                         SEQUENCE LISTING

<110>  Novozymes A/S
 
<120>  CRISPR-AID USING CATALYTICALLY INACTIVE RNA-GUIDED ENDONUCLEASE

<130>  15057-WO-PCT

<160>  166   

<170>  PatentIn version 3.5

<210>  1
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-889

<400>  1
ctagaaagta taggaacttc gctagctctg ctcgaggcca tctg                        44


<210>  2
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-890

<400>  2
gttcgttcca atggccagcc cgatgctata cttc                                   34


<210>  3
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-891

<400>  3
agtatagcat cgggctggcc attggaacga actcgg                                 36


<210>  4
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-892

<400>  4
gattgcggga cgatagcgtc aacatcgtag tccgacaacc g                           41


<210>  5
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-893

<400>  5
tcggactacg atgttgacgc tatcgtcccg caatccttcc t                           41


<210>  6
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-894

<400>  6
ggtagagtaa taacgcctag gacacgcaaa acgaggtaca tt                          42


<210>  7
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-895

<400>  7
gtcctaggcg ttattactct accgcaagg                                         29


<210>  8
<211>  49
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer HTJP-896

<400>  8
taggaacttc aatcgatcta gtcctaggct acgccaggac cgagcaagc                   49


<210>  9
<211>  500
<212>  DNA
<213>  Magnaporthe oryzae

<400>  9
tctgctcgag gccatctggc ttttctctgc tgtctgcctc gggaatggga tggaatacca       60

cgtacggtat ttggcctccg gtgccatccg aagcgagatg ctttgagctt gaaaccccct      120

cggcctgcac aggtgtctca tcgtgcattt aatccaacgg cggcgagtca aaacatcagc      180

taattgacca ggtttctgga ttgtgaatgc caactttttg ggtcttgagg agttgcgggg      240

tgggaaaaaa gtaaagaaat ttactgagga ttttatcatt gcgactataa aataaagcgg      300

cattgcaaat ccttgcgttg ctactatgta aaatggactg tagttgtgct gctgaaaata      360

gtttggcgat tgtggattgt ggattgtgga ttgtggatta tggcaagttg tcaaggggca      420

agttgacgaa aatgattgtg tggtgtctgc cagcaaattg agaacgtggg tatatatttc      480

atcttttcat gattcccttc                                                  500


<210>  10
<211>  91
<212>  DNA
<213>  Aspergillus fumigatus

<400>  10
ggcttgcttg tcaagcaatg gcatcattgg tctagtggta gaattcgtcg ttgccatcga       60

cgaggcccgt gttcgattca cggatgatgc a                                      91


<210>  11
<211>  78
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cas9 sgRNA backbone

<400>  11
gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac ttgaaaaagt       60

ggcaccgagt cggtgctt                                                     78


<210>  12
<211>  215
<212>  DNA
<213>  Magnaporthe oryzae

<400>  12
tttttttggc tcttgggttc gaactgccca aggcccatgt tttggtcatc ttttttttta       60

tgccccacca tttgggtcac ccctgccaat cattccatct ttgttcctac ccttcacgtg      120

tgctttccga agccaaagtt cccattcaac aactctcctt gcgttttttt tttcttgaag      180

cttgtcaccc gtcgatagtt tctgccattt gcaat                                 215


<210>  13
<211>  886
<212>  DNA
<213>  Aspergillus nidulans

<400>  13
cgagacagca gaatcaccgc ccaagttaag cctttgtgct gatcatgctc tcgaacgggc       60

caagttcggg aaaagcaaag gagcgtttag tgaggggcaa tttgactcac ctcccaggca      120

acagatgagg ggggcaaaaa gaaagaaatt ttcgtgagtc aatatggatt ccgagcatca      180

ttttcttgcg gtctatcttg ctacgtatgt tgatcttgac gctgtggatc aagcaacgcc      240

actcgctcgc tccatcgcag gctggtcgca gacaaattaa aaggcggcaa actcgtacag      300

ccgcggggtt gtccgctgca aagtacagag tgataaaagc cgccatgcga ccatcaacgc      360

gttgatgccc agctttttcg atccgagaat ccaccgtaga ggcgatagca agtaaagaaa      420

agctaaacaa aaaaaaattt ctgcccctaa gccatgaaaa cgagatgggg tggagcagaa      480

ccaaggaaag agtcgcgctg ggctgccgtt ccggaaggtg ttgtaaaggc tcgacgccca      540

aggtgggagt ctaggagaag aatttgcatc gggagtgggg cgggttaccc ctccatatcc      600

aatgacagat atctaccagc caagggtttg agcccgcccg cttagtcgtc gtcctcgctt      660

gcccctccat aaaaggattt cccctccccc tcccacaaaa ttttctttcc cttcctctcc      720

ttgtccgctt cagtacgtat atcttccctt ccctcgcttc tctcctccat ccttctttca      780

tccatctcct gctaacttct ctgctcagca cctctacgca ttactagccg tagtatctga      840

gcacttctcc cttttatatt ccacaaaaca taacacaacc ttcacc                     886


<210>  14
<211>  4131
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  dCas9_coding

<400>  14
atggacaaga agtatagcat cgggctggcc attggaacga actcggttgg ttgggctgtg       60

attacggacg aatacaaggt gccatccaag aagtttaagg tcctgggaaa caccgaccgt      120

cactcaatca agaagaatct cattggagcc ctgctcttcg atagtgggga gaccgccgaa      180

gctactcgac tgaagcgaac ggctcgccgg cgttatacac gacgcaagaa tcgcatctgc      240

tacctccagg agattttcag caacgaaatg gctaaggttg atgactcatt ctttcatcga      300

ctcgaagaaa gtttcttggt cgaggaggat aagaagcacg agcgccatcc gatctttggt      360

aacattgtgg atgaggttgc ctatcacgaa aagtacccaa ctatctatca tcttcgtaag      420

aagctggtcg atagcacgga caaggctgat ttgcgactta tctacctggc actcgcgcac      480

atgattaagt tccgcggcca ttttcttatc gagggtgacc tgaaccccga taattctgac      540

gttgataagc tcttcatcca gttggtccaa acctacaatc agctgtttga ggaaaaccct      600

attaatgcat ctggcgtgga cgccaaggct atcctttcgg cgcgcctgtc taagtcgcgg      660

cgtttggaga accttatcgc acaactcccc ggcgaaaaga agaacggcct cttcggtaat      720

ttgattgcgt tgtcacttgg tctgactcct aacttcaaga gtaattttga cctggcagag      780

gatgcgaagc tccagttgtc taaggatacg tatgatgacg atctcgacaa cttgcttgcc      840

caaatcggtg accagtacgc tgatcttttc ctggccgcta agaatctctc agatgcaatc      900

ctgctcagtg acattttgcg ggtcaacacc gagattacta aggcccccct gtcagctagt      960

atgatcaagc ggtatgatga gcaccatcag gacctcacct tgcttaaggc cctcgtgcgt     1020

cagcaattgc ctgagaagta caaggaaatc ttctttgacc aatccaagaa cggatacgca     1080

gggtatattg atggcggtgc gagccaggag gaattctaca agtttatcaa gccgattttg     1140

gagaagatgg acggcactga ggaactgctc gtcaagctga atcgcgaaga tttgcttcgt     1200

aagcaacgaa cgttcgacaa cggctccatc ccgcaccaga ttcatctggg cgagctccac     1260

gccatccttc gacgccagga agatttctac ccatttctga aggacaaccg tgagaagatc     1320

gaaaagattc ttacattccg aatcccctac tatgtgggac ctttggcccg tgggaattcc     1380

cgatttgctt ggatgacccg aaagagcgag gaaaccatca ctccgtggaa cttcgaggaa     1440

gtcgtggaca agggtgcatc cgcgcagagc ttcattgagc ggatgaccaa ttttgataag     1500

aaccttccga atgaaaaggt cctgccaaag cattcgctgc tctacgagta tttcaccgtg     1560

tataacgaac tgactaaggt caagtacgtg acggagggaa tgcggaagcc agccttcctc     1620

tcaggggaac aaaagaaggc tatcgtcgat ttgcttttta agaccaatcg taaagtgact     1680

gttaagcagc tgaaggagga ttatttcaag aagattgaat gtttcgactc cgtcgagatc     1740

agcggcgtgg aagatcgctt taacgcttcc ctcggtacct accacgacct gctcaagatc     1800

attaaggaca aggatttcct cgataacgag gaaaatgagg acatcttgga agatattgtc     1860

ctcacgttga cactttttga ggaccgcgaa atgatcgagg aacggctcaa gacatatgcc     1920

catttgttcg acgataaggt gatgaagcag ctgaagcggc gtcgatacac cggatggggt     1980

cgccttagcc ggaagctgat caacggcatt cgagataagc aatctggtaa gactatcttg     2040

gatttcctta agtcggacgg cttcgccaac cgcaatttta tgcagcttat tcacgacgat     2100

tccctgacgt tcaaggagga catccagaag gcacaagtct caggacaagg ggattccctg     2160

cacgagcata tcgccaacct ggctggatcc ccggcgatca agaaggggat tcttcagacc     2220

gtcaaggttg tcgacgagct ggtcaaggtg atgggccgtc ataagccaga aaacatcgtg     2280

attgagatgg cccgagaaaa tcagaccact caaaagggtc agaagaacag ccgcgagcgg     2340

atgaagcgga tcgaggaagg cattaaggaa cttggttctc agatcctgaa ggagcaccct     2400

gttgaaaaca cacagctcca aaatgagaag ctgtatctct actatttgca aaatggacgc     2460

gacatgtacg tcgatcagga gctcgacatt aaccggttgt cggactacga tgttgacgct     2520

atcgtcccgc aatccttcct taaggacgat agcattgata acaaggtgct gactcgctca     2580

gataagaacc ggggcaagtc cgacaatgtt ccaagcgagg aagtggttaa gaagatgaag     2640

aactactggc gccaattgct taatgccaag ctcatcacac agcgcaagtt tgacaacttg     2700

accaaggccg agcggggagg gctgagtgaa ctcgataagg ctggcttcat caagcgtcaa     2760

ctcgtggaga cgcgacagat cacaaagcac gttgctcaga ttctggactc ccggatgaac     2820

acaaagtacg acgagaatga taagctcatc cgtgaagtta aggtcattac cctcaagtct     2880

aagttggtgt cggatttccg caaggacttc caattttata aggttcggga gatcaacaat     2940

tatcaccatg cacatgatgc gtacctcaac gcagtcgtgg gaactgcgct catcaagaag     3000

tatcccaagt tggagtccga attcgtctac ggggattata aggtttacga cgtccgcaag     3060

atgatcgcca agagtgagca ggaaattggc aaggccacgg ctaagtattt cttttactcc     3120

aacatcatga atttctttaa gacggagatc acactcgcca atggagaaat ccgtaagcga     3180

cctttgattg agaccaacgg cgagactggt gaaatcgttt gggataaggg gcgcgacttc     3240

gctaccgtgc ggaaggttct gagcatgccg caagtcaata tcgtcaagaa aaccgaggtg     3300

cagacaggcg gtttctctaa ggaatcgatt cttccaaagc gtaactctga caagctgatc     3360

gctcgaaaga aggattggga ccccaagaag tatggagggt tcgattctcc tacagtggca     3420

tactcggttc tcgttgtcgc gaaggttgag aagggaaagt ctaagaagct gaagtcggtc     3480

aaggaactgc tcgggatcac cattatggag cgctccagct tcgaaaagaa tcccatcgac     3540

tttctcgagg ccaagggcta taaggaagtc aagaaggatc ttatcattaa gctgcctaag     3600

tactctttgt tcgagcttga aaacggtcga aagcgaatgc tcgcatcggc aggagagttg     3660

cagaagggga atgaattggc acttccctca aagtacgtga acttcctgta tctcgcgtcc     3720

cactacgaga agctgaaggg tagccctgag gacaacgaac agaagcaact ttttgttgag     3780

caacacaagc attatctgga tgagatcatt gaacagattt cagagttcag taagcgcgtc     3840

atcctcgccg atgctaatct cgacaaggtg ttgtcggcct acaacaagca ccgtgacaag     3900

ccgatccgag agcaggctga aaatatcatt catctgttca ccctcactaa cttgggagca     3960

ccagcagcgt tcaagtattt tgatacgaca atcgaccgta agcgatacac gtccacaaag     4020

gaggtgcttg atgcgaccct gattcatcaa tccatcactg ggctctatga aacccgtatc     4080

gaccttagtc aactgggggg cgaccctccc aagaagaagc gcaaggtctg a              4131


<210>  15
<211>  1376
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  dCas9_protein

<400>  15

Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Asp Ala Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


Pro Pro  Lys Lys Lys Arg Lys  Val 
    1370                 1375     


<210>  16
<211>  1023
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hph selection marker

<400>  16
atgtcgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc       60

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta      120

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt      180

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg      240

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa      300

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg      360

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc      420

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac      480

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg      540

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc      600

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg      660

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt      720

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg      780

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc      840

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc      900

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt      960

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa     1020

tag                                                                   1023


<210>  17
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF_U6cas9_fwd

<400>  17
ttttctctgc tgtctgcctc g                                                 21


<210>  18
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF_dCasCDA_rev

<400>  18
gtcgcccccc agttgactaa g                                                 21


<210>  19
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF_CDA3UTR_fwd

<400>  19
gcggacattc gatttatgcc gttatg                                            26


<210>  20
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF_U63UTR_rev

<400>  20
agacagcaga gaaaagccag atgg                                              24


<210>  21
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer CDAinsert_fwd

<400>  21
caactggggg gcgacagcag                                                   20


<210>  22
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer CDAinsert_rev

<400>  22
aaatcgaatg tccgcttatc cggag                                             25


<210>  23
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer for pTNA197

<400>  23
cagcagtcct ctgctctaga                                                   20


<210>  24
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer for pTNA198

<400>  24
tccaacccac tccctggaat                                                   20


<210>  25
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer for pTNA199

<400>  25
ccagcatgtt gactcggaat                                                   20


<210>  26
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer for pTNA200

<400>  26
tgtcccagca tagtcgtcgt                                                   20


<210>  27
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_sense_1

<400>  27
ttcgattcac ggatgatgca cagcagtcct ctgctctaga gttttagagc tagaaatagc       60


<210>  28
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_sense_1_rev

<400>  28
gctatttcta gctctaaaac tctagagcag aggactgctg tgcatcatcc gtgaatcgaa       60


<210>  29
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_sense_2

<400>  29
ttcgattcac ggatgatgca tccaacccac tccctggaat gttttagagc tagaaatagc       60


<210>  30
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_sense_2_rev

<400>  30
gctatttcta gctctaaaac attccaggga gtgggttgga tgcatcatcc gtgaatcgaa       60


<210>  31
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_anti_1

<400>  31
ttcgattcac ggatgatgca ccagcatgtt gactcggaat gttttagagc tagaaatagc       60


<210>  32
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_anti_1_rev

<400>  32
gctatttcta gctctaaaac attccgagtc aacatgctgg tgcatcatcc gtgaatcgaa       60


<210>  33
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_anti_2

<400>  33
ttcgattcac ggatgatgca tgtcccagca tagtcgtcgt gttttagagc tagaaatagc       60


<210>  34
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer wA_anti_2_rev

<400>  34
gctatttcta gctctaaaac acgacgacta tgctgggaca tgcatcatcc gtgaatcgaa       60


<210>  35
<211>  5055
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  dCas9-AID_coding

<400>  35
atggacaaga agtatagcat cgggctggcc attggaacga actcggttgg ttgggctgtg       60

attacggacg aatacaaggt gccatccaag aagtttaagg tcctgggaaa caccgaccgt      120

cactcaatca agaagaatct cattggagcc ctgctcttcg atagtgggga gaccgccgaa      180

gctactcgac tgaagcgaac ggctcgccgg cgttatacac gacgcaagaa tcgcatctgc      240

tacctccagg agattttcag caacgaaatg gctaaggttg atgactcatt ctttcatcga      300

ctcgaagaaa gtttcttggt cgaggaggat aagaagcacg agcgccatcc gatctttggt      360

aacattgtgg atgaggttgc ctatcacgaa aagtacccaa ctatctatca tcttcgtaag      420

aagctggtcg atagcacgga caaggctgat ttgcgactta tctacctggc actcgcgcac      480

atgattaagt tccgcggcca ttttcttatc gagggtgacc tgaaccccga taattctgac      540

gttgataagc tcttcatcca gttggtccaa acctacaatc agctgtttga ggaaaaccct      600

attaatgcat ctggcgtgga cgccaaggct atcctttcgg cgcgcctgtc taagtcgcgg      660

cgtttggaga accttatcgc acaactcccc ggcgaaaaga agaacggcct cttcggtaat      720

ttgattgcgt tgtcacttgg tctgactcct aacttcaaga gtaattttga cctggcagag      780

gatgcgaagc tccagttgtc taaggatacg tatgatgacg atctcgacaa cttgcttgcc      840

caaatcggtg accagtacgc tgatcttttc ctggccgcta agaatctctc agatgcaatc      900

ctgctcagtg acattttgcg ggtcaacacc gagattacta aggcccccct gtcagctagt      960

atgatcaagc ggtatgatga gcaccatcag gacctcacct tgcttaaggc cctcgtgcgt     1020

cagcaattgc ctgagaagta caaggaaatc ttctttgacc aatccaagaa cggatacgca     1080

gggtatattg atggcggtgc gagccaggag gaattctaca agtttatcaa gccgattttg     1140

gagaagatgg acggcactga ggaactgctc gtcaagctga atcgcgaaga tttgcttcgt     1200

aagcaacgaa cgttcgacaa cggctccatc ccgcaccaga ttcatctggg cgagctccac     1260

gccatccttc gacgccagga agatttctac ccatttctga aggacaaccg tgagaagatc     1320

gaaaagattc ttacattccg aatcccctac tatgtgggac ctttggcccg tgggaattcc     1380

cgatttgctt ggatgacccg aaagagcgag gaaaccatca ctccgtggaa cttcgaggaa     1440

gtcgtggaca agggtgcatc cgcgcagagc ttcattgagc ggatgaccaa ttttgataag     1500

aaccttccga atgaaaaggt cctgccaaag cattcgctgc tctacgagta tttcaccgtg     1560

tataacgaac tgactaaggt caagtacgtg acggagggaa tgcggaagcc agccttcctc     1620

tcaggggaac aaaagaaggc tatcgtcgat ttgcttttta agaccaatcg taaagtgact     1680

gttaagcagc tgaaggagga ttatttcaag aagattgaat gtttcgactc cgtcgagatc     1740

agcggcgtgg aagatcgctt taacgcttcc ctcggtacct accacgacct gctcaagatc     1800

attaaggaca aggatttcct cgataacgag gaaaatgagg acatcttgga agatattgtc     1860

ctcacgttga cactttttga ggaccgcgaa atgatcgagg aacggctcaa gacatatgcc     1920

catttgttcg acgataaggt gatgaagcag ctgaagcggc gtcgatacac cggatggggt     1980

cgccttagcc ggaagctgat caacggcatt cgagataagc aatctggtaa gactatcttg     2040

gatttcctta agtcggacgg cttcgccaac cgcaatttta tgcagcttat tcacgacgat     2100

tccctgacgt tcaaggagga catccagaag gcacaagtct caggacaagg ggattccctg     2160

cacgagcata tcgccaacct ggctggatcc ccggcgatca agaaggggat tcttcagacc     2220

gtcaaggttg tcgacgagct ggtcaaggtg atgggccgtc ataagccaga aaacatcgtg     2280

attgagatgg cccgagaaaa tcagaccact caaaagggtc agaagaacag ccgcgagcgg     2340

atgaagcgga tcgaggaagg cattaaggaa cttggttctc agatcctgaa ggagcaccct     2400

gttgaaaaca cacagctcca aaatgagaag ctgtatctct actatttgca aaatggacgc     2460

gacatgtacg tcgatcagga gctcgacatt aaccggttgt cggactacga tgttgacgct     2520

atcgtcccgc aatccttcct taaggacgat agcattgata acaaggtgct gactcgctca     2580

gataagaacc ggggcaagtc cgacaatgtt ccaagcgagg aagtggttaa gaagatgaag     2640

aactactggc gccaattgct taatgccaag ctcatcacac agcgcaagtt tgacaacttg     2700

accaaggccg agcggggagg gctgagtgaa ctcgataagg ctggcttcat caagcgtcaa     2760

ctcgtggaga cgcgacagat cacaaagcac gttgctcaga ttctggactc ccggatgaac     2820

acaaagtacg acgagaatga taagctcatc cgtgaagtta aggtcattac cctcaagtct     2880

aagttggtgt cggatttccg caaggacttc caattttata aggttcggga gatcaacaat     2940

tatcaccatg cacatgatgc gtacctcaac gcagtcgtgg gaactgcgct catcaagaag     3000

tatcccaagt tggagtccga attcgtctac ggggattata aggtttacga cgtccgcaag     3060

atgatcgcca agagtgagca ggaaattggc aaggccacgg ctaagtattt cttttactcc     3120

aacatcatga atttctttaa gacggagatc acactcgcca atggagaaat ccgtaagcga     3180

cctttgattg agaccaacgg cgagactggt gaaatcgttt gggataaggg gcgcgacttc     3240

gctaccgtgc ggaaggttct gagcatgccg caagtcaata tcgtcaagaa aaccgaggtg     3300

cagacaggcg gtttctctaa ggaatcgatt cttccaaagc gtaactctga caagctgatc     3360

gctcgaaaga aggattggga ccccaagaag tatggagggt tcgattctcc tacagtggca     3420

tactcggttc tcgttgtcgc gaaggttgag aagggaaagt ctaagaagct gaagtcggtc     3480

aaggaactgc tcgggatcac cattatggag cgctccagct tcgaaaagaa tcccatcgac     3540

tttctcgagg ccaagggcta taaggaagtc aagaaggatc ttatcattaa gctgcctaag     3600

tactctttgt tcgagcttga aaacggtcga aagcgaatgc tcgcatcggc aggagagttg     3660

cagaagggga atgaattggc acttccctca aagtacgtga acttcctgta tctcgcgtcc     3720

cactacgaga agctgaaggg tagccctgag gacaacgaac agaagcaact ttttgttgag     3780

caacacaagc attatctgga tgagatcatt gaacagattt cagagttcag taagcgcgtc     3840

atcctcgccg atgctaatct cgacaaggtg ttgtcggcct acaacaagca ccgtgacaag     3900

ccgatccgag agcaggctga aaatatcatt catctgttca ccctcactaa cttgggagca     3960

ccagcagcgt tcaagtattt tgatacgaca atcgaccgta agcgatacac gtccacaaag     4020

gaggtgcttg atgcgaccct gattcatcaa tccatcactg ggctctatga aacccgtatc     4080

gaccttagtc aactgggggg cgacagcagg gctgacccca agaagaagag gaaggtgggt     4140

ggaggaggtt ctggaggtgg aggttctgca gagtatgtgc gggccctctt tgactttaat     4200

gggaatgatg aagaagacct tccctttaag aaaggagaca tcctgagaat ccgggataag     4260

cctgaagagc agtggtggaa tgcagaggac agcgaaggaa agagggggat gattcctgtc     4320

ccttacgtgg agaagtattc cggagactat aaggaccacg acggagacta caaggatcat     4380

gatattgatt acaaagacga tgacgataag tctaggatga ccgacgctga gtacgtgaga     4440

atccatgaga agttggacat ctacacgttt aagaaacagt ttttcaacaa caaaaaatcc     4500

gtgtcgcata gatgctacgt tctctttgaa ttaaaacgac ggggtgaacg tagagcgtgt     4560

ttttggggct atgctgtgaa taaaccacag agcgggacag aacgtggcat tcacgccgaa     4620

atctttagca ttagaaaagt cgaagaatac ctgcgcgaca accccggaca attcacgata     4680

aattggtact catcctggag tccttgtgca gattgcgctg aaaaaatctt agaatggtat     4740

aaccaggagc tgcgggggaa cggccacact ttgaaaatct gggcttgcaa actctattac     4800

gagaaaaatg cgaggaatca aattgggctg tggaacctca gagataacgg ggttgggttg     4860

aatgtaatgg taagtgaaca ctaccaatgt tgcaggaaaa tattcatcca atcgtcgcac     4920

aatcaattga atgagaatag atggcttgag aagactttga agcgagctga aaaacgacgg     4980

agcgagttgt ccattatgat tcaggtaaaa atactccaca ccactaagag tcctgctgtt     5040

tctagaggct ccgga                                                      5055


<210>  36
<211>  1685
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  dCas9-AID_protein

<400>  36

Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Asp Ala Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


Ser Arg  Ala Asp Pro Lys Lys  Lys Arg Lys Val Gly  Gly Gly Gly 
    1370                 1375                 1380             


Ser Gly  Gly Gly Gly Ser Ala  Glu Tyr Val Arg Ala  Leu Phe Asp 
    1385                 1390                 1395             


Phe Asn  Gly Asn Asp Glu Glu  Asp Leu Pro Phe Lys  Lys Gly Asp 
    1400                 1405                 1410             


Ile Leu  Arg Ile Arg Asp Lys  Pro Glu Glu Gln Trp  Trp Asn Ala 
    1415                 1420                 1425             


Glu Asp  Ser Glu Gly Lys Arg  Gly Met Ile Pro Val  Pro Tyr Val 
    1430                 1435                 1440             


Glu Lys  Tyr Ser Gly Asp Tyr  Lys Asp His Asp Gly  Asp Tyr Lys 
    1445                 1450                 1455             


Asp His  Asp Ile Asp Tyr Lys  Asp Asp Asp Asp Lys  Ser Arg Met 
    1460                 1465                 1470             


Thr Asp  Ala Glu Tyr Val Arg  Ile His Glu Lys Leu  Asp Ile Tyr 
    1475                 1480                 1485             


Thr Phe  Lys Lys Gln Phe Phe  Asn Asn Lys Lys Ser  Val Ser His 
    1490                 1495                 1500             


Arg Cys  Tyr Val Leu Phe Glu  Leu Lys Arg Arg Gly  Glu Arg Arg 
    1505                 1510                 1515             


Ala Cys  Phe Trp Gly Tyr Ala  Val Asn Lys Pro Gln  Ser Gly Thr 
    1520                 1525                 1530             


Glu Arg  Gly Ile His Ala Glu  Ile Phe Ser Ile Arg  Lys Val Glu 
    1535                 1540                 1545             


Glu Tyr  Leu Arg Asp Asn Pro  Gly Gln Phe Thr Ile  Asn Trp Tyr 
    1550                 1555                 1560             


Ser Ser  Trp Ser Pro Cys Ala  Asp Cys Ala Glu Lys  Ile Leu Glu 
    1565                 1570                 1575             


Trp Tyr  Asn Gln Glu Leu Arg  Gly Asn Gly His Thr  Leu Lys Ile 
    1580                 1585                 1590             


Trp Ala  Cys Lys Leu Tyr Tyr  Glu Lys Asn Ala Arg  Asn Gln Ile 
    1595                 1600                 1605             


Gly Leu  Trp Asn Leu Arg Asp  Asn Gly Val Gly Leu  Asn Val Met 
    1610                 1615                 1620             


Val Ser  Glu His Tyr Gln Cys  Cys Arg Lys Ile Phe  Ile Gln Ser 
    1625                 1630                 1635             


Ser His  Asn Gln Leu Asn Glu  Asn Arg Trp Leu Glu  Lys Thr Leu 
    1640                 1645                 1650             


Lys Arg  Ala Glu Lys Arg Arg  Ser Glu Leu Ser Ile  Met Ile Gln 
    1655                 1660                 1665             


Val Lys  Ile Leu His Thr Thr  Lys Ser Pro Ala Val  Ser Arg Gly 
    1670                 1675                 1680             


Ser Gly  
    1685 


<210>  37
<211>  6680
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PKS (wA) gene locus

<400>  37
atggagggtc catctcgtgt gtaccttttt ggagaccaga ccagcgacat cgaagctggc       60

ctgcgccgtc tgctccaagc gaagaatagt accattgtcc agtccttttt ccagcaatgc      120

ttccatgcaa ttcgtcaaga gatcgcgaag ctcccgccgt ctcatcggaa gctcttccca      180

cgcttcacga gcatcgttga tctcctttcc aggagtcgtg aatcaggtcc tagccctgtc      240

ctggagagtg cattgacatg catctaccaa ttgggttgtt tcattcagta agtcaatgag      300

ttaccatcta tacttgacaa gtctgaccag ccttcagctt ttacggggat cttggacatg      360

actaccctac accctccaac agccatcttg ttggcctgtg cactggtgtt ctgagctgca      420

cggctgtaag ttgcgccaga aatgttggag agcttattcc agctgcagtg gaatcggttg      480

taattgcact gcgactggga atctgcgttt ttcgagttcg agaactggtg gactccgccg      540

attccgagtc aacatgctgg tcagcgttgg tttctggaat cagtgaagca gaggctagcc      600

acctgatcga cgagtacagt agtaagaagg tgtgctcttc caactttaaa cccccgcatt      660

gtgggatgct gacagatgca ggctactccg ccttcttcga aaccgtatat cagcgcggta      720

agctctaatg gcgttactgt cagcgcacca cctacggtac ttgatgaatt cgtcgagacc      780

tgcatttcca agaattacaa gccagtgaag gcccctattc atggcccgta ccatgcgcca      840

catctgtatg atgataagga tatcgaccgc atcctgcagc agtcctctgc tctagaagga      900

ctgaccggct gttcacccgt tattcccatc atctccagta acactggaaa gccgatcaag      960

gccaagtcca tcaaagatct cttcaaggtc gcactggagg agatactcct acgacgacta     1020

tgctgggaca aggtcacgga gtcctgcaca tcagtctgca agaccggcac aaaccactct     1080

tgcaaattgt ttccgatctc gagtagcgcc actcaaagtt tgttcacagt cctcaagaag     1140

gccggtgtga gcatcagctt ggagactggg gtaggagaga tcgcgacgaa cccagaaatg     1200

cggaacctta ctggcaaggc agaaaattca aagattgcta tcattggtat gtctggaaga     1260

tttcctgact cggatggtac ggagagcttc tggaacctcc tgtacaaagg actcgacgta     1320

catcgcaaag tccccgcaga ccgttgggac gttgatgccc acgtcgacat gaccgggtca     1380

aagagaaaca caagcaaagt ggcttacggt tgctggatca acgaacccgg cctgtttgac     1440

ccccgattct tcaacatgtc gcctcgggaa gcactccaag cagatcctgc acaacgtctt     1500

gcgttgctta cagcgtacga ggctctcgag atggctggct tcatcccgga tagctctcca     1560

tcgacgcaga gggaccgtgt gggtattttc tacggaatga ccagtgacga ctaccgtgag     1620

atcaacagcg gccaggacat tgatacctat ttcatccctg gcggtaaccg agcatttacg     1680

ccgggtcgga taaactacta cttcaaattt agcggcccca gtgtgagcgt tgacacagcg     1740

tgctcgtcta gtcttgctgc tatccacatg gcttgcaatt cgatctggag aaatgactgc     1800

gatgccgcca tcactggagg tgtgaacatt ctgaccagcc ctgacaacca cgccggtctg     1860

gatcggggcc atttcctgtc caccactggc aactgtaaca cctttgatga cggcgccgac     1920

ggctactgta gagcggacgg agttggaagc atcgttttga agcggcttga agatgccgag     1980

gccgacaacg acccgatcct ggccgtcatc aacggtgctt acaccaacca ctcggcggag     2040

gccgtgtcaa tcactcgtcc ccatgttggc gcgcaagcat tcatcttcaa caagctgctc     2100

aatgatgcga atatcgaccc taaggacgtg agctacgtgg aaatgcatgg cactggaact     2160

caagcaggtg atgcagtcga aatgcagtcc gttcttgacg tcttcgcacc agactaccgc     2220

cggggtcccg gtcaatcgct tcatatcggt tctgccaagg caaacattgg acacggtgaa     2280

tccgcatcag gagtgactgc tcttgtcaag gtcctcctaa tgatgagaga gaacatgatt     2340

cctcctcatt gtggtatcaa gaccaagatc aattccaatt tcccgacaga cttggcgaag     2400

cgcaatgttc atatcgcctt ccaacccact ccctggaatc ggccagcttc aggaaagcgg     2460

cgaactttcg tcaacaactt ttctgctgct ggtggtaaca ctgctcttct actggaagat     2520

gctcccatac cggaacgcca agggcaggac cccaggtcgt tccatttggt ctccgtgtca     2580

gcaagatccc agtctgcatt gaagaacaac gtcgaagctc tggtgaagta cattgactct     2640

cagggcaagt cctttggtgt gaaagagact gaattccttc caaacctggc gtacacgacc     2700

accgcacgcc gtatccacca tcccttccgt gtcattgcgg ttggagcgaa cctacaatca     2760

ctgcgtgact cgctgcatgg tgctttgcac cgtgagacat ataccccagt tccctcaacg     2820

gctcctggta ttggtttcgt cttcaccggc caaggagccc aatactccgg aatgggcaag     2880

gaactctacc gcagttgttt ccaattccga accaccattg agcattttga ctgcatcgca     2940

agaagccagg gccttccttc tatccttcct cttgtcgatg gaagcgtggc tgtcgaagaa     3000

cttagccctg tcgtggtaca agtgggaact acctgtgtac aaatggctct agtaaattac     3060

tggactgctc tgggtgtgaa gccggccttt atcatcggac acagtcttgg agactatgca     3120

gcccttaaca cggccggtgt tctatccacc agcgatacaa tctatctttg tggccggcgt     3180

gctcagttgc tgacgaagga atgcaagatt gggacacatt cgatgctggc catcaaggcg     3240

tccctggcag aggtcaaaca tttcctcaga gacgagctcc acgaagtctc ttgtgttaac     3300

gcacctgcgg agaccgtcgt cagcggcctt gtcgctgata tcgacgagtt ggctcagaaa     3360

tgctccacag agggtttgaa gtcaaccaag ctcaaggttc cttacgcgtt ccattcctct     3420

caggttgatc ctatcttgga ggccttcgaa gatattgccc aaggtgtcac cttccacaag     3480

ccgacaacac ctttcgtctc agccctgttc ggggaagtga tcaccgatgc taactgggag     3540

tgtctcggcc ccaagtacct gcgcgatcat tgcagaaaga cggtcaactt ccttggcggc     3600

gtggaggcta cgaggcatgc gaagctgacc aatgacaaga ctctgtgggt tgagatcggc     3660

tcacatacca tttgctctgg aatgatcaaa gcaactcttg gaccgcaagt tacaacggtt     3720

gcatctctac gccgcgaaga agatacctgg aaggtccttt cgaacagtct tgcgagcctt     3780

catctggcgg gtattgatat caactggaag caatatcacc aggactttag ctcctctctc     3840

caggtcctcc gcctcccagc ctacaagtgg gatctcaaga actactggat tccctatacc     3900

aacaacttct gcctgagcaa gggcgctcca gttgcgacag tagcggcagg gccacagcat     3960

gagtacctga caaccgcggc tcagaaggtc attgagactc gaagtgatgg agcaacagct     4020

acagtcgtga tagagaacga cattgctgat cccgagctca accgcgtcat tcaaggccat     4080

aaggtcaacg gtactgcttt gtgtccctca gtaagttacc gctcttgccc aacgactgcg     4140

ttaagattcg tactaatcag gatatagtca ctatatgccg acatctctca aacgcttgca     4200

gagtatctca tcaaaaagta caagcctgag tacgacggac ttggactgga tgtgtgtgag     4260

gtcacagtgc cacgaccact gattgcgaaa ggcggacagc agctctttag agtatctgcg     4320

acagcggatt gggcggagaa gaagacaacc cttcagatat attcagtcac tgcggagggg     4380

aagaagacgg ctgaccacgc aacttgcact gtccgattct ttgactgcgc tgctgcggag     4440

gcggaatgga aacgagtttc ctaccttgtc aagaggagca ttgaccgact gcatgatatc     4500

gccgaaaatg gtgacgctca ccgtcttggt agaggcatgg tttacaaact cttcgctgcc     4560

ttggttgatt atgacgacaa cttcaagtcc attcgcgagg ttattcttga cagtgaacag     4620

cacgaagcga ctgcacgcgt caagttccaa gcaccacaag gcaatttcca ccgaaacccg     4680

ttctggattg acagttttgg acacctgtct gggttcatca tgaacgcaag cgatgcaacc     4740

gactccaaga accaggtctt tgtcaatcac ggatgggact ccatgcgttg tttgaagaag     4800

ttctcgcctg atgtcaccta caggacttat gttagaatgc agccttggaa agactccatc     4860

tgggctggtg atgtctacgt tttcgatggg gatgatatcg ttgcggtgta tggtgcagtc     4920

aaggtgagtt cggcccgcgc tcagttgcat aagattcaag gtgctaatca ttggtgtcac     4980

agttccaagc cttatcacgc aagattctcg atacggtcct acctccagtt ggggcttcga     5040

agggccccgc cagaccagcc gctagcgctc agaaggcggc ccctgctgct gctgccagca     5100

agagtcgtgc tagcgccccg gccccggcga agcctgctgc taagcccagc gccccaagct     5160

tggtcaaacg ggcacttacc atcctcgcag aggaagtggg tctgtctgaa tccgagatta     5220

cggatgatct ggtcttcgca gactacggtg tggactccct tctttcgttg acggtcacgg     5280

gcaggtatcg tgaagagctg gatatcgatc tcgaatcctc catcttcatc gaccagccga     5340

ccgtgaaaga cttcaagcag ttcttggccc caatgagcca gggagaagcc agcgatgggt     5400

ccaccagtga cccagagtct agtagctcct tcaatggtgg ctcttcaaca gacgagtcca     5460

gtgctgggtc ccctgtcagc tcaccaccaa atgagaaggt tacgcaggtc gagcagcatg     5520

ctacgataaa ggagattcgc gccattttgg ccgatgagat tggtgttacg gaggaggagc     5580

tgaaggacga tgagaacttg ggagagatgg ggatggactc tctgctttcg cttacggtgc     5640

ttggtaggat ccgtgagaca ttggatctgg atctaccggg cgagttcttc atcgagaatc     5700

aaactctgaa tgacgtggag gatgcattgg gcctcaaacc caaggcagct cctgcgcctg     5760

cgcctgcgcc tgctcccgta cccgcacccg tgtccgcgcc catattgaag gagcctgtcc     5820

ccaacgcaaa ctctaccatc atggcccggg cgagcccgca ccctcgatca acctccattc     5880

tgttgcaagg aaacccgaaa accgcgacca agaccctgtt cctgttccct gatgggtctg     5940

gctccgcaac atcgtatgca accattcccg gagtgtcccc ggacgtgtgt gtctacggat     6000

tgaactgccc gtacatgaag actccagaga agctcaagta tccccttgct gagatgacat     6060

tcccctatct ggccgagatc cgccgcagac agcccaaggg cccgtacaac ttcggtggat     6120

ggtctgcagg tggtatttgc gcctatgatg ccgctcgcta cctaatcctt gaagagggcg     6180

aacaggttga ccgattgctt cttcttgact cgcccttccc cattggctta gagaagttgc     6240

ccactcggct gtacggcttc atcaactcaa tgggtctctt tggtgaaggc aacaaggctc     6300

ccccggcctg gttgctccct catttcctgg ccttcattga ttccctcgat acctacaagg     6360

ccgtccccct cccctttgac gatccgaagt gggccaagaa gatgccaaag acattcatgg     6420

tctgggccaa ggacggtatc tgcagcaagc cggatgaccc gtggcccgag ccggacccgg     6480

acggcaagcc ggacacgaga gagatggtct ggctcctcaa gaaccggacc gacatgggac     6540

ccaacaagtg ggacacactc gtcgggcccc aaaacgtcgg tggaatcact gtgatagagg     6600

gtgcgaatca tttcaccatg actttgggac ccaaggctaa agaattgggc tcgttcattg     6660

gcaacgccat ggccaattaa                                                 6680


<210>  38
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_seq_f2

<400>  38
tcatatcggt tctgccaagg                                                   20


<210>  39
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_R

<400>  39
gttgttgacg aaagttcgcc                                                   20


<210>  40
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_F

<400>  40
actgcgactg ggaatctgcg                                                   20


<210>  41
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_seq_r3

<400>  41
cttgtaattc ttggaaatgc agg                                               23


<210>  42
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer BsrGI-3UTR-fwd

<400>  42
gtctaatgta cagcggacat tcgatttatg c                                      31


<210>  43
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer NheI-FRT-rev

<400>  43
agcagagcta gcgaagttcc tatactttct ag                                     32


<210>  44
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer Ex-UGItop-fwd

<400>  44
atgatctcta gaggctccgg aaccaacctg                                        30


<210>  45
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer UGIend-rev

<400>  45
aaatcgaatg tccgctgtac attag                                             25


<210>  46
<211>  5340
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  dCas9-AID-UGI_coding

<400>  46
atggacaaga agtatagcat cgggctggcc attggaacga actcggttgg ttgggctgtg       60

attacggacg aatacaaggt gccatccaag aagtttaagg tcctgggaaa caccgaccgt      120

cactcaatca agaagaatct cattggagcc ctgctcttcg atagtgggga gaccgccgaa      180

gctactcgac tgaagcgaac ggctcgccgg cgttatacac gacgcaagaa tcgcatctgc      240

tacctccagg agattttcag caacgaaatg gctaaggttg atgactcatt ctttcatcga      300

ctcgaagaaa gtttcttggt cgaggaggat aagaagcacg agcgccatcc gatctttggt      360

aacattgtgg atgaggttgc ctatcacgaa aagtacccaa ctatctatca tcttcgtaag      420

aagctggtcg atagcacgga caaggctgat ttgcgactta tctacctggc actcgcgcac      480

atgattaagt tccgcggcca ttttcttatc gagggtgacc tgaaccccga taattctgac      540

gttgataagc tcttcatcca gttggtccaa acctacaatc agctgtttga ggaaaaccct      600

attaatgcat ctggcgtgga cgccaaggct atcctttcgg cgcgcctgtc taagtcgcgg      660

cgtttggaga accttatcgc acaactcccc ggcgaaaaga agaacggcct cttcggtaat      720

ttgattgcgt tgtcacttgg tctgactcct aacttcaaga gtaattttga cctggcagag      780

gatgcgaagc tccagttgtc taaggatacg tatgatgacg atctcgacaa cttgcttgcc      840

caaatcggtg accagtacgc tgatcttttc ctggccgcta agaatctctc agatgcaatc      900

ctgctcagtg acattttgcg ggtcaacacc gagattacta aggcccccct gtcagctagt      960

atgatcaagc ggtatgatga gcaccatcag gacctcacct tgcttaaggc cctcgtgcgt     1020

cagcaattgc ctgagaagta caaggaaatc ttctttgacc aatccaagaa cggatacgca     1080

gggtatattg atggcggtgc gagccaggag gaattctaca agtttatcaa gccgattttg     1140

gagaagatgg acggcactga ggaactgctc gtcaagctga atcgcgaaga tttgcttcgt     1200

aagcaacgaa cgttcgacaa cggctccatc ccgcaccaga ttcatctggg cgagctccac     1260

gccatccttc gacgccagga agatttctac ccatttctga aggacaaccg tgagaagatc     1320

gaaaagattc ttacattccg aatcccctac tatgtgggac ctttggcccg tgggaattcc     1380

cgatttgctt ggatgacccg aaagagcgag gaaaccatca ctccgtggaa cttcgaggaa     1440

gtcgtggaca agggtgcatc cgcgcagagc ttcattgagc ggatgaccaa ttttgataag     1500

aaccttccga atgaaaaggt cctgccaaag cattcgctgc tctacgagta tttcaccgtg     1560

tataacgaac tgactaaggt caagtacgtg acggagggaa tgcggaagcc agccttcctc     1620

tcaggggaac aaaagaaggc tatcgtcgat ttgcttttta agaccaatcg taaagtgact     1680

gttaagcagc tgaaggagga ttatttcaag aagattgaat gtttcgactc cgtcgagatc     1740

agcggcgtgg aagatcgctt taacgcttcc ctcggtacct accacgacct gctcaagatc     1800

attaaggaca aggatttcct cgataacgag gaaaatgagg acatcttgga agatattgtc     1860

ctcacgttga cactttttga ggaccgcgaa atgatcgagg aacggctcaa gacatatgcc     1920

catttgttcg acgataaggt gatgaagcag ctgaagcggc gtcgatacac cggatggggt     1980

cgccttagcc ggaagctgat caacggcatt cgagataagc aatctggtaa gactatcttg     2040

gatttcctta agtcggacgg cttcgccaac cgcaatttta tgcagcttat tcacgacgat     2100

tccctgacgt tcaaggagga catccagaag gcacaagtct caggacaagg ggattccctg     2160

cacgagcata tcgccaacct ggctggatcc ccggcgatca agaaggggat tcttcagacc     2220

gtcaaggttg tcgacgagct ggtcaaggtg atgggccgtc ataagccaga aaacatcgtg     2280

attgagatgg cccgagaaaa tcagaccact caaaagggtc agaagaacag ccgcgagcgg     2340

atgaagcgga tcgaggaagg cattaaggaa cttggttctc agatcctgaa ggagcaccct     2400

gttgaaaaca cacagctcca aaatgagaag ctgtatctct actatttgca aaatggacgc     2460

gacatgtacg tcgatcagga gctcgacatt aaccggttgt cggactacga tgttgacgct     2520

atcgtcccgc aatccttcct taaggacgat agcattgata acaaggtgct gactcgctca     2580

gataagaacc ggggcaagtc cgacaatgtt ccaagcgagg aagtggttaa gaagatgaag     2640

aactactggc gccaattgct taatgccaag ctcatcacac agcgcaagtt tgacaacttg     2700

accaaggccg agcggggagg gctgagtgaa ctcgataagg ctggcttcat caagcgtcaa     2760

ctcgtggaga cgcgacagat cacaaagcac gttgctcaga ttctggactc ccggatgaac     2820

acaaagtacg acgagaatga taagctcatc cgtgaagtta aggtcattac cctcaagtct     2880

aagttggtgt cggatttccg caaggacttc caattttata aggttcggga gatcaacaat     2940

tatcaccatg cacatgatgc gtacctcaac gcagtcgtgg gaactgcgct catcaagaag     3000

tatcccaagt tggagtccga attcgtctac ggggattata aggtttacga cgtccgcaag     3060

atgatcgcca agagtgagca ggaaattggc aaggccacgg ctaagtattt cttttactcc     3120

aacatcatga atttctttaa gacggagatc acactcgcca atggagaaat ccgtaagcga     3180

cctttgattg agaccaacgg cgagactggt gaaatcgttt gggataaggg gcgcgacttc     3240

gctaccgtgc ggaaggttct gagcatgccg caagtcaata tcgtcaagaa aaccgaggtg     3300

cagacaggcg gtttctctaa ggaatcgatt cttccaaagc gtaactctga caagctgatc     3360

gctcgaaaga aggattggga ccccaagaag tatggagggt tcgattctcc tacagtggca     3420

tactcggttc tcgttgtcgc gaaggttgag aagggaaagt ctaagaagct gaagtcggtc     3480

aaggaactgc tcgggatcac cattatggag cgctccagct tcgaaaagaa tcccatcgac     3540

tttctcgagg ccaagggcta taaggaagtc aagaaggatc ttatcattaa gctgcctaag     3600

tactctttgt tcgagcttga aaacggtcga aagcgaatgc tcgcatcggc aggagagttg     3660

cagaagggga atgaattggc acttccctca aagtacgtga acttcctgta tctcgcgtcc     3720

cactacgaga agctgaaggg tagccctgag gacaacgaac agaagcaact ttttgttgag     3780

caacacaagc attatctgga tgagatcatt gaacagattt cagagttcag taagcgcgtc     3840

atcctcgccg atgctaatct cgacaaggtg ttgtcggcct acaacaagca ccgtgacaag     3900

ccgatccgag agcaggctga aaatatcatt catctgttca ccctcactaa cttgggagca     3960

ccagcagcgt tcaagtattt tgatacgaca atcgaccgta agcgatacac gtccacaaag     4020

gaggtgcttg atgcgaccct gattcatcaa tccatcactg ggctctatga aacccgtatc     4080

gaccttagtc aactgggggg cgacagcagg gctgacccca agaagaagag gaaggtgggt     4140

ggaggaggtt ctggaggtgg aggttctgca gagtatgtgc gggccctctt tgactttaat     4200

gggaatgatg aagaagacct tccctttaag aaaggagaca tcctgagaat ccgggataag     4260

cctgaagagc agtggtggaa tgcagaggac agcgaaggaa agagggggat gattcctgtc     4320

ccttacgtgg agaagtattc cggagactat aaggaccacg acggagacta caaggatcat     4380

gatattgatt acaaagacga tgacgataag tctaggatga ccgacgctga gtacgtgaga     4440

atccatgaga agttggacat ctacacgttt aagaaacagt ttttcaacaa caaaaaatcc     4500

gtgtcgcata gatgctacgt tctctttgaa ttaaaacgac ggggtgaacg tagagcgtgt     4560

ttttggggct atgctgtgaa taaaccacag agcgggacag aacgtggcat tcacgccgaa     4620

atctttagca ttagaaaagt cgaagaatac ctgcgcgaca accccggaca attcacgata     4680

aattggtact catcctggag tccttgtgca gattgcgctg aaaaaatctt agaatggtat     4740

aaccaggagc tgcgggggaa cggccacact ttgaaaatct gggcttgcaa actctattac     4800

gagaaaaatg cgaggaatca aattgggctg tggaacctca gagataacgg ggttgggttg     4860

aatgtaatgg taagtgaaca ctaccaatgt tgcaggaaaa tattcatcca atcgtcgcac     4920

aatcaattga atgagaatag atggcttgag aagactttga agcgagctga aaaacgacgg     4980

agcgagttgt ccattatgat tcaggtaaaa atactccaca ccactaagag tcctgctgtt     5040

tctagaggct ccggaaccaa cctgtccgac atcatcgaga aggagaccgg caagcagctc     5100

gttatccagg agtccatcct gatgctgccc gaggaggtcg aggaggtcat cggcaacaag     5160

cccgagtccg acatcctggt ccacaccgcc tacgacgagt ccaccgacga gaacgtcatg     5220

ctgctgacct ccgacgcccc cgagtacaag ccctgggccc tggtcatcca ggactccaac     5280

ggcgagaaca agatcaagat gctgtccggc ggctccccca agaagaagcg caaggtctaa     5340


<210>  47
<211>  1779
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  dCas9-AID-UGI_protein

<400>  47

Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Asp Ala Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


Ser Arg  Ala Asp Pro Lys Lys  Lys Arg Lys Val Gly  Gly Gly Gly 
    1370                 1375                 1380             


Ser Gly  Gly Gly Gly Ser Ala  Glu Tyr Val Arg Ala  Leu Phe Asp 
    1385                 1390                 1395             


Phe Asn  Gly Asn Asp Glu Glu  Asp Leu Pro Phe Lys  Lys Gly Asp 
    1400                 1405                 1410             


Ile Leu  Arg Ile Arg Asp Lys  Pro Glu Glu Gln Trp  Trp Asn Ala 
    1415                 1420                 1425             


Glu Asp  Ser Glu Gly Lys Arg  Gly Met Ile Pro Val  Pro Tyr Val 
    1430                 1435                 1440             


Glu Lys  Tyr Ser Gly Asp Tyr  Lys Asp His Asp Gly  Asp Tyr Lys 
    1445                 1450                 1455             


Asp His  Asp Ile Asp Tyr Lys  Asp Asp Asp Asp Lys  Ser Arg Met 
    1460                 1465                 1470             


Thr Asp  Ala Glu Tyr Val Arg  Ile His Glu Lys Leu  Asp Ile Tyr 
    1475                 1480                 1485             


Thr Phe  Lys Lys Gln Phe Phe  Asn Asn Lys Lys Ser  Val Ser His 
    1490                 1495                 1500             


Arg Cys  Tyr Val Leu Phe Glu  Leu Lys Arg Arg Gly  Glu Arg Arg 
    1505                 1510                 1515             


Ala Cys  Phe Trp Gly Tyr Ala  Val Asn Lys Pro Gln  Ser Gly Thr 
    1520                 1525                 1530             


Glu Arg  Gly Ile His Ala Glu  Ile Phe Ser Ile Arg  Lys Val Glu 
    1535                 1540                 1545             


Glu Tyr  Leu Arg Asp Asn Pro  Gly Gln Phe Thr Ile  Asn Trp Tyr 
    1550                 1555                 1560             


Ser Ser  Trp Ser Pro Cys Ala  Asp Cys Ala Glu Lys  Ile Leu Glu 
    1565                 1570                 1575             


Trp Tyr  Asn Gln Glu Leu Arg  Gly Asn Gly His Thr  Leu Lys Ile 
    1580                 1585                 1590             


Trp Ala  Cys Lys Leu Tyr Tyr  Glu Lys Asn Ala Arg  Asn Gln Ile 
    1595                 1600                 1605             


Gly Leu  Trp Asn Leu Arg Asp  Asn Gly Val Gly Leu  Asn Val Met 
    1610                 1615                 1620             


Val Ser  Glu His Tyr Gln Cys  Cys Arg Lys Ile Phe  Ile Gln Ser 
    1625                 1630                 1635             


Ser His  Asn Gln Leu Asn Glu  Asn Arg Trp Leu Glu  Lys Thr Leu 
    1640                 1645                 1650             


Lys Arg  Ala Glu Lys Arg Arg  Ser Glu Leu Ser Ile  Met Ile Gln 
    1655                 1660                 1665             


Val Lys  Ile Leu His Thr Thr  Lys Ser Pro Ala Val  Ser Arg Gly 
    1670                 1675                 1680             


Ser Gly  Thr Asn Leu Ser Asp  Ile Ile Glu Lys Glu  Thr Gly Lys 
    1685                 1690                 1695             


Gln Leu  Val Ile Gln Glu Ser  Ile Leu Met Leu Pro  Glu Glu Val 
    1700                 1705                 1710             


Glu Glu  Val Ile Gly Asn Lys  Pro Glu Ser Asp Ile  Leu Val His 
    1715                 1720                 1725             


Thr Ala  Tyr Asp Glu Ser Thr  Asp Glu Asn Val Met  Leu Leu Thr 
    1730                 1735                 1740             


Ser Asp  Ala Pro Glu Tyr Lys  Pro Trp Ala Leu Val  Ile Gln Asp 
    1745                 1750                 1755             


Ser Asn  Gly Glu Asn Lys Ile  Lys Met Leu Ser Gly  Gly Ser Pro 
    1760                 1765                 1770             


Lys Lys  Lys Arg Lys Val 
    1775                 


<210>  48
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_seq_r1

<400>  48
atttgcaaga gtggtttgtg                                                   20


<210>  49
<211>  3792
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mad7_coding

<400>  49
atgaacaacg gcacaaacaa cttccagaac ttcattggaa tctcgtcgtt gcagaagact       60

ttgcgcaacg ccctcatccc cacagaaact acccagcagt tcattgtgaa gaacggaatc      120

atcaaggaag atgaactccg aggcgagaac cgccagattt tgaaggacat catggatgat      180

tactaccgtg gtttcatctc ggaaacgctc tcctccattg acgacatcga ttggacttcg      240

ttgttcgaaa agatggaaat ccagctcaaa aacggcgata acaaggatac cttgatcaag      300

gagcagaccg agtatcggaa ggcgatccat aagaagttcg ccaacgatga tcggttcaag      360

aacatgttct cggccaagtt gatttccgac attctccccg aattcgtgat ccataacaac      420

aactactcgg cgtcggagaa ggaggagaag acgcaggtca tcaagttgtt ctcgaggttc      480

gccacatcgt tcaaagacta ttttaagaat cgtgcgaact gtttctcggc agatgatatc      540

tcctcgtcct cctgtcaccg cattgtgaac gacaacgcgg aaatcttctt ctcgaacgcg      600

ttggtgtata ggcgcatcgt gaagtccctc tccaacgatg acatcaacaa aatctcggga      660

gatatgaagg attcgctcaa ggagatgtcg ttggaggaaa tctactccta tgagaagtat      720

ggcgagttca ttacgcagga gggcatttcc ttctacaacg acatttgtgg taaagtcaac      780

tcgttcatga acctctactg tcagaaaaac aaggagaaca aaaacctcta taagctccag      840

aagttgcata agcagatcct ctgtatcgca gacacctcgt acgaggtccc ttacaagttc      900

gaatccgatg aggaggtcta ccagtccgtc aacggattct tggacaacat ctcctcgaaa      960

cacattgtcg agcggctccg aaagatcggc gataactaca acggctacaa cttggacaaa     1020

atctatatcg tctccaagtt ctatgagtcc gtctcgcaga aaacctatcg tgattgggag     1080

actatcaaca ctgcgctcga gattcactat aacaacatct tgcctggtaa cggcaaatcg     1140

aaagccgaca aggtgaagaa ggccgtgaaa aacgatctcc agaagtcgat cacagaaatc     1200

aacgaactcg tctcgaacta caagctctgt tcggatgata acatcaaggc ggaaacgtac     1260

atccatgaaa tctcgcatat cttgaacaac ttcgaggccc aggaactcaa atacaacccc     1320

gagatccact tggtcgagtc ggagctcaaa gcctcggagt tgaagaacgt cttggatgtc     1380

atcatgaacg cattccactg gtgttccgtg ttcatgaccg aggaactcgt cgataaagac     1440

aacaacttct acgcggaact cgaggaaatc tacgatgaaa tctatcccgt gatctccctc     1500

tacaacctcg tgcgaaacta cgtcactcag aagccctatt ccaccaagaa gatcaagctc     1560

aacttcggca tccccactct cgcagacggt tggtcgaagt cgaaggagta ctccaacaac     1620

gccattatcc tcatgcgaga caacctctac tacttgggta tcttcaacgc aaagaacaag     1680

ccggataaga agatcattga aggcaacact tcggaaaaca agggagacta taagaagatg     1740

atctacaacc tcctccctgg acccaacaag atgattccta aagtgttcct ctcgtcgaag     1800

actggtgtgg aaacgtataa gccgtcggcc tacatcttgg agggctacaa acagaacaag     1860

catatcaagt cctcgaagga cttcgacatc actttctgtc acgacctcat cgactatttc     1920

aagaactgta ttgcaatcca tccggaatgg aagaacttcg gcttcgattt ctcggatact     1980

tcgacatacg aagatatctc gggattctac cgagaggtcg aattgcaggg ctataagatt     2040

gattggacct acatctcgga aaaggatatc gacttgctcc aggaaaaggg ccagctctac     2100

ctcttccaga tttacaacaa ggacttctcc aagaagtcga cgggtaacga caacttgcac     2160

acaatgtatc tcaaaaacct cttctcggag gagaacttga aggatatcgt gctcaaattg     2220

aacggagagg ccgaaatctt cttccgtaag tcctccatca agaacccgat catccataag     2280

aagggatcga tcttggtcaa ccggacttac gaagcagagg aaaaagatca gttcggaaac     2340

atccagattg tcaggaagaa catccctgaa aacatctatc aggagttgta taagtacttc     2400

aacgacaagt cggataagga gctctccgac gaagcagcca aactcaagaa cgtcgtcgga     2460

caccatgaag cagcaaccaa cattgtgaag gactaccggt acacttacga caagtacttc     2520

ttgcacatgc cgatcactat caacttcaaa gccaacaaga ccggattcat taacgacagg     2580

atcctccagt acattgccaa agaaaaggac ctccatgtca tcggtatcga caggggagaa     2640

cggaacctca tctacgtctc cgtgattgac acttgtggca acattgtcga acagaagtcg     2700

ttcaacatcg tcaacggtta cgattaccag attaagttga aacagcagga aggtgcgagg     2760

cagattgcgc gaaaggaatg gaaggagatt ggcaaaatca aggagattaa ggaaggctac     2820

ttgtcgttgg tcatccacga aatctcgaaa atggtgatca aatacaacgc catcatcgcc     2880

atggaagacc tctcgtacgg cttcaaaaag ggacggttca aagtggagcg tcaggtgtac     2940

cagaagttcg aaacaatgtt gatcaacaag ttgaactact tggtgttcaa ggacatttcc     3000

attaccgaga acggaggatt gctcaagggt tatcagctca cgtacatccc cgacaagttg     3060

aaaaacgtgg gacaccagtg tggctgtatc ttctacgtgc ctgcagccta cacgtcgaaa     3120

atcgacccta caacaggatt cgtgaacatc ttcaagttca aggatctcac cgtcgacgcg     3180

aagcgggagt tcatcaaaaa gttcgactcc atccgctatg attcggagaa gaacttgttc     3240

tgtttcacat tcgactacaa caacttcatt actcagaaca ccgtgatgtc caaatcgtcg     3300

tggtccgtgt acacgtatgg tgtgcgcatc aaaaggcgct tcgtcaacgg tcgcttctcc     3360

aacgaatcgg acacgatcga tatcacgaaa gacatggaga aaacattgga aatgaccgac     3420

atcaactggc gtgacggcca tgacctcagg caggacatca tcgattacga gatcgtccag     3480

cacatcttcg aaatcttccg tctcaccgtg cagatgagga actccctctc cgagctcgaa     3540

gatcgggatt acgaccggct catttcccct gtgttgaacg agaacaacat cttctacgac     3600

tcggcaaaag cgggagatgc attgccgaag gacgccgatg cgaacggtgc atattgtatt     3660

gcactcaagg gtctctacga aatcaagcag atcaccgaaa actggaagga ggacggcaaa     3720

ttctcgaggg acaagttgaa gatttcgaac aaggattggt tcgatttcat ccagaacaag     3780

aggtacttgt aa                                                         3792


<210>  50
<211>  1263
<212>  PRT
<213>  Eubacterium rectale

<400>  50

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Asp Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


<210>  51
<211>  192
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  LbCpf1_partial_protein

<400>  51

Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu 
1               5                   10                  15      


His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln 
            20                  25                  30          


Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu 
        35                  40                  45              


Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn 
    50                  55                  60                  


Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val 
65                  70                  75                  80  


Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro 
                85                  90                  95      


Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu 
            100                 105                 110         


Val Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile 
        115                 120                 125             


Asp Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys 
    130                 135                 140                 


Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe 
145                 150                 155                 160 


Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys 
                165                 170                 175     


Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn 
            180                 185                 190         


<210>  52
<211>  189
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  FnCpf1_partial_protein

<400>  52

Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg Pro Asn Leu 
1               5                   10                  15      


His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn Leu Gln Asp 
            20                  25                  30          


Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr Arg Lys Gln 
        35                  40                  45              


Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala Ile Ala Asn 
    50                  55                  60                  


Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu Tyr Asp Leu 
65                  70                  75                  80  


Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe His Cys Pro 
                85                  90                  95      


Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe Asn Asp Glu 
            100                 105                 110         


Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His Ile Leu Ser 
        115                 120                 125             


Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu Val Asp Gly 
    130                 135                 140                 


Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile Gly Asn Asp 
145                 150                 155                 160 


Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile Glu Lys Asp 
                165                 170                 175     


Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn 
            180                 185                 


<210>  53
<211>  229
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Mad7_partial_protein

<400>  53

Ile Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu 
1               5                   10                  15      


His Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp 
            20                  25                  30          


Ile Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser 
        35                  40                  45              


Ser Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn 
    50                  55                  60                  


Arg Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile 
65                  70                  75                  80  


Val Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr 
                85                  90                  95      


Phe Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu 
            100                 105                 110         


Lys Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp 
        115                 120                 125             


Tyr Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile 
    130                 135                 140                 


Asn Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln 
145                 150                 155                 160 


Tyr Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Asp Arg Gly 
                165                 170                 175     


Glu Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile 
            180                 185                 190         


Val Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile 
        195                 200                 205             


Lys Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp 
    210                 215                 220                 


Lys Glu Ile Gly Lys 
225                 


<210>  54
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF-Ptef-rev

<400>  54
ggtgaaggtt gtgttatgtt ttgtgg                                            26


<210>  55
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF-nls-fwd

<400>  55
agcagggctg accccaagaa g                                                 21


<210>  56
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF-PtfMd7-fwd

<400>  56
aacacaacct tcaccatgaa caacggcaca aacaac                                 36


<210>  57
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer IF-nlsMd7-rev

<400>  57
ggggtcagcc ctgctaggca agtacctctt gttctg                                 36


<210>  58
<211>  88
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer sg-site_fwd

<400>  58
ttcgattcac ggatgatgca gtcaaaagac ctttttaatt tctactcttg tagatagatc       60

ttttttttgg ctcttgggtt cgaactgc                                          88


<210>  59
<211>  130
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer sg-site_rev

<400>  59
tttggcttcg gaaagcacac gtgaagggta ggaacaaaga tggaatgatt ggcaggggtg       60

acccaaatgg tggggcataa aaaaaaagat gaccaaaaca tgggccttgg gcagttcgaa      120

cccaagagcc                                                             130


<210>  60
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mad7 sgRNA backbone

<400>  60
gtcaaaagac ctttttaatt tctactcttg tagat                                  35


<210>  61
<211>  5028
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mad7d-AID-UGI_coding

<400>  61
atgaacaacg gcacaaacaa cttccagaac ttcattggaa tctcgtcgtt gcagaagact       60

ttgcgcaacg ccctcatccc cacagaaact acccagcagt tcattgtgaa gaacggaatc      120

atcaaggaag atgaactccg aggcgagaac cgccagattt tgaaggacat catggatgat      180

tactaccgtg gtttcatctc ggaaacgctc tcctccattg acgacatcga ttggacttcg      240

ttgttcgaaa agatggaaat ccagctcaaa aacggcgata acaaggatac cttgatcaag      300

gagcagaccg agtatcggaa ggcgatccat aagaagttcg ccaacgatga tcggttcaag      360

aacatgttct cggccaagtt gatttccgac attctccccg aattcgtgat ccataacaac      420

aactactcgg cgtcggagaa ggaggagaag acgcaggtca tcaagttgtt ctcgaggttc      480

gccacatcgt tcaaagacta ttttaagaat cgtgcgaact gtttctcggc agatgatatc      540

tcctcgtcct cctgtcaccg cattgtgaac gacaacgcgg aaatcttctt ctcgaacgcg      600

ttggtgtata ggcgcatcgt gaagtccctc tccaacgatg acatcaacaa aatctcggga      660

gatatgaagg attcgctcaa ggagatgtcg ttggaggaaa tctactccta tgagaagtat      720

ggcgagttca ttacgcagga gggcatttcc ttctacaacg acatttgtgg taaagtcaac      780

tcgttcatga acctctactg tcagaaaaac aaggagaaca aaaacctcta taagctccag      840

aagttgcata agcagatcct ctgtatcgca gacacctcgt acgaggtccc ttacaagttc      900

gaatccgatg aggaggtcta ccagtccgtc aacggattct tggacaacat ctcctcgaaa      960

cacattgtcg agcggctccg aaagatcggc gataactaca acggctacaa cttggacaaa     1020

atctatatcg tctccaagtt ctatgagtcc gtctcgcaga aaacctatcg tgattgggag     1080

actatcaaca ctgcgctcga gattcactat aacaacatct tgcctggtaa cggcaaatcg     1140

aaagccgaca aggtgaagaa ggccgtgaaa aacgatctcc agaagtcgat cacagaaatc     1200

aacgaactcg tctcgaacta caagctctgt tcggatgata acatcaaggc ggaaacgtac     1260

atccatgaaa tctcgcatat cttgaacaac ttcgaggccc aggaactcaa atacaacccc     1320

gagatccact tggtcgagtc ggagctcaaa gcctcggagt tgaagaacgt cttggatgtc     1380

atcatgaacg cattccactg gtgttccgtg ttcatgaccg aggaactcgt cgataaagac     1440

aacaacttct acgcggaact cgaggaaatc tacgatgaaa tctatcccgt gatctccctc     1500

tacaacctcg tgcgaaacta cgtcactcag aagccctatt ccaccaagaa gatcaagctc     1560

aacttcggca tccccactct cgcagacggt tggtcgaagt cgaaggagta ctccaacaac     1620

gccattatcc tcatgcgaga caacctctac tacttgggta tcttcaacgc aaagaacaag     1680

ccggataaga agatcattga aggcaacact tcggaaaaca agggagacta taagaagatg     1740

atctacaacc tcctccctgg acccaacaag atgattccta aagtgttcct ctcgtcgaag     1800

actggtgtgg aaacgtataa gccgtcggcc tacatcttgg agggctacaa acagaacaag     1860

catatcaagt cctcgaagga cttcgacatc actttctgtc acgacctcat cgactatttc     1920

aagaactgta ttgcaatcca tccggaatgg aagaacttcg gcttcgattt ctcggatact     1980

tcgacatacg aagatatctc gggattctac cgagaggtcg aattgcaggg ctataagatt     2040

gattggacct acatctcgga aaaggatatc gacttgctcc aggaaaaggg ccagctctac     2100

ctcttccaga tttacaacaa ggacttctcc aagaagtcga cgggtaacga caacttgcac     2160

acaatgtatc tcaaaaacct cttctcggag gagaacttga aggatatcgt gctcaaattg     2220

aacggagagg ccgaaatctt cttccgtaag tcctccatca agaacccgat catccataag     2280

aagggatcga tcttggtcaa ccggacttac gaagcagagg aaaaagatca gttcggaaac     2340

atccagattg tcaggaagaa catccctgaa aacatctatc aggagttgta taagtacttc     2400

aacgacaagt cggataagga gctctccgac gaagcagcca aactcaagaa cgtcgtcgga     2460

caccatgaag cagcaaccaa cattgtgaag gactaccggt acacttacga caagtacttc     2520

ttgcacatgc cgatcactat caacttcaaa gccaacaaga ccggattcat taacgacagg     2580

atcctccagt acattgccaa agaaaaggac ctccatgtca tcggtatcgc gaggggagaa     2640

cggaacctca tctacgtctc cgtgattgac acttgtggca acattgtcga acagaagtcg     2700

ttcaacatcg tcaacggtta cgattaccag attaagttga aacagcagga aggtgcgagg     2760

cagattgcgc gaaaggaatg gaaggagatt ggcaaaatca aggagattaa ggaaggctac     2820

ttgtcgttgg tcatccacga aatctcgaaa atggtgatca aatacaacgc catcatcgcc     2880

atggaagacc tctcgtacgg cttcaaaaag ggacggttca aagtggagcg tcaggtgtac     2940

cagaagttcg aaacaatgtt gatcaacaag ttgaactact tggtgttcaa ggacatttcc     3000

attaccgaga acggaggatt gctcaagggt tatcagctca cgtacatccc cgacaagttg     3060

aaaaacgtgg gacaccagtg tggctgtatc ttctacgtgc ctgcagccta cacgtcgaaa     3120

atcgacccta caacaggatt cgtgaacatc ttcaagttca aggatctcac cgtcgacgcg     3180

aagcgggagt tcatcaaaaa gttcgactcc atccgctatg attcggagaa gaacttgttc     3240

tgtttcacat tcgactacaa caacttcatt actcagaaca ccgtgatgtc caaatcgtcg     3300

tggtccgtgt acacgtatgg tgtgcgcatc aaaaggcgct tcgtcaacgg tcgcttctcc     3360

aacgaatcgg acacgatcga tatcacgaaa gacatggaga aaacattgga aatgaccgac     3420

atcaactggc gtgacggcca tgacctcagg caggacatca tcgattacga gatcgtccag     3480

cacatcttcg aaatcttccg tctcaccgtg cagatgagga actccctctc cgagctcgaa     3540

gatcgggatt acgaccggct catttcccct gtgttgaacg agaacaacat cttctacgac     3600

tcggcaaaag cgggagatgc attgccgaag gacgccgatg cgaacggtgc atattgtatt     3660

gcactcaagg gtctctacga aatcaagcag atcaccgaaa actggaagga ggacggcaaa     3720

ttctcgaggg acaagttgaa gatttcgaac aaggattggt tcgatttcat ccagaacaag     3780

aggtacttgc ctagcagggc tgaccccaag aagaagagga aggtgggtgg aggaggttct     3840

ggaggtggag gttctgcaga gtatgtgcgg gccctctttg actttaatgg gaatgatgaa     3900

gaagaccttc cctttaagaa aggagacatc ctgagaatcc gggataagcc tgaagagcag     3960

tggtggaatg cagaggacag cgaaggaaag agggggatga ttcctgtccc ttacgtggag     4020

aagtattccg gagactataa ggaccacgac ggagactaca aggatcatga tattgattac     4080

aaagacgatg acgataagtc taggatgacc gacgctgagt acgtgagaat ccatgagaag     4140

ttggacatct acacgtttaa gaaacagttt ttcaacaaca aaaaatccgt gtcgcataga     4200

tgctacgttc tctttgaatt aaaacgacgg ggtgaacgta gagcgtgttt ttggggctat     4260

gctgtgaata aaccacagag cgggacagaa cgtggcattc acgccgaaat ctttagcatt     4320

agaaaagtcg aagaatacct gcgcgacaac cccggacaat tcacgataaa ttggtactca     4380

tcctggagtc cttgtgcaga ttgcgctgaa aaaatcttag aatggtataa ccaggagctg     4440

cgggggaacg gccacacttt gaaaatctgg gcttgcaaac tctattacga gaaaaatgcg     4500

aggaatcaaa ttgggctgtg gaacctcaga gataacgggg ttgggttgaa tgtaatggta     4560

agtgaacact accaatgttg caggaaaata ttcatccaat cgtcgcacaa tcaattgaat     4620

gagaatagat ggcttgagaa gactttgaag cgagctgaaa aacgacggag cgagttgtcc     4680

attatgattc aggtaaaaat actccacacc actaagagtc ctgctgtttc tagaggctcc     4740

ggaaccaacc tgtccgacat catcgagaag gagaccggca agcagctcgt tatccaggag     4800

tccatcctga tgctgcccga ggaggtcgag gaggtcatcg gcaacaagcc cgagtccgac     4860

atcctggtcc acaccgccta cgacgagtcc accgacgaga acgtcatgct gctgacctcc     4920

gacgcccccg agtacaagcc ctgggccctg gtcatccagg actccaacgg cgagaacaag     4980

atcaagatgc tgtccggcgg ctcccccaag aagaagcgca aggtctaa                  5028


<210>  62
<211>  1675
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Mad7d-AID-UGI_protein

<400>  62

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Ala Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


Pro Ser  Arg Ala Asp Pro Lys  Lys Lys Arg Lys Val  Gly Gly Gly 
    1265                 1270                 1275             


Gly Ser  Gly Gly Gly Gly Ser  Ala Glu Tyr Val Arg  Ala Leu Phe 
    1280                 1285                 1290             


Asp Phe  Asn Gly Asn Asp Glu  Glu Asp Leu Pro Phe  Lys Lys Gly 
    1295                 1300                 1305             


Asp Ile  Leu Arg Ile Arg Asp  Lys Pro Glu Glu Gln  Trp Trp Asn 
    1310                 1315                 1320             


Ala Glu  Asp Ser Glu Gly Lys  Arg Gly Met Ile Pro  Val Pro Tyr 
    1325                 1330                 1335             


Val Glu  Lys Tyr Ser Gly Asp  Tyr Lys Asp His Asp  Gly Asp Tyr 
    1340                 1345                 1350             


Lys Asp  His Asp Ile Asp Tyr  Lys Asp Asp Asp Asp  Lys Ser Arg 
    1355                 1360                 1365             


Met Thr  Asp Ala Glu Tyr Val  Arg Ile His Glu Lys  Leu Asp Ile 
    1370                 1375                 1380             


Tyr Thr  Phe Lys Lys Gln Phe  Phe Asn Asn Lys Lys  Ser Val Ser 
    1385                 1390                 1395             


His Arg  Cys Tyr Val Leu Phe  Glu Leu Lys Arg Arg  Gly Glu Arg 
    1400                 1405                 1410             


Arg Ala  Cys Phe Trp Gly Tyr  Ala Val Asn Lys Pro  Gln Ser Gly 
    1415                 1420                 1425             


Thr Glu  Arg Gly Ile His Ala  Glu Ile Phe Ser Ile  Arg Lys Val 
    1430                 1435                 1440             


Glu Glu  Tyr Leu Arg Asp Asn  Pro Gly Gln Phe Thr  Ile Asn Trp 
    1445                 1450                 1455             


Tyr Ser  Ser Trp Ser Pro Cys  Ala Asp Cys Ala Glu  Lys Ile Leu 
    1460                 1465                 1470             


Glu Trp  Tyr Asn Gln Glu Leu  Arg Gly Asn Gly His  Thr Leu Lys 
    1475                 1480                 1485             


Ile Trp  Ala Cys Lys Leu Tyr  Tyr Glu Lys Asn Ala  Arg Asn Gln 
    1490                 1495                 1500             


Ile Gly  Leu Trp Asn Leu Arg  Asp Asn Gly Val Gly  Leu Asn Val 
    1505                 1510                 1515             


Met Val  Ser Glu His Tyr Gln  Cys Cys Arg Lys Ile  Phe Ile Gln 
    1520                 1525                 1530             


Ser Ser  His Asn Gln Leu Asn  Glu Asn Arg Trp Leu  Glu Lys Thr 
    1535                 1540                 1545             


Leu Lys  Arg Ala Glu Lys Arg  Arg Ser Glu Leu Ser  Ile Met Ile 
    1550                 1555                 1560             


Gln Val  Lys Ile Leu His Thr  Thr Lys Ser Pro Ala  Val Ser Arg 
    1565                 1570                 1575             


Gly Ser  Gly Thr Asn Leu Ser  Asp Ile Ile Glu Lys  Glu Thr Gly 
    1580                 1585                 1590             


Lys Gln  Leu Val Ile Gln Glu  Ser Ile Leu Met Leu  Pro Glu Glu 
    1595                 1600                 1605             


Val Glu  Glu Val Ile Gly Asn  Lys Pro Glu Ser Asp  Ile Leu Val 
    1610                 1615                 1620             


His Thr  Ala Tyr Asp Glu Ser  Thr Asp Glu Asn Val  Met Leu Leu 
    1625                 1630                 1635             


Thr Ser  Asp Ala Pro Glu Tyr  Lys Pro Trp Ala Leu  Val Ile Gln 
    1640                 1645                 1650             


Asp Ser  Asn Gly Glu Asn Lys  Ile Lys Met Leu Ser  Gly Gly Ser 
    1655                 1660                 1665             


Pro Lys  Lys Lys Arg Lys Val  
    1670                 1675 


<210>  63
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA296 MdwA1

<400>  63
ttggagacca gaccagcgac a                                                 21


<210>  64
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA297 MdwA2

<400>  64
gagaccagac cagcgacatc g                                                 21


<210>  65
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA298 MdwA3

<400>  65
ttccagcaat gcttccatgc a                                                 21


<210>  66
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA299 MdwA4

<400>  66
cagcaatgct tccatgcaat t                                                 21


<210>  67
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA300 MdwA5

<400>  67
catgcaattc gtcaagagat c                                                 21


<210>  68
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA301 MdwA6

<400>  68
tcgagttcga gaactggtgg a                                                 21


<210>  69
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA302 MdwA7

<400>  69
cgatctcgag tagcgccact c                                                 21


<210>  70
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA303 MdwA8

<400>  70
atccctggcg gtaaccgagc a                                                 21


<210>  71
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA304 MdwA9

<400>  71
aggaaagcgg cgaactttcg t                                                 21


<210>  72
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA305 MdwA10

<400>  72
actgcatcgc aagaagccag g                                                 21


<210>  73
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA306 MdwA11

<400>  73
gccctgtcgt ggtacaagtg g                                                 21


<210>  74
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA307 MdwA12

<400>  74
tggccggcgt gctcagttgc t                                                 21


<210>  75
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA324 MdwA13

<400>  75
tacaggaggt tccagaagct c                                                 21


<210>  76
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA325 MdwA14

<400>  76
tccagatcga attgcaagcc a                                                 21


<210>  77
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA326 MdwA15

<400>  77
ctgaagctgg ccgattccag g                                                 21


<210>  78
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA327 MdwA16

<400>  78
acacccagag cagtccagta a                                                 21


<210>  79
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA328 MdwA17

<400>  79
gggccgagac actcccagtt a                                                 21


<210>  80
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA329 MdwA18

<400>  80
agatcccact tgtaggctgg g                                                 21


<210>  81
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protospacer pTNA330 MdwA19

<400>  81
ttctccgccc aatccgctgt c                                                 21


<210>  82
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 296-C9-sense

<400>  82
taatttctac tcttgtagat ttggagacca gaccagcgac atttttttgg ctcttgggtt       60

c                                                                       61


<210>  83
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 296-C9-anti

<400>  83
gaacccaaga gccaaaaaaa tgtcgctggt ctggtctcca aatctacaag agtagaaatt       60

a                                                                       61


<210>  84
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 297-C6-sense

<400>  84
taatttctac tcttgtagat gagaccagac cagcgacatc gtttttttgg ctcttgggtt       60

c                                                                       61


<210>  85
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 297-C6-anti

<400>  85
gaacccaaga gccaaaaaaa cgatgtcgct ggtctggtct catctacaag agtagaaatt       60

a                                                                       61


<210>  86
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 298-C4_7-sense

<400>  86
taatttctac tcttgtagat ttccagcaat gcttccatgc atttttttgg ctcttgggtt       60

c                                                                       61


<210>  87
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 298-C4_7-anti

<400>  87
gaacccaaga gccaaaaaaa tgcatggaag cattgctgga aatctacaag agtagaaatt       60

a                                                                       61


<210>  88
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 299-C1_4-sense

<400>  88
taatttctac tcttgtagat cagcaatgct tccatgcaat ttttttttgg ctcttgggtt       60

c                                                                       61


<210>  89
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 299-C1_4-anti

<400>  89
gaacccaaga gccaaaaaaa aattgcatgg aagcattgct gatctacaag agtagaaatt       60

a                                                                       61


<210>  90
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 300-C13-sense

<400>  90
taatttctac tcttgtagat catgcaattc gtcaagagat ctttttttgg ctcttgggtt       60

c                                                                       61


<210>  91
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 300-C13-anti

<400>  91
gaacccaaga gccaaaaaaa gatctcttga cgaattgcat gatctacaag agtagaaatt       60

a                                                                       61


<210>  92
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 301-C2_8-sense

<400>  92
taatttctac tcttgtagat tcgagttcga gaactggtgg atttttttgg ctcttgggtt       60

c                                                                       61


<210>  93
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 301-C2_8-anti

<400>  93
gaacccaaga gccaaaaaaa tccaccagtt ctcgaactcg aatctacaag agtagaaatt       60

a                                                                       61


<210>  94
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 302-C21-sense

<400>  94
taatttctac tcttgtagat cgatctcgag tagcgccact ctttttttgg ctcttgggtt       60

c                                                                       61


<210>  95
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 302-C21-anti

<400>  95
gaacccaaga gccaaaaaaa gagtggcgct actcgagatc gatctacaag agtagaaatt       60

a                                                                       61


<210>  96
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 303-C16-sense

<400>  96
taatttctac tcttgtagat atccctggcg gtaaccgagc atttttttgg ctcttgggtt       60

c                                                                       61


<210>  97
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 303-C16-anti

<400>  97
gaacccaaga gccaaaaaaa tgctcggtta ccgccaggga tatctacaag agtagaaatt       60

a                                                                       61


<210>  98
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 304-C11-sense

<400>  98
taatttctac tcttgtagat aggaaagcgg cgaactttcg ttttttttgg ctcttgggtt       60

c                                                                       61


<210>  99
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 304-C11-anti

<400>  99
gaacccaaga gccaaaaaaa acgaaagttc gccgctttcc tatctacaag agtagaaatt       60

a                                                                       61


<210>  100
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 305-C18-sense

<400>  100
taatttctac tcttgtagat actgcatcgc aagaagccag gtttttttgg ctcttgggtt       60

c                                                                       61


<210>  101
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 305-C18-anti

<400>  101
gaacccaaga gccaaaaaaa cctggcttct tgcgatgcag tatctacaag agtagaaatt       60

a                                                                       61


<210>  102
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 306-C15-sense

<400>  102
taatttctac tcttgtagat gccctgtcgt ggtacaagtg gtttttttgg ctcttgggtt       60

c                                                                       61


<210>  103
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 306-C15-anti

<400>  103
gaacccaaga gccaaaaaaa ccacttgtac cacgacaggg catctacaag agtagaaatt       60

a                                                                       61


<210>  104
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 307-C14-sense

<400>  104
taatttctac tcttgtagat tggccggcgt gctcagttgc ttttttttgg ctcttgggtt       60

c                                                                       61


<210>  105
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 307-C14-anti

<400>  105
gaacccaaga gccaaaaaaa agcaactgag cacgccggcc aatctacaag agtagaaatt       60

a                                                                       61


<210>  106
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 324-C12,13-sense

<400>  106
taatttctac tcttgtagat tacaggaggt tccagaagct ctttttttgg ctcttgggtt       60

c                                                                       61


<210>  107
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 324-C12,13-anti

<400>  107
gaacccaaga gccaaaaaaa gagcttctgg aacctcctgt aatctacaag agtagaaatt       60

a                                                                       61


<210>  108
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 325-C2,3-sense

<400>  108
taatttctac tcttgtagat tccagatcga attgcaagcc atttttttgg ctcttgggtt       60

c                                                                       61


<210>  109
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 325-C2,3-anti

<400>  109
gaacccaaga gccaaaaaaa tggcttgcaa ttcgatctgg aatctacaag agtagaaatt       60

a                                                                       61


<210>  110
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 326-C17,18-sense

<400>  110
taatttctac tcttgtagat ctgaagctgg ccgattccag gtttttttgg ctcttgggtt       60

c                                                                       61


<210>  111
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 326-C17,18-anti

<400>  111
gaacccaaga gccaaaaaaa cctggaatcg gccagcttca gatctacaag agtagaaatt       60

a                                                                       61


<210>  112
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 327-C15,16-sense

<400>  112
taatttctac tcttgtagat acacccagag cagtccagta atttttttgg ctcttgggtt       60

c                                                                       61


<210>  113
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 327-C15,16-anti

<400>  113
gaacccaaga gccaaaaaaa ttactggact gctctgggtg tatctacaag agtagaaatt       60

a                                                                       61


<210>  114
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 328-C15,16-2-sense

<400>  114
taatttctac tcttgtagat gggccgagac actcccagtt atttttttgg ctcttgggtt       60

c                                                                       61


<210>  115
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 328-C15,16-2-anti

<400>  115
gaacccaaga gccaaaaaaa taactgggag tgtctcggcc catctacaag agtagaaatt       60

a                                                                       61


<210>  116
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 329-C6,7-sense

<400>  116
taatttctac tcttgtagat agatcccact tgtaggctgg gtttttttgg ctcttgggtt       60

c                                                                       61


<210>  117
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 329-C6,7-anti

<400>  117
gaacccaaga gccaaaaaaa cccagcctac aagtgggatc tatctacaag agtagaaatt       60

a                                                                       61


<210>  118
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 330-C9,10-sense

<400>  118
taatttctac tcttgtagat ttctccgccc aatccgctgt ctttttttgg ctcttgggtt       60

c                                                                       61


<210>  119
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 330-C9,10-anti

<400>  119
gaacccaaga gccaaaaaaa gacagcggat tgggcggaga aatctacaag agtagaaatt       60

a                                                                       61


<210>  120
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_seq_f5

<400>  120
ttcttcaaca tgtcgcctcg g                                                 21


<210>  121
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer Primer pks_seq_r6

<400>  121
gtgttacagt tgccagtgg                                                    19


<210>  122
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer pks_seq_f4

<400>  122
ggtacttgat gaattcgtcg                                                   20


<210>  123
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer MS-test-wA3

<400>  123
tgaattcaac tctttacaat cg                                                22


<210>  124
<211>  3792
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mad7d coding sequence


<220>
<221>  CDS
<222>  (1)..(3792)

<400>  124
atg aac aac ggc aca aac aac ttc cag aac ttc att gga atc tcg tcg         48
Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser           
1               5                   10                  15                

ttg cag aag act ttg cgc aac gcc ctc atc ccc aca gaa act acc cag         96
Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln           
            20                  25                  30                    

cag ttc att gtg aag aac gga atc atc aag gaa gat gaa ctc cga ggc        144
Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly           
        35                  40                  45                        

gag aac cgc cag att ttg aag gac atc atg gat gat tac tac cgt ggt        192
Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly           
    50                  55                  60                            

ttc atc tcg gaa acg ctc tcc tcc att gac gac atc gat tgg act tcg        240
Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser           
65                  70                  75                  80            

ttg ttc gaa aag atg gaa atc cag ctc aaa aac ggc gat aac aag gat        288
Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp           
                85                  90                  95                

acc ttg atc aag gag cag acc gag tat cgg aag gcg atc cat aag aag        336
Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys           
            100                 105                 110                   

ttc gcc aac gat gat cgg ttc aag aac atg ttc tcg gcc aag ttg att        384
Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile           
        115                 120                 125                       

tcc gac att ctc ccc gaa ttc gtg atc cat aac aac aac tac tcg gcg        432
Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala           
    130                 135                 140                           

tcg gag aag gag gag aag acg cag gtc atc aag ttg ttc tcg agg ttc        480
Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe           
145                 150                 155                 160           

gcc aca tcg ttc aaa gag tat ttt aag aat cgt gcg aac tgt ttc tcg        528
Ala Thr Ser Phe Lys Glu Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser           
                165                 170                 175               

gca gat gat atc tcc tcg tcc tcc tgt cac cgc att gtg aac gac aac        576
Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn           
            180                 185                 190                   

gcg gaa atc ttc ttc tcg aac gcg ttg gtg tat agg cgc atc gtg aag        624
Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys           
        195                 200                 205                       

tcc ctc tcc aac gat gac atc aac aaa atc tcg gga gat atg aag gat        672
Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp           
    210                 215                 220                           

tcg ctc aag gag atg tcg ttg gag gaa atc tac tcc tat gag aag tat        720
Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr           
225                 230                 235                 240           

ggc gag ttc att acg cag gag ggc att tcc ttc tac aac gac att tgt        768
Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys           
                245                 250                 255               

ggt aaa gtc aac tcg ttc atg aac ctc tac tgt cag aaa aac aag gag        816
Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu           
            260                 265                 270                   

aac aaa aac ctc tat aag ctc cag aag ttg cat aag cag atc ctc tgt        864
Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys           
        275                 280                 285                       

atc gca gac acc tcg tac gag gtc cct tac aag ttc gaa tcc gat gag        912
Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu           
    290                 295                 300                           

gag gtc tac cag tcc gtc aac gga ttc ttg gac aac atc tcc tcg aaa        960
Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys           
305                 310                 315                 320           

cac att gtc gag cgg ctc cga aag atc ggc gat aac tac aac ggc tac       1008
His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr           
                325                 330                 335               

aac ttg gac aaa atc tat atc gtc tcc aag ttc tat gag tcc gtc tcg       1056
Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser           
            340                 345                 350                   

cag aaa acc tat cgt gat tgg gag act atc aac act gcg ctc gag att       1104
Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile           
        355                 360                 365                       

cac tat aac aac atc ttg cct ggt aac ggc aaa tcg aaa gcc gac aag       1152
His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys           
    370                 375                 380                           

gtg aag aag gcc gtg aaa aac gat ctc cag aag tcg atc aca gaa atc       1200
Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile           
385                 390                 395                 400           

aac gaa ctc gtc tcg aac tac aag ctc tgt tcg gat gat aac atc aag       1248
Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys           
                405                 410                 415               

gcg gaa acg tac atc cat gaa atc tcg cat atc ttg aac aac ttc gag       1296
Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu           
            420                 425                 430                   

gcc cag gaa ctc aaa tac aac ccc gag atc cac ttg gtc gag tcg gag       1344
Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu           
        435                 440                 445                       

ctc aaa gcc tcg gag ttg aag aac gtc ttg gat gtc atc atg aac gca       1392
Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala           
    450                 455                 460                           

ttc cac tgg tgt tcc gtg ttc atg acc gag gaa ctc gtc gat aaa gac       1440
Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp           
465                 470                 475                 480           

aac aac ttc tac gcg gaa ctc gag gaa atc tac gat gaa atc tat ccc       1488
Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro           
                485                 490                 495               

gtg atc tcc ctc tac aac ctc gtg cga aac tac gtc act cag aag ccc       1536
Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro           
            500                 505                 510                   

tat tcc acc aag aag atc aag ctc aac ttc ggc atc ccc act ctc gca       1584
Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala           
        515                 520                 525                       

gac ggt tgg tcg aag tcg aag gag tac tcc aac aac gcc att atc ctc       1632
Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu           
    530                 535                 540                           

atg cga gac aac ctc tac tac ttg ggt atc ttc aac gca aag aac aag       1680
Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys           
545                 550                 555                 560           

ccg gat aag aag atc att gaa ggc aac act tcg gaa aac aag gga gac       1728
Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp           
                565                 570                 575               

tat aag aag atg atc tac aac ctc ctc cct gga ccc aac aag atg att       1776
Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile           
            580                 585                 590                   

cct aaa gtg ttc ctc tcg tcg aag act ggt gtg gaa acg tat aag ccg       1824
Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro           
        595                 600                 605                       

tcg gcc tac atc ttg gag ggc tac aaa cag aac aag cat atc aag tcc       1872
Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser           
    610                 615                 620                           

tcg aag gac ttc gac atc act ttc tgt cac gac ctc atc gac tat ttc       1920
Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe           
625                 630                 635                 640           

aag aac tgt att gca atc cat ccg gaa tgg aag aac ttc ggc ttc gat       1968
Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp           
                645                 650                 655               

ttc tcg gat act tcg aca tac gaa gat atc tcg gga ttc tac cga gag       2016
Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu           
            660                 665                 670                   

gtc gaa ttg cag ggc tat aag att gat tgg acc tac atc tcg gaa aag       2064
Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys           
        675                 680                 685                       

gat atc gac ttg ctc cag gaa aag ggc cag ctc tac ctc ttc cag att       2112
Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile           
    690                 695                 700                           

tac aac aag gac ttc tcc aag aag tcg acg ggt aac gac aac ttg cac       2160
Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His           
705                 710                 715                 720           

aca atg tat ctc aaa aac ctc ttc tcg gag gag aac ttg aag gat atc       2208
Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile           
                725                 730                 735               

gtg ctc aaa ttg aac gga gag gcc gaa atc ttc ttc cgt aag tcc tcc       2256
Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser           
            740                 745                 750                   

atc aag aac ccg atc atc cat aag aag gga tcg atc ttg gtc aac cgg       2304
Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg           
        755                 760                 765                       

act tac gaa gca gag gaa aaa gat cag ttc gga aac atc cag att gtc       2352
Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val           
    770                 775                 780                           

agg aag aac atc cct gaa aac atc tat cag gag ttg tat aag tac ttc       2400
Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe           
785                 790                 795                 800           

aac gac aag tcg gat aag gag ctc tcc gac gaa gca gcc aaa ctc aag       2448
Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys           
                805                 810                 815               

aac gtc gtc gga cac cat gaa gca gca acc aac att gtg aag gac tac       2496
Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr           
            820                 825                 830                   

cgg tac act tac gac aag tac ttc ttg cac atg ccg atc act atc aac       2544
Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn           
        835                 840                 845                       

ttc aaa gcc aac aag acc gga ttc att aac gac agg atc ctc cag tac       2592
Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr           
    850                 855                 860                           

att gcc aaa gaa aag gac ctc cat gtc atc ggt atc gcg agg gga gaa       2640
Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Ala Arg Gly Glu           
865                 870                 875                 880           

cgg aac ctc atc tac gtc tcc gtg att gac act tgt ggc aac att gtc       2688
Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val           
                885                 890                 895               

gaa cag aag tcg ttc aac atc gtc aac ggt tac gat tac cag att aag       2736
Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys           
            900                 905                 910                   

ttg aaa cag cag gaa ggt gcg agg cag att gcg cga aag gaa tgg aag       2784
Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys           
        915                 920                 925                       

gag att ggc aaa atc aag gag att aag gaa ggc tac ttg tcg ttg gtc       2832
Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val           
    930                 935                 940                           

atc cac gaa atc tcg aaa atg gtg atc aaa tac aac gcc atc atc gcc       2880
Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala           
945                 950                 955                 960           

atg gaa gac ctc tcg tac ggc ttc aaa aag gga cgg ttc aaa gtg gag       2928
Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu           
                965                 970                 975               

cgt cag gtg tac cag aag ttc gaa aca atg ttg atc aac aag ttg aac       2976
Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn           
            980                 985                 990                   

tac ttg gtg ttc aag gac att tcc  att acc gag aac gga  gga ttg ctc     3024
Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu         
        995                 1000                 1005                     

aag ggt  tat cag ctc acg tac  atc ccc gac aag ttg  aaa aac gtg        3069
Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val            
    1010                 1015                 1020                        

gga cac  cag tgt ggc tgt atc  ttc tac gtg cct gca  gcc tac acg        3114
Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr            
    1025                 1030                 1035                        

tcg aaa  atc gac cct aca aca  gga ttc gtg aac atc  ttc aag ttc        3159
Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe            
    1040                 1045                 1050                        

aag gat  ctc acc gtc gac gcg  aag cgg gag ttc atc  aaa aag ttc        3204
Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe            
    1055                 1060                 1065                        

gac tcc  atc cgc tat gat tcg  gag aag aac ttg ttc  tgt ttc aca        3249
Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr            
    1070                 1075                 1080                        

ttc gac  tac aac aac ttc att  act cag aac acc gtg  atg tcc aaa        3294
Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys            
    1085                 1090                 1095                        

tcg tcg  tgg tcc gtg tac acg  tat ggt gtg cgc atc  aaa agg cgc        3339
Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg            
    1100                 1105                 1110                        

ttc gtc  aac ggt cgc ttc tcc  aac gaa tcg gac acg  atc gat atc        3384
Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile            
    1115                 1120                 1125                        

acg aaa  gac atg gag aaa aca  ttg gaa atg acc gac  atc aac tgg        3429
Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp            
    1130                 1135                 1140                        

cgt gac  ggc cat gac ctc agg  cag gac atc atc gat  tac gag atc        3474
Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile            
    1145                 1150                 1155                        

gtc cag  cac atc ttc gaa atc  ttc cgt ctc acc gtg  cag atg agg        3519
Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg            
    1160                 1165                 1170                        

aac tcc  ctc tcc gag ctc gaa  gat cgg gat tac gac  cgg ctc att        3564
Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile            
    1175                 1180                 1185                        

tcc cct  gtg ttg aac gag aac  aac atc ttc tac gac  tcg gca aaa        3609
Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys            
    1190                 1195                 1200                        

gcg gga  gat gca ttg ccg aag  gac gcc gat gcg aac  ggt gca tat        3654
Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr            
    1205                 1210                 1215                        

tgt att  gca ctc aag ggt ctc  tac gaa atc aag cag  atc acc gaa        3699
Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu            
    1220                 1225                 1230                        

aac tgg  aag gag gac ggc aaa  ttc tcg agg gac aag  ttg aag att        3744
Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile            
    1235                 1240                 1245                        

tcg aac  aag gat tgg ttc gat  ttc atc cag aac aag  agg tac ttg        3789
Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu            
    1250                 1255                 1260                        

taa                                                                   3792


<210>  125
<211>  1263
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  125

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Glu Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Ala Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


<210>  126
<211>  1263
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Mad7d mature protein

<400>  126

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Ala Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


<210>  127
<211>  624
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PmCDA1 coding sequence

<400>  127
atgaccgacg ctgagtacgt gagaatccat gagaagttgg acatctacac gtttaagaaa       60

cagtttttca acaacaaaaa atccgtgtcg catagatgct acgttctctt tgaattaaaa      120

cgacggggtg aacgtagagc gtgtttttgg ggctatgctg tgaataaacc acagagcggg      180

acagaacgtg gcattcacgc cgaaatcttt agcattagaa aagtcgaaga atacctgcgc      240

gacaaccccg gacaattcac gataaattgg tactcatcct ggagtccttg tgcagattgc      300

gctgaaaaaa tcttagaatg gtataaccag gagctgcggg ggaacggcca cactttgaaa      360

atctgggctt gcaaactcta ttacgagaaa aatgcgagga atcaaattgg gctgtggaac      420

ctcagagata acggggttgg gttgaatgta atggtaagtg aacactacca atgttgcagg      480

aaaatattca tccaatcgtc gcacaatcaa ttgaatgaga atagatggct tgagaagact      540

ttgaagcgag ctgaaaaacg acggagcgag ttgtccatta tgattcaggt aaaaatactc      600

cacaccacta agagtcctgc tgtt                                             624


<210>  128
<211>  208
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  PmCDA1 mature polypeptide

<400>  128

Met Thr Asp Ala Glu Tyr Val Arg Ile His Glu Lys Leu Asp Ile Tyr 
1               5                   10                  15      


Thr Phe Lys Lys Gln Phe Phe Asn Asn Lys Lys Ser Val Ser His Arg 
            20                  25                  30          


Cys Tyr Val Leu Phe Glu Leu Lys Arg Arg Gly Glu Arg Arg Ala Cys 
        35                  40                  45              


Phe Trp Gly Tyr Ala Val Asn Lys Pro Gln Ser Gly Thr Glu Arg Gly 
    50                  55                  60                  


Ile His Ala Glu Ile Phe Ser Ile Arg Lys Val Glu Glu Tyr Leu Arg 
65                  70                  75                  80  


Asp Asn Pro Gly Gln Phe Thr Ile Asn Trp Tyr Ser Ser Trp Ser Pro 
                85                  90                  95      


Cys Ala Asp Cys Ala Glu Lys Ile Leu Glu Trp Tyr Asn Gln Glu Leu 
            100                 105                 110         


Arg Gly Asn Gly His Thr Leu Lys Ile Trp Ala Cys Lys Leu Tyr Tyr 
        115                 120                 125             


Glu Lys Asn Ala Arg Asn Gln Ile Gly Leu Trp Asn Leu Arg Asp Asn 
    130                 135                 140                 


Gly Val Gly Leu Asn Val Met Val Ser Glu His Tyr Gln Cys Cys Arg 
145                 150                 155                 160 


Lys Ile Phe Ile Gln Ser Ser His Asn Gln Leu Asn Glu Asn Arg Trp 
                165                 170                 175     


Leu Glu Lys Thr Leu Lys Arg Ala Glu Lys Arg Arg Ser Glu Leu Ser 
            180                 185                 190         


Ile Met Ile Gln Val Lys Ile Leu His Thr Thr Lys Ser Pro Ala Val 
        195                 200                 205             


<210>  129
<211>  315
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Linker coding sequence

<400>  129
cctagcaggg ctgaccccaa gaagaagagg aaggtgggtg gaggaggttc tggaggtgga       60

ggttctgcag agtatgtgcg ggccctcttt gactttaatg ggaatgatga agaagacctt      120

ccctttaaga aaggagacat cctgagaatc cgggataagc ctgaagagca gtggtggaat      180

gcagaggaca gcgaaggaaa gagggggatg attcctgtcc cttacgtgga gaagtattcc      240

ggagactata aggaccacga cggagactac aaggatcatg atattgatta caaagacgat      300

gacgataagt ctagg                                                       315


<210>  130
<211>  105
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Linker mature polypeptide

<400>  130

Pro Ser Arg Ala Asp Pro Lys Lys Lys Arg Lys Val Gly Gly Gly Gly 
1               5                   10                  15      


Ser Gly Gly Gly Gly Ser Ala Glu Tyr Val Arg Ala Leu Phe Asp Phe 
            20                  25                  30          


Asn Gly Asn Asp Glu Glu Asp Leu Pro Phe Lys Lys Gly Asp Ile Leu 
        35                  40                  45              


Arg Ile Arg Asp Lys Pro Glu Glu Gln Trp Trp Asn Ala Glu Asp Ser 
    50                  55                  60                  


Glu Gly Lys Arg Gly Met Ile Pro Val Pro Tyr Val Glu Lys Tyr Ser 
65                  70                  75                  80  


Gly Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp 
                85                  90                  95      


Tyr Lys Asp Asp Asp Asp Lys Ser Arg 
            100                 105 


<210>  131
<211>  297
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UGI coding sequence

<400>  131
tctagaggct ccggaaccaa cctgtccgac atcatcgaga aggagaccgg caagcagctc       60

gttatccagg agtccatcct gatgctgccc gaggaggtcg aggaggtcat cggcaacaag      120

cccgagtccg acatcctggt ccacaccgcc tacgacgagt ccaccgacga gaacgtcatg      180

ctgctgacct ccgacgcccc cgagtacaag ccctgggccc tggtcatcca ggactccaac      240

ggcgagaaca agatcaagat gctgtccggc ggctccccca agaagaagcg caaggtc         297


<210>  132
<211>  99
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  UGI mature polypeptide

<400>  132

Ser Arg Gly Ser Gly Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr 
1               5                   10                  15      


Gly Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu 
            20                  25                  30          


Val Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His 
        35                  40                  45              


Thr Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser 
    50                  55                  60                  


Asp Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn 
65                  70                  75                  80  


Gly Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys 
                85                  90                  95      


Arg Lys Val 
            


<210>  133
<211>  17429
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3530

<400>  133
accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag       60

ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca      120

gcgctgcgat gataccgcga gaaccacgct caccggctcc ggatttatca gcaataaacc      180

agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt      240

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg      300

ttgttgccat cgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca      360

gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg      420

ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca      480

tggttatggc agcgctacat aattctctta ctgtcatgcc atccgtaaga tgcttttctg      540

tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct      600

cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca      660

tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca      720

gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg      780

tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac      840

ggaaatgttg aatactcata ttcttccttt ttcaatatta ttgaagcatt tatcagggtt      900

attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggtca      960

gtgttacaac caattaacca attctgaaca ttatcgcgag cccatttata cctgaatatg     1020

gctcataaca ccccttgttt gcctggcggc agtagcgcgg tggtcccacc tgaccccatg     1080

ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc ccatgcgaga     1140

gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact gggcctttcg     1200

cccgggctaa ttatggggtg tcgcccttat tcgactctat agtgaagttc ctattctcta     1260

gaaagtatag gaacttctga agtggggatt taaatgcggc cgcgctgagg gtttaatcga     1320

cgaagcagct gacggccagt gccaagctta acgcgtaccc gggcccagta tatgttccgc     1380

agatgactgg agctctgcca tacgtgccct ctcaagcacc atttgttcca tctacagaga     1440

ctagtcacca actagtctat caagactcac agggtacatt gctgagacca actgaccaga     1500

ggcagggtag cggattgacg gctccatctc cttcacttac aaggtctatt gaaagccctt     1560

tagcatcacc aagcggagaa tagattgtta agcttatttt ttgtatactg ttttgtgata     1620

gcacgaagtt tttccacggt atcttgtaaa aatatatatt tgtggcgggc ttacctacat     1680

caaattaata agagactaat tataaactaa acacacaagc aagctacttt agggtaaaag     1740

tttataaatg cttttgacgt ataaacgttg cttgtattta ttattacaat taaaggtgga     1800

tagaaaacct agagactagt tagaaactaa tctcaggttt gcgttaaact aaatcagagc     1860

ccgagaggtt aacagaacct agaaggggac tagatatccg ggtagggaaa caaaaaaaaa     1920

aaacaagaca gccacatatt agggagacta gttagaagct agttccagga ctaggaaaat     1980

aaaagacaat gataccacag tctagttgac aactagatag attctagatt gaggccaaag     2040

tctctgagat ccaggttagt tgcaactaat actagttagt atctagtctc ctataactct     2100

gaagctagaa taacttacta ctattatcct caccactgtt cagctgcgca aacggagtga     2160

ttgcaaggtg ttcagagact agttattgac tagtcagtga ctagcaataa ctaacaaggt     2220

attaacctac catgtctgcc atcaccctgc acttcctcgg gctcagcagc cttttcctcc     2280

tcattttcat gctcattttc cttgtttaag actgtgacta gtcaaagact agtccagaac     2340

cacaaaggag aaatgtctta ccactttctt cattgcttgt ctcttttgca ttatccatgt     2400

ctgcaactag ttagagtcta gttagtgact agtccgacga ggacttgctt gtctccggat     2460

tgttggagga actctccagg gcctcaagat ccacaacaga gccttctaga agactggtca     2520

ataactagtt ggtctttgtc tgagtctgac ttacgaggtt gcatactcgc tccctttgcc     2580

tcgtcaatcg atgagaaaaa gcgccaaaac tcgcaatatg gctttgaacc acacggtgct     2640

gagactagtt agaatctagt cccaaactag cttggatagc ttacctttgc cctttgcgtt     2700

gcgacaggtc ttgcagggta tggttccttt ctcaccagct gatttagctg ccttgctacc     2760

ctcacggcgg atctgccata aagagtggct agaggttata aattagcact gatcctaggt     2820

acggggctga atgtaacttg cctttccttt ctcatcgcgc ggcaagacag gcttgctcaa     2880

attcctacca gtcacagggg tatgcacggc gtacggacca cttgaactag tcacagatta     2940

gttagcaact agtctgcatt gaatggctgt acttacgggc cctcgccatt gtcctgatca     3000

tttccagctt caccctcgtt gctgcaaagt agttagtgac tagtcaagga ctagttgaaa     3060

tgggagaaga aactcacgaa ttctcgactc ccttagtatt gtggtccttg gacttggtgc     3120

tgctatatat tagctaatac actagttaga ctcacagaaa cttacgcagc tcgcttgcgc     3180

ttcttggtag gagtcggggt tgggagaaca gtgccttcaa acaagccttc ataccatgct     3240

acttgactag tcagggacta gtcaccaagt aatctagata ggacttgcct ttggcctcca     3300

tcagttcctt catagtggga ggaccattgt gcaatgtaaa ctccatgccg tgggagttct     3360

tgtccttcaa gtgcttgacc aatatgtttc tgttggcaga gggaacctgt caactagtta     3420

ataactagtc agaaactatg atagcagtag actcactgta cgcttgaggc atcccttcac     3480

tcggcagtag acttcatatg gatggatatc aggcacgcca ttgtcgtcct gtggactagt     3540

cagtaactag gcttaaagct agtcgggtcg gcttactatc ttgaaatccg gcagcgtaag     3600

ctccccgtcc ttaactgcct cgagatagtg acagtactct ggggactttc ggagatcgtt     3660

atcgttatcg cgaatgctcg gcatactaac tgttgactag tcttggacta gtcccgagca     3720

aaaaggattg gaggaggagg aggaaggtga gagtgagaca aagagcgaaa taagagcttc     3780

aaaggctatc tctaagcagt atgaaggtta agtatctagt tcttgactag atttaaagag     3840

atttcgacta gttatgtacc tggagtttgg atataggaat gtgttgtggt aacgaaatgt     3900

aagggggagg aaagaaaaag tcgtcaagag gtaactctaa gtcggccatt cctttttggg     3960

aggcgctaac cataaacggc atggtcgact tagagttagc tcagggaatt tagggagtta     4020

tctgcgacca ccgaggaacg gcggaatgcc aaagaatccc gatggagctc tagctggcgg     4080

ttgacaaccc caccttttgg cgtttctgcg gcgttgcagg cgggactgga tacttcgtag     4140

aaccagaaag gcaaggcaga acgcgctcag caagagtgtt ggaagtgata gcatgatgtg     4200

ccttgttaac taggtaccaa tctgcagtat gcttgatgtt atccaaagtg tgagagagga     4260

aggtccaaac atacacgatt gggagagggc ctaggtataa gagtttttga gtagaacgca     4320

tgtgagccca gccatctcga ggagattaaa cacgggccgg catttgatgg ctatgttagt     4380

accccaatgg aaacggtgag agtccagtgg tcgcagataa ctccctaaat tccctgagct     4440

aactctaagt cgaccatgcc gtttatggtt agcgcctccc aaaaaggaat ggccgactta     4500

gagttacctc ttgacgactt tttctttcct cccccttaca tttcgttacc acaacacatt     4560

cctatatcca aactccaggt acataactag tcgaaatctc tttaaatcta gtcaagaact     4620

agatacttaa ccttcatact gcttagagat agcctttgaa gctcttattt cgctctttgt     4680

ctcactctca ccttcctcct cctcctccaa tcctttttgc tcgggactag tccaagacta     4740

gtcaacagtt agtatgccga gcattcgcga taacgataac gatctccgaa agtccccaga     4800

gtactgtcac tatctcgagg cagttaagga cggggagctt acgctgccgg atttcaagat     4860

agtaagccga cccgactagc tttaagccta gttactgact agtccacagg acgacaatgg     4920

cgtgcctgat atccatccat atgaagtcta ctgccgagtg aagggatgcc tcaagcgtac     4980

agtgagtcta ctgctatcat agtttctgac tagttattaa ctagttgaca ggttccctct     5040

gccaacagaa acatattggt caagcacttg aaggacaaga actcccacgg catggagttt     5100

acattgcaca atggtcctcc cactatgaag gaactgatgg aggccaaagg caagtcctat     5160

ctagattact tggtgactag tccctgacta gtcaagtagc atggtatgaa ggcttgtttg     5220

aaggcactgt tctcccaacc ccgactccta ccaagaagcg caagcgagct gcgtaagttt     5280

ctgtgagtct aactagtgta ttagctaata tatagcagca ccaagtccaa ggaccacaat     5340

actaagggag tcgagaattc gtgagtttct tctcccattt caactagtcc ttgactagtc     5400

actaactact ttgcagcaac gagggtgaag ctggaaatga tcaggacaat ggcgagggcc     5460

cgtaagtaca gccattcaat gcagactagt tgctaactaa tctgtgacta gttcaagtgg     5520

tccgtacgcc gtgcataccc ctgtgactgg taggaatttg agcaagcctg tcttgccgcg     5580

cgatgagaaa ggaaaggcaa gttacattca gccccgtacc taggatcagt gctaatttat     5640

aacctctagc cactctttat ggcagatccg ccgtgagggt agcaaggcag ctaaatcagc     5700

tggtgagaaa ggaaccatac cctgcaagac ctgtcgcaac gcaaagggca aaggtaagct     5760

atccaagcta gtttgggact agattctaac tagtctcagc accgtgtggt tcaaagccat     5820

attgcgagtt ttggcgcttt ttctcatcga ttgacgaggc aaagggagcg agtatgcaac     5880

ctcgtaagtc agactcagac aaagaccaac tagttattga ccagtcttct agaaggctct     5940

gttgtggatc ttgaggccct ggagagttcc tccaacaatc cggagacaag caagtcctcg     6000

tcggactagt cactaactag actctaacta gttgcagaca tggataatgc aaaagagaca     6060

agcaatgaag aaagtggtaa gacatttctc ctttgtggtt ctggactagt ctttgactag     6120

tcacagtctt aaacaaggaa aatgagcatg aaaatgagga ggaaaaggct gctgagcccg     6180

aggaagtgca gggtgatggc agacatggta ggttaatacc ttgttagtta ttgctagtca     6240

ctgactagtc aataactagt ctctgaacac cttgcaatca ctccgtttgc gcagctgaac     6300

agtggtgagg ataatagtag taagttattc tagcttcaga gttataggag actagatact     6360

aactagtatt agttgcaact aacctggatc tcagagactt tggcctcaat ctagaatcta     6420

tctagttgtc aactagactg tggtatcatt gtcttttatt ttcctagtcc tggaactagc     6480

ttctaactag tctccctaat atgtggctgt cttgtttttt ttttttgttt ccctacccgg     6540

atatctagtc cccttctagg ttctgttaac ctctcgggct ctgatttagt ttaacgcaaa     6600

cctgagatta gtttctaact agtctctagg ttttctatcc acctttaatt gtaataataa     6660

atacaagcaa cgtttatacg tcaaaagcat ttataaactt ttaccctaaa gtagcttgct     6720

tgtgtgttta gtttataatt agtctcttat taatttgatg taggtaagcc cgccacaaat     6780

atatattttt acaagatacc gtggaaaaac ttcgtgctat cacaaaacag tatacaaaaa     6840

ataagcttaa caatctattc tccgcttggt gatgctaaag ggctttcaat agaccttgta     6900

agtgaaggag atggagccgt caatccgcta ccctgcctct ggtcagttgg tctcagcaat     6960

gtaccctgtg agtcttgata gactagttgg tgactagtct ctgtagatgg aacaaatggt     7020

gcttgagagg gcacgtatgg cagagctcca gtcatctgcg gaacatatac tgggcccggg     7080

aagatctcat ggtcatagct gtttccgtta attaatggtt cacttctctt tagaaatcaa     7140

ctgtgggttt tgctttttgc ttcattctct ttgtcttctc catctttgat caaatcctgg     7200

actttctcaa tccccagcta attcaatcat agtcagtttt ctatttttat tatttctttt     7260

tcttttgaaa tgtgattaac aaccagtccg ttatatatct tgtacccaga ttacgcccaa     7320

ctcgtgctcc tcagccacaa agatactcaa ttgatagcca agatacatac ataccacaaa     7380

gtaaggactc catgcattga gtattactca tcgtattcta gactactcca aaactcagca     7440

catagacaaa caatacgaac ctcgtctagg ggtgattcag aggcggcaaa gcggggtttt     7500

cgcatttgat gttcctggca cttatgtaag cccacgcttc ccgctcaact aaaccatcag     7560

ccaatcagac tgctcagatt tatcttttga agggtaaata aatcattgta aagaagaaca     7620

agtggcttgc ttgtcaagca atggcatcat tggtctagtg gtagaattcg tcgttgccat     7680

cgacgaggcc cgtgttcgat tcacggatga tgcagtcaaa agaccttttt aatttctact     7740

cttgtagatg cgatcgcttt ttttttgagc atttatcagc ttgatataga ggtaggaatg     7800

tatggaggtg cagaatggct attttgttat tggagcgggt tcgaaacgga gggcaggaga     7860

ctttttctaa atacgtcacg tgatatagag ctgctttaat taacgagaca gcagaatcac     7920

cgcccaagtt aagcctttgt gctgatcatg ctctcgaacg ggccaagttc gggaaaagca     7980

aaggagcgtt tagtgagggg caatttgact cacctcccag gcaacagatg aggggggcaa     8040

aaagaaagaa attttcgtga gtcaatatgg attccgagca tcattttctt gcggtctatc     8100

ttgctacgta tgttgatctt gacgctgtgg atcaagcaac gccactcgct cgctccatcg     8160

caggctggtc gcagacaaat taaaaggcgg caaactcgta cagccgcggg gttgtccgct     8220

gcaaagtaca gagtgataaa agccgccatg cgaccatcaa cgcgttgatg cccagctttt     8280

tcgatccgag aatccaccgt agaggcgata gcaagtaaag aaaagctaaa caaaaaaaaa     8340

tttctgcccc taagccatga aaacgagatg gggtggagca gaaccaagga aagagtcgcg     8400

ctgggctgcc gttccggaag gtgttgtaaa ggctcgacgc ccaaggtggg agtctaggag     8460

aagaatttgc atcgggagtg gggcgggtta cccctccata tccaatgaca gatatctacc     8520

agccaagggt ttgagcccgc ccgcttagtc gtcgtcctcg cttgcccctc cataaaagga     8580

tttcccctcc ccctcccaca aaattttctt tcccttcctc tccttgtccg cttcagtacg     8640

tatatcttcc cttccctcgc ttctctcctc catccttctt tcatccatct cctgctaact     8700

tctctgctca gcacctctac gcattactag ccgtagtatc tgagcacttc tcccttttat     8760

attccacaaa acataacaca accttcacca tgaacaacgg cacaaacaac ttccagaact     8820

tcattggaat ctcgtcgttg cagaagactt tgcgcaacgc cctcatcccc acagaaacta     8880

cccagcagtt cattgtgaag aacggaatca tcaaggaaga tgaactccga ggcgagaacc     8940

gccagatttt gaaggacatc atggatgatt actaccgtgg tttcatctcg gaaacgctct     9000

cctccattga cgacatcgat tggacttcgt tgttcgaaaa gatggaaatc cagctcaaaa     9060

acggcgataa caaggatacc ttgatcaagg agcagaccga gtatcggaag gcgatccata     9120

agaagttcgc caacgatgat cggttcaaga acatgttctc ggccaagttg atttccgaca     9180

ttctccccga attcgtgatc cataacaaca actactcggc gtcggagaag gaggagaaga     9240

cgcaggtcat caagttgttc tcgaggttcg ccacatcgtt caaagagtat tttaagaatc     9300

gtgcgaactg tttctcggca gatgatatct cctcgtcctc ctgtcaccgc attgtgaacg     9360

acaacgcgga aatcttcttc tcgaacgcgt tggtgtatag gcgcatcgtg aagtccctct     9420

ccaacgatga catcaacaaa atctcgggag atatgaagga ttcgctcaag gagatgtcgt     9480

tggaggaaat ctactcctat gagaagtatg gcgagttcat tacgcaggag ggcatttcct     9540

tctacaacga catttgtggt aaagtcaact cgttcatgaa cctctactgt cagaaaaaca     9600

aggagaacaa aaacctctat aagctccaga agttgcataa gcagatcctc tgtatcgcag     9660

acacctcgta cgaggtccct tacaagttcg aatccgatga ggaggtctac cagtccgtca     9720

acggattctt ggacaacatc tcctcgaaac acattgtcga gcggctccga aagatcggcg     9780

ataactacaa cggctacaac ttggacaaaa tctatatcgt ctccaagttc tatgagtccg     9840

tctcgcagaa aacctatcgt gattgggaga ctatcaacac tgcgctcgag attcactata     9900

acaacatctt gcctggtaac ggcaaatcga aagccgacaa ggtgaagaag gccgtgaaaa     9960

acgatctcca gaagtcgatc acagaaatca acgaactcgt ctcgaactac aagctctgtt    10020

cggatgataa catcaaggcg gaaacgtaca tccatgaaat ctcgcatatc ttgaacaact    10080

tcgaggccca ggaactcaaa tacaaccccg agatccactt ggtcgagtcg gagctcaaag    10140

cctcggagtt gaagaacgtc ttggatgtca tcatgaacgc attccactgg tgttccgtgt    10200

tcatgaccga ggaactcgtc gataaagaca acaacttcta cgcggaactc gaggaaatct    10260

acgatgaaat ctatcccgtg atctccctct acaacctcgt gcgaaactac gtcactcaga    10320

agccctattc caccaagaag atcaagctca acttcggcat ccccactctc gcagacggtt    10380

ggtcgaagtc gaaggagtac tccaacaacg ccattatcct catgcgagac aacctctact    10440

acttgggtat cttcaacgca aagaacaagc cggataagaa gatcattgaa ggcaacactt    10500

cggaaaacaa gggagactat aagaagatga tctacaacct cctccctgga cccaacaaga    10560

tgattcctaa agtgttcctc tcgtcgaaga ctggtgtgga aacgtataag ccgtcggcct    10620

acatcttgga gggctacaaa cagaacaagc atatcaagtc ctcgaaggac ttcgacatca    10680

ctttctgtca cgacctcatc gactatttca agaactgtat tgcaatccat ccggaatgga    10740

agaacttcgg cttcgatttc tcggatactt cgacatacga agatatctcg ggattctacc    10800

gagaggtcga attgcagggc tataagattg attggaccta catctcggaa aaggatatcg    10860

acttgctcca ggaaaagggc cagctctacc tcttccagat ttacaacaag gacttctcca    10920

agaagtcgac gggtaacgac aacttgcaca caatgtatct caaaaacctc ttctcggagg    10980

agaacttgaa ggatatcgtg ctcaaattga acggagaggc cgaaatcttc ttccgtaagt    11040

cctccatcaa gaacccgatc atccataaga agggatcgat cttggtcaac cggacttacg    11100

aagcagagga aaaagatcag ttcggaaaca tccagattgt caggaagaac atccctgaaa    11160

acatctatca ggagttgtat aagtacttca acgacaagtc ggataaggag ctctccgacg    11220

aagcagccaa actcaagaac gtcgtcggac accatgaagc agcaaccaac attgtgaagg    11280

actaccggta cacttacgac aagtacttct tgcacatgcc gatcactatc aacttcaaag    11340

ccaacaagac cggattcatt aacgacagga tcctccagta cattgccaaa gaaaaggacc    11400

tccatgtcat cggtatcgcg aggggagaac ggaacctcat ctacgtctcc gtgattgaca    11460

cttgtggcaa cattgtcgaa cagaagtcgt tcaacatcgt caacggttac gattaccaga    11520

ttaagttgaa acagcaggaa ggtgcgaggc agattgcgcg aaaggaatgg aaggagattg    11580

gcaaaatcaa ggagattaag gaaggctact tgtcgttggt catccacgaa atctcgaaaa    11640

tggtgatcaa atacaacgcc atcatcgcca tggaagacct ctcgtacggc ttcaaaaagg    11700

gacggttcaa agtggagcgt caggtgtacc agaagttcga aacaatgttg atcaacaagt    11760

tgaactactt ggtgttcaag gacatttcca ttaccgagaa cggaggattg ctcaagggtt    11820

atcagctcac gtacatcccc gacaagttga aaaacgtggg acaccagtgt ggctgtatct    11880

tctacgtgcc tgcagcctac acgtcgaaaa tcgaccctac aacaggattc gtgaacatct    11940

tcaagttcaa ggatctcacc gtcgacgcga agcgggagtt catcaaaaag ttcgactcca    12000

tccgctatga ttcggagaag aacttgttct gtttcacatt cgactacaac aacttcatta    12060

ctcagaacac cgtgatgtcc aaatcgtcgt ggtccgtgta cacgtatggt gtgcgcatca    12120

aaaggcgctt cgtcaacggt cgcttctcca acgaatcgga cacgatcgat atcacgaaag    12180

acatggagaa aacattggaa atgaccgaca tcaactggcg tgacggccat gacctcaggc    12240

aggacatcat cgattacgag atcgtccagc acatcttcga aatcttccgt ctcaccgtgc    12300

agatgaggaa ctccctctcc gagctcgaag atcgggatta cgaccggctc atttcccctg    12360

tgttgaacga gaacaacatc ttctacgact cggcaaaagc gggagatgca ttgccgaagg    12420

acgccgatgc gaacggtgca tattgtattg cactcaaggg tctctacgaa atcaagcaga    12480

tcaccgaaaa ctggaaggag gacggcaaat tctcgaggga caagttgaag atttcgaaca    12540

aggattggtt cgatttcatc cagaacaaga ggtacttgcc tagcagggct gaccccaaga    12600

agaagaggaa ggtgggtgga ggaggttctg gaggtggagg ttctgcagag tatgtgcggg    12660

ccctctttga ctttaatggg aatgatgaag aagaccttcc ctttaagaaa ggagacatcc    12720

tgagaatccg ggataagcct gaagagcagt ggtggaatgc agaggacagc gaaggaaaga    12780

gggggatgat tcctgtccct tacgtggaga agtattccgg agactataag gaccacgacg    12840

gagactacaa ggatcatgat attgattaca aagacgatga cgataagtct aggatgaccg    12900

acgctgagta cgtgagaatc catgagaagt tggacatcta cacgtttaag aaacagtttt    12960

tcaacaacaa aaaatccgtg tcgcatagat gctacgttct ctttgaatta aaacgacggg    13020

gtgaacgtag agcgtgtttt tggggctatg ctgtgaataa accacagagc gggacagaac    13080

gtggcattca cgccgaaatc tttagcatta gaaaagtcga agaatacctg cgcgacaacc    13140

ccggacaatt cacgataaat tggtactcat cctggagtcc ttgtgcagat tgcgctgaaa    13200

aaatcttaga atggtataac caggagctgc gggggaacgg ccacactttg aaaatctggg    13260

cttgcaaact ctattacgag aaaaatgcga ggaatcaaat tgggctgtgg aacctcagag    13320

ataacggggt tgggttgaat gtaatggtaa gtgaacacta ccaatgttgc aggaaaatat    13380

tcatccaatc gtcgcacaat caattgaatg agaatagatg gcttgagaag actttgaagc    13440

gagctgaaaa acgacggagc gagttgtcca ttatgattca ggtaaaaata ctccacacca    13500

ctaagagtcc tgctgtttct agaggctccg gaaccaacct gtccgacatc atcgagaagg    13560

agaccggcaa gcagctcgtt atccaggagt ccatcctgat gctgcccgag gaggtcgagg    13620

aggtcatcgg caacaagccc gagtccgaca tcctggtcca caccgcctac gacgagtcca    13680

ccgacgagaa cgtcatgctg ctgacctccg acgcccccga gtacaagccc tgggccctgg    13740

tcatccagga ctccaacggc gagaacaaga tcaagatgct gtccggcggc tcccccaaga    13800

agaagcgcaa ggtctaatgt acagcggaca ttcgatttat gccgttatga cttccttaaa    13860

aaagccttta cgaatgaaag aaatggaatt agacttgtta tgtagttgat tctacaatgg    13920

attatgattc ctgaacttca aatccgctgt tcattattaa tctcagctct tcccgtaaag    13980

ccaatgttga aactattcgt aaatgtacct cgttttgcgt gtaccttgct tatcacgtga    14040

tattacatga cctggacaga gttctgcgcg aaagtcataa cgtaaatccc gggcggtagg    14100

tgcgtcccgg gcggaaggta gttttctcgt ccaccccaac gcgtttatca acctcaactt    14160

tcaacaacca tcatgccacc aaaagcgcgt aaaacaaagc gagatttgat tgagcaagag    14220

ggcaggatcc aatgcgcgat tcaagacatt aaaaatggaa aatttcaaaa aattgcgccc    14280

gcagcgcgtg catacaaaat tcatcccaat actcctcgtg tactgtgtaa gcgcccacta    14340

ggtaatatga catgattacg aattcgagct cggtacccgg ccggaactcc acgtctagag    14400

gatccaccag tgattgacca atgttttatc ttctacagtt ctgcctgtct accccattct    14460

agctgtacct gactacagag tagtttaatt gtggttgacc ccacagtcgg aggcggagga    14520

atacagcacc gatgtggcct gtctccatcc agattggcac gcaattttta cacgcggaaa    14580

agatcgagat agagtacgac tttaaattta gtccccggcg gcttctattt tagaatattt    14640

gagatttgat tctcaagcaa ttgatttggt tgggtcaccc tcaattggat aatatacctc    14700

attgctcggc tacttcaact catcaatcac cgtcataccc cgcatataac cctccattcc    14760

cacgatgtcg tccaagtcgc aattgactta cggtgctcga gccagcaagc accccaatcc    14820

tctggcaaag agactttttg agattgccga agcaaagaag acaaacgtta ccgtctctgc    14880

tgatgtgacg acaacccgag aactcctgga cctcgctgac cgtacggaag ctgttggatc    14940

caatacatat gccgtctagc aatggactaa tcaacttttg atgatacagg tctcggtccc    15000

tacatcgccg tcatcaagac acacatcgac atcctcaccg atttcagcgt cgacactatc    15060

aatggcctga atgtgctggc tcaaaagtac aactttttga tcttcgagga ccgcaaattc    15120

atcgacatcg gcaataccgt ccagaagcaa taccacggcg gtgctctgag gatctccgaa    15180

tgggcccaca ttatcaactg cagcgttctc cctggcgagg gcatcgtcga ggctctggcc    15240

cagaccgcat ctgcgcaaga cttcccctat ggtcctgaga gaggactgtt ggtcctggca    15300

gagatgaccc ccaaaggatc gctggctacg ggcgagtata ccaaggcatc ggttgactac    15360

gctcgcaaat acaagaactt cgttatgggt ttcgtgtcga cgcgggccct gacggaagtg    15420

cagtcggatg tgtcttcagc ctcggaggat gaagatttcg tggtcttcac gacgggtgtg    15480

aacctctctt ccaaaggaga taagcttgga cagcaatacc agactcctgc atcggctatt    15540

ggacgcggtg ccgactttat catcgccggt cgaggcatct acgctgctcc cgacccggtt    15600

gaagctgcac agcggtacca gaaagaaggc tgggaagctt atatggccag agtatgcggc    15660

aagtcatgat ttcctcttgg agcaaaagtg tagtgccagt acgagtgttg tggaggaagg    15720

ctgcatacat tgtgcctgtc attaaacgat gagctcgtcc gtattggccc ctgtaatgcc    15780

atgttttccg cccccaatcg tcaaggtttt ccctttgtta gattcctacc agtcatctag    15840

caaggcggcc gcagctagca caattgaggc atccccacta ccgcattaag acctcagcgc    15900

ggccgcaaat ttaaataaaa tgaagtgaag ttcctatact ttctagagaa taggaacttc    15960

tatagtgagt cgaataaggg cgacacaaaa tttattctaa atgcataata aatactgata    16020

acatcttata gtttgtatta tattttgtat tatcgttgac atgtataatt ttgatatcaa    16080

aaactgattt tccctttatt attttcgaga tttattttct taattctctt taacaaacta    16140

gaaatattgt atatacaaaa aatcataaat aatagatgaa tagtttaatt ataggtgttc    16200

atcaatcgaa aaagcaacgt atcttattta aagtgcgttg cttttttctc atttataagg    16260

ttaaataatt ctcatatatc aagcaaagtg acaggcgccc ttaaatattc tgacaaatgc    16320

tctttcccta aactcccccc ataaaaaaac ccgccgaagc gggtttttac gttatttgcg    16380

gattaacgat tactcgttat cagaaccgcc cagggggccc gagcttaaga ctggccgtcg    16440

ttttacaaca cagaaagagt ttgtagaaac gcaaaaaggc catccgtcag gggccttctg    16500

cttagtttga tgcctggcag ttccctactc tcgccttccg cttcctcgct cactgactcg    16560

ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg    16620

ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag    16680

gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac    16740

gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga    16800

taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt    16860

accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc    16920

tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc    16980

cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta    17040

agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat    17100

gtaggcggtg ctacagagtt cttgaagtgg tgggctaact acggctacac tagaagaaca    17160

gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct    17220

tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt    17280

acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct    17340

cagtggaacg acgcgcgcgt aactcacgtt aagggatttt ggtcatgagc ttgcgccgtc    17400

ccgtcaagtc agcgtaatgc tctgctttt                                      17429


<210>  134
<211>  6651
<212>  DNA
<213>  Aspergillus oryzae

<400>  134
atggaggggc cacgcggcgt ctatctcttc ggagaccaga caagtgattt cgacgccggc       60

ttacgtcgcc tcctacaagt aaagaataac acaattgttg catcgttctt ccagagatgc      120

tttcacgctt tgcgccaaga gatcgcgagg ctttcaccat ctgaacggaa gatcttcccc      180

cggtttacga gcatagtgga tctactggcg cgtcaccggg agtcagaccc taatccggct      240

ctggagagtg cgttgacctg tatctatcaa ttgggatgct ttataaagta cgtgtaactg      300

cagatcctga cccgtttgaa cgagcctaac ctgagatagc tactacggag accttggaaa      360

cgtgtaccca tctgcttcag actgccatat agttggcctg tgcgcgggtc ttcttagttc      420

tgcagctgta agctgttcga acaatgttgg agaattgctc cccgctgcgg ttgaagcggt      480

ggtggtagct ctccgacttg gtctatgcgt ccttaaagtt cgagagctgg tgagctctga      540

ccaagcgtcg tcaacaagct ggtcagtctt gatttcaggg attagcgaga aagatgcctc      600

gcagcttata ggagaattca ctgctgaacg ggtaagtcaa ttgatctgaa atagtttgca      660

ggacagaatg ttctaaccac tggataaagg caattcctcc ttcatccaaa ccgtatatca      720

gtgcggtggg atataacagt ataaccatca gcgcaccgcc taaggtcctt gatgatttaa      780

ttgattctag gctgtctaag agccataagc cggtgagggc gcaaatccat ggtccttacc      840

atgcagcaca tctgtactat ggccgagatg tcgacaggat catcgaaagc tgccataatg      900

aggtcgtttc aaactacaca ccccgtatcc ccgtactatc aagtactacg ggacagccga      960

tagaggccaa acacatgaaa gatctactta aggccgccct tgaagagatt ctactacgtc     1020

aactatgctg ggagaaagtg accgatgcct gctattccat attaaaaact gctcgtcatc     1080

aaccatgcaa gttgttccca atttcaagca ctgcgacaca aagcttgttt acagctctta     1140

cgaaagccgg gataaccgac atcgaagtgg aaaatgggct cggagatgtt cccacgaacc     1200

cgaaggacaa ccttaacatc agcggcaggg cggactgctc caagatagct atcattggca     1260

tgtctggacg attcccagaa gctgatggca cagagagttt ctgggacctt ctgtataatg     1320

gcctcgatgt acaccggaag gtgcctgcag agcgttggga tgttgatgcc cacgttgatc     1380

ctaccggaac aaaacggaac accagcaagg ttccatacgg atgctggata aacgaaccgg     1440

ggttatttga cccccgcttc ttcaatatgt cgccacgcga agccctccag gcagatcccg     1500

ctcaaagact tgcattgctc acggcctatg aagctcttga aatggccggc tttatccccg     1560

acagcacccc ttctacacag agggatcgag tcggcctctt ctatggaatg actagcgatg     1620

actatcggga gataaatagt ggtcaagata ttgatactta ctttatccct ggtgggaatc     1680

gtgctttcac acctggccgg ataaactact atttcaagtt cagtgggccc agcgtcagcg     1740

ttgatacagc ttgttcttca agtcttgcgg ctattcatat ggcttgcaat tcgatctgga     1800

gaaatgattg cgatgctgct attgctggag gtgtcaatat attgacaaac cctgataacc     1860

atgccggtct tgaccgtggc catttcctgt ccagaaccgg gaattgcaac acatttgacg     1920

atggtgctga tggctactgt agagcagatg gagtgggtac aatcattctc aagcggctgg     1980

aagacgctca ggcggacaac gatccaatcc tcggtgtgat caatggagcc tataccaatc     2040

attcggcaga agcagtctcg attacccgcc ctcatgttgg cgcacaagcg tttatcttta     2100

ataagctatt gaacgatgcc aatatcgacc ctaaggacgt cagctacgtt gaaatgcatg     2160

gaactggtac tcaagctggg gatgcggtgg aaatgcaatc ggtcttggat acgtttgctc     2220

ccgactaccg ccgtggacca ggacagtctc tccatcttgg ttccgccaaa gcaaatgttg     2280

ggcatggaga gtcagcatct ggtgtaactg cacttgtgaa agtgctgcta atgatgaaga     2340

agaataccat accccctcat tgtggtataa agactaagat caaccacaac ttccccacgg     2400

atctcgcgca acgaaatgtc cacattgcct ttcaacctac cccttggaac agaccggctt     2460

ccggaaagcg gcagtgcttc attaacaact tttcggcggc tggtggaaat accgctcttt     2520

tgatggaaga cgctccaatc gctgaggtta aggggcagga cactcgacct gttcacgttg     2580

tgtctgtatc ggcacgatcc cagagtgcgc tcaaaaacaa catcaactct ctcgtaaaat     2640

acatcgacga acaaggaagg tcattcaatg tgaacgaggc agactttatc ccaagcttgg     2700

catacaccac cacagcacgg cgtatccatc acccattccg tgtcacagct atcgggtcta     2760

gtttgcagga gctgcgtgac tcacttaaca acagctctcg tctggaaagc tttacccctg     2820

tccctgcgac ggcccctggc gtagggttcg tgttcgctgg ccaaggagct cagcacaccg     2880

gaatgggaag gcaactatac gaaaaatgct ctcaattccg ggcaacaatg cagcacttcg     2940

attgcattag tcaaaaccaa gggtttcctt cgatccttcc cttggttgac ggaagcgtgc     3000

ccgtggagga gctgggccct atcgtgacac agctcggcac cacatgtctt cagatggctt     3060

tggtcaacta ttggggttca ctaggtataa aacctgcgtt cgttcttggg catagtctcg     3120

gggagtttgc tgctttgaat accgcaggag tattatcgac ttccgatacc atctaccttt     3180

gtggccgtcg ggctaccctc cttacagaat actgccaggt tgggacacac gccatgctgg     3240

ctgtcaaggc ttcctacccc caggtcaagc agttactgaa agaaggtgtg gatgaagttg     3300

cctgtgtcaa ctcacccagt gagacagtcg tcagtggcct caccgctgat attgatgact     3360

tggctcaaag gtgttccact gaaggttgga agtccactaa actaagggta ccgttcgctt     3420

tccattctgc ccaagttact ccaattcttg aacggtttca agaagaggcc cagggtgtca     3480

cgttccgtaa gccgtcgtta ccgtttgttt cctcactcct tggggaagtc atcaccgaat     3540

ctaattacga tgtcctggga gctcaatata tggtgaagca gtgccggaag tcggtgaact     3600

tccttggtgc tcttgaggcc accagatatg cgaaattgat gactgataag actgtctggc     3660

tggaagttgg tgcccatacc atttgctctg gtatgatcaa agcaacattc ggtccccagg     3720

ttaccactgt ggcatctctt cgccgagagg agaatgcatg gaaggtcctc tccaatagtc     3780

tatcggccct tcatttggct ggcattgata ttaattggaa agaatatcat caagacttca     3840

gctccagcca ccaggtgctc ccacttcctt cttacaagtg ggatctcaag aactactgga     3900

taccctacac taacaatttc tgccttacga agggtgctcc ccaaactgca attcaagctg     3960

caccacaaac tacattcctg accactgctg cgcaaaaggt tgttgagagt cgcgacgacg     4020

gtacaacagc gactgtcgtg gtgcaaaatg acatcgctga tcctgagttg aaccgtgtta     4080

tccaaggtca caaggtcaat ggagccgcac tttgcccatc ggtaagtatt gcatgcattg     4140

ccagactatc ttgtgttata attcggctac ttacgtattg cctagtcact ctacgcagat     4200

attgcccaga cacttggaga gtatcttatt gagaaataca aacccgagtt caaagatctt     4260

ggtctcgatg tgtgtgacat ggtcgtaccg aagccactca tcgcgaaggg aggagagcag     4320

ctctttagag tctctgctat tgctaattgg gctgagaaga aggcttcagt tcaagtatac     4380

gccgttaatg ctgacggcaa aaagaccgtg gatcatgcgt attgtacggt gaagttcttt     4440

gataccaatg cctccgagct cgagtggaag agaatctcgt acctggtcaa gagaagcatc     4500

gacagtcttc accagaatgc ggagacaggg gaggctcacc gtatccagcg aggaatggtc     4560

tataaacttt tcagcgcgtt ggtcgattat gatgaaaatt tcaagtcgat tcgcgaggtt     4620

atcctggaca gcgacaataa tgaggccacc gctcgtgtca aattccaagc accgccagga     4680

aatttccacc gaaacccatt ctggattgac agtttcggtc acttgtccgg attcattatg     4740

aatgcgagcg acgcgaccga ctctaagaac caagtatttg ttaaccatgg atgggattcg     4800

atgcgttgcc tgaagaagtt ctcgcctgat gtcacttatc gcacttatgt gaggatgcag     4860

ccatggcaaa acaacatttg ggctggagat gtttatatct ttgagggcga cgatattatt     4920

gctgtcttcg gaggtgtgaa ggtgggtacc tcactactga ttttggttcc tgcttactga     4980

catgataatt agttccaagc actggcacgc aagatacttg acactgttct tccccctgtt     5040

ggcggttcaa aggcaccaat tacagcgaaa tcaccacctc cagctcgcac tcagaaggcc     5100

aacaccggcg ccaagacccg tcctaaagca cctgttcctt ccaagtcgtt caccaaatct     5160

tctgggccga gtgttgtcgt acgcgcactc agcattctgg cctcagaagt tggcctggca     5220

gagtctgaaa tctcagacga catggtgttt gcggactacg gtgtagactc actcctctcc     5280

cttacagtta ctggcaggta tcgtgaagag ttgaacctcg atttggactc ctctgtgttt     5340

accgatcatc caactgtcaa cgacttcaag cggctcatcg cccaagtgag tccttcagag     5400

agccatgatg gttcctccag tgaacaagag tcgaatttct ctttcaacgg tggcgagtcc     5460

tcaagcgcaa gcacacctga cataacgtca ccgccgaatg agaaggtagc tcaagtcgag     5520

caaaacggca ccatgaagga aatccgtaac atcatggcgg aggagatcgg tgtacccgca     5580

gaagagatcg accctgacga gaacttggga gagatgggta tggactcgct tctctccctt     5640

actgttcttg gaagaatacg ggagactttg gacatggacc tgccaggaga gttcttcatc     5700

gaaaaccaga ccctcaatga tatagaggtg gctttggacc taaaacccaa gactacctct     5760

gctccaattc ctatgccaga gccagtgaaa ttccctgaag ctatccacga cctccagcca     5820

aagcttgctc aacatcccaa ggccacatcc atcctgttac aaggaaaccc caggacagca     5880

acaaagacgt tattcttgtt tcctgacggc tctggctcag ctacatctta cgctaccatc     5940

cccggactct ctcctgacgt ctgcgtttac gggttgaatt gcccatatat gaagacacct     6000

gagaagctca aatgcagcct agatgaactc actgcgccct atgtagcaga gattcgtcgt     6060

cggcaaccca agggtcctta cagcttcggt ggctggtcag caggagggat ctgtgcatat     6120

gatgcggcac gccatctaat gtttgaggaa ggtgaacaag tcgaccgctt gcttctcctt     6180

gataccccct tccccatcgg cctcgagaag ctgccgcaga gattgtacgg cttcttcaac     6240

tctatcggtc tcttcggtga aggtaaaacg gcaccaccct cctggctcct accccacttc     6300

ctagccttta tcgacgctct cgacgcatac aaggccgcgc cccttccatt caaagacgag     6360

aaatgggcca agaaactgcc caagacttat atcatctggg ccaaggacgg tgtttgcggt     6420

aagccgggag atccccggcc tgatcccccg acagacggtt ccaaggatcc caaggagatg     6480

gtctggcttc ttaatgaccg gaccgatctg ggacctaaca agtgggatac attggttgga     6540

cctgagaata ttggtggaat cacagtaatg gaagatgcta atcattttac gatgacgaag     6600

ggcgaaaaag cgaaagagtt gtctacattt atggctaacg ccatggctta a              6651


<210>  135
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3532 protospacer

<400>  135
acgctttgcg ccaagagatc                                                   20


<210>  136
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3533 protospacer

<400>  136
cgccaagaga tcgcgaggct                                                   20


<210>  137
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3534 protospacer

<400>  137
aagcactgcg acacaaagct                                                   20


<210>  138
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3535 protospacer

<400>  138
cattctgccc aagttactcc                                                   20


<210>  139
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3536 protospacer

<400>  139
cggaagccgg tctgttccaa                                                   20


<210>  140
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAT3537 protospacer

<400>  140
tacctagtga accccaatag                                                   20


<210>  141
<211>  62
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3941

<400>  141
aatttctact cttgtagata cgctttgcgc caagagatct ttttttttga gcatttatca       60

gc                                                                      62


<210>  142
<211>  62
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3942

<400>  142
aatttctact cttgtagatc gccaagagat cgcgaggctt ttttttttga gcatttatca       60

gc                                                                      62


<210>  143
<211>  62
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3943

<400>  143
aatttctact cttgtagata agcactgcga cacaaagctt ttttttttga gcatttatca       60

gc                                                                      62


<210>  144
<211>  62
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3944

<400>  144
aatttctact cttgtagatc attctgccca agttactcct ttttttttga gcatttatca       60

gc                                                                      62


<210>  145
<211>  62
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3945

<400>  145
aatttctact cttgtagatc ggaagccggt ctgttccaat ttttttttga gcatttatca       60

gc                                                                      62


<210>  146
<211>  62
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3946

<400>  146
aatttctact cttgtagatt acctagtgaa ccccaatagt ttttttttga gcatttatca       60

gc                                                                      62


<210>  147
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3912

<400>  147
tccaagttct ttgcatgc                                                     18


<210>  148
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3613

<400>  148
tatctcaggt taggctcg                                                     18


<210>  149
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oJaL188

<400>  149
ccatggtcct taccatgc                                                     18


<210>  150
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3616

<400>  150
tatttatctc ccgatagtca tc                                                22


<210>  151
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT919

<400>  151
ctggctgtca aggcttcc                                                     18


<210>  152
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT1040

<400>  152
tttgtggtgc agcttgaat                                                    19


<210>  153
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT967

<400>  153
gcgaacacga accctac                                                      17


<210>  154
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer oAT3618

<400>  154
tcaaagcagc aaactcc                                                      17


<210>  155
<211>  1232
<212>  PRT
<213>  Sulfuricurvum sp. PC08-66

<400>  155

Met Leu His Ala Phe Thr Asn Gln Tyr Gln Leu Ser Lys Thr Leu Arg 
1               5                   10                  15      


Phe Gly Ala Thr Leu Lys Glu Asp Glu Lys Lys Cys Lys Ser His Glu 
            20                  25                  30          


Glu Leu Lys Gly Phe Val Asp Ile Ser Tyr Glu Asn Met Lys Ser Ser 
        35                  40                  45              


Ala Thr Ile Ala Glu Ser Leu Asn Glu Asn Glu Leu Val Lys Lys Cys 
    50                  55                  60                  


Glu Arg Cys Tyr Ser Glu Ile Val Lys Phe His Asn Ala Trp Glu Lys 
65                  70                  75                  80  


Ile Tyr Tyr Arg Thr Asp Gln Ile Ala Val Tyr Lys Asp Phe Tyr Arg 
                85                  90                  95      


Gln Leu Ser Arg Lys Ala Arg Phe Asp Ala Gly Lys Gln Asn Ser Gln 
            100                 105                 110         


Leu Ile Thr Leu Ala Ser Leu Cys Gly Met Tyr Gln Gly Ala Lys Leu 
        115                 120                 125             


Ser Arg Tyr Ile Thr Asn Tyr Trp Lys Asp Asn Ile Thr Arg Gln Lys 
    130                 135                 140                 


Ser Phe Leu Lys Asp Phe Ser Gln Gln Leu His Gln Tyr Thr Arg Ala 
145                 150                 155                 160 


Leu Glu Lys Ser Asp Lys Ala His Thr Lys Pro Asn Leu Ile Asn Phe 
                165                 170                 175     


Asn Lys Thr Phe Met Val Leu Ala Asn Leu Val Asn Glu Ile Val Ile 
            180                 185                 190         


Pro Leu Ser Asn Gly Ala Ile Ser Phe Pro Asn Ile Ser Lys Leu Glu 
        195                 200                 205             


Asp Gly Glu Glu Ser His Leu Ile Glu Phe Ala Leu Asn Asp Tyr Ser 
    210                 215                 220                 


Gln Leu Ser Glu Leu Ile Gly Glu Leu Lys Asp Ala Ile Ala Thr Asn 
225                 230                 235                 240 


Gly Gly Tyr Thr Pro Phe Ala Lys Val Thr Leu Asn His Tyr Thr Ala 
                245                 250                 255     


Glu Gln Lys Pro His Val Phe Lys Asn Asp Ile Asp Ala Lys Ile Arg 
            260                 265                 270         


Glu Leu Lys Leu Ile Gly Leu Val Glu Thr Leu Lys Gly Lys Ser Ser 
        275                 280                 285             


Glu Gln Ile Glu Glu Tyr Phe Ser Asn Leu Asp Lys Phe Ser Thr Tyr 
    290                 295                 300                 


Asn Asp Arg Asn Gln Ser Val Ile Val Arg Thr Gln Cys Phe Lys Tyr 
305                 310                 315                 320 


Lys Pro Ile Pro Phe Leu Val Lys His Gln Leu Ala Lys Tyr Ile Ser 
                325                 330                 335     


Glu Pro Asn Gly Trp Asp Glu Asp Ala Val Ala Lys Val Leu Asp Ala 
            340                 345                 350         


Val Gly Ala Ile Arg Ser Pro Ala His Asp Tyr Ala Asn Asn Gln Glu 
        355                 360                 365             


Gly Phe Asp Leu Asn His Tyr Pro Ile Lys Val Ala Phe Asp Tyr Ala 
    370                 375                 380                 


Trp Glu Gln Leu Ala Asn Ser Leu Tyr Thr Thr Val Thr Phe Pro Gln 
385                 390                 395                 400 


Glu Met Cys Glu Lys Tyr Leu Asn Ser Ile Tyr Gly Cys Glu Val Ser 
                405                 410                 415     


Lys Glu Pro Val Phe Lys Phe Tyr Ala Asp Leu Leu Tyr Ile Arg Lys 
            420                 425                 430         


Asn Leu Ala Val Leu Glu His Lys Asn Asn Leu Pro Ser Asn Gln Glu 
        435                 440                 445             


Glu Phe Ile Cys Lys Ile Asn Asn Thr Phe Glu Asn Ile Val Leu Pro 
    450                 455                 460                 


Tyr Lys Ile Ser Gln Phe Glu Thr Tyr Lys Lys Asp Ile Leu Ala Trp 
465                 470                 475                 480 


Ile Asn Asp Gly His Asp His Lys Lys Tyr Thr Asp Ala Lys Gln Gln 
                485                 490                 495     


Leu Gly Phe Ile Arg Gly Gly Leu Lys Gly Arg Ile Lys Ala Glu Glu 
            500                 505                 510         


Val Ser Gln Lys Asp Lys Tyr Gly Lys Ile Lys Ser Tyr Tyr Glu Asn 
        515                 520                 525             


Pro Tyr Thr Lys Leu Thr Asn Glu Phe Lys Gln Ile Ser Ser Thr Tyr 
    530                 535                 540                 


Gly Lys Thr Phe Ala Glu Leu Arg Asp Lys Phe Lys Glu Lys Asn Glu 
545                 550                 555                 560 


Ile Thr Lys Ile Thr His Phe Gly Ile Ile Ile Glu Asp Lys Asn Arg 
                565                 570                 575     


Asp Arg Tyr Leu Leu Ala Ser Glu Leu Lys His Glu Gln Ile Asn His 
            580                 585                 590         


Val Ser Thr Ile Leu Asn Lys Leu Asp Lys Ser Ser Glu Phe Ile Thr 
        595                 600                 605             


Tyr Gln Val Lys Ser Leu Thr Ser Lys Thr Leu Ile Lys Leu Ile Lys 
    610                 615                 620                 


Asn His Thr Thr Lys Lys Gly Ala Ile Ser Pro Tyr Ala Asp Phe His 
625                 630                 635                 640 


Thr Ser Lys Thr Gly Phe Asn Lys Asn Glu Ile Glu Lys Asn Trp Asp 
                645                 650                 655     


Asn Tyr Lys Arg Glu Gln Val Leu Val Glu Tyr Val Lys Asp Cys Leu 
            660                 665                 670         


Thr Asp Ser Thr Met Ala Lys Asn Gln Asn Trp Ala Glu Phe Gly Trp 
        675                 680                 685             


Asn Phe Glu Lys Cys Asn Ser Tyr Glu Asp Ile Glu His Glu Ile Asp 
    690                 695                 700                 


Gln Lys Ser Tyr Leu Leu Gln Ser Asp Thr Ile Ser Lys Gln Ser Ile 
705                 710                 715                 720 


Ala Ser Leu Val Glu Gly Gly Cys Leu Leu Leu Pro Ile Ile Asn Gln 
                725                 730                 735     


Asp Ile Thr Ser Lys Glu Arg Lys Asp Lys Asn Gln Phe Ser Lys Asp 
            740                 745                 750         


Trp Asn His Ile Phe Glu Gly Ser Lys Glu Phe Arg Leu His Pro Glu 
        755                 760                 765             


Phe Ala Val Ser Tyr Arg Thr Pro Ile Glu Gly Tyr Pro Val Gln Lys 
    770                 775                 780                 


Arg Tyr Gly Arg Leu Gln Phe Val Cys Ala Phe Asn Ala His Ile Val 
785                 790                 795                 800 


Pro Gln Asn Gly Glu Phe Ile Asn Leu Lys Lys Gln Ile Glu Asn Phe 
                805                 810                 815     


Asn Asp Glu Asp Val Gln Lys Arg Asn Val Thr Glu Phe Asn Lys Lys 
            820                 825                 830         


Val Asn His Ala Leu Ser Asp Lys Glu Tyr Val Val Ile Gly Ile Asp 
        835                 840                 845             


Arg Gly Leu Lys Gln Leu Ala Thr Leu Cys Val Leu Asp Lys Arg Gly 
    850                 855                 860                 


Lys Ile Leu Gly Asp Phe Glu Ile Tyr Lys Lys Glu Phe Val Arg Ala 
865                 870                 875                 880 


Glu Lys Arg Ser Glu Ser His Trp Glu His Thr Gln Ala Glu Thr Arg 
                885                 890                 895     


His Ile Leu Asp Leu Ser Asn Leu Arg Val Glu Thr Thr Ile Glu Gly 
            900                 905                 910         


Lys Lys Val Leu Val Asp Gln Ser Leu Thr Leu Val Lys Lys Asn Arg 
        915                 920                 925             


Asp Thr Pro Asp Glu Glu Ala Thr Glu Glu Asn Lys Gln Lys Ile Lys 
    930                 935                 940                 


Leu Lys Gln Leu Ser Tyr Ile Arg Lys Leu Gln His Lys Met Gln Thr 
945                 950                 955                 960 


Asn Glu Gln Asp Val Leu Asp Leu Ile Asn Asn Glu Pro Ser Asp Glu 
                965                 970                 975     


Glu Phe Lys Lys Arg Ile Glu Gly Leu Ile Ser Ser Phe Gly Glu Gly 
            980                 985                 990         


Gln Lys Tyr Ala Asp Leu Pro Ile  Asn Thr Met Arg Glu  Met Ile Ser 
        995                 1000                 1005             


Asp Leu  Gln Gly Val Ile Ala  Arg Gly Asn Asn Gln  Thr Glu Lys 
    1010                 1015                 1020             


Asn Lys  Ile Ile Glu Leu Asp  Ala Ala Asp Asn Leu  Lys Gln Gly 
    1025                 1030                 1035             


Ile Val  Ala Asn Met Ile Gly  Ile Val Asn Tyr Ile  Phe Ala Lys 
    1040                 1045                 1050             


Tyr Ser  Tyr Lys Ala Tyr Ile  Ser Leu Glu Asp Leu  Ser Arg Ala 
    1055                 1060                 1065             


Tyr Gly  Gly Ala Lys Ser Gly  Tyr Asp Gly Arg Tyr  Leu Pro Ser 
    1070                 1075                 1080             


Thr Ser  Gln Asp Glu Asp Val  Asp Phe Lys Glu Gln  Gln Asn Gln 
    1085                 1090                 1095             


Met Leu  Ala Gly Leu Gly Thr  Tyr Gln Phe Phe Glu  Met Gln Leu 
    1100                 1105                 1110             


Leu Lys  Lys Leu Gln Lys Ile  Gln Ser Asp Asn Thr  Val Leu Arg 
    1115                 1120                 1125             


Phe Val  Pro Ala Phe Arg Ser  Ala Asp Asn Tyr Arg  Asn Ile Leu 
    1130                 1135                 1140             


Arg Leu  Glu Glu Thr Lys Tyr  Lys Ser Lys Pro Phe  Gly Val Val 
    1145                 1150                 1155             


His Phe  Ile Asp Pro Lys Phe  Thr Ser Lys Lys Cys  Pro Val Cys 
    1160                 1165                 1170             


Ser Lys  Thr Asn Val Tyr Arg  Asp Lys Asp Asp Ile  Leu Val Cys 
    1175                 1180                 1185             


Lys Glu  Cys Gly Phe Arg Ser  Asp Ser Gln Leu Lys  Glu Arg Glu 
    1190                 1195                 1200             


Asn Asn  Ile His Tyr Ile His  Asn Gly Asp Asp Asn  Gly Ala Tyr 
    1205                 1210                 1215             


His Ile  Ala Leu Lys Ser Val  Glu Asn Leu Ile Gln  Met Lys 
    1220                 1225                 1230         


<210>  156
<211>  3699
<212>  DNA
<213>  Sulfuricurvum sp. PC08-66


<220>
<221>  CDS
<222>  (1)..(3699)

<400>  156
atg ctt cac gct ttc act aat cag tat caa ctt tct aaa aca ttg aga         48
Met Leu His Ala Phe Thr Asn Gln Tyr Gln Leu Ser Lys Thr Leu Arg           
1               5                   10                  15                

ttc gga gca act ctg aaa gaa gac gag aaa aaa tgc aag agt cat gag         96
Phe Gly Ala Thr Leu Lys Glu Asp Glu Lys Lys Cys Lys Ser His Glu           
            20                  25                  30                    

gaa ctt aaa gga ttt gta gat att tca tat gaa aac atg aaa tct tcc        144
Glu Leu Lys Gly Phe Val Asp Ile Ser Tyr Glu Asn Met Lys Ser Ser           
        35                  40                  45                        

gct aca atc gct gaa agt ttg aac gaa aat gaa ctt gtg aaa aaa tgc        192
Ala Thr Ile Ala Glu Ser Leu Asn Glu Asn Glu Leu Val Lys Lys Cys           
    50                  55                  60                            

gaa agg tgt tat tct gag atc gtg aaa ttt cat aac gct tgg gag aaa        240
Glu Arg Cys Tyr Ser Glu Ile Val Lys Phe His Asn Ala Trp Glu Lys           
65                  70                  75                  80            

atc tac tac agg aca gat caa att gct gtc tat aaa gat ttc tat agg        288
Ile Tyr Tyr Arg Thr Asp Gln Ile Ala Val Tyr Lys Asp Phe Tyr Arg           
                85                  90                  95                

caa ctg tca aga aaa gct aga ttt gat gcc ggt aag caa aat tca caa        336
Gln Leu Ser Arg Lys Ala Arg Phe Asp Ala Gly Lys Gln Asn Ser Gln           
            100                 105                 110                   

ctg ata acc tta gct tcc ctt tgc ggt atg tac caa gga gct aag tta        384
Leu Ile Thr Leu Ala Ser Leu Cys Gly Met Tyr Gln Gly Ala Lys Leu           
        115                 120                 125                       

agt aga tac ata acc aat tat tgg aaa gat aac att act agg cag aaa        432
Ser Arg Tyr Ile Thr Asn Tyr Trp Lys Asp Asn Ile Thr Arg Gln Lys           
    130                 135                 140                           

tca ttt ctt aaa gat ttt tcc caa cag tta cat caa tac act cgt gca        480
Ser Phe Leu Lys Asp Phe Ser Gln Gln Leu His Gln Tyr Thr Arg Ala           
145                 150                 155                 160           

ctg gaa aag tct gat aag gct cat aca aaa cct aat ctg atc aac ttc        528
Leu Glu Lys Ser Asp Lys Ala His Thr Lys Pro Asn Leu Ile Asn Phe           
                165                 170                 175               

aat aag acc ttt atg gtg ttg gcc aat ctc gtg aac gaa ata gtt att        576
Asn Lys Thr Phe Met Val Leu Ala Asn Leu Val Asn Glu Ile Val Ile           
            180                 185                 190                   

cct ctt tct aat gga gcc atc tct ttt cca aac atc tct aag ctg gag        624
Pro Leu Ser Asn Gly Ala Ile Ser Phe Pro Asn Ile Ser Lys Leu Glu           
        195                 200                 205                       

gac ggg gaa gag tcc cat ctt ata gaa ttt gca ctc aat gac tat tct        672
Asp Gly Glu Glu Ser His Leu Ile Glu Phe Ala Leu Asn Asp Tyr Ser           
    210                 215                 220                           

cag ttg tct gaa tta att ggt gaa ttg aag gat gca ata gcc act aac        720
Gln Leu Ser Glu Leu Ile Gly Glu Leu Lys Asp Ala Ile Ala Thr Asn           
225                 230                 235                 240           

ggt ggt tac aca cca ttt gca aag gtg acc ctt aat cat tat aca gca        768
Gly Gly Tyr Thr Pro Phe Ala Lys Val Thr Leu Asn His Tyr Thr Ala           
                245                 250                 255               

gaa cag aaa cca cac gta ttt aaa aat gat att gat gct aaa ata cgt        816
Glu Gln Lys Pro His Val Phe Lys Asn Asp Ile Asp Ala Lys Ile Arg           
            260                 265                 270                   

gag ctt aag ttg att ggg ttg gtt gag acc ttg aaa gga aaa tcc agt        864
Glu Leu Lys Leu Ile Gly Leu Val Glu Thr Leu Lys Gly Lys Ser Ser           
        275                 280                 285                       

gaa cag att gag gaa tac ttc tca aat tta gac aag ttt agc aca tac        912
Glu Gln Ile Glu Glu Tyr Phe Ser Asn Leu Asp Lys Phe Ser Thr Tyr           
    290                 295                 300                           

aac gat agg aac caa tca gta atc gta aga act caa tgc ttt aag tat        960
Asn Asp Arg Asn Gln Ser Val Ile Val Arg Thr Gln Cys Phe Lys Tyr           
305                 310                 315                 320           

aaa ccc att cct ttt ttg gtt aag cat caa ctt gca aag tac att tca       1008
Lys Pro Ile Pro Phe Leu Val Lys His Gln Leu Ala Lys Tyr Ile Ser           
                325                 330                 335               

gaa cca aac ggt tgg gat gaa gac gcc gta gct aag gtt ctg gat gct       1056
Glu Pro Asn Gly Trp Asp Glu Asp Ala Val Ala Lys Val Leu Asp Ala           
            340                 345                 350                   

gtt gga gct att cgt tct cca gca cat gat tac gct aat aac caa gag       1104
Val Gly Ala Ile Arg Ser Pro Ala His Asp Tyr Ala Asn Asn Gln Glu           
        355                 360                 365                       

ggg ttt gat tta aac cat tat cct att aaa gtc gct ttc gat tat gct       1152
Gly Phe Asp Leu Asn His Tyr Pro Ile Lys Val Ala Phe Asp Tyr Ala           
    370                 375                 380                           

tgg gag cag ttg gct aat tct ttg tat acc acc gtg act ttt ccc caa       1200
Trp Glu Gln Leu Ala Asn Ser Leu Tyr Thr Thr Val Thr Phe Pro Gln           
385                 390                 395                 400           

gaa atg tgc gaa aaa tat tta aat agt atc tac ggt tgt gaa gtc tcc       1248
Glu Met Cys Glu Lys Tyr Leu Asn Ser Ile Tyr Gly Cys Glu Val Ser           
                405                 410                 415               

aag gag cct gta ttt aaa ttc tat gct gat ctg ctt tat atc agg aag       1296
Lys Glu Pro Val Phe Lys Phe Tyr Ala Asp Leu Leu Tyr Ile Arg Lys           
            420                 425                 430                   

aat ctg gct gta ctc gaa cat aag aac aat ctg ccc agt aat cag gaa       1344
Asn Leu Ala Val Leu Glu His Lys Asn Asn Leu Pro Ser Asn Gln Glu           
        435                 440                 445                       

gag ttc ata tgt aag atc aac aac aca ttt gag aac atc gtg tta cca       1392
Glu Phe Ile Cys Lys Ile Asn Asn Thr Phe Glu Asn Ile Val Leu Pro           
    450                 455                 460                           

tat aag att tct caa ttt gaa act tat aag aag gat ata ctt gcc tgg       1440
Tyr Lys Ile Ser Gln Phe Glu Thr Tyr Lys Lys Asp Ile Leu Ala Trp           
465                 470                 475                 480           

ata aac gat ggg cat gac cat aaa aaa tat act gat gca aaa cag caa       1488
Ile Asn Asp Gly His Asp His Lys Lys Tyr Thr Asp Ala Lys Gln Gln           
                485                 490                 495               

tta ggt ttt att agg ggt gga ctc aag ggt agg att aag gca gaa gaa       1536
Leu Gly Phe Ile Arg Gly Gly Leu Lys Gly Arg Ile Lys Ala Glu Glu           
            500                 505                 510                   

gtg tcc cag aaa gac aaa tat gga aaa atc aag tct tat tat gag aac       1584
Val Ser Gln Lys Asp Lys Tyr Gly Lys Ile Lys Ser Tyr Tyr Glu Asn           
        515                 520                 525                       

cct tac act aaa ctc acc aac gaa ttt aag caa ata tcc tct act tat       1632
Pro Tyr Thr Lys Leu Thr Asn Glu Phe Lys Gln Ile Ser Ser Thr Tyr           
    530                 535                 540                           

ggg aag acc ttc gct gag tta aga gac aaa ttt aaa gag aag aat gag       1680
Gly Lys Thr Phe Ala Glu Leu Arg Asp Lys Phe Lys Glu Lys Asn Glu           
545                 550                 555                 560           

atc acc aaa att acc cac ttc ggt att ata ata gaa gat aaa aac aga       1728
Ile Thr Lys Ile Thr His Phe Gly Ile Ile Ile Glu Asp Lys Asn Arg           
                565                 570                 575               

gac aga tat tta ctt gca agc gag ttg aag cac gaa caa atc aac cac       1776
Asp Arg Tyr Leu Leu Ala Ser Glu Leu Lys His Glu Gln Ile Asn His           
            580                 585                 590                   

gtc agt act atc ctt aac aag tta gat aaa tca tct gaa ttt att acc       1824
Val Ser Thr Ile Leu Asn Lys Leu Asp Lys Ser Ser Glu Phe Ile Thr           
        595                 600                 605                       

tat caa gtt aag agc ctt aca agc aaa aca ttg att aaa ttg att aaa       1872
Tyr Gln Val Lys Ser Leu Thr Ser Lys Thr Leu Ile Lys Leu Ile Lys           
    610                 615                 620                           

aat cac acc aca aag aag gga gcc att tca cca tat gct gat ttt cac       1920
Asn His Thr Thr Lys Lys Gly Ala Ile Ser Pro Tyr Ala Asp Phe His           
625                 630                 635                 640           

acc agt aaa acc gga ttc aac aag aat gaa atc gaa aag aat tgg gat       1968
Thr Ser Lys Thr Gly Phe Asn Lys Asn Glu Ile Glu Lys Asn Trp Asp           
                645                 650                 655               

aat tat aag aga gaa cag gta ttg gtt gag tat gtc aaa gat tgt ctg       2016
Asn Tyr Lys Arg Glu Gln Val Leu Val Glu Tyr Val Lys Asp Cys Leu           
            660                 665                 670                   

acc gat agt act atg gca aaa aac cag aac tgg gca gag ttc ggt tgg       2064
Thr Asp Ser Thr Met Ala Lys Asn Gln Asn Trp Ala Glu Phe Gly Trp           
        675                 680                 685                       

aat ttt gag aaa tgc aac tcc tat gag gat atc gaa cac gaa atc gac       2112
Asn Phe Glu Lys Cys Asn Ser Tyr Glu Asp Ile Glu His Glu Ile Asp           
    690                 695                 700                           

caa aaa tca tat ttg ctg cag agc gat aca att agc aag cag agt att       2160
Gln Lys Ser Tyr Leu Leu Gln Ser Asp Thr Ile Ser Lys Gln Ser Ile           
705                 710                 715                 720           

gct tcc ctc gtg gag ggg ggc tgt ctt ctc ctt cct ata att aac caa       2208
Ala Ser Leu Val Glu Gly Gly Cys Leu Leu Leu Pro Ile Ile Asn Gln           
                725                 730                 735               

gat ata aca agc aag gag agg aag gat aaa aat caa ttt tca aaa gat       2256
Asp Ile Thr Ser Lys Glu Arg Lys Asp Lys Asn Gln Phe Ser Lys Asp           
            740                 745                 750                   

tgg aac cat att ttc gaa ggt tcc aaa gaa ttc cgt ctc cac cca gag       2304
Trp Asn His Ile Phe Glu Gly Ser Lys Glu Phe Arg Leu His Pro Glu           
        755                 760                 765                       

ttc gca gtt agc tac agg aca cct att gaa ggg tat ccg gta cag aag       2352
Phe Ala Val Ser Tyr Arg Thr Pro Ile Glu Gly Tyr Pro Val Gln Lys           
    770                 775                 780                           

agg tac ggg cgt ctg cag ttc gtt tgc gct ttt aat gca cac atc gtt       2400
Arg Tyr Gly Arg Leu Gln Phe Val Cys Ala Phe Asn Ala His Ile Val           
785                 790                 795                 800           

cca caa aat ggt gag ttc atc aat ttg aaa aag cag atc gag aac ttt       2448
Pro Gln Asn Gly Glu Phe Ile Asn Leu Lys Lys Gln Ile Glu Asn Phe           
                805                 810                 815               

aac gat gaa gac gtt cag aaa cgt aat gtg act gaa ttc aat aaa aag       2496
Asn Asp Glu Asp Val Gln Lys Arg Asn Val Thr Glu Phe Asn Lys Lys           
            820                 825                 830                   

gtg aat cat gca ctt tcc gac aaa gaa tac gtc gtt att ggt att gat       2544
Val Asn His Ala Leu Ser Asp Lys Glu Tyr Val Val Ile Gly Ile Asp           
        835                 840                 845                       

aga ggc ctc aaa cag ctt gcc aca ctc tgt gtt tta gac aaa aga ggt       2592
Arg Gly Leu Lys Gln Leu Ala Thr Leu Cys Val Leu Asp Lys Arg Gly           
    850                 855                 860                           

aaa att ctt gga gat ttt gag atc tac aaa aag gaa ttt gtg cgt gct       2640
Lys Ile Leu Gly Asp Phe Glu Ile Tyr Lys Lys Glu Phe Val Arg Ala           
865                 870                 875                 880           

gaa aaa aga agc gag agt cat tgg gaa cac aca caa gca gaa acc aga       2688
Glu Lys Arg Ser Glu Ser His Trp Glu His Thr Gln Ala Glu Thr Arg           
                885                 890                 895               

cat atc ttg gat ctt tcc aat ttg cgt gtg gag aca aca ata gag ggt       2736
His Ile Leu Asp Leu Ser Asn Leu Arg Val Glu Thr Thr Ile Glu Gly           
            900                 905                 910                   

aaa aag gtt ctc gtg gac cag agc ctc aca ctt gtg aaa aag aat cgt       2784
Lys Lys Val Leu Val Asp Gln Ser Leu Thr Leu Val Lys Lys Asn Arg           
        915                 920                 925                       

gat aca cca gat gag gaa gct act gaa gaa aat aaa cag aaa atc aag       2832
Asp Thr Pro Asp Glu Glu Ala Thr Glu Glu Asn Lys Gln Lys Ile Lys           
    930                 935                 940                           

ttg aag cag ctc agc tat att aga aaa ttg cag cat aag atg cag act       2880
Leu Lys Gln Leu Ser Tyr Ile Arg Lys Leu Gln His Lys Met Gln Thr           
945                 950                 955                 960           

aac gaa cag gac gtt tta gat tta att aat aat gaa cca tca gat gaa       2928
Asn Glu Gln Asp Val Leu Asp Leu Ile Asn Asn Glu Pro Ser Asp Glu           
                965                 970                 975               

gaa ttt aag aaa aga atc gag ggg ctt att tcc agt ttt gga gaa gga       2976
Glu Phe Lys Lys Arg Ile Glu Gly Leu Ile Ser Ser Phe Gly Glu Gly           
            980                 985                 990                   

cag aag tac gct gac ctt cca att  aat act atg aga gaa  atg atc tct     3024
Gln Lys Tyr Ala Asp Leu Pro Ile  Asn Thr Met Arg Glu  Met Ile Ser         
        995                 1000                 1005                     

gat ctc  cag gga gtt atc gct  aga gga aac aac caa  aca gag aaa        3069
Asp Leu  Gln Gly Val Ile Ala  Arg Gly Asn Asn Gln  Thr Glu Lys            
    1010                 1015                 1020                        

aat aaa  att att gaa tta gat  gct gca gac aac ctt  aaa caa ggt        3114
Asn Lys  Ile Ile Glu Leu Asp  Ala Ala Asp Asn Leu  Lys Gln Gly            
    1025                 1030                 1035                        

att gta  gct aac atg atc gga  att gtt aat tac atc  ttc gct aag        3159
Ile Val  Ala Asn Met Ile Gly  Ile Val Asn Tyr Ile  Phe Ala Lys            
    1040                 1045                 1050                        

tat tca  tac aag gct tac atc  tct ctt gag gat ttg  tca aga gcc        3204
Tyr Ser  Tyr Lys Ala Tyr Ile  Ser Leu Glu Asp Leu  Ser Arg Ala            
    1055                 1060                 1065                        

tat gga  ggt gca aag tcc ggt  tat gac gga agg tat  ctg cca tca        3249
Tyr Gly  Gly Ala Lys Ser Gly  Tyr Asp Gly Arg Tyr  Leu Pro Ser            
    1070                 1075                 1080                        

act tca  caa gac gag gat gta  gat ttc aag gaa cag  cag aat cag        3294
Thr Ser  Gln Asp Glu Asp Val  Asp Phe Lys Glu Gln  Gln Asn Gln            
    1085                 1090                 1095                        

atg ctt  gca ggt ttg ggt acc  tac caa ttc ttc gag  atg cag ctt        3339
Met Leu  Ala Gly Leu Gly Thr  Tyr Gln Phe Phe Glu  Met Gln Leu            
    1100                 1105                 1110                        

ctg aaa  aaa ctt caa aag att  cag agt gat aac acc  gtt ctg aga        3384
Leu Lys  Lys Leu Gln Lys Ile  Gln Ser Asp Asn Thr  Val Leu Arg            
    1115                 1120                 1125                        

ttc gtg  ccc gct ttc aga tct  gca gat aac tat aga  aat att ttg        3429
Phe Val  Pro Ala Phe Arg Ser  Ala Asp Asn Tyr Arg  Asn Ile Leu            
    1130                 1135                 1140                        

aga ctt  gag gaa act aaa tat  aag tct aag ccg ttc  ggc gtt gtt        3474
Arg Leu  Glu Glu Thr Lys Tyr  Lys Ser Lys Pro Phe  Gly Val Val            
    1145                 1150                 1155                        

cat ttc  ata gat cca aag ttt  aca tca aag aaa tgc  ccc gtc tgt        3519
His Phe  Ile Asp Pro Lys Phe  Thr Ser Lys Lys Cys  Pro Val Cys            
    1160                 1165                 1170                        

agc aaa  aca aat gta tac agg  gac aag gat gac atc  ttg gtt tgc        3564
Ser Lys  Thr Asn Val Tyr Arg  Asp Lys Asp Asp Ile  Leu Val Cys            
    1175                 1180                 1185                        

aaa gag  tgc ggt ttt agg agc  gac tcc caa tta aaa  gaa aga gag        3609
Lys Glu  Cys Gly Phe Arg Ser  Asp Ser Gln Leu Lys  Glu Arg Glu            
    1190                 1195                 1200                        

aat aac  att cat tat att cac  aac ggg gac gat aac  ggt gca tac        3654
Asn Asn  Ile His Tyr Ile His  Asn Gly Asp Asp Asn  Gly Ala Tyr            
    1205                 1210                 1215                        

cac atc  gcc ctt aag agc gtt  gag aat ctt att cag  atg aag taa        3699
His Ile  Ala Leu Lys Ser Val  Glu Asn Leu Ile Gln  Met Lys                
    1220                 1225                 1230                        


<210>  157
<211>  1232
<212>  PRT
<213>  Sulfuricurvum sp. PC08-66

<400>  157

Met Leu His Ala Phe Thr Asn Gln Tyr Gln Leu Ser Lys Thr Leu Arg 
1               5                   10                  15      


Phe Gly Ala Thr Leu Lys Glu Asp Glu Lys Lys Cys Lys Ser His Glu 
            20                  25                  30          


Glu Leu Lys Gly Phe Val Asp Ile Ser Tyr Glu Asn Met Lys Ser Ser 
        35                  40                  45              


Ala Thr Ile Ala Glu Ser Leu Asn Glu Asn Glu Leu Val Lys Lys Cys 
    50                  55                  60                  


Glu Arg Cys Tyr Ser Glu Ile Val Lys Phe His Asn Ala Trp Glu Lys 
65                  70                  75                  80  


Ile Tyr Tyr Arg Thr Asp Gln Ile Ala Val Tyr Lys Asp Phe Tyr Arg 
                85                  90                  95      


Gln Leu Ser Arg Lys Ala Arg Phe Asp Ala Gly Lys Gln Asn Ser Gln 
            100                 105                 110         


Leu Ile Thr Leu Ala Ser Leu Cys Gly Met Tyr Gln Gly Ala Lys Leu 
        115                 120                 125             


Ser Arg Tyr Ile Thr Asn Tyr Trp Lys Asp Asn Ile Thr Arg Gln Lys 
    130                 135                 140                 


Ser Phe Leu Lys Asp Phe Ser Gln Gln Leu His Gln Tyr Thr Arg Ala 
145                 150                 155                 160 


Leu Glu Lys Ser Asp Lys Ala His Thr Lys Pro Asn Leu Ile Asn Phe 
                165                 170                 175     


Asn Lys Thr Phe Met Val Leu Ala Asn Leu Val Asn Glu Ile Val Ile 
            180                 185                 190         


Pro Leu Ser Asn Gly Ala Ile Ser Phe Pro Asn Ile Ser Lys Leu Glu 
        195                 200                 205             


Asp Gly Glu Glu Ser His Leu Ile Glu Phe Ala Leu Asn Asp Tyr Ser 
    210                 215                 220                 


Gln Leu Ser Glu Leu Ile Gly Glu Leu Lys Asp Ala Ile Ala Thr Asn 
225                 230                 235                 240 


Gly Gly Tyr Thr Pro Phe Ala Lys Val Thr Leu Asn His Tyr Thr Ala 
                245                 250                 255     


Glu Gln Lys Pro His Val Phe Lys Asn Asp Ile Asp Ala Lys Ile Arg 
            260                 265                 270         


Glu Leu Lys Leu Ile Gly Leu Val Glu Thr Leu Lys Gly Lys Ser Ser 
        275                 280                 285             


Glu Gln Ile Glu Glu Tyr Phe Ser Asn Leu Asp Lys Phe Ser Thr Tyr 
    290                 295                 300                 


Asn Asp Arg Asn Gln Ser Val Ile Val Arg Thr Gln Cys Phe Lys Tyr 
305                 310                 315                 320 


Lys Pro Ile Pro Phe Leu Val Lys His Gln Leu Ala Lys Tyr Ile Ser 
                325                 330                 335     


Glu Pro Asn Gly Trp Asp Glu Asp Ala Val Ala Lys Val Leu Asp Ala 
            340                 345                 350         


Val Gly Ala Ile Arg Ser Pro Ala His Asp Tyr Ala Asn Asn Gln Glu 
        355                 360                 365             


Gly Phe Asp Leu Asn His Tyr Pro Ile Lys Val Ala Phe Asp Tyr Ala 
    370                 375                 380                 


Trp Glu Gln Leu Ala Asn Ser Leu Tyr Thr Thr Val Thr Phe Pro Gln 
385                 390                 395                 400 


Glu Met Cys Glu Lys Tyr Leu Asn Ser Ile Tyr Gly Cys Glu Val Ser 
                405                 410                 415     


Lys Glu Pro Val Phe Lys Phe Tyr Ala Asp Leu Leu Tyr Ile Arg Lys 
            420                 425                 430         


Asn Leu Ala Val Leu Glu His Lys Asn Asn Leu Pro Ser Asn Gln Glu 
        435                 440                 445             


Glu Phe Ile Cys Lys Ile Asn Asn Thr Phe Glu Asn Ile Val Leu Pro 
    450                 455                 460                 


Tyr Lys Ile Ser Gln Phe Glu Thr Tyr Lys Lys Asp Ile Leu Ala Trp 
465                 470                 475                 480 


Ile Asn Asp Gly His Asp His Lys Lys Tyr Thr Asp Ala Lys Gln Gln 
                485                 490                 495     


Leu Gly Phe Ile Arg Gly Gly Leu Lys Gly Arg Ile Lys Ala Glu Glu 
            500                 505                 510         


Val Ser Gln Lys Asp Lys Tyr Gly Lys Ile Lys Ser Tyr Tyr Glu Asn 
        515                 520                 525             


Pro Tyr Thr Lys Leu Thr Asn Glu Phe Lys Gln Ile Ser Ser Thr Tyr 
    530                 535                 540                 


Gly Lys Thr Phe Ala Glu Leu Arg Asp Lys Phe Lys Glu Lys Asn Glu 
545                 550                 555                 560 


Ile Thr Lys Ile Thr His Phe Gly Ile Ile Ile Glu Asp Lys Asn Arg 
                565                 570                 575     


Asp Arg Tyr Leu Leu Ala Ser Glu Leu Lys His Glu Gln Ile Asn His 
            580                 585                 590         


Val Ser Thr Ile Leu Asn Lys Leu Asp Lys Ser Ser Glu Phe Ile Thr 
        595                 600                 605             


Tyr Gln Val Lys Ser Leu Thr Ser Lys Thr Leu Ile Lys Leu Ile Lys 
    610                 615                 620                 


Asn His Thr Thr Lys Lys Gly Ala Ile Ser Pro Tyr Ala Asp Phe His 
625                 630                 635                 640 


Thr Ser Lys Thr Gly Phe Asn Lys Asn Glu Ile Glu Lys Asn Trp Asp 
                645                 650                 655     


Asn Tyr Lys Arg Glu Gln Val Leu Val Glu Tyr Val Lys Asp Cys Leu 
            660                 665                 670         


Thr Asp Ser Thr Met Ala Lys Asn Gln Asn Trp Ala Glu Phe Gly Trp 
        675                 680                 685             


Asn Phe Glu Lys Cys Asn Ser Tyr Glu Asp Ile Glu His Glu Ile Asp 
    690                 695                 700                 


Gln Lys Ser Tyr Leu Leu Gln Ser Asp Thr Ile Ser Lys Gln Ser Ile 
705                 710                 715                 720 


Ala Ser Leu Val Glu Gly Gly Cys Leu Leu Leu Pro Ile Ile Asn Gln 
                725                 730                 735     


Asp Ile Thr Ser Lys Glu Arg Lys Asp Lys Asn Gln Phe Ser Lys Asp 
            740                 745                 750         


Trp Asn His Ile Phe Glu Gly Ser Lys Glu Phe Arg Leu His Pro Glu 
        755                 760                 765             


Phe Ala Val Ser Tyr Arg Thr Pro Ile Glu Gly Tyr Pro Val Gln Lys 
    770                 775                 780                 


Arg Tyr Gly Arg Leu Gln Phe Val Cys Ala Phe Asn Ala His Ile Val 
785                 790                 795                 800 


Pro Gln Asn Gly Glu Phe Ile Asn Leu Lys Lys Gln Ile Glu Asn Phe 
                805                 810                 815     


Asn Asp Glu Asp Val Gln Lys Arg Asn Val Thr Glu Phe Asn Lys Lys 
            820                 825                 830         


Val Asn His Ala Leu Ser Asp Lys Glu Tyr Val Val Ile Gly Ile Asp 
        835                 840                 845             


Arg Gly Leu Lys Gln Leu Ala Thr Leu Cys Val Leu Asp Lys Arg Gly 
    850                 855                 860                 


Lys Ile Leu Gly Asp Phe Glu Ile Tyr Lys Lys Glu Phe Val Arg Ala 
865                 870                 875                 880 


Glu Lys Arg Ser Glu Ser His Trp Glu His Thr Gln Ala Glu Thr Arg 
                885                 890                 895     


His Ile Leu Asp Leu Ser Asn Leu Arg Val Glu Thr Thr Ile Glu Gly 
            900                 905                 910         


Lys Lys Val Leu Val Asp Gln Ser Leu Thr Leu Val Lys Lys Asn Arg 
        915                 920                 925             


Asp Thr Pro Asp Glu Glu Ala Thr Glu Glu Asn Lys Gln Lys Ile Lys 
    930                 935                 940                 


Leu Lys Gln Leu Ser Tyr Ile Arg Lys Leu Gln His Lys Met Gln Thr 
945                 950                 955                 960 


Asn Glu Gln Asp Val Leu Asp Leu Ile Asn Asn Glu Pro Ser Asp Glu 
                965                 970                 975     


Glu Phe Lys Lys Arg Ile Glu Gly Leu Ile Ser Ser Phe Gly Glu Gly 
            980                 985                 990         


Gln Lys Tyr Ala Asp Leu Pro Ile  Asn Thr Met Arg Glu  Met Ile Ser 
        995                 1000                 1005             


Asp Leu  Gln Gly Val Ile Ala  Arg Gly Asn Asn Gln  Thr Glu Lys 
    1010                 1015                 1020             


Asn Lys  Ile Ile Glu Leu Asp  Ala Ala Asp Asn Leu  Lys Gln Gly 
    1025                 1030                 1035             


Ile Val  Ala Asn Met Ile Gly  Ile Val Asn Tyr Ile  Phe Ala Lys 
    1040                 1045                 1050             


Tyr Ser  Tyr Lys Ala Tyr Ile  Ser Leu Glu Asp Leu  Ser Arg Ala 
    1055                 1060                 1065             


Tyr Gly  Gly Ala Lys Ser Gly  Tyr Asp Gly Arg Tyr  Leu Pro Ser 
    1070                 1075                 1080             


Thr Ser  Gln Asp Glu Asp Val  Asp Phe Lys Glu Gln  Gln Asn Gln 
    1085                 1090                 1095             


Met Leu  Ala Gly Leu Gly Thr  Tyr Gln Phe Phe Glu  Met Gln Leu 
    1100                 1105                 1110             


Leu Lys  Lys Leu Gln Lys Ile  Gln Ser Asp Asn Thr  Val Leu Arg 
    1115                 1120                 1125             


Phe Val  Pro Ala Phe Arg Ser  Ala Asp Asn Tyr Arg  Asn Ile Leu 
    1130                 1135                 1140             


Arg Leu  Glu Glu Thr Lys Tyr  Lys Ser Lys Pro Phe  Gly Val Val 
    1145                 1150                 1155             


His Phe  Ile Asp Pro Lys Phe  Thr Ser Lys Lys Cys  Pro Val Cys 
    1160                 1165                 1170             


Ser Lys  Thr Asn Val Tyr Arg  Asp Lys Asp Asp Ile  Leu Val Cys 
    1175                 1180                 1185             


Lys Glu  Cys Gly Phe Arg Ser  Asp Ser Gln Leu Lys  Glu Arg Glu 
    1190                 1195                 1200             


Asn Asn  Ile His Tyr Ile His  Asn Gly Asp Asp Asn  Gly Ala Tyr 
    1205                 1210                 1215             


His Ile  Ala Leu Lys Ser Val  Glu Asn Leu Ile Gln  Met Lys 
    1220                 1225                 1230         


<210>  158
<211>  4995
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pMDT452: MAD7d-AID-UGI nucleotide sequence

<400>  158
atgaataatg gcacaaataa cttccagaac ttcattggca ttagcagcct gcaaaaaaca       60

ctgagaaatg cactgattcc gacagaaaca acacagcagt ttattgtcaa aaacggcatc      120

atcaaagagg atgaactgag aggcgaaaat cgccaaattc tgaaagatat catggacgac      180

tattaccgtg gctttatttc agaaacactg tccagcattg atgatatcga ttggacaagc      240

ctgttcgaga aaatggaaat ccaactgaaa aacggcgata acaaagacac gctgattaaa      300

gaacaaacgg aatatcgcaa agcgatccac aaaaagtttg caaatgatga ccgctttaaa      360

aacatgttca gcgcgaaact gattagcgat attctgccgg aatttgtcat ccacaataat      420

aactatagcg cgagcgagaa agaagaaaaa acacaggtca ttaaactgtt tagccgcttt      480

gccacaagct tcaaagacta tttcaaaaat cgcgcaaact gctttagcgc agatgatatt      540

tcatcatcaa gctgccatcg gattgtcaat gataatgcgg aaatcttttt tagcaacgca      600

ctggtctatc gcagaattgt taaatcattg agcaacgacg acatcaacaa aatctcaggc      660

gatatgaaag acagcctgaa agaaatgtca ctggaagaaa tctacagcta cgaaaaatac      720

ggcgaattta tcacacaaga aggcatcagc ttttacaacg atatttgcgg caaagtcaac      780

agctttatga atctgtattg ccagaaaaac aaagaaaaca aaaacctgta taaactgcag      840

aaactgcaca agcagattct gtgcattgca gatacatcat atgaagtccc gtacaaattt      900

gagagcgacg aagaagttta tcaaagcgtt aatggctttc tggataacat cagcagcaaa      960

catattgttg aacgcctgag aaaaattggc gataactata atggctacaa cctggacaaa     1020

atctacatcg tcagcaaatt ttacgaaagc gtcagccaaa aaacatatcg cgattgggaa     1080

acaattaata cagcgctgga aattcattat aacaacattc tgcctggcaa cggcaaaagc     1140

aaagcagata aagttaaaaa ggcggtcaaa aatgacctgc agaaaagcat tacagaaatc     1200

aatgaactgg tcagcaacta caaactgtgc tcagatgata atatcaaggc ggaaacgtac     1260

atccatgaaa ttagccatat cctgaacaac tttgaagcgc aagaactgaa atataacccg     1320

gaaatccatc tggttgaaag cgaactgaaa gcaagcgagc tgaaaaatgt tctggatgtc     1380

attatgaatg cgtttcattg gtgcagcgtc tttatgacag aagaactggt cgataaagat     1440

aacaactttt atgcggaact ggaagagatt tacgacgaaa tttatccggt catcagcctg     1500

tataatctgg ttcgcaatta tgtcacacag aaaccgtata gcacgaagaa aatcaaactg     1560

aactttggca ttccgacact ggcagatggc tggtcaaaat caaaagaata tagcaacaac     1620

gcgatcatcc tgatgcgcga taatctttat tatctgggca ttttcaacgc gaaaaacaag     1680

ccggacaaaa aaatcatcga aggcaatacg tcagagaaca aaggcgacta taaaaagatg     1740

atctataatc tgcttccggg accgaataaa atgatcccga aagtttttct gtcaagcaaa     1800

acaggcgtcg aaacatataa accgtcagcg tatattctgg aaggctacaa acagaacaaa     1860

cacatcaaaa gcagcaagga ctttgacatc acattttgcc atgatctgat cgactacttt     1920

aagaactgca ttgcaattca tccggaatgg aaaaacttcg gctttgattt ttcagacacg     1980

agcacgtatg aagatatcag cggcttttat agagaagttg aactgcaggg ctataaaatc     2040

gactggacat atatcagcga aaaggatatt gatctgctgc aagaaaaagg ccaactgtac     2100

ctgtttcaga tctacaacaa agacttcagc aaaaaaagca cgggcaatga taacctgcat     2160

acgatgtacc tgaaaaacct ttttagcgaa gagaacctga aagacattgt cctgaaactg     2220

aatggcgaag ccgaaatttt ctttcgcaaa tccagcatta aaaacccgat catccataaa     2280

aaaggcagca ttctggttaa ccgcacatat gaagcggaag aaaaagatca gtttggcaac     2340

attcagatcg tccgcaaaaa cattccggaa aacatttatc aagaactgta caaatacttt     2400

aacgataaaa gcgataaaga actgtccgac gaagcagcga aacttaaaaa tgttgttggc     2460

catcatgaag cggcaacaaa cattgttaaa gactatcgct atacgtacga taaatacttt     2520

ctgcatatgc cgatcacgat caacttcaaa gcaaataaaa cgggctttat caacgatcgc     2580

attctgcagt atattgccaa agaaaaggat ctgcatgtca tcggcattgc tagaggcgaa     2640

cgcaatctga tttatgtcag cgttattgat acatgcggca acattgtcga acagaaaagc     2700

tttaacattg tcaacggcta tgactaccag atcaagctga aacagcaaga aggcgcaaga     2760

caaattgctc gcaaagaatg gaaagaaatc ggcaagatca aagaaattaa agagggctat     2820

ctgagcctgg tcattcatga aatttctaaa atggtcatca aatataacgc gattatcgcc     2880

atggaagatc tgtcatatgg ctttaagaaa ggccgtttta aagtcgaaag acaggtctac     2940

cagaaattcg aaacaatgct gattaacaaa ctgaattatc tggtgtttaa agacatcagc     3000

atcacggaaa atggcggact gctgaaaggc tatcaactga catatattcc ggataagctt     3060

aaaaacgtcg gccatcaatg cggctgcatc ttttatgttc cggcagcgta tacatcaaaa     3120

attgatccga caacaggctt tgtcaacatc ttcaaattca aagatctgac ggtcgatgcg     3180

aaacgcgaat tcattaagaa atttgacagc atccgctacg acagcgagaa aaatcttttc     3240

tgctttacgt tcgactacaa caactttatc acgcagaata cggttatgtc aaaaagcagc     3300

tggtcagtct atacatatgg cgttagaatt aaacgcagat ttgtgaacgg cagatttagc     3360

aatgaaagcg atacaatcga catcacgaaa gacatggaaa aaacgcttga aatgacggat     3420

attaactggc gtgatggaca tgatcttcgc caggatatta tcgattatga aatcgtccag     3480

cacatctttg aaatctttag actgacagtc caaatgcgca attcactgtc agaacttgaa     3540

gatagagatt atgatcgcct gatttctccg gtcctgaatg aaaataacat cttttacgat     3600

agcgcaaaag caggcgacgc actgccgaaa gatgcggatg caaatggcgc atattgcatt     3660

gcactgaaag gcctgtatga aatcaaacaa atcaccgaga attggaaaga ggacggcaaa     3720

ttttcacggg ataaactgaa aatcagcaac aaggactggt ttgacttcat ccaaaataag     3780

cgctacctgc cgtcaagagc agatccgaag aaaaagagaa aagttggcgg aggcggatca     3840

ggcggaggtg gctcagcaga atatgttaga gcactgtttg attttaacgg caacgatgaa     3900

gaagatctgc cgttcaaaaa aggcgatatt ctgagaattc gcgacaaacc ggaagaacaa     3960

tggtggaatg cagaagatag cgaaggcaaa agaggcatga ttccggttcc gtatgttgaa     4020

aaatactcag gcgattacaa agatcatgac ggcgactata aagaccatga catcgattat     4080

aaggacgacg atgataaaag cagaatgacg gatgcggaat atgttcgcat tcatgaaaaa     4140

ctggacatct acacgttcaa gaagcagttc ttcaacaaca aaaaaagcgt cagccataga     4200

tgctacgttc tgtttgaact gaaaagaaga ggcgaaagac gcgcatgctt ttggggctat     4260

gcagttaata aaccgcaatc aggcacagaa cgcggaattc atgcagaaat ctttagcatt     4320

cgcaaagtcg aagaatatct gagagataat ccgggacagt ttacgattaa ttggtattca     4380

tcatggtcac cgtgcgcaga ttgcgcagaa aaaattctgg aatggtataa ccaagaactg     4440

agaggcaatg gccatacact gaaaatttgg gcatgcaaac tgtactacga aaaaaatgca     4500

cgcaatcaaa ttggcctgtg gaatctgcgc gataatggcg ttggcctgaa tgttatggtt     4560

agcgaacatt atcaatgctg ccgcaaaatc tttattcaga gcagccataa tcagctgaat     4620

gaaaatagat ggctggaaaa aacactgaaa cgtgcggaaa aaagacgctc agaactgagc     4680

attatgatcc aggttaaaat cctgcataca acgaaatcac cggcagtttc aagaggctca     4740

ggcacaaatc tgagcgatat tatcgaaaaa gaaacgggca aacagctggt cattcaagaa     4800

tcaattctga tgctgccgga agaagttgaa gaagtcattg gcaataaacc ggaaagcgat     4860

atcctggttc atacagcata tgatgaaagc acagatgaaa atgtcatgct gctgacatca     4920

gatgcaccgg aatacaaacc gtgggcactt gttattcaag atagcaatgg cgagaacaag     4980

atcaaaatgc tgtaa                                                      4995


<210>  159
<211>  1664
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  pMDT452: MAD7d-AID-UGI polypeptide sequence

<400>  159

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Ala Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


Pro Ser  Arg Ala Asp Pro Lys  Lys Lys Arg Lys Val  Gly Gly Gly 
    1265                 1270                 1275             


Gly Ser  Gly Gly Gly Gly Ser  Ala Glu Tyr Val Arg  Ala Leu Phe 
    1280                 1285                 1290             


Asp Phe  Asn Gly Asn Asp Glu  Glu Asp Leu Pro Phe  Lys Lys Gly 
    1295                 1300                 1305             


Asp Ile  Leu Arg Ile Arg Asp  Lys Pro Glu Glu Gln  Trp Trp Asn 
    1310                 1315                 1320             


Ala Glu  Asp Ser Glu Gly Lys  Arg Gly Met Ile Pro  Val Pro Tyr 
    1325                 1330                 1335             


Val Glu  Lys Tyr Ser Gly Asp  Tyr Lys Asp His Asp  Gly Asp Tyr 
    1340                 1345                 1350             


Lys Asp  His Asp Ile Asp Tyr  Lys Asp Asp Asp Asp  Lys Ser Arg 
    1355                 1360                 1365             


Met Thr  Asp Ala Glu Tyr Val  Arg Ile His Glu Lys  Leu Asp Ile 
    1370                 1375                 1380             


Tyr Thr  Phe Lys Lys Gln Phe  Phe Asn Asn Lys Lys  Ser Val Ser 
    1385                 1390                 1395             


His Arg  Cys Tyr Val Leu Phe  Glu Leu Lys Arg Arg  Gly Glu Arg 
    1400                 1405                 1410             


Arg Ala  Cys Phe Trp Gly Tyr  Ala Val Asn Lys Pro  Gln Ser Gly 
    1415                 1420                 1425             


Thr Glu  Arg Gly Ile His Ala  Glu Ile Phe Ser Ile  Arg Lys Val 
    1430                 1435                 1440             


Glu Glu  Tyr Leu Arg Asp Asn  Pro Gly Gln Phe Thr  Ile Asn Trp 
    1445                 1450                 1455             


Tyr Ser  Ser Trp Ser Pro Cys  Ala Asp Cys Ala Glu  Lys Ile Leu 
    1460                 1465                 1470             


Glu Trp  Tyr Asn Gln Glu Leu  Arg Gly Asn Gly His  Thr Leu Lys 
    1475                 1480                 1485             


Ile Trp  Ala Cys Lys Leu Tyr  Tyr Glu Lys Asn Ala  Arg Asn Gln 
    1490                 1495                 1500             


Ile Gly  Leu Trp Asn Leu Arg  Asp Asn Gly Val Gly  Leu Asn Val 
    1505                 1510                 1515             


Met Val  Ser Glu His Tyr Gln  Cys Cys Arg Lys Ile  Phe Ile Gln 
    1520                 1525                 1530             


Ser Ser  His Asn Gln Leu Asn  Glu Asn Arg Trp Leu  Glu Lys Thr 
    1535                 1540                 1545             


Leu Lys  Arg Ala Glu Lys Arg  Arg Ser Glu Leu Ser  Ile Met Ile 
    1550                 1555                 1560             


Gln Val  Lys Ile Leu His Thr  Thr Lys Ser Pro Ala  Val Ser Arg 
    1565                 1570                 1575             


Gly Ser  Gly Thr Asn Leu Ser  Asp Ile Ile Glu Lys  Glu Thr Gly 
    1580                 1585                 1590             


Lys Gln  Leu Val Ile Gln Glu  Ser Ile Leu Met Leu  Pro Glu Glu 
    1595                 1600                 1605             


Val Glu  Glu Val Ile Gly Asn  Lys Pro Glu Ser Asp  Ile Leu Val 
    1610                 1615                 1620             


His Thr  Ala Tyr Asp Glu Ser  Thr Asp Glu Asn Val  Met Leu Leu 
    1625                 1630                 1635             


Thr Ser  Asp Ala Pro Glu Tyr  Lys Pro Trp Ala Leu  Val Ile Gln 
    1640                 1645                 1650             


Asp Ser  Asn Gly Glu Asn Lys  Ile Lys Met Leu 
    1655                 1660                 


<210>  160
<211>  7682
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pMDT454 nucleotide sequence

<400>  160
aactaactca acgctagtag tggatttaat cccaaatgag ccaacagaac cagaaccaga       60

aacagaatca gaacaagtaa cattggattt agaaatggaa gaagaaaaaa gcaatgactt      120

cgtgtgaata atgcacgaaa tcgttgctta ttttttttaa aagcggtata ctagatataa      180

cgaaacaacg aactgaatag aaacgaaaaa agagccatga cacatttata aaatgtttga      240

cgacatttta taaatgcata gcccgataag attgccaaac caacgcttat cagttagtca      300

gatgaactct tccctcgtaa gaagttattt aattaacttt gtttgaagac ggtatataac      360

cgtactatca ttatataggg aaatcagaga gttttcaagt atctaagcta ctgaatttaa      420

gaattgttaa gcaatcaatc ggaaatcgtt tgattgcttt ttttgtattc atttatagaa      480

ggtggagttt gtatgaatca tgatgaatgt aaaacttata taaaaaatag tttattggag      540

ataagaaaat tagcaaatat ctatacacta gaaacgttta agaaagagtt agaaaagaga      600

aatatctact tagaaacaaa atcagataag tatttttctt cggaggggga agattatata      660

tataagttaa tagaaaataa caaaataatt tattcgatta gtggaaaaaa attgacttat      720

aaaggaaaaa aatctttttc aaaacatgca atattgaaac agttgaatga aaaagcaaac      780

caagttaatt aaacaaccta ttttatagga tttataggaa aggagaacag ctgaatgaat      840

atcccttttg ttgtagaaac tgtgcttcat gacggcttgt taaagtacaa atttaaaaat      900

agtaaaattc gctcaatcac taccaagcca ggtaaaagca aaggggctat ttttgcgtat      960

cgctcaaaat caagcatgat tggcggtcgt ggtgttgttc tgacttccga ggaagcgatt     1020

caagaaaatc aagatacatt tacacattgg acacccaacg tttatcgtta tggaacgtat     1080

gcagacgaaa accgttcata cacgaaagga cattctgaaa acaatttaag acaaatcaat     1140

accttcttta ttgattttga tattcacacg gcaaaagaaa ctatttcagc aagcgatatt     1200

ttaacaaccg ctattgattt aggttttatg cctactatga ttatcaaatc tgataaaggt     1260

tatcaagcat attttgtttt agaaacgcca gtctatgtga cttcaaaatc agaatttaaa     1320

tctgtcaaag cagccaaaat aatttcgcaa aatatccgag aatattttgg aaagtctttg     1380

ccagttgatc taacgtgtaa tcattttggt attgctcgca taccaagaac ggacaatgta     1440

gaattttttg atcctaatta ccgttattct ttcaaagaat ggcaagattg gtctttcaaa     1500

caaacagata ataagggctt tactcgttca agtctaacgg ttttaagcgg tacagaaggc     1560

aaaaaacaag tagatgaacc ctggtttaat ctcttattgc acgaaacgaa attttcagga     1620

gaaaagggtt taatagggcg taataacgtc atgtttaccc tctctttagc ctactttagt     1680

tcaggctatt caatcgaaac gtgcgaatat aatatgtttg agtttaataa tcgattagat     1740

caacccttag aagaaaaaga agtaatcaaa attgttagaa gtgcctattc agaaaactat     1800

caaggggcta atagggaata cattaccatt ctttgcaaag cttgggtatc aagtgattta     1860

accagtaaag atttatttgt ccgtcaaggg tggtttaaat tcaagaaaaa aagaagcgaa     1920

cgtcaacgtg ttcatttgtc agaatggaaa gaagatttaa tggcttatat tagcgaaaaa     1980

agcgatgtat acaagcctta tttagtgacg accaaaaaag agattagaga agtgctaggc     2040

attcctgaac ggacattaga taaattgctg aaggtactga aggcgaatca ggaaattttc     2100

tttaagatta aaccaggaag aaatggtggc attcaacttg ctagtgttaa atcattgttg     2160

ctatcgatca ttaaagtaaa aaaagaagaa aaagaaagct atataaaggc gctgacaaat     2220

tcttttgact tagagcatac attcattcaa gagactttaa acaagctagc agaacgccct     2280

aaaacggaca cacaactcga tttgtttagc tatgatacag gctgaaaata aaacccgcac     2340

tatgccatta catttatatc tatgatacgt gtttgttttt tctttgctgt ttagcgaatg     2400

attagcagaa atatacagag taagatttta attaattatt agggggagaa ggagagagta     2460

gcccgaaaac ttttagttgg cttggactga acgaagtgag ggaaaggcta ctaaaacgtc     2520

gaggggcagt gagagcgaag cgaacacttg attttttaat tttctatctt ttataggtca     2580

ttagagtata cttatttgtc ctataaacta tttagcagca taatagattt attgaatagg     2640

tcatttaagt tgagcatatt agaggaggaa aatcttggag aaatatttga agaacccgat     2700

tacatggatt ggattagttc ttgtggttac gtggttttta actaaaagta gtgaattttt     2760

gatttttggt gtgtgtgtct tgttgttagt atttgctagt caaagtgatt aaatagaatt     2820

catatccaat ttattttttt cttaacaagg gaggtgtttt ttaacatgac taaagtaggg     2880

tatgcacgtg tcagtagcaa agaacagaac ttagatagac aactgaaagc gttagagggc     2940

gtttctaagg tcttttcaga caaagcaagc ggtcaatcgg tcgaacgccc acaattacaa     3000

gctatgctta actatattcg tgaaggggat atagttgttg ttactgaatt agatcgatta     3060

ggacgaaata ataaagaatt aacagaattg atgaatcaaa ttcaaattaa gggggcaacc     3120

ctggaagtct taaatttacc ctcaatgaat ggtattgaag atgaaaattt aagacggctg     3180

attaataatt tagtgattga attgtataag taccaagcgg aatctgaacg caaacgaatt     3240

aaagaacgcc aagcccaagg aattgaaatt gctaagaaaa aaggaaaatt caaagggcga     3300

caactgaaat tcaaagaaaa tgatccacgt ttacaacacg ctttcgattt gtttttgaac     3360

ggtttatccg ataaagaagt tgaagaacaa actggaatta atcgccgaac gtttagaagg     3420

tatcgatcaa gatacaacgt gacagtcgat caaagaaaaa acaatgaaaa gagggatagt     3480

taatgagtac ggttatttta gctgaaaaac caagccaggc attagcctat gcaagtgctt     3540

taaaacaaag caccaaaaaa gacggttatt ttgagatcaa agacccaatc tttgcagatg     3600

aaacgtttat cacgtttggt tttgggcatt tagtcgagtt agcagaacca ggtcattatg     3660

acgaaaagtg gcaaaattgg aaacttgaat cattgccgat ttttcctgat cgatacgatt     3720

ttgaagtggc aacagataaa aaaaagcagt ttaaaattgt tgctgaactt ttaaaacaag     3780

caaatacaat cattgtcgca acagatagcg acagagaagg cgaaaacatt gcctggtcga     3840

tcattcataa agcaaatgcc ttttctaaag ataaaacgta taaaagacta tggatcaata     3900

gtttagaaaa agatgtgatc cgtagcggtt ttcaaaattt gcaaccagga atgaattact     3960

atccctttta tcaagaagcg caaacacgcc aaattgccga ttggttgatc ggcatgaatg     4020

caagcccttt gtatacgtta aatttacagc agaagggcgt acaaggtaca ttttcactag     4080

gacgtgttca aacgcccacc ttatatctta tttttcagcg ccaggaagcc atagaaaact     4140

ttagaaaaga accttttttc gaggtggaag ctagtataaa agtaaaccaa gggtcattta     4200

agggcgttat aagccccaca cagcgcttta aaacccaaga ggagctttta gcttttgttt     4260

cttctgaaca agctaaaata ggcaatcaag aggggataat tgctgatgtt caaaccaaag     4320

agaagaaaac gaatagtccg agtttgtttt ctttaagtag tttgcaatca aaagttaatc     4380

agctttataa agcgacagcg agccaaactt taaaagctat gcaaggactg tatgaagcaa     4440

aattattgag ttatccaaga acagatacac catttattac agagaacgaa tttgcttatt     4500

taaaagcgaa ttttggcaaa tatagcggtt ttttaggact tgatcttgaa atggttcaaa     4560

cagagcctag aaagcgttat gtggacggta gtaaggtaca ggaacaccac gccattatcc     4620

caacaaaaca agtacctacc gaatctgcat tagcgaaaat ggacgattta caacgaaaaa     4680

tatatgcttt agtcgttaaa acgaccgttg ccatgtttct acctgattat ttgtatgaag     4740

aaactaagat acaaaccaaa gtagccgact tactttttca atcaataggc aagacaccaa     4800

agcaagaagg ttggaaaatt cttttcaaac aacaaaccaa agaagaagaa gaggacgttc     4860

aaacgttacc cttggttatc attggagaac atgccgaggt tgacgttaag agtgccgaaa     4920

aagaaacaca accaccgaaa gcttttacag agggtacatt attaactgct atgaaaacgg     4980

cgaataaaac ggttgatgat gaagaagcaa tcaagatttt acaagaagtt gaggggattg     5040

gaacagaagc gacaagagca agcattattg aagccttgaa acaaaaagaa tatatccaag     5100

tgattaagaa taagcttgtt gtaactgaaa aaggaaaatt attgtgccag gcagttgaaa     5160

gtcagcacct tttaacgagt gctgaaatga cggctaaatg ggaaacgtat ttaaaaaaaa     5220

tcggtaaaag agaaggcaat caagagaact ttattacgaa tatcaaaaaa ttcattgttc     5280

atttactgga agctgtacct aacgatatag aaaaactaaa tttttctgat taccaggaac     5340

agaaagaaaa agaagcagaa aaaagtattg taggaaaatg tcctaagtgt ggcaacaata     5400

ttgtattaaa aaaatcgttt tatggttgtt caaattatcc tgaatgtaag tttactttag     5460

ctgaacattt tagaaagaaa aaactcacca aaacaaatgt aaaagaatta ctagagggaa     5520

aagaaaccct ggtaaaagga atcaaaacga aagatagaaa gtcctacaat gccgttgtaa     5580

aaatcggaga aaagggatat attgatttta tatctttctc aaaataaaca taaaagccct     5640

ttaaagaggg cttttatata ttaatcacaa atcacttatc acaaatcaca agtgatttgt     5700

gattgttgat gataaaataa gaataagaag aaatagaaag aagtgagtga ttgtgggaaa     5760

tttaggcgca caaaaagaaa aacgaaatga tacaccaatc agtgcaaaaa aagatataat     5820

gggagataag acggttcgtg ttcgtgctga cttgcaccat atcataaaaa tcgaaacagc     5880

aaagaatggc ggaaacgtaa aagaagttat ggaaataaga cttagaagca aacttaagag     5940

tgtgttgata gtgcagtatc ttaaaatttt gtataatagg aattgaagtt aaattagatg     6000

ctaaaaattt gtaattaaga aggagtgatt acatgaacaa aaatataaaa tattctcaaa     6060

actttttaac gagtgaaaaa gtactcaacc aaataataaa acaattgaat ttaaaagaaa     6120

ccgataccgt ttacgaaatt ggaacaggta aagggcattt aacgacgaaa ctggctaaaa     6180

taagtaaaca ggtaacgtct attgaattag acagtcatct attcaactta tcgtcagaaa     6240

aattaaaact gaatactcgt gtcactttaa ttcaccaaga tattctacag tttcaattcc     6300

ctaacaaaca gaggtataaa attgttggga gtattcctta ccatttaagc acacaaatta     6360

ttaaaaaagt ggtttttgaa agccatgcgt ctgacatcta tctgattgtt gaagaaggat     6420

tctacaagcg taccttggat attcaccgaa cactagggtt gctcttgcac actcaagtct     6480

cgattcagca attgcttaag ctgccagcgg aatgctttca tcctaaacca aaagtaaaca     6540

gtgtcttaat aaaacttacc cgccatacca cagatgttcc agataaatat tggaagctat     6600

atacgtactt tgtttcaaaa tgggtcaatc gagaatatcg tcaactgttt actaaaaatc     6660

agtttcatca agcaatgaaa cacgccaaag taaacaattt aagtaccgtt acttatgagc     6720

aagtattgtc tatttttaat agttatctat tatttaacgg gaggaaataa ttctatgagt     6780

cgcttttgta aatttggaaa gttacacgtt actaaaggga atgtagataa attattaggt     6840

atactactga cagcttccaa ggagctaaag agctggcgaa agggggatgt gctgcaaggc     6900

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg     6960

aattgatcaa gctttaaatg catgctagca acgcggccgc gttgctagca tgcatttaaa     7020

gcttgatcaa ttcgagctca ttattaatct gttcagcaat cgggcgcgat tgctgaataa     7080

aagatacgag agacctctct tgtatctttt ttattttgag tggttttgtc cgttacacta     7140

gaaaaccgaa agacaataaa aattttattc ttgctgagtc tggctttcgg taagctagac     7200

aaaacggaca aaataaaaat tggcaagggt ttaaaggtgg agattttttg agtgatcttc     7260

tcaaaaaata ctacctgtcc cttgctgatt tttaaacgag cacgagagca aaacccccct     7320

ttgctgaggt ggcagagggc aggttttttt gtttcttttt tctcgtaaaa aaaagaaagg     7380

tcttaaaggt tttatggttt tggtcggcac tgccgacagc ctcgcagagc acacacttta     7440

tgaatataaa gtatagtgtg ttatacttta cttggaagtg gttgccggaa agagcgaaaa     7500

tgcctcacat tgtcgacggt atcgataagc ttcccatact gaaactgcgg actatctaca     7560

agagtagaaa ttaaaaaggt cttttgacca ttttcttata caaattatat tatacatatc     7620

agtaaaataa tgtcaacccc cctttattcc ttttttttac acagcggaca gtctggacag     7680

ca                                                                    7682


<210>  161
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PS1-AID protospacer pMDT454

<400>  161
agtccgcagt ttcagtatgg g                                                 21


<210>  162
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PS2-AID protospacer pMDT455

<400>  162
agatgtccca agcaaacggc a                                                 21


<210>  163
<211>  1743
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DsRed expression cassette

<400>  163
ataaatgagt agaaagcgcc atatcggcgc ttttcttttg gaagaaaata tagggaaaat       60

ggtacttgtt aaaaattcgg aatatttata caatatcata tgtatcacat tgaaaggagg      120

ggcctgctgt ccagactgtc cgctgtgtaa aaaaaaggaa taaagggggg ttgacattat      180

tttactgata tgtataatat aatttgtata agaaaatgga ggggccctcg aaacgtaaga      240

tgaaacctta gataaaagtg ctttttttgt tgcaattgaa gaattattaa tgttaagctt      300

aattaaagat aatatctttg aattgtaacg cccctcaaaa gtaagaacta caaaaaaaga      360

atacgttata tagaaatatg tttgaacctt cttcagatta caaatatatt cggacggact      420

ctacctcaaa tgcttatcta actatagaat gacatacaag cacaaccttg aaaatttgaa      480

aatataacta ccaatgaact tgttcatgtg aattatcgct gtatttaatt ttctcaattc      540

aatatataat atgccaatac attgttacaa gtagaaatta agacaccctt gatagcctta      600

ctatacctaa catgatgtag tattaaatga atatgtaaat atatttatga taagaagcga      660

cttatttata atcattacat atttttctat tggaatgatt aagattccaa tagaatagtg      720

tataaattat ttatcttgaa aggagggatg cctaaaaacg aagaacatta aaaacatata      780

tttgcaccgt ctaatggatt tatgaaaaat cattttatca gtttgaaaat tatgtattat      840

ggagctctta taaaaatgag gagggaaccg aatggcttca actgaagacg taatcaaaga      900

gttcatgcgc ttcaaagtgc gaatggaagg aagtgtaaac gggcatgagt ttgaaattga      960

aggtgaaggt gaaggaaggc cttatgaagg aacgcaaact gcaaaactta aagtgacaaa     1020

aggaggaccg ctgccgtttg cttgggacat cttaagtccg cagtttcagt atgggtcaaa     1080

agtttatgta aagcatcctg ctgacattcc tgattacaaa aagttaagtt ttcctgaagg     1140

attcaagtgg gagcgcgtaa tgaactttga agatggaggt gtcgtaactg taacgcaaga     1200

ttcaagtctg caagacggtt gcttcattta caaagtaaag ttcattggcg tgaactttcc     1260

aagtgatggt cctgtaatgc agaaaaagac aatgggttgg gagccgtcaa ctgagaggct     1320

ttatccgcgt gatggtgtct tgaaaggtga aattcacaaa gccttaaagt tgaaagatgg     1380

agggcattat cttgttgagt tcaagagcat ttacatggcg aaaaagcctg tgcagcttcc     1440

tggctactac tatgttgatt caaaacttga cataactagt cacaacgaag actacacaat     1500

tgttgagcag tatgagcgaa ctgaaggaag gcatcatctt tttctttaag agaccagact     1560

tccaattgac actaaaggga tccagaagcg gcaacacgct aatcaataaa aaaacgctgt     1620

gcggttaaag ggcacagcgt tttttgtgta tgaatcgaaa aagaggagag atcgcactga     1680

taattgccaa cacaattaac atctcaatca aggtaaatgc tagcgcggcc gcgtcgacag     1740

gcc                                                                   1743


<210>  164
<211>  678
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DsRed coding region

<400>  164
atggcttcaa ctgaagacgt aatcaaagag ttcatgcgct tcaaagtgcg aatggaagga       60

agtgtaaacg ggcatgagtt tgaaattgaa ggtgaaggtg aaggaaggcc ttatgaagga      120

acgcaaactg caaaacttaa agtgacaaaa ggaggaccgc tgccgtttgc ttgggacatc      180

ttaagtccgc agtttcagta tgggtcaaaa gtttatgtaa agcatcctgc tgacattcct      240

gattacaaaa agttaagttt tcctgaagga ttcaagtggg agcgcgtaat gaactttgaa      300

gatggaggtg tcgtaactgt aacgcaagat tcaagtctgc aagacggttg cttcatttac      360

aaagtaaagt tcattggcgt gaactttcca agtgatggtc ctgtaatgca gaaaaagaca      420

atgggttggg agccgtcaac tgagaggctt tatccgcgtg atggtgtctt gaaaggtgaa      480

attcacaaag ccttaaagtt gaaagatgga gggcattatc ttgttgagtt caagagcatt      540

tacatggcga aaaagcctgt gcagcttcct ggctactact atgttgattc aaaacttgac      600

ataactagtc acaacgaaga ctacacaatt gttgagcagt atgagcgaac tgaaggaagg      660

catcatcttt ttctttaa                                                    678


<210>  165
<211>  16
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer ID 1202334

<400>  165
ttgcaccgtc taatgg                                                       16


<210>  166
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer ID 1228373

<400>  166
gatgatgcct tccttcagtt                                                   20


