                         SEQUENCE LISTING

<110>  E. I. du Pont de Nemours and Company
     
      
    
    
 
<120>  EXPRESSION OF XYLOSE ISOMERASE ACTIVITY IN YEAST

<130>  CL5683

<160>  101   

<170>  PatentIn version 3.5

<210>  1
<211>  548
<212>  PRT
<213>  Escherichia coli

<400>  1

Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met 
1               5                   10                  15      


Leu Arg Gly Val Asn Val Leu Ala Asp Ala Val Lys Val Thr Leu Gly 
            20                  25                  30          


Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala Pro Thr 
        35                  40                  45              


Ile Thr Lys Asp Gly Val Ser Val Ala Arg Glu Ile Glu Leu Glu Asp 
    50                  55                  60                  


Lys Phe Glu Asn Met Gly Ala Gln Met Val Lys Glu Val Ala Ser Lys 
65                  70                  75                  80  


Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
                85                  90                  95      


Gln Ala Ile Ile Thr Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn 
            100                 105                 110         


Pro Met Asp Leu Lys Arg Gly Ile Asp Lys Ala Val Thr Ala Ala Val 
        115                 120                 125             


Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp Ser Lys Ala Ile 
    130                 135                 140                 


Ala Gln Val Gly Thr Ile Ser Ala Asn Ser Asp Glu Thr Val Gly Lys 
145                 150                 155                 160 


Leu Ile Ala Glu Ala Met Asp Lys Val Gly Lys Glu Gly Val Ile Thr 
                165                 170                 175     


Val Glu Asp Gly Thr Gly Leu Gln Asp Glu Leu Asp Val Val Glu Gly 
            180                 185                 190         


Met Gln Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe Ile Asn Lys Pro 
        195                 200                 205             


Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe Ile Leu Leu Ala Asp 
    210                 215                 220                 


Lys Lys Ile Ser Asn Ile Arg Glu Met Leu Pro Val Leu Glu Ala Val 
225                 230                 235                 240 


Ala Lys Ala Gly Lys Pro Leu Leu Ile Ile Ala Glu Asp Val Glu Gly 
                245                 250                 255     


Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly Ile Val Lys 
            260                 265                 270         


Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met 
        275                 280                 285             


Leu Gln Asp Ile Ala Thr Leu Thr Gly Gly Thr Val Ile Ser Glu Glu 
    290                 295                 300                 


Ile Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp Leu Gly Gln Ala 
305                 310                 315                 320 


Lys Arg Val Val Ile Asn Lys Asp Thr Thr Thr Ile Ile Asp Gly Val 
                325                 330                 335     


Gly Glu Glu Ala Ala Ile Gln Gly Arg Val Ala Gln Ile Arg Gln Gln 
            340                 345                 350         


Ile Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gln Glu Arg 
        355                 360                 365             


Val Ala Lys Leu Ala Gly Gly Val Ala Val Ile Lys Val Gly Ala Ala 
    370                 375                 380                 


Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val Glu Asp Ala Leu 
385                 390                 395                 400 


His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Ala Gly Gly Gly 
                405                 410                 415     


Val Ala Leu Ile Arg Val Ala Ser Lys Leu Ala Asp Leu Arg Gly Gln 
            420                 425                 430         


Asn Glu Asp Gln Asn Val Gly Ile Lys Val Ala Leu Arg Ala Met Glu 
        435                 440                 445             


Ala Pro Leu Arg Gln Ile Val Leu Asn Cys Gly Glu Glu Pro Ser Val 
    450                 455                 460                 


Val Ala Asn Thr Val Lys Gly Gly Asp Gly Asn Tyr Gly Tyr Asn Ala 
465                 470                 475                 480 


Ala Thr Glu Glu Tyr Gly Asn Met Ile Asp Met Gly Ile Leu Asp Pro 
                485                 490                 495     


Thr Lys Val Thr Arg Ser Ala Leu Gln Tyr Ala Ala Ser Val Ala Gly 
            500                 505                 510         


Leu Met Ile Thr Thr Glu Cys Met Val Thr Asp Leu Pro Lys Asn Asp 
        515                 520                 525             


Ala Ala Asp Leu Gly Ala Ala Gly Gly Met Gly Gly Met Gly Gly Met 
    530                 535                 540                 


Gly Gly Met Met 
545             


<210>  2
<211>  1644
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  2
atggctgcta aagatgtaaa gttcggtaat gatgctagag taaaaatgtt gagaggtgta       60

aatgtattgg ctgacgctgt aaaagtaact ttgggtccaa aaggtagaaa tgttgtcttg      120

gataagtctt ttggtgctcc taccataact aaagacggtg tttcagtcgc aagagaaatc      180

gaattggagg ataagttcga aaacatgggt gctcaaatgg tcaaagaagt cgcctctaag      240

gctaacgatg ctgcaggtga cggtactaca accgctactg ttttggctca agcaattata      300

acagaaggtt taaaagcagt tgccgctggt atgaatccaa tggatttgaa aagaggtatt      360

gacaaggccg tcactgcagc cgtagaagaa ttgaaagcat tatcagtccc ttgttctgat      420

tcaaaggcca tcgctcaagt aggtaccatt tccgctaaca gtgatgaaac tgttggtaaa      480

ttaattgcag aagccatgga caaagtcggt aaagaaggtg taataaccgt tgaagatggt      540

actggtttgc aagatgaatt agacgtagtt gagggtatgc aatttgatag aggttatttg      600

tcaccatact tcatcaataa gcctgaaaca ggtgctgttg aattggaatc cccttttatt      660

ttgttggcag ataaaaagat tagtaacata agagaaatgt tgccagtttt agaagctgtc      720

gcaaaagccg gtaaaccttt gttaatcatt gctgaagatg ttgaaggtga agcattggca      780

acattagtcg taaataccat gagaggtatt gtaaaagttg ctgcagttaa ggctccaggt      840

ttcggtgaca gaagaaaagc tatgttgcaa gacattgcaa cattaaccgg tggtacagtt      900

atctccgaag aaattggtat ggaattggaa aaggccacct tggaagattt gggtcaagct      960

aagagagttg tcattaataa ggatactaca accatcatcg acggtgtagg tgaagaagcc     1020

gctatacaag gtagagttgc tcaaataaga caacaaatcg aagaagcaac ttctgattat     1080

gacagagaaa aattgcaaga aagagttgca aagttagccg gtggtgtcgc tgtaattaaa     1140

gttggtgcag ccaccgaagt cgaaatgaag gaaaagaaag caagagtaga agatgctttg     1200

catgcaacaa gagctgcagt tgaagaaggt gtagttgcag gtggtggtgt cgccttaatt     1260

agagtagcct ccaaattggc tgatttgaga ggtcaaaatg aagaccaaaa cgtaggtatc     1320

aaggttgcct taagagctat ggaagcacca ttgagacaaa tcgttttgaa ctgtggtgaa     1380

gaacctagtg tcgtagctaa cactgttaaa ggtggtgacg gtaattatgg ttacaacgcc     1440

gctacagaag aatacggtaa catgatcgat atgggtatat tggacccaac taaggtcaca     1500

agatctgcat tgcaatacgc agcctcagtt gccggtttaa tgattactac agaatgcatg     1560

gttacagatt tgcctaaaaa cgacgctgcc gacttgggtg ccgcaggtgg tatgggtggt     1620

atgggtggta tgggtggtat gatg                                            1644


<210>  3
<211>  550
<212>  PRT
<213>  Actinoplanes missouriensis

<400>  3

Met Ala Lys Ile Leu Ser Phe Ser Asp Asp Ala Arg His Leu Leu Glu 
1               5                   10                  15      


His Gly Val Asn Thr Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
            20                  25                  30          


Arg Gly Arg Asn Val Val Leu Asp Lys Lys Phe Gly Ala Pro Thr Ile 
        35                  40                  45              


Thr Asn Asp Gly Val Thr Ile Ala Lys Glu Ile Glu Leu Thr Asp Pro 
    50                  55                  60                  


Tyr Glu Asn Leu Gly Ala Gln Leu Val Lys Glu Val Ala Thr Lys Thr 
65                  70                  75                  80  


Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gln 
                85                  90                  95      


Ala Leu Val Arg Glu Gly Leu Arg Asn Val Thr Ala Gly Ala Asn Pro 
            100                 105                 110         


Ile Gly Leu Lys Arg Gly Met Asp Lys Ala Ser Glu Val Val Ser Lys 
        115                 120                 125             


Ala Leu Leu Ala Lys Ala Val Glu Val Ala Asp His Lys Ala Ile Ala 
    130                 135                 140                 


Asn Val Ala Thr Ile Ser Ala Gln Asp Ala Thr Ile Gly Glu Leu Ile 
145                 150                 155                 160 


Ala Glu Ala Met Asp Arg Val Gly Arg Asp Gly Val Ile Thr Val Glu 
                165                 170                 175     


Glu Gly Ser Ala Met Leu Thr Glu Leu Glu Val Thr Glu Gly Leu Gln 
            180                 185                 190         


Phe Asp Lys Gly Phe Ile Ser Pro Asn Phe Val Thr Asp Ala Glu Ser 
        195                 200                 205             


Gln Glu Val Val Leu Glu Asp Ala Phe Ile Leu Leu Thr Thr Gln Lys 
    210                 215                 220                 


Ile Ser Ser Ile Glu Glu Leu Leu Pro Leu Leu Glu Lys Val Leu Gln 
225                 230                 235                 240 


Ala Gly Lys Pro Leu Leu Ile Val Ala Glu Asp Val Glu Gly Gln Ala 
                245                 250                 255     


Leu Ser Thr Leu Val Val Asn Ala Leu Arg Lys Thr Ile Lys Val Ala 
            260                 265                 270         


Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Ile Leu Gln 
        275                 280                 285             


Asp Leu Ala Ile Ala Thr Gly Gly Glu Leu Ile Ala Pro Glu Leu Gly 
    290                 295                 300                 


Tyr Lys Leu Asp Gln Val Gly Ile Glu Ser Leu Gly Ser Ala Arg Arg 
305                 310                 315                 320 


Ile Val Val Asp Lys Glu Asn Thr Thr Ile Val Asp Gly Gly Gly Asn 
                325                 330                 335     


Lys Ala Asp Val Thr Asp Arg Val Ala Gln Ile Arg Lys Glu Ile Glu 
            340                 345                 350         


Ala Ser Asp Ser Asp Trp Asp Arg Glu Lys Leu Gln Glu Arg Leu Ala 
        355                 360                 365             


Lys Leu Gly Gly Gly Ile Ala Val Ile Lys Val Gly Ala Ala Thr Glu 
    370                 375                 380                 


Val Glu Met Lys Glu Arg Lys His Arg Ile Glu Asp Ala Ile Ala Ala 
385                 390                 395                 400 


Thr Lys Ala Ala Val Glu Glu Gly Thr Val Pro Gly Gly Gly Ala Ala 
                405                 410                 415     


Leu Ala Gln Val Ser Lys Glu Leu Glu Asp Asn Leu Gly Leu Thr Gly 
            420                 425                 430         


Glu Glu Ala Ile Gly Val Ser Ile Val Arg Lys Ala Leu Val Glu Pro 
        435                 440                 445             


Leu Arg Trp Ile Ala Gln Asn Ala Gly His Asp Gly Tyr Val Val Val 
    450                 455                 460                 


Gly Lys Val Gly Glu Leu Gly Trp Gly His Gly Leu Asn Ala Ala Thr 
465                 470                 475                 480 


Asp Glu Tyr Val Asp Leu Ala Ala Ala Gly Ile Ile Asp Pro Val Lys 
                485                 490                 495     


Val Thr Arg Asn Ala Val Ser Asn Ala Val Ser Ile Ala Ala Leu Leu 
            500                 505                 510         


Leu Thr Thr Glu Ser Leu Val Val Glu Lys Pro Ala Glu Ala Ala Pro 
        515                 520                 525             


Ala Ala Ala Gly Gly Gly His Gly His Ser His Gly Gly His Gly His 
    530                 535                 540                 


Gln His Gly Pro Gly Phe 
545                 550 


<210>  4
<211>  1650
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  4
atggctaaga tcttgtcctt ctctgatgat gctagacact tgttggaaca cggtgtcaac       60

actttggctg atactgttaa ggtcactttg ggtccaagag gtagaaacgt tgtcttggat      120

aagaagttcg gtgctccaac tatcaccaac gacggtgtta ctatcgctaa ggaaatcgaa      180

ttgaccgacc catacgaaaa cttgggtgct caattggtca aggaagttgc tactaagacc      240

aacgatgtcg ctggtgacgg tactactacc gctactgtct tggctcaagc tttggttaga      300

gaaggtttga gaaacgttac cgctggtgct aacccaatcg gtttgaagag aggtatggac      360

aaggcttctg aagttgtctc caaggctttg ttggctaagg ctgtcgaagt tgctgatcac      420

aaggctatcg ctaacgtcgc tactatctct gctcaagacg ctaccatcgg tgaattgatc      480

gctgaagcta tggatagagt tggtagagac ggtgtcatca ctgttgaaga aggttctgct      540

atgttgactg aattggaagt caccgaaggt ttgcaattcg acaagggttt catctctcca      600

aacttcgtta ccgatgctga atcccaagaa gttgtcttgg aagacgcttt catcttgttg      660

actacccaaa agatctcttc catcgaagaa ttgttgccat tgttggaaaa ggtcttgcaa      720

gctggtaaac cattgttgat cgtcgctgaa gacgttgaag gtcaagcttt gtctactttg      780

gttgtcaacg ctttgagaaa gaccatcaag gtcgctgctg ttaaggctcc aggtttcggt      840

gacagaagaa aggctatctt gcaagacttg gctatcgcta ctggtggtga attgatcgct      900

ccagaattgg gttacaagtt ggaccaagtc ggtatcgaat ctttgggttc cgctagaaga      960

atcgttgtcg ataaggaaaa cactaccatc gttgacggtg gtggtaacaa ggctgatgtc     1020

actgacagag ttgctcaaat cagaaaggaa atcgaagctt ctgactccga ttgggacaga     1080

gaaaagttgc aagaaagatt ggctaagttg ggtggtggta tcgctgtcat caaggttggt     1140

gctgctaccg aagttgaaat gaaggaaaga aagcacagaa tcgaagatgc tatcgctgct     1200

actaaggctg ctgtcgaaga aggtactgtt ccaggtggtg gtgctgcttt ggctcaagtc     1260

tctaaggaat tggaagacaa cttgggtttg accggtgaag aagctatcgg tgtctccatc     1320

gttagaaagg ctttggttga accattgaga tggatcgctc aaaacgctgg tcacgacggt     1380

tacgttgtcg ttggtaaagt cggtgaattg ggttggggtc acggtttgaa cgctgctact     1440

gatgaatacg ttgacttggc tgctgctggt atcatcgacc cagtcaaggt taccagaaac     1500

gctgtctcta acgctgtttc catcgctgct ttgttgttga ctaccgaatc tttggtcgtt     1560

gaaaagccag ctgaagctgc tccagctgct gctggtggtg gtcacggtca ctcccacggt     1620

ggtcacggtc accaacacgg tccaggtttc                                      1650


<210>  5
<211>  540
<212>  PRT
<213>  Actinoplanes missouriensis

<400>  5

Met Ala Lys Ile Ile Ala Phe Asp Glu Glu Ala Arg Arg Gly Leu Glu 
1               5                   10                  15      


Arg Gly Met Asn Gln Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
            20                  25                  30          


Lys Gly Arg Asn Val Val Leu Glu Lys Lys Trp Gly Ala Pro Thr Ile 
        35                  40                  45              


Thr Asn Asp Gly Val Ser Ile Ala Lys Glu Ile Glu Leu Glu Asp Ser 
    50                  55                  60                  


Tyr Glu Lys Ile Gly Ala Glu Leu Val Lys Glu Val Ala Lys Lys Thr 
65                  70                  75                  80  


Asp Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gln 
                85                  90                  95      


Ala Leu Val Arg Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
            100                 105                 110         


Met Ala Leu Lys Arg Gly Ile Glu Ala Ala Val Ala Ser Val Ser Glu 
        115                 120                 125             


Gly Leu Gln Gln Leu Ala Lys Asp Val Glu Thr Lys Glu Gln Ile Ala 
    130                 135                 140                 


Ser Thr Ala Ser Ile Ser Ala Gly Asp Ser Thr Val Gly Glu Ile Ile 
145                 150                 155                 160 


Ala Glu Ala Met Asp Lys Val Gly Lys Glu Gly Val Ile Thr Val Glu 
                165                 170                 175     


Glu Ser Asn Thr Phe Gly Leu Glu Leu Glu Leu Thr Glu Gly Met Arg 
            180                 185                 190         


Phe Asp Lys Gly Tyr Ile Ser Ala Tyr Phe Met Thr Asp Ala Glu Arg 
        195                 200                 205             


Met Glu Ala Val Phe Asp Asp Pro Tyr Ile Leu Ile Ala Asn Ser Lys 
    210                 215                 220                 


Ile Ser Ala Val Lys Asp Leu Leu Pro Ile Leu Glu Lys Val Met Gln 
225                 230                 235                 240 


Ser Gly Lys Pro Leu Val Ile Ile Ala Glu Asp Val Glu Gly Glu Ala 
                245                 250                 255     


Leu Ala Thr Leu Val Val Asn Lys Val Arg Gly Thr Phe Lys Ser Val 
            260                 265                 270         


Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
        275                 280                 285             


Asp Ile Ala Ile Leu Thr Gly Gly Ala Val Ile Ser Glu Glu Val Gly 
    290                 295                 300                 


Leu Lys Leu Asp Ala Ala Asp Leu Ser Leu Leu Gly Gln Ala Arg Lys 
305                 310                 315                 320 


Val Val Ile Thr Lys Asp Glu Thr Thr Val Val Asp Gly Ala Gly Asn 
                325                 330                 335     


Gly Glu Gln Ile Gln Gly Arg Val Asn Gln Ile Arg Ala Glu Ile Glu 
            340                 345                 350         


Arg Ser Asp Ser Asp Tyr Asp Arg Glu Lys Leu Gln Glu Arg Leu Ala 
        355                 360                 365             


Lys Leu Ala Gly Gly Val Ala Val Ile Lys Val Gly Ala Ala Thr Glu 
    370                 375                 380                 


Val Glu Leu Lys Glu Arg Lys His Arg Ile Glu Asp Ala Val Arg Asn 
385                 390                 395                 400 


Ala Lys Ala Ala Val Glu Glu Gly Ile Val Pro Gly Gly Gly Val Ala 
                405                 410                 415     


Leu Val Gln Ala Gly Lys Thr Ala Phe Asp Lys Leu Asp Leu Val Gly 
            420                 425                 430         


Asp Glu Ala Thr Gly Ala Asn Ile Val Lys Val Ala Leu Asp Ala Pro 
        435                 440                 445             


Leu Arg Gln Ile Ala Val Asn Ala Gly Leu Glu Gly Gly Val Val Val 
    450                 455                 460                 


Glu Lys Val Arg Asn Leu Ser Ala Gly His Gly Leu Asn Ala Ala Thr 
465                 470                 475                 480 


Gly Glu Tyr Val Asp Leu Leu Ala Ala Gly Ile Ile Asp Pro Ala Lys 
                485                 490                 495     


Val Thr Arg Ser Ala Leu Gln Asn Ala Ala Ser Ile Ala Ala Leu Phe 
            500                 505                 510         


Leu Thr Thr Glu Ala Val Val Ala Asp Lys Pro Glu Lys Asn Pro Ala 
        515                 520                 525             


Pro Ala Gly Ala Pro Gly Gly Gly Asp Met Asp Phe 
    530                 535                 540 


<210>  6
<211>  1620
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  6
atggctaaga tcatcgcttt cgacgaagaa gctagaagag gtttggaaag aggtatgaac       60

caattggctg acgctgttaa ggtcactttg ggtccaaagg gtagaaacgt tgtcttggaa      120

aagaagtggg gtgctccaac tatcaccaac gatggtgtct ctatcgctaa ggaaatcgaa      180

ttggaagact cctacgaaaa gatcggtgct gaattggtca aggaagttgc taagaagact      240

gacgatgtcg ctggtgacgg tactactacc gctaccgtct tggctcaagc tttggttaga      300

gaaggtttga gaaacgttgc tgctggtgct aacccaatgg ctttgaagag aggtatcgaa      360

gctgctgtcg cttctgtttc cgaaggtttg caacaattgg ctaaggacgt tgaaactaag      420

gaacaaatcg cttctaccgc ttctatctct gctggtgact ccactgtcgg tgaaatcatc      480

gctgaagcta tggacaaggt tggtaaagaa ggtgtcatca ctgttgaaga atctaacacc      540

ttcggtttgg aattggaatt gactgaaggt atgagattcg ataagggtta catctccgct      600

tacttcatga ccgacgctga aagaatggaa gctgtcttcg acgatccata catcttgatc      660

gctaactcta agatctccgc tgtcaaggac ttgttgccaa tcttggaaaa ggttatgcaa      720

tctggtaaac cattggtcat catcgctgaa gacgttgaag gtgaagcttt ggctactttg      780

gttgtcaaca aggttagagg tactttcaag tctgtcgctg ttaaggctcc aggtttcggt      840

gacagaagaa aggctatgtt ggaagacatc gctatcttga ctggtggtgc tgtcatctct      900

gaagaagttg gtttgaagtt ggatgctgct gacttgtcct tgttgggtca agctagaaag      960

gttgtcatca ccaaggatga aactaccgtt gttgacggtg ctggtaacgg tgaacaaatc     1020

caaggtagag ttaaccaaat cagagctgaa atcgaaagat ctgactccga ttacgacaga     1080

gaaaagttgc aagaaagatt ggctaagttg gctggtggtg tcgctgttat caaggtcggt     1140

gctgctaccg aagttgaatt gaaggaaaga aagcacagaa tcgaagacgc tgtcagaaac     1200

gctaaggctg ctgtcgaaga aggtatcgtt ccaggtggtg gtgtcgcttt ggttcaagct     1260

ggtaaaactg ctttcgataa gttggacttg gttggtgacg aagctaccgg tgctaacatc     1320

gtcaaggttg ctttggacgc tccattgaga caaatcgctg tcaacgctgg tttggaaggt     1380

ggtgttgtcg ttgaaaaggt tagaaacttg tctgctggtc acggtttgaa cgctgctact     1440

ggtgaatacg tcgatttgtt ggctgctggt atcatcgacc cagctaaggt taccagatct     1500

gctttgcaaa acgctgcttc catcgctgct ttgttcttga ctaccgaagc tgtcgttgct     1560

gacaagccag aaaagaaccc agctccagct ggtgctccag gtggtggtga catggacttc     1620


<210>  7
<211>  545
<212>  PRT
<213>  Bacteroides thetaiotaomicron

<400>  7

Met Ala Lys Glu Ile Leu Phe Asn Ile Asp Ala Arg Asp Gln Leu Lys 
1               5                   10                  15      


Lys Gly Val Asp Ala Leu Ala Asn Ala Val Lys Val Thr Leu Gly Pro 
            20                  25                  30          


Lys Gly Arg Asn Val Ile Ile Glu Lys Lys Phe Gly Ala Pro His Ile 
        35                  40                  45              


Thr Lys Asp Gly Val Thr Val Ala Lys Glu Ile Glu Leu Ala Asp Ala 
    50                  55                  60                  


Tyr Gln Asn Thr Gly Ala Gln Leu Val Lys Glu Val Ala Ser Lys Thr 
65                  70                  75                  80  


Gly Asp Asp Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gln 
                85                  90                  95      


Ala Ile Val Ala Glu Gly Leu Lys Asn Val Thr Ala Gly Ala Ser Pro 
            100                 105                 110         


Met Asp Ile Lys Arg Gly Ile Asp Lys Ala Val Ala Lys Val Val Glu 
        115                 120                 125             


Ser Ile Lys Ala Gln Ala Glu Thr Val Gly Asp Asn Tyr Asp Lys Ile 
    130                 135                 140                 


Glu Gln Val Ala Thr Val Ser Ala Asn Asn Asp Pro Val Ile Gly Lys 
145                 150                 155                 160 


Leu Ile Ala Asp Ala Met Arg Lys Val Ser Lys Asp Gly Val Ile Thr 
                165                 170                 175     


Ile Glu Glu Ala Lys Gly Thr Asp Thr Thr Ile Gly Val Val Glu Gly 
            180                 185                 190         


Met Gln Phe Asp Arg Gly Tyr Leu Ser Ala Tyr Phe Val Thr Asn Thr 
        195                 200                 205             


Glu Lys Met Glu Cys Glu Met Glu Lys Pro Tyr Ile Leu Ile Tyr Asp 
    210                 215                 220                 


Lys Lys Ile Ser Asn Leu Lys Asp Phe Leu Pro Ile Leu Glu Pro Ala 
225                 230                 235                 240 


Val Gln Thr Gly Arg Pro Leu Leu Val Ile Ala Glu Asp Val Asp Ser 
                245                 250                 255     


Glu Ala Leu Thr Thr Leu Val Val Asn Arg Leu Arg Ser Gln Leu Lys 
            260                 265                 270         


Ile Cys Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Glu Met 
        275                 280                 285             


Leu Glu Asp Ile Ala Ile Leu Thr Gly Gly Val Val Ile Ser Glu Glu 
    290                 295                 300                 


Lys Gly Leu Lys Leu Glu Gln Ala Thr Ile Glu Met Leu Gly Thr Ala 
305                 310                 315                 320 


Asp Lys Val Thr Val Ser Lys Asp Tyr Thr Thr Ile Val Asn Gly Ala 
                325                 330                 335     


Gly Val Lys Glu Asn Ile Lys Glu Arg Cys Asp Gln Ile Lys Ala Gln 
            340                 345                 350         


Ile Val Ala Thr Lys Ser Asp Tyr Asp Arg Glu Lys Leu Gln Glu Arg 
        355                 360                 365             


Leu Ala Lys Leu Ser Gly Gly Val Ala Val Leu Tyr Val Gly Ala Ala 
    370                 375                 380                 


Ser Glu Val Glu Met Lys Glu Lys Lys Asp Arg Val Asp Asp Ala Leu 
385                 390                 395                 400 


Arg Ala Thr Arg Ala Ala Ile Glu Glu Gly Ile Ile Pro Gly Gly Gly 
                405                 410                 415     


Val Ala Tyr Ile Arg Ala Ile Asp Ser Leu Glu Gly Met Lys Gly Asp 
            420                 425                 430         


Asn Ala Asp Glu Thr Thr Gly Ile Gly Ile Ile Lys Arg Ala Ile Glu 
        435                 440                 445             


Glu Pro Leu Arg Glu Ile Val Ala Asn Ala Gly Lys Glu Gly Ala Val 
    450                 455                 460                 


Val Val Gln Lys Val Arg Glu Gly Lys Gly Asp Phe Gly Tyr Asn Ala 
465                 470                 475                 480 


Arg Thr Asp Val Tyr Glu Asn Leu His Ala Ala Gly Val Val Asp Pro 
                485                 490                 495     


Ala Lys Val Ala Arg Val Ala Leu Glu Asn Ala Ala Ser Ile Ala Gly 
            500                 505                 510         


Met Phe Leu Thr Thr Glu Cys Val Ile Val Glu Lys Lys Glu Asp Lys 
        515                 520                 525             


Pro Glu Met Pro Met Gly Ala Pro Gly Met Gly Gly Met Gly Gly Met 
    530                 535                 540                 


Met 
545 


<210>  8
<211>  1635
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  8
atggctaagg aaatcttgtt caacatcgac gctagagacc aattgaagaa gggtgttgac       60

gctttggcta acgctgttaa ggttactttg ggtccaaagg gtagaaacgt catcatcgaa      120

aagaagttcg gtgctccaca catcactaag gacggtgtca ccgttgctaa ggaaatcgaa      180

ttggctgacg cttaccaaaa cactggtgct caattggtca aggaagttgc ttctaagacc      240

ggtgacgatg ctggtgacgg tactactacc gctactgtct tggctcaagc tatcgttgct      300

gaaggtttga agaacgttac cgctggtgct tctccaatgg acatcaagag aggtatcgat      360

aaggctgtcg ctaaggttgt cgaatccatc aaggctcaag ctgaaaccgt tggtgacaac      420

tacgataaga tcgaacaagt cgctactgtt tctgctaaca acgacccagt catcggtaaa      480

ttgatcgctg acgctatgag aaaggtctcc aaggatggtg ttatcactat cgaagaagct      540

aagggtactg acactaccat cggtgttgtc gaaggtatgc aattcgacag aggttacttg      600

tctgcttact tcgttactaa caccgaaaag atggaatgtg aaatggaaaa gccatacatc      660

ttgatctacg acaagaagat ctccaacttg aaggatttct tgccaatctt ggaaccagct      720

gtccaaactg gtagaccatt gttggtcatc gctgaagacg ttgattctga agctttgact      780

accttggttg tcaacagatt gagatcccaa ttgaagatct gtgctgttaa ggctccaggt      840

ttcggtgaca gaagaaagga aatgttggaa gatatcgcta tcttgaccgg tggtgttgtc      900

atctctgaag aaaagggttt gaagttggaa caagctacta tcgaaatgtt gggtactgct      960

gacaaggtca ccgtttccaa ggattacact accatcgtca acggtgctgg tgttaaggaa     1020

aacatcaagg aaagatgtga ccaaatcaag gctcaaatcg tcgctaccaa gtctgactac     1080

gatagagaaa agttgcaaga aagattggct aagttgtctg gtggtgtcgc tgttttgtac     1140

gtcggtgctg cttccgaagt tgaaatgaag gaaaagaagg acagagttga cgatgctttg     1200

agagctacta gagctgctat cgaagaaggt atcatcccag gtggtggtgt tgcttacatc     1260

agagctatcg actccttgga aggtatgaag ggtgacaacg ctgatgaaac taccggtatc     1320

ggtatcatca agagagctat cgaagaacca ttgagagaaa tcgtcgctaa cgctggtaaa     1380

gaaggtgctg ttgtcgttca aaaggttaga gaaggtaaag gtgacttcgg ttacaacgct     1440

agaaccgatg tttacgaaaa cttgcacgct gctggtgtcg ttgacccagc taaggtcgct     1500

agagttgctt tggaaaacgc tgcttctatc gctggtatgt tcttgactac cgaatgtgtc     1560

atcgttgaaa agaaggaaga caagccagaa atgccaatgg gtgctccagg tatgggtggt     1620

atgggtggta tgatg                                                      1635


<210>  9
<211>  544
<212>  PRT
<213>  Bacillus subtilis

<400>  9

Met Ala Lys Glu Ile Lys Phe Ser Glu Glu Ala Arg Arg Ala Met Leu 
1               5                   10                  15      


Arg Gly Val Asp Ala Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
            20                  25                  30          


Lys Gly Arg Asn Val Val Leu Glu Lys Lys Phe Gly Ser Pro Leu Ile 
        35                  40                  45              


Thr Asn Asp Gly Val Thr Ile Ala Lys Glu Ile Glu Leu Glu Asp Ala 
    50                  55                  60                  


Phe Glu Asn Met Gly Ala Lys Leu Val Ala Glu Val Ala Ser Lys Thr 
65                  70                  75                  80  


Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gln 
                85                  90                  95      


Ala Met Ile Arg Glu Gly Leu Lys Asn Val Thr Ala Gly Ala Asn Pro 
            100                 105                 110         


Val Gly Val Arg Lys Gly Met Glu Gln Ala Val Ala Val Ala Ile Glu 
        115                 120                 125             


Asn Leu Lys Glu Ile Ser Lys Pro Ile Glu Gly Lys Glu Ser Ile Ala 
    130                 135                 140                 


Gln Val Ala Ala Ile Ser Ala Ala Asp Glu Glu Val Gly Ser Leu Ile 
145                 150                 155                 160 


Ala Glu Ala Met Glu Arg Val Gly Asn Asp Gly Val Ile Thr Ile Glu 
                165                 170                 175     


Glu Ser Lys Gly Phe Thr Thr Glu Leu Glu Val Val Glu Gly Met Gln 
            180                 185                 190         


Phe Asp Arg Gly Tyr Ala Ser Pro Tyr Met Val Thr Asp Ser Asp Lys 
        195                 200                 205             


Met Glu Ala Val Leu Asp Asn Pro Tyr Ile Leu Ile Thr Asp Lys Lys 
    210                 215                 220                 


Ile Thr Asn Ile Gln Glu Ile Leu Pro Val Leu Glu Gln Val Val Gln 
225                 230                 235                 240 


Gln Gly Lys Pro Leu Leu Leu Ile Ala Glu Asp Val Glu Gly Glu Ala 
                245                 250                 255     


Leu Ala Thr Leu Val Val Asn Lys Leu Arg Gly Thr Phe Asn Ala Val 
            260                 265                 270         


Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
        275                 280                 285             


Asp Ile Ala Val Leu Thr Gly Gly Glu Val Ile Thr Glu Asp Leu Gly 
    290                 295                 300                 


Leu Asp Leu Lys Ser Thr Gln Ile Ala Gln Leu Gly Arg Ala Ser Lys 
305                 310                 315                 320 


Val Val Val Thr Lys Glu Asn Thr Thr Ile Val Glu Gly Ala Gly Glu 
                325                 330                 335     


Thr Asp Lys Ile Ser Ala Arg Val Thr Gln Ile Arg Ala Gln Val Glu 
            340                 345                 350         


Glu Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gln Glu Arg Leu Ala 
        355                 360                 365             


Lys Leu Ala Gly Gly Val Ala Val Ile Lys Val Gly Ala Ala Thr Glu 
    370                 375                 380                 


Thr Glu Leu Lys Glu Arg Lys Leu Arg Ile Glu Asp Ala Leu Asn Ser 
385                 390                 395                 400 


Thr Arg Ala Ala Val Glu Glu Gly Ile Val Ser Gly Gly Gly Thr Ala 
                405                 410                 415     


Leu Val Asn Val Tyr Asn Lys Val Ala Ala Val Glu Ala Glu Gly Asp 
            420                 425                 430         


Ala Gln Thr Gly Ile Asn Ile Val Leu Arg Ala Leu Glu Glu Pro Ile 
        435                 440                 445             


Arg Gln Ile Ala His Asn Ala Gly Leu Glu Gly Ser Val Ile Val Glu 
    450                 455                 460                 


Arg Leu Lys Asn Glu Glu Ile Gly Val Gly Phe Asn Ala Ala Thr Gly 
465                 470                 475                 480 


Glu Trp Val Asn Met Ile Glu Lys Gly Ile Val Asp Pro Thr Lys Val 
                485                 490                 495     


Thr Arg Ser Ala Leu Gln Asn Ala Ala Ser Val Ala Ala Met Phe Leu 
            500                 505                 510         


Thr Thr Glu Ala Val Val Ala Asp Lys Pro Glu Glu Asn Gly Gly Gly 
        515                 520                 525             


Ala Gly Met Pro Asp Met Gly Gly Met Gly Gly Met Gly Gly Met Met 
    530                 535                 540                 


<210>  10
<211>  1632
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  10
atggctaagg aaatcaagtt ctccgaagaa gctagaagag ctatgttgag aggtgtcgat       60

gctttggctg acgctgttaa ggttaccttg ggtccaaagg gtagaaacgt tgtcttggaa      120

aagaagttcg gttctccatt gatcactaac gacggtgtca ccatcgctaa ggaaatcgaa      180

ttggaagatg ctttcgaaaa catgggtgct aagttggtcg ctgaagttgc ttctaagact      240

aacgacgttg ctggtgacgg tactactacc gctaccgttt tggctcaagc tatgatcaga      300

gaaggtttga agaacgttac cgctggtgct aacccagtcg gtgttagaaa gggtatggaa      360

caagctgtcg ctgttgctat cgaaaacttg aaggaaatct ctaagccaat cgaaggtaaa      420

gaatccatcg ctcaagtcgc tgctatctct gctgctgacg aagaagttgg ttccttgatc      480

gctgaagcta tggaaagagt cggtaacgat ggtgttatca ctatcgaaga atctaagggt      540

ttcactaccg aattggaagt tgtcgaaggt atgcaattcg acagaggtta cgcttctcca      600

tacatggtca ccgactccga taagatggaa gctgtcttgg acaacccata catcttgatc      660

actgataaga agatcaccaa catccaagaa atcttgccag tcttggaaca agttgtccaa      720

caaggtaaac cattgttgtt gatcgctgaa gacgttgaag gtgaagcttt ggctactttg      780

gttgtcaaca agttgagagg tactttcaac gctgtcgctg ttaaggctcc aggtttcggt      840

gacagaagaa aggctatgtt ggaagatatc gctgtcttga ctggtggtga agttatcacc      900

gaagacttgg gtttggattt gaagtctact caaatcgctc aattgggtag agcttccaag      960

gttgtcgtta ccaaggaaaa cactaccatc gtcgaaggtg ctggtgaaac tgacaagatc     1020

tctgctagag tcacccaaat cagagcccaa gttgaagaaa ctacctccga atttgacaga     1080

gaaaagttgc aagaaagatt ggctaagttg gctggtggtg tcgctgttat caaggttggt     1140

gctgctactg aaaccgaatt gaaggaaaga aagttgagaa tcgaagacgc tttgaactct     1200

actagagctg ctgtcgaaga aggtatcgtt tccggtggtg gtactgcttt ggtcaacgtt     1260

tacaacaagg tcgctgctgt tgaagctgaa ggtgacgctc aaactggtat caacatcgtc     1320

ttgagagctt tggaagaacc aatcagacaa atcgctcaca acgctggttt ggaaggttct     1380

gtcatcgttg aaagattgaa gaacgaagaa atcggtgtcg gtttcaacgc tgctaccggt     1440

gaatgggtta acatgatcga aaagggtatc gttgacccaa ctaaggttac cagatctgct     1500

ttgcaaaacg ctgcttccgt tgctgctatg ttcttgacta ccgaagctgt cgttgctgac     1560

aagccagaag aaaacggtgg tggtgctggt atgccagata tgggtggcat gggcggtatg     1620

ggtggtatga tg                                                         1632


<210>  11
<211>  542
<212>  PRT
<213>  Ruminococcus champanellensis

<400>  11

Met Ala Lys Gln Ile Lys Tyr Gly Glu Glu Ala Arg Lys Ala Leu Gln 
1               5                   10                  15      


Ala Gly Ile Asp Ser Leu Ala Asp Thr Val Lys Ile Thr Leu Gly Pro 
            20                  25                  30          


Lys Gly Arg Asn Val Val Leu Asp Lys Lys Phe Gly Ala Pro Leu Ile 
        35                  40                  45              


Thr Asn Asp Gly Val Thr Ile Ala Lys Glu Val Glu Leu Glu Asp Pro 
    50                  55                  60                  


Phe Glu Asn Met Gly Ala Gln Leu Val Lys Glu Val Ala Thr Lys Thr 
65                  70                  75                  80  


Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Leu Leu Ala Gln 
                85                  90                  95      


Ala Met Val Arg Glu Gly Met Lys Asn Ile Ala Ala Gly Ala Asn Pro 
            100                 105                 110         


Met Ile Val Lys Lys Gly Ile Gln Lys Ala Val Asp Ala Ala Val Asn 
        115                 120                 125             


Ala Ile Lys Ala Asn Ser Lys Pro Val Glu Gly Ser Ala Asp Ile Ala 
    130                 135                 140                 


Arg Val Gly Thr Val Ser Ser Ala Asp Glu Asn Val Gly Lys Leu Ile 
145                 150                 155                 160 


Ala Glu Ala Met Glu Lys Val Ser Thr Asp Gly Val Ile Thr Leu Glu 
                165                 170                 175     


Glu Ser Lys Thr Ala Glu Thr Tyr Ser Glu Val Val Glu Gly Met Gln 
            180                 185                 190         


Phe Asp Arg Gly Tyr Ile Ser Pro Tyr Met Val Thr Asp Ala Asp Lys 
        195                 200                 205             


Met Glu Ala Val Tyr Asp Asp Ala Tyr Ile Leu Ile Thr Asp Lys Lys 
    210                 215                 220                 


Ile Ser Ser Ile Gln Glu Ile Leu Pro Leu Leu Glu Gln Val Val Gln 
225                 230                 235                 240 


Ala Gly Lys Lys Leu Val Ile Ile Ala Glu Asp Met Glu Gly Glu Ala 
                245                 250                 255     


Leu Thr Thr Ile Ile Leu Asn Asn Leu Arg Gly Thr Phe Lys Cys Ala 
            260                 265                 270         


Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Glu Met Leu Lys 
        275                 280                 285             


Asp Ile Ala Ile Leu Thr Gly Gly Glu Val Ile Thr Ser Glu Leu Gly 
    290                 295                 300                 


Leu Glu Leu Lys Asp Thr Thr Ile Ala Gln Leu Gly Arg Ala Lys Gln 
305                 310                 315                 320 


Val Val Ile Gln Lys Glu Asn Thr Ile Ile Val Asp Gly Ala Gly Ala 
                325                 330                 335     


Ser Glu Glu Ile Lys Ala Arg Ile Ser Gln Ile Arg Ser Gln Ile Glu 
            340                 345                 350         


Thr Thr Thr Ser Asp Phe Asp Lys Glu Lys Leu Gln Glu Arg Leu Ala 
        355                 360                 365             


Lys Leu Ser Gly Gly Val Ala Val Ile Lys Val Gly Ala Ala Thr Glu 
    370                 375                 380                 


Ile Glu Met Lys Glu Lys Lys Leu Arg Ile Glu Asp Ala Leu Ala Ala 
385                 390                 395                 400 


Thr Lys Ala Ala Val Glu Glu Gly Ile Val Ala Gly Gly Gly Thr Ala 
                405                 410                 415     


Leu Ile Asn Ala Ile Pro Ala Val Glu Lys Leu Leu Pro Ser Leu Asp 
            420                 425                 430         


Gly Asp Glu Lys Thr Gly Ala Lys Ile Ile Leu Lys Ala Leu Glu Glu 
        435                 440                 445             


Pro Val Arg Gln Ile Ala Arg Asn Ala Gly Leu Glu Gly Ser Val Ile 
    450                 455                 460                 


Ile Asp Lys Ile Arg Arg Ser Arg Lys Val Gly Tyr Gly Phe Asp Ala 
465                 470                 475                 480 


Tyr Asn Glu Thr Tyr Val Asp Met Ile Pro Ala Gly Ile Val Asp Pro 
                485                 490                 495     


Thr Lys Val Thr Arg Ser Ala Leu Gln Asn Ala Ala Ser Val Ala Ala 
            500                 505                 510         


Met Val Leu Thr Thr Glu Ser Leu Val Ala Asp Ile Lys Glu Glu Asn 
        515                 520                 525             


Ala Ala Ala Ala Pro Ala Met Pro Ala Gly Gly Met Gly Phe 
    530                 535                 540         


<210>  12
<211>  1626
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  12
atggctaagc aaatcaagta cggtgaagaa gctagaaagg ctttgcaagc tggtatcgac       60

tccttggctg acactgttaa gatcactttg ggtccaaagg gtagaaacgt tgtcttggat      120

aagaagttcg gtgctccatt gatcaccaac gacggtgtta ctatcgctaa ggaagtcgaa      180

ttggaagacc cattcgaaaa catgggtgct caattggtta aggaagtcgc taccaagact      240

aacgacgctg ctggtgacgg tactactacc gctaccttgt tggctcaagc tatggttaga      300

gaaggtatga agaacatcgc tgctggtgct aacccaatga tcgtcaagaa gggtatccaa      360

aaggctgttg acgctgctgt caacgctatc aaggctaact ctaagccagt tgaaggttcc      420

gctgatatcg ctagagttgg tactgtctct tccgctgacg aaaacgtcgg taaattgatc      480

gctgaagcta tggaaaaggt ttctaccgat ggtgtcatca ctttggaaga atctaagacc      540

gctgaaactt actccgaagt tgtcgaaggt atgcaattcg acagaggtta catctcccca      600

tacatggtta ccgacgctga taagatggaa gctgtctacg acgatgctta catcttgatc      660

actgacaaga agatctcttc catccaagaa atcttgccat tgttggaaca agttgtccaa      720

gctggtaaaa agttggttat catcgctgaa gacatggaag gtgaagcttt gactaccatc      780

atcttgaaca acttgagagg tactttcaag tgtgctgctg ttaaggctcc aggtttcggt      840

gacagaagaa aggaaatgtt gaaggatatc gctatcttga ccggtggtga agtcatcact      900

tctgaattgg gtttggaatt gaaggatact accatcgctc aattgggtag agctaagcaa      960

gttgtcatcc aaaaggaaaa caccatcatc gttgacggtg ctggtgcttc tgaagaaatc     1020

aaggctagaa tctctcaaat cagatcccaa atcgaaacta ccacttctga cttcgataag     1080

gaaaagttgc aagaaagatt ggctaagttg tccggtggtg ttgctgtcat caaggtcggt     1140

gctgctactg aaatcgaaat gaaggaaaag aagttgagaa tcgaagacgc tttggctgct     1200

accaaggctg ctgttgaaga aggtatcgtc gctggtggtg gtactgcttt gatcaacgct     1260

atcccagctg ttgaaaagtt gttgccatcc ttggacggtg acgaaaagac cggtgctaag     1320

atcatcttga aggctttgga agaaccagtc agacaaatcg ctagaaacgc tggtttggaa     1380

ggttctgtta tcatcgacaa gatcagaaga tccagaaagg tcggttacgg tttcgacgct     1440

tacaacgaaa cttacgttga tatgatccca gctggtatcg ttgacccaac caaggtcact     1500

agatctgctt tgcaaaacgc tgcttccgtt gctgctatgg tcttgaccac tgaatctttg     1560

gtcgctgaca tcaaggaaga aaacgctgct gctgctccag ctatgccagc tggtggtatg     1620

ggtttc                                                                1626


<210>  13
<211>  546
<212>  PRT
<213>  Zymomonas mobilis

<400>  13

Met Ala Ala Lys Asp Val Lys Phe Ser Arg Asp Ala Arg Glu Arg Ile 
1               5                   10                  15      


Leu Arg Gly Val Asp Ile Leu Ala Asp Ala Val Lys Val Thr Leu Gly 
            20                  25                  30          


Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ala Phe Gly Ala Pro Arg 
        35                  40                  45              


Ile Thr Lys Asp Gly Val Ser Val Ala Lys Glu Ile Glu Leu Lys Asp 
    50                  55                  60                  


Lys Phe Glu Asn Met Gly Ala Gln Met Leu Arg Glu Val Ala Ser Lys 
65                  70                  75                  80  


Thr Asn Asp Leu Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
                85                  90                  95      


Gln Ala Ile Val Arg Glu Gly Met Lys Ser Val Ala Ala Gly Met Asn 
            100                 105                 110         


Pro Met Asp Leu Lys Arg Gly Ile Asp Leu Ala Ala Thr Lys Val Val 
        115                 120                 125             


Glu Ser Leu Arg Ser Arg Ser Lys Pro Val Ser Asp Phe Asn Glu Val 
    130                 135                 140                 


Ala Gln Val Gly Ile Ile Ser Ala Asn Gly Asp Glu Glu Val Gly Arg 
145                 150                 155                 160 


Arg Ile Ala Glu Ala Met Glu Lys Val Gly Lys Glu Gly Val Ile Thr 
                165                 170                 175     


Val Glu Glu Ala Lys Gly Phe Asp Phe Glu Leu Asp Val Val Glu Gly 
            180                 185                 190         


Met Gln Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe Ile Thr Asn Pro 
        195                 200                 205             


Glu Lys Met Val Ala Glu Leu Ala Asp Pro Tyr Ile Leu Ile Tyr Glu 
    210                 215                 220                 


Lys Lys Leu Ser Asn Leu Gln Ser Ile Leu Pro Ile Leu Glu Ser Val 
225                 230                 235                 240 


Val Gln Ser Gly Arg Pro Leu Leu Ile Ile Ala Glu Asp Ile Glu Gly 
                245                 250                 255     


Glu Ala Leu Ala Thr Leu Val Val Asn Lys Leu Arg Gly Gly Leu Lys 
            260                 265                 270         


Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met 
        275                 280                 285             


Leu Glu Asp Ile Ala Ile Leu Thr Lys Gly Glu Leu Ile Ser Glu Asp 
    290                 295                 300                 


Leu Gly Ile Lys Leu Glu Asn Val Thr Leu Asn Met Leu Gly Ser Ala 
305                 310                 315                 320 


Lys Arg Val Ser Ile Thr Lys Glu Asn Thr Thr Ile Val Asp Gly Ala 
                325                 330                 335     


Gly Asp Gln Ser Thr Ile Lys Asp Arg Val Glu Ala Ile Arg Ser Gln 
            340                 345                 350         


Ile Glu Ala Thr Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gln Glu Arg 
        355                 360                 365             


Val Ala Lys Leu Ala Gly Gly Val Ala Val Ile Lys Val Gly Gly Ala 
    370                 375                 380                 


Thr Glu Val Glu Val Lys Glu Arg Lys Asp Arg Val Asp Asp Ala Leu 
385                 390                 395                 400 


His Ala Thr Arg Ala Ala Val Gln Glu Gly Ile Val Pro Gly Gly Gly 
                405                 410                 415     


Thr Ala Leu Leu Tyr Ala Thr Lys Thr Leu Glu Gly Leu Asn Gly Val 
            420                 425                 430         


Asn Glu Asp Gln Gln Arg Gly Ile Asp Ile Val Arg Arg Ala Leu Gln 
        435                 440                 445             


Ala Pro Val Arg Gln Ile Ala Gln Asn Ala Gly Phe Asp Gly Ala Val 
    450                 455                 460                 


Val Ala Gly Lys Leu Ile Asp Gly Asn Asp Asp Lys Ile Gly Phe Asn 
465                 470                 475                 480 


Ala Gln Thr Glu Lys Tyr Glu Asp Leu Ala Ala Thr Gly Val Ile Asp 
                485                 490                 495     


Pro Thr Lys Val Val Arg Thr Ala Leu Gln Asp Ala Ala Ser Val Ala 
            500                 505                 510         


Gly Leu Leu Ile Thr Thr Glu Ala Ala Val Gly Asp Leu Pro Glu Asp 
        515                 520                 525             


Lys Pro Ala Pro Ala Met Pro Gly Gly Met Gly Gly Met Gly Gly Met 
    530                 535                 540                 


Asp Phe 
545     


<210>  14
<211>  1638
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  14
atggctgcta aggacgttaa gttctccaga gacgctagag aaagaatctt gagaggtgtt       60

gacatcttgg ctgacgctgt taaggtcact ttgggtccaa agggtagaaa cgttgtcttg      120

gacaaggctt tcggtgctcc aagaatcacc aaggatggtg tttctgtcgc taaggaaatc      180

gaattgaagg acaagttcga aaacatgggt gctcaaatgt tgagagaagt tgcttccaag      240

actaacgact tggctggtga cggtactact accgctaccg ttttggctca agctatcgtc      300

agagaaggta tgaagtctgt cgctgctggt atgaacccaa tggacttgaa gagaggtatc      360

gatttggctg ctaccaaggt tgtcgaatct ttgagatcta gatccaagcc agtttccgac      420

ttcaacgaag ttgctcaagt cggtatcatc tctgctaacg gtgacgaaga agttggtaga      480

agaatcgctg aagctatgga aaaggtcggt aaagaaggtg ttatcactgt cgaagaagct      540

aagggtttcg acttcgaatt ggatgttgtc gaaggtatgc aattcgacag aggttacttg      600

tctccatact tcatcaccaa cccagaaaag atggtcgctg aattggctga cccatacatc      660

ttgatctacg aaaagaagtt gtctaacttg caatccatct tgccaatctt ggaatctgtt      720

gtccaatccg gtagaccatt gttgatcatc gctgaagaca tcgaaggtga agctttggct      780

actttggttg tcaacaagtt gagaggtggt ttgaaggttg ctgctgtcaa ggctccaggt      840

ttcggtgaca gaagaaaggc tatgttggaa gatatcgcta tcttgaccaa gggtgaattg      900

atctctgaag acttgggtat caagttggaa aacgttactt tgaacatgtt gggttctgct      960

aagagagttt ccatcaccaa ggaaaacact accatcgttg acggtgctgg tgaccaatcc     1020

actatcaagg acagagtcga agctatcaga tctcaaatcg aagctactac ctccgactac     1080

gatagagaaa agttgcaaga aagagttgct aagttggctg gtggtgttgc tgtcatcaag     1140

gtcggtggtg ctaccgaagt tgaagtcaag gaaagaaagg acagagttga cgatgctttg     1200

cacgctacta gagctgctgt tcaagaaggt atcgtcccag gtggtggtac tgctttgttg     1260

tacgctacta agaccttgga aggtttgaac ggtgtcaacg aagaccaaca aagaggtatc     1320

gatatcgtta gaagagcttt gcaagctcca gtcagacaaa tcgctcaaaa cgctggtttc     1380

gacggtgctg ttgtcgctgg taaattgatc gatggtaacg acgataagat cggtttcaac     1440

gctcaaactg aaaagtacga agacttggct gctaccggtg ttatcgatcc aactaaggtt     1500

gtcagaaccg ctttgcaaga cgctgcttct gttgctggtt tgttgatcac taccgaagct     1560

gctgtcggtg acttgccaga agataagcca gctccagcta tgccaggtgg tatgggcggc     1620

atgggtggta tggacttc                                                   1638


<210>  15
<211>  97
<212>  PRT
<213>  Escherichia coli

<400>  15

Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 
1               5                   10                  15      


Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 
            20                  25                  30          


Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 
        35                  40                  45              


Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 
    50                  55                  60                  


Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 
65                  70                  75                  80  


Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 
                85                  90                  95      


Ala 
    


<210>  16
<211>  291
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  16
atgaatatta gaccattgca tgatagagtt attgttaaga gaaaggaagt tgaaaccaaa       60

tctgcaggtg gtattgtttt gactggttcc gctgcagcta agagtacaag aggtgaagtt      120

ttggctgttg gtaatggtag aattttagaa aacggtgaag ttaagccttt ggatgttaag      180

gttggtgaca ttgttatttt caatgatggt tacggtgtta agtcagaaaa gattgataac      240

gaagaagttt tgatcatgtc tgaatcagat atcttggcaa ttgttgaagc a               291


<210>  17
<211>  104
<212>  PRT
<213>  Actinoplanes missouriensis

<400>  17

Met Pro Val Thr Thr Ala Thr Lys Val Ala Ile Lys Pro Leu Glu Asp 
1               5                   10                  15      


Arg Ile Val Val Gln Ala Asn Glu Ala Glu Thr Thr Thr Ala Ser Gly 
            20                  25                  30          


Ile Val Ile Pro Asp Thr Ala Lys Glu Lys Pro Gln Glu Gly Thr Val 
        35                  40                  45              


Leu Ala Val Gly Pro Gly Arg Ile Asp Asp Lys Gly Asn Arg Val Pro 
    50                  55                  60                  


Leu Asp Val Lys Val Gly Asp Val Val Leu Tyr Ser Lys Tyr Gly Gly 
65                  70                  75                  80  


Thr Glu Val Lys Tyr Ala Gly Glu Glu Tyr Leu Val Leu Ser Ala Arg 
                85                  90                  95      


Asp Val Leu Ala Val Ile Glu Lys 
            100                 


<210>  18
<211>  312
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  18
atgccagtca ccaccgctac taaggtcgct atcaagccat tggaagacag aatcgttgtt       60

caagctaacg aagctgaaac cactaccgct tctggtatcg ttatcccaga caccgctaag      120

gaaaagccac aagaaggtac tgttttggct gtcggtccag gtagaatcga cgataagggt      180

aacagagtcc cattggacgt taaggtcggt gacgttgtct tgtactctaa gtacggtggt      240

actgaagtca agtacgctgg tgaagaatac ttggtcttgt ccgctagaga tgttttggct      300

gtcatcgaaa ag                                                          312


<210>  19
<211>  112
<212>  PRT
<213>  Actinoplanes missouriensis

<400>  19

Met Ser Ala Asp Thr Arg Thr Asp Ala Gly Leu Pro Ile Arg Met Leu 
1               5                   10                  15      


His Asp Arg Val Leu Val Arg Gln Asp Gly Gly Glu Gly Glu Arg Arg 
            20                  25                  30          


Ser Ser Ala Gly Ile Val Ile Pro Ala Thr Ala Thr Ile Gly Arg Arg 
        35                  40                  45              


Leu Ser Trp Ala Val Ala Val Gly Val Gly Pro Asn Val Arg Ser Ile 
    50                  55                  60                  


Val Val Gly Asp Arg Val Leu Phe Asp Pro Asp Asp Arg Ser Glu Val 
65                  70                  75                  80  


Glu Leu His Gly Lys Glu Tyr Val Leu Leu Arg Glu Arg Asp Val His 
                85                  90                  95      


Ala Val Ala Ala Asn Arg Val Glu Ser Asp Gly Thr Gly Leu Tyr Leu 
            100                 105                 110         


<210>  20
<211>  336
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  20
atgtccgctg atactagaac cgatgctggt ttgccaatca gaatgttgca cgatagagtt       60

ttggtcagac aagatggtgg tgaaggtgaa agaagatctt ccgctggtat cgtcatccca      120

gctaccgcta ctatcggtag aagattgtct tgggctgttg ctgtcggtgt tggtccaaac      180

gtcagatcca tcgttgtcgg tgacagagtt ttgttcgatc cagacgatag atctgaagtc      240

gaattgcacg gtaaagaata cgttttgttg agagaaagag acgttcacgc tgttgctgct      300

aacagagttg aatccgatgg tactggtttg tacttg                                336


<210>  21
<211>  90
<212>  PRT
<213>  Bacteroides thetaiotaomicron

<400>  21

Met Asn Ile Lys Pro Leu Ala Asp Arg Val Leu Ile Leu Pro Ala Pro 
1               5                   10                  15      


Ala Glu Glu Lys Thr Ile Gly Gly Ile Ile Ile Pro Asp Thr Ala Lys 
            20                  25                  30          


Glu Lys Pro Leu Lys Gly Glu Val Val Ala Val Gly His Gly Thr Lys 
        35                  40                  45              


Asp Glu Glu Met Val Leu Lys Val Gly Asp Thr Val Leu Tyr Gly Lys 
    50                  55                  60                  


Tyr Ala Gly Thr Glu Leu Glu Val Glu Gly Thr Lys Tyr Leu Ile Met 
65                  70                  75                  80  


Arg Gln Ser Asp Val Leu Ala Ile Leu Gly 
                85                  90  


<210>  22
<211>  270
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  22
atgaacatca agccattggc tgacagagtt ttgatcttgc cagctccagc tgaagaaaag       60

actatcggtg gtatcatcat cccagacacc gctaaggaaa agccattgaa gggtgaagtt      120

gtcgctgttg gtcacggtac taaggacgaa gaaatggttt tgaaggtcgg tgacactgtt      180

ttgtacggta aatacgctgg tactgaattg gaagtcgaag gtactaagta cttgatcatg      240

agacaatctg acgttttggc tatcttgggt                                       270


<210>  23
<211>  94
<212>  PRT
<213>  Bacillus subtilis

<400>  23

Met Leu Lys Pro Leu Gly Asp Arg Val Val Ile Glu Leu Val Glu Ser 
1               5                   10                  15      


Glu Glu Lys Thr Ala Ser Gly Ile Val Leu Pro Asp Ser Ala Lys Glu 
            20                  25                  30          


Lys Pro Gln Glu Gly Lys Ile Val Ala Ala Gly Ser Gly Arg Val Leu 
        35                  40                  45              


Glu Ser Gly Glu Arg Val Ala Leu Glu Val Lys Glu Gly Asp Arg Ile 
    50                  55                  60                  


Ile Phe Ser Lys Tyr Ala Gly Thr Glu Val Lys Tyr Glu Gly Thr Glu 
65                  70                  75                  80  


Tyr Leu Ile Leu Arg Glu Ser Asp Ile Leu Ala Val Ile Gly 
                85                  90                  


<210>  24
<211>  282
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  24
atgttgaagc cattgggtga cagagttgtt atcgaattgg ttgaatccga agaaaagact       60

gcttccggta tcgttttgcc agactccgct aaggaaaagc cacaagaagg taaaatcgtt      120

gctgctggtt ctggtagagt cttggaatcc ggtgaaagag ttgctttgga agtcaaggaa      180

ggtgacagaa tcatcttctc taagtacgct ggtactgaag tcaagtacga aggtactgaa      240

tacttgatct tgagagaatc cgatatcttg gctgtcatcg gt                         282


<210>  25
<211>  94
<212>  PRT
<213>  Ruminococcus champanellensis

<400>  25

Met Thr Ile Lys Pro Leu Ala Asp Arg Val Val Ile Lys Met Met Glu 
1               5                   10                  15      


Ala Glu Glu Thr Thr Lys Gly Gly Ile Ile Leu Ala Ala Ser Ala Gln 
            20                  25                  30          


Glu Lys Pro Gln Val Ala Glu Ile Val Ala Val Gly Ser Gly Gly Val 
        35                  40                  45              


Val Asp Gly Lys Glu Val Lys Met Tyr Leu Lys Val Gly Asp Lys Val 
    50                  55                  60                  


Leu Leu Ser Lys Tyr Ala Gly Thr Glu Val Lys Leu Asp Gly Glu Asp 
65                  70                  75                  80  


Tyr Thr Ile Leu Arg Gln Ser Asp Ile Leu Ala Ile Val Glu 
                85                  90                  


<210>  26
<211>  282
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  26
atgactatca agccattggc tgacagagtc gttatcaaga tgatggaagc tgaagaaact       60

actaagggtg gtatcatctt ggctgcttct gctcaagaaa agccacaagt tgctgaaatc      120

gttgctgtcg gttccggtgg tgttgttgac ggtaaagaag tcaagatgta cttgaaggtt      180

ggtgacaagg tcttgttgtc taagtacgct ggtactgaag tcaagttgga cggtgaagat      240

tacactatct tgagacaatc cgacatcttg gctatcgtcg aa                         282


<210>  27
<211>  95
<212>  PRT
<213>  Zymomonas mobilis

<400>  27

Met Asn Phe Arg Pro Leu His Asp Arg Val Leu Val Arg Arg Val Ala 
1               5                   10                  15      


Ala Glu Glu Lys Thr Ala Gly Gly Ile Ile Ile Pro Asp Thr Ala Lys 
            20                  25                  30          


Glu Lys Pro Gln Glu Gly Glu Val Ile Ala Ala Gly Asn Gly Thr His 
        35                  40                  45              


Ser Glu Asp Gly Lys Val Val Pro Leu Asp Val Lys Ala Gly Asp Arg 
    50                  55                  60                  


Val Leu Phe Gly Lys Trp Ser Gly Thr Glu Val Arg Val Asp Gly Glu 
65                  70                  75                  80  


Asp Leu Leu Ile Met Lys Glu Ser Asp Ile Leu Gly Ile Ile Ser 
                85                  90                  95  


<210>  28
<211>  285
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  28
atgaacttca gaccattgca cgacagagtt ttggttagaa gagtcgctgc tgaagaaaag       60

accgctggtg gtatcatcat cccagatacc gctaaggaaa agccacaaga aggtgaagtt      120

atcgctgctg gtaacggtac tcactctgaa gacggtaaag ttgtcccatt ggacgttaag      180

gctggtgaca gagtcttgtt cggtaaatgg tccggtactg aagttagagt tgacggtgaa      240

gatttgttga tcatgaagga atctgatatc ttgggtatca tctcc                      285


<210>  29
<211>  394
<212>  PRT
<213>  Actinoplanes missouriensis

<400>  29

Met Ser Val Gln Ala Thr Arg Glu Asp Lys Phe Ser Phe Gly Leu Trp 
1               5                   10                  15      


Thr Val Gly Trp Gln Ala Arg Asp Ala Phe Gly Asp Ala Thr Arg Thr 
            20                  25                  30          


Ala Leu Asp Pro Val Glu Ala Val His Lys Leu Ala Glu Ile Gly Ala 
        35                  40                  45              


Tyr Gly Ile Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Asp 
    50                  55                  60                  


Ala Gln Thr Arg Asp Gly Ile Ile Ala Gly Phe Lys Lys Ala Leu Asp 
65                  70                  75                  80  


Glu Thr Gly Leu Ile Val Pro Met Val Thr Thr Asn Leu Phe Thr His 
                85                  90                  95      


Pro Val Phe Lys Asp Gly Gly Phe Thr Ser Asn Asp Arg Ser Val Arg 
            100                 105                 110         


Arg Tyr Ala Ile Arg Lys Val Leu Arg Gln Met Asp Leu Gly Ala Glu 
        115                 120                 125             


Leu Gly Ala Lys Thr Leu Val Leu Trp Gly Gly Arg Glu Gly Ala Glu 
    130                 135                 140                 


Tyr Asp Ser Ala Lys Asp Val Ser Ala Ala Leu Asp Arg Tyr Arg Glu 
145                 150                 155                 160 


Ala Leu Asn Leu Leu Ala Gln Tyr Ser Glu Asp Arg Gly Tyr Gly Leu 
                165                 170                 175     


Arg Phe Ala Ile Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Leu 
            180                 185                 190         


Leu Pro Thr Ala Gly His Ala Ile Ala Phe Val Gln Glu Leu Glu Arg 
        195                 200                 205             


Pro Glu Leu Phe Gly Ile Asn Pro Glu Thr Gly His Glu Gln Met Ser 
    210                 215                 220                 


Asn Leu Asn Phe Thr Gln Gly Ile Ala Gln Ala Leu Trp His Lys Lys 
225                 230                 235                 240 


Leu Phe His Ile Asp Leu Asn Gly Gln His Gly Pro Lys Phe Asp Gln 
                245                 250                 255     


Asp Leu Val Phe Gly His Gly Asp Leu Leu Asn Ala Phe Ser Leu Val 
            260                 265                 270         


Asp Leu Leu Glu Asn Gly Pro Asp Gly Ala Pro Ala Tyr Asp Gly Pro 
        275                 280                 285             


Arg His Phe Asp Tyr Lys Pro Ser Arg Thr Glu Asp Tyr Asp Gly Val 
    290                 295                 300                 


Trp Glu Ser Ala Lys Ala Asn Ile Arg Met Tyr Leu Leu Leu Lys Glu 
305                 310                 315                 320 


Arg Ala Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Glu Ala Leu Ala 
                325                 330                 335     


Ala Ser Lys Val Ala Glu Leu Lys Thr Pro Thr Leu Asn Pro Gly Glu 
            340                 345                 350         


Gly Tyr Ala Glu Leu Leu Ala Asp Arg Ser Ala Phe Glu Asp Tyr Asp 
        355                 360                 365             


Ala Asp Ala Val Gly Ala Lys Gly Phe Gly Phe Val Lys Leu Asn Gln 
    370                 375                 380                 


Leu Ala Ile Glu His Leu Leu Gly Ala Arg 
385                 390                 


<210>  30
<211>  1182
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  30
atgtccgttc aagccacaag agaagacaag tttagtttcg gtttatggac tgtaggttgg       60

caagcaagag acgcattcgg tgacgcaacc agaactgcct tggatccagt tgaagctgtc      120

cataaattgg cagaaatcgg tgcctacggt attacattcc acgatgacga tttggttcct      180

tttggttccg atgctcaaac cagagacggt attatagccg gtttcaaaaa ggctttagat      240

gaaactggtt tgatcgtacc aatggttact acaaatttgt ttactcatcc tgtcttcaag      300

gacggtggtt ttacatctaa cgatagatca gtcagaagat acgctataag aaaggtattg      360

agacaaatgg atttgggtgc tgaattgggt gcaaagacat tagtcttgtg gggtggtaga      420

gaaggtgcag aatacgattc cgccaaagac gttagtgctg cattggacag atatagagaa      480

gcattgaatt tgttggcaca atactctgaa gatagaggtt acggtttgag atttgctata      540

gaaccaaagc ctaacgaacc aagaggtgac atattgttac ctactgcagg tcatgcaatc      600

gccttcgttc aagaattgga aagaccagaa ttgttcggta ttaatcctga aaccggtcac      660

gaacaaatgt ctaatttgaa cttcactcaa ggtattgctc aagcattatg gcataaaaag      720

ttgttccaca tcgatttgaa cggtcaacat ggtccaaaat tcgaccaaga tttggtattt      780

ggtcacggtg acttgttgaa cgctttctca ttggttgatt tgttggaaaa cggtccagat      840

ggtgcccctg cttatgacgg tccaagacat tttgattaca aaccttctag aacagaagac      900

tatgatggtg tttgggaatc agcaaaggcc aacatcagaa tgtacttgtt gttgaaggaa      960

agagctaagg cattcagagc agatccagaa gttcaagaag ccttagccgc ttccaaagtc     1020

gcagaattga agacaccaac cttaaatcct ggtgaaggtt acgccgaatt attggctgat     1080

agaagtgcat ttgaagacta tgatgccgac gctgttggtg ctaaaggttt tggttttgtc     1140

aagttaaatc aattagcaat cgaacactta ttaggtgcca ga                        1182


<210>  31
<211>  440
<212>  PRT
<213>  Escherichia coli

<400>  31

Met Gln Ala Tyr Phe Asp Gln Leu Asp Arg Val Arg Tyr Glu Gly Ser 
1               5                   10                  15      


Lys Ser Ser Asn Pro Leu Ala Phe Arg His Tyr Asn Pro Asp Glu Leu 
            20                  25                  30          


Val Leu Gly Lys Arg Met Glu Glu His Leu Arg Phe Ala Ala Cys Tyr 
        35                  40                  45              


Trp His Thr Phe Cys Trp Asn Gly Ala Asp Met Phe Gly Val Gly Ala 
    50                  55                  60                  


Phe Asn Arg Pro Trp Gln Gln Pro Gly Glu Ala Leu Ala Leu Ala Lys 
65                  70                  75                  80  


Arg Lys Ala Asp Val Ala Phe Glu Phe Phe His Lys Leu His Val Pro 
                85                  90                  95      


Phe Tyr Cys Phe His Asp Val Asp Val Ser Pro Glu Gly Ala Ser Leu 
            100                 105                 110         


Lys Glu Tyr Ile Asn Asn Phe Ala Gln Met Val Asp Val Leu Ala Gly 
        115                 120                 125             


Lys Gln Glu Glu Ser Gly Val Lys Leu Leu Trp Gly Thr Ala Asn Cys 
    130                 135                 140                 


Phe Thr Asn Pro Arg Tyr Gly Ala Gly Ala Ala Thr Asn Pro Asp Pro 
145                 150                 155                 160 


Glu Val Phe Ser Trp Ala Ala Thr Gln Val Val Thr Ala Met Glu Ala 
                165                 170                 175     


Thr His Lys Leu Gly Gly Glu Asn Tyr Val Leu Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln Glu Arg Glu Gln 
        195                 200                 205             


Leu Gly Arg Phe Met Gln Met Val Val Glu His Lys His Lys Ile Gly 
    210                 215                 220                 


Phe Gln Gly Thr Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Tyr Asp Ala Ala Thr Val Tyr Gly Phe Leu Lys Gln 
                245                 250                 255     


Phe Gly Leu Glu Lys Glu Ile Lys Leu Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe His His Glu Ile Ala Thr Ala Ile Ala 
        275                 280                 285             


Leu Gly Leu Phe Gly Ser Val Asp Ala Asn Arg Gly Asp Ala Gln Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Asn Ala Leu 
305                 310                 315                 320 


Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr Thr Gly Gly Leu 
                325                 330                 335     


Asn Phe Asp Ala Lys Val Arg Arg Gln Ser Thr Asp Lys Tyr Asp Leu 
            340                 345                 350         


Phe Tyr Gly His Ile Gly Ala Met Asp Thr Met Ala Leu Ala Leu Lys 
        355                 360                 365             


Ile Ala Ala Arg Met Ile Glu Asp Gly Glu Leu Asp Lys Arg Ile Ala 
    370                 375                 380                 


Gln Arg Tyr Ser Gly Trp Asn Ser Glu Leu Gly Gln Gln Ile Leu Lys 
385                 390                 395                 400 


Gly Gln Met Ser Leu Ala Asp Leu Ala Lys Tyr Ala Gln Glu His His 
                405                 410                 415     


Leu Ser Pro Val His Gln Ser Gly Arg Gln Glu Gln Leu Glu Asn Leu 
            420                 425                 430         


Val Asn His Tyr Leu Phe Asp Lys 
        435                 440 


<210>  32
<211>  1320
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  32
atgcaagcct attttgacca attagacaga gtaagatacg aaggttccaa gtcctccaat       60

ccattagcct ttagacacta caaccctgat gaattggtat tgggtaaaag aatggaagaa      120

catttgagat ttgctgcatg ttattggcac actttctgct ggaatggtgc tgatatgttt      180

ggtgttggtg cattcaacag accatggcaa caacctggtg aagcattggc cttagctaaa      240

agaaaggctg acgtcgcatt tgaatttttc cataaattgc acgtaccatt ctattgtttc      300

catgatgtcg acgtatcccc tgaaggtgct agtttgaagg aatacataaa caacttcgcc      360

caaatggttg atgtcttagc aggtaaacaa gaagaatctg gtgttaagtt gttatggggt      420

actgctaatt gctttacaaa cccaagatac ggtgcaggtg ccgctaccaa tccagatcct      480

gaagttttct catgggcagc cacccaagtt gtcactgcca tggaagctac acataaattg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaaacatt gttaaacacc      600

gatttgagac aagaaagaga acaattaggt agattcatgc aaatggtagt tgaacataaa      660

cacaagattg gtttccaagg tactttgtta atagaaccaa aacctcaaga accaaccaag      720

caccaatatg attacgacgc tgcaactgtc tatggtttct tgaaacaatt cggtttggaa      780

aaggaaatta agttgaacat cgaagcaaac catgccacat tagctggtca ctcctttcat      840

cacgaaatcg caaccgccat tgctttgggt ttattcggta gtgttgatgc aaatagaggt      900

gacgcccaat tgggttggga tacagaccaa tttcctaatt ccgtagaaga aaacgctttg      960

gttatgtacg aaatcttgaa ggcaggtggt tttactacag gtggtttgaa cttcgatgct     1020

aaagttagaa gacaatctac tgataagtac gacttatttt acggtcatat tggtgctatg     1080

gacacaatgg cattggcctt aaaaatagcc gctagaatga tcgaagatgg tgaattggac     1140

aagagaatcg ctcaaagata ttctggttgg aactctgaat tgggtcaaca aatcttgaag     1200

ggtcaaatgt ctttggcaga tttggccaag tacgctcaag aacatcactt atcacctgtt     1260

catcaatcag gtagacaaga acaattagaa aacttagtca accattactt attcgacaaa     1320


<210>  33
<211>  445
<212>  PRT
<213>  Bacillus subtilis

<400>  33

Met Ala Gln Ser His Ser Ser Ser Ile Asn Tyr Phe Gly Ser Ala Asn 
1               5                   10                  15      


Lys Val Val Tyr Glu Gly Lys Asp Ser Thr Asn Pro Leu Ala Phe Lys 
            20                  25                  30          


Tyr Tyr Asn Pro Gln Glu Val Ile Gly Gly Lys Thr Leu Lys Glu His 
        35                  40                  45              


Leu Arg Phe Ser Ile Ala Tyr Trp His Thr Phe Thr Ala Asp Gly Thr 
    50                  55                  60                  


Asp Val Phe Gly Ala Ala Thr Met Gln Arg Pro Trp Asp His Tyr Lys 
65                  70                  75                  80  


Gly Met Asp Leu Ala Lys Met Arg Val Glu Ala Ala Phe Glu Met Phe 
                85                  90                  95      


Glu Lys Leu Asp Ala Pro Phe Phe Ala Phe His Asp Arg Asp Ile Ala 
            100                 105                 110         


Pro Glu Gly Ser Thr Leu Lys Glu Thr Asn Gln Asn Leu Asp Met Ile 
        115                 120                 125             


Met Gly Met Ile Lys Asp Tyr Met Arg Asn Ser Gly Val Lys Leu Leu 
    130                 135                 140                 


Trp Asn Thr Ala Asn Met Phe Thr Asn Pro Arg Phe Val His Gly Ala 
145                 150                 155                 160 


Ala Thr Ser Cys Asn Ala Asp Val Phe Ala Tyr Ala Ala Ala Gln Val 
                165                 170                 175     


Lys Lys Gly Leu Glu Thr Ala Lys Glu Leu Gly Ala Glu Asn Tyr Val 
            180                 185                 190         


Phe Trp Gly Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu 
        195                 200                 205             


Lys Phe Glu Leu Asp Asn Leu Ala Arg Phe Met His Met Ala Val Asp 
    210                 215                 220                 


Tyr Ala Lys Glu Ile Gly Tyr Thr Gly Gln Phe Leu Ile Glu Pro Lys 
225                 230                 235                 240 


Pro Lys Glu Pro Thr Thr His Gln Tyr Asp Thr Asp Ala Ala Thr Thr 
                245                 250                 255     


Ile Ala Phe Leu Lys Gln Tyr Gly Leu Asp Asn His Phe Lys Leu Asn 
            260                 265                 270         


Leu Glu Ala Asn His Ala Thr Leu Ala Gly His Thr Phe Glu His Glu 
        275                 280                 285             


Leu Arg Met Ala Arg Val His Gly Leu Leu Gly Ser Val Asp Ala Asn 
    290                 295                 300                 


Gln Gly His Pro Leu Leu Gly Trp Asp Thr Asp Glu Phe Pro Thr Asp 
305                 310                 315                 320 


Leu Tyr Ser Thr Thr Leu Ala Met Tyr Glu Ile Leu Gln Asn Gly Gly 
                325                 330                 335     


Leu Gly Ser Gly Gly Leu Asn Phe Asp Ala Lys Val Arg Arg Ser Ser 
            340                 345                 350         


Phe Glu Pro Asp Asp Leu Ile Tyr Ala His Ile Ala Gly Met Asp Ala 
        355                 360                 365             


Phe Ala Arg Gly Leu Lys Val Ala His Lys Leu Ile Glu Asp Arg Val 
    370                 375                 380                 


Phe Glu Asp Val Ile Gln His Arg Tyr Arg Ser Phe Thr Glu Gly Ile 
385                 390                 395                 400 


Gly Leu Glu Ile Ile Glu Gly Arg Ala Asn Phe His Thr Leu Glu Gln 
                405                 410                 415     


Tyr Ala Leu Asn His Lys Ser Ile Lys Asn Glu Ser Gly Arg Gln Glu 
            420                 425                 430         


Lys Leu Lys Ala Ile Leu Asn Gln Tyr Ile Leu Glu Val 
        435                 440                 445 


<210>  34
<211>  1335
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  34
atggctcaat ctcattccag ttcaatcaac tattttggaa gcgcaaacaa agtggtttac       60

gaagggaaag attcgactaa tcctttagca tttaaatatt ataatcctca agaagtaatc      120

ggcggaaaaa cgctgaaaga gcatttgcga ttttctattg cctattggca tacatttact      180

gctgatggta cagacgtttt tggagcagct acgatgcaaa gaccatggga tcactataaa      240

ggcatggatc tagcgaagat gagagtagaa gcagcatttg agatgtttga aaaactagat      300

gcaccattct ttgcttttca tgaccgggat attgcaccag aaggcagtac gctaaaagag      360

acaaaccaaa atttagatat gatcatgggc atgattaaag attacatgag aaatagcggc      420

gttaagctat tatggaatac agcaaacatg tttacgaatc cccgtttcgt ccatggtgcc      480

gcgacttctt gcaatgcaga tgtgtttgcg tatgctgcag cacaagtgaa aaaagggtta      540

gaaacagcaa aagagcttgg cgctgagaac tatgtatttt ggggcggccg tgaaggatat      600

gaaacattgt taaataccga tttaaaattt gagcttgata atttggctag atttatgcat      660

atggcagtgg attatgcgaa ggaaatcggg tacacagggc agtttttgat tgagccaaaa      720

ccaaaagagc cgaccaccca tcaatacgat acagatgcag caacaaccat tgcctttttg      780

aagcaatatg gcttagacaa tcattttaaa ttaaatcttg aagccaatca tgccacatta      840

gccgggcata cattcgaaca tgaattacgc atggcaagag tacatggtct gcttggctct      900

gttgacgcaa accagggtca tcctctttta ggctgggaca cggatgaatt tccgacggat      960

ttatattcta cgacattagc aatgtacgaa atcctgcaaa atggcggcct tggaagcggc     1020

ggattaaact ttgacgcgaa ggtcagaaga tcttctttcg agcctgatga tctaatatat     1080

gcccatattg cagggatgga tgcatttgca agaggattga aagttgccca caaattaatc     1140

gaagatcgtg tgtttgaaga tgtgattcaa catcgttacc gcagctttac tgaagggatt     1200

ggtcttgaaa ttatagaagg aagagctaat ttccacacac ttgagcaata tgcgctaaat     1260

cataaatcaa ttaaaaacga atctggaaga caggagaaat taaaagcgat attgaaccaa     1320

tacattttag aagta                                                      1335


<210>  35
<211>  387
<212>  PRT
<213>  Streptomyces rubiginosus

<400>  35

Met Asn Tyr Gln Pro Thr Pro Glu Asp Arg Phe Thr Phe Gly Leu Trp 
1               5                   10                  15      


Thr Val Gly Trp Gln Gly Arg Asp Pro Phe Gly Asp Ala Thr Arg Arg 
            20                  25                  30          


Ala Leu Asp Pro Val Glu Ser Val Arg Arg Leu Ala Glu Leu Gly Ala 
        35                  40                  45              


His Gly Val Thr Phe His Asp Asp Asp Leu Ile Pro Phe Gly Ser Ser 
    50                  55                  60                  


Asp Ser Glu Arg Glu Glu His Val Lys Arg Phe Arg Gln Ala Leu Asp 
65                  70                  75                  80  


Asp Thr Gly Met Lys Val Pro Met Ala Thr Thr Asn Leu Phe Thr His 
                85                  90                  95      


Pro Val Phe Lys Asp Gly Gly Phe Thr Ala Asn Asp Arg Asp Val Arg 
            100                 105                 110         


Arg Tyr Ala Leu Arg Lys Thr Ile Arg Asn Ile Asp Leu Ala Val Glu 
        115                 120                 125             


Leu Gly Ala Glu Thr Tyr Val Ala Trp Gly Gly Arg Glu Gly Ala Glu 
    130                 135                 140                 


Ser Gly Gly Ala Lys Asp Val Arg Asp Ala Leu Asp Arg Met Lys Glu 
145                 150                 155                 160 


Ala Phe Asp Leu Leu Gly Glu Tyr Val Thr Ser Gln Gly Tyr Asp Ile 
                165                 170                 175     


Arg Phe Ala Ile Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Leu 
            180                 185                 190         


Leu Pro Thr Val Gly His Ala Leu Ala Phe Ile Glu Arg Leu Glu Arg 
        195                 200                 205             


Pro Glu Leu Tyr Gly Val Asn Pro Glu Val Gly His Glu Gln Met Ala 
    210                 215                 220                 


Gly Leu Asn Phe Pro His Gly Ile Ala Gln Ala Leu Trp Ala Gly Lys 
225                 230                 235                 240 


Leu Phe His Ile Asp Leu Asn Gly Gln Asn Gly Ile Lys Tyr Asp Gln 
                245                 250                 255     


Asp Leu Arg Phe Gly Ala Gly Asp Leu Arg Ala Ala Phe Trp Leu Val 
            260                 265                 270         


Asp Leu Leu Glu Ser Ala Gly Tyr Ser Gly Pro Arg His Phe Asp Phe 
        275                 280                 285             


Lys Pro Pro Arg Thr Glu Asp Phe Asp Gly Val Trp Ala Ser Ala Ala 
    290                 295                 300                 


Gly Cys Met Arg Asn Tyr Leu Ile Leu Lys Glu Arg Ala Ala Ala Phe 
305                 310                 315                 320 


Arg Ala Asp Pro Glu Val Gln Glu Ala Leu Arg Ala Ser Arg Leu Asp 
                325                 330                 335     


Glu Leu Ala Arg Pro Thr Ala Ala Asp Gly Leu Gln Ala Leu Leu Asp 
            340                 345                 350         


Asp Arg Ser Ala Phe Glu Glu Phe Asp Val Asp Ala Ala Ala Ala Arg 
        355                 360                 365             


Gly Met Ala Phe Glu Arg Leu Asp Gln Leu Ala Met Asp His Leu Leu 
    370                 375                 380                 


Gly Ala Arg 
385         


<210>  36
<211>  1164
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  36
atgaactacc aaccaactcc agaagataga ttcactttcg gtttgtggac tgtcggttgg       60

caaggtagag acccattcgg tgacgctacc agaagagctt tggacccagt tgaatctgtc      120

agaagattgg ctgaattggg tgctcacggt gttactttcc acgacgatga cttgatccca      180

ttcggttctt ccgactccga aagagaagaa cacgtcaaga gattcagaca agctttggat      240

gacaccggta tgaaggttcc aatggctacc actaacttgt tcacccaccc agtcttcaag      300

gacggtggtt tcactgctaa cgatagagac gttagaagat acgctttgag aaagaccatc      360

agaaacatcg acttggctgt tgaattgggt gctgaaactt acgtcgcttg gggtggtaga      420

gaaggtgctg aatctggtgg tgctaaggat gttagagacg ctttggatag aatgaaggaa      480

gctttcgact tgttgggtga atacgtcacc tcccaaggtt acgacatcag attcgctatc      540

gaaccaaagc caaacgaacc aagaggtgac atcttgttgc caactgttgg tcacgctttg      600

gctttcatcg aaagattgga aagaccagaa ttgtacggtg ttaacccaga agtcggtcac      660

gaacaaatgg ctggtttgaa cttcccacac ggtatcgctc aagctttgtg ggctggtaaa      720

ttgttccaca tcgacttgaa cggtcaaaac ggtatcaagt acgatcaaga cttgagattc      780

ggtgctggtg acttgagagc tgctttctgg ttggttgatt tgttggaatc tgctggttac      840

tccggtccaa gacacttcga cttcaagcca ccaagaaccg aagatttcga cggtgtctgg      900

gcttctgctg ctggttgtat gagaaactac ttgatcttga aggaaagagc tgctgctttc      960

agagctgacc cagaagttca agaagctttg agagcttcta gattggacga attggctaga     1020

ccaactgctg ctgatggttt gcaagctttg ttggatgaca gatccgcttt cgaagaattt     1080

gacgttgacg ctgctgctgc tagaggtatg gctttcgaaa gattggacca attggctatg     1140

gatcacttgt tgggtgctag aggt                                            1164


<210>  37
<211>  440
<212>  PRT
<213>  Burkholderia phytofirmans

<400>  37

Met Ser Tyr Phe Glu His Ile Pro Glu Ile Arg Tyr Glu Gly Pro Gln 
1               5                   10                  15      


Ser Asp Asn Pro Leu Ala Tyr Arg His Tyr Asp Lys Ser Lys Lys Val 
            20                  25                  30          


Leu Gly Lys Thr Leu Glu Glu His Leu Arg Ile Ala Val Cys Tyr Trp 
        35                  40                  45              


His Thr Phe Val Trp Pro Gly Val Asp Ile Phe Gly Gln Gly Thr Phe 
    50                  55                  60                  


Arg Arg Pro Trp Gln Gln Ala Gly Asp Ala Met Glu Arg Ala Gln Gln 
65                  70                  75                  80  


Lys Ala Asp Ser Ala Phe Glu Phe Phe Ser Lys Leu Gly Thr Pro Tyr 
                85                  90                  95      


Tyr Thr Phe His Asp Thr Asp Val Ser Pro Glu Gly Ser Asn Leu Lys 
            100                 105                 110         


Glu Tyr Ser Glu Asn Phe Leu Arg Ile Thr Asp Tyr Leu Ala Arg Lys 
        115                 120                 125             


Gln Glu Ser Thr Gly Ile Lys Leu Leu Trp Gly Thr Ala Asn Leu Phe 
    130                 135                 140                 


Ser His Pro Arg Tyr Ala Ala Gly Ala Ala Thr Ser Pro Asp Pro Glu 
145                 150                 155                 160 


Val Phe Ala Phe Ala Ala Thr Gln Val Arg His Ala Leu Asp Ala Thr 
                165                 170                 175     


Gln Arg Leu Gly Gly Asp Asn Tyr Val Leu Trp Gly Gly Arg Glu Gly 
            180                 185                 190         


Tyr Asp Thr Leu Leu Asn Thr Asp Leu Val Arg Glu Arg Asp Gln Leu 
        195                 200                 205             


Ala Arg Phe Leu His Met Val Val Asp His Ala His Lys Ile Gly Phe 
    210                 215                 220                 


Lys Gly Ser Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys His 
225                 230                 235                 240 


Gln Tyr Asp Tyr Asp Val Ala Thr Val His Gly Phe Leu Leu Gln His 
                245                 250                 255     


Gly Leu Asp Lys Glu Ile Arg Val Asn Ile Glu Ala Asn His Ala Thr 
            260                 265                 270         


Leu Ala Gly His Ser Phe His His Glu Ile Ala Thr Ala Tyr Ala Leu 
        275                 280                 285             


Gly Ile Phe Gly Ser Val Asp Ala Asn Arg Gly Asp Pro Gln Asn Gly 
    290                 295                 300                 


Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Leu Thr Leu Ala 
305                 310                 315                 320 


Phe Tyr Glu Ile Leu Lys His Gly Gly Phe Thr Thr Gly Gly Met Asn 
                325                 330                 335     


Phe Asp Ser Lys Val Arg Arg Gln Ser Val Asp Pro Glu Asp Leu Phe 
            340                 345                 350         


Tyr Gly His Ile Gly Ala Ile Asp Asn Leu Ala Leu Ala Val Glu Arg 
        355                 360                 365             


Ala Ala Val Leu Ile Glu Asn Asp Arg Leu Asp Gln Phe Lys Arg Gln 
    370                 375                 380                 


Arg Tyr Ser Gly Trp Asp Ala Glu Phe Gly Arg Lys Ile Ser Ser Gly 
385                 390                 395                 400 


Asp Tyr Ser Leu Ser Ala Leu Ala Glu Glu Ala Met Ala Arg Gly Leu 
                405                 410                 415     


Asn Pro Gln His Ala Ser Gly His Gln Glu Leu Met Glu Asn Ile Val 
            420                 425                 430         


Asn Gln Ala Ile Tyr Ser Gly Arg 
        435                 440 


<210>  38
<211>  1320
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  38
atgtcctact tcgaacacat cccagaaatc agatacgaag gtccacaatc cgataaccca       60

ttggcttaca gacactacga caagtccaag aaggttttgg gtaaaacttt ggaagaacac      120

ttgagaatcg ctgtctgtta ctggcacact ttcgtttggc caggtgttga catcttcggt      180

caaggtactt tcagaagacc atggcaacaa gctggtgacg ctatggaaag agcccaacaa      240

aaggctgact ctgctttcga atttttctct aagttgggta ctccatacta cactttccac      300

gacaccgatg tttctccaga aggttccaac ttgaaggaat actctgaaaa cttcttgaga      360

atcactgact acttggctag aaagcaagaa tccactggta tcaagttgtt gtggggtact      420

gctaacttgt tctctcaccc aagatacgct gctggtgctg ctacctcccc agacccagaa      480

gttttcgctt tcgctgctac tcaagtcaga cacgctttgg atgctaccca aagattgggt      540

ggtgacaact acgttttgtg gggtggtaga gaaggttacg acactttgtt gaacaccgat      600

ttggtcagag aaagagacca attggctaga ttcttgcaca tggttgttga ccacgctcac      660

aagatcggtt tcaagggttc tttgttgatc gaaccaaagc cacaagaacc aactaagcac      720

caatacgact acgatgttgc taccgtccac ggtttcttgt tgcaacacgg tttggacaag      780

gaaatcagag tcaacatcga agctaaccac gctactttgg ctggtcactc tttccaccac      840

gaaatcgcta ccgcttacgc tttgggtatc ttcggttccg ttgacgctaa cagaggtgac      900

ccacaaaacg gttgggacac tgatcaattc ccaaactctg tcgaagaatt gaccttggct      960

ttctacgaaa tcttgaagca cggtggtttc accactggtg gtatgaactt cgactctaag     1020

gttagaagac aatccgttga cccagaagat ttgttctacg gtcacatcgg tgctatcgac     1080

aacttggctt tggctgttga aagagctgct gtcttgatcg aaaacgacag attggatcaa     1140

ttcaagagac aaagatactc tggttgggat gctgaatttg gtagaaagat ctcttccggt     1200

gactactctt tgtccgcttt ggctgaagaa gctatggcta gaggtttgaa cccacaacac     1260

gcttctggtc accaagaatt gatggaaaac atcgttaacc aagctatcta ctccggtaga     1320


<210>  39
<211>  441
<212>  PRT
<213>  Burkholderia phymatum

<400>  39

Met Ser Tyr Phe Glu His Leu Pro Ala Val Arg Tyr Glu Gly Pro Gln 
1               5                   10                  15      


Thr Asp Asn Pro Phe Ala Tyr Arg His Tyr Asp Lys Asp Lys Leu Val 
            20                  25                  30          


Leu Gly Lys Arg Met Glu Asp His Leu Arg Val Ala Val Cys Tyr Trp 
        35                  40                  45              


His Thr Phe Val Trp Pro Gly Ala Asp Met Phe Gly Pro Gly Thr Phe 
    50                  55                  60                  


Glu Arg Pro Trp His His Ala Gly Asp Ala Leu Glu Met Ala His Ala 
65                  70                  75                  80  


Lys Ala Asp His Ala Phe Glu Leu Phe Ser Lys Leu Gly Thr Pro Phe 
                85                  90                  95      


Tyr Thr Phe His Asp Leu Asp Val Ala Pro Glu Gly Asp Ser Ile Lys 
            100                 105                 110         


Ser Tyr Val Asn Asn Phe Lys Ala Met Thr Asp Val Leu Ala Arg Lys 
        115                 120                 125             


Gln Glu Gln Thr Gly Ile Lys Leu Leu Trp Gly Thr Ala Asn Leu Phe 
    130                 135                 140                 


Ser His Pro Arg Tyr Ala Ala Gly Ala Ala Thr Asn Pro Asn Pro Asp 
145                 150                 155                 160 


Val Phe Ala Phe Ala Ala Thr Gln Val Leu Asn Ala Leu Glu Ala Thr 
                165                 170                 175     


Gln Arg Leu Gly Gly Ala Asn Tyr Val Leu Trp Gly Gly Arg Glu Gly 
            180                 185                 190         


Tyr Glu Thr Leu Leu Asn Thr Asp Leu Lys Arg Glu Arg Glu Gln Leu 
        195                 200                 205             


Gly Arg Phe Met Ser Met Val Val Glu His Lys His Lys Thr Gly Phe 
    210                 215                 220                 


Lys Gly Ala Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys His 
225                 230                 235                 240 


Gln Tyr Asp Tyr Asp Val Ala Thr Val His Gly Phe Leu Thr Gln Phe 
                245                 250                 255     


Gly Leu Gln Asp Glu Ile Arg Val Asn Ile Glu Ala Asn His Ala Thr 
            260                 265                 270         


Leu Ala Gly His Ser Phe His His Glu Ile Ala Asn Ala Phe Ala Leu 
        275                 280                 285             


Gly Ile Phe Gly Ser Val Asp Ala Asn Arg Gly Asp Ala Gln Asn Gly 
    290                 295                 300                 


Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Leu Thr Leu Ala 
305                 310                 315                 320 


Phe Tyr Glu Ile Leu Arg Asn Gly Gly Phe Thr Thr Gly Gly Met Asn 
                325                 330                 335     


Phe Asp Ala Lys Val Arg Arg Gln Ser Ile Asp Pro Glu Asp Ile Val 
            340                 345                 350         


His Gly His Ile Gly Ala Ile Asp Val Leu Ala Val Ala Leu Glu Arg 
        355                 360                 365             


Ala Ala His Leu Ile Glu His Asp Arg Leu Ala Ala Phe Lys Gln Gln 
    370                 375                 380                 


Arg Tyr Ala Gly Trp Asp Ser Asp Phe Gly Arg Lys Ile Leu Ala Gly 
385                 390                 395                 400 


Gly Tyr Ser Leu Glu Ser Leu Ala Ser Asp Ala Val Gln Arg Asn Ile 
                405                 410                 415     


Ala Pro Arg His Val Ser Gly Gln Gln Glu Arg Leu Glu Asn Ile Val 
            420                 425                 430         


Asn Gln Ala Ile Phe Ser Ser Ala Lys 
        435                 440     


<210>  40
<211>  1323
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  40
atgtcctact tcgaacactt gccagctgtc agatacgaag gtccacaaac cgataaccca       60

ttcgcttaca gacactacga taaggataag ttggttttgg gtaaaagaat ggaagaccac      120

ttgagagttg ctgtctgtta ctggcacacc ttcgtctggc caggtgctga catgttcggt      180

ccaggtactt tcgaaagacc atggcaccac gctggtgacg ctttggaaat ggctcacgct      240

aaggctgatc acgctttcga attgttctcc aagttgggta ctccattcta cactttccac      300

gacttggatg ttgctccaga aggtgactct atcaagtcct acgttaacaa cttcaaggct      360

atgaccgatg tcttggctag aaagcaagaa caaaccggta tcaagttgtt gtggggtact      420

gctaacttgt tctctcaccc aagatacgct gctggtgctg ctactaaccc aaacccagac      480

gttttcgctt tcgctgctac ccaagtcttg aacgctttgg aagctactca aagattgggt      540

ggtgctaact acgttttgtg gggtggtaga gaaggttacg aaaccttgtt gaacactgac      600

ttgaagagag aaagagaaca attgggtaga ttcatgtcta tggttgtcga acacaagcac      660

aagaccggtt tcaagggtgc tttgttgatc gaaccaaagc cacaagaacc aactaagcac      720

caatacgact acgatgttgc taccgtccac ggtttcttga ctcaattcgg tttgcaagac      780

gaaatcagag tcaacatcga agctaaccac gctaccttgg ctggtcactc cttccaccac      840

gaaatcgcta acgctttcgc tttgggtatc ttcggttctg ttgacgctaa cagaggtgac      900

gctcaaaacg gttgggacac cgatcaattc ccaaactccg tcgaagaatt gactttggct      960

ttctacgaaa tcttgagaaa cggtggtttc accactggtg gtatgaactt cgacgctaag     1020

gttagaagac aatctatcga cccagaagat atcgtccacg gtcacatcgg tgctatcgac     1080

gttttggctg tcgctttgga aagagctgct cacttgatcg aacacgatag attggctgct     1140

ttcaagcaac aaagatacgc tggttgggac tccgatttcg gtagaaagat cttggctggt     1200

ggttactctt tggaatcctt ggcttctgac gctgttcaaa gaaacatcgc tccaagacac     1260

gtctctggtc aacaagaaag attggaaaac atcgtcaacc aagctatctt ctcttccgct     1320

aag                                                                   1323


<210>  41
<211>  444
<212>  PRT
<213>  Citrobacter youngae

<400>  41

Met Glu Leu Ile Met Gln Ala Tyr Phe Asp Gln Leu Asp Arg Val Arg 
1               5                   10                  15      


Phe Glu Gly Thr Lys Ser Thr Asn Pro Leu Ala Phe Arg His Tyr Asn 
            20                  25                  30          


Pro Asp Glu Ile Val Leu Gly Lys Arg Met Glu Asp His Leu Arg Phe 
        35                  40                  45              


Ala Ala Cys Tyr Trp His Thr Phe Cys Trp Asn Gly Ala Asp Met Phe 
    50                  55                  60                  


Gly Met Gly Ala Phe Asp Arg Pro Trp Gln Gln Pro Gly Glu Ala Leu 
65                  70                  75                  80  


Ala Leu Ala Lys Arg Lys Ala Asp Val Ala Phe Glu Phe Phe His Lys 
                85                  90                  95      


Leu Asn Val Pro Tyr Tyr Cys Phe His Asp Val Asp Val Ser Pro Glu 
            100                 105                 110         


Gly Ala Ser Leu Lys Glu Tyr Lys Asn Asn Phe Ala Gln Met Val Asp 
        115                 120                 125             


Val Leu Ala Ala Lys Gln Glu Gln Ser Gly Val Lys Leu Leu Trp Gly 
    130                 135                 140                 


Thr Ala Asn Cys Phe Thr Asn Pro Arg Tyr Gly Ala Gly Ala Ala Thr 
145                 150                 155                 160 


Asn Pro Asp Pro Glu Val Phe Ser Trp Ala Ala Thr Gln Val Val Thr 
                165                 170                 175     


Ala Met Asp Ala Thr His Lys Leu Gly Gly Glu Asn Tyr Val Leu Trp 
            180                 185                 190         


Gly Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln 
        195                 200                 205             


Glu Arg Glu Gln Ile Gly Arg Phe Met Gln Leu Val Val Glu His Lys 
    210                 215                 220                 


His Lys Ile Gly Phe Gln Gly Thr Leu Leu Ile Glu Pro Lys Pro Gln 
225                 230                 235                 240 


Glu Pro Thr Lys His Gln Tyr Asp Tyr Asp Ala Ala Thr Val Tyr Gly 
                245                 250                 255     


Phe Leu Lys Gln Phe Gly Leu Glu Lys Glu Ile Lys Leu Asn Ile Glu 
            260                 265                 270         


Ala Asn His Ala Thr Leu Ala Gly His Ser Phe His His Glu Ile Ala 
        275                 280                 285             


Thr Ala Ile Ala Leu Gly Leu Phe Gly Ser Val Asp Ala Asn Arg Gly 
    290                 295                 300                 


Asp Ala Gln Leu Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu 
305                 310                 315                 320 


Glu Asn Ala Leu Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr 
                325                 330                 335     


Thr Gly Gly Leu Asn Phe Asp Ala Lys Val Arg Arg Gln Ser Thr Asp 
            340                 345                 350         


Lys Tyr Asp Leu Phe Tyr Gly His Ile Gly Ala Met Asp Thr Met Ala 
        355                 360                 365             


Leu Ser Leu Lys Ile Ala Ala Arg Met Ile Glu Asp Gly Gly Leu Asp 
    370                 375                 380                 


Gln Arg Val Ala Lys Arg Tyr Ala Gly Trp Asn Gly Glu Leu Gly Gln 
385                 390                 395                 400 


Gln Ile Leu Lys Gly Gln Met Thr Leu Thr Glu Ile Ala Gln Tyr Ala 
                405                 410                 415     


Glu Gln His Asn Leu Ala Pro Val His Gln Ser Gly His Gln Glu Gln 
            420                 425                 430         


Leu Glu Asn Leu Val Asn His Tyr Leu Phe Asp Lys 
        435                 440                 


<210>  42
<211>  1332
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  42
atggaattga tcatgcaagc ttacttcgac caattggaca gagtcagatt cgaaggtact       60

aagtctacta acccattggc tttcagacac tacaacccag acgaaatcgt tttgggtaaa      120

agaatggaag atcacttgag attcgctgct tgttactggc acaccttctg ttggaacggt      180

gctgacatgt tcggtatggg tgctttcgat agaccatggc aacaaccagg tgaagctttg      240

gctttggcta agagaaaggc tgacgttgct ttcgaatttt tccacaagtt gaacgtccca      300

tactactgtt tccacgacgt tgatgtctct ccagaaggtg cttccttgaa ggaatacaag      360

aacaacttcg ctcaaatggt tgacgttttg gctgctaagc aagaacaatc tggtgtcaag      420

ttgttgtggg gtactgctaa ctgtttcact aacccaagat acggtgctgg tgctgctacc      480

aacccagacc cagaagtttt ctcctgggct gctacccaag ttgtcactgc tatggatgct      540

actcacaagt tgggtggtga aaactacgtc ttgtggggtg gtagagaagg ttacgaaacc      600

ttgttgaaca ctgacttgag acaagaaaga gaacaaatcg gtagattcat gcaattggtt      660

gtcgaacaca agcacaagat cggtttccaa ggtactttgt tgatcgaacc aaagccacaa      720

gaaccaacca agcaccaata cgactacgat gctgctactg tttacggttt cttgaagcaa      780

ttcggtttgg aaaaggaaat caagttgaac atcgaagcta accacgctac cttggctggt      840

cactctttcc accacgaaat cgctactgct atcgctttgg gtttgttcgg ttccgttgac      900

gctaacagag gtgacgctca attgggttgg gacactgatc aattcccaaa ctctgttgaa      960

gaaaacgctt tggtcatgta cgaaatcttg aaggctggtg gtttcaccac tggtggtttg     1020

aacttcgacg ctaaggttag aagacaatct accgacaagt acgatttgtt ctacggtcac     1080

atcggtgcta tggacactat ggctttgtcc ttgaagatcg ctgctagaat gatcgaagac     1140

ggtggtttgg atcaaagagt cgctaagaga tacgctggtt ggaacggtga attgggtcaa     1200

caaatcttga agggtcaaat gaccttgact gaaatcgctc aatacgctga acaacacaac     1260

ttggctccag ttcaccaatc tggtcaccaa gaacaattgg aaaacttggt caaccactac     1320

ttgttcgaca ag                                                         1332


<210>  43
<211>  440
<212>  PRT
<213>  Escherichia blattae

<400>  43

Met Pro Thr Tyr Phe Asp Gln Ile Asp Arg Val Arg Phe Glu Gly Pro 
1               5                   10                  15      


Lys Thr Thr Asn Pro Leu Ala Phe Arg His Tyr Asn Pro Asp Glu Leu 
            20                  25                  30          


Val Leu Gly Lys Arg Met Glu Asp His Leu Arg Phe Ala Ala Cys Tyr 
        35                  40                  45              


Trp His Asn Phe Cys Trp Asn Gly Ala Asp Met Phe Gly Val Gly Ser 
    50                  55                  60                  


Phe Asp Arg Pro Trp Gln His Pro Gly Ser Ala Leu Glu Met Ala Arg 
65                  70                  75                  80  


Gln Lys Ala Asp Val Ala Phe Glu Phe Phe His Lys Leu Asn Val Pro 
                85                  90                  95      


Tyr Tyr Cys Phe His Asp Val Asp Val Ser Pro Glu Gly Ala Ser Leu 
            100                 105                 110         


Lys Glu Tyr Leu Glu Asn Phe Ala His Met Val Asp Val Leu Ala Glu 
        115                 120                 125             


Lys Gln Gln Gln Ser Gly Val Lys Leu Leu Trp Gly Thr Ala Asn Cys 
    130                 135                 140                 


Phe Thr Asn Pro Arg Phe Gly Ala Gly Ala Ala Thr Asn Pro Asp Pro 
145                 150                 155                 160 


Glu Val Phe Ala Met Ala Ala Thr Gln Val Phe Thr Ala Met Asn Ala 
                165                 170                 175     


Thr Gln Lys Leu Gly Gly Glu Asn Tyr Val Leu Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Ser Leu Leu Asn Thr Asp Leu Arg Gln Glu Arg Glu Gln 
        195                 200                 205             


Ile Gly Arg Phe Met Gln Met Val Val Glu His Lys His Lys Ile Gly 
    210                 215                 220                 


Phe Arg Gly Thr Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Tyr Asp Val Ala Thr Val Tyr Gly Phe Leu Lys Gln 
                245                 250                 255     


Phe Gly Leu Glu Lys Glu Ile Lys Val Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe His His Glu Ile Ala Ser Ala Ile Ala 
        275                 280                 285             


Leu Gly Ile Phe Gly Ser Val Asp Ala Asn Arg Gly Asp Ala Gln Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Asn Ser Leu 
305                 310                 315                 320 


Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr Thr Gly Gly Leu 
                325                 330                 335     


Asn Phe Asp Ala Lys Val Arg Arg Gln Ser Thr Asp Lys Tyr Asp Leu 
            340                 345                 350         


Phe Tyr Gly His Ile Gly Ala Met Asp Thr Met Ala Leu Ser Leu Lys 
        355                 360                 365             


Ile Ala Ala Arg Met Ile Glu Asp Gly Glu Leu Asp Lys Arg Val Ala 
    370                 375                 380                 


Arg Arg Tyr Ser Gly Trp Ser Ser Glu Leu Gly Gln Gln Ile Leu Lys 
385                 390                 395                 400 


Gly Gln Met Ser Leu Ala Gln Leu Ala Gln Tyr Ala Gln Gln His Gln 
                405                 410                 415     


Leu Asp Pro His His Gln Ser Gly His Gln Glu Leu Leu Glu Asn Leu 
            420                 425                 430         


Val Asn His Tyr Ile Phe Asp Lys 
        435                 440 


<210>  44
<211>  1320
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  44
atgccaactt acttcgatca aatcgacaga gtcagattcg aaggtccaaa gaccactaac       60

ccattggctt tcagacacta caacccagac gaattggttt tgggtaaaag aatggaagat      120

cacttgagat tcgctgcttg ttactggcac aacttctgtt ggaacggtgc tgacatgttc      180

ggtgtcggtt ctttcgatag accatggcaa cacccaggtt ccgctttgga aatggctaga      240

caaaaggctg acgttgcttt cgaatttttc cacaagttga acgtcccata ctactgtttc      300

cacgacgttg atgtctctcc agaaggtgct tccttgaagg aatacttgga aaacttcgct      360

cacatggttg acgttttggc tgaaaagcaa caacaatctg gtgttaagtt gttgtggggt      420

actgctaact gtttcactaa cccaagattc ggtgctggtg ctgctaccaa cccagaccca      480

gaagttttcg ctatggctgc tacccaagtc ttcactgcta tgaacgctac tcaaaagttg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaatcttt gttgaacacc      600

gacttgagac aagaaagaga acaaatcggt agattcatgc aaatggttgt cgaacacaag      660

cacaagatcg gtttcagagg tactttgttg atcgaaccaa agccacaaga accaaccaag      720

caccaatacg actacgatgt tgctactgtc tacggtttct tgaagcaatt cggtttggaa      780

aaggaaatca aggttaacat cgaagctaac cacgctacct tggctggtca ctctttccac      840

cacgaaatcg cttccgctat cgctttgggt atcttcggtt ctgttgacgc taacagaggt      900

gacgctcaat tgggttggga cactgatcaa ttcccaaact ctgttgaaga aaactccttg      960

gtcatgtacg aaatcttgaa ggctggtggt ttcaccactg gtggtttgaa cttcgacgct     1020

aaggttagaa gacaatctac cgacaagtac gatttgttct acggtcacat cggtgctatg     1080

gacactatgg ctttgtcctt gaagatcgct gctagaatga tcgaagacgg tgaattggat     1140

aagagagtcg ctagaagata ctctggttgg tcttccgaat tgggtcaaca aatcttgaag     1200

ggtcaaatgt ccttggctca attggctcaa tacgctcaac aacaccaatt ggacccacac     1260

caccaatctg gtcaccaaga attgttggaa aacttggtta accactacat cttcgataag     1320


<210>  45
<211>  438
<212>  PRT
<213>  Pseudomonas fluorescens

<400>  45

Met Pro Tyr Phe Pro Gly Val Glu Lys Val Arg Phe Glu Gly Pro Ala 
1               5                   10                  15      


Ser Thr Ser Ala Leu Ala Phe Arg His Tyr Asp Ala Asn Lys Leu Ile 
            20                  25                  30          


Leu Gly Lys Pro Met Arg Glu His Leu Arg Met Ala Ala Cys Tyr Trp 
        35                  40                  45              


His Thr Phe Val Trp Pro Gly Ala Asp Met Phe Gly Met Gly Thr Phe 
    50                  55                  60                  


Lys Arg Pro Trp Gln Arg Ser Gly Asp Pro Met Glu Val Ala Ile Gly 
65                  70                  75                  80  


Lys Ala Glu Ala Ala Phe Glu Phe Phe Ser Lys Leu Gly Ile Asp Tyr 
                85                  90                  95      


Tyr Ser Phe His Asp Thr Asp Val Ala Pro Glu Gly Ser Ser Leu Lys 
            100                 105                 110         


Glu Tyr Arg Asn His Phe Ala Gln Met Val Asp His Leu Glu Arg His 
        115                 120                 125             


Gln Glu Gln Thr Gly Ile Lys Leu Leu Trp Gly Thr Ala Asn Cys Phe 
    130                 135                 140                 


Ser Asn Pro Arg Phe Ala Ala Gly Ala Ala Ser Asn Pro Asp Pro Glu 
145                 150                 155                 160 


Val Phe Ala Phe Ala Ala Ala Gln Val Phe Ser Ala Met Asn Ala Thr 
                165                 170                 175     


Leu Arg Leu Lys Gly Ala Asn Tyr Val Leu Trp Gly Gly Arg Glu Gly 
            180                 185                 190         


Tyr Glu Thr Leu Leu Asn Thr Asp Leu Lys Arg Glu Arg Glu Gln Leu 
        195                 200                 205             


Gly Arg Phe Met Arg Met Val Val Glu His Lys His Lys Ile Gly Phe 
    210                 215                 220                 


Lys Gly Asp Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys His 
225                 230                 235                 240 


Gln Tyr Asp Tyr Asp Ser Ala Thr Val Phe Gly Phe Leu His Glu Tyr 
                245                 250                 255     


Gly Leu Glu His Glu Ile Lys Val Asn Ile Glu Ala Asn His Ala Thr 
            260                 265                 270         


Leu Ala Gly His Ser Phe His His Glu Ile Ala Thr Ala Val Ser Leu 
        275                 280                 285             


Gly Ile Phe Gly Ser Ile Asp Ala Asn Arg Gly Asp Pro Gln Asn Gly 
    290                 295                 300                 


Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Met Thr Leu Ala 
305                 310                 315                 320 


Thr Tyr Glu Ile Leu Lys Ala Gly Gly Phe Lys Asn Gly Gly Tyr Asn 
                325                 330                 335     


Phe Asp Ser Lys Val Arg Arg Gln Ser Leu Asp Glu Val Asp Leu Phe 
            340                 345                 350         


His Gly His Val Ala Ala Met Asp Val Leu Ala Leu Ala Leu Glu Arg 
        355                 360                 365             


Ala Ala Ala Met Val Gln Asp Asp Arg Leu Gln Gln Phe Lys Glu Gln 
    370                 375                 380                 


Arg Tyr Ala Gly Trp Gln Gln Pro Leu Gly Gln Ala Val Leu Ala Gly 
385                 390                 395                 400 


Glu Phe Ser Leu Glu Ser Leu Ala Glu His Ala Phe Ala Asn Glu Leu 
                405                 410                 415     


Asn Pro Gln Ala Val Ser Gly Arg Gln Glu Met Leu Glu Gly Val Val 
            420                 425                 430         


Asn Arg Phe Ile Tyr Arg 
        435             


<210>  46
<211>  1314
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  46
atgccatact tcccaggtgt tgaaaaggtc agattcgaag gtccagcttc cacttccgct       60

ttggctttca gacactacga cgctaacaag ttgatcttgg gtaaaccaat gagagaacac      120

ttgagaatgg ctgcttgtta ctggcacacc ttcgtctggc caggtgctga catgttcggt      180

atgggtactt tcaagagacc atggcaaaga tctggtgacc caatggaagt tgctatcggt      240

aaagctgaag ctgctttcga atttttctct aagttgggta tcgactacta ctccttccac      300

gacaccgatg ttgctccaga aggttcttcc ttgaaggaat acagaaacca cttcgctcaa      360

atggttgacc acttggaaag acaccaagaa caaaccggta tcaagttgtt gtggggtact      420

gctaactgtt tctctaaccc aagattcgct gctggtgctg cttccaaccc agacccagaa      480

gttttcgctt tcgctgctgc tcaagtcttc tctgctatga acgctacttt gagattgaag      540

ggtgctaact acgtcttgtg gggtggtaga gaaggttacg aaaccttgtt gaacactgac      600

ttgaagagag aaagagaaca attgggtaga ttcatgagaa tggttgtcga acacaagcac      660

aagatcggtt tcaagggtga cttgttgatc gaaccaaagc cacaagaacc aaccaagcac      720

caatacgact acgattctgc tactgttttc ggtttcttgc acgaatacgg tttggaacac      780

gaaatcaagg tcaacatcga agctaaccac gctaccttgg ctggtcactc cttccaccac      840

gaaatcgcta ctgctgtctc tttgggtatc ttcggttcca tcgatgctaa cagaggtgac      900

ccacaaaacg gttgggacac cgatcaattc ccaaactctg ttgaagaaat gaccttggct      960

acttacgaaa tcttgaaggc tggtggtttc aagaacggtg gttacaactt cgactctaag     1020

gttagaagac aatccttgga cgaagtcgat ttgttccacg gtcacgttgc tgctatggat     1080

gtcttggctt tggctttgga aagagctgct gctatggttc aagacgatag attgcaacaa     1140

ttcaaggaac aaagatacgc tggttggcaa caaccattgg gtcaagctgt cttggctggt     1200

gaattttctt tggaatcctt ggctgaacac gctttcgcta acgaattgaa cccacaagct     1260

gtttctggta gacaagaaat gttggaaggt gttgtcaaca gattcatcta caga           1314


<210>  47
<211>  439
<212>  PRT
<213>  Photobacterium profundum

<400>  47

Met Thr Glu Phe Phe Lys Asn Ile Asn Lys Ile Gln Phe Glu Gly Thr 
1               5                   10                  15      


Asp Ala Ile Asn Pro Leu Ala Phe Arg His Tyr Asp Ala Glu Arg Met 
            20                  25                  30          


Ile Leu Gly Lys Ser Met Lys Glu His Leu Arg Phe Ala Ala Cys Tyr 
        35                  40                  45              


Trp His Asn Phe Cys Trp Pro Gly Ser Asp Val Phe Gly Ala Ala Thr 
    50                  55                  60                  


Phe Asp Arg Pro Trp Leu Gln Ser Gly Asn Ala Met Glu Met Ala His 
65                  70                  75                  80  


Met Lys Ala Asp Ala Ala Phe Asp Phe Phe Ser Lys Leu Gly Val Pro 
                85                  90                  95      


Tyr Tyr Cys Phe His Asp Thr Asp Ile Ala Pro Glu Gly Thr Ser Leu 
            100                 105                 110         


Lys Glu Tyr Val Asn Asn Phe Ala Gln Met Val Asp Val Leu Glu Gln 
        115                 120                 125             


Lys Gln Asp Glu Thr Gly Leu Lys Leu Leu Trp Gly Thr Ala Asn Ala 
    130                 135                 140                 


Phe Ser Asn Pro Arg Tyr Met Ser Gly Ala Gly Thr Asn Pro Asp Pro 
145                 150                 155                 160 


Lys Val Phe Ala Tyr Ala Ala Thr Gln Ile Phe Asn Ala Met Gly Ala 
                165                 170                 175     


Thr Gln Arg Leu Gly Gly Glu Asn Tyr Val Leu Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln Glu Arg Glu Gln 
        195                 200                 205             


Leu Gly Arg Leu Met Gln Met Val Val Glu His Lys His Lys Ile Gly 
    210                 215                 220                 


Phe Lys Gly Thr Ile Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Tyr Asp Thr Ala Thr Val Tyr Gly Phe Leu Lys Gln 
                245                 250                 255     


Phe Gly Leu Glu Asn Glu Ile Lys Val Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe Gln His Glu Ile Ala Thr Ala Thr Ser 
        275                 280                 285             


Leu Gly Leu Phe Gly Ser Ile Asp Ala Asn Arg Gly Asp Pro Gln Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Asn Thr Leu 
305                 310                 315                 320 


Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr Thr Gly Gly Phe 
                325                 330                 335     


Asn Phe Asp Ser His Val Arg Arg Pro Ser Ile Asp Ala Glu Asp Leu 
            340                 345                 350         


Phe Tyr Gly His Ile Gly Gly Met Asp Thr Met Ala Leu Ala Leu Glu 
        355                 360                 365             


Arg Ala Ala Asn Met Ile Glu Asn Asp Val Leu Ser Lys Asn Ile Ala 
    370                 375                 380                 


Gln Arg Tyr Ala Gly Trp Asn Glu Asp Leu Gly Lys Lys Ile Leu Ser 
385                 390                 395                 400 


Gly Asp His Ser Leu Glu Thr Leu Ala Lys Phe Ala Leu Asp Ser Asn 
                405                 410                 415     


Ile Ala Pro Val Lys Glu Ser Gly Arg Gln Glu His Leu Glu Asn Ile 
            420                 425                 430         


Val Asn Gly Phe Ile Tyr Lys 
        435                 


<210>  48
<211>  1317
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  48
atgaccgagt tcttcaagaa catcaacaag atccaattcg aaggtactga cgctatcaac       60

ccattggctt tcagacacta cgacgctgaa agaatgatct tgggtaaatc tatgaaggaa      120

cacttgagat tcgctgcttg ttactggcac aacttctgtt ggccaggttc tgacgttttc      180

ggtgctgcta ccttcgatag accatggttg caatccggta acgctatgga aatggctcac      240

atgaaggctg acgctgcttt cgatttcttc tctaagttgg gtgttccata ctactgtttc      300

cacgacaccg atatcgctcc agaaggtact tccttgaagg aatacgtcaa caacttcgct      360

caaatggttg acgttttgga acaaaagcaa gatgaaaccg gtttgaagtt gttgtggggt      420

actgctaacg ctttctctaa cccaagatac atgtccggtg ctggtactaa cccagaccca      480

aaggttttcg cttacgctgc tacccaaatc ttcaacgcta tgggtgctac tcaaagattg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaaacctt gttgaacact      600

gacttgagac aagaaagaga acaattgggt agattgatgc aaatggttgt cgaacacaag      660

cacaagatcg gtttcaaggg tactatcttg atcgaaccaa agccacaaga accaactaag      720

caccaatacg actacgatac cgctactgtt tacggtttct tgaagcaatt cggtttggaa      780

aacgaaatca aggtcaacat cgaagctaac cacgctacct tggctggtca ctctttccaa      840

cacgaaatcg ctaccgctac ttctttgggt ttgttcggtt ccatcgatgc taacagaggt      900

gacccacaat tgggttggga caccgatcaa ttcccaaact ctgttgaaga aaacactttg      960

gtcatgtacg aaatcttgaa ggctggtggt ttcaccactg gtggtttcaa cttcgactct     1020

cacgttagaa gaccatccat cgacgctgaa gatttgttct acggtcacat cggtggtatg     1080

gacaccatgg ctttggcttt ggaaagagct gctaacatga tcgaaaacga cgttttgtct     1140

aagaacatcg ctcaaagata cgctggttgg aacgaagact tgggtaaaaa gatcttgtct     1200

ggtgaccact ccttggaaac tttggctaag ttcgctttgg actccaacat cgctccagtt     1260

aaggaatctg gtagacaaga acacttggaa aacatcgtca acggtttcat ctacaag        1317


<210>  49
<211>  440
<212>  PRT
<213>  Pantoea stewartii

<400>  49

Met His Ala Tyr Phe Asp Gln Leu Asp Arg Val Arg Tyr Glu Gly Ala 
1               5                   10                  15      


Lys Thr Ile Asn Pro Leu Ala Phe Arg His Tyr Asn Pro Asp Glu Val 
            20                  25                  30          


Ile Leu Gly Lys Thr Met Ala Glu His Leu Arg Phe Ala Ala Cys Tyr 
        35                  40                  45              


Trp His Thr Phe Cys Trp Asn Gly Ala Asp Met Phe Gly Val Gly Ala 
    50                  55                  60                  


Phe Asp Arg Pro Trp Gln Lys Ala Gly Asp Ala Leu Ala Leu Ala Lys 
65                  70                  75                  80  


Leu Lys Ala Asp Val Ala Phe Glu Phe Phe His Lys Leu Asn Val Pro 
                85                  90                  95      


Tyr Tyr Cys Phe His Asp Val Asp Val Ser Pro Glu Gly Asp Ser Leu 
            100                 105                 110         


Lys Ser Tyr Arg Glu Asn Leu Ala Val Met Thr Asp Thr Leu Gln Ala 
        115                 120                 125             


Lys Gln Gln Glu Thr Gly Leu Lys Leu Leu Trp Gly Thr Ala Asn Cys 
    130                 135                 140                 


Phe Thr His Pro Arg Tyr Gly Ala Gly Ala Ala Thr Asn Pro Asp Pro 
145                 150                 155                 160 


Glu Val Phe Ser Trp Ala Ala Ser Gln Val Cys Ser Ala Met Lys Ala 
                165                 170                 175     


Thr Gln Thr Leu Gly Gly Glu Asn Tyr Val Leu Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln Glu Arg Glu Gln 
        195                 200                 205             


Ile Gly Arg Phe Met Gln Met Val Val Glu His Lys His Lys Ile Gly 
    210                 215                 220                 


Phe Gln Gly Thr Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Tyr Asp Val Ala Thr Val Tyr Gly Phe Leu Lys Gln 
                245                 250                 255     


Phe Gly Leu Glu Lys Glu Ile Lys Val Asn Val Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe His His Glu Ile Ala Thr Ala Ile Ala 
        275                 280                 285             


Leu Gly Val Phe Gly Ser Val Asp Ala Asn Arg Gly Asp Ala Gln Cys 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Val Ser Val Glu Glu Asn Ala Leu 
305                 310                 315                 320 


Val Met Tyr Glu Ile Ile Lys Ala Gly Gly Phe Thr Thr Gly Gly Leu 
                325                 330                 335     


Asn Phe Asp Ala Lys Val Arg Arg Gln Ser Thr Asp Lys Tyr Asp Leu 
            340                 345                 350         


Phe Tyr Gly His Ile Gly Ala Met Asp Thr Met Ala Leu Ala Leu Lys 
        355                 360                 365             


Val Ala Ala Arg Met Leu Ser Asp Gly Glu Leu Asp Gln Arg Val Ala 
    370                 375                 380                 


Gln Arg Tyr Ser Gly Trp Asn Gly Glu Phe Gly Gln Gln Ile Leu Lys 
385                 390                 395                 400 


Gly Glu Phe Ser Leu Glu Thr Leu Ala Ala His Ala His Gln Gln Gln 
                405                 410                 415     


Phe Asn Pro Gln His Arg Ser Gly Arg Gln Glu Gln Leu Glu Asn Leu 
            420                 425                 430         


Val Asn His Tyr Leu Tyr Asp Phe 
        435                 440 


<210>  50
<211>  1320
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  50
atgcacgctt acttcgatca attggacaga gtcagatacg aaggtgctaa gaccatcaac       60

ccattggctt tcagacacta caacccagac gaagttatct tgggtaaaac catggctgaa      120

cacttgagat tcgctgcttg ttactggcac actttctgtt ggaacggtgc tgacatgttc      180

ggtgtcggtg ctttcgatag accatggcaa aaggctggtg acgctttggc tttggctaag      240

ttgaaggctg acgttgcttt cgaatttttc cacaagttga acgtcccata ctactgtttc      300

cacgacgttg atgtctctcc agaaggtgac tctttgaagt cctacagaga aaacttggct      360

gttatgaccg acactttgca agctaagcaa caagaaaccg gtttgaagtt gttgtggggt      420

actgctaact gtttcactca cccaagatac ggtgctggtg ctgctactaa cccagaccca      480

gaagttttct cttgggctgc ttcccaagtc tgttctgcta tgaaggctac ccaaactttg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaaacctt gttgaacact      600

gacttgagac aagaaagaga acaaatcggt agattcatgc aaatggttgt cgaacacaag      660

cacaagatcg gtttccaagg tactttgttg atcgaaccaa agccacaaga accaaccaag      720

caccaatacg actacgatgt tgctactgtc tacggtttct tgaagcaatt cggtttggaa      780

aaggaaatca aggttaacgt cgaagctaac cacgctacct tggctggtca ctccttccac      840

cacgaaatcg ctactgctat cgctttgggt gttttcggtt ctgttgacgc taacagaggt      900

gacgctcaat gtggttggga cactgatcaa ttcccagttt ccgtcgaaga aaacgctttg      960

gttatgtacg aaatcatcaa ggctggtggt ttcaccactg gtggtttgaa cttcgatgct     1020

aaggtcagaa gacaatctac cgacaagtac gatttgttct acggtcacat cggtgctatg     1080

gacactatgg ctttggcttt gaaggttgct gctagaatgt tgtccgacgg tgaattggat     1140

caaagagtcg ctcaaagata ctctggttgg aacggtgaat ttggtcaaca aatcttgaag     1200

ggtgaatttt ctttggaaac cttggctgct cacgctcacc aacaacaatt caacccacaa     1260

cacagatctg gtagacaaga acaattggaa aacttggtta accactactt gtacgacttc     1320


<210>  51
<211>  440
<212>  PRT
<213>  Plautia stali symbiont

<400>  51

Met His Ala Tyr Phe Asp Gln Leu Glu Arg Val Gly Tyr Glu Gly Ala 
1               5                   10                  15      


Asn Thr Thr Asn Ala Leu Ala Phe Arg His Tyr Asn Pro Gln Glu Val 
            20                  25                  30          


Ile Leu Gly Lys Thr Met Ala Glu His Leu Arg Phe Ala Ala Cys Tyr 
        35                  40                  45              


Trp His Thr Phe Cys Trp Asn Gly Ala Asp Met Phe Gly Val Gly Ala 
    50                  55                  60                  


Phe Asp Arg Pro Trp Gln Lys Asn Gly Asp Ala Leu Gln Leu Ala Lys 
65                  70                  75                  80  


Leu Lys Ala Asp Val Ala Phe Glu Phe Phe Tyr Lys Leu Asn Val Pro 
                85                  90                  95      


Tyr Tyr Cys Phe His Asp Val Asp Val Ser Pro Glu Gly Asp Ser Leu 
            100                 105                 110         


Arg Ser Tyr Gln Glu Asn Leu Ala Val Ile Thr Asp Lys Leu Leu Glu 
        115                 120                 125             


Lys Gln Gln Glu Thr Gly Val Lys Leu Leu Trp Gly Thr Ala Asn Cys 
    130                 135                 140                 


Phe Thr His Pro Arg Tyr Ala Ala Gly Ala Ala Thr Ser Pro Asp Pro 
145                 150                 155                 160 


Glu Ile Phe Ala Trp Ala Ala Ser Gln Val Cys Ser Ala Met Gln Ala 
                165                 170                 175     


Thr Gln Thr Leu Gly Gly Glu Asn Tyr Val Leu Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln Glu Arg Glu Gln 
        195                 200                 205             


Ile Gly Arg Phe Met Gln Met Val Val Glu His Lys His Lys Ile Gly 
    210                 215                 220                 


Phe Gln Gly Met Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Phe Asp Val Ala Met Val Tyr Gly Phe Leu Arg Gln 
                245                 250                 255     


Phe Gly Leu Glu Lys Glu Ile Lys Val Asn Val Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe His His Glu Ile Ala Thr Ala Ile Ala 
        275                 280                 285             


Leu Gly Ile Phe Gly Ser Val Asp Ala Asn Arg Gly Asp Ser Gln Cys 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Asn Ala Leu 
305                 310                 315                 320 


Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr Thr Gly Gly Leu 
                325                 330                 335     


Asn Phe Asp Ala Lys Val Arg Arg Gln Ser Thr Asp Lys Tyr Asp Leu 
            340                 345                 350         


Phe Tyr Gly His Ile Gly Ala Met Asp Thr Met Ala Leu Ala Leu Lys 
        355                 360                 365             


Val Ala Ala Arg Met Val Ser Asp Gly Glu Leu Asp Lys Arg Val Ala 
    370                 375                 380                 


Gln Arg Tyr Ser Gly Trp Asn Gly Glu Phe Gly Gln Gln Ile Leu Lys 
385                 390                 395                 400 


Gly Glu Phe Ser Leu Ala Ser Leu Ala Ala His Ala Gln Gln Leu Gln 
                405                 410                 415     


Leu Asn Pro Gln His Arg Ser Gly Arg Gln Glu Gln Leu Glu Asn Leu 
            420                 425                 430         


Val Asn His Tyr Leu Tyr Asn Phe 
        435                 440 


<210>  52
<211>  1320
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Artificial sequence

<400>  52
atgcacgctt acttcgatca attggaaaga gtcggttacg aaggtgctaa cactactaac       60

gctttggctt tcagacacta caacccacaa gaagttatct tgggtaaaac catggctgaa      120

cacttgagat tcgctgcttg ttactggcac actttctgtt ggaacggtgc tgacatgttc      180

ggtgtcggtg ctttcgatag accatggcaa aagaacggtg acgctttgca attggctaag      240

ttgaaggctg acgttgcttt cgaatttttc tacaagttga acgtcccata ctactgtttc      300

cacgacgttg atgtctctcc agaaggtgac tctttgagat cctaccaaga aaacttggct      360

gttatcaccg acaagttgtt ggaaaagcaa caagaaactg gtgtcaagtt gttgtggggt      420

actgctaact gtttcactca cccaagatac gctgctggtg ctgctacctc cccagaccca      480

gaaatcttcg cttgggctgc ttctcaagtt tgttccgcta tgcaagctac ccaaactttg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaaacctt gttgaacact      600

gacttgagac aagaaagaga acaaatcggt agattcatgc aaatggttgt cgaacacaag      660

cacaagatcg gtttccaagg tatgttgttg atcgaaccaa agccacaaga accaaccaag      720

caccaatacg acttcgatgt tgctatggtc tacggtttct tgagacaatt cggtttggaa      780

aaggaaatca aggttaacgt cgaagctaac cacgctacct tggctggtca ctctttccac      840

cacgaaatcg ctactgctat cgctttgggt atcttcggtt ctgttgacgc taacagaggt      900

gactcccaat gtggttggga cactgatcaa ttcccaaact ctgttgaaga aaacgctttg      960

gtcatgtacg aaatcttgaa ggctggtggt ttcaccactg gtggtttgaa cttcgacgct     1020

aaggttagaa gacaatccac cgacaagtac gatttgttct acggtcacat cggtgctatg     1080

gacactatgg ctttggcttt gaaggttgct gctagaatgg tctctgacgg tgaattggat     1140

aagagagtcg ctcaaagata ctccggttgg aacggtgaat ttggtcaaca aatcttgaag     1200

ggtgaatttt ctttggcttc tttggctgct cacgctcaac aattgcaatt gaacccacaa     1260

cacagatctg gtagacaaga acaattggaa aacttggtca accactactt atacaacttc     1320


<210>  53
<211>  438
<212>  PRT
<213>  Pseudomonas syringae

<400>  53

Met Ser Tyr Phe Pro Thr Val Asp Lys Val Ile Tyr Glu Gly Pro Asp 
1               5                   10                  15      


Ser Asp Ser Pro Leu Ala Phe Arg His Tyr Asp Ala Asp Arg Arg Val 
            20                  25                  30          


Leu Gly Lys Pro Met Arg Glu His Leu Arg Met Ala Ala Cys Tyr Trp 
        35                  40                  45              


His Ser Phe Val Trp Pro Gly Ala Asp Met Phe Gly Val Gly Thr Phe 
    50                  55                  60                  


Lys Arg Pro Trp Gln Arg Ala Gly Asp Pro Met Glu Leu Ala Ile Gly 
65                  70                  75                  80  


Lys Ala Glu Ala Ala Phe Glu Phe Phe Ser Lys Leu Gly Ile Asp Tyr 
                85                  90                  95      


Tyr Ser Phe His Asp Thr Asp Val Ala Pro Glu Gly Ser Ser Ile Arg 
            100                 105                 110         


Glu Tyr Gln Asn Asn Phe Ala Gln Met Val Asp Arg Leu Glu Arg His 
        115                 120                 125             


Gln Glu Gln Ser Gly Ile Lys Leu Leu Trp Gly Thr Ala Asn Cys Phe 
    130                 135                 140                 


Ser Asn Pro Arg Phe Ala Ala Gly Ala Ala Ser Asn Pro Asp Pro Glu 
145                 150                 155                 160 


Val Phe Ala Tyr Ala Gly Ala Gln Val Phe Ser Ala Met Asn Ala Thr 
                165                 170                 175     


Gln Arg Leu Lys Gly Ser Asn Tyr Val Leu Trp Gly Gly Arg Glu Gly 
            180                 185                 190         


Tyr Glu Thr Leu Leu Asn Thr Asp Leu Lys Arg Glu Arg Glu Gln Leu 
        195                 200                 205             


Gly Arg Phe Met Arg Met Val Val Glu His Lys His Lys Ile Gly Phe 
    210                 215                 220                 


Lys Gly Asp Leu Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys His 
225                 230                 235                 240 


Gln Tyr Asp Tyr Asp Ser Ala Thr Val Phe Gly Phe Leu His Gln Tyr 
                245                 250                 255     


Gly Leu Gln Asp Glu Ile Lys Val Asn Ile Glu Ala Asn His Ala Thr 
            260                 265                 270         


Leu Ala Gly His Ser Phe His His Glu Ile Ala Thr Ala Val Ser Leu 
        275                 280                 285             


Gly Ile Phe Gly Ser Ile Asp Ala Asn Arg Gly Asp Pro Gln Asn Gly 
    290                 295                 300                 


Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Met Thr Leu Ala 
305                 310                 315                 320 


Thr Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr His Gly Gly Tyr Asn 
                325                 330                 335     


Phe Asp Ser Lys Val Arg Arg Gln Ser Leu Asp Asp Val Asp Leu Phe 
            340                 345                 350         


His Gly His Val Ala Ala Met Asp Val Leu Ala Leu Ser Leu Glu Arg 
        355                 360                 365             


Ala Ala Ala Met Val Gln Asn Asp Lys Leu Gln Gln Phe Lys Asp Gln 
    370                 375                 380                 


Arg Tyr Ala Gly Trp Gln Gln Pro Phe Gly Gln Ser Val Leu Ser Gly 
385                 390                 395                 400 


Gly Phe Ser Leu Ala Ser Leu Ala Glu His Ala Phe Ala Asn Glu Leu 
                405                 410                 415     


Asn Pro Gln Ala Val Ser Gly Arg Gln Glu Leu Leu Glu Gly Val Val 
            420                 425                 430         


Asn Arg Phe Ile Tyr Thr 
        435             


<210>  54
<211>  1314
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  54
atgtcctact tcccaaccgt tgataaggtc atctacgaag gtccagactc cgactcccca       60

ttggctttca gacactacga cgctgataga agagtcttgg gtaaaccaat gagagaacac      120

ttgagaatgg ctgcttgtta ctggcactct ttcgtttggc caggtgctga catgttcggt      180

gtcggtactt tcaagagacc atggcaaaga gctggtgacc caatggaatt ggctatcggt      240

aaagctgaag ctgctttcga atttttctct aagttgggta tcgactacta ctccttccac      300

gacactgatg ttgctccaga aggttcttcc atcagagaat accaaaacaa cttcgctcaa      360

atggttgaca gattggaaag acaccaagaa caatctggta tcaagttgtt gtggggtact      420

gctaactgtt tctctaaccc aagattcgct gctggtgctg cttccaaccc agacccagaa      480

gttttcgctt acgctggtgc tcaagtcttc tctgctatga acgctactca aagattgaag      540

ggttccaact acgttttgtg gggtggtaga gaaggttacg aaaccttgtt gaacactgac      600

ttgaagagag aaagagaaca attgggtaga ttcatgagaa tggttgtcga acacaagcac      660

aagatcggtt tcaagggtga cttgttgatc gaaccaaagc cacaagaacc aaccaagcac      720

caatacgact acgattctgc tactgttttc ggtttcttgc accaatacgg tttgcaagac      780

gaaatcaagg tcaacatcga agctaaccac gctaccttgg ctggtcactc cttccaccac      840

gaaatcgcta ctgctgtctc tttgggtatc ttcggttcca tcgatgctaa cagaggtgac      900

ccacaaaacg gttgggacac cgatcaattc ccaaactctg ttgaagaaat gaccttggct      960

acttacgaaa tcttgaaggc tggtggtttc actcacggtg gttacaactt cgactctaag     1020

gttagaagac aatccttgga cgacgttgac ttgttccacg gtcacgttgc tgctatggat     1080

gtcttggctt tgtctttgga aagagctgct gctatggttc aaaacgacaa gttgcaacaa     1140

ttcaaggatc aaagatacgc tggttggcaa caaccattcg gtcaatctgt cttgtccggt     1200

ggtttctctt tggcttcctt ggctgaacac gctttcgcta acgaattgaa cccacaagct     1260

gtttctggta gacaagaatt gttggaaggt gttgtcaaca gattcatcta cacc           1314


<210>  55
<211>  439
<212>  PRT
<213>  Vibrio sp.

<400>  55

Met Thr Glu Phe Phe Lys Asn Ile Asn Lys Ile Asn Phe Glu Gly Ala 
1               5                   10                  15      


Glu Ser Thr Asn Pro Leu Ala Phe Arg His Tyr Asp Ala Asp Lys Met 
            20                  25                  30          


Ile Leu Gly Lys Ser Met Ala Glu His Leu Arg Phe Ala Ala Cys Tyr 
        35                  40                  45              


Trp His Asn Phe Arg Trp Gly Gly Ala Asp Ile Phe Gly Asp Gly Thr 
    50                  55                  60                  


Phe Glu His Ala Trp Leu Asn Ala Ala Asp Pro Met Glu Gln Ala Leu 
65                  70                  75                  80  


Met Lys Ala Asp Ala Ala Phe Glu Phe Phe Thr Lys Leu Gly Val Pro 
                85                  90                  95      


Tyr Tyr Cys Phe His Asp Thr Asp Val Ala Pro Glu Gly Asn Ser Ile 
            100                 105                 110         


Lys Glu Tyr Ile Asn Asn Phe Gln Thr Met Val Asp Val Leu Glu Gln 
        115                 120                 125             


Lys Gln Glu Glu Thr Gly Met Lys Leu Leu Trp Gly Thr Ala Asn Ala 
    130                 135                 140                 


Phe Ser Asn Ala Arg Tyr Met Ala Gly Ala Gly Thr Asn Pro Asp Pro 
145                 150                 155                 160 


Lys Val Phe Ala Tyr Ala Ala Thr Gln Ile Phe Asn Ala Met Gly Ala 
                165                 170                 175     


Thr Gln Arg Leu Gly Gly Glu Asn Tyr Val Leu Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln Glu Arg Glu Gln 
        195                 200                 205             


Leu Gly Arg Leu Met Gln Met Val Val Glu His Lys His Lys Ile Gly 
    210                 215                 220                 


Phe Lys Gly Ser Ile Leu Ile Glu Pro Lys Pro Gln Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Tyr Asp Thr Ala Thr Val Tyr Gly Phe Leu Lys Gln 
                245                 250                 255     


Phe Gly Leu Glu Asn Glu Ile Lys Val Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe His His Glu Val Ala Thr Ala Thr Ser 
        275                 280                 285             


Leu Gly Leu Phe Gly Ser Ile Asp Ala Asn Arg Gly Asp Pro Gln Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu Glu Asn Thr Leu 
305                 310                 315                 320 


Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr Thr Gly Gly Phe 
                325                 330                 335     


Asn Phe Asp Ala Arg Val Arg Arg Pro Ser Thr Glu Leu Glu Asp Leu 
            340                 345                 350         


Phe His Gly His Ile Gly Gly Met Asp Thr Met Ala Leu Ser Leu Glu 
        355                 360                 365             


Arg Ala Ala Asn Met Ile Glu Asn Asp Val Leu Ser Lys Asn Ile Ala 
    370                 375                 380                 


Glu Arg Tyr Ala Gly Trp Asn Asp Asp Leu Gly Gln Lys Ile Leu Lys 
385                 390                 395                 400 


Gly Asp Leu Ser Leu Ala Gly Leu Ala Ala Phe Thr Glu Glu Thr Asn 
                405                 410                 415     


Ile Asn Pro Val Lys Glu Ser Gly Arg Gln Glu Tyr Leu Glu Asn Val 
            420                 425                 430         


Val Asn Gly Phe Ile Tyr Lys 
        435                 


<210>  56
<211>  1317
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  56
atgaccgagt tcttcaagaa catcaacaag atcaacttcg aaggtgctga atccactaac       60

ccattggctt tcagacacta cgacgctgac aagatgatct tgggtaaatc tatggctgaa      120

cacttgagat tcgctgcttg ttactggcac aacttcagat ggggtggtgc tgacatcttc      180

ggtgacggta ctttcgaaca cgcttggttg aacgctgctg acccaatgga acaagctttg      240

atgaaggctg atgctgcttt cgaatttttc accaagttgg gtgttccata ctactgtttc      300

cacgacactg atgtcgctcc agaaggtaac tctatcaagg aatacatcaa caacttccaa      360

accatggttg acgttttgga acaaaagcaa gaagaaaccg gtatgaagtt gttgtggggt      420

actgctaacg ctttctccaa cgctagatac atggctggtg ctggtactaa cccagaccca      480

aaggttttcg cttacgctgc tacccaaatc ttcaacgcta tgggtgctac tcaaagattg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaaacctt gttgaacact      600

gacttgagac aagaaagaga acaattgggt agattgatgc aaatggttgt cgaacacaag      660

cacaagatcg gtttcaaggg ttctatcttg atcgaaccaa agccacaaga accaaccaag      720

caccaatacg actacgatac cgctactgtt tacggtttct tgaagcaatt cggtttggaa      780

aacgaaatca aggtcaacat cgaagctaac cacgctactt tggctggtca ctccttccac      840

cacgaagttg ctaccgctac ttctttgggt ttgttcggtt ccatcgacgc taacagaggt      900

gacccacaat tgggttggga caccgatcaa ttcccaaact ctgttgaaga aaacactttg      960

gtcatgtacg aaatcttgaa ggctggtggt ttcaccactg gtggtttcaa cttcgacgct     1020

agagttagaa gaccatccac cgaattggaa gacttgttcc acggtcacat cggtggtatg     1080

gatactatgg ctttgtcttt ggaaagagct gctaacatga tcgaaaacga cgttttgtcc     1140

aagaacatcg ctgaaagata cgctggttgg aacgacgatt tgggtcaaaa gatcttgaag     1200

ggtgacttgt ctttggctgg tttggctgct ttcaccgaag aaactaacat caacccagtt     1260

aaggaatctg gtagacaaga atacttggaa aacgtcgtca acggtttcat ctacaag        1317


<210>  57
<211>  444
<212>  PRT
<213>  Yokenella regensburgei

<400>  57

Met Glu Phe Ile Met Gln Ser Tyr Phe Asp Gln Leu Glu Arg Val Arg 
1               5                   10                  15      


Tyr Glu Gly Pro Lys Ser Glu Asn Pro Leu Ala Phe Arg His Tyr Asn 
            20                  25                  30          


Pro Asp Glu Leu Val Leu Gly Lys Arg Met Glu Glu His Leu Arg Phe 
        35                  40                  45              


Ala Ala Cys Tyr Trp His Thr Phe Cys Trp Asn Gly Ala Asp Met Phe 
    50                  55                  60                  


Gly Val Gly Ala Phe Glu Arg Pro Trp Gln Gln Ala Gly Asp Ala Leu 
65                  70                  75                  80  


Ala Leu Ala Lys Arg Lys Ala Asp Val Ala Phe Glu Phe Phe His Lys 
                85                  90                  95      


Leu Asn Val Pro Tyr Tyr Cys Phe His Asp Val Asp Val Ser Pro Glu 
            100                 105                 110         


Gly Ala Ser Leu Lys Glu Tyr Arg Asn Asn Phe Ala Gln Met Val Asp 
        115                 120                 125             


Val Leu Ala Gln Lys Gln Gln Glu Ser Gly Val Lys Leu Leu Trp Gly 
    130                 135                 140                 


Thr Ala Asn Cys Phe Thr Asn Pro Arg Tyr Gly Ala Gly Ala Ala Thr 
145                 150                 155                 160 


Asn Pro Asp Pro Glu Val Phe Ser Trp Ala Ala Thr Gln Val Val Thr 
                165                 170                 175     


Ala Met Asp Ala Thr His Arg Leu Gly Gly Glu Asn Tyr Val Leu Trp 
            180                 185                 190         


Gly Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Arg Gln 
        195                 200                 205             


Glu Arg Glu Gln Ile Gly Arg Phe Met Gln Met Val Val Glu His Lys 
    210                 215                 220                 


His Lys Thr Gly Phe Gln Gly Thr Leu Leu Ile Glu Pro Lys Pro Gln 
225                 230                 235                 240 


Glu Pro Thr Lys His Gln Tyr Asp Tyr Asp Ala Ala Thr Val Tyr Gly 
                245                 250                 255     


Phe Leu Lys Gln Phe Gly Leu Glu Lys Glu Ile Lys Leu Asn Ile Glu 
            260                 265                 270         


Ala Asn His Ala Thr Leu Ala Gly His Ser Phe His His Glu Ile Ala 
        275                 280                 285             


Thr Ala Ile Ala Leu Gly Leu Phe Gly Ser Val Asp Ala Asn Arg Gly 
    290                 295                 300                 


Asp Ala Gln Leu Gly Trp Asp Thr Asp Gln Phe Pro Asn Ser Val Glu 
305                 310                 315                 320 


Glu Asn Ala Leu Val Met Tyr Glu Ile Leu Lys Ala Gly Gly Phe Thr 
                325                 330                 335     


Thr Gly Gly Leu Asn Phe Asp Ala Lys Val Arg Arg Gln Ser Thr Asp 
            340                 345                 350         


Lys Tyr Asp Leu Phe Tyr Gly His Ile Gly Ala Met Asp Thr Met Ala 
        355                 360                 365             


Leu Ala Leu Lys Val Ala Ala Arg Met Val Glu Asp Gly Gln Leu Asp 
    370                 375                 380                 


Lys Arg Val Ala Lys Arg Tyr Ala Gly Trp Asn Gly Glu Leu Gly Gln 
385                 390                 395                 400 


Gln Ile Leu Lys Gly Gln Met Ser Leu Thr Glu Leu Ala Thr Tyr Ala 
                405                 410                 415     


Glu Gln His Asn Leu Ala Pro Gln His His Ser Gly His Gln Glu Leu 
            420                 425                 430         


Leu Glu Asn Leu Val Asn His Tyr Leu Phe Asp Lys 
        435                 440                 


<210>  58
<211>  1332
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  58
atggagttca tcatgcaatc ctacttcgat caattggaaa gagttagata cgaaggtcca       60

aagtccgaaa acccattggc tttcagacac tacaacccag acgaattggt tttgggtaaa      120

agaatggaag aacacttgag attcgctgct tgttactggc acaccttctg ttggaacggt      180

gctgacatgt tcggtgtcgg tgctttcgaa agaccatggc aacaagctgg tgacgctttg      240

gctttggcta agagaaaggc tgatgttgct ttcgaatttt tccacaagtt gaacgtccca      300

tactactgtt tccacgacgt tgatgtctct ccagaaggtg cttccttgaa ggaatacaga      360

aacaacttcg ctcaaatggt tgacgttttg gctcaaaagc aacaagaatc tggtgttaag      420

ttgttgtggg gtactgctaa ctgtttcact aacccaagat acggtgctgg tgctgctacc      480

aacccagacc cagaagtttt ctcctgggct gctacccaag ttgtcactgc tatggatgct      540

actcacagat tgggtggtga aaactacgtc ttgtggggtg gtagagaagg ttacgaaacc      600

ttgttgaaca ctgacttgag acaagaaaga gaacaaatcg gtagattcat gcaaatggtt      660

gtcgaacaca agcacaagac cggtttccaa ggtactttgt tgatcgaacc aaagccacaa      720

gaaccaacca agcaccaata cgactacgat gctgctactg tttacggttt cttgaagcaa      780

ttcggtttgg aaaaggaaat caagttgaac atcgaagcta accacgctac cttggctggt      840

cactctttcc accacgaaat cgctactgct atcgctttgg gtttgttcgg ttccgttgac      900

gctaacagag gtgacgctca attgggttgg gacactgatc aattcccaaa ctctgttgaa      960

gaaaacgctt tggtcatgta cgaaatcttg aaggctggtg gtttcaccac tggtggtttg     1020

aacttcgacg ctaaggttag aagacaatcc accgacaagt acgatttgtt ctacggtcac     1080

atcggtgcta tggacactat ggctttggct ttgaaggttg ctgctagaat ggtcgaagac     1140

ggtcaattgg ataagagagt cgctaagaga tacgctggtt ggaacggtga attgggtcaa     1200

caaatcttga agggtcaaat gtctttgacc gaattggcta cttacgctga acaacacaac     1260

ttggctccac aacaccactc cggtcaccaa gaattgttgg aaaacttggt caaccactac     1320

ttgttcgata ag                                                         1332


<210>  59
<211>  1182
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  59
atgtccgttc aagctaccag agaagacaag ttctccttcg gtttgtggac tgtcggttgg       60

caagctagag acgctttcgg tgacgctacc agaactgctt tggacccagt tgaagctgtc      120

cacaagttgg ctgaaatcgg tgcttacggt atcaccttcc acgacgatga cttggttcca      180

ttcggttctg acgctcaaac tagagatggt atcatcgctg gtttcaagaa ggctttggac      240

gaaaccggtt tgatcgttcc aatggtcacc actaacttgt tcacccaccc agtcttcaag      300

gatggtggtt tcacttctaa cgacagatcc gttagaagat acgctatcag aaaggtcttg      360

agacaaatgg acttgggtgc tgaattgggt gctaagactt tggttttgtg gggtggtaga      420

gaaggtgctg aatacgactc tgctaaggat gtctccgctg ctttggatag atacagagaa      480

gctttgaact tgttggctca atactctgaa gacagaggtt acggtttgag attcgctatc      540

gaaccaaagc caaacgaacc aagaggtgac atcttgttgc caaccgctgg tcacgctatc      600

gctttcgttc aagaattgga aagaccagaa ttgttcggta tcaacccaga aaccggtcac      660

gaacaaatgt ctaacttgaa cttcactcaa ggtatcgctc aagctttgtg gcacaagaag      720

ttgttccaca tcgacttgaa cggtcaacac ggtccaaagt tcgatcaaga cttggttttc      780

ggtcacggtg acttgttgaa cgctttctct ttggttgact tgttggaaaa cggtccagac      840

ggtgctccag cttacgatgg tccaagacac ttcgactaca agccatctag aactgaagat      900

tacgacggtg tctgggaatc cgctaaggct aacatcagaa tgtacttgtt gttgaaggaa      960

agagctaagg ctttcagagc tgacccagaa gttcaagaag ctttggctgc ttctaaggtc     1020

gctgaattga agaccccaac tttgaaccca ggtgaaggtt acgctgaatt gttggctgac     1080

agatccgctt tcgaagatta cgacgctgat gctgttggtg ctaagggttt cggtttcgtt     1140

aagttgaacc aattggctat cgaacacttg ttgggtgcta ga                        1182


<210>  60
<211>  1320
<212>  DNA
<213>  Artificial sequence

<220>
<223>  coding region codon optimized for expression in Saccharomyces 
       cerevisiae

<400>  60
atgcaagcct attttgacca attagacaga gtaagatacg aaggttccaa gtcctccaat       60

ccattagcct ttagacacta caaccctgat gaattggtat tgggtaaaag aatggaagaa      120

catttgagat ttgctgcatg ttattggcac actttctgct ggaatggtgc tgatatgttt      180

ggtgttggtg cattcaacag accatggcaa caacctggtg aagcattggc cttagctaaa      240

agaaaggctg acgtcgcatt tgaatttttc cataaattgc acgtaccatt ctattgtttc      300

catgatgtcg acgtatcccc tgaaggtgct agtttgaagg aatacataaa caacttcgcc      360

caaatggttg atgtcttagc aggtaaacaa gaagaatctg gtgttaagtt gttatggggt      420

actgctaatt gctttacaaa cccaagatac ggtgcaggtg ccgctaccaa tccagatcct      480

gaagttttct catgggcagc cacccaagtt gtcactgcca tggaagctac acataaattg      540

ggtggtgaaa actacgtctt gtggggtggt agagaaggtt acgaaacatt gttaaacacc      600

gatttgagac aagaaagaga acaattaggt agattcatgc aaatggtagt tgaacataaa      660

cacaagattg gtttccaagg tactttgtta atagaaccaa aacctcaaga accaaccaag      720

caccaatatg attacgacgc tgcaactgtc tatggtttct tgaaacaatt cggtttggaa      780

aaggaaatta agttgaacat cgaagcaaac catgccacat tagctggtca ctcctttcat      840

cacgaaatcg caaccgccat tgctttgggt ttattcggta gtgttgatgc aaatagaggt      900

gacgcccaat tgggttggga tacagaccaa tttcctaatt ccgtagaaga aaacgctttg      960

gttatgtacg aaatcttgaa ggcaggtggt tttactacag gtggtttgaa cttcgatgct     1020

aaagttagaa gacaatctac tgataagtac gacttatttt acggtcatat tggtgctatg     1080

gacacaatgg cattggcctt aaaaatagcc gctagaatga tcgaagatgg tgaattggac     1140

aagagaatcg ctcaaagata ttctggttgg aactctgaat tgggtcaaca aatcttgaag     1200

ggtcaaatgt ctttggcaga tttggccaag tacgctcaag aacatcactt atcacctgtt     1260

catcaatcag gtagacaaga acaattagaa aacttagtca accattactt attcgacaaa     1320


<210>  61
<211>  3036
<212>  DNA
<213>  Artificial sequence

<220>
<223>  chimeric AMxylA expression cassette : ILV5p-Am XI coding-ILV5t 
       with a 5' NotI site and a 3' PmeI site

<400>  61
gcggccgcac ctggtaaaac ctctagtgga gtagtagatg taatcaatga agcggaagcc       60

aaaagaccag agtagaggcc tatagaagaa actgcgatac cttttgtgat ggctaaacaa      120

acagacatct ttttatatgt ttttacttct gtatatcgtg aagtagtaag tgataagcga      180

atttggctaa gaacgttgta agtgaacaag ggacctcttt tgcctttcaa aaaaggatta      240

aatggagtta atcattgaga tttagttttc gttagattct gtatccctaa ataactccct      300

tacccgacgg gaaggcacaa aagacttgaa taatagcaaa cggccagtag ccaagaccaa      360

ataatactag agttaactga tggtcttaaa caggcattac gtggtgaact ccaagaccaa      420

tatacaaaat atcgataagt tattcttgcc caccaattta aggagcctac atcaggacag      480

tagtaccatt cctcagagaa gaggtataca taacaagaaa atcgcgtgaa caccttatat      540

aacttagccc gttattgagc taaaaaacct tgcaaaattt cctatgaata agaatacttc      600

agacgtgata aaaatttact ttctaactct tctcacgctg cccctatctg ttcttccgct      660

ctaccgtgag aaataaagca tcgagtacgg cagttcgctg tcactgaact aaaacaataa      720

ggctagttcg aatgatgaac ttgcttgctg tcaaacttct gagttgccgc tgatgtgaca      780

ctgtgacaat aaattcaaac cggttatagc ggtctcctcc ggtaccggtt ctgccacctc      840

caatagagct cagtaggagt cagaacctct gcggtggctg tcagtgactc atccgcgttt      900

cgtaagttgt gcgcgtgcac atttcgcccg ttcccgctca tcttgcagca ggcggaaatt      960

ttcatcacgc tgtaggacgc aaaaaaaaaa taattaatcg tacaagaatc ttggaaaaaa     1020

aattgaaaaa ttttgtataa aagggatgac ctaacttgac tcaatggctt ttacacccag     1080

tattttccct ttccttgttt gttacaatta tagaagcaag acaaaaacat atagacaacc     1140

tattcctagg agttatattt ttttacccta ccagcaatat aagtaaaaaa ctgtttaaac     1200

agtatgtccg ttcaagccac aagagaagac aagtttagtt tcggtttatg gactgtaggt     1260

tggcaagcaa gagacgcatt cggtgacgca accagaactg ccttggatcc agttgaagct     1320

gtccataaat tggcagaaat cggtgcctac ggtattacat tccacgatga cgatttggtt     1380

ccttttggtt ccgatgctca aaccagagac ggtattatag ccggtttcaa aaaggcttta     1440

gatgaaactg gtttgatcgt accaatggtt actacaaatt tgtttactca tcctgtcttc     1500

aaggacggtg gttttacatc taacgataga tcagtcagaa gatacgctat aagaaaggta     1560

ttgagacaaa tggatttggg tgctgaattg ggtgcaaaga cattagtctt gtggggtggt     1620

agagaaggtg cagaatacga ttccgccaaa gacgttagtg ctgcattgga cagatataga     1680

gaagcattga atttgttggc acaatactct gaagatagag gttacggttt gagatttgct     1740

atagaaccaa agcctaacga accaagaggt gacatattgt tacctactgc aggtcatgca     1800

atcgccttcg ttcaagaatt ggaaagacca gaattgttcg gtattaatcc tgaaaccggt     1860

cacgaacaaa tgtctaattt gaacttcact caaggtattg ctcaagcatt atggcataaa     1920

aagttgttcc acatcgattt gaacggtcaa catggtccaa aattcgacca agatttggta     1980

tttggtcacg gtgacttgtt gaacgctttc tcattggttg atttgttgga aaacggtcca     2040

gatggtgccc ctgcttatga cggtccaaga cattttgatt acaaaccttc tagaacagaa     2100

gactatgatg gtgtttggga atcagcaaag gccaacatca gaatgtactt gttgttgaag     2160

gaaagagcta aggcattcag agcagatcca gaagttcaag aagccttagc cgcttccaaa     2220

gtcgcagaat tgaagacacc aaccttaaat cctggtgaag gttacgccga attattggct     2280

gatagaagtg catttgaaga ctatgatgcc gacgctgttg gtgctaaagg ttttggtttt     2340

gtcaagttaa atcaattagc aatcgaacac ttattaggtg ccagatgagg ccctgcaggc     2400

cagaggaaaa taatatcaag tgctggaaac tttttctctt ggaatttttg caacatcaag     2460

tcatagtcaa ttgaattgac ccaatttcac atttaagatt tttttttttt catccgacat     2520

acatctgtac actaggaagc cctgtttttc tgaagcagct tcaaatatat atatttttta     2580

catatttatt atgattcaat gaacaatcta attaaatcga aaacaagaac cgaaacgcga     2640

ataaataatt tatttagatg gtgacaagtg tataagtcct catcgggaca gctacgattt     2700

ctctttcggt tttggctgag ctactggttg ctgtgacgca gcggcattag cgcggcgtta     2760

tgagctaccc tcgtggcctg aaagatggcg ggaataaagc ggaactaaaa attactgact     2820

gagccatatt gaggtcaatt tgtcaactcg tcaagtcacg tttggtggac ggcccctttc     2880

caacgaatcg tatatactaa catgcgcgcg cttcctatat acacatatac atatatatat     2940

atatatatat gtgtgcgtgt atgtgtacac ctgtatttaa tttccttact cgcgggtttt     3000

tcttttttct caattcttgg cttcctcttt ctcgag                               3036


<210>  62
<211>  1247
<212>  DNA
<213>  Artificial sequence

<220>
<223>  GPDp-ECgroES-CYC1t  with a  5' PacI site and a 3' NotI site

<400>  62
agatctagtt cgagtttatc attatcaata ctgccatttc aaagaatacg taaataatta       60

atagtagtga ttttcctaac tttatttagt caaaaaatta gccttttaat tctgctgtaa      120

cccgtacatg cccaaaatag ggggcgggtt acacagaata tataacatcg taggtgtctg      180

ggtgaacagt ttattcctgg catccactaa atataatgga gcccgctttt taagctggca      240

tccagaaaaa aaaagaatcc cagcaccaaa atattgtttt cttcaccaac catcagttca      300

taggtccatt ctcttagcgc aactacagag aacaggggca caaacaggca aaaaacgggc      360

acaacctcaa tggagtgatg caacctgcct ggagtaaatg atgacacaag gcaattgacc      420

cacgcatgta tctatctcat tttcttacac cttctattac cttctgctct ctctgatttg      480

gaaaaagctg aaaaaaaagg ttgaaaccag ttccctgaaa ttattcccct acttgactaa      540

taagtatata aagacggtag gtattgattg taattctgta aatctatttc ttaaacttct      600

taaattctac ttttatagtt agtctttttt ttagttttaa aacaccaaga acttagtttc      660

gaataaacac acataaacaa acaaaatgaa tattagacca ttgcatgata gagttattgt      720

taagagaaag gaagttgaaa ccaaatctgc aggtggtatt gttttgactg gttccgctgc      780

agctaagagt acaagaggtg aagttttggc tgttggtaat ggtagaattt tagaaaacgg      840

tgaagttaag cctttggatg ttaaggttgg tgacattgtt attttcaatg atggttacgg      900

tgttaagtca gaaaagattg ataacgaaga agttttgatc atgtctgaat cagatatctt      960

ggcaattgtt gaagcataat taattaatca tgtaattagt tatgtcacgc ttacattcac     1020

gccctcctcc cacatccgct ctaaccgaaa aggaaggagt tagacaacct gaagtctagg     1080

tccctattta ttttttttaa tagttatgtt agtattaaga acgttattta tatttcaaat     1140

ttttcttttt tttctgtaca aacgcgtgta cgcatgtaac attatactga aaaccttgct     1200

tgagaaggtt ttgggacgct cgaaggcttt aatttgcggg cggccgc                   1247


<210>  63
<211>  2678
<212>  DNA
<213>  Artificial sequence

<220>
<223>  ADH1p-ECgroEL-ADH1t with a 5' PacI site and a 3' SpeI site

<400>  63
gaattcctgc agcccggggg atccttttct ggcaaccaaa cccatacatc gggattccta       60

taataccttc gttggtctcc ctaacatgta ggtggcggag gggagatata caatagaaca      120

gataccagac aagacataat gggctaaaca agactacacc aattacactg cctcattgat      180

ggtggtacat aacgaactaa tactgtagcc ctagacttga tagccatcat catatcgaag      240

tttcactacc ctttttccat ttgccatcta ttgaagtaat aataggcgca tgcaacttct      300

tttctttttt tttcttttct ctctcccccg ttgttgtctc accatatccg caatgacaaa      360

aaaatgatgg aagacactaa aggaaaaaat taacgacaaa gacagcacca acagatgtcg      420

ttgttccaga gctgatgagg ggtatctcga agcacacgaa actttttcct tccttcattc      480

acgcacacta ctctctaatg agcaacggta tacggccttc cttccagtta cttgaatttg      540

aaataaaaaa aagtttgctg tcttgctatc aagtataaat agacctgcaa ttattaatct      600

tttgtttcct cgtcattgtt ctcgttccct ttcttccttg tttctttttc tgcacaatat      660

ttcaagctat accaagcata caatcaacta tctcatatac aatggctgct aaagatgtaa      720

agttcggtaa tgatgctaga gtaaaaatgt tgagaggtgt aaatgtattg gctgacgctg      780

taaaagtaac tttgggtcca aaaggtagaa atgttgtctt ggataagtct tttggtgctc      840

ctaccataac taaagacggt gtttcagtcg caagagaaat cgaattggag gataagttcg      900

aaaacatggg tgctcaaatg gtcaaagaag tcgcctctaa ggctaacgat gctgcaggtg      960

acggtactac aaccgctact gttttggctc aagcaattat aacagaaggt ttaaaagcag     1020

ttgccgctgg tatgaatcca atggatttga aaagaggtat tgacaaggcc gtcactgcag     1080

ccgtagaaga attgaaagca ttatcagtcc cttgttctga ttcaaaggcc atcgctcaag     1140

taggtaccat ttccgctaac agtgatgaaa ctgttggtaa attaattgca gaagccatgg     1200

acaaagtcgg taaagaaggt gtaataaccg ttgaagatgg tactggtttg caagatgaat     1260

tagacgtagt tgagggtatg caatttgata gaggttattt gtcaccatac ttcatcaata     1320

agcctgaaac aggtgctgtt gaattggaat ccccttttat tttgttggca gataaaaaga     1380

ttagtaacat aagagaaatg ttgccagttt tagaagctgt cgcaaaagcc ggtaaacctt     1440

tgttaatcat tgctgaagat gttgaaggtg aagcattggc aacattagtc gtaaatacca     1500

tgagaggtat tgtaaaagtt gctgcagtta aggctccagg tttcggtgac agaagaaaag     1560

ctatgttgca agacattgca acattaaccg gtggtacagt tatctccgaa gaaattggta     1620

tggaattgga aaaggccacc ttggaagatt tgggtcaagc taagagagtt gtcattaata     1680

aggatactac aaccatcatc gacggtgtag gtgaagaagc cgctatacaa ggtagagttg     1740

ctcaaataag acaacaaatc gaagaagcaa cttctgatta tgacagagaa aaattgcaag     1800

aaagagttgc aaagttagcc ggtggtgtcg ctgtaattaa agttggtgca gccaccgaag     1860

tcgaaatgaa ggaaaagaaa gcaagagtag aagatgcttt gcatgcaaca agagctgcag     1920

ttgaagaagg tgtagttgca ggtggtggtg tcgccttaat tagagtagcc tccaaattgg     1980

ctgatttgag aggtcaaaat gaagaccaaa acgtaggtat caaggttgcc ttaagagcta     2040

tggaagcacc attgagacaa atcgttttga actgtggtga agaacctagt gtcgtagcta     2100

acactgttaa aggtggtgac ggtaattatg gttacaacgc cgctacagaa gaatacggta     2160

acatgatcga tatgggtata ttggacccaa ctaaggtcac aagatctgca ttgcaatacg     2220

cagcctcagt tgccggttta atgattacta cagaatgcat ggttacagat ttgcctaaaa     2280

acgacgctgc cgacttgggt gccgcaggtg gtatgggtgg tatgggtggt atgggtggta     2340

tgatgtgatt aattaagagt aagcgaattt cttatgattt atgattttta ttattaaata     2400

agttataaaa aaaataagtg tatacaaatt ttaaagtgac tcttaggttt taaaacgaaa     2460

attcttattc ttgagtaact ctttcctgta ggtcaggttg ctttctcagg tatagcatga     2520

ggtcgctctt attgaccaca cctctaccgg catgccgagc aaatgcctgc aaatcgctcc     2580

ccatttcacc caattgtaga tatgctaact ccagcaatga gttgatgaat ctcggtgtgt     2640

attttatgtc ctcagaggac aacacctgtg gtactagt                             2678


<210>  64
<211>  9766
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  64
ggccgcacct ggtaaaacct ctagtggagt agtagatgta atcaatgaag cggaagccaa       60

aagaccagag tagaggccta tagaagaaac tgcgatacct tttgtgatgg ctaaacaaac      120

agacatcttt ttatatgttt ttacttctgt atatcgtgaa gtagtaagtg ataagcgaat      180

ttggctaaga acgttgtaag tgaacaaggg acctcttttg cctttcaaaa aaggattaaa      240

tggagttaat cattgagatt tagttttcgt tagattctgt atccctaaat aactccctta      300

cccgacggga aggcacaaaa gacttgaata atagcaaacg gccagtagcc aagaccaaat      360

aatactagag ttaactgatg gtcttaaaca ggcattacgt ggtgaactcc aagaccaata      420

tacaaaatat cgataagtta ttcttgccca ccaatttaag gagcctacat caggacagta      480

gtaccattcc tcagagaaga ggtatacata acaagaaaat cgcgtgaaca ccttatataa      540

cttagcccgt tattgagcta aaaaaccttg caaaatttcc tatgaataag aatacttcag      600

acgtgataaa aatttacttt ctaactcttc tcacgctgcc cctatctgtt cttccgctct      660

accgtgagaa ataaagcatc gagtacggca gttcgctgtc actgaactaa aacaataagg      720

ctagttcgaa tgatgaactt gcttgctgtc aaacttctga gttgccgctg atgtgacact      780

gtgacaataa attcaaaccg gttatagcgg tctcctccgg taccggttct gccacctcca      840

atagagctca gtaggagtca gaacctctgc ggtggctgtc agtgactcat ccgcgtttcg      900

taagttgtgc gcgtgcacat ttcgcccgtt cccgctcatc ttgcagcagg cggaaatttt      960

catcacgctg taggacgcaa aaaaaaaata attaatcgta caagaatctt ggaaaaaaaa     1020

ttgaaaaatt ttgtataaaa gggatgacct aacttgactc aatggctttt acacccagta     1080

ttttcccttt ccttgtttgt tacaattata gaagcaagac aaaaacatat agacaaccta     1140

ttcctaggag ttatattttt ttaccctacc agcaatataa gtaaaaaact gtttaaacag     1200

tatgtccgtt caagccacaa gagaagacaa gtttagtttc ggtttatgga ctgtaggttg     1260

gcaagcaaga gacgcattcg gtgacgcaac cagaactgcc ttggatccag ttgaagctgt     1320

ccataaattg gcagaaatcg gtgcctacgg tattacattc cacgatgacg atttggttcc     1380

ttttggttcc gatgctcaaa ccagagacgg tattatagcc ggtttcaaaa aggctttaga     1440

tgaaactggt ttgatcgtac caatggttac tacaaatttg tttactcatc ctgtcttcaa     1500

ggacggtggt tttacatcta acgatagatc agtcagaaga tacgctataa gaaaggtatt     1560

gagacaaatg gatttgggtg ctgaattggg tgcaaagaca ttagtcttgt ggggtggtag     1620

agaaggtgca gaatacgatt ccgccaaaga cgttagtgct gcattggaca gatatagaga     1680

agcattgaat ttgttggcac aatactctga agatagaggt tacggtttga gatttgctat     1740

agaaccaaag cctaacgaac caagaggtga catattgtta cctactgcag gtcatgcaat     1800

cgccttcgtt caagaattgg aaagaccaga attgttcggt attaatcctg aaaccggtca     1860

cgaacaaatg tctaatttga acttcactca aggtattgct caagcattat ggcataaaaa     1920

gttgttccac atcgatttga acggtcaaca tggtccaaaa ttcgaccaag atttggtatt     1980

tggtcacggt gacttgttga acgctttctc attggttgat ttgttggaaa acggtccaga     2040

tggtgcccct gcttatgacg gtccaagaca ttttgattac aaaccttcta gaacagaaga     2100

ctatgatggt gtttgggaat cagcaaaggc caacatcaga atgtacttgt tgttgaagga     2160

aagagctaag gcattcagag cagatccaga agttcaagaa gccttagccg cttccaaagt     2220

cgcagaattg aagacaccaa ccttaaatcc tggtgaaggt tacgccgaat tattggctga     2280

tagaagtgca tttgaagact atgatgccga cgctgttggt gctaaaggtt ttggttttgt     2340

caagttaaat caattagcaa tcgaacactt attaggtgcc agatgaggcc ctgcaggcca     2400

gaggaaaata atatcaagtg ctggaaactt tttctcttgg aatttttgca acatcaagtc     2460

atagtcaatt gaattgaccc aatttcacat ttaagatttt ttttttttca tccgacatac     2520

atctgtacac taggaagccc tgtttttctg aagcagcttc aaatatatat attttttaca     2580

tatttattat gattcaatga acaatctaat taaatcgaaa acaagaaccg aaacgcgaat     2640

aaataattta tttagatggt gacaagtgta taagtcctca tcgggacagc tacgatttct     2700

ctttcggttt tggctgagct actggttgct gtgacgcagc ggcattagcg cggcgttatg     2760

agctaccctc gtggcctgaa agatggcggg aataaagcgg aactaaaaat tactgactga     2820

gccatattga ggtcaatttg tcaactcgtc aagtcacgtt tggtggacgg cccctttcca     2880

acgaatcgta tatactaaca tgcgcgcgct tcctatatac acatatacat atatatatat     2940

atatatatgt gtgcgtgtat gtgtacacct gtatttaatt tccttactcg cgggtttttc     3000

ttttttctca attcttggct tcctctttct cgagcggacc ggatcctccg cggtgccggc     3060

agatctattt aaatggcgcg ccgacgtcag gtggcacttt tcggggaaat gtgcgcggaa     3120

cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac     3180

cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg     3240

tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc     3300

tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg     3360

atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga     3420

gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc     3480

aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag     3540

aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga     3600

gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg     3660

cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga     3720

atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt     3780

tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact     3840

ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt     3900

ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg     3960

ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta     4020

tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac     4080

tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta     4140

aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt     4200

tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt     4260

tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt     4320

gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc     4380

agataccaaa tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg     4440

tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg     4500

ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt     4560

cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac     4620

tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg     4680

acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg     4740

gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat     4800

ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt     4860

tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg     4920

attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa     4980

cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc     5040

ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga     5100

aagcgggcag tgagcgcaac gcaattaatg tgagttagct cactcattag gcaccccagg     5160

ctttacactt tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc     5220

acacaggaaa cagctatgac catgattacg ccaagctttt tctttccaat tttttttttt     5280

tcgtcattat aaaaatcatt acgaccgaga ttcccgggta ataactgata taattaaatt     5340

gaagctctaa tttgtgagtt tagtatacat gcatttactt ataatacagt tttttagttt     5400

tgctggccgc atcttctcaa atatgcttcc cagcctgctt ttctgtaacg ttcaccctct     5460

accttagcat cccttccctt tgcaaatagt cctcttccaa caataataat gtcagatcct     5520

gtagagacca catcatccac ggttctatac tgttgaccca atgcgtctcc cttgtcatct     5580

aaacccacac cgggtgtcat aatcaaccaa tcgtaacctt catctcttcc acccatgtct     5640

ctttgagcaa taaagccgat aacaaaatct ttgtcgctct tcgcaatgtc aacagtaccc     5700

ttagtatatt ctccagtaga tagggagccc ttgcatgaca attctgctaa catcaaaagg     5760

cctctaggtt cctttgttac ttcttctgcc gcctgcttca aaccgctaac aatacctggg     5820

cccaccacac cgtgtgcatt cgtaatgtct gcccattctg ctattctgta tacacccgca     5880

gagtactgca atttgactgt attaccaatg tcagcaaatt ttctgtcttc gaagagtaaa     5940

aaattgtact tggcggataa tgcctttagc ggcttaactg tgccctccat ggaaaaatca     6000

gtcaagatat ccacatgtgt ttttagtaaa caaattttgg gacctaatgc ttcaactaac     6060

tccagtaatt ccttggtggt acgaacatcc aatgaagcac acaagtttgt ttgcttttcg     6120

tgcatgatat taaatagctt ggcagcaaca ggactaggat gagtagcagc acgttcctta     6180

tatgtagctt tcgacatgat ttatcttcgt ttcctgcagg tttttgttct gtgcagttgg     6240

gttaagaata ctgggcaatt tcatgtttct tcaacactac atatgcgtat atataccaat     6300

ctaagtctgt gctccttcct tcgttcttcc ttctgttcgg agattaccga atcaaaaaaa     6360

tttcaaggaa accgaaatca aaaaaaagaa taaaaaaaaa atgatgaatt gaaaagcttg     6420

catgcctgca ggtcgactct agtatactcc gtctactgta cgatacactt ccgctcaggt     6480

ccttgtcctt taacgaggcc ttaccactct tttgttactc tattgatcca gctcagcaaa     6540

ggcagtgtga tctaagattc tatcttcgcg atgtagtaaa actagctaga ccgagaaaga     6600

gactagaaat gcaaaaggca cttctacaat ggctgccatc attattatcc gatgtgacgc     6660

tgcatttttt tttttttttt tttttttttt tttttttttt tttttttttt ttttttgtac     6720

aaatatcata aaaaaagaga atctttttaa gcaaggattt tcttaacttc ttcggcgaca     6780

gcatcaccga cttcggtggt actgttggaa ccacctaaat caccagttct gatacctgca     6840

tccaaaacct ttttaactgc atcttcaatg gctttacctt cttcaggcaa gttcaatgac     6900

aatttcaaca tcattgcagc agacaagata gtggcgatag ggttgacctt attctttggc     6960

aaatctggag cggaaccatg gcatggttcg tacaaaccaa atgcggtgtt cttgtctggc     7020

aaagaggcca aggacgcaga tggcaacaaa cccaaggagc ctgggataac ggaggcttca     7080

tcggagatga tatcaccaaa catgttgctg gtgattataa taccatttag gtgggttggg     7140

ttcttaacta ggatcatggc ggcagaatca atcaattgat gttgaacttt caatgtaggg     7200

aattcgttct tgatggtttc ctccacagtt tttctccata atcttgaaga ggccaaaaca     7260

ttagctttat ccaaggacca aataggcaat ggtggctcat gttgtagggc catgaaagcg     7320

gccattcttg tgattctttg cacttctgga acggtgtatt gttcactatc ccaagcgaca     7380

ccatcaccat cgtcttcctt tctcttacca aagtaaatac ctcccactaa ttctctaaca     7440

acaacgaagt cagtaccttt agcaaattgt ggcttgattg gagataagtc taaaagagag     7500

tcggatgcaa agttacatgg tcttaagttg gcgtacaatt gaagttcttt acggattttt     7560

agtaaacctt gttcaggtct aacactaccg gtaccccatt taggaccacc cacagcacct     7620

aacaaaacgg catcagcctt cttggaggct tccagcgcct catctggaag tggaacacct     7680

gtagcatcga tagcagcacc accaattaaa tgattttcga aatcgaactt gacattggaa     7740

cgaacatcag aaatagcttt aagaacctta atggcttcgg ctgtgatttc ttgaccaacg     7800

tggtcacctg gcaaaacgac gatcttctta ggggcagaca ttacaatggt atatccttga     7860

aatatatata aaaaaaaaaa aaaaaaaaaa aaaaaaaaat gcagcttctc aatgatattc     7920

gaatacgctt tgaggagata cagcctaata tccgacaaac tgttttacag atttacgatc     7980

gtacttgtta cccatcattg aattttgaac atccgaacct gggagttttc cctgaaacag     8040

atagtatatt tgaacctgta taataatata tagtctagcg ctttacggaa gacaatgtat     8100

gtatttcggt tcctggagaa actattgcat ctattgcata ggtaatcttg cacgtcgcat     8160

ccccggttca ttttctgcgt ttccatcttg cacttcaata gcatatcttt gttaacgaag     8220

catctgtgct tcattttgta gaacaaaaat gcaacgcgag agcgctaatt tttcaaacaa     8280

agaatctgag ctgcattttt acagaacaga aatgcaacgc gaaagcgcta ttttaccaac     8340

gaagaatctg tgcttcattt ttgtaaaaca aaaatgcaac gcgagagcgc taatttttca     8400

aacaaagaat ctgagctgca tttttacaga acagaaatgc aacgcgagag cgctatttta     8460

ccaacaaaga atctatactt cttttttgtt ctacaaaaat gcatcccgag agcgctattt     8520

ttctaacaaa gcatcttaga ttactttttt tctcctttgt gcgctctata atgcagtctc     8580

ttgataactt tttgcactgt aggtccgtta aggttagaag aaggctactt tggtgtctat     8640

tttctcttcc ataaaaaaag cctgactcca cttcccgcgt ttactgatta ctagcgaagc     8700

tgcgggtgca ttttttcaag ataaaggcat ccccgattat attctatacc gatgtggatt     8760

gcgcatactt tgtgaacaga aagtgatagc gttgatgatt cttcattggt cagaaaatta     8820

tgaacggttt cttctatttt gtctctatat actacgtata ggaaatgttt acattttcgt     8880

attgttttcg attcactcta tgaatagttc ttactacaat ttttttgtct aaagagtaat     8940

actagagata aacataaaaa atgtagaggt cgagtttaga tgcaagttca aggagcgaaa     9000

ggtggatggg taggttatat agggatatag cacagagata tatagcaaag agatactttt     9060

gagcaatgtt tgtggaagcg gtattcgcaa tattttagta gctcgttaca gtccggtgcg     9120

tttttggttt tttgaaagtg cgtcttcaga gcgcttttgg ttttcaaaag cgctctgaag     9180

ttcctatact ttctagagaa taggaacttc ggaataggaa cttcaaagcg tttccgaaaa     9240

cgagcgcttc cgaaaatgca acgcgagctg cgcacataca gctcactgtt cacgtcgcac     9300

ctatatctgc gtgttgcctg tatatatata tacatgagaa gaacggcata gtgcgtgttt     9360

atgcttaaat gcgtacttat atgcgtctat ttatgtagga tgaaaggtag tctagtacct     9420

cctgtgatat tatcccattc catgcggggt atcgtatgct tccttcagca ctacccttta     9480

gctgttctat atgctgccac tcctcaattg gattagtctc atccttcaat gctatcattt     9540

cctttgatat tggatcatat gcatagtacc gagaaactag aggatctccc attaccgaca     9600

tttgggcgct atacgtgcat atgttcatgt atgtatctgt atttaaaaca cttttgtatt     9660

atttttcctc atatatgtgt ataggtttat acggatgatt taattattac ttcaccaccc     9720

tttatttcag gctgatatct tagccttgtt actagtcacc ggtggc                    9766


<210>  65
<211>  13921
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  65
ggccgcacct ggtaaaacct ctagtggagt agtagatgta atcaatgaag cggaagccaa       60

aagaccagag tagaggccta tagaagaaac tgcgatacct tttgtgatgg ctaaacaaac      120

agacatcttt ttatatgttt ttacttctgt atatcgtgaa gtagtaagtg ataagcgaat      180

ttggctaaga acgttgtaag tgaacaaggg acctcttttg cctttcaaaa aaggattaaa      240

tggagttaat cattgagatt tagttttcgt tagattctgt atccctaaat aactccctta      300

cccgacggga aggcacaaaa gacttgaata atagcaaacg gccagtagcc aagaccaaat      360

aatactagag ttaactgatg gtcttaaaca ggcattacgt ggtgaactcc aagaccaata      420

tacaaaatat cgataagtta ttcttgccca ccaatttaag gagcctacat caggacagta      480

gtaccattcc tcagagaaga ggtatacata acaagaaaat cgcgtgaaca ccttatataa      540

cttagcccgt tattgagcta aaaaaccttg caaaatttcc tatgaataag aatacttcag      600

acgtgataaa aatttacttt ctaactcttc tcacgctgcc cctatctgtt cttccgctct      660

accgtgagaa ataaagcatc gagtacggca gttcgctgtc actgaactaa aacaataagg      720

ctagttcgaa tgatgaactt gcttgctgtc aaacttctga gttgccgctg atgtgacact      780

gtgacaataa attcaaaccg gttatagcgg tctcctccgg taccggttct gccacctcca      840

atagagctca gtaggagtca gaacctctgc ggtggctgtc agtgactcat ccgcgtttcg      900

taagttgtgc gcgtgcacat ttcgcccgtt cccgctcatc ttgcagcagg cggaaatttt      960

catcacgctg taggacgcaa aaaaaaaata attaatcgta caagaatctt ggaaaaaaaa     1020

ttgaaaaatt ttgtataaaa gggatgacct aacttgactc aatggctttt acacccagta     1080

ttttcccttt ccttgtttgt tacaattata gaagcaagac aaaaacatat agacaaccta     1140

ttcctaggag ttatattttt ttaccctacc agcaatataa gtaaaaaact gtttaaacag     1200

tatgtccgtt caagccacaa gagaagacaa gtttagtttc ggtttatgga ctgtaggttg     1260

gcaagcaaga gacgcattcg gtgacgcaac cagaactgcc ttggatccag ttgaagctgt     1320

ccataaattg gcagaaatcg gtgcctacgg tattacattc cacgatgacg atttggttcc     1380

ttttggttcc gatgctcaaa ccagagacgg tattatagcc ggtttcaaaa aggctttaga     1440

tgaaactggt ttgatcgtac caatggttac tacaaatttg tttactcatc ctgtcttcaa     1500

ggacggtggt tttacatcta acgatagatc agtcagaaga tacgctataa gaaaggtatt     1560

gagacaaatg gatttgggtg ctgaattggg tgcaaagaca ttagtcttgt ggggtggtag     1620

agaaggtgca gaatacgatt ccgccaaaga cgttagtgct gcattggaca gatatagaga     1680

agcattgaat ttgttggcac aatactctga agatagaggt tacggtttga gatttgctat     1740

agaaccaaag cctaacgaac caagaggtga catattgtta cctactgcag gtcatgcaat     1800

cgccttcgtt caagaattgg aaagaccaga attgttcggt attaatcctg aaaccggtca     1860

cgaacaaatg tctaatttga acttcactca aggtattgct caagcattat ggcataaaaa     1920

gttgttccac atcgatttga acggtcaaca tggtccaaaa ttcgaccaag atttggtatt     1980

tggtcacggt gacttgttga acgctttctc attggttgat ttgttggaaa acggtccaga     2040

tggtgcccct gcttatgacg gtccaagaca ttttgattac aaaccttcta gaacagaaga     2100

ctatgatggt gtttgggaat cagcaaaggc caacatcaga atgtacttgt tgttgaagga     2160

aagagctaag gcattcagag cagatccaga agttcaagaa gccttagccg cttccaaagt     2220

cgcagaattg aagacaccaa ccttaaatcc tggtgaaggt tacgccgaat tattggctga     2280

tagaagtgca tttgaagact atgatgccga cgctgttggt gctaaaggtt ttggttttgt     2340

caagttaaat caattagcaa tcgaacactt attaggtgcc agatgaggcc ctgcaggcca     2400

gaggaaaata atatcaagtg ctggaaactt tttctcttgg aatttttgca acatcaagtc     2460

atagtcaatt gaattgaccc aatttcacat ttaagatttt ttttttttca tccgacatac     2520

atctgtacac taggaagccc tgtttttctg aagcagcttc aaatatatat attttttaca     2580

tatttattat gattcaatga acaatctaat taaatcgaaa acaagaaccg aaacgcgaat     2640

aaataattta tttagatggt gacaagtgta taagtcctca tcgggacagc tacgatttct     2700

ctttcggttt tggctgagct actggttgct gtgacgcagc ggcattagcg cggcgttatg     2760

agctaccctc gtggcctgaa agatggcggg aataaagcgg aactaaaaat tactgactga     2820

gccatattga ggtcaatttg tcaactcgtc aagtcacgtt tggtggacgg cccctttcca     2880

acgaatcgta tatactaaca tgcgcgcgct tcctatatac acatatacat atatatatat     2940

atatatatgt gtgcgtgtat gtgtacacct gtatttaatt tccttactcg cgggtttttc     3000

ttttttctca attcttggct tcctctttct cgaggtcgac ggtatcgata agcttgatat     3060

cgaattcctg cagcccgggg gatccttttc tggcaaccaa acccatacat cgggattcct     3120

ataatacctt cgttggtctc cctaacatgt aggtggcgga ggggagatat acaatagaac     3180

agataccaga caagacataa tgggctaaac aagactacac caattacact gcctcattga     3240

tggtggtaca taacgaacta atactgtagc cctagacttg atagccatca tcatatcgaa     3300

gtttcactac cctttttcca tttgccatct attgaagtaa taataggcgc atgcaacttc     3360

ttttcttttt ttttcttttc tctctccccc gttgttgtct caccatatcc gcaatgacaa     3420

aaaaatgatg gaagacacta aaggaaaaaa ttaacgacaa agacagcacc aacagatgtc     3480

gttgttccag agctgatgag gggtatctcg aagcacacga aactttttcc ttccttcatt     3540

cacgcacact actctctaat gagcaacggt atacggcctt ccttccagtt acttgaattt     3600

gaaataaaaa aaagtttgct gtcttgctat caagtataaa tagacctgca attattaatc     3660

ttttgtttcc tcgtcattgt tctcgttccc tttcttcctt gtttcttttt ctgcacaata     3720

tttcaagcta taccaagcat acaatcaact atctcatata caatggctgc taaagatgta     3780

aagttcggta atgatgctag agtaaaaatg ttgagaggtg taaatgtatt ggctgacgct     3840

gtaaaagtaa ctttgggtcc aaaaggtaga aatgttgtct tggataagtc ttttggtgct     3900

cctaccataa ctaaagacgg tgtttcagtc gcaagagaaa tcgaattgga ggataagttc     3960

gaaaacatgg gtgctcaaat ggtcaaagaa gtcgcctcta aggctaacga tgctgcaggt     4020

gacggtacta caaccgctac tgttttggct caagcaatta taacagaagg tttaaaagca     4080

gttgccgctg gtatgaatcc aatggatttg aaaagaggta ttgacaaggc cgtcactgca     4140

gccgtagaag aattgaaagc attatcagtc ccttgttctg attcaaaggc catcgctcaa     4200

gtaggtacca tttccgctaa cagtgatgaa actgttggta aattaattgc agaagccatg     4260

gacaaagtcg gtaaagaagg tgtaataacc gttgaagatg gtactggttt gcaagatgaa     4320

ttagacgtag ttgagggtat gcaatttgat agaggttatt tgtcaccata cttcatcaat     4380

aagcctgaaa caggtgctgt tgaattggaa tcccctttta ttttgttggc agataaaaag     4440

attagtaaca taagagaaat gttgccagtt ttagaagctg tcgcaaaagc cggtaaacct     4500

ttgttaatca ttgctgaaga tgttgaaggt gaagcattgg caacattagt cgtaaatacc     4560

atgagaggta ttgtaaaagt tgctgcagtt aaggctccag gtttcggtga cagaagaaaa     4620

gctatgttgc aagacattgc aacattaacc ggtggtacag ttatctccga agaaattggt     4680

atggaattgg aaaaggccac cttggaagat ttgggtcaag ctaagagagt tgtcattaat     4740

aaggatacta caaccatcat cgacggtgta ggtgaagaag ccgctataca aggtagagtt     4800

gctcaaataa gacaacaaat cgaagaagca acttctgatt atgacagaga aaaattgcaa     4860

gaaagagttg caaagttagc cggtggtgtc gctgtaatta aagttggtgc agccaccgaa     4920

gtcgaaatga aggaaaagaa agcaagagta gaagatgctt tgcatgcaac aagagctgca     4980

gttgaagaag gtgtagttgc aggtggtggt gtcgccttaa ttagagtagc ctccaaattg     5040

gctgatttga gaggtcaaaa tgaagaccaa aacgtaggta tcaaggttgc cttaagagct     5100

atggaagcac cattgagaca aatcgttttg aactgtggtg aagaacctag tgtcgtagct     5160

aacactgtta aaggtggtga cggtaattat ggttacaacg ccgctacaga agaatacggt     5220

aacatgatcg atatgggtat attggaccca actaaggtca caagatctgc attgcaatac     5280

gcagcctcag ttgccggttt aatgattact acagaatgca tggttacaga tttgcctaaa     5340

aacgacgctg ccgacttggg tgccgcaggt ggtatgggtg gtatgggtgg tatgggtggt     5400

atgatgtgat taattaagag taagcgaatt tcttatgatt tatgattttt attattaaat     5460

aagttataaa aaaaataagt gtatacaaat tttaaagtga ctcttaggtt ttaaaacgaa     5520

aattcttatt cttgagtaac tctttcctgt aggtcaggtt gctttctcag gtatagcatg     5580

aggtcgctct tattgaccac acctctaccg gcatgccgag caaatgcctg caaatcgctc     5640

cccatttcac ccaattgtag atatgctaac tccagcaatg agttgatgaa tctcggtgtg     5700

tattttatgt cctcagagga caacacctgt ggtactagtt ctagagcggc cgcccgcaaa     5760

ttaaagcctt cgagcgtccc aaaaccttct caagcaaggt tttcagtata atgttacatg     5820

cgtacacgcg tttgtacaga aaaaaaagaa aaatttgaaa tataaataac gttcttaata     5880

ctaacataac tattaaaaaa aataaatagg gacctagact tcaggttgtc taactccttc     5940

cttttcggtt agagcggatg tgggaggagg gcgtgaatgt aagcgtgaca taactaatta     6000

catgattaat taattatgct tcaacaattg ccaagatatc tgattcagac atgatcaaaa     6060

cttcttcgtt atcaatcttt tctgacttaa caccgtaacc atcattgaaa ataacaatgt     6120

caccaacctt aacatccaaa ggcttaactt caccgttttc taaaattcta ccattaccaa     6180

cagccaaaac ttcacctctt gtactcttag ctgcagcgga accagtcaaa acaataccac     6240

ctgcagattt ggtttcaact tcctttctct taacaataac tctatcatgc aatggtctaa     6300

tattcatttt gtttgtttat gtgtgtttat tcgaaactaa gttcttggtg ttttaaaact     6360

aaaaaaaaga ctaactataa aagtagaatt taagaagttt aagaaataga tttacagaat     6420

tacaatcaat acctaccgtc tttatatact tattagtcaa gtaggggaat aatttcaggg     6480

aactggtttc aacctttttt ttcagctttt tccaaatcag agagagcaga aggtaataga     6540

aggtgtaaga aaatgagata gatacatgcg tgggtcaatt gccttgtgtc atcatttact     6600

ccaggcaggt tgcatcactc cattgaggtt gtgcccgttt tttgcctgtt tgtgcccctg     6660

ttctctgtag ttgcgctaag agaatggacc tatgaactga tggttggtga agaaaacaat     6720

attttggtgc tgggattctt tttttttctg gatgccagct taaaaagcgg gctccattat     6780

atttagtgga tgccaggaat aaactgttca cccagacacc tacgatgtta tatattctgt     6840

gtaacccgcc ccctattttg ggcatgtacg ggttacagca gaattaaaag gctaattttt     6900

tgactaaata aagttaggaa aatcactact attaattatt tacgtattct ttgaaatggc     6960

agtattgata atgataaact cgaactagat ctatccgcgg tggagctcca gcttttgttc     7020

cctttagtga gggttaattg cgcgcttggc gtaatcatgg tcatagctgt ttcctgtgtg     7080

aaattgttat ccgctcacaa ttccacacaa cataggagcc ggaagcataa agtgtaaagc     7140

ctggggtgcc taatgagtga ggtaactcac attaattgcg ttgcgctcac tgcccgcttt     7200

ccagtcggga aacctgtcgt gccagaaatg gcgcgccgac gtcaggtggc acttttcggg     7260

gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc     7320

tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta     7380

ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg     7440

ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg     7500

gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac     7560

gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg     7620

acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt     7680

actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg     7740

ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac     7800

cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt     7860

gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag     7920

caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc     7980

aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc     8040

ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta     8100

tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg     8160

ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga     8220

ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac     8280

ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa     8340

tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat     8400

cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc     8460

taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg     8520

gcttcagcag agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc     8580

acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg     8640

ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg     8700

ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa     8760

cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg     8820

aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga     8880

gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct     8940

gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca     9000

gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc     9060

ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg     9120

ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc     9180

caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca     9240

ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc     9300

attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga     9360

gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaag ctttttcttt     9420

ccaatttttt ttttttcgtc attataaaaa tcattacgac cgagattccc gggtaataac     9480

tgatataatt aaattgaagc tctaatttgt gagtttagta tacatgcatt tacttataat     9540

acagtttttt agttttgctg gccgcatctt ctcaaatatg cttcccagcc tgcttttctg     9600

taacgttcac cctctacctt agcatccctt ccctttgcaa atagtcctct tccaacaata     9660

ataatgtcag atcctgtaga gaccacatca tccacggttc tatactgttg acccaatgcg     9720

tctcccttgt catctaaacc cacaccgggt gtcataatca accaatcgta accttcatct     9780

cttccaccca tgtctctttg agcaataaag ccgataacaa aatctttgtc gctcttcgca     9840

atgtcaacag tacccttagt atattctcca gtagataggg agcccttgca tgacaattct     9900

gctaacatca aaaggcctct aggttccttt gttacttctt ctgccgcctg cttcaaaccg     9960

ctaacaatac ctgggcccac cacaccgtgt gcattcgtaa tgtctgccca ttctgctatt    10020

ctgtatacac ccgcagagta ctgcaatttg actgtattac caatgtcagc aaattttctg    10080

tcttcgaaga gtaaaaaatt gtacttggcg gataatgcct ttagcggctt aactgtgccc    10140

tccatggaaa aatcagtcaa gatatccaca tgtgttttta gtaaacaaat tttgggacct    10200

aatgcttcaa ctaactccag taattccttg gtggtacgaa catccaatga agcacacaag    10260

tttgtttgct tttcgtgcat gatattaaat agcttggcag caacaggact aggatgagta    10320

gcagcacgtt ccttatatgt agctttcgac atgatttatc ttcgtttcct gcaggttttt    10380

gttctgtgca gttgggttaa gaatactggg caatttcatg tttcttcaac actacatatg    10440

cgtatatata ccaatctaag tctgtgctcc ttccttcgtt cttccttctg ttcggagatt    10500

accgaatcaa aaaaatttca aggaaaccga aatcaaaaaa aagaataaaa aaaaaatgat    10560

gaattgaaaa gcttgcatgc ctgcaggtcg actctagtat actccgtcta ctgtacgata    10620

cacttccgct caggtccttg tcctttaacg aggccttacc actcttttgt tactctattg    10680

atccagctca gcaaaggcag tgtgatctaa gattctatct tcgcgatgta gtaaaactag    10740

ctagaccgag aaagagacta gaaatgcaaa aggcacttct acaatggctg ccatcattat    10800

tatccgatgt gacgctgcat tttttttttt tttttttttt tttttttttt tttttttttt    10860

tttttttttt tgtacaaata tcataaaaaa agagaatctt tttaagcaag gattttctta    10920

acttcttcgg cgacagcatc accgacttcg gtggtactgt tggaaccacc taaatcacca    10980

gttctgatac ctgcatccaa aaccttttta actgcatctt caatggcttt accttcttca    11040

ggcaagttca atgacaattt caacatcatt gcagcagaca agatagtggc gatagggttg    11100

accttattct ttggcaaatc tggagcggaa ccatggcatg gttcgtacaa accaaatgcg    11160

gtgttcttgt ctggcaaaga ggccaaggac gcagatggca acaaacccaa ggagcctggg    11220

ataacggagg cttcatcgga gatgatatca ccaaacatgt tgctggtgat tataatacca    11280

tttaggtggg ttgggttctt aactaggatc atggcggcag aatcaatcaa ttgatgttga    11340

actttcaatg tagggaattc gttcttgatg gtttcctcca cagtttttct ccataatctt    11400

gaagaggcca aaacattagc tttatccaag gaccaaatag gcaatggtgg ctcatgttgt    11460

agggccatga aagcggccat tcttgtgatt ctttgcactt ctggaacggt gtattgttca    11520

ctatcccaag cgacaccatc accatcgtct tcctttctct taccaaagta aatacctccc    11580

actaattctc taacaacaac gaagtcagta cctttagcaa attgtggctt gattggagat    11640

aagtctaaaa gagagtcgga tgcaaagtta catggtctta agttggcgta caattgaagt    11700

tctttacgga tttttagtaa accttgttca ggtctaacac taccggtacc ccatttagga    11760

ccacccacag cacctaacaa aacggcatca gccttcttgg aggcttccag cgcctcatct    11820

ggaagtggaa cacctgtagc atcgatagca gcaccaccaa ttaaatgatt ttcgaaatcg    11880

aacttgacat tggaacgaac atcagaaata gctttaagaa ccttaatggc ttcggctgtg    11940

atttcttgac caacgtggtc acctggcaaa acgacgatct tcttaggggc agacattaca    12000

atggtatatc cttgaaatat atataaaaaa aaaaaaaaaa aaaaaaaaaa aaaatgcagc    12060

ttctcaatga tattcgaata cgctttgagg agatacagcc taatatccga caaactgttt    12120

tacagattta cgatcgtact tgttacccat cattgaattt tgaacatccg aacctgggag    12180

ttttccctga aacagatagt atatttgaac ctgtataata atatatagtc tagcgcttta    12240

cggaagacaa tgtatgtatt tcggttcctg gagaaactat tgcatctatt gcataggtaa    12300

tcttgcacgt cgcatccccg gttcattttc tgcgtttcca tcttgcactt caatagcata    12360

tctttgttaa cgaagcatct gtgcttcatt ttgtagaaca aaaatgcaac gcgagagcgc    12420

taatttttca aacaaagaat ctgagctgca tttttacaga acagaaatgc aacgcgaaag    12480

cgctatttta ccaacgaaga atctgtgctt catttttgta aaacaaaaat gcaacgcgag    12540

agcgctaatt tttcaaacaa agaatctgag ctgcattttt acagaacaga aatgcaacgc    12600

gagagcgcta ttttaccaac aaagaatcta tacttctttt ttgttctaca aaaatgcatc    12660

ccgagagcgc tatttttcta acaaagcatc ttagattact ttttttctcc tttgtgcgct    12720

ctataatgca gtctcttgat aactttttgc actgtaggtc cgttaaggtt agaagaaggc    12780

tactttggtg tctattttct cttccataaa aaaagcctga ctccacttcc cgcgtttact    12840

gattactagc gaagctgcgg gtgcattttt tcaagataaa ggcatccccg attatattct    12900

ataccgatgt ggattgcgca tactttgtga acagaaagtg atagcgttga tgattcttca    12960

ttggtcagaa aattatgaac ggtttcttct attttgtctc tatatactac gtataggaaa    13020

tgtttacatt ttcgtattgt tttcgattca ctctatgaat agttcttact acaatttttt    13080

tgtctaaaga gtaatactag agataaacat aaaaaatgta gaggtcgagt ttagatgcaa    13140

gttcaaggag cgaaaggtgg atgggtaggt tatataggga tatagcacag agatatatag    13200

caaagagata cttttgagca atgtttgtgg aagcggtatt cgcaatattt tagtagctcg    13260

ttacagtccg gtgcgttttt ggttttttga aagtgcgtct tcagagcgct tttggttttc    13320

aaaagcgctc tgaagttcct atactttcta gagaatagga acttcggaat aggaacttca    13380

aagcgtttcc gaaaacgagc gcttccgaaa atgcaacgcg agctgcgcac atacagctca    13440

ctgttcacgt cgcacctata tctgcgtgtt gcctgtatat atatatacat gagaagaacg    13500

gcatagtgcg tgtttatgct taaatgcgta cttatatgcg tctatttatg taggatgaaa    13560

ggtagtctag tacctcctgt gatattatcc cattccatgc ggggtatcgt atgcttcctt    13620

cagcactacc ctttagctgt tctatatgct gccactcctc aattggatta gtctcatcct    13680

tcaatgctat catttccttt gatattggat catatgcata gtaccgagaa actagaggat    13740

ctcccattac cgacatttgg gcgctatacg tgcatatgtt catgtatgta tctgtattta    13800

aaacactttt gtattatttt tcctcatata tgtgtatagg tttatacgga tgatttaatt    13860

attacttcac caccctttat ttcaggctga tatcttagcc ttgttactag tcaccggtgg    13920

c                                                                    13921


<210>  66
<211>  9684
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  66
ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc       60

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatagga gccggaagca      120

taaagtgtaa agcctggggt gcctaatgag tgaggtaact cacattaatt gcgttgcgct      180

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      240

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      300

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      360

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      420

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      480

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      540

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      600

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      660

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      720

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      780

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      840

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag      900

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt      960

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1020

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1080

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1140

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa     1200

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat     1260

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct     1320

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt     1380

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat     1440

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta     1500

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg     1560

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt     1620

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg     1680

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg     1740

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc     1800

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa     1860

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac     1920

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt     1980

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg     2040

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa     2100

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata     2160

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgaacga agcatctgtg     2220

cttcattttg tagaacaaaa atgcaacgcg agagcgctaa tttttcaaac aaagaatctg     2280

agctgcattt ttacagaaca gaaatgcaac gcgaaagcgc tattttacca acgaagaatc     2340

tgtgcttcat ttttgtaaaa caaaaatgca acgcgagagc gctaattttt caaacaaaga     2400

atctgagctg catttttaca gaacagaaat gcaacgcgag agcgctattt taccaacaaa     2460

gaatctatac ttcttttttg ttctacaaaa atgcatcccg agagcgctat ttttctaaca     2520

aagcatctta gattactttt tttctccttt gtgcgctcta taatgcagtc tcttgataac     2580

tttttgcact gtaggtccgt taaggttaga agaaggctac tttggtgtct attttctctt     2640

ccataaaaaa agcctgactc cacttcccgc gtttactgat tactagcgaa gctgcgggtg     2700

cattttttca agataaaggc atccccgatt atattctata ccgatgtgga ttgcgcatac     2760

tttgtgaaca gaaagtgata gcgttgatga ttcttcattg gtcagaaaat tatgaacggt     2820

ttcttctatt ttgtctctat atactacgta taggaaatgt ttacattttc gtattgtttt     2880

cgattcactc tatgaatagt tcttactaca atttttttgt ctaaagagta atactagaga     2940

taaacataaa aaatgtagag gtcgagttta gatgcaagtt caaggagcga aaggtggatg     3000

ggtaggttat atagggatat agcacagaga tatatagcaa agagatactt ttgagcaatg     3060

tttgtggaag cggtattcgc aatattttag tagctcgtta cagtccggtg cgtttttggt     3120

tttttgaaag tgcgtcttca gagcgctttt ggttttcaaa agcgctctga agttcctata     3180

ctttctagag aataggaact tcggaatagg aacttcaaag cgtttccgaa aacgagcgct     3240

tccgaaaatg caacgcgagc tgcgcacata cagctcactg ttcacgtcgc acctatatct     3300

gcgtgttgcc tgtatatata tatacatgag aagaacggca tagtgcgtgt ttatgcttaa     3360

atgcgtactt atatgcgtct atttatgtag gatgaaaggt agtctagtac ctcctgtgat     3420

attatcccat tccatgcggg gtatcgtatg cttccttcag cactaccctt tagctgttct     3480

atatgctgcc actcctcaat tggattagtc tcatccttca atgctatcat ttcctttgat     3540

attggatcat ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca     3600

cgaggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc     3660

tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg     3720

gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga     3780

ttgtactgag agtgcaccat aaattcccgt tttaagagct tggtgagcgc taggagtcac     3840

tgccaggtat cgtttgaaca cggcattagt cagggaagtc ataacacagt cctttcccgc     3900

aattttcttt ttctattact cttggcctcc tctagtacac tctatatttt tttatgcctc     3960

ggtaatgatt ttcatttttt tttttcccct agcggatgac tctttttttt tcttagcgat     4020

tggcattatc acataatgaa ttatacatta tataaagtaa tgtgatttct tcgaagaata     4080

tactaaaaaa tgagcaggca agataaacga aggcaaagat gacagagcag aaagccctag     4140

taaagcgtat tacaaatgaa accaagattc agattgcgat ctctttaaag ggtggtcccc     4200

tagcgataga gcactcgatc ttcccagaaa aagaggcaga agcagtagca gaacaggcca     4260

cacaatcgca agtgattaac gtccacacag gtatagggtt tctggaccat atgatacatg     4320

ctctggccaa gcattccggc tggtcgctaa tcgttgagtg cattggtgac ttacacatag     4380

acgaccatca caccactgaa gactgcggga ttgctctcgg tcaagctttt aaagaggccc     4440

tactggcgcg tggagtaaaa aggtttggat caggatttgc gcctttggat gaggcacttt     4500

ccagagcggt ggtagatctt tcgaacaggc cgtacgcagt tgtcgaactt ggtttgcaaa     4560

gggagaaagt aggagatctc tcttgcgaga tgatcccgca ttttcttgaa agctttgcag     4620

aggctagcag aattaccctc cacgttgatt gtctgcgagg caagaatgat catcaccgta     4680

gtgagagtgc gttcaaggct cttgcggttg ccataagaga agccacctcg cccaatggta     4740

ccaacgatgt tccctccacc aaaggtgttc ttatgtagtg acaccgatta tttaaagctg     4800

cagcatacga tatatataca tgtgtatata tgtataccta tgaatgtcag taagtatgta     4860

tacgaacagt atgatactga agatgacaag gtaatgcatc attctatacg tgtcattctg     4920

aacgaggcgc gctttccttt tttctttttg ctttttcttt ttttttctct tgaactcgac     4980

ggatctatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg catcaggaaa     5040

ttgtaaacgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt     5100

ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag     5160

ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg     5220

tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcaccctaat     5280

caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc     5340

gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga     5400

aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac     5460

ccgccgcgct taatgcgccg ctacagggcg cgtcgcgcca ttcgccattc aggctgcgca     5520

actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg     5580

gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta     5640

aaacgacggc cagtgagcgc gcgtaatacg actcactata gggcgaattg ggtaccgggc     5700

cccccctcga ggtcgacggt atcgataagc ttgatatcga attcctgcag cccgggggat     5760

ccttttctgg caaccaaacc catacatcgg gattcctata ataccttcgt tggtctccct     5820

aacatgtagg tggcggaggg gagatataca atagaacaga taccagacaa gacataatgg     5880

gctaaacaag actacaccaa ttacactgcc tcattgatgg tggtacataa cgaactaata     5940

ctgtagccct agacttgata gccatcatca tatcgaagtt tcactaccct ttttccattt     6000

gccatctatt gaagtaataa taggcgcatg caacttcttt tctttttttt tcttttctct     6060

ctcccccgtt gttgtctcac catatccgca atgacaaaaa aatgatggaa gacactaaag     6120

gaaaaaatta acgacaaaga cagcaccaac agatgtcgtt gttccagagc tgatgagggg     6180

tatctcgaag cacacgaaac tttttccttc cttcattcac gcacactact ctctaatgag     6240

caacggtata cggccttcct tccagttact tgaatttgaa ataaaaaaaa gtttgctgtc     6300

ttgctatcaa gtataaatag acctgcaatt attaatcttt tgtttcctcg tcattgttct     6360

cgttcccttt cttccttgtt tctttttctg cacaatattt caagctatac caagcataca     6420

atcaactatc tcatatacaa tggctgctaa agatgtaaag ttcggtaatg atgctagagt     6480

aaaaatgttg agaggtgtaa atgtattggc tgacgctgta aaagtaactt tgggtccaaa     6540

aggtagaaat gttgtcttgg ataagtcttt tggtgctcct accataacta aagacggtgt     6600

ttcagtcgca agagaaatcg aattggagga taagttcgaa aacatgggtg ctcaaatggt     6660

caaagaagtc gcctctaagg ctaacgatgc tgcaggtgac ggtactacaa ccgctactgt     6720

tttggctcaa gcaattataa cagaaggttt aaaagcagtt gccgctggta tgaatccaat     6780

ggatttgaaa agaggtattg acaaggccgt cactgcagcc gtagaagaat tgaaagcatt     6840

atcagtccct tgttctgatt caaaggccat cgctcaagta ggtaccattt ccgctaacag     6900

tgatgaaact gttggtaaat taattgcaga agccatggac aaagtcggta aagaaggtgt     6960

aataaccgtt gaagatggta ctggtttgca agatgaatta gacgtagttg agggtatgca     7020

atttgataga ggttatttgt caccatactt catcaataag cctgaaacag gtgctgttga     7080

attggaatcc ccttttattt tgttggcaga taaaaagatt agtaacataa gagaaatgtt     7140

gccagtttta gaagctgtcg caaaagccgg taaacctttg ttaatcattg ctgaagatgt     7200

tgaaggtgaa gcattggcaa cattagtcgt aaataccatg agaggtattg taaaagttgc     7260

tgcagttaag gctccaggtt tcggtgacag aagaaaagct atgttgcaag acattgcaac     7320

attaaccggt ggtacagtta tctccgaaga aattggtatg gaattggaaa aggccacctt     7380

ggaagatttg ggtcaagcta agagagttgt cattaataag gatactacaa ccatcatcga     7440

cggtgtaggt gaagaagccg ctatacaagg tagagttgct caaataagac aacaaatcga     7500

agaagcaact tctgattatg acagagaaaa attgcaagaa agagttgcaa agttagccgg     7560

tggtgtcgct gtaattaaag ttggtgcagc caccgaagtc gaaatgaagg aaaagaaagc     7620

aagagtagaa gatgctttgc atgcaacaag agctgcagtt gaagaaggtg tagttgcagg     7680

tggtggtgtc gccttaatta gagtagcctc caaattggct gatttgagag gtcaaaatga     7740

agaccaaaac gtaggtatca aggttgcctt aagagctatg gaagcaccat tgagacaaat     7800

cgttttgaac tgtggtgaag aacctagtgt cgtagctaac actgttaaag gtggtgacgg     7860

taattatggt tacaacgccg ctacagaaga atacggtaac atgatcgata tgggtatatt     7920

ggacccaact aaggtcacaa gatctgcatt gcaatacgca gcctcagttg ccggtttaat     7980

gattactaca gaatgcatgg ttacagattt gcctaaaaac gacgctgccg acttgggtgc     8040

cgcaggtggt atgggtggta tgggtggtat gggtggtatg atgtgattaa ttaagagtaa     8100

gcgaatttct tatgatttat gatttttatt attaaataag ttataaaaaa aataagtgta     8160

tacaaatttt aaagtgactc ttaggtttta aaacgaaaat tcttattctt gagtaactct     8220

ttcctgtagg tcaggttgct ttctcaggta tagcatgagg tcgctcttat tgaccacacc     8280

tctaccggca tgccgagcaa atgcctgcaa atcgctcccc atttcaccca attgtagata     8340

tgctaactcc agcaatgagt tgatgaatct cggtgtgtat tttatgtcct cagaggacaa     8400

cacctgtggt actagttcta gagcggccgc ccgcaaatta aagccttcga gcgtcccaaa     8460

accttctcaa gcaaggtttt cagtataatg ttacatgcgt acacgcgttt gtacagaaaa     8520

aaaagaaaaa tttgaaatat aaataacgtt cttaatacta acataactat taaaaaaaat     8580

aaatagggac ctagacttca ggttgtctaa ctccttcctt ttcggttaga gcggatgtgg     8640

gaggagggcg tgaatgtaag cgtgacataa ctaattacat gattaattaa ttatgcttca     8700

acaattgcca agatatctga ttcagacatg atcaaaactt cttcgttatc aatcttttct     8760

gacttaacac cgtaaccatc attgaaaata acaatgtcac caaccttaac atccaaaggc     8820

ttaacttcac cgttttctaa aattctacca ttaccaacag ccaaaacttc acctcttgta     8880

ctcttagctg cagcggaacc agtcaaaaca ataccacctg cagatttggt ttcaacttcc     8940

tttctcttaa caataactct atcatgcaat ggtctaatat tcattttgtt tgtttatgtg     9000

tgtttattcg aaactaagtt cttggtgttt taaaactaaa aaaaagacta actataaaag     9060

tagaatttaa gaagtttaag aaatagattt acagaattac aatcaatacc taccgtcttt     9120

atatacttat tagtcaagta ggggaataat ttcagggaac tggtttcaac cttttttttc     9180

agctttttcc aaatcagaga gagcagaagg taatagaagg tgtaagaaaa tgagatagat     9240

acatgcgtgg gtcaattgcc ttgtgtcatc atttactcca ggcaggttgc atcactccat     9300

tgaggttgtg cccgtttttt gcctgtttgt gcccctgttc tctgtagttg cgctaagaga     9360

atggacctat gaactgatgg ttggtgaaga aaacaatatt ttggtgctgg gattcttttt     9420

ttttctggat gccagcttaa aaagcgggct ccattatatt tagtggatgc caggaataaa     9480

ctgttcaccc agacacctac gatgttatat attctgtgta acccgccccc tattttgggc     9540

atgtacgggt tacagcagaa ttaaaaggct aattttttga ctaaataaag ttaggaaaat     9600

cactactatt aattatttac gtattctttg aaatggcagt attgataatg ataaactcga     9660

actagatcta tccgcggtgg agct                                            9684


<210>  67
<211>  12642
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  67
ggccgcacct ggtaaaacct ctagtggagt agtagatgta atcaatgaag cggaagccaa       60

aagaccagag tagaggccta tagaagaaac tgcgatacct tttgtgatgg ctaaacaaac      120

agacatcttt ttatatgttt ttacttctgt atatcgtgaa gtagtaagtg ataagcgaat      180

ttggctaaga acgttgtaag tgaacaaggg acctcttttg cctttcaaaa aaggattaaa      240

tggagttaat cattgagatt tagttttcgt tagattctgt atccctaaat aactccctta      300

cccgacggga aggcacaaaa gacttgaata atagcaaacg gccagtagcc aagaccaaat      360

aatactagag ttaactgatg gtcttaaaca ggcattacgt ggtgaactcc aagaccaata      420

tacaaaatat cgataagtta ttcttgccca ccaatttaag gagcctacat caggacagta      480

gtaccattcc tcagagaaga ggtatacata acaagaaaat cgcgtgaaca ccttatataa      540

cttagcccgt tattgagcta aaaaaccttg caaaatttcc tatgaataag aatacttcag      600

acgtgataaa aatttacttt ctaactcttc tcacgctgcc cctatctgtt cttccgctct      660

accgtgagaa ataaagcatc gagtacggca gttcgctgtc actgaactaa aacaataagg      720

ctagttcgaa tgatgaactt gcttgctgtc aaacttctga gttgccgctg atgtgacact      780

gtgacaataa attcaaaccg gttatagcgg tctcctccgg taccggttct gccacctcca      840

atagagctca gtaggagtca gaacctctgc ggtggctgtc agtgactcat ccgcgtttcg      900

taagttgtgc gcgtgcacat ttcgcccgtt cccgctcatc ttgcagcagg cggaaatttt      960

catcacgctg taggacgcaa aaaaaaaata attaatcgta caagaatctt ggaaaaaaaa     1020

ttgaaaaatt ttgtataaaa gggatgacct aacttgactc aatggctttt acacccagta     1080

ttttcccttt ccttgtttgt tacaattata gaagcaagac aaaaacatat agacaaccta     1140

ttcctaggag ttatattttt ttaccctacc agcaatataa gtaaaaaact gtttaaacag     1200

tatgtccgtt caagccacaa gagaagacaa gtttagtttc ggtttatgga ctgtaggttg     1260

gcaagcaaga gacgcattcg gtgacgcaac cagaactgcc ttggatccag ttgaagctgt     1320

ccataaattg gcagaaatcg gtgcctacgg tattacattc cacgatgacg atttggttcc     1380

ttttggttcc gatgctcaaa ccagagacgg tattatagcc ggtttcaaaa aggctttaga     1440

tgaaactggt ttgatcgtac caatggttac tacaaatttg tttactcatc ctgtcttcaa     1500

ggacggtggt tttacatcta acgatagatc agtcagaaga tacgctataa gaaaggtatt     1560

gagacaaatg gatttgggtg ctgaattggg tgcaaagaca ttagtcttgt ggggtggtag     1620

agaaggtgca gaatacgatt ccgccaaaga cgttagtgct gcattggaca gatatagaga     1680

agcattgaat ttgttggcac aatactctga agatagaggt tacggtttga gatttgctat     1740

agaaccaaag cctaacgaac caagaggtga catattgtta cctactgcag gtcatgcaat     1800

cgccttcgtt caagaattgg aaagaccaga attgttcggt attaatcctg aaaccggtca     1860

cgaacaaatg tctaatttga acttcactca aggtattgct caagcattat ggcataaaaa     1920

gttgttccac atcgatttga acggtcaaca tggtccaaaa ttcgaccaag atttggtatt     1980

tggtcacggt gacttgttga acgctttctc attggttgat ttgttggaaa acggtccaga     2040

tggtgcccct gcttatgacg gtccaagaca ttttgattac aaaccttcta gaacagaaga     2100

ctatgatggt gtttgggaat cagcaaaggc caacatcaga atgtacttgt tgttgaagga     2160

aagagctaag gcattcagag cagatccaga agttcaagaa gccttagccg cttccaaagt     2220

cgcagaattg aagacaccaa ccttaaatcc tggtgaaggt tacgccgaat tattggctga     2280

tagaagtgca tttgaagact atgatgccga cgctgttggt gctaaaggtt ttggttttgt     2340

caagttaaat caattagcaa tcgaacactt attaggtgcc agatgaggcc ctgcaggcca     2400

gaggaaaata atatcaagtg ctggaaactt tttctcttgg aatttttgca acatcaagtc     2460

atagtcaatt gaattgaccc aatttcacat ttaagatttt ttttttttca tccgacatac     2520

atctgtacac taggaagccc tgtttttctg aagcagcttc aaatatatat attttttaca     2580

tatttattat gattcaatga acaatctaat taaatcgaaa acaagaaccg aaacgcgaat     2640

aaataattta tttagatggt gacaagtgta taagtcctca tcgggacagc tacgatttct     2700

ctttcggttt tggctgagct actggttgct gtgacgcagc ggcattagcg cggcgttatg     2760

agctaccctc gtggcctgaa agatggcggg aataaagcgg aactaaaaat tactgactga     2820

gccatattga ggtcaatttg tcaactcgtc aagtcacgtt tggtggacgg cccctttcca     2880

acgaatcgta tatactaaca tgcgcgcgct tcctatatac acatatacat atatatatat     2940

atatatatgt gtgcgtgtat gtgtacacct gtatttaatt tccttactcg cgggtttttc     3000

ttttttctca attcttggct tcctctttct cgaggtcgac ggtatcgata agcttgatat     3060

cgaattcctg cagcccgggg gatccttttc tggcaaccaa acccatacat cgggattcct     3120

ataatacctt cgttggtctc cctaacatgt aggtggcgga ggggagatat acaatagaac     3180

agataccaga caagacataa tgggctaaac aagactacac caattacact gcctcattga     3240

tggtggtaca taacgaacta atactgtagc cctagacttg atagccatca tcatatcgaa     3300

gtttcactac cctttttcca tttgccatct attgaagtaa taataggcgc atgcaacttc     3360

ttttcttttt ttttcttttc tctctccccc gttgttgtct caccatatcc gcaatgacaa     3420

aaaaatgatg gaagacacta aaggaaaaaa ttaacgacaa agacagcacc aacagatgtc     3480

gttgttccag agctgatgag gggtatctcg aagcacacga aactttttcc ttccttcatt     3540

cacgcacact actctctaat gagcaacggt atacggcctt ccttccagtt acttgaattt     3600

gaaataaaaa aaagtttgct gtcttgctat caagtataaa tagacctgca attattaatc     3660

ttttgtttcc tcgtcattgt tctcgttccc tttcttcctt gtttcttttt ctgcacaata     3720

tttcaagcta taccaagcat acaatcaact atctcatata caatggctgc taaagatgta     3780

aagttcggta atgatgctag agtaaaaatg ttgagaggtg taaatgtatt ggctgacgct     3840

gtaaaagtaa ctttgggtcc aaaaggtaga aatgttgtct tggataagtc ttttggtgct     3900

cctaccataa ctaaagacgg tgtttcagtc gcaagagaaa tcgaattgga ggataagttc     3960

gaaaacatgg gtgctcaaat ggtcaaagaa gtcgcctcta aggctaacga tgctgcaggt     4020

gacggtacta caaccgctac tgttttggct caagcaatta taacagaagg tttaaaagca     4080

gttgccgctg gtatgaatcc aatggatttg aaaagaggta ttgacaaggc cgtcactgca     4140

gccgtagaag aattgaaagc attatcagtc ccttgttctg attcaaaggc catcgctcaa     4200

gtaggtacca tttccgctaa cagtgatgaa actgttggta aattaattgc agaagccatg     4260

gacaaagtcg gtaaagaagg tgtaataacc gttgaagatg gtactggttt gcaagatgaa     4320

ttagacgtag ttgagggtat gcaatttgat agaggttatt tgtcaccata cttcatcaat     4380

aagcctgaaa caggtgctgt tgaattggaa tcccctttta ttttgttggc agataaaaag     4440

attagtaaca taagagaaat gttgccagtt ttagaagctg tcgcaaaagc cggtaaacct     4500

ttgttaatca ttgctgaaga tgttgaaggt gaagcattgg caacattagt cgtaaatacc     4560

atgagaggta ttgtaaaagt tgctgcagtt aaggctccag gtttcggtga cagaagaaaa     4620

gctatgttgc aagacattgc aacattaacc ggtggtacag ttatctccga agaaattggt     4680

atggaattgg aaaaggccac cttggaagat ttgggtcaag ctaagagagt tgtcattaat     4740

aaggatacta caaccatcat cgacggtgta ggtgaagaag ccgctataca aggtagagtt     4800

gctcaaataa gacaacaaat cgaagaagca acttctgatt atgacagaga aaaattgcaa     4860

gaaagagttg caaagttagc cggtggtgtc gctgtaatta aagttggtgc agccaccgaa     4920

gtcgaaatga aggaaaagaa agcaagagta gaagatgctt tgcatgcaac aagagctgca     4980

gttgaagaag gtgtagttgc aggtggtggt gtcgccttaa ttagagtagc ctccaaattg     5040

gctgatttga gaggtcaaaa tgaagaccaa aacgtaggta tcaaggttgc cttaagagct     5100

atggaagcac cattgagaca aatcgttttg aactgtggtg aagaacctag tgtcgtagct     5160

aacactgtta aaggtggtga cggtaattat ggttacaacg ccgctacaga agaatacggt     5220

aacatgatcg atatgggtat attggaccca actaaggtca caagatctgc attgcaatac     5280

gcagcctcag ttgccggttt aatgattact acagaatgca tggttacaga tttgcctaaa     5340

aacgacgctg ccgacttggg tgccgcaggt ggtatgggtg gtatgggtgg tatgggtggt     5400

atgatgtgat taattaagag taagcgaatt tcttatgatt tatgattttt attattaaat     5460

aagttataaa aaaaataagt gtatacaaat tttaaagtga ctcttaggtt ttaaaacgaa     5520

aattcttatt cttgagtaac tctttcctgt aggtcaggtt gctttctcag gtatagcatg     5580

aggtcgctct tattgaccac acctctaccg gcatgccgag caaatgcctg caaatcgctc     5640

cccatttcac ccaattgtag atatgctaac tccagcaatg agttgatgaa tctcggtgtg     5700

tattttatgt cctcagagga caacacctgt ggtactagtt ctagagcggc cgcccgcaaa     5760

ttaaagcctt cgagcgtccc aaaaccttct caagcaaggt tttcagtata atgttacatg     5820

cgtacacgcg tttgtacaga aaaaaaagaa aaatttgaaa tataaataac gttcttaata     5880

ctaacataac tattaaaaaa aataaatagg gacctagact tcaggttgtc taactccttc     5940

cttttcggtt agagcggatg tgggaggagg gcgtgaatgt aagcgtgaca taactaatta     6000

catgattaat taattatgct tcaacaattg ccaagatatc tgattcagac atgatcaaaa     6060

cttcttcgtt atcaatcttt tctgacttaa caccgtaacc atcattgaaa ataacaatgt     6120

caccaacctt aacatccaaa ggcttaactt caccgttttc taaaattcta ccattaccaa     6180

cagccaaaac ttcacctctt gtactcttag ctgcagcgga accagtcaaa acaataccac     6240

ctgcagattt ggtttcaact tcctttctct taacaataac tctatcatgc aatggtctaa     6300

tattcatttt gtttgtttat gtgtgtttat tcgaaactaa gttcttggtg ttttaaaact     6360

aaaaaaaaga ctaactataa aagtagaatt taagaagttt aagaaataga tttacagaat     6420

tacaatcaat acctaccgtc tttatatact tattagtcaa gtaggggaat aatttcaggg     6480

aactggtttc aacctttttt ttcagctttt tccaaatcag agagagcaga aggtaataga     6540

aggtgtaaga aaatgagata gatacatgcg tgggtcaatt gccttgtgtc atcatttact     6600

ccaggcaggt tgcatcactc cattgaggtt gtgcccgttt tttgcctgtt tgtgcccctg     6660

ttctctgtag ttgcgctaag agaatggacc tatgaactga tggttggtga agaaaacaat     6720

attttggtgc tgggattctt tttttttctg gatgccagct taaaaagcgg gctccattat     6780

atttagtgga tgccaggaat aaactgttca cccagacacc tacgatgtta tatattctgt     6840

gtaacccgcc ccctattttg ggcatgtacg ggttacagca gaattaaaag gctaattttt     6900

tgactaaata aagttaggaa aatcactact attaattatt tacgtattct ttgaaatggc     6960

agtattgata atgataaact cgaactagat ctatccgcgg tggagctcca attcgcccta     7020

tagtgagtcg tattacaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc     7080

tggcgttacc caacttaatc gccttgcagc acatcccccc ttcgccagct ggcgtaatag     7140

cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg     7200

cgacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac     7260

cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc     7320

cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt     7380

tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg     7440

gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag     7500

tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt     7560

ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt     7620

taacgcgaat tttaacaaaa tattaacgtt tacaatttcc tgatgcggta ttttctcctt     7680

acgcatctgt gcggtatttc acaccgcata tgatccgtcg agttcaagag aaaaaaaaag     7740

aaaaagcaaa aagaaaaaag gaaagcgcgc ctcgttcaga atgacacgta tagaatgatg     7800

cattaccttg tcatcttcag tatcatactg ttcgtataca tacttactga cattcatagg     7860

tatacatata tacacatgta tatatatcgt atgctgcagc tttaaataat cggtgtcact     7920

acataagaac acctttggtg gagggaacat cgttggtacc attgggcgag gtggcttctc     7980

ttatggcaac cgcaagagcc ttgaacgcac tctcactacg gtgatgatca ttcttgcctc     8040

gcagacaatc aacgtggagg gtaattctgc tagcctctgc aaagctttca agaaaatgcg     8100

ggatcatctc gcaagagaga tctcctactt tctccctttg caaaccaagt tcgacaactg     8160

cgtacggcct gttcgaaaga tctaccaccg ctctggaaag tgcctcatcc aaaggcgcaa     8220

atcctgatcc aaaccttttt actccacgcg ccagtagggc ctctttaaaa gcttgaccga     8280

gagcaatccc gcagtcttca gtggtgtgat ggtcgtctat gtgtaagtca ccaatgcact     8340

caacgattag cgaccagccg gaatgcttgg ccagagcatg tatcatatgg tccagaaacc     8400

ctatacctgt gtggacgtta atcacttgcg attgtgtggc ctgttctgct actgcttctg     8460

cctctttttc tgggaagatc gagtgctcta tcgctagggg accacccttt aaagagatcg     8520

caatctgaat cttggtttca tttgtaatac gctttactag ggctttctgc tctgtcatct     8580

ttgccttcgt ttatcttgcc tgctcatttt ttagtatatt cttcgaagaa atcacattac     8640

tttatataat gtataattca ttatgtgata atgccaatcg ctaagaaaaa aaaagagtca     8700

tccgctaggt ggaaaaaaaa aaatgaaaat cattaccgag gcataaaaaa atatagagtg     8760

tactagagga ggccaagagt aatagaaaaa gaaaattgcg ggaaaggact gtgttatgac     8820

ttccctgact aatgccgtgt tcaaacgata cctggcagtg actcctagcg ctcaccaagc     8880

tcttaaaacg gaattatggt gcactctcag tacaatctgc tctgatgccg catagttaag     8940

ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc     9000

atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc     9060

gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa     9120

tgtcatgata ataatggttt cttaggacgg atcgcttgcc tgtaacttac acgcgcctcg     9180

tatcttttaa tgatggaata atttgggaat ttactctgtg tttatttatt tttatgtttt     9240

gtatttggat tttagaaagt aaataaagaa ggtagaagag ttacggaatg aagaaaaaaa     9300

aataaacaaa ggtttaaaaa atttcaacaa aaagcgtact ttacatatat atttattaga     9360

caagaaaagc agattaaata gatatacatt cgattaacga taagtaaaat gtaaaatcac     9420

aggattttcg tgtgtggtct tctacacaga caagatgaaa caattcggca ttaatacctg     9480

agagcaggaa gagcaagata aaaggtagta tttgttggcg atccccctag agtcttttac     9540

atcttcggaa aacaaaaact attttttctt taatttcttt ttttactttc tatttttaat     9600

ttatatattt atattaaaaa atttaaatta taattatttt tatagcacgt gatgaaaagg     9660

acccaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa     9720

tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt     9780

gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg     9840

cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag     9900

atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg     9960

agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg    10020

gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt    10080

ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga    10140

cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac    10200

ttctgacaac gatcggagga ccgaaggagc taaccgcttt ttttcacaac atgggggatc    10260

atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc    10320

gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac    10380

tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag    10440

gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg    10500

gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta    10560

tcgtagttat ctacacgacg ggcagtcagg caactatgga tgaacgaaat agacagatcg    10620

ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata    10680

tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt    10740

ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc    10800

ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct    10860

tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa    10920

ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag    10980

tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc    11040

tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg    11100

actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca    11160

cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat    11220

gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg    11280

tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc    11340

ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc    11400

ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc    11460

cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg    11520

cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga    11580

gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc    11640

attaatgcag ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa    11700

ttaatgtgag ttacctcact cattaggcac cccaggcttt acactttatg cttccggctc    11760

ctatgttgtg tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg    11820

attacgccaa gctcggaatt aaccctcact aaagggaaca aaagctgggt accgggcccc    11880

ccgtcgacgg tatcgataag cttgatatcg aattcctgca gcccgaataa aaaacacgct    11940

ttttcagttc gagtttatca ttatcaatac tgccatttca aagaatacgt aaataattaa    12000

tagtagtgat tttcctaact ttatttagtc aaaaaattag ccttttaatt ctgctgtaac    12060

ccgtacatgc ccaaaatagg gggcgggtta cacagaatat ataacatcgt aggtgtctgg    12120

gtgaacagtt tattcctggc atccactaaa tataatggag cccgcttttt aagctggcat    12180

ccagaaaaaa aaagaatccc agcaccaaaa tattgttttc ttcaccaacc atcagttcat    12240

aggtccattc tcttagcgca actacagaga acaggggcac aaacaggcaa aaaacgggca    12300

caacctcaat ggagtgatgc aacctgcctg gagtaaatga tgacacaagg caattgaccc    12360

acgcatgtat ctatctcatt ttcttacacc ttctattacc ttctgctctc tctgatttgg    12420

aaaaagctga aaaaaaaggt tgaaaccagt tccctgaaat tattccccta cttgactaat    12480

aagtatataa agacggtagg tattgattgt aattctgtaa atctatttct taaacttctt    12540

aaattctact tttatagtta gtcttttttt tagttttaaa acaccaagaa cttagtttcg    12600

aataaacaca cataaacaaa cagatcacta gtcaccggtg gc                       12642


<210>  68
<211>  8848
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  68
ccaattcgcc ctatagtgag tcgtattaca attcactggc cgtcgtttta caacgtcgtg       60

actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cccttcgcca      120

gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga      180

atggcgaatg gcgcgacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta      240

cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc      300

cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt      360

tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg      420

gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca      480

cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct      540

attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga      600

tttaacaaaa atttaacgcg aattttaaca aaatattaac gtttacaatt tcctgatgcg      660

gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atatgatccg tcgagttcaa      720

gagaaaaaaa aagaaaaagc aaaaagaaaa aaggaaagcg cgcctcgttc agaatgacac      780

gtatagaatg atgcattacc ttgtcatctt cagtatcata ctgttcgtat acatacttac      840

tgacattcat aggtatacat atatacacat gtatatatat cgtatgctgc agctttaaat      900

aatcggtgtc actacataag aacacctttg gtggagggaa catcgttggt accattgggc      960

gaggtggctt ctcttatggc aaccgcaaga gccttgaacg cactctcact acggtgatga     1020

tcattcttgc ctcgcagaca atcaacgtgg agggtaattc tgctagcctc tgcaaagctt     1080

tcaagaaaat gcgggatcat ctcgcaagag agatctccta ctttctccct ttgcaaacca     1140

agttcgacaa ctgcgtacgg cctgttcgaa agatctacca ccgctctgga aagtgcctca     1200

tccaaaggcg caaatcctga tccaaacctt tttactccac gcgccagtag ggcctcttta     1260

aaagcttgac cgagagcaat cccgcagtct tcagtggtgt gatggtcgtc tatgtgtaag     1320

tcaccaatgc actcaacgat tagcgaccag ccggaatgct tggccagagc atgtatcata     1380

tggtccagaa accctatacc tgtgtggacg ttaatcactt gcgattgtgt ggcctgttct     1440

gctactgctt ctgcctcttt ttctgggaag atcgagtgct ctatcgctag gggaccaccc     1500

tttaaagaga tcgcaatctg aatcttggtt tcatttgtaa tacgctttac tagggctttc     1560

tgctctgtca tctttgcctt cgtttatctt gcctgctcat tttttagtat attcttcgaa     1620

gaaatcacat tactttatat aatgtataat tcattatgtg ataatgccaa tcgctaagaa     1680

aaaaaaagag tcatccgcta ggtggaaaaa aaaaaatgaa aatcattacc gaggcataaa     1740

aaaatataga gtgtactaga ggaggccaag agtaatagaa aaagaaaatt gcgggaaagg     1800

actgtgttat gacttccctg actaatgccg tgttcaaacg atacctggca gtgactccta     1860

gcgctcacca agctcttaaa acggaattat ggtgcactct cagtacaatc tgctctgatg     1920

ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt     1980

gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc     2040

agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat     2100

ttttataggt taatgtcatg ataataatgg tttcttagga cggatcgctt gcctgtaact     2160

tacacgcgcc tcgtatcttt taatgatgga ataatttggg aatttactct gtgtttattt     2220

atttttatgt tttgtatttg gattttagaa agtaaataaa gaaggtagaa gagttacgga     2280

atgaagaaaa aaaaataaac aaaggtttaa aaaatttcaa caaaaagcgt actttacata     2340

tatatttatt agacaagaaa agcagattaa atagatatac attcgattaa cgataagtaa     2400

aatgtaaaat cacaggattt tcgtgtgtgg tcttctacac agacaagatg aaacaattcg     2460

gcattaatac ctgagagcag gaagagcaag ataaaaggta gtatttgttg gcgatccccc     2520

tagagtcttt tacatcttcg gaaaacaaaa actatttttt ctttaatttc tttttttact     2580

ttctattttt aatttatata tttatattaa aaaatttaaa ttataattat ttttatagca     2640

cgtgatgaaa aggacccagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt     2700

ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg     2760

cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt     2820

cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta     2880

aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc     2940

ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa     3000

gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc     3060

cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt     3120

acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact     3180

gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc tttttttcac     3240

aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata     3300

ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta     3360

ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg     3420

gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat     3480

aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt     3540

aagccctccc gtatcgtagt tatctacacg acgggcagtc aggcaactat ggatgaacga     3600

aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa     3660

gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag     3720

gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac     3780

tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc     3840

gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat     3900

caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat     3960

actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct     4020

acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt     4080

cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg     4140

gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta     4200

cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg     4260

gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg     4320

tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc     4380

tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg     4440

gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat     4500

aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc     4560

agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg     4620

cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt     4680

gagcgcaacg caattaatgt gagttacctc actcattagg caccccaggc tttacacttt     4740

atgcttccgg ctcctatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac     4800

agctatgacc atgattacgc caagctcgga attaaccctc actaaaggga acaaaagctg     4860

ggtaccgggc cccccgtcga cggtatcgat aagcttgata tcgaattcct gcagcccggg     4920

ggatcctttt ctggcaacca aacccataca tcgggattcc tataatacct tcgttggtct     4980

ccctaacatg taggtggcgg aggggagata tacaatagaa cagataccag acaagacata     5040

atgggctaaa caagactaca ccaattacac tgcctcattg atggtggtac ataacgaact     5100

aatactgtag ccctagactt gatagccatc atcatatcga agtttcacta ccctttttcc     5160

atttgccatc tattgaagta ataataggcg catgcaactt cttttctttt tttttctttt     5220

ctctctcccc cgttgttgtc tcaccatatc cgcaatgaca aaaaaatgat ggaagacact     5280

aaaggaaaaa attaacgaca aagacagcac caacagatgt cgttgttcca gagctgatga     5340

ggggtatctc gaagcacacg aaactttttc cttccttcat tcacgcacac tactctctaa     5400

tgagcaacgg tatacggcct tccttccagt tacttgaatt tgaaataaaa aaaagtttgc     5460

tgtcttgcta tcaagtataa atagacctgc aattattaat cttttgtttc ctcgtcattg     5520

ttctcgttcc ctttcttcct tgtttctttt tctgcacaat atttcaagct ataccaagca     5580

tacaatcaac tatctcatat acaatggctg ctaaagatgt aaagttcggt aatgatgcta     5640

gagtaaaaat gttgagaggt gtaaatgtat tggctgacgc tgtaaaagta actttgggtc     5700

caaaaggtag aaatgttgtc ttggataagt cttttggtgc tcctaccata actaaagacg     5760

gtgtttcagt cgcaagagaa atcgaattgg aggataagtt cgaaaacatg ggtgctcaaa     5820

tggtcaaaga agtcgcctct aaggctaacg atgctgcagg tgacggtact acaaccgcta     5880

ctgttttggc tcaagcaatt ataacagaag gtttaaaagc agttgccgct ggtatgaatc     5940

caatggattt gaaaagaggt attgacaagg ccgtcactgc agccgtagaa gaattgaaag     6000

cattatcagt cccttgttct gattcaaagg ccatcgctca agtaggtacc atttccgcta     6060

acagtgatga aactgttggt aaattaattg cagaagccat ggacaaagtc ggtaaagaag     6120

gtgtaataac cgttgaagat ggtactggtt tgcaagatga attagacgta gttgagggta     6180

tgcaatttga tagaggttat ttgtcaccat acttcatcaa taagcctgaa acaggtgctg     6240

ttgaattgga atcccctttt attttgttgg cagataaaaa gattagtaac ataagagaaa     6300

tgttgccagt tttagaagct gtcgcaaaag ccggtaaacc tttgttaatc attgctgaag     6360

atgttgaagg tgaagcattg gcaacattag tcgtaaatac catgagaggt attgtaaaag     6420

ttgctgcagt taaggctcca ggtttcggtg acagaagaaa agctatgttg caagacattg     6480

caacattaac cggtggtaca gttatctccg aagaaattgg tatggaattg gaaaaggcca     6540

ccttggaaga tttgggtcaa gctaagagag ttgtcattaa taaggatact acaaccatca     6600

tcgacggtgt aggtgaagaa gccgctatac aaggtagagt tgctcaaata agacaacaaa     6660

tcgaagaagc aacttctgat tatgacagag aaaaattgca agaaagagtt gcaaagttag     6720

ccggtggtgt cgctgtaatt aaagttggtg cagccaccga agtcgaaatg aaggaaaaga     6780

aagcaagagt agaagatgct ttgcatgcaa caagagctgc agttgaagaa ggtgtagttg     6840

caggtggtgg tgtcgcctta attagagtag cctccaaatt ggctgatttg agaggtcaaa     6900

atgaagacca aaacgtaggt atcaaggttg ccttaagagc tatggaagca ccattgagac     6960

aaatcgtttt gaactgtggt gaagaaccta gtgtcgtagc taacactgtt aaaggtggtg     7020

acggtaatta tggttacaac gccgctacag aagaatacgg taacatgatc gatatgggta     7080

tattggaccc aactaaggtc acaagatctg cattgcaata cgcagcctca gttgccggtt     7140

taatgattac tacagaatgc atggttacag atttgcctaa aaacgacgct gccgacttgg     7200

gtgccgcagg tggtatgggt ggtatgggtg gtatgggtgg tatgatgtga ttaattaaga     7260

gtaagcgaat ttcttatgat ttatgatttt tattattaaa taagttataa aaaaaataag     7320

tgtatacaaa ttttaaagtg actcttaggt tttaaaacga aaattcttat tcttgagtaa     7380

ctctttcctg taggtcaggt tgctttctca ggtatagcat gaggtcgctc ttattgacca     7440

cacctctacc ggcatgccga gcaaatgcct gcaaatcgct ccccatttca cccaattgta     7500

gatatgctaa ctccagcaat gagttgatga atctcggtgt gtattttatg tcctcagagg     7560

acaacacctg tggtactagt tctagagcgg ccgcccgcaa attaaagcct tcgagcgtcc     7620

caaaaccttc tcaagcaagg ttttcagtat aatgttacat gcgtacacgc gtttgtacag     7680

aaaaaaaaga aaaatttgaa atataaataa cgttcttaat actaacataa ctattaaaaa     7740

aaataaatag ggacctagac ttcaggttgt ctaactcctt ccttttcggt tagagcggat     7800

gtgggaggag ggcgtgaatg taagcgtgac ataactaatt acatgattaa ttaattatgc     7860

ttcaacaatt gccaagatat ctgattcaga catgatcaaa acttcttcgt tatcaatctt     7920

ttctgactta acaccgtaac catcattgaa aataacaatg tcaccaacct taacatccaa     7980

aggcttaact tcaccgtttt ctaaaattct accattacca acagccaaaa cttcacctct     8040

tgtactctta gctgcagcgg aaccagtcaa aacaatacca cctgcagatt tggtttcaac     8100

ttcctttctc ttaacaataa ctctatcatg caatggtcta atattcattt tgtttgttta     8160

tgtgtgttta ttcgaaacta agttcttggt gttttaaaac taaaaaaaag actaactata     8220

aaagtagaat ttaagaagtt taagaaatag atttacagaa ttacaatcaa tacctaccgt     8280

ctttatatac ttattagtca agtaggggaa taatttcagg gaactggttt caaccttttt     8340

tttcagcttt ttccaaatca gagagagcag aaggtaatag aaggtgtaag aaaatgagat     8400

agatacatgc gtgggtcaat tgccttgtgt catcatttac tccaggcagg ttgcatcact     8460

ccattgaggt tgtgcccgtt ttttgcctgt ttgtgcccct gttctctgta gttgcgctaa     8520

gagaatggac ctatgaactg atggttggtg aagaaaacaa tattttggtg ctgggattct     8580

ttttttttct ggatgccagc ttaaaaagcg ggctccatta tatttagtgg atgccaggaa     8640

taaactgttc acccagacac ctacgatgtt atatattctg tgtaacccgc cccctatttt     8700

gggcatgtac gggttacagc agaattaaaa ggctaatttt ttgactaaat aaagttagga     8760

aaatcactac tattaattat ttacgtattc tttgaaatgg cagtattgat aatgataaac     8820

tcgaactaga tctatccgcg gtggagct                                        8848


<210>  69
<211>  21
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  69
agagtgcgtt caaggctctt g                                                 21


<210>  70
<211>  21
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  70
gagggaacat cgttggtacc a                                                 21


<210>  71
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  probe

<400>  71
ttgccataag agaagccacc tcgcc                                             25


<210>  72
<211>  21
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  72
ttgcgaagag cgacaaagat t                                                 21


<210>  73
<211>  22
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  73
ccttcatctc ttccacccat gt                                                22


<210>  74
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  probe

<400>  74
tgttatcggc tttattgctc aaag                                              24


<210>  75
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  75
cattgcaaga tgtttacaag attg                                              24


<210>  76
<211>  22
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  76
tgatgacacc ggtttcaact ct                                                22


<210>  77
<211>  23
<212>  DNA
<213>  Artificial sequence

<220>
<223>  probe

<400>  77
tggtattggt actgtgccag tcg                                               23


<210>  78
<211>  26
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  78
ccgtagaaga attgaaagca ttatca                                            26


<210>  79
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  79
gttagcggaa atggtaccta cttga                                             25


<210>  80
<211>  26
<212>  DNA
<213>  Artificial sequence

<220>
<223>  probe

<400>  80
cccttgttct gattcaaagg ccatcg                                            26


<210>  81
<211>  19
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  81
gcagcggaac cagtcaaaa                                                    19


<210>  82
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  82
gcatgataga gttattgtta agagaaagga                                        30


<210>  83
<211>  23
<212>  DNA
<213>  Artificial sequence

<220>
<223>  probe

<400>  83
ccacctgcag atttggtttc aac                                               23


<210>  84
<211>  20
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  84
ggcaagcaag agacgcattc                                                   20


<210>  85
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  primer

<400>  85
aatttatgga cagcttcaac tggat                                             25


<210>  86
<211>  22
<212>  DNA
<213>  Artificial sequence

<220>
<223>  probe

<400>  86
tgacgcaacc agaactgcct tg                                                22


<210>  87
<211>  16404
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  87
gatccacgat cgcattgcgg attacgtatt ctaatgttca gtaccgttcg tataatgtat       60

gctatacgaa gttatgcaga ttgtactgag agtgcaccat accacagctt ttcaattcaa      120

ttcatcattt tttttttatt cttttttttg atttcggttt ctttgaaatt tttttgattc      180

ggtaatctcc gaacagaagg aagaacgaag gaaggagcac agacttagat tggtatatat      240

acgcatatgt agtgttgaag aaacatgaaa ttgcccagta ttcttaaccc aactgcacag      300

aacaaaaacc tgcaggaaac gaagataaat catgtcgaaa gctacatata aggaacgtgc      360

tgctactcat cctagtcctg ttgctgccaa gctatttaat atcatgcacg aaaagcaaac      420

aaacttgtgt gcttcattgg atgttcgtac caccaaggaa ttactggagt tagttgaagc      480

attaggtccc aaaatttgtt tactaaaaac acatgtggat atcttgactg atttttccat      540

ggagggcaca gttaagccgc taaaggcatt atccgccaag tacaattttt tactcttcga      600

agacagaaaa tttgctgaca ttggtaatac agtcaaattg cagtactctg cgggtgtata      660

cagaatagca gaatgggcag acattacgaa tgcacacggt gtggtgggcc caggtattgt      720

tagcggtttg aagcaggcgg cagaagaagt aacaaaggaa cctagaggcc ttttgatgtt      780

agcagaattg tcatgcaagg gctccctatc tactggagaa tatactaagg gtactgttga      840

cattgcgaag agcgacaaag attttgttat cggctttatt gctcaaagag acatgggtgg      900

aagagatgaa ggttacgatt ggttgattat gacacccggt gtgggtttag atgacaaggg      960

agacgcattg ggtcaacagt atagaaccgt ggatgatgtg gtctctacag gatctgacat     1020

tattattgtt ggaagaggac tatttgcaaa gggaagggat gctaaggtag agggtgaacg     1080

ttacagaaaa gcaggctggg aagcatattt gagaagatgc ggccagcaaa actaaaaaac     1140

tgtattataa gtaaatgcat gtatactaaa ctcacaaatt agagcttcaa tttaattata     1200

tcagttatta ccctatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca     1260

tcaggaaatt gtaaacgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag     1320

ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac     1380

cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga     1440

ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc     1500

accctaatca agataacttc gtataatgta tgctatacga acggtacccg ccaactctgt     1560

tcgagaatga tgtaatcaag aaggtctcac aaaaccatcc aggcagtacc acttcccaag     1620

tattgcttag atgggcaact cagagaggca ttgccgtcat tccaaaatct tccaagaagg     1680

aaaggttact tggcaaccta gaaatcgaaa aaaagttcac tttaacggag caagaattga     1740

aggatatttc tgcactaaat gccaacatca gatttaatga tccatggacc tggttggatg     1800

gtaaattccc cacttttgcc tgatccagcc agtaaaatcc atactcaacg acgatatgaa     1860

caaatttccc tcattccgat gctgtatatg tgtataaatt tttacatgct cttctgttta     1920

gacacagaac agctttaaat aaaatgttgg atatactttt tctgcctgtg gtgtcatcca     1980

cgcttttaat tcatctcttg tatggttgac aatttggcta ttttttaaca gaacccaacg     2040

gtaattgaaa ttaaaaggga aacgagtggg ggcgatgagt gagtgatacg gcgcctgatg     2100

cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt     2160

acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac     2220

gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc     2280

gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag acgaaagggc     2340

ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca     2400

ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat     2460

tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa     2520

aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt     2580

tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag     2640

ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt     2700

tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg     2760

gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag     2820

aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta     2880

agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg     2940

acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta     3000

actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac     3060

accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt     3120

actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca     3180

cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag     3240

cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta     3300

gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag     3360

ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt     3420

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat     3480

aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta     3540

gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa     3600

acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt     3660

tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag     3720

ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta     3780

atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca     3840

agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag     3900

cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa     3960

agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga     4020

acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc     4080

gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc     4140

ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt     4200

gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt     4260

gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag     4320

gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa     4380

tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat     4440

gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg     4500

ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattag     4560

gcgcctactt ctagggggcc tatcaagtaa attactcctg gtacactgaa gtatataagg     4620

gatatagaag caaatagttg tcagtgcaat ccttcaagac gattgggaaa atactgtaat     4680

ataaatcgta aaggaaaatt ggaaattttt taaagatgtc ttcactggtt actcttaata     4740

acggtctgaa aatgccccta gtcggcttag ggtgctggaa aattgacaaa aaagtctgtg     4800

cgaatcaaat ttatgaagct atcaaattag gctaccgttt attcgatggt gcttgcgact     4860

acggcaacga aaaggaagtt ggtgaaggta tcaggaaagc catctccgaa ggtcttgttt     4920

ctagaaagga tatatttgtt gtttcaaagt tatggaacaa ttttcaccat cctgatcatg     4980

taaaattagc tttaaagaag accttaagcg atatgggact tgattattta gacctgtatt     5040

atattcactt cccaatcgcc ttcaaatatg ttccatttga agagaaatac cctccaggat     5100

tctatacggg cgcagaagga ttctatacgg gcgcagaact agtgatctcg aggttccaga     5160

gctcggatcc accacaggtg ttgtcctctg aggacataaa atacacaccg agattcatca     5220

actcattgct ggagttagca tatctacaat tgggtgaaat ggggagcgat ttgcaggcat     5280

ttgctcggca tgccggtaga ggtgtggtca ataagagcga cctcatgcta tacctgagaa     5340

agcaacctga cctacaggaa agagttactc aagaataaga attttcgttt taaaacctaa     5400

gagtcacttt aaaatttgta tacacttatt ttttttataa cttatttaat aataaaaatc     5460

ataaatcata agaaattcgc ttactcatcc cgggttagat gagagtcttt tccagttcgc     5520

ttaaggggac aatcttggaa ttatagcgat cccaattttc attatccaca tcggatatgc     5580

tttccattac atgccatgga aaattgtcat tcagaaattt atcaaaagga actgcaattt     5640

tattagagtc atataacaat gaccacatgg ccttataaca accaccaagg gcacatgagt     5700

ttggtgtttc tagcctaaaa ttaccctttg tagcaccaat gacttgagca aacttcttca     5760

caatagcatc gtttttagaa gccccaccta caaaaaaagt cctttctggc cttttattta     5820

ggtagtcccg cagcggagat tcatcgtaat caaacttcac gattgtatct tcgttcagtc     5880

tctgttgtga gcttgcgttt gaatccgaaa gcaggggaga tattcttacc ctgcaactta     5940

aagcctgtga ttctacaata tttttggcat cgtgcctctt gtctttgaac ttggccacct     6000

ctctttcaat catacccgtt tttggattga agataaccct tttgtttatg gcttttacgc     6060

taggaacgat ctcccccaga ggaaaatata cacctaattc attttcacta ctttctgagt     6120

catctagcac agcttgatta aaaagagtcc aatcgttagt cttctcataa ttattttccc     6180

gttctttgtt taactcgtct cttatcctct cccttgccaa agaaccatta caataacaaa     6240

tcatacccat ataatggttt ggcagagttg gatgaatgaa aagatgatag ttcggagagg     6300

ggtgatactt atcggtgacc agaagaactg tagtacttgt tcctagggaa acgagaacgt     6360

cattcttccg caggggtaaa gaacatatag tggctaaatt atccccagtc atgggagaga     6420

ccttgcagtt tgtattgaaa ccgtacttct caataaaata tttacagatg gtacccgcta     6480

tcaaattttt catgggtgct ctcattaatt tttgtctgat agttttatcc ttagaagaac     6540

tatcaattag atgtagtagc tcatcactga attttctttc acgtatatca taaaggttca     6600

taccacaggc atctgcctcc tctaattcaa caagatggcc cactaagata gaagtcaaaa     6660

aattagacac taaagaaatg gtctttgttt tttcgtaagc ttctggttct aattgtgcaa     6720

ttttcagaat ttgaggacca gtaaatctaa aatgggctct ggaccctgtt aattgagcca     6780

ttttttcagg cccacctatg cactcttcaa actcttgaca ttgctttgca gtactgtggt     6840

cttgccaatt gggggcggtt tgccttgcaa atgctacaga gctcacgtag tgcaataaat     6900

ctttttccgg tttcttattc aattgctcta acagagattc ggcttgggag gaccagtaga     6960

cagacccgtg ctgctggcag gaccctgaga cggccataac tttgttcaat ggaaatttag     7020

cctcgcgata tttcgagaga accagatcta gagcctctaa ccacatggct acgggacatt     7080

cgatagtgtc gccgtgtata tagacaccct tctttgtgtg ataatgcgga agatcctttt     7140

caaattccac tgtttctgaa tggacaattt ttaggtcctg gttaatggcg agacatttca     7200

gttgttgggt cgaaagatca aacccaagat agtatgagtc taaagacatt gtgttggaaa     7260

cctctcttgt ctgtctctga attactgaac acaacatact agtcgtacgg ttttattttt     7320

tacttatatt gctggtaggg taaaaaaata taactcctag gaataggttg tctatatgtt     7380

tttgtcttgc ttctataatt gtaacaaaca aggaaaggga aaatactggg tgtaaaagcc     7440

attgagtcaa gttaggtcat cccttttata caaaattttt caattttttt tccaagattc     7500

ttgtacgatt aattattttt tttttgcgtc ctacagcgtg atgaaaattt ccgcctgctg     7560

caagatgagc gggaacgggc gaaatgtgca cgcgcacaac ttacgaaacg cggatgagtc     7620

actgacagcc accgcagagg ttctgactcc tactgagctc tattggaggt ggcagaaccg     7680

gtaccggagg agaccgctat aaccggtttg aatttattgt cacagtgtca catcagcggc     7740

aactcagaag tttgacagca agcaagttca tcattcgaac tagccttatt gttttagttc     7800

agtgacagcg aactgccgta ctcgatgctt tatttctcac ggtagagcgg aagaacagat     7860

aggggcagcg tgagaagagt tagaaagtaa atttttatca cgtctgaagt attcttattc     7920

ataggaaatt ttgcaaggtt ttttagctca ataacgggct aagttatata aggtgttcac     7980

gcgattttct tgttatgtat acctcttctg gcgcgcctct ttttattaac cttaattttt     8040

attttagatt cctgacttca actcaagacg cacagatatt ataacatctg cataataggc     8100

atttgcaaga attactcgtg agtaaggaaa gagtgaggaa ctatcgcata cctgcattta     8160

aagatgccga tttgggcgcg aatcctttat tttggcttca ccctcatact attatcaggg     8220

ccagaaaaag gaagtgtttc cctccttctt gaattgatgt taccctcata aagcacgtgg     8280

cctcttatcg agaaagaaat taccgtcgct cgtgatttgt ttgcaaaaag aacaaaactg     8340

aaaaaaccca gacacgctcg acttcctgtc ttcctattga ttgcagcttc caatttcgtc     8400

acacaacaag gtcctagcga cggctcacag gttttgtaac aagcaatcga aggttctgga     8460

atggcgggaa agggtttagt accacatgct atgatgccca ctgtgatctc cagagcaaag     8520

ttcgttcgat cgtactgtta ctctctctct ttcaaacaga attgtccgaa tcgtgtgaca     8580

acaacagcct gttctcacac actcttttct tctaaccaag ggggtggttt agtttagtag     8640

aacctcgtga aacttacatt tacatatata taaacttgca taaattggtc aatgcaagaa     8700

atacatattt ggtcttttct aattcgtagt ttttcaagtt cttagatgct ttctttttct     8760

cttttttaca gatcatcaag gaagtaatta tctacttttt acaacaaata taaaacacgt     8820

acgactagta tgactcaatt cactgacatt gataagttgg ccgtctccac cataagaatt     8880

ttggctgtgg acaccgtatc caaggccaac tcaggtcacc caggtgctcc attgggtatg     8940

gcaccagctg cacacgttct atggagtcaa atgcgcatga acccaaccaa cccagactgg     9000

atcaacagag atagatttgt cttgtctaac ggtcacgcgg tcgctttgtt gtattctatg     9060

ctacatttga ctggttacga tctgtctatt gaagacttga aacagttcag acagttgggt     9120

tccagaacac caggtcatcc tgaatttgag ttgccaggtg ttgaagttac taccggtcca     9180

ttaggtcaag gtatctccaa cgctgttggt atggccatgg ctcaagctaa cctggctgcc     9240

acttacaaca agccgggctt taccttgtct gacaactaca cctatgtttt cttgggtgac     9300

ggttgtttgc aagaaggtat ttcttcagaa gcttcctcct tggctggtca tttgaaattg     9360

ggtaacttga ttgccatcta cgatgacaac aagatcacta tcgatggtgc taccagtatc     9420

tcattcgatg aagatgttgc taagagatac gaagcctacg gttgggaagt tttgtacgta     9480

gaaaatggta acgaagatct agccggtatt gccaaggcta ttgctcaagc taagttatcc     9540

aaggacaaac caactttgat caaaatgacc acaaccattg gttacggttc cttgcatgcc     9600

ggctctcact ctgtgcacgg tgccccattg aaagcagatg atgttaaaca actaaagagc     9660

aaattcggtt tcaacccaga caagtccttt gttgttccac aagaagttta cgaccactac     9720

caaaagacaa ttttaaagcc aggtgtcgaa gccaacaaca agtggaacaa gttgttcagc     9780

gaataccaaa agaaattccc agaattaggt gctgaattgg ctagaagatt gagcggccaa     9840

ctacccgcaa attgggaatc taagttgcca acttacaccg ccaaggactc tgccgtggcc     9900

actagaaaat tatcagaaac tgttcttgag gatgtttaca atcaattgcc agagttgatt     9960

ggtggttctg ccgatttaac accttctaac ttgaccagat ggaaggaagc ccttgacttc    10020

caacctcctt cttccggttc aggtaactac tctggtagat acattaggta cggtattaga    10080

gaacacgcta tgggtgccat aatgaacggt atttcagctt tcggtgccaa ctacaaacca    10140

tacggtggta ctttcttgaa cttcgtttct tatgctgctg gtgccgttag attgtccgct    10200

ttgtctggcc acccagttat ttgggttgct acacatgact ctatcggtgt cggtgaagat    10260

ggtccaacac atcaacctat tgaaacttta gcacacttca gatccctacc aaacattcaa    10320

gtttggagac cagctgatgg taacgaagtt tctgccgcct acaagaactc tttagaatcc    10380

aagcatactc caagtatcat tgctttgtcc agacaaaact tgccacaatt ggaaggtagc    10440

tctattgaaa gcgcttctaa gggtggttac gtactacaag atgttgctaa cccagatatt    10500

attttagtgg ctactggttc cgaagtgtct ttgagtgttg aagctgctaa gactttggcc    10560

gcaaagaaca tcaaggctcg tgttgtttct ctaccagatt tcttcacttt tgacaaacaa    10620

cccctagaat acagactatc agtcttacca gacaacgttc caatcatgtc tgttgaagtt    10680

ttggctacca catgttgggg caaatacgct catcaatcct tcggtattga cagatttggt    10740

gcctccggta aggcaccaga agtcttcaag ttcttcggtt tcaccccaga aggtgttgct    10800

gaaagagctc aaaagaccat tgcattctat aagggtgaca agctaatttc tcctttgaaa    10860

aaagctttct aaattctgat cgtagatcat cagatttgat atgatattat ttgtgaaaaa    10920

atgaaataaa actttataca acttaaatac aacttttttt ataaacgatt aagcaaaaaa    10980

atagtttcaa acttttaaca atattccaaa cactcagtcc ttttccttct tatattatag    11040

gtgtacgtat tatagaaaaa tttcaatgat tactttttct ttctttttcc ttgtaccagc    11100

acatggccga gcttgaatgt taaacccttc gagagaatca caccattcaa gtataaagcc    11160

aataaagaat ataactccta aaaggctaat tgaaaccctg tgatttttgc ccgggtttaa    11220

ggcgcgccct ttatcattat caatactgcc atttcaaaga atacgtaaat aattaatagt    11280

agtgattttc ctaactttat ttagtcaaaa aattagcctt ttaattctgc tgtaacccgt    11340

acatgcccaa aatagggggc gggttacaca gaatatataa catcgtaggt gtctgggtga    11400

acagtttatt cctggcatcc actaaatata atggagcccg ctttttaagc tggcatccag    11460

aaaaaaaaag aatcccagca ccaaaatatt gttttcttca ccaaccatca gttcataggt    11520

ccattctctt agcgcaacta cagagaacag gggcacaaac aggcaaaaaa cgggcacaac    11580

ctcaatggag tgatgcaacc tgcctggagt aaatgatgac acaaggcaat tgacccacgc    11640

atgtatctat ctcattttct tacaccttct attaccttct gctctctctg atttggaaaa    11700

agctgaaaaa aaaggttgaa accagttccc tgaaattatt cccctacttg actaataagt    11760

atataaagac ggtaggtatt gattgtaatt ctgtaaatct atttcttaaa cttcttaaat    11820

tctactttta tagttagtct tttttttagt tttaaaacac caagaactta gtttcgaata    11880

aacacacata aacaaacacc actagcatgg ctgccggtgt cccaaaaatt gatgcgttag    11940

aatctttggg caatcctttg gaggatgcca agagagctgc agcatacaga gcagttgatg    12000

aaaatttaaa atttgatgat cacaaaatta ttggaattgg tagtggtagc acagtggttt    12060

atgttgccga aagaattgga caatatttgc atgaccctaa attttatgaa gtagcgtcta    12120

aattcatttg cattccaaca ggattccaat caagaaactt gattttggat aacaagttgc    12180

aattaggctc cattgaacag tatcctcgca ttgatatagc gtttgacggt gctgatgaag    12240

tggatgagaa tttacaatta attaaaggtg gtggtgcttg tctatttcaa gaaaaattgg    12300

ttagtactag tgctaaaacc ttcattgtcg ttgctgattc aagaaaaaag tcaccaaaac    12360

atttaggtaa gaactggagg caaggtgttc ccattgaaat tgtaccttcc tcatacgtga    12420

gggtcaagaa tgatctatta gaacaattgc atgctgaaaa agttgacatc agacaaggag    12480

gttctgctaa agcaggtcct gttgtaactg acaataataa cttcattatc gatgcggatt    12540

tcggtgaaat ttccgatcca agaaaattgc atagagaaat caaactgtta gtgggcgtgg    12600

tggaaacagg tttattcatc gacaacgctt caaaagccta cttcggtaat tctgacggta    12660

gtgttgaagt taccgaaaag tgagcggccg cgtgaattta ctttaaatct tgcatttaaa    12720

taaattttct ttttatagct ttatgactta gtttcaattt atatactatt ttaatgacat    12780

tttcgattca ttgattgaaa gctttgtgtt ttttcttgat gcgctattgc attgttcttg    12840

tctttttcgc cacatgtaat atctgtagta gatacctgat acattgtgga tgctgagtga    12900

aattttagtt aataatggag gcgctcttaa taattttggg gatattggct ttttttttta    12960

aagtttacaa atgaattttt tccgccagga taacgattct gaagttactc ttagcgttcc    13020

tatcggtaca gccatcaaat catgcctata aatcatgcct atatttgcgt gcagtcagta    13080

tcatctacat gaaaaaaact cccgcaattt cttatagaat acgttgaaaa ttaaatgtac    13140

gcgccaagat aagataacat atatctagat gcagtaatat acacagattc ccgcggacgt    13200

gggaaggaaa aaattagata acaaaatctg agtgatatgg aaattccgct gtatagctca    13260

tatctttccc tccaccgcgg tggtcgactt tcacatacgt tgcatacgtc gatatagata    13320

ataatgataa tgacagcagg attatcgtaa tacgtaatag ctgaaaatct caaaaatgtg    13380

tgggtcatta cgtaaataat gataggaatg ggattcttct atttttcctt tttccattct    13440

agcagccgtc gggaaaacgt ggcatcctct ctttcgggct caattggagt cacgctgccg    13500

tgagcatcct ctctttccat atctaacaac tgagcacgta accaatggaa aagcatgagc    13560

ttagcgttgc tccaaaaaag tattggatgg ttaataccat ttgtctgttc tcttctgact    13620

ttgactcctc aaaaaaaaaa atctacaatc aacagatcgc ttcaattacg ccctcacaaa    13680

aacttttttc cttcttcttc gcccacgtta aattttatcc ctcatgttgt ctaacggatt    13740

tctgcacttg atttattata aaaagacaaa gacataatac ttctctatca atttcagtta    13800

ttgttcttcc ttgcgttatt cttctgttct tctttttctt ttgtcatata taaccataac    13860

caagtaatac atattcaaac ttaagactcg agatggtcaa accaattata gctcccagta    13920

tccttgcttc tgacttcgcc aacttgggtt gcgaatgtca taaggtcatc aacgccggcg    13980

cagattggtt acatatcgat gtcatggacg gccattttgt tccaaacatt actctgggcc    14040

aaccaattgt tacctcccta cgtcgttctg tgccacgccc tggcgatgct agcaacacag    14100

aaaagaagcc cactgcgttc ttcgattgtc acatgatggt tgaaaatcct gaaaaatggg    14160

tcgacgattt tgctaaatgt ggtgctgacc aatttacgtt ccactacgag gccacacaag    14220

accctttgca tttagttaag ttgattaagt ctaagggcat caaagctgca tgcgccatca    14280

aacctggtac ttctgttgac gttttatttg aactagctcc tcatttggat atggctcttg    14340

ttatgactgt ggaacctggg tttggaggcc aaaaattcat ggaagacatg atgccaaaag    14400

tggaaacttt gagagccaag ttcccccatt tgaatatcca agtcgatggt ggtttgggca    14460

aggagaccat cccgaaagcc gccaaagccg gtgccaacgt tattgtcgct ggtaccagtg    14520

ttttcactgc agctgacccg cacgatgtta tctccttcat gaaagaagaa gtctcgaagg    14580

aattgcgttc tagagatttg ctagattaga cgtctgttta aagattacgg atatttaact    14640

tacttagaat aatgccattt ttttgagtta taataatcct acgttagtgt gagcgggatt    14700

taaactgtga ggaccttaat acattcagac acttctgcgg tatcacccta cttattccct    14760

tcgagattat atctaggaac ccatcaggtt ggtggaagat tacccgttct aagacttttc    14820

agcttcctct attgatgtta cacctggaca ccccttttct ggcatccagt ttttaatctt    14880

cagtggcatg tgagattctc cgaaattaat taaagcaatc acacaattct ctcggatacc    14940

acctcggttg aaactgacag gtggtttgtt acgcatgcta atgcaaagga gcctatatac    15000

ctttggctcg gctgctgtaa cagggaatat aaagggcagc ataatttagg agtttagtga    15060

acttgcaaca tttactattt tcccttctta cgtaaatatt tttcttttta attctaaatc    15120

aatctttttc aattttttgt ttgtattctt ttcttgctta aatctataac tacaaaaaac    15180

acatacataa actaaaacgt acgactagta tgtctgaacc agctcaaaag aaacaaaagg    15240

ttgctaacaa ctctctagaa caattgaaag cctccggcac tgtcgttgtt gccgacactg    15300

gtgatttcgg ctctattgcc aagtttcaac ctcaagactc cacaactaac ccatcattga    15360

tcttggctgc tgccaagcaa ccaacttacg ccaagttgat cgatgttgcc gtggaatacg    15420

gtaagaagca tggtaagacc accgaagaac aagtcgaaaa tgctgtggac agattgttag    15480

tcgaattcgg taaggagatc ttaaagattg ttccaggcag agtctccacc gaagttgatg    15540

ctagattgtc ttttgacact caagctacca ttgaaaaggc tagacatatc attaaattgt    15600

ttgaacaaga aggtgtctcc aaggaaagag tccttattaa aattgcttcc acttgggaag    15660

gtattcaagc tgccaaagaa ttggaagaaa aggacggtat ccactgtaat ttgactctat    15720

tattctcctt cgttcaagca gttgcctgtg ccgaggccca agttactttg atttccccat    15780

ttgttggtag aattctagac tggtacaaat ccagcactgg taaagattac aagggtgaag    15840

ccgacccagg tgttatttcc gtcaagaaaa tctacaacta ctacaagaag tacggttaca    15900

agactattgt tatgggtgct tctttcagaa gcactgacga aatcaaaaac ttggctggtg    15960

ttgactatct aacaatttct ccagctttat tggacaagtt gatgaacagt actgaacctt    16020

tcccaagagt tttggaccct gtctccgcta agaaggaagc cggcgacaag atttcttaca    16080

tcagcgacga atctaaattc agattcgact tgaatgaaga cgctatggcc actgaaaaat    16140

tgtccgaagg tatcagaaaa ttctctgccg atattgttac tctattcgac ttgattgaaa    16200

agaaagttac cgcttaagga agtatctcgg aaatattaat ttaggccatg tccttatgca    16260

cgtttctttt gatacttacg ggtacatgta cacaagtata tctatatata taaattaatg    16320

aaaatcccct atttatatat atgactttaa cgagacagaa cagtttttta ttttttatcc    16380

tatttgatga atgatacagt ttcg                                           16404


<210>  88
<211>  95
<212>  DNA
<213>  Artificial sequence

<220>
<223>   as a URA3 deletion scar in the genome -After removal of the 
       KanMX marker using the cre recombinase, a 95 bp sequence 
       consisting of a loxP site flanked by the primer binding sites 
       remained

<400>  88
gcattgcgga ttacgtattc taatgttcag ataacttcgt atagcataca ttatacgaag       60

ttatccagtg atgatacaac gagttagcca aggtg                                  95


<210>  89
<211>  100
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  89
gtccataaag cttttcaatt catctttttt ttttttgttc ttttttttga ttccggtttc       60

tttgaaattt ttttgattcg gtaatctccg agcagaagga                            100


<210>  90
<211>  100
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  90
aaaactgtat tataagtaaa tgcatgtata ctaaactcac aaattagagc ttcaatttaa       60

ttatatcagt tattacccgg gaatctcggt cgtaatgatt                            100


<210>  91
<211>  100
<212>  DNA
<213>  saccharomyces cerevisiae

<400>  91
attggcatta tcacataatg aattatacat tatataaagt aatgtgattt cttcgaagaa       60

tatactaaaa aatgagcagg caagataaac gaaggcaaag                            100


<210>  92
<211>  100
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  92
tagtgacacc gattatttaa agctgcagca tacgatatat atacatgtgt atatatgtat       60

acctatgaat gtcagtaagt atgtatacga acagtatgat                            100


<210>  93
<211>  6728
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed vector

<400>  93
acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa       60

aagtgccacc tgggtccttt tcatcacgtg ctataaaaat aattataatt taaatttttt      120

aatataaata tataaattaa aaatagaaag taaaaaaaga aattaaagaa aaaatagttt      180

ttgttttccg aagatgtaaa agactctagg gggatcgcca acaaatacta ccttttatct      240

tgctcttcct gctctcaggt attaatgccg aattgtttca tcttgtctgt gtagaagacc      300

acacacgaaa atcctgtgat tttacatttt acttatcgtt aatcgaatgt atatctattt      360

aatctgcttt tcttgtctaa taaatatata tgtaaagtac gctttttgtt gaaatttttt      420

aaacctttgt ttattttttt ttcttcattc cgtaactctt ctaccttctt tatttacttt      480

ctaaaatcca aatacaaaac ataaaaataa ataaacacag agtaaattcc caaattattc      540

catcattaaa agatacgagg cgcgtgtaag ttacaggcaa gcgatccgtc ctaagaaacc      600

attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg      660

cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct      720

tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gcgtgttggc      780

gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat      840

aaattcccgt tttaagagct tggtgagcgc taggagtcac tgccaggtat cgtttgaaca      900

cggcattagt cagggaagtc ataacacagt cctttcccgc aattttcttt ttctattact      960

cttggcctcc tctagtacac tctatatttt tttatgcctc ggtaatgatt ttcatttttt     1020

tttttcccct agcggatgac tctttttttt tcttagcgat tggcattatc acataatgaa     1080

ttatacatta tataaagtaa tgtgatttct tcgaagaata tactaaaaaa tgagcaggca     1140

agataaacga aggcaaagat gacagagcag aaagccctag taaagcgtat tacaaatgaa     1200

accaagattc agattgcgat ctctttaaag ggtggtcccc tagcgataga gcactcgatc     1260

ttcccagaaa aagaggcaga agcagtagca gaacaggcca cacaatcgca agtgattaac     1320

gtccacacag gtatagggtt tctggaccat atgatacatg ctctggccaa gcattccggc     1380

tggtcgctaa tcgttgagtg cattggtgac ttacacatag acgaccatca caccactgaa     1440

gactgcggga ttgctctcgg tcaagctttt aaagaggccc tactggcgcg tggagtaaaa     1500

aggtttggat caggatttgc gcctttggat gaggcacttt ccagagcggt ggtagatctt     1560

tcgaacaggc cgtacgcagt tgtcgaactt ggtttgcaaa gggagaaagt aggagatctc     1620

tcttgcgaga tgatcccgca ttttcttgaa agctttgcag aggctagcag aattaccctc     1680

cacgttgatt gtctgcgagg caagaatgat catcaccgta gtgagagtgc gttcaaggct     1740

cttgcggttg ccataagaga agccacctcg cccaatggta ccaacgatgt tccctccacc     1800

aaaggtgttc ttatgtagtg acaccgatta tttaaagctg cagcatacga tatatataca     1860

tgtgtatata tgtataccta tgaatgtcag taagtatgta tacgaacagt atgatactga     1920

agatgacaag gtaatgcatc attctatacg tgtcattctg aacgaggcgc gctttccttt     1980

tttctttttg ctttttcttt ttttttctct tgaactcgac ggatctatgc ggtgtgaaat     2040

accgcacaga tgcgtaagga gaaaataccg catcaggaaa ttgtaaacgt taatattttg     2100

ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata ggccgaaatc     2160

ggcaaaatcc cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagtt     2220

tggaacaaga gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc     2280

tatcagggcg atggcccact acgtgaacca tcaccctaat caagtttttt ggggtcgagg     2340

tgccgtaaag cactaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga     2400

aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg     2460

ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct taatgcgccg     2520

ctacagggcg cgtcgcgcca ttcgccattc aggctgcgca actgttggga agggcgatcg     2580

gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc aaggcgatta     2640

agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc cagtgagcgc     2700

gcgtaatacg actcactata gggcgaattg ggtaccgggc cccccctcga ggtcgacggt     2760

atcgataagc ttgattagaa gccgccgagc gggcgacagc cctccgacgg aagactctcc     2820

tccgtgcgtc ctcgtcttca ccggtcgcgt tcctgaaacg cagatgtgcc tcgcgccgca     2880

ctgctccgaa caataaagat tctacaatac tagcttttat ggttatgaag aggaaaaatt     2940

ggcagtaacc tggccccaca aaccttcaaa ttaacgaatc aaattaacaa ccataggatg     3000

ataatgcgat tagtttttta gccttatttc tggggtaatt aatcagcgaa gcgatgattt     3060

ttgatctatt aacagatata taaatggaaa agctgcataa ccactttaac taatactttc     3120

aacattttca gtttgtatta cttcttattc aaatgtcata aaagtatcaa caaaaaattg     3180

ttaatatacc tctatacttt aacgtcaagg agaaaaatgt ccaatttact gcccgtacac     3240

caaaatttgc ctgcattacc ggtcgatgca acgagtgatg aggttcgcaa gaacctgatg     3300

gacatgttca gggatcgcca ggcgttttct gagcatacct ggaaaatgct tctgtccgtt     3360

tgccggtcgt gggcggcatg gtgcaagttg aataaccgga aatggtttcc cgcagaacct     3420

gaagatgttc gcgattatct tctatatctt caggcgcgcg gtctggcagt aaaaactatc     3480

cagcaacatt tgggccagct aaacatgctt catcgtcggt ccgggctgcc acgaccaagt     3540

gacagcaatg ctgtttcact ggttatgcgg cggatccgaa aagaaaacgt tgatgccggt     3600

gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt tcgaccaggt tcgttcactc     3660

atggaaaata gcgatcgctg ccaggatata cgtaatctgg catttctggg gattgcttat     3720

aacaccctgt tacgtatagc cgaaattgcc aggatcaggg ttaaagatat ctcacgtact     3780

gacggtggga gaatgttaat ccatattggc agaacgaaaa cgctggttag caccgcaggt     3840

gtagagaagg cacttagcct gggggtaact aaactggtcg agcgatggat ttccgtctct     3900

ggtgtagctg atgatccgaa taactacctg ttttgccggg tcagaaaaaa tggtgttgcc     3960

gcgccatctg ccaccagcca gctatcaact cgcgccctgg aagggatttt tgaagcaact     4020

catcgattga tttacggcgc taaggatgac tctggtcaga gatacctggc ctggtctgga     4080

cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg ctggagtttc aataccggag     4140

atcatgcaag ctggtggctg gaccaatgta aatattgtca tgaactatat ccgtaacctg     4200

gatagtgaaa caggggcaat ggtgcgcctg ctggaagatg gcgattagga gtaagcgaat     4260

ttcttatgat ttatgatttt tattattaaa taagttataa aaaaaataag tgtatacaaa     4320

ttttaaagtg actcttaggt tttaaaacga aaattcttat tcttgagtaa ctctttcctg     4380

taggtcaggt tgctttctca ggtatagcat gaggtcgctc ttattgacca cacctctacc     4440

ggcatgccga gcaaatgcct gcaaatcgct ccccatttca cccaattgta gatatgctaa     4500

ctccagcaat gagttgatga atctcggtgt gtattttatg tcctcagagg acaacacctg     4560

tggtgttcta gagcggccgc caccgcggtg gagctccagc ttttgttccc tttagtgagg     4620

gttaattgcg cgcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc     4680

gctcacaatt ccacacaaca taggagccgg aagcataaag tgtaaagcct ggggtgccta     4740

atgagtgagg taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa     4800

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     4860

tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     4920

agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     4980

aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     5040

gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     5100

tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     5160

cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     5220

ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     5280

cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     5340

atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     5400

agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     5460

gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa     5520

gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     5580

tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     5640

agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     5700

gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     5760

aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     5820

aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     5880

ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     5940

gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     6000

aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     6060

ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     6120

tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     6180

ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     6240

cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     6300

agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     6360

gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     6420

gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     6480

acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     6540

acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     6600

agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     6660

aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     6720

gagcggat                                                              6728


<210>  94
<211>  9353
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  94
ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc       60

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatagga gccggaagca      120

taaagtgtaa agcctggggt gcctaatgag tgaggtaact cacattaatt gcgttgcgct      180

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      240

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      300

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      360

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      420

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      480

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      540

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      600

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      660

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      720

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      780

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      840

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag      900

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt      960

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1020

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1080

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1140

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa     1200

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat     1260

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct     1320

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt     1380

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat     1440

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta     1500

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg     1560

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt     1620

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg     1680

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg     1740

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc     1800

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa     1860

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac     1920

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt     1980

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg     2040

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa     2100

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata     2160

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgaacga agcatctgtg     2220

cttcattttg tagaacaaaa atgcaacgcg agagcgctaa tttttcaaac aaagaatctg     2280

agctgcattt ttacagaaca gaaatgcaac gcgaaagcgc tattttacca acgaagaatc     2340

tgtgcttcat ttttgtaaaa caaaaatgca acgcgagagc gctaattttt caaacaaaga     2400

atctgagctg catttttaca gaacagaaat gcaacgcgag agcgctattt taccaacaaa     2460

gaatctatac ttcttttttg ttctacaaaa atgcatcccg agagcgctat ttttctaaca     2520

aagcatctta gattactttt tttctccttt gtgcgctcta taatgcagtc tcttgataac     2580

tttttgcact gtaggtccgt taaggttaga agaaggctac tttggtgtct attttctctt     2640

ccataaaaaa agcctgactc cacttcccgc gtttactgat tactagcgaa gctgcgggtg     2700

cattttttca agataaaggc atccccgatt atattctata ccgatgtgga ttgcgcatac     2760

tttgtgaaca gaaagtgata gcgttgatga ttcttcattg gtcagaaaat tatgaacggt     2820

ttcttctatt ttgtctctat atactacgta taggaaatgt ttacattttc gtattgtttt     2880

cgattcactc tatgaatagt tcttactaca atttttttgt ctaaagagta atactagaga     2940

taaacataaa aaatgtagag gtcgagttta gatgcaagtt caaggagcga aaggtggatg     3000

ggtaggttat atagggatat agcacagaga tatatagcaa agagatactt ttgagcaatg     3060

tttgtggaag cggtattcgc aatattttag tagctcgtta cagtccggtg cgtttttggt     3120

tttttgaaag tgcgtcttca gagcgctttt ggttttcaaa agcgctctga agttcctata     3180

ctttctagag aataggaact tcggaatagg aacttcaaag cgtttccgaa aacgagcgct     3240

tccgaaaatg caacgcgagc tgcgcacata cagctcactg ttcacgtcgc acctatatct     3300

gcgtgttgcc tgtatatata tatacatgag aagaacggca tagtgcgtgt ttatgcttaa     3360

atgcgtactt atatgcgtct atttatgtag gatgaaaggt agtctagtac ctcctgtgat     3420

attatcccat tccatgcggg gtatcgtatg cttccttcag cactaccctt tagctgttct     3480

atatgctgcc actcctcaat tggattagtc tcatccttca atgctatcat ttcctttgat     3540

attggatcat ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca     3600

cgaggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc     3660

tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg     3720

gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga     3780

ttgtactgag agtgcaccat aaattcccgt tttaagagct tggtgagcgc taggagtcac     3840

tgccaggtat cgtttgaaca cggcattagt cagggaagtc ataacacagt cctttcccgc     3900

aattttcttt ttctattact cttggcctcc tctagtacac tctatatttt tttatgcctc     3960

ggtaatgatt ttcatttttt tttttcccct agcggatgac tctttttttt tcttagcgat     4020

tggcattatc acataatgaa ttatacatta tataaagtaa tgtgatttct tcgaagaata     4080

tactaaaaaa tgagcaggca agataaacga aggcaaagat gacagagcag aaagccctag     4140

taaagcgtat tacaaatgaa accaagattc agattgcgat ctctttaaag ggtggtcccc     4200

tagcgataga gcactcgatc ttcccagaaa aagaggcaga agcagtagca gaacaggcca     4260

cacaatcgca agtgattaac gtccacacag gtatagggtt tctggaccat atgatacatg     4320

ctctggccaa gcattccggc tggtcgctaa tcgttgagtg cattggtgac ttacacatag     4380

acgaccatca caccactgaa gactgcggga ttgctctcgg tcaagctttt aaagaggccc     4440

tactggcgcg tggagtaaaa aggtttggat caggatttgc gcctttggat gaggcacttt     4500

ccagagcggt ggtagatctt tcgaacaggc cgtacgcagt tgtcgaactt ggtttgcaaa     4560

gggagaaagt aggagatctc tcttgcgaga tgatcccgca ttttcttgaa agctttgcag     4620

aggctagcag aattaccctc cacgttgatt gtctgcgagg caagaatgat catcaccgta     4680

gtgagagtgc gttcaaggct cttgcggttg ccataagaga agccacctcg cccaatggta     4740

ccaacgatgt tccctccacc aaaggtgttc ttatgtagtg acaccgatta tttaaagctg     4800

cagcatacga tatatataca tgtgtatata tgtataccta tgaatgtcag taagtatgta     4860

tacgaacagt atgatactga agatgacaag gtaatgcatc attctatacg tgtcattctg     4920

aacgaggcgc gctttccttt tttctttttg ctttttcttt ttttttctct tgaactcgac     4980

ggatctatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg catcaggaaa     5040

ttgtaaacgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt     5100

ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag     5160

ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg     5220

tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcaccctaat     5280

caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc     5340

gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga     5400

aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac     5460

ccgccgcgct taatgcgccg ctacagggcg cgtcgcgcca ttcgccattc aggctgcgca     5520

actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg     5580

gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta     5640

aaacgacggc cagtgagcgc gcgtaatacg actcactata gggcgaattg ggtaccgggc     5700

cccccctcga ggtcgacggt atcgataagc ttgatatcga attcctgcag cccgggggat     5760

ccttttctgg caaccaaacc catacatcgg gattcctata ataccttcgt tggtctccct     5820

aacatgtagg tggcggaggg gagatataca atagaacaga taccagacaa gacataatgg     5880

gctaaacaag actacaccaa ttacactgcc tcattgatgg tggtacataa cgaactaata     5940

ctgtagccct agacttgata gccatcatca tatcgaagtt tcactaccct ttttccattt     6000

gccatctatt gaagtaataa taggcgcatg caacttcttt tctttttttt tcttttctct     6060

ctcccccgtt gttgtctcac catatccgca atgacaaaaa aatgatggaa gacactaaag     6120

gaaaaaatta acgacaaaga cagcaccaac agatgtcgtt gttccagagc tgatgagggg     6180

tatctcgaag cacacgaaac tttttccttc cttcattcac gcacactact ctctaatgag     6240

caacggtata cggccttcct tccagttact tgaatttgaa ataaaaaaaa gtttgctgtc     6300

ttgctatcaa gtataaatag acctgcaatt attaatcttt tgtttcctcg tcattgttct     6360

cgttcccttt cttccttgtt tctttttctg cacaatattt caagctatac caagcataca     6420

atcaactatc tcatatacaa ctagtatggc tgctaaagat gtaaagttcg gtaatgatgc     6480

tagagtaaaa atgttgagag gtgtaaatgt attggctgac gctgtaaaag taactttggg     6540

tccaaaaggt agaaatgttg tcttggataa gtcttttggt gctcctacca taactaaaga     6600

cggtgtttca gtcgcaagag aaatcgaatt ggaggataag ttcgaaaaca tgggtgctca     6660

aatggtcaaa gaagtcgcct ctaaggctaa cgatgctgca ggtgacggta ctacaaccgc     6720

tactgttttg gctcaagcaa ttataacaga aggtttaaaa gcagttgccg ctggtatgaa     6780

tccaatggat ttgaaaagag gtattgacaa ggccgtcact gcagccgtag aagaattgaa     6840

agcattatca gtcccttgtt ctgattcaaa ggccatcgct caagtaggta ccatttccgc     6900

taacagtgat gaaactgttg gtaaattaat tgcagaagcc atggacaaag tcggtaaaga     6960

aggtgtaata accgttgaag atggtactgg tttgcaagat gaattagacg tagttgaggg     7020

tatgcaattt gatagaggtt atttgtcacc atacttcatc aataagcctg aaacaggtgc     7080

tgttgaattg gaatcccctt ttattttgtt ggcagataaa aagattagta acataagaga     7140

aatgttgcca gttttagaag ctgtcgcaaa agccggtaaa cctttgttaa tcattgctga     7200

agatgttgaa ggtgaagcat tggcaacatt agtcgtaaat accatgagag gtattgtaaa     7260

agttgctgca gttaaggctc caggtttcgg tgacagaaga aaagctatgt tgcaagacat     7320

tgcaacatta accggtggta cagttatctc cgaagaaatt ggtatggaat tggaaaaggc     7380

caccttggaa gatttgggtc aagctaagag agttgtcatt aataaggata ctacaaccat     7440

catcgacggt gtaggtgaag aagccgctat acaaggtaga gttgctcaaa taagacaaca     7500

aatcgaagaa gcaacttctg attatgacag agaaaaattg caagaaagag ttgcaaagtt     7560

agccggtggt gtcgctgtaa ttaaagttgg tgcagccacc gaagtcgaaa tgaaggaaaa     7620

gaaagcaaga gtagaagatg ctttgcatgc aacaagagct gcagttgaag aaggtgtagt     7680

tgcaggtggt ggtgtcgcct taattagagt agcctccaaa ttggctgatt tgagaggtca     7740

aaatgaagac caaaacgtag gtatcaaggt tgccttaaga gctatggaag caccattgag     7800

acaaatcgtt ttgaactgtg gtgaagaacc tagtgtcgta gctaacactg ttaaaggtgg     7860

tgacggtaat tatggttaca acgccgctac agaagaatac ggtaacatga tcgatatggg     7920

tatattggac ccaactaagg tcacaagatc tgcattgcaa tacgcagcct cagttgccgg     7980

tttaatgatt actacagaat gcatggttac agatttgcct aaaaacgacg ctgccgactt     8040

gggtgccgca ggtggtatgg gtggtatggg tggtatgggt ggtatgatgt gagcggccgc     8100

acaggcccct tttcctttgt cgatatcatg taattagtta tgtcacgctt acattcacgc     8160

cctcctccca catccgctct aaccgaaaag gaaggagtta gacaacctga agtctaggtc     8220

cctatttatt ttttttaata gttatgttag tattaagaac gttatttata tttcaaattt     8280

ttcttttttt tctgtacaaa cgcgtgtacg catgtaacag gcgcgcctca cttttcgatg     8340

acagccaaaa catctctagc ggacaagacc aagtattctt caccagcgta cttgacttca     8400

gtaccaccgt acttagagta caagacaacg tcaccgacct taacgtccaa tgggactctg     8460

ttacccttat cgtcgattct acctggaccg acagccaaaa cagtaccttc ttgtggcttt     8520

tccttagcgg tgtctgggat aacgatacca gaagcggtag tggtttcagc ttcgttagct     8580

tgaacaacga ttctgtcttc caatggcttg atagcgacct tagtagcggt ggtgactggc     8640

atactgttta aactttgttt gtttatgtgt gtttattcga aactaagttc ttggtgtttt     8700

aaaactaaaa aaaagactaa ctataaaagt agaatttaag aagtttaaga aatagattta     8760

cagaattaca atcaatacct accgtcttta tatacttatt agtcaagtag gggaataatt     8820

tcagggaact ggtttcaacc ttttttttca gctttttcca aatcagagag agcagaaggt     8880

aatagaaggt gtaagaaaat gagatagata catgcgtggg tcaattgcct tgtgtcatca     8940

tttactccag gcaggttgca tcactccatt gaggttgtgc ccgttttttg cctgtttgtg     9000

cccctgttct ctgtagttgc gctaagagaa tggacctatg aactgatggt tggtgaagaa     9060

aacaatattt tggtgctggg attctttttt tttctggatg ccagcttaaa aagcgggctc     9120

cattatattt agtggatgcc aggaataaac tgttcaccca gacacctacg atgttatata     9180

ttctgtgtaa cccgccccct attttgggca tgtacgggtt acagcagaat taaaaggcta     9240

attttttgac taaataaagt taggaaaatc actactatta attatttacg tattctttga     9300

aatggcagta ttgataatga taaactcgaa ctagatctat ccgcggtgga gct            9353


<210>  95
<211>  9353
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  95
ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc       60

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatagga gccggaagca      120

taaagtgtaa agcctggggt gcctaatgag tgaggtaact cacattaatt gcgttgcgct      180

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      240

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      300

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      360

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      420

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      480

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      540

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      600

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      660

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      720

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      780

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      840

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag      900

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt      960

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1020

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1080

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1140

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa     1200

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat     1260

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct     1320

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt     1380

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat     1440

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta     1500

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg     1560

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt     1620

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg     1680

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg     1740

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc     1800

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa     1860

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac     1920

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt     1980

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg     2040

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa     2100

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata     2160

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgaacga agcatctgtg     2220

cttcattttg tagaacaaaa atgcaacgcg agagcgctaa tttttcaaac aaagaatctg     2280

agctgcattt ttacagaaca gaaatgcaac gcgaaagcgc tattttacca acgaagaatc     2340

tgtgcttcat ttttgtaaaa caaaaatgca acgcgagagc gctaattttt caaacaaaga     2400

atctgagctg catttttaca gaacagaaat gcaacgcgag agcgctattt taccaacaaa     2460

gaatctatac ttcttttttg ttctacaaaa atgcatcccg agagcgctat ttttctaaca     2520

aagcatctta gattactttt tttctccttt gtgcgctcta taatgcagtc tcttgataac     2580

tttttgcact gtaggtccgt taaggttaga agaaggctac tttggtgtct attttctctt     2640

ccataaaaaa agcctgactc cacttcccgc gtttactgat tactagcgaa gctgcgggtg     2700

cattttttca agataaaggc atccccgatt atattctata ccgatgtgga ttgcgcatac     2760

tttgtgaaca gaaagtgata gcgttgatga ttcttcattg gtcagaaaat tatgaacggt     2820

ttcttctatt ttgtctctat atactacgta taggaaatgt ttacattttc gtattgtttt     2880

cgattcactc tatgaatagt tcttactaca atttttttgt ctaaagagta atactagaga     2940

taaacataaa aaatgtagag gtcgagttta gatgcaagtt caaggagcga aaggtggatg     3000

ggtaggttat atagggatat agcacagaga tatatagcaa agagatactt ttgagcaatg     3060

tttgtggaag cggtattcgc aatattttag tagctcgtta cagtccggtg cgtttttggt     3120

tttttgaaag tgcgtcttca gagcgctttt ggttttcaaa agcgctctga agttcctata     3180

ctttctagag aataggaact tcggaatagg aacttcaaag cgtttccgaa aacgagcgct     3240

tccgaaaatg caacgcgagc tgcgcacata cagctcactg ttcacgtcgc acctatatct     3300

gcgtgttgcc tgtatatata tatacatgag aagaacggca tagtgcgtgt ttatgcttaa     3360

atgcgtactt atatgcgtct atttatgtag gatgaaaggt agtctagtac ctcctgtgat     3420

attatcccat tccatgcggg gtatcgtatg cttccttcag cactaccctt tagctgttct     3480

atatgctgcc actcctcaat tggattagtc tcatccttca atgctatcat ttcctttgat     3540

attggatcat ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca     3600

cgaggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc     3660

tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg     3720

gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga     3780

ttgtactgag agtgcaccat aaattcccgt tttaagagct tggtgagcgc taggagtcac     3840

tgccaggtat cgtttgaaca cggcattagt cagggaagtc ataacacagt cctttcccgc     3900

aattttcttt ttctattact cttggcctcc tctagtacac tctatatttt tttatgcctc     3960

ggtaatgatt ttcatttttt tttttcccct agcggatgac tctttttttt tcttagcgat     4020

tggcattatc acataatgaa ttatacatta tataaagtaa tgtgatttct tcgaagaata     4080

tactaaaaaa tgagcaggca agataaacga aggcaaagat gacagagcag aaagccctag     4140

taaagcgtat tacaaatgaa accaagattc agattgcgat ctctttaaag ggtggtcccc     4200

tagcgataga gcactcgatc ttcccagaaa aagaggcaga agcagtagca gaacaggcca     4260

cacaatcgca agtgattaac gtccacacag gtatagggtt tctggaccat atgatacatg     4320

ctctggccaa gcattccggc tggtcgctaa tcgttgagtg cattggtgac ttacacatag     4380

acgaccatca caccactgaa gactgcggga ttgctctcgg tcaagctttt aaagaggccc     4440

tactggcgcg tggagtaaaa aggtttggat caggatttgc gcctttggat gaggcacttt     4500

ccagagcggt ggtagatctt tcgaacaggc cgtacgcagt tgtcgaactt ggtttgcaaa     4560

gggagaaagt aggagatctc tcttgcgaga tgatcccgca ttttcttgaa agctttgcag     4620

aggctagcag aattaccctc cacgttgatt gtctgcgagg caagaatgat catcaccgta     4680

gtgagagtgc gttcaaggct cttgcggttg ccataagaga agccacctcg cccaatggta     4740

ccaacgatgt tccctccacc aaaggtgttc ttatgtagtg acaccgatta tttaaagctg     4800

cagcatacga tatatataca tgtgtatata tgtataccta tgaatgtcag taagtatgta     4860

tacgaacagt atgatactga agatgacaag gtaatgcatc attctatacg tgtcattctg     4920

aacgaggcgc gctttccttt tttctttttg ctttttcttt ttttttctct tgaactcgac     4980

ggatctatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg catcaggaaa     5040

ttgtaaacgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt     5100

ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag     5160

ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg     5220

tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcaccctaat     5280

caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc     5340

gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga     5400

aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac     5460

ccgccgcgct taatgcgccg ctacagggcg cgtcgcgcca ttcgccattc aggctgcgca     5520

actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg     5580

gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta     5640

aaacgacggc cagtgagcgc gcgtaatacg actcactata gggcgaattg ggtaccgggc     5700

cccccctcga ggtcgacggt atcgataagc ttgatatcga attcctgcag cccgggggat     5760

ccttttctgg caaccaaacc catacatcgg gattcctata ataccttcgt tggtctccct     5820

aacatgtagg tggcggaggg gagatataca atagaacaga taccagacaa gacataatgg     5880

gctaaacaag actacaccaa ttacactgcc tcattgatgg tggtacataa cgaactaata     5940

ctgtagccct agacttgata gccatcatca tatcgaagtt tcactaccct ttttccattt     6000

gccatctatt gaagtaataa taggcgcatg caacttcttt tctttttttt tcttttctct     6060

ctcccccgtt gttgtctcac catatccgca atgacaaaaa aatgatggaa gacactaaag     6120

gaaaaaatta acgacaaaga cagcaccaac agatgtcgtt gttccagagc tgatgagggg     6180

tatctcgaag cacacgaaac tttttccttc cttcattcac gcacactact ctctaatgag     6240

caacggtata cggccttcct tccagttact tgaatttgaa ataaaaaaaa gtttgctgtc     6300

ttgctatcaa gtataaatag acctgcaatt attaatcttt tgtttcctcg tcattgttct     6360

cgttcccttt cttccttgtt tctttttctg cacaatattt caagctatac caagcataca     6420

atcaactatc tcatatacaa ctagtatggc taagatcatc gctttcgacg aagaagctag     6480

aagaggtttg gaaagaggta tgaaccaatt ggctgacgct gttaaggtca ctttgggtcc     6540

aaagggtaga aacgttgtct tggaaaagaa gtggggtgct ccaactatca ccaacgatgg     6600

tgtctctatc gctaaggaaa tcgaattgga agactcctac gaaaagatcg gtgctgaatt     6660

ggtcaaggaa gttgctaaga agactgacga tgtcgctggt gacggtacta ctaccgctac     6720

cgtcttggct caagctttgg ttagagaagg tttgagaaac gttgctgctg gtgctaaccc     6780

aatggctttg aagagaggta tcgaagctgc tgtcgcttct gtttccgaag gtttgcaaca     6840

attggctaag gacgttgaaa ctaaggaaca aatcgcttct accgcttcta tctctgctgg     6900

tgactccact gtcggtgaaa tcatcgctga agctatggac aaggttggta aagaaggtgt     6960

catcactgtt gaagaatcta acaccttcgg tttggaattg gaattgactg aaggtatgag     7020

attcgataag ggttacatct ccgcttactt catgaccgac gctgaaagaa tggaagctgt     7080

cttcgacgat ccatacatct tgatcgctaa ctctaagatc tccgctgtca aggacttgtt     7140

gccaatcttg gaaaaggtta tgcaatctgg taaaccattg gtcatcatcg ctgaagacgt     7200

tgaaggtgaa gctttggcta ctttggttgt caacaaggtt agaggtactt tcaagtctgt     7260

cgctgttaag gctccaggtt tcggtgacag aagaaaggct atgttggaag acatcgctat     7320

cttgactggt ggtgctgtca tctctgaaga agttggtttg aagttggatg ctgctgactt     7380

gtccttgttg ggtcaagcta gaaaggttgt catcaccaag gatgaaacta ccgttgttga     7440

cggtgctggt aacggtgaac aaatccaagg tagagttaac caaatcagag ctgaaatcga     7500

aagatctgac tccgattacg acagagaaaa gttgcaagaa agattggcta agttggctgg     7560

tggtgtcgct gttatcaagg tcggtgctgc taccgaagtt gaattgaagg aaagaaagca     7620

cagaatcgaa gacgctgtca gaaacgctaa ggctgctgtc gaagaaggta tcgttccagg     7680

tggtggtgtc gctttggttc aagctggtaa aactgctttc gataagttgg acttggttgg     7740

tgacgaagct accggtgcta acatcgtcaa ggttgctttg gacgctccat tgagacaaat     7800

cgctgtcaac gctggtttgg aaggtggtgt tgtcgttgaa aaggttagaa acttgtctgc     7860

tggtcacggt ttgaacgctg ctactggtga atacgtcgat ttgttggctg ctggtatcat     7920

cgacccagct aaggttacca gatctgcttt gcaaaacgct gcttccatcg ctgctttgtt     7980

cttgactacc gaagctgtcg ttgctgacaa gccagaaaag aacccagctc cagctggtgc     8040

tccaggtggt ggtgacatgg acttctgagc ggccgcacag gccccttttc ctttgtcgat     8100

atcatgtaat tagttatgtc acgcttacat tcacgccctc ctcccacatc cgctctaacc     8160

gaaaaggaag gagttagaca acctgaagtc taggtcccta tttatttttt ttaatagtta     8220

tgttagtatt aagaacgtta tttatatttc aaatttttct tttttttctg tacaaacgcg     8280

tgtacgcatg taacaggcgc gcctcacaag tacaaaccag taccatcgga ttcaactctg     8340

ttagcagcaa cagcgtgaac gtctctttct ctcaacaaaa cgtattcttt accgtgcaat     8400

tcgacttcag atctatcgtc tggatcgaac aaaactctgt caccgacaac gatggatctg     8460

acgtttggac caacaccgac agcaacagcc caagacaatc ttctaccgat agtagcggta     8520

gctgggatga cgataccagc ggaagatctt ctttcacctt caccaccatc ttgtctgacc     8580

aaaactctat cgtgcaacat tctgattggc aaaccagcat cggttctagt atcagcggac     8640

atactgttta aactttgttt gtttatgtgt gtttattcga aactaagttc ttggtgtttt     8700

aaaactaaaa aaaagactaa ctataaaagt agaatttaag aagtttaaga aatagattta     8760

cagaattaca atcaatacct accgtcttta tatacttatt agtcaagtag gggaataatt     8820

tcagggaact ggtttcaacc ttttttttca gctttttcca aatcagagag agcagaaggt     8880

aatagaaggt gtaagaaaat gagatagata catgcgtggg tcaattgcct tgtgtcatca     8940

tttactccag gcaggttgca tcactccatt gaggttgtgc ccgttttttg cctgtttgtg     9000

cccctgttct ctgtagttgc gctaagagaa tggacctatg aactgatggt tggtgaagaa     9060

aacaatattt tggtgctggg attctttttt tttctggatg ccagcttaaa aagcgggctc     9120

cattatattt agtggatgcc aggaataaac tgttcaccca gacacctacg atgttatata     9180

ttctgtgtaa cccgccccct attttgggca tgtacgggtt acagcagaat taaaaggcta     9240

attttttgac taaataaagt taggaaaatc actactatta attatttacg tattctttga     9300

aatggcagta ttgataatga taaactcgaa ctagatctat ccgcggtgga gct            9353


<210>  96
<211>  439
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  96

Met Glu Phe Phe Lys Asn Ile Ser Lys Ile Pro Tyr Glu Gly Lys Asp 
1               5                   10                  15      


Ser Thr Asn Pro Leu Ala Phe Lys Tyr Tyr Asn Pro Asp Glu Val Ile 
            20                  25                  30          


Asp Gly Lys Lys Met Arg Asp Ile Met Lys Phe Ala Leu Ser Trp Trp 
        35                  40                  45              


His Thr Met Gly Gly Asp Gly Thr Asp Met Phe Gly Cys Gly Thr Ala 
    50                  55                  60                  


Asp Lys Thr Trp Gly Glu Asn Asp Pro Ala Ala Arg Ala Lys Ala Lys 
65                  70                  75                  80  


Val Asp Ala Ala Phe Glu Ile Met Gln Lys Leu Ser Ile Asp Tyr Phe 
                85                  90                  95      


Cys Phe His Asp Arg Asp Leu Ser Pro Glu Tyr Gly Ser Leu Lys Asp 
            100                 105                 110         


Thr Asn Ala Gln Leu Asp Ile Val Thr Asp Tyr Ile Lys Ala Lys Gln 
        115                 120                 125             


Ala Glu Thr Gly Leu Lys Cys Leu Trp Gly Thr Ala Lys Cys Phe Asp 
    130                 135                 140                 


His Pro Arg Phe Met His Gly Ala Gly Thr Ser Pro Ser Ala Asp Val 
145                 150                 155                 160 


Phe Ala Phe Ser Ala Ala Gln Ile Lys Lys Ala Leu Glu Ser Thr Val 
                165                 170                 175     


Lys Leu Gly Gly Thr Gly Tyr Val Phe Trp Gly Gly Arg Glu Gly Tyr 
            180                 185                 190         


Glu Thr Leu Leu Asn Thr Asn Met Gly Leu Glu Leu Asp Asn Met Ala 
        195                 200                 205             


Arg Leu Met Lys Met Ala Val Glu Tyr Gly Arg Ser Ile Gly Phe Lys 
    210                 215                 220                 


Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr Lys His Gln 
225                 230                 235                 240 


Tyr Asp Phe Asp Thr Ala Thr Val Leu Gly Phe Leu Arg Lys Tyr Gly 
                245                 250                 255     


Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala Thr Leu 
            260                 265                 270         


Ala Gln His Thr Phe Gln His Glu Leu Cys Val Ala Arg Thr Asn Gly 
        275                 280                 285             


Ala Phe Gly Ser Ile Asp Ala Asn Gln Gly Asp Pro Leu Leu Gly Trp 
    290                 295                 300                 


Asp Thr Asp Gln Phe Pro Thr Asn Ile Tyr Asp Thr Thr Met Cys Met 
305                 310                 315                 320 


Tyr Glu Val Ile Lys Ala Gly Gly Phe Thr Asn Gly Gly Leu Asn Phe 
                325                 330                 335     


Asp Ala Lys Ala Arg Arg Gly Ser Phe Thr Pro Glu Asp Ile Phe Tyr 
            340                 345                 350         


Ser Tyr Ile Ala Gly Met Asp Ala Phe Ala Leu Gly Tyr Lys Ala Ala 
        355                 360                 365             


Ser Lys Leu Ile Ala Asp Gly Arg Ile Asp Ser Phe Ile Ser Asp Arg 
    370                 375                 380                 


Tyr Ala Ser Trp Ser Glu Gly Ile Gly Leu Asp Ile Ile Ser Gly Lys 
385                 390                 395                 400 


Ala Asp Met Ala Ala Leu Glu Lys Tyr Ala Leu Glu Lys Gly Glu Val 
                405                 410                 415     


Thr Asp Ser Ile Ser Ser Gly Arg Gln Glu Leu Leu Glu Ser Ile Val 
            420                 425                 430         


Asn Asn Val Ile Phe Asn Leu 
        435                 


<210>  97
<211>  441
<212>  PRT
<213>  Ruminococcus champanellensis

<400>  97

Met Ser Glu Phe Phe Thr Gly Ile Ser Lys Ile Pro Phe Glu Gly Lys 
1               5                   10                  15      


Ala Ser Asn Asn Pro Met Ala Phe Lys Tyr Tyr Asn Pro Asp Glu Val 
            20                  25                  30          


Val Gly Gly Lys Thr Met Arg Glu Gln Leu Lys Phe Ala Leu Ser Trp 
        35                  40                  45              


Trp His Thr Met Gly Gly Asp Gly Thr Asp Met Phe Gly Val Gly Thr 
    50                  55                  60                  


Thr Asn Lys Lys Phe Gly Gly Thr Asp Pro Met Asp Ile Ala Lys Arg 
65                  70                  75                  80  


Lys Val Asn Ala Ala Phe Glu Leu Met Asp Lys Leu Ser Ile Asp Tyr 
                85                  90                  95      


Phe Cys Phe His Asp Arg Asp Leu Ala Pro Glu Ala Asp Asn Leu Lys 
            100                 105                 110         


Glu Thr Asn Gln Arg Leu Asp Glu Ile Thr Glu Tyr Ile Ala Gln Met 
        115                 120                 125             


Met Gln Leu Asn Pro Asp Lys Lys Val Leu Trp Gly Thr Ala Asn Cys 
    130                 135                 140                 


Phe Gly Asn Pro Arg Tyr Met His Gly Ala Gly Thr Ala Pro Asn Ala 
145                 150                 155                 160 


Asp Val Phe Ala Phe Ala Ala Ala Gln Ile Lys Lys Ala Ile Glu Ile 
                165                 170                 175     


Thr Val Lys Leu Gly Gly Lys Gly Tyr Val Phe Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asn Met Gly Leu Glu Leu Asp Asn 
        195                 200                 205             


Met Ala Arg Leu Leu His Met Ala Val Asp Tyr Ala Arg Ser Ile Gly 
    210                 215                 220                 


Phe Thr Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Phe Asp Thr Ala Thr Val Ile Gly Phe Leu Arg Lys 
                245                 250                 255     


Tyr Asn Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gln His Thr Phe Gln His Glu Leu Arg Val Ala Arg Glu 
        275                 280                 285             


Asn Gly Phe Phe Gly Ser Ile Asp Ala Asn Gln Gly Asp Thr Leu Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Thr Asn Thr Tyr Asp Ala Ala Leu 
305                 310                 315                 320 


Cys Met Tyr Glu Val Leu Lys Ala Gly Gly Phe Thr Asn Gly Gly Leu 
                325                 330                 335     


Asn Phe Asp Ser Lys Ala Arg Arg Gly Ser Phe Glu Met Glu Asp Ile 
            340                 345                 350         


Phe His Ser Tyr Ile Ala Gly Met Asp Thr Phe Ala Leu Gly Leu Lys 
        355                 360                 365             


Ile Ala Gln Lys Met Ile Asp Asp Gly Arg Ile Asp Gln Phe Val Ala 
    370                 375                 380                 


Asp Arg Tyr Ala Ser Trp Asn Thr Gly Ile Gly Ala Asp Ile Ile Ser 
385                 390                 395                 400 


Gly Lys Ala Thr Met Ala Asp Leu Glu Ala Tyr Ala Leu Ser Lys Gly 
                405                 410                 415     


Asp Val Thr Ala Ser Leu Lys Ser Gly Arg Gln Glu Leu Leu Glu Ser 
            420                 425                 430         


Ile Leu Asn Asn Ile Met Phe Asn Leu 
        435                 440     


<210>  98
<211>  439
<212>  PRT
<213>  Unknown

<220>
<223>  uncultured bacteria from cow rumen

<400>  98

Met Gly Glu Ile Phe Ser Asn Ile Pro Val Ile Lys Tyr Glu Gly Pro 
1               5                   10                  15      


Asp Ser Lys Asn Pro Leu Ala Phe Lys Tyr Tyr Asp Pro Glu Arg Val 
            20                  25                  30          


Ile Leu Gly Lys Lys Met Lys Glu His Leu Pro Phe Ala Met Ala Trp 
        35                  40                  45              


Trp His Asn Leu Cys Ala Asn Gly Val Asp Met Phe Gly Arg Gly Thr 
    50                  55                  60                  


Ile Asp Lys Leu Phe Gly Ala Ala Glu Ala Gly Thr Met Glu His Ala 
65                  70                  75                  80  


Lys Ala Lys Val Asp Ala Gly Ile Glu Phe Met Gln Lys Leu Gly Ile 
                85                  90                  95      


Glu Tyr Tyr Cys Phe His Asp Val Asp Leu Val Pro Glu Ala Asp Asp 
            100                 105                 110         


Ile Asn Glu Thr Asn Arg Arg Leu Asp Glu Leu Thr Asp Tyr Leu Lys 
        115                 120                 125             


Glu Lys Thr Ala Gly Thr Asn Ile Lys Cys Leu Trp Gly Thr Ala Asn 
    130                 135                 140                 


Met Phe Ser Asn Pro Arg Phe Met Asn Gly Ala Gly Ser Thr Asn Asp 
145                 150                 155                 160 


Val Asp Val Tyr Cys Phe Ala Ala Ala Gln Val Lys Lys Ala Ile Glu 
                165                 170                 175     


Met Thr Val Lys Leu Gly Gly Arg Gly Tyr Val Phe Trp Gly Gly Arg 
            180                 185                 190         


Glu Gly Tyr Glu Thr Leu Leu Asn Thr Lys Val Gln Met Glu Leu Glu 
        195                 200                 205             


Asn Ile Ala Asn Leu Met Lys Met Ala Arg Asp Tyr Gly Arg Ser Ile 
    210                 215                 220                 


Gly Phe Lys Gly Thr Phe Leu Ile Glu Pro Lys Pro Lys Glu Pro Met 
225                 230                 235                 240 


Lys His Gln Tyr Asp Tyr Asp Ala Ala Thr Ala Ile Gly Phe Leu Arg 
                245                 250                 255     


Gln Tyr Gly Leu Asp Gln Asp Phe Lys Met Asn Ile Glu Ala Asn His 
            260                 265                 270         


Ala Thr Leu Ala Gly His Thr Phe Gln His Glu Leu Arg Ile Ser Arg 
        275                 280                 285             


Ile Asn Gly Met Leu Gly Ser Ile Asp Ala Asn Gln Gly Asp Ile Met 
    290                 295                 300                 


Leu Gly Trp Asp Thr Asp Cys Phe Pro Ser Asn Val Tyr Asp Thr Thr 
305                 310                 315                 320 


Leu Ala Met Tyr Glu Ile Val Arg Asn Gly Gly Leu Pro Val Gly Ile 
                325                 330                 335     


Asn Phe Asp Ser Lys Asn Arg Arg Pro Ser Asn Thr Tyr Glu Asp Met 
            340                 345                 350         


Phe His Ala Phe Ile Leu Gly Met Asp Ser Phe Ala Phe Gly Leu Ile 
        355                 360                 365             


Lys Ala Ala Gln Ile Ile Glu Asp Gly Arg Ile Glu Gly Phe Thr Glu 
    370                 375                 380                 


Lys Lys Tyr Glu Ser Phe Asn Thr Glu Leu Gly Gln Lys Ile Arg Lys 
385                 390                 395                 400 


Gly Glu Ala Thr Leu Glu Glu Leu Ala Ala His Ala Ala Asp Leu Lys 
                405                 410                 415     


Ala Pro Lys Val Pro Val Ser Gly Arg Gln Glu Tyr Leu Glu Gly Val 
            420                 425                 430         


Leu Asn Asn Ile Ile Leu Ser 
        435                 


<210>  99
<211>  1317
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for Ru2 optimized for expression in Saccharomyces 
       cerevisiae

<400>  99
atgggtgaaa tcttctctaa catcccagtc atcaagtacg aaggtccaga ctctaagaac       60

ccattggctt tcaagtacta cgatccagaa agagtcatct tgggtaaaaa gatgaaggaa      120

cacttgccat tcgctatggc ttggtggcac aacttgtgtg ctaacggtgt tgacatgttc      180

ggtagaggta ctatcgataa gttgttcggt gctgctgaag ctggtactat ggaacacgct      240

aaggctaagg ttgacgctgg tatcgagttc atgcaaaagt tgggtatcga atactactgt      300

ttccacgacg ttgatttggt cccagaagct gacgatatca acgaaaccaa cagaagattg      360

gacgaattga ctgattactt gaaggaaaag accgctggta ctaacatcaa gtgtttgtgg      420

ggtactgcta acatgttctc taacccaaga ttcatgaacg gtgctggttc cactaacgac      480

gttgatgtct actgtttcgc tgctgctcaa gttaagaagg ctatcgaaat gaccgtcaag      540

ttgggtggta gaggttacgt tttctggggt ggtagagaag gttacgaaac cttgttgaac      600

actaaggtcc aaatggaatt ggaaaacatc gctaacttga tgaagatggc tagagactac      660

ggtagatcta tcggtttcaa gggtactttc ttgatcgaac caaagccaaa ggaaccaatg      720

aagcaccaat acgactacga tgctgctact gctatcggtt tcttgagaca atacggtttg      780

gaccaagatt tcaagatgaa catcgaagct aaccacgcta ccttggctgg tcacactttc      840

caacacgaat tgagaatctc tagaatcaac ggtatgttgg gttccatcga cgctaaccaa      900

ggtgacatca tgttgggttg ggacaccgat tgtttcccat ctaacgttta cgacaccact      960

ttggctatgt acgaaatcgt tagaaacggt ggtttgccag tcggtatcaa cttcgactct     1020

aagaacagaa gaccatccaa cacttacgaa gacatgttcc acgctttcat cttgggtatg     1080

gactctttcg ctttcggttt gatcaaggct gctcaaatca tcgaagacgg tagaatcgaa     1140

ggtttcaccg aaaagaagta cgaatccttc aacactgaat tgggtcaaaa gatcagaaag     1200

ggtgaagcta ctttggaaga attggctgct cacgctgctg acttgaaggc tccaaaggtt     1260

ccagtctctg gtagacaaga atacttggaa ggtgttttga acaacatcat cttgtcc        1317


<210>  100
<211>  395
<212>  PRT
<213>  Unknown

<220>
<223>  uncultured bacteria from cow rumen

<400>  100

Met Ala Trp Trp His Asn Met Cys Ala Asn Gly Lys Asp Met Phe Gly 
1               5                   10                  15      


Thr Gly Thr Ala Asp Lys Ser Phe Gly Ala Glu Pro Gly Thr Met Glu 
            20                  25                  30          


His Ala Lys Ala Lys Val Asp Ala Ala Ile Glu Phe Met Gln Lys Leu 
        35                  40                  45              


Gly Ile Glu Tyr Tyr Cys Phe His Asp Val Asp Leu Val Pro Glu Asp 
    50                  55                  60                  


Glu Asp Asp Ile Asn Val Thr Asn Ala Arg Leu Asp Glu Ile Ser Asp 
65                  70                  75                  80  


Tyr Ile Leu Glu Lys Thr Lys Gly Thr Asn Ile Arg Cys Leu Trp Gly 
                85                  90                  95      


Thr Ala Asn Met Phe Asn Asn Pro Arg Phe Met Asn Gly Ala Gly Ser 
            100                 105                 110         


Thr Asn Ser Ala Asp Val Tyr Cys Phe Ala Ala Ala Gln Ile Lys Lys 
        115                 120                 125             


Ala Leu Asp Ile Thr Val Lys Leu Gly Gly Arg Gly Tyr Val Phe Trp 
    130                 135                 140                 


Gly Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Val Lys Leu 
145                 150                 155                 160 


Glu Gln Glu Asn Ile Ala Asn Leu Met His Met Ala Val Glu Tyr Gly 
                165                 170                 175     


Arg Ser Ile Gly Phe Lys Gly Asp Phe Leu Ile Glu Pro Lys Pro Lys 
            180                 185                 190         


Glu Pro Met Lys His Gln Tyr Asp Phe Asp Ala Ala Thr Ala Ile Gly 
        195                 200                 205             


Phe Leu Arg Gln Tyr Gly Leu Asp Lys Asp Phe Lys Leu Asn Ile Glu 
    210                 215                 220                 


Ala Asn His Ala Thr Leu Ala Gly His Thr Phe Gln His Glu Leu Arg 
225                 230                 235                 240 


Ile Ser Ala Met Asn Gly Met Leu Gly Ser Ile Asp Ala Asn Gln Gly 
                245                 250                 255     


Asp Met Leu Leu Gly Trp Asp Thr Asp Glu Phe Pro Phe Asn Val Tyr 
            260                 265                 270         


Asp Thr Thr Leu Ala Met Tyr Glu Val Leu Lys Ala Gly Gly Ile Asn 
        275                 280                 285             


Gly Gly Phe Asn Phe Asp Ser Lys Asn Arg Arg Pro Ser Asn Thr Tyr 
    290                 295                 300                 


Glu Asp Met Phe Tyr Gly Tyr Ile Leu Gly Met Asp Ser Phe Ala Leu 
305                 310                 315                 320 


Gly Leu Ile Lys Ala Ala Ala Ile Ile Glu Asp Gly Arg Ile Glu Lys 
                325                 330                 335     


Gln Leu Ala Asp Arg Tyr Ser Ser Tyr Ser Asn Thr Glu Ile Gly Lys 
            340                 345                 350         


Lys Ile Arg Asn His Thr Ala Thr Leu Lys Glu Leu Ala Glu Tyr Ala 
        355                 360                 365             


Ala Thr Leu Lys Lys Pro Gly Asp Pro Gly Ser Gly Arg Gln Glu Leu 
    370                 375                 380                 


Leu Glu Gln Ile Met Asn Glu Val Met Phe Gly 
385                 390                 395 


<210>  101
<211>  1185
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for Ru3 optimized for expression in Saccharomyces 
       cerevisiae

<400>  101
atggcttggt ggcacaacat gtgtgctaac ggcaaggata tgttcggtac tggtactgct       60

gataagtctt tcggtgctga accaggcacc atggaacacg ctaaggctaa ggttgacgct      120

gctatcgagt tcatgcaaaa gttgggtatc gaatactact gtttccacga cgttgatttg      180

gtcccagaag acgaagacga tatcaacgtc actaacgcta gattggacga aatctctgat      240

tacatcttgg aaaagaccaa gggtactaac atcagatgtt tgtggggtac tgctaacatg      300

ttcaacaacc caagattcat gaacggtgct ggttctacta actccgctga cgtttactgt      360

ttcgctgctg ctcaaatcaa gaaggctttg gacatcaccg ttaagttggg tggtagaggt      420

tacgtcttct ggggtggtag agaaggttac gaaaccttgt tgaacactga cgttaagttg      480

gaacaagaaa acatcgctaa cttgatgcac atggctgtcg aatacggtag atctatcggt      540

ttcaagggtg acttcttgat cgaaccaaag ccaaaggaac caatgaagca ccaatacgac      600

ttcgatgctg ctactgctat cggtttcttg agacaatacg gtttggacaa ggatttcaag      660

ttgaacatcg aagctaacca cgctaccttg gctggtcaca ctttccaaca cgaattgaga      720

atctctgcta tgaacggtat gttgggttcc atcgacgcta accaaggtga catgttgttg      780

ggttgggaca ccgatgaatt tccattcaac gtttacgaca ccactttggc tatgtacgaa      840

gtcttgaagg ctggtggtat caacggtggt ttcaacttcg actctaagaa cagaagacca      900

tccaacactt acgaagacat gttctacggt tacatcttgg gtatggattc tttcgctttg      960

ggtttgatca aggctgctgc tatcatcgaa gacggtagaa tcgaaaagca attggctgat     1020

agatactctt cctactccaa caccgaaatc ggtaaaaaga tcagaaacca caccgctact     1080

ttgaaggaat tggctgaata cgctgctact ttgaagaagc caggtgaccc aggttccggt     1140

agacaagaat tgttggaaca aatcatgaac gaagttatgt tcggt                     1185


