                         SEQUENCE LISTING

<110>  Chen, Zhongqiang
       Kelly, Kristen  
       Ye, Rick Weizhang
 
<120>  COW RUMEN XYLOSE ISOMERASES ACTIVE IN YEAST CELLS

<130>  CL5968

<160>  20    

<170>  PatentIn version 3.5

<210>  1
<211>  438
<212>  PRT
<213>  Unknown

<220>
<223>  uncultured bacteria from cow rumen

<400>  1

Met Ser Glu Ile Phe Ala Asn Ile Pro Val Ile Pro Tyr Glu Gly Pro 
1               5                   10                  15      


Gln Ser Lys Asn Pro Leu Ala Phe Lys Phe Tyr Asp Ala Asp Lys Val 
            20                  25                  30          


Ile Leu Gly Lys Lys Met Ser Glu His Leu Pro Phe Ala Met Ala Trp 
        35                  40                  45              


Trp His Asn Leu Cys Ala Gly Gly Thr Asp Met Phe Gly Arg Asp Thr 
    50                  55                  60                  


Ala Asp Lys Ser Phe Gly Ala Glu Lys Gly Thr Met Ala His Ala Arg 
65                  70                  75                  80  


Ala Lys Val Asp Ala Gly Phe Glu Phe Met Lys Lys Val Gly Val Lys 
                85                  90                  95      


Tyr Phe Cys Phe His Asp Val Asp Leu Val Pro Glu Ala Asp Asp Ile 
            100                 105                 110         


Lys Glu Thr Asn Arg Arg Leu Asp Glu Ile Ser Asp Tyr Ile Leu Glu 
        115                 120                 125             


Lys Met Lys Gly Thr Asp Ile Lys Cys Leu Trp Gly Thr Ala Asn Met 
    130                 135                 140                 


Phe Gly Asn Pro Arg Tyr Met Asn Gly Ala Gly Ser Thr Asn Ser Ala 
145                 150                 155                 160 


Asp Val Phe Cys Phe Ala Ala Ala Gln Ile Lys Lys Ala Leu Asp Leu 
                165                 170                 175     


Thr Val Lys Leu Gly Gly Arg Gly Tyr Val Phe Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Met Lys Phe Glu Gln Glu Asn 
        195                 200                 205             


Ile Ala Arg Leu Met His Leu Ala Val Asp Tyr Gly Arg Ser Ile Gly 
    210                 215                 220                 


Phe Thr Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Met Lys 
225                 230                 235                 240 


His Gln Tyr Asp Phe Asp Ala Ala Thr Ala Ile Gly Phe Leu Arg Gln 
                245                 250                 255     


Tyr Gly Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Thr Phe Gln His Asp Leu Arg Ile Ser Ala Ile 
        275                 280                 285             


Asn Gly Met Leu Gly Ser Ile Asp Ala Asn Gln Gly Asp Leu Leu Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Glu Phe Pro Phe Asn Val Tyr Glu Ala Thr Leu 
305                 310                 315                 320 


Cys Met Tyr Glu Val Leu Lys Ala Gly Gly Leu Thr Gly Gly Phe Asn 
                325                 330                 335     


Phe Asp Ser Lys Thr Arg Arg Pro Ser Tyr Thr Leu Glu Asp Met Phe 
            340                 345                 350         


His Ala Tyr Ile Leu Gly Met Asp Thr Phe Ala Leu Gly Leu Ile Lys 
        355                 360                 365             


Ala Ala Ala Leu Ile Glu Asp Gly Arg Leu Asp Gln Phe Val Ala Asp 
    370                 375                 380                 


Arg Tyr Ala Ser Tyr Lys Thr Gly Ile Gly Ala Lys Ile Arg Ser Gly 
385                 390                 395                 400 


Glu Thr Thr Leu Ala Glu Leu Ala Ala Tyr Ala Asp Lys Leu Gly Ala 
                405                 410                 415     


Pro Ala Leu Pro Ser Ser Gly Arg Gln Glu Tyr Leu Glu Ser Ile Val 
            420                 425                 430         


Asn Ser Ile Leu Phe Gly 
        435             


<210>  2
<211>  1314
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for Ru4 optimized for expression in Saccharomyces 
       cerevisiae

<400>  2
atgtctgaaa tcttcgctaa catcccagtc atcccatacg aaggtccaca atctaagaac       60

ccattggctt tcaagttcta cgacgctgac aaggttatct tgggtaaaaa gatgtctgaa      120

cacttgccat tcgctatggc ttggtggcac aacttgtgtg ctggtggtac tgacatgttc      180

ggtagagaca ctgctgataa gtccttcggt gctgaaaagg gtactatggc tcacgctaga      240

gctaaggttg acgctggttt cgagttcatg aagaaggttg gtgtcaagta cttctgtttc      300

cacgacgttg atttggtccc agaagctgac gatatcaagg aaactaacag aagattggac      360

gaaatctctg attacatctt ggaaaagatg aagggtactg acatcaagtg tttgtggggt      420

actgctaaca tgttcggtaa cccaagatac atgaacggtg ctggttctac caactccgct      480

gacgttttct gtttcgctgc tgctcaaatc aagaaggctt tggatttgac tgttaagttg      540

ggtggtagag gttacgtctt ctggggtggt agagaaggtt acgaaacctt gttgaacact      600

gacatgaagt tcgaacaaga aaacatcgct agattgatgc acttggctgt tgactacggt      660

agatctatcg gtttcaccgg tgacttctac atcgaaccaa agccaaagga accaatgaag      720

caccaatacg acttcgatgc tgctactgct atcggtttct tgagacaata cggtttggac      780

aaggatttca agatgaacat cgaagctaac cacgctacct tggctggtca cactttccaa      840

cacgacttga gaatctctgc tatcaacggt atgttgggtt ccatcgacgc taaccaaggt      900

gacttgttgt tgggttggga caccgatgaa tttccattca acgtttacga agctactttg      960

tgtatgtacg aagtcttgaa ggctggtggt ttgaccggtg gtttcaactt cgactctaag     1020

accagaagac catcctacac tttggaagac atgttccacg cttacatctt gggtatggat     1080

actttcgctt tgggtttgat caaggctgct gctttgatcg aagacggtag attggatcaa     1140

ttcgttgctg acagatacgc ttcttacaag accggtatcg gtgctaagat cagatccggt     1200

gaaaccactt tggctgaatt ggctgcttac gctgacaagt tgggtgctcc agctttgcca     1260

tcttccggta gacaagaata cttggaatct atcgtcaact ccatcttgtt cggt           1314


<210>  3
<211>  438
<212>  PRT
<213>  Unknown

<220>
<223>  uncultured bacteria from cow rumen

<400>  3

Met Ala Glu Ile Phe Lys Gly Ile Pro Glu Ile Arg Tyr Glu Gly Pro 
1               5                   10                  15      


Asn Ser Thr Asn Pro Leu Ser Phe Lys Tyr Tyr Asp Pro Asp Lys Val 
            20                  25                  30          


Ile Leu Gly Lys Pro Met Lys Glu His Leu Pro Phe Ala Met Ala Trp 
        35                  40                  45              


Trp His Asn Leu Gly Ala Ala Gly Thr Asp Met Phe Gly Arg Asp Thr 
    50                  55                  60                  


Ala Asp Lys Ser Phe Gly Ala Glu Lys Gly Thr Met Glu His Ala Lys 
65                  70                  75                  80  


Ala Lys Val Asp Ala Gly Phe Glu Phe Met Lys Lys Leu Gly Ile Arg 
                85                  90                  95      


Tyr Phe Cys Phe His Asp Val Asp Leu Val Pro Glu Ala Asp Asp Ile 
            100                 105                 110         


Lys Val Thr Asn Ala Arg Leu Asp Glu Ile Ser Asp Tyr Ile Leu Glu 
        115                 120                 125             


Lys Met Lys Gly Thr Asp Ile Lys Cys Leu Trp Gly Thr Ala Asn Met 
    130                 135                 140                 


Phe Ser Asn Pro Arg Phe Met Asn Gly Ala Gly Ser Thr Asn Ser Ala 
145                 150                 155                 160 


Asp Val Phe Cys Phe Ala Ala Ala Gln Val Lys Lys Ala Leu Asp Ile 
                165                 170                 175     


Thr Val Lys Leu Gly Gly Lys Gly Tyr Val Phe Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Val Lys Phe Glu Gln Glu Asn 
        195                 200                 205             


Ile Ala Lys Leu Met His Leu Ala Val Asp Tyr Gly Arg Ser Ile Gly 
    210                 215                 220                 


Phe Lys Gly Asp Phe Phe Ile Glu Pro Lys Pro Lys Glu Pro Met Lys 
225                 230                 235                 240 


His Gln Tyr Asp Phe Asp Ala Ala Thr Ala Ile Gly Phe Val Arg Gln 
                245                 250                 255     


Tyr Gly Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Thr Phe Gln His Glu Leu Arg Ile Ser Ala Ile 
        275                 280                 285             


Asn Gly Met Leu Gly Ser Ile Asp Ala Asn Gln Gly Asp Met Leu Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Glu Phe Pro Phe Asn Val Tyr Asp Thr Thr Leu 
305                 310                 315                 320 


Cys Met Tyr Glu Val Leu Lys Asn Gly Gly Ile Pro Gly Gly Phe Asn 
                325                 330                 335     


Phe Asp Ala Lys Asn Arg Arg Pro Ser Tyr Thr Ala Glu Asp Met Phe 
            340                 345                 350         


Tyr Gly Phe Ile Leu Gly Met Asp Ser Phe Ala Leu Gly Leu Ile Lys 
        355                 360                 365             


Ala Ala Lys Leu Ile Glu Asp Gly Arg Ile Asp Lys Phe Val Glu Glu 
    370                 375                 380                 


Arg Tyr Ala Ser Tyr Lys Asp Gly Ile Gly Lys Lys Ile Arg Asp Gly 
385                 390                 395                 400 


Glu Thr Thr Leu Ala Glu Leu Ala Ala Tyr Ala Asp Gln Leu Gly Ala 
                405                 410                 415     


Pro Lys Leu Pro Gly Ser Gly Arg Gln Glu Asp Leu Glu Ser Val Phe 
            420                 425                 430         


Asn Gln Val Leu Phe Gly 
        435             


<210>  4
<211>  1314
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for Ru1 optimized for expression in Saccharomyces 
       cerevisiae

<400>  4
atggctgaaa tcttcaaggg tatcccagaa atcagatacg aaggtccaaa ctccactaac       60

ccattgtctt tcaagtacta cgatccagac aaggttatct tgggtaaacc aatgaaggaa      120

cacttgccat tcgctatggc ttggtggcac aacttgggtg ctgctggtac tgacatgttc      180

ggtagagaca ctgctgataa gtctttcggt gctgaaaagg gtactatgga acacgctaag      240

gctaaggttg acgctggttt cgagttcatg aagaagttgg gtatcagata cttctgtttc      300

cacgacgttg atttggtccc agaagctgac gatatcaagg tcactaacgc tagattggac      360

gaaatctctg attacatctt ggaaaagatg aagggtactg acatcaagtg tttgtggggt      420

actgctaaca tgttctccaa cccaagattc atgaacggtg ctggttctac caactccgct      480

gacgttttct gtttcgctgc tgctcaagtc aagaaggctt tggacatcac tgttaagttg      540

ggtggtaaag gttacgtctt ctggggtggt agagaaggtt acgaaacctt gttgaacact      600

gacgttaagt tcgaacaaga aaacatcgct aagttgatgc acttggctgt tgactacggt      660

agatctatcg gtttcaaggg tgacttcttc atcgaaccaa agccaaagga accaatgaag      720

caccaatacg acttcgatgc tgctaccgct atcggtttcg ttagacaata cggtttggac      780

aaggatttca agatgaacat cgaagctaac cacgctacct tggctggtca cactttccaa      840

cacgaattga gaatctctgc tatcaacggt atgttgggtt ccatcgacgc taaccaaggt      900

gacatgttgt tgggttggga caccgatgaa tttccattca acgtttacga caccactttg      960

tgtatgtacg aagtcttgaa gaacggtggt atcccaggtg gtttcaactt cgacgctaag     1020

aacagaagac catcttacac tgctgaagac atgttctacg gtttcatctt gggtatggat     1080

tccttcgctt tgggtttgat caaggctgct aagttgatcg aagacggtag aatcgataag     1140

ttcgtcgaag aaagatacgc ttcttacaag gacggtatcg gtaaaaagat cagagatggt     1200

gaaaccactt tggctgaatt ggctgcttac gctgaccaat tgggtgctcc aaagttgcca     1260

ggttctggta gacaagaaga cttggaatcc gttttcaacc aagtcttgtt cggt           1314


<210>  5
<211>  439
<212>  PRT
<213>  Unknown

<220>
<223>  uncultured bacteria from cow rumen

<400>  5

Met Gly Glu Ile Phe Ser Asn Ile Pro Val Ile Lys Tyr Glu Gly Pro 
1               5                   10                  15      


Asp Ser Lys Asn Pro Leu Ala Phe Lys Tyr Tyr Asp Pro Glu Arg Val 
            20                  25                  30          


Ile Leu Gly Lys Lys Met Lys Glu His Leu Pro Phe Ala Met Ala Trp 
        35                  40                  45              


Trp His Asn Leu Cys Ala Asn Gly Val Asp Met Phe Gly Arg Gly Thr 
    50                  55                  60                  


Ile Asp Lys Leu Phe Gly Ala Ala Glu Ala Gly Thr Met Glu His Ala 
65                  70                  75                  80  


Lys Ala Lys Val Asp Ala Gly Ile Glu Phe Met Gln Lys Leu Gly Ile 
                85                  90                  95      


Glu Tyr Tyr Cys Phe His Asp Val Asp Leu Val Pro Glu Ala Asp Asp 
            100                 105                 110         


Ile Asn Glu Thr Asn Arg Arg Leu Asp Glu Leu Thr Asp Tyr Leu Lys 
        115                 120                 125             


Glu Lys Thr Ala Gly Thr Asn Ile Lys Cys Leu Trp Gly Thr Ala Asn 
    130                 135                 140                 


Met Phe Ser Asn Pro Arg Phe Met Asn Gly Ala Gly Ser Thr Asn Asp 
145                 150                 155                 160 


Val Asp Val Tyr Cys Phe Ala Ala Ala Gln Val Lys Lys Ala Ile Glu 
                165                 170                 175     


Met Thr Val Lys Leu Gly Gly Arg Gly Tyr Val Phe Trp Gly Gly Arg 
            180                 185                 190         


Glu Gly Tyr Glu Thr Leu Leu Asn Thr Lys Val Gln Met Glu Leu Glu 
        195                 200                 205             


Asn Ile Ala Asn Leu Met Lys Met Ala Arg Asp Tyr Gly Arg Ser Ile 
    210                 215                 220                 


Gly Phe Lys Gly Thr Phe Leu Ile Glu Pro Lys Pro Lys Glu Pro Met 
225                 230                 235                 240 


Lys His Gln Tyr Asp Tyr Asp Ala Ala Thr Ala Ile Gly Phe Leu Arg 
                245                 250                 255     


Gln Tyr Gly Leu Asp Gln Asp Phe Lys Met Asn Ile Glu Ala Asn His 
            260                 265                 270         


Ala Thr Leu Ala Gly His Thr Phe Gln His Glu Leu Arg Ile Ser Arg 
        275                 280                 285             


Ile Asn Gly Met Leu Gly Ser Ile Asp Ala Asn Gln Gly Asp Ile Met 
    290                 295                 300                 


Leu Gly Trp Asp Thr Asp Cys Phe Pro Ser Asn Val Tyr Asp Thr Thr 
305                 310                 315                 320 


Leu Ala Met Tyr Glu Ile Val Arg Asn Gly Gly Leu Pro Val Gly Ile 
                325                 330                 335     


Asn Phe Asp Ser Lys Asn Arg Arg Pro Ser Asn Thr Tyr Glu Asp Met 
            340                 345                 350         


Phe His Ala Phe Ile Leu Gly Met Asp Ser Phe Ala Phe Gly Leu Ile 
        355                 360                 365             


Lys Ala Ala Gln Ile Ile Glu Asp Gly Arg Ile Glu Gly Phe Thr Glu 
    370                 375                 380                 


Lys Lys Tyr Glu Ser Phe Asn Thr Glu Leu Gly Gln Lys Ile Arg Lys 
385                 390                 395                 400 


Gly Glu Ala Thr Leu Glu Glu Leu Ala Ala His Ala Ala Asp Leu Lys 
                405                 410                 415     


Ala Pro Lys Val Pro Val Ser Gly Arg Gln Glu Tyr Leu Glu Gly Val 
            420                 425                 430         


Leu Asn Asn Ile Ile Leu Ser 
        435                 


<210>  6
<211>  1317
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for Ru2 optimized for expression in Saccharomyces 
       cerevisiae

<400>  6
atgggtgaaa tcttctctaa catcccagtc atcaagtacg aaggtccaga ctctaagaac       60

ccattggctt tcaagtacta cgatccagaa agagtcatct tgggtaaaaa gatgaaggaa      120

cacttgccat tcgctatggc ttggtggcac aacttgtgtg ctaacggtgt tgacatgttc      180

ggtagaggta ctatcgataa gttgttcggt gctgctgaag ctggtactat ggaacacgct      240

aaggctaagg ttgacgctgg tatcgagttc atgcaaaagt tgggtatcga atactactgt      300

ttccacgacg ttgatttggt cccagaagct gacgatatca acgaaaccaa cagaagattg      360

gacgaattga ctgattactt gaaggaaaag accgctggta ctaacatcaa gtgtttgtgg      420

ggtactgcta acatgttctc taacccaaga ttcatgaacg gtgctggttc cactaacgac      480

gttgatgtct actgtttcgc tgctgctcaa gttaagaagg ctatcgaaat gaccgtcaag      540

ttgggtggta gaggttacgt tttctggggt ggtagagaag gttacgaaac cttgttgaac      600

actaaggtcc aaatggaatt ggaaaacatc gctaacttga tgaagatggc tagagactac      660

ggtagatcta tcggtttcaa gggtactttc ttgatcgaac caaagccaaa ggaaccaatg      720

aagcaccaat acgactacga tgctgctact gctatcggtt tcttgagaca atacggtttg      780

gaccaagatt tcaagatgaa catcgaagct aaccacgcta ccttggctgg tcacactttc      840

caacacgaat tgagaatctc tagaatcaac ggtatgttgg gttccatcga cgctaaccaa      900

ggtgacatca tgttgggttg ggacaccgat tgtttcccat ctaacgttta cgacaccact      960

ttggctatgt acgaaatcgt tagaaacggt ggtttgccag tcggtatcaa cttcgactct     1020

aagaacagaa gaccatccaa cacttacgaa gacatgttcc acgctttcat cttgggtatg     1080

gactctttcg ctttcggttt gatcaaggct gctcaaatca tcgaagacgg tagaatcgaa     1140

ggtttcaccg aaaagaagta cgaatccttc aacactgaat tgggtcaaaa gatcagaaag     1200

ggtgaagcta ctttggaaga attggctgct cacgctgctg acttgaaggc tccaaaggtt     1260

ccagtctctg gtagacaaga atacttggaa ggtgttttga acaacatcat cttgtcc        1317


<210>  7
<211>  395
<212>  PRT
<213>  Unknown

<220>
<223>  uncultured bacteria from cow rumen

<400>  7

Met Ala Trp Trp His Asn Met Cys Ala Asn Gly Lys Asp Met Phe Gly 
1               5                   10                  15      


Thr Gly Thr Ala Asp Lys Ser Phe Gly Ala Glu Pro Gly Thr Met Glu 
            20                  25                  30          


His Ala Lys Ala Lys Val Asp Ala Ala Ile Glu Phe Met Gln Lys Leu 
        35                  40                  45              


Gly Ile Glu Tyr Tyr Cys Phe His Asp Val Asp Leu Val Pro Glu Asp 
    50                  55                  60                  


Glu Asp Asp Ile Asn Val Thr Asn Ala Arg Leu Asp Glu Ile Ser Asp 
65                  70                  75                  80  


Tyr Ile Leu Glu Lys Thr Lys Gly Thr Asn Ile Arg Cys Leu Trp Gly 
                85                  90                  95      


Thr Ala Asn Met Phe Asn Asn Pro Arg Phe Met Asn Gly Ala Gly Ser 
            100                 105                 110         


Thr Asn Ser Ala Asp Val Tyr Cys Phe Ala Ala Ala Gln Ile Lys Lys 
        115                 120                 125             


Ala Leu Asp Ile Thr Val Lys Leu Gly Gly Arg Gly Tyr Val Phe Trp 
    130                 135                 140                 


Gly Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Val Lys Leu 
145                 150                 155                 160 


Glu Gln Glu Asn Ile Ala Asn Leu Met His Met Ala Val Glu Tyr Gly 
                165                 170                 175     


Arg Ser Ile Gly Phe Lys Gly Asp Phe Leu Ile Glu Pro Lys Pro Lys 
            180                 185                 190         


Glu Pro Met Lys His Gln Tyr Asp Phe Asp Ala Ala Thr Ala Ile Gly 
        195                 200                 205             


Phe Leu Arg Gln Tyr Gly Leu Asp Lys Asp Phe Lys Leu Asn Ile Glu 
    210                 215                 220                 


Ala Asn His Ala Thr Leu Ala Gly His Thr Phe Gln His Glu Leu Arg 
225                 230                 235                 240 


Ile Ser Ala Met Asn Gly Met Leu Gly Ser Ile Asp Ala Asn Gln Gly 
                245                 250                 255     


Asp Met Leu Leu Gly Trp Asp Thr Asp Glu Phe Pro Phe Asn Val Tyr 
            260                 265                 270         


Asp Thr Thr Leu Ala Met Tyr Glu Val Leu Lys Ala Gly Gly Ile Asn 
        275                 280                 285             


Gly Gly Phe Asn Phe Asp Ser Lys Asn Arg Arg Pro Ser Asn Thr Tyr 
    290                 295                 300                 


Glu Asp Met Phe Tyr Gly Tyr Ile Leu Gly Met Asp Ser Phe Ala Leu 
305                 310                 315                 320 


Gly Leu Ile Lys Ala Ala Ala Ile Ile Glu Asp Gly Arg Ile Glu Lys 
                325                 330                 335     


Gln Leu Ala Asp Arg Tyr Ser Ser Tyr Ser Asn Thr Glu Ile Gly Lys 
            340                 345                 350         


Lys Ile Arg Asn His Thr Ala Thr Leu Lys Glu Leu Ala Glu Tyr Ala 
        355                 360                 365             


Ala Thr Leu Lys Lys Pro Gly Asp Pro Gly Ser Gly Arg Gln Glu Leu 
    370                 375                 380                 


Leu Glu Gln Ile Met Asn Glu Val Met Phe Gly 
385                 390                 395 


<210>  8
<211>  1185
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for Ru3 optimized for expression in Saccharomyces 
       cerevisiae

<400>  8
atggcttggt ggcacaacat gtgtgctaac ggcaaggata tgttcggtac tggtactgct       60

gataagtctt tcggtgctga accaggcacc atggaacacg ctaaggctaa ggttgacgct      120

gctatcgagt tcatgcaaaa gttgggtatc gaatactact gtttccacga cgttgatttg      180

gtcccagaag acgaagacga tatcaacgtc actaacgcta gattggacga aatctctgat      240

tacatcttgg aaaagaccaa gggtactaac atcagatgtt tgtggggtac tgctaacatg      300

ttcaacaacc caagattcat gaacggtgct ggttctacta actccgctga cgtttactgt      360

ttcgctgctg ctcaaatcaa gaaggctttg gacatcaccg ttaagttggg tggtagaggt      420

tacgtcttct ggggtggtag agaaggttac gaaaccttgt tgaacactga cgttaagttg      480

gaacaagaaa acatcgctaa cttgatgcac atggctgtcg aatacggtag atctatcggt      540

ttcaagggtg acttcttgat cgaaccaaag ccaaaggaac caatgaagca ccaatacgac      600

ttcgatgctg ctactgctat cggtttcttg agacaatacg gtttggacaa ggatttcaag      660

ttgaacatcg aagctaacca cgctaccttg gctggtcaca ctttccaaca cgaattgaga      720

atctctgcta tgaacggtat gttgggttcc atcgacgcta accaaggtga catgttgttg      780

ggttgggaca ccgatgaatt tccattcaac gtttacgaca ccactttggc tatgtacgaa      840

gtcttgaagg ctggtggtat caacggtggt ttcaacttcg actctaagaa cagaagacca      900

tccaacactt acgaagacat gttctacggt tacatcttgg gtatggattc tttcgctttg      960

ggtttgatca aggctgctgc tatcatcgaa gacggtagaa tcgaaaagca attggctgat     1020

agatactctt cctactccaa caccgaaatc ggtaaaaaga tcagaaacca caccgctact     1080

ttgaaggaat tggctgaata cgctgctact ttgaagaagc caggtgaccc aggttccggt     1140

agacaagaat tgttggaaca aatcatgaac gaagttatgt tcggt                     1185


<210>  9
<211>  441
<212>  PRT
<213>  Ruminococcus champanellensis

<400>  9

Met Ser Glu Phe Phe Thr Gly Ile Ser Lys Ile Pro Phe Glu Gly Lys 
1               5                   10                  15      


Ala Ser Asn Asn Pro Met Ala Phe Lys Tyr Tyr Asn Pro Asp Glu Val 
            20                  25                  30          


Val Gly Gly Lys Thr Met Arg Glu Gln Leu Lys Phe Ala Leu Ser Trp 
        35                  40                  45              


Trp His Thr Met Gly Gly Asp Gly Thr Asp Met Phe Gly Val Gly Thr 
    50                  55                  60                  


Thr Asn Lys Lys Phe Gly Gly Thr Asp Pro Met Asp Ile Ala Lys Arg 
65                  70                  75                  80  


Lys Val Asn Ala Ala Phe Glu Leu Met Asp Lys Leu Ser Ile Asp Tyr 
                85                  90                  95      


Phe Cys Phe His Asp Arg Asp Leu Ala Pro Glu Ala Asp Asn Leu Lys 
            100                 105                 110         


Glu Thr Asn Gln Arg Leu Asp Glu Ile Thr Glu Tyr Ile Ala Gln Met 
        115                 120                 125             


Met Gln Leu Asn Pro Asp Lys Lys Val Leu Trp Gly Thr Ala Asn Cys 
    130                 135                 140                 


Phe Gly Asn Pro Arg Tyr Met His Gly Ala Gly Thr Ala Pro Asn Ala 
145                 150                 155                 160 


Asp Val Phe Ala Phe Ala Ala Ala Gln Ile Lys Lys Ala Ile Glu Ile 
                165                 170                 175     


Thr Val Lys Leu Gly Gly Lys Gly Tyr Val Phe Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asn Met Gly Leu Glu Leu Asp Asn 
        195                 200                 205             


Met Ala Arg Leu Leu His Met Ala Val Asp Tyr Ala Arg Ser Ile Gly 
    210                 215                 220                 


Phe Thr Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr Lys 
225                 230                 235                 240 


His Gln Tyr Asp Phe Asp Thr Ala Thr Val Ile Gly Phe Leu Arg Lys 
                245                 250                 255     


Tyr Asn Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gln His Thr Phe Gln His Glu Leu Arg Val Ala Arg Glu 
        275                 280                 285             


Asn Gly Phe Phe Gly Ser Ile Asp Ala Asn Gln Gly Asp Thr Leu Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Gln Phe Pro Thr Asn Thr Tyr Asp Ala Ala Leu 
305                 310                 315                 320 


Cys Met Tyr Glu Val Leu Lys Ala Gly Gly Phe Thr Asn Gly Gly Leu 
                325                 330                 335     


Asn Phe Asp Ser Lys Ala Arg Arg Gly Ser Phe Glu Met Glu Asp Ile 
            340                 345                 350         


Phe His Ser Tyr Ile Ala Gly Met Asp Thr Phe Ala Leu Gly Leu Lys 
        355                 360                 365             


Ile Ala Gln Lys Met Ile Asp Asp Gly Arg Ile Asp Gln Phe Val Ala 
    370                 375                 380                 


Asp Arg Tyr Ala Ser Trp Asn Thr Gly Ile Gly Ala Asp Ile Ile Ser 
385                 390                 395                 400 


Gly Lys Ala Thr Met Ala Asp Leu Glu Ala Tyr Ala Leu Ser Lys Gly 
                405                 410                 415     


Asp Val Thr Ala Ser Leu Lys Ser Gly Arg Gln Glu Leu Leu Glu Ser 
            420                 425                 430         


Ile Leu Asn Asn Ile Met Phe Asn Leu 
        435                 440     


<210>  10
<211>  1323
<212>  DNA
<213>  artificial sequence

<220>
<223>  coding region for R champ XI optimized for expression in 
       Saccharomyces cerevisiae

<400>  10
atgtccgagt tcttcactgg tatctctaag atcccattcg aaggcaaggc ttctaacaac       60

ccaatggctt tcaagtacta caacccagac gaagttgtcg gtggtaaaac catgagagaa      120

caattgaagt tcgctttgtc ttggtggcac accatgggtg gtgacggtac tgatatgttc      180

ggtgttggta ctactaacaa gaagttcggt ggtactgacc caatggatat cgctaagaga      240

aaggtcaacg ctgctttcga attgatggac aagttgtcca tcgattactt ctgtttccac      300

gacagagatt tggctccaga agctgacaac ttgaaggaaa ccaaccaaag attggatgaa      360

atcactgaat acatcgctca aatgatgcaa ttgaacccag acaagaaggt tttgtggggt      420

actgctaact gtttcggtaa cccaagatac atgcacggtg ctggtactgc tccaaacgct      480

gacgttttcg ctttcgctgc tgctcaaatc aagaaggcta tcgaaatcac cgttaagttg      540

ggtggtaaag gttacgtctt ctggggtggt agagaaggtt acgaaacctt gttgaacact      600

aacatgggtt tggaattgga caacatggct agattgttgc acatggctgt tgactacgct      660

agatctatcg gtttcaccgg tgacttctac atcgaaccaa agccaaagga accaactaag      720

caccaatacg acttcgatac cgctactgtc atcggtttct tgagaaagta caacttggac      780

aaggatttca agatgaacat cgaagctaac cacgctacct tggctcaaca cactttccaa      840

cacgaattga gagttgctag agaaaacggt ttcttcggtt ctatcgacgc taaccaaggt      900

gacaccttgt tgggttggga cactgatcaa ttcccaacca acacttacga cgctgctttg      960

tgtatgtacg aagtcttgaa ggctggtggt ttcaccaacg gtggtttgaa cttcgactct     1020

aaggctagaa gaggttcctt cgaaatggaa gacatcttcc actcctacat cgctggtatg     1080

gacactttcg ctttgggttt gaagatcgct caaaagatga tcgacgatgg tagaatcgac     1140

caattcgttg ctgatagata cgcttcttgg aacaccggta tcggtgctga catcatctcc     1200

ggtaaagcta ccatggctga cttggaagct tacgctttgt ctaagggtga cgttactgct     1260

tccttgaagt ccggtagaca agaattgttg gaatctatct tgaacaacat catgttcaac     1320

ttg                                                                   1323


<210>  11
<211>  439
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  11

Met Glu Phe Phe Lys Asn Ile Ser Lys Ile Pro Tyr Glu Gly Lys Asp 
1               5                   10                  15      


Ser Thr Asn Pro Leu Ala Phe Lys Tyr Tyr Asn Pro Asp Glu Val Ile 
            20                  25                  30          


Asp Gly Lys Lys Met Arg Asp Ile Met Lys Phe Ala Leu Ser Trp Trp 
        35                  40                  45              


His Thr Met Gly Gly Asp Gly Thr Asp Met Phe Gly Cys Gly Thr Ala 
    50                  55                  60                  


Asp Lys Thr Trp Gly Glu Asn Asp Pro Ala Ala Arg Ala Lys Ala Lys 
65                  70                  75                  80  


Val Asp Ala Ala Phe Glu Ile Met Gln Lys Leu Ser Ile Asp Tyr Phe 
                85                  90                  95      


Cys Phe His Asp Arg Asp Leu Ser Pro Glu Tyr Gly Ser Leu Lys Asp 
            100                 105                 110         


Thr Asn Ala Gln Leu Asp Ile Val Thr Asp Tyr Ile Lys Ala Lys Gln 
        115                 120                 125             


Ala Glu Thr Gly Leu Lys Cys Leu Trp Gly Thr Ala Lys Cys Phe Asp 
    130                 135                 140                 


His Pro Arg Phe Met His Gly Ala Gly Thr Ser Pro Ser Ala Asp Val 
145                 150                 155                 160 


Phe Ala Phe Ser Ala Ala Gln Ile Lys Lys Ala Leu Glu Ser Thr Val 
                165                 170                 175     


Lys Leu Gly Gly Thr Gly Tyr Val Phe Trp Gly Gly Arg Glu Gly Tyr 
            180                 185                 190         


Glu Thr Leu Leu Asn Thr Asn Met Gly Leu Glu Leu Asp Asn Met Ala 
        195                 200                 205             


Arg Leu Met Lys Met Ala Val Glu Tyr Gly Arg Ser Ile Gly Phe Lys 
    210                 215                 220                 


Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr Lys His Gln 
225                 230                 235                 240 


Tyr Asp Phe Asp Thr Ala Thr Val Leu Gly Phe Leu Arg Lys Tyr Gly 
                245                 250                 255     


Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala Thr Leu 
            260                 265                 270         


Ala Gln His Thr Phe Gln His Glu Leu Cys Val Ala Arg Thr Asn Gly 
        275                 280                 285             


Ala Phe Gly Ser Ile Asp Ala Asn Gln Gly Asp Pro Leu Leu Gly Trp 
    290                 295                 300                 


Asp Thr Asp Gln Phe Pro Thr Asn Ile Tyr Asp Thr Thr Met Cys Met 
305                 310                 315                 320 


Tyr Glu Val Ile Lys Ala Gly Gly Phe Thr Asn Gly Gly Leu Asn Phe 
                325                 330                 335     


Asp Ala Lys Ala Arg Arg Gly Ser Phe Thr Pro Glu Asp Ile Phe Tyr 
            340                 345                 350         


Ser Tyr Ile Ala Gly Met Asp Ala Phe Ala Leu Gly Tyr Lys Ala Ala 
        355                 360                 365             


Ser Lys Leu Ile Ala Asp Gly Arg Ile Asp Ser Phe Ile Ser Asp Arg 
    370                 375                 380                 


Tyr Ala Ser Trp Ser Glu Gly Ile Gly Leu Asp Ile Ile Ser Gly Lys 
385                 390                 395                 400 


Ala Asp Met Ala Ala Leu Glu Lys Tyr Ala Leu Glu Lys Gly Glu Val 
                405                 410                 415     


Thr Asp Ser Ile Ser Ser Gly Arg Gln Glu Leu Leu Glu Ser Ile Val 
            420                 425                 430         


Asn Asn Val Ile Phe Asn Leu 
        435                 


<210>  12
<211>  440
<212>  PRT
<213>  Abiotrophia defectiva

<400>  12

Met Ser Glu Leu Phe Gln Asn Ile Pro Lys Ile Lys Tyr Glu Gly Ala 
1               5                   10                  15      


Asn Ser Lys Asn Pro Leu Ala Phe His Tyr Tyr Asp Ala Glu Lys Ile 
            20                  25                  30          


Val Leu Gly Lys Thr Met Lys Glu His Leu Pro Phe Ala Met Ala Trp 
        35                  40                  45              


Trp His Asn Leu Cys Ala Ala Gly Thr Asp Met Phe Gly Arg Asp Thr 
    50                  55                  60                  


Ala Asp Lys Ser Phe Gly Leu Glu Lys Gly Ser Met Glu His Ala Lys 
65                  70                  75                  80  


Ala Lys Val Asp Ala Gly Phe Glu Phe Met Glu Lys Leu Gly Ile Lys 
                85                  90                  95      


Tyr Phe Cys Phe His Asp Val Asp Leu Val Pro Glu Ala Cys Asp Ile 
            100                 105                 110         


Lys Glu Thr Asn Ser Arg Leu Asp Glu Ile Ser Asp Tyr Ile Leu Glu 
        115                 120                 125             


Lys Met Lys Gly Thr Asp Ile Lys Cys Leu Trp Gly Thr Ala Asn Met 
    130                 135                 140                 


Phe Ser Asn Pro Arg Phe Val Asn Gly Ala Gly Ser Thr Asn Ser Ala 
145                 150                 155                 160 


Asp Val Tyr Cys Phe Ala Ala Ala Gln Ile Lys Lys Ala Leu Asp Ile 
                165                 170                 175     


Thr Val Lys Leu Gly Gly Arg Gly Tyr Val Phe Trp Gly Gly Arg Glu 
            180                 185                 190         


Gly Tyr Glu Thr Leu Leu Asn Thr Asp Val Lys Phe Glu Gln Glu Asn 
        195                 200                 205             


Ile Ala Asn Leu Met Lys Met Ala Val Glu Tyr Gly Arg Ser Ile Gly 
    210                 215                 220                 


Phe Lys Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Met Lys 
225                 230                 235                 240 


His Gln Tyr Asp Phe Asp Ala Ala Thr Ala Ile Gly Phe Leu Arg Gln 
                245                 250                 255     


Tyr Gly Leu Asp Lys Asp Phe Lys Leu Asn Ile Glu Ala Asn His Ala 
            260                 265                 270         


Thr Leu Ala Gly His Ser Phe Gln His Glu Leu Arg Ile Ser Ser Ile 
        275                 280                 285             


Asn Gly Met Leu Gly Ser Val Asp Ala Asn Gln Gly Asp Met Leu Leu 
    290                 295                 300                 


Gly Trp Asp Thr Asp Glu Phe Pro Phe Asp Val Tyr Asp Thr Thr Met 
305                 310                 315                 320 


Cys Met Tyr Glu Val Leu Lys Asn Gly Gly Leu Thr Gly Gly Phe Asn 
                325                 330                 335     


Phe Asp Ala Lys Asn Arg Arg Pro Ser Tyr Thr Tyr Glu Asp Met Phe 
            340                 345                 350         


Tyr Gly Phe Ile Leu Gly Met Asp Ser Phe Ala Leu Gly Leu Ile Lys 
        355                 360                 365             


Ala Ala Lys Leu Ile Glu Glu Gly Thr Leu Asp Asn Phe Ile Lys Glu 
    370                 375                 380                 


Arg Tyr Lys Ser Phe Glu Ser Glu Ile Gly Lys Lys Ile Arg Ser Lys 
385                 390                 395                 400 


Ser Ala Ser Leu Gln Glu Leu Ala Ala Tyr Ala Glu Glu Met Gly Ala 
                405                 410                 415     


Pro Ala Met Pro Gly Ser Gly Arg Gln Glu Tyr Leu Gln Ala Ala Leu 
            420                 425                 430         


Asn Gln Asn Leu Phe Gly Glu Val 
        435                 440 


<210>  13
<211>  9901
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed vector containing Ru2 chimeric gene

<400>  13
aggccagagg aaaataatat caagtgctgg aaactttttc tcttggaatt tttgcaacat       60

caagtcatag tcaattgaat tgacccaatt tcacatttaa gatttttttt ttttcatccg      120

acatacatct gtacactagg aagccctgtt tttctgaagc agcttcaaat atatatattt      180

tttacatatt tattatgatt caatgaacaa tctaattaaa tcgaaaacaa gaaccgaaac      240

gcgaataaat aatttattta gatggtgaca agtgtataag tcctcatcgg gacagctacg      300

atttctcttt cggttttggc tgagctactg gttgctgtga cgcagcggca ttagcgcggc      360

gttatgagct accctcgtgg cctgaaagat ggcgggaata aagcggaact aaaaattact      420

gactgagcca tattgaggtc aatttgtcaa ctcgtcaagt cacgtttggt ggacggcccc      480

tttccaacga atcgtatata ctaacatgcg cgcgcttcct atatacacat atacatatat      540

atatatatat atatgtgtgc gtgtatgtgt acacctgtat ttaatttcct tactcgcggg      600

tttttctttt ttctcaattc ttggcttcct ctttctcgag cggaccggat cctccgcggt      660

gccggcagat ctatttaaat ggcgcgccga cgtcaggtgg cacttttcgg ggaaatgtgc      720

gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac      780

aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt      840

tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag      900

aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg      960

aactggatct caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa     1020

tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc     1080

aagagcaact cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag     1140

tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa     1200

ccatgagtga taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc     1260

taaccgcttt tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg     1320

agctgaatga agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa     1380

caacgttgcg caaactatta actggcgaac tacttactct agcttcccgg caacaattaa     1440

tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg     1500

gctggtttat tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag     1560

cactggggcc agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg     1620

caactatgga tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt     1680

ggtaactgtc agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt     1740

aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac     1800

gtgagttttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag     1860

atcctttttt tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg     1920

tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca     1980

gagcgcagat accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga     2040

actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca     2100

gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc     2160

agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca     2220

ccgaactgag atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa     2280

aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc     2340

cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc     2400

gtcgattttt gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg     2460

cctttttacg gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat     2520

cccctgattc tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca     2580

gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca     2640

aaccgcctct ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg     2700

actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac     2760

cccaggcttt acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac     2820

aatttcacac aggaaacagc tatgaccatg attacgccaa gctttttctt tccaattttt     2880

tttttttcgt cattataaaa atcattacga ccgagattcc cgggtaataa ctgatataat     2940

taaattgaag ctctaatttg tgagtttagt atacatgcat ttacttataa tacagttttt     3000

tagttttgct ggccgcatct tctcaaatat gcttcccagc ctgcttttct gtaacgttca     3060

ccctctacct tagcatccct tccctttgca aatagtcctc ttccaacaat aataatgtca     3120

gatcctgtag agaccacatc atccacggtt ctatactgtt gacccaatgc gtctcccttg     3180

tcatctaaac ccacaccggg tgtcataatc aaccaatcgt aaccttcatc tcttccaccc     3240

atgtctcttt gagcaataaa gccgataaca aaatctttgt cgctcttcgc aatgtcaaca     3300

gtacccttag tatattctcc agtagatagg gagcccttgc atgacaattc tgctaacatc     3360

aaaaggcctc taggttcctt tgttacttct tctgccgcct gcttcaaacc gctaacaata     3420

cctgggccca ccacaccgtg tgcattcgta atgtctgccc attctgctat tctgtataca     3480

cccgcagagt actgcaattt gactgtatta ccaatgtcag caaattttct gtcttcgaag     3540

agtaaaaaat tgtacttggc ggataatgcc tttagcggct taactgtgcc ctccatggaa     3600

aaatcagtca agatatccac atgtgttttt agtaaacaaa ttttgggacc taatgcttca     3660

actaactcca gtaattcctt ggtggtacga acatccaatg aagcacacaa gtttgtttgc     3720

ttttcgtgca tgatattaaa tagcttggca gcaacaggac taggatgagt agcagcacgt     3780

tccttatatg tagctttcga catgatttat cttcgtttcc tgcaggtttt tgttctgtgc     3840

agttgggtta agaatactgg gcaatttcat gtttcttcaa cactacatat gcgtatatat     3900

accaatctaa gtctgtgctc cttccttcgt tcttccttct gttcggagat taccgaatca     3960

aaaaaatttc aaggaaaccg aaatcaaaaa aaagaataaa aaaaaaatga tgaattgaaa     4020

agcttgcatg cctgcaggtc gactctagta tactccgtct actgtacgat acacttccgc     4080

tcaggtcctt gtcctttaac gaggccttac cactcttttg ttactctatt gatccagctc     4140

agcaaaggca gtgtgatcta agattctatc ttcgcgatgt agtaaaacta gctagaccga     4200

gaaagagact agaaatgcaa aaggcacttc tacaatggct gccatcatta ttatccgatg     4260

tgacgctgca tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt     4320

ttgtacaaat atcataaaaa aagagaatct ttttaagcaa ggattttctt aacttcttcg     4380

gcgacagcat caccgacttc ggtggtactg ttggaaccac ctaaatcacc agttctgata     4440

cctgcatcca aaaccttttt aactgcatct tcaatggctt taccttcttc aggcaagttc     4500

aatgacaatt tcaacatcat tgcagcagac aagatagtgg cgatagggtt gaccttattc     4560

tttggcaaat ctggagcgga accatggcat ggttcgtaca aaccaaatgc ggtgttcttg     4620

tctggcaaag aggccaagga cgcagatggc aacaaaccca aggagcctgg gataacggag     4680

gcttcatcgg agatgatatc accaaacatg ttgctggtga ttataatacc atttaggtgg     4740

gttgggttct taactaggat catggcggca gaatcaatca attgatgttg aactttcaat     4800

gtagggaatt cgttcttgat ggtttcctcc acagtttttc tccataatct tgaagaggcc     4860

aaaacattag ctttatccaa ggaccaaata ggcaatggtg gctcatgttg tagggccatg     4920

aaagcggcca ttcttgtgat tctttgcact tctggaacgg tgtattgttc actatcccaa     4980

gcgacaccat caccatcgtc ttcctttctc ttaccaaagt aaatacctcc cactaattct     5040

ctaacaacaa cgaagtcagt acctttagca aattgtggct tgattggaga taagtctaaa     5100

agagagtcgg atgcaaagtt acatggtctt aagttggcgt acaattgaag ttctttacgg     5160

atttttagta aaccttgttc aggtctaaca ctaccggtac cccatttagg accacccaca     5220

gcacctaaca aaacggcatc agccttcttg gaggcttcca gcgcctcatc tggaagtgga     5280

acacctgtag catcgatagc agcaccacca attaaatgat tttcgaaatc gaacttgaca     5340

ttggaacgaa catcagaaat agctttaaga accttaatgg cttcggctgt gatttcttga     5400

ccaacgtggt cacctggcaa aacgacgatc ttcttagggg cagacattac aatggtatat     5460

ccttgaaata tatataaaaa aaaaaaaaaa aaaaaaaaaa aaaaatgcag cttctcaatg     5520

atattcgaat acgctttgag gagatacagc ctaatatccg acaaactgtt ttacagattt     5580

acgatcgtac ttgttaccca tcattgaatt ttgaacatcc gaacctggga gttttccctg     5640

aaacagatag tatatttgaa cctgtataat aatatatagt ctagcgcttt acggaagaca     5700

atgtatgtat ttcggttcct ggagaaacta ttgcatctat tgcataggta atcttgcacg     5760

tcgcatcccc ggttcatttt ctgcgtttcc atcttgcact tcaatagcat atctttgtta     5820

acgaagcatc tgtgcttcat tttgtagaac aaaaatgcaa cgcgagagcg ctaatttttc     5880

aaacaaagaa tctgagctgc atttttacag aacagaaatg caacgcgaaa gcgctatttt     5940

accaacgaag aatctgtgct tcatttttgt aaaacaaaaa tgcaacgcga gagcgctaat     6000

ttttcaaaca aagaatctga gctgcatttt tacagaacag aaatgcaacg cgagagcgct     6060

attttaccaa caaagaatct atacttcttt tttgttctac aaaaatgcat cccgagagcg     6120

ctatttttct aacaaagcat cttagattac tttttttctc ctttgtgcgc tctataatgc     6180

agtctcttga taactttttg cactgtaggt ccgttaaggt tagaagaagg ctactttggt     6240

gtctattttc tcttccataa aaaaagcctg actccacttc ccgcgtttac tgattactag     6300

cgaagctgcg ggtgcatttt ttcaagataa aggcatcccc gattatattc tataccgatg     6360

tggattgcgc atactttgtg aacagaaagt gatagcgttg atgattcttc attggtcaga     6420

aaattatgaa cggtttcttc tattttgtct ctatatacta cgtataggaa atgtttacat     6480

tttcgtattg ttttcgattc actctatgaa tagttcttac tacaattttt ttgtctaaag     6540

agtaatacta gagataaaca taaaaaatgt agaggtcgag tttagatgca agttcaagga     6600

gcgaaaggtg gatgggtagg ttatataggg atatagcaca gagatatata gcaaagagat     6660

acttttgagc aatgtttgtg gaagcggtat tcgcaatatt ttagtagctc gttacagtcc     6720

ggtgcgtttt tggttttttg aaagtgcgtc ttcagagcgc ttttggtttt caaaagcgct     6780

ctgaagttcc tatactttct agagaatagg aacttcggaa taggaacttc aaagcgtttc     6840

cgaaaacgag cgcttccgaa aatgcaacgc gagctgcgca catacagctc actgttcacg     6900

tcgcacctat atctgcgtgt tgcctgtata tatatataca tgagaagaac ggcatagtgc     6960

gtgtttatgc ttaaatgcgt acttatatgc gtctatttat gtaggatgaa aggtagtcta     7020

gtacctcctg tgatattatc ccattccatg cggggtatcg tatgcttcct tcagcactac     7080

cctttagctg ttctatatgc tgccactcct caattggatt agtctcatcc ttcaatgcta     7140

tcatttcctt tgatattgga tcatatgcat agtaccgaga aactagagga tctcccatta     7200

ccgacatttg ggcgctatac gtgcatatgt tcatgtatgt atctgtattt aaaacacttt     7260

tgtattattt ttcctcatat atgtgtatag gtttatacgg atgatttaat tattacttca     7320

ccacccttta tttcaggctg atatcttagc cttgttacta gtcaccggtg gcggccgcac     7380

ctggtaaaac ctctagtgga gtagtagatg taatcaatga agcggaagcc aaaagaccag     7440

agtagaggcc tatagaagaa actgcgatac cttttgtgat ggctaaacaa acagacatct     7500

ttttatatgt ttttacttct gtatatcgtg aagtagtaag tgataagcga atttggctaa     7560

gaacgttgta agtgaacaag ggacctcttt tgcctttcaa aaaaggatta aatggagtta     7620

atcattgaga tttagttttc gttagattct gtatccctaa ataactccct tacccgacgg     7680

gaaggcacaa aagacttgaa taatagcaaa cggccagtag ccaagaccaa ataatactag     7740

agttaactga tggtcttaaa caggcattac gtggtgaact ccaagaccaa tatacaaaat     7800

atcgataagt tattcttgcc caccaattta aggagcctac atcaggacag tagtaccatt     7860

cctcagagaa gaggtataca taacaagaaa atcgcgtgaa caccttatat aacttagccc     7920

gttattgagc taaaaaacct tgcaaaattt cctatgaata agaatacttc agacgtgata     7980

aaaatttact ttctaactct tctcacgctg cccctatctg ttcttccgct ctaccgtgag     8040

aaataaagca tcgagtacgg cagttcgctg tcactgaact aaaacaataa ggctagttcg     8100

aatgatgaac ttgcttgctg tcaaacttct gagttgccgc tgatgtgaca ctgtgacaat     8160

aaattcaaac cggttatagc ggtctcctcc ggtaccggtt ctgccacctc caatagagct     8220

cagtaggagt cagaacctct gcggtggctg tcagtgactc atccgcgttt cgtaagttgt     8280

gcgcgtgcac atttcgcccg ttcccgctca tcttgcagca ggcggaaatt ttcatcacgc     8340

tgtaggacgc aaaaaaaaaa taattaatcg tacaagaatc ttggaaaaaa aattgaaaaa     8400

ttttgtataa aagggatgac ctaacttgac tcaatggctt ttacacccag tattttccct     8460

ttccttgttt gttacaatta tagaagcaag acaaaaacat atagacaacc tattcctagg     8520

agttatattt ttttacccta ccagcaatat aagtaaaaaa ctgtttaaac agtatgggtg     8580

aaatcttctc taacatccca gtcatcaagt acgaaggtcc agactctaag aacccattgg     8640

ctttcaagta ctacgatcca gaaagagtca tcttgggtaa aaagatgaag gaacacttgc     8700

cattcgctat ggcttggtgg cacaacttgt gtgctaacgg tgttgacatg ttcggtagag     8760

gtactatcga taagttgttc ggtgctgctg aagctggtac tatggaacac gctaaggcta     8820

aggttgacgc tggtatcgag ttcatgcaaa agttgggtat cgaatactac tgtttccacg     8880

acgttgattt ggtcccagaa gctgacgata tcaacgaaac caacagaaga ttggacgaat     8940

tgactgatta cttgaaggaa aagaccgctg gtactaacat caagtgtttg tggggtactg     9000

ctaacatgtt ctctaaccca agattcatga acggtgctgg ttccactaac gacgttgatg     9060

tctactgttt cgctgctgct caagttaaga aggctatcga aatgaccgtc aagttgggtg     9120

gtagaggtta cgttttctgg ggtggtagag aaggttacga aaccttgttg aacactaagg     9180

tccaaatgga attggaaaac atcgctaact tgatgaagat ggctagagac tacggtagat     9240

ctatcggttt caagggtact ttcttgatcg aaccaaagcc aaaggaacca atgaagcacc     9300

aatacgacta cgatgctgct actgctatcg gtttcttgag acaatacggt ttggaccaag     9360

atttcaagat gaacatcgaa gctaaccacg ctaccttggc tggtcacact ttccaacacg     9420

aattgagaat ctctagaatc aacggtatgt tgggttccat cgacgctaac caaggtgaca     9480

tcatgttggg ttgggacacc gattgtttcc catctaacgt ttacgacacc actttggcta     9540

tgtacgaaat cgttagaaac ggtggtttgc cagtcggtat caacttcgac tctaagaaca     9600

gaagaccatc caacacttac gaagacatgt tccacgcttt catcttgggt atggactctt     9660

tcgctttcgg tttgatcaag gctgctcaaa tcatcgaaga cggtagaatc gaaggtttca     9720

ccgaaaagaa gtacgaatcc ttcaacactg aattgggtca aaagatcaga aagggtgaag     9780

ctactttgga agaattggct gctcacgctg ctgacttgaa ggctccaaag gttccagtct     9840

ctggtagaca agaatacttg gaaggtgttt tgaacaacat catcttgtcc tgaggccctg     9900

c                                                                     9901


<210>  14
<211>  16404
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed plasmid

<400>  14
gatccacgat cgcattgcgg attacgtatt ctaatgttca gtaccgttcg tataatgtat       60

gctatacgaa gttatgcaga ttgtactgag agtgcaccat accacagctt ttcaattcaa      120

ttcatcattt tttttttatt cttttttttg atttcggttt ctttgaaatt tttttgattc      180

ggtaatctcc gaacagaagg aagaacgaag gaaggagcac agacttagat tggtatatat      240

acgcatatgt agtgttgaag aaacatgaaa ttgcccagta ttcttaaccc aactgcacag      300

aacaaaaacc tgcaggaaac gaagataaat catgtcgaaa gctacatata aggaacgtgc      360

tgctactcat cctagtcctg ttgctgccaa gctatttaat atcatgcacg aaaagcaaac      420

aaacttgtgt gcttcattgg atgttcgtac caccaaggaa ttactggagt tagttgaagc      480

attaggtccc aaaatttgtt tactaaaaac acatgtggat atcttgactg atttttccat      540

ggagggcaca gttaagccgc taaaggcatt atccgccaag tacaattttt tactcttcga      600

agacagaaaa tttgctgaca ttggtaatac agtcaaattg cagtactctg cgggtgtata      660

cagaatagca gaatgggcag acattacgaa tgcacacggt gtggtgggcc caggtattgt      720

tagcggtttg aagcaggcgg cagaagaagt aacaaaggaa cctagaggcc ttttgatgtt      780

agcagaattg tcatgcaagg gctccctatc tactggagaa tatactaagg gtactgttga      840

cattgcgaag agcgacaaag attttgttat cggctttatt gctcaaagag acatgggtgg      900

aagagatgaa ggttacgatt ggttgattat gacacccggt gtgggtttag atgacaaggg      960

agacgcattg ggtcaacagt atagaaccgt ggatgatgtg gtctctacag gatctgacat     1020

tattattgtt ggaagaggac tatttgcaaa gggaagggat gctaaggtag agggtgaacg     1080

ttacagaaaa gcaggctggg aagcatattt gagaagatgc ggccagcaaa actaaaaaac     1140

tgtattataa gtaaatgcat gtatactaaa ctcacaaatt agagcttcaa tttaattata     1200

tcagttatta ccctatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca     1260

tcaggaaatt gtaaacgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag     1320

ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac     1380

cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga     1440

ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc     1500

accctaatca agataacttc gtataatgta tgctatacga acggtacccg ccaactctgt     1560

tcgagaatga tgtaatcaag aaggtctcac aaaaccatcc aggcagtacc acttcccaag     1620

tattgcttag atgggcaact cagagaggca ttgccgtcat tccaaaatct tccaagaagg     1680

aaaggttact tggcaaccta gaaatcgaaa aaaagttcac tttaacggag caagaattga     1740

aggatatttc tgcactaaat gccaacatca gatttaatga tccatggacc tggttggatg     1800

gtaaattccc cacttttgcc tgatccagcc agtaaaatcc atactcaacg acgatatgaa     1860

caaatttccc tcattccgat gctgtatatg tgtataaatt tttacatgct cttctgttta     1920

gacacagaac agctttaaat aaaatgttgg atatactttt tctgcctgtg gtgtcatcca     1980

cgcttttaat tcatctcttg tatggttgac aatttggcta ttttttaaca gaacccaacg     2040

gtaattgaaa ttaaaaggga aacgagtggg ggcgatgagt gagtgatacg gcgcctgatg     2100

cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt     2160

acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac     2220

gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc     2280

gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag acgaaagggc     2340

ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca     2400

ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat     2460

tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa     2520

aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt     2580

tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag     2640

ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt     2700

tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg     2760

gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag     2820

aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta     2880

agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg     2940

acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta     3000

actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac     3060

accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt     3120

actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca     3180

cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag     3240

cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta     3300

gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag     3360

ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt     3420

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat     3480

aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta     3540

gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa     3600

acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt     3660

tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag     3720

ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta     3780

atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca     3840

agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag     3900

cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa     3960

agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga     4020

acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc     4080

gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc     4140

ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt     4200

gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt     4260

gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag     4320

gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa     4380

tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat     4440

gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg     4500

ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattag     4560

gcgcctactt ctagggggcc tatcaagtaa attactcctg gtacactgaa gtatataagg     4620

gatatagaag caaatagttg tcagtgcaat ccttcaagac gattgggaaa atactgtaat     4680

ataaatcgta aaggaaaatt ggaaattttt taaagatgtc ttcactggtt actcttaata     4740

acggtctgaa aatgccccta gtcggcttag ggtgctggaa aattgacaaa aaagtctgtg     4800

cgaatcaaat ttatgaagct atcaaattag gctaccgttt attcgatggt gcttgcgact     4860

acggcaacga aaaggaagtt ggtgaaggta tcaggaaagc catctccgaa ggtcttgttt     4920

ctagaaagga tatatttgtt gtttcaaagt tatggaacaa ttttcaccat cctgatcatg     4980

taaaattagc tttaaagaag accttaagcg atatgggact tgattattta gacctgtatt     5040

atattcactt cccaatcgcc ttcaaatatg ttccatttga agagaaatac cctccaggat     5100

tctatacggg cgcagaagga ttctatacgg gcgcagaact agtgatctcg aggttccaga     5160

gctcggatcc accacaggtg ttgtcctctg aggacataaa atacacaccg agattcatca     5220

actcattgct ggagttagca tatctacaat tgggtgaaat ggggagcgat ttgcaggcat     5280

ttgctcggca tgccggtaga ggtgtggtca ataagagcga cctcatgcta tacctgagaa     5340

agcaacctga cctacaggaa agagttactc aagaataaga attttcgttt taaaacctaa     5400

gagtcacttt aaaatttgta tacacttatt ttttttataa cttatttaat aataaaaatc     5460

ataaatcata agaaattcgc ttactcatcc cgggttagat gagagtcttt tccagttcgc     5520

ttaaggggac aatcttggaa ttatagcgat cccaattttc attatccaca tcggatatgc     5580

tttccattac atgccatgga aaattgtcat tcagaaattt atcaaaagga actgcaattt     5640

tattagagtc atataacaat gaccacatgg ccttataaca accaccaagg gcacatgagt     5700

ttggtgtttc tagcctaaaa ttaccctttg tagcaccaat gacttgagca aacttcttca     5760

caatagcatc gtttttagaa gccccaccta caaaaaaagt cctttctggc cttttattta     5820

ggtagtcccg cagcggagat tcatcgtaat caaacttcac gattgtatct tcgttcagtc     5880

tctgttgtga gcttgcgttt gaatccgaaa gcaggggaga tattcttacc ctgcaactta     5940

aagcctgtga ttctacaata tttttggcat cgtgcctctt gtctttgaac ttggccacct     6000

ctctttcaat catacccgtt tttggattga agataaccct tttgtttatg gcttttacgc     6060

taggaacgat ctcccccaga ggaaaatata cacctaattc attttcacta ctttctgagt     6120

catctagcac agcttgatta aaaagagtcc aatcgttagt cttctcataa ttattttccc     6180

gttctttgtt taactcgtct cttatcctct cccttgccaa agaaccatta caataacaaa     6240

tcatacccat ataatggttt ggcagagttg gatgaatgaa aagatgatag ttcggagagg     6300

ggtgatactt atcggtgacc agaagaactg tagtacttgt tcctagggaa acgagaacgt     6360

cattcttccg caggggtaaa gaacatatag tggctaaatt atccccagtc atgggagaga     6420

ccttgcagtt tgtattgaaa ccgtacttct caataaaata tttacagatg gtacccgcta     6480

tcaaattttt catgggtgct ctcattaatt tttgtctgat agttttatcc ttagaagaac     6540

tatcaattag atgtagtagc tcatcactga attttctttc acgtatatca taaaggttca     6600

taccacaggc atctgcctcc tctaattcaa caagatggcc cactaagata gaagtcaaaa     6660

aattagacac taaagaaatg gtctttgttt tttcgtaagc ttctggttct aattgtgcaa     6720

ttttcagaat ttgaggacca gtaaatctaa aatgggctct ggaccctgtt aattgagcca     6780

ttttttcagg cccacctatg cactcttcaa actcttgaca ttgctttgca gtactgtggt     6840

cttgccaatt gggggcggtt tgccttgcaa atgctacaga gctcacgtag tgcaataaat     6900

ctttttccgg tttcttattc aattgctcta acagagattc ggcttgggag gaccagtaga     6960

cagacccgtg ctgctggcag gaccctgaga cggccataac tttgttcaat ggaaatttag     7020

cctcgcgata tttcgagaga accagatcta gagcctctaa ccacatggct acgggacatt     7080

cgatagtgtc gccgtgtata tagacaccct tctttgtgtg ataatgcgga agatcctttt     7140

caaattccac tgtttctgaa tggacaattt ttaggtcctg gttaatggcg agacatttca     7200

gttgttgggt cgaaagatca aacccaagat agtatgagtc taaagacatt gtgttggaaa     7260

cctctcttgt ctgtctctga attactgaac acaacatact agtcgtacgg ttttattttt     7320

tacttatatt gctggtaggg taaaaaaata taactcctag gaataggttg tctatatgtt     7380

tttgtcttgc ttctataatt gtaacaaaca aggaaaggga aaatactggg tgtaaaagcc     7440

attgagtcaa gttaggtcat cccttttata caaaattttt caattttttt tccaagattc     7500

ttgtacgatt aattattttt tttttgcgtc ctacagcgtg atgaaaattt ccgcctgctg     7560

caagatgagc gggaacgggc gaaatgtgca cgcgcacaac ttacgaaacg cggatgagtc     7620

actgacagcc accgcagagg ttctgactcc tactgagctc tattggaggt ggcagaaccg     7680

gtaccggagg agaccgctat aaccggtttg aatttattgt cacagtgtca catcagcggc     7740

aactcagaag tttgacagca agcaagttca tcattcgaac tagccttatt gttttagttc     7800

agtgacagcg aactgccgta ctcgatgctt tatttctcac ggtagagcgg aagaacagat     7860

aggggcagcg tgagaagagt tagaaagtaa atttttatca cgtctgaagt attcttattc     7920

ataggaaatt ttgcaaggtt ttttagctca ataacgggct aagttatata aggtgttcac     7980

gcgattttct tgttatgtat acctcttctg gcgcgcctct ttttattaac cttaattttt     8040

attttagatt cctgacttca actcaagacg cacagatatt ataacatctg cataataggc     8100

atttgcaaga attactcgtg agtaaggaaa gagtgaggaa ctatcgcata cctgcattta     8160

aagatgccga tttgggcgcg aatcctttat tttggcttca ccctcatact attatcaggg     8220

ccagaaaaag gaagtgtttc cctccttctt gaattgatgt taccctcata aagcacgtgg     8280

cctcttatcg agaaagaaat taccgtcgct cgtgatttgt ttgcaaaaag aacaaaactg     8340

aaaaaaccca gacacgctcg acttcctgtc ttcctattga ttgcagcttc caatttcgtc     8400

acacaacaag gtcctagcga cggctcacag gttttgtaac aagcaatcga aggttctgga     8460

atggcgggaa agggtttagt accacatgct atgatgccca ctgtgatctc cagagcaaag     8520

ttcgttcgat cgtactgtta ctctctctct ttcaaacaga attgtccgaa tcgtgtgaca     8580

acaacagcct gttctcacac actcttttct tctaaccaag ggggtggttt agtttagtag     8640

aacctcgtga aacttacatt tacatatata taaacttgca taaattggtc aatgcaagaa     8700

atacatattt ggtcttttct aattcgtagt ttttcaagtt cttagatgct ttctttttct     8760

cttttttaca gatcatcaag gaagtaatta tctacttttt acaacaaata taaaacacgt     8820

acgactagta tgactcaatt cactgacatt gataagttgg ccgtctccac cataagaatt     8880

ttggctgtgg acaccgtatc caaggccaac tcaggtcacc caggtgctcc attgggtatg     8940

gcaccagctg cacacgttct atggagtcaa atgcgcatga acccaaccaa cccagactgg     9000

atcaacagag atagatttgt cttgtctaac ggtcacgcgg tcgctttgtt gtattctatg     9060

ctacatttga ctggttacga tctgtctatt gaagacttga aacagttcag acagttgggt     9120

tccagaacac caggtcatcc tgaatttgag ttgccaggtg ttgaagttac taccggtcca     9180

ttaggtcaag gtatctccaa cgctgttggt atggccatgg ctcaagctaa cctggctgcc     9240

acttacaaca agccgggctt taccttgtct gacaactaca cctatgtttt cttgggtgac     9300

ggttgtttgc aagaaggtat ttcttcagaa gcttcctcct tggctggtca tttgaaattg     9360

ggtaacttga ttgccatcta cgatgacaac aagatcacta tcgatggtgc taccagtatc     9420

tcattcgatg aagatgttgc taagagatac gaagcctacg gttgggaagt tttgtacgta     9480

gaaaatggta acgaagatct agccggtatt gccaaggcta ttgctcaagc taagttatcc     9540

aaggacaaac caactttgat caaaatgacc acaaccattg gttacggttc cttgcatgcc     9600

ggctctcact ctgtgcacgg tgccccattg aaagcagatg atgttaaaca actaaagagc     9660

aaattcggtt tcaacccaga caagtccttt gttgttccac aagaagttta cgaccactac     9720

caaaagacaa ttttaaagcc aggtgtcgaa gccaacaaca agtggaacaa gttgttcagc     9780

gaataccaaa agaaattccc agaattaggt gctgaattgg ctagaagatt gagcggccaa     9840

ctacccgcaa attgggaatc taagttgcca acttacaccg ccaaggactc tgccgtggcc     9900

actagaaaat tatcagaaac tgttcttgag gatgtttaca atcaattgcc agagttgatt     9960

ggtggttctg ccgatttaac accttctaac ttgaccagat ggaaggaagc ccttgacttc    10020

caacctcctt cttccggttc aggtaactac tctggtagat acattaggta cggtattaga    10080

gaacacgcta tgggtgccat aatgaacggt atttcagctt tcggtgccaa ctacaaacca    10140

tacggtggta ctttcttgaa cttcgtttct tatgctgctg gtgccgttag attgtccgct    10200

ttgtctggcc acccagttat ttgggttgct acacatgact ctatcggtgt cggtgaagat    10260

ggtccaacac atcaacctat tgaaacttta gcacacttca gatccctacc aaacattcaa    10320

gtttggagac cagctgatgg taacgaagtt tctgccgcct acaagaactc tttagaatcc    10380

aagcatactc caagtatcat tgctttgtcc agacaaaact tgccacaatt ggaaggtagc    10440

tctattgaaa gcgcttctaa gggtggttac gtactacaag atgttgctaa cccagatatt    10500

attttagtgg ctactggttc cgaagtgtct ttgagtgttg aagctgctaa gactttggcc    10560

gcaaagaaca tcaaggctcg tgttgtttct ctaccagatt tcttcacttt tgacaaacaa    10620

cccctagaat acagactatc agtcttacca gacaacgttc caatcatgtc tgttgaagtt    10680

ttggctacca catgttgggg caaatacgct catcaatcct tcggtattga cagatttggt    10740

gcctccggta aggcaccaga agtcttcaag ttcttcggtt tcaccccaga aggtgttgct    10800

gaaagagctc aaaagaccat tgcattctat aagggtgaca agctaatttc tcctttgaaa    10860

aaagctttct aaattctgat cgtagatcat cagatttgat atgatattat ttgtgaaaaa    10920

atgaaataaa actttataca acttaaatac aacttttttt ataaacgatt aagcaaaaaa    10980

atagtttcaa acttttaaca atattccaaa cactcagtcc ttttccttct tatattatag    11040

gtgtacgtat tatagaaaaa tttcaatgat tactttttct ttctttttcc ttgtaccagc    11100

acatggccga gcttgaatgt taaacccttc gagagaatca caccattcaa gtataaagcc    11160

aataaagaat ataactccta aaaggctaat tgaaaccctg tgatttttgc ccgggtttaa    11220

ggcgcgccct ttatcattat caatactgcc atttcaaaga atacgtaaat aattaatagt    11280

agtgattttc ctaactttat ttagtcaaaa aattagcctt ttaattctgc tgtaacccgt    11340

acatgcccaa aatagggggc gggttacaca gaatatataa catcgtaggt gtctgggtga    11400

acagtttatt cctggcatcc actaaatata atggagcccg ctttttaagc tggcatccag    11460

aaaaaaaaag aatcccagca ccaaaatatt gttttcttca ccaaccatca gttcataggt    11520

ccattctctt agcgcaacta cagagaacag gggcacaaac aggcaaaaaa cgggcacaac    11580

ctcaatggag tgatgcaacc tgcctggagt aaatgatgac acaaggcaat tgacccacgc    11640

atgtatctat ctcattttct tacaccttct attaccttct gctctctctg atttggaaaa    11700

agctgaaaaa aaaggttgaa accagttccc tgaaattatt cccctacttg actaataagt    11760

atataaagac ggtaggtatt gattgtaatt ctgtaaatct atttcttaaa cttcttaaat    11820

tctactttta tagttagtct tttttttagt tttaaaacac caagaactta gtttcgaata    11880

aacacacata aacaaacacc actagcatgg ctgccggtgt cccaaaaatt gatgcgttag    11940

aatctttggg caatcctttg gaggatgcca agagagctgc agcatacaga gcagttgatg    12000

aaaatttaaa atttgatgat cacaaaatta ttggaattgg tagtggtagc acagtggttt    12060

atgttgccga aagaattgga caatatttgc atgaccctaa attttatgaa gtagcgtcta    12120

aattcatttg cattccaaca ggattccaat caagaaactt gattttggat aacaagttgc    12180

aattaggctc cattgaacag tatcctcgca ttgatatagc gtttgacggt gctgatgaag    12240

tggatgagaa tttacaatta attaaaggtg gtggtgcttg tctatttcaa gaaaaattgg    12300

ttagtactag tgctaaaacc ttcattgtcg ttgctgattc aagaaaaaag tcaccaaaac    12360

atttaggtaa gaactggagg caaggtgttc ccattgaaat tgtaccttcc tcatacgtga    12420

gggtcaagaa tgatctatta gaacaattgc atgctgaaaa agttgacatc agacaaggag    12480

gttctgctaa agcaggtcct gttgtaactg acaataataa cttcattatc gatgcggatt    12540

tcggtgaaat ttccgatcca agaaaattgc atagagaaat caaactgtta gtgggcgtgg    12600

tggaaacagg tttattcatc gacaacgctt caaaagccta cttcggtaat tctgacggta    12660

gtgttgaagt taccgaaaag tgagcggccg cgtgaattta ctttaaatct tgcatttaaa    12720

taaattttct ttttatagct ttatgactta gtttcaattt atatactatt ttaatgacat    12780

tttcgattca ttgattgaaa gctttgtgtt ttttcttgat gcgctattgc attgttcttg    12840

tctttttcgc cacatgtaat atctgtagta gatacctgat acattgtgga tgctgagtga    12900

aattttagtt aataatggag gcgctcttaa taattttggg gatattggct ttttttttta    12960

aagtttacaa atgaattttt tccgccagga taacgattct gaagttactc ttagcgttcc    13020

tatcggtaca gccatcaaat catgcctata aatcatgcct atatttgcgt gcagtcagta    13080

tcatctacat gaaaaaaact cccgcaattt cttatagaat acgttgaaaa ttaaatgtac    13140

gcgccaagat aagataacat atatctagat gcagtaatat acacagattc ccgcggacgt    13200

gggaaggaaa aaattagata acaaaatctg agtgatatgg aaattccgct gtatagctca    13260

tatctttccc tccaccgcgg tggtcgactt tcacatacgt tgcatacgtc gatatagata    13320

ataatgataa tgacagcagg attatcgtaa tacgtaatag ctgaaaatct caaaaatgtg    13380

tgggtcatta cgtaaataat gataggaatg ggattcttct atttttcctt tttccattct    13440

agcagccgtc gggaaaacgt ggcatcctct ctttcgggct caattggagt cacgctgccg    13500

tgagcatcct ctctttccat atctaacaac tgagcacgta accaatggaa aagcatgagc    13560

ttagcgttgc tccaaaaaag tattggatgg ttaataccat ttgtctgttc tcttctgact    13620

ttgactcctc aaaaaaaaaa atctacaatc aacagatcgc ttcaattacg ccctcacaaa    13680

aacttttttc cttcttcttc gcccacgtta aattttatcc ctcatgttgt ctaacggatt    13740

tctgcacttg atttattata aaaagacaaa gacataatac ttctctatca atttcagtta    13800

ttgttcttcc ttgcgttatt cttctgttct tctttttctt ttgtcatata taaccataac    13860

caagtaatac atattcaaac ttaagactcg agatggtcaa accaattata gctcccagta    13920

tccttgcttc tgacttcgcc aacttgggtt gcgaatgtca taaggtcatc aacgccggcg    13980

cagattggtt acatatcgat gtcatggacg gccattttgt tccaaacatt actctgggcc    14040

aaccaattgt tacctcccta cgtcgttctg tgccacgccc tggcgatgct agcaacacag    14100

aaaagaagcc cactgcgttc ttcgattgtc acatgatggt tgaaaatcct gaaaaatggg    14160

tcgacgattt tgctaaatgt ggtgctgacc aatttacgtt ccactacgag gccacacaag    14220

accctttgca tttagttaag ttgattaagt ctaagggcat caaagctgca tgcgccatca    14280

aacctggtac ttctgttgac gttttatttg aactagctcc tcatttggat atggctcttg    14340

ttatgactgt ggaacctggg tttggaggcc aaaaattcat ggaagacatg atgccaaaag    14400

tggaaacttt gagagccaag ttcccccatt tgaatatcca agtcgatggt ggtttgggca    14460

aggagaccat cccgaaagcc gccaaagccg gtgccaacgt tattgtcgct ggtaccagtg    14520

ttttcactgc agctgacccg cacgatgtta tctccttcat gaaagaagaa gtctcgaagg    14580

aattgcgttc tagagatttg ctagattaga cgtctgttta aagattacgg atatttaact    14640

tacttagaat aatgccattt ttttgagtta taataatcct acgttagtgt gagcgggatt    14700

taaactgtga ggaccttaat acattcagac acttctgcgg tatcacccta cttattccct    14760

tcgagattat atctaggaac ccatcaggtt ggtggaagat tacccgttct aagacttttc    14820

agcttcctct attgatgtta cacctggaca ccccttttct ggcatccagt ttttaatctt    14880

cagtggcatg tgagattctc cgaaattaat taaagcaatc acacaattct ctcggatacc    14940

acctcggttg aaactgacag gtggtttgtt acgcatgcta atgcaaagga gcctatatac    15000

ctttggctcg gctgctgtaa cagggaatat aaagggcagc ataatttagg agtttagtga    15060

acttgcaaca tttactattt tcccttctta cgtaaatatt tttcttttta attctaaatc    15120

aatctttttc aattttttgt ttgtattctt ttcttgctta aatctataac tacaaaaaac    15180

acatacataa actaaaacgt acgactagta tgtctgaacc agctcaaaag aaacaaaagg    15240

ttgctaacaa ctctctagaa caattgaaag cctccggcac tgtcgttgtt gccgacactg    15300

gtgatttcgg ctctattgcc aagtttcaac ctcaagactc cacaactaac ccatcattga    15360

tcttggctgc tgccaagcaa ccaacttacg ccaagttgat cgatgttgcc gtggaatacg    15420

gtaagaagca tggtaagacc accgaagaac aagtcgaaaa tgctgtggac agattgttag    15480

tcgaattcgg taaggagatc ttaaagattg ttccaggcag agtctccacc gaagttgatg    15540

ctagattgtc ttttgacact caagctacca ttgaaaaggc tagacatatc attaaattgt    15600

ttgaacaaga aggtgtctcc aaggaaagag tccttattaa aattgcttcc acttgggaag    15660

gtattcaagc tgccaaagaa ttggaagaaa aggacggtat ccactgtaat ttgactctat    15720

tattctcctt cgttcaagca gttgcctgtg ccgaggccca agttactttg atttccccat    15780

ttgttggtag aattctagac tggtacaaat ccagcactgg taaagattac aagggtgaag    15840

ccgacccagg tgttatttcc gtcaagaaaa tctacaacta ctacaagaag tacggttaca    15900

agactattgt tatgggtgct tctttcagaa gcactgacga aatcaaaaac ttggctggtg    15960

ttgactatct aacaatttct ccagctttat tggacaagtt gatgaacagt actgaacctt    16020

tcccaagagt tttggaccct gtctccgcta agaaggaagc cggcgacaag atttcttaca    16080

tcagcgacga atctaaattc agattcgact tgaatgaaga cgctatggcc actgaaaaat    16140

tgtccgaagg tatcagaaaa ttctctgccg atattgttac tctattcgac ttgattgaaa    16200

agaaagttac cgcttaagga agtatctcgg aaatattaat ttaggccatg tccttatgca    16260

cgtttctttt gatacttacg ggtacatgta cacaagtata tctatatata taaattaatg    16320

aaaatcccct atttatatat atgactttaa cgagacagaa cagtttttta ttttttatcc    16380

tatttgatga atgatacagt ttcg                                           16404


<210>  15
<211>  95
<212>  DNA
<213>  Artificial sequence

<220>
<223>   as a URA3 deletion scar in the genome -After removal of the 
       KanMX marker using the cre recombinase, a 95 bp sequence 
       consisting of a loxP site flanked by the primer binding sites 
       remained

<400>  15
gcattgcgga ttacgtattc taatgttcag ataacttcgt atagcataca ttatacgaag       60

ttatccagtg atgatacaac gagttagcca aggtg                                  95


<210>  16
<211>  100
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  16
gtccataaag cttttcaatt catctttttt ttttttgttc ttttttttga ttccggtttc       60

tttgaaattt ttttgattcg gtaatctccg agcagaagga                            100


<210>  17
<211>  100
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  17
aaaactgtat tataagtaaa tgcatgtata ctaaactcac aaattagagc ttcaatttaa       60

ttatatcagt tattacccgg gaatctcggt cgtaatgatt                            100


<210>  18
<211>  100
<212>  DNA
<213>  saccharomyces cerevisiae

<400>  18
attggcatta tcacataatg aattatacat tatataaagt aatgtgattt cttcgaagaa       60

tatactaaaa aatgagcagg caagataaac gaaggcaaag                            100


<210>  19
<211>  100
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  19
tagtgacacc gattatttaa agctgcagca tacgatatat atacatgtgt atatatgtat       60

acctatgaat gtcagtaagt atgtatacga acagtatgat                            100


<210>  20
<211>  6728
<212>  DNA
<213>  Artificial sequence

<220>
<223>  constructed vector

<400>  20
acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa       60

aagtgccacc tgggtccttt tcatcacgtg ctataaaaat aattataatt taaatttttt      120

aatataaata tataaattaa aaatagaaag taaaaaaaga aattaaagaa aaaatagttt      180

ttgttttccg aagatgtaaa agactctagg gggatcgcca acaaatacta ccttttatct      240

tgctcttcct gctctcaggt attaatgccg aattgtttca tcttgtctgt gtagaagacc      300

acacacgaaa atcctgtgat tttacatttt acttatcgtt aatcgaatgt atatctattt      360

aatctgcttt tcttgtctaa taaatatata tgtaaagtac gctttttgtt gaaatttttt      420

aaacctttgt ttattttttt ttcttcattc cgtaactctt ctaccttctt tatttacttt      480

ctaaaatcca aatacaaaac ataaaaataa ataaacacag agtaaattcc caaattattc      540

catcattaaa agatacgagg cgcgtgtaag ttacaggcaa gcgatccgtc ctaagaaacc      600

attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg      660

cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct      720

tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gcgtgttggc      780

gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat      840

aaattcccgt tttaagagct tggtgagcgc taggagtcac tgccaggtat cgtttgaaca      900

cggcattagt cagggaagtc ataacacagt cctttcccgc aattttcttt ttctattact      960

cttggcctcc tctagtacac tctatatttt tttatgcctc ggtaatgatt ttcatttttt     1020

tttttcccct agcggatgac tctttttttt tcttagcgat tggcattatc acataatgaa     1080

ttatacatta tataaagtaa tgtgatttct tcgaagaata tactaaaaaa tgagcaggca     1140

agataaacga aggcaaagat gacagagcag aaagccctag taaagcgtat tacaaatgaa     1200

accaagattc agattgcgat ctctttaaag ggtggtcccc tagcgataga gcactcgatc     1260

ttcccagaaa aagaggcaga agcagtagca gaacaggcca cacaatcgca agtgattaac     1320

gtccacacag gtatagggtt tctggaccat atgatacatg ctctggccaa gcattccggc     1380

tggtcgctaa tcgttgagtg cattggtgac ttacacatag acgaccatca caccactgaa     1440

gactgcggga ttgctctcgg tcaagctttt aaagaggccc tactggcgcg tggagtaaaa     1500

aggtttggat caggatttgc gcctttggat gaggcacttt ccagagcggt ggtagatctt     1560

tcgaacaggc cgtacgcagt tgtcgaactt ggtttgcaaa gggagaaagt aggagatctc     1620

tcttgcgaga tgatcccgca ttttcttgaa agctttgcag aggctagcag aattaccctc     1680

cacgttgatt gtctgcgagg caagaatgat catcaccgta gtgagagtgc gttcaaggct     1740

cttgcggttg ccataagaga agccacctcg cccaatggta ccaacgatgt tccctccacc     1800

aaaggtgttc ttatgtagtg acaccgatta tttaaagctg cagcatacga tatatataca     1860

tgtgtatata tgtataccta tgaatgtcag taagtatgta tacgaacagt atgatactga     1920

agatgacaag gtaatgcatc attctatacg tgtcattctg aacgaggcgc gctttccttt     1980

tttctttttg ctttttcttt ttttttctct tgaactcgac ggatctatgc ggtgtgaaat     2040

accgcacaga tgcgtaagga gaaaataccg catcaggaaa ttgtaaacgt taatattttg     2100

ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata ggccgaaatc     2160

ggcaaaatcc cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagtt     2220

tggaacaaga gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc     2280

tatcagggcg atggcccact acgtgaacca tcaccctaat caagtttttt ggggtcgagg     2340

tgccgtaaag cactaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga     2400

aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg     2460

ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct taatgcgccg     2520

ctacagggcg cgtcgcgcca ttcgccattc aggctgcgca actgttggga agggcgatcg     2580

gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc aaggcgatta     2640

agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc cagtgagcgc     2700

gcgtaatacg actcactata gggcgaattg ggtaccgggc cccccctcga ggtcgacggt     2760

atcgataagc ttgattagaa gccgccgagc gggcgacagc cctccgacgg aagactctcc     2820

tccgtgcgtc ctcgtcttca ccggtcgcgt tcctgaaacg cagatgtgcc tcgcgccgca     2880

ctgctccgaa caataaagat tctacaatac tagcttttat ggttatgaag aggaaaaatt     2940

ggcagtaacc tggccccaca aaccttcaaa ttaacgaatc aaattaacaa ccataggatg     3000

ataatgcgat tagtttttta gccttatttc tggggtaatt aatcagcgaa gcgatgattt     3060

ttgatctatt aacagatata taaatggaaa agctgcataa ccactttaac taatactttc     3120

aacattttca gtttgtatta cttcttattc aaatgtcata aaagtatcaa caaaaaattg     3180

ttaatatacc tctatacttt aacgtcaagg agaaaaatgt ccaatttact gcccgtacac     3240

caaaatttgc ctgcattacc ggtcgatgca acgagtgatg aggttcgcaa gaacctgatg     3300

gacatgttca gggatcgcca ggcgttttct gagcatacct ggaaaatgct tctgtccgtt     3360

tgccggtcgt gggcggcatg gtgcaagttg aataaccgga aatggtttcc cgcagaacct     3420

gaagatgttc gcgattatct tctatatctt caggcgcgcg gtctggcagt aaaaactatc     3480

cagcaacatt tgggccagct aaacatgctt catcgtcggt ccgggctgcc acgaccaagt     3540

gacagcaatg ctgtttcact ggttatgcgg cggatccgaa aagaaaacgt tgatgccggt     3600

gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt tcgaccaggt tcgttcactc     3660

atggaaaata gcgatcgctg ccaggatata cgtaatctgg catttctggg gattgcttat     3720

aacaccctgt tacgtatagc cgaaattgcc aggatcaggg ttaaagatat ctcacgtact     3780

gacggtggga gaatgttaat ccatattggc agaacgaaaa cgctggttag caccgcaggt     3840

gtagagaagg cacttagcct gggggtaact aaactggtcg agcgatggat ttccgtctct     3900

ggtgtagctg atgatccgaa taactacctg ttttgccggg tcagaaaaaa tggtgttgcc     3960

gcgccatctg ccaccagcca gctatcaact cgcgccctgg aagggatttt tgaagcaact     4020

catcgattga tttacggcgc taaggatgac tctggtcaga gatacctggc ctggtctgga     4080

cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg ctggagtttc aataccggag     4140

atcatgcaag ctggtggctg gaccaatgta aatattgtca tgaactatat ccgtaacctg     4200

gatagtgaaa caggggcaat ggtgcgcctg ctggaagatg gcgattagga gtaagcgaat     4260

ttcttatgat ttatgatttt tattattaaa taagttataa aaaaaataag tgtatacaaa     4320

ttttaaagtg actcttaggt tttaaaacga aaattcttat tcttgagtaa ctctttcctg     4380

taggtcaggt tgctttctca ggtatagcat gaggtcgctc ttattgacca cacctctacc     4440

ggcatgccga gcaaatgcct gcaaatcgct ccccatttca cccaattgta gatatgctaa     4500

ctccagcaat gagttgatga atctcggtgt gtattttatg tcctcagagg acaacacctg     4560

tggtgttcta gagcggccgc caccgcggtg gagctccagc ttttgttccc tttagtgagg     4620

gttaattgcg cgcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc     4680

gctcacaatt ccacacaaca taggagccgg aagcataaag tgtaaagcct ggggtgccta     4740

atgagtgagg taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa     4800

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     4860

tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     4920

agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     4980

aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     5040

gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     5100

tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     5160

cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     5220

ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     5280

cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     5340

atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     5400

agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     5460

gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa     5520

gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     5580

tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     5640

agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     5700

gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     5760

aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     5820

aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     5880

ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     5940

gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     6000

aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     6060

ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     6120

tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     6180

ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     6240

cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     6300

agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     6360

gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     6420

gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     6480

acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     6540

acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     6600

agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     6660

aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     6720

gagcggat                                                              6728


