                         SEQUENCE LISTING

<110>  Ginkgo BioWorks, Inc.
 
<120>  Methods and Systems for Chemoautotrophic Production of Organic 
       Compounds

<130>  134395-010201/PCT

<140>  PCT/US12/62540
<141>  2012-10-30


<150>  US 13/285,919
<151>  2011-10-31

<160>  66    

<170>  PatentIn version 3.5

<210>  1
<211>  1161
<212>  DNA
<213>  Burkholderia stabilis

<400>  1
atggccactg ttctatgcgt gctatatcct gatccggttg atggttatcc accgcactat       60

gttcgtgata ccatcccggt gatcacccga tatgcagatg gccaaacagc ccccactccc      120

gcggggccgc caggatttcg tcccggtgaa ctggtgggca gtgtttctgg tgcgctggga      180

cttcgcggtt accttgaagc ccatggtcac actctcatcg tcacatcgga taaagatggt      240

ccggatagtg agtttgaaag acggctgcct gatgccgatg ttgtcatcag ccagccgttt      300

tggcccgcat atcttacggc tgaacgtatc gcgagggcgc cgaagttacg tctggctctg      360

actgctggta taggctcaga ccacgttgac ctcgatgccg cggcgcgtgc tcacattacg      420

gtcgccgaag tgactggaag taacagtatt tcagtggctg aacacgttgt tatgacaacg      480

ctggccttag tgcggaacta tttacctagc cacgcaattg cgcagcaagg tggttggaac      540

atcgccgact gtgtttcacg ctcttatgac gtcgaaggaa tgcatttcgg cacagtaggg      600

gcgggtagga ttggattggc tgttctgcgc cggcttaaac cgtttggtct gcatttgcat      660

tacacccaaa gacatcgctt ggatgcagcc atcgaacaag aactcggtct tacttaccat      720

gccgatccag ccagtcttgc ggcggcagta gacattgtta atttgcagat tccgctgtat      780

ccttccactg aacacctttt tgatgctgca atgattgcac gcatgaaaag aggtgcgtac      840

ctgattaata ctgcccgtgc gaagttagtg gaccgcgatg ccgtcgtcag ggctgtcaca      900

agcggacatc tggctggtta tggcggggac gtctggtttc cccagcctgc tccggctgat      960

catccgtggc gggcgatgcc ttttaatggc atgacacctc atattagcgg tacttcactt     1020

tctgctcagg cgcggtacgc agcggggacc cttgaaatcc tccagtgttg gtttgatggc     1080

agaccgatca ggaacgagta cctgatagtg gatggaggaa cattggccgg tacaggtgcc     1140

caatcatatc ggctgaagta a                                               1161


<210>  2
<211>  1095
<212>  DNA
<213>  Candida methylica

<400>  2
atgaaaattg tactggtgct ctatgatgca ggaaaacacg ccgcagacga ggaaaagctg       60

tatggctgca ctgagaacaa gctaggaatc gccaattggc tgaaggatca gggccatgaa      120

ttaatcacta cctccgataa agaaggtgaa acctcagagt tagataagca cattcccgat      180

gccgatataa tcattacgac gccttttcac ccagcttaca ttacaaaaga gcgtctggat      240

aaagcgaaaa acctcaaatc ggttgtagtc gccggcgtcg gttccgacca cattgacctg      300

gattatatta atcagactgg taagaagatc agcgtcctgg aagtcaccgg ctctaatgtg      360

gtatctgttg ctgagcatgt tgtaatgact atgctggttt tagtgcgcaa ttttgtgccc      420

gcacacgagc agatcataaa ccatgactgg gaagtagcag caatagctaa agatgcgtat      480

gatattgaag gcaaaactat cgctacgatc ggcgcgggcc ggatcggtta ccgggttctg      540

gagcggctgc tgccgttcaa tcctaaagag ctcctatact atgattatca ggcactgccc      600

aaggaagcag aggaaaaagt tggtgcgcgg agagtggaaa acattgaaga acttgtggct      660

caggccgaca ttgtaacggt aaatgctcca cttcacgcag gcaccaaagg ccttatcaat      720

aaagagttgc tttcaaagtt taagaaaggt gcctggttgg taaatacggc ccgtggagca      780

atttgcgttg cggaggatgt cgccgccgct ctggaatcgg gacagctccg gggatacggt      840

ggggatgttt ggtttcccca gccggcgcca aaggatcacc cgtggcgtga tatgcgaaac      900

aaatatggcg cagggaacgc catgacaccg cattactccg ggacgacctt agatgcacaa      960

actcgatacg ctgaaggtac caagaacatc ctggaaagtt tctttacggg caagtttgat     1020

tatcgccctc aggatattat tctgcttaat ggagaatatg taacaaaagc ttacggcaaa     1080

catgacaaaa agtaa                                                      1095


<210>  3
<211>  1095
<212>  DNA
<213>  Candida boidinii

<400>  3
atgaaaatcg tgttggtact gtatgacgct gggaaacatg ctgctgatga ggagaaattg       60

tatggctgta ccgagaataa actgggcatt gcgaactggt tgaaagatca agggcatgag      120

cttattacca cgagcgataa ggagggcgaa accagcgagc tggacaaaca tattcctgac      180

gcggatatta ttatcactac accctttcac cctgcgtaca ttactaaaga gcgtcttgat      240

aaggctaaga atctaaaact cgttgtcgtc gctggcgttg gatctgatca tatcgatttg      300

gattacataa accagacagg aaagaagatc agcgtgctgg aagttacggg ctcgaacgtt      360

gttagcgtcg cagaacacgt ggttatgacc atgctagtcc tggtccgtaa cttcgtgccg      420

gcgcatgaac agatcattaa ccatgattgg gaagttgcgg ctattgcaaa ggatgcttat      480

gatatcgaag gtaaaaccat cgccaccatt ggcgctgggc gtattggcta tcgcgtcttg      540

gagcgcctgc tgccatttaa cccgaaagaa ctgttgtatt acgactatca agccttacca      600

aaagaagcgg aagaaaaagt gggtgcacgt cgtgtagaaa atattgaaga attggtagcg      660

caggcagata tagttaccgt taatgctccc ctccacgccg gaacgaaagg tcttattaac      720

aaagaattac tgtctaagtt taaaaaaggg gcctggcttg tgaacacagc ccgaggcgct      780

atatgtgttg cagaagatgt tgcagctgcg ctggagagtg gtcaactgcg tgggtacgga      840

ggtgatgtgt ggtttcctca gccggcccca aaggatcacc cctggcgaga tatgcgcaat      900

aaatatgggg ccggaaatgc aatgacgcca cattatagtg gtacaaccct ggacgctcag      960

accagatatg cagaaggtac taagaatata cttgagtcgt tttttaccgg aaagtttgac     1020

tacagaccgc aagatatcat tttattgaat ggggagtatg tcaccaaagc atatggaaag     1080

catgataaaa agtaa                                                      1095


<210>  4
<211>  1131
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  4
atgtcgaaag gcaaagtgct tctcgtcctg tatgaaggtg ggaagcatgc agaagaacag       60

gagaaattac tgggctgtat cgaaaatgaa ttaggaatac gaaattttat cgaagaacaa      120

ggttatgaac tcgttactac gatcgataaa gatccggaac ctaccagtac tgtcgatcgc      180

gaattaaaag atgcggaaat cgttatcacc acacctttct ttcctgccta catatctagg      240

aaccgtattg ccgaagcccc gaacctcaaa ctatgcgtga ccgccggagt tgggtctgat      300

cacgtggatc tggaggcagc caatgaacgt aaaataacag taaccgaggt tactgggagt      360

aacgtggtca gcgtagctga gcacgttatg gcgacaatcc tggtacttat ccgtaactac      420

aacgggggtc atcagcaagc gatcaatggt gaatgggata tcgctggcgt agcaaagaac      480

gaatatgatt tggaggataa gattattagt accgtgggag ccgggcggat cgggtatcgt      540

gtactggaac gtcttgtagc tttcaatccg aaaaagcttc tgtattacga ctatcaagaa      600

ttgccggccg aagccatcaa tcggcttaat gaagcctcta agctgttcaa cggccgcggg      660

gacatcgttc agcgcgttga gaagctggag gacatggtgg cgcagtcaga tgtcgttaca      720

atcaattgtc cgctacataa agactccaga ggcttgttta acaaaaaact tatatcccat      780

atgaaagatg gagcctatct tgtaaatact gcacgcggcg ctatttgcgt agcagaggac      840

gttgccgagg ctgtaaaatc gggcaagctg gctggctatg gaggcgacgt gtgggacaaa      900

caacctgcgc ccaaggacca tccttggcgt acaatggata acaaggacca cgtaggaaat      960

gcgatgacgg ttcatatcag cggcacgagt ctggatgcac agaagcgtta tgcgcagggg     1020

gtcaagaata tccttaattc ctatttttca aagaaatttg actatagacc ccaggatatc     1080

atagtgcaaa atggttcata cgccactaga gcttacggac aaaaaaagta a              1131


<210>  5
<211>  731
<212>  PRT
<213>  Clostridium pasteurianum

<400>  5

Met Tyr Lys Ile Lys Met His Cys Thr Gly Leu Leu Phe Cys Leu Ile 
1               5                   10                  15      


Gln Arg Ser Val Asn Met Glu Lys Lys Val Leu Thr Val Cys Pro Tyr 
            20                  25                  30          


Cys Gly Ser Gly Cys Asn Leu Tyr Leu Val Val Glu Gly Gly Lys Val 
        35                  40                  45              


Val Arg Ala Glu Pro Ala Lys Gly Arg Asn Asn Glu Gly Lys Leu Cys 
    50                  55                  60                  


Leu Lys Gly Tyr Tyr Gly Trp Asp Phe Leu Asn Asp Pro Lys Leu Leu 
65                  70                  75                  80  


Thr Ser Arg Leu Lys Lys Pro Met Ile Arg Lys Asn Gly Val Leu Glu 
                85                  90                  95      


Glu Val Ser Trp Asp Glu Ala Ile Lys Phe Thr Ala Glu Asn Leu Met 
            100                 105                 110         


Lys Ile Lys Ala Gln Tyr Gly Pro Asp Ala Ile Met Gly Thr Gly Ser 
        115                 120                 125             


Ala Arg Gly Pro Gly Asn Glu Pro Asn Tyr Ile Met Gln Lys Phe Met 
    130                 135                 140                 


Arg Ala Ala Ile Gly Thr Asn Asn Ile Asp His Cys Ala Arg Val Cys 
145                 150                 155                 160 


His Gly Pro Ser Val Ala Gly Leu Asp Tyr Ser Leu Gly Gly Ala Ala 
                165                 170                 175     


Met Ser Asn Ser Ile Pro Glu Ile Glu Asp Thr Asp Val Val Phe Val 
            180                 185                 190         


Phe Gly Tyr Asn Pro Ser Glu Thr His Pro Ile Val Ala Arg Arg Ile 
        195                 200                 205             


Val Lys Ala Arg Glu Lys Gly Ala Lys Ile Ile Val Ala Asp Pro Arg 
    210                 215                 220                 


Lys Ile Glu Thr Val Lys Ile Ser Asp Leu Trp Leu Gln Leu Lys Gly 
225                 230                 235                 240 


Gly Thr Asn Met Ala Leu Val Asn Ala Leu Gly Asn Val Leu Ile Asn 
                245                 250                 255     


Glu Glu Leu Tyr Asp Glu Lys Phe Val Glu Asn Cys Thr Glu Gly Phe 
            260                 265                 270         


Glu Glu Tyr Lys Glu Ala Val Lys Lys Tyr Thr Pro Glu Tyr Ala Glu 
        275                 280                 285             


Lys Ile Thr Gly Val Ser Ala Glu Tyr Ile Arg Lys Ala Met Arg Ile 
    290                 295                 300                 


Tyr Ala Lys Ala Lys Lys Ala Thr Ile Leu Tyr Gly Met Gly Val Cys 
305                 310                 315                 320 


Gln Phe Ser Gln Ala Val Asp Val Val Lys Gly Leu Ala Ser Leu Ala 
                325                 330                 335     


Leu Leu Thr Gly Asn Leu Gly Arg Pro Asn Val Gly Ile Gly Pro Val 
            340                 345                 350         


Arg Gly Gln Asn Asn Val Gln Gly Thr Cys Asp Met Gly Val Leu Pro 
        355                 360                 365             


Asn Arg Phe Pro Gly Tyr Gln Ser Val Thr Asp Glu Lys Ala Arg Glu 
    370                 375                 380                 


Lys Phe Glu Lys Ala Trp Gly Val Lys Leu Ser Asp Arg Val Gly Tyr 
385                 390                 395                 400 


Phe Leu Thr Glu Val Pro Lys His Val Leu Lys Glu Asp Lys Ile Lys 
                405                 410                 415     


Ala Tyr Tyr Ile Phe Gly Glu Asp Pro Ala Gln Ser Asp Pro Asn Ala 
            420                 425                 430         


Ala Glu Val Arg Glu Ala Leu Asp Lys Ile Asp Phe Val Ile Val Gln 
        435                 440                 445             


Asp Ile Phe Met Asn Lys Thr Ala Leu His Ala Asp Val Val Leu Pro 
    450                 455                 460                 


Ala Thr Ser Trp Gly Glu His Asp Gly Val Tyr Ser Ala Ala Asp Arg 
465                 470                 475                 480 


Ser Phe Gln Arg Ile Arg Lys Ala Val Glu Pro Met Gly Glu Ala Lys 
                485                 490                 495     


Asp Asp Trp Glu Ile Ile Cys Glu Ile Ser Thr Ala Met Gly Tyr Pro 
            500                 505                 510         


Met His Tyr Asn Asn Thr Glu Glu Ile Trp Asn Glu Met Arg Ser Leu 
        515                 520                 525             


Cys Pro Lys Phe Ala Gly Ala Ser Tyr Glu Lys Met Glu Lys Gln Gly 
    530                 535                 540                 


Ala Val Pro Trp Pro Cys Thr Ser Glu Glu Asp Pro Gly Thr Asp Tyr 
545                 550                 555                 560 


Leu Tyr Asp Asp Gly Lys Phe Met Thr Glu Asn Gly Arg Gly Lys Leu 
                565                 570                 575     


Phe Ala Cys Glu Trp Arg His Pro Phe Glu Leu Thr Asp Glu Lys Tyr 
            580                 585                 590         


Pro Leu Val Leu Ser Thr Val Arg Glu Ile Gly His Tyr Ser Val Arg 
        595                 600                 605             


Thr Met Thr Gly Asn Cys Arg Thr Leu Gln Lys Leu Ala Asp Glu Pro 
    610                 615                 620                 


Gly Tyr Ile Glu Ile Ser Val Glu Asp Ala Lys Glu Leu Asn Ile Lys 
625                 630                 635                 640 


Asp Gln Glu Leu Val Thr Val Ser Ser Arg Arg Gly Lys Ile Ile Thr 
                645                 650                 655     


Arg Ala Ala Val Ala Glu Arg Val Lys Lys Gly Ala Thr Tyr Met Thr 
            660                 665                 670         


Tyr Gln Trp Trp Val Gly Ala Cys Asn Glu Leu Thr Ile Asp Ser Leu 
        675                 680                 685             


Asp Pro Ile Ser Lys Thr Pro Glu Phe Lys Tyr Cys Ala Val Lys Val 
    690                 695                 700                 


Glu Arg Ile Lys Asp Gln Gln Lys Ala Glu Gln Glu Ile Glu Glu Arg 
705                 710                 715                 720 


Tyr Ser Ser Leu Lys Lys Gln Met Lys Ala Glu 
                725                 730     


<210>  6
<211>  211
<212>  PRT
<213>  Clostridium pasteurianum

<400>  6

Met Asp Arg Phe Lys Thr Ala Val Ile Leu Ala Gly Gly Lys Ser Ser 
1               5                   10                  15      


Arg Met Gly Phe Asp Lys Gln Phe Leu Lys Ile Gly Glu Lys Arg Leu 
            20                  25                  30          


Met Asp Ile Leu Ile Asn Glu Ile Lys Glu Glu Phe Gln Asp Ile Ile 
        35                  40                  45              


Ile Val Thr Asn Lys Pro Lys Glu Tyr Lys Ser Leu Tyr Lys Ser Cys 
    50                  55                  60                  


Arg Ile Val Ser Asp Glu Ile Glu Ser Gln Gly Pro Leu Ser Gly Ile 
65                  70                  75                  80  


His Ile Gly Leu Lys Glu Ser Lys Ser Lys Tyr Ala Tyr Phe Ile Ala 
                85                  90                  95      


Cys Asp Met Pro Lys Val Asn Ile Pro Tyr Ile Arg Tyr Met Lys Glu 
            100                 105                 110         


Glu Leu Ile Lys Thr Asp Ala Asp Ala Cys Val Thr Glu Ala Gly Cys 
        115                 120                 125             


Arg Met Gln Pro Phe Asn Ala Phe Tyr Ser Lys Glu Val Phe Tyr Lys 
    130                 135                 140                 


Ile Glu Asp Leu Leu Arg Glu Gly Lys Arg Ser Met Phe Ser Phe Ile 
145                 150                 155                 160 


Asn Ile Ile Asn Thr His Phe Ile Asp Glu Asp Thr Ala Lys Lys Tyr 
                165                 170                 175     


Asn Lys Asp Phe Asn Met Phe Phe Asn Leu Asn Thr Pro Glu Asp Leu 
            180                 185                 190         


Lys Asp Phe Gln Val Lys Leu Tyr Asn Pro Lys Asn Met Asp Lys Asn 
        195                 200                 205             


Ile Glu Lys 
    210     


<210>  7
<211>  191
<212>  PRT
<213>  Clostridium pasteurianum

<400>  7

Met Arg Asn Phe Ile Lys Leu Phe Leu Tyr Arg Leu Ser Gly Lys Val 
1               5                   10                  15      


Gly Lys Ala Met Ser Arg Glu Val Asn Ser Phe Val Ile Gly Asp Ala 
            20                  25                  30          


Ser Lys Cys Val Gly Cys Arg Ala Cys Glu Val Ala Cys Phe Lys Ala 
        35                  40                  45              


His Ser Asn Arg Glu Glu Ser Ser Lys Pro Ile Phe Val Lys Gly Lys 
    50                  55                  60                  


Arg Arg Asp Ile Ile Thr Arg Ile His Val Val Lys Asn Glu Lys Phe 
65                  70                  75                  80  


Ser Val Pro Val Gln Cys Arg Gln Cys Glu Asp Ala Pro Cys Ala Asn 
                85                  90                  95      


Ala Cys Pro Val Gly Ala Ile Lys Glu Lys Glu His Val Leu Val Val 
            100                 105                 110         


Glu Glu Glu Leu Cys Ile Gly Cys Lys Ala Cys Val Met Ala Cys Pro 
        115                 120                 125             


Phe Gly Ala Ile Glu Val Lys Arg Lys Ser Glu Glu Val Arg Lys Val 
    130                 135                 140                 


Ala Tyr Lys Cys Asp Leu Cys Arg Asn Arg Asp Thr Lys Ala Cys Val 
145                 150                 155                 160 


Glu Ile Cys Ser Lys Lys Ala Leu Lys Leu Phe Asp Pro Val Lys Glu 
                165                 170                 175     


Arg Lys Gln Arg Asn Ile Asp Thr Val Asn Asn Leu Ile Asp Asp 
            180                 185                 190     


<210>  8
<211>  206
<212>  PRT
<213>  Clostridium pasteurianum

<400>  8

Met Thr Asn Leu Cys His Phe His Arg Gln Arg Glu Glu Arg Ile Ile 
1               5                   10                  15      


Met Asn Ser Phe Val Ile Ala Asn Pro Lys Lys Cys Ile Gly Cys Lys 
            20                  25                  30          


Thr Cys Glu Ala Gly Cys Ala Met Ala His Ser Glu Lys Asn Ile Leu 
        35                  40                  45              


Asn Arg Lys Ser Asp Glu Leu Lys Phe Asn Pro Arg Leu Lys Val Ile 
    50                  55                  60                  


Lys Thr Trp Asp Val Thr Ala Pro Val Met Cys Arg His Cys Glu Asn 
65                  70                  75                  80  


Ser Pro Cys Ala Ser Val Cys Pro Asn Gly Ser Ile Thr Asn Lys Glu 
                85                  90                  95      


Gly Val Val Leu Ile Asn Gln Asp Thr Cys Ile Gly Cys Lys Ser Cys 
            100                 105                 110         


Met Val Ala Cys Pro Phe Gly Ala Ile Asn Leu Ile Val Gln Gln Asp 
        115                 120                 125             


Gly Glu Gly Lys Ala Ile Thr Gln Ser Gly Leu Lys Lys Thr Asp Gly 
    130                 135                 140                 


Lys Glu Ile Ile His Lys Glu Lys Ile Val Ala Asn Lys Cys Asp Leu 
145                 150                 155                 160 


Cys Ile Glu Arg Asp Lys Gly Pro Ala Cys Val Glu Val Cys Pro Thr 
                165                 170                 175     


Glu Ala Leu Arg Leu Val Ser Gly Glu Asp Ile Glu Glu Ser Ile Lys 
            180                 185                 190         


Glu Lys Arg Glu Ala Ala Ala Leu Gly Leu Ser Arg Ile Gly 
        195                 200                 205     


<210>  9
<211>  1293
<212>  DNA
<213>  Aquifex aeolicus

<400>  9
atggccaaac atgtggttgt tattggcggc ggcgttggag gtattgcgac ggcctataac       60

ctgcgcaact tgatgccgga tttaaaaata acgttgatca gcgatcgccc atatttcggc      120

tttacgccag cattcccgca tctggcgatg ggctggcgca aatttgaaga tatcagcgtt      180

cccctggcgc ctttgttacc gaaattcaac atagagttta ttaacgaaaa ggctgaaagc      240

atcgatccag atgcgaacac ggttaccacg cagagcggaa aaaaaatcga gtatgattat      300

ctggttattg ccacaggccc gaaactggtg ttcggagcag aaggccagga ggagaactcg      360

acgagcattt gtaccgccga acatgcgcta gaaactcaga aaaaactgca agaattatat      420

gcgaatccgg gccctgtagt tattggtgcc ataccgggcg tgagttgttt cggccctgcc      480

tatgagttcg ccttgatgtt acattatgaa ctgaagaaac gtgggattcg ctataaagtg      540

ccgatgacgt tcatcacgag cgaaccgtat ttaggccatt ttggcgtggg tggtattggt      600

gcctctaaac gtctcgttga ggatttattc gccgaacgca acattgactg gatcgcgaac      660

gttgcggtaa aagccattga accagataaa gtgatttatg aagatctcaa cggcaacacg      720

catgaagtac cggctaaatt tacgatgttc atgccgagtt tccaaggccc agaggttgtg      780

gccagcgcag gcgataaggt cgcgaacccg gcgaacaaaa tggtgattgt gaaccgctgc      840

ttccagaacc cgacttataa aaacattttc ggcgttggtg tggttaccgc cattccgcca      900

attgaaaaaa ccccaattcc gacgggagtt cccaaaaccg gtatgatgat cgagcaaatg      960

gccatggccg ttgcccataa cattgttaac gatattcgca acaacccgga taaatatgcc     1020

cctcgtttaa gcgctatttg tattgccgat ttcggcgaag atgccggctt tttcttcgcg     1080

gatccggtta ttccacctcg cgaacgtgtt attacgaaaa tgggaaaatg ggcgcattat     1140

ttcaaaacgg catttgaaaa atatttcctg tggaaagtac gcaacggcaa catagcgccg     1200

agctttgaag aaaaagttct ggaaattttc ctgaaagtgc atccgattga attatgtaaa     1260

gattgcgaag gcgcgccggg ctcccgctgc taa                                  1293


<210>  10
<211>  1302
<212>  DNA
<213>  Nostoc sp.

<400>  10
atggcccaca ttgtgatcgt aggcgccgga ttaggcggcc tgcctactgc gtatgaactg       60

agacatatcc ttcctaaaca acatcaggtg actgtaatta gtgaaactcc atactttacg      120

tttattccaa gtttaccatg ggttgccatg ggcctgacct ctttggagag tattcaagtg      180

agcctccagc agagattgaa gcagaagggg attaactgga tattgggacg agttgattac      240

ttaaacccac agaatcagaa gatatcacta ggtgagcaga gcattagcta cgattacctg      300

attattgcaa cgggcgctga actcgccctg gatgcagttg cgggcctggg gcctgatggt      360

tatacccaga gtgtttgtaa cccccatcat gccatcaagg cttttcaagc gtggcagaat      420

tttcttctgg ccccgggacc gctggttgtt ggagccctgc cgaaaacaag ctgcctgggg      480

ccagcatacg agtttacatt gctggcggac tacgttctta ggaaacaagg tctgcgggag      540

caggttagta ttacctttgt caccccggaa ccatacgccg gtcacttagg cataggcgga      600

atggcgaact cggcagagct ggtcacgaaa ttcatggccg aacgaggagt tgaggtgatt      660

gaaaatgttg ccgtgacggc cattgaggcc aaccaaattc atctcggtaa cgggcgggtt      720

ctgccgtttg cgtacagtat gttgcttccg cctttcagag gaccccgttt tgtaagacag      780

gttccgggtc tgagcaacca agatggcttt attccggttt taccaacgta ccggcatcca      840

gaatatgcaa gtatttatgc tgtcggtgtg gttgttgaaa ttaaaccgag tgaggttacg      900

ccacttcctt taggtgttcc taaaaccggt cagatgacgg aagccatggg gatggccgtt      960

gcacataaca ttgcaattga attaggtgtt ttttcggcgc ccccagtcac cccgacgcta     1020

gatgcaattt gttttgcgga ctttggcaac agtggtattc tgtttcttgc gaatcctgtc     1080

ctgccggatc tggcaacggg taaacgcaga cgagcggttg ctttaagcgg cgcgtgggtt     1140

acctgggcga aagcagcctt tgaaaggtat tttttggcga aaatgcgttt cgggaccgcg     1200

gttccatggt ttgaaaaatt ggcgttaaaa ctgttaggtc tctcgctggt ggctccactc     1260

gccgttaaga gcagccggaa catctcccag gagaactatt aa                        1302


<210>  11
<211>  1458
<212>  DNA
<213>  Chlorobium tepidum

<400>  11
atggccaaag tcgtcgtcct cggggctggc gtcagcggac acacctgcgc aagtttcctg       60

aaaaagaagt tggggaaaca gcacgaagtc gtggtcatta gcccaaactc gtactatcag      120

tggatcccga gcaatatatg ggttggcgtg ggccacatga ccattgatga cgtgcgcttt      180

aaactgaaaa aagtctacga tcgctggggc attgactata aacaggccaa ggcggtcagc      240

atccacccag aaggcgatgc gaacatttcg aaagggtatg ttaccattga atataccgat      300

gaagaacatg cgggatatac cgaaacggtt gactatgatt atttggttaa cgcgaccggt      360

cctaaattaa actttgaagc taccgaagga ttgggacccg ataagaacag cttatcagtt      420

tgcacctatt cgcatgccgc tcacgcctgg gaagaactcc aaaagtcgat tgagaaaatg      480

aaaaatggtc agaaacagcg gtttctgatc ggcaccggcc acgccatggc tacctgtcag      540

ggcgcagctt ttgaatacat tttaaacgtt gctcatgaga ttagtcggcg cggcttatcg      600

catatggccg aattaacctg gatctccaac gaatatgaat taggtgactt tggtatgggc      660

ggcgccttta ttaaacgcgg cggttatatc acgccgacca aagttttcac cgagagctta      720

ctggctgaat atggcattaa atggattcgc cgtgccggtg tttataaagt ggaaccgggc      780

gtggcgcatt atgagacgct ggatggcgag atgttgagcc aggaatttga tttcgccatg      840

ttgatcccga gctttagtgg cgtcggctta accgcgtttg ataagtcggg caacgatatt      900

accgataaaa tgttcttacc gaacaaattt atgaaagttg acgccgatta taccgcgaaa      960

ccgtttggcg aatggggcgc taacgattgg ccgaccattt atcagacgcc gatgtattcg     1020

aatatttatg cggccggcat tgcgtttgcc ccgccgcaca gcattagcaa accaatgacg     1080

tcggtgaatg gccgccagat ctttccgacg ccgccgcgca ccggcatgcc gagcggcgtc     1140

attggcaaaa ttatcgccct gaatattagt gaacagatta aaggcaacca taaagaacat     1200

caccataagg cgagcatggc gcgcatgggc gcagcgtgca tcgtgagcgc gggctttggt     1260

agcttcgatg ggctgggcgc cagcatgacc gtgtttccaa ttgtgccaga ctgggaaaaa     1320

tacccggaat ggggccgcga tatgacctat agcgttggcg aggtgggatt ggcgggtcat     1380

tggttaaaat ttatgttaca ttatctgttt tttcataaag ccaagggcta cccgttttgg     1440

tatttaatcc cggaataa                                                   1458


<210>  12
<211>  1305
<212>  DNA
<213>  Acidithiobacillus ferrooxidans

<400>  12
atggcccatg tggtaatctt gggtgccggc acaggcggaa tgccggccgc gtacgaaatg       60

aaagaagctc tgggttctgg gcatgaggtg acgctgatta gcgcgaatga ttattttcag      120

tttgtcccgt cgaacccgtg ggtgggggtg ggctggaaag agcgcgatga tattgctttt      180

cccattcgtc actatgtgga acggaaggga atacatttca ttgcccagtc ggcggaacag      240

attgatgcgg aagcccagaa tattaccctc gcggacggca acacggtaca ttacgactac      300

ctgatgattg ccacaggtcc gaaactggct tttgagaatg taccgggttc ggatccacat      360

gaaggcccgg tgcagtcgat ctgtacggtg gaccatgccg aacgtgcgtt cgcggaatac      420

caggctttgt tgcgcgagcc aggcccaatc gttattggtg cgatggcggg cgcatcctgc      480

tttggaccgg cttacgaata tgcgatgatt gttgcttcgg acttaaagaa gcgtggcatg      540

cgcgacaaaa tcccgtcgtt taccttcatt acctccgagc catacattgg tcatctgggc      600

atccagggcg tgggcgattc gaaaggcatc ctgacgaaag gcttaaaaga agaaggtatt      660

gaagcctaca cgaactgtaa agttaccaaa gttgaagaca acaaaatgta tgtaacccag      720

gtggacgaga aaggtgaaac cattaaagag atggtcctgc cggttaaatt tgggatgatg      780

attccggctt ttaaaggcgt gcccgccgtg gccggtgttg aaggattgtg caatccaggt      840

ggctttgtgc tggtggatga gcaccagcgc agcaaaaagt acgcaaatat tttcgccgcc      900

ggtattgcga ttgcgatccc gccggtagag acgaccccgg tgccgaccgg cgccccaaaa      960

accggttata tgattgaatc gatggtgagt gccgccgtgc acaacattaa agccgatctg     1020

gaaggccgca aaggcgagca gaccatgggc acctggaatg ccgtgtgttt cgcggatatg     1080

ggtgatcgcg gcgccgcatt cattgcgttg ccacagttga aaccacggaa ggtggacgtt     1140

ttcgcgtacg ggcgctgggt gcatctggcc aaagtggcgt tcgaaaagta cttcattcgc     1200

aagatgaaaa tgggtgtttc ggagccattt tatgagaaag tgctgtttaa aatgatgggc     1260

atcacccggc tgaaagaaga agatacccat cggaaagcgt cgtaa                     1305


<210>  13
<211>  1458
<212>  DNA
<213>  Allochromatium vinosum

<400>  13
atggcccgca ttctgatatt aggagcgggt attagtggcc acacgaccgc gcggtatctg       60

ggcaaatggg ttggcaaaca gcaccagatt accgttgtga gcccaaatag taagtggaac      120

tggatcccta gtaacatttg ggttggcgta ggtgagatga ccgaacgcca agtgaccttt      180

gaactggcac cagtgtacaa aaaaattaac gtgggttttc gtcaggcacg cgcggtgagc      240

attcacccag acggcggtgc gggacatgaa tcgccatttg tgaccattga atacacggat      300

ccgacacgtg cgggtcagtc ggatgaaatt gagtacgatt atctggtgaa tgcgaccggc      360

ccaaaattaa actttgatgc gacgccgggt ctggggccgg aaacgggcta caccatgagc      420

gtgtgcaccc cgagtcacgc cctggaagcg aacgaacagt tacagaaatg cgtgcaggaa      480

atgaaagcgg gtgcccgcaa aacctttgtg attggcaccg ggcacggcat gtgcacctgc      540

cagggcgccg cgtttgaata catttacaac gtggaccatg tgctgcgggg agcgggcgtt      600

cgccacctgg cccgcgttgt gtggatttcg aacgaatacg agttaggcga ttttggcatg      660

ggaggcgttc atattacccg aggcggctat ctgaccaacg gcaaagtgtt tgcggaaagc      720

ctgatggtgg aacgcggcct ggaatggatt acccgcgccg ctgtgaccaa agtggaaccg      780

ggcaaaattc actacgaaca gttagatggc tccgtgcatg aactggaatt tgactttagt      840

atgttaattc cgccatttag cggcgttgga ttaaaggcgt atgataagag tgggtcggat      900

attaccgaac aattatttgc cccaaacggc tttatgaaag tggatgcgga ttacaatcca      960

aaaccatttg aggagtggtc gaaagcggat tggccgaaaa cctatcagac cccgaaatac     1020

aaaaacattt ttgcgattgg cattgcgttt gcgccgccgc acccgatttc gaaagttatg     1080

aaatccccgt cgggtctgca aatttcgccg actccgccgc gcaccggcat gccaagcgcc     1140

accatcggga aagctgttgc ggaaaacatt cgcgacctgt taaatggcgc caccacgttg     1200

agccataccg cgagcatggg cgaaatgggc gccgcttgcg ttgcgagcac cggcatggac     1260

ttatttaaag gcacggccgc caccatgacc gtatttccgg ttgttccgga ttacgaaacc     1320

tatccagaat acggacgcga tatggacctg acctttggtg aaattggcct ggcgggacat     1380

tggatgaagt acctcctgca ccacgtgttt atttaccagg cgaaactgcg cccgggctgg     1440

agcgttctgc cagactaa                                                   1458


<210>  14
<211>  1284
<212>  DNA
<213>  Rhodobacter capsulatus

<400>  14
atggcccata ttgtggtcct gggggccggg ctcggcggcg ccattatggc atatgagctc       60

cgcgagcagg tgcgcaaaga ggataaagtt accgttatta ccaaagatcc gatgtatcat      120

tttgtgccaa gcaacccatg ggtggcggtg ggctggcgcg atcgcaaaga aattaccgtg      180

gatttagcgc cgacgatggc gcgcaaaaac attgatttta ttccggtggc agcgaaacgc      240

ctgcatccgg cggagaaccg tgttgaactg gagaacggcc agagcgtttc gtacgatcag      300

attgttattg ccaccggccc ggagctggcc tttgatgaaa ttgaaggctt cggcccagaa      360

ggccacacgc aaagcatttg ccatattgat catgccgaag aagcgcggct ggccttcgat      420

cgcttctgcg agaacccagg cccgattttg attggtgcgg cgcagggcgc ctcgtgcttt      480

ggcccggctt acgagtttac ctttatttta gacaccgcgc tgcgcaaacg caaaattcgc      540

gataaagtgc cgatgacctt tgttaccagc gaaccatatg ttggtcatct gggtctggat      600

ggtgtgggcg ataccaaagg cctgttggag ggcaacctgc gcgataaaca cattaagtgg      660

atgaccagca cccgtattaa gcgcgttgag aaaggcaaaa tggtggttga agaagtgacc      720

gaagatggca cggttaaacc agaaaaggaa ctgccatttg gctatgcgat gatgctgcca      780

gcgtttcgcg gcattaaagc gctgatgggt attgaaggtc tggttaatcc gcgcggcttt      840

gttattgttg accagcacca gcagaacccg acctttaaaa acgtttttgc ggttggcgtt      900

tgcgtggcga ttccgccgat tggtccgacg ccggtgccat gcggcgtgcc gaaaaccggc      960

tttatgattg agtcgatggt taccgccacc gcccacaaca ttggccgtat tgtgcgcggt     1020

ttcgaagccg atgaagttgg ctcgtggaac gccgtttgtc tggccgactt tggcgaccag     1080

ggcattgcct tcgttgcgca gccgcagatt ccgccgcgca acgtgaactg gagctcgcag     1140

ggcaagtggg tgcattgggc caaagaaggt tttgaacgct attttatgca caaactgcgc     1200

cgcggtacca gtgaaacctt ttatgagaaa gccgcgatga aattcctggg cattgataaa     1260

ctgaaagccg ttaagaaagg gtaa                                            1284


<210>  15
<211>  1269
<212>  DNA
<213>  Thiobacillus denitrificans

<400>  15
atggcccata ttgtaatatt aggcgccggc gtgggcggca tgaccatggc gtacgagatg       60

cgcgaaagcg cacgtgccga agataaagtg accgtgatca gtaacaatag ctattttcag      120

tttacgccga gtaacccttg ggttggtgtt aactggcgca aacgggatga tgtgacgtta      180

gaagccgcgc cttatttaaa caaaaagaac attgatttta tcccggtggg cgccgcgcgc      240

gttcacccag atagaaacca gattgattta accgatggca gaaccgtgga ttacgatttt      300

ttagtgattg cgacgggtcc aaaattagcg tttgatgagg tgccgggctt aggcccagaa      360

gggtataccc agagcgtatg cacggtggat cacgcccagg ccgcgggccg cgcgtgggat      420

gacttcgtta aaaacccggg tccgattgtt gtgggcgcgg ttcagggtgc tagttgctat      480

ggcccggcgt atgaatatgc gatgattatg gataccgatc tgcgcaaacg caaaatccgg      540

gatcgtgttc cgatgaccta tgtgacggcc gaaccgtaca ttggccacct gggactgggc      600

ggcgtgggcg acagtaaagg catgttagag agcgtgttac gcgaacgcca tattaaatgg      660

atttgcaacg ccaaagtgac caaagtggaa gctggcaaaa tgtttgtggc cgaacataac      720

gataaaggcg aggttattaa agaacatgag ctgccgtttg gctatagcat gatgctgccg      780

gcgtttaaag gcattgacgc tgtttttggc attgaaggtc tgaccaatcc gcgcggcttt      840

attacgattg atccatatca gcgcaacgcg aaatatccga acgtgtatag tgtgggcgtt      900

tgcgtggcga ttcccccagt ggaagtgaca ccagttccga ccggcacgcc gaaaaccggc      960

tatatgattg agagcatggt tacggcgacc gcgcataaca ttcgcgccgt tttagatggc     1020

cgcgaacctg ccgaaaaagc gacctggaac gcgatttgct tagccgattt tggcgatacc     1080

ggcgccgcct ttgttgccct gccgcagatc ccgccgcgca atgtgaactg gtttaaagag     1140

ggcaaatggg ttcacctggc caaagtggcc tttgaaaaat attttattcg caaaatgaag     1200

aagggcagca ccgaaccgct gtatgaaaaa tatgttttag gcttaatggg cattaaaaag     1260

ttaaagtaa                                                             1269


<210>  16
<211>  1284
<212>  DNA
<213>  Magnetococcus sp.

<400>  16
atggcccata ttgtggtgtt aggtggtggt gtagggggat ggccggctgc ctatgaatta       60

cgtggcgcct taggtaaaga acataaggtc actgtggtgc acaatagcac ccatttttct      120

tttaccccct ctaatccttg ggtagccgtc ggttggcgca aggcagagga gattcagctt      180

ccgatggagg gttatttaag caaaaagggc atccatttta tttccgtcgc gtgcgaggaa      240

attaagcccg acgacaataa attagtgctg gcggatgggc agatcgtgga ttatgactat      300

ctggttatct gtaccggtcc tgaactggca tttgatgaag tcgagggatt aggtccgcat      360

ggtggttaca cgcagtccgt ttgctcaacg ccgcatgcgg aaaccgcatg cgagggatgg      420

gaggcgttct taaaagatcc aggcccgatc gtggtgggtg ccgtgcaggg cgcgtcatgt      480

tttggacccg cctatgaatt tgcatttatc atggatgcag acctgcgcaa gcgtcgtata      540

cgtgatcagg tgccgatgac ctatgtgacc tctgaacctt acattggcca tttagggctg      600

gcaggggtgg gtgactcgcg caccatgatg gaatctgagc tgcgtggcca ccacattaac      660

tggatctgta acgcgaaggt aacccgtgtc gaacctggta aaatgtttgt ggatgaacat      720

gatatgtcgg gtaatgtggt taagcagcac gaactgccgc acaaatactc tatgatgcta      780

ccagcattcc gtggggtgcc tgccgtggcc aaggtagggg ataagctgtg caatccacgt      840

ggttttgtga aggtggataa acatcagcgc aacaccgtgt ggccaaacat ttattcagcc      900

ggtgtgtgcg tggccattcc tcctgtcgaa gccacacctg tgccgaccgg taccccgaag      960

acgggttata tgatcgaatc catggtgacg gcgattgtgc acaatattga actggacctg     1020

caaggcaagc cgttaaccca cgagggcacc tggaacgcga tctgtttagc cgatatgggg     1080

gatacagggg tggcctttgt ggccatgcca cagattggcc cgcgtaacgt cgcatggatg     1140

cgcaaaggta agtgggtgca tttagccaaa gtgggctttg aaaaatattt tatgcgcaag     1200

atgaagaccg gttcttctga accaatgttt gaaaagttta tgctgcgcat ggtgggtatt     1260

acccgcctga agaaggattc gtaa                                            1284


<210>  17
<211>  171
<212>  DNA
<213>  Clostridium pasteurianum

<400>  17
atggcctata aaatagcgga tagctgcgtg tcctgcgggg cgtgcgcgag cgaatgcccg       60

gttaacgcga ttagtcaggg cgattcgatt ttcgttattg atgcagatac ctgcatagac      120

tgcggtaatt gcgcgaatgt ttgcccggtt ggcgccccgg tgcaggaata a               171


<210>  18
<211>  219
<212>  DNA
<213>  Hydrogenobacter thermophilus

<400>  18
atggccttgc gtaccatggt cgaccctgat acgtgtacct cttgcgaatt atgttatgat       60

cgcgtcccag aggtttataa gaaccgcggc gatggcattg cggaagtggt ttcccccggt      120

ccagatggtt ggatgatggt tcctcctgaa ttggaacaag aagtcaaaga ggtcaccgac      180

gaatgtccat ccgggtcgat aattacggag gaggtataa                             219


<210>  19
<211>  204
<212>  DNA
<213>  Hydrogenobacter thermophilus

<400>  19
atgcgcattt tgatcgacat tgatacgtgc acgacctgcc gtttatgcta tgatacactg       60

ccgactgttt ttgtggaccg cggcgatggg attccaatta cgttaccaat gaaaagcttc      120

ccggaccgta acctggttga ggcgattaag gaagtgatgg aaagctgccc aagcaattcg      180

attcagatgg aggaggtcgg gtaa                                             204


<210>  20
<211>  183
<212>  DNA
<213>  Methanosarcina barkeri

<400>  20
atgccagcga tcgtgaacgc ggatgaatgc tccggctgcg gcacttgcgt cgatgaatgt       60

cctaacgatg cgattacgct ggatgaggag aagggcatag cggttgtcga caacgacgaa      120

tgtgttgaat gtggtgcgtg tgaagaagcg tgtcccaatc aggcgattaa agttgaagaa      180

taa                                                                    183


<210>  21
<211>  243
<212>  DNA
<213>  Aquifex aeolicus

<400>  21
atggccaagt tgaaaacgat ggtggatcag gaaacgtgta ccgcctgcga gctctgttat       60

gaccgtgtgc cggaggtgta taaaaaccgc ggcgatggca ttgcagacgt ggtaaaatgt      120

gatattaagg atgaggaaga ccattgctgg atgattgtac ctgaaggcct cgaagatgaa      180

gtacgtgaag ttgaggagga gtgcccgagt ggttcgatca tagtggaaga actggaagaa      240

taa                                                                    243


<210>  22
<211>  80
<212>  PRT
<213>  Aquifex aeolicus

<400>  22

Met Ala Lys Leu Lys Thr Met Val Asp Gln Glu Thr Cys Thr Ala Cys 
1               5                   10                  15      


Glu Leu Cys Tyr Asp Arg Val Pro Glu Val Tyr Lys Asn Arg Gly Asp 
            20                  25                  30          


Gly Ile Ala Asp Val Val Lys Cys Asp Ile Lys Asp Glu Glu Asp His 
        35                  40                  45              


Cys Trp Met Ile Val Pro Glu Gly Leu Glu Asp Glu Val Arg Glu Val 
    50                  55                  60                  


Glu Glu Glu Cys Pro Ser Gly Ser Ile Ile Val Glu Glu Leu Glu Glu 
65                  70                  75                  80  


<210>  23
<211>  231
<212>  DNA
<213>  Aquifex aeolicus

<400>  23
atggggttaa aggtgcgtgt tgaccaggat acgtgcacgg cctgcgagct gtgctatgac       60

cgtataccgg aagtattcaa aaacgcaggc gatggcattg cagatgttgt aaaatgcgat      120

atagaagatg atgaaggctg ctggatgata gtgccggaag gcctggagga ggaagttcag      180

gaagtggcgg atgagtgccc gagtggcagc attatagttg aggaagaata a               231


<210>  24
<211>  76
<212>  PRT
<213>  Aquifex aeolicus

<400>  24

Met Gly Leu Lys Val Arg Val Asp Gln Asp Thr Cys Thr Ala Cys Glu 
1               5                   10                  15      


Leu Cys Tyr Asp Arg Ile Pro Glu Val Phe Lys Asn Ala Gly Asp Gly 
            20                  25                  30          


Ile Ala Asp Val Val Lys Cys Asp Ile Glu Asp Asp Glu Gly Cys Trp 
        35                  40                  45              


Met Ile Val Pro Glu Gly Leu Glu Glu Glu Val Gln Glu Val Ala Asp 
    50                  55                  60                  


Glu Cys Pro Ser Gly Ser Ile Ile Val Glu Glu Glu 
65                  70                  75      


<210>  25
<211>  3633
<212>  DNA
<213>  Unknown
<220>
<223>  gamma-proteobacterium

<400>  25
atgaacacag aaactcgcac caccagcggc ggacggttgc atgacaaggt ggttatcttg       60

accggcgcgg ccggaaacat tggcagctac attagccggt ccttgctgcg cgaaggggcg      120

aatttggtaa tgaccgggcg gaatgaaccg aaattacagg cgtttgtgga aggtttagtt      180

gaggagggtt ttgatcgtga caatatatta attgccatcg gtgactccgc gaaggcggat      240

atatgtcgcg aaatcgttaa ggcgactgtt aatcatttcg gcaacattga tgtcctggtt      300

aataacgccg gaggcgcggg gcctcgtcgc acgctgcggg acattccgtt ttctgagagt      360

gaacgcttag cccgcggcga cgatgagacc atgttagatg ccgccatgaa tctattagct      420

ggcgcgtgga acatgacccg ggccgccgtt ccacacatga gcgagggcgg cagtattgtt      480

aatgttagta ccatctttag tcgcacccat tactatggac gcataccgta cgtggtgccc      540

aaaagtggat tgaatgccct gagtataggc ttagcgaaag aactcggtga agaacatggg      600

atccgcgtta atacgctgtt tccgggcccg atcgagtcgg aacgcattga caccgttttt      660

ggcaatatgg atgccctgca aagcgcacca gcaggtgcca cttcgcagga gttccgtgac      720

ctgatgatca cccgtcgaga aaacccggat ggtgaatatg agtaccgcta cccaacgccc      780

aacgatgtgg cgtcaacggt gacctggctg gcatcggagg agagtgcggc cctaagtgga      840

catcatattg aagttaccaa tggcatgcag gtgccggccc aaagccgctc aaagttagta      900

agttggccgg acaaacgact ggaggacctg tccgggcagg tcgtattttt gttagctggc      960

agcgactacg aggacgcact ggcatttgct gagcgccaca tggtttccgg tgcgaaagtc     1020

gtcctcgcgt tccgctccct ggagtcgtta gggttagctc gctctttatg cgcgtcgcgc     1080

gatttagaga gcatccacct actgcacctg gagccactgc gacgtgagtc ggcagaccgg     1140

tgtttcgatt acattcgcga tcatttcggc cgacttgacg gcatcgtcgt gcttccacgc     1200

tcgggcaacg gagaacatgg ctattcgtta tccaccgcgg gtgatgacga tgtcgaggcg     1260

tttgtccgcg atgagatcat atcaccggta gcatttgctg ctgcattggc cattaacctc     1320

gatcgctggg gcattttaga ggaggcacca gcgctgacct atgtcaccaa tccgaccgat     1380

ggccacgggg actacttaaa cgaggtaaag cgtgcagcta ttgaggccct gattcgcatc     1440

tggcggcatg aggaccgcca gatgcgcaag aagggcgaac gcgaatgggc aatgctgcct     1500

aaccagctgg tccgctatga caacaacgag gaggacaatt taacttttac cgcagactgg     1560

gcggccacgc taaccaaccg tgttcgacgc atggatccaa taaacttatg ggtgcctgag     1620

agcattatgc gcgcaaccgg caaaagcggt atgccacaaa gtatccagcg cgtgttgcca     1680

ggcctgcata aaggccggac cgccgtgatc accggtggct ccctcggcat cggcctgcaa     1740

ttgggccggt tcctggctat tgccggtgcg cgggtattgt tatcggctcg ctccaaagag     1800

aaattagaag aagcacgcca cgagatcgtt gaggagttac gcggcgtcgg ctacccgaac     1860

gcgcaccagc gtgttcatat tttaccagac attgatgttg gcgatgagga ggccctcgaa     1920

cggttgtaca atcactctat agaattattc ggaaatgtgg acttcttaat taacaacgcg     1980

ggcatcagtg gcgctgagga gatggtcgtt gatatgtcgc tcgaagcatg gaatcgcacc     2040

atgtatgcga acttaatcag taactattct ttgatccgca aatatgcgcc caagatgaaa     2100

gcgaatggct acggcgttgt actcaatgtt agtagttatt tcggcggcga gaaatatgtt     2160

gccgtggctt acccgaatcg tgccgactat gcggtctcta aggcaggtca gcgagtactg     2220

gcagagatcc tctctcggca ccttggccca gagatccgaa tcaacgcatt agcaccaggg     2280

ccagtggatg gcgcccgcct gcgcggcctt ggcggcgcac cgggattatt tgaacgacgc     2340

ggtcgactgg ttctcgagaa caaacgatta aacagcgtgc ataaagcggt gttggcggcg     2400

ttgcgggagg gcgcaacccc cgaggttatc atggcgctgt cgagaaacgc cctgggcgac     2460

gcaaaaccga ccgccggaca gtccaaagca ttggacaaac tctttgccca ggtcgaagac     2520

tcccctgaag gcggtaatag taccgcattc ttattaaacc gagatttagc agagaaactg     2580

atgaaccggc tggttaccgg cgggttattt actcccgagt ccgctacaca attcatggaa     2640

ggctttgtcg atgcgccggc tatcttcttt gacgaaaagt cggtaaacaa ggcggcggca     2700

ggaattgagg ccggtatctt aaatcgactg cacttacata agatgccgac cgatgagcag     2760

atcggcctgt ctacggtgtt tcatctcgcg gacgatatcg cgagcggcga aacctttcat     2820

ccgagtggcg gcttaaaatt tgaccgctcg gtaaccgaag gggagttgct gctaccccca     2880

gaccgtgaca gcttagcgaa gttaaaaggc aaacgtgtcg tgttaattgg tgattcgatg     2940

cgcgaggagc tgtcggccat cggcaatggc ttcattaatc agggcgtcgc ttctctgacg     3000

gtcttaacac gcagcccaga agcctgtgag gaggtgcagc atagcctcca gaaaagcaat     3060

tcggttacat tggatgtccg atgcattgaa gataatatcg aggacgcttt agatgatctg     3120

ttgcaaaacc agggtggctt tgatgtagtg gttagcgccc cgttcagtcg actgccgtat     3180

aacccattag cggccgagcg tgagggtagt tggaatcgtg tgttgtctca tacggacttt     3240

gcccgcctga ttgatgaaca gttaacccac catttccgcg tggcaagacg cgcggccctg     3300

gtaccgaact gccagattgt cttgttaaca ccagatacat ctttcgtatc gtctcgtgag     3360

gagttcgcgc tcgccctgtt cgttaaaaac tcgctgcacg cgtttacggt aaccctgggg     3420

gtcgagaccg aacgcttacc gaccgtaccg gccgttaatc aggtgcagct tacccgtcgg     3480

gctcgcgctg aagaaccagc gaccgaaagc gagttgcagg aggagatgga gcgcctggtc     3540

tcggccgtac tgcaatgcgc cgttcctgca ccgtccccgt ctgaaagccg gtacctggcg     3600

cgcattttca gaggtaatgc ggtcaccgta tga                                  3633


<210>  26
<211>  3690
<212>  DNA
<213>  Roseiflexus castenholzii

<400>  26
atgtcgactg tgcgacgact ggaaggaaag gtggcgctga ttaccggcgg cgctggcaac       60

atcggcgagg ttattacgcg ccgattcctg gcggaaggcg cgaccgttgt catcaccggt      120

cgcaatgcgg aaaagcttgc ggtgtaccgc cgtcgtctga ttgatgagga gcgcgtcgct      180

ccagagcgcg ttgtcgcgct gcggatggac ggcagtgata tcgctcaggt gcgcgcagga      240

gtcgcgcaga ttgttcatgg cggcactgac gtcccaatac cgctgcaccg gattgatatt      300

ctggttaaca acgccggcag tgccggacca cgccggcgcc tggtcgatat cccgttagaa      360

ccaagcgaag tgcaaccgcc tgactcggaa accctggcgc aggcggttgg taatctggtc      420

ggaatcacct ggaatctgac tcgcgcagcg gcgccgcaca tgccgtcagg ctcgtcggtg      480

attaatatca gcaccatttt ctcacgcacg gattattatg gtcggatcgc gtacgtcgcg      540

cccaaagcgg cgttaaatgc gctgtcggac ggtttggcgc gcgaattggg ggtgcgcggc      600

atccgcgtta atacgattta tccaggtccg attgagtcgg aacgcatcta caccatgttc      660

caggcgatgg acgcgttaaa ggggcaacca gagggcgaca cagcctccgg cttcctgaga      720

atgatgcgct tgtcgcgcat tgatcagaat ggcgaagtgg ttaagcgctt tccctcaccc      780

gtcgatgttg ccaataccgc ggtgttttta gcctcggatg agagcgcagc gtttacgggt      840

catgcctttg aggtgactca tggcatggag gtgcctacgg agagtcgcac taccttcgtg      900

agccgcccag gtctgcgctc ggttgatgcc acgggcaaag tcatcctgat ttgcgctggc      960

gatcaggtcg atgatgcggt cgcgctggcc gacaccttac gcagttgccg cgcgaccgtt     1020

gtgattggtt ttcgggatcc gcgcgcgctt gaaaaagcgt ctgtgttact gcgcgaacct     1080

cgccatgcgc ttgctgccga tatgtacggc cgcccgacca tgaccgcgga agcgcgcctg     1140

gtgcgcttag atccattaga cccgcgtgct gcggcacaga ccttagagca gatccacgcc     1200

gaattaggcg ccatccatca tgctgttgtc ctgcccggtc agagtcgtca cgcgcccagt     1260

gcatcgctga ttgaagtgga cgatcaggtt gttgagcgct ttctgcatca ggagctagta     1320

ggcaccatcg cgctggcgcg cgaactggcg cgcttttggg aggaataccc cagtggctcc     1380

tctatgcacc gcgtgctgtt cgtgtcgaat ccagacgatc agcaggggaa tcagtactcc     1440

catattctgc gcgctgcggt tgagcaattg gtgcgcgttt ggcgccacga gtcggagtat     1500

gacagcgtta atcctgcgca ccagcaggaa gggcagagct cggccgctgt gtgggcgaac     1560

caattgattc gctacgtgaa caatgagatg gccaacttag atttcacgtg cgcatgggtg     1620

gcaaagttat taggttcgga ccgtcgcatc gccgaaatta atttatactt accagaagaa     1680

attgtgggca ccatcggcgt gcacaatccg ggttttgggt gggcggaaag tctgttcggg     1740

ttacacatgg gtaaagtggc gttaattacg ggcggcagtg cgggcattgg cgggcagatt     1800

ggtcggttac tggcgttaag tggcgcgcat gtgatgctgg cggcgcggaa cgccgatcag     1860

ttagagcaga tgcgcgcgag cattgtgcgg gaggtgcgtg atgccagtta ccccgatgcc     1920

gagagccgcg tggcgatttt tccaggctcg gatgttagtg acattgacgg tctcgaacgc     1980

ctggttaacc acaccgtgcg cgtgttcggc aaagtggatt atctgattaa caatgcgggc     2040

attgccggcg ccgaggagat ggtgattgat atgccggtgg acgcctggcg ccacactctg     2100

cgcgccaatc tgatttcgaa ttacgcgctg ctgcgccgct tagcgccgca aatgaaagcg     2160

gcgggcggtg cgtacgtgct gaatgtgagc agttattttg gcggcgaaaa atacgtggcg     2220

attccttatc cgaaccgcag tgattacgcg gttagtaaag cggggcagcg cgccatggtt     2280

gaaagtctgg cgcgctttct tgggcccgag atccagatta acgcaattgc gccagggccg     2340

gtggaagggg aacgtctgaa gggcgccggt agtcggcccg ggctgtttat gcgccgggcg     2400

cgtttaatcc tggaaaacaa gcgcttaaat gaggtttttg ctgcgctgct ggcagcgcgc     2460

catgagggcg cgacgattgc cgatctgtta ccagatctgt ttgccaatga catccagagt     2520

attgccaatt cggctgcgat gccggcgccg ctgcgccgcc tggcgaccat gctgcgcgag     2580

acctcggatg ctggcggttc ggcgcagagc tatctgatga atgcgactat cgcgcgcaag     2640

ttgttaaatc gtctggagaa tggcggttat atcaccttac atgaccgacg cgcgctgacg     2700

gtcgaaccgc cggagccgtt tttcacggaa gcgcagattg agcgcgaagc gattaaagtg     2760

cgggatggca tcctgggaat gttacatctg caacgaatgc cgacggagtt tgatgtggcg     2820

ctggcgaccg ttttttacct ggcggaccgt aatgtgaccg gcgagacctt tcacccgagc     2880

ggcggtctgc gcttcgagcg caccgttacg gaaggcgaac tgttcggcaa accgggacag     2940

cagcgcctgg aacgactgaa gggctcggtt gtttacctga ttggggagca tctgcgccaa     3000

cacctggtgc tgctggcgcg cacgttttta gatgagatcc acgttgcgcg cgttgtcctg     3060

ctgactgaaa cgacccaggc ggcaacggac ctggcagccg aactgagtga ttacgaagcc     3120

gccggacgat ttgtcgttat tccgacctgt ggcgacatcg aaggcgggat cgatcgagcg     3180

atggcagaat atggccgccc agggccggtc attagcaccc cgtttcgccc gctgccggat     3240

cgcgccctga gtgcccgaaa cggggattgg agtagtgtgt taacgacggc cgaatttgag     3300

gaattggttg aacagcagat tacgcaccat tttcgcgtcg cgcgcaaagc gggtctgatt     3360

gagggcgcga acgttacgct ggtgaccccg ccgacgagcg cgcgcagcac cagtgaggag     3420

tttgcgctgg caaactttgt taaaaccacc ttacatgcat taaccgcgac ggctggcgcc     3480

gaaagtgaac gcaccgtgcc gcacgtgccg gttaatcagg tggacctgac ccgtcgtgcc     3540

cgtagtgagg agccccgtac ccctagtgag gaggaggagg aattacagcg gttcgtgaat     3600

gccgtgctgc tgacgagtgc gccgctgcct accccgttag aaagtcgtta ccgtgcgcgt     3660

atttaccggg gaaatgcgat cacggtatga                                      3690


<210>  27
<211>  3651
<212>  DNA
<213>  Unknown
<220>
<223>  marine gamma proteobacterium

<400>  27
atgaacgatt tcgtgcaatt caccgatgac atgaccagtc agtcaaagtc cggaaaaaga       60

ctggacaaca aatctattat ccttacgggc gccgcgggat ctattggccg atttattacc      120

cggcaattac tctgcgaagg tgcgagagtg atgatgacag ggcgtgatat cagcaagttg      180

gaagaatttg tagattccct ctgcgatgac ggttttgaca gggagaatat ggtggttact      240

gtgggcgatt gcgcggaccc agaggtttgt cggcggatcg ttgcagacac cgtggaagcg      300

ttcggcacca ttgacgtgct ggtcaacaac gcgggtgctg ctggcccgaa atatacgcta      360

agagatattc cgttttccga tgtagagatg aagaccgcgg gatcagatca aacgatgttc      420

gattcagcga tgaatctatt aggcgctccc tggaatatgg cacgtgcagc ggcccctcac      480

atgtccgtgg gtgcttcaat catcaatgta tctacaatct tttctcggac gcattacttt      540

ggtcgcatac cgtatgtggt tccgaagtcg ggcctaaatg cgctatcaaa gggactcgca      600

ttggaattgg gggaggaaca aggcattcgt gtcaacactg tatttccggg acccattgaa      660

tcggaacgta tcgacacggt gtttgcgcgc atggacgaat tgcagaatct ggaacccggc      720

agcacgggcc gcgagttccg cgacctcatg attacgactc gcgagggtga ggaggggttg      780

gagtatcgct atcctacacc gaccgatgtg gcgtcgtcca ttacttggct cgcgtccggg      840

gagtcggccg cagtttctgg tcacgcagtt gaagtcacta atggtatgca ggtgccagcc      900

cagagtcggt cgcagttggt gtcatggccg gataagcgac tagaggatct ttccgatcac      960

attgtgctga tattaggcgg ctcagactat gaggaggcgg tgaccttcgc cgaacggcat     1020

actgaaagtg gcgctcgggt gttactagct tttagaaact tggaatccgt aggccatgct     1080

cgatctatta tccaagctcg ggaattggag tcagttcaat tgtctcattt ggacccgtta     1140

cgccgagagt cggtagacag aaccatgcaa ttcatagacg accatttcgg ccggttggat     1200

ggcgttattg ttttgcccca gaagaaaaac ggtcaatacg gttattccat ctgttccgct     1260

accgacgacg atgtagaaaa cttcgtcaag gacgaagtcg tggctccggt agcctttgcg     1320

tcaaccctcg cgaccaatct tagccgctgg tttggaaaat gtgatccgcc ggcgattacc     1380

tacgtcacca atgcgagtga tggacacggt aaccttctga acgaggtcat tcgcgcgtcg     1440

aacgaggcgc taattcgcgg ttggcgccac gaggatgaga cactaaaagc tgctggagag     1500

ttgtcgtggt ccgtgcaacc gaatcagtta gtgcgctatg acactgaaga tagagatgcg     1560

ctcccgtttg cagctgattg ggccgctacc ctcactaatc gcgtgcgaca gatggatccc     1620

attaatctct ggattcctaa ggatattaaa cgcgcgacag gtaaaggcgc gatgccgaca     1680

tctctgatgc gtgttcttcc gggcttacat aaaggaaaaa ccgcggtcat cacaggtggc     1740

agcttgggca taggcctaca attaggacgc tatctcgcca tcgcgggagc gcgagtgcta     1800

cttagtgccc gcagtgaggc gaaattgata gaagctaagg ccgagattgt tgcagaattg     1860

agtggcattg gctatccgaa cgccgacaac cggattcacg tgttggcgaa catcgacgtt     1920

ggagatcccg cagcactaga gacattacac cagcacgcgg tagacctttt cggtcaagtc     1980

gattttctta ttaataatgc aggcatttca ggtgcggagg agatggttgt tgatatgacc     2040

ctaaaagact gggatcgcac catggaggca aatctaatct caaattactc gctaatacgt     2100

aagtttggcc ccctaatgaa agataaaggg cgaggctcta tattgaatgt ttccagttac     2160

tttggtggtg aaaaatatgt ggcggtggcg tatccgaatc gcgccgacta tgcggtgtca     2220

aaggcaggac aacgtgtgct cgcggaaatc ctgtcgcgcc acttgggacc tgagattcag     2280

atcaacgcgc tggctccagg cccagttgat ggagctcggt tgcggggatt aggcgatgcg     2340

cccggtcttt ttgatcgtcg cgggagactt gttctcgaga ataaacgctt gaaccaggtc     2400

cacgcggcga tcatatcggc cgtcgcggat ggctacccca ttgaggaaat caggaaactc     2460

tcagccaacg cggttgaagt tttgccgacc cataatctac catccgtact aagccgtttg     2520

tattcccaag taaaagactc cgggggaaca ggaagctcta gtaaatgcct gctgcatatg     2580

ggcatggctg tcaagctcgt cgaacggtta gtcaatgctg gtatttttac gactgaggac     2640

aaagacgaat ttttaggctc gttcgtcgac gcaccgtcgc cgttctttga caaggaggcg     2700

tgcaaaagat ctgcaaccca gatcgaatct ggaatcctta accggctgca tttgcacaaa     2760

atgcctacag atgaacaagt cggcttatct accgtatttc atctcgcgga tgagatcgtc     2820

agtggtgaaa ctttccatcc atcaggcggc ctgaaatttg accgctctgt aaccgagggc     2880

gagcttctct tgtcccctag tgagaaagat ttagcccgat tgagtggcaa gcgcgttgtg     2940

atacttggcg actgcatgcg taacgaaatc acagagattg caaaggggtt taaatctaac     3000

ggtgttgaga agctctggat ccttacgcgc tcggaggaga ccaaaaccac gctataccat     3060

gctctcgaat gtgacagtgt ggagaacatc gacgtgcgct gcatcggtga cgatatcgaa     3120

ggtgcgctag ataatttact gcgccacgac ggcggattcg acgttgttgt cagcagcccg     3180

tttgaacgcc ttccgctaaa cgcactggca ggagaccgcg gtggtgactg ggatcgcgtg     3240

ctatcagatg agcagttccg acaactcgtt catcagcagt tgacccacca cttccgcagt     3300

gctcgaattg ctgccttaat accgagctgc caaatcgtat tattgacacc ggaaacctcg     3360

cttgcatcca cccgtgagga atttgcactt gcactgttcg tcaaaaatag tctgcacgct     3420

tttacggtga cgctaggtgt ggagggtgag cgtcttccca ccgttccagc tgtcaaccag     3480

gtgcagctta ctcgcagggc gcataccgag gagccaagta atgatcagga attaagtgag     3540

gagatggagc ggctggttgc tgctgtcatg caatgctcgg tcccagcccc ctctcctaaa     3600

gagagtcgtt acctgagtaa gatcttccgg ggaaacgccg tgacggtatg a              3651


<210>  28
<211>  3654
<212>  DNA
<213>  Erythrobacter sp.

<400>  28
atgtcgaagg aaggaaacgc cgccaaaggt cggttagaag gtaaagtggc gctgattacg       60

ggggcggcag gcaatttagg caacgagata tcgcgggcct tcgcccgcga aggcgccttc      120

gttgttatga cggggcgcac cgaggagcgg atctctgcgg cgcgtgaaca gttaattgcg      180

gataccggcg tggcgcctga gcgaattgat accgccgtgt tagacggcgg caatcccgac      240

tcgattcgcg cagcgatggc aaaattgcgc aaggaatacg gccgtattga cattttaatt      300

aacaatgcag gttctgctgg cccaaaacag ccgttacata acgtaccgtt aagccctcag      360

gagatggaag cgtgcggcga caccgagacc gtgcgcgacg cgatgttaaa tattttgggc      420

gttacctgga acatggcgcg cattgtcgcg ccaatgatgc cggttggcgg cgctatggtt      480

aatatttcga cgatctttag ccatacgcgc tactatggac gcacggctta cgtggttcca      540

aaagctgcgc tgaacgcgct ttcgaaccag ttggccagcg agttaggacc gcgcggcatt      600

cgcgttaaca cagtgtttcc aggcccgatc gaaagcgatc gcattcgcac cgtcttcgcc      660

gcgatggatg aggttcagag ccagccaaaa gatacgaccg caaactattt taccggtcgc      720

atggcgttaa cccgcagcgt gaacggaaaa gtagatggca aacctctgcc aaaccccaaa      780

gacattgcgg ggacgtgcct gtttttggcc tcagaggaag ccgcaggaat cgcgggcgag      840

gaagttgatg ttacccatgg tcttagtgcc aaccgcacct cggcatcgac ctacatgacc      900

cgtcccagta tgcgctcgtt agatggggcg ggtttaaata tttttattgt gtcgggagag      960

aactgggatg acgcgctggt ggccgctcat acgctgattg gaagtggcgc aaaagttcgc     1020

ttaggcttag ctcgcaatgc cgatgtcgcg caggccaatg cgcgtctgaa ggcgcaaggg     1080

atcggcgagg agctgaccgt gacccgtttt aaccgtgcag agccagacgc gatggaagat     1140

gcgttagccg cgttcagtgg cgacgtggat ggggcgatta ccggcgcgat tattctgccg     1200

gtgaaaccct cgggccattt taccggatcg ctgttagccg ccgatgacga caccgtcacg     1260

aaatttatgg ataccgagtt ggttggcgcg atcgcagtgt cgcgaagctt ggcgcgttac     1320

tggcacgggc gagaggactt acagagtcct ccacgctgcg tttttatgac caatccgggc     1380

gacccactcg gcaatagttt tgcctcggtg ttaagtgccg gcattaccca gctgattcgc     1440

atttggcgcg acgaggaacg cgttcaggcg ggcaatggct cgaccgagca tgccgtttgg     1500

tcgaaccaga ttgttcgcca taccaacacc gaagatgaga acacccgctt cgcctcgggc     1560

cacgccaccc gcgtcttatt tcgcgaacag catattgccg agattgattt aaaactgcca     1620

gcgaatatta gcgaggaaac cggatcgcgc aaagccatgg tgggcttcgc cgagaacatt     1680

accgggcttc atttgggcaa agtcgctttt attaccggcg gctctgccgg gattggcggc     1740

caggttgcgc gcctcttagc gttagcaggc gcaaaagtta tgatggtggc aagacgcgaa     1800

agcgagttgg tggccgcccg ggatcgtatt gttggtgagt tgcaggacat tggctttgcg     1860

ggcgtcgaac gccgtgtgaa gtatatggcc gatattgatg tgagcgattt tgcctcgtta     1920

gataaagcgg tcgatgcgac gttagaggag tttgggcgta tcgactattt aattaataac     1980

gcaggcgtcg cgggcgccga ggatatggtt attgatatgg agccagaggc atggcgcttt     2040

acgttagacg cgaacttaat tagtaattat cacctgatgc agcgcgtggt tccgctgatg     2100

aaagaacagg gcagtggcta tgtgttaaat gtgagtagtt actttggcgg tgaaaaattt     2160

ttagcggtgg cctatccaaa ccgtgccgac tacggactga gtaaggcggg ccagcgggcg     2220

atggtggagg cgtttagtcc gtttttaggg cccgaggtac agtgcaacgc catcgcgccg     2280

ggccctgtgg acggcgatcg gcttagtggt accggtggaa agccaggtct gtttcagcgc     2340

cgtgccaaac tgattttgga gaacaaacga ctgaatgcgg tgtacagtgc agtgattcat     2400

gcgattcgcg agggcggcga cgcggcgaag attctgacgc gactctcgcg caattcgacc     2460

tcgaccttaa gccacgatgc agaagcacca gaggaactgc gcaaattagc attagatttt     2520

gcatcgcagg gtgacgggct gtgcacgtgg gaccagtact tactgaccga tgcgatggcg     2580

cagcggctct tagtgcggtt gcagttgggc ggctttctgt taggctcgaa cgaatgggcg     2640

agcctgtcga gcagcgagca gacgtggtta aagttatcgc ctccagacga taaaccattt     2700

ttaccagctg cgcaggtgga taaagtggca aacggcgtgg gcaaaggcgt tatttcgcag     2760

ttgcatttgg gtgcgatgcc gaccgaggcg gaggttgcgc aagcgaccgt gtttttttta     2820

gccgatcgcg ctgttagcgg ggaaaccttt atgccgtccg gcggcttacg tgtggaacgc     2880

agtaacaccg agcgcgagat gtttggcagc ccaaaacaag agcgcattga taaaatgaaa     2940

gggaagaccg tgtggattat tggcgagcat ctgagtgact acgtggctgc gacaattgag     3000

gagttagtct ccggctgcgg cgtggccaaa gtggttctga ttgccaaaga taaaagtggc     3060

gaaaaagcgg ttcgcgatca gctcccaaac gatttgtcga aggatgcgtt agaagttctg     3120

attgcgggtg acgggttgga ggaagcgatg gatgaggcgt tgggccactg gggcaaacca     3180

accacggtgc tgagtatgcc gggtgaacca ctcccagacc atctgtttga aggcggcaac     3240

ccgttgtcga ccaaagactt tgcgcacatg gtggaggcga acattacccg ccattaccgc     3300

gttacgcgca aagcgtcgtt gtacgatgga tgccaagtgg ttctcgtttc gccggatgtt     3360

ccgtatggca gtgacggccc aggagttgcg ttagccaatt ttgttaaaac gagcctgcat     3420

gcttttaccg cgacggtcgc ggttgagaat gagagactcg tgcatgacgt tccggtgaac     3480

cagattaact taacccgccg ggtgtcgagc gaggagccgc gcgacgctga tgaacacgcc     3540

gaggagttaa gacgctttac ccgcgctgtc ctgcttgtgg gcgcaccgct gccagacgcg     3600

caggatagtc gctatcgctc gaaaatttac cgcggcacgt cgatgacggt atga           3654


<210>  29
<211>  3660
<212>  DNA
<213>  Chloroflexus aurantiacus

<400>  29
atgtcgggaa ctggacgact ggcaggaaaa atcgccctta tcaccggcgg tgcgggtaac       60

attggttcgg aattgactcg tcgcttttta gcagagggag ccacggttat catctcggga      120

cggaaccggg ccaaattgac cgcactggcg gaacggatgc aggcagaggc aggagtgccg      180

gcaaaacgca ttgatttaga agttatggat gggtcggatc cggttgccgt acgtgcgggc      240

attgaagcca tcgtggcgcg tcatggtcag attgacatcc tggttaataa cgcaggatcg      300

gcgggcgcgc agcgtcgtct ggcggagatc ccattaacag aagctgaact tggtcccggt      360

gcggaggaga cgctccacgc gagcattgcg aaccttctcg gcatgggatg gcacctgatg      420

cgtatcgccg caccccacat gccggtagga tcggccgtta ttaacgttag taccattttt      480

tcccgggctg agtattatgg gcggatcccg tacgttaccc ccaaggctgc tctcaacgct      540

ctatcacaac tcgctgcccg tgagcttggc gcacgtggta ttcgcgtcaa cacgattttt      600

cctggtccga tcgaatcgga tcgcattcgt actgtgttcc agcgtatgga tcagttaaaa      660

gggcggcctg aaggtgacac tgcccatcac tttttgaata ccatgcgatt gtgccgtgcg      720

aatgaccagg gtgccctcga acgtcggttc ccttctgttg gcgatgtggc agacgcggct      780

gtttttctgg cgtcggcgga atctgcggct ttatctggcg agacgatcga ggtcacgcat      840

ggaatggagt tgccggcgtg ttcggagacc agcctgctgg cgcgtacaga tctgcgcacg      900

atcgatgcgt cgggtcgcac gacgttaatt tgtgcgggtg accagatcga ggaggtgatg      960

gccttaaccg gcatgttgcg tacctgcggg tcggaagtga ttattggttt ccgtagtgct     1020

gccgccctgg cgcagttcga gcaggcagtt aacgagtcgc ggcggctggc gggtgcagac     1080

tttacgcccc ctatcgcgtt gccattagat ccacgcgatc cggcaactat cgacgctgtt     1140

ttcgattggg cgggtgagaa caccggtggg atccacgcag ccgtgatcct gcccgctacc     1200

tcgcatgaac cggcaccgtg tgtgatcgag gtcgatgatg agcgggtgct gaactttctg     1260

gcggatgaaa ttaccgggac tatcgtgatc gcgtcgcgcc tggcgcgtta ttggcagagt     1320

caacggctca cccctggtgc acgtgcccgt gggccgcgtg ttatcttttt aagtaatggc     1380

gcggatcaaa acgggaacgt ctatggacgc atccaatcgg cggctattgg ccagttaatc     1440

cgtgtgtggc gtcatgaggc tgaactcgac taccagcgtg cgagcgcggc gggcgatcac     1500

gtgctgccgc cggtatgggc gaaccagatc gtgcgcttcg ctaatcgcag cctcgaaggg     1560

cttgaatttg cgtgcgcgtg gactgctcaa ttgttacact cgcaacgcca cattaacgag     1620

atcaccttaa atattcccgc gaatatcagc gcgaccaccg gtgcacgctc ggccagtgtt     1680

ggatgggccg aaagcctgat tgggttacac ttggggaagg tcgcgttgat caccggcggt     1740

agcgcgggca tcggcgggca gattgggcgc ttactggctt tatcgggtgc ccgcgtgatg     1800

ctggcagcgc gtgatcggca caaattagaa cagatgcagg ccatgattca atcagagctg     1860

gctgaggtgg ggtacaccga tgttgaagat cgcgttcata tcgcaccggg ttgtgatgtg     1920

tcgagcgaag cccagctcgc cgatctcgtc gaacgtaccc tgtccgcttt tggtaccgtt     1980

gattacctga ttaataatgc ggggattgcg ggcgttgagg agatggtcat tgatatgcca     2040

gtcgagggat ggcgccacac cttattcgcg aacctgatta gcaattatag tttgatgcgc     2100

aagctggccc cgttgatgaa gaagcagggc agcggctata ttctcaatgt ttcctcctat     2160

tttggtggcg aaaaggatgc cgcgatccct tatcctaatc gtgcggatta tgcggttagt     2220

aaagctggcc agcgggcaat ggcggaagtt tttgcccgct tcctcggtcc ggagatacag     2280

attaacgcga tcgccccggg cccggttgaa ggcgatcgct tacgcggcac cggcgaacgt     2340

cctggtttat ttgcgcgtcg ggcccggctg atcttggaga ataaacggct gaacgagctc     2400

catgctgctc tcattgccgc tgcccgcacc gatgagcgat caatgcatga actggtcgaa     2460

ctgttacttc ctaacgatgt ggcggcacta gagcagaacc cagcagcacc caccgccttg     2520

cgtgaactgg cacgacgttt tcgcagcgaa ggtgatccgg ccgcatcctc cagctcggcc     2580

ctgctgaatc gttccatcgc ggctaagttg ctggctcgtt tgcacaacgg cggttacgtg     2640

ttgcccgcgg acatttttgc aaatctgcca aatccgcctg atcctttctt cacccgagcg     2700

cagatcgatc gcgaggctcg caaagtccgt gacggtatta tggggatgtt atatctgcaa     2760

cggatgccga cagagtttga tgttgcaatg gcgaccgttt actatctcgc ggaccgcaac     2820

gtttcgggcg agactttcca tccatccggc ggcttgcgtt atgaacgcac ccccaccggc     2880

ggtgaattat tcggtttgcc ttccccggaa cggctggccg agctggttgg aagcacggtt     2940

tacctgatag gcgaacacct gacagaacat ctcaatctgc tcgcgcgtgc ctatttagaa     3000

cgttatgggg cacgtcaggt agtgatgatc gtcgagactg aaaccggggc agagactatg     3060

cgtcgcttgt tacatgatca tgttgaggct ggccggctga tgacaatcgt ggcgggcgat     3120

cagattgaag cggctattga ccaggctatt acacgctatg gccgcccagg gccggttgtt     3180

tgcacccctt tccggccact gccgacggta ccactggttg ggcgtaagga ctcggactgg     3240

agcactgtgt tgtcggaggc tgaatttgcg gagttgtgtg aacatcagtt aacccatcac     3300

ttccgggtag cccgcaaaat cgcgctgtcg gatggcgcgt cgttagccct ggttacacct     3360

gaaacaacgg ctacctccac aaccgagcaa tttgctctgg ctaatttcat taagacgacc     3420

ctccatgctt ttacggctac gatcggcgtt gagagcgaaa ggacagctca gcgcatcctg     3480

attaaccaag ttgatctgac ccggcgtgcc cgtgcggagg agccgcgtga tccgcatgag     3540

cgtcaacaag aactggaacg ttttattgag gcagttttac tggttacagc accattaccg     3600

cccgaagcgg atacccgtta tgcggggcgg atccaccgcg gacgggccat caccgtatga     3660


<210>  30
<211>  5469
<212>  DNA
<213>  Chloroflexus aurantiacus

<400>  30
atgatcgaca ctgcgcccct tgccccacca cgggcgcccc gctctaatcc gattcgggat       60

cgagttgatt gggaagctca gcgtgctgct gcgctggcag atcccggtgc ctttcatggc      120

gcgattgccc ggacagttat ccactggtac gacccacaac accattgctg gattcgcttc      180

aacgagtcta gtcagcgttg ggaagggctg gatgccgcta ccggtgcccc tgtaacggta      240

gactatcccg ccgattatca gccctggcaa caggcgtttg atgatagtga agcgccgttt      300

taccgctggt ttagtggtgg gttgacaaat gcctgcttta atgaagtaga ccggcatgtc      360

acgatgggct atggcgacga ggtggcctac tactttgaag gtgaccgctg ggataactcg      420

ctcaacaatg gtcgtggtgg tccggttgtc caggagacaa tcacgcgacg gcgtctgttg      480

gtggaggtgg tgaaggctgc gcaggtgttg cgcgatctgg gcctgaagaa gggtgatcgg      540

attgctctga atatgccgaa tattatgccg cagatttatt atacggaagc ggcaaaacga      600

ctgggtattc tgtacacgcc ggtcttcggt ggcttctcgg acaagactct ttccgaccgt      660

attcacaatg ccggtgcacg agtggtgatt acctctgatg gcgcgtatcg caacgcgcag      720

gtggtgccct acaaagaagc gtataccgat caggcgctcg ataagtatat tccggttgag      780

actgcgcagg cgattgttgc gcagaccctg gccaccttgc ccctgactga gtcgcagcgc      840

cagacgatca tcaccgaagt ggaggccgcc ctggcaggtg agattacggt tgagcgttcg      900

gacgtgatgc gtggggttgg ttctgccctc gcaaagctcc gcgatcttga tgcaagcgtg      960

caggcaaagg tgcgcacagt actggcgcag gcgctggtcg agtcgccgcc gcgggttgaa     1020

gctgtggtgg ttgtgcgtca taccggtcag gagattttgt ggaacgaggg gcgagatcgc     1080

tggagtcacg acttgctgga tgctgcgctg gcgaagattc tggccaatgc gcgtgctgca     1140

ggctttgatg tgcacagtga gaatgatctg ctcaatctcc ccgatgacca gcttatccgt     1200

gcgctctacg ccagtattcc ctgtgaaccg gttgatgctg aatatccgat gtttatcatt     1260

tacacatcgg gtagcaccgg taagcccaag ggtgtgatcc acgttcacgg cggttatgtc     1320

gccggtgtgg tgcacacctt gagggtcagt tttgacgccg agccgggtga tacgatatat     1380

gtgatcgccg atccgggctg gatcaccggc cagagctata tgctcacagc cacaatggcc     1440

ggtagactga ccggggtgat tgccgaggga tcaccgcttt tcccctcagc cgggcgttat     1500

gccagcatca tcgagcgcta tggggtgcag atctttaagg cgggtgtgac cttcctcaag     1560

acagtgatgt ccaatccgca gaatgttgaa gatgtgcgac tctatgatat gcactcgctg     1620

agagttgcaa ccttctgcgc cgagccggta agtccggcgg tgcagcagtt tggtatgcag     1680

atcatgaccc cgcagtatat caattcgtac tgggcgaccg agcacggtgg aattgtctgg     1740

acgcatttct acggtaatca ggactttccg cttcgtcccg atgcccatac ctatcccttg     1800

ccctgggtga tgggtgatgt ctgggtggcc gaaactgatg agagcgggac gacgcgctat     1860

cgggtcgctg atttcgatga gaagggcgag attgtgatta ccgccccgta tccctacctg     1920

acccgcacac tctggggtga tgtgcccggt ttcgaggcgt acctgcgcgg tgagattccg     1980

ctgcgagcct ggaagggtga tgccgagcgt ttcgtcaaga cctactggcg acgtgggcca     2040

aacggtgaat ggggctatat ccagggtgat tttgccatca agtaccccga tggtagcttc     2100

acgctccacg gacgctctga cgatgtgatc aatgtgtcgg gccaccgtat gggcaccgag     2160

gagattgagg gtgccatttt gcgtgaccgc cagatcacgc ccgactcgcc tgtcggtaat     2220

tgtattgtgg tcggtgcgcc gcatcgtgag aagggtctga ccccggttgc cttcattcaa     2280

cctgcgcctg gccgtcatct gaccggtgca gacaggcgcc gtctcgatga gctggtgcgc     2340

accgagaagg gggcggtcag tgtcccagag gattacatcg aggtcagtgc ctttcccgaa     2400

acccgcagcg ggaagtatat gaggcgcttt ttgcgcaata tgatgctcga tgaaccactg     2460

ggtgatacga cgacgttgcg caatcctgaa gtgctcgaag aaattgcagc caagatcgct     2520

gagtggaaac gccgtcagcg tatggccgaa gaacagcaga tcatcgaacg ctatcgctac     2580

ttccggatcg agtatcatcc accaacggcc agtgcgggta aactcgcggt agtgacggtg     2640

acaaatccgc cggtgaacgc actgaatgag cgtgcgttag atgagttgaa cacaattgtt     2700

gaccacctgg cccgtcgtca ggatgttgcc gcaattgtct tcaccggaca gggcgccagg     2760

agttttgtcg ccggtgctga tattcgccag ttgctcgaag aaattcatac ggttgaagaa     2820

gcaatggccc tgccgaataa cgcccatctt gctttccgca agattgagcg tatgaataag     2880

ccgtgtatcg cggcgatcaa cggtgtggcg ctcggtggtg gtctggaatt tgccatggcc     2940

tgccattacc gggttgccga tgtctatgcc gaatttggtc agccagagat taatctgcgc     3000

ttgctacctg gttatggtgg cacgcagcgc ttgccgcgtc tgttgtacaa gcgcaacaac     3060

ggcaccggtc tgctccgagc gctggagatg attctgggtg ggcgtagcgt accggctgat     3120

gaggcgctgg agctgggtct gatcgatgcc attgctaccg gcgatcagga ctcactgtcg     3180

ctggcatgcg cgttagcccg tgccgcaatc ggtgccgatg gtcagttgat cgagtcggct     3240

gcggtgaccc aggctttccg ccatcgccac gagcagcttg acgagtggcg caaaccagac     3300

ccgcgctttg ccgatgacga actgcgctcg attatcgccc atccacgtat cgagcggatt     3360

atccggcagg cccataccgt tgggcgcgat gcggcagtgc accgggcact ggatgcaatc     3420

cgctatggca ttatccacgg cttcgaggcc ggtctggagc acgaggcgaa gctctttgcc     3480

gaggcagtgg ttgacccgaa cggtggcaag cgtggtattc gcgagttcct cgaccgccag     3540

agtgcgccgt tgccaacccg ccgaccattg attacacctg aacaggagca actcttgcgc     3600

gatcagaaag aactgttgcc ggttggttca cccttcttcc ccggtgttga ccggattccg     3660

aagtggcagt acgcgcaggc ggttattcgt gatccggaca ccggtgcggc ggctcacggc     3720

gatcccatcg tggctgaaaa gcagattatt gtgccggtgg aacgcccccg cgccaatcag     3780

gcgctgattt atgttctggc ctcggaggtg aacttcaacg atatctgggc gattaccggt     3840

attccggtgt cacggtttga tgagcacgac cgcgactggc acgttaccgg ttcaggtggc     3900

atcggcctga tcgttgcgct gggtgaagaa gcgcgacgcg aaggccggct gaaggtgggt     3960

gatctggtgg cgatctactc cgggcagtcg gatctgctct caccgctgat gggccttgat     4020

ccgatggccg ccgatttcgt catccagggg aacgacacgc cagatggatc gcatcagcaa     4080

tttatgctgg cccaggcccc gcagtgtctg cccatcccaa ccgatatgtc tatcgaggca     4140

gccggcagct acatcctcaa tctcggtacg atctatcgcg ccctctttac gacgttgcaa     4200

atcaaggccg gacgcaccat ctttatcgag ggtgcggcga ccggcaccgg tctggacgca     4260

gcgcgctcgg cggcccggaa tggtctgcgc gtaattggaa tggtcagttc gtcgtcacgt     4320

gcgtctacgc tgctggctgc gggtgcccac ggtgcgatta accgtaaaga cccggaggtt     4380

gccgattgtt tcacgcgcgt gcccgaagat ccatcagcct gggcagcctg ggaagccgcc     4440

ggtcagccgt tgctggcgat gttccgggcg cagaacgacg ggcgactggc cgattatgtg     4500

gtctcgcacg cgggcgagac ggccttcccg cgcagtttcc agcttctcgg cgagccacgc     4560

gatggtcaca ttccgacgct cacattctac ggtgccacca gtggctacca cttcaccttc     4620

ctgggtaagc cagggtcagc ttcgccgacc gagatgctgc ggcgggccaa tctccgcgcc     4680

ggtgaggcgg tgttgatcta ctacggggtt gggagcgatg acctggtaga taccggcggt     4740

ctggaggcta tcgaggcggc gcggcaaatg ggagcgcgga tcgtcgtcgt taccgtcagc     4800

gatgcgcaac gcgagtttgt cctctcgttg ggcttcgggg ctgccctacg tggtgtcgtc     4860

agcctggcgg aactcaaacg acgcttcggc gatgagtttg agtggccgcg cacgatgccg     4920

ccgttgccga acgcccgcca ggacccgcag ggtctgaaag aggctgtccg ccgcttcaac     4980

gatctggtct tcaagccgct aggaagcgcg gtcggtgtct tcttgcggag tgccgacaat     5040

ccgcgtggct accccgatct gatcatcgag cgggctgccc acgatgcact ggcggtgagc     5100

gcgatgctga tcaagccctt caccggacgg attgtctact tcgaggacat tggtgggcgg     5160

cgttactcct tcttcgcacc gcaaatctgg gtgcgccagc gccgcatcta catgccgacg     5220

gcacagatct ttggtacgca cctctcaaat gcgtatgaaa ttctgcgtct gaatgatgag     5280

atcagcgccg gtctgctgac gattaccgag ccggcagtgg tgccgtggga tgaactaccc     5340

gaagcacatc aggcgatgtg ggaaaatcgc cacacggcgg ccacttatgt ggtgaatcat     5400

gccttaccac gtctcggcct aaagaacagg gacgagctgt acgaggcgtg gacggccggc     5460

gagcgctaa                                                             5469


<210>  31
<211>  1822
<212>  PRT
<213>  Chloroflexus aurantiacus

<400>  31

Met Ile Asp Thr Ala Pro Leu Ala Pro Pro Arg Ala Pro Arg Ser Asn 
1               5                   10                  15      


Pro Ile Arg Asp Arg Val Asp Trp Glu Ala Gln Arg Ala Ala Ala Leu 
            20                  25                  30          


Ala Asp Pro Gly Ala Phe His Gly Ala Ile Ala Arg Thr Val Ile His 
        35                  40                  45              


Trp Tyr Asp Pro Gln His His Cys Trp Ile Arg Phe Asn Glu Ser Ser 
    50                  55                  60                  


Gln Arg Trp Glu Gly Leu Asp Ala Ala Thr Gly Ala Pro Val Thr Val 
65                  70                  75                  80  


Asp Tyr Pro Ala Asp Tyr Gln Pro Trp Gln Gln Ala Phe Asp Asp Ser 
                85                  90                  95      


Glu Ala Pro Phe Tyr Arg Trp Phe Ser Gly Gly Leu Thr Asn Ala Cys 
            100                 105                 110         


Phe Asn Glu Val Asp Arg His Val Thr Met Gly Tyr Gly Asp Glu Val 
        115                 120                 125             


Ala Tyr Tyr Phe Glu Gly Asp Arg Trp Asp Asn Ser Leu Asn Asn Gly 
    130                 135                 140                 


Arg Gly Gly Pro Val Val Gln Glu Thr Ile Thr Arg Arg Arg Leu Leu 
145                 150                 155                 160 


Val Glu Val Val Lys Ala Ala Gln Val Leu Arg Asp Leu Gly Leu Lys 
                165                 170                 175     


Lys Gly Asp Arg Ile Ala Leu Asn Met Pro Asn Ile Met Pro Gln Ile 
            180                 185                 190         


Tyr Tyr Thr Glu Ala Ala Lys Arg Leu Gly Ile Leu Tyr Thr Pro Val 
        195                 200                 205             


Phe Gly Gly Phe Ser Asp Lys Thr Leu Ser Asp Arg Ile His Asn Ala 
    210                 215                 220                 


Gly Ala Arg Val Val Ile Thr Ser Asp Gly Ala Tyr Arg Asn Ala Gln 
225                 230                 235                 240 


Val Val Pro Tyr Lys Glu Ala Tyr Thr Asp Gln Ala Leu Asp Lys Tyr 
                245                 250                 255     


Ile Pro Val Glu Thr Ala Gln Ala Ile Val Ala Gln Thr Leu Ala Thr 
            260                 265                 270         


Leu Pro Leu Thr Glu Ser Gln Arg Gln Thr Ile Ile Thr Glu Val Glu 
        275                 280                 285             


Ala Ala Leu Ala Gly Glu Ile Thr Val Glu Arg Ser Asp Val Met Arg 
    290                 295                 300                 


Gly Val Gly Ser Ala Leu Ala Lys Leu Arg Asp Leu Asp Ala Ser Val 
305                 310                 315                 320 


Gln Ala Lys Val Arg Thr Val Leu Ala Gln Ala Leu Val Glu Ser Pro 
                325                 330                 335     


Pro Arg Val Glu Ala Val Val Val Val Arg His Thr Gly Gln Glu Ile 
            340                 345                 350         


Leu Trp Asn Glu Gly Arg Asp Arg Trp Ser His Asp Leu Leu Asp Ala 
        355                 360                 365             


Ala Leu Ala Lys Ile Leu Ala Asn Ala Arg Ala Ala Gly Phe Asp Val 
    370                 375                 380                 


His Ser Glu Asn Asp Leu Leu Asn Leu Pro Asp Asp Gln Leu Ile Arg 
385                 390                 395                 400 


Ala Leu Tyr Ala Ser Ile Pro Cys Glu Pro Val Asp Ala Glu Tyr Pro 
                405                 410                 415     


Met Phe Ile Ile Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Val 
            420                 425                 430         


Ile His Val His Gly Gly Tyr Val Ala Gly Val Val His Thr Leu Arg 
        435                 440                 445             


Val Ser Phe Asp Ala Glu Pro Gly Asp Thr Ile Tyr Val Ile Ala Asp 
    450                 455                 460                 


Pro Gly Trp Ile Thr Gly Gln Ser Tyr Met Leu Thr Ala Thr Met Ala 
465                 470                 475                 480 


Gly Arg Leu Thr Gly Val Ile Ala Glu Gly Ser Pro Leu Phe Pro Ser 
                485                 490                 495     


Ala Gly Arg Tyr Ala Ser Ile Ile Glu Arg Tyr Gly Val Gln Ile Phe 
            500                 505                 510         


Lys Ala Gly Val Thr Phe Leu Lys Thr Val Met Ser Asn Pro Gln Asn 
        515                 520                 525             


Val Glu Asp Val Arg Leu Tyr Asp Met His Ser Leu Arg Val Ala Thr 
    530                 535                 540                 


Phe Cys Ala Glu Pro Val Ser Pro Ala Val Gln Gln Phe Gly Met Gln 
545                 550                 555                 560 


Ile Met Thr Pro Gln Tyr Ile Asn Ser Tyr Trp Ala Thr Glu His Gly 
                565                 570                 575     


Gly Ile Val Trp Thr His Phe Tyr Gly Asn Gln Asp Phe Pro Leu Arg 
            580                 585                 590         


Pro Asp Ala His Thr Tyr Pro Leu Pro Trp Val Met Gly Asp Val Trp 
        595                 600                 605             


Val Ala Glu Thr Asp Glu Ser Gly Thr Thr Arg Tyr Arg Val Ala Asp 
    610                 615                 620                 


Phe Asp Glu Lys Gly Glu Ile Val Ile Thr Ala Pro Tyr Pro Tyr Leu 
625                 630                 635                 640 


Thr Arg Thr Leu Trp Gly Asp Val Pro Gly Phe Glu Ala Tyr Leu Arg 
                645                 650                 655     


Gly Glu Ile Pro Leu Arg Ala Trp Lys Gly Asp Ala Glu Arg Phe Val 
            660                 665                 670         


Lys Thr Tyr Trp Arg Arg Gly Pro Asn Gly Glu Trp Gly Tyr Ile Gln 
        675                 680                 685             


Gly Asp Phe Ala Ile Lys Tyr Pro Asp Gly Ser Phe Thr Leu His Gly 
    690                 695                 700                 


Arg Ser Asp Asp Val Ile Asn Val Ser Gly His Arg Met Gly Thr Glu 
705                 710                 715                 720 


Glu Ile Glu Gly Ala Ile Leu Arg Asp Arg Gln Ile Thr Pro Asp Ser 
                725                 730                 735     


Pro Val Gly Asn Cys Ile Val Val Gly Ala Pro His Arg Glu Lys Gly 
            740                 745                 750         


Leu Thr Pro Val Ala Phe Ile Gln Pro Ala Pro Gly Arg His Leu Thr 
        755                 760                 765             


Gly Ala Asp Arg Arg Arg Leu Asp Glu Leu Val Arg Thr Glu Lys Gly 
    770                 775                 780                 


Ala Val Ser Val Pro Glu Asp Tyr Ile Glu Val Ser Ala Phe Pro Glu 
785                 790                 795                 800 


Thr Arg Ser Gly Lys Tyr Met Arg Arg Phe Leu Arg Asn Met Met Leu 
                805                 810                 815     


Asp Glu Pro Leu Gly Asp Thr Thr Thr Leu Arg Asn Pro Glu Val Leu 
            820                 825                 830         


Glu Glu Ile Ala Ala Lys Ile Ala Glu Trp Lys Arg Arg Gln Arg Met 
        835                 840                 845             


Ala Glu Glu Gln Gln Ile Ile Glu Arg Tyr Arg Tyr Phe Arg Ile Glu 
    850                 855                 860                 


Tyr His Pro Pro Thr Ala Ser Ala Gly Lys Leu Ala Val Val Thr Val 
865                 870                 875                 880 


Thr Asn Pro Pro Val Asn Ala Leu Asn Glu Arg Ala Leu Asp Glu Leu 
                885                 890                 895     


Asn Thr Ile Val Asp His Leu Ala Arg Arg Gln Asp Val Ala Ala Ile 
            900                 905                 910         


Val Phe Thr Gly Gln Gly Ala Arg Ser Phe Val Ala Gly Ala Asp Ile 
        915                 920                 925             


Arg Gln Leu Leu Glu Glu Ile His Thr Val Glu Glu Ala Met Ala Leu 
    930                 935                 940                 


Pro Asn Asn Ala His Leu Ala Phe Arg Lys Ile Glu Arg Met Asn Lys 
945                 950                 955                 960 


Pro Cys Ile Ala Ala Ile Asn Gly Val Ala Leu Gly Gly Gly Leu Glu 
                965                 970                 975     


Phe Ala Met Ala Cys His Tyr Arg Val Ala Asp Val Tyr Ala Glu Phe 
            980                 985                 990         


Gly Gln Pro Glu Ile Asn Leu Arg  Leu Leu Pro Gly Tyr  Gly Gly Thr 
        995                 1000                 1005             


Gln Arg  Leu Pro Arg Leu Leu  Tyr Lys Arg Asn Asn  Gly Thr Gly 
    1010                 1015                 1020             


Leu Leu  Arg Ala Leu Glu Met  Ile Leu Gly Gly Arg  Ser Val Pro 
    1025                 1030                 1035             


Ala Asp  Glu Ala Leu Glu Leu  Gly Leu Ile Asp Ala  Ile Ala Thr 
    1040                 1045                 1050             


Gly Asp  Gln Asp Ser Leu Ser  Leu Ala Cys Ala Leu  Ala Arg Ala 
    1055                 1060                 1065             


Ala Ile  Gly Ala Asp Gly Gln  Leu Ile Glu Ser Ala  Ala Val Thr 
    1070                 1075                 1080             


Gln Ala  Phe Arg His Arg His  Glu Gln Leu Asp Glu  Trp Arg Lys 
    1085                 1090                 1095             


Pro Asp  Pro Arg Phe Ala Asp  Asp Glu Leu Arg Ser  Ile Ile Ala 
    1100                 1105                 1110             


His Pro  Arg Ile Glu Arg Ile  Ile Arg Gln Ala His  Thr Val Gly 
    1115                 1120                 1125             


Arg Asp  Ala Ala Val His Arg  Ala Leu Asp Ala Ile  Arg Tyr Gly 
    1130                 1135                 1140             


Ile Ile  His Gly Phe Glu Ala  Gly Leu Glu His Glu  Ala Lys Leu 
    1145                 1150                 1155             


Phe Ala  Glu Ala Val Val Asp  Pro Asn Gly Gly Lys  Arg Gly Ile 
    1160                 1165                 1170             


Arg Glu  Phe Leu Asp Arg Gln  Ser Ala Pro Leu Pro  Thr Arg Arg 
    1175                 1180                 1185             


Pro Leu  Ile Thr Pro Glu Gln  Glu Gln Leu Leu Arg  Asp Gln Lys 
    1190                 1195                 1200             


Glu Leu  Leu Pro Val Gly Ser  Pro Phe Phe Pro Gly  Val Asp Arg 
    1205                 1210                 1215             


Ile Pro  Lys Trp Gln Tyr Ala  Gln Ala Val Ile Arg  Asp Pro Asp 
    1220                 1225                 1230             


Thr Gly  Ala Ala Ala His Gly  Asp Pro Ile Val Ala  Glu Lys Gln 
    1235                 1240                 1245             


Ile Ile  Val Pro Val Glu Arg  Pro Arg Ala Asn Gln  Ala Leu Ile 
    1250                 1255                 1260             


Tyr Val  Leu Ala Ser Glu Val  Asn Phe Asn Asp Ile  Trp Ala Ile 
    1265                 1270                 1275             


Thr Gly  Ile Pro Val Ser Arg  Phe Asp Glu His Asp  Arg Asp Trp 
    1280                 1285                 1290             


His Val  Thr Gly Ser Gly Gly  Ile Gly Leu Ile Val  Ala Leu Gly 
    1295                 1300                 1305             


Glu Glu  Ala Arg Arg Glu Gly  Arg Leu Lys Val Gly  Asp Leu Val 
    1310                 1315                 1320             


Ala Ile  Tyr Ser Gly Gln Ser  Asp Leu Leu Ser Pro  Leu Met Gly 
    1325                 1330                 1335             


Leu Asp  Pro Met Ala Ala Asp  Phe Val Ile Gln Gly  Asn Asp Thr 
    1340                 1345                 1350             


Pro Asp  Gly Ser His Gln Gln  Phe Met Leu Ala Gln  Ala Pro Gln 
    1355                 1360                 1365             


Cys Leu  Pro Ile Pro Thr Asp  Met Ser Ile Glu Ala  Ala Gly Ser 
    1370                 1375                 1380             


Tyr Ile  Leu Asn Leu Gly Thr  Ile Tyr Arg Ala Leu  Phe Thr Thr 
    1385                 1390                 1395             


Leu Gln  Ile Lys Ala Gly Arg  Thr Ile Phe Ile Glu  Gly Ala Ala 
    1400                 1405                 1410             


Thr Gly  Thr Gly Leu Asp Ala  Ala Arg Ser Ala Ala  Arg Asn Gly 
    1415                 1420                 1425             


Leu Arg  Val Ile Gly Met Val  Ser Ser Ser Ser Arg  Ala Ser Thr 
    1430                 1435                 1440             


Leu Leu  Ala Ala Gly Ala His  Gly Ala Ile Asn Arg  Lys Asp Pro 
    1445                 1450                 1455             


Glu Val  Ala Asp Cys Phe Thr  Arg Val Pro Glu Asp  Pro Ser Ala 
    1460                 1465                 1470             


Trp Ala  Ala Trp Glu Ala Ala  Gly Gln Pro Leu Leu  Ala Met Phe 
    1475                 1480                 1485             


Arg Ala  Gln Asn Asp Gly Arg  Leu Ala Asp Tyr Val  Val Ser His 
    1490                 1495                 1500             


Ala Gly  Glu Thr Ala Phe Pro  Arg Ser Phe Gln Leu  Leu Gly Glu 
    1505                 1510                 1515             


Pro Arg  Asp Gly His Ile Pro  Thr Leu Thr Phe Tyr  Gly Ala Thr 
    1520                 1525                 1530             


Ser Gly  Tyr His Phe Thr Phe  Leu Gly Lys Pro Gly  Ser Ala Ser 
    1535                 1540                 1545             


Pro Thr  Glu Met Leu Arg Arg  Ala Asn Leu Arg Ala  Gly Glu Ala 
    1550                 1555                 1560             


Val Leu  Ile Tyr Tyr Gly Val  Gly Ser Asp Asp Leu  Val Asp Thr 
    1565                 1570                 1575             


Gly Gly  Leu Glu Ala Ile Glu  Ala Ala Arg Gln Met  Gly Ala Arg 
    1580                 1585                 1590             


Ile Val  Val Val Thr Val Ser  Asp Ala Gln Arg Glu  Phe Val Leu 
    1595                 1600                 1605             


Ser Leu  Gly Phe Gly Ala Ala  Leu Arg Gly Val Val  Ser Leu Ala 
    1610                 1615                 1620             


Glu Leu  Lys Arg Arg Phe Gly  Asp Glu Phe Glu Trp  Pro Arg Thr 
    1625                 1630                 1635             


Met Pro  Pro Leu Pro Asn Ala  Arg Gln Asp Pro Gln  Gly Leu Lys 
    1640                 1645                 1650             


Glu Ala  Val Arg Arg Phe Asn  Asp Leu Val Phe Lys  Pro Leu Gly 
    1655                 1660                 1665             


Ser Ala  Val Gly Val Phe Leu  Arg Ser Ala Asp Asn  Pro Arg Gly 
    1670                 1675                 1680             


Tyr Pro  Asp Leu Ile Ile Glu  Arg Ala Ala His Asp  Ala Leu Ala 
    1685                 1690                 1695             


Val Ser  Ala Met Leu Ile Lys  Pro Phe Thr Gly Arg  Ile Val Tyr 
    1700                 1705                 1710             


Phe Glu  Asp Ile Gly Gly Arg  Arg Tyr Ser Phe Phe  Ala Pro Gln 
    1715                 1720                 1725             


Ile Trp  Val Arg Gln Arg Arg  Ile Tyr Met Pro Thr  Ala Gln Ile 
    1730                 1735                 1740             


Phe Gly  Thr His Leu Ser Asn  Ala Tyr Glu Ile Leu  Arg Leu Asn 
    1745                 1750                 1755             


Asp Glu  Ile Ser Ala Gly Leu  Leu Thr Ile Thr Glu  Pro Ala Val 
    1760                 1765                 1770             


Val Pro  Trp Asp Glu Leu Pro  Glu Ala His Gln Ala  Met Trp Glu 
    1775                 1780                 1785             


Asn Arg  His Thr Ala Ala Thr  Tyr Val Val Asn His  Ala Leu Pro 
    1790                 1795                 1800             


Arg Leu  Gly Leu Lys Asn Arg  Asp Glu Leu Tyr Glu  Ala Trp Thr 
    1805                 1810                 1815             


Ala Gly  Glu Arg 
    1820         


<210>  32
<211>  1575
<212>  DNA
<213>  Metallosphaera sedula

<400>  32
atgaccgcca cgttcgagaa gcccgatatg tcgaagttgg ttgaggagct tcgtgcgttg       60

aaagcgaaag catatatggg tggcggcgaa gaacgtgtcc aggcacaaca cgcaaaagga      120

aaactcactg ctcgcgaacg cctaaattta ttgtttgatg aagggacctt caacgaagta      180

atgaccttcg cgacaactaa agcaacggaa tttggccttg ataagtccaa agtatatggc      240

gacggagtcg tcacgggctg gggccaggta gaaggccgca cggtcttcgc cttcgcgcag      300

gactttacaa gtattggcgg aacattgggg gaaacgcatg ctagtaagat tgcaaaagtt      360

tacgaattag ccctaaaagt tggcgcccct gtcgttggga ttaatgatag tggcggcgcg      420

cgtattcaag aaggagccgt agccttggaa ggttatggta cagtattcaa agcgaacgtg      480

atggccagtg gagtcgttcc gcagattacc attatggcag gtccagcagc tggcggtgcc      540

gtttattcgc cagcgttaac ggactttatt ataatgatta aaggagacgc gtattatatg      600

ttcgtgaccg gtccagaaat aactaaagtg gtgcttggtg aagacgtttc gtttcaagac      660

ttgggtggcg cggtcatcca tgccacgaag tcgggagtgg ttcattttat cgctgaaaac      720

gagcaagata gcatcaacat taccaaacgc cttttaagtt atttgccgag taacaacatg      780

gaagaaccac cgtttatgga cacaggcgac cctgctgacc gcgagatgaa agacgtggaa      840

tccgttgttc caacggacac cgtaaaaccg ttcgatatgc gtgaagtcat ttatcgcacg      900

gtggacaacg gagaatttat ggaagtgcag aaacactggg cacagaacat ggtggttggc      960

ttcggccgcg tcgcggggaa cgtggtcggt attgtcgcca ataacagcgc gcacctcggg     1020

gccgcgattg atattgacgc gtcggacaaa gctgcgcgct ttattcgctt ttgcgacgca     1080

ttcaatatcc cgcttatctc ccttgtggac acgcctggtt atatgcccgg cactgaccag     1140

gagtacaaag gaataatccg ccatggcgcc aaaatgcttt atgcgttcgc agaagccact     1200

gttccgaaag taacggtggt ggttcgtcgc agctatggtg gagcacatat agcgatgtcc     1260

attaaatccc tgggcgctga tttaatttac gcatggccga gtgccgaaat tgctgtgacg     1320

gggcctgaag gggcggtgcg catactctat cgccgtgaga tccagaactc caaaagtcct     1380

gacgatttaa taaaagaacg tattgcagaa tataaaaaac tttttgcgaa cccgtactgg     1440

gccgcagaaa aaggccttat cgacgacgtc attgaaccga aagatacacg caaagtcatt     1500

gcttcggcgc ttaaaatgct aaaaaacaaa cgcgaatttc gctatccgaa aaaacacggc     1560

aatattccgc tctaa                                                      1575


<210>  33
<211>  1533
<212>  DNA
<213>  Metallosphaera sedula

<400>  33
atgccaccgt tctctcgtgt tcttgttgcc aaccgcggcg agatcgccgt ccgcgtcatg       60

aaagccatta aagagatggg catgactgcc attgcagttt atagtgaagc agacaaatat      120

gccgtacatg ttaaatacgc ggatgaggca tactacatcg gcccgtcacc cgcgcttgag      180

tcttatttaa acattccgca tataatcgac gccgctgaaa aagcacatgc agacgcagtt      240

caccctggct acggctttct ttcagaaaat gcagactttg tggaagccgt tgagaaagcc      300

ggcatgacgt atattggtcc gagtgcagaa gtaatgcgta aaattaaaga taaactcgat      360

gggaagcgca ttgcgcagct aagtggtgta ccgatcgcgc ctggatcaga tggaccggtc      420

gagagcatcg acgaagcact gaaacttgca gaaaaaattg gctatccgat aatggttaaa      480

gcggcatccg ggggtggtgg cgtcggtatt actaaaattg atactcctga ccagttaatc      540

gacgcctggg agcgtaacaa acgcctagca actcaagcgt ttggccgaag tgatttgtat      600

attgagaagg cggcggtcaa ccctcgccat atcgaatttc agctaatcgg agataaatat      660

ggaaactacg tagttgcatg ggaacgcgag tgcacgatcc agcgtcgtaa ccagaaactt      720

attgaagaag ccccaagtcc agccataact atggaggagc gctcgcgaat gtttgaacct      780

atttataagt acgggaaact aatcaattat ttcaccctcg gtacgtttga aactgttttt      840

agtgatgcga ctcgcgaatt ttattttctg gaactcaaca agcgcctgca agtcgagcat      900

ccagttacgg aactaatttt tcgtatcgat ctcgtcaaat tgcagattcg cttggcagcc      960

ggcgagcacc ttccatttac acaggaggag ttaaacaaac gcgctcgtgg tgccgccatt     1020

gaatttcgca ttaatgcgga agatccaatt aataattttt ccggctcctc gggttttatc     1080

acatattatc gcgaaccgac aggtcctggc gtgcgtatgg attccggtgt cacagaaggc     1140

tcctgggtcc ctccttttta tgacagtttg gtctcaaaac ttatcgtgta cggcgaggac     1200

cgccaatatg ccattcaaac ggcgatgcgc gccttggacg attataaaat cggaggcgta     1260

aagacaacga ttcccttgta taaattaata atgcgcgatc cggacttcca ggagggccgc     1320

ttttctacgg cgtacatcag ccagaaaatc gactcgatgg ttaaaaagct caaagcggag     1380

gaagaaatga tggcatcggt ggcggccgtt ctgcaatccc gcggcttact gcgtaaaaaa     1440

gcatcggcac ctcaggaaca ggctaagcca ggatcgggct ggaaatctta tggtataatg     1500

atgcagtcca cgcctcgcgt gatgtggggg taa                                  1533


<210>  34
<211>  504
<212>  DNA
<213>  Metallosphaera sedula

<400>  34
atgaaactct accgcgttca cgctgatact ggcgatacct ttatcgtggc gcatgatcaa       60

aaagagaaca aagaccgttt gaaaacagag aataacgaat ttgaaattga atacgtaggt      120

cagggtactc gcgagggcga gattatactc aaaatcaacg gtgaaatgca tcgcgtattt      180

attgacaacg gctggattat cctggacaat gcccgcattt ttcgcgccga acgtgttact      240

gaactgccga cgcaggaggg ccagactctc gacgaaatga taaagggtaa agaaggcgag      300

gtgttgtcgc ccctgcaagg acgtgtcgtt caggtacgcg ttaaagaggg agatgctgtg      360

aataaaggcc agccgctttt gtcaatcgaa gcgatgaagt ccgaaaccat tgtgtcagcc      420

ccaatttccg ggttggtgga aaaagtccta gttaaagccg gtcaaggcgt caaaaaaggc      480

gatatattgg tggtgattaa gtaa                                             504


<210>  35
<211>  1548
<212>  DNA
<213>  Nitrosopumilus maritimus

<400>  35
atgcactcgg aaaaacttga aaactataat aataaacaca aaacgtcgca gcagggaggt       60

ggtcaagatc gaataaaagc acaacatgat aaagggaaac tgacggcacg ggaacgcata      120

gatttactcc tggatgaggg tagttttact gaaatagacc cgatggttac gcatcattat      180

catgaatatg atatgcaaaa aaagaagttc tttacggatg gtgttgtggg tggttatggc      240

aacgtcaatg gtcgccagat attcgttttt gcctatgatt tcacggtgtt aggtggcacg      300

ctcagtcaga tgggtgcaaa aaaaattacg aaactgatgg atcatgcggt gcgcacgggc      360

tgcccggtga taggcataat ggattcgggt ggtgcgcgca ttcaggaagg catcatgagt      420

ttagatggct ttgcggatat tttttatcat aaccagctgg ccagcggcgt ggtgccacag      480

attaccgcga gtattggtcc atcggcgggt gggagcgtat atagcccggc catgaccgac      540

tttgtcgtta tggtagaaaa ggcgggcagc atgtttgtga cgggtccaga tgtggttaag      600

accgtgttgg gtgaagaaat tagcatggat gatttaggtg gcgccatgac ccatggtagc      660

aaaagtggcg tggcgcattt tgtggcgcag aatgaatacg aatgcatgga ttacataaaa      720

aaactgatat cgtacatccc gcagaacaac tcggaagaac cgccgaaaat caaaactgat      780

gatgatccga atcgcctgga taacaacctt attaacgtga taccggaaaa cccgctgcaa      840

ccatatgata tgaaagaaat tataaactcg attgttgata accatgagtt ctttgaagtg      900

catgaactgt ttgcgccgaa cattgtcgtt ggttatgccc gcatggatgg tcaggttgtg      960

ggcatcattg cgaataaccc gatgcatctg gcgggcgcgt tagacattga tagcagcaac     1020

aaatcggcgc gtttcattcg cttctgcgat gcgtttaata ttcctattat aacgctggtt     1080

gataccccgg gttacatgcc gggttcgaac caggaacaca atggtataat tcgccatggt     1140

agtaaattgc tttatgcgta ctgcgaagcg actgtgccgc gcattacctt agttattggc     1200

aaggcgtatg gtggggcgta cattgcgatg ggcagtaaga atttacggac ggacattaac     1260

tatgcgtggc cgacggcgcg ttgcgccgtt ctcggtggtg aagccgccgt taaaataatg     1320

aatcgcaaag atttggcgga cgcggataac ccagaagaat taaagaagaa attgattgat     1380

gagtttaccg aaaaattcga aaatccgtac gtggcggcga gccacggcac cgtggataac     1440

gttattgatc ctgcggaaac ccgcccaatg ttgattaaag cacttaaaat gttagcgaat     1500

aaacgcgaaa aacagttacc acgcaaacat ggcaacataa acctctaa                  1548


<210>  36
<211>  1488
<212>  DNA
<213>  Nitrosopumilus maritimus

<400>  36
atgatcgaga aagttttaat tgcgaatcgc ggcgaaattg cccttcgcgt tattcgcacc       60

tgcaacgcgc tgggtataaa gacggtggcg gtttactcgg atgaggatta caactcgctg      120

catgttaaga aagccgatga atcgtatcac attggagaag cggccccagc gaaatcgtat      180

ttaaaccagg aaaaaatttt agaagtaatg ctaagctcgg gtgcggatgc cgttcatccg      240

ggttatggtt tcttatcgga aaacgatgac tttgcgcgcc tgtgcgaaaa aaacaaaatt      300

aacttcattg gtccgtcggc cgactccatg aacctctgcg gtgataagat ggaatgcaaa      360

gcggcgatgc tgaaagccca ggtgccgacc gttccaggca gtccgggcct ggttgatact      420

gcggaagaag cggaaaaaat tgcgaacgaa attggttatc cagttctttt gaaaagcgtg      480

tatggtggtg gcggtcgtgg catacgcctg gtgactacgg atcaggaact ccgggaaggt      540

tttgaaaccg ttacgtcgga atcgattgcc gccgttggca aatcggcgat aattgtggaa      600

aaattcctcg aaaaaacccg ccacattgaa tatcagatgt gccgcgatca tcatggtaac      660

gccgttcacc tttttgagcg cgaatgctcg attcagcgcc gcaaccagaa actcattgaa      720

cagacgccat ccccagtggt tgatgaagcg aaacgggagg agattggtga actggtggtg      780

aaagcggcgg aagccgtcaa ctatacgaat ttaggtacgg cggaattttt acgcgcggat      840

aacggtgagt tttactttat tgagattaac gcgcgccttc aggttgaaca tccgataagt      900

gaaatggtga gcggcctgga ctttgttaaa ctgcaaattg atattgcgaa tggtgaaacc      960

ttaccgttca aacagaaaga tctcaagatg aacggttatg cgattgaatg ccgcataaac     1020

gccgaagaca cctttttgga ctttgcgcca agcacgggcc cagtgccgga tgttacaatt     1080

ccagcgggcc cgaacgtccg ctgcgacacg tatctctatc caggctgcac cgtttcgccg     1140

ttttacgata gcttgatggc gaaactttgc acctggggcc cgacctttga agaatcgcgc     1200

acgcgcatgt taacggcgct gaacgatatg tatgtgcagg gtgtggaaac cagcattccg     1260

ttatacaaaa ccattctcaa ttcggaagaa tacaaaaatg gtgaactcag cacggacttt     1320

ttgaaacgtt atgggatgat tgataaactc tcggaagact taaagaaaga aaaagaagac     1380

aagagtgaag ccgccttagc cgcggcaatt attcattcgg aatactttaa gaatcgcgtg     1440

cagaacgata atgcgtctag tgcgacgtgg aaaaacaaat tggactga                  1488


<210>  37
<211>  513
<212>  DNA
<213>  Nitrosopumilus maritimus

<400>  37
atggactata agatcgccga tgtggaaaaa agctttgaag gcaaaattac ggaaaatctg       60

ggtaacaacg attatgtaat taagataaac gacaaagaac atcagttgaa aatattatct      120

atgaacgcga aaggtatcga atttattctg gatcagcagt atcataaagc gaaatattta      180

gagacggcga cgaacgaaat gaacttagtt attgataacg tgccggtgac cctgaatatg      240

aacacgcact ttgacgaaat cgtgtacaaa aatagtgggg gtggtggggc gggtggtgcc      300

caggttgcgc ttaaaagtca gataccaggt aaagtggtaa gcattgcggt ggccgaaggt      360

gactcggtca agaaaggtga tgttgtgtgc acgctggaaa gcatgaagat gcaggtgggc      420

atcaaggcgc acaaagatgg tgaagtgaaa aaccttaaaa ttaaagaagg tgcgacggtc      480

gcgaaagggg acgtgattgc ggatctggaa taa                                   513


<210>  38
<211>  1548
<212>  DNA
<213>  Cenarchaeum symbiosum

<400>  38
atgcactctg agaaattgga taaacgtagc gcgaacaacc gctcggcgtt aatgggtggc       60

ggtgaagcgc gaatcgaagc gcagcatggc aaaggcaaat taaccgcgcg cgaacgcatt      120

gcgatcatgt tagatgaagg tagttttacg gaagtggata gcctggccac ccatcattat      180

catgaatttg atatgcagaa aaagaaattc tttggtgatg gtgtagttgg cggttatggc      240

cgcattgatg gccgcaaagt ttttgttttc gcctatgatt ttaccgtgat gggcggcacg      300

ttaagtcaga tgggcgcaaa gaaaatcact aaactgatgg atcacgcagt ccgcactggc      360

tgcccggtga ttggtgttat ggactccggt ggtgcgagaa tccaggaagg tattatgagt      420

ttagatggtt ttgccgatat tttctatcat aaccagttgg catcgggtgt ggtgccgcag      480

atcactgcta gtattggtcc aagcgccggt ggctcggtgt atagcccggc gatgacggat      540

tttgtgatta tggttgagaa aagcgcgacc atgttcgtta cgggtccgga tgtggtgcag      600

acggttttag gcgaatcgat cagctttgaa gatttaggcg gcgcgatgac ccatggttcg      660

aaaagtggcg tggcgcattt tgttgcaaaa aacgaatatg actgcatgga ttacatccgc      720

aaactgttaa gctttatccc gcagaacaac cgcgaagaac caccagtcgt caaaacagcg      780

gatgatccgg atcgcttaga tcatggcttg atcgggatga tcccggaaaa cccactgcaa      840

acctatgata tgaagaatgt gattcatagc attgtggatg atcgtacgtt cttggaggtg      900

catgagaact ttgcgacgaa tatcattgtc ggtttcggcc ggttcaacgg ccgcgcggca      960

gggattgtgg cgaaccagcc agcgagtttg gccggcgcct tagatattga tgcctcgagt     1020

aaagcggcac gcttcatccg gttctgcgat gccttcaaca ttccagtgat caccttggta     1080

gataccccag gttatatgcc gggctcggat caggaacatg gcggtattat ccggcatggc     1140

agtaaattat tatttgcata ttgcgaagcg accatcccga aaattacgct ggtcattggc     1200

aaagcgtatg gtggtgcgta tattgcgatg gcgagtaaaa acctggggac ggacatcaac     1260

tacgcctggc ccaccgcgcg ttgcgcggtg ttaggcgcag aggctgcggt caaaattatg     1320

aacaggaaag atctggctgc cgcatcggat ccggaaggtt taaagaaaga actgattggc     1380

aactttgcgg agaaattcga taacccatat gtagcggcct cgcatggtac tgtggatgcg     1440

gtcattgatc cggcagaaac ccgtccgatg ctgattaaag ccttagaaat gttaagctcg     1500

aaacgtgaag gccgtatatc gcgaaaacat gggaacatta acctctaa                  1548


<210>  39
<211>  1431
<212>  DNA
<213>  Cenarchaeum symbiosum

<400>  39
atgatccgca cctgccgcgc cttaggcttg ggtagcgtgg cagtctattc ggatgaagat       60

tacaacgccc tgcatgttaa gaaagcatcg gaatcctatc atattggcgg tgccgcgcca      120

gctgagtcgt atttaaacca gcagcgcatc attgaagccg ccttatcgtc gggcgccgac      180

gcgatacatc cagggtatgg ctttttaagc gaaaacggcg aatttgcggc cctgtgcgag      240

aaaaaccgca ttaactttat cggcccctcg gccaagagca tgaacctgtg cggcgataaa      300

atggaatgca aagcggcaat gttaaaagcg gacgtgccga cggtaccggg cagtccaggc      360

cttgtgggct ccgcggatga agccgcgggc attgcctcca aaattggcta ccccgtactg      420

ttaaaaagcg tttttggcgg tggcggccgt ggcatccgtc tggctgagga tgaaggcggt      480

ttacgcggcg ggtacgactc tgcgacagca gagagcattg ccgctgtcgg caaaagcgcg      540

attctggtgg agaaattctt aaaacgcacc cgtcacattg agtatcagat ggcccgtgat      600

aaacatggga acgcagttca tattttcgag cgcgaatgca gcattcagcg acgaaaccag      660

aaattaatcg aacagacccc gagccccgtc atggatgaag acacccgtaa acgcattggc      720

gacctggtgg ttaaagcagc ggaagcggtt gattacacca acctggggac ggcagaattt      780

cttcgtgccg actcgggcga attttacttc attgaaatca acgcccgcct gcaagtggaa      840

catccgatta cggaactggt tagcggtctg gatctcgtta aactgcaaat tgatattgca      900

aacggcgaac cgctgccgtt caaacagaat gatctgcgca tgaacggcta tgcgattgaa      960

tgccgcatta acgcagagga cacgtttttg gattttgccc caagcgttgg tccagttcca     1020

gatgttaaac tgccttcggg tccaggcgtg cggtgcgata cttatctgta tcccgggtgc     1080

actgttagcc cattctacga ctctctgatg gcaaaactgt gcacctgggg tgccactttc     1140

gaagaatccc gcttacgcat gctgggcgcg ttaggcgatt tttatgtgga gggggtggaa     1200

acttcgatcc cgttatataa aacgattatg gcatcggatg aatataagaa cggcgaatta     1260

tcgacggatt ttttatcgcg ctataatatc attgatcgcc tggataagga tatcaagaaa     1320

gaacgcgcgg caaacggcga agctgccgca gcggcggcga ttatgcatag cgaatttctc     1380

agcagtcgcg ccggcggtaa cagtgggacc gcatggaaag ggggcgcctg a              1431


<210>  40
<211>  510
<212>  DNA
<213>  Cenarchaeum symbiosum

<400>  40
atgaaatatg aaattgagga tgccggctcg ttcgaaggcc gcatggccgc aaacccgggg       60

aacggggagt atactctgga aattaacggg aaggaagtgc ggttaaaagt cattagcatg      120

ggcccgcgtg gtatggaatt tctgctggat caaaaatatc atagcgcacg atatctggaa      180

cgcagtactt cgggcataga tatgattatc gatgggacgc cggttcgcgc aggcatgcac      240

gcagacctag ataaaattgt ttataagaat agcggcggcg gtgggggcgg cggcccgggc      300

atagcgctgc ggagtcagat tcctggcaaa gttgtctccc ttgaagtctc ggaaggtgat      360

gaaattaaaa aaggcgatcc ggtggccgtt ttggaatcca tgaaaatgca ggtggcggtt      420

aaagcccata aggatggcac ggtcaagtcg gttagtatta aagaaggcgg cagtgttgca      480

aaaaacgatg ttatcgcgga aattgaataa                                       510


<210>  41
<211>  1551
<212>  DNA
<213>  Halobacterium sp.

<400>  41
atgaccatgg aggaacgcat tgaagatttg cgcgaacaga ccgaacgcgc actgctgggc       60

ggcggagaag ctcgcattga aagtcaacat gaaaaaggta aattaacggc tcgcgaacgc      120

attgattatt ttctggatga tggcacgttt aacgaattgg accagttacg cacgcatcgc      180

agtacgaact ttgatatgga tgagacgaaa ctgccaggcg atggcgtggt gaccggctat      240

ggcgatgtga acggccgtac gacgtttgtt tttgcccatg attttaccgt gtttggtggt      300

tcgctggggg aagtttttgc ggaaaaggtt acgaaggtta tggaccgcgc gatggaagtg      360

ggcgcgccag tggttggtct gaacgatagc gcgggcgcac gcattcagga aggcgtggat      420

gcgttaggcg gttttgcgga aatttttacc cgcaacgaaa aagcctcggg cgtggtgcca      480

cagattagcg cgattatggg tccgtgtgcg ggcggggccg tgtatagccc ggcgattacg      540

gactttaccg tgatggttaa agatacgtcg catatgttta ttaccggtcc ggatgtgatt      600

gaaacggtta ccggcgaaca ggtgggcttt gaagaactgg gcggcgcgac cacccatgcc      660

gccgagagcg gtgtggcaca ctttgcctgc gattcggaag aagcggcctt agataacatt      720

aaacgcttac tgagctacct gccacagaac aacgtggaag atccgcctcg tgtggaacca      780

tatgatgatc cagaacgccg cgatgatgcc ctggagacca ttgtgccgga tgaacctcgt      840

aaaccatatg atatgaccga tgtggtggat tcggtggtgg atgaacagag cttctttgaa      900

gttcaggcgg attatgcgaa aaacattgtg gtgggttttg cgcggctgga tggccgtagc      960

gtgggcattg tggcgaacca gccacgcgtt aacgccggca ccctggatat tgatgcctcc     1020

gaaaaaggct cgcgttttgt tcgtttttgt gatagcttta acgtgccgat tttaacgctg     1080

gttgatgttc caggtttttt accgggcacc gatcaagaac atggcggcat tattcgtcat     1140

ggcgcgaaac tgttatatgc gttttcggaa gcgtcggttc cattaatgac ggtgattacg     1200

cgtaaagcct atggcggcgc ctatgatgtt atggcgtcga aacatattgg cgcggatgtg     1260

aactatgcgt ggccgaccgc cgaaattgcc gttatgggtc cgaaaggcgc ggttaacgtg     1320

ctgtattccg acgagctgga ggcggccgat gataccgcgg cgcgtcgcca agaactgatt     1380

gatgaatatc gcgaagaatt tgccaaccct tatacggccg ccgatcgcgg ctatctggat     1440

gccgtgattg aaccgaccga aacccggccg cgtctgattg atgatctgga tatgttggcg     1500

tcgaaacgcg aagaaacgcc ggataaaaag catggcaaca ttccgctctg a              1551


<210>  42
<211>  1776
<212>  DNA
<213>  Halobacterium sp.

<400>  42
atgatcatga aggtacggat tggcgtgggg gcgacggatg cggaagccag tgcggtggcg       60

gcagcactgg cggcacatgt gagtgatgat gtcgcggttt acttaggcga tgccgatgaa      120

ccggcagcag tgcatgaacc agaaccaccc gccgatgatt cggccgatgc cgatgatctg      180

ggtccgaccg aacgcgaaga agttctgcgc gaagaaattg cggatattct ggatggcggc      240

ccagaaaaat ataagcagcg cttaccagaa caggataagt tatttgtgcg cgatcgcctg      300

gccctgtggt ttggcgatga tgatgatggc gatgatgacc tgctgtttga agatggcaga      360

tttgcgcact ttgatggctg gcacccaaac tcgccagacg tggatgaagc agatgatggc      420

acacgtgtgc cggccgatgg tctgattacg ggcgcggcgg actttgatgg ccgtgatctg      480

cattttatgg ccaacgattt taccgttaag gcgggtagca tggccgaacg cggcgtggaa      540

aaatttctgc gcatgcagca gcgcgcgctg aaaaccggca agccagtttt gtatttaatg      600

gatagtagcg gtggtcgcat tgatcagcag agcggttttt ttgcgaaccg cgagggcatt      660

ggcaagtatt attttaatca tagtcgtctg agtggccgcg tgccacagat ttgcgtttta      720

tatggcccat gtattgccgg tgcagcgtat acgccggttt ttgcggattt tacgattatg      780

gtggaaggta tgagtgcgat ggccattgcc tcgccacgta tggttgaaat ggttaccggc      840

gaacagattg aaatgcagga tctgggcggc ccgcaggttc atgccgaaca gagtggctcg      900

gccgatttag tggcccgcga tgaagatcat gcccgcgaat tagtggcgga tctggttcag      960

tatctgccag ataactcgga tgaaaagccg ccatcccagc cggcgaaacc gcctgcgaaa     1020

ccgccaaaag gcattgatgg cttaattccg gaagcaccga accgcgccta tgatatgcat     1080

gatctgattg gccgcgttgt ggaccaggat agtttctttg aattacgtcc agaatatggg     1140

gccgaaattt taacgggtta tgcgcgcatt gatggccgta cggtgggcat tgtggcgaac     1200

cagccagccc agcgtgccgg cgccattttt ccggatgccg ccgaaaaagc cgcggaattt     1260

gtgtggaaaa gtgatgccta taacattccg ttgttatatc tgtgtgatac gccaggcttt     1320

atgccgggta gcagtgtgga aaaggatgcc attttagaaa agggcaaaaa aatgatttat     1380

gccacctcgg aagccaccgt gccgaaacag agtgtggttg ttcgtaaagc ctatggcgcc     1440

ggcatttatg cgatgagtgg tccagcctat gatccagaaa gtaccattgc actgcctagt     1500

ggcgaaattg gtattatggg tccagaggcc gcgattaacg ccgtgtatgc gaacaaactg     1560

gatgccattg atgatccaga agaacgcaag cagcgcgaac aggaactgcg cgaagcgtat     1620

cgcgaagata ttgatgccca tcgtatggcc agtgaaacgg tgattgatga aattgttccg     1680

ccaagtgagc tgcgtacaga gctgagcaac cgtttttcgt tttatgaaga tgttgaaaag     1740

gatcgcccga gcaaaaagca tggcaccatt ctttga                               1776


<210>  43
<211>  1833
<212>  DNA
<213>  Halobacterium sp.

<400>  43
atgtttgaaa aagtactggt cgccaaccgc ggcgaaattg cggttcgagt tatgcgtgcg       60

tgtgatgatc tgggtgtgga tacggtggct gtttattcgg acgccgatgc gcatgccgga      120

catgttcgtt atgccgatga agcgtataac gtgggtccgg cgcgcgcggc ggatagttac      180

ctggatcatg atgccattat tgatgcggcc acgcgtgccg gtgccgatgc cattcatcca      240

ggctatggct ttctggccga aaacgccgaa tttgcgggca aggtggaaga taccgatggc      300

gtgacgtggg tgggcccgag tgcggatagc atgcgccagc tgggcgaaaa aacctcggcc      360

cgtaaaacga tgcgtgaagc cgatgttcca attgtgccag ggaccaccga tccagttgaa      420

agtgtggccg atattcatga atttggcgag gagcatggct atccaattgc tattaaagcc      480

gaaggcggag ggggcggccg tggcatgaaa attgttcgct cggccgatga agccgaggat      540

caattagaaa gcgcggaacg agaaggggaa gcgtattttg ataacgcgaa cgtgtactta      600

gaacgctatt tggaaaaccc acgccatatt gaagtgcaga ttctcgcgga tcatcatggt      660

aacgtgcgtc atctgggcga acgtgattgt agcttacagc gccgccatca gaaggtgatt      720

gaggagggcc cgtcgccagc gctgacggat gaacttcgcg aagaaattgg tacggcggcc      780

cgtcgaggag ccgatgccgc gggctattat aacgcaggca cctttgaatt tctggtggag      840

gaggataccg aacgtgaacc aggggatctg ctgggtccgg aaacggaatt ttattttctg      900

gaggttaaca cccgtatcca ggtggaacat accgtgacgg aagcgctgac gggcgtggat      960

atcgttaaat ggcagctgaa aattgcaagt gatgatgaac tgacctttga acaggatgat     1020

gttgcattag atggccatgc cgtggaatat cgcattaacg cggaaaacgc ggccgatgat     1080

tttgcgccag ccacgggcgg cgaattagaa acctatgatc cgccgggcgg cattggcgtt     1140

cgtgtggatg atggcctgcg tcagggcgat gacctggtga ccgattatga tagcatggtg     1200

gcgaaactga ttgttcatgg ctcggatcgc gaagaatgtt tagcgcgtag tcgtcgcgcg     1260

ctggccgaat atgatattga aggcattccg acgattattc cgtttcaccg tttaatgtta     1320

accgatgatg cgtttgtggg tggcacgcat acgacgaaat acctggatcg cgatattgaa     1380

gaaagccgta ttagtgatgc ccaggcggaa tggggtacga cgacggccag tgaaagtagt     1440

gccgacgaaa acgttgttga acgtgacttt acggttgagg tgaacggcaa acgttttgaa     1500

gtgaacttag aagaacgtgg tgcggcgcag tttgctgccc cagaggcgga taccggtggt     1560

ggtggtccgc cggaaccagc gggtggagca gatgatggtg aaacggtggt tgaaggtgat     1620

ggcgaaacgg tgacggcgga aatgcagggt acgattttag atgttgccgt tagtgaaggc     1680

gacgcggtgg atgcaggaga tgttttagtc gttttagagg cgatgaagat ggaaaacgat     1740

gtggttgcga gtcatggggg cacggtgacg caggtggccg tgtcggaaga tgattcggtg     1800

gatatggatg atgttttagt tgtgattgac tga                                  1833


<210>  44
<211>  3129
<212>  DNA
<213>  Halobacterium sp.

<400>  44
atgaccgaag attcgcgtac gatattactg attggttcgg gcccaattca gattggccag       60

gcagcggaat ttgattatag cggcgcgcag gcttgtcgcg cgctgcaaga agaaggcgcg      120

cgtgtcgttc tggttaacag taaccctgcg accattatga ccgatccaga aatggccgat      180

gcggtgtata ttgaaccaat tgaaccagat gccattgccg aagtgattga acaggaagat      240

ccagatggcg tgattgccgg tttaggcggc cagaccggct taaacgtgac ggcagcgtta      300

gccgaacagg gcgtgctgga tgagcatgat gtggatgtta tgggtacgcc gttggatacc      360

atttatgcga ccgaagatcg cgatttattt cgccagcgca tggcggatct gggccagccg      420

gttccggcca gcacgaccat tgcgctgggc gatgatgaaa cggcaacgga tattgatgaa      480

ggtgcgttac gggaacgcgt ggatgatgcg gtggaagcgg tgggcggttt accggttatt      540

gcgcgtacga cgtatacgtt aggcggctcg ggcagtgggg tggtgcatga ttttgaagcg      600

ctggttgatc gtgttcgcac gggcttacgt ctgagccgta acgccgaagt tttagtgacg      660

gaaagtatta ccggttgggt ggagttagaa tatgaagtta tgcgtgatgc cggcgatagt      720

tgcattattg tgtgcaacat ggaaaacatt gatccgatgg gcattcatac cggcgaaagt      780

acggtggtga cgccatcgca gattattccg gatgatggcc atcaggaaat gcgcaacgcc      840

gccgtggcgg tgattcgcga attgggcatt cagggcggct gcaacattca gtttgcgtgg      900

cgcgatgatg gcacgccagg cggcgaatat cgtgtggtgg aagtgaaccc gcgcgtgagc      960

cgtagtagcg cgttagcgag caaagcgacc ggctatccaa ttgcgcgtgt gaccgcgaaa     1020

gtggcgctgg gcaaacgcct gcatgaaatt gataacgaaa ttaccggcca gaccacggcg     1080

gcctttgaac cggccattga ttatgttgtt acgaaggttc cacgttggcc taacgataaa     1140

tttccagaag ttgattttga attaagcacg gcgatgaaaa gtacgggtga agcgatggcc     1200

attggtcgca cctttgaaga atcgttatta aaagcgttac gcagttcgga atatgacccg     1260

tcggtggatt gggcgaccgt gtcggatgat gagctggccg ccgattatct gcaacgcccg     1320

agtccagatc gtccatatgc cgtgtttgaa gcgtttgaac gtggttttac agtgggcgat     1380

gttaacgatc atacgggctt tcgtgaatgg tacttacagc gttttcagaa tgtggcagcg     1440

gccagtgcgg cagcaagtga aggtgatgtt gcgacgccgg cagcgctggg ttatacgaat     1500

agtgcggtgg cggcgttagc gagcgacggt ggagatgttg ccgtggatga tgttgcggcg     1560

actgccccag aacgtacgtt taaacaggtg gatacctgtg cgggcgaatt tgcggccagc     1620

acgccgtatt attatagcgc gcgcagccag ggaagcacgg gatcggatgt tcgtgccgat     1680

cgtgatgccc attcggttgt gattgtgggt ggcggcccaa ttcgcattgg tcagggcgtt     1740

gaatttgatt attgtacggt tcatgccgtt cgcgcgctgc gcgaagcggg cattgatgcc     1800

catgttgtta acaacaaccc ggaaacggtt agcacggatt atgataccag tgatggttta     1860

ttttttgaac cgattacggc ggaagaagtg gcggatgtgg ttgaagcgac gaacgccgat     1920

ggcgttatgg ttcagtttgg cggccagacc tcggttgatg ttggcgcgcc actggaggcc     1980

gaattagaac gccgcggcct ggattgtgaa attatgggta cggatgttga tgcgatggac     2040

ctggcggaag atcgcgatcg ctttaatcgt ttactggatg aacgcgatat ttcgcagcca     2100

gatggtggtt cagcgacctc cgttgcgggc gcgttagaac tggccgccga agtgggatac     2160

ccggttttgg tccgcccgtc gtatgttctg ggaggtcgcg cgatggaaat tgttcatgat     2220

gatgatgaac tgcgccgcta tgtggaagaa gccgtgcgtg tgtcgccaga aaaaccggtc     2280

ctggtggatg aatttctggc cgatgccgtg gaactggatg ttgatgccgt gtcggatggc     2340

gaagatgttt tagttggcgg cgttatggaa catattgaat cggcgggggt gcatagtggt     2400

gatagcgcgt gcgttatccc gcctcggggt ctgggcgatg atattctggc gcgtgtccgt     2460

gaagtgacca ccgaaattgc gcgtgcgtta gatacggttg gattattaaa cgtgcaactg     2520

gccgtgcagg atggcgaggt ttatgtgctg gaggcgaacc cgcgtagtag ccgcaccgtc     2580

ccgtttgtga gcaaagcgac cggcgtgccg attgcgaaac tggcggcgaa agttatggcc     2640

ggcgaaagct tagccgatct ggatgcatcg gaaggcgtac cagaacagta ttcggtgaag     2700

gaagttgttt tgccgtttga tcgtctcccg ggtagcgacc cgcgcctggg cccggaaatg     2760

aaaagtacgg gcgaagttat gggcaccgcc tcggacccgg gtatggccta ttggaaagcc     2820

caggttgcgg cgagtaacgc accagtacca ggtagcacgg cggtggtgga tctcttagtg     2880

gaaggtctcg gcgaacgttt tgaagtggtt acggtgaagg atgttccggc ggcgattcga     2940

cgcggcgaag tggaatttct ggtgtcggat gatcgcgatg cgttaaccgc ggctgtggaa     3000

gccgaaattc cgtatgtgag tacggttgcg gcagcggaag cgatgcgaga aggtattgcg     3060

gcggccgatg gggcacgaga agcaatgccg gtggccgatc gtccggtgaa tgatgaaacg     3120

tggggatga                                                             3129


<210>  45
<211>  1437
<212>  DNA
<213>  Halobacterium sp.

<400>  45
atggcccgcg agtttacgct tcccgatgtg ggagaaggcg tggcggaagg cgagctggtt       60

cgttggctgg tggatgaagg cgataccgtg accgaagacc agccggtggc ggaagtggag      120

acggataagg cacaggtgga agttccagcg ccggtggatg gtacggtgca ggagctgcat      180

tgggcggagg gcgatgttgt tccggtgggc gatctgtttg ttacgtttga tgtcgatggc      240

gaagcgagcg ccaccgcgga tgatggtgat gagagtggtg atgaagccgc gagcgcgacc      300

agtgaagcca gtggtcgtac gtttgcgccg ccgagtgtcc gtacgctggc ccgtgaatta      360

ggcgtggatc tggatagtgt ggaaggtagt ggtccgagcg gtcgtattac cgatggcgat      420

gttcgtgccg ccgcggaagg tggtgaagat accacggaac cggccaccga agccacgagt      480

gcgacggagc gtgtggatga agatgatacc gcggcgagtg caggcagtca agaaccggcg      540

ggccgtgaaa aaacgcttgc ggcaccagcg acccgtggtg tggcccgaga attaggcgtg      600

gatattaacg atgttccggc ggtggagcag cgtgatggtg aagcgtttgt gaccgcggaa      660

gcggtgcagg cgtatgcaga aggtggtcag gcagcacagg gtgaagcggg tggtgcagcc      720

acccgtgaat ttgtggccgg cggtgaaacc accgaaccat atcgtggcat tcgccgcacc      780

attggcgaac agatggccga aagcaaatat acggcgcctc atgtgaccca tcatgatacc      840

gccgtgattg attcgctggt tgaaacgcga agcaaattaa aggcgcgcgc cgaagccgaa      900

gatgttaaat taacgtatat gccgtttgtt atgaaagcgg tggtggcggc gctgaaagaa      960

tttccagttt tgaacagtga actgcgcgaa gatgatgaag aaattgcgct gaaacaggac     1020

tataacattg gcgttgcggt ggcgaccgat gccggtttaa tggttccggt ggtggaacat     1080

gtggatcaga aaagcatgct cgaaattagt acggaaatga acgatctggt ggaacaggcc     1140

cgcgaacgca gcattgcgcc agcggatatg gatggcggaa cttttacgat tacgaacttt     1200

ggcgcgattg gcggcgaata tgcgaccccg attattaact atccagaaac ggcgatttta     1260

ggtttaggcg cgattgatga acgcccggtg gccgaagatg gcgatgttcg tgcggcccag     1320

acgttaccgt tgagcttaag cattgatcat cgcgtgattg atggcgcgga agccgcgcag     1380

tttacgaacc gcgttatgga atacttaaca gatccagagc ttcttcttct ggaataa        1437


<210>  46
<211>  1170
<212>  DNA
<213>  Methylcoccus capsulatus

<400>  46
atggcaaggc ccttaatcca gctggctctg gacagtttgg atcgggatcg tacgttagag       60

ttagcgcgcg tgacggcccc ttacgtggat atttttgaaa ttggcacgcc atgtattaag      120

tataacggca ttgagattgt gcgcgaatta aaacgccgtc atccggaccg tttagtgctg      180

gtggatttaa aaaccatgga cgcgggcgag tatgaagcgg ccccgttcta cgcggcgggc      240

gctgatattt gcaccgtttt aggtgtttcg ggtccggcca ccattgcggg tgtggtcaag      300

gccgcgcagg cccataatgc ggaggtgcag gttgacctga ttaacgttcc ggataaagct      360

gcgtgcgccc gcgaagccgc gcgtttaggc gcgcagatta ttggggttca taccgggctg      420

gacgcgcagg cgcaggggca gacgccgttt gcggacttag agagcattgc gcgcctgaaa      480

ctgccggtga gaatttctgt tgctggtggt attaaccaga acaccgcgtc tcgtgtggcg      540

aaagccggtg cggatattgt ggtggtgggg gccgccattt atggcgcccc atgtccagcg      600

accgccgcgc gcacgatccg cgaactgctg gagggtgctc accataaatt tattgttagt      660

aaaattggcg gcgttcttgc ggcgactgat aaaagctatg aagcccggct gaccgggtta      720

ttagagcggg cgcgccggat ctttgtggcg ggcgcgggtc ggagtggcct ggtgggccgc      780

ttctttgcga tgcgtctgat gcatggcggc taccaggctt acatcgttgg cgaaattgtt      840

acgccaagca ttcggcaagg cgacctcctg attgttatca gtgggtcggg cgagaccgag      900

accatgattg cttatgcgaa aaaggcgaaa gagcagggtg cgagcattgc cctgattacc      960

acccgcgata aaagtacgat tggggatatg gcagatgttg tttttcgtat tggcactcca     1020

gaacagtatg gcaaagttgt ggggatgccg atgggcacca cctttgaact gagtaccctg     1080

gttctgttag aggcgacgat cagtcatatt attcacacca aaaaaattcc agaagaacag     1140

atgcgtaccc gccatgcgaa tctggagtaa                                      1170


<210>  47
<211>  648
<212>  DNA
<213>  Methylcoccus capsulatus

<400>  47
atggcaaggc ccttgatcca gttagcgctg gatacgctgg atattccgca gaccctgaaa       60

ttagcaagct taaccgcccc atacgtggac atttttgaga ttggcacccc aagcattaaa      120

cataacggca ttgcgctggt taaagaattt aagaagcgct ttccaaacaa actgttactg      180

gtggatttaa agaccatgga tgcgggggag tatgaggcga ccccattttt tgcggcgggc      240

gcggatatta ccaccgtgtt aggcgtggca ggactggcga ccattaaagg cgtgattaac      300

gcggcgaaca aacataacgc ggaagtgcag gtggatctga ttaacgtgcc agataaagcg      360

gcgtgcgcgc gggaaagtgc gaaagcgggc gcgcagattg tgggcattca taccggctta      420

gatgcgcagg cggcgggcca gaccccattt gcggatttac aggcgattgc gaaattaggc      480

ttaccagtgc gcattagtgt ggcgggcggc attaaagcga gtaccgcgca acaggtggtg      540

aagaccgggg cgaacattat tgtggtggga gcggcgattt atggcgcggc gagtccagcg      600

gacgcggccc gcgagattta tgagcaggtg gtggcggcta gtgcgtaa                   648


<210>  48
<211>  534
<212>  DNA
<213>  Methylcoccus capsulatus

<400>  48
atgcaccaga agctgattat agataagatt agtggcattt tagcggcgac cgacgcgggc       60

tacgacgcaa agctgactgc gatgttagat caggcgagtc gcatttttgt ggccggtgcg      120

ggccgttcgg gtctggtggc gaaatttttt gcgatgcgct taatgcatgg cggctacgat      180

gtgtttgtgg tgggcgagat tgtgacccca agcattcgca aaggcgattt gctgattgtt      240

attagtggca gtggggagac cgagaccatg ttagcgttta ccaagaaggc gaaagaacag      300

ggcgcgagta ttgcgttaat tagtacccgc gatagcagta gtttaggcga tttagcggat      360

agtgtgtttc gcattggcag tcccgaatta tttggaaagg tggtgggcat gccaatgggc      420

accgtgtttg aattaagtac cttattattt ttagaagcga ccatttcaca tattattcat      480

gaaaagggca ttccagagga ggagatgagg actcggcatg cgaacctgga gtaa            534


<210>  49
<211>  1221
<212>  DNA
<213>  Mycobacterium gastri

<400>  49
atgaaattac aagttgcgat tgatctgctg agtaccgagg cggcgttaga actggcgggc       60

aaagtcgcgg aatatgttga tattattgag ctgggcaccc cactgattga agcggaaggc      120

ctgagcgtta ttaccgcggt taaaaaagct catccggata aaattgtttt tgcggatatg      180

aaaaccatgg atgcgggcga attagaggcg gatattgcct ttaaagcggg cgctgatctg      240

gttacggttt taggcagcgc ggatgatagt accattgccg gggcggttaa agcggcgcag      300

gctcataaca aaggcgttgt tgttgatctg attggcattg aagataaagc gacccgggca      360

caggaggtcc gcgcgctggg ggcgaaattt gttgaaatgc atgctgggct ggatgaacag      420

gcgaaaccag gctttgatct gaacgggctg ttagcggcgg gcgaaaaagc tcgtgtcccg      480

tttagtgtgg cgggaggcgt gaaggtcgcc accattccag cagttcagaa agcgggtgca      540

gaggttgcgg ttgcgggagg tgcgatttat ggggcagcgg atccggcggc ggcggcaaaa      600

gagctgcgtg cggcaattgc gatgacgcaa gcggcagagg cggatggtgc ggtgaaagtt      660

gttggagatg atattaccaa caacctcagt cttgtccgtg atgaagttgc cgataccgcc      720

gccaaggttg atccggaaca ggtggctgtt ttagctcgcc aaattgttca gcccggacgt      780

gtctttgtgg ccggcgcggg gcgcagtggt ttagttctgc gcatggcggc gatgcgtctg      840

atgcattttg gcttaaccgt gcatgttgcc ggcgatacca ccaccccggc aatttctgcg      900

ggcgacctgc tgctggtggc tagtggcagc ggcaccacca gtggggtggt taaaagtgcg      960

gaaacggcga aaaaagcggg tgcacgtatt gcggcgttta ccaccaatcc ggactcaccg     1020

ctggcggggc tggcggatgc ggtggtgatt attccagcgg cccagaaaac cgaccatggc     1080

agccatatca gccgtcagta tgcgggatcc ctctttgaac aggtgctgtt tgttgttacc     1140

gaggcggtgt ttcagagcct gtgggaccat accgaagttg aagcggagga gttatggacg     1200

cgccatgcga acttagaatg a                                               1221


<210>  50
<211>  406
<212>  PRT
<213>  Mycobacterium gastri

<400>  50

Met Lys Leu Gln Val Ala Ile Asp Leu Leu Ser Thr Glu Ala Ala Leu 
1               5                   10                  15      


Glu Leu Ala Gly Lys Val Ala Glu Tyr Val Asp Ile Ile Glu Leu Gly 
            20                  25                  30          


Thr Pro Leu Ile Glu Ala Glu Gly Leu Ser Val Ile Thr Ala Val Lys 
        35                  40                  45              


Lys Ala His Pro Asp Lys Ile Val Phe Ala Asp Met Lys Thr Met Asp 
    50                  55                  60                  


Ala Gly Glu Leu Glu Ala Asp Ile Ala Phe Lys Ala Gly Ala Asp Leu 
65                  70                  75                  80  


Val Thr Val Leu Gly Ser Ala Asp Asp Ser Thr Ile Ala Gly Ala Val 
                85                  90                  95      


Lys Ala Ala Gln Ala His Asn Lys Gly Val Val Val Asp Leu Ile Gly 
            100                 105                 110         


Ile Glu Asp Lys Ala Thr Arg Ala Gln Glu Val Arg Ala Leu Gly Ala 
        115                 120                 125             


Lys Phe Val Glu Met His Ala Gly Leu Asp Glu Gln Ala Lys Pro Gly 
    130                 135                 140                 


Phe Asp Leu Asn Gly Leu Leu Ala Ala Gly Glu Lys Ala Arg Val Pro 
145                 150                 155                 160 


Phe Ser Val Ala Gly Gly Val Lys Val Ala Thr Ile Pro Ala Val Gln 
                165                 170                 175     


Lys Ala Gly Ala Glu Val Ala Val Ala Gly Gly Ala Ile Tyr Gly Ala 
            180                 185                 190         


Ala Asp Pro Ala Ala Ala Ala Lys Glu Leu Arg Ala Ala Ile Ala Met 
        195                 200                 205             


Thr Gln Ala Ala Glu Ala Asp Gly Ala Val Lys Val Val Gly Asp Asp 
    210                 215                 220                 


Ile Thr Asn Asn Leu Ser Leu Val Arg Asp Glu Val Ala Asp Thr Ala 
225                 230                 235                 240 


Ala Lys Val Asp Pro Glu Gln Val Ala Val Leu Ala Arg Gln Ile Val 
                245                 250                 255     


Gln Pro Gly Arg Val Phe Val Ala Gly Ala Gly Arg Ser Gly Leu Val 
            260                 265                 270         


Leu Arg Met Ala Ala Met Arg Leu Met His Phe Gly Leu Thr Val His 
        275                 280                 285             


Val Ala Gly Asp Thr Thr Thr Pro Ala Ile Ser Ala Gly Asp Leu Leu 
    290                 295                 300                 


Leu Val Ala Ser Gly Ser Gly Thr Thr Ser Gly Val Val Lys Ser Ala 
305                 310                 315                 320 


Glu Thr Ala Lys Lys Ala Gly Ala Arg Ile Ala Ala Phe Thr Thr Asn 
                325                 330                 335     


Pro Asp Ser Pro Leu Ala Gly Leu Ala Asp Ala Val Val Ile Ile Pro 
            340                 345                 350         


Ala Ala Gln Lys Thr Asp His Gly Ser His Ile Ser Arg Gln Tyr Ala 
        355                 360                 365             


Gly Ser Leu Phe Glu Gln Val Leu Phe Val Val Thr Glu Ala Val Phe 
    370                 375                 380                 


Gln Ser Leu Trp Asp His Thr Glu Val Glu Ala Glu Glu Leu Trp Thr 
385                 390                 395                 400 


Arg His Ala Asn Leu Glu 
                405     


<210>  51
<211>  1020
<212>  DNA
<213>  Synechococcus elongatus

<400>  51
atgaccattc gagttgcgat caatggcttt ggccgtattg gccggaattt tctccgttgc       60

tggtttggac ggcagaacac cgatcttgag gttgtggcca ttaacaacac ctcggatgca      120

cggacggctg ctcacctgct ggagtacgac tctgttctcg gccggttcaa cgccgacatc      180

agctacgacg aaaattcgat caccgtcaac ggcaagacga tgaaaatcgt ctgcgatcgc      240

aaccccctca acctgccttg gaaagagtgg gatatcgatc tcgtcattga atctacaggt      300

gtgttcgtca ccgctgaagg cgcatccaag cacatccaag ccggggccaa gaaagttctg      360

atcacggctc ctggtaaagg cgaaggtgtc ggcacctacg tcatcggtgt caacgattcg      420

gaataccgcc acgaagactt cgcagtcatc agcaatgcaa gctgcaccac caactgctta      480

gcaccggtcg ccaaagttct gcatgacaac tttggcatca tcaaaggcac gatgaccacc      540

acccacagct acacgctgga ccagcgcatc ttggacgcca gccaccgtga tctacgtcgg      600

gctcgggctg ccgccgttaa catcgttccc accacgaccg gcgctgctaa agccgttgct      660

ttggtgatcc ccgagctgaa aggcaaacta aacgggattg cgctgcgcgt tcctacgcca      720

aacgtgtctg tcgttgactt ggtggttcaa gtcgagaaac cgacgatcac tgagcaggtc      780

aatgaagtcc tgcaaaaagc ttctcaaacg acgatgaagg gcatcatcaa gtactcggat      840

ctgcccttgg tatcttccga cttccggggt actgacgagt cttcgatcgt tgactccagc      900

ctgaccttgg taatggatgg cgatctcgtc aaagtaattg cttggtacga caacgagtgg      960

ggctacagcc aacgagttgt cgacttggct gaactggccg ctcgcaaatg ggccgcctaa     1020


<210>  52
<211>  1038
<212>  DNA
<213>  Synechococcus elongatus

<400>  52
atggaaaaga cgatcggtct cgagattatt gaagttgtcg agcaggcagc gatcgcctcg       60

gcccgcctga tgggcaaagg cgaaaagaat gaagccgatc gcgtcgcagt agaagcgatg      120

cgggtgcgga tgaaccaagt ggaaatgctg ggccgcatcg tcatcggtga aggcgagcgc      180

gacgaagctc cgatgctcta tatcggtgaa gaagtgggca tctaccgcga tgcagacaag      240

cgggctggcg taccggctgg caagctggtg gaaatcgaca tcgccgttga cccctgcgaa      300

ggcaccaacc tctgcgccta cggtcagccc ggctcgatgg cagttttggc catctccgag      360

aaaggcggcc tgtttgcagc tcccgacttc tacatgaaga aactggctgc acccccagct      420

gccaaaggca aagtagacat caataagtcc gcgaccgaaa acctgaaaat tctctcggaa      480

tgtctcgatc gcgccatcga tgaattggtg gtcgtggtca tggatcgtcc ccgccacaaa      540

gagctaatcc aagagatccg ccaagcgggt gcccgcgtcc gtctgatcag cgatggtgac      600

gtttcggccg cgatctcctg cggttttgct ggcaccaaca cccacgccct gatgggcatc      660

ggtgcagctc ccgagggtgt gatttcggca gcagcaatgc gttgcctcgg cggtcacttc      720

caaggccagc tgatctacga cccagaagtg gtcaaaaccg gcctgatcgg tgaaagccgt      780

gagagcaaca tcgctcgcct gcaagaaatg ggcatcaccg atcccgatcg cgtctacgac      840

gccaacgaac tggcttcggg tcaagaagtg ctgtttgcgg cttgcggtat caccccgggc      900

ttgctgatgg aaggcgtgcg cttcttcaaa ggcggcgctc gcacccagag cttggtgatc      960

tccagccagt cacggacggc tcgcttcgtt gacaccgttc acatgttcga cgatgtcaaa     1020

acggttagcc tccgctaa                                                   1038


<210>  53
<211>  1002
<212>  DNA
<213>  Synechococcus elongatus

<400>  53
atgtcgaagc cagatcgtgt tgttttgatc ggcgttgccg gtgactccgg ttgcggcaaa       60

tcaaccttcc taaatcgcct tgccgacttg tttggtacgg aattgatgac ggtcatctgc      120

ttggatgact atcacagtct cgatcgcaag ggccggaagg aagcaggcgt aacggctttg      180

gatccccgcg ccaacaactt tgacttgatg tatgaacagg tcaaggcgtt gaagaacggc      240

gaaacgatca tgaagccgat ctacaaccat gaaaccggct tgatcgatcc gcccgaaaaa      300

atcgaaccca atcgcatcat tgtgatcgag ggtctgcatc cgctttacga cgagcgcgtg      360

cgtgaactgc tcgatttcag cgtttacctc gacatcgatg acgaagtcaa aatcgcttgg      420

aagatccaac gcgatatggc agaacgcggc cactcctacg aagatgtcct cgcctcgatc      480

gaagcgcgcc gccctgactt caaggcctac attgagcccc agcgtggcca tgcggacatc      540

gtcatccgcg tcatgccgac ccagctaatc cccaatgaca ccgagcgcaa ggtgctgcgg      600

gtgcagttga tccaacggga aggccgcgat ggttttgagc cggcttacct gttcgacgaa      660

ggttcgacca tccagtggac gccctgcggt cgtaagctga cctgctccta tccgggcatt      720

cgcttagcct acggccctga cacctactac ggtcacgaag tctcagtgct tgaggtcgac      780

ggtcagttcg agaacctcga ggagatgatc tacgtcgagg gccacctcag caagaccgac      840

acgcagtact acggtgagtt gacccacctg ctgctgcaac acaaagatta cccgggttcg      900

aacaacggca cgggtctgtt ccaagtgctg accggcctga aaatgcgggc ggcctatgag      960

cgtttgacct cccaagcagc acccgtcgcc gcaagcgtat aa                        1002


<210>  54
<211>  858
<212>  DNA
<213>  Escherichia coli

<400>  54
atgaaagctg acaacccttt tgatctttta cttcctgctg caatggccaa agtggccgag       60

gaggcgggtg tctataaagc aacgaaacat ccgcttaaga ctttctatct ggcgattacc      120

gccggtgttt tcatctcaat cgcattcgtc ttctatatca cagcaaccac tggcacaggc      180

acaatgccct tcggcatggc aaaactggtt ggcggcattt gcttctctct ggggctgatt      240

ctttgtgttg tctgcggagc cgatctcttt acttccaccg tgttgattgt tgttgctaag      300

gcgagtgggc gcatcacctg gggtcagttg gcgaaaaact ggctaaatgt ctattttggc      360

aacctggtcg gcgcactgct gtttgtactt ttaatgtggc tttccggcga gtatatgacc      420

gcaaatggtc aatggggact aaacgtccta caaaccgccg accacaaagt gcaccatact      480

tttattgagg ccgtctgtct tggtatcctg gcaaacctga tggtatgtct ggcagtatgg      540

atgagttatt ctggccgcag cctgatggac aaagcgttca ttatggtgct gccggtcgcg      600

atgtttgttg ccagcggttt tgagcacagt atcgcaaaca tgtttatgat cccgatgggt      660

attgtaatcc gcgacttcgc atccccggaa ttttggaccg cagtcggttc tgcaccggaa      720

aatttttctc acctgaccgt gatgaatttc atcactgata acctgattcc ggttacgatc      780

ggcaacatta tcggtggtgg tttgttggtt gggttgacat actgggtcat ttacctgcgt      840

gaaaacgacc accactaa                                                    858


<210>  55
<211>  849
<212>  DNA
<213>  Escherichia coli

<400>  55
atgcgcaaca aactctcttt cgacttgcag ttgagcgcca gaaaagcggc aatcgctgaa       60

cggattgccg cccataaaat tgcccgcagt aaagtgtcgg tctttttaat ggcgatgtcc      120

gctggcgtgt ttatggcgat cggatttact ttttaccttt ccgttatcgc cgatgccccg      180

tcttcacagg cattaaccca tctggtgggc ggcctttgct ttacactcgg ctttattttg      240

ctggcggttt gcggcaccag cctgttcacc tcgtcggtaa tgacggtgat ggcaaaaagt      300

cggggcgtta ttagttggcg aacttggctg attaacgcac ttctggtggc ctgcggtaat      360

ctggcaggta ttgcctgttt cagtttgtta atctggtttt ccgggctggt gatgagtgaa      420

aacgcgatgt ggggagtcgc ggttttacac tgcgccgagg gcaaaatgca tcatacattt      480

actgaatctg tcagcctcgg cattatgtgc aatctgatgg tttgcctggc gctgtggatg      540

agttattgcg ggcgttcgtt atgcgacaaa atcgtcgcca tgattttgcc catcaccctg      600

tttgtcgcca gtggctttga gcactgtatc gccaatttgt ttgtgattcc gttcgccatt      660

gccattcgcc atttcgcccc tccccctttc tggcagctgg cgcacagtag cgcagacaat      720

tttccggcac tgacggtcag ccattttatt accgccaatc tgctcccggt gatgctgggt      780

aatattatcg gcggtgcggt gctggtgagt atgtgttatc gggctattta tttacgtcag      840

gaaccataa                                                              849


<210>  56
<211>  3461
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  56
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtatg atatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta actttgcccg cgctctcccc cagtgtgaga gtgttcacac      240

aggaaagtac tagatggcca ctgttctatg cgtgctatat cctgatccgg ttgatggtta      300

tccaccgcac tatgttcgtg ataccatccc ggtgatcacc cgatatgcag atggccaaac      360

agcccccact cccgcggggc cgccaggatt tcgtcccggt gaactggtgg gcagtgtttc      420

tggtgcgctg ggacttcgcg gttaccttga agcccatggt cacactctca tcgtcacatc      480

ggataaagat ggtccggata gtgagtttga aagacggctg cctgatgccg atgttgtcat      540

cagccagccg ttttggcccg catatcttac ggctgaacgt atcgcgaggg cgccgaagtt      600

acgtctggct ctgactgctg gtataggctc agaccacgtt gacctcgatg ccgcggcgcg      660

tgctcacatt acggtcgccg aagtgactgg aagtaacagt atttcagtgg ctgaacacgt      720

tgttatgaca acgctggcct tagtgcggaa ctatttacct agccacgcaa ttgcgcagca      780

aggtggttgg aacatcgccg actgtgtttc acgctcttat gacgtcgaag gaatgcattt      840

cggcacagta ggggcgggta ggattggatt ggctgttctg cgccggctta aaccgtttgg      900

tctgcatttg cattacaccc aaagacatcg cttggatgca gccatcgaac aagaactcgg      960

tcttacttac catgccgatc cagccagtct tgcggcggca gtagacattg ttaatttgca     1020

gattccgctg tatccttcca ctgaacacct ttttgatgct gcaatgattg cacgcatgaa     1080

aagaggtgcg tacctgatta atactgcccg tgcgaagtta gtggaccgcg atgccgtcgt     1140

cagggctgtc acaagcggac atctggctgg ttatggcggg gacgtctggt ttccccagcc     1200

tgctccggct gatcatccgt ggcgggcgat gccttttaat ggcatgacac ctcatattag     1260

cggtacttca ctttctgctc aggcgcggta cgcagcgggg acccttgaaa tcctccagtg     1320

ttggtttgat ggcagaccga tcaggaacga gtacctgata gtggatggag gaacattggc     1380

cggtacaggt gcccaatcat atcggctgaa gtaagccaca cgcgctctcc cccctccggt     1440

gtaatcgggg gagagcgcgt gtccgctgca gtccggcaaa aaagggcaag gtgtcaccac     1500

cctgcccttt ttctttaaaa ccgaaaagat tacttcgcgt tatgcaggct tcctcgctca     1560

ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg     1620

taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc     1680

agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccac aggctccgcc     1740

cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac     1800

tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc     1860

tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata     1920

gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc     1980

acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca     2040

acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag     2100

cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta     2160

gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg     2220

gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc     2280

agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt     2340

ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa     2400

ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat     2460

atgagtaaac ttggtctgac agctcgaggc ttggattctc accaataaaa aacgcccggc     2520

ggcaaccgag cgttctgaac aaatccagat ggagttctga ggtcattact ggatctatca     2580

acaggagtcc aagcgagctc gatatcaaat tacgccccgc cctgccactc atcgcagtac     2640

tgttgtaatt cattaagcat tctgccgaca tggaagccat cacaaacggc atgatgaacc     2700

tgaatcgcca gcggcatcag caccttgtcg ccttgcgtat aatatttgcc catggtgaaa     2760

acgggggcga agaagttgtc catattggcc acgtttaaat caaaactggt gaaactcacc     2820

cagggattgg ctgagacgaa aaacatattc tcaataaacc ctttagggaa ataggccagg     2880

ttttcaccgt aacacgccac atcttgcgaa tatatgtgta gaaactgccg gaaatcgtcg     2940

tggtattcac tccagagcga tgaaaacgtt tcagtttgct catggaaaac ggtgtaacaa     3000

gggtgaacac tatcccatat caccagctca ccgtctttca ttgccatacg aaattccgga     3060

tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt gtgcttattt     3120

ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt ataggtacat     3180

tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga tatatcaacg     3240

gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga aaatctcgat     3300

aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt ggaacctctt     3360

acgtgcccga tcaactcgag tgccacttga cgtctaagaa accattatta tcatgacatt     3420

aacctataaa aataggcgta tcacgaggca gaatttcaga t                         3461


<210>  57
<211>  3395
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  57
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtatg atatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta actttgcccg cgctctcccc cagtgtgaga gtgttcacac      240

aggaaagtac tagatgaaaa ttgtactggt gctctatgat gcaggaaaac acgccgcaga      300

cgaggaaaag ctgtatggct gcactgagaa caagctagga atcgccaatt ggctgaagga      360

tcagggccat gaattaatca ctacctccga taaagaaggt gaaacctcag agttagataa      420

gcacattccc gatgccgata taatcattac gacgcctttt cacccagctt acattacaaa      480

agagcgtctg gataaagcga aaaacctcaa atcggttgta gtcgccggcg tcggttccga      540

ccacattgac ctggattata ttaatcagac tggtaagaag atcagcgtcc tggaagtcac      600

cggctctaat gtggtatctg ttgctgagca tgttgtaatg actatgctgg ttttagtgcg      660

caattttgtg cccgcacacg agcagatcat aaaccatgac tgggaagtag cagcaatagc      720

taaagatgcg tatgatattg aaggcaaaac tatcgctacg atcggcgcgg gccggatcgg      780

ttaccgggtt ctggagcggc tgctgccgtt caatcctaaa gagctcctat actatgatta      840

tcaggcactg cccaaggaag cagaggaaaa agttggtgcg cggagagtgg aaaacattga      900

agaacttgtg gctcaggccg acattgtaac ggtaaatgct ccacttcacg caggcaccaa      960

aggccttatc aataaagagt tgctttcaaa gtttaagaaa ggtgcctggt tggtaaatac     1020

ggcccgtgga gcaatttgcg ttgcggagga tgtcgccgcc gctctggaat cgggacagct     1080

ccggggatac ggtggggatg tttggtttcc ccagccggcg ccaaaggatc acccgtggcg     1140

tgatatgcga aacaaatatg gcgcagggaa cgccatgaca ccgcattact ccgggacgac     1200

cttagatgca caaactcgat acgctgaagg taccaagaac atcctggaaa gtttctttac     1260

gggcaagttt gattatcgcc ctcaggatat tattctgctt aatggagaat atgtaacaaa     1320

agcttacggc aaacatgaca aaaagtaagc cacacgcgct ctcccccctc cggtgtaatc     1380

gggggagagc gcgtgtccgc tgcagtccgg caaaaaaggg caaggtgtca ccaccctgcc     1440

ctttttcttt aaaaccgaaa agattacttc gcgttatgca ggcttcctcg ctcactgact     1500

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac     1560

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa     1620

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccacaggctc cgcccccctg     1680

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa     1740

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc     1800

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac     1860

gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac     1920

cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg     1980

taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt     2040

atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa     2100

cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct     2160

cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga     2220

ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg     2280

ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct     2340

tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt     2400

aaacttggtc tgacagctcg aggcttggat tctcaccaat aaaaaacgcc cggcggcaac     2460

cgagcgttct gaacaaatcc agatggagtt ctgaggtcat tactggatct atcaacagga     2520

gtccaagcga gctcgatatc aaattacgcc ccgccctgcc actcatcgca gtactgttgt     2580

aattcattaa gcattctgcc gacatggaag ccatcacaaa cggcatgatg aacctgaatc     2640

gccagcggca tcagcacctt gtcgccttgc gtataatatt tgcccatggt gaaaacgggg     2700

gcgaagaagt tgtccatatt ggccacgttt aaatcaaaac tggtgaaact cacccaggga     2760

ttggctgaga cgaaaaacat attctcaata aaccctttag ggaaataggc caggttttca     2820

ccgtaacacg ccacatcttg cgaatatatg tgtagaaact gccggaaatc gtcgtggtat     2880

tcactccaga gcgatgaaaa cgtttcagtt tgctcatgga aaacggtgta acaagggtga     2940

acactatccc atatcaccag ctcaccgtct ttcattgcca tacgaaattc cggatgagca     3000

ttcatcaggc gggcaagaat gtgaataaag gccggataaa acttgtgctt atttttcttt     3060

acggtcttta aaaaggccgt aatatccagc tgaacggtct ggttataggt acattgagca     3120

actgactgaa atgcctcaaa atgttcttta cgatgccatt gggatatatc aacggtggta     3180

tatccagtga tttttttctc cattttagct tccttagctc ctgaaaatct cgataactca     3240

aaaaatacgc ccggtagtga tcttatttca ttatggtgaa agttggaacc tcttacgtgc     3300

ccgatcaact cgagtgccac ttgacgtcta agaaaccatt attatcatga cattaaccta     3360

taaaaatagg cgtatcacga ggcagaattt cagat                                3395


<210>  58
<211>  3556
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  58
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtagg ctatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta acttttcaca caggaaacct actagatggc ccatattgtg      240

gtcctggggg ccgggctcgg cggcgccatt atggcatatg agctccgcga gcaggtgcgc      300

aaagaggata aagttaccgt tattaccaaa gatccgatgt atcattttgt gccaagcaac      360

ccatgggtgg cggtgggctg gcgcgatcgc aaagaaatta ccgtggattt agcgccgacg      420

atggcgcgca aaaacattga ttttattccg gtggcagcga aacgcctgca tccggcggag      480

aaccgtgttg aactggagaa cggccagagc gtttcgtacg atcagattgt tattgccacc      540

ggcccggagc tggcctttga tgaaattgaa ggcttcggcc cagaaggcca cacgcaaagc      600

atttgccata ttgatcatgc cgaagaagcg cggctggcct tcgatcgctt ctgcgagaac      660

ccaggcccga ttttgattgg tgcggcgcag ggcgcctcgt gctttggccc ggcttacgag      720

tttaccttta ttttagacac cgcgctgcgc aaacgcaaaa ttcgcgataa agtgccgatg      780

acctttgtta ccagcgaacc atatgttggt catctgggtc tggatggtgt gggcgatacc      840

aaaggcctgt tggagggcaa cctgcgcgat aaacacatta agtggatgac cagcacccgt      900

attaagcgcg ttgagaaagg caaaatggtg gttgaagaag tgaccgaaga tggcacggtt      960

aaaccagaaa aggaactgcc atttggctat gcgatgatgc tgccagcgtt tcgcggcatt     1020

aaagcgctga tgggtattga aggtctggtt aatccgcgcg gctttgttat tgttgaccag     1080

caccagcaga acccgacctt taaaaacgtt tttgcggttg gcgtttgcgt ggcgattccg     1140

ccgattggtc cgacgccggt gccatgcggc gtgccgaaaa ccggctttat gattgagtcg     1200

atggttaccg ccaccgccca caacattggc cgtattgtgc gcggtttcga agccgatgaa     1260

gttggctcgt ggaacgccgt ttgtctggcc gactttggcg accagggcat tgccttcgtt     1320

gcgcagccgc agattccgcc gcgcaacgtg aactggagct cgcagggcaa gtgggtgcat     1380

tgggccaaag aaggttttga acgctatttt atgcacaaac tgcgccgcgg taccagtgaa     1440

accttttatg agaaagccgc gatgaaattc ctgggcattg ataaactgaa agccgttaag     1500

aaagggtaag ccacacgcgc tctcccccct ccggtgtaat cgggggagag cgcgtgtccg     1560

ctgcagtccg gcaaaaaagg gcaaggtgtc accaccctgc cctttttctt taaaaccgaa     1620

aagattactt cgcgttatgc aggcttcctc gctcactgac tcgctgcgct cggtcgttcg     1680

gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg     1740

ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa     1800

ggccgcgttg ctggcgtttt tccacaggct ccgcccccct gacgagcatc acaaaaatcg     1860

acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc     1920

tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc     1980

ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc     2040

ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg     2100

ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc     2160

actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga     2220

gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc     2280

tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac     2340

caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg     2400

atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc     2460

acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa     2520

ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagctc     2580

gaggcttgga ttctcaccaa taaaaaacgc ccggcggcaa ccgagcgttc tgaacaaatc     2640

cagatggagt tctgaggtca ttactggatc tatcaacagg agtccaagcg agctcgatat     2700

caaattacgc cccgccctgc cactcatcgc agtactgttg taattcatta agcattctgc     2760

cgacatggaa gccatcacaa acggcatgat gaacctgaat cgccagcggc atcagcacct     2820

tgtcgccttg cgtataatat ttgcccatgg tgaaaacggg ggcgaagaag ttgtccatat     2880

tggccacgtt taaatcaaaa ctggtgaaac tcacccaggg attggctgag acgaaaaaca     2940

tattctcaat aaacccttta gggaaatagg ccaggttttc accgtaacac gccacatctt     3000

gcgaatatat gtgtagaaac tgccggaaat cgtcgtggta ttcactccag agcgatgaaa     3060

acgtttcagt ttgctcatgg aaaacggtgt aacaagggtg aacactatcc catatcacca     3120

gctcaccgtc tttcattgcc atacgaaatt ccggatgagc attcatcagg cgggcaagaa     3180

tgtgaataaa ggccggataa aacttgtgct tatttttctt tacggtcttt aaaaaggccg     3240

taatatccag ctgaacggtc tggttatagg tacattgagc aactgactga aatgcctcaa     3300

aatgttcttt acgatgccat tgggatatat caacggtggt atatccagtg atttttttct     3360

ccattttagc ttccttagct cctgaaaatc tcgataactc aaaaaatacg cccggtagtg     3420

atcttatttc attatggtga aagttggaac ctcttacgtg cccgatcaac tcgagtgcca     3480

cttgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg     3540

aggcagaatt tcagat                                                     3556


<210>  59
<211>  3555
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  59
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtatg atatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta acttttcaca caggaaagta ctagatggcc catattgtgg      240

tcctgggggc cgggctcggc ggcgccatta tggcatatga gctccgcgag caggtgcgca      300

aagaggataa agttaccgtt attaccaaag atccgatgta tcattttgtg ccaagcaacc      360

catgggtggc ggtgggctgg cgcgatcgca aagaaattac cgtggattta gcgccgacga      420

tggcgcgcaa aaacattgat tttattccgg tggcagcgaa acgcctgcat ccggcggaga      480

accgtgttga actggagaac ggccagagcg tttcgtacga tcagattgtt attgccaccg      540

gcccggagct ggcctttgat gaaattgaag gcttcggccc agaaggccac acgcaaagca      600

tttgccatat tgatcatgcc gaagaagcgc ggctggcctt cgatcgcttc tgcgagaacc      660

caggcccgat tttgattggt gcggcgcagg gcgcctcgtg ctttggcccg gcttacgagt      720

ttacctttat tttagacacc gcgctgcgca aacgcaaaat tcgcgataaa gtgccgatga      780

cctttgttac cagcgaacca tatgttggtc atctgggtct ggatggtgtg ggcgatacca      840

aaggcctgtt ggagggcaac ctgcgcgata aacacattaa gtggatgacc agcacccgta      900

ttaagcgcgt tgagaaaggc aaaatggtgg ttgaagaagt gaccgaagat ggcacggtta      960

aaccagaaaa ggaactgcca tttggctatg cgatgatgct gccagcgttt cgcggcatta     1020

aagcgctgat gggtattgaa ggtctggtta atccgcgcgg ctttgttatt gttgaccagc     1080

accagcagaa cccgaccttt aaaaacgttt ttgcggttgg cgtttgcgtg gcgattccgc     1140

cgattggtcc gacgccggtg ccatgcggcg tgccgaaaac cggctttatg attgagtcga     1200

tggttaccgc caccgcccac aacattggcc gtattgtgcg cggtttcgaa gccgatgaag     1260

ttggctcgtg gaacgccgtt tgtctggccg actttggcga ccagggcatt gccttcgttg     1320

cgcagccgca gattccgccg cgcaacgtga actggagctc gcagggcaag tgggtgcatt     1380

gggccaaaga aggttttgaa cgctatttta tgcacaaact gcgccgcggt accagtgaaa     1440

ccttttatga gaaagccgcg atgaaattcc tgggcattga taaactgaaa gccgttaaga     1500

aagggtaagc cacacgcgct ctcccccctc cggtgtaatc gggggagagc gcgtgtccgc     1560

tgcagtccgg caaaaaaggg caaggtgtca ccaccctgcc ctttttcttt aaaaccgaaa     1620

agattacttc gcgttatgca ggcttcctcg ctcactgact cgctgcgctc ggtcgttcgg     1680

ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg     1740

gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag     1800

gccgcgttgc tggcgttttt ccacaggctc cgcccccctg acgagcatca caaaaatcga     1860

cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct     1920

ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc     1980

tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg     2040

gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc     2100

tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca     2160

ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag     2220

ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct     2280

ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc     2340

accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga     2400

tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca     2460

cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat     2520

taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagctcg     2580

aggcttggat tctcaccaat aaaaaacgcc cggcggcaac cgagcgttct gaacaaatcc     2640

agatggagtt ctgaggtcat tactggatct atcaacagga gtccaagcga gctcgatatc     2700

aaattacgcc ccgccctgcc actcatcgca gtactgttgt aattcattaa gcattctgcc     2760

gacatggaag ccatcacaaa cggcatgatg aacctgaatc gccagcggca tcagcacctt     2820

gtcgccttgc gtataatatt tgcccatggt gaaaacgggg gcgaagaagt tgtccatatt     2880

ggccacgttt aaatcaaaac tggtgaaact cacccaggga ttggctgaga cgaaaaacat     2940

attctcaata aaccctttag ggaaataggc caggttttca ccgtaacacg ccacatcttg     3000

cgaatatatg tgtagaaact gccggaaatc gtcgtggtat tcactccaga gcgatgaaaa     3060

cgtttcagtt tgctcatgga aaacggtgta acaagggtga acactatccc atatcaccag     3120

ctcaccgtct ttcattgcca tacgaaattc cggatgagca ttcatcaggc gggcaagaat     3180

gtgaataaag gccggataaa acttgtgctt atttttcttt acggtcttta aaaaggccgt     3240

aatatccagc tgaacggtct ggttataggt acattgagca actgactgaa atgcctcaaa     3300

atgttcttta cgatgccatt gggatatatc aacggtggta tatccagtga tttttttctc     3360

cattttagct tccttagctc ctgaaaatct cgataactca aaaaatacgc ccggtagtga     3420

tcttatttca ttatggtgaa agttggaacc tcttacgtgc ccgatcaact cgagtgccac     3480

ttgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga     3540

ggcagaattt cagat                                                      3555


<210>  60
<211>  7740
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  60
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtatg atatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta acttttcaca caggaaagta ctagatgatc gacactgcgc      240

cccttgcccc accacgggcg ccccgctcta atccgattcg ggatcgagtt gattgggaag      300

ctcagcgtgc tgctgcgctg gcagatcccg gtgcctttca tggcgcgatt gcccggacag      360

ttatccactg gtacgaccca caacaccatt gctggattcg cttcaacgag tctagtcagc      420

gttgggaagg gctggatgcc gctaccggtg cccctgtaac ggtagactat cccgccgatt      480

atcagccctg gcaacaggcg tttgatgata gtgaagcgcc gttttaccgc tggtttagtg      540

gtgggttgac aaatgcctgc tttaatgaag tagaccggca tgtcacgatg ggctatggcg      600

acgaggtggc ctactacttt gaaggtgacc gctgggataa ctcgctcaac aatggtcgtg      660

gtggtccggt tgtccaggag acaatcacgc gacggcgtct gttggtggag gtggtgaagg      720

ctgcgcaggt gttgcgcgat ctgggcctga agaagggtga tcggattgct ctgaatatgc      780

cgaatattat gccgcagatt tattatacgg aagcggcaaa acgactgggt attctgtaca      840

cgccggtctt cggtggcttc tcggacaaga ctctttccga ccgtattcac aatgccggtg      900

cacgagtggt gattacctct gatggcgcgt atcgcaacgc gcaggtggtg ccctacaaag      960

aagcgtatac cgatcaggcg ctcgataagt atattccggt tgagactgcg caggcgattg     1020

ttgcgcagac cctggccacc ttgcccctga ctgagtcgca gcgccagacg atcatcaccg     1080

aagtggaggc cgccctggca ggtgagatta cggttgagcg ttcggacgtg atgcgtgggg     1140

ttggttctgc cctcgcaaag ctccgcgatc ttgatgcaag cgtgcaggca aaggtgcgca     1200

cagtactggc gcaggcgctg gtcgagtcgc cgccgcgggt tgaagctgtg gtggttgtgc     1260

gtcataccgg tcaggagatt ttgtggaacg aggggcgaga tcgctggagt cacgacttgc     1320

tggatgctgc gctggcgaag attctggcca atgcgcgtgc tgcaggcttt gatgtgcaca     1380

gtgagaatga tctgctcaat ctccccgatg accagcttat ccgtgcgctc tacgccagta     1440

ttccctgtga accggttgat gctgaatatc cgatgtttat catttacaca tcgggtagca     1500

ccggtaagcc caagggtgtg atccacgttc acggcggtta tgtcgccggt gtggtgcaca     1560

ccttgagggt cagttttgac gccgagccgg gtgatacgat atatgtgatc gccgatccgg     1620

gctggatcac cggccagagc tatatgctca cagccacaat ggccggtaga ctgaccgggg     1680

tgattgccga gggatcaccg cttttcccct cagccgggcg ttatgccagc atcatcgagc     1740

gctatggggt gcagatcttt aaggcgggtg tgaccttcct caagacagtg atgtccaatc     1800

cgcagaatgt tgaagatgtg cgactctatg atatgcactc gctgagagtt gcaaccttct     1860

gcgccgagcc ggtaagtccg gcggtgcagc agtttggtat gcagatcatg accccgcagt     1920

atatcaattc gtactgggcg accgagcacg gtggaattgt ctggacgcat ttctacggta     1980

atcaggactt tccgcttcgt cccgatgccc atacctatcc cttgccctgg gtgatgggtg     2040

atgtctgggt ggccgaaact gatgagagcg ggacgacgcg ctatcgggtc gctgatttcg     2100

atgagaaggg cgagattgtg attaccgccc cgtatcccta cctgacccgc acactctggg     2160

gtgatgtgcc cggtttcgag gcgtacctgc gcggtgagat tccgctgcga gcctggaagg     2220

gtgatgccga gcgtttcgtc aagacctact ggcgacgtgg gccaaacggt gaatggggct     2280

atatccaggg tgattttgcc atcaagtacc ccgatggtag cttcacgctc cacggacgct     2340

ctgacgatgt gatcaatgtg tcgggccacc gtatgggcac cgaggagatt gagggtgcca     2400

ttttgcgtga ccgccagatc acgcccgact cgcctgtcgg taattgtatt gtggtcggtg     2460

cgccgcatcg tgagaagggt ctgaccccgg ttgccttcat tcaacctgcg cctggccgtc     2520

atctgaccgg tgcagacagg cgccgtctcg atgagctggt gcgcaccgag aagggggcgg     2580

tcagtgtccc agaggattac atcgaggtca gtgcctttcc cgaaacccgc agcgggaagt     2640

atatgaggcg ctttttgcgc aatatgatgc tcgatgaacc actgggtgat acgacgacgt     2700

tgcgcaatcc tgaagtgctc gaagaaattg cagccaagat cgctgagtgg aaacgccgtc     2760

agcgtatggc cgaagaacag cagatcatcg aacgctatcg ctacttccgg atcgagtatc     2820

atccaccaac ggccagtgcg ggtaaactcg cggtagtgac ggtgacaaat ccgccggtga     2880

acgcactgaa tgagcgtgcg ttagatgagt tgaacacaat tgttgaccac ctggcccgtc     2940

gtcaggatgt tgccgcaatt gtcttcaccg gacagggcgc caggagtttt gtcgccggtg     3000

ctgatattcg ccagttgctc gaagaaattc atacggttga agaagcaatg gccctgccga     3060

ataacgccca tcttgctttc cgcaagattg agcgtatgaa taagccgtgt atcgcggcga     3120

tcaacggtgt ggcgctcggt ggtggtctgg aatttgccat ggcctgccat taccgggttg     3180

ccgatgtcta tgccgaattt ggtcagccag agattaatct gcgcttgcta cctggttatg     3240

gtggcacgca gcgcttgccg cgtctgttgt acaagcgcaa caacggcacc ggtctgctcc     3300

gagcgctgga gatgattctg ggtgggcgta gcgtaccggc tgatgaggcg ctggagctgg     3360

gtctgatcga tgccattgct accggcgatc aggactcact gtcgctggca tgcgcgttag     3420

cccgtgccgc aatcggtgcc gatggtcagt tgatcgagtc ggctgcggtg acccaggctt     3480

tccgccatcg ccacgagcag cttgacgagt ggcgcaaacc agacccgcgc tttgccgatg     3540

acgaactgcg ctcgattatc gcccatccac gtatcgagcg gattatccgg caggcccata     3600

ccgttgggcg cgatgcggca gtgcaccggg cactggatgc aatccgctat ggcattatcc     3660

acggcttcga ggccggtctg gagcacgagg cgaagctctt tgccgaggca gtggttgacc     3720

cgaacggtgg caagcgtggt attcgcgagt tcctcgaccg ccagagtgcg ccgttgccaa     3780

cccgccgacc attgattaca cctgaacagg agcaactctt gcgcgatcag aaagaactgt     3840

tgccggttgg ttcacccttc ttccccggtg ttgaccggat tccgaagtgg cagtacgcgc     3900

aggcggttat tcgtgatccg gacaccggtg cggcggctca cggcgatccc atcgtggctg     3960

aaaagcagat tattgtgccg gtggaacgcc cccgcgccaa tcaggcgctg atttatgttc     4020

tggcctcgga ggtgaacttc aacgatatct gggcgattac cggtattccg gtgtcacggt     4080

ttgatgagca cgaccgcgac tggcacgtta ccggttcagg tggcatcggc ctgatcgttg     4140

cgctgggtga agaagcgcga cgcgaaggcc ggctgaaggt gggtgatctg gtggcgatct     4200

actccgggca gtcggatctg ctctcaccgc tgatgggcct tgatccgatg gccgccgatt     4260

tcgtcatcca ggggaacgac acgccagatg gatcgcatca gcaatttatg ctggcccagg     4320

ccccgcagtg tctgcccatc ccaaccgata tgtctatcga ggcagccggc agctacatcc     4380

tcaatctcgg tacgatctat cgcgccctct ttacgacgtt gcaaatcaag gccggacgca     4440

ccatctttat cgagggtgcg gcgaccggca ccggtctgga cgcagcgcgc tcggcggccc     4500

ggaatggtct gcgcgtaatt ggaatggtca gttcgtcgtc acgtgcgtct acgctgctgg     4560

ctgcgggtgc ccacggtgcg attaaccgta aagacccgga ggttgccgat tgtttcacgc     4620

gcgtgcccga agatccatca gcctgggcag cctgggaagc cgccggtcag ccgttgctgg     4680

cgatgttccg ggcgcagaac gacgggcgac tggccgatta tgtggtctcg cacgcgggcg     4740

agacggcctt cccgcgcagt ttccagcttc tcggcgagcc acgcgatggt cacattccga     4800

cgctcacatt ctacggtgcc accagtggct accacttcac cttcctgggt aagccagggt     4860

cagcttcgcc gaccgagatg ctgcggcggg ccaatctccg cgccggtgag gcggtgttga     4920

tctactacgg ggttgggagc gatgacctgg tagataccgg cggtctggag gctatcgagg     4980

cggcgcggca aatgggagcg cggatcgtcg tcgttaccgt cagcgatgcg caacgcgagt     5040

ttgtcctctc gttgggcttc ggggctgccc tacgtggtgt cgtcagcctg gcggaactca     5100

aacgacgctt cggcgatgag tttgagtggc cgcgcacgat gccgccgttg ccgaacgccc     5160

gccaggaccc gcagggtctg aaagaggctg tccgccgctt caacgatctg gtcttcaagc     5220

cgctaggaag cgcggtcggt gtcttcttgc ggagtgccga caatccgcgt ggctaccccg     5280

atctgatcat cgagcgggct gcccacgatg cactggcggt gagcgcgatg ctgatcaagc     5340

ccttcaccgg acggattgtc tacttcgagg acattggtgg gcggcgttac tccttcttcg     5400

caccgcaaat ctgggtgcgc cagcgccgca tctacatgcc gacggcacag atctttggta     5460

cgcacctctc aaatgcgtat gaaattctgc gtctgaatga tgagatcagc gccggtctgc     5520

tgacgattac cgagccggca gtggtgccgt gggatgaact acccgaagca catcaggcga     5580

tgtgggaaaa tcgccacacg gcggccactt atgtggtgaa tcatgcctta ccacgtctcg     5640

gcctaaagaa cagggacgag ctgtacgagg cgtggacggc cggcgagcgc taagccacac     5700

gcgctctccc ccctccggtg taatcggggg agagcgcgtg tccgctgcag tccggcaaaa     5760

aagggcaagg tgtcaccacc ctgccctttt tctttaaaac cgaaaagatt acttcgcgtt     5820

atgcaggctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta     5880

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     5940

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     6000

tttttccaca ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     6060

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     6120

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     6180

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     6240

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     6300

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     6360

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     6420

cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt     6480

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     6540

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct     6600

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg     6660

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt     6720

aaatcaatct aaagtatata tgagtaaact tggtctgaca gctcgaggct tggattctca     6780

ccaataaaaa acgcccggcg gcaaccgagc gttctgaaca aatccagatg gagttctgag     6840

gtcattactg gatctatcaa caggagtcca agcgagctcg atatcaaatt acgccccgcc     6900

ctgccactca tcgcagtact gttgtaattc attaagcatt ctgccgacat ggaagccatc     6960

acaaacggca tgatgaacct gaatcgccag cggcatcagc accttgtcgc cttgcgtata     7020

atatttgccc atggtgaaaa cgggggcgaa gaagttgtcc atattggcca cgtttaaatc     7080

aaaactggtg aaactcaccc agggattggc tgagacgaaa aacatattct caataaaccc     7140

tttagggaaa taggccaggt tttcaccgta acacgccaca tcttgcgaat atatgtgtag     7200

aaactgccgg aaatcgtcgt ggtattcact ccagagcgat gaaaacgttt cagtttgctc     7260

atggaaaacg gtgtaacaag ggtgaacact atcccatatc accagctcac cgtctttcat     7320

tgccatacga aattccggat gagcattcat caggcgggca agaatgtgaa taaaggccgg     7380

ataaaacttg tgcttatttt tctttacggt ctttaaaaag gccgtaatat ccagctgaac     7440

ggtctggtta taggtacatt gagcaactga ctgaaatgcc tcaaaatgtt ctttacgatg     7500

ccattgggat atatcaacgg tggtatatcc agtgattttt ttctccattt tagcttcctt     7560

agctcctgaa aatctcgata actcaaaaaa tacgcccggt agtgatctta tttcattatg     7620

gtgaaagttg gaacctctta cgtgcccgat caactcgagt gccacttgac gtctaagaaa     7680

ccattattat catgacatta acctataaaa ataggcgtat cacgaggcag aatttcagat     7740


<210>  61
<211>  1959
<212>  DNA
<213>  Escherichia coli

<400>  61
atgtcgcaaa ttcacaaaca caccattcct gccaacatcg cagaccgttg cctgataaac       60

cctcagcagt acgaggcgat gtatcaacaa tctattaacg tacctgatac cttctggggc      120

gaacagggaa aaattcttga ctggatcaaa ccttaccaga aggtgaaaaa cacctccttt      180

gcccccggta atgtgtccat taaatggtac gaggacggca cgctgaatct ggcggcaaac      240

tgccttgacc gccatctgca agaaaacggc gatcgtaccg ccatcatctg ggaaggcgac      300

gacgccagcc agagcaaaca tatcagctat aaagagctgc accgcgacgt ctgccgcttc      360

gccaataccc tgctcgagct gggcattaaa aaaggtgatg tggtggcgat ttatatgccg      420

atggtgccgg aagccgcggt tgcgatgctg gcctgcgccc gcattggcgc ggtgcattcg      480

gtgattttcg gcggcttctc gccggaagcc gttgccgggc gcattattga ttccaactca      540

cgactggtga tcacttccga cgaaggtgtg cgtgccgggc gctccattcc gctgaagaaa      600

aacgttgatg acgcgctgaa aaacccgaac gtcaccagcg tagagcatgt ggtggtactg      660

aagcgtactg gcgggaaaat tgactggcag gaagggcgcg acctgtggtg gcacgacctg      720

gttgagcaag cgagcgatca gcaccaggcg gaggagatga acgccgaaga tccgctgttt      780

attctctaca cctccggttc taccggtaag ccaaaaggtg tgctgcatac taccggcggt      840

tatctggtgt acgcggcgct gacctttaaa tatgtctttg attatcatcc gggtgatatc      900

tactggtgca ccgccgatgt gggctgggtg accggacaca gttacttgct gtacggcccg      960

ctggcctgcg gtgcgaccac gctgatgttt gaaggcgtac ccaactggcc gacgcctgcc     1020

cgtatggcgc aggtggtgga caagcatcag gtcaatattc tctataccgc acccacggcg     1080

atccgcgcgc tgatggcgga aggcgataaa gcgatcgaag gcaccgaccg ttcgtcgctg     1140

cgcattctcg gttccgtggg cgagccaatt aacccggaag cgtgggagtg gtactggaaa     1200

aaaatcggca acgagaaatg tccggtggtc gatacctggt ggcagaccga aaccggcggt     1260

ttcatgatca ccccgctgcc tggcgctacc gagctgaaag ccggtagtgc aacacgtccg     1320

ttcttcggcg tgcaaccggc gctggtcgat aacgaaggta acccgctgga gggggccacc     1380

gaaggtagcc tggtaatcac cgacagttgg ccgggtcagg cgcgtacgct gtttggcgat     1440

cacgaacgtt ttgaacagac ctacttctcc accttcaaaa atatgtattt cagcggcgac     1500

ggcgcgcgtc gcgatgaaga tggctattac tggataaccg ggcgtgtgga cgacgtgctg     1560

aacgtctccg gtcaccgtct ggggacggca gagattgagt cggcgctggt ggcgcatccg     1620

aagattgccg aagccgccgt agtaggtatt ccgcacaata ttaaaggtca ggcgatctac     1680

gcctacgtca cgcttaatca cggggaggaa ccgtcaccag aactgtacgc agaagtccgc     1740

aactgggtgc gtaaagagat tggcccgctg gcgacgccag acgtgctgca ctggaccgac     1800

tccctgccta aaacccgctc cggcaaaatt atgcgccgta ttctgcgcaa aattgcggcg     1860

ggcgatacca gcaacctggg cgatacctcg acgcttgccg atcctggcgt agtcgagaag     1920

ctgcttgagg agaagcaggc tatcgcgatg ccatcgtga                            1959


<210>  62
<211>  1461
<212>  DNA
<213>  Listeria monocytogenes

<400>  62
atggcccttg aggacaaaga cctgcggagc atccaagagg taaggaactt aatagagagc       60

gcgaacaaag cccaaaaaga gctcgcggcc atgagtcaac aacaaataga cactatagtc      120

aaagctatag cggacgcagg gtatggggcg agggagaaat tagccaaaat ggcccacgag      180

gagacagggt tcggcatatg gcaagataaa gtcataaaaa acgtttttgc gtccaagcat      240

gtttacaatt acatcaaaga catgaaaacg atagggatgc tgaaagagga caacgagaag      300

aaagtcatgg aagtagccgt accgcttggg gtcgtagccg gcctgatacc atcgactaac      360

ccaacttcca cagtaatata caaaactctt atatctataa aagccggcaa ttcaatcgtc      420

ttctcgccgc acccgaacgc ccttaaagcc atactcgaga ctgtccgtat aatatcagag      480

gcggccgaga aagcgggatg tccgaaaggt gcgatcagtt gtatgacagt accgactatc      540

caaggcactg accaattgat gaaacataaa gacactgccg taatcctcgc cactggaggg      600

tcggccatgg tcaaagctgc gtatagctcg gggacaccgg ccataggcgt aggtccgggg      660

aatggtccgg ccttcatcga gcggtcagca aacataccac gggccgtgaa acacatactc      720

gactccaaaa cttttgacaa cggcactata tgcgcctcgg aacaatcggt tgtagtagag      780

agggtgaata aagaggcggt tatagcggag ttccgtaaac aaggcgccca ttttctgagc      840

gacgccgagg cggtacaact cgggaaattt atcctgaggc cgaatgggag catgaatccg      900

gccatcgtcg ggaaaagtgt gcaacacatc gcgaacctcg cggggcttac tgtaccggcg      960

gatgcccgtg tcctcatcgc ggaggagact aaagtagggg cgaaaatccc atatagccgt     1020

gagaaactgg cgccgatcct agcgttttat actgccgaga catggcaaga ggcgtgcgag     1080

ctcagtatgg acatactcta tcacgagggc gcgggccaca cactgatcat ccattcggag     1140

gacaaagaga tcatccggga gtttgccttg aaaaaaccgg tatccaggct cctggtaaat     1200

actccgggcg ctctcggtgg cataggtgcc actactaatc tcgtcccagc gctgactctc     1260

gggtgtggag ccgtaggcgg ctcaagcagc tcggacaata tcggcccaga gaatctcttt     1320

aacatacggc ggatcgcgac tggcgtactg gaactggagg acataaggaa agaggagaac     1380

caagccactt cggagctccc tgtagacgcc gatgccttga tccaatcact ggtcgagaaa     1440

gtactggccg agctgaagta a                                               1461


<210>  63
<211>  2919
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  63
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtatg atatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta acttttcaca caggaaagta ctagatggca aggcccttga      240

tccagttagc gctggatacg ctggatattc cgcagaccct gaaattagca agcttaaccg      300

ccccatacgt ggacattttt gagattggca ccccaagcat taaacataac ggcattgcgc      360

tggttaaaga atttaagaag cgctttccaa acaaactgtt actggtggat ttaaagacca      420

tggatgcggg ggagtatgag gcgaccccat tttttgcggc gggcgcggat attaccaccg      480

tgttaggcgt ggcaggactg gcgaccatta aaggcgtgat taacgcggcg aacaaacata      540

acgcggaagt gcaggtggat ctgattaacg tgccagataa agcggcgtgc gcgcgggaaa      600

gtgcgaaagc gggcgcgcag attgtgggca ttcataccgg cttagatgcg caggcggcgg      660

gccagacccc atttgcggat ttacaggcga ttgcgaaatt aggcttacca gtgcgcatta      720

gtgtggcggg cggcattaaa gcgagtaccg cgcaacaggt ggtgaagacc ggggcgaaca      780

ttattgtggt gggagcggcg atttatggcg cggcgagtcc agcggacgcg gcccgcgaga      840

tttatgagca ggtggtggcg gctagtgcgt aagccacacg cgctctcccc cctccggtgt      900

aatcggggga gagcgcgtgt ccgctgcagt ccggcaaaaa agggcaaggt gtcaccaccc      960

tgcccttttt ctttaaaacc gaaaagatta cttcgcgtta tgcaggcttc ctcgctcact     1020

gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta     1080

atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag     1140

caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccacag gctccgcccc     1200

cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta     1260

taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg     1320

ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc     1380

tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac     1440

gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac     1500

ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg     1560

aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga     1620

agaacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt     1680

agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag     1740

cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct     1800

gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg     1860

atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat     1920

gagtaaactt ggtctgacag ctcgaggctt ggattctcac caataaaaaa cgcccggcgg     1980

caaccgagcg ttctgaacaa atccagatgg agttctgagg tcattactgg atctatcaac     2040

aggagtccaa gcgagctcga tatcaaatta cgccccgccc tgccactcat cgcagtactg     2100

ttgtaattca ttaagcattc tgccgacatg gaagccatca caaacggcat gatgaacctg     2160

aatcgccagc ggcatcagca ccttgtcgcc ttgcgtataa tatttgccca tggtgaaaac     2220

gggggcgaag aagttgtcca tattggccac gtttaaatca aaactggtga aactcaccca     2280

gggattggct gagacgaaaa acatattctc aataaaccct ttagggaaat aggccaggtt     2340

ttcaccgtaa cacgccacat cttgcgaata tatgtgtaga aactgccgga aatcgtcgtg     2400

gtattcactc cagagcgatg aaaacgtttc agtttgctca tggaaaacgg tgtaacaagg     2460

gtgaacacta tcccatatca ccagctcacc gtctttcatt gccatacgaa attccggatg     2520

agcattcatc aggcgggcaa gaatgtgaat aaaggccgga taaaacttgt gcttattttt     2580

ctttacggtc tttaaaaagg ccgtaatatc cagctgaacg gtctggttat aggtacattg     2640

agcaactgac tgaaatgcct caaaatgttc tttacgatgc cattgggata tatcaacggt     2700

ggtatatcca gtgatttttt tctccatttt agcttcctta gctcctgaaa atctcgataa     2760

ctcaaaaaat acgcccggta gtgatcttat ttcattatgg tgaaagttgg aacctcttac     2820

gtgcccgatc aactcgagtg ccacttgacg tctaagaaac cattattatc atgacattaa     2880

cctataaaaa taggcgtatc acgaggcaga atttcagat                            2919


<210>  64
<211>  2805
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  64
aaaaaaaatc cttagctttc gctaaggatg atttctggaa ttcgcggccg cttctagagc       60

ccacagctaa caccacgtcg tccctatctg ctgccctagg tctatgagtg gttgctggat      120

aactttacgg gcatgcataa ggctcgtatg atatattcag ggagaccaca acggtttccc      180

tctacaaata attttgttta acttttcaca caggaaagta ctagatgcac cagaagctga      240

ttatagataa gattagtggc attttagcgg cgaccgacgc gggctacgac gcaaagctga      300

ctgcgatgtt agatcaggcg agtcgcattt ttgtggccgg tgcgggccgt tcgggtctgg      360

tggcgaaatt ttttgcgatg cgcttaatgc atggcggcta cgatgtgttt gtggtgggcg      420

agattgtgac cccaagcatt cgcaaaggcg atttgctgat tgttattagt ggcagtgggg      480

agaccgagac catgttagcg tttaccaaga aggcgaaaga acagggcgcg agtattgcgt      540

taattagtac ccgcgatagc agtagtttag gcgatttagc ggatagtgtg tttcgcattg      600

gcagtcccga attatttgga aaggtggtgg gcatgccaat gggcaccgtg tttgaattaa      660

gtaccttatt atttttagaa gcgaccattt cacatattat tcatgaaaag ggcattccag      720

aggaggagat gaggactcgg catgcgaacc tggagtaagc cacacgcgct ctcccccctc      780

cggtgtaatc gggggagagc gcgtgtccgc tgcagtccgg caaaaaaggg caaggtgtca      840

ccaccctgcc ctttttcttt aaaaccgaaa agattacttc gcgttatgca ggcttcctcg      900

ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag      960

gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa     1020

ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccacaggctc     1080

cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca     1140

ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg     1200

accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct     1260

catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt     1320

gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag     1380

tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc     1440

agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac     1500

actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga     1560

gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc     1620

aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg     1680

gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca     1740

aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt     1800

atatatgagt aaacttggtc tgacagctcg aggcttggat tctcaccaat aaaaaacgcc     1860

cggcggcaac cgagcgttct gaacaaatcc agatggagtt ctgaggtcat tactggatct     1920

atcaacagga gtccaagcga gctcgatatc aaattacgcc ccgccctgcc actcatcgca     1980

gtactgttgt aattcattaa gcattctgcc gacatggaag ccatcacaaa cggcatgatg     2040

aacctgaatc gccagcggca tcagcacctt gtcgccttgc gtataatatt tgcccatggt     2100

gaaaacgggg gcgaagaagt tgtccatatt ggccacgttt aaatcaaaac tggtgaaact     2160

cacccaggga ttggctgaga cgaaaaacat attctcaata aaccctttag ggaaataggc     2220

caggttttca ccgtaacacg ccacatcttg cgaatatatg tgtagaaact gccggaaatc     2280

gtcgtggtat tcactccaga gcgatgaaaa cgtttcagtt tgctcatgga aaacggtgta     2340

acaagggtga acactatccc atatcaccag ctcaccgtct ttcattgcca tacgaaattc     2400

cggatgagca ttcatcaggc gggcaagaat gtgaataaag gccggataaa acttgtgctt     2460

atttttcttt acggtcttta aaaaggccgt aatatccagc tgaacggtct ggttataggt     2520

acattgagca actgactgaa atgcctcaaa atgttcttta cgatgccatt gggatatatc     2580

aacggtggta tatccagtga tttttttctc cattttagct tccttagctc ctgaaaatct     2640

cgataactca aaaaatacgc ccggtagtga tcttatttca ttatggtgaa agttggaacc     2700

tcttacgtgc ccgatcaact cgagtgccac ttgacgtcta agaaaccatt attatcatga     2760

cattaaccta taaaaatagg cgtatcacga ggcagaattt cagat                     2805


<210>  65
<211>  7393
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  65
acacgcgctc tcccccgtca cgtacgtcgc agtacgtgct cttcgcggga tcgtacgtta       60

gagttagcgc gcgtgacggc cccttacgtg gatatttttg aaattggcac gccatgtatt      120

aagtataacg gcattgagat tgtgcgcgaa ttaaaacgcc gtcatccgga ccgtttagtg      180

ctggtggatt taaaaaccat ggacgcgggc gagtatgaag cggccccgtt ctacgcggcg      240

ggcgctgata tttgcaccgt tttaggtgtt tcgggtccgg ccaccattgc gggtgtggtc      300

aaggccgcgc aggcccataa tgcggaggtg caggttgacc tgattaacgt tccggataaa      360

gctgcgtgcg cccgcgaagc cgcgcgttta ggcgcgcaga ttattggggt tcataccggg      420

ctggacgcgc aggcgcaggg gcagacgccg tttgcggact tagagagcat tgcgcgcctg      480

aaactgccgg tgagaatttc tgttgctggt ggtattaacc agaacaccgc gtctcgtgtg      540

gcgaaagccg gtgcggatat tgtggtggtg ggggccgcca tttatggcgc cccatgtcca      600

gcgaccgccg cgcgcacgat ccgcgaactg ctggagggtg ctcaccataa atttattgtt      660

agtaaaattg gcggcgttct tgcggcgact gataaaagct atgaagcccg gctgaccggg      720

ttattagagc gggcgcgccg gatctttgtg gcgggcgcgg gtcggagtgg cctggtgggc      780

cgcttctttg cgatgcgtct gatgcatggc ggctaccagg cttacatcgt tggcgaaatt      840

gttacgccaa gcattcggca aggcgacctc ctgattgtta tcagtgggtc gggcgagacc      900

gagaccatga ttgcttatgc gaaaaaggcg aaagagcagg gtgcgagcat tgccctgatt      960

accacccgcg ataaaagtac gattggggat atggcagatg ttgtttttcg tattggcact     1020

ccagaacagt atggcaaagt tgtggggatg ccgatgggca ccacctttga actgagtacc     1080

ctggttctgt tagaggcgac gatcagtcat attattcaca ccaaaaaaat tccagaagaa     1140

cagatgcgta cccgccatgc gaatctggag taagcccaca gctaacacca cgtcgtccct     1200

atctgctgcc ctaggtctat gagtggttgc tggataactt tacgggcatg cataaggctc     1260

gtatgatata ttcagggaga ccacaacggt ttccctctac aaataatttt gtttaacttt     1320

aaagaggaga aatactagat gtcgcaaatt cacaaacaca ccattcctgc caacatcgca     1380

gaccgttgcc tgataaaccc tcagcagtac gaggcgatgt atcaacaatc tattaacgta     1440

cctgatacct tctggggcga acagggaaaa attcttgact ggatcaaacc ttaccagaag     1500

gtgaaaaaca cctcctttgc ccccggtaat gtgtccatta aatggtacga ggacggcacg     1560

ctgaatctgg cggcaaactg ccttgaccgc catctgcaag aaaacggcga tcgtaccgcc     1620

atcatctggg aaggcgacga cgccagccag agcaaacata tcagctataa agagctgcac     1680

cgcgacgtct gccgcttcgc caataccctg ctcgagctgg gcattaaaaa aggtgatgtg     1740

gtggcgattt atatgccgat ggtgccggaa gccgcggttg cgatgctggc ctgcgcccgc     1800

attggcgcgg tgcattcggt gattttcggc ggcttctcgc cggaagccgt tgccgggcgc     1860

attattgatt ccaactcacg actggtgatc acttccgacg aaggtgtgcg tgccgggcgc     1920

tccattccgc tgaagaaaaa cgttgatgac gcgctgaaaa acccgaacgt caccagcgta     1980

gagcatgtgg tggtactgaa gcgtactggc gggaaaattg actggcagga agggcgcgac     2040

ctgtggtggc acgacctggt tgagcaagcg agcgatcagc accaggcgga ggagatgaac     2100

gccgaagatc cgctgtttat tctctacacc tccggttcta ccggtaagcc aaaaggtgtg     2160

ctgcatacta ccggcggtta tctggtgtac gcggcgctga cctttaaata tgtctttgat     2220

tatcatccgg gtgatatcta ctggtgcacc gccgatgtgg gctgggtgac cggacacagt     2280

tacttgctgt acggcccgct ggcctgcggt gcgaccacgc tgatgtttga aggcgtaccc     2340

aactggccga cgcctgcccg tatggcgcag gtggtggaca agcatcaggt caatattctc     2400

tataccgcac ccacggcgat ccgcgcgctg atggcggaag gcgataaagc gatcgaaggc     2460

accgaccgtt cgtcgctgcg cattctcggt tccgtgggcg agccaattaa cccggaagcg     2520

tgggagtggt actggaaaaa aatcggcaac gagaaatgtc cggtggtcga tacctggtgg     2580

cagaccgaaa ccggcggttt catgatcacc ccgctgcctg gcgctaccga gctgaaagcc     2640

ggtagtgcaa cacgtccgtt cttcggcgtg caaccggcgc tggtcgataa cgaaggtaac     2700

ccgctggagg gggccaccga aggtagcctg gtaatcaccg acagttggcc gggtcaggcg     2760

cgtacgctgt ttggcgatca cgaacgtttt gaacagacct acttctccac cttcaaaaat     2820

atgtatttca gcggcgacgg cgcgcgtcgc gatgaagatg gctattactg gataaccggg     2880

cgtgtggacg acgtgctgaa cgtctccggt caccgtctgg ggacggcaga gattgagtcg     2940

gcgctggtgg cgcatccgaa gattgccgaa gccgccgtag taggtattcc gcacaatatt     3000

aaaggtcagg cgatctacgc ctacgtcacg cttaatcacg gggaggaacc gtcaccagaa     3060

ctgtacgcag aagtccgcaa ctgggtgcgt aaagagattg gcccgctggc gacgccagac     3120

gtgctgcact ggaccgactc cctgcctaaa acccgctccg gcaaaattat gcgccgtatt     3180

ctgcgcaaaa ttgcggcggg cgataccagc aacctgggcg atacctcgac gcttgccgat     3240

cctggcgtag tcgagaagct gcttgaggag aagcaggcta tcgcgatgcc atcgtgagcc     3300

cgaagagcac gtactgctaa gctgactgcg ggggagagcg cgtgtgccgg ccagtctaca     3360

tgtactcttt ttgataaaaa attggagatt cctttacaaa tatgctctta cgtgctatta     3420

tttaagtgac tatttaaaag gagttaataa atatgcggca aggtattctt aaataaactg     3480

tcaatttgat agcgggaaca aataattaga tgtccttttt taggagggct tagttttttg     3540

tacccagttt aagaatacct ttatcatgtg attctaaagt atccagagaa tatctgtatg     3600

ctttgtatac ctatggttat gcataaaaat cccggtgata aaagtattta tcactgggat     3660

ttttatgccc ttttgggttt ttgaatggag gaatactaga tgaaaatcat aaatatcggt     3720

gtattagctc acgttgatgc aggaaaaaca acattaactg aatcactttt atataactct     3780

ggtgcaatta ctgaacttgg ttcagtagat aaaggtacta ctcgtactga taatacatta     3840

ttagaacgtc aacgtggaat cacaattcaa acaggtatca catcttttca atgggaaaat     3900

acaaaagtaa atattataga tacacctgga cacatggatt tccttgcaga agtataccgt     3960

agtctttcag tattagatgg tgctatttta cttatcagcg ctaaagatgg agttcaagct     4020

caaactcgta tcttatttca cgcattacgt aaaatgggta ttccaacaat tttctttata     4080

aacaaaattg accaaaacgg aattgattta agtacagttt atcaagatat caaagaaaaa     4140

ctttctgctg aaatcgttat taaacaaaaa gttgaattat acccaaacgt ttgcgtaaca     4200

aattttactg aatcagaaca atgggataca gttatagaag gtaatgatga tttattagaa     4260

aaatacatgt caggtaaatc attagaagca ttagaattag aacaagaaga aagtattcgt     4320

ttccaaaact gttctttatt ccctttatac catggaagcg ctaaaagtaa cataggtatt     4380

gataacttaa ttgaagttat tactaacaaa ttttattctt caactcatcg tgggccttct     4440

gaattatgcg gtaacgtttt caaaattgaa tatacaaaaa aacgtcaacg tttagcttat     4500

atacgtcttt atagtggtgt tttacattta cgtgatagtg ttcgtgttag tgaaaaagaa     4560

aagattaaag ttacagaaat gtatacttct attaacggtg aattatgcaa aattgaccgt     4620

gcatattcag gtgaaattgt aattttacaa aacgaatttc ttaaacttaa tagtgtactt     4680

ggtgacacaa aacttttacc acaacgtaag aaaattgaaa atccacaccc attacttcaa     4740

acaacagtag aaccaagcaa acctgaacaa cgtgaaatgc ttttagatgc tcttttagaa     4800

attagtgact ctgacccact tttacgttac tatgtagatt ctactactca tgaaattatt     4860

ctttctttcc ttggtaaagt tcaaatggaa gttatttctg cattattaca agaaaaatat     4920

catgttgaaa tcgaattaaa agaacctact gtaatttata tggaacgtcc attaaaaaat     4980

gctgaatata caattcatat tgaagttcca ccaaatccat tttgggcttc tattggtctt     5040

tctgtttctc cacttccact tggtagcgga atgcaatatg aaagtagcgt aagtttaggt     5100

tatcttaatc aaagtttcca aaacgcagtt atggaaggta ttcgttacgg ttgcgaacaa     5160

ggtttatacg gttggaatgt tacagactgc aaaatctgtt ttaagtatgg actttactat     5220

tcacctgtat caacacctgc tgactttcgt atgcttgcac caattgtttt agaacaagtt     5280

ttaaagaaag ctggaactga acttttagaa ccataccttt cttttaaaat ctatgcacca     5340

caagaatact taagtcgtgc ttataacgat gcacctaaat actgtgctaa tattgttgat     5400

actcaattaa agaacaacga agtaatttta agcggagaaa ttcctgcacg ttgtattcaa     5460

gaatatcgta gtgatttaac atttttcact aatggacgtt ctgtttgctt aactgaatta     5520

aaaggttatc atgttactac tggtgaacct gtatgccaac cacgtcgtcc taatagtcgt     5580

attgataaag ttcgttatat gttcaacaaa atcacataat aaaaaaaaaa accccgcccc     5640

tgacagggcg gggttttttt tttagttagt tagagatgtg tataagagac agctggccat     5700

ggaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg     5760

gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt     5820

gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt     5880

caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag     5940

cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat     6000

ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct     6060

taacgtgagt tttcgttcca ctgagcgtca gaccccttaa taagatgatc ttcttgagat     6120

cgttttggtc tgcgcgtaat ctcttgctct gaaaacgaaa aaaccgcctt gcagggcggt     6180

ttttcgaagg ttctctgagc taccaactct ttgaaccgag gtaactggct tggaggagcg     6240

cagtcaccaa aacttgtcct ttcagtttag ccttaaccgg cgcatgactt caagactaac     6300

tcctctaaat caattaccag tggctgctgc cagtggtgct tttgcatgtc tttccgggtt     6360

ggactcaaga cgatagttac cggataaggc gcagcggtcg gactgaacgg ggggttcgtg     6420

catacagtcc agcttggagc gaactgccta cccggaactg agtgtcaggc gtggaatgag     6480

acaaacgcgg ccataacagc ggaatgacac cggtaaaccg aaaggcagga acaggagagc     6540

gcacgaggga gccgccaggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca     6600

ccactgattt gagcgtcaga tttcgtgatg cttgtcaggg gggcggagcc tatggaaaaa     6660

cggctttgcc gcggccctct cacttccctg ttaagtatct tcctggcatc ttccaggaaa     6720

tctccgcccc gttcgtaagc catttccgct cgccgcagtc gaacgaccga gcgtagcgag     6780

tcagtgagcg aggaagcgga atatatcctg tatcacatat tctgctgacg caccggtgca     6840

gccttttttc tcctgccaca tgaagcactt cactgacacc ctcatcagtg ccaacatagt     6900

aagccagtat acactccgct agcgctgagg tctgcctcgt gaagaaggtg ttgctgactc     6960

ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac ggttgatgag     7020

agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca cggaacggtc     7080

tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc gatttattca     7140

acaaagccac gttgtgtctc aaaatctctg atgttacatt gcacaagata aaaatatatc     7200

atcatgaaca ataaaactgt ctgcttacat aaacagtaat acaaggggtg ttgcccagct     7260

gtctcttata cacatctccg gcttatcggt cagtttcacc tgatttacgt aaaaacccgc     7320

ttcggcgggt ttttgctttt ggaggggcag aaagatgaat gactgtccac gacgctatac     7380

ccaaaagaaa gcc                                                        7393


<210>  66
<211>  4269
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA

<400>  66
gaattcgcgg ccgcttctag aggccgatct cgatcccgcg aaattaatac gactcactat       60

aggggaattg tgagcggata acaattcccc tctagaaata attttgttta actttaagaa      120

ggagactctt ctatggccct tgaggacaaa gacctgcgga gcatccaaga ggtaaggaac      180

ttaatagaga gcgcgaacaa agcccaaaaa gagctcgcgg ccatgagtca acaacaaata      240

gacactatag tcaaagctat agcggacgca gggtatgggg cgagggagaa attagccaaa      300

atggcccacg aggagacagg gttcggcata tggcaagata aagtcataaa aaacgttttt      360

gcgtccaagc atgtttacaa ttacatcaaa gacatgaaaa cgatagggat gctgaaagag      420

gacaacgaga agaaagtcat ggaagtagcc gtaccgcttg gggtcgtagc cggcctgata      480

ccatcgacta acccaacttc cacagtaata tacaaaactc ttatatctat aaaagccggc      540

aattcaatcg tcttctcgcc gcacccgaac gcccttaaag ccatactcga gactgtccgt      600

ataatatcag aggcggccga gaaagcggga tgtccgaaag gtgcgatcag ttgtatgaca      660

gtaccgacta tccaaggcac tgaccaattg atgaaacata aagacactgc cgtaatcctc      720

gccactggag ggtcggccat ggtcaaagct gcgtatagct cggggacacc ggccataggc      780

gtaggtccgg ggaatggtcc ggccttcatc gagcggtcag caaacatacc acgggccgtg      840

aaacacatac tcgactccaa aacttttgac aacggcacta tatgcgcctc ggaacaatcg      900

gttgtagtag agagggtgaa taaagaggcg gttatagcgg agttccgtaa acaaggcgcc      960

cattttctga gcgacgccga ggcggtacaa ctcgggaaat ttatcctgag gccgaatggg     1020

agcatgaatc cggccatcgt cgggaaaagt gtgcaacaca tcgcgaacct cgcggggctt     1080

actgtaccgg cggatgcccg tgtcctcatc gcggaggaga ctaaagtagg ggcgaaaatc     1140

ccatatagcc gtgagaaact ggcgccgatc ctagcgtttt atactgccga gacatggcaa     1200

gaggcgtgcg agctcagtat ggacatactc tatcacgagg gcgcgggcca cacactgatc     1260

atccattcgg aggacaaaga gatcatccgg gagtttgcct tgaaaaaacc ggtatccagg     1320

ctcctggtaa atactccggg cgctctcggt ggcataggtg ccactactaa tctcgtccca     1380

gcgctgactc tcgggtgtgg agccgtaggc ggctcaagca gctcggacaa tatcggccca     1440

gagaatctct ttaacatacg gcggatcgcg actggcgtac tggaactgga ggacataagg     1500

aaagaggaga accaagccac ttcggagctc cctgtagacg ccgatgcctt gatccaatca     1560

ctggtcgaga aagtactggc cgagctgaag taagcccgaa gagctactag tagcggccgc     1620

tgcagcgtca aaagggcgac acaaaattta ttctaaatgc ataataaata ctgataacat     1680

cttatagttt gtattatatt ttgtattatc gttgacatgt ataattttga tatcaaaaac     1740

tgattttccc tttattattt tcgagattta ttttcttaat tctctttaac aaactagaaa     1800

tattgtatat acaaaaaatc ataaataata gatgaatagt ttaattatag gtgttcatca     1860

atcgaaaaag caacgtatct tatttaaagt gcgttgcttt tttctcattt ataaggttaa     1920

ataattctca tatatcaagc aaagtgacag gcgcccttaa atattctgac aaatgctctt     1980

tccctaaact ccccccataa aaaaacccgc cgaagcgggt ttttacgtta tttgcggatt     2040

aacgattact cgttatcaga accgcccagg gggcccgagc ttaagactgg ccgtcgtttt     2100

acaacacaga aagagtttgt agaaacgcaa aaaggccatc cgtcaggggc cttctgctta     2160

gtttgatgcc tggcagttcc ctactctcgc cttccgcttc ctcgctcact gactcgctgc     2220

gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat     2280

ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca     2340

ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc     2400

atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc     2460

aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg     2520

gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta     2580

ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg     2640

ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac     2700

acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag     2760

gcggtgctac agagttcttg aagtggtggg ctaactacgg ctacactaga agaacagtat     2820

ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat     2880

ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc     2940

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt     3000

ggaacgacgc gcgcgtaact cacgttaagg gattttggtc atgagcttgc gccgtcccgt     3060

caagtcagcg taatgctctg cttttagaaa aactcatcga gcatcaaatg aaactgcaat     3120

ttattcatat caggattatc aataccatat ttttgaaaaa gccgtttctg taatgaagga     3180

gaaaactcac cgaggcagtt ccataggatg gcaagatcct ggtatcggtc tgcgattccg     3240

actcgtccaa catcaataca acctattaat ttcccctcgt caaaaataag gttatcaagt     3300

gagaaatcac catgagtgac gactgaatcc ggtgagaatg gcaaaagttt atgcatttct     3360

ttccagactt gttcaacagg ccagccatta cgctcgtcat caaaatcact cgcatcaacc     3420

aaaccgttat tcattcgtga ttgcgcctga gcgaggcgaa atacgcgatc gctgttaaaa     3480

ggacaattac aaacaggaat cgagtgcaac cggcgcagga acactgccag cgcatcaaca     3540

atattttcac ctgaatcagg atattcttct aatacctgga acgctgtttt tccggggatc     3600

gcagtggtga gtaaccatgc atcatcagga gtacggataa aatgcttgat ggtcggaagt     3660

ggcataaatt ccgtcagcca gtttagtctg accatctcat ctgtaacatc attggcaacg     3720

ctacctttgc catgtttcag aaacaactct ggcgcatcgg gcttcccata caagcgatag     3780

attgtcgcac ctgattgccc gacattatcg cgagcccatt tatacccata taaatcagca     3840

tccatgttgg aatttaatcg cggcctcgac gtttcccgtt gaatatggct catattcttc     3900

ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt     3960

gaatgtattt agaaaaataa acaaataggg gtcagtgtta caaccaatta accaattctg     4020

aacattatcg cgagcccatt tatacctgaa tatggctcat aacacccctt gtttgcctgg     4080

cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag     4140

cgccgatggt agtgtgggga ctccccatgc gagagtaggg aactgccagg catcaaataa     4200

aacgaaaggc tcagtcgaaa gactgggcct ttcgcccggg ctaattaggg ggtgtcgccc     4260

ttcgctgaa                                                             4269



