                         SEQUENCE LISTING
<110>  BASF SE
<120>  Method for the production of acrylic acid or salts thereof
<130>  180927WO02
<160>  75
<170>  According Wipo Std 25


<210> 1
<211> 1011
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 1
atgaaagaag cgattaaagt ggcgtgcgtg caggcggcgc cgatttatat ggatctggaa     60
gcgaccgtgg ataaaaccat tgaactgatg gaagaagcgg cgcgcaacaa cgcgcgcctg    120
attgcgtttc cggaaacctg gattccgggc tatccgtggt ttctgtggct ggatagcccg    180
gcgtgggcga tgcagtttgt gcgccagtat catgaaaaca gcctggaact ggatggcccg    240
caggcgaaac gcattagcga tgcggcgaaa cgcctgggca ttatggtgac cctgggcatg    300
agcgaacgcg tgggcggcac cctgtatatt agccagtggt ttattggcga taacggcgat    360
accattggcg cgcgccgcaa actgaaaccg acctttgtgg aacgcaccct gtttggcgaa    420
ggcgatggca gcagcctggc ggtgtttgaa accagcgtgg gccgcctggg cggcctgtgc    480
tgctgggaac atctgcagcc gctgaccaaa tatgcgctgt atgcgcagaa cgaagaaatt    540
cattgcgcgg cgtggccgag ctttagcctg tatccgaacg cggcgaaagc gctgggcccg    600
gatgtgaacg tggcggcgag ccgcatttat gcggtggaag gccagtgctt tgtgctggcg    660
agctgcgcgc tggtgagcca gagcatgatt gatatgctgt gcaccgatga tgaaaaacat    720
gcgctgctgc tggcgggcgg cggccatagc cgcattattg gcccggatgg cggcgatctg    780
gtggcgccgc tggcggaaaa cgaagaaggc attctgtatg cgaacctgga tccgggcgtg    840
cgcattctgg cgaaaatggc ggcggatccg gcgggccatt atagccgccc ggatattacc    900
cgcctgctga ttgatcgcag cccgaaactg ccggtggtgg aaattgaagg cgatctgcgc    960
ccgtatgcgc tgggcaaagc gagcgaaacc ggcgcgcagc tggaagaaat t            1011

<210> 2
<211> 337
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 2
Met Lys Glu Ala Ile Lys Val Ala Cys Val Gln Ala Ala Pro Ile Tyr
1               5                   10                  15
Met Asp Leu Glu Ala Thr Val Asp Lys Thr Ile Glu Leu Met Glu Glu
            20                  25                  30
Ala Ala Arg Asn Asn Ala Arg Leu Ile Ala Phe Pro Glu Thr Trp Ile
        35                  40                  45
Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asp Ser Pro Ala Trp Ala Met
    50                  55                  60
Gln Phe Val Arg Gln Tyr His Glu Asn Ser Leu Glu Leu Asp Gly Pro
65                  70                  75                  80
Gln Ala Lys Arg Ile Ser Asp Ala Ala Lys Arg Leu Gly Ile Met Val
                85                  90                  95
Thr Leu Gly Met Ser Glu Arg Val Gly Gly Thr Leu Tyr Ile Ser Gln
            100                 105                 110
Trp Phe Ile Gly Asp Asn Gly Asp Thr Ile Gly Ala Arg Arg Lys Leu
        115                 120                 125
Lys Pro Thr Phe Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly Ser
    130                 135                 140
Ser Leu Ala Val Phe Glu Thr Ser Val Gly Arg Leu Gly Gly Leu Cys
145                 150                 155                 160
Cys Trp Glu His Leu Gln Pro Leu Thr Lys Tyr Ala Leu Tyr Ala Gln
                165                 170                 175
Asn Glu Glu Ile His Cys Ala Ala Trp Pro Ser Phe Ser Leu Tyr Pro
            180                 185                 190
Asn Ala Ala Lys Ala Leu Gly Pro Asp Val Asn Val Ala Ala Ser Arg
        195                 200                 205
Ile Tyr Ala Val Glu Gly Gln Cys Phe Val Leu Ala Ser Cys Ala Leu
    210                 215                 220
Val Ser Gln Ser Met Ile Asp Met Leu Cys Thr Asp Asp Glu Lys His
225                 230                 235                 240
Ala Leu Leu Leu Ala Gly Gly Gly His Ser Arg Ile Ile Gly Pro Asp
                245                 250                 255
Gly Gly Asp Leu Val Ala Pro Leu Ala Glu Asn Glu Glu Gly Ile Leu
            260                 265                 270
Tyr Ala Asn Leu Asp Pro Gly Val Arg Ile Leu Ala Lys Met Ala Ala
        275                 280                 285
Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Ile Thr Arg Leu Leu Ile
    290                 295                 300
Asp Arg Ser Pro Lys Leu Pro Val Val Glu Ile Glu Gly Asp Leu Arg
305                 310                 315                 320
Pro Tyr Ala Leu Gly Lys Ala Ser Glu Thr Gly Ala Gln Leu Glu Glu
                325                 330                 335
Ile

<210> 3
<211> 942
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 3
atgggtgaat tcggtgaagt taccctgggt gttgctcagg ctgctccggt ttacttcgac     60
cgtgaagctt ctaccgaaaa agctcgtggt ctgatccgtg aagctggtga aaaaggtgtt    120
gacctgctgg ctttcggtga aacctggctg accggttacc cgtactggaa agacgctccg    180
tggtctcgtg aatacaacga cctgcgtgct cgttacgttg ctaacggtgt tatgatcccg    240
ggtccggaaa ccgacgctct gtgccaggct gctgctgaag ctggtgttga cgttgctatc    300
ggtgttgttg aactggaacc gggttctctg tcttctgttt actgcaccct gctgttcatc    360
tctcgtgaag gtgaaatcct gggtcgtcac cgtaaactga aaccgaccga ctctgaacgt    420
cgttactggt ctgaaggtga cgctaccggt ctgcgtgttt acgaacgtcc gtacggtcgt    480
ctgtctggtc tgaactgctg ggaacacctg atgatgctgc cgggttacgc tctggctgct    540
cagggtaccc agttccacgt tgctgcttgg ccgaacatgg cttcttctgc ttctgaactg    600
ctgtctcgtg cttacgctta ccaggctggt tgctacgttc tgtgcgctgg tggtctgggt    660
ccggctccgg gtgaactgcc ggacggtatc gctgctgaat ctctggacca cctgaccggt    720
gaatcttgca tcatcgaccc gtggggtaaa gttatcgctg gtccggtttc ttgcgaagaa    780
accctgatca ccgctcgtgt ttctaccgct tctatctacc gtcgtaaatc tctgaccgac    840
gttggtggtc actactctcg tccggacgtt ttccgtttcg aagttgaccg ttctgaacgt    900
ccgcgtgttg ttttccgtga cggtgacgtt gacgaccgtg gt                       942

<210> 4
<211> 314
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 4
Met Gly Glu Phe Gly Glu Val Thr Leu Gly Val Ala Gln Ala Ala Pro
1               5                   10                  15
Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Arg Gly Leu Ile
            20                  25                  30
Arg Glu Ala Gly Glu Lys Gly Val Asp Leu Leu Ala Phe Gly Glu Thr
        35                  40                  45
Trp Leu Thr Gly Tyr Pro Tyr Trp Lys Asp Ala Pro Trp Ser Arg Glu
    50                  55                  60
Tyr Asn Asp Leu Arg Ala Arg Tyr Val Ala Asn Gly Val Met Ile Pro
65                  70                  75                  80
Gly Pro Glu Thr Asp Ala Leu Cys Gln Ala Ala Ala Glu Ala Gly Val
                85                  90                  95
Asp Val Ala Ile Gly Val Val Glu Leu Glu Pro Gly Ser Leu Ser Ser
            100                 105                 110
Val Tyr Cys Thr Leu Leu Phe Ile Ser Arg Glu Gly Glu Ile Leu Gly
        115                 120                 125
Arg His Arg Lys Leu Lys Pro Thr Asp Ser Glu Arg Arg Tyr Trp Ser
    130                 135                 140
Glu Gly Asp Ala Thr Gly Leu Arg Val Tyr Glu Arg Pro Tyr Gly Arg
145                 150                 155                 160
Leu Ser Gly Leu Asn Cys Trp Glu His Leu Met Met Leu Pro Gly Tyr
                165                 170                 175
Ala Leu Ala Ala Gln Gly Thr Gln Phe His Val Ala Ala Trp Pro Asn
            180                 185                 190
Met Ala Ser Ser Ala Ser Glu Leu Leu Ser Arg Ala Tyr Ala Tyr Gln
        195                 200                 205
Ala Gly Cys Tyr Val Leu Cys Ala Gly Gly Leu Gly Pro Ala Pro Gly
    210                 215                 220
Glu Leu Pro Asp Gly Ile Ala Ala Glu Ser Leu Asp His Leu Thr Gly
225                 230                 235                 240
Glu Ser Cys Ile Ile Asp Pro Trp Gly Lys Val Ile Ala Gly Pro Val
                245                 250                 255
Ser Cys Glu Glu Thr Leu Ile Thr Ala Arg Val Ser Thr Ala Ser Ile
            260                 265                 270
Tyr Arg Arg Lys Ser Leu Thr Asp Val Gly Gly His Tyr Ser Arg Pro
        275                 280                 285
Asp Val Phe Arg Phe Glu Val Asp Arg Ser Glu Arg Pro Arg Val Val
    290                 295                 300
Phe Arg Asp Gly Asp Val Asp Asp Arg Gly
305                 310

<210> 5
<211> 969
<212> DNA
<213> Flavihumibacter solisilvae

<400> 5
atgagccata gtaccaataa taacagcagc accgttgttc gtgcagcagc cgtgcagatt     60
agcccggttc tgtatagtcg cgaaggcacc acccagaaag tggtgaatac cattcgtgaa    120
ctgggtaaac agggcgtgca gtttgcagtg tttccggaaa cctttattcc gtattatccg    180
tattttagtt tcgttcagcc gccgtatatg caggcagaac agcatctgaa actgatggaa    240
gaagcagtga ccgttccgag tgccaccacc gatgcaattg gcgaagccgc ccgtgaagcc    300
ggtattgttg ttagtattgg cgtgaatgaa cgtgatggtg gtagtctgta taatacccag    360
ctgctgtttg atgccgatgg taccctgatt cagcgccgtc gcaaaattac cccgacctat    420
catgaacgca tggtttgggg tcagggcgat ggtagcggcc tgcgcgctgt ggatagtaaa    480
gcaggccgta ttggccagct ggcatgttgg gaacattata atccgctggc ccgttatgca    540
atgattgccg atggtgaaca gattcatgca gcaatgtatc cgggcagcag ctttggcgaa    600
ctgtttagcc agcagattga agttagtgtt cgtcagcatg ccctggaaag tgccgccttt    660
gttgttagta gcaccgcatg gctggatgcc gatcagcagg cccagattat gaaagatacc    720
ggcagcccga ttggtccgat tagcggtggt aattttaccg ccattattgc cccggatggt    780
accattattg gcgaaccgat tcgtagcggc gaaggctttg tgattgcaga tttggatttt    840
aatctgattg agaaacgcaa acgtctgatg gatctgaaag gccattataa tcgcccggaa    900
ctgctgagtc tgctgattga tcgcaccccg gccgaatatg ttcaggaagt gaataagagt    960
gttagcgaa                                                            969

<210> 6
<211> 323
<212> PRT
<213> Flavihumibacter solisilvae

<400> 6
Met Ser His Ser Thr Asn Asn Asn Ser Ser Thr Val Val Arg Ala Ala
1               5                   10                  15
Ala Val Gln Ile Ser Pro Val Leu Tyr Ser Arg Glu Gly Thr Thr Gln
            20                  25                  30
Lys Val Val Asn Thr Ile Arg Glu Leu Gly Lys Gln Gly Val Gln Phe
        35                  40                  45
Ala Val Phe Pro Glu Thr Phe Ile Pro Tyr Tyr Pro Tyr Phe Ser Phe
    50                  55                  60
Val Gln Pro Pro Tyr Met Gln Ala Glu Gln His Leu Lys Leu Met Glu
65                  70                  75                  80
Glu Ala Val Thr Val Pro Ser Ala Thr Thr Asp Ala Ile Gly Glu Ala
                85                  90                  95
Ala Arg Glu Ala Gly Ile Val Val Ser Ile Gly Val Asn Glu Arg Asp
            100                 105                 110
Gly Gly Ser Leu Tyr Asn Thr Gln Leu Leu Phe Asp Ala Asp Gly Thr
        115                 120                 125
Leu Ile Gln Arg Arg Arg Lys Ile Thr Pro Thr Tyr His Glu Arg Met
    130                 135                 140
Val Trp Gly Gln Gly Asp Gly Ser Gly Leu Arg Ala Val Asp Ser Lys
145                 150                 155                 160
Ala Gly Arg Ile Gly Gln Leu Ala Cys Trp Glu His Tyr Asn Pro Leu
                165                 170                 175
Ala Arg Tyr Ala Met Ile Ala Asp Gly Glu Gln Ile His Ala Ala Met
            180                 185                 190
Tyr Pro Gly Ser Ser Phe Gly Glu Leu Phe Ser Gln Gln Ile Glu Val
        195                 200                 205
Ser Val Arg Gln His Ala Leu Glu Ser Ala Ala Phe Val Val Ser Ser
    210                 215                 220
Thr Ala Trp Leu Asp Ala Asp Gln Gln Ala Gln Ile Met Lys Asp Thr
225                 230                 235                 240
Gly Ser Pro Ile Gly Pro Ile Ser Gly Gly Asn Phe Thr Ala Ile Ile
                245                 250                 255
Ala Pro Asp Gly Thr Ile Ile Gly Glu Pro Ile Arg Ser Gly Glu Gly
            260                 265                 270
Phe Val Ile Ala Asp Leu Asp Phe Asn Leu Ile Glu Lys Arg Lys Arg
        275                 280                 285
Leu Met Asp Leu Lys Gly His Tyr Asn Arg Pro Glu Leu Leu Ser Leu
    290                 295                 300
Leu Ile Asp Arg Thr Pro Ala Glu Tyr Val Gln Glu Val Asn Lys Ser
305                 310                 315                 320
Val Ser Glu

<210> 7
<211> 1107
<212> DNA
<213> Acidovorax facilis

<220>
<223> Acidovorax facilis 72W

<400> 7
atggtttctt acaactctaa attcctggct gctaccgttc aggctgaacc ggtttggctg     60
gacgctgacg ctaccatcga caaatctatc ggtatcatcg aagaagctgc tcagaaaggt    120
gcttctctga tcgctttccc ggaagttttc atcccgggtt acccgtactg ggcttggctg    180
ggtgacgtta aatactctct gtctttcacc tctcgttacc acgaaaactc tctggaactg    240
ggtgacgacc gtatgcgtcg tctgcagctg gctgctcgtc gtaacaaaat cgctctggtt    300
atgggttact ctgaacgtga agctggttct cgttacctgt ctcaggtttt catcgacgaa    360
cgtggtgaaa tcgttgctaa ccgtcgtaaa ctgaaaccga cccacgttga acgtaccatc    420
tacggtgaag gtaacggtac cgacttcctg acccacgact tcgctttcgg tcgtgttggt    480
ggtctgaact gctgggaaca cttccagccg ctgtctaaat tcatgatgta ctctctgggt    540
gaacaggttc acgttgcttc ttggccggct atgtctccgc tgcagccgga cgttttccag    600
ctgtctatcg aagctaacgc taccgttacc cgttcttacg ctatcgaagg tcagaccttc    660
gttctgtgct ctacccaggt tatcggtccg tctgctatcg aaaccttctg cctgaacgac    720
gaacagcgtg ctctgctgcc gcagggttgc ggttgggctc gtatctacgg tccggacggt    780
tctgaactgg ctaaaccgct ggctgaagac gctgaaggta tcctgtacgc tgaaatcgac    840
ctggaacaga tcctgctggc taaagctggt gctgacccgg ttggtcacta ctctcgtccg    900
gacgttctgt ctgttcagtt cgacccgcgt aaccacaccc cggttcaccg tatcggtatc    960
gacggtcgtc tggacgttaa cacccgttct cgtgttgaaa acttccgtct gcgtcaggct   1020
gctgaacagg aacgtcaggc ttctaaacgt ctgggtacca aactgttcga acagtctctg   1080
ctggctgaag aaccggttcc ggctaaa                                       1107

<210> 8
<211> 369
<212> PRT
<213> Acidovorax facilis

<220>
<223> Acidovorax facilis 72W

<400> 8
Met Val Ser Tyr Asn Ser Lys Phe Leu Ala Ala Thr Val Gln Ala Glu
1               5                   10                  15
Pro Val Trp Leu Asp Ala Asp Ala Thr Ile Asp Lys Ser Ile Gly Ile
            20                  25                  30
Ile Glu Glu Ala Ala Gln Lys Gly Ala Ser Leu Ile Ala Phe Pro Glu
        35                  40                  45
Val Phe Ile Pro Gly Tyr Pro Tyr Trp Ala Trp Leu Gly Asp Val Lys
    50                  55                  60
Tyr Ser Leu Ser Phe Thr Ser Arg Tyr His Glu Asn Ser Leu Glu Leu
65                  70                  75                  80
Gly Asp Asp Arg Met Arg Arg Leu Gln Leu Ala Ala Arg Arg Asn Lys
                85                  90                  95
Ile Ala Leu Val Met Gly Tyr Ser Glu Arg Glu Ala Gly Ser Arg Tyr
            100                 105                 110
Leu Ser Gln Val Phe Ile Asp Glu Arg Gly Glu Ile Val Ala Asn Arg
        115                 120                 125
Arg Lys Leu Lys Pro Thr His Val Glu Arg Thr Ile Tyr Gly Glu Gly
    130                 135                 140
Asn Gly Thr Asp Phe Leu Thr His Asp Phe Ala Phe Gly Arg Val Gly
145                 150                 155                 160
Gly Leu Asn Cys Trp Glu His Phe Gln Pro Leu Ser Lys Phe Met Met
                165                 170                 175
Tyr Ser Leu Gly Glu Gln Val His Val Ala Ser Trp Pro Ala Met Ser
            180                 185                 190
Pro Leu Gln Pro Asp Val Phe Gln Leu Ser Ile Glu Ala Asn Ala Thr
        195                 200                 205
Val Thr Arg Ser Tyr Ala Ile Glu Gly Gln Thr Phe Val Leu Cys Ser
    210                 215                 220
Thr Gln Val Ile Gly Pro Ser Ala Ile Glu Thr Phe Cys Leu Asn Asp
225                 230                 235                 240
Glu Gln Arg Ala Leu Leu Pro Gln Gly Cys Gly Trp Ala Arg Ile Tyr
                245                 250                 255
Gly Pro Asp Gly Ser Glu Leu Ala Lys Pro Leu Ala Glu Asp Ala Glu
            260                 265                 270
Gly Ile Leu Tyr Ala Glu Ile Asp Leu Glu Gln Ile Leu Leu Ala Lys
        275                 280                 285
Ala Gly Ala Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Leu Ser
    290                 295                 300
Val Gln Phe Asp Pro Arg Asn His Thr Pro Val His Arg Ile Gly Ile
305                 310                 315                 320
Asp Gly Arg Leu Asp Val Asn Thr Arg Ser Arg Val Glu Asn Phe Arg
                325                 330                 335
Leu Arg Gln Ala Ala Glu Gln Glu Arg Gln Ala Ser Lys Arg Leu Gly
            340                 345                 350
Thr Lys Leu Phe Glu Gln Ser Leu Leu Ala Glu Glu Pro Val Pro Ala
        355                 360                 365
Lys

<210> 9
<211> 975
<212> DNA
<213> Pseudomonas sp

<220>
<223> Pseudomonas sp. RIT357

<400> 9
atgaccagca aacgtgaaaa aaccgtggcc attgtgcaga tgccggcagc actgctggat     60
cgcgccgaaa gtatgcgccg cgcagccgaa catattaaga aagcagccct gcaagaagca    120
cagctggtta tttttccgga aacctggctg agttgttatc cggcctgggt gtttggtatg    180
gccggttggg atgatgcaca ggcaaaaagc tggtatgcaa aactgctggc agatagtccg    240
gttattggtc agccggaaga tatgcatgat gatctggcag aactgcgtga agccgcccgc    300
gtgaatgccg tgaccgtggt tatgggcatg aatgaacgta gtcgtcatca tggtggtagc    360
ctgtataata gtctggttac cattggtccg gatggtgcaa ttctgaatgt tcatcgtaaa    420
ctgaccccga cccataccga acgtaccgtt tgggcaaatg gtgacgcagc aggtctgcgc    480
gtggttgata ccgtggttgg tcgtgtgggt ggcctggttt gctgggaaca ttggcatccg    540
ctggcccgcc aggccctgca tgctcaagat gaacagattc atgttgcagc ctggccggat    600
atgccggaaa tgcatcatgt ggccgcccgc agctatgcat ttgaaggtcg ttgttttgtt    660
ctgtgtgcag gccagtatct ggcagcaggc gatgtgccgg cagaactgct ggccgcatat    720
cgccgtggcg ttggtggtaa agccctggaa gaagatgttc tgtttaatgg tggtagtggc    780
gttattgcac cggatggtag ttgggtgacc gcaccgctgt ttggcgaacc gggtattatt    840
ctggccacca ttgatctggc ccagattgat gcccagcatc atgatctgga tgtggcaggc    900
cattatctgc gtccggatgt gtttgaactg agtattgatc gccgcgttcg caccggtctg    960
accctgcgtg atgca                                                     975

<210> 10
<211> 325
<212> PRT
<213> Pseudomonas sp.

<220>
<223> Pseudomonas sp. RIT357

<400> 10
Met Thr Ser Lys Arg Glu Lys Thr Val Ala Ile Val Gln Met Pro Ala
1               5                   10                  15
Ala Leu Leu Asp Arg Ala Glu Ser Met Arg Arg Ala Ala Glu His Ile
            20                  25                  30
Lys Lys Ala Ala Leu Gln Glu Ala Gln Leu Val Ile Phe Pro Glu Thr
        35                  40                  45
Trp Leu Ser Cys Tyr Pro Ala Trp Val Phe Gly Met Ala Gly Trp Asp
    50                  55                  60
Asp Ala Gln Ala Lys Ser Trp Tyr Ala Lys Leu Leu Ala Asp Ser Pro
65                  70                  75                  80
Val Ile Gly Gln Pro Glu Asp Met His Asp Asp Leu Ala Glu Leu Arg
                85                  90                  95
Glu Ala Ala Arg Val Asn Ala Val Thr Val Val Met Gly Met Asn Glu
            100                 105                 110
Arg Ser Arg His His Gly Gly Ser Leu Tyr Asn Ser Leu Val Thr Ile
        115                 120                 125
Gly Pro Asp Gly Ala Ile Leu Asn Val His Arg Lys Leu Thr Pro Thr
    130                 135                 140
His Thr Glu Arg Thr Val Trp Ala Asn Gly Asp Ala Ala Gly Leu Arg
145                 150                 155                 160
Val Val Asp Thr Val Val Gly Arg Val Gly Gly Leu Val Cys Trp Glu
                165                 170                 175
His Trp His Pro Leu Ala Arg Gln Ala Leu His Ala Gln Asp Glu Gln
            180                 185                 190
Ile His Val Ala Ala Trp Pro Asp Met Pro Glu Met His His Val Ala
        195                 200                 205
Ala Arg Ser Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Cys Ala Gly
    210                 215                 220
Gln Tyr Leu Ala Ala Gly Asp Val Pro Ala Glu Leu Leu Ala Ala Tyr
225                 230                 235                 240
Arg Arg Gly Val Gly Gly Lys Ala Leu Glu Glu Asp Val Leu Phe Asn
                245                 250                 255
Gly Gly Ser Gly Val Ile Ala Pro Asp Gly Ser Trp Val Thr Ala Pro
            260                 265                 270
Leu Phe Gly Glu Pro Gly Ile Ile Leu Ala Thr Ile Asp Leu Ala Gln
        275                 280                 285
Ile Asp Ala Gln His His Asp Leu Asp Val Ala Gly His Tyr Leu Arg
    290                 295                 300
Pro Asp Val Phe Glu Leu Ser Ile Asp Arg Arg Val Arg Thr Gly Leu
305                 310                 315                 320
Thr Leu Arg Asp Ala
                325

<210> 11
<211> 924
<212> DNA
<213> Nocardia brasiliensis

<220>
<223> Nocardia brasiliensis NBRC 14402

<400> 11
atgcgtattg cagcagcaca ggcccgtccg gcatggctgg accctaccgc tggtaccaaa     60
attgtggtgg attggctgac caaagcagcc gccgcaggtg cagaactggt tgcatttccg    120
gaaacctttc tgagtggcta tccgatttgg ctggcccgta ccggtggtgc acgctttgat    180
aatccggcac agaaagccgc atacgcttat tatctgggcg ccgcagtgac cctggatggt    240
ccgcagctgg ataccgtgcg caccgcagca ggtgacctgg gcgttttctg ttatctgggc    300
attaccgaac gtgttcgtgg taccgtttat tgcaccctgg tggccattga tccggatcgt    360
ggcattgtgg gtgcccatcg caaactgatg ccgacccatg aagaacgtat ggtttggggc    420
attggcgatg gtaatggcct gcgtgcccat gattttggcg tttttcgtgt tagtggcctg    480
agttgttggg aaaattggat gccgcaggcc cgccatgccc tgtatgcaga tggtaccacc    540
ctgcatgtta gcacctggcc gggtagtatt cgtaatacca aagatattac ccgttttatt    600
gccctggaag gtcgtgtgta tagcctggcc gtgggtgccg tgctggatta tgcagatgtg    660
ccgaccgatt ttccgctgta tgaagaactg agcgcactgg ataaaccggc cggctatgat    720
ggcggcagtg ccgtggcagc cccggatggt acctggctgg ttgaaccggt ggtgggcacc    780
gaacgcctga ttctggcaga tttggaccct gccgaagtgg caaaagaacg tcagaatttt    840
gatccgaccg gccattatgc acgcccggat atttttagtg tgaccgtgaa tcgccatcgt    900
cgtaccccgg caacctttct ggat                                           924

<210> 12
<211> 308
<212> PRT
<213> Nocardia brasiliensis

<220>
<223> Nocardia brasiliensis NBRC 14402

<400> 12
Met Arg Ile Ala Ala Ala Gln Ala Arg Pro Ala Trp Leu Asp Pro Thr
1               5                   10                  15
Ala Gly Thr Lys Ile Val Val Asp Trp Leu Thr Lys Ala Ala Ala Ala
            20                  25                  30
Gly Ala Glu Leu Val Ala Phe Pro Glu Thr Phe Leu Ser Gly Tyr Pro
        35                  40                  45
Ile Trp Leu Ala Arg Thr Gly Gly Ala Arg Phe Asp Asn Pro Ala Gln
    50                  55                  60
Lys Ala Ala Tyr Ala Tyr Tyr Leu Gly Ala Ala Val Thr Leu Asp Gly
65                  70                  75                  80
Pro Gln Leu Asp Thr Val Arg Thr Ala Ala Gly Asp Leu Gly Val Phe
                85                  90                  95
Cys Tyr Leu Gly Ile Thr Glu Arg Val Arg Gly Thr Val Tyr Cys Thr
            100                 105                 110
Leu Val Ala Ile Asp Pro Asp Arg Gly Ile Val Gly Ala His Arg Lys
        115                 120                 125
Leu Met Pro Thr His Glu Glu Arg Met Val Trp Gly Ile Gly Asp Gly
    130                 135                 140
Asn Gly Leu Arg Ala His Asp Phe Gly Val Phe Arg Val Ser Gly Leu
145                 150                 155                 160
Ser Cys Trp Glu Asn Trp Met Pro Gln Ala Arg His Ala Leu Tyr Ala
                165                 170                 175
Asp Gly Thr Thr Leu His Val Ser Thr Trp Pro Gly Ser Ile Arg Asn
            180                 185                 190
Thr Lys Asp Ile Thr Arg Phe Ile Ala Leu Glu Gly Arg Val Tyr Ser
        195                 200                 205
Leu Ala Val Gly Ala Val Leu Asp Tyr Ala Asp Val Pro Thr Asp Phe
    210                 215                 220
Pro Leu Tyr Glu Glu Leu Ser Ala Leu Asp Lys Pro Ala Gly Tyr Asp
225                 230                 235                 240
Gly Gly Ser Ala Val Ala Ala Pro Asp Gly Thr Trp Leu Val Glu Pro
                245                 250                 255
Val Val Gly Thr Glu Arg Leu Ile Leu Ala Asp Leu Asp Pro Ala Glu
            260                 265                 270
Val Ala Lys Glu Arg Gln Asn Phe Asp Pro Thr Gly His Tyr Ala Arg
        275                 280                 285
Pro Asp Ile Phe Ser Val Thr Val Asn Arg His Arg Arg Thr Pro Ala
    290                 295                 300
Thr Phe Leu Asp
305

<210> 13
<211> 1059
<212> DNA
<213> Pseudomonas fluorescens

<400> 13
atgacggtgc ataaaaaaca gtacaaagta gccgcggtgc aggccgcccc tgcgttcctc     60
gacctggaag ctggcgtggc caaagccatc ggactgattg ctcaggcggc ggctgagggt    120
gcctcactgg tcgctttccc cgaagcgtgg ctgccggggt atccctggtg gatctggctg    180
gactccccgg ccggcggcat gcgcttcgtc cagcgcaact tcgacaatgc tctggaggtc    240
ggcagcgaac ccttcgagcg gctctgcagg gctgcggcac agcacaaaat ctacgtcgta    300
ctgggcttca ctgaacgctc tggcggcacc ttgtatttgg ctcaggcgat cattgatgat    360
tgcggtcggg tagtcgccac acggcgtaag ctcaagccga ctcacgtgga gcgctcagtc    420
tacggagaag gcgacggtag tgaccttgct gtgcatgaca ctaccttggg tcgcttaggt    480
gccttgtgct gcgcggagca tatccagccg ctgtccaagt acgccatgta cgctcagcac    540
gaacaggtac atatcgcggc ctggcctagc ttttcggtat accggggggc tgcgtttcaa    600
ctgagcgccc aagccaataa tgccgcctcg caagtctacg cactggaagg tcagtgtttt    660
gtgctggcgc catgcgccac ggtgtccaaa gaaatgctcg acgaactgat tgattctccg    720
gccaaggctg agctgctgct ggaaggtggc ggcttcgcga tgatctacgg cccggatggc    780
gcaccgctgt gtacgccatt ggcggaaaca gaggagggca ttctctatgc ggatatcgac    840
ttgggggtga tcggggtggc caaagctgcc tacgacccgg ttggtcacta ttcacgccct    900
gatgtgctgc ggttgctggt caaccgggag ccaatgacgc gtgtgcatta tgttcagccg    960
cagtcgttac cggagacatc ggtgttggcg ttcggtgcgg gagcggatgc catcagaagt   1020
gaggagaacc cagaagagca aggcgacaag ggatcctga                          1059

<210> 14
<211> 352
<212> PRT
<213> Pseudomonas fluorescens

<400> 14
Met Thr Val His Lys Lys Gln Tyr Lys Val Ala Ala Val Gln Ala Ala
1               5                   10                  15
Pro Ala Phe Leu Asp Leu Glu Ala Gly Val Ala Lys Ala Ile Gly Leu
            20                  25                  30
Ile Ala Gln Ala Ala Ala Glu Gly Ala Ser Leu Val Ala Phe Pro Glu
        35                  40                  45
Ala Trp Leu Pro Gly Tyr Pro Trp Trp Ile Trp Leu Asp Ser Pro Ala
    50                  55                  60
Gly Gly Met Arg Phe Val Gln Arg Asn Phe Asp Asn Ala Leu Glu Val
65                  70                  75                  80
Gly Ser Glu Pro Phe Glu Arg Leu Cys Arg Ala Ala Ala Gln His Lys
                85                  90                  95
Ile Tyr Val Val Leu Gly Phe Thr Glu Arg Ser Gly Gly Thr Leu Tyr
            100                 105                 110
Leu Ala Gln Ala Ile Ile Asp Asp Cys Gly Arg Val Val Ala Thr Arg
        115                 120                 125
Arg Lys Leu Lys Pro Thr His Val Glu Arg Ser Val Tyr Gly Glu Gly
    130                 135                 140
Asp Gly Ser Asp Leu Ala Val His Asp Thr Thr Leu Gly Arg Leu Gly
145                 150                 155                 160
Ala Leu Cys Cys Ala Glu His Ile Gln Pro Leu Ser Lys Tyr Ala Met
                165                 170                 175
Tyr Ala Gln His Glu Gln Val His Ile Ala Ala Trp Pro Ser Phe Ser
            180                 185                 190
Val Tyr Arg Gly Ala Ala Phe Gln Leu Ser Ala Gln Ala Asn Asn Ala
        195                 200                 205
Ala Ser Gln Val Tyr Ala Leu Glu Gly Gln Cys Phe Val Leu Ala Pro
    210                 215                 220
Cys Ala Thr Val Ser Lys Glu Met Leu Asp Glu Leu Ile Asp Ser Pro
225                 230                 235                 240
Ala Lys Ala Glu Leu Leu Leu Glu Gly Gly Gly Phe Ala Met Ile Tyr
                245                 250                 255
Gly Pro Asp Gly Ala Pro Leu Cys Thr Pro Leu Ala Glu Thr Glu Glu
            260                 265                 270
Gly Ile Leu Tyr Ala Asp Ile Asp Leu Gly Val Ile Gly Val Ala Lys
        275                 280                 285
Ala Ala Tyr Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Leu Arg
    290                 295                 300
Leu Leu Val Asn Arg Glu Pro Met Thr Arg Val His Tyr Val Gln Pro
305                 310                 315                 320
Gln Ser Leu Pro Glu Thr Ser Val Leu Ala Phe Gly Ala Gly Ala Asp
                325                 330                 335
Ala Ile Arg Ser Glu Glu Asn Pro Glu Glu Gln Gly Asp Lys Gly Ser
            340                 345                 350

<210> 15
<211> 1041
<212> DNA
<213> Agrobacterium rubi

<400> 15
atggaaaaga gtaagaccgt gcgtgccgcc gccgcccaga ttgctcctga tctgaccagt     60
cgcgataata ccctggcacg cgttctggat accattcatg aagcagccgg caaaggtgca    120
gaactgattg tgtttccgga aacctttgtg ccgtggtatc cgtattttag ttttgttctg    180
ccgccggttc tgagtggccg tgaacatctg cgtctgtatg aagaagcagt taccgttccg    240
agtgccacca ccgatgcagt ggccaccgca gcacgcgaac atggtattgt ggtggcactg    300
ggtgtgaatg aacgtgatca tggcaccctg tataataccc agctggtgtt tgatgcagat    360
ggcgccctgg tgctgaaacg tcgcaaaatt accccgacct ttcatgaacg tatgatttgg    420
ggccagggtg acgcaagtgg cctgaaagtg gtggatagcc aggttggccg cattggtgca    480
ctggcctgct gggaacatta taatccgctg gcacgttatg ccctgatggc ccagcatgaa    540
gaaattcatg ttgcccagtt tccgggcagc atggtgggcc cgatttttgc agatcagatg    600
gaagtgacca ttcgtcatca tgcactggaa agtggttgtt ttgtggttaa tgccaccggt    660
tggctgaccg atgaacagat tcgtagtatt accccggatg aaaatctgca aaaagcactg    720
cgcggtggct gcatgaccgc cattattagt ccggaaggta aacatctggc accgccgatg    780
accgaaggtg aaggcattct ggtggcagat ttggatatga gcctgattct gaaacgtaaa    840
cgtatgatgg atagtgtggg tcattatgcc cgcccggaac tgctgcatct ggttattgat    900
aatcgtccgg ccattaccat ggtgaccgcc catccgtttc tggaaaccgc accgaccggt    960
agtaataccg atggccatca gaccagcgcc tttgatggca atccggatca gcgcgccgca   1020
attctgcgcc gtcaggcagg c                                             1041

<210> 16
<211> 347
<212> PRT
<213> Agrobacterium rubi

<400> 16
Met Glu Lys Ser Lys Thr Val Arg Ala Ala Ala Ala Gln Ile Ala Pro
1               5                   10                  15
Asp Leu Thr Ser Arg Asp Asn Thr Leu Ala Arg Val Leu Asp Thr Ile
            20                  25                  30
His Glu Ala Ala Gly Lys Gly Ala Glu Leu Ile Val Phe Pro Glu Thr
        35                  40                  45
Phe Val Pro Trp Tyr Pro Tyr Phe Ser Phe Val Leu Pro Pro Val Leu
    50                  55                  60
Ser Gly Arg Glu His Leu Arg Leu Tyr Glu Glu Ala Val Thr Val Pro
65                  70                  75                  80
Ser Ala Thr Thr Asp Ala Val Ala Thr Ala Ala Arg Glu His Gly Ile
                85                  90                  95
Val Val Ala Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn
            100                 105                 110
Thr Gln Leu Val Phe Asp Ala Asp Gly Ala Leu Val Leu Lys Arg Arg
        115                 120                 125
Lys Ile Thr Pro Thr Phe His Glu Arg Met Ile Trp Gly Gln Gly Asp
    130                 135                 140
Ala Ser Gly Leu Lys Val Val Asp Ser Gln Val Gly Arg Ile Gly Ala
145                 150                 155                 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
                165                 170                 175
Ala Gln His Glu Glu Ile His Val Ala Gln Phe Pro Gly Ser Met Val
            180                 185                 190
Gly Pro Ile Phe Ala Asp Gln Met Glu Val Thr Ile Arg His His Ala
        195                 200                 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp
    210                 215                 220
Glu Gln Ile Arg Ser Ile Thr Pro Asp Glu Asn Leu Gln Lys Ala Leu
225                 230                 235                 240
Arg Gly Gly Cys Met Thr Ala Ile Ile Ser Pro Glu Gly Lys His Leu
                245                 250                 255
Ala Pro Pro Met Thr Glu Gly Glu Gly Ile Leu Val Ala Asp Leu Asp
            260                 265                 270
Met Ser Leu Ile Leu Lys Arg Lys Arg Met Met Asp Ser Val Gly His
        275                 280                 285
Tyr Ala Arg Pro Glu Leu Leu His Leu Val Ile Asp Asn Arg Pro Ala
    290                 295                 300
Ile Thr Met Val Thr Ala His Pro Phe Leu Glu Thr Ala Pro Thr Gly
305                 310                 315                 320
Ser Asn Thr Asp Gly His Gln Thr Ser Ala Phe Asp Gly Asn Pro Asp
                325                 330                 335
Gln Arg Ala Ala Ile Leu Arg Arg Gln Ala Gly
            340                 345

<210> 17
<211> 990
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 17
atgaaggtgg ttaaagcagc agcagttcag attagcccgg ttctgtatag tcgcgaagcc     60
accgttgaaa aagttgttaa aaagattcac gagctgggcc agctgggtgt gcagtttgca    120
acctttccgg aaaccgttgt tccgtattat ccgtatttta gtgcagttca gaccggtatt    180
gaactgctga gtggcaccga acatctgcgc ctgctggatc aggccgtgac cgttccgagt    240
ccggcaaccg atgcaattgg tgaagccgcc cgcaaagccg gtatggttgt gagtattggt    300
gttaatgaac gtgatggtgg caccctgtat aatacccagc tgctgtttga tgcagatggt    360
accctgattc agcgtcgtcg taaaattacc ccgacccatt ttgaacgcat gatttggggt    420
cagggtgacg gtagcggtct gcgtgcagtt gatagtaaag ttggtcgcat tggtcagctg    480
gcatgttttg aacataataa tccgctggcc cgctatgcac tgattgcaga tggtgaacag    540
attcatagcg caatgtatcc gggcagtgcc tttggtgaag gttttgcaca gcgtatggaa    600
attaatattc gtcagcatgc actggaaagt ggcgcatttg tggtgaatgc aaccgcatgg    660
ctggatgcag atcagcaggc acagattatt aaggataccg gttgtggtat tggtccgatt    720
agcggcggtt gttttaccac cattgtggca ccggatggta tgctgatggc cgaaccgctg    780
cgtagtggcg aaggcgaagt gattgttgat ctggatttta ccctgattga tcgccgcaaa    840
atgctgatgg atagcgcagg ccattataat cgtccggaac tgctgagcct gatgattgat    900
cgcaccgcaa ccgcccatgt tcatgaacgc gccgcacatc cggtgagtgg tgccgaacag    960
ggcccggaag atttgcgcac cccggccgct                                     990

<210> 18
<211> 330
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 18
Met Lys Val Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1               5                   10                  15
Ser Arg Glu Ala Thr Val Glu Lys Val Val Lys Lys Ile His Glu Leu
            20                  25                  30
Gly Gln Leu Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
        35                  40                  45
Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
    50                  55                  60
Gly Thr Glu His Leu Arg Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65                  70                  75                  80
Pro Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Lys Ala Gly Met Val
                85                  90                  95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
            100                 105                 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
        115                 120                 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
    130                 135                 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg Ile Gly Gln Leu
145                 150                 155                 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Ile Ala
                165                 170                 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
            180                 185                 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
        195                 200                 205
Glu Ser Gly Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
    210                 215                 220
Gln Gln Ala Gln Ile Ile Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225                 230                 235                 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Ala Pro Asp Gly Met Leu Met
                245                 250                 255
Ala Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Val Asp Leu Asp
            260                 265                 270
Phe Thr Leu Ile Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His
        275                 280                 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met Ile Asp Arg Thr Ala Thr
    290                 295                 300
Ala His Val His Glu Arg Ala Ala His Pro Val Ser Gly Ala Glu Gln
305                 310                 315                 320
Gly Pro Glu Asp Leu Arg Thr Pro Ala Ala
                325                 330

<210> 19
<211> 1158
<212> DNA
<213> Candidatus Dadabacteria

<220>
<223> Candidatus Dadabacteria bacterium CSP1-2

<400> 19
atgggtcagg tgctgggtgg tcgtgaacag gttcgtgccg ccgtggttca ggcaagtccg     60
gtttttatga ataagaaagg ttgtctggaa aaggcctgcg atctgattca taaagcaggt    120
aaagaaggcg cagaaattgt ggtgtttccg gaaacctgga ttccgaccta tccgtattgg    180
ggtatgggtt gggataccgc agcagcagca tttgccgatg ttcatgccga tctgcaagat    240
aatagcctgg tggttggcag caaagatacc gatattctgg gtaaagcagc ccgcgatgcc    300
ggtgcctatg ttgttatggg ctgcaatgaa ctggatgatc gcattggcag ccgtaccctg    360
tttaatagtc tggtttatat tggcaaagac ggccgtgtta tgggtcgtca tcgtaaactg    420
attccgagtt atattgaacg catttggtgg ggtcgcggtg acgcccgtga tctgaaagtt    480
tttgataccg atatcggccg cattggtggt cagatttgtt gggaaaatca tattgttaac    540
atcaccgcct ggtttattgc ccagggcgtt gatattcatg ttgcagtttg gccgggtctg    600
tggaattgtg gtgccgcaca gggtgaaagt tttatctatg caggccatga tattaataag    660
tgcgatctga tcccggccac ccgcgaacgc gcctttaccg gtcagtgctt tgttctgagc    720
gcaaataata ttctgcgcat ggatgaaatt ccggatgatt ttccgtttaa aaataagatg    780
acctacgcag gtccgggtca gggtgaattt gttggctggg catgtggtgg tagtcatatt    840
gttgcaccga ccagcgaata tattgtgccg ccgacctttg atgttgaaac cattctgtat    900
gcagatttga atgccaaata tattaaggtt gtgaagagcg ttttcgatag tctgggccat    960
tatacccgct gggatctggt gagtctgacc aaacagccgc agccgtatga accgctggca   1020
ggcgaacgcc cgatggcaat gccggaagaa cgtattgaac aggttgccga tgcagtggcc   1080
cgtgagttta atctggatgt tgaaaaagtt gataagatcg tgcgtcaggt taccaccccg   1140
catcgtcagc gcgcagcc                                                 1158

<210> 20
<211> 386
<212> PRT
<213> Candidatus Dadabacteria

<220>
<223> Candidatus Dadabacteria bacterium CSP1-2

<400> 20
Met Gly Gln Val Leu Gly Gly Arg Glu Gln Val Arg Ala Ala Val Val
1               5                   10                  15
Gln Ala Ser Pro Val Phe Met Asn Lys Lys Gly Cys Leu Glu Lys Ala
            20                  25                  30
Cys Asp Leu Ile His Lys Ala Gly Lys Glu Gly Ala Glu Ile Val Val
        35                  40                  45
Phe Pro Glu Thr Trp Ile Pro Thr Tyr Pro Tyr Trp Gly Met Gly Trp
    50                  55                  60
Asp Thr Ala Ala Ala Ala Phe Ala Asp Val His Ala Asp Leu Gln Asp
65                  70                  75                  80
Asn Ser Leu Val Val Gly Ser Lys Asp Thr Asp Ile Leu Gly Lys Ala
                85                  90                  95
Ala Arg Asp Ala Gly Ala Tyr Val Val Met Gly Cys Asn Glu Leu Asp
            100                 105                 110
Asp Arg Ile Gly Ser Arg Thr Leu Phe Asn Ser Leu Val Tyr Ile Gly
        115                 120                 125
Lys Asp Gly Arg Val Met Gly Arg His Arg Lys Leu Ile Pro Ser Tyr
    130                 135                 140
Ile Glu Arg Ile Trp Trp Gly Arg Gly Asp Ala Arg Asp Leu Lys Val
145                 150                 155                 160
Phe Asp Thr Asp Ile Gly Arg Ile Gly Gly Gln Ile Cys Trp Glu Asn
                165                 170                 175
His Ile Val Asn Ile Thr Ala Trp Phe Ile Ala Gln Gly Val Asp Ile
            180                 185                 190
His Val Ala Val Trp Pro Gly Leu Trp Asn Cys Gly Ala Ala Gln Gly
        195                 200                 205
Glu Ser Phe Ile Tyr Ala Gly His Asp Ile Asn Lys Cys Asp Leu Ile
    210                 215                 220
Pro Ala Thr Arg Glu Arg Ala Phe Thr Gly Gln Cys Phe Val Leu Ser
225                 230                 235                 240
Ala Asn Asn Ile Leu Arg Met Asp Glu Ile Pro Asp Asp Phe Pro Phe
                245                 250                 255
Lys Asn Lys Met Thr Tyr Ala Gly Pro Gly Gln Gly Glu Phe Val Gly
            260                 265                 270
Trp Ala Cys Gly Gly Ser His Ile Val Ala Pro Thr Ser Glu Tyr Ile
        275                 280                 285
Val Pro Pro Thr Phe Asp Val Glu Thr Ile Leu Tyr Ala Asp Leu Asn
    290                 295                 300
Ala Lys Tyr Ile Lys Val Val Lys Ser Val Phe Asp Ser Leu Gly His
305                 310                 315                 320
Tyr Thr Arg Trp Asp Leu Val Ser Leu Thr Lys Gln Pro Gln Pro Tyr
                325                 330                 335
Glu Pro Leu Ala Gly Glu Arg Pro Met Ala Met Pro Glu Glu Arg Ile
            340                 345                 350
Glu Gln Val Ala Asp Ala Val Ala Arg Glu Phe Asn Leu Asp Val Glu
        355                 360                 365
Lys Val Asp Lys Ile Val Arg Gln Val Thr Thr Pro His Arg Gln Arg
    370                 375                 380
Ala Ala
385

<210> 21
<211> 963
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 21
atgtcaaacg agaacaacaa cgctacattc aaagttgccg cagtacaggc tacacctgtt     60
tttctcgatc gtgaagcgac tctcgacaag gcttgcgatt tgatcgccgc cgccggaggt    120
gaaggggcac gattggttgt ctttccagaa gccttcatac cggcctatcc ggattgggta    180
tgggcaatcc caccgggtga agagggcgta cttaatgagt tgtacgcaga gctgctctcc    240
aactcggtca cgattcccag tgacgcgacg gacagactgt gccgggccgc gaggcttgct    300
aatgcttacg tggtgatggg gataagcgaa cgcaatgtcg aggcgagtgg agcaagcctg    360
tataacacgc tgttgtacat cgatgcgcag ggtgagattc taggcaaaca tcgaaagcta    420
gtgccaacgg gcggcgagcg gctggtgtgg gcgcagggcg atggcagcac actgcaggtc    480
tacgatactc cactgggaaa actcggcggt ttaatttgct gggagaatta tatgccgctg    540
gcccgctata ccatgtatgc ctggggcaca caaatctatg tcgccgctac gtgggatcgc    600
gggcaaccct ggctctccac tttgcggcat atcgccaaag aaggcagggt gtacgtgatt    660
ggttgttgta tcgcgatgcg caaagacgat atccctgatc gttacgcaat gaagcagaag    720
ttttacgcgg aggcagatga gtggatcaat ataggtgaca gcgcgattgt caatcctgaa    780
gggcaattta tcgcagggcc agtacgcaag caggaagaga ttctctacgc agagattgat    840
ccgcgcatgg tacaagggcc gaagtggatg ctcgacgtgg cggggcacta tgccaggccg    900
gatgtgttcc agttgacggt gcatacggat gtgcgacaga tgattcggat ggaacacgat    960
tct                                                                  963

<210> 22
<211> 321
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 22
Met Ser Asn Glu Asn Asn Asn Ala Thr Phe Lys Val Ala Ala Val Gln
1               5                   10                  15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys
            20                  25                  30
Asp Leu Ile Ala Ala Ala Gly Gly Glu Gly Ala Arg Leu Val Val Phe
        35                  40                  45
Pro Glu Ala Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Ala Ile Pro
    50                  55                  60
Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser
65                  70                  75                  80
Asn Ser Val Thr Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Arg Ala
                85                  90                  95
Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Ile Ser Glu Arg Asn
            100                 105                 110
Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp
        115                 120                 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
    130                 135                 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145                 150                 155                 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
                165                 170                 175
Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile
            180                 185                 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
        195                 200                 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
    210                 215                 220
Ala Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Ala Met Lys Gln Lys
225                 230                 235                 240
Phe Tyr Ala Glu Ala Asp Glu Trp Ile Asn Ile Gly Asp Ser Ala Ile
                245                 250                 255
Val Asn Pro Glu Gly Gln Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
            260                 265                 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
        275                 280                 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
    290                 295                 300
Leu Thr Val His Thr Asp Val Arg Gln Met Ile Arg Met Glu His Asp
305                 310                 315                 320
Ser

<210> 23
<400> 23
000

<210> 24
<400> 24
000

<210> 25
<211> 972
<212> DNA
<213> Tepidicaulis marinus

<400> 25
atgacccgtg ttgctgctat ccagatggaa gctaaagttg ctgacctgaa cttcaacatc     60
gaccaggctt ctcgtctgat cgacgaagct ggttctaaag gtgctgaaat catcgctctg    120
ccggaattct tcaccacccg tatcgtttac gacgaacgtc tgttcgaatg ctctctgccg    180
ccggaaaacc cggctctgga catgctgaaa gctaaagctg ctaaatacgg tgctatgatc    240
ggtggttctt acctggaaat gcgtgacggt gacgtttaca acacctacac cctggttgaa    300
ccggacggta ccgttcaccg tcacgacaaa gaccgtccga ccatggttga aaacgctttc    360
tacaccggtg gttctgacga cggttacttc gaaaccgcta tgggtccggt tggtaccgct    420
gtttgctggg aactgatccg taccgctacc gttcgtcgtc tggctggtaa agttggtctg    480
atgatgaccg gttctcactg gtggtctgct ccgggttgga acttctggaa atctttcgaa    540
cgtcgtttcc acaaagctaa cggtaaagct atggaaatca ccccgccgcg tttcgcttct    600
ctggttggtg ctccgctgct gcacgctggt cacaccggta tgctggaagg tggtttcctg    660
gttctgccgg gtacccgtat ctctgttccg acccgtaccc agctgatggg tgaaacccag    720
atcatcgacg gtgaaggtgc tgttgttgct cgtcgtcact acaccgaagg tgctggtatc    780
gttggtggtg aaatcgaact gggtgctacc tctccgaaaa aagctccgcc ggaccgtttc    840
tggatcccga acctggaagg tttcccgaaa gctctgtggc tgcaccagaa cccggctggt    900
gcttctgttt accgttgggc taaacgtacc ggtcgtctga aaacctacga cttctctcgt    960
aacgctcgtc cg                                                        972

<210> 26
<211> 324
<212> PRT
<213> Tepidicaulis marinus

<400> 26
Met Thr Arg Val Ala Ala Ile Gln Met Glu Ala Lys Val Ala Asp Leu
1               5                   10                  15
Asn Phe Asn Ile Asp Gln Ala Ser Arg Leu Ile Asp Glu Ala Gly Ser
            20                  25                  30
Lys Gly Ala Glu Ile Ile Ala Leu Pro Glu Phe Phe Thr Thr Arg Ile
        35                  40                  45
Val Tyr Asp Glu Arg Leu Phe Glu Cys Ser Leu Pro Pro Glu Asn Pro
    50                  55                  60
Ala Leu Asp Met Leu Lys Ala Lys Ala Ala Lys Tyr Gly Ala Met Ile
65                  70                  75                  80
Gly Gly Ser Tyr Leu Glu Met Arg Asp Gly Asp Val Tyr Asn Thr Tyr
                85                  90                  95
Thr Leu Val Glu Pro Asp Gly Thr Val His Arg His Asp Lys Asp Arg
            100                 105                 110
Pro Thr Met Val Glu Asn Ala Phe Tyr Thr Gly Gly Ser Asp Asp Gly
        115                 120                 125
Tyr Phe Glu Thr Ala Met Gly Pro Val Gly Thr Ala Val Cys Trp Glu
    130                 135                 140
Leu Ile Arg Thr Ala Thr Val Arg Arg Leu Ala Gly Lys Val Gly Leu
145                 150                 155                 160
Met Met Thr Gly Ser His Trp Trp Ser Ala Pro Gly Trp Asn Phe Trp
                165                 170                 175
Lys Ser Phe Glu Arg Arg Phe His Lys Ala Asn Gly Lys Ala Met Glu
            180                 185                 190
Ile Thr Pro Pro Arg Phe Ala Ser Leu Val Gly Ala Pro Leu Leu His
        195                 200                 205
Ala Gly His Thr Gly Met Leu Glu Gly Gly Phe Leu Val Leu Pro Gly
    210                 215                 220
Thr Arg Ile Ser Val Pro Thr Arg Thr Gln Leu Met Gly Glu Thr Gln
225                 230                 235                 240
Ile Ile Asp Gly Glu Gly Ala Val Val Ala Arg Arg His Tyr Thr Glu
                245                 250                 255
Gly Ala Gly Ile Val Gly Gly Glu Ile Glu Leu Gly Ala Thr Ser Pro
            260                 265                 270
Lys Lys Ala Pro Pro Asp Arg Phe Trp Ile Pro Asn Leu Glu Gly Phe
        275                 280                 285
Pro Lys Ala Leu Trp Leu His Gln Asn Pro Ala Gly Ala Ser Val Tyr
    290                 295                 300
Arg Trp Ala Lys Arg Thr Gly Arg Leu Lys Thr Tyr Asp Phe Ser Arg
305                 310                 315                 320
Asn Ala Arg Pro

<210> 27
<211> 1014
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 27
atgggtatcg aacacccgaa atacaaagtt gctgttgttc aggctgctcc ggcttggctg     60
gacctggacg cttctatcga caaatctatc gctctgatcg aagaagctgc tcagaaaggt    120
gctaaactga tcgctttccc ggaagctttc atcccgggtt acccgtggca catctggatg    180
gactctccgg cttgggctat cggtcgtggt ttcgttcagc gttacttcga caactctctg    240
gcttacgact ctccgcaggc tgaaaaactg cgtgctgctg ttcgtaaagc taaactgacc    300
gctgttctgg gtctgtctga acgtgacggt ggttctctgt acctggctca gtggctgatc    360
ggtccggacg gtgaaaccat cgctaaacgt cgtaaactgc gtccgaccca cgctgaacgt    420
accgtttacg gtgaaggtga cggttctgac ctggctgttc acaaccgtcc ggacatcggt    480
cgtctgggtg ctctgtgctg ctgggaacac ctgcagccgc tgtctaaata cgctatgtac    540
gctcagaacg aacaggttca cgttgctgct tggccgtctt tctctctgta cgacccgttc    600
gctgttgctc tgggtgctga agttaacaac gctgcttctc gtgtttacgc tgttgaaggt    660
tcttgcttcg ttctggctcc gtgcgctacc gtttctcagg ctatgatcga cgaactgtgc    720
gaccgtccgg acaaacacac cctgctgcac gttggtggtg gtttcgctgc tatctacggt    780
ccggacggtt ctcagatcgg tgacaaactg gctccggacc aggaaggtct gctgatcgct    840
gaaatcgacc tgggtgctat cggtgttgct aaaaacgctg ctgacccggc tggtcactac    900
tctcgtccgg acgttacccg tctgctgctg aacaaaaaac cgtacaaacg tgttgaacag    960
ttctctccgc cggctgaagc tgttgaaccg accgacatcg ctgctgctgc ttct         1014

<210> 28
<211> 338
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 28
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1               5                   10                  15
Pro Ala Trp Leu Asp Leu Asp Ala Ser Ile Asp Lys Ser Ile Ala Leu
            20                  25                  30
Ile Glu Glu Ala Ala Gln Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
        35                  40                  45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala
    50                  55                  60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65                  70                  75                  80
Ala Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Ala Ala Val Arg Lys
                85                  90                  95
Ala Lys Leu Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
            100                 105                 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
        115                 120                 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
    130                 135                 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asn Arg Pro Asp Ile Gly
145                 150                 155                 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
                165                 170                 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
            180                 185                 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Val Ala Leu Gly Ala Glu Val
        195                 200                 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
    210                 215                 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225                 230                 235                 240
Asp Arg Pro Asp Lys His Thr Leu Leu His Val Gly Gly Gly Phe Ala
                245                 250                 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Ala Pro
            260                 265                 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
        275                 280                 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
    290                 295                 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Tyr Lys Arg Val Glu Gln
305                 310                 315                 320
Phe Ser Pro Pro Ala Glu Ala Val Glu Pro Thr Asp Ile Ala Ala Ala
                325                 330                 335
Ala Ser

<210> 29
<211> 1014
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 29
atgggtatcg aacacccgaa atacaaagtt gctgttgttc aggctgctcc ggcttggctg     60
gacctggacg gttctgttga caaatctatc gctctgatca aagaagctgc tgaaaaaggt    120
gctaaactga tcgctttccc ggaagctttc atcccgggtt acccgtggca catctggatg    180
gactctccgg cttgggctat cggtcgtggt ttcgttcagc gttacttcga caactctctg    240
tcttacgact ctccgcaggc tgaacgtctg cgtgacgctg ttaaaaaagc taaactgacc    300
gctgttttcg gtctgtctga acgtgacggt ggttctctgt acctggctca gtggctgatc    360
ggtccggacg gtgaaaccat cgctaaacgt cgtaaactgc gtccgaccca cgctgaacgt    420
accgtttacg gtgaaggtga cggttctgac ctggctgttc acgctcgtgc tgacatcggt    480
cgtatcggtg ctctgtgctg ctgggaacac ctgcagccgc tgtctaaata cgctatgtac    540
gctcagaacg aacaggttca cgttgctgct tggccgtctt tctctctgta cgacccgttc    600
gctccggctc tgggtgctga agttaacaac gctgcttctc gtgtttacgc tgttgaaggt    660
tcttgcttcg ttctggctcc gtgcgctacc gtttctcagg ctatgatcga cgaactgtgc    720
gaccgtccgg acaaaaacgc tctgctgcac gttggtggtg gtttcgctgc tatctacggt    780
ccggacggtt ctcagatcgg tgacaaactg gctccggacc aggaaggtct gctgatcgct    840
gaaatcgacc tgggtgctat cggtgttgct aaaaacgctg ctgacccggc tggtcactac    900
tctcgtccgg acgttacccg tctgctgctg aacaaaaaac gttaccagcg tgttgaacag    960
ttcgctctgc cggttgacac cgttgaaccg gctgacatcg gtgctgctgc ttct         1014

<210> 30
<211> 338
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 30
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1               5                   10                  15
Pro Ala Trp Leu Asp Leu Asp Gly Ser Val Asp Lys Ser Ile Ala Leu
            20                  25                  30
Ile Lys Glu Ala Ala Glu Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
        35                  40                  45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala
    50                  55                  60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65                  70                  75                  80
Ser Tyr Asp Ser Pro Gln Ala Glu Arg Leu Arg Asp Ala Val Lys Lys
                85                  90                  95
Ala Lys Leu Thr Ala Val Phe Gly Leu Ser Glu Arg Asp Gly Gly Ser
            100                 105                 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
        115                 120                 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
    130                 135                 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Ala Arg Ala Asp Ile Gly
145                 150                 155                 160
Arg Ile Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
                165                 170                 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
            180                 185                 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val
        195                 200                 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
    210                 215                 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225                 230                 235                 240
Asp Arg Pro Asp Lys Asn Ala Leu Leu His Val Gly Gly Gly Phe Ala
                245                 250                 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Ala Pro
            260                 265                 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
        275                 280                 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
    290                 295                 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Arg Tyr Gln Arg Val Glu Gln
305                 310                 315                 320
Phe Ala Leu Pro Val Asp Thr Val Glu Pro Ala Asp Ile Gly Ala Ala
                325                 330                 335
Ala Ser

<210> 31
<211> 1011
<212> DNA
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 31
atgggtatca cccacccgaa ctacaaagtt gctgttgttc aggctgctcc ggtttggctg     60
aacctggaag ctaccgttga aaaaaccatc cgttacatcg aagaagctgc taaagctggt    120
gctaaactga tcgctttccc ggaaacctgg atcccgggtt acccgtggca catctggatc    180
ggtaccccgg cttgggctat cggtaaaggt ttcgttcagc gttacttcga caactctctg    240
tcttacgact ctccgctggc tcgtcagatc gctgacgctg ctgctaaatc taaaatcacc    300
gttgttctgg gtctgtctga acgtgacggt ggttctctgt acatcgctca gtggctgatc    360
ggtccggacg gtgaaaccat cgctaaacgt cgtaaactgc gtccgaccca cgttgaacgt    420
accgttttcg gtgacggtga cggttctcac atcgctgttc acgaccgttc tgacctgggt    480
cgtctgggtg ctctgtgctg ctgggaacac gttcagccgc tgaccaaatt cgctatgtac    540
gctcagaacg aacaggttca cgttgctgct tggccgtctt tctctatgta cgaaccgttc    600
gctcacgctc tgggttggga aaccaacaac gctgtttcta aagtttacgc tgttgaaggt    660
tcttgcttcg ttctggctcc gtgcgctgtt atctctcagg ctatggttga cgaaatgtgc    720
gacaccccgg acaaacgtga actggttcac gctggtggtg gtcacgctgt tatctacggt    780
ccggacggtt ctccgctggc tgaaaaactg ggtgaaaacg aagaaggtct gctgtacgct    840
accgttaacc tggctgctat cggtgttgct aaaaacgctg ctgacccggc tggtcactac    900
tctcgtccgg acgttctgcg tctgctgttc aacaaatctc cggctcgtcg tgttgaacac    960
ttcgctctgc cgcacgaaca gctggaaatc ggtgctggtc cgtctggtga c            1011

<210> 32
<211> 337
<212> PRT
<213> Unknown

<220>
<223> Unknown prokaryotic organism

<400> 32
Met Gly Ile Thr His Pro Asn Tyr Lys Val Ala Val Val Gln Ala Ala
1               5                   10                  15
Pro Val Trp Leu Asn Leu Glu Ala Thr Val Glu Lys Thr Ile Arg Tyr
            20                  25                  30
Ile Glu Glu Ala Ala Lys Ala Gly Ala Lys Leu Ile Ala Phe Pro Glu
        35                  40                  45
Thr Trp Ile Pro Gly Tyr Pro Trp His Ile Trp Ile Gly Thr Pro Ala
    50                  55                  60
Trp Ala Ile Gly Lys Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65                  70                  75                  80
Ser Tyr Asp Ser Pro Leu Ala Arg Gln Ile Ala Asp Ala Ala Ala Lys
                85                  90                  95
Ser Lys Ile Thr Val Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
            100                 105                 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
        115                 120                 125
Lys Arg Arg Lys Leu Arg Pro Thr His Val Glu Arg Thr Val Phe Gly
    130                 135                 140
Asp Gly Asp Gly Ser His Ile Ala Val His Asp Arg Ser Asp Leu Gly
145                 150                 155                 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Val Gln Pro Leu Thr Lys
                165                 170                 175
Phe Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
            180                 185                 190
Ser Phe Ser Met Tyr Glu Pro Phe Ala His Ala Leu Gly Trp Glu Thr
        195                 200                 205
Asn Asn Ala Val Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Val
    210                 215                 220
Leu Ala Pro Cys Ala Val Ile Ser Gln Ala Met Val Asp Glu Met Cys
225                 230                 235                 240
Asp Thr Pro Asp Lys Arg Glu Leu Val His Ala Gly Gly Gly His Ala
                245                 250                 255
Val Ile Tyr Gly Pro Asp Gly Ser Pro Leu Ala Glu Lys Leu Gly Glu
            260                 265                 270
Asn Glu Glu Gly Leu Leu Tyr Ala Thr Val Asn Leu Ala Ala Ile Gly
        275                 280                 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
    290                 295                 300
Val Leu Arg Leu Leu Phe Asn Lys Ser Pro Ala Arg Arg Val Glu His
305                 310                 315                 320
Phe Ala Leu Pro His Glu Gln Leu Glu Ile Gly Ala Gly Pro Ser Gly
                325                 330                 335
Asp

<210> 33
<211> 996
<212> DNA
<213> Synechococcus sp.

<220>
<223> Synechococcus sp. CC9605

<400> 33
atgaccaccg ttaaagttgc tgctgctcag atccgtccgg ttctgttctc tctggacggt     60
tctctgcaga aagttctgga cgctatggct gaagctgctg ctcagggtgt tgaactgatc    120
gttttcccgg aaaccttcct gccgtactac ccgtacttct ctttcgttga accgccggtt    180
ctgatgggtc gttctcacct ggctctgtac gaacaggctg ttgttgttcc gggtccggtt    240
accgacgctg ttgctgctgc tgcttctcag tacggtatgc aggttctgct gggtgttaac    300
gaacgtgacg gtggtaccct gtacaacacc cagctgctgt tcaactcttg cggtgaactg    360
gttctgaaac gtcgtaaaat caccccgacc taccacgaac gtatggtttg gggtcagggt    420
gacggttctg gtctgaaagt tgttcagacc ccgctggctc gtgttggtgc tctggcttgc    480
tgggaacact acaacccgct ggctcgttac gctctgatgg ctcagggtga agaaatccac    540
tgcgctcagt tcccgggttc tctggttggt ccgatcttca ccgaacagac cgctgttacc    600
atgcgtcacc acgctctgga agctggttgc ttcgttatct gctctaccgg ttggctgcac    660
ccggacgact acgcttctat cacctctgaa tctggtctgc acaaagcttt ccagggtggt    720
tgccacaccg ctgttatctc tccggaaggt cgttacctgg ctggtccgct gccggacggt    780
gaaggtctgg ctatcgctga cctggacctg gctctgatca ccaaacgtaa acgtatgatg    840
gactctgttg gtcactactc tcgtccggaa ctgctgtctc tgcagatcaa ctcttctccg    900
gctgttccgg ttcagaacat gtctaccgct tctgttccgc tggaaccggc taccgctacc    960
gacgctctgt cttctatgga agctctgaac cacgtt                              996

<210> 34
<211> 332
<212> PRT
<213> Synechococcus sp.

<220>
<223> Synechococcus sp. CC9605

<400> 34
Met Thr Thr Val Lys Val Ala Ala Ala Gln Ile Arg Pro Val Leu Phe
1               5                   10                  15
Ser Leu Asp Gly Ser Leu Gln Lys Val Leu Asp Ala Met Ala Glu Ala
            20                  25                  30
Ala Ala Gln Gly Val Glu Leu Ile Val Phe Pro Glu Thr Phe Leu Pro
        35                  40                  45
Tyr Tyr Pro Tyr Phe Ser Phe Val Glu Pro Pro Val Leu Met Gly Arg
    50                  55                  60
Ser His Leu Ala Leu Tyr Glu Gln Ala Val Val Val Pro Gly Pro Val
65                  70                  75                  80
Thr Asp Ala Val Ala Ala Ala Ala Ser Gln Tyr Gly Met Gln Val Leu
                85                  90                  95
Leu Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr Gln Leu
            100                 105                 110
Leu Phe Asn Ser Cys Gly Glu Leu Val Leu Lys Arg Arg Lys Ile Thr
        115                 120                 125
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
    130                 135                 140
Leu Lys Val Val Gln Thr Pro Leu Ala Arg Val Gly Ala Leu Ala Cys
145                 150                 155                 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Gln Gly
                165                 170                 175
Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Leu Val Gly Pro Ile
            180                 185                 190
Phe Thr Glu Gln Thr Ala Val Thr Met Arg His His Ala Leu Glu Ala
        195                 200                 205
Gly Cys Phe Val Ile Cys Ser Thr Gly Trp Leu His Pro Asp Asp Tyr
    210                 215                 220
Ala Ser Ile Thr Ser Glu Ser Gly Leu His Lys Ala Phe Gln Gly Gly
225                 230                 235                 240
Cys His Thr Ala Val Ile Ser Pro Glu Gly Arg Tyr Leu Ala Gly Pro
                245                 250                 255
Leu Pro Asp Gly Glu Gly Leu Ala Ile Ala Asp Leu Asp Leu Ala Leu
            260                 265                 270
Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser Arg
        275                 280                 285
Pro Glu Leu Leu Ser Leu Gln Ile Asn Ser Ser Pro Ala Val Pro Val
    290                 295                 300
Gln Asn Met Ser Thr Ala Ser Val Pro Leu Glu Pro Ala Thr Ala Thr
305                 310                 315                 320
Asp Ala Leu Ser Ser Met Glu Ala Leu Asn His Val
                325                 330

<210> 35
<400> 35
000

<210> 36
<400> 36
000

<210> 37
<211> 951
<212> DNA
<213> Aquimarina atlantica

<400> 37
atgaaagacc agctgctgac cgttgctctg gctcagatct ctccggtttg gctggacaaa     60
accgctacca tcaaaaaaat cgaaaactct atcgctgaag ctgcttctaa aaaagctgaa    120
ctgatcgttt tcggtgaatc tctgctgccg ggttacccgt tctgggtttc tctgaccgac    180
ggtgctaaat tcgactctaa aatccagaaa gaaatccacg ctcactacgc tcagaactct    240
atcgttatcg aaaacggtga cctggacacc atctgcgaac tggctgctga atgcaacatc    300
gctatctacc tgggtatcat cgaacgtccg atcgaccgtg gtggtcactc tctgtacgct    360
tctctggttt acatcgacca gaaaggtgaa atcaaatctg ttcaccgtaa actgcagccg    420
acctacgaag aacgtctgac ctgggctccg ggtgacggta acggtctgct ggttcacccg    480
ctgaaagctt tcaccgttgg tggtctgaac tgctgggaaa actggatgcc gctgccgcgt    540
gctgctctgt acggtcaggg tgaaaacctg cacatcgctg tttggccggg ttctgactac    600
aacaccaaag acatcacccg tttcatcgct cgtgaatctc gttcttacgt tatctctgtt    660
tcttctctga tgcgtaccga agacttcccg aaaaccaccc cgcacctgga cgaaatcctg    720
aaaaaagctc cggacgttct gggtaacggt ggttcttgca tcgctggtcc ggacggtgaa    780
tgggttatga aaccggttct gcacaaagaa ggtctgctga tcgaaaccct ggacttctct    840
aaagttctgc aggaacgtca gaacttcgac ccggttggtc actactctcg tccggacgtt    900
acccagctgc acgttaaccg taaacgtcag tctaccgttc gtttcgacga a             951

<210> 38
<211> 317
<212> PRT
<213> Aquimarina atlantica

<400> 38
Met Lys Asp Gln Leu Leu Thr Val Ala Leu Ala Gln Ile Ser Pro Val
1               5                   10                  15
Trp Leu Asp Lys Thr Ala Thr Ile Lys Lys Ile Glu Asn Ser Ile Ala
            20                  25                  30
Glu Ala Ala Ser Lys Lys Ala Glu Leu Ile Val Phe Gly Glu Ser Leu
        35                  40                  45
Leu Pro Gly Tyr Pro Phe Trp Val Ser Leu Thr Asp Gly Ala Lys Phe
    50                  55                  60
Asp Ser Lys Ile Gln Lys Glu Ile His Ala His Tyr Ala Gln Asn Ser
65                  70                  75                  80
Ile Val Ile Glu Asn Gly Asp Leu Asp Thr Ile Cys Glu Leu Ala Ala
                85                  90                  95
Glu Cys Asn Ile Ala Ile Tyr Leu Gly Ile Ile Glu Arg Pro Ile Asp
            100                 105                 110
Arg Gly Gly His Ser Leu Tyr Ala Ser Leu Val Tyr Ile Asp Gln Lys
        115                 120                 125
Gly Glu Ile Lys Ser Val His Arg Lys Leu Gln Pro Thr Tyr Glu Glu
    130                 135                 140
Arg Leu Thr Trp Ala Pro Gly Asp Gly Asn Gly Leu Leu Val His Pro
145                 150                 155                 160
Leu Lys Ala Phe Thr Val Gly Gly Leu Asn Cys Trp Glu Asn Trp Met
                165                 170                 175
Pro Leu Pro Arg Ala Ala Leu Tyr Gly Gln Gly Glu Asn Leu His Ile
            180                 185                 190
Ala Val Trp Pro Gly Ser Asp Tyr Asn Thr Lys Asp Ile Thr Arg Phe
        195                 200                 205
Ile Ala Arg Glu Ser Arg Ser Tyr Val Ile Ser Val Ser Ser Leu Met
    210                 215                 220
Arg Thr Glu Asp Phe Pro Lys Thr Thr Pro His Leu Asp Glu Ile Leu
225                 230                 235                 240
Lys Lys Ala Pro Asp Val Leu Gly Asn Gly Gly Ser Cys Ile Ala Gly
                245                 250                 255
Pro Asp Gly Glu Trp Val Met Lys Pro Val Leu His Lys Glu Gly Leu
            260                 265                 270
Leu Ile Glu Thr Leu Asp Phe Ser Lys Val Leu Gln Glu Arg Gln Asn
        275                 280                 285
Phe Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Thr Gln Leu His
    290                 295                 300
Val Asn Arg Lys Arg Gln Ser Thr Val Arg Phe Asp Glu
305                 310                 315

<210> 39
<211> 945
<212> DNA
<213> Arthrobacter sp.

<220>
<223> Arthrobacter sp. Soil736

<400> 39
atgcgtatcg ctgctatcca ggctaccccg gttatcctgg acgctgaagc ttctgtttct     60
aaagctctgc gtctgctggg tgaagctgct ggtcagggtg ttaaactggc tgttttcccg    120
gaaaccttca tcccgctgta cccgtctggt gtttgggctt accaggctgc tcgtttcgac    180
ggtttcgacg aaatgtggac ccgtctgtgg gacaactctg ttgacgttcc gggtccgcag    240
atcgaccgtt tcatcaaagc ttgcgctgaa cacgacatct actgcgttct gggtgttaac    300
gaacgtgaat ctgctcgtcc gggttctctg tacaacacca tgatcctgct gggtccggaa    360
ggtctgctgt ggaaacaccg taaactgatg ccgaccatgc acgaacgtct gttccacggt    420
gttggttacg gtcaggacct gaacgttatc gaaaccccgg ttggtcgtgt tggtggtctg    480
atctgctggg aaaaccgtat gccgctggct cgttacgctg tttaccgtca gggtgttcag    540
atctgggctg ctccgaccgc tgacgactct gacggttgga tctctaccat gtctcacatc    600
gctatcgaat ctggtgcttt cgttgtttct gctccgcagt acatcccgcg ttctgctttc    660
ccggacgact tcccggttca gctgccggac gacggtcagg ctctgggtcg tggtggtgct    720
gctatcttcg aaccgctgca gggtcgtgct atcgctggtc cgctgtacga ccaggaaggt    780
atcgttgttg ctgacgttga cctgggtcgt tctctgaccg ctaaacgtat cttcgacgtt    840
gttggtcact actctcgtga agacgttctg tacccgccgg ctccgaccaa ccacgctccg    900
gaaggtccgg ctttctggcc gcgtacccgt ccgctgctgg gtaac                    945

<210> 40
<211> 315
<212> PRT
<213> Arthrobacter sp.

<220>
<223> Arthrobacter sp. Soil736

<400> 40
Met Arg Ile Ala Ala Ile Gln Ala Thr Pro Val Ile Leu Asp Ala Glu
1               5                   10                  15
Ala Ser Val Ser Lys Ala Leu Arg Leu Leu Gly Glu Ala Ala Gly Gln
            20                  25                  30
Gly Val Lys Leu Ala Val Phe Pro Glu Thr Phe Ile Pro Leu Tyr Pro
        35                  40                  45
Ser Gly Val Trp Ala Tyr Gln Ala Ala Arg Phe Asp Gly Phe Asp Glu
    50                  55                  60
Met Trp Thr Arg Leu Trp Asp Asn Ser Val Asp Val Pro Gly Pro Gln
65                  70                  75                  80
Ile Asp Arg Phe Ile Lys Ala Cys Ala Glu His Asp Ile Tyr Cys Val
                85                  90                  95
Leu Gly Val Asn Glu Arg Glu Ser Ala Arg Pro Gly Ser Leu Tyr Asn
            100                 105                 110
Thr Met Ile Leu Leu Gly Pro Glu Gly Leu Leu Trp Lys His Arg Lys
        115                 120                 125
Leu Met Pro Thr Met His Glu Arg Leu Phe His Gly Val Gly Tyr Gly
    130                 135                 140
Gln Asp Leu Asn Val Ile Glu Thr Pro Val Gly Arg Val Gly Gly Leu
145                 150                 155                 160
Ile Cys Trp Glu Asn Arg Met Pro Leu Ala Arg Tyr Ala Val Tyr Arg
                165                 170                 175
Gln Gly Val Gln Ile Trp Ala Ala Pro Thr Ala Asp Asp Ser Asp Gly
            180                 185                 190
Trp Ile Ser Thr Met Ser His Ile Ala Ile Glu Ser Gly Ala Phe Val
        195                 200                 205
Val Ser Ala Pro Gln Tyr Ile Pro Arg Ser Ala Phe Pro Asp Asp Phe
    210                 215                 220
Pro Val Gln Leu Pro Asp Asp Gly Gln Ala Leu Gly Arg Gly Gly Ala
225                 230                 235                 240
Ala Ile Phe Glu Pro Leu Gln Gly Arg Ala Ile Ala Gly Pro Leu Tyr
                245                 250                 255
Asp Gln Glu Gly Ile Val Val Ala Asp Val Asp Leu Gly Arg Ser Leu
            260                 265                 270
Thr Ala Lys Arg Ile Phe Asp Val Val Gly His Tyr Ser Arg Glu Asp
        275                 280                 285
Val Leu Tyr Pro Pro Ala Pro Thr Asn His Ala Pro Glu Gly Pro Ala
    290                 295                 300
Phe Trp Pro Arg Thr Arg Pro Leu Leu Gly Asn
305                 310                 315

<210> 41
<211> 1020
<212> DNA
<213> Cupriavidus basilensis

<400> 41
atgtctcaga aacgtatcgt tcgtgctgct gctgttcaga tctctccgga cctggaacac     60
ggtgaaggta ccctgggtaa agtttgcgaa gctatcgacc gtgctgctcg tgaaggtgtt    120
cagctgatcg ttttcccgga aaccttcctg ccgtactacc cgtacttctc tttcgttcgt    180
ccgccggttc agtctggttc tgaccacatg cgtctgtacg aacaggctgt tgttgttccg    240
ggtccggtta cccacgctgt ttctgaacgt gctcgtcgtc acgctatggt tgttgttctg    300
ggtgttaacg aacgtgacca cggttctctg tacaacaccc agctgatctt cgacaccgac    360
ggtcgtctgg ttctgaaacg tcgtaaaatc accccgacct tccacgaacg tatgatctgg    420
ggtcagggtg acgctgctgg tctgaaagtt gctgacaccg ctatcggtcg tgttggtgct    480
ctggcttgct gggaacacta caacccgctg gctcgttacg ctctgatgac ccagcacgaa    540
gaaatccact gctctcagtt cccgggttct ctggttggtc cggttttcgc tgaacagatc    600
gaagttacca tccgtcacca cgctctggaa tctggttgct tcgttgttaa cgctaccggt    660
tggctgaccg acgaacagat cgcttctgtt accaccgacc cggctctgca gaaagctctg    720
cgtggtggtt gcaacaccgc tatcgtttct ccggaaggtc agcacctggc tccgccgctg    780
cgtgaaggtg aaggtatggt tatcgctgac ctggacatgt ctctgatcac caaacgtaaa    840
cgtatgatgg actctgttgg tcactacgct cgtccggaac tgctgtctct ggctatcaac    900
gaccgtccgg ctgctaccgc ttctccgatg gctaccgctc tgtctaacta ccacggttct    960
acccaccacg aaccgcagcg tgacgacgct ggtctggacc tggaaccggt tgttggtaac   1020

<210> 42
<211> 340
<212> PRT
<213> Cupriavidus basilensis

<400> 42
Met Ser Gln Lys Arg Ile Val Arg Ala Ala Ala Val Gln Ile Ser Pro
1               5                   10                  15
Asp Leu Glu His Gly Glu Gly Thr Leu Gly Lys Val Cys Glu Ala Ile
            20                  25                  30
Asp Arg Ala Ala Arg Glu Gly Val Gln Leu Ile Val Phe Pro Glu Thr
        35                  40                  45
Phe Leu Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Pro Pro Val Gln
    50                  55                  60
Ser Gly Ser Asp His Met Arg Leu Tyr Glu Gln Ala Val Val Val Pro
65                  70                  75                  80
Gly Pro Val Thr His Ala Val Ser Glu Arg Ala Arg Arg His Ala Met
                85                  90                  95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn
            100                 105                 110
Thr Gln Leu Ile Phe Asp Thr Asp Gly Arg Leu Val Leu Lys Arg Arg
        115                 120                 125
Lys Ile Thr Pro Thr Phe His Glu Arg Met Ile Trp Gly Gln Gly Asp
    130                 135                 140
Ala Ala Gly Leu Lys Val Ala Asp Thr Ala Ile Gly Arg Val Gly Ala
145                 150                 155                 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
                165                 170                 175
Thr Gln His Glu Glu Ile His Cys Ser Gln Phe Pro Gly Ser Leu Val
            180                 185                 190
Gly Pro Val Phe Ala Glu Gln Ile Glu Val Thr Ile Arg His His Ala
        195                 200                 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp
    210                 215                 220
Glu Gln Ile Ala Ser Val Thr Thr Asp Pro Ala Leu Gln Lys Ala Leu
225                 230                 235                 240
Arg Gly Gly Cys Asn Thr Ala Ile Val Ser Pro Glu Gly Gln His Leu
                245                 250                 255
Ala Pro Pro Leu Arg Glu Gly Glu Gly Met Val Ile Ala Asp Leu Asp
            260                 265                 270
Met Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
        275                 280                 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Arg Pro Ala
    290                 295                 300
Ala Thr Ala Ser Pro Met Ala Thr Ala Leu Ser Asn Tyr His Gly Ser
305                 310                 315                 320
Thr His His Glu Pro Gln Arg Asp Asp Ala Gly Leu Asp Leu Glu Pro
                325                 330                 335
Val Val Gly Asn
            340

<210> 43
<400> 43
000

<210> 44
<400> 44
000

<210> 45
<211> 1059
<212> DNA
<213> Sphingomonas wittichii

<220>
<223> Sphingomonas wittichii RW1

<400> 45
atgaacgaag gtttccagaa agttcgtgtt gctgctgctc agatctctcc ggctttcctg     60
gaccgtgaag gttctaccga aatcgcttgc cactggatcg ctgaagctgc tcgtggtggt    120
gctgaactgc tgtctttcgg tgaagcttgg ctgccggctt acccgttctg gatcttcatg    180
ggttctccga tctactctgc tcagttctct cgtcgtctgt acgaaaacgc tgttgaaatc    240
ccgtctgcta ccaccgaccg tctgtgcgaa gctgctcgta aagctggtat ccacgttgtt    300
atgggtctga ccgaactgtg gggtggttct ctgtacctgg ctcagctgtt catcaacgac    360
cgtggtgaaa tcgttggtca ccgtcgtaaa ctgaaaccga cccactggga acgtgctatc    420
tggggtgaag gtgacggttc tgacttcttc gttgttccga cctctatcgg tcgtctgggt    480
gctctgaact gctgggaaca cctgcagccg ctgaacctgt tcgctatgaa cgctttcggt    540
gaacagatcc acgttgctgc ttggccggct ttcgctatct acaaccgtgt tgacccgtct    600
ttcaccaacg aagctaacct ggctgcttct cgtgcttacg ctatggctac ccagaccttc    660
gttatccaca cctctgctgt tgttgacgac gctaccgttg aactgctgtg cgacgacgac    720
gacaaacgtc tgctgctgga atctggtggt ggtcagtgcg ctgttatcaa cccgctgggt    780
gctatcatct ctaccccgct gtcttctacc gctcagggtc tggttttcgc tgactgcgac    840
ttcggtgtta tcgcttctgc taaaatgtct aacgacccgg ctggtcacta ccagcgtggt    900
gacgttttcc aggttcactt caacccggct ccgcgtcgtc cgctggttcc gcgtgctgct    960
atcgctgctg acccgaccac cgctgcttct gaagacctgc cgaacatcaa acacccgccg   1020
ttctctccgg ctgttaaact gccgatcgtt gttgacgac                          1059

<210> 46
<211> 353
<212> PRT
<213> Sphingomonas wittichii

<220>
<223> Sphingomonas wittichii RW1

<400> 46
Met Asn Glu Gly Phe Gln Lys Val Arg Val Ala Ala Ala Gln Ile Ser
1               5                   10                  15
Pro Ala Phe Leu Asp Arg Glu Gly Ser Thr Glu Ile Ala Cys His Trp
            20                  25                  30
Ile Ala Glu Ala Ala Arg Gly Gly Ala Glu Leu Leu Ser Phe Gly Glu
        35                  40                  45
Ala Trp Leu Pro Ala Tyr Pro Phe Trp Ile Phe Met Gly Ser Pro Ile
    50                  55                  60
Tyr Ser Ala Gln Phe Ser Arg Arg Leu Tyr Glu Asn Ala Val Glu Ile
65                  70                  75                  80
Pro Ser Ala Thr Thr Asp Arg Leu Cys Glu Ala Ala Arg Lys Ala Gly
                85                  90                  95
Ile His Val Val Met Gly Leu Thr Glu Leu Trp Gly Gly Ser Leu Tyr
            100                 105                 110
Leu Ala Gln Leu Phe Ile Asn Asp Arg Gly Glu Ile Val Gly His Arg
        115                 120                 125
Arg Lys Leu Lys Pro Thr His Trp Glu Arg Ala Ile Trp Gly Glu Gly
    130                 135                 140
Asp Gly Ser Asp Phe Phe Val Val Pro Thr Ser Ile Gly Arg Leu Gly
145                 150                 155                 160
Ala Leu Asn Cys Trp Glu His Leu Gln Pro Leu Asn Leu Phe Ala Met
                165                 170                 175
Asn Ala Phe Gly Glu Gln Ile His Val Ala Ala Trp Pro Ala Phe Ala
            180                 185                 190
Ile Tyr Asn Arg Val Asp Pro Ser Phe Thr Asn Glu Ala Asn Leu Ala
        195                 200                 205
Ala Ser Arg Ala Tyr Ala Met Ala Thr Gln Thr Phe Val Ile His Thr
    210                 215                 220
Ser Ala Val Val Asp Asp Ala Thr Val Glu Leu Leu Cys Asp Asp Asp
225                 230                 235                 240
Asp Lys Arg Leu Leu Leu Glu Ser Gly Gly Gly Gln Cys Ala Val Ile
                245                 250                 255
Asn Pro Leu Gly Ala Ile Ile Ser Thr Pro Leu Ser Ser Thr Ala Gln
            260                 265                 270
Gly Leu Val Phe Ala Asp Cys Asp Phe Gly Val Ile Ala Ser Ala Lys
        275                 280                 285
Met Ser Asn Asp Pro Ala Gly His Tyr Gln Arg Gly Asp Val Phe Gln
    290                 295                 300
Val His Phe Asn Pro Ala Pro Arg Arg Pro Leu Val Pro Arg Ala Ala
305                 310                 315                 320
Ile Ala Ala Asp Pro Thr Thr Ala Ala Ser Glu Asp Leu Pro Asn Ile
                325                 330                 335
Lys His Pro Pro Phe Ser Pro Ala Val Lys Leu Pro Ile Val Val Asp
            340                 345                 350
Asp

<210> 47
<211> 951
<212> DNA
<213> Pseudomonas mandelii

<220>
<223> Pseudomonas mandelii JR-1

<400> 47
atggaaaacg ctatgaccaa agttgctatc atccagcgtc cgccggttct gctggaccgt     60
tctgctacca tcgctcgtgc tgttcagtct gttgctgaag ctgctgctgc tggtgcttct    120
ctgatcgttc tgccggaatc tttcatcccg ggttacccgt cttggatctg gcgtctggct    180
gctggtaaag acggtgctgt tatgggtcag ctgcacaccc gtctgctggc taacgctgtt    240
gacatcgcta acggtgacct gggtgaactg tgcgaagctg ctcgtgttca cgctgttacc    300
atcgtttgcg gtatcaacga atgcgaccgt tctaccggtg gtggtaccct gtacaactct    360
gttgttgtta tcggtgctga cggtgctgtt ctgaaccgtc accgtaaact gatgccgacc    420
aacccggaac gtatggttca cggtttcggt gacgcttctg gtctgcgtgc tgttgacacc    480
ccggttggtc gtgttggtgc tctgatctgc tgggaaaact acatgccgct ggctcgttac    540
tctctgtacg ctcagggtgt tgaaatctac atcgctccga cctacgacac cggtgaaggt    600
tggatctcta ccatgcgtca catcgctctg gaaggtcgtt gctgggttct gggttctggt    660
accgctctgc gtggttctga catcccggaa gacttcccgg ctcgtatgca gctgttcgct    720
gacccggacg aatggatcaa cgacggtgac tctgttgttg tttctccgca gggtcgtgtt    780
gttgctggtc cgctgcaccg tgaagctggt atcctgtacg ctgacatcga cgttgctctg    840
gttgctccgg ctcgtcgtgc tctggacgtt accggtcact acgctcgtcc ggacatcttc    900
gaactgcacg ttcgtcgttc tccggctatc ccggttcact acatcgacga a             951

<210> 48
<211> 317
<212> PRT
<213> Pseudomonas mandelii

<220>
<223> Pseudomonas mandelii JR-1

<400> 48
Met Glu Asn Ala Met Thr Lys Val Ala Ile Ile Gln Arg Pro Pro Val
1               5                   10                  15
Leu Leu Asp Arg Ser Ala Thr Ile Ala Arg Ala Val Gln Ser Val Ala
            20                  25                  30
Glu Ala Ala Ala Ala Gly Ala Ser Leu Ile Val Leu Pro Glu Ser Phe
        35                  40                  45
Ile Pro Gly Tyr Pro Ser Trp Ile Trp Arg Leu Ala Ala Gly Lys Asp
    50                  55                  60
Gly Ala Val Met Gly Gln Leu His Thr Arg Leu Leu Ala Asn Ala Val
65                  70                  75                  80
Asp Ile Ala Asn Gly Asp Leu Gly Glu Leu Cys Glu Ala Ala Arg Val
                85                  90                  95
His Ala Val Thr Ile Val Cys Gly Ile Asn Glu Cys Asp Arg Ser Thr
            100                 105                 110
Gly Gly Gly Thr Leu Tyr Asn Ser Val Val Val Ile Gly Ala Asp Gly
        115                 120                 125
Ala Val Leu Asn Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg
    130                 135                 140
Met Val His Gly Phe Gly Asp Ala Ser Gly Leu Arg Ala Val Asp Thr
145                 150                 155                 160
Pro Val Gly Arg Val Gly Ala Leu Ile Cys Trp Glu Asn Tyr Met Pro
                165                 170                 175
Leu Ala Arg Tyr Ser Leu Tyr Ala Gln Gly Val Glu Ile Tyr Ile Ala
            180                 185                 190
Pro Thr Tyr Asp Thr Gly Glu Gly Trp Ile Ser Thr Met Arg His Ile
        195                 200                 205
Ala Leu Glu Gly Arg Cys Trp Val Leu Gly Ser Gly Thr Ala Leu Arg
    210                 215                 220
Gly Ser Asp Ile Pro Glu Asp Phe Pro Ala Arg Met Gln Leu Phe Ala
225                 230                 235                 240
Asp Pro Asp Glu Trp Ile Asn Asp Gly Asp Ser Val Val Val Ser Pro
                245                 250                 255
Gln Gly Arg Val Val Ala Gly Pro Leu His Arg Glu Ala Gly Ile Leu
            260                 265                 270
Tyr Ala Asp Ile Asp Val Ala Leu Val Ala Pro Ala Arg Arg Ala Leu
        275                 280                 285
Asp Val Thr Gly His Tyr Ala Arg Pro Asp Ile Phe Glu Leu His Val
    290                 295                 300
Arg Arg Ser Pro Ala Ile Pro Val His Tyr Ile Asp Glu
305                 310                 315

<210> 49
<400> 49
000

<210> 50
<400> 50
000

<210> 51
<211> 1017
<212> DNA
<213> Arabidopsis thaliana

<400> 51
atgtctacct ctgaaaacac cccgttcaac ggtgttgctt cttctaccat cgttcgtgct     60
accatcgttc aggcttctac cgtttacaac gacaccccgg ctaccctgga aaaagctaac    120
aaattcatcg ttgaagctgc ttctaaaggt tctgaactgg ttgttttccc ggaagctttc    180
atcggtggtt acccgcgtgg tttccgtttc ggtctgggtg ttggtgttca caacgaagaa    240
ggtcgtgacg aattccgtaa ataccacgct tctgctatca aagttccggg tccggaagtt    300
gaaaaactgg ctgaactggc tggtaaaaac aacgtttacc tggttatggg tgctatcgaa    360
aaagacggtt acaccctgta ctgcaccgct ctgttcttct ctccgcaggg tcagttcctg    420
ggtaaacacc gtaaactgat gccgacctct ctggaacgtt gcatctgggg tcagggtgac    480
ggttctacca tcccggttta cgacaccccg atcggtaaac tgggtgctgc tatctgctgg    540
gaaaaccgta tgccgctgta ccgtaccgct ctgtacgcta aaggtatcga actgtactgc    600
gctccgaccg ctgacggttc taaagaatgg cagtcttcta tgctgcacat cgctatcgaa    660
ggtggttgct tcgttctgtc tgcttgccag ttctgcctgc gtaaagactt cccggaccac    720
ccggactacc tgttcaccga ctggtacgac gacaaagaac cggactctat cgtttctcag    780
ggtggttctg ttatcatctc tccgctgggt caggttctgg ctggtccgaa cttcgaatct    840
gaaggtctga tcaccgctga cctggacctg ggtgacgttg ctcgtgctaa actgtacttc    900
gactctgttg gtcactactc tcgtccggac gttctgcacc tgaccgttaa cgaacacccg    960
aaaaaaccgg ttaccttcat ctctaaagtt gaaaaagctg aagacgactc taacaaa      1017

<210> 52
<211> 339
<212> PRT
<213> Arabidopsis thaliana

<400> 52
Met Ser Thr Ser Glu Asn Thr Pro Phe Asn Gly Val Ala Ser Ser Thr
1               5                   10                  15
Ile Val Arg Ala Thr Ile Val Gln Ala Ser Thr Val Tyr Asn Asp Thr
            20                  25                  30
Pro Ala Thr Leu Glu Lys Ala Asn Lys Phe Ile Val Glu Ala Ala Ser
        35                  40                  45
Lys Gly Ser Glu Leu Val Val Phe Pro Glu Ala Phe Ile Gly Gly Tyr
    50                  55                  60
Pro Arg Gly Phe Arg Phe Gly Leu Gly Val Gly Val His Asn Glu Glu
65                  70                  75                  80
Gly Arg Asp Glu Phe Arg Lys Tyr His Ala Ser Ala Ile Lys Val Pro
                85                  90                  95
Gly Pro Glu Val Glu Lys Leu Ala Glu Leu Ala Gly Lys Asn Asn Val
            100                 105                 110
Tyr Leu Val Met Gly Ala Ile Glu Lys Asp Gly Tyr Thr Leu Tyr Cys
        115                 120                 125
Thr Ala Leu Phe Phe Ser Pro Gln Gly Gln Phe Leu Gly Lys His Arg
    130                 135                 140
Lys Leu Met Pro Thr Ser Leu Glu Arg Cys Ile Trp Gly Gln Gly Asp
145                 150                 155                 160
Gly Ser Thr Ile Pro Val Tyr Asp Thr Pro Ile Gly Lys Leu Gly Ala
                165                 170                 175
Ala Ile Cys Trp Glu Asn Arg Met Pro Leu Tyr Arg Thr Ala Leu Tyr
            180                 185                 190
Ala Lys Gly Ile Glu Leu Tyr Cys Ala Pro Thr Ala Asp Gly Ser Lys
        195                 200                 205
Glu Trp Gln Ser Ser Met Leu His Ile Ala Ile Glu Gly Gly Cys Phe
    210                 215                 220
Val Leu Ser Ala Cys Gln Phe Cys Leu Arg Lys Asp Phe Pro Asp His
225                 230                 235                 240
Pro Asp Tyr Leu Phe Thr Asp Trp Tyr Asp Asp Lys Glu Pro Asp Ser
                245                 250                 255
Ile Val Ser Gln Gly Gly Ser Val Ile Ile Ser Pro Leu Gly Gln Val
            260                 265                 270
Leu Ala Gly Pro Asn Phe Glu Ser Glu Gly Leu Ile Thr Ala Asp Leu
        275                 280                 285
Asp Leu Gly Asp Val Ala Arg Ala Lys Leu Tyr Phe Asp Ser Val Gly
    290                 295                 300
His Tyr Ser Arg Pro Asp Val Leu His Leu Thr Val Asn Glu His Pro
305                 310                 315                 320
Lys Lys Pro Val Thr Phe Ile Ser Lys Val Glu Lys Ala Glu Asp Asp
                325                 330                 335
Ser Asn Lys

<210> 53
<211> 1029
<212> DNA
<213> Brassica oleracea

<400> 53
atgtctaccc cgaaaaacac cacccaggct aacggtgact cttcttcttc tatcgttcgt     60
gctaccatcg ttcaggcttc taccgtttac aacgacaccc cgaaaaccat cgaaaaagct    120
gaaaaactga tcgctgaagc tgcttctaac ggttctgaac tggttgtttt cccggaaggt    180
ttcatcggtg gttacccgcg tggtttccgt ttcggtatcg ctgttggtat ccacaacgaa    240
gacggtcgtg acgacttccg taaataccac gactctgcta tccacgttcc gggtccggaa    300
gttgacaaac tggctgaact ggctcgtaaa aacaacgttt acctggttat gggtgctatc    360
gaaaaagacg gttacaccct gtactgcacc gctctgttct tcaactctga aggtcgttac    420
ctgggtaaac accgtaaagt tatgccgacc tctctggaac gttgcatctg gggtttcggt    480
gacggttcta ccatcccggt ttacgacacc ccgatcggta aactgggtgc tgctatctgc    540
tgggaaaacc gtatgccgct gtaccgtacc gctctgtacg gtaaaggtgt tgaactgtac    600
tgcgctccga ccgctgacgg ttctaaagaa tggcagtctt ctatgatgca catcgctatg    660
gaaggtggtt gcttcgttct gtctgcttgc cagttctgcc agcgtaaaga cttcccggct    720
cacgttgacc acctgttcac cgactggtac gacgaccagc acgacgaagc tatcgtttct    780
cagggtggtt ctgttatcat ctctccgctg ggtaaagttc tggctggtcc gaacttcgaa    840
tctgaaggtc tgatcaccgc tgacctggac ctgggtgaca tcgctcgtgc taaactgtac    900
ttcgacgttg ttggtcacta ctctaaaccg gacgttttca acctgaccgt taacgaacac    960
ccgaaaaaac cggttacctt cgtttctaaa accgttaaag ctgaagacgg ttctgaatct   1020
aaagaaaaa                                                           1029

<210> 54
<211> 343
<212> PRT
<213> Brassica oleracea

<400> 54
Met Ser Thr Pro Lys Asn Thr Thr Gln Ala Asn Gly Asp Ser Ser Ser
1               5                   10                  15
Ser Ile Val Arg Ala Thr Ile Val Gln Ala Ser Thr Val Tyr Asn Asp
            20                  25                  30
Thr Pro Lys Thr Ile Glu Lys Ala Glu Lys Leu Ile Ala Glu Ala Ala
        35                  40                  45
Ser Asn Gly Ser Glu Leu Val Val Phe Pro Glu Gly Phe Ile Gly Gly
    50                  55                  60
Tyr Pro Arg Gly Phe Arg Phe Gly Ile Ala Val Gly Ile His Asn Glu
65                  70                  75                  80
Asp Gly Arg Asp Asp Phe Arg Lys Tyr His Asp Ser Ala Ile His Val
                85                  90                  95
Pro Gly Pro Glu Val Asp Lys Leu Ala Glu Leu Ala Arg Lys Asn Asn
            100                 105                 110
Val Tyr Leu Val Met Gly Ala Ile Glu Lys Asp Gly Tyr Thr Leu Tyr
        115                 120                 125
Cys Thr Ala Leu Phe Phe Asn Ser Glu Gly Arg Tyr Leu Gly Lys His
    130                 135                 140
Arg Lys Val Met Pro Thr Ser Leu Glu Arg Cys Ile Trp Gly Phe Gly
145                 150                 155                 160
Asp Gly Ser Thr Ile Pro Val Tyr Asp Thr Pro Ile Gly Lys Leu Gly
                165                 170                 175
Ala Ala Ile Cys Trp Glu Asn Arg Met Pro Leu Tyr Arg Thr Ala Leu
            180                 185                 190
Tyr Gly Lys Gly Val Glu Leu Tyr Cys Ala Pro Thr Ala Asp Gly Ser
        195                 200                 205
Lys Glu Trp Gln Ser Ser Met Met His Ile Ala Met Glu Gly Gly Cys
    210                 215                 220
Phe Val Leu Ser Ala Cys Gln Phe Cys Gln Arg Lys Asp Phe Pro Ala
225                 230                 235                 240
His Val Asp His Leu Phe Thr Asp Trp Tyr Asp Asp Gln His Asp Glu
                245                 250                 255
Ala Ile Val Ser Gln Gly Gly Ser Val Ile Ile Ser Pro Leu Gly Lys
            260                 265                 270
Val Leu Ala Gly Pro Asn Phe Glu Ser Glu Gly Leu Ile Thr Ala Asp
        275                 280                 285
Leu Asp Leu Gly Asp Ile Ala Arg Ala Lys Leu Tyr Phe Asp Val Val
    290                 295                 300
Gly His Tyr Ser Lys Pro Asp Val Phe Asn Leu Thr Val Asn Glu His
305                 310                 315                 320
Pro Lys Lys Pro Val Thr Phe Val Ser Lys Thr Val Lys Ala Glu Asp
                325                 330                 335
Gly Ser Glu Ser Lys Glu Lys
            340

<210> 55
<211> 1035
<212> DNA
<213> Salinisphaera shabanensis

<220>
<223> Salinisphaera shabanensis E1L3A

<400> 55
atgacccagt ctcagatcgt taaagttgct gctgttcagc tgcagccggt tctggactct     60
gctgacggta ccgttgaacg tgttctggac gaaatcgctg ctgctgctgc tgacggtgct    120
cagctggttg ttttcccgga aaccgctgtt ccgtactacc cgtactggtc tttcgttatg    180
gctccgatgg acatgggtgc tcgtcaccgt gctctgtacg accactctcc gaccgttccg    240
ggtccggtta ccgacgctgt tgctgctgct gctcgtaccc acgaaatcgt tgttgttctg    300
ggtgttaacg aacgtgacca cggtaccctg tacaactgcc agctggtttt cgacggtaac    360
ggtgaaatcg ctctgaaacg tcgtaaaatc accccgacct accacgaacg tatggtttgg    420
ggtcagggtg acggttctgg tctgcacgct gttgacaccg ctgttggtcg tgttggtgct    480
ctggcttgct gggaacacta caacccgctg gctcgttacg ctctgatggc tgaccacgaa    540
cagatccact gctctcagtt cccgggttct ctggttggtc cgatcttcgc tgaacagcag    600
gaagttaccc tgcgtcacca cgctctggaa tctggttgct tcgttgttaa cgctaccgct    660
tggctggacg ctgaccaggt tgcttctgtt accgaagacc cggctctgca gaaaggtctg    720
ttcggtggtt gctacaccgc tatcatcgct ccggacggtt ctcacgttgt tgctccgctg    780
ctggacggtc cgggtcgtct ggttgctgac atcgacctgt ctctgatcac caaacgtaaa    840
cgtatgatgg actctgttgg tcactacgct cgtccggaac tgctgtctct gcgtatcgac    900
cgtcgttctc acgctgctca gcacgctgac gctgctccgg gtgttggtgc tgtttctgaa    960
ttcgaagaac cggaccacgg tgaaccggaa ccgtacgctg cttaccgtga cgctatcgct   1020
cgttcttcta ccggt                                                    1035

<210> 56
<211> 345
<212> PRT
<213> Salinisphaera shabanensis

<220>
<223> Salinisphaera shabanensis E1L3A

<400> 56
Met Thr Gln Ser Gln Ile Val Lys Val Ala Ala Val Gln Leu Gln Pro
1               5                   10                  15
Val Leu Asp Ser Ala Asp Gly Thr Val Glu Arg Val Leu Asp Glu Ile
            20                  25                  30
Ala Ala Ala Ala Ala Asp Gly Ala Gln Leu Val Val Phe Pro Glu Thr
        35                  40                  45
Ala Val Pro Tyr Tyr Pro Tyr Trp Ser Phe Val Met Ala Pro Met Asp
    50                  55                  60
Met Gly Ala Arg His Arg Ala Leu Tyr Asp His Ser Pro Thr Val Pro
65                  70                  75                  80
Gly Pro Val Thr Asp Ala Val Ala Ala Ala Ala Arg Thr His Glu Ile
                85                  90                  95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn
            100                 105                 110
Cys Gln Leu Val Phe Asp Gly Asn Gly Glu Ile Ala Leu Lys Arg Arg
        115                 120                 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp
    130                 135                 140
Gly Ser Gly Leu His Ala Val Asp Thr Ala Val Gly Arg Val Gly Ala
145                 150                 155                 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
                165                 170                 175
Ala Asp His Glu Gln Ile His Cys Ser Gln Phe Pro Gly Ser Leu Val
            180                 185                 190
Gly Pro Ile Phe Ala Glu Gln Gln Glu Val Thr Leu Arg His His Ala
        195                 200                 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala
    210                 215                 220
Asp Gln Val Ala Ser Val Thr Glu Asp Pro Ala Leu Gln Lys Gly Leu
225                 230                 235                 240
Phe Gly Gly Cys Tyr Thr Ala Ile Ile Ala Pro Asp Gly Ser His Val
                245                 250                 255
Val Ala Pro Leu Leu Asp Gly Pro Gly Arg Leu Val Ala Asp Ile Asp
            260                 265                 270
Leu Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
        275                 280                 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Arg Ile Asp Arg Arg Ser His
    290                 295                 300
Ala Ala Gln His Ala Asp Ala Ala Pro Gly Val Gly Ala Val Ser Glu
305                 310                 315                 320
Phe Glu Glu Pro Asp His Gly Glu Pro Glu Pro Tyr Ala Ala Tyr Arg
                325                 330                 335
Asp Ala Ile Ala Arg Ser Ser Thr Gly
            340                 345

<210> 57
<400> 57
000

<210> 58
<400> 58
000

<210> 59
<211> 954
<212> DNA
<213> Smithella sp.

<220>
<223> Smithella sp. SDB

<400> 59
atgaaaaacc agaccaaagt tgctgctatc cagctggcta ccaaaatcgg tgactctaac     60
accaacatcg ctggttgcga acgtctggct ctgatggcta tcaaaaacgg tgctcgttgg    120
atcgctctgc cggaattctt caccaccggt gtttcttgga aaccggaaat cgcttcttct    180
atccagaccg ttgacggtgc tgctgcttct ttcatgtgcg acttctctgc taaacaccag    240
gttgttctgg gtggttcttt cctgtgccgt ctgtctgacg gttctgttcg taaccgttac    300
cagtgctacg ctaacggttc tctgatcggt cagcacgaca aagacctgcc gaccatgtgg    360
gaaaactact tctacgaagg tggtgacccg atggactctg gtgttctggg tacctacaac    420
aacatccgta tcggtgctgc tgtttgctgg gaattcatgc gtaccatgac cgctcgtcgt    480
ctgcgtaaca aagttgacgt tatcatcggt ggttcttgct ggtggtctat cccgaccaac    540
ttcccggttt tcctgcagaa actgtgggaa ccggctaacc actactgctc tctggctgct    600
atccaggact ctgctcgtct gatcggtgct ccggttatcc acgctgctca ctgcggtgaa    660
atcgaatgcc cgatgccggg tctgccgatc aaataccgtg gttacttcga aggtaacgct    720
tctatcgttg acgcttctgg taaagttctg gctcagcgtt ctgctgaaca gggtgaaggt    780
atcgtttgcg ctgacatcct gctggaagct cagccgacca tcgaagctat cccggaccgt    840
ttctggctgc gttctcgtgg tttcctgccg accttcgctt ggcaccacca gcgttggctg    900
ggtcgtcgtt ggtacaaacg taacgttcgt cagaaaaaaa acgaactgca ccac          954

<210> 60
<211> 318
<212> PRT
<213> Smithella sp.

<220>
<223> Smithella sp. SDB

<400> 60
Met Lys Asn Gln Thr Lys Val Ala Ala Ile Gln Leu Ala Thr Lys Ile
1               5                   10                  15
Gly Asp Ser Asn Thr Asn Ile Ala Gly Cys Glu Arg Leu Ala Leu Met
            20                  25                  30
Ala Ile Lys Asn Gly Ala Arg Trp Ile Ala Leu Pro Glu Phe Phe Thr
        35                  40                  45
Thr Gly Val Ser Trp Lys Pro Glu Ile Ala Ser Ser Ile Gln Thr Val
    50                  55                  60
Asp Gly Ala Ala Ala Ser Phe Met Cys Asp Phe Ser Ala Lys His Gln
65                  70                  75                  80
Val Val Leu Gly Gly Ser Phe Leu Cys Arg Leu Ser Asp Gly Ser Val
                85                  90                  95
Arg Asn Arg Tyr Gln Cys Tyr Ala Asn Gly Ser Leu Ile Gly Gln His
            100                 105                 110
Asp Lys Asp Leu Pro Thr Met Trp Glu Asn Tyr Phe Tyr Glu Gly Gly
        115                 120                 125
Asp Pro Met Asp Ser Gly Val Leu Gly Thr Tyr Asn Asn Ile Arg Ile
    130                 135                 140
Gly Ala Ala Val Cys Trp Glu Phe Met Arg Thr Met Thr Ala Arg Arg
145                 150                 155                 160
Leu Arg Asn Lys Val Asp Val Ile Ile Gly Gly Ser Cys Trp Trp Ser
                165                 170                 175
Ile Pro Thr Asn Phe Pro Val Phe Leu Gln Lys Leu Trp Glu Pro Ala
            180                 185                 190
Asn His Tyr Cys Ser Leu Ala Ala Ile Gln Asp Ser Ala Arg Leu Ile
        195                 200                 205
Gly Ala Pro Val Ile His Ala Ala His Cys Gly Glu Ile Glu Cys Pro
    210                 215                 220
Met Pro Gly Leu Pro Ile Lys Tyr Arg Gly Tyr Phe Glu Gly Asn Ala
225                 230                 235                 240
Ser Ile Val Asp Ala Ser Gly Lys Val Leu Ala Gln Arg Ser Ala Glu
                245                 250                 255
Gln Gly Glu Gly Ile Val Cys Ala Asp Ile Leu Leu Glu Ala Gln Pro
            260                 265                 270
Thr Ile Glu Ala Ile Pro Asp Arg Phe Trp Leu Arg Ser Arg Gly Phe
        275                 280                 285
Leu Pro Thr Phe Ala Trp His His Gln Arg Trp Leu Gly Arg Arg Trp
    290                 295                 300
Tyr Lys Arg Asn Val Arg Gln Lys Lys Asn Glu Leu His His
305                 310                 315

<210> 61
<211> 963
<212> DNA
<213> Bradyrhizobium diazoefficiens

<400> 61
atgatggata gtaaccgccc gaatacctat aaagcagccg tggtgcaggc agccagcgat     60
ccgaccagca gcctggttag tgcacagaaa gccgcagccc tgattgaaaa agccgccggt    120
gcaggtgcac gtctggttgt gtttccggaa gcctttattg gtggttatcc gaaaggtaat    180
agctttggtg ccccggtggg catgcgtaaa ccggaaggtc gtgaagcatt tcgtctgtat    240
tgggaagcag caattgatct ggatggcgtt gaagtggaaa ccattgccgc agcagcagca    300
gcgaccggtg cctttaccgt tattggctgt attgaacgtg aacagggcac cctgtattgc    360
accgcactgt ttttcgatgg cgcccgtggt ctggttggta aacatcgtaa actgatgccg    420
accgccggcg aacgcctgat ttggggcttt ggtgacggta gcaccatgcc ggtgtttgaa    480
accagtctgg gtaatattgg cgcagttatt tgctgggaaa attatatgcc gatgctgcgc    540
atgcacatgt atagtcaggg cattagtatc tattgtgccc cgaccgcaga tgatcgtgat    600
acctggctgc cgaccatgca gcatattgca ctggaaggcc gctgctttgt tctgaccgcc    660
tgccagcatc tgaaacgtgg cgcatttccg gccgattatg aatgcgcact gggcgcagat    720
ccggaaaccg tgctgatgcg cggtggtagt gcaattgtga atccgctggg taaagttctg    780
gccggcccgt gctttgaagg cgaaaccatt ctgtatgcag atattgcact ggatgaagtt    840
acccgtggta aatttgattt tgatgcagca ggccattata gtcgtccgga tgtgtttcag    900
ctggttgtgg atgatcgtcc gaaacgcgcc gttagcaccg tgagcgccgt gcgtgcccgc    960
aat                                                                  963

<210> 62
<211> 321
<212> PRT
<213> Bradyrhizobium diazoefficiens

<400> 62
Met Met Asp Ser Asn Arg Pro Asn Thr Tyr Lys Ala Ala Val Val Gln
1               5                   10                  15
Ala Ala Ser Asp Pro Thr Ser Ser Leu Val Ser Ala Gln Lys Ala Ala
            20                  25                  30
Ala Leu Ile Glu Lys Ala Ala Gly Ala Gly Ala Arg Leu Val Val Phe
        35                  40                  45
Pro Glu Ala Phe Ile Gly Gly Tyr Pro Lys Gly Asn Ser Phe Gly Ala
    50                  55                  60
Pro Val Gly Met Arg Lys Pro Glu Gly Arg Glu Ala Phe Arg Leu Tyr
65                  70                  75                  80
Trp Glu Ala Ala Ile Asp Leu Asp Gly Val Glu Val Glu Thr Ile Ala
                85                  90                  95
Ala Ala Ala Ala Ala Thr Gly Ala Phe Thr Val Ile Gly Cys Ile Glu
            100                 105                 110
Arg Glu Gln Gly Thr Leu Tyr Cys Thr Ala Leu Phe Phe Asp Gly Ala
        115                 120                 125
Arg Gly Leu Val Gly Lys His Arg Lys Leu Met Pro Thr Ala Gly Glu
    130                 135                 140
Arg Leu Ile Trp Gly Phe Gly Asp Gly Ser Thr Met Pro Val Phe Glu
145                 150                 155                 160
Thr Ser Leu Gly Asn Ile Gly Ala Val Ile Cys Trp Glu Asn Tyr Met
                165                 170                 175
Pro Met Leu Arg Met His Met Tyr Ser Gln Gly Ile Ser Ile Tyr Cys
            180                 185                 190
Ala Pro Thr Ala Asp Asp Arg Asp Thr Trp Leu Pro Thr Met Gln His
        195                 200                 205
Ile Ala Leu Glu Gly Arg Cys Phe Val Leu Thr Ala Cys Gln His Leu
    210                 215                 220
Lys Arg Gly Ala Phe Pro Ala Asp Tyr Glu Cys Ala Leu Gly Ala Asp
225                 230                 235                 240
Pro Glu Thr Val Leu Met Arg Gly Gly Ser Ala Ile Val Asn Pro Leu
                245                 250                 255
Gly Lys Val Leu Ala Gly Pro Cys Phe Glu Gly Glu Thr Ile Leu Tyr
            260                 265                 270
Ala Asp Ile Ala Leu Asp Glu Val Thr Arg Gly Lys Phe Asp Phe Asp
        275                 280                 285
Ala Ala Gly His Tyr Ser Arg Pro Asp Val Phe Gln Leu Val Val Asp
    290                 295                 300
Asp Arg Pro Lys Arg Ala Val Ser Thr Val Ser Ala Val Arg Ala Arg
305                 310                 315                 320
Asn

<210> 63
<211> 933
<212> DNA
<213> Actinobacteria bacterium

<220>
<223> Actinobacteria bacterium RBG_13_55_18

<400> 63
atgaaagttg ctgctgttca gatcaaagct aaactggctt gcgttgaaga aaacctggaa     60
cgtgctgaaa aactgctgga caaagctttc ggtcagggtt gcgaaatggt tatcctgccg    120
gaattcttca cctctgctgt tgctttccac ccggacatgc tgaccgctgc tctgccgttc    180
gaaggtccgg ctctgggtct gctgcgtgac gctgctaaac gttacggtgg ttacgctggt    240
ggttctttca tcgcttctcg tgaaggtaac aactacaaca ccttcgttct ggctttcccg    300
gacggtggtt acgttaccca caacaaagac cagccgacca tgtgggaaaa ctgctactac    360
atcggtggta acgacgaagg tatcatggaa accccgctgg gtccggttgg ttctgctctg    420
tgctgggaaa tggttcgtac ccgtaccgtt cgtcgtctgc gtggtcgtat cggtctggct    480
gttggtggtt cttgctggtg ggacgttccg gaccgtctgc tgccgctgcc gggtaaaaaa    540
tctgctaaac gtcgtaacct ggctatcatg aacgaaaccc cggttcgtct ggctaaaatg    600
ctgggtgttc cggttgttca cgctgctcac gctgaagctt tcgaatgccg tatgccgctg    660
gttccgggta tcccgtaccg ttctcacttc ctgggtgaca ccatgatcgt tgacgctgac    720
ggttctgttc tggctcaccg ttctcgtgaa gaaggtgaag gtctggctat cgctgacgtt    780
cgtgttggtg gtatcgaacc gtctgaagac ccgccggacc gtttctggat cccggaactg    840
ccgctgctga tccgtttcgc ttgggcttac cagaacctgc acggtcgtct gtactaccgt    900
cgtgctctgc gtaccggtcg tatccagatc aaa                                 933

<210> 64
<211> 311
<212> PRT
<213> Actinobacteria bacterium

<220>
<223> Actinobacteria bacterium RBG_13_55_18

<400> 64
Met Lys Val Ala Ala Val Gln Ile Lys Ala Lys Leu Ala Cys Val Glu
1               5                   10                  15
Glu Asn Leu Glu Arg Ala Glu Lys Leu Leu Asp Lys Ala Phe Gly Gln
            20                  25                  30
Gly Cys Glu Met Val Ile Leu Pro Glu Phe Phe Thr Ser Ala Val Ala
        35                  40                  45
Phe His Pro Asp Met Leu Thr Ala Ala Leu Pro Phe Glu Gly Pro Ala
    50                  55                  60
Leu Gly Leu Leu Arg Asp Ala Ala Lys Arg Tyr Gly Gly Tyr Ala Gly
65                  70                  75                  80
Gly Ser Phe Ile Ala Ser Arg Glu Gly Asn Asn Tyr Asn Thr Phe Val
                85                  90                  95
Leu Ala Phe Pro Asp Gly Gly Tyr Val Thr His Asn Lys Asp Gln Pro
            100                 105                 110
Thr Met Trp Glu Asn Cys Tyr Tyr Ile Gly Gly Asn Asp Glu Gly Ile
        115                 120                 125
Met Glu Thr Pro Leu Gly Pro Val Gly Ser Ala Leu Cys Trp Glu Met
    130                 135                 140
Val Arg Thr Arg Thr Val Arg Arg Leu Arg Gly Arg Ile Gly Leu Ala
145                 150                 155                 160
Val Gly Gly Ser Cys Trp Trp Asp Val Pro Asp Arg Leu Leu Pro Leu
                165                 170                 175
Pro Gly Lys Lys Ser Ala Lys Arg Arg Asn Leu Ala Ile Met Asn Glu
            180                 185                 190
Thr Pro Val Arg Leu Ala Lys Met Leu Gly Val Pro Val Val His Ala
        195                 200                 205
Ala His Ala Glu Ala Phe Glu Cys Arg Met Pro Leu Val Pro Gly Ile
    210                 215                 220
Pro Tyr Arg Ser His Phe Leu Gly Asp Thr Met Ile Val Asp Ala Asp
225                 230                 235                 240
Gly Ser Val Leu Ala His Arg Ser Arg Glu Glu Gly Glu Gly Leu Ala
                245                 250                 255
Ile Ala Asp Val Arg Val Gly Gly Ile Glu Pro Ser Glu Asp Pro Pro
            260                 265                 270
Asp Arg Phe Trp Ile Pro Glu Leu Pro Leu Leu Ile Arg Phe Ala Trp
        275                 280                 285
Ala Tyr Gln Asn Leu His Gly Arg Leu Tyr Tyr Arg Arg Ala Leu Arg
    290                 295                 300
Thr Gly Arg Ile Gln Ile Lys
305                 310

<210> 65
<211> 1062
<212> DNA
<213> Rhizobium sp.

<220>
<223> Rhizobium sp. YK2

<400> 65
atggaaaaca aatctatcgt tcgtgctgct gctgttcaga tcgctccgga cctgacctct     60
cgtgaaaaaa ccctggctcg tgttctggaa gctatccacg aagctgctgg taaaggtgct    120
gaactggctg ttttcccgga aaccttcgtt ccgtggtacc cgtacttctc tttcgttctg    180
ccgccggttc tgtctggtaa agaacacgtt cgtctgtacg acgaagctgt taccgttccg    240
tctgctgcta ccgaagctat cgctaccgct gctcgtaacc acggtatcgt tgttgttctg    300
ggtgttaacg aacgtgacca cggttctctg tacaacaccc agctggtttt caacgctgac    360
ggtaccctga tcctgaaacg tcgtaaaatc accccgacct tccacgaacg tatgatctgg    420
ggtcagggtg acgcttctgg tctgaccgtt gttgaatctc acgttggtcg tatcggtgct    480
ctggcttgct gggaacacta caacccgctg gctcgttacg ctctgatggc tcagcacgaa    540
gaaatccacg ttgctcagtt cccgggttct atggttggtc cgatcttcgc tgaacagatc    600
gaagttacca tccgtcacca cgctctggaa tctggttgct tcgttgttaa cgctaccggt    660
tggctgaccg acgaacagat cgcttctatc accccggacc agaacctgca gaaagctctg    720
cgtggtggtt gcatgaccgc tatcatctct ccggaaggta aacacctggc tccgccgctg    780
accgaaggtg aaggtatcct gatcgctgac ctggacatgt ctctgatcac caaacgtaaa    840
cgtatgatgg actctgttgg tcactacgct cgtccggaac tgctgcacct ggttatcgac    900
ggtcgtgcta ccgctccgat ggttgcttct gaatcttctt tcgaaaaccg taacccgtct    960
cagaccgctt ctccgcgttc taactctgac ggtcaccacg acaacgcttc ttctgaccgt   1020
gacccggacc agcgtgttgc tgttctgcgt tctcaggctt ct                      1062

<210> 66
<211> 354
<212> PRT
<213> Rhizobium sp.

<220>
<223> Rhizobium sp. YK2

<400> 66
Met Glu Asn Lys Ser Ile Val Arg Ala Ala Ala Val Gln Ile Ala Pro
1               5                   10                  15
Asp Leu Thr Ser Arg Glu Lys Thr Leu Ala Arg Val Leu Glu Ala Ile
            20                  25                  30
His Glu Ala Ala Gly Lys Gly Ala Glu Leu Ala Val Phe Pro Glu Thr
        35                  40                  45
Phe Val Pro Trp Tyr Pro Tyr Phe Ser Phe Val Leu Pro Pro Val Leu
    50                  55                  60
Ser Gly Lys Glu His Val Arg Leu Tyr Asp Glu Ala Val Thr Val Pro
65                  70                  75                  80
Ser Ala Ala Thr Glu Ala Ile Ala Thr Ala Ala Arg Asn His Gly Ile
                85                  90                  95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn
            100                 105                 110
Thr Gln Leu Val Phe Asn Ala Asp Gly Thr Leu Ile Leu Lys Arg Arg
        115                 120                 125
Lys Ile Thr Pro Thr Phe His Glu Arg Met Ile Trp Gly Gln Gly Asp
    130                 135                 140
Ala Ser Gly Leu Thr Val Val Glu Ser His Val Gly Arg Ile Gly Ala
145                 150                 155                 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
                165                 170                 175
Ala Gln His Glu Glu Ile His Val Ala Gln Phe Pro Gly Ser Met Val
            180                 185                 190
Gly Pro Ile Phe Ala Glu Gln Ile Glu Val Thr Ile Arg His His Ala
        195                 200                 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp
    210                 215                 220
Glu Gln Ile Ala Ser Ile Thr Pro Asp Gln Asn Leu Gln Lys Ala Leu
225                 230                 235                 240
Arg Gly Gly Cys Met Thr Ala Ile Ile Ser Pro Glu Gly Lys His Leu
                245                 250                 255
Ala Pro Pro Leu Thr Glu Gly Glu Gly Ile Leu Ile Ala Asp Leu Asp
            260                 265                 270
Met Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
        275                 280                 285
Tyr Ala Arg Pro Glu Leu Leu His Leu Val Ile Asp Gly Arg Ala Thr
    290                 295                 300
Ala Pro Met Val Ala Ser Glu Ser Ser Phe Glu Asn Arg Asn Pro Ser
305                 310                 315                 320
Gln Thr Ala Ser Pro Arg Ser Asn Ser Asp Gly His His Asp Asn Ala
                325                 330                 335
Ser Ser Asp Arg Asp Pro Asp Gln Arg Val Ala Val Leu Arg Ser Gln
            340                 345                 350
Ala Ser

<210> 67
<211> 951
<212> DNA
<213> bacterium YEK0313

<400> 67
atgtctgttg ttcgttacaa agctgctgtt gctcaggctg cttcttgccc ggacgacgct     60
atggcttctg ctaccaaagc tgctcgtctg atcgaagaag ctgctggtgc tggtgctcgt    120
ctgatcgttt tcccggaagc tttcctgggt ggttacccga aaggtgcttc tttcggtgct    180
ccgatcggta tgcgtaaacc ggaaggtcgt gacgctttcc gtcactactt cgaacaggct    240
atcgacctgg acggtccgga agttgctgct atcgctgctg ctaccgctac caccggtctg    300
ttcgctgtta tcggttgcat cgaacgtgac ggtggtaccc tgcactgcac cgttctgttc    360
ttcgacggtg ctgctggtct ggttggtaaa caccgtaaac tgatgccgac cgctggtgaa    420
cgtctgatct ggggtttcgg tgacggttct accatgccgg ttttcaaaac ctctctgggt    480
cgtatcggtg ctgttatctg ctgggaaaac tacatgccga tgctgcgtat gcacatgttc    540
tctcagggta tctctatcta ctgcgctccg accgctgacg accgtgacac ctggctgccg    600
tctatgcgtc acatcgctct ggaaggtcgt tgcttcgttc tgaccgcttg ccagcacatc    660
cgtcgtggtg ctttcccggc tggtcacgaa tgcgctctgg gtgacgaccc ggacaccgtt    720
ctgatgcgtg gtggttctgc tatcgttgac ccgctgggtg gtgttctggc tggtccggac    780
ttcaccggtg aaaccatcct gtacgctgac atcgacctgg gtgaagttgc tcgtggtaaa    840
ttcgacttcg acgttgttgg tcactacgct cgtccggaca tcttctctct gaccgttgac    900
gaccgtccgc gtccggctgt ttctaccctg ggtgacccgc aggctggttc t             951

<210> 68
<211> 317
<212> PRT
<213> bacterium YEK0313

<400> 68
Met Ser Val Val Arg Tyr Lys Ala Ala Val Ala Gln Ala Ala Ser Cys
1               5                   10                  15
Pro Asp Asp Ala Met Ala Ser Ala Thr Lys Ala Ala Arg Leu Ile Glu
            20                  25                  30
Glu Ala Ala Gly Ala Gly Ala Arg Leu Ile Val Phe Pro Glu Ala Phe
        35                  40                  45
Leu Gly Gly Tyr Pro Lys Gly Ala Ser Phe Gly Ala Pro Ile Gly Met
    50                  55                  60
Arg Lys Pro Glu Gly Arg Asp Ala Phe Arg His Tyr Phe Glu Gln Ala
65                  70                  75                  80
Ile Asp Leu Asp Gly Pro Glu Val Ala Ala Ile Ala Ala Ala Thr Ala
                85                  90                  95
Thr Thr Gly Leu Phe Ala Val Ile Gly Cys Ile Glu Arg Asp Gly Gly
            100                 105                 110
Thr Leu His Cys Thr Val Leu Phe Phe Asp Gly Ala Ala Gly Leu Val
        115                 120                 125
Gly Lys His Arg Lys Leu Met Pro Thr Ala Gly Glu Arg Leu Ile Trp
    130                 135                 140
Gly Phe Gly Asp Gly Ser Thr Met Pro Val Phe Lys Thr Ser Leu Gly
145                 150                 155                 160
Arg Ile Gly Ala Val Ile Cys Trp Glu Asn Tyr Met Pro Met Leu Arg
                165                 170                 175
Met His Met Phe Ser Gln Gly Ile Ser Ile Tyr Cys Ala Pro Thr Ala
            180                 185                 190
Asp Asp Arg Asp Thr Trp Leu Pro Ser Met Arg His Ile Ala Leu Glu
        195                 200                 205
Gly Arg Cys Phe Val Leu Thr Ala Cys Gln His Ile Arg Arg Gly Ala
    210                 215                 220
Phe Pro Ala Gly His Glu Cys Ala Leu Gly Asp Asp Pro Asp Thr Val
225                 230                 235                 240
Leu Met Arg Gly Gly Ser Ala Ile Val Asp Pro Leu Gly Gly Val Leu
                245                 250                 255
Ala Gly Pro Asp Phe Thr Gly Glu Thr Ile Leu Tyr Ala Asp Ile Asp
            260                 265                 270
Leu Gly Glu Val Ala Arg Gly Lys Phe Asp Phe Asp Val Val Gly His
        275                 280                 285
Tyr Ala Arg Pro Asp Ile Phe Ser Leu Thr Val Asp Asp Arg Pro Arg
    290                 295                 300
Pro Ala Val Ser Thr Leu Gly Asp Pro Gln Ala Gly Ser
305                 310                 315

<210> 69
<211> 819
<212> DNA
<213> Paenibacillus darwinianus

<400> 69
atgatccgtg aaggtaaccg tctgaccgtt gctgctgttc agatgaactg cgttctgggt     60
gacgttgaag ctaacctgcg taaagctgaa cgtctgctgg aaatcgctgc tggtcgtggt    120
gctcgtctgg ctgttctgcc ggaactgttc aacaccggtt accgtgttga agaacgtgac    180
gttgaactgg ctgaaccgat cccgggtccg accaccgaat ggatgcgtcg tcaggcttct    240
aaacacggta tgaaactggt tgctgctatc ctggaaaaag gtgctccggc tggtctggtt    300
tacgacaccg ctgttctggt tgaaccggct ggtgttatcg gttcttaccg taaaacccac    360
ctgtggaacc aggaaaacac ccgtttcacc cgtggtgaac agttcccggt ttacgaaacc    420
gacggtatcc aggttggtct gcagatctgc tacgaaatcg gtttcccgga aggtgctcgt    480
atcctgacct tccacggtgc tgacatcatc gtttacccgt ctgctttcgg taaagctcgt    540
ctgtacgctt gggacatcgc tacccgttct cgtgctctgg aaaacggtac cttcgttatc    600
gcttctaacc gtaccggtct ggaaaaaggt gaaaccgaat tcggtggtac ctctcgtatc    660
gttgacccgg ctggtaccat cctggctgaa gctgaacagg aagacgacgt tatcaccgct    720
gaactggacc tgggtctgat cgctgaacag cgtcgtgcta tcccgtacct gcgtgacttc    780
aaccgttctc tgatctctaa agaatacaac tctgaacgt                           819

<210> 70
<211> 273
<212> PRT
<213> Paenibacillus darwinianus

<400> 70
Met Ile Arg Glu Gly Asn Arg Leu Thr Val Ala Ala Val Gln Met Asn
1               5                   10                  15
Cys Val Leu Gly Asp Val Glu Ala Asn Leu Arg Lys Ala Glu Arg Leu
            20                  25                  30
Leu Glu Ile Ala Ala Gly Arg Gly Ala Arg Leu Ala Val Leu Pro Glu
        35                  40                  45
Leu Phe Asn Thr Gly Tyr Arg Val Glu Glu Arg Asp Val Glu Leu Ala
    50                  55                  60
Glu Pro Ile Pro Gly Pro Thr Thr Glu Trp Met Arg Arg Gln Ala Ser
65                  70                  75                  80
Lys His Gly Met Lys Leu Val Ala Ala Ile Leu Glu Lys Gly Ala Pro
                85                  90                  95
Ala Gly Leu Val Tyr Asp Thr Ala Val Leu Val Glu Pro Ala Gly Val
            100                 105                 110
Ile Gly Ser Tyr Arg Lys Thr His Leu Trp Asn Gln Glu Asn Thr Arg
        115                 120                 125
Phe Thr Arg Gly Glu Gln Phe Pro Val Tyr Glu Thr Asp Gly Ile Gln
    130                 135                 140
Val Gly Leu Gln Ile Cys Tyr Glu Ile Gly Phe Pro Glu Gly Ala Arg
145                 150                 155                 160
Ile Leu Thr Phe His Gly Ala Asp Ile Ile Val Tyr Pro Ser Ala Phe
                165                 170                 175
Gly Lys Ala Arg Leu Tyr Ala Trp Asp Ile Ala Thr Arg Ser Arg Ala
            180                 185                 190
Leu Glu Asn Gly Thr Phe Val Ile Ala Ser Asn Arg Thr Gly Leu Glu
        195                 200                 205
Lys Gly Glu Thr Glu Phe Gly Gly Thr Ser Arg Ile Val Asp Pro Ala
    210                 215                 220
Gly Thr Ile Leu Ala Glu Ala Glu Gln Glu Asp Asp Val Ile Thr Ala
225                 230                 235                 240
Glu Leu Asp Leu Gly Leu Ile Ala Glu Gln Arg Arg Ala Ile Pro Tyr
                245                 250                 255
Leu Arg Asp Phe Asn Arg Ser Leu Ile Ser Lys Glu Tyr Asn Ser Glu
            260                 265                 270
Arg

<210> 71
<211> 1098
<212> DNA
<213> Haloarcula sp.

<220>
<223> Haloarcula sp. CBA1115

<400> 71
atgccggctg aatctttcac cctggctgct gctcaggttg aaccggttta ccacgacaaa     60
gaaggtaccc tggacaaaac ctgccgttac atcgaacagg ctggtcgtga cggtgctgac    120
atcgttgttt tcccggaaac ctacttcccg ggttacccgt actggcgtgg ttctgtttct    180
atctctcgtt ggaccgacct gatggttgac ctgcagaaaa actctctgca cgttgacgac    240
gaagctatcg aagttctggg tgaagctgtt gctgaagctg acctgaccct ggttctgggt    300
accaacgaag tttctgaccg tcagggttct gaaaccctgt acaactctct gttctacttc    360
gactctaccg gtgaactgat gggtcgtcac cgtaaactga tgccgaccca cgaagaacgt    420
gctatctggg gtcgtggtga cccgtcttct ctggctacct acgaaaccga catcggttgg    480
ctgggtggtc tgatctgcta cgaaaaccac atgaccctgt ctaaagctgc tctgaccgct    540
atgggtgaag aaatccacgc tgctgtttgg ccgggtttct ggaaacagca cggtcacccg    600
ggtgacaaaa cccgtgctga aacctctgaa gctgttgaca cctgcgacat ctacccggct    660
atgcgtgaat acgctttcga aacccagtct ttcgttgctg cttgctctgc ttacatgtct    720
gacgctgttc cggacggttt ctctgaagac gaactgggtt tcaacgttgc tgctggtggt    780
tctatgctga tcaacccggc tggtatcgtt aaagctggtc cgctggttgg tgaagaaggt    840
ctgctgaccg ctgaattcca ggacgacgaa cgtcgtgcta ccaaagctta cttcgacgct    900
atgggtcact acacccgttg ggacgctgtt tctctgtcta tcaacgacga aaccctggct    960
ccgtctcagc cgcgtgaacc gtctaaaaac ccggttgctg gtacctcttc tctgtctgct   1020
gctcaggctc aggctgttgc tgacgaatac gacgttccgg ttgaagctgt tgaagctgtt   1080
gctgacaaac tgaccgac                                                 1098

<210> 72
<211> 366
<212> PRT
<213> Haloarcula sp.

<220>
<223> Haloarcula sp. CBA1115

<400> 72
Met Pro Ala Glu Ser Phe Thr Leu Ala Ala Ala Gln Val Glu Pro Val
1               5                   10                  15
Tyr His Asp Lys Glu Gly Thr Leu Asp Lys Thr Cys Arg Tyr Ile Glu
            20                  25                  30
Gln Ala Gly Arg Asp Gly Ala Asp Ile Val Val Phe Pro Glu Thr Tyr
        35                  40                  45
Phe Pro Gly Tyr Pro Tyr Trp Arg Gly Ser Val Ser Ile Ser Arg Trp
    50                  55                  60
Thr Asp Leu Met Val Asp Leu Gln Lys Asn Ser Leu His Val Asp Asp
65                  70                  75                  80
Glu Ala Ile Glu Val Leu Gly Glu Ala Val Ala Glu Ala Asp Leu Thr
                85                  90                  95
Leu Val Leu Gly Thr Asn Glu Val Ser Asp Arg Gln Gly Ser Glu Thr
            100                 105                 110
Leu Tyr Asn Ser Leu Phe Tyr Phe Asp Ser Thr Gly Glu Leu Met Gly
        115                 120                 125
Arg His Arg Lys Leu Met Pro Thr His Glu Glu Arg Ala Ile Trp Gly
    130                 135                 140
Arg Gly Asp Pro Ser Ser Leu Ala Thr Tyr Glu Thr Asp Ile Gly Trp
145                 150                 155                 160
Leu Gly Gly Leu Ile Cys Tyr Glu Asn His Met Thr Leu Ser Lys Ala
                165                 170                 175
Ala Leu Thr Ala Met Gly Glu Glu Ile His Ala Ala Val Trp Pro Gly
            180                 185                 190
Phe Trp Lys Gln His Gly His Pro Gly Asp Lys Thr Arg Ala Glu Thr
        195                 200                 205
Ser Glu Ala Val Asp Thr Cys Asp Ile Tyr Pro Ala Met Arg Glu Tyr
    210                 215                 220
Ala Phe Glu Thr Gln Ser Phe Val Ala Ala Cys Ser Ala Tyr Met Ser
225                 230                 235                 240
Asp Ala Val Pro Asp Gly Phe Ser Glu Asp Glu Leu Gly Phe Asn Val
                245                 250                 255
Ala Ala Gly Gly Ser Met Leu Ile Asn Pro Ala Gly Ile Val Lys Ala
            260                 265                 270
Gly Pro Leu Val Gly Glu Glu Gly Leu Leu Thr Ala Glu Phe Gln Asp
        275                 280                 285
Asp Glu Arg Arg Ala Thr Lys Ala Tyr Phe Asp Ala Met Gly His Tyr
    290                 295                 300
Thr Arg Trp Asp Ala Val Ser Leu Ser Ile Asn Asp Glu Thr Leu Ala
305                 310                 315                 320
Pro Ser Gln Pro Arg Glu Pro Ser Lys Asn Pro Val Ala Gly Thr Ser
                325                 330                 335
Ser Leu Ser Ala Ala Gln Ala Gln Ala Val Ala Asp Glu Tyr Asp Val
            340                 345                 350
Pro Val Glu Ala Val Glu Ala Val Ala Asp Lys Leu Thr Asp
        355                 360                 365

<210> 73
<211> 1062
<212> DNA
<213> Hungatella hathewayi

<400> 73
atgtctaaaa aagaaaccgt taaagaagtt acccacacca tcggtgacac cctgccgaaa     60
ctgcgtgctg ctgctgttca ggctgctccg gttttcctga accgtgacgc taccgttcag    120
aaagttgctc gtctgaccaa agaagctaaa gacaacggtg ctgacctggt tgttttcccg    180
gaatctttca tcccgacctt cccgctgtgg tgcctgttcc tgccgccggt tgaccagcac    240
ccgttctaca aacgtctgtt cgaaaacgct gttaccgttc cgggtccggc tttccacgaa    300
ctgcagaaaa tcgctcgtga caactctatc ttcctgtctg ttggtatctg cgaaaaatct    360
acctctaact tcggtaccat gtggaacacc accctgctgt tcgaccgtga aggtaacatg    420
atcggtcacc accgtaaact gctgccgacc tggggtgaaa aactggtttg gtctttcggt    480
gacggttctt ctctgaacat ccacgacacc gaaatcggtc gtatcggttc tctgatctgc    540
ggtgaaaact ctaacaccct ggctcgttac gctctggttg ctcagggtga acaggttcac    600
atctctgttt acccgccgtg ctggccgacc aaccgtgaaa aaggtaacta cgctgactgc    660
ctgcgtgttc gtacctgcgc tcacgctttc gaagctaaag ttttcaacat ctgctcttct    720
gcttctctgg acgaagacgc tatggaacag atgtctatgg gtgacccggc tctgaaagaa    780
tggctgcaca accagtcttg ggctctgacc atgatcgctg gtccgaacgg tcagccgtgc    840
tgcccgtcta tcgaaaacaa ccaggaaggt atcatctacg ctgactgcga catcgctaac    900
gaaatcaccg ctaaaggtat ccacgacatc gctggtgctt accagcgttt cgacgttttc    960
cagctgcacg ttaacaaaac cccgcgtgaa ccggcttact tctacgacga aggtatcggt   1020
gaatctcgtg aatacatccc gtacgaagaa gaagacaccg aa                      1062

<210> 74
<211> 354
<212> PRT
<213> Hungatella hathewayi

<400> 74
Met Ser Lys Lys Glu Thr Val Lys Glu Val Thr His Thr Ile Gly Asp
1               5                   10                  15
Thr Leu Pro Lys Leu Arg Ala Ala Ala Val Gln Ala Ala Pro Val Phe
            20                  25                  30
Leu Asn Arg Asp Ala Thr Val Gln Lys Val Ala Arg Leu Thr Lys Glu
        35                  40                  45
Ala Lys Asp Asn Gly Ala Asp Leu Val Val Phe Pro Glu Ser Phe Ile
    50                  55                  60
Pro Thr Phe Pro Leu Trp Cys Leu Phe Leu Pro Pro Val Asp Gln His
65                  70                  75                  80
Pro Phe Tyr Lys Arg Leu Phe Glu Asn Ala Val Thr Val Pro Gly Pro
                85                  90                  95
Ala Phe His Glu Leu Gln Lys Ile Ala Arg Asp Asn Ser Ile Phe Leu
            100                 105                 110
Ser Val Gly Ile Cys Glu Lys Ser Thr Ser Asn Phe Gly Thr Met Trp
        115                 120                 125
Asn Thr Thr Leu Leu Phe Asp Arg Glu Gly Asn Met Ile Gly His His
    130                 135                 140
Arg Lys Leu Leu Pro Thr Trp Gly Glu Lys Leu Val Trp Ser Phe Gly
145                 150                 155                 160
Asp Gly Ser Ser Leu Asn Ile His Asp Thr Glu Ile Gly Arg Ile Gly
                165                 170                 175
Ser Leu Ile Cys Gly Glu Asn Ser Asn Thr Leu Ala Arg Tyr Ala Leu
            180                 185                 190
Val Ala Gln Gly Glu Gln Val His Ile Ser Val Tyr Pro Pro Cys Trp
        195                 200                 205
Pro Thr Asn Arg Glu Lys Gly Asn Tyr Ala Asp Cys Leu Arg Val Arg
    210                 215                 220
Thr Cys Ala His Ala Phe Glu Ala Lys Val Phe Asn Ile Cys Ser Ser
225                 230                 235                 240
Ala Ser Leu Asp Glu Asp Ala Met Glu Gln Met Ser Met Gly Asp Pro
                245                 250                 255
Ala Leu Lys Glu Trp Leu His Asn Gln Ser Trp Ala Leu Thr Met Ile
            260                 265                 270
Ala Gly Pro Asn Gly Gln Pro Cys Cys Pro Ser Ile Glu Asn Asn Gln
        275                 280                 285
Glu Gly Ile Ile Tyr Ala Asp Cys Asp Ile Ala Asn Glu Ile Thr Ala
    290                 295                 300
Lys Gly Ile His Asp Ile Ala Gly Ala Tyr Gln Arg Phe Asp Val Phe
305                 310                 315                 320
Gln Leu His Val Asn Lys Thr Pro Arg Glu Pro Ala Tyr Phe Tyr Asp
                325                 330                 335
Glu Gly Ile Gly Glu Ser Arg Glu Tyr Ile Pro Tyr Glu Glu Glu Asp
            340                 345                 350
Thr Glu

<210> 75
<211> 5365
<212> DNA
<213> Artificial sequence

<220>
<223> Plasmid

<400> 75
cgatcaccac aattcagcaa attgtgaaca tcatcacgtt catctttccc tggttgccaa     60
tggcccattt tcctgtcagt aacgagaagg tcgcgaattc aggcgctttt tagactggtc    120
gtaatgaaca attcttaaga aggagatata catatgcaga caagaaaaat cgtccgggca    180
gccgccgtac aggccgcctc tcccaactac gatctggcaa cgggtgttga taaaaccatt    240
gagctggctc gtcaggcccg cgatgagggc tgtgacctga tcgtgtttgg tgaaacctgg    300
ctgcccggat atcccttcca cgtctggctg ggcgcaccgg cctggtcgct gaaatacagt    360
gcccgctact atgccaactc gctctcgctg gacagtgcag agtttcaacg cattgcccag    420
gccgcacgga ccttgggtat tttcatcgca ctgggttata gcgagcgcag cggcggcagc    480
ctttacctgg gccaatgcct gatcgacgac aagggcgaga tgctgtggtc gcgtcgcaaa    540
ctcaaaccca cgcatgtaga gcgcaccgta tttggtgaag gttatgcccg tgatctgatt    600
gtgtccgaca cagaactggg acgcgtcggt gctctatgct gctgggagca tttgtcgccc    660
ttgagcaagt acgcgctgta ctcccagcat gaagccattc acattgctgc ctggccgtcg    720
ttttcgctat acagcgaaca ggcccacgcc ctcagtgcca aggtgaacat ggctgcctcg    780
caaatctatt cggttgaagg ccagtgcttt accatcgccg ccagcagtgt ggtcacccaa    840
gagacgctag acatgctgga agtgggtgaa cacaacgccc ccttgctgaa agtgggcggc    900
ggcagttcca tgatttttgc gccggacgga cgcacactgg ctccctacct gcctcacgat    960
gccgagggct tgatcattgc cgatctgaat atggaggaga ttgccttcgc caaagcgatc   1020
aatgaccccg taggccacta ttccaaaccc gaggccaccc gtctggtgct ggacttgggg   1080
caccgagacc ccatgactcg ggtgcactcc aaaagcgtga ccagggaaga ggctcccgag   1140
caaggtgtgc aaagcaagat tgcctcagtc gctatcagcc atccacagga ctcggacaca   1200
ctgctagtgc aagagccgtc cttgaggatc cgtcgacctg cagccaagct tggctgtttt   1260
ggcggatgag agaagatttt cagcctgata cagattaaat cagaacgcag aagcggtctg   1320
ataaaacaga atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac   1380
tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg   1440
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgttttat   1500
ctgttgtttg tcggtgaacg ctctcctgag taggacaaat ccgccgggag cggatttgaa   1560
cgttgcgaag caacggcccg gagggtggcg ggcaggacgc ccgccataaa ctgccaggca   1620
tcaaattaag cagaaggcca tcctgacgga tggccttttt gcgtttctac aaactctttt   1680
gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa   1740
tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta   1800
ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag   1860
taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca   1920
gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta   1980
aagttctgct atgtggcgcg gtattatccc gtgttgacgc cgggcaagag caactcggtc   2040
gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc   2100
ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca   2160
ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc   2220
acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca   2280
taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac   2340
tattaactgg cgaactactt actctagctt cccggcaaca attaatagac tggatggagg   2400
cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg   2460
ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg   2520
gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac   2580
gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc   2640
aagtttactc atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct   2700
aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc   2760
actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc   2820
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg   2880
atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa   2940
atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc   3000
ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt   3060
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa   3120
cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc   3180
tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc   3240
cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct   3300
ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat   3360
gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc   3420
tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg   3480
ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc   3540
gcagcgagtc agtgagcgag gaagcggaag agcgcctgat gcggtatttt ctccttacgc   3600
atctgtgcgg tatttcacac cgcatatatg gtgcactctc agtacaatct gctctgatgc   3660
cgcatagtta agccagtata cactccgcta tcgctacgtg actgggtcat ggctgcgccc   3720
cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct   3780
tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca   3840
ccgaaacgcg cgaggcagct gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag   3900
atgtctgcct gttcatccgc gtccagctcg ttgagtttct ccagaagcgt taatgtctgg   3960
cttctgataa agcgggccat gttaagggcg gttttttcct gtttggtcac tgatgcctcc   4020
gtgtaagggg gatttctgtt catgggggta atgataccga tgaaacgaga gaggatgctc   4080
acgatacggg ttactgatga tgaacatgcc cggttactgg aacgttgtga gggtaaacaa   4140
ctggcggtat ggatgcggcg ggaccagaga aaaatcactc agggtcaatg ccagcgcttc   4200
gttaatacag atgtaggtgt tccacagggt agccagcagc atcctgcgat gcagatccgg   4260
aacataatgg tgcagggcgc tgacttccgc gtttccagac tttacgaaac acggaaaccg   4320
aagaccattc atgttgttgc tcaggtcgca gacgttttgc agcagcagtc gcttcacgtt   4380
cgctcgcgta tcggtgattc attctgctaa ccagtaaggc aaccccgcca gcctagccgg   4440
gtcctcaacg acaggagcac gatcatgcgc acccgtggcc aggacccaac gctgcccgag   4500
atgcgccgcg tgcggctgct ggagatggcg gacgcgatgg atatgttctg ccaagggttg   4560
gtttgcgcat tcacagttct ccgcaagaat tgattggctc caattcttgg agtggtgaat   4620
ccgttagcga ggtgccgccg gcttccattc aggtcgaggt ggcccggctc catgcaccgc   4680
gacgcaacgc ggggaggcag acaaggtata gggcggcgcc tacaatccat gccaacccgt   4740
tccatgtgct cgccgaggcg gcataaatcg ccgtgacgat cagcggtcca atgatcgaag   4800
ttaggctggt aagagccgcg agcgatcctt gaagctgtcc ctgatggtcg tcatctacct   4860
gcctggacag catggcctgc aacgcgggca tcccgatgcc gccggaagcg agaagaatca   4920
taatggggaa ggccatccag cctcgcgtcg cgaacgccag caagacgtag cccagcgcgt   4980
cggccgccat gccggcgata atggcctgct tctcgccgaa acgtttggtg gcgggaccag   5040
tgacgaaggc ttgagcgagg gcgtgcaaga ttccgaatac cgcaagcgac aggccgatca   5100
tcgtcgcgct ccagcgaaag cggtcctcgc cgaaaatgac ccagagcgct gccggcacct   5160
gtcctacgag ttgcatgata aagaagacag tcataagtgc ggcgacgata gtcatgcccc   5220
gcgcccaccg gaaggagctg actgggttga aggctctcaa gggcatcggt cgacgctctc   5280
ccttatgcga ctcctgcatt aggaagcagc ccagtagtag gttgaggccg ttgagcaccg   5340
ccgccgcaag gaatggtgca tgcat                                         5365
