                         SEQUENCE LISTING

<110>  COSKATA, INC.
       Reeves, Andrew
       Enzien, Mike
       Tobey, Richard R
 
<120>  Compositions and Methods for the Conversion of Short-Chained 
       Carboxylic Acids to Alcohols Using Clostridial Enzymes

<130>  14-992

<160>  34    

<170>  PatentIn version 3.5

<210>  1
<211>  1962
<212>  DNA
<213>  Clostridium autoethanogenum

<400>  1
tctgaaatgg aacaaaattg caccaagatg gaacagtttg cttggcgcaa taccctcaaa       60

acacaatatt ttattattgg catggtttat gctatatata taattgtttg aataatcaat      120

ttttttagga gggtttttat gtacggatat aagggtaagg tattaagaat taatctaagt      180

agtaaaactt atatagtgga agaattgaaa attgacaaag ctaaaaaatt tataggtgca      240

agagggttag gcgtaaaaac cttatttgac gaagtagatc caaaggtaga tccattatca      300

cctgataaca aatttattat agcagcggga ccacttacag gtgcacctgt tccaacaagc      360

ggaagattca tggtagttac taaatcacct ttaacaggaa ctattgctat tgcaaattca      420

ggtggaaaat ggggagcaga attcaaagca gctggatacg atatgataat cgttgaaggt      480

aaatctgata aagaagttta tgtaaatata gtagatgata aagtagaatt tagggatgct      540

tctcatgttt ggggaaaact aacagaagaa actacaaaaa tgcttcaaca ggaaacagat      600

tcgagagcta aggttttatg cataggacca gctggggaaa agttatcact tatggcagca      660

gttatgaatg atgttgatag aacagcagga cgtggtggtg ttggagctgt tatgggttca      720

aagaacttaa aagctattgt agttaaagga agcggaaaag taaaattatt tgatgaacaa      780

aaagtgaagg aagtagcact tgagaaaaca aatattttaa gaaaagatcc agtagctggt      840

ggaggacttc caacatacgg aacagctgta cttgttaata ttataaatga aaatggtgta      900

catccagtaa agaattttca aaaatcttat acagatcaag cagataagat cagtggagaa      960

actttaacta aagattgctt agttagaaaa aatccttgct ataggtgtcc aattgcctgt     1020

ggaagatggg taaaacttga tgatggaact gaatgtggag gaccagaata tgaaacatta     1080

tggtcatttg gatctgattg tgatgtatac gatataaatg ctgtaaatac agcaaatatg     1140

ttgtgtaatg aatatggact agataccatt acagcaggat gtactattgc agcagctatg     1200

gaactttatc aaagaggtta tattaaggat gaagaaatag cagcagatgg attgtcactt     1260

aattggggag atgctaagtc catggttgaa tgggtaaaga aaatgggact tagagaagga     1320

tttggagaca agatggcaga tggttcatac agactttgtg actcatacgg tgtacctgag     1380

tattcaatga ctgtaaaaaa acaggaactt ccagcatatg acccaagagg aatacaggga     1440

catggcatta cttatgctgt taacaatagg ggaggatgtc acattaaggg atatatggta     1500

agtcctgaaa tacttggcta tccagaaaaa cttgatagac ttgcagtgga aggaaaagca     1560

ggatatgcta gagtattcca tgatttaaca gctgttatag attcacttgg attatgtatt     1620

tttacaacat ttggtcttgg tgcacaggat tatgttgata tgtataatgc agtagttggt     1680

ggagaattac atgatgtaaa ttctttaatg ttagctggag atagaatatg gactttagaa     1740

aaaatattta acttaaaagc aggcatagat agttcacagg atactcttcc aaagagattg     1800

cttgaagaac aaattccaga aggaccatca aaaggagaag ttcataagtt agatgtacta     1860

ctacctgaat attattcagt acgtggatgg gataaaaatg gtattcctac agaggaaacg     1920

ttaaagaaat taggattaga tgaatacgta ggtaagcttt ag                        1962


<210>  2
<211>  607
<212>  PRT
<213>  Clostridium autoethanogenum

<400>  2

Met Tyr Gly Tyr Lys Gly Lys Val Leu Arg Ile Asn Leu Ser Ser Lys 
1               5                   10                  15      


Thr Tyr Ile Val Glu Glu Leu Lys Ile Asp Lys Ala Lys Lys Phe Ile 
            20                  25                  30          


Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Val Asp Pro 
        35                  40                  45              


Lys Val Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Ala Gly 
    50                  55                  60                  


Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 
65                  70                  75                  80  


Thr Lys Ser Pro Leu Thr Gly Thr Ile Ala Ile Ala Asn Ser Gly Gly 
                85                  90                  95      


Lys Trp Gly Ala Glu Phe Lys Ala Ala Gly Tyr Asp Met Ile Ile Val 
            100                 105                 110         


Glu Gly Lys Ser Asp Lys Glu Val Tyr Val Asn Ile Val Asp Asp Lys 
        115                 120                 125             


Val Glu Phe Arg Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu 
    130                 135                 140                 


Thr Thr Lys Met Leu Gln Gln Glu Thr Asp Ser Arg Ala Lys Val Leu 
145                 150                 155                 160 


Cys Ile Gly Pro Ala Gly Glu Lys Leu Ser Leu Met Ala Ala Val Met 
                165                 170                 175     


Asn Asp Val Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 
            180                 185                 190         


Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 
        195                 200                 205             


Lys Leu Phe Asp Glu Gln Lys Val Lys Glu Val Ala Leu Glu Lys Thr 
    210                 215                 220                 


Asn Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 
225                 230                 235                 240 


Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 
                245                 250                 255     


Val Lys Asn Phe Gln Lys Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 
            260                 265                 270         


Gly Glu Thr Leu Thr Lys Asp Cys Leu Val Arg Lys Asn Pro Cys Tyr 
        275                 280                 285             


Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Lys Leu Asp Asp Gly Thr 
    290                 295                 300                 


Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 
305                 310                 315                 320 


Cys Asp Val Tyr Asp Ile Asn Ala Val Asn Thr Ala Asn Met Leu Cys 
                325                 330                 335     


Asn Glu Tyr Gly Leu Asp Thr Ile Thr Ala Gly Cys Thr Ile Ala Ala 
            340                 345                 350         


Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 
        355                 360                 365             


Ala Asp Gly Leu Ser Leu Asn Trp Gly Asp Ala Lys Ser Met Val Glu 
    370                 375                 380                 


Trp Val Lys Lys Met Gly Leu Arg Glu Gly Phe Gly Asp Lys Met Ala 
385                 390                 395                 400 


Asp Gly Ser Tyr Arg Leu Cys Asp Ser Tyr Gly Val Pro Glu Tyr Ser 
                405                 410                 415     


Met Thr Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ile 
            420                 425                 430         


Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 
        435                 440                 445             


Ile Lys Gly Tyr Met Val Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 
    450                 455                 460                 


Leu Asp Arg Leu Ala Val Glu Gly Lys Ala Gly Tyr Ala Arg Val Phe 
465                 470                 475                 480 


His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 
                485                 490                 495     


Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 
            500                 505                 510         


Val Gly Gly Glu Leu His Asp Val Asn Ser Leu Met Leu Ala Gly Asp 
        515                 520                 525             


Arg Ile Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Ile Asp 
    530                 535                 540                 


Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Gln Ile Pro 
545                 550                 555                 560 


Glu Gly Pro Ser Lys Gly Glu Val His Lys Leu Asp Val Leu Leu Pro 
                565                 570                 575     


Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Ile Pro Thr Glu 
            580                 585                 590         


Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Val Gly Lys Leu 
        595                 600                 605         


<210>  3
<211>  2106
<212>  DNA
<213>  Clostridium autoethanogenum

<400>  3
aaatagtatt gtacgttatt tgatcacttt tttaaaaata aaatgataat attgtatttt       60

ttttgagcat tttggcaata caaagaactg tattgtttat tatttaagca ttttattata      120

aaacaaaaaa acgttattaa attatttgct atgaattcac ttgataatca atgcattgca      180

tgtgatgttg attattgagt gttttttttg taaccatatt tggcacaatt tatgctctat      240

aatatttctg aaataaatac atttatatga ggaggaattt caatgtatgg ttatgatggt      300

aaagtattaa gaattaattt aaaagaaaga acttgcaaat cagaaaattt agatttagat      360

aaagctaaaa agtttatagg ttgtagggga ctaggtgtta aaactttatt tgatgaaata      420

gatcctaaaa tagatgcatt atcaccagaa aataaattta taattgtaac aggtccttta      480

actggagctc cggttccaac tagtggaagg tttatggtag ttactaaagc accgcttaca      540

ggaactatag gaatttcaaa ttcgggtgga aaatggggag tagacttaaa aaaagctggt      600

tgggatatga taatagtaga ggataaggct gattcaccag tttacattga aatagtagat      660

gataaggtag aaattaaaga cgcgtcacag ctttggggaa aagttacatc agaaactaca      720

aaagagttag aaaagataac tgagaataaa tcaaaggtat tatgtatagg acctgctggt      780

gaacgattgt ctcttatggc agcagttatg aatgatgtag atagaactgc agcaagaggc      840

ggcgttggtg cagttatggg atctaaaaac ttaaaagcta ttacagttaa aggaactgga      900

aaaatagctt tagctgataa agaaaaagta aaaaaagtgt ccgtagaaaa aattacaaca      960

ttaaaaaatg atccagtagc tggtcaggga atgccaactt atggtacagc tatactggtt     1020

aatataataa atgaaaatgg agttcatcct gtaaagaatt ttcaagagtc ttatacgaat     1080

caagcagata aaataagtgg agagactctt actgctaacc aactagtaag gaaaaatcct     1140

tgttacagct gtcctatagg ttgtggaaga tgggttagac taaaagatgg cacagagtgc     1200

ggaggaccag aatatgaaac actgtggtgt tttggatctg actgtggttc atatgattta     1260

gatgctataa atgaagctaa tatgttatgt aatgaatatg gtattgatac tattacttgt     1320

ggtgcaacaa ttgctgcagc tatggaactt tatcaaagag gatatataaa agacgaagaa     1380

atagctggag ataacctatc tctcaagtgg ggtgatacgg aatctatgat tggctggata     1440

aagagaatgg tatatagtga aggctttgga gcaaagatga caaatggttc atataggctt     1500

tgtgaaggtt atggagcacc ggagtattct atgacagtta aaaagcagga aattccagca     1560

tatgatccaa ggggaataca gggacacggt attacctatg cagttaataa tagaggaggc     1620

tgtcatatta agggatacat gattaaccct gaaatattag gttatcctga aaaacttgat     1680

agatttgcat tagatggtaa agcagcttat gccaaattat ttcatgattt aactgctgta     1740

attgattctt taggattgtg catattcact acatttgggc ttggaataca ggattatgta     1800

gatatgtata atgcagtagt aggagaatct acttatgatg cagattcact attagaggca     1860

ggagatagaa tctggactct tgagaaatta tttaatcttg cagctggaat agacagcagc     1920

caggatactc taccaaagag attgttagaa gaacctattc cagatggccc atcaaaggga     1980

gaagttcata ggctagatgt tcttctgcca gaatattact cagtacgagg atggagtaaa     2040

gagggtatac ctacagaaga aacattaaag aaattaggat tagatgaata tataggtaag     2100

ttctag                                                                2106


<210>  4
<211>  607
<212>  PRT
<213>  Clostridium autoethanogenum

<400>  4

Met Tyr Gly Tyr Asp Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 
1               5                   10                  15      


Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 
            20                  25                  30          


Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 
        35                  40                  45              


Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 
    50                  55                  60                  


Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 
65                  70                  75                  80  


Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 
                85                  90                  95      


Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 
            100                 105                 110         


Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 
        115                 120                 125             


Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 
    130                 135                 140                 


Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Lys Ser Lys Val Leu 
145                 150                 155                 160 


Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 
                165                 170                 175     


Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 
            180                 185                 190         


Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 
        195                 200                 205             


Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 
    210                 215                 220                 


Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 
225                 230                 235                 240 


Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 
                245                 250                 255     


Val Lys Asn Phe Gln Glu Ser Tyr Thr Asn Gln Ala Asp Lys Ile Ser 
            260                 265                 270         


Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 
        275                 280                 285             


Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 
    290                 295                 300                 


Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 
305                 310                 315                 320 


Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 
                325                 330                 335     


Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 
            340                 345                 350         


Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 
        355                 360                 365             


Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 
    370                 375                 380                 


Trp Ile Lys Arg Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 
385                 390                 395                 400 


Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Ala Pro Glu Tyr Ser 
                405                 410                 415     


Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 
            420                 425                 430         


Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 
        435                 440                 445             


Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 
    450                 455                 460                 


Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Leu Phe 
465                 470                 475                 480 


His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 
                485                 490                 495     


Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 
            500                 505                 510         


Val Gly Glu Ser Thr Tyr Asp Ala Asp Ser Leu Leu Glu Ala Gly Asp 
        515                 520                 525             


Arg Ile Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 
    530                 535                 540                 


Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 
545                 550                 555                 560 


Asp Gly Pro Ser Lys Gly Glu Val His Arg Leu Asp Val Leu Leu Pro 
                565                 570                 575     


Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 
            580                 585                 590         


Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 
        595                 600                 605         


<210>  5
<211>  2329
<212>  DNA
<213>  Clostridium autoethanogenum

<400>  5
gaattctgat tgattattta ttttaaaatg cctaagtgaa atatatacat attataacaa       60

taaaataagt attagtgtag gatttttaaa tagagtatct attttcagat taaatttttg      120

attatttgat ttacattata taatattgag taaagtattg actagcaaaa ttttttgata      180

ctttaatttg tgaaatttct tatcaaaagt tatatttttg aataattttt attgaaaaat      240

acaactaaaa aggattatag tataagtgtg tgtaattttg tgttaaattt aaagggagga      300

aaatgtatgg ttatgatggt aaagtattaa gaattaattt aaaagaaaga acttgcaaat      360

cagaaaattt agatttagat aaagctaaaa agtttatagg ttgtagggga ctaggtgtta      420

aaactttatt tgatgaaata gatcctaaaa tagatgcatt atcaccagaa aataaattta      480

taattgtaac aggtccttta actggagctc cggttccaac tagtggaagg tttatggtag      540

ttactaaagc accgcttaca ggaactatag gaatttcaaa ttcgggtgga aaatggggag      600

tagacttaaa aaaagctggt tgggatatga taatagtaga ggataaggct gattcaccag      660

tttacattga aatagtagat gataaggtag aaattaaaga cgcgtcacag ctttggggaa      720

aagttacatc agaaactaca aaagagttag aaaagataac tgagaataaa tcaaaggtat      780

tatgtatagg acctgctggt gaacgattgt ctcttatggc agcagttatg aatgatgtag      840

atagaactgc agcaagaggc ggcgttggtg cagttatggg atctaaaaac ttaaaagcta      900

ttacagttaa aggaactgga aaaatagctt tagctgataa agaaaaagta aaaaaagtgt      960

ccgtagaaaa aattacaaca ttaaaaaatg atccagtagc tggtcaggga atgccaactt     1020

atggtacagc tatactggtt aatataataa atgaaaatgg agttcatcct gtaaagaatt     1080

ttcaagagtc ttatacgaat caagcagata aaataagtgg agagactctt actgctaacc     1140

aactagtaag gaaaaatcct tgttacagct gtcctatagg ttgtggaaga tgggttagac     1200

taaaagatgg cacagagtgc ggaggaccag aatatgaaac actgtggtgt tttggatctg     1260

actgtggttc atatgattta gatgctataa atgaagctaa tatgttatgt aatgaatatg     1320

gtattgatac tattacttgt ggtgcaacaa ttgctgcagc tatggaactt tatcaaagag     1380

gatatataaa agacgaagaa atagctggag ataacctatc tctcaagtgg ggtgatacgg     1440

aatctatgat tggctggata aagagaatgg tatatagtga aggctttgga gcaaagatga     1500

caaatggttc atataggctt tgtgaaggtt atggagcacc ggagtattct atgacagtta     1560

aaaagcagga aattccagca tatgatccaa ggggaataca gggacacggt attacctatg     1620

cagttaataa tagaggaggc tgtcatatta agggatacat gattaaccct gaaatattag     1680

gttatcctga aaaacttgat agatttgcat tagatggtaa agcagcttat gccaaattat     1740

ttcatgattt aactgctgta attgattctt taggattgtg catattcact acatttgggc     1800

ttggaataca ggattatgta gatatgtata atgcagtagt aggagaatct acttatgatg     1860

cagattcact attagaggca ggagatagaa tctggactct tgagaaatta tttaatcttg     1920

cagctggaat agacagcagc caggatactc taccaaagag attgttagaa gaacctattc     1980

cagatggccc atcaaaggga gaagttcata ggctagatgt tcttctgcca gaatattact     2040

cagtacgagg atggagtaaa gagggtatac ctacagaaga aacattaaag aaattaggat     2100

tagatgaata tataggtaag ttctagtttg attcggtaaa ctagaaagca gactttatgt     2160

gttaaagaag atagcttctc tctatatatg aagtctgttt ttaatagaaa gatatgaatt     2220

tgagaataga agttagatta gttgcttata tatttgcaaa agtgttaagt tctgcttgga     2280

taagttcggg agatgaaatt tagttgttat gataaacttc aatgaattc                 2329


<210>  6
<211>  2155
<212>  DNA
<213>  Clostridium autoethanogenum

<400>  6
gattcttagt ataagtattc ttagtatctt tagcacttag aatacgttat cctttaggag       60

aataatccta atcagtagtt ctaataattt aatagtatac ttaaatagta tattttggag      120

gttttattat gtatggttat gatggtaaag tattaagaat taatttaaaa gaaagaactt      180

gcaaatcaga aaatttagat ttagataaag ctaaaaagtt tataggttgt aggggactag      240

gtgttaaaac tttatttgat gaaatagatc ctaaaataga tgcattatca ccagaaaata      300

aatttataat tgtaacaggt cctttaactg gagctccggt tccaactagt ggaaggttta      360

tggtagttac taaagcaccg cttacaggaa ctataggaat ttcaaattcg ggtggaaaat      420

ggggagtaga cttaaaaaaa gctggttggg atatgataat agtagaggat aaggctgatt      480

caccagttta cattgaaata gtagatgata aggtagaaat taaagacgcg tcacagcttt      540

ggggaaaagt tacatcagaa actacaaaag agttagaaaa gataactgag aataaatcaa      600

aggtattatg tataggacct gctggtgaac gattgtctct tatggcagca gttatgaatg      660

atgtagatag aactgcagca agaggcggcg ttggtgcagt tatgggatct aaaaacttaa      720

aagctattac agttaaagga actggaaaaa tagctttagc tgataaagaa aaagtaaaaa      780

aagtgtccgt agaaaaaatt acaacattaa aaaatgatcc agtagctggt cagggaatgc      840

caacttatgg tacagctata ctggttaata taataaatga aaatggagtt catcctgtaa      900

agaattttca agagtcttat acgaatcaag cagataaaat aagtggagag actcttactg      960

ctaaccaact agtaaggaaa aatccttgtt acagctgtcc tataggttgt ggaagatggg     1020

ttagactaaa agatggcaca gagtgcggag gaccagaata tgaaacactg tggtgttttg     1080

gatctgactg tggttcatat gatttagatg ctataaatga agctaatatg ttatgtaatg     1140

aatatggtat tgatactatt acttgtggtg caacaattgc tgcagctatg gaactttatc     1200

aaagaggata tataaaagac gaagaaatag ctggagataa cctatctctc aagtggggtg     1260

atacggaatc tatgattggc tggataaaga gaatggtata tagtgaaggc tttggagcaa     1320

agatgacaaa tggttcatat aggctttgtg aaggttatgg agcaccggag tattctatga     1380

cagttaaaaa gcaggaaatt ccagcatatg atccaagggg aatacaggga cacggtatta     1440

cctatgcagt taataataga ggaggctgtc atattaaggg atacatgatt aaccctgaaa     1500

tattaggtta tcctgaaaaa cttgatagat ttgcattaga tggtaaagca gcttatgcca     1560

aattatttca tgatttaact gctgtaattg attctttagg attgtgcata ttcactacat     1620

ttgggcttgg aatacaggat tatgtagata tgtataatgc agtagtagga gaatctactt     1680

atgatgcaga ttcactatta gaggcaggag atagaatctg gactcttgag aaattattta     1740

atcttgcagc tggaatagac agcagccagg atactctacc aaagagattg ttagaagaac     1800

ctattccaga tggcccatca aagggagaag ttcataggct agatgttctt ctgccagaat     1860

attactcagt acgaggatgg agtaaagagg gtatacctac agaagaaaca ttaaagaaat     1920

taggattaga tgaatatata ggtaagttct agtttgattc ggtaaactag aaagcagact     1980

ttatgtgtta aagaagatag cttctctcta tatatgaagt ctgtttttaa tagaaagata     2040

tgaatttgag aatagaagtt agattagttg cttatatatt tgcaaaagtg ttaagttctg     2100

cttggataag ttcgggagat gaaatttagt tgttatgata aacttcaatg aattc          2155


<210>  7
<211>  1371
<212>  DNA
<213>  Clostridium pharus

<400>  7
atgaggaagg tgtttttatt taaaatatta ataaattttt tggaaggggt tttaaaaatg       60

gtttttaaaa attggcagga tctttataaa agtaaatttg ttagtgcaga tgaagctgta      120

tctaaggtaa actgtgggga tactatagtt ttaggtaatg cttgtggagc acctcttaca      180

cttttagatg ctttggctgc aaataaggaa aagtataaga gtgtacagat atataatctt      240

atactgaact ataaaagtga tatatatgct gaaccaggtg cagaaaagta tattcatgga      300

aatacttttt ttgtaagtgg aggtactaag gaagctgtta attgtaacag aacagattat      360

accccatgct ttttttatga aataccaaaa ttaataaaac aaaaatatat acatatagat      420

gtagcattta ttcatgtaag taaacctgat aagcatggtt attgtagttt tggagtatca      480

actgattatt cacaggcaat ggtacagggt gctaaacttg taattgcaga agtaaacgat      540

caaatgccaa gagtttttgg agacaatttt atacacattt ctgatattga ttacatagta      600

aagacttcac gcccaattct tgagttggca cctcctaaaa taggagaagt agaaaaaaca      660

ataggaaaat attgtgcatc tcttatagaa gatggttcta cacttcaact tggaataggc      720

gctattccag atgcagtact tttgtttttg aaagacaaaa aagatttggg aatacactca      780

gaaatgatat ctgatggtgt tgtagaatta gttgaatcag gagtaattac aaataagaaa      840

aaagctcttc atccaggaaa aataattgtt acattcttaa tgggaactaa aaaattatat      900

gattttatag atgataatcc tatggtagaa ggttatccag tagattatgt aaatgatcct      960

aaagttatta tgcaaaattc caagatggta tgtataaatt cttgtgtaga ggtagatttt     1020

acaggacagg tgtgtgctga aagtatagga tttaagcaga taagcggagt aggcggacaa     1080

gttgattata tgagaggtgc tagcatgtct gatggaggaa aatcaattct tgctatacca     1140

tctactgcag ctggcggcaa aatttcaaga atagttccga tgttgactga aggagcagga     1200

gttactactt caagatatga tgttcaatat gttgttacag agtatggtat tgcacttctc     1260

aagggtaaat ccataagaga aagagctaaa gctcttataa acattgcaca tcctaaattt     1320

agagaacaat tagaaaaatc gtttgaagaa agatttagtt gtaaacttta a              1371


<210>  8
<211>  1314
<212>  DNA
<213>  Clostridium kluyveri

<400>  8
atggttttta aaaattggca ggatctttat aaaagtaaaa ttgttagtgc agacgaagct       60

gtatctaaag taagctgtgg agatagcata attttaggca atgcttgtgg agcatctctt      120

acacttttag atgccttggc tgcaaataag gaaaagtata agagtgtaaa gatacacaat      180

cttatactta attataaaaa tgatatatat actgatccgg aatcagaaaa gtatattcat      240

ggaaatactt tctttgtaag tggaggtaca aaggaagcag ttaattgtaa tagaacagat      300

tatactccat gcttttttta tgaaatacca aaattattaa aacaaaagta tataaatgca      360

gatgtagctt ttattcaagt aagtaagcct gatagccatg gatactgtag ctttggagta      420

tcaaccgatt attcacaggc aatggtacag tctgcaaagc ttataattgc agaagtaaac      480

gatcagatgc caagagtttt aggagacaat tttatacaca tttctgatat ggattacata      540

gtagaaagtt cacgtccaat tctagaattg actcctccta aaataggaga agtagagaag      600

acaataggaa aatactgtgc atctcttgta gaagatggtt ctacacttca gcttggaata      660

ggagctattc cagatgcagt acttttattc ttgaaggata aaaaggattt gggtatacat      720

tcagaaatga tatccgatgg tgttgttgaa ttagttgaag caggggtaat tacaaataag      780

aaaaagtccc ttcatccagg aaaaataatt attacattct taatgggaac taagaaatta      840

tatgatttca taaatgataa tcctatggta gaaggatacc ctgtagatta tgtaaatgat      900

cctaaggtta ttatgcaaaa ttctaagatg gtatgtataa actcctgtgt agaagtggat      960

ttcacaggac aagtgtgtgc tgaaagtgta ggatttaaac aaataagcgg tgtaggtgga     1020

caagttgatt acatgagagg agctagcatg gctgatggag gaaaatcaat tcttgctata     1080

ccatctactg cagctggcgg caaaatttca agaatagttc ctattttaac tgaaggagcg     1140

ggggttacta cttcaagata tgatgttcaa tatgttgtta cagaatatgg tattgcactt     1200

ctcaagggca aatccataag agaaagagct aaggagctta taaaaattgc acatcctaaa     1260

tttagggaag aattaacagc tcaatttgaa aaaagattca gttgtaagct ttaa           1314


<210>  9
<211>  1068
<212>  DNA
<213>  Clostridium carboxidivorans

<400>  9
atgagttata agatattagc aattaaccca ggatctactt ctacaaaaat agctttatac       60

gaagatgaaa aagaaatatt ttgcaaaacg ttagagcatc cagttgaaca aattgaaaaa      120

tatgagaatg tggcagatca atttgatatg agaaaagaag ttgttctttc atttttaaag      180

caaaatggat atgaagttaa agaattagct gcagttgttg gaagaggtgg aatggttcca      240

aaagtaaaat ctggagctta taaagttaat gaaacaatgg tagatagatt aaaaaataat      300

ccagtagtag aacatgcttc aaatttagga gctttaattg cttatgaaat agcaaattct      360

attggagtat cagcctatat atatgactct gttagagtag atgaattaga ggatatagct      420

cgtatatcag gtatgccgga tataccaaga acaagtacta gtcatgcatt aaatacaagg      480

gcaatggcaa tgaaggttgc aaaaaattat ggtaaaaagt attcagatat gaactttatt      540

gtagctcatc taggtggagg aatatcagta aatgttcata gaaaaggaca aatggtagat      600

ataatggcag atgacgaagg accattttca cctgaaagag ctggaaaagt tccttgcaat      660

gcacttatag atctttgcta ttcaggaaaa tttgataaaa aaactacgaa gaaaaaatta      720

aggggaaatg gtggattaaa agcttatctt aacactgttg atgctagaga agttgaaaga      780

atgattgaaa gtggagatga aaaagcaaag cttgtttatg aagctatggc ttatcaggtt      840

gctaagggaa taggagaact tgcaacagta gtagaaggta aggttgatgc tatcgttatt      900

acaggaggta tagcatattc tgatatgata actaactgga ttaaaaagcg tgtagagttt      960

attgcgcctg ttgagattat gcctggtgaa aatgaaatgg aatctttggc tttgggaact     1020

cttagagtgt taaagggtga agaagaagca agagaatatg ttgaataa                  1068


<210>  10
<211>  1113
<212>  DNA
<213>  Clostridium carboxidivorans

<400>  10
ttgctaatta aaatatttat taagtattgc tataatcagg agggtaaaat aatgtacaaa       60

atactagcaa taaatccagg ttcaacttca actaaaatag ctatttatga tgacacagag      120

gaattattta aaaccactat agaacattct agtgaagaag tgaaaaaata tgaaaacata      180

gctgatcaat atagtatgag atatgaagct ataatgaaat ttttaaaaga agtagatttt      240

gatgtcaaag ctttatctgc agtagttgga agaggaggaa ttctgcctcc agttaaatca      300

ggagcttaca gagtaaatga ttctatggta gaaagactgg ctaaaagacc tgtagtagag      360

catgcttcaa atttaggagc tataatttca tatgcaatag caaaaccttt aaatatacca      420

gctttcatat atgattctgt agctgtagat gaatttgagg atattgcaag aatatcagga      480

cttgcagata taaaaagaga gagttttatt catgctttaa atatgagagc tgcagcaata      540

aaaacagcaa aaaaactagg taaaccttat gaacaatgta atttagttgt tgctcattta      600

ggaggcggaa tatctcttac tgtacataaa ggtggaaaaa tgatagacgc tgttactgat      660

gaagaaggac cgttttcacc agaaaggtca ggtagagtac cttgtaagcg cttaatagaa      720

atgtgttata aaaatgatga acgcacaatg aaaaagaaaa taagaggaga tggtggatta      780

atctcttatt taggaactaa tagtgcatta gatgtagaaa aaagaattga aaatggagat      840

gctgaagcca aattagttta tgaagctatg gcatatcaaa ttgcaaaagc aataggagaa      900

cttgcaactg tagtaaaggg aaaggttgat gcagtagtaa ttacaggggg aattgcctat      960

tcaaaaatga tgacaggatg gataaaagaa agagtagaat ttatagcacc tgtagagata     1020

ttgccaggag aaaatgaatt agaatctctt gctttaggta cgcttagagt tataaaggga     1080

gaagaaaaag cacacgaata tgatttagat tag                                  1113


<210>  11
<211>  1071
<212>  DNA
<213>  Clostridium carboxidivorans

<400>  11
atgtcatata aattattaat attaaatcca ggatctacat ctaccaaaat aggagtatat       60

gatggagaaa atgaaatttt agaagaaact ttaagacatt cttcagaaga aattgagaaa      120

tatgctacta tttatgatca atttgaattt agaaaagaag ttatattgaa ggttttaaaa      180

gaaaagaatt ttgatattaa tacattagac ggagtagtag gcagaggtgg attattaaaa      240

ccaattgaaa gtggaactta taaagtcaat gatgctatgt tagaagacct aaaagttgga      300

gtgcaaggac agcatgcttc aaatttaggt ggaataatag ctaatgaaat aggaaaatct      360

ataaataaac cagcatttat agtagaccca gttgttgttg atgaattaga tgaagcagct      420

agaatatccg gaatgcctga aatagaaaga ataagtatat tccatgcttt aaatcaaaaa      480

gcagtagcaa agagatatgc aaaagaaaac aataagaagt atgatgaatt aaatttagta      540

gtgacacaca tgggtggcgg agtaactgtt ggagctcaca aaaaaggaag agttgtagat      600

gtagccaatg gtttagatgg agatggacca ttttcaccag aaagaacagg aggacttcct      660

gtaggaggtt taataaagct ttgctatagt ggaaaatata ctttagaaga aatgaagaaa      720

aagataagtg gaaaaggtgg aattgtagct tatctaaata caaatgattt tagggaagta      780

gaacaaaaag cagaaagtgg agataaaaag gcaaagttag tatttgatgc tttcatatta      840

caagtaggta aagaaattgg taaatgtgct gcagttttac atggaaaagt agatgcttta      900

attttaactg gaggaatagc ttatagtaaa actgttacag ctgcaataaa agacatggta      960

gaatttattg caccagttgt agtttatcca ggagaagatg aattattagc attagcacaa     1020

ggcggactta gagtactagg tggagaagaa caagcaaaag aatataagta a              1071


<210>  12
<211>  1068
<212>  DNA
<213>  Clostridium difficile

<400>  12
atgacttaca gaatattagc cataaatcca ggttctactt ctacaaaaat agcagtatat       60

gatggagaag aacaaattct tgtgaagacg atagaccatc cggctgaaga gattgcaaaa      120

tataatacta tacaagacca gtttgaaatg cgtaaggaag cagttttgaa tattcttaaa      180

gaaaatagta tagacttaaa atctcttagt gcaatagtag gaagaggtgg agttttacca      240

ccagtaaaat caggagcata tttagtaaat gaagaaatga ttgatgtact aagacataga      300

ccagtacttg aacacgcttc caatttaggt gctgttgtgg cacatgcaat atcagaacct      360

cttggaatca actcatatat ttatgattct gttgcagtag atgagcttat agatgtagcg      420

agaatatctg gactttgtgg aatggataga tcaagtgcag ggcatgcatt aaatactaga      480

gcaatggctt taaaatatgc taaggataaa ggaaaagatt ataagagctt aaacttaata      540

gtagctcaca ttggtggagg agtaagtatt tatcttcatg aaaaaggaag aatggttgat      600

atgctatctg atgatgaagg accattttct ccagaaaggt caggaagagt acctgctaca      660

aaattagtgg ctgcctgtta ttcaggtcaa tattcagaaa gagaaatgac taaaaagata      720

agaggtaaag gtggtatagt ttcataccta aatactgtag atgctagaga agttgaaaaa      780

atgatagcag aaggaaatga agaagcaaaa attatttatg aagcaatggc ttatcagtta      840

gcaaaaggta ttggagagtt agcaactgta gtagatggaa aggtagatgc tataattata      900

acaggtggaa ttgcatattc tgaaatgttt acttcaatgg ttaaaaagaa agttgagttt      960

atagcaccag tagaaattat ggcaggagaa aatgagttgg ataatcactt gcttttggaa     1020

ctttaagagt actaaatgga gaagaagaag ctagaattta tagtgaaa                  1068


<210>  13
<211>  172
<212>  DNA
<213>  Clostridium autoethanogenum

<400>  13
acttggatta tgtattttta caacatttgg tcttggtgca caggattatg ttgatatgta       60

taatgcagta gttggtggag aattacatga tgtaaattct ttaatgttag ctggagatag      120

aatatggact ttagaaaaaa tatttaactt aaaagcaggc atagatagtt ca              172


<210>  14
<211>  170
<212>  DNA
<213>  Clostridium autoethanogenum

<400>  14
aagaaagaac ttgcaaatca gaaaatttag atttagataa agctaaaaag tttataggtt       60

gtaggggact aggtgttaaa actttatttg atgaaataga tcctaaaata gatgcattat      120

caccagaaaa taaatttata attgtaacag gtcctttaac tggagctccg                 170


<210>  15
<211>  607
<212>  PRT
<213>  Clostridium ljungdahlii

<400>  15

Met Tyr Gly Tyr Lys Gly Lys Val Leu Arg Ile Asn Leu Ser Ser Lys 
1               5                   10                  15      


Thr Tyr Ile Val Glu Glu Leu Lys Ile Asp Lys Ala Lys Lys Phe Ile 
            20                  25                  30          


Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Val Asp Pro 
        35                  40                  45              


Lys Val Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Ala Gly 
    50                  55                  60                  


Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 
65                  70                  75                  80  


Thr Lys Ser Pro Leu Thr Gly Thr Ile Ala Ile Ala Asn Ser Gly Gly 
                85                  90                  95      


Lys Trp Gly Ala Glu Phe Lys Ala Ala Gly Tyr Asp Met Ile Ile Val 
            100                 105                 110         


Glu Gly Lys Ser Asp Lys Glu Val Tyr Val Asn Ile Val Asp Asp Lys 
        115                 120                 125             


Val Glu Phe Arg Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu 
    130                 135                 140                 


Thr Thr Lys Met Leu Gln Gln Glu Thr Asp Ser Arg Ala Lys Val Leu 
145                 150                 155                 160 


Cys Ile Gly Pro Ala Gly Glu Lys Leu Ser Leu Met Ala Ala Val Met 
                165                 170                 175     


Asn Asp Val Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 
            180                 185                 190         


Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 
        195                 200                 205             


Lys Leu Phe Asp Glu Gln Lys Val Lys Glu Val Ala Leu Glu Lys Thr 
    210                 215                 220                 


Asn Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 
225                 230                 235                 240 


Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 
                245                 250                 255     


Val Lys Asn Phe Gln Lys Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 
            260                 265                 270         


Gly Glu Thr Leu Thr Lys Asp Cys Leu Val Arg Lys Asn Pro Cys Tyr 
        275                 280                 285             


Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Lys Leu Asp Asp Gly Thr 
    290                 295                 300                 


Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 
305                 310                 315                 320 


Cys Asp Val Tyr Asp Ile Asn Ala Val Asn Thr Ala Asn Met Leu Cys 
                325                 330                 335     


Asn Glu Tyr Gly Leu Asp Thr Ile Thr Ala Gly Cys Thr Ile Ala Ala 
            340                 345                 350         


Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 
        355                 360                 365             


Ala Asp Gly Leu Ser Leu Asn Trp Gly Asp Ala Lys Ser Met Val Glu 
    370                 375                 380                 


Trp Val Lys Lys Met Gly Leu Arg Glu Gly Phe Gly Asp Lys Met Ala 
385                 390                 395                 400 


Asp Gly Ser Tyr Arg Leu Cys Asp Ser Tyr Gly Val Pro Glu Tyr Ser 
                405                 410                 415     


Met Thr Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ile 
            420                 425                 430         


Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 
        435                 440                 445             


Ile Lys Gly Tyr Met Val Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 
    450                 455                 460                 


Leu Asp Arg Leu Ala Val Glu Gly Lys Ala Gly Tyr Ala Arg Val Phe 
465                 470                 475                 480 


His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 
                485                 490                 495     


Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 
            500                 505                 510         


Val Gly Gly Glu Leu His Asp Val Asn Ser Leu Met Leu Ala Gly Asp 
        515                 520                 525             


Arg Ile Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Ile Asp 
    530                 535                 540                 


Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Gln Ile Pro 
545                 550                 555                 560 


Glu Gly Pro Ser Lys Gly Glu Val His Lys Leu Asp Val Leu Leu Pro 
                565                 570                 575     


Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Ile Pro Thr Glu 
            580                 585                 590         


Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Val Gly Lys Leu 
        595                 600                 605         


<210>  16
<211>  607
<212>  PRT
<213>  Clostridium ragsdalei

<400>  16

Met Tyr Gly Tyr Ser Gly Lys Val Leu Arg Ile Asn Leu Ser Asn Lys 
1               5                   10                  15      


Thr Tyr Lys Ala Glu Glu Leu Lys Ile Asp Glu Ala Lys Lys Phe Ile 
            20                  25                  30          


Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Leu Asp Glu Ile Asp Pro 
        35                  40                  45              


Lys Ile Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Thr Gly 
    50                  55                  60                  


Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Ile 
65                  70                  75                  80  


Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ala Asn Ser Gly Gly 
                85                  90                  95      


Lys Trp Gly Ala Glu Leu Lys Thr Ala Gly Tyr Asp Met Val Ile Val 
            100                 105                 110         


Glu Gly Lys Ser Asp Lys Pro Val Tyr Val Asn Ile Val Asp Asp Lys 
        115                 120                 125             


Val Glu Phe Lys Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu 
    130                 135                 140                 


Thr Thr Lys Met Leu Gln Asn Glu Ile Asp Ala Lys Ala Lys Val Leu 
145                 150                 155                 160 


Cys Ile Gly Pro Ala Gly Glu Asn Leu Ser Leu Met Ala Ala Val Met 
                165                 170                 175     


Asn Asp Ile Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 
            180                 185                 190         


Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 
        195                 200                 205             


Lys Leu Phe Asp Glu Glu Lys Val Lys Ala Val Ser Leu Gln Lys Ser 
    210                 215                 220                 


Asp Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 
225                 230                 235                 240 


Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Ile Asn Pro 
                245                 250                 255     


Val Arg Asn Phe Gln Glu Ser Tyr Thr Asp Glu Ala Asp Lys Val Ser 
            260                 265                 270         


Gly Glu Thr Met Thr Gln Glu Cys Leu Val Arg Lys Asn Pro Cys Tyr 
        275                 280                 285             


Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Arg Leu Asp Asp Gly Thr 
    290                 295                 300                 


Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 
305                 310                 315                 320 


Cys Asp Val Tyr Asp Leu Asn Ala Val Asn Lys Ala Asn Met Leu Cys 
                325                 330                 335     


Asn Glu Tyr Gly Leu Asp Thr Ile Ser Ala Gly Ala Thr Ile Ala Ser 
            340                 345                 350         


Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 
        355                 360                 365             


Ala Asp Gly Leu Ser Leu Lys Trp Gly Asp Ala Lys Ser Met Val Glu 
    370                 375                 380                 


Trp Val Lys Lys Met Gly Arg Arg Glu Gly Phe Gly Gly Lys Met Ala 
385                 390                 395                 400 


Asp Gly Ser Tyr Arg Leu Cys Glu Ser Tyr Gly Val Pro Gln Tyr Ser 
                405                 410                 415     


Met Ser Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ala 
            420                 425                 430         


Gln Gly His Gly Leu Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 
        435                 440                 445             


Ile Lys Gly Tyr Met Ile Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 
    450                 455                 460                 


Leu Asp Arg Phe Ser Ile Glu Gly Lys Pro Ala Tyr Ala Lys Val Phe 
465                 470                 475                 480 


His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 
                485                 490                 495     


Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 
            500                 505                 510         


Val Gly Gly Glu Leu His Asp Val Asp Ser Leu Met Leu Ala Gly Asp 
        515                 520                 525             


Arg Val Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Val Gly 
    530                 535                 540                 


Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Glu Val Val 
545                 550                 555                 560 


Glu Gly Pro Ser Lys Gly His Val His Arg Leu Asp Glu Leu Val Pro 
                565                 570                 575     


Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Val Pro Thr Glu 
            580                 585                 590         


Glu Thr Leu Lys Lys Leu Gly Leu Glu Glu Tyr Ile Gly Lys Ile 
        595                 600                 605         


<210>  17
<211>  607
<212>  PRT
<213>  Clostridium ljungdahlii

<400>  17

Met Tyr Gly Tyr Asp Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 
1               5                   10                  15      


Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 
            20                  25                  30          


Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 
        35                  40                  45              


Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 
    50                  55                  60                  


Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 
65                  70                  75                  80  


Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 
                85                  90                  95      


Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 
            100                 105                 110         


Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 
        115                 120                 125             


Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 
    130                 135                 140                 


Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Lys Ser Lys Val Leu 
145                 150                 155                 160 


Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 
                165                 170                 175     


Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 
            180                 185                 190         


Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 
        195                 200                 205             


Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 
    210                 215                 220                 


Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 
225                 230                 235                 240 


Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 
                245                 250                 255     


Val Lys Asn Phe Gln Glu Ser Tyr Thr Asn Gln Ala Asp Lys Ile Ser 
            260                 265                 270         


Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 
        275                 280                 285             


Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 
    290                 295                 300                 


Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 
305                 310                 315                 320 


Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 
                325                 330                 335     


Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 
            340                 345                 350         


Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 
        355                 360                 365             


Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 
    370                 375                 380                 


Trp Ile Lys Arg Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 
385                 390                 395                 400 


Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Ala Pro Glu Tyr Ser 
                405                 410                 415     


Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 
            420                 425                 430         


Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 
        435                 440                 445             


Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 
    450                 455                 460                 


Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Leu Phe 
465                 470                 475                 480 


His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 
                485                 490                 495     


Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 
            500                 505                 510         


Val Gly Glu Ser Thr Tyr Asp Ala Asp Ser Leu Leu Glu Ala Gly Asp 
        515                 520                 525             


Arg Ile Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 
    530                 535                 540                 


Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 
545                 550                 555                 560 


Asp Gly Pro Ser Lys Gly Glu Val His Arg Leu Asp Val Leu Leu Pro 
                565                 570                 575     


Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 
            580                 585                 590         


Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 
        595                 600                 605         


<210>  18
<211>  607
<212>  PRT
<213>  Clostridium ragsdalei

<400>  18

Met Tyr Gly Tyr Asn Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 
1               5                   10                  15      


Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 
            20                  25                  30          


Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 
        35                  40                  45              


Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 
    50                  55                  60                  


Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 
65                  70                  75                  80  


Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 
                85                  90                  95      


Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 
            100                 105                 110         


Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 
        115                 120                 125             


Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 
    130                 135                 140                 


Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Arg Ser Lys Val Leu 
145                 150                 155                 160 


Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 
                165                 170                 175     


Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 
            180                 185                 190         


Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 
        195                 200                 205             


Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 
    210                 215                 220                 


Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 
225                 230                 235                 240 


Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 
                245                 250                 255     


Val Asn Asn Phe Gln Glu Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 
            260                 265                 270         


Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 
        275                 280                 285             


Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 
    290                 295                 300                 


Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 
305                 310                 315                 320 


Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 
                325                 330                 335     


Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 
            340                 345                 350         


Ala Met Glu Leu Tyr Gln Arg Gly Tyr Val Lys Asp Glu Glu Ile Ala 
        355                 360                 365             


Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 
    370                 375                 380                 


Trp Ile Lys Lys Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 
385                 390                 395                 400 


Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Val Pro Glu Tyr Ser 
                405                 410                 415     


Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 
            420                 425                 430         


Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 
        435                 440                 445             


Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 
    450                 455                 460                 


Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Met Met 
465                 470                 475                 480 


His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 
                485                 490                 495     


Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 
            500                 505                 510         


Val Gly Glu Ser Thr Cys Asp Ser Asp Ser Leu Leu Glu Ala Gly Asp 
        515                 520                 525             


Arg Val Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 
    530                 535                 540                 


Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 
545                 550                 555                 560 


Asp Gly Pro Ser Lys Gly His Val His Arg Leu Asp Val Leu Leu Pro 
                565                 570                 575     


Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 
            580                 585                 590         


Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 
        595                 600                 605         


<210>  19
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AOR1-A-forward primer

<400>  19
acttggatta tgtattttta ca                                                22


<210>  20
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AOR1-A-reverse primer

<400>  20
tgaactatct atgcctgctt tt                                                22


<210>  21
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AOR2-A-forward primer

<400>  21
aagaaagaac ttgcaaatca                                                   20


<210>  22
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AOR2-A reverse primer

<400>  22
cggagctcca gttaaagga                                                    19


<210>  23
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  BCoAAT forward primer

<400>  23
agccatgcta gctcctctca tgta                                              24


<210>  24
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  BCoAAT reverse primer

<400>  24
ggagtatcaa ccgattattc acag                                              24


<210>  25
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Buk forward primer

<400>  25
gatatcattt ctgaatgtat accc                                              24


<210>  26
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Buk reverse primer

<400>  26
gatatcattt ctgaatgtat accc                                              24


<210>  27
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Iron-only hydrogenase-forward primer

<400>  27
tgtgaacgtc ctgaaatgaa ag                                                22


<210>  28
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Iron-only hydrogenase reverse primer

<400>  28
agtgcctgca ccagaataag tt                                                22


<210>  29
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyg-1 forward primer

<400>  29
gcccgatata aatcctcttt                                                   20


<210>  30
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyg-1 reverse primer

<400>  30
ccaacaaaaa ttccatgatt                                                   20


<210>  31
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyg-2 forward primer

<400>  31
ctacaatttt aaacgctgca                                                   20


<210>  32
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyg-2 reverse primer

<400>  32
gctctggcac tgtttgttct a                                                 21


<210>  33
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyg-3 forward primer

<400>  33
tgatacaaac tttggtgcag                                                   20


<210>  34
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyg-3 reverse primer

<400>  34
atatagctcc agccatctga                                                   20


