                         SEQUENCE LISTING

<110>  The Trustees of Dartmouth College
       Shaw, Arthur J
       Lynd, Lee R
       Hogsett, David A
 
<120>  MODIFICATION OF HYDROGENASE ACTIVITIES IN THERMOPHILIC BACTERIA 
       TO ENHANCE ETHANOL PRODUCTION

<130>  483349

<150>  61/014,359
<151>  2007-12-17

<150>  61/049,238
<151>  2008-04-30

<160>  28    

<170>  PatentIn version 3.5

<210>  1
<211>  1764
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  1
atgaaaggag tgcaaaacat ggataaagtt cgaataacta ttgatggaat tcctgcagaa       60

gtacctgcta actatacagt attgcaagct gcaaaatatg caaaaattga gattccgaca      120

ttatgctacc ttgaagagat aaacgaaata ggtgcttgca ggctatgcgt tgttgagata      180

aaaggcgtta gaaatttaca ggcatcttgt gtttatcctg taagcgacgg aatggaaata      240

tacacgaata ctcctcgtgt aagagaggca aggagatcta atttagagct tatactgtct      300

gcacacgaca gaagctgcct tacatgcgta agaagcggaa actgtgagtt gcaagattta      360

agtagaaagt ctggcataga tgaaataagg tttatgggcg aaaatataaa atatcaaaaa      420

gatgagtcgt ctccttccat cgtaagagac ccaaataaat gcgtattgtg tagaaggtgt      480

gttgctacct gcaacaatgt gcagaatgtt ttcgccatag gcatggttaa cagaggattt      540

aagactattg ttgcaccttc atttggcaga ggtctaaacg aatcaccatg tattagctgc      600

ggacagtgta tagaagcatg tcctgtcgga gcgatttatg aaaaagacca tacaaagatt      660

gtttacgatg cgcttttgga tgagaagaaa tacgttgtag ttcagacagc acctgctgtg      720

agagttgcac ttggtgaaga gtttggaatg ccttatggtt cgatagtgac agggaaaatg      780

gtatcagctt taaaaaggct tgggtttgac aaagtgtttg acacagactt tgctgcagat      840

ttaaccataa tagaagaagg aaatgaactt ttaaagaggc ttaacgaagg cggtaagctt      900

cctatgataa catcctgcag ccctggatgg ataaactatt gtgaaaggta ttatccagaa      960

tttatagaca atctttctac ttgcaaatcg cctcacatga tgatgggcgc aataataaag     1020

agctattttg cggaaaaaga aggaatagat ccaaaggata tcttcgttgt atcaattatg     1080

ccgtgtactg ccaagaagta tgagatagac aggcctcaaa tgatagtaga tggcatgaaa     1140

gatgtagatg ctgttttgac gacgagggag cttgctcgta tgataaaaca gtcaggcata     1200

gattttgtca acttgcctga cagcgaatac gacaatccgc tgggcgaatc atccggtgct     1260

ggtgtcatat tcggtgctac aggcggtgtc atggaagcgg ctttaagaac tgttgcagat     1320

atagttgaag gaaaagatat tgagaatttt gagtacgaag aagtaagagg attggaagga     1380

ataaaagaag cgaagattga cataggcgga aaagaaataa aaatagctgt agcaaatggc     1440

acagggaatg ctaagaaact cttagacaag ataaagaatg gcgaggcaga gtaccatttc     1500

atagaagtca tggggtgccc tggcggttgc ataatgggcg gcggacagcc aatacacaat     1560

ccaaatgaaa aagatttggt gaggaaaagt aggttaaaag ccatatatga agcagataaa     1620

gacttgccta tcagaaagtc tcacaaaaat ccaatgataa caaagctgta cgaagaattc     1680

ttaataagcc cattaggaga aaaatctcat cacttgcttc atacaaccta tagcaaaaaa     1740

gatctttatc ctatgaatga ttaa                                            1764


<210>  2
<211>  1791
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  2
atgttatata gatcacatgt tatggtgtgc ggtggtactg gatgtacatc gtcaaattca       60

gatagaatag caaaatgctt tgaagaagaa attgcaaata aaggtttaga caaagaagtt      120

caggttgtaa gaactggatg ctttggactt tgtgagttgg gcccagttgt tgtcgtgtat      180

ccagaaggcg tgttttacag ctgtgtcaaa gaagaatatg ttccggaaat cgtggaagaa      240

caccttctaa aaggaagagt tgttaaaaag tatctttatg gagaaagcgt cacagaagaa      300

ggaatcaaac ctttagagga aacagcattt ttcaagaaac agcagagagt tgctttaaga      360

aactgtggtc ttataaaccc agaggatata aaagaagcaa ttgcatttga tggctataaa      420

gcattggcaa aggtattgac tgagatgacg cctgaggaag tcataaatga gattaaaaag      480

tcaggcttaa gaggtagagg tggtggtggc ttccctacag gtataaagtg ggaatttgct      540

tacaaccaaa aagagacgcc taagtacgtc gtttgtaatg ctgatgaagg ggatcctggt      600

gccttcatgg atagaagcgt attggaggga gatcctcaca gcgttttgga agctatggct      660

atagcaggat atgcaattgg tgctaaccat ggttatattt atgtaagggc tgaatatcct      720

cttgcagtaa agaggcttca aattgcgata gatcaagcaa gagaatacgg acttttaggc      780

aaaaatattt tcaatacggg atttgacttt gatatagaga taaggcttgg agcaggtgct      840

tttgtctgcg gtgaagagac tgcactttta aattctgtca tgggaaaacg cggtgaacca      900

aggccaaggc ctccattccc tgctgtaaaa ggcgtgtggg aaaaaccaac tatcataaac      960

aacgttgaaa cttatgcaaa tattcctgcg ataatattga atggtgcaga atggttcgca     1020

agtataggca ctgaaaaatc taaaggcaca aaggtatttg ctcttggcgg aaaaatcaac     1080

aatactggct tggtagaaat acctatgggt acaaccctga gagagatcat atttgaaata     1140

ggtggcggaa taccaaatgg caagaaattc aaagcagctc aaactggtgg accatctggt     1200

ggatgcattc ctgcggagca tttagataca cctattgact atgattcgct tcttaatatt     1260

ggttccatga tgggttcagg tggacttatc gtaatggacg aagacaactg tatggttgat     1320

attgcaaaat tcttcttgga atttaccgtt gatgaatcat gtggcaaatg ctcaccatgt     1380

cgcataggta cgagaagaat gttggaactg cttaataaga taacatcagg aaagggcgaa     1440

gaaggagata tcgagaaact tgaaactctt gctaattcca taaaggcgtc ttctttgtgt     1500

ggattaggtc aaacagctcc taaccctgtt ctttccacta taaggtattt tagagatgaa     1560

tatgaggcgc acataaagga gaaaaggtgt cctgcaggtg tttgccaggc acttctgaaa     1620

tttagaattg atccagataa atgtaaggga tgcggcatat gtgccaagaa ttgtcctaca     1680

aacgccatat ctggaaaagt aaagcagcct catgtgatag atcaagataa atgtataaaa     1740

tgtggaacat gtatggataa atgtccgttt gatgctatat acaagaaata g              1791


<210>  3
<211>  483
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  3
atgcaggcaa tctacgaaaa attcagcgaa gaaaatataa ataagttaaa aaaagtgata       60

gaccaattga aagatacaga cggttctttg attgctgtca tgaatgaagc tcaagaaata      120

tttggctatt tgcctataga agttcagcaa tttatttcag aagaaatgaa tgtaccattg      180

acagagatat ttggaatcgc gactttttac tcacgtttca cattaaagcc atccgggaag      240

tataaaatcg gcgtttgcct tggcactgct tgttacgtaa aaggttctgc gatggtatta      300

gacaaattaa aagagaagct tggcataagc gtaggagacg tgacaggtga tggcaagttt      360

tcacttgaag cgactcgctg tttaggtgct tgcggtcttg cacctgtaat gatgataaac      420

ggagaagttt ttggcagatt gacacctgat gatgttgaag atatattgaa gaaatttgat      480

taa                                                                    483


<210>  4
<211>  387
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  4
ttgtgtcatg gaggtgtaaa tatgaaatct atagaggaat tagaaaaaat aagaaaagag       60

acattggaaa aggtaaatct tcgtaaagat agaaacggca taagaattac ggtcggcatg      120

gctacgtgtg gtatagctgc tggcgcaagg ccagttatga tggctatatt agatgagctt      180

ggcaagagaa atattacgga tgtagttgtt gctgagactg gttgtatcgg catgtgcaaa      240

tatgagccta tggtagatgt ttatgttcct ggacaagaaa aagttacgta tataaaagtt      300

gatgaaaaca aggcaaggca gatagttgcg gaacatgtag ttaacggaca tccgattaaa      360

gaatggacta ttagtagtgt tgaataa                                          387


<210>  5
<211>  1716
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  5
atgagtgtca ttaatttcaa agaagccaat tgcagaaact gctataaatg cattagatat       60

tgccctgtaa aagcgataaa agtcaatgat gaacaggctg aaatcataga atacaggtgc      120

atagcatgcg gaagatgctt aaatatctgt cctcagaatg caaaaacagt tagatcagac      180

gtagaaagag ttcaatcttt tttaaataaa ggagaaaaag ttgctttcac tgtagctcca      240

tcatatcctg ctcttgttgg acatgatggt gctttgaact ttttaaaggc tttaaaaagt      300

ttaggagccg aaatgatagt tgagacatca gtaggtgcta tgcttatatc taaggagtat      360

gaaaggtatt ataatgattt gaaatatgac aatttgatta ctacttcatg tccatcggta      420

aattatttgg ttgaaaaata ctaccctgat cttataaaat gccttgtgcc agttgtatcg      480

ccgatggtgg ctgttggaag agctataaaa aatatacacg gtgaaggtgt gaaagtcgta      540

tttataggcc cgtgccttgc taaaaaagca gagatgagcg attttagctg tgaaggcgct      600

atagatgctg tattgacttt tgaagaagta atgaatttgt ttaatacaaa taaaataggt      660

gttgaatgca cgaaagagaa tttagaagat gttgactctg aaagccgatt taaattgtat      720

ccaatagagg gcaaaaccat ggattgcatg gatgttgatt taaatttaag aaaatttatc      780

tctgtatcat cgatagaaaa tgtgaaagat attttaaatg atttaagagc tggcaatcta      840

cacggatatt ggatagaagc taatgcctgt gatggaggct gcatcaatgg ccctgcattt      900

ggaaagttag aaagtggtat tgcaaaaaga aaagaagaag ttataagcta ttctcgcatg      960

aaagaaaggt ttagcggtga tttcagcggc attaccgatt tttccttaga tttaagcaga     1020

aagtttattg atttaagtga tagatggaaa atgccaagcg agatggagat aaaagagata     1080

ttgtcgaaga ttggcaagtt ttctgtagaa gacgaattga attgcggtgc atgtggctat     1140

gacacttgca gggaaaaggc tattgcagtc tttaacggaa tggcggaacc gtatatgtgc     1200

ttgccatata tgagagggag ggctgaaacg ctgtctaata tcataataag ttctactcca     1260

aacgctataa ttgcagttaa taatgagtat gaaattcaag atatgaatag agcgtttgag     1320

aagatgtttt tggtaaattc agccatggtt aaaggtgaag atttatcgtt gatctttgat     1380

atatctgatt ttgtagaggt tattgaaaat aagaaaagca tttttaataa aaaagtttcg     1440

tttaaaaatt acggaatcat agcattggaa agcatctact atttggaaga atataaaatt     1500

gccattggaa tttttacaga tataacaaag atggagaaac aaaaggagag cttctcaaag     1560

cttaaaaggg aaaactacca attggcgcag caagtgatag atagacagat gaaggttgca     1620

caagagatag caagcttgtt aggagaaacg actgcggaga caaaagtgat actgactaag     1680

atgaaagata tgctgttaaa tcaaggtgat gatgaa                               1716


<210>  6
<211>  1158
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  6
atgagtcatt acatcgatat tgcacatgca tcattgaata aatacgatga agaactgtgt       60

ggagatagtg ttcaaataat aagaaagaaa gattatgcaa tggcagttat ggcagatggc      120

cttggcagcg gtgttaaggc gaatattcta tctactttga caacgcgaat agtgtcaaaa      180

atgttggata tgggttctga gctaagagat gttgtagaaa cggtggctga gacattgcca      240

atatgcaaag aaagaaatat agcgtattca acatttactg ttgtttctat atatggggac      300

aatgctcatt tagttgaata tgacaatcca tcggtttttt attttaagaa tggtgtgcat      360

aagaaggtcg atagaaaatg tgttgaaata ggtgataaga aaatctttga aagcagcttc      420

aaattggatt tgaatgatgc gctgatagtt gtatctgatg gagtaattca tgcaggcgta      480

ggagggatat taaatcttgg ttggcaatgg gataatgtta aacaatattt atcaaaagta      540

ttggaagttt acagcgatgc atcagatatc tgttcacaac ttataacaac ctgcaataat      600

ttgtacaaaa ataggccagg cgatgataca actgcaatag tgataaaagt taacgaatct      660

aaaaaagtta cggtaatggt aggaccgccg attttaaaga atatggatga atgggttgtt      720

aaaaaactca tgaaaagtga aggcttaaag gtagtatgtg gtggtaccgc tgcaaaaatt      780

gtaagcagga ttttaaataa agacgtgatt acatctaccg agtatattga tcctgatata      840

cctccttatg cacatattga tgggattgat ctggtgacag agggcgtatt gactttaaga      900

aagactgttg aaattttcaa agaatacatg aatgataaag actcaaattt gctgagattt      960

tcaaaaaaag atgctgcaac tcgattattt aaaatcttaa attacgctac tgacgtaaat     1020

ttcttagtag gccaggctgt aaacagtgcc catcaaaatc ctgattttcc atccgatctt     1080

agaataaagg tcaggattgt ggaagaactt ataagcttat tagagagatt aaataaaaat     1140

gtggaagtaa attatttt                                                   1158


<210>  7
<211>  1485
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  7
atgttaaagt acgaggtgct ttacaacgta gctaaattga cgcttgaaga taggcttgaa       60

gatgaatacg acgaaatacc ttacgagata ataccgggaa caaaaccgag gtttaggtgt      120

tgcgtgtata aggaaagggc tataattgag cagagaacta aagtcgcaat ggggaaaaat      180

ttaaagcgca ctatgaaaca tgcagttgac ggtgaagagc cgataattca agttttagat      240

attgcctgtg aggagtgtcc tatcaaaagg tatcgtgtaa ctgaagcttg tagagggtgt      300

attactcata ggtgtacaga agtatgtcca aaaggagcca taacgataat aaacaaaaag      360

gccaacatcg actacgacaa gtgcatagag tgtggcaggt gcaaagatgc gtgtccatac      420

aatgctattt ctgacaattt gaggccgtgt attagatctt gttcagcaaa ggccataact      480

atggatgaag aattgaaagc tgccataaat tacgaaaaat gtacttcgtg tggtgcttgc      540

acattggcat gtccattcgg agccataacc gataagtctt atattgtaga cattataagg      600

gcgattaaga gcgggaaaaa agtttatgca ttggtagcgc cagccatagc atcccaattt      660

aaggatgtaa ctgtaggaca gataaaatct gctttaaaag aatttggatt tgttgatgtg      720

attgaagttg ctcttggcgc agattttgta gctatggaag aagccaaaga attcagccat      780

aaaataaaag acataaaagt catgacgagt tcatgttgtc ctgcatttgt ggcacacata      840

aagaaaagtt atcctgagct atcgcaaaat atatcgacaa ctgtatctcc aatgacagct      900

atatcgaaat acatcaaaaa acacgatcct atggcagtga cagtatttat aggtccatgt      960

actgcaaaga aatcggaagt catgagagat gatgtaaagg gcataacgga ttttgccatg     1020

acatttgaag agatggttgc tgtgttggat gcggcaaaaa tagacatgaa agaacagcaa     1080

gatgtggaag tggatgatgc tacgcttttt ggaagaaagt ttgcaagatc tggaggcgtc     1140

ttagaggctg tggttgaagc cgttaaagaa ataggcgcgg atgttgaagt aaaccctgta     1200

gtatgcaatg ggcttgatga atgcaacaag acattgaaaa taatgaaagc tggcaaattg     1260

ccaaacaatt ttatagaagg catggcttgc atcggaggat gtataggcgg tgcaggcgta     1320

ataaataaca atgtaaatca ggcaaaattg gctgttaaca aatttggcga ttcatcttac     1380

cataaaagca taaaagatag aatcagccaa tttgatactg atgacgttga tttccatgtt     1440

gacagcggtg aagatgagtc aagtgaaaca tcgtttaaag aagct                     1485


<210>  8
<211>  243
<212>  DNA
<213>  Thermoanaerobacterium saccharolyticum

<400>  8
atggttatta ctgtttgtgt aggaagttca tgccacttaa aaggttccta cgatgttata       60

aacaaattaa aagaaatgat taaaaattac ggtattgagg ataaagtgga gttgaaggct      120

gacttttgca tgggcaattg tttaagggcg gtttctgtaa aaattgatgg cggtgcgtgt      180

ttatcaataa aaccaaatag cgttgagaga ttttttaaag aacatgtttt aggtgaacta      240

aaa                                                                    243


<210>  9
<211>  587
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  9

Met Lys Gly Val Gln Asn Met Asp Lys Val Arg Ile Thr Ile Asp Gly 
1               5                   10                  15      


Ile Pro Ala Glu Val Pro Ala Asn Tyr Thr Val Leu Gln Ala Ala Lys 
            20                  25                  30          


Tyr Ala Lys Ile Glu Ile Pro Thr Leu Cys Tyr Leu Glu Glu Ile Asn 
        35                  40                  45              


Glu Ile Gly Ala Cys Arg Leu Cys Val Val Glu Ile Lys Gly Val Arg 
    50                  55                  60                  


Asn Leu Gln Ala Ser Cys Val Tyr Pro Val Ser Asp Gly Met Glu Ile 
65                  70                  75                  80  


Tyr Thr Asn Thr Pro Arg Val Arg Glu Ala Arg Arg Ser Asn Leu Glu 
                85                  90                  95      


Leu Ile Leu Ser Ala His Asp Arg Ser Cys Leu Thr Cys Val Arg Ser 
            100                 105                 110         


Gly Asn Cys Glu Leu Gln Asp Leu Ser Arg Lys Ser Gly Ile Asp Glu 
        115                 120                 125             


Ile Arg Phe Met Gly Glu Asn Ile Lys Tyr Gln Lys Asp Glu Ser Ser 
    130                 135                 140                 


Pro Ser Ile Val Arg Asp Pro Asn Lys Cys Val Leu Cys Arg Arg Cys 
145                 150                 155                 160 


Val Ala Thr Cys Asn Asn Val Gln Asn Val Phe Ala Ile Gly Met Val 
                165                 170                 175     


Asn Arg Gly Phe Lys Thr Ile Val Ala Pro Ser Phe Gly Arg Gly Leu 
            180                 185                 190         


Asn Glu Ser Pro Cys Ile Ser Cys Gly Gln Cys Ile Glu Ala Cys Pro 
        195                 200                 205             


Val Gly Ala Ile Tyr Glu Lys Asp His Thr Lys Ile Val Tyr Asp Ala 
    210                 215                 220                 


Leu Leu Asp Glu Lys Lys Tyr Val Val Val Gln Thr Ala Pro Ala Val 
225                 230                 235                 240 


Arg Val Ala Leu Gly Glu Glu Phe Gly Met Pro Tyr Gly Ser Ile Val 
                245                 250                 255     


Thr Gly Lys Met Val Ser Ala Leu Lys Arg Leu Gly Phe Asp Lys Val 
            260                 265                 270         


Phe Asp Thr Asp Phe Ala Ala Asp Leu Thr Ile Ile Glu Glu Gly Asn 
        275                 280                 285             


Glu Leu Leu Lys Arg Leu Asn Glu Gly Gly Lys Leu Pro Met Ile Thr 
    290                 295                 300                 


Ser Cys Ser Pro Gly Trp Ile Asn Tyr Cys Glu Arg Tyr Tyr Pro Glu 
305                 310                 315                 320 


Phe Ile Asp Asn Leu Ser Thr Cys Lys Ser Pro His Met Met Met Gly 
                325                 330                 335     


Ala Ile Ile Lys Ser Tyr Phe Ala Glu Lys Glu Gly Ile Asp Pro Lys 
            340                 345                 350         


Asp Ile Phe Val Val Ser Ile Met Pro Cys Thr Ala Lys Lys Tyr Glu 
        355                 360                 365             


Ile Asp Arg Pro Gln Met Ile Val Asp Gly Met Lys Asp Val Asp Ala 
    370                 375                 380                 


Val Leu Thr Thr Arg Glu Leu Ala Arg Met Ile Lys Gln Ser Gly Ile 
385                 390                 395                 400 


Asp Phe Val Asn Leu Pro Asp Ser Glu Tyr Asp Asn Pro Leu Gly Glu 
                405                 410                 415     


Ser Ser Gly Ala Gly Val Ile Phe Gly Ala Thr Gly Gly Val Met Glu 
            420                 425                 430         


Ala Ala Leu Arg Thr Val Ala Asp Ile Val Glu Gly Lys Asp Ile Glu 
        435                 440                 445             


Asn Phe Glu Tyr Glu Glu Val Arg Gly Leu Glu Gly Ile Lys Glu Ala 
    450                 455                 460                 


Lys Ile Asp Ile Gly Gly Lys Glu Ile Lys Ile Ala Val Ala Asn Gly 
465                 470                 475                 480 


Thr Gly Asn Ala Lys Lys Leu Leu Asp Lys Ile Lys Asn Gly Glu Ala 
                485                 490                 495     


Glu Tyr His Phe Ile Glu Val Met Gly Cys Pro Gly Gly Cys Ile Met 
            500                 505                 510         


Gly Gly Gly Gln Pro Ile His Asn Pro Asn Glu Lys Asp Leu Val Arg 
        515                 520                 525             


Lys Ser Arg Leu Lys Ala Ile Tyr Glu Ala Asp Lys Asp Leu Pro Ile 
    530                 535                 540                 


Arg Lys Ser His Lys Asn Pro Met Ile Thr Lys Leu Tyr Glu Glu Phe 
545                 550                 555                 560 


Leu Ile Ser Pro Leu Gly Glu Lys Ser His His Leu Leu His Thr Thr 
                565                 570                 575     


Tyr Ser Lys Lys Asp Leu Tyr Pro Met Asn Asp 
            580                 585         


<210>  10
<211>  596
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  10

Met Leu Tyr Arg Ser His Val Met Val Cys Gly Gly Thr Gly Cys Thr 
1               5                   10                  15      


Ser Ser Asn Ser Asp Arg Ile Ala Lys Cys Phe Glu Glu Glu Ile Ala 
            20                  25                  30          


Asn Lys Gly Leu Asp Lys Glu Val Gln Val Val Arg Thr Gly Cys Phe 
        35                  40                  45              


Gly Leu Cys Glu Leu Gly Pro Val Val Val Val Tyr Pro Glu Gly Val 
    50                  55                  60                  


Phe Tyr Ser Cys Val Lys Glu Glu Tyr Val Pro Glu Ile Val Glu Glu 
65                  70                  75                  80  


His Leu Leu Lys Gly Arg Val Val Lys Lys Tyr Leu Tyr Gly Glu Ser 
                85                  90                  95      


Val Thr Glu Glu Gly Ile Lys Pro Leu Glu Glu Thr Ala Phe Phe Lys 
            100                 105                 110         


Lys Gln Gln Arg Val Ala Leu Arg Asn Cys Gly Leu Ile Asn Pro Glu 
        115                 120                 125             


Asp Ile Lys Glu Ala Ile Ala Phe Asp Gly Tyr Lys Ala Leu Ala Lys 
    130                 135                 140                 


Val Leu Thr Glu Met Thr Pro Glu Glu Val Ile Asn Glu Ile Lys Lys 
145                 150                 155                 160 


Ser Gly Leu Arg Gly Arg Gly Gly Gly Gly Phe Pro Thr Gly Ile Lys 
                165                 170                 175     


Trp Glu Phe Ala Tyr Asn Gln Lys Glu Thr Pro Lys Tyr Val Val Cys 
            180                 185                 190         


Asn Ala Asp Glu Gly Asp Pro Gly Ala Phe Met Asp Arg Ser Val Leu 
        195                 200                 205             


Glu Gly Asp Pro His Ser Val Leu Glu Ala Met Ala Ile Ala Gly Tyr 
    210                 215                 220                 


Ala Ile Gly Ala Asn His Gly Tyr Ile Tyr Val Arg Ala Glu Tyr Pro 
225                 230                 235                 240 


Leu Ala Val Lys Arg Leu Gln Ile Ala Ile Asp Gln Ala Arg Glu Tyr 
                245                 250                 255     


Gly Leu Leu Gly Lys Asn Ile Phe Asn Thr Gly Phe Asp Phe Asp Ile 
            260                 265                 270         


Glu Ile Arg Leu Gly Ala Gly Ala Phe Val Cys Gly Glu Glu Thr Ala 
        275                 280                 285             


Leu Leu Asn Ser Val Met Gly Lys Arg Gly Glu Pro Arg Pro Arg Pro 
    290                 295                 300                 


Pro Phe Pro Ala Val Lys Gly Val Trp Glu Lys Pro Thr Ile Ile Asn 
305                 310                 315                 320 


Asn Val Glu Thr Tyr Ala Asn Ile Pro Ala Ile Ile Leu Asn Gly Ala 
                325                 330                 335     


Glu Trp Phe Ala Ser Ile Gly Thr Glu Lys Ser Lys Gly Thr Lys Val 
            340                 345                 350         


Phe Ala Leu Gly Gly Lys Ile Asn Asn Thr Gly Leu Val Glu Ile Pro 
        355                 360                 365             


Met Gly Thr Thr Leu Arg Glu Ile Ile Phe Glu Ile Gly Gly Gly Ile 
    370                 375                 380                 


Pro Asn Gly Lys Lys Phe Lys Ala Ala Gln Thr Gly Gly Pro Ser Gly 
385                 390                 395                 400 


Gly Cys Ile Pro Ala Glu His Leu Asp Thr Pro Ile Asp Tyr Asp Ser 
                405                 410                 415     


Leu Leu Asn Ile Gly Ser Met Met Gly Ser Gly Gly Leu Ile Val Met 
            420                 425                 430         


Asp Glu Asp Asn Cys Met Val Asp Ile Ala Lys Phe Phe Leu Glu Phe 
        435                 440                 445             


Thr Val Asp Glu Ser Cys Gly Lys Cys Ser Pro Cys Arg Ile Gly Thr 
    450                 455                 460                 


Arg Arg Met Leu Glu Leu Leu Asn Lys Ile Thr Ser Gly Lys Gly Glu 
465                 470                 475                 480 


Glu Gly Asp Ile Glu Lys Leu Glu Thr Leu Ala Asn Ser Ile Lys Ala 
                485                 490                 495     


Ser Ser Leu Cys Gly Leu Gly Gln Thr Ala Pro Asn Pro Val Leu Ser 
            500                 505                 510         


Thr Ile Arg Tyr Phe Arg Asp Glu Tyr Glu Ala His Ile Lys Glu Lys 
        515                 520                 525             


Arg Cys Pro Ala Gly Val Cys Gln Ala Leu Leu Lys Phe Arg Ile Asp 
    530                 535                 540                 


Pro Asp Lys Cys Lys Gly Cys Gly Ile Cys Ala Lys Asn Cys Pro Thr 
545                 550                 555                 560 


Asn Ala Ile Ser Gly Lys Val Lys Gln Pro His Val Ile Asp Gln Asp 
                565                 570                 575     


Lys Cys Ile Lys Cys Gly Thr Cys Met Asp Lys Cys Pro Phe Asp Ala 
            580                 585                 590         


Ile Tyr Lys Lys 
        595     


<210>  11
<211>  160
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  11

Met Gln Ala Ile Tyr Glu Lys Phe Ser Glu Glu Asn Ile Asn Lys Leu 
1               5                   10                  15      


Lys Lys Val Ile Asp Gln Leu Lys Asp Thr Asp Gly Ser Leu Ile Ala 
            20                  25                  30          


Val Met Asn Glu Ala Gln Glu Ile Phe Gly Tyr Leu Pro Ile Glu Val 
        35                  40                  45              


Gln Gln Phe Ile Ser Glu Glu Met Asn Val Pro Leu Thr Glu Ile Phe 
    50                  55                  60                  


Gly Ile Ala Thr Phe Tyr Ser Arg Phe Thr Leu Lys Pro Ser Gly Lys 
65                  70                  75                  80  


Tyr Lys Ile Gly Val Cys Leu Gly Thr Ala Cys Tyr Val Lys Gly Ser 
                85                  90                  95      


Ala Met Val Leu Asp Lys Leu Lys Glu Lys Leu Gly Ile Ser Val Gly 
            100                 105                 110         


Asp Val Thr Gly Asp Gly Lys Phe Ser Leu Glu Ala Thr Arg Cys Leu 
        115                 120                 125             


Gly Ala Cys Gly Leu Ala Pro Val Met Met Ile Asn Gly Glu Val Phe 
    130                 135                 140                 


Gly Arg Leu Thr Pro Asp Asp Val Glu Asp Ile Leu Lys Lys Phe Asp 
145                 150                 155                 160 


<210>  12
<211>  128
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  12

Met Cys His Gly Gly Val Asn Met Lys Ser Ile Glu Glu Leu Glu Lys 
1               5                   10                  15      


Ile Arg Lys Glu Thr Leu Glu Lys Val Asn Leu Arg Lys Asp Arg Asn 
            20                  25                  30          


Gly Ile Arg Ile Thr Val Gly Met Ala Thr Cys Gly Ile Ala Ala Gly 
        35                  40                  45              


Ala Arg Pro Val Met Met Ala Ile Leu Asp Glu Leu Gly Lys Arg Asn 
    50                  55                  60                  


Ile Thr Asp Val Val Val Ala Glu Thr Gly Cys Ile Gly Met Cys Lys 
65                  70                  75                  80  


Tyr Glu Pro Met Val Asp Val Tyr Val Pro Gly Gln Glu Lys Val Thr 
                85                  90                  95      


Tyr Ile Lys Val Asp Glu Asn Lys Ala Arg Gln Ile Val Ala Glu His 
            100                 105                 110         


Val Val Asn Gly His Pro Ile Lys Glu Trp Thr Ile Ser Ser Val Glu 
        115                 120                 125             


<210>  13
<211>  572
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  13

Met Ser Val Ile Asn Phe Lys Glu Ala Asn Cys Arg Asn Cys Tyr Lys 
1               5                   10                  15      


Cys Ile Arg Tyr Cys Pro Val Lys Ala Ile Lys Val Asn Asp Glu Gln 
            20                  25                  30          


Ala Glu Ile Ile Glu Tyr Arg Cys Ile Ala Cys Gly Arg Cys Leu Asn 
        35                  40                  45              


Ile Cys Pro Gln Asn Ala Lys Thr Val Arg Ser Asp Val Glu Arg Val 
    50                  55                  60                  


Gln Ser Phe Leu Asn Lys Gly Glu Lys Val Ala Phe Thr Val Ala Pro 
65                  70                  75                  80  


Ser Tyr Pro Ala Leu Val Gly His Asp Gly Ala Leu Asn Phe Leu Lys 
                85                  90                  95      


Ala Leu Lys Ser Leu Gly Ala Glu Met Ile Val Glu Thr Ser Val Gly 
            100                 105                 110         


Ala Met Leu Ile Ser Lys Glu Tyr Glu Arg Tyr Tyr Asn Asp Leu Lys 
        115                 120                 125             


Tyr Asp Asn Leu Ile Thr Thr Ser Cys Pro Ser Val Asn Tyr Leu Val 
    130                 135                 140                 


Glu Lys Tyr Tyr Pro Asp Leu Ile Lys Cys Leu Val Pro Val Val Ser 
145                 150                 155                 160 


Pro Met Val Ala Val Gly Arg Ala Ile Lys Asn Ile His Gly Glu Gly 
                165                 170                 175     


Val Lys Val Val Phe Ile Gly Pro Cys Leu Ala Lys Lys Ala Glu Met 
            180                 185                 190         


Ser Asp Phe Ser Cys Glu Gly Ala Ile Asp Ala Val Leu Thr Phe Glu 
        195                 200                 205             


Glu Val Met Asn Leu Phe Asn Thr Asn Lys Ile Gly Val Glu Cys Thr 
    210                 215                 220                 


Lys Glu Asn Leu Glu Asp Val Asp Ser Glu Ser Arg Phe Lys Leu Tyr 
225                 230                 235                 240 


Pro Ile Glu Gly Lys Thr Met Asp Cys Met Asp Val Asp Leu Asn Leu 
                245                 250                 255     


Arg Lys Phe Ile Ser Val Ser Ser Ile Glu Asn Val Lys Asp Ile Leu 
            260                 265                 270         


Asn Asp Leu Arg Ala Gly Asn Leu His Gly Tyr Trp Ile Glu Ala Asn 
        275                 280                 285             


Ala Cys Asp Gly Gly Cys Ile Asn Gly Pro Ala Phe Gly Lys Leu Glu 
    290                 295                 300                 


Ser Gly Ile Ala Lys Arg Lys Glu Glu Val Ile Ser Tyr Ser Arg Met 
305                 310                 315                 320 


Lys Glu Arg Phe Ser Gly Asp Phe Ser Gly Ile Thr Asp Phe Ser Leu 
                325                 330                 335     


Asp Leu Ser Arg Lys Phe Ile Asp Leu Ser Asp Arg Trp Lys Met Pro 
            340                 345                 350         


Ser Glu Met Glu Ile Lys Glu Ile Leu Ser Lys Ile Gly Lys Phe Ser 
        355                 360                 365             


Val Glu Asp Glu Leu Asn Cys Gly Ala Cys Gly Tyr Asp Thr Cys Arg 
    370                 375                 380                 


Glu Lys Ala Ile Ala Val Phe Asn Gly Met Ala Glu Pro Tyr Met Cys 
385                 390                 395                 400 


Leu Pro Tyr Met Arg Gly Arg Ala Glu Thr Leu Ser Asn Ile Ile Ile 
                405                 410                 415     


Ser Ser Thr Pro Asn Ala Ile Ile Ala Val Asn Asn Glu Tyr Glu Ile 
            420                 425                 430         


Gln Asp Met Asn Arg Ala Phe Glu Lys Met Phe Leu Val Asn Ser Ala 
        435                 440                 445             


Met Val Lys Gly Glu Asp Leu Ser Leu Ile Phe Asp Ile Ser Asp Phe 
    450                 455                 460                 


Val Glu Val Ile Glu Asn Lys Lys Ser Ile Phe Asn Lys Lys Val Ser 
465                 470                 475                 480 


Phe Lys Asn Tyr Gly Ile Ile Ala Leu Glu Ser Ile Tyr Tyr Leu Glu 
                485                 490                 495     


Glu Tyr Lys Ile Ala Ile Gly Ile Phe Thr Asp Ile Thr Lys Met Glu 
            500                 505                 510         


Lys Gln Lys Glu Ser Phe Ser Lys Leu Lys Arg Glu Asn Tyr Gln Leu 
        515                 520                 525             


Ala Gln Gln Val Ile Asp Arg Gln Met Lys Val Ala Gln Glu Ile Ala 
    530                 535                 540                 


Ser Leu Leu Gly Glu Thr Thr Ala Glu Thr Lys Val Ile Leu Thr Lys 
545                 550                 555                 560 


Met Lys Asp Met Leu Leu Asn Gln Gly Asp Asp Glu 
                565                 570         


<210>  14
<211>  386
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  14

Met Ser His Tyr Ile Asp Ile Ala His Ala Ser Leu Asn Lys Tyr Asp 
1               5                   10                  15      


Glu Glu Leu Cys Gly Asp Ser Val Gln Ile Ile Arg Lys Lys Asp Tyr 
            20                  25                  30          


Ala Met Ala Val Met Ala Asp Gly Leu Gly Ser Gly Val Lys Ala Asn 
        35                  40                  45              


Ile Leu Ser Thr Leu Thr Thr Arg Ile Val Ser Lys Met Leu Asp Met 
    50                  55                  60                  


Gly Ser Glu Leu Arg Asp Val Val Glu Thr Val Ala Glu Thr Leu Pro 
65                  70                  75                  80  


Ile Cys Lys Glu Arg Asn Ile Ala Tyr Ser Thr Phe Thr Val Val Ser 
                85                  90                  95      


Ile Tyr Gly Asp Asn Ala His Leu Val Glu Tyr Asp Asn Pro Ser Val 
            100                 105                 110         


Phe Tyr Phe Lys Asn Gly Val His Lys Lys Val Asp Arg Lys Cys Val 
        115                 120                 125             


Glu Ile Gly Asp Lys Lys Ile Phe Glu Ser Ser Phe Lys Leu Asp Leu 
    130                 135                 140                 


Asn Asp Ala Leu Ile Val Val Ser Asp Gly Val Ile His Ala Gly Val 
145                 150                 155                 160 


Gly Gly Ile Leu Asn Leu Gly Trp Gln Trp Asp Asn Val Lys Gln Tyr 
                165                 170                 175     


Leu Ser Lys Val Leu Glu Val Tyr Ser Asp Ala Ser Asp Ile Cys Ser 
            180                 185                 190         


Gln Leu Ile Thr Thr Cys Asn Asn Leu Tyr Lys Asn Arg Pro Gly Asp 
        195                 200                 205             


Asp Thr Thr Ala Ile Val Ile Lys Val Asn Glu Ser Lys Lys Val Thr 
    210                 215                 220                 


Val Met Val Gly Pro Pro Ile Leu Lys Asn Met Asp Glu Trp Val Val 
225                 230                 235                 240 


Lys Lys Leu Met Lys Ser Glu Gly Leu Lys Val Val Cys Gly Gly Thr 
                245                 250                 255     


Ala Ala Lys Ile Val Ser Arg Ile Leu Asn Lys Asp Val Ile Thr Ser 
            260                 265                 270         


Thr Glu Tyr Ile Asp Pro Asp Ile Pro Pro Tyr Ala His Ile Asp Gly 
        275                 280                 285             


Ile Asp Leu Val Thr Glu Gly Val Leu Thr Leu Arg Lys Thr Val Glu 
    290                 295                 300                 


Ile Phe Lys Glu Tyr Met Asn Asp Lys Asp Ser Asn Leu Leu Arg Phe 
305                 310                 315                 320 


Ser Lys Lys Asp Ala Ala Thr Arg Leu Phe Lys Ile Leu Asn Tyr Ala 
                325                 330                 335     


Thr Asp Val Asn Phe Leu Val Gly Gln Ala Val Asn Ser Ala His Gln 
            340                 345                 350         


Asn Pro Asp Phe Pro Ser Asp Leu Arg Ile Lys Val Arg Ile Val Glu 
        355                 360                 365             


Glu Leu Ile Ser Leu Leu Glu Arg Leu Asn Lys Asn Val Glu Val Asn 
    370                 375                 380                 


Tyr Phe 
385     


<210>  15
<211>  495
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  15

Met Leu Lys Tyr Glu Val Leu Tyr Asn Val Ala Lys Leu Thr Leu Glu 
1               5                   10                  15      


Asp Arg Leu Glu Asp Glu Tyr Asp Glu Ile Pro Tyr Glu Ile Ile Pro 
            20                  25                  30          


Gly Thr Lys Pro Arg Phe Arg Cys Cys Val Tyr Lys Glu Arg Ala Ile 
        35                  40                  45              


Ile Glu Gln Arg Thr Lys Val Ala Met Gly Lys Asn Leu Lys Arg Thr 
    50                  55                  60                  


Met Lys His Ala Val Asp Gly Glu Glu Pro Ile Ile Gln Val Leu Asp 
65                  70                  75                  80  


Ile Ala Cys Glu Glu Cys Pro Ile Lys Arg Tyr Arg Val Thr Glu Ala 
                85                  90                  95      


Cys Arg Gly Cys Ile Thr His Arg Cys Thr Glu Val Cys Pro Lys Gly 
            100                 105                 110         


Ala Ile Thr Ile Ile Asn Lys Lys Ala Asn Ile Asp Tyr Asp Lys Cys 
        115                 120                 125             


Ile Glu Cys Gly Arg Cys Lys Asp Ala Cys Pro Tyr Asn Ala Ile Ser 
    130                 135                 140                 


Asp Asn Leu Arg Pro Cys Ile Arg Ser Cys Ser Ala Lys Ala Ile Thr 
145                 150                 155                 160 


Met Asp Glu Glu Leu Lys Ala Ala Ile Asn Tyr Glu Lys Cys Thr Ser 
                165                 170                 175     


Cys Gly Ala Cys Thr Leu Ala Cys Pro Phe Gly Ala Ile Thr Asp Lys 
            180                 185                 190         


Ser Tyr Ile Val Asp Ile Ile Arg Ala Ile Lys Ser Gly Lys Lys Val 
        195                 200                 205             


Tyr Ala Leu Val Ala Pro Ala Ile Ala Ser Gln Phe Lys Asp Val Thr 
    210                 215                 220                 


Val Gly Gln Ile Lys Ser Ala Leu Lys Glu Phe Gly Phe Val Asp Val 
225                 230                 235                 240 


Ile Glu Val Ala Leu Gly Ala Asp Phe Val Ala Met Glu Glu Ala Lys 
                245                 250                 255     


Glu Phe Ser His Lys Ile Lys Asp Ile Lys Val Met Thr Ser Ser Cys 
            260                 265                 270         


Cys Pro Ala Phe Val Ala His Ile Lys Lys Ser Tyr Pro Glu Leu Ser 
        275                 280                 285             


Gln Asn Ile Ser Thr Thr Val Ser Pro Met Thr Ala Ile Ser Lys Tyr 
    290                 295                 300                 


Ile Lys Lys His Asp Pro Met Ala Val Thr Val Phe Ile Gly Pro Cys 
305                 310                 315                 320 


Thr Ala Lys Lys Ser Glu Val Met Arg Asp Asp Val Lys Gly Ile Thr 
                325                 330                 335     


Asp Phe Ala Met Thr Phe Glu Glu Met Val Ala Val Leu Asp Ala Ala 
            340                 345                 350         


Lys Ile Asp Met Lys Glu Gln Gln Asp Val Glu Val Asp Asp Ala Thr 
        355                 360                 365             


Leu Phe Gly Arg Lys Phe Ala Arg Ser Gly Gly Val Leu Glu Ala Val 
    370                 375                 380                 


Val Glu Ala Val Lys Glu Ile Gly Ala Asp Val Glu Val Asn Pro Val 
385                 390                 395                 400 


Val Cys Asn Gly Leu Asp Glu Cys Asn Lys Thr Leu Lys Ile Met Lys 
                405                 410                 415     


Ala Gly Lys Leu Pro Asn Asn Phe Ile Glu Gly Met Ala Cys Ile Gly 
            420                 425                 430         


Gly Cys Ile Gly Gly Ala Gly Val Ile Asn Asn Asn Val Asn Gln Ala 
        435                 440                 445             


Lys Leu Ala Val Asn Lys Phe Gly Asp Ser Ser Tyr His Lys Ser Ile 
    450                 455                 460                 


Lys Asp Arg Ile Ser Gln Phe Asp Thr Asp Asp Val Asp Phe His Val 
465                 470                 475                 480 


Asp Ser Gly Glu Asp Glu Ser Ser Glu Thr Ser Phe Lys Glu Ala 
                485                 490                 495 


<210>  16
<211>  81
<212>  PRT
<213>  Thermoanaerobacterium saccharolyticum

<400>  16

Met Val Ile Thr Val Cys Val Gly Ser Ser Cys His Leu Lys Gly Ser 
1               5                   10                  15      


Tyr Asp Val Ile Asn Lys Leu Lys Glu Met Ile Lys Asn Tyr Gly Ile 
            20                  25                  30          


Glu Asp Lys Val Glu Leu Lys Ala Asp Phe Cys Met Gly Asn Cys Leu 
        35                  40                  45              


Arg Ala Val Ser Val Lys Ile Asp Gly Gly Ala Cys Leu Ser Ile Lys 
    50                  55                  60                  


Pro Asn Ser Val Glu Arg Phe Phe Lys Glu His Val Leu Gly Glu Leu 
65                  70                  75                  80  


Lys 
    


<210>  17
<211>  33
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic oligonucleotides

<400>  17
ttactcgaga aactggtgga acatctggtg gat                                    33


<210>  18
<211>  33
<212>  DNA
<213>  Artificial

<220>
<223>  Synthetic DNA

<400>  18
aagtctagat aaatcgctcc gacaggacat gct                                    33


<210>  19
<211>  34
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  19
ctacaattgg acttgcctat cagaaagtct caca                                   34


<210>  20
<211>  33
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  20
atagagctct catgggagaa ccagatgcaa gta                                    33


<210>  21
<211>  31
<212>  DNA
<213>  Artificial

<220>
<223>  Synthetic DNA

<400>  21
atatctcgag ctgtaattgt ccttgatgac g                                      31


<210>  22
<211>  33
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  22
atatctgcag caggatatga tggagctaca gtg                                    33


<210>  23
<211>  32
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  23
atatgaattc catatatgag agggagggct ga                                     32


<210>  24
<211>  29
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  24
atatcggccg agtcgtttct cctaacaag                                         29


<210>  25
<211>  34
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  25
tggatccgcc atttattatt tccttcctct tttc                                   34


<210>  26
<211>  27
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  26
ttctagatgg ctgcaggtcg ataaacc                                           27


<210>  27
<211>  34
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  27
gcggatccca tgaacaaaaa tataaaatat tctc                                   34


<210>  28
<211>  31
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  28
gcgaattccc tttagtaacg tgtaactttc c                                      31


