
                                SEQUENCE LISTING

<110> LanzaTech New Zealand Limited

<120> RECOMBINANT MICROORGANISMS AND USES
  THEREFOR

<130> LT83   008156.00014

<160> 26

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 854
<212> PRT
<213> Clostridium autoethanogenum

<400> 1
Met Asn Asp Val Leu Asn Lys Leu Tyr Thr Ala Asn Gln Ser Lys Arg
 1               5                  10                  15
Ile Glu Lys Leu Thr Asn Asp Leu Tyr Ser Val Thr Pro Glu Ile Glu
            20                  25                  30
Ala Gln Arg Ala Val Leu Ile Thr Glu Ser Phe Lys Glu Thr Glu Ala
        35                  40                  45
Tyr Pro Met Ile Ile Arg Arg Ala Lys Ala Leu Glu Lys Ile Leu Asn
    50                  55                  60
Glu Met Asp Ile Val Ile Arg Asp Glu Glu Leu Ile Val Gly Asn Leu
65                  70                  75                  80
Thr Lys Lys Pro Arg Ala Ala Ser Ile Phe Pro Glu Phe Ser Asn Lys
                85                  90                  95
Trp Leu Leu Glu Glu Phe Asp Thr Leu Ala Lys Arg Thr Gly Asp Val
            100                 105                 110
Phe Leu Ile Ser Glu Asp Val Lys Ser Gln Leu Arg Glu Val Phe Lys
        115                 120                 125
Tyr Trp Asp Gly Lys Thr Thr Asn Glu Leu Ala Thr Glu Tyr Met Phe
    130                 135                 140
Ser Glu Thr Lys Glu Ala Met Glu Ala Gly Val Phe Thr Val Gly Asn
145                 150                 155                 160
Tyr Tyr Phe Asn Gly Ile Gly His Ile Ser Val Asp Tyr Ala Lys Val
                165                 170                 175
Leu Ser Lys Gly Phe Asn Gly Ile Ile Glu Asp Ala Glu Ser Glu Lys
            180                 185                 190
Ala Lys Ala Asp Lys Ala Asp Pro Asp Tyr Ile Lys Lys Asp Gln Phe
        195                 200                 205
Leu Thr Ala Val Ile Ile Thr Ser Lys Ala Val Ile Lys Phe Ala Arg
    210                 215                 220
Arg Phe Ala Glu Leu Ala Arg Asn Leu Ala Ser Gln Ser Leu Asp Ser
225                 230                 235                 240
Arg Arg Arg Glu Glu Leu Met Gln Ile Ala Glu Asn Cys Gln Trp Val
                245                 250                 255
Pro Glu Arg Pro Ala Arg Thr Phe Tyr Glu Ala Leu Gln Ser Phe Trp
            260                 265                 270
Phe Val Gln Ser Ile Ile Gln Ile Glu Ser Asn Gly His Ser Ile Ser
        275                 280                 285
Pro Met Arg Phe Asp Gln Tyr Met Tyr Pro Tyr Phe Lys Lys Asp Val
    290                 295                 300
Ser Asn Gly Leu Ile Thr Gln Glu Lys Ala Gln Glu Leu Leu Asp Cys
305                 310                 315                 320
Leu Trp Val Lys Phe Asn Asp Val Asn Lys Val Arg Asp Glu Gly Ser
                325                 330                 335
Thr Lys Ala Phe Gly Gly Tyr Pro Met Phe Gln Asn Leu Ile Val Gly
            340                 345                 350
Gly Gln Thr Ile Asp Gly Arg Asp Ala Thr Asn Glu Leu Ser Phe Met
        355                 360                 365
Cys Leu Glu Ala Thr Ala His Thr Lys Leu Pro Gln Pro Ser Ile Ser
    370                 375                 380
Ile Arg Ala Trp Asn Lys Thr Pro Asp Glu Leu Leu Leu Lys Ala Ala
385                 390                 395                 400
Glu Val Thr Arg Leu Gly Leu Gly Met Pro Ala Tyr Tyr Asn Asp Glu
                405                 410                 415
Val Ile Ile Pro Ser Leu Thr Ser Arg Gly Leu Thr Leu Glu Asp Ala
            420                 425                 430
Arg Asp Tyr Gly Ile Ile Gly Cys Val Glu Pro Gln Lys Gly Gly Lys
        435                 440                 445
Thr Glu Gly Trp His Asp Ala Ala Phe Phe Asn Ile Val Lys Val Leu
    450                 455                 460
Glu Ile Thr Ile Asn Asn Gly Met Asp Asn Gly Lys Gln Ile Gly Leu
465                 470                 475                 480
Arg Thr Gly Asp Phe Thr Ser Phe Thr Ser Phe Glu Lys Leu Phe Asp
                485                 490                 495
Ala Tyr Lys Leu Gln Met Glu Tyr Phe Val Lys Leu Leu Val Asn Ala
            500                 505                 510
Asp Asn Ser Val Asp Leu Ala His Gly Glu Arg Ala Pro Leu Pro Phe
        515                 520                 525
Leu Ser Ser Met Ala Asp Asp Cys Ile Ala Arg Gly Lys Ser Leu Gln
    530                 535                 540
Glu Gly Gly Ala His Tyr Asn Phe Thr Gly Pro Gln Gly Val Gly Val
545                 550                 555                 560
Ala Asn Ala Ala Asp Ser Leu Glu Ala Ile Lys Lys Leu Val Phe Glu
                565                 570                 575
Asp Lys Lys Ile Thr Leu Gln Asp Leu Lys Asn Ala Leu Asp Thr Asn
            580                 585                 590
Phe Gly Glu Cys Lys Lys Asn Pro Ile Ser Glu Leu Ala Asn Ser Ile
        595                 600                 605
Asn Glu Val Gly Asp Met Lys Gly Leu Thr Pro Glu Thr Ile Leu Lys
    610                 615                 620
Val Ile Glu Lys Leu Leu Ser Glu Glu Lys Lys Thr Ser Leu Glu Gly
625                 630                 635                 640
Leu Glu Pro Gly Lys Asp Ile Asn Leu Gly Ser Tyr Gly Asn Lys Glu
                645                 650                 655
Ser Ile Arg Gln Met Leu Leu Asn Arg Ala Pro Lys Phe Gly Asn Asp
            660                 665                 670
Ile Asp Glu Val Asp Asp Leu Ala Arg Glu Ala Ala Leu Ile Tyr Cys
        675                 680                 685
Asn Glu Val Glu Lys Tyr Thr Asn Pro Arg Asn Gly Gln Phe Gln Pro
    690                 695                 700
Gly Leu Tyr Pro Val Ser Ala Asn Val Pro Met Gly Ser Gln Thr Gly
705                 710                 715                 720
Ala Thr Pro Asp Gly Arg Lys Ala Gly Glu Pro Leu Ala Asp Gly Val
                725                 730                 735
Ser Pro Val Ser Gly Arg Asp Ala Met Gly Pro Thr Ala Ala Ala Asn
            740                 745                 750
Ser Val Ala Lys Ile Asp His Cys Lys Ala Ser Asn Gly Thr Leu Phe
        755                 760                 765
Asn Gln Lys Phe His Pro Ser Ala Leu Glu Gly Gln Thr Gly Leu Gln
    770                 775                 780
Asn Leu Ser Ser Leu Val Arg Thr Phe Phe Asp Glu Lys Gly Leu His
785                 790                 795                 800
Val Gln Phe Asn Val Val Ser Arg Glu Thr Leu Leu Asp Ala Gln Lys
                805                 810                 815
Asn Pro Glu Asn Tyr Arg Asn Leu Val Val Arg Val Ala Gly Tyr Ser
            820                 825                 830
Ala His Phe Thr Ser Leu Asp Lys Ser Ile Gln Asp Asp Ile Ile Lys
        835                 840                 845
Arg Thr Glu His Thr Phe
    850

<210> 2
<211> 259
<212> PRT
<213> Clostridium autoethanogenum

<400> 2
Met Glu Pro Gln Val Met Phe Ile Pro Ser Lys Cys Ile Gly Cys Lys
 1               5                  10                  15
Lys Cys Tyr Glu Val Cys Ser Asn Gly Ala Ile Asp Phe Asn Leu Pro
            20                  25                  30
Ser Arg Val Asp Gln Asn Lys Cys Val Lys Cys Gly Lys Cys Val Glu
        35                  40                  45
Asn Cys Tyr Ala Gly Ala Leu Asn Leu Ala Gly Asn Thr Arg Thr Val
    50                  55                  60
Lys Glu Leu Leu Leu Glu Leu Lys Lys Asp Asn Ile Tyr Tyr Arg Arg
65                  70                  75                  80
Ser Gly Gly Gly Ile Thr Leu Ser Gly Gly Glu Val Thr Ala Gln Pro
                85                  90                  95
Glu Phe Ala Glu Glu Leu Leu Lys Gly Cys Lys Gln Asn Gly Trp His
            100                 105                 110
Thr Ala Ile Glu Thr Ala Ala Phe Thr Ser Gln Ser Val Leu Glu Arg
        115                 120                 125
Met Leu Pro Trp Leu Asp Leu Val Met Leu Asp Ile Lys His Met Asp
    130                 135                 140
Ala Asn Lys His Leu Glu Tyr Thr Gly Lys Pro Asn Glu Leu Ile Leu
145                 150                 155                 160
Gln Asn Ala Lys Leu Ile Ala Gln Phe Gly Val Gln Leu Ile Ile Arg
                165                 170                 175
Val Pro Val Ile Pro Gly Val Asn Ser Asp Glu Asn Asn Ile Arg Ala
            180                 185                 190
Thr Ala Asn Phe Ala Thr Ser Leu Lys Ser Val Lys Glu Leu His Leu
        195                 200                 205
Leu Pro Tyr His Arg Leu Gly Glu Asn Lys Tyr Glu Tyr Leu Gly His
    210                 215                 220
Asp Tyr Ile Met Lys Gly Leu Gln Pro Pro Thr Lys Glu Glu Ile Asn
225                 230                 235                 240
Lys Leu Lys Glu Leu Val Glu Glu Cys Gly Leu Ile Cys Lys Val Gly
                245                 250                 255
Gly Ile Asp


<210> 3
<211> 2565
<212> DNA
<213> Clostridium autoethanogenum

<400> 3
ttgaatgatg ttttaaacaa actttatact gcaaatcaaa gtaaaagaat agaaaaatta      60
actaacgatt tatactcggt aactcctgaa atcgaagcgc aaagagcagt tttaataacg     120
gaatctttta aggaaactga agcttatcct atgattatta gaagagctaa agctttagaa     180
aaaatactaa atgaaatgga tatagttatt cgtgatgaag aacttattgt aggaaattta     240
actaaaaaac ctagagcggc ttcaatattt ccggaatttt caaataagtg gcttttggag     300
gaatttgata ctcttgcaaa aagaactggt gatgtatttc ttattagtga agatgtaaaa     360
tcacaactta gagaagtatt caaatattgg gatggaaaaa caacaaatga gcttgcaaca     420
gagtatatgt tttcagaaac gaaggaagca atggaagcag gggtatttac tgttggaaat     480
tattacttca atggtatagg tcatatttct gtagattatg caaaagtatt atctaaagga     540
tttaacggta taattgaaga tgcagaaagt gaaaaggcta aagcagataa agcagatcca     600
gattacataa agaaggatca gtttttaaca gctgtaatca taacttcaaa agctgttatt     660
aagtttgcta gacgttttgc tgaattagct agaaatttag caagtcaatc attggattca     720
cgaagacgtg aagagttaat gcaaatagct gaaaattgtc agtgggtacc tgaaagacca     780
gctagaacgt tttatgaggc tctacaatca ttttggtttg tacaatctat tattcaaata     840
gaatctaatg gacattcaat atcacctatg cgttttgacc aatacatgta tccttatttt     900
aagaaggatg tatcaaatgg acttattaca caagaaaaag cccaagaact tttagattgt     960
ctatgggtta aatttaatga tgttaataag gttcgtgatg aaggatcaac aaaagcattt    1020
ggtggatatc caatgttcca gaacttaatt gtaggtggac aaactattga tggaagagat    1080
gctacaaatg agctttcatt tatgtgcctt gaagctactg cacataccaa attaccgcaa    1140
ccatcaattt caataagagc ttggaacaaa actccagatg agttattatt aaaagctgct    1200
gaagtaactc gtttaggttt aggtatgcca gcttactata atgatgaagt tatcattcct    1260
tctttgacaa gccgcggtct tacgttagaa gatgctagag attatggtat tattggatgt    1320
gtagaacctc aaaaaggtgg aaagacggaa ggatggcatg atgctgcatt ctttaatatt    1380
gtaaaggtat tagagataac tataaataat ggtatggata atggcaaaca gataggatta    1440
agaactggag acttcacttc ttttacatca tttgagaaat tatttgatgc atacaaatta    1500
cagatggagt attttgttaa acttttagtt aatgcagata acagtgtaga tttagcacat    1560
ggagagagag caccattacc attcttatct tcaatggcag acgattgtat agctagagga    1620
aagtcattac aagaaggagg agcacattac aactttacag gaccacaagg agtaggagtt    1680
gcaaatgcag cagactcgtt agaagctatt aagaaacttg tttttgaaga taagaagata    1740
actttacagg atttaaagaa tgcgttagac actaattttg gtgaatgtaa gaaaaaccca    1800
atatctgaac ttgctaatag cataaatgaa gtgggtgata tgaaaggatt aacacctgaa    1860
actatattga aagttattga gaaattatta tcagaagaaa agaaaacctc attagaagga    1920
ttggagccgg gtaaagatat taatttaggt agttatggaa ataaagagag tattcgtcaa    1980
atgctattaa atagagcacc taagtttggt aatgatatag atgaggttga tgatttagca    2040
cgagaagccg cattaattta ctgtaatgaa gttgaaaaat acactaatcc acgtaatggt    2100
caattccaac caggacttta tcctgtttct gcaaatgttc caatgggatc acagacagga    2160
gcaacaccag atggaagaaa agctggggaa ccactagcag atggtgtatc accagtttca    2220
ggaagagatg caatgggacc aactgcagct gctaattctg ttgcgaaaat agaccattgt    2280
aaagcttcaa atggtacatt atttaatcaa aagtttcatc catctgcttt agaaggtcag    2340
actggtttac agaatttatc ttctctagta agaacctttt tcgatgaaaa aggattacat    2400
gtacaattta atgtagtaag tagagaaacg cttttagatg ctcaaaagaa tcctgaaaat    2460
tatagaaatc tggtagtacg tgtagccgga tatagtgctc actttacttc tttagataag    2520
tcaattcagg atgatattat aaaaagaaca gaacatactt tttag                    2565

<210> 4
<211> 780
<212> DNA
<213> Clostridium autoethanogenum

<400> 4
atggagccac aagtaatgtt tattccaagc aaatgtatag gatgtaaaaa atgctatgaa      60
gtttgcagta atggagcaat agatttcaac cttccttcta gagttgatca gaataaatgt     120
gttaagtgtg gtaagtgtgt tgagaattgt tatgctggag ctttgaattt agcaggaaac     180
acgagaactg taaaggaatt gttactagaa ttgaaaaagg ataacatata ttatagacga     240
tctggtggtg gaattacttt atcaggagga gaagtaacag ctcaacctga gtttgctgaa     300
gaattactaa aaggatgcaa acaaaatggt tggcacacag ctattgaaac cgcagcattt     360
acatctcaaa gtgttttaga aaggatgcta ccttggcttg atttagttat gcttgatatt     420
aagcacatgg atgcaaataa acatttagag tatacaggaa agcctaatga gttaatctta     480
cagaatgcaa aattaatagc tcagtttgga gttcagttaa taataagagt acctgttatt     540
ccaggagtta atagcgatga aaataatata agagctacag ctaattttgc aactagcctt     600
aaaagtgtta aagaattaca tcttctacca taccatcgtc ttggtgaaaa taagtatgag     660
tacttaggac atgattatat aatgaagggt ttacaaccac ctactaaaga agaaataaat     720
aagcttaaag aactagtgga agaatgtgga ctaatatgta aagttggtgg aattgactag     780

<210> 5
<211> 351
<212> PRT
<213> Clostridium autoethanogenum

<400> 5
Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu
 1               5                  10                  15
Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu
            20                  25                  30
Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala
        35                  40                  45
Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu
    50                  55                  60
Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg
65                  70                  75                  80
Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln
                85                  90                  95
Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe
            100                 105                 110
Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp
        115                 120                 125
Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser
    130                 135                 140
Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu
145                 150                 155                 160
Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly
                165                 170                 175
Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly
            180                 185                 190
Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys
        195                 200                 205
Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val
    210                 215                 220
Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile
225                 230                 235                 240
Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val
                245                 250                 255
Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp
            260                 265                 270
Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys
        275                 280                 285
Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met
    290                 295                 300
Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val
305                 310                 315                 320
Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu
                325                 330                 335
Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe
            340                 345                 350

<210> 6
<211> 1056
<212> DNA
<213> Clostridium autoethanogenum

<400> 6
atgaaaggtt ttgcaatgtt aggtattaac aaattaggat ggattgaaaa gaaaaaccca      60
gtgccaggtc cttatgatgc gattgtacat cctctagctg tatccccatg tacatcagat     120
atacatacgg tttttgaagg agcacttggt aatagggaaa atatgatttt aggccatgaa     180
gctgtaggtg aaatagccga agttggcagc gaagttaaag attttaaagt tggcgataga     240
gttatcgtac catgcacaac acctgactgg agatctttag aagtccaagc tggttttcag     300
cagcattcaa acggtatgct tgcaggatgg aagttttcca attttaaaga tggtgtattt     360
gcagattact ttcatgtaaa cgatgcagat atgaatcttg ccatactccc agatgaaata     420
cctttagaaa gtgcagttat gatgacagac atgatgacta ctggttttca tggagcagaa     480
cttgcagaca taaaaatggg ctccagcgtt gtagtaattg gtataggagc tgttggatta     540
atgggaatag ccggttccaa acttcgagga gcaggcagaa ttatcggtgt tggaagcaga     600
cctgtttgtg ttgaaacagc taaattttat ggagcaactg atattgtaaa ttataaaaat     660
ggtgatatag ttgaacaaat catggactta actcatggta aaggtgtaga ccgtgtaatc     720
atggcaggcg gtggtgctga aacactagca caagcagtaa ctatggttaa acctggcggc     780
gtaatttcta acatcaacta ccatggaagc ggtgatactt taccaatacc tcgtgttcaa     840
tggggctgcg gcatggctca caaaactata agaggaggat tatgccccgg cggacgtctt     900
agaatggaaa tgctaagaga tcttgttcta tataaacgtg ttgatttgag taaacttgtt     960
actcatgtat ttgatggtgc agaaaatatt gaaaaggccc ttttgcttat gaaaaataag    1020
ccaaaagatt taattaaatc agtagttaca ttctaa                              1056

<210> 7
<211> 601
<212> PRT
<213> Artificial Sequence

<220> 
<223> designed type II methyltransferase

<400> 7
Met Phe Pro Cys Asn Ala Tyr Ile Glu Tyr Gly Asp Lys Asn Met Asn
 1               5                  10                  15
Ser Phe Ile Glu Asp Val Glu Gln Ile Tyr Asn Phe Ile Lys Lys Asn
            20                  25                  30
Ile Asp Val Glu Glu Lys Met His Phe Ile Glu Thr Tyr Lys Gln Lys
        35                  40                  45
Ser Asn Met Lys Lys Glu Ile Ser Phe Ser Glu Glu Tyr Tyr Lys Gln
    50                  55                  60
Lys Ile Met Asn Gly Lys Asn Gly Val Val Tyr Thr Pro Pro Glu Met
65                  70                  75                  80
Ala Ala Phe Met Val Lys Asn Leu Ile Asn Val Asn Asp Val Ile Gly
                85                  90                  95
Asn Pro Phe Ile Lys Ile Ile Asp Pro Ser Cys Gly Ser Gly Asn Leu
            100                 105                 110
Ile Cys Lys Cys Phe Leu Tyr Leu Asn Arg Ile Phe Ile Lys Asn Ile
        115                 120                 125
Glu Val Ile Asn Ser Lys Asn Asn Leu Asn Leu Lys Leu Glu Asp Ile
    130                 135                 140
Ser Tyr His Ile Val Arg Asn Asn Leu Phe Gly Phe Asp Ile Asp Glu
145                 150                 155                 160
Thr Ala Ile Lys Val Leu Lys Ile Asp Leu Phe Leu Ile Ser Asn Gln
                165                 170                 175
Phe Ser Glu Lys Asn Phe Gln Val Lys Asp Phe Leu Val Glu Asn Ile
            180                 185                 190
Asp Arg Lys Tyr Asp Val Phe Ile Gly Asn Pro Pro Tyr Ile Gly His
        195                 200                 205
Lys Ser Val Asp Ser Ser Tyr Ser Tyr Val Leu Arg Lys Ile Tyr Gly
    210                 215                 220
Ser Ile Tyr Arg Asp Lys Gly Asp Ile Ser Tyr Cys Phe Phe Gln Lys
225                 230                 235                 240
Ser Leu Lys Cys Leu Lys Glu Gly Gly Lys Leu Val Phe Val Thr Ser
                245                 250                 255
Arg Tyr Phe Cys Glu Ser Cys Ser Gly Lys Glu Leu Arg Lys Phe Leu
            260                 265                 270
Ile Glu Asn Thr Ser Ile Tyr Lys Ile Ile Asp Phe Tyr Gly Ile Arg
        275                 280                 285
Pro Phe Lys Arg Val Gly Ile Asp Pro Met Ile Ile Phe Leu Val Arg
    290                 295                 300
Thr Lys Asn Trp Asn Asn Asn Ile Glu Ile Ile Arg Pro Asn Lys Ile
305                 310                 315                 320
Glu Lys Asn Glu Lys Asn Lys Phe Leu Asp Ser Leu Phe Leu Asp Lys
                325                 330                 335
Ser Glu Lys Cys Lys Lys Phe Ser Ile Ser Gln Lys Ser Ile Asn Asn
            340                 345                 350
Asp Gly Trp Val Phe Val Asp Glu Val Glu Lys Asn Ile Ile Asp Lys
        355                 360                 365
Ile Lys Glu Lys Ser Lys Phe Ile Leu Lys Asp Ile Cys His Ser Cys
    370                 375                 380
Gln Gly Ile Ile Thr Gly Cys Asp Arg Ala Phe Ile Val Asp Arg Asp
385                 390                 395                 400
Ile Ile Asn Ser Arg Lys Ile Glu Leu Arg Leu Ile Lys Pro Trp Ile
                405                 410                 415
Lys Ser Ser His Ile Arg Lys Asn Glu Val Ile Lys Gly Glu Lys Phe
            420                 425                 430
Ile Ile Tyr Ser Asn Leu Ile Glu Asn Glu Thr Glu Cys Pro Asn Ala
        435                 440                 445
Ile Lys Tyr Ile Glu Gln Tyr Lys Lys Arg Leu Met Glu Arg Arg Glu
    450                 455                 460
Cys Lys Lys Gly Thr Arg Lys Trp Tyr Glu Leu Gln Trp Gly Arg Lys
465                 470                 475                 480
Pro Glu Ile Phe Glu Glu Lys Lys Ile Val Phe Pro Tyr Lys Ser Cys
                485                 490                 495
Asp Asn Arg Phe Ala Leu Asp Lys Gly Ser Tyr Phe Ser Ala Asp Ile
            500                 505                 510
Tyr Ser Leu Val Leu Lys Lys Asn Val Pro Phe Thr Tyr Glu Ile Leu
        515                 520                 525
Leu Asn Ile Leu Asn Ser Pro Leu Tyr Glu Phe Tyr Phe Lys Thr Phe
    530                 535                 540
Ala Lys Lys Leu Gly Glu Asn Leu Tyr Glu Tyr Tyr Pro Asn Asn Leu
545                 550                 555                 560
Met Lys Leu Cys Ile Pro Ser Ile Asp Phe Gly Gly Glu Asn Asn Ile
                565                 570                 575
Glu Lys Lys Leu Tyr Asp Phe Phe Gly Leu Thr Asp Lys Glu Ile Glu
            580                 585                 590
Ile Val Glu Lys Ile Lys Asp Asn Cys
        595                 600

<210> 8
<211> 1806
<212> DNA
<213> Artificial Sequence

<220> 
<223> designed type II methyltransferase

<400> 8
atgtttccgt gcaatgccta tatcgaatat ggtgataaaa atatgaacag ctttatcgaa      60
gatgtggaac agatctacaa cttcattaaa aagaacattg atgtggaaga aaagatgcat     120
ttcattgaaa cctataaaca gaaaagcaac atgaagaaag agattagctt tagcgaagaa     180
tactataaac agaagattat gaacggcaaa aatggcgttg tgtacacccc gccggaaatg     240
gcggccttta tggttaaaaa tctgatcaac gttaacgatg ttattggcaa tccgtttatt     300
aaaatcattg acccgagctg cggtagcggc aatctgattt gcaaatgttt tctgtatctg     360
aatcgcatct ttattaagaa cattgaggtg attaacagca aaaataacct gaatctgaaa     420
ctggaagaca tcagctacca catcgttcgc aacaatctgt ttggcttcga tattgacgaa     480
accgcgatca aagtgctgaa aattgatctg tttctgatca gcaaccaatt tagcgagaaa     540
aatttccagg ttaaagactt tctggtggaa aatattgatc gcaaatatga cgtgttcatt     600
ggtaatccgc cgtatatcgg tcacaaaagc gtggacagca gctacagcta cgtgctgcgc     660
aaaatctacg gcagcatcta ccgcgacaaa ggcgatatca gctattgttt ctttcagaag     720
agcctgaaat gtctgaagga aggtggcaaa ctggtgtttg tgaccagccg ctacttctgc     780
gagagctgca gcggtaaaga actgcgtaaa ttcctgatcg aaaacacgag catttacaag     840
atcattgatt tttacggcat ccgcccgttc aaacgcgtgg gtatcgatcc gatgattatt     900
tttctggttc gtacgaagaa ctggaacaat aacattgaaa ttattcgccc gaacaagatt     960
gaaaagaacg aaaagaacaa attcctggat agcctgttcc tggacaaaag cgaaaagtgt    1020
aaaaagttta gcattagcca gaaaagcatt aataacgatg gctgggtttt cgtggacgaa    1080
gtggagaaaa acattatcga caaaatcaaa gagaaaagca agttcattct gaaagatatt    1140
tgccatagct gtcaaggcat tatcaccggt tgtgatcgcg cctttattgt ggaccgtgat    1200
atcatcaata gccgtaagat cgaactgcgt ctgattaaac cgtggattaa aagcagccat    1260
atccgtaaga atgaagttat taagggcgaa aaattcatca tctatagcaa cctgattgag    1320
aatgaaaccg agtgtccgaa tgcgattaaa tatatcgaac agtacaagaa acgtctgatg    1380
gagcgccgcg aatgcaaaaa gggcacgcgt aagtggtatg aactgcaatg gggccgtaaa    1440
ccggaaatct tcgaagaaaa gaaaattgtt ttcccgtata aaagctgtga caatcgtttt    1500
gcactggata agggtagcta ttttagcgca gacatttata gcctggttct gaagaaaaat    1560
gtgccgttca cctatgagat cctgctgaat atcctgaata gcccgctgta cgagttttac    1620
tttaagacct tcgcgaaaaa gctgggcgag aatctgtacg agtactatcc gaacaacctg    1680
atgaagctgt gcatcccgag catcgatttc ggcggtgaga acaatattga gaaaaagctg    1740
tatgatttct ttggtctgac ggataaagaa attgagattg tggagaagat caaagataac    1800
tgctaa                                                               1806

<210> 9
<211> 61
<212> PRT
<213> Clostridium autoethanogenum

<400> 9
Cys Lys Lys Asn Pro Ile Ser Glu Leu Ala Asn Ser Ile Asn Glu Val
 1               5                  10                  15
Gly Asp Met Lys Gly Leu Thr Pro Glu Thr Ile Leu Lys Val Ile Glu
            20                  25                  30
Lys Leu Leu Ser Glu Glu Lys Lys Thr Ser Leu Glu Gly Leu Glu Pro
        35                  40                  45
Gly Lys Asp Ile Asn Leu Gly Ser Tyr Gly Asn Lys Glu
    50                  55                  60

<210> 10
<211> 28
<212> DNA
<213> Artificial Sequence

<220> 
<223> oligonucleotide Ppta-ack-NotI-F

<400> 10
gagcggccgc aatatgatat ttatgtcc                                         28

<210> 11
<211> 28
<212> DNA
<213> Artificial Sequence

<220> 
<223> oligonucleotide Ppta-ack-NdeI-R

<400> 11
ttccatatgt ttcatgttca tttcctcc                                         28

<210> 12
<211> 2941
<212> DNA
<213> Artificial Sequence

<220> 
<223> codon optimized pddABD operon

<400> 12
catatgagat cgaaaagatt tgaagcactg gcgaaacgcc ctgtgaatca ggacggcttc      60
gttaaggagt ggatcgaaga aggctttatc gcgatggaaa gcccgaacga cccaaaaccg     120
tcgattaaaa tcgttaacgg cgcggtgacc gagctggacg ggaaaccggt aagcgatttt     180
gacctgatcg accactttat cgcccgctac ggtatcaacc tgaaccgcgc cgaagaagtg     240
atggcgatgg attcggtcaa gctggccaac atgctgtgcg atccgaacgt taaacgcagc     300
gaaatcgtcc cgctgaccac cgcgatgacg ccggcgaaaa ttgtcgaagt ggtttcgcat     360
atgaacgtcg tcgagatgat gatggcgatg cagaaaatgc gcgcccgccg caccccgtcc     420
cagcaggcgc acgtcaccaa cgtcaaagat aacccggtac agattgccgc cgacgccgcc     480
gaaggggcat ggcgcggatt tgacgaacag gaaaccaccg ttgcggtagc gcgctatgcg     540
ccgttcaacg ccatcgcgct gctggtgggc tcgcaggtag gccgtccggg cgtgctgacg     600
cagtgctcgc tggaagaagc caccgagctg aagctcggca tgctgggcca cacctgctac     660
gccgaaacca tctccgtcta cggcaccgag ccggtcttta ccgacggcga cgacacgccg     720
tggtcgaagg gcttcctcgc ctcgtcctac gcctctcgcg ggctgaaaat gcgctttacc     780
tccggctccg gctcggaagt gcagatgggc tacgccgaag gcaaatccat gctttatctg     840
gaagcgcgct gcatctacat caccaaagcc gcgggcgtac agggtctgca aaacggttcc     900
gtaagctgca tcggcgtgcc gtctgcggtg ccttccggca ttcgcgcggt gctggcggaa     960
aacctgatct gttcgtcgct ggatctggag tgcgcctcca gcaacgacca gaccttcacc    1020
cactccgata tgcgtcgtac cgcgcgcctg ctgatgcagt tcctgccggg caccgacttt    1080
atctcctccg gttattccgc ggtgccgaac tacgacaaca tgttcgccgg ctccaacgaa    1140
gatgccgaag actttgacga ctacaacgtc atccagcgcg acctgaaggt ggacggcggt    1200
ttgcgtccgg ttcgcgaaga ggacgtcatc gccatccgta acaaagccgc ccgcgcgctg    1260
caggccgtgt ttgccggaat ggggctgccg ccgattaccg atgaagaagt tgaagccgcg    1320
acctacgccc acggttcgaa agatatgccg gagcgcaaca tcgtcgaaga catcaagttc    1380
gcccaggaaa tcatcaataa aaaccgcaac ggtctggaag tggtgaaagc gctggcgcag    1440
ggcggattca ccgacgtggc ccaggacatg ctcaacatcc agaaagctaa gctgaccggg    1500
gactacctgc atacctccgc gattatcgtc ggcgacgggc aggtgctgtc agccgtcaac    1560
gacgtcaacg actatgccgg tccggcaacg ggctatcgcc tgcagggcga acgctgggaa    1620
gagattaaaa acatccctgg cgctcttgat cccaacgaga ttgattaaaa aaaaaaaaaa    1680
aaaaaaaaaa aaaaaaaaaa aaaatggaaa ttaatgaaaa attgctgcgc cagataattg    1740
aagacgtgct cagcgagatg aagggcagcg ataaaccggt ctcgtttaat gcgccggcgg    1800
cctccgcggc gccccaggcc acgccgcccg ccggcgacgg cttcctgacg gaagtgggcg    1860
aagcgcgtca gggaacccag caggacgaag tgattatcgc cgtcggcccg gctttcggcc    1920
tggcgcagac cgtcaatatc gtcggcatcc cgcataagag cattttgcgc gaagtcattg    1980
ccggtattga agaagaaggc attaaggcgc gcgtgattcg ctgctttaaa tcctccgacg    2040
tggccttcgt cgccgttgaa ggtaatcgcc tgagcggctc cggcatctct atcggcatcc    2100
agtcgaaagg caccacggtg atccaccagc aggggctgcc gccgctctct aacctggagc    2160
tgttcccgca ggcgccgctg ctgaccctgg aaacctatcg ccagatcggc aaaaacgccg    2220
cccgctatgc gaaacgcgaa tcgccgcagc cggtcccgac gctgaatgac cagatggcgc    2280
ggccgaagta ccaggcgaaa tcggccattt tgcacattaa agagaccaag tacgtggtga    2340
cgggcaaaaa cccgcaggaa ctgcgcgtgg cgctttgaaa aaaaaaaaaa aaaaaaaaaa    2400
aaaaaaaaaa aaaatgaata ccgacgcaat tgaatcgatg gtacgcgacg tattgagccg    2460
catgaacagc ctgcagggcg aggcgcctgc ggcggctccg gcggctggcg gcgcgtcccg    2520
tagcgccagg gtcagcgact acccgctggc gaacaagcac ccggaatggg tgaaaaccgc    2580
caccaataaa acgctggacg actttacgct ggaaaacgtg ctgagcaata aagtcaccgc    2640
ccaggatatg cgtattaccc cggaaaccct gcgcttacag gcttctattg ccaaagacgc    2700
gggccgcgac cggctggcga tgaacttcga gcgcgccgcc gagctgaccg cggtaccgga    2760
cgatcgcatt cttgaaatct acaacgccct ccgcccctat cgctcgacga aagaggagct    2820
gctggcgatc gccgacgatc tcgaaagccg ctatcaggcg aagatttgcg ccgctttcgt    2880
tcgcgaagcg gccacgctgt acgtcgagcg taaaaaactc aaaggcgacg attaagaatt    2940
c                                                                    2941

<210> 13
<211> 344
<212> DNA
<213> Artificial Sequence

<220> 
<223> intron targeting region for diol dehydratase

<400> 13
aagcttataa ttatccttac gagacgccgc agtgcgccca gatagggtgt taagtcaagt      60
agtttaaggt actactctgt aagataacac agaaaacagc caacctaacc gaaaagcgaa     120
agctgatacg ggaacagagc acggttggaa agcgatgagt tacctaaaga caatcgggta     180
cgactgagtc gcaatgttaa tcagatataa ggtataagtt gtgtttactg aacgcaagtt     240
tctaatttcg gtttctcgtc gatagaggaa agtgtctgaa acctctagta caaagaaagg     300
taagttaaat gcggcgactt atctgttatc accacatttg taca                      344

<210> 14
<211> 25
<212> DNA
<213> Artificial Sequence

<220> 
<223> oligonucleotide Og84f

<400> 14
aaacctcatt agaaggattg gagcc                                            25

<210> 15
<211> 25
<212> DNA
<213> Artificial Sequence

<220> 
<223> oligonucleotide Og85r

<400> 15
gaaactggtg atacaccatc tgcta                                            25

<210> 16
<211> 37
<212> DNA
<213> Artificial Sequence

<220> 
<223> oligonucleotide fD1

<400> 16
ccgaattcgt cgacaacaga gtttgatcct ggctcag                               37

<210> 17
<211> 37
<212> DNA
<213> Artificial Sequence

<220> 
<223> oligonucleotide rP2

<400> 17
cccgggatcc aagcttacgg ctaccttgtt acgactt                               37

<210> 18
<211> 684
<212> DNA
<213> Clostridium autoethanogenum

<400> 18
cgggcrraww twaatttagg tagttatgga aataaagaga gtattcgtca aatgctatta      60
aatagagcac ctaagtttgg taatgatata gatgaggttg atgatttagc acgagaagcc     120
gcagtgcgcc cagatagggt gttaagtcaa gtagtttaag gtactactct gtaagataac     180
acagaaaaca gccaacctaa ccgaaaagcg aaagctgata cgggaacaga gcacggttgg     240
aaagcgatga gttacctaaa gacaatcggg tacgactgag tcgcaatgtt aatcagatat     300
aaggtataag ttgtgtttac tgaacgcaag tttctaattt cggtttctcg tcgatagagg     360
aaagtgtctg aaacctctag tacaaagaaa ggtaagttaa atgcggcgac ttatctgtta     420
tcaccacatt tgtacaatct gtaggagaac ctatgggaac gaaacgaaag cgatgccgag     480
aatctgaatt taccawgact taacactaac tggggatacc ctaaacaaga atgcctaata     540
kaaaggagga aaaaggctat agcactagag cttgaaaatc ttgcaagggt acggagtact     600
cgtaktagtc tgagwagggt aacgcccttt acatggmaar ggggtamwgt wawwgtktyc     660
twraattwaa wattrawtag ykat                                            684

<210> 19
<211> 1124
<212> DNA
<213> Clostridium autoethanogenum

<400> 19
tyttttcaat ctwggtgttg ctcctgtctg tgatcccatt ggaacatttg cagaaacagg      60
ataaagtcct ggttggaatt gaccattacg tggattagtg tatttttcaa cttcattaca     120
gtaaattaag tgaagtaggg aggtaccgcc ttgttcacat tactgtgact ggtttgcacc     180
accctcttcg ggaaccgtac gtacccctct cggagtatac ggctctgtta ttgttcgttc     240
gtaaaaattc actgtcgaca ttcacttgtg tttatgaatc acgtgacgat gacaatgaaa     300
gcatacaaca agagttttac gttgtttcgc tatcattgcc atttcccaac gcgtgaagtt     360
cctattctct agaaagtata ggaacttcta tattgataaa aataataata gtgggtataa     420
ttaagttgtt agagaaaacg tataaattag gagggattca tatggaccca agagatgctg     480
gtgcttctgg tgctggtatg aacaaaaata taaaatattc tcaaaacttt ttaacgagtg     540
aaaaagtact caaccaaata ataaaacaat tgaatttaaa agaaaccgat accgtttacg     600
aaattggaac aggtaaaggg catttaacga cgaaactggc taaaataagt aaacaggtaa     660
cgtctattga attagacagt catctattca acttatcgtc agaaaaatta aaactgaata     720
ctcgtgtcac tttaattcac caagatattc tacagtttca attccctaac aaacagaggt     780
ataaaattgt tgggagtatt ccttaccatt taagcacaca aattattaaa aaagtggttt     840
tgaaagccat gcgtctgaca tctatctgat tgttgaagaa ggattctaca agcgtmyttg     900
gwtattcacc raacwctakg gttgctcttg cacactcagt ctcgattyac aattgcttaa     960
gctgccwscg aatgctttcw cctaaacaaa ktaammgtgt cttataawac ttwcccscwt    1020
aycacwgatg ttccarataa awtggaakct attacgtact tgttcaaatg kgtcatcaar    1080
mywckcatsg tamtaaatcr tytmwcagca tgmacrscma kkaa                     1124

<210> 20
<211> 1279
<212> DNA
<213> Clostridium autoethanogenum

<400> 20
cggggcarra awtwatttag gtagttatgg aaataaagag agtattcgtc aaatgctatt      60
aaatagagca cctaagtttg gtaatgatat agatgaggtt gatgatttag cacgagaagc     120
cgcagtgcgc ccagataggg tgttaagtca agtagtttaa ggtactactc tgtaagataa     180
cacagaaaac agccaaccta accgaaaagc gaaagctgat acgggaacag agcacggttg     240
gaaagcgatg agttacctaa agacaatcgg gtacgactga gtcgcaatgt taatcagata     300
taaggtataa gttgtgttta ctgaacgcaa gtttctaatt tcggtttctc gtcgatagag     360
gaaagtgtct gaaacctcta gtacaaagaa aggtaagtta aatgcggcga cttatctgtt     420
atcaccacat ttgtacaatc tgtaggagaa cctatgggaa cgaaacgaaa gcgatgccga     480
gaatctgaat ttaccaagac ttaacactaa ctggggatac cctaaacaag aatgcctaat     540
agaaaggagg aaaaaggcta tagcactaga gcttgaaaat cttgcaaggg tacggagtac     600
tcgtagtagt ctgagaaggg taacgccctt tacatggcaa aggggtacag ttattgtgta     660
ctaaaattaa aaattgatta gggaggaaaa cctcaaaatg aaaccaacaa tggcaatttt     720
agaaagaatc agtaaaaatt cacaagaaaa tatagacgaa gtttttacaa gactttatcg     780
ttatctttta cgtccagata tttattacgt ggcgacgcgt gaagttccta tactttctag     840
agaataggaa cttcgcgact catagaatta tttcctcccg ttaaataata gataactatt     900
aaaaatagac aatacttgct cataagtaac ggtacttaaa ttgtttactt tggcgtgttt     960
cattgcttga tgaaactgat tttagtaaac agttgacgat atctcgattg acccatttga    1020
aacaaagtac gtatatagct tcatatttat ctgaacatct gtggtatggc ggtagtttat    1080
agacctgtta ctttggttta gatgaagcat cgctgcagct agcatgctga tcgagactga    1140
kkkcaagaca cctatgttcg kgaawtcagt asctkgatcy tctcmmmacw cratwaagtc    1200
arcatggctt cgaacactta gaatttggty twwgkgagat tcgaataacc gtgtgttaga    1260
aktacytsaa aacctctga                                                 1279

<210> 21
<211> 1297
<212> DNA
<213> Clostridium autoethanogenum

<400> 21
agggggtyag ctttttcttc atctggtgtt gctcctgtct gtgatcccat tggaacattt      60
gcagaaacag gataaagtcc tggttggaat tgaccattac gtggattagt gtatttttca     120
acttcattac agtaaattaa gtgaagtagg gaggtaccgc cttgttcaca ttactgtgac     180
tggtttgcac caccctcttc gggaaccgta cgtacccctc tcggagtata cggctctgtt     240
attgttcgtt cgtaaaaatt cactgtcgac attcacttgt gtttatgaat cacgtgacga     300
tgacaatgaa agcatacaac aagagtttta cgttgtttcg ctatcattgc catttcccaa     360
cgcgtgaagt tcctattctc tagaaagtat aggaacttct atattgataa aaataataat     420
agtgggtata attaagttgt tagagaaaac gtataaatta ggagggattc atatggaccc     480
aagagatgct ggtgcttctg gtgctggtat gaacaaaaat ataaaatatt ctcaaaactt     540
tttaacgagt gaaaaagtac tcaaccaaat aataaaacaa ttgaatttaa aagaaaccga     600
taccgtttac gaaattggaa caggtaaagg gcatttaacg acgaaactgg ctaaaataag     660
taaacaggta acgtctattg aattagacag tcatctattc aacttatcgt cagaaaaatt     720
aaaactgaat actcgtgtca ctttaattca ccaagatatt ctacagtttc aattccctaa     780
caaacagagg tataaaattg ttgggagtat tccttaccat ttaagcacac aaattattaa     840
aaaagtggtt tttgaaagcc atgcgtctga catctatctg attgttgaag aaggwttcta     900
caagcgtacc ttggatattc accgaacact agggttgctc ttgcacactc aagtctcgat     960
tcagcaattg cttaagctgc cagcggaatg ctttcatcct aaaccaaagt aaacagtgtc    1020
tataaactta cccgcatacc acagatgttc cagataatat tggagctata tacgtacttt    1080
gttcaaatgg tcatcgagaa tatcgtcact gttactaaaa tcagttcatc agcatgaamm    1140
gccagtaaca ttagtacgtt acttatgrca rgwttgcyat tttaagtwtc twtawtacgg    1200
gagaatattc wgagtcsgag ttctattyct gagwtgactc acgtkcgcag ataaaatckg    1260
acctagarac gtataggact ttgactataa ctsctgc                             1297

<210> 22
<211> 754
<212> DNA
<213> Clostridium autoethanogenum

<400> 22
cgggcasagc ctwacaymtk asaakycska wsmmggkasc cgtaagcttg gatcccggga      60
ygacgggtga gtaacmsgkg ggtraccwac sycrrrgagg gggatagcct cccsaawggg     120
akattaatac cscataataw tcagttttcw catggagact gatttaaagg agtaatccgy     180
tttgagatgg acccgcggcg cattagctwk ttggtagggt aacggcctac caaggcgack     240
atgcgtagcc gacctgasag ggtgatcggc cacattggaa ctgasagacg gyccasactc     300
ctacgggagg cascagtggg gaatattgca caatgggcga aagcctgatg cagcaacgcc     360
gcgtgagtga agaaggtttt cgrattgtaa agctctgtct ttggggacga taatgacggt     420
accsaaggag gaagccmcgg staactacgk gccascagcc kcggtaatac rtaggtggcg     480
agcgttgtcc ggaattactg ggcgtaaaga gtgcgtakgc ggatatttaa gtgagakgtg     540
raatacccgg gcttaacccg ggywctgywt ttcamaykgg atatcwakag tgcgggagag     600
gagawtggaa ktccwagkgt agcggtgaar tgcgtarasa tymgraasaa maycwktwkc     660
gwwkgcgatt ctcwgkaccr trrytgayrc tgaggytcga aascgtgggt akcaaacwgm     720
attawatacy ctggtastcc acsyygtwaa cgtg                                 754

<210> 23
<211> 885
<212> DNA
<213> Clostridium autoethanogenum

<400> 23
cattagggcc stcskacycc kwagyccrwc kragccrgga tcaarcmctg ttgtcgacka      60
attcggrctm tcatggwtsw racksgskkk gygwacaagg cccgggaacg tattcaccgc     120
gacattctga ttcgcgatta ctagcaactc caacttcatg taggcgagtt tcagcctgca     180
atccgaactg ggggcagttt ttgaggtttg ctccaccttg cggtcttgct tctctctgta     240
ctgcccattg tagcacgtgt gttgccctgg acataagggg catgatgatt tgacgtcatc     300
cccaccttcc tccgcgttaa ccgcggcagt cttgctagag tgctcaayta awtgttagca     360
actaacaaca ggggttgcgc tcgttgcagg acttaaccta acatctcacg acacgagctg     420
acgacaaccw tgcaccacct gtwtccctgc cccgaagggy ttctcttatc tctaagatat     480
tcagggtatg tcaagtccag gtaaggttct tcgcgttgct tcraattaaa ccacatgctc     540
cgctgcttgt gcgggccccc gtcaattcct ttgagtttta atcttgcgat cgtacttccc     600
aggcggagta cttattgtgt twactgcgkc acaraaggrg tcgatacctc ctacayctag     660
tactcatcgt ttacggcgtg kactaccakg gtatctaatc ctgtttgctm cccaykcttt     720
cktgccwcak cgtcwrttac rktcmaakaa tcsccttygc cwctggtgtt ttccwaatct     780
ctaskkwtty ackgsamwyt agsaattcct tytcctcycc cgymcyctar atatccyrtt     840
tgaatgcagt gmcmwgrtaa mcccggkatt mmawctytmt taatt                     885

<210> 24
<211> 1005
<212> DNA
<213> Clostridium autoethanogenum

<400> 24
agcagggcar gcctwacacm tkasaakycs krwssmggka gccgtaagct tggatcccgg      60
gaygacgggt gagtaacrsg kgggtraccw acsycrrrga gggggatagc ctcccsaawg     120
ggakattaat accscataat aatcagtttt cwcatggaga ctgwtttaaa ggagtaatcc     180
gytttgagat ggacccgcgg cgcattagct wkttggtagg gtaacggcct accmaggcga     240
ckatgcgtag ccgacctgas agggtgatcg gccacwttgg aactgasaga ckgtccasac     300
tcctacggga ggcagcagtg gggaatattg cacaakgggc gaaagcctga tgcascaacg     360
ccgcgtgagt gaagaaggtt ttcggattgt aaagctctgt ctttggggac gatratgacg     420
gtaccsaagg aggargccmc ggstaactac gkgccascmk ccgcggtaat acgtasgtgg     480
cgagcgttgt ccggaattac tgggcgtaaa gagtgcgtag gcggatattt aagtgagatg     540
tgaaataccc gggcttaacc ygggywctgc atttcaaact ggatatctag agtgcgggag     600
aggagaatgg aattcctagt gtagcggtga aatgcgtaka sattakgaag aacaccaktg     660
gcgaaggcga ttytctggac cgtractgay gctgakgcac gaaagcgtgg gtakcaaaca     720
ggattagata cyctggtagt ccacrccgta aacgatgagt actkggtgta ggaggtwtcg     780
accccttmtg tgccgcakta aacacwataa ktactccgcc tggaawgtac gatcgcaakw     840
twaaawctca arggakttga cgggggcscg cwcwagcakc ggagcawgtk gkttaattys     900
arcwcgyraa samcttacct ggamytkacw wmcctkmatw yttwaraswt agwgyrscct     960
tsrggsaggg awrrwkksgt gtrtgttsck cakwycgtyt rtaag                    1005

<210> 25
<211> 1117
<212> DNA
<213> Clostridium autoethanogenum

<400> 25
cccgcwgycr tagcykyskm kwckwagkyr wckragmcrr gatcaarctc tgttgtcgac      60
raattcggac tmtcakggtg wgacgggsgg kgtgwacaak gcccgggaac gtattcaccg     120
cgacattctg attcgcgatt actagcaact ccaacttcat gtaggcgagt ttcagcctgc     180
aatccgaact gggggcagtt tttgaggttt gctccacctt gcggtcttgc ttctctctgt     240
actgcccatt gtagcacgtg tgttgccctg gacataaggg gcatgatgat ttgacgtcat     300
ccccaccttc ctccgcgtta accgcggcag tcttgctaga gtgctcaact aaatgttagc     360
aactaacaac aggggttgcg ctcgttgcag gacttaacct aacatctcac gacacgagct     420
gacgacaacc atgcaccacc tgtatccctg ccccgaaggg yttctcttat ctctaarata     480
ttcagggtat gtcaagtcca ggtaaggttc ttcgcgttgc ttcraattaa accacatgct     540
ccgctgcttg tgcgggcccc cgtcaattcc tttgagtttt aatcttgcga tcgtacttcc     600
caggcggagt acttattgtg tttactgcgg cacagaaggg gtcgatacct cytacaccta     660
gtactcatcg tttacggcgt ggactaccag ggtatctaat cctgtttgct acccacgctt     720
tcgtgcctca kcgtcagtta cggtccagag aatcgccttc gccactggtg ttcttcctaa     780
tctctacgca tttmaccgct acactaggaw tycmttctcc tctcccgcac tctagatatc     840
yagtttkaaa tgcagtgccc gggttaagcc cgggtatttc acatctcact waatatccgc     900
ctacgcactc kttmcgccca gtaatyccga macgctcgcm yctacgtatt acgcggytgc     960
tgcacgtagt tagccgkgct tyctcttggg tmccgtmmtt ayskycccaa kamagagctt    1020
tacaatckga aacyttcttm wytcmksmgs gtgctgmtag cttygycwtg kgcaawatcc    1080
mmtgcgccyy cgatgakytg rwctgyctma ktyaagt                             1117

<210> 26
<211> 3566
<212> DNA
<213> Artificial Sequence

<220> 
<223>  C. autoethanogenum diol dehydratase operon codon
      optimized for E. coli

<400> 26
atttcacaca ggaaacagac catgaatgac gtgctgaata aactgtatac cgccaatcag      60
agcaaacgca ttgaaaaact gaccaacgat ctgtatagcg tgacaccgga aattgaagca     120
cagcgtgcag ttctgattac cgaaagcttt aaagaaaccg aagcctaccc gatgattatt     180
cgtcgtgcaa aagcactgga aaaaatcctg aacgaaatgg atattgtgat ccgtgatgaa     240
gaactgattg ttggcaatct gaccaaaaaa ccgcgtgcag caagcatttt tccggaattt     300
agcaataaat ggctgctgga agaatttgat accctggcaa aacgtaccgg tgatgttttt     360
ctgattagcg aggatgttaa aagccagctg cgtgaagttt tcaaatattg ggatggtaaa     420
accaccaatg aactggcaac cgaatatatg tttagcgaaa ccaaagaagc aatggaagca     480
ggcgttttta ccgttggcaa ctattatttc aatggcattg gtcatatcag cgtggattat     540
gcaaaagttc tgagcaaagg ctttaacggc attattgaag atgccgaaag cgaaaaagca     600
aaagcagata aagccgatcc ggattatatc aaaaaagatc agtttctgac cgcagtgatc     660
attaccagca aagccgttat caaatttgca cgtcgttttg cagaactggc acgtaatctg     720
gcaagccaga gcctggatag ccgtcgtcgt gaggaactga tgcagattgc agaaaattgt     780
cagtgggtgc cggaacgtcc tgcacgtacc ttttatgaag cactgcagag cttttggttt     840
gtgcagagca ttattcagat tgaaagcaac ggtcatagca ttagcccgat gcgttttgat     900
cagtatatgt acccgtactt caaaaaagac gttagcaatg gtctgatcac ccaagaaaaa     960
gcacaagaac tgctggattg tctgtgggtc aaattcaatg atgtgaacaa agttcgtgat    1020
gagggtagca ccaaagcatt tggcggttat ccgatgtttc agaatctgat tgtgggtggc    1080
cagacaattg atggtcgtga tgcaacaaat gaactgagct ttatgtgtct ggaagcaacc    1140
gcacatacca aactgccgca gccgagcatt agcattcgtg catggaataa aacaccggat    1200
gaactgctgc tgaaagcagc agaagttacc cgtctgggtc tgggtatgcc tgcctattat    1260
aacgatgaag ttattattcc gagcctgacc agccgtggtc tgaccctgga agatgcacgt    1320
gattatggta ttattggttg tgttgaaccg cagaaaggtg gcaaaaccga aggttggcat    1380
gatgcagcct ttttcaatat tgttaaagtg ctggaaatca ccatcaacaa cggtatggat    1440
aacggtaaac aaattggtct gcgcaccggt gattttacca gctttaccag ttttgagaaa    1500
ctgttcgatg cctacaaact gcagatggaa tattttgtga aactgctggt gaatgccgat    1560
aattcagttg atctggcaca tggtgaacgt gcaccgctgc cgtttctgag cagcatggca    1620
gatgattgta ttgcacgtgg taaaagcctg caagaaggtg gcgcacatta taactttacc    1680
ggtccgcagg gtgttggtgt tgcaaatgca gcagatagcc tggaagccat caaaaaactg    1740
gtgttcgagg acaaaaaaat caccctgcag gatctgaaaa atgccctgga taccaatttt    1800
ggcgagtgca aaaaaaaccc gattagcgaa ctggccaata gcattaatga agtgggtgat    1860
atgaaaggcc tgactccgga aaccattctg aaagttatcg aaaaactgct gagcgaagag    1920
aaaaaaacca gtctggaagg tctggaaccg ggtaaagata tcaatctggg tagctatggt    1980
aacaaagaaa gcattcgtca gatgctgctg aatcgtgcac cgaaatttgg caatgatatt    2040
gatgaagttg atgacctggc acgtgaagca gcactgattt attgtaacga agtggaaaag    2100
tataccaatc cgcgtaatgg ccagtttcag cctggtctgt atccggttag cgcaaatgtt    2160
ccgatgggta gccagaccgg tgcaactccg gatggtcgta aagccggtga accgctggca    2220
gatggtgtta gtccggtgag tggccgtgat gccatgggtc cgaccgcagc agcaaatagc    2280
gttgcaaaaa ttgatcattg caaagccagc aatggcaccc tgtttaacca gaaatttcat    2340
ccgagcgcac tggaaggcca gacaggtctg cagaatctgt caagcctggt tcgtaccttc    2400
tttgatgaga aaggtctgca tgttcagttt aatgttgtta gccgtgaaac cctgctggat    2460
gcacagaaaa atccggaaaa ttatcgcaat ctggttgttc gtgttgcagg ttatagcgca    2520
cattttacct cactggataa aagcattcag gacgatatta tcaaacgcac cgaacacacc    2580
ttttaaagct ttttatagga ggaaaagtta tgctgaacta ccaggtgaac ctggataaaa    2640
aaggcatcat ctttgatatc cagcgcttta gcgttcatga tggtccgggt attcgtacca    2700
ttgttttttt caaaggttgt ccgctgagct gtcgttggtg tagcaatccg gaatcacagt    2760
gtatggaacc gcaggtaatg tttattccgt caaaatgcat tggctgcaaa aaatgttatg    2820
aggtgtgcag caatggtgcc atcgatttta atctgccgag ccgtgttgat cagaataaat    2880
gtgttaaatg cggcaaatgc gtggaaaatt gttatgccgg tgcactgaat ctggcaggta    2940
atacccgtac cgttaaagag ctgctgctgg aactgaaaaa agacaacatc tattatcgtc    3000
gtagcggtgg tggtattacc ctgagtggtg gtgaagttac cgcacagccg gaatttgcag    3060
aggaactgct gaaaggttgt aaacagaatg gctggcatac cgcaattgaa accgcagcat    3120
ttaccagcca gagcgttctg gaacgtatgc tgccgtggct ggatctggtt atgctggata    3180
ttaaacatat ggacgccaac aaacacctgg aatataccgg taaaccgaac gaactgatcc    3240
tgcaaaatgc aaaactgatt gcacagtttg gtgttcagct gattatccgt gttccggtga    3300
ttcctggtgt taatagtgat gaaaacaata ttcgtgccac cgccaatttt gcaaccagcc    3360
tgaaaagcgt gaaagaactg catctgctgc cgtatcatcg tctgggtgaa aacaaatatg    3420
aatatctggg ccacgattac atcatgaaag gactgcagcc tccgacaaaa gaagaaatca    3480
ataaactgaa agaactggtg gaagaatgcg gtctgatttg taaagtgggt ggcattgatt    3540
aaagcttggc tgttttggcg gatgag                                         3566
