                         SEQUENCE LISTING

<110>  STRING BIO PRIVATE LIMITED
       KUMAR S et al., RAJEEV
 
<120>  "RECOMBINANT METHANOTROPHIC BACTERIA FOR INDIGO BIOSYNTHESIS AND 
       METHODS THEREOF"

<130>  IP49153/VJ/mrm

<150>  IN201941031391
<151>  2019-08-02

<160>  60    

<170>  PatentIn version 3.5

<210>  1
<211>  1416
<212>  DNA
<213>  TnaA Gene- Escherichia coli strain K-12 (XL1-Blue/S17-1)  

<400>  1
atggaaaact ttaaacatct ccctgaaccg ttccgcattc gtgttattga gccagtaaaa       60

cgtaccactc gcgcttatcg tgaagaggca attattaaat ccggtatgaa cccgttcctg      120

ctggatagcg aagatgtttt tatcgattta ctgaccgaca gcggcaccgg ggcggtgacg      180

cagagcatgc aggctgcgat gatgcgcggc gacgaagcct acagcggcag tcgtagctac      240

tatgcgttag ccgagtcagt gaaaaatatc tttggttatc aatacaccat tccgactcac      300

cagggccgtg gcgcagagca aatctatatt ccggtactga ttaaaaaacg cgagcaggaa      360

aaaggcctgg atcgcagcaa aatggtggcg ttctctaact atttctttga taccacgcag      420

ggccatagcc agatcaacgg ctgtaccgtg cgtaacgtct atatcaaaga agccttcgat      480

acgggcgtgc gttacgactt taaaggcaac tttgaccttg agggattaga acgcggtatt      540

gaagaagttg gtccgaataa cgtgccgtat atcgttgcaa ccatcaccag taactctgca      600

ggtggtcagc cggtttcact ggcaaactta aaagcgatgt acagcatcgc gaagaaatac      660

gatattccgg tggtaatgga ctccgcgcgc tttgctgaaa acgcctattt catcaagcag      720

cgtgaagcag aatacaaaga ctggaccatc gagcagatca cccgcgaaac ctacaaatat      780

gccgatatgc tggcgatgtc cgccaagaaa gatgcgatgg tgccgatggg cggcctgctg      840

tgcatgaaag acgacagctt ctttgatgtg tacaccgagt gcagaaccct ttgcgtggtg      900

caggaaggct tcccgacata tggcggcctg gaaggcggcg cgatggagcg tctggcggta      960

ggtctgtatg acggcatgaa tctcgactgg ctggcttatc gtatcgcgca ggtacagtat     1020

ctggtcgatg gtctggaaga gattggcgtt gtctgccagc aggcgggcgg tcacgcggca     1080

ttcgttgatg ccggtaaact gttgccgcat atcccggcag accagttccc ggcacaggcg     1140

ctggcctgcg agctgtataa agtcgccggt atccgtgcgg tagaaattgg ctctttcctg     1200

ttaggccgcg atccgaaaac cggtaaacaa ctgccatgcc cggctgaact gctgcgttta     1260

accattccgc gcgcaacata tactcaaaca catatggact tcattattga agcctttaaa     1320

catgtgaaag agaacgcggc gaatattaaa ggattaacct ttacgtacga accgaaagta     1380

ttgcgtcact tcaccgcaaa acttaaagaa gtttaa                               1416


<210>  2
<211>  173
<212>  PRT
<213>  TnaA Protein - Escherichia coli strain K-12 (XL1-Blue/S17-1)  

<400>  2

Met Leu Arg Ala Glu Trp Leu Asn Ala Ala Val Gln Pro Gly Met Ala 
1               5                   10                  15      


Val Val Tyr Arg Phe Ser Asp Arg Gly Leu Thr Gly Lys Ser Gln Phe 
            20                  25                  30          


Leu Pro His Gly Tyr Arg Arg Leu Tyr Thr Ala Arg Arg Pro Ala Pro 
        35                  40                  45              


Val Pro Gly Thr Gly Leu Pro Gly Tyr Ala Ala Thr Val Tyr Arg His 
    50                  55                  60                  


Gln Arg Met Pro Arg Asp Arg Pro Pro Ala Gly Arg Gln Arg Gln Ser 
65                  70                  75                  80  


Leu Pro Asp His Arg Pro Asp Thr Val Pro Ala Arg Tyr Asp Lys Pro 
                85                  90                  95      


Ala Ser Arg Asp Ser Cys Arg His Thr Asp Leu Pro Pro Asp Ala Pro 
            100                 105                 110         


Ser Arg Arg Leu Pro Gly Arg His Met Ser Gly Ser Leu Pro Ala Pro 
        115                 120                 125             


Arg Lys Gly Phe Cys Thr Arg Cys Thr His Gln Arg Ser Cys Arg Leu 
    130                 135                 140                 


Ser Cys Thr Ala Gly Arg Pro Ser Ala Pro Ser His Leu Ser Trp Arg 
145                 150                 155                 160 


Thr Ser Pro Ala Tyr Arg His Ile Cys Arg Phe Arg Gly 
                165                 170             


<210>  3
<211>  1212
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mutant TrpB gene (R389P K392M mutant)

<400>  3
atgcaagacg agattcagcc ctatgacctg cccgatgagc tcggccactt tggaccttac       60

ggtggcattt tcgtcgccga gaccttgatg gagccgctgg aagagctgaa agccgcctac      120

catcgctacc tgaaggaccc ggaattcctc gccgagctgg atcacgatct gaaccactac      180

gtcggccgcc cctcaccgat ctaccatgcc gaacgcctga gccgggagct cggcggcgca      240

cagatcttct tcaagcgcga agatctcaat cataccggtg cacacaaggt caacaacacc      300

gtcggccagg cactgctggc caagcgcatg ggcaagcggc gggtgatcgc cgagaccggt      360

gccggccagc acggcgtggc cacggccacc gtggcggccc ggctggggat ggagtgcgtg      420

gtctacatgg gggcggtcga cgtccagcgc caggcgctca acgtattccg catgaagctg      480

ctcggcgcca ccgtgatagc ggtcgactcg ggttcccgga cgctcaagga cgcgctgaac      540

gaagccatgc gcgactgggt gaccaacgtc gacgatacct tctacatcat cggtacggtg      600

gcgggtcccc atccctatcc cgccatggtg cgcgatttcc aggccgtgat cggccgcgag      660

gcgcgccggc agatgctgga gatgacgggg cgtctgcccg atgccctggt cgcctgcgtg      720

ggcggcggct cgaatgccat cggcctgttt catccgttcg tcgatgaccg cgaggtcgcc      780

atgtacgggg tcgaggccgc cggggatggt atcgaaaccg gtcgccactc ggctccgctg      840

agcgccggcc gccccggcgt gctgcacggc aaccgtacct acctgatgga agacgaagac      900

ggcgagatca tcgagaccca ttccatttcc gccgggctgg actatccggg cgtcgggccg      960

gaacacgcct ggctcaagga ctgcggccgg gcgagctatg tcagtgccac cgacgccgaa     1020

gcgctcgagg cgttccatat cctgacccgg tccgagggga tcatcccggc actggaatcc     1080

agccatgccg tggcctacgc cctcaagctg gcgccgactc tcagttccga caagatcgtc     1140

ctggtcaacc tgtctggccc tggcgacatg gatatccaca ccatcgccac ccgggagggc     1200

atcgttctgt ga                                                         1212


<210>  4
<211>  297
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  mutant TrpB protein (R389P K392M mutant)

<400>  4

Met Ala Gly Phe Gln Cys Arg Asp Asp Pro Leu Gly Pro Gly Gln Asp 
1               5                   10                  15      


Met Glu Arg Leu Glu Arg Phe Gly Val Gly Gly Thr Asp Ile Ala Arg 
            20                  25                  30          


Pro Ala Ala Val Leu Glu Pro Gly Val Phe Arg Pro Asp Ala Arg Ile 
        35                  40                  45              


Val Gln Pro Gly Gly Asn Gly Met Gly Leu Asp Asp Leu Ala Val Phe 
    50                  55                  60                  


Val Phe His Gln Val Gly Thr Val Ala Val Gln His Ala Gly Ala Ala 
65                  70                  75                  80  


Gly Ala Gln Arg Ser Arg Val Ala Thr Gly Phe Asp Thr Ile Pro Gly 
                85                  90                  95      


Gly Leu Asp Pro Val His Gly Asp Leu Ala Val Ile Asp Glu Arg Met 
            100                 105                 110         


Lys Gln Ala Asp Gly Ile Arg Ala Ala Ala His Ala Gly Asp Gln Gly 
        115                 120                 125             


Ile Gly Gln Thr Pro Arg His Leu Gln His Leu Pro Ala Arg Leu Ala 
    130                 135                 140                 


Ala Asp His Gly Leu Glu Ile Ala His His Gly Gly Ile Gly Met Gly 
145                 150                 155                 160 


Thr Arg His Arg Thr Asp Asp Val Glu Gly Ile Val Asp Val Gly His 
                165                 170                 175     


Pro Val Ala His Gly Phe Val Gln Arg Val Leu Glu Arg Pro Gly Thr 
            180                 185                 190         


Arg Val Asp Arg Tyr His Gly Gly Ala Glu Gln Leu His Ala Glu Tyr 
        195                 200                 205             


Val Glu Arg Leu Ala Leu Asp Val Asp Arg Pro His Val Asp His Ala 
    210                 215                 220                 


Leu His Pro Gln Pro Gly Arg His Gly Gly Arg Gly His Ala Val Leu 
225                 230                 235                 240 


Ala Gly Thr Gly Leu Gly Asp His Pro Pro Leu Ala His Ala Leu Gly 
                245                 250                 255     


Gln Gln Cys Leu Ala Asp Gly Val Val Asp Leu Val Cys Thr Gly Met 
            260                 265                 270         


Ile Glu Ile Phe Ala Leu Glu Glu Asp Leu Cys Ala Ala Glu Leu Pro 
        275                 280                 285             


Ala Gln Ala Phe Gly Met Val Asp Arg 
    290                 295         


<210>  5
<211>  1371
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FMO gene (codon optimized) - Methylophaga aminisulfidivorans

<400>  5
atggcaaccc gtattgcaat tctcggagca ggaccttccg gcatggcgca actccgtgcc       60

tttcaatccg cccaagagaa gggtgccgag atccccgaac tcgtctgttt cgaaaagcaa      120

gccgactggg gcggacaatg gaattacacg tggcggacgg ggctcgacga aaatggagag      180

cccgtccact cctcgatgta ccgctatctc tggtcgaatg gacccaagga gtgtttggaa      240

tttgccgact acacgttcga cgagcacttc gggaagccca tcgcgtccta cccgccccgt      300

gaggtcttgt gggattatat caaggggcgt gtcgagaagg caggggtccg gaagtacatt      360

cggttcaata cggcagtccg gcacgtcgag ttcaatgagg actcgcaaac gttcacggtc      420

acggtccaag accacacgac ggacacgatc tattccgagg agttcgacta cgtcgtctgt      480

tgtacggggc acttctccac gccctacgtc cctgagttcg agggtttcga gaagtttggc      540

gggcggattt tgcacgccca cgactttcgt gacgccttgg agttcaagga caagacggtc      600

ctcttggtcg gatcgtccta ctccgccgaa gatattggtt cgcaatgtta caagtacggt      660

gccaagaagt tgatctcgtg ttaccggacg gcccccatgg ggtacaaatg gcccgagaat      720

tgggacgagc gtcccaattt ggtccgggtc gatacggaga atgcatactt cgccgacggt      780

tcctcggaga aagtcgatgc aatcatcttg tgtacggggt atatccacca cttccccttc      840

ctcaatgacg atctccggct cgtcacgaat aatcggctct ggcctctcaa tttgtacaag      900

ggtgtcgtct gggaagacaa tcccaagttc ttctacattg gaatgcaaga ccaatggtac      960

tccttcaata tgtttgacgc ccaagcctgg tacgcacgtg acgtcatcat ggggcgtctc     1020

cctctccctt ccaaggaaga gatgaaagcc gactcgatgg cctggcgtga aaaggagctg     1080

acgttggtca cggccgagga aatgtacacg taccaaggag attatatcca aaatctcatt     1140

gacatgacgg actacccctc gttcgacatc cccgccacga ataagacgtt cttggagtgg     1200

aagcatcaca agaaggagaa tattatgacg ttccgtgatc actcgtaccg gtcgttgatg     1260

acggggacga tggcccccaa gcaccacacg ccttggattg acgccttgga cgactccctg     1320

gaagcctact tgtccgacaa gtcggaaatc cccgtcgcca aagaagcata g              1371


<210>  6
<211>  456
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  FMO protein - Methylophaga aminisulfidivorans

<400>  6

Met Ala Thr Arg Ile Ala Ile Leu Gly Ala Gly Pro Ser Gly Met Ala 
1               5                   10                  15      


Gln Leu Arg Ala Phe Gln Ser Ala Gln Glu Lys Gly Ala Glu Ile Pro 
            20                  25                  30          


Glu Leu Val Cys Phe Glu Lys Gln Ala Asp Trp Gly Gly Gln Trp Asn 
        35                  40                  45              


Tyr Thr Trp Arg Thr Gly Leu Asp Glu Asn Gly Glu Pro Val His Ser 
    50                  55                  60                  


Ser Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu 
65                  70                  75                  80  


Phe Ala Asp Tyr Thr Phe Asp Glu His Phe Gly Lys Pro Ile Ala Ser 
                85                  90                  95      


Tyr Pro Pro Arg Glu Val Leu Trp Asp Tyr Ile Lys Gly Arg Val Glu 
            100                 105                 110         


Lys Ala Gly Val Arg Lys Tyr Ile Arg Phe Asn Thr Ala Val Arg His 
        115                 120                 125             


Val Glu Phe Asn Glu Asp Ser Gln Thr Phe Thr Val Thr Val Gln Asp 
    130                 135                 140                 


His Thr Thr Asp Thr Ile Tyr Ser Glu Glu Phe Asp Tyr Val Val Cys 
145                 150                 155                 160 


Cys Thr Gly His Phe Ser Thr Pro Tyr Val Pro Glu Phe Glu Gly Phe 
                165                 170                 175     


Glu Lys Phe Gly Gly Arg Ile Leu His Ala His Asp Phe Arg Asp Ala 
            180                 185                 190         


Leu Glu Phe Lys Asp Lys Thr Val Leu Leu Val Gly Ser Ser Tyr Ser 
        195                 200                 205             


Ala Glu Asp Ile Gly Ser Gln Cys Tyr Lys Tyr Gly Ala Lys Lys Leu 
    210                 215                 220                 


Ile Ser Cys Tyr Arg Thr Ala Pro Met Gly Tyr Lys Trp Pro Glu Asn 
225                 230                 235                 240 


Trp Asp Glu Arg Pro Asn Leu Val Arg Val Asp Thr Glu Asn Ala Tyr 
                245                 250                 255     


Phe Ala Asp Gly Ser Ser Glu Lys Val Asp Ala Ile Ile Leu Cys Thr 
            260                 265                 270         


Gly Tyr Ile His His Phe Pro Phe Leu Asn Asp Asp Leu Arg Leu Val 
        275                 280                 285             


Thr Asn Asn Arg Leu Trp Pro Leu Asn Leu Tyr Lys Gly Val Val Trp 
    290                 295                 300                 


Glu Asp Asn Pro Lys Phe Phe Tyr Ile Gly Met Gln Asp Gln Trp Tyr 
305                 310                 315                 320 


Ser Phe Asn Met Phe Asp Ala Gln Ala Trp Tyr Ala Arg Asp Val Ile 
                325                 330                 335     


Met Gly Arg Leu Pro Leu Pro Ser Lys Glu Glu Met Lys Ala Asp Ser 
            340                 345                 350         


Met Ala Trp Arg Glu Lys Glu Leu Thr Leu Val Thr Ala Glu Glu Met 
        355                 360                 365             


Tyr Thr Tyr Gln Gly Asp Tyr Ile Gln Asn Leu Ile Asp Met Thr Asp 
    370                 375                 380                 


Tyr Pro Ser Phe Asp Ile Pro Ala Thr Asn Lys Thr Phe Leu Glu Trp 
385                 390                 395                 400 


Lys His His Lys Lys Glu Asn Ile Met Thr Phe Arg Asp His Ser Tyr 
                405                 410                 415     


Arg Ser Leu Met Thr Gly Thr Met Ala Pro Lys His His Thr Pro Trp 
            420                 425                 430         


Ile Asp Ala Leu Asp Asp Ser Leu Glu Ala Tyr Leu Ser Asp Lys Ser 
        435                 440                 445             


Glu Ile Pro Val Ala Lys Glu Ala 
    450                 455     


<210>  7
<211>  1167
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IacA gene (codon-optimized) - Acinetobacter baumannii 

<400>  7
atgaacaagc tgtcgaagat ggagttcgcc tcgcaggaca aggccgtgga cctggacgcc       60

ctgtgccagg agatccggga gcgcgcctgc accggcgaat ttgacaacca ggcctacgtg      120

agccaggaca tcatcgagaa gctgaagcag atcggcgtct accgggccct ggtcccgaag      180

cgcttcggcg gcgaagagtg gagcccgcgc cagttctgcg agctgatcga gaccctgtcg      240

aaggcggacg gcagcgtcgg ctgggtcgcc agcttcggca tgtcgccggc ctacctgggc      300

agcctgccgg aagagaccct gaaggagctg taccagaacg gcccggacgt cgtcttcgcg      360

ggcggcatct tcccgccgca gccggccgag atcaccgacg aaggcgtcgt cgtccgcggc      420

cgctggaagt tctccagcgg ctgcatgggc gcggacatcg tcggcgtggg catctcgccg      480

ctgaagaaca acgaaatgca gggcctgccg cgcatggccg tgatgccggc caagaaggcc      540

aagatcgaga tgacctggga caccgtgggc ctgaagggca ccggctcgca cgacctggtc      600

gtcgaggacg tcctggtcga gaagaagtgg accttcgtgc gcggcgagcc gagcaagctg      660

tcggagccgt tcttcaagta cccgtcgctg tcgctggcca cccaggtcct gaccgtggtc      720

ggcatcggcg tcgcggccgc cgccctggag gagttcgaaa agctggcccc gggcaaggcc      780

agcatcaccg gcggcagcga aatcgccaac cgcccggtca cccagtacga atttgcccag      840

gccgacgccg agttccaggc cgccaagagc tggttctacc agaccatgga catcgtctgg      900

aacgaaatca tcgccggccg cgaggccacc gccgaacaga tcagcgacat gcgcctggcc      960

tgcacccacg ccgcccgcgt ctgcgccaag gtcacccgca agatgcagat gctggccggc     1020

atgaccgcca tctacaccaa caaccccttc agccgcttcg tcaacgacac caacgtcgtc     1080

acccagcacg ccttcatggg cgacgccacc ctgcaaaacg ccggcctggt cagcttcggc     1140

ctgaagcccg cccccggcta cctgtga                                         1167


<210>  8
<211>  388
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IacA protein - Acinetobacter baumannii 

<400>  8

Met Asn Lys Leu Ser Lys Met Glu Phe Ala Ser Gln Asp Lys Ala Val 
1               5                   10                  15      


Asp Leu Asp Ala Leu Cys Gln Glu Ile Arg Glu Arg Ala Cys Thr Gly 
            20                  25                  30          


Glu Phe Asp Asn Gln Ala Tyr Val Ser Gln Asp Ile Ile Glu Lys Leu 
        35                  40                  45              


Lys Gln Ile Gly Val Tyr Arg Ala Leu Val Pro Lys Arg Phe Gly Gly 
    50                  55                  60                  


Glu Glu Trp Ser Pro Arg Gln Phe Cys Glu Leu Ile Glu Thr Leu Ser 
65                  70                  75                  80  


Lys Ala Asp Gly Ser Val Gly Trp Val Ala Ser Phe Gly Met Ser Pro 
                85                  90                  95      


Ala Tyr Leu Gly Ser Leu Pro Glu Glu Thr Leu Lys Glu Leu Tyr Gln 
            100                 105                 110         


Asn Gly Pro Asp Val Val Phe Ala Gly Gly Ile Phe Pro Pro Gln Pro 
        115                 120                 125             


Ala Glu Ile Thr Asp Glu Gly Val Val Val Arg Gly Arg Trp Lys Phe 
    130                 135                 140                 


Ser Ser Gly Cys Met Gly Ala Asp Ile Val Gly Val Gly Ile Ser Pro 
145                 150                 155                 160 


Leu Lys Asn Asn Glu Met Gln Gly Leu Pro Arg Met Ala Val Met Pro 
                165                 170                 175     


Ala Lys Lys Ala Lys Ile Glu Met Thr Trp Asp Thr Val Gly Leu Lys 
            180                 185                 190         


Gly Thr Gly Ser His Asp Leu Val Val Glu Asp Val Leu Val Glu Lys 
        195                 200                 205             


Lys Trp Thr Phe Val Arg Gly Glu Pro Ser Lys Leu Ser Glu Pro Phe 
    210                 215                 220                 


Phe Lys Tyr Pro Ser Leu Ser Leu Ala Thr Gln Val Leu Thr Val Val 
225                 230                 235                 240 


Gly Ile Gly Val Ala Ala Ala Ala Leu Glu Glu Phe Glu Lys Leu Ala 
                245                 250                 255     


Pro Gly Lys Ala Ser Ile Thr Gly Gly Ser Glu Ile Ala Asn Arg Pro 
            260                 265                 270         


Val Thr Gln Tyr Glu Phe Ala Gln Ala Asp Ala Glu Phe Gln Ala Ala 
        275                 280                 285             


Lys Ser Trp Phe Tyr Gln Thr Met Asp Ile Val Trp Asn Glu Ile Ile 
    290                 295                 300                 


Ala Gly Arg Glu Ala Thr Ala Glu Gln Ile Ser Asp Met Arg Leu Ala 
305                 310                 315                 320 


Cys Thr His Ala Ala Arg Val Cys Ala Lys Val Thr Arg Lys Met Gln 
                325                 330                 335     


Met Leu Ala Gly Met Thr Ala Ile Tyr Thr Asn Asn Pro Phe Ser Arg 
            340                 345                 350         


Phe Val Asn Asp Thr Asn Val Val Thr Gln His Ala Phe Met Gly Asp 
        355                 360                 365             


Ala Thr Leu Gln Asn Ala Gly Leu Val Ser Phe Gly Leu Lys Pro Ala 
    370                 375                 380                 


Pro Gly Tyr Leu 
385             


<210>  9
<211>  1092
<212>  DNA
<213>  DAHP Synthase gene - Methylococcus capsulatus

<400>  9
atgcccagcg tgtacaacac cgacgatctt cgcatctgcg agatcaagga agtcattccg       60

cccgtccagg ttcatgagga attcccgatc acggaccggg ccgcactcac gacactgacc      120

gcccgccgag ggattcacgc aatcctttcc aaggaggacg accgcctgct ggtggtgatc      180

gggccctgtt cgatccatga ccccaaggcc gcgctcgaat acggggagcg gctgctgcca      240

ctccgccaga aactggcgag acatctggaa atcgtgatgc gggtctattt cgagaagccg      300

cgaacgaccg tcggctggaa gggcctgatc aatgatcccg atctggacga gagtttcaac      360

atcaacaaag gcttgcgcct cgcccgcaag ctgttgctcg atctgaacga actgggcatg      420

cccgcggcca ccgagtacct cgatctcatc accccgcagt atgtctccga cctgatcgct      480

tggggcgcca tcggtgctcg taccacggag agccagtctc accgtgaact ggcatcgggg      540

ctgtcatgtc cggttggatt caagaacgcc accgacggca cgatcaaggt tgctgtcgac      600

gccataggtg cggcacggcg gccacatcat ttcctgtctt tgaccaaggc cggtcattcg      660

gcgatcttct ccacgaccgg taacgccgac tgtcacatca tccttcgtgg cggagcccgg      720

ccgaattacg acgcggccag cgtcgaagcg gcggccaggg cgctggaagc cgtcggcctg      780

ccgcccaaca tcatggtgga ctgcagccat gccaacagca tgaaggatta cctgaagcag      840

ctgcgggtgg ccgaggacgt ggccgaacag atagacggcg gcgacaggcg gatcatcggc      900

ttgatggtgg aaagtcacct caagccgggc aatcagaaac tccacaaggg catggttccc      960

gaatacggcg tcagcatcac cgatgcctgc atcggctggg atgacagcgt ggccgtgctg     1020

gaacggctcg ccgccgcggt ggagagccgg cgcggccggt cggcaggcat ccggaacgtg     1080

cggggggcct ga                                                         1092


<210>  10
<211>  363
<212>  PRT
<213>  DAHP Synthase protein - Methylococcus capsulatus

<400>  10

Met Pro Ser Val Tyr Asn Thr Asp Asp Leu Arg Ile Cys Glu Ile Lys 
1               5                   10                  15      


Glu Val Ile Pro Pro Val Gln Val His Glu Glu Phe Pro Ile Thr Asp 
            20                  25                  30          


Arg Ala Ala Leu Thr Thr Leu Thr Ala Arg Arg Gly Ile His Ala Ile 
        35                  40                  45              


Leu Ser Lys Glu Asp Asp Arg Leu Leu Val Val Ile Gly Pro Cys Ser 
    50                  55                  60                  


Ile His Asp Pro Lys Ala Ala Leu Glu Tyr Gly Glu Arg Leu Leu Pro 
65                  70                  75                  80  


Leu Arg Gln Lys Leu Ala Arg His Leu Glu Ile Val Met Arg Val Tyr 
                85                  90                  95      


Phe Glu Lys Pro Arg Thr Thr Val Gly Trp Lys Gly Leu Ile Asn Asp 
            100                 105                 110         


Pro Asp Leu Asp Glu Ser Phe Asn Ile Asn Lys Gly Leu Arg Leu Ala 
        115                 120                 125             


Arg Lys Leu Leu Leu Asp Leu Asn Glu Leu Gly Met Pro Ala Ala Thr 
    130                 135                 140                 


Glu Tyr Leu Asp Leu Ile Thr Pro Gln Tyr Val Ser Asp Leu Ile Ala 
145                 150                 155                 160 


Trp Gly Ala Ile Gly Ala Arg Thr Thr Glu Ser Gln Ser His Arg Glu 
                165                 170                 175     


Leu Ala Ser Gly Leu Ser Cys Pro Val Gly Phe Lys Asn Ala Thr Asp 
            180                 185                 190         


Gly Thr Ile Lys Val Ala Val Asp Ala Ile Gly Ala Ala Arg Arg Pro 
        195                 200                 205             


His His Phe Leu Ser Leu Thr Lys Ala Gly His Ser Ala Ile Phe Ser 
    210                 215                 220                 


Thr Thr Gly Asn Ala Asp Cys His Ile Ile Leu Arg Gly Gly Ala Arg 
225                 230                 235                 240 


Pro Asn Tyr Asp Ala Ala Ser Val Glu Ala Ala Ala Arg Ala Leu Glu 
                245                 250                 255     


Ala Val Gly Leu Pro Pro Asn Ile Met Val Asp Cys Ser His Ala Asn 
            260                 265                 270         


Ser Met Lys Asp Tyr Leu Lys Gln Leu Arg Val Ala Glu Asp Val Ala 
        275                 280                 285             


Glu Gln Ile Asp Gly Gly Asp Arg Arg Ile Ile Gly Leu Met Val Glu 
    290                 295                 300                 


Ser His Leu Lys Pro Gly Asn Gln Lys Leu His Lys Gly Met Val Pro 
305                 310                 315                 320 


Glu Tyr Gly Val Ser Ile Thr Asp Ala Cys Ile Gly Trp Asp Asp Ser 
                325                 330                 335     


Val Ala Val Leu Glu Arg Leu Ala Ala Ala Val Glu Ser Arg Arg Gly 
            340                 345                 350         


Arg Ser Ala Gly Ile Arg Asn Val Arg Gly Ala 
        355                 360             


<210>  11
<211>  1071
<212>  DNA
<213>  AroF gene - Escherichia coli

<400>  11
atgcaaaaag acgcgctgaa taacgtacat attaccgacg aacaggtttt aatgactccg       60

gaacaactga aggccgcttt tccattgagc ctgcaacaag aagcccagat tgctgactcg      120

cgtaaaagca tttcagatat tatcgccggg cgcgatcctc gtctgctggt agtatgtggt      180

ccttgttcca ttcatgatcc ggaaactgct ctggaatatg ctcgtcgatt taaagccctt      240

gccgcagagg tcagcgatag cctctatctg gtaatgcgcg tctattttga aaaaccccgt      300

accactgtcg gctggaaagg gttaattaac gatccccata tggatggctc ttttgatgta      360

gaagccgggc tgcagatcgc gcgtaaattg ctgcttgagc tggtgaatat gggactgcca      420

ctggcgacgg aagcgttaga tccgaatagc ccgcaatacc tgggcgatct gtttagctgg      480

tcagcaattg gtgctcgtac aacggaatcg caaactcacc gtgaaatggc ctccgggctt      540

tccatgccgg ttggttttaa aaacggcacc gacggcagtc tggcaacagc aattaacgct      600

atgcgcgccg ccgcccagcc gcaccgtttt gttggcatta accaggcagg gcaggttgcg      660

ttgctacaaa ctcaggggaa tccggacggc catgtgatcc tgcgcggtgg taaagcgccg      720

aactatagcc ctgcggatgt tgcgcaatgt gaaaaagaga tggaacaggc gggactgcgc      780

ccgtctctga tggtagattg cagccacggt aattccaata aagattatcg ccgtcagcct      840

gcggtggcag aatccgtggt tgctcaaatc aaagatggca atcgctcaat tattggtctg      900

atgatcgaaa gtaatatcca cgagggcaat cagtcttccg agcaaccgcg cagtgaaatg      960

aaatacggtg tatccgtaac cgatgcctgc attagctggg aaatgaccga tgccttgctg     1020

cgtgaaattc atcaggatct gaacgggcag ctgacggctc gcgtggctta a              1071


<210>  12
<211>  356
<212>  PRT
<213>  AroF protein - Escherichia coli

<400>  12

Met Gln Lys Asp Ala Leu Asn Asn Val His Ile Thr Asp Glu Gln Val 
1               5                   10                  15      


Leu Met Thr Pro Glu Gln Leu Lys Ala Ala Phe Pro Leu Ser Leu Gln 
            20                  25                  30          


Gln Glu Ala Gln Ile Ala Asp Ser Arg Lys Ser Ile Ser Asp Ile Ile 
        35                  40                  45              


Ala Gly Arg Asp Pro Arg Leu Leu Val Val Cys Gly Pro Cys Ser Ile 
    50                  55                  60                  


His Asp Pro Glu Thr Ala Leu Glu Tyr Ala Arg Arg Phe Lys Ala Leu 
65                  70                  75                  80  


Ala Ala Glu Val Ser Asp Ser Leu Tyr Leu Val Met Arg Val Tyr Phe 
                85                  90                  95      


Glu Lys Pro Arg Thr Thr Val Gly Trp Lys Gly Leu Ile Asn Asp Pro 
            100                 105                 110         


His Met Asp Gly Ser Phe Asp Val Glu Ala Gly Leu Gln Ile Ala Arg 
        115                 120                 125             


Lys Leu Leu Leu Glu Leu Val Asn Met Gly Leu Pro Leu Ala Thr Glu 
    130                 135                 140                 


Ala Leu Asp Pro Asn Ser Pro Gln Tyr Leu Gly Asp Leu Phe Ser Trp 
145                 150                 155                 160 


Ser Ala Ile Gly Ala Arg Thr Thr Glu Ser Gln Thr His Arg Glu Met 
                165                 170                 175     


Ala Ser Gly Leu Ser Met Pro Val Gly Phe Lys Asn Gly Thr Asp Gly 
            180                 185                 190         


Ser Leu Ala Thr Ala Ile Asn Ala Met Arg Ala Ala Ala Gln Pro His 
        195                 200                 205             


Arg Phe Val Gly Ile Asn Gln Ala Gly Gln Val Ala Leu Leu Gln Thr 
    210                 215                 220                 


Gln Gly Asn Pro Asp Gly His Val Ile Leu Arg Gly Gly Lys Ala Pro 
225                 230                 235                 240 


Asn Tyr Ser Pro Ala Asp Val Ala Gln Cys Glu Lys Glu Met Glu Gln 
                245                 250                 255     


Ala Gly Leu Arg Pro Ser Leu Met Val Asp Cys Ser His Gly Asn Ser 
            260                 265                 270         


Asn Lys Asp Tyr Arg Arg Gln Pro Ala Val Ala Glu Ser Val Val Ala 
        275                 280                 285             


Gln Ile Lys Asp Gly Asn Arg Ser Ile Ile Gly Leu Met Ile Glu Ser 
    290                 295                 300                 


Asn Ile His Glu Gly Asn Gln Ser Ser Glu Gln Pro Arg Ser Glu Met 
305                 310                 315                 320 


Lys Tyr Gly Val Ser Val Thr Asp Ala Cys Ile Ser Trp Glu Met Thr 
                325                 330                 335     


Asp Ala Leu Leu Arg Glu Ile His Gln Asp Leu Asn Gly Gln Leu Thr 
            340                 345                 350         


Ala Arg Val Ala 
        355     


<210>  13
<211>  1080
<212>  DNA
<213>  Aro B gene - Methylococcus capsulatus

<400>  13
atgaaaacct tacacgtcga gctgggggag cgcggctacc ccatttatat aggacggggc       60

ctgctgggcc atcccgacct gatacaggcc catctgccgg gcgggcaggt cctggtggtg      120

accaacgaag tggtggcgcc gctgtacctc gaccgcatgc ttgcatccct ggccggcaag      180

gacacgggca gtgtcgtgct tcccgacggc gaggcccaca agaccctgga ctcggcgatg      240

gccgtgttcg atgccttgct ggcccggcgt ttcggccgca acgccgccat cgtggcgctc      300

ggcggcgggg tgatcggcga tctggccggt ttcgcggcag cctgctatca gcgcggcgtg      360

cctttcatcc aggtgcccac caccctgttg tctcaggtcg actcctcggt gggaggcaag      420

accgcggtca accatccgcg cggcaagaac atgatcggcg ccttctacca gccgcgctgc      480

gttctggccg acaccgacac tctggatacg ttgcccgacc gcgaactgag cgcgggtctg      540

gccgaggtca tcaagtacgg cttcatccgt gacccggaat tcctggcctg gctcgaagcg      600

aacgtcgagc gcttgctgca gcgcgatccc gaagcgctcg cctatgccat cgagcggtcc      660

tgcatcaaca aggcggaaat cgtggcggaa gacgagaccg aaaccggggt gcgggcgacg      720

ctgaacctgg ggcacacttt cggccacgcc atcgaaaccg gcatgggcta tggtgtatgt      780

ctgcacggcg aagcggtggc gatcggtatg tgccaggcgg ccgatctgtc ccgtcgcttg      840

ggctggatcg gtgacgacga ggtggcgagg gtgatccgcc tgctggagcg ggcgcggctg      900

ccggtcgtcc cgccgcgcga gttggatgcg gacgcctttc tcgaacacat ggcggtcgac      960

aagaagaacg tcgacggcgg tctgcgactg gttctgctca aatccctggg tgaggcgacc     1020

ctgccggtgg ccgtggacgc cggactgtta cgggccacat tggaatgcta cggccgctga     1080


<210>  14
<211>  359
<212>  PRT
<213>  Aro B protein - Methylococcus capsulatus

<400>  14

Met Lys Thr Leu His Val Glu Leu Gly Glu Arg Gly Tyr Pro Ile Tyr 
1               5                   10                  15      


Ile Gly Arg Gly Leu Leu Gly His Pro Asp Leu Ile Gln Ala His Leu 
            20                  25                  30          


Pro Gly Gly Gln Val Leu Val Val Thr Asn Glu Val Val Ala Pro Leu 
        35                  40                  45              


Tyr Leu Asp Arg Met Leu Ala Ser Leu Ala Gly Lys Asp Thr Gly Ser 
    50                  55                  60                  


Val Val Leu Pro Asp Gly Glu Ala His Lys Thr Leu Asp Ser Ala Met 
65                  70                  75                  80  


Ala Val Phe Asp Ala Leu Leu Ala Arg Arg Phe Gly Arg Asn Ala Ala 
                85                  90                  95      


Ile Val Ala Leu Gly Gly Gly Val Ile Gly Asp Leu Ala Gly Phe Ala 
            100                 105                 110         


Ala Ala Cys Tyr Gln Arg Gly Val Pro Phe Ile Gln Val Pro Thr Thr 
        115                 120                 125             


Leu Leu Ser Gln Val Asp Ser Ser Val Gly Gly Lys Thr Ala Val Asn 
    130                 135                 140                 


His Pro Arg Gly Lys Asn Met Ile Gly Ala Phe Tyr Gln Pro Arg Cys 
145                 150                 155                 160 


Val Leu Ala Asp Thr Asp Thr Leu Asp Thr Leu Pro Asp Arg Glu Leu 
                165                 170                 175     


Ser Ala Gly Leu Ala Glu Val Ile Lys Tyr Gly Phe Ile Arg Asp Pro 
            180                 185                 190         


Glu Phe Leu Ala Trp Leu Glu Ala Asn Val Glu Arg Leu Leu Gln Arg 
        195                 200                 205             


Asp Pro Glu Ala Leu Ala Tyr Ala Ile Glu Arg Ser Cys Ile Asn Lys 
    210                 215                 220                 


Ala Glu Ile Val Ala Glu Asp Glu Thr Glu Thr Gly Val Arg Ala Thr 
225                 230                 235                 240 


Leu Asn Leu Gly His Thr Phe Gly His Ala Ile Glu Thr Gly Met Gly 
                245                 250                 255     


Tyr Gly Val Cys Leu His Gly Glu Ala Val Ala Ile Gly Met Cys Gln 
            260                 265                 270         


Ala Ala Asp Leu Ser Arg Arg Leu Gly Trp Ile Gly Asp Asp Glu Val 
        275                 280                 285             


Ala Arg Val Ile Arg Leu Leu Glu Arg Ala Arg Leu Pro Val Val Pro 
    290                 295                 300                 


Pro Arg Glu Leu Asp Ala Asp Ala Phe Leu Glu His Met Ala Val Asp 
305                 310                 315                 320 


Lys Lys Asn Val Asp Gly Gly Leu Arg Leu Val Leu Leu Lys Ser Leu 
                325                 330                 335     


Gly Glu Ala Thr Leu Pro Val Ala Val Asp Ala Gly Leu Leu Arg Ala 
            340                 345                 350         


Thr Leu Glu Cys Tyr Gly Arg 
        355                 


<210>  15
<211>  447
<212>  DNA
<213>  Aro D gene - Methylococcus capsulatus

<400>  15
atggcgggta tcttggtgct gaacgggcct aacctcaatc tgttgggggt acgtgagccg       60

ggtatctatg gcagcgacac gctttcggat atcgaatcgc gtctgcaggc acaggccagg      120

gtggcaggca tgccgatcga tttcttccag agcaatgccg agcatgctct gatcgaacgc      180

attcaccagg cgttccgcga tgcggtcgac atgatcatca tcaatcccgg cgccctcacc      240

cataccagcg tcgctttgcg cgatgcgttg ctggccaccg ccgtgccttt cattgaagta      300

cacatttcga acgttcatgc gcgcgagccg ttccgccgcc attcctatct ttccgatatt      360

gccagggggg tcatctgcgg attgggcccc atgggctacg aactggcgct ccaggccgcc      420

ctgcaaatga cacataggtc gttatag                                          447


<210>  16
<211>  148
<212>  PRT
<213>  Aro D protein - Methylococcus capsulatus

<400>  16

Met Ala Gly Ile Leu Val Leu Asn Gly Pro Asn Leu Asn Leu Leu Gly 
1               5                   10                  15      


Val Arg Glu Pro Gly Ile Tyr Gly Ser Asp Thr Leu Ser Asp Ile Glu 
            20                  25                  30          


Ser Arg Leu Gln Ala Gln Ala Arg Val Ala Gly Met Pro Ile Asp Phe 
        35                  40                  45              


Phe Gln Ser Asn Ala Glu His Ala Leu Ile Glu Arg Ile His Gln Ala 
    50                  55                  60                  


Phe Arg Asp Ala Val Asp Met Ile Ile Ile Asn Pro Gly Ala Leu Thr 
65                  70                  75                  80  


His Thr Ser Val Ala Leu Arg Asp Ala Leu Leu Ala Thr Ala Val Pro 
                85                  90                  95      


Phe Ile Glu Val His Ile Ser Asn Val His Ala Arg Glu Pro Phe Arg 
            100                 105                 110         


Arg His Ser Tyr Leu Ser Asp Ile Ala Arg Gly Val Ile Cys Gly Leu 
        115                 120                 125             


Gly Pro Met Gly Tyr Glu Leu Ala Leu Gln Ala Ala Leu Gln Met Thr 
    130                 135                 140                 


His Arg Ser Leu 
145             


<210>  17
<211>  840
<212>  DNA
<213>  Aro E gene - Methylococcus capsulatus

<400>  17
atgacccagc ccgaccgata cgccgtgttc gggcacccga tcgaacacag ccagtcaccc       60

cgcatccatg ccctgttcgc cgcccagacc ggccaggacc tgatctacac cgccgaggac      120

gtgccacccg accggttcga atcctgcgtc cgcgcgttct tcgacggcgg tggccgcggc      180

ctcaactgca cgatcccgct caaggagatg gcctggctgc tcgcggacag ccgcagcggc      240

agggcaaagc gggcgcgtgc ggtcaacacg ctgctcctgc gggccgatgg ctcgatcttc      300

ggcgacaaca ccgatggcat cggtctgctc cgcgacctgc gggacaacct cggactgaac      360

ctcgcgggca cgaaaatcct catactcggc gccggcgggg cgacgcgggg aatcctggcg      420

cccctgctgg ccgagcggcc ggaccggctg gtcatcgcca accgcaccgt cgccacggcg      480

gaaaccctga ccgtggaatt cggcgacctg ggccccgtcg aaggctgcgg cttcgctgca      540

ttggccggtc gccgcttcga cctgatcatc aacgccaccg ccgccagtct gagcggcgaa      600

ctcccgccgc tccccgccga catactcgcc cccggcggca gttgctacga cctggcctat      660

gccgccgaac cgacgccctt cgtgcggtgg ggccaggaaa agcaagcggt cgtcagtgcc      720

gacggcatcg gcatgctggt ggaacaggcc gccgaagcct tcctgctctg gcgcggtgtg      780

cgcccgcaaa cacgcccggt gatcgagacg ctcgaagccg agcgacgaac cgcgaagtga      840


<210>  18
<211>  279
<212>  PRT
<213>  Aro E protein - Methylococcus capsulatus

<400>  18

Met Thr Gln Pro Asp Arg Tyr Ala Val Phe Gly His Pro Ile Glu His 
1               5                   10                  15      


Ser Gln Ser Pro Arg Ile His Ala Leu Phe Ala Ala Gln Thr Gly Gln 
            20                  25                  30          


Asp Leu Ile Tyr Thr Ala Glu Asp Val Pro Pro Asp Arg Phe Glu Ser 
        35                  40                  45              


Cys Val Arg Ala Phe Phe Asp Gly Gly Gly Arg Gly Leu Asn Cys Thr 
    50                  55                  60                  


Ile Pro Leu Lys Glu Met Ala Trp Leu Leu Ala Asp Ser Arg Ser Gly 
65                  70                  75                  80  


Arg Ala Lys Arg Ala Arg Ala Val Asn Thr Leu Leu Leu Arg Ala Asp 
                85                  90                  95      


Gly Ser Ile Phe Gly Asp Asn Thr Asp Gly Ile Gly Leu Leu Arg Asp 
            100                 105                 110         


Leu Arg Asp Asn Leu Gly Leu Asn Leu Ala Gly Thr Lys Ile Leu Ile 
        115                 120                 125             


Leu Gly Ala Gly Gly Ala Thr Arg Gly Ile Leu Ala Pro Leu Leu Ala 
    130                 135                 140                 


Glu Arg Pro Asp Arg Leu Val Ile Ala Asn Arg Thr Val Ala Thr Ala 
145                 150                 155                 160 


Glu Thr Leu Thr Val Glu Phe Gly Asp Leu Gly Pro Val Glu Gly Cys 
                165                 170                 175     


Gly Phe Ala Ala Leu Ala Gly Arg Arg Phe Asp Leu Ile Ile Asn Ala 
            180                 185                 190         


Thr Ala Ala Ser Leu Ser Gly Glu Leu Pro Pro Leu Pro Ala Asp Ile 
        195                 200                 205             


Leu Ala Pro Gly Gly Ser Cys Tyr Asp Leu Ala Tyr Ala Ala Glu Pro 
    210                 215                 220                 


Thr Pro Phe Val Arg Trp Gly Gln Glu Lys Gln Ala Val Val Ser Ala 
225                 230                 235                 240 


Asp Gly Ile Gly Met Leu Val Glu Gln Ala Ala Glu Ala Phe Leu Leu 
                245                 250                 255     


Trp Arg Gly Val Arg Pro Gln Thr Arg Pro Val Ile Glu Thr Leu Glu 
            260                 265                 270         


Ala Glu Arg Arg Thr Ala Lys 
        275                 


<210>  19
<211>  540
<212>  DNA
<213>  Aro K gene - Methylococcus capsulatus

<400>  19
atgcgaaacc gtcgaaacat cttcctgatc ggcccgatgg gagcgggcaa gaccaccgtg       60

ggacgtctgc tcgcccgtgc cctggggatg gagttctggg acagcgacaa ggaaatcgaa      120

cgccggaccg gcgtcacggt gccgatgatt ttcgaatacg agggcgaggc cggattccgg      180

cgccgcgaat cggaagtcat cgccgatctc acgggcaagg aaaggatcgt gctggccacc      240

ggcggcggtt cggtgctggc agcggagaac cgggagcatc tggcggcacg ggggctggta      300

atttacctgc agtgttcggt ccagaagcag ttggagagga cgcacaagga catgaaccgg      360

cccttgttgc agacggagaa tcccaggcaa aggctggaag aactgctgcg ggtgagggat      420

cccatctacc gcgagcttgc cgactacgtc gtcgataccg gccagcattc gagccgcagt      480

gccgtgcgcc ggatcatcaa cgcctacgag aaatccggaa ccagactgcg gacggaatga      540


<210>  20
<211>  179
<212>  PRT
<213>  Aro K protein - Methylococcus capsulatus

<400>  20

Met Arg Asn Arg Arg Asn Ile Phe Leu Ile Gly Pro Met Gly Ala Gly 
1               5                   10                  15      


Lys Thr Thr Val Gly Arg Leu Leu Ala Arg Ala Leu Gly Met Glu Phe 
            20                  25                  30          


Trp Asp Ser Asp Lys Glu Ile Glu Arg Arg Thr Gly Val Thr Val Pro 
        35                  40                  45              


Met Ile Phe Glu Tyr Glu Gly Glu Ala Gly Phe Arg Arg Arg Glu Ser 
    50                  55                  60                  


Glu Val Ile Ala Asp Leu Thr Gly Lys Glu Arg Ile Val Leu Ala Thr 
65                  70                  75                  80  


Gly Gly Gly Ser Val Leu Ala Ala Glu Asn Arg Glu His Leu Ala Ala 
                85                  90                  95      


Arg Gly Leu Val Ile Tyr Leu Gln Cys Ser Val Gln Lys Gln Leu Glu 
            100                 105                 110         


Arg Thr His Lys Asp Met Asn Arg Pro Leu Leu Gln Thr Glu Asn Pro 
        115                 120                 125             


Arg Gln Arg Leu Glu Glu Leu Leu Arg Val Arg Asp Pro Ile Tyr Arg 
    130                 135                 140                 


Glu Leu Ala Asp Tyr Val Val Asp Thr Gly Gln His Ser Ser Arg Ser 
145                 150                 155                 160 


Ala Val Arg Arg Ile Ile Asn Ala Tyr Glu Lys Ser Gly Thr Arg Leu 
                165                 170                 175     


Arg Thr Glu 
            


<210>  21
<211>  1269
<212>  DNA
<213>  Aro A gene - Methylococcus capsulatus

<400>  21
atgcagggcg acatccgggt accgggcgac aagtccatct cccaccggtc ggtgatgctg       60

ggctcgctcg ccgagggcgt gactgaggtg agtggcttcc tccaggctga ggactgtttg      120

gcgaccatgg cggcgttccg ggccatgggc gtcgaaatcg aaggcccgac ggagggccgg      180

ctgcggatcc acggcgtcgg cctgcacggc ctgaagccac ctgccgcccc cctggatctc      240

ggcaattccg gcacctccat gcggctattg agcggactgt tggcgggaca ggcattcgac      300

accacgctga ccggcgatgc ctccctggtg cgccggccga tgcggcgggt gaccgaaccg      360

ctgcgcgcca tgggcgcgcg gatcgacacc accgaagccg gcaccgcgcc actgcgcatc      420

gccggcggaa gccgcctcaa agggatcgac tatgcgatgc cggtcgccag cgcccaggtg      480

aaatcctgtc tgctgctggc gggcctctac gcggaaggga agacctgtgt caccgagccg      540

gcgccgaccc gcgaccacac cgaacgcatg ctggcgggtt tcggctatcc ggtggcgcga      600

gatggcaacc gtgtatgcat ccaatccggc ggcaagcttt ccgcgacccg tatcgacgta      660

ccggcggaca tttcctcggc ggcgttcttc atgataggcg cagcgatcag ccctgggtcc      720

gacgtgttcc tccgccatgt cgggatcaat ccgacccgga ccggcgtcat cgaaatcctg      780

cgcgaaatgg gcgccgacat cgagatactc gctccgcgcg aagtcggcgg tgaaccggtg      840

gcggacctcc gcatccgtta ccgggaactg cgcggcatcc gcattcccga acataccgtg      900

ccgctggcca ttgacgaatt cccggccctg ttcatcgccg cagcctgcgc cacaggcgaa      960

acggtgctga ccggggccga ggagctgcga gtcaaggaaa gcgaccgtat ccaggccatg     1020

gccgacggcc tgaccacgct gggcatcgat gcccgcccga cccccgatgg catggtcatc     1080

cggggcggga gtttccgcgg cggcgcagtc gattcgcgcg gcgatcatcg catcgccatg     1140

tcattctcga tcgcggcatt gcgcgctccc atccccatcg agattcacga ctgcgccaac     1200

gtggcgacat cttttcccaa tttcgtcgaa ctggcgcgga ccctgggttt ggacatcgag     1260

gtcagctga                                                             1269


<210>  22
<211>  422
<212>  PRT
<213>  Aro A protein - Methylococcus capsulatus

<400>  22

Met Gln Gly Asp Ile Arg Val Pro Gly Asp Lys Ser Ile Ser His Arg 
1               5                   10                  15      


Ser Val Met Leu Gly Ser Leu Ala Glu Gly Val Thr Glu Val Ser Gly 
            20                  25                  30          


Phe Leu Gln Ala Glu Asp Cys Leu Ala Thr Met Ala Ala Phe Arg Ala 
        35                  40                  45              


Met Gly Val Glu Ile Glu Gly Pro Thr Glu Gly Arg Leu Arg Ile His 
    50                  55                  60                  


Gly Val Gly Leu His Gly Leu Lys Pro Pro Ala Ala Pro Leu Asp Leu 
65                  70                  75                  80  


Gly Asn Ser Gly Thr Ser Met Arg Leu Leu Ser Gly Leu Leu Ala Gly 
                85                  90                  95      


Gln Ala Phe Asp Thr Thr Leu Thr Gly Asp Ala Ser Leu Val Arg Arg 
            100                 105                 110         


Pro Met Arg Arg Val Thr Glu Pro Leu Arg Ala Met Gly Ala Arg Ile 
        115                 120                 125             


Asp Thr Thr Glu Ala Gly Thr Ala Pro Leu Arg Ile Ala Gly Gly Ser 
    130                 135                 140                 


Arg Leu Lys Gly Ile Asp Tyr Ala Met Pro Val Ala Ser Ala Gln Val 
145                 150                 155                 160 


Lys Ser Cys Leu Leu Leu Ala Gly Leu Tyr Ala Glu Gly Lys Thr Cys 
                165                 170                 175     


Val Thr Glu Pro Ala Pro Thr Arg Asp His Thr Glu Arg Met Leu Ala 
            180                 185                 190         


Gly Phe Gly Tyr Pro Val Ala Arg Asp Gly Asn Arg Val Cys Ile Gln 
        195                 200                 205             


Ser Gly Gly Lys Leu Ser Ala Thr Arg Ile Asp Val Pro Ala Asp Ile 
    210                 215                 220                 


Ser Ser Ala Ala Phe Phe Met Ile Gly Ala Ala Ile Ser Pro Gly Ser 
225                 230                 235                 240 


Asp Val Phe Leu Arg His Val Gly Ile Asn Pro Thr Arg Thr Gly Val 
                245                 250                 255     


Ile Glu Ile Leu Arg Glu Met Gly Ala Asp Ile Glu Ile Leu Ala Pro 
            260                 265                 270         


Arg Glu Val Gly Gly Glu Pro Val Ala Asp Leu Arg Ile Arg Tyr Arg 
        275                 280                 285             


Glu Leu Arg Gly Ile Arg Ile Pro Glu His Thr Val Pro Leu Ala Ile 
    290                 295                 300                 


Asp Glu Phe Pro Ala Leu Phe Ile Ala Ala Ala Cys Ala Thr Gly Glu 
305                 310                 315                 320 


Thr Val Leu Thr Gly Ala Glu Glu Leu Arg Val Lys Glu Ser Asp Arg 
                325                 330                 335     


Ile Gln Ala Met Ala Asp Gly Leu Thr Thr Leu Gly Ile Asp Ala Arg 
            340                 345                 350         


Pro Thr Pro Asp Gly Met Val Ile Arg Gly Gly Ser Phe Arg Gly Gly 
        355                 360                 365             


Ala Val Asp Ser Arg Gly Asp His Arg Ile Ala Met Ser Phe Ser Ile 
    370                 375                 380                 


Ala Ala Leu Arg Ala Pro Ile Pro Ile Glu Ile His Asp Cys Ala Asn 
385                 390                 395                 400 


Val Ala Thr Ser Phe Pro Asn Phe Val Glu Leu Ala Arg Thr Leu Gly 
                405                 410                 415     


Leu Asp Ile Glu Val Ser 
            420         


<210>  23
<211>  1101
<212>  DNA
<213>  Aro C gene - Methylococcus capsulatus

<400>  23
atgtccggaa acaccatcgg caaactgttt accgtcacga ccttcggcga aagccacggg       60

cctgcgctcg gctgcatcgt cgacggctgc ccgccgggac ttgcgttgtc cgaggccgat      120

ctgcagcacg atctgtatcg ccgccggccg ggccagtccc gccacaccac ccagcggcgt      180

gagtcggaca ccgtcaagat cctgtccggg gtgttcgagg gactcaccac cgggacgccg      240

atcggtctcc tgatcgagaa cgaggaccag cggtccaagg attacgccag catcgccgac      300

cgcttccgcc ccggccatgc cgactacacc taccacatga aatacggctt ccgcgactac      360

cgtggcggcg gtcgctcgtc ggcgcgtgaa accgcgatgc gggtggcggc gggaggcatc      420

gccaagaaat acctgcgtga gcggttgggt gtcgaaatcc gcggctacct ggcccagctc      480

gggccgatcc ggatcgaccc ggtggactgg aacgccatcg acgacaaccc cttcttctgt      540

cccgatcccg ccagggttcc cgagcttgaa gcttacatgg atgccctgcg caaggaaggt      600

gattcgagcg gcgcccgggt caacgtggtg gccaggggcg tgccgccggg cttgggcgag      660

ccggtcttcg accggctcga cgccgagctg gcgtatgcgc tgatgagcat caacgccgtc      720

aagggtgtgg aaatcggcgc cggtttcggc tgtgtcgaag ccaagggttc ggtgttccgc      780

gatgagatga gtccggaagg tttcctgggg aattcggcgg gcggtattct gggcgggata      840

tccaccggcc aggacatcgt tgccagcatc gcgctgaagc ctacctccag tctgcgtctc      900

ccgggccggt cggtgaacat ccgcggggaa tcggtggaag tcgtgaccac cggacgccat      960

gatccctgtg tcggcatccg ggccacgccg atcgccgagg cgatgatggc catcgtgctg     1020

atggatcatt atctgcgcca ccggggtcag aaccaggacg tcgtgcgcac gctcgatccc     1080

atcccgccca gcgcgttcta g                                               1101


<210>  24
<211>  366
<212>  PRT
<213>  Aro C protein - Methylococcus capsulatus

<400>  24

Met Ser Gly Asn Thr Ile Gly Lys Leu Phe Thr Val Thr Thr Phe Gly 
1               5                   10                  15      


Glu Ser His Gly Pro Ala Leu Gly Cys Ile Val Asp Gly Cys Pro Pro 
            20                  25                  30          


Gly Leu Ala Leu Ser Glu Ala Asp Leu Gln His Asp Leu Tyr Arg Arg 
        35                  40                  45              


Arg Pro Gly Gln Ser Arg His Thr Thr Gln Arg Arg Glu Ser Asp Thr 
    50                  55                  60                  


Val Lys Ile Leu Ser Gly Val Phe Glu Gly Leu Thr Thr Gly Thr Pro 
65                  70                  75                  80  


Ile Gly Leu Leu Ile Glu Asn Glu Asp Gln Arg Ser Lys Asp Tyr Ala 
                85                  90                  95      


Ser Ile Ala Asp Arg Phe Arg Pro Gly His Ala Asp Tyr Thr Tyr His 
            100                 105                 110         


Met Lys Tyr Gly Phe Arg Asp Tyr Arg Gly Gly Gly Arg Ser Ser Ala 
        115                 120                 125             


Arg Glu Thr Ala Met Arg Val Ala Ala Gly Gly Ile Ala Lys Lys Tyr 
    130                 135                 140                 


Leu Arg Glu Arg Leu Gly Val Glu Ile Arg Gly Tyr Leu Ala Gln Leu 
145                 150                 155                 160 


Gly Pro Ile Arg Ile Asp Pro Val Asp Trp Asn Ala Ile Asp Asp Asn 
                165                 170                 175     


Pro Phe Phe Cys Pro Asp Pro Ala Arg Val Pro Glu Leu Glu Ala Tyr 
            180                 185                 190         


Met Asp Ala Leu Arg Lys Glu Gly Asp Ser Ser Gly Ala Arg Val Asn 
        195                 200                 205             


Val Val Ala Arg Gly Val Pro Pro Gly Leu Gly Glu Pro Val Phe Asp 
    210                 215                 220                 


Arg Leu Asp Ala Glu Leu Ala Tyr Ala Leu Met Ser Ile Asn Ala Val 
225                 230                 235                 240 


Lys Gly Val Glu Ile Gly Ala Gly Phe Gly Cys Val Glu Ala Lys Gly 
                245                 250                 255     


Ser Val Phe Arg Asp Glu Met Ser Pro Glu Gly Phe Leu Gly Asn Ser 
            260                 265                 270         


Ala Gly Gly Ile Leu Gly Gly Ile Ser Thr Gly Gln Asp Ile Val Ala 
        275                 280                 285             


Ser Ile Ala Leu Lys Pro Thr Ser Ser Leu Arg Leu Pro Gly Arg Ser 
    290                 295                 300                 


Val Asn Ile Arg Gly Glu Ser Val Glu Val Val Thr Thr Gly Arg His 
305                 310                 315                 320 


Asp Pro Cys Val Gly Ile Arg Ala Thr Pro Ile Ala Glu Ala Met Met 
                325                 330                 335     


Ala Ile Val Leu Met Asp His Tyr Leu Arg His Arg Gly Gln Asn Gln 
            340                 345                 350         


Asp Val Val Arg Thr Leu Asp Pro Ile Pro Pro Ser Ala Phe 
        355                 360                 365     


<210>  25
<211>  1212
<212>  DNA
<213>  TrpB gene - Methylococcus capsulatus

<400>  25
atgcaagacg agattcagcc ctatgacctg cccgatgagc tcggccactt tggaccttac       60

ggtggcattt tcgtcgccga gaccttgatg gagccgctgg aagagctgaa agccgcctac      120

catcgctacc tgaaggaccc ggaattcctc gccgagctgg atcacgatct gaaccactac      180

gtcggccgcc cctcaccgat ctaccatgcc gaacgcctga gccgggagct cggcggcgca      240

cagatcttct tcaagcgcga agatctcaat cataccggtg cacacaaggt caacaacacc      300

gtcggccagg cactgctggc caagcgcatg ggcaagcggc gggtgatcgc cgagaccggt      360

gccggccagc acggcgtggc cacggccacc gtggcggccc ggctggggat ggagtgcgtg      420

gtctacatgg gggcggtcga cgtccagcgc caggcgctca acgtattccg catgaagctg      480

ctcggcgcca ccgtgatagc ggtcgactcg ggttcccgga cgctcaagga cgcgctgaac      540

gaagccatgc gcgactgggt gaccaacgtc gacgatacct tctacatcat cggtacggtg      600

gcgggtcccc atccctatcc cgccatggtg cgcgatttcc aggccgtgat cggccgcgag      660

gcgcgccggc agatgctgga gatgacgggg cgtctgcccg atgccctggt cgcctgcgtg      720

ggcggcggct cgaatgccat cggcctgttt catccgttcg tcgatgaccg cgaggtcgcc      780

atgtacgggg tcgaggccgc cggggatggt atcgaaaccg gtcgccactc ggctccgctg      840

agcgccggcc gccccggcgt gctgcacggc aaccgtacct acctgatgga agacgaagac      900

ggcgagatca tcgagaccca ttccatttcc gccgggctgg actatccggg cgtcgggccg      960

gaacacgcct ggctcaagga ctgcggccgg gcgagctatg tcagtgccac cgacgccgaa     1020

gcgctcgagg cgttccatat cctgacccgg tccgagggga tcatcccggc actggaatcc     1080

agccatgccg tggcctacgc cctcaagctg gcgccgactc tcagttccga caagatcgtc     1140

ctggtcaacc tgtctggccg tggcgacaag gatatccaca ccatcgccac ccgggagggc     1200

atcgttctgt ga                                                         1212


<210>  26
<211>  403
<212>  PRT
<213>  TrpB protein - Methylococcus capsulatus

<400>  26

Met Gln Asp Glu Ile Gln Pro Tyr Asp Leu Pro Asp Glu Leu Gly His 
1               5                   10                  15      


Phe Gly Pro Tyr Gly Gly Ile Phe Val Ala Glu Thr Leu Met Glu Pro 
            20                  25                  30          


Leu Glu Glu Leu Lys Ala Ala Tyr His Arg Tyr Leu Lys Asp Pro Glu 
        35                  40                  45              


Phe Leu Ala Glu Leu Asp His Asp Leu Asn His Tyr Val Gly Arg Pro 
    50                  55                  60                  


Ser Pro Ile Tyr His Ala Glu Arg Leu Ser Arg Glu Leu Gly Gly Ala 
65                  70                  75                  80  


Gln Ile Phe Phe Lys Arg Glu Asp Leu Asn His Thr Gly Ala His Lys 
                85                  90                  95      


Val Asn Asn Thr Val Gly Gln Ala Leu Leu Ala Lys Arg Met Gly Lys 
            100                 105                 110         


Arg Arg Val Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val Ala Thr 
        115                 120                 125             


Ala Thr Val Ala Ala Arg Leu Gly Met Glu Cys Val Val Tyr Met Gly 
    130                 135                 140                 


Ala Val Asp Val Gln Arg Gln Ala Leu Asn Val Phe Arg Met Lys Leu 
145                 150                 155                 160 


Leu Gly Ala Thr Val Ile Ala Val Asp Ser Gly Ser Arg Thr Leu Lys 
                165                 170                 175     


Asp Ala Leu Asn Glu Ala Met Arg Asp Trp Val Thr Asn Val Asp Asp 
            180                 185                 190         


Thr Phe Tyr Ile Ile Gly Thr Val Ala Gly Pro His Pro Tyr Pro Ala 
        195                 200                 205             


Met Val Arg Asp Phe Gln Ala Val Ile Gly Arg Glu Ala Arg Arg Gln 
    210                 215                 220                 


Met Leu Glu Met Thr Gly Arg Leu Pro Asp Ala Leu Val Ala Cys Val 
225                 230                 235                 240 


Gly Gly Gly Ser Asn Ala Ile Gly Leu Phe His Pro Phe Val Asp Asp 
                245                 250                 255     


Arg Glu Val Ala Met Tyr Gly Val Glu Ala Ala Gly Asp Gly Ile Glu 
            260                 265                 270         


Thr Gly Arg His Ser Ala Pro Leu Ser Ala Gly Arg Pro Gly Val Leu 
        275                 280                 285             


His Gly Asn Arg Thr Tyr Leu Met Glu Asp Glu Asp Gly Glu Ile Ile 
    290                 295                 300                 


Glu Thr His Ser Ile Ser Ala Gly Leu Asp Tyr Pro Gly Val Gly Pro 
305                 310                 315                 320 


Glu His Ala Trp Leu Lys Asp Cys Gly Arg Ala Ser Tyr Val Ser Ala 
                325                 330                 335     


Thr Asp Ala Glu Ala Leu Glu Ala Phe His Ile Leu Thr Arg Ser Glu 
            340                 345                 350         


Gly Ile Ile Pro Ala Leu Glu Ser Ser His Ala Val Ala Tyr Ala Leu 
        355                 360                 365             


Lys Leu Ala Pro Thr Leu Ser Ser Asp Lys Ile Val Leu Val Asn Leu 
    370                 375                 380                 


Ser Gly Arg Gly Asp Lys Asp Ile His Thr Ile Ala Thr Arg Glu Gly 
385                 390                 395                 400 


Ile Val Leu 
            


<210>  27
<211>  6109
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Vector backbone (derived from vector pMHA201)

<400>  27
taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt       60

aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct      120

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa      180

aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa      240

aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc      300

tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga      360

caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc      420

cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt      480

ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct      540

gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg      600

agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta      660

gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct      720

acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa      780

gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt      840

gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta      900

cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat      960

caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa     1020

gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct     1080

cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta     1140

cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct     1200

caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg     1260

gtcagcttgg ctgccatttt tggggtgagg ccgttcgcgg ccgaggggcg cagcccctgg     1320

ggggatggga ggcccgcgtt agcgggccgg gagggttcga gaaggggggg cacccccctt     1380

cggcgtgcgc ggtcacgcgc acagggcgca gccctggtta aaaacaaggt ttataaatat     1440

tggtttaaaa gcaggttaaa agacaggtta gcggtggccg aaaaacgggc ggaaaccctt     1500

gcaaatgctg gattttctgc ctgtggacag cccctcaaat gtcaataggt gcgcccctca     1560

tctgtcagca ctctgcccct caagtgtcaa ggatcgcgcc cctcatctgt cagtagtcgc     1620

gcccctcaag tgtcaatacc gcagggcact tatccccagg cttgtccaca tcatctgtgg     1680

gaaactcgcg taaaatcagg cgttttcgcc gatttgcgag gctggccagc tccacgtcgc     1740

cggccgaaat cgagcctgcc cctcatctgt caacgccgcg ccgggtgagt cggcccctca     1800

agtgtcaacg tccgcccctc atctgtcagt gagggccaag ttttccgcga ggtatccaca     1860

acgccggcgg ccgcggtgtc tcgcacacgg cttcgacggc gtttctggcg cgtttgcagg     1920

gccatagacg gccgccagcc cagcggcgag ggcaaccagc ccggtgagcg tcggaaaggg     1980

tcgacggatc ttttccgctg cataaccctg cttcggggtc attatagcga ttttttcggt     2040

atatccatcc tttttcgcac gatatacagg attttgccaa agggttcgtg tagactttcc     2100

ttggtgtatc caacggcgtc agccgggcag gataggtgaa gtaggcccac ccgcgagcgg     2160

gtgttccttc ttcactgtcc cttattcgca cctggcggtg ctcaacggga atcctgctct     2220

gcgaggctgg ccggctaccg ccggcgtaac agatgagggc aagcggatgg ctgatgaaac     2280

caagccaacc aggaagggca gcccacctat caaggtgtac tgccttccag acgaacgaag     2340

agcgattgag gaaaaggcgg cggcggccgg catgagcctg taggcctacc tgctggccgt     2400

cggccagggc tacaaaatca cgggcgtcgt ggactatgag cacgtccgcg agctggcccg     2460

catcaatggc gacctgggcc gcctgggcgg cctgctgaaa ctctggctca ccgacgaccc     2520

gcgcacggcg cggttcggtg atgccacgat cctcgccctg ctggcgaaga tcgaagagaa     2580

gcaggacgag cttggcaagg tcatgatggg cgtggtccgc ccgagggcag agccatgact     2640

tttttagccg ctaaaacggc cggggggtgc gcgtgattgc caagcacgtc cccatgcgct     2700

ccatcaagaa gagcgacttc gcggagctgg tattcgtgca gggcaagatt cggaatacca     2760

agtacgagaa ggacggccag acggtctacg ggaccgactt cattgccgat aaggtggatt     2820

atctggacac caaggcacca ggcgggtcaa atcaggaata agggcacatt gccccggcgt     2880

gagtcggggc aatcccgcaa ggagggtgaa tgaatcggac gtttgaccgg aaggcataca     2940

ggcaagaact gatcgacgcg gggttttccg ccgaggatgc cgaaaccatc gcaagccgca     3000

ccgtcatgcg tgcgccccgc gaaaccttcc agtccgtcgg ctcgatggtc cagcaagcta     3060

cggccaagat cgagcgcgac agcgtgcaac tggctccccc tgccctgccc gcgccatcgg     3120

ccgccgtgga gcgttcgcgt cgtctcgaac aggaggcggc aggtttggcg aagtcgatga     3180

ccatcgacac gcgaggaact atgacgacca agaagcgaaa aaccgccggc gaggacctgg     3240

caaaacaggt cagcgaggcc aagcaggccg cgttgctgaa acacacgaag cagcagatca     3300

aggaaatgca gctttccttg ttcgatattg cgccgtggcc ggacacgatg cgagcgatgc     3360

caaacgacac ggcccgctct gccctgttca ccacgcgcaa caagaaaatc ccgcgcgagg     3420

cgctgcaaaa caaggtcatt ttccacgtca acaaggacgt gaagatcacc tacaccggcg     3480

tcgagctgcg ggccgacgat gacgaactgg tgtggcagca ggtgttggag tacgcgaagc     3540

gcacccctat cggcgagccg atcaccttca cgttctacga gctttgccag gacctgggct     3600

ggtcgatcaa tggccggtat tacacgaagg ccgaggaatg cctgtcgcgc ctacaggcga     3660

cggcgatggg cttcacgtcc gaccgcgttg ggcacctgga atcggtgtcg ctgctgcacc     3720

gcttccgcgt cctggaccgt ggcaagaaaa cgtcccgttg ccaggtcctg atcgacgagg     3780

aaatcgtcgt gctgtttgct ggcgaccact acacgaaatt catatgggag aagtaccgca     3840

agctgtcgcc gacggcccga cggatgttcg actatttcag ctcgcaccgg gagccgtacc     3900

cgctcaagct ggaaaccttc cgcctcatgt gcggatcgga ttccacccgc gtgaagaagt     3960

ggcgcgagca ggtcggcgaa gcctgcgaag agttgcgagg cagcggcctg gtggaacacg     4020

cctgggtcaa tgatgacctg gtgcattgca aacgctaggg ccttgtgggg tcagttccgg     4080

ctgggggttc agcagccagc gctttactgg catttcagga acaagcgggc actgctcgac     4140

gcacttgctt cgctcagtat cgctcgggac gcacggcgcg ctctacgaac tgccgataaa     4200

cagaggatta aaattgacaa ttctagggcg cgtatagctt gccggaagtc gccttgaccc     4260

gcatggcata ggcctatcgt ttccacgatc agcgatcggc tcgttgccct gcgccgctcc     4320

aaagcccgcg acgcagcgcc ggcaggcaga gcaagtagag ggcagcgcct gcaatccatg     4380

cccacccgtt ccacgttgtt atagaagccg catagatcgc cgtgaagagg aggggtccga     4440

cgatcgaggt caggctggtg agcgccgcca gtgagccttg cagctgcccc tgacgttcct     4500

catccacctg cctggacaac attgcttgca gcgccggcat tccgatgcca cccgaagcaa     4560

gcaggaccat gatcgggaac gccatccatc cccgtgtcgg acctgcaggg ggggggggga     4620

aagccacgtt gtgtctcaaa atctctgatg ttacattgca caagataaaa atatatcatc     4680

atgaacaata aaactgtctg cttacataaa cagtaataca aggggtgtta tgagccatat     4740

tcaacgggaa acgtcttgct cgaggccgcg attaaattcc aacatggatg ctgatttata     4800

tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt gcgacaatct atcgattgta     4860

tgggaagccc gatgcgccag agttgtttct gaaacatggc aaaggtagcg ttgccaatga     4920

tgttacagat gagatggtca gactaaactg gctgacggaa tttatgcctc ttccgaccat     4980

caagcatttt atccgtactc ctgatgatgc atggttactc accactgcga tccccgggaa     5040

aacagcattc caggtattag aagaatatcc tgattcaggt gaaaatattg ttgatgcgct     5100

ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt aattgtcctt ttaacagcga     5160

tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag     5220

tgattttgat gacgagcgta atggctggcc tgttgaacaa gtctggaaag aaatgcataa     5280

gcttttgcca ttctcaccgg attcagtcgt cactcatggt gatttctcac ttgataacct     5340

tatttttgac gaggggaaat taataggttg tattgatgtt ggacgagtcg gaatcgcaga     5400

ccgataccag gatcttgcca tcctatggaa ctgcctcggt gagttttctc cttcattaca     5460

gaaacggctt tttcaaaaat atggtattga taatcctgat atgaataaat tgcagtttca     5520

tttgatgctc gatgagtttt tctaatcaga attggttaat tggttgtaac actggcagag     5580

cattacgctg acttgacggg acggcggctt tgttgaataa atcgaacttt tgctgagttg     5640

aaggatcaga tcacgcatct tcccgacaac gcagaccgtt ccgtggcaaa gcaaaagttc     5700

aaaatcacca actggtccac ctacaacaaa gctctcatca accgtggctc cctcactttc     5760

tggctggatg atggggcgat tcaggcctgg tatgagtcag caacaccttc ttcacgaggc     5820

agacctcagc gccccccccc ccctgcaggt catcggcaat ataagcgccg gctaccgccc     5880

cagtcgcccc ggtgatgccg gccacgatcc gcccgatata gagaacccaa aggaaaggcg     5940

ctgtcgccat gatggcgtag tcgacagtgg cgccggccag cgagacgagc aagattggcc     6000

gccgcccgaa acgatccgac agcgcgccca gcacaggtgc gcaggcaaat tgcaccaacg     6060

catacagcgc cagcagaatg ccatagtggg cggtgacgtc gttcgagtg                 6109


<210>  28
<211>  861
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AmpR gene (from vector pCR2.1-TOPO - Ali and Murrell 2009)

<400>  28
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct       60

gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca      120

cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc      180

gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc      240

cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg      300

gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta      360

tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc      420

ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt      480

gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg      540

cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct      600

tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc      660

tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct      720

cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac      780

acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc      840

tcactgatta agcattggta a                                                861


<210>  29
<211>  286
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AmpR protein

<400>  29

Met Ser Ile Gln His Phe Arg Val Ala Leu Ile Pro Phe Phe Ala Ala 
1               5                   10                  15      


Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 
            20                  25                  30          


Asp Ala Glu Asp Gln Leu Gly Ala Arg Val Gly Tyr Ile Glu Leu Asp 
        35                  40                  45              


Leu Asn Ser Gly Lys Ile Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
    50                  55                  60                  


Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65                  70                  75                  80  


Arg Ile Asp Ala Gly Gln Glu Gln Leu Gly Arg Arg Ile His Tyr Ser 
                85                  90                  95      


Gln Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
            100                 105                 110         


Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala Ile Thr Met Ser 
        115                 120                 125             


Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr Ile Gly Gly Pro Lys 
    130                 135                 140                 


Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145                 150                 155                 160 


Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala Ile Pro Asn Asp Glu Arg 
                165                 170                 175     


Asp Thr Thr Met Pro Val Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
            180                 185                 190         


Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gln Gln Leu Ile Asp Trp 
        195                 200                 205             


Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
    210                 215                 220                 


Ala Gly Trp Phe Ile Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225                 230                 235                 240 


Arg Gly Ile Ile Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg Ile 
                245                 250                 255     


Val Val Ile Tyr Thr Thr Gly Ser Gln Ala Thr Met Asp Glu Arg Asn 
            260                 265                 270         


Arg Gln Ile Ala Glu Ile Gly Ala Ser Leu Ile Lys His Trp 
        275                 280                 285     


<210>  30
<211>  327
<212>  DNA
<213>  TrpR gene - Escherichia coli

<400>  30
atggcccaac aatcacccta ttcagcagcg atggcagaac agcgtcacca ggagtggtta       60

cgttttgtcg acctgcttaa gaatgcctac caaaacgatc tccatttacc gttgttaaac      120

ctgatgctga cgccagatga gcgcgaagcg ttggggactc gcgtgcgtat tgtcgaagag      180

ctgttgcgcg gcgaaatgag ccagcgtgag ttaaaaaatg aactcggcgc aggcatcgcg      240

acgattacgc gtggatctaa cagcctgaaa gccgcgcccg tcgagctgcg ccagtggctg      300

gaagaggtgt tgctgaaaag cgattga                                          327


<210>  31
<211>  108
<212>  PRT
<213>  TrpR protein - Escherichia coli

<400>  31

Met Ala Gln Gln Ser Pro Tyr Ser Ala Ala Met Ala Glu Gln Arg His 
1               5                   10                  15      


Gln Glu Trp Leu Arg Phe Val Asp Leu Leu Lys Asn Ala Tyr Gln Asn 
            20                  25                  30          


Asp Leu His Leu Pro Leu Leu Asn Leu Met Leu Thr Pro Asp Glu Arg 
        35                  40                  45              


Glu Ala Leu Gly Thr Arg Val Arg Ile Val Glu Glu Leu Leu Arg Gly 
    50                  55                  60                  


Glu Met Ser Gln Arg Glu Leu Lys Asn Glu Leu Gly Ala Gly Ile Ala 
65                  70                  75                  80  


Thr Ile Thr Arg Gly Ser Asn Ser Leu Lys Ala Ala Pro Val Glu Leu 
                85                  90                  95      


Arg Gln Trp Leu Glu Glu Val Leu Leu Lys Ser Asp 
            100                 105             


<210>  32
<211>  48
<212>  DNA
<213>  T7 Terminator sequence (T7 RNA polymerase of T7 phage)

<400>  32
ctagcataac cccttggggc ctctaaacgg gtcttgaggg gttttttg                    48


<210>  33
<211>  105
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AmpR Promoter gene - from vector pCR2.1-TOPO (Ali and Murrell 
       2009)

<400>  33
cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga       60

caataaccct gataaatgct tcaataatat tgaaaaagga agagt                      105


<210>  34
<211>  1182
<212>  DNA
<213>  Aspartate transaminase - Methylococcus capsulatus

<400>  34
atgagcataa aactttccgg cagagtccaa tcggttaaac catcgccaac cctggctatt       60

accgcaagag ccgccgcaat gcgcgccgcc ggcaaggaca tcatcggcct cggcgcgggc      120

gaacccgact tcgacacgcc ggaccacatc aaagccgcag caatagaagc aatgaacaaa      180

ggctttacga aatacactcc ggtcgacggc accgcgagct taaaaaaagc gatcatcgaa      240

aaattcaaaa aagacaacgg cctcgattat caaccgaaac aaatcttggt ttcctgcggc      300

ggtaagcaaa gttcttacaa cctgacgcaa gcgttgctga acgacggaga cgaagtcatt      360

attccagccc cttattgggt ctcgtatcct gatatggtgc tgcttgccgg cggcgtgccg      420

gtcgtcatcg aaacaacaca ggcgcagcac tttaaaatat cgccggaaca actgcgcgcg      480

gcgattaccg acaagacccg attaattttc atcaacagcc cgtcgaatcc gaccggcgtc      540

gcctattcgc tcgacgaact gaaagcactc ggcgatgtgt tgaaagattt tccggacatc      600

atcatcgcga ccgacgacat gtacgaacat atcacctgga aaaaaggcgc gttcgtcaac      660

attctgaacg cgcacccgga gttctacgac cgcaccgtcg ttatgaacgg cgtgtctaaa      720

gcttattcga tgaccggctg gcgcatcggt tacgcggcag gccctatcga tttgattgaa      780

gcgatgggca cgattcaatc gcaaagcacc tcgaatccga cctcgatttc acaatatgcc      840

gccgaagccg cgctgaacgg cgatcaaggc ttcatcgaca tgatgatgac cgaattcaag      900

aagcgccatg atttcgtggt ctcggaactc aacaaaatcg acggcatcga ttgccttgaa      960

accgacggca cattctacgt attcccgaac gtggaacaag caatcgccaa aatggacaac     1020

atcaaagacg acttggattt ttcagaatac ctgatcgaaa atgccggcgt agcgctagtg     1080

ccgggctcgg ccttcggttg tccgggacac gtcagaatat cgatcgcgac cagtatgaaa     1140

aacttggaaa acgcgctgga gagaattaaa aaggcggttt ga                        1182


<210>  35
<211>  393
<212>  PRT
<213>  Aspartate transaminase protein - Methylococcus capsulatus

<400>  35

Met Ser Ile Lys Leu Ser Gly Arg Val Gln Ser Val Lys Pro Ser Pro 
1               5                   10                  15      


Thr Leu Ala Ile Thr Ala Arg Ala Ala Ala Met Arg Ala Ala Gly Lys 
            20                  25                  30          


Asp Ile Ile Gly Leu Gly Ala Gly Glu Pro Asp Phe Asp Thr Pro Asp 
        35                  40                  45              


His Ile Lys Ala Ala Ala Ile Glu Ala Met Asn Lys Gly Phe Thr Lys 
    50                  55                  60                  


Tyr Thr Pro Val Asp Gly Thr Ala Ser Leu Lys Lys Ala Ile Ile Glu 
65                  70                  75                  80  


Lys Phe Lys Lys Asp Asn Gly Leu Asp Tyr Gln Pro Lys Gln Ile Leu 
                85                  90                  95      


Val Ser Cys Gly Gly Lys Gln Ser Ser Tyr Asn Leu Thr Gln Ala Leu 
            100                 105                 110         


Leu Asn Asp Gly Asp Glu Val Ile Ile Pro Ala Pro Tyr Trp Val Ser 
        115                 120                 125             


Tyr Pro Asp Met Val Leu Leu Ala Gly Gly Val Pro Val Val Ile Glu 
    130                 135                 140                 


Thr Thr Gln Ala Gln His Phe Lys Ile Ser Pro Glu Gln Leu Arg Ala 
145                 150                 155                 160 


Ala Ile Thr Asp Lys Thr Arg Leu Ile Phe Ile Asn Ser Pro Ser Asn 
                165                 170                 175     


Pro Thr Gly Val Ala Tyr Ser Leu Asp Glu Leu Lys Ala Leu Gly Asp 
            180                 185                 190         


Val Leu Lys Asp Phe Pro Asp Ile Ile Ile Ala Thr Asp Asp Met Tyr 
        195                 200                 205             


Glu His Ile Thr Trp Lys Lys Gly Ala Phe Val Asn Ile Leu Asn Ala 
    210                 215                 220                 


His Pro Glu Phe Tyr Asp Arg Thr Val Val Met Asn Gly Val Ser Lys 
225                 230                 235                 240 


Ala Tyr Ser Met Thr Gly Trp Arg Ile Gly Tyr Ala Ala Gly Pro Ile 
                245                 250                 255     


Asp Leu Ile Glu Ala Met Gly Thr Ile Gln Ser Gln Ser Thr Ser Asn 
            260                 265                 270         


Pro Thr Ser Ile Ser Gln Tyr Ala Ala Glu Ala Ala Leu Asn Gly Asp 
        275                 280                 285             


Gln Gly Phe Ile Asp Met Met Met Thr Glu Phe Lys Lys Arg His Asp 
    290                 295                 300                 


Phe Val Val Ser Glu Leu Asn Lys Ile Asp Gly Ile Asp Cys Leu Glu 
305                 310                 315                 320 


Thr Asp Gly Thr Phe Tyr Val Phe Pro Asn Val Glu Gln Ala Ile Ala 
                325                 330                 335     


Lys Met Asp Asn Ile Lys Asp Asp Leu Asp Phe Ser Glu Tyr Leu Ile 
            340                 345                 350         


Glu Asn Ala Gly Val Ala Leu Val Pro Gly Ser Ala Phe Gly Cys Pro 
        355                 360                 365             


Gly His Val Arg Ile Ser Ile Ala Thr Ser Met Lys Asn Leu Glu Asn 
    370                 375                 380                 


Ala Leu Glu Arg Ile Lys Lys Ala Val 
385                 390             


<210>  36
<211>  1413
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FMO gene (codon-optimized) - Corynebacterium glutamicum

<400>  36
atggagatgg tcatgaagaa caagcgggtg gcgatcatcg gcgcgggccc gagcggcatc       60

gcccagctgc gcgccttcga gtcggcggaa aagcagggcc acgagatccc ggagctggtc      120

tgcttcgaga agcaggacac ctggggcggc cagtggaact acagctggcg caccggcacc      180

gactcgtacg gcgagccggt ccacagcagc atgtaccgca acctgtggag caacggcccg      240

aaggaggtcc tggagttcgc cgagtactcc ttcgacgagc acttcggcaa gccgatctcg      300

tcctacccgc cgcgcgaggt cctgtgggac tacatcgccg gccgcgccaa gaagtccaac      360

gtggagaagt acatcaagtt cgcccacgtc gtccgctggg tgagcttcga cgaggccacc      420

aagctgttca ccgtgaccgt cgagaacctg cgcaccggcg agaccagcag cgacacctac      480

gacaacgtca tcgtcggcgc gggccacttc tcgttcccga acgtcccgca cttcgacggc      540

gtcgagacct tccccggcca gatcatgcac gcccacgagt tccggggcgc ggaagccgtc      600

gccgacaagg acatcctgct gatcggcgcg tcgtactccg ccgaggacat cggcacccag      660

gcctacaaga tgggcgcgcg cagcgtcacc ttcagctacc gcagcaaccc gatgggctac      720

gagtggccgg aggaaatgac cgaactgccc ctggtcgaac ggttcgacgg cagcgaggtg      780

cacttcgtga acggcgagaa gcggaaggtc gatatcgtcg tcttctgcac cggctacctg      840

caccactacc cgttcatgcc gtccgagctg accctgagca gcccgaacaa cctgtacccc      900

gacaccctgt accgcggcgt cgtctcggaa gccaacaacc agctgttctg gctgggcgcg      960

caggaccagt ggctgacctt caatatgttt gacgcccagg cctggtacgt ccgcgacgtc     1020

atcctgggcc gcgtcgccct gccgagcaag gaagcccagc gcaaccacat ggacaagtgg     1080

ctgtcccgct tcgagggcct gaagagcgag aacgaccaga tcgacttcca gtgcgactac     1140

gtcgaggacc tgatcgacca gaccgactac ccgagcttcg acctgaagga agtcgccaac     1200

atcctgaagg gctgggtcaa gagcaaggaa gaggacatcc tgaactaccg ggactacacc     1260

tacacctccg tcatgaccgg caccaccagc gtcgagcacc acaccccgtg gatgatcgag     1320

ctggacgaca gcctggaacg ctacctgagc gaaccccagg aagacgaagc ccgccaggtc     1380

taccgcggca agaaggtccg cgacaaggcg tga                                  1413


<210>  37
<211>  470
<212>  PRT
<213>  FMO protein - Corynebacterium glutamicum

<400>  37

Met Glu Met Val Met Lys Asn Lys Arg Val Ala Ile Ile Gly Ala Gly 
1               5                   10                  15      


Pro Ser Gly Ile Ala Gln Leu Arg Ala Phe Glu Ser Ala Glu Lys Gln 
            20                  25                  30          


Gly His Glu Ile Pro Glu Leu Val Cys Phe Glu Lys Gln Asp Thr Trp 
        35                  40                  45              


Gly Gly Gln Trp Asn Tyr Ser Trp Arg Thr Gly Thr Asp Ser Tyr Gly 
    50                  55                  60                  


Glu Pro Val His Ser Ser Met Tyr Arg Asn Leu Trp Ser Asn Gly Pro 
65                  70                  75                  80  


Lys Glu Val Leu Glu Phe Ala Glu Tyr Ser Phe Asp Glu His Phe Gly 
                85                  90                  95      


Lys Pro Ile Ser Ser Tyr Pro Pro Arg Glu Val Leu Trp Asp Tyr Ile 
            100                 105                 110         


Ala Gly Arg Ala Lys Lys Ser Asn Val Glu Lys Tyr Ile Lys Phe Ala 
        115                 120                 125             


His Val Val Arg Trp Val Ser Phe Asp Glu Ala Thr Lys Leu Phe Thr 
    130                 135                 140                 


Val Thr Val Glu Asn Leu Arg Thr Gly Glu Thr Ser Ser Asp Thr Tyr 
145                 150                 155                 160 


Asp Asn Val Ile Val Gly Ala Gly His Phe Ser Phe Pro Asn Val Pro 
                165                 170                 175     


His Phe Asp Gly Val Glu Thr Phe Pro Gly Gln Ile Met His Ala His 
            180                 185                 190         


Glu Phe Arg Gly Ala Glu Ala Val Ala Asp Lys Asp Ile Leu Leu Ile 
        195                 200                 205             


Gly Ala Ser Tyr Ser Ala Glu Asp Ile Gly Thr Gln Ala Tyr Lys Met 
    210                 215                 220                 


Gly Ala Arg Ser Val Thr Phe Ser Tyr Arg Ser Asn Pro Met Gly Tyr 
225                 230                 235                 240 


Glu Trp Pro Glu Glu Met Thr Glu Leu Pro Leu Val Glu Arg Phe Asp 
                245                 250                 255     


Gly Ser Glu Val His Phe Val Asn Gly Glu Lys Arg Lys Val Asp Ile 
            260                 265                 270         


Val Val Phe Cys Thr Gly Tyr Leu His His Tyr Pro Phe Met Pro Ser 
        275                 280                 285             


Glu Leu Thr Leu Ser Ser Pro Asn Asn Leu Tyr Pro Asp Thr Leu Tyr 
    290                 295                 300                 


Arg Gly Val Val Ser Glu Ala Asn Asn Gln Leu Phe Trp Leu Gly Ala 
305                 310                 315                 320 


Gln Asp Gln Trp Leu Thr Phe Asn Met Phe Asp Ala Gln Ala Trp Tyr 
                325                 330                 335     


Val Arg Asp Val Ile Leu Gly Arg Val Ala Leu Pro Ser Lys Glu Ala 
            340                 345                 350         


Gln Arg Asn His Met Asp Lys Trp Leu Ser Arg Phe Glu Gly Leu Lys 
        355                 360                 365             


Ser Glu Asn Asp Gln Ile Asp Phe Gln Cys Asp Tyr Val Glu Asp Leu 
    370                 375                 380                 


Ile Asp Gln Thr Asp Tyr Pro Ser Phe Asp Leu Lys Glu Val Ala Asn 
385                 390                 395                 400 


Ile Leu Lys Gly Trp Val Lys Ser Lys Glu Glu Asp Ile Leu Asn Tyr 
                405                 410                 415     


Arg Asp Tyr Thr Tyr Thr Ser Val Met Thr Gly Thr Thr Ser Val Glu 
            420                 425                 430         


His His Thr Pro Trp Met Ile Glu Leu Asp Asp Ser Leu Glu Arg Tyr 
        435                 440                 445             


Leu Ser Glu Pro Gln Glu Asp Glu Ala Arg Gln Val Tyr Arg Gly Lys 
    450                 455                 460                 


Lys Val Arg Asp Lys Ala 
465                 470 


<210>  38
<211>  1071
<212>  DNA
<213>  mutant AroF (N8K mutant) - Escherichia coli

<400>  38
atgcaaaaag acgcgctgaa taaggtacat attaccgacg aacaggtttt aatgactccg       60

gaacaactga aggccgcttt tccattgagc ctgcaacaag aagcccagat tgctgactcg      120

cgtaaaagca tttcagatat tatcgccggg cgcgatcctc gtctgctggt agtatgtggt      180

ccttgttcca ttcatgatcc ggaaactgct ctggaatatg ctcgtcgatt taaagccctt      240

gccgcagagg tcagcgatag cctctatctg gtaatgcgcg tctattttga aaaaccccgt      300

accactgtcg gctggaaagg gttaattaac gatccccata tggatggctc ttttgatgta      360

gaagccgggc tgcagatcgc gcgtaaattg ctgcttgagc tggtgaatat gggactgcca      420

ctggcgacgg aagcgttaga tccgaatagc ccgcaatacc tgggcgatct gtttagctgg      480

tcagcaattg gtgctcgtac aacggaatcg caaactcacc gtgaaatggc ctccgggctt      540

tccatgccgg ttggttttaa aaacggcacc gacggcagtc tggcaacagc aattaacgct      600

atgcgcgccg ccgcccagcc gcaccgtttt gttggcatta accaggcagg gcaggttgcg      660

ttgctacaaa ctcaggggaa tccggacggc catgtgatcc tgcgcggtgg taaagcgccg      720

aactatagcc ctgcggatgt tgcgcaatgt gaaaaagaga tggaacaggc gggactgcgc      780

ccgtctctga tggtagattg cagccacggt aattccaata aagattatcg ccgtcagcct      840

gcggtggcag aatccgtggt tgctcaaatc aaagatggca atcgctcaat tattggtctg      900

atgatcgaaa gtaatatcca cgagggcaat cagtcttccg agcaaccgcg cagtgaaatg      960

aaatacggtg tatccgtaac cgatgcctgc attagctggg aaatgaccga tgccttgctg     1020

cgtgaaattc atcaggatct gaacgggcag ctgacggctc gcgtggctta a              1071


<210>  39
<211>  356
<212>  PRT
<213>  mutant AroF protein (N8K mutant) - Escherichia coli

<400>  39

Met Gln Lys Asp Ala Leu Asn Lys Val His Ile Thr Asp Glu Gln Val 
1               5                   10                  15      


Leu Met Thr Pro Glu Gln Leu Lys Ala Ala Phe Pro Leu Ser Leu Gln 
            20                  25                  30          


Gln Glu Ala Gln Ile Ala Asp Ser Arg Lys Ser Ile Ser Asp Ile Ile 
        35                  40                  45              


Ala Gly Arg Asp Pro Arg Leu Leu Val Val Cys Gly Pro Cys Ser Ile 
    50                  55                  60                  


His Asp Pro Glu Thr Ala Leu Glu Tyr Ala Arg Arg Phe Lys Ala Leu 
65                  70                  75                  80  


Ala Ala Glu Val Ser Asp Ser Leu Tyr Leu Val Met Arg Val Tyr Phe 
                85                  90                  95      


Glu Lys Pro Arg Thr Thr Val Gly Trp Lys Gly Leu Ile Asn Asp Pro 
            100                 105                 110         


His Met Asp Gly Ser Phe Asp Val Glu Ala Gly Leu Gln Ile Ala Arg 
        115                 120                 125             


Lys Leu Leu Leu Glu Leu Val Asn Met Gly Leu Pro Leu Ala Thr Glu 
    130                 135                 140                 


Ala Leu Asp Pro Asn Ser Pro Gln Tyr Leu Gly Asp Leu Phe Ser Trp 
145                 150                 155                 160 


Ser Ala Ile Gly Ala Arg Thr Thr Glu Ser Gln Thr His Arg Glu Met 
                165                 170                 175     


Ala Ser Gly Leu Ser Met Pro Val Gly Phe Lys Asn Gly Thr Asp Gly 
            180                 185                 190         


Ser Leu Ala Thr Ala Ile Asn Ala Met Arg Ala Ala Ala Gln Pro His 
        195                 200                 205             


Arg Phe Val Gly Ile Asn Gln Ala Gly Gln Val Ala Leu Leu Gln Thr 
    210                 215                 220                 


Gln Gly Asn Pro Asp Gly His Val Ile Leu Arg Gly Gly Lys Ala Pro 
225                 230                 235                 240 


Asn Tyr Ser Pro Ala Asp Val Ala Gln Cys Glu Lys Glu Met Glu Gln 
                245                 250                 255     


Ala Gly Leu Arg Pro Ser Leu Met Val Asp Cys Ser His Gly Asn Ser 
            260                 265                 270         


Asn Lys Asp Tyr Arg Arg Gln Pro Ala Val Ala Glu Ser Val Val Ala 
        275                 280                 285             


Gln Ile Lys Asp Gly Asn Arg Ser Ile Ile Gly Leu Met Ile Glu Ser 
    290                 295                 300                 


Asn Ile His Glu Gly Asn Gln Ser Ser Glu Gln Pro Arg Ser Glu Met 
305                 310                 315                 320 


Lys Tyr Gly Val Ser Val Thr Asp Ala Cys Ile Ser Trp Glu Met Thr 
                325                 330                 335     


Asp Ala Leu Leu Arg Glu Ile His Gln Asp Leu Asn Gly Gln Leu Thr 
            340                 345                 350         


Ala Arg Val Ala 
        355     


<210>  40
<211>  1071
<212>  DNA
<213>  mutant AroF gene (P148L mutant) - Escherichia coli

<400>  40
atgcaaaaag acgcgctgaa taacgtacat attaccgacg aacaggtttt aatgactccg       60

gaacaactga aggccgcttt tccattgagc ctgcaacaag aagcccagat tgctgactcg      120

cgtaaaagca tttcagatat tatcgccggg cgcgatcctc gtctgctggt agtatgtggt      180

ccttgttcca ttcatgatcc ggaaactgct ctggaatatg ctcgtcgatt taaagccctt      240

gccgcagagg tcagcgatag cctctatctg gtaatgcgcg tctattttga aaaaccccgt      300

accactgtcg gctggaaagg gttaattaac gatccccata tggatggctc ttttgatgta      360

gaagccgggc tgcagatcgc gcgtaaattg ctgcttgagc tggtgaatat gggactgcca      420

ctggcgacgg aagcgttaga tctgaatagc ccgcaatacc tgggcgatct gtttagctgg      480

tcagcaattg gtgctcgtac aacggaatcg caaactcacc gtgaaatggc ctccgggctt      540

tccatgccgg ttggttttaa aaacggcacc gacggcagtc tggcaacagc aattaacgct      600

atgcgcgccg ccgcccagcc gcaccgtttt gttggcatta accaggcagg gcaggttgcg      660

ttgctacaaa ctcaggggaa tccggacggc catgtgatcc tgcgcggtgg taaagcgccg      720

aactatagcc ctgcggatgt tgcgcaatgt gaaaaagaga tggaacaggc gggactgcgc      780

ccgtctctga tggtagattg cagccacggt aattccaata aagattatcg ccgtcagcct      840

gcggtggcag aatccgtggt tgctcaaatc aaagatggca atcgctcaat tattggtctg      900

atgatcgaaa gtaatatcca cgagggcaat cagtcttccg agcaaccgcg cagtgaaatg      960

aaatacggtg tatccgtaac cgatgcctgc attagctggg aaatgaccga tgccttgctg     1020

cgtgaaattc atcaggatct gaacgggcag ctgacggctc gcgtggctta a              1071


<210>  41
<211>  356
<212>  PRT
<213>  mutant AroF protein (P148L mutant) - Escherichia coli

<400>  41

Met Gln Lys Asp Ala Leu Asn Asn Val His Ile Thr Asp Glu Gln Val 
1               5                   10                  15      


Leu Met Thr Pro Glu Gln Leu Lys Ala Ala Phe Pro Leu Ser Leu Gln 
            20                  25                  30          


Gln Glu Ala Gln Ile Ala Asp Ser Arg Lys Ser Ile Ser Asp Ile Ile 
        35                  40                  45              


Ala Gly Arg Asp Pro Arg Leu Leu Val Val Cys Gly Pro Cys Ser Ile 
    50                  55                  60                  


His Asp Pro Glu Thr Ala Leu Glu Tyr Ala Arg Arg Phe Lys Ala Leu 
65                  70                  75                  80  


Ala Ala Glu Val Ser Asp Ser Leu Tyr Leu Val Met Arg Val Tyr Phe 
                85                  90                  95      


Glu Lys Pro Arg Thr Thr Val Gly Trp Lys Gly Leu Ile Asn Asp Pro 
            100                 105                 110         


His Met Asp Gly Ser Phe Asp Val Glu Ala Gly Leu Gln Ile Ala Arg 
        115                 120                 125             


Lys Leu Leu Leu Glu Leu Val Asn Met Gly Leu Pro Leu Ala Thr Glu 
    130                 135                 140                 


Ala Leu Asp Leu Asn Ser Pro Gln Tyr Leu Gly Asp Leu Phe Ser Trp 
145                 150                 155                 160 


Ser Ala Ile Gly Ala Arg Thr Thr Glu Ser Gln Thr His Arg Glu Met 
                165                 170                 175     


Ala Ser Gly Leu Ser Met Pro Val Gly Phe Lys Asn Gly Thr Asp Gly 
            180                 185                 190         


Ser Leu Ala Thr Ala Ile Asn Ala Met Arg Ala Ala Ala Gln Pro His 
        195                 200                 205             


Arg Phe Val Gly Ile Asn Gln Ala Gly Gln Val Ala Leu Leu Gln Thr 
    210                 215                 220                 


Gln Gly Asn Pro Asp Gly His Val Ile Leu Arg Gly Gly Lys Ala Pro 
225                 230                 235                 240 


Asn Tyr Ser Pro Ala Asp Val Ala Gln Cys Glu Lys Glu Met Glu Gln 
                245                 250                 255     


Ala Gly Leu Arg Pro Ser Leu Met Val Asp Cys Ser His Gly Asn Ser 
            260                 265                 270         


Asn Lys Asp Tyr Arg Arg Gln Pro Ala Val Ala Glu Ser Val Val Ala 
        275                 280                 285             


Gln Ile Lys Asp Gly Asn Arg Ser Ile Ile Gly Leu Met Ile Glu Ser 
    290                 295                 300                 


Asn Ile His Glu Gly Asn Gln Ser Ser Glu Gln Pro Arg Ser Glu Met 
305                 310                 315                 320 


Lys Tyr Gly Val Ser Val Thr Asp Ala Cys Ile Ser Trp Glu Met Thr 
                325                 330                 335     


Asp Ala Leu Leu Arg Glu Ile His Gln Asp Leu Asn Gly Gln Leu Thr 
            340                 345                 350         


Ala Arg Val Ala 
        355     


<210>  42
<211>  1071
<212>  DNA
<213>  mutant AroF gene (Q152I mutant) - Escherichia coli

<400>  42
atgcaaaaag acgcgctgaa taacgtacat attaccgacg aacaggtttt aatgactccg       60

gaacaactga aggccgcttt tccattgagc ctgcaacaag aagcccagat tgctgactcg      120

cgtaaaagca tttcagatat tatcgccggg cgcgatcctc gtctgctggt agtatgtggt      180

ccttgttcca ttcatgatcc ggaaactgct ctggaatatg ctcgtcgatt taaagccctt      240

gccgcagagg tcagcgatag cctctatctg gtaatgcgcg tctattttga aaaaccccgt      300

accactgtcg gctggaaagg gttaattaac gatccccata tggatggctc ttttgatgta      360

gaagccgggc tgcagatcgc gcgtaaattg ctgcttgagc tggtgaatat gggactgcca      420

ctggcgacgg aagcgttaga tccgaatagc ccgatatacc tgggcgatct gtttagctgg      480

tcagcaattg gtgctcgtac aacggaatcg caaactcacc gtgaaatggc ctccgggctt      540

tccatgccgg ttggttttaa aaacggcacc gacggcagtc tggcaacagc aattaacgct      600

atgcgcgccg ccgcccagcc gcaccgtttt gttggcatta accaggcagg gcaggttgcg      660

ttgctacaaa ctcaggggaa tccggacggc catgtgatcc tgcgcggtgg taaagcgccg      720

aactatagcc ctgcggatgt tgcgcaatgt gaaaaagaga tggaacaggc gggactgcgc      780

ccgtctctga tggtagattg cagccacggt aattccaata aagattatcg ccgtcagcct      840

gcggtggcag aatccgtggt tgctcaaatc aaagatggca atcgctcaat tattggtctg      900

atgatcgaaa gtaatatcca cgagggcaat cagtcttccg agcaaccgcg cagtgaaatg      960

aaatacggtg tatccgtaac cgatgcctgc attagctggg aaatgaccga tgccttgctg     1020

cgtgaaattc atcaggatct gaacgggcag ctgacggctc gcgtggctta a              1071


<210>  43
<211>  356
<212>  PRT
<213>  mutant AroF protein (Q152I mutant) - Escherichia coli

<400>  43

Met Gln Lys Asp Ala Leu Asn Asn Val His Ile Thr Asp Glu Gln Val 
1               5                   10                  15      


Leu Met Thr Pro Glu Gln Leu Lys Ala Ala Phe Pro Leu Ser Leu Gln 
            20                  25                  30          


Gln Glu Ala Gln Ile Ala Asp Ser Arg Lys Ser Ile Ser Asp Ile Ile 
        35                  40                  45              


Ala Gly Arg Asp Pro Arg Leu Leu Val Val Cys Gly Pro Cys Ser Ile 
    50                  55                  60                  


His Asp Pro Glu Thr Ala Leu Glu Tyr Ala Arg Arg Phe Lys Ala Leu 
65                  70                  75                  80  


Ala Ala Glu Val Ser Asp Ser Leu Tyr Leu Val Met Arg Val Tyr Phe 
                85                  90                  95      


Glu Lys Pro Arg Thr Thr Val Gly Trp Lys Gly Leu Ile Asn Asp Pro 
            100                 105                 110         


His Met Asp Gly Ser Phe Asp Val Glu Ala Gly Leu Gln Ile Ala Arg 
        115                 120                 125             


Lys Leu Leu Leu Glu Leu Val Asn Met Gly Leu Pro Leu Ala Thr Glu 
    130                 135                 140                 


Ala Leu Asp Pro Asn Ser Pro Ile Tyr Leu Gly Asp Leu Phe Ser Trp 
145                 150                 155                 160 


Ser Ala Ile Gly Ala Arg Thr Thr Glu Ser Gln Thr His Arg Glu Met 
                165                 170                 175     


Ala Ser Gly Leu Ser Met Pro Val Gly Phe Lys Asn Gly Thr Asp Gly 
            180                 185                 190         


Ser Leu Ala Thr Ala Ile Asn Ala Met Arg Ala Ala Ala Gln Pro His 
        195                 200                 205             


Arg Phe Val Gly Ile Asn Gln Ala Gly Gln Val Ala Leu Leu Gln Thr 
    210                 215                 220                 


Gln Gly Asn Pro Asp Gly His Val Ile Leu Arg Gly Gly Lys Ala Pro 
225                 230                 235                 240 


Asn Tyr Ser Pro Ala Asp Val Ala Gln Cys Glu Lys Glu Met Glu Gln 
                245                 250                 255     


Ala Gly Leu Arg Pro Ser Leu Met Val Asp Cys Ser His Gly Asn Ser 
            260                 265                 270         


Asn Lys Asp Tyr Arg Arg Gln Pro Ala Val Ala Glu Ser Val Val Ala 
        275                 280                 285             


Gln Ile Lys Asp Gly Asn Arg Ser Ile Ile Gly Leu Met Ile Glu Ser 
    290                 295                 300                 


Asn Ile His Glu Gly Asn Gln Ser Ser Glu Gln Pro Arg Ser Glu Met 
305                 310                 315                 320 


Lys Tyr Gly Val Ser Val Thr Asp Ala Cys Ile Ser Trp Glu Met Thr 
                325                 330                 335     


Asp Ala Leu Leu Arg Glu Ile His Gln Asp Leu Asn Gly Gln Leu Thr 
            340                 345                 350         


Ala Arg Val Ala 
        355     


<210>  44
<211>  1068
<212>  DNA
<213>  mutant AroF gene (del Ile11 mutant) - Escherichia coli

<400>  44
atgcaaaaag acgcgctgaa taacgtacat accgacgaac aggttttaat gactccggaa       60

caactgaagg ccgcttttcc attgagcctg caacaagaag cccagattgc tgactcgcgt      120

aaaagcattt cagatattat cgccgggcgc gatcctcgtc tgctggtagt atgtggtcct      180

tgttccattc atgatccgga aactgctctg gaatatgctc gtcgatttaa agcccttgcc      240

gcagaggtca gcgatagcct ctatctggta atgcgcgtct attttgaaaa accccgtacc      300

actgtcggct ggaaagggtt aattaacgat ccccatatgg atggctcttt tgatgtagaa      360

gccgggctgc agatcgcgcg taaattgctg cttgagctgg tgaatatggg actgccactg      420

gcgacggaag cgttagatcc gaatagcccg caatacctgg gcgatctgtt tagctggtca      480

gcaattggtg ctcgtacaac ggaatcgcaa actcaccgtg aaatggcctc cgggctttcc      540

atgccggttg gttttaaaaa cggcaccgac ggcagtctgg caacagcaat taacgctatg      600

cgcgccgccg cccagccgca ccgttttgtt ggcattaacc aggcagggca ggttgcgttg      660

ctacaaactc aggggaatcc ggacggccat gtgatcctgc gcggtggtaa agcgccgaac      720

tatagccctg cggatgttgc gcaatgtgaa aaagagatgg aacaggcggg actgcgcccg      780

tctctgatgg tagattgcag ccacggtaat tccaataaag attatcgccg tcagcctgcg      840

gtggcagaat ccgtggttgc tcaaatcaaa gatggcaatc gctcaattat tggtctgatg      900

atcgaaagta atatccacga gggcaatcag tcttccgagc aaccgcgcag tgaaatgaaa      960

tacggtgtat ccgtaaccga tgcctgcatt agctgggaaa tgaccgatgc cttgctgcgt     1020

gaaattcatc aggatctgaa cgggcagctg acggctcgcg tggcttaa                  1068


<210>  45
<211>  355
<212>  PRT
<213>  mutant AroF protein (del Ile11 mutant) - Escherichia coli

<400>  45

Met Gln Lys Asp Ala Leu Asn Asn Val His Thr Asp Glu Gln Val Leu 
1               5                   10                  15      


Met Thr Pro Glu Gln Leu Lys Ala Ala Phe Pro Leu Ser Leu Gln Gln 
            20                  25                  30          


Glu Ala Gln Ile Ala Asp Ser Arg Lys Ser Ile Ser Asp Ile Ile Ala 
        35                  40                  45              


Gly Arg Asp Pro Arg Leu Leu Val Val Cys Gly Pro Cys Ser Ile His 
    50                  55                  60                  


Asp Pro Glu Thr Ala Leu Glu Tyr Ala Arg Arg Phe Lys Ala Leu Ala 
65                  70                  75                  80  


Ala Glu Val Ser Asp Ser Leu Tyr Leu Val Met Arg Val Tyr Phe Glu 
                85                  90                  95      


Lys Pro Arg Thr Thr Val Gly Trp Lys Gly Leu Ile Asn Asp Pro His 
            100                 105                 110         


Met Asp Gly Ser Phe Asp Val Glu Ala Gly Leu Gln Ile Ala Arg Lys 
        115                 120                 125             


Leu Leu Leu Glu Leu Val Asn Met Gly Leu Pro Leu Ala Thr Glu Ala 
    130                 135                 140                 


Leu Asp Pro Asn Ser Pro Gln Tyr Leu Gly Asp Leu Phe Ser Trp Ser 
145                 150                 155                 160 


Ala Ile Gly Ala Arg Thr Thr Glu Ser Gln Thr His Arg Glu Met Ala 
                165                 170                 175     


Ser Gly Leu Ser Met Pro Val Gly Phe Lys Asn Gly Thr Asp Gly Ser 
            180                 185                 190         


Leu Ala Thr Ala Ile Asn Ala Met Arg Ala Ala Ala Gln Pro His Arg 
        195                 200                 205             


Phe Val Gly Ile Asn Gln Ala Gly Gln Val Ala Leu Leu Gln Thr Gln 
    210                 215                 220                 


Gly Asn Pro Asp Gly His Val Ile Leu Arg Gly Gly Lys Ala Pro Asn 
225                 230                 235                 240 


Tyr Ser Pro Ala Asp Val Ala Gln Cys Glu Lys Glu Met Glu Gln Ala 
                245                 250                 255     


Gly Leu Arg Pro Ser Leu Met Val Asp Cys Ser His Gly Asn Ser Asn 
            260                 265                 270         


Lys Asp Tyr Arg Arg Gln Pro Ala Val Ala Glu Ser Val Val Ala Gln 
        275                 280                 285             


Ile Lys Asp Gly Asn Arg Ser Ile Ile Gly Leu Met Ile Glu Ser Asn 
    290                 295                 300                 


Ile His Glu Gly Asn Gln Ser Ser Glu Gln Pro Arg Ser Glu Met Lys 
305                 310                 315                 320 


Tyr Gly Val Ser Val Thr Asp Ala Cys Ile Ser Trp Glu Met Thr Asp 
                325                 330                 335     


Ala Leu Leu Arg Glu Ile His Gln Asp Leu Asn Gly Gln Leu Thr Ala 
            340                 345                 350         


Arg Val Ala 
        355 


<210>  46
<211>  500
<212>  DNA
<213>  Pyrroloquinoline quinone biosynthesis protein A (ppqA) promoter gene - Methylococcus capsulatus str. Bath

<400>  46
ctcgctgctt cagggcagcg tcttcgccac ggaatacatc tatcgcgacc tgatgggcaa       60

tacgctcccg ccccagaaat gctcgtcgag aacggaagcc gaagccgccg catccgacga      120

ctacaacctc aaaaagcacg cccggatatt ctgtgaatcc cagggctatg gctggcatgt      180

cgaacagcgc aaaagtacgg gcaagctggt ctgcgaggaa tgcagcgaag gcggcgacaa      240

cggccgcttc cgctgccata tggaagacgt ggtcgtacag tgcaagcgga tcaaacccgg      300

ttctgtcggg ttgattccag gccagggctg aggggcccgc ctgacgttca ctgatgagcg      360

atacaagccg gcgtacggcc gctgtttcgg gaaaaactct gttgcgtagg agcggggcgg      420

tacgtatact ggctgccgag ctgtaacctg ctctttctct tgaagaccag acgtataaca      480

ttatctggag gacatttaag                                                  500


<210>  47
<211>  549
<212>  DNA
<213>  3-hexulose-6-phosphate synthase (hps) promoter gene - Methylococcus capsulatus str. Bath

<400>  47
gctcggatta ttacctgttc ggcggcgtgg tcgccgccct catcggcttc gtgctctgga       60

gcagccgccg tccggccacc gcccacccgg ccgcaaccgc cgcagcggcc gcccccgccg      120

ccgcaccggc gccggcagcc accggcgtcg ccaaatacct ccaggcccaa gggctcgcca      180

ccggccccga aaccggtgtc gccaaatacc tcaaggcgct gccggaaccc gtccgcaccc      240

ccgaaaccgg cgtggcccgc tacctcaaga acctgccact gcccgaagtg gctgccgccg      300

ccgaaaccgg tgtcgccagg tacatcaaga acctgcccaa gcccgccgtc gtggccacgg      360

gcgaaaccgg cgtcaccaag tacctgaaaa gcctcaacgg ctgaccctgc gaaagggggg      420

gctgcattga cagcggcccc ctcactcttt acagttggga aattgtgcgt ttagcaatac      480

cccataactc ctaaggctta agcccgactt ttttctctct accttacttt tttgatcgga      540

ggagatctc                                                              549


<210>  48
<211>  809
<212>  DNA
<213>  Sigma 70 gene promoter gene - Methylococcus capsulatus str. Bath

<400>  48
tgcttcagat agcgggcggc cgtggcaccg ccataaccgc caccgacgac cacgacccgg       60

ccgcctgagc gaagaccctt gccggaagcg cagccaccca gtccccagcc catgccgcag      120

actgccagca ggcgcaggaa ccgccgccgg cggatcatgg cctctttccc agaaagccgg      180

cgatcgcttc aatgtcctgg ttcgtgagtc cggcagcgat ccggttcatg accgtgcccg      240

atctttttcc ttcacggtac tcccgcaaca gagatgccat ctccttcgca tcgaagcggc      300

gtaacgatgc cggttcggga atctgctcct cctcgtcggc atggcagccg aggcaaccga      360

gcgcagccaa aaccatgtcc ggtttttcgg cccgggccgg gaaacagaga gcgacagcga      420

tcgtaccgat ctggacgcgt ctcacaaacg acaaaacgtc acgatgggtg ttcggtagct      480

gagtcacggg gatttgtaga agtataggac cgacggattt tatgcaagca tgtcgctttg      540

accaagccgg gattccatgg aagggatgtc atcgggagag ttatttatgt cgttgattta      600

taagaaacta cccctgcgtc aaaatgtcgc agatttttct tgacagtttg ggggagggtg      660

atagactccc tccaccgatg gaccggtacc gcctctgttg cggggtccat gaaatgcccg      720

ttagaggcag aaccgatagg gaattagaga agcgggcgtc ggcgccgaat gccggcccct      780

gtcaaccatc actttaggag gaacaaaca                                        809


<210>  49
<211>  500
<212>  DNA
<213>  Formaldehyde activating enzyme 2 (fae2) promoter gene - Methylococcus capsulatus str. Bath

<400>  49
caatacacct gcgactatac ggatgttttt gatttcatga ttccacgcta tcatagtgcg       60

ctgtcccgat gggacaatga aggctgggcg gcggtggctg cgttttccgg catgcttgct      120

cgcataccgg atagccacat cgaacgcaaa ttcgggcccc gctactcccc gtgggtatcg      180

gagaagatgg tgctcctcga gaaaacgctg tcctatgccg tgcgacctga ttcggttttg      240

gggcttctac gggacgtgga tgccgaattc aagacgcgcg gaattaaccc tggaacgact      300

gccgacctga cggtcgccgg tctgctcgcc gtgcgcctgg aggcgatttt taccgggacg      360

ggccggggtt aaaccttcgg acacgtgggg ccgatgcggg cactccgcga gaaacttcgg      420

tctcgaaaac ctttcccggg ggaacaacca tgtccttatt aaaaaagttt ttttcgtttt      480

tttcacaaga ggaaattcag                                                  500


<210>  50
<211>  816
<212>  DNA
<213>  Kanamycin Resistance (KanR) gene - from vector pCR2.1-TOPO (Ali and Murrell 2009)

<400>  50
atgagccata ttcaacggga aacgtcttgc tcgaggccgc gattaaattc caacatggat       60

gctgatttat atgggtataa atgggctcgc gataatgtcg ggcaatcagg tgcgacaatc      120

tatcgattgt atgggaagcc cgatgcgcca gagttgtttc tgaaacatgg caaaggtagc      180

gttgccaatg atgttacaga tgagatggtc agactaaact ggctgacgga atttatgcct      240

cttccgacca tcaagcattt tatccgtact cctgatgatg catggttact caccactgcg      300

atccccggga aaacagcatt ccaggtatta gaagaatatc ctgattcagg tgaaaatatt      360

gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga ttcctgtttg taattgtcct      420

tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat cacgaatgaa taacggtttg      480

gttgatgcga gtgattttga tgacgagcgt aatggctggc ctgttgaaca agtctggaaa      540

gaaatgcata agcttttgcc attctcaccg gattcagtcg tcactcatgg tgatttctca      600

cttgataacc ttatttttga cgaggggaaa ttaataggtt gtattgatgt tggacgagtc      660

ggaatcgcag accgatacca ggatcttgcc atcctatgga actgcctcgg tgagttttct      720

ccttcattac agaaacggct ttttcaaaaa tatggtattg ataatcctga tatgaataaa      780

ttgcagtttc atttgatgct cgatgagttt ttctaa                                816


<210>  51
<211>  271
<212>  PRT
<213>  Kanamycin Resistance (KanR) protein

<400>  51

Met Ser His Ile Gln Arg Glu Thr Ser Cys Ser Arg Pro Arg Leu Asn 
1               5                   10                  15      


Ser Asn Met Asp Ala Asp Leu Tyr Gly Tyr Lys Trp Ala Arg Asp Asn 
            20                  25                  30          


Val Gly Gln Ser Gly Ala Thr Ile Tyr Arg Leu Tyr Gly Lys Pro Asp 
        35                  40                  45              


Ala Pro Glu Leu Phe Leu Lys His Gly Lys Gly Ser Val Ala Asn Asp 
    50                  55                  60                  


Val Thr Asp Glu Met Val Arg Leu Asn Trp Leu Thr Glu Phe Met Pro 
65                  70                  75                  80  


Leu Pro Thr Ile Lys His Phe Ile Arg Thr Pro Asp Asp Ala Trp Leu 
                85                  90                  95      


Leu Thr Thr Ala Ile Pro Gly Lys Thr Ala Phe Gln Val Leu Glu Glu 
            100                 105                 110         


Tyr Pro Asp Ser Gly Glu Asn Ile Val Asp Ala Leu Ala Val Phe Leu 
        115                 120                 125             


Arg Arg Leu His Ser Ile Pro Val Cys Asn Cys Pro Phe Asn Ser Asp 
    130                 135                 140                 


Arg Val Phe Arg Leu Ala Gln Ala Gln Ser Arg Met Asn Asn Gly Leu 
145                 150                 155                 160 


Val Asp Ala Ser Asp Phe Asp Asp Glu Arg Asn Gly Trp Pro Val Glu 
                165                 170                 175     


Gln Val Trp Lys Glu Met His Lys Leu Leu Pro Phe Ser Pro Asp Ser 
            180                 185                 190         


Val Val Thr His Gly Asp Phe Ser Leu Asp Asn Leu Ile Phe Asp Glu 
        195                 200                 205             


Gly Lys Leu Ile Gly Cys Ile Asp Val Gly Arg Val Gly Ile Ala Asp 
    210                 215                 220                 


Arg Tyr Gln Asp Leu Ala Ile Leu Trp Asn Cys Leu Gly Glu Phe Ser 
225                 230                 235                 240 


Pro Ser Leu Gln Lys Arg Leu Phe Gln Lys Tyr Gly Ile Asp Asn Pro 
                245                 250                 255     


Asp Met Asn Lys Leu Gln Phe His Leu Met Leu Asp Glu Phe Phe 
            260                 265                 270     


<210>  52
<211>  589
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ColE1 plasmid Ori - from pUC18/19 vector (Ali and Murrell 2009)

<400>  52
tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg       60

gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg      120

ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag      180

cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc      240

caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa      300

ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg      360

taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc      420

taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac      480

cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg      540

tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaa                  589


<210>  53
<211>  110
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  OriT (origin of transfer) - from pMJ153 vector (Ali and Murrell 
       2009)

<400>  53
gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca ctgtccctta       60

ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg                 110


<210>  54
<211>  632
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  OriV (origin of replication)

<400>  54
agcgggccgg gagggttcga gaaggggggg cacccccctt cggcgtgcgc ggtcacgcgc       60

acagggcgca gccctggtta aaaacaaggt ttataaatat tggtttaaaa gcaggttaaa      120

agacaggtta gcggtggccg aaaaacgggc ggaaaccctt gcaaatgctg gattttctgc      180

ctgtggacag cccctcaaat gtcaataggt gcgcccctca tctgtcagca ctctgcccct      240

caagtgtcaa ggatcgcgcc cctcatctgt cagtagtcgc gcccctcaag tgtcaatacc      300

gcagggcact tatccccagg cttgtccaca tcatctgtgg gaaactcgcg taaaatcagg      360

cgttttcgcc gatttgcgag gctggccagc tccacgtcgc cggccgaaat cgagcctgcc      420

cctcatctgt caacgccgcg ccgggtgagt cggcccctca agtgtcaacg tccgcccctc      480

atctgtcagt gagggccaag ttttccgcga ggtatccaca acgccggcgg ccgcggtgtc      540

tcgcacacgg cttcgacggc gtttctggcg cgtttgcagg gccatagacg gccgccagcc      600

cagcggcgag ggcaaccagc ccggtgagcg tc                                    632


<210>  55
<211>  372
<212>  DNA
<213>  TraJ - Escherichia coli S17-1

<400>  55
atggctgatg aaaccaagcc aaccaggaag ggcagcccac ctatcaaggt gtactgcctt       60

ccagacgaac gaagagcgat tgaggaaaag gcggcggcgg ccggcatgag cctgtaggcc      120

tacctgctgg ccgtcggcca gggctacaaa atcacgggcg tcgtggacta tgagcacgtc      180

cgcgagctgg cccgcatcaa tggcgacctg ggccgcctgg gcggcctgct gaaactctgg      240

ctcaccgacg acccgcgcac ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg      300

aagatcgaag agaagcagga cgagcttggc aaggtcatga tgggcgtggt ccgcccgagg      360

gcagagccat ga                                                          372


<210>  56
<211>  122
<212>  PRT
<213>  TraJ protein - Escherichia coli S17-1

<400>  56

Met Ala Asp Glu Thr Lys Pro Thr Arg Lys Gly Ser Pro Pro Ile Lys 
1               5                   10                  15      


Val Tyr Cys Leu Pro Asp Glu Arg Arg Ala Ile Glu Glu Lys Ala Ala 
            20                  25                  30          


Ala Ala Gly Met Ser Leu Ala Tyr Leu Leu Ala Val Gly Gln Gly Tyr 
        35                  40                  45              


Lys Ile Thr Gly Val Val Asp Tyr Glu His Val Arg Glu Leu Ala Arg 
    50                  55                  60                  


Ile Asn Gly Asp Leu Gly Arg Leu Gly Gly Leu Leu Lys Leu Trp Leu 
65                  70                  75                  80  


Thr Asp Asp Pro Arg Thr Ala Arg Phe Gly Asp Ala Thr Ile Leu Ala 
                85                  90                  95      


Leu Leu Ala Lys Ile Glu Glu Lys Gln Asp Glu Leu Gly Lys Val Met 
            100                 105                 110         


Met Gly Val Val Arg Pro Arg Ala Glu Pro 
        115                 120         


<210>  57
<211>  1149
<212>  DNA
<213>  TrfA (Plasmid replication initiator) gene - Escherichia coli S17-1

<400>  57
atgaatcgga cgtttgaccg gaaggcatac aggcaagaac tgatcgacgc ggggttttcc       60

gccgaggatg ccgaaaccat cgcaagccgc accgtcatgc gtgcgccccg cgaaaccttc      120

cagtccgtcg gctcgatggt ccagcaagct acggccaaga tcgagcgcga cagcgtgcaa      180

ctggctcccc ctgccctgcc cgcgccatcg gccgccgtgg agcgttcgcg tcgtctcgaa      240

caggaggcgg caggtttggc gaagtcgatg accatcgaca cgcgaggaac tatgacgacc      300

aagaagcgaa aaaccgccgg cgaggacctg gcaaaacagg tcagcgaggc caagcaggcc      360

gcgttgctga aacacacgaa gcagcagatc aaggaaatgc agctttcctt gttcgatatt      420

gcgccgtggc cggacacgat gcgagcgatg ccaaacgaca cggcccgctc tgccctgttc      480

accacgcgca acaagaaaat cccgcgcgag gcgctgcaaa acaaggtcat tttccacgtc      540

aacaaggacg tgaagatcac ctacaccggc gtcgagctgc gggccgacga tgacgaactg      600

gtgtggcagc aggtgttgga gtacgcgaag cgcaccccta tcggcgagcc gatcaccttc      660

acgttctacg agctttgcca ggacctgggc tggtcgatca atggccggta ttacacgaag      720

gccgaggaat gcctgtcgcg cctacaggcg acggcgatgg gcttcacgtc cgaccgcgtt      780

gggcacctgg aatcggtgtc gctgctgcac cgcttccgcg tcctggaccg tggcaagaaa      840

acgtcccgtt gccaggtcct gatcgacgag gaaatcgtcg tgctgtttgc tggcgaccac      900

tacacgaaat tcatatggga gaagtaccgc aagctgtcgc cgacggcccg acggatgttc      960

gactatttca gctcgcaccg ggagccgtac ccgctcaagc tggaaacctt ccgcctcatg     1020

tgcggatcgg attccacccg cgtgaagaag tggcgcgagc aggtcggcga agcctgcgaa     1080

gagttgcgag gcagcggcct ggtggaacac gcctgggtca atgatgacct ggtgcattgc     1140

aaacgctag                                                             1149


<210>  58
<211>  382
<212>  PRT
<213>  TrfA (Plasmid replication initiator) protein - Escherichia coli S17-1

<400>  58

Met Asn Arg Thr Phe Asp Arg Lys Ala Tyr Arg Gln Glu Leu Ile Asp 
1               5                   10                  15      


Ala Gly Phe Ser Ala Glu Asp Ala Glu Thr Ile Ala Ser Arg Thr Val 
            20                  25                  30          


Met Arg Ala Pro Arg Glu Thr Phe Gln Ser Val Gly Ser Met Val Gln 
        35                  40                  45              


Gln Ala Thr Ala Lys Ile Glu Arg Asp Ser Val Gln Leu Ala Pro Pro 
    50                  55                  60                  


Ala Leu Pro Ala Pro Ser Ala Ala Val Glu Arg Ser Arg Arg Leu Glu 
65                  70                  75                  80  


Gln Glu Ala Ala Gly Leu Ala Lys Ser Met Thr Ile Asp Thr Arg Gly 
                85                  90                  95      


Thr Met Thr Thr Lys Lys Arg Lys Thr Ala Gly Glu Asp Leu Ala Lys 
            100                 105                 110         


Gln Val Ser Glu Ala Lys Gln Ala Ala Leu Leu Lys His Thr Lys Gln 
        115                 120                 125             


Gln Ile Lys Glu Met Gln Leu Ser Leu Phe Asp Ile Ala Pro Trp Pro 
    130                 135                 140                 


Asp Thr Met Arg Ala Met Pro Asn Asp Thr Ala Arg Ser Ala Leu Phe 
145                 150                 155                 160 


Thr Thr Arg Asn Lys Lys Ile Pro Arg Glu Ala Leu Gln Asn Lys Val 
                165                 170                 175     


Ile Phe His Val Asn Lys Asp Val Lys Ile Thr Tyr Thr Gly Val Glu 
            180                 185                 190         


Leu Arg Ala Asp Asp Asp Glu Leu Val Trp Gln Gln Val Leu Glu Tyr 
        195                 200                 205             


Ala Lys Arg Thr Pro Ile Gly Glu Pro Ile Thr Phe Thr Phe Tyr Glu 
    210                 215                 220                 


Leu Cys Gln Asp Leu Gly Trp Ser Ile Asn Gly Arg Tyr Tyr Thr Lys 
225                 230                 235                 240 


Ala Glu Glu Cys Leu Ser Arg Leu Gln Ala Thr Ala Met Gly Phe Thr 
                245                 250                 255     


Ser Asp Arg Val Gly His Leu Glu Ser Val Ser Leu Leu His Arg Phe 
            260                 265                 270         


Arg Val Leu Asp Arg Gly Lys Lys Thr Ser Arg Cys Gln Val Leu Ile 
        275                 280                 285             


Asp Glu Glu Ile Val Val Leu Phe Ala Gly Asp His Tyr Thr Lys Phe 
    290                 295                 300                 


Ile Trp Glu Lys Tyr Arg Lys Leu Ser Pro Thr Ala Arg Arg Met Phe 
305                 310                 315                 320 


Asp Tyr Phe Ser Ser His Arg Glu Pro Tyr Pro Leu Lys Leu Glu Thr 
                325                 330                 335     


Phe Arg Leu Met Cys Gly Ser Asp Ser Thr Arg Val Lys Lys Trp Arg 
            340                 345                 350         


Glu Gln Val Gly Glu Ala Cys Glu Glu Leu Arg Gly Ser Gly Leu Val 
        355                 360                 365             


Glu His Ala Trp Val Asn Asp Asp Leu Val His Cys Lys Arg 
    370                 375                 380         



<210>  59
<211>  1194
<212>  DNA
<213>  tyrB (Aromatic-amino-acid aminotransferase) gene - Escherichia coli

<400>  59
gtgtttcaaa aagttgacgc ctacgctggc gacccgattc ttacgcttat ggagcgtttt       60

aaagaagacc ctcgcagcga caaagtgaat ttaagtatcg gtctgtacta caacgaagac      120

ggaattattc cacaactgca agccgtggcg gaggcggaag cgcgcctgaa tgcgcagcct      180

catggcgctt cgctttattt accgatggaa gggcttaact gctatcgcca tgccattgcg      240

ccgctgctgt ttggtgcgga ccatccggta ctgaaacaac agcgcgtagc aaccattcaa      300

acccttggcg gctccggggc attgaaagtg ggcgcggatt tcctgaaacg ctacttcccg      360

gaatcaggcg tctgggtcag cgatcctacc tgggaaaacc acgtagcaat attcgccggg      420

gctggattcg aagtgagtac ttacccctgg tatgacgaag cgactaacgg cgtgcgcttt      480

aatgacctgt tggcgacgct gaaaacatta cctgcccgca gtattgtgtt gctgcatcca      540

tgttgccaca acccaacggg tgccgatctc actaatgatc agtgggatgc ggtgattgaa      600

attctcaaag cccgcgagct tattccattc ctcgatattg cctatcaagg atttggtgcc      660

ggtatggaag aggatgccta cgctattcgc gccattgcca gcgctggatt acccgctctg      720

gtgagcaatt cgttctcgaa aattttctcc ctttacggcg agcgcgtcgg cggactttct      780

gttatgtgtg aagatgccga agccgctggc cgcgtactgg ggcaattgaa agcaacagtt      840

cgccgcaact actccagccc gccgaatttt ggtgcgcagg tggtggctgc agtgctgaat      900

gacgaggcat tgaaagccag ctggctggcg gaagtagaag agatgcgtac tcgcattctg      960

gcaatgcgtc aggaattggt gaaggtatta agcacagaga tgccagaacg caatttcgat     1020

tatctgctta atcagcgcgg catgttcagt tataccggtt taagtgccgc tcaggttgac     1080

cgactacgtg aagaatttgg tgtctatctc atcgccagcg gtcgcatgtg tgtcgccggg     1140

ttaaatacgg caaatgtaca acgtgtggca aaggcgtttg ctgcggtgat gtaa           1194


<210>  60
<211>  397
<212>  PRT
<213>  tyrB (Aromatic-amino-acid aminotransferase) protein - Escherichia coli

<400>  60

Met Phe Gln Lys Val Asp Ala Tyr Ala Gly Asp Pro Ile Leu Thr Leu 
1               5                   10                  15      


Met Glu Arg Phe Lys Glu Asp Pro Arg Ser Asp Lys Val Asn Leu Ser 
            20                  25                  30          


Ile Gly Leu Tyr Tyr Asn Glu Asp Gly Ile Ile Pro Gln Leu Gln Ala 
        35                  40                  45              


Val Ala Glu Ala Glu Ala Arg Leu Asn Ala Gln Pro His Gly Ala Ser 
    50                  55                  60                  


Leu Tyr Leu Pro Met Glu Gly Leu Asn Cys Tyr Arg His Ala Ile Ala 
65                  70                  75                  80  


Pro Leu Leu Phe Gly Ala Asp His Pro Val Leu Lys Gln Gln Arg Val 
                85                  90                  95      


Ala Thr Ile Gln Thr Leu Gly Gly Ser Gly Ala Leu Lys Val Gly Ala 
            100                 105                 110         


Asp Phe Leu Lys Arg Tyr Phe Pro Glu Ser Gly Val Trp Val Ser Asp 
        115                 120                 125             


Pro Thr Trp Glu Asn His Val Ala Ile Phe Ala Gly Ala Gly Phe Glu 
    130                 135                 140                 


Val Ser Thr Tyr Pro Trp Tyr Asp Glu Ala Thr Asn Gly Val Arg Phe 
145                 150                 155                 160 


Asn Asp Leu Leu Ala Thr Leu Lys Thr Leu Pro Ala Arg Ser Ile Val 
                165                 170                 175     


Leu Leu His Pro Cys Cys His Asn Pro Thr Gly Ala Asp Leu Thr Asn 
            180                 185                 190         


Asp Gln Trp Asp Ala Val Ile Glu Ile Leu Lys Ala Arg Glu Leu Ile 
        195                 200                 205             


Pro Phe Leu Asp Ile Ala Tyr Gln Gly Phe Gly Ala Gly Met Glu Glu 
    210                 215                 220                 


Asp Ala Tyr Ala Ile Arg Ala Ile Ala Ser Ala Gly Leu Pro Ala Leu 
225                 230                 235                 240 


Val Ser Asn Ser Phe Ser Lys Ile Phe Ser Leu Tyr Gly Glu Arg Val 
                245                 250                 255     


Gly Gly Leu Ser Val Met Cys Glu Asp Ala Glu Ala Ala Gly Arg Val 
            260                 265                 270         


Leu Gly Gln Leu Lys Ala Thr Val Arg Arg Asn Tyr Ser Ser Pro Pro 
        275                 280                 285             


Asn Phe Gly Ala Gln Val Val Ala Ala Val Leu Asn Asp Glu Ala Leu 
    290                 295                 300                 


Lys Ala Ser Trp Leu Ala Glu Val Glu Glu Met Arg Thr Arg Ile Leu 
305                 310                 315                 320 


Ala Met Arg Gln Glu Leu Val Lys Val Leu Ser Thr Glu Met Pro Glu 
                325                 330                 335     


Arg Asn Phe Asp Tyr Leu Leu Asn Gln Arg Gly Met Phe Ser Tyr Thr 
            340                 345                 350         


Gly Leu Ser Ala Ala Gln Val Asp Arg Leu Arg Glu Glu Phe Gly Val 
        355                 360                 365             


Tyr Leu Ile Ala Ser Gly Arg Met Cys Val Ala Gly Leu Asn Thr Ala 
    370                 375                 380                 


Asn Val Gln Arg Val Ala Lys Ala Phe Ala Ala Val Met 
385                 390                 395         


