                         SEQUENCE LISTING

<110>  BASF SE
 
<120>  AMYCOLATOPSIS STRAINS FOR VANILLIN PRODUCTION WITH SUPPRESSED 
       VANILLIC ACID FORMATION

<130>  074008-2039-00-WO-000325

<150>  US 63/127,519
<151>  2020-12-18

<160>  6     

<170>  PatentIn version 3.5

<210>  1
<211>  4575
<212>  DNA
<213>  Amycolatopsis sp. ATCC 39116


<220>
<221>  misc_feature
<222>  (1)..(4575)
<223>  Nucleic Acid Sequence of gltB

<220>
<221>  misc_feature
<222>  (4573)..(4575)
<223>  stop codon

<400>  1
atgatcttct ccgccaaccc gggcaagcag ggcctgtacg accctgccat ggagcaggat       60

tcctgcggtg tggcgatggt ggccgacatt cagggccggc gcacgcacgc catcgtcacg      120

gacgggctga cggcgctgat caacctggac caccggggcg ccgcgggcgc cgaaccgact      180

tccggcgacg gcgccgggat cctcgtgcag ctgcccgacc agctgctccg cgaggaagcc      240

ggcttcgagc tgcccgaacc cgacgcgcag ggccaccacc gctacgcggc cggcatcgcg      300

ttcctgcccg ccgaggagga ggcgcgcggc aaggccgtgg cgctgatcga acgcctcgcc      360

gacgaggaga gcctcgaggt gctgggctgg cgcgaggtcc cggtcgacgc cgacggggcg      420

gacatcggcc ccaccgcgcg ttcggtgatg ccgcacttcg ccatgctgtt cgtggcgggc      480

aggccggacg ccgagggcgt gcggccctcc ggcctcgcgc tggaccggct caccttctgc      540

ctgcgcaagc gcgtcgagca cgagagcgtc gtggccgagt gcggcacgta cttcccgtcg      600

ctgtcctcgc gcaccttggt ctacaaggga atgctcacgc ccgagcagct ccccgcgttc      660

ttcggcgacc tgcgcgaccc gcggctcacc agcgcgatcg cactggtgca cagccgcttt      720

tccaccaaca cgttcccgtc gtggccgctg gcgcacccgt tccggttcgt cgcgcacaac      780

ggtgagatca acacgatccg cggcaaccgc aaccgcatgc gggcccgcga ggcgctgctc      840

gaatcgggcc tgatcccggg cgacctgacc cggctgtacc cgatctgctc gccggaggcg      900

tccgactcgg cgtccttcga cgaggtgctc gaactgctgc acctgggcgg tcgcagcctg      960

ccgcacgcgg tgctgatgat gatcccggag gcgtgggaga accactcgac catggacgcg     1020

cagcgccgcg cgttctacca gttccacgcc agcctgatgg agccgtggga cggccccgcg     1080

tgcgtcacct tcaccgacgg cacgctcgtc ggcgcggtgc tggaccgcaa cggcctgcgc     1140

ccgtgccgct ggtggcgcac cgccgacgac cgcgtcgtgc tggccagcga ggccggcgtc     1200

ctggacgtgc cgccggacca ggtggtcgcc aagggccgcc tcaagccggg ccgcatgttc     1260

ctggtggaca ccgaggcagg ccgcatcgtc gccgacgacg aggtcaagtc ggagctggcg     1320

aagcagcacc cgtacgagga gtggctgcac gccggcctgc tgcagctggc cgacctgcag     1380

gaccgcgacc acgtcacgca gagccacgac tcggtgctgc gccgccagct cgccttcggc     1440

tactccgagg aggagctgaa gatcctgctc gcgccgatgg ccgagaaggg cgccgagccg     1500

ctgggctcga tgggcaccga caccccgccc gccgtgctgt ccaagcgctc gcggcagctc     1560

tacgactact tcaaacaggc cttcgcccag gtgaccaacc cgccgctgga cgcgatccgc     1620

gaggagctgg tcacctcgat gagccggatc atgggtcccg agcgcaacct gctcgaccct     1680

ggcccggcct cgtgccggca catccagctg ccgtacccgg tcatcgacaa cgacgagctg     1740

gccaagctca tccacatcaa cgacgacggt gacctgcccg gcttcgcctg caccgtcctg     1800

tccggactgt tcgaagtgga cggcggcggc aaggcgctgg cggaggcgat cgagcgggtg     1860

cgccgcgagg cgtccgaggc gatcgcggcg ggcgcgcgca cgctcgtgct gtccgaccgg     1920

gactccgacc acaagatggc gccgatcccg tcgctgctgc tggtttccgc ggtgcaccac     1980

cacctggtgc gcaccaagga gcggctgcgc gtcgcgctcg tcgtcgagac cggtgacgcc     2040

cgcgaggtgc accacatcgc gctgctgctc ggttacggcg cggccgcggt gaacccgtac     2100

ctggccttcg agacgatcga ggacatgatc gcgcagggcg cgatcaccgg catcgagccg     2160

cgcaaggccg tgcgcaacta cgtcaacgcg ctcgtcaagg gcgtcctgaa gatcatgtcc     2220

aagatgggca tctcgaccgt cggcgcctac accgcggcgc aggtgttcga gtccttcggg     2280

ctgtcgcagg aactgctcga cgagtacttc accggcacgg tgtccaagct cggcggcgtc     2340

ggtctcgacg tgctcgccga ggaggtcgcc gtccggcacc gccgggcgta cccggacaac     2400

cccaccgacc gggtgcaccg ccgcctggac agcggcggcg agtacgccta ccgccgcgag     2460

ggcgagctgc acctgttcac cccggagacc gtgttcctgc tgcagcacgc cagcaagacg     2520

cgccgcgacg aggtgtaccg caagtacacc gaagaggtgc accgcctgtc ccgcgagggc     2580

ggtgcgctgc gcgggttgtt caagttccgc aaggaaggcc gcgccccggt gccgatcgac     2640

gaggtcgagt cggtcgagtc gatctgcaag cggttcaaca ccggcgcgat gtcctacggt     2700

tcgatctcgg ccgaggcgca ccagacgctc gcgatcgcga tgaaccgcat cggcggccgc     2760

tccaacaccg gtgagggcgg cgaggacccg gagcggctct acgaccccga gcggcgcagc     2820

gcgatcaagc aggtcgccag cggctggttc ggcgtgacga gcgagtacct ggtcaacgcc     2880

gacgacatcc agatcaagat ggcgcagggc gccaagcccg gcgagggcgg ccagctgccg     2940

ccgaacaagg tgtacccgtg gatcgcgcgc acccggcact ccacgccggg cgtcggcctc     3000

atctcgccgc cgccgcacca cgacatctac tcgatcgagg acctggcgca gctgatccac     3060

gacctgaaga acgccaacga gcaggcccgc atccacgtga agctggtcag ctcgctcggc     3120

gtcggcacgg tcgcggccgg cgtgtccaag gcgcacgcgg acgtcgtgct gatctccggc     3180

cacgacggcg gcaccggcgc ctcgccgctg aactcgctca agcacgcggg cacgccgtgg     3240

gagatcggcc ttgccgagac ccagcagacc ctgatgctga acgggctgcg cgaccgcatc     3300

accgtgcagg tcgacggcgc gatgaagacc ggccgcgacg tcgtggtcgc cgcgctgctc     3360

ggcgccgagg agtacggctt cgcgaccgcg ccgctgatcg tggccggctg catcatgatg     3420

cgcgtctgcc acctcgacac ctgccccgtc ggtgtcgcca cccagagccc ggagctgcgc     3480

aagcgctaca ccgggcaggc cgagcacgtg gtgaactact tccggttcgt cgcgcaggag     3540

gtccgggaac tgctggcgga gctgggtttc cgcaccctgg acgaggcgat cggccgggcc     3600

gacgtgctgg acaccgacga cgccgtcgac cactggaagg ccagcggcct ggacctgtcg     3660

ccgatcttcc agatgccgac cgacaccccg tacggcggcg cccggcgcaa gatccgcgag     3720

caggaccacg gcctcgagca cgctctggac cgcacgctga tccagttgtc cgaggcggcg     3780

ctggaggacg cgcacccggt ccggctggaa ctgccggtgc gcaacgtcaa tcgcaccgtc     3840

ggcacactgc tgggctcgga gatcacccgc cgctacggcg gggagggcct gcccgacggc     3900

acgatccaca tccggctcac cgggtcggcg gggcagtcgc tgggcgcgtt cctgccgcgc     3960

ggcgtcacgc tggagatggt gggcgacgcc aacgactacg tcggcaaggg cctgtccggc     4020

ggccgcatca tcgtgcggcc gcacccggac gcgacgttcg ccgctgaacg tcaggtcatc     4080

gccggcaaca cgctggccta cggcgccacc gcgggggaga tgttcctgcg cgggcatgtg     4140

ggcgagcggt tctgcgtacg caactcgggc gccaccgtcg tcgccgaggg cgtgggcgac     4200

cacgccttcg aatacatgac cggtggccgt gccgtggtgc tcggtccgac cggccgcaac     4260

ctcgccgcgg gcatgtccgg cggtatcggc tacgtcctcg acctcgacca gggcagcgtc     4320

aaccgcgaga tggtcgagct gctcccgctc gagcccgagg atctgaactg gttgaaggac     4380

atcgtgaccc gtcaccacga actcacccgc tcggcggtcg ccgcctcgct gctcggcgat     4440

tggccgcgcc ggtcggcgag cttcacgaag gtcatgccgc gcgactacaa gcgcgtgctg     4500

gaggcgacca aggccgcgaa ggccgcgggc cgcgacgtcg acgaggcgat catggaggtg     4560

gcgtctcgtg gctga                                                      4575


<210>  2
<211>  1524
<212>  PRT
<213>  Amycolatopsis sp. ATCC 39116


<220>
<221>  MISC_FEATURE
<222>  (1)..(1524)
<223>  Amino Acid Sequence of GltB

<400>  2

Met Ile Phe Ser Ala Asn Pro Gly Lys Gln Gly Leu Tyr Asp Pro Ala 
1               5                   10                  15      


Met Glu Gln Asp Ser Cys Gly Val Ala Met Val Ala Asp Ile Gln Gly 
            20                  25                  30          


Arg Arg Thr His Ala Ile Val Thr Asp Gly Leu Thr Ala Leu Ile Asn 
        35                  40                  45              


Leu Asp His Arg Gly Ala Ala Gly Ala Glu Pro Thr Ser Gly Asp Gly 
    50                  55                  60                  


Ala Gly Ile Leu Val Gln Leu Pro Asp Gln Leu Leu Arg Glu Glu Ala 
65                  70                  75                  80  


Gly Phe Glu Leu Pro Glu Pro Asp Ala Gln Gly His His Arg Tyr Ala 
                85                  90                  95      


Ala Gly Ile Ala Phe Leu Pro Ala Glu Glu Glu Ala Arg Gly Lys Ala 
            100                 105                 110         


Val Ala Leu Ile Glu Arg Leu Ala Asp Glu Glu Ser Leu Glu Val Leu 
        115                 120                 125             


Gly Trp Arg Glu Val Pro Val Asp Ala Asp Gly Ala Asp Ile Gly Pro 
    130                 135                 140                 


Thr Ala Arg Ser Val Met Pro His Phe Ala Met Leu Phe Val Ala Gly 
145                 150                 155                 160 


Arg Pro Asp Ala Glu Gly Val Arg Pro Ser Gly Leu Ala Leu Asp Arg 
                165                 170                 175     


Leu Thr Phe Cys Leu Arg Lys Arg Val Glu His Glu Ser Val Val Ala 
            180                 185                 190         


Glu Cys Gly Thr Tyr Phe Pro Ser Leu Ser Ser Arg Thr Leu Val Tyr 
        195                 200                 205             


Lys Gly Met Leu Thr Pro Glu Gln Leu Pro Ala Phe Phe Gly Asp Leu 
    210                 215                 220                 


Arg Asp Pro Arg Leu Thr Ser Ala Ile Ala Leu Val His Ser Arg Phe 
225                 230                 235                 240 


Ser Thr Asn Thr Phe Pro Ser Trp Pro Leu Ala His Pro Phe Arg Phe 
                245                 250                 255     


Val Ala His Asn Gly Glu Ile Asn Thr Ile Arg Gly Asn Arg Asn Arg 
            260                 265                 270         


Met Arg Ala Arg Glu Ala Leu Leu Glu Ser Gly Leu Ile Pro Gly Asp 
        275                 280                 285             


Leu Thr Arg Leu Tyr Pro Ile Cys Ser Pro Glu Ala Ser Asp Ser Ala 
    290                 295                 300                 


Ser Phe Asp Glu Val Leu Glu Leu Leu His Leu Gly Gly Arg Ser Leu 
305                 310                 315                 320 


Pro His Ala Val Leu Met Met Ile Pro Glu Ala Trp Glu Asn His Ser 
                325                 330                 335     


Thr Met Asp Ala Gln Arg Arg Ala Phe Tyr Gln Phe His Ala Ser Leu 
            340                 345                 350         


Met Glu Pro Trp Asp Gly Pro Ala Cys Val Thr Phe Thr Asp Gly Thr 
        355                 360                 365             


Leu Val Gly Ala Val Leu Asp Arg Asn Gly Leu Arg Pro Cys Arg Trp 
    370                 375                 380                 


Trp Arg Thr Ala Asp Asp Arg Val Val Leu Ala Ser Glu Ala Gly Val 
385                 390                 395                 400 


Leu Asp Val Pro Pro Asp Gln Val Val Ala Lys Gly Arg Leu Lys Pro 
                405                 410                 415     


Gly Arg Met Phe Leu Val Asp Thr Glu Ala Gly Arg Ile Val Ala Asp 
            420                 425                 430         


Asp Glu Val Lys Ser Glu Leu Ala Lys Gln His Pro Tyr Glu Glu Trp 
        435                 440                 445             


Leu His Ala Gly Leu Leu Gln Leu Ala Asp Leu Gln Asp Arg Asp His 
    450                 455                 460                 


Val Thr Gln Ser His Asp Ser Val Leu Arg Arg Gln Leu Ala Phe Gly 
465                 470                 475                 480 


Tyr Ser Glu Glu Glu Leu Lys Ile Leu Leu Ala Pro Met Ala Glu Lys 
                485                 490                 495     


Gly Ala Glu Pro Leu Gly Ser Met Gly Thr Asp Thr Pro Pro Ala Val 
            500                 505                 510         


Leu Ser Lys Arg Ser Arg Gln Leu Tyr Asp Tyr Phe Lys Gln Ala Phe 
        515                 520                 525             


Ala Gln Val Thr Asn Pro Pro Leu Asp Ala Ile Arg Glu Glu Leu Val 
    530                 535                 540                 


Thr Ser Met Ser Arg Ile Met Gly Pro Glu Arg Asn Leu Leu Asp Pro 
545                 550                 555                 560 


Gly Pro Ala Ser Cys Arg His Ile Gln Leu Pro Tyr Pro Val Ile Asp 
                565                 570                 575     


Asn Asp Glu Leu Ala Lys Leu Ile His Ile Asn Asp Asp Gly Asp Leu 
            580                 585                 590         


Pro Gly Phe Ala Cys Thr Val Leu Ser Gly Leu Phe Glu Val Asp Gly 
        595                 600                 605             


Gly Gly Lys Ala Leu Ala Glu Ala Ile Glu Arg Val Arg Arg Glu Ala 
    610                 615                 620                 


Ser Glu Ala Ile Ala Ala Gly Ala Arg Thr Leu Val Leu Ser Asp Arg 
625                 630                 635                 640 


Asp Ser Asp His Lys Met Ala Pro Ile Pro Ser Leu Leu Leu Val Ser 
                645                 650                 655     


Ala Val His His His Leu Val Arg Thr Lys Glu Arg Leu Arg Val Ala 
            660                 665                 670         


Leu Val Val Glu Thr Gly Asp Ala Arg Glu Val His His Ile Ala Leu 
        675                 680                 685             


Leu Leu Gly Tyr Gly Ala Ala Ala Val Asn Pro Tyr Leu Ala Phe Glu 
    690                 695                 700                 


Thr Ile Glu Asp Met Ile Ala Gln Gly Ala Ile Thr Gly Ile Glu Pro 
705                 710                 715                 720 


Arg Lys Ala Val Arg Asn Tyr Val Asn Ala Leu Val Lys Gly Val Leu 
                725                 730                 735     


Lys Ile Met Ser Lys Met Gly Ile Ser Thr Val Gly Ala Tyr Thr Ala 
            740                 745                 750         


Ala Gln Val Phe Glu Ser Phe Gly Leu Ser Gln Glu Leu Leu Asp Glu 
        755                 760                 765             


Tyr Phe Thr Gly Thr Val Ser Lys Leu Gly Gly Val Gly Leu Asp Val 
    770                 775                 780                 


Leu Ala Glu Glu Val Ala Val Arg His Arg Arg Ala Tyr Pro Asp Asn 
785                 790                 795                 800 


Pro Thr Asp Arg Val His Arg Arg Leu Asp Ser Gly Gly Glu Tyr Ala 
                805                 810                 815     


Tyr Arg Arg Glu Gly Glu Leu His Leu Phe Thr Pro Glu Thr Val Phe 
            820                 825                 830         


Leu Leu Gln His Ala Ser Lys Thr Arg Arg Asp Glu Val Tyr Arg Lys 
        835                 840                 845             


Tyr Thr Glu Glu Val His Arg Leu Ser Arg Glu Gly Gly Ala Leu Arg 
    850                 855                 860                 


Gly Leu Phe Lys Phe Arg Lys Glu Gly Arg Ala Pro Val Pro Ile Asp 
865                 870                 875                 880 


Glu Val Glu Ser Val Glu Ser Ile Cys Lys Arg Phe Asn Thr Gly Ala 
                885                 890                 895     


Met Ser Tyr Gly Ser Ile Ser Ala Glu Ala His Gln Thr Leu Ala Ile 
            900                 905                 910         


Ala Met Asn Arg Ile Gly Gly Arg Ser Asn Thr Gly Glu Gly Gly Glu 
        915                 920                 925             


Asp Pro Glu Arg Leu Tyr Asp Pro Glu Arg Arg Ser Ala Ile Lys Gln 
    930                 935                 940                 


Val Ala Ser Gly Trp Phe Gly Val Thr Ser Glu Tyr Leu Val Asn Ala 
945                 950                 955                 960 


Asp Asp Ile Gln Ile Lys Met Ala Gln Gly Ala Lys Pro Gly Glu Gly 
                965                 970                 975     


Gly Gln Leu Pro Pro Asn Lys Val Tyr Pro Trp Ile Ala Arg Thr Arg 
            980                 985                 990         


His Ser Thr Pro Gly Val Gly Leu  Ile Ser Pro Pro Pro  His His Asp 
        995                 1000                 1005             


Ile Tyr  Ser Ile Glu Asp Leu  Ala Gln Leu Ile His  Asp Leu Lys 
    1010                 1015                 1020             


Asn Ala  Asn Glu Gln Ala Arg  Ile His Val Lys Leu  Val Ser Ser 
    1025                 1030                 1035             


Leu Gly  Val Gly Thr Val Ala  Ala Gly Val Ser Lys  Ala His Ala 
    1040                 1045                 1050             


Asp Val  Val Leu Ile Ser Gly  His Asp Gly Gly Thr  Gly Ala Ser 
    1055                 1060                 1065             


Pro Leu  Asn Ser Leu Lys His  Ala Gly Thr Pro Trp  Glu Ile Gly 
    1070                 1075                 1080             


Leu Ala  Glu Thr Gln Gln Thr  Leu Met Leu Asn Gly  Leu Arg Asp 
    1085                 1090                 1095             


Arg Ile  Thr Val Gln Val Asp  Gly Ala Met Lys Thr  Gly Arg Asp 
    1100                 1105                 1110             


Val Val  Val Ala Ala Leu Leu  Gly Ala Glu Glu Tyr  Gly Phe Ala 
    1115                 1120                 1125             


Thr Ala  Pro Leu Ile Val Ala  Gly Cys Ile Met Met  Arg Val Cys 
    1130                 1135                 1140             


His Leu  Asp Thr Cys Pro Val  Gly Val Ala Thr Gln  Ser Pro Glu 
    1145                 1150                 1155             


Leu Arg  Lys Arg Tyr Thr Gly  Gln Ala Glu His Val  Val Asn Tyr 
    1160                 1165                 1170             


Phe Arg  Phe Val Ala Gln Glu  Val Arg Glu Leu Leu  Ala Glu Leu 
    1175                 1180                 1185             


Gly Phe  Arg Thr Leu Asp Glu  Ala Ile Gly Arg Ala  Asp Val Leu 
    1190                 1195                 1200             


Asp Thr  Asp Asp Ala Val Asp  His Trp Lys Ala Ser  Gly Leu Asp 
    1205                 1210                 1215             


Leu Ser  Pro Ile Phe Gln Met  Pro Thr Asp Thr Pro  Tyr Gly Gly 
    1220                 1225                 1230             


Ala Arg  Arg Lys Ile Arg Glu  Gln Asp His Gly Leu  Glu His Ala 
    1235                 1240                 1245             


Leu Asp  Arg Thr Leu Ile Gln  Leu Ser Glu Ala Ala  Leu Glu Asp 
    1250                 1255                 1260             


Ala His  Pro Val Arg Leu Glu  Leu Pro Val Arg Asn  Val Asn Arg 
    1265                 1270                 1275             


Thr Val  Gly Thr Leu Leu Gly  Ser Glu Ile Thr Arg  Arg Tyr Gly 
    1280                 1285                 1290             


Gly Glu  Gly Leu Pro Asp Gly  Thr Ile His Ile Arg  Leu Thr Gly 
    1295                 1300                 1305             


Ser Ala  Gly Gln Ser Leu Gly  Ala Phe Leu Pro Arg  Gly Val Thr 
    1310                 1315                 1320             


Leu Glu  Met Val Gly Asp Ala  Asn Asp Tyr Val Gly  Lys Gly Leu 
    1325                 1330                 1335             


Ser Gly  Gly Arg Ile Ile Val  Arg Pro His Pro Asp  Ala Thr Phe 
    1340                 1345                 1350             


Ala Ala  Glu Arg Gln Val Ile  Ala Gly Asn Thr Leu  Ala Tyr Gly 
    1355                 1360                 1365             


Ala Thr  Ala Gly Glu Met Phe  Leu Arg Gly His Val  Gly Glu Arg 
    1370                 1375                 1380             


Phe Cys  Val Arg Asn Ser Gly  Ala Thr Val Val Ala  Glu Gly Val 
    1385                 1390                 1395             


Gly Asp  His Ala Phe Glu Tyr  Met Thr Gly Gly Arg  Ala Val Val 
    1400                 1405                 1410             


Leu Gly  Pro Thr Gly Arg Asn  Leu Ala Ala Gly Met  Ser Gly Gly 
    1415                 1420                 1425             


Ile Gly  Tyr Val Leu Asp Leu  Asp Gln Gly Ser Val  Asn Arg Glu 
    1430                 1435                 1440             


Met Val  Glu Leu Leu Pro Leu  Glu Pro Glu Asp Leu  Asn Trp Leu 
    1445                 1450                 1455             


Lys Asp  Ile Val Thr Arg His  His Glu Leu Thr Arg  Ser Ala Val 
    1460                 1465                 1470             


Ala Ala  Ser Leu Leu Gly Asp  Trp Pro Arg Arg Ser  Ala Ser Phe 
    1475                 1480                 1485             


Thr Lys  Val Met Pro Arg Asp  Tyr Lys Arg Val Leu  Glu Ala Thr 
    1490                 1495                 1500             


Lys Ala  Ala Lys Ala Ala Gly  Arg Asp Val Asp Glu  Ala Ile Met 
    1505                 1510                 1515             


Glu Val  Ala Ser Arg Gly 
    1520                 


<210>  3
<211>  1509
<212>  DNA
<213>  Amycolatopsis sp. ATCC 39116


<220>
<221>  misc_feature
<222>  (1)..(1509)
<223>  Nucleic Acid Sequence of gltD

<220>
<221>  misc_feature
<222>  (1507)..(1509)
<223>  stop codon

<400>  3
gtggctgacc ccaagggctt cctgaagtac gagcgggtcg agccgcccaa gcgccccaag       60

gagcaccgcg ccgaggactg gcgcgaggtc tacgtcgacc tcgaaccggc cgagcgcgac      120

cagcaggtgc gcacccaggc cacccgctgc atggactgcg gcatcccgtt ctgccactcg      180

gccggttccg gctgcccgct cggcaacctg atcccggagt ggaacgacct ggtgcgccgc      240

ggtgactgga ccgcggccag cgaccggctg cacgccacca acaacttccc ggagttcacc      300

gggaagctgt gcccggcgcc gtgcgaggcg ggctgcacgc tgtccatctc gccgctgtcc      360

ggcggcccgg tcgcgatcaa gcgcgtcgag gcgacgatcg cggagaagtc gtgggagctg      420

ggcctggccc agccgcaggt cgccgaggtg gccagcggtc agcgcgtcgc cgtggtcggg      480

tccggcccgg ccggtctcgc cgccgcccag cagctcaccc gcgccgggca cgacgtgacc      540

gtcttcgagc gggacgaccg gctcggcggg ctgctccgat acggcatccc cgagttcaag      600

atggagaaga agcacctcga caagcgcctg gcccagctca agaaggaggg cacgcagttc      660

gtcacgggct gcgaggtggg cgtcgacatc accgtcgagg agctgcgggc ccgctacgac      720

gcggtcgtgc tcgccgtcgg cgcgctgcgc ggccgcgacg acaccaccac gcccggccgg      780

gagctcaagg gcatccacct ggcgatggag cacctggtgc cggccaacaa gcagtgcgag      840

ggcgacggcc cgtcgccggt ccacgcgcac ggcaagcacg tggtgatcat cggcggtggt      900

gacaccggcg ccgactccta cggcaccgcg atccgccagg gcgcggcctc ggtggtccag      960

ctggaccagt acccgatgcc gccgacgacc cgcgacgacg agcggtcgcc gtggccgacc     1020

tggccgtacg tgctgcgcac ctacccggcg cacgaggagg gcggcgagcg gaagttcggt     1080

gtcgccgtgc ggcggttcgt gggcgacgag aacgggcacg tccgcgcgat cgagctgcag     1140

caggtcaagg tcgtcaagga cccggagacc gggcgccgcg aggtgctgcc ggtgtcggac     1200

gagatcgagg agatcccggc cgacctggtg ctcttcgcca tcgggttcga gggcgtggag     1260

cacatgcggc tgctcgacga cctgggcatc cggctgaccc ggcgcggcac catctcgtgc     1320

ggcccggact ggcagaccga ggccccgggc gtgttcgtct gcggtgacgc ccaccgcggc     1380

gcgtcgctgg tcgtgtgggc gatcgcggag ggccgctcgg tggccaacgc cgtcgacgcc     1440

tacctgaccg gcgcgtcgga cctgccggcc ccggtgcatc cgacggctct gccgctcgct     1500

gtggtgtaa                                                             1509


<210>  4
<211>  502
<212>  PRT
<213>  Amycolatopsis sp. ATCC 39116


<220>
<221>  MISC_FEATURE
<222>  (1)..(502)
<223>  Amino Acid Sequence of GltD

<400>  4

Met Ala Asp Pro Lys Gly Phe Leu Lys Tyr Glu Arg Val Glu Pro Pro 
1               5                   10                  15      


Lys Arg Pro Lys Glu His Arg Ala Glu Asp Trp Arg Glu Val Tyr Val 
            20                  25                  30          


Asp Leu Glu Pro Ala Glu Arg Asp Gln Gln Val Arg Thr Gln Ala Thr 
        35                  40                  45              


Arg Cys Met Asp Cys Gly Ile Pro Phe Cys His Ser Ala Gly Ser Gly 
    50                  55                  60                  


Cys Pro Leu Gly Asn Leu Ile Pro Glu Trp Asn Asp Leu Val Arg Arg 
65                  70                  75                  80  


Gly Asp Trp Thr Ala Ala Ser Asp Arg Leu His Ala Thr Asn Asn Phe 
                85                  90                  95      


Pro Glu Phe Thr Gly Lys Leu Cys Pro Ala Pro Cys Glu Ala Gly Cys 
            100                 105                 110         


Thr Leu Ser Ile Ser Pro Leu Ser Gly Gly Pro Val Ala Ile Lys Arg 
        115                 120                 125             


Val Glu Ala Thr Ile Ala Glu Lys Ser Trp Glu Leu Gly Leu Ala Gln 
    130                 135                 140                 


Pro Gln Val Ala Glu Val Ala Ser Gly Gln Arg Val Ala Val Val Gly 
145                 150                 155                 160 


Ser Gly Pro Ala Gly Leu Ala Ala Ala Gln Gln Leu Thr Arg Ala Gly 
                165                 170                 175     


His Asp Val Thr Val Phe Glu Arg Asp Asp Arg Leu Gly Gly Leu Leu 
            180                 185                 190         


Arg Tyr Gly Ile Pro Glu Phe Lys Met Glu Lys Lys His Leu Asp Lys 
        195                 200                 205             


Arg Leu Ala Gln Leu Lys Lys Glu Gly Thr Gln Phe Val Thr Gly Cys 
    210                 215                 220                 


Glu Val Gly Val Asp Ile Thr Val Glu Glu Leu Arg Ala Arg Tyr Asp 
225                 230                 235                 240 


Ala Val Val Leu Ala Val Gly Ala Leu Arg Gly Arg Asp Asp Thr Thr 
                245                 250                 255     


Thr Pro Gly Arg Glu Leu Lys Gly Ile His Leu Ala Met Glu His Leu 
            260                 265                 270         


Val Pro Ala Asn Lys Gln Cys Glu Gly Asp Gly Pro Ser Pro Val His 
        275                 280                 285             


Ala His Gly Lys His Val Val Ile Ile Gly Gly Gly Asp Thr Gly Ala 
    290                 295                 300                 


Asp Ser Tyr Gly Thr Ala Ile Arg Gln Gly Ala Ala Ser Val Val Gln 
305                 310                 315                 320 


Leu Asp Gln Tyr Pro Met Pro Pro Thr Thr Arg Asp Asp Glu Arg Ser 
                325                 330                 335     


Pro Trp Pro Thr Trp Pro Tyr Val Leu Arg Thr Tyr Pro Ala His Glu 
            340                 345                 350         


Glu Gly Gly Glu Arg Lys Phe Gly Val Ala Val Arg Arg Phe Val Gly 
        355                 360                 365             


Asp Glu Asn Gly His Val Arg Ala Ile Glu Leu Gln Gln Val Lys Val 
    370                 375                 380                 


Val Lys Asp Pro Glu Thr Gly Arg Arg Glu Val Leu Pro Val Ser Asp 
385                 390                 395                 400 


Glu Ile Glu Glu Ile Pro Ala Asp Leu Val Leu Phe Ala Ile Gly Phe 
                405                 410                 415     


Glu Gly Val Glu His Met Arg Leu Leu Asp Asp Leu Gly Ile Arg Leu 
            420                 425                 430         


Thr Arg Arg Gly Thr Ile Ser Cys Gly Pro Asp Trp Gln Thr Glu Ala 
        435                 440                 445             


Pro Gly Val Phe Val Cys Gly Asp Ala His Arg Gly Ala Ser Leu Val 
    450                 455                 460                 


Val Trp Ala Ile Ala Glu Gly Arg Ser Val Ala Asn Ala Val Asp Ala 
465                 470                 475                 480 


Tyr Leu Thr Gly Ala Ser Asp Leu Pro Ala Pro Val His Pro Thr Ala 
                485                 490                 495     


Leu Pro Leu Ala Val Val 
            500         


<210>  5
<211>  483
<212>  DNA
<213>  Amycolatopsis sp. ATCC 39116


<220>
<221>  misc_feature
<222>  (1)..(483)
<223>  Nucleic acid sequence of echR (repressor of ech-fcs operon)

<220>
<221>  misc_feature
<222>  (481)..(483)
<223>  stop codon

<400>  5
gtggtgaccg aatcccgcgc cgaggacgcc ccgctgaccc tctacctggt caagcggctg       60

gagctggtga tccgctcgct gatggacgac gcgctgcgcc cgttcgggct gaccaccctg      120

cagtacaccg cgctgaccgc gctgcggcac cgcaacgggc tgtcgtccgc gcagctcgcg      180

cgccgctcgt tcgtccggcc ccagaccatg cacaccatgg tgctcacgct ggagaagtac      240

gggctcatcg agcgcgcgga ggacccggcc aaccgccggg tcctgctcgc caccctcacc      300

gagcgcggca agcaggtcct cgacgagtgc acgccgctgg tccgggagct cgaagaccgg      360

atgctctccg gcatggacga cgaccgccgc gccgggttcc gccgggacct ggaggacggc      420

tacggcatgc tcgcctcgca cgccaacgct cagcgcgcgt tgacgaacgg cggcggcgag      480

taa                                                                    483


<210>  6
<211>  160
<212>  PRT
<213>  Amycolatopsis sp. ATCC 39116


<220>
<221>  MISC_FEATURE
<222>  (1)..(160)
<223>  Amino Acid Sequence of EchR (repressor of ech-fcs operon)

<400>  6

Met Val Thr Glu Ser Arg Ala Glu Asp Ala Pro Leu Thr Leu Tyr Leu 
1               5                   10                  15      


Val Lys Arg Leu Glu Leu Val Ile Arg Ser Leu Met Asp Asp Ala Leu 
            20                  25                  30          


Arg Pro Phe Gly Leu Thr Thr Leu Gln Tyr Thr Ala Leu Thr Ala Leu 
        35                  40                  45              


Arg His Arg Asn Gly Leu Ser Ser Ala Gln Leu Ala Arg Arg Ser Phe 
    50                  55                  60                  


Val Arg Pro Gln Thr Met His Thr Met Val Leu Thr Leu Glu Lys Tyr 
65                  70                  75                  80  


Gly Leu Ile Glu Arg Ala Glu Asp Pro Ala Asn Arg Arg Val Leu Leu 
                85                  90                  95      


Ala Thr Leu Thr Glu Arg Gly Lys Gln Val Leu Asp Glu Cys Thr Pro 
            100                 105                 110         


Leu Val Arg Glu Leu Glu Asp Arg Met Leu Ser Gly Met Asp Asp Asp 
        115                 120                 125             


Arg Arg Ala Gly Phe Arg Arg Asp Leu Glu Asp Gly Tyr Gly Met Leu 
    130                 135                 140                 


Ala Ser His Ala Asn Ala Gln Arg Ala Leu Thr Asn Gly Gly Gly Glu 
145                 150                 155                 160 


