                         SEQUENCE LISTING

<110>  INVISTA North America S.A.R.L.
       Combe, Jonathan
       Foster, Alexander
       Barman, Arghya
 
<120>  Methods and Materials for the Biosynthesis of Hydroxy Fatty Acid 
       Anions and/or Derivatives Thereof and/or Compounds Related 
       Thereto

<130>  INV0153WO

<150>  US 62/624,910
<151>  2018-02-01

<160>  26    

<170>  PatentIn version 3.5

<210>  1
<211>  327
<212>  PRT
<213>  A. tertiaricarbonis

<400>  1

Met Thr Tyr Val Pro Ser Ser Ala Leu Leu Glu Gln Leu Arg Ala Gly 
1               5                   10                  15      


Asn Thr Trp Ala Leu Gly Arg Leu Ile Ser Arg Ala Glu Ala Gly Val 
            20                  25                  30          


Ala Glu Ala Arg Pro Ala Leu Ala Glu Val Tyr Arg His Ala Gly Ser 
        35                  40                  45              


Ala His Val Ile Gly Leu Thr Gly Val Pro Gly Ser Gly Lys Ser Thr 
    50                  55                  60                  


Leu Val Ala Lys Leu Thr Ala Ala Leu Arg Lys Arg Gly Glu Lys Val 
65                  70                  75                  80  


Gly Ile Val Ala Ile Asp Pro Ser Ser Pro Tyr Ser Gly Gly Ala Ile 
                85                  90                  95      


Leu Gly Asp Arg Ile Arg Met Thr Glu Leu Ala Asn Asp Ser Gly Val 
            100                 105                 110         


Phe Ile Arg Ser Met Ala Thr Arg Gly Ala Thr Gly Gly Met Ala Arg 
        115                 120                 125             


Ala Ala Leu Asp Ala Val Asp Leu Leu Asp Val Ala Gly Tyr His Thr 
    130                 135                 140                 


Ile Ile Leu Glu Thr Val Gly Val Gly Gln Asp Glu Val Glu Val Ala 
145                 150                 155                 160 


His Ala Ser Asp Thr Thr Val Val Val Ser Ala Pro Gly Leu Gly Asp 
                165                 170                 175     


Glu Ile Gln Ala Ile Lys Ala Gly Val Leu Glu Ile Ala Asp Ile His 
            180                 185                 190         


Val Val Ser Lys Cys Asp Arg Asp Asp Ala Asn Arg Thr Leu Thr Asp 
        195                 200                 205             


Leu Lys Gln Met Leu Thr Leu Gly Thr Met Val Gly Pro Lys Arg Ala 
    210                 215                 220                 


Trp Ala Ile Pro Val Val Gly Val Ser Ser Tyr Thr Gly Glu Gly Val 
225                 230                 235                 240 


Asp Asp Leu Leu Gly Arg Ile Ala Ala His Arg Gln Ala Thr Ala Asp 
                245                 250                 255     


Thr Glu Leu Gly Arg Glu Arg Arg Arg Arg Val Ala Glu Phe Arg Leu 
            260                 265                 270         


Gln Lys Thr Ala Glu Thr Leu Leu Leu Glu Arg Phe Thr Thr Gly Ala 
        275                 280                 285             


Gln Pro Phe Ser Pro Ala Leu Ala Asp Ser Leu Ser Asn Arg Ala Ser 
    290                 295                 300                 


Asp Pro Tyr Ala Ala Ala Arg Glu Leu Ile Ala Arg Thr Ile Arg Lys 
305                 310                 315                 320 


Glu Tyr Ser Asn Asp Leu Ala 
                325         


<210>  2
<211>  984
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  2
atgacctatg tgccctcctc ggcgctgctc gaacagctcc gcgccggtaa cacctgggcc       60

ctgggccgcc tgatctcccg cgcggaagcg ggcgtggcgg aagcccgccc cgccctggcc      120

gaggtgtacc ggcatgccgg cagcgcccac gtgatcggtc tgaccggcgt cccgggctcg      180

ggcaagtcca ccctggtcgc caagctcacc gccgccctgc gcaagcgcgg cgaaaaggtc      240

ggcatcgtcg cgatcgatcc gtcgtcgccg tacagcggcg gcgcgatcct gggcgaccgc      300

attcgcatga cggaactggc caacgatagc ggcgtgttca tccgcagcat ggccacccgc      360

ggcgccacgg gcggcatggc ccgggccgcc ctggacgcgg tggacctcct ggacgtggcg      420

ggctaccaca ccatcatcct ggaaaccgtg ggcgtcggcc aagacgaagt cgaagtggcc      480

catgccagcg acacgaccgt ggtcgtgtcg gcccccggcc tgggcgacga gatccaggcg      540

atcaaggccg gcgtgctgga aatcgcggat atccatgtgg tgtcgaagtg cgatcgtgac      600

gacgcgaatc gcaccctcac ggacctgaag caaatgctca ccctgggtac gatggtgggc      660

ccgaagcgcg cgtgggccat cccggtcgtg ggcgtcagct cctacaccgg cgagggcgtg      720

gacgacctcc tgggccgcat tgcggcccac cggcaggcca ccgcggacac ggaactgggc      780

cgcgagcgcc gtcgccgggt cgcggagttc cgcctgcaaa agacggccga aaccctgctg      840

ctggagcgtt tcaccaccgg cgcccagccg ttttcccccg cgctggccga tagcctgagc      900

aaccgcgcgt cggaccccta cgccgccgcc cgcgagctga tcgcgcgcac catccgcaag      960

gaatatagca acgacctggc ctga                                             984


<210>  3
<211>  562
<212>  PRT
<213>  A. tertiaricarbonis

<400>  3

Met Thr Trp Leu Glu Pro Gln Ile Lys Ser Gln Leu Gln Ser Glu Arg 
1               5                   10                  15      


Lys Asp Trp Glu Ala Asn Glu Val Gly Ala Phe Leu Lys Lys Ala Pro 
            20                  25                  30          


Glu Arg Lys Glu Gln Phe His Thr Ile Gly Asp Phe Pro Val Gln Arg 
        35                  40                  45              


Thr Tyr Thr Ala Ala Asp Ile Ala Asp Thr Pro Leu Glu Asp Ile Gly 
    50                  55                  60                  


Leu Pro Gly Arg Tyr Pro Phe Thr Arg Gly Pro Tyr Pro Thr Met Tyr 
65                  70                  75                  80  


Arg Ser Arg Thr Trp Thr Met Arg Gln Ile Ala Gly Phe Gly Thr Gly 
                85                  90                  95      


Glu Asp Thr Asn Lys Arg Phe Lys Tyr Leu Ile Ala Gln Gly Gln Thr 
            100                 105                 110         


Gly Ile Ser Thr Asp Phe Asp Met Pro Thr Leu Met Gly Tyr Asp Ser 
        115                 120                 125             


Asp His Pro Met Ser Asp Gly Glu Val Gly Arg Glu Gly Val Ala Ile 
    130                 135                 140                 


Asp Thr Leu Ala Asp Met Glu Ala Leu Leu Ala Asp Ile Asp Leu Glu 
145                 150                 155                 160 


Lys Ile Ser Val Ser Phe Thr Ile Asn Pro Ser Ala Trp Ile Leu Leu 
                165                 170                 175     


Ala Met Tyr Val Ala Leu Gly Glu Lys Arg Gly Tyr Asp Leu Asn Lys 
            180                 185                 190         


Leu Ser Gly Thr Val Gln Ala Asp Ile Leu Lys Glu Tyr Met Ala Gln 
        195                 200                 205             


Lys Glu Tyr Ile Tyr Pro Ile Ala Pro Ser Val Arg Ile Val Arg Asp 
    210                 215                 220                 


Ile Ile Thr Tyr Ser Ala Lys Asn Leu Lys Arg Tyr Asn Pro Ile Asn 
225                 230                 235                 240 


Ile Ser Gly Tyr His Ile Ser Glu Ala Gly Ser Ser Pro Leu Gln Glu 
                245                 250                 255     


Ala Ala Phe Thr Leu Ala Asn Leu Ile Thr Tyr Val Asn Glu Val Thr 
            260                 265                 270         


Lys Thr Gly Met His Val Asp Glu Phe Ala Pro Arg Leu Ala Phe Phe 
        275                 280                 285             


Phe Val Ser Gln Gly Asp Phe Phe Glu Glu Val Ala Lys Phe Arg Ala 
    290                 295                 300                 


Leu Arg Arg Cys Tyr Ala Lys Ile Met Lys Glu Arg Phe Gly Ala Arg 
305                 310                 315                 320 


Asn Pro Glu Ser Met Arg Leu Arg Phe His Cys Gln Thr Ala Ala Ala 
                325                 330                 335     


Thr Leu Thr Lys Pro Gln Tyr Met Val Asn Val Val Arg Thr Ser Leu 
            340                 345                 350         


Gln Ala Leu Ser Ala Val Leu Gly Gly Ala Gln Ser Leu His Thr Asn 
        355                 360                 365             


Gly Tyr Asp Glu Ala Phe Ala Ile Pro Thr Glu Asp Ala Met Lys Met 
    370                 375                 380                 


Ala Leu Arg Thr Gln Gln Ile Ile Ala Glu Glu Ser Gly Val Ala Asp 
385                 390                 395                 400 


Val Ile Asp Pro Leu Gly Gly Ser Tyr Tyr Val Glu Ala Leu Thr Thr 
                405                 410                 415     


Glu Tyr Glu Lys Lys Ile Phe Glu Ile Leu Glu Glu Val Glu Lys Arg 
            420                 425                 430         


Gly Gly Thr Ile Lys Leu Ile Glu Gln Gly Trp Phe Gln Lys Gln Ile 
        435                 440                 445             


Ala Asp Phe Ala Tyr Glu Thr Ala Leu Arg Lys Gln Ser Gly Gln Lys 
    450                 455                 460                 


Pro Val Ile Gly Val Asn Arg Phe Val Glu Asn Glu Glu Asp Val Lys 
465                 470                 475                 480 


Ile Glu Ile His Pro Tyr Asp Asn Thr Thr Ala Glu Arg Gln Ile Ser 
                485                 490                 495     


Arg Thr Arg Arg Val Arg Ala Glu Arg Asp Glu Ala Lys Val Gln Ala 
            500                 505                 510         


Met Leu Asp Gln Leu Val Ala Val Ala Lys Asp Glu Ser Gln Asn Leu 
        515                 520                 525             


Met Pro Leu Thr Ile Glu Leu Val Lys Ala Gly Ala Thr Met Gly Asp 
    530                 535                 540                 


Ile Val Glu Lys Leu Lys Gly Ile Trp Gly Thr Tyr Arg Glu Thr Pro 
545                 550                 555                 560 


Val Phe 
        


<210>  4
<211>  1689
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  4
atgacctggc tggagccgca aatcaagagc cagctgcaat cggagcggaa ggattgggaa       60

gcgaacgaag tgggtgcctt cctcaagaag gcccccgaac gcaaggaaca gtttcacacg      120

atcggcgact tccccgtcca gcgcacctac acggccgccg atatcgccga caccccgctg      180

gaggacatcg gcctgccggg ccgctacccg ttcacgcgcg gtccgtaccc gacgatgtat      240

cgctcccgca cctggaccat gcggcagatc gccggcttcg gcacgggcga ggacaccaac      300

aagcgtttta agtatctgat cgcccagggt caaaccggca tctcgaccga cttcgacatg      360

cccaccctga tgggctatga ttcggaccac ccgatgtccg acggcgaagt gggccgcgaa      420

ggcgtcgcga tcgacacgct cgccgacatg gaagcgctgc tggccgatat cgacctggaa      480

aagatcagcg tgagcttcac catcaacccc tcagcctgga ttctgctggc gatgtacgtc      540

gccctcggcg aaaagcgcgg ctacgacctg aacaagctga gcggcaccgt gcaagccgac      600

atcctgaagg agtacatggc gcaaaaggaa tatatctacc ccatcgcccc gtcggtccgc      660

atcgtgcgtg acatcattac gtattcggcg aagaacctca agcgctataa ccccatcaac      720

attagcggct accatatctc ggaggcgggc agcagccccc tccaggaggc cgcgttcacg      780

ctcgcgaacc tgattacgta cgtgaacgag gtcaccaaga ccggcatgca tgtcgatgag      840

tttgcgccgc ggctggcgtt cttcttcgtg tcgcagggcg acttctttga agaagtcgcg      900

aagtttcgcg cgctccgccg ctgctacgcc aagatcatga aggaacgctt cggcgcccgc      960

aacccggaga gcatgcgcct ccggttccac tgccaaaccg ccgccgcgac gctgaccaag     1020

ccgcaataca tggtcaacgt ggtccgcacc tcgctgcagg cgctgtcggc cgtcctgggc     1080

ggtgcgcagt cgctccatac caatggctac gacgaagcgt tcgccatccc gacggaagat     1140

gccatgaaga tggccctgcg cacgcagcag atcattgccg aagaatccgg cgtcgccgac     1200

gtgatcgacc cgctgggcgg cagctattac gtcgaggcgc tgacgaccga atatgaaaag     1260

aagattttcg aaattctgga agaggtggag aagcgcggtg gcaccatcaa gctgatcgaa     1320

cagggctggt tccagaagca aatcgccgac ttcgcctacg aaacggcgct gcgcaagcag     1380

tcgggccaga agcccgtcat cggtgtgaac cgcttcgtgg aaaacgagga ggacgtgaag     1440

atcgagatcc acccgtacga taacaccacg gccgaacgcc agatcagccg cacccggcgg     1500

gtgcgtgccg agcgcgacga ggccaaggtc caggcgatgc tggaccagct cgtcgcggtg     1560

gcgaaggatg agagccagaa cctgatgccg ctgaccatcg agctcgtgaa ggccggcgcc     1620

acgatgggcg acatcgtgga aaagctcaag ggcatctggg gcacgtaccg cgaaacgccg     1680

gtgttctga                                                             1689


<210>  5
<211>  136
<212>  PRT
<213>  A. tertiaricarbonis

<400>  5

Met Asp Gln Thr Pro Ile Arg Val Leu Leu Ala Lys Val Gly Leu Asp 
1               5                   10                  15      


Gly His Asp Arg Gly Val Lys Val Val Ala Arg Ala Leu Arg Asp Ala 
            20                  25                  30          


Gly Met Asp Val Ile Tyr Ser Gly Leu His Arg Thr Pro Glu Glu Val 
        35                  40                  45              


Val Asn Thr Ala Ile Gln Glu Asp Val Asp Val Leu Gly Val Ser Leu 
    50                  55                  60                  


Leu Ser Gly Val Gln Leu Thr Val Phe Pro Lys Ile Phe Lys Leu Leu 
65                  70                  75                  80  


Asp Glu Arg Gly Ala Gly Asp Leu Ile Val Ile Ala Gly Gly Val Met 
                85                  90                  95      


Pro Asp Glu Asp Ala Ala Ala Ile Arg Lys Leu Gly Val Arg Glu Val 
            100                 105                 110         


Leu Leu Gln Asp Thr Pro Pro Gln Ala Ile Ile Asp Ser Ile Arg Ser 
        115                 120                 125             


Leu Val Ala Ala Arg Gly Ala Arg 
    130                 135     


<210>  6
<211>  411
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  6
atggaccaga cgccgatccg cgtcctgctg gccaaggtcg gcctggacgg ccatgatcgc       60

ggcgtcaagg tcgtcgcccg cgccctgcgc gacgccggta tggacgtgat ctactccggc      120

ctgcaccgca cgcccgaaga agtcgtcaac accgcgatcc aggaagatgt ggacgtgctg      180

ggcgtgtcgc tcctgagcgg cgtgcagctg accgtgttcc cgaagatttt caagctgctc      240

gacgagcggg gcgccggcga cctgatcgtg atcgccggtg gcgtgatgcc ggacgaagat      300

gcggcggcca tccggaagct gggcgtgcgc gaggtgctgc tgcaagacac ccccccgcag      360

gcgatcatcg actcgatccg cagcctcgtg gcggcccgtg gcgcgcgctg a               411


<210>  7
<211>  312
<212>  PRT
<213>  K. tusciae

<400>  7

Met Gln Glu Leu Leu Ser Arg Phe Asp Ala Gly Asp Pro Val Ala Leu 
1               5                   10                  15      


Gly Lys Leu Leu Lys Glu Val Glu Asn Gly Thr Ser Ser Gly Lys Glu 
            20                  25                  30          


Ala Leu Arg Cys Thr Ala Ser Arg Gln Gly Arg Ala His Val Val Gly 
        35                  40                  45              


Ile Thr Gly Pro Pro Gly Ala Gly Lys Ser Thr Leu Thr Ala Lys Leu 
    50                  55                  60                  


Ser Lys Arg Trp Ala Glu Ala Gly Arg Glu Val Gly Ile Val Cys Val 
65                  70                  75                  80  


Asp Pro Thr Ser Pro Phe Ser Gly Gly Ala Leu Leu Gly Asp Arg Ile 
                85                  90                  95      


Arg Met Leu Glu Leu Ser Ser Phe Pro Asn Val Phe Ile Lys Ser Leu 
            100                 105                 110         


Ala Thr Arg Gly Ser Leu Gly Gly Met Ala Ala Ser Thr Ala Asp Ile 
        115                 120                 125             


Ile Gln Leu Met Asp Ala Tyr Gly Lys Glu Val Val Val Val Glu Thr 
    130                 135                 140                 


Val Gly Val Gly Gln Val Glu Phe Asp Val Met Asp Leu Ser Asp Thr 
145                 150                 155                 160 


Val Val Leu Val Asn Val Pro Gly Leu Gly Asp Ser Ile Gln Ala Leu 
                165                 170                 175     


Lys Ala Gly Ile Leu Glu Ile Ala Asp Ile Phe Val Ile Asn Gln Ala 
            180                 185                 190         


Asp Arg Pro Gly Ala Glu Asp Ser Val Arg Asp Leu Arg Gln Met Leu 
        195                 200                 205             


Ala Asp Arg Lys Glu Thr Gly Trp Leu Trp Pro Val Val Lys Thr Val 
    210                 215                 220                 


Ala Thr Arg Gly Glu Gly Ile Asp Arg Leu Ala Glu Ala Ile Glu Ser 
225                 230                 235                 240 


His Arg Ala Tyr Leu Lys Arg Glu Gln Leu Trp Glu Glu Lys Arg Cys 
                245                 250                 255     


Arg Arg Asn Arg Gln Arg Leu Met Gln Glu Met Asp Arg Leu Phe Arg 
            260                 265                 270         


Gln His Val Leu Thr Arg Ile Arg Thr Asp Pro Thr Ala Arg Ala Leu 
        275                 280                 285             


Phe Glu Glu Val Glu Lys Gly Thr Gln Asp Pro Tyr Ser Ala Ala Arg 
    290                 295                 300                 


His Leu Phe Gln Glu Ile Val Asn 
305                 310         


<210>  8
<211>  939
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  8
atgcaggaac tgctgtcgcg cttcgacgcg ggcgaccccg tcgccctggg caagctgctg       60

aaggaagtgg agaacggcac gtccagcggc aaggaagcgc tgcgctgcac ggccagccgt      120

cagggccggg cgcacgtcgt cggcatcacc ggccccccgg gtgcgggcaa gtccacgctg      180

acggccaagc tgtcgaagcg ctgggccgaa gcgggccgcg aggtcggcat cgtctgcgtc      240

gatccgacga gccccttctc cggcggcgcg ctgctgggtg accggattcg catgctggag      300

ctgagctcct tcccgaatgt gttcatcaag tcgctcgcca cccgcggctc gctgggcggt      360

atggccgcct cgacggcgga catcatccag ctgatggacg cgtacggtaa ggaagtcgtg      420

gtggtggaaa ccgtgggtgt gggccaagtg gagtttgacg tgatggacct gtcggatacg      480

gtcgtgctcg tcaacgtgcc gggcctcggc gattcgatcc aggcgctgaa ggccggcatc      540

ctggagattg ccgatatctt cgtgatcaac caggccgacc gcccgggcgc cgaggactcg      600

gtgcgggatc tgcgccagat gctggcggat cgcaaggaaa ccggctggct gtggccggtc      660

gtcaagaccg tggcgacgcg cggcgaaggc atcgatcgtc tggccgaagc gatcgagtcc      720

catcgcgcct acctgaagcg cgagcagctg tgggaagaga agcgctgccg gcgcaaccgg      780

caacgcctga tgcaggaaat ggaccgcctg ttccgccaac acgtgctcac ccgcatccgc      840

acggacccca ccgcccgtgc cctgttcgaa gaagtggaaa agggcaccca ggacccgtat      900

agcgccgccc gccatctctt ccaggaaatc gtgaattga                             939


<210>  9
<211>  563
<212>  PRT
<213>  K. tusciae

<400>  9

Met Ala Asp Gln Glu Lys Leu Phe Asn Gly Asp Glu Ile Arg Arg Ile 
1               5                   10                  15      


Arg Gln Glu Lys Glu Arg Trp Tyr Arg Glu Thr Val Lys Gly Asn Asp 
            20                  25                  30          


Gly Gly Asn Asp Tyr Val Thr Asp Ser Gly Ile Pro Val Asn Leu Ile 
        35                  40                  45              


Tyr Gly Pro Asp Asp Ile Ala Asp Phe Asp Tyr Leu Lys Glu Ser Gly 
    50                  55                  60                  


Phe Ser Gly Glu Pro Pro Tyr Val Arg Gly Val Tyr Pro Asn Met Tyr 
65                  70                  75                  80  


Arg Gly Arg Leu Phe Thr Ile Arg Gln Ile Ala Gly Phe Gly Thr Pro 
                85                  90                  95      


Glu Asp Thr Asn Arg Arg Phe Lys Phe Leu Leu Glu Asn Gly Ala Thr 
            100                 105                 110         


Gly Thr Ser Val Val Leu Asp Leu Pro Thr Ile Arg Gly Tyr Asp Ser 
        115                 120                 125             


Asp Asp Pro Lys Ala Glu Gly His Val Gly Ala Ala Gly Val Ala Ile 
    130                 135                 140                 


Asp Ser Leu Glu Asp Met Glu Ala Leu Tyr Asp Gly Ile Pro Ile Asp 
145                 150                 155                 160 


Gln Val Ser Ser Asn Ile Val Thr His Leu Pro Ser Thr Thr Val Val 
                165                 170                 175     


Leu Met Ala Met Phe Val Ala Met Ala Glu Lys Arg Gly Leu Pro Leu 
            180                 185                 190         


Glu Lys Leu Ser Gly Thr Asn Gln Asn Asp Phe Leu Met Glu Thr Thr 
        195                 200                 205             


Ile Gly Ser Ser Leu Glu Ile Leu Pro Pro Lys Ala Ser Phe Arg Leu 
    210                 215                 220                 


Gln Cys Asp Ser Ile Glu Tyr Ala Ser Lys Arg Leu Pro Arg Trp Asn 
225                 230                 235                 240 


Pro Val Ser Tyr Asn Gly Tyr Asn Leu Arg Glu Ala Gly Thr Thr Ala 
                245                 250                 255     


Val Gln Glu Val Gly Cys Ala Ile Ala Asn Ala Ile Ala Thr Thr Glu 
            260                 265                 270         


Glu Leu Ile Arg Arg Gly Asn Asp Val Asp Asp Phe Ala Lys Arg Leu 
        275                 280                 285             


Ser Phe Phe Trp Asn Leu Phe Asn Asp Phe Phe Glu Glu Ile Ala Lys 
    290                 295                 300                 


Cys Arg Ala Ser Arg Leu Val Trp Tyr Asp Val Met Lys Asn Arg Phe 
305                 310                 315                 320 


Gly Ala Lys Asn Pro Arg Ser Tyr Leu Met Arg Phe His Val Gln Thr 
                325                 330                 335     


Gly Gly Ile Thr Leu Thr Lys Val Glu Pro Leu Asn Asn Ile Ala Arg 
            340                 345                 350         


Ser Ala Ile Gln Gly Leu Ala Ala Val Leu Gly Gly Ala Gln Ser Leu 
        355                 360                 365             


His Ile Asp Ser Tyr Asp Glu Ala Tyr Ser Ala Pro Thr Glu Gln Ala 
    370                 375                 380                 


Ala Leu Val Ser Leu Arg Thr Gln Gln Ile Ile Gln Val Glu Thr Gly 
385                 390                 395                 400 


Val Val Asn Thr Val Asp Pro Leu Ala Gly Ser Tyr Tyr Val Glu Tyr 
                405                 410                 415     


Leu Thr Arg Glu Met Ala Glu His Ile Arg Ala Tyr Ile Asp Gln Ile 
            420                 425                 430         


Glu Ser Arg Gly Gly Ile Ile Ala Val Val Glu Ser Gly Trp Leu His 
        435                 440                 445             


Arg Glu Ile Ala Glu Phe Ala Tyr Arg Thr Gln Gln Asp Ile Glu Thr 
    450                 455                 460                 


Gly Lys Arg Lys Val Val Gly Leu Asn Tyr Phe Pro Ser Lys Glu Ala 
465                 470                 475                 480 


Glu Thr Lys Val Glu Val Phe Arg Tyr Pro Glu Asp Ala Glu Arg Met 
                485                 490                 495     


Gln Lys Glu Lys Leu Ala Lys Leu Arg Ala Arg Arg Asp Pro Val Lys 
            500                 505                 510         


Val Glu Gln Thr Leu Arg Val Leu Arg Glu Lys Cys His Glu Asp Val 
        515                 520                 525             


Asn Ile Leu Pro Tyr Val Lys Asp Ala Val Glu Ala Tyr Cys Thr Leu 
    530                 535                 540                 


Gly Glu Ile Gln Asn Val Phe Arg Glu Glu Phe Gly Leu Trp Gln Phe 
545                 550                 555                 560 


Pro Leu Val 
            


<210>  10
<211>  1692
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  10
atggcggacc aagaaaagct gttcaatggc gacgagattc ggcgcatccg gcaggaaaag       60

gaacggtggt atcgcgaaac cgtcaagggc aatgacggtg gcaatgacta cgtgaccgac      120

tcgggtattc cggtcaacct catctacggc ccggacgata tcgccgattt tgactatctg      180

aaggaaagcg gcttctccgg tgaaccgccg tacgtgcgcg gcgtctaccc caatatgtac      240

cgcggccgtc tgttcaccat ccggcagatc gccggcttcg gcacgcccga ggacacgaat      300

cggcgcttca agtttctcct ggagaacggc gcgaccggca ccagcgtggt gctcgacctc      360

ccgaccatcc ggggctatga ctcggacgat ccgaaggccg agggccacgt gggcgccgcc      420

ggcgtggcga tcgactcgct ggaggacatg gaggccctgt acgatggcat cccgatcgac      480

caggtgtcct ccaacatcgt cacgcacctc ccctcgacca cggtggtcct gatggccatg      540

ttcgtcgcaa tggcggaaaa gcgtggcctg ccgctcgaaa agctgagcgg caccaaccag      600

aacgactttc tcatggaaac caccatcggc agctccctgg agattctgcc cccgaaggcc      660

tccttccgcc tccaatgcga tagcatcgag tatgcctcga agcgcctgcc ccgctggaac      720

cccgtgagct acaacggcta taatctgcgc gaggcgggca ccacggccgt ccaagaagtc      780

ggctgcgcca tcgcgaatgc catcgccacg acggaggaac tgatccgccg cggtaacgat      840

gtggacgatt tcgccaagcg cctctccttc ttctggaatc tgtttaatga cttcttcgag      900

gaaatcgcca agtgccgggc gagccgcctg gtgtggtacg acgtgatgaa gaatcgcttc      960

ggcgccaaga acccgcgctc gtacctgatg cgctttcatg tccagacggg tggcatcacc     1020

ctgaccaagg tggagccgct caacaacatt gcgcggtcgg ccattcaggg cctggccgcg     1080

gtcctgggtg gcgcccagag cctgcatatc gattcgtatg atgaagcgta cagcgcgccg     1140

acggagcaag cggccctggt gtccctccgt acccagcaga tcattcaagt cgaaaccggc     1200

gtcgtgaaca cggtggaccc cctggccggc agctattacg tggagtacct gacccgcgaa     1260

atggccgagc atatccgtgc ctacatcgac caaattgaat cgcgcggtgg catcatcgcc     1320

gtggtggaga gcggttggct gcaccgcgaa atcgccgagt ttgcctatcg cacgcaacaa     1380

gacattgaaa ccggcaagcg caaggtggtg ggcctgaact acttcccgag caaggaggcc     1440

gaaaccaagg tcgaagtgtt ccgttacccg gaagatgcgg aacgcatgca aaaggaaaag     1500

ctggccaagc tgcgtgcgcg ccgcgacccg gtgaaggtcg aacagaccct ccgcgtgctg     1560

cgtgagaagt gccacgaaga tgtcaacatc ctcccctatg tgaaggacgc cgtcgaggcc     1620

tactgcaccc tgggcgagat ccaaaacgtg ttccgcgagg agttcggcct gtggcagttc     1680

cccctcgtct ga                                                         1692


<210>  11
<211>  132
<212>  PRT
<213>  K. tusciae

<400>  11

Met Glu Lys Lys Ile Lys Val Ile Met Val Lys Leu Gly Leu Asp Ile 
1               5                   10                  15      


His Trp Arg Gly Ala Leu Val Val Ser Lys Met Leu Arg Asp Arg Gly 
            20                  25                  30          


Met Glu Val Val Tyr Leu Gly Asn Leu Phe Pro Glu Gln Ile Val Gln 
        35                  40                  45              


Ala Ala Val Gln Glu Gly Ala Asp Val Val Gly Leu Ser Thr Leu Gly 
    50                  55                  60                  


Gly Asn His Leu Thr Leu Gly Pro Lys Val Val Glu Leu Leu Arg Ala 
65                  70                  75                  80  


Lys Gly Met Glu Glu Val Leu Val Ile Met Gly Gly Val Ile Pro Glu 
                85                  90                  95      


Glu Asp Val Pro Ala Leu Lys Glu Ala Gly Ile Ala Glu Val Phe Gly 
            100                 105                 110         


Pro Glu Thr Pro Ile Asp Ala Ile Glu Ser Phe Ile Arg Ser Arg Phe 
        115                 120                 125             


Pro Asp Arg Asp 
    130         


<210>  12
<211>  399
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  12
atggagaaga agattaaggt catcatggtg aagctgggcc tggatatcca ttggcgcggt       60

gccctggtgg tgtcgaagat gctgcgtgac cgcggcatgg aagtggtcta cctgggcaac      120

ctgtttccgg agcagatcgt ccaagccgcg gtgcaggaag gcgcggacgt ggtcggcctg      180

tccacgctgg gcggcaacca cctcaccctg ggccccaagg tcgtggagct cctgcgcgcc      240

aagggcatgg aagaagtcct ggtgatcatg ggcggcgtga tcccggaaga agatgtgccg      300

gccctcaagg aagccggcat cgcggaagtg ttcggtcccg aaaccccgat cgacgcgatc      360

gagtcgttca tccgcagccg cttcccggac cgggactga                             399


<210>  13
<211>  317
<212>  PRT
<213>  B. massiliosenegalensis

<400>  13

Met Lys Gln Ala Ala Ser Tyr Phe Glu Lys Lys Ser Asp Val Met Leu 
1               5                   10                  15      


Gly Lys Leu Leu Lys Glu Val Glu Asn Gln Thr Pro Thr Ser Ile Glu 
            20                  25                  30          


Ile Leu Lys Glu Ser Ser Ser Arg Lys Gly Asn Ala His Ile Val Gly 
        35                  40                  45              


Ile Thr Gly Pro Pro Gly Ser Gly Lys Ser Thr Leu Val Asn Lys Leu 
    50                  55                  60                  


Cys Lys Thr Leu Ala Thr Ser Gly Leu Glu Ile Gly Ile Val Ala Val 
65                  70                  75                  80  


Asp Pro Thr Ser Pro Phe Thr Lys Gly Ala Leu Leu Gly Asp Arg Thr 
                85                  90                  95      


Arg Met Gln Glu Leu Ser Gly Leu Ser Asn Val Phe Ile Lys Ser Leu 
            100                 105                 110         


Ala Thr Arg Gly Asn Leu Gly Gly Leu Ala Pro Thr Thr Ala Asp Ile 
        115                 120                 125             


Val His Val Leu Asp Ala Tyr Gly Lys Glu Leu Ile Ile Ile Glu Thr 
    130                 135                 140                 


Val Gly Val Gly Gln Ile Glu Phe Asp Val Leu Glu Ile Ala Asp Thr 
145                 150                 155                 160 


Val Val Leu Val Asn Val Pro Gly Leu Gly Asp Ser Leu Gln Thr Leu 
                165                 170                 175     


Lys Ala Gly Ile Met Glu Ile Ala Asp Ile Tyr Val Val Asn Gln Ala 
            180                 185                 190         


Asp Arg Pro Gly Ala Asp Glu Ser Ala Arg Asp Leu Lys Leu Met Val 
        195                 200                 205             


Arg Glu Lys Met Gln Asp Asn Trp Glu Gln Pro Ile Leu Lys Thr Val 
    210                 215                 220                 


Ala Thr Asn Asn Glu Gly Ile Thr Glu Leu Ile Glu Gln Ile Gln Lys 
225                 230                 235                 240 


His Lys Asp Tyr Ile Lys Ser Ser Asn Ile Trp Asn Glu Lys Arg Lys 
                245                 250                 255     


Asn Gln Asn Leu Thr Lys Phe Asn His Leu Ile Ile Gln Thr Leu Glu 
            260                 265                 270         


Arg Glu Val Glu Lys Tyr Ile Ser Gly Lys Gln Asp Leu Gln Ile Lys 
        275                 280                 285             


Arg Gln Gln Val Lys Asp Gly Lys Leu Asp Pro Tyr Thr Leu Ser Ala 
    290                 295                 300                 


Tyr Ile Val Gly Gln Leu Ile Glu Lys His Gly Gly Met 
305                 310                 315         


<210>  14
<211>  954
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  14
atgaagcaag cggccagcta ttttgagaag aagtccgacg tgatgctggg caagctgctg       60

aaggaagtgg agaaccagac ccccacctcg attgagatcc tcaaggaaag ctcgtcgcgc      120

aagggcaacg cccacatcgt gggcatcacg ggcccgccgg gcagcggcaa gtccaccctg      180

gtgaataagc tgtgcaagac cctggccacg tcgggcctgg aaatcggcat cgtcgcggtg      240

gaccccacgt cgccgttcac caagggcgcg ctgctgggcg accgcacccg catgcaggaa      300

ctgagcggcc tctccaacgt gttcatcaag tcgctggcga cgcggggcaa cctgggcggt      360

ctggcgccca cgaccgccga catcgtgcat gtcctggatg cctacggcaa ggaactgatc      420

atcatcgaaa cggtgggcgt cggccagatc gagttcgacg tgctggagat tgccgacacc      480

gtcgtgctgg tcaatgtgcc gggtctgggc gactcgctgc agaccctgaa ggccggcatc      540

atggaaatcg cggacatcta cgtggtcaac caggccgatc gtccgggcgc cgacgagtcc      600

gcgcgcgacc tcaagctgat ggtccgcgag aagatgcagg ataactggga gcagccgatc      660

ctcaagaccg tggccaccaa caacgaaggc attaccgaac tgatcgagca gatccagaag      720

cacaaggact acatcaagtc gagcaacatc tggaacgaga agcgcaagaa ccagaacctg      780

accaagttca atcacctgat catccagacg ctggagcggg aagtcgagaa gtatatcagc      840

ggtaagcaag acctccagat caagcgccag caagtgaagg atggcaagct cgacccgtac      900

accctgagcg cctacatcgt cggccagctg atcgagaagc acggcggcat gtga            954


<210>  15
<211>  569
<212>  PRT
<213>  B. massiliosenegalensis

<400>  15

Met Thr Lys Ala Asn Val Asn Gln Glu Thr Lys Leu Phe Asn Gln Glu 
1               5                   10                  15      


Val Val Lys Glu Ile Glu Ala Gln Lys Glu Arg Trp Lys Lys Glu Thr 
            20                  25                  30          


Val Lys Gly Lys Thr Gly Asp Gly Glu Tyr Phe Ser Asp Ser Gly Ile 
        35                  40                  45              


Pro Val Asn Leu Leu Tyr Thr Pro Asp Asp Met Lys Asp Ile Asp Tyr 
    50                  55                  60                  


Met Lys Asp Ile Gly Leu Ser Gly Glu Ala Pro Tyr Val Arg Gly Val 
65                  70                  75                  80  


Tyr Pro Asn Met Tyr Arg Gly Arg Leu Phe Thr Val Arg Gln Ile Ala 
                85                  90                  95      


Gly Tyr Gly Thr Pro Glu Asp Thr Asn Asp Arg Phe Lys Phe Leu Leu 
            100                 105                 110         


Lys Asn Gly Ala Thr Gly Thr Ser Val Val Leu Asp Leu Pro Thr Ile 
        115                 120                 125             


Arg Gly Tyr Asp Ser Asp Asp Pro Glu Ala Glu Gly His Val Gly Ala 
    130                 135                 140                 


Ala Gly Val Ala Ile Asp Ser Leu Glu Asp Ile Glu Ala Leu Tyr Asp 
145                 150                 155                 160 


Gly Ile Pro Ile Asp Glu Ile Ser Ser Asn Ile Val Thr His Leu Pro 
                165                 170                 175     


Ser Thr Thr Val Val Ile Met Ala Met Phe Ala Ala Met Ala Glu Lys 
            180                 185                 190         


Lys Gly Ile Pro Phe Glu Lys Leu Ser Gly Thr Asn Gln Asn Asp Phe 
        195                 200                 205             


Leu Met Glu Thr Ala Ile Gly Ser Ser Leu Glu Val Leu Pro Pro Lys 
    210                 215                 220                 


Ala Ser Phe Arg Leu Gln Cys Asp Ala Ile Glu Phe Ala Ser Lys Asn 
225                 230                 235                 240 


Leu Pro Arg Trp Asn Pro Val Ser Tyr Asn Gly Tyr Asn Leu Arg Glu 
                245                 250                 255     


Ala Gly Thr Asp Ala Val Ala Glu Val Ala Cys Ala Leu Ala Asn Ala 
            260                 265                 270         


Ile Ala Thr Ser Glu Glu Leu Ile Arg Arg Gly Asn Lys Ile Asp Asp 
        275                 280                 285             


Phe Ala Lys Arg Leu Ser Phe Phe Trp Asn Leu Tyr Asn Asp Phe Phe 
    290                 295                 300                 


Glu Glu Ile Ala Lys Cys Arg Ala Ser Arg Val Val Tyr Gln Glu Ile 
305                 310                 315                 320 


Met Lys Glu Arg Phe His Ala Glu Glu Met Lys Ser Gln Leu Met Arg 
                325                 330                 335     


Phe His Val Gln Thr Ala Gly Ile Thr Leu Thr Lys Val Glu Pro Leu 
            340                 345                 350         


Asn Asn Ile Ala Arg Ser Ala Ile Gln Gly Leu Ala Ala Val Leu Gly 
        355                 360                 365             


Gly Ala Gln Ser Leu His Val Asp Ser Tyr Asp Glu Ala Tyr Ser Ala 
    370                 375                 380                 


Pro Thr Glu Glu Ser Ala Leu Ile Ser Ile Arg Thr Gln Gln Ile Ile 
385                 390                 395                 400 


Gln Thr Glu Thr Asn Val Val Asn Thr Val Asp Pro Leu Ala Gly Ser 
                405                 410                 415     


Tyr Phe Val Glu Tyr Leu Thr Lys Glu Met Ala Gln Arg Ile Arg Asp 
            420                 425                 430         


Tyr Ile Ser Glu Ile Glu Ser Arg Gly Gly Leu Val Ala Cys Val Asp 
        435                 440                 445             


Ser Gly Trp Leu His Arg Glu Ile Ala Asp Phe Ala Tyr Gln Thr Gln 
    450                 455                 460                 


Lys Glu Ile Glu Asn Gly Thr Arg Lys Ile Val Gly Leu Asn Tyr Phe 
465                 470                 475                 480 


Pro Ser Glu Asp His Ala Gly Gln Lys Val Glu Val Phe Arg Tyr Pro 
                485                 490                 495     


Glu Thr Ala Glu Ala Lys Gln Lys Glu Lys Leu Glu Arg Leu Arg Gln 
            500                 505                 510         


Lys Arg Asp Ala Lys Lys Val Glu Glu Lys Leu Asn Val Ile Arg Glu 
        515                 520                 525             


Met Cys His Gln Asp Val Asn Leu Met Pro Tyr Ile Lys Asp Ala Val 
    530                 535                 540                 


Leu Glu Tyr Ala Thr Leu Gly Glu Ile Glu Glu Val Phe Arg Glu Glu 
545                 550                 555                 560 


Phe Gly Leu Trp Gln Phe Pro Leu Ala 
                565                 


<210>  16
<211>  1710
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  16
atgacgaagg cgaatgtgaa ccaggaaacc aagctgttca accaagaagt ggtgaaggaa       60

atcgaggccc aaaaggaacg ctggaagaag gaaaccgtca agggcaagac cggcgacggc      120

gagtacttca gcgactcggg catcccggtg aacctcctgt acacgcccga cgatatgaaa      180

gacatcgact acatgaagga catcggcctg tcgggcgagg cgccctatgt gcgtggcgtc      240

tacccgaata tgtaccgcgg ccgtctgttc accgtgcgcc agatcgccgg ttacggcacc      300

cccgaggaca ccaacgaccg ctttaagttc ctgctgaaga acggcgccac gggcacgtcg      360

gtcgtcctcg atctgcccac gatccgtggc tacgattcgg acgacccgga agccgagggc      420

catgtgggtg ccgcgggcgt cgcgatcgac tcgctggaag atatcgaggc gctgtacgac      480

ggtatcccga tcgacgagat cagcagcaac atcgtgaccc atctcccgtc caccaccgtc      540

gtgatcatgg cgatgttcgc ggcgatggcc gagaagaagg gcatcccgtt cgagaagctg      600

agcggcacca accagaacga cttcctgatg gaaaccgcca tcggctccag cctggaagtc      660

ctgccgccga aggccagctt ccggctgcag tgcgacgcga tcgagtttgc ctcgaagaac      720

ctcccgcgct ggaaccccgt gtcgtataac ggctacaatc tgcgcgaggc cggtacggac      780

gcggtggccg aagtggcctg cgcgctggcc aatgccatcg cgacctcgga agaactgatc      840

cgccgcggca acaagatcga tgacttcgcc aagcgcctgt cgttcttctg gaacctgtat      900

aacgatttct tcgaagagat tgcgaagtgc cgcgcctcgc gcgtggtgta ccaggaaatc      960

atgaaggaac gcttccacgc cgaagagatg aagtcgcagc tcatgcgctt ccacgtgcag     1020

accgccggca tcaccctgac caaggtcgag ccgctgaaca acattgcgcg cagcgccatt     1080

cagggcctgg cggccgtcct cggcggtgcc cagagcctgc atgtggactc ctacgacgaa     1140

gcctactccg ccccgaccga ggaatccgcc ctgatttcga tccgcaccca gcagatcatc     1200

cagacggaaa ccaacgtcgt caacacggtg gacccgctgg cgggcagcta tttcgtggag     1260

tacctgacga aggaaatggc ccagcgcatc cgggactata tcagcgagat cgaatcgcgc     1320

ggcggcctgg tggcctgcgt ggactccggc tggctgcacc gggagatcgc ggatttcgcc     1380

taccaaaccc aaaaggaaat cgagaacggc acccgcaaga tcgtgggcct gaactacttc     1440

ccgagcgagg accacgcggg ccaaaaggtc gaggtgttcc ggtacccgga aacggcggaa     1500

gccaagcaga aggaaaagct ggagcgcctg cgccagaagc gggacgccaa gaaggtcgag     1560

gaaaagctga acgtcatccg cgagatgtgc caccaagacg tgaacctcat gccctatatc     1620

aaggacgccg tgctggaata cgcgacgctg ggcgagatcg aagaagtgtt ccgcgaagag     1680

ttcggcctgt ggcagtttcc gctggcgtga                                      1710


<210>  17
<211>  130
<212>  PRT
<213>  B. massiliosenegalensis

<400>  17

Met Gln Val Lys Val Val Met Ala Lys Leu Gly Leu Asp Ile His Trp 
1               5                   10                  15      


Arg Gly Ala Leu Val Val Ser Arg Met Leu Arg Asp Glu Gly Met Glu 
            20                  25                  30          


Val Val Tyr Leu Gly Asn Gln Phe Pro Glu Gln Ile Val Glu Ala Ala 
        35                  40                  45              


Ile Gln Glu Gly Ala Asp Val Ile Gly Leu Ser Thr Leu Gly Gly Asn 
    50                  55                  60                  


His Leu Thr Leu Gly Pro Lys Val Val Lys Ile Ala Arg Glu Lys Gly 
65                  70                  75                  80  


Val Glu Ser Leu Val Ile Met Gly Gly Val Ile Pro Glu Asp Asp Ile 
                85                  90                  95      


Pro Leu Leu Lys Glu Ser Gly Ile Ala Glu Val Phe Gly Pro Glu Thr 
            100                 105                 110         


Lys Val Glu Ser Ile Ala Ser Phe Ile Arg Glu His Val Gly Lys Lys 
        115                 120                 125             


Ile Gly 
    130 


<210>  18
<211>  393
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  18
atgcaagtga aggtcgtgat ggccaagctg ggcctggaca tccattggcg cggtgcgctc       60

gtggtgagcc gtatgctgcg cgacgaaggc atggaagtgg tgtacctggg caaccagttc      120

ccggagcaga tcgtcgaggc cgcgatccag gaaggcgcgg acgtgatcgg cctgtccacg      180

ctgggcggca accacctgac cctgggcccc aaggtcgtca agattgcccg cgaaaagggc      240

gtggagtcgc tggtcatcat gggcggtgtg atccccgaag atgacatccc gctgctcaag      300

gaatcgggca tcgccgaggt gttcggcccg gaaaccaagg tcgagagcat cgcctcgttc      360

atccgggagc acgtgggcaa gaagatcggc tga                                   393


<210>  19
<211>  393
<212>  PRT
<213>  C. necator

<400>  19

Met Thr Asp Val Val Ile Val Ser Ala Ala Arg Thr Ala Val Gly Lys 
1               5                   10                  15      


Phe Gly Gly Ser Leu Ala Lys Ile Pro Ala Pro Glu Leu Gly Ala Val 
            20                  25                  30          


Val Ile Lys Ala Ala Leu Glu Arg Ala Gly Val Lys Pro Glu Gln Val 
        35                  40                  45              


Ser Glu Val Ile Met Gly Gln Val Leu Thr Ala Gly Ser Gly Gln Asn 
    50                  55                  60                  


Pro Ala Arg Gln Ala Ala Ile Lys Ala Gly Leu Pro Ala Met Val Pro 
65                  70                  75                  80  


Ala Met Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Lys Ala Val Met 
                85                  90                  95      


Leu Ala Ala Asn Ala Ile Met Ala Gly Asp Ala Glu Ile Val Val Ala 
            100                 105                 110         


Gly Gly Gln Glu Asn Met Ser Ala Ala Pro His Val Leu Pro Gly Ser 
        115                 120                 125             


Arg Asp Gly Phe Arg Met Gly Asp Ala Lys Leu Val Asp Thr Met Ile 
    130                 135                 140                 


Val Asp Gly Leu Trp Asp Val Tyr Asn Gln Tyr His Met Gly Ile Thr 
145                 150                 155                 160 


Ala Glu Asn Val Ala Lys Glu Tyr Gly Ile Thr Arg Glu Ala Gln Asp 
                165                 170                 175     


Glu Phe Ala Val Gly Ser Gln Asn Lys Ala Glu Ala Ala Gln Lys Ala 
            180                 185                 190         


Gly Lys Phe Asp Glu Glu Ile Val Pro Val Leu Ile Pro Gln Arg Lys 
        195                 200                 205             


Gly Asp Pro Val Ala Phe Lys Thr Asp Glu Phe Val Arg Gln Gly Ala 
    210                 215                 220                 


Thr Leu Asp Ser Met Ser Gly Leu Lys Pro Ala Phe Asp Lys Ala Gly 
225                 230                 235                 240 


Thr Val Thr Ala Ala Asn Ala Ser Gly Leu Asn Asp Gly Ala Ala Ala 
                245                 250                 255     


Val Val Val Met Ser Ala Ala Lys Ala Lys Glu Leu Gly Leu Thr Pro 
            260                 265                 270         


Leu Ala Thr Ile Lys Ser Tyr Ala Asn Ala Gly Val Asp Pro Lys Val 
        275                 280                 285             


Met Gly Met Gly Pro Val Pro Ala Ser Lys Arg Ala Leu Ser Arg Ala 
    290                 295                 300                 


Glu Trp Thr Pro Gln Asp Leu Asp Leu Met Glu Ile Asn Glu Ala Phe 
305                 310                 315                 320 


Ala Ala Gln Ala Leu Ala Val His Gln Gln Met Gly Trp Asp Thr Ser 
                325                 330                 335     


Lys Val Asn Val Asn Gly Gly Ala Ile Ala Ile Gly His Pro Ile Gly 
            340                 345                 350         


Ala Ser Gly Cys Arg Ile Leu Val Thr Leu Leu His Glu Met Lys Arg 
        355                 360                 365             


Arg Asp Ala Lys Lys Gly Leu Ala Ser Leu Cys Ile Gly Gly Gly Met 
    370                 375                 380                 


Gly Val Ala Leu Ala Val Glu Arg Lys 
385                 390             


<210>  20
<211>  1182
<212>  DNA
<213>  C. necator

<400>  20
atgactgacg ttgtcatcgt atccgccgcc cgcaccgcgg tcggcaagtt tggcggctcg       60

ctggccaaga tcccggcacc ggaactgggt gccgtggtca tcaaggccgc gctggagcgc      120

gccggcgtca agccggagca ggtgagcgaa gtcatcatgg gccaggtgct gaccgccggt      180

tcgggccaga accccgcacg ccaggccgcg atcaaggccg gcctgccggc gatggtgccg      240

gccatgacca tcaacaaggt gtgcggctcg ggcctgaagg ccgtgatgct ggccgccaac      300

gcgatcatgg cgggcgacgc cgagatcgtg gtggccggcg gccaggaaaa catgagcgcc      360

gccccgcacg tgctgccggg ctcgcgcgat ggtttccgca tgggcgatgc caagctggtc      420

gacaccatga tcgtcgacgg cctgtgggac gtgtacaacc agtaccacat gggcatcacc      480

gccgagaacg tggccaagga atacggcatc acacgcgagg cgcaggatga gttcgccgtc      540

ggctcgcaga acaaggccga agccgcgcag aaggccggca agtttgacga agagatcgtc      600

ccggtgctga tcccgcagcg caagggcgac ccggtggcct tcaagaccga cgagttcgtg      660

cgccagggcg ccacgctgga cagcatgtcc ggcctcaagc ccgccttcga caaggccggc      720

acggtgaccg cggccaacgc ctcgggcctg aacgacggcg ccgccgcggt ggtggtgatg      780

tcggcggcca aggccaagga actgggcctg accccgctgg ccacgatcaa gagctatgcc      840

aacgccggtg tcgatcccaa ggtgatgggc atgggcccgg tgccggcctc caagcgcgcc      900

ctgtcgcgcg ccgagtggac cccgcaagac ctggacctga tggagatcaa cgaggccttt      960

gccgcgcagg cgctggcggt gcaccagcag atgggctggg acacctccaa ggtcaatgtg     1020

aacggcggcg ccatcgccat cggccacccg atcggcgcgt cgggctgccg tatcctggtg     1080

acgctgctgc acgagatgaa gcgccgtgac gcgaagaagg gcctggcctc gctgtgcatc     1140

ggcggcggca tgggcgtggc gctggcagtc gagcgcaaat ga                        1182


<210>  21
<211>  284
<212>  PRT
<213>  C. necator

<400>  21

Met Ser Ile Arg Thr Val Gly Ile Val Gly Ala Gly Thr Met Gly Asn 
1               5                   10                  15      


Gly Ile Ala Gln Ala Cys Ala Val Val Gly Leu Asn Val Val Met Val 
            20                  25                  30          


Asp Ile Ser Asp Ala Ala Val Gln Lys Gly Val Ala Thr Val Ala Ser 
        35                  40                  45              


Ser Leu Asp Arg Leu Ile Lys Lys Glu Lys Leu Thr Glu Ala Asp Lys 
    50                  55                  60                  


Ala Ser Ala Leu Ala Arg Ile Lys Gly Ser Thr Ser Tyr Asp Asp Leu 
65                  70                  75                  80  


Lys Ala Thr Asp Ile Val Ile Glu Ala Ala Thr Glu Asn Tyr Asp Leu 
                85                  90                  95      


Lys Val Lys Ile Leu Lys Gln Ile Asp Gly Ile Val Gly Glu Asn Val 
            100                 105                 110         


Ile Ile Ala Ser Asn Thr Ser Ser Ile Ser Ile Thr Lys Leu Ala Ala 
        115                 120                 125             


Val Thr Ser Arg Ala Asp Arg Phe Ile Gly Met His Phe Phe Asn Pro 
    130                 135                 140                 


Val Pro Val Met Ala Leu Val Glu Leu Ile Arg Gly Leu Gln Thr Ser 
145                 150                 155                 160 


Asp Thr Thr His Ala Ala Val Glu Ala Leu Ser Lys Gln Leu Gly Lys 
                165                 170                 175     


Tyr Pro Ile Thr Val Lys Asn Ser Pro Gly Phe Val Val Asn Arg Ile 
            180                 185                 190         


Leu Cys Pro Met Ile Asn Glu Ala Phe Cys Val Leu Gly Glu Gly Leu 
        195                 200                 205             


Ala Ser Pro Glu Glu Ile Asp Glu Gly Met Lys Leu Gly Cys Asn His 
    210                 215                 220                 


Pro Ile Gly Pro Leu Ala Leu Ala Asp Met Ile Gly Leu Asp Thr Met 
225                 230                 235                 240 


Leu Ala Val Met Glu Val Leu Tyr Thr Glu Phe Ala Asp Pro Lys Tyr 
                245                 250                 255     


Arg Pro Ala Met Leu Met Arg Glu Met Val Ala Ala Gly Tyr Leu Gly 
            260                 265                 270         


Arg Lys Thr Gly Arg Gly Val Tyr Val Tyr Ser Lys 
        275                 280                 


<210>  22
<211>  855
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  22
atgagcatcc gcaccgtggg catcgtcggt gccggcacaa tgggcaacgg catcgcgcag       60

gcctgcgcgg tggtcggcct gaatgtcgtg atggtggata tcagcgacgc cgccgtgcaa      120

aagggcgtgg ccaccgtggc ctcgtccctg gaccgtctga ttaagaagga aaagctgacc      180

gaggccgaca aggcctcggc cctcgcgcgc atcaagggca gcacgtcgta tgatgacctg      240

aaggcgaccg acatcgtgat cgaagcggcc accgagaact acgacctgaa ggtgaagatc      300

ctgaagcaaa tcgacggtat cgtcggcgaa aacgtcatca tcgcctcgaa cacgagctcc      360

atcagcatca ccaagctcgc cgccgtgacc tcgcgggcgg accgcttcat cggcatgcat      420

ttcttcaacc cggtcccggt gatggccctc gtcgaactga tccgcggcct ccagacctcg      480

gatacgacgc acgcggcggt cgaagccctc agcaagcagc tgggcaagta ccccatcacc      540

gtgaagaact cgccgggctt cgtggtcaat cgcatcctct gccccatgat caacgaagcg      600

ttttgcgtcc tgggcgaggg cctggcgtcc cccgaagaga ttgacgaagg catgaagctg      660

ggctgcaacc acccgatcgg cccgctggcg ctggccgata tgatcggcct ggacacgatg      720

ctggcggtga tggaagtgct gtacaccgag ttcgcggacc ccaagtaccg tccggcgatg      780

ctgatgcgcg agatggtggc ggccggctat ctgggccgca agacgggtcg cggcgtgtat      840

gtgtacagca agtga                                                       855


<210>  23
<211>  282
<212>  PRT
<213>  Clostridium acetobutylicum

<400>  23

Met Lys Lys Val Cys Val Ile Gly Ala Gly Thr Met Gly Ser Gly Ile 
1               5                   10                  15      


Ala Gln Ala Phe Ala Ala Lys Gly Phe Glu Val Val Leu Arg Asp Ile 
            20                  25                  30          


Lys Asp Glu Phe Val Asp Arg Gly Leu Asp Phe Ile Asn Lys Asn Leu 
        35                  40                  45              


Ser Lys Leu Val Lys Lys Gly Lys Ile Glu Glu Ala Thr Lys Val Glu 
    50                  55                  60                  


Ile Leu Thr Arg Ile Ser Gly Thr Val Asp Leu Asn Met Ala Ala Asp 
65                  70                  75                  80  


Cys Asp Leu Val Ile Glu Ala Ala Val Glu Arg Met Asp Ile Lys Lys 
                85                  90                  95      


Gln Ile Phe Ala Asp Leu Asp Asn Ile Cys Lys Pro Glu Thr Ile Leu 
            100                 105                 110         


Ala Ser Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Ser Ala Thr 
        115                 120                 125             


Lys Arg Pro Asp Lys Val Ile Gly Met His Phe Phe Asn Pro Ala Pro 
    130                 135                 140                 


Val Met Lys Leu Val Glu Val Ile Arg Gly Ile Ala Thr Ser Gln Glu 
145                 150                 155                 160 


Thr Phe Asp Ala Val Lys Glu Thr Ser Ile Ala Ile Gly Lys Asp Pro 
                165                 170                 175     


Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Arg Ile Leu Ile 
            180                 185                 190         


Pro Met Ile Asn Glu Ala Val Gly Ile Leu Ala Glu Gly Ile Ala Ser 
        195                 200                 205             


Val Glu Asp Ile Asp Lys Ala Met Lys Leu Gly Ala Asn His Pro Met 
    210                 215                 220                 


Gly Pro Leu Glu Leu Gly Asp Phe Ile Gly Leu Asp Ile Cys Leu Ala 
225                 230                 235                 240 


Ile Met Asp Val Leu Tyr Ser Glu Thr Gly Asp Ser Lys Tyr Arg Pro 
                245                 250                 255     


His Thr Leu Leu Lys Lys Tyr Val Arg Ala Gly Trp Leu Gly Arg Lys 
            260                 265                 270         


Ser Gly Lys Gly Phe Tyr Asp Tyr Ser Lys 
        275                 280         


<210>  24
<211>  849
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  24
atgaagaagg tgtgcgtgat cggcgccggc acgatgggct ccggtatcgc ccaggcgttc       60

gccgcgaagg gcttcgaggt cgtcctgcgg gacatcaagg acgagttcgt ggaccgcggt      120

ctggacttta tcaataagaa cctgagcaag ctcgtgaaga agggcaagat tgaagaagcc      180

accaaggtcg agatcctgac gcgcatcagc ggcaccgtgg atctgaacat ggcggccgac      240

tgcgatctgg tgatcgaggc cgccgtcgag cgcatggata tcaagaagca aatcttcgcc      300

gacctcgaca acatctgcaa gccggaaacc atcctggcga gcaacacgtc gtcgctgtcg      360

attaccgaag tggcgagcgc caccaagcgg cccgacaagg tcatcggcat gcacttcttc      420

aacccggccc cggtgatgaa gctcgtcgag gtgatccgcg gcatcgcgac cagccaggaa      480

acgttcgacg cggtgaagga aacctccatc gcgatcggca aggacccggt cgaggtcgcc      540

gaggcccccg gctttgtggt gaaccgcatc ctgatcccca tgatcaacga agcggtgggc      600

atcctggccg agggcatcgc ctcggtcgaa gatatcgaca aggccatgaa gctgggcgcg      660

aatcacccga tgggcccgct ggagctgggc gacttcatcg gcctggacat ctgcctcgcc      720

attatggacg tgctgtactc ggaaacgggc gactcgaagt atcgcccgca taccctgctg      780

aagaagtacg tgcgcgcggg ctggctgggt cgtaagtccg gcaagggctt ctacgactac      840

agcaagtga                                                              849


<210>  25
<211>  246
<212>  PRT
<213>  C. necator

<400>  25

Met Thr Gln Arg Ile Ala Tyr Val Thr Gly Gly Met Gly Gly Ile Gly 
1               5                   10                  15      


Thr Ala Ile Cys Gln Arg Leu Ala Lys Asp Gly Phe Arg Val Val Ala 
            20                  25                  30          


Gly Cys Gly Pro Asn Ser Pro Arg Arg Glu Lys Trp Leu Glu Gln Gln 
        35                  40                  45              


Lys Ala Leu Gly Phe Asp Phe Ile Ala Ser Glu Gly Asn Val Ala Asp 
    50                  55                  60                  


Trp Asp Ser Thr Lys Thr Ala Phe Asp Lys Val Lys Ser Glu Val Gly 
65                  70                  75                  80  


Glu Val Asp Val Leu Ile Asn Asn Ala Gly Ile Thr Arg Asp Val Val 
                85                  90                  95      


Phe Arg Lys Met Thr Arg Ala Asp Trp Asp Ala Val Ile Asp Thr Asn 
            100                 105                 110         


Leu Thr Ser Leu Phe Asn Val Thr Lys Gln Val Ile Asp Gly Met Ala 
        115                 120                 125             


Asp Arg Gly Trp Gly Arg Ile Val Asn Ile Ser Ser Val Asn Gly Gln 
    130                 135                 140                 


Lys Gly Gln Phe Gly Gln Thr Asn Tyr Ser Thr Ala Lys Ala Gly Leu 
145                 150                 155                 160 


His Gly Phe Thr Met Ala Leu Ala Gln Glu Val Ala Thr Lys Gly Val 
                165                 170                 175     


Thr Val Asn Thr Val Ser Pro Gly Tyr Ile Ala Thr Asp Met Val Lys 
            180                 185                 190         


Ala Ile Arg Gln Asp Val Leu Asp Lys Ile Val Ala Thr Ile Pro Val 
        195                 200                 205             


Lys Arg Leu Gly Leu Pro Glu Glu Ile Ala Ser Ile Cys Ala Trp Leu 
    210                 215                 220                 


Ser Ser Glu Glu Ser Gly Phe Ser Thr Gly Ala Asp Phe Ser Leu Asn 
225                 230                 235                 240 


Gly Gly Leu His Met Gly 
                245     


<210>  26
<211>  741
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  26
atgacgcagc gcattgcgta tgtcacgggc ggcatgggcg gcatcggcac cgccatctgc       60

cagcgcctgg cgaaggacgg cttccgcgtg gtggccggct gcggccccaa ctccccgcgc      120

cgtgaaaagt ggctggagca gcaaaaggcc ctcggcttcg atttcatcgc gagcgagggc      180

aacgtcgccg actgggactc gaccaagacc gccttcgaca aggtcaagtc ggaagtcggc      240

gaggtcgatg tgctcatcaa taatgccggc atcacccgtg acgtggtgtt ccggaagatg      300

acccgcgcgg actgggacgc ggtgattgac accaacctga cctccctgtt caacgtcacc      360

aagcaagtca tcgacggcat ggcggatcgc ggctggggcc gcatcgtgaa catcagcagc      420

gtcaacggcc agaagggtca gttcggtcag accaactact cgaccgccaa ggccggcctg      480

cacggcttta caatggccct cgcgcaagaa gtggcgacca agggcgtcac cgtgaacacg      540

gtgagccccg gttacatcgc gacggacatg gtgaaggcca tccgccaaga cgtgctggat      600

aagatcgtcg ccacgatccc ggtgaagcgc ctgggcctgc cggaagagat cgcctccatc      660

tgcgcctggc tgtcgtcgga agagtcgggc ttcagcaccg gcgcggactt ctcgctgaac      720

ggcggcctgc acatgggctg a                                                741


