                         SEQUENCE LISTING

<110>  INVISTA North America S.A.R.L.
       Amatriain, Cristina
       Foster, Alexander
       Cartman, Stephen
 
<120>  Methods and Materials for the Biosynthesis of Beta Hydroxy Acids 
       and/or Derivatives Thereof and/or Compounds Related Thereto

<130>  INV0155WO

<150>  US 62/659,306
<151>  2018-04-18

<150>  US 62/625,066
<151>  2018-02-01

<150>  US 62/625,013
<151>  2018-02-01

<160>  34    

<170>  PatentIn version 3.5

<210>  1
<211>  1668
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  1
atgaagcgct cgaagcgctt cgcggtgctg gcccagcgcc cggtgaacca agatggcctc       60

atcggggagt ggcccgaaga gggcctcatc gcaatggact cgccgttcga tcccgtgtcc      120

tcggtgaagg tcgataacgg cctgatcgtg gagctggacg gcaagcgccg cgaccagttc      180

gatatgatcg accggttcat tgcggactac gcgatcaatg tggaacgcac cgaacaggcg      240

atgcgcctgg aagcggtcga gatcgcccgg atgctcgtgg acatccatgt gagccgcgaa      300

gagatcatcg cgatcaccac ggcgatcacc ccggccaaag ccgtggaagt gatggcccag      360

atgaacgtcg tcgagatgat gatggcgctg cagaagatgc gcgcccgccg caccccgtcc      420

aaccagtgcc atgtcaccaa cctgaaggat aacccggtgc agatcgccgc ggacgcggcc      480

gaggccggca tccggggctt ctcggaacag gaaaccaccg tgggcattgc ccgctacgcc      540

cccttcaacg cgctggccct gctggtcggc tcgcagtgcg gccggccggg cgtgctgacc      600

cagtgcagcg tggaagaagc gaccgagctg gagctgggca tgcgcggcct gacctcgtac      660

gcggaaaccg tgtcggtcta cgggaccgag gccgtcttta ccgacggcga cgacacgccg      720

tggtccaagg cctttctggc gagcgcctat gccagccgcg gcctgaagat gcggtacacg      780

agcggcaccg gctccgaggc cctgatgggc tacagcgagt cgaagtccat gctgtatctg      840

gagtcccggt gcatcttcat cacgaagggc gcgggcgtgc aagggctgca gaatggcgcc      900

gtgtcgtgca tcggcatgac cggcgcggtg cccagcggca tccgcgcggt gctcgccgaa      960

aacctgattg cctccatgct ggacctggaa gtcgcgagcg cgaacgacca gacgttcagc     1020

cacagcgaca tccgccgcac ggcgcgcacg ctgatgcaga tgctgccggg caccgacttc     1080

atcttcagcg gctactccgc ggtgccgaac tatgataata tgttcgccgg cagcaacttc     1140

gatgccgagg atttcgacga ctacaacatc ctgcagcgcg atctgatggt cgatggcggg     1200

ctgcgccccg tcaccgaagc ggaaaccatc gccatccgcc agaaagccgc gcgggccatc     1260

caggccgtgt tccgcgagct ggggctgccg ccgatcgccg acgaagaagt cgaggccgcc     1320

acctacgcgc acggctccaa tgaaatgccc ccgcgcaacg tcgtggagga cctgtcggcg     1380

gtggaagaga tgatgaagcg caacatcacc ggcctggaca tcgtcggcgc gctgtcgcgc     1440

agcggcttcg aggacatcgc gagcaatatc ctgaacatgc tgcgccaacg cgtgaccggc     1500

gactacctcc agacctcggc gattctggac cgccagtttg aggtcgtgtc ggccgtgaac     1560

gacatcaacg actaccaggg cccgggcacg ggctaccgca tctcggccga gcgctgggcc     1620

gagatcaaga acatcccggg cgtggtgcag ccggacacga tcgagtga                  1668


<210>  2
<211>  555
<212>  PRT
<213>  Klebsiella pneumonia

<400>  2

Met Lys Arg Ser Lys Arg Phe Ala Val Leu Ala Gln Arg Pro Val Asn 
1               5                   10                  15      


Gln Asp Gly Leu Ile Gly Glu Trp Pro Glu Glu Gly Leu Ile Ala Met 
            20                  25                  30          


Asp Ser Pro Phe Asp Pro Val Ser Ser Val Lys Val Asp Asn Gly Leu 
        35                  40                  45              


Ile Val Glu Leu Asp Gly Lys Arg Arg Asp Gln Phe Asp Met Ile Asp 
    50                  55                  60                  


Arg Phe Ile Ala Asp Tyr Ala Ile Asn Val Glu Arg Thr Glu Gln Ala 
65                  70                  75                  80  


Met Arg Leu Glu Ala Val Glu Ile Ala Arg Met Leu Val Asp Ile His 
                85                  90                  95      


Val Ser Arg Glu Glu Ile Ile Ala Ile Thr Thr Ala Ile Thr Pro Ala 
            100                 105                 110         


Lys Ala Val Glu Val Met Ala Gln Met Asn Val Val Glu Met Met Met 
        115                 120                 125             


Ala Leu Gln Lys Met Arg Ala Arg Arg Thr Pro Ser Asn Gln Cys His 
    130                 135                 140                 


Val Thr Asn Leu Lys Asp Asn Pro Val Gln Ile Ala Ala Asp Ala Ala 
145                 150                 155                 160 


Glu Ala Gly Ile Arg Gly Phe Ser Glu Gln Glu Thr Thr Val Gly Ile 
                165                 170                 175     


Ala Arg Tyr Ala Pro Phe Asn Ala Leu Ala Leu Leu Val Gly Ser Gln 
            180                 185                 190         


Cys Gly Arg Pro Gly Val Leu Thr Gln Cys Ser Val Glu Glu Ala Thr 
        195                 200                 205             


Glu Leu Glu Leu Gly Met Arg Gly Leu Thr Ser Tyr Ala Glu Thr Val 
    210                 215                 220                 


Ser Val Tyr Gly Thr Glu Ala Val Phe Thr Asp Gly Asp Asp Thr Pro 
225                 230                 235                 240 


Trp Ser Lys Ala Phe Leu Ala Ser Ala Tyr Ala Ser Arg Gly Leu Lys 
                245                 250                 255     


Met Arg Tyr Thr Ser Gly Thr Gly Ser Glu Ala Leu Met Gly Tyr Ser 
            260                 265                 270         


Glu Ser Lys Ser Met Leu Tyr Leu Glu Ser Arg Cys Ile Phe Ile Thr 
        275                 280                 285             


Lys Gly Ala Gly Val Gln Gly Leu Gln Asn Gly Ala Val Ser Cys Ile 
    290                 295                 300                 


Gly Met Thr Gly Ala Val Pro Ser Gly Ile Arg Ala Val Leu Ala Glu 
305                 310                 315                 320 


Asn Leu Ile Ala Ser Met Leu Asp Leu Glu Val Ala Ser Ala Asn Asp 
                325                 330                 335     


Gln Thr Phe Ser His Ser Asp Ile Arg Arg Thr Ala Arg Thr Leu Met 
            340                 345                 350         


Gln Met Leu Pro Gly Thr Asp Phe Ile Phe Ser Gly Tyr Ser Ala Val 
        355                 360                 365             


Pro Asn Tyr Asp Asn Met Phe Ala Gly Ser Asn Phe Asp Ala Glu Asp 
    370                 375                 380                 


Phe Asp Asp Tyr Asn Ile Leu Gln Arg Asp Leu Met Val Asp Gly Gly 
385                 390                 395                 400 


Leu Arg Pro Val Thr Glu Ala Glu Thr Ile Ala Ile Arg Gln Lys Ala 
                405                 410                 415     


Ala Arg Ala Ile Gln Ala Val Phe Arg Glu Leu Gly Leu Pro Pro Ile 
            420                 425                 430         


Ala Asp Glu Glu Val Glu Ala Ala Thr Tyr Ala His Gly Ser Asn Glu 
        435                 440                 445             


Met Pro Pro Arg Asn Val Val Glu Asp Leu Ser Ala Val Glu Glu Met 
    450                 455                 460                 


Met Lys Arg Asn Ile Thr Gly Leu Asp Ile Val Gly Ala Leu Ser Arg 
465                 470                 475                 480 


Ser Gly Phe Glu Asp Ile Ala Ser Asn Ile Leu Asn Met Leu Arg Gln 
                485                 490                 495     


Arg Val Thr Gly Asp Tyr Leu Gln Thr Ser Ala Ile Leu Asp Arg Gln 
            500                 505                 510         


Phe Glu Val Val Ser Ala Val Asn Asp Ile Asn Asp Tyr Gln Gly Pro 
        515                 520                 525             


Gly Thr Gly Tyr Arg Ile Ser Ala Glu Arg Trp Ala Glu Ile Lys Asn 
    530                 535                 540                 


Ile Pro Gly Val Val Gln Pro Asp Thr Ile Glu 
545                 550                 555 


<210>  3
<211>  1668
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  3
atgaagcgct cgaagcgctt cgcggtgctg gcccagcgcc cggtgaacca agatggcctc       60

atcggggagt ggcccgaaga gggcctcatc gcaatggact cgccgttcga tcccgtgtcc      120

tcggtgaagg tcgataacgg cctgatcgtg gagctggacg gcaagcgccg cgaccagttc      180

gatatgatcg accggttcat tgcggactac gcgatcaatg tggaacgcac cgaacaggcg      240

atgcgcctgg aagcggtcga gatcgcccgg atgctcgtgg acatccatgt gagccgcgaa      300

gagatcatcg cgatcaccac ggcgatcacc ccggccaaag ccgtggaagt gatggcccag      360

atgaacgtcg tcgagatgat gatggcgctg cagaagatgc gcgcccgccg caccccgtcc      420

aaccagtgcc atgtcaccaa cctgaaggat aacccggtgc agatcgccgc ggacgcggcc      480

gaggccggca tccggggctt ctcggaacag gaaaccaccg tgggcattgc ccgctacgcc      540

cccttcaacg cgctggccct gctggtcggc tcgcagtgcg gccggccggg cgtgctgacc      600

cagtgcagcg tggaagaagc gaccgagctg gagctgggca tgcgcggcct gacctcgtac      660

gcggaaaccg tgtcggtcta cgggaccgag gccgtcttta ccgacggcga cgacacgccg      720

tggtccaagg cctttctggc gagcgcctat gccagccgcg gcctgaagat gcggtacacg      780

agcggcaccg gctccgaggc cctgatgggc tacagcgagt cgaagtccat gctgtatctg      840

gagtcccggt gcatcttcat cacgaagggc gcgggcgtgc aagggctgca gaatggcgcc      900

gtgtcgtgca tcggcatgac cggcgcggtg cccagcggca tccgcgcggt gctcgccgaa      960

aacctgattg cctccatgct ggacctggaa gtcgcgagcg cgaacgacca gacgttcagc     1020

cacagcgaca tccgccgcac ggcgcgcacg ctgatgcaga tgctgccggg caccgacttc     1080

atcttcagcg gctactccgc ggtgccgaac tatgataata tgttcgccgg cagcaacttc     1140

gatgccgagg atttcgacga ctacaacatc ctgcagcgcg atctgatggt cgatggcggg     1200

ctgcgccccg tcaccgaagc ggaaaccatc gccatccgcc agaaagccgc gcgggccatc     1260

caggccgtgt tccgcgagct ggggctgccg ccgatcgccg acgaagaagt cgaggccgcc     1320

acctacgcgc acggctccaa tgaaatgccc ccgcgcaacg tcgtggagga cctgtcggcg     1380

gtggaagaga tgatgaagcg caacatcacc ggcctggaca tcgtcggcgc gctgtcgcgc     1440

agcggcttcg aggacatcgc gagcaatatc ctgaacatgc tgcgccaacg cgtgaccggc     1500

gactacctcc agacctcggc gattctggac cgccagtttg aggtcgtgtc ggccgtgaac     1560

gacatcaacg actaccaggg cccgggcacg ggctaccgca tctcggccga gcgctgggcc     1620

gagatcaaga acatcccggg cgtggtgcag ccggacacga tcgagtga                  1668


<210>  4
<211>  426
<212>  PRT
<213>  Klebsiella pneumoniae [HC1]

<400>  4

Ala Thr Gly Ala Gly Cys Gly Ala Gly Ala Ala Ala Ala Cys Cys Ala 
1               5                   10                  15      


Thr Gly Cys Gly Cys Gly Thr Gly Cys Ala Gly Gly Ala Thr Thr Ala 
            20                  25                  30          


Thr Cys Cys Gly Thr Thr Ala Gly Cys Cys Ala Cys Cys Cys Gly Cys 
        35                  40                  45              


Thr Gly Cys Cys Cys Gly Gly Ala Gly Cys Ala Thr Ala Thr Cys Cys 
    50                  55                  60                  


Thr Gly Ala Cys Gly Cys Cys Thr Ala Cys Cys Gly Gly Cys Ala Ala 
65                  70                  75                  80  


Ala Cys Cys Ala Thr Thr Gly Ala Cys Cys Gly Ala Thr Ala Thr Thr 
                85                  90                  95      


Ala Cys Cys Cys Thr Cys Gly Ala Gly Ala Ala Gly Gly Thr Gly Cys 
            100                 105                 110         


Thr Cys Thr Cys Thr Gly Gly Cys Gly Ala Gly Gly Thr Gly Gly Gly 
        115                 120                 125             


Cys Cys Cys Gly Cys Ala Gly Gly Ala Thr Gly Thr Gly Cys Gly Gly 
    130                 135                 140                 


Ala Thr Cys Thr Cys Cys Cys Gly Cys Cys Ala Gly Ala Cys Cys Cys 
145                 150                 155                 160 


Thr Thr Gly Ala Gly Thr Ala Cys Cys Ala Gly Gly Cys Gly Cys Ala 
                165                 170                 175     


Gly Ala Thr Thr Gly Cys Cys Gly Ala Gly Cys Ala Gly Ala Thr Gly 
            180                 185                 190         


Cys Ala Gly Cys Gly Cys Cys Ala Thr Gly Cys Gly Gly Thr Gly Gly 
        195                 200                 205             


Cys Gly Cys Gly Cys Ala Ala Thr Thr Thr Cys Cys Gly Cys Cys Gly 
    210                 215                 220                 


Cys Gly Cys Gly Gly Cys Gly Gly Ala Gly Cys Thr Thr Ala Thr Cys 
225                 230                 235                 240 


Gly Cys Cys Ala Thr Thr Cys Cys Thr Gly Ala Cys Gly Ala Gly Cys 
                245                 250                 255     


Gly Cys Ala Thr Thr Cys Thr Gly Gly Cys Thr Ala Thr Cys Thr Ala 
            260                 265                 270         


Thr Ala Ala Cys Gly Cys Gly Cys Thr Gly Cys Gly Cys Cys Cys Gly 
        275                 280                 285             


Thr Thr Cys Cys Gly Cys Thr Cys Cys Thr Cys Gly Cys Ala Gly Gly 
    290                 295                 300                 


Cys Gly Gly Ala Gly Cys Thr Gly Cys Thr Gly Gly Cys Gly Ala Thr 
305                 310                 315                 320 


Cys Gly Cys Cys Gly Ala Cys Gly Ala Gly Cys Thr Gly Gly Ala Gly 
                325                 330                 335     


Cys Ala Cys Ala Cys Cys Thr Gly Gly Cys Ala Thr Gly Cys Gly Ala 
            340                 345                 350         


Cys Ala Gly Thr Gly Ala Ala Thr Gly Cys Cys Gly Cys Cys Thr Thr 
        355                 360                 365             


Thr Gly Thr Cys Cys Gly Gly Gly Ala Gly Thr Cys Gly Gly Cys Gly 
    370                 375                 380                 


Gly Ala Ala Gly Thr Gly Thr Ala Thr Cys Ala Gly Cys Ala Gly Cys 
385                 390                 395                 400 


Gly Gly Cys Ala Thr Ala Ala Gly Cys Thr Gly Cys Gly Thr Ala Ala 
                405                 410                 415     


Ala Gly Gly Ala Ala Gly Cys Thr Ala Ala 
            420                 425     


<210>  5
<211>  141
<212>  PRT
<213>  Klebsiella pneumonia

<400>  5

Met Ser Glu Lys Thr Met Arg Val Gln Asp Tyr Pro Leu Ala Thr Arg 
1               5                   10                  15      


Cys Pro Glu His Ile Leu Thr Pro Thr Gly Lys Pro Leu Thr Asp Ile 
            20                  25                  30          


Thr Leu Glu Lys Val Leu Ser Gly Glu Val Gly Pro Gln Asp Val Arg 
        35                  40                  45              


Ile Ser Arg Gln Thr Leu Glu Tyr Gln Ala Gln Ile Ala Glu Gln Met 
    50                  55                  60                  


Gln Arg His Ala Val Ala Arg Asn Phe Arg Arg Ala Ala Glu Leu Ile 
65                  70                  75                  80  


Ala Ile Pro Asp Glu Arg Ile Leu Ala Ile Tyr Asn Ala Leu Arg Pro 
                85                  90                  95      


Phe Arg Ser Ser Gln Ala Glu Leu Leu Ala Ile Ala Asp Glu Leu Glu 
            100                 105                 110         


His Thr Trp His Ala Thr Val Asn Ala Ala Phe Val Arg Glu Ser Ala 
        115                 120                 125             


Glu Val Tyr Gln Gln Arg His Lys Leu Arg Lys Gly Ser 
    130                 135                 140     


<210>  6
<211>  1824
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  6
atgccgttaa tagccgggat tgatatcggc aacgccacca ccgaggtggc gctggcgtcc       60

gacgacccgc aggcgagggc gtttgttgcc agcgggatcg ttgcgacgac gggcatgaaa      120

gggacgcggg acaatatcgc cgggaccctc gccgcgctgg agcaggccct ggcgaaaaca      180

ccgtggtcga tgagcgatgt ctctcgcatc tatcttaacg aagccgcgcc ggtgattggc      240

gatgtggcga tggagaccat caccgagacc attatcaccg aatcgaccat gatcggtcat      300

aacccgcaga cgccgggcgg ggtgggcgtt ggcgtgggga cgactatcgc cctcgggcgg      360

ctggcgacgc tgccggcggc gcagtatgcc gaggggtgga tcgtactgat tgacgatgcc      420

gtcgatttcc ttgacgccgt gtggtggctc aatgaggcgc tcgaccgggg gatcaacgtg      480

gtggcggcga tccttaaaaa ggacgacggc gtgctggtga acaaccgcct gcgtaaaacc      540

ctgccggtgg tggatgaagt gacgctgctg gagcaggtcc ccgagggggt gatggcggcg      600

gtggaagtgg ccgcgccggg ccaggtggtg cggatcctgt cgaatcccta cgggatcgcc      660

accttcttcg ggctaagccc ggaagagacc caggccatcg tccccatcgc ccgcgccctg      720

attggcaacc gttccgcggt ggtgctcaag accccgcagg gggacgtgca gtcgcgggtg      780

atcccggcgg gcaacctcta cattagcggc gaaaagcgcc gcggagaggc cgatgttgcc      840

gagggcgcgg aagccatcat gcaggcgatg agcgcctgcg ctccggtacg cgacatccgc      900

ggcgaaccgg gcactcacgc cggcggcatg cttgagcggg tgcgcaaggt aatggcgtcc      960

ctgaccgacc atgagatgag cgcgatatac atccaggatc tgctggcggt ggatacgttt     1020

attccgcgca aggtgcaggg cgggatggcc ggcgagtgcg ccatggaaaa tgccgtcggg     1080

atggcggcga tggtgaaagc ggatcgtctg caaatgcagg ttatcgcccg cgaactgagc     1140

gcccgactgc agaccgaggt ggtggtgggc ggcgtggagg ccaacatggc catcgccggg     1200

gcgttaacca ctcccggctg tgcggcgccg ctggcgatcc tcgacctcgg cgccggctcg     1260

acggatgcgg cgatcgtcaa cgcggagggg cagataacgg cggtccatct cgccggggcg     1320

gggaatatgg tcagcctgtt gattaaaacc gagctgggcc tcgaggatct ttcgctggcg     1380

gaagcgataa aaaaataccc gctggccaaa gtggaaagcc tgttcagtat tcgtcatgag     1440

aatggcgcgg tggagttctt tcgggaagcc ctcagcccgg cggtgttcgc caaagtggtg     1500

tacatcaagg agggcgaact ggtgccgatc gataacgcca gcccgctgga aaaaattcgt     1560

ctcgtgcgcc ggcaggcgaa agagaaagtg tttgtcacca actgcctgcg cgcgctgcgc     1620

caggtctcac ccggcggttc cattcgcgat atcgcctttg tggtgctggt gggcggctca     1680

tcgctggact ttgagatccc gcagcttatc acggaagcct tgtcgcacta tggcgtagtc     1740

gccgggcagg gcaatattcg gggaacagaa gggccgcgca atgcggtcgc caccgggctg     1800

ctactggccg gtcaggcgaa ttaa                                            1824


<210>  7
<211>  607
<212>  PRT
<213>  Klebsiella pneumonia

<400>  7

Met Pro Leu Ile Ala Gly Ile Asp Ile Gly Asn Ala Thr Thr Glu Val 
1               5                   10                  15      


Ala Leu Ala Ser Asp Tyr Pro Gln Ala Arg Ala Phe Val Ala Ser Gly 
            20                  25                  30          


Ile Val Ala Thr Thr Gly Met Lys Gly Thr Arg Asp Asn Ile Ala Gly 
        35                  40                  45              


Thr Leu Ala Ala Leu Glu Gln Ala Leu Ala Lys Thr Pro Trp Ser Met 
    50                  55                  60                  


Ser Asp Val Ser Arg Ile Tyr Leu Asn Glu Ala Ala Pro Val Ile Gly 
65                  70                  75                  80  


Asp Val Ala Met Glu Thr Ile Thr Glu Thr Ile Ile Thr Glu Ser Thr 
                85                  90                  95      


Met Ile Gly His Asn Pro Gln Thr Pro Gly Gly Val Gly Val Gly Val 
            100                 105                 110         


Gly Thr Thr Ile Ala Leu Gly Arg Leu Ala Thr Leu Pro Ala Ala Gln 
        115                 120                 125             


Tyr Ala Glu Gly Trp Ile Val Leu Ile Asp Asp Ala Val Asp Phe Leu 
    130                 135                 140                 


Asp Ala Val Trp Trp Leu Asn Glu Ala Leu Asp Arg Gly Ile Asn Val 
145                 150                 155                 160 


Val Ala Ala Ile Leu Lys Lys Asp Asp Gly Val Leu Val Asn Asn Arg 
                165                 170                 175     


Leu Arg Lys Thr Leu Pro Val Val Asp Glu Val Thr Leu Leu Glu Gln 
            180                 185                 190         


Val Pro Glu Gly Val Met Ala Ala Val Glu Val Ala Ala Pro Gly Gln 
        195                 200                 205             


Val Val Arg Ile Leu Ser Asn Pro Tyr Gly Ile Ala Thr Phe Phe Gly 
    210                 215                 220                 


Leu Ser Pro Glu Glu Thr Gln Ala Ile Val Pro Ile Ala Arg Ala Leu 
225                 230                 235                 240 


Ile Gly Asn Arg Ser Ala Val Val Leu Lys Thr Pro Gln Gly Asp Val 
                245                 250                 255     


Gln Ser Arg Val Ile Pro Ala Gly Asn Leu Tyr Ile Ser Gly Glu Lys 
            260                 265                 270         


Arg Arg Gly Glu Ala Asp Val Ala Glu Gly Ala Glu Ala Ile Met Gln 
        275                 280                 285             


Ala Met Ser Ala Cys Ala Pro Val Arg Asp Ile Arg Gly Glu Pro Gly 
    290                 295                 300                 


Thr His Ala Gly Gly Met Leu Glu Arg Val Arg Lys Val Met Ala Ser 
305                 310                 315                 320 


Leu Thr Gly His Glu Met Ser Ala Ile Tyr Ile Gln Asp Leu Leu Ala 
                325                 330                 335     


Val Asp Thr Phe Ile Pro Arg Lys Val Gln Gly Gly Met Ala Gly Glu 
            340                 345                 350         


Cys Ala Met Glu Asn Ala Val Gly Met Ala Ala Met Val Lys Ala Asp 
        355                 360                 365             


Arg Leu Gln Met Gln Val Ile Ala Arg Glu Leu Ser Ala Arg Leu Gln 
    370                 375                 380                 


Thr Glu Val Val Val Gly Gly Val Glu Ala Asn Met Ala Ile Ala Gly 
385                 390                 395                 400 


Ala Leu Thr Thr Pro Gly Cys Ala Ala Pro Leu Ala Ile Leu Asp Leu 
                405                 410                 415     


Gly Ala Gly Ser Thr Asp Ala Ala Ile Val Asn Ala Glu Gly Gln Ile 
            420                 425                 430         


Thr Ala Val His Leu Ala Gly Ala Gly Asn Met Val Ser Leu Leu Ile 
        435                 440                 445             


Lys Thr Glu Leu Gly Leu Glu Asp Leu Ser Leu Ala Glu Ala Ile Lys 
    450                 455                 460                 


Lys Tyr Pro Leu Ala Lys Val Glu Ser Leu Phe Ser Ile Arg His Glu 
465                 470                 475                 480 


Asn Gly Ala Val Glu Phe Phe Arg Glu Ala Leu Ser Pro Ala Val Phe 
                485                 490                 495     


Ala Lys Val Val Tyr Ile Lys Glu Gly Glu Leu Val Pro Ile Asp Asn 
            500                 505                 510         


Ala Ser Pro Leu Glu Lys Ile Arg Leu Val Arg Arg Gln Ala Lys Glu 
        515                 520                 525             


Lys Val Phe Val Thr Asn Cys Leu Arg Ala Leu Arg Gln Val Ser Pro 
    530                 535                 540                 


Gly Gly Ser Ile Arg Asp Ile Ala Phe Val Val Leu Val Gly Gly Ser 
545                 550                 555                 560 


Ser Leu Asp Phe Glu Ile Pro Gln Leu Ile Thr Glu Ala Leu Ser His 
                565                 570                 575     


Tyr Gly Val Val Ala Gly Gln Gly Asn Ile Arg Gly Thr Glu Gly Pro 
            580                 585                 590         


Arg Asn Ala Val Ala Thr Gly Leu Leu Leu Ala Gly Gln Ala Asn 
        595                 600                 605         


<210>  8
<211>  2558
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  8
acttttcata ctcccgccat tcagagaaga aaccaattgt ccatattgca tcagacattg       60

ccgtcactgc gtcttttact ggctcttctc gctaaccaaa ccggtaaccc cgcttattaa      120

aagcattctg taacaaagcg ggaccaaagc catgacaaaa acgcgtaaca aaagtgtcta      180

taatcacggc agaaaagtcc acattgatta tttgcacggc gtcacacttt gctatgccat      240

agcattttta tccataagat tagcggatcc tacctgacgc tttttatcgc aactctctac      300

tgtttctcca tacccgtttt ttgggctaga aataattttg tttaacttta aaaggaggta      360

tatcgatgcc cctgatcgcc ggcattgata tcggcaacgc gaccacggag gtcgcgctgg      420

cgtccgatta tccccaggcc cgggccttcg tggcgtccgg catcgtcgcc accaccggca      480

tgaagggcac gcgggacaac atcgccggca cactcgccgc cctggagcag gcgctggcca      540

agaccccgtg gagcatgtcg gacgtgagcc gcatctacct gaacgaagcg gccccggtga      600

tcggcgatgt ggcgatggaa accattaccg aaacgattat taccgagtcc accatgatcg      660

gccataaccc gcagacgccg gggggggtgg gcgtgggcgt gggcaccacg attgcgctgg      720

ggcgcctggc caccctcccc gcggcgcagt atgccgaagg gtggattgtg ctgatcgatg      780

atgcggtgga tttcctcgac gcggtctggt ggctgaatga ggcgctggat cgcgggatca      840

atgtcgtggc ggcgatcctc aagaaagatg acggcgtgct cgtgaataac cgcctgcgca      900

agacgctccc cgtggtggac gaagtgaccc tgctggaaca ggtgccggag ggcgtcatgg      960

ccgcggtcga agtggcggcc cccggccagg tcgtgcgcat cctcagcaac ccgtacggca     1020

tcgccacgtt cttcggcctc agcccggagg aaacccaggc gatcgtcccg atcgcccgcg     1080

cgctgatcgg gaaccgctcg gcggttgtcc tgaaaacccc gcagggggat gtgcagagcc     1140

gcgtgatccc cgccggcaac ctgtatatca gcggcgaaaa gcgccgcggc gaagccgacg     1200

tggccgaggg cgccgaagcc atcatgcaag ccatgagcgc gtgcgccccg gtccgcgata     1260

tccggggcga gcccggcacc cacgcgggcg gcatgctgga acgcgtccgg aaggtgatgg     1320

cctcgctgac ggaccacgag atgtcggcga tctatatcca ggatctgctc gccgtggaca     1380

cgtttatccc gcggaaagtc cagggcggca tggccggcga gtgcgcgatg gagaacgccg     1440

tgggcatggc ggcgatggtg aaggccgatc gcctgcagat gcaagtcatc gcccgggaac     1500

tgagcgcgcg cctgcagacc gaagtggtcg tcgggggggt cgaggcgaac atggcgattg     1560

cgggcgcgct gacgacgccc gggtgcgcgg cgccgctggc cattctcgac ctgggcgcgg     1620

gctccaccga cgcggcgatt gtgaatgcgg agggccagat caccgcggtc cacctggcgg     1680

gcgcgggcaa catggtcagc ctcctgatca agaccgaact gggcctggaa gatttgagcc     1740

tggccgaagc catcaagaag tacccgctgg cgaaggtcga aagcctgttt agcatccgcc     1800

atgagaatgg cgccgtggag ttctttcgcg aggcgctctc ccccgccgtg ttcgccaaag     1860

tcgtgtacat caaggaaggg gagctggtgc cgatcgacaa tgcgtcgccg ctggaaaaga     1920

tccgcctggt ccgccgccag gccaaggaga aggtgttcgt gacgaactgc ctgcgcgcgc     1980

tgcgccaagt gtcgccgggc ggctcgatcc gcgacatcgc cttcgtggtc ctggtggggg     2040

gctcctcgct ggatttcgaa atcccgcaac tgatcaccga agcgctctcg cactacgggg     2100

tcgtcgcggg ccagggcaac atccgcggca ccgagggccc ccgcaacgcg gtcgccaccg     2160

gcctgctgct ggccggccag gccaactgaa aaggaggtat atcgatgtcg ctgagcccgc     2220

cgggcgtccg cctgttctat gacccccgcg gccatcacgc cggggccatc aatgaactgt     2280

gctggggcct ggaagaacag ggcgtgccct gccagaccat cacgtacgac ggcggcggcg     2340

acgcggcggc gctgggcgcc ctcgccgccc ggagctcccc gctgcgcgtg ggcatcggcc     2400

tgagcgcctc gggcgagatc gccctgacgc acgcgcagct gaccgcggat gccccgctcg     2460

ccaccgggca cgtgacggat tcggacgacc atctgcgcac cctgggcgcg aacgcgggcc     2520

aactggtgaa ggtcctcccg ctgtccgagc gcaactga                             2558


<210>  9
<211>  607
<212>  PRT
<213>  Klebsiella pneumonia

<400>  9

Met Pro Leu Ile Ala Gly Ile Asp Ile Gly Asn Ala Thr Thr Glu Val 
1               5                   10                  15      


Ala Leu Ala Ser Asp Tyr Pro Gln Ala Arg Ala Phe Val Ala Ser Gly 
            20                  25                  30          


Ile Val Ala Thr Thr Gly Met Lys Gly Thr Arg Asp Asn Ile Ala Gly 
        35                  40                  45              


Thr Leu Ala Ala Leu Glu Gln Ala Leu Ala Lys Thr Pro Trp Ser Met 
    50                  55                  60                  


Ser Asp Val Ser Arg Ile Tyr Leu Asn Glu Ala Ala Pro Val Ile Gly 
65                  70                  75                  80  


Asp Val Ala Met Glu Thr Ile Thr Glu Thr Ile Ile Thr Glu Ser Thr 
                85                  90                  95      


Met Ile Gly His Asn Pro Gln Thr Pro Gly Gly Val Gly Val Gly Val 
            100                 105                 110         


Gly Thr Thr Ile Ala Leu Gly Arg Leu Ala Thr Leu Pro Ala Ala Gln 
        115                 120                 125             


Tyr Ala Glu Gly Trp Ile Val Leu Ile Asp Asp Ala Val Asp Phe Leu 
    130                 135                 140                 


Asp Ala Val Trp Trp Leu Asn Glu Ala Leu Asp Arg Gly Ile Asn Val 
145                 150                 155                 160 


Val Ala Ala Ile Leu Lys Lys Asp Asp Gly Val Leu Val Asn Asn Arg 
                165                 170                 175     


Leu Arg Lys Thr Leu Pro Val Val Asp Glu Val Thr Leu Leu Glu Gln 
            180                 185                 190         


Val Pro Glu Gly Val Met Ala Ala Val Glu Val Ala Ala Pro Gly Gln 
        195                 200                 205             


Val Val Arg Ile Leu Ser Asn Pro Tyr Gly Ile Ala Thr Phe Phe Gly 
    210                 215                 220                 


Leu Ser Pro Glu Glu Thr Gln Ala Ile Val Pro Ile Ala Arg Ala Leu 
225                 230                 235                 240 


Ile Gly Asn Arg Ser Ala Val Val Leu Lys Thr Pro Gln Gly Asp Val 
                245                 250                 255     


Gln Ser Arg Val Ile Pro Ala Gly Asn Leu Tyr Ile Ser Gly Glu Lys 
            260                 265                 270         


Arg Arg Gly Glu Ala Asp Val Ala Glu Gly Ala Glu Ala Ile Met Gln 
        275                 280                 285             


Ala Met Ser Ala Cys Ala Pro Val Arg Asp Ile Arg Gly Glu Pro Gly 
    290                 295                 300                 


Thr His Ala Gly Gly Met Leu Glu Arg Val Arg Lys Val Met Ala Ser 
305                 310                 315                 320 


Leu Thr Asp His Glu Met Ser Ala Ile Tyr Ile Gln Asp Leu Leu Ala 
                325                 330                 335     


Val Asp Thr Phe Ile Pro Arg Lys Val Gln Gly Gly Met Ala Gly Glu 
            340                 345                 350         


Cys Ala Met Glu Asn Ala Val Gly Met Ala Ala Met Val Lys Ala Asp 
        355                 360                 365             


Arg Leu Gln Met Gln Val Ile Ala Arg Glu Leu Ser Ala Arg Leu Gln 
    370                 375                 380                 


Thr Glu Val Val Val Gly Gly Val Glu Ala Asn Met Ala Ile Ala Gly 
385                 390                 395                 400 


Ala Leu Thr Thr Pro Gly Cys Ala Ala Pro Leu Ala Ile Leu Asp Leu 
                405                 410                 415     


Gly Ala Gly Ser Thr Asp Ala Ala Ile Val Asn Ala Glu Gly Gln Ile 
            420                 425                 430         


Thr Ala Val His Leu Ala Gly Ala Gly Asn Met Val Ser Leu Leu Ile 
        435                 440                 445             


Lys Thr Glu Leu Gly Leu Glu Asp Leu Ser Leu Ala Glu Ala Ile Lys 
    450                 455                 460                 


Lys Tyr Pro Leu Ala Lys Val Glu Ser Leu Phe Ser Ile Arg His Glu 
465                 470                 475                 480 


Asn Gly Ala Val Glu Phe Phe Arg Glu Ala Leu Ser Pro Ala Val Phe 
                485                 490                 495     


Ala Lys Val Val Tyr Ile Lys Glu Gly Glu Leu Val Pro Ile Asp Asn 
            500                 505                 510         


Ala Ser Pro Leu Glu Lys Ile Arg Leu Val Arg Arg Gln Ala Lys Glu 
        515                 520                 525             


Lys Val Phe Val Thr Asn Cys Leu Arg Ala Leu Arg Gln Val Ser Pro 
    530                 535                 540                 


Gly Gly Ser Ile Arg Asp Ile Ala Phe Val Val Leu Val Gly Gly Ser 
545                 550                 555                 560 


Ser Leu Asp Phe Glu Ile Pro Gln Leu Ile Thr Glu Ala Leu Ser His 
                565                 570                 575     


Tyr Gly Val Val Ala Gly Gln Gly Asn Ile Arg Gly Thr Glu Gly Pro 
            580                 585                 590         


Arg Asn Ala Val Ala Thr Gly Leu Leu Leu Ala Gly Gln Ala Asn 
        595                 600                 605         


<210>  10
<211>  117
<212>  PRT
<213>  Klebsiella pneumonia

<400>  10

Met Ser Leu Ser Pro Pro Gly Val Arg Leu Phe Tyr Asp Pro Arg Gly 
1               5                   10                  15      


His His Ala Gly Ala Ile Asn Glu Leu Cys Trp Gly Leu Glu Glu Gln 
            20                  25                  30          


Gly Val Pro Cys Gln Thr Ile Thr Tyr Asp Gly Gly Gly Asp Ala Ala 
        35                  40                  45              


Ala Leu Gly Ala Leu Ala Ala Arg Ser Ser Pro Leu Arg Val Gly Ile 
    50                  55                  60                  


Gly Leu Ser Ala Ser Gly Glu Ile Ala Leu Thr His Ala Gln Leu Thr 
65                  70                  75                  80  


Ala Asp Ala Pro Leu Ala Thr Gly His Val Thr Asp Ser Asp Asp His 
                85                  90                  95      


Leu Arg Thr Leu Gly Ala Asn Ala Gly Gln Leu Val Lys Val Leu Pro 
            100                 105                 110         


Leu Ser Glu Arg Asn 
        115         


<210>  11
<211>  1488
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  11
atgaattttc atcatctggc ttactggcag gataaagcgt taagtctcgc cattgaaaac       60

cgcttattta ttaacggtga atatactgct gcggcggaaa atgaaacctt tgaaaccgtt      120

gatccggtca cccaggcacc gctggcgaaa attgcccgcg gcaagagcgt cgatatcgac      180

cgtgcgatga gcgcagcacg cggcgtattt gaacgcggcg actggtcact ctcttctccg      240

gctaaacgta aagcggtact gaataaactc gccgatttaa tggaagccca cgccgaagag      300

ctggcactgc tggaaactct cgacaccggc aaaccgattc gtcacagtct gcgtgatgat      360

attcccggcg cggcgcgcgc cattcgctgg tacgccgaag cgatcgacaa agtgtatggc      420

gaagtggcga ccaccagtag ccatgagctg gcgatgatcg tgcgtgaacc ggtcggcgtg      480

attgccgcca tcgtgccgtg gaacttcccg ctgttgctga cttgctggaa actcggcccg      540

gcgctggcgg cgggaaacag cgtgattcta aaaccgtctg aaaaatcacc gctcagtgcg      600

attcgtctcg cggggctggc gaaagaagca ggcttgccgg atggtgtgtt gaacgtggtg      660

acgggttttg gtcatgaagc cgggcaggcg ctgtcgcgtc ataacgatat cgacgccatt      720

gcctttaccg gttcaacccg taccgggaaa cagctgctga aagatgcggg cgacagcaac      780

atgaaacgcg tctggctgga agcgggcggc aaaagcgcca acatcgtttt cgctgactgc      840

ccggatttgc aacaggcggc aagcgccacc gcagcaggca ttttctacaa ccagggacag      900

gtgtgcatcg ccggaacgcg cctgttgctg gaagagagca tcgccgatga attcttagcc      960

ctgttaaaac agcaggcgca aaactggcag ccgggccatc cacttgatcc cgcaaccacc     1020

atgggcacct taatcgactg cgcccacgcc gactcggtcc atagctttat tcgggaaggc     1080

gaaagcaaag ggcaactgtt gttggatggc cgtaacgccg ggctggctgc cgccatcggc     1140

ccgaccatct ttgtggatgt ggacccgaat gcgtccttaa gtcgcgaaga gattttcggt     1200

ccggtgctgg tggtcacgcg tttcacatca gaagaacagg cgctacagct tgccaacgac     1260

agccagtacg gccttggcgc ggcggtatgg acgcgcgacc tctcccgcgc gcaccgcatg     1320

agccgacgcc tgaaagccgg ttccgtcttc gtcaataact acaacgacgg cgatatgacc     1380

gtgccgtttg gcggctataa gcagagcggc aacggtcgcg acaaatccct gcatgccctt     1440

gaaaaattca ctgaactgaa aaccatctgg ataagcctgg aggcctga                  1488


<210>  12
<211>  495
<212>  PRT
<213>  E. coli

<400>  12

Met Asn Phe His His Leu Ala Tyr Trp Gln Asp Lys Ala Leu Ser Leu 
1               5                   10                  15      


Ala Ile Glu Asn Arg Leu Phe Ile Asn Gly Glu Tyr Thr Ala Ala Ala 
            20                  25                  30          


Glu Asn Glu Thr Phe Glu Thr Val Asp Pro Val Thr Gln Ala Pro Leu 
        35                  40                  45              


Ala Lys Ile Ala Arg Gly Lys Ser Val Asp Ile Asp Arg Ala Met Ser 
    50                  55                  60                  


Ala Ala Arg Gly Val Phe Glu Arg Gly Asp Trp Ser Leu Ser Ser Pro 
65                  70                  75                  80  


Ala Lys Arg Lys Ala Val Leu Asn Lys Leu Ala Asp Leu Met Glu Ala 
                85                  90                  95      


His Ala Glu Glu Leu Ala Leu Leu Glu Thr Leu Asp Thr Gly Lys Pro 
            100                 105                 110         


Ile Arg His Ser Leu Arg Asp Asp Ile Pro Gly Ala Ala Arg Ala Ile 
        115                 120                 125             


Arg Trp Tyr Ala Glu Ala Ile Asp Lys Val Tyr Gly Glu Val Ala Thr 
    130                 135                 140                 


Thr Ser Ser His Glu Leu Ala Met Ile Val Arg Glu Pro Val Gly Val 
145                 150                 155                 160 


Ile Ala Ala Ile Val Pro Trp Asn Phe Pro Leu Leu Leu Thr Cys Trp 
                165                 170                 175     


Lys Leu Gly Pro Ala Leu Ala Ala Gly Asn Ser Val Ile Leu Lys Pro 
            180                 185                 190         


Ser Glu Lys Ser Pro Leu Ser Ala Ile Arg Leu Ala Gly Leu Ala Lys 
        195                 200                 205             


Glu Ala Gly Leu Pro Asp Gly Val Leu Asn Val Val Thr Gly Phe Gly 
    210                 215                 220                 


His Glu Ala Gly Gln Ala Leu Ser Arg His Asn Asp Ile Asp Ala Ile 
225                 230                 235                 240 


Ala Phe Thr Gly Ser Thr Arg Thr Gly Lys Gln Leu Leu Lys Asp Ala 
                245                 250                 255     


Gly Asp Ser Asn Met Lys Arg Val Trp Leu Glu Ala Gly Gly Lys Ser 
            260                 265                 270         


Ala Asn Ile Val Phe Ala Asp Cys Pro Asp Leu Gln Gln Ala Ala Ser 
        275                 280                 285             


Ala Thr Ala Ala Gly Ile Phe Tyr Asn Gln Gly Gln Val Cys Ile Ala 
    290                 295                 300                 


Gly Thr Arg Leu Leu Leu Glu Glu Arg Ile Ala Asp Glu Phe Leu Ala 
305                 310                 315                 320 


Leu Leu Lys Gln Gln Ala Gln Asn Trp Gln Pro Gly His Pro Leu Asp 
                325                 330                 335     


Pro Ala Thr Thr Met Gly Thr Leu Ile Asp Cys Ala His Ala Asp Ser 
            340                 345                 350         


Val His Ser Phe Ile Arg Glu Gly Glu Ser Lys Gly Gln Leu Leu Leu 
        355                 360                 365             


Asp Gly Arg Asn Ala Gly Leu Ala Ala Ala Ile Gly Pro Thr Ile Phe 
    370                 375                 380                 


Val Asp Val Asp Pro Asn Ala Ser Leu Ser Arg Glu Glu Ile Phe Gly 
385                 390                 395                 400 


Pro Val Leu Val Val Thr Arg Phe Thr Ser Glu Glu Gln Ala Leu Gln 
                405                 410                 415     


Leu Ala Asn Asp Ser Gln Tyr Gly Leu Gly Ala Ala Val Trp Thr Arg 
            420                 425                 430         


Asp Leu Ser Arg Ala His Arg Met Ser Arg Arg Leu Lys Ala Gly Ser 
        435                 440                 445             


Val Phe Val Asn Asn Tyr Asn Asp Gly Asp Met Thr Val Pro Phe Gly 
    450                 455                 460                 


Gly Tyr Lys Gln Ser Gly Asn Gly Arg Asp Lys Ser Leu His Ala Leu 
465                 470                 475                 480 


Glu Lys Phe Thr Glu Leu Lys Thr Ile Trp Ile Ser Leu Glu Ala 
                485                 490                 495 


<210>  13
<211>  1491
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  13
atgatgaatt ttcagcacct ggcttactgg caggaaaaag cgaaaaacct ggccattgaa       60

acgcgcttat ttattaacgg cgaatattgc gccgcggccg ataataccac ctttgagact      120

atcgaccccg ccgcgcagca gacattagcc caggtcgccc gcggtaaaaa agccgacgtc      180

gaacgggcgg tgaaagccgc gcgccaggct tttgataacg gcgactggtc gcaggcctcc      240

cccgcacagc gtaaagcgat cctcactcgc tttgctaatc tgatggaggc ccatcgtgaa      300

gagctggcgc tgctggaaac gctggatacc ggcaagccga ttcgccacag cctgcgcgac      360

gatattcccg gcgccgcccg cgccattcgc tggtatgccg aagcgctgga taaagtctat      420

ggcgaagtgg cccccaccgg cagcaacgag ctggcgatga tcgttcgcga accaattggc      480

gtgatcgccg cggtggtgcc gtggaacttc ccgctgctgc tggcctgctg gaaactcggc      540

ccggcgctgg cggcaggcaa tagcgtaatc ctcaaaccct cggaaaaatc gccgcttacc      600

gccctgcgtc tggccgggct ggcgaaagag gccggcctgc cggacggcgt gttgaacgtg      660

gtcagcggct ttggccacga ggccgggcag gcgctggccc tgcatcctga tgttgaagtc      720

atcaccttca ccggctccac ccgcaccggc aagcagctgc tgaaagacgc cggcgacagc      780

aatatgaagc gcgtgtggct ggaagcgggc ggcaagagcg ccaacattgt cttcgccgat      840

tgcccggatc tgcaacaagc ggttcgcgcc accgccggcg gcatcttcta caaccaggga      900

caggtgtgca tcgccgggac ccgtctgctg ctcgaggaga gcatcgctga cgagttcctg      960

gcgcggctga aagctgaggc gcaacactgg cagccgggca acccgctcga tccggacacc     1020

accatgggca tgctgattga caatacccat gccgacaacg tgcatagctt tattcgcggc     1080

ggcgaaagcc aaagcaccct gttcctcgac ggacggaaaa acccgtggcc tgccgccgtt     1140

ggcccgacca ttttcgttga cgtcgacccg gcatcaaccc tcagccggga agagatcttc     1200

ggcccggtgc tggtggtgac ccgcttcaaa agcgaagaag aggcgctaaa gctcgccaat     1260

gacagcgact acggcttggg cgccgcggtg tggacccgcg atctctcccg cgcccaccgc     1320

atgagccgcc gcctgaaggc cggctcggtc ttcgtcaaca actataacga tggtgatatg     1380

accgttccgt tcggcggcta caagcagagc ggcaacgggc gcgataaatc gctgcacgcg     1440

ctggaaaaat tcaccgaact gaaaaccatc tggattgccc tggagtcttg a              1491


<210>  14
<211>  496
<212>  PRT
<213>  Klebsiella pneumonia

<400>  14

Met Met Asn Phe Gln His Leu Ala Tyr Trp Gln Glu Lys Ala Lys Asn 
1               5                   10                  15      


Leu Ala Ile Glu Thr Arg Leu Phe Ile Asn Gly Glu Tyr Cys Ala Ala 
            20                  25                  30          


Ala Asp Asn Thr Thr Phe Glu Thr Ile Asp Pro Ala Ala Gln Gln Thr 
        35                  40                  45              


Leu Ala Gln Val Ala Arg Gly Lys Lys Ala Asp Val Glu Arg Ala Val 
    50                  55                  60                  


Lys Ala Ala Arg Gln Ala Phe Asp Asn Gly Asp Trp Ser Gln Ala Ser 
65                  70                  75                  80  


Pro Ala Gln Arg Lys Ala Ile Leu Thr Arg Phe Ala Asn Leu Met Glu 
                85                  90                  95      


Ala His Arg Glu Glu Leu Ala Leu Leu Glu Thr Leu Asp Thr Gly Lys 
            100                 105                 110         


Pro Ile Arg His Ser Leu Arg Asp Asp Ile Pro Gly Ala Ala Arg Ala 
        115                 120                 125             


Ile Arg Trp Tyr Ala Glu Ala Leu Asp Lys Val Tyr Gly Glu Val Ala 
    130                 135                 140                 


Pro Thr Gly Ser Asn Glu Leu Ala Met Ile Val Arg Glu Pro Ile Gly 
145                 150                 155                 160 


Val Ile Ala Ala Val Val Pro Trp Asn Phe Pro Leu Leu Leu Ala Cys 
                165                 170                 175     


Trp Lys Leu Gly Pro Ala Leu Ala Ala Gly Asn Ser Val Ile Leu Lys 
            180                 185                 190         


Pro Ser Glu Lys Ser Pro Leu Thr Ala Leu Arg Leu Ala Gly Leu Ala 
        195                 200                 205             


Lys Glu Ala Gly Leu Pro Asp Gly Val Leu Asn Val Val Ser Gly Phe 
    210                 215                 220                 


Gly His Glu Ala Gly Gln Ala Leu Ala Leu His Pro Asp Val Glu Val 
225                 230                 235                 240 


Ile Thr Phe Thr Gly Ser Thr Arg Thr Gly Lys Gln Leu Leu Lys Asp 
                245                 250                 255     


Ala Gly Asp Ser Asn Met Lys Arg Val Trp Leu Glu Ala Gly Gly Lys 
            260                 265                 270         


Ser Ala Asn Ile Val Phe Ala Asp Cys Pro Asp Leu Gln Gln Ala Val 
        275                 280                 285             


Arg Ala Thr Ala Gly Gly Ile Phe Tyr Asn Gln Gly Gln Val Cys Ile 
    290                 295                 300                 


Ala Gly Thr Arg Leu Leu Leu Glu Glu Ser Ile Ala Asp Glu Phe Leu 
305                 310                 315                 320 


Ala Arg Leu Lys Ala Glu Ala Gln His Trp Gln Pro Gly Asn Pro Leu 
                325                 330                 335     


Asp Pro Asp Thr Thr Met Gly Met Leu Ile Asp Asn Thr His Ala Asp 
            340                 345                 350         


Asn Val His Ser Phe Ile Arg Gly Gly Glu Ser Gln Ser Thr Leu Phe 
        355                 360                 365             


Leu Asp Gly Arg Lys Asn Pro Trp Pro Ala Ala Val Gly Pro Thr Ile 
    370                 375                 380                 


Phe Val Asp Val Asp Pro Ala Ser Thr Leu Ser Arg Glu Glu Ile Phe 
385                 390                 395                 400 


Gly Pro Val Leu Val Val Thr Arg Phe Lys Ser Glu Glu Glu Ala Leu 
                405                 410                 415     


Lys Leu Ala Asn Asp Ser Asp Tyr Gly Leu Gly Ala Ala Val Trp Thr 
            420                 425                 430         


Arg Asp Leu Ser Arg Ala His Arg Met Ser Arg Arg Leu Lys Ala Gly 
        435                 440                 445             


Ser Val Phe Val Asn Asn Tyr Asn Asp Gly Asp Met Thr Val Pro Phe 
    450                 455                 460                 


Gly Gly Tyr Lys Gln Ser Gly Asn Gly Arg Asp Lys Ser Leu His Ala 
465                 470                 475                 480 


Leu Glu Lys Phe Thr Glu Leu Lys Thr Ile Trp Ile Ala Leu Glu Ser 
                485                 490                 495     


<210>  15
<211>  1176
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  15
atgtctgctg ctgctgatag attaaactta acttccggcc acttgaatgc tggtagaaag       60

agaagttcct cttctgtttc tttgaaggct gccgaaaagc ctttcaaggt tactgtgatt      120

ggatctggta actggggtac tactattgcc aaggtggttg ccgaaaattg taagggatac      180

ccagaagttt tcgctccaat agtacaaatg tgggtgttcg aagaagagat caatggtgaa      240

aaattgactg aaatcataaa tactagacat caaaacgtga aatacttgcc tggcatcact      300

ctacccgaca atttggttgc taatccagac ttgattgatt cagtcaagga tgtcgacatc      360

atcgttttca acattccaca tcaatttttg ccccgtatct gtagccaatt gaaaggtcat      420

gttgattcac acgtcagagc tatctcctgt ctaaagggtt ttgaagttgg tgctaaaggt      480

gtccaattgc tatcctctta catcactgag gaactaggta ttcaatgtgg tgctctatct      540

ggtgctaaca ttgccaccga agtcgctcaa gaacactggt ctgaaacaac agttgcttac      600

cacattccaa aggatttcag aggcgagggc aaggacgtcg accataaggt tctaaaggcc      660

ttgttccaca gaccttactt ccacgttagt gtcatcgaag atgttgctgg tatctccatc      720

tgtggtgctt tgaagaacgt tgttgcctta ggttgtggtt tcgtcgaagg tctaggctgg      780

ggtaacaacg cttctgctgc catccaaaga gtcggtttgg gtgagatcat cagattcggt      840

caaatgtttt tcccagaatc tagagaagaa acatactacc aagagtctgc tggtgttgct      900

gatttgatca ccacctgcgc tggtggtaga aacgtcaagg ttgctaggct aatggctact      960

tctggtaagg acgcctggga atgtgaaaag gagttgttga atggccaatc cgctcaaggt     1020

ttaattacct gcaaagaagt tcacgaatgg ttggaaacat gtggctctgt cgaagacttc     1080

ccattatttg aagccgtata ccaaatcgtt tacaacaact acccaatgaa gaacctgccg     1140

gacatgattg aagaattaga tctacatgaa gattag                               1176


<210>  16
<211>  391
<212>  PRT
<213>  S. cerevisiae

<400>  16

Met Ser Ala Ala Ala Asp Arg Leu Asn Leu Thr Ser Gly His Leu Asn 
1               5                   10                  15      


Ala Gly Arg Lys Arg Ser Ser Ser Ser Val Ser Leu Lys Ala Ala Glu 
            20                  25                  30          


Lys Pro Phe Lys Val Thr Val Ile Gly Ser Gly Asn Trp Gly Thr Thr 
        35                  40                  45              


Ile Ala Lys Val Val Ala Glu Asn Cys Lys Gly Tyr Pro Glu Val Phe 
    50                  55                  60                  


Ala Pro Ile Val Gln Met Trp Val Phe Glu Glu Glu Ile Asn Gly Glu 
65                  70                  75                  80  


Lys Leu Thr Glu Ile Ile Asn Thr Arg His Gln Asn Val Lys Tyr Leu 
                85                  90                  95      


Pro Gly Ile Thr Leu Pro Asp Asn Leu Val Ala Asn Pro Asp Leu Ile 
            100                 105                 110         


Asp Ser Val Lys Asp Val Asp Ile Ile Val Phe Asn Ile Pro His Gln 
        115                 120                 125             


Phe Leu Pro Arg Ile Cys Ser Gln Leu Lys Gly His Val Asp Ser His 
    130                 135                 140                 


Val Arg Ala Ile Ser Cys Leu Lys Gly Phe Glu Val Gly Ala Lys Gly 
145                 150                 155                 160 


Val Gln Leu Leu Ser Ser Tyr Ile Thr Glu Glu Leu Gly Ile Gln Cys 
                165                 170                 175     


Gly Ala Leu Ser Gly Ala Asn Ile Ala Thr Glu Val Ala Gln Glu His 
            180                 185                 190         


Trp Ser Glu Thr Thr Val Ala Tyr His Ile Pro Lys Asp Phe Arg Gly 
        195                 200                 205             


Glu Gly Lys Asp Val Asp His Lys Val Leu Lys Ala Leu Phe His Arg 
    210                 215                 220                 


Pro Tyr Phe His Val Ser Val Ile Glu Asp Val Ala Gly Ile Ser Ile 
225                 230                 235                 240 


Cys Gly Ala Leu Lys Asn Val Val Ala Leu Gly Cys Gly Phe Val Glu 
                245                 250                 255     


Gly Leu Gly Trp Gly Asn Asn Ala Ser Ala Ala Ile Gln Arg Val Gly 
            260                 265                 270         


Leu Gly Glu Ile Ile Arg Phe Gly Gln Met Phe Phe Pro Glu Ser Arg 
        275                 280                 285             


Glu Glu Thr Tyr Tyr Gln Glu Ser Ala Gly Val Ala Asp Leu Ile Thr 
    290                 295                 300                 


Thr Cys Ala Gly Gly Arg Asn Val Lys Val Ala Arg Leu Met Ala Thr 
305                 310                 315                 320 


Ser Gly Lys Asp Ala Trp Glu Cys Glu Lys Glu Leu Leu Asn Gly Gln 
                325                 330                 335     


Ser Ala Gln Gly Leu Ile Thr Cys Lys Glu Val His Glu Trp Leu Glu 
            340                 345                 350         


Thr Cys Gly Ser Val Glu Asp Phe Pro Leu Phe Glu Ala Val Tyr Gln 
        355                 360                 365             


Ile Val Tyr Asn Asn Tyr Pro Met Lys Asn Leu Pro Asp Met Ile Glu 
    370                 375                 380                 


Glu Leu Asp Leu His Glu Asp 
385                 390     


<210>  17
<211>  753
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  17
atgggattga ctactaaacc tctatctttg aaagttaacg ccgctttgtt cgacgtcgac       60

ggtaccatta tcatctctca accagccatt gctgcattct ggagggattt cggtaaggac      120

aaaccttatt tcgatgctga acacgttatc caagtctcgc atggttggag aacgtttgat      180

gccattgcta agttcgctcc agactttgcc aatgaagagt atgttaacaa attagaagct      240

gaaattccgg tcaagtacgg tgaaaaatcc attgaagtcc caggtgcagt taagctgtgc      300

aacgctttga acgctctacc aaaagagaaa tgggctgtgg caacttccgg tacccgtgat      360

atggcacaaa aatggttcga gcatctggga atcaggagac caaagtactt cattaccgct      420

aatgatgtca aacagggtaa gcctcatcca gaaccatatc tgaagggcag gaatggctta      480

ggatatccga tcaatgagca agacccttcc aaatctaagg tagtagtatt tgaagacgct      540

ccagcaggta ttgccgccgg aaaagccgcc ggttgtaaga tcattggtat tgccactact      600

ttcgacttgg acttcctaaa ggaaaaaggc tgtgacatca ttgtcaaaaa ccacgaatcc      660

atcagagttg gcggctacaa tgccgaaaca gacgaagttg aattcatttt tgacgactac      720

ttatatgcta aggacgatct gttgaaatgg taa                                   753


<210>  18
<211>  250
<212>  PRT
<213>  S. cerevisiae

<400>  18

Met Gly Leu Thr Thr Lys Pro Leu Ser Leu Lys Val Asn Ala Ala Leu 
1               5                   10                  15      


Phe Asp Val Asp Gly Thr Ile Ile Ile Ser Gln Pro Ala Ile Ala Ala 
            20                  25                  30          


Phe Trp Arg Asp Phe Gly Lys Asp Lys Pro Tyr Phe Asp Ala Glu His 
        35                  40                  45              


Val Ile Gln Val Ser His Gly Trp Arg Thr Phe Asp Ala Ile Ala Lys 
    50                  55                  60                  


Phe Ala Pro Asp Phe Ala Asn Glu Glu Tyr Val Asn Lys Leu Glu Ala 
65                  70                  75                  80  


Glu Ile Pro Val Lys Tyr Gly Glu Lys Ser Ile Glu Val Pro Gly Ala 
                85                  90                  95      


Val Lys Leu Cys Asn Ala Leu Asn Ala Leu Pro Lys Glu Lys Trp Ala 
            100                 105                 110         


Val Ala Thr Ser Gly Thr Arg Asp Met Ala Gln Lys Trp Phe Glu His 
        115                 120                 125             


Leu Gly Ile Arg Arg Pro Lys Tyr Phe Ile Thr Ala Asn Asp Val Lys 
    130                 135                 140                 


Gln Gly Lys Pro His Pro Glu Pro Tyr Leu Lys Gly Arg Asn Gly Leu 
145                 150                 155                 160 


Gly Tyr Pro Ile Asn Glu Gln Asp Pro Ser Lys Ser Lys Val Val Val 
                165                 170                 175     


Phe Glu Asp Ala Pro Ala Gly Ile Ala Ala Gly Lys Ala Ala Gly Cys 
            180                 185                 190         


Lys Ile Ile Gly Ile Ala Thr Thr Phe Asp Leu Asp Phe Leu Lys Glu 
        195                 200                 205             


Lys Gly Cys Asp Ile Ile Val Lys Asn His Glu Ser Ile Arg Val Gly 
    210                 215                 220                 


Gly Tyr Asn Ala Glu Thr Asp Glu Val Glu Phe Ile Phe Asp Asp Tyr 
225                 230                 235                 240 


Leu Tyr Ala Lys Asp Asp Leu Leu Lys Trp 
                245                 250 


<210>  19
<211>  542
<212>  PRT
<213>  C. necator

<400>  19

Met Lys Val Ile Thr Ala Arg Glu Ala Ala Ala Leu Val Gln Asp Gly 
1               5                   10                  15      


Trp Thr Val Ala Ser Ala Gly Phe Val Gly Ala Gly His Ala Glu Ala 
            20                  25                  30          


Val Thr Glu Ala Leu Glu Gln Arg Phe Leu Gln Ser Gly Leu Pro Arg 
        35                  40                  45              


Asp Leu Thr Leu Val Tyr Ser Ala Gly Gln Gly Asp Arg Gly Ala Arg 
    50                  55                  60                  


Gly Val Asn His Phe Gly Asn Ala Gly Met Thr Ala Ser Ile Val Gly 
65                  70                  75                  80  


Gly His Trp Arg Ser Ala Thr Arg Leu Ala Thr Leu Ala Met Ala Glu 
                85                  90                  95      


Gln Cys Glu Gly Tyr Asn Leu Pro Gln Gly Val Leu Thr His Leu Tyr 
            100                 105                 110         


Arg Ala Ile Ala Gly Gly Lys Pro Gly Val Met Thr Lys Ile Gly Leu 
        115                 120                 125             


His Thr Phe Val Asp Pro Arg Thr Ala Gln Asp Ala Arg Tyr His Gly 
    130                 135                 140                 


Gly Ala Val Asn Glu Arg Ala Arg Gln Ala Ile Ala Glu Gly Lys Ala 
145                 150                 155                 160 


Cys Trp Val Asp Ala Val Asp Phe Arg Gly Asp Glu Tyr Leu Phe Tyr 
                165                 170                 175     


Pro Ser Phe Pro Ile His Cys Ala Leu Ile Arg Cys Thr Ala Ala Asp 
            180                 185                 190         


Ala Arg Gly Asn Leu Ser Thr His Arg Glu Ala Phe His His Glu Leu 
        195                 200                 205             


Leu Ala Met Ala Gln Ala Ala His Asn Ser Gly Gly Ile Val Ile Ala 
    210                 215                 220                 


Gln Val Glu Ser Leu Val Asp His His Glu Ile Leu Gln Ala Ile His 
225                 230                 235                 240 


Val Pro Gly Ile Leu Val Asp Tyr Val Val Val Cys Asp Asn Pro Ala 
                245                 250                 255     


Asn His Gln Met Thr Phe Ala Glu Ser Tyr Asn Pro Ala Tyr Val Thr 
            260                 265                 270         


Pro Trp Gln Gly Glu Ala Ala Val Ala Glu Ala Glu Ala Ala Pro Val 
        275                 280                 285             


Ala Ala Gly Pro Leu Asp Ala Arg Thr Ile Val Gln Arg Arg Ala Val 
    290                 295                 300                 


Met Glu Leu Ala Arg Arg Ala Pro Arg Val Val Asn Leu Gly Val Gly 
305                 310                 315                 320 


Met Pro Ala Ala Val Gly Met Leu Ala His Gln Ala Gly Leu Asp Gly 
                325                 330                 335     


Phe Thr Leu Thr Val Glu Ala Gly Pro Ile Gly Gly Thr Pro Ala Asp 
            340                 345                 350         


Gly Leu Ser Phe Gly Ala Ser Ala Tyr Pro Glu Ala Val Val Asp Gln 
        355                 360                 365             


Pro Ala Gln Phe Asp Phe Tyr Glu Gly Gly Gly Ile Asp Leu Ala Ile 
    370                 375                 380                 


Leu Gly Leu Ala Glu Leu Asp Gly His Gly Asn Val Asn Val Ser Lys 
385                 390                 395                 400 


Phe Gly Glu Gly Glu Gly Ala Ser Ile Ala Gly Val Gly Gly Phe Ile 
                405                 410                 415     


Asn Ile Thr Gln Ser Ala Arg Ala Val Val Phe Met Gly Thr Leu Thr 
            420                 425                 430         


Ala Gly Gly Leu Glu Val Arg Ala Gly Asp Gly Gly Leu Gln Ile Val 
        435                 440                 445             


Arg Glu Gly Arg Val Lys Lys Ile Val Pro Glu Val Ser His Leu Ser 
    450                 455                 460                 


Phe Asn Gly Pro Tyr Val Ala Ser Leu Gly Ile Pro Val Leu Tyr Ile 
465                 470                 475                 480 


Thr Glu Arg Ala Val Phe Glu Met Arg Ala Gly Ala Asp Gly Glu Ala 
                485                 490                 495     


Arg Leu Thr Leu Val Glu Ile Ala Pro Gly Val Asp Leu Gln Arg Asp 
            500                 505                 510         


Val Leu Asp Gln Cys Ser Thr Pro Ile Ala Val Ala Gln Asp Leu Arg 
        515                 520                 525             


Glu Met Asp Ala Arg Leu Phe Gln Ala Gly Pro Leu His Leu 
    530                 535                 540         


<210>  20
<211>  1628
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  20
atgaaggtga tcaccgcacg cgaagcggcg gcactggtgc aggacggctg gaccgtggcc       60

agcgcgggct tgtcggcgcc ggccatgccg aggccgtgac cgaggcgctg gagcagcgct      120

tcctgcagag cgggctgccg cgcgacctga cgctggtgta ctcggccggg cagggcgacc      180

gcggcgcgcg cggcgtgaac cacttcggca atgccggcat gaccgccagc atcgtcggcg      240

gccactggcg ctcggccacg cggctggcca cgctggccat ggccgagcag tgcgagggct      300

acaacctgcc gcagggcgtg ctgacgcacc tataccgcgc catcgccggc ggcaagcccg      360

gcgtgatgac caagatcggc ctgcacacct tcgtcgaccc gcgcaccgcg caggatgcgc      420

gctaccacgg cggcgccgtc aacgagcgcg cgcgccaggc cattgccgag ggcaaggcat      480

gctgggtcga tgcggtcgac ttccgcggcg acgaatacct gttctacccg agcttcccga      540

tccactgcgc gctgatccgc tgcaccgcgg ccgacgcccg cggcaacctc agcacccatc      600

gcgaagcctt ccaccatgag ctgctggcga tggcgcaggc ggcccacaac tcgggcggca      660

tcgtgatcgc gcaggtggaa agcctggtcg accaccacga gatcctgcag gccatccacg      720

tgcccggcat cctggtcgac tacgtggtgg tctgcgacaa ccccgccaac caccagatga      780

cgtttgccga gtcctacaac ccggcctacg tgacgccatg gcaaggcgag gcagcggtgg      840

ccgaagcgga agcggcgccg gtggctgccg gcccgctcga cgcgcgcacc atcgtgcagc      900

gccgtgcggt gatggaactg gcgcgccgtg cgccgcgcgt ggtcaacctg ggcgtgggca      960

tgccggcagc ggtcggcatg ctggcgcacc aggccgggct ggacggcttc acgctgaccg     1020

tcgaggccgg ccccatcggc ggcacgcccg cggatggcct cagcttcggt gcctcggcct     1080

acccggaggc ggtggtggat cagcccgcgc agttcgattt ctacgagggc ggcggcatcg     1140

acctggccat cctcggcctg gccgagctgg atggccacgg caacgtcaat gtcagcaagt     1200

tcggcgaagg cgagggcgca tcgattgccg gcgtcggcgg ctttatcaac atcacgcaga     1260

gcgcgcgcgc ggtggtgttc atgggcacgc tgacggcggg cgggctggaa gtccgcgccg     1320

gcgacggcgg cctgcagatc gtgcgcgaag gccgcgtgaa gaagatcgtg cctgaggtgt     1380

cgcacctgag cttcaacggg ccctatgtgg cgtcgctcgg catcccggtg ctgtacatca     1440

ccgagcgcgc ggtgttcgag atgcgcgctg gcgcagacgg cgaagcccgc ctcacgctgg     1500

tcgagatcgc ccccggcgtg gacctgcagc gcgacgtgct cgaccagtgc tcgacgccca     1560

tcgccgttgc gcaggacctg cgcgaaatgg atgcgcggct gttccaggcc gggcccctgc     1620

acctgtaa                                                              1628


<210>  21
<211>  630
<212>  PRT
<213>  C. necator

<400>  21

Met Thr Ala Ser His Ala Val His Ala Arg Ser Leu Ala Asp Pro Glu 
1               5                   10                  15      


Gly Phe Trp Ala Glu Gln Ala Ala Arg Ile Asp Trp Glu Thr Pro Phe 
            20                  25                  30          


Gly Gln Val Leu Asp Asn Ser Arg Ala Pro Phe Thr Arg Trp Phe Val 
        35                  40                  45              


Gly Gly Arg Thr Asn Leu Cys His Asn Ala Val Asp Arg His Leu Ala 
    50                  55                  60                  


Ala Arg Ala Ser Gln Pro Ala Leu His Trp Val Ser Thr Glu Thr Asp 
65                  70                  75                  80  


Gln Ala Arg Thr Phe Thr Tyr Ala Glu Leu His Asp Glu Val Ser Arg 
                85                  90                  95      


Met Ala Ala Ile Leu Gln Gly Leu Asp Val Gln Lys Gly Asp Arg Val 
            100                 105                 110         


Leu Ile Tyr Met Pro Met Ile Pro Glu Ala Ala Phe Ala Met Leu Ala 
        115                 120                 125             


Cys Ala Arg Ile Gly Ala Ile His Ser Val Val Phe Gly Gly Phe Ala 
    130                 135                 140                 


Ser Val Ser Leu Ala Ala Arg Ile Glu Asp Ala Arg Pro Arg Val Val 
145                 150                 155                 160 


Val Ser Ala Asp Ala Gly Ser Arg Ala Gly Lys Val Val Pro Tyr Lys 
                165                 170                 175     


Pro Leu Leu Asp Glu Ala Ile Arg Leu Ser Ser His Gln Pro Gly Lys 
            180                 185                 190         


Val Leu Leu Val Asp Arg Gln Leu Ala Gln Met Pro Arg Thr Glu Gly 
        195                 200                 205             


Arg Asp Glu Asp Tyr Ala Ala Trp Arg Glu Arg Val Ala Gly Val Gln 
    210                 215                 220                 


Val Pro Cys Val Trp Leu Glu Ser Ser Glu Pro Ser Tyr Val Leu Tyr 
225                 230                 235                 240 


Thr Ser Gly Thr Thr Gly Lys Pro Lys Gly Val Gln Arg Asp Thr Gly 
                245                 250                 255     


Gly Tyr Ala Val Ala Leu Ala Thr Ser Met Glu Tyr Ile Phe Cys Gly 
            260                 265                 270         


Lys Pro Gly Asp Thr Met Phe Thr Ala Ser Asp Ile Gly Trp Val Val 
        275                 280                 285             


Gly His Ser Tyr Ile Val Tyr Gly Pro Leu Leu Ala Gly Met Ala Thr 
    290                 295                 300                 


Leu Met Tyr Glu Gly Thr Pro Ile Arg Pro Asp Gly Gly Ile Leu Trp 
305                 310                 315                 320 


Arg Leu Val Glu Gln Tyr Lys Val Asn Leu Met Phe Ser Ala Pro Thr 
                325                 330                 335     


Ala Ile Arg Val Leu Lys Lys Gln Asp Pro Ala Trp Leu Thr Arg Tyr 
            340                 345                 350         


Asp Leu Ser Ser Leu Arg Leu Leu Phe Leu Ala Gly Glu Pro Leu Asp 
        355                 360                 365             


Glu Pro Thr Ala Arg Trp Ile Gln Asp Gly Leu Gly Lys Pro Val Val 
    370                 375                 380                 


Asp Asn Tyr Trp Gln Thr Glu Ser Gly Trp Pro Ile Leu Ala Ile Gln 
385                 390                 395                 400 


Arg Gly Ile Glu Ala Leu Pro Pro Lys Leu Gly Ser Pro Gly Val Pro 
                405                 410                 415     


Ala Tyr Gly Tyr Asp Leu Lys Ile Val Asp Glu Asn Thr Gly Ala Glu 
            420                 425                 430         


Cys Pro Pro Gly Gln Lys Gly Val Val Ala Ile Asp Gly Pro Leu Pro 
        435                 440                 445             


Pro Gly Cys Met Ser Thr Val Trp Gly Asp Asp Asp Arg Phe Val Arg 
    450                 455                 460                 


Thr Tyr Trp Gln Ala Val Pro Asn Arg Leu Cys Tyr Ser Thr Phe Asp 
465                 470                 475                 480 


Trp Gly Val Arg Asp Ala Asp Gly Tyr Val Phe Ile Leu Gly Arg Thr 
                485                 490                 495     


Asp Asp Val Ile Asn Val Ala Gly His Arg Leu Gly Thr Arg Glu Ile 
            500                 505                 510         


Glu Glu Ser Leu Ser Ser Asn Ala Ala Val Ala Glu Val Ala Val Val 
        515                 520                 525             


Gly Val Gln Asp Ala Leu Lys Gly Gln Val Ala Met Ala Phe Cys Ile 
    530                 535                 540                 


Ala Arg Asp Pro Ala Arg Thr Ala Thr Ala Glu Ala Arg Leu Ala Leu 
545                 550                 555                 560 


Glu Gly Glu Leu Met Lys Thr Val Glu Gln Gln Leu Gly Ala Val Ala 
                565                 570                 575     


Arg Pro Ala Arg Val Phe Phe Val Asn Ala Leu Pro Lys Thr Arg Ser 
            580                 585                 590         


Gly Lys Leu Leu Arg Arg Ala Met Gln Ala Val Ala Glu Gly Arg Asp 
        595                 600                 605             


Pro Gly Asp Leu Thr Thr Ile Glu Asp Pro Gly Ala Leu Glu Gln Leu 
    610                 615                 620                 


Gln Ala Ala Leu Lys Gly 
625                 630 


<210>  22
<211>  1893
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  22
atgacggcaa gccatgccgt gcatgcccgt tcgctggccg accccgaggg gttctgggcc       60

gaacaggcgg cgcgcatcga ctgggaaacc ccgttcggcc aggtgctcga caacagccgc      120

gcgcccttta cgcgctggtt cgtcggcggg cgcaccaacc tgtgccacaa cgcggtcgac      180

cgccacctgg cggcccgcgc cagccagccg gcgctgcact gggtctcgac cgagaccgac      240

caggcccgca cctttaccta cgccgagctg cacgacgaag tcagccgcat ggccgcgatc      300

ctgcagggcc tggacgtgca gaagggcgac cgcgtgctga tctacatgcc gatgatcccg      360

gaagccgcct ttgccatgct ggcctgcgcg cgcatcggcg cgatccattc ggtggtgttc      420

ggcggctttg cctcggtcag cctggccgcg cgcatcgagg atgcccggcc gcgcgtggtg      480

gtcagcgccg acgccggctc gcgtgccggc aaggtggtgc cctacaagcc gctgctggac      540

gaggccatcc ggctctcgtc gcaccagccc gggaaggtgc tgctggtgga ccggcaactg      600

gcgcaaatgc cccgtaccga gggccgcgat gaggactacg ccgcctggcg cgaacgcgtg      660

gccggcgtgc aggtgccgtg cgtgtggctg gaatcgagcg agccgtcgta cgtgctatac      720

acctccggca ccaccggcaa gcccaagggc gtgcagcgcg ataccggcgg ctacgcggtg      780

gcgctggcca cctcgatgga atacatcttc tgcggcaagc ccggcgacac catgttcacc      840

gcgtcggaca tcggctgggt ggtggggcac agctatatcg tctacggccc gctgctggcc      900

ggcatggcca cgctgatgta tgaaggcacg ccgatccgcc ccgacggtgg catcctgtgg      960

cggctggtgg agcaatacaa ggtcaacctg atgttcagcg cgccgaccgc gatccgcgtg     1020

ctgaagaagc aggacccggc ctggctgacc cgctacgacc tgtccagcct gcgcctgctg     1080

ttcctggccg gcgagccgct ggacgagccc accgcgcgct ggatccagga cggcctgggc     1140

aagcccgtgg tcgacaacta ctggcagacc gaatccggct ggccgatcct cgcgatccag     1200

cgcggcatcg aggcgctgcc gcccaagctg ggctcgcccg gcgtgcccgc ctacggctat     1260

gacctgaaga tcgtcgacga gaacaccggc gctgaatgcc cgccggggca gaagggtgtg     1320

gtcgccatcg acggcccgct gccgccggga tgcatgagca cggtctgggg cgacgacgac     1380

cgcttcgtgc gcacctactg gcaggcggtg ccgaaccggc tgtgctattc gaccttcgac     1440

tggggcgtgc gcgacgccga cggctatgtt tttatcctgg gccgcaccga cgacgtgatc     1500

aacgttgccg gccaccggct gggcacccgc gagatcgagg aaagcctgtc gtccaacgct     1560

gccgtggccg aggtggcggt ggtgggcgtg caggacgcgc tcaaggggca ggtggcgatg     1620

gccttctgca tcgcccgcga tccggcgcgc acggccacgg ccgaagcgcg gctggcattg     1680

gagggcgagt tgatgaagac ggtggagcag caactgggtg ccgtggcgcg gccggcgcgc     1740

gtattctttg tcaatgcact gcccaagacc cgctccggca agttgctgcg gcgcgccatg     1800

caggcggtgg ccgaagggcg cgatccgggc gacctgacca cgatcgagga cccgggtgcg     1860

ctggaacagt tgcaggcagc gctgaaaggc tag                                  1893


<210>  23
<211>  576
<212>  PRT
<213>  C. necator

<400>  23

Met Ala Ala Ala Ala Leu Pro Ala Ser Arg Arg Asp Asp Tyr Arg Ala 
1               5                   10                  15      


Leu Tyr Glu Ser Phe Arg Trp Glu Ile Pro Pro His Phe Asn Ile Ala 
            20                  25                  30          


Glu Ala Cys Cys Gly Arg Trp Ala Arg Asp Pro Ala Thr Met Asp Arg 
        35                  40                  45              


Ile Ala Val Tyr Thr Glu His Glu Asp Gly Arg Arg Asn Ala His Thr 
    50                  55                  60                  


Phe Ala His Ile Gln Ala Glu Ala Asn Arg Leu Ser Ala Ala Leu Arg 
65                  70                  75                  80  


Ala Leu Gly Val Ala Arg Gly Asp Arg Val Ala Ile Val Met Pro Gln 
                85                  90                  95      


Arg Ile Glu Thr Val Ile Ala His Met Ala Ile Tyr Gln Leu Gly Ala 
            100                 105                 110         


Ile Ala Met Pro Leu Ser Met Leu Phe Gly Pro Glu Ala Leu Ala Tyr 
        115                 120                 125             


Arg Ile Ala His Ser Glu Ala Asn Val Ala Ile Ala Asp Glu Thr Ser 
    130                 135                 140                 


Ile Asp Asn Val Leu Ala Ala Arg Pro Glu Cys Pro Thr Leu Ala Thr 
145                 150                 155                 160 


Val Ile Ala Ala Gly Gly Ala His Gly Arg Gly Asp His Asp Trp Asp 
                165                 170                 175     


Val Leu Leu Ala Ala Gln Leu Pro Thr Phe Val Ala Glu Gln Thr Lys 
            180                 185                 190         


Ala Asp Glu Ala Ala Val Leu Ile Tyr Thr Ser Gly Thr Thr Gly Pro 
        195                 200                 205             


Pro Lys Gly Ala Leu Ile Pro His Arg Ala Leu Ile Gly Asn Leu Thr 
    210                 215                 220                 


Gly Phe Val Cys Ser Gln Asn Trp Tyr Pro Gln Asp Asp Asp Val Phe 
225                 230                 235                 240 


Trp Ser Pro Ala Asp Trp Ala Trp Thr Gly Gly Leu Trp Asp Ala Leu 
                245                 250                 255     


Met Pro Ala Leu Tyr Phe Gly Lys Pro Ile Val Gly Tyr Gln Gly Arg 
            260                 265                 270         


Phe Ser Ala Glu Arg Ala Phe Glu Leu Leu Glu Arg Tyr Ala Val Thr 
        275                 280                 285             


Asn Thr Phe Leu Phe Pro Thr Ala Leu Lys Gln Met Met Lys Ala Cys 
    290                 295                 300                 


Pro Glu Pro Arg Gln Arg Tyr Asp Ile Arg Leu Arg Ala Leu Met Ser 
305                 310                 315                 320 


Ala Gly Glu Ala Val Gly Glu Thr Val Phe Gly Trp Cys Arg Asp Ala 
                325                 330                 335     


Leu Gly Val Ile Val Asn Glu Met Phe Gly Gln Thr Glu Ile Asn Tyr 
            340                 345                 350         


Ile Val Gly Asn Cys Thr Ala Gln Asn Asp Asp Lys Gln Leu Gly Trp 
        355                 360                 365             


Pro Ala Arg Pro Gly Ser Met Gly Arg Pro Tyr Pro Gly His Arg Val 
    370                 375                 380                 


Gln Val Ile Asp Asp Glu Gly Gln Pro Cys Ala Pro Gly Glu Asp Gly 
385                 390                 395                 400 


Glu Val Ala Val Cys Ala Thr Asp Ser Ala Gly His Pro Asp Pro Val 
                405                 410                 415     


Phe Phe Leu Gly Tyr Trp Lys Asn Glu Ala Ala Thr Ala Gly Lys Tyr 
            420                 425                 430         


Ala Glu Arg Asp Gly Leu Arg Trp Cys Arg Thr Gly Asp Leu Ala Arg 
        435                 440                 445             


Val Asp Ala Asp Gly Tyr Leu Trp Tyr Gln Gly Arg Ala Asp Asp Val 
    450                 455                 460                 


Phe Lys Ser Ser Gly Tyr Arg Ile Gly Pro Ser Glu Ile Glu Asn Cys 
465                 470                 475                 480 


Leu Leu Lys His Pro Ala Val Ser Asn Cys Ala Val Val Pro Ser Pro 
                485                 490                 495     


Asp Pro Glu Arg Gly Ala Val Val Lys Ala Phe Val Val Leu Thr Pro 
            500                 505                 510         


Ser Val Ala Arg Ser Phe Asp Gly Asp Ala Ala Leu Val Thr Glu Leu 
        515                 520                 525             


Gln Ala His Val Arg Gly Gln Leu Ala Pro Tyr Glu Tyr Pro Lys Ala 
    530                 535                 540                 


Ile Glu Phe Ile Asp Gln Leu Pro Met Thr Thr Thr Gly Lys Ile Gln 
545                 550                 555                 560 


Arg Arg Val Leu Arg Leu Leu Glu Glu Ala Arg Ala Gly Lys Arg Ala 
                565                 570                 575     


<210>  24
<211>  1731
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  24
atggccgcag ctgcgttgcc ggcaagccgg cgcgacgact atcgcgccct gtatgaatcc       60

ttccgctggg aaatcccccc gcatttcaat atcgccgagg cctgctgcgg gcgctgggcg      120

cgcgacccgg ccacgatgga ccgcatcgcg gtctataccg agcatgagga cggccgccgc      180

aacgcgcata cctttgccca tatccaggcc gaagccaacc gcctgtcggc ggcgctgcgc      240

gcactgggcg tggcgcgcgg cgaccgcgtg gcaatcgtga tgccgcagcg gatcgagacc      300

gtgatcgcgc atatggcgat ctaccagctc ggcgccatcg ccatgccgct gtcgatgctg      360

ttcgggcccg aggcgctggc ctaccgtatc gcacacagcg aagccaatgt ggcgatcgcg      420

gacgagactt ccatcgacaa tgtgctggcc gcgcgcccgg aatgcccgac gctggccacc      480

gtgattgccg ccggcggcgc gcatggccgc ggcgaccacg actgggacgt gctgctggcc      540

gcgcagctgc cgacttttgt cgccgagcag accaaggccg acgaggccgc ggtgctgatc      600

tacaccagcg gcaccaccgg cccgcccaag ggcgcgctga tcccgcaccg cgcgctgatc      660

ggcaacctga ccggctttgt ctgctcgcag aactggtatc cgcaggacga cgacgtgttc      720

tggagcccgg ccgactgggc ctggaccggc ggcctgtggg atgcgctgat gccggcgctg      780

tatttcggca agcccatcgt cggctaccag ggccgcttct ccgccgagcg cgccttcgag      840

ctgctggagc gctacgccgt caccaacacc ttcctgttcc cgaccgcgct caagcagatg      900

atgaaggcct gccccgagcc gcggcagcgc tacgacatca ggctgcgtgc gctgatgagc      960

gccggcgagg ccgtgggcga gaccgtgttc ggctggtgcc gcgatgcgct gggcgtgatc     1020

gtcaacgaga tgttcggcca gaccgagatc aactacatcg tcggcaactg caccgcgcag     1080

aacgacgaca agcagctggg ctggccggca cgaccgggct cgatggggcg tccctatccg     1140

ggccaccgcg tgcaggtgat cgacgacgaa ggccagccct gcgcgccggg cgaggacggc     1200

gaggtcgcgg tatgcgccac cgacagcgcc gggcatccgg acccggtgtt cttcctcggc     1260

tactggaaga acgaagccgc caccgcgggc aagtacgccg agcgcgacgg cctgcgctgg     1320

tgccgcaccg gcgacctggc gcgcgtcgat gccgatggct acctgtggta ccaggggcgt     1380

gccgacgatg tgttcaagtc ctcgggctac cgcatcgggc cgagcgagat cgagaactgc     1440

ctgctcaagc atccggcggt gtccaactgc gccgtggtgc cctcgcccga ccccgagcgc     1500

ggcgccgtgg tcaaggcctt cgtggtgctg acaccgtcgg tggcgcgctc gttcgacggc     1560

gacgcggcgc tggtcacgga gctgcaggcg catgtgcgcg gccagctggc gccgtatgaa     1620

tacccgaagg cgatcgaatt catcgaccag ctgccgatga ccaccaccgg caagatccag     1680

cggcgcgtgc tgcgcttgct ggaggaagcg cgcgcgggca agcgcgccta g              1731


<210>  25
<211>  685
<212>  PRT
<213>  C. necator

<400>  25

Met Ser Glu Gly Lys Ala Pro Arg His Ala Ala Gln Gln Glu Leu Ala 
1               5                   10                  15      


Asp Val Ser Glu Ala Glu Ile Ala Val His Trp Pro Glu Glu Asp Tyr 
            20                  25                  30          


Val Pro Pro Ala Gly Gln Phe Ile Ala Gln Ala Asn Leu Thr Asp Pro 
        35                  40                  45              


His Ile Phe Glu Arg Phe Ser Leu Glu Arg Phe Pro Glu Cys Phe Lys 
    50                  55                  60                  


Glu Phe Ala Asp Leu Leu Asp Trp Tyr Lys Tyr Trp Glu Thr Thr Leu 
65                  70                  75                  80  


Asp Thr Ser Asn Pro Pro Phe Trp Arg Trp Phe Val Gly Gly Arg Ile 
                85                  90                  95      


Asn Ala Cys His Asn Cys Val Asp Arg His Leu Ala Ala Tyr Arg Asn 
            100                 105                 110         


Lys Thr Ala Ile His Phe Val Pro Glu Pro Glu Asp Glu Ala Val His 
        115                 120                 125             


His Leu Thr Tyr Gln Glu Leu Phe Val Arg Val Asn Glu Leu Ala Ala 
    130                 135                 140                 


Leu Leu Arg Glu Phe Cys Gly Leu Lys Ala Gly Asp Arg Val Thr Leu 
145                 150                 155                 160 


His Met Pro Met Val Ala Glu Leu Pro Ile Thr Met Leu Ala Cys Ala 
                165                 170                 175     


Arg Ile Gly Val Ile His Ser Gln Val Phe Ser Gly Phe Ser Gly Lys 
            180                 185                 190         


Ala Cys Ala Glu Arg Ile Ala Asp Ser Glu Ser Arg Leu Leu Ile Thr 
        195                 200                 205             


Met Asp Ala Tyr His Arg Gly Gly Glu Leu Leu Asp His Lys Glu Lys 
    210                 215                 220                 


Ala Asp Ile Ala Val Ala Glu Ala Ala Ser Ala Gly Gln Gln Val Glu 
225                 230                 235                 240 


Lys Val Leu Ile Trp Gln Arg Tyr Pro Gly Lys Tyr Ser Ser Ala Ala 
                245                 250                 255     


Leu Leu Val Lys Gly Arg Asp Val Ile Leu Asn Asp Val Leu Ala Gly 
            260                 265                 270         


Phe Arg Gly Arg Arg Val Glu Pro Glu Pro Met Pro Ala Glu Ala Pro 
        275                 280                 285             


Leu Phe Leu Met Tyr Thr Ser Gly Thr Thr Gly Arg Pro Lys Gly Cys 
    290                 295                 300                 


Gln His Ser Thr Gly Gly Tyr Leu Ser Tyr Val Ala Trp Thr Ser Lys 
305                 310                 315                 320 


Tyr Ile Gln Asp Ile His Pro Glu Asp Val Tyr Trp Cys Met Ala Asp 
                325                 330                 335     


Ile Gly Trp Ile Thr Gly His Ser Tyr Ile Val Tyr Gly Pro Leu Ala 
            340                 345                 350         


Leu Ala Ala Ser Ser Val Val Tyr Glu Gly Val Pro Thr Trp Pro Asp 
        355                 360                 365             


Ala Gly Arg Pro Trp Arg Ile Ala Glu Ser Leu Gly Val Asn Ile Phe 
    370                 375                 380                 


His Thr Ser Pro Thr Ala Ile Arg Ala Leu Arg Arg Asn Gly Pro Asp 
385                 390                 395                 400 


Glu Pro Ala Lys Tyr Asp Cys His Phe Lys His Met Thr Thr Val Gly 
                405                 410                 415     


Glu Pro Ile Glu Pro Glu Val Trp Lys Trp Tyr His Arg Glu Val Gly 
            420                 425                 430         


Lys Gly Glu Ala Val Ile Val Asp Thr Trp Trp Gln Thr Glu Asn Gly 
        435                 440                 445             


Gly Phe Leu Cys Ser Thr Leu Pro Gly Ile His Pro Met Lys Pro Gly 
    450                 455                 460                 


Ser Thr Gly Pro Gly Ile Pro Gly Ile His Pro Val Ile Phe Asp Glu 
465                 470                 475                 480 


Glu Gly Asn Glu Val Pro Ala Gly Ser Gly Lys Ala Gly Asn Ile Cys 
                485                 490                 495     


Ile Arg Asn Pro Trp Pro Gly Ile Phe Gln Thr Val Trp Lys Asp Pro 
            500                 505                 510         


Asp Arg Tyr Val Arg Gln Tyr Tyr Ala Arg Tyr Cys Lys Asn Pro Asp 
        515                 520                 525             


Ser Lys Asp Trp His Asp Trp Pro Tyr Met Ala Gly Asp Gly Ala Met 
    530                 535                 540                 


Gln Ala Ala Asp Gly Tyr Phe Arg Ile Leu Gly Arg Ile Asp Asp Val 
545                 550                 555                 560 


Ile Asn Val Ser Gly His Arg Leu Gly Thr Lys Glu Ile Glu Ser Ala 
                565                 570                 575     


Ala Leu Leu Val Pro Asp Val Ala Glu Ala Ala Val Val Pro Val Ala 
            580                 585                 590         


Asp Glu Val Lys Gly Lys Val Pro Asp Leu Tyr Val Ser Leu Lys Pro 
        595                 600                 605             


Gly Leu Ser Pro Ser Ile Lys Ile Ala Asn Lys Val Ser Ala Ala Val 
    610                 615                 620                 


Val Ser Gln Ile Gly Ala Ile Ala Arg Pro His Arg Val Val Ile Val 
625                 630                 635                 640 


Pro Asp Met Pro Lys Thr Arg Ser Gly Lys Ile Met Arg Arg Val Leu 
                645                 650                 655     


Ala Ala Ile Ser Asn His Gln Glu Pro Gly Asp Val Ser Thr Leu Ala 
            660                 665                 670         


Asn Pro Glu Val Val Glu Lys Ile Arg Glu Leu Ala Thr 
        675                 680                 685 


<210>  26
<211>  2058
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  26
atgtctgaag gcaaagcgcc acgccatgct gcccagcagg aattggccga tgtgtccgag       60

gccgaaatcg cggtccattg gcccgaggag gactatgtcc cgccggccgg ccagttcatt      120

gcgcaggcca atctgaccga tccccatatt ttcgagcgct tctccctcga acgtttcccc      180

gagtgcttca aggagttcgc agacctgctg gactggtaca aatactggga aacgaccctg      240

gataccagca acccgccttt ctggcgctgg ttcgtcggcg gcaggatcaa cgcctgccac      300

aattgcgtgg atcgccacct cgctgcatac aggaacaaga ccgcgattca tttcgtgccc      360

gagccggagg atgaggcggt gcatcacctc acctaccagg agctcttcgt tcgcgtcaat      420

gagctggccg ccctgctgcg cgagttctgc ggcctgaagg ccggcgaccg cgtcacgctg      480

catatgccga tggtggccga actgcccatc accatgctcg cctgcgcccg catcggcgtg      540

attcattcgc aggtattcag cggcttcagc ggcaaggcct gcgccgagcg catcgcggac      600

tccgagagcc ggctgctgat caccatggac gcctatcacc gcggcggtga attgctcgat      660

cacaaggaaa aggccgacat cgccgtggca gaagccgcca gcgccggtca gcaggtcgag      720

aaggtcctga tctggcagcg ctacccgggc aagtattcca gtgccgccct actggtgaag      780

ggccgcgatg tcattctcaa tgacgtgctc gccgggttcc gcggcaggcg tgtcgagccc      840

gagccgatgc cggcggaggc gccgctgttc ctgatgtaca cgagcggcac cacgggccgg      900

cccaagggct gccagcattc cactggcggc tatctgtcct atgtggcgtg gacctctaag      960

tacatccagg atatccaccc cgaggacgtc tactggtgca tggccgatat tggctggatc     1020

accgggcatt cctacatcgt ctatggcccg ctcgcgctcg ccgcttcgtc tgtcgtctat     1080

gaaggcgtgc cgacctggcc cgacgccggc cggccctggc gtattgcgga aagccttggc     1140

gtcaatatct tccacacctc gcccaccgca atccgcgcgc tgcggcgcaa cgggcccgac     1200

gagccggcga agtacgactg ccatttcaag cacatgacca cggtgggcga gccgatcgag     1260

cccgaagtct ggaagtggta ccaccgtgaa gtcggcaaag gcgaggcggt gatcgtggac     1320

acctggtggc aaaccgagaa tggcggcttc ctctgcagca cgctgccggg catccacccg     1380

atgaagcccg gcagcactgg cccgggaatc ccgggcattc atccggtgat ctttgacgag     1440

gaaggcaatg aggtcccggc cggctcgggc aaggcgggca acatctgcat ccgcaatccc     1500

tggccgggca tattccagac cgtctggaag gatccggacc gctacgtgcg ccagtactat     1560

gcgcgctatt gcaagaatcc cgacagcaag gactggcacg actggccgta tatggcgggc     1620

gatggcgcaa tgcaggcggc ggacggctac tttcgcatcc ttggccgcat cgacgacgtg     1680

atcaatgttt ccggccatcg cctcggcacc aaggagatcg aatccgcagc actgctggtg     1740

ccggacgtcg ccgaggcggc ggtggtgccg gtggccgacg aggtcaaggg caaggtgcct     1800

gatctctatg tatcgctcaa gccgggactg tcgccctcca tcaagatcgc gaacaaggtc     1860

tcggccgcgg tggtatccca gattggcgcg attgcgcgtc cgcatcgggt cgtgatcgtc     1920

cccgacatgc ccaagacacg ctcgggcaag atcatgcgcc gcgtgctggc ggcgatctcc     1980

aaccaccagg agcctggcga cgtatccacg cttgccaatc cggaggtcgt cgagaagatc     2040

agggagctgg cgacatag                                                   2058


<210>  27
<211>  660
<212>  PRT
<213>  C. necator

<400>  27

Met Ser Ala Ile Glu Ser Val Met Gln Glu His Arg Val Phe Asn Pro 
1               5                   10                  15      


Pro Glu Gly Phe Ala Ser Gln Ala Ala Ile Pro Ser Met Glu Ala Tyr 
            20                  25                  30          


Gln Ala Leu Cys Asp Glu Ala Glu Arg Asp Tyr Glu Gly Phe Trp Ala 
        35                  40                  45              


Arg His Ala Arg Glu Leu Leu His Trp Thr Lys Pro Phe Thr Lys Val 
    50                  55                  60                  


Leu Asp Gln Ser Asn Ala Pro Phe Tyr Lys Trp Phe Glu Asp Gly Glu 
65                  70                  75                  80  


Leu Asn Ala Ser Tyr Asn Cys Leu Asp Arg Asn Leu Gln Asn Gly Asn 
                85                  90                  95      


Ala Asp Lys Val Ala Ile Val Phe Glu Ala Asp Asp Gly Ser Val Thr 
            100                 105                 110         


Arg Val Thr Tyr Arg Glu Leu His Gly Lys Val Cys Arg Phe Ala Asn 
        115                 120                 125             


Gly Leu Lys Ala Leu Gly Ile Arg Lys Gly Asp Arg Val Val Ile Tyr 
    130                 135                 140                 


Met Pro Met Ser Val Glu Gly Val Val Ala Met Gln Ala Cys Ala Arg 
145                 150                 155                 160 


Leu Gly Ala Thr His Ser Val Val Phe Gly Gly Phe Ser Ala Lys Ser 
                165                 170                 175     


Leu Gln Glu Arg Leu Val Asp Val Gly Ala Val Ala Leu Ile Thr Ala 
            180                 185                 190         


Asp Glu Gln Met Arg Gly Gly Lys Ala Leu Pro Leu Lys Ala Ile Ala 
        195                 200                 205             


Asp Asp Ala Leu Ala Leu Gly Gly Cys Glu Ala Val Arg Asn Val Ile 
    210                 215                 220                 


Val Tyr Arg Arg Thr Gly Gly Lys Val Ala Trp Thr Glu Gly Arg Asp 
225                 230                 235                 240 


Arg Trp Met Glu Asp Val Ser Ala Gly Gln Pro Asp Thr Cys Glu Ala 
                245                 250                 255     


Glu Pro Val Ser Ala Glu His Pro Leu Phe Val Leu Tyr Thr Ser Gly 
            260                 265                 270         


Ser Thr Gly Lys Pro Lys Gly Val Gln His Ser Thr Gly Gly Tyr Leu 
        275                 280                 285             


Leu Trp Ala Leu Met Thr Met Lys Trp Thr Phe Asp Ile Lys Pro Asp 
    290                 295                 300                 


Asp Leu Phe Trp Cys Thr Ala Asp Ile Gly Trp Val Thr Gly His Thr 
305                 310                 315                 320 


Tyr Ile Ala Tyr Gly Pro Leu Ala Ala Gly Ala Thr Gln Val Val Phe 
                325                 330                 335     


Glu Gly Val Pro Thr Tyr Pro Asn Ala Gly Arg Phe Trp Asp Met Ile 
            340                 345                 350         


Ala Arg His Lys Val Ser Ile Phe Tyr Thr Ala Pro Thr Ala Ile Arg 
        355                 360                 365             


Ser Leu Ile Lys Ala Ala Glu Ala Asp Glu Lys Ile His Pro Lys Gln 
    370                 375                 380                 


Tyr Asp Leu Ser Ser Leu Arg Leu Leu Gly Thr Val Gly Glu Pro Ile 
385                 390                 395                 400 


Asn Pro Glu Ala Trp Met Trp Tyr Tyr Lys Asn Ile Gly Asn Glu Arg 
                405                 410                 415     


Cys Pro Ile Val Asp Thr Phe Trp Gln Thr Glu Thr Gly Gly His Met 
            420                 425                 430         


Ile Thr Pro Leu Pro Gly Ala Thr Pro Leu Val Pro Gly Ser Cys Thr 
        435                 440                 445             


Leu Pro Leu Pro Gly Ile Met Ala Ala Ile Val Asp Glu Thr Gly His 
    450                 455                 460                 


Asp Val Pro Asn Gly Asn Gly Gly Ile Leu Val Val Lys Arg Pro Trp 
465                 470                 475                 480 


Pro Ala Met Ile Arg Thr Ile Trp Gly Asp Pro Glu Arg Phe Arg Lys 
                485                 490                 495     


Ser Tyr Phe Pro Glu Glu Leu Gly Gly Lys Leu Tyr Leu Ala Gly Asp 
            500                 505                 510         


Gly Ser Ile Arg Asp Lys Asp Thr Gly Tyr Phe Thr Ile Met Gly Arg 
        515                 520                 525             


Ile Asp Asp Val Leu Asn Val Ser Gly His Arg Met Gly Thr Met Glu 
    530                 535                 540                 


Ile Glu Ser Ala Leu Val Ser Asn Pro Leu Val Ala Glu Ala Ala Val 
545                 550                 555                 560 


Val Gly Arg Pro Asp Asp Met Thr Gly Glu Ala Ile Cys Ala Phe Val 
                565                 570                 575     


Val Leu Lys Arg Ser Arg Pro Thr Gly Glu Glu Ala Val Lys Ile Ala 
            580                 585                 590         


Thr Glu Leu Arg Asn Trp Val Gly Lys Glu Ile Gly Pro Ile Ala Lys 
        595                 600                 605             


Pro Lys Asp Ile Arg Phe Gly Asp Asn Leu Pro Lys Thr Arg Ser Gly 
    610                 615                 620                 


Lys Ile Met Arg Arg Leu Leu Arg Ser Leu Ala Lys Gly Glu Glu Ile 
625                 630                 635                 640 


Thr Gln Asp Thr Ser Thr Leu Glu Asn Pro Ala Ile Leu Glu Gln Leu 
                645                 650                 655     


Lys Gln Ala Gln 
            660 


<210>  28
<211>  1983
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  28
atgtccgcca tcgaatcggt gatgcaagag catcgcgtgt tcaacccgcc cgaaggcttc       60

gccagccagg ccgcgatccc cagcatggag gcctaccagg cgctgtgcga cgaagccgag      120

cgtgactatg aaggtttctg ggcgcgccac gcgcgcgagc tgctgcactg gaccaagccc      180

ttcaccaagg tgctggacca aagcaacgca ccgttctaca agtggttcga agacggcgag      240

ctcaacgcct cttacaactg cctggaccgc aatctgcaga acggcaatgc ggacaaggtc      300

gcgatcgtgt tcgaggccga cgacggcagc gtgacgcgcg tcacctaccg cgagctgcat      360

ggcaaggtgt gccgcttcgc caacggcctg aaggcgctcg gcatcaggaa gggcgaccgc      420

gtggtgatct acatgccgat gtcggtcgaa ggcgtggtcg cgatgcaggc ctgcgcacgc      480

ctgggcgcca cgcactcggt ggtgttcggc ggcttctcgg ccaagtcgct gcaggagcgg      540

ctggtggacg tgggcgcggt ggcgctgatc accgccgacg agcagatgcg cggcggcaag      600

gcgctgccgc tcaaggccat cgccgatgac gcgctggcgc tgggcggctg cgaggccgtc      660

aggaacgtga tcgtctaccg ccgcaccggc ggcaaggttg cctggaccga aggccgcgac      720

cgctggatgg aagatgtcag cgccggccag ccggatacct gcgaagccga gccggtgagc      780

gccgagcacc cgctgttcgt gctctacacc tccggctcca ccggcaagcc caagggcgtg      840

cagcacagca ccggcggcta cctgctgtgg gcgctgatga caatgaagtg gaccttcgac      900

atcaagcccg acgacctgtt ctggtgtacc gcggacatcg gctgggtcac cggccacacc      960

tatattgcct acggcccgct ggccgcgggc gccacccagg tggtgttcga aggcgtgccg     1020

acctacccca acgccggccg cttctgggac atgatcgcgc gccacaaggt cagcatcttc     1080

tacaccgcgc cgaccgcgat ccgctcgctg atcaaggccg ccgaggccga cgagaagatc     1140

cacccgaaac agtacgacct gtccagcctg cgcctgctcg gcaccgtggg cgagccgatc     1200

aaccccgaag cctggatgtg gtactacaag aacatcggca acgagcgctg cccgatcgtc     1260

gacaccttct ggcagaccga gaccggcggc cacatgatca cgccgctgcc gggcgcgacg     1320

ccgctggtgc cgggttcgtg cacgctgccg ctgccgggca tcatggccgc catcgtcgac     1380

gagaccggcc atgacgtgcc caacggcaac ggcggcatcc tggtggtcaa gcgtccgtgg     1440

ccggccatga tccgcaccat ctggggcgat ccggagcgct tcaggaagag ctacttcccc     1500

gaagagctcg gcggcaagct ctacctggcc ggcgacggct cgatccgcga caaggacacc     1560

ggctacttca ccatcatggg ccgcatcgac gacgtgctga acgtgtcggg ccaccgcatg     1620

gggacgatgg agatcgagtc cgcgctggtg tccaacccgc tggtggctga agccgccgtg     1680

gtgggccgcc ccgacgacat gaccggcgag gccatctgcg ccttcgtcgt gctcaagcgt     1740

tcgcgtccga ctggcgaaga ggccgtcaag atcgcgacgg agctgcgcaa ctgggtcggc     1800

aaggagatcg gcccgatcgc caagcccaag gacatccgct ttggcgacaa cctgcccaag     1860

acgcgctcgg gcaagatcat gcggcgcctg ctgcggtcgc tggccaaggg ggaggagatc     1920

acgcaggaca cctcgacgct ggagaatccg gccatcctgg agcagctcaa gcaggcgcag     1980

tga                                                                   1983


<210>  29
<211>  714
<212>  PRT
<213>  C. necator

<400>  29

Met Ser Thr Arg Asp Leu Tyr Thr His Ala Gln Leu Arg Arg Leu Phe 
1               5                   10                  15      


His Pro Arg Thr Ile Ala Val Val Gly Ala Thr Pro Asn Ala Arg Ser 
            20                  25                  30          


Phe Ala Gly Arg Ala Met Thr Asn Leu Gln Gln Phe Asp Gly Asn Val 
        35                  40                  45              


Leu Leu Val Asn Pro Arg Tyr Pro Glu Val Asn Gly Gln Val Cys Tyr 
    50                  55                  60                  


Pro Ser Leu Ser Ala Leu Pro Glu Ala Pro Asp Cys Val Leu Ile Ala 
65                  70                  75                  80  


Thr Ala Arg Glu Thr Val Glu Pro Ile Val Arg Glu Cys Ala Gly Leu 
                85                  90                  95      


Gly Val Gly Gly Val Val Leu Phe Ala Ser Gly Tyr Ala Glu Thr Gly 
            100                 105                 110         


Asn Pro Glu Gln Ile Ala Glu Gln Ala Arg Leu Val Ala Ile Ala Arg 
        115                 120                 125             


Glu Ser Gly Met Leu Leu Leu Gly Pro Asn Ser Ile Gly Tyr Ala Asn 
    130                 135                 140                 


Tyr Ile Asn His Ala Leu Val Ser Phe Thr Pro Leu Pro Ala Arg Gly 
145                 150                 155                 160 


Gly Glu Leu Pro Ala His Ala Ile Gly Leu Val Ser Gln Ser Gly Ala 
                165                 170                 175     


Leu Ala Phe Ala Leu Glu Gln Ala Ala Asn His Gly Thr Ala Phe Ser 
            180                 185                 190         


His Val Phe Ser Cys Gly Asn Ala Cys Asp Ile Asp Val Thr Asp Gln 
        195                 200                 205             


Ile Ala Tyr Leu Ala Gly Asp Pro Ser Cys Ala Ala Ile Ala Cys Val 
    210                 215                 220                 


Phe Glu Gly Leu Ser Asp Ala Ser Arg Ile Ile Arg Ala Ala Gln Val 
225                 230                 235                 240 


Cys Ala Glu Ala Gly Lys Pro Leu Val Val Tyr Lys Met Ala Arg Gly 
                245                 250                 255     


Thr Ala Gly Ala Ala Ala Ala Met Ser His Thr Gly Ser Met Ala Gly 
            260                 265                 270         


Ser Asp Arg Ala Tyr Ser Thr Ala Leu Arg Glu Ala Gly Val Val Gln 
        275                 280                 285             


Val Asp Thr Ile Glu Gln Leu Val Pro Thr Thr Val Phe Phe Ala Lys 
    290                 295                 300                 


Ala Pro Arg Pro Thr Thr Ser Gly Val Ala Ile Val Ser Gly Ser Gly 
305                 310                 315                 320 


Gly Ala Gly Ile Val Ala Ala Asp Glu Ala Glu Arg Phe Asn Val Pro 
                325                 330                 335     


Leu Pro Gln Pro Cys Asp Ala Thr Arg Ala Val Leu Glu Ser His Ile 
            340                 345                 350         


Pro Asp Phe Gly Ala Ala Arg Asn Pro Cys Asp Leu Thr Ala Gln Ala 
        355                 360                 365             


Ala Asn Asn Phe Asp Ser Phe Ile Gln Cys Gly Asp Ala Val Phe Ala 
    370                 375                 380                 


Asp Pro Ala Tyr Gly Ala Ala Val Val Pro Leu Val Val Thr Gly Asp 
385                 390                 395                 400 


Gly Asn Gly Arg Arg Phe Gln Val Phe Asn Asp Leu Ala Val Lys His 
                405                 410                 415     


Gly Lys Met Ala Cys Gly Leu Trp Met Ser Asn Trp Met Glu Gly Pro 
            420                 425                 430         


Glu Ala Val Glu Ser Glu Ala Leu Pro Arg Leu Ala Leu Phe Arg Ser 
        435                 440                 445             


Val Ser His Cys Phe Ala Ala Leu Ala Ala Trp Gln Ala Arg Glu Gln 
    450                 455                 460                 


Trp Leu Leu Ser Arg Ala Thr Pro Lys Pro Pro Arg Leu Thr His Ala 
465                 470                 475                 480 


Ser Val Ala Ala Glu Ala Arg Ala Arg Ile Val Ala Ala Pro Ala Asp 
                485                 490                 495     


Thr Leu Thr Glu Arg Glu Ala Lys Asp Val Leu Ala Met Tyr Gly Val 
            500                 505                 510         


Pro Val Val Gly Glu Ser Leu Ala Thr Ser Glu Gln Asp Ala Val Arg 
        515                 520                 525             


Ala Ala Asp Ala Cys Gly Tyr Pro Val Val Leu Lys Val Glu Ser Pro 
    530                 535                 540                 


Ala Ile Pro His Lys Ser Glu Ala Gly Val Ile Arg Leu Gly Val Asn 
545                 550                 555                 560 


Ser Ala Gln Glu Val Ala Val Ala Tyr Arg Glu Val Met Ala Asn Ala 
                565                 570                 575     


Arg Lys Val Thr Ala Asp Asp Arg Ile Asn Gly Val Leu Val Gln Ser 
            580                 585                 590         


Gln Val Pro Thr Gly Ile Glu Ile Leu Val Gly Ala Arg Val Asp Pro 
        595                 600                 605             


His Leu Gly Ala Leu Leu Val Val Gly Leu Gly Gly Val Met Val Glu 
    610                 615                 620                 


Leu Met Gln Asp Thr Val Ala Thr Ile Ala Pro Cys Ser Ala Gln Gln 
625                 630                 635                 640 


Ala Arg Ala Met Leu Glu Gln Leu Arg Gly Val Ala Leu Leu Lys Gly 
                645                 650                 655     


Phe Arg Gly Ala Ala Gly Val Asp Met Asp Leu Leu Ala Glu Ile Val 
            660                 665                 670         


Ala Ser Leu Ser Glu Phe Ala Ala Asp Gln Arg Asp Val Ile Ala Glu 
        675                 680                 685             


Phe Asp Val Asn Pro Leu Ile Cys Thr Pro Asp Arg Ile Val Ala Val 
    690                 695                 700                 


Asp Ala Leu Ile Glu Arg Arg Val Gly Ala 
705                 710                 


<210>  30
<211>  2145
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  30
atgtcgacac gcgatctcta tacccacgcg caactgcggc gcctcttcca tccgcgcacc       60

atcgcggtgg tcggcgcgac gccgaacgct cgctcgttcg ccggccgggc catgacgaac      120

ctgcagcagt tcgacggcaa cgtgctgctg gtcaaccccc gctaccccga ggtgaacggg      180

caggtctgct atccgtcgct gtcggcgctg cccgaggcgc ccgactgcgt gctgatcgcc      240

accgcgcgcg aaacggtgga gcccatcgtg cgcgagtgcg cggggctggg cgtgggcggc      300

gtggtgctgt tcgcgtcggg ctatgccgag accggcaatc cggagcagat tgccgagcag      360

gctcggctgg tcgccattgc ccgggaaagc ggcatgctgc tgctcggtcc gaacagcatc      420

ggctatgcga actacatcaa ccatgcgctg gtgtcgttca cgccgctgcc cgcgcgtggc      480

ggcgaactgc cggcccatgc gatcgggctg gtcagccagt ccggcgcgct ggcatttgcg      540

ctggaacagg cggccaacca cggcacggcg ttcagccacg tgttctcgtg cggcaatgcg      600

tgcgatatcg acgtgaccga ccagatcgcc tatctcgccg gggatccctc gtgcgcggcg      660

atcgcatgcg tattcgaagg gctgtccgac gccagccgga tcattcgcgc ggcgcaagtc      720

tgcgcggaag ccggcaagcc gctggtggtc tacaagatgg cgcgcgggac ggcgggcgcg      780

gcggcggcca tgtcgcatac cggctcgatg gcgggatccg accgcgccta cagcacggcg      840

ctgcgcgaag ctggcgtggt gcaggtcgat accatcgagc agctcgtgcc gacgacggtg      900

ttcttcgcca aggccccccg gccgacgacg tccggcgtgg ccatcgtctc gggttcgggc      960

ggcgcgggca ttgtcgccgc cgacgaggcc gagcgtttca acgtgccgct gccgcagccg     1020

tgtgacgcga cccgcgccgt gctcgaatcg cacattcctg acttcggcgc cgcgcgcaac     1080

ccgtgcgacc tgaccgccca ggccgccaac aacttcgact ccttcatcca gtgcggcgac     1140

gcggtcttcg ccgatcccgc ctacggcgcc gccgtggtgc cgctggtggt gaccggcgac     1200

ggcaacggcc gccgcttcca ggtgttcaac gacctagccg tcaagcacgg caagatggcg     1260

tgcggcctgt ggatgtcgaa ctggatggaa gggccggagg cggtcgagtc cgaggcgctg     1320

ccgcgccttg cgctgttccg ctcggtctcg cactgcttcg cggcgctggc cgcgtggcag     1380

gcacgggagc aatggctgtt gtcgcgcgcc acgccgaagc cgccgcgcct gacacacgct     1440

tcggtggccg ccgaagcgcg cgcgcgcatc gttgccgcgc cggccgatac gctcaccgag     1500

cgtgaagcca aggacgtcct tgccatgtac ggcgtgccgg tggtgggcga gtccctggcg     1560

acgagcgagc aggacgccgt gcgcgccgcc gatgcctgcg gctatccggt cgtgctgaag     1620

gtcgagagcc cggccatccc gcacaagtcg gaagcgggcg tgatccgcct cggcgtgaac     1680

tcggcgcagg aggttgccgt cgcgtaccgc gaggtcatgg cgaatgcgcg caaggtgacc     1740

gccgacgacc gcatcaacgg cgtgctggtg cagagccagg tgccgaccgg catcgagatc     1800

ttggtcggcg cccgcgtgga cccgcacctc ggcgcgctgc tggtggtggg gctgggcggg     1860

gtgatggtcg agctgatgca ggacacggtc gcgaccatcg cgccgtgctc ggcgcagcag     1920

gcgcgcgcca tgctggagca gctgcgcggc gtggcgctgc tgaagggctt ccgcggcgcg     1980

gcgggcgtgg acatggacct gctggcggaa atcgtcgcca gcctgtccga gttcgcggcg     2040

gaccagcgcg acgtgatcgc cgagttcgat gtgaatccgc tgatctgcac gccggaccgc     2100

atcgtggcgg tggatgcgct gatcgaacgg agagtggggg cctga                     2145


<210>  31
<211>  660
<212>  PRT
<213>  C. necator

<400>  31

Met Thr Ser Ile Gln Ser Val Val His Glu Gly Arg Met Phe Pro Pro 
1               5                   10                  15      


Ser Arg His Ala Ser Ala Lys Ala Ala Ile Pro Ser Met Glu Ala Tyr 
            20                  25                  30          


Gln Ala Leu Cys Asp Glu Ala Glu Arg Asp Tyr Glu Gly Phe Trp Ala 
        35                  40                  45              


Arg His Ala Arg Glu Leu Leu His Trp Thr Lys Pro Phe Thr Lys Val 
    50                  55                  60                  


Leu Asp Gln Ser Asn Ala Pro Phe Tyr Lys Trp Phe Glu Asp Gly Glu 
65                  70                  75                  80  


Leu Asn Ala Ser Tyr Asn Cys Leu Asp Arg Asn Leu Gln Asn Gly Asn 
                85                  90                  95      


Ala Asp Lys Val Ala Ile Val Phe Glu Ala Asp Asp Gly Ser Val Thr 
            100                 105                 110         


Arg Val Thr Tyr Arg Glu Leu His Gly Lys Val Cys Arg Phe Ala Asn 
        115                 120                 125             


Gly Leu Lys Ala Leu Gly Ile Arg Lys Gly Asp Arg Val Val Ile Tyr 
    130                 135                 140                 


Met Pro Met Ser Val Glu Gly Val Val Ala Met Gln Ala Cys Ala Arg 
145                 150                 155                 160 


Leu Gly Ala Thr His Ser Val Val Phe Gly Gly Phe Ser Ala Lys Ser 
                165                 170                 175     


Leu Gln Glu Arg Leu Val Asp Val Gly Ala Val Ala Leu Ile Thr Ala 
            180                 185                 190         


Asp Glu Gln Met Arg Gly Gly Lys Ala Leu Pro Leu Lys Pro Ile Ala 
        195                 200                 205             


Asp Asp Ala Leu Ala Leu Gly Gly Cys Glu Ala Val Arg Asn Val Ile 
    210                 215                 220                 


Val Tyr Arg Arg Thr Gly Gly Lys Val Ala Trp Thr Glu Gly Arg Asp 
225                 230                 235                 240 


Arg Trp Met Glu Asp Val Ser Ala Gly Gln Pro Glu Thr Cys Glu Ala 
                245                 250                 255     


Glu Pro Val Ser Ala Glu His Pro Leu Phe Val Leu Tyr Thr Ser Gly 
            260                 265                 270         


Ser Thr Gly Lys Pro Lys Gly Val Gln His Ser Thr Gly Gly Tyr Leu 
        275                 280                 285             


Leu Trp Ala Leu Met Thr Met Lys Trp Thr Phe Asp Ile Lys Pro Asp 
    290                 295                 300                 


Asp Leu Phe Trp Cys Thr Ala Asp Ile Gly Trp Val Thr Gly His Thr 
305                 310                 315                 320 


Tyr Ile Ala Tyr Gly Pro Leu Ala Ala Gly Ala Thr Gln Val Val Phe 
                325                 330                 335     


Glu Gly Val Pro Thr Tyr Pro Asn Ala Gly Arg Phe Trp Asp Met Ile 
            340                 345                 350         


Ala Arg His Lys Val Ser Ile Phe Tyr Thr Ala Pro Thr Ala Ile Arg 
        355                 360                 365             


Ser Leu Ile Lys Ala Ala Glu Ala Asp Glu Lys Ile His Pro Lys Gln 
    370                 375                 380                 


Tyr Asp Leu Ser Ser Leu Arg Leu Leu Gly Thr Val Gly Glu Pro Ile 
385                 390                 395                 400 


Asn Pro Glu Ala Trp Met Trp Tyr Tyr Lys Asn Ile Gly Asn Glu Arg 
                405                 410                 415     


Cys Pro Ile Val Asp Thr Phe Trp Gln Thr Glu Thr Gly Gly His Met 
            420                 425                 430         


Ile Thr Pro Leu Pro Gly Ala Thr Pro Leu Val Pro Gly Ser Cys Thr 
        435                 440                 445             


Leu Pro Leu Pro Gly Ile Met Ala Ala Ile Val Asp Glu Thr Gly His 
    450                 455                 460                 


Asp Val Pro Asn Gly Asn Gly Gly Ile Leu Val Val Lys Arg Pro Trp 
465                 470                 475                 480 


Pro Ala Met Ile Arg Thr Ile Trp Gly Asp Pro Glu Arg Phe Arg Lys 
                485                 490                 495     


Ser Tyr Phe Pro Glu Glu Leu Gly Gly Lys Leu Tyr Leu Ala Gly Asp 
            500                 505                 510         


Gly Ser Ile Arg Asp Lys Asp Thr Gly Tyr Phe Thr Ile Met Gly Arg 
        515                 520                 525             


Ile Asp Asp Val Leu Asn Val Ser Gly His Arg Met Gly Thr Met Glu 
    530                 535                 540                 


Ile Glu Ser Ala Leu Val Ser Asn Pro Leu Val Ala Glu Ala Ala Val 
545                 550                 555                 560 


Val Gly Arg Pro Asp Asp Met Thr Gly Glu Ala Ile Cys Ala Phe Val 
                565                 570                 575     


Val Leu Lys Arg Ser Arg Pro Thr Gly Glu Glu Ala Val Lys Ile Ala 
            580                 585                 590         


Thr Glu Leu Arg Asn Trp Val Gly Lys Glu Ile Gly Pro Ile Ala Lys 
        595                 600                 605             


Pro Lys Asp Ile Arg Phe Gly Asp Asn Leu Pro Lys Thr Arg Ser Gly 
    610                 615                 620                 


Lys Ile Met Arg Arg Leu Leu Arg Ser Leu Ala Lys Gly Glu Glu Ile 
625                 630                 635                 640 


Thr Gln Asp Thr Ser Thr Leu Glu Asn Pro Ala Ile Leu Glu Gln Leu 
                645                 650                 655     


Gly Gln Ala Arg 
            660 


<210>  32
<211>  1983
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  32
atgacaagca ttcaatccgt tgtgcacgaa gggcggatgt tcccgccatc ccgccacgcc       60

agcgctaagg ccgcgattcc cagcatggag gcctaccagg cactgtgcga cgaagccgag      120

cgtgactatg aaggtttctg ggcgcgccac gcgcgcgagc tgctgcactg gaccaagccc      180

ttcaccaagg tgctggacca aagcaacgca ccgttctaca agtggttcga agacggcgag      240

ctcaacgcct cttacaactg cctggaccgc aatctgcaga acggcaatgc ggacaaggtc      300

gcgatcgtgt tcgaggccga cgacggcagc gtgacgcgcg tcacctaccg cgagctgcat      360

ggcaaggtgt gccgctttgc caacggcctg aaggcgctcg gcatcaggaa gggcgaccgc      420

gtggtgatct acatgccgat gtcggtcgaa ggcgtggtcg cgatgcaggc ctgcgcacgc      480

ctgggcgcca cgcactcggt ggtgttcggc ggcttctcgg ccaagtcgct gcaggagcgg      540

ctggtggacg tgggcgcggt ggcgctgatc accgccgacg agcagatgcg cggcggcaag      600

gcgctgccgc tcaagcccat cgccgatgac gcgctggcgc tggggggctg cgaggccgtc      660

aggaacgtga tcgtctaccg ccgcaccggc ggcaaggttg cctggaccga aggccgcgac      720

cgctggatgg aagatgtcag cgccggccag ccggagacct gcgaagccga gccggtgagc      780

gccgagcacc cgctgttcgt gctctacacc tccggctcca ccggcaagcc caagggcgtg      840

cagcacagca ccggcggcta cctgctgtgg gcgctgatga caatgaagtg gaccttcgac      900

atcaagcccg acgacctgtt ctggtgtacc gcggacatcg gctgggtcac cggccacacc      960

tatattgcct acggcccgct ggccgcgggc gccacccagg tggtgttcga aggcgtgccg     1020

acctacccca acgccggccg cttctgggac atgatcgcgc gccacaaggt cagcatcttc     1080

tacaccgcgc cgaccgcgat ccgctcgctg atcaaggccg ccgaggccga cgagaagatc     1140

cacccgaaac agtacgacct gtccagcctg cgcctgctcg gcaccgtggg cgagccgatc     1200

aaccccgaag cctggatgtg gtactacaag aacatcggca acgagcgctg cccgatcgtc     1260

gacaccttct ggcagaccga gaccggcggc cacatgatca cgccgctgcc gggcgcgacg     1320

ccgctggtgc cgggttcgtg cacgctgccg ctgccgggca tcatggccgc catcgtcgac     1380

gagaccggcc atgacgtgcc caacggcaac ggcggcatcc tggtggtcaa gcgtccgtgg     1440

ccggccatga tccgcaccat ctggggcgat ccggagcgct tcaggaagag ctacttcccc     1500

gaagagctcg gcggcaagct ctacctggcc ggcgacggct cgatccgcga caaggacacc     1560

ggctacttca ccatcatggg ccgcatcgac gacgtgctga acgtgtcggg ccaccgcatg     1620

gggacgatgg agatcgagtc cgcgctggtg tccaacccgc tggtggccga agccgccgtg     1680

gtgggccgcc ccgacgacat gaccggcgag gccatctgcg ccttcgtcgt gctcaagcgt     1740

tcgcgtccga ctggcgaaga ggccgtcaag atcgcgacgg agctgcgcaa ctgggtcggc     1800

aaggagatcg gcccgatcgc caagcccaag gacatccgct ttggcgacaa cctgcccaag     1860

acgcgctcgg gcaagatcat gcggcgcctg ctgcggtcgc tggccaaggg ggaggagatc     1920

acgcaggaca cctcgacgct ggagaatccg gccatcctgg agcagcttgg ccaggcacgc     1980

tga                                                                   1983


<210>  33
<211>  550
<212>  PRT
<213>  C. necator

<400>  33

Met Arg Asp Tyr Ala Gln Ala Phe Asp Gly Phe Ser Tyr Asp Asp Ala 
1               5                   10                  15      


Val Ala Arg Gln Leu His Gly Ser Gln Glu Ala Met Asn Ala Cys Val 
            20                  25                  30          


Glu Cys Cys Asp Arg His Ala Leu Pro Gly Arg Ile Ala Leu Phe Trp 
        35                  40                  45              


Glu Gly Arg Asp Gly Asn Ser Arg Ser Trp Thr Phe Thr Glu Leu Gln 
    50                  55                  60                  


Ala Leu Ser Ala Gln Phe Ala Gly Phe Leu Lys Ala Gln Gly Val Gln 
65                  70                  75                  80  


Pro Gly Asp Arg Val Ala Gly Leu Leu Pro Arg Asn Ala Glu Leu Leu 
                85                  90                  95      


Val Thr Ile Leu Gly Thr Trp Arg Ala Gly Ala Val Tyr Gln Pro Leu 
            100                 105                 110         


Phe Thr Ala Phe Gly Pro Lys Ala Ile Glu His Arg Leu Asn Ala Ser 
        115                 120                 125             


Gly Ala Lys Val Val Val Thr Asp Gly Ala Asn Arg Pro Lys Leu Asp 
    130                 135                 140                 


Asp Val Asp Gly Cys Pro Ala Ile Val Thr Val Ala Gly Asp Lys Gly 
145                 150                 155                 160 


Arg Gly Leu Val Arg Gly Asp Phe Ser Phe Trp Ala Glu Leu Glu Arg 
                165                 170                 175     


Gln Pro Ala Ser Phe Glu Pro Val Pro Arg Arg Gly Asp Asp Pro Phe 
            180                 185                 190         


Leu Met Met Phe Thr Ser Gly Thr Thr Gly Pro Ala Lys Pro Leu Leu 
        195                 200                 205             


Val Pro Leu Lys Ala Ile Ala Ala Phe Ala Gly Tyr Met Ser Asp Ala 
    210                 215                 220                 


Val Asp Leu Arg Ala Glu Asp Ala Phe Trp Asn Leu Ala Asp Pro Gly 
225                 230                 235                 240 


Trp Ala Tyr Gly Leu Tyr Tyr Ala Val Thr Gly Pro Leu Ala Leu Gly 
                245                 250                 255     


His Pro Thr Thr Phe Tyr Asp Gly Pro Phe Thr Val Glu Ser Thr Cys 
            260                 265                 270         


Arg Val Ile Arg Lys Tyr Gly Ile Thr Asn Leu Ala Gly Ser Pro Thr 
        275                 280                 285             


Ala Tyr Arg Leu Leu Ile Ala Ala Gly Glu Ala Val Ser Gly Pro Leu 
    290                 295                 300                 


Arg Gly Arg Leu Arg Ala Val Ser Ser Ala Gly Glu Pro Leu Asn Pro 
305                 310                 315                 320 


Glu Val Ile Arg Trp Phe Ala Ser Glu Leu Gly Val Thr Ile His Asp 
                325                 330                 335     


His Tyr Gly Gln Thr Glu Leu Gly Met Val Leu Cys Asn His His Ala 
            340                 345                 350         


Leu Ala His Pro Val Arg Met Gly Ala Ala Gly Phe Ala Ser Pro Gly 
        355                 360                 365             


His Arg Val Val Val Val Asp Asp Glu Gln Arg Glu Leu Pro Pro Gly 
    370                 375                 380                 


Arg Pro Gly Thr Leu Ala Leu Asp Leu Lys Arg Ser Pro Met Cys Trp 
385                 390                 395                 400 


Phe Gly Gly Tyr His Gly Thr Pro Thr Ser Gly Phe Ala Gly Gly Tyr 
                405                 410                 415     


Tyr Leu Thr Gly Asp Ser Ala Glu Leu Asn Asp Asp Gly Ser Ile Ser 
            420                 425                 430         


Phe Ile Gly Arg Ala Asp Asp Val Ile Thr Thr Ser Gly Tyr Arg Val 
        435                 440                 445             


Gly Pro Phe Asp Val Glu Ser Ala Leu Ile Glu His Pro Ala Val Val 
    450                 455                 460                 


Glu Ala Ala Val Ile Gly Lys Pro Asp Pro Glu Arg Thr Glu Leu Ile 
465                 470                 475                 480 


Lys Ala Phe Val Val Leu Asp Pro Gln Tyr Arg Ala Ala Pro Glu Leu 
                485                 490                 495     


Ala Glu Ala Leu Arg Gln His Val Arg Lys Arg Leu Ala Ala His Ala 
            500                 505                 510         


Tyr Pro Arg Glu Ile Glu Phe Val Val Glu Leu Pro Lys Thr Pro Ser 
        515                 520                 525             


Gly Lys Val Gln Arg Phe Ile Leu Arg Asn Gln Glu Val Ala Arg Ala 
    530                 535                 540                 


Arg Glu Ala Ala Ala Ala 
545                 550 


<210>  34
<211>  1653
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic

<400>  34
atgcgcgact acgcccaagc cttcgacgga ttttcctatg acgacgccgt ggcacggcaa       60

ctgcacggca gccaggaggc aatgaacgcc tgcgtcgaat gctgcgaccg ccacgcgctg      120

ccgggccgta tcgcgctgtt ctgggaaggg cgagacggca attcgcgcag ctggaccttt      180

accgagctgc aggcactgtc cgcgcagttt gccggcttcc tgaaggcgca gggcgtgcag      240

ccgggcgacc gcgtggcggg cctgctgccg cgcaatgcgg aactgctggt gacgattctc      300

ggcacctggc gcgccggcgc ggtgtaccag ccgctgttca cggccttcgg ccccaaggcc      360

atcgagcacc ggctcaatgc gtccggcgcg aaggttgtgg tcaccgatgg cgccaaccgc      420

cccaagctgg atgacgtgga tggctgtccc gccattgtca ccgtggccgg cgacaagggc      480

cgcggcctgg tgcgcggcga cttcagcttc tgggccgaac tggaacgcca gccggcgtcg      540

ttcgagccgg tgccgcgccg gggcgacgac cccttcctga tgatgttcac ctccggcacc      600

accggcccgg ccaagccgct gctggtgccg ctcaaggcca ttgccgcgtt tgccggctat      660

atgagcgacg cggtcgacct gcgcgcggaa gacgctttct ggaacctggc cgatccgggc      720

tgggcctatg gcctgtatta cgcggtcacg ggcccgctgg cgctgggcca tcccaccacc      780

ttctacgatg gcccgttcac cgtggagagc acatgccgtg tgatccgcaa gtacggcatc      840

accaacctgg ccggctcgcc cacggcatac cggctgctga tcgccgcggg cgaggccgtg      900

tcaggcccgc tgcgcgggcg gctgcgcgcg gtcagcagcg cgggcgagcc gctcaacccg      960

gaagtgatcc gctggttcgc cagcgagctg ggcgtgacca tccacgacca ctacggccag     1020

accgagctgg gcatggtgct gtgcaaccac catgcgctgg cgcatccggt gcgcatgggc     1080

gcggccggct ttgccagccc cgggcaccgc gtggtggtgg tggacgatga acagcgcgaa     1140

ctgccgccgg gccggccggg cacgctggcg ctggacctga agcgctcgcc gatgtgctgg     1200

ttcggcggct atcacggcac gcccaccagc gggtttgccg gcggctacta cctgaccggc     1260

gattccgccg agctgaatga cgacggcagc atcagcttca taggccgggc cgacgacgtc     1320

atcaccacct ctggctaccg cgtgggcccg ttcgacgtgg aaagcgcgct gatcgagcac     1380

ccggccgtgg tcgaggccgc ggtgatcggc aagcccgatc cggagcgcac cgagctgatc     1440

aaggcctttg tcgtgctgga cccgcaatat cgcgccgcgc cggaactggc cgaggcgctg     1500

cgccagcacg tgcgtaagcg cctggccgcc catgcctacc cgcgcgagat cgagttcgtc     1560

gtcgagctgc ccaagacccc cagcggcaag gtccagcgct ttatcctgcg caaccaggaa     1620

gtggcccgcg cgcgcgaggc ggccgctgcc tga                                  1653


