                         SEQUENCE LISTING

<110>  E.I.duPont de Nemours and Company, Inc.
       DiCosimo, Robert
       Payne, Mark
       Gavagan, John
 
<120>  PERHYDROLASE FOR ENZYMATIC PERACID PRODUCTION

<130>  CL4550 US NA

<160>  41    

<170>  PatentIn version 3.5

<210>  1
<211>  957
<212>  DNA
<213>  Bacillus subtilis

<400>  1
atgcaactat tcgatctgcc gctcgaccaa ttgcaaacat ataagcctga aaaaacagca       60

ccgaaagatt tttctgagtt ttggaaattg tctttggagg aacttgcaaa agtccaagca      120

gaacctgatc tacagccggt tgactatcct gctgacggag taaaagtgta ccgtctcaca      180

tataaaagct tcggaaacgc ccgcattacc ggatggtacg cggtgcctga caagcaaggc      240

ccgcatccgg cgatcgtgaa atatcatggc tacaatgcaa gctatgatgg tgagattcat      300

gaaatggtaa actgggcact ccatggctac gccgcattcg gcatgcttgt ccgcggccag      360

cagagcagcg aggatacgag tatttcactg cacggtcatg ctttgggctg gatgacgaaa      420

ggaattcttg ataaagatac atactattac cgcggtgttt atttggacgc cgtccgcgcg      480

cttgaggtca tcagcagctt cgacgaggtt gacgaaacaa ggatcggtgt gacaggagga      540

agccaaggcg gaggtttaac cattgccgca gcagcgctgt cagacattcc aaaagccgcg      600

gttgccgatt atccttattt aagcaacttc gaacgggcca ttgatgtggc gcttgaacag      660

ccgtaccttg aaatcaattc cttcttcaga agaaatggca gcccggaaac agaagtgcag      720

gcgatgaaga cactttcata tttcgatatt atgaatctcg ctgaccgagt gaaggtgcct      780

gtcctgatgt caatcggcct gattgacaag gtcacgccgc cgtccaccgt gtttgccgcc      840

tacaatcatt tggaaacaga gaaagagctg aaggtgtacc gctacttcgg acatgagtat      900

atccctgctt ttcaaacgga aaaacttgct ttctttaagc agcatcttaa aggctga         957


<210>  2
<211>  318
<212>  PRT
<213>  Bacillus subtilis

<400>  2

Met Gln Leu Phe Asp Leu Pro Leu Asp Gln Leu Gln Thr Tyr Lys Pro 
1               5                   10                  15      


Glu Lys Thr Ala Pro Lys Asp Phe Ser Glu Phe Trp Lys Leu Ser Leu 
            20                  25                  30          


Glu Glu Leu Ala Lys Val Gln Ala Glu Pro Asp Leu Gln Pro Val Asp 
        35                  40                  45              


Tyr Pro Ala Asp Gly Val Lys Val Tyr Arg Leu Thr Tyr Lys Ser Phe 
    50                  55                  60                  


Gly Asn Ala Arg Ile Thr Gly Trp Tyr Ala Val Pro Asp Lys Gln Gly 
65                  70                  75                  80  


Pro His Pro Ala Ile Val Lys Tyr His Gly Tyr Asn Ala Ser Tyr Asp 
                85                  90                  95      


Gly Glu Ile His Glu Met Val Asn Trp Ala Leu His Gly Tyr Ala Ala 
            100                 105                 110         


Phe Gly Met Leu Val Arg Gly Gln Gln Ser Ser Glu Asp Thr Ser Ile 
        115                 120                 125             


Ser Leu His Gly His Ala Leu Gly Trp Met Thr Lys Gly Ile Leu Asp 
    130                 135                 140                 


Lys Asp Thr Tyr Tyr Tyr Arg Gly Val Tyr Leu Asp Ala Val Arg Ala 
145                 150                 155                 160 


Leu Glu Val Ile Ser Ser Phe Asp Glu Val Asp Glu Thr Arg Ile Gly 
                165                 170                 175     


Val Thr Gly Gly Ser Gln Gly Gly Gly Leu Thr Ile Ala Ala Ala Ala 
            180                 185                 190         


Leu Ser Asp Ile Pro Lys Ala Ala Val Ala Asp Tyr Pro Tyr Leu Ser 
        195                 200                 205             


Asn Phe Glu Arg Ala Ile Asp Val Ala Leu Glu Gln Pro Tyr Leu Glu 
    210                 215                 220                 


Ile Asn Ser Phe Phe Arg Arg Asn Gly Ser Pro Glu Thr Glu Val Gln 
225                 230                 235                 240 


Ala Met Lys Thr Leu Ser Tyr Phe Asp Ile Met Asn Leu Ala Asp Arg 
                245                 250                 255     


Val Lys Val Pro Val Leu Met Ser Ile Gly Leu Ile Asp Lys Val Thr 
            260                 265                 270         


Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn His Leu Glu Thr Glu Lys 
        275                 280                 285             


Glu Leu Lys Val Tyr Arg Tyr Phe Gly His Glu Tyr Ile Pro Ala Phe 
    290                 295                 300                 


Gln Thr Glu Lys Leu Ala Phe Phe Lys Gln His Leu Lys Gly 
305                 310                 315             


<210>  3
<211>  939
<212>  DNA
<213>  Lactococcus lactis

<400>  3
atgacaaaaa taaacaattg gcaagattat caaggaagtt cacttaaacc agaggatttt       60

gataaatttt gggatgaaaa aattaatttg gtttcaaatc atcaatttga atttgaatta      120

atagaaaaaa atctttcctc taaggtagtt aacttttatc atttgtggtt tacagctatt      180

gatggagcta aaattcatgc tcagttaatt gttcccaaga atttgaaaga gaaataccca      240

gccatcttac aatttcatgg ttatcattgc gatagtgggg attgggtcga taaaataggg      300

atagttgccg aagggaatgt agttcttgcg cttgattgtc gaggacaagg tggtttaagt      360

caagataata ttcaaactat ggggatgaca atgaagggac tcattgttcg aggaattgat      420

gaagggtatg aaaatctcta ttacgttcgc caatttatgg acttaataac tgcaaccaaa      480

attttatccg agtttgattt tgttgatgaa acaaatataa gtgcacaagg tgcttctcaa      540

ggtggagcgc ttgccgttgc ttgcgccgca ctttctcctc ttataaaaaa ggtgactgcc      600

acttacccct ttctttcaga ttatcgcaaa gcttatgagc ttggtgccga ggaatctgct      660

ttcgaagaac ttccatattg gtttcagttt aaagatccac ttcatctaag agaagactgg      720

ttttttaatc agttggaata cattgatatt caaaatttag caccaagaat taaggctgag      780

gtcatttgga tcctaggcgg caaagatact gttgttcctc cgattacgca aatggcggct      840

tacaataaaa tacaaagtaa aaaatctctc tatgtcttac ctgaatacgg ccatgaatat      900

cttcctaaaa ttagcgactg gttaagagag aatcaataa                             939


<210>  4
<211>  312
<212>  PRT
<213>  Lactococcus lactis

<400>  4

Met Thr Lys Ile Asn Asn Trp Gln Asp Tyr Gln Gly Ser Ser Leu Lys 
1               5                   10                  15      


Pro Glu Asp Phe Asp Lys Phe Trp Asp Glu Lys Ile Asn Leu Val Ser 
            20                  25                  30          


Asn His Gln Phe Glu Phe Glu Leu Ile Glu Lys Asn Leu Ser Ser Lys 
        35                  40                  45              


Val Val Asn Phe Tyr His Leu Trp Phe Thr Ala Ile Asp Gly Ala Lys 
    50                  55                  60                  


Ile His Ala Gln Leu Ile Val Pro Lys Asn Leu Lys Glu Lys Tyr Pro 
65                  70                  75                  80  


Ala Ile Leu Gln Phe His Gly Tyr His Cys Asp Ser Gly Asp Trp Val 
                85                  90                  95      


Asp Lys Ile Gly Ile Val Ala Glu Gly Asn Val Val Leu Ala Leu Asp 
            100                 105                 110         


Cys Arg Gly Gln Gly Gly Leu Ser Gln Asp Asn Ile Gln Thr Met Gly 
        115                 120                 125             


Met Thr Met Lys Gly Leu Ile Val Arg Gly Ile Asp Glu Gly Tyr Glu 
    130                 135                 140                 


Asn Leu Tyr Tyr Val Arg Gln Phe Met Asp Leu Ile Thr Ala Thr Lys 
145                 150                 155                 160 


Ile Leu Ser Glu Phe Asp Phe Val Asp Glu Thr Asn Ile Ser Ala Gln 
                165                 170                 175     


Gly Ala Ser Gln Gly Gly Ala Leu Ala Val Ala Cys Ala Ala Leu Ser 
            180                 185                 190         


Pro Leu Ile Lys Lys Val Thr Ala Thr Tyr Pro Phe Leu Ser Asp Tyr 
        195                 200                 205             


Arg Lys Ala Tyr Glu Leu Gly Ala Glu Glu Ser Ala Phe Glu Glu Leu 
    210                 215                 220                 


Pro Tyr Trp Phe Gln Phe Lys Asp Pro Leu His Leu Arg Glu Asp Trp 
225                 230                 235                 240 


Phe Phe Asn Gln Leu Glu Tyr Ile Asp Ile Gln Asn Leu Ala Pro Arg 
                245                 250                 255     


Ile Lys Ala Glu Val Ile Trp Ile Leu Gly Gly Lys Asp Thr Val Val 
            260                 265                 270         


Pro Pro Ile Thr Gln Met Ala Ala Tyr Asn Lys Ile Gln Ser Lys Lys 
        275                 280                 285             


Ser Leu Tyr Val Leu Pro Glu Tyr Gly His Glu Tyr Leu Pro Lys Ile 
    290                 295                 300                 


Ser Asp Trp Leu Arg Glu Asn Gln 
305                 310         


<210>  5
<211>  972
<212>  DNA
<213>  Mesorhizobium loti

<400>  5
atgccgttcc cggatctgat ccagcccgaa ctgggcgctt atgtcagcag tgtcggcatg       60

ccggacgact ttgcccaatt ctggacgtcg accatcgccg aggctcgcca ggccggcggt      120

gaggtcagta tcgtgcaggc gcagacgaca ctgaaggcgg tccagtcctt cgatgtcacg      180

tttccaggat acggcggtca tccaatcaaa ggatggctga tcttgccgac gcaccacaag      240

gggcggcttc ccctcgtcgt gcagtatatc ggctatggcg gcggccgcgg cttggcgcat      300

gagcaactgc attgggcggc gtcaggcttt gcctatttcc gaatggatac acgcgggcag      360

ggaagcgact ggagcgtcgg tgagaccgcc gatcccgtcg gctcgacctc gtccattccc      420

ggctttatga cgcgtggcgt gctggacaag aatgactact attaccggcg cctgttcacc      480

gatgccgtga gggcgataga tgctctgctc ggactggact tcgtcgatcc cgaacgcatc      540

gcggtttgcg gtgacagtca gggaggcggt atttcgctcg ccgttggcgg catcgacccg      600

cgcgtcaagg ccgtaatgcc cgacgttcca tttctgtgcg actttccgcg cgctgtgcag      660

actgccgtgc gcgatcccta tttggaaatc gttcgctttc tggcccagca tcgcgaaaag      720

aaggcggcag tctttgaaac gctcaactat ttcgactgcg tcaacttcgc ccggcggtcc      780

aaggcgccgg cgctgttttc ggtggccctg atggacgaag tctgcccgcc ctctaccgtg      840

tatggcgcat tcaatgccta tgcaggcgaa aagaccatca cagagtacga attcaacaat      900

catgaaggcg ggcaaggcta tcaagagcgc caacagatga cgtggctcag caggctgttc      960

ggtgtcggct ga                                                          972


<210>  6
<211>  323
<212>  PRT
<213>  Mesorhizobium loit

<400>  6

Met Pro Phe Pro Asp Leu Ile Gln Pro Glu Leu Gly Ala Tyr Val Ser 
1               5                   10                  15      


Ser Val Gly Met Pro Asp Asp Phe Ala Gln Phe Trp Thr Ser Thr Ile 
            20                  25                  30          


Ala Glu Ala Arg Gln Ala Gly Gly Glu Val Ser Ile Val Gln Ala Gln 
        35                  40                  45              


Thr Thr Leu Lys Ala Val Gln Ser Phe Asp Val Thr Phe Pro Gly Tyr 
    50                  55                  60                  


Gly Gly His Pro Ile Lys Gly Trp Leu Ile Leu Pro Thr His His Lys 
65                  70                  75                  80  


Gly Arg Leu Pro Leu Val Val Gln Tyr Ile Gly Tyr Gly Gly Gly Arg 
                85                  90                  95      


Gly Leu Ala His Glu Gln Leu His Trp Ala Ala Ser Gly Phe Ala Tyr 
            100                 105                 110         


Phe Arg Met Asp Thr Arg Gly Gln Gly Ser Asp Trp Ser Val Gly Glu 
        115                 120                 125             


Thr Ala Asp Pro Val Gly Ser Thr Ser Ser Ile Pro Gly Phe Met Thr 
    130                 135                 140                 


Arg Gly Val Leu Asp Lys Asn Asp Tyr Tyr Tyr Arg Arg Leu Phe Thr 
145                 150                 155                 160 


Asp Ala Val Arg Ala Ile Asp Ala Leu Leu Gly Leu Asp Phe Val Asp 
                165                 170                 175     


Pro Glu Arg Ile Ala Val Cys Gly Asp Ser Gln Gly Gly Gly Ile Ser 
            180                 185                 190         


Leu Ala Val Gly Gly Ile Asp Pro Arg Val Lys Ala Val Met Pro Asp 
        195                 200                 205             


Val Pro Phe Leu Cys Asp Phe Pro Arg Ala Val Gln Thr Ala Val Arg 
    210                 215                 220                 


Asp Pro Tyr Leu Glu Ile Val Arg Phe Leu Ala Gln His Arg Glu Lys 
225                 230                 235                 240 


Lys Ala Ala Val Phe Glu Thr Leu Asn Tyr Phe Asp Cys Val Asn Phe 
                245                 250                 255     


Ala Arg Arg Ser Lys Ala Pro Ala Leu Phe Ser Val Ala Leu Met Asp 
            260                 265                 270         


Glu Val Cys Pro Pro Ser Thr Val Tyr Gly Ala Phe Asn Ala Tyr Ala 
        275                 280                 285             


Gly Glu Lys Thr Ile Thr Glu Tyr Glu Phe Asn Asn His Glu Gly Gly 
    290                 295                 300                 


Gln Gly Tyr Gln Glu Arg Gln Gln Met Thr Trp Leu Ser Arg Leu Phe 
305                 310                 315                 320 


Gly Val Gly 
            


<210>  7
<211>  990
<212>  DNA
<213>  Geobacillus stearothermophilus

<400>  7
atgttcgata tgccgttagc acaattacag aaatacatgg ggacaaatcc gaagccggct       60

gattttgctg acttttggag tcgagcgttg gaggaattat ctgcccaatc gttgcattat      120

gagctgattc cggcaacatt tcaaacgaca gtggcgagtt gctaccattt gtatttcacg      180

ggagtcggcg gggctagagt ccattgtcag ttagtaaaac cgagagagca gaagcagaaa      240

ggcccggggt tggtatggtt tcatggctac catacgaata gcggcgattg ggtcgataaa      300

ctggcatatg ctgcggcagg ttttactgta ttggcgatgg attgccgcgg ccaaggagga      360

aaatcagagg ataatttgca agtgaaaggc ccaacattga agggccatat tattcgcgga      420

attgaggatc caaatcctca tcatctttat tatcgaaatg tttttttaga tacagttcag      480

gcggtaagaa ttttatgctc tatggatcat attgatcgtg aacgaattgg tgtatatggc      540

gcttcccaag gaggagcgtt ggcattagcg tgtgctgctc tggaaccatc ggtggtgaaa      600

aaagcggttg tgctctatcc atttttatcg gattataagc gggcgcaaga gttggatatg      660

aaaaataccg cgtatgagga aattcattat tattttcgat ttttagatcc cacacatgag      720

cgggaagaag aagtatttta caaactaggc tatattgata ttcaactctt agccgatcgg      780

atttgtgccg atgttttatg ggctgttgcg ctagaagacc atatttgtcc cccgtccaca      840

caatttgctg tttataataa aattaagtca aaaaaagaca tggttttgtt ttacgagtat      900

ggtcatgagt atttaccgac tatgggagac cgtgcttatc tgtttttttg cccgatcttc      960

tttccaatcc aaaagagaaa cgttaagtaa                                       990


<210>  8
<211>  329
<212>  PRT
<213>  Geobacillus stearothermophilus

<400>  8

Met Phe Asp Met Pro Leu Ala Gln Leu Gln Lys Tyr Met Gly Thr Asn 
1               5                   10                  15      


Pro Lys Pro Ala Asp Phe Ala Asp Phe Trp Ser Arg Ala Leu Glu Glu 
            20                  25                  30          


Leu Ser Ala Gln Ser Leu His Tyr Glu Leu Ile Pro Ala Thr Phe Gln 
        35                  40                  45              


Thr Thr Val Ala Ser Cys Tyr His Leu Tyr Phe Thr Gly Val Gly Gly 
    50                  55                  60                  


Ala Arg Val His Cys Gln Leu Val Lys Pro Arg Glu Gln Lys Gln Lys 
65                  70                  75                  80  


Gly Pro Gly Leu Val Trp Phe His Gly Tyr His Thr Asn Ser Gly Asp 
                85                  90                  95      


Trp Val Asp Lys Leu Ala Tyr Ala Ala Ala Gly Phe Thr Val Leu Ala 
            100                 105                 110         


Met Asp Cys Arg Gly Gln Gly Gly Lys Ser Glu Asp Asn Leu Gln Val 
        115                 120                 125             


Lys Gly Pro Thr Leu Lys Gly His Ile Ile Arg Gly Ile Glu Asp Pro 
    130                 135                 140                 


Asn Pro His His Leu Tyr Tyr Arg Asn Val Phe Leu Asp Thr Val Gln 
145                 150                 155                 160 


Ala Val Arg Ile Leu Cys Ser Met Asp His Ile Asp Arg Glu Arg Ile 
                165                 170                 175     


Gly Val Tyr Gly Ala Ser Gln Gly Gly Ala Leu Ala Leu Ala Cys Ala 
            180                 185                 190         


Ala Leu Glu Pro Ser Val Val Lys Lys Ala Val Val Leu Tyr Pro Phe 
        195                 200                 205             


Leu Ser Asp Tyr Lys Arg Ala Gln Glu Leu Asp Met Lys Asn Thr Ala 
    210                 215                 220                 


Tyr Glu Glu Ile His Tyr Tyr Phe Arg Phe Leu Asp Pro Thr His Glu 
225                 230                 235                 240 


Arg Glu Glu Glu Val Phe Tyr Lys Leu Gly Tyr Ile Asp Ile Gln Leu 
                245                 250                 255     


Leu Ala Asp Arg Ile Cys Ala Asp Val Leu Trp Ala Val Ala Leu Glu 
            260                 265                 270         


Asp His Ile Cys Pro Pro Ser Thr Gln Phe Ala Val Tyr Asn Lys Ile 
        275                 280                 285             


Lys Ser Lys Lys Asp Met Val Leu Phe Tyr Glu Tyr Gly His Glu Tyr 
    290                 295                 300                 


Leu Pro Thr Met Gly Asp Arg Ala Tyr Leu Phe Phe Cys Pro Ile Phe 
305                 310                 315                 320 


Phe Pro Ile Gln Lys Arg Asn Val Lys 
                325                 


<210>  9
<211>  795
<212>  DNA
<213>  artificial sequence

<220>
<223>  synthetic construct

<400>  9
atgattgaac aagatggatt gcacgcaggt tctccggccg cttgggtgga gaggctattc       60

ggctatgact gggcacaaca gacaatcggc tgctctgatg ccgccgtgtt ccggctgtca      120

gcgcaggggc gcccggttct ttttgtcaag accgacctgt ccggtgccct gaatgaactg      180

caggacgagg cagcgcggct atcgtggctg gccacgacgg gcgttccttg cgcagctgtg      240

ctcgacgttg tcactgaagc gggaagggac tggctgctat tgggcgaagt gccggggcag      300

gatctcctgt catctcacct tgctcctgcc gagaaagtat ccatcatggc tgatgcaatg      360

cggcggctgc atacgcttga tccggctacc tgcccattcg accaccaagc gaaacatcgc      420

atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg atcaggatga tctggacgaa      480

gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc tcaaggcgcg catgcccgac      540

ggcgaggatc tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat      600

ggccgctttt ctggattcat cgactgtggc cggctgggtg tggcggaccg ctatcaggac      660

atagcgttgg ctacccgtga tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc      720

ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca tcgccttcta tcgccttctt      780

gacgagttct tctaa                                                       795


<210>  10
<211>  3434
<212>  DNA
<213>  artificial sequence

<220>
<223>  Plasmid pKD13

<400>  10
agattgcagc attacacgtc ttgagcgatt gtgtaggctg gagctgcttc gaagttccta       60

tactttctag agaataggaa cttcggaata ggaacttcaa gatcccctta ttagaagaac      120

tcgtcaagaa ggcgatagaa ggcgatgcgc tgcgaatcgg gagcggcgat accgtaaagc      180

acgaggaagc ggtcagccca ttcgccgcca agctcttcag caatatcacg ggtagccaac      240

gctatgtcct gatagcggtc cgccacaccc agccggccac agtcgatgaa tccagaaaag      300

cggccatttt ccaccatgat attcggcaag caggcatcgc catgggtcac gacgagatcc      360

tcgccgtcgg gcatgcgcgc cttgagcctg gcgaacagtt cggctggcgc gagcccctga      420

tgctcttcgt ccagatcatc ctgatcgaca agaccggctt ccatccgagt acgtgctcgc      480

tcgatgcgat gtttcgcttg gtggtcgaat gggcaggtag ccggatcaag cgtatgcagc      540

cgccgcattg catcagccat gatggatact ttctcggcag gagcaaggtg agatgacagg      600

agatcctgcc ccggcacttc gcccaatagc agccagtccc ttcccgcttc agtgacaacg      660

tcgagcacag ctgcgcaagg aacgcccgtc gtggccagcc acgatagccg cgctgcctcg      720

tcctgcagtt cattcagggc accggacagg tcggtcttga caaaaagaac cgggcgcccc      780

tgcgctgaca gccggaacac ggcggcatca gagcagccga ttgtctgttg tgcccagtca      840

tagccgaata gcctctccac ccaagcggcc ggagaacctg cgtgcaatcc atcttgttca      900

atcatgcgaa acgatcctca tcctgtctct tgatcagatc ttgatcccct gcgccatcag      960

atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac cttaccagag     1020

ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca gtctagctat     1080

cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt ttcccttgtc     1140

cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg actggctttc     1200

tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca gcgtgagctt     1260

caaaagcgct ctgaagttcc tatactttct agagaatagg aacttcgaac tgcaggtcga     1320

cggatccccg gaattaattc tcatgtttga cagcttatca ctgatcagtg aattaatggc     1380

gatgacgcat cctcacgata atatccgggt aggcgcaatc actttcgtct ctactccgtt     1440

acaaagcgag gctgggtatt tcccggcctt tctgttatcc gaaatccact gaaagcacag     1500

cggctggctg aggagataaa taataaacga ggggctgtat gcacaaagca tcttctgttg     1560

agttaagaac gagtatcgag atggcacata gccttgctca aattggaatc aggtttgtgc     1620

caataccagt agaaacagac gaagaagcta gctttgcact ggattgcgag gctttgccat     1680

ggctaattcc catgtcagcc gttaagtgtt cctgtgtcac tgaaaattgc tttgagaggc     1740

tctaagggct tctcagtgcg ttacatccct ggcttgttgt ccacaaccgt taaaccttaa     1800

aagctttaaa agccttatat attctttttt ttcttataaa acttaaaacc ttagaggcta     1860

tttaagttgc tgatttatat taattttatt gttcaaacat gagagcttag tacgtgaaac     1920

atgagagctt agtacgttag ccatgagagc ttagtacgtt agccatgagg gtttagttcg     1980

ttaaacatga gagcttagta cgttaaacat gagagcttag tacgtgaaac atgagagctt     2040

agtacgtact atcaacaggt tgaactgcgg atcttgcggc cgcaaaaatt aaaaatgaag     2100

ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat     2160

cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc     2220

cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat     2280

accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag     2340

ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg     2400

ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc     2460

tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca     2520

acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg     2580

tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc     2640

actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta     2700

ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt gcccggcgtc     2760

aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca ttggaaaacg     2820

ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc     2880

cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt ctgggtgagc     2940

aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat     3000

actcatactc ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag     3060

cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc     3120

ccgaaaagtg ccacctgcat cgatggcccc ccgatggtag tgtggggtct ccccatgcga     3180

gagtagggaa ctgccaggca tcaaataaaa cgaaaggctc agtcgaaaga ctgggccttt     3240

cgttttatct gttgtttgtc ggtgaacgct ctcctgagta ggacaaatcc gccgggagcg     3300

gatttgaacg ttgcgaagca acggcccgga gggtggcggg caggacgccc gccataaact     3360

gccaggcatc aaattaagca gaaggccatc ctgacggatg gcctttttgc gtggccagtg     3420

ccaagcttgc atgc                                                       3434


<210>  11
<211>  80
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  11
atgagcacgt cagacgatat ccataacacc acagccactg gcaaatgccc gttccatcag       60

gtgtaggctg gagctgcttc                                                   80


<210>  12
<211>  82
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  12
taacagcagg tcgaaacggt cgaggttcat cactttcacc catgccgcca cgaagtcttt       60

attccgggga tccgtcgacc tg                                                82


<210>  13
<211>  1424
<212>  DNA
<213>  artificial sequence

<220>
<223>  Synthetic construct

<400>  13
taacagcagg tcgaaacggt cgaggttcat cactttcacc catgccgcca cgaagtcttt       60

attccgggga tccgtcgacc tgcagttcga agttcctatt ctctagaaag tataggaact      120

tcagagcgct tttgaagctc acgctgccgc aagcactcag ggcgcaaggg ctgctaaagg      180

aagcggaaca cgtagaaagc cagtccgcag aaacggtgct gaccccggat gaatgtcagc      240

tactgggcta tctggacaag ggaaaacgca agcgcaaaga gaaagcaggt agcttgcagt      300

gggcttacat ggcgatagct agactgggcg gttttatgga cagcaagcga accggaattg      360

ccagctgggg cgccctctgg taaggttggg aagccctgca aagtaaactg gatggctttc      420

ttgccgccaa ggatctgatg gcgcagggga tcaagatctg atcaagagac aggatgagga      480

tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc ttgggtggag      540

aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc cgccgtgttc      600

cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc cggtgccctg      660

aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg cgttccttgc      720

gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt gggcgaagtg      780

ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc catcatggct      840

gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga ccaccaagcg      900

aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga tcaggatgat      960

ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc     1020

atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg     1080

gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt ggcggaccgc     1140

tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg cgaatgggct     1200

gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat cgccttctat     1260

cgccttcttg acgagttctt ctaataaggg gatcttgaag ttcctattcc gaagttccta     1320

ttctctagaa agtataggaa cttcgaagca gctccagcct acacctgatg gaacgggcat     1380

ttgccagtgg ctgtggtgtt atggatatcg tctgacgtgc tcat                      1424


<210>  14
<211>  2181
<212>  DNA
<213>  Escherichia coli


<220>
<221>  CDS
<222>  (1)..(2181)

<400>  14
atg agc acg tca gac gat atc cat aac acc aca gcc act ggc aaa tgc         48
Met Ser Thr Ser Asp Asp Ile His Asn Thr Thr Ala Thr Gly Lys Cys           
1               5                   10                  15                

ccg ttc cat cag ggc ggt cac gac cag agt gcg ggg gcg ggc aca acc         96
Pro Phe His Gln Gly Gly His Asp Gln Ser Ala Gly Ala Gly Thr Thr           
            20                  25                  30                    

act cgc gac tgg tgg cca aat caa ctt cgt gtt gac ctg tta aac caa        144
Thr Arg Asp Trp Trp Pro Asn Gln Leu Arg Val Asp Leu Leu Asn Gln           
        35                  40                  45                        

cat tct aat cgt tct aac cca ctg ggt gag gac ttt gac tac cgc aaa        192
His Ser Asn Arg Ser Asn Pro Leu Gly Glu Asp Phe Asp Tyr Arg Lys           
    50                  55                  60                            

gaa ttc agc aaa tta gat tac tac ggc ctg aaa aaa gat ctg aaa gcc        240
Glu Phe Ser Lys Leu Asp Tyr Tyr Gly Leu Lys Lys Asp Leu Lys Ala           
65                  70                  75                  80            

ctg ttg aca gaa tct caa ccg tgg tgg cca gcc gac tgg ggc agt tac        288
Leu Leu Thr Glu Ser Gln Pro Trp Trp Pro Ala Asp Trp Gly Ser Tyr           
                85                  90                  95                

gcc ggt ctg ttt att cgt atg gcc tgg cac ggc gcg ggg act tac cgt        336
Ala Gly Leu Phe Ile Arg Met Ala Trp His Gly Ala Gly Thr Tyr Arg           
            100                 105                 110                   

tca atc gat gga cgc ggt ggc gcg ggt cgt ggt cag caa cgt ttt gca        384
Ser Ile Asp Gly Arg Gly Gly Ala Gly Arg Gly Gln Gln Arg Phe Ala           
        115                 120                 125                       

ccg ctg aac tcc tgg ccg gat aac gta agc ctc gat aaa gcg cgt cgc        432
Pro Leu Asn Ser Trp Pro Asp Asn Val Ser Leu Asp Lys Ala Arg Arg           
    130                 135                 140                           

ctg ttg tgg cca atc aaa cag aaa tat ggt cag aaa atc tcc tgg gcc        480
Leu Leu Trp Pro Ile Lys Gln Lys Tyr Gly Gln Lys Ile Ser Trp Ala           
145                 150                 155                 160           

gac ctg ttt atc ctc gcg ggt aac gtg gcg cta gaa aac tcc ggc ttc        528
Asp Leu Phe Ile Leu Ala Gly Asn Val Ala Leu Glu Asn Ser Gly Phe           
                165                 170                 175               

cgt acc ttc ggt ttt ggt gcc ggt cgt gaa gac gtc tgg gaa ccg gat        576
Arg Thr Phe Gly Phe Gly Ala Gly Arg Glu Asp Val Trp Glu Pro Asp           
            180                 185                 190                   

ctg gat gtt aac tgg ggt gat gaa aaa gcc tgg ctg act cac cgt cat        624
Leu Asp Val Asn Trp Gly Asp Glu Lys Ala Trp Leu Thr His Arg His           
        195                 200                 205                       

ccg gaa gcg ctg gcg aaa gca ccg ctg ggt gca acc gag atg ggt ctg        672
Pro Glu Ala Leu Ala Lys Ala Pro Leu Gly Ala Thr Glu Met Gly Leu           
    210                 215                 220                           

att tac gtt aac ccg gaa ggc ccg gat cac agc ggc gaa ccg ctt tct        720
Ile Tyr Val Asn Pro Glu Gly Pro Asp His Ser Gly Glu Pro Leu Ser           
225                 230                 235                 240           

gcg gca gca gct atc cgc gcg acc ttc ggc aac atg ggc atg aac gac        768
Ala Ala Ala Ala Ile Arg Ala Thr Phe Gly Asn Met Gly Met Asn Asp           
                245                 250                 255               

gaa gaa acc gtg gcg ctg att gcg ggt ggt cat acg ctg ggt aaa acc        816
Glu Glu Thr Val Ala Leu Ile Ala Gly Gly His Thr Leu Gly Lys Thr           
            260                 265                 270                   

cac ggt gcc ggt ccg aca tca aat gta ggt cct gat cca gaa gct gca        864
His Gly Ala Gly Pro Thr Ser Asn Val Gly Pro Asp Pro Glu Ala Ala           
        275                 280                 285                       

ccg att gaa gaa caa ggt tta ggt tgg gcg agc act tac ggc agc ggc        912
Pro Ile Glu Glu Gln Gly Leu Gly Trp Ala Ser Thr Tyr Gly Ser Gly           
    290                 295                 300                           

gtt ggc gca gat gcc att acc tct ggt ctg gaa gta gtc tgg acc cag        960
Val Gly Ala Asp Ala Ile Thr Ser Gly Leu Glu Val Val Trp Thr Gln           
305                 310                 315                 320           

acg ccg acc cag tgg agc aac tat ttc ttc gag aac ctg ttc aag tat       1008
Thr Pro Thr Gln Trp Ser Asn Tyr Phe Phe Glu Asn Leu Phe Lys Tyr           
                325                 330                 335               

gag tgg gta cag acc cgc agc ccg gct ggc gca atc cag ttc gaa gcg       1056
Glu Trp Val Gln Thr Arg Ser Pro Ala Gly Ala Ile Gln Phe Glu Ala           
            340                 345                 350                   

gta gac gca ccg gaa att atc ccg gat ccg ttt gat ccg tcg aag aaa       1104
Val Asp Ala Pro Glu Ile Ile Pro Asp Pro Phe Asp Pro Ser Lys Lys           
        355                 360                 365                       

cgt aaa ccg aca atg ctg gtg acc gac ctg acg ctg cgt ttt gat cct       1152
Arg Lys Pro Thr Met Leu Val Thr Asp Leu Thr Leu Arg Phe Asp Pro           
    370                 375                 380                           

gag ttc gag aag atc tct cgt cgt ttc ctc aac gat ccg cag gcg ttc       1200
Glu Phe Glu Lys Ile Ser Arg Arg Phe Leu Asn Asp Pro Gln Ala Phe           
385                 390                 395                 400           

aac gaa gcc ttt gcc cgt gcc tgg ttc aaa ctg acg cac agg gat atg       1248
Asn Glu Ala Phe Ala Arg Ala Trp Phe Lys Leu Thr His Arg Asp Met           
                405                 410                 415               

ggg ccg aaa tct cgc tac atc ggg ccg gaa gtg ccg aaa gaa gat ctg       1296
Gly Pro Lys Ser Arg Tyr Ile Gly Pro Glu Val Pro Lys Glu Asp Leu           
            420                 425                 430                   

atc tgg caa gat ccg ctg ccg cag ccg atc tac aac ccg acc gag cag       1344
Ile Trp Gln Asp Pro Leu Pro Gln Pro Ile Tyr Asn Pro Thr Glu Gln           
        435                 440                 445                       

gac att atc gat ctg aaa ttc gcg att gcg gat tct ggt ctg tct gtt       1392
Asp Ile Ile Asp Leu Lys Phe Ala Ile Ala Asp Ser Gly Leu Ser Val           
    450                 455                 460                           

agt gag ctg gta tcg gtg gcc tgg gca tct gct tct acc ttc cgt ggt       1440
Ser Glu Leu Val Ser Val Ala Trp Ala Ser Ala Ser Thr Phe Arg Gly           
465                 470                 475                 480           

ggc gac aaa cgc ggt ggt gcc aac ggt gcg cgt ctg gca tta atg ccg       1488
Gly Asp Lys Arg Gly Gly Ala Asn Gly Ala Arg Leu Ala Leu Met Pro           
                485                 490                 495               

cag cgc gac tgg gat gtg aac gcc gca gcc gtt cgt gct ctg cct gtt       1536
Gln Arg Asp Trp Asp Val Asn Ala Ala Ala Val Arg Ala Leu Pro Val           
            500                 505                 510                   

ctg gag aaa atc cag aaa gag tct ggt aaa gcc tcg ctg gcg gat atc       1584
Leu Glu Lys Ile Gln Lys Glu Ser Gly Lys Ala Ser Leu Ala Asp Ile           
        515                 520                 525                       

ata gtg ctg gct ggt gtg gtt ggt gtt gag aaa gcc gca agc gcc gca       1632
Ile Val Leu Ala Gly Val Val Gly Val Glu Lys Ala Ala Ser Ala Ala           
    530                 535                 540                           

ggt ttg agc att cat gta ccg ttt gcg ccg ggt cgc gtt gat gcg cgt       1680
Gly Leu Ser Ile His Val Pro Phe Ala Pro Gly Arg Val Asp Ala Arg           
545                 550                 555                 560           

cag gat cag act gac att gag atg ttt gag ctg ctg gag cca att gct       1728
Gln Asp Gln Thr Asp Ile Glu Met Phe Glu Leu Leu Glu Pro Ile Ala           
                565                 570                 575               

gac ggt ttc cgt aac tat cgc gct cgt ctg gac gtt tcc acc acc gag       1776
Asp Gly Phe Arg Asn Tyr Arg Ala Arg Leu Asp Val Ser Thr Thr Glu           
            580                 585                 590                   

tca ctg ctg atc gac aaa gca cag caa ctg acg ctg acc gcg ccg gaa       1824
Ser Leu Leu Ile Asp Lys Ala Gln Gln Leu Thr Leu Thr Ala Pro Glu           
        595                 600                 605                       

atg act gcg ctg gtg ggc ggc atg cgt gta ctg ggt gcc aac ttc gat       1872
Met Thr Ala Leu Val Gly Gly Met Arg Val Leu Gly Ala Asn Phe Asp           
    610                 615                 620                           

ggc agc aaa aac ggc gtc ttc act gac cgc gtt ggc gta ttg agc aat       1920
Gly Ser Lys Asn Gly Val Phe Thr Asp Arg Val Gly Val Leu Ser Asn           
625                 630                 635                 640           

gac ttc ttc gtg aac ttg ctg gat atg cgt tac gag tgg aaa gcg acc       1968
Asp Phe Phe Val Asn Leu Leu Asp Met Arg Tyr Glu Trp Lys Ala Thr           
                645                 650                 655               

gac gaa tcg aaa gag ctg ttc gaa ggc cgt gac cgt gaa acc ggc gaa       2016
Asp Glu Ser Lys Glu Leu Phe Glu Gly Arg Asp Arg Glu Thr Gly Glu           
            660                 665                 670                   

gtg aaa ttt acg gcc agc cgt gcg gat ctg gtg ttt ggt tct aac tcc       2064
Val Lys Phe Thr Ala Ser Arg Ala Asp Leu Val Phe Gly Ser Asn Ser           
        675                 680                 685                       

gtc ctg cgt gcg gtg gcg gaa gtt tac gcc agt agc gat gcc cac gag       2112
Val Leu Arg Ala Val Ala Glu Val Tyr Ala Ser Ser Asp Ala His Glu           
    690                 695                 700                           

aag ttt gtt aaa gac ttc gtg gcg gca tgg gtg aaa gtg atg aac ctc       2160
Lys Phe Val Lys Asp Phe Val Ala Ala Trp Val Lys Val Met Asn Leu           
705                 710                 715                 720           

gac cgt ttc gac ctg ctg taa                                           2181
Asp Arg Phe Asp Leu Leu                                                   
                725                                                       


<210>  15
<211>  726
<212>  PRT
<213>  Escherichia coli

<400>  15

Met Ser Thr Ser Asp Asp Ile His Asn Thr Thr Ala Thr Gly Lys Cys 
1               5                   10                  15      


Pro Phe His Gln Gly Gly His Asp Gln Ser Ala Gly Ala Gly Thr Thr 
            20                  25                  30          


Thr Arg Asp Trp Trp Pro Asn Gln Leu Arg Val Asp Leu Leu Asn Gln 
        35                  40                  45              


His Ser Asn Arg Ser Asn Pro Leu Gly Glu Asp Phe Asp Tyr Arg Lys 
    50                  55                  60                  


Glu Phe Ser Lys Leu Asp Tyr Tyr Gly Leu Lys Lys Asp Leu Lys Ala 
65                  70                  75                  80  


Leu Leu Thr Glu Ser Gln Pro Trp Trp Pro Ala Asp Trp Gly Ser Tyr 
                85                  90                  95      


Ala Gly Leu Phe Ile Arg Met Ala Trp His Gly Ala Gly Thr Tyr Arg 
            100                 105                 110         


Ser Ile Asp Gly Arg Gly Gly Ala Gly Arg Gly Gln Gln Arg Phe Ala 
        115                 120                 125             


Pro Leu Asn Ser Trp Pro Asp Asn Val Ser Leu Asp Lys Ala Arg Arg 
    130                 135                 140                 


Leu Leu Trp Pro Ile Lys Gln Lys Tyr Gly Gln Lys Ile Ser Trp Ala 
145                 150                 155                 160 


Asp Leu Phe Ile Leu Ala Gly Asn Val Ala Leu Glu Asn Ser Gly Phe 
                165                 170                 175     


Arg Thr Phe Gly Phe Gly Ala Gly Arg Glu Asp Val Trp Glu Pro Asp 
            180                 185                 190         


Leu Asp Val Asn Trp Gly Asp Glu Lys Ala Trp Leu Thr His Arg His 
        195                 200                 205             


Pro Glu Ala Leu Ala Lys Ala Pro Leu Gly Ala Thr Glu Met Gly Leu 
    210                 215                 220                 


Ile Tyr Val Asn Pro Glu Gly Pro Asp His Ser Gly Glu Pro Leu Ser 
225                 230                 235                 240 


Ala Ala Ala Ala Ile Arg Ala Thr Phe Gly Asn Met Gly Met Asn Asp 
                245                 250                 255     


Glu Glu Thr Val Ala Leu Ile Ala Gly Gly His Thr Leu Gly Lys Thr 
            260                 265                 270         


His Gly Ala Gly Pro Thr Ser Asn Val Gly Pro Asp Pro Glu Ala Ala 
        275                 280                 285             


Pro Ile Glu Glu Gln Gly Leu Gly Trp Ala Ser Thr Tyr Gly Ser Gly 
    290                 295                 300                 


Val Gly Ala Asp Ala Ile Thr Ser Gly Leu Glu Val Val Trp Thr Gln 
305                 310                 315                 320 


Thr Pro Thr Gln Trp Ser Asn Tyr Phe Phe Glu Asn Leu Phe Lys Tyr 
                325                 330                 335     


Glu Trp Val Gln Thr Arg Ser Pro Ala Gly Ala Ile Gln Phe Glu Ala 
            340                 345                 350         


Val Asp Ala Pro Glu Ile Ile Pro Asp Pro Phe Asp Pro Ser Lys Lys 
        355                 360                 365             


Arg Lys Pro Thr Met Leu Val Thr Asp Leu Thr Leu Arg Phe Asp Pro 
    370                 375                 380                 


Glu Phe Glu Lys Ile Ser Arg Arg Phe Leu Asn Asp Pro Gln Ala Phe 
385                 390                 395                 400 


Asn Glu Ala Phe Ala Arg Ala Trp Phe Lys Leu Thr His Arg Asp Met 
                405                 410                 415     


Gly Pro Lys Ser Arg Tyr Ile Gly Pro Glu Val Pro Lys Glu Asp Leu 
            420                 425                 430         


Ile Trp Gln Asp Pro Leu Pro Gln Pro Ile Tyr Asn Pro Thr Glu Gln 
        435                 440                 445             


Asp Ile Ile Asp Leu Lys Phe Ala Ile Ala Asp Ser Gly Leu Ser Val 
    450                 455                 460                 


Ser Glu Leu Val Ser Val Ala Trp Ala Ser Ala Ser Thr Phe Arg Gly 
465                 470                 475                 480 


Gly Asp Lys Arg Gly Gly Ala Asn Gly Ala Arg Leu Ala Leu Met Pro 
                485                 490                 495     


Gln Arg Asp Trp Asp Val Asn Ala Ala Ala Val Arg Ala Leu Pro Val 
            500                 505                 510         


Leu Glu Lys Ile Gln Lys Glu Ser Gly Lys Ala Ser Leu Ala Asp Ile 
        515                 520                 525             


Ile Val Leu Ala Gly Val Val Gly Val Glu Lys Ala Ala Ser Ala Ala 
    530                 535                 540                 


Gly Leu Ser Ile His Val Pro Phe Ala Pro Gly Arg Val Asp Ala Arg 
545                 550                 555                 560 


Gln Asp Gln Thr Asp Ile Glu Met Phe Glu Leu Leu Glu Pro Ile Ala 
                565                 570                 575     


Asp Gly Phe Arg Asn Tyr Arg Ala Arg Leu Asp Val Ser Thr Thr Glu 
            580                 585                 590         


Ser Leu Leu Ile Asp Lys Ala Gln Gln Leu Thr Leu Thr Ala Pro Glu 
        595                 600                 605             


Met Thr Ala Leu Val Gly Gly Met Arg Val Leu Gly Ala Asn Phe Asp 
    610                 615                 620                 


Gly Ser Lys Asn Gly Val Phe Thr Asp Arg Val Gly Val Leu Ser Asn 
625                 630                 635                 640 


Asp Phe Phe Val Asn Leu Leu Asp Met Arg Tyr Glu Trp Lys Ala Thr 
                645                 650                 655     


Asp Glu Ser Lys Glu Leu Phe Glu Gly Arg Asp Arg Glu Thr Gly Glu 
            660                 665                 670         


Val Lys Phe Thr Ala Ser Arg Ala Asp Leu Val Phe Gly Ser Asn Ser 
        675                 680                 685             


Val Leu Arg Ala Val Ala Glu Val Tyr Ala Ser Ser Asp Ala His Glu 
    690                 695                 700                 


Lys Phe Val Lys Asp Phe Val Ala Ala Trp Val Lys Val Met Asn Leu 
705                 710                 715                 720 


Asp Arg Phe Asp Leu Leu 
                725     


<210>  16
<211>  6329
<212>  DNA
<213>  artificial sequence

<220>
<223>  Plasmid pKD46

<400>  16
catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac       60

ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat      120

cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca      180

gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct      240

ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga      300

tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat      360

tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct      420

caagcagatt tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga      480

tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg      540

tattggcaaa tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt      600

aaacccactg gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc      660

ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa ttctcgtccc tgatttttca      720

ccaccccctg accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt      780

cgataaaaaa atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg      840

cattaaacga gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac      900

tcccgccatt cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg      960

tcttttactg gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt     1020

aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca     1080

gaaaagtcca cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat     1140

ccataagatt agcggatcct acctgacgct ttttatcgca actctctact gtttctccat     1200

acccgttttt ttgggaattc gagctctaag gaggttataa aaaatggata ttaatactga     1260

aactgagatc aagcaaaagc attcactaac cccctttcct gttttcctaa tcagcccggc     1320

atttcgcggg cgatattttc acagctattt caggagttca gccatgaacg cttattacat     1380

tcaggatcgt cttgaggctc agagctgggc gcgtcactac cagcagctcg cccgtgaaga     1440

gaaagaggca gaactggcag acgacatgga aaaaggcctg ccccagcacc tgtttgaatc     1500

gctatgcatc gatcatttgc aacgccacgg ggccagcaaa aaatccatta cccgtgcgtt     1560

tgatgacgat gttgagtttc aggagcgcat ggcagaacac atccggtaca tggttgaaac     1620

cattgctcac caccaggttg atattgattc agaggtataa aacgaatgag tactgcactc     1680

gcaacgctgg ctgggaagct ggctgaacgt gtcggcatgg attctgtcga cccacaggaa     1740

ctgatcacca ctcttcgcca gacggcattt aaaggtgatg ccagcgatgc gcagttcatc     1800

gcattactga tcgttgccaa ccagtacggc cttaatccgt ggacgaaaga aatttacgcc     1860

tttcctgata agcagaatgg catcgttccg gtggtgggcg ttgatggctg gtcccgcatc     1920

atcaatgaaa accagcagtt tgatggcatg gactttgagc aggacaatga atcctgtaca     1980

tgccggattt accgcaagga ccgtaatcat ccgatctgcg ttaccgaatg gatggatgaa     2040

tgccgccgcg aaccattcaa aactcgcgaa ggcagagaaa tcacggggcc gtggcagtcg     2100

catcccaaac ggatgttacg tcataaagcc atgattcagt gtgcccgtct ggccttcgga     2160

tttgctggta tctatgacaa ggatgaagcc gagcgcattg tcgaaaatac tgcatacact     2220

gcagaacgtc agccggaacg cgacatcact ccggttaacg atgaaaccat gcaggagatt     2280

aacactctgc tgatcgccct ggataaaaca tgggatgacg acttattgcc gctctgttcc     2340

cagatatttc gccgcgacat tcgtgcatcg tcagaactga cacaggccga agcagtaaaa     2400

gctcttggat tcctgaaaca gaaagccgca gagcagaagg tggcagcatg acaccggaca     2460

ttatcctgca gcgtaccggg atcgatgtga gagctgtcga acagggggat gatgcgtggc     2520

acaaattacg gctcggcgtc atcaccgctt cagaagttca caacgtgata gcaaaacccc     2580

gctccggaaa gaagtggcct gacatgaaaa tgtcctactt ccacaccctg cttgctgagg     2640

tttgcaccgg tgtggctccg gaagttaacg ctaaagcact ggcctgggga aaacagtacg     2700

agaacgacgc cagaaccctg tttgaattca cttccggcgt gaatgttact gaatccccga     2760

tcatctatcg cgacgaaagt atgcgtaccg cctgctctcc cgatggttta tgcagtgacg     2820

gcaacggcct tgaactgaaa tgcccgttta cctcccggga tttcatgaag ttccggctcg     2880

gtggtttcga ggccataaag tcagcttaca tggcccaggt gcagtacagc atgtgggtga     2940

cgcgaaaaaa tgcctggtac tttgccaact atgacccgcg tatgaagcgt gaaggcctgc     3000

attatgtcgt gattgagcgg gatgaaaagt acatggcgag ttttgacgag atcgtgccgg     3060

agttcatcga aaaaatggac gaggcactgg ctgaaattgg ttttgtattt ggggagcaat     3120

ggcgatgacg catcctcacg ataatatccg ggtaggcgca atcactttcg tctactccgt     3180

tacaaagcga ggctgggtat ttcccggcct ttctgttatc cgaaatccac tgaaagcaca     3240

gcggctggct gaggagataa ataataaacg aggggctgta tgcacaaagc atcttctgtt     3300

gagttaagaa cgagtatcga gatggcacat agccttgctc aaattggaat caggtttgtg     3360

ccaataccag tagaaacaga cgaagaatcc atgggtatgg acagttttcc ctttgatatg     3420

taacggtgaa cagttgttct acttttgttt gttagtcttg atgcttcact gatagataca     3480

agagccataa gaacctcaga tccttccgta tttagccagt atgttctcta gtgtggttcg     3540

ttgtttttgc gtgagccatg agaacgaacc attgagatca tacttacttt gcatgtcact     3600

caaaaatttt gcctcaaaac tggtgagctg aatttttgca gttaaagcat cgtgtagtgt     3660

ttttcttagt ccgttacgta ggtaggaatc tgatgtaatg gttgttggta ttttgtcacc     3720

attcattttt atctggttgt tctcaagttc ggttacgaga tccatttgtc tatctagttc     3780

aacttggaaa atcaacgtat cagtcgggcg gcctcgctta tcaaccacca atttcatatt     3840

gctgtaagtg tttaaatctt tacttattgg tttcaaaacc cattggttaa gccttttaaa     3900

ctcatggtag ttattttcaa gcattaacat gaacttaaat tcatcaaggc taatctctat     3960

atttgccttg tgagttttct tttgtgttag ttcttttaat aaccactcat aaatcctcat     4020

agagtatttg ttttcaaaag acttaacatg ttccagatta tattttatga atttttttaa     4080

ctggaaaaga taaggcaata tctcttcact aaaaactaat tctaattttt cgcttgagaa     4140

cttggcatag tttgtccact ggaaaatctc aaagccttta accaaaggat tcctgatttc     4200

cacagttctc gtcatcagct ctctggttgc tttagctaat acaccataag cattttccct     4260

actgatgttc atcatctgag cgtattggtt ataagtgaac gataccgtcc gttctttcct     4320

tgtagggttt tcaatcgtgg ggttgagtag tgccacacag cataaaatta gcttggtttc     4380

atgctccgtt aagtcatagc gactaatcgc tagttcattt gctttgaaaa caactaattc     4440

agacatacat ctcaattggt ctaggtgatt ttaatcacta taccaattga gatgggctag     4500

tcaatgataa ttactagtcc ttttcctttg agttgtgggt atctgtaaat tctgctagac     4560

ctttgctgga aaacttgtaa attctgctag accctctgta aattccgcta gacctttgtg     4620

tgtttttttt gtttatattc aagtggttat aatttataga ataaagaaag aataaaaaaa     4680

gataaaaaga atagatccca gccctgtgta taactcacta ctttagtcag ttccgcagta     4740

ttacaaaagg atgtcgcaaa cgctgtttgc tcctctacaa aacagacctt aaaaccctaa     4800

aggcttaagt agcaccctcg caagctcggt tgcggccgca atcgggcaaa tcgctgaata     4860

ttccttttgt ctccgaccat caggcacctg agtcgctgtc tttttcgtga cattcagttc     4920

gctgcgctca cggctctggc agtgaatggg ggtaaatggc actacaggcg ccttttatgg     4980

attcatgcaa ggaaactacc cataatacaa gaaaagcccg tcacgggctt ctcagggcgt     5040

tttatggcgg gtctgctatg tggtgctatc tgactttttg ctgttcagca gttcctgccc     5100

tctgattttc cagtctgacc acttcggatt atcccgtgac aggtcattca gactggctaa     5160

tgcacccagt aaggcagcgg tatcatcaac ggggtctgac gctcagtgga acgaaaactc     5220

acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa     5280

ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta     5340

ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt     5400

tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag     5460

tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca     5520

gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc     5580

tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt     5640

tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag     5700

ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt     5760

tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat     5820

ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt     5880

gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc     5940

ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat     6000

cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag     6060

ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt     6120

ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg     6180

gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta     6240

ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc     6300

gcgcacattt ccccgaaaag tgccacctg                                       6329


<210>  17
<211>  25
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  17
aacaatatgt aagatctcaa ctatc                                             25


<210>  18
<211>  25
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  18
cagacatgag agatccagtg tgtag                                             25


<210>  19
<211>  9332
<212>  DNA
<213>  artificial sequence

<220>
<223>  Plasmid pCP20

<400>  19
gagacacaac gtggctttgt tgaataaatc gaacttttgc tgagttgaag gatcagatca       60

cgcatcttcc cgacaacgca gaccgttccg tggcaaagca aaagttcaaa atcaccaact      120

ggtccaccta caacaaagct ctcatcaacc gtggctccct cactttctgg ctggatgatg      180

gggcgattca ggcctggtat gagtcagcaa caccttcttc acgaggcaga cctcagcgcc      240

acaggtgcgg ttgctggcgc taaccgtttt tatcaggctc tgggaggcag aataaatgat      300

catatcgtca attattacct ccacggggag agcctgagca aactggcctc aggcatttga      360

gaagcacacg gtcacactgc ttccggtagt caataaaccg gtaaaccagc aatagacata      420

agcggctatt taacgaccct gccctgaacc gacgaccggg tcgaatttgc tttcgaattt      480

ctgccattca tccgcttatt atcacttatt caggcgtagc aaccaggcgt ttaagggcac      540

caataactgc cttaaaaaaa ttacgccccg ccctgccact catcgcagta ctgttgtaat      600

tcattaagca ttctgccgac atggaagcca tcacaaacgg catgatgaac ctgaatcgcc      660

agcggcatca gcaccttgtc gccttgcgta taatatttgc ccatggtgaa aacgggggcg      720

aagaagttgt ccatattggc cacgtttaaa tcaaaactgg tgaaactcac ccagggattg      780

gctgagacga aaaacatatt ctcaataaac cctttaggga aataggccag gttttcaccg      840

taacacgcca catcttgcga atatatgtgt agaaactgcc ggaaatcgtc gtggtattca      900

ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa cggtgtaaca agggtgaaca      960

ctatcccata tcaccagctc accgtctttc attgccatac ggaattccgg atgagcattc     1020

atcaggcggg caagaatgtg aataaaggcc ggataaaact tgtgcttatt tttctttacg     1080

gtctttaaaa aggccgtaat atccagctga acggtctggt tataggtaca ttgagcaact     1140

gactgaaatg cctcaaaatg ttctttacga tgccattggg atatatcaac ggtggtatat     1200

ccagtgattt ttttctccat tttagcttcc ttagctcctg aaaatctcga taactcaaaa     1260

aatacgcccg gtagtgatct tatttcatta tggtgaaagt tggaacctct tacgtgccga     1320

tcaacgtctc attttcgcca aaagttggcc cagggcttcc cggtatcaac agggacacca     1380

ggatttattt attctgcgaa gtgatcttcc gtcacaggta tttattcggc gcaaagtgcg     1440

tcgggtgatg ctgccaactt actgatttag tgtatgatgg tgtttttgag gtgctccagt     1500

ggcttctgtt tctatcagct gtccctcctg ttcagctact gacggggtgg tgcgtaacgg     1560

caaaagcacc gccggacatc agcgcttgtt tcggcgtggg tatggtggca ggccccgtgg     1620

ccgggggact gttgggcgcc tgtagtgcca tttaccccca ttcactgcca gagccgtgag     1680

cgcagcgaac tgaatgtcac gaaaaagaca gcgactcagg tgcctgatgg tcggagacaa     1740

aaggaatatt cagcgatttg cccgagcttg cgagggtgct acttaagcct ttagggtttt     1800

aaggtctgtt ttgtagagga gcaaacagcg tttgcgacat ccttttgtaa tactgcggaa     1860

ctgactaaag tagtgagtta tacacagggc tgggatctat tctttttatc tttttttatt     1920

ctttctttat tctataaatt ataaccactt gaatataaac aaaaaaaaca cacaaaggtc     1980

tagcggaatt tacagagggt ctagcagaat ttacaagttt tccagcaaag gtctagcaga     2040

atttacagat acccacaact caaaggaaaa ggactagtaa ttatcattga ctagcccatc     2100

tcaattggta tagtgattaa aatcacctag accaattgag atgtatgtct gaattagttg     2160

ttttcaaagc aaatgaacta gcgattagtc gctatgactt aacggagcat gaaaccaagc     2220

taattttatg ctgtgtggca ctactcaacc ccacgattga aaaccctaca aggaaagaac     2280

ggacggtatc gttcacttat aaccaatacg ttcagatgat gaacatcagt agggaaaatg     2340

cttatggtgt attagctaaa gcaaccagag agctgatgac gagaactgtg gaaatcagga     2400

atcctttggt taaaggcttt gagattttcc agtggacaaa ctatgccaag ttctcaagcg     2460

aaaaattaga attagttttt agtgaagaga tattgcctta tcttttccag ttaaaaaaat     2520

tcataaaata taatctggaa catgttaagt cttttgaaaa caaatactct atgaggattt     2580

atgagtggtt attaaaagaa ctaacacaaa agaaaactca caaggcaaat atagagatta     2640

gccttgatga atttaagttc atgttaatgc ttgaaaataa ctaccatgag tttaaaaggc     2700

ttaaccaatg ggttttgaaa ccaataagta aagatttaaa cacttacagc aatatgaaat     2760

tggtggttga taagcgaggc cgcccgactg atacgttgat tttccaagtt gaactagata     2820

gacaaatgga tctcgtaacc gaacttgaga acaaccagat aaaaatgaat ggtgacaaaa     2880

taccaacaac cattacatca gattcctacc tacataacgg actaagaaaa acactacacg     2940

atgctttaac tgcaaaaatt cagctcacca gttttgaggc aaaatttttg agtgacatgc     3000

aaagtaagta tgatctcaat ggttcgttct catggctcac gcaaaaacaa cgaaccacac     3060

tagagaacat actggctaaa tacggaagga tctgaggttc ttatggctct tgtatctatc     3120

agtgaagcat caagactaac aaacaaaagt agaacaactg ttcaccgtta catatcaaag     3180

ggaaaactgt ccatatgcac agatgaaaac ggtgtaaaaa agatagatac atcagagctt     3240

ttacgagttt ttggtgcatt taaagctgtt caccatgaac agatcgacaa tgtaacagat     3300

gaacagcatg taacacctaa tagaacaggt gaaaccagta aaacaaagca actagaacat     3360

gaaattgaac acctgagaca acttgttaca gctcaacagt cacacataga cagcctgaaa     3420

caggcgatgc tgcttatcga atcaaagctg ccgacaacac gggagccagt gacgcctccc     3480

gtggggaaaa aatcatggca attctggaag aaatagcgcc tgtttcgttt caggcaggtt     3540

atcagggagt gtcagcgtcc tgcggttctc cggggcgttc gggtcatgca gcccgtaatg     3600

gtgatttacc agcgtctgcc aggcatcaat tctaggcctg tctgcgcggt cgtagtacgg     3660

ctggaggcgt tttccggtct gtagctccat gttcggaatg acaaaattca gctcaagccg     3720

tcccttgtcc tggtgctcca cccacaggat gctgtactga tttttttcga gaccgggcat     3780

cagtacacgc tcaaagctcg ccatcacttt ttcacgtcct cccggcggca gctccttctc     3840

cgcgaacgac agaacaccgg acgtgtattt cttcgcaaat ggcgtggcat cgatgagttc     3900

ccggacttct tccggattac cctgaagcac cgttgcgcct tcgcggttac gctccctccc     3960

cagcaggtaa tcaaccggac cactgccacc accttttccc ctggcatgaa atttaactat     4020

catcccgcgc cccctgttcc ctgacagcca gacgcagccg gcgcagctca tccccgatgg     4080

ccatcagtgc ggccaccacc tgaacccggt caccggaaga ccactgcccg ctgttcacct     4140

tacgggctgt ctgattcagg ttatttccga tggcggccag ctgacgcagt aacggcggtg     4200

ccagtgtcgg cagttttccg gaacgggcaa ccggctcccc caggcagacc cgccgcatcc     4260

ataccgccag ttgtttaccc tcacagcgtt caagtaaccg ggcatgttca tcatcagtaa     4320

cccgtattgt gagcatcctc tcgcgtttca tcggtatcat taccccatga acagaaatcc     4380

cccttacacg gaggcatcag tgactaaacg gggtctgacg ctcagtggaa cgaaaactca     4440

cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat     4500

taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac     4560

caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt     4620

gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt     4680

gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag     4740

ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct     4800

attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt     4860

gttgccattg ctgcaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc     4920

tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt     4980

agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg     5040

gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg     5100

actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct     5160

tgcccggcgt caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc     5220

attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt     5280

tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt     5340

tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg     5400

aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat     5460

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg     5520

cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta     5580

acctataaaa ataggcgtat cacgaggccc tttcgtcttc aagaatttta taaaccgtgg     5640

agcgggcaat actgagctga tgagcaattt ccgttgcacc agtgcccttc tgatgaagcg     5700

tcagcacgac gttcctgtcc acggtacgcc tgcggccaaa tttgattcct ttcagctttg     5760

cttcctgtcg gccctcattc gtgcgctcta ggatcctcta cgccggacgc atcgtggccg     5820

gcatcaccgg cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat accaggcctg     5880

aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag ctttgttgta     5940

ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtctg cgttgtcggg     6000

aagatgcgtg atctgatcct tcaactcagc aaaagttcga tttattcaac aaagccgccg     6060

tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa ttctgattag     6120

aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt atcaatacca     6180

tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca gttccatagg     6240

atggcaagat cctggtatcg gtctgcgatt ccgactcgtc caacatcaat acaacctatt     6300

aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt gacgactgaa     6360

tccggtgaga atggcagaat aggaacttcg gaataggaac ttcaaagcgt ttccgaaaac     6420

gagcgcttcc gaaaatgcaa cgcgagctgc gcacatacag ctcactgttc acgtcgcacc     6480

tatatctgcg tgttgcctgt atatatatat acatgagaag aacggcatag tgcgtgttta     6540

tgcttaaatg cgtacttata tgcgtctatt tatgtaggat gaaaggtagt ctagtacctc     6600

ctgtgatatt atcccattcc atgcggggta tcgtatgctt ccttcagcac taccctttag     6660

ctgttctata tgctgccact cctcaattgg attagtctca tccttcaatg ctatcatttc     6720

ctttgatatt ggatcatatg catagtaccg agaaactagt gcgaagtagt gatcaggtat     6780

tgctgttatc tgatgagtat acgttgtcct ggccacggca gaagcacgct tatcgctcca     6840

atttcccaca acattagtca actccgttag gcccttcatt gaaagaaatg aggtcatcaa     6900

atgtcttcca atgtgagatt ttgggccatt ttttatagca aagattgaat aaggcgcatt     6960

tttcttcaaa gctttattgt acgatctgac taagttatct tttaataatt ggtattcctg     7020

tttattgctt gaagaattgc cggtcctatt tactcgtttt aggactggtt cagaattcct     7080

caaaaattca tccaaatata caagtggatc gatcctaccc cttgcgctaa agaagtatat     7140

gtgcctacta acgcttgtct ttgtctctgt cactaaacac tggattatta ctcccagata     7200

cttattttgg actaatttaa atgatttcgg atcaacgttc ttaatatcgc tgaatcttcc     7260

acaattgatg aaagtagcta ggaagaggaa ttggtataaa gtttttgttt ttgtaaatct     7320

cgaagtatac tcaaacgaat ttagtatttt ctcagtgatc tcccagatgc tttcaccctc     7380

acttagaagt gctttaagca tttttttact gtggctattt cccttatctg cttcttccga     7440

tgattcgaac tgtaattgca aactacttac aatatcagtg atatcagatt gatgtttttg     7500

tccatagtaa ggaataattg taaattccca agcaggaatc aatttcttta atgaggcttc     7560

cagaattgtt gctttttgcg tcttgtattt aaactggagt gatttattga caatatcgaa     7620

actcagcgaa ttgcttatga tagtattata gctcatgaat gtggctctct tgattgctgt     7680

tccgttatgt gtaatcatcc aacataaata ggttagttca gcagcacata atgctatttt     7740

ctcacctgaa ggtctttcaa acctttccac aaactgacga acaagcacct taggtggtgt     7800

tttacataat atatcaaatt gtggcataca acctccttag tacatgcaac cattatcacc     7860

gccagaggta aaatagtcaa cacgcacggt gttagatatt tatcccttgc ggtgatagat     7920

ttaacgtatg agcacaaaaa agaaaccatt aacacaagag cagcttgagg acgcacgtcg     7980

ccttaaagca atttatgaaa aaaagaaaaa tgaacttggc ttatcccagg aatctgtcgc     8040

agacaagatg gggatggggc agtcaggcgt tggtgcttta tttaatggca tcaatgcatt     8100

aaatgcttat aacgccgcat tgcttacaaa aattctcaaa gttagcgttg aagaatttag     8160

cccttcaatc gccagagaaa tctacgagat gtatgaagcg gttagtatgc agccgtcact     8220

tagaagtgag tatgagtacc ctgttttttc tcatgttcag gcagggatgt tctcacctaa     8280

gcttagaacc tttaccaaag gtgatgcgga gagatgggta agcacaacca aaaaagccag     8340

tgattctgca ttctggcttg aggttgaagg taattccatg accgcaccaa caggctccaa     8400

gccaagcttt cctgacggaa tgttaattct cgttgaccct gagcaggctg ttgagccagg     8460

tgatttctgc atagccagac ttgggggtga tgagtttacc ttcaagaaac tgatcaggga     8520

tagcggtcag gtgtttttac aaccactaaa cccacagtac ccaatgatcc catgcaatga     8580

gagttgttcc gttgtgggga aagttatcgc tagtcagtgg cctgaagaga cgtttggctg     8640

atcggcaagg tgttctggtc ggcgcatagc tgataacaat tgagcaagaa tctgcatttc     8700

tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac tcgcatcaac     8760

caaaccgtta ttcattcgtg attgcgcctg agcgagacga aatacgcgat cgctgttaaa     8820

aggacaatta caaacaggaa tcgaatgcaa ccggcgcagg aacactgcca gcgcatcaac     8880

aatattttca cctgaatcag gatattcttc taatacctgg aatgctgttt tcccggggat     8940

cgcagtggtg agtaaccatg catcatcagg agtacggata aaatgcttga tggtcggaag     9000

aggcataaat tccgtcagcc agtttagtct gaccatctca tctgtaacat cattggcaac     9060

gctacctttg ccatgtttca gaaacaactc tggcgcatcg ggcttcccat acaatcgata     9120

gattgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat ataaatcagc     9180

atccatgttg gaatttaatc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat     9240

aacacccctt gtattactgt ttatgtaagc agacagtttt attgttcatg atgatatatt     9300

tttatcttgt gcaatgtaac atcagagatt tt                                   9332


<210>  20
<211>  80
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  20
atgtcgcaac ataacgaaaa gaacccacat cagcaccagt caccactaca cgattccagc       60

gtgtaggctg gagctgcttc                                                   80


<210>  21
<211>  82
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  21
ttacgccggg attttgtcaa tcttaggaat gcgtgaccac acgcggtgtg ctgtcatcag       60

attccgggga tccgtcgacc tg                                                82


<210>  22
<211>  1424
<212>  DNA
<213>  artificial sequence

<220>
<223>  Synthetic construct

<400>  22
ttacgccggg attttgtcaa tcttaggaat gcgtgaccac acgcggtgtg ctgtcatcag       60

attccgggga tccgtcgacc tgcagttcga agttcctatt ctctagaaag tataggaact      120

tcagagcgct tttgaagctc acgctgccgc aagcactcag ggcgcaaggg ctgctaaagg      180

aagcggaaca cgtagaaagc cagtccgcag aaacggtgct gaccccggat gaatgtcagc      240

tactgggcta tctggacaag ggaaaacgca agcgcaaaga gaaagcaggt agcttgcagt      300

gggcttacat ggcgatagct agactgggcg gttttatgga cagcaagcga accggaattg      360

ccagctgggg cgccctctgg taaggttggg aagccctgca aagtaaactg gatggctttc      420

ttgccgccaa ggatctgatg gcgcagggga tcaagatctg atcaagagac aggatgagga      480

tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc ttgggtggag      540

aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc cgccgtgttc      600

cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc cggtgccctg      660

aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg cgttccttgc      720

gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt gggcgaagtg      780

ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc catcatggct      840

gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga ccaccaagcg      900

aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga tcaggatgat      960

ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc     1020

atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg     1080

gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt ggcggaccgc     1140

tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg cgaatgggct     1200

gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat cgccttctat     1260

cgccttcttg acgagttctt ctaataaggg gatcttgaag ttcctattcc gaagttccta     1320

ttctctagaa agtataggaa cttcgaagca gctccagcct acacgctgga atcgtgtagt     1380

ggtgactggt gctgatgtgg gttcttttcg ttatgttgcg acat                      1424


<210>  23
<211>  2262
<212>  DNA
<213>  Escherichia coli


<220>
<221>  CDS
<222>  (1)..(2262)

<400>  23
atg tcg caa cat aac gaa aag aac cca cat cag cac cag tca cca cta         48
Met Ser Gln His Asn Glu Lys Asn Pro His Gln His Gln Ser Pro Leu           
1               5                   10                  15                

cac gat tcc agc gaa gcg aaa ccg ggg atg gac tca ctg gca cct gag         96
His Asp Ser Ser Glu Ala Lys Pro Gly Met Asp Ser Leu Ala Pro Glu           
            20                  25                  30                    

gac ggc tct cat cgt cca gcg gct gaa cca aca ccg cca ggt gca caa        144
Asp Gly Ser His Arg Pro Ala Ala Glu Pro Thr Pro Pro Gly Ala Gln           
        35                  40                  45                        

cct acc gcc cca ggg agc ctg aaa gcc cct gat acg cgt aac gaa aaa        192
Pro Thr Ala Pro Gly Ser Leu Lys Ala Pro Asp Thr Arg Asn Glu Lys           
    50                  55                  60                            

ctt aat tct ctg gaa gac gta cgc aaa ggc agt gaa aat tat gcg ctg        240
Leu Asn Ser Leu Glu Asp Val Arg Lys Gly Ser Glu Asn Tyr Ala Leu           
65                  70                  75                  80            

acc act aat cag ggc gtg cgc atc gcc gac gat caa aac tca ctg cgt        288
Thr Thr Asn Gln Gly Val Arg Ile Ala Asp Asp Gln Asn Ser Leu Arg           
                85                  90                  95                

gcc ggt agc cgt ggt cca acg ctg ctg gaa gat ttt att ctg cgc gag        336
Ala Gly Ser Arg Gly Pro Thr Leu Leu Glu Asp Phe Ile Leu Arg Glu           
            100                 105                 110                   

aaa atc acc cac ttt gac cat gag cgc att ccg gaa cgt att gtt cat        384
Lys Ile Thr His Phe Asp His Glu Arg Ile Pro Glu Arg Ile Val His           
        115                 120                 125                       

gca cgc gga tca gcc gct cac ggt tat ttc cag cca tat aaa agc tta        432
Ala Arg Gly Ser Ala Ala His Gly Tyr Phe Gln Pro Tyr Lys Ser Leu           
    130                 135                 140                           

agc gat att acc aaa gcg gat ttc ctc tca gat ccg aac aaa atc acc        480
Ser Asp Ile Thr Lys Ala Asp Phe Leu Ser Asp Pro Asn Lys Ile Thr           
145                 150                 155                 160           

cca gta ttt gta cgt ttc tct acc gtt cag ggt ggt gct ggc tct gct        528
Pro Val Phe Val Arg Phe Ser Thr Val Gln Gly Gly Ala Gly Ser Ala           
                165                 170                 175               

gat acc gtg cgt gat atc cgt ggc ttt gcc acc aag ttc tat acc gaa        576
Asp Thr Val Arg Asp Ile Arg Gly Phe Ala Thr Lys Phe Tyr Thr Glu           
            180                 185                 190                   

gag ggt att ttt gac ctc gtt ggc aat aac acg cca atc ttc ttt atc        624
Glu Gly Ile Phe Asp Leu Val Gly Asn Asn Thr Pro Ile Phe Phe Ile           
        195                 200                 205                       

cag gat gcg cat aaa ttc ccc gat ttt gtt cat gcg gta aaa cca gaa        672
Gln Asp Ala His Lys Phe Pro Asp Phe Val His Ala Val Lys Pro Glu           
    210                 215                 220                           

ccg cac tgg gca att cca caa ggg caa agt gcc cac gat act ttc tgg        720
Pro His Trp Ala Ile Pro Gln Gly Gln Ser Ala His Asp Thr Phe Trp           
225                 230                 235                 240           

gat tat gtt tct ctg caa cct gaa act ctg cac aac gtg atg tgg gcg        768
Asp Tyr Val Ser Leu Gln Pro Glu Thr Leu His Asn Val Met Trp Ala           
                245                 250                 255               

atg tcg gat cgc ggc atc ccc cgc agt tac cgc acc atg gaa ggc ttc        816
Met Ser Asp Arg Gly Ile Pro Arg Ser Tyr Arg Thr Met Glu Gly Phe           
            260                 265                 270                   

ggt att cac acc ttc cgc ctg att aat gcc gaa ggg aag gca acg ttt        864
Gly Ile His Thr Phe Arg Leu Ile Asn Ala Glu Gly Lys Ala Thr Phe           
        275                 280                 285                       

gta cgt ttc cac tgg aaa cca ctg gca ggt aaa gcc tca ctc gtt tgg        912
Val Arg Phe His Trp Lys Pro Leu Ala Gly Lys Ala Ser Leu Val Trp           
    290                 295                 300                           

gat gaa gca caa aaa ctc acc gga cgt gac ccg gac ttc cac cgc cgc        960
Asp Glu Ala Gln Lys Leu Thr Gly Arg Asp Pro Asp Phe His Arg Arg           
305                 310                 315                 320           

gag ttg tgg gaa gcc att gaa gca ggc gat ttt ccg gaa tac gaa ctg       1008
Glu Leu Trp Glu Ala Ile Glu Ala Gly Asp Phe Pro Glu Tyr Glu Leu           
                325                 330                 335               

ggc ttc cag ttg att cct gaa gaa gat gaa ttc aag ttc gac ttc gat       1056
Gly Phe Gln Leu Ile Pro Glu Glu Asp Glu Phe Lys Phe Asp Phe Asp           
            340                 345                 350                   

ctt ctc gat cca acc aaa ctt atc ccg gaa gaa ctg gtg ccc gtt cag       1104
Leu Leu Asp Pro Thr Lys Leu Ile Pro Glu Glu Leu Val Pro Val Gln           
        355                 360                 365                       

cgt gtc ggc aaa atg gtg ctc aat cgc aac ccg gat aac ttc ttt gct       1152
Arg Val Gly Lys Met Val Leu Asn Arg Asn Pro Asp Asn Phe Phe Ala           
    370                 375                 380                           

gaa aac gaa cag gcg gct ttc cat cct ggg cat atc gtg ccg gga ctg       1200
Glu Asn Glu Gln Ala Ala Phe His Pro Gly His Ile Val Pro Gly Leu           
385                 390                 395                 400           

gac ttc acc aac gat ccg ctg ttg cag gga cgt ttg ttc tcc tat acc       1248
Asp Phe Thr Asn Asp Pro Leu Leu Gln Gly Arg Leu Phe Ser Tyr Thr           
                405                 410                 415               

gat aca caa atc agt cgt ctt ggt ggg ccg aat ttc cat gag att ccg       1296
Asp Thr Gln Ile Ser Arg Leu Gly Gly Pro Asn Phe His Glu Ile Pro           
            420                 425                 430                   

att aac cgt ccg acc tgc cct tac cat aat ttc cag cgt gac ggc atg       1344
Ile Asn Arg Pro Thr Cys Pro Tyr His Asn Phe Gln Arg Asp Gly Met           
        435                 440                 445                       

cat cgc atg ggg atc gac act aac ccg gcg aat tac gaa ccg aac tcg       1392
His Arg Met Gly Ile Asp Thr Asn Pro Ala Asn Tyr Glu Pro Asn Ser           
    450                 455                 460                           

att aac gat aac tgg ccg cgc gaa aca ccg ccg ggg ccg aaa cgc ggc       1440
Ile Asn Asp Asn Trp Pro Arg Glu Thr Pro Pro Gly Pro Lys Arg Gly           
465                 470                 475                 480           

ggt ttt gaa tca tac cag gag cgc gtg gaa ggc aat aaa gtt cgc gag       1488
Gly Phe Glu Ser Tyr Gln Glu Arg Val Glu Gly Asn Lys Val Arg Glu           
                485                 490                 495               

cgc agc cca tcg ttt ggc gaa tat tat tcc cat ccg cgt ctg ttc tgg       1536
Arg Ser Pro Ser Phe Gly Glu Tyr Tyr Ser His Pro Arg Leu Phe Trp           
            500                 505                 510                   

cta agt cag acg cca ttt gag cag cgc cat att gtc gat ggt ttc agt       1584
Leu Ser Gln Thr Pro Phe Glu Gln Arg His Ile Val Asp Gly Phe Ser           
        515                 520                 525                       

ttt gag tta agc aaa gtc gtt cgt ccg tat att cgt gag cgc gtt gtt       1632
Phe Glu Leu Ser Lys Val Val Arg Pro Tyr Ile Arg Glu Arg Val Val           
    530                 535                 540                           

gac cag ctg gcg cat att gat ctc act ctg gcc cag gcg gtg gcg aaa       1680
Asp Gln Leu Ala His Ile Asp Leu Thr Leu Ala Gln Ala Val Ala Lys           
545                 550                 555                 560           

aat ctc ggt atc gaa ctg act gac gac cag ctg aat atc acc cca cct       1728
Asn Leu Gly Ile Glu Leu Thr Asp Asp Gln Leu Asn Ile Thr Pro Pro           
                565                 570                 575               

ccg gac gtc aac ggt ctg aaa aag gat cca tcc tta agt ttg tac gcc       1776
Pro Asp Val Asn Gly Leu Lys Lys Asp Pro Ser Leu Ser Leu Tyr Ala           
            580                 585                 590                   

att cct gac ggt gat gtg aaa ggt cgc gtg gta gcg att tta ctt aat       1824
Ile Pro Asp Gly Asp Val Lys Gly Arg Val Val Ala Ile Leu Leu Asn           
        595                 600                 605                       

gat gaa gtg aga tcg gca gac ctt ctg gcc att ctc aag gcg ctg aag       1872
Asp Glu Val Arg Ser Ala Asp Leu Leu Ala Ile Leu Lys Ala Leu Lys           
    610                 615                 620                           

gcc aaa ggc gtt cat gcc aaa ctg ctc tac tcc cga atg ggt gaa gtg       1920
Ala Lys Gly Val His Ala Lys Leu Leu Tyr Ser Arg Met Gly Glu Val           
625                 630                 635                 640           

act gcg gat gac ggt acg gtg ttg cct ata gcc gct acc ttt gcc ggt       1968
Thr Ala Asp Asp Gly Thr Val Leu Pro Ile Ala Ala Thr Phe Ala Gly           
                645                 650                 655               

gca cct tcg ctg acg gtc gat gcg gtc att gtc cct tgc ggc aat atc       2016
Ala Pro Ser Leu Thr Val Asp Ala Val Ile Val Pro Cys Gly Asn Ile           
            660                 665                 670                   

gcg gat atc gct gac aac ggc gat gcc aac tac tac ctg atg gaa gcc       2064
Ala Asp Ile Ala Asp Asn Gly Asp Ala Asn Tyr Tyr Leu Met Glu Ala           
        675                 680                 685                       

tac aaa cac ctt aaa ccg att gcg ctg gcg ggt gac gcg cgc aag ttt       2112
Tyr Lys His Leu Lys Pro Ile Ala Leu Ala Gly Asp Ala Arg Lys Phe           
    690                 695                 700                           

aaa gca aca atc aag atc gct gac cag ggt gaa gaa ggg att gtg gaa       2160
Lys Ala Thr Ile Lys Ile Ala Asp Gln Gly Glu Glu Gly Ile Val Glu           
705                 710                 715                 720           

gct gac agc gct gac ggt agt ttt atg gat gaa ctg cta acg ctg atg       2208
Ala Asp Ser Ala Asp Gly Ser Phe Met Asp Glu Leu Leu Thr Leu Met           
                725                 730                 735               

gca gca cac cgc gtg tgg tca cgc att cct aag att gac aaa att cct       2256
Ala Ala His Arg Val Trp Ser Arg Ile Pro Lys Ile Asp Lys Ile Pro           
            740                 745                 750                   

gcc tga                                                               2262
Ala                                                                       
                                                                          


<210>  24
<211>  753
<212>  PRT
<213>  Escherichia coli

<400>  24

Met Ser Gln His Asn Glu Lys Asn Pro His Gln His Gln Ser Pro Leu 
1               5                   10                  15      


His Asp Ser Ser Glu Ala Lys Pro Gly Met Asp Ser Leu Ala Pro Glu 
            20                  25                  30          


Asp Gly Ser His Arg Pro Ala Ala Glu Pro Thr Pro Pro Gly Ala Gln 
        35                  40                  45              


Pro Thr Ala Pro Gly Ser Leu Lys Ala Pro Asp Thr Arg Asn Glu Lys 
    50                  55                  60                  


Leu Asn Ser Leu Glu Asp Val Arg Lys Gly Ser Glu Asn Tyr Ala Leu 
65                  70                  75                  80  


Thr Thr Asn Gln Gly Val Arg Ile Ala Asp Asp Gln Asn Ser Leu Arg 
                85                  90                  95      


Ala Gly Ser Arg Gly Pro Thr Leu Leu Glu Asp Phe Ile Leu Arg Glu 
            100                 105                 110         


Lys Ile Thr His Phe Asp His Glu Arg Ile Pro Glu Arg Ile Val His 
        115                 120                 125             


Ala Arg Gly Ser Ala Ala His Gly Tyr Phe Gln Pro Tyr Lys Ser Leu 
    130                 135                 140                 


Ser Asp Ile Thr Lys Ala Asp Phe Leu Ser Asp Pro Asn Lys Ile Thr 
145                 150                 155                 160 


Pro Val Phe Val Arg Phe Ser Thr Val Gln Gly Gly Ala Gly Ser Ala 
                165                 170                 175     


Asp Thr Val Arg Asp Ile Arg Gly Phe Ala Thr Lys Phe Tyr Thr Glu 
            180                 185                 190         


Glu Gly Ile Phe Asp Leu Val Gly Asn Asn Thr Pro Ile Phe Phe Ile 
        195                 200                 205             


Gln Asp Ala His Lys Phe Pro Asp Phe Val His Ala Val Lys Pro Glu 
    210                 215                 220                 


Pro His Trp Ala Ile Pro Gln Gly Gln Ser Ala His Asp Thr Phe Trp 
225                 230                 235                 240 


Asp Tyr Val Ser Leu Gln Pro Glu Thr Leu His Asn Val Met Trp Ala 
                245                 250                 255     


Met Ser Asp Arg Gly Ile Pro Arg Ser Tyr Arg Thr Met Glu Gly Phe 
            260                 265                 270         


Gly Ile His Thr Phe Arg Leu Ile Asn Ala Glu Gly Lys Ala Thr Phe 
        275                 280                 285             


Val Arg Phe His Trp Lys Pro Leu Ala Gly Lys Ala Ser Leu Val Trp 
    290                 295                 300                 


Asp Glu Ala Gln Lys Leu Thr Gly Arg Asp Pro Asp Phe His Arg Arg 
305                 310                 315                 320 


Glu Leu Trp Glu Ala Ile Glu Ala Gly Asp Phe Pro Glu Tyr Glu Leu 
                325                 330                 335     


Gly Phe Gln Leu Ile Pro Glu Glu Asp Glu Phe Lys Phe Asp Phe Asp 
            340                 345                 350         


Leu Leu Asp Pro Thr Lys Leu Ile Pro Glu Glu Leu Val Pro Val Gln 
        355                 360                 365             


Arg Val Gly Lys Met Val Leu Asn Arg Asn Pro Asp Asn Phe Phe Ala 
    370                 375                 380                 


Glu Asn Glu Gln Ala Ala Phe His Pro Gly His Ile Val Pro Gly Leu 
385                 390                 395                 400 


Asp Phe Thr Asn Asp Pro Leu Leu Gln Gly Arg Leu Phe Ser Tyr Thr 
                405                 410                 415     


Asp Thr Gln Ile Ser Arg Leu Gly Gly Pro Asn Phe His Glu Ile Pro 
            420                 425                 430         


Ile Asn Arg Pro Thr Cys Pro Tyr His Asn Phe Gln Arg Asp Gly Met 
        435                 440                 445             


His Arg Met Gly Ile Asp Thr Asn Pro Ala Asn Tyr Glu Pro Asn Ser 
    450                 455                 460                 


Ile Asn Asp Asn Trp Pro Arg Glu Thr Pro Pro Gly Pro Lys Arg Gly 
465                 470                 475                 480 


Gly Phe Glu Ser Tyr Gln Glu Arg Val Glu Gly Asn Lys Val Arg Glu 
                485                 490                 495     


Arg Ser Pro Ser Phe Gly Glu Tyr Tyr Ser His Pro Arg Leu Phe Trp 
            500                 505                 510         


Leu Ser Gln Thr Pro Phe Glu Gln Arg His Ile Val Asp Gly Phe Ser 
        515                 520                 525             


Phe Glu Leu Ser Lys Val Val Arg Pro Tyr Ile Arg Glu Arg Val Val 
    530                 535                 540                 


Asp Gln Leu Ala His Ile Asp Leu Thr Leu Ala Gln Ala Val Ala Lys 
545                 550                 555                 560 


Asn Leu Gly Ile Glu Leu Thr Asp Asp Gln Leu Asn Ile Thr Pro Pro 
                565                 570                 575     


Pro Asp Val Asn Gly Leu Lys Lys Asp Pro Ser Leu Ser Leu Tyr Ala 
            580                 585                 590         


Ile Pro Asp Gly Asp Val Lys Gly Arg Val Val Ala Ile Leu Leu Asn 
        595                 600                 605             


Asp Glu Val Arg Ser Ala Asp Leu Leu Ala Ile Leu Lys Ala Leu Lys 
    610                 615                 620                 


Ala Lys Gly Val His Ala Lys Leu Leu Tyr Ser Arg Met Gly Glu Val 
625                 630                 635                 640 


Thr Ala Asp Asp Gly Thr Val Leu Pro Ile Ala Ala Thr Phe Ala Gly 
                645                 650                 655     


Ala Pro Ser Leu Thr Val Asp Ala Val Ile Val Pro Cys Gly Asn Ile 
            660                 665                 670         


Ala Asp Ile Ala Asp Asn Gly Asp Ala Asn Tyr Tyr Leu Met Glu Ala 
        675                 680                 685             


Tyr Lys His Leu Lys Pro Ile Ala Leu Ala Gly Asp Ala Arg Lys Phe 
    690                 695                 700                 


Lys Ala Thr Ile Lys Ile Ala Asp Gln Gly Glu Glu Gly Ile Val Glu 
705                 710                 715                 720 


Ala Asp Ser Ala Asp Gly Ser Phe Met Asp Glu Leu Leu Thr Leu Met 
                725                 730                 735     


Ala Ala His Arg Val Trp Ser Arg Ile Pro Lys Ile Asp Lys Ile Pro 
            740                 745                 750         


Ala 
    


<210>  25
<211>  25
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  25
gatctgactg gtggtctata gttag                                             25


<210>  26
<211>  25
<212>  DNA
<213>  artificial sequence

<220>
<223>  Primer

<400>  26
gtagttatca tgatgtgtaa gtaag                                             25


<210>  27
<211>  24
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  27
atgaccaaga tcaataactg gcag                                              24


<210>  28
<211>  25
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  28
ttactggttt tcacgcagcc agtcg                                             25


<210>  29
<211>  939
<212>  DNA
<213>  artificial sequence

<220>
<223>  synthetic construct

<400>  29
atgaccaaga tcaataactg gcaggattat cagggttcca gcctgaaacc tgaagatttc       60

gacaaattct gggacgagaa aatcaacctg gttagcaatc accagttcga gtttgaattg      120

attgagaaga acctgagcag caaggtggtg aacttttacc acctgtggtt tacggctatc      180

gacggcgcga agattcacgc acaactgatt gttccgaaga atctgaaaga gaaatacccg      240

gcgatcctgc aatttcatgg ctatcactgt gactccggtg actgggttga caagattggt      300

attgttgccg aaggcaatgt ggttctggca ctggactgcc gtggtcaggg cggtttgagc      360

caagacaata tccaaaccat gggcatgacg atgaaaggtc tgattgtgcg tggtatcgat      420

gagggctatg agaatctgta ctatgtgcgc cagttcatgg acctgatcac ggctacgaag      480

attctgagcg agtttgactt cgttgacgaa accaatatct cggcacaagg cgcgtcgcag      540

ggtggcgcgt tggctgtggc gtgcgcggct ctgtctccgc tgattaagaa ggtgacggca      600

acgtacccgt tcttgagcga ttatcgcaag gcatatgagc tgggtgcgga agaaagcgcc      660

tttgaggagc tgccgtattg gtttcagttc aaagacccgc tgcacctgcg tgaagattgg      720

ttcttcaacc agctggagta tatcgacatt cagaatctgg cccctcgcat taaggcagag      780

gtgatttgga tcctgggtgg caaggatacc gtcgtcccgc cgattacgca aatggcagcg      840

tacaataaga tccagagcaa gaaaagcctg tatgttctgc cggagtatgg ccatgagtac      900

ttgccgaaga ttagcgactg gctgcgtgaa aaccagtaa                             939


<210>  30
<211>  5320
<212>  DNA
<213>  artificial sequence

<220>
<223>  plasmid

<400>  30
gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc       60

ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc      120

gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc      180

tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga      240

taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac tgctctttaa      300

caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac tttattatta      360

aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa ccatggccct      420

tatgaccaag atcaataact ggcaggatta tcagggttcc agcctgaaac ctgaagattt      480

cgacaaattc tgggacgaga aaatcaacct ggttagcaat caccagttcg agtttgaatt      540

gattgagaag aacctgagca gcaaggtggt gaacttttac cacctgtggt ttacggctat      600

cgacggcgcg aagattcacg cacaactgat tgttccgaag aatctgaaag agaaataccc      660

ggcgatcctg caatttcatg gctatcactg tgactccggt gactgggttg acaagattgg      720

tattgttgcc gaaggcaatg tggttctggc actggactgc cgtggtcagg gcggtttgag      780

ccaagacaat atccaaacca tgggcatgac gatgaaaggt ctgattgtgc gtggtatcga      840

tgagggctat gagaatctgt actatgtgcg ccagttcatg gacctgatca cggctacgaa      900

gattctgagc gagtttgact tcgttgacga aaccaatatc tcggcacaag gcgcgtcgca      960

gggtggcgcg ttggctgtgg cgtgcgcggc tctgtctccg ctgattaaga aggtgacggc     1020

aacgtacccg ttcttgagcg attatcgcaa ggcatatgag ctgggtgcgg aagaaagcgc     1080

ctttgaggag ctgccgtatt ggtttcagtt caaagacccg ctgcacctgc gtgaagattg     1140

gttcttcaac cagctggagt atatcgacat tcagaatctg gcccctcgca ttaaggcaga     1200

ggtgatttgg atcctgggtg gcaaggatac cgtcgtcccg ccgattacgc aaatggcagc     1260

gtacaataag atccagagca agaaaagcct gtatgttctg ccggagtatg gccatgagta     1320

cttgccgaag attagcgact ggctgcgtga aaaccagtaa aagggcgaat tcgaagctta     1380

cgtagaacaa aaactcatct cagaagagga tctgaatagc gccgtcgacc atcatcatca     1440

tcatcattga gtttaaacgg tctccagctt ggctgttttg gcggatgaga gaagattttc     1500

agcctgatac agattaaatc agaacgcaga agcggtctga taaaacagaa tttgcctggc     1560

ggcagtagcg cggtggtccc acctgacccc atgccgaact cagaagtgaa acgccgtagc     1620

gccgatggta gtgtggggtc tccccatgcg agagtaggga actgccaggc atcaaataaa     1680

acgaaaggct cagtcgaaag actgggcctt tcgttttatc tgttgtttgt cggtgaacgc     1740

tctcctgagt aggacaaatc cgccgggagc ggatttgaac gttgcgaagc aacggcccgg     1800

agggtggcgg gcaggacgcc cgccataaac tgccaggcat caaattaagc agaaggccat     1860

cctgacggat ggcctttttg cgtttctaca aactcttttt gtttattttt ctaaatacat     1920

tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa     1980

aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt     2040

tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag     2100

ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt     2160

tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg     2220

gtattatccc gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag     2280

aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta     2340

agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg     2400

acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta     2460

actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac     2520

accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt     2580

actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca     2640

cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag     2700

cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta     2760

gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag     2820

ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt     2880

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat     2940

aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta     3000

gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa     3060

acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt     3120

tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag     3180

ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta     3240

atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca     3300

agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag     3360

cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa     3420

agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga     3480

acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc     3540

gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc     3600

ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt     3660

gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt     3720

gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag     3780

gaagcggaag agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac     3840

cgcatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca     3900

ctccgctatc gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg     3960

acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct     4020

ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagcaga     4080

tcaattcgcg cgcgaaggcg aagcggcatg catttacgtt gacaccatcg aatggtgcaa     4140

aacctttcgc ggtatggcat gatagcgccc ggaagagagt caattcaggg tggtgaatgt     4200

gaaaccagta acgttatacg atgtcgcaga gtatgccggt gtctcttatc agaccgtttc     4260

ccgcgtggtg aaccaggcca gccacgtttc tgcgaaaacg cgggaaaaag tggaagcggc     4320

gatggcggag ctgaattaca ttcccaaccg cgtggcacaa caactggcgg gcaaacagtc     4380

gttgctgatt ggcgttgcca cctccagtct ggccctgcac gcgccgtcgc aaattgtcgc     4440

ggcgattaaa tctcgcgccg atcaactggg tgccagcgtg gtggtgtcga tggtagaacg     4500

aagcggcgtc gaagcctgta aagcggcggt gcacaatctt ctcgcgcaac gcgtcagtgg     4560

gctgatcatt aactatccgc tggatgacca ggatgccatt gctgtggaag ctgcctgcac     4620

taatgttccg gcgttatttc ttgatgtctc tgaccagaca cccatcaaca gtattatttt     4680

ctcccatgaa gacggtacgc gactgggcgt ggagcatctg gtcgcattgg gtcaccagca     4740

aatcgcgctg ttagcgggcc cattaagttc tgtctcggcg cgtctgcgtc tggctggctg     4800

gcataaatat ctcactcgca atcaaattca gccgatagcg gaacgggaag gcgactggag     4860

tgccatgtcc ggttttcaac aaaccatgca aatgctgaat gagggcatcg ttcccactgc     4920

gatgctggtt gccaacgatc agatggcgct gggcgcaatg cgcgccatta ccgagtccgg     4980

gctgcgcgtt ggtgcggata tctcggtagt gggatacgac gataccgaag acagctcatg     5040

ttatatcccg ccgtcaacca ccatcaaaca ggattttcgc ctgctggggc aaaccagcgt     5100

ggaccgcttg ctgcaactct ctcagggcca ggcggtgaag ggcaatcagc tgttgcccgt     5160

ctcactggtg aaaagaaaaa ccaccctggc gcccaatacg caaaccgcct ctccccgcgc     5220

gttggccgat tcattaatgc agctggcacg acaggtttcc cgactggaaa gcgggcagtg     5280

agcgcaacgc aattaatgtg agttagcgcg aattgatctg                           5320


<210>  31
<211>  27
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  31
atgccgttcc cggatctgat ccagccg                                           27


<210>  32
<211>  28
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  32
ttaacccaca ccgaacagac ggctcaac                                          28


<210>  33
<211>  972
<212>  DNA
<213>  artificial sequence

<220>
<223>  synthetic construct

<400>  33
atgccgttcc cggatctgat ccagccggaa ctgggcgcat acgtgagcag cgttggtatg       60

cctgatgact tcgcgcaatt ctggacgagc acgattgctg aggcgcgtca agctggcggc      120

gaggtcagca tcgtccaggc ccaaaccacc ctgaaggctg ttcagagctt tgatgttacc      180

ttccctggct acggcggcca tccgatcaag ggctggttga tcctgccgac tcatcacaaa      240

ggtcgtctgc cgctggtggt gcaatacatc ggttatggtg gcggccgtgg tttggcacat      300

gaacagctgc attgggccgc ctcgggcttt gcctatttcc gtatggatac ccgcggtcaa      360

ggttccgatt ggagcgtggg cgagactgcc gatccggtcg gtagcaccag cagcattccg      420

ggtttcatga cccgtggcgt tctggacaag aacgactatt actatcgtcg cttgtttacc      480

gacgcggttc gtgcgattga cgcgctgttg ggtctggact ttgttgatcc ggagcgtatc      540

gcggtttgcg gtgactccca aggtggcggt attagcctgg cagttggcgg catcgacccg      600

cgtgtgaagg cggtgatgcc ggacgtcccg ttcttgtgtg actttccgcg tgcggtccag      660

accgcggtgc gtgatccgta tctggagatc gtccgtttcc tggctcagca ccgtgagaag      720

aaagcggcag tcttcgaaac cttgaactac tttgactgcg tcaatttcgc ccgtcgctcc      780

aaagccccgg cactgtttag cgtggccctg atggacgagg tttgccctcc aagcactgtc      840

tatggtgcgt ttaacgctta tgctggcgag aaaaccatta cggaatacga gtttaacaac      900

cacgagggcg gtcagggtta ccaggaacgt caacaaatga cctggttgag ccgtctgttc      960

ggtgtgggtt aa                                                          972


<210>  34
<211>  5353
<212>  DNA
<213>  artificial sequence

<220>
<223>  plasmid

<400>  34
gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc       60

ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc      120

gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc      180

tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga      240

taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac tgctctttaa      300

caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac tttattatta      360

aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa ccatggccct      420

tatgccgttc ccggatctga tccagccgga actgggcgca tacgtgagca gcgttggtat      480

gcctgatgac ttcgcgcaat tctggacgag cacgattgct gaggcgcgtc aagctggcgg      540

cgaggtcagc atcgtccagg cccaaaccac cctgaaggct gttcagagct ttgatgttac      600

cttccctggc tacggcggcc atccgatcaa gggctggttg atcctgccga ctcatcacaa      660

aggtcgtctg ccgctggtgg tgcaatacat cggttatggt ggcggccgtg gtttggcaca      720

tgaacagctg cattgggccg cctcgggctt tgcctatttc cgtatggata cccgcggtca      780

aggttccgat tggagcgtgg gcgagactgc cgatccggtc ggtagcacca gcagcattcc      840

gggtttcatg acccgtggcg ttctggacaa gaacgactat tactatcgtc gcttgtttac      900

cgacgcggtt cgtgcgattg acgcgctgtt gggtctggac tttgttgatc cggagcgtat      960

cgcggtttgc ggtgactccc aaggtggcgg tattagcctg gcagttggcg gcatcgaccc     1020

gcgtgtgaag gcggtgatgc cggacgtccc gttcttgtgt gactttccgc gtgcggtcca     1080

gaccgcggtg cgtgatccgt atctggagat cgtccgtttc ctggctcagc accgtgagaa     1140

gaaagcggca gtcttcgaaa ccttgaacta ctttgactgc gtcaatttcg cccgtcgctc     1200

caaagccccg gcactgttta gcgtggccct gatggacgag gtttgccctc caagcactgt     1260

ctatggtgcg tttaacgctt atgctggcga gaaaaccatt acggaatacg agtttaacaa     1320

ccacgagggc ggtcagggtt accaggaacg tcaacaaatg acctggttga gccgtctgtt     1380

cggtgtgggt taaaagggcg aattcgaagc ttacgtagaa caaaaactca tctcagaaga     1440

ggatctgaat agcgccgtcg accatcatca tcatcatcat tgagtttaaa cggtctccag     1500

cttggctgtt ttggcggatg agagaagatt ttcagcctga tacagattaa atcagaacgc     1560

agaagcggtc tgataaaaca gaatttgcct ggcggcagta gcgcggtggt cccacctgac     1620

cccatgccga actcagaagt gaaacgccgt agcgccgatg gtagtgtggg gtctccccat     1680

gcgagagtag ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga aagactgggc     1740

ctttcgtttt atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa atccgccggg     1800

agcggatttg aacgttgcga agcaacggcc cggagggtgg cgggcaggac gcccgccata     1860

aactgccagg catcaaatta agcagaaggc catcctgacg gatggccttt ttgcgtttct     1920

acaaactctt tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa     1980

taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc     2040

cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa     2100

acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa     2160

ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg     2220

atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtgttga cgccgggcaa     2280

gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc     2340

acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc     2400

atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta     2460

accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag     2520

ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca     2580

acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata     2640

gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc     2700

tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca     2760

ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca     2820

actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg     2880

taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa     2940

tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt     3000

gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat     3060

cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg     3120

gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga     3180

gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac     3240

tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt     3300

ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag     3360

cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc     3420

gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag     3480

gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca     3540

gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt     3600

cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc     3660

tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc     3720

cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc     3780

cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat     3840

tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc     3900

tgctctgatg ccgcatagtt aagccagtat acactccgct atcgctacgt gactgggtca     3960

tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc     4020

cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt     4080

caccgtcatc accgaaacgc gcgaggcagc agatcaattc gcgcgcgaag gcgaagcggc     4140

atgcatttac gttgacacca tcgaatggtg caaaaccttt cgcggtatgg catgatagcg     4200

cccggaagag agtcaattca gggtggtgaa tgtgaaacca gtaacgttat acgatgtcgc     4260

agagtatgcc ggtgtctctt atcagaccgt ttcccgcgtg gtgaaccagg ccagccacgt     4320

ttctgcgaaa acgcgggaaa aagtggaagc ggcgatggcg gagctgaatt acattcccaa     4380

ccgcgtggca caacaactgg cgggcaaaca gtcgttgctg attggcgttg ccacctccag     4440

tctggccctg cacgcgccgt cgcaaattgt cgcggcgatt aaatctcgcg ccgatcaact     4500

gggtgccagc gtggtggtgt cgatggtaga acgaagcggc gtcgaagcct gtaaagcggc     4560

ggtgcacaat cttctcgcgc aacgcgtcag tgggctgatc attaactatc cgctggatga     4620

ccaggatgcc attgctgtgg aagctgcctg cactaatgtt ccggcgttat ttcttgatgt     4680

ctctgaccag acacccatca acagtattat tttctcccat gaagacggta cgcgactggg     4740

cgtggagcat ctggtcgcat tgggtcacca gcaaatcgcg ctgttagcgg gcccattaag     4800

ttctgtctcg gcgcgtctgc gtctggctgg ctggcataaa tatctcactc gcaatcaaat     4860

tcagccgata gcggaacggg aaggcgactg gagtgccatg tccggttttc aacaaaccat     4920

gcaaatgctg aatgagggca tcgttcccac tgcgatgctg gttgccaacg atcagatggc     4980

gctgggcgca atgcgcgcca ttaccgagtc cgggctgcgc gttggtgcgg atatctcggt     5040

agtgggatac gacgataccg aagacagctc atgttatatc ccgccgtcaa ccaccatcaa     5100

acaggatttt cgcctgctgg ggcaaaccag cgtggaccgc ttgctgcaac tctctcaggg     5160

ccaggcggtg aagggcaatc agctgttgcc cgtctcactg gtgaaaagaa aaaccaccct     5220

ggcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc     5280

acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc     5340

gcgaattgat ctg                                                        5353


<210>  35
<211>  27
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  35
atgtttgata tgccgctggc ccagttg                                           27


<210>  36
<211>  26
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  36
ttatttgaca ttgcgctttt gaatcg                                            26


<210>  37
<211>  990
<212>  DNA
<213>  artificial sequence

<220>
<223>  synthetic construct

<400>  37
atgtttgata tgccgctggc ccagttgcag aaatacatgg gtacgaaccc taaaccggca       60

gattttgcag acttttggag ccgtgctctg gaggagctga gcgcccagtc gctgcactac      120

gagctgatcc cagcgacgtt ccagactacc gtcgcaagct gctaccacct gtactttacg      180

ggtgttggcg gtgcacgcgt tcactgtcaa ctggtgaaac cgcgcgagca aaaacagaaa      240

ggtccgggcc tggtgtggtt tcatggctat cataccaata gcggtgactg ggtcgacaag      300

ctggcatacg cggcagcagg cttcaccgtt ttggcgatgg actgccgtgg tcaaggtggt      360

aagagcgagg ataatctgca agtgaaaggc ccgaccctga agggccatat cattcgtggt      420

atcgaggacc cgaatccgca tcatttgtat taccgtaacg tgtttctgga cactgtccaa      480

gctgtccgta ttctgtgttc catggatcac atcgatcgtg aacgcatcgg tgtgtatggc      540

gctagccagg gtggtgccct ggcgctggcg tgtgcggcgc tggagccgtc tgtcgttaag      600

aaagccgttg ttctgtaccc attcctgagc gactataagc gtgcgcaaga gctggacatg      660

aaaaacacgg cgtatgaaga aatccactac tatttccgtt tcctggatcc gacccacgaa      720

cgcgaggaag aggttttcta taagctgggc tacattgaca tccagctgct ggcggatcgt      780

atctgcgcgg acgtgctgtg ggccgttgct ctggaagatc acatttgccc tccaagcacc      840

cagttcgcgg tgtataacaa gattaagtcc aagaaagata tggtgttgtt ctacgaatac      900

ggtcatgaat acctgccgac catgggcgat cgcgcctatt tgttcttctg tccgattttc      960

tttccgattc aaaagcgcaa tgtcaaataa                                       990


<210>  38
<211>  5371
<212>  DNA
<213>  artificial sequence

<220>
<223>  plasmid

<400>  38
gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc       60

ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc      120

gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc      180

tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga      240

taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac tgctctttaa      300

caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac tttattatta      360

aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa ccatggccct      420

tatgtttgat atgccgctgg cccagttgca gaaatacatg ggtacgaacc ctaaaccggc      480

agattttgca gacttttgga gccgtgctct ggaggagctg agcgcccagt cgctgcacta      540

cgagctgatc ccagcgacgt tccagactac cgtcgcaagc tgctaccacc tgtactttac      600

gggtgttggc ggtgcacgcg ttcactgtca actggtgaaa ccgcgcgagc aaaaacagaa      660

aggtccgggc ctggtgtggt ttcatggcta tcataccaat agcggtgact gggtcgacaa      720

gctggcatac gcggcagcag gcttcaccgt tttggcgatg gactgccgtg gtcaaggtgg      780

taagagcgag gataatctgc aagtgaaagg cccgaccctg aagggccata tcattcgtgg      840

tatcgaggac ccgaatccgc atcatttgta ttaccgtaac gtgtttctgg acactgtcca      900

agctgtccgt attctgtgtt ccatggatca catcgatcgt gaacgcatcg gtgtgtatgg      960

cgctagccag ggtggtgccc tggcgctggc gtgtgcggcg ctggagccgt ctgtcgttaa     1020

gaaagccgtt gttctgtacc cattcctgag cgactataag cgtgcgcaag agctggacat     1080

gaaaaacacg gcgtatgaag aaatccacta ctatttccgt ttcctggatc cgacccacga     1140

acgcgaggaa gaggttttct ataagctggg ctacattgac atccagctgc tggcggatcg     1200

tatctgcgcg gacgtgctgt gggccgttgc tctggaagat cacatttgcc ctccaagcac     1260

ccagttcgcg gtgtataaca agattaagtc caagaaagat atggtgttgt tctacgaata     1320

cggtcatgaa tacctgccga ccatgggcga tcgcgcctat ttgttcttct gtccgatttt     1380

ctttccgatt caaaagcgca atgtcaaata aaagggcgaa ttcgaagctt acgtagaaca     1440

aaaactcatc tcagaagagg atctgaatag cgccgtcgac catcatcatc atcatcattg     1500

agtttaaacg gtctccagct tggctgtttt ggcggatgag agaagatttt cagcctgata     1560

cagattaaat cagaacgcag aagcggtctg ataaaacaga atttgcctgg cggcagtagc     1620

gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt     1680

agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc     1740

tcagtcgaaa gactgggcct ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag     1800

taggacaaat ccgccgggag cggatttgaa cgttgcgaag caacggcccg gagggtggcg     1860

ggcaggacgc ccgccataaa ctgccaggca tcaaattaag cagaaggcca tcctgacgga     1920

tggccttttt gcgtttctac aaactctttt tgtttatttt tctaaataca ttcaaatatg     1980

tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt     2040

atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct     2100

gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca     2160

cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc     2220

gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc     2280

cgtgttgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg     2340

gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta     2400

tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc     2460

ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt     2520

gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg     2580

cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct     2640

tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc     2700

tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct     2760

cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac     2820

acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc     2880

tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat     2940

ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg     3000

accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc     3060

aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa     3120

ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag     3180

gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta     3240

ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta     3300

ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag     3360

ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg     3420

gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg     3480

cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag     3540

cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc     3600

cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa     3660

aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg     3720

ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct     3780

gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa     3840

gagcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatatgg     3900

tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat     3960

cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct     4020

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct     4080

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagcag atcaattcgc     4140

gcgcgaaggc gaagcggcat gcatttacgt tgacaccatc gaatggtgca aaacctttcg     4200

cggtatggca tgatagcgcc cggaagagag tcaattcagg gtggtgaatg tgaaaccagt     4260

aacgttatac gatgtcgcag agtatgccgg tgtctcttat cagaccgttt cccgcgtggt     4320

gaaccaggcc agccacgttt ctgcgaaaac gcgggaaaaa gtggaagcgg cgatggcgga     4380

gctgaattac attcccaacc gcgtggcaca acaactggcg ggcaaacagt cgttgctgat     4440

tggcgttgcc acctccagtc tggccctgca cgcgccgtcg caaattgtcg cggcgattaa     4500

atctcgcgcc gatcaactgg gtgccagcgt ggtggtgtcg atggtagaac gaagcggcgt     4560

cgaagcctgt aaagcggcgg tgcacaatct tctcgcgcaa cgcgtcagtg ggctgatcat     4620

taactatccg ctggatgacc aggatgccat tgctgtggaa gctgcctgca ctaatgttcc     4680

ggcgttattt cttgatgtct ctgaccagac acccatcaac agtattattt tctcccatga     4740

agacggtacg cgactgggcg tggagcatct ggtcgcattg ggtcaccagc aaatcgcgct     4800

gttagcgggc ccattaagtt ctgtctcggc gcgtctgcgt ctggctggct ggcataaata     4860

tctcactcgc aatcaaattc agccgatagc ggaacgggaa ggcgactgga gtgccatgtc     4920

cggttttcaa caaaccatgc aaatgctgaa tgagggcatc gttcccactg cgatgctggt     4980

tgccaacgat cagatggcgc tgggcgcaat gcgcgccatt accgagtccg ggctgcgcgt     5040

tggtgcggat atctcggtag tgggatacga cgataccgaa gacagctcat gttatatccc     5100

gccgtcaacc accatcaaac aggattttcg cctgctgggg caaaccagcg tggaccgctt     5160

gctgcaactc tctcagggcc aggcggtgaa gggcaatcag ctgttgcccg tctcactggt     5220

gaaaagaaaa accaccctgg cgcccaatac gcaaaccgcc tctccccgcg cgttggccga     5280

ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg     5340

caattaatgt gagttagcgc gaattgatct g                                    5371


<210>  39
<211>  325
<212>  PRT
<213>  Thermotoga neapolitana

<400>  39

Met Ala Phe Phe Asp Met Pro Leu Glu Glu Leu Lys Lys Tyr Arg Pro 
1               5                   10                  15      


Glu Arg Tyr Glu Glu Lys Asp Phe Asp Glu Phe Trp Arg Glu Thr Leu 
            20                  25                  30          


Lys Glu Ser Glu Gly Phe Pro Leu Asp Pro Val Phe Glu Lys Val Asp 
        35                  40                  45              


Phe His Leu Lys Thr Val Glu Thr Tyr Asp Val Thr Phe Ser Gly Tyr 
    50                  55                  60                  


Arg Gly Gln Arg Ile Lys Gly Trp Leu Leu Val Pro Lys Leu Ala Glu 
65                  70                  75                  80  


Glu Lys Leu Pro Cys Val Val Gln Tyr Ile Gly Tyr Asn Gly Gly Arg 
                85                  90                  95      


Gly Phe Pro His Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr Ile Cys 
            100                 105                 110         


Phe Val Met Asp Thr Arg Gly Gln Gly Ser Gly Trp Met Lys Gly Asp 
        115                 120                 125             


Thr Pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Gln Tyr Pro Gly Phe 
    130                 135                 140                 


Met Thr Arg Gly Ile Leu Asp Pro Gly Thr Tyr Tyr Tyr Arg Arg Val 
145                 150                 155                 160 


Phe Val Asp Ala Val Arg Ala Val Glu Ala Ala Ile Ser Phe Pro Arg 
                165                 170                 175     


Val Asp Ser Arg Lys Val Val Val Ala Gly Gly Ser Gln Gly Gly Gly 
            180                 185                 190         


Ile Ala Leu Ala Val Ser Ala Leu Ser Asn Arg Val Lys Ala Leu Leu 
        195                 200                 205             


Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala Val Gln Leu Val 
    210                 215                 220                 


Asp Thr His Pro Tyr Val Glu Ile Thr Asn Phe Leu Lys Thr His Arg 
225                 230                 235                 240 


Asp Lys Glu Glu Ile Val Phe Arg Thr Leu Ser Tyr Phe Asp Gly Val 
                245                 250                 255     


Asn Phe Ala Ala Arg Ala Lys Val Pro Ala Leu Phe Ser Val Gly Leu 
            260                 265                 270         


Met Asp Thr Ile Cys Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn His 
        275                 280                 285             


Tyr Ala Gly Pro Lys Glu Ile Arg Ile Tyr Pro Tyr Asn Asn His Glu 
    290                 295                 300                 


Gly Gly Gly Ser Phe Gln Ala Ile Glu Gln Val Lys Phe Leu Lys Arg 
305                 310                 315                 320 


Leu Phe Glu Glu Gly 
                325 


<210>  40
<211>  325
<212>  PRT
<213>  Thermotoga maritima MSB8

<400>  40

Met Ala Phe Phe Asp Leu Pro Leu Glu Glu Leu Lys Lys Tyr Arg Pro 
1               5                   10                  15      


Glu Arg Tyr Glu Glu Lys Asp Phe Asp Glu Phe Trp Glu Glu Thr Leu 
            20                  25                  30          


Ala Glu Ser Glu Lys Phe Pro Leu Asp Pro Val Phe Glu Arg Met Glu 
        35                  40                  45              


Ser His Leu Lys Thr Val Glu Ala Tyr Asp Val Thr Phe Ser Gly Tyr 
    50                  55                  60                  


Arg Gly Gln Arg Ile Lys Gly Trp Leu Leu Val Pro Lys Leu Glu Glu 
65                  70                  75                  80  


Glu Lys Leu Pro Cys Val Val Gln Tyr Ile Gly Tyr Asn Gly Gly Arg 
                85                  90                  95      


Gly Phe Pro His Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr Ile Cys 
            100                 105                 110         


Phe Val Met Asp Thr Arg Gly Gln Gly Ser Gly Trp Leu Lys Gly Asp 
        115                 120                 125             


Thr Pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Gln Tyr Pro Gly Phe 
    130                 135                 140                 


Met Thr Arg Gly Ile Leu Asp Pro Arg Thr Tyr Tyr Tyr Arg Arg Val 
145                 150                 155                 160 


Phe Thr Asp Ala Val Arg Ala Val Glu Ala Ala Ala Ser Phe Pro Gln 
                165                 170                 175     


Val Asp Gln Glu Arg Ile Val Ile Ala Gly Gly Ser Gln Gly Gly Gly 
            180                 185                 190         


Ile Ala Leu Ala Val Ser Ala Leu Ser Lys Lys Ala Lys Ala Leu Leu 
        195                 200                 205             


Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala Val Gln Leu Val 
    210                 215                 220                 


Asp Thr His Pro Tyr Ala Glu Ile Thr Asn Phe Leu Lys Thr His Arg 
225                 230                 235                 240 


Asp Lys Glu Glu Ile Val Phe Arg Thr Leu Ser Tyr Phe Asp Gly Val 
                245                 250                 255     


Asn Phe Ala Ala Arg Ala Lys Ile Pro Ala Leu Phe Ser Val Gly Leu 
            260                 265                 270         


Met Asp Asn Ile Cys Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn Tyr 
        275                 280                 285             


Tyr Ala Gly Pro Lys Glu Ile Arg Ile Tyr Pro Tyr Asn Asn His Glu 
    290                 295                 300                 


Gly Gly Gly Ser Phe Gln Ala Val Glu Gln Val Lys Phe Leu Lys Lys 
305                 310                 315                 320 


Leu Phe Glu Lys Gly 
                325 


<210>  41
<211>  320
<212>  PRT
<213>  Bacillus pumilus

<400>  41

Met Gln Leu Phe Asp Leu Ser Leu Glu Glu Leu Lys Lys Tyr Lys Pro 
1               5                   10                  15      


Lys Lys Thr Ala Arg Pro Asp Phe Ser Asp Phe Trp Lys Lys Ser Leu 
            20                  25                  30          


Glu Glu Leu Arg Gln Val Glu Ala Glu Pro Thr Leu Glu Ser Tyr Asp 
        35                  40                  45              


Tyr Pro Val Lys Gly Val Lys Val Tyr Arg Leu Thr Tyr Gln Ser Phe 
    50                  55                  60                  


Gly His Ser Lys Ile Glu Gly Phe Tyr Ala Val Pro Asp Gln Thr Gly 
65                  70                  75                  80  


Pro His Pro Ala Leu Val Arg Phe His Gly Tyr Asn Ala Ser Tyr Asp 
                85                  90                  95      


Gly Gly Ile His Asp Ile Val Asn Trp Ala Leu His Gly Tyr Ala Thr 
            100                 105                 110         


Phe Gly Met Leu Val Arg Gly Gln Gly Gly Ser Glu Asp Thr Ser Val 
        115                 120                 125             


Thr Pro Gly Gly His Ala Leu Gly Trp Met Thr Lys Gly Ile Leu Ser 
    130                 135                 140                 


Lys Asp Thr Tyr Tyr Tyr Arg Gly Val Tyr Leu Asp Ala Val Arg Ala 
145                 150                 155                 160 


Leu Glu Val Ile Gln Ser Phe Pro Glu Val Asp Glu His Arg Ile Gly 
                165                 170                 175     


Val Ile Gly Gly Ser Gln Gly Gly Ala Leu Ala Ile Ala Ala Ala Ala 
            180                 185                 190         


Leu Ser Asp Ile Pro Lys Val Val Val Ala Asp Tyr Pro Tyr Leu Ser 
        195                 200                 205             


Asn Phe Glu Arg Ala Val Asp Val Ala Leu Glu Gln Pro Tyr Leu Glu 
    210                 215                 220                 


Ile Asn Ser Tyr Phe Arg Arg Asn Ser Asp Pro Lys Val Glu Glu Lys 
225                 230                 235                 240 


Ala Phe Glu Thr Leu Ser Tyr Phe Asp Leu Ile Asn Leu Ala Gly Trp 
                245                 250                 255     


Val Lys Gln Pro Thr Leu Met Ala Ile Gly Leu Ile Asp Lys Ile Thr 
            260                 265                 270         


Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn His Leu Glu Thr Asp Lys 
        275                 280                 285             


Asp Leu Lys Val Tyr Arg Tyr Phe Gly His Glu Phe Ile Pro Ala Phe 
    290                 295                 300                 


Gln Thr Glu Lys Leu Ser Phe Leu Gln Lys His Leu Leu Leu Ser Thr 
305                 310                 315                 320 

