                         SEQUENCE LISTING

<110>  Glycosyn LLC
       Heidtman, Matthew I.
       Merighi, Massimo
       McCoy, John M.
 
<120>  Alpha (1,2) Fucosyltransferases Suitable for Use in the 
       Production of Fucosylated Oligosaccharides

<130>  37847-510001WO

<140>  PCT/US13/051777
<141>  2013-07-24

<150>  US 13/557,655
<151>  2012-07-25

<160>  23    

<170>  PatentIn version 3.5

<210>  1
<211>  6244
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pG171

<400>  1
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatatg cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc gcatcaggcg      240

cctcctcaac ctgtatattc gtaaaccacg cccaatggga gctgtctcag gtttgttcct      300

gattggttac ggcgcgtttc gcatcattgt tgagtttttc cgccagcccg acgcgcagtt      360

taccggtgcc tgggtgcagt acatcagcat ggggcaaatt ctttccatcc cgatgattgt      420

cgcgggtgtg atcatgatgg tctgggcata tcgtcgcagc ccacagcaac acgtttcctg      480

aggaaccatg aaacagtatt tagaactgat gcaaaaagtg ctcgacgaag gcacacagaa      540

aaacgaccgt accggaaccg gaacgctttc catttttggt catcagatgc gttttaacct      600

gcaagatgga ttcccgctgg tgacaactaa acgttgccac ctgcgttcca tcatccatga      660

actgctgtgg tttctgcagg gcgacactaa cattgcttat ctacacgaaa acaatgtcac      720

catctgggac gaatgggccg atgaaaacgg cgacctcggg ccagtgtatg gtaaacagtg      780

gcgcgcctgg ccaacgccag atggtcgtca tattgaccag atcactacgg tactgaacca      840

gctgaaaaac gacccggatt cgcgccgcat tattgtttca gcgtggaacg taggcgaact      900

ggataaaatg gcgctggcac cgtgccatgc attcttccag ttctatgtgg cagacggcaa      960

actctcttgc cagctttatc agcgctcctg tgacgtcttc ctcggcctgc cgttcaacat     1020

tgccagctac gcgttattgg tgcatatgat ggcgcagcag tgcgatctgg aagtgggtga     1080

ttttgtctgg accggtggcg acacgcatct gtacagcaac catatggatc aaactcatct     1140

gcaattaagc cgcgaaccgc gtccgctgcc gaagttgatt atcaaacgta aacccgaatc     1200

catcttcgac taccgtttcg aagactttga gattgaaggc tacgatccgc atccgggcat     1260

taaagcgccg gtggctatct aattacgaaa catcctgcca gagccgacgc cagtgtgcgt     1320

cggttttttt accctccgtt aaattcttcg agacgccttc ccgaaggcgc cattcgccat     1380

tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc     1440

tggcgaaagg gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt     1500

cacgacgttg taaaacgacg gccagtgcca agctttcttt aatgaagcag ggcatcagga     1560

cggtatcttt gtggagaaag cagagtaatc ttattcagcc tgactggtgg gaaaccacca     1620

gtcagaatgt gttagcgcat gttgacaaaa ataccattag tcacattatc cgtcagtcgg     1680

acgacatggt agataacctg tttattatgc gttttgatct tacgtttaat attaccttta     1740

tgcgatgaaa cggtcttggc tttgatattc atttggtcag agatttgaat ggttccctga     1800

cctgccatcc acattcgcaa catactcgat tcggttcggc tcaatgataa cgtcggcata     1860

tttaaaaacg aggttatcgt tgtctctttt ttcagaatat cgccaaggat atcgtcgaga     1920

gattccggtt taatcgattt agaactgatc aataaatttt ttctgaccaa tagatattca     1980

tcaaaatgaa cattggcaat tgccataaaa acgataaata acgtattggg atgttgatta     2040

atgatgagct tgatacgctg actgttagaa gcatcgtgga tgaaacagtc ctcattaata     2100

aacaccactg aagggcgctg tgaatcacaa gctatggcaa ggtcatcaac ggtttcaatg     2160

tcgttgattt ctcttttttt aacccctcta ctcaacagat acccggttaa acctagtcgg     2220

gtgtaactac ataaatccat aataatcgtt gacatggcat accctcactc aatgcgtaac     2280

gataattccc cttacctgaa tatttcatca tgactaaacg gaacaacatg ggtcacctaa     2340

tgcgccactc tcgcgatttt tcaggcggac ttactatccc gtaaagtgtt gtataatttg     2400

cctggaattg tcttaaagta aagtaaatgt tgcgatatgt gagtgagctt aaaacaaata     2460

tttcgctgca ggagtatcct ggaagatgtt cgtagaagct tactgctcac aagaaaaaag     2520

gcacgtcatc tgacgtgcct tttttatttg tactaccctg tacgattact gcagctcgag     2580

tttaattcaa atcttcttca gaaatcaatt tttgttcagc gttatacttt tgggatttta     2640

cctcaaaatg ggattctatt ttcacccact ccttacaaag gatattctca tgcccaaaaa     2700

gccagtgttt ggggccaata atgatttttt ctggattttc tatcaaatag gccgcccacc     2760

agctataagt gctattagcg ataatgccat gctgacaaga ttgcatgagc agcatgtccc     2820

aatacgcctc ttcttcttta tccctagtgg tcatgtccat aaaagggtag ccaagatcaa     2880

gattttgcgt gaattctaag tcttcgcaaa acacaaaaag ctccatgttt ggcacgcgct     2940

ttgccatata ctcaagcgcc tttttttgat agtcaatacc aagctgacag ccaatcccca     3000

cataatcccc tcttcttata tgcacaaaca cgctgttttt agcggctaaa atcaaagaaa     3060

gcttgcactg atattcttcc tcttttttat tattattctt attattttcg ggtggtggtg     3120

gtagagtgaa ggtttgcttg attaaagggg atatagcatc aaagtatcgt ggatcttgga     3180

aatagccaaa aaaataagtc aagcggcttg gctttagcaa tttaggctcg tattcaaaaa     3240

cgatttcttg actcacccta tcaaatccca tgcatttgag cgcgtctctt actagcttgg     3300

ggaggtgttg cattttagct atagcgattt ctttcgcgct cgcatagggc aaatcaatag     3360

ggaaaagttc taattgcatt ttcctatcgc tccaatcaaa agaagtgata tctaacagca     3420

caggcgtatt agagtgtttt tgcaaacttt tagcgaaagc gtattgaaac atttgattcc     3480

caagccctcc gcaaatttgc accaccttaa aagccatatg tatatctcct tcttgaattc     3540

taaaaattga ttgaatgtat gcaaataaat gcatacacca taggtgtggt ttaatttgat     3600

gccctttttc agggctggaa tgtgtaagag cggggttatt tatgctgttg tttttttgtt     3660

actcgggaag ggctttacct cttccgcata aacgcttcca tcagcgttta tagttaaaaa     3720

aatctttcgg aactggtttt gcgcttaccc caaccaacag gggatttgct gctttccatt     3780

gagcctgttt ctctgcgcga cgttcgcggc ggcgtgtttg tgcatccatc tggattctcc     3840

tgtcagttag ctttggtggt gtgtggcagt tgtagtcctg aacgaaaacc ccccgcgatt     3900

ggcacattgg cagctaatcc ggaatcgcac ttacggccaa tgcttcgttt cgtatcacac     3960

accccaaagc cttctgcttt gaatgctgcc cttcttcagg gcttaatttt taagagcgtc     4020

accttcatgg tggtcagtgc gtcctgctga tgtgctcagt atcaccgcca gtggtattta     4080

tgtcaacacc gccagagata atttatcacc gcagatggtt atctgtatgt tttttatatg     4140

aatttatttt ttgcaggggg gcattgtttg gtaggtgaga gatcaattct gcattaatga     4200

atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc     4260

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg     4320

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc     4380

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc     4440

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga     4500

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc     4560

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat     4620

agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg     4680

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc     4740

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga     4800

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact     4860

agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt     4920

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag     4980

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg     5040

tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa     5100

aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata     5160

tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg     5220

atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata     5280

cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg     5340

gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct     5400

gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt     5460

tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc     5520

tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga     5580

tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt     5640

aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc     5700

atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa     5760

tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca     5820

catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca     5880

aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct     5940

tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc     6000

gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa     6060

tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt     6120

tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc     6180

taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac gaggcccttt     6240

cgtc                                                                  6244


<210>  2
<211>  298
<212>  PRT
<213>  Helicobacter pylori

<400>  2

Met Ala Phe Lys Val Val Gln Ile Cys Gly Gly Leu Gly Asn Gln Met 
1               5                   10                  15      


Phe Gln Tyr Ala Phe Ala Lys Ser Leu Gln Lys His Leu Asn Thr Pro 
            20                  25                  30          


Val Leu Leu Asp Ile Thr Ser Phe Asp Trp Ser Asn Arg Lys Met Gln 
        35                  40                  45              


Leu Glu Leu Phe Pro Ile Asp Leu Pro Tyr Ala Ser Ala Lys Glu Ile 
    50                  55                  60                  


Ala Ile Ala Lys Met Gln His Leu Pro Lys Leu Val Arg Asp Thr Leu 
65                  70                  75                  80  


Lys Cys Met Gly Phe Asp Arg Val Ser Gln Glu Ile Val Phe Glu Tyr 
                85                  90                  95      


Glu Pro Gly Leu Leu Lys Pro Ser Arg Leu Thr Tyr Phe Tyr Gly Tyr 
            100                 105                 110         


Phe Gln Asp Pro Arg Tyr Phe Asp Ala Ile Ser Pro Leu Ile Lys Gln 
        115                 120                 125             


Thr Phe Thr Leu Pro Pro Pro Glu Asn Gly Asn Asn Lys Lys Lys Glu 
    130                 135                 140                 


Glu Glu Tyr His Arg Lys Leu Ala Leu Ile Leu Ala Ala Lys Asn Ser 
145                 150                 155                 160 


Val Phe Val His Val Arg Arg Gly Asp Tyr Val Gly Ile Gly Cys Gln 
                165                 170                 175     


Leu Gly Ile Asp Tyr Gln Lys Lys Ala Leu Glu Tyr Ile Ala Lys Arg 
            180                 185                 190         


Val Pro Asn Met Glu Leu Phe Val Phe Cys Glu Asp Leu Lys Phe Thr 
        195                 200                 205             


Gln Asn Leu Asp Leu Gly Tyr Pro Phe Met Asp Met Thr Thr Arg Asp 
    210                 215                 220                 


Lys Glu Glu Glu Ala Tyr Trp Asp Met Leu Leu Met Gln Ser Cys Lys 
225                 230                 235                 240 


His Gly Ile Ile Ala Asn Ser Thr Tyr Ser Trp Trp Ala Ala Tyr Leu 
                245                 250                 255     


Ile Asn Asn Pro Glu Lys Ile Ile Ile Gly Pro Lys His Trp Leu Phe 
            260                 265                 270         


Gly His Glu Asn Ile Leu Cys Lys Glu Trp Val Lys Ile Glu Ser His 
        275                 280                 285             


Phe Glu Val Lys Ser Lys Lys Tyr Asn Ala 
    290                 295             


<210>  3
<211>  281
<212>  PRT
<213>  Vibrio cholerae

<400>  3

Met Ile Val Met Lys Ile Ser Gly Gly Leu Gly Asn Gln Leu Phe Gln 
1               5                   10                  15      


Tyr Ala Val Gly Arg Ala Ile Ala Ile Gln Tyr Gly Val Pro Leu Lys 
            20                  25                  30          


Leu Asp Val Ser Ala Tyr Lys Asn Tyr Lys Leu His Asn Gly Tyr Arg 
        35                  40                  45              


Leu Asp Gln Phe Asn Ile Asn Ala Asp Ile Ala Asn Glu Asp Glu Ile 
    50                  55                  60                  


Phe His Leu Lys Gly Ser Ser Asn Arg Leu Ser Arg Ile Leu Arg Arg 
65                  70                  75                  80  


Leu Gly Trp Leu Lys Lys Asn Thr Tyr Tyr Ala Glu Lys Gln Arg Thr 
                85                  90                  95      


Ile Tyr Asp Val Ser Val Phe Met Gln Ala Pro Arg Tyr Leu Asp Gly 
            100                 105                 110         


Tyr Trp Gln Asn Glu Gln Tyr Phe Ser Gln Ile Arg Ala Val Leu Leu 
        115                 120                 125             


Gln Glu Leu Trp Pro Asn Gln Pro Leu Ser Ile Asn Ala Gln Ala His 
    130                 135                 140                 


Gln Ile Lys Ile Gln Gln Thr His Ala Val Ser Ile His Val Arg Arg 
145                 150                 155                 160 


Gly Asp Tyr Leu Asn His Pro Glu Ile Gly Val Leu Asp Ile Asp Tyr 
                165                 170                 175     


Tyr Lys Arg Ala Val Asp Tyr Ile Lys Glu Lys Ile Glu Ala Pro Val 
            180                 185                 190         


Phe Phe Val Phe Ser Asn Asp Val Ala Trp Cys Lys Asp Asn Phe Asn 
        195                 200                 205             


Phe Ile Asp Ser Pro Val Phe Ile Glu Asp Thr Gln Thr Glu Ile Asp 
    210                 215                 220                 


Asp Leu Met Leu Met Cys Gln Cys Gln His Asn Ile Val Ala Asn Ser 
225                 230                 235                 240 


Ser Phe Ser Trp Trp Ala Ala Trp Leu Asn Ser Asn Val Asp Lys Ile 
                245                 250                 255     


Val Ile Ala Pro Lys Thr Trp Met Ala Glu Asn Pro Lys Gly Tyr Lys 
            260                 265                 270         


Trp Val Pro Asp Ser Trp Arg Glu Ile 
        275                 280     


<210>  4
<211>  297
<212>  PRT
<213>  Escherichia coli

<400>  4

Met Ser Ile Ile Arg Leu Gln Gly Gly Leu Gly Asn Gln Leu Phe Gln 
1               5                   10                  15      


Phe Ser Phe Gly Tyr Ala Leu Ser Lys Ile Asn Gly Thr Pro Leu Tyr 
            20                  25                  30          


Phe Asp Ile Ser His Tyr Ala Glu Asn Asp Asp His Gly Gly Tyr Arg 
        35                  40                  45              


Leu Asn Asn Leu Gln Ile Pro Glu Glu Tyr Leu Gln Tyr Tyr Thr Pro 
    50                  55                  60                  


Lys Ile Asn Asn Ile Tyr Lys Phe Leu Val Arg Gly Ser Arg Leu Tyr 
65                  70                  75                  80  


Pro Glu Ile Phe Leu Phe Leu Gly Phe Cys Asn Glu Phe His Ala Tyr 
                85                  90                  95      


Gly Tyr Asp Phe Glu Tyr Ile Ala Gln Lys Trp Lys Ser Lys Lys Tyr 
            100                 105                 110         


Ile Gly Tyr Trp Gln Ser Glu His Phe Phe His Lys His Ile Leu Asp 
        115                 120                 125             


Leu Lys Glu Phe Phe Ile Pro Lys Asn Val Ser Glu Gln Ala Asn Leu 
    130                 135                 140                 


Leu Ala Ala Lys Ile Leu Glu Ser Gln Ser Ser Leu Ser Ile His Ile 
145                 150                 155                 160 


Arg Arg Gly Asp Tyr Ile Lys Asn Lys Thr Ala Thr Leu Thr His Gly 
                165                 170                 175     


Val Cys Ser Leu Glu Tyr Tyr Lys Lys Ala Leu Asn Lys Ile Arg Asp 
            180                 185                 190         


Leu Ala Met Ile Arg Asp Val Phe Ile Phe Ser Asp Asp Ile Phe Trp 
        195                 200                 205             


Cys Lys Glu Asn Ile Glu Thr Leu Leu Ser Lys Lys Tyr Asn Ile Tyr 
    210                 215                 220                 


Tyr Ser Glu Asp Leu Ser Gln Glu Glu Asp Leu Trp Leu Met Ser Leu 
225                 230                 235                 240 


Ala Asn His His Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp Gly Ala 
                245                 250                 255     


Tyr Leu Gly Thr Ser Ala Ser Gln Ile Val Ile Tyr Pro Thr Pro Trp 
            260                 265                 270         


Tyr Asp Ile Thr Pro Lys Asn Thr Tyr Ile Pro Ile Val Asn His Trp 
        275                 280                 285             


Ile Asn Val Asp Lys His Ser Ser Cys 
    290                 295         


<210>  5
<211>  293
<212>  PRT
<213>  Helicobacter bilis

<400>  5

Met Gly Asp Tyr Lys Ile Val Glu Leu Thr Cys Gly Leu Gly Asn Gln 
1               5                   10                  15      


Met Phe Gln Tyr Ala Phe Ala Lys Ala Leu Gln Lys His Leu Gln Val 
            20                  25                  30          


Pro Val Leu Leu Asp Lys Thr Trp Tyr Asp Thr Gln Asp Asn Ser Thr 
        35                  40                  45              


Gln Phe Ser Leu Asp Ile Phe Asn Val Asp Leu Glu Tyr Ala Thr Asn 
    50                  55                  60                  


Thr Gln Ile Glu Lys Ala Lys Ala Arg Val Ser Lys Leu Pro Gly Leu 
65                  70                  75                  80  


Leu Arg Lys Met Phe Gly Leu Lys Lys His Asn Ile Ala Tyr Ser Gln 
                85                  90                  95      


Ser Phe Asp Phe His Asp Glu Tyr Leu Leu Pro Asn Asp Phe Thr Tyr 
            100                 105                 110         


Phe Ser Gly Phe Phe Gln Asn Ala Lys Tyr Leu Lys Gly Leu Glu Gln 
        115                 120                 125             


Glu Leu Lys Ser Ile Phe Tyr Tyr Asp Ser Asn Asn Phe Ser Asn Phe 
    130                 135                 140                 


Gly Lys Gln Arg Leu Glu Leu Ile Leu Gln Ala Lys Asn Ser Ile Phe 
145                 150                 155                 160 


Ile His Ile Arg Arg Gly Asp Tyr Cys Lys Ile Gly Trp Glu Leu Gly 
                165                 170                 175     


Met Asp Tyr Tyr Lys Arg Ala Ile Gln Tyr Ile Met Asp Arg Val Glu 
            180                 185                 190         


Glu Pro Lys Phe Phe Ile Phe Gly Ala Thr Asp Met Ser Phe Thr Glu 
        195                 200                 205             


Gln Phe Gln Lys Asn Leu Gly Leu Asn Glu Asn Asn Ser Ala Asn Leu 
    210                 215                 220                 


Ser Glu Lys Thr Ile Thr Gln Asp Asn Gln His Glu Asp Met Phe Leu 
225                 230                 235                 240 


Met Cys Tyr Cys Lys His Ala Ile Leu Ala Asn Ser Ser Tyr Ser Phe 
                245                 250                 255     


Trp Ser Ala Tyr Leu Asn Asn Asp Ala Asn Asn Ile Val Ile Ala Pro 
            260                 265                 270         


Thr Pro Trp Leu Leu Asp Asn Asp Asn Ile Ile Cys Asp Asp Trp Ile 
        275                 280                 285             


Lys Ile Ser Ser Lys 
    290             


<210>  6
<211>  257
<212>  PRT
<213>  Helicobacter cinaedi

<400>  6

Met Leu Phe Pro Phe Lys Phe Ile Tyr Asn Arg Leu Arg Tyr Lys Ala 
1               5                   10                  15      


Ile Arg Leu Ile Arg Arg Arg Ala Ser Tyr Arg Pro Phe Tyr Glu Phe 
            20                  25                  30          


Tyr Ala His Ile Val Trp Gly Glu Glu Gly Val Val Asn Asp Arg Ile 
        35                  40                  45              


Met Lys His Tyr Arg Glu Ser Ser Phe Lys Pro Tyr Ala Phe Pro Tyr 
    50                  55                  60                  


Gly Ile Asn Met Ser Phe Val Tyr Ser Asn Asp Val Tyr Ala Leu Leu 
65                  70                  75                  80  


Lys Asp Asp Phe Arg Leu Lys Ile Pro Leu Arg Tyr Asp Asn Ala Met 
                85                  90                  95      


Leu Lys Lys Gln Ile Gln Asn Thr Asp Lys Ser Val Phe Leu His Ile 
            100                 105                 110         


Arg Arg Gly Asp Tyr Leu Gln Ser Glu Gly Leu Tyr Val Val Leu Gly 
        115                 120                 125             


Val Thr Tyr Tyr Gln Lys Ala Leu Glu Ile Leu Lys Ser Lys Ile Thr 
    130                 135                 140                 


Asn Pro His Ile Phe Val Phe Ser Asn Asp Met Cys Trp Cys Lys Glu 
145                 150                 155                 160 


Tyr Leu Met Arg Tyr Val Asp Phe Ser Gly Cys Thr Ile Asp Phe Ile 
                165                 170                 175     


Glu Gly Asn Thr Glu Gly Asn Ala Val Glu Glu Met Glu Leu Met Arg 
            180                 185                 190         


Ser Cys Gln His Ala Ile Ile Ala Asn Ser Thr Phe Ser Trp Trp Ala 
        195                 200                 205             


Ala Tyr Leu Ile Glu Asn Pro Asp Lys Ile Val Ile Met Pro Lys Glu 
    210                 215                 220                 


Tyr Leu Asn Asp Ser Ser Arg Phe Leu Pro Lys Gln Phe Leu Ala Leu 
225                 230                 235                 240 


Lys Asn Trp Phe Leu Val Asp His Ile Trp Gly Ser Val Glu Leu Ala 
                245                 250                 255     


Asn 
    


<210>  7
<211>  286
<212>  PRT
<213>  Helicobacter mustelae

<400>  7

Met Asp Phe Lys Ile Val Gln Val His Gly Gly Leu Gly Asn Gln Met 
1               5                   10                  15      


Phe Gln Tyr Ala Phe Ala Lys Ser Leu Gln Thr His Leu Asn Ile Pro 
            20                  25                  30          


Val Leu Leu Asp Thr Thr Trp Phe Asp Tyr Gly Asn Arg Glu Leu Gly 
        35                  40                  45              


Leu His Leu Phe Pro Ile Asp Leu Gln Cys Ala Ser Ala Gln Gln Ile 
    50                  55                  60                  


Ala Ala Ala His Met Gln Asn Leu Pro Arg Leu Val Arg Gly Ala Leu 
65                  70                  75                  80  


Arg Arg Met Gly Leu Gly Arg Val Ser Lys Glu Ile Val Phe Glu Tyr 
                85                  90                  95      


Met Pro Glu Leu Phe Glu Pro Ser Arg Ile Ala Tyr Phe His Gly Tyr 
            100                 105                 110         


Phe Gln Asp Pro Arg Tyr Phe Glu Asp Ile Ser Pro Leu Ile Lys Gln 
        115                 120                 125             


Thr Phe Thr Leu Pro His Pro Thr Glu His Ala Glu Gln Tyr Ser Arg 
    130                 135                 140                 


Lys Leu Ser Gln Ile Leu Ala Ala Lys Asn Ser Val Phe Val His Ile 
145                 150                 155                 160 


Arg Arg Gly Asp Tyr Met Arg Leu Gly Trp Gln Leu Asp Ile Ser Tyr 
                165                 170                 175     


Gln Leu Arg Ala Ile Ala Tyr Met Ala Lys Arg Val Gln Asn Leu Glu 
            180                 185                 190         


Leu Phe Leu Phe Cys Glu Asp Leu Glu Phe Val Gln Asn Leu Asp Leu 
        195                 200                 205             


Gly Tyr Pro Phe Val Asp Met Thr Thr Arg Asp Gly Ala Ala His Trp 
    210                 215                 220                 


Asp Met Met Leu Met Gln Ser Cys Lys His Gly Ile Ile Thr Asn Ser 
225                 230                 235                 240 


Thr Tyr Ser Trp Trp Ala Ala Tyr Leu Ile Lys Asn Pro Glu Lys Ile 
                245                 250                 255     


Ile Ile Gly Pro Ser His Trp Ile Tyr Gly Asn Glu Asn Ile Leu Cys 
            260                 265                 270         


Lys Asp Trp Val Lys Ile Glu Ser Gln Phe Glu Thr Lys Ser 
        275                 280                 285     


<210>  8
<211>  281
<212>  PRT
<213>  Bacteroides vulgatus

<400>  8

Met Arg Leu Ile Lys Val Thr Gly Gly Leu Gly Asn Gln Met Phe Ile 
1               5                   10                  15      


Tyr Ala Phe Tyr Leu Arg Met Lys Lys Tyr Tyr Pro Lys Val Arg Ile 
            20                  25                  30          


Asp Leu Ser Asp Met Met His Tyr Lys Val His Tyr Gly Tyr Glu Met 
        35                  40                  45              


His Arg Val Phe Asn Leu Pro His Thr Glu Phe Cys Ile Asn Gln Pro 
    50                  55                  60                  


Leu Lys Lys Val Ile Glu Phe Leu Phe Phe Lys Lys Ile Tyr Glu Arg 
65                  70                  75                  80  


Lys Gln Ala Pro Asn Ser Leu Arg Ala Phe Glu Lys Lys Tyr Phe Trp 
                85                  90                  95      


Pro Leu Leu Tyr Phe Lys Gly Phe Tyr Gln Ser Glu Arg Phe Phe Ala 
            100                 105                 110         


Asp Ile Lys Asp Glu Val Arg Glu Ser Phe Thr Phe Asp Lys Asn Lys 
        115                 120                 125             


Ala Asn Ser Arg Ser Leu Asn Met Leu Glu Ile Leu Asp Lys Asp Glu 
    130                 135                 140                 


Asn Ala Val Ser Leu His Ile Arg Arg Gly Asp Tyr Leu Gln Pro Lys 
145                 150                 155                 160 


His Trp Ala Thr Thr Gly Ser Val Cys Gln Leu Pro Tyr Tyr Gln Asn 
                165                 170                 175     


Ala Ile Ala Glu Met Ser Arg Arg Val Ala Ser Pro Ser Tyr Tyr Ile 
            180                 185                 190         


Phe Ser Asp Asp Ile Ala Trp Val Lys Glu Asn Leu Pro Leu Gln Asn 
        195                 200                 205             


Ala Val Tyr Ile Asp Trp Asn Thr Asp Glu Asp Ser Trp Gln Asp Met 
    210                 215                 220                 


Met Leu Met Ser His Cys Lys His His Ile Ile Cys Asn Ser Thr Phe 
225                 230                 235                 240 


Ser Trp Trp Gly Ala Trp Leu Asn Pro Asn Met Asp Lys Thr Val Ile 
                245                 250                 255     


Val Pro Ser Arg Trp Phe Gln His Ser Glu Ala Pro Asp Ile Tyr Pro 
            260                 265                 270         


Thr Gly Trp Ile Lys Val Pro Val Ser 
        275                 280     


<210>  9
<211>  292
<212>  PRT
<213>  Bacteroides ovatus

<400>  9

Met Lys Ile Val Asn Ile Leu Gly Gly Leu Gly Asn Gln Met Phe Val 
1               5                   10                  15      


Tyr Ala Met Tyr Leu Ala Leu Lys Glu Ala His Pro Glu Glu Glu Ile 
            20                  25                  30          


Leu Leu Cys Arg Arg Ser Tyr Lys Gly Tyr Pro Leu His Asn Gly Tyr 
        35                  40                  45              


Glu Leu Glu Arg Ile Phe Gly Val Glu Ala Pro Glu Ala Ala Leu Ser 
    50                  55                  60                  


Gln Leu Ala Arg Val Ala Tyr Pro Phe Phe Asn Tyr Lys Ser Trp Gln 
65                  70                  75                  80  


Leu Met Arg His Phe Leu Pro Leu Arg Lys Ser Met Ala Ser Gly Thr 
                85                  90                  95      


Thr Gln Ile Pro Phe Asp Tyr Ser Glu Val Thr Arg Asn Asp Asn Val 
            100                 105                 110         


Tyr Tyr Asp Gly Tyr Trp Gln Asn Glu Lys Asn Phe Leu Ser Ile Arg 
        115                 120                 125             


Asp Lys Val Ile Lys Ala Phe Thr Phe Pro Glu Phe Arg Asp Glu Lys 
    130                 135                 140                 


Asn Lys Ala Leu Ser Asp Lys Leu Lys Ser Val Lys Thr Ala Ser Cys 
145                 150                 155                 160 


His Ile Arg Arg Gly Asp Tyr Leu Lys Asp Pro Ile Tyr Gly Val Cys 
                165                 170                 175     


Asn Ser Asp Tyr Tyr Thr Arg Ala Ile Thr Glu Leu Asn Gln Ser Val 
            180                 185                 190         


Asn Pro Asp Met Tyr Cys Ile Phe Ser Asp Asp Ile Gly Trp Cys Lys 
        195                 200                 205             


Glu Asn Phe Lys Phe Leu Ile Gly Asp Lys Glu Val Val Phe Val Asp 
    210                 215                 220                 


Trp Asn Lys Gly Gln Glu Ser Phe Tyr Asp Met Gln Leu Met Ser Leu 
225                 230                 235                 240 


Cys His Tyr Asn Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp Gly Ala 
                245                 250                 255     


Trp Leu Asn Asn Asn Asp Asp Lys Val Val Val Ala Pro Glu Arg Trp 
            260                 265                 270         


Met Asn Lys Thr Leu Glu Asn Asp Pro Ile Cys Asp Asn Trp Lys Arg 
        275                 280                 285             


Ile Lys Val Glu 
    290         


<210>  10
<211>  290
<212>  PRT
<213>  Escherichia coli

<400>  10

Met Ser Ile Val Val Ala Arg Leu Ala Gly Gly Leu Gly Asn Gln Met 
1               5                   10                  15      


Phe Gln Tyr Ala Lys Gly Tyr Ala Glu Ser Val Glu Arg Asn Ser Ser 
            20                  25                  30          


Leu Lys Leu Asp Leu Arg Gly Tyr Lys Asn Tyr Thr Leu His Gly Gly 
        35                  40                  45              


Phe Arg Leu Asp Lys Leu Asn Ile Asp Asn Thr Phe Val Met Ser Lys 
    50                  55                  60                  


Lys Glu Met Cys Ile Phe Pro Asn Phe Ile Val Arg Ala Ile Asn Lys 
65                  70                  75                  80  


Phe Pro Lys Leu Ser Leu Cys Ser Lys Arg Phe Glu Ser Glu Gln Tyr 
                85                  90                  95      


Ser Lys Lys Ile Asn Gly Ser Met Lys Gly Ser Val Glu Phe Ile Gly 
            100                 105                 110         


Phe Trp Gln Asn Glu Arg Tyr Phe Leu Glu His Lys Glu Lys Leu Arg 
        115                 120                 125             


Glu Ile Phe Thr Pro Ile Asn Ile Asn Leu Asp Ala Lys Glu Leu Ser 
    130                 135                 140                 


Asp Val Ile Arg Cys Thr Asn Ser Val Ser Val His Ile Arg Arg Gly 
145                 150                 155                 160 


Asp Tyr Val Ser Asn Val Glu Ala Leu Lys Ile His Gly Leu Cys Thr 
                165                 170                 175     


Glu Arg Tyr Tyr Ile Asp Ser Ile Arg Tyr Leu Lys Glu Arg Phe Asn 
            180                 185                 190         


Asn Leu Val Phe Phe Val Phe Ser Asp Asp Ile Glu Trp Cys Lys Lys 
        195                 200                 205             


Tyr Lys Asn Glu Ile Phe Ser Arg Ser Asp Asp Val Lys Phe Ile Glu 
    210                 215                 220                 


Gly Asn Thr Gln Glu Val Asp Met Trp Leu Met Ser Asn Ala Lys Tyr 
225                 230                 235                 240 


His Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp Gly Ala Trp Leu Lys 
                245                 250                 255     


Asn Tyr Asp Leu Gly Ile Thr Ile Ala Pro Thr Pro Trp Phe Glu Arg 
            260                 265                 270         


Glu Glu Leu Asn Ser Phe Asp Pro Cys Pro Glu Lys Trp Val Arg Ile 
        275                 280                 285             


Glu Lys 
    290 


<210>  11
<211>  289
<212>  PRT
<213>  Bacteroides fragilis

<400>  11

Met Phe Phe Arg Cys Cys Met Lys Ile Val Gln Ile Ile Gly Gly Leu 
1               5                   10                  15      


Gly Asn Gln Met Phe Gln Phe Ala Phe Tyr Leu Ala Leu Lys Glu Lys 
            20                  25                  30          


Tyr Val Asn Val Lys Leu Asp Thr Ser Ser Phe Gly Ala Tyr Thr His 
        35                  40                  45              


Asn Gly Phe Glu Leu Asp Lys Val Phe His Val Glu Tyr Leu Lys Ala 
    50                  55                  60                  


Ser Ile Arg Glu Arg Ile Lys Leu Ser Tyr Gln Gly Ser Glu Ile Trp 
65                  70                  75                  80  


Ile Arg Val Leu Arg Lys Leu Leu Lys Arg Lys Lys Thr Glu Tyr Val 
                85                  90                  95      


Glu Pro Tyr Leu Cys Phe Asp Glu Asn Ala Ile Ser Leu Ser Cys Asp 
            100                 105                 110         


Lys Tyr Tyr Ile Gly Tyr Trp Gln Ser Tyr Lys Tyr Phe Thr Asn Ile 
        115                 120                 125             


Glu Ala Ala Ile Arg Gly Gln Phe His Phe Ser Lys Val Leu Ser Asp 
    130                 135                 140                 


Lys Asn Glu Phe Ile Lys Lys Gln Met Gln Asn Ser Asn Ser Val Ser 
145                 150                 155                 160 


Leu His Val Arg Leu Gly Asp Tyr Val Asn Asn Pro Ala Tyr Ser Asn 
                165                 170                 175     


Ile Cys Thr Ser Ala Tyr Tyr Asn Lys Ala Ile Asn Ile Ile Gln Ser 
            180                 185                 190         


Lys Val Ser Glu Pro Lys Phe Phe Val Phe Ser Asp Asp Thr Val Trp 
        195                 200                 205             


Cys Lys Asp His Leu Lys Ile Pro Asn Cys His Ile Ile Asp Trp Asn 
    210                 215                 220                 


Asn Lys Glu Glu Ser Tyr Trp Asp Met Cys Leu Met Thr Tyr Cys Lys 
225                 230                 235                 240 


His Asn Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp Gly Ala Trp Leu 
                245                 250                 255     


Asn Thr Asn Pro Glu Arg Ile Val Ile Ala Pro Gly Lys Trp Ile Asn 
            260                 265                 270         


Asp Asp Arg Val Gln Val Ser Asp Ile Ile Pro Ser Asp Trp Ile Cys 
        275                 280                 285             


Val 
    


<210>  12
<211>  287
<212>  PRT
<213>  Bacteroides fragilis

<400>  12

Met Leu Tyr Val Ile Leu Arg Gly Arg Leu Gly Asn Asn Leu Phe Gln 
1               5                   10                  15      


Ile Ala Thr Ala Ala Ser Leu Thr Gln Asn Phe Ile Phe Cys Thr Val 
            20                  25                  30          


Asn Lys Asp Gln Glu Arg Gln Val Leu Leu Tyr Lys Asp Ser Phe Phe 
        35                  40                  45              


Lys Asn Ile Lys Val Met Lys Gly Val Pro Asp Gly Ile Pro Tyr Tyr 
    50                  55                  60                  


Lys Glu Pro Phe His Glu Phe Ser Arg Ile Pro Tyr Glu Glu Gly Lys 
65                  70                  75                  80  


Asp Leu Ile Ile Asp Gly Tyr Phe Gln Ser Glu Lys Tyr Phe Lys Arg 
                85                  90                  95      


Ser Val Val Leu Asp Leu Tyr Arg Ile Thr Asp Glu Leu Arg Lys Lys 
            100                 105                 110         


Ile Trp Asn Ile Cys Gly Asn Ile Leu Glu Lys Gly Glu Thr Val Ser 
        115                 120                 125             


Ile His Val Arg Arg Gly Asp Tyr Leu Lys Leu Pro His Ala Leu Pro 
    130                 135                 140                 


Phe Cys Gly Lys Ser Tyr Tyr Lys Asn Ala Ile Gln Tyr Ile Gly Glu 
145                 150                 155                 160 


Asp Lys Ile Phe Ile Ile Cys Ser Asp Asp Ile Asp Trp Cys Lys Lys 
                165                 170                 175     


Asn Phe Ile Gly Lys Arg Tyr Tyr Phe Ile Glu Asn Thr Thr Pro Leu 
            180                 185                 190         


Leu Asp Leu Tyr Ile Gln Ser Leu Cys Thr His Asn Ile Ile Ser Asn 
        195                 200                 205             


Ser Ser Phe Ser Trp Trp Gly Ala Trp Leu Asn Glu Asn Ser Asn Lys 
    210                 215                 220                 


Ile Val Ile Ala Pro Gln Met Trp Phe Gly Ile Ser Val Lys Leu Gly 
225                 230                 235                 240 


Val Ser Asp Leu Leu Pro Val Ser Trp Val Arg Leu Pro Asn Asn Tyr 
                245                 250                 255     


Thr Leu Gly Arg Tyr Cys Phe Ala Leu Tyr Lys Val Val Glu Asp Tyr 
            260                 265                 270         


Leu Leu Asn Ile Leu Arg Leu Ile Trp Lys Arg Lys Lys Asn Met 
        275                 280                 285         


<210>  13
<211>  3075
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  lacY chromosomal construct

<400>  13
atgaccatga ttacggattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct       60

ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc      120

gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc      180

tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct      240

gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc      300

tacaccaacg tgacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg      360

acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg      420

cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc      480

ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc      540

ggagaaaacc gcctcgcggt gatggtgctg cgctggagtg acggcagtta tctggaagat      600

caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact      660

acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta      720

ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct      780

ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc      840

gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa      900

ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac      960

ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat     1020

ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat     1080

catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg     1140

aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac     1200

acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc     1260

atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc     1320

gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg     1380

aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat     1440

ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt     1500

tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc     1560

atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc     1620

cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat     1680

ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat     1740

gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc     1800

cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa     1860

gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc     1920

agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat     1980

ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg     2040

attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc     2100

gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag     2160

tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat     2220

ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac     2280

cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg     2340

ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc     2400

cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa     2460

gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct     2520

cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat     2580

ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg     2640

gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga     2700

ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat     2760

ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc     2820

gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc     2880

agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa     2940

gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg     3000

agcccgtcag tatcggcgga attccagctg agcgccggtc gctaccatta ccagttggtc     3060

tggtgtcaaa aataa                                                      3075


<210>  14
<211>  2203
<212>  DNA
<213>  Artificial Sequence

<220>
<223>   del wcaJ::FRT mutation

<400>  14
gttcggttat atcaatgtca aaaacctcac gccgctcaag ctggtgatca actccgggaa       60

cggcgcagcg ggtccggtgg tggacgccat tgaagcccgc tttaaagccc tcggcgcgcc      120

cgtggaatta atcaaagtgc acaacacgcc ggacggcaat ttccccaacg gtattcctaa      180

cccactactg ccggaatgcc gcgacgacac ccgcaatgcg gtcatcaaac acggcgcgga      240

tatgggcatt gcttttgatg gcgattttga ccgctgtttc ctgtttgacg aaaaagggca      300

gtttattgag ggctactaca ttgtcggcct gttggcagaa gcattcctcg aaaaaaatcc      360

cggcgcgaag atcatccacg atccacgtct ctcctggaac accgttgatg tggtgactgc      420

cgcaggtggc acgccggtaa tgtcgaaaac cggacacgcc tttattaaag aacgtatgcg      480

caaggaagac gccatctatg gtggcgaaat gagcgcccac cattacttcc gtgatttcgc      540

ttactgcgac agcggcatga tcccgtggct gctggtcgcc gaactggtgt gcctgaaaga      600

taaaacgctg ggcgaactgg tacgcgaccg gatggcggcg tttccggcaa gcggtgagat      660

caacagcaaa ctggcgcaac ccgttgaggc gattaaccgc gtggaacagc attttagccg      720

tgaggcgctg gcggtggatc gcaccgatgg catcagcatg acctttgccg actggcgctt      780

taacctgcgc acctccaata ccgaaccggt ggtgcgcctg aatgtggaat cgcgcggtga      840

tgtgccgctg atggaagcgc gaacgcgaac tctgctgacg ttgctgaacg agtaatgtcg      900

gatcttccct taccccactg cgggtaaggg gctaataaca ggaacaacga tgattccggg      960

gatccgtcga cctgcagttc gaagttccta ttctctagaa agtataggaa cttcgaagca     1020

gctccagcct acagttaaca aagcggcata ttgatatgag cttacgtgaa aaaaccatca     1080

gcggcgcgaa gtggtcggcg attgccacgg tgatcatcat cggcttattt ttgacaccag     1140

accaactggt aatttatttt tgacaccaga ccaactggta atttattttt gacaccagac     1200

caactggtta tttttgacac cagaccaact ggttattttt gacaccagac caactggctc     1260

gggctggtgc agatgaccgt gctggcgcgg attatcgaca accaccagtt cggcctgctt     1320

accgtgtcgc tggtgattat cgcgctggca gatacgcttt ctgacttcgg tatcgctaac     1380

tcgattattc agcgaaaaga aatcagtcac cttgaactca ccacgttgta ctggctgaac     1440

gtcgggctgg ggatcgtggt gtgcgtggcg gtgtttttgt tgagtgatct catcggcgac     1500

gtgctgaata acccggacct ggcaccgttg attaaaacat tatcgctggc gtttgtggta     1560

atcccccacg ggcaacagtt ccgcgcgttg atgcaaaaag agctggagtt caacaaaatc     1620

ggcatgatcg aaaccagcgc ggtgctggcg ggcttcactt gtacggtggt tagcgcccat     1680

ttctggccgc tggcgatgac cgcgatcctc ggttatctgg tcaatagtgc ggtgagaacg     1740

ctgctgtttg gctactttgg ccgcaaaatt tatcgccccg gtctgcattt ctcgctggcg     1800

tcggtggcac cgaacttacg ctttggtgcc tggctgacgg cggacagcat catcaactat     1860

ctcaatacca acctttcaac gctcgtgctg gcgcgtattc tcggcgcggg cgtggcaggg     1920

ggatacaacc tggcgtacaa cgtggccgtt gtgccaccga tgaagctgaa cccaatcatc     1980

acccgcgtgt tgtttccggc attcgccaaa attcaggacg ataccgaaaa gctgcgtgtt     2040

aacttctaca agctgctgtc ggtagtgggg attatcaact ttccggcgct gctcgggcta     2100

atggtggtgt cgaataactt tgtaccgctg gtctttggtg agaagtggaa cagcattatt     2160

ccggtgctgc aattgctgtg tgtggtgggt ctgctgcgct ccg                       2203


<210>  15
<211>  5046
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  lacZ insertion in lon

<400>  15
gtggatggaa gaggtggaaa aagtggttat ggaggagtgg gtaattgatg gtgaaaggaa       60

agggttggtg atttatggga agggggaagg ggaagaggga tgtggtgaat aattaaggat      120

tgggatagaa ttagttaagg aaaaaggggg gattttatgt ggggtttaat ttttggtgta      180

ttgtgggggt tgaatgtggg ggaaagatgg ggatatagtg aggtagatgt taatagatgg      240

ggtgaaggag agtggtgtga tgtgattagg tgggggaaat taaagtaaga gagaggtgta      300

tgattggggg gatgggtgga ggtggagttg gaagttggta ttgtgtagaa agtataggaa      360

gttgagaggg gttttgaagg tgagggtggg ggaaggagtg aggggggaag gggtggtaaa      420

ggaaggggaa gaggtagaaa gggagtgggg agaaagggtg gtgagggggg atgaatgtga      480

ggtagtgggg tatgtggaga agggaaaagg gaaggggaaa gagaaaggag gtaggttgga      540

gtggggttag atggggatag gtagagtggg gggttttatg gagaggaagg gaaggggaat      600

tgggaggtgg gggggggtgt ggtaaggttg ggaaggggtg gaaagtaaag tggatgggtt      660

tgttgggggg aaggatgtga tgggggaggg gatgaagatg tgatgaagag agaggatgag      720

gatggtttgg gatgattgaa gaagatggat tggagggagg ttgtgggggg ggttgggtgg      780

agagggtatt ggggtatgag tggggagaag agagaatggg gtggtgtgat gggggggtgt      840

tgggggtgtg aggggagggg gggggggttg tttttgtgaa gagggaggtg tggggtgggg      900

tgaatgaagt ggaggaggag ggaggggggg tatggtgggt ggggaggagg ggggttggtt      960

ggggaggtgt ggtggaggtt gtgagtgaag ggggaaggga gtgggtggta ttgggggaag     1020

tgggggggga ggatgtggtg tgatgtgagg ttggtggtgg ggagaaagta tggatgatgg     1080

gtgatggaat ggggggggtg gatagggttg atgggggtag gtggggattg gaggaggaag     1140

ggaaagatgg gatggaggga ggaggtagtg ggatggaagg gggtgttgtg gatgaggatg     1200

atgtggagga agaggatgag ggggtggggg gaggggaagt gttggggagg gtgaaggggg     1260

gatgggggag ggggaggatg tggtggtgag ggatggggat gggtggttgg ggaatatgat     1320

ggtggaaaat ggggggtttt gtggattgat ggagtgtggg ggggtgggtg tgggggaggg     1380

gtatgaggag atagggttgg gtaggggtga tattggtgaa gaggttgggg gggaatgggg     1440

tgaggggttg gtggtggttt agggtatggg gggtggggat tgggagggga tggggttgta     1500

tggggttgtt gaggagttgt tgtaataagg ggatgttgaa gttggtattg ggaagttggt     1560

attgtgtaga aagtatagga agttggaagg aggtggaggg tagataaagg ggggggttat     1620

ttttgagagg agaggaagtg gtaatggtag ggaggggggg tgaggtggaa ttggggggat     1680

agtgaggggg tggaggagtg gtggggagga atggggatat ggaaagggtg gatattgagg     1740

gatgtgggtt gttgggggtg gaggagatgg ggatgggtgg tttggatgag ttggtgttga     1800

gtgtaggggg tgatgttgaa gtggaagtgg gggggggagt ggtgtggggg ataattgaat     1860

tggggggtgg gggaggggag agggttttgg gtggggaaga ggtagggggt atagatgttg     1920

agaatgggag atgggagggg tgaaaagagg ggggagtaag ggggtgggga tagttttgtt     1980

gggggggtaa tgggagggag tttagggggt gtggtaggtg ggggaggtgg gagttgaggg     2040

gaatgggggg gggatggggt gtatgggtgg ggagttgaag atgaagggta atggggattt     2100

gaggagtagg atgaatgggg taggttttgg gggtgataaa taaggttttg gggtgatggt     2160

gggaggggtg aggggtggta atgaggaggg gatgaggaag tgtatgtggg gtggagtgga     2220

agaagggtgg ttgggggtgg taatgggggg gggggttgga gggttggagg gaggggttag     2280

ggtgaatggg ggtgggttga gttaggggaa tgtggttatg gaggggtgga ggggtgaagt     2340

gatgggggag gggggtgagg agttgttttt tatggggaat ggagatgtgt gaaagaaagg     2400

gtgagtgggg gttaaattgg gaagggttat tagggaggtg gatggaaaaa tggatttggg     2460

tggtggtgag atgggggatg gggtgggagg ggggggggag ggtgagagtg aggttttggg     2520

ggagagggga gtggtgggag ggggtgatgt gggggggttg tgaggatggg gtggggttgg     2580

gttggagtag gggtagtgtg agggagagtt gggggggggt gtgggggtgg ggtagttgag     2640

ggagttgaat gaagtgttta ggttgtggag ggagatggag agggagttga ggggttggga     2700

gggggttagg atggaggggg aggatggagt ggaggaggtg gttatgggta tgagggaaga     2760

ggtattgggt ggtgagttgg atggtttggg gggataaagg gaagtggaaa aagtggtggt     2820

ggtgttttgg ttgggtgagg ggtggatggg gggtggggtg gggaaagagg agagggttga     2880

tagagaagtg gggatggttg ggggtatggg gaaaatgagg ggggtaaggg gaggaggggt     2940

tggggttttg atgatattta atgagggagt gatggaggga gtgggagagg aagggggggt     3000

gtaaaggggg atagtgagga aaggggtggg agtatttagg gaaaggggga agagtgttag     3060

ggatggggtg ggggtattgg gaaaggatga gggggggggt gtgtggaggt agggaaaggg     3120

attttttgat ggaggatttg gggagagggg ggaaggggtg gtgttgatgg aggggggggt     3180

agatggggga aataatatgg gtgggggtgg tgtggggtgg ggggggttga tagtggaggg     3240

ggggggaagg atggagagat ttgatggagg gatagagggg gtggtgatta ggggggtggg     3300

gtgattgatt ggggagggag gagatgatga gagtggggtg attaggatgg gggtggagga     3360

ttggggttag gggttgggtg atggggggta gggagggggg atgatgggtg agaggattga     3420

ttgggaggat ggggtgggtt tgaatattgg gttgatggag gagatagagg gggtaggggt     3480

gggagagggt gtaggagagg ggatggttgg gataatggga agaggggagg gggttaaagt     3540

tgttgtggtt gatgaggagg atatggtgga ggatggtgtg gtgatggatg aggtgaggat     3600

ggagaggatg atggtggtga gggttaaggg gtggaatgag gaaggggttg gggttgagga     3660

ggaggagagg attttgaatg gggaggtggg ggaaagggag atgggagggt tgtggttgaa     3720

tgagggtggg gtggggggtg tggagttgaa ggaggggagg atagagattg gggatttggg     3780

gggtggagag tttggggttt tggaggttga gaggtagtgt gaggggatgg ggataaggag     3840

gagggtgatg gataatttga ggggggaaag ggggggtggg ggtggggagg tgggtttgag     3900

ggtgggataa agaaagtgtt aggggtaggt agtgagggaa gtggggggag atgtgaagtt     3960

gagggtggag tagagggggg gtgaaatgat gattaaaggg agtgggaaga tggaaatggg     4020

tgatttgtgt agtgggttta tggaggaagg agaggtgagg gaaaatgggg gtgatggggg     4080

agatatggtg atgttggaga taagtggggt gagtggaggg gaggaggatg agggggaggg     4140

ggttttgtgg ggggggtaaa aatggggtga ggtgaaattg agaggggaaa ggagtgtggt     4200

gggggtaagg gagggagggg gggttggagg agagatgaaa gggggagtta aggggatgaa     4260

aaataattgg ggtgtggggt tggtgtaggg aggtttgatg aagattaaat gtgagggagt     4320

aagaaggggt gggattgtgg gtgggaagaa aggggggatt gagggtaatg ggataggtga     4380

ggttggtgta gatgggggga tggtaagggt ggatgtggga gtttgagggg aggaggagag     4440

tatgggggtg aggaagatgg gagggaggga ggtttggggg aggggttgtg gtgggggaaa     4500

ggagggaaag ggggattggg gattgagggt ggggaagtgt tgggaagggg gatgggtggg     4560

ggggtgttgg gtattagggg aggtggggaa agggggatgt ggtggaaggg gattaagttg     4620

ggtaagggga gggttttggg agtgaggagg ttgtaaaagg agggggagtg aatgggtaat     4680

gatggtgata gtaggtttgg tgaggttgtg agtggaaaat agtgaggtgg gggaaaatgg     4740

agtaataaaa agaggggtgg gagggtaatt gggggttggg agggtttttt tgtgtgggta     4800

agttagatgg gggatggggg ttggggttat taaggggtgt tgtaagggga tgggtggggt     4860

gatataagtg gtgggggttg gtaggttgaa ggattgaagt gggatataaa ttataaagag     4920

gaagagaaga gtgaataaat gtgaattgat ggagaagatt ggtggagggg gtgatatgtg     4980

taaaggtggg ggtgggggtg ggttagatgg tattattggt tgggtaagtg aatgtgtgaa     5040

agaagg                                                                5046


<210>  16
<211>  6199
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pG204

<400>  16
aattctaaaa attgattgaa tgtatgcaaa taaatgcata caccataggt gtggtttaat       60

ttgatgccct ttttcagggc tggaatgtgt aagagcgggg ttatttatgc tgttgttttt      120

ttgttactcg ggaagggctt tacctcttcc gcataaacgc ttccatcagc gtttatagtt      180

aaaaaaatct ttcggaactg gttttgcgct taccccaacc aacaggggat ttgctgcttt      240

ccattgagcc tgtttctctg cgcgacgttc gcggcggcgt gtttgtgcat ccatctggat      300

tctcctgtca gttagctttg gtggtgtgtg gcagttgtag tcctgaacga aaaccccccg      360

cgattggcac attggcagct aatccggaat cgcacttacg gccaatgctt cgtttcgtat      420

cacacacccc aaagccttct gctttgaatg ctgcccttct tcagggctta atttttaaga      480

gcgtcacctt catggtggtc agtgcgtcct gctgatgtgc tcagtatcac cgccagtggt      540

atttatgtca acaccgccag agataattta tcaccgcaga tggttatctg tatgtttttt      600

atatgaattt attttttgca ggggggcatt gtttggtagg tgagagatca attctgcatt      660

aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct      720

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa      780

aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa      840

aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc      900

tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga      960

caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc     1020

cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt     1080

ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct     1140

gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg     1200

agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta     1260

gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct     1320

acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa     1380

gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt     1440

gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta     1500

cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat     1560

caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa     1620

gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct     1680

cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta     1740

cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct     1800

caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg     1860

gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa     1920

gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt     1980

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta     2040

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca     2100

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta     2160

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct     2220

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg     2280

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac     2340

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact     2400

gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa     2460

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt     2520

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat     2580

gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg     2640

acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc     2700

cctttcgtct cgcgcgtttc ggtgatgacg gtgaaaacct ctgacacatg cagctcccgg     2760

agacggtcac agcttgtctg taagcggatg ccgggagcag acaagcccgt cagggcgcgt     2820

cagcgggtgt tggcgggtgt cggggctggc ttaactatgc ggcatcagag cagattgtac     2880

tgagagtgca ccatatatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg     2940

catcaggcgc ctcctcaacc tgtatattcg taaaccacgc ccaatgggag ctgtctcagg     3000

tttgttcctg attggttacg gcgcgtttcg catcattgtt gagtttttcc gccagcccga     3060

cgcgcagttt accggtgcct gggtgcagta catcagcatg gggcaaattc tttccatccc     3120

gatgattgtc gcgggtgtga tcatgatggt ctgggcatat cgtcgcagcc cacagcaaca     3180

cgtttcctga ggaaccatga aacagtattt agaactgatg caaaaagtgc tcgacgaagg     3240

cacacagaaa aacgaccgta ccggaaccgg aacgctttcc atttttggtc atcagatgcg     3300

ttttaacctg caagatggat tcccgctggt gacaactaaa cgttgccacc tgcgttccat     3360

catccatgaa ctgctgtggt ttctgcaggg cgacactaac attgcttatc tacacgaaaa     3420

caatgtcacc atctgggacg aatgggccga tgaaaacggc gacctcgggc cagtgtatgg     3480

taaacagtgg cgcgcctggc caacgccaga tggtcgtcat attgaccaga tcactacggt     3540

actgaaccag ctgaaaaacg acccggattc gcgccgcatt attgtttcag cgtggaacgt     3600

aggcgaactg gataaaatgg cgctggcacc gtgccatgca ttcttccagt tctatgtggc     3660

agacggcaaa ctctcttgcc agctttatca gcgctcctgt gacgtcttcc tcggcctgcc     3720

gttcaacatt gccagctacg cgttattggt gcatatgatg gcgcagcagt gcgatctgga     3780

agtgggtgat tttgtctgga ccggtggcga cacgcatctg tacagcaacc atatggatca     3840

aactcatctg caattaagcc gcgaaccgcg tccgctgccg aagttgatta tcaaacgtaa     3900

acccgaatcc atcttcgact accgtttcga agactttgag attgaaggct acgatccgca     3960

tccgggcatt aaagcgccgg tggctatcta attacgaaac atcctgccag agccgacgcc     4020

agtgtgcgtc ggttttttta ccctccgtta aattcttcga gacgccttcc cgaaggcgcc     4080

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat     4140

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt     4200

tttcccagtc acgacgttgt aaaacgacgg ccagtgccaa gctttcttta atgaagcagg     4260

gcatcaggac ggtatctttg tggagaaagc agagtaatct tattcagcct gactggtggg     4320

aaaccaccag tcagaatgtg ttagcgcatg ttgacaaaaa taccattagt cacattatcc     4380

gtcagtcgga cgacatggta gataacctgt ttattatgcg ttttgatctt acgtttaata     4440

ttacctttat gcgatgaaac ggtcttggct ttgatattca tttggtcaga gatttgaatg     4500

gttccctgac ctgccatcca cattcgcaac atactcgatt cggttcggct caatgataac     4560

gtcggcatat ttaaaaacga ggttatcgtt gtctcttttt tcagaatatc gccaaggata     4620

tcgtcgagag attccggttt aatcgattta gaactgatca ataaattttt tctgaccaat     4680

agatattcat caaaatgaac attggcaatt gccataaaaa cgataaataa cgtattggga     4740

tgttgattaa tgatgagctt gatacgctga ctgttagaag catcgtggat gaaacagtcc     4800

tcattaataa acaccactga agggcgctgt gaatcacaag ctatggcaag gtcatcaacg     4860

gtttcaatgt cgttgatttc tcttttttta acccctctac tcaacagata cccggttaaa     4920

cctagtcggg tgtaactaca taaatccata ataatcgttg acatggcata ccctcactca     4980

atgcgtaacg ataattcccc ttacctgaat atttcatcat gactaaacgg aacaacatgg     5040

gtcacctaat gcgccactct cgcgattttt caggcggact tactatcccg taaagtgttg     5100

tataatttgc ctggaattgt cttaaagtaa agtaaatgtt gcgatatgtg agtgagctta     5160

aaacaaatat ttcgctgcag gagtatcctg gaagatgttc gtgagaagct tactgctcac     5220

aagaaaaaag gcacgtcatc tgacgtgcct tttttatttg tactaccctg tacgattact     5280

gcagctcgag ctaacacgag ctatgtttat ccacgtttat ccagtgattg actatgggga     5340

tataagtatt ttttggagtt atatcgtacc aaggagtagg ataaataaca atctgtgacg     5400

ctgatgtacc taaataagcc ccccaccaac taaaactact attcgctata atatgatggt     5460

tagctaagct cattaaccat aaatcttctt cttgtgataa atcttctgaa taatatatat     5520

tatatttttt actgagtaat gtttcgatat tttctttaca ccaaaaaata tcatcactga     5580

aaataaacac gtcacgtatc attgccaaat cgcgtatttt atttaaagct tttttgtaat     5640

actctaacga acaaacgcca tgagttaaag tagctgtttt gttttttata taatctcctc     5700

ttcttatatg aatagaaagt gatgattgag attcaagaat ttttgctgca agtaaatttg     5760

cttgttcaga cacattcttt ggaataaaaa attcttttag atctaatata tgtttatgga     5820

aaaagtgctc agattgccaa taccctatat attttttgga tttccatttt tgcgctatat     5880

attcaaaatc ataaccatag gcatgaaatt cattgcaaaa acctaaaaaa agaaagattt     5940

caggatataa tcttgaccca cgaaccaaaa atttataaat attattaatt tttggtgtgt     6000

aatactgtaa atattcctct ggaatttgta gattgtttag cctgtaacca ccatgatcat     6060

cattttcagc ataatgactt atatcaaaat ataatggtgt cccattaatt ttggaaagcg     6120

catacccaaa tgagaactga aaaagttgat ttccaagtcc gccttgtaat cttataatag     6180

acattatatc tccttcttg                                                  6199


<210>  17
<211>  6170
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pG216

<400>  17
tctagaattc taaaaattga ttgaatgtat gcaaataaat gcatacacca taggtgtggt       60

ttaatttgat gccctttttc agggctggaa tgtgtaagag cggggttatt tatgctgttg      120

tttttttgtt actcgggaag ggctttacct cttccgcata aacgcttcca tcagcgttta      180

tagttaaaaa aatctttcgg aactggtttt gcgcttaccc caaccaacag gggatttgct      240

gctttccatt gagcctgttt ctctgcgcga cgttcgcggc ggcgtgtttg tgcatccatc      300

tggattctcc tgtcagttag ctttggtggt gtgtggcagt tgtagtcctg aacgaaaacc      360

ccccgcgatt ggcacattgg cagctaatcc ggaatcgcac ttacggccaa tgcttcgttt      420

cgtatcacac accccaaagc cttctgcttt gaatgctgcc cttcttcagg gcttaatttt      480

taagagcgtc accttcatgg tggtcagtgc gtcctgctga tgtgctcagt atcaccgcca      540

gtggtattta tgtcaacacc gccagagata atttatcacc gcagatggtt atctgtatgt      600

tttttatatg aatttatttt ttgcaggggg gcattgtttg gtaggtgaga gatcaattct      660

gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc      720

ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca      780

ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg      840

agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca      900

taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa      960

cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc     1020

tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc     1080

gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct     1140

gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg     1200

tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag     1260

gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta     1320

cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg     1380

aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt     1440

tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt     1500

ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag     1560

attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat     1620

ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc     1680

tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat     1740

aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc     1800

acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag     1860

aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag     1920

agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt     1980

ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg     2040

agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt     2100

tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc     2160

tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc     2220

attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa     2280

taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg     2340

aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc     2400

caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag     2460

gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt     2520

cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt     2580

tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc     2640

acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac     2700

gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct     2760

cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg     2820

cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat     2880

tgtactgaga gtgcaccata tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa     2940

taccgcatca ggcgcctcct caacctgtat attcgtaaac cacgcccaat gggagctgtc     3000

tcaggtttgt tcctgattgg ttacggcgcg tttcgcatca ttgttgagtt tttccgccag     3060

cccgacgcgc agtttaccgg tgcctgggtg cagtacatca gcatggggca aattctttcc     3120

atcccgatga ttgtcgcggg tgtgatcatg atggtctggg catatcgtcg cagcccacag     3180

caacacgttt cctgaggaac catgaaacag tatttagaac tgatgcaaaa agtgctcgac     3240

gaaggcacac agaaaaacga ccgtaccgga accggaacgc tttccatttt tggtcatcag     3300

atgcgtttta acctgcaaga tggattcccg ctggtgacaa ctaaacgttg ccacctgcgt     3360

tccatcatcc atgaactgct gtggtttctg cagggcgaca ctaacattgc ttatctacac     3420

gaaaacaatg tcaccatctg ggacgaatgg gccgatgaaa acggcgacct cgggccagtg     3480

tatggtaaac agtggcgcgc ctggccaacg ccagatggtc gtcatattga ccagatcact     3540

acggtactga accagctgaa aaacgacccg gattcgcgcc gcattattgt ttcagcgtgg     3600

aacgtaggcg aactggataa aatggcgctg gcaccgtgcc atgcattctt ccagttctat     3660

gtggcagacg gcaaactctc ttgccagctt tatcagcgct cctgtgacgt cttcctcggc     3720

ctgccgttca acattgccag ctacgcgtta ttggtgcata tgatggcgca gcagtgcgat     3780

ctggaagtgg gtgattttgt ctggaccggt ggcgacacgc atctgtacag caaccatatg     3840

gatcaaactc atctgcaatt aagccgcgaa ccgcgtccgc tgccgaagtt gattatcaaa     3900

cgtaaacccg aatccatctt cgactaccgt ttcgaagact ttgagattga aggctacgat     3960

ccgcatccgg gcattaaagc gccggtggct atctaattac gaaacatcct gccagagccg     4020

acgccagtgt gcgtcggttt ttttaccctc cgttaaattc ttcgagacgc cttcccgaag     4080

gcgccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc     4140

gctattacgc cagctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc     4200

agggttttcc cagtcacgac gttgtaaaac gacggccagt gccaagcttt ctttaatgaa     4260

gcagggcatc aggacggtat ctttgtggag aaagcagagt aatcttattc agcctgactg     4320

gtgggaaacc accagtcaga atgtgttagc gcatgttgac aaaaatacca ttagtcacat     4380

tatccgtcag tcggacgaca tggtagataa cctgtttatt atgcgttttg atcttacgtt     4440

taatattacc tttatgcgat gaaacggtct tggctttgat attcatttgg tcagagattt     4500

gaatggttcc ctgacctgcc atccacattc gcaacatact cgattcggtt cggctcaatg     4560

ataacgtcgg catatttaaa aacgaggtta tcgttgtctc ttttttcaga atatcgccaa     4620

ggatatcgtc gagagattcc ggtttaatcg atttagaact gatcaataaa ttttttctga     4680

ccaatagata ttcatcaaaa tgaacattgg caattgccat aaaaacgata aataacgtat     4740

tgggatgttg attaatgatg agcttgatac gctgactgtt agaagcatcg tggatgaaac     4800

agtcctcatt aataaacacc actgaagggc gctgtgaatc acaagctatg gcaaggtcat     4860

caacggtttc aatgtcgttg atttctcttt ttttaacccc tctactcaac agatacccgg     4920

ttaaacctag tcgggtgtaa ctacataaat ccataataat cgttgacatg gcataccctc     4980

actcaatgcg taacgataat tccccttacc tgaatatttc atcatgacta aacggaacaa     5040

catgggtcac ctaatgcgcc actctcgcga tttttcaggc ggacttacta tcccgtaaag     5100

tgttgtataa tttgcctgga attgtcttaa agtaaagtaa atgttgcgat atgtgagtga     5160

gcttaaaaca aatatttcgc tgcaggagta tcctggaaga tgttcgtaga agcttactgc     5220

tcacaagaaa aaaggcacgt catctgacgt gcctttttta tttgtactac cctgtacgat     5280

tactgcagct cgagttagga tttcgtttcg aattgggatt cgattttaac ccagtctttg     5340

cacaggatgt tttcgttacc gtaaatccag tgggacggac caatgataat tttttccgga     5400

tttttgatca ggtaggctgc ccaccaggag taagtgctgt tagtgatgat accgtgtttg     5460

caagactgca tcagcatcat gtcccagtgg gctgcaccat cacgcgtcgt catgtcaaca     5520

aacgggtaac ccagatccag gttctgtacg aattccagat cctcgcagaa caggaacagt     5580

tccagatttt gaacacgttt tgccatatac gcaatggcgc gcagctggta ggagatgtcc     5640

agctgccagc ccaggcgcat gtaatcgcca cggcggatgt gaacgaacac agagtttttc     5700

gcagccagga tctgggacag tttacgagag tactgttccg cgtgttcggt cgggtgaggc     5760

agggtgaaag tttgtttgat cagaggggag atatcttcga aatagcgcgg gtcctgaaag     5820

tagccatgga aatacgcaat gcggctcggt tcaaacagtt ccggcatgta ctcgaataca     5880

atttctttgc taacgcggcc cagacccata cgacgcagtg caccacgcac cagacgcggc     5940

aggttctgca tgtgtgccgc ggcgatctgc tgggcggacg cacactgcag gtcgatcggg     6000

aacaggtgca ggcccagttc acggttaccg taatcgaacc aagtggtatc cagcagtacc     6060

ggaatgttca ggtgagtctg cagagattta gcgaatgcgt actggaacat ctggttaccc     6120

aggccgccgt gcacctgaac gattttgaaa tccattatat ctccttcttg                6170


<210>  18
<211>  6155
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pG217

<400>  18
tctagaattc taaaaattga ttgaatgtat gcaaataaat gcatacacca taggtgtggt       60

ttaatttgat gccctttttc agggctggaa tgtgtaagag cggggttatt tatgctgttg      120

tttttttgtt actcgggaag ggctttacct cttccgcata aacgcttcca tcagcgttta      180

tagttaaaaa aatctttcgg aactggtttt gcgcttaccc caaccaacag gggatttgct      240

gctttccatt gagcctgttt ctctgcgcga cgttcgcggc ggcgtgtttg tgcatccatc      300

tggattctcc tgtcagttag ctttggtggt gtgtggcagt tgtagtcctg aacgaaaacc      360

ccccgcgatt ggcacattgg cagctaatcc ggaatcgcac ttacggccaa tgcttcgttt      420

cgtatcacac accccaaagc cttctgcttt gaatgctgcc cttcttcagg gcttaatttt      480

taagagcgtc accttcatgg tggtcagtgc gtcctgctga tgtgctcagt atcaccgcca      540

gtggtattta tgtcaacacc gccagagata atttatcacc gcagatggtt atctgtatgt      600

tttttatatg aatttatttt ttgcaggggg gcattgtttg gtaggtgaga gatcaattct      660

gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc      720

ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca      780

ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg      840

agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca      900

taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa      960

cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc     1020

tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc     1080

gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct     1140

gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg     1200

tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag     1260

gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta     1320

cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg     1380

aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt     1440

tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt     1500

ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag     1560

attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat     1620

ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc     1680

tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat     1740

aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc     1800

acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag     1860

aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag     1920

agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt     1980

ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg     2040

agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt     2100

tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc     2160

tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc     2220

attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa     2280

taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg     2340

aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc     2400

caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag     2460

gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt     2520

cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt     2580

tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc     2640

acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac     2700

gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct     2760

cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg     2820

cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat     2880

tgtactgaga gtgcaccata tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa     2940

taccgcatca ggcgcctcct caacctgtat attcgtaaac cacgcccaat gggagctgtc     3000

tcaggtttgt tcctgattgg ttacggcgcg tttcgcatca ttgttgagtt tttccgccag     3060

cccgacgcgc agtttaccgg tgcctgggtg cagtacatca gcatggggca aattctttcc     3120

atcccgatga ttgtcgcggg tgtgatcatg atggtctggg catatcgtcg cagcccacag     3180

caacacgttt cctgaggaac catgaaacag tatttagaac tgatgcaaaa agtgctcgac     3240

gaaggcacac agaaaaacga ccgtaccgga accggaacgc tttccatttt tggtcatcag     3300

atgcgtttta acctgcaaga tggattcccg ctggtgacaa ctaaacgttg ccacctgcgt     3360

tccatcatcc atgaactgct gtggtttctg cagggcgaca ctaacattgc ttatctacac     3420

gaaaacaatg tcaccatctg ggacgaatgg gccgatgaaa acggcgacct cgggccagtg     3480

tatggtaaac agtggcgcgc ctggccaacg ccagatggtc gtcatattga ccagatcact     3540

acggtactga accagctgaa aaacgacccg gattcgcgcc gcattattgt ttcagcgtgg     3600

aacgtaggcg aactggataa aatggcgctg gcaccgtgcc atgcattctt ccagttctat     3660

gtggcagacg gcaaactctc ttgccagctt tatcagcgct cctgtgacgt cttcctcggc     3720

ctgccgttca acattgccag ctacgcgtta ttggtgcata tgatggcgca gcagtgcgat     3780

ctggaagtgg gtgattttgt ctggaccggt ggcgacacgc atctgtacag caaccatatg     3840

gatcaaactc atctgcaatt aagccgcgaa ccgcgtccgc tgccgaagtt gattatcaaa     3900

cgtaaacccg aatccatctt cgactaccgt ttcgaagact ttgagattga aggctacgat     3960

ccgcatccgg gcattaaagc gccggtggct atctaattac gaaacatcct gccagagccg     4020

acgccagtgt gcgtcggttt ttttaccctc cgttaaattc ttcgagacgc cttcccgaag     4080

gcgccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc     4140

gctattacgc cagctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc     4200

agggttttcc cagtcacgac gttgtaaaac gacggccagt gccaagcttt ctttaatgaa     4260

gcagggcatc aggacggtat ctttgtggag aaagcagagt aatcttattc agcctgactg     4320

gtgggaaacc accagtcaga atgtgttagc gcatgttgac aaaaatacca ttagtcacat     4380

tatccgtcag tcggacgaca tggtagataa cctgtttatt atgcgttttg atcttacgtt     4440

taatattacc tttatgcgat gaaacggtct tggctttgat attcatttgg tcagagattt     4500

gaatggttcc ctgacctgcc atccacattc gcaacatact cgattcggtt cggctcaatg     4560

ataacgtcgg catatttaaa aacgaggtta tcgttgtctc ttttttcaga atatcgccaa     4620

ggatatcgtc gagagattcc ggtttaatcg atttagaact gatcaataaa ttttttctga     4680

ccaatagata ttcatcaaaa tgaacattgg caattgccat aaaaacgata aataacgtat     4740

tgggatgttg attaatgatg agcttgatac gctgactgtt agaagcatcg tggatgaaac     4800

agtcctcatt aataaacacc actgaagggc gctgtgaatc acaagctatg gcaaggtcat     4860

caacggtttc aatgtcgttg atttctcttt ttttaacccc tctactcaac agatacccgg     4920

ttaaacctag tcgggtgtaa ctacataaat ccataataat cgttgacatg gcataccctc     4980

actcaatgcg taacgataat tccccttacc tgaatatttc atcatgacta aacggaacaa     5040

catgggtcac ctaatgcgcc actctcgcga tttttcaggc ggacttacta tcccgtaaag     5100

tgttgtataa tttgcctgga attgtcttaa agtaaagtaa atgttgcgat atgtgagtga     5160

gcttaaaaca aatatttcgc tgcaggagta tcctggaaga tgttcgtaga agcttactgc     5220

tcacaagaaa aaaggcacgt catctgacgt gcctttttta tttgtactac cctgtacgat     5280

tactgcagct cgagttagga taccggcact ttgatccaac cagtcgggta gatatccggt     5340

gcttcggagt gctggaacca acggctcggc acaataacag tcttatccat attagggttc     5400

agccaggcac cccaccaaga aaacgtgctg ttacaaatga tgtgatgttt gcaatgagac     5460

atcagcatca tatcctgcca ggagtcttca tcagtgttcc agtcaatata aaccgcattc     5520

tgcagtggca gattttcttt aacccacgcg atatcgtcgg agaagatata gtaagatggg     5580

ctagcaacac gacgggacat ttccgcgata gcattctggt aatacggcag ctggcacacg     5640

gaaccggtag tagcccagtg tttcggctgc agatagtcac cacgacgaat gtgcagggaa     5700

accgcgtttt catctttgtc caggatttcc agcatgttca ggctgcggga atttgctttg     5760

ttcttatcaa aggtgaagga ttcacgcact tcgtctttga tatcagcgaa gaaacgctcg     5820

ctctgataga aacctttaaa gtacagcagc ggccagaaat acttcttctc gaacgcacgc     5880

agagagttcg gcgcctgctt gcgttcgtag atttttttaa aaaacaggaa ttcgataact     5940

tttttcagcg gttggttgat gcagaattcg gtgtgcggca ggttgaacac gcggtgcatt     6000

tcgtaaccgt aatggacttt gtaatgcatc atgtcgctca ggtcgatacg gaccttcggg     6060

taatactttt tcatacgcag atagaaagca tagataaaca tctggttgcc cagaccgcca     6120

gtcactttga tcagacgcat tatatctcct tcttg                                6155


<210>  19
<211>  5048
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  lacZ insertion into the lon gene

<400>  19
gtccatggaa gacgtcgaaa aagtggttat cgacgagtcg gtaattgatg gtcaaagcaa       60

accgttgctg atttatggca agccggaagc gcaacaggca tctggtgaat aattaaccat      120

tcccatacaa ttagttaacc aaaaaggggg gattttatct cccctttaat ttttcctcta      180

ttctcggcgt tgaatgtggg ggaaacatcc ccatatactg acgtacatgt taatagatgg      240

cgtgaagcac agtcgtgtca tctgattacc tggcggaaat taaactaaga gagagctcta      300

tgattccggg gatccgtcga cctgcagttc gaagttccta ttctctagaa agtataggaa      360

cttcagagcg cttttgaagc tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa      420

ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca      480

gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca      540

gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat      600

tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt      660

tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag      720

gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg      780

agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt      840

tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc      900

tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt      960

gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag     1020

tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg     1080

ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag     1140

cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg     1200

atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc     1260

gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca     1320

tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc     1380

gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg     1440

ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct     1500

atcgccttct tgacgagttc ttctaataag gggatcttga agttcctatt ccgaagttcc     1560

tattctctag aaagtatagg aacttcgaag cagctccagc ctacataagc ggccgcttat     1620

ttttgacacc agaccaactg gtaatggtag cgaccggcgc tcagctggaa ttccgccgat     1680

actgacgggc tccaggagtc gtcgccacca atccccatat ggaaaccgtc gatattcagc     1740

catgtgcctt cttccgcgtg cagcagatgg cgatggctgg tttccatcag ttgctgttga     1800

ctgtagcggc tgatgttgaa ctggaagtcg ccgcgccact ggtgtgggcc ataattcaat     1860

tcgcgcgtcc cgcagcgcag accgttttcg ctcgggaaga cgtacggggt atacatgtct     1920

gacaatggca gatcccagcg gtcaaaacag gcggcagtaa ggcggtcggg atagttttct     1980

tgcggcccta atccgagcca gtttacccgc tctgctacct gcgccagctg gcagttcagg     2040

ccaatccgcg ccggatgcgg tgtatcgctc gccacttcaa catcaacggt aatcgccatt     2100

tgaccactac catcaatccg gtaggttttc cggctgataa ataaggtttt cccctgatgc     2160

tgccacgcgt gagcggtcgt aatcagcacc gcatcagcaa gtgtatctgc cgtgcactgc     2220

aacaacgctg cttcggcctg gtaatggccc gccgccttcc agcgttcgac ccaggcgtta     2280

gggtcaatgc gggtcgcttc acttacgcca atgtcgttat ccagcggtgc acgggtgaac     2340

tgatcgcgca gcggcgtcag cagttgtttt ttatcgccaa tccacatctg tgaaagaaag     2400

cctgactggc ggttaaattg ccaacgctta ttacccagct cgatgcaaaa atccatttcg     2460

ctggtggtca gatgcgggat ggcgtgggac gcggcgggga gcgtcacact gaggttttcc     2520

gccagacgcc actgctgcca ggcgctgatg tgcccggctt ctgaccatgc ggtcgcgttc     2580

ggttgcacta cgcgtactgt gagccagagt tgcccggcgc tctccggctg cggtagttca     2640

ggcagttcaa tcaactgttt accttgtgga gcgacatcca gaggcacttc accgcttgcc     2700

agcggcttac catccagcgc caccatccag tgcaggagct cgttatcgct atgacggaac     2760

aggtattcgc tggtcacttc gatggtttgc ccggataaac ggaactggaa aaactgctgc     2820

tggtgttttg cttccgtcag cgctggatgc ggcgtgcggt cggcaaagac cagaccgttc     2880

atacagaact ggcgatcgtt cggcgtatcg ccaaaatcac cgccgtaagc cgaccacggg     2940

ttgccgtttt catcatattt aatcagcgac tgatccaccc agtcccagac gaagccgccc     3000

tgtaaacggg gatactgacg aaacgcctgc cagtatttag cgaaaccgcc aagactgtta     3060

cccatcgcgt gggcgtattc gcaaaggatc agcgggcgcg tctctccagg tagcgaaagc     3120

cattttttga tggaccattt cggcacagcc gggaagggct ggtcttcatc cacgcgcgcg     3180

tacatcgggc aaataatatc ggtggccgtg gtgtcggctc cgccgccttc atactgcacc     3240

gggcgggaag gatcgacaga tttgatccag cgatacagcg cgtcgtgatt agcgccgtgg     3300

cctgattcat tccccagcga ccagatgatc acactcgggt gattacgatc gcgctgcacc     3360

attcgcgtta cgcgttcgct catcgccggt agccagcgcg gatcatcggt cagacgattc     3420

attggcacca tgccgtgggt ttcaatattg gcttcatcca ccacatacag gccgtagcgg     3480

tcgcacagcg tgtaccacag cggatggttc ggataatgcg aacagcgcac ggcgttaaag     3540

ttgttctgct tcatcagcag gatatcctgc accatcgtct gctcatccat gacctgacca     3600

tgcagaggat gatgctcgtg acggttaacg cctcgaatca gcaacggctt gccgttcagc     3660

agcagcagac cattttcaat ccgcacctcg cggaaaccga catcgcaggc ttctgcttca     3720

atcagcgtgc cgtcggcggt gtgcagttca accaccgcac gatagagatt cgggatttcg     3780

gcgctccaca gtttcgggtt ttcgacgttc agacgtagtg tgacgcgatc ggcataacca     3840

ccacgctcat cgataatttc accgccgaaa ggcgcggtgc cgctggcgac ctgcgtttca     3900

ccctgccata aagaaactgt tacccgtagg tagtcacgca actcgccgca catctgaact     3960

tcagcctcca gtacagcgcg gctgaaatca tcattaaagc gagtggcaac atggaaatcg     4020

ctgatttgtg tagtcggttt atgcagcaac gagacgtcac ggaaaatgcc gctcatccgc     4080

cacatatcct gatcttccag ataactgccg tcactccagc gcagcaccat caccgcgagg     4140

cggttttctc cggcgcgtaa aaatgcgctc aggtcaaatt cagacggcaa acgactgtcc     4200

tggccgtaac cgacccagcg cccgttgcac cacagatgaa acgccgagtt aacgccatca     4260

aaaataattc gcgtctggcc ttcctgtagc cagctttcat caacattaaa tgtgagcgag     4320

taacaacccg tcggattctc cgtgggaaca aacggcggat tgaccgtaat gggataggtc     4380

acgttggtgt agatgggcgc atcgtaaccg tgcatctgcc agtttgaggg gacgacgaca     4440

gtatcggcct caggaagatc gcactccagc cagctttccg gcaccgcttc tggtgccgga     4500

aaccaggcaa agcgccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg     4560

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt     4620

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaatccgta     4680

atcatggtca tagtaggttt cctcaggttg tgactgcaaa atagtgacct cgcgcaaaat     4740

gcactaataa aaacagggct ggcaggctaa ttcgggcttg ccagcctttt tttgtctcgc     4800

taagttagat ggcggatcgg gcttgccctt attaaggggt gttgtaaggg gatggctggc     4860

ctgatataac tgctgcgcgt tcgtaccttg aaggattcaa gtgcgatata aattataaag     4920

aggaagagaa gagtgaataa atctcaattg atcgacaaga ttgctgcagg ggctgatatc     4980

tctaaagctg cggctggccg tgcgttagat gctattattg cttccgtaac tgaatctctg     5040

aaagaagg                                                              5048


<210>  20
<211>  3075
<212>  DNA
<213>  Escherichia coli

<400>  20
atgaccatga ttacggattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct       60

ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc      120

gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc      180

tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct      240

gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc      300

tacaccaacg tgacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg      360

acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg      420

cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc      480

ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc      540

ggagaaaacc gcctcgcggt gatggtgctg cgctggagtg acggcagtta tctggaagat      600

caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact      660

acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta      720

ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct      780

ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc      840

gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa      900

ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac      960

ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat     1020

ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat     1080

catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg     1140

aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac     1200

acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc     1260

atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc     1320

gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg     1380

aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat     1440

ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt     1500

tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc     1560

atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc     1620

cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat     1680

ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat     1740

gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc     1800

cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa     1860

gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc     1920

agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat     1980

ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg     2040

attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc     2100

gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag     2160

tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat     2220

ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac     2280

cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg     2340

ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc     2400

cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa     2460

gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct     2520

cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat     2580

ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg     2640

gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga     2700

ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat     2760

ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc     2820

gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc     2880

agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa     2940

gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg     3000

agcccgtcag tatcggcgga attccagctg agcgccggtc gctaccatta ccagttggtc     3060

tggtgtcaaa aataa                                                      3075


<210>  21
<211>  894
<212>  DNA
<213>  Escherichia coli

<400>  21
atgtctatta taagattaca aggcggactt ggaaatcaac tttttcagtt ctcatttggg       60

tatgcgcttt ccaaaattaa tgggacacca ttatattttg atataagtca ttatgctgaa      120

aatgatgatc atggtggtta caggctaaac aatctacaaa ttccagagga atatttacag      180

tattacacac caaaaattaa taatatttat aaatttttgg ttcgtgggtc aagattatat      240

cctgaaatct ttcttttttt aggtttttgc aatgaatttc atgcctatgg ttatgatttt      300

gaatatatag cgcaaaaatg gaaatccaaa aaatatatag ggtattggca atctgagcac      360

tttttccata aacatatatt agatctaaaa gaatttttta ttccaaagaa tgtgtctgaa      420

caagcaaatt tacttgcagc aaaaattctt gaatctcaat catcactttc tattcatata      480

agaagaggag attatataaa aaacaaaaca gctactttaa ctcatggcgt ttgttcgtta      540

gagtattaca aaaaagcttt aaataaaata cgcgatttgg caatgatacg tgacgtgttt      600

attttcagtg atgatatttt ttggtgtaaa gaaaatatcg aaacattact cagtaaaaaa      660

tataatatat attattcaga agatttatca caagaagaag atttatggtt aatgagctta      720

gctaaccatc atattatagc gaatagtagt tttagttggt ggggggctta tttaggtaca      780

tcagcgtcac agattgttat ttatcctact ccttggtacg atataactcc aaaaaatact      840

tatatcccca tagtcaatca ctggataaac gtggataaac atagctcgtg ttag            894


<210>  22
<211>  861
<212>  DNA
<213>  Helicobacter mustelae

<400>  22
atggatttca aaatcgttca ggtgcacggc ggcctgggta accagatgtt ccagtacgca       60

ttcgctaaat ctctgcagac tcacctgaac attccggtac tgctggatac cacttggttc      120

gattacggta accgtgaact gggcctgcac ctgttcccga tcgacctgca gtgtgcgtcc      180

gcccagcaga tcgccgcggc acacatgcag aacctgccgc gtctggtgcg tggtgcactg      240

cgtcgtatgg gtctgggccg cgttagcaaa gaaattgtat tcgagtacat gccggaactg      300

tttgaaccga gccgcattgc gtatttccat ggctactttc aggacccgcg ctatttcgaa      360

gatatctccc ctctgatcaa acaaactttc accctgcctc acccgaccga acacgcggaa      420

cagtactctc gtaaactgtc ccagatcctg gctgcgaaaa actctgtgtt cgttcacatc      480

cgccgtggcg attacatgcg cctgggctgg cagctggaca tctcctacca gctgcgcgcc      540

attgcgtata tggcaaaacg tgttcaaaat ctggaactgt tcctgttctg cgaggatctg      600

gaattcgtac agaacctgga tctgggttac ccgtttgttg acatgacgac gcgtgatggt      660

gcagcccact gggacatgat gctgatgcag tcttgcaaac acggtatcat cactaacagc      720

acttactcct ggtgggcagc ctacctgatc aaaaatccgg aaaaaattat cattggtccg      780

tcccactgga tttacggtaa cgaaaacatc ctgtgcaaag actgggttaa aatcgaatcc      840

caattcgaaa cgaaatccta a                                                861


<210>  23
<211>  846
<212>  DNA
<213>  Bacteroides vulgatus

<400>  23
atgcgtctga tcaaagtgac tggcggtctg ggcaaccaga tgtttatcta tgctttctat       60

ctgcgtatga aaaagtatta cccgaaggtc cgtatcgacc tgagcgacat gatgcattac      120

aaagtccatt acggttacga aatgcaccgc gtgttcaacc tgccgcacac cgaattctgc      180

atcaaccaac cgctgaaaaa agttatcgaa ttcctgtttt ttaaaaaaat ctacgaacgc      240

aagcaggcgc cgaactctct gcgtgcgttc gagaagaagt atttctggcc gctgctgtac      300

tttaaaggtt tctatcagag cgagcgtttc ttcgctgata tcaaagacga agtgcgtgaa      360

tccttcacct ttgataagaa caaagcaaat tcccgcagcc tgaacatgct ggaaatcctg      420

gacaaagatg aaaacgcggt ttccctgcac attcgtcgtg gtgactatct gcagccgaaa      480

cactgggcta ctaccggttc cgtgtgccag ctgccgtatt accagaatgc tatcgcggaa      540

atgtcccgtc gtgttgctag cccatcttac tatatcttct ccgacgatat cgcgtgggtt      600

aaagaaaatc tgccactgca gaatgcggtt tatattgact ggaacactga tgaagactcc      660

tggcaggata tgatgctgat gtctcattgc aaacatcaca tcatttgtaa cagcacgttt      720

tcttggtggg gtgcctggct gaaccctaat atggataaga ctgttattgt gccgagccgt      780

tggttccagc actccgaagc accggatatc tacccgactg gttggatcaa agtgccggta      840

tcctaa                                                                 846


