                         SEQUENCE LISTING

<110>  OXFORD NANOPORE TECHNOLOGIES LIMITED
 
<120>  MUTANT PORE

<130>  N407106WO

<140>  
<141>  

<150>  GB 1605899.2
<151>  2016-04-06

<150>  GB 1608274.5
<151>  2016-05-11

<160>  24    

<170>  PatentIn version 3.5

<210>  1
<211>  897
<212>  DNA
<213>  Eisenia fetida

<400>  1
atgagtgcga aggctgctga aggttatgaa caaatcgaag ttgatgtggt tgctgtgtgg       60

aaggaaggtt atgtgtatga aaatcgtggt agtacctccg tggatcaaaa aattaccatc      120

acgaaaggca tgaagaacgt taatagcgaa acccgtacgg tcaccgcgac gcattctatt      180

ggcagtacca tctccacggg tgacgccttt gaaatcggct ccgtggaagt ttcatattcg      240

catagccacg aagaatcaca agtttcgatg accgaaacgg aagtctacga atcaaaagtg      300

attgaacaca ccattacgat cccgccgacc tcgaagttca cgcgctggca gctgaacgca      360

gatgtcggcg gtgctgacat tgaatatatg tacctgatcg atgaagttac cccgattggc      420

ggtacgcaga gtattccgca agtgatcacc tcccgtgcaa aaattatcgt tggtcgccag      480

attatcctgg gcaagaccga aattcgtatc aaacatgctg aacgcaagga atatatgacc      540

gtggttagcc gtaaatcttg gccggcggcc acgctgggtc acagtaaact gtttaagttc      600

gtgctgtacg aagattgggg cggttttcgc atcaaaaccc tgaatacgat gtattctggt      660

tatgaatacg cgtatagctc tgaccagggc ggtatctact tcgatcaagg caccgacaac      720

ccgaaacagc gttgggccat taataagagc ctgccgctgc gccatggtga tgtcgtgacc      780

tttatgaaca aatacttcac gcgttctggt ctgtgctatg atgacggccc ggcgaccaat      840

gtgtattgtc tggataaacg cgaagacaag tggattctgg aagttgtcgg ctaatga         897


<210>  2
<211>  297
<212>  PRT
<213>  Eisenia fetida

<400>  2

Met Ser Ala Lys Ala Ala Glu Gly Tyr Glu Gln Ile Glu Val Asp Val 
1               5                   10                  15      


Val Ala Val Trp Lys Glu Gly Tyr Val Tyr Glu Asn Arg Gly Ser Thr 
            20                  25                  30          


Ser Val Asp Gln Lys Ile Thr Ile Thr Lys Gly Met Lys Asn Val Asn 
        35                  40                  45              


Ser Glu Thr Arg Thr Val Thr Ala Thr His Ser Ile Gly Ser Thr Ile 
    50                  55                  60                  


Ser Thr Gly Asp Ala Phe Glu Ile Gly Ser Val Glu Val Ser Tyr Ser 
65                  70                  75                  80  


His Ser His Glu Glu Ser Gln Val Ser Met Thr Glu Thr Glu Val Tyr 
                85                  90                  95      


Glu Ser Lys Val Ile Glu His Thr Ile Thr Ile Pro Pro Thr Ser Lys 
            100                 105                 110         


Phe Thr Arg Trp Gln Leu Asn Ala Asp Val Gly Gly Ala Asp Ile Glu 
        115                 120                 125             


Tyr Met Tyr Leu Ile Asp Glu Val Thr Pro Ile Gly Gly Thr Gln Ser 
    130                 135                 140                 


Ile Pro Gln Val Ile Thr Ser Arg Ala Lys Ile Ile Val Gly Arg Gln 
145                 150                 155                 160 


Ile Ile Leu Gly Lys Thr Glu Ile Arg Ile Lys His Ala Glu Arg Lys 
                165                 170                 175     


Glu Tyr Met Thr Val Val Ser Arg Lys Ser Trp Pro Ala Ala Thr Leu 
            180                 185                 190         


Gly His Ser Lys Leu Phe Lys Phe Val Leu Tyr Glu Asp Trp Gly Gly 
        195                 200                 205             


Phe Arg Ile Lys Thr Leu Asn Thr Met Tyr Ser Gly Tyr Glu Tyr Ala 
    210                 215                 220                 


Tyr Ser Ser Asp Gln Gly Gly Ile Tyr Phe Asp Gln Gly Thr Asp Asn 
225                 230                 235                 240 


Pro Lys Gln Arg Trp Ala Ile Asn Lys Ser Leu Pro Leu Arg His Gly 
                245                 250                 255     


Asp Val Val Thr Phe Met Asn Lys Tyr Phe Thr Arg Ser Gly Leu Cys 
            260                 265                 270         


Tyr Asp Asp Gly Pro Ala Thr Asn Val Tyr Cys Leu Asp Lys Arg Glu 
        275                 280                 285             


Asp Lys Trp Ile Leu Glu Val Val Gly 
    290                 295         


<210>  3
<211>  1830
<212>  DNA
<213>  Bacteriophage phi-29

<400>  3
atgaaacaca tgccgcgtaa aatgtatagc tgcgcgtttg aaaccacgac caaagtggaa       60

gattgtcgcg tttgggccta tggctacatg aacatcgaag atcattctga atacaaaatc      120

ggtaacagtc tggatgaatt tatggcatgg gtgctgaaag ttcaggcgga tctgtacttc      180

cacaacctga aatttgatgg cgcattcatt atcaactggc tggaacgtaa tggctttaaa      240

tggagcgcgg atggtctgcc gaacacgtat aataccatta tctctcgtat gggccagtgg      300

tatatgattg atatctgcct gggctacaaa ggtaaacgca aaattcatac cgtgatctat      360

gatagcctga aaaaactgcc gtttccggtg aagaaaattg cgaaagattt caaactgacg      420

gttctgaaag gcgatattga ttatcacaaa gaacgtccgg ttggttacaa aatcaccccg      480

gaagaatacg catacatcaa aaacgatatc cagatcatcg cagaagcgct gctgattcag      540

tttaaacagg gcctggatcg catgaccgcg ggcagtgata gcctgaaagg tttcaaagat      600

atcatcacga ccaaaaaatt caaaaaagtg ttcccgacgc tgagcctggg tctggataaa      660

gaagttcgtt atgcctaccg cggcggtttt acctggctga acgatcgttt caaagaaaaa      720

gaaattggcg agggtatggt gtttgatgtt aatagtctgt atccggcaca gatgtacagc      780

cgcctgctgc cgtatggcga accgatcgtg ttcgagggta aatatgtttg ggatgaagat      840

tacccgctgc atattcagca catccgttgt gaatttgaac tgaaagaagg ctatattccg      900

accattcaga tcaaacgtag tcgcttctat aagggtaacg aatacctgaa aagctctggc      960

ggtgaaatcg cggatctgtg gctgagtaac gtggatctgg aactgatgaa agaacactac     1020

gatctgtaca acgttgaata catcagcggc ctgaaattta aagccacgac cggtctgttc     1080

aaagatttca tcgataaatg gacctacatc aaaacgacct ctgaaggcgc gattaaacag     1140

ctggccaaac tgatgctgaa cagcctgtat ggcaaattcg cctctaatcc ggatgtgacc     1200

ggtaaagttc cgtacctgaa agaaaatggc gcactgggtt ttcgcctggg cgaagaagaa     1260

acgaaagatc cggtgtatac cccgatgggt gttttcatta cggcctgggc acgttacacg     1320

accatcaccg cggcccaggc atgctatgat cgcattatct actgtgatac cgattctatt     1380

catctgacgg gcaccgaaat cccggatgtg attaaagata tcgttgatcc gaaaaaactg     1440

ggttattggg cccacgaaag tacgtttaaa cgtgcaaaat acctgcgcca gaaaacctac     1500

atccaggata tctacatgaa agaagtggat ggcaaactgg ttgaaggttc tccggatgat     1560

tacaccgata tcaaattcag tgtgaaatgc gccggcatga cggataaaat caaaaaagaa     1620

gtgaccttcg aaaacttcaa agttggtttc agccgcaaaa tgaaaccgaa accggtgcag     1680

gttccgggcg gtgtggttct ggtggatgat acgtttacca ttaaatctgg cggtagtgcg     1740

tggagccatc cgcagttcga aaaaggcggt ggctctggtg gcggttctgg cggtagtgcc     1800

tggagccacc cgcagtttga aaaataataa                                      1830


<210>  4
<211>  608
<212>  PRT
<213>  Bacteriophage phi-29

<400>  4

Met Lys His Met Pro Arg Lys Met Tyr Ser Cys Ala Phe Glu Thr Thr 
1               5                   10                  15      


Thr Lys Val Glu Asp Cys Arg Val Trp Ala Tyr Gly Tyr Met Asn Ile 
            20                  25                  30          


Glu Asp His Ser Glu Tyr Lys Ile Gly Asn Ser Leu Asp Glu Phe Met 
        35                  40                  45              


Ala Trp Val Leu Lys Val Gln Ala Asp Leu Tyr Phe His Asn Leu Lys 
    50                  55                  60                  


Phe Asp Gly Ala Phe Ile Ile Asn Trp Leu Glu Arg Asn Gly Phe Lys 
65                  70                  75                  80  


Trp Ser Ala Asp Gly Leu Pro Asn Thr Tyr Asn Thr Ile Ile Ser Arg 
                85                  90                  95      


Met Gly Gln Trp Tyr Met Ile Asp Ile Cys Leu Gly Tyr Lys Gly Lys 
            100                 105                 110         


Arg Lys Ile His Thr Val Ile Tyr Asp Ser Leu Lys Lys Leu Pro Phe 
        115                 120                 125             


Pro Val Lys Lys Ile Ala Lys Asp Phe Lys Leu Thr Val Leu Lys Gly 
    130                 135                 140                 


Asp Ile Asp Tyr His Lys Glu Arg Pro Val Gly Tyr Lys Ile Thr Pro 
145                 150                 155                 160 


Glu Glu Tyr Ala Tyr Ile Lys Asn Asp Ile Gln Ile Ile Ala Glu Ala 
                165                 170                 175     


Leu Leu Ile Gln Phe Lys Gln Gly Leu Asp Arg Met Thr Ala Gly Ser 
            180                 185                 190         


Asp Ser Leu Lys Gly Phe Lys Asp Ile Ile Thr Thr Lys Lys Phe Lys 
        195                 200                 205             


Lys Val Phe Pro Thr Leu Ser Leu Gly Leu Asp Lys Glu Val Arg Tyr 
    210                 215                 220                 


Ala Tyr Arg Gly Gly Phe Thr Trp Leu Asn Asp Arg Phe Lys Glu Lys 
225                 230                 235                 240 


Glu Ile Gly Glu Gly Met Val Phe Asp Val Asn Ser Leu Tyr Pro Ala 
                245                 250                 255     


Gln Met Tyr Ser Arg Leu Leu Pro Tyr Gly Glu Pro Ile Val Phe Glu 
            260                 265                 270         


Gly Lys Tyr Val Trp Asp Glu Asp Tyr Pro Leu His Ile Gln His Ile 
        275                 280                 285             


Arg Cys Glu Phe Glu Leu Lys Glu Gly Tyr Ile Pro Thr Ile Gln Ile 
    290                 295                 300                 


Lys Arg Ser Arg Phe Tyr Lys Gly Asn Glu Tyr Leu Lys Ser Ser Gly 
305                 310                 315                 320 


Gly Glu Ile Ala Asp Leu Trp Leu Ser Asn Val Asp Leu Glu Leu Met 
                325                 330                 335     


Lys Glu His Tyr Asp Leu Tyr Asn Val Glu Tyr Ile Ser Gly Leu Lys 
            340                 345                 350         


Phe Lys Ala Thr Thr Gly Leu Phe Lys Asp Phe Ile Asp Lys Trp Thr 
        355                 360                 365             


Tyr Ile Lys Thr Thr Ser Glu Gly Ala Ile Lys Gln Leu Ala Lys Leu 
    370                 375                 380                 


Met Leu Asn Ser Leu Tyr Gly Lys Phe Ala Ser Asn Pro Asp Val Thr 
385                 390                 395                 400 


Gly Lys Val Pro Tyr Leu Lys Glu Asn Gly Ala Leu Gly Phe Arg Leu 
                405                 410                 415     


Gly Glu Glu Glu Thr Lys Asp Pro Val Tyr Thr Pro Met Gly Val Phe 
            420                 425                 430         


Ile Thr Ala Trp Ala Arg Tyr Thr Thr Ile Thr Ala Ala Gln Ala Cys 
        435                 440                 445             


Tyr Asp Arg Ile Ile Tyr Cys Asp Thr Asp Ser Ile His Leu Thr Gly 
    450                 455                 460                 


Thr Glu Ile Pro Asp Val Ile Lys Asp Ile Val Asp Pro Lys Lys Leu 
465                 470                 475                 480 


Gly Tyr Trp Ala His Glu Ser Thr Phe Lys Arg Ala Lys Tyr Leu Arg 
                485                 490                 495     


Gln Lys Thr Tyr Ile Gln Asp Ile Tyr Met Lys Glu Val Asp Gly Lys 
            500                 505                 510         


Leu Val Glu Gly Ser Pro Asp Asp Tyr Thr Asp Ile Lys Phe Ser Val 
        515                 520                 525             


Lys Cys Ala Gly Met Thr Asp Lys Ile Lys Lys Glu Val Thr Phe Glu 
    530                 535                 540                 


Asn Phe Lys Val Gly Phe Ser Arg Lys Met Lys Pro Lys Pro Val Gln 
545                 550                 555                 560 


Val Pro Gly Gly Val Val Leu Val Asp Asp Thr Phe Thr Ile Lys Ser 
                565                 570                 575     


Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys Gly Gly Gly Ser 
            580                 585                 590         


Gly Gly Gly Ser Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys 
        595                 600                 605             


<210>  5
<211>  1390
<212>  DNA
<213>  Escherichia coli

<400>  5
atgatgaacg atggcaaaca gcagagcacc ttcctgtttc atgattatga aaccttcggt       60

acccatccgg ccctggatcg tccggcgcag tttgcggcca ttcgcaccga tagcgaattc      120

aatgtgattg gcgaaccgga agtgttttat tgcaaaccgg ccgatgatta tctgccgcag      180

ccgggtgcgg tgctgattac cggtattacc ccgcaggaag cgcgcgcgaa aggtgaaaac      240

gaagcggcgt ttgccgcgcg cattcatagc ctgtttaccg tgccgaaaac ctgcattctg      300

ggctataaca atgtgcgctt cgatgatgaa gttacccgta atatctttta tcgtaacttt      360

tatgatccgt atgcgtggag ctggcagcat gataacagcc gttgggatct gctggatgtg      420

atgcgcgcgt gctatgcgct gcgcccggaa ggcattaatt ggccggaaaa cgatgatggc      480

ctgccgagct ttcgtctgga acatctgacc aaagccaacg gcattgaaca tagcaatgcc      540

catgatgcga tggccgatgt ttatgcgacc attgcgatgg cgaaactggt taaaacccgt      600

cagccgcgcc tgtttgatta tctgtttacc caccgtaaca aacacaaact gatggcgctg      660

attgatgttc cgcagatgaa accgctggtg catgtgagcg gcatgtttgg cgcctggcgc      720

ggcaacacca gctgggtggc cccgctggcc tggcacccgg aaaatcgtaa cgccgtgatt      780

atggttgatc tggccggtga tattagcccg ctgctggaac tggatagcga taccctgcgt      840

gaacgcctgt ataccgccaa aaccgatctg ggcgataatg ccgccgtgcc ggtgaaactg      900

gttcacatta acaaatgccc ggtgctggcc caggcgaaca ccctgcgccc ggaagatgcg      960

gatcgtctgg gtattaatcg ccagcattgt ctggataatc tgaaaatcct gcgtgaaaac     1020

ccgcaggtgc gtgaaaaagt ggtggcgatc ttcgcggaag cggaaccgtt caccccgagc     1080

gataacgtgg atgcgcagct gtataacggc ttctttagcg atgccgatcg cgcggcgatg     1140

aaaatcgttc tggaaaccga accgcgcaat ctgccggcgc tggatattac ctttgttgat     1200

aaacgtattg aaaaactgct gtttaattat cgtgcgcgca attttccggg taccctggat     1260

tatgccgaac agcagcgttg gctggaacat cgtcgtcagg ttttcacccc ggaatttctg     1320

cagggttatg cggatgaact gcagatgctg gttcagcagt atgccgatga taaagaaaaa     1380

gtggcgctgc                                                            1390


<210>  6
<211>  485
<212>  PRT
<213>  Escherichia coli

<400>  6

Met Met Asn Asp Gly Lys Gln Gln Ser Thr Phe Leu Phe His Asp Tyr 
1               5                   10                  15      


Glu Thr Phe Gly Thr His Pro Ala Leu Asp Arg Pro Ala Gln Phe Ala 
            20                  25                  30          


Ala Ile Arg Thr Asp Ser Glu Phe Asn Val Ile Gly Glu Pro Glu Val 
        35                  40                  45              


Phe Tyr Cys Lys Pro Ala Asp Asp Tyr Leu Pro Gln Pro Gly Ala Val 
    50                  55                  60                  


Leu Ile Thr Gly Ile Thr Pro Gln Glu Ala Arg Ala Lys Gly Glu Asn 
65                  70                  75                  80  


Glu Ala Ala Phe Ala Ala Arg Ile His Ser Leu Phe Thr Val Pro Lys 
                85                  90                  95      


Thr Cys Ile Leu Gly Tyr Asn Asn Val Arg Phe Asp Asp Glu Val Thr 
            100                 105                 110         


Arg Asn Ile Phe Tyr Arg Asn Phe Tyr Asp Pro Tyr Ala Trp Ser Trp 
        115                 120                 125             


Gln His Asp Asn Ser Arg Trp Asp Leu Leu Asp Val Met Arg Ala Cys 
    130                 135                 140                 


Tyr Ala Leu Arg Pro Glu Gly Ile Asn Trp Pro Glu Asn Asp Asp Gly 
145                 150                 155                 160 


Leu Pro Ser Phe Arg Leu Glu His Leu Thr Lys Ala Asn Gly Ile Glu 
                165                 170                 175     


His Ser Asn Ala His Asp Ala Met Ala Asp Val Tyr Ala Thr Ile Ala 
            180                 185                 190         


Met Ala Lys Leu Val Lys Thr Arg Gln Pro Arg Leu Phe Asp Tyr Leu 
        195                 200                 205             


Phe Thr His Arg Asn Lys His Lys Leu Met Ala Leu Ile Asp Val Pro 
    210                 215                 220                 


Gln Met Lys Pro Leu Val His Val Ser Gly Met Phe Gly Ala Trp Arg 
225                 230                 235                 240 


Gly Asn Thr Ser Trp Val Ala Pro Leu Ala Trp His Pro Glu Asn Arg 
                245                 250                 255     


Asn Ala Val Ile Met Val Asp Leu Ala Gly Asp Ile Ser Pro Leu Leu 
            260                 265                 270         


Glu Leu Asp Ser Asp Thr Leu Arg Glu Arg Leu Tyr Thr Ala Lys Thr 
        275                 280                 285             


Asp Leu Gly Asp Asn Ala Ala Val Pro Val Lys Leu Val His Ile Asn 
    290                 295                 300                 


Lys Cys Pro Val Leu Ala Gln Ala Asn Thr Leu Arg Pro Glu Asp Ala 
305                 310                 315                 320 


Asp Arg Leu Gly Ile Asn Arg Gln His Cys Leu Asp Asn Leu Lys Ile 
                325                 330                 335     


Leu Arg Glu Asn Pro Gln Val Arg Glu Lys Val Val Ala Ile Phe Ala 
            340                 345                 350         


Glu Ala Glu Pro Phe Thr Pro Ser Asp Asn Val Asp Ala Gln Leu Tyr 
        355                 360                 365             


Asn Gly Phe Phe Ser Asp Ala Asp Arg Ala Ala Met Lys Ile Val Leu 
    370                 375                 380                 


Glu Thr Glu Pro Arg Asn Leu Pro Ala Leu Asp Ile Thr Phe Val Asp 
385                 390                 395                 400 


Lys Arg Ile Glu Lys Leu Leu Phe Asn Tyr Arg Ala Arg Asn Phe Pro 
                405                 410                 415     


Gly Thr Leu Asp Tyr Ala Glu Gln Gln Arg Trp Leu Glu His Arg Arg 
            420                 425                 430         


Gln Val Phe Thr Pro Glu Phe Leu Gln Gly Tyr Ala Asp Glu Leu Gln 
        435                 440                 445             


Met Leu Val Gln Gln Tyr Ala Asp Asp Lys Glu Lys Val Ala Leu Leu 
    450                 455                 460                 


Lys Ala Leu Trp Gln Tyr Ala Glu Glu Ile Val Ser Gly Ser Gly His 
465                 470                 475                 480 


His His His His His 
                485 


<210>  7
<211>  804
<212>  DNA
<213>  Escherichia coli

<400>  7
atgaaatttg tctcttttaa tatcaacggc ctgcgcgcca gacctcacca gcttgaagcc       60

atcgtcgaaa agcaccaacc ggatgtgatt ggcctgcagg agacaaaagt tcatgacgat      120

atgtttccgc tcgaagaggt ggcgaagctc ggctacaacg tgttttatca cgggcagaaa      180

ggccattatg gcgtggcgct gctgaccaaa gagacgccga ttgccgtgcg tcgcggcttt      240

cccggtgacg acgaagaggc gcagcggcgg attattatgg cggaaatccc ctcactgctg      300

ggtaatgtca ccgtgatcaa cggttacttc ccgcagggtg aaagccgcga ccatccgata      360

aaattcccgg caaaagcgca gttttatcag aatctgcaaa actacctgga aaccgaactc      420

aaacgtgata atccggtact gattatgggc gatatgaata tcagccctac agatctggat      480

atcggcattg gcgaagaaaa ccgtaagcgc tggctgcgta ccggtaaatg ctctttcctg      540

ccggaagagc gcgaatggat ggacaggctg atgagctggg ggttggtcga taccttccgc      600

catgcgaatc cgcaaacagc agatcgtttc tcatggtttg attaccgctc aaaaggtttt      660

gacgataacc gtggtctgcg catcgacctg ctgctcgcca gccaaccgct ggcagaatgt      720

tgcgtagaaa ccggcatcga ctatgaaatc cgcagcatgg aaaaaccgtc cgatcacgcc      780

cccgtctggg cgaccttccg ccgc                                             804


<210>  8
<211>  268
<212>  PRT
<213>  Escherichia coli

<400>  8

Met Lys Phe Val Ser Phe Asn Ile Asn Gly Leu Arg Ala Arg Pro His 
1               5                   10                  15      


Gln Leu Glu Ala Ile Val Glu Lys His Gln Pro Asp Val Ile Gly Leu 
            20                  25                  30          


Gln Glu Thr Lys Val His Asp Asp Met Phe Pro Leu Glu Glu Val Ala 
        35                  40                  45              


Lys Leu Gly Tyr Asn Val Phe Tyr His Gly Gln Lys Gly His Tyr Gly 
    50                  55                  60                  


Val Ala Leu Leu Thr Lys Glu Thr Pro Ile Ala Val Arg Arg Gly Phe 
65                  70                  75                  80  


Pro Gly Asp Asp Glu Glu Ala Gln Arg Arg Ile Ile Met Ala Glu Ile 
                85                  90                  95      


Pro Ser Leu Leu Gly Asn Val Thr Val Ile Asn Gly Tyr Phe Pro Gln 
            100                 105                 110         


Gly Glu Ser Arg Asp His Pro Ile Lys Phe Pro Ala Lys Ala Gln Phe 
        115                 120                 125             


Tyr Gln Asn Leu Gln Asn Tyr Leu Glu Thr Glu Leu Lys Arg Asp Asn 
    130                 135                 140                 


Pro Val Leu Ile Met Gly Asp Met Asn Ile Ser Pro Thr Asp Leu Asp 
145                 150                 155                 160 


Ile Gly Ile Gly Glu Glu Asn Arg Lys Arg Trp Leu Arg Thr Gly Lys 
                165                 170                 175     


Cys Ser Phe Leu Pro Glu Glu Arg Glu Trp Met Asp Arg Leu Met Ser 
            180                 185                 190         


Trp Gly Leu Val Asp Thr Phe Arg His Ala Asn Pro Gln Thr Ala Asp 
        195                 200                 205             


Arg Phe Ser Trp Phe Asp Tyr Arg Ser Lys Gly Phe Asp Asp Asn Arg 
    210                 215                 220                 


Gly Leu Arg Ile Asp Leu Leu Leu Ala Ser Gln Pro Leu Ala Glu Cys 
225                 230                 235                 240 


Cys Val Glu Thr Gly Ile Asp Tyr Glu Ile Arg Ser Met Glu Lys Pro 
                245                 250                 255     


Ser Asp His Ala Pro Val Trp Ala Thr Phe Arg Arg 
            260                 265             


<210>  9
<211>  1275
<212>  DNA
<213>  Thermus thermophilus

<400>  9
atgtttcgtc gtaaagaaga tctggatccg ccgctggcac tgctgccgct gaaaggcctg       60

cgcgaagccg ccgcactgct ggaagaagcg ctgcgtcaag gtaaacgcat tcgtgttcac      120

ggcgactatg atgcggatgg cctgaccggc accgcgatcc tggttcgtgg tctggccgcc      180

ctgggtgcgg atgttcatcc gtttatcccg caccgcctgg aagaaggcta tggtgtcctg      240

atggaacgcg tcccggaaca tctggaagcc tcggacctgt ttctgaccgt tgactgcggc      300

attaccaacc atgcggaact gcgcgaactg ctggaaaatg gcgtggaagt cattgttacc      360

gatcatcata cgccgggcaa aacgccgccg ccgggtctgg tcgtgcatcc ggcgctgacg      420

ccggatctga aagaaaaacc gaccggcgca ggcgtggcgt ttctgctgct gtgggcactg      480

catgaacgcc tgggcctgcc gccgccgctg gaatacgcgg acctggcagc cgttggcacc      540

attgccgacg ttgccccgct gtggggttgg aatcgtgcac tggtgaaaga aggtctggca      600

cgcatcccgg cttcatcttg ggtgggcctg cgtctgctgg ctgaagccgt gggctatacc      660

ggcaaagcgg tcgaagtcgc tttccgcatc gcgccgcgca tcaatgcggc ttcccgcctg      720

ggcgaagcgg aaaaagccct gcgcctgctg ctgacggatg atgcggcaga agctcaggcg      780

ctggtcggcg aactgcaccg tctgaacgcc cgtcgtcaga ccctggaaga agcgatgctg      840

cgcaaactgc tgccgcaggc cgacccggaa gcgaaagcca tcgttctgct ggacccggaa      900

ggccatccgg gtgttatggg tattgtggcc tctcgcatcc tggaagcgac cctgcgcccg      960

gtctttctgg tggcccaggg caaaggcacc gtgcgttcgc tggctccgat ttccgccgtc     1020

gaagcactgc gcagcgcgga agatctgctg ctgcgttatg gtggtcataa agaagcggcg     1080

ggtttcgcaa tggatgaagc gctgtttccg gcgttcaaag cacgcgttga agcgtatgcc     1140

gcacgtttcc cggatccggt tcgtgaagtg gcactgctgg atctgctgcc ggaaccgggc     1200

ctgctgccgc aggtgttccg tgaactggca ctgctggaac cgtatggtga aggtaacccg     1260

gaaccgctgt tcctg                                                      1275


<210>  10
<211>  425
<212>  PRT
<213>  Thermus thermophilus

<400>  10

Met Phe Arg Arg Lys Glu Asp Leu Asp Pro Pro Leu Ala Leu Leu Pro 
1               5                   10                  15      


Leu Lys Gly Leu Arg Glu Ala Ala Ala Leu Leu Glu Glu Ala Leu Arg 
            20                  25                  30          


Gln Gly Lys Arg Ile Arg Val His Gly Asp Tyr Asp Ala Asp Gly Leu 
        35                  40                  45              


Thr Gly Thr Ala Ile Leu Val Arg Gly Leu Ala Ala Leu Gly Ala Asp 
    50                  55                  60                  


Val His Pro Phe Ile Pro His Arg Leu Glu Glu Gly Tyr Gly Val Leu 
65                  70                  75                  80  


Met Glu Arg Val Pro Glu His Leu Glu Ala Ser Asp Leu Phe Leu Thr 
                85                  90                  95      


Val Asp Cys Gly Ile Thr Asn His Ala Glu Leu Arg Glu Leu Leu Glu 
            100                 105                 110         


Asn Gly Val Glu Val Ile Val Thr Asp His His Thr Pro Gly Lys Thr 
        115                 120                 125             


Pro Pro Pro Gly Leu Val Val His Pro Ala Leu Thr Pro Asp Leu Lys 
    130                 135                 140                 


Glu Lys Pro Thr Gly Ala Gly Val Ala Phe Leu Leu Leu Trp Ala Leu 
145                 150                 155                 160 


His Glu Arg Leu Gly Leu Pro Pro Pro Leu Glu Tyr Ala Asp Leu Ala 
                165                 170                 175     


Ala Val Gly Thr Ile Ala Asp Val Ala Pro Leu Trp Gly Trp Asn Arg 
            180                 185                 190         


Ala Leu Val Lys Glu Gly Leu Ala Arg Ile Pro Ala Ser Ser Trp Val 
        195                 200                 205             


Gly Leu Arg Leu Leu Ala Glu Ala Val Gly Tyr Thr Gly Lys Ala Val 
    210                 215                 220                 


Glu Val Ala Phe Arg Ile Ala Pro Arg Ile Asn Ala Ala Ser Arg Leu 
225                 230                 235                 240 


Gly Glu Ala Glu Lys Ala Leu Arg Leu Leu Leu Thr Asp Asp Ala Ala 
                245                 250                 255     


Glu Ala Gln Ala Leu Val Gly Glu Leu His Arg Leu Asn Ala Arg Arg 
            260                 265                 270         


Gln Thr Leu Glu Glu Ala Met Leu Arg Lys Leu Leu Pro Gln Ala Asp 
        275                 280                 285             


Pro Glu Ala Lys Ala Ile Val Leu Leu Asp Pro Glu Gly His Pro Gly 
    290                 295                 300                 


Val Met Gly Ile Val Ala Ser Arg Ile Leu Glu Ala Thr Leu Arg Pro 
305                 310                 315                 320 


Val Phe Leu Val Ala Gln Gly Lys Gly Thr Val Arg Ser Leu Ala Pro 
                325                 330                 335     


Ile Ser Ala Val Glu Ala Leu Arg Ser Ala Glu Asp Leu Leu Leu Arg 
            340                 345                 350         


Tyr Gly Gly His Lys Glu Ala Ala Gly Phe Ala Met Asp Glu Ala Leu 
        355                 360                 365             


Phe Pro Ala Phe Lys Ala Arg Val Glu Ala Tyr Ala Ala Arg Phe Pro 
    370                 375                 380                 


Asp Pro Val Arg Glu Val Ala Leu Leu Asp Leu Leu Pro Glu Pro Gly 
385                 390                 395                 400 


Leu Leu Pro Gln Val Phe Arg Glu Leu Ala Leu Leu Glu Pro Tyr Gly 
                405                 410                 415     


Glu Gly Asn Pro Glu Pro Leu Phe Leu 
            420                 425 


<210>  11
<211>  738
<212>  DNA
<213>  Bacteriophage lambda

<400>  11
tccggaagcg gctctggtag tggttctggc atgacaccgg acattatcct gcagcgtacc       60

gggatcgatg tgagagctgt cgaacagggg gatgatgcgt ggcacaaatt acggctcggc      120

gtcatcaccg cttcagaagt tcacaacgtg atagcaaaac cccgctccgg aaagaagtgg      180

cctgacatga aaatgtccta cttccacacc ctgcttgctg aggtttgcac cggtgtggct      240

ccggaagtta acgctaaagc actggcctgg ggaaaacagt acgagaacga cgccagaacc      300

ctgtttgaat tcacttccgg cgtgaatgtt actgaatccc cgatcatcta tcgcgacgaa      360

agtatgcgta ccgcctgctc tcccgatggt ttatgcagtg acggcaacgg ccttgaactg      420

aaatgcccgt ttacctcccg ggatttcatg aagttccggc tcggtggttt cgaggccata      480

aagtcagctt acatggccca ggtgcagtac agcatgtggg tgacgcgaaa aaatgcctgg      540

tactttgcca actatgaccc gcgtatgaag cgtgaaggcc tgcattatgt cgtgattgag      600

cgggatgaaa agtacatggc gagttttgac gagatcgtgc cggagttcat cgaaaaaatg      660

gacgaggcac tggctgaaat tggttttgta tttggggagc aatggcgatc tggctctggt      720

tccggcagcg gttccgga                                                    738


<210>  12
<211>  226
<212>  PRT
<213>  Bacteriophage lambda

<400>  12

Met Thr Pro Asp Ile Ile Leu Gln Arg Thr Gly Ile Asp Val Arg Ala 
1               5                   10                  15      


Val Glu Gln Gly Asp Asp Ala Trp His Lys Leu Arg Leu Gly Val Ile 
            20                  25                  30          


Thr Ala Ser Glu Val His Asn Val Ile Ala Lys Pro Arg Ser Gly Lys 
        35                  40                  45              


Lys Trp Pro Asp Met Lys Met Ser Tyr Phe His Thr Leu Leu Ala Glu 
    50                  55                  60                  


Val Cys Thr Gly Val Ala Pro Glu Val Asn Ala Lys Ala Leu Ala Trp 
65                  70                  75                  80  


Gly Lys Gln Tyr Glu Asn Asp Ala Arg Thr Leu Phe Glu Phe Thr Ser 
                85                  90                  95      


Gly Val Asn Val Thr Glu Ser Pro Ile Ile Tyr Arg Asp Glu Ser Met 
            100                 105                 110         


Arg Thr Ala Cys Ser Pro Asp Gly Leu Cys Ser Asp Gly Asn Gly Leu 
        115                 120                 125             


Glu Leu Lys Cys Pro Phe Thr Ser Arg Asp Phe Met Lys Phe Arg Leu 
    130                 135                 140                 


Gly Gly Phe Glu Ala Ile Lys Ser Ala Tyr Met Ala Gln Val Gln Tyr 
145                 150                 155                 160 


Ser Met Trp Val Thr Arg Lys Asn Ala Trp Tyr Phe Ala Asn Tyr Asp 
                165                 170                 175     


Pro Arg Met Lys Arg Glu Gly Leu His Tyr Val Val Ile Glu Arg Asp 
            180                 185                 190         


Glu Lys Tyr Met Ala Ser Phe Asp Glu Ile Val Pro Glu Phe Ile Glu 
        195                 200                 205             


Lys Met Asp Glu Ala Leu Ala Glu Ile Gly Phe Val Phe Gly Glu Gln 
    210                 215                 220                 


Trp Arg 
225     


<210>  13
<211>  760
<212>  PRT
<213>  Methanococcoides burtonii

<400>  13

Met Met Ile Arg Glu Leu Asp Ile Pro Arg Asp Ile Ile Gly Phe Tyr 
1               5                   10                  15      


Glu Asp Ser Gly Ile Lys Glu Leu Tyr Pro Pro Gln Ala Glu Ala Ile 
            20                  25                  30          


Glu Met Gly Leu Leu Glu Lys Lys Asn Leu Leu Ala Ala Ile Pro Thr 
        35                  40                  45              


Ala Ser Gly Lys Thr Leu Leu Ala Glu Leu Ala Met Ile Lys Ala Ile 
    50                  55                  60                  


Arg Glu Gly Gly Lys Ala Leu Tyr Ile Val Pro Leu Arg Ala Leu Ala 
65                  70                  75                  80  


Ser Glu Lys Phe Glu Arg Phe Lys Glu Leu Ala Pro Phe Gly Ile Lys 
                85                  90                  95      


Val Gly Ile Ser Thr Gly Asp Leu Asp Ser Arg Ala Asp Trp Leu Gly 
            100                 105                 110         


Val Asn Asp Ile Ile Val Ala Thr Ser Glu Lys Thr Asp Ser Leu Leu 
        115                 120                 125             


Arg Asn Gly Thr Ser Trp Met Asp Glu Ile Thr Thr Val Val Val Asp 
    130                 135                 140                 


Glu Ile His Leu Leu Asp Ser Lys Asn Arg Gly Pro Thr Leu Glu Val 
145                 150                 155                 160 


Thr Ile Thr Lys Leu Met Arg Leu Asn Pro Asp Val Gln Val Val Ala 
                165                 170                 175     


Leu Ser Ala Thr Val Gly Asn Ala Arg Glu Met Ala Asp Trp Leu Gly 
            180                 185                 190         


Ala Ala Leu Val Leu Ser Glu Trp Arg Pro Thr Asp Leu His Glu Gly 
        195                 200                 205             


Val Leu Phe Gly Asp Ala Ile Asn Phe Pro Gly Ser Gln Lys Lys Ile 
    210                 215                 220                 


Asp Arg Leu Glu Lys Asp Asp Ala Val Asn Leu Val Leu Asp Thr Ile 
225                 230                 235                 240 


Lys Ala Glu Gly Gln Cys Leu Val Phe Glu Ser Ser Arg Arg Asn Cys 
                245                 250                 255     


Ala Gly Phe Ala Lys Thr Ala Ser Ser Lys Val Ala Lys Ile Leu Asp 
            260                 265                 270         


Asn Asp Ile Met Ile Lys Leu Ala Gly Ile Ala Glu Glu Val Glu Ser 
        275                 280                 285             


Thr Gly Glu Thr Asp Thr Ala Ile Val Leu Ala Asn Cys Ile Arg Lys 
    290                 295                 300                 


Gly Val Ala Phe His His Ala Gly Leu Asn Ser Asn His Arg Lys Leu 
305                 310                 315                 320 


Val Glu Asn Gly Phe Arg Gln Asn Leu Ile Lys Val Ile Ser Ser Thr 
                325                 330                 335     


Pro Thr Leu Ala Ala Gly Leu Asn Leu Pro Ala Arg Arg Val Ile Ile 
            340                 345                 350         


Arg Ser Tyr Arg Arg Phe Asp Ser Asn Phe Gly Met Gln Pro Ile Pro 
        355                 360                 365             


Val Leu Glu Tyr Lys Gln Met Ala Gly Arg Ala Gly Arg Pro His Leu 
    370                 375                 380                 


Asp Pro Tyr Gly Glu Ser Val Leu Leu Ala Lys Thr Tyr Asp Glu Phe 
385                 390                 395                 400 


Ala Gln Leu Met Glu Asn Tyr Val Glu Ala Asp Ala Glu Asp Ile Trp 
                405                 410                 415     


Ser Lys Leu Gly Thr Glu Asn Ala Leu Arg Thr His Val Leu Ser Thr 
            420                 425                 430         


Ile Val Asn Gly Phe Ala Ser Thr Arg Gln Glu Leu Phe Asp Phe Phe 
        435                 440                 445             


Gly Ala Thr Phe Phe Ala Tyr Gln Gln Asp Lys Trp Met Leu Glu Glu 
    450                 455                 460                 


Val Ile Asn Asp Cys Leu Glu Phe Leu Ile Asp Lys Ala Met Val Ser 
465                 470                 475                 480 


Glu Thr Glu Asp Ile Glu Asp Ala Ser Lys Leu Phe Leu Arg Gly Thr 
                485                 490                 495     


Arg Leu Gly Ser Leu Val Ser Met Leu Tyr Ile Asp Pro Leu Ser Gly 
            500                 505                 510         


Ser Lys Ile Val Asp Gly Phe Lys Asp Ile Gly Lys Ser Thr Gly Gly 
        515                 520                 525             


Asn Met Gly Ser Leu Glu Asp Asp Lys Gly Asp Asp Ile Thr Val Thr 
    530                 535                 540                 


Asp Met Thr Leu Leu His Leu Val Cys Ser Thr Pro Asp Met Arg Gln 
545                 550                 555                 560 


Leu Tyr Leu Arg Asn Thr Asp Tyr Thr Ile Val Asn Glu Tyr Ile Val 
                565                 570                 575     


Ala His Ser Asp Glu Phe His Glu Ile Pro Asp Lys Leu Lys Glu Thr 
            580                 585                 590         


Asp Tyr Glu Trp Phe Met Gly Glu Val Lys Thr Ala Met Leu Leu Glu 
        595                 600                 605             


Glu Trp Val Thr Glu Val Ser Ala Glu Asp Ile Thr Arg His Phe Asn 
    610                 615                 620                 


Val Gly Glu Gly Asp Ile His Ala Leu Ala Asp Thr Ser Glu Trp Leu 
625                 630                 635                 640 


Met His Ala Ala Ala Lys Leu Ala Glu Leu Leu Gly Val Glu Tyr Ser 
                645                 650                 655     


Ser His Ala Tyr Ser Leu Glu Lys Arg Ile Arg Tyr Gly Ser Gly Leu 
            660                 665                 670         


Asp Leu Met Glu Leu Val Gly Ile Arg Gly Val Gly Arg Val Arg Ala 
        675                 680                 685             


Arg Lys Leu Tyr Asn Ala Gly Phe Val Ser Val Ala Lys Leu Lys Gly 
    690                 695                 700                 


Ala Asp Ile Ser Val Leu Ser Lys Leu Val Gly Pro Lys Val Ala Tyr 
705                 710                 715                 720 


Asn Ile Leu Ser Gly Ile Gly Val Arg Val Asn Asp Lys His Phe Asn 
                725                 730                 735     


Ser Ala Pro Ile Ser Ser Asn Thr Leu Asp Thr Leu Leu Asp Lys Asn 
            740                 745                 750         


Gln Lys Thr Phe Asn Asp Phe Gln 
        755                 760 


<210>  14
<211>  300
<212>  PRT
<213>  Eisenia fetida

<400>  14

Met Ser Ser Ser Thr Val Met Ala Asp Gly Phe Glu Glu Ile Glu Val 
1               5                   10                  15      


Asp Val Val Ser Val Trp Lys Glu Gly Tyr Ala Tyr Glu Asn Arg Gly 
            20                  25                  30          


Asn Ser Ser Val Gln Gln Lys Ile Thr Met Thr Lys Gly Met Lys Asn 
        35                  40                  45              


Leu Asn Ser Glu Thr Lys Thr Leu Thr Ala Thr His Thr Leu Gly Arg 
    50                  55                  60                  


Thr Leu Lys Val Gly Asp Pro Phe Glu Ile Ala Ser Val Glu Val Ser 
65                  70                  75                  80  


Tyr Thr Phe Ser His Gln Lys Ser Gln Val Ser Met Thr Gln Thr Glu 
                85                  90                  95      


Val Tyr Ser Ser Gln Val Ile Glu His Thr Val Thr Ile Pro Pro Asn 
            100                 105                 110         


Lys Lys Phe Thr Arg Trp Lys Leu Asn Ala Asp Val Gly Gly Thr Gly 
        115                 120                 125             


Ile Glu Tyr Met Tyr Leu Ile Asp Glu Val Thr Ala Ile Gly Ala Asp 
    130                 135                 140                 


Leu Thr Ile Pro Glu Val Asn Lys Ser Arg Ala Lys Ile Leu Val Gly 
145                 150                 155                 160 


Arg Gln Ile His Leu Gly Glu Thr Glu Ile Arg Ile Lys His Ala Glu 
                165                 170                 175     


Arg Lys Glu Tyr Met Thr Val Ile Ser Arg Lys Ser Trp Pro Ala Ala 
            180                 185                 190         


Thr Leu Gly Asn Ser Asn Leu Phe Lys Phe Val Leu Phe Glu Asp Ser 
        195                 200                 205             


Ser Gly Ile Arg Ile Lys Thr Leu Asn Thr Met Tyr Pro Gly Tyr Glu 
    210                 215                 220                 


Trp Ala Tyr Ser Ser Asp Gln Gly Gly Ile Tyr Phe Asp Glu Ser Ser 
225                 230                 235                 240 


Asp Asn Pro Lys Gln Arg Trp Ala Leu Ser Lys Ala Met Pro Leu Arg 
                245                 250                 255     


His Gly Asp Val Val Thr Phe Arg Asn Asn Phe Phe Thr Asn Ser Gly 
            260                 265                 270         


Met Cys Tyr Asp Asp Gly Pro Ala Thr Asn Val Tyr Cys Leu Glu Lys 
        275                 280                 285             


Arg Glu Asp Lys Trp Ile Leu Glu Val Val Asn Thr 
    290                 295                 300 


<210>  15
<211>  300
<212>  PRT
<213>  Eisenia fetida

<400>  15

Met Ser Ser Arg Ala Gly Ile Ala Glu Gly Tyr Glu Gln Ile Glu Val 
1               5                   10                  15      


Asp Val Val Ala Val Trp Lys Glu Gly Tyr Val Tyr Glu Asn Arg Gly 
            20                  25                  30          


Ser Thr Ser Val Glu Gln Lys Ile Lys Ile Thr Lys Gly Met Arg Asn 
        35                  40                  45              


Leu Asn Ser Glu Thr Lys Thr Leu Thr Ala Ser His Ser Ile Gly Ser 
    50                  55                  60                  


Thr Ile Ser Thr Gly Asp Leu Phe Glu Ile Ala Thr Val Asp Val Ser 
65                  70                  75                  80  


Tyr Ser Tyr Ser His Glu Glu Ser Gln Val Ser Met Thr Glu Thr Glu 
                85                  90                  95      


Val Tyr Glu Ser Lys Glu Ile Glu His Thr Ile Thr Ile Pro Pro Thr 
            100                 105                 110         


Ser Lys Phe Thr Arg Trp Gln Leu Asn Ala Asp Val Gly Gly Ala Asp 
        115                 120                 125             


Ile Glu Tyr Met Tyr Leu Ile Asp Glu Val Thr Pro Ile Gly Gly Thr 
    130                 135                 140                 


Leu Ser Ile Pro Gln Val Ile Lys Ser Arg Ala Lys Ile Leu Val Gly 
145                 150                 155                 160 


Arg Glu Ile Tyr Leu Gly Glu Thr Glu Ile Arg Ile Lys His Ala Asp 
                165                 170                 175     


Arg Lys Glu Tyr Met Thr Val Val Ser Arg Lys Ser Trp Pro Ala Ala 
            180                 185                 190         


Thr Leu Gly His Ser Lys Leu Tyr Lys Phe Val Leu Tyr Glu Asp Met 
        195                 200                 205             


Tyr Gly Phe Arg Ile Lys Thr Leu Asn Thr Met Tyr Ser Gly Tyr Glu 
    210                 215                 220                 


Tyr Ala Tyr Ser Ser Asp Gln Gly Gly Ile Tyr Phe Asp Gln Gly Ser 
225                 230                 235                 240 


Asp Asn Pro Lys Gln Arg Trp Ala Ile Asn Lys Ser Leu Pro Leu Arg 
                245                 250                 255     


His Gly Asp Val Val Thr Phe Met Asn Lys Tyr Phe Thr Arg Ser Gly 
            260                 265                 270         


Leu Cys Tyr Tyr Asp Gly Pro Ala Thr Asp Val Tyr Cys Leu Asp Lys 
        275                 280                 285             


Arg Glu Asp Lys Trp Ile Leu Glu Val Val Lys Pro 
    290                 295                 300 


<210>  16
<211>  300
<212>  PRT
<213>  Eisenia fetida

<400>  16

Met Ser Ala Thr Ala Val Thr Ala Asp Gly Leu Glu Glu Ile Glu Val 
1               5                   10                  15      


Asp Val Val Ala Val Trp Lys Glu Gly Tyr Val Tyr Glu Asn Arg Gly 
            20                  25                  30          


Asp Thr Ser Val Glu Gln Lys Ile Thr Met Thr Lys Gly Met Lys Asn 
        35                  40                  45              


Leu Asn Ser Glu Thr Lys Thr Leu Thr Ala Thr His Thr Val Gly Arg 
    50                  55                  60                  


Thr Leu Lys Val Gly Asp Pro Phe Glu Ile Gly Ser Val Glu Val Ser 
65                  70                  75                  80  


Tyr Ser Phe Ser His Gln Glu Ser Gln Val Ser Met Thr Gln Thr Glu 
                85                  90                  95      


Val Tyr Ser Ser Gln Val Ile Glu His Thr Val Thr Ile Pro Pro Thr 
            100                 105                 110         


Ser Lys Phe Thr Arg Trp Lys Leu Asn Ala Asp Val Gly Gly Thr Asp 
        115                 120                 125             


Ile Glu Tyr Met Tyr Leu Ile Asp Glu Val Thr Pro Ile Ser Val Thr 
    130                 135                 140                 


Gln Thr Ile Pro Gln Val Ile Arg Ser Arg Ala Lys Ile Leu Val Gly 
145                 150                 155                 160 


Arg Gln Ile His Leu Gly Thr Thr Ala Val Arg Ile Lys His Ala Glu 
                165                 170                 175     


Arg Gln Glu Tyr Met Thr Val Ile Glu Arg Lys Lys Trp Pro Ala Ala 
            180                 185                 190         


Thr Leu Gly Lys Ser Asn Leu Phe Lys Phe Val Leu Phe Glu Asp Ser 
        195                 200                 205             


Ser Gly Thr Arg Ile Lys Thr Leu Asn Thr Met Tyr Pro Gly Tyr Glu 
    210                 215                 220                 


Trp Ala Tyr Ser Ser Asp Gln Gly Gly Val Tyr Phe Asp Glu Ser Ser 
225                 230                 235                 240 


Asp Asn Pro Lys Gln Arg Trp Ala Leu Ser Lys Ala Leu Pro Leu Arg 
                245                 250                 255     


His Gly Asp Val Val Thr Phe Met Asn Lys Tyr Phe Thr Asn Ser Gly 
            260                 265                 270         


Leu Cys Tyr Asp Asp Gly Pro Ala Thr Asn Val Tyr Cys Leu Asp Lys 
        275                 280                 285             


Arg Glu Asp Lys Trp Ile Leu Glu Val Val Asn Pro 
    290                 295                 300 


<210>  17
<211>  252
<212>  PRT
<213>  Bacillus thuringiensis

<400>  17

Met Asp Val Ile Arg Glu Tyr Leu Met Phe Asn Glu Leu Ser Ala Leu 
1               5                   10                  15      


Ser Ser Ser Pro Glu Ser Val Arg Ser Arg Phe Ser Ser Ile Tyr Gly 
            20                  25                  30          


Thr Asn Pro Asp Gly Ile Ala Leu Asn Asn Glu Thr Tyr Phe Asn Ala 
        35                  40                  45              


Val Lys Pro Pro Ile Thr Ala Gln Tyr Gly Tyr Tyr Cys Tyr Lys Asn 
    50                  55                  60                  


Val Gly Thr Val Gln Tyr Val Asn Arg Pro Thr Asp Ile Asn Pro Asn 
65                  70                  75                  80  


Val Ile Leu Ala Gln Asp Thr Leu Thr Asn Asn Thr Asn Glu Pro Phe 
                85                  90                  95      


Thr Thr Thr Ile Thr Ile Thr Gly Ser Phe Thr Asn Thr Ser Thr Val 
            100                 105                 110         


Thr Ser Ser Thr Thr Thr Gly Phe Lys Phe Thr Ser Lys Leu Ser Ile 
        115                 120                 125             


Lys Lys Val Phe Glu Ile Gly Gly Glu Val Ser Phe Ser Thr Thr Ile 
    130                 135                 140                 


Gly Thr Ser Glu Thr Thr Thr Glu Thr Ile Thr Val Ser Lys Ser Val 
145                 150                 155                 160 


Thr Val Thr Val Pro Ala Gln Ser Arg Arg Thr Ile Gln Leu Thr Ala 
                165                 170                 175     


Lys Ile Ala Lys Glu Ser Ala Asp Phe Ser Ala Pro Ile Thr Val Asp 
            180                 185                 190         


Gly Tyr Phe Gly Ala Asn Phe Pro Lys Arg Val Gly Pro Gly Gly His 
        195                 200                 205             


Tyr Phe Trp Phe Asn Pro Ala Arg Asp Val Leu Asn Thr Thr Ser Gly 
    210                 215                 220                 


Thr Leu Arg Gly Thr Val Thr Asn Val Ser Ser Phe Asp Phe Gln Thr 
225                 230                 235                 240 


Ile Val Gln Pro Ala Arg Ser Leu Leu Asp Glu Gln 
                245                 250         


<210>  18
<211>  439
<212>  PRT
<213>  Enterobacteria phage T4

<400>  18

Met Thr Phe Asp Asp Leu Thr Glu Gly Gln Lys Asn Ala Phe Asn Ile 
1               5                   10                  15      


Val Met Lys Ala Ile Lys Glu Lys Lys His His Val Thr Ile Asn Gly 
            20                  25                  30          


Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr Lys Phe Ile Ile Glu Ala 
        35                  40                  45              


Leu Ile Ser Thr Gly Glu Thr Gly Ile Ile Leu Ala Ala Pro Thr His 
    50                  55                  60                  


Ala Ala Lys Lys Ile Leu Ser Lys Leu Ser Gly Lys Glu Ala Ser Thr 
65                  70                  75                  80  


Ile His Ser Ile Leu Lys Ile Asn Pro Val Thr Tyr Glu Glu Asn Val 
                85                  90                  95      


Leu Phe Glu Gln Lys Glu Val Pro Asp Leu Ala Lys Cys Arg Val Leu 
            100                 105                 110         


Ile Cys Asp Glu Val Ser Met Tyr Asp Arg Lys Leu Phe Lys Ile Leu 
        115                 120                 125             


Leu Ser Thr Ile Pro Pro Trp Cys Thr Ile Ile Gly Ile Gly Asp Asn 
    130                 135                 140                 


Lys Gln Ile Arg Pro Val Asp Pro Gly Glu Asn Thr Ala Tyr Ile Ser 
145                 150                 155                 160 


Pro Phe Phe Thr His Lys Asp Phe Tyr Gln Cys Glu Leu Thr Glu Val 
                165                 170                 175     


Lys Arg Ser Asn Ala Pro Ile Ile Asp Val Ala Thr Asp Val Arg Asn 
            180                 185                 190         


Gly Lys Trp Ile Tyr Asp Lys Val Val Asp Gly His Gly Val Arg Gly 
        195                 200                 205             


Phe Thr Gly Asp Thr Ala Leu Arg Asp Phe Met Val Asn Tyr Phe Ser 
    210                 215                 220                 


Ile Val Lys Ser Leu Asp Asp Leu Phe Glu Asn Arg Val Met Ala Phe 
225                 230                 235                 240 


Thr Asn Lys Ser Val Asp Lys Leu Asn Ser Ile Ile Arg Lys Lys Ile 
                245                 250                 255     


Phe Glu Thr Asp Lys Asp Phe Ile Val Gly Glu Ile Ile Val Met Gln 
            260                 265                 270         


Glu Pro Leu Phe Lys Thr Tyr Lys Ile Asp Gly Lys Pro Val Ser Glu 
        275                 280                 285             


Ile Ile Phe Asn Asn Gly Gln Leu Val Arg Ile Ile Glu Ala Glu Tyr 
    290                 295                 300                 


Thr Ser Thr Phe Val Lys Ala Arg Gly Val Pro Gly Glu Tyr Leu Ile 
305                 310                 315                 320 


Arg His Trp Asp Leu Thr Val Glu Thr Tyr Gly Asp Asp Glu Tyr Tyr 
                325                 330                 335     


Arg Glu Lys Ile Lys Ile Ile Ser Ser Asp Glu Glu Leu Tyr Lys Phe 
            340                 345                 350         


Asn Leu Phe Leu Gly Lys Thr Ala Glu Thr Tyr Lys Asn Trp Asn Lys 
        355                 360                 365             


Gly Gly Lys Ala Pro Trp Ser Asp Phe Trp Asp Ala Lys Ser Gln Phe 
    370                 375                 380                 


Ser Lys Val Lys Ala Leu Pro Ala Ser Thr Phe His Lys Ala Gln Gly 
385                 390                 395                 400 


Met Ser Val Asp Arg Ala Phe Ile Tyr Thr Pro Cys Ile His Tyr Ala 
                405                 410                 415     


Asp Val Glu Leu Ala Gln Gln Leu Leu Tyr Val Gly Val Thr Arg Gly 
            420                 425                 430         


Arg Tyr Asp Val Phe Tyr Val 
        435                 


<210>  19
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Adaptor portion B of Fig 5

<400>  19
ggcgtctgct tgggtgttta accttttttt ttttt                                  35


<210>  20
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Adaptor portion D of Fig 5

<400>  20
ggttgtttct gttggtgctg atattgct                                          28


<210>  21
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Adaptor portion E of Fig 5

<400>  21
aacacccaag cagacgcctt                                                   20


<210>  22
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Adaptor portion F of Fig 5

<400>  22
gcaatatcag caccaacaga aacaaccttt gaggcgagcg gtcaa                       45


<210>  23
<211>  10178
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  10kb Lambda cDNA

<400>  23
caaagtccat gccatcaaac tgctggtttt cattgatgat gcgggaccag ccatcaacgc       60

ccaccaccgg aacgatgcca ttctgcttat caggaaaggc gtaaatttct ttcgtccacg      120

gattaaggcc gtactggttg gcaacgatca gtaatgcgat gaactgcgca tcgctggcat      180

cacctttaaa tgccgtctgg cgaagagtgg tgatcagttc ctgtgggtcg acagaatcca      240

tgccgacacg ttcagccagc ttcccagcca gcgttgcgag tgcagtactc attcgtttta      300

tacctctgaa tcaatatcaa cctggtggtg agcaatggtt tcaaccatgt accggatgtg      360

ttctgccatg cgctcctgaa actcaacatc gtcatcaaac gcacgggtaa tggatttttt      420

gctggccccg tggcgttgca aatgatcgat gcatagcgat tcaaacaggt gctggggcag      480

gcctttttcc atgtcgtctg ccagttctgc ctctttctct tcacgggcga gctgctggta      540

gtgacgcgcc cagctctgag cctcaagacg atcctgaatg taataagcgt tcatggctga      600

actcctgaaa tagctgtgaa aatatcgccc gcgaaatgcc gggctgatta ggaaaacagg      660

aaagggggtt agtgaatgct tttgcttgat ctcagtttca gtattaatat ccatttttta      720

taagcgtcga cggcttcacg aaacatcttt tcatcgccaa taaaagtggc gatagtgaat      780

ttagtctgga tagccataag tgtttgatcc attctttggg actcctggct gattaagtat      840

gtcgataagg cgtttccatc cgtcacgtaa tttacgggtg attcgttcaa gtaaagattc      900

ggaagggcag ccagcaacag gccaccctgc aatggcatat tgcatggtgt gctccttatt      960

tatacataac gaaaaacgcc tcgagtgaag cgttattggt atgcggtaaa accgcactca     1020

ggcggccttg atagtcatat catctgaatc aaatattcct gatgtatcga tatcggtaat     1080

tcttattcct tcgctaccat ccattggagg ccatccttcc tgaccatttc catcattcca     1140

gtcgaactca cacacaacac catatgcatt taagtcgctt gaaattgcta taagcagagc     1200

atgttgcgcc agcatgatta atacagcatt taatacagag ccgtgtttat tgagtcggta     1260

ttcagagtct gaccagaaat tattaatctg gtgaagtttt tcctctgtca ttacgtcatg     1320

gtcgatttca atttctattg atgctttcca gtcgtaatca atgatgtatt ttttgatgtt     1380

tgacatctgt tcatatcctc acagataaaa aatcgccctc acactggagg gcaaagaaga     1440

tttccaataa tcagaacaag tcggctcctg tttagttacg agcgacattg ctccgtgtat     1500

tcactcgttg gaatgaatac acagtgcagt gtttattctg ttatttatgc caaaaataaa     1560

ggccactatc aggcagcttt gttgttctgt ttaccaagtt ctctggcaat cattgccgtc     1620

gttcgtattg cccatttatc gacatatttc ccatcttcca ttacaggaaa catttcttca     1680

ggcttaacca tgcattccga ttgcagcttg catccattgc atcgcttgaa ttgtccacac     1740

cattgatttt tatcaatagt cgtagtcata cggatagtcc tggtattgtt ccatcacatc     1800

ctgaggatgc tcttcgaact cttcaaattc ttcttccata tatcacctta aatagtggat     1860

tgcggtagta aagattgtgc ctgtctttta accacatcag gctcggtggt tctcgtgtac     1920

ccctacagcg agaaatcgga taaactatta caacccctac agtttgatga gtatagaaat     1980

ggatccactc gttattctcg gacgagtgtt cagtaatgaa cctctggaga gaaccatgta     2040

tatgatcgtt atctgggttg gacttctgct tttaagccca gataactggc ctgaatatgt     2100

taatgagaga atcggtattc ctcatgtgtg gcatgttttc gtctttgctc ttgcattttc     2160

gctagcaatt aatgtgcatc gattatcagc tattgccagc gccagatata agcgatttaa     2220

gctaagaaaa cgcattaaga tgcaaaacga taaagtgcga tcagtaattc aaaaccttac     2280

agaagagcaa tctatggttt tgtgcgcagc ccttaatgaa ggcaggaagt atgtggttac     2340

atcaaaacaa ttcccataca ttagtgagtt gattgagctt ggtgtgttga acaaaacttt     2400

ttcccgatgg aatggaaagc atatattatt ccctattgag gatatttact ggactgaatt     2460

agttgccagc tatgatccat ataatattga gataaagcca aggccaatat ctaagtaact     2520

agataagagg aatcgatttt cccttaattt tctggcgtcc actgcatgtt atgccgcgtt     2580

cgccaggctt gctgtaccat gtgcgctgat tcttgcgctc aatacgttgc aggttgcttt     2640

caatctgttt gtggtattca gccagcactg taaggtctat cggatttagt gcgctttcta     2700

ctcgtgattt cggtttgcga ttcagcgaga gaatagggcg gttaactggt tttgcgctta     2760

ccccaaccaa caggggattt gctgctttcc attgagcctg tttctctgcg cgacgttcgc     2820

ggcggcgtgt ttgtgcatcc atctggattc tcctgtcagt tagctttggt ggtgtgtggc     2880

agttgtagtc ctgaacgaaa accccccgcg attggcacat tggcagctaa tccggaatcg     2940

cacttacggc caatgcttcg tttcgtatca cacaccccaa agccttctgc tttgaatgct     3000

gcccttcttc agggcttaat ttttaagagc gtcaccttca tggtggtcag tgcgtcctgc     3060

tgatgtgctc agtatcaccg ccagtggtat ttatgtcaac accgccagag ataatttatc     3120

accgcagatg gttatctgta tgttttttat atgaatttat tttttgcagg ggggcattgt     3180

ttggtaggtg agagatctga attgctatgt ttagtgagtt gtatctattt atttttcaat     3240

aaatacaatt ggttatgtgt tttgggggcg atcgtgaggc aaagaaaacc cggcgctgag     3300

gccgggttat tcttgttctc tggtcaaatt atatagttgg aaaacaagga tgcatatatg     3360

aatgaacgat gcagaggcaa tgccgatggc gatagtgggt atcatgtagc cgcttatgct     3420

ggaaagaagc aataacccgc agaaaaacaa agctccaagc tcaacaaaac taagggcata     3480

gacaataact accgatgtca tatacccata ctctctaatc ttggccagtc ggcgcgttct     3540

gcttccgatt agaaacgtca aggcagcaat caggattgca atcatggttc ctgcatatga     3600

tgacaatgtc gccccaagac catctctatg agctgaaaaa gaaacaccag gaatgtagtg     3660

gcggaaaagg agatagcaaa tgcttacgat aacgtaagga attattacta tgtaaacacc     3720

aggcatgatt ctgttccgca taattactcc tgataattaa tccttaactt tgcccacctg     3780

ccttttaaaa cattccagta tatcactttt cattcttgcg tagcaatatg ccatctcttc     3840

agctatctca gcattggtga ccttgttcag aggcgctgag agatggcctt tttctgatag     3900

ataatgttct gttaaaatat ctccggcctc atcttttgcc cgcaggctaa tgtctgaaaa     3960

ttgaggtgac gggttaaaaa taatatcctt ggcaaccttt tttatatccc ttttaaattt     4020

tggcttaatg actatatcca atgagtcaaa aagctcccct tcaatatctg ttgcccctaa     4080

gacctttaat atatcgccaa atacaggtag cttggcttct accttcaccg ttgttcggcc     4140

gatgaaatgc atatgcataa catcgtcttt ggtggttccc ctcatcagtg gctctatctg     4200

aacgcgctct ccactgctta atgacattcc tttcccgatt aaaaaatctg tcagatcgga     4260

tgtggtcggc ccgaaaacag ttctggcaaa accaatggtg tcgccttcaa caaacaaaaa     4320

agatgggaat cccaatgatt cgtcatctgc gaggctgttc ttaatatctt caactgaagc     4380

tttagagcga tttatcttct gaaccagact cttgtcattt gttttggtaa agagaaaagt     4440

ttttccatcg attttatgaa tatacaaata attggagcca acctgcaggt gatgattatc     4500

agccagcaga gaattaagga aaacagacag gtttattgag cgcttatctt tccctttatt     4560

tttgctgcgg taagtcgcat aaaaaccatt cttcataatt caatccattt actatgttat     4620

gttctgaggg gagtgaaaat tcccctaatt cgatgaagat tcttgctcaa ttgttatcag     4680

ctatgcgccg accagaacac cttgccgatc agccaaacgt ctcttcaggc cactgactag     4740

cgataacttt ccccacaacg gaacaactct cattgcatgg gatcattggg tactgtgggt     4800

ttagtggttg taaaaacacc tgaccgctat ccctgatcag tttcttgaag gtaaactcat     4860

cacccccaag tctggctatg cagaaatcac ctggctcaac agcctgctca gggtcaacga     4920

gaattaacat tccgtcagga aagcttggct tggagcctgt tggtgcggtc atggaattac     4980

cttcaacctc aagccagaat gcagaatcac tggctttttt ggttgtgctt acccatctct     5040

ccgcatcacc tttggtaaag gttctaagct taggtgagaa catccctgcc tgaacatgag     5100

aaaaaacagg gtactcatac tcacttctaa gtgacggctg catactaacc gcttcataca     5160

tctcgtagat ttctctggcg attgaagggc taaattcttc aacgctaact ttgagaattt     5220

ttgtaagcaa tgcggcgtta taagcattta atgcattgat gccattaaat aaagcaccaa     5280

cgcctgactg ccccatcccc atcttgtctg cgacagattc ctgggataag ccaagttcat     5340

ttttcttttt ttcataaatt gctttaaggc gacgtgcgtc ctcaagctgc tcttgtgtta     5400

atggtttctt ttttgtgctc atacgttaaa tctatcaccg caagggataa atatctaaca     5460

ccgtgcgtgt tgactatttt acctctggcg gtgataatgg ttgcatgtac taaggaggtt     5520

gtatggaaca acgcataacc ctgaaagatt atgcaatgcg ctttgggcaa accaagacag     5580

ctaaagatct cggcgtatat caaagcgcga tcaacaaggc cattcatgca ggccgaaaga     5640

tttttttaac tataaacgct gatggaagcg tttatgcgga agaggtaaag cccttcccga     5700

gtaacaaaaa aacaacagca taaataaccc cgctcttaca cattccagcc ctgaaaaagg     5760

gcatcaaatt aaaccacacc tatggtgtat gcatttattt gcatacattc aatcaattgt     5820

tatctaagga aatacttaca tatggttcgt gcaaacaaac gcaacgaggc tctacgaatc     5880

gagagtgcgt tgcttaacaa aatcgcaatg cttggaactg agaagacagc ggaagctgtg     5940

ggcgttgata agtcgcagat cagcaggtgg aagagggact ggattccaaa gttctcaatg     6000

ctgcttgctg ttcttgaatg gggggtcgtt gacgacgaca tggctcgatt ggcgcgacaa     6060

gttgctgcga ttctcaccaa taaaaaacgc ccggcggcaa ccgagcgttc tgaacaaatc     6120

cagatggagt tctgaggtca ttactggatc tatcaacagg agtcattatg acaaatacag     6180

caaaaatact caacttcggc agaggtaact ttgccggaca ggagcgtaat gtggcagatc     6240

tcgatgatgg ttacgccaga ctatcaaata tgctgcttga ggcttattcg ggcgcagatc     6300

tgaccaagcg acagtttaaa gtgctgcttg ccattctgcg taaaacctat gggtggaata     6360

aaccaatgga cagaatcacc gattctcaac ttagcgagat tacaaagtta cctgtcaaac     6420

ggtgcaatga agccaagtta gaactcgtca gaatgaatat tatcaagcag caaggcggca     6480

tgtttggacc aaataaaaac atctcagaat ggtgcatccc tcaaaacgag ggaaaatccc     6540

ctaaaacgag ggataaaaca tccctcaaat tgggggattg ctatccctca aaacaggggg     6600

acacaaaaga cactattaca aaagaaaaaa gaaaagatta ttcgtcagag aattctggcg     6660

aatcctctga ccagccagaa aacgaccttt ctgtggtgaa accggatgct gcaattcaga     6720

gcggcagcaa gtgggggaca gcagaagacc tgaccgccgc agagtggatg tttgacatgg     6780

tgaagactat cgcaccatca gccagaaaac cgaattttgc tgggtgggct aacgatatcc     6840

gcctgatgcg tgaacgtgac ggacgtaacc accgcgacat gtgtgtgctg ttccgctggg     6900

catgccagga caacttctgg tccggtaacg tgctgagccc ggccaaactc cgcgataagt     6960

ggacccaact cgaaatcaac cgtaacaagc aacaggcagg cgtgacagcc agcaaaccaa     7020

aactcgacct gacaaacaca gactggattt acggggtgga tctatgaaaa acatcgccgc     7080

acagatggtt aactttgacc gtgagcagat gcgtcggatc gccaacaaca tgccggaaca     7140

gtacgacgaa aagccgcagg tacagcaggt agcgcagatc atcaacggtg tgttcagcca     7200

gttactggca actttcccgg cgagcctggc taaccgtgac cagaacgaag tgaacgaaat     7260

ccgtcgccag tgggttctgg cttttcggga aaacgggatc accacgatgg aacaggttaa     7320

cgcaggaatg cgcgtagccc gtcggcagaa tcgaccattt ctgccatcac ccgggcagtt     7380

tgttgcatgg tgccgggaag aagcatccgt taccgccgga ctgccaaacg tcagcgagct     7440

ggttgatatg gtttacgagt attgccggaa gcgaggcctg tatccggatg cggagtctta     7500

tccgtggaaa tcaaacgcgc actactggct ggttaccaac ctgtatcaga acatgcgggc     7560

caatgcgctt actgatgcgg aattacgccg taaggccgca gatgagcttg tccatatgac     7620

tgcgagaatt aaccgtggtg aggcgatccc tgaaccagta aaacaacttc ctgtcatggg     7680

cggtagacct ctaaatcgtg cacaggctct ggcgaagatc gcagaaatca aagctaagtt     7740

cggactgaaa ggagcaagtg tatgacgggc aaagaggcaa ttattcatta cctggggacg     7800

cataatagct tctgtgcgcc ggacgttgcc gcgctaacag gcgcaacagt aaccagcata     7860

aatcaggccg cggctaaaat ggcacgggca ggtcttctgg ttatcgaagg taaggtctgg     7920

cgaacggtgt attaccggtt tgctaccagg gaagaacggg aaggaaagat gagcacgaac     7980

ctggttttta aggagtgtcg ccagagtgcc gcgatgaaac gggtattggc ggtatatgga     8040

gttaaaagat gaccatctac attactgagc taataacagg cctgctggta atcgcaggcc     8100

tttttatttg ggggagaggg aagtcatgaa aaaactaacc tttgaaattc gatctccagc     8160

acatcagcaa aacgctattc acgcagtaca gcaaatcctt ccagacccaa ccaaaccaat     8220

cgtagtaacc attcaggaac gcaaccgcag cttagaccaa aacaggaagc tatgggcctg     8280

cttaggtgac gtctctcgtc aggttgaatg gcatggtcgc tggctggatg cagaaagctg     8340

gaagtgtgtg tttaccgcag cattaaagca gcaggatgtt gttcctaacc ttgccgggaa     8400

tggctttgtg gtaataggcc agtcaaccag caggatgcgt gtaggcgaat ttgcggagct     8460

attagagctt atacaggcat tcggtacaga gcgtggcgtt aagtggtcag acgaagcgag     8520

actggctctg gagtggaaag cgagatgggg agacagggct gcatgataaa tgtcgttagt     8580

ttctccggtg gcaggacgtc agcatatttg ctctggctaa tggagcaaaa gcgacgggca     8640

ggtaaagacg tgcattacgt tttcatggat acaggttgtg aacatccaat gacatatcgg     8700

tttgtcaggg aagttgtgaa gttctgggat ataccgctca ccgtattgca ggttgatatc     8760

aacccggagc ttggacagcc aaatggttat acggtatggg aaccaaagga tattcagacg     8820

cgaatgcctg ttctgaagcc atttatcgat atggtaaaga aatatggcac tccatacgtc     8880

ggcggcgcgt tctgcactga cagattaaaa ctcgttccct tcaccaaata ctgtgatgac     8940

catttcgggc gagggaatta caccacgtgg attggcatca gagctgatga accgaagcgg     9000

ctaaagccaa agcctggaat cagatatctt gctgaactgt cagactttga gaaggaagat     9060

atcctcgcat ggtggaagca acaaccattc gatttgcaaa taccggaaca tctcggtaac     9120

tgcatattct gcattaaaaa atcaacgcaa aaaatcggac ttgcctgcaa agatgaggag     9180

ggattgcagc gtgtttttaa tgaggtcatc acgggatccc atgtgcgtga cggacatcgg     9240

gaaacgccaa aggagattat gtaccgagga agaatgtcgc tggacggtat cgcgaaaatg     9300

tattcagaaa atgattatca agccctgtat caggacatgg tacgagctaa aagattcgat     9360

accggctctt gttctgagtc atgcgaaata tttggagggc agcttgattt cgacttcggg     9420

agggaagctg catgatgcga tgttatcggt gcggtgaatg caaagaagat aaccgcttcc     9480

gaccaaatca accttactgg aatcgatggt gtctccggtg tgaaagaaca ccaacagggg     9540

tgttaccact accgcaggaa aaggaggacg tgtggcgaga cagcgacgaa gtatcaccga     9600

cataatctgc gaaaactgca aataccttcc aacgaaacgc accagaaata aacccaagcc     9660

aatcccaaaa gaatctgacg taaaaacctt caactacacg gctcacctgt gggatatccg     9720

gtggctaaga cgtcgtgcga ggaaaacaag gtgattgacc aaaatcgaag ttacgaacaa     9780

gaaagcgtcg agcgagcttt aacgtgcgct aactgcggtc agaagctgca tgtgctggaa     9840

gttcacgtgt gtgagcactg ctgcgcagaa ctgatgagcg atccgaatag ctcgatgcac     9900

gaggaagaag atgatggcta aaccagcgcg aagacgatgt aaaaacgatg aatgccggga     9960

atggtttcac cctgcattcg ctaatcagtg gtggtgctct ccagagtgtg gaaccaagat    10020

agcactcgaa cgacgaagta aagaacgcga aaaagcggaa aaagcagcag agaagaaacg    10080

acgacgagag gagcagaaac agaaagataa acttaagatt cgaaaactcg ccttaaagcc    10140

ccgcagttac tggattaaac aagcccaaca agccagga                            10178


<210>  24
<211>  15
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Adaptor portion G of Fig 5

<400>  24
ttgaccgctc gcctc                                                        15


