                         SEQUENCE LISTING

<110>  OXFORD NANOPORE TECHNOLOGIES LIMITED
 
<120>  MODIFIED ENZYMES

<130>  N401334WO-B

<150>  GB 1318464.3
<151>  2013-10-18

<150>  GB 1406151.9
<151>  2014-04-04

<150>  GB 1404718.7
<151>  2014-03-17

<150>  PCT/GB2014/050175
<151>  2014-01-22

<160>  80    

<170>  PatentIn version 3.5

<210>  1
<211>  558
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mycobacterium smegmatis porin A mutant 
       (D90N/D91N/D93N/D118R/D134R/E139K)

<400>  1
atgggtctgg ataatgaact gagcctggtg gacggtcaag atcgtaccct gacggtgcaa       60

caatgggata cctttctgaa tggcgttttt ccgctggatc gtaatcgcct gacccgtgaa      120

tggtttcatt ccggtcgcgc aaaatatatc gtcgcaggcc cgggtgctga cgaattcgaa      180

ggcacgctgg aactgggtta tcagattggc tttccgtggt cactgggcgt tggtatcaac      240

ttctcgtaca ccacgccgaa tattctgatc aacaatggta acattaccgc accgccgttt      300

ggcctgaaca gcgtgattac gccgaacctg tttccgggtg ttagcatctc tgcccgtctg      360

ggcaatggtc cgggcattca agaagtggca acctttagtg tgcgcgtttc cggcgctaaa      420

ggcggtgtcg cggtgtctaa cgcccacggt accgttacgg gcgcggccgg cggtgtcctg      480

ctgcgtccgt tcgcgcgcct gattgcctct accggcgaca gcgttacgac ctatggcgaa      540

ccgtggaata tgaactaa                                                    558


<210>  2
<211>  184
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Mycobacterium smegmatis porin A mutant 
       (D90N/D91N/D93N/D118R/D134R/E139K)

<400>  2

Gly Leu Asp Asn Glu Leu Ser Leu Val Asp Gly Gln Asp Arg Thr Leu 
1               5                   10                  15      


Thr Val Gln Gln Trp Asp Thr Phe Leu Asn Gly Val Phe Pro Leu Asp 
            20                  25                  30          


Arg Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Lys Tyr 
        35                  40                  45              


Ile Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu 
    50                  55                  60                  


Gly Tyr Gln Ile Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe 
65                  70                  75                  80  


Ser Tyr Thr Thr Pro Asn Ile Leu Ile Asn Asn Gly Asn Ile Thr Ala 
                85                  90                  95      


Pro Pro Phe Gly Leu Asn Ser Val Ile Thr Pro Asn Leu Phe Pro Gly 
            100                 105                 110         


Val Ser Ile Ser Ala Arg Leu Gly Asn Gly Pro Gly Ile Gln Glu Val 
        115                 120                 125             


Ala Thr Phe Ser Val Arg Val Ser Gly Ala Lys Gly Gly Val Ala Val 
    130                 135                 140                 


Ser Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu 
145                 150                 155                 160 


Arg Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr 
                165                 170                 175     


Tyr Gly Glu Pro Trp Asn Met Asn 
            180                 


<210>  3
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  alpha-hemolysin mutant E111N/K147N

<400>  3
atggcagatt ctgatattaa tattaaaacc ggtactacag atattggaag caatactaca       60

gtaaaaacag gtgatttagt cacttatgat aaagaaaatg gcatgcacaa aaaagtattt      120

tatagtttta tcgatgataa aaatcacaat aaaaaactgc tagttattag aacaaaaggt      180

accattgctg gtcaatatag agtttatagc gaagaaggtg ctaacaaaag tggtttagcc      240

tggccttcag cctttaaggt acagttgcaa ctacctgata atgaagtagc tcaaatatct      300

gattactatc caagaaattc gattgataca aaaaactata tgagtacttt aacttatgga      360

ttcaacggta atgttactgg tgatgataca ggaaaaattg gcggccttat tggtgcaaat      420

gtttcgattg gtcatacact gaactatgtt caacctgatt tcaaaacaat tttagagagc      480

ccaactgata aaaaagtagg ctggaaagtg atatttaaca atatggtgaa tcaaaattgg      540

ggaccatacg atcgagattc ttggaacccg gtatatggca atcaactttt catgaaaact      600

agaaatggtt ctatgaaagc agcagataac ttccttgatc ctaacaaagc aagttctcta      660

ttatcttcag ggttttcacc agacttcgct acagttatta ctatggatag aaaagcatcc      720

aaacaacaaa caaatataga tgtaatatac gaacgagttc gtgatgatta ccaattgcat      780

tggacttcaa caaattggaa aggtaccaat actaaagata aatggacaga tcgttcttca      840

gaaagatata aaatcgattg ggaaaaagaa gaaatgacaa attaa                      885


<210>  4
<211>  293
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  alpha-hemolysin mutant E111N/K147N

<400>  4

Ala Asp Ser Asp Ile Asn Ile Lys Thr Gly Thr Thr Asp Ile Gly Ser 
1               5                   10                  15      


Asn Thr Thr Val Lys Thr Gly Asp Leu Val Thr Tyr Asp Lys Glu Asn 
            20                  25                  30          


Gly Met His Lys Lys Val Phe Tyr Ser Phe Ile Asp Asp Lys Asn His 
        35                  40                  45              


Asn Lys Lys Leu Leu Val Ile Arg Thr Lys Gly Thr Ile Ala Gly Gln 
    50                  55                  60                  


Tyr Arg Val Tyr Ser Glu Glu Gly Ala Asn Lys Ser Gly Leu Ala Trp 
65                  70                  75                  80  


Pro Ser Ala Phe Lys Val Gln Leu Gln Leu Pro Asp Asn Glu Val Ala 
                85                  90                  95      


Gln Ile Ser Asp Tyr Tyr Pro Arg Asn Ser Ile Asp Thr Lys Asn Tyr 
            100                 105                 110         


Met Ser Thr Leu Thr Tyr Gly Phe Asn Gly Asn Val Thr Gly Asp Asp 
        115                 120                 125             


Thr Gly Lys Ile Gly Gly Leu Ile Gly Ala Asn Val Ser Ile Gly His 
    130                 135                 140                 


Thr Leu Asn Tyr Val Gln Pro Asp Phe Lys Thr Ile Leu Glu Ser Pro 
145                 150                 155                 160 


Thr Asp Lys Lys Val Gly Trp Lys Val Ile Phe Asn Asn Met Val Asn 
                165                 170                 175     


Gln Asn Trp Gly Pro Tyr Asp Arg Asp Ser Trp Asn Pro Val Tyr Gly 
            180                 185                 190         


Asn Gln Leu Phe Met Lys Thr Arg Asn Gly Ser Met Lys Ala Ala Asp 
        195                 200                 205             


Asn Phe Leu Asp Pro Asn Lys Ala Ser Ser Leu Leu Ser Ser Gly Phe 
    210                 215                 220                 


Ser Pro Asp Phe Ala Thr Val Ile Thr Met Asp Arg Lys Ala Ser Lys 
225                 230                 235                 240 


Gln Gln Thr Asn Ile Asp Val Ile Tyr Glu Arg Val Arg Asp Asp Tyr 
                245                 250                 255     


Gln Leu His Trp Thr Ser Thr Asn Trp Lys Gly Thr Asn Thr Lys Asp 
            260                 265                 270         


Lys Trp Thr Asp Arg Ser Ser Glu Arg Tyr Lys Ile Asp Trp Glu Lys 
        275                 280                 285             


Glu Glu Met Thr Asn 
    290             


<210>  5
<211>  184
<212>  PRT
<213>  Mycobacterium smegmatis

<400>  5

Gly Leu Asp Asn Glu Leu Ser Leu Val Asp Gly Gln Asp Arg Thr Leu 
1               5                   10                  15      


Thr Val Gln Gln Trp Asp Thr Phe Leu Asn Gly Val Phe Pro Leu Asp 
            20                  25                  30          


Arg Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Lys Tyr 
        35                  40                  45              


Ile Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu 
    50                  55                  60                  


Gly Tyr Gln Ile Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe 
65                  70                  75                  80  


Ser Tyr Thr Thr Pro Asn Ile Leu Ile Asp Asp Gly Asp Ile Thr Ala 
                85                  90                  95      


Pro Pro Phe Gly Leu Asn Ser Val Ile Thr Pro Asn Leu Phe Pro Gly 
            100                 105                 110         


Val Ser Ile Ser Ala Asp Leu Gly Asn Gly Pro Gly Ile Gln Glu Val 
        115                 120                 125             


Ala Thr Phe Ser Val Asp Val Ser Gly Pro Ala Gly Gly Val Ala Val 
    130                 135                 140                 


Ser Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu 
145                 150                 155                 160 


Arg Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr 
                165                 170                 175     


Tyr Gly Glu Pro Trp Asn Met Asn 
            180                 


<210>  6
<211>  184
<212>  PRT
<213>  Mycobacterium smegmatis

<400>  6

Gly Leu Asp Asn Glu Leu Ser Leu Val Asp Gly Gln Asp Arg Thr Leu 
1               5                   10                  15      


Thr Val Gln Gln Trp Asp Thr Phe Leu Asn Gly Val Phe Pro Leu Asp 
            20                  25                  30          


Arg Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Lys Tyr 
        35                  40                  45              


Ile Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu 
    50                  55                  60                  


Gly Tyr Gln Ile Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe 
65                  70                  75                  80  


Ser Tyr Thr Thr Pro Asn Ile Leu Ile Asp Asp Gly Asp Ile Thr Gly 
                85                  90                  95      


Pro Pro Phe Gly Leu Glu Ser Val Ile Thr Pro Asn Leu Phe Pro Gly 
            100                 105                 110         


Val Ser Ile Ser Ala Asp Leu Gly Asn Gly Pro Gly Ile Gln Glu Val 
        115                 120                 125             


Ala Thr Phe Ser Val Asp Val Ser Gly Pro Ala Gly Gly Val Ala Val 
    130                 135                 140                 


Ser Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu 
145                 150                 155                 160 


Arg Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr 
                165                 170                 175     


Tyr Gly Glu Pro Trp Asn Met Asn 
            180                 


<210>  7
<211>  183
<212>  PRT
<213>  Mycobacterium smegmatis

<400>  7

Val Asp Asn Gln Leu Ser Val Val Asp Gly Gln Gly Arg Thr Leu Thr 
1               5                   10                  15      


Val Gln Gln Ala Glu Thr Phe Leu Asn Gly Val Phe Pro Leu Asp Arg 
            20                  25                  30          


Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Thr Tyr His 
        35                  40                  45              


Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu Gly 
    50                  55                  60                  


Tyr Gln Val Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe Ser 
65                  70                  75                  80  


Tyr Thr Thr Pro Asn Ile Leu Ile Asp Gly Gly Asp Ile Thr Gln Pro 
                85                  90                  95      


Pro Phe Gly Leu Asp Thr Ile Ile Thr Pro Asn Leu Phe Pro Gly Val 
            100                 105                 110         


Ser Ile Ser Ala Asp Leu Gly Asn Gly Pro Gly Ile Gln Glu Val Ala 
        115                 120                 125             


Thr Phe Ser Val Asp Val Lys Gly Ala Lys Gly Ala Val Ala Val Ser 
    130                 135                 140                 


Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu Arg 
145                 150                 155                 160 


Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr Tyr 
                165                 170                 175     


Gly Glu Pro Trp Asn Met Asn 
            180             


<210>  8
<211>  439
<212>  PRT
<213>  Enterobacteria phage T4

<400>  8

Met Thr Phe Asp Asp Leu Thr Glu Gly Gln Lys Asn Ala Phe Asn Ile 
1               5                   10                  15      


Val Met Lys Ala Ile Lys Glu Lys Lys His His Val Thr Ile Asn Gly 
            20                  25                  30          


Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr Lys Phe Ile Ile Glu Ala 
        35                  40                  45              


Leu Ile Ser Thr Gly Glu Thr Gly Ile Ile Leu Ala Ala Pro Thr His 
    50                  55                  60                  


Ala Ala Lys Lys Ile Leu Ser Lys Leu Ser Gly Lys Glu Ala Ser Thr 
65                  70                  75                  80  


Ile His Ser Ile Leu Lys Ile Asn Pro Val Thr Tyr Glu Glu Asn Val 
                85                  90                  95      


Leu Phe Glu Gln Lys Glu Val Pro Asp Leu Ala Lys Cys Arg Val Leu 
            100                 105                 110         


Ile Cys Asp Glu Val Ser Met Tyr Asp Arg Lys Leu Phe Lys Ile Leu 
        115                 120                 125             


Leu Ser Thr Ile Pro Pro Trp Cys Thr Ile Ile Gly Ile Gly Asp Asn 
    130                 135                 140                 


Lys Gln Ile Arg Pro Val Asp Pro Gly Glu Asn Thr Ala Tyr Ile Ser 
145                 150                 155                 160 


Pro Phe Phe Thr His Lys Asp Phe Tyr Gln Cys Glu Leu Thr Glu Val 
                165                 170                 175     


Lys Arg Ser Asn Ala Pro Ile Ile Asp Val Ala Thr Asp Val Arg Asn 
            180                 185                 190         


Gly Lys Trp Ile Tyr Asp Lys Val Val Asp Gly His Gly Val Arg Gly 
        195                 200                 205             


Phe Thr Gly Asp Thr Ala Leu Arg Asp Phe Met Val Asn Tyr Phe Ser 
    210                 215                 220                 


Ile Val Lys Ser Leu Asp Asp Leu Phe Glu Asn Arg Val Met Ala Phe 
225                 230                 235                 240 


Thr Asn Lys Ser Val Asp Lys Leu Asn Ser Ile Ile Arg Lys Lys Ile 
                245                 250                 255     


Phe Glu Thr Asp Lys Asp Phe Ile Val Gly Glu Ile Ile Val Met Gln 
            260                 265                 270         


Glu Pro Leu Phe Lys Thr Tyr Lys Ile Asp Gly Lys Pro Val Ser Glu 
        275                 280                 285             


Ile Ile Phe Asn Asn Gly Gln Leu Val Arg Ile Ile Glu Ala Glu Tyr 
    290                 295                 300                 


Thr Ser Thr Phe Val Lys Ala Arg Gly Val Pro Gly Glu Tyr Leu Ile 
305                 310                 315                 320 


Arg His Trp Asp Leu Thr Val Glu Thr Tyr Gly Asp Asp Glu Tyr Tyr 
                325                 330                 335     


Arg Glu Lys Ile Lys Ile Ile Ser Ser Asp Glu Glu Leu Tyr Lys Phe 
            340                 345                 350         


Asn Leu Phe Leu Gly Lys Thr Ala Glu Thr Tyr Lys Asn Trp Asn Lys 
        355                 360                 365             


Gly Gly Lys Ala Pro Trp Ser Asp Phe Trp Asp Ala Lys Ser Gln Phe 
    370                 375                 380                 


Ser Lys Val Lys Ala Leu Pro Ala Ser Thr Phe His Lys Ala Gln Gly 
385                 390                 395                 400 


Met Ser Val Asp Arg Ala Phe Ile Tyr Thr Pro Cys Ile His Tyr Ala 
                405                 410                 415     


Asp Val Glu Leu Ala Gln Gln Leu Leu Tyr Val Gly Val Thr Arg Gly 
            420                 425                 430         


Arg Tyr Asp Val Phe Tyr Val 
        435                 


<210>  9
<211>  678
<212>  PRT
<213>  Rhodothermus marinus

<400>  9

Met Glu Glu Leu Ser Asn Glu Gln Gln Arg Val Leu Asp His Val Leu 
1               5                   10                  15      


Ala Trp Leu Glu Arg Asn Asp Ala Pro Pro Ile Phe Ile Leu Thr Gly 
            20                  25                  30          


Ser Ala Gly Thr Gly Lys Thr Leu Leu Ile Arg His Leu Val Arg Ala 
        35                  40                  45              


Leu Gln Asp Arg Arg Ile His Tyr Ala Leu Ala Ala Pro Thr Gly Arg 
    50                  55                  60                  


Ala Ala Arg Ile Leu Ser Glu Arg Thr Gly Asp His Ala Arg Thr Leu 
65                  70                  75                  80  


His Ser Leu Ile Tyr Ile Phe Asp Arg Tyr Gln Leu Val Glu Glu Ala 
                85                  90                  95      


Asp Arg Gln Thr Asp Glu Pro Leu Ser Leu Gln Leu His Phe Ala Leu 
            100                 105                 110         


Arg Ser Ala Glu His Asp Ala Arg Leu Ile Ile Val Asp Glu Ala Ser 
        115                 120                 125             


Met Val Ser Asp Thr Ala Gly Glu Glu Glu Leu Tyr Arg Phe Gly Ser 
    130                 135                 140                 


Gly Arg Leu Leu Asn Asp Leu Leu Thr Phe Ala Arg Leu Ile Pro Lys 
145                 150                 155                 160 


Arg Asp Arg Pro Pro Thr Thr Arg Leu Leu Phe Val Gly Asp Pro Ala 
                165                 170                 175     


Gln Leu Pro Pro Val Gly Gln Ser Val Ser Pro Ala Leu Ser Ala Gln 
            180                 185                 190         


Tyr Leu Arg Asp Thr Phe Gly Leu Ser Ala Glu Thr Ala His Leu Arg 
        195                 200                 205             


Ser Val Tyr Arg Gln Arg Lys Gly His Pro Ile Leu Glu Thr Ala Thr 
    210                 215                 220                 


Ala Leu Arg Asn Ala Leu Glu Lys Gly His Tyr His Thr Phe Arg Leu 
225                 230                 235                 240 


Pro Glu Gln Pro Pro Asp Leu Arg Pro Val Gly Leu Glu Glu Ala Ile 
                245                 250                 255     


Glu Thr Thr Ala Thr Asp Phe Arg Arg Gln Asn Pro Ser Val Leu Leu 
            260                 265                 270         


Cys Arg Thr Asn Ala Leu Ala Arg Lys Leu Asn Ala Ala Val Arg Ala 
        275                 280                 285             


Arg Leu Trp Gly Arg Glu Gly Leu Pro Pro Gln Pro Gly Asp Leu Leu 
    290                 295                 300                 


Leu Val Asn Arg Asn Ala Pro Leu His Gly Leu Phe Asn Gly Asp Leu 
305                 310                 315                 320 


Val Leu Val Glu Thr Val Gly Pro Leu Glu His Arg Arg Val Gly Arg 
                325                 330                 335     


Arg Gly Arg Pro Pro Val Asp Leu Tyr Phe Arg Asp Val Glu Leu Leu 
            340                 345                 350         


Tyr Pro His Glu Lys Pro Arg Asn Arg Ile Arg Cys Lys Leu Leu Glu 
        355                 360                 365             


Asn Leu Leu Glu Ser Pro Asp Gly Gln Leu Ser Pro Asp Ile Ile Gln 
    370                 375                 380                 


Ala Leu Leu Ile Asp Phe Tyr Arg Arg His Pro Ser Leu Lys His Gly 
385                 390                 395                 400 


Ser Ser Glu Phe Arg Leu Met Leu Ala Asn Asp Ala Tyr Phe Asn Ala 
                405                 410                 415     


Leu His Val Arg Tyr Gly Tyr Ala Met Thr Val His Lys Ala Gln Gly 
            420                 425                 430         


Gly Glu Trp Lys Arg Ala Thr Val Val Phe Asn Asp Trp Arg His Phe 
        435                 440                 445             


Arg His Ala Glu Phe Phe Arg Trp Ala Tyr Thr Ala Ile Thr Arg Ala 
    450                 455                 460                 


Arg Glu Glu Leu Leu Thr Ile Gly Ala Pro Ser Phe Glu Ala Leu Ser 
465                 470                 475                 480 


Asp Met Arg Trp Gln Pro Ala Pro Ser Val Pro Ala Pro Glu Gln Ala 
                485                 490                 495     


Ala Glu Asn Ala Thr Arg Phe Pro Leu Lys Ala Leu Glu Thr Tyr His 
            500                 505                 510         


Gln Arg Leu Ser Glu Ala Leu Thr Ala Ala Gly Ile Glu Thr Thr Gly 
        515                 520                 525             


Val Glu Leu Leu Gln Tyr Ala Val Arg Tyr His Leu Ala Arg Ala Asp 
    530                 535                 540                 


Arg Thr Thr Arg Ile Gln Tyr Tyr Tyr Arg Gly Asp Gly Gln Ile Ser 
545                 550                 555                 560 


Arg Ile Val Thr Leu Gly Gly Ala Asp Asp Pro Glu Leu Thr Gln Gln 
                565                 570                 575     


Ala Tyr Ala Leu Phe Glu Arg Ile Leu Ser Glu Pro Pro Ala Asp Ser 
            580                 585                 590         


Gly Glu Leu Pro Glu Asn Pro Leu Leu Arg Glu Phe Leu Glu Arg Ala 
        595                 600                 605             


His Leu Arg Leu Glu Gly Ser Gly Ile Arg Ile Val His Trp Lys Glu 
    610                 615                 620                 


Met Pro Tyr Ala Leu Arg Leu Tyr Phe Ser Ala Asp Gly Glu Asn Val 
625                 630                 635                 640 


Thr Ile Asp Phe Tyr Tyr Asn Arg Arg Gly Val Trp Thr His Ala Gln 
                645                 650                 655     


Glu Val Gly Arg Ser Ser Ser Gly Ala Leu Phe Ala Arg Ile Gln Ser 
            660                 665                 670         


Leu Leu Gln Ala Asp Ser 
        675             


<210>  10
<211>  496
<212>  PRT
<213>  Cyanothece ATCC51142

<400>  10

Met Ser Gln Ser Val Val Val Pro Asp Glu Leu Gly Glu Ile Ile Thr 
1               5                   10                  15      


Ala Val Ile Glu Phe Tyr Gln Asp Ala Val Asp Lys Ile Glu Pro Lys 
            20                  25                  30          


Ile Val Phe Leu Glu Leu Arg Lys Asn Val Val Asp Trp Val Ser Arg 
        35                  40                  45              


Thr Gln Leu Lys Ile Glu Glu Lys Glu Ile Gln Ala Thr Gly Leu Thr 
    50                  55                  60                  


Arg Gln Gln Gln Thr Ala Tyr Lys Glu Met Ile Asn Phe Ile Glu Asn 
65                  70                  75                  80  


Ser Ser Glu Gln Tyr Phe Arg Leu Ser Gly Tyr Ala Gly Thr Gly Lys 
                85                  90                  95      


Ser Phe Leu Met Ala Lys Val Ile Glu Trp Leu Lys Gln Glu Asp Tyr 
            100                 105                 110         


Lys Tyr Ser Val Ala Ala Pro Thr Asn Lys Ala Ala Lys Asn Leu Thr 
        115                 120                 125             


Gln Ile Ala Arg Ser Gln Gly Ile Lys Ile Glu Ala Thr Thr Val Ala 
    130                 135                 140                 


Lys Leu Leu Lys Leu Gln Pro Thr Ile Asp Val Asp Thr Gly Gln Gln 
145                 150                 155                 160 


Ser Phe Glu Phe Asn Ser Glu Lys Glu Leu Glu Leu Lys Asp Tyr Asp 
                165                 170                 175     


Val Ile Ile Ile Asp Glu Tyr Ser Met Leu Asn Lys Asp Asn Phe Arg 
            180                 185                 190         


Asp Leu Gln Gln Ala Val Lys Gly Gly Glu Ser Lys Phe Ile Phe Val 
        195                 200                 205             


Gly Asp Ser Ser Gln Leu Pro Pro Val Lys Glu Lys Glu Pro Ile Val 
    210                 215                 220                 


Ala Asn His Pro Asp Ile Arg Lys Ser Ala Asn Leu Thr Gln Ile Val 
225                 230                 235                 240 


Arg Tyr Asp Gly Glu Ile Val Lys Val Ala Glu Ser Ile Arg Arg Asn 
                245                 250                 255     


Pro Arg Trp Asn His Gln Thr Tyr Pro Phe Glu Thr Val Ala Asp Gly 
            260                 265                 270         


Thr Ile Ile Lys Leu Asn Thr Glu Asp Trp Leu Gln Gln Ala Leu Ser 
        275                 280                 285             


His Phe Glu Lys Glu Asp Trp Leu Ser Asn Pro Asp Tyr Val Arg Met 
    290                 295                 300                 


Ile Thr Trp Arg Asn Lys Thr Ala Asp Lys Tyr Asn Gln Ala Ile Arg 
305                 310                 315                 320 


Glu Ala Leu Tyr Gly Glu Asn Val Glu Gln Leu Val Val Gly Asp Arg 
                325                 330                 335     


Leu Ile Ala Lys Lys Pro Val Phe Arg Ser Leu Pro Gly Gly Lys Lys 
            340                 345                 350         


Lys Glu Lys Lys Ile Ile Leu Asn Asn Ser Glu Glu Cys Lys Val Ile 
        355                 360                 365             


Glu Thr Pro Lys Ile Asn Tyr Asn Glu Lys Tyr Lys Trp Glu Phe Tyr 
    370                 375                 380                 


Gln Val Lys Val Arg Thr Asp Glu Gly Gly Met Ile Glu Leu Arg Ile 
385                 390                 395                 400 


Leu Thr Ser Glu Ser Glu Glu Lys Arg Gln Lys Lys Leu Lys Glu Leu 
                405                 410                 415     


Ala Lys Arg Ala Arg Glu Glu Glu Asn Tyr Ser Glu Lys Lys Lys Gln 
            420                 425                 430         


Trp Ala Ile Tyr Tyr Glu Leu Asp Glu Leu Phe Asp Asn Met Ala Tyr 
        435                 440                 445             


Ala Tyr Ala Leu Thr Cys His Lys Ala Gln Gly Ser Ser Ile Asp Asn 
    450                 455                 460                 


Val Phe Leu Leu Val Ser Asp Met His Tyr Cys Arg Asp Lys Thr Lys 
465                 470                 475                 480 


Met Ile Tyr Thr Gly Leu Thr Arg Ala Lys Lys Cys Cys Tyr Val Gly 
                485                 490                 495     


<210>  11
<211>  421
<212>  PRT
<213>  Salinibacter ruber

<400>  11

Met Ser Thr Phe Ala Asp Ala Pro Phe Thr Glu Asp Gln Glu Glu Ala 
1               5                   10                  15      


Tyr Asp His Val Tyr Asp Arg Leu Ala Gln Gly Glu Arg Phe Thr Gly 
            20                  25                  30          


Leu Arg Gly Tyr Ala Gly Thr Gly Lys Thr Tyr Leu Val Ser Arg Leu 
        35                  40                  45              


Val Glu Gln Leu Leu Asp Glu Asp Cys Thr Val Thr Val Cys Ala Pro 
    50                  55                  60                  


Thr His Lys Ala Val Gln Val Leu Ser Asp Glu Leu Gly Asp Ala Pro 
65                  70                  75                  80  


Val Gln Met Gln Thr Leu His Ser Phe Leu Gly Leu Arg Leu Gln Pro 
                85                  90                  95      


Lys Gln Asp Gly Glu Tyr Glu Leu Val Ala Glu Glu Glu Arg Asn Phe 
            100                 105                 110         


Ala Glu Gly Val Val Ile Val Asp Glu Ala Ser Met Ile Gly Arg Glu 
        115                 120                 125             


Glu Trp Ser His Ile Gln Asp Ala Pro Phe Trp Val Gln Trp Leu Phe 
    130                 135                 140                 


Val Gly Asp Pro Ala Gln Leu Pro Pro Val Asn Glu Asp Pro Ser Pro 
145                 150                 155                 160 


Ala Leu Asp Val Pro Gly Pro Thr Leu Glu Thr Ile His Arg Gln Ala 
                165                 170                 175     


Ala Asp Asn Pro Ile Leu Glu Leu Ala Thr Lys Ile Arg Thr Gly Ala 
            180                 185                 190         


Asp Gly Arg Phe Gly Ser Thr Phe Glu Asp Gly Lys Gly Val Ala Val 
        195                 200                 205             


Thr Arg Asn Arg Glu Glu Phe Leu Asp Ser Ile Leu Arg Ala Phe Asp 
    210                 215                 220                 


Ala Asp Ala Phe Ala Glu Asp Ala Thr His Ala Arg Val Leu Ala Tyr 
225                 230                 235                 240 


Arg Asn Lys Thr Val Arg Arg Tyr Asn Arg Glu Ile Arg Ala Glu Arg 
                245                 250                 255     


Tyr Gly Ala Asp Ala Asp Arg Phe Val Glu Gly Glu Trp Leu Val Gly 
            260                 265                 270         


Thr Glu Thr Trp Tyr Tyr Asp Gly Val Gln Arg Leu Thr Asn Ser Glu 
        275                 280                 285             


Glu Val Arg Val Lys Lys Ala Gln Val Glu Thr Phe Glu Ala Asp Asp 
    290                 295                 300                 


Gln Ser Glu Trp Thr Val Trp Glu Leu Lys Ile Arg Thr Pro Gly Arg 
305                 310                 315                 320 


Gly Leu Thr Arg Thr Ile His Val Leu His Glu Glu Glu Arg Glu Arg 
                325                 330                 335     


Tyr Glu Asn Ala Leu Glu Arg Arg Arg Gly Lys Ala Glu Asp Asp Pro 
            340                 345                 350         


Ser Lys Trp Asp Arg Phe Phe Glu Leu Arg Glu Arg Phe Ala Arg Val 
        355                 360                 365             


Asp Tyr Ala Tyr Ala Thr Thr Val His Arg Ala Gln Gly Ser Thr Tyr 
    370                 375                 380                 


Asp Thr Val Phe Val Asp His Arg Asp Leu Arg Val Cys Arg Gly Glu 
385                 390                 395                 400 


Glu Arg Gly Ala Leu Leu Tyr Val Ala Val Thr Arg Pro Ser Arg Arg 
                405                 410                 415     


Leu Ala Leu Leu Val 
            420     


<210>  12
<211>  500
<212>  PRT
<213>  Sullfurimonas gotlandica GD1

<400>  12

Met Lys Ile Leu Asn Lys Glu Thr Tyr Lys Leu Ser Leu His Gln Glu 
1               5                   10                  15      


Glu Val Phe Thr Gln Ile Val Ser Gln Leu Asp Thr Lys Val Ser Ser 
            20                  25                  30          


Ile Leu Lys Ser Thr Asn Ile Glu Asp Tyr Leu Leu Ser Leu Thr Gly 
        35                  40                  45              


Pro Ala Gly Thr Gly Lys Thr Phe Leu Thr Thr Gln Ile Ala Lys Tyr 
    50                  55                  60                  


Leu Val Glu Lys Arg Lys Glu Ser Glu Tyr Pro Met Ser Ser Asp Phe 
65                  70                  75                  80  


Asp Phe Thr Ile Thr Ala Pro Thr His Lys Ala Val Gly Val Leu Ser 
                85                  90                  95      


Lys Leu Leu Arg Glu Asn Asn Ile Gln Ser Ser Cys Lys Thr Ile His 
            100                 105                 110         


Ser Phe Leu Gly Ile Lys Pro Phe Ile Asp Tyr Thr Thr Gly Glu Glu 
        115                 120                 125             


Lys Phe Val Val Asp Lys Thr Asn Lys Arg Lys Asp Arg Thr Ser Ile 
    130                 135                 140                 


Leu Ile Val Asp Glu Ser Ser Met Ile Gly Asn Thr Leu Tyr Glu Tyr 
145                 150                 155                 160 


Ile Leu Glu Ala Ile Glu Asp Lys Arg Val Asn Val Val Leu Phe Ile 
                165                 170                 175     


Gly Asp Pro Tyr Gln Leu Leu Pro Ile Glu Asn Ser Lys Asn Glu Ile 
            180                 185                 190         


Tyr Asp Leu Pro Asn Arg Phe Phe Leu Ser Glu Val Val Arg Gln Ala 
        195                 200                 205             


Glu Asn Ser Tyr Ile Ile Arg Val Ala Thr Lys Leu Arg Glu Arg Ile 
    210                 215                 220                 


Lys Asn Gln Asp Phe Ile Ser Leu Gln Gln Phe Phe Gln Glu Asn Met 
225                 230                 235                 240 


Glu Asp Glu Ile Thr Phe Phe His Asn Lys Glu Ala Phe Leu Glu Asp 
                245                 250                 255     


Phe Tyr Lys Glu Glu Glu Trp Tyr Lys Glu Asn Lys Ile Leu Ala Thr 
            260                 265                 270         


Tyr Lys Asn Lys Asp Val Asp Ala Phe Asn Lys Ile Ile Arg Asn Lys 
        275                 280                 285             


Phe Trp Glu Gln Lys Gly Asn Thr Thr Pro Ser Thr Leu Leu Ala Gly 
    290                 295                 300                 


Asp Met Ile Arg Phe Lys Asp Ala Tyr Thr Val Gly Asp Ile Thr Ile 
305                 310                 315                 320 


Tyr His Asn Gly Gln Glu Leu Gln Leu Gly Ser Thr Glu Val Lys Tyr 
                325                 330                 335     


His Asp Ser Leu His Ile Glu Tyr Trp Glu Cys Lys Ser Ile Tyr Ala 
            340                 345                 350         


Leu Glu Gln Gln Val Phe Arg Val Val Asn Pro Asp Ser Glu Ala Val 
        355                 360                 365             


Phe Asn Gln Lys Leu Gln Ser Leu Ala Thr Lys Ala Lys Gln Ala Lys 
    370                 375                 380                 


Phe Pro Asp Asn Lys Lys Leu Trp Lys Leu Tyr Tyr Glu Thr Arg Asn 
385                 390                 395                 400 


Met Phe Ala Asn Val Gln Tyr Ile His Ala Ser Thr Ile His Lys Leu 
                405                 410                 415     


Gln Gly Ser Thr Tyr Asp Val Ser Tyr Ile Asp Ile Phe Ser Leu Val 
            420                 425                 430         


His Asn His Tyr Met Ser Asp Glu Glu Lys Tyr Arg Leu Leu Tyr Val 
        435                 440                 445             


Ala Ile Thr Arg Ala Ser Lys Asp Ile Lys Ile Phe Met Ser Ala Phe 
    450                 455                 460                 


Asp Arg Thr Ser Asp Glu Lys Val Ile Ile Asn Asn Gln Asn Ser Glu 
465                 470                 475                 480 


Thr Met Asn Thr Leu Lys Gln Leu His Asp Ile Asp Ile Ile Leu Lys 
                485                 490                 495     


Asp Leu Asp Leu 
            500 


<210>  13
<211>  450
<212>  PRT
<213>  Vibrio phage henriette 12B8

<400>  13

Met Ala Asp Phe Glu Leu Thr Leu Gly Gln Lys Thr Val Leu Gly Glu 
1               5                   10                  15      


Val Ile Ser Thr Ile Leu Lys Pro Val Asn Leu Asn Asp Thr Ser Arg 
            20                  25                  30          


Phe His Thr Met His Gly Pro Ala Gly Ser Gly Lys Thr Thr Val Leu 
        35                  40                  45              


Gln Arg Ile Ile Ser Gln Ile Pro Ala Tyr Lys Thr Ile Gly Phe Cys 
    50                  55                  60                  


Ser Pro Thr His Lys Ser Val Lys Val Ile Arg Arg Met Ala Arg Glu 
65                  70                  75                  80  


Ala Gly Ile Ser His Arg Val Asp Ile Arg Thr Ile His Ser Ala Leu 
                85                  90                  95      


Gly Leu Val Met Lys Pro Val Arg Gly Asp Glu Val Leu Val Lys Glu 
            100                 105                 110         


Pro Phe Ala Glu Glu Arg Ile Tyr Asp Val Leu Ile Ile Asp Glu Ala 
        115                 120                 125             


Gly Met Leu Asn Asp Glu Leu Ile Met Tyr Ile Leu Glu Ser Gln Ser 
    130                 135                 140                 


Ser Lys Val Ile Phe Val Gly Asp Met Cys Gln Ile Gly Pro Ile Gln 
145                 150                 155                 160 


Ser Asn Leu Pro Glu Glu Asp Gly Tyr Thr Pro Thr Ser Thr Asp Asp 
                165                 170                 175     


Val Ser Lys Val Phe Thr Glu Val Glu Met Met Ser Ala Leu Thr Glu 
            180                 185                 190         


Val Val Arg Gln Ala Glu Gly Ser Pro Ile Ile Gln Leu Ala Thr Glu 
        195                 200                 205             


Phe Arg Leu Ala Gln Asp Asp Ile Tyr Ala Asp Leu Pro Arg Ile Val 
    210                 215                 220                 


Thr Asn Thr Thr Pro Asp Gly Asn Gly Ile Ile Thr Met Pro Asn Gly 
225                 230                 235                 240 


Asn Trp Val Asp Ser Ala Val Ala Arg Phe Gln Ser Asp Gln Phe Lys 
                245                 250                 255     


Glu Asp Pro Asp His Cys Arg Ile Val Cys Tyr Thr Asn Ala Met Val 
            260                 265                 270         


Asp Leu Cys Asn Asp Leu Val Arg Lys Arg Leu Phe Gly Ala Asp Val 
        275                 280                 285             


Pro Glu Trp Leu Glu Asp Glu Ile Leu Val Ala Gln Glu Met Gly Ser 
    290                 295                 300                 


Thr Trp Asn Asn Ala Asp Glu Leu Arg Ile Val Ser Ile Asp Asp His 
305                 310                 315                 320 


Phe Asp Gln Gln Tyr Glu Val Pro Cys Trp Arg Met Gln Leu Glu Ser 
                325                 330                 335     


Val Glu Asp His Lys Leu His Asn Ala Leu Val Val Lys Gly Asp Tyr 
            340                 345                 350         


Ile Glu Asp Phe Lys Phe Arg Leu Asn Ala Ile Ala Glu Arg Ala Asn 
        355                 360                 365             


Thr Asp Lys Asn Met Ser Gly Met His Trp Lys Glu Phe Trp Gly Met 
    370                 375                 380                 


Arg Lys Lys Phe Asn Thr Phe Lys Asn Val Tyr Ala Gly Thr Ala His 
385                 390                 395                 400 


Lys Ser Gln Gly Ser Thr Phe Asp Tyr Thr Tyr Val Phe Thr Pro Asp 
                405                 410                 415     


Phe Tyr Lys Phe Gly Ala Thr Met Thr Ile Lys Arg Leu Leu Tyr Thr 
            420                 425                 430         


Ala Ile Thr Arg Ser Arg Tyr Thr Thr Tyr Phe Ala Met Asn Thr Gly 
        435                 440                 445             


Ala Gln 
    450 


<210>  14
<211>  421
<212>  PRT
<213>  Vibrio phage phi-pp2

<400>  14

Met Gly Leu Thr Asn Cys Gln Gln Gly Ala Met Asp Ala Phe Leu Glu 
1               5                   10                  15      


Ser Asp Gly His Met Thr Ile Ser Gly Pro Ala Gly Ser Gly Lys Thr 
            20                  25                  30          


Phe Leu Met Lys Ser Ile Leu Glu Ala Leu Glu Ser Lys Gly Lys Asn 
        35                  40                  45              


Val Thr Met Val Thr Pro Thr His Gln Ala Lys Asn Val Leu His Lys 
    50                  55                  60                  


Ala Thr Gly Gln Glu Val Ser Thr Ile His Ser Leu Leu Lys Ile His 
65                  70                  75                  80  


Pro Asp Thr Tyr Glu Asp Gln Lys His Phe Thr Gln Ser Gly Glu Val 
                85                  90                  95      


Glu Gly Leu Asp Glu Ile Asp Val Leu Val Val Glu Glu Ala Ser Met 
            100                 105                 110         


Val Asp Glu Glu Leu Phe Gln Ile Thr Gly Arg Thr Met Pro Arg Lys 
        115                 120                 125             


Cys Arg Ile Leu Ala Val Gly Asp Lys Tyr Gln Leu Gln Pro Val Lys 
    130                 135                 140                 


His Asp Pro Gly Val Ile Ser Pro Phe Phe Thr Lys Phe Thr Thr Phe 
145                 150                 155                 160 


Glu Met Asn Glu Val Val Arg Gln Ala Lys Asp Asn Pro Leu Ile Gln 
                165                 170                 175     


Val Ala Thr Glu Val Arg Asn Gly Gln Trp Leu Arg Thr Asn Trp Ser 
            180                 185                 190         


Lys Glu Arg Arg Gln Gly Val Leu His Val Pro Asn Val Asn Lys Met 
        195                 200                 205             


Leu Asp Thr Tyr Leu Ser Lys Val Asn Ser Pro Glu Asp Leu Leu Asp 
    210                 215                 220                 


Tyr Arg Ile Leu Ala Tyr Thr Asn Asp Cys Val Asp Thr Phe Asn Gly 
225                 230                 235                 240 


Ile Ile Arg Glu His Val Tyr Asn Thr Ser Glu Pro Phe Ile Pro Gly 
                245                 250                 255     


Glu Tyr Leu Val Thr Gln Met Pro Val Met Val Ser Asn Gly Lys Tyr 
            260                 265                 270         


Pro Val Cys Val Ile Glu Asn Gly Glu Val Val Lys Ile Leu Asp Val 
        275                 280                 285             


Arg Gln Lys Thr Ile Asp Gly Met Leu Pro Lys Val Asp Asn Glu Ala 
    290                 295                 300                 


Phe Asp Val Ala Val Leu Thr Val Glu Lys Glu Asp Gly Asn Val Tyr 
305                 310                 315                 320 


Glu Phe Thr Val Leu Trp Asp Asp Leu Gln Lys Glu Arg Phe Ala Arg 
                325                 330                 335     


Tyr Leu Ser Val Ala Ala Gly Thr Tyr Lys Ser Met Arg Gly Asn Thr 
            340                 345                 350         


Lys Arg Tyr Trp Arg Ala Phe Trp Gly Leu Lys Glu Gln Met Ile Glu 
        355                 360                 365             


Thr Lys Ser Leu Gly Ala Ser Thr Val His Lys Ser Gln Gly Thr Thr 
    370                 375                 380                 


Val Lys Gly Val Cys Leu Tyr Thr Gln Asp Met Gly Tyr Ala Glu Pro 
385                 390                 395                 400 


Glu Ile Leu Gln Gln Leu Val Tyr Val Gly Leu Thr Arg Pro Thr Asp 
                405                 410                 415     


Trp Ala Leu Tyr Asn 
            420     


<210>  15
<211>  434
<212>  PRT
<213>  Aeromonas phage 65

<400>  15

Met Ser Glu Ser Glu Ile Thr Leu Thr Pro Ser Gln Asn Met Ala Val 
1               5                   10                  15      


Asn Glu Val Lys Asn Gly Thr Gly His Ile Thr Ile Ser Gly Pro Pro 
            20                  25                  30          


Gly Ser Gly Lys Thr Phe Leu Val Lys Tyr Leu Ile Lys Met Leu Gly 
        35                  40                  45              


Asp Glu Leu Gly Thr Val Leu Ala Ala Pro Thr His Gln Ala Lys Ile 
    50                  55                  60                  


Val Leu Thr Glu Met Ser Gly Ile Glu Ala Cys Thr Ile His Ser Leu 
65                  70                  75                  80  


Met Lys Ile His Pro Glu Thr Leu Glu Asp Ile Gln Ile Phe Asp Gln 
                85                  90                  95      


Ser Lys Leu Pro Asp Leu Ser Asn Ile Arg Tyr Leu Ile Val Glu Glu 
            100                 105                 110         


Ala Ser Met His Ser Lys Thr Leu Phe Lys Ile Thr Met Lys Ser Ile 
        115                 120                 125             


Pro Pro Thr Cys Arg Ile Ile Ala Ile Gly Asp Lys Asp Gln Ile Gln 
    130                 135                 140                 


Pro Glu Glu His Ala Gln Gly Glu Leu Ser Pro Tyr Phe Thr Asp Pro 
145                 150                 155                 160 


Arg Phe Ser Gln Ile Arg Leu Thr Asp Ile Met Arg Gln Ser Leu Asp 
                165                 170                 175     


Asn Pro Ile Ile Gln Val Ala Thr Lys Ile Arg Glu Gly Gly Trp Ile 
            180                 185                 190         


Glu Pro Asn Trp Asn Arg Asp Thr Lys Thr Gly Val Tyr Lys Val Ser 
        195                 200                 205             


Gly Ile Thr Asp Leu Val Asn Ser Tyr Leu Arg Ala Val Lys Thr Pro 
    210                 215                 220                 


Glu Asp Leu Thr Lys Tyr Arg Phe Leu Ala Tyr Thr Asn Lys Val Val 
225                 230                 235                 240 


Asn Lys Val Asn Ser Ile Val Arg Glu His Val Tyr Lys Thr Lys Leu 
                245                 250                 255     


Pro Phe Ile Glu Gly Glu Lys Ile Val Leu Gln Glu Pro Val Met Val 
            260                 265                 270         


Glu His Glu Asp Asp Thr Ile Glu Thr Ile Phe Thr Asn Gly Glu Val 
        275                 280                 285             


Val Thr Ile Asn Glu Ile Glu Val Phe Asp Arg Thr Ile Arg Ile Asp 
    290                 295                 300                 


Gly Ser Pro Glu Phe Lys Val Asn Ala Ala Lys Leu Ser Val Ser Ser 
305                 310                 315                 320 


Asp Tyr Ser Gly Ile Glu His Asp Phe Cys Val Leu Tyr Gly Ser Glu 
                325                 330                 335     


Ser Arg Leu Glu Phe Glu Tyr Gln Leu Ser Glu Ser Ala Gly Asn Ile 
            340                 345                 350         


Lys Gln Met Gly Lys Gly Gly Asn Gln Arg Ser Ala Trp Lys Ser Phe 
        355                 360                 365             


Trp Ala Ala Lys Lys Met Phe Ile Glu Thr Lys Ser Leu Gly Ala Ser 
    370                 375                 380                 


Thr Ile His Lys Ser Gln Gly Ser Thr Val Lys Gly Val Trp Leu Ala 
385                 390                 395                 400 


Leu His Asp Ile His Tyr Ala Asp Glu Glu Leu Lys Gln Gln Leu Val 
                405                 410                 415     


Tyr Val Gly Val Thr Arg Pro Thr Asp Phe Cys Leu Tyr Phe Asp Gly 
            420                 425                 430         


Thr Lys 
        


<210>  16
<211>  420
<212>  PRT
<213>  Aeromonas phage CC2

<400>  16

Met Ala Val Asp Ala Val Gln Ser Gly Thr Gly His Ile Thr Ile Ser 
1               5                   10                  15      


Gly Pro Pro Gly Ser Gly Lys Thr Phe Leu Val Lys Tyr Ile Ile Lys 
            20                  25                  30          


Met Leu Gly Asp Glu Leu Gly Thr Val Leu Ala Ala Pro Thr His Gln 
        35                  40                  45              


Ala Lys Ile Val Leu Thr Glu Met Ser Gly Ile Glu Ala Cys Thr Ile 
    50                  55                  60                  


His Ser Leu Met Lys Ile His Pro Glu Thr Leu Glu Asp Ile Gln Ile 
65                  70                  75                  80  


Phe Asp Gln Ser Lys Met Pro Asp Leu Ser Thr Val Arg Tyr Leu Ile 
                85                  90                  95      


Ile Glu Glu Ala Ser Met His Ser Lys Ala Leu Phe Asn Ile Thr Met 
            100                 105                 110         


Lys Ser Ile Pro Pro Thr Cys Arg Ile Ile Ala Ile Gly Asp Lys Asp 
        115                 120                 125             


Gln Ile Gln Pro Val Asp His Ala Pro Gly Glu Leu Ser Pro Tyr Phe 
    130                 135                 140                 


Thr Asp Ser Arg Phe Thr Gln Ile Arg Met Thr Asp Ile Met Arg Gln 
145                 150                 155                 160 


Ser Leu Asp Asn Pro Ile Ile Gln Val Ala Thr Thr Ile Arg Glu Gly 
                165                 170                 175     


Gly Trp Ile Tyr Gln Asn Trp Asn Lys Glu Lys Lys Ser Gly Val Tyr 
            180                 185                 190         


Lys Val Lys Ser Ile Thr Asp Leu Ile Asn Ser Tyr Leu Arg Val Val 
        195                 200                 205             


Lys Thr Pro Glu Asp Leu Thr Lys Tyr Arg Phe Leu Ala Phe Thr Asn 
    210                 215                 220                 


Lys Val Val Asp Lys Val Asn Ser Ile Val Arg Lys His Val Tyr Lys 
225                 230                 235                 240 


Thr Asp Leu Pro Phe Ile Glu Gly Glu Lys Leu Val Leu Gln Glu Pro 
                245                 250                 255     


Val Met Val Glu Tyr Asp Asp Asp Thr Ile Glu Thr Ile Phe Thr Asn 
            260                 265                 270         


Gly Glu Val Val Thr Val Asp Glu Ile Glu Val Ser Asp Met Asn Ile 
        275                 280                 285             


Arg Ile Asp Gly Ser Pro Ala Phe Ser Ile Ser Val Ala Lys Leu Lys 
    290                 295                 300                 


Val Thr Ser Asp Phe Ser Gly Val Thr His Asp Ile Met Ser Val Tyr 
305                 310                 315                 320 


Gly Glu Asp Ser Lys Ala Glu Phe Asn Tyr Gln Leu Ser Glu Ala Ala 
                325                 330                 335     


Ala Val Ile Lys Gln Met Gln Arg Gly Gln Thr Lys Ala Ala Trp Ala 
            340                 345                 350         


Ser Phe Trp Asp Ala Lys Lys Thr Phe Thr Glu Thr Lys Ser Leu Gly 
        355                 360                 365             


Ala Cys Thr Ile His Lys Ser Gln Gly Ser Thr Val Lys Gly Val Trp 
    370                 375                 380                 


Leu Gly Leu His Asp Ile Ser Tyr Ala Asp Thr Asp Leu Gln Gln Gln 
385                 390                 395                 400 


Leu Val Tyr Val Gly Val Thr Arg Pro Thr Asp Phe Cys Leu Tyr Phe 
                405                 410                 415     


Asp Gly Ser Lys 
            420 


<210>  17
<211>  443
<212>  PRT
<213>  Cronobacter phage vB CsaM GAP161

<400>  17

Met Ser Glu Leu Thr Phe Asp Asp Leu Ser Asp Asp Gln Lys Ser Ala 
1               5                   10                  15      


His Asp Arg Val Ile His Asn Ile Gln Asn Ala Ile His Thr Thr Ile 
            20                  25                  30          


Thr Gly Gly Pro Gly Val Gly Lys Thr Thr Leu Val Lys Phe Val Phe 
        35                  40                  45              


Asn Thr Leu Lys Gly Leu Gly Ile Ser Gly Ile Trp Leu Thr Ala Pro 
    50                  55                  60                  


Thr His Gln Ala Lys Asn Val Leu Ala Ala Ala Thr Gly Met Asp Ala 
65                  70                  75                  80  


Thr Thr Ile His Ser Ala Leu Lys Ile Ser Pro Val Thr Asn Glu Glu 
                85                  90                  95      


Leu Arg Val Phe Glu Gln Gln Lys Gly Lys Lys Ala Pro Asp Leu Ser 
            100                 105                 110         


Thr Cys Arg Val Phe Val Val Glu Glu Val Ser Met Val Asp Met Asp 
        115                 120                 125             


Leu Phe Arg Ile Ile Arg Arg Ser Ile Pro Ser Asn Ala Val Ile Leu 
    130                 135                 140                 


Gly Leu Gly Asp Lys Asp Gln Ile Arg Pro Val Asn Ala Asp Gly Arg 
145                 150                 155                 160 


Val Glu Leu Ser Pro Phe Phe Asp Glu Glu Ile Phe Asp Val Ile Arg 
                165                 170                 175     


Met Asp Lys Ile Met Arg Gln Ala Glu Gly Asn Pro Ile Ile Gln Val 
            180                 185                 190         


Ser Arg Ala Val Arg Asp Gly Lys Met Leu Lys Pro Met Ser Val Gly 
        195                 200                 205             


Asp Leu Gly Val Phe Gln His Ala Asn Ala Val Asp Phe Leu Arg Gln 
    210                 215                 220                 


Tyr Phe Arg Arg Val Lys Thr Pro Asp Asp Leu Ile Glu Asn Arg Met 
225                 230                 235                 240 


Phe Ala Tyr Thr Asn Asp Asn Val Asp Lys Leu Asn Ala Thr Ile Arg 
                245                 250                 255     


Lys His Leu Tyr Lys Thr Thr Glu Pro Phe Ile Leu Asp Glu Val Ile 
            260                 265                 270         


Val Met Gln Glu Pro Leu Val Gln Glu Met Arg Leu Asn Gly Gln Ile 
        275                 280                 285             


Phe Thr Glu Ile Val Tyr Asn Asn Asn Glu Lys Ile Arg Val Leu Glu 
    290                 295                 300                 


Ile Ile Pro Arg Arg Glu Val Ile Lys Ala Glu Lys Cys Asp Glu Lys 
305                 310                 315                 320 


Ile Glu Ile Glu Phe Tyr Leu Leu Lys Thr Val Ser Leu Glu Glu Glu 
                325                 330                 335     


Thr Glu Ala Gln Ile Gln Val Val Val Asp Pro Val Met Lys Asp Arg 
            340                 345                 350         


Leu Gly Asn Tyr Leu Ala Tyr Val Ala Ser Thr Tyr Lys Arg Ile Lys 
        355                 360                 365             


Gln Gln Thr Gly Tyr Lys Ala Pro Trp His Ser Phe Trp Ala Ile Lys 
    370                 375                 380                 


Asn Lys Phe Gln Asp Val Lys Pro Leu Pro Val Cys Thr Tyr His Lys 
385                 390                 395                 400 


Ser Gln Gly Ser Thr Tyr Asp His Ala Tyr Met Tyr Thr Arg Asp Ala 
                405                 410                 415     


Tyr Ala Phe Ala Asp Tyr Asp Leu Cys Lys Gln Leu Ile Tyr Val Gly 
            420                 425                 430         


Val Thr Arg Ala Arg Tyr Thr Val Asp Tyr Val 
        435                 440             


<210>  18
<211>  442
<212>  PRT
<213>  Klebsiella phage KP15

<400>  18

Met Ser Glu Leu Thr Phe Asp Asp Leu Ser Glu Asp Gln Lys Asn Ala 
1               5                   10                  15      


His Asp Arg Val Ile Lys Asn Ile Arg Asn Lys Ile His Thr Thr Ile 
            20                  25                  30          


Thr Gly Gly Pro Gly Val Gly Lys Thr Thr Leu Val Lys Phe Val Phe 
        35                  40                  45              


Glu Thr Leu Lys Lys Leu Gly Ile Ser Gly Ile Trp Leu Thr Ala Pro 
    50                  55                  60                  


Thr His Gln Ala Lys Asn Val Leu Ser Glu Ala Val Gly Met Asp Ala 
65                  70                  75                  80  


Thr Thr Ile His Ser Ala Leu Lys Ile Ser Pro Val Thr Asn Glu Glu 
                85                  90                  95      


Leu Arg Val Phe Glu Gln Gln Lys Gly Lys Lys Ala Ala Asp Leu Ser 
            100                 105                 110         


Glu Cys Arg Val Phe Val Val Glu Glu Val Ser Met Val Asp Lys Glu 
        115                 120                 125             


Leu Phe Arg Ile Ile Lys Arg Thr Ile Pro Ser Cys Ala Val Ile Leu 
    130                 135                 140                 


Gly Leu Gly Asp Lys Asp Gln Ile Arg Pro Val Asn Thr Glu Gly Ile 
145                 150                 155                 160 


Thr Glu Leu Ser Pro Phe Phe Asp Glu Glu Ile Phe Asp Val Ile Arg 
                165                 170                 175     


Met Asp Lys Ile Met Arg Gln Ala Glu Gly Asn Pro Ile Ile Gln Val 
            180                 185                 190         


Ser Arg Ala Ile Arg Asp Gly Lys Pro Leu Met Pro Leu Met Asn Gly 
        195                 200                 205             


Glu Leu Gly Val Met Lys His Glu Asn Ala Ser Asp Phe Leu Arg Arg 
    210                 215                 220                 


Tyr Phe Ser Arg Val Lys Thr Pro Asp Asp Leu Asn Asn Asn Arg Met 
225                 230                 235                 240 


Phe Ala Tyr Thr Asn Ala Asn Val Asp Lys Leu Asn Ala Val Ile Arg 
                245                 250                 255     


Lys His Leu Tyr Lys Thr Asp Gln Pro Phe Ile Val Gly Glu Val Val 
            260                 265                 270         


Val Met Gln Glu Pro Leu Val Thr Glu Gly Arg Val Asn Gly Val Ser 
        275                 280                 285             


Phe Val Glu Val Ile Tyr Asn Asn Asn Glu Gln Ile Lys Ile Leu Glu 
    290                 295                 300                 


Ile Ile Pro Arg Ser Asp Thr Ile Lys Ala Asp Arg Cys Asp Pro Val 
305                 310                 315                 320 


Gln Ile Asp Tyr Phe Leu Met Lys Thr Glu Ser Met Phe Glu Asp Thr 
                325                 330                 335     


Lys Ala Asp Ile Gln Val Ile Ala Asp Pro Val Met Gln Glu Arg Leu 
            340                 345                 350         


Gly Asp Tyr Leu Asn Tyr Val Ala Phe Gln Tyr Lys Lys Met Lys Gln 
        355                 360                 365             


Glu Thr Gly Tyr Lys Ala Pro Trp Tyr Ser Phe Trp Gln Ile Lys Asn 
    370                 375                 380                 


Lys Phe Gln Thr Val Lys Ala Leu Pro Val Cys Thr Tyr His Lys Gly 
385                 390                 395                 400 


Gln Gly Ser Thr Tyr Asp His Ser Tyr Met Tyr Thr Arg Asp Ala Tyr 
                405                 410                 415     


Ala Tyr Ala Asp Tyr Glu Leu Cys Lys Gln Leu Leu Tyr Val Gly Thr 
            420                 425                 430         


Thr Arg Ala Arg Phe Thr Val Asp Tyr Val 
        435                 440         


<210>  19
<211>  438
<212>  PRT
<213>  Strnotrophomonas phage IME13

<400>  19

Met Val Thr Tyr Asp Asp Leu Thr Val Gly Gln Lys Asp Ala Ile Glu 
1               5                   10                  15      


Lys Ala Leu Gln Ala Met Arg Thr Lys Arg His Ile Thr Ile Arg Gly 
            20                  25                  30          


Pro Ala Gly Ser Gly Lys Thr Thr Met Thr Arg Phe Leu Leu Glu Arg 
        35                  40                  45              


Leu Phe Gln Thr Gly Gln Gln Gly Ile Val Leu Thr Ala Pro Thr His 
    50                  55                  60                  


Gln Ala Lys Lys Glu Leu Ser Lys His Ala Leu Arg Lys Ser Tyr Thr 
65                  70                  75                  80  


Ile Gln Ser Val Leu Lys Ile Asn Pro Ser Thr Leu Glu Glu Asn Gln 
                85                  90                  95      


Ile Phe Glu Gln Lys Gly Thr Pro Asp Phe Ser Lys Thr Arg Val Leu 
            100                 105                 110         


Ile Cys Asp Glu Val Ser Phe Tyr Thr Arg Lys Leu Phe Asp Ile Leu 
        115                 120                 125             


Met Arg Asn Val Pro Ser His Cys Val Val Ile Gly Ile Gly Asp Lys 
    130                 135                 140                 


Ala Gln Ile Arg Gly Val Ser Glu Asp Asp Thr His Glu Leu Ser Pro 
145                 150                 155                 160 


Phe Phe Thr Asp Asn Arg Phe Glu Gln Val Glu Leu Thr Glu Val Lys 
                165                 170                 175     


Arg His Gln Gly Pro Ile Ile Glu Val Ala Thr Asp Ile Arg Asn Gly 
            180                 185                 190         


Lys Trp Ile Tyr Glu Lys Leu Asp Asp Ser Gly Asn Gly Val Lys Gln 
        195                 200                 205             


Phe His Thr Val Lys Asp Phe Leu Ser Lys Tyr Phe Glu Arg Thr Lys 
    210                 215                 220                 


Thr Pro Asn Asp Leu Leu Glu Asn Arg Ile Met Ala Tyr Thr Asn Asn 
225                 230                 235                 240 


Ser Val Asp Lys Leu Asn Ser Val Ile Arg Lys Gln Leu Tyr Gly Ala 
                245                 250                 255     


Asn Ala Ala Pro Phe Leu Pro Asp Glu Ile Leu Val Met Gln Glu Pro 
            260                 265                 270         


Leu Met Phe Asp Ile Asp Ile Gly Gly Gln Thr Leu Lys Glu Val Ile 
        275                 280                 285             


Phe Asn Asn Gly Gln Asn Val Arg Val Ile Asn Val Lys Pro Ser Arg 
    290                 295                 300                 


Lys Thr Leu Lys Ala Lys Gly Val Gly Glu Ile Glu Val Glu Cys Thr 
305                 310                 315                 320 


Met Leu Glu Cys Glu Ser Tyr Glu Glu Asp Glu Asp Asp Tyr Arg Arg 
                325                 330                 335     


Ala Trp Phe Thr Val Val His Asp Gln Asn Thr Gln Tyr Ala Ile Asn 
            340                 345                 350         


Glu Phe Leu Ser Ile Ile Ala Glu Lys Tyr Arg Ser Arg Glu Val Phe 
        355                 360                 365             


Pro Asn Trp Lys Asp Phe Trp Ala Ile Arg Asn Thr Phe Val Lys Val 
    370                 375                 380                 


Arg Pro Leu Gly Ala Met Thr Phe His Lys Ser Gln Gly Ser Thr Phe 
385                 390                 395                 400 


Asp Asn Ala Tyr Leu Phe Thr Pro Cys Leu His Gln Tyr Cys Arg Asp 
                405                 410                 415     


Pro Asp Val Ala Gln Glu Leu Ile Tyr Val Gly Asn Thr Arg Ala Arg 
            420                 425                 430         


Lys Asn Val Cys Phe Val 
        435             


<210>  20
<211>  442
<212>  PRT
<213>  Acinetobacter phage Ac42

<400>  20

Met Asn Phe Glu Asp Leu Thr Glu Gly Gln Lys Asn Ala Tyr Thr Ala 
1               5                   10                  15      


Ala Ile Lys Ala Ile Glu Thr Val Pro Ser Ser Ser Ala Glu Lys Arg 
            20                  25                  30          


His Leu Thr Ile Asn Gly Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr 
        35                  40                  45              


Lys Phe Leu Ile Ala Glu Leu Ile Arg Arg Gly Glu Arg Gly Val Tyr 
    50                  55                  60                  


Leu Ala Ala Pro Thr His Gln Ala Lys Lys Val Leu Ser Gln His Ala 
65                  70                  75                  80  


Gly Met Glu Ala Ser Thr Ile His Ser Leu Leu Lys Ile Asn Pro Thr 
                85                  90                  95      


Thr Tyr Glu Asp Ser Thr Thr Phe Glu Gln Lys Asp Val Pro Asp Met 
            100                 105                 110         


Ser Glu Cys Arg Val Leu Ile Cys Asp Glu Ala Ser Met Tyr Asp Leu 
        115                 120                 125             


Lys Leu Phe Gln Ile Leu Met Ser Ser Ile Pro Leu Cys Cys Thr Val 
    130                 135                 140                 


Ile Ala Leu Gly Asp Ile Ala Gln Ile Arg Pro Val Glu Pro Gly Ala 
145                 150                 155                 160 


Phe Glu Gly Gln Val Ser Pro Phe Phe Thr Tyr Glu Lys Phe Glu Gln 
                165                 170                 175     


Val Ser Leu Thr Glu Val Met Arg Ser Asn Ala Pro Ile Ile Asp Val 
            180                 185                 190         


Ala Thr Ser Ile Arg Thr Gly Asn Trp Ile Tyr Glu Asn Val Ile Asp 
        195                 200                 205             


Gly Ala Gly Val His Asn Leu Thr Ser Glu Arg Ser Val Lys Ser Phe 
    210                 215                 220                 


Met Glu Lys Tyr Phe Ser Ile Val Lys Thr Pro Glu Asp Leu Phe Glu 
225                 230                 235                 240 


Asn Arg Leu Leu Ala Phe Thr Asn Lys Ser Val Asp Asp Leu Asn Lys 
                245                 250                 255     


Ile Val Arg Lys Lys Ile Tyr Asn Thr Leu Glu Pro Phe Ile Asp Gly 
            260                 265                 270         


Glu Val Leu Val Met Gln Glu Pro Leu Ile Lys Ser Tyr Thr Tyr Glu 
        275                 280                 285             


Gly Lys Lys Val Ser Glu Ile Val Phe Asn Asn Gly Glu Met Val Lys 
    290                 295                 300                 


Val Leu Cys Cys Ser Gln Thr Ser Asp Glu Ile Ser Val Arg Gly Cys 
305                 310                 315                 320 


Ser Thr Lys Tyr Met Val Arg Tyr Trp Gln Leu Asp Leu Gln Ser Leu 
                325                 330                 335     


Asp Asp Pro Asp Leu Thr Gly Ser Ile Asn Val Ile Val Asp Glu Ala 
            340                 345                 350         


Glu Ile Asn Lys Leu Asn Leu Val Leu Gly Lys Ser Ala Glu Gln Phe 
        355                 360                 365             


Lys Ser Gly Ala Val Lys Ala Ala Trp Ala Asp Trp Trp Lys Leu Lys 
    370                 375                 380                 


Arg Asn Phe His Lys Val Lys Ala Leu Pro Cys Ser Thr Ile His Lys 
385                 390                 395                 400 


Ser Gln Gly Thr Ser Val Asp Asn Val Phe Leu Tyr Thr Pro Cys Ile 
                405                 410                 415     


His Lys Ala Asp Ser Gln Leu Ala Gln Gln Leu Leu Tyr Val Gly Ala 
            420                 425                 430         


Thr Arg Ala Arg His Asn Val Tyr Tyr Ile 
        435                 440         


<210>  21
<211>  442
<212>  PRT
<213>  Shigella phage SP18

<400>  21

Met Ile Lys Phe Glu Asp Leu Asn Thr Gly Gln Lys Glu Ala Phe Asp 
1               5                   10                  15      


Tyr Ile Thr Glu Ala Ile Gln Arg Arg Ser Gly Glu Cys Ile Thr Leu 
            20                  25                  30          


Asn Gly Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr Lys Phe Val Ile 
        35                  40                  45              


Asp His Leu Val Arg Asn Gly Val Met Gly Ile Val Leu Ala Ala Pro 
    50                  55                  60                  


Thr His Gln Ala Lys Lys Val Leu Ser Lys Leu Ser Gly Gln Thr Ala 
65                  70                  75                  80  


Asn Thr Ile His Ser Ile Leu Lys Ile Asn Pro Thr Thr Tyr Glu Asp 
                85                  90                  95      


Gln Asn Ile Phe Glu Gln Arg Glu Met Pro Asp Met Ser Lys Cys Asn 
            100                 105                 110         


Val Leu Val Cys Asp Glu Ala Ser Met Tyr Asp Gly Ser Leu Phe Lys 
        115                 120                 125             


Ile Ile Cys Asn Ser Val Pro Glu Trp Cys Thr Ile Leu Gly Ile Gly 
    130                 135                 140                 


Asp Met His Gln Leu Gln Pro Val Asp Pro Gly Ser Thr Gln Gln Lys 
145                 150                 155                 160 


Ile Ser Pro Phe Phe Thr His Pro Lys Phe Lys Gln Ile His Leu Thr 
                165                 170                 175     


Glu Val Met Arg Ser Asn Ala Pro Ile Ile Glu Val Ala Thr Glu Ile 
            180                 185                 190         


Arg Asn Gly Gly Trp Phe Arg Asp Cys Met Tyr Asp Gly His Gly Val 
        195                 200                 205             


Gln Gly Phe Thr Ser Gln Thr Ala Leu Lys Asp Phe Met Val Asn Tyr 
    210                 215                 220                 


Phe Gly Ile Val Lys Asp Ala Asp Met Leu Met Glu Asn Arg Met Tyr 
225                 230                 235                 240 


Ala Tyr Thr Asn Lys Ser Val Glu Lys Leu Asn Asn Ile Ile Arg Arg 
                245                 250                 255     


Lys Leu Tyr Glu Thr Asp Lys Ala Phe Leu Pro Tyr Glu Val Leu Val 
            260                 265                 270         


Met Gln Glu Pro His Met Lys Glu Leu Glu Phe Glu Gly Lys Lys Phe 
        275                 280                 285             


Ser Glu Thr Ile Phe Asn Asn Gly Gln Leu Val Arg Ile Lys Asp Cys 
    290                 295                 300                 


Lys Tyr Thr Ser Thr Ile Leu Arg Cys Lys Gly Glu Ser His Gln Leu 
305                 310                 315                 320 


Val Ile Asn Tyr Trp Asp Leu Glu Val Glu Ser Ile Asp Glu Asp Glu 
                325                 330                 335     


Glu Tyr Gln Val Asp Arg Ile Lys Val Leu Pro Glu Asp Gln Gln Pro 
            340                 345                 350         


Lys Phe Gln Ala Tyr Leu Ala Lys Val Ala Asp Thr Tyr Lys Gln Met 
        355                 360                 365             


Lys Ala Ala Gly Lys Arg Pro Glu Trp Lys Asp Phe Trp Lys Ala Arg 
    370                 375                 380                 


Arg Thr Phe Leu Lys Val Arg Ala Leu Pro Val Ser Thr Ile His Lys 
385                 390                 395                 400 


Ala Gln Gly Val Ser Val Asp Lys Ala Phe Ile Tyr Thr Pro Cys Ile 
                405                 410                 415     


His Met Ala Glu Ala Ser Leu Ala Ser Gln Leu Ala Tyr Val Gly Ile 
            420                 425                 430         


Thr Arg Ala Arg Tyr Asp Ala Tyr Tyr Val 
        435                 440         


<210>  22
<211>  439
<212>  PRT
<213>  Yersinia phage phiR1-RT

<400>  22

Met Ile Thr Tyr Asp Asp Leu Thr Asp Gly Gln Lys Ser Ala Phe Asp 
1               5                   10                  15      


Asn Thr Met Glu Ala Ile Lys Asn Lys Lys Gly His Ile Thr Ile Asn 
            20                  25                  30          


Gly Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr Lys Phe Ile Ile Asp 
        35                  40                  45              


His Leu Ile Lys Thr Gly Glu Ala Gly Ile Ile Leu Cys Ala Pro Thr 
    50                  55                  60                  


His Gln Ala Lys Lys Val Leu Ser Lys Leu Ser Gly Met Asp Ala Ser 
65                  70                  75                  80  


Thr Ile His Ser Val Leu Lys Ile Asn Pro Thr Thr Tyr Glu Glu Asn 
                85                  90                  95      


Gln Ile Phe Glu Gln Arg Glu Val Pro Asp Leu Ala Ala Cys Arg Val 
            100                 105                 110         


Leu Ile Cys Asp Glu Ala Ser Phe Tyr Asp Arg Lys Leu Phe Gly Ile 
        115                 120                 125             


Ile Leu Ala Thr Val Pro Ser Trp Cys Thr Val Ile Ala Leu Gly Asp 
    130                 135                 140                 


Lys Asp Gln Leu Arg Pro Val Thr Pro Gly Glu Ser Glu Gln Gln Leu 
145                 150                 155                 160 


Ser Pro Phe Phe Ser His Ala Lys Phe Lys Gln Val His Leu Thr Glu 
                165                 170                 175     


Ile Lys Arg Ser Asn Gly Pro Ile Ile Gln Val Ala Thr Asp Ile Arg 
            180                 185                 190         


Asn Gly Gly Trp Leu Ser Glu Asn Ile Val Asp Gly Glu Gly Val His 
        195                 200                 205             


Ala Phe Asn Ser Asn Thr Ala Leu Lys Asp Phe Met Ile Arg Tyr Phe 
    210                 215                 220                 


Asp Val Val Lys Thr Ala Asp Asp Leu Ile Glu Ser Arg Met Leu Ala 
225                 230                 235                 240 


Tyr Thr Asn Lys Ser Val Asp Lys Leu Asn Gly Ile Ile Arg Arg Lys 
                245                 250                 255     


Leu Tyr Glu Thr Asp Lys Pro Phe Ile Asn Gly Glu Val Leu Val Met 
            260                 265                 270         


Gln Glu Pro Leu Met Lys Glu Leu Glu Phe Asp Gly Lys Lys Phe His 
        275                 280                 285             


Glu Ile Val Phe Asn Asn Gly Gln Leu Val Lys Ile Leu Tyr Ala Ser 
    290                 295                 300                 


Glu Thr Ser Thr Phe Ile Ser Ala Arg Asn Val Pro Gly Glu Tyr Met 
305                 310                 315                 320 


Ile Arg Tyr Trp Asn Leu Glu Val Glu Thr Ala Asp Ser Asp Asp Asp 
                325                 330                 335     


Tyr Ala Thr Ser Gln Ile Gln Val Ile Cys Asp Pro Ala Glu Met Thr 
            340                 345                 350         


Lys Phe Gln Met Phe Leu Ala Lys Thr Ala Asp Thr Tyr Lys Asn Ser 
        355                 360                 365             


Gly Val Lys Ala Tyr Trp Lys Asp Phe Trp Ser Val Lys Asn Lys Phe 
    370                 375                 380                 


Lys Lys Val Lys Ala Leu Pro Val Ser Thr Ile His Lys Ser Gln Gly 
385                 390                 395                 400 


Cys Thr Val Asn Asn Thr Phe Leu Tyr Thr Pro Cys Ile His Met Ala 
                405                 410                 415     


Asp Ala Gln Leu Ala Lys Gln Leu Leu Tyr Val Gly Ala Thr Arg Ala 
            420                 425                 430         


Arg Thr Asn Leu Tyr Tyr Ile 
        435                 


<210>  23
<211>  441
<212>  PRT
<213>  Salmonella phage S16

<400>  23

Met Ile Thr Phe Glu Gln Leu Thr Ser Gly Gln Lys Leu Ala Phe Asp 
1               5                   10                  15      


Glu Thr Ile Arg Ala Ile Lys Glu Lys Lys Asn His Val Thr Ile Asn 
            20                  25                  30          


Gly Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr Lys Phe Ile Met Glu 
        35                  40                  45              


His Leu Val Ser Thr Gly Glu Thr Gly Ile Ile Leu Thr Ala Pro Thr 
    50                  55                  60                  


His Ala Ala Lys Lys Val Leu Thr Lys Leu Ser Gly Met Glu Ala Asn 
65                  70                  75                  80  


Thr Ile His Lys Ile Leu Lys Ile Asn Pro Thr Thr Tyr Glu Glu Ser 
                85                  90                  95      


Met Leu Phe Glu Gln Lys Glu Val Pro Asp Leu Ala Ser Cys Arg Val 
            100                 105                 110         


Leu Ile Cys Asp Glu Ala Ser Met Trp Asp Arg Lys Leu Phe Lys Ile 
        115                 120                 125             


Leu Met Ala Ser Ile Pro Lys Trp Cys Thr Ile Val Ala Ile Gly Asp 
    130                 135                 140                 


Val Ala Gln Ile Arg Pro Val Asp Pro Gly Glu Thr Glu Ala His Ile 
145                 150                 155                 160 


Ser Pro Phe Phe Ile His Lys Asp Phe Lys Gln Leu Asn Leu Thr Glu 
                165                 170                 175     


Val Met Arg Ser Asn Ala Pro Ile Ile Asp Val Ala Thr Asp Ile Arg 
            180                 185                 190         


Asn Gly Ser Trp Ile Tyr Glu Lys Thr Val Asp Gly His Gly Val His 
        195                 200                 205             


Gly Phe Thr Ser Thr Thr Ala Leu Lys Asp Phe Met Met Gln Tyr Phe 
    210                 215                 220                 


Ser Ile Val Lys Ser Pro Glu Asp Leu Phe Glu Asn Arg Met Leu Ala 
225                 230                 235                 240 


Phe Thr Asn Lys Ser Val Asp Lys Leu Asn Ser Ile Ile Arg Arg Arg 
                245                 250                 255     


Leu Tyr Gln Thr Glu Glu Ala Phe Val Val Gly Glu Val Ile Val Met 
            260                 265                 270         


Gln Glu Pro Leu Met Arg Glu Leu Val Phe Glu Gly Lys Lys Phe His 
        275                 280                 285             


Glu Thr Leu Phe Thr Asn Gly Gln Tyr Val Arg Ile Leu Ser Ala Asp 
    290                 295                 300                 


Tyr Thr Ser Ser Phe Leu Gly Ala Lys Gly Val Ser Gly Glu His Leu 
305                 310                 315                 320 


Ile Arg His Trp Val Leu Asp Val Glu Thr Tyr Asp Asp Glu Glu Tyr 
                325                 330                 335     


Ala Arg Glu Lys Ile Asn Val Ile Ser Asp Glu Gln Glu Met Asn Lys 
            340                 345                 350         


Phe Gln Phe Phe Leu Ala Lys Thr Ala Asp Thr Tyr Lys Asn Trp Asn 
        355                 360                 365             


Lys Gly Gly Lys Ala Pro Trp Ser Glu Phe Trp Asp Ala Lys Arg Lys 
    370                 375                 380                 


Phe His Lys Val Lys Ala Leu Pro Cys Ser Thr Phe His Lys Ala Gln 
385                 390                 395                 400 


Gly Ile Ser Val Asp Ser Ser Phe Ile Tyr Thr Pro Cys Ile His Val 
                405                 410                 415     


Ser Ser Asp Asn Lys Phe Lys Leu Glu Leu Leu Tyr Val Gly Ala Thr 
            420                 425                 430         


Arg Gly Arg His Asp Val Phe Phe Val 
        435                 440     


<210>  24
<211>  65
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  preferred HhH domain

<400>  24

Gly Thr Gly Ser Gly Ala Trp Lys Glu Trp Leu Glu Arg Lys Val Gly 
1               5                   10                  15      


Glu Gly Arg Ala Arg Arg Leu Ile Glu Tyr Phe Gly Ser Ala Gly Glu 
            20                  25                  30          


Val Gly Lys Leu Val Glu Asn Ala Glu Val Ser Lys Leu Leu Glu Val 
        35                  40                  45              


Pro Gly Ile Gly Asp Glu Ala Val Ala Arg Leu Val Pro Gly Gly Ser 
    50                  55                  60                  


Ser 
65  


<210>  25
<211>  299
<212>  PRT
<213>  Bacteriophage RB69

<400>  25

Met Phe Lys Arg Lys Ser Thr Ala Asp Leu Ala Ala Gln Met Ala Lys 
1               5                   10                  15      


Leu Asn Gly Asn Lys Gly Phe Ser Ser Glu Asp Lys Gly Glu Trp Lys 
            20                  25                  30          


Leu Lys Leu Asp Ala Ser Gly Asn Gly Gln Ala Val Ile Arg Phe Leu 
        35                  40                  45              


Pro Ala Lys Thr Asp Asp Ala Leu Pro Phe Ala Ile Leu Val Asn His 
    50                  55                  60                  


Gly Phe Lys Lys Asn Gly Lys Trp Tyr Ile Glu Thr Cys Ser Ser Thr 
65                  70                  75                  80  


His Gly Asp Tyr Asp Ser Cys Pro Val Cys Gln Tyr Ile Ser Lys Asn 
                85                  90                  95      


Asp Leu Tyr Asn Thr Asn Lys Thr Glu Tyr Ser Gln Leu Lys Arg Lys 
            100                 105                 110         


Thr Ser Tyr Trp Ala Asn Ile Leu Val Val Lys Asp Pro Gln Ala Pro 
        115                 120                 125             


Asp Asn Glu Gly Lys Val Phe Lys Tyr Arg Phe Gly Lys Lys Ile Trp 
    130                 135                 140                 


Asp Lys Ile Asn Ala Met Ile Ala Val Asp Thr Glu Met Gly Glu Thr 
145                 150                 155                 160 


Pro Val Asp Val Thr Cys Pro Trp Glu Gly Ala Asn Phe Val Leu Lys 
                165                 170                 175     


Val Lys Gln Val Ser Gly Phe Ser Asn Tyr Asp Glu Ser Lys Phe Leu 
            180                 185                 190         


Asn Gln Ser Ala Ile Pro Asn Ile Asp Asp Glu Ser Phe Gln Lys Glu 
        195                 200                 205             


Leu Phe Glu Gln Met Val Asp Leu Ser Glu Met Thr Ser Lys Asp Lys 
    210                 215                 220                 


Phe Lys Ser Phe Glu Glu Leu Asn Thr Lys Phe Asn Gln Val Leu Gly 
225                 230                 235                 240 


Thr Ala Ala Leu Gly Gly Ala Ala Ala Ala Ala Ala Ser Val Ala Asp 
                245                 250                 255     


Lys Val Ala Ser Asp Leu Asp Asp Phe Asp Lys Asp Met Glu Ala Phe 
            260                 265                 270         


Ser Ser Ala Lys Thr Glu Asp Asp Phe Met Ser Ser Ser Ser Ser Asp 
        275                 280                 285             


Asp Gly Asp Leu Asp Asp Leu Leu Ala Gly Leu 
    290                 295                 


<210>  26
<211>  232
<212>  PRT
<213>  Bacteriophage T7

<400>  26

Met Ala Lys Lys Ile Phe Thr Ser Ala Leu Gly Thr Ala Glu Pro Tyr 
1               5                   10                  15      


Ala Tyr Ile Ala Lys Pro Asp Tyr Gly Asn Glu Glu Arg Gly Phe Gly 
            20                  25                  30          


Asn Pro Arg Gly Val Tyr Lys Val Asp Leu Thr Ile Pro Asn Lys Asp 
        35                  40                  45              


Pro Arg Cys Gln Arg Met Val Asp Glu Ile Val Lys Cys His Glu Glu 
    50                  55                  60                  


Ala Tyr Ala Ala Ala Val Glu Glu Tyr Glu Ala Asn Pro Pro Ala Val 
65                  70                  75                  80  


Ala Arg Gly Lys Lys Pro Leu Lys Pro Tyr Glu Gly Asp Met Pro Phe 
                85                  90                  95      


Phe Asp Asn Gly Asp Gly Thr Thr Thr Phe Lys Phe Lys Cys Tyr Ala 
            100                 105                 110         


Ser Phe Gln Asp Lys Lys Thr Lys Glu Thr Lys His Ile Asn Leu Val 
        115                 120                 125             


Val Val Asp Ser Lys Gly Lys Lys Met Glu Asp Val Pro Ile Ile Gly 
    130                 135                 140                 


Gly Gly Ser Lys Leu Lys Val Lys Tyr Ser Leu Val Pro Tyr Lys Trp 
145                 150                 155                 160 


Asn Thr Ala Val Gly Ala Ser Val Lys Leu Gln Leu Glu Ser Val Met 
                165                 170                 175     


Leu Val Glu Leu Ala Thr Phe Gly Gly Gly Glu Asp Asp Trp Ala Asp 
            180                 185                 190         


Glu Val Glu Glu Asn Gly Tyr Val Ala Ser Gly Ser Ala Lys Ala Ser 
        195                 200                 205             


Lys Pro Arg Asp Glu Glu Ser Trp Asp Glu Asp Asp Glu Glu Ser Glu 
    210                 215                 220                 


Glu Ala Asp Glu Asp Gly Asp Phe 
225                 230         


<210>  27
<211>  324
<212>  PRT
<213>  Herpes virus 1

<400>  27

Met Asp Ser Pro Gly Gly Val Ala Pro Ala Ser Pro Val Glu Asp Ala 
1               5                   10                  15      


Ser Asp Ala Ser Leu Gly Gln Pro Glu Glu Gly Ala Pro Cys Gln Val 
            20                  25                  30          


Val Leu Gln Gly Ala Glu Leu Asn Gly Ile Leu Gln Ala Phe Ala Pro 
        35                  40                  45              


Leu Arg Thr Ser Leu Leu Asp Ser Leu Leu Val Met Gly Asp Arg Gly 
    50                  55                  60                  


Ile Leu Ile His Asn Thr Ile Phe Gly Glu Gln Val Phe Leu Pro Leu 
65                  70                  75                  80  


Glu His Ser Gln Phe Ser Arg Tyr Arg Trp Arg Gly Pro Thr Ala Ala 
                85                  90                  95      


Phe Leu Ser Leu Val Asp Gln Lys Arg Ser Leu Leu Ser Val Phe Arg 
            100                 105                 110         


Ala Asn Gln Tyr Pro Asp Leu Arg Arg Val Glu Leu Ala Ile Thr Gly 
        115                 120                 125             


Gln Ala Pro Phe Arg Thr Leu Val Gln Arg Ile Trp Thr Thr Thr Ser 
    130                 135                 140                 


Asp Gly Glu Ala Val Glu Leu Ala Ser Glu Thr Leu Met Lys Arg Glu 
145                 150                 155                 160 


Leu Thr Ser Phe Val Val Leu Val Pro Gln Gly Thr Pro Asp Val Gln 
                165                 170                 175     


Leu Arg Leu Thr Arg Pro Gln Leu Thr Lys Val Leu Asn Ala Thr Gly 
            180                 185                 190         


Ala Asp Ser Ala Thr Pro Thr Thr Phe Glu Leu Gly Val Asn Gly Lys 
        195                 200                 205             


Phe Ser Val Phe Thr Thr Ser Thr Cys Val Thr Phe Ala Ala Arg Glu 
    210                 215                 220                 


Glu Gly Val Ser Ser Ser Thr Ser Thr Gln Val Gln Ile Leu Ser Asn 
225                 230                 235                 240 


Ala Leu Thr Lys Ala Gly Gln Ala Ala Ala Asn Ala Lys Thr Val Tyr 
                245                 250                 255     


Gly Glu Asn Thr His Arg Thr Phe Ser Val Val Val Asp Asp Cys Ser 
            260                 265                 270         


Met Arg Ala Val Leu Arg Arg Leu Gln Val Gly Gly Gly Thr Leu Lys 
        275                 280                 285             


Phe Phe Leu Thr Thr Pro Val Pro Ser Leu Cys Val Thr Ala Thr Gly 
    290                 295                 300                 


Pro Asn Ala Val Ser Ala Val Phe Leu Leu Lys Pro Gln Lys His His 
305                 310                 315                 320 


His His His His 
                


<210>  28
<211>  251
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  subunit 1 of PCNA

<400>  28

Met Phe Lys Ile Val Tyr Pro Asn Ala Lys Asp Phe Phe Ser Phe Ile 
1               5                   10                  15      


Asn Ser Ile Thr Asn Val Thr Asp Ser Ile Ile Leu Asn Phe Thr Glu 
            20                  25                  30          


Asp Gly Ile Phe Ser Arg His Leu Thr Glu Asp Lys Val Leu Met Ala 
        35                  40                  45              


Ile Met Arg Ile Pro Lys Asp Val Leu Ser Glu Tyr Ser Ile Asp Ser 
    50                  55                  60                  


Pro Thr Ser Val Lys Leu Asp Val Ser Ser Val Lys Lys Ile Leu Ser 
65                  70                  75                  80  


Lys Ala Ser Ser Lys Lys Ala Thr Ile Glu Leu Thr Glu Thr Asp Ser 
                85                  90                  95      


Gly Leu Lys Ile Ile Ile Arg Asp Glu Lys Ser Gly Ala Lys Ser Thr 
            100                 105                 110         


Ile Tyr Ile Lys Ala Glu Lys Gly Gln Val Glu Gln Leu Thr Glu Pro 
        115                 120                 125             


Lys Val Asn Leu Ala Val Asn Phe Thr Thr Asp Glu Ser Val Leu Asn 
    130                 135                 140                 


Val Ile Ala Ala Asp Val Thr Leu Val Gly Glu Glu Met Arg Ile Ser 
145                 150                 155                 160 


Thr Glu Glu Asp Lys Ile Lys Ile Glu Ala Gly Glu Glu Gly Lys Arg 
                165                 170                 175     


Tyr Val Ala Phe Leu Met Lys Asp Lys Pro Leu Lys Glu Leu Ser Ile 
            180                 185                 190         


Asp Thr Ser Ala Ser Ser Ser Tyr Ser Ala Glu Met Phe Lys Asp Ala 
        195                 200                 205             


Val Lys Gly Leu Arg Gly Phe Ser Ala Pro Thr Met Val Ser Phe Gly 
    210                 215                 220                 


Glu Asn Leu Pro Met Lys Ile Asp Val Glu Ala Val Ser Gly Gly His 
225                 230                 235                 240 


Met Ile Phe Trp Ile Ala Pro Arg Leu Leu Glu 
                245                 250     


<210>  29
<211>  245
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  subunit 2 of PCNA

<400>  29

Met Lys Ala Lys Val Ile Asp Ala Val Ser Phe Ser Tyr Ile Leu Arg 
1               5                   10                  15      


Thr Val Gly Asp Phe Leu Ser Glu Ala Asn Phe Ile Val Thr Lys Glu 
            20                  25                  30          


Gly Ile Arg Val Ser Gly Ile Asp Pro Ser Arg Val Val Phe Leu Asp 
        35                  40                  45              


Ile Phe Leu Pro Ser Ser Tyr Phe Glu Gly Phe Glu Val Ser Gln Glu 
    50                  55                  60                  


Lys Glu Ile Ile Gly Phe Lys Leu Glu Asp Val Asn Asp Ile Leu Lys 
65                  70                  75                  80  


Arg Val Leu Lys Asp Asp Thr Leu Ile Leu Ser Ser Asn Glu Ser Lys 
                85                  90                  95      


Leu Thr Leu Thr Phe Asp Gly Glu Phe Thr Arg Ser Phe Glu Leu Pro 
            100                 105                 110         


Leu Ile Gln Val Glu Ser Thr Gln Pro Pro Ser Val Asn Leu Glu Phe 
        115                 120                 125             


Pro Phe Lys Ala Gln Leu Leu Thr Ile Thr Phe Ala Asp Ile Ile Asp 
    130                 135                 140                 


Glu Leu Ser Asp Leu Gly Glu Val Leu Asn Ile His Ser Lys Glu Asn 
145                 150                 155                 160 


Lys Leu Tyr Phe Glu Val Ile Gly Asp Leu Ser Thr Ala Lys Val Glu 
                165                 170                 175     


Leu Ser Thr Asp Asn Gly Thr Leu Leu Glu Ala Ser Gly Ala Asp Val 
            180                 185                 190         


Ser Ser Ser Tyr Gly Met Glu Tyr Val Ala Asn Thr Thr Lys Met Arg 
        195                 200                 205             


Arg Ala Ser Asp Ser Met Glu Leu Tyr Phe Gly Ser Gln Ile Pro Leu 
    210                 215                 220                 


Lys Leu Arg Phe Lys Leu Pro Gln Glu Gly Tyr Gly Asp Phe Tyr Ile 
225                 230                 235                 240 


Ala Pro Arg Ala Asp 
                245 


<210>  30
<211>  246
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sununit 3 of PCNA

<400>  30

Met Lys Val Val Tyr Asp Asp Val Arg Val Leu Lys Asp Ile Ile Gln 
1               5                   10                  15      


Ala Leu Ala Arg Leu Val Asp Glu Ala Val Leu Lys Phe Lys Gln Asp 
            20                  25                  30          


Ser Val Glu Leu Val Ala Leu Asp Arg Ala His Ile Ser Leu Ile Ser 
        35                  40                  45              


Val Asn Leu Pro Arg Glu Met Phe Lys Glu Tyr Asp Val Asn Asp Glu 
    50                  55                  60                  


Phe Lys Phe Gly Phe Asn Thr Gln Tyr Leu Met Lys Ile Leu Lys Val 
65                  70                  75                  80  


Ala Lys Arg Lys Glu Ala Ile Glu Ile Ala Ser Glu Ser Pro Asp Ser 
                85                  90                  95      


Val Ile Ile Asn Ile Ile Gly Ser Thr Asn Arg Glu Phe Asn Val Arg 
            100                 105                 110         


Asn Leu Glu Val Ser Glu Gln Glu Ile Pro Glu Ile Asn Leu Gln Phe 
        115                 120                 125             


Asp Ile Ser Ala Thr Ile Ser Ser Asp Gly Phe Lys Ser Ala Ile Ser 
    130                 135                 140                 


Glu Val Ser Thr Val Thr Asp Asn Val Val Val Glu Gly His Glu Asp 
145                 150                 155                 160 


Arg Ile Leu Ile Lys Ala Glu Gly Glu Ser Glu Val Glu Val Glu Phe 
                165                 170                 175     


Ser Lys Asp Thr Gly Gly Leu Gln Asp Leu Glu Phe Ser Lys Glu Ser 
            180                 185                 190         


Lys Asn Ser Tyr Ser Ala Glu Tyr Leu Asp Asp Val Leu Ser Leu Thr 
        195                 200                 205             


Lys Leu Ser Asp Tyr Val Lys Ile Ser Phe Gly Asn Gln Lys Pro Leu 
    210                 215                 220                 


Gln Leu Phe Phe Asn Met Glu Gly Gly Gly Lys Val Thr Tyr Leu Leu 
225                 230                 235                 240 


Ala Pro Lys Val Leu Glu 
                245     


<210>  31
<211>  608
<212>  PRT
<213>  Bacillus subtilis phage phi29

<400>  31

Met Lys His Met Pro Arg Lys Met Tyr Ser Cys Ala Phe Glu Thr Thr 
1               5                   10                  15      


Thr Lys Val Glu Asp Cys Arg Val Trp Ala Tyr Gly Tyr Met Asn Ile 
            20                  25                  30          


Glu Asp His Ser Glu Tyr Lys Ile Gly Asn Ser Leu Asp Glu Phe Met 
        35                  40                  45              


Ala Trp Val Leu Lys Val Gln Ala Asp Leu Tyr Phe His Asn Leu Lys 
    50                  55                  60                  


Phe Asp Gly Ala Phe Ile Ile Asn Trp Leu Glu Arg Asn Gly Phe Lys 
65                  70                  75                  80  


Trp Ser Ala Asp Gly Leu Pro Asn Thr Tyr Asn Thr Ile Ile Ser Arg 
                85                  90                  95      


Met Gly Gln Trp Tyr Met Ile Asp Ile Cys Leu Gly Tyr Lys Gly Lys 
            100                 105                 110         


Arg Lys Ile His Thr Val Ile Tyr Asp Ser Leu Lys Lys Leu Pro Phe 
        115                 120                 125             


Pro Val Lys Lys Ile Ala Lys Asp Phe Lys Leu Thr Val Leu Lys Gly 
    130                 135                 140                 


Asp Ile Asp Tyr His Lys Glu Arg Pro Val Gly Tyr Lys Ile Thr Pro 
145                 150                 155                 160 


Glu Glu Tyr Ala Tyr Ile Lys Asn Asp Ile Gln Ile Ile Ala Glu Ala 
                165                 170                 175     


Leu Leu Ile Gln Phe Lys Gln Gly Leu Asp Arg Met Thr Ala Gly Ser 
            180                 185                 190         


Asp Ser Leu Lys Gly Phe Lys Asp Ile Ile Thr Thr Lys Lys Phe Lys 
        195                 200                 205             


Lys Val Phe Pro Thr Leu Ser Leu Gly Leu Asp Lys Glu Val Arg Tyr 
    210                 215                 220                 


Ala Tyr Arg Gly Gly Phe Thr Trp Leu Asn Asp Arg Phe Lys Glu Lys 
225                 230                 235                 240 


Glu Ile Gly Glu Gly Met Val Phe Asp Val Asn Ser Leu Tyr Pro Ala 
                245                 250                 255     


Gln Met Tyr Ser Arg Leu Leu Pro Tyr Gly Glu Pro Ile Val Phe Glu 
            260                 265                 270         


Gly Lys Tyr Val Trp Asp Glu Asp Tyr Pro Leu His Ile Gln His Ile 
        275                 280                 285             


Arg Cys Glu Phe Glu Leu Lys Glu Gly Tyr Ile Pro Thr Ile Gln Ile 
    290                 295                 300                 


Lys Arg Ser Arg Phe Tyr Lys Gly Asn Glu Tyr Leu Lys Ser Ser Gly 
305                 310                 315                 320 


Gly Glu Ile Ala Asp Leu Trp Leu Ser Asn Val Asp Leu Glu Leu Met 
                325                 330                 335     


Lys Glu His Tyr Asp Leu Tyr Asn Val Glu Tyr Ile Ser Gly Leu Lys 
            340                 345                 350         


Phe Lys Ala Thr Thr Gly Leu Phe Lys Asp Phe Ile Asp Lys Trp Thr 
        355                 360                 365             


Tyr Ile Lys Thr Thr Ser Glu Gly Ala Ile Lys Gln Leu Ala Lys Leu 
    370                 375                 380                 


Met Leu Asn Ser Leu Tyr Gly Lys Phe Ala Ser Asn Pro Asp Val Thr 
385                 390                 395                 400 


Gly Lys Val Pro Tyr Leu Lys Glu Asn Gly Ala Leu Gly Phe Arg Leu 
                405                 410                 415     


Gly Glu Glu Glu Thr Lys Asp Pro Val Tyr Thr Pro Met Gly Val Phe 
            420                 425                 430         


Ile Thr Ala Trp Ala Arg Tyr Thr Thr Ile Thr Ala Ala Gln Ala Cys 
        435                 440                 445             


Tyr Asp Arg Ile Ile Tyr Cys Asp Thr Asp Ser Ile His Leu Thr Gly 
    450                 455                 460                 


Thr Glu Ile Pro Asp Val Ile Lys Asp Ile Val Asp Pro Lys Lys Leu 
465                 470                 475                 480 


Gly Tyr Trp Ala His Glu Ser Thr Phe Lys Arg Ala Lys Tyr Leu Arg 
                485                 490                 495     


Gln Lys Thr Tyr Ile Gln Asp Ile Tyr Met Lys Glu Val Asp Gly Lys 
            500                 505                 510         


Leu Val Glu Gly Ser Pro Asp Asp Tyr Thr Asp Ile Lys Phe Ser Val 
        515                 520                 525             


Lys Cys Ala Gly Met Thr Asp Lys Ile Lys Lys Glu Val Thr Phe Glu 
    530                 535                 540                 


Asn Phe Lys Val Gly Phe Ser Arg Lys Met Lys Pro Lys Pro Val Gln 
545                 550                 555                 560 


Val Pro Gly Gly Val Val Leu Val Asp Asp Thr Phe Thr Ile Lys Ser 
                565                 570                 575     


Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys Gly Gly Gly Ser 
            580                 585                 590         


Gly Gly Gly Ser Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys 
        595                 600                 605             


<210>  32
<211>  318
<212>  PRT
<213>  Herpes virus 1

<400>  32

Thr Asp Ser Pro Gly Gly Val Ala Pro Ala Ser Pro Val Glu Asp Ala 
1               5                   10                  15      


Ser Asp Ala Ser Leu Gly Gln Pro Glu Glu Gly Ala Pro Cys Gln Val 
            20                  25                  30          


Val Leu Gln Gly Ala Glu Leu Asn Gly Ile Leu Gln Ala Phe Ala Pro 
        35                  40                  45              


Leu Arg Thr Ser Leu Leu Asp Ser Leu Leu Val Met Gly Asp Arg Gly 
    50                  55                  60                  


Ile Leu Ile His Asn Thr Ile Phe Gly Glu Gln Val Phe Leu Pro Leu 
65                  70                  75                  80  


Glu His Ser Gln Phe Ser Arg Tyr Arg Trp Arg Gly Pro Thr Ala Ala 
                85                  90                  95      


Phe Leu Ser Leu Val Asp Gln Lys Arg Ser Leu Leu Ser Val Phe Arg 
            100                 105                 110         


Ala Asn Gln Tyr Pro Asp Leu Arg Arg Val Glu Leu Ala Ile Thr Gly 
        115                 120                 125             


Gln Ala Pro Phe Arg Thr Leu Val Gln Arg Ile Trp Thr Thr Thr Ser 
    130                 135                 140                 


Asp Gly Glu Ala Val Glu Leu Ala Ser Glu Thr Leu Met Lys Arg Glu 
145                 150                 155                 160 


Leu Thr Ser Phe Val Val Leu Val Pro Gln Gly Thr Pro Asp Val Gln 
                165                 170                 175     


Leu Arg Leu Thr Arg Pro Gln Leu Thr Lys Val Leu Asn Ala Thr Gly 
            180                 185                 190         


Ala Asp Ser Ala Thr Pro Thr Thr Phe Glu Leu Gly Val Asn Gly Lys 
        195                 200                 205             


Phe Ser Val Phe Thr Thr Ser Thr Cys Val Thr Phe Ala Ala Arg Glu 
    210                 215                 220                 


Glu Gly Val Ser Ser Ser Thr Ser Thr Gln Val Gln Ile Leu Ser Asn 
225                 230                 235                 240 


Ala Leu Thr Lys Ala Gly Gln Ala Ala Ala Asn Ala Lys Thr Val Tyr 
                245                 250                 255     


Gly Glu Asn Thr His Arg Thr Phe Ser Val Val Val Asp Asp Cys Ser 
            260                 265                 270         


Met Arg Ala Val Leu Arg Arg Leu Gln Val Gly Gly Gly Thr Leu Lys 
        275                 280                 285             


Phe Phe Leu Thr Thr Pro Val Pro Ser Leu Cys Val Thr Ala Thr Gly 
    290                 295                 300                 


Pro Asn Ala Val Ser Ala Val Phe Leu Leu Lys Pro Gln Lys 
305                 310                 315             


<210>  33
<211>  233
<212>  PRT
<213>  Bacteriophage RB69

<400>  33

Lys Gly Phe Ser Ser Glu Asp Lys Gly Glu Trp Lys Leu Lys Leu Asp 
1               5                   10                  15      


Ala Ser Gly Asn Gly Gln Ala Val Ile Arg Phe Leu Pro Ala Lys Thr 
            20                  25                  30          


Asp Asp Ala Leu Pro Phe Ala Ile Leu Val Asn His Gly Phe Lys Lys 
        35                  40                  45              


Asn Gly Lys Trp Tyr Ile Glu Thr Cys Ser Ser Thr His Gly Asp Tyr 
    50                  55                  60                  


Asp Ser Cys Pro Val Cys Gln Tyr Ile Ser Lys Asn Asp Leu Tyr Asn 
65                  70                  75                  80  


Thr Asn Lys Thr Glu Tyr Ser Gln Leu Lys Arg Lys Thr Ser Tyr Trp 
                85                  90                  95      


Ala Asn Ile Leu Val Val Lys Asp Pro Gln Ala Pro Asp Asn Glu Gly 
            100                 105                 110         


Lys Val Phe Lys Tyr Arg Phe Gly Lys Lys Ile Trp Asp Lys Ile Asn 
        115                 120                 125             


Ala Met Ile Ala Val Asp Thr Glu Met Gly Glu Thr Pro Val Asp Val 
    130                 135                 140                 


Thr Cys Pro Trp Glu Gly Ala Asn Phe Val Leu Lys Val Lys Gln Val 
145                 150                 155                 160 


Ser Gly Phe Ser Asn Tyr Asp Glu Ser Lys Phe Leu Asn Gln Ser Ala 
                165                 170                 175     


Ile Pro Asn Ile Asp Asp Glu Ser Phe Gln Lys Glu Leu Phe Glu Gln 
            180                 185                 190         


Met Val Asp Leu Ser Glu Met Thr Ser Lys Asp Lys Phe Lys Ser Phe 
        195                 200                 205             


Glu Glu Leu Asn Thr Lys Phe Asn Gln Val Leu Gly Thr Ala Ala Leu 
    210                 215                 220                 


Gly Gly Ala Ala Ala Ala Ala Ala Ser 
225                 230             


<210>  34
<211>  210
<212>  PRT
<213>  Bacteriophage T7

<400>  34

Ala Lys Lys Ile Phe Thr Ser Ala Leu Gly Thr Ala Glu Pro Tyr Ala 
1               5                   10                  15      


Tyr Ile Ala Lys Pro Asp Tyr Gly Asn Glu Glu Arg Gly Phe Gly Asn 
            20                  25                  30          


Pro Arg Gly Val Tyr Lys Val Asp Leu Thr Ile Pro Asn Lys Asp Pro 
        35                  40                  45              


Arg Cys Gln Arg Met Val Asp Glu Ile Val Lys Cys His Glu Glu Ala 
    50                  55                  60                  


Tyr Ala Ala Ala Val Glu Glu Tyr Glu Ala Asn Pro Pro Ala Val Ala 
65                  70                  75                  80  


Arg Gly Lys Lys Pro Leu Lys Pro Tyr Glu Gly Asp Met Pro Phe Phe 
                85                  90                  95      


Asp Asn Gly Asp Gly Thr Thr Thr Phe Lys Phe Lys Cys Tyr Ala Ser 
            100                 105                 110         


Phe Gln Asp Lys Lys Thr Lys Glu Thr Lys His Ile Asn Leu Val Val 
        115                 120                 125             


Val Asp Ser Lys Gly Lys Lys Met Glu Asp Val Pro Ile Ile Gly Gly 
    130                 135                 140                 


Gly Ser Lys Leu Lys Val Lys Tyr Ser Leu Val Pro Tyr Lys Trp Asn 
145                 150                 155                 160 


Thr Ala Val Gly Ala Ser Val Lys Leu Gln Leu Glu Ser Val Met Leu 
                165                 170                 175     


Val Glu Leu Ala Thr Phe Gly Gly Gly Glu Asp Asp Trp Ala Asp Glu 
            180                 185                 190         


Val Glu Glu Asn Gly Tyr Val Ala Ser Gly Ser Ala Lys Ala Ser Lys 
        195                 200                 205             


Pro Arg 
    210 


<210>  35
<211>  99
<212>  PRT
<213>  Halorubrum lacusprofundi

<400>  35

Ser Gly Glu Glu Leu Leu Asp Leu Ala Gly Val Arg Asn Val Gly Arg 
1               5                   10                  15      


Lys Arg Ala Arg Arg Leu Phe Glu Ala Gly Ile Glu Thr Arg Ala Asp 
            20                  25                  30          


Leu Arg Glu Ala Asp Lys Ala Val Val Leu Gly Ala Leu Arg Gly Arg 
        35                  40                  45              


Glu Arg Thr Ala Glu Arg Ile Leu Glu His Ala Gly Arg Glu Asp Pro 
    50                  55                  60                  


Ser Met Asp Asp Val Arg Pro Asp Lys Ser Ala Ser Ala Ala Ala Thr 
65                  70                  75                  80  


Ala Gly Ser Ala Ser Asp Glu Asp Gly Glu Gly Gln Ala Ser Leu Gly 
                85                  90                  95      


Asp Phe Arg 
            


<210>  36
<211>  102
<212>  PRT
<213>  Haloferax volcanii

<400>  36

Ser Gly Glu Glu Leu Leu Asp Leu Ala Gly Val Arg Gly Val Gly Arg 
1               5                   10                  15      


Lys Arg Ala Arg Arg Leu Phe Glu Ala Gly Val Glu Thr Arg Ala Asp 
            20                  25                  30          


Leu Arg Glu Ala Asp Lys Pro Arg Val Leu Ala Ala Leu Arg Gly Arg 
        35                  40                  45              


Arg Lys Thr Ala Glu Asn Ile Leu Glu Ala Ala Gly Arg Lys Asp Pro 
    50                  55                  60                  


Ser Met Asp Ala Val Asp Glu Asp Asp Ala Pro Asp Asp Ala Val Pro 
65                  70                  75                  80  


Asp Asp Ala Gly Phe Glu Thr Ala Lys Glu Arg Ala Asp Gln Gln Ala 
                85                  90                  95      


Ser Leu Gly Asp Phe Glu 
            100         


<210>  37
<211>  55
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  (HhH)2 domain

<400>  37

Trp Lys Glu Trp Leu Glu Arg Lys Val Gly Glu Gly Arg Ala Arg Arg 
1               5                   10                  15      


Leu Ile Glu Tyr Phe Gly Ser Ala Gly Glu Val Gly Lys Leu Val Glu 
            20                  25                  30          


Asn Ala Glu Val Ser Lys Leu Leu Glu Val Pro Gly Ile Gly Asp Glu 
        35                  40                  45              


Ala Val Ala Arg Leu Val Pro 
    50                  55  


<210>  38
<211>  107
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  (HhH)2-(HhH)2 domain

<400>  38

Trp Lys Glu Trp Leu Glu Arg Lys Val Gly Glu Gly Arg Ala Arg Arg 
1               5                   10                  15      


Leu Ile Glu Tyr Phe Gly Ser Ala Gly Glu Val Gly Lys Leu Val Glu 
            20                  25                  30          


Asn Ala Glu Val Ser Lys Leu Leu Glu Val Pro Gly Ile Gly Asp Glu 
        35                  40                  45              


Ala Val Ala Arg Leu Val Pro Gly Tyr Lys Thr Leu Arg Asp Ala Gly 
    50                  55                  60                  


Leu Thr Pro Ala Glu Ala Glu Arg Val Leu Lys Arg Tyr Gly Ser Val 
65                  70                  75                  80  


Ser Lys Val Gln Glu Gly Ala Thr Pro Asp Glu Leu Arg Glu Leu Gly 
                85                  90                  95      


Leu Gly Asp Ala Lys Ile Ala Arg Ile Leu Gly 
            100                 105         


<210>  39
<211>  132
<212>  PRT
<213>  Homo sapiens

<400>  39

Glu Ser Glu Thr Thr Thr Ser Leu Val Leu Glu Arg Ser Leu Asn Arg 
1               5                   10                  15      


Val His Leu Leu Gly Arg Val Gly Gln Asp Pro Val Leu Arg Gln Val 
            20                  25                  30          


Glu Gly Lys Asn Pro Val Thr Ile Phe Ser Leu Ala Thr Asn Glu Met 
        35                  40                  45              


Trp Arg Ser Gly Asp Ser Glu Val Tyr Gln Leu Gly Asp Val Ser Gln 
    50                  55                  60                  


Lys Thr Thr Trp His Arg Ile Ser Val Phe Arg Pro Gly Leu Arg Asp 
65                  70                  75                  80  


Val Ala Tyr Gln Tyr Val Lys Lys Gly Ser Arg Ile Tyr Leu Glu Gly 
                85                  90                  95      


Lys Ile Asp Tyr Gly Glu Tyr Met Asp Lys Asn Asn Val Arg Arg Gln 
            100                 105                 110         


Ala Thr Thr Ile Ile Ala Asp Asn Ile Ile Phe Leu Ser Asp Gln Thr 
        115                 120                 125             


Lys Glu Lys Glu 
    130         


<210>  40
<211>  123
<212>  PRT
<213>  Bacillus subtilis phage phi29

<400>  40

Glu Asn Thr Asn Ile Val Lys Ala Thr Phe Asp Thr Glu Thr Leu Glu 
1               5                   10                  15      


Gly Gln Ile Lys Ile Phe Asn Ala Gln Thr Gly Gly Gly Gln Ser Phe 
            20                  25                  30          


Lys Asn Leu Pro Asp Gly Thr Ile Ile Glu Ala Asn Ala Ile Ala Gln 
        35                  40                  45              


Tyr Lys Gln Val Ser Asp Thr Tyr Gly Asp Ala Lys Glu Glu Thr Val 
    50                  55                  60                  


Thr Thr Ile Phe Ala Ala Asp Gly Ser Leu Tyr Ser Ala Ile Ser Lys 
65                  70                  75                  80  


Thr Val Ala Glu Ala Ala Ser Asp Leu Ile Asp Leu Val Thr Arg His 
                85                  90                  95      


Lys Leu Glu Thr Phe Lys Val Lys Val Val Gln Gly Thr Ser Ser Lys 
            100                 105                 110         


Gly Asn Val Phe Phe Ser Leu Gln Leu Ser Leu 
        115                 120             


<210>  41
<211>  177
<212>  PRT
<213>  Escherichia coli

<400>  41

Ala Ser Arg Gly Val Asn Lys Val Ile Leu Val Gly Asn Leu Gly Gln 
1               5                   10                  15      


Asp Pro Glu Val Arg Tyr Met Pro Asn Gly Gly Ala Val Ala Asn Ile 
            20                  25                  30          


Thr Leu Ala Thr Ser Glu Ser Trp Arg Asp Lys Ala Thr Gly Glu Met 
        35                  40                  45              


Lys Glu Gln Thr Glu Trp His Arg Val Val Leu Phe Gly Lys Leu Ala 
    50                  55                  60                  


Glu Val Ala Ser Glu Tyr Leu Arg Lys Gly Ser Gln Val Tyr Ile Glu 
65                  70                  75                  80  


Gly Gln Leu Arg Thr Arg Lys Trp Thr Asp Gln Ser Gly Gln Asp Arg 
                85                  90                  95      


Tyr Thr Thr Glu Val Val Val Asn Val Gly Gly Thr Met Gln Met Leu 
            100                 105                 110         


Gly Gly Arg Gln Gly Gly Gly Ala Pro Ala Gly Gly Asn Ile Gly Gly 
        115                 120                 125             


Gly Gln Pro Gln Gly Gly Trp Gly Gln Pro Gln Gln Pro Gln Gly Gly 
    130                 135                 140                 


Asn Gln Phe Ser Gly Gly Ala Gln Ser Arg Pro Gln Gln Ser Ala Pro 
145                 150                 155                 160 


Ala Ala Pro Ser Asn Glu Pro Pro Met Asp Phe Asp Asp Asp Ile Pro 
                165                 170                 175     


Phe 
    


<210>  42
<211>  301
<212>  PRT
<213>  Enterobacteria phage T4

<400>  42

Met Phe Lys Arg Lys Ser Thr Ala Glu Leu Ala Ala Gln Met Ala Lys 
1               5                   10                  15      


Leu Asn Gly Asn Lys Gly Phe Ser Ser Glu Asp Lys Gly Glu Trp Lys 
            20                  25                  30          


Leu Lys Leu Asp Asn Ala Gly Asn Gly Gln Ala Val Ile Arg Phe Leu 
        35                  40                  45              


Pro Ser Lys Asn Asp Glu Gln Ala Pro Phe Ala Ile Leu Val Asn His 
    50                  55                  60                  


Gly Phe Lys Lys Asn Gly Lys Trp Tyr Ile Glu Thr Cys Ser Ser Thr 
65                  70                  75                  80  


His Gly Asp Tyr Asp Ser Cys Pro Val Cys Gln Tyr Ile Ser Lys Asn 
                85                  90                  95      


Asp Leu Tyr Asn Thr Asp Asn Lys Glu Tyr Ser Leu Val Lys Arg Lys 
            100                 105                 110         


Thr Ser Tyr Trp Ala Asn Ile Leu Val Val Lys Asp Pro Ala Ala Pro 
        115                 120                 125             


Glu Asn Glu Gly Lys Val Phe Lys Tyr Arg Phe Gly Lys Lys Ile Trp 
    130                 135                 140                 


Asp Lys Ile Asn Ala Met Ile Ala Val Asp Val Glu Met Gly Glu Thr 
145                 150                 155                 160 


Pro Val Asp Val Thr Cys Pro Trp Glu Gly Ala Asn Phe Val Leu Lys 
                165                 170                 175     


Val Lys Gln Val Ser Gly Phe Ser Asn Tyr Asp Glu Ser Lys Phe Leu 
            180                 185                 190         


Asn Gln Ser Ala Ile Pro Asn Ile Asp Asp Glu Ser Phe Gln Lys Glu 
        195                 200                 205             


Leu Phe Glu Gln Met Val Asp Leu Ser Glu Met Thr Ser Lys Asp Lys 
    210                 215                 220                 


Phe Lys Ser Phe Glu Glu Leu Asn Thr Lys Phe Gly Gln Val Met Gly 
225                 230                 235                 240 


Thr Ala Val Met Gly Gly Ala Ala Ala Thr Ala Ala Lys Lys Ala Asp 
                245                 250                 255     


Lys Val Ala Asp Asp Leu Asp Ala Phe Asn Val Asp Asp Phe Asn Thr 
            260                 265                 270         


Lys Thr Glu Asp Asp Phe Met Ser Ser Ser Ser Gly Ser Ser Ser Ser 
        275                 280                 285             


Ala Asp Asp Thr Asp Leu Asp Asp Leu Leu Asn Asp Leu 
    290                 295                 300     


<210>  43
<211>  177
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  EcoSSB-CterAla

<400>  43

Ala Ser Arg Gly Val Asn Lys Val Ile Leu Val Gly Asn Leu Gly Gln 
1               5                   10                  15      


Asp Pro Glu Val Arg Tyr Met Pro Asn Gly Gly Ala Val Ala Asn Ile 
            20                  25                  30          


Thr Leu Ala Thr Ser Glu Ser Trp Arg Asp Lys Ala Thr Gly Glu Met 
        35                  40                  45              


Lys Glu Gln Thr Glu Trp His Arg Val Val Leu Phe Gly Lys Leu Ala 
    50                  55                  60                  


Glu Val Ala Ser Glu Tyr Leu Arg Lys Gly Ser Gln Val Tyr Ile Glu 
65                  70                  75                  80  


Gly Gln Leu Arg Thr Arg Lys Trp Thr Asp Gln Ser Gly Gln Asp Arg 
                85                  90                  95      


Tyr Thr Thr Glu Val Val Val Asn Val Gly Gly Thr Met Gln Met Leu 
            100                 105                 110         


Gly Gly Arg Gln Gly Gly Gly Ala Pro Ala Gly Gly Asn Ile Gly Gly 
        115                 120                 125             


Gly Gln Pro Gln Gly Gly Trp Gly Gln Pro Gln Gln Pro Gln Gly Gly 
    130                 135                 140                 


Asn Gln Phe Ser Gly Gly Ala Gln Ser Arg Pro Gln Gln Ser Ala Pro 
145                 150                 155                 160 


Ala Ala Pro Ser Asn Glu Pro Pro Met Ala Phe Ala Ala Ala Ile Pro 
                165                 170                 175     


Phe 
    


<210>  44
<211>  177
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  EcoSSB-CterNGGN

<400>  44

Ala Ser Arg Gly Val Asn Lys Val Ile Leu Val Gly Asn Leu Gly Gln 
1               5                   10                  15      


Asp Pro Glu Val Arg Tyr Met Pro Asn Gly Gly Ala Val Ala Asn Ile 
            20                  25                  30          


Thr Leu Ala Thr Ser Glu Ser Trp Arg Asp Lys Ala Thr Gly Glu Met 
        35                  40                  45              


Lys Glu Gln Thr Glu Trp His Arg Val Val Leu Phe Gly Lys Leu Ala 
    50                  55                  60                  


Glu Val Ala Ser Glu Tyr Leu Arg Lys Gly Ser Gln Val Tyr Ile Glu 
65                  70                  75                  80  


Gly Gln Leu Arg Thr Arg Lys Trp Thr Asp Gln Ser Gly Gln Asp Arg 
                85                  90                  95      


Tyr Thr Thr Glu Val Val Val Asn Val Gly Gly Thr Met Gln Met Leu 
            100                 105                 110         


Gly Gly Arg Gln Gly Gly Gly Ala Pro Ala Gly Gly Asn Ile Gly Gly 
        115                 120                 125             


Gly Gln Pro Gln Gly Gly Trp Gly Gln Pro Gln Gln Pro Gln Gly Gly 
    130                 135                 140                 


Asn Gln Phe Ser Gly Gly Ala Gln Ser Arg Pro Gln Gln Ser Ala Pro 
145                 150                 155                 160 


Ala Ala Pro Ser Asn Glu Pro Pro Met Asn Phe Gly Gly Asn Ile Pro 
                165                 170                 175     


Phe 
    


<210>  45
<211>  152
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  EcoSSB-Q152del

<400>  45

Ala Ser Arg Gly Val Asn Lys Val Ile Leu Val Gly Asn Leu Gly Gln 
1               5                   10                  15      


Asp Pro Glu Val Arg Tyr Met Pro Asn Gly Gly Ala Val Ala Asn Ile 
            20                  25                  30          


Thr Leu Ala Thr Ser Glu Ser Trp Arg Asp Lys Ala Thr Gly Glu Met 
        35                  40                  45              


Lys Glu Gln Thr Glu Trp His Arg Val Val Leu Phe Gly Lys Leu Ala 
    50                  55                  60                  


Glu Val Ala Ser Glu Tyr Leu Arg Lys Gly Ser Gln Val Tyr Ile Glu 
65                  70                  75                  80  


Gly Gln Leu Arg Thr Arg Lys Trp Thr Asp Gln Ser Gly Gln Asp Arg 
                85                  90                  95      


Tyr Thr Thr Glu Val Val Val Asn Val Gly Gly Thr Met Gln Met Leu 
            100                 105                 110         


Gly Gly Arg Gln Gly Gly Gly Ala Pro Ala Gly Gly Asn Ile Gly Gly 
        115                 120                 125             


Gly Gln Pro Gln Gly Gly Trp Gly Gln Pro Gln Gln Pro Gln Gly Gly 
    130                 135                 140                 


Asn Gln Phe Ser Gly Gly Ala Gln 
145                 150         


<210>  46
<211>  117
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  EcoSSB-G117del

<400>  46

Ala Ser Arg Gly Val Asn Lys Val Ile Leu Val Gly Asn Leu Gly Gln 
1               5                   10                  15      


Asp Pro Glu Val Arg Tyr Met Pro Asn Gly Gly Ala Val Ala Asn Ile 
            20                  25                  30          


Thr Leu Ala Thr Ser Glu Ser Trp Arg Asp Lys Ala Thr Gly Glu Met 
        35                  40                  45              


Lys Glu Gln Thr Glu Trp His Arg Val Val Leu Phe Gly Lys Leu Ala 
    50                  55                  60                  


Glu Val Ala Ser Glu Tyr Leu Arg Lys Gly Ser Gln Val Tyr Ile Glu 
65                  70                  75                  80  


Gly Gln Leu Arg Thr Arg Lys Trp Thr Asp Gln Ser Gly Gln Asp Arg 
                85                  90                  95      


Tyr Thr Thr Glu Val Val Val Asn Val Gly Gly Thr Met Gln Met Leu 
            100                 105                 110         


Gly Gly Arg Gln Gly 
        115         


<210>  47
<211>  984
<212>  PRT
<213>  Methanopyrus kandleri

<400>  47

Met Ala Leu Val Tyr Asp Ala Glu Phe Val Gly Ser Glu Arg Glu Phe 
1               5                   10                  15      


Glu Glu Glu Arg Glu Thr Phe Leu Lys Gly Val Lys Ala Tyr Asp Gly 
            20                  25                  30          


Val Leu Ala Thr Arg Tyr Leu Met Glu Arg Ser Ser Ser Ala Lys Asn 
        35                  40                  45              


Asp Glu Glu Leu Leu Glu Leu His Gln Asn Phe Ile Leu Leu Thr Gly 
    50                  55                  60                  


Ser Tyr Ala Cys Ser Ile Asp Pro Thr Glu Asp Arg Tyr Gln Asn Val 
65                  70                  75                  80  


Ile Val Arg Gly Val Asn Phe Asp Glu Arg Val Gln Arg Leu Ser Thr 
                85                  90                  95      


Gly Gly Ser Pro Ala Arg Tyr Ala Ile Val Tyr Arg Arg Gly Trp Arg 
            100                 105                 110         


Ala Ile Ala Lys Ala Leu Asp Ile Asp Glu Glu Asp Val Pro Ala Ile 
        115                 120                 125             


Glu Val Arg Ala Val Lys Arg Asn Pro Leu Gln Pro Ala Leu Tyr Arg 
    130                 135                 140                 


Ile Leu Val Arg Tyr Gly Arg Val Asp Leu Met Pro Val Thr Val Asp 
145                 150                 155                 160 


Glu Val Pro Pro Glu Met Ala Gly Glu Phe Glu Arg Leu Ile Glu Arg 
                165                 170                 175     


Tyr Asp Val Pro Ile Asp Glu Lys Glu Glu Arg Ile Leu Glu Ile Leu 
            180                 185                 190         


Arg Glu Asn Pro Trp Thr Pro His Asp Glu Ile Ala Arg Arg Leu Gly 
        195                 200                 205             


Leu Ser Val Ser Glu Val Glu Gly Glu Lys Asp Pro Glu Ser Ser Gly 
    210                 215                 220                 


Ile Tyr Ser Leu Trp Ser Arg Val Val Val Asn Ile Glu Tyr Asp Glu 
225                 230                 235                 240 


Arg Thr Ala Lys Arg His Val Lys Arg Arg Asp Arg Leu Leu Glu Glu 
                245                 250                 255     


Leu Tyr Glu His Leu Glu Glu Leu Ser Glu Arg Tyr Leu Arg His Pro 
            260                 265                 270         


Leu Thr Arg Arg Trp Ile Val Glu His Lys Arg Asp Ile Met Arg Arg 
        275                 280                 285             


Tyr Leu Glu Gln Arg Ile Val Glu Cys Ala Leu Lys Leu Gln Asp Arg 
    290                 295                 300                 


Tyr Gly Ile Arg Glu Asp Val Ala Leu Cys Leu Ala Arg Ala Phe Asp 
305                 310                 315                 320 


Gly Ser Ile Ser Met Ile Ala Thr Thr Pro Tyr Arg Thr Leu Lys Asp 
                325                 330                 335     


Val Cys Pro Asp Leu Thr Leu Glu Glu Ala Lys Ser Val Asn Arg Thr 
            340                 345                 350         


Leu Ala Thr Leu Ile Asp Glu His Gly Leu Ser Pro Asp Ala Ala Asp 
        355                 360                 365             


Glu Leu Ile Glu His Phe Glu Ser Ile Ala Gly Ile Leu Ala Thr Asp 
    370                 375                 380                 


Leu Glu Glu Ile Glu Arg Met Tyr Glu Glu Gly Arg Leu Ser Glu Glu 
385                 390                 395                 400 


Ala Tyr Arg Ala Ala Val Glu Ile Gln Leu Ala Glu Leu Thr Lys Lys 
                405                 410                 415     


Glu Gly Val Gly Arg Lys Thr Ala Glu Arg Leu Leu Arg Ala Phe Gly 
            420                 425                 430         


Asn Pro Glu Arg Val Lys Gln Leu Ala Arg Glu Phe Glu Ile Glu Lys 
        435                 440                 445             


Leu Ala Ser Val Glu Gly Val Gly Glu Arg Val Leu Arg Ser Leu Val 
    450                 455                 460                 


Pro Gly Tyr Ala Ser Leu Ile Ser Ile Arg Gly Ile Asp Arg Glu Arg 
465                 470                 475                 480 


Ala Glu Arg Leu Leu Lys Lys Tyr Gly Gly Tyr Ser Lys Val Arg Glu 
                485                 490                 495     


Ala Gly Val Glu Glu Leu Arg Glu Asp Gly Leu Thr Asp Ala Gln Ile 
            500                 505                 510         


Arg Glu Leu Lys Gly Leu Lys Thr Leu Glu Ser Ile Val Gly Asp Leu 
        515                 520                 525             


Glu Lys Ala Asp Glu Leu Lys Arg Lys Tyr Gly Ser Ala Ser Ala Val 
    530                 535                 540                 


Arg Arg Leu Pro Val Glu Glu Leu Arg Glu Leu Gly Phe Ser Asp Asp 
545                 550                 555                 560 


Glu Ile Ala Glu Ile Lys Gly Ile Pro Lys Lys Leu Arg Glu Ala Phe 
                565                 570                 575     


Asp Leu Glu Thr Ala Ala Glu Leu Tyr Glu Arg Tyr Gly Ser Leu Lys 
            580                 585                 590         


Glu Ile Gly Arg Arg Leu Ser Tyr Asp Asp Leu Leu Glu Leu Gly Ala 
        595                 600                 605             


Thr Pro Lys Ala Ala Ala Glu Ile Lys Gly Pro Glu Phe Lys Phe Leu 
    610                 615                 620                 


Leu Asn Ile Glu Gly Val Gly Pro Lys Leu Ala Glu Arg Ile Leu Glu 
625                 630                 635                 640 


Ala Val Asp Tyr Asp Leu Glu Arg Leu Ala Ser Leu Asn Pro Glu Glu 
                645                 650                 655     


Leu Ala Glu Lys Val Glu Gly Leu Gly Glu Glu Leu Ala Glu Arg Val 
            660                 665                 670         


Val Tyr Ala Ala Arg Glu Arg Val Glu Ser Arg Arg Lys Ser Gly Arg 
        675                 680                 685             


Gln Glu Arg Ser Glu Glu Glu Trp Lys Glu Trp Leu Glu Arg Lys Val 
    690                 695                 700                 


Gly Glu Gly Arg Ala Arg Arg Leu Ile Glu Tyr Phe Gly Ser Ala Gly 
705                 710                 715                 720 


Glu Val Gly Lys Leu Val Glu Asn Ala Glu Val Ser Lys Leu Leu Glu 
                725                 730                 735     


Val Pro Gly Ile Gly Asp Glu Ala Val Ala Arg Leu Val Pro Gly Tyr 
            740                 745                 750         


Lys Thr Leu Arg Asp Ala Gly Leu Thr Pro Ala Glu Ala Glu Arg Val 
        755                 760                 765             


Leu Lys Arg Tyr Gly Ser Val Ser Lys Val Gln Glu Gly Ala Thr Pro 
    770                 775                 780                 


Asp Glu Leu Arg Glu Leu Gly Leu Gly Asp Ala Lys Ile Ala Arg Ile 
785                 790                 795                 800 


Leu Gly Leu Arg Ser Leu Val Asn Lys Arg Leu Asp Val Asp Thr Ala 
                805                 810                 815     


Tyr Glu Leu Lys Arg Arg Tyr Gly Ser Val Ser Ala Val Arg Lys Ala 
            820                 825                 830         


Pro Val Lys Glu Leu Arg Glu Leu Gly Leu Ser Asp Arg Lys Ile Ala 
        835                 840                 845             


Arg Ile Lys Gly Ile Pro Glu Thr Met Leu Gln Val Arg Gly Met Ser 
    850                 855                 860                 


Val Glu Lys Ala Glu Arg Leu Leu Glu Arg Phe Asp Thr Trp Thr Lys 
865                 870                 875                 880 


Val Lys Glu Ala Pro Val Ser Glu Leu Val Arg Val Pro Gly Val Gly 
                885                 890                 895     


Leu Ser Leu Val Lys Glu Ile Lys Ala Gln Val Asp Pro Ala Trp Lys 
            900                 905                 910         


Ala Leu Leu Asp Val Lys Gly Val Ser Pro Glu Leu Ala Asp Arg Leu 
        915                 920                 925             


Val Glu Glu Leu Gly Ser Pro Tyr Arg Val Leu Thr Ala Lys Lys Ser 
    930                 935                 940                 


Asp Leu Met Arg Val Glu Arg Val Gly Pro Lys Leu Ala Glu Arg Ile 
945                 950                 955                 960 


Arg Ala Ala Gly Lys Arg Tyr Val Glu Glu Arg Arg Ser Arg Arg Glu 
                965                 970                 975     


Arg Ile Arg Arg Lys Leu Arg Gly 
            980                 


<210>  48
<211>  299
<212>  PRT
<213>  Methanopyrus kandleri

<400>  48

Ser Gly Arg Gln Glu Arg Ser Glu Glu Glu Trp Lys Glu Trp Leu Glu 
1               5                   10                  15      


Arg Lys Val Gly Glu Gly Arg Ala Arg Arg Leu Ile Glu Tyr Phe Gly 
            20                  25                  30          


Ser Ala Gly Glu Val Gly Lys Leu Val Glu Asn Ala Glu Val Ser Lys 
        35                  40                  45              


Leu Leu Glu Val Pro Gly Ile Gly Asp Glu Ala Val Ala Arg Leu Val 
    50                  55                  60                  


Pro Gly Tyr Lys Thr Leu Arg Asp Ala Gly Leu Thr Pro Ala Glu Ala 
65                  70                  75                  80  


Glu Arg Val Leu Lys Arg Tyr Gly Ser Val Ser Lys Val Gln Glu Gly 
                85                  90                  95      


Ala Thr Pro Asp Glu Leu Arg Glu Leu Gly Leu Gly Asp Ala Lys Ile 
            100                 105                 110         


Ala Arg Ile Leu Gly Leu Arg Ser Leu Val Asn Lys Arg Leu Asp Val 
        115                 120                 125             


Asp Thr Ala Tyr Glu Leu Lys Arg Arg Tyr Gly Ser Val Ser Ala Val 
    130                 135                 140                 


Arg Lys Ala Pro Val Lys Glu Leu Arg Glu Leu Gly Leu Ser Asp Arg 
145                 150                 155                 160 


Lys Ile Ala Arg Ile Lys Gly Ile Pro Glu Thr Met Leu Gln Val Arg 
                165                 170                 175     


Gly Met Ser Val Glu Lys Ala Glu Arg Leu Leu Glu Arg Phe Asp Thr 
            180                 185                 190         


Trp Thr Lys Val Lys Glu Ala Pro Val Ser Glu Leu Val Arg Val Pro 
        195                 200                 205             


Gly Val Gly Leu Ser Leu Val Lys Glu Ile Lys Ala Gln Val Asp Pro 
    210                 215                 220                 


Ala Trp Lys Ala Leu Leu Asp Val Lys Gly Val Ser Pro Glu Leu Ala 
225                 230                 235                 240 


Asp Arg Leu Val Glu Glu Leu Gly Ser Pro Tyr Arg Val Leu Thr Ala 
                245                 250                 255     


Lys Lys Ser Asp Leu Met Arg Val Glu Arg Val Gly Pro Lys Leu Ala 
            260                 265                 270         


Glu Arg Ile Arg Ala Ala Gly Lys Arg Tyr Val Glu Glu Arg Arg Ser 
        275                 280                 285             


Arg Arg Glu Arg Ile Arg Arg Lys Leu Arg Gly 
    290                 295                 


<210>  49
<211>  853
<212>  PRT
<213>  Escherichia coli

<400>  49

Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln 
1               5                   10                  15      


Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg 
            20                  25                  30          


Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser 
        35                  40                  45              


Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu 
    50                  55                  60                  


Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu 
65                  70                  75                  80  


Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile 
                85                  90                  95      


Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg 
            100                 105                 110         


Ile Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu Arg 
        115                 120                 125             


Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly 
    130                 135                 140                 


Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu Pro 
145                 150                 155                 160 


Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala 
                165                 170                 175     


Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly 
            180                 185                 190         


Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr 
        195                 200                 205             


Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly 
    210                 215                 220                 


Phe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys Leu 
225                 230                 235                 240 


Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg 
                245                 250                 255     


Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala 
            260                 265                 270         


Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu 
        275                 280                 285             


Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser 
    290                 295                 300                 


Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val 
305                 310                 315                 320 


Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala 
                325                 330                 335     


Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu 
            340                 345                 350         


Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met 
        355                 360                 365             


Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr 
    370                 375                 380                 


Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe 
385                 390                 395                 400 


Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro 
                405                 410                 415     


Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu 
            420                 425                 430         


Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu 
        435                 440                 445             


Arg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys 
    450                 455                 460                 


Val Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly 
465                 470                 475                 480 


Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys 
                485                 490                 495     


Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys 
            500                 505                 510         


Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr 
        515                 520                 525             


Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln 
    530                 535                 540                 


Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu 
545                 550                 555                 560 


Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro 
                565                 570                 575     


Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu 
            580                 585                 590         


Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg 
        595                 600                 605             


Met Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met 
    610                 615                 620                 


Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val 
625                 630                 635                 640 


Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg 
                645                 650                 655     


Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val 
            660                 665                 670         


Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser 
        675                 680                 685             


Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly 
    690                 695                 700                 


Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys 
705                 710                 715                 720 


Ala Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro 
                725                 730                 735     


Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His 
            740                 745                 750         


Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser 
        755                 760                 765             


Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu 
    770                 775                 780                 


Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser 
785                 790                 795                 800 


Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu 
                805                 810                 815     


Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu 
            820                 825                 830         


Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg 
        835                 840                 845             


Leu Lys Ser Leu Val 
    850             


<210>  50
<211>  64
<212>  PRT
<213>  Sufolobus solfataricus

<400>  50

Met Ala Thr Val Lys Phe Lys Tyr Lys Gly Glu Glu Lys Glu Val Asp 
1               5                   10                  15      


Ile Ser Lys Ile Lys Lys Val Trp Arg Val Gly Lys Met Ile Ser Phe 
            20                  25                  30          


Thr Tyr Asp Glu Gly Gly Gly Lys Thr Gly Arg Gly Ala Val Ser Glu 
        35                  40                  45              


Lys Asp Ala Pro Lys Glu Leu Leu Gln Met Leu Glu Lys Gln Lys Lys 
    50                  55                  60                  


<210>  51
<211>  99
<212>  PRT
<213>  Sufolobus solfataricus P2

<400>  51

Glu Lys Met Ser Ser Gly Thr Pro Thr Pro Ser Asn Val Val Leu Ile 
1               5                   10                  15      


Gly Lys Lys Pro Val Met Asn Tyr Val Leu Ala Ala Leu Thr Leu Leu 
            20                  25                  30          


Asn Gln Gly Val Ser Glu Ile Val Ile Lys Ala Arg Gly Arg Ala Ile 
        35                  40                  45              


Ser Lys Ala Val Asp Thr Val Glu Ile Val Arg Asn Arg Phe Leu Pro 
    50                  55                  60                  


Asp Lys Ile Glu Ile Lys Glu Ile Arg Val Gly Ser Gln Val Val Thr 
65                  70                  75                  80  


Ser Gln Asp Gly Arg Gln Ser Arg Val Ser Thr Ile Glu Ile Ala Ile 
                85                  90                  95      


Arg Lys Lys 
            


<210>  52
<211>  88
<212>  PRT
<213>  Sufolobus solfataricus P2

<400>  52

Thr Glu Lys Leu Asn Glu Ile Val Val Arg Lys Thr Lys Asn Val Glu 
1               5                   10                  15      


Asp His Val Leu Asp Val Ile Val Leu Phe Asn Gln Gly Ile Asp Glu 
            20                  25                  30          


Val Ile Leu Lys Gly Thr Gly Arg Glu Ile Ser Lys Ala Val Asp Val 
        35                  40                  45              


Tyr Asn Ser Leu Lys Asp Arg Leu Gly Asp Gly Val Gln Leu Val Asn 
    50                  55                  60                  


Val Gln Thr Gly Ser Glu Val Arg Asp Arg Arg Arg Ile Ser Tyr Ile 
65                  70                  75                  80  


Leu Leu Arg Leu Lys Arg Val Tyr 
                85              


<210>  53
<211>  107
<212>  PRT
<213>  Escherichia coli

<400>  53

Ala Gln Gln Ser Pro Tyr Ser Ala Ala Met Ala Glu Gln Arg His Gln 
1               5                   10                  15      


Glu Trp Leu Arg Phe Val Asp Leu Leu Lys Asn Ala Tyr Gln Asn Asp 
            20                  25                  30          


Leu His Leu Pro Leu Leu Asn Leu Met Leu Thr Pro Asp Glu Arg Glu 
        35                  40                  45              


Ala Leu Gly Thr Arg Val Arg Ile Val Glu Glu Leu Leu Arg Gly Glu 
    50                  55                  60                  


Met Ser Gln Arg Glu Leu Lys Asn Glu Leu Gly Ala Gly Ile Ala Thr 
65                  70                  75                  80  


Ile Thr Arg Gly Ser Asn Ser Leu Lys Ala Ala Pro Val Glu Leu Arg 
                85                  90                  95      


Gln Trp Leu Glu Glu Val Leu Leu Lys Ser Asp 
            100                 105         


<210>  54
<211>  237
<212>  PRT
<213>  Enterobacteria phage lambda

<400>  54

Met Ser Thr Lys Lys Lys Pro Leu Thr Gln Glu Gln Leu Glu Asp Ala 
1               5                   10                  15      


Arg Arg Leu Lys Ala Ile Tyr Glu Lys Lys Lys Asn Glu Leu Gly Leu 
            20                  25                  30          


Ser Gln Glu Ser Val Ala Asp Lys Met Gly Met Gly Gln Ser Gly Val 
        35                  40                  45              


Gly Ala Leu Phe Asn Gly Ile Asn Ala Leu Asn Ala Tyr Asn Ala Ala 
    50                  55                  60                  


Leu Leu Ala Lys Ile Leu Lys Val Ser Val Glu Glu Phe Ser Pro Ser 
65                  70                  75                  80  


Ile Ala Arg Glu Ile Tyr Glu Met Tyr Glu Ala Val Ser Met Gln Pro 
                85                  90                  95      


Ser Leu Arg Ser Glu Tyr Glu Tyr Pro Val Phe Ser His Val Gln Ala 
            100                 105                 110         


Gly Met Phe Ser Pro Glu Leu Arg Thr Phe Thr Lys Gly Asp Ala Glu 
        115                 120                 125             


Arg Trp Val Ser Thr Thr Lys Lys Ala Ser Asp Ser Ala Phe Trp Leu 
    130                 135                 140                 


Glu Val Glu Gly Asn Ser Met Thr Ala Pro Thr Gly Ser Lys Pro Ser 
145                 150                 155                 160 


Phe Pro Asp Gly Met Leu Ile Leu Val Asp Pro Glu Gln Ala Val Glu 
                165                 170                 175     


Pro Gly Asp Phe Cys Ile Ala Arg Leu Gly Gly Asp Glu Phe Thr Phe 
            180                 185                 190         


Lys Lys Leu Ile Arg Asp Ser Gly Gln Val Phe Leu Gln Pro Leu Asn 
        195                 200                 205             


Pro Gln Tyr Pro Met Ile Pro Cys Asn Glu Ser Cys Ser Val Val Gly 
    210                 215                 220                 


Lys Val Ile Ala Ser Gln Trp Pro Glu Glu Thr Phe Gly 
225                 230                 235         


<210>  55
<211>  60
<212>  PRT
<213>  Crenarchaea

<400>  55

Met Ser Ser Gly Lys Lys Pro Val Lys Val Lys Thr Pro Ala Gly Lys 
1               5                   10                  15      


Glu Ala Glu Leu Val Pro Glu Lys Val Trp Ala Leu Ala Pro Lys Gly 
            20                  25                  30          


Arg Lys Gly Val Lys Ile Gly Leu Phe Lys Asp Pro Glu Thr Gly Lys 
        35                  40                  45              


Tyr Phe Arg His Lys Leu Pro Asp Asp Tyr Pro Ile 
    50                  55                  60  


<210>  56
<211>  136
<212>  PRT
<213>  Homo sapiens

<400>  56

Met Ala Arg Thr Lys Gln Thr Ala Arg Lys Ser Thr Gly Gly Lys Ala 
1               5                   10                  15      


Pro Arg Lys Gln Leu Ala Thr Lys Ala Ala Arg Lys Ser Ala Pro Ala 
            20                  25                  30          


Thr Gly Gly Val Lys Lys Pro His Arg Tyr Arg Pro Gly Thr Val Ala 
        35                  40                  45              


Leu Arg Glu Ile Arg Arg Tyr Gln Lys Ser Thr Glu Leu Leu Ile Arg 
    50                  55                  60                  


Lys Leu Pro Phe Gln Arg Leu Val Arg Glu Ile Ala Gln Asp Phe Lys 
65                  70                  75                  80  


Thr Asp Leu Arg Phe Gln Ser Ser Ala Val Met Ala Leu Gln Glu Ala 
                85                  90                  95      


Ser Glu Ala Tyr Leu Val Gly Leu Phe Glu Asp Thr Asn Leu Cys Ala 
            100                 105                 110         


Ile His Ala Lys Arg Val Thr Ile Met Pro Lys Asp Ile Gln Leu Ala 
        115                 120                 125             


Arg Arg Ile Arg Gly Glu Arg Ala 
    130                 135     


<210>  57
<211>  89
<212>  PRT
<213>  Enterobacteria phage T4

<400>  57

Met Ala Lys Lys Glu Met Val Glu Phe Asp Glu Ala Ile His Gly Glu 
1               5                   10                  15      


Asp Leu Ala Lys Phe Ile Lys Glu Ala Ser Asp His Lys Leu Lys Ile 
            20                  25                  30          


Ser Gly Tyr Asn Glu Leu Ile Lys Asp Ile Arg Ile Arg Ala Lys Asp 
        35                  40                  45              


Glu Leu Gly Val Asp Gly Lys Met Phe Asn Arg Leu Leu Ala Leu Tyr 
    50                  55                  60                  


His Lys Asp Asn Arg Asp Val Phe Glu Ala Glu Thr Glu Glu Val Val 
65                  70                  75                  80  


Glu Leu Tyr Asp Thr Val Phe Ser Lys 
                85                  


<210>  58
<211>  339
<212>  PRT
<213>  Homo sapiens

<400>  58

Met Ala Met Gln Met Gln Leu Glu Ala Asn Ala Asp Thr Ser Val Glu 
1               5                   10                  15      


Glu Glu Ser Phe Gly Pro Gln Pro Ile Ser Arg Leu Glu Gln Cys Gly 
            20                  25                  30          


Ile Asn Ala Asn Asp Val Lys Lys Leu Glu Glu Ala Gly Phe His Thr 
        35                  40                  45              


Val Glu Ala Val Ala Tyr Ala Pro Lys Lys Glu Leu Ile Asn Ile Lys 
    50                  55                  60                  


Gly Ile Ser Glu Ala Lys Ala Asp Lys Ile Leu Ala Glu Ala Ala Lys 
65                  70                  75                  80  


Leu Val Pro Met Gly Phe Thr Thr Ala Thr Glu Phe His Gln Arg Arg 
                85                  90                  95      


Ser Glu Ile Ile Gln Ile Thr Thr Gly Ser Lys Glu Leu Asp Lys Leu 
            100                 105                 110         


Leu Gln Gly Gly Ile Glu Thr Gly Ser Ile Thr Glu Met Phe Gly Glu 
        115                 120                 125             


Phe Arg Thr Gly Lys Thr Gln Ile Cys His Thr Leu Ala Val Thr Cys 
    130                 135                 140                 


Gln Leu Pro Ile Asp Arg Gly Gly Gly Glu Gly Lys Ala Met Tyr Ile 
145                 150                 155                 160 


Asp Thr Glu Gly Thr Phe Arg Pro Glu Arg Leu Leu Ala Val Ala Glu 
                165                 170                 175     


Arg Tyr Gly Leu Ser Gly Ser Asp Val Leu Asp Asn Val Ala Tyr Ala 
            180                 185                 190         


Arg Ala Phe Asn Thr Asp His Gln Thr Gln Leu Leu Tyr Gln Ala Ser 
        195                 200                 205             


Ala Met Met Val Glu Ser Arg Tyr Ala Leu Leu Ile Val Asp Ser Ala 
    210                 215                 220                 


Thr Ala Leu Tyr Arg Thr Asp Tyr Ser Gly Arg Gly Glu Leu Ser Ala 
225                 230                 235                 240 


Arg Gln Met His Leu Ala Arg Phe Leu Arg Met Leu Leu Arg Leu Ala 
                245                 250                 255     


Asp Glu Phe Gly Val Ala Val Val Ile Thr Asn Gln Val Val Ala Gln 
            260                 265                 270         


Val Asp Gly Ala Ala Met Phe Ala Ala Asp Pro Lys Lys Pro Ile Gly 
        275                 280                 285             


Gly Asn Ile Ile Ala His Ala Ser Thr Thr Arg Leu Tyr Leu Arg Lys 
    290                 295                 300                 


Gly Arg Gly Glu Thr Arg Ile Cys Lys Ile Tyr Asp Ser Pro Cys Leu 
305                 310                 315                 320 


Pro Glu Ala Glu Ala Met Phe Ala Ile Asn Ala Asp Gly Val Gly Asp 
                325                 330                 335     


Ala Lys Asp 
            


<210>  59
<211>  375
<212>  PRT
<213>  Citromicrobium bathyomarinum JL354

<400>  59

Met Lys Ala Thr Ile Glu Arg Ala Thr Leu Leu Arg Cys Leu Ser His 
1               5                   10                  15      


Val Gln Ser Val Val Glu Arg Arg Asn Thr Ile Pro Ile Leu Ser Asn 
            20                  25                  30          


Val Leu Ile Asp Ala Asp Ala Gly Gly Gly Val Lys Val Met Ala Thr 
        35                  40                  45              


Asp Leu Asp Leu Gln Val Val Glu Thr Met Thr Ala Ala Ser Val Glu 
    50                  55                  60                  


Ser Ala Gly Ala Ile Thr Val Ser Ala His Leu Leu Phe Asp Ile Ala 
65                  70                  75                  80  


Arg Lys Leu Pro Asp Gly Ser Gln Val Ser Leu Glu Thr Ala Asp Asn 
                85                  90                  95      


Arg Met Val Val Lys Ala Gly Arg Ser Arg Phe Gln Leu Pro Thr Leu 
            100                 105                 110         


Pro Arg Asp Asp Phe Pro Val Ile Val Glu Gly Glu Leu Pro Thr Ser 
        115                 120                 125             


Phe Glu Leu Pro Ala Arg Glu Leu Ala Glu Met Ile Asp Arg Thr Arg 
    130                 135                 140                 


Phe Ala Ile Ser Thr Glu Glu Thr Arg Tyr Tyr Leu Asn Gly Ile Phe 
145                 150                 155                 160 


Leu His Val Ser Asp Glu Ala Arg Pro Val Leu Lys Ala Ala Ala Thr 
                165                 170                 175     


Asp Gly His Arg Leu Ala Arg Tyr Thr Leu Asp Arg Pro Glu Gly Ala 
            180                 185                 190         


Glu Gly Met Pro Asp Val Ile Val Pro Arg Lys Ala Val Gly Glu Leu 
        195                 200                 205             


Arg Lys Leu Leu Glu Glu Ala Leu Asp Ser Asn Val Gln Ile Asp Leu 
    210                 215                 220                 


Ser Ala Ser Lys Ile Arg Phe Ala Leu Gly Gly Glu Gly Gly Val Val 
225                 230                 235                 240 


Leu Thr Ser Lys Leu Ile Asp Gly Thr Phe Pro Asp Tyr Ser Arg Val 
                245                 250                 255     


Ile Pro Thr Gly Asn Asp Lys Leu Leu Arg Leu Asp Pro Lys Ala Phe 
            260                 265                 270         


Phe Gln Gly Val Asp Arg Val Ala Thr Ile Ala Thr Glu Lys Thr Arg 
        275                 280                 285             


Ala Val Lys Met Gly Leu Asp Glu Asp Lys Val Thr Leu Ser Val Thr 
    290                 295                 300                 


Ser Pro Asp Asn Gly Thr Ala Ala Glu Glu Ile Ala Ala Glu Tyr Lys 
305                 310                 315                 320 


Ala Glu Gly Phe Glu Ile Gly Phe Asn Ala Asn Tyr Leu Lys Asp Ile 
                325                 330                 335     


Leu Gly Gln Ile Asp Ser Asp Thr Val Glu Leu His Leu Ala Asp Ala 
            340                 345                 350         


Gly Ala Pro Thr Leu Ile Arg Arg Asp Glu Asn Ser Pro Ala Leu Tyr 
        355                 360                 365             


Val Leu Met Pro Met Arg Val 
    370                 375 


<210>  60
<211>  50
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1

<400>  60

Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
1               5                   10                  15      


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
            20                  25                  30          


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
        35                  40                  45              


Thr Thr 
    50  


<210>  61
<211>  27
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Examples 1, 3, 4 and 6

<400>  61
ggttgtttct gttggtgctg atattgc                                           27


<210>  62
<211>  97138
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1

<400>  62
gctccactaa agggccgatt gacgggcggc gacctcgcgg gttttcgcta tttatgaaaa       60

ttttccggtt taaggcgttt ccgttcttct tcgtcataac ttaatgtttt tatttaaaat      120

accctctgaa aagaaaggaa acgacaggtg ctgaaagcga ggctttttgg cctctgtcgt      180

ttcctttctc tgtttttgtc cgtggaatga acaatggaag tcaacaaaaa gcagctggct      240

gacattttcg gtgcgagtat ccgtaccatt cagaactggc aggaacaggg aatgcccgtt      300

ctgcgaggcg gtggcaaggg taatgaggtg ctttatgact ctgccgccgt cataaaatgg      360

tatgccgaaa gggatgctga aattgagaac gaaaagctgc gccgggaggt tgaagaactg      420

cggcaggcca gcgaggcaga tctccagcca ggaactattg agtacgaacg ccatcgactt      480

acgcgtgcgc aggccgacgc acaggaactg aagaatgcca gagactccgc tgaagtggtg      540

gaaaccgcat tctgtacttt cgtgctgtcg cggatcgcag gtgaaattgc cagtattctc      600

gacgggctcc ccctgtcggt gcagcggcgt tttccggaac tggaaaaccg acatgttgat      660

ttcctgaaac gggatatcat caaagccatg aacaaagcag ccgcgctgga tgaactgata      720

ccggggttgc tgagtgaata tatcgaacag tcaggttaac aggctgcggc attttgtccg      780

cgccgggctt cgctcactgt tcaggccgga gccacagacc gccgttgaat gggcggatgc      840

taattactat ctcccgaaag aatccgcata ccaggaaggg cgctgggaaa cactgccctt      900

tcagcgggcc atcatgaatg cgatgggcag cgactacatc cgtgaggtga atgtggtgaa      960

gtctgcccgt gtcggttatt ccaaaatgct gctgggtgtt tatgcctact ttatagagca     1020

taagcagcgc aacaccctta tctggttgcc gacggatggt gatgccgaga actttatgaa     1080

aacccacgtt gagccgacta ttcgtgatat tccgtcgctg ctggcgctgg ccccgtggta     1140

tggcaaaaag caccgggata acacgctcac catgaagcgt ttcactaatg ggcgtggctt     1200

ctggtgcctg ggcggtaaag cggcaaaaaa ctaccgtgaa aagtcggtgg atgtggcggg     1260

ttatgatgaa cttgctgctt ttgatgatga tattgaacag gaaggctctc cgacgttcct     1320

gggtgacaag cgtattgaag gctcggtctg gccaaagtcc atccgtggct ccacgccaaa     1380

agtgagaggc acctgtcaga ttgagcgtgc agccagtgaa tccccgcatt ttatgcgttt     1440

tcatgttgcc tgcccgcatt gcggggagga gcagtatctt aaatttggcg acaaagagac     1500

gccgtttggc ctcaaatgga cgccggatga cccctccagc gtgttttatc tctgcgagca     1560

taatgcctgc gtcatccgcc agcaggagct ggactttact gatgcccgtt atatctgcga     1620

aaagaccggg atctggaccc gtgatggcat tctctggttt tcgtcatccg gtgaagagat     1680

tgagccacct gacagtgtga cctttcacat ctggacagcg tacagcccgt tcaccacctg     1740

ggtgcagatt gtcaaagact ggatgaaaac gaaaggggat acgggaaaac gtaaaacctt     1800

cgtaaacacc acgctcggtg agacgtggga ggcgaaaatt ggcgaacgtc cggatgctga     1860

agtgatggca gagcggaaag agcattattc agcgcccgtt cctgaccgtg tggcttacct     1920

gaccgccggt atcgactccc agctggaccg ctacgaaatg cgcgtatggg gatgggggcc     1980

gggtgaggaa agctggctga ttgaccggca gattattatg ggccgccacg acgatgaaca     2040

gacgctgctg cgtgtggatg aggccatcaa taaaacctat acccgccgga atggtgcaga     2100

aatgtcgata tcccgtatct gctgggatac tggcgggatt gacccgacca ttgtgtatga     2160

acgctcgaaa aaacatgggc tgttccgggt gatccccatt aaaggggcat ccgtctacgg     2220

aaagccggtg gccagcatgc cacgtaagcg aaacaaaaac ggggtttacc ttaccgaaat     2280

cggtacggat accgcgaaag agcagattta taaccgcttc acactgacgc cggaagggga     2340

tgaaccgctt cccggtgccg ttcacttccc gaataacccg gatatttttg atctgaccga     2400

agcgcagcag ctgactgctg aagagcaggt cgaaaaatgg gtggatggca ggaaaaaaat     2460

actgtgggac agcaaaaagc gacgcaatga ggcactcgac tgcttcgttt atgcgctggc     2520

ggcgctgcgc atcagtattt cccgctggca gctggatctc agtgcgctgc tggcgagcct     2580

gcaggaagag gatggtgcag caaccaacaa gaaaacactg gcagattacg cccgtgcctt     2640

atccggagag gatgaatgac gcgacaggaa gaacttgccg ctgcccgtgc ggcactgcat     2700

gacctgatga caggtaaacg ggtggcaaca gtacagaaag acggacgaag ggtggagttt     2760

acggccactt ccgtgtctga cctgaaaaaa tatattgcag agctggaagt gcagaccggc     2820

atgacacagc gacgcagggg acctgcagga ttttatgtat gaaaacgccc accattccca     2880

cccttctggg gccggacggc atgacatcgc tgcgcgaata tgccggttat cacggcggtg     2940

gcagcggatt tggagggcag ttgcggtcgt ggaacccacc gagtgaaagt gtggatgcag     3000

ccctgttgcc caactttacc cgtggcaatg cccgcgcaga cgatctggta cgcaataacg     3060

gctatgccgc caacgccatc cagctgcatc aggatcatat cgtcgggtct tttttccggc     3120

tcagtcatcg cccaagctgg cgctatctgg gcatcgggga ggaagaagcc cgtgcctttt     3180

cccgcgaggt tgaagcggca tggaaagagt ttgccgagga tgactgctgc tgcattgacg     3240

ttgagcgaaa acgcacgttt accatgatga ttcgggaagg tgtggccatg cacgccttta     3300

acggtgaact gttcgttcag gccacctggg ataccagttc gtcgcggctt ttccggacac     3360

agttccggat ggtcagcccg aagcgcatca gcaacccgaa caataccggc gacagccgga     3420

actgccgtgc cggtgtgcag attaatgaca gcggtgcggc gctgggatat tacgtcagcg     3480

aggacgggta tcctggctgg atgccgcaga aatggacatg gataccccgt gagttacccg     3540

gcgggcgcgc ctcgttcatt cacgtttttg aacccgtgga ggacgggcag actcgcggtg     3600

caaatgtgtt ttacagcgtg atggagcaga tgaagatgct cgacacgctg cagaacacgc     3660

agctgcagag cgccattgtg aaggcgatgt atgccgccac cattgagagt gagctggata     3720

cgcagtcagc gatggatttt attctgggcg cgaacagtca ggagcagcgg gaaaggctga     3780

ccggctggat tggtgaaatt gccgcgtatt acgccgcagc gccggtccgg ctgggaggcg     3840

caaaagtacc gcacctgatg ccgggtgact cactgaacct gcagacggct caggatacgg     3900

ataacggcta ctccgtgttt gagcagtcac tgctgcggta tatcgctgcc gggctgggtg     3960

tctcgtatga gcagctttcc cggaattacg cccagatgag ctactccacg gcacgggcca     4020

gtgcgaacga gtcgtgggcg tactttatgg ggcggcgaaa attcgtcgca tcccgtcagg     4080

cgagccagat gtttctgtgc tggctggaag aggccatcgt tcgccgcgtg gtgacgttac     4140

cttcaaaagc gcgcttcagt tttcaggaag cccgcagtgc ctgggggaac tgcgactgga     4200

taggctccgg tcgtatggcc atcgatggtc tgaaagaagt tcaggaagcg gtgatgctga     4260

tagaagccgg actgagtacc tacgagaaag agtgcgcaaa acgcggtgac gactatcagg     4320

aaatttttgc ccagcaggtc cgtgaaacga tggagcgccg tgcagccggt cttaaaccgc     4380

ccgcctgggc ggctgcagca tttgaatccg ggctgcgaca atcaacagag gaggagaaga     4440

gtgacagcag agctgcgtaa tctcccgcat attgccagca tggcctttaa tgagccgctg     4500

atgcttgaac ccgcctatgc gcgggttttc ttttgtgcgc ttgcaggcca gcttgggatc     4560

agcagcctga cggatgcggt gtccggcgac agcctgactg cccaggaggc actcgcgacg     4620

ctggcattat ccggtgatga tgacggacca cgacaggccc gcagttatca ggtcatgaac     4680

ggcatcgccg tgctgccggt gtccggcacg ctggtcagcc ggacgcgggc gctgcagccg     4740

tactcgggga tgaccggtta caacggcatt atcgcccgtc tgcaacaggc tgccagcgat     4800

ccgatggtgg acggcattct gctcgatatg gacacgcccg gcgggatggt ggcgggggca     4860

tttgactgcg ctgacatcat cgcccgtgtg cgtgacataa aaccggtatg ggcgcttgcc     4920

aacgacatga actgcagtgc aggtcagttg cttgccagtg ccgcctcccg gcgtctggtc     4980

acgcagaccg cccggacagg ctccatcggc gtcatgatgg ctcacagtaa ttacggtgct     5040

gcgctggaga aacagggtgt ggaaatcacg ctgatttaca gcggcagcca taaggtggat     5100

ggcaacccct acagccatct tccggatgac gtccgggaga cactgcagtc ccggatggac     5160

gcaacccgcc agatgtttgc gcagaaggtg tcggcatata ccggcctgtc cgtgcaggtt     5220

gtgctggata ccgaggctgc agtgtacagc ggtcaggagg ccattgatgc cggactggct     5280

gatgaacttg ttaacagcac cgatgcgatc accgtcatgc gtgatgcact ggatgcacgt     5340

aaatcccgtc tctcaggagg gcgaatgacc aaagagactc aatcaacaac tgtttcagcc     5400

actgcttcgc aggctgacgt tactgacgtg gtgccagcga cggagggcga gaacgccagc     5460

gcggcgcagc cggacgtgaa cgcgcagatc accgcagcgg ttgcggcaga aaacagccgc     5520

attatgggga tcctcaactg tgaggaggct cacggacgcg aagaacaggc acgcgtgctg     5580

gcagaaaccc ccggtatgac cgtgaaaacg gcccgccgca ttctggccgc agcaccacag     5640

agtgcacagg cgcgcagtga cactgcgctg gatcgtctga tgcagggggc accggcaccg     5700

ctggctgcag gtaacccggc atctgatgcc gttaacgatt tgctgaacac accagtgtaa     5760

gggatgttta tgacgagcaa agaaaccttt acccattacc agccgcaggg caacagtgac     5820

ccggctcata ccgcaaccgc gcccggcgga ttgagtgcga aagcgcctgc aatgaccccg     5880

ctgatgctgg acacctccag ccgtaagctg gttgcgtggg atggcaccac cgacggtgct     5940

gccgttggca ttcttgcggt tgctgctgac cagaccagca ccacgctgac gttctacaag     6000

tccggcacgt tccgttatga ggatgtgctc tggccggagg ctgccagcga cgagacgaaa     6060

aaacggaccg cgtttgccgg aacggcaatc agcatcgttt aactttaccc ttcatcacta     6120

aaggccgcct gtgcggcttt ttttacggga tttttttatg tcgatgtaca caaccgccca     6180

actgctggcg gcaaatgagc agaaatttaa gtttgatccg ctgtttctgc gtctcttttt     6240

ccgtgagagc tatcccttca ccacggagaa agtctatctc tcacaaattc cgggactggt     6300

aaacatggcg ctgtacgttt cgccgattgt ttccggtgag gttatccgtt cccgtggcgg     6360

ctccacctct gaatttacgc cgggatatgt caagccgaag catgaagtga atccgcagat     6420

gaccctgcgt cgcctgccgg atgaagatcc gcagaatctg gcggacccgg cttaccgccg     6480

ccgtcgcatc atcatgcaga acatgcgtga cgaagagctg gccattgctc aggtcgaaga     6540

gatgcaggca gtttctgccg tgcttaaggg caaatacacc atgaccggtg aagccttcga     6600

tccggttgag gtggatatgg gccgcagtga ggagaataac atcacgcagt ccggcggcac     6660

ggagtggagc aagcgtgaca agtccacgta tgacccgacc gacgatatcg aagcctacgc     6720

gctgaacgcc agcggtgtgg tgaatatcat cgtgttcgat ccgaaaggct gggcgctgtt     6780

ccgttccttc aaagccgtca aggagaagct ggatacccgt cgtggctcta attccgagct     6840

ggagacagcg gtgaaagacc tgggcaaagc ggtgtcctat aaggggatgt atggcgatgt     6900

ggccatcgtc gtgtattccg gacagtacgt ggaaaacggc gtcaaaaaga acttcctgcc     6960

ggacaacacg atggtgctgg ggaacactca ggcacgcggt ctgcgcacct atggctgcat     7020

tcaggatgcg gacgcacagc gcgaaggcat taacgcctct gcccgttacc cgaaaaactg     7080

ggtgaccacc ggcgatccgg cgcgtgagtt caccatgatt cagtcagcac cgctgatgct     7140

gctggctgac cctgatgagt tcgtgtccgt acaactggcg taatcatggc ccttcggggc     7200

cattgtttct ctgtggagga gtccatgacg aaagatgaac tgattgcccg tctccgctcg     7260

ctgggtgaac aactgaaccg tgatgtcagc ctgacgggga cgaaagaaga actggcgctc     7320

cgtgtggcag agctgaaaga ggagcttgat gacacggatg aaactgccgg tcaggacacc     7380

cctctcagcc gggaaaatgt gctgaccgga catgaaaatg aggtgggatc agcgcagccg     7440

gataccgtga ttctggatac gtctgaactg gtcacggtcg tggcactggt gaagctgcat     7500

actgatgcac ttcacgccac gcgggatgaa cctgtggcat ttgtgctgcc gggaacggcg     7560

tttcgtgtct ctgccggtgt ggcagccgaa atgacagagc gcggcctggc cagaatgcaa     7620

taacgggagg cgctgtggct gatttcgata acctgttcga tgctgccatt gcccgcgccg     7680

atgaaacgat acgcgggtac atgggaacgt cagccaccat tacatccggt gagcagtcag     7740

gtgcggtgat acgtggtgtt tttgatgacc ctgaaaatat cagctatgcc ggacagggcg     7800

tgcgcgttga aggctccagc ccgtccctgt ttgtccggac tgatgaggtg cggcagctgc     7860

ggcgtggaga cacgctgacc atcggtgagg aaaatttctg ggtagatcgg gtttcgccgg     7920

atgatggcgg aagttgtcat ctctggcttg gacggggcgt accgcctgcc gttaaccgtc     7980

gccgctgaaa gggggatgta tggccataaa aggtcttgag caggccgttg aaaacctcag     8040

ccgtatcagc aaaacggcgg tgcctggtgc cgccgcaatg gccattaacc gcgttgcttc     8100

atccgcgata tcgcagtcgg cgtcacaggt tgcccgtgag acaaaggtac gccggaaact     8160

ggtaaaggaa agggccaggc tgaaaagggc cacggtcaaa aatccgcagg ccagaatcaa     8220

agttaaccgg ggggatttgc ccgtaatcaa gctgggtaat gcgcgggttg tcctttcgcg     8280

ccgcaggcgt cgtaaaaagg ggcagcgttc atccctgaaa ggtggcggca gcgtgcttgt     8340

ggtgggtaac cgtcgtattc ccggcgcgtt tattcagcaa ctgaaaaatg gccggtggca     8400

tgtcatgcag cgtgtggctg ggaaaaaccg ttaccccatt gatgtggtga aaatcccgat     8460

ggcggtgccg ctgaccacgg cgtttaaaca aaatattgag cggatacggc gtgaacgtct     8520

tccgaaagag ctgggctatg cgctgcagca tcaactgagg atggtaataa agcgatgaaa     8580

catactgaac tccgtgcagc cgtactggat gcactggaga agcatgacac cggggcgacg     8640

ttttttgatg gtcgccccgc tgtttttgat gaggcggatt ttccggcagt tgccgtttat     8700

ctcaccggcg ctgaatacac gggcgaagag ctggacagcg atacctggca ggcggagctg     8760

catatcgaag ttttcctgcc tgctcaggtg ccggattcag agctggatgc gtggatggag     8820

tcccggattt atccggtgat gagcgatatc ccggcactgt cagatttgat caccagtatg     8880

gtggccagcg gctatgacta ccggcgcgac gatgatgcgg gcttgtggag ttcagccgat     8940

ctgacttatg tcattaccta tgaaatgtga ggacgctatg cctgtaccaa atcctacaat     9000

gccggtgaaa ggtgccggga ccaccctgtg ggtttataag gggagcggtg acccttacgc     9060

gaatccgctt tcagacgttg actggtcgcg tctggcaaaa gttaaagacc tgacgcccgg     9120

cgaactgacc gctgagtcct atgacgacag ctatctcgat gatgaagatg cagactggac     9180

tgcgaccggg caggggcaga aatctgccgg agataccagc ttcacgctgg cgtggatgcc     9240

cggagagcag gggcagcagg cgctgctggc gtggtttaat gaaggcgata cccgtgccta     9300

taaaatccgc ttcccgaacg gcacggtcga tgtgttccgt ggctgggtca gcagtatcgg     9360

taaggcggtg acggcgaagg aagtgatcac ccgcacggtg aaagtcacca atgtgggacg     9420

tccgtcgatg gcagaagatc gcagcacggt aacagcggca accggcatga ccgtgacgcc     9480

tgccagcacc tcggtggtga aagggcagag caccacgctg accgtggcct tccagccgga     9540

gggcgtaacc gacaagagct ttcgtgcggt gtctgcggat aaaacaaaag ccaccgtgtc     9600

ggtcagtggt atgaccatca ccgtgaacgg cgttgctgca ggcaaggtca acattccggt     9660

tgtatccggt aatggtgagt ttgctgcggt tgcagaaatt accgtcaccg ccagttaatc     9720

cggagagtca gcgatgttcc tgaaaaccga atcatttgaa cataacggtg tgaccgtcac     9780

gctttctgaa ctgtcagccc tgcagcgcat tgagcatctc gccctgatga aacggcaggc     9840

agaacaggcg gagtcagaca gcaaccggaa gtttactgtg gaagacgcca tcagaaccgg     9900

cgcgtttctg gtggcgatgt ccctgtggca taaccatccg cagaagacgc agatgccgtc     9960

catgaatgaa gccgttaaac agattgagca ggaagtgctt accacctggc ccacggaggc    10020

aatttctcat gctgaaaacg tggtgtaccg gctgtctggt atgtatgagt ttgtggtgaa    10080

taatgcccct gaacagacag aggacgccgg gcccgcagag cctgtttctg cgggaaagtg    10140

ttcgacggtg agctgagttt tgccctgaaa ctggcgcgtg agatggggcg acccgactgg    10200

cgtgccatgc ttgccgggat gtcatccacg gagtatgccg actggcaccg cttttacagt    10260

acccattatt ttcatgatgt tctgctggat atgcactttt ccgggctgac gtacaccgtg    10320

ctcagcctgt ttttcagcga tccggatatg catccgctgg atttcagtct gctgaaccgg    10380

cgcgaggctg acgaagagcc tgaagatgat gtgctgatgc agaaagcggc agggcttgcc    10440

ggaggtgtcc gctttggccc ggacgggaat gaagttatcc ccgcttcccc ggatgtggcg    10500

gacatgacgg aggatgacgt aatgctgatg acagtatcag aagggatcgc aggaggagtc    10560

cggtatggct gaaccggtag gcgatctggt cgttgatttg agtctggatg cggccagatt    10620

tgacgagcag atggccagag tcaggcgtca tttttctggt acggaaagtg atgcgaaaaa    10680

aacagcggca gtcgttgaac agtcgctgag ccgacaggcg ctggctgcac agaaagcggg    10740

gatttccgtc gggcagtata aagccgccat gcgtatgctg cctgcacagt tcaccgacgt    10800

ggccacgcag cttgcaggcg ggcaaagtcc gtggctgatc ctgctgcaac agggggggca    10860

ggtgaaggac tccttcggcg ggatgatccc catgttcagg gggcttgccg gtgcgatcac    10920

cctgccgatg gtgggggcca cctcgctggc ggtggcgacc ggtgcgctgg cgtatgcctg    10980

gtatcagggc aactcaaccc tgtccgattt caacaaaacg ctggtccttt ccggcaatca    11040

ggcgggactg acggcagatc gtatgctggt cctgtccaga gccgggcagg cggcagggct    11100

gacgtttaac cagaccagcg agtcactcag cgcactggtt aaggcggggg taagcggtga    11160

ggctcagatt gcgtccatca gccagagtgt ggcgcgtttc tcctctgcat ccggcgtgga    11220

ggtggacaag gtcgctgaag ccttcgggaa gctgaccaca gacccgacgt cggggctgac    11280

ggcgatggct cgccagttcc ataacgtgtc ggcggagcag attgcgtatg ttgctcagtt    11340

gcagcgttcc ggcgatgaag ccggggcatt gcaggcggcg aacgaggccg caacgaaagg    11400

gtttgatgac cagacccgcc gcctgaaaga gaacatgggc acgctggaga cctgggcaga    11460

caggactgcg cgggcattca aatccatgtg ggatgcggtg ctggatattg gtcgtcctga    11520

taccgcgcag gagatgctga ttaaggcaga ggctgcgtat aagaaagcag acgacatctg    11580

gaatctgcgc aaggatgatt attttgttaa cgatgaagcg cgggcgcgtt actgggatga    11640

tcgtgaaaag gcccgtcttg cgcttgaagc cgcccgaaag aaggctgagc agcagactca    11700

acaggacaaa aatgcgcagc agcagagcga taccgaagcg tcacggctga aatataccga    11760

agaggcgcag aaggcttacg aacggctgca gacgccgctg gagaaatata ccgcccgtca    11820

ggaagaactg aacaaggcac tgaaagacgg gaaaatcctg caggcggatt acaacacgct    11880

gatggcggcg gcgaaaaagg attatgaagc gacgctgaaa aagccgaaac agtccagcgt    11940

gaaggtgtct gcgggcgatc gtcaggaaga cagtgctcat gctgccctgc tgacgcttca    12000

ggcagaactc cggacgctgg agaagcatgc cggagcaaat gagaaaatca gccagcagcg    12060

ccgggatttg tggaaggcgg agagtcagtt cgcggtactg gaggaggcgg cgcaacgtcg    12120

ccagctgtct gcacaggaga aatccctgct ggcgcataaa gatgagacgc tggagtacaa    12180

acgccagctg gctgcacttg gcgacaaggt tacgtatcag gagcgcctga acgcgctggc    12240

gcagcaggcg gataaattcg cacagcagca acgggcaaaa cgggccgcca ttgatgcgaa    12300

aagccggggg ctgactgacc ggcaggcaga acgggaagcc acggaacagc gcctgaagga    12360

acagtatggc gataatccgc tggcgctgaa taacgtcatg tcagagcaga aaaagacctg    12420

ggcggctgaa gaccagcttc gcgggaactg gatggcaggc ctgaagtccg gctggagtga    12480

gtgggaagag agcgccacgg acagtatgtc gcaggtaaaa agtgcagcca cgcagacctt    12540

tgatggtatt gcacagaata tggcggcgat gctgaccggc agtgagcaga actggcgcag    12600

cttcacccgt tccgtgctgt ccatgatgac agaaattctg cttaagcagg caatggtggg    12660

gattgtcggg agtatcggca gcgccattgg cggggctgtt ggtggcggcg catccgcgtc    12720

aggcggtaca gccattcagg ccgctgcggc gaaattccat tttgcaaccg gaggatttac    12780

gggaaccggc ggcaaatatg agccagcggg gattgttcac cgtggtgagt ttgtcttcac    12840

gaaggaggca accagccgga ttggcgtggg gaatctttac cggctgatgc gcggctatgc    12900

caccggcggt tatgtcggta caccgggcag catggcagac agccggtcgc aggcgtccgg    12960

gacgtttgag cagaataacc atgtggtgat taacaacgac ggcacgaacg ggcagatagg    13020

tccggctgct ctgaaggcgg tgtatgacat ggcccgcaag ggtgcccgtg atgaaattca    13080

gacacagatg cgtgatggtg gcctgttctc cggaggtgga cgatgaagac cttccgctgg    13140

aaagtgaaac ccggtatgga tgtggcttcg gtcccttctg taagaaaggt gcgctttggt    13200

gatggctatt ctcagcgagc gcctgccggg ctgaatgcca acctgaaaac gtacagcgtg    13260

acgctttctg tcccccgtga ggaggccacg gtactggagt cgtttctgga agagcacggg    13320

ggctggaaat cctttctgtg gacgccgcct tatgagtggc ggcagataaa ggtgacctgc    13380

gcaaaatggt cgtcgcgggt cagtatgctg cgtgttgagt tcagcgcaga gtttgaacag    13440

gtggtgaact gatgcaggat atccggcagg aaacactgaa tgaatgcacc cgtgcggagc    13500

agtcggccag cgtggtgctc tgggaaatcg acctgacaga ggtcggtgga gaacgttatt    13560

ttttctgtaa tgagcagaac gaaaaaggtg agccggtcac ctggcagggg cgacagtatc    13620

agccgtatcc cattcagggg agcggttttg aactgaatgg caaaggcacc agtacgcgcc    13680

ccacgctgac ggtttctaac ctgtacggta tggtcaccgg gatggcggaa gatatgcaga    13740

gtctggtcgg cggaacggtg gtccggcgta aggtttacgc ccgttttctg gatgcggtga    13800

acttcgtcaa cggaaacagt tacgccgatc cggagcagga ggtgatcagc cgctggcgca    13860

ttgagcagtg cagcgaactg agcgcggtga gtgcctcctt tgtactgtcc acgccgacgg    13920

aaacggatgg cgctgttttt ccgggacgta tcatgctggc caacacctgc acctggacct    13980

atcgcggtga cgagtgcggt tatagcggtc cggctgtcgc ggatgaatat gaccagccaa    14040

cgtccgatat cacgaaggat aaatgcagca aatgcctgag cggttgtaag ttccgcaata    14100

acgtcggcaa ctttggcggc ttcctttcca ttaacaaact ttcgcagtaa atcccatgac    14160

acagacagaa tcagcgattc tggcgcacgc ccggcgatgt gcgccagcgg agtcgtgcgg    14220

cttcgtggta agcacgccgg agggggaaag atatttcccc tgcgtgaata tctccggtga    14280

gccggaggct atttccgtat gtcgccggaa gactggctgc aggcagaaat gcagggtgag    14340

attgtggcgc tggtccacag ccaccccggt ggtctgccct ggctgagtga ggccgaccgg    14400

cggctgcagg tgcagagtga tttgccgtgg tggctggtct gccgggggac gattcataag    14460

ttccgctgtg tgccgcatct caccgggcgg cgctttgagc acggtgtgac ggactgttac    14520

acactgttcc gggatgctta tcatctggcg gggattgaga tgccggactt tcatcgtgag    14580

gatgactggt ggcgtaacgg ccagaatctc tatctggata atctggaggc gacggggctg    14640

tatcaggtgc cgttgtcagc ggcacagccg ggcgatgtgc tgctgtgctg ttttggttca    14700

tcagtgccga atcacgccgc aatttactgc ggcgacggcg agctgctgca ccatattcct    14760

gaacaactga gcaaacgaga gaggtacacc gacaaatggc agcgacgcac acactccctc    14820

tggcgtcacc gggcatggcg cgcatctgcc tttacgggga tttacaacga tttggtcgcc    14880

gcatcgacct tcgtgtgaaa acgggggctg aagccatccg ggcactggcc acacagctcc    14940

cggcgtttcg tcagaaactg agcgacggct ggtatcaggt acggattgcc gggcgggacg    15000

tcagcacgtc cgggttaacg gcgcagttac atgagactct gcctgatggc gctgtaattc    15060

atattgttcc cagagtcgcc ggggccaagt caggtggcgt attccagatt gtcctggggg    15120

ctgccgccat tgccggatca ttctttaccg ccggagccac ccttgcagca tggggggcag    15180

ccattggggc cggtggtatg accggcatcc tgttttctct cggtgccagt atggtgctcg    15240

gtggtgtggc gcagatgctg gcaccgaaag ccagaactcc ccgtatacag acaacggata    15300

acggtaagca gaacacctat ttctcctcac tggataacat ggttgcccag ggcaatgttc    15360

tgcctgttct gtacggggaa atgcgcgtgg ggtcacgcgt ggtttctcag gagatcagca    15420

cggcagacga aggggacggt ggtcaggttg tggtgattgg tcgctgatgc aaaatgtttt    15480

atgtgaaacc gcctgcgggc ggttttgtca tttatggagc gtgaggaatg ggtaaaggaa    15540

gcagtaaggg gcataccccg cgcgaagcga aggacaacct gaagtccacg cagttgctga    15600

gtgtgatcga tgccatcagc gaagggccga ttgaaggtcc ggtggatggc ttaaaaagcg    15660

tgctgctgaa cagtacgccg gtgctggaca ctgaggggaa taccaacata tccggtgtca    15720

cggtggtgtt ccgggctggt gagcaggagc agactccgcc ggagggattt gaatcctccg    15780

gctccgagac ggtgctgggt acggaagtga aatatgacac gccgatcacc cgcaccatta    15840

cgtctgcaaa catcgaccgt ctgcgcttta ccttcggtgt acaggcactg gtggaaacca    15900

cctcaaaggg tgacaggaat ccgtcggaag tccgcctgct ggttcagata caacgtaacg    15960

gtggctgggt gacggaaaaa gacatcacca ttaagggcaa aaccacctcg cagtatctgg    16020

cctcggtggt gatgggtaac ctgccgccgc gcccgtttaa tatccggatg cgcaggatga    16080

cgccggacag caccacagac cagctgcaga acaaaacgct ctggtcgtca tacactgaaa    16140

tcatcgatgt gaaacagtgc tacccgaaca cggcactggt cggcgtgcag gtggactcgg    16200

agcagttcgg cagccagcag gtgagccgta attatcatct gcgcgggcgt attctgcagg    16260

tgccgtcgaa ctataacccg cagacgcggc aatacagcgg tatctgggac ggaacgttta    16320

aaccggcata cagcaacaac atggcctggt gtctgtggga tatgctgacc catccgcgct    16380

acggcatggg gaaacgtctt ggtgcggcgg atgtggataa atgggcgctg tatgtcatcg    16440

gccagtactg cgaccagtca gtgccggacg gctttggcgg cacggagccg cgcatcacct    16500

gtaatgcgta cctgaccaca cagcgtaagg cgtgggatgt gctcagcgat ttctgctcgg    16560

cgatgcgctg tatgccggta tggaacgggc agacgctgac gttcgtgcag gaccgaccgt    16620

cggataagac gtggacctat aaccgcagta atgtggtgat gccggatgat ggcgcgccgt    16680

tccgctacag cttcagcgcc ctgaaggacc gccataatgc cgttgaggtg aactggattg    16740

acccgaacaa cggctgggag acggcgacag agcttgttga agatacgcag gccattgccc    16800

gttacggtcg taatgttacg aagatggatg cctttggctg taccagccgg gggcaggcac    16860

accgcgccgg gctgtggctg attaaaacag aactgctgga aacgcagacc gtggatttca    16920

gcgtcggcgc agaagggctt cgccatgtac cgggcgatgt tattgaaatc tgcgatgatg    16980

actatgccgg tatcagcacc ggtggtcgtg tgctggcggt gaacagccag acccggacgc    17040

tgacgctcga ccgtgaaatc acgctgccat cctccggtac cgcgctgata agcctggttg    17100

acggaagtgg caatccggtc agcgtggagg ttcagtccgt caccgacggc gtgaaggtaa    17160

aagtgagccg tgttcctgac ggtgttgctg aatacagcgt atgggagctg aagctgccga    17220

cgctgcgcca gcgactgttc cgctgcgtga gtatccgtga gaacgacgac ggcacgtatg    17280

ccatcaccgc cgtgcagcat gtgccggaaa aagaggccat cgtggataac ggggcgcact    17340

ttgacggcga acagagtggc acggtgaatg gtgtcacgcc gccagcggtg cagcacctga    17400

ccgcagaagt cactgcagac agcggggaat atcaggtgct ggcgcgatgg gacacaccga    17460

aggtggtgaa gggcgtgagt ttcctgctcc gtctgaccgt aacagcggac gacggcagtg    17520

agcggctggt cagcacggcc cggacgacgg aaaccacata ccgcttcacg caactggcgc    17580

tggggaacta caggctgaca gtccgggcgg taaatgcgtg ggggcagcag ggcgatccgg    17640

cgtcggtatc gttccggatt gccgcaccgg cagcaccgtc gaggattgag ctgacgccgg    17700

gctattttca gataaccgcc acgccgcatc ttgccgttta tgacccgacg gtacagtttg    17760

agttctggtt ctcggaaaag cagattgcgg atatcagaca ggttgaaacc agcacgcgtt    17820

atcttggtac ggcgctgtac tggatagccg ccagtatcaa tatcaaaccg ggccatgatt    17880

attactttta tatccgcagt gtgaacaccg ttggcaaatc ggcattcgtg gaggccgtcg    17940

gtcgggcgag cgatgatgcg gaaggttacc tggatttttt caaaggcaag ataaccgaat    18000

cccatctcgg caaggagctg ctggaaaaag tcgagctgac ggaggataac gccagcagac    18060

tggaggagtt ttcgaaagag tggaaggatg ccagtgataa gtggaatgcc atgtgggctg    18120

tcaaaattga gcagaccaaa gacggcaaac attatgtcgc gggtattggc ctcagcatgg    18180

aggacacgga ggaaggcaaa ctgagccagt ttctggttgc cgccaatcgt atcgcattta    18240

ttgacccggc aaacgggaat gaaacgccga tgtttgtggc gcagggcaac cagatattca    18300

tgaacgacgt gttcctgaag cgcctgacgg cccccaccat taccagcggc ggcaatcctc    18360

cggccttttc cctgacaccg gacggaaagc tgaccgctaa aaatgcggat atcagtggca    18420

gtgtgaatgc gaactccggg acgctcagta atgtgacgat agctgaaaac tgtacgataa    18480

acggtacgct gagggcggaa aaaatcgtcg gggacattgt aaaggcggcg agcgcggctt    18540

ttccgcgcca gcgtgaaagc agtgtggact ggccgtcagg tacccgtact gtcaccgtga    18600

ccgatgacca tccttttgat cgccagatag tggtgcttcc gctgacgttt cgcggaagta    18660

agcgtactgt cagcggcagg acaacgtatt cgatgtgtta tctgaaagta ctgatgaacg    18720

gtgcggtgat ttatgatggc gcggcgaacg aggcggtaca ggtgttctcc cgtattgttg    18780

acatgccagc gggtcgggga aacgtgatcc tgacgttcac gcttacgtcc acacggcatt    18840

cggcagatat tccgccgtat acgtttgcca gcgatgtgca ggttatggtg attaagaaac    18900

aggcgctggg catcagcgtg gtctgagtgt gttacagagg ttcgtccggg aacgggcgtt    18960

ttattataaa acagtgagag gtgaacgatg cgtaatgtgt gtattgccgt tgctgtcttt    19020

gccgcacttg cggtgacagt cactccggcc cgtgcggaag gtggacatgg tacgtttacg    19080

gtgggctatt ttcaagtgaa accgggtaca ttgccgtcgt tgtcgggcgg ggataccggt    19140

gtgagtcatc tgaaagggat taacgtgaag taccgttatg agctgacgga cagtgtgggg    19200

gtgatggctt ccctggggtt cgccgcgtcg aaaaagagca gcacagtgat gaccggggag    19260

gatacgtttc actatgagag cctgcgtgga cgttatgtga gcgtgatggc cggaccggtt    19320

ttacaaatca gtaagcaggt cagtgcgtac gccatggccg gagtggctca cagtcggtgg    19380

tccggcagta caatggatta ccgtaagacg gaaatcactc ccgggtatat gaaagagacg    19440

accactgcca gggacgaaag tgcaatgcgg catacctcag tggcgtggag tgcaggtata    19500

cagattaatc cggcagcgtc cgtcgttgtt gatattgctt atgaaggctc cggcagtggc    19560

gactggcgta ctgacggatt catcgttggg gtcggttata aattctgatt agccaggtaa    19620

cacagtgtta tgacagcccg ccggaaccgg tgggcttttt tgtggggtga atatggcagt    19680

aaagatttca ggagtcctga aagacggcac aggaaaaccg gtacagaact gcaccattca    19740

gctgaaagcc agacgtaaca gcaccacggt ggtggtgaac acggtgggct cagagaatcc    19800

ggatgaagcc gggcgttaca gcatggatgt ggagtacggt cagtacagtg tcatcctgca    19860

ggttgacggt tttccaccat cgcacgccgg gaccatcacc gtgtatgaag attcacaacc    19920

ggggacgctg aatgattttc tctgtgccat gacggaggat gatgcccggc cggaggtgct    19980

gcgtcgtctt gaactgatgg tggaagaggt ggcgcgtaac gcgtccgtgg tggcacagag    20040

tacggcagac gcgaagaaat cagccggcga tgccagtgca tcagctgctc aggtcgcggc    20100

ccttgtgact gatgcaactg actcagcacg cgccgccagc acgtccgccg gacaggctgc    20160

atcgtcagct caggaagcgt cctccggcgc agaagcggca tcagcaaagg ccactgaagc    20220

ggaaaaaagt gccgcagccg cagagtcctc aaaaaacgcg gcggccacca gtgccggtgc    20280

ggcgaaaacg tcagaaacga atgctgcagc gtcacaacaa tcagccgcca cgtctgcctc    20340

caccgcggcc acgaaagcgt cagaggccgc cacttcagca cgagatgcgg tggcctcaaa    20400

agaggcagca aaatcatcag aaacgaacgc atcatcaagt gccggtcgtg cagcttcctc    20460

ggcaacggcg gcagaaaatt ctgccagggc ggcaaaaacg tccgagacga atgccaggtc    20520

atctgaaaca gcagcggaac ggagcgcctc tgccgcggca gacgcaaaaa cagcggcggc    20580

ggggagtgcg tcaacggcat ccacgaaggc gacagaggct gcgggaagtg cggtatcagc    20640

atcgcagagc aaaagtgcgg cagaagcggc ggcaatacgt gcaaaaaatt cggcaaaacg    20700

tgcagaagat atagcttcag ctgtcgcgct tgaggatgcg gacacaacga gaaaggggat    20760

agtgcagctc agcagtgcaa ccaacagcac gtctgaaacg cttgctgcaa cgccaaaggc    20820

ggttaaggtg gtaatggatg aaacgaacag aaaagcccac tggacagtcc ggcactgacc    20880

ggaacgccaa cagcaccaac cgcgctcagg ggaacaaaca atacccagat tgcgaacacc    20940

gcttttgtac tggccgcgat tgcagatgtt atcgacgcgt cacctgacgc actgaatacg    21000

ctgaatgaac tggccgcagc gctcgggaat gatccagatt ttgctaccac catgactaac    21060

gcgcttgcgg gtaaacaacc gaagaatgcg acactgacgg cgctggcagg gctttccacg    21120

gcgaaaaata aattaccgta ttttgcggaa aatgatgccg ccagcctgac tgaactgact    21180

caggttggca gggatattct ggcaaaaaat tccgttgcag atgttcttga ataccttggg    21240

gccggtgaga attcggcctt tccggcaggt gcgccgatcc cgtggccatc agatatcgtt    21300

ccgtctggct acgtcctgat gcaggggcag gcgtttgaca aatcagccta cccaaaactt    21360

gctgtcgcgt atccatcggg tgtgcttcct gatatgcgag gctggacaat caaggggaaa    21420

cccgccagcg gtcgtgctgt attgtctcag gaacaggatg gaattaagtc gcacacccac    21480

agtgccagtg catccggtac ggatttgggg acgaaaacca catcgtcgtt tgattacggg    21540

acgaaaacaa caggcagttt cgattacggc accaaatcga cgaataacac gggggctcat    21600

gctcacagtc tgagcggttc aacaggggcc gcgggtgctc atgcccacac aagtggttta    21660

aggatgaaca gttctggctg gagtcagtat ggaacagcaa ccattacagg aagtttatcc    21720

acagttaaag gaaccagcac acagggtatt gcttatttat cgaaaacgga cagtcagggc    21780

agccacagtc actcattgtc cggtacagcc gtgagtgccg gtgcacatgc gcatacagtt    21840

ggtattggtg cgcaccagca tccggttgtt atcggtgctc atgcccattc tttcagtatt    21900

ggttcacacg gacacaccat caccgttaac gctgcgggta acgcggaaaa caccgtcaaa    21960

aacattgcat ttaactatat tgtgaggctt gcataatggc attcagaatg agtgaacaac    22020

cacggaccat aaaaatttat aatctgctgg ccggaactaa tgaatttatt ggtgaaggtg    22080

acgcatatat tccgcctcat accggtctgc ctgcaaacag taccgatatt gcaccgccag    22140

atattccggc tggctttgtg gctgttttca acagtgatga ggcatcgtgg catctcgttg    22200

aagaccatcg gggtaaaacc gtctatgacg tggcttccgg cgacgcgtta tttatttctg    22260

aactcggtcc gttaccggaa aattttacct ggttatcgcc gggaggggaa tatcagaagt    22320

ggaacggcac agcctgggtg aaggatacgg aagcagaaaa actgttccgg atccgggagg    22380

cggaagaaac aaaaaaaagc ctgatgcagg tagccagtga gcatattgcg ccgcttcagg    22440

atgctgcaga tctggaaatt gcaacgaagg aagaaacctc gttgctggaa gcctggaaga    22500

agtatcgggt gttgctgaac cgtgttgata catcaactgc acctgatatt gagtggcctg    22560

ctgtccctgt tatggagtaa tcgttttgtg atatgccgca gaaacgttgt atgaaataac    22620

gttctgcggt tagttagtat attgtaaagc tgagtattgg tttatttggc gattattatc    22680

ttcaggagaa taatggaagt tctatgactc aattgttcat agtgtttaca tcaccgccaa    22740

ttgcttttaa gactgaacgc atgaaatatg gtttttcgtc atgttttgag tctgctgttg    22800

atatttctaa agtcggtttt ttttcttcgt tttctctaac tattttccat gaaatacatt    22860

tttgattatt atttgaatca attccaatta cctgaagtct ttcatctata attggcattg    22920

tatgtattgg tttattggag tagatgcttg cttttctgag ccatagctct gatatccaaa    22980

tgaagccata ggcatttgtt attttggctc tgtcagctgc ataacgccaa aaaatatatt    23040

tatctgcttg atcttcaaat gttgtattga ttaaatcaat tggatggaat tgtttatcat    23100

aaaaaattaa tgtttgaatg tgataaccgt cctttaaaaa agtcgtttct gcaagcttgg    23160

ctgtatagtc aactaactct tctgtcgaag tgatattttt aggcttatct accagtttta    23220

gacgctcttt aatatcttca ggaattattt tattgtcata ttgtatcatg ctaaatgaca    23280

atttgcttat ggagtaatct tttaatttta aataagttat tctcctggct tcatcaaata    23340

aagagtcgaa tgatgttggc gaaatcacat cgtcacccat tggattgttt atttgtatgc    23400

caagagagtt acagcagtta tacattctgc catagattat agctaaggca tgtaataatt    23460

cgtaatcttt tagcgtatta gcgacccatc gtctttctga tttaataata gatgattcag    23520

ttaaatatga aggtaatttc ttttgtgcaa gtctgactaa cttttttata ccaatgttta    23580

acatactttc atttgtaata aactcaatgt cattttcttc aatgtaagat gaaataagag    23640

tagcctttgc ctcgctatac atttctaaat cgccttgttt ttctatcgta ttgcgagaat    23700

ttttagccca agccattaat ggatcatttt tccatttttc aataacatta ttgttatacc    23760

aaatgtcata tcctataatc tggtttttgt ttttttgaat aataaatgtt actgttcttg    23820

cggtttggag gaattgattc aaattcaagc gaaataattc agggtcaaaa tatgtatcaa    23880

tgcagcattt gagcaagtgc gataaatctt taagtcttct ttcccatggt tttttagtca    23940

taaaactctc cattttgata ggttgcatgc tagatgctga tatattttag aggtgataaa    24000

attaactgct taactgtcaa tgtaatacaa gttgtttgat ctttgcaatg attcttatca    24060

gaaaccatat agtaaattag ttacacagga aatttttaat attattatta tcattcatta    24120

tgtattaaaa ttagagttgt ggcttggctc tgctaacacg ttgctcatag gagatatggt    24180

agagccgcag acacgtcgta tgcaggaacg tgctgcggct ggctggtgaa cttccgatag    24240

tgcgggtgtt gaatgatttc cagttgctac cgattttaca tattttttgc atgagagaat    24300

ttgtaccacc tcccaccgac catctatgac tgtacgccac tgtccctagg actgctatgt    24360

gccggagcgg acattacaaa cgtccttctc ggtgcatgcc actgttgcca atgacctgcc    24420

taggaattgg ttagcaagtt actaccggat tttgtaaaaa cagccctcct catataaaaa    24480

gtattcgttc acttccgata agcgtcgtaa ttttctatct ttcatcatat tctagatccc    24540

tctgaaaaaa tcttccgagt ttgctaggca ctgatacata actcttttcc aataattggg    24600

gaagtcattc aaatctataa taggtttcag atttgcttca ataaattctg actgtagctg    24660

ctgaaacgtt gcggttgaac tatatttcct tataactttt acgaaagagt ttctttgagt    24720

aatcacttca ctcaagtgct tccctgcctc caaacgatac ctgttagcaa tatttaatag    24780

cttgaaatga tgaagagctc tgtgtttgtc ttcctgcctc cagttcgccg ggcattcaac    24840

ataaaaactg atagcacccg gagttccgga aacgaaattt gcatataccc attgctcacg    24900

aaaaaaaatg tccttgtcga tatagggatg aatcgcttgg tgtacctcat ctactgcgaa    24960

aacttgacct ttctctccca tattgcagtc gcggcacgat ggaactaaat taataggcat    25020

caccgaaaat tcaggataat gtgcaatagg aagaaaatga tctatatttt ttgtctgtcc    25080

tatatcacca caaaatggac atttttcacc tgatgaaaca agcatgtcat cgtaatatgt    25140

tctagcgggt ttgtttttat ctcggagatt attttcataa agcttttcta atttaacctt    25200

tgtcaggtta ccaactacta aggttgtagg ctcaagaggg tgtgtcctgt cgtaggtaaa    25260

taactgacct gtcgagctta atattctata ttgttgttct ttctgcaaaa aagtggggaa    25320

gtgagtaatg aaattatttc taacatttat ctgcatcata ccttccgagc atttattaag    25380

catttcgcta taagttctcg ctggaagagg tagttttttc attgtacttt accttcatct    25440

ctgttcatta tcatcgcttt taaaacggtt cgaccttcta atcctatctg accattataa    25500

ttttttagaa tggtttcata agaaagctct gaatcaacgg actgcgataa taagtggtgg    25560

tatccagaat ttgtcacttc aagtaaaaac acctcacgag ttaaaacacc taagttctca    25620

ccgaatgtct caatatccgg acggataata tttattgctt ctcttgaccg taggactttc    25680

cacatgcagg attttggaac ctcttgcagt actactgggg aatgagttgc aattattgct    25740

acaccattgc gtgcatcgag taagtcgctt aatgttcgta aaaaagcaga gagcaaaggt    25800

ggatgcagat gaacctctgg ttcatcgaat aaaactaatg acttttcgcc aacgacatct    25860

actaatcttg tgatagtaaa taaaacaatt gcatgtccag agctcattcg aagcagatat    25920

ttctggatat tgtcataaaa caatttagtg aatttatcat cgtccacttg aatctgtggt    25980

tcattacgtc ttaactcttc atatttagaa atgaggctga tgagttccat atttgaaaag    26040

ttttcatcac tacttagttt tttgatagct tcaagccaga gttgtctttt tctatctact    26100

ctcatacaac caataaatgc tgaaatgaat tctaagcgga gatcgcctag tgattttaaa    26160

ctattgctgg cagcattctt gagtccaata taaaagtatt gtgtaccttt tgctgggtca    26220

ggttgttctt taggaggagt aaaaggatca aatgcactaa acgaaactga aacaagcgat    26280

cgaaaatatc cctttgggat tcttgactcg ataagtctat tattttcaga gaaaaaatat    26340

tcattgtttt ctgggttggt gattgcacca atcattccat tcaaaattgt tgttttacca    26400

cacccattcc gcccgataaa agcatgaatg ttcgtgctgg gcatagaatt aaccgtcacc    26460

tcaaaaggta tagttaaatc actgaatccg ggagcacttt ttctattaaa tgaaaagtgg    26520

aaatctgaca attctggcaa accatttaac acacgtgcga actgtccatg aatttctgaa    26580

agagttaccc ctctaagtaa tgaggtgtta aggacgcttt cattttcaat gtcggctaat    26640

cgatttggcc atactactaa atcctgaata gctttaagaa ggttatgttt aaaaccatcg    26700

cttaatttgc tgagattaac atagtagtca atgctttcac ctaaggaaaa aaacatttca    26760

gggagttgac tgaatttttt atctattaat gaataagtgc ttacttcttc tttttgacct    26820

acaaaaccaa ttttaacatt tccgatatcg catttttcac catgctcatc aaagacagta    26880

agataaaaca ttgtaacaaa ggaatagtca ttccaaccat ctgctcgtag gaatgcctta    26940

tttttttcta ctgcaggaat atacccgcct ctttcaataa cactaaactc caacatatag    27000

taacccttaa ttttattaaa ataaccgcaa tttatttggc ggcaacacag gatctctctt    27060

ttaagttact ctctattaca tacgttttcc atctaaaaat tagtagtatt gaacttaacg    27120

gggcatcgta ttgtagtttt ccatatttag ctttctgctt ccttttggat aacccactgt    27180

tattcatgtt gcatggtgca ctgtttatac caacgatata gtctattaat gcatatatag    27240

tatcgccgaa cgattagctc ttcaggcttc tgaagaagcg tttcaagtac taataagccg    27300

atagatagcc acggacttcg tagccatttt tcataagtgt taacttccgc tcctcgctca    27360

taacagacat tcactacagt tatggcggaa aggtatgcat gctgggtgtg gggaagtcgt    27420

gaaagaaaag aagtcagctg cgtcgtttga catcactgct atcttcttac tggttatgca    27480

ggtcgtagtg ggtggcacac aaagctttgc actggattgc gaggctttgt gcttctctgg    27540

agtgcgacag gtttgatgac aaaaaattag cgcaagaaga caaaaatcac cttgcgctaa    27600

tgctctgtta caggtcacta ataccatcta agtagttgat tcatagtgac tgcatatgtt    27660

gtgttttaca gtattatgta gtctgttttt tatgcaaaat ctaatttaat atattgatat    27720

ttatatcatt ttacgtttct cgttcagctt ttttatacta agttggcatt ataaaaaagc    27780

attgcttatc aatttgttgc aacgaacagg tcactatcag tcaaaataaa atcattattt    27840

gatttcaatt ttgtcccact ccctgcctct gtcatcacga tactgtgatg ccatggtgtc    27900

cgacttatgc ccgagaagat gttgagcaaa cttatcgctt atctgcttct catagagtct    27960

tgcagacaaa ctgcgcaact cgtgaaaggt aggcggatcc ccttcgaagg aaagacctga    28020

tgcttttcgt gcgcgcataa aataccttga tactgtgccg gatgaaagcg gttcgcgacg    28080

agtagatgca attatggttt ctccgccaag aatctctttg catttatcaa gtgtttcctt    28140

cattgatatt ccgagagcat caatatgcaa tgctgttggg atggcaattt ttacgcctgt    28200

tttgctttgc tcgacataaa gatatccatc tacgatatca gaccacttca tttcgcataa    28260

atcaccaact cgttgcccgg taacaacagc cagttccatt gcaagtctga gccaacatgg    28320

tgatgattct gctgcttgat aaattttcag gtattcgtca gccgtaagtc ttgatctcct    28380

tacctctgat tttgctgcgc gagtggcagc gacatggttt gttgttatat ggccttcagc    28440

tattgcctct cggaatgcat cgctcagtgt tgatctgatt aacttggctg acgccgcctt    28500

gccctcgtct atgtatccat tgagcattgc cgcaatttct tttgtggtga tgtcttcaag    28560

tggagcatca ggcagacccc tccttattgc tttaattttg ctcatgtaat ttatgagtgt    28620

cttctgcttg attcctctgc tggccaggat tttttcgtag cgatcaagcc atgaatgtaa    28680

cgtaacggaa ttatcactgt tgattctcgc tgtcagaggc ttgtgtttgt gtcctgaaaa    28740

taactcaatg ttggcctgta tagcttcagt gattgcgatt cgcctgtctc tgcctaatcc    28800

aaactcttta cccgtccttg ggtccctgta gcagtaatat ccattgtttc ttatataaag    28860

gttagggggt aaatcccggc gctcatgact tcgccttctt cccatttctg atcctcttca    28920

aaaggccacc tgttactggt cgatttaagt caacctttac cgctgattcg tggaacagat    28980

actctcttcc atccttaacc ggaggtggga atatcctgca ttcccgaacc catcgacgaa    29040

ctgtttcaag gcttcttgga cgtcgctggc gtgcgttcca ctcctgaagt gtcaagtaca    29100

tcgcaaagtc tccgcaatta cacgcaagaa aaaaccgcca tcaggcggct tggtgttctt    29160

tcagttcttc aattcgaata ttggttacgt ctgcatgtgc tatctgcgcc catatcatcc    29220

agtggtcgta gcagtcgttg atgttctccg cttcgataac tctgttgaat ggctctccat    29280

tccattctcc tgtgactcgg aagtgcattt atcatctcca taaaacaaaa cccgccgtag    29340

cgagttcaga taaaataaat ccccgcgagt gcgaggattg ttatgtaata ttgggtttaa    29400

tcatctatat gttttgtaca gagagggcaa gtatcgtttc caccgtactc gtgataataa    29460

ttttgcacgg tatcagtcat ttctcgcaca ttgcagaatg gggatttgtc ttcattagac    29520

ttataaacct tcatggaata tttgtatgcc gactctatat ctataccttc atctacataa    29580

acaccttcgt gatgtctgca tggagacaag acaccggatc tgcacaacat tgataacgcc    29640

caatcttttt gctcagactc taactcattg atactcattt ataaactcct tgcaatgtat    29700

gtcgtttcag ctaaacggta tcagcaatgt ttatgtaaag aaacagtaag ataatactca    29760

acccgatgtt tgagtacggt catcatctga cactacagac tctggcatcg ctgtgaagac    29820

gacgcgaaat tcagcatttt cacaagcgtt atcttttaca aaaccgatct cactctcctt    29880

tgatgcgaat gccagcgtca gacatcatat gcagatactc acctgcatcc tgaacccatt    29940

gacctccaac cccgtaatag cgatgcgtaa tgatgtcgat agttactaac gggtcttgtt    30000

cgattaactg ccgcagaaac tcttccaggt caccagtgca gtgcttgata acaggagtct    30060

tcccaggatg gcgaacaaca agaaactggt ttccgtcttc acggacttcg ttgctttcca    30120

gtttagcaat acgcttactc ccatccgaga taacaccttc gtaatactca cgctgctcgt    30180

tgagttttga ttttgctgtt tcaagctcaa cacgcagttt ccctactgtt agcgcaatat    30240

cctcgttctc ctggtcgcgg cgtttgatgt attgctggtt tctttcccgt tcatccagca    30300

gttccagcac aatcgatggt gttaccaatt catggaaaag gtctgcgtca aatccccagt    30360

cgtcatgcat tgcctgctct gccgcttcac gcagtgcctg agagttaatt tcgctcactt    30420

cgaacctctc tgtttactga taagttccag atcctcctgg caacttgcac aagtccgaca    30480

accctgaacg accaggcgtc ttcgttcatc tatcggatcg ccacactcac aacaatgagt    30540

ggcagatata gcctggtggt tcaggcggcg catttttatt gctgtgttgc gctgtaattc    30600

ttctatttct gatgctgaat caatgatgtc tgccatcttt cattaatccc tgaactgttg    30660

gttaatacgc ttgagggtga atgcgaataa taaaaaagga gcctgtagct ccctgatgat    30720

tttgcttttc atgttcatcg ttccttaaag acgccgttta acatgccgat tgccaggctt    30780

aaatgagtcg gtgtgaatcc catcagcgtt accgtttcgc ggtgcttctt cagtacgcta    30840

cggcaaatgt catcgacgtt tttatccgga aactgctgtc tggctttttt tgatttcaga    30900

attagcctga cgggcaatgc tgcgaagggc gttttcctgc tgaggtgtca ttgaacaagt    30960

cccatgtcgg caagcataag cacacagaat atgaagcccg ctgccagaaa aatgcattcc    31020

gtggttgtca tacctggttt ctctcatctg cttctgcttt cgccaccatc atttccagct    31080

tttgtgaaag ggatgcggct aacgtatgaa attcttcgtc tgtttctact ggtattggca    31140

caaacctgat tccaatttga gcaaggctat gtgccatctc gatactcgtt cttaactcaa    31200

cagaagatgc tttgtgcata cagcccctcg tttattattt atctcctcag ccagccgctg    31260

tgctttcagt ggatttcgga taacagaaag gccgggaaat acccagcctc gctttgtaac    31320

ggagtagacg aaagtgattg cgcctacccg gatattatcg tgaggatgcg tcatcgccat    31380

tgctccccaa atacaaaacc aatttcagcc agtgcctcgt ccattttttc gatgaactcc    31440

ggcacgatct cgtcaaaact cgccatgtac ttttcatccc gctcaatcac gacataatgc    31500

aggccttcac gcttcatacg cgggtcatag ttggcaaagt accaggcatt ttttcgcgtc    31560

acccacatgc tgtactgcac ctgggccatg taagctgact ttatggcctc gaaaccaccg    31620

agccggaact tcatgaaatc ccgggaggta aacgggcatt tcagttcaag gccgttgccg    31680

tcactgcata aaccatcggg agagcaggcg gtacgcatac tttcgtcgcg atagatgatc    31740

ggggattcag taacattcac gccggaagtg aattcaaaca gggttctggc gtcgttctcg    31800

tactgttttc cccaggccag tgctttagcg ttaacttccg gagccacacc ggtgcaaacc    31860

tcagcaagca gggtgtggaa gtaggacatt ttcatgtcag gccacttctt tccggagcgg    31920

ggttttgcta tcacgttgtg aacttctgaa gcggtgatga cgccgagccg taatttgtgc    31980

cacgcatcat ccccctgttc gacagctctc acatcgatcc cggtacgctg caggataatg    32040

tccggtgtca tgctgccacc ttctgctctg cggctttctg tttcaggaat ccaagagctt    32100

ttactgcttc ggcctgtgtc agttctgacg atgcacgaat gtcgcggcga aatatctggg    32160

aacagagcgg caataagtcg tcatcccatg ttttatccag ggcgatcagc agagtgttaa    32220

tctcctgcat ggtttcatcg ttaaccggag tgatgtcgcg ttccggctga cgttctgcag    32280

tgtatgcagt attttcgaca atgcgctcgg cttcatcctt gtcatagata ccagcaaatc    32340

cgaaggccag acgggcacac tgaatcatgg ctttatgacg taacatccgt ttgggatgcg    32400

actgccacgg ccccgtgatt tctctgcctt cgcgagtttt gaatggttcg cggcggcatt    32460

catccatcca ttcggtaacg cagatcggat gattacggtc cttgcggtaa atccggcatg    32520

tacaggattc attgtcctgc tcaaagtcca tgccatcaaa ctgctggttt tcattgatga    32580

tgcgggacca gccatcaacg cccaccaccg gaacgatgcc attctgctta tcaggaaagg    32640

cgtaaatttc tttcgtccac ggattaaggc cgtactggtt ggcaacgatc agtaatgcga    32700

tgaactgcgc atcgctggca tcacctttaa atgccgtctg gcgaagagtg gtgatcagtt    32760

cctgtgggtc gacagaatcc atgccgacac gttcagccag cttcccagcc agcgttgcga    32820

gtgcagtact cattcgtttt atacctctga atcaatatca acctggtggt gagcaatggt    32880

ttcaaccatg taccggatgt gttctgccat gcgctcctga aactcaacat cgtcatcaaa    32940

cgcacgggta atggattttt tgctggcccc gtggcgttgc aaatgatcga tgcatagcga    33000

ttcaaacagg tgctggggca ggcctttttc catgtcgtct gccagttctg cctctttctc    33060

ttcacgggcg agctgctggt agtgacgcgc ccagctctga gcctcaagac gatcctgaat    33120

gtaataagcg ttcatggctg aactcctgaa atagctgtga aaatatcgcc cgcgaaatgc    33180

cgggctgatt aggaaaacag gaaagggggt tagtgaatgc ttttgcttga tctcagtttc    33240

agtattaata tccatttttt ataagcgtcg acggcttcac gaaacatctt ttcatcgcca    33300

ataaaagtgg cgatagtgaa tttagtctgg atagccataa gtgtttgatc cattctttgg    33360

gactcctggc tgattaagta tgtcgataag gcgtttccat ccgtcacgta atttacgggt    33420

gattcgttca agtaaagatt cggaagggca gccagcaaca ggccaccctg caatggcata    33480

ttgcatggtg tgctccttat ttatacataa cgaaaaacgc ctcgagtgaa gcgttattgg    33540

tatgcggtaa aaccgcactc aggcggcctt gatagtcata tcatctgaat caaatattcc    33600

tgatgtatcg atatcggtaa ttcttattcc ttcgctacca tccattggag gccatccttc    33660

ctgaccattt ccatcattcc agtcgaactc acacacaaca ccatatgcat ttaagtcgct    33720

tgaaattgct ataagcagag catgttgcgc cagcatgatt aatacagcat ttaatacaga    33780

gccgtgttta ttgagtcggt attcagagtc tgaccagaaa ttattaatct ggtgaagttt    33840

ttcctctgtc attacgtcat ggtcgatttc aatttctatt gatgctttcc agtcgtaatc    33900

aatgatgtat tttttgatgt ttgacatctg ttcatatcct cacagataaa aaatcgccct    33960

cacactggag ggcaaagaag atttccaata atcagaacaa gtcggctcct gtttagttac    34020

gagcgacatt gctccgtgta ttcactcgtt ggaatgaata cacagtgcag tgtttattct    34080

gttatttatg ccaaaaataa aggccactat caggcagctt tgttgttctg tttaccaagt    34140

tctctggcaa tcattgccgt cgttcgtatt gcccatttat cgacatattt cccatcttcc    34200

attacaggaa acatttcttc aggcttaacc atgcattccg attgcagctt gcatccattg    34260

catcgcttga attgtccaca ccattgattt ttatcaatag tcgtagtcat acggatagtc    34320

ctggtattgt tccatcacat cctgaggatg ctcttcgaac tcttcaaatt cttcttccat    34380

atatcacctt aaatagtgga ttgcggtagt aaagattgtg cctgtctttt aaccacatca    34440

ggctcggtgg ttctcgtgta cccctacagc gagaaatcgg ataaactatt acaaccccta    34500

cagtttgatg agtatagaaa tggatccact cgttattctc ggacgagtgt tcagtaatga    34560

acctctggag agaaccatgt atatgatcgt tatctgggtt ggacttctgc ttttaagccc    34620

agataactgg cctgaatatg ttaatgagag aatcggtatt cctcatgtgt ggcatgtttt    34680

cgtctttgct cttgcatttt cgctagcaat taatgtgcat cgattatcag ctattgccag    34740

cgccagatat aagcgattta agctaagaaa acgcattaag atgcaaaacg ataaagtgcg    34800

atcagtaatt caaaacctta cagaagagca atctatggtt ttgtgcgcag cccttaatga    34860

aggcaggaag tatgtggtta catcaaaaca attcccatac attagtgagt tgattgagct    34920

tggtgtgttg aacaaaactt tttcccgatg gaatggaaag catatattat tccctattga    34980

ggatatttac tggactgaat tagttgccag ctatgatcca tataatattg agataaagcc    35040

aaggccaata tctaagtaac tagataagag gaatcgattt tcccttaatt ttctggcgtc    35100

cactgcatgt tatgccgcgt tcgccaggct tgctgtacca tgtgcgctga ttcttgcgct    35160

caatacgttg caggttgctt tcaatctgtt tgtggtattc agccagcact gtaaggtcta    35220

tcggatttag tgcgctttct actcgtgatt tcggtttgcg attcagcgag agaatagggc    35280

ggttaactgg ttttgcgctt accccaacca acaggggatt tgctgctttc cattgagcct    35340

gtttctctgc gcgacgttcg cggcggcgtg tttgtgcatc catctggatt ctcctgtcag    35400

ttagctttgg tggtgtgtgg cagttgtagt cctgaacgaa aaccccccgc gattggcaca    35460

ttggcagcta atccggaatc gcacttacgg ccaatgcttc gtttcgtatc acacacccca    35520

aagccttctg ctttgaatgc tgcccttctt cagggcttaa tttttaagag cgtcaccttc    35580

atggtggtca gtgcgtcctg ctgatgtgct cagtatcacc gccagtggta tttatgtcaa    35640

caccgccaga gataatttat caccgcagat ggttatctgt atgtttttta tatgaattta    35700

ttttttgcag gggggcattg tttggtaggt gagagatctg aattgctatg tttagtgagt    35760

tgtatctatt tatttttcaa taaatacaat tggttatgtg ttttgggggc gatcgtgagg    35820

caaagaaaac ccggcgctga ggccgggtta ttcttgttct ctggtcaaat tatatagttg    35880

gaaaacaagg atgcatatat gaatgaacga tgcagaggca atgccgatgg cgatagtggg    35940

tatcatgtag ccgcttatgc tggaaagaag caataacccg cagaaaaaca aagctccaag    36000

ctcaacaaaa ctaagggcat agacaataac taccgatgtc atatacccat actctctaat    36060

cttggccagt cggcgcgttc tgcttccgat tagaaacgtc aaggcagcaa tcaggattgc    36120

aatcatggtt cctgcatatg atgacaatgt cgccccaaga ccatctctat gagctgaaaa    36180

agaaacacca ggaatgtagt ggcggaaaag gagatagcaa atgcttacga taacgtaagg    36240

aattattact atgtaaacac caggcatgat tctgttccgc ataattactc ctgataatta    36300

atccttaact ttgcccacct gccttttaaa acattccagt atatcacttt tcattcttgc    36360

gtagcaatat gccatctctt cagctatctc agcattggtg accttgttca gaggcgctga    36420

gagatggcct ttttctgata gataatgttc tgttaaaata tctccggcct catcttttgc    36480

ccgcaggcta atgtctgaaa attgaggtga cgggttaaaa ataatatcct tggcaacctt    36540

ttttatatcc cttttaaatt ttggcttaat gactatatcc aatgagtcaa aaagctcccc    36600

ttcaatatct gttgccccta agacctttaa tatatcgcca aatacaggta gcttggcttc    36660

taccttcacc gttgttcggc cgatgaaatg catatgcata acatcgtctt tggtggttcc    36720

cctcatcagt ggctctatct gaacgcgctc tccactgctt aatgacattc ctttcccgat    36780

taaaaaatct gtcagatcgg atgtggtcgg cccgaaaaca gttctggcaa aaccaatggt    36840

gtcgccttca acaaacaaaa aagatgggaa tcccaatgat tcgtcatctg cgaggctgtt    36900

cttaatatct tcaactgaag ctttagagcg atttatcttc tgaaccagac tcttgtcatt    36960

tgttttggta aagagaaaag tttttccatc gattttatga atatacaaat aattggagcc    37020

aacctgcagg tgatgattat cagccagcag agaattaagg aaaacagaca ggtttattga    37080

gcgcttatct ttccctttat ttttgctgcg gtaagtcgca taaaaaccat tcttcataat    37140

tcaatccatt tactatgtta tgttctgagg ggagtgaaaa ttcccctaat tcgatgaaga    37200

ttcttgctca attgttatca gctatgcgcc gaccagaaca ccttgccgat cagccaaacg    37260

tctcttcagg ccactgacta gcgataactt tccccacaac ggaacaactc tcattgcatg    37320

ggatcattgg gtactgtggg tttagtggtt gtaaaaacac ctgaccgcta tccctgatca    37380

gtttcttgaa ggtaaactca tcacccccaa gtctggctat gcagaaatca cctggctcaa    37440

cagcctgctc agggtcaacg agaattaaca ttccgtcagg aaagcttggc ttggagcctg    37500

ttggtgcggt catggaatta ccttcaacct caagccagaa tgcagaatca ctggcttttt    37560

tggttgtgct tacccatctc tccgcatcac ctttggtaaa ggttctaagc ttaggtgaga    37620

acatccctgc ctgaacatga gaaaaaacag ggtactcata ctcacttcta agtgacggct    37680

gcatactaac cgcttcatac atctcgtaga tttctctggc gattgaaggg ctaaattctt    37740

caacgctaac tttgagaatt tttgtaagca atgcggcgtt ataagcattt aatgcattga    37800

tgccattaaa taaagcacca acgcctgact gccccatccc catcttgtct gcgacagatt    37860

cctgggataa gccaagttca tttttctttt tttcataaat tgctttaagg cgacgtgcgt    37920

cctcaagctg ctcttgtgtt aatggtttct tttttgtgct catacgttaa atctatcacc    37980

gcaagggata aatatctaac accgtgcgtg ttgactattt tacctctggc ggtgataatg    38040

gttgcatgta ctaaggaggt tgtatggaac aacgcataac cctgaaagat tatgcaatgc    38100

gctttgggca aaccaagaca gctaaagatc tcggcgtata tcaaagcgcg atcaacaagg    38160

ccattcatgc aggccgaaag atttttttaa ctataaacgc tgatggaagc gtttatgcgg    38220

aagaggtaaa gcccttcccg agtaacaaaa aaacaacagc ataaataacc ccgctcttac    38280

acattccagc cctgaaaaag ggcatcaaat taaaccacac ctatggtgta tgcatttatt    38340

tgcatacatt caatcaattg ttatctaagg aaatacttac atatggttcg tgcaaacaaa    38400

cgcaacgagg ctctacgaat cgagagtgcg ttgcttaaca aaatcgcaat gcttggaact    38460

gagaagacag cggaagctgt gggcgttgat aagtcgcaga tcagcaggtg gaagagggac    38520

tggattccaa agttctcaat gctgcttgct gttcttgaat ggggggtcgt tgacgacgac    38580

atggctcgat tggcgcgaca agttgctgcg attctcacca ataaaaaacg cccggcggca    38640

accgagcgtt ctgaacaaat ccagatggag ttctgaggtc attactggat ctatcaacag    38700

gagtcattat gacaaataca gcaaaaatac tcaacttcgg cagaggtaac tttgccggac    38760

aggagcgtaa tgtggcagat ctcgatgatg gttacgccag actatcaaat atgctgcttg    38820

aggcttattc gggcgcagat ctgaccaagc gacagtttaa agtgctgctt gccattctgc    38880

gtaaaaccta tgggtggaat aaaccaatgg acagaatcac cgattctcaa cttagcgaga    38940

ttacaaagtt acctgtcaaa cggtgcaatg aagccaagtt agaactcgtc agaatgaata    39000

ttatcaagca gcaaggcggc atgtttggac caaataaaaa catctcagaa tggtgcatcc    39060

ctcaaaacga gggaaaatcc cctaaaacga gggataaaac atccctcaaa ttgggggatt    39120

gctatccctc aaaacagggg gacacaaaag acactattac aaaagaaaaa agaaaagatt    39180

attcgtcaga gaattctggc gaatcctctg accagccaga aaacgacctt tctgtggtga    39240

aaccggatgc tgcaattcag agcggcagca agtgggggac agcagaagac ctgaccgccg    39300

cagagtggat gtttgacatg gtgaagacta tcgcaccatc agccagaaaa ccgaattttg    39360

ctgggtgggc taacgatatc cgcctgatgc gtgaacgtga cggacgtaac caccgcgaca    39420

tgtgtgtgct gttccgctgg gcatgccagg acaacttctg gtccggtaac gtgctgagcc    39480

cggccaaact ccgcgataag tggacccaac tcgaaatcaa ccgtaacaag caacaggcag    39540

gcgtgacagc cagcaaacca aaactcgacc tgacaaacac agactggatt tacggggtgg    39600

atctatgaaa aacatcgccg cacagatggt taactttgac cgtgagcaga tgcgtcggat    39660

cgccaacaac atgccggaac agtacgacga aaagccgcag gtacagcagg tagcgcagat    39720

catcaacggt gtgttcagcc agttactggc aactttcccg gcgagcctgg ctaaccgtga    39780

ccagaacgaa gtgaacgaaa tccgtcgcca gtgggttctg gcttttcggg aaaacgggat    39840

caccacgatg gaacaggtta acgcaggaat gcgcgtagcc cgtcggcaga atcgaccatt    39900

tctgccatca cccgggcagt ttgttgcatg gtgccgggaa gaagcatccg ttaccgccgg    39960

actgccaaac gtcagcgagc tggttgatat ggtttacgag tattgccgga agcgaggcct    40020

gtatccggat gcggagtctt atccgtggaa atcaaacgcg cactactggc tggttaccaa    40080

cctgtatcag aacatgcggg ccaatgcgct tactgatgcg gaattacgcc gtaaggccgc    40140

agatgagctt gtccatatga ctgcgagaat taaccgtggt gaggcgatcc ctgaaccagt    40200

aaaacaactt cctgtcatgg gcggtagacc tctaaatcgt gcacaggctc tggcgaagat    40260

cgcagaaatc aaagctaagt tcggactgaa aggagcaagt gtatgacggg caaagaggca    40320

attattcatt acctggggac gcataatagc ttctgtgcgc cggacgttgc cgcgctaaca    40380

ggcgcaacag taaccagcat aaatcaggcc gcggctaaaa tggcacgggc aggtcttctg    40440

gttatcgaag gtaaggtctg gcgaacggtg tattaccggt ttgctaccag ggaagaacgg    40500

gaaggaaaga tgagcacgaa cctggttttt aaggagtgtc gccagagtgc cgcgatgaaa    40560

cgggtattgg cggtatatgg agttaaaaga tgaccatcta cattactgag ctaataacag    40620

gcctgctggt aatcgcaggc ctttttattt gggggagagg gaagtcatga aaaaactaac    40680

ctttgaaatt cgatctccag cacatcagca aaacgctatt cacgcagtac agcaaatcct    40740

tccagaccca accaaaccaa tcgtagtaac cattcaggaa cgcaaccgca gcttagacca    40800

aaacaggaag ctatgggcct gcttaggtga cgtctctcgt caggttgaat ggcatggtcg    40860

ctggctggat gcagaaagct ggaagtgtgt gtttaccgca gcattaaagc agcaggatgt    40920

tgttcctaac cttgccggga atggctttgt ggtaataggc cagtcaacca gcaggatgcg    40980

tgtaggcgaa tttgcggagc tattagagct tatacaggca ttcggtacag agcgtggcgt    41040

taagtggtca gacgaagcga gactggctct ggagtggaaa gcgagatggg gagacagggc    41100

tgcatgataa atgtcgttag tttctccggt ggcaggacgt cagcatattt gctctggcta    41160

atggagcaaa agcgacgggc aggtaaagac gtgcattacg ttttcatgga tacaggttgt    41220

gaacatccaa tgacatatcg gtttgtcagg gaagttgtga agttctggga tataccgctc    41280

accgtattgc aggttgatat caacccggag cttggacagc caaatggtta tacggtatgg    41340

gaaccaaagg atattcagac gcgaatgcct gttctgaagc catttatcga tatggtaaag    41400

aaatatggca ctccatacgt cggcggcgcg ttctgcactg acagattaaa actcgttccc    41460

ttcaccaaat actgtgatga ccatttcggg cgagggaatt acaccacgtg gattggcatc    41520

agagctgatg aaccgaagcg gctaaagcca aagcctggaa tcagatatct tgctgaactg    41580

tcagactttg agaaggaaga tatcctcgca tggtggaagc aacaaccatt cgatttgcaa    41640

ataccggaac atctcggtaa ctgcatattc tgcattaaaa aatcaacgca aaaaatcgga    41700

cttgcctgca aagatgagga gggattgcag cgtgttttta atgaggtcat cacgggatcc    41760

catgtgcgtg acggacatcg ggaaacgcca aaggagatta tgtaccgagg aagaatgtcg    41820

ctggacggta tcgcgaaaat gtattcagaa aatgattatc aagccctgta tcaggacatg    41880

gtacgagcta aaagattcga taccggctct tgttctgagt catgcgaaat atttggaggg    41940

cagcttgatt tcgacttcgg gagggaagct gcatgatgcg atgttatcgg tgcggtgaat    42000

gcaaagaaga taaccgcttc cgaccaaatc aaccttactg gaatcgatgg tgtctccggt    42060

gtgaaagaac accaacaggg gtgttaccac taccgcagga aaaggaggac gtgtggcgag    42120

acagcgacga agtatcaccg acataatctg cgaaaactgc aaataccttc caacgaaacg    42180

caccagaaat aaacccaagc caatcccaaa agaatctgac gtaaaaacct tcaactacac    42240

ggctcacctg tgggatatcc ggtggctaag acgtcgtgcg aggaaaacaa ggtgattgac    42300

caaaatcgaa gttacgaaca agaaagcgtc gagcgagctt taacgtgcgc taactgcggt    42360

cagaagctgc atgtgctgga agttcacgtg tgtgagcact gctgcgcaga actgatgagc    42420

gatccgaata gctcgatgca cgaggaagaa gatgatggct aaaccagcgc gaagacgatg    42480

taaaaacgat gaatgccggg aatggtttca ccctgcattc gctaatcagt ggtggtgctc    42540

tccagagtgt ggaaccaaga tagcactcga acgacgaagt aaagaacgcg aaaaagcgga    42600

aaaagcagca gagaagaaac gacgacgaga ggagcagaaa cagaaagata aacttaagat    42660

tcgaaaactc gccttaaagc cccgcagtta ctggattaaa caagcccaac aagccgtaaa    42720

cgccttcatc agagaaagag accgcgactt accatgtatc tcgtgcggaa cgctcacgtc    42780

tgctcagtgg gatgccggac attaccggac aactgctgcg gcacctcaac tccgatttaa    42840

tgaacgcaat attcacaagc aatgcgtggt gtgcaaccag cacaaaagcg gaaatctcgt    42900

tccgtatcgc gtcgaactga ttagccgcat cgggcaggaa gcagtagacg aaatcgaatc    42960

aaaccataac cgccatcgct ggactatcga agagtgcaag gcgatcaagg cagagtacca    43020

acagaaactc aaagacctgc gaaatagcag aagtgaggcc gcatgacgtt ctcagtaaaa    43080

accattccag acatgctcgt tgaaacatac ggaaatcaga cagaagtagc acgcagactg    43140

aaatgtagtc gcggtacggt cagaaaatac gttgatgata aagacgggaa aatgcacgcc    43200

atcgtcaacg acgttctcat ggttcatcgc ggatggagtg aaagagatgc gctattacga    43260

aaaaattgat ggcagcaaat accgaaatat ttgggtagtt ggcgatctgc acggatgcta    43320

cacgaacctg atgaacaaac tggatacgat tggattcgac aacaaaaaag acctgcttat    43380

ctcggtgggc gatttggttg atcgtggtgc agagaacgtt gaatgcctgg aattaatcac    43440

attcccctgg ttcagagctg tacgtggaaa ccatgagcaa atgatgattg atggcttatc    43500

agagcgtgga aacgttaatc actggctgct taatggcggt ggctggttct ttaatctcga    43560

ttacgacaaa gaaattctgg ctaaagctct tgcccataaa gcagatgaac ttccgttaat    43620

catcgaactg gtgagcaaag ataaaaaata tgttatctgc cacgccgatt atccctttga    43680

cgaatacgag tttggaaagc cagttgatca tcagcaggta atctggaacc gcgaacgaat    43740

cagcaactca caaaacggga tcgtgaaaga aatcaaaggc gcggacacgt tcatctttgg    43800

tcatacgcca gcagtgaaac cactcaagtt tgccaaccaa atgtatatcg ataccggcgc    43860

agtgttctgc ggaaacctaa cattgattca ggtacaggga gaaggcgcat gagactcgaa    43920

agcgtagcta aatttcattc gccaaaaagc ccgatgatga gcgactcacc acgggccacg    43980

gcttctgact ctctttccgg tactgatgtg atggctgcta tggggatggc gcaatcacaa    44040

gccggattcg gtatggctgc attctgcggt aagcacgaac tcagccagaa cgacaaacaa    44100

aaggctatca actatctgat gcaatttgca cacaaggtat cggggaaata ccgtggtgtg    44160

gcaaagcttg aaggaaatac taaggcaaag gtactgcaag tgctcgcaac attcgcttat    44220

gcggattatt gccgtagtgc cgcgacgccg ggggcaagat gcagagattg ccatggtaca    44280

ggccgtgcgg ttgatattgc caaaacagag ctgtggggga gagttgtcga gaaagagtgc    44340

ggaagatgca aaggcgtcgg ctattcaagg atgccagcaa gcgcagcata tcgcgctgtg    44400

acgatgctaa tcccaaacct tacccaaccc acctggtcac gcactgttaa gccgctgtat    44460

gacgctctgg tggtgcaatg ccacaaagaa gagtcaatcg cagacaacat tttgaatgcg    44520

gtcacacgtt agcagcatga ttgccacgga tggcaacata ttaacggcat gatattgact    44580

tattgaataa aattgggtaa atttgactca acgatgggtt aattcgctcg ttgtggtagt    44640

gagatgaaaa gaggcggcgc ttactaccga ttccgcctag ttggtcactt cgacgtatcg    44700

tctggaactc caaccatcgc aggcagagag gtctgcaaaa tgcaatcccg aaacagttcg    44760

caggtaatag ttagagcctg cataacggtt tcgggatttt ttatatctgc acaacaggta    44820

agagcattga gtcgataatc gtgaagagtc ggcgagcctg gttagccagt gctctttccg    44880

ttgtgctgaa ttaagcgaat accggaagca gaaccggatc accaaatgcg tacaggcgtc    44940

atcgccgccc agcaacagca caacccaaac tgagccgtag ccactgtctg tcctgaattc    45000

attagtaata gttacgctgc ggccttttac acatgacctt cgtgaaagcg ggtggcagga    45060

ggtcgcgcta acaacctcct gccgttttgc ccgtgcatat cggtcacgaa caaatctgat    45120

tactaaacac agtagcctgg atttgttcta tcagtaatcg accttattcc taattaaata    45180

gagcaaatcc ccttattggg ggtaagacat gaagatgcca gaaaaacatg acctgttggc    45240

cgccattctc gcggcaaagg aacaaggcat cggggcaatc cttgcgtttg caatggcgta    45300

ccttcgcggc agatataatg gcggtgcgtt tacaaaaaca gtaatcgacg caacgatgtg    45360

cgccattatc gcctagttca ttcgtgacct tctcgacttc gccggactaa gtagcaatct    45420

cgcttatata acgagcgtgt ttatcggcta catcggtact gactcgattg gttcgcttat    45480

caaacgcttc gctgctaaaa aagccggagt agaagatggt agaaatcaat aatcaacgta    45540

aggcgttcct cgatatgctg gcgtggtcgg agggaactga taacggacgt cagaaaacca    45600

gaaatcatgg ttatgacgtc attgtaggcg gagagctatt tactgattac tccgatcacc    45660

ctcgcaaact tgtcacgcta aacccaaaac tcaaatcaac aggcgccgga cgctaccagc    45720

ttctttcccg ttggtgggat gcctaccgca agcagcttgg cctgaaagac ttctctccga    45780

aaagtcagga cgctgtggca ttgcagcaga ttaaggagcg tggcgcttta cctatgattg    45840

atcgtggtga tatccgtcag gcaatcgacc gttgcagcaa tatctgggct tcactgccgg    45900

gcgctggtta tggtcagttc gagcataagg ctgacagcct gattgcaaaa ttcaaagaag    45960

cgggcggaac ggtcagagag attgatgtat gagcagagtc accgcgatta tctccgctct    46020

ggttatctgc atcatcgtct gcctgtcatg ggctgttaat cattaccgtg ataacgccat    46080

tacctacaaa gcccagcgcg acaaaaatgc cagagaactg aagctggcga acgcggcaat    46140

tactgacatg cagatgcgtc agcgtgatgt tgctgcgctc gatgcaaaat acacgaagga    46200

gttagctgat gctaaagctg aaaatgatgc tctgcgtgat gatgttgccg ctggtcgtcg    46260

tcggttgcac atcaaagcag tctgtcagtc agtgcgtgaa gccaccaccg cctccggcgt    46320

ggataatgca gcctcccccc gactggcaga caccgctgaa cgggattatt tcaccctcag    46380

agagaggctg atcactatgc aaaaacaact ggaaggaacc cagaagtata ttaatgagca    46440

gtgcagatag agttgcccat atcgatgggc aactcatgca attattgtga gcaatacaca    46500

cgcgcttcca gcggagtata aatgcctaaa gtaataaaac cgagcaatcc atttacgaat    46560

gtttgctggg tttctgtttt aacaacattt tctgcgccgc cacaaatttt ggctgcatcg    46620

acagttttct tctgcccaat tccagaaacg aagaaatgat gggtgatggt ttcctttggt    46680

gctactgctg ccggtttgtt ttgaacagta aacgtctgtt gagcacatcc tgtaataagc    46740

agggccagcg cagtagcgag tagcattttt ttcatggtgt tattcccgat gctttttgaa    46800

gttcgcagaa tcgtatgtgt agaaaattaa acaaacccta aacaatgagt tgaaatttca    46860

tattgttaat atttattaat gtatgtcagg tgcgatgaat cgtcattgta ttcccggatt    46920

aactatgtcc acagccctga cggggaactt ctctgcggga gtgtccggga ataattaaaa    46980

cgatgcacac agggtttagc gcgtacacgt attgcattat gccaacgccc cggtgctgac    47040

acggaagaaa ccggacgtta tgatttagcg tggaaagatt tgtgtagtgt tctgaatgct    47100

ctcagtaaat agtaatgaat tatcaaaggt atagtaatat cttttatgtt catggatatt    47160

tgtaacccat cggaaaactc ctgctttagc aagattttcc ctgtattgct gaaatgtgat    47220

ttctcttgat ttcaacctat cataggacgt ttctataaga tgcgtgtttc ttgagaattt    47280

aacatttaca acctttttaa gtccttttat taacacggtg ttatcgtttt ctaacacgat    47340

gtgaatatta tctgtggcta gatagtaaat ataatgtgag acgttgtgac gttttagttc    47400

agaataaaac aattcacagt ctaaatcttt tcgcacttga tcgaatattt ctttaaaaat    47460

ggcaacctga gccattggta aaaccttcca tgtgatacga gggcgcgtag tttgcattat    47520

cgtttttatc gtttcaatct ggtctgacct ccttgtgttt tgttgatgat ttatgtcaaa    47580

tattaggaat gttttcactt aatagtattg gttgcgtaac aaagtgcggt cctgctggca    47640

ttctggaggg aaatacaacc gacagatgta tgtaaggcca acgtgctcaa atcttcatac    47700

agaaagattt gaagtaatat tttaaccgct agatgaagag caagcgcatg gagcgacaaa    47760

atgaataaag aacaatctgc tgatgatccc tccgtggatc tgattcgtgt aaaaaatatg    47820

cttaatagca ccatttctat gagttaccct gatgttgtaa ttgcatgtat agaacataag    47880

gtgtctctgg aagcattcag agcaattgag gcagcgttgg tgaagcacga taataatatg    47940

aaggattatt ccctggtggt tgactgatca ccataactgc taatcattca aactatttag    48000

tctgtgacag agccaacacg cagtctgtca ctgtcaggaa agtggtaaaa ctgcaactca    48060

attactgcaa tgccctcgta attaagtgaa tttacaatat cgtcctgttc ggagggaaga    48120

acgcgggatg ttcattcttc atcactttta attgatgtat atgctctctt ttctgacgtt    48180

agtctccgac ggcaggcttc aatgacccag gctgagaaat tcccggaccc tttttgctca    48240

agagcgatgt taatttgttc aatcatttgg ttaggaaagc ggatgttgcg ggttgttgtt    48300

ctgcgggttc tgttcttcgt tgacatgagg ttgccccgta ttcagtgtcg ctgatttgta    48360

ttgtctgaag ttgtttttac gttaagttga tgcagatcaa ttaatacgat acctgcgtca    48420

taattgatta tttgacgtgg tttgatggcc tccacgcacg ttgtgatatg tagatgataa    48480

tcattatcac tttacgggtc ctttccggtg atccgacagg ttacggggcg gcgacctcgt    48540

tctgtttatg tttcttgttt gttagccttt tggctaacaa acaagaaaca taaacagaac    48600

gcgtaacctg tcggatcacc ggaaaggacc cgtaaagtga taatgattat catctacata    48660

tcacaacgtg cgtggaggcc atcaaaccac gtcaaataat caattatgac gcaggtatcg    48720

tattaattga tctgcatcaa cttaacgtaa aaacaacttc agacaataca aatcagcgac    48780

actgaatacg gggcaacctc atgtcaacga agaacagaac ccgcagaaca acaacccgca    48840

acatccgctt tcctaaccaa atgattgaac aaattaacat cgctcttgag caaaaagggt    48900

ccgggaattt ctcagcctgg gtcattgaag cctgccgtcg gagactaacg tcagaaaaga    48960

gagcatatac atcaattaaa agtgatgaag aatgaacatc ccgcgttctt ccctccgaac    49020

aggacgatat tgtaaattca cttaattacg agggcattgc agtaattgag ttgcagtttt    49080

accactttcc tgacagtgac agactgcgtg ttggctctgt cacagactaa atagtttgaa    49140

tgattagcag ttatggtgat cagtcaacca ccagggaata atccttcata ttattatcgt    49200

gcttcaccaa cgctgcctca attgctctga atgcttccag agacacctta tgttctatac    49260

atgcaattac aacatcaggg taactcatag aaatggtgct attaagcata ttttttacac    49320

gaatcagatc cacggaggga tcatcagcag attgttcttt attcattttg tcgctccatg    49380

cgcttgctct tcatctagcg gttaaaatat tacttcaaat ctttctgtat gaagatttga    49440

gcacgttggc cttacataca tctgtcggtt gtatttccct ccagaatgcc agcaggaccg    49500

cactttgtta cgcaaccaat actattaagt gaaaacattc ctaatatttg acataaatca    49560

tcaacaaaac acaaggaggt cagaccagat tgaaacgata aaaacgataa tgcaaactac    49620

gcgccctcgt atcacatgga aggttttacc aatggctcag gttgccattt ttaaagaaat    49680

attcgatcaa gtgcgaaaag atttagactg tgaattgttt tattctgaac taaaacgtca    49740

caacgtctca cattatattt actatctagc cacagataat attcacatcg tgttagaaaa    49800

cgataacacc gtgttaataa aaggacttaa aaaggttgta aatgttaaat tctcaagaaa    49860

cacgcatctt atagaaacgt cctatgatag gttgaaatca agagaaatca catttcagca    49920

atacagggaa aatcttgcta aagcaggagt tttccgatgg gttacaaata tccatgaaca    49980

taaaagatat tactatacct ttgataattc attactattt actgagagca ttcagaacac    50040

tacacaaatc tttccacgct aaatcataac gtccggtttc ttccgtgtca gcaccggggc    50100

gttggcataa tgcaatacgt gtacgcgcta aaccctgtgt gcatcgtttt aattattccc    50160

ggacactccc gcagagaagt tccccgtcag ggctgtggac atagttaatc cgggaataca    50220

atgacgattc atcgcacctg acatacatta ataaatatta acaatatgaa atttcaactc    50280

attgtttagg gtttgtttaa ttttctacac atacgattct gcgaacttca aaaagcatcg    50340

ggaataacac catgaaaaaa atgctactcg ctactgcgct ggccctgctt attacaggat    50400

gtgctcaaca gacgtttact gttcaaaaca aaccggcagc agtagcacca aaggaaacca    50460

tcacccatca tttcttcgtt tctggaattg ggcagaagaa aactgtcgat gcagccaaaa    50520

tttgtggcgg cgcagaaaat gttgttaaaa cagaaaccca gcaaacattc gtaaatggat    50580

tgctcggttt tattacttta ggcatttata ctccgctgga agcgcgtgtg tattgctcac    50640

aataattgca tgagttgccc atcgatatgg gcaactctat ctgcactgct cattaatata    50700

cttctgggtt ccttccagtt gtttttgcat agtgatcagc ctctctctga gggtgaaata    50760

atcccgttca gcggtgtctg ccagtcgggg ggaggctgca ttatccacgc cggaggcggt    50820

ggtggcttca cgcactgact gacagactgc tttgatgtgc aaccgacgac gaccagcggc    50880

aacatcatca cgcagagcat cattttcagc tttagcatca gctaactcct tcgtgtattt    50940

tgcatcgagc gcagcaacat cacgctgacg catctgcatg tcagtaattg ccgcgttcgc    51000

cagcttcagt tctctggcat ttttgtcgcg ctgggctttg taggtaatgg cgttatcacg    51060

gtaatgatta acagcccatg acaggcagac gatgatgcag ataaccagag cggagataat    51120

cgcggtgact ctgctcatac atcaatctct ctgaccgttc cgcccgcttc tttgaatttt    51180

gcaatcaggc tgtcagcctt atgctcgaac tgaccataac cagcgcccgg cagtgaagcc    51240

cagatattgc tgcaacggtc gattgcctga cggatatcac cacgatcaat cataggtaaa    51300

gcgccacgct ccttaatctg ctgcaatgcc acagcgtcct gacttttcgg agagaagtct    51360

ttcaggccaa gctgcttgcg gtaggcatcc caccaacggg aaagaagctg gtagcgtccg    51420

gcgcctgttg atttgagttt tgggtttagc gtgacaagtt tgcgagggtg atcggagtaa    51480

tcagtaaata gctctccgcc tacaatgacg tcataaccat gatttctggt tttctgacgt    51540

ccgttatcag ttccctccga ccacgccagc atatcgagga acgccttacg ttgattattg    51600

atttctacca tcttctactc cggctttttt agcagcgaag cgtttgataa gcgaaccaat    51660

cgagtcagta ccgatgtagc cgataaacac gctcgttata taagcgagat tgctacttag    51720

tccggcgaag tcgagaaggt cacgaatgaa ctaggcgata atggcgcaca tcgttgcgtc    51780

gattactgtt tttgtaaacg caccgccatt atatctgccg cgaaggtacg ccattgcaaa    51840

cgcaaggatt gccccgatgc cttgttcctt tgccgcgaga atggcggcca acaggtcatg    51900

tttttctggc atcttcatgt cttaccccca ataaggggat ttgctctatt taattaggaa    51960

taaggtcgat tactgataga acaaatccag gctactgtgt ttagtaatca gatttgttcg    52020

tgaccgatat gcacgggcaa aacggcagga ggttgttagc gcgacctcct gccacccgct    52080

ttcacgaagg tcatgtgtaa aaggccgcag cgtaactatt actaatgaat tcaggacaga    52140

cagtggctac ggctcagttt gggttgtgct gttgctgggc ggcgatgacg cctgtacgca    52200

tttggtgatc cggttctgct tccggtattc gcttaattca gcacaacgga aagagcactg    52260

gctaaccagg ctcgccgact cttcacgatt atcgactcaa tgctcttacc tgttgtgcag    52320

atataaaaaa tcccgaaacc gttatgcagg ctctaactat tacctgcgaa ctgtttcggg    52380

attgcatttt gcagacctct ctgcctgcga tggttggagt tccagacgat acgtcgaagt    52440

gaccaactag gcggaatcgg tagtaagcgc cgcctctttt catctcacta ccacaacgag    52500

cgaattaacc catcgttgag tcaaatttac ccaattttat tcaataagtc aatatcatgc    52560

cgttaatatg ttgccatccg tggcaatcat gctgctaacg tgtgaccgca ttcaaaatgt    52620

tgtctgcgat tgactcttct ttgtggcatt gcaccaccag agcgtcatac agcggcttaa    52680

cagtgcgtga ccaggtgggt tgggtaaggt ttgggattag catcgtcaca gcgcgatatg    52740

ctgcgcttgc tggcatcctt gaatagccga cgcctttgca tcttccgcac tctttctcga    52800

caactctccc ccacagctct gttttggcaa tatcaaccgc acggcctgta ccatggcaat    52860

ctctgcatct tgcccccggc gtcgcggcac tacggcaata atccgcataa gcgaatgttg    52920

cgagcacttg cagtaccttt gccttagtat ttccttcaag ctttgccaca ccacggtatt    52980

tccccgatac cttgtgtgca aattgcatca gatagttgat agccttttgt ttgtcgttct    53040

ggctgagttc gtgcttaccg cagaatgcag ccataccgaa tccggcttgt gattgcgcca    53100

tccccatagc agccatcaca tcagtaccgg aaagagagtc agaagccgtg gcccgtggtg    53160

agtcgctcat catcgggctt tttggcgaat gaaatttagc tacgctttcg agtctcatgc    53220

gccttctccc tgtacctgaa tcaatgttag gtttccgcag aacactgcgc cggtatcgat    53280

atacatttgg ttggcaaact tgagtggttt cactgctggc gtatgaccaa agatgaacgt    53340

gtccgcgcct ttgatttctt tcacgatccc gttttgtgag ttgctgattc gttcgcggtt    53400

ccagattacc tgctgatgat caactggctt tccaaactcg tattcgtcaa agggataatc    53460

ggcgtggcag ataacatatt ttttatcttt gctcaccagt tcgatgatta acggaagttc    53520

atctgcttta tgggcaagag ctttagccag aatttctttg tcgtaatcga gattaaagaa    53580

ccagccaccg ccattaagca gccagtgatt aacgtttcca cgctctgata agccatcaat    53640

catcatttgc tcatggtttc cacgtacagc tctgaaccag gggaatgtga ttaattccag    53700

gcattcaacg ttctctgcac cacgatcaac caaatcgccc accgagataa gcaggtcttt    53760

tttgttgtcg aatccaatcg tatccagttt gttcatcagg ttcgtgtagc atccgtgcag    53820

atcgccaact acccaaatat ttcggtattt gctgccatca attttttcgt aatagcgcat    53880

ctctttcact ccatccgcga tgaaccatga gaacgtcgtt gacgatggcg tgcattttcc    53940

cgtctttatc atcaacgtat tttctgaccg taccgcgact acatttcagt ctgcgtgcta    54000

cttctgtctg atttccgtat gtttcaacga gcatgtctgg aatggttttt actgagaacg    54060

tcatgcggcc tcacttctgc tatttcgcag gtctttgagt ttctgttggt actctgcctt    54120

gatcgccttg cactcttcga tagtccagcg atggcggtta tggtttgatt cgatttcgtc    54180

tactgcttcc tgcccgatgc ggctaatcag ttcgacgcga tacggaacga gatttccgct    54240

tttgtgctgg ttgcacacca cgcattgctt gtgaatattg cgttcattaa atcggagttg    54300

aggtgccgca gcagttgtcc ggtaatgtcc ggcatcccac tgagcagacg tgagcgttcc    54360

gcacgagata catggtaagt cgcggtctct ttctctgatg aaggcgttta cggcttgttg    54420

ggcttgttta atccagtaac tgcggggctt taaggcgagt tttcgaatct taagtttatc    54480

tttctgtttc tgctcctctc gtcgtcgttt cttctctgct gctttttccg ctttttcgcg    54540

ttctttactt cgtcgttcga gtgctatctt ggttccacac tctggagagc accaccactg    54600

attagcgaat gcagggtgaa accattcccg gcattcatcg tttttacatc gtcttcgcgc    54660

tggtttagcc atcatcttct tcctcgtgca tcgagctatt cggatcgctc atcagttctg    54720

cgcagcagtg ctcacacacg tgaacttcca gcacatgcag cttctgaccg cagttagcgc    54780

acgttaaagc tcgctcgacg ctttcttgtt cgtaacttcg attttggtca atcaccttgt    54840

tttcctcgca cgacgtctta gccaccggat atcccacagg tgagccgtgt agttgaaggt    54900

ttttacgtca gattcttttg ggattggctt gggtttattt ctggtgcgtt tcgttggaag    54960

gtatttgcag ttttcgcaga ttatgtcggt gatacttcgt cgctgtctcg ccacacgtcc    55020

tccttttcct gcggtagtgg taacacccct gttggtgttc tttcacaccg gagacaccat    55080

cgattccagt aaggttgatt tggtcggaag cggttatctt ctttgcattc accgcaccga    55140

taacatcgca tcatgcagct tccctcccga agtcgaaatc aagctgccct ccaaatattt    55200

cgcatgactc agaacaagag ccggtatcga atcttttagc tcgtaccatg tcctgataca    55260

gggcttgata atcattttct gaatacattt tcgcgatacc gtccagcgac attcttcctc    55320

ggtacataat ctcctttggc gtttcccgat gtccgtcacg cacatgggat cccgtgatga    55380

cctcattaaa aacacgctgc aatccctcct catctttgca ggcaagtccg attttttgcg    55440

ttgatttttt aatgcagaat atgcagttac cgagatgttc cggtatttgc aaatcgaatg    55500

gttgttgctt ccaccatgcg aggatatctt ccttctcaaa gtctgacagt tcagcaagat    55560

atctgattcc aggctttggc tttagccgct tcggttcatc agctctgatg ccaatccacg    55620

tggtgtaatt ccctcgcccg aaatggtcat cacagtattt ggtgaaggga acgagtttta    55680

atctgtcagt gcagaacgcg ccgccgacgt atggagtgcc atatttcttt accatatcga    55740

taaatggctt cagaacaggc attcgcgtct gaatatcctt tggttcccat accgtataac    55800

catttggctg tccaagctcc gggttgatat caacctgcaa tacggtgagc ggtatatccc    55860

agaacttcac aacttccctg acaaaccgat atgtcattgg atgttcacaa cctgtatcca    55920

tgaaaacgta atgcacgtct ttacctgccc gtcgcttttg ctccattagc cagagcaaat    55980

atgctgacgt cctgccaccg gagaaactaa cgacatttat catgcagccc tgtctcccca    56040

tctcgctttc cactccagag ccagtctcgc ttcgtctgac cacttaacgc cacgctctgt    56100

accgaatgcc tgtataagct ctaatagctc cgcaaattcg cctacacgca tcctgctggt    56160

tgactggcct attaccacaa agccattccc ggcaaggtta ggaacaacat cctgctgctt    56220

taatgctgcg gtaaacacac acttccagct ttctgcatcc agccagcgac catgccattc    56280

aacctgacga gagacgtcac ctaagcaggc ccatagcttc ctgttttggt ctaagctgcg    56340

gttgcgttcc tgaatggtta ctacgattgg tttggttggg tctggaagga tttgctgtac    56400

tgcgtgaata gcgttttgct gatgtgctgg agatcgaatt tcaaaggtta gttttttcat    56460

gacttccctc tcccccaaat aaaaaggcct gcgattacca gcaggcctgt tattagctca    56520

gtaatgtaga tggtcatctt ttaactccat ataccgccaa tacccgtttc atcgcggcac    56580

tctggcgaca ctccttaaaa accaggttcg tgctcatctt tccttcccgt tcttccctgg    56640

tagcaaaccg gtaatacacc gttcgccaga ccttaccttc gataaccaga agacctgccc    56700

gtgccatttt agccgcggcc tgatttatgc tggttactgt tgcgcctgtt agcgcggcaa    56760

cgtccggcgc acagaagcta ttatgcgtcc ccaggtaatg aataattgcc tctttgcccg    56820

tcatacactt gctcctttca gtccgaactt agctttgatt tctgcgatct tcgccagagc    56880

ctgtgcacga tttagaggtc taccgcccat gacaggaagt tgttttactg gttcagggat    56940

cgcctcacca cggttaattc tcgcagtcat atggacaagc tcatctgcgg ccttacggcg    57000

taattccgca tcagtaagcg cattggcccg catgttctga tacaggttgg taaccagcca    57060

gtagtgcgcg tttgatttcc acggataaga ctccgcatcc ggatacaggc ctcgcttccg    57120

gcaatactcg taaaccatat caaccagctc gctgacgttt ggcagtccgg cggtaacgga    57180

tgcttcttcc cggcaccatg caacaaactg cccgggtgat ggcagaaatg gtcgattctg    57240

ccgacgggct acgcgcattc ctgcgttaac ctgttccatc gtggtgatcc cgttttcccg    57300

aaaagccaga acccactggc gacggatttc gttcacttcg ttctggtcac ggttagccag    57360

gctcgccggg aaagttgcca gtaactggct gaacacaccg ttgatgatct gcgctacctg    57420

ctgtacctgc ggcttttcgt cgtactgttc cggcatgttg ttggcgatcc gacgcatctg    57480

ctcacggtca aagttaacca tctgtgcggc gatgtttttc atagatccac cccgtaaatc    57540

cagtctgtgt ttgtcaggtc gagttttggt ttgctggctg tcacgcctgc ctgttgcttg    57600

ttacggttga tttcgagttg ggtccactta tcgcggagtt tggccgggct cagcacgtta    57660

ccggaccaga agttgtcctg gcatgcccag cggaacagca cacacatgtc gcggtggtta    57720

cgtccgtcac gttcacgcat caggcggata tcgttagccc acccagcaaa attcggtttt    57780

ctggctgatg gtgcgatagt cttcaccatg tcaaacatcc actctgcggc ggtcaggtct    57840

tctgctgtcc cccacttgct gccgctctga attgcagcat ccggtttcac cacagaaagg    57900

tcgttttctg gctggtcaga ggattcgcca gaattctctg acgaataatc ttttcttttt    57960

tcttttgtaa tagtgtcttt tgtgtccccc tgttttgagg gatagcaatc ccccaatttg    58020

agggatgttt tatccctcgt tttaggggat tttccctcgt tttgagggat gcaccattct    58080

gagatgtttt tatttggtcc aaacatgccg ccttgctgct tgataatatt cattctgacg    58140

agttctaact tggcttcatt gcaccgtttg acaggtaact ttgtaatctc gctaagttga    58200

gaatcggtga ttctgtccat tggtttattc cacccatagg ttttacgcag aatggcaagc    58260

agcactttaa actgtcgctt ggtcagatct gcgcccgaat aagcctcaag cagcatattt    58320

gatagtctgg cgtaaccatc atcgagatct gccacattac gctcctgtcc ggcaaagtta    58380

cctctgccga agttgagtat ttttgctgta tttgtcataa tgactcctgt tgatagatcc    58440

agtaatgacc tcagaactcc atctggattt gttcagaacg ctcggttgcc gccgggcgtt    58500

ttttattggt gagaatcgca gcaacttgtc gcgccaatcg agccatgtcg tcgtcaacga    58560

ccccccattc aagaacagca agcagcattg agaactttgg aatccagtcc ctcttccacc    58620

tgctgatctg cgacttatca acgcccacag cttccgctgt cttctcagtt ccaagcattg    58680

cgattttgtt aagcaacgca ctctcgattc gtagagcctc gttgcgtttg tttgcacgaa    58740

ccatatgtaa gtatttcctt agataacaat tgattgaatg tatgcaaata aatgcataca    58800

ccataggtgt ggtttaattt gatgcccttt ttcagggctg gaatgtgtaa gagcggggtt    58860

atttatgctg ttgttttttt gttactcggg aagggcttta cctcttccgc ataaacgctt    58920

ccatcagcgt ttatagttaa aaaaatcttt cggcctgcat gaatggcctt gttgatcgcg    58980

ctttgatata cgccgagatc tttagctgtc ttggtttgcc caaagcgcat tgcataatct    59040

ttcagggtta tgcgttgttc catacaacct ccttagtaca tgcaaccatt atcaccgcca    59100

gaggtaaaat agtcaacacg cacggtgtta gatatttatc ccttgcggtg atagatttaa    59160

cgtatgagca caaaaaagaa accattaaca caagagcagc ttgaggacgc acgtcgcctt    59220

aaagcaattt atgaaaaaaa gaaaaatgaa cttggcttat cccaggaatc tgtcgcagac    59280

aagatgggga tggggcagtc aggcgttggt gctttattta atggcatcaa tgcattaaat    59340

gcttataacg ccgcattgct tacaaaaatt ctcaaagtta gcgttgaaga atttagccct    59400

tcaatcgcca gagaaatcta cgagatgtat gaagcggtta gtatgcagcc gtcacttaga    59460

agtgagtatg agtaccctgt tttttctcat gttcaggcag ggatgttctc acctaagctt    59520

agaaccttta ccaaaggtga tgcggagaga tgggtaagca caaccaaaaa agccagtgat    59580

tctgcattct ggcttgaggt tgaaggtaat tccatgaccg caccaacagg ctccaagcca    59640

agctttcctg acggaatgtt aattctcgtt gaccctgagc aggctgttga gccaggtgat    59700

ttctgcatag ccagacttgg gggtgatgag tttaccttca agaaactgat cagggatagc    59760

ggtcaggtgt ttttacaacc actaaaccca cagtacccaa tgatcccatg caatgagagt    59820

tgttccgttg tggggaaagt tatcgctagt cagtggcctg aagagacgtt tggctgatcg    59880

gcaaggtgtt ctggtcggcg catagctgat aacaattgag caagaatctt catcgaatta    59940

ggggaatttt cactcccctc agaacataac atagtaaatg gattgaatta tgaagaatgg    60000

tttttatgcg acttaccgca gcaaaaataa agggaaagat aagcgctcaa taaacctgtc    60060

tgttttcctt aattctctgc tggctgataa tcatcacctg caggttggct ccaattattt    60120

gtatattcat aaaatcgatg gaaaaacttt tctctttacc aaaacaaatg acaagagtct    60180

ggttcagaag ataaatcgct ctaaagcttc agttgaagat attaagaaca gcctcgcaga    60240

tgacgaatca ttgggattcc catctttttt gtttgttgaa ggcgacacca ttggttttgc    60300

cagaactgtt ttcgggccga ccacatccga tctgacagat tttttaatcg ggaaaggaat    60360

gtcattaagc agtggagagc gcgttcagat agagccactg atgaggggaa ccaccaaaga    60420

cgatgttatg catatgcatt tcatcggccg aacaacggtg aaggtagaag ccaagctacc    60480

tgtatttggc gatatattaa aggtcttagg ggcaacagat attgaagggg agctttttga    60540

ctcattggat atagtcatta agccaaaatt taaaagggat ataaaaaagg ttgccaagga    60600

tattattttt aacccgtcac ctcaattttc agacattagc ctgcgggcaa aagatgaggc    60660

cggagatatt ttaacagaac attatctatc agaaaaaggc catctctcag cgcctctgaa    60720

caaggtcacc aatgctgaga tagctgaaga gatggcatat tgctacgcaa gaatgaaaag    60780

tgatatactg gaatgtttta aaaggcaggt gggcaaagtt aaggattaat tatcaggagt    60840

aattatgcgg aacagaatca tgcctggtgt ttacatagta ataattcctt acgttatcgt    60900

aagcatttgc tatctccttt tccgccacta cattcctggt gtttcttttt cagctcatag    60960

agatggtctt ggggcgacat tgtcatcata tgcaggaacc atgattgcaa tcctgattgc    61020

tgccttgacg tttctaatcg gaagcagaac gcgccgactg gccaagatta gagagtatgg    61080

gtatatgaca tcggtagtta ttgtctatgc ccttagtttt gttgagcttg gagctttgtt    61140

tttctgcggg ttattgcttc tttccagcat aagcggctac atgataccca ctatcgccat    61200

cggcattgcc tctgcatcgt tcattcatat atgcatcctt gttttccaac tatataattt    61260

gaccagagaa caagaataac ccggcctcag cgccgggttt tctttgcctc acgatcgccc    61320

ccaaaacaca taaccaattg tatttattga aaaataaata gatacaactc actaaacata    61380

gcaattcaga tctctcacct accaaacaat gcccccctgc aaaaaataaa ttcatataaa    61440

aaacatacag ataaccatct gcggtgataa attatctctg gcggtgttga cataaatacc    61500

actggcggtg atactgagca catcagcagg acgcactgac caccatgaag gtgacgctct    61560

taaaaattaa gccctgaaga agggcagcat tcaaagcaga aggctttggg gtgtgtgata    61620

cgaaacgaag cattggccgt aagtgcgatt ccggattagc tgccaatgtg ccaatcgcgg    61680

ggggttttcg ttcaggacta caactgccac acaccaccaa agctaactga caggagaatc    61740

cagatggatg cacaaacacg ccgccgcgaa cgtcgcgcag agaaacaggc tcaatggaaa    61800

gcagcaaatc ccctgttggt tggggtaagc gcaaaaccag ttaaccgccc tattctctcg    61860

ctgaatcgca aaccgaaatc acgagtagaa agcgcactaa atccgataga ccttacagtg    61920

ctggctgaat accacaaaca gattgaaagc aacctgcaac gtattgagcg caagaatcag    61980

cgcacatggt acagcaagcc tggcgaacgc ggcataacat gcagtggacg ccagaaaatt    62040

aagggaaaat cgattcctct tatctagtta cttagatatt ggccttggct ttatctcaat    62100

attatatgga tcatagctgg caactaattc agtccagtaa atatcctcaa tagggaataa    62160

tatatgcttt ccattccatc gggaaaaagt tttgttcaac acaccaagct caatcaactc    62220

actaatgtat gggaattgtt ttgatgtaac cacatacttc ctgccttcat taagggctgc    62280

gcacaaaacc atagattgct cttctgtaag gttttgaatt actgatcgca ctttatcgtt    62340

ttgcatctta atgcgttttc ttagcttaaa tcgcttatat ctggcgctgg caatagctga    62400

taatcgatgc acattaattg ctagcgaaaa tgcaagagca aagacgaaaa catgccacac    62460

atgaggaata ccgattctct cattaacata ttcaggccag ttatctgggc ttaaaagcag    62520

aagtccaacc cagataacga tcatatacat ggttctctcc agaggttcat tactgaacac    62580

tcgtccgaga ataacgagtg gatccatttc tatactcatc aaactgtagg ggttgtaata    62640

gtttatccga tttctcgctg taggggtaca cgagaaccac cgagcctgat gtggttaaaa    62700

gacaggcaca atctttacta ccgcaatcca ctatttaagg tgatatatgg aagaagaatt    62760

tgaagagttc gaagagcatc ctcaggatgt gatggaacaa taccaggact atccgtatga    62820

ctacgactat tgataaaaat caatggtgtg gacaattcaa gcgatgcaat ggatgcaagc    62880

tgcaatcgga atgcatggtt aagcctgaag aaatgtttcc tgtaatggaa gatgggaaat    62940

atgtcgataa atgggcaata cgaacgacgg caatgattgc cagagaactt ggtaaacaga    63000

acaacaaagc tgcctgatag tggcctttat ttttggcata aataacagaa taaacactgc    63060

actgtgtatt cattccaacg agtgaataca cggagcaatg tcgctcgtaa ctaaacagga    63120

gccgacttgt tctgattatt ggaaatcttc tttgccctcc agtgtgaggg cgatttttta    63180

tctgtgagga tatgaacaga tgtcaaacat caaaaaatac atcattgatt acgactggaa    63240

agcatcaata gaaattgaaa tcgaccatga cgtaatgaca gaggaaaaac ttcaccagat    63300

taataatttc tggtcagact ctgaataccg actcaataaa cacggctctg tattaaatgc    63360

tgtattaatc atgctggcgc aacatgctct gcttatagca atttcaagcg acttaaatgc    63420

atatggtgtt gtgtgtgagt tcgactggaa tgatggaaat ggtcaggaag gatggcctcc    63480

aatggatggt agcgaaggaa taagaattac cgatatcgat acatcaggaa tatttgattc    63540

agatgatatg actatcaagg ccgcctgagt gcggttttac cgcataccaa taacgcttca    63600

ctcgaggcgt ttttcgttat gtataaataa ggagcacacc atgcaatatg ccattgcagg    63660

gtggcctgtt gctggctgcc cttccgaatc tttacttgaa cgaatcaccc gtaaattacg    63720

tgacggatgg aaacgcctta tcgacatact taatcagcca ggagtcccaa agaatggatc    63780

aaacacttat ggctatccag actaaattca ctatcgccac ttttattggc gatgaaaaga    63840

tgtttcgtga agccgtcgac gcttataaaa aatggatatt aatactgaaa ctgagatcaa    63900

gcaaaagcat tcactaaccc cctttcctgt tttcctaatc agcccggcat ttcgcgggcg    63960

atattttcac agctatttca ggagttcagc catgaacgct tattacattc aggatcgtct    64020

tgaggctcag agctgggcgc gtcactacca gcagctcgcc cgtgaagaga aagaggcaga    64080

actggcagac gacatggaaa aaggcctgcc ccagcacctg tttgaatcgc tatgcatcga    64140

tcatttgcaa cgccacgggg ccagcaaaaa atccattacc cgtgcgtttg atgacgatgt    64200

tgagtttcag gagcgcatgg cagaacacat ccggtacatg gttgaaacca ttgctcacca    64260

ccaggttgat attgattcag aggtataaaa cgaatgagta ctgcactcgc aacgctggct    64320

gggaagctgg ctgaacgtgt cggcatggat tctgtcgacc cacaggaact gatcaccact    64380

cttcgccaga cggcatttaa aggtgatgcc agcgatgcgc agttcatcgc attactgatc    64440

gttgccaacc agtacggcct taatccgtgg acgaaagaaa tttacgcctt tcctgataag    64500

cagaatggca tcgttccggt ggtgggcgtt gatggctggt cccgcatcat caatgaaaac    64560

cagcagtttg atggcatgga ctttgagcag gacaatgaat cctgtacatg ccggatttac    64620

cgcaaggacc gtaatcatcc gatctgcgtt accgaatgga tggatgaatg ccgccgcgaa    64680

ccattcaaaa ctcgcgaagg cagagaaatc acggggccgt ggcagtcgca tcccaaacgg    64740

atgttacgtc ataaagccat gattcagtgt gcccgtctgg ccttcggatt tgctggtatc    64800

tatgacaagg atgaagccga gcgcattgtc gaaaatactg catacactgc agaacgtcag    64860

ccggaacgcg acatcactcc ggttaacgat gaaaccatgc aggagattaa cactctgctg    64920

atcgccctgg ataaaacatg ggatgacgac ttattgccgc tctgttccca gatatttcgc    64980

cgcgacattc gtgcatcgtc agaactgaca caggccgaag cagtaaaagc tcttggattc    65040

ctgaaacaga aagccgcaga gcagaaggtg gcagcatgac accggacatt atcctgcagc    65100

gtaccgggat cgatgtgaga gctgtcgaac agggggatga tgcgtggcac aaattacggc    65160

tcggcgtcat caccgcttca gaagttcaca acgtgatagc aaaaccccgc tccggaaaga    65220

agtggcctga catgaaaatg tcctacttcc acaccctgct tgctgaggtt tgcaccggtg    65280

tggctccgga agttaacgct aaagcactgg cctggggaaa acagtacgag aacgacgcca    65340

gaaccctgtt tgaattcact tccggcgtga atgttactga atccccgatc atctatcgcg    65400

acgaaagtat gcgtaccgcc tgctctcccg atggtttatg cagtgacggc aacggccttg    65460

aactgaaatg cccgtttacc tcccgggatt tcatgaagtt ccggctcggt ggtttcgagg    65520

ccataaagtc agcttacatg gcccaggtgc agtacagcat gtgggtgacg cgaaaaaatg    65580

cctggtactt tgccaactat gacccgcgta tgaagcgtga aggcctgcat tatgtcgtga    65640

ttgagcggga tgaaaagtac atggcgagtt ttgacgagat cgtgccggag ttcatcgaaa    65700

aaatggacga ggcactggct gaaattggtt ttgtatttgg ggagcaatgg cgatgacgca    65760

tcctcacgat aatatccggg taggcgcaat cactttcgtc tactccgtta caaagcgagg    65820

ctgggtattt cccggccttt ctgttatccg aaatccactg aaagcacagc ggctggctga    65880

ggagataaat aataaacgag gggctgtatg cacaaagcat cttctgttga gttaagaacg    65940

agtatcgaga tggcacatag ccttgctcaa attggaatca ggtttgtgcc aataccagta    66000

gaaacagacg aagaatttca tacgttagcc gcatcccttt cacaaaagct ggaaatgatg    66060

gtggcgaaag cagaagcaga tgagagaaac caggtatgac aaccacggaa tgcatttttc    66120

tggcagcggg cttcatattc tgtgtgctta tgcttgccga catgggactt gttcaatgac    66180

acctcagcag gaaaacgccc ttcgcagcat tgcccgtcag gctaattctg aaatcaaaaa    66240

aagccagaca gcagtttccg gataaaaacg tcgatgacat ttgccgtagc gtactgaaga    66300

agcaccgcga aacggtaacg ctgatgggat tcacaccgac tcatttaagc ctggcaatcg    66360

gcatgttaaa cggcgtcttt aaggaacgat gaacatgaaa agcaaaatca tcagggagct    66420

acaggctcct tttttattat tcgcattcac cctcaagcgt attaaccaac agttcaggga    66480

ttaatgaaag atggcagaca tcattgattc agcatcagaa atagaagaat tacagcgcaa    66540

cacagcaata aaaatgcgcc gcctgaacca ccaggctata tctgccactc attgttgtga    66600

gtgtggcgat ccgatagatg aacgaagacg cctggtcgtt cagggttgtc ggacttgtgc    66660

aagttgccag gaggatctgg aacttatcag taaacagaga ggttcgaagt gagcgaaatt    66720

aactctcagg cactgcgtga agcggcagag caggcaatgc atgacgactg gggatttgac    66780

gcagaccttt tccatgaatt ggtaacacca tcgattgtgc tggaactgct ggatgaacgg    66840

gaaagaaacc agcaatacat caaacgccgc gaccaggaga acgaggatat tgcgctaaca    66900

gtagggaaac tgcgtgttga gcttgaaaca gcaaaatcaa aactcaacga gcagcgtgag    66960

tattacgaag gtgttatctc ggatgggagt aagcgtattg ctaaactgga aagcaacgaa    67020

gtccgtgaag acggaaacca gtttcttgtt gttcgccatc ctgggaagac tcctgttatc    67080

aagcactgca ctggtgacct ggaagagttt ctgcggcagt taatcgaaca agacccgtta    67140

gtaactatcg acatcattac gcatcgctat tacggggttg gaggtcaatg ggttcaggat    67200

gcaggtgagt atctgcatat gatgtctgac gctggcattc gcatcaaagg agagtgagat    67260

cggttttgta aaagataacg cttgtgaaaa tgctgaattt cgcgtcgtct tcacagcgat    67320

gccagagtct gtagtgtcag atgatgaccg tactcaaaca tcgggttgag tattatctta    67380

ctgtttcttt acataaacat tgctgatacc gtttagctga aacgacatac attgcaagga    67440

gtttataaat gagtatcaat gagttagagt ctgagcaaaa agattgggcg ttatcaatgt    67500

tgtgcagatc cggtgtcttg tctccatgca gacatcacga aggtgtttat gtagatgaag    67560

gtatagatat agagtcggca tacaaatatt ccatgaaggt ttataagtct aatgaagaca    67620

aatccccatt ctgcaatgtg cgagaaatga ctgataccgt gcaaaattat tatcacgagt    67680

acggtggaaa cgatacttgc cctctctgta caaaacatat agatgattaa acccaatatt    67740

acataacaat cctcgcactc gcggggattt attttatctg aactcgctac ggcgggtttt    67800

gttttatgga gatgataaat gcacttccga gtcacaggag aatggaatgg agagccattc    67860

aacagagtta tcgaagcgga gaacatcaac gactgctacg accactggat gatatgggcg    67920

cagatagcac atgcagacgt aaccaatatt cgaattgaag aactgaaaga acaccaagcc    67980

gcctgatggc ggttttttct tgcgtgtaat tgcggagact ttgcgatgta cttgacactt    68040

caggagtgga acgcacgcca gcgacgtcca agaagccttg aaacagttcg tcgatgggtt    68100

cgggaatgca ggatattccc acctccggtt aaggatggaa gagagtatct gttccacgaa    68160

tcagcggtaa aggttgactt aaatcgacca gtaacaggtg gccttttgaa gaggatcaga    68220

aatgggaaga aggcgaagtc atgagcgccg ggatttaccc cctaaccttt atataagaaa    68280

caatggatat tactgctaca gggacccaag gacgggtaaa gagtttggat taggcagaga    68340

caggcgaatc gcaatcactg aagctataca ggccaacatt gagttatttt caggacacaa    68400

acacaagcct ctgacagcga gaatcaacag tgataattcc gttacgttac attcatggct    68460

tgatcgctac gaaaaaatcc tggccagcag aggaatcaag cagaagacac tcataaatta    68520

catgagcaaa attaaagcaa taaggagggg tctgcctgat gctccacttg aagacatcac    68580

cacaaaagaa attgcggcaa tgctcaatgg atacatagac gagggcaagg cggcgtcagc    68640

caagttaatc agatcaacac tgagcgatgc attccgagag gcaatagctg aaggccatat    68700

aacaacaaac catgtcgctg ccactcgcgc agcaaaatca gaggtaagga gatcaagact    68760

tacggctgac gaatacctga aaatttatca agcagcagaa tcatcaccat gttggctcag    68820

acttgcaatg gaactggctg ttgttaccgg gcaacgagtt ggtgatttat gcgaaatgaa    68880

gtggtctgat atcgtagatg gatatcttta tgtcgagcaa agcaaaacag gcgtaaaaat    68940

tgccatccca acagcattgc atattgatgc tctcggaata tcaatgaagg aaacacttga    69000

taaatgcaaa gagattcttg gcggagaaac cataattgca tctactcgtc gcgaaccgct    69060

ttcatccggc acagtatcaa ggtattttat gcgcgcacga aaagcatcag gtctttcctt    69120

cgaaggggat ccgcctacct ttcacgagtt gcgcagtttg tctgcaagac tctatgagaa    69180

gcagataagc gataagtttg ctcaacatct tctcgggcat aagtcggaca ccatggcatc    69240

acagtatcgt gatgacagag gcagggagtg ggacaaaatt gaaatcaaat aatgatttta    69300

ttttgactga tagtgacctg ttcgttgcaa caaattgata agcaatgctt ttttataatg    69360

ccaacttagt ataaaaaagc tgaacgagaa acgtaaaatg atataaatat caatatatta    69420

aattagattt tgcataaaaa acagactaca taatactgta aaacacaaca tatgcagtca    69480

ctatgaatca actacttaga tggtattagt gacctgtaac agagcattag cgcaaggtga    69540

tttttgtctt cttgcgctaa ttttttgtca tcaaacctgt cgcactccag agaagcacaa    69600

agcctcgcaa tccagtgcaa agctttgtgt gccacccact acgacctgca taaccagtaa    69660

gaagatagca gtgatgtcaa acgacgcagc tgacttcttt tctttcacga cttccccaca    69720

cccagcatgc atacctttcc gccataactg tagtgaatgt ctgttatgag cgaggagcgg    69780

aagttaacac ttatgaaaaa tggctacgaa gtccgtggct atctatcggc ttattagtac    69840

ttgaaacgct tcttcagaag cctgaagagc taatcgttcg gcgatactat atatgcatta    69900

atagactata tcgttggtat aaacagtgca ccatgcaaca tgaataacag tgggttatcc    69960

aaaaggaagc agaaagctaa atatggaaaa ctacaatacg atgccccgtt aagttcaata    70020

ctactaattt ttagatggaa aacgtatgta atagagagta acttaaaaga gagatcctgt    70080

gttgccgcca aataaattgc ggttatttta ataaaattaa gggttactat atgttggagt    70140

ttagtgttat tgaaagaggc gggtatattc ctgcagtaga aaaaaataag gcattcctac    70200

gagcagatgg ttggaatgac tattcctttg ttacaatgtt ttatcttact gtctttgatg    70260

agcatggtga aaaatgcgat atcggaaatg ttaaaattgg ttttgtaggt caaaaagaag    70320

aagtaagcac ttattcatta atagataaaa aattcagtca actccctgaa atgttttttt    70380

ccttaggtga aagcattgac tactatgtta atctcagcaa attaagcgat ggttttaaac    70440

ataaccttct taaagctatt caggatttag tagtatggcc aaatcgatta gccgacattg    70500

aaaatgaaag cgtccttaac acctcattac ttagaggggt aactctttca gaaattcatg    70560

gacagttcgc acgtgtgtta aatggtttgc cagaattgtc agatttccac ttttcattta    70620

atagaaaaag tgctcccgga ttcagtgatt taactatacc ttttgaggtg acggttaatt    70680

ctatgcccag cacgaacatt catgctttta tcgggcggaa tgggtgtggt aaaacaacaa    70740

ttttgaatgg aatgattggt gcaatcacca acccagaaaa caatgaatat tttttctctg    70800

aaaataatag acttatcgag tcaagaatcc caaagggata ttttcgatcg cttgtttcag    70860

tttcgtttag tgcatttgat ccttttactc ctcctaaaga acaacctgac ccagcaaaag    70920

gtacacaata cttttatatt ggactcaaga atgctgccag caatagttta aaatcactag    70980

gcgatctccg cttagaattc atttcagcat ttattggttg tatgagagta gatagaaaaa    71040

gacaactctg gcttgaagct atcaaaaaac taagtagtga tgaaaacttt tcaaatatgg    71100

aactcatcag cctcatttct aaatatgaag agttaagacg taatgaacca cagattcaag    71160

tggacgatga taaattcact aaattgtttt atgacaatat ccagaaatat ctgcttcgaa    71220

tgagctctgg acatgcaatt gttttattta ctatcacaag attagtagat gtcgttggcg    71280

aaaagtcatt agttttattc gatgaaccag aggttcatct gcatccacct ttgctctctg    71340

cttttttacg aacattaagc gacttactcg atgcacgcaa tggtgtagca ataattgcaa    71400

ctcattcccc agtagtactg caagaggttc caaaatcctg catgtggaaa gtcctacggt    71460

caagagaagc aataaatatt atccgtccgg atattgagac attcggtgag aacttaggtg    71520

ttttaactcg tgaggtgttt ttacttgaag tgacaaattc tggataccac cacttattat    71580

cgcagtccgt tgattcagag ctttcttatg aaaccattct aaaaaattat aatggtcaga    71640

taggattaga aggtcgaacc gttttaaaag cgatgataat gaacagagat gaaggtaaag    71700

tacaatgaaa aaactacctc ttccagcgag aacttatagc gaaatgctta ataaatgctc    71760

ggaaggtatg atgcagataa atgttagaaa taatttcatt actcacttcc ccactttttt    71820

gcagaaagaa caacaatata gaatattaag ctcgacaggt cagttattta cctacgacag    71880

gacacaccct cttgagccta caaccttagt agttggtaac ctgacaaagg ttaaattaga    71940

aaagctttat gaaaataatc tccgagataa aaacaaaccc gctagaacat attacgatga    72000

catgcttgtt tcatcaggtg aaaaatgtcc attttgtggt gatataggac agacaaaaaa    72060

tatagatcat tttcttccta ttgcacatta tcctgaattt tcggtgatgc ctattaattt    72120

agttccatcg tgccgcgact gcaatatggg agagaaaggt caagttttcg cagtagatga    72180

ggtacaccaa gcgattcatc cctatatcga caaggacatt ttttttcgtg agcaatgggt    72240

atatgcaaat ttcgtttccg gaactccggg tgctatcagt ttttatgttg aatgcccggc    72300

gaactggagg caggaagaca aacacagagc tcttcatcat ttcaagctat taaatattgc    72360

taacaggtat cgtttggagg cagggaagca cttgagtgaa gtgattactc aaagaaactc    72420

tttcgtaaaa gttataagga aatatagttc aaccgcaacg tttcagcagc tacagtcaga    72480

atttattgaa gcaaatctga aacctattat agatttgaat gacttcccca attattggaa    72540

aagagttatg tatcagtgcc tagcaaactc ggaagatttt ttcagaggga tctagaatat    72600

gatgaaagat agaaaattac gacgcttatc ggaagtgaac gaatactttt tatatgagga    72660

gggctgtttt tacaaaatcc ggtagtaact tgctaaccaa ttcctaggca ggtcattggc    72720

aacagtggca tgcaccgaga aggacgtttg taatgtccgc tccggcacat agcagtccta    72780

gggacagtgg cgtacagtca tagatggtcg gtgggaggtg gtacaaattc tctcatgcaa    72840

aaaatatgta aaatcggtag caactggaaa tcattcaaca cccgcactat cggaagttca    72900

ccagccagcc gcagcacgtt cctgcatacg acgtgtctgc ggctctacca tatctcctat    72960

gagcaacgtg ttagcagagc caagccacaa ctctaatttt aatacataat gaatgataat    73020

aataatatta aaaatttcct gtgtaactaa tttactatat ggtttctgat aagaatcatt    73080

gcaaagatca aacaacttgt attacattga cagttaagca gttaatttta tcacctctaa    73140

aatatatcag catctagcat gcaacctatc aaaatggaga gttttatgac taaaaaacca    73200

tgggaaagaa gacttaaaga tttatcgcac ttgctcaaat gctgcattga tacatatttt    73260

gaccctgaat tatttcgctt gaatttgaat caattcctcc aaaccgcaag aacagtaaca    73320

tttattattc aaaaaaacaa aaaccagatt ataggatatg acatttggta taacaataat    73380

gttattgaaa aatggaaaaa tgatccatta atggcttggg ctaaaaattc tcgcaatacg    73440

atagaaaaac aaggcgattt agaaatgtat agcgaggcaa aggctactct tatttcatct    73500

tacattgaag aaaatgacat tgagtttatt acaaatgaaa gtatgttaaa cattggtata    73560

aaaaagttag tcagacttgc acaaaagaaa ttaccttcat atttaactga atcatctatt    73620

attaaatcag aaagacgatg ggtcgctaat acgctaaaag attacgaatt attacatgcc    73680

ttagctataa tctatggcag aatgtataac tgctgtaact ctcttggcat acaaataaac    73740

aatccaatgg gtgacgatgt gatttcgcca acatcattcg actctttatt tgatgaagcc    73800

aggagaataa cttatttaaa attaaaagat tactccataa gcaaattgtc atttagcatg    73860

atacaatatg acaataaaat aattcctgaa gatattaaag agcgtctaaa actggtagat    73920

aagcctaaaa atatcacttc gacagaagag ttagttgact atacagccaa gcttgcagaa    73980

acgacttttt taaaggacgg ttatcacatt caaacattaa ttttttatga taaacaattc    74040

catccaattg atttaatcaa tacaacattt gaagatcaag cagataaata tattttttgg    74100

cgttatgcag ctgacagagc caaaataaca aatgcctatg gcttcatttg gatatcagag    74160

ctatggctca gaaaagcaag catctactcc aataaaccaa tacatacaat gccaattata    74220

gatgaaagac ttcaggtaat tggaattgat tcaaataata atcaaaaatg tatttcatgg    74280

aaaatagtta gagaaaacga agaaaaaaaa ccgactttag aaatatcaac agcagactca    74340

aaacatgacg aaaaaccata tttcatgcgt tcagtcttaa aagcaattgg cggtgatgta    74400

aacactatga acaattgagt catagaactt ccattattct cctgaagata ataatcgcca    74460

aataaaccaa tactcagctt tacaatatac taactaaccg cagaacgtta tttcatacaa    74520

cgtttctgcg gcatatcaca aaacgattac tccataacag ggacagcagg ccactcaata    74580

tcaggtgcag ttgatgtatc aacacggttc agcaacaccc gatacttctt ccaggcttcc    74640

agcaacgagg tttcttcctt cgttgcaatt tccagatctg cagcatcctg aagcggcgca    74700

atatgctcac tggctacctg catcaggctt ttttttgttt cttccgcctc ccggatccgg    74760

aacagttttt ctgcttccgt atccttcacc caggctgtgc cgttccactt ctgatattcc    74820

cctcccggcg ataaccaggt aaaattttcc ggtaacggac cgagttcaga aataaataac    74880

gcgtcgccgg aagccacgtc atagacggtt ttaccccgat ggtcttcaac gagatgccac    74940

gatgcctcat cactgttgaa aacagccaca aagccagccg gaatatctgg cggtgcaata    75000

tcggtactgt ttgcaggcag accggtatga ggcggaatat atgcgtcacc ttcaccaata    75060

aattcattag ttccggccag cagattataa atttttatgg tccgtggttg ttcactcatt    75120

ctgaatgcca ttatgcaagc ctcacaatat agttaaatgc aatgtttttg acggtgtttt    75180

ccgcgttacc cgcagcgtta acggtgatgg tgtgtccgtg tgaaccaata ctgaaagaat    75240

gggcatgagc accgataaca accggatgct ggtgcgcacc aataccaact gtatgcgcat    75300

gtgcaccggc actcacggct gtaccggaca atgagtgact gtggctgccc tgactgtccg    75360

ttttcgataa ataagcaata ccctgtgtgc tggttccttt aactgtggat aaacttcctg    75420

taatggttgc tgttccatac tgactccagc cagaactgtt catccttaaa ccacttgtgt    75480

gggcatgagc acccgcggcc cctgttgaac cgctcagact gtgagcatga gcccccgtgt    75540

tattcgtcga tttggtgccg taatcgaaac tgcctgttgt tttcgtcccg taatcaaacg    75600

acgatgtggt tttcgtcccc aaatccgtac cggatgcact ggcactgtgg gtgtgcgact    75660

taattccatc ctgttcctga gacaatacag cacgaccgct ggcgggtttc cccttgattg    75720

tccagcctcg catatcagga agcacacccg atggatacgc gacagcaagt tttgggtagg    75780

ctgatttgtc aaacgcctgc ccctgcatca ggacgtagcc agacggaacg atatctgatg    75840

gccacgggat cggcgcacct gccggaaagg ccgaattctc accggcccca aggtattcaa    75900

gaacatctgc aacggaattt tttgccagaa tatccctgcc aacctgagtc agttcagtca    75960

ggctggcggc atcattttcc gcaaaatacg gtaatttatt tttcgccgtg gaaagccctg    76020

ccagcgccgt cagtgtcgca ttcttcggtt gtttacccgc aagcgcgtta gtcatggtgg    76080

tagcaaaatc tggatcattc ccgagcgctg cggccagttc attcagcgta ttcagtgcgt    76140

caggtgacgc gtcgataaca tctgcaatcg cggccagtac aaaagcggtg ttcgcaatct    76200

gggtattgtt tgttcccctg agcgcggttg gtgctgttgg cgttccggtc agtgccggac    76260

tgtccagtgg gcttttctgt tcgtttcatc cattaccacc ttaaccgcct ttggcgttgc    76320

agcaagcgtt tcagacgtgc tgttggttgc actgctgagc tgcactatcc cctttctcgt    76380

tgtgtccgca tcctcaagcg cgacagctga agctatatct tctgcacgtt ttgccgaatt    76440

ttttgcacgt attgccgccg cttctgccgc acttttgctc tgcgatgctg ataccgcact    76500

tcccgcagcc tctgtcgcct tcgtggatgc cgttgacgca ctccccgccg ccgctgtttt    76560

tgcgtctgcc gcggcagagg cgctccgttc cgctgctgtt tcagatgacc tggcattcgt    76620

ctcggacgtt tttgccgccc tggcagaatt ttctgccgcc gttgccgagg aagctgcacg    76680

accggcactt gatgatgcgt tcgtttctga tgattttgct gcctcttttg aggccaccgc    76740

atctcgtgct gaagtggcgg cctctgacgc tttcgtggcc gcggtggagg cagacgtggc    76800

ggctgattgt tgtgacgctg cagcattcgt ttctgacgtt ttcgccgcac cggcactggt    76860

ggccgccgcg ttttttgagg actctgcggc tgcggcactt ttttccgctt cagtggcctt    76920

tgctgatgcc gcttctgcgc cggaggacgc ttcctgagct gacgatgcag cctgtccggc    76980

ggacgtgctg gcggcgcgtg ctgagtcagt tgcatcagtc acaagggccg cgacctgagc    77040

agctgatgca ctggcatcgc cggctgattt cttcgcgtct gccgtactct gtgccaccac    77100

ggacgcgtta cgcgccacct cttccaccat cagttcaaga cgacgcagca cctccggccg    77160

ggcatcatcc tccgtcatgg cacagagaaa atcattcagc gtccccggtt gtgaatcttc    77220

atacacggtg atggtcccgg cgtgcgatgg tggaaaaccg tcaacctgca ggatgacact    77280

gtactgaccg tactccacat ccatgctgta acgcccggct tcatccggat tctctgagcc    77340

caccgtgttc accaccaccg tggtgctgtt acgtctggct ttcagctgaa tggtgcagtt    77400

ctgtaccggt tttcctgtgc cgtctttcag gactcctgaa atctttactg ccatattcac    77460

cccacaaaaa agcccaccgg ttccggcggg ctgtcataac actgtgttac ctggctaatc    77520

agaatttata accgacccca acgatgaatc cgtcagtacg ccagtcgcca ctgccggagc    77580

cttcataagc aatatcaaca acgacggacg ctgccggatt aatctgtata cctgcactcc    77640

acgccactga ggtatgccgc attgcacttt cgtccctggc agtggtcgtc tctttcatat    77700

acccgggagt gatttccgtc ttacggtaat ccattgtact gccggaccac cgactgtgag    77760

ccactccggc catggcgtac gcactgacct gcttactgat ttgtaaaacc ggtccggcca    77820

tcacgctcac ataacgtcca cgcaggctct catagtgaaa cgtatcctcc ccggtcatca    77880

ctgtgctgct ctttttcgac gcggcgaacc ccagggaagc catcaccccc acactgtccg    77940

tcagctcata acggtacttc acgttaatcc ctttcagatg actcacaccg gtatccccgc    78000

ccgacaacga cggcaatgta cccggtttca cttgaaaata gcccaccgta aacgtaccat    78060

gtccaccttc cgcacgggcc ggagtgactg tcaccgcaag tgcggcaaag acagcaacgg    78120

caatacacac attacgcatc gttcacctct cactgtttta taataaaacg cccgttcccg    78180

gacgaacctc tgtaacacac tcagaccacg ctgatgccca gcgcctgttt cttaatcacc    78240

ataacctgca catcgctggc aaacgtatac ggcggaatat ctgccgaatg ccgtgtggac    78300

gtaagcgtga acgtcaggat cacgtttccc cgacccgctg gcatgtcaac aatacgggag    78360

aacacctgta ccgcctcgtt cgccgcgcca tcataaatca ccgcaccgtt catcagtact    78420

ttcagataac acatcgaata cgttgtcctg ccgctgacag tacgcttact tccgcgaaac    78480

gtcagcggaa gcaccactat ctggcgatca aaaggatggt catcggtcac ggtgacagta    78540

cgggtacctg acggccagtc cacactgctt tcacgctggc gcggaaaagc cgcgctcgcc    78600

gcctttacaa tgtccccgac gattttttcc gccctcagcg taccgtttat cgtacagttt    78660

tcagctatcg tcacattact gagcgtcccg gagttcgcat tcacactgcc actgatatcc    78720

gcatttttag cggtcagctt tccgtccggt gtcagggaaa aggccggagg attgccgccg    78780

ctggtaatgg tgggggccgt caggcgcttc aggaacacgt cgttcatgaa tatctggttg    78840

ccctgcgcca caaacatcgg cgtttcattc ccgtttgccg ggtcaataaa tgcgatacga    78900

ttggcggcaa ccagaaactg gctcagtttg ccttcctccg tgtcctccat gctgaggcca    78960

atacccgcga cataatgttt gccgtctttg gtctgctcaa ttttgacagc ccacatggca    79020

ttccacttat cactggcatc cttccactct ttcgaaaact cctccagtct gctggcgtta    79080

tcctccgtca gctcgacttt ttccagcagc tccttgccga gatgggattc ggttatcttg    79140

cctttgaaaa aatccaggta accttccgca tcatcgctcg cccgaccgac ggcctccacg    79200

aatgccgatt tgccaacggt gttcacactg cggatataaa agtaataatc atggcccggt    79260

ttgatattga tactggcggc tatccagtac agcgccgtac caagataacg cgtgctggtt    79320

tcaacctgtc tgatatccgc aatctgcttt tccgagaacc agaactcaaa ctgtaccgtc    79380

gggtcataaa cggcaagatg cggcgtggcg gttatctgaa aatagcccgg cgtcagctca    79440

atcctcgacg gtgctgccgg tgcggcaatc cggaacgata ccgacgccgg atcgccctgc    79500

tgcccccacg catttaccgc ccggactgtc agcctgtagt tccccagcgc cagttgcgtg    79560

aagcggtatg tggtttccgt cgtccgggcc gtgctgacca gccgctcact gccgtcgtcc    79620

gctgttacgg tcagacggag caggaaactc acgcccttca ccaccttcgg tgtgtcccat    79680

cgcgccagca cctgatattc cccgctgtct gcagtgactt ctgcggtcag gtgctgcacc    79740

gctggcggcg tgacaccatt caccgtgcca ctctgttcgc cgtcaaagtg cgccccgtta    79800

tccacgatgg cctctttttc cggcacatgc tgcacggcgg tgatggcata cgtgccgtcg    79860

tcgttctcac ggatactcac gcagcggaac agtcgctggc gcagcgtcgg cagcttcagc    79920

tcccatacgc tgtattcagc aacaccgtca ggaacacggc tcacttttac cttcacgccg    79980

tcggtgacgg actgaacctc cacgctgacc ggattgccac ttccgtcaac caggcttatc    80040

agcgcggtac cggaggatgg cagcgtgatt tcacggtcga gcgtcagcgt ccgggtctgg    80100

ctgttcaccg ccagcacacg accaccggtg ctgataccgg catagtcatc atcgcagatt    80160

tcaataacat cgcccggtac atggcgaagc ccttctgcgc cgacgctgaa atccacggtc    80220

tgcgtttcca gcagttctgt tttaatcagc cacagcccgg cgcggtgtgc ctgcccccgg    80280

ctggtacagc caaaggcatc catcttcgta acattacgac cgtaacgggc aatggcctgc    80340

gtatcttcaa caagctctgt cgccgtctcc cagccgttgt tcgggtcaat ccagttcacc    80400

tcaacggcat tatggcggtc cttcagggcg ctgaagctgt agcggaacgg cgcgccatca    80460

tccggcatca ccacattact gcggttatag gtccacgtct tatccgacgg tcggtcctgc    80520

acgaacgtca gcgtctgccc gttccatacc ggcatacagc gcatcgccga gcagaaatcg    80580

ctgagcacat cccacgcctt acgctgtgtg gtcaggtacg cattacaggt gatgcgcggc    80640

tccgtgccgc caaagccgtc cggcactgac tggtcgcagt actggccgat gacatacagc    80700

gcccatttat ccacatccgc cgcaccaaga cgtttcccca tgccgtagcg cggatgggtc    80760

agcatatccc acagacacca ggccatgttg ttgctgtatg ccggtttaaa cgttccgtcc    80820

cagataccgc tgtattgccg cgtctgcggg ttatagttcg acggcacctg cagaatacgc    80880

ccgcgcagat gataattacg gctcacctgc tggctgccga actgctccga gtccacctgc    80940

acgccgacca gtgccgtgtt cgggtagcac tgtttcacat cgatgatttc agtgtatgac    81000

gaccagagcg ttttgttctg cagctggtct gtggtgctgt ccggcgtcat cctgcgcatc    81060

cggatattaa acgggcgcgg cggcaggtta cccatcacca ccgaggccag atactgcgag    81120

gtggttttgc ccttaatggt gatgtctttt tccgtcaccc agccaccgtt acgttgtatc    81180

tgaaccagca ggcggacttc cgacggattc ctgtcaccct ttgaggtggt ttccaccagt    81240

gcctgtacac cgaaggtaaa gcgcagacgg tcgatgtttg cagacgtaat ggtgcgggtg    81300

atcggcgtgt catatttcac ttccgtaccc agcaccgtct cggagccgga ggattcaaat    81360

ccctccggcg gagtctgctc ctgctcacca gcccggaaca ccaccgtgac accggatatg    81420

ttggtattcc cctcagtgtc cagcaccggc gtactgttca gcagcacgct ttttaagcca    81480

tccaccggac cttcaatcgg cccttcgctg atggcatcga tcacactcag caactgcgtg    81540

gacttcaggt tgtccttcgc ttcgcgcggg gtatgcccct tactgcttcc tttacccatt    81600

cctcacgctc cataaatgac aaaaccgccc gcaggcggtt tcacataaaa cattttgcat    81660

cagcgaccaa tcaccacaac ctgaccaccg tccccttcgt ctgccgtgct gatctcctga    81720

gaaaccacgc gtgaccccac gcgcatttcc ccgtacagaa caggcagaac attgccctgg    81780

gcaaccatgt tatccagtga ggagaaatag gtgttctgct taccgttatc cgttgtctgt    81840

atacggggag ttctggcttt cggtgccagc atctgcgcca caccaccgag caccatactg    81900

gcaccgagag aaaacaggat gccggtcata ccaccggccc caatggctgc cccccatgct    81960

gcaagggtgg ctccggcggt aaagaatgat ccggcaatgg cggcagcccc caggacaatc    82020

tggaatacgc cacctgactt ggccccggcg actctgggaa caatatgaat tacagcgcca    82080

tcaggcagag tctcatgtaa ctgcgccgtt aacccggacg tgctgacgtc ccgcccggca    82140

atccgtacct gataccagcc gtcgctcagt ttctgacgaa acgccgggag ctgtgtggcc    82200

agtgcccgga tggcttcagc ccccgttttc acacgaaggt cgatgcggcg accaaatcgt    82260

tgtaaatccc cgtaaaggca gatgcgcgcc atgcccggtg acgccagagg gagtgtgtgc    82320

gtcgctgcca tttgtcggtg tacctctctc gtttgctcag ttgttcagga atatggtgca    82380

gcagctcgcc gtcgccgcag taaattgcgg cgtgattcgg cactgatgaa ccaaaacagc    82440

acagcagcac atcgcccggc tgtgccgctg acaacggcac ctgatacagc cccgtcgcct    82500

ccagattatc cagatagaga ttctggccgt tacgccacca gtcatcctca cgatgaaagt    82560

ccggcatctc aatccccgcc agatgataag catcccggaa cagtgtgtaa cagtccgtca    82620

caccgtgctc aaagcgccgc ccggtgagat gcggcacaca gcggaactta tgaatcgtcc    82680

cccggcagac cagccaccac ggcaaatcac tctgcacctg cagccgccgg tcggcctcac    82740

tcagccaggg cagaccaccg gggtggctgt ggaccagcgc cacaatctca ccctgcattt    82800

ctgcctgcag ccagtcttcc ggcgacatac ggaaatagcc tccggctcac cggagatatt    82860

cacgcagggg aaatatcttt ccccctccgg cgtgcttacc acgaagccgc acgactccgc    82920

tggcgcacat cgccgggcgt gcgccagaat cgctgattct gtctgtgtca tgggatttac    82980

tgcgaaagtt tgttaatgga aaggaagccg ccaaagttgc cgacgttatt gcggaactta    83040

caaccgctca ggcatttgct gcatttatcc ttcgtgatat cggacgttgg ctggtcatat    83100

tcatccgcga cagccggacc gctataaccg cactcgtcac cgcgataggt ccaggtgcag    83160

gtgttggcca gcatgatacg tcccggaaaa acagcgccat ccgtttccgt cggcgtggac    83220

agtacaaagg aggcactcac cgcgctcagt tcgctgcact gctcaatgcg ccagcggctg    83280

atcacctcct gctccggatc ggcgtaactg tttccgttga cgaagttcac cgcatccaga    83340

aaacgggcgt aaaccttacg ccggaccacc gttccgccga ccagactctg catatcttcc    83400

gccatcccgg tgaccatacc gtacaggtta gaaaccgtca gcgtggggcg cgtactggtg    83460

cctttgccat tcagttcaaa accgctcccc tgaatgggat acggctgata ctgtcgcccc    83520

tgccaggtga ccggctcacc tttttcgttc tgctcattac agaaaaaata acgttctcca    83580

ccgacctctg tcaggtcgat ttcccagagc accacgctgg ccgactgctc cgcacgggtg    83640

cattcattca gtgtttcctg ccggatatcc tgcatcagtt caccacctgt tcaaactctg    83700

cgctgaactc aacacgcagc atactgaccc gcgacgacca ttttgcgcag gtcaccttta    83760

tctgccgcca ctcataaggc ggcgtccaca gaaaggattt ccagcccccg tgctcttcca    83820

gaaacgactc cagtaccgtg gcctcctcac gggggacaga aagcgtcacg ctgtacgttt    83880

tcaggttggc attcagcccg gcaggcgctc gctgagaata gccatcacca aagcgcacct    83940

ttcttacaga agggaccgaa gccacatcca taccgggttt cactttccag cggaaggtct    84000

tcatcgtcca cctccggaga acaggccacc atcacgcatc tgtgtctgaa tttcatcacg    84060

ggcacccttg cgggccatgt catacaccgc cttcagagca gccggaccta tctgcccgtt    84120

cgtgccgtcg ttgttaatca ccacatggtt attctgctca aacgtcccgg acgcctgcga    84180

ccggctgtct gccatgctgc ccggtgtacc gacataaccg ccggtggcat agccgcgcat    84240

cagccggtaa agattcccca cgccaatccg gctggttgcc tccttcgtga agacaaactc    84300

accacggtga acaatccccg ctggctcata tttgccgccg gttcccgtaa atcctccggt    84360

tgcaaaatgg aatttcgccg cagcggcctg aatggctgta ccgcctgacg cggatgcgcc    84420

gccaccaaca gccccgccaa tggcgctgcc gatactcccg acaatcccca ccattgcctg    84480

cttaagcaga atttctgtca tcatggacag cacggaacgg gtgaagctgc gccagttctg    84540

ctcactgccg gtcagcatcg ccgccatatt ctgtgcaata ccatcaaagg tctgcgtggc    84600

tgcacttttt acctgcgaca tactgtccgt ggcgctctct tcccactcac tccagccgga    84660

cttcaggcct gccatccagt tcccgcgaag ctggtcttca gccgcccagg tctttttctg    84720

ctctgacatg acgttattca gcgccagcgg attatcgcca tactgttcct tcaggcgctg    84780

ttccgtggct tcccgttctg cctgccggtc agtcagcccc cggcttttcg catcaatggc    84840

ggcccgtttt gcccgttgct gctgtgcgaa tttatccgcc tgctgcgcca gcgcgttcag    84900

gcgctcctga tacgtaacct tgtcgccaag tgcagccagc tggcgtttgt actccagcgt    84960

ctcatcttta tgcgccagca gggatttctc ctgtgcagac agctggcgac gttgcgccgc    85020

ctcctccagt accgcgaact gactctccgc cttccacaaa tcccggcgct gctggctgat    85080

tttctcattt gctccggcat gcttctccag cgtccggagt tctgcctgaa gcgtcagcag    85140

ggcagcatga gcactgtctt cctgacgatc gcccgcagac accttcacgc tggactgttt    85200

cggctttttc agcgtcgctt cataatcctt tttcgccgcc gccatcagcg tgttgtaatc    85260

cgcctgcagg attttcccgt ctttcagtgc cttgttcagt tcttcctgac gggcggtata    85320

tttctccagc ggcgtctgca gccgttcgta agccttctgc gcctcttcgg tatatttcag    85380

ccgtgacgct tcggtatcgc tctgctgctg cgcatttttg tcctgttgag tctgctgctc    85440

agccttcttt cgggcggctt caagcgcaag acgggccttt tcacgatcat cccagtaacg    85500

cgcccgcgct tcatcgttaa caaaataatc atccttgcgc agattccaga tgtcgtctgc    85560

tttcttatac gcagcctctg ccttaatcag catctcctgc gcggtatcag gacgaccaat    85620

atccagcacc gcatcccaca tggatttgaa tgcccgcgca gtcctgtctg cccaggtctc    85680

cagcgtgccc atgttctctt tcaggcggcg ggtctggtca tcaaaccctt tcgttgcggc    85740

ctcgttcgcc gcctgcaatg ccccggcttc atcgccggaa cgctgcaact gagcaacata    85800

cgcaatctgc tccgccgaca cgttatggaa ctggcgagcc atcgccgtca gccccgacgt    85860

cgggtctgtg gtcagcttcc cgaaggcttc agcgaccttg tccacctcca cgccggatgc    85920

agaggagaaa cgcgccacac tctggctgat ggacgcaatc tgagcctcac cgcttacccc    85980

cgccttaacc agtgcgctga gtgactcgct ggtctggtta aacgtcagcc ctgccgcctg    86040

cccggctctg gacaggacca gcatacgatc tgccgtcagt cccgcctgat tgccggaaag    86100

gaccagcgtt ttgttgaaat cggacagggt tgagttgccc tgataccagg catacgccag    86160

cgcaccggtc gccaccgcca gcgaggtggc ccccaccatc ggcagggtga tcgcaccggc    86220

aagccccctg aacatgggga tcatcccgcc gaaggagtcc ttcacctgcc ccccctgttg    86280

cagcaggatc agccacggac tttgcccgcc tgcaagctgc gtggccacgt cggtgaactg    86340

tgcaggcagc atacgcatgg cggctttata ctgcccgacg gaaatccccg ctttctgtgc    86400

agccagcgcc tgtcggctca gcgactgttc aacgactgcc gctgtttttt tcgcatcact    86460

ttccgtacca gaaaaatgac gcctgactct ggccatctgc tcgtcaaatc tggccgcatc    86520

cagactcaaa tcaacgacca gatcgcctac cggttcagcc ataccggact cctcctgcga    86580

tcccttctga tactgtcatc agcattacgt catcctccgt catgtccgcc acatccgggg    86640

aagcggggat aacttcattc ccgtccgggc caaagcggac acctccggca agccctgccg    86700

ctttctgcat cagcacatca tcttcaggct cttcgtcagc ctcgcgccgg ttcagcagac    86760

tgaaatccag cggatgcata tccggatcgc tgaaaaacag gctgagcacg gtgtacgtca    86820

gcccggaaaa gtgcatatcc agcagaacat catgaaaata atgggtactg taaaagcggt    86880

gccagtcggc atactccgtg gatgacatcc cggcaagcat ggcacgccag tcgggtcgcc    86940

ccatctcacg cgccagtttc agggcaaaac tcagctcacc gtcgaacact ttcccgcaga    87000

aacaggctct gcgggcccgg cgtcctctgt ctgttcaggg gcattattca ccacaaactc    87060

atacatacca gacagccggt acaccacgtt ttcagcatga gaaattgcct ccgtgggcca    87120

ggtggtaagc acttcctgct caatctgttt aacggcttca ttcatggacg gcatctgcgt    87180

cttctgcgga tggttatgcc acagggacat cgccaccaga aacgcgccgg ttctgatggc    87240

gtcttccaca gtaaacttcc ggttgctgtc tgactccgcc tgttctgcct gccgtttcat    87300

cagggcgaga tgctcaatgc gctgcagggc tgacagttca gaaagcgtga cggtcacacc    87360

gttatgttca aatgattcgg ttttcaggaa catcgctgac tctccggatt aactggcggt    87420

gacggtaatt tctgcaaccg cagcaaactc accattaccg gatacaaccg gaatgttgac    87480

cttgcctgca gcaacgccgt tcacggtgat ggtcatacca ctgaccgaca cggtggcttt    87540

tgttttatcc gcagacaccg cacgaaagct cttgtcggtt acgccctccg gctggaaggc    87600

cacggtcagc gtggtgctct gccctttcac caccgaggtg ctggcaggcg tcacggtcat    87660

gccggttgcc gctgttaccg tgctgcgatc ttctgccatc gacggacgtc ccacattggt    87720

gactttcacc gtgcgggtga tcacttcctt cgccgtcacc gccttaccga tactgctgac    87780

ccagccacgg aacacatcga ccgtgccgtt cgggaagcgg attttatagg cacgggtatc    87840

gccttcatta aaccacgcca gcagcgcctg ctgcccctgc tctccgggca tccacgccag    87900

cgtgaagctg gtatctccgg cagatttctg cccctgcccg gtcgcagtcc agtctgcatc    87960

ttcatcatcg agatagctgt cgtcatagga ctcagcggtc agttcgccgg gcgtcaggtc    88020

tttaactttt gccagacgcg accagtcaac gtctgaaagc ggattcgcgt aagggtcacc    88080

gctcccctta taaacccaca gggtggtccc ggcacctttc accggcattg taggatttgg    88140

tacaggcata gcgtcctcac atttcatagg taatgacata agtcagatcg gctgaactcc    88200

acaagcccgc atcatcgtcg cgccggtagt catagccgct ggccaccata ctggtgatca    88260

aatctgacag tgccgggata tcgctcatca ccggataaat ccgggactcc atccacgcat    88320

ccagctctga atccggcacc tgagcaggca ggaaaacttc gatatgcagc tccgcctgcc    88380

aggtatcgct gtccagctct tcgcccgtgt attcagcgcc ggtgagataa acggcaactg    88440

ccggaaaatc cgcctcatca aaaacagcgg ggcgaccatc aaaaaacgtc gccccggtgt    88500

catgcttctc cagtgcatcc agtacggctg cacggagttc agtatgtttc atcgctttat    88560

taccatcctc agttgatgct gcagcgcata gcccagctct ttcggaagac gttcacgccg    88620

tatccgctca atattttgtt taaacgccgt ggtcagcggc accgccatcg ggattttcac    88680

cacatcaatg gggtaacggt ttttcccagc cacacgctgc atgacatgcc accggccatt    88740

tttcagttgc tgaataaacg cgccgggaat acgacggtta cccaccacaa gcacgctgcc    88800

gccacctttc agggatgaac gctgcccctt tttacgacgc ctgcggcgcg aaaggacaac    88860

ccgcgcatta cccagcttga ttacgggcaa atccccccgg ttaactttga ttctggcctg    88920

cggatttttg accgtggccc ttttcagcct ggccctttcc tttaccagtt tccggcgtac    88980

ctttgtctca cgggcaacct gtgacgccga ctgcgatatc gcggatgaag caacgcggtt    89040

aatggccatt gcggcggcac caggcaccgc cgttttgctg atacggctga ggttttcaac    89100

ggcctgctca agacctttta tggccataca tccccctttc agcggcgacg gttaacggca    89160

ggcggtacgc cccgtccaag ccagagatga caacttccgc catcatccgg cgaaacccga    89220

tctacccaga aattttcctc accgatggtc agcgtgtctc cacgccgcag ctgccgcacc    89280

tcatcagtcc ggacaaacag ggacgggctg gagccttcaa cgcgcacgcc ctgtccggca    89340

tagctgatat tttcagggtc atcaaaaaca ccacgtatca ccgcacctga ctgctcaccg    89400

gatgtaatgg tggctgacgt tcccatgtac ccgcgtatcg tttcatcggc gcgggcaatg    89460

gcagcatcga acaggttatc gaaatcagcc acagcgcctc ccgttattgc attctggcca    89520

ggccgcgctc tgtcatttcg gctgccacac cggcagagac acgaaacgcc gttcccggca    89580

gcacaaatgc cacaggttca tcccgcgtgg cgtgaagtgc atcagtatgc agcttcacca    89640

gtgccacgac cgtgaccagt tcagacgtat ccagaatcac ggtatccggc tgcgctgatc    89700

ccacctcatt ttcatgtccg gtcagcacat tttcccggct gagaggggtg tcctgaccgg    89760

cagtttcatc cgtgtcatca agctcctctt tcagctctgc cacacggagc gccagttctt    89820

ctttcgtccc cgtcaggctg acatcacggt tcagttgttc acccagcgag cggagacggg    89880

caatcagttc atctttcgtc atggactcct ccacagagaa acaatggccc cgaagggcca    89940

tgattacgcc agttgtacgg acacgaactc atcagggtca gccagcagca tcagcggtgc    90000

tgactgaatc atggtgaact cacgcgccgg atcgccggtg gtcacccagt ttttcgggta    90060

acgggcagag gcgttaatgc cttcgcgctg tgcgtccgca tcctgaatgc agccataggt    90120

gcgcagaccg cgtgcctgag tgttccccag caccatcgtg ttgtccggca ggaagttctt    90180

tttgacgccg ttttccacgt actgtccgga atacacgacg atggccacat cgccatacat    90240

ccccttatag gacaccgctt tgcccaggtc tttcaccgct gtctccagct cggaattaga    90300

gccacgacgg gtatccagct tctccttgac ggctttgaag gaacggaaca gcgcccagcc    90360

tttcggatcg aacacgatga tattcaccac accgctggcg ttcagcgcgt aggcttcgat    90420

atcgtcggtc gggtcatacg tggacttgtc acgcttgctc cactccgtgc cgccggactg    90480

cgtgatgtta ttctcctcac tgcggcccat atccacctca accggatcga aggcttcacc    90540

ggtcatggtg tatttgccct taagcacggc agaaactgcc tgcatctctt cgacctgagc    90600

aatggccagc tcttcgtcac gcatgttctg catgatgatg cgacggcggc ggtaagccgg    90660

gtccgccaga ttctgcggat cttcatccgg caggcgacgc agggtcatct gcggattcac    90720

ttcatgcttc ggcttgacat atcccggcgt aaattcagag gtggagccgc cacgggaacg    90780

gataacctca ccggaaacaa tcggcgaaac gtacagcgcc atgtttacca gtcccggaat    90840

ttgtgagaga tagactttct ccgtggtgaa gggatagctc tcacggaaaa agagacgcag    90900

aaacagcgga tcaaacttaa atttctgctc atttgccgcc agcagttggg cggttgtgta    90960

catcgacata aaaaaatccc gtaaaaaaag ccgcacaggc ggcctttagt gatgaagggt    91020

aaagttaaac gatgctgatt gccgttccgg caaacgcggt ccgttttttc gtctcgtcgc    91080

tggcagcctc cggccagagc acatcctcat aacggaacgt gccggacttg tagaacgtca    91140

gcgtggtgct ggtctggtca gcagcaaccg caagaatgcc aacggcagca ccgtcggtgg    91200

tgccatccca cgcaaccagc ttacggctgg aggtgtccag catcagcggg gtcattgcag    91260

gcgctttcgc actcaatccg ccgggcgcgg ttgcggtatg agccgggtca ctgttgccct    91320

gcggctggta atgggtaaag gtttctttgc tcgtcataaa catcccttac actggtgtgt    91380

tcagcaaatc gttaacggca tcagatgccg ggttacctgc agccagcggt gccggtgccc    91440

cctgcatcag acgatccagc gcagtgtcac tgcgcgcctg tgcactctgt ggtgctgcgg    91500

ccagaatgcg gcgggccgtt ttcacggtca taccgggggt ttctgccagc acgcgtgcct    91560

gttcttcgcg tccgtgagcc tcctcacagt tgaggatccc cataatgcgg ctgttttctg    91620

ccgcaaccgc tgcggtgatc tgcgcgttca cgtccggctg cgccgcgctg gcgttctcgc    91680

cctccgtcgc tggcaccacg tcagtaacgt cagcctgcga agcagtggct gaaacagttg    91740

ttgattgagt ctctttggtc attcgccctc ctgagagacg ggatttacgt gcatccagtg    91800

catcacgcat gacggtgatc gcatcggtgc tgttaacaag ttcatcagcc agtccggcat    91860

caatggcctc ctgaccgctg tacactgcag cctcggtatc cagcacaacc tgcacggaca    91920

ggccggtata tgccgacacc ttctgcgcaa acatctggcg ggttgcgtcc atccgggact    91980

gcagtgtctc ccggacgtca tccggaagat ggctgtaggg gttgccatcc accttatggc    92040

tgccgctgta aatcagcgtg atttccacac cctgtttctc cagcgcagca ccgtaattac    92100

tgtgagccat catgacgccg atggagcctg tccgggcggt ctgcgtgacc agacgccggg    92160

aggcggcact ggcaagcaac tgacctgcac tgcagttcat gtcgttggca agcgcccata    92220

ccggttttat gtcacgcaca cgggcgatga tgtcagcgca gtcaaatgcc cccgccacca    92280

tcccgccggg cgtgtccata tcgagcagaa tgccgtccac catcggatcg ctggcagcct    92340

gttgcagacg ggcgataatg ccgttgtaac cggtcatccc cgagtacggc tgcagcgccc    92400

gcgtccggct gaccagcgtg ccggacaccg gcagcacggc gatgccgttc atgacctgat    92460

aactgcgggc ctgtcgtggt ccgtcatcat caccggataa tgccagcgtc gcgagtgcct    92520

cctgggcagt caggctgtcg ccggacaccg catccgtcag gctgctgatc ccaagctggc    92580

ctgcaagcgc acaaaagaaa acccgcgcat aggcgggttc aagcatcagc ggctcattaa    92640

aggccatgct ggcaatatgc gggagattac gcagctctgc tgtcactctt ctcctcctct    92700

gttgattgtc gcagcccgga ttcaaatgct gcagccgccc aggcgggcgg tttaagaccg    92760

gctgcacggc gctccatcgt ttcacggacc tgctgggcaa aaatttcctg atagtcgtca    92820

ccgcgttttg cgcactcttt ctcgtaggta ctcagtccgg cttctatcag catcaccgct    92880

tcctgaactt ctttcagacc atcgatggcc atacgaccgg agcctatcca gtcgcagttc    92940

ccccaggcac tgcgggcttc ctgaaaactg aagcgcgctt ttgaaggtaa cgtcaccacg    93000

cggcgaacga tggcctcttc cagccagcac agaaacatct ggctcgcctg acgggatgcg    93060

acgaattttc gccgccccat aaagtacgcc cacgactcgt tcgcactggc ccgtgccgtg    93120

gagtagctca tctgggcgta attccgggaa agctgctcat acgagacacc cagcccggca    93180

gcgatatacc gcagcagtga ctgctcaaac acggagtagc cgttatccgt atcctgagcc    93240

gtctgcaggt tcagtgagtc acccggcatc aggtgcggta cttttgcgcc tcccagccgg    93300

accggcgctg cggcgtaata cgcggcaatt tcaccaatcc agccggtcag cctttcccgc    93360

tgctcctgac tgttcgcgcc cagaataaaa tccatcgctg actgcgtatc cagctcactc    93420

tcaatggtgg cggcatacat cgccttcaca atggcgctct gcagctgcgt gttctgcagc    93480

gtgtcgagca tcttcatctg ctccatcacg ctgtaaaaca catttgcacc gcgagtctgc    93540

ccgtcctcca cgggttcaaa aacgtgaatg aacgaggcgc gcccgccggg taactcacgg    93600

ggtatccatg tccatttctg cggcatccag ccaggatacc cgtcctcgct gacgtaatat    93660

cccagcgccg caccgctgtc attaatctgc acaccggcac ggcagttccg gctgtcgccg    93720

gtattgttcg ggttgctgat gcgcttcggg ctgaccatcc ggaactgtgt ccggaaaagc    93780

cgcgacgaac tggtatccca ggtggcctga acgaacagtt caccgttaaa ggcgtgcatg    93840

gccacacctt cccgaatcat catggtaaac gtgcgttttc gctcaacgtc aatgcagcag    93900

cagtcatcct cggcaaactc tttccatgcc gcttcaacct cgcgggaaaa ggcacgggct    93960

tcttcctccc cgatgcccag atagcgccag cttgggcgat gactgagccg gaaaaaagac    94020

ccgacgatat gatcctgatg cagctggatg gcgttggcgg catagccgtt attgcgtacc    94080

agatcgtctg cgcgggcatt gccacgggta aagttgggca acagggctgc atccacactt    94140

tcactcggtg ggttccacga ccgcaactgc cctccaaatc cgctgccacc gccgtgataa    94200

ccggcatatt cgcgcagcga tgtcatgccg tccggcccca gaagggtggg aatggtgggc    94260

gttttcatac ataaaatcct gcaggtcccc tgcgtcgctg tgtcatgccg gtctgcactt    94320

ccagctctgc aatatatttt ttcaggtcag acacggaagt ggccgtaaac tccacccttc    94380

gtccgtcttt ctgtactgtt gccacccgtt tacctgtcat caggtcatgc agtgccgcac    94440

gggcagcggc aagttcttcc tgtcgcgtca ttcatcctct ccggataagg cacgggcgta    94500

atctgccagt gttttcttgt tggttgctgc accatcctct tcctgcaggc tcgccagcag    94560

cgcactgaga tccagctgcc agcgggaaat actgatgcgc agcgccgcca gcgcataaac    94620

gaagcagtcg agtgcctcat tgcgtcgctt tttgctgtcc cacagtattt ttttcctgcc    94680

atccacccat ttttcgacct gctcttcagc agtcagctgc tgcgcttcgg tcagatcaaa    94740

aatatccggg ttattcggga agtgaacggc accgggaagc ggttcatccc cttccggcgt    94800

cagtgtgaag cggttataaa tctgctcttt cgcggtatcc gtaccgattt cggtaaggta    94860

aaccccgttt ttgtttcgct tacgtggcat gctggccacc ggctttccgt agacggatgc    94920

ccctttaatg gggatcaccc ggaacagccc atgttttttc gagcgttcat acacaatggt    94980

cgggtcaatc ccgccagtat cccagcagat acgggatatc gacatttctg caccattccg    95040

gcgggtatag gttttattga tggcctcatc cacacgcagc agcgtctgtt catcgtcgtg    95100

gcggcccata ataatctgcc ggtcaatcag ccagctttcc tcacccggcc cccatcccca    95160

tacgcgcatt tcgtagcggt ccagctggga gtcgataccg gcggtcaggt aagccacacg    95220

gtcaggaacg ggcgctgaat aatgctcttt ccgctctgcc atcacttcag catccggacg    95280

ttcgccaatt ttcgcctccc acgtctcacc gagcgtggtg tttacgaagg ttttacgttt    95340

tcccgtatcc cctttcgttt tcatccagtc tttgacaatc tgcacccagg tggtgaacgg    95400

gctgtacgct gtccagatgt gaaaggtcac actgtcaggt ggctcaatct cttcaccgga    95460

tgacgaaaac cagagaatgc catcacgggt ccagatcccg gtcttttcgc agatataacg    95520

ggcatcagta aagtccagct cctgctggcg gatgacgcag gcattatgct cgcagagata    95580

aaacacgctg gaggggtcat ccggcgtcca tttgaggcca aacggcgtct ctttgtcgcc    95640

aaatttaaga tactgctcct ccccgcaatg cgggcaggca acatgaaaac gcataaaatg    95700

cggggattca ctggctgcac gctcaatctg acaggtgcct ctcacttttg gcgtggagcc    95760

acggatggac tttggccaga ccgagccttc aatacgcttg tcacccagga acgtcggaga    95820

gccttcctgt tcaatatcat catcaaaagc agcaagttca tcataacccg ccacatccac    95880

cgacttttca cggtagtttt ttgccgcttt accgcccagg caccagaagc cacgcccatt    95940

agtgaaacgc ttcatggtga gcgtgttatc ccggtgcttt ttgccatacc acggggccag    96000

cgccagcagc gacggaatat cacgaatagt cggctcaacg tgggttttca taaagttctc    96060

ggcatcacca tccgtcggca accagataag ggtgttgcgc tgcttatgct ctataaagta    96120

ggcataaaca cccagcagca ttttggaata accgacacgg gcagacttca ccacattcac    96180

ctcacggatg tagtcgctgc ccatcgcatt catgatggcc cgctgaaagg gcagtgtttc    96240

ccagcgccct tcctggtatg cggattcttt cgggagatag taattagcat ccgcccattc    96300

aacggcggtc tgtggctccg gcctgaacag tgagcgaagc ccggcgcgga caaaatgccg    96360

cagcctgtta acctgactgt tcgatatatt cactcagcaa ccccggtatc agttcatcca    96420

gcgcggctgc tttgttcatg gctttgatga tatcccgttt caggaaatca acatgtcggt    96480

tttccagttc cggaaaacgc cgctgcaccg acagggggag cccgtcgaga atactggcaa    96540

tttcacctgc gatccgcgac agcacgaaag tacagaatgc ggtttccacc acttcagcgg    96600

agtctctggc attcttcagt tcctgtgcgt cggcctgcgc acgcgtaagt cgatggcgtt    96660

cgtactcaat agttcctggc tggagatctg cctcgctggc ctgccgcagt tcttcaacct    96720

cccggcgcag cttttcgttc tcaatttcag catccctttc ggcataccat tttatgacgg    96780

cggcagagtc ataaagcacc tcattaccct tgccaccgcc tcgcagaacg ggcattccct    96840

gttcctgcca gttctgaatg gtacggatac tcgcaccgaa aatgtcagcc agctgctttt    96900

tgttgacttc cattgttcat tccacggaca aaaacagaga aaggaaacga cagaggccaa    96960

aaagcctcgc tttcagcacc tgtcgtttcc tttcttttca gagggtattt taaataaaaa    97020

cattaagtta tgacgaagaa gaacggaaac gccttaaacc ggaaaatttt cataaatagc    97080

gaaaacccgc gaggtcgccg cccaggtcgc cgcccgtcaa tcggcccttt agtggagc      97138


<210>  63
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1

<400>  63
gcaatatcag caccaacaga aacaacct                                          28


<210>  64
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 2


<220>
<221>  misc_feature
<222>  (37)..(37)
<223>  Carboxyfluorescein (FAM) attached to T

<400>  64
tttttttttt tttttttttt tttttttttt tttttttttt tttt                        44


<210>  65
<211>  75
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Circular sequence used in Example 2


<220>
<221>  MISC_FEATURE
<222>  (1)..(75)
<223>  Carboxyfluorescein (FAM) attached to one T in sequence

<400>  65

Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
1               5                   10                  15      


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
            20                  25                  30          


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
        35                  40                  45              


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
    50                  55                  60                  


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
65                  70                  75  


<210>  66
<211>  970
<212>  PRT
<213>  Clostridium botulinum

<400>  66

Met Leu Ser Val Ala Asn Val Arg Ser Pro Ser Ala Ala Ala Ser Tyr 
1               5                   10                  15      


Phe Ala Ser Asp Asn Tyr Tyr Ala Ser Ala Asp Ala Asp Arg Ser Gly 
            20                  25                  30          


Gln Trp Ile Gly Asp Gly Ala Lys Arg Leu Gly Leu Glu Gly Lys Val 
        35                  40                  45              


Glu Ala Arg Ala Phe Asp Ala Leu Leu Arg Gly Glu Leu Pro Asp Gly 
    50                  55                  60                  


Ser Ser Val Gly Asn Pro Gly Gln Ala His Arg Pro Gly Thr Asp Leu 
65                  70                  75                  80  


Thr Phe Ser Val Pro Lys Ser Trp Ser Leu Leu Ala Leu Val Gly Lys 
                85                  90                  95      


Asp Glu Arg Ile Ile Ala Ala Tyr Arg Glu Ala Val Val Glu Ala Leu 
            100                 105                 110         


His Trp Ala Glu Lys Asn Ala Ala Glu Thr Arg Val Val Glu Lys Gly 
        115                 120                 125             


Met Val Val Thr Gln Ala Thr Gly Asn Leu Ala Ile Gly Leu Phe Gln 
    130                 135                 140                 


His Asp Thr Asn Arg Asn Gln Glu Pro Asn Leu His Phe His Ala Val 
145                 150                 155                 160 


Ile Ala Asn Val Thr Gln Gly Lys Asp Gly Lys Trp Arg Thr Leu Lys 
                165                 170                 175     


Asn Asp Arg Leu Trp Gln Leu Asn Thr Thr Leu Asn Ser Ile Ala Met 
            180                 185                 190         


Ala Arg Phe Arg Val Ala Val Glu Lys Leu Gly Tyr Glu Pro Gly Pro 
        195                 200                 205             


Val Leu Lys His Gly Asn Phe Glu Ala Arg Gly Ile Ser Arg Glu Gln 
    210                 215                 220                 


Val Met Ala Phe Ser Thr Arg Arg Lys Glu Val Leu Glu Ala Arg Arg 
225                 230                 235                 240 


Gly Pro Gly Leu Asp Ala Gly Arg Ile Ala Ala Leu Asp Thr Arg Ala 
                245                 250                 255     


Ser Lys Glu Gly Ile Glu Asp Arg Ala Thr Leu Ser Lys Gln Trp Ser 
            260                 265                 270         


Glu Ala Ala Gln Ser Ile Gly Leu Asp Leu Lys Pro Leu Val Asp Arg 
        275                 280                 285             


Ala Arg Thr Lys Ala Leu Gly Gln Gly Met Glu Ala Thr Arg Ile Gly 
    290                 295                 300                 


Ser Leu Val Glu Arg Gly Arg Ala Trp Leu Ser Arg Phe Ala Ala His 
305                 310                 315                 320 


Val Arg Gly Asp Pro Ala Asp Pro Leu Val Pro Pro Ser Val Leu Lys 
                325                 330                 335     


Gln Asp Arg Gln Thr Ile Ala Ala Ala Gln Ala Val Ala Ser Ala Val 
            340                 345                 350         


Arg His Leu Ser Gln Arg Glu Ala Ala Phe Glu Arg Thr Ala Leu Tyr 
        355                 360                 365             


Lys Ala Ala Leu Asp Phe Gly Leu Pro Thr Thr Ile Ala Asp Val Glu 
    370                 375                 380                 


Lys Arg Thr Arg Ala Leu Val Arg Ser Gly Asp Leu Ile Ala Gly Lys 
385                 390                 395                 400 


Gly Glu His Lys Gly Trp Leu Ala Ser Arg Asp Ala Val Val Thr Glu 
                405                 410                 415     


Gln Arg Ile Leu Ser Glu Val Ala Ala Gly Lys Gly Asp Ser Ser Pro 
            420                 425                 430         


Ala Ile Thr Pro Gln Lys Ala Ala Ala Ser Val Gln Ala Ala Ala Leu 
        435                 440                 445             


Thr Gly Gln Gly Phe Arg Leu Asn Glu Gly Gln Leu Ala Ala Ala Arg 
    450                 455                 460                 


Leu Ile Leu Ile Ser Lys Asp Arg Thr Ile Ala Val Gln Gly Ile Ala 
465                 470                 475                 480 


Gly Ala Gly Lys Ser Ser Val Leu Lys Pro Val Ala Glu Val Leu Arg 
                485                 490                 495     


Asp Glu Gly His Pro Val Ile Gly Leu Ala Ile Gln Asn Thr Leu Val 
            500                 505                 510         


Gln Met Leu Glu Arg Asp Thr Gly Ile Gly Ser Gln Thr Leu Ala Arg 
        515                 520                 525             


Phe Leu Gly Gly Trp Asn Lys Leu Leu Asp Asp Pro Gly Asn Val Ala 
    530                 535                 540                 


Leu Arg Ala Glu Ala Gln Ala Ser Leu Lys Asp His Val Leu Val Leu 
545                 550                 555                 560 


Asp Glu Ala Ser Met Val Ser Asn Glu Asp Lys Glu Lys Leu Val Arg 
                565                 570                 575     


Leu Ala Asn Leu Ala Gly Val His Arg Leu Val Leu Ile Gly Asp Arg 
            580                 585                 590         


Lys Gln Leu Gly Ala Val Asp Ala Gly Lys Pro Phe Ala Leu Leu Gln 
        595                 600                 605             


Arg Ala Gly Ile Ala Arg Ala Glu Met Ala Thr Asn Leu Arg Ala Arg 
    610                 615                 620                 


Asp Pro Val Val Arg Glu Ala Gln Ala Ala Ala Gln Ala Gly Asp Val 
625                 630                 635                 640 


Arg Lys Ala Leu Arg His Leu Lys Ser His Thr Val Glu Ala Arg Gly 
                645                 650                 655     


Asp Gly Ala Gln Val Ala Ala Glu Thr Trp Leu Ala Leu Asp Lys Glu 
            660                 665                 670         


Thr Arg Ala Arg Thr Ser Ile Tyr Ala Ser Gly Arg Ala Ile Arg Ser 
        675                 680                 685             


Ala Val Asn Ala Ala Val Gln Gln Gly Leu Leu Ala Ser Arg Glu Ile 
    690                 695                 700                 


Gly Pro Ala Lys Met Lys Leu Glu Val Leu Asp Arg Val Asn Thr Thr 
705                 710                 715                 720 


Arg Glu Glu Leu Arg His Leu Pro Ala Tyr Arg Ala Gly Arg Val Leu 
                725                 730                 735     


Glu Val Ser Arg Lys Gln Gln Ala Leu Gly Leu Phe Ile Gly Glu Tyr 
            740                 745                 750         


Arg Val Ile Gly Gln Asp Arg Lys Gly Lys Leu Val Glu Val Glu Asp 
        755                 760                 765             


Lys Arg Gly Lys Arg Phe Arg Phe Asp Pro Ala Arg Ile Arg Ala Gly 
    770                 775                 780                 


Lys Gly Asp Asp Asn Leu Thr Leu Leu Glu Pro Arg Lys Leu Glu Ile 
785                 790                 795                 800 


His Glu Gly Asp Arg Ile Arg Trp Thr Arg Asn Asp His Arg Arg Gly 
                805                 810                 815     


Leu Phe Asn Ala Asp Gln Ala Arg Val Val Glu Ile Ala Asn Gly Lys 
            820                 825                 830         


Val Thr Phe Glu Thr Ser Lys Gly Asp Leu Val Glu Leu Lys Lys Asp 
        835                 840                 845             


Asp Pro Met Leu Lys Arg Ile Asp Leu Ala Tyr Ala Leu Asn Val His 
    850                 855                 860                 


Met Ala Gln Gly Leu Thr Ser Asp Arg Gly Ile Ala Val Met Asp Ser 
865                 870                 875                 880 


Arg Glu Arg Asn Leu Ser Asn Gln Lys Thr Phe Leu Val Thr Val Thr 
                885                 890                 895     


Arg Leu Arg Asp His Leu Thr Leu Val Val Asp Ser Ala Asp Lys Leu 
            900                 905                 910         


Gly Ala Ala Val Ala Arg Asn Lys Gly Glu Lys Ala Ser Ala Ile Glu 
        915                 920                 925             


Val Thr Gly Ser Val Lys Pro Thr Ala Thr Lys Gly Ser Gly Val Asp 
    930                 935                 940                 


Gln Pro Lys Ser Val Glu Ala Asn Lys Ala Glu Lys Glu Leu Thr Arg 
945                 950                 955                 960 


Ser Lys Ser Lys Thr Leu Asp Phe Gly Ile 
                965                 970 


<210>  67
<211>  46
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence used in Examples 3 and 4

<400>  67

Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
1               5                   10                  15      


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
            20                  25                  30          


Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
        35                  40                  45      


<210>  68
<211>  1292
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 3

<400>  68
gccatcagat tgtgtttgtt agtcgctttt tttttttgga attttttttt tggaattttt       60

tttttgacgc tcagtaatgt gacgatagct gaaaactgta cgataaacgg tacgctgagg      120

gcggaaaaaa tcgtcgggga cattgtaaag gcggcgagcg cggcttttcc gcgccagcgt      180

gaaagcagtg tggactggcc gtcaggtacc cgtactgtca ccgtgaccga tgaccatcct      240

tttgatcgcc agatagtggt gcttccgctg acgtttcgcg gaagtaagcg tactgtcagc      300

ggcaggacaa cgtattcgat gtgttatctg aaagtactga tgaacggtgc ggtgatttat      360

gatggcgcgg cgaacgaggc ggtacaggtg ttctcccgta ttgttgacat gccagcgggt      420

cggggaaacg tgatcctgac gttcacgctt acgtccacac ggcattcggc agatattccg      480

ccgtatacgt ttgccagcga tgtgcaggtt atggtgatta agaaacaggc gctgggcatc      540

agcgtggtct gagtgtgttt tttttttgga attttttttt tggaattttt tttttcatcg      600

tcgtgagtag tgaaccgtaa gctgcgttct gtttcggatg tatgaaaaca tacatccgaa      660

acagaacgca gcttacggtt cactactcac gacgatgaaa aaaaaaattc caaaaaaaaa      720

attccaaaaa aaaaacacac tcagaccacg ctgatgccca gcgcctgttt cttaatcacc      780

ataacctgca catcgctggc aaacgtatac ggcggaatat ctgccgaatg ccgtgtggac      840

gtaagcgtga acgtcaggat cacgtttccc cgacccgctg gcatgtcaac aatacgggag      900

aacacctgta ccgcctcgtt cgccgcgcca tcataaatca ccgcaccgtt catcagtact      960

ttcagataac acatcgaata cgttgtcctg ccgctgacag tacgcttact tccgcgaaac     1020

gtcagcggaa gcaccactat ctggcgatca aaaggatggt catcggtcac ggtgacagta     1080

cgggtacctg acggccagtc cacactgctt tcacgctggc gcggaaaagc cgcgctcgcc     1140

gcctttacaa tgtccccgac gattttttcc gccctcagcg taccgtttat cgtacagttt     1200

tcagctatcg tcacattact gagcgtcaaa aaaaaaattc caaaaaaaaa attccaaaaa     1260

aaaaaagcga ctaacaaaca caatctgatg gc                                   1292


<210>  69
<211>  7240
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 4

<400>  69
gccatcagat tgtgtttgtt agtcgctgcc atcagattgt gtttgttagt cgcttttttt       60

ttttggaatt ttttttttgg aatttttttt ttgcgctaac aacctcctgc cgttttgccc      120

gtgcatatcg gtcacgaaca aatctgatta ctaaacacag tagcctggat ttgttctatc      180

agtaatcgac cttattccta attaaataga gcaaatcccc ttattggggg taagacatga      240

agatgccaga aaaacatgac ctgttggccg ccattctcgc ggcaaaggaa caaggcatcg      300

gggcaatcct tgcgtttgca atggcgtacc ttcgcggcag atataatggc ggtgcgttta      360

caaaaacagt aatcgacgca acgatgtgcg ccattatcgc ctagttcatt cgtgaccttc      420

tcgacttcgc cggactaagt agcaatctcg cttatataac gagcgtgttt atcggctaca      480

tcggtactga ctcgattggt tcgcttatca aacgcttcgc tgctaaaaaa gccggagtag      540

aagatggtag aaatcaataa tcaacgtaag gcgttcctcg atatgctggc gtggtcggag      600

ggaactgata acggacgtca gaaaaccaga aatcatggtt atgacgtcat tgtaggcgga      660

gagctattta ctgattactc cgatcaccct cgcaaacttg tcacgctaaa cccaaaactc      720

aaatcaacag gcgccggacg ctaccagctt ctttcccgtt ggtgggatgc ctaccgcaag      780

cagcttggcc tgaaagactt ctctccgaaa agtcaggacg ctgtggcatt gcagcagatt      840

aaggagcgtg gcgctttacc tatgattgat cgtggtgata tccgtcaggc aatcgaccgt      900

tgcagcaata tctgggcttc actgccgggc gctggttatg gtcagttcga gcataaggct      960

gacagcctga ttgcaaaatt caaagaagcg ggcggaacgg tcagagagat tgatgtatga     1020

gcagagtcac cgcgattatc tccgctctgg ttatctgcat catcgtctgc ctgtcatggg     1080

ctgttaatca ttaccgtgat aacgccatta cctacaaagc ccagcgcgac aaaaatgcca     1140

gagaactgaa gctggcgaac gcggcaatta ctgacatgca gatgcgtcag cgtgatgttg     1200

ctgcgctcga tgcaaaatac acgaaggagt tagctgatgc taaagctgaa aatgatgctc     1260

tgcgtgatga tgttgccgct ggtcgtcgtc ggttgcacat caaagcagtc tgtcagtcag     1320

tgcgtgaagc caccaccgcc tccggcgtgg ataatgcagc ctccccccga ctggcagaca     1380

ccgctgaacg ggattatttc accctcagag agaggctgat cactatgcaa aaacaactgg     1440

aaggaaccca gaagtatatt aatgagcagt gcagatagag ttgcccatat cgatgggcaa     1500

ctcatgcaat tattgtgagc aatacacacg cgcttccagc ggagtataaa tgcctaaagt     1560

aataaaaccg agcaatccat ttacgaatgt ttgctgggtt tctgttttaa caacattttc     1620

tgcgccgcca caaattttgg ctgcatcgac agttttcttc tgcccaattc cagaaacgaa     1680

gaaatgatgg gtgatggttt cctttggtgc tactgctgcc ggtttgtttt gaacagtaaa     1740

cgtctgttga gcacatcctg taataagcag ggccagcgca gtagcgagta gcattttttt     1800

catggtgtta ttcccgatgc tttttgaagt tcgcagaatc gtatgtgtag aaaattaaac     1860

aaaccctaaa caatgagttg aaatttcata ttgttaatat ttattaatgt atgtcaggtg     1920

cgatgaatcg tcattgtatt cccggattaa ctatgtccac agccctgacg gggaacttct     1980

ctgcgggagt gtccgggaat aattaaaacg atgcacacag ggtttagcgc gtacacgtat     2040

tgcattatgc caacgccccg gtgctgacac ggaagaaacc ggacgttatg atttagcgtg     2100

gaaagatttg tgtagtgttc tgaatgctct cagtaaatag taatgaatta tcaaaggtat     2160

agtaatatct tttatgttca tggatatttg taacccatcg gaaaactcct gctttagcaa     2220

gattttccct gtattgctga aatgtgattt ctcttgattt caacctatca taggacgttt     2280

ctataagatg cgtgtttctt gagaatttaa catttacaac ctttttaagt ccttttatta     2340

acacggtgtt atcgttttct aacacgatgt gaatattatc tgtggctaga tagtaaatat     2400

aatgtgagac gttgtgacgt tttagttcag aataaaacaa ttcacagtct aaatcttttc     2460

gcacttgatc gaatatttct ttaaaaatgg caacctgagc cattggtaaa accttccatg     2520

tgatacgagg gcgcgtagtt tgcattatcg tttttatcgt ttcaatctgg tctgacctcc     2580

ttgtgttttg ttgatgattt atgtcaaata ttaggaatgt tttcacttaa tagtattggt     2640

tgcgtaacaa agtgcggtcc tgctggcatt ctggagggaa atacaaccga cagatgtatg     2700

taaggccaac gtgctcaaat cttcatacag aaagatttga agtaatattt taaccgctag     2760

atgaagagca agcgcatgga gcgacaaaat gaataaagaa caatctgctg atgatccctc     2820

cgtggatctg attcgtgtaa aaaatatgct taatagcacc atttctatga gttaccctga     2880

tgttgtaatt gcatgtatag aacataaggt gtctctggaa gcattcagag caattgaggc     2940

agcgttggtg aagcacgata ataatatgaa ggattattcc ctggtggttg actgatcacc     3000

ataactgcta atcattcaaa ctatttagtc tgtgacagag ccaacacgca gtctgtcact     3060

gtcaggaaag tggtaaaact gcaactcaat tactgcaatg ccctcgtaat taagtgaatt     3120

tacaatatcg tcctgttcgg agggaagaac gcgggatgtt cattcttcat cacttttaat     3180

tgatgtatat gctctctttt ctgacgttag tctccgacgg caggcttcaa tgacccaggc     3240

tgagaaattc ccggaccctt tttgctcaag agcgatgtta atttgttcaa tcatttggtt     3300

aggaaagcgg atgttgcggg ttgttgttct gcgggttctg ttcttcgttg acatgaggtt     3360

gccccgtatt cagtgtcgct gatttgtatt gtctgaagtt gtttttacgt taagttgatg     3420

cagatcaatt aatacgatac ctgcgtcata attgattatt tgacgtggtt tgatggcctc     3480

cacgcacgtt gtgatatgta gatgataatc attatcactt tacgggtcct ttccggtgaa     3540

aaaaaaggta ccaaaaaaaa catcgtcgtg agtagtgaac cgtaagcacg ttctgtttat     3600

gtttcttgtt tgttagcctt ttggctaaca aacaagaaac ataaacagaa cgtgcttacg     3660

gttcactact cacgacgatg ttttttttgg tacctttttt ttcaccggaa aggacccgta     3720

aagtgataat gattatcatc tacatatcac aacgtgcgtg gaggccatca aaccacgtca     3780

aataatcaat tatgacgcag gtatcgtatt aattgatctg catcaactta acgtaaaaac     3840

aacttcagac aatacaaatc agcgacactg aatacggggc aacctcatgt caacgaagaa     3900

cagaacccgc agaacaacaa cccgcaacat ccgctttcct aaccaaatga ttgaacaaat     3960

taacatcgct cttgagcaaa aagggtccgg gaatttctca gcctgggtca ttgaagcctg     4020

ccgtcggaga ctaacgtcag aaaagagagc atatacatca attaaaagtg atgaagaatg     4080

aacatcccgc gttcttccct ccgaacagga cgatattgta aattcactta attacgaggg     4140

cattgcagta attgagttgc agttttacca ctttcctgac agtgacagac tgcgtgttgg     4200

ctctgtcaca gactaaatag tttgaatgat tagcagttat ggtgatcagt caaccaccag     4260

ggaataatcc ttcatattat tatcgtgctt caccaacgct gcctcaattg ctctgaatgc     4320

ttccagagac accttatgtt ctatacatgc aattacaaca tcagggtaac tcatagaaat     4380

ggtgctatta agcatatttt ttacacgaat cagatccacg gagggatcat cagcagattg     4440

ttctttattc attttgtcgc tccatgcgct tgctcttcat ctagcggtta aaatattact     4500

tcaaatcttt ctgtatgaag atttgagcac gttggcctta catacatctg tcggttgtat     4560

ttccctccag aatgccagca ggaccgcact ttgttacgca accaatacta ttaagtgaaa     4620

acattcctaa tatttgacat aaatcatcaa caaaacacaa ggaggtcaga ccagattgaa     4680

acgataaaaa cgataatgca aactacgcgc cctcgtatca catggaaggt tttaccaatg     4740

gctcaggttg ccatttttaa agaaatattc gatcaagtgc gaaaagattt agactgtgaa     4800

ttgttttatt ctgaactaaa acgtcacaac gtctcacatt atatttacta tctagccaca     4860

gataatattc acatcgtgtt agaaaacgat aacaccgtgt taataaaagg acttaaaaag     4920

gttgtaaatg ttaaattctc aagaaacacg catcttatag aaacgtccta tgataggttg     4980

aaatcaagag aaatcacatt tcagcaatac agggaaaatc ttgctaaagc aggagttttc     5040

cgatgggtta caaatatcca tgaacataaa agatattact atacctttga taattcatta     5100

ctatttactg agagcattca gaacactaca caaatctttc cacgctaaat cataacgtcc     5160

ggtttcttcc gtgtcagcac cggggcgttg gcataatgca atacgtgtac gcgctaaacc     5220

ctgtgtgcat cgttttaatt attcccggac actcccgcag agaagttccc cgtcagggct     5280

gtggacatag ttaatccggg aatacaatga cgattcatcg cacctgacat acattaataa     5340

atattaacaa tatgaaattt caactcattg tttagggttt gtttaatttt ctacacatac     5400

gattctgcga acttcaaaaa gcatcgggaa taacaccatg aaaaaaatgc tactcgctac     5460

tgcgctggcc ctgcttatta caggatgtgc tcaacagacg tttactgttc aaaacaaacc     5520

ggcagcagta gcaccaaagg aaaccatcac ccatcatttc ttcgtttctg gaattgggca     5580

gaagaaaact gtcgatgcag ccaaaatttg tggcggcgca gaaaatgttg ttaaaacaga     5640

aacccagcaa acattcgtaa atggattgct cggttttatt actttaggca tttatactcc     5700

gctggaagcg cgtgtgtatt gctcacaata attgcatgag ttgcccatcg atatgggcaa     5760

ctctatctgc actgctcatt aatatacttc tgggttcctt ccagttgttt ttgcatagtg     5820

atcagcctct ctctgagggt gaaataatcc cgttcagcgg tgtctgccag tcggggggag     5880

gctgcattat ccacgccgga ggcggtggtg gcttcacgca ctgactgaca gactgctttg     5940

atgtgcaacc gacgacgacc agcggcaaca tcatcacgca gagcatcatt ttcagcttta     6000

gcatcagcta actccttcgt gtattttgca tcgagcgcag caacatcacg ctgacgcatc     6060

tgcatgtcag taattgccgc gttcgccagc ttcagttctc tggcattttt gtcgcgctgg     6120

gctttgtagg taatggcgtt atcacggtaa tgattaacag cccatgacag gcagacgatg     6180

atgcagataa ccagagcgga gataatcgcg gtgactctgc tcatacatca atctctctga     6240

ccgttccgcc cgcttctttg aattttgcaa tcaggctgtc agccttatgc tcgaactgac     6300

cataaccagc gcccggcagt gaagcccaga tattgctgca acggtcgatt gcctgacgga     6360

tatcaccacg atcaatcata ggtaaagcgc cacgctcctt aatctgctgc aatgccacag     6420

cgtcctgact tttcggagag aagtctttca ggccaagctg cttgcggtag gcatcccacc     6480

aacgggaaag aagctggtag cgtccggcgc ctgttgattt gagttttggg tttagcgtga     6540

caagtttgcg agggtgatcg gagtaatcag taaatagctc tccgcctaca atgacgtcat     6600

aaccatgatt tctggttttc tgacgtccgt tatcagttcc ctccgaccac gccagcatat     6660

cgaggaacgc cttacgttga ttattgattt ctaccatctt ctactccggc ttttttagca     6720

gcgaagcgtt tgataagcga accaatcgag tcagtaccga tgtagccgat aaacacgctc     6780

gttatataag cgagattgct acttagtccg gcgaagtcga gaaggtcacg aatgaactag     6840

gcgataatgg cgcacatcgt tgcgtcgatt actgtttttg taaacgcacc gccattatat     6900

ctgccgcgaa ggtacgccat tgcaaacgca aggattgccc cgatgccttg ttcctttgcc     6960

gcgagaatgg cggccaacag gtcatgtttt tctggcatct tcatgtctta cccccaataa     7020

ggggatttgc tctatttaat taggaataag gtcgattact gatagaacaa atccaggcta     7080

ctgtgtttag taatcagatt tgttcgtgac cgatatgcac gggcaaaacg gcaggaggtt     7140

gttagcgcaa aaaaaaaatt ccaaaaaaaa aattccaaaa aaaaaaagcg actaacaaac     7200

acaatctgat ggcagcgact aacaaacaca atctgatggc                           7240


<210>  70
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 5 and 6

<400>  70
tttttttttt tttttttttt                                                   20


<210>  71
<211>  7240
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 5

<400>  71
gccatcagat tgtgtttgtt agtcgctgcc atcagattgt gtttgttagt cgcttttttt       60

ttttggaatt ttttttttgg aatttttttt ttgcgctaac aacctcctgc cgttttgccc      120

gtgcatatcg gtcacgaaca aatctgatta ctaaacacag tagcctggat ttgttctatc      180

agtaatcgac cttattccta attaaataga gcaaatcccc ttattggggg taagacatga      240

agatgccaga aaaacatgac ctgttggccg ccattctcgc ggcaaaggaa caaggcatcg      300

gggcaatcct tgcgtttgca atggcgtacc ttcgcggcag atataatggc ggtgcgttta      360

caaaaacagt aatcgacgca acgatgtgcg ccattatcgc ctagttcatt cgtgaccttc      420

tcgacttcgc cggactaagt agcaatctcg cttatataac gagcgtgttt atcggctaca      480

tcggtactga ctcgattggt tcgcttatca aacgcttcgc tgctaaaaaa gccggagtag      540

aagatggtag aaatcaataa tcaacgtaag gcgttcctcg atatgctggc gtggtcggag      600

ggaactgata acggacgtca gaaaaccaga aatcatggtt atgacgtcat tgtaggcgga      660

gagctattta ctgattactc cgatcaccct cgcaaacttg tcacgctaaa cccaaaactc      720

aaatcaacag gcgccggacg ctaccagctt ctttcccgtt ggtgggatgc ctaccgcaag      780

cagcttggcc tgaaagactt ctctccgaaa agtcaggacg ctgtggcatt gcagcagatt      840

aaggagcgtg gcgctttacc tatgattgat cgtggtgata tccgtcaggc aatcgaccgt      900

tgcagcaata tctgggcttc actgccgggc gctggttatg gtcagttcga gcataaggct      960

gacagcctga ttgcaaaatt caaagaagcg ggcggaacgg tcagagagat tgatgtatga     1020

gcagagtcac cgcgattatc tccgctctgg ttatctgcat catcgtctgc ctgtcatggg     1080

ctgttaatca ttaccgtgat aacgccatta cctacaaagc ccagcgcgac aaaaatgcca     1140

gagaactgaa gctggcgaac gcggcaatta ctgacatgca gatgcgtcag cgtgatgttg     1200

ctgcgctcga tgcaaaatac acgaaggagt tagctgatgc taaagctgaa aatgatgctc     1260

tgcgtgatga tgttgccgct ggtcgtcgtc ggttgcacat caaagcagtc tgtcagtcag     1320

tgcgtgaagc caccaccgcc tccggcgtgg ataatgcagc ctccccccga ctggcagaca     1380

ccgctgaacg ggattatttc accctcagag agaggctgat cactatgcaa aaacaactgg     1440

aaggaaccca gaagtatatt aatgagcagt gcagatagag ttgcccatat cgatgggcaa     1500

ctcatgcaat tattgtgagc aatacacacg cgcttccagc ggagtataaa tgcctaaagt     1560

aataaaaccg agcaatccat ttacgaatgt ttgctgggtt tctgttttaa caacattttc     1620

tgcgccgcca caaattttgg ctgcatcgac agttttcttc tgcccaattc cagaaacgaa     1680

gaaatgatgg gtgatggttt cctttggtgc tactgctgcc ggtttgtttt gaacagtaaa     1740

cgtctgttga gcacatcctg taataagcag ggccagcgca gtagcgagta gcattttttt     1800

catggtgtta ttcccgatgc tttttgaagt tcgcagaatc gtatgtgtag aaaattaaac     1860

aaaccctaaa caatgagttg aaatttcata ttgttaatat ttattaatgt atgtcaggtg     1920

cgatgaatcg tcattgtatt cccggattaa ctatgtccac agccctgacg gggaacttct     1980

ctgcgggagt gtccgggaat aattaaaacg atgcacacag ggtttagcgc gtacacgtat     2040

tgcattatgc caacgccccg gtgctgacac ggaagaaacc ggacgttatg atttagcgtg     2100

gaaagatttg tgtagtgttc tgaatgctct cagtaaatag taatgaatta tcaaaggtat     2160

agtaatatct tttatgttca tggatatttg taacccatcg gaaaactcct gctttagcaa     2220

gattttccct gtattgctga aatgtgattt ctcttgattt caacctatca taggacgttt     2280

ctataagatg cgtgtttctt gagaatttaa catttacaac ctttttaagt ccttttatta     2340

acacggtgtt atcgttttct aacacgatgt gaatattatc tgtggctaga tagtaaatat     2400

aatgtgagac gttgtgacgt tttagttcag aataaaacaa ttcacagtct aaatcttttc     2460

gcacttgatc gaatatttct ttaaaaatgg caacctgagc cattggtaaa accttccatg     2520

tgatacgagg gcgcgtagtt tgcattatcg tttttatcgt ttcaatctgg tctgacctcc     2580

ttgtgttttg ttgatgattt atgtcaaata ttaggaatgt tttcacttaa tagtattggt     2640

tgcgtaacaa agtgcggtcc tgctggcatt ctggagggaa atacaaccga cagatgtatg     2700

taaggccaac gtgctcaaat cttcatacag aaagatttga agtaatattt taaccgctag     2760

atgaagagca agcgcatgga gcgacaaaat gaataaagaa caatctgctg atgatccctc     2820

cgtggatctg attcgtgtaa aaaatatgct taatagcacc atttctatga gttaccctga     2880

tgttgtaatt gcatgtatag aacataaggt gtctctggaa gcattcagag caattgaggc     2940

agcgttggtg aagcacgata ataatatgaa ggattattcc ctggtggttg actgatcacc     3000

ataactgcta atcattcaaa ctatttagtc tgtgacagag ccaacacgca gtctgtcact     3060

gtcaggaaag tggtaaaact gcaactcaat tactgcaatg ccctcgtaat taagtgaatt     3120

tacaatatcg tcctgttcgg agggaagaac gcgggatgtt cattcttcat cacttttaat     3180

tgatgtatat gctctctttt ctgacgttag tctccgacgg caggcttcaa tgacccaggc     3240

tgagaaattc ccggaccctt tttgctcaag agcgatgtta atttgttcaa tcatttggtt     3300

aggaaagcgg atgttgcggg ttgttgttct gcgggttctg ttcttcgttg acatgaggtt     3360

gccccgtatt cagtgtcgct gatttgtatt gtctgaagtt gtttttacgt taagttgatg     3420

cagatcaatt aatacgatac ctgcgtcata attgattatt tgacgtggtt tgatggcctc     3480

cacgcacgtt gtgatatgta gatgataatc attatcactt tacgggtcct ttccggtgaa     3540

aaaaaaggta ccaaaaaaaa catcgtcgtg agtagtgaac cgtaagcacg ttctgtttat     3600

gtttcttgtt tgttagcctt ttggctaaca aacaagaaac ataaacagaa cgtgcttacg     3660

gttcactact cacgacgatg ttttttttgg tacctttttt ttcaccggaa aggacccgta     3720

aagtgataat gattatcatc tacatatcac aacgtgcgtg gaggccatca aaccacgtca     3780

aataatcaat tatgacgcag gtatcgtatt aattgatctg catcaactta acgtaaaaac     3840

aacttcagac aatacaaatc agcgacactg aatacggggc aacctcatgt caacgaagaa     3900

cagaacccgc agaacaacaa cccgcaacat ccgctttcct aaccaaatga ttgaacaaat     3960

taacatcgct cttgagcaaa aagggtccgg gaatttctca gcctgggtca ttgaagcctg     4020

ccgtcggaga ctaacgtcag aaaagagagc atatacatca attaaaagtg atgaagaatg     4080

aacatcccgc gttcttccct ccgaacagga cgatattgta aattcactta attacgaggg     4140

cattgcagta attgagttgc agttttacca ctttcctgac agtgacagac tgcgtgttgg     4200

ctctgtcaca gactaaatag tttgaatgat tagcagttat ggtgatcagt caaccaccag     4260

ggaataatcc ttcatattat tatcgtgctt caccaacgct gcctcaattg ctctgaatgc     4320

ttccagagac accttatgtt ctatacatgc aattacaaca tcagggtaac tcatagaaat     4380

ggtgctatta agcatatttt ttacacgaat cagatccacg gagggatcat cagcagattg     4440

ttctttattc attttgtcgc tccatgcgct tgctcttcat ctagcggtta aaatattact     4500

tcaaatcttt ctgtatgaag atttgagcac gttggcctta catacatctg tcggttgtat     4560

ttccctccag aatgccagca ggaccgcact ttgttacgca accaatacta ttaagtgaaa     4620

acattcctaa tatttgacat aaatcatcaa caaaacacaa ggaggtcaga ccagattgaa     4680

acgataaaaa cgataatgca aactacgcgc cctcgtatca catggaaggt tttaccaatg     4740

gctcaggttg ccatttttaa agaaatattc gatcaagtgc gaaaagattt agactgtgaa     4800

ttgttttatt ctgaactaaa acgtcacaac gtctcacatt atatttacta tctagccaca     4860

gataatattc acatcgtgtt agaaaacgat aacaccgtgt taataaaagg acttaaaaag     4920

gttgtaaatg ttaaattctc aagaaacacg catcttatag aaacgtccta tgataggttg     4980

aaatcaagag aaatcacatt tcagcaatac agggaaaatc ttgctaaagc aggagttttc     5040

cgatgggtta caaatatcca tgaacataaa agatattact atacctttga taattcatta     5100

ctatttactg agagcattca gaacactaca caaatctttc cacgctaaat cataacgtcc     5160

ggtttcttcc gtgtcagcac cggggcgttg gcataatgca atacgtgtac gcgctaaacc     5220

ctgtgtgcat cgttttaatt attcccggac actcccgcag agaagttccc cgtcagggct     5280

gtggacatag ttaatccggg aatacaatga cgattcatcg cacctgacat acattaataa     5340

atattaacaa tatgaaattt caactcattg tttagggttt gtttaatttt ctacacatac     5400

gattctgcga acttcaaaaa gcatcgggaa taacaccatg aaaaaaatgc tactcgctac     5460

tgcgctggcc ctgcttatta caggatgtgc tcaacagacg tttactgttc aaaacaaacc     5520

ggcagcagta gcaccaaagg aaaccatcac ccatcatttc ttcgtttctg gaattgggca     5580

gaagaaaact gtcgatgcag ccaaaatttg tggcggcgca gaaaatgttg ttaaaacaga     5640

aacccagcaa acattcgtaa atggattgct cggttttatt actttaggca tttatactcc     5700

gctggaagcg cgtgtgtatt gctcacaata attgcatgag ttgcccatcg atatgggcaa     5760

ctctatctgc actgctcatt aatatacttc tgggttcctt ccagttgttt ttgcatagtg     5820

atcagcctct ctctgagggt gaaataatcc cgttcagcgg tgtctgccag tcggggggag     5880

gctgcattat ccacgccgga ggcggtggtg gcttcacgca ctgactgaca gactgctttg     5940

atgtgcaacc gacgacgacc agcggcaaca tcatcacgca gagcatcatt ttcagcttta     6000

gcatcagcta actccttcgt gtattttgca tcgagcgcag caacatcacg ctgacgcatc     6060

tgcatgtcag taattgccgc gttcgccagc ttcagttctc tggcattttt gtcgcgctgg     6120

gctttgtagg taatggcgtt atcacggtaa tgattaacag cccatgacag gcagacgatg     6180

atgcagataa ccagagcgga gataatcgcg gtgactctgc tcatacatca atctctctga     6240

ccgttccgcc cgcttctttg aattttgcaa tcaggctgtc agccttatgc tcgaactgac     6300

cataaccagc gcccggcagt gaagcccaga tattgctgca acggtcgatt gcctgacgga     6360

tatcaccacg atcaatcata ggtaaagcgc cacgctcctt aatctgctgc aatgccacag     6420

cgtcctgact tttcggagag aagtctttca ggccaagctg cttgcggtag gcatcccacc     6480

aacgggaaag aagctggtag cgtccggcgc ctgttgattt gagttttggg tttagcgtga     6540

caagtttgcg agggtgatcg gagtaatcag taaatagctc tccgcctaca atgacgtcat     6600

aaccatgatt tctggttttc tgacgtccgt tatcagttcc ctccgaccac gccagcatat     6660

cgaggaacgc cttacgttga ttattgattt ctaccatctt ctactccggc ttttttagca     6720

gcgaagcgtt tgataagcga accaatcgag tcagtaccga tgtagccgat aaacacgctc     6780

gttatataag cgagattgct acttagtccg gcgaagtcga gaaggtcacg aatgaactag     6840

gcgataatgg cgcacatcgt tgcgtcgatt actgtttttg taaacgcacc gccattatat     6900

ctgccgcgaa ggtacgccat tgcaaacgca aggattgccc cgatgccttg ttcctttgcc     6960

gcgagaatgg cggccaacag gtcatgtttt tctggcatct tcatgtctta cccccaataa     7020

ggggatttgc tctatttaat taggaataag gtcgattact gatagaacaa atccaggcta     7080

ctgtgtttag taatcagatt tgttcgtgac cgatatgcac gggcaaaacg gcaggaggtt     7140

gttagcgcaa aaaaaaaatt ccaaaaaaaa aattccaaaa aaaaaaagcg actaacaaac     7200

acaatctgat ggcagcgact aacaaacaca atctgatggc                           7240


<210>  72
<211>  3653
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 6


<220>
<221>  misc_feature
<222>  (3648)..(3648)
<223>  n is uridine

<400>  72
gccatcagat tgtgtttgtt agtcgctgcc atcagattgt gtttgttagt cgcttttttt       60

ttttggaatt ttttttttgg aatttttttt ttgcgctaac aacctcctgc cgttttgccc      120

gtgcatatcg gtcacgaaca aatctgatta ctaaacacag tagcctggat ttgttctatc      180

agtaatcgac cttattccta attaaataga gcaaatcccc ttattggggg taagacatga      240

agatgccaga aaaacatgac ctgttggccg ccattctcgc ggcaaaggaa caaggcatcg      300

gggcaatcct tgcgtttgca atggcgtacc ttcgcggcag atataatggc ggtgcgttta      360

caaaaacagt aatcgacgca acgatgtgcg ccattatcgc ctagttcatt cgtgaccttc      420

tcgacttcgc cggactaagt agcaatctcg cttatataac gagcgtgttt atcggctaca      480

tcggtactga ctcgattggt tcgcttatca aacgcttcgc tgctaaaaaa gccggagtag      540

aagatggtag aaatcaataa tcaacgtaag gcgttcctcg atatgctggc gtggtcggag      600

ggaactgata acggacgtca gaaaaccaga aatcatggtt atgacgtcat tgtaggcgga      660

gagctattta ctgattactc cgatcaccct cgcaaacttg tcacgctaaa cccaaaactc      720

aaatcaacag gcgccggacg ctaccagctt ctttcccgtt ggtgggatgc ctaccgcaag      780

cagcttggcc tgaaagactt ctctccgaaa agtcaggacg ctgtggcatt gcagcagatt      840

aaggagcgtg gcgctttacc tatgattgat cgtggtgata tccgtcaggc aatcgaccgt      900

tgcagcaata tctgggcttc actgccgggc gctggttatg gtcagttcga gcataaggct      960

gacagcctga ttgcaaaatt caaagaagcg ggcggaacgg tcagagagat tgatgtatga     1020

gcagagtcac cgcgattatc tccgctctgg ttatctgcat catcgtctgc ctgtcatggg     1080

ctgttaatca ttaccgtgat aacgccatta cctacaaagc ccagcgcgac aaaaatgcca     1140

gagaactgaa gctggcgaac gcggcaatta ctgacatgca gatgcgtcag cgtgatgttg     1200

ctgcgctcga tgcaaaatac acgaaggagt tagctgatgc taaagctgaa aatgatgctc     1260

tgcgtgatga tgttgccgct ggtcgtcgtc ggttgcacat caaagcagtc tgtcagtcag     1320

tgcgtgaagc caccaccgcc tccggcgtgg ataatgcagc ctccccccga ctggcagaca     1380

ccgctgaacg ggattatttc accctcagag agaggctgat cactatgcaa aaacaactgg     1440

aaggaaccca gaagtatatt aatgagcagt gcagatagag ttgcccatat cgatgggcaa     1500

ctcatgcaat tattgtgagc aatacacacg cgcttccagc ggagtataaa tgcctaaagt     1560

aataaaaccg agcaatccat ttacgaatgt ttgctgggtt tctgttttaa caacattttc     1620

tgcgccgcca caaattttgg ctgcatcgac agttttcttc tgcccaattc cagaaacgaa     1680

gaaatgatgg gtgatggttt cctttggtgc tactgctgcc ggtttgtttt gaacagtaaa     1740

cgtctgttga gcacatcctg taataagcag ggccagcgca gtagcgagta gcattttttt     1800

catggtgtta ttcccgatgc tttttgaagt tcgcagaatc gtatgtgtag aaaattaaac     1860

aaaccctaaa caatgagttg aaatttcata ttgttaatat ttattaatgt atgtcaggtg     1920

cgatgaatcg tcattgtatt cccggattaa ctatgtccac agccctgacg gggaacttct     1980

ctgcgggagt gtccgggaat aattaaaacg atgcacacag ggtttagcgc gtacacgtat     2040

tgcattatgc caacgccccg gtgctgacac ggaagaaacc ggacgttatg atttagcgtg     2100

gaaagatttg tgtagtgttc tgaatgctct cagtaaatag taatgaatta tcaaaggtat     2160

agtaatatct tttatgttca tggatatttg taacccatcg gaaaactcct gctttagcaa     2220

gattttccct gtattgctga aatgtgattt ctcttgattt caacctatca taggacgttt     2280

ctataagatg cgtgtttctt gagaatttaa catttacaac ctttttaagt ccttttatta     2340

acacggtgtt atcgttttct aacacgatgt gaatattatc tgtggctaga tagtaaatat     2400

aatgtgagac gttgtgacgt tttagttcag aataaaacaa ttcacagtct aaatcttttc     2460

gcacttgatc gaatatttct ttaaaaatgg caacctgagc cattggtaaa accttccatg     2520

tgatacgagg gcgcgtagtt tgcattatcg tttttatcgt ttcaatctgg tctgacctcc     2580

ttgtgttttg ttgatgattt atgtcaaata ttaggaatgt tttcacttaa tagtattggt     2640

tgcgtaacaa agtgcggtcc tgctggcatt ctggagggaa atacaaccga cagatgtatg     2700

taaggccaac gtgctcaaat cttcatacag aaagatttga agtaatattt taaccgctag     2760

atgaagagca agcgcatgga gcgacaaaat gaataaagaa caatctgctg atgatccctc     2820

cgtggatctg attcgtgtaa aaaatatgct taatagcacc atttctatga gttaccctga     2880

tgttgtaatt gcatgtatag aacataaggt gtctctggaa gcattcagag caattgaggc     2940

agcgttggtg aagcacgata ataatatgaa ggattattcc ctggtggttg actgatcacc     3000

ataactgcta atcattcaaa ctatttagtc tgtgacagag ccaacacgca gtctgtcact     3060

gtcaggaaag tggtaaaact gcaactcaat tactgcaatg ccctcgtaat taagtgaatt     3120

tacaatatcg tcctgttcgg agggaagaac gcgggatgtt cattcttcat cacttttaat     3180

tgatgtatat gctctctttt ctgacgttag tctccgacgg caggcttcaa tgacccaggc     3240

tgagaaattc ccggaccctt tttgctcaag agcgatgtta atttgttcaa tcatttggtt     3300

aggaaagcgg atgttgcggg ttgttgttct gcgggttctg ttcttcgttg acatgaggtt     3360

gccccgtatt cagtgtcgct gatttgtatt gtctgaagtt gtttttacgt taagttgatg     3420

cagatcaatt aatacgatac ctgcgtcata attgattatt tgacgtggtt tgatggcctc     3480

cacgcacgtt gtgatatgta gatgataatc attatcactt tacgggtcct ttccggtgaa     3540

aaaaaaggta ccaaaaaaaa catcgtcgtg agtagtgaac cgtaagcagc gacggctgag     3600

aagttccact caagcctctg acactgattg acacggttta gtagaacntt ttt            3653


<210>  73
<211>  3643
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 6

<400>  73
cttctcaatg tgtacgtgtc ctctagaggc ttgagtggaa cttctcagcc gtcgctgctt       60

acggttcact actcacgacg atgttttttt tggtaccttt tttttcaccg gaaaggaccc      120

gtaaagtgat aatgattatc atctacatat cacaacgtgc gtggaggcca tcaaaccacg      180

tcaaataatc aattatgacg caggtatcgt attaattgat ctgcatcaac ttaacgtaaa      240

aacaacttca gacaatacaa atcagcgaca ctgaatacgg ggcaacctca tgtcaacgaa      300

gaacagaacc cgcagaacaa caacccgcaa catccgcttt cctaaccaaa tgattgaaca      360

aattaacatc gctcttgagc aaaaagggtc cgggaatttc tcagcctggg tcattgaagc      420

ctgccgtcgg agactaacgt cagaaaagag agcatataca tcaattaaaa gtgatgaaga      480

atgaacatcc cgcgttcttc cctccgaaca ggacgatatt gtaaattcac ttaattacga      540

gggcattgca gtaattgagt tgcagtttta ccactttcct gacagtgaca gactgcgtgt      600

tggctctgtc acagactaaa tagtttgaat gattagcagt tatggtgatc agtcaaccac      660

cagggaataa tccttcatat tattatcgtg cttcaccaac gctgcctcaa ttgctctgaa      720

tgcttccaga gacaccttat gttctataca tgcaattaca acatcagggt aactcataga      780

aatggtgcta ttaagcatat tttttacacg aatcagatcc acggagggat catcagcaga      840

ttgttcttta ttcattttgt cgctccatgc gcttgctctt catctagcgg ttaaaatatt      900

acttcaaatc tttctgtatg aagatttgag cacgttggcc ttacatacat ctgtcggttg      960

tatttccctc cagaatgcca gcaggaccgc actttgttac gcaaccaata ctattaagtg     1020

aaaacattcc taatatttga cataaatcat caacaaaaca caaggaggtc agaccagatt     1080

gaaacgataa aaacgataat gcaaactacg cgccctcgta tcacatggaa ggttttacca     1140

atggctcagg ttgccatttt taaagaaata ttcgatcaag tgcgaaaaga tttagactgt     1200

gaattgtttt attctgaact aaaacgtcac aacgtctcac attatattta ctatctagcc     1260

acagataata ttcacatcgt gttagaaaac gataacaccg tgttaataaa aggacttaaa     1320

aaggttgtaa atgttaaatt ctcaagaaac acgcatctta tagaaacgtc ctatgatagg     1380

ttgaaatcaa gagaaatcac atttcagcaa tacagggaaa atcttgctaa agcaggagtt     1440

ttccgatggg ttacaaatat ccatgaacat aaaagatatt actatacctt tgataattca     1500

ttactattta ctgagagcat tcagaacact acacaaatct ttccacgcta aatcataacg     1560

tccggtttct tccgtgtcag caccggggcg ttggcataat gcaatacgtg tacgcgctaa     1620

accctgtgtg catcgtttta attattcccg gacactcccg cagagaagtt ccccgtcagg     1680

gctgtggaca tagttaatcc gggaatacaa tgacgattca tcgcacctga catacattaa     1740

taaatattaa caatatgaaa tttcaactca ttgtttaggg tttgtttaat tttctacaca     1800

tacgattctg cgaacttcaa aaagcatcgg gaataacacc atgaaaaaaa tgctactcgc     1860

tactgcgctg gccctgctta ttacaggatg tgctcaacag acgtttactg ttcaaaacaa     1920

accggcagca gtagcaccaa aggaaaccat cacccatcat ttcttcgttt ctggaattgg     1980

gcagaagaaa actgtcgatg cagccaaaat ttgtggcggc gcagaaaatg ttgttaaaac     2040

agaaacccag caaacattcg taaatggatt gctcggtttt attactttag gcatttatac     2100

tccgctggaa gcgcgtgtgt attgctcaca ataattgcat gagttgccca tcgatatggg     2160

caactctatc tgcactgctc attaatatac ttctgggttc cttccagttg tttttgcata     2220

gtgatcagcc tctctctgag ggtgaaataa tcccgttcag cggtgtctgc cagtcggggg     2280

gaggctgcat tatccacgcc ggaggcggtg gtggcttcac gcactgactg acagactgct     2340

ttgatgtgca accgacgacg accagcggca acatcatcac gcagagcatc attttcagct     2400

ttagcatcag ctaactcctt cgtgtatttt gcatcgagcg cagcaacatc acgctgacgc     2460

atctgcatgt cagtaattgc cgcgttcgcc agcttcagtt ctctggcatt tttgtcgcgc     2520

tgggctttgt aggtaatggc gttatcacgg taatgattaa cagcccatga caggcagacg     2580

atgatgcaga taaccagagc ggagataatc gcggtgactc tgctcataca tcaatctctc     2640

tgaccgttcc gcccgcttct ttgaattttg caatcaggct gtcagcctta tgctcgaact     2700

gaccataacc agcgcccggc agtgaagccc agatattgct gcaacggtcg attgcctgac     2760

ggatatcacc acgatcaatc ataggtaaag cgccacgctc cttaatctgc tgcaatgcca     2820

cagcgtcctg acttttcgga gagaagtctt tcaggccaag ctgcttgcgg taggcatccc     2880

accaacggga aagaagctgg tagcgtccgg cgcctgttga tttgagtttt gggtttagcg     2940

tgacaagttt gcgagggtga tcggagtaat cagtaaatag ctctccgcct acaatgacgt     3000

cataaccatg atttctggtt ttctgacgtc cgttatcagt tccctccgac cacgccagca     3060

tatcgaggaa cgccttacgt tgattattga tttctaccat cttctactcc ggctttttta     3120

gcagcgaagc gtttgataag cgaaccaatc gagtcagtac cgatgtagcc gataaacacg     3180

ctcgttatat aagcgagatt gctacttagt ccggcgaagt cgagaaggtc acgaatgaac     3240

taggcgataa tggcgcacat cgttgcgtcg attactgttt ttgtaaacgc accgccatta     3300

tatctgccgc gaaggtacgc cattgcaaac gcaaggattg ccccgatgcc ttgttccttt     3360

gccgcgagaa tggcggccaa caggtcatgt ttttctggca tcttcatgtc ttacccccaa     3420

taaggggatt tgctctattt aattaggaat aaggtcgatt actgatagaa caaatccagg     3480

ctactgtgtt tagtaatcag atttgttcgt gaccgatatg cacgggcaaa acggcaggag     3540

gttgttagcg caaaaaaaaa attccaaaaa aaaaattcca aaaaaaaaaa gcgactaaca     3600

aacacaatct gatggcagcg actaacaaac acaatctgat ggc                       3643


<210>  74
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 6, 7 and 8

<400>  74
gcaatatcag caccaacaga aacaaccttt gaggcgagcg gtcaa                       45


<210>  75
<211>  15
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 6, 7 and 8

<400>  75
ttgaccgctc gcctc                                                        15


<210>  76
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 7 and 8

<400>  76
tttttttttt                                                              10


<210>  77
<211>  59
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 7 and 8

<400>  77
ggttgtttct gttggtgctg atattgcact gagtgaccaa tcagctacgt ttttttttt        59


<210>  78
<211>  3636
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 7 and 8

<400>  78
ggttgtttct gttggtgctg atattgctgc catcagattg tgtttgttag tcgctttttt       60

tttttggaat tttttttttg gaattttttt tttgcgctaa caacctcctg ccgttttgcc      120

cgtgcatatc ggtcacgaac aaatctgatt actaaacaca gtagcctgga tttgttctat      180

cagtaatcga ccttattcct aattaaatag agcaaatccc cttattgggg gtaagacatg      240

aagatgccag aaaaacatga cctgttggcc gccattctcg cggcaaagga acaaggcatc      300

ggggcaatcc ttgcgtttgc aatggcgtac cttcgcggca gatataatgg cggtgcgttt      360

acaaaaacag taatcgacgc aacgatgtgc gccattatcg cctagttcat tcgtgacctt      420

ctcgacttcg ccggactaag tagcaatctc gcttatataa cgagcgtgtt tatcggctac      480

atcggtactg actcgattgg ttcgcttatc aaacgcttcg ctgctaaaaa agccggagta      540

gaagatggta gaaatcaata atcaacgtaa ggcgttcctc gatatgctgg cgtggtcgga      600

gggaactgat aacggacgtc agaaaaccag aaatcatggt tatgacgtca ttgtaggcgg      660

agagctattt actgattact ccgatcaccc tcgcaaactt gtcacgctaa acccaaaact      720

caaatcaaca ggcgccggac gctaccagct tctttcccgt tggtgggatg cctaccgcaa      780

gcagcttggc ctgaaagact tctctccgaa aagtcaggac gctgtggcat tgcagcagat      840

taaggagcgt ggcgctttac ctatgattga tcgtggtgat atccgtcagg caatcgaccg      900

ttgcagcaat atctgggctt cactgccggg cgctggttat ggtcagttcg agcataaggc      960

tgacagcctg attgcaaaat tcaaagaagc gggcggaacg gtcagagaga ttgatgtatg     1020

agcagagtca ccgcgattat ctccgctctg gttatctgca tcatcgtctg cctgtcatgg     1080

gctgttaatc attaccgtga taacgccatt acctacaaag cccagcgcga caaaaatgcc     1140

agagaactga agctggcgaa cgcggcaatt actgacatgc agatgcgtca gcgtgatgtt     1200

gctgcgctcg atgcaaaata cacgaaggag ttagctgatg ctaaagctga aaatgatgct     1260

ctgcgtgatg atgttgccgc tggtcgtcgt cggttgcaca tcaaagcagt ctgtcagtca     1320

gtgcgtgaag ccaccaccgc ctccggcgtg gataatgcag cctccccccg actggcagac     1380

accgctgaac gggattattt caccctcaga gagaggctga tcactatgca aaaacaactg     1440

gaaggaaccc agaagtatat taatgagcag tgcagataga gttgcccata tcgatgggca     1500

actcatgcaa ttattgtgag caatacacac gcgcttccag cggagtataa atgcctaaag     1560

taataaaacc gagcaatcca tttacgaatg tttgctgggt ttctgtttta acaacatttt     1620

ctgcgccgcc acaaattttg gctgcatcga cagttttctt ctgcccaatt ccagaaacga     1680

agaaatgatg ggtgatggtt tcctttggtg ctactgctgc cggtttgttt tgaacagtaa     1740

acgtctgttg agcacatcct gtaataagca gggccagcgc agtagcgagt agcatttttt     1800

tcatggtgtt attcccgatg ctttttgaag ttcgcagaat cgtatgtgta gaaaattaaa     1860

caaaccctaa acaatgagtt gaaatttcat attgttaata tttattaatg tatgtcaggt     1920

gcgatgaatc gtcattgtat tcccggatta actatgtcca cagccctgac ggggaacttc     1980

tctgcgggag tgtccgggaa taattaaaac gatgcacaca gggtttagcg cgtacacgta     2040

ttgcattatg ccaacgcccc ggtgctgaca cggaagaaac cggacgttat gatttagcgt     2100

ggaaagattt gtgtagtgtt ctgaatgctc tcagtaaata gtaatgaatt atcaaaggta     2160

tagtaatatc ttttatgttc atggatattt gtaacccatc ggaaaactcc tgctttagca     2220

agattttccc tgtattgctg aaatgtgatt tctcttgatt tcaacctatc ataggacgtt     2280

tctataagat gcgtgtttct tgagaattta acatttacaa cctttttaag tccttttatt     2340

aacacggtgt tatcgttttc taacacgatg tgaatattat ctgtggctag atagtaaata     2400

taatgtgaga cgttgtgacg ttttagttca gaataaaaca attcacagtc taaatctttt     2460

cgcacttgat cgaatatttc tttaaaaatg gcaacctgag ccattggtaa aaccttccat     2520

gtgatacgag ggcgcgtagt ttgcattatc gtttttatcg tttcaatctg gtctgacctc     2580

cttgtgtttt gttgatgatt tatgtcaaat attaggaatg ttttcactta atagtattgg     2640

ttgcgtaaca aagtgcggtc ctgctggcat tctggaggga aatacaaccg acagatgtat     2700

gtaaggccaa cgtgctcaaa tcttcataca gaaagatttg aagtaatatt ttaaccgcta     2760

gatgaagagc aagcgcatgg agcgacaaaa tgaataaaga acaatctgct gatgatccct     2820

ccgtggatct gattcgtgta aaaaatatgc ttaatagcac catttctatg agttaccctg     2880

atgttgtaat tgcatgtata gaacataagg tgtctctgga agcattcaga gcaattgagg     2940

cagcgttggt gaagcacgat aataatatga aggattattc cctggtggtt gactgatcac     3000

cataactgct aatcattcaa actatttagt ctgtgacaga gccaacacgc agtctgtcac     3060

tgtcaggaaa gtggtaaaac tgcaactcaa ttactgcaat gccctcgtaa ttaagtgaat     3120

ttacaatatc gtcctgttcg gagggaagaa cgcgggatgt tcattcttca tcacttttaa     3180

ttgatgtata tgctctcttt tctgacgtta gtctccgacg gcaggcttca atgacccagg     3240

ctgagaaatt cccggaccct ttttgctcaa gagcgatgtt aatttgttca atcatttggt     3300

taggaaagcg gatgttgcgg gttgttgttc tgcgggttct gttcttcgtt gacatgaggt     3360

tgccccgtat tcagtgtcgc tgatttgtat tgtctgaagt tgtttttacg ttaagttgat     3420

gcagatcaat taatacgata cctgcgtcat aattgattat ttgacgtggt ttgatggcct     3480

ccacgcacgt tgtgatatgt agatgataat cattatcact ttacgggtcc tttccggtga     3540

aaaaaaaggt accaaaaaaa acatcgtcgt gagtagtgaa ccgtaagccg tcctgtcgct     3600

gtgtctcgga cactgattga cacggtttag tagagc                               3636


<210>  79
<211>  52
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 7 and 8

<400>  79
tttttttttt tttttttttt ttttttttcg agacacagcg acaggacgtc ct               52


<210>  80
<211>  83
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 7 and 8

<400>  80
cgtagctgat tgaggtcact cagtgcaata tcagcaccaa cagaaacaac ctttgaggcg       60

agcggtcaag cgacgaggtg tcc                                               83


