                         SEQUENCE LISTING

<110>  Weinberger, Leor S.
 
<120>  COMPOSITIONS AND COMPOSITIONS AND METHODS OF USE THEREOF FOR 
       IDENTIFYING ANTI-VIRAL AGENTS

<130>  GLAD-406WO

<150>  US 61/764,854
<151>  2013-02-14

<160>  47    

<170>  PatentIn version 3.5

<210>  1
<211>  580
<212>  PRT
<213>  Human herpesvirus 5

<400>  1

Met Glu Ser Ser Ala Lys Arg Lys Met Asp Pro Asp Asn Pro Asp Glu 
1               5                   10                  15      


Gly Pro Ser Ser Lys Val Pro Arg Pro Glu Thr Pro Val Thr Lys Ala 
            20                  25                  30          


Thr Thr Phe Leu Gln Thr Met Leu Arg Lys Glu Val Asn Ser Gln Leu 
        35                  40                  45              


Ser Leu Gly Asp Pro Leu Phe Pro Glu Leu Ala Glu Glu Ser Leu Lys 
    50                  55                  60                  


Thr Phe Glu Gln Val Thr Glu Asp Cys Asn Glu Asn Pro Glu Lys Asp 
65                  70                  75                  80  


Val Leu Ala Glu Leu Gly Asp Ile Leu Ala Gln Ala Val Asn His Ala 
                85                  90                  95      


Gly Ile Asp Ser Ser Ser Thr Gly Pro Thr Leu Thr Thr His Ser Cys 
            100                 105                 110         


Ser Val Ser Ser Ala Pro Leu Asn Lys Pro Thr Pro Thr Ser Val Ala 
        115                 120                 125             


Val Thr Asn Thr Pro Leu Pro Gly Ala Ser Ala Thr Pro Glu Leu Ser 
    130                 135                 140                 


Pro Arg Lys Lys Pro Arg Lys Thr Thr Arg Pro Phe Lys Val Ile Ile 
145                 150                 155                 160 


Lys Pro Pro Val Pro Pro Ala Pro Ile Met Leu Pro Leu Ile Lys Gln 
                165                 170                 175     


Glu Asp Ile Lys Pro Glu Pro Asp Phe Thr Ile Gln Tyr Arg Asn Lys 
            180                 185                 190         


Ile Ile Asp Thr Ala Gly Cys Ile Val Ile Ser Asp Ser Glu Glu Glu 
        195                 200                 205             


Gln Gly Glu Glu Val Glu Thr Arg Gly Ala Thr Ala Ser Ser Pro Ser 
    210                 215                 220                 


Thr Gly Ser Gly Thr Pro Arg Val Thr Ser Pro Thr His Pro Leu Ser 
225                 230                 235                 240 


Gln Met Asn His Pro Pro Leu Pro Asp Pro Leu Gly Arg Pro Asp Glu 
                245                 250                 255     


Asp Ser Ser Ser Ser Ser Ser Ser Ser Cys Ser Ser Ala Ser Asp Ser 
            260                 265                 270         


Glu Ser Glu Ser Glu Glu Met Lys Cys Ser Ser Gly Gly Gly Ala Ser 
        275                 280                 285             


Val Thr Ser Ser His His Gly Arg Gly Gly Phe Gly Gly Ala Ala Ser 
    290                 295                 300                 


Ser Ser Leu Leu Ser Cys Gly His Gln Ser Ser Gly Gly Ala Ser Thr 
305                 310                 315                 320 


Gly Pro Arg Lys Lys Lys Ser Lys Arg Ile Ser Glu Leu Asp Asn Glu 
                325                 330                 335     


Lys Val Arg Asn Ile Met Lys Asp Lys Asn Thr Pro Phe Cys Thr Pro 
            340                 345                 350         


Asn Val Gln Thr Arg Arg Gly Arg Val Lys Ile Asp Glu Val Ser Arg 
        355                 360                 365             


Met Phe Arg Asn Thr Asn Arg Ser Leu Glu Tyr Lys Asn Leu Pro Phe 
    370                 375                 380                 


Thr Ile Pro Ser Met His Gln Val Leu Asp Glu Ala Ile Lys Ala Cys 
385                 390                 395                 400 


Lys Thr Met Gln Val Asn Asn Lys Gly Ile Gln Ile Ile Tyr Thr Arg 
                405                 410                 415     


Asn His Glu Val Lys Ser Glu Val Asp Ala Val Arg Cys Arg Leu Gly 
            420                 425                 430         


Thr Met Cys Asn Leu Ala Leu Ser Thr Pro Phe Leu Met Glu His Thr 
        435                 440                 445             


Met Pro Val Thr His Pro Pro Glu Val Ala Gln Arg Thr Ala Asp Ala 
    450                 455                 460                 


Cys Asn Glu Gly Val Lys Ala Ala Trp Ser Leu Lys Glu Leu His Thr 
465                 470                 475                 480 


His Gln Leu Cys Pro Arg Ser Ser Asp Tyr Arg Asn Met Ile Ile His 
                485                 490                 495     


Ala Ala Thr Pro Val Asp Leu Leu Gly Ala Leu Asn Leu Cys Leu Pro 
            500                 505                 510         


Leu Met Gln Lys Phe Pro Lys Gln Val Met Val Arg Ile Phe Ser Thr 
        515                 520                 525             


Asn Gln Gly Gly Phe Met Leu Pro Ile Tyr Glu Thr Ala Ala Lys Ala 
    530                 535                 540                 


Tyr Ala Val Gly Gln Phe Glu Gln Pro Thr Glu Thr Pro Pro Glu Asp 
545                 550                 555                 560 


Leu Asp Thr Leu Ser Leu Ala Ile Glu Ala Ala Ile Gln Asp Leu Arg 
                565                 570                 575     


Asn Lys Ser Gln 
            580 


<210>  2
<211>  502
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  2
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca       60

ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt      120

caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg      180

ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag      240

tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt      300

accatggtga tgcggttttg gcagtacatc aatgggcgtg gatagcggtt tgactcacgg      360

ggatttccaa gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa      420

cgggactttc caaaatgtcg taacaactcc gccccattga cgcaaatggg cggtaggcgt      480

gtacggtggg aggtctatat aa                                               502


<210>  3
<211>  776
<212>  PRT
<213>  Human herpesvirus 1

<400>  3

Met Glu Pro Arg Pro Gly Ala Ser Thr Arg Arg Pro Glu Gly Arg Pro 
1               5                   10                  15      


Gln Arg Glu Pro Ala Pro Asp Val Trp Val Phe Pro Cys Asp Arg Asp 
            20                  25                  30          


Leu Pro Asp Ser Ser Asp Ser Glu Ala Glu Thr Glu Val Gly Gly Arg 
        35                  40                  45              


Gly Asp Ala Asp His His Asp Asp Asp Ser Ala Ser Glu Ala Asp Ser 
    50                  55                  60                  


Thr Asp Thr Glu Leu Phe Glu Thr Gly Leu Leu Gly Pro Gln Gly Val 
65                  70                  75                  80  


Asp Gly Gly Ala Val Ser Gly Gly Ser Pro Pro Arg Glu Glu Asp Pro 
                85                  90                  95      


Gly Ser Cys Gly Gly Ala Pro Pro Arg Glu Asp Gly Gly Ser Asp Glu 
            100                 105                 110         


Gly Asp Val Cys Ala Val Cys Thr Asp Glu Ile Ala Pro His Leu Arg 
        115                 120                 125             


Cys Asp Thr Phe Pro Cys Met His Arg Phe Cys Ile Pro Cys Met Lys 
    130                 135                 140                 


Thr Trp Met Gln Leu Arg Asn Thr Cys Pro Leu Cys Asn Ala Lys Leu 
145                 150                 155                 160 


Val Tyr Leu Ile Val Gly Val Thr Pro Ser Gly Ser Phe Ser Thr Ile 
                165                 170                 175     


Pro Ile Val Asn Asp Pro Gln Thr Arg Met Glu Ala Glu Glu Ala Val 
            180                 185                 190         


Arg Ala Gly Thr Ala Val Asp Phe Ile Trp Thr Gly Asn Gln Arg Phe 
        195                 200                 205             


Ala Pro Arg Tyr Leu Thr Leu Gly Gly His Thr Val Arg Ala Leu Ser 
    210                 215                 220                 


Pro Thr His Pro Glu Pro Thr Thr Asp Glu Asp Asp Asp Asp Leu Asp 
225                 230                 235                 240 


Asp Ala Asp Tyr Val Pro Pro Ala Pro Arg Arg Thr Pro Arg Ala Pro 
                245                 250                 255     


Pro Arg Arg Gly Ala Ala Ala Pro Pro Val Thr Gly Gly Ala Ser His 
            260                 265                 270         


Ala Ala Pro Gln Pro Ala Ala Ala Arg Thr Ala Pro Pro Ser Ala Pro 
        275                 280                 285             


Ile Gly Pro His Gly Ser Ser Asn Thr Asn Thr Thr Thr Asn Ser Ser 
    290                 295                 300                 


Gly Gly Gly Gly Ser Arg Gln Ser Arg Ala Ala Val Pro Arg Gly Ala 
305                 310                 315                 320 


Ser Gly Pro Ser Gly Gly Val Gly Val Val Glu Ala Glu Ala Gly Arg 
                325                 330                 335     


Pro Arg Gly Arg Thr Gly Pro Leu Val Asn Arg Pro Ala Pro Leu Ala 
            340                 345                 350         


Asn Asn Arg Asp Pro Ile Val Ile Ser Asp Ser Pro Pro Ala Ser Pro 
        355                 360                 365             


His Arg Pro Pro Ala Ala Pro Met Pro Gly Ser Ala Pro Arg Pro Gly 
    370                 375                 380                 


Pro Pro Ala Ser Ala Ala Ala Ser Gly Pro Ala Arg Pro Arg Ala Ala 
385                 390                 395                 400 


Val Ala Pro Cys Val Arg Ala Pro Pro Pro Gly Pro Gly Pro Arg Ala 
                405                 410                 415     


Pro Ala Pro Gly Ala Glu Pro Ala Ala Arg Pro Ala Asp Ala Arg Arg 
            420                 425                 430         


Val Pro Gln Ser His Ser Ser Leu Ala Gln Ala Ala Asn Gln Glu Gln 
        435                 440                 445             


Ser Leu Cys Arg Ala Arg Ala Thr Val Ala Arg Gly Ser Gly Gly Pro 
    450                 455                 460                 


Gly Val Glu Gly Gly His Gly Pro Ser Arg Gly Ala Ala Pro Ser Gly 
465                 470                 475                 480 


Ala Ala Pro Ser Gly Ala Pro Pro Leu Pro Ser Ala Ala Ser Val Glu 
                485                 490                 495     


Gln Glu Ala Ala Val Arg Pro Arg Lys Arg Arg Gly Ser Gly Gln Glu 
            500                 505                 510         


Asn Pro Ser Pro Gln Ser Thr Arg Pro Pro Leu Ala Pro Ala Gly Ala 
        515                 520                 525             


Lys Arg Ala Ala Thr His Pro Pro Ser Asp Ser Gly Pro Gly Gly Arg 
    530                 535                 540                 


Gly Gln Gly Gly Pro Gly Thr Pro Leu Thr Ser Ser Ala Ala Ser Ala 
545                 550                 555                 560 


Ser Ser Ser Ser Ala Ser Ser Ser Ser Ala Pro Thr Pro Ala Gly Ala 
                565                 570                 575     


Thr Ser Ser Ala Thr Gly Ala Ala Ser Ser Ser Ala Ser Ala Ser Ser 
            580                 585                 590         


Gly Gly Ala Val Gly Ala Leu Gly Gly Arg Gln Glu Glu Thr Ser Leu 
        595                 600                 605             


Gly Pro Arg Ala Ala Ser Gly Pro Arg Gly Pro Arg Lys Cys Ala Arg 
    610                 615                 620                 


Lys Thr Arg His Ala Glu Thr Ser Gly Ala Val Pro Ala Gly Gly Leu 
625                 630                 635                 640 


Thr Arg Tyr Leu Pro Ile Ser Gly Val Ser Ser Val Val Ala Leu Ser 
                645                 650                 655     


Pro Tyr Val Asn Lys Thr Ile Thr Gly Asp Cys Leu Pro Ile Leu Asp 
            660                 665                 670         


Met Glu Thr Gly Asn Ile Gly Ala Tyr Val Val Leu Val Asp Gln Thr 
        675                 680                 685             


Gly Asn Met Ala Thr Arg Leu Arg Ala Ala Val Pro Gly Trp Ser Arg 
    690                 695                 700                 


Arg Thr Leu Leu Pro Glu Thr Ala Gly Asn His Val Thr Pro Pro Glu 
705                 710                 715                 720 


Tyr Pro Thr Ala Pro Ala Ser Glu Trp Asn Ser Leu Trp Met Thr Pro 
                725                 730                 735     


Val Gly Asn Met Leu Phe Asp Gln Gly Thr Leu Val Gly Ala Leu Asp 
            740                 745                 750         


Phe Arg Ser Leu Arg Ser Arg His Pro Trp Ser Gly Glu Gln Gly Ala 
        755                 760                 765             


Ser Thr Arg Asp Glu Gly Lys Gln 
    770                 775     


<210>  4
<211>  499
<212>  DNA
<213>  HSV1 strain F alpha

<400>  4
cctggggttc cgggtatggt aatgagtttc ttcgggaagg cgggaagccc cggggcaccg       60

acgcaggcca agcccctgtt gcgtcggtgg gaggggcatg ctaatggggt tctttggggg      120

acaccgggtt ggtcccccaa atcgggggcc gggccgtgca tgctaatgat attctttggg      180

ggcgccgggt tggtccccgg ggacgggccc gccccgcggt gggcctgcct cccctgggac      240

gcgcggccat tgggggaatc gtcactgccg cccctttggg gaggggaaag gcgtggggta      300

taagttagcc ctggcccgac agtctggtcg catttgcacc tcggcactcg gagcgagacg      360

cagcagccag gcagactcgg gccgccccct ctccgcatca ccacagaagc cccgcctacg      420

ttgcgacccc cagggaccct ccgtccgcga ccctccagcc gcatacgacc cccatggagc      480

cccgccccgg agcgggtac                                                   499


<210>  5
<211>  245
<212>  PRT
<213>  Human herpesvirus 4

<400>  5

Met Met Asp Pro Asn Ser Thr Ser Glu Asp Val Lys Phe Thr Pro Asp 
1               5                   10                  15      


Pro Tyr Gln Val Pro Phe Val Gln Ala Phe Asp Gln Ala Thr Arg Val 
            20                  25                  30          


Tyr Gln Asp Leu Gly Gly Pro Ser Gln Ala Pro Leu Pro Cys Val Leu 
        35                  40                  45              


Trp Pro Val Leu Pro Glu Pro Leu Pro Gln Gly Gln Leu Thr Ala Tyr 
    50                  55                  60                  


His Val Ser Thr Ala Pro Thr Gly Ser Trp Phe Ser Ala Pro Gln Pro 
65                  70                  75                  80  


Ala Pro Glu Asn Ala Tyr Gln Ala Tyr Ala Ala Pro Gln Leu Phe Pro 
                85                  90                  95      


Val Ser Asp Ile Thr Gln Asn Gln Gln Thr Asn Gln Ala Gly Gly Glu 
            100                 105                 110         


Ala Pro Gln Pro Gly Asp Asn Ser Thr Val Gln Thr Ala Ala Ala Val 
        115                 120                 125             


Val Phe Ala Cys Pro Gly Ala Asn Gln Gly Gln Gln Leu Ala Asp Ile 
    130                 135                 140                 


Gly Val Pro Gln Pro Ala Pro Val Ala Ala Pro Ala Arg Arg Thr Arg 
145                 150                 155                 160 


Lys Pro Gln Gln Pro Glu Ser Leu Glu Glu Cys Asp Ser Glu Leu Glu 
                165                 170                 175     


Ile Lys Arg Tyr Lys Asn Arg Val Ala Ser Arg Lys Cys Arg Ala Lys 
            180                 185                 190         


Phe Lys Gln Leu Leu Gln His Tyr Arg Glu Val Ala Ala Ala Lys Ser 
        195                 200                 205             


Ser Glu Asn Asp Arg Leu Arg Leu Leu Leu Lys Gln Met Cys Pro Ser 
    210                 215                 220                 


Leu Asp Val Asp Ser Ile Ile Pro Arg Thr Pro Asp Val Leu His Glu 
225                 230                 235                 240 


Asp Leu Leu Asn Phe 
                245 


<210>  6
<211>  222
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  6
gcatatttca actgggctgt ctatttttga caccagctta ttttagacac ttctgaaaac       60

tgcctcctcc tcttttggaa actatgcatg agccacaggc attgctaatg tgcctcagag      120

acacacctaa atttagcacg tcccaaacca tgacatcaca gaggaggctg gtgccttggc      180

tttaaagggg agatgttaga caggtaactc actaaacatt gc                         222


<210>  7
<211>  691
<212>  PRT
<213>  Human herpesvirus 8

<400>  7

Met Ala Gln Asp Asp Lys Gly Lys Lys Leu Arg Arg Ser Cys Val Glu 
1               5                   10                  15      


Ser Phe Val Gly Leu Ser Asp Glu Leu Lys Ala Gln Leu Tyr Gln Cys 
            20                  25                  30          


Val Leu Leu Ile Asn Asp Ala Tyr Glu Thr Ile Tyr Asp Pro Ser Asp 
        35                  40                  45              


Leu Asn Arg Val Val Glu Asp Val Cys Ile Arg Ile Met Lys Glu Cys 
    50                  55                  60                  


Ser Lys Leu Gly Ala Leu Cys Gly Leu Phe Thr Asp Ile Asn Met Phe 
65                  70                  75                  80  


Asn Leu Phe Cys Phe Phe Arg Ala Ser Arg Met Arg Thr Lys Gly Ala 
                85                  90                  95      


Ala Gly Tyr Asn Val Pro Cys Ala Glu Ala Ser Gln Gly Ile Ile Arg 
            100                 105                 110         


Ile Leu Thr Glu Arg Ile Leu Phe Cys Thr Glu Lys Ala Phe Leu Thr 
        115                 120                 125             


Ala Ala Cys Ser Gly Val Ser Leu Pro Pro Ala Ile Cys Lys Leu Leu 
    130                 135                 140                 


His Glu Ile Tyr Thr Glu Met Lys Ala Lys Cys Leu Gly Ala Trp Arg 
145                 150                 155                 160 


Arg Leu Val Cys Asn Arg Arg Pro Ile Met Ile Leu Thr Ser Ser Leu 
                165                 170                 175     


Leu Lys Leu Tyr Asn Thr Tyr Asp Thr Ala Gly Leu Leu Ser Glu Gln 
            180                 185                 190         


Ser Arg Ala Leu Cys Leu Leu Val Phe Gln Pro Val Tyr Leu Pro Arg 
        195                 200                 205             


Ile Met Ala Pro Leu Glu Ile Met Thr Lys Gly Gln Leu Ala Pro Glu 
    210                 215                 220                 


Asn Phe Tyr Ser Ile Thr Gly Ser Ala Glu Lys Arg Arg Pro Ile Thr 
225                 230                 235                 240 


Thr Gly Lys Val Thr Gly Leu Ser Tyr Pro Gly Ser Gly Leu Met Pro 
                245                 250                 255     


Glu Ser Leu Ile Leu Pro Ile Leu Glu Pro Gly Leu Leu Pro Ala Ser 
            260                 265                 270         


Met Val Asp Leu Ser Asp Val Leu Ala Lys Pro Ala Val Ile Leu Ser 
        275                 280                 285             


Ala Pro Ala Leu Ser Gln Phe Val Ile Ser Lys Pro His Pro Asn Met 
    290                 295                 300                 


Pro His Thr Val Ser Ile Ile Pro Phe Asn Pro Ser Gly Thr Asp Pro 
305                 310                 315                 320 


Ala Phe Ile Ser Thr Trp Gln Ala Ala Ser Gln Asn Met Val Tyr Asn 
                325                 330                 335     


Thr Ser Thr Ala Pro Leu Lys Pro Ala Thr Gly Ser Ser Gln Thr Val 
            340                 345                 350         


Ser Val Lys Ala Val Ala Gln Gly Ala Val Ile Thr Ala Thr Thr Val 
        355                 360                 365             


Pro Gln Ala Met Pro Ala Arg Gly Thr Gly Gly Glu Leu Pro Val Met 
    370                 375                 380                 


Ser Ala Ser Thr Pro Ala Arg Asp Gln Val Ala Ala Cys Phe Val Ala 
385                 390                 395                 400 


Glu Asn Thr Gly Asp Ser Pro Asp Asn Pro Ser Ser Phe Leu Thr Ser 
                405                 410                 415     


Cys His Pro Cys Asp Pro Asn Thr Val Ile Val Ala Gln Gln Phe Gln 
            420                 425                 430         


Pro Pro Gln Cys Val Thr Leu Leu Gln Val Thr Cys Ala Pro Ser Ser 
        435                 440                 445             


Thr Pro Pro Pro Asp Ser Thr Val Arg Ala Pro Val Val Gln Leu Pro 
    450                 455                 460                 


Thr Val Val Pro Leu Pro Ala Ser Ala Phe Leu Pro Ala Leu Ala Gln 
465                 470                 475                 480 


Pro Glu Ala Ser Gly Glu Glu Leu Pro Gly Gly His Asp Gly Asp Gln 
                485                 490                 495     


Gly Val Pro Cys Arg Asp Ser Thr Ala Ala Ala Thr Ala Ala Glu Ala 
            500                 505                 510         


Thr Thr Pro Lys Arg Lys Gln Arg Ser Lys Glu Arg Ser Ser Lys Lys 
        515                 520                 525             


Arg Lys Ala Leu Thr Val Pro Glu Ala Asp Thr Thr Pro Ser Thr Thr 
    530                 535                 540                 


Thr Pro Gly Thr Ser Leu Gly Ser Ile Thr Thr Pro Gln Asp Val His 
545                 550                 555                 560 


Ala Thr Asp Val Ala Thr Ser Glu Gly Pro Ser Glu Ala Gln Pro Pro 
                565                 570                 575     


Leu Leu Ser Leu Pro Pro Pro Leu Asp Val Asp Gln Ser Leu Phe Ala 
            580                 585                 590         


Leu Leu Asp Glu Ala Gly Pro Glu Thr Trp Asp Val Gly Ser Pro Leu 
        595                 600                 605             


Ser Pro Thr Asp Asp Ala Leu Leu Ser Ser Ile Leu Gln Gly Leu Tyr 
    610                 615                 620                 


Gln Leu Asp Thr Pro Pro Pro Leu Arg Ser Pro Ser Pro Ala Ser Phe 
625                 630                 635                 640 


Gly Pro Glu Ser Pro Ala Asp Ile Pro Ser Pro Ser Gly Gly Glu Tyr 
                645                 650                 655     


Thr Gln Leu Gln Pro Val Arg Ala Thr Ser Ala Thr Pro Ala Asn Glu 
            660                 665                 670         


Val Gln Glu Ser Gly Thr Leu Tyr Gln Leu His Gln Trp Arg Asn Tyr 
        675                 680                 685             


Phe Arg Asp 
    690     


<210>  8
<211>  217
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  8
gttcagtcac atgtacgcta gggtctcccc acccaacccc cataggaccc agctacagct       60

tatcctccac taaataccag gcagctaccg gcgactcatt aagccccgcc cagaaaccag      120

tagctgggtg gcaatgacac gtccccttta aaaagtcaac cttactccgc aaggggtagt      180

ctgttgtgag aatactgtcc aggcagccac aaaaatg                               217


<210>  9
<211>  236
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  9

Met Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe 
1               5                   10                  15      


Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe 
            20                  25                  30          


Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr 
        35                  40                  45              


Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp 
    50                  55                  60                  


Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His 
65                  70                  75                  80  


Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe 
                85                  90                  95      


Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val 
            100                 105                 110         


Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys 
        115                 120                 125             


Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys 
    130                 135                 140                 


Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly 
145                 150                 155                 160 


Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly 
                165                 170                 175     


His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val 
            180                 185                 190         


Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser 
        195                 200                 205             


His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly 
    210                 215                 220                 


Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys 
225                 230                 235     


<210>  10
<211>  1298
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  10

Met Ala Ser Glu Asn Lys Gln Arg Pro Gly Ser Pro Gly Pro Thr Asp 
1               5                   10                  15      


Gly Pro Pro Pro Thr Pro Ser Pro Asp Arg Asp Glu Arg Gly Ala Leu 
            20                  25                  30          


Gly Trp Gly Ala Glu Thr Glu Glu Gly Gly Asp Asp Pro Asp His Asp 
        35                  40                  45              


Pro Asp His Pro His Asp Leu Asp Asp Ala Arg Arg Asp Gly Arg Ala 
    50                  55                  60                  


Pro Ala Ala Gly Thr Asp Ala Gly Glu Asp Ala Gly Asp Ala Val Ser 
65                  70                  75                  80  


Pro Arg Gln Leu Ala Leu Leu Ala Ser Met Val Glu Glu Ala Val Arg 
                85                  90                  95      


Thr Ile Pro Thr Pro Asp Pro Ala Ala Ser Pro Pro Arg Thr Pro Ala 
            100                 105                 110         


Phe Arg Ala Asp Asp Asp Asp Gly Asp Glu Tyr Asp Asp Ala Ala Asp 
        115                 120                 125             


Ala Ala Gly Asp Arg Ala Pro Ala Arg Gly Arg Glu Arg Glu Ala Pro 
    130                 135                 140                 


Leu Arg Gly Ala Tyr Pro Asp Pro Thr Asp Arg Leu Ser Pro Arg Pro 
145                 150                 155                 160 


Pro Ala Gln Pro Pro Arg Arg Arg Arg His Gly Arg Trp Arg Pro Ser 
                165                 170                 175     


Ala Ser Ser Thr Ser Ser Asp Ser Gly Ser Ser Ser Ser Ser Ser Ala 
            180                 185                 190         


Ser Ser Ser Ser Ser Ser Ser Asp Glu Asp Glu Asp Asp Asp Gly Asn 
        195                 200                 205             


Asp Ala Ala Asp His Ala Arg Glu Ala Arg Ala Val Gly Arg Gly Pro 
    210                 215                 220                 


Ser Ser Ala Ala Pro Ala Ala Pro Gly Arg Thr Pro Pro Pro Pro Gly 
225                 230                 235                 240 


Pro Pro Pro Leu Ser Glu Ala Ala Pro Lys Pro Arg Ala Ala Ala Arg 
                245                 250                 255     


Thr Pro Ala Ala Ser Ala Gly Arg Ile Glu Arg Arg Arg Ala Arg Ala 
            260                 265                 270         


Ala Val Ala Gly Arg Asp Ala Thr Gly Arg Phe Thr Ala Gly Gln Pro 
        275                 280                 285             


Arg Arg Val Glu Leu Asp Ala Asp Ala Thr Ser Gly Ala Phe Tyr Ala 
    290                 295                 300                 


Arg Tyr Arg Asp Gly Tyr Val Ser Gly Glu Pro Trp Pro Gly Ala Gly 
305                 310                 315                 320 


Pro Pro Pro Pro Gly Arg Val Leu Tyr Gly Gly Leu Gly Asp Ser Arg 
                325                 330                 335     


Pro Gly Leu Trp Gly Ala Pro Glu Ala Glu Glu Ala Arg Arg Arg Phe 
            340                 345                 350         


Glu Ala Ser Gly Ala Pro Ala Ala Val Trp Ala Pro Glu Leu Gly Asp 
        355                 360                 365             


Ala Ala Gln Gln Tyr Ala Leu Ile Thr Arg Leu Leu Tyr Thr Pro Asp 
    370                 375                 380                 


Ala Glu Ala Met Gly Trp Leu Gln Asn Pro Arg Val Val Pro Gly Asp 
385                 390                 395                 400 


Val Ala Leu Asp Gln Ala Cys Phe Arg Ile Ser Gly Ala Ala Arg Asn 
                405                 410                 415     


Ser Ser Ser Phe Ile Thr Gly Ser Val Ala Arg Ala Val Pro His Leu 
            420                 425                 430         


Gly Tyr Ala Met Ala Ala Gly Arg Phe Gly Trp Gly Leu Ala His Ala 
        435                 440                 445             


Ala Ala Ala Val Ala Met Ser Arg Arg Tyr Asp Arg Ala Gln Lys Gly 
    450                 455                 460                 


Phe Leu Leu Thr Ser Leu Arg Arg Ala Tyr Ala Pro Leu Leu Ala Arg 
465                 470                 475                 480 


Glu Asn Ala Ala Leu Thr Gly Ala Ala Gly Ser Pro Gly Ala Gly Ala 
                485                 490                 495     


Asp Asp Glu Gly Val Ala Ala Val Ala Ala Ala Ala Pro Gly Glu Arg 
            500                 505                 510         


Ala Val Pro Ala Gly Tyr Gly Ala Ala Gly Ile Leu Ala Ala Leu Gly 
        515                 520                 525             


Arg Leu Ser Ala Ala Pro Ala Ser Pro Ala Gly Gly Asp Asp Pro Asp 
    530                 535                 540                 


Ala Ala Arg His Ala Asp Ala Asp Asp Asp Ala Gly Arg Arg Ala Gln 
545                 550                 555                 560 


Ala Gly Arg Val Ala Val Glu Cys Leu Ala Ala Cys Arg Gly Ile Leu 
                565                 570                 575     


Glu Ala Leu Ala Glu Gly Phe Asp Gly Asp Leu Ala Ala Val Pro Gly 
            580                 585                 590         


Leu Ala Gly Ala Arg Pro Ala Ser Pro Pro Arg Pro Glu Gly Pro Ala 
        595                 600                 605             


Gly Pro Ala Ser Pro Pro Pro Pro His Ala Asp Ala Pro Arg Leu Arg 
    610                 615                 620                 


Ala Trp Leu Arg Glu Leu Arg Phe Val Arg Asp Ala Leu Val Leu Met 
625                 630                 635                 640 


Arg Leu Arg Gly Asp Leu Arg Val Ala Gly Gly Ser Glu Ala Ala Val 
                645                 650                 655     


Ala Ala Val Arg Ala Val Ser Leu Val Ala Gly Ala Leu Gly Pro Ala 
            660                 665                 670         


Leu Pro Arg Asp Pro Arg Leu Pro Ser Ser Ala Ala Ala Ala Ala Ala 
        675                 680                 685             


Asp Leu Leu Phe Asp Asn Gln Ser Leu Arg Pro Leu Leu Ala Ala Ala 
    690                 695                 700                 


Ala Ser Ala Pro Asp Ala Ala Asp Ala Leu Ala Ala Ala Ala Ala Ser 
705                 710                 715                 720 


Ala Ala Pro Arg Glu Gly Arg Lys Arg Lys Ser Pro Gly Pro Ala Arg 
                725                 730                 735     


Pro Pro Gly Gly Gly Gly Pro Arg Pro Pro Lys Thr Lys Lys Ser Gly 
            740                 745                 750         


Ala Asp Ala Pro Gly Ser Asp Ala Arg Ala Pro Leu Pro Ala Pro Ala 
        755                 760                 765             


Pro Pro Ser Thr Pro Pro Gly Pro Glu Pro Ala Pro Ala Gln Pro Ala 
    770                 775                 780                 


Ala Pro Arg Ala Ala Ala Ala Gln Ala Arg Pro Arg Pro Val Ala Val 
785                 790                 795                 800 


Ser Arg Arg Pro Ala Glu Gly Pro Asp Pro Leu Gly Gly Trp Arg Arg 
                805                 810                 815     


Gln Pro Pro Gly Pro Ser His Thr Ala Ala Pro Ala Ala Ala Ala Leu 
            820                 825                 830         


Glu Ala Tyr Cys Ser Pro Arg Ala Val Ala Glu Leu Thr Asp His Pro 
        835                 840                 845             


Leu Phe Pro Val Pro Trp Arg Pro Ala Leu Met Phe Asp Pro Arg Ala 
    850                 855                 860                 


Leu Ala Ser Ile Ala Ala Arg Cys Ala Gly Pro Ala Pro Ala Ala Gln 
865                 870                 875                 880 


Ala Ala Cys Gly Gly Gly Asp Asp Asp Asp Asn Pro His Pro His Gly 
                885                 890                 895     


Ala Ala Gly Gly Arg Leu Phe Gly Pro Leu Arg Ala Ser Gly Pro Leu 
            900                 905                 910         


Arg Arg Met Ala Ala Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val 
        915                 920                 925             


Arg Val Val Val Leu Tyr Ser Pro Leu Pro Gly Glu Asp Leu Ala Gly 
    930                 935                 940                 


Gly Gly Ala Ser Gly Gly Pro Pro Glu Trp Ser Ala Glu Arg Gly Gly 
945                 950                 955                 960 


Leu Ser Cys Leu Leu Ala Ala Leu Ala Asn Arg Leu Cys Gly Pro Asp 
                965                 970                 975     


Thr Ala Ala Trp Ala Gly Asn Trp Thr Gly Ala Pro Asp Val Ser Ala 
            980                 985                 990         


Leu Gly Ala Gln Gly Val Leu Leu  Leu Ser Thr Arg Asp  Leu Ala Phe 
        995                 1000                 1005             


Ala Gly  Ala Val Glu Phe Leu  Gly Leu Leu Ala Ser  Ala Gly Asp 
    1010                 1015                 1020             


Arg Arg  Leu Ile Val Val Asn  Thr Val Arg Ala Cys  Asp Trp Pro 
    1025                 1030                 1035             


Ala Asp  Gly Pro Ala Val Ser  Arg Gln His Ala Tyr  Leu Ala Cys 
    1040                 1045                 1050             


Glu Leu  Leu Pro Ala Val Gln  Cys Ala Val Arg Trp  Pro Ala Ala 
    1055                 1060                 1065             


Arg Asp  Leu Arg Arg Thr Val  Leu Ala Ser Gly Arg  Val Phe Gly 
    1070                 1075                 1080             


Pro Gly  Val Phe Ala Arg Val  Glu Ala Ala His Ala  Arg Leu Tyr 
    1085                 1090                 1095             


Pro Asp  Ala Pro Pro Leu Arg  Leu Cys Arg Gly Gly  Asn Val Arg 
    1100                 1105                 1110             


Tyr Arg  Val Arg Thr Arg Phe  Gly Pro Asp Thr Pro  Val Pro Met 
    1115                 1120                 1125             


Ser Pro  Arg Glu Tyr Arg Arg  Ala Val Leu Pro Ala  Leu Asp Gly 
    1130                 1135                 1140             


Arg Ala  Ala Ala Ser Gly Thr  Thr Asp Ala Met Ala  Pro Gly Ala 
    1145                 1150                 1155             


Pro Asp  Phe Cys Glu Glu Glu  Ala His Ser His Ala  Ala Cys Ala 
    1160                 1165                 1170             


Arg Trp  Gly Leu Gly Ala Pro  Leu Arg Pro Val Tyr  Val Ala Leu 
    1175                 1180                 1185             


Gly Arg  Glu Ala Val Arg Ala  Gly Pro Ala Arg Trp  Arg Gly Pro 
    1190                 1195                 1200             


Arg Arg  Asp Phe Cys Ala Arg  Ala Leu Leu Glu Pro  Asp Asp Asp 
    1205                 1210                 1215             


Ala Pro  Pro Leu Val Leu Arg  Gly Asp Asp Asp Gly  Pro Gly Ala 
    1220                 1225                 1230             


Leu Pro  Pro Ala Pro Pro Gly  Ile Arg Trp Ala Ser  Ala Thr Gly 
    1235                 1240                 1245             


Arg Ser  Gly Thr Val Leu Ala  Ala Ala Gly Ala Val  Glu Val Leu 
    1250                 1255                 1260             


Gly Ala  Glu Ala Gly Leu Ala  Thr Pro Pro Arg Arg  Glu Val Val 
    1265                 1270                 1275             


Asp Trp  Glu Gly Ala Trp Asp  Glu Asp Asp Gly Gly  Ala Phe Glu 
    1280                 1285                 1290             


Gly Asp  Gly Val Leu 
    1295             


<210>  11
<211>  467
<212>  PRT
<213>  Human herpesvirus 3

<400>  11

Met Asp Thr Ile Leu Ala Gly Gly Ser Gly Thr Ser Asp Ala Ser Asp 
1               5                   10                  15      


Asn Thr Cys Thr Ile Cys Met Ser Thr Val Ser Asp Leu Gly Lys Thr 
            20                  25                  30          


Met Pro Cys Leu His Asp Phe Cys Phe Val Cys Ile Arg Ala Trp Thr 
        35                  40                  45              


Ser Thr Ser Val Gln Cys Pro Leu Cys Arg Cys Pro Val Gln Ser Ile 
    50                  55                  60                  


Leu His Lys Ile Val Ser Asp Thr Ser Tyr Lys Glu Tyr Glu Val His 
65                  70                  75                  80  


Pro Ser Asp Asp Asp Gly Phe Ser Glu Pro Ser Phe Glu Asp Ser Ile 
                85                  90                  95      


Asp Ile Leu Pro Gly Asp Val Ile Asp Leu Leu Pro Pro Ser Pro Gly 
            100                 105                 110         


Pro Ser Arg Glu Ser Ile Gln Gln Pro Thr Ser Arg Ser Ser Arg Glu 
        115                 120                 125             


Pro Ile Gln Ser Pro Asn Pro Gly Pro Leu Gln Ser Ser Ala Arg Glu 
    130                 135                 140                 


Pro Thr Ala Glu Ser Pro Ser Asp Ser Gln Gln Asp Ser Ile Gln Pro 
145                 150                 155                 160 


Pro Thr Arg Asp Ser Ser Pro Gly Val Thr Lys Thr Cys Ser Thr Ala 
                165                 170                 175     


Ser Phe Leu Arg Lys Val Phe Phe Lys Asp Gln Pro Ala Val Arg Ser 
            180                 185                 190         


Ala Thr Pro Val Val Tyr Gly Ser Ile Glu Ser Ala Gln Gln Pro Arg 
        195                 200                 205             


Thr Gly Gly Gln Asp Tyr Arg Asp Arg Pro Val Ser Val Gly Ile Asn 
    210                 215                 220                 


Gln Asp Pro Arg Thr Met Asp Arg Leu Pro Phe Arg Ala Thr Asp Arg 
225                 230                 235                 240 


Gly Thr Glu Gly Asn Ala Arg Phe Pro Cys Tyr Met Gln Pro Leu Leu 
                245                 250                 255     


Gly Trp Leu Asp Asp Gln Leu Ala Glu Leu Tyr Gln Pro Glu Ile Val 
            260                 265                 270         


Glu Pro Thr Lys Met Leu Ile Leu Asn Tyr Ile Gly Ile Tyr Gly Arg 
        275                 280                 285             


Asp Glu Ala Gly Leu Lys Thr Ser Leu Arg Cys Leu Leu His Asp Ser 
    290                 295                 300                 


Thr Gly Pro Phe Val Thr Asn Met Leu Phe Leu Leu Asp Arg Cys Thr 
305                 310                 315                 320 


Asp Pro Thr Arg Leu Thr Met Gln Thr Trp Thr Trp Lys Asp Thr Ala 
                325                 330                 335     


Ile Gln Leu Ile Thr Gly Pro Ile Val Arg Pro Glu Thr Thr Ser Thr 
            340                 345                 350         


Gly Glu Thr Ser Arg Gly Asp Glu Arg Asp Thr Arg Leu Val Asn Thr 
        355                 360                 365             


Pro Gln Lys Val Arg Leu Phe Ser Val Leu Pro Gly Ile Lys Pro Gly 
    370                 375                 380                 


Ser Ala Arg Gly Ala Lys Arg Arg Leu Phe His Thr Gly Arg Asp Val 
385                 390                 395                 400 


Lys Arg Cys Leu Thr Ile Asp Leu Thr Ser Glu Ser Asp Ser Ala Cys 
                405                 410                 415     


Lys Gly Ser Lys Thr Arg Lys Val Ala Ser Pro Gln Gly Glu Ser Asn 
            420                 425                 430         


Thr Pro Ser Thr Ser Gly Ser Thr Ser Gly Ser Leu Lys His Leu Thr 
        435                 440                 445             


Lys Lys Ser Ser Ala Gly Lys Ala Gly Lys Gly Ile Pro Asn Lys Met 
    450                 455                 460                 


Lys Lys Ser 
465         


<210>  12
<211>  1310
<212>  PRT
<213>  Human herpesvirus 3


<220>
<221>  misc_feature
<222>  (99)..(99)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (512)..(512)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (1275)..(1275)
<223>  Xaa can be any naturally occurring amino acid

<400>  12

Met Asp Thr Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser 
1               5                   10                  15      


Pro Asp Thr Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala 
            20                  25                  30          


Ala Glu His Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu 
        35                  40                  45              


Leu Phe Gly Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile Val 
    50                  55                  60                  


Ser Ile Pro Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr Glu Asp 
65                  70                  75                  80  


Val Gly Glu Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu Asp Gly Gln 
                85                  90                  95      


Asp Leu Xaa Gly Ser Pro Val Ile Pro Leu Ala Glu Val Phe His Thr 
            100                 105                 110         


Arg Phe Ser Glu Ala Gly Ala Arg Glu Pro Thr Gly Ala Asp Arg Ser 
        115                 120                 125             


Leu Glu Thr Val Ser Leu Gly Thr Lys Leu Ala Arg Ser Pro Lys Pro 
    130                 135                 140                 


Pro Met Asn Asp Gly Glu Thr Gly Arg Gly Thr Thr Pro Pro Phe Pro 
145                 150                 155                 160 


Gln Ala Phe Ser Pro Val Ser Pro Ala Ser Pro Val Gly Asp Ala Ala 
                165                 170                 175     


Gly Asn Asp Gln Arg Glu Asp Gln Arg Ser Ile Pro Arg Gln Thr Thr 
            180                 185                 190         


Arg Gly Asn Ser Pro Gly Leu Pro Ser Val Val His Arg Asp Arg Gln 
        195                 200                 205             


Thr Gln Ser Ile Ser Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His 
    210                 215                 220                 


Ala His Ala Ser Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro 
225                 230                 235                 240 


Ala Gln Gly Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro 
                245                 250                 255     


Leu Pro Ala Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln 
            260                 265                 270         


Leu Tyr His Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala 
        275                 280                 285             


Asp Leu Pro Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp Ala 
    290                 295                 300                 


Arg Gly Ile Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly Lys Pro 
305                 310                 315                 320 


Arg Ala Phe Leu Ala Leu Pro Gly Arg Ser His Ala Pro Asp Pro Ile 
                325                 330                 335     


Glu Asp Asp Ser Pro Val Glu Lys Lys Pro Lys Ser Arg Glu Phe Val 
            340                 345                 350         


Ser Ser Ser Ser Ser Ser Ser Ser Trp Gly Ser Ser Ser Glu Asp Glu 
        355                 360                 365             


Asp Asp Glu Pro Arg Arg Val Ser Val Gly Ser Glu Thr Thr Gly Ser 
    370                 375                 380                 


Arg Ser Gly Arg Glu His Ala Pro Ser Pro Ser Asn Ser Asp Asp Ser 
385                 390                 395                 400 


Asp Ser Asn Asp Gly Gly Ser Thr Lys Gln Asn Ile Gln Pro Gly Tyr 
                405                 410                 415     


Arg Ser Ile Ser Gly Pro Asp Pro Arg Ile Arg Lys Thr Lys Arg Leu 
            420                 425                 430         


Ala Gly Glu Pro Gly Arg Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg 
        435                 440                 445             


Ser Arg Thr Pro Ile Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro 
    450                 455                 460                 


Asp Gly Ser Pro Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val 
465                 470                 475                 480 


Arg Phe Gly Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp Glu 
                485                 490                 495     


Ala Val Arg Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu Pro Xaa 
            500                 505                 510         


Pro Leu Tyr Val Pro Glu Leu Gly Asp Pro Ala Arg Gln Tyr Arg Ala 
        515                 520                 525             


Leu Ile Asn Leu Ile Tyr Cys Pro Asp Arg Asp Pro Ile Ala Trp Leu 
    530                 535                 540                 


Gln Asn Pro Lys Leu Thr Gly Val Asn Ser Ala Leu Asn Gln Phe Tyr 
545                 550                 555                 560 


Gln Lys Leu Leu Pro Pro Gly Arg Ala Gly Thr Ala Val Thr Gly Ser 
                565                 570                 575     


Val Ala Ser Pro Val Pro His Val Gly Glu Ala Met Ala Thr Gly Glu 
            580                 585                 590         


Ala Leu Trp Ala Leu Pro His Ala Ala Ala Ala Val Ala Met Ser Arg 
        595                 600                 605             


Arg Tyr Asp Arg Ala Gln Lys His Phe Ile Leu Gln Ser Leu Arg Arg 
    610                 615                 620                 


Ala Phe Ala Gly Met Ala Tyr Pro Glu Ala Thr Gly Ser Ser Pro Ala 
625                 630                 635                 640 


Ala Arg Ile Ser Arg Gly His Pro Ser Pro Thr Thr Pro Ala Thr Gln 
                645                 650                 655     


Thr Pro Asp Pro Gln Pro Ser Ala Ala Ala Arg Ser Leu Ser Val Cys 
            660                 665                 670         


Pro Pro Asp Asp Arg Leu Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro 
        675                 680                 685             


Val Glu Ser Arg Ser Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala 
    690                 695                 700                 


Asp Ala Arg Val Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg 
705                 710                 715                 720 


Val Ser Glu Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro Pro 
                725                 730                 735     


Ala Val Ile Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly Gly Phe 
            740                 745                 750         


Arg Arg Ile Pro Arg Gly Ala Leu His Thr Pro Val Pro Ser Asp Gln 
        755                 760                 765             


Ala Arg Lys Ala Tyr Cys Thr Pro Glu Thr Ile Ala Arg Leu Val Asp 
    770                 775                 780                 


Asp Pro Leu Phe Pro Thr Ala Trp Arg Pro Ala Leu Ser Phe Asp Pro 
785                 790                 795                 800 


Gly Ala Leu Ala Glu Ile Ala Ala Arg Arg Pro Gly Gly Gly Asp Arg 
                805                 810                 815     


Arg Phe Gly Pro Pro Ser Gly Val Glu Ala Leu Arg Arg Arg Cys Ala 
            820                 825                 830         


Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val Arg Leu Leu Ile Ile 
        835                 840                 845             


Tyr Asp Pro Leu Pro Gly Glu Asp Ile Asn Gly Pro Leu Glu Ser Thr 
    850                 855                 860                 


Leu Ala Thr Asp Pro Gly Pro Ser Trp Ser Pro Ser Arg Gly Gly Leu 
865                 870                 875                 880 


Ser Val Val Leu Ala Ala Leu Ser Asn Arg Leu Cys Leu Pro Ser Thr 
                885                 890                 895     


His Ala Trp Ala Gly Asn Trp Thr Gly Pro Pro Asp Val Ser Ala Leu 
            900                 905                 910         


Asn Ala Arg Gly Val Leu Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala 
        915                 920                 925             


Gly Ala Val Glu Tyr Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg 
    930                 935                 940                 


Leu Leu Val Leu Asp Ala Val Ala Leu Glu Arg Trp Pro Gly Asp Gly 
945                 950                 955                 960 


Pro Ala Leu Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg Pro 
                965                 970                 975     


Asp Ala Gln Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr Glu Gly 
            980                 985                 990         


Leu Ala Arg Ala Val Phe Ala Ser  Ser Arg Thr Phe Gly  Pro Ala Ser 
        995                 1000                 1005             


Phe Ala  Arg Ile Glu Thr Ala  Phe Ala Asn Leu Tyr  Pro Gly Glu 
    1010                 1015                 1020             


Gln Pro  Leu Cys Leu Cys Arg  Gly Gly Asn Val Ala  Tyr Thr Val 
    1025                 1030                 1035             


Cys Thr  Arg Ala Gly Pro Lys  Thr Arg Val Pro Leu  Ser Pro Arg 
    1040                 1045                 1050             


Glu Tyr  Arg Gln Tyr Val Leu  Pro Gly Phe Asp Gly  Cys Lys Asp 
    1055                 1060                 1065             


Leu Ala  Arg Gln Ser Arg Gly  Leu Gly Leu Gly Ala  Ala Asp Phe 
    1070                 1075                 1080             


Val Asp  Glu Ala Ala His Ser  His Arg Ala Ala Asn  Arg Trp Gly 
    1085                 1090                 1095             


Leu Gly  Ala Ala Leu Arg Pro  Val Phe Leu Pro Glu  Gly Arg Arg 
    1100                 1105                 1110             


Pro Gly  Ala Ala Gly Pro Glu  Ala Gly Asp Val Pro  Thr Trp Ala 
    1115                 1120                 1125             


Arg Val  Phe Cys Arg His Ala  Leu Leu Glu Pro Asp  Pro Ala Ala 
    1130                 1135                 1140             


Glu Pro  Leu Val Leu Pro Pro  Val Ala Gly Arg Ser  Val Ala Leu 
    1145                 1150                 1155             


Tyr Ala  Ser Ala Asp Glu Ala  Arg Asn Ala Leu Pro  Pro Ile Pro 
    1160                 1165                 1170             


Arg Val  Met Trp Pro Pro Gly  Phe Gly Ala Ala Glu  Thr Val Leu 
    1175                 1180                 1185             


Glu Gly  Ser Asp Gly Thr Arg  Phe Ala Phe Gly His  His Gly Gly 
    1190                 1195                 1200             


Ser Glu  Arg Pro Ala Glu Thr  Gln Ala Gly Arg Gln  Arg Arg Thr 
    1205                 1210                 1215             


Ala Asp  Asp Arg Glu His Ala  Leu Glu Pro Asp Asp  Trp Glu Val 
    1220                 1225                 1230             


Gly Cys  Glu Asp Ala Trp Asp  Ser Glu Glu Gly Gly  Gly Asp Asp 
    1235                 1240                 1245             


Gly Asp  Ala Pro Gly Ser Ser  Phe Gly Val Ser Val  Val Ser Val 
    1250                 1255                 1260             


Ala Pro  Gly Val Leu Arg Asp  Arg Arg Val Gly Xaa  Arg Pro Ala 
    1265                 1270                 1275             


Val Lys  Val Glu Leu Leu Ser  Ser Ser Ser Ser Ser  Glu Asp Glu 
    1280                 1285                 1290             


Asp Asp  Val Trp Gly Gly Arg  Gly Gly Arg Ser Pro  Pro Gln Ser 
    1295                 1300                 1305             


Arg Gly  
    1310 


<210>  13
<211>  180
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  13
acattttata cccacgtttt agtgggtggg acttaaaaga aatgggtgga gggatatagg       60

ggtgtgtctt cgttggtacc aattataaaa atgtactcgc cacaactcac aatttagaac      120

gcatggcagt tctgctacgt gtttggatgc ccggacatta gaatacagcc agttgttacc      180


<210>  14
<211>  11771
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  14
ctagcatgga gtcctctgcc aagagaaaga tggaccctga taatcctgac gagggccctt       60

cctccaaggt gccacggccc gagacacccg tgaccaaggc cacgacgttc ctgcagacta      120

tgttgaggaa ggaggttaac agtcagctga gtctgggaga cccgctgttt ccagagttgg      180

ccgaagaatc cctcaaaact tttgaacaag tgaccgagga ttgcaacgag aaccccgaga      240

aagatgtcct ggcagaactc ggtgacatcc tcgcccaggc tgtcaatcat gccggtatcg      300

attccagtag caccggcccc acgctgacaa cccactcttg cagcgttagc agcgcccctc      360

ttaacaagcc gacccccacc agcgtcgcgg ttactaacac tcctctcccc ggggcatccg      420

ctactcccga gctcagcccg cgtaagaaac cgcgcaaaac cacgcgtcct ttcaaggtga      480

ttattaaacc gcccgtgcct cccgcgccta tcatgctgcc cctcatcaaa caggaagaca      540

tcaagcccga gcccgacttt accatccagt accgcaacaa gattatcgat accgccggct      600

gtatcgtgat ctctgatagc gaggaagaac agggtgaaga agtcgaaacc cgcggtgcta      660

ccgcgtcttc cccttccacc ggcagcggca cgccgcgagt gacctctccc acgcacccgc      720

tctcccagat gaaccaccct cctcttcccg atcccttggg ccggcccgat gaagatagtt      780

cctcttcgtc ttcctcctcc tgcagttcgg cttcggactc ggagagtgag tccgaggaga      840

tgaaatgcag cagtggcgga ggagcatccg tgacctcgag ccaccatggg cgcggcggtt      900

ttggtggcgc ggcctcctcc tctctgctga gctgcggcca tcagagcagc ggcggggcga      960

gcaccggacc ccgcaagaag aagagcaaac gcatctccga gttggacaac gagaaggtgc     1020

gcaatatcat gaaagataag aacaccccct tctgcacacc caacgtgcag actcggcggg     1080

gtcgcgtcaa gattgacgag gtgagccgca tgttccgcaa caccaatcgc tctcttgagt     1140

acaagaacct gcccttcacg attcccagta tgcaccaggt gttagatgag gccatcaaag     1200

cctgcaaaac catgcaggtg aacaacaagg gcatccagat tatctacacc cgcaatcatg     1260

aggtgaagag tgaggtggat gcggtgcggt gtcgcctggg caccatgtgc aacctggccc     1320

tctccactcc cttcctcatg gagcacacca tgcccgtgac acatccaccc gaagtggcgc     1380

agcgcacagc cgatgcttgt aacgaaggcg tcaaggccgc gtggagcctc aaagaattgc     1440

acacccacca attatgcccc cgttcctccg attaccgcaa catgatcatc cacgctgcca     1500

cccccgtgga cctgttgggc gctctcaacc tgtgcctgcc cctgatgcaa aagtttccca     1560

aacaggtcat ggtgcgcatc ttctccacca accagggtgg gttcatgctg cctatctacg     1620

agacggccgc gaaggcctac gccgtggggc agtttgagca gcccaccgag acccctcccg     1680

aagacctgga caccctgagc ctggccatcg aggcagccat ccaggacctg aggaacaagt     1740

ctcagtaagg atccgcccct ctccctcccc cccccctaac gttactggcc gaagccgctt     1800

ggaataaggc cggtgtgcgt ttgtctatat gttattttcc accatattgc cgtcttttgg     1860

caatgtgagg gcccggaaac ctggccctgt cttcttgacg agcattccta ggggtctttc     1920

ccctctcgcc aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga     1980

agcttcttga agacaaacaa cgtctgtagc gaccctttgc aggcagcgga accccccacc     2040

tggcgacagg tgcctctgcg gccaaaagcc acgtgtataa gatacacctg caaaggcggc     2100

acaaccccag tgccacgttg tgagttggat agttgtggaa agagtcaaat ggctctcctc     2160

aagcgtattc aacaaggggc tgaaggatgc ccagaaggta ccccattgta tgggatctga     2220

tctggggcct cggtgcacat gctttacatg tgtttagtcg aggttaaaaa aacgtctagg     2280

ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgatgataat atggccacaa     2340

ccatggcctc ctccgaggac gtcatcaagg agttcatgcg cttcaaggtg cgcatggagg     2400

gctccgtgaa cggccacgag ttcgagatcg agggcgaggg cgagggccgc ccctacgagg     2460

gcacccagac cgccaagctg aaggtgacca agggcggccc cctgcccttc gcctgggaca     2520

tcctgtcccc ccagttccag tacggctcca aggtgtacgt gaagcacccc gccgacatcc     2580

ccgactacaa gaagctgtcc ttccccgagg gcttcaagtg ggagcgcgtg atgaacttcg     2640

aggacggcgg cgtggtgacc gtgacccagg actcctccct gcaggacggc tccttcatct     2700

acaaggtgaa gttcatcggc gtgaacttcc cctccgacgg ccccgtaatg cagaagaaga     2760

ctatgggctg ggaggcctcc accgagcgcc tgtacccccg cgacggcgtg ctgaagggcg     2820

agatccacaa ggccctgaag ctgaaggacg gcggccacta cctggtggag ttcaagtcca     2880

tctacatggc caagaagccc gtgcagctgc ccggctacta ctacgtggac tccaagctgg     2940

acatcacctc ccacaacgag gactacacca tcgtggagca gtacgagcgc gccgagggcc     3000

gccaccacct gttcctgtag gcggccgcaa tcaacctctg gattacaaaa tttgtgaaag     3060

attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat     3120

gcctttgtat catgctatta cttcccgtac ggctttcatt ttctcctcct tgtataaatc     3180

ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg     3240

cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct atcaactcct     3300

ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcattg ccgcctgcct     3360

tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg     3420

gaagctgacg tcctttccat ggctgctcgc ctgtgttgcc aactggattc tgcgcgggac     3480

gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct     3540

gccggttctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc ggatctccct     3600

ttgggccgcc tccccgcctg cctgcaggtt tgtcgagacc tagaaaaaca tggagcaatc     3660

acaagtagca atacagcagc taccaatgct gattgtgcct ggctagaagc acaagaggag     3720

gaggaggtgg gttttccagt cacacctcag gtacctttaa gaccaatgac ttacaaggca     3780

gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct aattcactcc     3840

caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga ccagatctga     3900

gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct     3960

tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc     4020

agaccctttt agtcagtgtg gaaaatctct agcagggccc gtttaaaccc gctgatcagc     4080

ctcgactgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt     4140

gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca     4200

ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga     4260

ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc     4320

ggaaagaacc agctggggct ctagggggta tccccacgcg ccctgtagcg gcgcattaag     4380

cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc     4440

cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc     4500

tctaaatcgg ggcatccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa     4560

aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg     4620

ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac     4680

actcaaccct atctcggtct attcttttga tttataaggg attttgggga tttcggccta     4740

ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattaattct gtggaatgtg     4800

tgtcagttag ggtgtggaaa gtccccaggc tccccaggca ggcagaagta tgcaaagcat     4860

gcatctcaat tagtcagcaa ccaggtgtgg aaagtcccca ggctccccag caggcagaag     4920

tatgcaaagc atgcatctca attagtcagc aaccatagtc ccgcccctaa ctccgcccat     4980

cccgccccta actccgccca gttccgccca ttctccgccc catggctgac taattttttt     5040

tatttatgca gaggccgagg ccgcctctgc ctctgagcta ttccagaagt agtgaggagg     5100

cttttttgga ggcctaggct tttgcaaaaa gctcccggga gcttgtatat ccattttcgg     5160

atctgatcag cacgtgttga caattaatca tcggcatagt atatcggcat agtataatac     5220

gacaaggtga ggaactaaac catggccaag ttgaccagtg ccgttccggt gctcaccgcg     5280

cgcgacgtcg ccggagcggt cgagttctgg accgaccggc tcgggttctc ccgggacttc     5340

gtggaggacg acttcgccgg tgtggtccgg gacgacgtga ccctgttcat cagcgcggtc     5400

caggaccagg tggtgccgga caacaccctg gcctgggtgt gggtgcgcgg cctggacgag     5460

ctgtacgccg agtggtcgga ggtcgtgtcc acgaacttcc gggacgcctc cgggccggcc     5520

atgaccgaga tcggcgagca gccgtggggg cgggagttcg ccctgcgcga cccggccggc     5580

aactgcgtgc acttcgtggc cgaggagcag gactgacacg tgctacgaga tttcgattcc     5640

accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc cggctggatg     5700

atcctccagc gcggggatct catgctggag ttcttcgccc accccaactt gtttattgca     5760

gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt     5820

tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca tgtctgtata     5880

ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat     5940

tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg     6000

ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag     6060

tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt     6120

ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg     6180

ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg     6240

gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag     6300

gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga     6360

cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct     6420

ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc     6480

tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta tctcagttcg     6540

gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc     6600

tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca     6660

ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag     6720

ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct     6780

ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc     6840

accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga     6900

tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca     6960

cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat     7020

taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac     7080

caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt     7140

gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt     7200

gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag     7260

ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct     7320

attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt     7380

gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc     7440

tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt     7500

agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg     7560

gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg     7620

actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct     7680

tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc     7740

attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt     7800

tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt     7860

tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg     7920

aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat     7980

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg     8040

cgcacatttc cccgaaaagt gccacctgac gtcgacggat cgggagatct cccgatcccc     8100

tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag tatctgctcc     8160

ctgcttgtgt gttggaggtc gctgagtagt gcgcgagcaa aatttaagct acaacaaggc     8220

aaggcttgac cgacaattgc atgaagaatc tgcttagggt taggcgtttt gcgctgcttc     8280

gcgatgtacg ggccagatat acgcgttgac attgattatt gactagttat taatagtaat     8340

caattacggg gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg     8400

taaatggccc gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt     8460

atgttcccat agtaacgcca atagggactt tccattgacg tcaatgggtg gactatttac     8520

ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg     8580

acgtcaatga cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact     8640

ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt     8700

ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc     8760

ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc     8820

gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata     8880

taagcagagc tctctggcta actagagaac ccactgctta ctggcttatc gaaattaata     8940

cgactcacta tagggagacc caagctggtt taaacttaag cttggtaccg agctcactag     9000

tccagtgtgg tggcagatat ccagcacagt ggcggccgct cgagtctaga gggcccgttt     9060

tgcctgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact     9120

agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc     9180

ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa     9240

aatctctagc agtggcgccc gaacagggac ttgaaagcga aagggaaacc agaggagctc     9300

tctcgacgca ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg gcggcgactg     9360

gtgagtacgc caaaaatttt gactagcgga ggctagaagg agagagatgg gtgcgagagc     9420

gtcagtatta agcgggggag aattagatcg cgatgggaaa aaattcggtt aaggccaggg     9480

ggaaagaaaa aatataaatt aaaacatata gtatgggcaa gcagggagct agaacgattc     9540

gcagttaatc ctggcctgtt agaaacatca gaaggctgta gacaaatact gggacagcta     9600

caaccatccc ttcagacagg atcagaagaa cttagatcat tatataatac agtagcaacc     9660

ctctattgtg tgcatcaaag gatagagata aaagacacca aggaagcttt agacaagata     9720

gaggaagagc aaaacaaaag taagaccacc gcacagcaag cggccgctga tcttcagacc     9780

tggaggagga gatatgaggg acaattggag aagtgaatta tataaatata aagtagtaaa     9840

aattgaacca ttaggagtag cacccaccaa ggcaaagaga agagtggtgc agagagaaaa     9900

aagagcagtg ggaataggag ctttgttcct tgggttcttg ggagcagcag gaagcactat     9960

gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa ttattgtctg gtatagtgca    10020

gcagcagaac aatttgctga gggctattga ggcgcaacag catctgttgc aactcacagt    10080

ctggggcatc aagcagctcc aggcaagaat cctggctgtg gaaagatacc taaaggatca    10140

acagctcctg gggatttggg gttgctctgg aaaactcatt tgcaccactg ctgtgccttg    10200

gaatgctagt tggagtaata aatctctgga acagatttgg aatcacacga cctggatgga    10260

gtgggacaga gaaattaaca attacacaag cttaatacac tccttaattg aagaatcgca    10320

aaaccagcaa gaaaagaatg aacaagaatt attggaatta gataaatggg caagtttgtg    10380

gaattggttt aacataacaa attggctgtg gtatataaaa ttattcataa tgatagtagg    10440

aggcttggta ggtttaagaa tagtttttgc tgtactttct atagtgaata gagttaggca    10500

gggatattca ccattatcgt ttcagaccca cctcccaacc ccgaggggac ccgacaggcc    10560

cttaattaat tggctccggt gcccgtcagt gggcagagcg cacatcgccc acagtccccg    10620

agaagttggg gggaggggtc ggcaattgaa ccggtgccta gagaaggtgg cgcggggtaa    10680

actgggaaag tgatgtcgtg tactggctcc gcctttttcc cgagggtggg ggagaaccgt    10740

atataagtgc agtagtcgcc gtgaacgttc tttttcgcaa cgggtttgcc gccagaacac    10800

aggtaagtgc cgtgtgtggt tcccgcgggc ctggcctctt tacgggttat ggcccttgcg    10860

tgccttgaat tacttccacc tggctgcagt acgtgattct tgatcccgag cttcgggttg    10920

gaagtgggtg ggagagttcg aggccttgcg cttaaggagc cccttcgcct cgtgcttgag    10980

ttgaggcctg gcctgggcgc tggggccgcc gcgtgcgaat ctggtggcac cttcgcgcct    11040

gtctcgctgc tttcgataag tctctagcca tttaaaattt ttgatgacct gctgcgacgc    11100

tttttttctg gcaagatagt cttgtaaatg cgggccaaga tctgcacact ggtatttcgg    11160

tttttggggc cgcgggcggc gacggggccc gtgcgtccca gcgcacatgt tcggcgaggc    11220

ggggcctgcg agcgcggcca ccgagaatcg gacgggggta gtctcaagct ggccggcctg    11280

ctctggtgcc tggcctcgcg ccgccgtgta tcgccccgcc ctgggcggca aggctggccc    11340

ggtcggcacc agttgcgtga gcggaaagat ggccgcttcc cggccctgct gcagggagct    11400

caaaatggag gacgcggcgc tcgggagagc gggcgggtga gtcacccaca caaaggaaaa    11460

gggcctttcc gtcctcagcc gtcgcttcat gtgactccac ggagtaccgg gcgccgtcca    11520

ggcacctcga ttagttctcg agcttttgga gtacgtcgtc tttaggttgg ggggaggggt    11580

tttatgcgat ggagtttccc cacactgagt gggtggagac tgaagttagg ccagcttggc    11640

acttgatgta attctccttg gaatttgccc tttttgagtt tggatcttgg ttcattctca    11700

agcctcagac agtggttcaa agtttttttc ttccatttca ggtgtcgtga ggaattcggc    11760

cattacggcc g                                                         11771


<210>  15
<211>  10760
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  15
tcgagaccta gaaaaacatg gagcaatcac aagtagcaat acagcagcta ccaatgctga       60

ttgtgcctgg ctagaagcac aagaggagga ggaggtgggt tttccagtca cacctcaggt      120

acctttaaga ccaatgactt acaaggcagc tgtagatctt agccactttt taaaagaaaa      180

ggggggactg gaagggctaa ttcactccca acgaagacaa gatctgcttt ttgcttgtac      240

tgggtctctc tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc      300

actgcttaag cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt      360

gtgtgactct ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag      420

cagggcccgt ttaaacccgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg      480

ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt      540

cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg      600

gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg      660

atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctct agggggtatc      720

cccacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga      780

ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg      840

ccacgttcgc cggctttccc cgtcaagctc taaatcgggg catcccttta gggttccgat      900

ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg      960

ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata     1020

gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt     1080

tataagggat tttggggatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat     1140

ttaacgcgaa ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc     1200

cccaggcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc aggtgtggaa     1260

agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa     1320

ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt     1380

ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctctgcct     1440

ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcaaaaagc     1500

tcccgggagc ttgtatatcc attttcggat ctgatcagca cgtgttgaca attaatcatc     1560

ggcatagtat atcggcatag tataatacga caaggtgagg aactaaacca tggccaagtt     1620

gaccagtgcc gttccggtgc tcaccgcgcg cgacgtcgcc ggagcggtcg agttctggac     1680

cgaccggctc gggttctccc gggacttcgt ggaggacgac ttcgccggtg tggtccggga     1740

cgacgtgacc ctgttcatca gcgcggtcca ggaccaggtg gtgccggaca acaccctggc     1800

ctgggtgtgg gtgcgcggcc tggacgagct gtacgccgag tggtcggagg tcgtgtccac     1860

gaacttccgg gacgcctccg ggccggccat gaccgagatc ggcgagcagc cgtgggggcg     1920

ggagttcgcc ctgcgcgacc cggccggcaa ctgcgtgcac ttcgtggccg aggagcagga     1980

ctgacacgtg ctacgagatt tcgattccac cgccgccttc tatgaaaggt tgggcttcgg     2040

aatcgttttc cgggacgccg gctggatgat cctccagcgc ggggatctca tgctggagtt     2100

cttcgcccac cccaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat     2160

cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact     2220

catcaatgta tcttatcatg tctgtatacc gtcgacctct agctagagct tggcgtaatc     2280

atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac acaacatacg     2340

agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac tcacattaat     2400

tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg     2460

aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct     2520

cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc     2580

ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg     2640

ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg     2700

cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg     2760

actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac     2820

cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca     2880

atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt     2940

gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc     3000

caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag     3060

agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac     3120

tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt     3180

tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa     3240

gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg     3300

gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa     3360

aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat     3420

atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc     3480

gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat     3540

acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc     3600

ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc     3660

tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag     3720

ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg     3780

ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg     3840

atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag     3900

taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt     3960

catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga     4020

atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc     4080

acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc     4140

aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc     4200

ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc     4260

cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca     4320

atattattga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat     4380

ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt     4440

cgacggatcg ggagatctcc cgatccccta tggtgcactc tcagtacaat ctgctctgat     4500

gccgcatagt taagccagta tctgctccct gcttgtgtgt tggaggtcgc tgagtagtgc     4560

gcgagcaaaa tttaagctac aacaaggcaa ggcttgaccg acaattgcat gaagaatctg     4620

cttagggtta ggcgttttgc gctgcttcgc gatgtacggg ccagatatac gcgttgacat     4680

tgattattga ctagttatta atagtaatca attacggggt cattagttca tagcccatat     4740

atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac     4800

ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc     4860

cattgacgtc aatgggtgga ctatttacgg taaactgccc acttggcagt acatcaagtg     4920

tatcatatgc caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat     4980

tatgcccagt acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc     5040

atcgctatta ccatggtgat gcggttttgg cagtacatca atgggcgtgg atagcggttt     5100

gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt gttttggcac     5160

caaaatcaac gggactttcc aaaatgtcgt aacaactccg ccccattgac gcaaatgggc     5220

ggtaggcgtg tacggtggga ggtctatata agcagagctc tctggctaac tagagaaccc     5280

actgcttact ggcttatcga aattaatacg actcactata gggagaccca agctggttta     5340

aacttaagct tggtaccgag ctcactagtc cagtgtggtg gcagatatcc agcacagtgg     5400

cggccgctcg agtctagagg gcccgttttg cctgtactgg gtctctctgg ttagaccaga     5460

tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct     5520

tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat     5580

ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga acagggactt     5640

gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg ctgaagcgcg     5700

cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga ctagcggagg     5760

ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa ttagatcgcg     5820

atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatatagt     5880

atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag aaacatcaga     5940

aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat cagaagaact     6000

tagatcatta tataatacag tagcaaccct ctattgtgtg catttaatta actggaatac     6060

gacaagataa cccggatcgt gggcctggat cagtacctgg agagcgttaa aaaacacaaa     6120

cggctggatg tgtgccgcgc taaaatgggc tatatgctgc agtgaataat aaaatgtgtg     6180

tttgtccgaa atacgcgttt tgagatttct gtcgccgact aaattcatgt cgcgcgatag     6240

tggtgtttat cgccgataga gatggcgata ttggaaaaat cgatatttga aaatatggca     6300

tattgaaaat gtcgccgatg tgagtttctg tgtaactgat atcgccattt ttccaaaagt     6360

gatttttggg catacgcgat atctggcgat agcgcttata tcgtttacgg gggatggcga     6420

tagacgactt tggtgacttg ggcgattctg tgtgtcgcaa atatcgcagt ttcgatatag     6480

gtgacagacg atatgaggct atatcgccga tagaggcgac atcaagctgg cacatggcca     6540

atgcatatcg atctatacat tgaatcaata ttggccatta gccatattat tcattggtta     6600

tatagcataa atcaatattg gctattggcc attgcatacg ttgtatccat atcataatat     6660

gtacatttat attggctcat gtccaacatt accgccatgt tgacattgat tattgactag     6720

ttattaatag taatcaatta cggggtcatt agttcatagc ccatatatgg agttccgcgt     6780

tacataactt acggtaaatg gcccgcctgg ctgaccgccc aacgaccccc gcccattgac     6840

gtcaataatg acgtatgttc ccatagtaac gccaataggg actttccatt gacgtcaatg     6900

ggtggagtat ttacggtaaa ctgcccactt ggcagtacat caagtgtatc atatgccaag     6960

tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg cccagtacat     7020

gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg ctattaccat     7080

ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact cacggggatt     7140

tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa atcaacggga     7200

ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta ggcgtgtacg     7260

gtgggaggtc tatataagca gagctcgttt agtgaaccgt cagatcgcct ggagacgcca     7320

tccacgctgt tttgacctcc atagaagaca ccgggaccga tccagcctcc gcggccggga     7380

acggtgcatt ggaacgcgga ttccccgtgc caagagtgac gtaagtaccg cctatagagt     7440

ctataggccc acccccttgg cttcttatgc atgctatact gtttttggct tggggtctat     7500

acacccccgc ttcctcatgt tataggtgat ggtatagctt agcctatagg tgtgggttat     7560

tgaccattat tgaccactcc cctattggtg acgatacttt ccattactaa tccataacat     7620

ggctctttgc cacaactctc tttattggct atatgccaat acactgtcct tcagagactg     7680

acacggactc tgtattttta caggatgggg tctcatttat tatttacaaa ttcacatata     7740

caacaccacc gtccccagtg cccgcagttt ttattaaaca taacgtggga tctccacgcg     7800

aatctcgggt acgtgttccg gacatgggct cttctccggt agcggcggag cttctacatc     7860

cgagccctgc tcccatgcct ccagcgactc atggtcgctc ggcagctcct tgctcctaac     7920

agtggaggcc agacttaggc acagcacgat gcccaccacc accagtgtgc cgcacaaggc     7980

cgtggcggta gggtatgtgt ctgaaaatga gctcggggag cgggcttgca ccgctgacgc     8040

atttggaaga cttaaggcag cggcagaaga agatgcaggc agctgagttg ttgtgttctg     8100

ataagagtca gaggtaactc ccgttgcggt gctgttaacg gtggagggca gtgtagtctg     8160

agcagtactc gttgctgccg cgcgcgccac cagacataat agctgacaga ctaacagact     8220

gttcctttcc atgggtcttt tctgcagtca ccgtccttga cacggctagc atggagtcct     8280

ctgccaagag aaagatggac cctgataatc ctgacgaggg cccttcctcc aaggtgccac     8340

ggcccgagac acccgtgacc aaggccacga cgttcctgca gactatgttg aggaaggagg     8400

ttaacagtca gctgagtctg ggagacccgc tgtttccaga gttggccgaa gaatccctca     8460

aaacttttga acaagtgacc gaggattgca acgagaaccc cgagaaagat gtcctggcag     8520

aactcggtga catcctcgcc caggctgtca atcatgccgg tatcgattcc agtagcaccg     8580

gccccacgct gacaacccac tcttgcagcg ttagcagcgc ccctcttaac aagccgaccc     8640

ccaccagcgt cgcggttact aacactcctc tccccggggc atccgctact cccgagctca     8700

gcccgcgtaa gaaaccgcgc aaaaccacgc gtcctttcaa ggtgattatt aaaccgcccg     8760

tgcctcccgc gcctatcatg ctgcccctca tcaaacagga agacatcaag cccgagcccg     8820

actttaccat ccagtaccgc aacaagatta tcgataccgc cggctgtatc gtgatctctg     8880

atagcgagga agaacagggt gaagaagtcg aaacccgcgg tgctaccgcg tcttcccctt     8940

ccaccggcag cggcacgccg cgagtgacct ctcccacgca cccgctctcc cagatgaacc     9000

accctcctct tcccgatccc ttgggccggc ccgatgaaga tagttcctct tcgtcttcct     9060

cctcctgcag ttcggcttcg gactcggaga gtgagtccga ggagatgaaa tgcagcagtg     9120

gcggaggagc atccgtgacc tcgagccacc atgggcgcgg cggttttggt ggcgcggcct     9180

cctcctctct gctgagctgc ggccatcaga gcagcggcgg ggcgagcacc ggaccccgca     9240

agaagaagag caaacgcatc tccgagttgg acaacgagaa ggtgcgcaat atcatgaaag     9300

ataagaacac ccccttctgc acacccaacg tgcagactcg gcggggtcgc gtcaagattg     9360

acgaggtgag ccgcatgttc cgcaacacca atcgctctct tgagtacaag aacctgccct     9420

tcacgattcc cagtatgcac caggtgttag atgaggccat caaagcctgc aaaaccatgc     9480

aggtgaacaa caagggcatc cagattatct acacccgcaa tcatgaggtg aagagtgagg     9540

tggatgcggt gcggtgtcgc ctgggcacca tgtgcaacct ggccctctcc actcccttcc     9600

tcatggagca caccatgccc gtgacacatc cacccgaagt ggcgcagcgc acagccgatg     9660

cttgtaacga aggcgtcaag gccgcgtgga gcctcaaaga attgcacacc caccaattat     9720

gcccccgttc ctccgattac cgcaacatga tcatccacgc tgccaccccc gtggacctgt     9780

tgggcgctct caacctgtgc ctgcccctga tgcaaaagtt tcccaaacag gtcatggtgc     9840

gcatcttctc caccaaccag ggtgggttca tgctgcctat ctacgagacg gccgcgaagg     9900

cctacgccgt ggggcagttt gagcagccca ccgagacccc tcccgaagac ctggacaccc     9960

tgagcctggc catcgaggca gccatccagg acctgaggaa caagtctcag taaggtgctg    10020

gtgctggtgc tggtgctggt gctgtgagca agggcgagga gctgttcacc ggggtggtgc    10080

ccatcctggt cgagctggac ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg    10140

gcgagggcga tgccacctac ggcaagctga ccctgaagtt catctgcacc accggcaagc    10200

tgcccgtgcc ctggcccacc ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc    10260

gctaccccga ccacatgaag cagcacgact tcttcaagtc cgccatgccc gaaggctacg    10320

tccaggagcg caccatcttc ttcaaggacg acggcaacta caagacccgc gccgaggtga    10380

agttcgaggg cgacaccctg gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg    10440

acggcaacat cctggggcac aagctggagt acaactacaa cagccacaac gtctatatca    10500

tggccgacaa gcagaagaac ggcatcaagg tgaacttcaa gatccgccac aacatcgagg    10560

acggcagcgt gcagctcgcc gaccactacc agcagaacac ccccatcggc gacggccccg    10620

tgctgctgcc cgacaaccac tacctgagca cccagtccgc cctgagcaaa gaccccaacg    10680

agaagcgcga tcacatggtc ctgctggagt tcgtgaccgc cgccgggatc actctcggca    10740

tggacgagct gtacaagtaa                                                10760


<210>  16
<211>  11362
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  16
ggtttgtcga gacctagaaa aacatggagc aatcacaagt agcaatacag cagctaccaa       60

tgctgattgt gcctggctag aagcacaaga ggaggaggag gtgggttttc cagtcacacc      120

tcaggtacct ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa      180

agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc      240

ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg      300

gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg      360

tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat      420

ctctagcagg gcccgtttaa acccgctgat cagcctcgac tgtgccttct agttgccagc      480

catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg      540

tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc      600

tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg      660

ctggggatgc ggtgggctct atggcttctg aggcggaaag aaccagctgg ggctctaggg      720

ggtatcccca cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca      780

gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct      840

ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcggggcatc cctttagggt      900

tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac      960

gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct     1020

ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt     1080

ttgatttata agggattttg gggatttcgg cctattggtt aaaaaatgag ctgatttaac     1140

aaaaatttaa cgcgaattaa ttctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc     1200

aggctcccca ggcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccaggt     1260

gtggaaagtc cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt     1320

cagcaaccat agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg     1380

cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct     1440

ctgcctctga gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca     1500

aaaagctccc gggagcttgt atatccattt tcggatctga tcagcacgtg ttgacaatta     1560

atcatcggca tagtatatcg gcatagtata atacgacaag gtgaggaact aaaccatggc     1620

caagttgacc agtgccgttc cggtgctcac cgcgcgcgac gtcgccggag cggtcgagtt     1680

ctggaccgac cggctcgggt tctcccggga cttcgtggag gacgacttcg ccggtgtggt     1740

ccgggacgac gtgaccctgt tcatcagcgc ggtccaggac caggtggtgc cggacaacac     1800

cctggcctgg gtgtgggtgc gcggcctgga cgagctgtac gccgagtggt cggaggtcgt     1860

gtccacgaac ttccgggacg cctccgggcc ggccatgacc gagatcggcg agcagccgtg     1920

ggggcgggag ttcgccctgc gcgacccggc cggcaactgc gtgcacttcg tggccgagga     1980

gcaggactga cacgtgctac gagatttcga ttccaccgcc gccttctatg aaaggttggg     2040

cttcggaatc gttttccggg acgccggctg gatgatcctc cagcgcgggg atctcatgct     2100

ggagttcttc gcccacccca acttgtttat tgcagcttat aatggttaca aataaagcaa     2160

tagcatcaca aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc     2220

caaactcatc aatgtatctt atcatgtctg tataccgtcg acctctagct agagcttggc     2280

gtaatcatgg tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa     2340

catacgagcc ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac     2400

attaattgcg ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca     2460

ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc     2520

ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc     2580

aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc     2640

aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag     2700

gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc     2760

gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt     2820

tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct     2880

ttctcaatgc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg     2940

ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct     3000

tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat     3060

tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg     3120

ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa     3180

aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt     3240

ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc     3300

tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt     3360

atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta     3420

aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat     3480

ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac     3540

tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg     3600

ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag     3660

tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt     3720

aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt     3780

gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt     3840

tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt     3900

cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct     3960

tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt     4020

ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac     4080

cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa     4140

actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa     4200

ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca     4260

aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct     4320

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga     4380

atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc     4440

tgacgtcgac ggatcgggag atctcccgat cccctatggt gcactctcag tacaatctgc     4500

tctgatgccg catagttaag ccagtatctg ctccctgctt gtgtgttgga ggtcgctgag     4560

tagtgcgcga gcaaaattta agctacaaca aggcaaggct tgaccgacaa ttgcatgaag     4620

aatctgctta gggttaggcg ttttgcgctg cttcgcgatg tacgggccag atatacgcgt     4680

tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt agttcatagc     4740

ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc     4800

aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg     4860

actttccatt gacgtcaatg ggtggactat ttacggtaaa ctgcccactt ggcagtacat     4920

caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc     4980

tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta catctacgta     5040

ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag     5100

cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt     5160

tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa     5220

atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctctctg gctaactaga     5280

gaacccactg cttactggct tatcgaaatt aatacgactc actataggga gacccaagct     5340

ggtttaaact taagcttggt accgagctca ctagtccagt gtggtggcag atatccagca     5400

cagtggcggc cgctcgagtc tagagggccc gttttgcctg tactgggtct ctctggttag     5460

accagatctg agcctgggag ctctctggct aactagggaa cccactgctt aagcctcaat     5520

aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac tctggtaact     5580

agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc gcccgaacag     5640

ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc ggcttgctga     5700

agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa ttttgactag     5760

cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg ggagaattag     5820

atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata aattaaaaca     5880

tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc tgttagaaac     5940

atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga caggatcaga     6000

agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatt taattaactg     6060

gaatacgaca agataacccg gatcgtgggc ctggatcagt acctggagag cgttaaaaaa     6120

cacaaacggc tggatgtgtg ccgcgctaaa atgggctata tgctgcagtg aataataaaa     6180

tgtgtgtttg tccgaaatac gcgttttgag atttctgtcg ccgactaaat tcatgtcgcg     6240

cgatagtggt gtttatcgcc gatagagatg gcgatattgg aaaaatcgat atttgaaaat     6300

atggcatatt gaaaatgtcg ccgatgtgag tttctgtgta actgatatcg ccatttttcc     6360

aaaagtgatt tttgggcata cgcgatatct ggcgatagcg cttatatcgt ttacggggga     6420

tggcgataga cgactttggt gacttgggcg attctgtgtg tcgcaaatat cgcagtttcg     6480

atataggtga cagacgatat gaggctatat cgccgataga ggcgacatca agctggcaca     6540

tggccaatgc atatcgatct atacattgaa tcaatattgg ccattagcca tattattcat     6600

tggttatata gcataaatca atattggcta ttggccattg catacgttgt atccatatca     6660

taatatgtac atttatattg gctcatgtcc aacattaccg ccatgttgac attgattatt     6720

gactagttat taatagtaat caattacggg gtcattagtt catagcccat atatggagtt     6780

ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg acccccgccc     6840

attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt tccattgacg     6900

tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag tgtatcatat     6960

gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc attatgccca     7020

gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag tcatcgctat     7080

taccatggtg atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg     7140

gggatttcca agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca     7200

acgggacttt ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg     7260

tgtacggtgg gaggtctata taagcagagc tcgtttagtg aaccgtcaga tcgcctggag     7320

acgccatcca cgctgttttg acctccatag aagacaccgg gaccgatcca gcctccgcgg     7380

ccgggaacgg tgcattggaa cgcggattcc ccgtgccaag agtgacgtaa gtaccgccta     7440

tagagtctat aggcccaccc ccttggcttc ttatgcatgc tatactgttt ttggcttggg     7500

gtctatacac ccccgcttcc tcatgttata ggtgatggta tagcttagcc tataggtgtg     7560

ggttattgac cattattgac cactccccta ttggtgacga tactttccat tactaatcca     7620

taacatggct ctttgccaca actctcttta ttggctatat gccaatacac tgtccttcag     7680

agactgacac ggactctgta tttttacagg atggggtctc atttattatt tacaaattca     7740

catatacaac accaccgtcc ccagtgcccg cagtttttat taaacataac gtgggatctc     7800

cacgcgaatc tcgggtacgt gttccggaca tgggctcttc tccggtagcg gcggagcttc     7860

tacatccgag ccctgctccc atgcctccag cgactcatgg tcgctcggca gctccttgct     7920

cctaacagtg gaggccagac ttaggcacag cacgatgccc accaccacca gtgtgccgca     7980

caaggccgtg gcggtagggt atgtgtctga aaatgagctc ggggagcggg cttgcaccgc     8040

tgacgcattt ggaagactta aggcagcggc agaagaagat gcaggcagct gagttgttgt     8100

gttctgataa gagtcagagg taactcccgt tgcggtgctg ttaacggtgg agggcagtgt     8160

agtctgagca gtactcgttg ctgccgcgcg cgccaccaga cataatagct gacagactaa     8220

cagactgttc ctttccatgg gtcttttctg cagtcaccgt ccttgacacg gctagcatgg     8280

agtcctctgc caagagaaag atggaccctg ataatcctga cgagggccct tcctccaagg     8340

tgccacggcc cgagacaccc gtgaccaagg ccacgacgtt cctgcagact atgttgagga     8400

aggaggttaa cagtcagctg agtctgggag acccgctgtt tccagagttg gccgaagaat     8460

ccctcaaaac ttttgaacga gtgaccgagg attgcaacga gaaccccgag aaagatgtcc     8520

tggcagaact cggtgacatc ctcgcccagg ctgtcaatca tgccggtatc gattccagta     8580

gcaccggccc cacgctgaca acccactctt gcagcgttag cagcgcccct cttaacaagc     8640

cgacccccac cagcgtcgcg gttactaaca ctcctctccc cggggcatcc gctactcccg     8700

agctcagccc gcgtaagaaa ccgcgcaaaa ccacgcgtcc tttcaaggtg attattaaac     8760

cgcccgtgcc tcccgcgcct atcatgctgc ccctcatcaa acaggaagac atcaagcccg     8820

agcccgactt taccatccag taccgcaaca agattatcga taccgccggc tgtatcgtga     8880

tctctgatag cgaggaagaa cagggtgaag aagtcgaaac ccgcggtgct accgcgtctt     8940

ccccttccac cggcagcggc acgccgcgag tgacctctcc cacgcacccg ctctcccaga     9000

taaaccaccc tcctcttccc gatcccttgg gccggcccga tgaagatagt tcctcttcgt     9060

cttcctcctg cagttcggct tcggactcgg agagtgagtc cgaggagatg aaatgcagca     9120

gtggcggagg agcatccgtg acctcgagcc accatgggcg cggcggtttt ggtggcgcgg     9180

cctcctcctc tctgctgagc tgcggccatc agagcagcgg cggggcgagc accggacccc     9240

gcaagaagaa gagcaaacgc atctccgagt tggacaacga gaaggtgcgc aatatcatga     9300

aagataagaa cacccccttc tgcacaccca acgtgcagac tcggcggggt cgcgtcaaga     9360

ttgacgaggt gagccgcatg ttccgcaaca ccaatcgctc tcttgagtac aagaacctgc     9420

ccttcacgat tcccagtatg caccaggtgt tagatgaggc catcaaagcc tgcaaaacca     9480

tgcaggtgaa caacaagggc atccagatta tctacacccg caatcatgag gtgaagagtg     9540

aggtggatgc ggtgcggtgt cgcctgggca ccatgtgcaa cctggccctc tccactccct     9600

tcctcatgga gcacaccatg cccgtgacac atccacccga agtggcgcag cgcacagccg     9660

atacttgtaa cgaaggcgtc aaggccgcgt ggagcctcaa agaattgcac acccaccaat     9720

tatgcccccg ttcctccgat taccgcaaca tgatcatcca cgctgccacc cccgtggacc     9780

tgttgggcgc tctcaacctg tgcctgcccc tgatgcaaaa gtttcccaaa caggtcatgg     9840

tgcgcatctt ctccaccaac cagggtgggt tcatgctgcc tatctacgag acggccgcga     9900

aggcctacgc cgtggggcag tttgagcagc ccaccgagac ccctcccgaa gacctggaca     9960

ccctgagcct ggccatcgag gcagccatcc aggacctgag gaacaagtct cagtaaggat    10020

ccgcccctct ccctcccccc cccctaacgt tactggccga agccgcttgg aataaggccg    10080

gtgtgcgttt gtctatatgt tattttccac catattgccg tcttttggca atgtgagggc    10140

ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc ctctcgccaa    10200

aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag cttcttgaag    10260

acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg gcgacaggtg    10320

cctctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac aaccccagtg    10380

ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa gcgtattcaa    10440

caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc tggggcctcg    10500

gtacacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc ccccgaacca    10560

cggggacgtg gttttccttt gaaaaacacg atgataatat ggccacaacc atggtgagca    10620

agggcgagga ggataacatg gccatcatca aggagttcat gcgcttcaag gtgcacatgg    10680

agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc cgcccctacg    10740

agggcaccca gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc ttcgcctggg    10800

acatcctgtc ccctcagttc atgtacggct ccaaggccta cgtgaagcac cccgccgaca    10860

tccccgacta cttgaagctg tccttccccg agggcttcaa gtgggagcgc gtgatgaact    10920

tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac ggcgagttca    10980

tctacaaggt gaagctgcgc ggcaccaact tcccctccga cggccccgta atgcagaaga    11040

agaccatggg ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc gccctgaagg    11100

gcgagatcaa gcagaggctg aagctgaagg acggcggcca ctacgacgct gaggtcaaga    11160

ccacctacaa ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc aacatcaagt    11220

tggacatcac ctcccacaac gaggactaca ccatcgtgga acagtacgaa cgcgccgagg    11280

gccgccactc caccggcggc atggacgagc tgtacaagag cagcctgagg cctcctaaga    11340

agaagaggaa ggtttgaatg ca                                             11362


<210>  17
<211>  11375
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  17
gatccgcccc tctccctccc ccccccctaa cgttactggc cgaagccgct tggaataagg       60

ccggtgtgcg tttgtctata tgttattttc caccatattg ccgtcttttg gcaatgtgag      120

ggcccggaaa cctggccctg tcttcttgac gagcattcct aggggtcttt cccctctcgc      180

caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg      240

aagacaaaca acgtctgtag cgaccctttg caggcagcgg aaccccccac ctggcgacag      300

gtgcctctgc ggccaaaagc cacgtgtata agatacacct gcaaaggcgg cacaacccca      360

gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa tggctctcct caagcgtatt      420

caacaagggg ctgaaggatg cccagaaggt accccattgt atgggatctg atctggggcc      480

tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa      540

ccacggggac gtggttttcc tttgaaaaac acgatgataa tatggccaca accatggtga      600

gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg gacggcgacg      660

taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc tacggcaagc      720

tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc accctcgtga      780

ccaccctgac ctacggcgtg cagtgcttca gccgctaccc cgaccacatg aagcagcacg      840

acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc ttcttcaagg      900

acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc      960

gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg cacaagctgg     1020

agtacaacta caacagccac aacgtctata tcatggccga caagcagaag aacggcatca     1080

aggtgaactt caagatccgc cacaacatcg aggacggcag cgtgcagctc gccgaccact     1140

accagcagaa cacccccatc ggcgacggcc ccgtgctgct gcccgacaac cactacctga     1200

gcacccagtc cgccctgagc aaagacccca acgagaagcg cgatcacatg gtcctgctgg     1260

agttcgtgac cgccgccggg atcactctcg gcatggacga gctgtacaag agcagcctga     1320

ggcctcctaa gaagaagagg aaggtttgac ctgcaggttt gtcgagacct agaaaaacat     1380

ggagcaatca caagtagcaa tacagcagct accaatgctg attgtgcctg gctagaagca     1440

caagaggagg aggaggtggg ttttccagtc acacctcagg tacctttaag accaatgact     1500

tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact ggaagggcta     1560

attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct ctggttagac     1620

cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa     1680

agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc tggtaactag     1740

agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagggcccg tttaaacccg     1800

ctgatcagcc tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt     1860

gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat     1920

tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag     1980

caagggggag gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggc     2040

ttctgaggcg gaaagaacca gctggggctc tagggggtat ccccacgcgc cctgtagcgg     2100

cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc     2160

cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc     2220

ccgtcaagct ctaaatcggg gcatcccttt agggttccga tttagtgctt tacggcacct     2280

cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc cctgatagac     2340

ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac     2400

tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga ttttggggat     2460

ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attaattctg     2520

tggaatgtgt gtcagttagg gtgtggaaag tccccaggct ccccaggcag gcagaagtat     2580

gcaaagcatg catctcaatt agtcagcaac caggtgtgga aagtccccag gctccccagc     2640

aggcagaagt atgcaaagca tgcatctcaa ttagtcagca accatagtcc cgcccctaac     2700

tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact     2760

aatttttttt atttatgcag aggccgaggc cgcctctgcc tctgagctat tccagaagta     2820

gtgaggaggc ttttttggag gcctaggctt ttgcaaaaag ctcccgggag cttgtatatc     2880

cattttcgga tctgatcagc acgtgttgac aattaatcat cggcatagta tatcggcata     2940

gtataatacg acaaggtgag gaactaaacc atggccaagt tgaccagtgc cgttccggtg     3000

ctcaccgcgc gcgacgtcgc cggagcggtc gagttctgga ccgaccggct cgggttctcc     3060

cgggacttcg tggaggacga cttcgccggt gtggtccggg acgacgtgac cctgttcatc     3120

agcgcggtcc aggaccaggt ggtgccggac aacaccctgg cctgggtgtg ggtgcgcggc     3180

ctggacgagc tgtacgccga gtggtcggag gtcgtgtcca cgaacttccg ggacgcctcc     3240

gggccggcca tgaccgagat cggcgagcag ccgtgggggc gggagttcgc cctgcgcgac     3300

ccggccggca actgcgtgca cttcgtggcc gaggagcagg actgacacgt gctacgagat     3360

ttcgattcca ccgccgcctt ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc     3420

ggctggatga tcctccagcg cggggatctc atgctggagt tcttcgccca ccccaacttg     3480

tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt cacaaataaa     3540

gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt atcttatcat     3600

gtctgtatac cgtcgacctc tagctagagc ttggcgtaat catggtcata gctgtttcct     3660

gtgtgaaatt gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt     3720

aaagcctggg gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc     3780

gctttccagt cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg     3840

agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg     3900

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca     3960

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac     4020

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac     4080

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg     4140

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac     4200

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat     4260

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag     4320

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac     4380

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt     4440

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt     4500

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc     4560

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga     4620

aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac     4680

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc     4740

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct     4800

gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca     4860

tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct     4920

ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca     4980

ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc     5040

atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg     5100

cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct     5160

tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa     5220

aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta     5280

tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc     5340

ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg     5400

agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa     5460

gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg     5520

agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc     5580

accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg     5640

gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat     5700

cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata     5760

ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tcgacggatc gggagatctc     5820

ccgatcccct atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt     5880

atctgctccc tgcttgtgtg ttggaggtcg ctgagtagtg cgcgagcaaa atttaagcta     5940

caacaaggca aggcttgacc gacaattgca tgaagaatct gcttagggtt aggcgttttg     6000

cgctgcttcg cgatgtacgg gccagatata cgcgttgaca ttgattattg actagttatt     6060

aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc cgcgttacat     6120

aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca ttgacgtcaa     6180

taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt caatgggtgg     6240

actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg ccaagtacgc     6300

cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag tacatgacct     6360

tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt accatggtga     6420

tgcggttttg gcagtacatc aatgggcgtg gatagcggtt tgactcacgg ggatttccaa     6480

gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa cgggactttc     6540

caaaatgtcg taacaactcc gccccattga cgcaaatggg cggtaggcgt gtacggtggg     6600

aggtctatat aagcagagct ctctggctaa ctagagaacc cactgcttac tggcttatcg     6660

aaattaatac gactcactat agggagaccc aagctggttt aaacttaagc ttggtaccga     6720

gctcactagt ccagtgtggt ggcagatatc cagcacagtg gcggccgctc gagtctagag     6780

ggcccgtttt gcctgtactg ggtctctctg gttagaccag atctgagcct gggagctctc     6840

tggctaacta gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt     6900

agtgtgtgcc cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc     6960

agtgtggaaa atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca     7020

gaggagctct ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg     7080

cggcgactgg tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg     7140

tgcgagagcg tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta     7200

aggccagggg gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta     7260

gaacgattcg cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg     7320

ggacagctac aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca     7380

gtagcaaccc tctattgtgt gcatttaatt aactggaata cgacaagata acccggatcg     7440

tgggcctgga tcagtacctg gagagcgtta aaaaacacaa acggctggat gtgtgccgcg     7500

ctaaaatggg ctatatgctg cagtgaataa taaaatgtgt gtttgtccga aatacgcgtt     7560

ttgagatttc tgtcgccgac taaattcatg tcgcgcgata gtggtgttta tcgccgatag     7620

agatggcgat attggaaaaa tcgatatttg aaaatatggc atattgaaaa tgtcgccgat     7680

gtgagtttct gtgtaactga tatcgccatt tttccaaaag tgatttttgg gcatacgcga     7740

tatctggcga tagcgcttat atcgtttacg ggggatggcg atagacgact ttggtgactt     7800

gggcgattct gtgtgtcgca aatatcgcag tttcgatata ggtgacagac gatatgaggc     7860

tatatcgccg atagaggcga catcaagctg gcacatggcc aatgcatatc gatctataca     7920

ttgaatcaat attggccatt agccatatta ttcattggtt atatagcata aatcaatatt     7980

ggctattggc cattgcatac gttgtatcca tatcataata tgtacattta tattggctca     8040

tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata gtaatcaatt     8100

acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact tacggtaaat     8160

ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat gacgtatgtt     8220

cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta tttacggtaa     8280

actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc     8340

aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct     8400

acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg gttttggcag     8460

tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct ccaccccatt     8520

gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa atgtcgtaac     8580

aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatataagc     8640

agagctcgtt tagtgaaccg tcagatcgcc tggagacgcc atccacgctg ttttgacctc     8700

catagaagac accgggaccg atccagcctc cgcggccggg aacggtgcat tggaacgcgg     8760

attccccgtg ccaagagtga cgtaagtacc gcctatagag tctataggcc cacccccttg     8820

gcttcttatg catgctatac tgtttttggc ttggggtcta tacacccccg cttcctcatg     8880

ttataggtga tggtatagct tagcctatag gtgtgggtta ttgaccatta ttgaccactc     8940

ccctattggt gacgatactt tccattacta atccataaca tggctctttg ccacaactct     9000

ctttattggc tatatgccaa tacactgtcc ttcagagact gacacggact ctgtattttt     9060

acaggatggg gtctcattta ttatttacaa attcacatat acaacaccac cgtccccagt     9120

gcccgcagtt tttattaaac ataacgtggg atctccacgc gaatctcggg tacgtgttcc     9180

ggacatgggc tcttctccgg tagcggcgga gcttctacat ccgagccctg ctcccatgcc     9240

tccagcgact catggtcgct cggcagctcc ttgctcctaa cagtggaggc cagacttagg     9300

cacagcacga tgcccaccac caccagtgtg ccgcacaagg ccgtggcggt agggtatgtg     9360

tctgaaaatg agctcgggga gcgggcttgc accgctgacg catttggaag acttaaggca     9420

gcggcagaag aagatgcagg cagctgagtt gttgtgttct gataagagtc agaggtaact     9480

cccgttgcgg tgctgttaac ggtggagggc agtgtagtct gagcagtact cgttgctgcc     9540

gcgcgcgcca ccagacataa tagctgacag actaacagac tgttcctttc catgggtctt     9600

ttctgcagtc accgtccttg acacggctag catggagtcc tctgccaaga gaaagatgga     9660

ccctgataat cctgacgagg gcccttcctc caaggtgcca cggcccgaga cacccgtgac     9720

caaggccacg acgttcctgc agactatgtt gaggaaggag gttaacagtc agctgagtct     9780

gggagacccg ctgtttccag agttggccga agaatccctc aaaacttttg aacaagtgac     9840

cgaggattgc aacgagaacc ccgagaaaga tgtcctggca gaactcggtg acatcctcgc     9900

ccaggctgtc aatcatgccg gtatcgattc cagtagcacc ggccccacgc tgacaaccca     9960

ctcttgcagc gttagcagcg cccctcttaa caagccgacc cccaccagcg tcgcggttac    10020

taacactcct ctccccgggg catccgctac tcccgagctc agcccgcgta agaaaccgcg    10080

caaaaccacg cgtcctttca aggtgattat taaaccgccc gtgcctcccg cgcctatcat    10140

gctgcccctc atcaaacagg aagacatcaa gcccgagccc gactttacca tccagtaccg    10200

caacaagatt atcgataccg ccggctgtat cgtgatctct gatagcgagg aagaacaggg    10260

tgaagaagtc gaaacccgcg gtgctaccgc gtcttcccct tccaccggca gcggcacgcc    10320

gcgagtgacc tctcccacgc acccgctctc ccagatgaac caccctcctc ttcccgatcc    10380

cttgggccgg cccgatgaag atagttcctc ttcgtcttcc tcctcctgca gttcggcttc    10440

ggactcggag agtgagtccg aggagatgaa atgcagcagt ggcggaggag catccgtgac    10500

ctcgagccac catgggcgcg gcggttttgg tggcgcggcc tcctcctctc tgctgagctg    10560

cggccatcag agcagcggcg gggcgagcac cggaccccgc aagaagaaga gcaaacgcat    10620

ctccgagttg gacaacgaga aggtgcgcaa tatcatgaaa gataagaaca cccccttctg    10680

cacacccaac gtgcagactc ggcggggtcg cgtcaagatt gacgaggtga gccgcatgtt    10740

ccgcaacacc aatcgctctc ttgagtacaa gaacctgccc ttcacgattc ccagtatgca    10800

ccaggtgtta gatgaggcca tcaaagcctg caaaaccatg caggtgaaca acaagggcat    10860

ccagattatc tacacccgca atcatgaggt gaagagtgag gtggatgcgg tgcggtgtcg    10920

cctgggcacc atgtgcaacc tggccctctc cactcccttc ctcatggagc acaccatgcc    10980

cgtgacacat ccacccgaag tggcgcagcg cacagccgat gcttgtaacg aaggcgtcaa    11040

ggccgcgtgg agcctcaaag aattgcacac ccaccaatta tgcccccgtt cctccgatta    11100

ccgcaacatg atcatccacg ctgccacccc cgtggacctg ttgggcgctc tcaacctgtg    11160

cctgcccctg atgcaaaagt ttcccaaaca ggtcatggtg cgcatcttct ccaccaacca    11220

gggtgggttc atgctgccta tctacgagac ggccgcgaag gcctacgccg tggggcagtt    11280

tgagcagccc accgagaccc ctcccgaaga cctggacacc ctgagcctgg ccatcgaggc    11340

agccatccag gacctgagga acaagtctca gtaag                               11375


<210>  18
<211>  11939
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  18
gatccgcccc tctccctccc ccccccctaa cgttactggc cgaagccgct tggaataagg       60

ccggtgtgcg tttgtctata tgttattttc caccatattg ccgtcttttg gcaatgtgag      120

ggcccggaaa cctggccctg tcttcttgac gagcattcct aggggtcttt cccctctcgc      180

caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg      240

aagacaaaca acgtctgtag cgaccctttg caggcagcgg aaccccccac ctggcgacag      300

gtgcctctgc ggccaaaagc cacgtgtata agatacacct gcaaaggcgg cacaacccca      360

gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa tggctctcct caagcgtatt      420

caacaagggg ctgaaggatg cccagaaggt accccattgt atgggatctg atctggggcc      480

tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa      540

ccacggggac gtggttttcc tttgaaaaac acgatgataa tatggccaca accatggtga      600

gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg gacggcgacg      660

taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc tacggcaagc      720

tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc accctcgtga      780

ccaccctgac ctacggcgtg cagtgcttca gccgctaccc cgaccacatg aagcagcacg      840

acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc ttcttcaagg      900

acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc      960

gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg cacaagctgg     1020

agtacaacta caacagccac aacgtctata tcatggccga caagcagaag aacggcatca     1080

aggtgaactt caagatccgc cacaacatcg aggacggcag cgtgcagctc gccgaccact     1140

accagcagaa cacccccatc ggcgacggcc ccgtgctgct gcccgacaac cactacctga     1200

gcacccagtc cgccctgagc aaagacccca acgagaagcg cgatcacatg gtcctgctgg     1260

agttcgtgac cgccgccggg atcactctcg gcatggacga gctgtacaag taagcggccg     1320

caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc     1380

tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttacttcccg     1440

tacggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt     1500

gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac     1560

tggttggggc attgccacca cctatcaact cctttccggg actttcgctt tccccctccc     1620

tattgccacg gcggaactca ttgccgcctg ccttgcccgc tgctggacag gggctcggct     1680

gttgggcact gacaattccg tggtgttgtc ggggaagctg acgtcctttc catggctgct     1740

cgcctgtgtt gccaactgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct     1800

caatccagcg gaccttcctt cccgcggcct gctgccggtt ctgcggcctc ttccgcgtct     1860

tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc ctgcctgcag     1920

gtttgtcgag acctagaaaa acatggagca atcacaagta gcaatacagc agctaccaat     1980

gctgattgtg cctggctaga agcacaagag gaggaggagg tgggttttcc agtcacacct     2040

caggtacctt taagaccaat gacttacaag gcagctgtag atcttagcca ctttttaaaa     2100

gaaaaggggg gactggaagg gctaattcac tcccaacgaa gacaagatct gctttttgct     2160

tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg ctaactaggg     2220

aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt     2280

ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt gtggaaaatc     2340

tctagcaggg cccgtttaaa cccgctgatc agcctcgact gtgccttcta gttgccagcc     2400

atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt     2460

cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc attctattct     2520

ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc     2580

tggggatgcg gtgggctcta tggcttctga ggcggaaaga accagctggg gctctagggg     2640

gtatccccac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag     2700

cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt     2760

tctcgccacg ttcgccggct ttccccgtca agctctaaat cggggcatcc ctttagggtt     2820

ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg     2880

tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt     2940

taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt     3000

tgatttataa gggattttgg ggatttcggc ctattggtta aaaaatgagc tgatttaaca     3060

aaaatttaac gcgaattaat tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca     3120

ggctccccag gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccaggtg     3180

tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc     3240

agcaaccata gtcccgcccc taactccgcc catcccgccc ctaactccgc ccagttccgc     3300

ccattctccg ccccatggct gactaatttt ttttatttat gcagaggccg aggccgcctc     3360

tgcctctgag ctattccaga agtagtgagg aggctttttt ggaggcctag gcttttgcaa     3420

aaagctcccg ggagcttgta tatccatttt cggatctgat cagcacgtgt tgacaattaa     3480

tcatcggcat agtatatcgg catagtataa tacgacaagg tgaggaacta aaccatggcc     3540

aagttgacca gtgccgttcc ggtgctcacc gcgcgcgacg tcgccggagc ggtcgagttc     3600

tggaccgacc ggctcgggtt ctcccgggac ttcgtggagg acgacttcgc cggtgtggtc     3660

cgggacgacg tgaccctgtt catcagcgcg gtccaggacc aggtggtgcc ggacaacacc     3720

ctggcctggg tgtgggtgcg cggcctggac gagctgtacg ccgagtggtc ggaggtcgtg     3780

tccacgaact tccgggacgc ctccgggccg gccatgaccg agatcggcga gcagccgtgg     3840

gggcgggagt tcgccctgcg cgacccggcc ggcaactgcg tgcacttcgt ggccgaggag     3900

caggactgac acgtgctacg agatttcgat tccaccgccg ccttctatga aaggttgggc     3960

ttcggaatcg ttttccggga cgccggctgg atgatcctcc agcgcgggga tctcatgctg     4020

gagttcttcg cccaccccaa cttgtttatt gcagcttata atggttacaa ataaagcaat     4080

agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc     4140

aaactcatca atgtatctta tcatgtctgt ataccgtcga cctctagcta gagcttggcg     4200

taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat tccacacaac     4260

atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca     4320

ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat     4380

taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc     4440

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca     4500

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca     4560

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg     4620

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg     4680

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt     4740

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt     4800

tctcaatgct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc     4860

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt     4920

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt     4980

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc     5040

tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa     5100

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt     5160

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct     5220

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta     5280

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa     5340

agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc     5400

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact     5460

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc     5520

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt     5580

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta     5640

agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg     5700

tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt     5760

acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc     5820

agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt     5880

actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc     5940

tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc     6000

gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa     6060

ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac     6120

tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa     6180

aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt     6240

tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa     6300

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct     6360

gacgtcgacg gatcgggaga tctcccgatc ccctatggtg cactctcagt acaatctgct     6420

ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt     6480

agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga     6540

atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt     6600

gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc     6660

catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca     6720

acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga     6780

ctttccattg acgtcaatgg gtggactatt tacggtaaac tgcccacttg gcagtacatc     6840

aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct     6900

ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat     6960

tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc     7020

ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt     7080

ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa     7140

tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctctctgg ctaactagag     7200

aacccactgc ttactggctt atcgaaatta atacgactca ctatagggag acccaagctg     7260

gtttaaactt aagcttggta ccgagctcac tagtccagtg tggtggcaga tatccagcac     7320

agtggcggcc gctcgagtct agagggcccg ttttgcctgt actgggtctc tctggttaga     7380

ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata     7440

aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta     7500

gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg     7560

gacttgaaag cgaaagggaa accagaggag ctctctcgac gcaggactcg gcttgctgaa     7620

gcgcgcacgg caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc     7680

ggaggctaga aggagagaga tgggtgcgag agcgtcagta ttaagcgggg gagaattaga     7740

tcgcgatggg aaaaaattcg gttaaggcca gggggaaaga aaaaatataa attaaaacat     7800

atagtatggg caagcaggga gctagaacga ttcgcagtta atcctggcct gttagaaaca     7860

tcagaaggct gtagacaaat actgggacag ctacaaccat cccttcagac aggatcagaa     7920

gaacttagat cattatataa tacagtagca accctctatt gtgtgcattt aattaactgg     7980

aatacgacaa gataacccgg atcgtgggcc tggatcagta cctggagagc gttaaaaaac     8040

acaaacggct ggatgtgtgc cgcgctaaaa tgggctatat gctgcagtga ataataaaat     8100

gtgtgtttgt ccgaaatacg cgttttgaga tttctgtcgc cgactaaatt catgtcgcgc     8160

gatagtggtg tttatcgccg atagagatgg cgatattgga aaaatcgata tttgaaaata     8220

tggcatattg aaaatgtcgc cgatgtgagt ttctgtgtaa ctgatatcgc catttttcca     8280

aaagtgattt ttgggcatac gcgatatctg gcgatagcgc ttatatcgtt tacgggggat     8340

ggcgatagac gactttggtg acttgggcga ttctgtgtgt cgcaaatatc gcagtttcga     8400

tataggtgac agacgatatg aggctatatc gccgatagag gcgacatcaa gctggcacat     8460

ggccaatgca tatcgatcta tacattgaat caatattggc cattagccat attattcatt     8520

ggttatatag cataaatcaa tattggctat tggccattgc atacgttgta tccatatcat     8580

aatatgtaca tttatattgg ctcatgtcca acattaccgc catgttgaca ttgattattg     8640

actagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc     8700

cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca     8760

ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt     8820

caatgggtgg agtatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg     8880

ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag     8940

tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt     9000

accatggtga tgcggttttg gcagtacatc aatgggcgtg gatagcggtt tgactcacgg     9060

ggatttccaa gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa     9120

cgggactttc caaaatgtcg taacaactcc gccccattga cgcaaatggg cggtaggcgt     9180

gtacggtggg aggtctatat aagcagagct cgtttagtga accgtcagat cgcctggaga     9240

cgccatccac gctgttttga cctccataga agacaccggg accgatccag cctccgcggc     9300

cgggaacggt gcattggaac gcggattccc cgtgccaaga gtgacgtaag taccgcctat     9360

agagtctata ggcccacccc cttggcttct tatgcatgct atactgtttt tggcttgggg     9420

tctatacacc cccgcttcct catgttatag gtgatggtat agcttagcct ataggtgtgg     9480

gttattgacc attattgacc actcccctat tggtgacgat actttccatt actaatccat     9540

aacatggctc tttgccacaa ctctctttat tggctatatg ccaatacact gtccttcaga     9600

gactgacacg gactctgtat ttttacagga tggggtctca tttattattt acaaattcac     9660

atatacaaca ccaccgtccc cagtgcccgc agtttttatt aaacataacg tgggatctcc     9720

acgcgaatct cgggtacgtg ttccggacat gggctcttct ccggtagcgg cggagcttct     9780

acatccgagc cctgctccca tgcctccagc gactcatggt cgctcggcag ctccttgctc     9840

ctaacagtgg aggccagact taggcacagc acgatgccca ccaccaccag tgtgccgcac     9900

aaggccgtgg cggtagggta tgtgtctgaa aatgagctcg gggagcgggc ttgcaccgct     9960

gacgcatttg gaagacttaa ggcagcggca gaagaagatg caggcagctg agttgttgtg    10020

ttctgataag agtcagaggt aactcccgtt gcggtgctgt taacggtgga gggcagtgta    10080

gtctgagcag tactcgttgc tgccgcgcgc gccaccagac ataatagctg acagactaac    10140

agactgttcc tttccatggg tcttttctgc agtcaccgtc cttgacacgg ctagcatgga    10200

gtcctctgcc aagagaaaga tggaccctga taatcctgac gagggccctt cctccaaggt    10260

gccacggccc gagacacccg tgaccaaggc cacgacgttc ctgcagacta tgttgaggaa    10320

ggaggttaac agtcagctga gtctgggaga cccgctgttt ccagagttgg ccgaagaatc    10380

cctcaaaact tttgaacaag tgaccgagga ttgcaacgag aaccccgaga aagatgtcct    10440

ggcagaactc ggtgacatcc tcgcccaggc tgtcaatcat gccggtatcg attccagtag    10500

caccggcccc acgctgacaa cccactcttg cagcgttagc agcgcccctc ttaacaagcc    10560

gacccccacc agcgtcgcgg ttactaacac tcctctcccc ggggcatccg ctactcccga    10620

gctcagcccg cgtaagaaac cgcgcaaaac cacgcgtcct ttcaaggtga ttattaaacc    10680

gcccgtgcct cccgcgccta tcatgctgcc cctcatcaaa caggaagaca tcaagcccga    10740

gcccgacttt accatccagt accgcaacaa gattatcgat accgccggct gtatcgtgat    10800

ctctgatagc gaggaagaac agggtgaaga agtcgaaacc cgcggtgcta ccgcgtcttc    10860

cccttccacc ggcagcggca cgccgcgagt gacctctccc acgcacccgc tctcccagat    10920

gaaccaccct cctcttcccg atcccttggg ccggcccgat gaagatagtt cctcttcgtc    10980

ttcctcctcc tgcagttcgg cttcggactc ggagagtgag tccgaggaga tgaaatgcag    11040

cagtggcgga ggagcatccg tgacctcgag ccaccatggg cgcggcggtt ttggtggcgc    11100

ggcctcctcc tctctgctga gctgcggcca tcagagcagc ggcggggcga gcaccggacc    11160

ccgcaagaag aagagcaaac gcatctccga gttggacaac gagaaggtgc gcaatatcat    11220

gaaagataag aacaccccct tctgcacacc caacgtgcag actcggcggg gtcgcgtcaa    11280

gattgacgag gtgagccgca tgttccgcaa caccaatcgc tctcttgagt acaagaacct    11340

gcccttcacg attcccagta tgcaccaggt gttagatgag gccatcaaag cctgcaaaac    11400

catgcaggtg aacaacaagg gcatccagat tatctacacc cgcaatcatg aggtgaagag    11460

tgaggtggat gcggtgcggt gtcgcctggg caccatgtgc aacctggccc tctccactcc    11520

cttcctcatg gagcacacca tgcccgtgac acatccaccc gaagtggcgc agcgcacagc    11580

cgatgcttgt aacgaaggcg tcaaggccgc gtggagcctc aaagaattgc acacccacca    11640

attatgcccc cgttcctccg attaccgcaa catgatcatc cacgctgcca cccccgtgga    11700

cctgttgggc gctctcaacc tgtgcctgcc cctgatgcaa aagtttccca aacaggtcat    11760

ggtgcgcatc ttctccacca accagggtgg gttcatgctg cctatctacg agacggccgc    11820

gaaggcctac gccgtggggc agtttgagca gcccaccgag acccctcccg aagacctgga    11880

caccctgagc ctggccatcg aggcagccat ccaggacctg aggaacaagt ctcagtaag     11939


<210>  19
<211>  10343
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  19
ttaattaact ggaatacgac aagataaccc ggatcgtggg cctggatcag tacctggaga       60

gcgttaaaaa acacaaacgg ctggatgtgt gccgcgctaa aatgggctat atgctgcagt      120

gaataataaa atgtgtgttt gtccgaaata cgcgttttga gatttctgtc gccgactaaa      180

ttcatgtcgc gcgatagtgg tgtttatcgc cgatagagat ggcgatattg gaaaaatcga      240

tatttgaaaa tatggcatat tgaaaatgtc gccgatgtga gtttctgtgt aactgatatc      300

gccatttttc caaaagtgat ttttgggcat acgcgatatc tggcgatagc gcttatatcg      360

tttacggggg atggcgatag acgactttgg tgacttgggc gattctgtgt gtcgcaaata      420

tcgcagtttc gatataggtg acagacgata tgaggctata tcgccgatag aggcgacatc      480

aagctggcac atggccaatg catatcgatc tatacattga atcaatattg gccattagcc      540

atattattca ttggttatat agcataaatc aatattggct attggccatt gcatacgttg      600

tatccatatc ataatatgta catttatatt ggctcatgtc caacattacc gccatgttga      660

cattgattat tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca      720

tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac      780

gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact      840

ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa      900

gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg      960

cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta     1020

gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg     1080

tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg     1140

caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg     1200

ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctcgtttagt gaaccgtcag     1260

atcgcctgga gacgccatcc acgctgtttt gacctccata gaagacaccg ggaccgatcc     1320

agcctccgcg gccgggaacg gtgcattgga acgcggattc cccgtgccaa gagtgacgta     1380

agtaccgcct atagagtcta taggcccacc cccttggctt cttatgcatg ctatactgtt     1440

tttggcttgg ggtctataca cccccgcttc ctcatgttat aggtgatggt atagcttagc     1500

ctataggtgt gggttattga ccattattga ccactcccct attggtgacg atactttcca     1560

ttactaatcc ataacatggc tctttgccac aactctcttt attggctata tgccaataca     1620

ctgtccttca gagactgaca cggactctgt atttttacag gatggggtct catttattat     1680

ttacaaattc acatatacaa caccaccgtc cccagtgccc gcagttttta ttaaacataa     1740

cgtgggatct ccacgcgaat ctcgggtacg tgttccggac atgggctctt ctccggtagc     1800

ggcggagctt ctacatccga gccctgctcc catgcctcca gcgactcatg gtcgctcggc     1860

agctccttgc tcctaacagt ggaggccaga cttaggcaca gcacgatgcc caccaccacc     1920

agtgtgccgc acaaggccgt ggcggtaggg tatgtgtctg aaaatgagct cggggagcgg     1980

gcttgcaccg ctgacgcatt tggaagactt aaggcagcgg cagaagaaga tgcaggcagc     2040

tgagttgttg tgttctgata agagtcagag gtaactcccg ttgcggtgct gttaacggtg     2100

gagggcagtg tagtctgagc agtactcgtt gctgccgcgc gcgccaccag acataatagc     2160

tgacagacta acagactgtt cctttccatg ggtcttttct gcagtcaccg tccttgacac     2220

ggctagcatg gtgagcaagg gcgaggagga taacatggcc atcatcaagg agttcatgcg     2280

cttcaaggtg cacatggagg gctccgtgaa cggccacgag ttcgagatcg agggcgaggg     2340

cgagggccgc ccctacgagg gcacccagac cgccaagctg aaggtgacca agggtggccc     2400

cctgcccttc gcctgggaca tcctgtcccc tcagttcatg tacggctcca aggcctacgt     2460

gaagcacccc gccgacatcc ccgactactt gaagctgtcc ttccccgagg gcttcaagtg     2520

ggagcgcgtg atgaacttcg aggacggcgg cgtggtgacc gtgacccagg actcctccct     2580

gcaggacggc gagttcatct acaaggtgaa gctgcgcggc accaacttcc cctccgacgg     2640

ccccgtaatg cagaagaaga ccatgggctg ggaggcctcc tccgagcgga tgtaccccga     2700

ggacggcgcc ctgaagggcg agatcaagca gaggctgaag ctgaaggacg gcggccacta     2760

cgacgctgag gtcaagacca cctacaaggc caagaagccc gtgcagctgc ccggcgccta     2820

caacgtcaac atcaagttgg acatcacctc ccacaacgag gactacacca tcgtggaaca     2880

gtacgaacgc gccgagggcc gccactccac cggcggcatg gacgagctgt acaagtaagg     2940

atccgcccct ctccctcccc cccccctaac gttactggcc gaagccgctt ggaataaggc     3000

cggtgtgcgt ttgtctatat gttattttcc accatattgc cgtcttttgg caatgtgagg     3060

gcccggaaac ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc     3120

aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga     3180

agacaaacaa cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg     3240

tgcctctgcg gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag     3300

tgccacgttg tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc     3360

aacaaggggc tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct     3420

cggtgcacat gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac     3480

cacggggacg tggttttcct ttgaaaaaca cgatgataat atggccacaa ccatggtgag     3540

caagggcgag gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt     3600

aaacggccac aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct     3660

gaccctgaag ttcatctgca ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac     3720

caccctgacc tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga     3780

cttcttcaag tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga     3840

cgacggcaac tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg     3900

catcgagctg aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga     3960

gtacaactac aacagccaca acgtctatat catggccgac aagcagaaga acggcatcaa     4020

ggtgaacttc aagatccgcc acaacatcga ggacggcagc gtgcagctcg ccgaccacta     4080

ccagcagaac acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag     4140

cacccagtcc gccctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga     4200

gttcgtgacc gccgccggga tcactctcgg catggacgag ctgtacaaga gcagcctgag     4260

gcctcctaag aagaagagga aggtttgacc tgcaggtttg tcgagaccta gaaaaacatg     4320

gagcaatcac aagtagcaat acagcagcta ccaatgctga ttgtgcctgg ctagaagcac     4380

aagaggagga ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt     4440

acaaggcagc tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa     4500

ttcactccca acgaagacaa gatctgcttt ttgcttgtac tgggtctctc tggttagacc     4560

agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag cctcaataaa     4620

gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga     4680

gatccctcag acccttttag tcagtgtgga aaatctctag cagggcccgt ttaaacccgc     4740

tgatcagcct cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg     4800

ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt     4860

gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc     4920

aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg ctctatggct     4980

tctgaggcgg aaagaaccag ctggggctct agggggtatc cccacgcgcc ctgtagcggc     5040

gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc     5100

ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc     5160

cgtcaagctc taaatcgggg catcccttta gggttccgat ttagtgcttt acggcacctc     5220

gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg     5280

gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact     5340

ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat tttggggatt     5400

tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttaattctgt     5460

ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccaggcagg cagaagtatg     5520

caaagcatgc atctcaatta gtcagcaacc aggtgtggaa agtccccagg ctccccagca     5580

ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ccatagtccc gcccctaact     5640

ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta     5700

atttttttta tttatgcaga ggccgaggcc gcctctgcct ctgagctatt ccagaagtag     5760

tgaggaggct tttttggagg cctaggcttt tgcaaaaagc tcccgggagc ttgtatatcc     5820

attttcggat ctgatcagca cgtgttgaca attaatcatc ggcatagtat atcggcatag     5880

tataatacga caaggtgagg aactaaacca tggccaagtt gaccagtgcc gttccggtgc     5940

tcaccgcgcg cgacgtcgcc ggagcggtcg agttctggac cgaccggctc gggttctccc     6000

gggacttcgt ggaggacgac ttcgccggtg tggtccggga cgacgtgacc ctgttcatca     6060

gcgcggtcca ggaccaggtg gtgccggaca acaccctggc ctgggtgtgg gtgcgcggcc     6120

tggacgagct gtacgccgag tggtcggagg tcgtgtccac gaacttccgg gacgcctccg     6180

ggccggccat gaccgagatc ggcgagcagc cgtgggggcg ggagttcgcc ctgcgcgacc     6240

cggccggcaa ctgcgtgcac ttcgtggccg aggagcagga ctgacacgtg ctacgagatt     6300

tcgattccac cgccgccttc tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg     6360

gctggatgat cctccagcgc ggggatctca tgctggagtt cttcgcccac cccaacttgt     6420

ttattgcagc ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag     6480

catttttttc actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg     6540

tctgtatacc gtcgacctct agctagagct tggcgtaatc atggtcatag ctgtttcctg     6600

tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta     6660

aagcctgggg tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg     6720

ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga     6780

gaggcggttt gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg     6840

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag     6900

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc     6960

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca     7020

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt     7080

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc     7140

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca atgctcacgc tgtaggtatc     7200

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc     7260

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact     7320

tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg     7380

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta     7440

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca     7500

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa     7560

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg     7620

aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc     7680

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg     7740

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat     7800

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg     7860

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa     7920

taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca     7980

tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc     8040

gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     8100

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     8160

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     8220

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     8280

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     8340

gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag     8400

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     8460

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     8520

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     8580

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     8640

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     8700

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt cgacggatcg ggagatctcc     8760

cgatccccta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagta     8820

tctgctccct gcttgtgtgt tggaggtcgc tgagtagtgc gcgagcaaaa tttaagctac     8880

aacaaggcaa ggcttgaccg acaattgcat gaagaatctg cttagggtta ggcgttttgc     8940

gctgcttcgc gatgtacggg ccagatatac gcgttgacat tgattattga ctagttatta     9000

atagtaatca attacggggt cattagttca tagcccatat atggagttcc gcgttacata     9060

acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat     9120

aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga     9180

ctatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc     9240

ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt     9300

atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta ccatggtgat     9360

gcggttttgg cagtacatca atgggcgtgg atagcggttt gactcacggg gatttccaag     9420

tctccacccc attgacgtca atgggagttt gttttggcac caaaatcaac gggactttcc     9480

aaaatgtcgt aacaactccg ccccattgac gcaaatgggc ggtaggcgtg tacggtggga     9540

ggtctatata agcagagctc tctggctaac tagagaaccc actgcttact ggcttatcga     9600

aattaatacg actcactata gggagaccca agctggttta aacttaagct tggtaccgag     9660

ctcactagtc cagtgtggtg gcagatatcc agcacagtgg cggccgctcg agtctagagg     9720

gcccgttttg cctgtactgg gtctctctgg ttagaccaga tctgagcctg ggagctctct     9780

ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta     9840

gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca     9900

gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa gggaaaccag     9960

aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga ggcgaggggc    10020

ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag agagatgggt    10080

gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa attcggttaa    10140

ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc agggagctag    10200

aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga caaatactgg    10260

gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacag    10320

tagcaaccct ctattgtgtg cat                                            10343


<210>  20
<211>  11369
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  20
gatccagcag cctgaggcct cctaagaaga agaggaaggt ttgagaattc gcccctctcc       60

ctcccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt gtgcgtttgt      120

ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc ggaaacctgg      180

ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag gaatgcaagg      240

tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac aaacaacgtc      300

tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc tctgcggcca      360

aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag      420

ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca aggggctgaa      480

ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt acacatgctt      540

tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt      600

tttcctttga aaaacacgat gataatatgg ccacaaccat ggagtcctct gccaagagaa      660

agatggaccc tgataatcct gacgagggcc cttcctccaa ggtgccacgg cccgagacac      720

ccgtgaccaa ggccacgacg ttcctgcaga ctatgttgag gaaggaggtt aacagtcagc      780

tgagtctggg agacccgctg tttccagagt tggccgaaga atccctcaaa acttttgaac      840

gagtgaccga ggattgcaac gagaaccccg agaaagatgt cctggcagaa ctcggtgaca      900

tcctcgccca ggctgtcaat catgccggta tcgattccag tagcaccggc cccacgctga      960

caacccactc ttgcagcgtt agcagcgccc ctcttaacaa gccgaccccc accagcgtcg     1020

cggttactaa cactcctctc cccggggcat ccgctactcc cgagctcagc ccgcgtaaga     1080

aaccgcgcaa aaccacgcgt cctttcaagg tgattattaa accgcccgtg cctcccgcgc     1140

ctatcatgct gcccctcatc aaacaggaag acatcaagcc cgagcccgac tttaccatcc     1200

agtaccgcaa caagattatc gataccgccg gctgtatcgt gatctctgat agcgaggaag     1260

aacagggtga agaagtcgaa acccgcggtg ctaccgcgtc ttccccttcc accggcagcg     1320

gcacgccgcg agtgacctct cccacgcacc cgctctccca gataaaccac cctcctcttc     1380

ccgatccctt gggccggccc gatgaagata gttcctcttc gtcttcctcc tgcagttcgg     1440

cttcggactc ggagagtgag tccgaggaga tgaaatgcag cagtggcgga ggagcatccg     1500

tgacctcgag ccaccatggg cgcggcggtt ttggtggcgc ggcctcctcc tctctgctga     1560

gctgcggcca tcagagcagc ggcggggcga gcaccggacc ccgcaagaag aagagcaaac     1620

gcatctccga gttggacaac gagaaggtgc gcaatatcat gaaagataag aacaccccct     1680

tctgcacacc caacgtgcag actcggcggg gtcgcgtcaa gattgacgag gtgagccgca     1740

tgttccgcaa caccaatcgc tctcttgagt acaagaacct gcccttcacg attcccagta     1800

tgcaccaggt gttagatgag gccatcaaag cctgcaaaac catgcaggtg aacaacaagg     1860

gcatccagat tatctacacc cgcaatcatg aggtgaagag tgaggtggat gcggtgcggt     1920

gtcgcctggg caccatgtgc aacctggccc tctccactcc cttcctcatg gagcacacca     1980

tgcccgtgac acatccaccc gaagtggcgc agcgcacagc cgatacttgt aacgaaggcg     2040

tcaaggccgc gtggagcctc aaagaattgc acacccacca attatgcccc cgttcctccg     2100

attaccgcaa catgatcatc cacgctgcca cccccgtgga cctgttgggc gctctcaacc     2160

tgtgcctgcc cctgatgcaa aagtttccca aacaggtcat ggtgcgcatc ttctccacca     2220

accagggtgg gttcatgctg cctatctacg agacggccgc gaaggcctac gccgtggggc     2280

agtttgagca gcccaccgag acccctcccg aagacctgga caccctgagc ctggccatcg     2340

aggcagccat ccaggacctg aggaacaagt ctcagtaacc tgcaggtttg tcgagaccta     2400

gaaaaacatg gagcaatcac aagtagcaat acagcagcta ccaatgctga ttgtgcctgg     2460

ctagaagcac aagaggagga ggaggtgggt tttccagtca cacctcaggt acctttaaga     2520

ccaatgactt acaaggcagc tgtagatctt agccactttt taaaagaaaa ggggggactg     2580

gaagggctaa ttcactccca acgaagacaa gatctgcttt ttgcttgtac tgggtctctc     2640

tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag     2700

cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct     2760

ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag cagggcccgt     2820

ttaaacccgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg ttgtttgccc     2880

ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa     2940

tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg     3000

gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg     3060

ctctatggct tctgaggcgg aaagaaccag ctggggctct agggggtatc cccacgcgcc     3120

ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact     3180

tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc     3240

cggctttccc cgtcaagctc taaatcgggg catcccttta gggttccgat ttagtgcttt     3300

acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc     3360

ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt     3420

gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat     3480

tttggggatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa     3540

ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccaggcagg     3600

cagaagtatg caaagcatgc atctcaatta gtcagcaacc aggtgtggaa agtccccagg     3660

ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ccatagtccc     3720

gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca     3780

tggctgacta atttttttta tttatgcaga ggccgaggcc gcctctgcct ctgagctatt     3840

ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcaaaaagc tcccgggagc     3900

ttgtatatcc attttcggat ctgatcagca cgtgttgaca attaatcatc ggcatagtat     3960

atcggcatag tataatacga caaggtgagg aactaaacca tggccaagtt gaccagtgcc     4020

gttccggtgc tcaccgcgcg cgacgtcgcc ggagcggtcg agttctggac cgaccggctc     4080

gggttctccc gggacttcgt ggaggacgac ttcgccggtg tggtccggga cgacgtgacc     4140

ctgttcatca gcgcggtcca ggaccaggtg gtgccggaca acaccctggc ctgggtgtgg     4200

gtgcgcggcc tggacgagct gtacgccgag tggtcggagg tcgtgtccac gaacttccgg     4260

gacgcctccg ggccggccat gaccgagatc ggcgagcagc cgtgggggcg ggagttcgcc     4320

ctgcgcgacc cggccggcaa ctgcgtgcac ttcgtggccg aggagcagga ctgacacgtg     4380

ctacgagatt tcgattccac cgccgccttc tatgaaaggt tgggcttcgg aatcgttttc     4440

cgggacgccg gctggatgat cctccagcgc ggggatctca tgctggagtt cttcgcccac     4500

cccaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat cacaaatttc     4560

acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact catcaatgta     4620

tcttatcatg tctgtatacc gtcgacctct agctagagct tggcgtaatc atggtcatag     4680

ctgtttcctg tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc     4740

ataaagtgta aagcctgggg tgcctaatga gtgagctaac tcacattaat tgcgttgcgc     4800

tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa     4860

cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct cactgactcg     4920

ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg     4980

ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag     5040

gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac     5100

gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga     5160

taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt     5220

accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca atgctcacgc     5280

tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc     5340

cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta     5400

agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat     5460

gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca     5520

gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct     5580

tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt     5640

acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct     5700

cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc     5760

acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa     5820

acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta     5880

tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc     5940

ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat     6000

ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta     6060

tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt     6120

aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt     6180

ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg     6240

ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc     6300

gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc     6360

gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg     6420

cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga     6480

actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta     6540

ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct     6600

tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag     6660

ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga     6720

agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat     6780

aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt cgacggatcg     6840

ggagatctcc cgatccccta tggtgcactc tcagtacaat ctgctctgat gccgcatagt     6900

taagccagta tctgctccct gcttgtgtgt tggaggtcgc tgagtagtgc gcgagcaaaa     6960

tttaagctac aacaaggcaa ggcttgaccg acaattgcat gaagaatctg cttagggtta     7020

ggcgttttgc gctgcttcgc gatgtacggg ccagatatac gcgttgacat tgattattga     7080

ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc     7140

gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat     7200

tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc     7260

aatgggtgga ctatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc     7320

caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt     7380

acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta     7440

ccatggtgat gcggttttgg cagtacatca atgggcgtgg atagcggttt gactcacggg     7500

gatttccaag tctccacccc attgacgtca atgggagttt gttttggcac caaaatcaac     7560

gggactttcc aaaatgtcgt aacaactccg ccccattgac gcaaatgggc ggtaggcgtg     7620

tacggtggga ggtctatata agcagagctc tctggctaac tagagaaccc actgcttact     7680

ggcttatcga aattaatacg actcactata gggagaccca agctggttta aacttaagct     7740

tggtaccgag ctcactagtc cagtgtggtg gcagatatcc agcacagtgg cggccgctcg     7800

agtctagagg gcccgttttg cctgtactgg gtctctctgg ttagaccaga tctgagcctg     7860

ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt     7920

gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc     7980

cttttagtca gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa     8040

gggaaaccag aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga     8100

ggcgaggggc ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag     8160

agagatgggt gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa     8220

attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc     8280

agggagctag aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga     8340

caaatactgg gacagctaca accatccctt cagacaggat cagaagaact tagatcatta     8400

tataatacag tagcaaccct ctattgtgtg catttaatta actggaatac gacaagataa     8460

cccggatcgt gggcctggat cagtacctgg agagcgttaa aaaacacaaa cggctggatg     8520

tgtgccgcgc taaaatgggc tatatgctgc agtgaataat aaaatgtgtg tttgtccgaa     8580

atacgcgttt tgagatttct gtcgccgact aaattcatgt cgcgcgatag tggtgtttat     8640

cgccgataga gatggcgata ttggaaaaat cgatatttga aaatatggca tattgaaaat     8700

gtcgccgatg tgagtttctg tgtaactgat atcgccattt ttccaaaagt gatttttggg     8760

catacgcgat atctggcgat agcgcttata tcgtttacgg gggatggcga tagacgactt     8820

tggtgacttg ggcgattctg tgtgtcgcaa atatcgcagt ttcgatatag gtgacagacg     8880

atatgaggct atatcgccga tagaggcgac atcaagctgg cacatggcca atgcatatcg     8940

atctatacat tgaatcaata ttggccatta gccatattat tcattggtta tatagcataa     9000

atcaatattg gctattggcc attgcatacg ttgtatccat atcataatat gtacatttat     9060

attggctcat gtccaacatt accgccatgt tgacattgat tattgactag ttattaatag     9120

taatcaatta cggggtcatt agttcatagc ccatatatgg agttccgcgt tacataactt     9180

acggtaaatg gcccgcctgg ctgaccgccc aacgaccccc gcccattgac gtcaataatg     9240

acgtatgttc ccatagtaac gccaataggg actttccatt gacgtcaatg ggtggagtat     9300

ttacggtaaa ctgcccactt ggcagtacat caagtgtatc atatgccaag tacgccccct     9360

attgacgtca atgacggtaa atggcccgcc tggcattatg cccagtacat gaccttatgg     9420

gactttccta cttggcagta catctacgta ttagtcatcg ctattaccat ggtgatgcgg     9480

ttttggcagt acatcaatgg gcgtggatag cggtttgact cacggggatt tccaagtctc     9540

caccccattg acgtcaatgg gagtttgttt tggcaccaaa atcaacggga ctttccaaaa     9600

tgtcgtaaca actccgcccc attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc     9660

tatataagca gagctcgttt agtgaaccgt cagatcgcct ggagacgcca tccacgctgt     9720

tttgacctcc atagaagaca ccgggaccga tccagcctcc gcggccggga acggtgcatt     9780

ggaacgcgga ttccccgtgc caagagtgac gtaagtaccg cctatagagt ctataggccc     9840

acccccttgg cttcttatgc atgctatact gtttttggct tggggtctat acacccccgc     9900

ttcctcatgt tataggtgat ggtatagctt agcctatagg tgtgggttat tgaccattat     9960

tgaccactcc cctattggtg acgatacttt ccattactaa tccataacat ggctctttgc    10020

cacaactctc tttattggct atatgccaat acactgtcct tcagagactg acacggactc    10080

tgtattttta caggatgggg tctcatttat tatttacaaa ttcacatata caacaccacc    10140

gtccccagtg cccgcagttt ttattaaaca taacgtggga tctccacgcg aatctcgggt    10200

acgtgttccg gacatgggct cttctccggt agcggcggag cttctacatc cgagccctgc    10260

tcccatgcct ccagcgactc atggtcgctc ggcagctcct tgctcctaac agtggaggcc    10320

agacttaggc acagcacgat gcccaccacc accagtgtgc cgcacaaggc cgtggcggta    10380

gggtatgtgt ctgaaaatga gctcggggag cgggcttgca ccgctgacgc atttggaaga    10440

cttaaggcag cggcagaaga agatgcaggc agctgagttg ttgtgttctg ataagagtca    10500

gaggtaactc ccgttgcggt gctgttaacg gtggagggca gtgtagtctg agcagtactc    10560

gttgctgccg cgcgcgccac cagacataat agctgacaga ctaacagact gttcctttcc    10620

atgggtcttt tctgcagtca ccgtccttga cacggctagc atggtgagca agggcgagga    10680

ggataacatg gccatcatca aggagttcat gcgcttcaag gtgcacatgg agggctccgt    10740

gaacggccac gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca    10800

gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc ttcgcctggg acatcctgtc    10860

ccctcagttc atgtacggct ccaaggccta cgtgaagcac cccgccgaca tccccgacta    10920

cttgaagctg tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg    10980

cggcgtggtg accgtgaccc aggactcctc cctgcaggac ggcgagttca tctacaaggt    11040

gaagctgcgc ggcaccaact tcccctccga cggccccgta atgcagaaga agaccatggg    11100

ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc gccctgaagg gcgagatcaa    11160

gcagaggctg aagctgaagg acggcggcca ctacgacgct gaggtcaaga ccacctacaa    11220

ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc aacatcaagt tggacatcac    11280

ctcccacaac gaggactaca ccatcgtgga acagtacgaa cgcgccgagg gccgccactc    11340

caccggcggc atggacgagc tgtacaagg                                      11369


<210>  21
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  21

Tyr Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg 
1               5                   10      


<210>  22
<211>  12
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  22

Arg Arg Gln Arg Arg Thr Ser Lys Leu Met Lys Arg 
1               5                   10          


<210>  23
<211>  27
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  23

Gly Trp Thr Leu Asn Ser Ala Gly Tyr Leu Leu Gly Lys Ile Asn Leu 
1               5                   10                  15      


Lys Ala Leu Ala Ala Leu Ala Lys Lys Ile Leu 
            20                  25          


<210>  24
<211>  33
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  24

Lys Ala Leu Ala Trp Glu Ala Lys Leu Ala Lys Ala Leu Ala Lys Ala 
1               5                   10                  15      


Leu Ala Lys His Leu Ala Lys Ala Leu Ala Lys Ala Leu Lys Cys Glu 
            20                  25                  30          


Ala 
    


<210>  25
<211>  16
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  25

Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys Trp Lys Lys 
1               5                   10                  15      


<210>  26
<211>  9
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  26

Arg Lys Lys Arg Arg Gln Arg Arg Arg 
1               5                   


<210>  27
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  27

Tyr Ala Arg Ala Ala Ala Arg Gln Ala Arg Ala 
1               5                   10      


<210>  28
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  28

Thr His Arg Leu Pro Arg Arg Arg Arg Arg Arg 
1               5                   10      


<210>  29
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  29

Gly Gly Arg Arg Ala Arg Arg Arg Arg Arg Arg 
1               5                   10      


<210>  30
<211>  580
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  30

Met Glu Ser Ser Ala Lys Arg Lys Met Asp Pro Asp Asn Pro Asp Glu 
1               5                   10                  15      


Gly Pro Ser Ser Lys Val Pro Arg Pro Glu Thr Pro Val Thr Lys Ala 
            20                  25                  30          


Thr Thr Phe Leu Gln Thr Met Leu Arg Lys Glu Val Asn Ser Gln Leu 
        35                  40                  45              


Ser Leu Gly Asp Pro Leu Phe Pro Glu Leu Ala Glu Glu Ser Leu Lys 
    50                  55                  60                  


Thr Phe Glu Gln Val Thr Glu Asp Cys Asn Glu Asn Pro Glu Lys Asp 
65                  70                  75                  80  


Val Leu Ala Glu Leu Gly Asp Ile Leu Ala Gln Ala Val Asn His Ala 
                85                  90                  95      


Gly Ile Asp Ser Ser Ser Thr Gly Pro Thr Leu Thr Thr His Ser Cys 
            100                 105                 110         


Ser Val Ser Ser Ala Pro Leu Asn Lys Pro Thr Pro Thr Ser Val Ala 
        115                 120                 125             


Val Thr Asn Thr Pro Leu Pro Gly Ala Ser Ala Thr Pro Glu Leu Ser 
    130                 135                 140                 


Pro Arg Lys Lys Pro Arg Lys Thr Thr Arg Pro Phe Lys Val Ile Ile 
145                 150                 155                 160 


Lys Pro Pro Val Pro Pro Ala Pro Ile Met Leu Pro Leu Ile Lys Gln 
                165                 170                 175     


Glu Asp Ile Lys Pro Glu Pro Asp Phe Thr Ile Gln Tyr Arg Asn Lys 
            180                 185                 190         


Ile Ile Asp Thr Ala Gly Cys Ile Val Ile Ser Asp Ser Glu Glu Glu 
        195                 200                 205             


Gln Gly Glu Glu Val Glu Thr Arg Gly Ala Thr Ala Ser Ser Pro Ser 
    210                 215                 220                 


Thr Gly Ser Gly Thr Pro Arg Val Thr Ser Pro Thr His Pro Leu Ser 
225                 230                 235                 240 


Gln Met Asn His Pro Pro Leu Pro Asp Pro Leu Gly Arg Pro Asp Glu 
                245                 250                 255     


Asp Ser Ser Ser Ser Ser Ser Ser Ser Cys Ser Ser Ala Ser Asp Ser 
            260                 265                 270         


Glu Ser Glu Ser Glu Glu Met Lys Cys Ser Ser Gly Gly Gly Ala Ser 
        275                 280                 285             


Val Thr Ser Ser His His Gly Arg Gly Gly Phe Gly Gly Ala Ala Ser 
    290                 295                 300                 


Ser Ser Leu Leu Ser Cys Gly His Gln Ser Ser Gly Gly Ala Ser Thr 
305                 310                 315                 320 


Gly Pro Arg Lys Lys Lys Ser Lys Arg Ile Ser Glu Leu Asp Asn Glu 
                325                 330                 335     


Lys Val Arg Asn Ile Met Lys Asp Lys Asn Thr Pro Phe Cys Thr Pro 
            340                 345                 350         


Asn Val Gln Thr Arg Arg Gly Arg Val Lys Ile Asp Glu Val Ser Arg 
        355                 360                 365             


Met Phe Arg Asn Thr Asn Arg Ser Leu Glu Tyr Lys Asn Leu Pro Phe 
    370                 375                 380                 


Thr Ile Pro Ser Met His Gln Val Leu Asp Glu Ala Ile Lys Ala Cys 
385                 390                 395                 400 


Lys Thr Met Gln Val Asn Asn Lys Gly Ile Gln Ile Ile Tyr Thr Arg 
                405                 410                 415     


Asn His Glu Val Lys Ser Glu Val Asp Ala Val Arg Cys Arg Leu Gly 
            420                 425                 430         


Thr Met Cys Asn Leu Ala Leu Ser Thr Pro Phe Leu Met Glu His Thr 
        435                 440                 445             


Met Pro Val Thr His Pro Pro Glu Val Ala Gln Arg Thr Ala Asp Ala 
    450                 455                 460                 


Cys Asn Glu Gly Val Lys Ala Ala Trp Ser Leu Lys Glu Leu His Thr 
465                 470                 475                 480 


His Gln Leu Cys Pro Arg Ser Ser Asp Tyr Arg Asn Met Ile Ile His 
                485                 490                 495     


Ala Ala Thr Pro Val Asp Leu Leu Gly Ala Leu Asn Leu Cys Leu Pro 
            500                 505                 510         


Leu Met Gln Lys Phe Pro Lys Gln Val Met Val Arg Ile Phe Ser Thr 
        515                 520                 525             


Asn Gln Gly Gly Phe Met Leu Pro Ile Tyr Glu Thr Ala Ala Lys Ala 
    530                 535                 540                 


Tyr Ala Val Gly Gln Phe Glu Gln Pro Thr Glu Thr Pro Pro Glu Asp 
545                 550                 555                 560 


Leu Asp Thr Leu Ser Leu Ala Ile Glu Ala Ala Ile Gln Asp Leu Arg 
                565                 570                 575     


Asn Lys Ser Gln 
            580 


<210>  31
<211>  131
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  31

Met Cys Gln Leu Asp Val Ala Ser Ile Gly Asp Ile Ala Ser Tyr Arg 
1               5                   10                  15      


Leu Ser Pro Ile Ser Lys Leu Arg Tyr Leu Arg His Thr Glu Ser Pro 
            20                  25                  30          


Lys Ser Pro Lys Ser Ser Ile Ala Ile Pro Arg Lys Arg Tyr Lys Arg 
        35                  40                  45              


Tyr Arg Gln Ile Ser Arg Met Pro Lys Asn His Phe Trp Lys Asn Gly 
    50                  55                  60                  


Asp Ile Ser Tyr Thr Glu Thr His Ile Gly Asp Ile Phe Asn Met Pro 
65                  70                  75                  80  


Tyr Phe Gln Ile Ser Ile Phe Pro Ile Ser Pro Ser Leu Ser Ala Ile 
                85                  90                  95      


Asn Thr Thr Ile Ala Arg His Glu Phe Ser Arg Arg Gln Lys Ser Gln 
            100                 105                 110         


Asn Ala Tyr Phe Gly Gln Thr His Ile Leu Leu Phe Thr Ala Ala Tyr 
        115                 120                 125             


Ser Pro Phe 
    130     


<210>  32
<211>  124
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  32

Met Ala Lys Leu Thr Ser Ala Val Pro Val Leu Thr Ala Arg Asp Val 
1               5                   10                  15      


Ala Gly Ala Val Glu Phe Trp Thr Asp Arg Leu Gly Phe Ser Arg Asp 
            20                  25                  30          


Phe Val Glu Asp Asp Phe Ala Gly Val Val Arg Asp Asp Val Thr Leu 
        35                  40                  45              


Phe Ile Ser Ala Val Gln Asp Gln Val Val Pro Asp Asn Thr Leu Ala 
    50                  55                  60                  


Trp Val Trp Val Arg Gly Leu Asp Glu Leu Tyr Ala Glu Trp Ser Glu 
65                  70                  75                  80  


Val Val Ser Thr Asn Phe Arg Asp Ala Ser Gly Pro Ala Met Thr Glu 
                85                  90                  95      


Ile Gly Glu Gln Pro Trp Gly Arg Glu Phe Ala Leu Arg Asp Pro Ala 
            100                 105                 110         


Gly Asn Cys Val His Phe Val Ala Glu Glu Gln Asp 
        115                 120                 


<210>  33
<211>  286
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  33

Met Ser Ile Gln His Phe Arg Val Ala Leu Ile Pro Phe Phe Ala Ala 
1               5                   10                  15      


Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 
            20                  25                  30          


Asp Ala Glu Asp Gln Leu Gly Ala Arg Val Gly Tyr Ile Glu Leu Asp 
        35                  40                  45              


Leu Asn Ser Gly Lys Ile Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 
    50                  55                  60                  


Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65                  70                  75                  80  


Arg Ile Asp Ala Gly Gln Glu Gln Leu Gly Arg Arg Ile His Tyr Ser 
                85                  90                  95      


Gln Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu Lys His Leu Thr 
            100                 105                 110         


Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala Ile Thr Met Ser 
        115                 120                 125             


Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr Ile Gly Gly Pro Lys 
    130                 135                 140                 


Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His Val Thr Arg Leu 
145                 150                 155                 160 


Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala Ile Pro Asn Asp Glu Arg 
                165                 170                 175     


Asp Thr Thr Met Pro Val Ala Met Ala Thr Thr Leu Arg Lys Leu Leu 
            180                 185                 190         


Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gln Gln Leu Ile Asp Trp 
        195                 200                 205             


Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg Ser Ala Leu Pro 
    210                 215                 220                 


Ala Gly Trp Phe Ile Ala Asp Lys Ser Gly Ala Gly Glu Arg Gly Ser 
225                 230                 235                 240 


Arg Gly Ile Ile Ala Ala Leu Gly Pro Asp Gly Lys Pro Ser Arg Ile 
                245                 250                 255     


Val Val Ile Tyr Thr Thr Gly Ser Gln Ala Thr Met Asp Glu Arg Asn 
            260                 265                 270         


Arg Gln Ile Ala Glu Ile Gly Ala Ser Leu Ile Lys His Trp 
        275                 280                 285     


<210>  34
<211>  225
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  34

Met Ala Ser Ser Glu Asp Val Ile Lys Glu Phe Met Arg Phe Lys Val 
1               5                   10                  15      


Arg Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu 
            20                  25                  30          


Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val 
        35                  40                  45              


Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln 
    50                  55                  60                  


Phe Gln Tyr Gly Ser Lys Val Tyr Val Lys His Pro Ala Asp Ile Pro 
65                  70                  75                  80  


Asp Tyr Lys Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val 
                85                  90                  95      


Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser 
            100                 105                 110         


Leu Gln Asp Gly Ser Phe Ile Tyr Lys Val Lys Phe Ile Gly Val Asn 
        115                 120                 125             


Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu 
    130                 135                 140                 


Ala Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu Lys Gly Glu 
145                 150                 155                 160 


Ile His Lys Ala Leu Lys Leu Lys Asp Gly Gly His Tyr Leu Val Glu 
                165                 170                 175     


Phe Lys Ser Ile Tyr Met Ala Lys Lys Pro Val Gln Leu Pro Gly Tyr 
            180                 185                 190         


Tyr Tyr Val Asp Ser Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr 
        195                 200                 205             


Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His His Leu Phe 
    210                 215                 220                 


Leu 
225 


<210>  35
<211>  239
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  35

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu 
1               5                   10                  15      


Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
            20                  25                  30          


Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile 
        35                  40                  45              


Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
    50                  55                  60                  


Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65                  70                  75                  80  


Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu 
                85                  90                  95      


Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
            100                 105                 110         


Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly 
        115                 120                 125             


Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr 
    130                 135                 140                 


Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn 
145                 150                 155                 160 


Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser 
                165                 170                 175     


Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly 
            180                 185                 190         


Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu 
        195                 200                 205             


Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
    210                 215                 220                 


Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225                 230                 235                 


<210>  36
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  36

Pro Lys Lys Lys Arg Lys Val 
1               5           


<210>  37
<211>  16
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  37

Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210>  38
<211>  40
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  38

Ser Val Gly Arg Ala Thr Ser Thr Ala Glu Leu Leu Val Gln Gly Glu 
1               5                   10                  15      


Glu Glu Val Pro Ala Lys Lys Thr Lys Thr Ile Val Ser Thr Ala Gln 
            20                  25                  30          


Ile Ser Glu Ser Arg Gln Thr Arg 
        35                  40  


<210>  39
<211>  16
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  39

Val Gln Gly Glu Glu Glu Val Pro Ala Lys Lys Thr Lys Thr Ile Val 
1               5                   10                  15      


<210>  40
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  40

Val Pro Ala Lys Lys Thr Lys Thr Ile Val 
1               5                   10  


<210>  41
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  41

Pro Ala Lys Lys Thr Lys Thr 
1               5           


<210>  42
<211>  12
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  42

Ser Ser Leu Arg Pro Pro Lys Lys Lys Arg Lys Val 
1               5                   10          


<210>  43
<211>  12
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic amino acid sequence

<400>  43

Ser Ser Leu Arg Pro Pro Lys Lys Arg Gly Arg Phe 
1               5                   10          


<210>  44
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  44
tgcatgagcc acaggcatt                                                    19


<210>  45
<211>  27
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleotide sequence

<400>  45
gctgtctatt tttgacacca gcttatt                                           27


<210>  46
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  46
agtgggtggg acttaaaaga aatgggtgga gggatatagg ggtgtgtctt                  50


<210>  47
<211>  16
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic amino acid sequence


<220>
<221>  misc_feature
<222>  (3)..(12)
<223>  Xaa can be any amino acid

<400>  47

Lys Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Lys Lys Leu 
1               5                   10                  15      


