                               SEQUENCE LISTING

<110> PRESIDENT AND FELLOWS OF HARVARD COLLEGE
      DANA-FARBER CANCER INSTITUTE, INC.
 
<120> IMMUNOTHERAPEUTIC VIRUS FOR THE TREATMENT OF CANCER

<130> 002806-098100WOPT

<140> PCT/US2021/051956
<141> 2021-09-24

<150> 63/083,487
<151> 2020-09-25

<160> 30    

<170> PatentIn version 3.5

<210> 1
<211> 775
<212> PRT
<213> Human alphaherpesvirus 1

<400> 1
Met Glu Pro Arg Pro Gly Ala Ser Thr Arg Arg Pro Glu Gly Arg Pro 
1               5                   10                  15      


Gln Arg Glu Pro Ala Pro Asp Val Trp Val Phe Pro Cys Asp Arg Asp 
            20                  25                  30          


Leu Pro Asp Ser Ser Asp Ser Glu Ala Glu Thr Glu Val Gly Gly Arg 
        35                  40                  45              


Gly Asp Ala Asp His His Asp Asp Asp Ser Ala Ser Glu Ala Asp Ser 
    50                  55                  60                  


Thr Asp Thr Glu Leu Phe Glu Thr Gly Leu Leu Gly Pro Gln Gly Val 
65                  70                  75                  80  


Asp Gly Gly Ala Val Ser Gly Gly Ser Pro Pro Arg Glu Glu Asp Pro 
                85                  90                  95      


Gly Ser Cys Gly Gly Ala Pro Pro Arg Glu Asp Gly Gly Ser Asp Glu 
            100                 105                 110         


Gly Asp Val Cys Ala Val Cys Thr Asp Glu Ile Ala Pro His Leu Arg 
        115                 120                 125             


Cys Asp Thr Phe Pro Cys Met His Arg Phe Cys Ile Pro Cys Met Lys 
    130                 135                 140                 


Thr Trp Met Gln Leu Arg Asn Thr Cys Pro Leu Cys Asn Ala Lys Leu 
145                 150                 155                 160 


Val Tyr Leu Ile Val Gly Val Thr Pro Ser Gly Ser Phe Ser Thr Ile 
                165                 170                 175     


Pro Ile Val Asn Asp Pro Gln Thr Arg Met Glu Ala Glu Glu Ala Val 
            180                 185                 190         


Arg Ala Gly Thr Ala Val Asp Phe Ile Trp Thr Gly Asn Gln Arg Phe 
        195                 200                 205             


Ala Pro Arg Tyr Leu Thr Leu Gly Gly His Thr Val Arg Ala Leu Ser 
    210                 215                 220                 


Pro Thr His Pro Glu Pro Thr Thr Asp Glu Asp Asp Asp Asp Leu Asp 
225                 230                 235                 240 


Asp Ala Asp Tyr Val Pro Pro Ala Pro Arg Arg Thr Pro Arg Ala Pro 
                245                 250                 255     


Pro Arg Arg Gly Ala Ala Ala Pro Pro Val Thr Gly Gly Ala Ser His 
            260                 265                 270         


Ala Ala Pro Gln Pro Ala Ala Ala Arg Thr Ala Pro Pro Ser Ala Pro 
        275                 280                 285             


Ile Gly Pro His Gly Ser Ser Asn Thr Asn Thr Thr Thr Asn Ser Ser 
    290                 295                 300                 


Gly Gly Gly Gly Ser Arg Gln Ser Arg Ala Ala Ala Pro Arg Gly Ala 
305                 310                 315                 320 


Ser Gly Pro Ser Gly Gly Val Gly Val Gly Val Gly Val Val Glu Ala 
                325                 330                 335     


Glu Ala Gly Arg Pro Arg Gly Arg Thr Gly Pro Leu Val Asn Arg Pro 
            340                 345                 350         


Ala Pro Leu Ala Asn Asn Arg Asp Pro Ile Val Ile Ser Asp Ser Pro 
        355                 360                 365             


Pro Ala Ser Pro His Arg Pro Pro Ala Ala Pro Met Pro Gly Ser Ala 
    370                 375                 380                 


Pro Arg Pro Gly Pro Pro Ala Ser Ala Ala Ala Ser Gly Pro Ala Arg 
385                 390                 395                 400 


Pro Arg Ala Ala Val Ala Pro Cys Val Arg Ala Pro Pro Pro Gly Pro 
                405                 410                 415     


Gly Pro Arg Ala Pro Ala Pro Gly Ala Glu Pro Ala Ala Arg Pro Ala 
            420                 425                 430         


Asp Ala Arg Arg Val Pro Gln Ser His Ser Ser Leu Ala Gln Ala Ala 
        435                 440                 445             


Asn Gln Glu Gln Ser Leu Cys Arg Ala Arg Ala Thr Val Ala Arg Gly 
    450                 455                 460                 


Ser Gly Gly Pro Gly Val Glu Gly Gly His Gly Pro Ser Arg Gly Ala 
465                 470                 475                 480 


Ala Pro Ser Gly Ala Ala Pro Leu Pro Ser Ala Ala Ser Val Glu Gln 
                485                 490                 495     


Glu Ala Ala Val Arg Pro Arg Lys Arg Arg Gly Ser Gly Gln Glu Asn 
            500                 505                 510         


Pro Ser Pro Gln Ser Thr Arg Pro Pro Leu Ala Pro Ala Gly Ala Lys 
        515                 520                 525             


Arg Ala Ala Thr His Pro Pro Ser Asp Ser Gly Pro Gly Gly Arg Gly 
    530                 535                 540                 


Gln Gly Gly Pro Gly Thr Pro Leu Thr Ser Ser Ala Ala Ser Ala Ser 
545                 550                 555                 560 


Ser Ser Ser Ala Ser Ser Ser Ser Ala Pro Thr Pro Ala Gly Ala Ala 
                565                 570                 575     


Ser Ser Ala Ala Gly Ala Ala Ser Ser Ser Ala Ser Ala Ser Ser Gly 
            580                 585                 590         


Gly Ala Val Gly Ala Leu Gly Gly Arg Gln Glu Glu Thr Ser Leu Gly 
        595                 600                 605             


Pro Arg Ala Ala Ser Gly Pro Arg Gly Pro Arg Lys Cys Ala Arg Lys 
    610                 615                 620                 


Thr Arg His Ala Glu Thr Ser Gly Ala Val Pro Ala Gly Gly Leu Thr 
625                 630                 635                 640 


Arg Tyr Leu Pro Ile Ser Gly Val Ser Ser Val Val Ala Leu Ser Pro 
                645                 650                 655     


Tyr Val Asn Lys Thr Ile Thr Gly Asp Cys Leu Pro Ile Leu Asp Met 
            660                 665                 670         


Glu Thr Gly Asn Ile Gly Ala Tyr Val Val Leu Val Asp Gln Thr Gly 
        675                 680                 685             


Asn Met Ala Thr Arg Leu Arg Ala Ala Val Pro Gly Trp Ser Arg Arg 
    690                 695                 700                 


Thr Leu Leu Pro Glu Thr Ala Gly Asn His Val Met Pro Pro Glu Tyr 
705                 710                 715                 720 


Pro Thr Ala Pro Ala Ser Glu Trp Asn Ser Leu Trp Met Thr Pro Val 
                725                 730                 735     


Gly Asn Met Leu Phe Asp Gln Gly Thr Leu Val Gly Ala Leu Asp Phe 
            740                 745                 750         


Arg Ser Leu Arg Ser Arg His Pro Trp Ser Gly Glu Gln Gly Ala Ser 
        755                 760                 765             


Thr Arg Asp Glu Gly Lys Gln 
    770                 775 


<210> 2
<211> 3233
<212> DNA
<213> Human alphaherpesvirus 1

<400> 2
atggagcccc gccccggagc gagtacccgc cggcctgagg gccgccccca gcgcgaggtg       60

aggggccggg cgccatgtct ggggcgccat gttggggggc gccatgttgg ggggcgccat      120

gttgggggac ccccgaccct tacactggaa ccggccgcca tgttggggga cccccactca      180

tacacgggag ccgggcgcca tgttggggcg ccatgttagg gggcgtggaa ccccgtgaca      240

ctatatatac agggaccggg ggcgccatgt tagggggcgc ggaaccccct gaccctatat      300

atacagggac cggggtcgcc ctgttagggg tcgccatgtg accccctgac tttatatata      360

cagaccccca acacctacac atggcccctt tgactcagac gcagggcccg gggtcgccgt      420

gggacccccc tgactcatac acagagacac gcccccacaa caaacacaca gggaccgggg      480

tcgccgtgtt agggggcgtg gtccccactg actcatacgc agggccccct tactcacacg      540

catctagggg ggtggggagg agccgcccgc catatttggg ggacgccgtg ggacccccga      600

ctccggtgcg tctggagggc gggagaagag ggaagaagag gggtcgggat ccaaaggacg      660

gacccagacc acctttggtt gcagacccct ttctcccccc tcttccgagg ccagcagggg      720

ggcaggactt tgtgaggcgg ggggggaggg ggaactcgtg ggcgctgatt gacgcgggaa      780

atccccccat tcttacccgc cccccctttt ttcccctcag cccgccccgg atgtctgggt      840

gtttccctgc gaccgagacc tgccggacag cagcgactcg gaggcggaga ccgaagtggg      900

ggggcggggg gacgccgacc accatgacga cgactccgcc tccgaggcgg acagcacgga      960

cacggaactg ttcgagacgg ggctgctggg gccgcagggc gtggatgggg gggcggtctc     1020

gggggggagc cccccccgcg aggaagaccc cggcagttgc gggggcgccc cccctcgaga     1080

ggacgggggg agcgacgagg gcgacgtgtg cgccgtgtgc acggatgaga tcgcgcccca     1140

cctgcgctgc gacaccttcc cgtgcatgca ccgcttctgc atcccgtgca tgaaaacctg     1200

gatgcaattg cgcaacacct gcccgctgtg caacgccaag ctggtgtacc tgatagtggg     1260

cgtgacgccc agcgggtcgt tcagcaccat cccgatcgtg aacgaccccc agacccgcat     1320

ggaggccgag gaggccgtca gggcgggcac ggccgtggac tttatctgga cgggcaatca     1380

gcggttcgcc ccgcggtacc tgaccctggg ggggcacacg gtgagggccc tgtcgcccac     1440

ccaccctgag cccaccacgg acgaggatga cgacgacctg gacgacggtg aggcgggggg     1500

gcggcgagga ccctggggga ggaggaggag ggggggggga gggaggaata ggcgggcggg     1560

cgggcgagga aagggcgggc cggggagggg gcgtaacctg atcgcgcccc ccgttgtctc     1620

ttgcagcaga ctacgtaccg cccgcccccc gccggacgcc ccgcgccccc ccacgcagag     1680

gcgccgccgc gccccccgtg acgggcgggg cgtctcacgc agccccccag ccggccgcgg     1740

ctcggacagc gcccccctcg gcgcccatcg ggccacacgg cagcagtaac actaacacca     1800

ccaccaacag cagcggcggc ggcggctccc gccagtcgcg agccgcggtg ccgcgggggg     1860

cgtctggccc ctccgggggg gttggggttg ttgaagcgga ggcggggcgg ccgaggggcc     1920

ggacgggccc ccttgtcaac agacccgccc cccttgcaaa caacagagac cccatagtga     1980

tcagcgactc ccccccggcc tctccccaca ggccccccgc ggcgcccatg ccaggctccg     2040

ccccccgccc cggtcccccc gcgtccgcgg ccgcgtcggg ccccgcgcgc ccccgcgcgg     2100

ccgtggcccc gtgtgtgcgg gcgccgcctc cggggcccgg cccccgcgcc ccggcccccg     2160

gggcggagcc ggccgcccgc cccgcggacg cgcgccgtgt gccccagtcg cactcgtccc     2220

tggctcaggc cgcgaaccaa gaacagagtc tgtgccgggc gcgtgcgacg gtggcgcgcg     2280

gctcgggggg gccgggcgtg gagggtggac acgggccctc ccgcggcgcc gccccctccg     2340

gcgccgcccc ctccggcgcc cccccgctcc cctccgccgc ctctgtcgag caggaggcgg     2400

cggtgcgtcc gaggaagagg cgcgggtcgg gccaggaaaa cccctccccc cagtccacgc     2460

gtccccccct cgcgccggca ggggccaaga gggcggcgac gcaccccccc tccgactcag     2520

ggccgggggg gcgcggccag ggagggcccg ggacccccct gacgtcctcg gcggcctccg     2580

cctcttcctc ctccgcctct tcctcctcgg ccccgactcc cgcgggggcc acctcttccg     2640

ccaccggggc cgcgtcctcc tccgcttccg cctcctcggg cggggccgtc ggtgccctgg     2700

gagggagaca agaggaaacc tccctcggcc cccgcgctgc ttctgggccg cgggggccga     2760

ggaagtgtgc ccggaagacg cgccacgcgg agacttccgg ggccgtcccc gcgggcggcc     2820

tcacgcgcta cctgcccatc tcgggggtct ctagcgtggt cgccctgtcg ccttacgtga     2880

acaagacgat cacgggggac tgcctgccca tcctggacat ggagacgggg aacatcgggg     2940

cgtacgtggt cctggtggac cagacgggaa acatggcgac ccggctgcgg gccgcggtcc     3000

ccggctggag ccgccgcacc ctgctccccg agaccgcggg taaccacgtg acgccccccg     3060

agtacccgac ggcccccgcg tcggagtgga acagcctctg gatgaccccc gtggggaaca     3120

tgctgttcga ccagggcacc ctagtgggcg ccctggactt ccgcagcctg cggtctcggc     3180

acccgtggtc cggggagcag ggggcgtcga cccgggacga gggaaaacaa taa            3233


<210> 3
<211> 1298
<212> PRT
<213> Human alphaherpesvirus 1

<400> 3
Met Ala Ser Glu Asn Lys Gln Arg Pro Gly Ser Pro Gly Pro Thr Asp 
1               5                   10                  15      


Gly Pro Pro Pro Thr Pro Ser Pro Asp Arg Asp Glu Arg Gly Ala Leu 
            20                  25                  30          


Gly Trp Gly Ala Glu Thr Glu Glu Gly Gly Asp Asp Pro Asp His Asp 
        35                  40                  45              


Pro Asp His Pro His Asp Leu Asp Asp Ala Arg Arg Asp Gly Arg Ala 
    50                  55                  60                  


Pro Ala Ala Gly Thr Asp Ala Gly Glu Asp Ala Gly Asp Ala Val Ser 
65                  70                  75                  80  


Pro Arg Gln Leu Ala Leu Leu Ala Ser Met Val Glu Glu Ala Val Arg 
                85                  90                  95      


Thr Ile Pro Thr Pro Asp Pro Ala Ala Ser Pro Pro Arg Thr Pro Ala 
            100                 105                 110         


Phe Arg Ala Asp Asp Asp Asp Gly Asp Glu Tyr Asp Asp Ala Ala Asp 
        115                 120                 125             


Ala Ala Gly Asp Arg Ala Pro Ala Arg Gly Arg Glu Arg Glu Ala Pro 
    130                 135                 140                 


Leu Arg Gly Ala Tyr Pro Asp Pro Thr Asp Arg Leu Ser Pro Arg Pro 
145                 150                 155                 160 


Pro Ala Gln Pro Pro Arg Arg Arg Arg His Gly Arg Trp Arg Pro Ser 
                165                 170                 175     


Ala Ser Ser Thr Ser Ser Asp Ser Gly Ser Ser Ser Ser Ser Ser Ala 
            180                 185                 190         


Ser Ser Ser Ser Ser Ser Ser Asp Glu Asp Glu Asp Asp Asp Gly Asn 
        195                 200                 205             


Asp Ala Ala Asp His Ala Arg Glu Ala Arg Ala Val Gly Arg Gly Pro 
    210                 215                 220                 


Ser Ser Ala Ala Pro Ala Ala Pro Gly Arg Thr Pro Pro Pro Pro Gly 
225                 230                 235                 240 


Pro Pro Pro Leu Ser Glu Ala Ala Pro Lys Pro Arg Ala Ala Ala Arg 
                245                 250                 255     


Thr Pro Ala Ala Ser Ala Gly Arg Ile Glu Arg Arg Arg Ala Arg Ala 
            260                 265                 270         


Ala Val Ala Gly Arg Asp Ala Thr Gly Arg Phe Thr Ala Gly Gln Pro 
        275                 280                 285             


Arg Arg Val Glu Leu Asp Ala Asp Ala Thr Ser Gly Ala Phe Tyr Ala 
    290                 295                 300                 


Arg Tyr Arg Asp Gly Tyr Val Ser Gly Glu Pro Trp Pro Gly Ala Gly 
305                 310                 315                 320 


Pro Pro Pro Pro Gly Arg Val Leu Tyr Gly Gly Leu Gly Asp Ser Arg 
                325                 330                 335     


Pro Gly Leu Trp Gly Ala Pro Glu Ala Glu Glu Ala Arg Arg Arg Phe 
            340                 345                 350         


Glu Ala Ser Gly Ala Pro Ala Ala Val Trp Ala Pro Glu Leu Gly Asp 
        355                 360                 365             


Ala Ala Gln Gln Tyr Ala Leu Ile Thr Arg Leu Leu Tyr Thr Pro Asp 
    370                 375                 380                 


Ala Glu Ala Met Gly Trp Leu Gln Asn Pro Arg Val Val Pro Gly Asp 
385                 390                 395                 400 


Val Ala Leu Asp Gln Ala Cys Phe Arg Ile Ser Gly Ala Ala Arg Asn 
                405                 410                 415     


Ser Ser Ser Phe Ile Thr Gly Ser Val Ala Arg Ala Val Pro His Leu 
            420                 425                 430         


Gly Tyr Ala Met Ala Ala Gly Arg Phe Gly Trp Gly Leu Ala His Ala 
        435                 440                 445             


Ala Ala Ala Val Ala Met Ser Arg Arg Tyr Asp Arg Ala Gln Lys Gly 
    450                 455                 460                 


Phe Leu Leu Thr Ser Leu Arg Arg Ala Tyr Ala Pro Leu Leu Ala Arg 
465                 470                 475                 480 


Glu Asn Ala Ala Leu Thr Gly Ala Ala Gly Ser Pro Gly Ala Gly Ala 
                485                 490                 495     


Asp Asp Glu Gly Val Ala Ala Val Ala Ala Ala Ala Pro Gly Glu Arg 
            500                 505                 510         


Ala Val Pro Ala Gly Tyr Gly Ala Ala Gly Ile Leu Ala Ala Leu Gly 
        515                 520                 525             


Arg Leu Ser Ala Ala Pro Ala Ser Pro Ala Gly Gly Asp Asp Pro Asp 
    530                 535                 540                 


Ala Ala Arg His Ala Asp Ala Asp Asp Asp Ala Gly Arg Arg Ala Gln 
545                 550                 555                 560 


Ala Gly Arg Val Ala Val Glu Cys Leu Ala Ala Cys Arg Gly Ile Leu 
                565                 570                 575     


Glu Ala Leu Ala Glu Gly Phe Asp Gly Asp Leu Ala Ala Val Pro Gly 
            580                 585                 590         


Leu Ala Gly Ala Arg Pro Ala Ser Pro Pro Arg Pro Glu Gly Pro Ala 
        595                 600                 605             


Gly Pro Ala Ser Pro Pro Pro Pro His Ala Asp Ala Pro Arg Leu Arg 
    610                 615                 620                 


Ala Trp Leu Arg Glu Leu Arg Phe Val Arg Asp Ala Leu Val Leu Met 
625                 630                 635                 640 


Arg Leu Arg Gly Asp Leu Arg Val Ala Gly Gly Ser Glu Ala Ala Val 
                645                 650                 655     


Ala Ala Val Arg Ala Val Ser Leu Val Ala Gly Ala Leu Gly Pro Ala 
            660                 665                 670         


Leu Pro Arg Asp Pro Arg Leu Pro Ser Ser Ala Ala Ala Ala Ala Ala 
        675                 680                 685             


Asp Leu Leu Phe Asp Asn Gln Ser Leu Arg Pro Leu Leu Ala Ala Ala 
    690                 695                 700                 


Ala Ser Ala Pro Asp Ala Ala Asp Ala Leu Ala Ala Ala Ala Ala Ser 
705                 710                 715                 720 


Ala Ala Pro Arg Glu Gly Arg Lys Arg Lys Ser Pro Gly Pro Ala Arg 
                725                 730                 735     


Pro Pro Gly Gly Gly Gly Pro Arg Pro Pro Lys Thr Lys Lys Ser Gly 
            740                 745                 750         


Ala Asp Ala Pro Gly Ser Asp Ala Arg Ala Pro Leu Pro Ala Pro Ala 
        755                 760                 765             


Pro Pro Ser Thr Pro Pro Gly Pro Glu Pro Ala Pro Ala Gln Pro Ala 
    770                 775                 780                 


Ala Pro Arg Ala Ala Ala Ala Gln Ala Arg Pro Arg Pro Val Ala Val 
785                 790                 795                 800 


Ser Arg Arg Pro Ala Glu Gly Pro Asp Pro Leu Gly Gly Trp Arg Arg 
                805                 810                 815     


Gln Pro Pro Gly Pro Ser His Thr Ala Ala Pro Ala Ala Ala Ala Leu 
            820                 825                 830         


Glu Ala Tyr Cys Ser Pro Arg Ala Val Ala Glu Leu Thr Asp His Pro 
        835                 840                 845             


Leu Phe Pro Val Pro Trp Arg Pro Ala Leu Met Phe Asp Pro Arg Ala 
    850                 855                 860                 


Leu Ala Ser Ile Ala Ala Arg Cys Ala Gly Pro Ala Pro Ala Ala Gln 
865                 870                 875                 880 


Ala Ala Cys Gly Gly Gly Asp Asp Asp Asp Asn Pro His Pro His Gly 
                885                 890                 895     


Ala Ala Gly Gly Arg Leu Phe Gly Pro Leu Arg Ala Ser Gly Pro Leu 
            900                 905                 910         


Arg Arg Met Ala Ala Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val 
        915                 920                 925             


Arg Val Val Val Leu Tyr Ser Pro Leu Pro Gly Glu Asp Leu Ala Gly 
    930                 935                 940                 


Gly Gly Ala Ser Gly Gly Pro Pro Glu Trp Ser Ala Glu Arg Gly Gly 
945                 950                 955                 960 


Leu Ser Cys Leu Leu Ala Ala Leu Ala Asn Arg Leu Cys Gly Pro Asp 
                965                 970                 975     


Thr Ala Ala Trp Ala Gly Asn Trp Thr Gly Ala Pro Asp Val Ser Ala 
            980                 985                 990         


Leu Gly Ala Gln Gly Val Leu Leu  Leu Ser Thr Arg Asp  Leu Ala Phe 
        995                 1000                 1005             


Ala Gly  Ala Val Glu Phe Leu  Gly Leu Leu Ala Ser  Ala Gly Asp 
    1010                 1015                 1020             


Arg Arg  Leu Ile Val Val Asn  Thr Val Arg Ala Cys  Asp Trp Pro 
    1025                 1030                 1035             


Ala Asp  Gly Pro Ala Val Ser  Arg Gln His Ala Tyr  Leu Ala Cys 
    1040                 1045                 1050             


Glu Leu  Leu Pro Ala Val Gln  Cys Ala Val Arg Trp  Pro Ala Ala 
    1055                 1060                 1065             


Arg Asp  Leu Arg Arg Thr Val  Leu Ala Ser Gly Arg  Val Phe Gly 
    1070                 1075                 1080             


Pro Gly  Val Phe Ala Arg Val  Glu Ala Ala His Ala  Arg Leu Tyr 
    1085                 1090                 1095             


Pro Asp  Ala Pro Pro Leu Arg  Leu Cys Arg Gly Gly  Asn Val Arg 
    1100                 1105                 1110             


Tyr Arg  Val Arg Thr Arg Phe  Gly Pro Asp Thr Pro  Val Pro Met 
    1115                 1120                 1125             


Ser Pro  Arg Glu Tyr Arg Arg  Ala Val Leu Pro Ala  Leu Asp Gly 
    1130                 1135                 1140             


Arg Ala  Ala Ala Ser Gly Thr  Thr Asp Ala Met Ala  Pro Gly Ala 
    1145                 1150                 1155             


Pro Asp  Phe Cys Glu Glu Glu  Ala His Ser His Ala  Ala Cys Ala 
    1160                 1165                 1170             


Arg Trp  Gly Leu Gly Ala Pro  Leu Arg Pro Val Tyr  Val Ala Leu 
    1175                 1180                 1185             


Gly Arg  Glu Ala Val Arg Ala  Gly Pro Ala Arg Trp  Arg Gly Pro 
    1190                 1195                 1200             


Arg Arg  Asp Phe Cys Ala Arg  Ala Leu Leu Glu Pro  Asp Asp Asp 
    1205                 1210                 1215             


Ala Pro  Pro Leu Val Leu Arg  Gly Asp Asp Asp Gly  Pro Gly Ala 
    1220                 1225                 1230             


Leu Pro  Pro Ala Pro Pro Gly  Ile Arg Trp Ala Ser  Ala Thr Gly 
    1235                 1240                 1245             


Arg Ser  Gly Thr Val Leu Ala  Ala Ala Gly Ala Val  Glu Val Leu 
    1250                 1255                 1260             


Gly Ala  Glu Ala Gly Leu Ala  Thr Pro Pro Arg Arg  Glu Val Val 
    1265                 1270                 1275             


Asp Trp  Glu Gly Ala Trp Asp  Glu Asp Asp Gly Gly  Ala Phe Glu 
    1280                 1285                 1290             


Gly Asp  Gly Val Leu 
    1295             


<210> 4
<211> 3885
<212> DNA
<213> Human alphaherpesvirus 1

<400> 4
atggcgtcgg agaacaagca gcgccccggc tccccgggcc ccaccgacgg gccgccgccc       60

accccgagcc cagaccgcga cgagcggggg gccctcgggt ggggcgcgga gacggaggag      120

ggcggggacg accccgacca cgaccccgac cacccccacg acctcgacga cgcccggcgg      180

gacgggaggg cccccgcggc gggcaccgac gccggcgagg acgccgggga cgccgtctcg      240

tcgcgacagc tggctctgct ggcctccatg gtagaggagg ccgtccggac gatcccgacg      300

cccgaccccg cggcctcgcc gccccggacc cccgcctttc tagccgacga cgatgacggg      360

gacgagtacg acgacgcagc cgacgccgcc ggcgaccggg ccccggcccg gggccgcgaa      420

cgggaggccc cgctacgcgg cgcgtatccg gaccccacgg accgcctgtc gccgcgcccg      480

ccggcccagc cgccgcggag acgtcgtcac ggccggcggc ggccatcggc gtcatcgacc      540

tcgtcggact ccgggtcctc gtcctcgtcg tccgcatcct cttcgtcctc gtcgtccgac      600

gaggacgagg acgacgacgg caacgacgcg gccgaccacg cacgcgaggc gcgggccgtc      660

gggcggggtc cgtcgagcgc ggcgccggaa gcccccgggc ggacgccgcc cccgcccggg      720

ccaccccccc tctccgaggc cgcgcccaag ccccgggcgg cggcgaggac ccccgcggcc      780

tccgcgggcc gcatcgagcg ccgccgggcc cgcgcggcgg tggccggccg cgacgccacg      840

ggccgcttca cggccgggca gccccggcgg gtcgagctgg acgccgacgc ggcctccggc      900

gccttctacg cgcgctatcg cgacgggtac gtcagcgggg agccgtggcc cggcgccggg      960

cccccgcccc cggggcgggt gctgtacggc ggcctgggcg acagccgccc gggcctctgg     1020

ggggcgcccg aggcggagga ggcgcgacgc cggttcgagg cctcgggcgc cccggcggcc     1080

gtgtgggcgc ccgagctggg cgacgccgcg cagcagtacg ccctgatcac gcggctgctg     1140

tacaccccgg acgcggaggc catggggtgg ctccagaacc cgcgcgtggt ccccggggac     1200

gtggcgctgg accaggcctg cttccggatc tcgggcgccg cgcgcaacag cagctccttc     1260

atcaccggca gcgtggcgcg ggccgtgccc cacctgggct acgccatggc ggccggccgc     1320

ttcggctggg gcctggcgca cgcggcggcc gccgtggcca tgagccgccg atacgaccgc     1380

gcgcagaagg gcttcctgct gaccagcctg cgccgcgcct acgcgcccct gttggcgcgc     1440

gagaacgcgg cgctgacggg ggccgcgggg agccccggcg ccggcgcaga tgacgagggg     1500

gtcgccgccg tcgccgccgc cgcaccgggc gagcgcgcgg tgcccgccgg gtacggcgcc     1560

gcggggatcc tcgccgccct ggggcggctg tccgccgcgc ccgcctcccc cgtggggggc     1620

gacgaccccg acgccgcccg ccacgccgac gccgacgccg ggcgccgcgc ccaggccggc     1680

cgcgtggccg tcgagtgcct ggccgcctgc cgcgggatcc tggaggcgct ggccgagggc     1740

ttcgacggcg acctggcggc cgtcccgggg ctggccgggg cccggcccgc cagccccccg     1800

cggccggagg gacccgcggg ccccgcttcc ccgccgccgc cgcacgccga cgcgccccgc     1860

ctgcgcgcgt ggctgcgcga gctgcggttc gtgcgcgacg cgctggtgct catgcgcctg     1920

cgcggggacc tgcgcgtggc cggcggcagc gaggccgccg tggccgccgt gcgcgccgtg     1980

agcctggtcg ccggggccct gggccccgcg ctgccgcggg acccgcgcct gccgagctcc     2040

gcggccgccg ccgccgcgga cctgctgttt gagaaccaga gcctccgccc cctgctggcg     2100

gcggcggcca gcgcaccgga cgccgccgac gcgctggcgg ccgccgccgc ctccgccgcg     2160

ccgcgggagg ggcgcaagcg caagagtccc ggcccggccc ggccgcccgg aggcggcggc     2220

ccgcgacccc cgaagacgaa gaagagcggc gcggacgccc ccggctcgga cgcccgcgcc     2280

cccctccccg cgcccccctc cacgcccccg gggcccgagc ccacccccgc ccagcccgcg     2340

gcggcccggg gcgccgcggc gcaggcccgc ccgcgccccg tggcgctgtc gcgccggccc     2400

gccgagggcc ccgaccccct gggcggctgg cggcggcagc cccgggggcc cagccacacg     2460

gcggcgcccg cggccgccgc cctggaggcc tactgctccc cgcgcgccgt ggccgagctc     2520

acggaccacc cgctgttccc cgtcccctgg cgaccggccc tcatgtttga cccgcgggcc     2580

ctggcctcga tcgccgcgcg gtgcgccggg cccgcccccg ccgcccaggc cgcgtgcggc     2640

ggcgacgacg acgagaaccc ccacccccac ggggccgccg ggggccgcct ctttggcccc     2700

ctgcgcgcct cgggcccgct gcgccgcatg gcggcctgga tgcgccagat ccccgacccc     2760

gaggacgtgc gcgtggtggt gctgtactcg ccgctgccgg gcgaggacct ggccggcggc     2820

ggggcctcgg gggggccgcc ggagtggtcc gccgagcgcg gcgggctgtc ctgcctgctg     2880

gcggccctgg ccaaccggct gtgcgggccg gacacggccg cctgggcggg caactggacc     2940

ggcgcccccg acgtgtcggc gctgggcgcg cagggcgtgc tgctgctgtc cacgcgggac     3000

ctggccttcg ccggggccgt ggagtttctg gggctgctcg ccagcgccgg cgaccggcgg     3060

ctcatcgtgg tcaacaccgt gcgcgcctgc gactggcccg ccgacgggcc cgcggtgtcg     3120

cggcagcacg cctacctggc gtgcgacctg ctgcccgccg tgcagtgcgc cgtgcgctgg     3180

ccggcggcgc gggacctgcg ccgcacggtg ctggcctcgg gccgcgtgtt cggcccgggg     3240

gtcttcgcgc gcgtggaggc cgcgcacgcg cgcctgtacc ccgacgcgcc gccgctgcgc     3300

ctgtgccgcg gcggcaacgt gcgctaccgc gtgcgcacgc gcttcggccc ggacacgccg     3360

gtgcccatgt ccccgcgcga gtaccgccgg gccgtgctgc cggcgctgga cggccgggcg     3420

gcggcctcgg ggaccaccga cgccatggcg cccggcgcgc cggacttctg cgaggaggag     3480

gcccactcgc accgcgcctg cgcgcgctgg ggcctgggcg cgccgctgcg gcccgtgtac     3540

gtggcgctgg ggcgcgaggc ggtgcgcgcc ggcccggccc ggtggcgcgg gccgcggagg     3600

gacttttgcg cccgcgccct gctggagccc gacgacgacg cccccccgct ggtgctgcgc     3660

ggcgacgacg acgacggccc gggggccctg ccgccggcgc cgcccgggat tcgctgggcc     3720

tcggccacgg gccgcagcgg caccgtgctg gcggcggcgg gggccgtgga ggtgctgggg     3780

gcggaggcgg gcttggccac gcccccgcga cgggacgttg tggactggga aggcgcctgg     3840

gacgaagacg acggcggcgc gttcgagggg gacggggtgc tgtaa                     3885


<210> 5
<211> 420
<212> PRT
<213> Human alphaherpesvirus 1

<400> 5
Met Ala Asp Ile Ser Pro Gly Ala Phe Ala Pro Cys Val Lys Ala Arg 
1               5                   10                  15      


Arg Pro Ala Leu Arg Ser Pro Pro Leu Gly Thr Arg Lys Arg Lys Arg 
            20                  25                  30          


Pro Ser Arg Pro Leu Ser Ser Glu Ser Glu Val Glu Ser Asp Thr Ala 
        35                  40                  45              


Leu Glu Ser Glu Val Glu Ser Glu Thr Ala Ser Asp Ser Thr Glu Ser 
    50                  55                  60                  


Gly Asp Gln Asp Glu Ala Pro Arg Ile Gly Gly Arg Arg Ala Pro Arg 
65                  70                  75                  80  


Arg Leu Gly Gly Arg Phe Phe Leu Asp Met Ser Ala Glu Ser Thr Thr 
                85                  90                  95      


Gly Thr Glu Thr Asp Ala Ser Val Ser Asp Asp Pro Asp Asp Thr Ser 
            100                 105                 110         


Asp Trp Ser Tyr Asp Asp Ile Pro Pro Arg Pro Lys Arg Ala Arg Val 
        115                 120                 125             


Asn Leu Arg Leu Thr Ser Ser Pro Asp Arg Arg Asp Gly Val Ile Phe 
    130                 135                 140                 


Pro Lys Met Gly Arg Val Arg Ser Thr Arg Glu Thr Gln Pro Arg Ala 
145                 150                 155                 160 


Pro Thr Pro Ser Ala Pro Ser Pro Asn Ala Met Leu Arg Arg Ser Val 
                165                 170                 175     


Arg Gln Ala Gln Arg Arg Ser Ser Ala Arg Trp Thr Pro Asp Leu Gly 
            180                 185                 190         


Tyr Met Arg Gln Cys Ile Asn Gln Leu Phe Arg Val Leu Arg Val Ala 
        195                 200                 205             


Arg Asp Pro His Gly Ser Ala Asn Arg Leu Arg His Leu Ile Arg Asp 
    210                 215                 220                 


Cys Tyr Leu Met Gly Tyr Cys Arg Ala Arg Leu Ala Pro Arg Thr Trp 
225                 230                 235                 240 


Cys Arg Leu Leu Gln Val Ser Gly Gly Thr Trp Gly Met His Leu Arg 
                245                 250                 255     


Asn Thr Ile Arg Glu Val Glu Ala Arg Phe Asp Ala Thr Ala Glu Pro 
            260                 265                 270         


Val Cys Lys Leu Pro Cys Leu Glu Thr Arg Arg Tyr Gly Pro Glu Cys 
        275                 280                 285             


Asp Leu Ser Asn Leu Glu Ile His Leu Ser Ala Thr Ser Asp Asp Glu 
    290                 295                 300                 


Ile Ser Asp Ala Thr Asp Leu Glu Ala Ala Gly Ser Asp His Thr Leu 
305                 310                 315                 320 


Ala Ser Gln Ser Asp Thr Glu Asp Ala Pro Ser Pro Val Thr Leu Glu 
                325                 330                 335     


Thr Pro Glu Pro Arg Gly Ser Leu Ala Val Arg Leu Glu Asp Glu Phe 
            340                 345                 350         


Gly Glu Phe Asp Trp Thr Pro Gln Glu Gly Ser Gln Pro Trp Leu Ser 
        355                 360                 365             


Ala Val Val Ala Asp Thr Ser Ser Val Glu Arg Pro Gly Pro Ser Asp 
    370                 375                 380                 


Ser Gly Ala Gly Arg Ala Ala Glu Asp Arg Lys Cys Leu Asp Gly Cys 
385                 390                 395                 400 


Arg Lys Met Arg Phe Ser Thr Ala Cys Pro Tyr Pro Cys Ser Asp Thr 
                405                 410                 415     


Phe Leu Arg Pro 
            420 


<210> 6
<211> 2831
<212> DNA
<213> Human alphaherpesvirus 1

<400> 6
ggccaccgcc gcgcgggccc ggcggcgctc gatgcggccc gcggaggccg cgggggtcct       60

cgccgccgcc cggggcttgg gcgcggcctc ggagaggggg ggtggcccgg gcgggggcgg      120

cgtccgcccg ggggcttccg gcgccgcgct cgacggaccc cgcccgacgg cccgcgcctc      180

gcgtgcgtgg tcggccgcgt cgttgccgtc gtcgtcctcg tcctcgtcgg acgacgagga      240

cgaagaggat gcggacgacg aggacgagga cccggagtcc gacgaggtcg atgacgccga      300

tggccgccgc cggccgtgac gacgtctccg cggcggctgg gccggcgggc gcggcgacag      360

gcggtccgtg gggtccggat acgcgccgcg tagcggggcc tcccgttcgc ggccccgggc      420

cggggcccgg tcgccggcgg cgtcggctgc gtcgtcgtac tcgtccccgt catcgtcgtc      480

ggctagaaag gcgggggtcc ggggcggcga ggccgcgggg tcgggcgtcg ggatcgtccg      540

gacggcctcc tctaccatgg aggccagcag agccagctgt cgcgacgaga cggcgtcccc      600

ggcgtcctcg ccggcgtcgg tgcccgccgc gggggccctc ccgtcccgcc gggcgtcgtc      660

gaggtcgtgg gggtggtcgg ggtcgtggtc ggggtcgtcc ccgccctcct ccgtctccgc      720

gccccacccg agggcccccc gctcgtcgcg gtctgggctc ggggtgggcg gcggcccgtc      780

ggtggggccc ggggagccgg ggcgctgctt gttctccgac gccatcgccg atgcggggcg      840

atcctccggg gatacggctg cgacggcgga cgtagcacgg taggtcacct acggactctc      900

gatgggggga gggggcgaga cccacggacc ccgacgaccc ccgccgtcga cgcggaacta      960

gcgcggaccg gtcgatgctt gggtggggaa aaaggacagg gacggccgat ccccctcccg     1020

cgcttcgtcc gcgtatcggc gtcccggcgc ggcgagcgtc tgacggtctg tctctggcgg     1080

tcccgcgtcg ggtcgtggat ccgtgtcggc agccgcgctc cgtgtggacg atcggggcgt     1140

cctcgggctc atatagtccc aggggccggc gggaaggagg agcagcggag gccgccggcc     1200

ccccgccccc cggcgggccc accccgaacg gaattccatt atgcacgacc ccgccccgac     1260

gccggcacgc cgggggcccg tggccgcggc ccgttggtcg aacccccggc cccgcccatc     1320

cgcgccatct gccatggacg gggcgcgagg gcgggtgggt ccgcgccccg ccccgcatgg     1380

catctcatta ccgcccgatc cggcggtttc cgcttccgtt ccgcatgcta acgaggaacg     1440

ggcagggggc ggggcccggg ccccgacttc ccggttcggc ggtaatgaga tacgagcccc     1500

gcgcgcccgt tggccgtccc cgggcccccg gtcccgcccg ccggacgccg ggaccaacgg     1560

gacggcgggc ggcccttggg ccgcccgcct tgccgccccc ccattggccg gcgggcggga     1620

ccgccccaag ggggcggggc cgccgggtaa aagaagtgag aacgcgaagc gttcgcactt     1680

cgtcccaata tatatatatt attagggcga agtgcgagca ctggcgccgt gcccgactcc     1740

gcgccggccc cgggggcgga cccgggcggc ggggggcggg tctctccggc gcacataaag     1800

gcccggcgcg accgacgccc gcagacggcg ccagccacga acgacgggag cggctgcgga     1860

gcacgcggac cgggagcggg agtcgcagag ggccgtcgga gcggacggcg tcggcatcgc     1920

gacgccccgg ctcgggatcg ggatcgcatc ggaaagggac acgcggacgc gggggggaaa     1980

gacccgccca ccccacccac gaaacacagg ggacgcaccc cgggggcctc cgacgacaga     2040

aacccaccgg tccgcctttt ttgcacgggt aagcaccttg ggtgggcaga ggagggggga     2100

cgcgggggcg gaggaggggg gacgcggggg cggaggaggg gggacgcggg ggcggaggag     2160

gggggacgcg ggggcggagg aggggggacg cgggggcgga ggagggggct cacccgcgtt     2220

cgtgccttcc cgcaggagga acgccctcgt cgaggcgacc ggcggcgacc gttgcgtgga     2280

ccgcttcctg ctcgtcgggg gggggggagc cactgtggtc ctccgggacg ttttctggat     2340

ggccgacatt tccccaggcg cttttgtgcc ttgtgtaaaa gcgcggcgtc ccgctctccg     2400

atccccgccc ctgggcacgc gcaagcgcaa gcgccctgcc cgccccctct catcggagtc     2460

tgaggtcgaa tccgagacag ccttggagtc tgaggtcgaa tccgagacag catcggattc     2520

gaccgagtct ggggaccagg aggaagcccc ccgcatcggt ggccgtaggg ccccccggag     2580

gcttgggggg cggttttttc tggacatgtc ggcggaatcc accacgggga cggaaacgga     2640

tgcgtcggtg tcggacgacc ccgacgacac gtccgactgg tcttgtgacg acattccccc     2700

acgacccaag cgggcccggg taaacctgcg gctcactagc tctcccgatc ggcgggatgg     2760

ggttattttt cctaagatgg ggcgggtccg gtctacccgg gaaacgcagc cccgggcccc     2820

caccccgtcg g                                                          2831


<210> 7
<211> 512
<212> PRT
<213> Human alphaherpesvirus 1

<400> 7
Met Ala Thr Asp Ile Asp Met Leu Ile Asp Leu Gly Leu Asp Leu Ser 
1               5                   10                  15      


Asp Ser Asp Leu Asp Glu Asp Pro Pro Glu Pro Ala Glu Ser Arg Arg 
            20                  25                  30          


Asp Asp Leu Glu Ser Asp Ser Ser Gly Glu Cys Ser Ser Ser Asp Glu 
        35                  40                  45              


Asp Met Glu Asp Pro His Gly Glu Asp Gly Pro Glu Pro Ile Leu Asp 
    50                  55                  60                  


Ala Ala Arg Pro Ala Val Arg Pro Ser Arg Pro Glu Asp Pro Gly Val 
65                  70                  75                  80  


Pro Ser Thr Gln Thr Pro Arg Pro Thr Glu Arg Gln Gly Pro Asn Asp 
                85                  90                  95      


Pro Gln Pro Ala Pro His Ser Val Trp Ser Arg Leu Gly Ala Arg Arg 
            100                 105                 110         


Pro Ser Cys Ser Pro Glu Gln His Gly Gly Lys Val Ala Arg Leu Gln 
        115                 120                 125             


Pro Pro Pro Thr Lys Ala Gln Pro Ala Arg Gly Gly Arg Arg Gly Arg 
    130                 135                 140                 


Arg Arg Gly Arg Gly Arg Gly Gly Pro Gly Ala Ala Asp Gly Leu Ser 
145                 150                 155                 160 


Asp Pro Arg Arg Arg Ala Pro Arg Thr Asn Arg Asn Pro Gly Gly Pro 
                165                 170                 175     


Arg Pro Gly Ala Gly Trp Thr Asp Gly Pro Gly Ala Pro His Gly Glu 
            180                 185                 190         


Ala Trp Arg Gly Ser Glu Gln Pro Asp Pro Pro Gly Gly Gln Arg Thr 
        195                 200                 205             


Arg Gly Val Arg Gln Ala Pro Pro Pro Leu Met Thr Leu Ala Ile Ala 
    210                 215                 220                 


Pro Pro Pro Ala Asp Pro Arg Ala Pro Ala Pro Glu Arg Lys Ala Pro 
225                 230                 235                 240 


Ala Ala Asp Thr Ile Asp Ala Thr Thr Arg Leu Val Leu Arg Ser Ile 
                245                 250                 255     


Ser Glu Arg Ala Ala Val Asp Arg Ile Ser Glu Ser Phe Gly Arg Ser 
            260                 265                 270         


Ala Gln Val Met His Asp Pro Phe Gly Gly Gln Pro Phe Pro Ala Ala 
        275                 280                 285             


Asn Ser Pro Trp Ala Pro Val Leu Ala Gly Gln Gly Gly Pro Phe Asp 
    290                 295                 300                 


Ala Glu Thr Arg Arg Val Ser Trp Glu Thr Leu Val Ala His Gly Pro 
305                 310                 315                 320 


Ser Leu Tyr Arg Thr Phe Ala Gly Asn Pro Arg Ala Ala Ser Thr Ala 
                325                 330                 335     


Lys Ala Met Arg Asp Cys Val Leu Arg Gln Glu Asn Phe Ile Glu Ala 
            340                 345                 350         


Leu Ala Ser Ala Asp Glu Thr Leu Ala Trp Cys Lys Met Cys Ile His 
        355                 360                 365             


His Asn Leu Pro Leu Arg Pro Gln Asp Pro Ile Ile Gly Thr Thr Ala 
    370                 375                 380                 


Ala Val Leu Asp Asn Leu Ala Thr Arg Leu Arg Pro Phe Leu Gln Cys 
385                 390                 395                 400 


Tyr Leu Lys Ala Arg Gly Leu Cys Gly Leu Asp Glu Leu Cys Ser Arg 
                405                 410                 415     


Arg Arg Leu Ala Asp Ile Lys Asp Ile Ala Ser Phe Val Phe Val Ile 
            420                 425                 430         


Leu Ala Arg Leu Ala Asn Arg Val Glu Arg Gly Val Ala Glu Ile Asp 
        435                 440                 445             


Tyr Ala Thr Leu Gly Val Gly Val Gly Glu Lys Met His Phe Tyr Leu 
    450                 455                 460                 


Pro Gly Ala Cys Met Ala Gly Leu Ile Glu Ile Leu Asp Thr His Arg 
465                 470                 475                 480 


Gln Glu Cys Ser Ser Arg Val Cys Glu Leu Thr Ala Ser His Ile Val 
                485                 490                 495     


Ala Pro Pro Tyr Val His Gly Lys Tyr Phe Tyr Cys Asn Ser Leu Phe 
            500                 505                 510         


<210> 8
<211> 1539
<212> DNA
<213> Human alphaherpesvirus 1

<400> 8
atggcgactg acattgatat gctaattgac ctcggcctgg acctctccga cagcgatctg       60

gacgaggacc cccccgagcc ggcggagagc cgccgcgacg acctggaatc ggacagcaac      120

ggggagtgtt cctcgtcgga cgaggacatg gaagaccccc acggagagga cggaccggag      180

ccgatactcg acgccgctcg cccggcggtc cgcccgtctc gtccagaaga ccccggcgta      240

cccagcaccc agacgcctcg tccgacggag cggcagggcc ccaacgatcc tcaaccagcg      300

ccccacagtg tgtggtcgcg cctcggggcc cggcgaccgt cttgctcccc cgagcggcac      360

gggggcaagg tggcccgcct ccaaccccca ccgaccaaag cccagcctgc ccgcggcgga      420

cgccgtgggc gtcgcagggg tcggggtcgc ggtggtcccg gggccgccga tggtttgtcg      480

gacccccgcc ggcgtgcccc cagaaccaat cgcaacccgg ggggaccccg ccccggggcg      540

gggtggacgg acggccccgg cgccccccat ggcgaggcgt ggcgcggaag tgagcagccc      600

gacccacccg gaggcccgcg gacacggagc gtgcgccaag cacccccccc gctaatgacg      660

ctggcgattg cccccccgcc cgcggacccc cgcgccccgg ccccggagcg aaaggcgccc      720

gccgccgaca ccatcgacgc caccacgcgg ttggtcctgc gctccatctc cgagcgcgcg      780

gcggtcgacc gcatcagcga gagcttcggc cgcagcgcac aggtcatgca cgaccccttt      840

ggggggcagc cgtttcccgc cgcgaatagc ccctgggccc cggtgctggc gggccaagga      900

gggccctttg acgccgagac cagacgggtc tcctgggaaa ccttggtcgc ccacggcccg      960

agcctctatc gcacttttgc cggcaatcct cgggccgcat cgaccgccaa ggccatgcgc     1020

gactgcgtgc tgcgccaaga aaatttcatc gaggcgctgg cctccgccga cgagacgctg     1080

gcgtggtgca agatgtgcat ccaccacaac ctgccgctgc gcccccagga ccccattatc     1140

gggacggccg cggcggtgct ggataacctc gccacgcgcc tgcggccctt tctccagtgc     1200

tacctgaagg cgcgaggcct gtgcggcctg gacgaactgt gttcgcggcg gcgtctggcg     1260

gacattaagg acattgcatc cttcgtgttt gtcattctgg ccaggctcgc caaccgcgtc     1320

gagcgtggcg tcgcggagat cgactacgcg acccttggtg tcggggtcgg agagaagatg     1380

catttctacc tccccggggc ctgcatggcg ggcctgatcg aaatcctaga cacgcaccgc     1440

caggagtgtt cgagtcgtgt ctgcgagttg acggccagtc acatcgtcgc ccccccgtac     1500

gtgcacggca aatattttta ttgcaactcc ctgttttag                            1539


<210> 9
<211> 88
<212> PRT
<213> Human alphaherpesvirus 1

<400> 9
Met Ser Trp Ala Leu Glu Met Ala Asp Thr Phe Leu Asp Thr Met Arg 
1               5                   10                  15      


Val Gly Pro Arg Thr Tyr Ala Asp Val Arg Asp Glu Ile Asn Lys Arg 
            20                  25                  30          


Gly Arg Glu Asp Arg Glu Ala Ala Arg Thr Ala Val His Asp Pro Glu 
        35                  40                  45              


Arg Pro Leu Leu Arg Ser Pro Gly Leu Leu Pro Glu Ile Ala Pro Asn 
    50                  55                  60                  


Ala Ser Leu Gly Val Ala His Arg Arg Thr Gly Gly Thr Val Thr Asp 
65                  70                  75                  80  


Ser Pro Arg Asn Pro Val Thr Arg 
                85              


<210> 10
<211> 267
<212> DNA
<213> Human alphaherpesvirus 1

<400> 10
tcaacgggtt accggattac ggggactgtc ggtcacggtc ccgccggttc ttcgatgtgc       60

cacacccaag gatgcgttgg gggcgatttc gggcagcagc ccgggagagc gcagcagggg      120

acgctccggg tcgtgcacgg cggttctggc cgcctcccgg tcctcacgcc cccttttatt      180

gatctcatcg cgtacgtcgg cgtacgtcct gggcccaacc cgcatgttgt ccaggaaggt      240

gtccgccatt tccagggccc acgacat                                          267


<210> 11
<211> 271
<212> DNA
<213> Human alphaherpesvirus 1

<400> 11
aattccatta tgcacgaccc cgccccgacg ccggcacgcc gggggcccgt ggccgcggcc       60

cgttggtcga acccccggcc ccgcccatcc gcgccatctg ccatggacgg ggcgcgaggg      120

cgggtgggtc cgcgccccgc cccgcatggc atctcattac cgcccgatcc ggcggtttcc      180

gcttccgttc cgcatgctaa cgaggaacgg gcagggggcg gggcccgggc cccgacttcc      240

cggttcggcg gtaatgagat acgagccccg c                                     271


<210> 12
<211> 271
<212> DNA
<213> Human alphaherpesvirus 1

<400> 12
cgcgcggggc tcgtatctca ttaccgccga accgggaagt cggggcccgg gccccgcccc       60

ctgcccgttc ctcgttagca tgcggaacgg aagcggaaac cgccggatcg ggcggtaatg      120

agatgccatg cggggcgggg cgcggaccca cccgccctcg cgccccgtcc atggcagatg      180

gcgcggatgg gcggggccgg gggttcgacc aacgggccgc ggccacgggc ccccggcgtg      240

ccggcgtcgg ggcggggtcg tgcataatgg a                                     271


<210> 13
<211> 1474
<212> DNA
<213> Human alphaherpesvirus 1

<400> 13
cgccgatgcg gggcgatcct ccggggatac ggctgcgacg gcggacgtag cacggtaggt       60

cacctacgga ctctcgatgg ggggaggggg cgagacccac ggaccccgac gacccccgcc      120

gtcgacgcgg aactagcgcg gaccggtcga tgcttgggtg gggaaaaagg acagggacgg      180

ccgatccccc tcccgcgctt cgtccgcgta tcggcgtccc ggcgcggcga gcgtctgacg      240

gtctgtctct ggcggtcccg cgtcgggtcg tggatccgtg tcggcagccg cgctccgtgt      300

ggacgatcgg ggcgtcctcg ggctcatata gtcccagggg ccggcgggaa ggaggagcag      360

cggaggccgc cggccccccg ccccccggcg ggcccacccc gaacggaatt ccattatgca      420

cgaccccgcc ccgacgccgg cacgccgggg gcccgtggcc gcggcccgtt ggtcgaaccc      480

ccggccccgc ccatccgcgc catctgccat ggacggggcg cgagggcggg tgggtccgcg      540

ccccgccccg catggcatct cattaccgcc cgatccggcg gtttccgctt ccgttccgca      600

tgctaacgag gaacgggcag ggggcggggc ccgggccccg acttcccggt tcggcggtaa      660

tgagatacga gccccgcgcg cccgttggcc gtccccgggc ccccggtccc gcccgccgga      720

cgccgggacc aacgggacgg cgggcggccc ttgggccgcc cgccttgccg cccccccatt      780

ggccggcggg cgggaccgcc ccaagggggc ggggccgccg ggtaaaagaa gtgagaacgc      840

gaagcgttcg cacttcgtcc caatatatat atattattag ggcgaagtgc gagcactggc      900

gccgtgcccg actccgcgcc ggccccgggg gcggacccgg gcggcggggg gcgggtctct      960

ccggcgcaca taaaggcccg gcgcgaccga cgcccgcaga cggcgccagc cacgaacgac     1020

gggagcggct gcggagcacg cggaccggga gcgggagtcg cagagggccg tcggagcgga     1080

cggcgtcggc atcgcgacgc cccggctcgg gatcgggatc gcatcggaaa gggacacgcg     1140

gacgcggggg ggaaagaccc gcccacccca cccacgaaac acaggggacg caccccgggg     1200

gcctccgacg acagaaaccc accggtccgc cttttttgca cgggtaagca ccttgggtgg     1260

gcagaggagg ggggacgcgg gggcggagga ggggggacgc gggggcggag gaggggggac     1320

gcgggggcgg aggagggggg acgcgggggc ggaggagggg ggacgcgggg gcggaggagg     1380

gggctcaccc gcgttcgtgc cttcccgcag gaggaacgcc ctcgtcgagg cgaccggcgg     1440

cgaccgttgc gtggaccgct tcctgctcgt cggg                                 1474


<210> 14
<211> 7185
<212> DNA
<213> Homo sapiens

<400> 14
atttcgcttt cattttgggc cgagctggag gcggcggggc cgtcccggaa cggctgcggc       60

cgggcacccc gggagttaat ccgaaagcgc cgcaagcccc gcgggccggc cgcaccgcac      120

gtgtcaccga gaagctgatg tagagagaga cacagaagga gacagaaagc aagagaccag      180

agtcccggga aagtcctgcc gcgcctcggg acaattataa aaatgtggcc ccctgggtca      240

gcctcccagc caccgccctc acctgccgcg gccacaggtc tgcatccagc ggctcgccct      300

gtgtccctgc agtgccggct cagcatgtgt ccagcgcgca gtgagtactc agcccgccag      360

gtctttggct cgctcgggtg cggaggggcg gctgcttggg aagagtgggt agaaactcaa      420

gtctgcctaa ggagagactg gaacagcagc agccgtctcc tgcaaagagg aataagaata      480

ggggtctcac cgtgtgctaa atagggcatg tctcttttca tctcgaaaca gcctgatgag      540

ctatcattat gcccacttcg tagaggagaa actgaggccc agcgtgggag atttcctgct      600

ttcctgccta gggtccctca gctacaaaca gaaggcagat cctagccagt tctaaagtca      660

ggcttggccg ggtgcaatgg ctcacgcctg taatcccagc actttgggag gccaaggcgg      720

acagatctcc tgaggtcagg agttcaagac cagcctgacc aacatggtga aaccccgtct      780

ctactaacaa atacacaaat tagccgagtg tggtggtggg tgcctgtaat cccatctact      840

cgggagtcta aggcagggaa aattgcttga actcggaggt ggaggttgca gtgagccaaa      900

atcgtgccac tgcactccag tctgggccac agagtgagac tccgtcttaa aaaaacaaaa      960

aaagtcaggc tttctcactc cactgttttt aaaacactgg gtcccaagtc tgactcagcc     1020

acttcaccac ctggtctggg tttccctgat caaaaacatg taggctttct ctgggtgagt     1080

gttgggttca acacccttgt ccaagctcat ctctagaccc tcacagggag ctggcttcta     1140

gccctagaat agagtggcgc tttgtaataa ctcgagtcat ctctcaggtg ttgaaggaaa     1200

agtgttggaa atgggttgag ggaggtgggt gccaagcatg aatggatggg tgaaggtggc     1260

cagaatggag ggaaggtggt gcaggcaggc catccaggct gaagctcctc cacctgctcc     1320

tcttccttcc aggcctcctc cttgtggcta ccctggtcct cctggaccac ctcagtttgg     1380

ccagaaacct ccccgtggcc actccagacc caggaatgtt cccatgcctt caccactccc     1440

aaaacctgct gagggccgtc agcaacatgc tccagaaggt gagcctttcc tgtcctctcc     1500

actgtggacc tgcaccctcc ctgaggaagg ggcctctgat cctcccctct ggtacctgat     1560

ggaactgcag agaaattgtg gaagttcatt agcagctgtc aacagcagga gagggaactt     1620

tacaaatggc ccaagtgtta aagagtccta gtgaaattgt gtctccagag aaagcaagag     1680

tagtaataaa ttataatgat gttggttttg gctatgtcta cactgagtaa agtggatatt     1740

tgcagtgttc ctgtagcctg ccaacagaga tagaattgtg tcaggtccac atggtttctc     1800

tggcaccaca ccactgatca atcccggaaa tagttacttg ggtgcatact ctgtgttggg     1860

gggcaggggt acaaagatga aatagccttg tccctcctgc tgcccaccag cttctacttg     1920

gtgacacagt tccttcttgg agcaacacat ctcagggagg actatgacag tgcacatgca     1980

gccttcattt tctagggaag agttacctaa ttttatttac agcttcgttc ttttaggttt     2040

catattttaa tgtaaaaaac attgccgtta aaaactgctc cagtagctac tgctgcaaag     2100

gctaaaggct aatgttttaa aagatgtcct ctgcctttct gctctctagg aattttttct     2160

tgagtatttt ctagcttgtt gccttaggaa tatcttaaaa taagctcaat gccaccaagc     2220

cttcttaaag ggctcctttc ccctttcaat ctttcataat gtgctgtgca ttgctcctca     2280

caattcaatt aatcttaatt ggacaggggg ctcaagtgaa aactgctttg tttcccgaag     2340

aaaggttcaa aatgggtaac ttttaggtgc tccaattcat atagattttt ttgtgcaaaa     2400

gctgacaccc ctactcccag atatagtcct aagggtcaaa agattataga aatcaattta     2460

atgttttgta catcatttgt caaatttgct gtagtagaaa acaaatgagg tagatgcaag     2520

tgattcaccg gccatatcca gggcacaggt ttgtagaatt tgggctatat gtttattttt     2580

tagtttgatg ccattcaaaa accaaacatt tcaattaaga tttcaaaatt ctagcttctc     2640

ttgaaaagat ctgaagaaca acactggact cacacctcca aatctaactc tttataacct     2700

gctcagaata gtggggttgc ccaggtctgt ttaaacacct ccgggaatag tgtattcatt     2760

tcctactgct gctgtcacaa atgtagtggg tttaaatgac acaaatttat tatcttacag     2820

ctctgggcat cagaagtcca aaatgggtcc tactgggcta aagtcaatgt gtcagcagag     2880

ctgtagccct ttctggaggc tcttggggag aatctgttgc cttgctttgt caggtctaga     2940

ggctgcccac attccttgac ttgtggcccc ttccattgtc cataccagca atgtctggtt     3000

gagtctttct catgctgctc cctctggact gaccttctac ccgcttctac ttttgaggac     3060

actgcttaca ctggattcac ccagataatc caggatcatc tctctactct tagatcagct     3120

gattagcaaa ttgaatgtca tctgcacctt cattcctctt tggcaggtca catagctcat     3180

taccgggttc cagggattgg gaggtggatg tctttggagg tggagaattg gtctagctac     3240

cacagaaagg gattttacta cttgtgccca aacgtattct gaattttcag gccttgcttc     3300

ttggctcatg ctgttacctc tgtctgaaat ttccctcctc cattgtcgac actttactcc     3360

tcttaatgtg ggtgtcccaa ctctaggaag ctttgcttcc cttcctgccc ctcctcagaa     3420

gcttcatgcc cttctcacca tgtctcacct actgatgcct cctggtcatt tgggcaattg     3480

tctgtctcag cagctaaact tgcaatccag gagggtatac atggtcctct tctctgtcat     3540

gttcttagta cctaatggag tgcctgccac ccattaggct gttgaatgga aaaatgatcg     3600

taatcttcaa tcatacagtc ctttacttcc taagatacat ttcataattt tacccaacgg     3660

tgctttccaa cacactaaga tgacatctct gtgtatgtgt gtgcgtgtgt gcactgcaaa     3720

ttaaatctca gcttgtctta agggtttgca tgtttgttat atccatcaga ctgtgacctt     3780

aacagttccc cctttagaag ttatacttga aataggtatt catagaatac agtcacttta     3840

gcaaaagaag aaacaatttt caccagaagc aagtgtcaca gtgaggtgtg gaggctggtt     3900

agcacaggct gttgacatga tttattgtgt ttacataatg aaaaaatatg taatccagag     3960

aaacatgccc caagcttagg gagaggaggg tgagagacag aatgtaagga atctattttc     4020

ttttgtatta tgaagaacca gaatttcctg caagtcaaag tgacagaggt caagggcagc     4080

cagccggagc cttcctggca ccctggctta ccagccttgt ggggtgccag gtgcattatc     4140

aatgttataa actgagtttc tccttcattt tttataggcc agacaaactc tagaatttta     4200

cccttgcact tctgaagaga ttgatcatga agatatcaca aaagataaaa ccagcacagt     4260

ggaggcctgt ttaccattgg aattaaccaa ggtataaagg attttcctcc cagagcatgc     4320

agtgtggtta aaaactgtgc atcaaatctc acctgcttct aaaaattcac gtttctggta     4380

tccatcatta tgggatttta actggtccta cttcagaagt tagatttggg agaaaaatta     4440

tttttaaaaa atggtttttt ttgcaacaat gtgaatacac ttaacactta aaaatagttc     4500

agatgctaaa tattatgata tgtgttttta ccataataaa aaaaatttga gacctgaaga     4560

gtcagaaaga tttatactaa gaattaatac tttgatatat gattttttcc ctctagaatg     4620

agagttgcct aaattccaga gagacctctt tcataactgt aagtcaaaaa atgaaaagtt     4680

tcagcctgta tgatgaattc atatcactga tgtctgatta ttttttcctc tagaatggga     4740

gttgcctggc ctccagaaag acctctttta tgatggtaag acacacagct ctttcctcaa     4800

atgcaatggg ggaaatgttt ttagcccatc tcaatggata cttccccatc ttgtcatgtc     4860

acccaggccc tgtgccttag tagtatttat gaagacttga agatgtacca ggtggagttc     4920

aagaccatga atgcaaagct tctgatggat cctaagaggc agatctttct agatcaaaac     4980

atgctggcag ttattgatga gctgatgcag gtaagacttc attctatcag tgagagcacc     5040

tttttcatgc taaagataac cagccagggt ctttgataaa gagatataaa aagaggtctg     5100

gaggcctttt aaaggcctga cagacctaca ttttcaagaa gacagccttg agggtgccgt     5160

ctatagggag cacaaatgtg agcagatcac attatgaaaa gcagactcca aagtattcac     5220

tctgtggtat cccccacact cggcaaatgt ttttgtgcat tttcttatgt cagcctcaaa     5280

acaaccatat gggatctgca caaaggaggg aaccaaacct taggagagtt aaaccacttg     5340

tctgaggctc tgcctcttaa aatccttcaa gaggatttca ctacacttac cttctcactc     5400

acctctaaag gctccaggac gtgctccagg atgtgcatgc aaggggccag tttgcatatc     5460

tgcaaatatt gctgggaaac aagcaagatt ggtgcctgat tatgacccat tgtgaaccaa     5520

aatgtgaacc aaataaaaga atgacctatc atctggggca tctataatgt tattcatagg     5580

caagaacttg ctttgttatg ttctgaatca ccataacaca gatgctaata taaaacaaat     5640

atttatataa gcgagggtga ctgctttggt gacgaggaca tgggataaaa atatggtggc     5700

agaaatcatt gtctgaaaag taattgtttt acttttattc ttttcgtgtg tgtgtgtgtg     5760

tgtgtgtgtg tgtgtgtgtg tgtgcatgtg ccagatttct tgtttgaaag gcaatgagct     5820

tcatccaagt atcaaagaat gttagcatct agagagctgt agttgctatt tcatttttag     5880

gaccaagagt tgggtgattt gggtgctaga atcaattcta ccagtaacca gacaactatt     5940

cccaagtcac ttaacccctc tgtgcctcag tttcctccag tataaaatgg ggtgacttta     6000

ttctagcttt cttttagact ttttgtgagg aagatatgaa agtatttatt catcaagggt     6060

gcaaatgtaa ggttttatat tctgttatca aatcaaagtg ctaaacttgg gaaattcatt     6120

gccaggttta tctgacacaa atggcatgtc ttcagtaaac aggcctgtta ctgatagtga     6180

gttctgatca atagcaatcc atcacctccc tgtgctaaac agaagtgggc tttttaatgt     6240

aacatatata aaattaatta gatattgcag cagatgtcat tttaaaggaa ctgtttcttt     6300

ctaagacaca actcccactg atgatttttt ctaaatagtt ttaagggtct tttcagagct     6360

cattgaagat ggatgtgctt ggaaaatgag tatttctttt ctcattctgc ctggtgatct     6420

ggctgagagt agatttggat tgggtttagg agtggcataa gggactgagt tgcaggctct     6480

gagacatgta ctggcttcac tcatttttat gaatgaatat ttgaattttg gaataccatg     6540

taagtcatgc ttactgttca ttctcctagg ccctgaattt caacagtgag actgtgccac     6600

aaaaatcctc ccttgaagaa ccggattttt ataaaactaa aatcaagctc tgcatacttc     6660

ttcatgcttt cagaattcgg gcagtgacta ttgatagagt gatgagctat ctgaatgctt     6720

cctaaaaagc gaggtccctc caaaccgttg tcatttttat aaaactttga aatgaggaaa     6780

ctttgatagg atgtggatta agaactaggg agggggaaag aaggatggga ctattacatc     6840

cacatgatac ctctgatcaa gtatttttga catttactgt ggataaattg tttttaagtt     6900

ttcatgaatg aattgctaag aagggaaaat atccatcctg aaggtgtttt tcattcactt     6960

taatagaagg gcaaatattt ataagctatt tctgtaccaa agtgtttgtg gaaacaaaca     7020

tgtaagcata acttatttta aaatatttat ttatataact tggtaatcat gaaagcatct     7080

gagctaactt atatttattt atgttatatt tattaaatta tttatcaagt gtatttgaaa     7140

aatattttta agtgttctaa aaataaaagt attgaattaa agtga                     7185


<210> 15
<211> 1444
<212> DNA
<213> Homo sapiens

<400> 15
atttcgcttt cattttgggc cgagctggag gcggcggggc cgtcccggaa cggctgcggc       60

cgggcacccc gggagttaat ccgaaagcgc cgcaagcccc gcgggccggc cgcaccgcac      120

gtgtcaccga gaagctgatg tagagagaga cacagaagga gacagaaagc aagagaccag      180

agtcccggga aagtcctgcc gcgcctcggg acaattataa aaatgtggcc ccctgggtca      240

gcctcccagc caccgccctc acctgccgcg gccacaggtc tgcatccagc ggctcgccct      300

gtgtccctgc agtgccggct cagcatgtgt ccagcgcgca gcctcctcct tgtggctacc      360

ctggtcctcc tggaccacct cagtttggcc agaaacctcc ccgtggccac tccagaccca      420

ggaatgttcc catgccttca ccactcccaa aacctgctga gggccgtcag caacatgctc      480

cagaaggcca gacaaactct agaattttac ccttgcactt ctgaagagat tgatcatgaa      540

gatatcacaa aagataaaac cagcacagtg gaggcctgtt taccattgga attaaccaag      600

aatgagagtt gcctaaattc cagagagacc tctttcataa ctaatgggag ttgcctggcc      660

tccagaaaga cctcttttat gatggccctg tgccttagta gtatttatga agacttgaag      720

atgtaccagg tggagttcaa gaccatgaat gcaaagcttc tgatggatcc taagaggcag      780

atctttctag atcaaaacat gctggcagtt attgatgagc tgatgcaggc cctgaatttc      840

aacagtgaga ctgtgccaca aaaatcctcc cttgaagaac cggattttta taaaactaaa      900

atcaagctct gcatacttct tcatgctttc agaattcggg cagtgactat tgatagagtg      960

atgagctatc tgaatgcttc ctaaaaagcg aggtccctcc aaaccgttgt catttttata     1020

aaactttgaa atgaggaaac tttgatagga tgtggattaa gaactaggga gggggaaaga     1080

aggatgggac tattacatcc acatgatacc tctgatcaag tatttttgac atttactgtg     1140

gataaattgt ttttaagttt tcatgaatga attgctaaga agggaaaata tccatcctga     1200

aggtgttttt cattcacttt aatagaaggg caaatattta taagctattt ctgtaccaaa     1260

gtgtttgtgg aaacaaacat gtaagcataa cttattttaa aatatttatt tatataactt     1320

ggtaatcatg aaagcatctg agctaactta tatttattta tgttatattt attaaattat     1380

ttatcaagtg tatttgaaaa atatttttaa gtgttctaaa aataaaagta ttgaattaaa     1440

gtga                                                                  1444


<210> 16
<211> 253
<212> PRT
<213> Homo sapiens

<400> 16
Met Trp Pro Pro Gly Ser Ala Ser Gln Pro Pro Pro Ser Pro Ala Ala 
1               5                   10                  15      


Ala Thr Gly Leu His Pro Ala Ala Arg Pro Val Ser Leu Gln Cys Arg 
            20                  25                  30          


Leu Ser Met Cys Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val 
        35                  40                  45              


Leu Leu Asp His Leu Ser Leu Ala Arg Asn Leu Pro Val Ala Thr Pro 
    50                  55                  60                  


Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg 
65                  70                  75                  80  


Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr 
                85                  90                  95      


Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys 
            100                 105                 110         


Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu 
        115                 120                 125             


Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys 
    130                 135                 140                 


Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser 
145                 150                 155                 160 


Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn 
                165                 170                 175     


Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn 
            180                 185                 190         


Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser 
        195                 200                 205             


Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys 
    210                 215                 220                 


Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala 
225                 230                 235                 240 


Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
                245                 250             


<210> 17
<211> 15708
<212> DNA
<213> Homo sapiens

<400> 17
tatgattaca aagaagagtt tttattagtt cagcctcaga atgcaaaaat aaataaataa       60

ataaacaaac aggaaacaaa tgtaatcact ttacagagcg cacatacatt acttaaaagt      120

agcaccttca tggagccata ttttctggtc ataattgtgt atcaggttca ttcatgctaa      180

tgagaaaggg attccagatt ttctttgcat ctgtctgctt ctcacagggc tgttaagaag      240

ccacctgcca ttctgacaat ttcatgtcct tagccataac tacttgtcct ctctcttgaa      300

tcttaagatc tttttgcctt ccagacactt acggtgtttc tgtgtcatcc tcctgtgtct      360

tttagagagg tgggggtgag gactgccatg gaagctaaag ctgaatttta atttcaatca      420

tttttttttt tgagacaggg tctcattctg tcacctaggc tggtatgcag tggcacaatc      480

acggctcacc atagccttga cctcctgggc tcaagagatc ctcctgcctc atcctcctga      540

acagctggga ccacaagcat gagccaccac gcctggctaa ttttaaattt tttttgtaga      600

gacggggtct tgttatgttt cccaggctgg tcaatcattt ttttctagcc cttttaaaat      660

tcaggcatcc aaagattaaa cttgcttaga agtgtaagtg gccctaaatt gctttatcaa      720

caccatctcc aggaagtctt cacagacatg ggaactagca tcttgttctc ctggatcttc      780

tcaacaggtt tgcattgtca ggtttccatg taagtatctc ttgcgttccc atccatcaca      840

agtgtatgat ggcactggta tcagaacact gcattcttcc tgattgtcat aaaactgatg      900

tacttgcagc cttgcttgaa aagttgtcag tacaaataaa attaaattca cattttgcat      960

aatagggact gatcctgatg gatcaggtca taagagtatg aaacattcca tacatcctgg     1020

cagacaaacg ttaaataaca gtaatatact aataaataca taaattactt aaatatttaa     1080

atagcatgaa ggcccatggc aacttgagag ctggaaaatc tatacataaa ttagctgatt     1140

gtttcaatga gcatttagca tctaactata caaatacagc aaagatatca ttgtgatcct     1200

aaaaaaacgt tttaaagcaa atcagataga aattatcttt ttgggtctat tccgttgtgt     1260

ctttaaacat tttgcttaat atcttccact tttcctccaa attttcatcc tggatcagaa     1320

cctggaagag aatgccaaaa gttgatgtgg ggtgacattg taacagcaat gtctcttctt     1380

atttctcaca acatatgatc ctgggcaact gggtttcagg gatttcatgc cagaaggccc     1440

aggccttcct tatgtggcct ggaatttggc tggcaccatg cttgccagag gctttcttga     1500

gggttttctt aaaataatat ctgattgtgt tacttccttg ctgaaaaccc ttcagtgggt     1560

ttcagggccc ggggccccca gaacaagatt ctgagtcctg caagcttgca agtcctccat     1620

gctctgcctc ctggctacct ctctcttttc tttgcctttc tctttaggag gccagaaccc     1680

cggtctgttt tctttcctgc aatatccctg tggccagcac agtgtcctac ataacaaagg     1740

caccaaataa atatctgtta gtgaataaat gtatgtttct gattctggca actgggtgct     1800

ggccactcca tccccctttc ctctccaaca cagcccccaa tcatatccct gcatccaggt     1860

gcactgagag tgcaggcctg ggctggcctt tgagggcctg ctcacctaac tgcagggcac     1920

agatgcccat tcgctccaag atgagctata gtagcggtcc tgggcccgca cgctaatgct     1980

ggcatttttg cggcagatga ccgtggctga ggtcttgtcc gtgaagactc tatctttctg     2040

caaaagagaa ggaaagctgt gaagacccct tggcaacata gtcacagggt aagctgagcc     2100

tgtttctgca atgcatactc tcccaaaaca agcccatctt ggtcttaggg cactgtgctt     2160

gcaattcaca ggggtgtgca tagaacttgc accacctact ggcagacact ccacatgtag     2220

gtgcagcttt tgtactttgt aagcccttga gggagaaatc gtttggccca gctttcatct     2280

ttctagcaca attgccttgc ctcaagaccc catggccact cacccagtcc tgagctgatg     2340

gtgagagaga gacagatttg ctctagccat ccctgagata ttgagaaata ccatatcctg     2400

atcatttcat cagaaaactt gccttcaaat tctggcactg ctacttaata gctgtgtaac     2460

ttcaggaaaa tgtcttaggt tctctgtgtc tgtttcctca cttataaata gggataacaa     2520

taatgcctac ttcatagaat tatagttcaa ggtaaaaatc acgtcaaact cttagcaagt     2580

ctttagcaca taggaagcac tcaatatcac ctattagtca tacagatctt aaatagggaa     2640

agtacttgcc aagatgtaaa ataatattta ggtaaatatc tattccagga tagcctccct     2700

acctaattat tttcccagag agtaactagc tcactgaatt tctaccacat gctaaatgct     2760

atgctgaatt agggctttgt ccagtgattt taaaagtggg gtgaaaggag tctggggcgg     2820

tacaaaaggg cctctggaac cttgcaacag gcaaaggaat tctgctgtaa ggtgaggaag     2880

ctgggaagcc aatatcttag cctctataag tgtagacatt ctgtttagta aaataatttt     2940

ataatatctg gaacagccag gagctatcca ttttgggggc gatatctctt gcctgttcta     3000

tcattcatta catgcactca gttgaaacaa caattttagg cttctggagc ccaggtctcc     3060

tctatcagga ctaattctga gtgccaagat caatgaccaa catcagaggt agtggagtgg     3120

ttaggtgcat ggcctttctg tctggcagat tgagtttatg tcttggttat gtcatttcct     3180

agctgggtga ccttggtaaa gtcacttaac ctctctgagt cttcaatcac ttgtgaaatg     3240

atgataatac tactggctac caaccatcct tttcttgggg ttgggctact gtccaatgag     3300

cacgtagtga gggcagtgct gctaacacct acacaaaatt cctgcatcag ctacagcttt     3360

actttacctt gccacagttt ctggaaaaaa ggaaagcctc ttttccacaa aaaagggggt     3420

aaaaaaacaa gaataacatc agctaccttt gttgcgttaa ttttgtagat taagtgaaat     3480

aaacatggaa aacccttggc acagctctag acacatagaa aagtgctaag aaaagtaatt     3540

atgcatcaca taataacatt tcagtcaaag aaggatcaca tatatgacca gtggtcctat     3600

attgttataa tattgtattt ttgctatatc ttttctgttt agatatacaa accattgtgt     3660

tgtaattgcc tacagtattc agtacaggaa catgctgtac aggtttgcag cctaggagca     3720

ataggctata ccataaagcc tagaagcgta gtaggctata ccatctaggt ttgtgtgggt     3780

acaccatata tgtttgcaca atgatatagc tgcctaaaga tgaatttctc agaatgtatc     3840

cctgttgtta agtgatacat ggatgtcatt gttattatca tcattcatct tctcacttta     3900

tggtgggcct ccgtaaaact gaccaaggaa tatactgcac ctgaatcact tcttaccttt     3960

tctctcttgc tcttgccctg gacctgaacg cagaatgtca gggagaagta ggaatgtgga     4020

gtactccagg tgtcagggta ctcccagctg acctccacct gccgagaatt ctttaatggc     4080

ttcagctgca agttcttggg tgggtcaggt ttgactgtgg aagaggataa acatgcttta     4140

ttttcctaat aaggctttgg agttcagtgg ttttggctta tgatctcact tgcacctttg     4200

atcagctgtg agtccactgg atagttactt aacttctgtg agcacctggc tggcaaatgt     4260

gaggcacctc tgccgcaggt gctcacgctg gtgcctcggg aaggcagtca ttatcaatgg     4320

ccacatctca gatccacatg tattgcagga tctgagagat gctcagaatc ctcatcactg     4380

aacaatcaga tgggaccgca cagtgagaag acctctgctt taatggttat gggccatgca     4440

ttgaaggacc accctgtctg tgctaatccc tcactttgca ctgaacatgg aactaagctg     4500

agcctctccc tggggatgag atgatagatt ttctatttac tgccctttct tttgtctttt     4560

catagctttt ggtgcggaca tgtcttggag cagttacagt caattgtctc tatgctcaat     4620

ttgcttgttt attccatgaa tactgagcgc ccattatgtt ccagccactg gactgtgcat     4680

gaaggataca gcagtgagtt tcacaaagac tccttcctca attaggtctt ctagataaga     4740

ggaggcagag acatgctagg gtgtgtggtt gggctctgta ggcctggcac tgctccacac     4800

aaagatcacg gctcctagca ggtgcttcgc acatatgctt ccctctgctc ttaacacaac     4860

tgacatcttt ccttttggtg tcagctaaat gtcacctcct cagagaggcc ttccttggcc     4920

accctatgtg aactagaccc cacccaccaa gtacaaaccc tccataagtt ttgaagcaag     4980

ttctctggtt ctattacact ttatcatgct gcctggtcaa tgaataaata gtaaaggaaa     5040

tgacttatgg tgaaatccct atctgccaat taaaaaaaaa gatgtccatt ttggaagggt     5100

ttatattcta agaaagaagt cccttttcaa atgctaagga aggattggtt gggaataatt     5160

gccattagta aggaagccct ggttctgaat atgaatatta tgcttgatta ggctctggga     5220

ttcatcatgg aaactatctg gcagctgaat caaagtgcag taaagagcag ctgtagtgga     5280

gtggatctgg atttgaacgg ctagctgtaa gatctgagat ctggggcaag ttgcttaact     5340

tttctaagac tcaatttcct cttccttaaa atgcagataa cagtatctac atcgtatttt     5400

gtttttggat gcagaataaa agagatgatg cttgtcaacc acctacccca cagtgcatgg     5460

ggcattgtga acacgtgaca aatagtatct ttccttatgg agcacatata atcatccaaa     5520

actcactgat gtccctgatg aagaagctgc tggtgtagtt ttcatacttg agcttgtgaa     5580

cggcatccac catgacctca atgggcagac tctcctcagc agctgggcag gcactgtcct     5640

cctggcactc cactgagtac tcatactcct tgttgtcccc tctgactctc tctgcagaga     5700

gtgtagcagc tccgcacgtc accccttggg ggtcagaaga gctgaagtca aagacagaaa     5760

ttagcctgtg ttacacattg gggagagagt tcctagtgat tgtagccagt aaggcaggta     5820

aggcctcaac tgttgtctga ggacacagtt tctccaactg ggctgatttc tacccagagg     5880

gtaagaaact gccctcccca ggagaaaaag cttaaattca agagcactta catgtctctg     5940

gactaaatga atatggaagt ttttgtttgt tttagatcta gaatctctga ttattaaacc     6000

cccttgttaa aaacttaaat tcattttttc ccattttaat tataaaatca atacacaaat     6060

cagtacacac taattataaa agtaaagatg ataaagatcc atataaagga aagaagtgta     6120

tgtctccttc atccttaact gctctccaca ggtttaagta ctaataatag cttgttgtat     6180

ccttccagac cttccttttt ctttttcttt ctctctctct ctctcttttt ttttttaaga     6240

gacagggttt tgctctgcca cccaggctgg agtgaagtgg tggaatcaga ccccactgta     6300

gcctcaaact cctaggctca agtgatcccc tccctctgcc tcccaagtag ctgggactac     6360

aggtgtgtgc caccacaccc acctaatttt tttattttta atattttctg agatggggtc     6420

tcactgtgtt gcctaggctg gtcttgaact cctggcctca agcaatcatt ctgtctcagc     6480

ctcccaaaga gttaggattt caggtgtgag ccactgtgcc tggccagaac tttttcaatg     6540

aatattcaag ataattgtat acacatttta tatatatata tatatataca cacacacaca     6600

cacacacata tgtatacaca cattatatat ataatccatg ttatatacat ctctacatta     6660

tatatatcca ctatatatat tttacttata catatagatt ttatttttat gaactaggat     6720

caaattgtat acatatgatt atgtaaccat ccttttcccc tctgaacata ttatggggag     6780

ctctccatga caaaacatat gggttgggtt atccacttca atgactgcac attaagcaag     6840

agtatagtgt accatgtttt atttaaccat tcctctgctg attatgtctt tatgcacttg     6900

gagaaacatt tctttagtaa gcattttcct tttaaagatg aaaaagtgag accccaatgc     6960

ttaatttact cagtgaaata atggtaaagt caggatgatc acctggggtt tgcttcggtg     7020

atgattaaag taagccacat gggggttaac acataggtct tgtatttatg gaagttgctt     7080

tcttacggaa agtccaggtg cattggacca acagcgaact tcagattagt ggtgtcagca     7140

gaagtaaaag gagttgaggt ggccctggga aaatttcggg aggcctcatc gagttttgga     7200

gtctgcttca gggcccctaa gatctacgcc ctggagctct tgtttttatt tttgactcaa     7260

ggtgcaattt cagcaagtca tttgtagctt tgaattctcc gtttatccct ttctttggtg     7320

ctatgaggct tcaggaagca tggccaggca atttggatga gtgggttcaa acacagcaga     7380

gactattctc agttcccaat aatatcctgc cccaacacat acatttattc aagcaatcgg     7440

ctaaaatctc catgttcttt cttcaatgta gacaaggaag ggcggttgaa aaaacacggg     7500

tttgactttg cttttcccat tatcattaac tgataggtca ctgagaggtt gcccttaatt     7560

tcttaatgaa atagttctag aaaaatgctg agaaaccaga gcagtttcac tcaccctctg     7620

ctgcttttga cactgaatgt caaatcagta ctgattgtcg tcagccacca gcaggtgaaa     7680

cgtccagaat aattcttggc ctcgcatctt agaaaggtct tatttttggg ttctggattt     7740

gaaaaaaaca aaaattcaat atttaggagt tttttgcaga aaggttttga ttgtgatgaa     7800

tcacagatca ccagatggta cagggtgaag ttcttccatg cgttgcctct tgggtagctc     7860

cataaggatt gggagtatcc cagtggggtg ttgattctgc agagtacact acacacctaa     7920

catgcctggc agaggcattc attttttgaa ggctatcacc ccagtgcaga gtgatacact     7980

atcggttaat ttccattagc cattaaataa tatcaccatt atttaaagaa aacatgaggc     8040

taagcattca gactgatcat gaaactccct taatatgaga ttttgatggt tgataaccca     8100

aagggtccag gaaagcaaag gaaaatggag gttaacatca attaacatca ataagagact     8160

tgatgttaat tcattacact caccatgact tggcttttca atttgttgtt gttgttgttt     8220

ttaactctta tgagcgaaag agaaaattga tactatccaa gggtatagaa ttacctttct     8280

ggtcctttaa aatatcagtg gaccaaattc catcttcctt tttgtgaagc agcaggagcg     8340

aatggcttag aacctcgcct cctttgtgac aggtgtactg gccagcatct ccaaactctt     8400

tgacttggat ggtcagggtt ttgccagagc ctaagacctc actgctctgg tccaaggtcc     8460

aggtgatacc atcttcttca ggggtgtcac aggtgaggac caccatttct ccaggggcat     8520

ccggatacca atccaattct acgacataaa ctggaatgca cataaagtgg agaaccaggc     8580

tgtaagctcc aggaactctg ggactgtgtt ttattaaccc ttcgggcatc cccattgcct     8640

aaacacaacc ttgtttacaa caagtatttg gcaaatgctt gctgagatga tctcaaggca     8700

tgagaccctg cttgggctca aggtcatgga aaagagaaac aaagagtttt atagctgcca     8760

tcgactcatt gactggaagg ctgcctttaa tagtaacctt tgattattta gcagattgga     8820

aacaccttaa tataccaaaa actgcaaaca gcacaagact ctttgccaaa ggtctggggg     8880

agagaaactt ccagcacaat ttcagtttca tagagaatac ggcagggcac aatattcagc     8940

agagtaacat agtggttaaa agctcagggt gtcgagaaca acgaaccaag actgtcatcc     9000

tgtctccact aaccagctgg gggatttgga acaaggtatt tcattatcat gagcctcagt     9060

ttcctcatct gtaaaatgat aataataaca gtatctgcct tacatttgac tgaggattaa     9120

atgaaaaaaa aaaaaagcac gtaaagtact tagcacagtg tctgccacac agtaaattcg     9180

gtgttagtta tcgttactta tagactgagg agtcagccaa ctgtacagag aaactctctt     9240

aacaattttc catggatatt taaggatttc gttccctctg ttttaaatca ccagtggaga     9300

ttttcattct ctctgcatta ttattattat tattattatt tttagctatc ttacactctt     9360

atgaagcagt ccagtagagc ttagtcttcc catttaatga agaagcgtac tgaggccaac     9420

gatctaagca tggtcacagc aagtcagaag tacaagggct acagctcaga ccttttgtct     9480

cttgggcttt gcaagggatg cctaatgcta gtgtctaaac tggcctttga ggaatggctt     9540

agtatagtat ttcagagtgt gtcatagcaa agcttcattc atttttttaa tccatgcata     9600

aattattaat tgaatagaat tgtttaatgt agtagttccc aaagcatgat cgatgaaaca     9660

ccatcagcag catcaccttg gaatagatca gaaatgcaaa ttctcagctg ctgaatcaga     9720

aaccttggga tggatttagt tatatggttc ttaagctctc cagatggtgc tgatgcctgc     9780

tcaagtttat gagtcacagt ggttgagagc ctgcattttg gagtggcaga ggcctggatt     9840

tagaagcagc tctagccctt gctccagtct tgtggccttt ggcaagacat tcaactactg     9900

taagcctcag tgtcatcatc tttaaaatga gaaaaatatt agcccttgct ttagaaggtg     9960

gatatgtgga ttatataagc tcatacacag tgcccttaca tgctttataa atggtggttc    10020

tcgttattgg cctctggatg taaagcattg tagaatataa agttttcact ttacatcaat    10080

tggattataa ggactgaatg gataacaaaa ataagaggtc aaaagcattt ggggtccaac    10140

tctggggaaa tgattagctg tgctgatgta aaaccatgtg gcacttttgg ctttttcttg    10200

gttttttttt tgtccccctg acacctgaag agcatggtta ctttcctgtc ccaactgagg    10260

ctcggaagcc ctagaagatg tttgctcaac agcagggaaa acagctgcag gaggaaagct    10320

gaagaggagc atcaagaacc tggcacttag cctcactgca tcgtctgccc tcctaagtca    10380

cacctcggag agtttctcac actgattttc tggaatttct gtggtttacc aaatcagaag    10440

tcactaaggg gtcaaatatg ccaggaccaa ggttcctgag ctcgctaaca atcccagggt    10500

tggaagggat ctgccatata cccctggctc aggcctttgc ctttacttaa gagagtgtct    10560

aaatctctca aattctcatg gagaaggggt ggctggattc tgtcttccta aaattcttga    10620

gaagagagga gtggtcccag atttctttca ggtacccatt tcaagatggt actttctaga    10680

atatcaaaag attatttctg atcatcagct gaaatatttt catccttaat ttatgtccat    10740

ttcctcttat tcagctctcc aaagacctgg gaaaaaatta cttcatgtcc ttgtaaaaac    10800

gttttcacaa atttaagtgc accagctttg aagcaagaca gaattgggat caagtccctg    10860

ttccaccctt accaggtatg tgacattggg caagttactt aatctctctg gcctcagttt    10920

ccaacatctg taaaatggga tacagacagt ctctacttca tatggctctt gtgagaatta    10980

agtgagacag tgcactcaat aaacattaac tattattatt atacaattaa agatagtaat    11040

cgccaacatt atgatgttgg cttcagtgat aaaaatctga gggtctttca ggtacagcca    11100

acggtcagaa tgcacattta ctgagtgtga gatactagaa ggatgggcag aactccttat    11160

tttaccatta actattctat tagtctacta ataattttta ctgatcattg ggataataaa    11220

aagtatctct tgaatcatga aagatatttt aaacaaccaa aatagtaata ctagctgaca    11280

tttactttca ctttactatc tgtcgagttg tattctgcat tcattaaccc actaaataac    11340

ccacgtggta ggcatccact ttgtaacatc cagtttacag ttgaggaaat tcaggattag    11400

aggtgaccca gcttaaagtc acataaccag tcattgacag acctaggact tgaacccaag    11460

tcaatgtgag tctgaatcct atgcttcctc caaagtgctt ggtcagaggt aagtcatcac    11520

agtgccaact gagggggcag tatcatgtgg tggaatgaga aacgaacaca gttgttatac    11580

tttgcaaatc ctctactttg gaattttaat tgtgtgaaac agtttttagt aatattggaa    11640

aatatcatat gtcatatgtc tttccccacc atatcatata tcatatgtct ttccccacca    11700

tagtgtacca atgtcatcca atgtgaaaac caaacaatat agataccatc aaagtagaaa    11760

agagagatat ggcattgcat tttcccagag tgatggcttt ctctagagga cccagtgcat    11820

ggttgcccat ttcagtttga tttcccacca gtggctgccc aagagtcctg gcttagaagt    11880

ggggcctcca cagagaataa tgagaggctg gttaccatct ttcttcagtt cccatatggc    11940

cacgagggga gatgccagaa aaaccaggga aaaccaagag atgaccaact gctggtgaca    12000

catctataag aagggagaga agcaggggga agagaaggag gaaaagagaa ggaagaggaa    12060

gaaaaaagag aagaaagaat gtcaataatt cttgttacct ttgtgaacca tatacacatt    12120

ctagctactt tttttttaaa taaaatcttg attctcccct ttcttctcca tactgtcaag    12180

caaatactaa gtaaattaca ggagttctat ggcagtctgt ggggaagaac atctctaaat    12240

agagaaccat agggtatttg gggctctgga gctgaaagag acctaaggca agccatctga    12300

tacaatcctc ttatttttaa gatgagaaac ttaaagctta gagaaggaat gtgactttct    12360

ggatcaacat ctagcagttg tttatttagt gcttactaca taaagagcac tgggctagaa    12420

gcagttgaga gagaaaaaaa gggcttacct ggatcccgct tcctaggagc aaatactttt    12480

actcaataaa tatttattaa gtcagtgtgt taaatgcctc cccgccagct ggggagatga    12540

gccatacatt tgtatcactg cagcacaaag cagtgtgtgc tggagcaccc agaactgaag    12600

gacttgggtt agggacagga acggtaatac agaggcgaac tttcaggttc tggcaacgac    12660

ctggtcacca gcccttgctg taggggttta gcttctcttg ttttccaagt tcaaagacta    12720

ctctctccca tatagagaac ctagtggttc taaaatttga gtgactgtca ggataacctg    12780

gaagcactgc tacaacagac ggctgagtcc cacccccaga gtgtctgatt cagcaggcat    12840

gagggcctga gaatatgcat ttctagaaag tttccagggg aagcagatgc tgctggcgct    12900

aagaccacac cttgagaacc actggtttaa cccatgttca gggtccagaa cccctgaaaa    12960

tcaaagtgtt cattgggaaa agaccacaac atattgatat atccaccatg aaatagtaaa    13020

atgcataatt ttatgttaaa taagccacca tctgatatag aaaccagtct aggttagtat    13080

ccagggttta tgaaaatctt ccgattgcaa aaattacatg ccacttacga atgaggtagg    13140

gatgattatt gaaacattct gaaacaactc cattcacctt gcagagcacc catggtctgg    13200

aggaaagtgg ttctgaaacc actgttttaa actatatgaa gggcaatgct caactgtttc    13260

agtcaaatac cttaaaaatg agcattcctg ggttgggtga cggaatattg acaaattaca    13320

gctttgtcag aactgctact aactctaggc ggaccttgct atgtacttta ttcccttata    13380

aagtttgtga gtggcagaga caggcctaga agtcaagcct tcttggacac tgctcagtgc    13440

tgtcaccaca gcatggagtt ttaggggcat ggctatcact tcagtctagg ttatgactgt    13500

agcaaagtac tagctggcac agggcaccgg ttagtacatt cacaggcttg aagttaaaca    13560

gacacaggct tgaattttgg tcccaccgct catttgctgt tgagcagtgg gagcaacttg    13620

ttggccaagt tactcgctga gcctcagtct ctttgtctat aaaatggacc taatacttat    13680

ctcaaaggct tgttgggaaa ggcaatgaga taacatatta tagaaggcaa ccaataacat    13740

attaacttga acctagagga agaggtaagg gaacaattcg gtatctgtgt tttatatatc    13800

atggatagtc aagaaagtca ttctaggcag tagggaaatg tgcacagagg tactgcaatt    13860

acatgcgcct gtaaaaccca gaggctggca cactggaagg gaaatagtgg agattcctgg    13920

catctcctgg ccacttacgc tgaggtattc ccactagctg tgtgcccagc acttcctctg    13980

catgcctcag atgcatttga caatctcagg tgaactgcac ttcagggtca agggaacccc    14040

ggccatggtt ctaagaagca actcccattt tagtatcacc tacatttgaa accacagagc    14100

actgtccagg agaggtgatg gtggtgggtc tcctcctttg gctctctggc ccatcagctg    14160

atactaggga gcgctatccc tcagcccaca tttctcagca tggtaagtct gagagtcttg    14220

atgattaaaa cacatttttg tgtgtgcaat ttcataggtt ttctaaactc ttgaaaccct    14280

tgcatgaagg acaaagggac cccacgttac acactcctac ttcctggagt atgctaaggg    14340

cttcccaaca ctggtgccaa catcagcatg ccaatgacgc ttgggtgggc atctcttttt    14400

taagttttgt ttgttacctt taattagaaa agggagttat tattaaagaa taaaaagata    14460

tagtcacaga tggcatgggt ggcacatgca aagatcccta gtttgagggc caggcagact    14520

tctagaactg tgtgactctg ggtgagtcac ttggcccctt taagccccac actcctgatc    14580

tataaaatgg gcatgctact gtggtaagga gttagtgaca taattcacgt gaagtgtcat    14640

ggtgcctaga tgaagtcagg acccactatt tgctcccctc cttccacagc atttactaca    14700

atggcaacaa tacagtctct tctttactgt ttgtcttctc tgccacactg agtgaactct    14760

cttatgccat ggatcatgtc tatcttgttt tgcactttac ccccagagtc cagcacagag    14820

ggattccatg acttattagt agcatgaagg aatgtacgga aatcatacaa tttcagattc    14880

cagtaaacta aactctaaga ggttacccac attccatcta gtatcttgca attccttgtt    14940

tgaaaaagtg ggtcatcacc tggggtttag taatcccttt gttaaaatga cattgttgtt    15000

gcacacaatt ttacctgggc aagtaaaaaa accttttttt cccccttctc acagatttaa    15060

aaactcttta ggaagcaata atagcttttc attttttaac tggggccaaa gttagttaat    15120

ccacaagaat ggggatccca gctgtcattt tggttgatat cacaactgac gaccaagacc    15180

atcacaaata tgggagcaag tctgatttgt aacattatta taattatgaa tccaattact    15240

ttaaggaatg cacgaaaggc tttttaaaaa tttcaatagt aaggcaggca ccactcactt    15300

tagctctata caatccttta actgatgggc attaacattt aggaatatac ttttatacat    15360

caatacacac cacttcaaaa taagataaaa caaacatttc cacaaactat gcatacacac    15420

agatcttgct atacagttat tattaggtgg cttttaaaat ggacactgta catcggtccc    15480

tagagtacta agttttcact aatgttccag cagatgactc accagggatg cttccaggaa    15540

tccaccaact ccctcctgcc agctgctggg gtagcacact aacggtttct acacctggac    15600

agctgcaggc ccacagggag gggagggagg taggggcttg ggaagtgctt accttgctct    15660

gggcaggacg gagagtccaa tggccctgaa acagatgttg tttcttct                 15708


<210> 18
<211> 2364
<212> DNA
<213> Homo sapiens

<400> 18
agaagaaaca acatctgttt cagggccatt ggactctccg tcctgcccag agcaagatgt       60

gtcaccagca gttggtcatc tcttggtttt ccctggtttt tctggcatct cccctcgtgg      120

ccatatggga actgaagaaa gatgtttatg tcgtagaatt ggattggtat ccggatgccc      180

ctggagaaat ggtggtcctc acctgtgaca cccctgaaga agatggtatc acctggacct      240

tggaccagag cagtgaggtc ttaggctctg gcaaaaccct gaccatccaa gtcaaagagt      300

ttggagatgc tggccagtac acctgtcaca aaggaggcga ggttctaagc cattcgctcc      360

tgctgcttca caaaaaggaa gatggaattt ggtccactga tattttaaag gaccagaaag      420

aacccaaaaa taagaccttt ctaagatgcg aggccaagaa ttattctgga cgtttcacct      480

gctggtggct gacgacaatc agtactgatt tgacattcag tgtcaaaagc agcagaggct      540

cttctgaccc ccaaggggtg acgtgcggag ctgctacact ctctgcagag agagtcagag      600

gggacaacaa ggagtatgag tactcagtgg agtgccagga ggacagtgcc tgcccagctg      660

ctgaggagag tctgcccatt gaggtcatgg tggatgccgt tcacaagctc aagtatgaaa      720

actacaccag cagcttcttc atcagggaca tcatcaaacc tgacccaccc aagaacttgc      780

agctgaagcc attaaagaat tctcggcagg tggaggtcag ctgggagtac cctgacacct      840

ggagtactcc acattcctac ttctccctga cattctgcgt tcaggtccag ggcaagagca      900

agagagaaaa gaaagataga gtcttcacgg acaagacctc agccacggtc atctgccgca      960

aaaatgccag cattagcgtg cgggcccagg accgctacta tagctcatct tggagcgaat     1020

gggcatctgt gccctgcagt taggttctga tccaggatga aaatttggag gaaaagtgga     1080

agatattaag caaaatgttt aaagacacaa cggaatagac ccaaaaagat aatttctatc     1140

tgatttgctt taaaacgttt ttttaggatc acaatgatat ctttgctgta tttgtatagt     1200

tagatgctaa atgctcattg aaacaatcag ctaatttatg tatagatttt ccagctctca     1260

agttgccatg ggccttcatg ctatttaaat atttaagtaa tttatgtatt tattagtata     1320

ttactgttat ttaacgtttg tctgccagga tgtatggaat gtttcatact cttatgacct     1380

gatccatcag gatcagtccc tattatgcaa aatgtgaatt taattttatt tgtactgaca     1440

acttttcaag caaggctgca agtacatcag ttttatgaca atcaggaaga atgcagtgtt     1500

ctgataccag tgccatcata cacttgtgat ggatgggaac gcaagagata cttacatgga     1560

aacctgacaa tgcaaacctg ttgagaagat ccaggagaac aagatgctag ttcccatgtc     1620

tgtgaagact tcctggagat ggtgttgata aagcaattta gggccactta cacttctaag     1680

caagtttaat ctttggatgc ctgaatttta aaagggctag aaaaaaatga ttgaccagcc     1740

tgggaaacat aacaagaccc cgtctctaca aaaaaaattt aaaattagcc aggcgtggtg     1800

gctcatgctt gtggtcccag ctgttcagga ggatgaggca ggaggatctc ttgagcccag     1860

gaggtcaagg ctatggtgag ccgtgattgt gccactgcat accagcctag gtgacagaat     1920

gagaccctgt ctcaaaaaaa aaaatgattg aaattaaaat tcagctttag cttccatggc     1980

agtcctcacc cccacctctc taaaagacac aggaggatga cacagaaaca ccgtaagtgt     2040

ctggaaggca aaaagatctt aagattcaag agagaggaca agtagttatg gctaaggaca     2100

tgaaattgtc agaatggcag gtggcttctt aacagccctg tgagaagcag acagatgcaa     2160

agaaaatctg gaatcccttt ctcattagca tgaatgaacc tgatacacaa ttatgaccag     2220

aaaatatggc tccatgaagg tgctactttt aagtaatgta tgtgcgctct gtaaagtgat     2280

tacatttgtt tcctgtttgt ttatttattt atttattttt gcattctgag gctgaactaa     2340

taaaaactct tctttgtaat cata                                            2364


<210> 19
<211> 328
<212> PRT
<213> Homo sapiens

<400> 19
Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
1               5                   10                  15      


Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
            20                  25                  30          


Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
        35                  40                  45              


Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
    50                  55                  60                  


Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
65                  70                  75                  80  


Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                85                  90                  95      


Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110         


Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125             


Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140                 


Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 


Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175     


Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190         


Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205             


Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220                 


Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Lys Asn 
225                 230                 235                 240 


Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255     


Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270         


Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285             


Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300                 


Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 


Glu Trp Ala Ser Val Pro Cys Ser 
                325             


<210> 20
<211> 708
<212> DNA
<213> Mus musculus

<400> 20
atggtcagcg ttccaacagc ctcaccctcg gcatccagca gctcctctca gtgccggtcc       60

agcatgtgtc aatcacgcta cctcctcttt ttggccaccc ttgccctcct aaaccacctc      120

agtttggcca gggtcattcc agtctctgga cctgccaggt gtcttagcca gtcccgaaac      180

ctgctgaaga ccacagatga catggtgaag acggccagag aaaaactgaa acattattcc      240

tgcactgctg aagacatcga tcatgaagac atcacacggg accaaaccag cacattgaag      300

acctgtttac cactggaact acacaagaac gagagttgcc tggctactag agagacttct      360

tccacaacaa gagggagctg cctgccccca cagaagacgt ctttgatgat gaccctgtgc      420

cttggtagca tctatgagga cttgaagatg taccagacag agttccaggc catcaacgca      480

gcacttcaga atcacaacca tcagcagatc attctagaca agggcatgct ggtggccatc      540

gatgagctga tgcagtctct gaatcataat ggcgagactc tgcgccagaa acctcctgtg      600

ggagaagcag acccttacag agtgaaaatg aagctctgca tcctgcttca cgccttcagc      660

acccgcgtcg tgaccatcaa cagggtgatg ggctatctga gctccgcc                   708


<210> 21
<211> 236
<212> PRT
<213> Mus musculus

<400> 21
Met Val Ser Val Pro Thr Ala Ser Pro Ser Ala Ser Ser Ser Ser Ser 
1               5                   10                  15      


Gln Cys Arg Ser Ser Met Cys Gln Ser Arg Tyr Leu Leu Phe Leu Ala 
            20                  25                  30          


Thr Leu Ala Leu Leu Asn His Leu Ser Leu Ala Arg Val Ile Pro Val 
        35                  40                  45              


Ser Gly Pro Ala Arg Cys Leu Ser Gln Ser Arg Asn Leu Leu Lys Thr 
    50                  55                  60                  


Thr Asp Asp Met Val Lys Thr Ala Arg Glu Lys Leu Lys His Tyr Ser 
65                  70                  75                  80  


Cys Thr Ala Glu Asp Ile Asp His Glu Asp Ile Thr Arg Asp Gln Thr 
                85                  90                  95      


Ser Thr Leu Lys Thr Cys Leu Pro Leu Glu Leu His Lys Asn Glu Ser 
            100                 105                 110         


Cys Leu Ala Thr Arg Glu Thr Ser Ser Thr Thr Arg Gly Ser Cys Leu 
        115                 120                 125             


Pro Pro Gln Lys Thr Ser Leu Met Met Thr Leu Cys Leu Gly Ser Ile 
    130                 135                 140                 


Tyr Glu Asp Leu Lys Met Tyr Gln Thr Glu Phe Gln Ala Ile Asn Ala 
145                 150                 155                 160 


Ala Leu Gln Asn His Asn His Gln Gln Ile Ile Leu Asp Lys Gly Met 
                165                 170                 175     


Leu Val Ala Ile Asp Glu Leu Met Gln Ser Leu Asn His Asn Gly Glu 
            180                 185                 190         


Thr Leu Arg Gln Lys Pro Pro Val Gly Glu Ala Asp Pro Tyr Arg Val 
        195                 200                 205             


Lys Met Lys Leu Cys Ile Leu Leu His Ala Phe Ser Thr Arg Val Val 
    210                 215                 220                 


Thr Ile Asn Arg Val Met Gly Tyr Leu Ser Ser Ala 
225                 230                 235     


<210> 22
<211> 1005
<212> DNA
<213> Mus musculus

<400> 22
atgtgtcctc agaagctaac catctcctgg tttgccatcg ttttgctggt gtctccactc       60

atggccatgt gggagctgga gaaagacgtt tatgttgtag aggtggactg gactcccgat      120

gcccctggag aaacagtgaa cctcacctgt gacacgcctg aagaagatga catcacctgg      180

acctcagacc agagacatgg agtcataggc tctggaaaga ccctgaccat cactgtcaaa      240

gagtttctag atgctggcca gtacacctgc cacaaaggag gcgagactct gagccactca      300

catctgctgc tccacaagaa ggaaaatgga atttggtcca ctgaaatttt aaaaaatttc      360

aaaaacaaga ctttcctgaa gtgtgaagca ccaaattact ccggacggtt cacgtgctca      420

tggctggtgc aaagaaacat ggacttgaag ttcaacatca agagcagtag cagttcccct      480

gactctcggg cagtgacatg tggaatggcg tctctgtctg cagagaaggt cacactggac      540

caaagggact atgagaagta ttcagtgtcc tgccaggagg atgtcacctg cccaactgcc      600

gaggagaccc tgcccattga actggcgttg gaagcacggc agcagaataa atatgagaac      660

tacagcacca gcttcttcat cagggacatc atcaaaccag acccgcccaa gaacttgcag      720

atgaagcctt tgaagaactc acaggtggag gtcagctggg agtaccctga ctcctggagc      780

actccccatt cctacttctc cctcaagttc tttgttcgaa tccagcgcaa gaaagaaaag      840

atgaaggaga cagaggaggg gtgtaaccag aaaggtgcgt tcctcgtaga gaagacatct      900

accgaagtcc aatgcaaagg cgggaatgtc tgcgtgcaag ctcaggatcg ctattacaat      960

tcctcatgca gcaagtgggc atgtgttccc tgcagggtcc gatcc                     1005


<210> 23
<211> 335
<212> PRT
<213> Mus musculus

<400> 23
Met Cys Pro Gln Lys Leu Thr Ile Ser Trp Phe Ala Ile Val Leu Leu 
1               5                   10                  15      


Val Ser Pro Leu Met Ala Met Trp Glu Leu Glu Lys Asp Val Tyr Val 
            20                  25                  30          


Val Glu Val Asp Trp Thr Pro Asp Ala Pro Gly Glu Thr Val Asn Leu 
        35                  40                  45              


Thr Cys Asp Thr Pro Glu Glu Asp Asp Ile Thr Trp Thr Ser Asp Gln 
    50                  55                  60                  


Arg His Gly Val Ile Gly Ser Gly Lys Thr Leu Thr Ile Thr Val Lys 
65                  70                  75                  80  


Glu Phe Leu Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Thr 
                85                  90                  95      


Leu Ser His Ser His Leu Leu Leu His Lys Lys Glu Asn Gly Ile Trp 
            100                 105                 110         


Ser Thr Glu Ile Leu Lys Asn Phe Lys Asn Lys Thr Phe Leu Lys Cys 
        115                 120                 125             


Glu Ala Pro Asn Tyr Ser Gly Arg Phe Thr Cys Ser Trp Leu Val Gln 
    130                 135                 140                 


Arg Asn Met Asp Leu Lys Phe Asn Ile Lys Ser Ser Ser Ser Ser Pro 
145                 150                 155                 160 


Asp Ser Arg Ala Val Thr Cys Gly Met Ala Ser Leu Ser Ala Glu Lys 
                165                 170                 175     


Val Thr Leu Asp Gln Arg Asp Tyr Glu Lys Tyr Ser Val Ser Cys Gln 
            180                 185                 190         


Glu Asp Val Thr Cys Pro Thr Ala Glu Glu Thr Leu Pro Ile Glu Leu 
        195                 200                 205             


Ala Leu Glu Ala Arg Gln Gln Asn Lys Tyr Glu Asn Tyr Ser Thr Ser 
    210                 215                 220                 


Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Lys Asn Leu Gln 
225                 230                 235                 240 


Met Lys Pro Leu Lys Asn Ser Gln Val Glu Val Ser Trp Glu Tyr Pro 
                245                 250                 255     


Asp Ser Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Lys Phe Phe Val 
            260                 265                 270         


Arg Ile Gln Arg Lys Lys Glu Lys Met Lys Glu Thr Glu Glu Gly Cys 
        275                 280                 285             


Asn Gln Lys Gly Ala Phe Leu Val Glu Lys Thr Ser Thr Glu Val Gln 
    290                 295                 300                 


Cys Lys Gly Gly Asn Val Cys Val Gln Ala Gln Asp Arg Tyr Tyr Asn 
305                 310                 315                 320 


Ser Ser Cys Ser Lys Trp Ala Cys Val Pro Cys Arg Val Arg Ser 
                325                 330                 335 


<210> 24
<211> 8239
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 24
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgatatcact agtatttaaa      420

taacgttgag acgtccttaa tcgtcccgac gctaaacggc cgcgcacacc gcagccgcac      480

cccgcgctta tcctccagtt cgcgtaggac cggcgggtgg ttaaccaggt ccgcaaagtt      540

gcggagctcg gtaatcagcg gaggggtatg gtgggtgtcc ttgtataccg caaagaaaaa      600

gcagtggatt gtgccgctgg tctcacagga ggcgcggacc aggtaactcc gcacggccac      660

gcaagcggag tccgttttgc tggtgtgcat ggccgtttcg gcctgccagg tggcgttgag      720

gcagtaaggg ggggccacgt gggttatgtc cggggcccgt aagaacaggt tggtgagggg      780

ggtcgctgtc atagtgcaaa aggggggatg cgcccgggcg ggaagcccct aagggcacta      840

tgacaccggc cttggagcgg ggacggattt atacgttggg ttagttccct ccgcccaccc      900

aggccgtacg ccgggcccac ccccgccatc tgccgtgacc cacgccccgc cggccatgag      960

caaagaagga caacacgtgg ggcgatttgt ttgaaatgtt ttgtttttat tgtacctaaa     1020

acaaggagtt gcaatgaaaa tatttgccgt gcacgtacgg gggggcgacg atgtgactgg     1080

ccgtcaactc gcagacacga ctcgaacact cctggcggtg cgtgtctagg atttcgatca     1140

ggcccgccat gcaggccccg gggaggtaga aatgcatctt ctctccgacc ccgacaccaa     1200

gggtcgcgta gtcgatctcc gcgacgccac gctcgacgcg gttggcgagc ctggccagaa     1260

tgacaaacac gaaggatgca atgtccttaa tgtccgccag acgccgccgc gaacacagtt     1320

cgtccaggcc gcacaggcct cgcgccttca ggtagcactg gagaaagggc cgcaggcgcg     1380

tggcgaggtt atccagcacc gccgcggccg tcccgataat ggggtcctgg gggcgcagcg     1440

gcaggttgtg gtggatgcac atcttgcacc acgccagcgt ctcgtcggcg gaggccagcg     1500

cctcgatgaa attttcttgg cgcagcacgc agtcgcgcat ggccttggcg gtcgatgcgg     1560

cccgaggatt gccggcaaag tgcgatagag gctcgggccg tgggcgacca aggtttccca     1620

ggagacccgt ctggtctcgg cgtcaaaggg ccctccttgg cccgccagca ccggggccca     1680

ggggctattc gcggcgggaa acggctgccc cccaaagggg tcgtacatga cctgtgcgct     1740

gcggccggat cgatcttcaa tattggccat tagccatatt attcattggt tatatagcat     1800

aaatcaatat tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt     1860

atattggctc atgtccaata tgaccgccat gttggcattg attattgact agttattaat     1920

agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac     1980

ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa     2040

tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt     2100

atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc     2160

ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttac     2220

gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc     2280

ggttttggca gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc     2340

tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa     2400

aatgtcgtaa taaccccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg     2460

tctatataag cagagctcgt ttagtgaacc gtcagatcac tagaagcttt attgcggtag     2520

tttatcacag ttaaattgct aacgcagtca gtgcttctga cacaacagtc tcgaacttaa     2580

ggctagagta cttaatacga ctcactatag gctagcctcg agatgtgtcc tcagaagcta     2640

accatctcct ggtttgccat cgttttgctg gtgtctccac tcatggccat gtgggagctg     2700

gagaaagacg tttatgttgt agaggtggac tggactcccg atgcccctgg agaaacagtg     2760

aacctcacct gtgacacgcc tgaagaagat gacatcacct ggacctcaga ccagagacat     2820

ggagtcatag gctctggaaa gaccctgacc atcactgtca aagagtttct agatgctggc     2880

cagtacacct gccacaaagg aggcgagact ctgagccact cacatctgct gctccacaag     2940

aaggaaaatg gaatttggtc cactgaaatt ttaaaaaatt tcaaaaacaa gactttcctg     3000

aagtgtgaag caccaaatta ctccggacgg ttcacgtgct catggctggt gcaaagaaac     3060

atggacttga agttcaacat caagagcagt agcagttccc ctgactctcg ggcagtgaca     3120

tgtggaatgg cgtctctgtc tgcagagaag gtcacactgg accaaaggga ctatgagaag     3180

tattcagtgt cctgccagga ggatgtcacc tgcccaactg ccgaggagac cctgcccatt     3240

gaactggcgt tggaagcacg gcagcagaat aaatatgaga actacagcac cagcttcttc     3300

atcagggaca tcatcaaacc agacccgccc aagaacttgc agatgaagcc tttgaagaac     3360

tcacaggtgg aggtcagctg ggagtaccct gactcctgga gcactcccca ttcctacttc     3420

tccctcaagt tctttgttcg aatccagcgc aagaaagaaa agatgaagga gacagaggag     3480

gggtgtaacc agaaaggtgc gttcctcgta gagaagacat ctaccgaagt ccaatgcaaa     3540

ggcgggaatg tctgcgtgca agctcaggat cgctattaca attcctcatg cagcaagtgg     3600

gcatgtgttc cctgcagggt ccgatccggc ggcggcggga gtggcggcgg gggttctggc     3660

ggaggcctcg ctagcggtgg ctccatggtc agcgttccaa cagcctcacc ctcggcatcc     3720

agcagctcct ctcagtgccg gtccagcatg tgtcaatcac gctacctcct ctttttggcc     3780

acccttgccc tcctaaacca cctcagtttg gccagggtca ttccagtctc tggacctgcc     3840

aggtgtctta gccagtcccg aaacctgctg aagaccacag atgacatggt gaagacggcc     3900

agagaaaaac tgaaacatta ttcctgcact gctgaagaca tcgatcatga agacatcaca     3960

cgggaccaaa ccagcacatt gaagacctgt ttaccactgg aactacacaa gaacgagagt     4020

tgcctggcta ctagagagac ttcttccaca acaagaggga gctgcctgcc cccacagaag     4080

acgtctttga tgatgaccct gtgccttggt agcatctatg aggacttgaa gatgtaccag     4140

acagagttcc aggccatcaa cgcagcactt cagaatcaca accatcagca gatcattcta     4200

gacaagggca tgctggtggc catcgatgag ctgatgcagt ctctgaatca taatggcgag     4260

actctgcgcc agaaacctcc tgtgggagaa gcagaccctt acagagtgaa aatgaagctc     4320

tgcatcctgc ttcacgcctt cagcacccgc gtcgtgacca tcaacagggt gatgggctat     4380

ctgagctccg cctgagcggc cgcttcgagc agacatgata agatacattg atgagtttgg     4440

acaaaccaca actagaatgc agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat     4500

tgctttattt gtaaccatta taagctgcaa taaacaagtt aacaacaaca attgcattca     4560

ttttatgttt caggttcagg gggagatgtg ggaggttttt taaagcaagt aaaacctcta     4620

caaatgtggt aaaatcgata aggatcgatc tccaggctac acgtggatta tcatggtatt     4680

tttcatttac atatgactat acatttcaaa tgggccttgc actcaactcg tttccagttt     4740

gcatatgccg ttatgcgcga ataatgcctg gatgtgacgt catacgtcaa acaggcgcct     4800

ctggatctcc tgctcgtagt gaagcgccac gagcaccacc ccggccacca cggcgatata     4860

acacaatcgc attgcgatgc ccgacaggat gatggaacaa cagcgcccgc agacgcccga     4920

cagccccttg gatcgccccg gggcggcggc cttgtctgcg ttcttggggg ccgggccccg     4980

ccgcagaata caatacagct ctgtcaggcc gatggtggag acaaaacacc aggtggtgat     5040

ggtcagaaac agggggtatg tgatcgcaca tgccccccgg gatatgaaag cggtgccgac     5100

gatgagaccc acggccacaa agcgtagcat caactcgcag cctacgatga ccccgatggc     5160

ggggcggtgg tacaagaagg tgaccgggtc cgtctcaaac aactgaacca ggttttgccg     5220

ctggaccgac agctcgcaga gcaggcgggt aattttcgtg taggggtact gcaggaacac     5280

gctcgatacg atgcggcctg cgtagttcaa gaggtaggag gccggggcca ccatcttgtg     5340

ggcgggactc acgacaccaa acatacatcg gcgttggtgg agggcgacga acgccagata     5400

caggaaccac cctacgacca ccagacgcac ccgtgtgtac catagggtct ccagacagtt     5460

aactgcctcg tggacgttca tgatccgacg attcgtggcg tcgggtggga cctggaaggg     5520

cacgacccta cccgcgataa gattggcgta gcagatatgg gcgtggttgc gccagccccc     5580

gttggggggg tgtgtcgggg cccccagaaa caatagggtc tggttcattt tcatccacac     5640

gagggcggtg tcgttgttgg tgccggtggg gcgtaccgcg taaatacatc ggtgcagcgg     5700

actggcaccg aagacggtgt accacacgag cacgaggccg tacgccgtta tcaagacgac     5760

ggttgagagg tgctgcaggg aacggacggc gagcatggcg tgccggcgtc aatggtaaac     5820

agcgtgtgca ggcggttgct gtcgcatttg gcggcaaagc actgctgaca caaggacacg     5880

cacaggcggt tgttggcccc gacgctcagc gcgacgaatg tccgcgccgt ggcgcgactc     5940

gcccggccgt gcttaaagcg cagacacgac aggcaacgtt atttaatact agtgatatca     6000

agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt     6060

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc     6120

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc     6180

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct     6240

tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca     6300

gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac     6360

atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt     6420

ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg     6480

cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc     6540

tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc     6600

gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc     6660

aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac     6720

tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt     6780

aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct     6840

aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc     6900

ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt     6960

ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg     7020

atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc     7080

atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa     7140

tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag     7200

gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg     7260

tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga     7320

gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag     7380

cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa     7440

gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc     7500

atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca     7560

aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg     7620

atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat     7680

aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc     7740

aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg     7800

gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg     7860

gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt     7920

gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca     7980

ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata     8040

ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac     8100

atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa     8160

gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt     8220

atcacgaggc cctttcgtc                                                  8239


<210> 25
<211> 60
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide


<220>
<221> SITE
<222> (1)..(60)
<223> This sequence may encompass 1-10 "(Gly)x-Ser"
      repeating units where x=2-5

<400> 25
Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
1               5                   10                  15      


Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser 
        35                  40                  45              


Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser 
    50                  55                  60  


<210> 26
<211> 50
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide


<220>
<221> SITE
<222> (1)..(50)
<223> This sequence may encompass 1-10 "Gly Gly Gly Gly Ser"
      repeating units

<400> 26
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        35                  40                  45              


Gly Ser 
    50  


<210> 27
<211> 19
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 27
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Leu Ala Ser 
1               5                   10                  15      


Gly Gly Ser 
            


<210> 28
<211> 57
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 28
ggcggcggcg ggagtggcgg cgggggttct ggcggaggcc tcgctagcgg tggctcc          57


<210> 29
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 29
gcgagtctcg agatgtgtcc tcagaagcta acc                                    33


<210> 30
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 30
atagaagcgg ccgctcaggc ggagctcaga tag                                    33


