                               SEQUENCE LISTING

<110> THE JOHNS HOPKINS UNIVERSITY
 
<120> AN INNOVATIVE DNA VACCINE FOR SARS-COV, SARS-COV-2, AND MERS-COV

<130> 0184.0148-PCT (P16311-03)

<140>
<141>

<150> 63/042,260
<151> 2020-06-22

<150> 63/007,608
<151> 2020-04-09

<160> 21    

<170> PatentIn version 3.5

<210> 1
<211> 1827
<212> DNA
<213> Homo sapiens

<400> 1
atgaagtggg taacctttat ttcccttctt tttctcttta gctcggctta ttccaggggt       60

gtgtttcgtc gagatgcaca caagagtgag gttgctcatc ggtttaaaga tttgggagaa      120

gaaaatttca aagccttggt gttgattgcc tttgctcagt atcttcagca gtgtccattt      180

gaagatcatg taaaattagt gaatgaagta actgaatttg caaaaacatg tgttgctgat      240

gagtcagctg aaaattgtga caaatcactt catacccttt ttggagacaa attatgcaca      300

gttgcaactc ttcgtgaaac ctatggtgaa atggctgact gctgtgcaaa acaagaacct      360

gagagaaatg aatgcttctt gcaacacaaa gatgacaacc caaacctccc ccgattggtg      420

agaccagagg ttgatgtgat gtgcactgct tttcatgaca atgaagagac atttttgaaa      480

aaatacttat atgaaattgc cagaagacat ccttactttt atgccccgga actccttttc      540

tttgctaaaa ggtataaagc tgcttttaca gaatgttgcc aagctgctga taaagctgcc      600

tgcctgttgc caaagctcga tgaacttcgg gatgaaggga aggcttcgtc tgccaaacag      660

agactcaagt gtgccagtct ccaaaaattt ggagaaagag ctttcaaagc atgggcagta      720

gctcgcctga gccagagatt tcccaaagct gagtttgcag aagtttccaa gttagtgaca      780

gatcttacca aagtccacac ggaatgctgc catggagatc tgcttgaatg tgctgatgac      840

agggcggacc ttgccaagta tatctgtgaa aatcaagatt cgatctccag taaactgaag      900

gaatgctgtg aaaaacctct gttggaaaaa tcccactgca ttgccgaagt ggaaaatgat      960

gagatgcctg ctgacttgcc ttcattagct gctgattttg ttgaaagtaa ggatgtttgc     1020

aaaaactatg ctgaggcaaa ggatgtcttc ctgggcatgt ttttgtatga atatgcaaga     1080

aggcatcctg attactctgt cgtgctgctg ctgagacttg ccaagacata tgaaaccact     1140

ctagagaagt gctgtgccgc tgcagatcct catgaatgct atgccaaagt gttcgatgaa     1200

tttaaacctc ttgtggaaga gcctcagaat ttaatcaaac aaaattgtga gctttttgag     1260

cagcttggag agtacaaatt ccagaatgcg ctattagttc gttacaccaa gaaagtaccc     1320

caagtgtcaa ctccaactct tgtagaggtc tcaagaaacc taggaaaagt gggcagcaaa     1380

tgttgtaaac atcctgaagc aaaaagaatg ccctgtgcag aagactatct atccgtggtc     1440

ctgaaccagt tatgtgtgtt gcatgagaaa acgccagtaa gtgacagagt caccaaatgc     1500

tgcacagaat ccttggtgaa caggcgacca tgcttttcag ctctggaagt cgatgaaaca     1560

tacgttccca aagagtttaa tgctgaaacg ttcaccttcc atgcagatat atgcacactt     1620

tctgagaagg agagacaaat caagaaacaa actgcacttg ttgagcttgt gaaacacaag     1680

cccaaggcaa caaaagagca actgaaagct gttatggatg atttcgcagc ttttgtagag     1740

aagtgctgca aggctgacga taaggagacc tgctttgccg aggagggtaa aaaacttgtt     1800

gctgcaagtc aagctgcctt aggctta                                         1827


<210> 2
<211> 609
<212> PRT
<213> Homo sapiens

<400> 2
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala 
1               5                   10                  15      


Tyr Ser Arg Gly Val Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala 
            20                  25                  30          


His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu 
        35                  40                  45              


Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val 
    50                  55                  60                  


Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp 
65                  70                  75                  80  


Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp 
                85                  90                  95      


Lys Leu Cys Thr Val Ala Thr Leu Arg Glu Thr Tyr Gly Glu Met Ala 
            100                 105                 110         


Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu Cys Phe Leu Gln 
        115                 120                 125             


His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val 
    130                 135                 140                 


Asp Val Met Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys 
145                 150                 155                 160 


Lys Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro 
                165                 170                 175     


Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys 
            180                 185                 190         


Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu 
        195                 200                 205             


Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys 
    210                 215                 220                 


Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val 
225                 230                 235                 240 


Ala Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser 
                245                 250                 255     


Lys Leu Val Thr Asp Leu Thr Lys Val His Thr Glu Cys Cys His Gly 
            260                 265                 270         


Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr Ile 
        275                 280                 285             


Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu 
    290                 295                 300                 


Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp 
305                 310                 315                 320 


Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser 
                325                 330                 335     


Lys Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly 
            340                 345                 350         


Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val 
        355                 360                 365             


Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys 
    370                 375                 380                 


Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu 
385                 390                 395                 400 


Phe Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys 
                405                 410                 415     


Glu Leu Phe Glu Gln Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu 
            420                 425                 430         


Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr Pro Thr Leu Val 
        435                 440                 445             


Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 
    450                 455                 460                 


Pro Glu Ala Lys Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val 
465                 470                 475                 480 


Leu Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg 
                485                 490                 495     


Val Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe 
            500                 505                 510         


Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala 
        515                 520                 525             


Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu 
    530                 535                 540                 


Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys 
545                 550                 555                 560 


Pro Lys Ala Thr Lys Glu Gln Leu Lys Ala Val Met Asp Asp Phe Ala 
                565                 570                 575     


Ala Phe Val Glu Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe 
            580                 585                 590         


Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln Ala Ala Leu Gly 
        595                 600                 605             


Leu 
    


<210> 3
<211> 675
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 3
atgagggtcc aaccaacaga gagcattgtg aggtttccaa acatcaccaa cctgtgtcca       60

tttggagagg tgttcaatgc caccaggttt gcctctgtct atgcctggaa caggaagagg      120

attagcaact gtgtggctga ctactctgtg ctctacaact ctgcctcctt cagcaccttc      180

aagtgttatg gagtgagccc aaccaaactg aatgacctgt gtttcaccaa tgtctatgct      240

gactcctttg tgattagggg agatgaggtg agacagattg cccctggaca aacaggcaag      300

attgctgact acaactacaa actgcctgat gacttcacag gctgtgtgat tgcctggaac      360

agcaacaacc tggacagcaa ggtgggaggc aactacaact acctctacag actgttcagg      420

aagagcaacc tgaaaccatt tgagagggac atcagcacag agatttacca ggctggcagc      480

acaccatgta atggagtgga gggcttcaac tgttactttc cactccaatc ctatggcttc      540

caaccaacca atggagtggg ctaccaacca tacagggtgg tggtgctgtc ctttgaactg      600

ctccatgccc ctgccacagt gtgtggacca aagaagagca ccaacctggt gaagaacaag      660

tgtgtgaact tctga                                                       675


<210> 4
<211> 224
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 4
Met Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
1               5                   10                  15      


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
            20                  25                  30          


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
        35                  40                  45              


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
    50                  55                  60                  


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
65                  70                  75                  80  


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
                85                  90                  95      


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
            100                 105                 110         


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
        115                 120                 125             


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
    130                 135                 140                 


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
145                 150                 155                 160 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
                165                 170                 175     


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
            180                 185                 190         


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
        195                 200                 205             


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
    210                 215                 220                 


<210> 5
<211> 2508
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 5
atgaagtggg taacctttat ttcccttctt tttctcttta gctcggctta ttccaggggt       60

gtgtttcgtc gagatgcaca caagagtgag gttgctcatc ggtttaaaga tttgggagaa      120

gaaaatttca aagccttggt gttgattgcc tttgctcagt atcttcagca gtgtccattt      180

gaagatcatg taaaattagt gaatgaagta actgaatttg caaaaacatg tgttgctgat      240

gagtcagctg aaaattgtga caaatcactt catacccttt ttggagacaa attatgcaca      300

gttgcaactc ttcgtgaaac ctatggtgaa atggctgact gctgtgcaaa acaagaacct      360

gagagaaatg aatgcttctt gcaacacaaa gatgacaacc caaacctccc ccgattggtg      420

agaccagagg ttgatgtgat gtgcactgct tttcatgaca atgaagagac atttttgaaa      480

aaatacttat atgaaattgc cagaagacat ccttactttt atgccccgga actccttttc      540

tttgctaaaa ggtataaagc tgcttttaca gaatgttgcc aagctgctga taaagctgcc      600

tgcctgttgc caaagctcga tgaacttcgg gatgaaggga aggcttcgtc tgccaaacag      660

agactcaagt gtgccagtct ccaaaaattt ggagaaagag ctttcaaagc atgggcagta      720

gctcgcctga gccagagatt tcccaaagct gagtttgcag aagtttccaa gttagtgaca      780

gatcttacca aagtccacac ggaatgctgc catggagatc tgcttgaatg tgctgatgac      840

agggcggacc ttgccaagta tatctgtgaa aatcaagatt cgatctccag taaactgaag      900

gaatgctgtg aaaaacctct gttggaaaaa tcccactgca ttgccgaagt ggaaaatgat      960

gagatgcctg ctgacttgcc ttcattagct gctgattttg ttgaaagtaa ggatgtttgc     1020

aaaaactatg ctgaggcaaa ggatgtcttc ctgggcatgt ttttgtatga atatgcaaga     1080

aggcatcctg attactctgt cgtgctgctg ctgagacttg ccaagacata tgaaaccact     1140

ctagagaagt gctgtgccgc tgcagatcct catgaatgct atgccaaagt gttcgatgaa     1200

tttaaacctc ttgtggaaga gcctcagaat ttaatcaaac aaaattgtga gctttttgag     1260

cagcttggag agtacaaatt ccagaatgcg ctattagttc gttacaccaa gaaagtaccc     1320

caagtgtcaa ctccaactct tgtagaggtc tcaagaaacc taggaaaagt gggcagcaaa     1380

tgttgtaaac atcctgaagc aaaaagaatg ccctgtgcag aagactatct atccgtggtc     1440

ctgaaccagt tatgtgtgtt gcatgagaaa acgccagtaa gtgacagagt caccaaatgc     1500

tgcacagaat ccttggtgaa caggcgacca tgcttttcag ctctggaagt cgatgaaaca     1560

tacgttccca aagagtttaa tgctgaaacg ttcaccttcc atgcagatat atgcacactt     1620

tctgagaagg agagacaaat caagaaacaa actgcacttg ttgagcttgt gaaacacaag     1680

cccaaggcaa caaaagagca actgaaagct gttatggatg atttcgcagc ttttgtagag     1740

aagtgctgca aggctgacga taaggagacc tgctttgccg aggagggtaa aaaacttgtt     1800

gctgcaagtc aagctgcctt aggcttagaa ttcatgaggg tccaaccaac agagagcatt     1860

gtgaggtttc caaacatcac caacctgtgt ccatttggag aggtgttcaa tgccaccagg     1920

tttgcctctg tctatgcctg gaacaggaag aggattagca actgtgtggc tgactactct     1980

gtgctctaca actctgcctc cttcagcacc ttcaagtgtt atggagtgag cccaaccaaa     2040

ctgaatgacc tgtgtttcac caatgtctat gctgactcct ttgtgattag gggagatgag     2100

gtgagacaga ttgcccctgg acaaacaggc aagattgctg actacaacta caaactgcct     2160

gatgacttca caggctgtgt gattgcctgg aacagcaaca acctggacag caaggtggga     2220

ggcaactaca actacctcta cagactgttc aggaagagca acctgaaacc atttgagagg     2280

gacatcagca cagagattta ccaggctggc agcacaccat gtaatggagt ggagggcttc     2340

aactgttact ttccactcca atcctatggc ttccaaccaa ccaatggagt gggctaccaa     2400

ccatacaggg tggtggtgct gtcctttgaa ctgctccatg cccctgccac agtgtgtgga     2460

ccaaagaaga gcaccaacct ggtgaagaac aagtgtgtga acttctga                  2508


<210> 6
<211> 787
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 6
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe His Leu Phe Ser Ser 
1               5                   10                  15      


Ala Tyr Ser Arg Gly Val Phe Arg Arg Asp Ala His Lys Ser Glu Val 
            20                  25                  30          


Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val 
        35                  40                  45              


Leu Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His 
    50                  55                  60                  


Val Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala 
65                  70                  75                  80  


Asp Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly 
                85                  90                  95      


Asp Lys Leu Cys Thr Glu Phe His Asp Asn Glu Glu Thr Phe Leu Lys 
            100                 105                 110         


Lys Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro 
        115                 120                 125             


Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys 
    130                 135                 140                 


Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu 
145                 150                 155                 160 


Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys 
                165                 170                 175     


Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val 
            180                 185                 190         


Ala Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser 
        195                 200                 205             


Lys Leu Val Thr Asp Leu Thr Lys Val His Thr Glu Cys Cys His Gly 
    210                 215                 220                 


Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr Ile 
225                 230                 235                 240 


Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu 
                245                 250                 255     


Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp 
            260                 265                 270         


Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser 
        275                 280                 285             


Lys Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly 
    290                 295                 300                 


Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val 
305                 310                 315                 320 


Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys 
                325                 330                 335     


Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu 
            340                 345                 350         


Phe Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys 
        355                 360                 365             


Glu Leu Phe Glu Gln Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu 
    370                 375                 380                 


Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr Pro Thr Leu Val 
385                 390                 395                 400 


Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 
                405                 410                 415     


Pro Glu Ala Lys Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val 
            420                 425                 430         


Leu Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg 
        435                 440                 445             


Val Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe 
    450                 455                 460                 


Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala 
465                 470                 475                 480 


Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu 
                485                 490                 495     


Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys 
            500                 505                 510         


Pro Lys Ala Thr Lys Glu Gln Leu Lys Ala Val Met Asp Asp Phe Ala 
        515                 520                 525             


Ala Phe Val Glu Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe 
    530                 535                 540                 


Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln Ala Ala Leu Gly 
545                 550                 555                 560 


Leu Glu Phe Met Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro 
                565                 570                 575     


Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
            580                 585                 590         


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
        595                 600                 605             


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
    610                 615                 620                 


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
625                 630                 635                 640 


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
                645                 650                 655     


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
            660                 665                 670         


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
        675                 680                 685             


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
    690                 695                 700                 


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
705                 710                 715                 720 


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
                725                 730                 735     


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
            740                 745                 750         


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
        755                 760                 765             


Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys 
    770                 775                 780                 


Val Asn Phe 
785         


<210> 7
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 7
aaactcgagg ccaccatgaa gtgggtaacc ttt                                    33


<210> 8
<211> 27
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 8
tttgaattct aagcctaagg cagcttg                                           27


<210> 9
<211> 30
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 9
aaagaattca tgagggtcca accaacagag                                        30


<210> 10
<211> 30
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 10
tttggatcct cagaagttca cacacttgtt                                        30


<210> 11
<211> 6988
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 11
ggccattgca tacgttgtat ccatatcata atatgtacat ttatattggc tcatgtccaa       60

cattaccgcc atgttgacat tgattattga ctagttatta atagtaatca attacggggt      120

cattagttca tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc      180

ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag      240

taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc      300

acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg      360

gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc      420

agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca      480

atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca      540

atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg      600

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc      660

gtttagtgaa ccgtcagatc gcctggagac gccatccacg ctgttttgac ctccatagaa      720

gacaccggga ccgatccagc ctccgcggcc gggaacggtg cattggaacg cggattcccc      780

gtgccaagag tgacgtaagt accgcctata gactctatag gcacacccct ttggctctta      840

tgcatgctat actgtttttg gcttggggcc tatacacccc cgcttcctta tgctataggt      900

gatggtatag cttagcctat aggtgtgggt tattgaccat tattgaccac tccaacggtg      960

gagggcagtg tagtctgagc agtactcgtt gctgccgcgc gcgccaccag acataatagc     1020

tgacagacta acagactgtt cctttccatg ggtcttttct gcagtcaccg tcgtcgacgg     1080

tatcgataag cttgatatcg aattcacgtg ggcccggtac cgtatactct agagccacca     1140

tgaagtgggt caccttcatc agcctgctgt ttctgttcag cagcgcctac agcagaggcg     1200

tgttcagaag agatgcccac aagagcgagg tggcccacag attcaaggac ctgggcgaag     1260

agaacttcaa ggccctggtg ctgatcgcct tcgctcagta tctgcagcag tgccccttcg     1320

aggatcacgt gaagctggtc aacgaagtga ccgagttcgc caagacctgt gtggccgatg     1380

agagcgccga gaactgtgat aagagcctgc acaccctgtt cggcgacaag ctgtgtacag     1440

tggccacact gagagaaacc tacggcgaga tggccgactg ctgtgccaag caagagcccg     1500

agagaaacga gtgcttcctg cagcacaagg acgacaaccc caacctgcct agactcgtgc     1560

gacccgaagt ggatgtgatg tgcaccgcct tccacgacaa cgaggaaacc ttcctgaaga     1620

agtacctgta cgagatcgcc agacggcacc cctactttta tgcccctgag ctgctgttct     1680

tcgccaagcg gtataaggcc gccttcaccg aatgttgcca ggccgctgat aaggctgcct     1740

gtctgctgcc taagctggac gagctgagag atgagggcaa agccagctct gccaagcaga     1800

gactgaagtg cgccagcctg cagaagttcg gcgagagagc ctttaaagcc tgggccgttg     1860

ccagactgag ccagagattt cctaaggccg agtttgccga ggtgtccaag ctcgtgaccg     1920

atctgacaaa ggtgcacacc gagtgctgtc acggcgatct gctggaatgt gccgacgata     1980

gagccgacct ggccaagtac atctgcgaga accaggacag catcagcagc aagctgaaag     2040

agtgctgcga gaagcccctg ctggaaaagt ctcactgtat cgccgaggtg gaaaacgacg     2100

agatgcctgc cgatctgcct agcctggctg ccgatttcgt ggaaagcaag gacgtgtgca     2160

agaactacgc cgaggccaag gatgtgttcc tgggcatgtt tctgtatgag tacgcccgca     2220

gacaccccga ctattctgtg gttctgctgc tgcggctggc caaaacctac gagacaaccc     2280

tggaaaaatg ctgcgccgct gccgatcctc acgagtgtta tgccaaggtg ttcgacgagt     2340

tcaagcctct ggtggaagaa ccccagaacc tgatcaagca gaactgcgag ctgttcgagc     2400

agctgggcga gtacaagttc cagaatgccc tgctcgtgcg gtacaccaag aaagtgcctc     2460

aggtgtccac acctacactg gttgaggtgt cccggaatct gggcaaagtg ggcagcaagt     2520

gttgcaagca ccctgaggcc aagagaatgc cttgcgccga ggattacctg agcgtggtgc     2580

tgaatcagct gtgcgtgctg cacgagaaaa cccctgtgtc cgacagagtg accaagtgct     2640

gtaccgagag cctcgtgaac agaaggcctt gctttagcgc cctggaagtg gacgagacat     2700

acgtgcccaa agagttcaac gccgagacat tcaccttcca cgccgacatc tgcaccctgt     2760

ccgagaaaga gcggcagatc aagaagcaga cagccctggt cgagctggtt aagcacaagc     2820

ccaaggccac caaagaacag ctgaaggccg tgatggacga cttcgccgcc tttgtcgaga     2880

agtgctgcaa ggccgacgac aaagagacat gcttcgccga agagggcaag aaactggtgg     2940

ctgcctctca ggctgccctg ggccttgagt ttatgagagt gcagcctacc gagtccatcg     3000

tgcggttccc caacatcacc aatctgtgcc cctttggcga ggtgttcaat gccaccagat     3060

ttgccagcgt gtacgcctgg aaccggaaga gaatcagcaa ctgcgtggcc gactacagcg     3120

tgctgtacaa tagcgccagc ttcagcacct tcaagtgcta cggcgtgtcc cctaccaagc     3180

tgaacgacct gtgcttcacc aatgtgtacg ccgacagctt cgtgatcaga ggcgacgaag     3240

tgcggcagat tgctcctgga cagaccggca agatcgccga ttacaactac aagctgcccg     3300

acgacttcac cggctgcgtg atcgcctgga atagcaacaa cctggacagc aaagtcggcg     3360

gcaactacaa ctacctgtac cggctgttcc ggaagtccaa cctgaagcct ttcgagcggg     3420

acatcagcac cgagatctat caggccggca gcaccccttg taatggcgtc gagggcttca     3480

actgctactt cccactgcag tcctacggct tccagcctac caatggcgtg ggctaccagc     3540

cttatagagt ggtggtgctg tccttcgaac tgctgcatgc ccctgctacc gtgtgcggcc     3600

ctaagaagtc taccaacctg gtcaagaaca aatgcgtgaa cttctaataa ggatccagat     3660

ctttttccct ctgccaaaaa ttatggggac atcatgaagc cccttgagca tctgacttct     3720

ggctaataaa ggaaatttat tttcattgca atagtgtgtt ggaatttttt gtgtctctca     3780

ctcggaagga catatgggag ggcaaatcat ttaaaacatc agaatgagta tttggtttag     3840

agtttggcaa catatgccca ttcttccgct tcctcgctca ctgactcgct gcgctcggtc     3900

gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa     3960

tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt     4020

aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa     4080

aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt     4140

ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg     4200

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc     4260

agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc     4320

gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta     4380

tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct     4440

acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc     4500

tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa     4560

caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa     4620

aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa     4680

aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt     4740

ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac     4800

agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc     4860

atagttgcct gactcggggg gggggggcgc tgaggtctgc ctcgtgaaga aggtgttgct     4920

gactcatacc agggcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg     4980

tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc     5040

atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg     5100

gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca     5160

tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt     5220

atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc     5280

agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc     5340

ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacctgaatc gccccatcat     5400

ccagccagaa agtgagggag ccacggttga tgagagcttt gttgtaggtg gaccagttgg     5460

tgattttgaa cttttgcttt gccacggaac ggtctgcgtt gtcgggaaga tgcgtgatct     5520

gatccttcaa ctcagcaaaa gttcgattta ttcaacaaag ccgccgtccc gtcaagtcag     5580

cgtaatgctc tgccagtgtt acaaccaatt aaccaattct gattagaaaa actcatcgag     5640

catcaaatga aactgcaatt tattcatatc aggattatca ataccatatt tttgaaaaag     5700

ccgtttctgt aatgaaggag aaaactcacc gaggcagttc cataggatgg caagatcctg     5760

gtatcggtct gcgattccga ctcgtccaac atcaatacaa cctattaatt tcccctcgtc     5820

aaaaataagg ttatcaagtg agaaatcacc atgagtgacg actgaatccg gtgagaatgg     5880

caaaagctta tgcatttctt tccagacttg ttcaacaggc cagccattac gctcgtcatc     5940

aaaatcactc gcatcaacca aaccgttatt cattcgtgat tgcgcctgag cgagacgaaa     6000

tacgcgatcg ctgttaaaag gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa     6060

cactgccagc gcatcaacaa tattttcacc tgaatcagga tattcttcta atacctggaa     6120

tgctgttttc ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa     6180

atgcttgatg gtcggaagag gcataaattc cgtcagccag tttagtctga ccatctcatc     6240

tgtaacatca ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg     6300

cttcccatac aatcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt     6360

atacccatat aaatcagcat ccatgttgga atttaatcgc ggcctcgagc aagacgtttc     6420

ccgttgaata tggctcataa caccccttgt attactgttt atgtaagcag acagttttat     6480

tgttcatgat gatatatttt tatcttgtgc aatgtaacat cagagatttt gagacacaac     6540

gtggctttcc cccccccccc attattgaag catttatcag ggttattgtc tcatgagcgg     6600

atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg     6660

aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag     6720

gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca     6780

catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc     6840

ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc     6900

agagcagatt gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag     6960

gagaaaatac cgcatcagat tggctatt                                        6988


<210> 12
<211> 835
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 12
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala 
1               5                   10                  15      


Tyr Ser Arg Gly Val Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala 
            20                  25                  30          


His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu 
        35                  40                  45              


Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val 
    50                  55                  60                  


Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp 
65                  70                  75                  80  


Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp 
                85                  90                  95      


Lys Leu Cys Thr Val Ala Thr Leu Arg Glu Thr Tyr Gly Glu Met Ala 
            100                 105                 110         


Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu Cys Phe Leu Gln 
        115                 120                 125             


His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val 
    130                 135                 140                 


Asp Val Met Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys 
145                 150                 155                 160 


Lys Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro 
                165                 170                 175     


Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys 
            180                 185                 190         


Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu 
        195                 200                 205             


Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys 
    210                 215                 220                 


Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val 
225                 230                 235                 240 


Ala Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser 
                245                 250                 255     


Lys Leu Val Thr Asp Leu Thr Lys Val His Thr Glu Cys Cys His Gly 
            260                 265                 270         


Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr Ile 
        275                 280                 285             


Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu 
    290                 295                 300                 


Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp 
305                 310                 315                 320 


Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser 
                325                 330                 335     


Lys Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly 
            340                 345                 350         


Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val 
        355                 360                 365             


Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys 
    370                 375                 380                 


Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu 
385                 390                 395                 400 


Phe Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys 
                405                 410                 415     


Glu Leu Phe Glu Gln Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu 
            420                 425                 430         


Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr Pro Thr Leu Val 
        435                 440                 445             


Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 
    450                 455                 460                 


Pro Glu Ala Lys Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val 
465                 470                 475                 480 


Leu Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg 
                485                 490                 495     


Val Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe 
            500                 505                 510         


Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala 
        515                 520                 525             


Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu 
    530                 535                 540                 


Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys 
545                 550                 555                 560 


Pro Lys Ala Thr Lys Glu Gln Leu Lys Ala Val Met Asp Asp Phe Ala 
                565                 570                 575     


Ala Phe Val Glu Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe 
            580                 585                 590         


Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln Ala Ala Leu Gly 
        595                 600                 605             


Leu Glu Phe Met Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro 
    610                 615                 620                 


Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
625                 630                 635                 640 


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
                645                 650                 655     


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
            660                 665                 670         


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
        675                 680                 685             


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
    690                 695                 700                 


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
705                 710                 715                 720 


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
                725                 730                 735     


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
            740                 745                 750         


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
        755                 760                 765             


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
    770                 775                 780                 


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
785                 790                 795                 800 


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
                805                 810                 815     


Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys 
            820                 825                 830         


Val Asn Phe 
        835 


<210> 13
<211> 8368
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 13
ggccattgca tacgttgtat ccatatcata atatgtacat ttatattggc tcatgtccaa       60

cattaccgcc atgttgacat tgattattga ctagttatta atagtaatca attacggggt      120

cattagttca tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc      180

ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag      240

taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc      300

acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg      360

gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc      420

agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca      480

atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca      540

atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg      600

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc      660

gtttagtgaa ccgtcagatc gcctggagac gccatccacg ctgttttgac ctccatagaa      720

gacaccggga ccgatccagc ctccgcggcc gggaacggtg cattggaacg cggattcccc      780

gtgccaagag tgacgtaagt accgcctata gactctatag gcacacccct ttggctctta      840

tgcatgctat actgtttttg gcttggggcc tatacacccc cgcttcctta tgctataggt      900

gatggtatag cttagcctat aggtgtgggt tattgaccat tattgaccac tccaacggtg      960

gagggcagtg tagtctgagc agtactcgtt gctgccgcgc gcgccaccag acataatagc     1020

tgacagacta acagactgtt cctttccatg ggtcttttct gcagtcaccg tcgtcgacgg     1080

tatcgataag cttgatatcg aattcacgtg ggcccggtac cgtatactct agagccacca     1140

tgaagtgggt caccttcatc agcctgctgt ttctgttcag cagcgcctac agcagaggcg     1200

tgttcagaag agatgcccac aagagcgagg tggcccacag attcaaggac ctgggcgaag     1260

agaacttcaa ggccctggtg ctgatcgcct tcgctcagta tctgcagcag tgccccttcg     1320

aggatcacgt gaagctggtc aacgaagtga ccgagttcgc caagacctgt gtggccgatg     1380

agagcgccga gaactgtgat aagagcctgc acaccctgtt cggcgacaag ctgtgtacag     1440

tggccacact gagagaaacc tacggcgaga tggccgactg ctgtgccaag caagagcccg     1500

agagaaacga gtgcttcctg cagcacaagg acgacaaccc caacctgcct agactcgtgc     1560

gacccgaagt ggatgtgatg tgcaccgcct tccacgacaa cgaggaaacc ttcctgaaga     1620

agtacctgta cgagatcgcc agacggcacc cctactttta tgcccctgag ctgctgttct     1680

tcgccaagcg gtataaggcc gccttcaccg aatgttgcca ggccgctgat aaggctgcct     1740

gtctgctgcc taagctggac gagctgagag atgagggcaa agccagctct gccaagcaga     1800

gactgaagtg cgccagcctg cagaagttcg gcgagagagc ctttaaagcc tgggccgttg     1860

ccagactgag ccagagattt cctaaggccg agtttgccga ggtgtccaag ctcgtgaccg     1920

atctgacaaa ggtgcacacc gagtgctgtc acggcgatct gctggaatgt gccgacgata     1980

gagccgacct ggccaagtac atctgcgaga accaggacag catcagcagc aagctgaaag     2040

agtgctgcga gaagcccctg ctggaaaagt ctcactgtat cgccgaggtg gaaaacgacg     2100

agatgcctgc cgatctgcct agcctggctg ccgatttcgt ggaaagcaag gacgtgtgca     2160

agaactacgc cgaggccaag gatgtgttcc tgggcatgtt tctgtatgag tacgcccgca     2220

gacaccccga ctattctgtg gttctgctgc tgcggctggc caaaacctac gagacaaccc     2280

tggaaaaatg ctgcgccgct gccgatcctc acgagtgtta tgccaaggtg ttcgacgagt     2340

tcaagcctct ggtggaagaa ccccagaacc tgatcaagca gaactgcgag ctgttcgagc     2400

agctgggcga gtacaagttc cagaatgccc tgctcgtgcg gtacaccaag aaagtgcctc     2460

aggtgtccac acctacactg gttgaggtgt cccggaatct gggcaaagtg ggcagcaagt     2520

gttgcaagca ccctgaggcc aagagaatgc cttgcgccga ggattacctg agcgtggtgc     2580

tgaatcagct gtgcgtgctg cacgagaaaa cccctgtgtc cgacagagtg accaagtgct     2640

gtaccgagag cctcgtgaac agaaggcctt gctttagcgc cctggaagtg gacgagacat     2700

acgtgcccaa agagttcaac gccgagacat tcaccttcca cgccgacatc tgcaccctgt     2760

ccgagaaaga gcggcagatc aagaagcaga cagccctggt cgagctggtt aagcacaagc     2820

ccaaggccac caaagaacag ctgaaggccg tgatggacga cttcgccgcc tttgtcgaga     2880

agtgctgcaa ggccgacgac aaagagacat gcttcgccga agagggcaag aaactggtgg     2940

ctgcctctca ggctgctctg ggacttagag tgcagcctac agagtccatc gtgcggttcc     3000

ccaacatcac caatctgtgc ccctttggcg aggtgttcaa tgccaccaga tttgccagcg     3060

tgtacgcctg gaaccggaag agaatcagca actgcgtggc cgactacagc gtgctgtaca     3120

atagcgccag cttcagcacc ttcaagtgct acggcgtgtc ccctaccaag ctgaacgacc     3180

tgtgcttcac caatgtgtac gccgacagct tcgtgatcag aggcgacgaa gtgcggcaga     3240

ttgctcctgg acagaccggc aagatcgccg attacaacta caagctgccc gacgacttca     3300

ccggctgcgt gatcgcctgg aatagcaaca acctggacag caaagtcggc ggcaactaca     3360

actacctgta ccggctgttc cggaagtcca acctgaagcc tttcgagcgg gacatcagca     3420

ccgagatcta tcaggccggc agcacccctt gtaatggcgt cgagggcttc aactgctact     3480

tcccactgca gtcctacggc ttccagccta ccaatggcgt gggctaccag ccttatagag     3540

tggtggtgct gtccttcgaa ctgctgcatg cccctgctac cgtgtgcggc cctaagaagt     3600

ctaccaacct ggtcaagaac aaatgcgtga acttcgaggc taagcccagc ggctctgtgg     3660

ttgaacaagc cgaaggcgtg gaatgcgact tctctccact gctgtctggc acccctccac     3720

aggtgtacaa cttcaagcgg ctggtgttca ccaactgcaa ttacaacctg acaaagctgc     3780

tgagcctgtt cagcgtgaac gactttacct gcagccagat ctctcctgcc gccattgcca     3840

gcaactgtta cagctccctg atcctggact acttcagcta ccctctgagc atgaagtccg     3900

acctgtctgt gtctagcgcc ggacctatca gccagttcaa ttacaagcag tccttcagca     3960

accccacctg tctgattctg gccaccgtgc ctcacaatct gaccaccatc accaagccac     4020

tgaagtacag ctacatcaac aagtgcagcc ggttcctgag cgacgacaga acagaagtgc     4080

cacagctcgt caacgccaac cagtacagcc cctgtgtgtc tatcgtgcct agcacagtgt     4140

gggaggacgg cgactactac agaaagcagc tgtctccact cgaaggcgga ggatggctgg     4200

tggcttctgg aagcacagtg gctatgacag agcagctgca gatgggcttt ggcatcaccg     4260

tgcagtacgg caccgatacc aatagcgtgt gccccaagct ggaattcgcc aacgacacca     4320

agattgccag ccagctgggc aattgcgtcg agtacagagt ggtgcctagc ggcgacgttg     4380

tgcgctttcc taatatcaca aacctgtgtc cattcgggga agtgtttaac gccacaaagt     4440

tcccttccgt gtatgcctgg gagcgcaaga aaatctccaa ctgtgtggct gattactccg     4500

tcctgtacaa cagcaccttt ttctccacgt tcaaatgtta tggggtgtcc gccaccaaac     4560

tcaatgacct ctgttttagc aacgtctacg ccgactcctt cgtcgtgaaa ggggatgatg     4620

ttcgccagat cgccccagga caaaccggcg ttatcgccga ctataattac aaactccccg     4680

atgatttcat gggctgtgtg ctggcctgga acaccagaaa tatcgatgcc acctccaccg     4740

ggaactataa ctacaagtac agatacctgc ggcacggcaa gctgaggccc tttgagaggg     4800

atatctccaa cgtgccattc agccccgacg gcaagccttg tacaccacca gctctgaatt     4860

gctactggcc cctgaacgat tacggcttct acaccacaac cggcatcggc taccaaccat     4920

acagggtcgt cgtgctgagc ttcgaattgc tgaacgcccc agccacagtg tgtggcccaa     4980

agctgagcac cgacctgatt aagaaccagt gcgtcaactt caactaataa ggatccagat     5040

ctttttccct ctgccaaaaa ttatggggac atcatgaagc cccttgagca tctgacttct     5100

ggctaataaa ggaaatttat tttcattgca atagtgtgtt ggaatttttt gtgtctctca     5160

ctcggaagga catatgggag ggcaaatcat ttaaaacatc agaatgagta tttggtttag     5220

agtttggcaa catatgccca ttcttccgct tcctcgctca ctgactcgct gcgctcggtc     5280

gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa     5340

tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt     5400

aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa     5460

aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt     5520

ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg     5580

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc     5640

agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc     5700

gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta     5760

tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct     5820

acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc     5880

tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa     5940

caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa     6000

aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa     6060

aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt     6120

ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac     6180

agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc     6240

atagttgcct gactcggggg gggggggcgc tgaggtctgc ctcgtgaaga aggtgttgct     6300

gactcatacc agggcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg     6360

tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc     6420

atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg     6480

gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca     6540

tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt     6600

atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc     6660

agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc     6720

ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacctgaatc gccccatcat     6780

ccagccagaa agtgagggag ccacggttga tgagagcttt gttgtaggtg gaccagttgg     6840

tgattttgaa cttttgcttt gccacggaac ggtctgcgtt gtcgggaaga tgcgtgatct     6900

gatccttcaa ctcagcaaaa gttcgattta ttcaacaaag ccgccgtccc gtcaagtcag     6960

cgtaatgctc tgccagtgtt acaaccaatt aaccaattct gattagaaaa actcatcgag     7020

catcaaatga aactgcaatt tattcatatc aggattatca ataccatatt tttgaaaaag     7080

ccgtttctgt aatgaaggag aaaactcacc gaggcagttc cataggatgg caagatcctg     7140

gtatcggtct gcgattccga ctcgtccaac atcaatacaa cctattaatt tcccctcgtc     7200

aaaaataagg ttatcaagtg agaaatcacc atgagtgacg actgaatccg gtgagaatgg     7260

caaaagctta tgcatttctt tccagacttg ttcaacaggc cagccattac gctcgtcatc     7320

aaaatcactc gcatcaacca aaccgttatt cattcgtgat tgcgcctgag cgagacgaaa     7380

tacgcgatcg ctgttaaaag gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa     7440

cactgccagc gcatcaacaa tattttcacc tgaatcagga tattcttcta atacctggaa     7500

tgctgttttc ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa     7560

atgcttgatg gtcggaagag gcataaattc cgtcagccag tttagtctga ccatctcatc     7620

tgtaacatca ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg     7680

cttcccatac aatcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt     7740

atacccatat aaatcagcat ccatgttgga atttaatcgc ggcctcgagc aagacgtttc     7800

ccgttgaata tggctcataa caccccttgt attactgttt atgtaagcag acagttttat     7860

tgttcatgat gatatatttt tatcttgtgc aatgtaacat cagagatttt gagacacaac     7920

gtggctttcc cccccccccc attattgaag catttatcag ggttattgtc tcatgagcgg     7980

atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg     8040

aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag     8100

gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca     8160

catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc     8220

ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc     8280

agagcagatt gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag     8340

gagaaaatac cgcatcagat tggctatt                                        8368


<210> 14
<211> 1295
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 14
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala 
1               5                   10                  15      


Tyr Ser Arg Gly Val Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala 
            20                  25                  30          


His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu 
        35                  40                  45              


Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val 
    50                  55                  60                  


Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp 
65                  70                  75                  80  


Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp 
                85                  90                  95      


Lys Leu Cys Thr Val Ala Thr Leu Arg Glu Thr Tyr Gly Glu Met Ala 
            100                 105                 110         


Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu Cys Phe Leu Gln 
        115                 120                 125             


His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val 
    130                 135                 140                 


Asp Val Met Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys 
145                 150                 155                 160 


Lys Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro 
                165                 170                 175     


Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys 
            180                 185                 190         


Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu 
        195                 200                 205             


Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys 
    210                 215                 220                 


Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val 
225                 230                 235                 240 


Ala Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser 
                245                 250                 255     


Lys Leu Val Thr Asp Leu Thr Lys Val His Thr Glu Cys Cys His Gly 
            260                 265                 270         


Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr Ile 
        275                 280                 285             


Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu 
    290                 295                 300                 


Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp 
305                 310                 315                 320 


Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser 
                325                 330                 335     


Lys Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly 
            340                 345                 350         


Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val 
        355                 360                 365             


Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys 
    370                 375                 380                 


Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu 
385                 390                 395                 400 


Phe Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys 
                405                 410                 415     


Glu Leu Phe Glu Gln Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu 
            420                 425                 430         


Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr Pro Thr Leu Val 
        435                 440                 445             


Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 
    450                 455                 460                 


Pro Glu Ala Lys Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val 
465                 470                 475                 480 


Leu Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg 
                485                 490                 495     


Val Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe 
            500                 505                 510         


Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala 
        515                 520                 525             


Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu 
    530                 535                 540                 


Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys 
545                 550                 555                 560 


Pro Lys Ala Thr Lys Glu Gln Leu Lys Ala Val Met Asp Asp Phe Ala 
                565                 570                 575     


Ala Phe Val Glu Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe 
            580                 585                 590         


Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln Ala Ala Leu Gly 
        595                 600                 605             


Leu Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
    610                 615                 620                 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
625                 630                 635                 640 


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
                645                 650                 655     


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
            660                 665                 670         


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
        675                 680                 685             


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
    690                 695                 700                 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
705                 710                 715                 720 


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
                725                 730                 735     


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
            740                 745                 750         


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
        755                 760                 765             


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
    770                 775                 780                 


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
785                 790                 795                 800 


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
                805                 810                 815     


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
            820                 825                 830         


Glu Ala Lys Pro Ser Gly Ser Val Val Glu Gln Ala Glu Gly Val Glu 
        835                 840                 845             


Cys Asp Phe Ser Pro Leu Leu Ser Gly Thr Pro Pro Gln Val Tyr Asn 
    850                 855                 860                 


Phe Lys Arg Leu Val Phe Thr Asn Cys Asn Tyr Asn Leu Thr Lys Leu 
865                 870                 875                 880 


Leu Ser Leu Phe Ser Val Asn Asp Phe Thr Cys Ser Gln Ile Ser Pro 
                885                 890                 895     


Ala Ala Ile Ala Ser Asn Cys Tyr Ser Ser Leu Ile Leu Asp Tyr Phe 
            900                 905                 910         


Ser Tyr Pro Leu Ser Met Lys Ser Asp Leu Ser Val Ser Ser Ala Gly 
        915                 920                 925             


Pro Ile Ser Gln Phe Asn Tyr Lys Gln Ser Phe Ser Asn Pro Thr Cys 
    930                 935                 940                 


Leu Ile Leu Ala Thr Val Pro His Asn Leu Thr Thr Ile Thr Lys Pro 
945                 950                 955                 960 


Leu Lys Tyr Ser Tyr Ile Asn Lys Cys Ser Arg Phe Leu Ser Asp Asp 
                965                 970                 975     


Arg Thr Glu Val Pro Gln Leu Val Asn Ala Asn Gln Tyr Ser Pro Cys 
            980                 985                 990         


Val Ser Ile Val Pro Ser Thr Val  Trp Glu Asp Gly Asp  Tyr Tyr Arg 
        995                 1000                 1005             


Lys Gln  Leu Ser Pro Leu Glu  Gly Gly Gly Trp Leu  Val Ala Ser 
    1010                 1015                 1020             


Gly Ser  Thr Val Ala Met Thr  Glu Gln Leu Gln Met  Gly Phe Gly 
    1025                 1030                 1035             


Ile Thr  Val Gln Tyr Gly Thr  Asp Thr Asn Ser Val  Cys Pro Lys 
    1040                 1045                 1050             


Leu Glu  Phe Ala Asn Asp Thr  Lys Ile Ala Ser Gln  Leu Gly Asn 
    1055                 1060                 1065             


Cys Val  Glu Tyr Arg Val Val  Pro Ser Gly Asp Val  Val Arg Phe 
    1070                 1075                 1080             


Pro Asn  Ile Thr Asn Leu Cys  Pro Phe Gly Glu Val  Phe Asn Ala 
    1085                 1090                 1095             


Thr Lys  Phe Pro Ser Val Tyr  Ala Trp Glu Arg Lys  Lys Ile Ser 
    1100                 1105                 1110             


Asn Cys  Val Ala Asp Tyr Ser  Val Leu Tyr Asn Ser  Thr Phe Phe 
    1115                 1120                 1125             


Ser Thr  Phe Lys Cys Tyr Gly  Val Ser Ala Thr Lys  Leu Asn Asp 
    1130                 1135                 1140             


Leu Cys  Phe Ser Asn Val Tyr  Ala Asp Ser Phe Val  Val Lys Gly 
    1145                 1150                 1155             


Asp Asp  Val Arg Gln Ile Ala  Pro Gly Gln Thr Gly  Val Ile Ala 
    1160                 1165                 1170             


Asp Tyr  Asn Tyr Lys Leu Pro  Asp Asp Phe Met Gly  Cys Val Leu 
    1175                 1180                 1185             


Ala Trp  Asn Thr Arg Asn Ile  Asp Ala Thr Ser Thr  Gly Asn Tyr 
    1190                 1195                 1200             


Asn Tyr  Lys Tyr Arg Tyr Leu  Arg His Gly Lys Leu  Arg Pro Phe 
    1205                 1210                 1215             


Glu Arg  Asp Ile Ser Asn Val  Pro Phe Ser Pro Asp  Gly Lys Pro 
    1220                 1225                 1230             


Cys Thr  Pro Pro Ala Leu Asn  Cys Tyr Trp Pro Leu  Asn Asp Tyr 
    1235                 1240                 1245             


Gly Phe  Tyr Thr Thr Thr Gly  Ile Gly Tyr Gln Pro  Tyr Arg Val 
    1250                 1255                 1260             


Val Val  Leu Ser Phe Glu Leu  Leu Asn Ala Pro Ala  Thr Val Cys 
    1265                 1270                 1275             


Gly Pro  Lys Leu Ser Thr Asp  Leu Ile Lys Asn Gln  Cys Val Asn 
    1280                 1285                 1290             


Phe Asn  
    1295 


<210> 15
<211> 723
<212> DNA
<213> Middle East respiratory syndrome-related coronavirus

<400> 15
gaggctaagc ccagcggctc tgtggttgaa caagccgaag gcgtggaatg cgacttctct       60

ccactgctgt ctggcacccc tccacaggtg tacaacttca agcggctggt gttcaccaac      120

tgcaattaca acctgacaaa gctgctgagc ctgttcagcg tgaacgactt tacctgcagc      180

cagatctctc ctgccgccat tgccagcaac tgttacagct ccctgatcct ggactacttc      240

agctaccctc tgagcatgaa gtccgacctg tctgtgtcta gcgccggacc tatcagccag      300

ttcaattaca agcagtcctt cagcaacccc acctgtctga ttctggccac cgtgcctcac      360

aatctgacca ccatcaccaa gccactgaag tacagctaca tcaacaagtg cagccggttc      420

ctgagcgacg acagaacaga agtgccacag ctcgtcaacg ccaaccagta cagcccctgt      480

gtgtctatcg tgcctagcac agtgtgggag gacggcgact actacagaaa gcagctgtct      540

ccactcgaag gcggaggatg gctggtggct tctggaagca cagtggctat gacagagcag      600

ctgcagatgg gctttggcat caccgtgcag tacggcaccg ataccaatag cgtgtgcccc      660

aagctggaat tcgccaacga caccaagatt gccagccagc tgggcaattg cgtcgagtac      720

aga                                                                    723


<210> 16
<211> 242
<212> PRT
<213> Middle East respiratory syndrome-related coronavirus

<400> 16
Phe Glu Ala Lys Pro Ser Gly Ser Val Val Glu Gln Ala Glu Gly Val 
1               5                   10                  15      


Glu Cys Asp Phe Ser Pro Leu Leu Ser Gly Thr Pro Pro Gln Val Tyr 
            20                  25                  30          


Asn Phe Lys Arg Leu Val Phe Thr Asn Cys Asn Tyr Asn Leu Thr Lys 
        35                  40                  45              


Leu Leu Ser Leu Phe Ser Val Asn Asp Phe Thr Cys Ser Gln Ile Ser 
    50                  55                  60                  


Pro Ala Ala Ile Ala Ser Asn Cys Tyr Ser Ser Leu Ile Leu Asp Tyr 
65                  70                  75                  80  


Phe Ser Tyr Pro Leu Ser Met Lys Ser Asp Leu Ser Val Ser Ser Ala 
                85                  90                  95      


Gly Pro Ile Ser Gln Phe Asn Tyr Lys Gln Ser Phe Ser Asn Pro Thr 
            100                 105                 110         


Cys Leu Ile Leu Ala Thr Val Pro His Asn Leu Thr Thr Ile Thr Lys 
        115                 120                 125             


Pro Leu Lys Tyr Ser Tyr Ile Asn Lys Cys Ser Arg Phe Leu Ser Asp 
    130                 135                 140                 


Asp Arg Thr Glu Val Pro Gln Leu Val Asn Ala Asn Gln Tyr Ser Pro 
145                 150                 155                 160 


Cys Val Ser Ile Val Pro Ser Thr Val Trp Glu Asp Gly Asp Tyr Tyr 
                165                 170                 175     


Arg Lys Gln Leu Ser Pro Leu Glu Gly Gly Gly Trp Leu Val Ala Ser 
            180                 185                 190         


Gly Ser Thr Val Ala Met Thr Glu Gln Leu Gln Met Gly Phe Gly Ile 
        195                 200                 205             


Thr Val Gln Tyr Gly Thr Asp Thr Asn Ser Val Cys Pro Lys Leu Glu 
    210                 215                 220                 


Phe Ala Asn Asp Thr Lys Ile Ala Ser Gln Leu Gly Asn Cys Val Glu 
225                 230                 235                 240 


Tyr Arg 
        


<210> 17
<211> 666
<212> DNA
<213> Severe acute respiratory syndrome-related coronavirus

<400> 17
gtggtgccta gcggcgacgt tgtgcgcttt cctaatatca caaacctgtg tccattcggg       60

gaagtgttta acgccacaaa gttcccttcc gtgtatgcct gggagcgcaa gaaaatctcc      120

aactgtgtgg ctgattactc cgtcctgtac aacagcacct ttttctccac gttcaaatgt      180

tatggggtgt ccgccaccaa actcaatgac ctctgtttta gcaacgtcta cgccgactcc      240

ttcgtcgtga aaggggatga tgttcgccag atcgccccag gacaaaccgg cgttatcgcc      300

gactataatt acaaactccc cgatgatttc atgggctgtg tgctggcctg gaacaccaga      360

aatatcgatg ccacctccac cgggaactat aactacaagt acagatacct gcggcacggc      420

aagctgaggc cctttgagag ggatatctcc aacgtgccat tcagccccga cggcaagcct      480

tgtacaccac cagctctgaa ttgctactgg cccctgaacg attacggctt ctacaccaca      540

accggcatcg gctaccaacc atacagggtc gtcgtgctga gcttcgaatt gctgaacgcc      600

ccagccacag tgtgtggccc aaagctgagc accgacctga ttaagaacca gtgcgtcaac      660

ttcaac                                                                 666


<210> 18
<211> 222
<212> PRT
<213> Severe acute respiratory syndrome-related coronavirus

<400> 18
Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn Ile Thr Asn Leu 
1               5                   10                  15      


Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser Val Tyr 
            20                  25                  30          


Ala Trp Glu Arg Lys Lys Ile Ser Asn Cys Val Ala Asp Tyr Ser Val 
        35                  40                  45              


Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser 
    50                  55                  60                  


Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala Asp Ser 
65                  70                  75                  80  


Phe Val Val Lys Gly Asp Asp Val Arg Gln Ile Ala Pro Gly Gln Thr 
                85                  90                  95      


Gly Val Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Met Gly 
            100                 105                 110         


Cys Val Leu Ala Trp Asn Thr Arg Asn Ile Asp Ala Thr Ser Thr Gly 
        115                 120                 125             


Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu Arg Pro 
    130                 135                 140                 


Phe Glu Arg Asp Ile Ser Asn Val Pro Phe Ser Pro Asp Gly Lys Pro 
145                 150                 155                 160 


Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp Tyr Gly 
                165                 170                 175     


Phe Tyr Thr Thr Thr Gly Ile Gly Tyr Gln Pro Tyr Arg Val Val Val 
            180                 185                 190         


Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly Pro Lys 
        195                 200                 205             


Leu Ser Thr Asp Leu Ile Lys Asn Gln Cys Val Asn Phe Asn 
    210                 215                 220         


<210> 19
<211> 8011
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 19
ggccattgca tacgttgtat ccatatcata atatgtacat ttatattggc tcatgtccaa       60

cattaccgcc atgttgacat tgattattga ctagttatta atagtaatca attacggggt      120

cattagttca tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc      180

ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag      240

taacgccaat agggactttc cattgacgtc aatgggtgga ctatttacgg taaactgccc      300

acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg      360

gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc      420

agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca      480

atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca      540

atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg      600

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc      660

tctggctaac tagagaaccc actgcttact ggcttatcga aattaatacg actcactata      720

gggagaccca agctggctag cgtttaaacg ggccctctag actcgaggcc accatgaagt      780

gggtaacctt tatttccctt ctttttctct ttagctcggc ttattccagg ggtgtgtttc      840

gtcgagatgc acacaagagt gaggttgctc atcggtttaa agatttggga gaagaaaatt      900

tcaaagcctt ggtgttgatt gcctttgctc agtatcttca gcagtgtcca tttgaagatc      960

atgtaaaatt agtgaatgaa gtaactgaat ttgcaaaaac atgtgttgct gatgagtcag     1020

ctgaaaattg tgacaaatca cttcataccc tttttggaga caaattatgc acagttgcaa     1080

ctcttcgtga aacctatggt gaaatggctg actgctgtgc aaaacaagaa cctgagagaa     1140

atgaatgctt cttgcaacac aaagatgaca acccaaacct cccccgattg gtgagaccag     1200

aggttgatgt gatgtgcact gcttttcatg acaatgaaga gacatttttg aaaaaatact     1260

tatatgaaat tgccagaaga catccttact tttatgcccc ggaactcctt ttctttgcta     1320

aaaggtataa agctgctttt acagaatgtt gccaagctgc tgataaagct gcctgcctgt     1380

tgccaaagct cgatgaactt cgggatgaag ggaaggcttc gtctgccaaa cagagactca     1440

agtgtgccag tctccaaaaa tttggagaaa gagctttcaa agcatgggca gtagctcgcc     1500

tgagccagag atttcccaaa gctgagtttg cagaagtttc caagttagtg acagatctta     1560

ccaaagtcca cacggaatgc tgccatggag atctgcttga atgtgctgat gacagggcgg     1620

accttgccaa gtatatctgt gaaaatcaag attcgatctc cagtaaactg aaggaatgct     1680

gtgaaaaacc tctgttggaa aaatcccact gcattgccga agtggaaaat gatgagatgc     1740

ctgctgactt gccttcatta gctgctgatt ttgttgaaag taaggatgtt tgcaaaaact     1800

atgctgaggc aaaggatgtc ttcctgggca tgtttttgta tgaatatgca agaaggcatc     1860

ctgattactc tgtcgtgctg ctgctgagac ttgccaagac atatgaaacc actctagaga     1920

agtgctgtgc cgctgcagat cctcatgaat gctatgccaa agtgttcgat gaatttaaac     1980

ctcttgtgga agagcctcag aatttaatca aacaaaattg tgagcttttt gagcagcttg     2040

gagagtacaa attccagaat gcgctattag ttcgttacac caagaaagta ccccaagtgt     2100

caactccaac tcttgtagag gtctcaagaa acctaggaaa agtgggcagc aaatgttgta     2160

aacatcctga agcaaaaaga atgccctgtg cagaagacta tctatccgtg gtcctgaacc     2220

agttatgtgt gttgcatgag aaaacgccag taagtgacag agtcaccaaa tgctgcacag     2280

aatccttggt gaacaggcga ccatgctttt cagctctgga agtcgatgaa acatacgttc     2340

ccaaagagtt taatgctgaa acgttcacct tccatgcaga tatatgcaca ctttctgaga     2400

aggagagaca aatcaagaaa caaactgcac ttgttgagct tgtgaaacac aagcccaagg     2460

caacaaaaga gcaactgaaa gctgttatgg atgatttcgc agcttttgta gagaagtgct     2520

gcaaggctga cgataaggag acctgctttg ccgaggaggg taaaaaactt gttgctgcaa     2580

gtcaagctgc cttaggctta gaattcatgc gcgtgcagcc cactgagtcc atagtgaggt     2640

ttcctaacat aactaacctc tgcccattcg gggaagtgtt taacgccacc cggttcgcta     2700

gcgtgtacgc ctggaaccgt aagaggatta gtaactgcgt agctgactat agcgtactgt     2760

ataatagcgc tagttttagc acctttaagt gctacggggt cagccccact aagctgaatg     2820

atttgtgctt cactaacgtg tacgctgata gcttcgtaat taggggggat gaggtgagac     2880

agatagcccc cggacagacc ggcaagatcg ctgattacaa ctacaagctg cctgatgact     2940

tcacaggctg cgtgatcgcc tggaactcta acaacttgga ctctaaggtc ggcggaaact     3000

acaattacct ttaccgcctg tttagaaagt ccaacctgaa acccttcgag cgggatatca     3060

gcacagagat ctaccaggcc gggagcaccc cctgcaacgg ggtggagggc ttcaactgct     3120

acttccccct gcagtcctac gggttccagc caaccaacgg cgtgggctac cagccctaca     3180

gagtcgtggt gctgtccttt gagctgctgc acgcccccgc taccgtctgc ggccccaaga     3240

agtccacaaa cctcgtgaag aacaagtgcg tgaactttga ggccaagcca tccgggagcg     3300

tcgtggagca ggccgaaggg gtcgagtgcg acttcagccc actgctgagc ggcacccccc     3360

cacaggtgta caactttaag aggctggtgt tcactaactg caactacaac ctcaccaagc     3420

tcctgtccct gttctccgtg aacgacttca catgcagcca gatcagccca gccgccatcg     3480

cctccaactg ctacagcagc ctgatcctcg actacttctc ctaccccctc agcatgaagt     3540

ccgacctctc cgtgtccagc gccggcccca ttagccagtt taactacaag cagagctttt     3600

ccaaccccac ctgcctgatc ctggccacag tgccccataa cctgaccaca attaccaagc     3660

ccctgaagta cagctacatt aacaagtgct ccaggttcct cagcgatgat cggaccgagg     3720

tgccccagct cgtcaacgcc aaccagtaca gcccttgcgt gagcattgtc cccagcaccg     3780

tgtgggagga cggcgactac tacagaaagc agctgagccc tctggagggc ggcgggtggc     3840

tggtggcctc cgggagcaca gtggccatga cagagcagct gcagatgggg ttcggcatta     3900

ctgtgcagta cggaacagat acaaacagcg tgtgccctaa gctggagttc gccaacgaca     3960

ctaagatcgc ctcccagctg ggcaactgcg tcgagtacag ggtggtgccc agcggggacg     4020

tggtgcggtt cccaaacatc accaacctgt gccccttcgg ggaggtgttc aacgccacaa     4080

agttccctag cgtctacgcc tgggagcgga agaagattag caactgcgtg gccgactact     4140

ccgtgctgta caactccacc ttcttctcca cattcaagtg ctacggcgtg agcgccacaa     4200

agctgaacga cctctgcttc agcaacgtgt acgccgacag cttcgtggtc aagggcgatg     4260

atgtgcggca gatcgccccc ggccagaccg gcgtgatcgc cgattacaac tataagctgc     4320

ccgacgactt catggggtgc gtgctggcct ggaacacaag gaacattgat gccaccagca     4380

caggcaacta caactacaag tacaggtacc tgaggcacgg gaagctgcgg cccttcgagc     4440

gggacatctc caacgtgccc ttcagccccg acggcaagcc ctgcaccccc cccgccctga     4500

actgctactg gcccctgaac gattacggct tctacacaac caccggcatt ggctaccagc     4560

cttaccgggt cgtggtgctg agctttgagc tgctgaacgc ccccgccacc gtgtgcgggc     4620

ctaagctgag cactgacctg attaagaacc agtgcgtgaa ctttaactga tgaggatcca     4680

gatctttttc cctctgccaa aaattatggg gacatcatga agccccttga gcatctgact     4740

tctggctaat aaaggaaatt tattttcatt gcaatagtgt gttggaattt tttgtgtctc     4800

tcactcggaa ggacatatgg gagggcaaat catttaaaac atcagaatga gtatttggtt     4860

tagagtttgg caacatatgc ccattcttcc gcttcctcgc tcactgactc gctgcgctcg     4920

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca     4980

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac     5040

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac     5100

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg     5160

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac     5220

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat     5280

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag     5340

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac     5400

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt     5460

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt     5520

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc     5580

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga     5640

aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac     5700

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc     5760

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct     5820

gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca     5880

tccatagttg cctgactcgg gggggggggg cgctgaggtc tgcctcgtga agaaggtgtt     5940

gctgactcat accagggcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg     6000

tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc     6060

cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag     6120

ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg     6180

ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag     6240

tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat     6300

agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg     6360

atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacctga atcgccccat     6420

catccagcca gaaagtgagg gagccacggt tgatgagagc tttgttgtag gtggaccagt     6480

tggtgatttt gaacttttgc tttgccacgg aacggtctgc gttgtcggga agatgcgtga     6540

tctgatcctt caactcagca aaagttcgat ttattcaaca aagccgccgt cccgtcaagt     6600

cagcgtaatg ctctgccagt gttacaacca attaaccaat tctgattaga aaaactcatc     6660

gagcatcaaa tgaaactgca atttattcat atcaggatta tcaataccat atttttgaaa     6720

aagccgtttc tgtaatgaag gagaaaactc accgaggcag ttccatagga tggcaagatc     6780

ctggtatcgg tctgcgattc cgactcgtcc aacatcaata caacctatta atttcccctc     6840

gtcaaaaata aggttatcaa gtgagaaatc accatgagtg acgactgaat ccggtgagaa     6900

tggcaaaagc ttatgcattt ctttccagac ttgttcaaca ggccagccat tacgctcgtc     6960

atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt gattgcgcct gagcgagacg     7020

aaatacgcga tcgctgttaa aaggacaatt acaaacagga atcgaatgca accggcgcag     7080

gaacactgcc agcgcatcaa caatattttc acctgaatca ggatattctt ctaatacctg     7140

gaatgctgtt ttcccgggga tcgcagtggt gagtaaccat gcatcatcag gagtacggat     7200

aaaatgcttg atggtcggaa gaggcataaa ttccgtcagc cagtttagtc tgaccatctc     7260

atctgtaaca tcattggcaa cgctaccttt gccatgtttc agaaacaact ctggcgcatc     7320

gggcttccca tacaatcgat agattgtcgc acctgattgc ccgacattat cgcgagccca     7380

tttataccca tataaatcag catccatgtt ggaatttaat cgcggcctcg agcaagacgt     7440

ttcccgttga atatggctca taacacccct tgtattactg tttatgtaag cagacagttt     7500

tattgttcat gatgatatat ttttatcttg tgcaatgtaa catcagagat tttgagacac     7560

aacgtggctt tccccccccc cccattattg aagcatttat cagggttatt gtctcatgag     7620

cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc     7680

ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa     7740

taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt gatgacggtg aaaacctctg     7800

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     7860

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggctggctta actatgcggc     7920

atcagagcag attgtactga gagtgcacca tatgcggtgt gaaataccgc acagatgcgt     7980

aaggagaaaa taccgcatca gattggctat t                                    8011


<210> 20
<211> 999
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 20
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala 
1               5                   10                  15      


Tyr Ser Arg Gly Val Phe Arg Arg Asp Ala His Lys Ser Glu Val Ala 
            20                  25                  30          


His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu 
        35                  40                  45              


Ile Ala Phe Ala Gln Tyr Leu Gln Gln Cys Pro Phe Glu Asp His Val 
    50                  55                  60                  


Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp 
65                  70                  75                  80  


Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp 
                85                  90                  95      


Lys Leu Cys Thr Val Ala Thr Leu Arg Glu Thr Tyr Gly Glu Met Ala 
            100                 105                 110         


Asp Cys Cys Ala Lys Gln Glu Pro Glu Arg Asn Glu Cys Phe Leu Gln 
        115                 120                 125             


His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val 
    130                 135                 140                 


Asp Val Met Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys 
145                 150                 155                 160 


Lys Tyr Leu Tyr Glu Ile Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro 
                165                 170                 175     


Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys 
            180                 185                 190         


Cys Gln Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu 
        195                 200                 205             


Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gln Arg Leu Lys Cys 
    210                 215                 220                 


Ala Ser Leu Gln Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val 
225                 230                 235                 240 


Ala Arg Leu Ser Gln Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser 
                245                 250                 255     


Lys Leu Val Thr Asp Leu Thr Lys Val His Thr Glu Cys Cys His Gly 
            260                 265                 270         


Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr Ile 
        275                 280                 285             


Cys Glu Asn Gln Asp Ser Ile Ser Ser Lys Leu Lys Glu Cys Cys Glu 
    290                 295                 300                 


Lys Pro Leu Leu Glu Lys Ser His Cys Ile Ala Glu Val Glu Asn Asp 
305                 310                 315                 320 


Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser 
                325                 330                 335     


Lys Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly 
            340                 345                 350         


Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val 
        355                 360                 365             


Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys 
    370                 375                 380                 


Cys Ala Ala Ala Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu 
385                 390                 395                 400 


Phe Lys Pro Leu Val Glu Glu Pro Gln Asn Leu Ile Lys Gln Asn Cys 
                405                 410                 415     


Glu Leu Phe Glu Gln Leu Gly Glu Tyr Lys Phe Gln Asn Ala Leu Leu 
            420                 425                 430         


Val Arg Tyr Thr Lys Lys Val Pro Gln Val Ser Thr Pro Thr Leu Val 
        435                 440                 445             


Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 
    450                 455                 460                 


Pro Glu Ala Lys Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val 
465                 470                 475                 480 


Leu Asn Gln Leu Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg 
                485                 490                 495     


Val Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe 
            500                 505                 510         


Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala 
        515                 520                 525             


Glu Thr Phe Thr Phe His Ala Asp Ile Cys Thr Leu Ser Glu Lys Glu 
    530                 535                 540                 


Arg Gln Ile Lys Lys Gln Thr Ala Leu Val Glu Leu Val Lys His Lys 
545                 550                 555                 560 


Pro Lys Ala Thr Lys Glu Gln Leu Lys Ala Val Met Asp Asp Phe Ala 
                565                 570                 575     


Ala Phe Val Glu Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe 
            580                 585                 590         


Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gln Ala Ala Leu Gly 
        595                 600                 605             


Leu Glu Phe Met Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro 
    610                 615                 620                 


Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
625                 630                 635                 640 


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
                645                 650                 655     


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
            660                 665                 670         


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
        675                 680                 685             


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
    690                 695                 700                 


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
705                 710                 715                 720 


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
                725                 730                 735     


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
            740                 745                 750         


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
        755                 760                 765             


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
    770                 775                 780                 


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
785                 790                 795                 800 


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
                805                 810                 815     


Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys 
            820                 825                 830         


Val Asn Phe Glu Ala Lys Pro Ser Gly Ser Val Val Glu Gln Ala Glu 
        835                 840                 845             


Gly Val Glu Cys Asp Phe Ser Pro Leu Leu Ser Gly Thr Pro Pro Gln 
    850                 855                 860                 


Val Tyr Asn Phe Lys Arg Leu Val Phe Thr Asn Cys Asn Tyr Asn Leu 
865                 870                 875                 880 


Thr Lys Leu Leu Ser Leu Phe Ser Val Asn Asp Phe Thr Cys Ser Gln 
                885                 890                 895     


Ile Ser Pro Ala Ala Ile Ala Ser Asn Cys Tyr Ser Ser Leu Ile Leu 
            900                 905                 910         


Asp Tyr Phe Ser Tyr Pro Leu Ser Met Lys Ser Asp Leu Ser Val Ser 
        915                 920                 925             


Ser Ala Gly Pro Ile Ser Gln Phe Asn Tyr Lys Gln Ser Phe Ser Asn 
    930                 935                 940                 


Pro Thr Cys Leu Ile Leu Ala Thr Val Pro His Asn Leu Thr Thr Ile 
945                 950                 955                 960 


Thr Lys Pro Leu Lys Tyr Ser Tyr Ile Asn Lys Cys Ser Arg Phe Leu 
                965                 970                 975     


Ser Asp Asp Arg Thr Glu Val Pro Gln Leu Val Asn Ala Asn Gln Tyr 
            980                 985                 990         


Ser Pro Cys Val Ser Ile Val 
        995                 


<210> 21
<211> 7929
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 21
ggccattgca tacgttgtat ccatatcata atatgtacat ttatattggc tcatgtccaa       60

cattaccgcc atgttgacat tgattattga ctagttatta atagtaatca attacggggt      120

cattagttca tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc      180

ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag      240

taacgccaat agggactttc cattgacgtc aatgggtgga ctatttacgg taaactgccc      300

acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg      360

gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc      420

agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca      480

atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca      540

atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg      600

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc      660

tctggctaac tagagaaccc actgcttact ggcttatcga aattaatacg actcactata      720

gggagaccca agctggctag cgtttaaacg ggccctctag actcgaggcc accatgaagt      780

gggtaacctt tatttccctt ctttttctct ttagctcggc ttattccagg ggtgtgtttc      840

gtcgagatgc acacaagagt gaggttgctc atcggtttaa agatttggga gaagaaaatt      900

tcaaagcctt ggtgttgatt gcctttgctc agtatcttca gcagtgtcca tttgaagatc      960

atgtaaaatt agtgaatgaa gtaactgaat ttgcaaaaac atgtgttgct gatgagtcag     1020

ctgaaaattg tgacaaatca cttcataccc tttttggaga caaattatgc acagttgcaa     1080

ctcttcgtga aacctatggt gaaatggctg actgctgtgc aaaacaagaa cctgagagaa     1140

atgaatgctt cttgcaacac aaagatgaca acccaaacct cccccgattg gtgagaccag     1200

aggttgatgt gatgtgcact gcttttcatg acaatgaaga gacatttttg aaaaaatact     1260

tatatgaaat tgccagaaga catccttact tttatgcccc ggaactcctt ttctttgcta     1320

aaaggtataa agctgctttt acagaatgtt gccaagctgc tgataaagct gcctgcctgt     1380

tgccaaagct cgatgaactt cgggatgaag ggaaggcttc gtctgccaaa cagagactca     1440

agtgtgccag tctccaaaaa tttggagaaa gagctttcaa agcatgggca gtagctcgcc     1500

tgagccagag atttcccaaa gctgagtttg cagaagtttc caagttagtg acagatctta     1560

ccaaagtcca cacggaatgc tgccatggag atctgcttga atgtgctgat gacagggcgg     1620

accttgccaa gtatatctgt gaaaatcaag attcgatctc cagtaaactg aaggaatgct     1680

gtgaaaaacc tctgttggaa aaatcccact gcattgccga agtggaaaat gatgagatgc     1740

ctgctgactt gccttcatta gctgctgatt ttgttgaaag taaggatgtt tgcaaaaact     1800

atgctgaggc aaaggatgtc ttcctgggca tgtttttgta tgaatatgca agaaggcatc     1860

ctgattactc tgtcgtgctg ctgctgagac ttgccaagac atatgaaacc actctagaga     1920

agtgctgtgc cgctgcagat cctcatgaat gctatgccaa agtgttcgat gaatttaaac     1980

ctcttgtgga agagcctcag aatttaatca aacaaaattg tgagcttttt gagcagcttg     2040

gagagtacaa attccagaat gcgctattag ttcgttacac caagaaagta ccccaagtgt     2100

caactccaac tcttgtagag gtctcaagaa acctaggaaa agtgggcagc aaatgttgta     2160

aacatcctga agcaaaaaga atgccctgtg cagaagacta tctatccgtg gtcctgaacc     2220

agttatgtgt gttgcatgag aaaacgccag taagtgacag agtcaccaaa tgctgcacag     2280

aatccttggt gaacaggcga ccatgctttt cagctctgga agtcgatgaa acatacgttc     2340

ccaaagagtt taatgctgaa acgttcacct tccatgcaga tatatgcaca ctttctgaga     2400

aggagagaca aatcaagaaa caaactgcac ttgttgagct tgtgaaacac aagcccaagg     2460

caacaaaaga gcaactgaaa gctgttatgg atgatttcgc agcttttgta gagaagtgct     2520

gcaaggctga cgataaggag acctgctttg ccgaggaggg taaaaaactt gttgctgcaa     2580

gtcaagctgc cttaggctta gaattcatgc gcgtgcagcc cactgagtcc atagtgaggt     2640

ttcctaacat aactaacctc tgcccattcg gggaagtgtt taacgccacc cggttcgcta     2700

gcgtgtacgc ctggaaccgt aagaggatta gtaactgcgt agctgactat agcgtactgt     2760

ataatagcgc tagttttagc acctttaagt gctacggggt cagccccact aagctgaatg     2820

atttgtgctt cactaacgtg tacgctgata gcttcgtaat taggggggat gaggtgagac     2880

agatagcccc cggacagacc ggcaagatcg ctgattacaa ctacaagctg cctgatgact     2940

tcacaggctg cgtgatcgcc tggaactcta acaacttgga ctctaaggtc ggcggaaact     3000

acaattacct ttaccgcctg tttagaaagt ccaacctgaa acccttcgag cgggatatca     3060

gcacagagat ctaccaggcc gggagcaccc cctgcaacgg ggtggagggc ttcaactgct     3120

acttccccct gcagtcctac gggttccagc caaccaacgg cgtgggctac cagccctaca     3180

gagtcgtggt gctgtccttt gagctgctgc acgcccccgc taccgtctgc ggccccaaga     3240

agtccacaaa cctcgtgaag aacaagtgcg tgaactttga ggccaagcca tccgggagcg     3300

tcgtggagca ggccgaaggg gtcgagtgcg acttcagccc actgctgagc ggcacccccc     3360

cacaggtgta caactttaag aggctggtgt tcactaactg caactacaac ctcaccaagc     3420

tcctgtccct gttctccgtg aacgacttca catgcagcca gatcagccca gccgccatcg     3480

cctccaactg ctacagcagc ctgatcctcg actacttctc ctaccccctc agcatgaagt     3540

ccgacctctc cgtgtccagc gccggcccca ttagccagtt taactacaag cagagctttt     3600

ccaaccccac ctgcctgatc ctggccacag tgccccataa cctgaccaca attaccaagc     3660

ccctgaagta cagctacatt aacaagtgct ccaggttcct cagcgatgat cggaccgagg     3720

tgccccagct cgtcaacgcc aaccagtaca gcccttgcgt gagcattgtc cccagcaccg     3780

tgtgggagga cggcgactac tacagaaagc agctgagccc tctggagggc ggcgggtggc     3840

tggtggcctc cgggagcaca gtggccatga cagagcagct gcagatgggg ttcggcatta     3900

ctgtgcagta cggaacagat acaaacagcg tgtgccctaa gctggagttc gccaacgaca     3960

ctaagatcgc ctcccagctg ggcaactgcg tcgagtacag ggtggtgccc agcggggacg     4020

tggtgcggtt cccaaacatc accaacctgt gccccttcgg ggaggtgttc aacgccacaa     4080

agttccctag cgtctacgcc tgggagcgga agaagattag caactgcgtg gccgactact     4140

ccgtgctgta caactccacc ttcttctcca cattcaagtg ctacggcgtg agcgccacaa     4200

agctgaacga cctctgcttc agcaacgtgt acgccgacag cttcgtggtc aagggcgatg     4260

atgtgcggca gatcgccccc ggccagaccg gcgtgatcgc cgattacaac tataagctgc     4320

ccgacgactt catggggtgc gtgctggcct ggaacacaag gaacattgat gccaccagca     4380

caggcaacta caactacaag tacaggtacc tgaggcacgg gaagctgcgg cccttcgagc     4440

gggacatctc caacgtgccc ttcagccccg acggcaagcc ctgcaccccc cccgccctga     4500

actgctactg gcccctgaac gattacggct tctacacaac caccggcatt ggctaccagc     4560

cttaccgggt cgtggtgctg agctttgagc tgctgaacgc ccccgccacc gtgtgcgggc     4620

ctaagctgag cactgacctg attaagaacc agtgcgtgaa ctttaactga tgaggatcca     4680

gatctttttc cctctgccaa aaattatggg gacatcatga agccccttga gcatctgact     4740

tctggctaat aaaggaaatt tattttcatt gcaatagtgt gttggaattt tttgtgtctc     4800

tcactcggaa ggacatatgg gagggcaaat catttaaaac atcagaatga gtatttggtt     4860

tagagtttgg caacatatgc ccattcttcc gcttcctcgc tcactgactc gctgcgctcg     4920

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca     4980

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac     5040

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac     5100

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg     5160

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac     5220

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat     5280

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag     5340

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac     5400

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt     5460

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt     5520

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc     5580

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga     5640

aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac     5700

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc     5760

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct     5820

gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca     5880

tccatagttg cctgactcgg gggggggggg cgctgaggtc tgcctcgtga agaaggtgtt     5940

gctgactcat accagggcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg     6000

tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc     6060

cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag     6120

ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg     6180

ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag     6240

tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat     6300

agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg     6360

atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacctga atcgccccat     6420

catccagcca gaaagtgagg gagccacggt tgatgagagc tttgttgtag gtggaccagt     6480

tggtgatttt gaacttttgc tttgccacgg aacggtctgc gttgtcggga agatgcgtga     6540

tctgatcctt caactcagca aaagttcgat ttattcaaca aagccgccgt cccgtcaagt     6600

cagcgtaatg ctctgccagt gttacaacca attaaccaat tctgattaga aaaactcatc     6660

gagcatcaaa tgaaactgca atttattcat atcaggatta tcaataccat atttttgaaa     6720

aagccgtttc tgtaatgaag gagaaaactc accgaggcag ttccatagga tggcaagatc     6780

ctggtatcgg tctgcgattc cgactcgtcc aacatcaata caacctatta atttcccctc     6840

gtcaaaaata aggttatcaa gtgagaaatc accatgagtg acgactgaat ccggtgagaa     6900

tggcaaaagc ttatgcattt ctttccagac ttgttcaaca ggccagccat tacgctcgtc     6960

atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt gattgcgcct gagcgagacg     7020

aaatacgcga tcgctgttaa aaggacaatt acaaacagga atcgaatgca accggcgcag     7080

gaacactgcc agcgcatcaa caatattttc acctgaatca ggatattctt ctaatacctg     7140

gaatgctgtt ttcccgggga tcgcagtggt gagtaaccat gcatcatcag gagtacggat     7200

aaaatgcttg atggtcggaa gaggcataaa ttccgtcagc cagtttagtc tgaccatctc     7260

atctgtaaca tcattggcaa cgctaccttt gccatgtttc agaaacaact ctggcgcatc     7320

gggcttccca tacaatcgat agattgtcgc acctgattgc ccgacattat cgcgagccca     7380

tttataccca tataaatcag catccatgtt ggaatttaat cgcggcctcg agcaagacgt     7440

ttcccgttga atatggctca taacacccct tgtattactg tttatgtaag cagacagttt     7500

tattgttcat gatgatatat ttttatcttg tgcaatgtaa catcagagat tttgagacac     7560

aacgtggctt tccccccccc cccattattg aagcatttat cagggttatt gtctcatgag     7620

cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc     7680

ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa     7740

taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt gatgacggtg aaaacctctg     7800

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     7860

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggctggctta actatgcggc     7920

atcagagca                                                             7929


