                         SEQUENCE LISTING

<110>  The Board of Regents of The University of Texas System
       Westport Bio, LLC
 
<120>  PROTEINS, POLYNUCLEOTIDES, AND METHODS FOR TREATING CORONAVIRUS 
       INFECTION

<130>  0265-000094WO01

<150>  63/064,083
<151>  2020-08-11

<160>  33    

<170>  PatentIn version 3.5

<210>  1
<211>  674
<212>  PRT
<213>  SARS-CoV-2

<400>  1

Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser 
1               5                   10                  15      


Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val 
            20                  25                  30          


Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr 
        35                  40                  45              


Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe 
    50                  55                  60                  


Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr 
65                  70                  75                  80  


Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp 
                85                  90                  95      


Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val 
            100                 105                 110         


Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val 
        115                 120                 125             


Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val 
    130                 135                 140                 


Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe 
145                 150                 155                 160 


Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu 
                165                 170                 175     


Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His 
            180                 185                 190         


Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu 
        195                 200                 205             


Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln 
    210                 215                 220                 


Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser 
225                 230                 235                 240 


Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln 
                245                 250                 255     


Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp 
            260                 265                 270         


Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu 
        275                 280                 285             


Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg 
    290                 295                 300                 


Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu 
305                 310                 315                 320 


Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr 
                325                 330                 335     


Val Glu Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala 
            340                 345                 350         


Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys 
        355                 360                 365             


Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val 
    370                 375                 380                 


Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala 
385                 390                 395                 400 


Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp 
                405                 410                 415     


Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser 
            420                 425                 430         


Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser 
        435                 440                 445             


Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala 
    450                 455                 460                 


Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro 
465                 470                 475                 480 


Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro 
                485                 490                 495     


Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr 
            500                 505                 510         


Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val 
        515                 520                 525             


Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser 
    530                 535                 540                 


Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp 
545                 550                 555                 560 


Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile 
                565                 570                 575     


Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn 
            580                 585                 590         


Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu 
        595                 600                 605             


Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val 
    610                 615                 620                 


Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile 
625                 630                 635                 640 


Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly 
                645                 650                 655     


Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg 
            660                 665                 670         


Ala Arg 
        


<210>  2
<211>  208
<212>  PRT
<213>  SARS-CoV-2

<400>  2

Val Glu Cys Asp Phe Ser Pro Leu Leu Ser Gly Thr Pro Pro Gln Val 
1               5                   10                  15      


Tyr Asn Phe Lys Arg Leu Val Phe Thr Asn Cys Asn Tyr Asn Leu Thr 
            20                  25                  30          


Lys Leu Leu Ser Leu Phe Ser Val Asn Asp Phe Thr Cys Ser Gln Ile 
        35                  40                  45              


Ser Pro Ala Ala Ile Ala Ser Asn Cys Tyr Ser Ser Leu Ile Leu Asp 
    50                  55                  60                  


Tyr Phe Ser Tyr Pro Leu Ser Met Lys Ser Asp Leu Ser Val Ser Ser 
65                  70                  75                  80  


Ala Gly Pro Ile Ser Gln Phe Asn Tyr Lys Gln Ser Phe Ser Asn Pro 
                85                  90                  95      


Thr Cys Leu Ile Leu Ala Thr Val Pro His Asn Leu Thr Thr Ile Thr 
            100                 105                 110         


Lys Pro Leu Lys Tyr Ser Tyr Ile Asn Lys Cys Ser Arg Leu Leu Ser 
        115                 120                 125             


Asp Asp Arg Thr Glu Val Pro Gln Leu Val Asn Ala Asn Gln Tyr Ser 
    130                 135                 140                 


Pro Cys Val Ser Ile Val Pro Ser Thr Val Trp Glu Asp Gly Asp Tyr 
145                 150                 155                 160 


Tyr Arg Lys Gln Leu Ser Pro Leu Glu Gly Gly Gly Trp Leu Val Ala 
                165                 170                 175     


Ser Gly Ser Thr Val Ala Met Thr Glu Gln Leu Gln Met Gly Phe Gly 
            180                 185                 190         


Ile Thr Val Gln Tyr Gly Thr Asp Thr Asn Ser Val Cys Pro Lys Leu 
        195                 200                 205             


<210>  3
<211>  242
<212>  PRT
<213>  SARS-CoV-2

<400>  3

Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln 
1               5                   10                  15      


Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro 
            20                  25                  30          


Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val Glu 
        35                  40                  45              


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
    50                  55                  60                  


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
65                  70                  75                  80  


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
                85                  90                  95      


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
            100                 105                 110         


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
        115                 120                 125             


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
    130                 135                 140                 


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
145                 150                 155                 160 


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
                165                 170                 175     


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
            180                 185                 190         


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
        195                 200                 205             


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
    210                 215                 220                 


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
225                 230                 235                 240 


Asn Phe 
        


<210>  4
<211>  164
<212>  PRT
<213>  SARS-CoV-2

<400>  4

Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe 
1               5                   10                  15      


Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala 
            20                  25                  30          


Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val 
        35                  40                  45              


Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr 
    50                  55                  60                  


Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys 
65                  70                  75                  80  


Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln 
                85                  90                  95      


Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
            100                 105                 110         


His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala 
        115                 120                 125             


Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala 
    130                 135                 140                 


Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr 
145                 150                 155                 160 


Glu Gln Tyr Ile 
                


<210>  5
<211>  218
<212>  PRT
<213>  SARS-CoV-2

<400>  5

Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu 
1               5                   10                  15      


Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile 
            20                  25                  30          


Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile 
        35                  40                  45              


Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys 
    50                  55                  60                  


Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile 
65                  70                  75                  80  


Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe 
                85                  90                  95      


Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe 
            100                 105                 110         


Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile 
        115                 120                 125             


Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile 
    130                 135                 140                 


Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp 
145                 150                 155                 160 


Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu 
                165                 170                 175     


Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly 
            180                 185                 190         


Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr 
        195                 200                 205             


Asp His Ser Ser Ser Ser Asp Asn Ile Ala 
    210                 215             


<210>  6
<211>  419
<212>  PRT
<213>  SARS-CoV-2

<400>  6

Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr 
1               5                   10                  15      


Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg 
            20                  25                  30          


Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn 
        35                  40                  45              


Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu 
    50                  55                  60                  


Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro 
65                  70                  75                  80  


Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly 
                85                  90                  95      


Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr 
            100                 105                 110         


Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp 
        115                 120                 125             


Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp 
    130                 135                 140                 


His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln 
145                 150                 155                 160 


Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser 
                165                 170                 175     


Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn 
            180                 185                 190         


Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala 
        195                 200                 205             


Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu 
    210                 215                 220                 


Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln 
225                 230                 235                 240 


Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys 
                245                 250                 255     


Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln 
            260                 265                 270         


Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp 
        275                 280                 285             


Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile 
    290                 295                 300                 


Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile 
305                 310                 315                 320 


Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly Ala 
                325                 330                 335     


Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu 
            340                 345                 350         


Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro 
        355                 360                 365             


Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln 
    370                 375                 380                 


Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu 
385                 390                 395                 400 


Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser 
                405                 410                 415     


Thr Gln Ala 
            


<210>  7
<211>  1945
<212>  PRT
<213>  SARS-CoV-2

<400>  7

Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu Val Gln 
1               5                   10                  15      


Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg Ile Asp 
            20                  25                  30          


Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu Gly Thr 
        35                  40                  45              


Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile Lys Thr 
    50                  55                  60                  


Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp Leu Asp 
65                  70                  75                  80  


Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly Glu Phe 
                85                  90                  95      


Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp Glu Asp 
            100                 105                 110         


Glu Glu Glu Gly Asp Cys Glu Glu Glu Glu Phe Glu Pro Ser Thr Gln 
        115                 120                 125             


Tyr Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Lys Pro Leu Glu Phe 
    130                 135                 140                 


Gly Ala Thr Ser Ala Ala Leu Gln Pro Glu Glu Glu Gln Glu Glu Asp 
145                 150                 155                 160 


Trp Leu Asp Asp Asp Ser Gln Gln Thr Val Gly Gln Gln Asp Gly Ser 
                165                 170                 175     


Glu Asp Asn Gln Thr Thr Thr Ile Gln Thr Ile Val Glu Val Gln Pro 
            180                 185                 190         


Gln Leu Glu Met Glu Leu Thr Pro Val Val Gln Thr Ile Glu Val Asn 
        195                 200                 205             


Ser Phe Ser Gly Tyr Leu Lys Leu Thr Asp Asn Val Tyr Ile Lys Asn 
    210                 215                 220                 


Ala Asp Ile Val Glu Glu Ala Lys Lys Val Lys Pro Thr Val Val Val 
225                 230                 235                 240 


Asn Ala Ala Asn Val Tyr Leu Lys His Gly Gly Gly Val Ala Gly Ala 
                245                 250                 255     


Leu Asn Lys Ala Thr Asn Asn Ala Met Gln Val Glu Ser Asp Asp Tyr 
            260                 265                 270         


Ile Ala Thr Asn Gly Pro Leu Lys Val Gly Gly Ser Cys Val Leu Ser 
        275                 280                 285             


Gly His Asn Leu Ala Lys His Cys Leu His Val Val Gly Pro Asn Val 
    290                 295                 300                 


Asn Lys Gly Glu Asp Ile Gln Leu Leu Lys Ser Ala Tyr Glu Asn Phe 
305                 310                 315                 320 


Asn Gln His Glu Val Leu Leu Ala Pro Leu Leu Ser Ala Gly Ile Phe 
                325                 330                 335     


Gly Ala Asp Pro Ile His Ser Leu Arg Val Cys Val Asp Thr Val Arg 
            340                 345                 350         


Thr Asn Val Tyr Leu Ala Val Phe Asp Lys Asn Leu Tyr Asp Lys Leu 
        355                 360                 365             


Val Ser Ser Phe Leu Glu Met Lys Ser Glu Lys Gln Val Glu Gln Lys 
    370                 375                 380                 


Ile Ala Glu Ile Pro Lys Glu Glu Val Lys Pro Phe Ile Thr Glu Ser 
385                 390                 395                 400 


Lys Pro Ser Val Glu Gln Arg Lys Gln Asp Asp Lys Lys Ile Lys Ala 
                405                 410                 415     


Cys Val Glu Glu Val Thr Thr Thr Leu Glu Glu Thr Lys Phe Leu Thr 
            420                 425                 430         


Glu Asn Leu Leu Leu Tyr Ile Asp Ile Asn Gly Asn Leu His Pro Asp 
        435                 440                 445             


Ser Ala Thr Leu Val Ser Asp Ile Asp Ile Thr Phe Leu Lys Lys Asp 
    450                 455                 460                 


Ala Pro Tyr Ile Val Gly Asp Val Val Gln Glu Gly Val Leu Thr Ala 
465                 470                 475                 480 


Val Val Ile Pro Thr Lys Lys Ala Gly Gly Thr Thr Glu Met Leu Ala 
                485                 490                 495     


Lys Ala Leu Arg Lys Val Pro Thr Asp Asn Tyr Ile Thr Thr Tyr Pro 
            500                 505                 510         


Gly Gln Gly Leu Asn Gly Tyr Thr Val Glu Glu Ala Lys Thr Val Leu 
        515                 520                 525             


Lys Lys Cys Lys Ser Ala Phe Tyr Ile Leu Pro Ser Ile Ile Ser Asn 
    530                 535                 540                 


Glu Lys Gln Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg Glu Met 
545                 550                 555                 560 


Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Val Cys Val Glu 
                565                 570                 575     


Thr Lys Ala Ile Val Ser Thr Ile Gln Arg Lys Tyr Lys Gly Ile Lys 
            580                 585                 590         


Ile Gln Glu Gly Val Val Asp Tyr Gly Ala Arg Phe Tyr Phe Tyr Thr 
        595                 600                 605             


Ser Lys Thr Thr Val Ala Ser Leu Ile Asn Thr Leu Asn Asp Leu Asn 
    610                 615                 620                 


Glu Thr Leu Val Thr Met Pro Leu Gly Tyr Val Thr His Gly Leu Asn 
625                 630                 635                 640 


Leu Glu Glu Ala Ala Arg Tyr Met Arg Ser Leu Lys Val Pro Ala Thr 
                645                 650                 655     


Val Ser Val Ser Ser Pro Asp Ala Val Thr Ala Tyr Asn Gly Tyr Leu 
            660                 665                 670         


Thr Ser Ser Ser Lys Thr Pro Glu Glu His Phe Ile Glu Thr Ile Ser 
        675                 680                 685             


Leu Ala Gly Ser Tyr Lys Asp Trp Ser Tyr Ser Gly Gln Ser Thr Gln 
    690                 695                 700                 


Leu Gly Ile Glu Phe Leu Lys Arg Gly Asp Lys Ser Val Tyr Tyr Thr 
705                 710                 715                 720 


Ser Asn Pro Thr Thr Phe His Leu Asp Gly Glu Val Ile Thr Phe Asp 
                725                 730                 735     


Asn Leu Lys Thr Leu Leu Ser Leu Arg Glu Val Arg Thr Ile Lys Val 
            740                 745                 750         


Phe Thr Thr Val Asp Asn Ile Asn Leu His Thr Gln Val Val Asp Met 
        755                 760                 765             


Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp Gly Ala 
    770                 775                 780                 


Asp Val Thr Lys Ile Lys Pro His Asn Ser His Glu Gly Lys Thr Phe 
785                 790                 795                 800 


Tyr Val Leu Pro Asn Asp Asp Thr Leu Arg Val Glu Ala Phe Glu Tyr 
                805                 810                 815     


Tyr His Thr Thr Asp Pro Ser Phe Leu Gly Arg Tyr Met Ser Ala Leu 
            820                 825                 830         


Asn His Thr Lys Lys Trp Lys Tyr Pro Gln Val Asn Gly Leu Thr Ser 
        835                 840                 845             


Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu Ala Thr Ala Leu Leu Thr 
    850                 855                 860                 


Leu Gln Gln Ile Glu Leu Lys Phe Asn Pro Pro Ala Leu Gln Asp Ala 
865                 870                 875                 880 


Tyr Tyr Arg Ala Arg Ala Gly Glu Ala Ala Asn Phe Cys Ala Leu Ile 
                885                 890                 895     


Leu Ala Tyr Cys Asn Lys Thr Val Gly Glu Leu Gly Asp Val Arg Glu 
            900                 905                 910         


Thr Met Ser Tyr Leu Phe Gln His Ala Asn Leu Asp Ser Cys Lys Arg 
        915                 920                 925             


Val Leu Asn Val Val Cys Lys Thr Cys Gly Gln Gln Gln Thr Thr Leu 
    930                 935                 940                 


Lys Gly Val Glu Ala Val Met Tyr Met Gly Thr Leu Ser Tyr Glu Gln 
945                 950                 955                 960 


Phe Lys Lys Gly Val Gln Ile Pro Cys Thr Cys Gly Lys Gln Ala Thr 
                965                 970                 975     


Lys Tyr Leu Val Gln Gln Glu Ser Pro Phe Val Met Met Ser Ala Pro 
            980                 985                 990         


Pro Ala Gln Tyr Glu Leu Lys His  Gly Thr Phe Thr Cys  Ala Ser Glu 
        995                 1000                 1005             


Tyr Thr  Gly Asn Tyr Gln Cys  Gly His Tyr Lys His  Ile Thr Ser 
    1010                 1015                 1020             


Lys Glu  Thr Leu Tyr Cys Ile  Asp Gly Ala Leu Leu  Thr Lys Ser 
    1025                 1030                 1035             


Ser Glu  Tyr Lys Gly Pro Ile  Thr Asp Val Phe Tyr  Lys Glu Asn 
    1040                 1045                 1050             


Ser Tyr  Thr Thr Thr Ile Lys  Pro Val Thr Tyr Lys  Leu Asp Gly 
    1055                 1060                 1065             


Val Val  Cys Thr Glu Ile Asp  Pro Lys Leu Asp Asn  Tyr Tyr Lys 
    1070                 1075                 1080             


Lys Asp  Asn Ser Tyr Phe Thr  Glu Gln Pro Ile Asp  Leu Val Pro 
    1085                 1090                 1095             


Asn Gln  Pro Tyr Pro Asn Ala  Ser Phe Asp Asn Phe  Lys Phe Val 
    1100                 1105                 1110             


Cys Asp  Asn Ile Lys Phe Ala  Asp Asp Leu Asn Gln  Leu Thr Gly 
    1115                 1120                 1125             


Tyr Lys  Lys Pro Ala Ser Arg  Glu Leu Lys Val Thr  Phe Phe Pro 
    1130                 1135                 1140             


Asp Leu  Asn Gly Asp Val Val  Ala Ile Asp Tyr Lys  His Tyr Thr 
    1145                 1150                 1155             


Pro Ser  Phe Lys Lys Gly Ala  Lys Leu Leu His Lys  Pro Ile Val 
    1160                 1165                 1170             


Trp His  Val Asn Asn Ala Thr  Asn Lys Ala Thr Tyr  Lys Pro Asn 
    1175                 1180                 1185             


Thr Trp  Cys Ile Arg Cys Leu  Trp Ser Thr Lys Pro  Val Glu Thr 
    1190                 1195                 1200             


Ser Asn  Ser Phe Asp Val Leu  Lys Ser Glu Asp Ala  Gln Gly Met 
    1205                 1210                 1215             


Asp Asn  Leu Ala Cys Glu Asp  Leu Lys Pro Val Ser  Glu Glu Val 
    1220                 1225                 1230             


Val Glu  Asn Pro Thr Ile Gln  Lys Asp Val Leu Glu  Cys Asn Val 
    1235                 1240                 1245             


Lys Thr  Thr Glu Val Val Gly  Asp Ile Ile Leu Lys  Pro Ala Asn 
    1250                 1255                 1260             


Asn Ser  Leu Lys Ile Thr Glu  Glu Val Gly His Thr  Asp Leu Met 
    1265                 1270                 1275             


Ala Ala  Tyr Val Asp Asn Ser  Ser Leu Thr Ile Lys  Lys Pro Asn 
    1280                 1285                 1290             


Glu Leu  Ser Arg Val Leu Gly  Leu Lys Thr Leu Ala  Thr His Gly 
    1295                 1300                 1305             


Leu Ala  Ala Val Asn Ser Val  Pro Trp Asp Thr Ile  Ala Asn Tyr 
    1310                 1315                 1320             


Ala Lys  Pro Phe Leu Asn Lys  Val Val Ser Thr Thr  Thr Asn Ile 
    1325                 1330                 1335             


Val Thr  Arg Cys Leu Asn Arg  Val Cys Thr Asn Tyr  Met Pro Tyr 
    1340                 1345                 1350             


Phe Phe  Thr Leu Leu Leu Gln  Leu Cys Thr Phe Thr  Arg Ser Thr 
    1355                 1360                 1365             


Asn Ser  Arg Ile Lys Ala Ser  Met Pro Thr Thr Ile  Ala Lys Asn 
    1370                 1375                 1380             


Thr Val  Lys Ser Val Gly Lys  Phe Cys Leu Glu Ala  Ser Phe Asn 
    1385                 1390                 1395             


Tyr Leu  Lys Ser Pro Asn Phe  Ser Lys Leu Ile Asn  Ile Ile Ile 
    1400                 1405                 1410             


Trp Phe  Leu Leu Leu Ser Val  Cys Leu Gly Ser Leu  Ile Tyr Ser 
    1415                 1420                 1425             


Thr Ala  Ala Leu Gly Val Leu  Met Ser Asn Leu Gly  Met Pro Ser 
    1430                 1435                 1440             


Tyr Cys  Thr Gly Tyr Arg Glu  Gly Tyr Leu Asn Ser  Thr Asn Val 
    1445                 1450                 1455             


Thr Ile  Ala Thr Tyr Cys Thr  Gly Ser Ile Pro Cys  Ser Val Cys 
    1460                 1465                 1470             


Leu Ser  Gly Leu Asp Ser Leu  Asp Thr Tyr Pro Ser  Leu Glu Thr 
    1475                 1480                 1485             


Ile Gln  Ile Thr Ile Ser Ser  Phe Lys Trp Asp Leu  Thr Ala Phe 
    1490                 1495                 1500             


Gly Leu  Val Ala Glu Trp Phe  Leu Ala Tyr Ile Leu  Phe Thr Arg 
    1505                 1510                 1515             


Phe Phe  Tyr Val Leu Gly Leu  Ala Ala Ile Met Gln  Leu Phe Phe 
    1520                 1525                 1530             


Ser Tyr  Phe Ala Val His Phe  Ile Ser Asn Ser Trp  Leu Met Trp 
    1535                 1540                 1545             


Leu Ile  Ile Asn Leu Val Gln  Met Ala Pro Ile Ser  Ala Met Val 
    1550                 1555                 1560             


Arg Met  Tyr Ile Phe Phe Ala  Ser Phe Tyr Tyr Val  Trp Lys Ser 
    1565                 1570                 1575             


Tyr Val  His Val Val Asp Gly  Cys Asn Ser Ser Thr  Cys Met Met 
    1580                 1585                 1590             


Cys Tyr  Lys Arg Asn Arg Ala  Thr Arg Val Glu Cys  Thr Thr Ile 
    1595                 1600                 1605             


Val Asn  Gly Val Arg Arg Ser  Phe Tyr Val Tyr Ala  Asn Gly Gly 
    1610                 1615                 1620             


Lys Gly  Phe Cys Lys Leu His  Asn Trp Asn Cys Val  Asn Cys Asp 
    1625                 1630                 1635             


Thr Phe  Cys Ala Gly Ser Thr  Phe Ile Ser Asp Glu  Val Ala Arg 
    1640                 1645                 1650             


Asp Leu  Ser Leu Gln Phe Lys  Arg Pro Ile Asn Pro  Thr Asp Gln 
    1655                 1660                 1665             


Ser Ser  Tyr Ile Val Asp Ser  Val Thr Val Lys Asn  Gly Ser Ile 
    1670                 1675                 1680             


His Leu  Tyr Phe Asp Lys Ala  Gly Gln Lys Thr Tyr  Glu Arg His 
    1685                 1690                 1695             


Ser Leu  Ser His Phe Val Asn  Leu Asp Asn Leu Arg  Ala Asn Asn 
    1700                 1705                 1710             


Thr Lys  Gly Ser Leu Pro Ile  Asn Val Ile Val Phe  Asp Gly Lys 
    1715                 1720                 1725             


Ser Lys  Cys Glu Glu Ser Ser  Ala Lys Ser Ala Ser  Val Tyr Tyr 
    1730                 1735                 1740             


Ser Gln  Leu Met Cys Gln Pro  Ile Leu Leu Leu Asp  Gln Ala Leu 
    1745                 1750                 1755             


Val Ser  Asp Val Gly Asp Ser  Ala Glu Val Ala Val  Lys Met Phe 
    1760                 1765                 1770             


Asp Ala  Tyr Val Asn Thr Phe  Ser Ser Thr Phe Asn  Val Pro Met 
    1775                 1780                 1785             


Glu Lys  Leu Lys Thr Leu Val  Ala Thr Ala Glu Ala  Glu Leu Ala 
    1790                 1795                 1800             


Lys Asn  Val Ser Leu Asp Asn  Val Leu Ser Thr Phe  Ile Ser Ala 
    1805                 1810                 1815             


Ala Arg  Gln Gly Phe Val Asp  Ser Asp Val Glu Thr  Lys Asp Val 
    1820                 1825                 1830             


Val Glu  Cys Leu Lys Leu Ser  His Gln Ser Asp Ile  Glu Val Thr 
    1835                 1840                 1845             


Gly Asp  Ser Cys Asn Asn Tyr  Met Leu Thr Tyr Asn  Lys Val Glu 
    1850                 1855                 1860             


Asn Met  Thr Pro Arg Asp Leu  Gly Ala Cys Ile Asp  Cys Ser Ala 
    1865                 1870                 1875             


Arg His  Ile Asn Ala Gln Val  Ala Lys Ser His Asn  Ile Ala Leu 
    1880                 1885                 1890             


Ile Trp  Asn Val Lys Asp Phe  Met Ser Leu Ser Glu  Gln Leu Arg 
    1895                 1900                 1905             


Lys Gln  Ile Arg Ser Ala Ala  Lys Lys Asn Asn Leu  Pro Phe Lys 
    1910                 1915                 1920             


Leu Thr  Cys Ala Thr Thr Arg  Gln Val Val Asn Val  Val Thr Thr 
    1925                 1930                 1935             


Lys Ile  Ala Leu Lys Gly Gly  
    1940                 1945 


<210>  8
<211>  111
<212>  PRT
<213>  SARS-CoV-2

<400>  8

Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu Val Gln 
1               5                   10                  15      


Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg Ile Asp 
            20                  25                  30          


Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu Gly Thr 
        35                  40                  45              


Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile Lys Thr 
    50                  55                  60                  


Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp Leu Asp 
65                  70                  75                  80  


Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly Glu Phe 
                85                  90                  95      


Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp Glu 
            100                 105                 110     


<210>  9
<211>  82
<212>  PRT
<213>  SARS-CoV-2

<400>  9

Ser Asn Leu Gly Met Pro Ser Tyr Cys Thr Gly Tyr Arg Glu Gly Tyr 
1               5                   10                  15      


Leu Asn Ser Thr Asn Val Thr Ile Ala Thr Tyr Cys Thr Gly Ser Ile 
            20                  25                  30          


Pro Cys Ser Val Cys Leu Ser Gly Leu Asp Ser Leu Asp Thr Tyr Pro 
        35                  40                  45              


Ser Leu Glu Thr Ile Gln Ile Thr Ile Ser Ser Phe Lys Trp Asp Leu 
    50                  55                  60                  


Thr Ala Phe Gly Leu Val Ala Glu Trp Phe Leu Ala Tyr Ile Leu Phe 
65                  70                  75                  80  


Thr Arg 
        


<210>  10
<211>  198
<212>  PRT
<213>  SARS-CoV-2

<400>  10

Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro Ser Tyr Ala Ala Phe Ala 
1               5                   10                  15      


Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val Ala Asn Gly Asp Ser Glu 
            20                  25                  30          


Val Val Leu Lys Lys Leu Lys Lys Ser Leu Asn Val Ala Lys Ser Glu 
        35                  40                  45              


Phe Asp Arg Asp Ala Ala Met Gln Arg Lys Leu Glu Lys Met Ala Asp 
    50                  55                  60                  


Gln Ala Met Thr Gln Met Tyr Lys Gln Ala Arg Ser Glu Asp Lys Arg 
65                  70                  75                  80  


Ala Lys Val Thr Ser Ala Met Gln Thr Met Leu Phe Thr Met Leu Arg 
                85                  90                  95      


Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile Ile Asn Asn Ala Arg Asp 
            100                 105                 110         


Gly Cys Val Pro Leu Asn Ile Ile Pro Leu Thr Thr Ala Ala Lys Leu 
        115                 120                 125             


Met Val Val Ile Pro Asp Tyr Asn Thr Tyr Lys Asn Thr Cys Asp Gly 
    130                 135                 140                 


Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp Glu Ile Gln Gln Val Val 
145                 150                 155                 160 


Asp Ala Asp Ser Lys Ile Val Gln Leu Ser Glu Ile Ser Met Asp Asn 
                165                 170                 175     


Ser Pro Asn Leu Ala Trp Pro Leu Ile Val Thr Ala Leu Arg Ala Asn 
            180                 185                 190         


Ser Ala Val Lys Leu Gln 
        195             


<210>  11
<211>  912
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including SARS-CoV-2 S1 and MERS S1-RBD

<400>  11

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val 
            340                 345                 350         


Glu Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp 
        355                 360                 365             


Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr 
    370                 375                 380                 


Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr 
385                 390                 395                 400 


Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro 
                405                 410                 415     


Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp 
            420                 425                 430         


Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys 
        435                 440                 445             


Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn 
    450                 455                 460                 


Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly 
465                 470                 475                 480 


Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu 
                485                 490                 495     


Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr 
            500                 505                 510         


Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val 
        515                 520                 525             


Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn 
    530                 535                 540                 


Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn 
545                 550                 555                 560 


Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr 
                565                 570                 575     


Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr 
            580                 585                 590         


Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr 
        595                 600                 605             


Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val 
    610                 615                 620                 


Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr 
625                 630                 635                 640 


Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly 
                645                 650                 655     


Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala 
            660                 665                 670         


Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala 
        675                 680                 685             


Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
    690                 695                 700                 


Val Glu Cys Asp Phe Ser Pro Leu Leu Ser Gly Thr Pro Pro Gln Val 
705                 710                 715                 720 


Tyr Asn Phe Lys Arg Leu Val Phe Thr Asn Cys Asn Tyr Asn Leu Thr 
                725                 730                 735     


Lys Leu Leu Ser Leu Phe Ser Val Asn Asp Phe Thr Cys Ser Gln Ile 
            740                 745                 750         


Ser Pro Ala Ala Ile Ala Ser Asn Cys Tyr Ser Ser Leu Ile Leu Asp 
        755                 760                 765             


Tyr Phe Ser Tyr Pro Leu Ser Met Lys Ser Asp Leu Ser Val Ser Ser 
    770                 775                 780                 


Ala Gly Pro Ile Ser Gln Phe Asn Tyr Lys Gln Ser Phe Ser Asn Pro 
785                 790                 795                 800 


Thr Cys Leu Ile Leu Ala Thr Val Pro His Asn Leu Thr Thr Ile Thr 
                805                 810                 815     


Lys Pro Leu Lys Tyr Ser Tyr Ile Asn Lys Cys Ser Arg Leu Leu Ser 
            820                 825                 830         


Asp Asp Arg Thr Glu Val Pro Gln Leu Val Asn Ala Asn Gln Tyr Ser 
        835                 840                 845             


Pro Cys Val Ser Ile Val Pro Ser Thr Val Trp Glu Asp Gly Asp Tyr 
    850                 855                 860                 


Tyr Arg Lys Gln Leu Ser Pro Leu Glu Gly Gly Gly Trp Leu Val Ala 
865                 870                 875                 880 


Ser Gly Ser Thr Val Ala Met Thr Glu Gln Leu Gln Met Gly Phe Gly 
                885                 890                 895     


Ile Thr Val Gln Tyr Gly Thr Asp Thr Asn Ser Val Cys Pro Lys Leu 
            900                 905                 910         


<210>  12
<211>  2739
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:11

<400>  12
atgttcgtgt ttttggtact gctgcctctg gtgtcctcac aatgcgtaaa tctcaccact       60

agaacccaac tgccacccgc ttatacgaac tccttcacac gaggtgttta ctaccccgat      120

aaggtcttta gatcatccgt actccactca acacaggact tgttcttgcc ttttttcagt      180

aatgtcacgt ggtttcacgc gatacatgtg tcaggtacaa acggcacaaa gcgattcgac      240

aaccctgtgc tcccctttaa cgatggcgtc tattttgcct ctactgagaa gtcaaatatt      300

atccgcgggt ggatctttgg gacaaccttg gattcaaaga ctcagtctct gcttatagta      360

aataacgcca ccaacgtcgt catcaaggtt tgtgagttcc agttttgtaa tgatccattt      420

ttgggtgtct actaccataa aaacaacaag tcatggatgg aatccgagtt cagagtttat      480

tcctcagcga acaattgtac ttttgaatat gtgagccagc cgtttctgat ggatctggag      540

ggaaagcagg gaaacttcaa gaacctgaga gagttcgttt tcaaaaacat cgatggctac      600

ttcaaaatct atagcaagca cacaccaatt aacctggtgc gcgatctgcc ccaaggcttc      660

tccgcactgg aacctctggt cgatctgcca attggaataa atatcacaag atttcagacc      720

ctccttgctc ttcacagaag ttatctgaca ccaggcgata gtagctcagg ctggacagca      780

ggggcagctg cgtattatgt gggctatctg caacccagaa cctttctcct caagtacaac      840

gaaaacggaa ctatcaccga cgccgtagat tgtgctcttg acccactctc cgaaactaag      900

tgtaccttga aatcctttac tgtggaaaag ggcatttacc aaacctcaaa ttttagagtt      960

caacccaccg aaagtattgt acgctttccg aacattacaa atttgtgccc cttcggcgaa     1020

gtgtttaacg ccactagatt cgctagtttt actgtagaag tgtatgcctg gaaccggaag     1080

cggattagta actgtgtcgc cgattacagc gtgctctata acagcgcttc cttcagcaca     1140

tttaagtgct atggggtcag tcccaccaaa cttaacgacc tgtgcttcac caacgtgtat     1200

gccgacagct ttgtgatacg aggcgatgag gtgagacaga ttgctccagg gcagacaggt     1260

aagatcgccg attacaacta taaactccca gacgatttta ccggttgcgt gattgcttgg     1320

aattccaata atttggacag caaggtcggg ggcaactata attatctgta cagactgttt     1380

cgcaagtcca acctgaagcc ctttgagaga gacatttcaa ccgagatcta tcaagctggt     1440

tcaacgcctt gtaatggcgt cgaaggattt aactgctatt tccccctcca gagttacggc     1500

ttccagccca ccaacggagt tggataccaa ccttatcgcg tagtcgtact ttcctttgaa     1560

ctgctgcatg cccctgctac tgtctgcgga cccaaaaaat caactaatct ggtaaagaac     1620

aaatgcgtga actttaattt caatggcctg actggtaccg gtgtcctcac cgaatccaac     1680

aagaaattcc tgccttttca gcaattcggt cgcgatatcg ctgacacaac tgacgcagtg     1740

agggacccac aaacactgga aatattggac ataacgccct gtagctttgg cggagtgtct     1800

gtcattaccc ctggcacaaa tacaagcaac caggtcgccg tcctgtacca agatgtcaat     1860

tgtacagagg tgcctgtcgc aatacatgcc gaccaactca cccccacctg gcgagtgtac     1920

agcaccggat caaatgtctt ccagacccgc gctgggtgcc tgattggagc agaacatgtc     1980

aataactcct atgaatgtga tatacccatt ggcgctggaa tttgcgcttc ttaccaaact     2040

cagaccaata gtcctcggcg ggcccgcggc ggaggtggtt caggcggcgg cggatctggc     2100

ggaggagggt cagtcgagtg tgatttctcc cccctgttgt caggtacgcc tcctcaggtg     2160

tataatttca agaggctggt gttcacaaat tgcaactaca accttacaaa gctcctgagc     2220

ctgttctccg taaatgattt cacctgttct cagatctccc cagcagctat tgcaagcaac     2280

tgttattcca gcctcattct ggattacttt agctatcccc tgagcatgaa gtctgatctg     2340

tccgtttcat ctgctggacc tataagtcaa ttcaactata aacagagctt tagtaacccc     2400

acatgtctga tcctcgccac cgtgcctcat aatctcacca ccatcacgaa accactgaag     2460

tatagttaca ttaacaagtg ttccaggctt ttgagtgatg atcggactga ggtaccacag     2520

cttgtcaatg ccaaccagta ttccccttgc gttagcatcg ttccctccac agtgtgggag     2580

gacggagact actatcgcaa acagctgtcc cccttggagg gcggagggtg gctcgtcgct     2640

tcagggtcta ccgttgctat gacagagcag ctgcaaatgg gcttcggtat caccgttcag     2700

tatggcacag acaccaattc cgtctgcccc aagctgtga                            2739


<210>  13
<211>  669
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1-RBD, S2-HR2 and M

<400>  13

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Phe 
1               5                   10                  15      


Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro 
            20                  25                  30          


Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe 
        35                  40                  45              


Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val Glu Val 
    50                  55                  60                  


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
65                  70                  75                  80  


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
                85                  90                  95      


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
            100                 105                 110         


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
        115                 120                 125             


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
    130                 135                 140                 


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
145                 150                 155                 160 


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
                165                 170                 175     


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
            180                 185                 190         


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
        195                 200                 205             


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
    210                 215                 220                 


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
225                 230                 235                 240 


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
                245                 250                 255     


Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            260                 265                 270         


Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe 
        275                 280                 285             


Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala 
    290                 295                 300                 


Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val 
305                 310                 315                 320 


Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr 
                325                 330                 335     


Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys 
            340                 345                 350         


Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln 
        355                 360                 365             


Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
    370                 375                 380                 


His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala 
385                 390                 395                 400 


Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala 
                405                 410                 415     


Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr 
            420                 425                 430         


Glu Gln Tyr Ile Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
        435                 440                 445             


Gly Gly Ser Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu 
    450                 455                 460                 


Lys Lys Leu Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu 
465                 470                 475                 480 


Thr Trp Ile Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe 
                485                 490                 495     


Leu Tyr Ile Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr 
            500                 505                 510         


Leu Ala Cys Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr 
        515                 520                 525             


Gly Gly Ile Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu 
    530                 535                 540                 


Ser Tyr Phe Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met 
545                 550                 555                 560 


Trp Ser Phe Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His 
                565                 570                 575     


Gly Thr Ile Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly 
            580                 585                 590         


Ala Val Ile Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly 
        595                 600                 605             


Arg Cys Asp Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser 
    610                 615                 620                 


Arg Thr Leu Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly 
625                 630                 635                 640 


Asp Ser Gly Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys 
                645                 650                 655     


Leu Asn Thr Asp His Ser Ser Ser Ser Asp Asn Ile Ala 
            660                 665                 


<210>  14
<211>  2010
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:13

<400>  14
atgtttgtgt tcctggtgct tcttccgctg gtctcttccc aatgtttcac tgtggagaaa       60

ggcatctatc agacttcaaa tttccgggtt cagccaaccg agagtattgt ccgattccca      120

aatataacaa atctgtgccc tttcggagaa gtatttaacg ccaccagatt cgcatccttt      180

accgttgagg tctacgcatg gaatagaaaa aggatctcaa actgcgtagc tgattacagc      240

gttctgtaca actccgcatc attctcaacc ttcaagtgct atggcgtcag tcccacaaag      300

cttaacgatc tctgcttcac caatgtgtac gccgactcat tcgtgattcg aggagatgag      360

gtacggcaaa ttgccccggg tcaaactggt aagattgcag actataacta taagctgcct      420

gatgacttta cagggtgtgt catagcctgg aattcaaaca atctggactc taaagtgggt      480

ggcaattata attatttgta tagactgttc aggaaatcta acctcaagcc ctttgagcgg      540

gacatcagca ccgaaatcta tcaagccgga tctaccccct gcaacggagt agagggcttc      600

aattgttatt tccctttgca gagctacggg ttccagccaa ccaatggagt gggatatcaa      660

ccatatcgcg tggtggtgct tagctttgag ttgttgcacg ctcctgcaac tgtatgtggc      720

ccgaaaaaat caacaaatct ggtaaaaaac aaatgtgtta actttaactt cggtggaggc      780

ggcagcggtg gcggcggaag tggcggcgga ggtagctacc atctgatgtc attcccacag      840

agcgctccgc acggcgtggt cttcttgcac gtgacgtacg tacccgcaca ggaaaagaac      900

tttacgacag ctcctgccat ttgtcacgac ggtaaggccc atttccccag agagggagtc      960

tttgtatcaa atggaaccca ctggtttgtt acacaaagaa atttttatga gccacagata     1020

atcactaccg acaacacttt cgtgagtggc aattgtgacg ttgtgatcgg gatagtaaac     1080

aataccgtct atgacccact gcagcccgaa ctggatagtt tcaaggaaga gcttgataaa     1140

tacttcaaaa atcatacatc acctgatgtt gatctcggag acatcagtgg catcaacgca     1200

tcagtggtaa atattcaaaa agagatcgac aggctgaatg aagttgcaaa aaatctcaat     1260

gaatctctta tcgatctgca ggagctggga aagtatgaac agtacatagg tgggggggga     1320

agtggaggcg gcggctccgg cggaggcggt tctatggctg acagcaatgg tacgatcact     1380

gtggaagagc ttaaaaagct cctggagcaa tggaacttgg tcataggttt cctctttctg     1440

acatggattt gtttgctgca gtttgcctat gccaatcgca atcgcttctt gtatattatc     1500

aagctcattt ttttgtggct tctgtggcca gtcactcttg catgcttcgt gctcgccgcc     1560

gtatatcgaa ttaactggat tactggcgga attgccatag cgatggcgtg tctcgtcggc     1620

cttatgtggc tgtcctactt cattgcaagc tttagactgt ttgccagaac cagatccatg     1680

tggagtttta acccagagac aaacattctg cttaatgttc cccttcacgg aactatcctg     1740

acccggccac tgttggaatc cgaactcgtg attggcgcag tgatcctgcg ggggcacctg     1800

agaatcgcag gtcatcatct cggaagatgc gacatcaagg atttgccgaa ggaaattacc     1860

gtggccacct caaggactct gtcttactac aagctgggag cgtcacagag ggtcgccggc     1920

gactccggct ttgcagcgta ttctcgctac aggattggca attacaagct taacaccgac     1980

cacagttcat cctcagacaa cattgcgtga                                      2010


<210>  15
<211>  870
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1-RBD, S2-HR2, and N

<400>  15

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Phe 
1               5                   10                  15      


Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro 
            20                  25                  30          


Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe 
        35                  40                  45              


Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val Glu Val 
    50                  55                  60                  


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
65                  70                  75                  80  


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
                85                  90                  95      


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
            100                 105                 110         


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
        115                 120                 125             


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
    130                 135                 140                 


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
145                 150                 155                 160 


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
                165                 170                 175     


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
            180                 185                 190         


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
        195                 200                 205             


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
    210                 215                 220                 


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
225                 230                 235                 240 


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
                245                 250                 255     


Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            260                 265                 270         


Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe 
        275                 280                 285             


Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala 
    290                 295                 300                 


Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val 
305                 310                 315                 320 


Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr 
                325                 330                 335     


Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys 
            340                 345                 350         


Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln 
        355                 360                 365             


Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
    370                 375                 380                 


His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala 
385                 390                 395                 400 


Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala 
                405                 410                 415     


Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr 
            420                 425                 430         


Glu Gln Tyr Ile Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
        435                 440                 445             


Gly Gly Ser Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro 
    450                 455                 460                 


Arg Ile Thr Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn 
465                 470                 475                 480 


Gly Glu Arg Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu 
                485                 490                 495     


Pro Asn Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys 
            500                 505                 510         


Glu Asp Leu Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn 
        515                 520                 525             


Ser Ser Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg 
    530                 535                 540                 


Ile Arg Gly Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr 
545                 550                 555                 560 


Phe Tyr Tyr Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala 
                565                 570                 575     


Asn Lys Asp Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr 
            580                 585                 590         


Pro Lys Asp His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile 
        595                 600                 605             


Val Leu Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala 
    610                 615                 620                 


Glu Gly Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg 
625                 630                 635                 640 


Ser Arg Asn Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr 
                645                 650                 655     


Ser Pro Ala Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu 
            660                 665                 670         


Leu Leu Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys 
        675                 680                 685             


Gly Gln Gln Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu 
    690                 695                 700                 


Ala Ser Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn 
705                 710                 715                 720 


Val Thr Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn 
                725                 730                 735     


Phe Gly Asp Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp 
            740                 745                 750         


Pro Gln Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met 
        755                 760                 765             


Ser Arg Ile Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr 
    770                 775                 780                 


Thr Gly Ala Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln 
785                 790                 795                 800 


Val Ile Leu Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro 
                805                 810                 815     


Thr Glu Pro Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala 
            820                 825                 830         


Leu Pro Gln Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala 
        835                 840                 845             


Ala Asp Leu Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser 
    850                 855                 860                 


Ala Asp Ser Thr Gln Ala 
865                 870 


<210>  16
<211>  2613
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:15

<400>  16
atgttcgtgt ttctcgtgct tctccctctg gtctcatctc agtgctttac cgtcgaaaag       60

ggaatctatc aaacgagtaa tttccgggtg cagccaaccg aaagtatcgt aagatttccc      120

aacatcacta acctttgccc attcggcgaa gtttttaatg ctactagatt cgccagtttc      180

acagtcgaag tgtatgcctg gaatagaaag agaatctcca actgcgttgc tgactatagc      240

gtgctgtaca actccgcgtc attctctact ttcaagtgtt acggcgtatc tcctaccaag      300

ctgaacgatc tctgctttac caatgtgtac gccgattcat tcgtgatacg cggtgatgag      360

gtacgccaaa ttgccccggg ccagacagga aagatcgctg actataacta taaactcccg      420

gacgacttta ctggatgcgt tatcgcctgg aatagtaata atctggattc caaagtgggg      480

ggcaactata attatttgta tcgcctgttc aggaaatcca atctgaaacc tttcgagcgc      540

gacatatcaa ctgagatata tcaggccggt tcaaccccat gtaatggcgt cgaaggattc      600

aactgttatt ttcccctgca aagctatggt tttcagccga caaacggcgt aggataccaa      660

ccttacagag tggtggtgtt gagtttcgag ctcctgcacg cccctgctac agtctgcggt      720

cccaagaaga gtacaaacct ggttaagaat aaatgcgtca attttaactt tgggggaggt      780

gggtctgggg gtggcggctc aggggggggc ggttcctacc atctgatgag ctttccccag      840

tccgcaccac atggcgtggt attcctccac gttacctacg tacccgccca agagaagaat      900

tttactaccg ccccagcaat atgtcacgac gggaaagcac atttcccaag agagggtgtt      960

tttgtgtcca atggtacaca ctggtttgtt acgcagcgca atttctacga accacaaatc     1020

ataaccacag ataatacatt cgtgtccgga aattgtgacg tcgtaatcgg tattgtcaac     1080

aacactgtgt acgatcccct gcagccagag ctggatagct ttaaagaaga gttggacaaa     1140

tattttaaaa atcacacatc acccgatgtc gacctgggag acattagtgg gatcaacgcc     1200

tctgtcgtaa acatccaaaa agagattgac cggctgaatg aggttgctaa gaacctgaat     1260

gagagcctga ttgacttgca agaactgggc aaatacgagc agtatatcgg cggagggggt     1320

tcaggtgggg gtggctccgg gggaggcgga tcaatgagcg acaatggacc ccagaaccaa     1380

agaaacgctc cccgcatcac atttggggga ccctctgact caaccggaag caaccagaat     1440

ggtgaacggt ccggcgccag gtcaaagcag aggagacccc aggggcttcc gaataatact     1500

gcctcctggt tcactgccct cacccagcac ggaaaggagg acctcaagtt tcctagagga     1560

cagggagtgc caatcaacac aaattcaagc ccagacgacc aaatcggcta ttatagacgc     1620

gccactcgac gaattcgcgg aggagacggt aaaatgaagg atctttctcc ccgctggtat     1680

ttttactatc tcggaacagg accagaggca ggactccctt atggagctaa caaggacggt     1740

atcatttggg tggccacaga gggggccctt aacacaccca aggaccacat tggtacaagg     1800

aatcccgcta acaacgcagc gattgttctc caactgcctc agggaaccac cctccccaag     1860

ggtttctacg ctgaggggag ccgcgggggc agtcaggcga gctcacgctc atcttccaga     1920

agtcgaaata gctcccggaa tagtacacct ggttcaagtc gcggaacttc ccctgcacgg     1980

atggccggca atggcggaga cgctgccctt gcactgctgc tgcttgacag gcttaaccag     2040

ctggaatcca aaatgtccgg taagggtcag caacagcagg gacagaccgt gacgaaaaaa     2100

agtgcagccg aggccagtaa aaaaccaaga caaaagcgga ccgcgacaaa ggcctacaat     2160

gtgacccagg ctttcggccg gcggggccca gagcagacac aaggtaattt cggcgaccag     2220

gagctgatca gacaaggaac tgactacaag cattggccgc agatcgcgca attcgcacct     2280

tctgcatccg ccttcttcgg tatgtcacgg attggaatgg aagtgacccc tagcggaacc     2340

tggctgacgt acacaggagc cataaaactg gatgacaagg acccaaactt caaagatcag     2400

gtaatcctct tgaataagca catcgacgcg tacaagacct ttcctccaac cgaacccaaa     2460

aaagacaaaa agaagaaagc tgacgagacc caagcgctgc cccagagaca gaagaagcaa     2520

cagaccgtga ctctcttgcc cgccgcagat ctcgatgact tctccaaaca gctgcaacag     2580

tcaatgtcca gtgctgacag cacccaagcc taa                                  2613


<210>  17
<211>  1103
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1-RBD, S2-HR2, M, and N

<400>  17

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Phe 
1               5                   10                  15      


Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro 
            20                  25                  30          


Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe 
        35                  40                  45              


Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val Glu Val 
    50                  55                  60                  


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
65                  70                  75                  80  


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
                85                  90                  95      


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
            100                 105                 110         


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
        115                 120                 125             


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
    130                 135                 140                 


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
145                 150                 155                 160 


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
                165                 170                 175     


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
            180                 185                 190         


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
        195                 200                 205             


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
    210                 215                 220                 


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
225                 230                 235                 240 


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
                245                 250                 255     


Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            260                 265                 270         


Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe 
        275                 280                 285             


Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala 
    290                 295                 300                 


Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val 
305                 310                 315                 320 


Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr 
                325                 330                 335     


Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys 
            340                 345                 350         


Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln 
        355                 360                 365             


Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
    370                 375                 380                 


His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala 
385                 390                 395                 400 


Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala 
                405                 410                 415     


Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr 
            420                 425                 430         


Glu Gln Tyr Ile Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
        435                 440                 445             


Gly Gly Ser Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu 
    450                 455                 460                 


Lys Lys Leu Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu 
465                 470                 475                 480 


Thr Trp Ile Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe 
                485                 490                 495     


Leu Tyr Ile Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr 
            500                 505                 510         


Leu Ala Cys Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr 
        515                 520                 525             


Gly Gly Ile Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu 
    530                 535                 540                 


Ser Tyr Phe Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met 
545                 550                 555                 560 


Trp Ser Phe Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His 
                565                 570                 575     


Gly Thr Ile Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly 
            580                 585                 590         


Ala Val Ile Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly 
        595                 600                 605             


Arg Cys Asp Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser 
    610                 615                 620                 


Arg Thr Leu Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly 
625                 630                 635                 640 


Asp Ser Gly Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys 
                645                 650                 655     


Leu Asn Thr Asp His Ser Ser Ser Ser Asp Asn Ile Ala Gly Gly Gly 
            660                 665                 670         


Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Met Ser Asp Asn 
        675                 680                 685             


Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr Phe Gly Gly Pro 
    690                 695                 700                 


Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg Ser Gly Ala Arg 
705                 710                 715                 720 


Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn Thr Ala Ser Trp 
                725                 730                 735     


Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu Lys Phe Pro Arg 
            740                 745                 750         


Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro Asp Asp Gln Ile 
        755                 760                 765             


Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly Gly Asp Gly Lys 
    770                 775                 780                 


Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr Leu Gly Thr Gly 
785                 790                 795                 800 


Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp Gly Ile Ile Trp 
                805                 810                 815     


Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp His Ile Gly Thr 
            820                 825                 830         


Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln Leu Pro Gln Gly 
        835                 840                 845             


Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser Arg Gly Gly Ser 
    850                 855                 860                 


Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn Ser Ser Arg Asn 
865                 870                 875                 880 


Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala Arg Met Ala Gly 
                885                 890                 895     


Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu Asp Arg Leu Asn 
            900                 905                 910         


Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln Gln Gln Gly Gln 
        915                 920                 925             


Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys Lys Pro Arg Gln 
    930                 935                 940                 


Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln Ala Phe Gly Arg 
945                 950                 955                 960 


Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp Gln Glu Leu Ile 
                965                 970                 975     


Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile Ala Gln Phe Ala 
            980                 985                 990         


Pro Ser Ala Ser Ala Phe Phe Gly  Met Ser Arg Ile Gly  Met Glu Val 
        995                 1000                 1005             


Thr Pro  Ser Gly Thr Trp Leu  Thr Tyr Thr Gly Ala  Ile Lys Leu 
    1010                 1015                 1020             


Asp Asp  Lys Asp Pro Asn Phe  Lys Asp Gln Val Ile  Leu Leu Asn 
    1025                 1030                 1035             


Lys His  Ile Asp Ala Tyr Lys  Thr Phe Pro Pro Thr  Glu Pro Lys 
    1040                 1045                 1050             


Lys Asp  Lys Lys Lys Lys Ala  Asp Glu Thr Gln Ala  Leu Pro Gln 
    1055                 1060                 1065             


Arg Gln  Lys Lys Gln Gln Thr  Val Thr Leu Leu Pro  Ala Ala Asp 
    1070                 1075                 1080             


Leu Asp  Asp Phe Ser Lys Gln  Leu Gln Gln Ser Met  Ser Ser Ala 
    1085                 1090                 1095             


Asp Ser  Thr Gln Ala 
    1100             


<210>  18
<211>  3312
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:17

<400>  18
atgttcgtgt tcctggtgct gctgccattg gtgtcctccc agtgctttac ggtagaaaag       60

ggaatttacc agacgtctaa cttccgagtc cagcccaccg agtctatcgt gcggttcccc      120

aacataacca atctgtgccc attcggtgag gtgttcaatg ccacacgctt cgctagtttt      180

actgtggagg tctatgcatg gaatcgcaaa cggattagta actgcgtagc tgattatagc      240

gtgctctaca acagtgcatc tttctctacg ttcaaatgtt atggcgtgtc cccgaccaaa      300

ctgaacgatc tgtgcttcac caatgtttac gccgactcat tcgtcatcag aggggatgaa      360

gtgaggcaaa tcgcccctgg tcaaaccggt aagatcgctg actacaatta taaacttcct      420

gacgacttca ccggttgtgt gatagcatgg aattccaaca atctggactc caaagtaggt      480

ggaaactata actatctcta tcgactgttt agaaaaagta atctgaaacc cttcgagcgc      540

gacatatcca ccgagattta tcaggccggc agcactccct gcaacggcgt agagggattt      600

aattgctatt tcccattgca aagctacggg ttccagccca caaatggcgt gggttaccaa      660

ccctacaggg tcgtggtcct ttcctttgaa cttttgcacg cacccgctac agtttgtggg      720

cccaaaaaaa gcacaaatct cgtgaaaaat aaatgcgtca acttcaattt cgggggtggc      780

ggctccggcg gcggtggctc cgggggaggg ggatcttacc atttgatgag ctttcctcag      840

tctgcacctc atggggttgt ctttttgcat gtgacctacg tccctgctca ggaaaagaat      900

tttaccaccg cccccgctat ttgccacgat ggtaaagcgc attttccgag ggaaggggtg      960

ttcgtctcca acggaaccca ctggtttgtc actcagagaa atttttacga gccccagatc     1020

atcacaacag ataacacctt tgtaagcggt aactgtgacg tggtcatagg tatagttaat     1080

aacacggtct atgatcctct gcaacccgag ctggacagct tcaaggaaga actcgataag     1140

tattttaaaa accacacttc acctgatgtc gatcttggcg atatctccgg gataaatgcc     1200

agcgtagtca acatccagaa agagatagat aggctgaacg aggtcgctaa aaatctcaac     1260

gaatccctga tagatctgca ggagttgggt aagtatgagc agtatatagg ggggggcggc     1320

agtggaggcg gagggtcagg aggcggcggg tctatggccg actccaacgg aacaataacc     1380

gtcgaggagc tgaaaaaact cctggaacag tggaatcttg tgattggatt tctcttcctc     1440

acatggattt gcctgttgca gttcgcctat gccaatagga atagatttct ttacatcatc     1500

aagctgatct tcctctggct gctttggcct gtgaccttgg catgttttgt gctggcggct     1560

gtgtacagaa ttaattggat taccggaggt atagccattg ccatggcttg tctggtaggg     1620

ctgatgtggt tgagctactt tatagcctcc ttccgcctgt ttgcgcgaac aagaagcatg     1680

tggtctttca atcctgaaac gaacatcttg cttaatgtgc ccctgcacgg tacaatcctt     1740

actagaccac tgctggagtc tgaactcgtg atcggagccg tgattcttcg cggacatctg     1800

agaatcgctg gccaccacct gggccgctgc gacattaaag accttcctaa agaaattacc     1860

gtcgccacat cccgcacact gtcatactac aaacttggcg ccagccaacg cgtggctgga     1920

gactccgggt ttgccgccta tagccgatac aggatcggca actataagct gaacactgac     1980

cattctagta gcagcgacaa cattgctgga gggggaggct ccggtggggg tgggtctggg     2040

ggagggggca gcatgtcaga taacggccct caaaatcagc ggaacgcccc gcgcataacc     2100

ttcggaggcc caagtgactc aacaggaagt aaccagaacg gagagaggag tggcgcgcgc     2160

agtaaacaga gacgacctca gggcctgcca aacaacacag cttcttggtt caccgccctc     2220

actcagcacg ggaaggaaga tctcaagttc ccaagagggc agggagttcc aattaacacc     2280

aacagcagcc ccgatgatca aatcggctac tatcgaaggg ctacaagaag aattcggggc     2340

ggtgatggaa agatgaagga tctgtcccca agatggtact tttactacct tggaacaggc     2400

cccgaagcag gtctgccgta tggtgcaaac aaggatggaa ttatttgggt cgccacggaa     2460

ggagccctga atacacctaa ggatcatatc ggcacccgga accctgctaa caatgcagca     2520

attgtgctgc agctccccca gggcactacg ctgccaaagg gattctacgc agaaggcagt     2580

aggggtggat ctcaagcttc ctcaaggtct tctagccggt caagaaactc aagtagaaac     2640

agtaccccag gatctagccg cggaacatcc ccagcacgca tggccggaaa tgggggcgat     2700

gcggccctcg ctcttcttct tcttgatcgc ctcaaccagt tggagtccaa aatgagcggg     2760

aagggccaac agcagcaggg acagaccgta accaagaaaa gcgcagccga agcgtctaaa     2820

aaacccagac agaagcgaac cgcgacaaaa gcatataacg ttacgcaagc cttcggaagg     2880

agaggccctg aacaaacaca gggaaacttc ggagatcaag aactgattag acaagggact     2940

gactacaaac attggccaca gatcgcacag tttgccccct ccgcgtctgc cttttttggg     3000

atgtctcgga taggcatgga agtcactccc tccgggacct ggttgacata taccggagca     3060

ataaaactgg atgataaaga ccctaatttt aaggaccaag ttatactgtt gaacaagcac     3120

attgatgctt acaagacatt cccccctaca gaaccaaaga aagacaaaaa gaagaaagcc     3180

gatgaaacac aggctctccc ccaacgacaa aaaaaacaac agacggtgac tttgctcccc     3240

gcggccgatc ttgatgactt ctccaaacag ctgcaacaga gcatgtctag cgctgatagt     3300

acccaggctt ag                                                         3312


<210>  19
<211>  2217
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1-RBD and Nsp3

<400>  19

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Phe 
1               5                   10                  15      


Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro 
            20                  25                  30          


Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe 
        35                  40                  45              


Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val Glu Val 
    50                  55                  60                  


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
65                  70                  75                  80  


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
                85                  90                  95      


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
            100                 105                 110         


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
        115                 120                 125             


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
    130                 135                 140                 


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
145                 150                 155                 160 


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
                165                 170                 175     


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
            180                 185                 190         


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
        195                 200                 205             


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
    210                 215                 220                 


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
225                 230                 235                 240 


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
                245                 250                 255     


Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            260                 265                 270         


Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu Val Gln 
        275                 280                 285             


Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg Ile Asp 
    290                 295                 300                 


Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu Gly Thr 
305                 310                 315                 320 


Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile Lys Thr 
                325                 330                 335     


Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp Leu Asp 
            340                 345                 350         


Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly Glu Phe 
        355                 360                 365             


Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp Glu Asp 
    370                 375                 380                 


Glu Glu Glu Gly Asp Cys Glu Glu Glu Glu Phe Glu Pro Ser Thr Gln 
385                 390                 395                 400 


Tyr Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Lys Pro Leu Glu Phe 
                405                 410                 415     


Gly Ala Thr Ser Ala Ala Leu Gln Pro Glu Glu Glu Gln Glu Glu Asp 
            420                 425                 430         


Trp Leu Asp Asp Asp Ser Gln Gln Thr Val Gly Gln Gln Asp Gly Ser 
        435                 440                 445             


Glu Asp Asn Gln Thr Thr Thr Ile Gln Thr Ile Val Glu Val Gln Pro 
    450                 455                 460                 


Gln Leu Glu Met Glu Leu Thr Pro Val Val Gln Thr Ile Glu Val Asn 
465                 470                 475                 480 


Ser Phe Ser Gly Tyr Leu Lys Leu Thr Asp Asn Val Tyr Ile Lys Asn 
                485                 490                 495     


Ala Asp Ile Val Glu Glu Ala Lys Lys Val Lys Pro Thr Val Val Val 
            500                 505                 510         


Asn Ala Ala Asn Val Tyr Leu Lys His Gly Gly Gly Val Ala Gly Ala 
        515                 520                 525             


Leu Asn Lys Ala Thr Asn Asn Ala Met Gln Val Glu Ser Asp Asp Tyr 
    530                 535                 540                 


Ile Ala Thr Asn Gly Pro Leu Lys Val Gly Gly Ser Cys Val Leu Ser 
545                 550                 555                 560 


Gly His Asn Leu Ala Lys His Cys Leu His Val Val Gly Pro Asn Val 
                565                 570                 575     


Asn Lys Gly Glu Asp Ile Gln Leu Leu Lys Ser Ala Tyr Glu Asn Phe 
            580                 585                 590         


Asn Gln His Glu Val Leu Leu Ala Pro Leu Leu Ser Ala Gly Ile Phe 
        595                 600                 605             


Gly Ala Asp Pro Ile His Ser Leu Arg Val Cys Val Asp Thr Val Arg 
    610                 615                 620                 


Thr Asn Val Tyr Leu Ala Val Phe Asp Lys Asn Leu Tyr Asp Lys Leu 
625                 630                 635                 640 


Val Ser Ser Phe Leu Glu Met Lys Ser Glu Lys Gln Val Glu Gln Lys 
                645                 650                 655     


Ile Ala Glu Ile Pro Lys Glu Glu Val Lys Pro Phe Ile Thr Glu Ser 
            660                 665                 670         


Lys Pro Ser Val Glu Gln Arg Lys Gln Asp Asp Lys Lys Ile Lys Ala 
        675                 680                 685             


Cys Val Glu Glu Val Thr Thr Thr Leu Glu Glu Thr Lys Phe Leu Thr 
    690                 695                 700                 


Glu Asn Leu Leu Leu Tyr Ile Asp Ile Asn Gly Asn Leu His Pro Asp 
705                 710                 715                 720 


Ser Ala Thr Leu Val Ser Asp Ile Asp Ile Thr Phe Leu Lys Lys Asp 
                725                 730                 735     


Ala Pro Tyr Ile Val Gly Asp Val Val Gln Glu Gly Val Leu Thr Ala 
            740                 745                 750         


Val Val Ile Pro Thr Lys Lys Ala Gly Gly Thr Thr Glu Met Leu Ala 
        755                 760                 765             


Lys Ala Leu Arg Lys Val Pro Thr Asp Asn Tyr Ile Thr Thr Tyr Pro 
    770                 775                 780                 


Gly Gln Gly Leu Asn Gly Tyr Thr Val Glu Glu Ala Lys Thr Val Leu 
785                 790                 795                 800 


Lys Lys Cys Lys Ser Ala Phe Tyr Ile Leu Pro Ser Ile Ile Ser Asn 
                805                 810                 815     


Glu Lys Gln Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg Glu Met 
            820                 825                 830         


Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Val Cys Val Glu 
        835                 840                 845             


Thr Lys Ala Ile Val Ser Thr Ile Gln Arg Lys Tyr Lys Gly Ile Lys 
    850                 855                 860                 


Ile Gln Glu Gly Val Val Asp Tyr Gly Ala Arg Phe Tyr Phe Tyr Thr 
865                 870                 875                 880 


Ser Lys Thr Thr Val Ala Ser Leu Ile Asn Thr Leu Asn Asp Leu Asn 
                885                 890                 895     


Glu Thr Leu Val Thr Met Pro Leu Gly Tyr Val Thr His Gly Leu Asn 
            900                 905                 910         


Leu Glu Glu Ala Ala Arg Tyr Met Arg Ser Leu Lys Val Pro Ala Thr 
        915                 920                 925             


Val Ser Val Ser Ser Pro Asp Ala Val Thr Ala Tyr Asn Gly Tyr Leu 
    930                 935                 940                 


Thr Ser Ser Ser Lys Thr Pro Glu Glu His Phe Ile Glu Thr Ile Ser 
945                 950                 955                 960 


Leu Ala Gly Ser Tyr Lys Asp Trp Ser Tyr Ser Gly Gln Ser Thr Gln 
                965                 970                 975     


Leu Gly Ile Glu Phe Leu Lys Arg Gly Asp Lys Ser Val Tyr Tyr Thr 
            980                 985                 990         


Ser Asn Pro Thr Thr Phe His Leu  Asp Gly Glu Val Ile  Thr Phe Asp 
        995                 1000                 1005             


Asn Leu  Lys Thr Leu Leu Ser  Leu Arg Glu Val Arg  Thr Ile Lys 
    1010                 1015                 1020             


Val Phe  Thr Thr Val Asp Asn  Ile Asn Leu His Thr  Gln Val Val 
    1025                 1030                 1035             


Asp Met  Ser Met Thr Tyr Gly  Gln Gln Phe Gly Pro  Thr Tyr Leu 
    1040                 1045                 1050             


Asp Gly  Ala Asp Val Thr Lys  Ile Lys Pro His Asn  Ser His Glu 
    1055                 1060                 1065             


Gly Lys  Thr Phe Tyr Val Leu  Pro Asn Asp Asp Thr  Leu Arg Val 
    1070                 1075                 1080             


Glu Ala  Phe Glu Tyr Tyr His  Thr Thr Asp Pro Ser  Phe Leu Gly 
    1085                 1090                 1095             


Arg Tyr  Met Ser Ala Leu Asn  His Thr Lys Lys Trp  Lys Tyr Pro 
    1100                 1105                 1110             


Gln Val  Asn Gly Leu Thr Ser  Ile Lys Trp Ala Asp  Asn Asn Cys 
    1115                 1120                 1125             


Tyr Leu  Ala Thr Ala Leu Leu  Thr Leu Gln Gln Ile  Glu Leu Lys 
    1130                 1135                 1140             


Phe Asn  Pro Pro Ala Leu Gln  Asp Ala Tyr Tyr Arg  Ala Arg Ala 
    1145                 1150                 1155             


Gly Glu  Ala Ala Asn Phe Cys  Ala Leu Ile Leu Ala  Tyr Cys Asn 
    1160                 1165                 1170             


Lys Thr  Val Gly Glu Leu Gly  Asp Val Arg Glu Thr  Met Ser Tyr 
    1175                 1180                 1185             


Leu Phe  Gln His Ala Asn Leu  Asp Ser Cys Lys Arg  Val Leu Asn 
    1190                 1195                 1200             


Val Val  Cys Lys Thr Cys Gly  Gln Gln Gln Thr Thr  Leu Lys Gly 
    1205                 1210                 1215             


Val Glu  Ala Val Met Tyr Met  Gly Thr Leu Ser Tyr  Glu Gln Phe 
    1220                 1225                 1230             


Lys Lys  Gly Val Gln Ile Pro  Cys Thr Cys Gly Lys  Gln Ala Thr 
    1235                 1240                 1245             


Lys Tyr  Leu Val Gln Gln Glu  Ser Pro Phe Val Met  Met Ser Ala 
    1250                 1255                 1260             


Pro Pro  Ala Gln Tyr Glu Leu  Lys His Gly Thr Phe  Thr Cys Ala 
    1265                 1270                 1275             


Ser Glu  Tyr Thr Gly Asn Tyr  Gln Cys Gly His Tyr  Lys His Ile 
    1280                 1285                 1290             


Thr Ser  Lys Glu Thr Leu Tyr  Cys Ile Asp Gly Ala  Leu Leu Thr 
    1295                 1300                 1305             


Lys Ser  Ser Glu Tyr Lys Gly  Pro Ile Thr Asp Val  Phe Tyr Lys 
    1310                 1315                 1320             


Glu Asn  Ser Tyr Thr Thr Thr  Ile Lys Pro Val Thr  Tyr Lys Leu 
    1325                 1330                 1335             


Asp Gly  Val Val Cys Thr Glu  Ile Asp Pro Lys Leu  Asp Asn Tyr 
    1340                 1345                 1350             


Tyr Lys  Lys Asp Asn Ser Tyr  Phe Thr Glu Gln Pro  Ile Asp Leu 
    1355                 1360                 1365             


Val Pro  Asn Gln Pro Tyr Pro  Asn Ala Ser Phe Asp  Asn Phe Lys 
    1370                 1375                 1380             


Phe Val  Cys Asp Asn Ile Lys  Phe Ala Asp Asp Leu  Asn Gln Leu 
    1385                 1390                 1395             


Thr Gly  Tyr Lys Lys Pro Ala  Ser Arg Glu Leu Lys  Val Thr Phe 
    1400                 1405                 1410             


Phe Pro  Asp Leu Asn Gly Asp  Val Val Ala Ile Asp  Tyr Lys His 
    1415                 1420                 1425             


Tyr Thr  Pro Ser Phe Lys Lys  Gly Ala Lys Leu Leu  His Lys Pro 
    1430                 1435                 1440             


Ile Val  Trp His Val Asn Asn  Ala Thr Asn Lys Ala  Thr Tyr Lys 
    1445                 1450                 1455             


Pro Asn  Thr Trp Cys Ile Arg  Cys Leu Trp Ser Thr  Lys Pro Val 
    1460                 1465                 1470             


Glu Thr  Ser Asn Ser Phe Asp  Val Leu Lys Ser Glu  Asp Ala Gln 
    1475                 1480                 1485             


Gly Met  Asp Asn Leu Ala Cys  Glu Asp Leu Lys Pro  Val Ser Glu 
    1490                 1495                 1500             


Glu Val  Val Glu Asn Pro Thr  Ile Gln Lys Asp Val  Leu Glu Cys 
    1505                 1510                 1515             


Asn Val  Lys Thr Thr Glu Val  Val Gly Asp Ile Ile  Leu Lys Pro 
    1520                 1525                 1530             


Ala Asn  Asn Ser Leu Lys Ile  Thr Glu Glu Val Gly  His Thr Asp 
    1535                 1540                 1545             


Leu Met  Ala Ala Tyr Val Asp  Asn Ser Ser Leu Thr  Ile Lys Lys 
    1550                 1555                 1560             


Pro Asn  Glu Leu Ser Arg Val  Leu Gly Leu Lys Thr  Leu Ala Thr 
    1565                 1570                 1575             


His Gly  Leu Ala Ala Val Asn  Ser Val Pro Trp Asp  Thr Ile Ala 
    1580                 1585                 1590             


Asn Tyr  Ala Lys Pro Phe Leu  Asn Lys Val Val Ser  Thr Thr Thr 
    1595                 1600                 1605             


Asn Ile  Val Thr Arg Cys Leu  Asn Arg Val Cys Thr  Asn Tyr Met 
    1610                 1615                 1620             


Pro Tyr  Phe Phe Thr Leu Leu  Leu Gln Leu Cys Thr  Phe Thr Arg 
    1625                 1630                 1635             


Ser Thr  Asn Ser Arg Ile Lys  Ala Ser Met Pro Thr  Thr Ile Ala 
    1640                 1645                 1650             


Lys Asn  Thr Val Lys Ser Val  Gly Lys Phe Cys Leu  Glu Ala Ser 
    1655                 1660                 1665             


Phe Asn  Tyr Leu Lys Ser Pro  Asn Phe Ser Lys Leu  Ile Asn Ile 
    1670                 1675                 1680             


Ile Ile  Trp Phe Leu Leu Leu  Ser Val Cys Leu Gly  Ser Leu Ile 
    1685                 1690                 1695             


Tyr Ser  Thr Ala Ala Leu Gly  Val Leu Met Ser Asn  Leu Gly Met 
    1700                 1705                 1710             


Pro Ser  Tyr Cys Thr Gly Tyr  Arg Glu Gly Tyr Leu  Asn Ser Thr 
    1715                 1720                 1725             


Asn Val  Thr Ile Ala Thr Tyr  Cys Thr Gly Ser Ile  Pro Cys Ser 
    1730                 1735                 1740             


Val Cys  Leu Ser Gly Leu Asp  Ser Leu Asp Thr Tyr  Pro Ser Leu 
    1745                 1750                 1755             


Glu Thr  Ile Gln Ile Thr Ile  Ser Ser Phe Lys Trp  Asp Leu Thr 
    1760                 1765                 1770             


Ala Phe  Gly Leu Val Ala Glu  Trp Phe Leu Ala Tyr  Ile Leu Phe 
    1775                 1780                 1785             


Thr Arg  Phe Phe Tyr Val Leu  Gly Leu Ala Ala Ile  Met Gln Leu 
    1790                 1795                 1800             


Phe Phe  Ser Tyr Phe Ala Val  His Phe Ile Ser Asn  Ser Trp Leu 
    1805                 1810                 1815             


Met Trp  Leu Ile Ile Asn Leu  Val Gln Met Ala Pro  Ile Ser Ala 
    1820                 1825                 1830             


Met Val  Arg Met Tyr Ile Phe  Phe Ala Ser Phe Tyr  Tyr Val Trp 
    1835                 1840                 1845             


Lys Ser  Tyr Val His Val Val  Asp Gly Cys Asn Ser  Ser Thr Cys 
    1850                 1855                 1860             


Met Met  Cys Tyr Lys Arg Asn  Arg Ala Thr Arg Val  Glu Cys Thr 
    1865                 1870                 1875             


Thr Ile  Val Asn Gly Val Arg  Arg Ser Phe Tyr Val  Tyr Ala Asn 
    1880                 1885                 1890             


Gly Gly  Lys Gly Phe Cys Lys  Leu His Asn Trp Asn  Cys Val Asn 
    1895                 1900                 1905             


Cys Asp  Thr Phe Cys Ala Gly  Ser Thr Phe Ile Ser  Asp Glu Val 
    1910                 1915                 1920             


Ala Arg  Asp Leu Ser Leu Gln  Phe Lys Arg Pro Ile  Asn Pro Thr 
    1925                 1930                 1935             


Asp Gln  Ser Ser Tyr Ile Val  Asp Ser Val Thr Val  Lys Asn Gly 
    1940                 1945                 1950             


Ser Ile  His Leu Tyr Phe Asp  Lys Ala Gly Gln Lys  Thr Tyr Glu 
    1955                 1960                 1965             


Arg His  Ser Leu Ser His Phe  Val Asn Leu Asp Asn  Leu Arg Ala 
    1970                 1975                 1980             


Asn Asn  Thr Lys Gly Ser Leu  Pro Ile Asn Val Ile  Val Phe Asp 
    1985                 1990                 1995             


Gly Lys  Ser Lys Cys Glu Glu  Ser Ser Ala Lys Ser  Ala Ser Val 
    2000                 2005                 2010             


Tyr Tyr  Ser Gln Leu Met Cys  Gln Pro Ile Leu Leu  Leu Asp Gln 
    2015                 2020                 2025             


Ala Leu  Val Ser Asp Val Gly  Asp Ser Ala Glu Val  Ala Val Lys 
    2030                 2035                 2040             


Met Phe  Asp Ala Tyr Val Asn  Thr Phe Ser Ser Thr  Phe Asn Val 
    2045                 2050                 2055             


Pro Met  Glu Lys Leu Lys Thr  Leu Val Ala Thr Ala  Glu Ala Glu 
    2060                 2065                 2070             


Leu Ala  Lys Asn Val Ser Leu  Asp Asn Val Leu Ser  Thr Phe Ile 
    2075                 2080                 2085             


Ser Ala  Ala Arg Gln Gly Phe  Val Asp Ser Asp Val  Glu Thr Lys 
    2090                 2095                 2100             


Asp Val  Val Glu Cys Leu Lys  Leu Ser His Gln Ser  Asp Ile Glu 
    2105                 2110                 2115             


Val Thr  Gly Asp Ser Cys Asn  Asn Tyr Met Leu Thr  Tyr Asn Lys 
    2120                 2125                 2130             


Val Glu  Asn Met Thr Pro Arg  Asp Leu Gly Ala Cys  Ile Asp Cys 
    2135                 2140                 2145             


Ser Ala  Arg His Ile Asn Ala  Gln Val Ala Lys Ser  His Asn Ile 
    2150                 2155                 2160             


Ala Leu  Ile Trp Asn Val Lys  Asp Phe Met Ser Leu  Ser Glu Gln 
    2165                 2170                 2175             


Leu Arg  Lys Gln Ile Arg Ser  Ala Ala Lys Lys Asn  Asn Leu Pro 
    2180                 2185                 2190             


Phe Lys  Leu Thr Cys Ala Thr  Thr Arg Gln Val Val  Asn Val Val 
    2195                 2200                 2205             


Thr Thr  Lys Ile Ala Leu Lys  Gly Gly 
    2210                 2215         


<210>  20
<211>  6654
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:19

<400>  20
atgtttgtgt tcctggtcct gctccccttg gtgtcttctc agtgctttac agtggagaaa       60

gggatttacc aaacctcaaa ctttcgggtg cagcccaccg aatcaattgt gcgctttcca      120

aacattacca atctgtgtcc ctttggcgaa gtattcaacg ccaccaggtt tgcaagtttt      180

accgtagaag tctatgcttg gaatcggaaa cgaatctcta attgtgtagc cgactactct      240

gtgctttaca atagtgcgtc attctctacc ttcaagtgct acggagttag tcctacgaag      300

cttaacgacc tgtgcttcac taacgtatac gcggatagtt ttgtgattcg cggagatgag      360

gttcggcaga tcgctcctgg acagacagga aaaattgcgg attacaatta caagctgcct      420

gacgacttca cggggtgcgt tatcgcatgg aattccaaca accttgacag caaagtgggc      480

gggaactaca actatctgta ccggctcttt agaaagagta acctgaagcc cttcgagcgg      540

gacataagta ccgagatata tcaggccggg agcactccct gcaatggcgt agaaggattc      600

aattgctatt ttcccctgca gtcctatgga tttcagccaa ctaatggagt gggttatcag      660

ccttatcggg tcgtggtcct tagttttgag ctcttgcatg cgcctgccac tgtgtgcggg      720

ccaaaaaagt ctacaaacct tgttaagaac aaatgcgtga attttaattt cggcggaggc      780

gggagtggcg gtgggggcag tggtgggggc ggctcagccc ctactaaggt caccttcggg      840

gacgacaccg taattgaagt acaaggatac aaaagtgtga acattacctt cgagctggac      900

gagcggattg acaaagtgct gaatgagaaa tgtagtgctt acacagtaga gcttggcact      960

gaagtgaatg aattcgcatg cgttgtggct gatgccgtga taaagaccct gcagcccgtg     1020

agcgagctcc tcacccctct gggcatcgac ctggacgagt ggagcatggc gacctactac     1080

ctgttcgacg aatctggcga atttaagctg gcttctcata tgtactgtag cttctatccc     1140

cccgatgagg acgaggaaga gggagattgc gaagaggaag aatttgaacc cagcactcag     1200

tatgaatatg gaactgaaga tgattaccaa ggtaaacctc tggaatttgg agcaacgagt     1260

gcagctctgc aacccgagga ggagcaggag gaagactggc tcgacgatga ttcccaacag     1320

acggtcggcc aacaggatgg gtccgaggat aatcagacga ccactatcca gactatagtc     1380

gaggttcagc cacaactgga aatggagctt actccagtag tgcagaccat agaggtcaat     1440

agctttagcg gatacctgaa actgactgat aatgtctaca ttaagaatgc agatatagtc     1500

gaagaagcca aaaaggtgaa acctaccgtg gttgtcaatg ccgccaacgt ctacctgaaa     1560

cacgggggag gcgtagccgg cgccctgaat aaagcaacaa acaacgccat gcaggtagag     1620

tcagatgact acatcgcaac caatgggcct ttgaaggtgg gaggcagctg tgtcctgtct     1680

ggccacaatc tggccaaaca ttgtctccac gtggttggac cgaatgtgaa caagggcgag     1740

gatattcagt tgctcaagag cgcatatgag aattttaacc agcacgaggt actgttggcc     1800

ccactgctta gcgcagggat tttcggcgct gaccctattc atagtcttcg agtgtgtgtg     1860

gatactgtta gaacaaatgt ctacctggca gtcttcgaca aaaatctgta tgacaaactt     1920

gtctcatcat tccttgaaat gaagtcagaa aagcaagtcg agcagaagat cgcagagatc     1980

ccaaaagagg aagtgaagcc atttatcacc gagagcaaac ccagtgtgga gcaaaggaaa     2040

caggatgaca agaaaattaa ggcatgtgtg gaggaagtca ccacaactct ggaggaaacg     2100

aaattcctga cagagaatct cctcttgtat attgacatta atggaaacct tcaccccgac     2160

agcgcaacgc tggtctctga catcgatatt acgtttctta agaaagatgc tccttatatc     2220

gtgggcgacg tcgtgcaaga aggagtgctg accgccgttg tcatcccgac aaagaaggcc     2280

ggcgggacta cagaaatgct ggcaaaggcc ctgcggaaag tgccaacaga taattacatc     2340

acgacctatc ctgggcaagg cctcaacggc tacaccgtgg aagaggccaa gacagtgctg     2400

aaaaagtgca aaagcgcctt ttatatcttg ccctctatta tcagtaatga gaaacaggaa     2460

attctgggaa ctgtgagctg gaacctgaga gagatgctgg cccacgcgga ggaaacgaga     2520

aaattgatgc ccgtgtgtgt ggaaactaag gctatcgtga gcactattca gcggaagtac     2580

aaaggcataa aaatccagga aggcgtggtg gattatggcg ctagattcta tttttataca     2640

tccaagacga ctgttgcatc tctgatcaac acactcaacg atctgaatga gaccctggtt     2700

actatgcctc ttgggtacgt gacacatggt ctgaacctgg aagaagccgc tcgatatatg     2760

aggagcctga aggtccccgc caccgttagc gtctcctccc ctgacgccgt gacagcctac     2820

aacggatacc tgacctcctc ctctaagacc ccagaggagc acttcatcga aactatctcc     2880

ttggccggaa gctataagga ttggagttat tctggacaga gcacacaatt gggcattgag     2940

ttcctgaaga gaggcgacaa gagcgtgtac tacacctcca atcctaccac cttccatctg     3000

gacggagaag taattacatt tgacaatctg aagacacttc tcagccttag ggaggtgcgc     3060

accattaagg tattcaccac cgttgataac attaacctcc atacccaggt ggtggacatg     3120

agtatgacgt acggtcaaca gttcgggcca acatatttgg atggggcaga cgtgaccaag     3180

attaagcccc ataattccca tgagggaaag acattctacg tcctgcccaa cgatgacacc     3240

cttcgagtag aggcatttga atactaccac acaacagacc cgtctttctt gggacgctac     3300

atgagcgcac tcaatcacac taagaaatgg aaatacccgc aggttaacgg acttacctcc     3360

attaagtggg cagataataa ctgttacctc gctacagccc tgctgacatt gcaacagatc     3420

gagctgaaat tcaacccccc cgcactccaa gatgcctact accgagcacg agccggcgag     3480

gctgccaact tttgcgccct gatcttggct tactgcaaca aaactgtagg agaactgggt     3540

gacgtgcgag aaacgatgag ctatctgttc cagcacgcaa atctggactc atgcaaacga     3600

gtactcaacg tggtctgcaa gacatgcggt cagcagcaaa ctacactcaa gggagtagag     3660

gctgtgatgt atatgggcac actgtcctac gagcagttca agaagggcgt gcagattcct     3720

tgtacttgcg gtaagcaggc caccaaatat ttggtacagc aggaaagccc attcgtgatg     3780

atgtccgcac cccccgctca gtatgaactc aaacatggta cattcacctg cgcctcagag     3840

tacacaggaa attaccagtg cggccactat aagcatatca cctcaaagga gaccctgtat     3900

tgtattgacg gcgcccttct gaccaagagc tctgagtaca agggcccaat cacagacgtt     3960

ttttataagg agaactctta caccacaacc atcaagcccg tgacctacaa gctggatggg     4020

gtggtgtgca cagaaataga tccgaagctg gataactatt ataaaaaaga caacagctac     4080

ttcaccgagc aacctatcga cttggtccca aaccagccat acccgaacgc cagttttgac     4140

aattttaagt ttgtctgcga caacattaag tttgccgacg acctgaatca gctcactgga     4200

tacaagaagc cagcaagcag ggagctgaag gtgacctttt tccccgacct caacggcgac     4260

gtggtggcca ttgactataa acactatacc ccaagtttca agaagggcgc caaactcctt     4320

cataagccta tcgtgtggca tgttaataat gcaactaata aagctacata caaacctaac     4380

acatggtgta tacgctgcct ttggtctact aaaccagtcg agactagtaa cagctttgat     4440

gtgcttaaat ccgaggacgc ccaaggaatg gacaaccttg cctgtgaaga cttgaagccg     4500

gtcagcgagg aggtggtcga gaacccgaca atacaaaagg acgtgcttga gtgtaacgtt     4560

aagaccacgg aggtagttgg ggacatcatt ctcaagccag ccaataactc acttaaaatc     4620

accgaggagg tgggtcacac tgatctgatg gctgcctacg tggacaactc ctcacttact     4680

ataaaaaaac ccaatgagct tagcagggtg cttggtctga agacacttgc aacccacggg     4740

cttgcggcgg tcaattcagt gccctgggat accattgcca attacgcaaa gccctttctt     4800

aataaagtcg tttctaccac aactaacatc gttaccagat gcctgaacag agtttgtaca     4860

aactacatgc cttacttctt tacactcctg ctgcagctct gtacctttac tcggagtaca     4920

aattcacgca taaaggcctc catgccaaca acgatcgcga agaacaccgt gaagtcagta     4980

ggcaagttct gtctggaagc atccttcaac tatctcaagt ctcccaactt ttctaagctg     5040

atcaacataa tcatttggtt cctcttgctg tcagtatgcc tggggtcact catctacagc     5100

accgctgcac tgggcgtatt gatgtcaaat ctgggaatgc catcttattg tactggatac     5160

agagagggat atctgaacag tactaatgtg acaattgcca cctattgcac tggtagcatc     5220

ccttgctctg tatgcctttc aggcctggat tcccttgata catatcccag cctcgagacc     5280

attcagatta caatatcctc attcaagtgg gatctgaccg cttttggact ggttgctgaa     5340

tggttcctgg cttacatcct cttcacacgc ttcttttatg tgctgggcct ggccgctatc     5400

atgcaactgt tcttcagtta ttttgctgtc cacttcatct ctaacagttg gctgatgtgg     5460

ctcatcatca acctggtgca aatggcccct atttccgcca tggtgcgcat gtatatattt     5520

ttcgcatcct tctactatgt ttggaaatct tacgtgcatg tagtggacgg ttgcaacagt     5580

tcaacttgta tgatgtgcta caaaagaaat cgagccacac gcgtcgagtg cacaacgatt     5640

gtcaacggtg tccgaaggtc cttctacgtg tacgccaatg gaggtaaagg gttctgcaag     5700

cttcataatt ggaattgcgt gaattgcgac acattctgcg ccggtagtac cttcatctca     5760

gacgaagtgg cccgcgatct gtccctccaa tttaagcgac ctataaatcc aaccgatcag     5820

agtagctaca ttgtggactc tgtgaccgtt aagaacggca gcattcatct gtactttgat     5880

aaagcaggac aaaagacgta cgagcggcac tctctgtctc atttcgttaa cctcgacaat     5940

ctccgcgcca ataacaccaa gggaagtttg ccaatcaacg taattgtttt cgatggtaag     6000

agtaagtgcg aagagtcttc cgctaaaagt gcttctgtgt actactcaca gttgatgtgt     6060

cagcctatcc ttctgctgga ccaggccctg gtgtccgatg tcggagattc agccgaggtt     6120

gctgtcaaga tgttcgacgc ttacgtaaac accttcagct ctacgtttaa tgtgcctatg     6180

gagaaactta agacacttgt ggcaacagcc gaggcagagc tggcgaaaaa cgtgtcactg     6240

gacaacgtcc tgtccacttt tatatccgcc gctcggcagg ggttcgtgga ttccgacgtg     6300

gagacaaaag atgtggtcga gtgcttgaaa ctgtctcacc aatccgatat agaagtgacc     6360

ggcgacagct gcaacaacta catgctcaca tacaataagg tggagaacat gactccgagg     6420

gatttgggag catgcattga ctgttcagct agacatatta acgctcaagt ggccaaaagc     6480

cataacattg cactgatctg gaacgttaaa gacttcatgt ccctgtccga gcagctccgc     6540

aaacaaatca ggtccgccgc caagaagaat aacttgccct ttaaactcac ttgcgccaca     6600

acaagacagg tggttaatgt cgttactact aagattgcac tgaagggagg ctga           6654


<210>  21
<211>  872
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1-RBD, S2-HR2, Ubl1-nsp3, 3Ecto-Nsp3 
       and Nsp8

<400>  21

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Phe 
1               5                   10                  15      


Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro 
            20                  25                  30          


Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe 
        35                  40                  45              


Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val Glu Val 
    50                  55                  60                  


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
65                  70                  75                  80  


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
                85                  90                  95      


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
            100                 105                 110         


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
        115                 120                 125             


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
    130                 135                 140                 


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
145                 150                 155                 160 


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
                165                 170                 175     


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
            180                 185                 190         


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
        195                 200                 205             


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
    210                 215                 220                 


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
225                 230                 235                 240 


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
                245                 250                 255     


Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            260                 265                 270         


Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe 
        275                 280                 285             


Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala 
    290                 295                 300                 


Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val 
305                 310                 315                 320 


Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr 
                325                 330                 335     


Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys 
            340                 345                 350         


Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln 
        355                 360                 365             


Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
    370                 375                 380                 


His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala 
385                 390                 395                 400 


Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala 
                405                 410                 415     


Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr 
            420                 425                 430         


Glu Gln Tyr Ile Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
        435                 440                 445             


Gly Gly Ser Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile 
    450                 455                 460                 


Glu Val Gln Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu 
465                 470                 475                 480 


Arg Ile Asp Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu 
                485                 490                 495     


Leu Gly Thr Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val 
            500                 505                 510         


Ile Lys Thr Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile 
        515                 520                 525             


Asp Leu Asp Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser 
    530                 535                 540                 


Gly Glu Phe Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro 
545                 550                 555                 560 


Asp Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
                565                 570                 575     


Ser Ser Asn Leu Gly Met Pro Ser Tyr Cys Thr Gly Tyr Arg Glu Gly 
            580                 585                 590         


Tyr Leu Asn Ser Thr Asn Val Thr Ile Ala Thr Tyr Cys Thr Gly Ser 
        595                 600                 605             


Ile Pro Cys Ser Val Cys Leu Ser Gly Leu Asp Ser Leu Asp Thr Tyr 
    610                 615                 620                 


Pro Ser Leu Glu Thr Ile Gln Ile Thr Ile Ser Ser Phe Lys Trp Asp 
625                 630                 635                 640 


Leu Thr Ala Phe Gly Leu Val Ala Glu Trp Phe Leu Ala Tyr Ile Leu 
                645                 650                 655     


Phe Thr Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
            660                 665                 670         


Gly Ser Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro Ser Tyr Ala Ala 
        675                 680                 685             


Phe Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val Ala Asn Gly Asp 
    690                 695                 700                 


Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu Asn Val Ala Lys 
705                 710                 715                 720 


Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys Leu Glu Lys Met 
                725                 730                 735     


Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala Arg Ser Glu Asp 
            740                 745                 750         


Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr Met Leu Phe Thr Met 
        755                 760                 765             


Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile Ile Asn Asn Ala 
    770                 775                 780                 


Arg Asp Gly Cys Val Pro Leu Asn Ile Ile Pro Leu Thr Thr Ala Ala 
785                 790                 795                 800 


Lys Leu Met Val Val Ile Pro Asp Tyr Asn Thr Tyr Lys Asn Thr Cys 
                805                 810                 815     


Asp Gly Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp Glu Ile Gln Gln 
            820                 825                 830         


Val Val Asp Ala Asp Ser Lys Ile Val Gln Leu Ser Glu Ile Ser Met 
        835                 840                 845             


Asp Asn Ser Pro Asn Leu Ala Trp Pro Leu Ile Val Thr Ala Leu Arg 
    850                 855                 860                 


Ala Asn Ser Ala Val Lys Leu Gln 
865                 870         


<210>  22
<211>  2619
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:21

<400>  22
atgtttgtgt ttctggtgct gcttcccctg gtatcatctc agtgcttcac agtggagaaa       60

gggatttacc agacgagcaa ctttcgggtc cagccaaccg aaagtatagt gaggttcccc      120

aatattacta acctttgtcc cttcggtgaa gtgttcaatg caacccgatt tgcttctttc      180

acggtcgagg tctacgcttg gaataggaaa agaatctcca attgtgtggc cgattactcc      240

gttctgtata atagtgcgtc attttccacc ttcaagtgct atggtgtgtc cccaacaaaa      300

ttgaatgatc tttgttttac caacgtatac gcagacagct tcgtgataag gggggacgag      360

gtgcggcaga tagccccagg tcagaccgga aaaatagcag attacaatta taagctccca      420

gatgatttta caggctgcgt gatcgcctgg aactctaata acctcgattc aaaggttggc      480

ggaaattaca actacttgta taggcttttc aggaagtcta acttgaagcc cttcgaacgc      540

gacattagca cagagattta ccaggccggt tccactcctt gtaatggtgt agaaggcttc      600

aattgctact tccctctgca atcttatggt tttcagccaa ccaatggggt gggctatcag      660

ccttatcgcg tggtggtgct gtcattcgag cttcttcatg ctccagcaac agtgtgcgga      720

ccaaaaaaat ctacaaactt ggtgaagaat aagtgcgtta atttcaattt tggcggagga      780

ggatcaggag gcggcggcag cggcggcggg ggttcttacc acctcatgtc tttcccgcag      840

tctgctcccc acggcgttgt atttctgcat gtcacttatg tccctgctca ggagaagaac      900

ttcacaacag cgcctgcaat ttgccacgat ggaaaagcac acttcccccg agagggcgtc      960

ttcgtcagca acgggaccca ctggttcgtg actcagagga atttctacga accgcagatc     1020

ataacaaccg acaacacgtt tgtgtcaggg aactgcgacg tcgtgatcgg gatagtcaac     1080

aacaccgtat acgatcctct ccagcctgag ctggacagct ttaaggagga gctcgataaa     1140

tattttaaga atcacacttc tccggacgtg gacctcggcg atatttctgg cattaatgct     1200

tcagtggtga acattcaaaa ggagattgac aggctgaacg aggttgccaa gaatctgaat     1260

gaatctctca ttgaccttca ggaactgggg aaatatgaac aatatattgg tggcggagga     1320

tccggaggcg gaggatcagg cggcggagga agcgccccta caaaggtcac tttcggagat     1380

gacaccgtta ttgaggtgca gggctacaaa tctgtgaata ttacctttga gctggatgaa     1440

agaatcgaca aagtgttgaa tgagaaatgc tccgcttata ctgtcgaact ggggactgaa     1500

gttaatgaat tcgcgtgtgt ggtggcagac gccgttataa aaactttgca gcctgtatcc     1560

gagttgctga cccctttggg gatcgatctt gatgaatggt ccatggcaac atactacctc     1620

tttgacgagt caggagaatt caagctggcc tctcacatgt actgttcatt ttatccaccc     1680

gacgaaggag gcggagggag cggtggcggt ggttctggcg gaggcggttc atcaaatctc     1740

ggaatgccaa gctactgcac tgggtatagg gagggttacc ttaattctac aaacgtgacc     1800

atagccactt actgcactgg atcaatacct tgcagcgtgt gtctcagcgg gcttgactcc     1860

ctcgatacat acccctcact ggaaactatt cagatcacta tatcttcctt caaatgggac     1920

ctgactgcct tcgggctggt cgccgaatgg tttctggcct acattctttt tactagggga     1980

ggcgggggct ccggtggagg aggctccggg ggcggcggca gcgcaattgc ctcagaattc     2040

tcctctctgc catcatacgc tgctttcgca actgcccagg aagcctacga acaagctgtt     2100

gcgaatgggg actccgaagt ggtgcttaag aaactgaaaa aatctctcaa cgtcgcgaag     2160

agtgaattcg atcgagacgc agcaatgcag aggaagctgg agaagatggc cgatcaagca     2220

atgacacaaa tgtataaaca ggcccgaagt gaggataagc gggcgaaggt cacatccgcc     2280

atgcagacta tgctctttac tatgctgaga aagctcgaca atgacgccct gaacaacatt     2340

attaataatg caagggatgg ttgtgtgccc ctcaacatta tacctctgac tacagccgct     2400

aaacttatgg tggtgattcc tgattacaat acttataaaa atacatgtga cggcacaact     2460

ttcacatatg ccagtgcact gtgggagatt cagcaggtgg ttgacgcaga cagtaagatt     2520

gtgcaactgt cagaaattag tatggataat agcccgaacc tcgcgtggcc actgattgtt     2580

accgccctga gggctaattc agctgtcaag ctgcagtag                            2619


<210>  23
<211>  902
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1-Spike protein and Nsp8

<400>  23

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val 
            340                 345                 350         


Glu Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp 
        355                 360                 365             


Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr 
    370                 375                 380                 


Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr 
385                 390                 395                 400 


Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro 
                405                 410                 415     


Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp 
            420                 425                 430         


Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys 
        435                 440                 445             


Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn 
    450                 455                 460                 


Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly 
465                 470                 475                 480 


Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu 
                485                 490                 495     


Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr 
            500                 505                 510         


Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val 
        515                 520                 525             


Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn 
    530                 535                 540                 


Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn 
545                 550                 555                 560 


Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr 
                565                 570                 575     


Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr 
            580                 585                 590         


Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr 
        595                 600                 605             


Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val 
    610                 615                 620                 


Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr 
625                 630                 635                 640 


Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly 
                645                 650                 655     


Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala 
            660                 665                 670         


Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala 
        675                 680                 685             


Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
    690                 695                 700                 


Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro Ser Tyr Ala Ala Phe Ala 
705                 710                 715                 720 


Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val Ala Asn Gly Asp Ser Glu 
                725                 730                 735     


Val Val Leu Lys Lys Leu Lys Lys Ser Leu Asn Val Ala Lys Ser Glu 
            740                 745                 750         


Phe Asp Arg Asp Ala Ala Met Gln Arg Lys Leu Glu Lys Met Ala Asp 
        755                 760                 765             


Gln Ala Met Thr Gln Met Tyr Lys Gln Ala Arg Ser Glu Asp Lys Arg 
    770                 775                 780                 


Ala Lys Val Thr Ser Ala Met Gln Thr Met Leu Phe Thr Met Leu Arg 
785                 790                 795                 800 


Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile Ile Asn Asn Ala Arg Asp 
                805                 810                 815     


Gly Cys Val Pro Leu Asn Ile Ile Pro Leu Thr Thr Ala Ala Lys Leu 
            820                 825                 830         


Met Val Val Ile Pro Asp Tyr Asn Thr Tyr Lys Asn Thr Cys Asp Gly 
        835                 840                 845             


Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp Glu Ile Gln Gln Val Val 
    850                 855                 860                 


Asp Ala Asp Ser Lys Ile Val Gln Leu Ser Glu Ile Ser Met Asp Asn 
865                 870                 875                 880 


Ser Pro Asn Leu Ala Trp Pro Leu Ile Val Thr Ala Leu Arg Ala Asn 
                885                 890                 895     


Ser Ala Val Lys Leu Gln 
            900         


<210>  24
<211>  2709
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:23

<400>  24
atgttcgtct tccttgtgct cctgcctctg gtgtcatccc agtgcgtaaa cctgacaaca       60

agaacccagc ttcctccagc ctacactaat tccttcacta gaggggtgta ctaccccgat      120

aaagtattta ggtcctccgt gctccactcc acacaagatc ttttcctccc gttcttctcc      180

aatgtcacat ggtttcatgc gatccatgtt agtgggacaa atggcacaaa acgctttgat      240

aacccagtcc tcccttttaa tgacggggtg tatttcgcat ctactgagaa aagcaacatc      300

atcagaggat ggattttcgg gaccacactg gattctaaaa cacagagcct gctgatagta      360

aacaatgcaa ctaatgtggt gatcaaagtg tgcgaatttc agttctgcaa cgacccattc      420

cttggcgttt actatcacaa gaacaataaa agttggatgg agtccgaatt tagagtgtat      480

tcaagcgcta acaactgtac tttcgagtac gtgtcccagc catttctgat ggatctcgaa      540

ggcaaacagg gcaactttaa aaatctgcgg gaattcgtct tcaagaatat cgacgggtac      600

ttcaaaatct attccaaaca tacccccata aaccttgtga gggacctgcc ccaaggattt      660

agcgcattgg aacccttggt ggacctgcct attggaatca atatcactag gtttcagaca      720

ctgctggccc tgcaccgctc ctaccttacg cctggggact cctcatccgg ttggaccgct      780

ggcgcagctg cttattatgt gggttacctg caaccccgga cattcctcct taaatacaat      840

gaaaacggga ccataacgga cgccgtcgat tgtgccctcg atccactcag cgagacgaag      900

tgcacactga agtccttcac agtggagaag gggatatatc agaccagtaa ttttcgggtc      960

cagcctacag agtcaattgt gcgctttcca aatatcacca acctttgtcc attcggcgag     1020

gtttttaatg caaccagatt cgcatccttt accgtagaag tttatgcttg gaatcgaaag     1080

aggatttcaa attgcgtcgc tgactatagc gtgctgtata atagcgcttc attttcaacc     1140

ttcaaatgtt atggagttag cccaaccaag ttgaatgatc tttgcttcac aaatgtgtat     1200

gccgactcat ttgttatccg cggagacgag gtcagacaaa tcgcccccgg acagacgggc     1260

aagatcgcag actataacta caaactcccc gacgatttta ccgggtgcgt gattgcctgg     1320

aactctaata accttgatag taaagttggg ggaaattata attatctgta caggctcttt     1380

cggaaatcaa acctgaaacc attcgagcgg gatatttcca cagagatcta tcaggctggc     1440

tcaacgcctt gcaacggggt agagggattt aactgttatt ttccgttgca atcctatggt     1500

ttccaaccta ccaacggtgt gggatatcag ccgtataggg tggtcgtgct tagcttcgag     1560

ctgctgcacg ctccagccac agtctgtggc ccgaaaaagt caactaatct tgtgaaaaat     1620

aaatgcgtga attttaattt caatgggttg accggaacag gagttctgac cgagagcaat     1680

aaaaagtttt tgccctttca gcaatttggc cgggatatag ccgacacaac cgatgcggtc     1740

cgcgatcctc aaacactgga aattttggat atcacccctt gtagctttgg tggcgtctct     1800

gtcatcaccc ccggaaccaa tacttctaac caggtggccg tcttgtacca ggatgttaac     1860

tgtaccgagg tgcctgtggc gattcacgca gaccagctta cccccacatg gagagtgtat     1920

tctacaggat ctaacgtctt ccagacacga gcgggctgct tgattggcgc tgaacacgtg     1980

aataattcct acgagtgtga cattccgata ggcgcgggaa tctgcgcatc atatcagaca     2040

cagactaata gtccgaggag agctagaggg ggcggcgggt cagggggggg aggcagtggg     2100

ggagggggct cagcgatcgc gtccgagttt tctagtctgc cgagctatgc tgcattcgcc     2160

acagcccaag aagcatatga acaagccgtc gctaacggtg actctgaggt ggtgctgaag     2220

aagctgaaga agagccttaa tgtggcaaag agcgagttcg acagggacgc cgcaatgcag     2280

cgcaagctgg aaaaaatggc tgaccaggcc atgacccaga tgtataagca ggctagatca     2340

gaagataaga gagccaaagt gacttccgcg atgcaaacca tgcttttcac aatgctgcgg     2400

aaactggata acgatgctct gaataacata attaacaatg cccgagacgg ctgtgtccct     2460

ctcaatatta tcccccttac caccgcggca aaacttatgg tggtgatacc cgactacaac     2520

acttacaaga acacatgcga cgggacgaca ttcacgtacg cgtccgctct ctgggaaatt     2580

caacaagtgg ttgacgctga ttccaagatt gtgcagctgt cagaaatctc aatggataat     2640

tcacctaatt tggcctggcc tctgatcgtg actgcattga gggcaaattc cgccgtcaag     2700

ttgcagtga                                                             2709


<210>  25
<211>  1277
<212>  PRT
<213>  artificial

<220>
<223>  Protein including full spike protein

<400>  25

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val 
            340                 345                 350         


Glu Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp 
        355                 360                 365             


Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr 
    370                 375                 380                 


Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr 
385                 390                 395                 400 


Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro 
                405                 410                 415     


Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp 
            420                 425                 430         


Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys 
        435                 440                 445             


Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn 
    450                 455                 460                 


Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly 
465                 470                 475                 480 


Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu 
                485                 490                 495     


Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr 
            500                 505                 510         


Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val 
        515                 520                 525             


Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn 
    530                 535                 540                 


Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn 
545                 550                 555                 560 


Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr 
                565                 570                 575     


Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr 
            580                 585                 590         


Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr 
        595                 600                 605             


Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val 
    610                 615                 620                 


Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr 
625                 630                 635                 640 


Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly 
                645                 650                 655     


Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala 
            660                 665                 670         


Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala 
        675                 680                 685             


Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly 
    690                 695                 700                 


Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr 
705                 710                 715                 720 


Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr 
                725                 730                 735     


Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu 
            740                 745                 750         


Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn 
        755                 760                 765             


Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu 
    770                 775                 780                 


Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp 
785                 790                 795                 800 


Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro 
                805                 810                 815     


Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu 
            820                 825                 830         


Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile 
        835                 840                 845             


Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val 
    850                 855                 860                 


Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala 
865                 870                 875                 880 


Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala 
                885                 890                 895     


Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly 
            900                 905                 910         


Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala 
        915                 920                 925             


Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser 
    930                 935                 940                 


Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala 
945                 950                 955                 960 


Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala 
                965                 970                 975     


Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu 
            980                 985                 990         


Ala Glu Val Gln Ile Asp Arg Leu  Ile Thr Gly Arg Leu  Gln Ser Leu 
        995                 1000                 1005             


Gln Thr  Tyr Val Thr Gln Gln  Leu Ile Arg Ala Ala  Glu Ile Arg 
    1010                 1015                 1020             


Ala Ser  Ala Asn Leu Ala Ala  Thr Lys Met Ser Glu  Cys Val Leu 
    1025                 1030                 1035             


Gly Gln  Ser Lys Arg Val Asp  Phe Cys Gly Lys Gly  Tyr His Leu 
    1040                 1045                 1050             


Met Ser  Phe Pro Gln Ser Ala  Pro His Gly Val Val  Phe Leu His 
    1055                 1060                 1065             


Val Thr  Tyr Val Pro Ala Gln  Glu Lys Asn Phe Thr  Thr Ala Pro 
    1070                 1075                 1080             


Ala Ile  Cys His Asp Gly Lys  Ala His Phe Pro Arg  Glu Gly Val 
    1085                 1090                 1095             


Phe Val  Ser Asn Gly Thr His  Trp Phe Val Thr Gln  Arg Asn Phe 
    1100                 1105                 1110             


Tyr Glu  Pro Gln Ile Ile Thr  Thr Asp Asn Thr Phe  Val Ser Gly 
    1115                 1120                 1125             


Asn Cys  Asp Val Val Ile Gly  Ile Val Asn Asn Thr  Val Tyr Asp 
    1130                 1135                 1140             


Pro Leu  Gln Pro Glu Leu Asp  Ser Phe Lys Glu Glu  Leu Asp Lys 
    1145                 1150                 1155             


Tyr Phe  Lys Asn His Thr Ser  Pro Asp Val Asp Leu  Gly Asp Ile 
    1160                 1165                 1170             


Ser Gly  Ile Asn Ala Ser Val  Val Asn Ile Gln Lys  Glu Ile Asp 
    1175                 1180                 1185             


Arg Leu  Asn Glu Val Ala Lys  Asn Leu Asn Glu Ser  Leu Ile Asp 
    1190                 1195                 1200             


Leu Gln  Glu Leu Gly Lys Tyr  Glu Gln Tyr Ile Lys  Trp Pro Trp 
    1205                 1210                 1215             


Tyr Ile  Trp Leu Gly Phe Ile  Ala Gly Leu Ile Ala  Ile Val Met 
    1220                 1225                 1230             


Val Thr  Ile Met Leu Cys Cys  Met Thr Ser Cys Cys  Ser Cys Leu 
    1235                 1240                 1245             


Lys Gly  Cys Cys Ser Cys Gly  Ser Cys Cys Lys Phe  Asp Glu Asp 
    1250                 1255                 1260             


Asp Ser  Glu Pro Val Leu Lys  Gly Val Lys Leu His  Tyr Thr 
    1265                 1270                 1275         


<210>  26
<211>  3834
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:25

<400>  26
atgttcgtct tcctggtcct cctccctctt gtctcttctc agtgtgtgaa cctcactaca       60

aggacccaac tccccccagc ttatacaaac tccttcacgc gaggagtgta ctaccccgat      120

aaagtcttca gaagcagcgt tctccatagc acgcaggatc tgttcctgcc gttcttttct      180

aacgtaactt ggtttcacgc cattcatgta tcaggcacta atggaaccaa acggtttgac      240

aatcctgtgt tgccctttaa cgacggggtt tacttcgcct ctaccgagaa atcaaacatc      300

atcaggggat ggattttcgg cactaccctt gactctaaga cacagagcct cctcatcgtg      360

aataacgcca ccaacgtagt tatcaaggtg tgtgaatttc agttctgcaa cgacccgttc      420

ttgggagtat attaccacaa aaacaacaaa tcatggatgg agtccgagtt tcgggtgtac      480

tccagcgcca ataactgcac attcgagtac gtttcccagc ccttccttat ggaccttgaa      540

gggaagcagg gtaacttcaa aaaccttcgg gaattcgtat ttaaaaacat cgatggctat      600

tttaagatat actctaaaca cacacctatc aacctcgtga gagaccttcc tcagggattc      660

tctgctcttg agcctctcgt tgatttgcct ataggcataa atattacccg attccagacc      720

ctgcttgctc tgcacagatc ctatttgacc cccggcgata gctcctccgg gtggaccgcc      780

ggcgcagccg cctattatgt ggggtatctg cagcccagaa ccttcttgct caagtacaac      840

gagaatggaa caataaccga tgcagtggac tgtgccctcg atccccttag cgagactaaa      900

tgcacgctga agagctttac cgtcgaaaag ggaatctacc agacaagcaa ttttcgggtc      960

cagccaaccg agagtatcgt gagatttccg aacattacca acctgtgccc tttcggggag     1020

gttttcaacg ctacccgctt tgcttccttc acagtcgaag tgtacgcctg gaaccgaaag     1080

cggatctcaa attgtgtagc cgactatagt gtcttgtata atagcgcctc tttttcaaca     1140

tttaaatgct atggagtgag tccaacgaaa cttaatgacc tctgtttcac caacgtgtac     1200

gcggatagct tcgtgatcag aggagacgag gtgagacaaa tcgcacctgg ccagacagga     1260

aagatagctg actacaacta taagctgccc gatgatttta cgggatgtgt gatagcctgg     1320

aactcaaaca acctggactc caaggtcgga ggtaattata attatctgta ccggctcttc     1380

aggaaaagta acctcaaacc tttcgagcgg gacatttcta cggagatcta ccaggctggc     1440

agcaccccct gcaatggagt agagggtttc aactgttatt ttcctttgca gagctatggc     1500

tttcagccca ccaacggagt gggttaccaa ccctacagag tggttgtact gtccttcgag     1560

cttctccacg cgcccgcaac tgtgtgcggt ccaaagaaat ctactaacct tgtgaagaac     1620

aagtgcgtca atttcaattt caacgggctt accggaactg gtgtcctgac agagtctaat     1680

aagaaattcc tgcccttcca gcaattcgga cgcgatatcg ctgacactac tgatgcagtg     1740

agggatcccc agacactcga gattctggat atcaccccat gctccttcgg aggggtatca     1800

gtaatcactc caggcacaaa cacaagtaat caggtggccg tgctttatca ggatgtaaat     1860

tgcactgaag tgcctgtggc cattcacgcc gaccaactca cccccacatg gcgagtgtac     1920

agcaccggca gtaacgtatt ccaaactcgc gcaggctgtc tgattggcgc agagcacgtg     1980

aacaacagtt atgagtgtga tattcctatt ggggccggca tatgcgcttc ttaccagacc     2040

cagacaaatt ctcctcggcg cgccagatca gtagccagtc aatctataat cgcgtatacc     2100

atgtctttgg gcgccgagaa ctccgtcgct tactccaaca acagtattgc cattccgacc     2160

aatttcacca tttcagtcac cacagaaatt cttcccgtgt caatgaccaa gactagcgta     2220

gattgcacga tgtacatttg tggcgacagt acagaatgta gtaacctcct gttgcaatat     2280

ggaagcttct gtacgcaatt gaacagagcc ttgactggta ttgctgttga gcaagacaaa     2340

aatacacagg aggtgttcgc ccaggttaag cagatttaca agactccccc catcaaggac     2400

ttcggtggtt tcaacttctc tcaaattctg ccagatccca gtaagcctag caagcggtcc     2460

ttcattgaag acctgctgtt caataaggtg acactggccg atgcgggctt tataaagcag     2520

tacggcgatt gcctgggaga tatcgccgca agagatctga tatgcgctca aaaatttaat     2580

gggttgactg tcctgccccc tctgctcacg gacgagatga tcgcacaata cactagcgcc     2640

ctcctggccg gtacgataac atctggctgg acattcggcg caggggccgc cctgcagata     2700

cccttcgcta tgcagatggc atataggttc aatggcattg gggtaactca gaacgtactg     2760

tatgagaatc agaagctgat tgcgaaccaa ttcaattctg ctatcggtaa gattcaggac     2820

tcactgagct ccaccgcatc cgcactcggc aagcttcagg atgttgtaaa tcagaacgct     2880

caggctttga ataccctggt aaaacaactc tcatcaaact tcggcgccat ctccagcgta     2940

ctcaatgaca ttctcagccg gttggacaag gtggaggcgg aagttcagat cgacagactg     3000

atcaccggcc gactccagtc actgcagacg tacgttacac aacaactcat tcgggccgct     3060

gagattcggg ccagtgccaa cctcgccgct acgaagatga gcgaatgcgt gctcgggcag     3120

tctaagaggg ttgacttttg cggcaagggc tatcatctga tgagttttcc gcaatccgct     3180

ccccatggag ttgtattcct ccatgtgaca tatgttcccg ctcaggaaaa gaactttacc     3240

acagctcctg ccatctgtca tgacggcaag gcccattttc ctcgagaagg ggttttcgtt     3300

tctaacggaa ctcactggtt cgtgacccag cgcaatttct acgaaccgca gattatcaca     3360

acagacaata cttttgtgtc aggaaactgt gatgtggtaa taggcatagt caacaataca     3420

gtttacgacc cccttcagcc tgagctggat agctttaagg aagaactgga taagtacttc     3480

aaaaaccata cctctcccga cgtagatctt ggggatatct ctggcattaa tgcttccgtt     3540

gtgaatatcc agaaagaaat agaccggctt aacgaagttg ccaaaaattt gaacgaaagc     3600

ctgatcgatc ttcaagaatt gggtaaatac gaacagtaca taaaatggcc ctggtatatc     3660

tggctgggct tcatcgctgg cttgatagct attgtcatgg ttacgattat gctctgctgt     3720

atgacaagct gctgcagctg tctcaaagga tgttgttcat gcgggtcatg ttgcaaattt     3780

gatgaagacg attcagagcc agtattgaaa ggcgtgaaac tgcactatac ctag           3834


<210>  27
<211>  1713
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including full spike protein, UBl1-Nsp3, 
       3Ecto-Nsp3,and Nsp8

<400>  27

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val 
            340                 345                 350         


Glu Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp 
        355                 360                 365             


Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr 
    370                 375                 380                 


Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr 
385                 390                 395                 400 


Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro 
                405                 410                 415     


Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp 
            420                 425                 430         


Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys 
        435                 440                 445             


Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn 
    450                 455                 460                 


Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly 
465                 470                 475                 480 


Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu 
                485                 490                 495     


Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr 
            500                 505                 510         


Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val 
        515                 520                 525             


Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn 
    530                 535                 540                 


Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn 
545                 550                 555                 560 


Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr 
                565                 570                 575     


Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr 
            580                 585                 590         


Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr 
        595                 600                 605             


Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val 
    610                 615                 620                 


Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr 
625                 630                 635                 640 


Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly 
                645                 650                 655     


Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala 
            660                 665                 670         


Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala 
        675                 680                 685             


Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly 
    690                 695                 700                 


Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr 
705                 710                 715                 720 


Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr 
                725                 730                 735     


Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu 
            740                 745                 750         


Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn 
        755                 760                 765             


Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu 
    770                 775                 780                 


Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp 
785                 790                 795                 800 


Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro 
                805                 810                 815     


Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu 
            820                 825                 830         


Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile 
        835                 840                 845             


Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val 
    850                 855                 860                 


Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala 
865                 870                 875                 880 


Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala 
                885                 890                 895     


Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly 
            900                 905                 910         


Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala 
        915                 920                 925             


Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser 
    930                 935                 940                 


Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala 
945                 950                 955                 960 


Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala 
                965                 970                 975     


Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu 
            980                 985                 990         


Ala Glu Val Gln Ile Asp Arg Leu  Ile Thr Gly Arg Leu  Gln Ser Leu 
        995                 1000                 1005             


Gln Thr  Tyr Val Thr Gln Gln  Leu Ile Arg Ala Ala  Glu Ile Arg 
    1010                 1015                 1020             


Ala Ser  Ala Asn Leu Ala Ala  Thr Lys Met Ser Glu  Cys Val Leu 
    1025                 1030                 1035             


Gly Gln  Ser Lys Arg Val Asp  Phe Cys Gly Lys Gly  Tyr His Leu 
    1040                 1045                 1050             


Met Ser  Phe Pro Gln Ser Ala  Pro His Gly Val Val  Phe Leu His 
    1055                 1060                 1065             


Val Thr  Tyr Val Pro Ala Gln  Glu Lys Asn Phe Thr  Thr Ala Pro 
    1070                 1075                 1080             


Ala Ile  Cys His Asp Gly Lys  Ala His Phe Pro Arg  Glu Gly Val 
    1085                 1090                 1095             


Phe Val  Ser Asn Gly Thr His  Trp Phe Val Thr Gln  Arg Asn Phe 
    1100                 1105                 1110             


Tyr Glu  Pro Gln Ile Ile Thr  Thr Asp Asn Thr Phe  Val Ser Gly 
    1115                 1120                 1125             


Asn Cys  Asp Val Val Ile Gly  Ile Val Asn Asn Thr  Val Tyr Asp 
    1130                 1135                 1140             


Pro Leu  Gln Pro Glu Leu Asp  Ser Phe Lys Glu Glu  Leu Asp Lys 
    1145                 1150                 1155             


Tyr Phe  Lys Asn His Thr Ser  Pro Asp Val Asp Leu  Gly Asp Ile 
    1160                 1165                 1170             


Ser Gly  Ile Asn Ala Ser Val  Val Asn Ile Gln Lys  Glu Ile Asp 
    1175                 1180                 1185             


Arg Leu  Asn Glu Val Ala Lys  Asn Leu Asn Glu Ser  Leu Ile Asp 
    1190                 1195                 1200             


Leu Gln  Glu Leu Gly Lys Tyr  Glu Gln Tyr Ile Lys  Trp Pro Trp 
    1205                 1210                 1215             


Tyr Ile  Trp Leu Gly Phe Ile  Ala Gly Leu Ile Ala  Ile Val Met 
    1220                 1225                 1230             


Val Thr  Ile Met Leu Cys Cys  Met Thr Ser Cys Cys  Ser Cys Leu 
    1235                 1240                 1245             


Lys Gly  Cys Cys Ser Cys Gly  Ser Cys Cys Lys Phe  Asp Glu Asp 
    1250                 1255                 1260             


Asp Ser  Glu Pro Val Leu Lys  Gly Val Lys Leu His  Tyr Thr Gly 
    1265                 1270                 1275             


Gly Gly  Gly Ser Gly Gly Gly  Gly Ser Gly Gly Gly  Gly Ser Ala 
    1280                 1285                 1290             


Pro Thr  Lys Val Thr Phe Gly  Asp Asp Thr Val Ile  Glu Val Gln 
    1295                 1300                 1305             


Gly Tyr  Lys Ser Val Asn Ile  Thr Phe Glu Leu Asp  Glu Arg Ile 
    1310                 1315                 1320             


Asp Lys  Val Leu Asn Glu Lys  Cys Ser Ala Tyr Thr  Val Glu Leu 
    1325                 1330                 1335             


Gly Thr  Glu Val Asn Glu Phe  Ala Cys Val Val Ala  Asp Ala Val 
    1340                 1345                 1350             


Ile Lys  Thr Leu Gln Pro Val  Ser Glu Leu Leu Thr  Pro Leu Gly 
    1355                 1360                 1365             


Ile Asp  Leu Asp Glu Trp Ser  Met Ala Thr Tyr Tyr  Leu Phe Asp 
    1370                 1375                 1380             


Glu Ser  Gly Glu Phe Lys Leu  Ala Ser His Met Tyr  Cys Ser Phe 
    1385                 1390                 1395             


Tyr Pro  Pro Asp Glu Gly Gly  Gly Gly Ser Gly Gly  Gly Gly Ser 
    1400                 1405                 1410             


Gly Gly  Gly Gly Ser Ser Asn  Leu Gly Met Pro Ser  Tyr Cys Thr 
    1415                 1420                 1425             


Gly Tyr  Arg Glu Gly Tyr Leu  Asn Ser Thr Asn Val  Thr Ile Ala 
    1430                 1435                 1440             


Thr Tyr  Cys Thr Gly Ser Ile  Pro Cys Ser Val Cys  Leu Ser Gly 
    1445                 1450                 1455             


Leu Asp  Ser Leu Asp Thr Tyr  Pro Ser Leu Glu Thr  Ile Gln Ile 
    1460                 1465                 1470             


Thr Ile  Ser Ser Phe Lys Trp  Asp Leu Thr Ala Phe  Gly Leu Val 
    1475                 1480                 1485             


Ala Glu  Trp Phe Leu Ala Tyr  Ile Leu Phe Thr Arg  Gly Gly Gly 
    1490                 1495                 1500             


Gly Ser  Gly Gly Gly Gly Ser  Gly Gly Gly Gly Ser  Ala Ile Ala 
    1505                 1510                 1515             


Ser Glu  Phe Ser Ser Leu Pro  Ser Tyr Ala Ala Phe  Ala Thr Ala 
    1520                 1525                 1530             


Gln Glu  Ala Tyr Glu Gln Ala  Val Ala Asn Gly Asp  Ser Glu Val 
    1535                 1540                 1545             


Val Leu  Lys Lys Leu Lys Lys  Ser Leu Asn Val Ala  Lys Ser Glu 
    1550                 1555                 1560             


Phe Asp  Arg Asp Ala Ala Met  Gln Arg Lys Leu Glu  Lys Met Ala 
    1565                 1570                 1575             


Asp Gln  Ala Met Thr Gln Met  Tyr Lys Gln Ala Arg  Ser Glu Asp 
    1580                 1585                 1590             


Lys Arg  Ala Lys Val Thr Ser  Ala Met Gln Thr Met  Leu Phe Thr 
    1595                 1600                 1605             


Met Leu  Arg Lys Leu Asp Asn  Asp Ala Leu Asn Asn  Ile Ile Asn 
    1610                 1615                 1620             


Asn Ala  Arg Asp Gly Cys Val  Pro Leu Asn Ile Ile  Pro Leu Thr 
    1625                 1630                 1635             


Thr Ala  Ala Lys Leu Met Val  Val Ile Pro Asp Tyr  Asn Thr Tyr 
    1640                 1645                 1650             


Lys Asn  Thr Cys Asp Gly Thr  Thr Phe Thr Tyr Ala  Ser Ala Leu 
    1655                 1660                 1665             


Trp Glu  Ile Gln Gln Val Val  Asp Ala Asp Ser Lys  Ile Val Gln 
    1670                 1675                 1680             


Leu Ser  Glu Ile Ser Met Asp  Asn Ser Pro Asn Leu  Ala Trp Pro 
    1685                 1690                 1695             


Leu Ile  Val Thr Ala Leu Arg  Ala Asn Ser Ala Val  Lys Leu Gln 
    1700                 1705                 1710             


<210>  28
<211>  5142
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:27

<400>  28
atgtttgttt tcctcgtgct cttgcctctt gtgagctctc agtgcgtgaa tttgaccaca       60

agaacacagc ttcctcccgc atacaccaat agtttcacca ggggcgtgta ctatcctgat      120

aaggtcttca ggagctcagt gcttcatagc acccaagatc ttttcctccc attttttagt      180

aatgtcactt ggtttcatgc aattcatgta tccggaacga acgggacgaa gcgcttcgat      240

aatcctgtgc tcccttttaa cgatggggtt tattttgcct ctacagagaa atccaatatt      300

atcagaggat ggattttcgg cactactctt gactctaaga cacagtccct gttgatcgtg      360

aacaacgcca ctaatgtggt gattaaagtg tgtgaatttc agttttgtaa cgaccctttc      420

ctgggggtct attatcataa gaataataaa agctggatgg aatccgaatt ccgggtgtat      480

agctctgcca acaactgcac ctttgaatat gtaagccagc ccttcctcat ggaccttgag      540

ggcaagcagg gaaactttaa gaatctgcgc gaatttgtgt ttaagaacat cgacggctac      600

ttcaaaatct attctaaaca tacaccgatc aatttggtga gagaccttcc acaaggattc      660

tccgccctgg agcccctggt cgatctgccc atcggaatca acatcactag gttccagact      720

ttgctggccc tgcataggtc atatctgacc cccggggatt catcttctgg atggaccgca      780

ggtgcagctg cctattatgt gggctatttg caacccagga ccttcctgct gaaatataat      840

gaaaatggta caatcactga cgctgtagac tgcgcacttg atcctctgtc cgagacaaaa      900

tgtacactga agtcattcac ggtggaaaaa ggcatttatc agacatctaa ctttcgggta      960

cagcctacag aaagtatcgt gagattccct aacattacca atctctgtcc ttttggagag     1020

gtgtttaatg ccacacgatt tgcatccttc acagtggagg tatacgcctg gaaccgaaag     1080

cggatctcca actgcgtcgc cgactattct gtcctgtata attcagcctc attcagtacc     1140

ttcaagtgtt atggcgtgtc cccaaccaaa ttgaatgacc tgtgtttcac caatgtgtac     1200

gcggattctt tcgtgattcg gggtgacgaa gtgagacaga tcgcgccagg acaaacaggg     1260

aaaatcgcgg attataatta caagctgccc gatgacttca ctggatgtgt gattgcttgg     1320

aactcaaaca atttggacag taaagttggt ggcaattata actacctgta tagactcttt     1380

aggaagtcaa acctgaagcc ttttgaacgc gatataagca ctgagatcta tcaggcaggg     1440

tctactccgt gcaatggggt agaggggttc aattgctatt ttcctctcca atcatacggc     1500

tttcagccga ctaatggcgt gggctatcag ccttacaggg tagtcgtcct gagctttgaa     1560

ctgctccatg caccagccac agtatgtggg ccgaagaaat ccacgaacct cgtgaagaat     1620

aaatgcgtta acttcaactt caatggcctc acaggcacag gagttctgac tgagtctaac     1680

aagaagttcc tcccattcca acagttcggc cgcgacattg ctgatacaac tgacgctgtc     1740

agagatcctc agacccttga gatactggat ataacaccat gctcattcgg gggcgtcagc     1800

gtgatcacac ccggcaccaa tacaagtaac caggtcgcgg tgctgtatca ggatgtgaac     1860

tgtactgagg tccctgtcgc cattcatgcc gatcagctga ctcctacatg gagggtgtac     1920

tctacgggat ctaatgtctt tcaaacacgc gctggctgtc ttataggggc cgaacatgtt     1980

aataactcat acgaatgcga catacctatc ggcgccggaa tttgcgcctc atatcaaaca     2040

caaaccaata gcccccgccg cgcgaggagc gttgctagtc aaagcatcat tgcctacact     2100

atgtcactcg gggcagagaa ttccgttgcc tacagcaaca atagtattgc aattccaact     2160

aatttcacca ttagcgtgac taccgaaatc ttgcctgtta gcatgaccaa gacctctgtg     2220

gattgtacaa tgtatatatg cggggacagc actgaatgtt caaatttgct gttgcaatat     2280

gggtcattct gcactcaact caacagggca ctgaccggga ttgctgtgga gcaggacaag     2340

aacacccaag aggtattcgc tcaggtaaaa caaatttaca aaaccccacc tattaaggat     2400

ttcggggggt tcaatttttc tcagatcctc cccgacccca gtaaaccctc aaagcggagc     2460

ttcattgaag acctgctctt taacaaggta actctggcgg atgccggctt tattaagcag     2520

tacggagatt gtctgggtga tatcgccgct cgagacctca tttgcgctca aaaatttaat     2580

ggtcttacag tactgccacc actgctcact gacgagatga tagcccagta cacatctgcc     2640

ctccttgcgg gcactatcac aagcggctgg acgttcggcg caggagcggc gctgcaaatt     2700

ccattcgcaa tgcagatggc ctatagattt aatggcattg gcgttacaca aaatgttttg     2760

tatgagaacc agaagctgat tgccaaccaa ttcaatagcg ctatcggtaa aatccaggac     2820

agcctgtcct caacagcatc agccctgggg aagttgcagg acgtggtaaa tcagaacgca     2880

caagccctca atacccttgt caagcagctg agtagtaact ttggggcaat ttcctccgtg     2940

ctgaacgaca tcttgtcacg actggacaag gttgaggccg aagttcagat tgaccggctc     3000

attaccggga gactccagtc cttgcaaacg tacgtcaccc agcagttgat cagggctgcg     3060

gagatcagag ccagtgctaa cctggcggca accaagatga gcgagtgtgt tctcggccaa     3120

tccaagcggg tcgatttctg cggcaaagga taccacctga tgtctttccc gcaaagcgcg     3180

ccccatgggg tggtctttct tcatgtgacg tatgttccgg ctcaagagaa gaattttaca     3240

accgcccccg cgatttgtca cgacggtaaa gcccacttcc caagagaggg agttttcgtg     3300

tctaatggaa cccattggtt tgtgactcag cgcaatttct acgaacccca gatcataacg     3360

acagataaca cgttcgtgtc tggtaattgc gatgtagtga ttggcatagt caacaatacc     3420

gtatatgacc ctcttcagcc cgagctcgat agctttaaag aagaactgga taaatatttc     3480

aagaatcaca ctagccctga cgtcgacctc ggtgacatct ccggaattaa cgcttccgtg     3540

gtaaatattc agaaggaaat tgaccggctc aacgaggtgg ctaaaaacct gaacgagagt     3600

ctgattgatc tccaagaact ggggaaatat gagcaataca tcaagtggcc ctggtatatt     3660

tggctgggct tcattgcagg cctgatcgca atagtgatgg tgaccattat gctgtgctgc     3720

atgacgagct gttgttcatg tcttaagggt tgttgctctt gtggatcttg ctgtaagttt     3780

gacgaggacg attccgagcc agttctgaag ggagtgaaac tccactatac cggaggcggt     3840

ggcagcggag gaggaggctc aggcggtggc gggtcagctc ctacaaaggt taccttcggt     3900

gatgacacgg tcatcgaggt ccaggggtat aagtccgtca acattacgtt tgaactcgac     3960

gagcggatcg ataaagtgct gaacgaaaaa tgcagcgctt acactgtcga actggggaca     4020

gaggtgaatg aatttgcttg tgtggttgct gatgccgtca tcaagaccct gcaacctgtg     4080

tccgagcttt tgactccttt gggaatcgat cttgatgagt ggagcatggc aacgtactat     4140

ctctttgacg agagcgggga gttcaaactg gccagtcaca tgtactgctc cttttaccct     4200

ccggacgaag gtggcggagg ctcaggggga ggtggtagcg gtgggggtgg gagctccaat     4260

ctgggaatgc cgagttactg cactgggtat agggaaggtt accttaacag cactaatgtt     4320

accatcgcta cgtattgcac aggcagtatc ccttgttcag tgtgtcttag cggtctggat     4380

tccctcgata catacccctc tctcgaaaca atacaaatca cgatctcatc ttttaagtgg     4440

gacctcaccg cgtttggcct ggttgctgag tggttcctgg cgtatatcct gttcactaga     4500

ggcggaggag gatccggagg tggaggttct ggaggcgggg gatctgccat agcttccgaa     4560

ttctcatcct tgcccagcta tgccgcattt gccactgctc aggaggcgta cgagcaggcc     4620

gtggccaacg gtgattccga ggtggttctt aagaagctga agaagagtct taatgttgct     4680

aagtccgagt ttgatcgcga tgccgcaatg cagaggaaac tggaaaagat ggcagaccag     4740

gctatgactc agatgtataa acaggcccgc agtgaggaca aacgggctaa agtaaccagc     4800

gcaatgcaga caatgttgtt tacgatgctg aggaaattgg acaatgacgc tctcaataac     4860

attattaata acgctaggga cggatgtgtg cccctgaaca tcatccccct tactacggcg     4920

gccaaattga tggtggtgat tcctgattat aacacataca agaacacttg cgacggtaca     4980

acttttactt atgctagtgc actgtgggag atccaacaag tggtggatgc tgattccaaa     5040

atagtacaat tgtctgagat cagtatggat aattccccaa accttgcctg gcctttgata     5100

gtgactgccc tgcgagccaa cagtgccgtt aagctgcaat ga                        5142


<210>  29
<211>  1569
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including S1, M, N, and Nsp8

<400>  29

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Phe Thr Val 
            340                 345                 350         


Glu Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp 
        355                 360                 365             


Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr 
    370                 375                 380                 


Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr 
385                 390                 395                 400 


Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro 
                405                 410                 415     


Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp 
            420                 425                 430         


Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys 
        435                 440                 445             


Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn 
    450                 455                 460                 


Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly 
465                 470                 475                 480 


Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu 
                485                 490                 495     


Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr 
            500                 505                 510         


Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val 
        515                 520                 525             


Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn 
    530                 535                 540                 


Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn 
545                 550                 555                 560 


Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr 
                565                 570                 575     


Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr 
            580                 585                 590         


Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr 
        595                 600                 605             


Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val 
    610                 615                 620                 


Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr 
625                 630                 635                 640 


Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly 
                645                 650                 655     


Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala 
            660                 665                 670         


Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala 
        675                 680                 685             


Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
    690                 695                 700                 


Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu 
705                 710                 715                 720 


Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile 
                725                 730                 735     


Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile 
            740                 745                 750         


Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys 
        755                 760                 765             


Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile 
    770                 775                 780                 


Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe 
785                 790                 795                 800 


Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe 
                805                 810                 815     


Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile 
            820                 825                 830         


Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile 
        835                 840                 845             


Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp 
    850                 855                 860                 


Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu 
865                 870                 875                 880 


Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly 
                885                 890                 895     


Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr 
            900                 905                 910         


Asp His Ser Ser Ser Ser Asp Asn Ile Ala Gly Gly Gly Gly Ser Gly 
        915                 920                 925             


Gly Gly Gly Ser Gly Gly Gly Gly Ser Met Ser Asp Asn Gly Pro Gln 
    930                 935                 940                 


Asn Gln Arg Asn Ala Pro Arg Ile Thr Phe Gly Gly Pro Ser Asp Ser 
945                 950                 955                 960 


Thr Gly Ser Asn Gln Asn Gly Glu Arg Ser Gly Ala Arg Ser Lys Gln 
                965                 970                 975     


Arg Arg Pro Gln Gly Leu Pro Asn Asn Thr Ala Ser Trp Phe Thr Ala 
            980                 985                 990         


Leu Thr Gln His Gly Lys Glu Asp  Leu Lys Phe Pro Arg  Gly Gln Gly 
        995                 1000                 1005             


Val Pro  Ile Asn Thr Asn Ser  Ser Pro Asp Asp Gln  Ile Gly Tyr 
    1010                 1015                 1020             


Tyr Arg  Arg Ala Thr Arg Arg  Ile Arg Gly Gly Asp  Gly Lys Met 
    1025                 1030                 1035             


Lys Asp  Leu Ser Pro Arg Trp  Tyr Phe Tyr Tyr Leu  Gly Thr Gly 
    1040                 1045                 1050             


Pro Glu  Ala Gly Leu Pro Tyr  Gly Ala Asn Lys Asp  Gly Ile Ile 
    1055                 1060                 1065             


Trp Val  Ala Thr Glu Gly Ala  Leu Asn Thr Pro Lys  Asp His Ile 
    1070                 1075                 1080             


Gly Thr  Arg Asn Pro Ala Asn  Asn Ala Ala Ile Val  Leu Gln Leu 
    1085                 1090                 1095             


Pro Gln  Gly Thr Thr Leu Pro  Lys Gly Phe Tyr Ala  Glu Gly Ser 
    1100                 1105                 1110             


Arg Gly  Gly Ser Gln Ala Ser  Ser Arg Ser Ser Ser  Arg Ser Arg 
    1115                 1120                 1125             


Asn Ser  Ser Arg Asn Ser Thr  Pro Gly Ser Ser Arg  Gly Thr Ser 
    1130                 1135                 1140             


Pro Ala  Arg Met Ala Gly Asn  Gly Gly Asp Ala Ala  Leu Ala Leu 
    1145                 1150                 1155             


Leu Leu  Leu Asp Arg Leu Asn  Gln Leu Glu Ser Lys  Met Ser Gly 
    1160                 1165                 1170             


Lys Gly  Gln Gln Gln Gln Gly  Gln Thr Val Thr Lys  Lys Ser Ala 
    1175                 1180                 1185             


Ala Glu  Ala Ser Lys Lys Pro  Arg Gln Lys Arg Thr  Ala Thr Lys 
    1190                 1195                 1200             


Ala Tyr  Asn Val Thr Gln Ala  Phe Gly Arg Arg Gly  Pro Glu Gln 
    1205                 1210                 1215             


Thr Gln  Gly Asn Phe Gly Asp  Gln Glu Leu Ile Arg  Gln Gly Thr 
    1220                 1225                 1230             


Asp Tyr  Lys His Trp Pro Gln  Ile Ala Gln Phe Ala  Pro Ser Ala 
    1235                 1240                 1245             


Ser Ala  Phe Phe Gly Met Ser  Arg Ile Gly Met Glu  Val Thr Pro 
    1250                 1255                 1260             


Ser Gly  Thr Trp Leu Thr Tyr  Thr Gly Ala Ile Lys  Leu Asp Asp 
    1265                 1270                 1275             


Lys Asp  Pro Asn Phe Lys Asp  Gln Val Ile Leu Leu  Asn Lys His 
    1280                 1285                 1290             


Ile Asp  Ala Tyr Lys Thr Phe  Pro Pro Thr Glu Pro  Lys Lys Asp 
    1295                 1300                 1305             


Lys Lys  Lys Lys Ala Asp Glu  Thr Gln Ala Leu Pro  Gln Arg Gln 
    1310                 1315                 1320             


Lys Lys  Gln Gln Thr Val Thr  Leu Leu Pro Ala Ala  Asp Leu Asp 
    1325                 1330                 1335             


Asp Phe  Ser Lys Gln Leu Gln  Gln Ser Met Ser Ser  Ala Asp Ser 
    1340                 1345                 1350             


Thr Gln  Ala Gly Gly Gly Gly  Ser Gly Gly Gly Gly  Ser Gly Gly 
    1355                 1360                 1365             


Gly Gly  Ser Ala Ile Ala Ser  Glu Phe Ser Ser Leu  Pro Ser Tyr 
    1370                 1375                 1380             


Ala Ala  Phe Ala Thr Ala Gln  Glu Ala Tyr Glu Gln  Ala Val Ala 
    1385                 1390                 1395             


Asn Gly  Asp Ser Glu Val Val  Leu Lys Lys Leu Lys  Lys Ser Leu 
    1400                 1405                 1410             


Asn Val  Ala Lys Ser Glu Phe  Asp Arg Asp Ala Ala  Met Gln Arg 
    1415                 1420                 1425             


Lys Leu  Glu Lys Met Ala Asp  Gln Ala Met Thr Gln  Met Tyr Lys 
    1430                 1435                 1440             


Gln Ala  Arg Ser Glu Asp Lys  Arg Ala Lys Val Thr  Ser Ala Met 
    1445                 1450                 1455             


Gln Thr  Met Leu Phe Thr Met  Leu Arg Lys Leu Asp  Asn Asp Ala 
    1460                 1465                 1470             


Leu Asn  Asn Ile Ile Asn Asn  Ala Arg Asp Gly Cys  Val Pro Leu 
    1475                 1480                 1485             


Asn Ile  Ile Pro Leu Thr Thr  Ala Ala Lys Leu Met  Val Val Ile 
    1490                 1495                 1500             


Pro Asp  Tyr Asn Thr Tyr Lys  Asn Thr Cys Asp Gly  Thr Thr Phe 
    1505                 1510                 1515             


Thr Tyr  Ala Ser Ala Leu Trp  Glu Ile Gln Gln Val  Val Asp Ala 
    1520                 1525                 1530             


Asp Ser  Lys Ile Val Gln Leu  Ser Glu Ile Ser Met  Asp Asn Ser 
    1535                 1540                 1545             


Pro Asn  Leu Ala Trp Pro Leu  Ile Val Thr Ala Leu  Arg Ala Asn 
    1550                 1555                 1560             


Ser Ala  Val Lys Leu Gln 
    1565                 


<210>  30
<211>  4710
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence (SEQ ID NO:30) encoding SEQ ID NO:29

<400>  30
atgttcgtgt ttcttgtgct gctgccactg gtctcttccc agtgcgttaa cctcactacg       60

agaacacaac tccctcctgc ttatacgaat agtttcacgc gcggtgttta ttaccccgac      120

aaggtgttcc ggtcatccgt gcttcactca acgcaagacc ttttcttgcc ctttttctcc      180

aacgtgactt ggttccacgc catccatgtt agcggcacaa atgggaccaa gcgattcgac      240

aatcccgtac tgccctttaa cgatggcgtg tactttgctt ccactgagaa gtccaacatc      300

attcgcggat ggatattcgg caccaccctc gacagcaaaa cacagtctct gctgatcgtt      360

aacaatgcaa ccaatgtggt catcaaggtc tgcgagtttc agttctgcaa cgaccctttc      420

ctcggcgtct attatcataa aaataacaaa agttggatgg aatcagagtt cagagtgtac      480

tcctcagcta ataactgcac gtttgagtac gtctcacaac cttttctcat ggacctcgag      540

ggtaaacagg ggaatttcaa gaatctgcga gagttcgtct tcaaaaatat cgatggatac      600

ttcaaaatct atagtaaaca cacaccaatc aacctggtca gggacttgcc ccagggtttt      660

tccgcactgg agcctctcgt ggatcttccc atcgggatca atatcacccg gttccagact      720

ctcctggctc ttcatcgcag ctatctgact cctggggata gcagtagcgg ctggaccgct      780

ggcgccgctg cctactacgt cgggtatctc cagcctcgaa cctttttgct gaaatacaat      840

gagaatggga ctatcactga tgcagttgac tgcgccctgg accccctgtc tgagactaag      900

tgtaccctga aaagcttcac tgttgaaaag gggatatatc aaacatccaa cttccgggta      960

cagccaactg aaagcatcgt taggtttccc aatattacaa acctgtgccc ttttggggaa     1020

gttttcaacg ctactagatt cgctagtttt accgtggaag tttatgcttg gaaccggaaa     1080

aggatttcaa attgtgttgc agattatagc gtcctgtata atagtgccag cttctctacg     1140

tttaagtgtt acggcgtgag ccccaccaag ctgaatgacc tgtgtttcac taatgtgtat     1200

gcagacagct ttgtcattag aggggatgaa gtaagacaga tcgcccccgg ccagaccggg     1260

aaaattgccg actataatta caagctgccc gacgacttta ccggctgtgt tatcgcttgg     1320

aactcaaata acctcgattc caaggtagga gggaactata actatttgta caggctcttc     1380

agaaaaagca acttgaagcc cttcgagagg gatatcagca ccgagattta tcaagctgga     1440

tcaacacctt gtaatggcgt ggagggattc aactgctact tcccactgca aagctacggc     1500

tttcagccaa caaatggggt cgggtatcag ccttacagag tagtggtcct cagttttgaa     1560

ctgctccacg ctcctgcgac agtatgtggc cctaagaagt ctactaacct cgtgaaaaac     1620

aaatgtgtta atttcaattt caatggactg acaggcacag gcgtgctcac agaaagtaat     1680

aaaaagttcc tgccctttca gcagtttggc cgagacatcg ctgataccac cgacgctgtg     1740

agagatcccc agacgctcga gattctggat atcacccctt gcagctttgg cggggtgtca     1800

gtcatcaccc ccggcactaa tacatccaat caggtggccg tactctatca ggacgtcaac     1860

tgcacggagg tcccggttgc catccacgcc gaccaattga ctcctacgtg gcgagtgtat     1920

tccactggaa gcaacgtatt tcaaacacga gctggttgtc tgatcggagc cgaacacgtg     1980

aataactctt acgaatgcga catcccaata ggcgctggca tctgcgcatc ttatcagact     2040

cagacaaaca gtcccagaag agcaaggggg ggtggtggtt ctggcggagg gggttcaggg     2100

gggggaggct ccatggccga ttccaatggt actattaccg tagaggaact caagaaactg     2160

ctcgaacagt ggaatctcgt gatcggattt ctctttctca cctggatatg cttgcttcaa     2220

ttcgcctacg ccaatcgcaa ccgatttctc tacatcatca agttgatctt tctttggctt     2280

ctgtggcccg tgacactggc atgctttgtg ctggccgctg tttatcggat caattggata     2340

acggggggaa tcgccattgc aatggcctgt ctcgtgggac tcatgtggtt gtcctacttc     2400

atcgcaagtt ttaggctgtt tgctaggacc cgcagcatgt ggagttttaa ccccgaaacc     2460

aacattctgc tcaacgtgcc ccttcacggt acaatcttga caaggcctct gctcgaatca     2520

gaattggtga tcggcgccgt gatcctgcgc ggacacctca gaatcgcagg acatcatttg     2580

ggcaggtgtg acatcaagga tctccccaaa gaaataactg tggccacttc tcgcacactg     2640

agttactaca agctgggtgc cagtcagcgc gtcgcgggcg actctggctt cgctgcctac     2700

tctcgctacc ggattggcaa ttacaaattg aacaccgacc actcatctag ctccgacaac     2760

attgcaggtg gcggcggaag tggcggtgga ggctctggcg gaggaggctc aatgtcagat     2820

aatggcccac agaaccagcg gaacgcccca agaatcacct ttggtggtcc atctgatagc     2880

accggcagca accagaacgg ggagcgcagc ggagcaaggt caaaacagcg caggccccag     2940

ggactgccga acaacacagc ctcatggttc acagccctca cacagcatgg taaagaagat     3000

ctcaaattcc cacgaggtca gggcgtgcca attaatacca atagttctcc tgacgatcaa     3060

atcggatatt acagaagagc cacccggcgg atccgcggtg gtgatgggaa aatgaaggac     3120

ctcagcccaa gatggtattt ttattacttg gggactggcc cagaggcagg actcccttac     3180

ggggctaata aagacggaat tatatgggtg gcaaccgaag gagctcttaa cacacccaaa     3240

gatcatattg gcaccagaaa ccctgccaac aacgcagcaa ttgtgctgca actgccgcag     3300

ggaaccacac tgccaaaagg attttatgcc gaaggatcac gcggcggatc acaagcctcc     3360

agtcggtcct ccagtcggtc acgcaactcc agccgaaatt ctaccccggg ctctagcagg     3420

gggacttcac ccgccagaat ggccggcaat ggcggagatg cagcactcgc acttttgctg     3480

ctggataggc tgaaccagct ggaaagtaaa atgtctggca agggccagca gcagcaagga     3540

caaaccgtga ccaagaagag cgccgccgag gcctctaaga agccaagaca aaaacgcaca     3600

gccacgaagg cctataacgt gacacaggca ttcggtcgcc gcggccctga gcagacgcaa     3660

ggcaattttg gtgatcagga gttgattaga caagggactg actacaagca ttggccccaa     3720

attgcccagt ttgctccttc agcctctgct tttttcggga tgtccagaat aggaatggaa     3780

gtgactccca gtggcacttg gcttacatac acaggggcta tcaagctgga cgataaggac     3840

ccgaatttca aggatcaggt tatactcctt aacaaacata tcgacgccta taagacgttt     3900

ccacctactg aacctaagaa agacaagaag aagaaggctg atgagactca agcacttcct     3960

cagcgccaga agaaacagca gactgttaca ctgctgcctg ccgccgatct cgacgacttc     4020

agcaagcaac tgcagcaaag catgagttca gccgacagta cccaggccgg aggaggcgga     4080

tctgggggcg gtgggtcagg ggggggggga tctgctatcg cgagcgaatt ttcatctttg     4140

ccgtcctacg ctgcatttgc cactgcgcag gaggcctatg agcaggcggt ggcgaatggc     4200

gactctgaag tggtgcttaa aaagttgaag aaatccctca acgtagccaa atcagaattc     4260

gaccgagatg ctgccatgca gcgcaagctt gaaaagatgg ccgaccaggc aatgacacag     4320

atgtacaagc aggcgagatc cgaggataaa cgagctaagg tgacgtccgc gatgcagaca     4380

atgctgttca ctatgctgcg caaactggat aacgatgctc tgaacaacat cattaataac     4440

gccagagatg gatgtgttcc actgaatatc atacctttga caactgccgc taagttgatg     4500

gtagtcatcc ccgattacaa tacttacaaa aacacttgcg acgggaccac gttcacttac     4560

gcttccgccc tttgggagat ccagcaggtc gtggatgccg attcaaagat tgtgcaactc     4620

tccgaaattt caatggataa ctcacccaac ctcgcgtggc ccctgatcgt gaccgcactg     4680

cgagctaatt ccgctgttaa acttcaatga                                      4710


<210>  31
<211>  1088
<212>  PRT
<213>  artificial

<220>
<223>  Fusion protein including M, N, Ubl1-Nsp3, 3Ecto-Nsp3, Nsp8

<400>  31

Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu 
1               5                   10                  15      


Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile 
            20                  25                  30          


Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile 
        35                  40                  45              


Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys 
    50                  55                  60                  


Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile 
65                  70                  75                  80  


Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe 
                85                  90                  95      


Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe 
            100                 105                 110         


Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile 
        115                 120                 125             


Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile 
    130                 135                 140                 


Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp 
145                 150                 155                 160 


Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu 
                165                 170                 175     


Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly 
            180                 185                 190         


Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr 
        195                 200                 205             


Asp His Ser Ser Ser Ser Asp Asn Ile Ala Gly Gly Gly Gly Ser Gly 
    210                 215                 220                 


Gly Gly Gly Ser Gly Gly Gly Gly Ser Met Ser Asp Asn Gly Pro Gln 
225                 230                 235                 240 


Asn Gln Arg Asn Ala Pro Arg Ile Thr Phe Gly Gly Pro Ser Asp Ser 
                245                 250                 255     


Thr Gly Ser Asn Gln Asn Gly Glu Arg Ser Gly Ala Arg Ser Lys Gln 
            260                 265                 270         


Arg Arg Pro Gln Gly Leu Pro Asn Asn Thr Ala Ser Trp Phe Thr Ala 
        275                 280                 285             


Leu Thr Gln His Gly Lys Glu Asp Leu Lys Phe Pro Arg Gly Gln Gly 
    290                 295                 300                 


Val Pro Ile Asn Thr Asn Ser Ser Pro Asp Asp Gln Ile Gly Tyr Tyr 
305                 310                 315                 320 


Arg Arg Ala Thr Arg Arg Ile Arg Gly Gly Asp Gly Lys Met Lys Asp 
                325                 330                 335     


Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr Leu Gly Thr Gly Pro Glu Ala 
            340                 345                 350         


Gly Leu Pro Tyr Gly Ala Asn Lys Asp Gly Ile Ile Trp Val Ala Thr 
        355                 360                 365             


Glu Gly Ala Leu Asn Thr Pro Lys Asp His Ile Gly Thr Arg Asn Pro 
    370                 375                 380                 


Ala Asn Asn Ala Ala Ile Val Leu Gln Leu Pro Gln Gly Thr Thr Leu 
385                 390                 395                 400 


Pro Lys Gly Phe Tyr Ala Glu Gly Ser Arg Gly Gly Ser Gln Ala Ser 
                405                 410                 415     


Ser Arg Ser Ser Ser Arg Ser Arg Asn Ser Ser Arg Asn Ser Thr Pro 
            420                 425                 430         


Gly Ser Ser Arg Gly Thr Ser Pro Ala Arg Met Ala Gly Asn Gly Gly 
        435                 440                 445             


Asp Ala Ala Leu Ala Leu Leu Leu Leu Asp Arg Leu Asn Gln Leu Glu 
    450                 455                 460                 


Ser Lys Met Ser Gly Lys Gly Gln Gln Gln Gln Gly Gln Thr Val Thr 
465                 470                 475                 480 


Lys Lys Ser Ala Ala Glu Ala Ser Lys Lys Pro Arg Gln Lys Arg Thr 
                485                 490                 495     


Ala Thr Lys Ala Tyr Asn Val Thr Gln Ala Phe Gly Arg Arg Gly Pro 
            500                 505                 510         


Glu Gln Thr Gln Gly Asn Phe Gly Asp Gln Glu Leu Ile Arg Gln Gly 
        515                 520                 525             


Thr Asp Tyr Lys His Trp Pro Gln Ile Ala Gln Phe Ala Pro Ser Ala 
    530                 535                 540                 


Ser Ala Phe Phe Gly Met Ser Arg Ile Gly Met Glu Val Thr Pro Ser 
545                 550                 555                 560 


Gly Thr Trp Leu Thr Tyr Thr Gly Ala Ile Lys Leu Asp Asp Lys Asp 
                565                 570                 575     


Pro Asn Phe Lys Asp Gln Val Ile Leu Leu Asn Lys His Ile Asp Ala 
            580                 585                 590         


Tyr Lys Thr Phe Pro Pro Thr Glu Pro Lys Lys Asp Lys Lys Lys Lys 
        595                 600                 605             


Ala Asp Glu Thr Gln Ala Leu Pro Gln Arg Gln Lys Lys Gln Gln Thr 
    610                 615                 620                 


Val Thr Leu Leu Pro Ala Ala Asp Leu Asp Asp Phe Ser Lys Gln Leu 
625                 630                 635                 640 


Gln Gln Ser Met Ser Ser Ala Asp Ser Thr Gln Ala Gly Gly Gly Gly 
                645                 650                 655     


Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Pro Thr Lys Val 
            660                 665                 670         


Thr Phe Gly Asp Asp Thr Val Ile Glu Val Gln Gly Tyr Lys Ser Val 
        675                 680                 685             


Asn Ile Thr Phe Glu Leu Asp Glu Arg Ile Asp Lys Val Leu Asn Glu 
    690                 695                 700                 


Lys Cys Ser Ala Tyr Thr Val Glu Leu Gly Thr Glu Val Asn Glu Phe 
705                 710                 715                 720 


Ala Cys Val Val Ala Asp Ala Val Ile Lys Thr Leu Gln Pro Val Ser 
                725                 730                 735     


Glu Leu Leu Thr Pro Leu Gly Ile Asp Leu Asp Glu Trp Ser Met Ala 
            740                 745                 750         


Thr Tyr Tyr Leu Phe Asp Glu Ser Gly Glu Phe Lys Leu Ala Ser His 
        755                 760                 765             


Met Tyr Cys Ser Phe Tyr Pro Pro Asp Glu Gly Gly Gly Gly Ser Gly 
    770                 775                 780                 


Gly Gly Gly Ser Gly Gly Gly Gly Ser Ser Asn Leu Gly Met Pro Ser 
785                 790                 795                 800 


Tyr Cys Thr Gly Tyr Arg Glu Gly Tyr Leu Asn Ser Thr Asn Val Thr 
                805                 810                 815     


Ile Ala Thr Tyr Cys Thr Gly Ser Ile Pro Cys Ser Val Cys Leu Ser 
            820                 825                 830         


Gly Leu Asp Ser Leu Asp Thr Tyr Pro Ser Leu Glu Thr Ile Gln Ile 
        835                 840                 845             


Thr Ile Ser Ser Phe Lys Trp Asp Leu Thr Ala Phe Gly Leu Val Ala 
    850                 855                 860                 


Glu Trp Phe Leu Ala Tyr Ile Leu Phe Thr Arg Gly Gly Gly Gly Ser 
865                 870                 875                 880 


Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Ile Ala Ser Glu Phe 
                885                 890                 895     


Ser Ser Leu Pro Ser Tyr Ala Ala Phe Ala Thr Ala Gln Glu Ala Tyr 
            900                 905                 910         


Glu Gln Ala Val Ala Asn Gly Asp Ser Glu Val Val Leu Lys Lys Leu 
        915                 920                 925             


Lys Lys Ser Leu Asn Val Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala 
    930                 935                 940                 


Met Gln Arg Lys Leu Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met 
945                 950                 955                 960 


Tyr Lys Gln Ala Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala 
                965                 970                 975     


Met Gln Thr Met Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala 
            980                 985                 990         


Leu Asn Asn Ile Ile Asn Asn Ala  Arg Asp Gly Cys Val  Pro Leu Asn 
        995                 1000                 1005             


Ile Ile  Pro Leu Thr Thr Ala  Ala Lys Leu Met Val  Val Ile Pro 
    1010                 1015                 1020             


Asp Tyr  Asn Thr Tyr Lys Asn  Thr Cys Asp Gly Thr  Thr Phe Thr 
    1025                 1030                 1035             


Tyr Ala  Ser Ala Leu Trp Glu  Ile Gln Gln Val Val  Asp Ala Asp 
    1040                 1045                 1050             


Ser Lys  Ile Val Gln Leu Ser  Glu Ile Ser Met Asp  Asn Ser Pro 
    1055                 1060                 1065             


Asn Leu  Ala Trp Pro Leu Ile  Val Thr Ala Leu Arg  Ala Asn Ser 
    1070                 1075                 1080             


Ala Val  Lys Leu Gln 
    1085             


<210>  32
<211>  3267
<212>  DNA
<213>  artificial

<220>
<223>  Nucleotide sequence encoding SEQ ID NO:31

<400>  32
atggctgata gcaacggcac catcacagtt gaagagctca agaagcttct ggaacagtgg       60

aacctggtca ttggcttttt gtttctgaca tggatttgcc tgctccagtt tgcctacgct      120

aaccggaaca gattcctgta cattatcaaa ctgatcttcc tgtggcttct ttggcccgtg      180

acccttgcat gcttcgtgct cgccgccgtg tacaggatca actggataac cggaggaatc      240

gctatcgcta tggcttgcct cgttgggttg atgtggctgt cctacttcat cgcttctttc      300

cgcctcttcg cacgcacaag atccatgtgg tcatttaacc ctgaaactaa catcctgctt      360

aatgtgcctc tccatggcac tatcctcacc cgcccactgc tggagtcaga gctcgtgatt      420

ggggcggtta tcttgcgcgg tcatctgagg atagctgggc accatctggg gcggtgtgac      480

ataaaggatc tgcccaaaga gatcacggtt gcaacaagta gaactctgag ctattacaaa      540

ctcggagctt cacaaagggt ggccggggac tccggctttg ccgcctattc acggtacaga      600

atcgggaact acaaactcaa tacagatcac tccagttcct ctgataacat tgccggcggt      660

gggggcagtg gcggcggcgg gtcaggcggg gggggcagca tgagcgacaa cggaccccag      720

aatcagagaa acgctcctcg aatcacattt ggtggaccta gcgattccac tgggagcaat      780

cagaatggtg agaggtccgg cgcccgcagc aagcagcggc ggccccaggg cctgccaaac      840

aacacagcta gttggtttac cgctcttacc cagcacggaa aggaggattt gaagtttccc      900

aggggacagg gggtcccaat aaacaccaac agctcacctg atgatcagat tggctattac      960

cggagggcca cccggcgcat ccgcggaggc gatgggaaga tgaaagacct gtctccacgg     1020

tggtacttct attatctggg aactggaccc gaggcagggc tgccctacgg tgcgaataag     1080

gatggcatta tttgggttgc aacagaaggt gcactgaata cgcctaagga ccacatcggt     1140

acaagaaatc cagcaaacaa cgctgccatt gtgcttcaac tcccacaggg cacgactctg     1200

cctaagggct tttacgcaga gggaagccgc ggtggcagcc aagcttccag cagatcctct     1260

agtaggtccc ggaactcttc tcggaactca acccctggct ccagtcgcgg gacaagtcca     1320

gctagaatgg ccggcaatgg gggggatgca gcactcgcac tcctgttgct cgacagattg     1380

aaccaactgg agtctaaaat gtctggtaaa ggacagcaac agcagggcca gaccgtaaca     1440

aagaaatctg ccgctgaggc gtctaaaaag ccccgccaga agaggaccgc caccaaggca     1500

tataatgtta ctcaggcatt tggtagacgc ggacctgaac agactcaagg aaacttcgga     1560

gaccaggagc tgatacgcca gggaacggac tacaagcact ggccgcagat agcacagttc     1620

gcgccaagcg caagcgcctt ttttggcatg tcccgcatcg gaatggaagt aacaccttct     1680

ggaacttggt tgacctatac cggagccata aaattggacg ataaggatcc taacttcaaa     1740

gaccaagtga tcttgctcaa caaacatatt gacgcctata aaacctttcc cccgacagag     1800

cctaagaaag ataaaaaaaa gaaggctgac gaaacacagg ccctcccaca acggcaaaag     1860

aagcagcaga ctgtcacatt gcttcctgcc gcagacctgg acgacttctc caagcagctg     1920

caacagtcta tgtcctccgc agactccaca caagcaggtg gtggcggatc tggagggggt     1980

ggctctggtg gtggtggcag cgctccaacg aaagtaacct tcggggatga cacagtgata     2040

gaggtgcagg gatataaatc agtgaacatc acatttgaac tggacgagag gatcgataaa     2100

gtactgaacg aaaaatgctc agcttacacc gtagaactgg gcacagaagt caacgaattc     2160

gcatgcgtcg tggctgacgc tgttattaaa accctgcagc ctgtcagcga gttgctcaca     2220

cccctgggaa tcgatctgga cgaatggagc atggcaacat attacctctt cgacgaatct     2280

ggagagttca aactcgcttc tcacatgtat tgctccttct atcctccgga tgaaggcggc     2340

ggcggatccg ggggaggcgg gtctggcggc ggcggctctt ctaatctcgg gatgccttca     2400

tactgtaccg gctacagaga gggctacctc aacagcacaa acgtaaccat agccacctat     2460

tgcaccggaa gtataccgtg ctccgtgtgt ctgagtggac ttgatagcct cgacacttat     2520

ccatcactgg aaacaatcca aataaccatt agctctttca aatgggatct taccgccttc     2580

ggactcgtgg ccgagtggtt tttggcttat attctgttca ccaggggcgg cggtggatct     2640

ggaggcggtg ggagtggggg cggcggctcc gctatcgcca gtgaatttag cagcctccca     2700

tcttatgctg cctttgctac cgctcaggag gcttacgagc aggccgtcgc caacggagat     2760

agcgaagtgg tgctgaaaaa gctgaaaaaa tccctgaatg tggccaaaag cgagttcgac     2820

cgcgatgctg caatgcagag gaagctggaa aagatggcag atcaggccat gacgcagatg     2880

tacaaacagg ctcgctcaga ggataaacgg gccaaggtga cctctgctat gcagactatg     2940

ctgtttacca tgctcagaaa gctggacaac gatgccctca ataacatcat aaataatgca     3000

agagacgggt gcgtgcctct taacatcatc ccccttacta cggccgccaa actgatggtc     3060

gttataccag attacaatac ttacaagaat acgtgcgatg ggacaacctt tacttatgcc     3120

agcgctctgt gggagataca gcaggtagtg gacgcggata gcaaaattgt tcaactgagc     3180

gagatctcca tggataacag tccgaacctg gcctggccac tgatcgtcac cgccctcaga     3240

gctaatagtg ccgtcaagct gcaatga                                         3267


<210>  33
<211>  15
<212>  PRT
<213>  artificial

<220>
<223>  linker

<400>  33

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10                  15  


