﻿                         序列表

<110>  武汉大学

<120>  新型冠状病毒突变株S蛋白及其亚单位疫苗

<160>  8

<170>  SIPOSequenceListing 1.0

<210>  1
<211>  3688
<212>  DNA
<213>  人工序列(Artificial Sequence)

<220>
<221>  modified_base
<222>  (2035)..(2035)
<223>  r=(ggcagcgccagc)或者(ggcggcggcagc)n或者(ggcggcggcggcagc)n或者(ggc)n，其中1≤n≤3，且n为整数


<400>  1
atgttcgtgt tcctggtgct gctgcccctg gtgagcagcc agtgcgtgaa cctgaccacc        60
cgcacccagc tgccccccgc ctacaccaac agcttcaccc gcggcgtgta ctaccccgac       120
aaggtgttcc gcagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc       180
aacgtgacct ggttccacgc catcagcggc accaacggca ccaagcgctt cgacaacccc       240
gtgctgccct tcaacgacgg cgtgtacttc gccagcaccg agaagagcaa catcatccgc       300
ggctggatct tcggcaccac cctggacagc aagacccaga gcctgctgat cgtgaacaac       360
gccaccaacg tggtgatcaa ggtgtgcgag ttccagttct gcaacgaccc cttcctgggc       420
gtgtaccaca agaacaacaa gagctggatg gagagcgagt tccgcgtgta cagcagcgcc       480
aacaactgca ccttcgagta cgtgagccag cccttcctga tggacctgga gggcaagcag       540
ggcaacttca agaacctgcg cgagttcgtg ttcaagaaca tcgacggcta cttcaagatc       600
tacagcaagc acacccccat caacctggtg cgcgacctgc cccagggctt cagcgccctg       660
gagcccctgg tggacctgcc catcggcatc aacatcaccc gcttccagac cctgctggcc       720
ctgcaccgca gctacctgac ccccggcgac agcagcagcg gctggaccgc cggcgccgcc       780
gcctactacg tgggctacct gcagccccgc accttcctgc tgaagtacaa cgagaacggc       840
accatcaccg acgccgtgga ctgcgccctg gaccccctga gcgagaccaa gtgcaccctg       900
aagagcttca ccgtggagaa gggcatctac cagaccagca acttccgcgt gcagcccacc       960
gagagcatcg tgcgcttccc caacatcacc aacctgtgcc ccttcggcga ggtgttcaac      1020
gccacccgct tcgccagcgt gtacgcctgg aaccgcaagc gcatcagcaa ctgcgtggcc      1080
gactacagcg tgctgtacaa cagcgccagc ttcagcacct tcaagtgcta cggcgtgagc      1140
cccaccaagc tgaacgacct gtgcttcacc aacgtgtacg ccgacagctt cgtgatccgc      1200
ggcgacgagg tgcgccagat cgcccccggc cagaccggca agatcgccga ctacaactac      1260
aagctgcccg acgacttcac cggctgcgtg atcgcctgga acagcaacaa cctggacagc      1320
aaggtgggcg gcaactacaa ctacctgtac cgcctgttcc gcaagagcaa cctgaagccc      1380
ttcgagcgcg acatcagcac cgagatctac caggccggca gcaccccctg caacggcgtg      1440
aagggcttca actgctactt ccccctgcag agctacggct tccagcccac ctacggcgtg      1500
ggctaccagc cctaccgcgt ggtggtgctg agcttcgagc tgctgcacgc ccccgccacc      1560
gtgtgcggcc ccaagaagag caccaacctg gtgaagaaca agtgcgtgaa cttcaacttc      1620
aacggcctga ccggcaccgg cgtgctgacc gagagcaaca agaagttcct gcccttccag      1680
cagttcggcc gcgacatcgc cgacaccacc gacgccgtgc gcgaccccca gaccctggag      1740
atcctggaca tcaccccctg cagcttcggc ggcgtgagcg tgatcacccc cggcaccaac      1800
accagcaacc aggtggccgt gctgtaccag ggcgtgaact gcaccgaggt gcccgtggcc      1860
atccacgccg accagctgac ccccacctgg cgcgtgtaca gcaccggcag caacgtgttc      1920
cagacccgcg ccggctgcct gatcggcgcc gagcacgtga acaacagcta cgagtgcgac      1980
atccccatcg gcgccggcat ctgcgccagc taccagaccc agaccaacag ccacragcgt      2040
ggccagccag agcatcatcg cctacaccat gagcctgggc gccgagaaca gcgtggccta      2100
cagcaacaac agcatcgcca tccccaccaa cttcaccatc agcgtgacca ccgagatcct      2160
gcccgtgagc atgaccaaga ccagcgtgga ctgcaccatg tacatctgcg gcgacagcac      2220
cgagtgcagc aacctgctgc tgcagtacgg cagcttctgc acccagctga accgcgccct      2280
gaccggcatc gccgtggagc aggacaagaa cacccaggag gtgttcgccc aggtgaagca      2340
gatctacaag acccccccca tcaaggactt cggcggcttc aacttcagcc agatcctgcc      2400
cgaccccagc aagcccagca agcgcagctt catcgaggac ctgctgttca acaaggtgac      2460
cctggccgac gccggcttca tcaagcagta cggcgactgc ctgggcgaca tcgccgcccg      2520
cgacctgatc tgcgcccaga agttcaacgg cctgaccgtg ctgccccccc tgctgaccga      2580
cgagatgatc gcccagtaca ccagcgccct gctggccggc accatcacca gcggctggac      2640
cttcggcgcc ggcgccgccc tgcagatccc cttcgccatg cagatggcct accgcttcaa      2700
cggcatcggc gtgacccaga acgtgctgta cgagaaccag aagctgatcg ccaaccagtt      2760
caacagcgcc atcggcaaga tccaggacag cctgagcagc accgccagcg ccctgggcaa      2820
gctgcaggac gtggtgaacc agaacgccca ggccctgaac accctggtga agcagctgag      2880
cagcaacttc ggcgccatca gcagcgtgct gaacgacatc ctgagccgcc tggacaaggt      2940
ggaggccgag gtgcagatcg accgcctgat caccggccgc ctgcagagcc tgcagaccta      3000
cgtgacccag cagctgatcc gcgccgccga gatccgcgcc agcgccaacc tggccgccac      3060
caagatgagc gagtgcgtgc tgggccagag caagcgcgtg gacttctgcg gcaagggcta      3120
ccacctgatg agcttccccc agagcgcccc ccacggcgtg gtgttcctgc acgtgaccta      3180
cgtgcccgcc caggagaaga acttcaccac cgcccccgcc atctgccacg acggcaaggc      3240
ccacttcccc cgcgagggcg tgttcgtgag caacggcacc cactggttcg tgacccagcg      3300
caacttctac gagccccaga tcatcaccac cgacaacacc ttcgtgagcg gcaactgcga      3360
cgtggtgatc ggcatcgtga acaacaccgt gtacgacccc ctgcagcccg agctggacag      3420
cttcaaggag gagctggaca agtacttcaa gaaccacacc agccccgacg tggacctggg      3480
cgacatcagc ggcatcaacg ccagcgtggt gaacatccag aaggagatcg accgcctgaa      3540
cgaggtggcc aagaacctga acgagagcct gatcgacctg caggagctgg gcaagtacga      3600
gcagggctac atccccgagg ccccccgcga cggccaggcc tacgtgcgca aggacggcga      3660
gtgggtgctg ctgagcacct tcctgtga                                         3688

<210>  2
<211>  1229
<212>  PRT
<213>  人工序列(Artificial Sequence)

<220>
<221>  UNSURE
<222>  (679)..(679)
<223>  Xaa=GSAS或者(GGGS)n或者(GGGGS)n或者(G)n，其中1≤n≤3，且n为整数


<400>  2
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  
Phe His Ala Ile Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp Asn Pro 
65                  70                  75                  80  
Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu Lys Ser 
                85                  90                  95      
Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser Lys Thr 
            100                 105                 110         
Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile Lys Val 
        115                 120                 125             
Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr His Lys 
    130                 135                 140                 
Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala 
145                 150                 155                 160 
Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu Met Asp Leu 
                165                 170                 175     
Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe Val Phe Lys 
            180                 185                 190         
Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr Pro Ile Asn 
        195                 200                 205             
Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu Pro Leu Val 
    210                 215                 220                 
Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr Leu Leu Ala 
225                 230                 235                 240 
Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr 
                245                 250                 255     
Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe 
            260                 265                 270         
Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala Val Asp Cys 
        275                 280                 285             
Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr 
    290                 295                 300                 
Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro Thr 
305                 310                 315                 320 
Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly 
                325                 330                 335     
Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg 
            340                 345                 350         
Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser 
        355                 360                 365             
Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu 
    370                 375                 380                 
Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg 
385                 390                 395                 400 
Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala 
                405                 410                 415     
Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala 
            420                 425                 430         
Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr 
        435                 440                 445             
Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp 
    450                 455                 460                 
Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val 
465                 470                 475                 480 
Lys Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro 
                485                 490                 495     
Thr Tyr Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe 
            500                 505                 510         
Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr 
        515                 520                 525             
Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr 
    530                 535                 540                 
Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln 
545                 550                 555                 560 
Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro 
                565                 570                 575     
Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Val 
            580                 585                 590         
Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val Ala Val Leu 
        595                 600                 605             
Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile His Ala Asp 
    610                 615                 620                 
Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser Asn Val Phe 
625                 630                 635                 640 
Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val Asn Asn Ser 
                645                 650                 655     
Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser Tyr Gln 
            660                 665                 670         
Thr Gln Thr Asn Ser His Xaa Ser Val Ala Ser Gln Ser Ile Ile Ala 
        675                 680                 685             
Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn 
    690                 695                 700                 
Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile 
705                 710                 715                 720 
Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile 
                725                 730                 735     
Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser 
            740                 745                 750         
Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln 
        755                 760                 765             
Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys 
    770                 775                 780                 
Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu 
785                 790                 795                 800 
Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu 
                805                 810                 815     
Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly 
            820                 825                 830         
Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys 
        835                 840                 845             
Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile 
    850                 855                 860                 
Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp 
865                 870                 875                 880 
Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met 
                885                 890                 895     
Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu 
            900                 905                 910         
Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile 
        915                 920                 925             
Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp 
    930                 935                 940                 
Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu 
945                 950                 955                 960 
Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser 
                965                 970                 975     
Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr 
            980                 985                 990         
Gly Arg Leu Gln Ser Leu Gln Thr  Tyr Val Thr Gln Gln  Leu Ile Arg 
        995                 1000                 1005             
Ala Ala  Glu Ile Arg Ala Ser  Ala Asn Leu Ala Ala  Thr Lys Met Ser 
    1010                 1015                 1020                 
Glu  Cys Val Leu Gly Gln  Ser Lys Arg Val Asp  Phe Cys Gly Lys Gly  
1025                 1030                 1035                 1040 
Tyr His Leu Met Ser  Phe Pro Gln Ser Ala  Pro His Gly Val Val  Phe 
                1045                 1050                 1055     
Leu His Val Thr  Tyr Val Pro Ala Gln  Glu Lys Asn Phe Thr  Thr Ala 
            1060                 1065                 1070         
Pro Ala Ile  Cys His Asp Gly Lys  Ala His Phe Pro Arg  Glu Gly Val 
        1075                 1080                 1085             
Phe Val  Ser Asn Gly Thr His  Trp Phe Val Thr Gln  Arg Asn Phe Tyr 
    1090                 1095                 1100                 
Glu  Pro Gln Ile Ile Thr  Thr Asp Asn Thr Phe  Val Ser Gly Asn Cys  
1105                 1110                 1115                 1120 
Asp Val Val Ile Gly  Ile Val Asn Asn Thr  Val Tyr Asp Pro Leu  Gln 
                1125                 1130                 1135     
Pro Glu Leu Asp  Ser Phe Lys Glu Glu  Leu Asp Lys Tyr Phe  Lys Asn 
            1140                 1145                 1150         
His Thr Ser  Pro Asp Val Asp Leu  Gly Asp Ile Ser Gly  Ile Asn Ala 
        1155                 1160                 1165             
Ser Val  Val Asn Ile Gln Lys  Glu Ile Asp Arg Leu  Asn Glu Val Ala 
    1170                 1175                 1180                 
Lys  Asn Leu Asn Glu Ser  Leu Ile Asp Leu Gln  Glu Leu Gly Lys Tyr  
1185                 1190                 1195                 1200 
Glu Gln Gly Tyr Ile  Pro Glu Ala Pro Arg  Asp Gly Gln Ala Tyr  Val 
                1205                 1210                 1215     
Arg Lys Asp Gly  Glu Trp Val Leu Leu  Ser Thr Phe Leu 
            1220                 1225                 

<210>  3
<211>  3679
<212>  DNA
<213>  人工序列(Artificial Sequence)

<220>
<221>  modified_base
<222>  (2026)..(2026)
<223>  r=(ggcagcgccagc)或者(ggcggcggcagc)n或者(ggcggcggcggcagc)n或者(ggc)n，其中1≤n≤3，且n为整数


<400>  3
atgttcgtgt tcctggtgct gctgcccctg gtgagcagcc agtgcgtgaa cctgaccacc        60
cgcacccagc tgccccccgc ctacaccaac agcttcaccc gcggcgtgta ctaccccgac       120
aaggtgttcc gcagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc       180
aacgtgacct ggttccacgc catcagcggc accaacggca ccaagcgctt cgacaacccc       240
gtgctgccct tcaacgacgg cgtgtacttc gccagcaccg agaagagcaa catcatccgc       300
ggctggatct tcggcaccac cctggacagc aagacccaga gcctgctgat cgtgaacaac       360
gccaccaacg tggtgatcaa ggtgtgcgag ttccagttct gcaacgaccc cttcctgggc       420
gtgtaccaca agaacaacaa gagctggatg gagagcgagt tccgcgtgta cagcagcgcc       480
aacaactgca ccttcgagta cgtgagccag cccttcctga tggacctgga gggcaagcag       540
ggcaacttca agaacctgcg cgagttcgtg ttcaagaaca tcgacggcta cttcaagatc       600
tacagcaagc acacccccat caacctggtg cgcgacctgc cccagggctt cagcgtgctg       660
gagcccctgg tggacctgcc catcggcatc aacatcaccc gcttccagac cctgcaccgc       720
agctacctga cccccggcga cagcagcagc ggctggaccg ccggcgccgc cgcctactac       780
gtgggctacc tgcagccccg caccttcctg ctgaagtaca acgagaacgg caccatcacc       840
gacgccgtgg actgcgccct ggaccccctg agcgagacca agtgcaccct gaagagcttc       900
accgtggaga agggcatcta ccagaccagc aacttccgcg tgcagcccac cgagagcatc       960
gtgcgcttcc ccaacatcac caacctgtgc cccttcggcg aggtgttcaa cgccacccgc      1020
ttcgccagcg tgtacgcctg gaaccgcaag cgcatcagca actgcgtggc cgactacagc      1080
gtgctgtaca acagcgccag cttcagcacc ttcaagtgct acggcgtgag ccccaccaag      1140
ctgaacgacc tgtgcttcac caacgtgtac gccgacagct tcgtgatccg cggcgacgag      1200
gtgcgccaga tcgcccccgg ccagaccggc aacatcgccg actacaacta caagctgccc      1260
gacgacttca ccggctgcgt gatcgcctgg aacagcaaca acctggacag caaggtgggc      1320
ggcaactaca actaccgcta ccgcctgttc cgcaagagca acctgaagcc cttcgagcgc      1380
gacatcagca ccgagatcta ccaggccggc agcaccccct gcaacggcgt gaagggcttc      1440
aactgctact tccccctgca gagctacggc ttccagccca cctacggcgt gggctaccag      1500
ccctaccgcg tggtggtgct gagcttcgag ctgctgcacg cccccgccac cgtgtgcggc      1560
cccaagaaga gcaccaacct ggtgaagaac aagtgcgtga acttcaactt caacggcctg      1620
accggcaccg gcgtgctgac cgagagcaac aagaagttcc tgcccttcca gcagttcggc      1680
cgcgacatcg acgacaccac cgacgccgtg cgcgaccccc agaccctgga gatcctggac      1740
atcaccccct gcagcttcgg cggcgtgagc gtgatcaccc ccggcaccaa caccagcaac      1800
caggtggccg tgctgtacca gggcgtgaac tgcaccgagg tgcccgtggc catccacgcc      1860
gaccagctga cccccacctg gcgcgtgtac agcaccggca gcaacgtgtt ccagacccgc      1920
gccggctgcc tgatcggcgc cgagcacgtg aacaacagct acgagtgcga catccccatc      1980
ggcgccggca tctgcgccag ctaccagacc cacaccaaca gccacragcg tggccagcca      2040
gagcatcatc gcctacacca tgagcctggg cgccgagaac agcgtggcct acagcaacaa      2100
cagcatcgcc atccccatca acttcaccat cagcgtgacc accgagatcc tgcccgtgag      2160
catgaccaag accagcgtgg actgcaccat gtacatctgc ggcgacagca ccgagtgcag      2220
caacctgctg ctgcagtacg gcagcttctg cacccagctg aaccgcgccc tgaccggcat      2280
cgccgtggag caggacaaga acacccagga ggtgttcgcc caggtgaagc agatctacaa      2340
gacccccccc atcaaggact tcggcggctt caacttcagc cagatcctgc ccgaccccag      2400
caagcccagc aagcgcagct tcatcgagga cctgctgttc aacaaggtga ccctggccga      2460
cgccggcttc atcaagcagt acggcgactg cctgggcgac atcgccgccc gcgacctgat      2520
ctgcgcccag aagttcaacg gcctgaccgt gctgcccccc ctgctgaccg acgagatgat      2580
cgcccagtac accagcgccc tgctggccgg caccatcacc agcggctgga ccttcggcgc      2640
cggcgccgcc ctgcagatcc ccttcgccat gcagatggcc taccgcttca acggcatcgg      2700
cgtgacccag aacgtgctgt acgagaacca gaagctgatc gccaaccagt tcaacagcgc      2760
catcggcaag atccaggaca gcctgagcag caccgccagc gccctgggca agctgcagga      2820
cgtggtgaac cagaacgccc aggccctgaa caccctggtg aagcagctga gcagcaactt      2880
cggcgccatc agcagcgtgc tgaacgacat cctggcccgc ctggacaagg tggaggccga      2940
ggtgcagatc gaccgcctga tcaccggccg cctgcagagc ctgcagacct acgtgaccca      3000
gcagctgatc cgcgccgccg agatccgcgc cagcgccaac ctggccgcca ccaagatgag      3060
cgagtgcgtg ctgggccaga gcaagcgcgt ggacttctgc ggcaagggct accacctgat      3120
gagcttcccc cagagcgccc cccacggcgt ggtgttcctg cacgtgacct acgtgcccgc      3180
ccaggagaag aacttcacca ccgcccccgc catctgccac gacggcaagg cccacttccc      3240
ccgcgagggc gtgttcgtga gcaacggcac ccactggttc gtgacccagc gcaacttcta      3300
cgagccccag atcatcacca cccacaacac cttcgtgagc ggcaactgcg acgtggtgat      3360
cggcatcgtg aacaacaccg tgtacgaccc cctgcagccc gagctggaca gcttcaagga      3420
ggagctggac aagtacttca agaaccacac cagccccgac gtggacctgg gcgacatcag      3480
cggcatcaac gccagcgtgg tgaacatcca gaaggagatc gaccgcctga acgaggtggc      3540
caagaacctg aacgagagcc tgatcgacct gcaggagctg ggcaagtacg agcagggcta      3600
catccccgag gccccccgcg acggccaggc ctacgtgcgc aaggacggcg agtgggtgct      3660
gctgagcacc ttcctgtga                                                   3679

<210>  4
<211>  1226
<212>  PRT
<213>  人工序列(Artificial Sequence)

<220>
<221>  UNSURE
<222>  (676)..(676)
<223>  Xaa=GSAS或者(GGGS)n或者(GGGGS)n或者(G)n，其中1≤n≤3，且n为整数


<400>  4
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  
Phe His Ala Ile Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp Asn Pro 
65                  70                  75                  80  
Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu Lys Ser 
                85                  90                  95      
Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser Lys Thr 
            100                 105                 110         
Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile Lys Val 
        115                 120                 125             
Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr His Lys 
    130                 135                 140                 
Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala 
145                 150                 155                 160 
Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu Met Asp Leu 
                165                 170                 175     
Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe Val Phe Lys 
            180                 185                 190         
Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr Pro Ile Asn 
        195                 200                 205             
Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Val Leu Glu Pro Leu Val 
    210                 215                 220                 
Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr Leu His Arg 
225                 230                 235                 240 
Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala 
                245                 250                 255     
Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys 
            260                 265                 270         
Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp 
        275                 280                 285             
Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys 
    290                 295                 300                 
Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile 
305                 310                 315                 320 
Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe 
                325                 330                 335     
Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile 
            340                 345                 350         
Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe 
        355                 360                 365             
Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu 
    370                 375                 380                 
Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu 
385                 390                 395                 400 
Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Asn Ile Ala Asp Tyr Asn 
                405                 410                 415     
Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser 
            420                 425                 430         
Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Arg Tyr Arg 
        435                 440                 445             
Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr 
    450                 455                 460                 
Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Lys Gly Phe 
465                 470                 475                 480 
Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Tyr Gly 
                485                 490                 495     
Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu 
            500                 505                 510         
His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val 
        515                 520                 525             
Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly 
    530                 535                 540                 
Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly 
545                 550                 555                 560 
Arg Asp Ile Asp Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu 
                565                 570                 575     
Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile 
            580                 585                 590         
Thr Pro Gly Thr Asn Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Gly 
        595                 600                 605             
Val Asn Cys Thr Glu Val Pro Val Ala Ile His Ala Asp Gln Leu Thr 
    610                 615                 620                 
Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg 
625                 630                 635                 640 
Ala Gly Cys Leu Ile Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys 
                645                 650                 655     
Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser Tyr Gln Thr His Thr 
            660                 665                 670         
Asn Ser His Xaa Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met 
        675                 680                 685             
Ser Leu Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala 
    690                 695                 700                 
Ile Pro Ile Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val 
705                 710                 715                 720 
Ser Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp 
                725                 730                 735     
Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr 
            740                 745                 750         
Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn 
        755                 760                 765             
Thr Gln Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro 
    770                 775                 780                 
Ile Lys Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro 
785                 790                 795                 800 
Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys 
                805                 810                 815     
Val Thr Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu 
            820                 825                 830         
Gly Asp Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly 
        835                 840                 845             
Leu Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr 
    850                 855                 860                 
Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly 
865                 870                 875                 880 
Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg 
                885                 890                 895     
Phe Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys 
            900                 905                 910         
Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser 
        915                 920                 925             
Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn 
    930                 935                 940                 
Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn 
945                 950                 955                 960 
Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ala Arg Leu Asp 
                965                 970                 975     
Lys Val Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu 
            980                 985                 990         
Gln Ser Leu Gln Thr Tyr Val Thr  Gln Gln Leu Ile Arg  Ala Ala Glu 
        995                 1000                 1005             
Ile Arg  Ala Ser Ala Asn Leu  Ala Ala Thr Lys Met  Ser Glu Cys Val 
    1010                 1015                 1020                 
Leu  Gly Gln Ser Lys Arg  Val Asp Phe Cys Gly  Lys Gly Tyr His Leu  
1025                 1030                 1035                 1040 
Met Ser Phe Pro Gln  Ser Ala Pro His Gly  Val Val Phe Leu His  Val 
                1045                 1050                 1055     
Thr Tyr Val Pro  Ala Gln Glu Lys Asn  Phe Thr Thr Ala Pro  Ala Ile 
            1060                 1065                 1070         
Cys His Asp  Gly Lys Ala His Phe  Pro Arg Glu Gly Val  Phe Val Ser 
        1075                 1080                 1085             
Asn Gly  Thr His Trp Phe Val  Thr Gln Arg Asn Phe  Tyr Glu Pro Gln 
    1090                 1095                 1100                 
Ile  Ile Thr Thr His Asn  Thr Phe Val Ser Gly  Asn Cys Asp Val Val  
1105                 1110                 1115                 1120 
Ile Gly Ile Val Asn  Asn Thr Val Tyr Asp  Pro Leu Gln Pro Glu  Leu 
                1125                 1130                 1135     
Asp Ser Phe Lys  Glu Glu Leu Asp Lys  Tyr Phe Lys Asn His  Thr Ser 
            1140                 1145                 1150         
Pro Asp Val  Asp Leu Gly Asp Ile  Ser Gly Ile Asn Ala  Ser Val Val 
        1155                 1160                 1165             
Asn Ile  Gln Lys Glu Ile Asp  Arg Leu Asn Glu Val  Ala Lys Asn Leu 
    1170                 1175                 1180                 
Asn  Glu Ser Leu Ile Asp  Leu Gln Glu Leu Gly  Lys Tyr Glu Gln Gly  
1185                 1190                 1195                 1200 
Tyr Ile Pro Glu Ala  Pro Arg Asp Gly Gln  Ala Tyr Val Arg Lys  Asp 
                1205                 1210                 1215     
Gly Glu Trp Val  Leu Leu Ser Thr Phe  Leu 
            1220                 1225     

<210>  5
<211>  25
<212>  PRT
<213>  人工序列(Artificial Sequence)

<400>  5
Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val 
1               5                   10                  15      
Met Val Thr Ile Met Leu Cys Cys Met 
            20                  25  

<210>  6
<211>  27
<212>  PRT
<213>  人工序列(Artificial Sequence)

<400>  6
Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys 
1               5                   10                  15      
Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 
            20                  25          

<210>  7
<211>  31
<212>  PRT
<213>  人工序列(Artificial Sequence)

<400>  7
Met Lys Gln Ile Glu Asp Lys Ile Glu Glu Ile Leu Ser Lys Ile Tyr 
1               5                   10                  15      
His Ile Glu Asn Glu Ile Ala Arg Ile Lys Lys Leu Ile Gly Glu 
            20                  25                  30      

<210>  8
<211>  3708
<212>  DNA
<213>  人工序列(Artificial Sequence)

<400>  8
atgttcgtgt tcctggtgct gctgcccctg gtgagcagcc agtgcgtgaa cctgaccacc        60
cgcacccagc tgccccccgc ctacaccaac agcttcaccc gcggcgtgta ctaccccgac       120
aaggtgttcc gcagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc       180
aacgtgacct ggttccacgc catccacgtg agcggcacca acggcaccaa gcgcttcgac       240
aaccccgtgc tgcccttcaa cgacggcgtg tacttcgcca gcaccgagaa gagcaacatc       300
atccgcggct ggatcttcgg caccaccctg gacagcaaga cccagagcct gctgatcgtg       360
aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc       420
ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt ccgcgtgtac       480
agcagcgcca acaactgcac cttcgagtac gtgagccagc ccttcctgat ggacctggag       540
ggcaagcagg gcaacttcaa gaacctgcgc gagttcgtgt tcaagaacat cgacggctac       600
ttcaagatct acagcaagca cacccccatc aacctggtgc gcgacctgcc ccagggcttc       660
agcgccctgg agcccctggt ggacctgccc atcggcatca acatcacccg cttccagacc       720
ctgctggccc tgcaccgcag ctacctgacc cccggcgaca gcagcagcgg ctggaccgcc       780
ggcgccgccg cctactacgt gggctacctg cagccccgca ccttcctgct gaagtacaac       840
gagaacggca ccatcaccga cgccgtggac tgcgccctgg accccctgag cgagaccaag       900
tgcaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttccgcgtg       960
cagcccaccg agagcatcgt gcgcttcccc aacatcacca acctgtgccc cttcggcgag      1020
gtgttcaacg ccacccgctt cgccagcgtg tacgcctgga accgcaagcg catcagcaac      1080
tgcgtggccg actacagcgt gctgtacaac agcgccagct tcagcacctt caagtgctac      1140
ggcgtgagcc ccaccaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc      1200
gtgatccgcg gcgacgaggt gcgccagatc gcccccggcc agaccggcaa gatcgccgac      1260
tacaactaca agctgcccga cgacttcacc ggctgcgtga tcgcctggaa cagcaacaac      1320
ctggacagca aggtgggcgg caactacaac tacctgtacc gcctgttccg caagagcaac      1380
ctgaagccct tcgagcgcga catcagcacc gagatctacc aggccggcag caccccctgc      1440
aacggcgtgg agggcttcaa ctgctacttc cccctgcaga gctacggctt ccagcccacc      1500
aacggcgtgg gctaccagcc ctaccgcgtg gtggtgctga gcttcgagct gctgcacgcc      1560
cccgccaccg tgtgcggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac      1620
ttcaacttca acggcctgac cggcaccggc gtgctgaccg agagcaacaa gaagttcctg      1680
cccttccagc agttcggccg cgacatcgcc gacaccaccg acgccgtgcg cgacccccag      1740
accctggaga tcctggacat caccccctgc agcttcggcg gcgtgagcgt gatcaccccc      1800
ggcaccaaca ccagcaacca ggtggccgtg ctgtaccagg acgtgaactg caccgaggtg      1860
cccgtggcca tccacgccga ccagctgacc cccacctggc gcgtgtacag caccggcagc      1920
aacgtgttcc agacccgcgc cggctgcctg atcggcgccg agcacgtgaa caacagctac      1980
gagtgcgaca tccccatcgg cgccggcatc tgcgccagct accagaccca gaccaacagc      2040
cccggcagcg ccagcagcgt ggccagccag agcatcatcg cctacaccat gagcctgggc      2100
gccgagaaca gcgtggccta cagcaacaac agcatcgcca tccccaccaa cttcaccatc      2160
agcgtgacca ccgagatcct gcccgtgagc atgaccaaga ccagcgtgga ctgcaccatg      2220
tacatctgcg gcgacagcac cgagtgcagc aacctgctgc tgcagtacgg cagcttctgc      2280
acccagctga accgcgccct gaccggcatc gccgtggagc aggacaagaa cacccaggag      2340
gtgttcgccc aggtgaagca gatctacaag acccccccca tcaaggactt cggcggcttc      2400
aacttcagcc agatcctgcc cgaccccagc aagcccagca agcgcagctt catcgaggac      2460
ctgctgttca acaaggtgac cctggccgac gccggcttca tcaagcagta cggcgactgc      2520
ctgggcgaca tcgccgcccg cgacctgatc tgcgcccaga agttcaacgg cctgaccgtg      2580
ctgccccccc tgctgaccga cgagatgatc gcccagtaca ccagcgccct gctggccggc      2640
accatcacca gcggctggac cttcggcgcc ggcgccgccc tgcagatccc cttcgccatg      2700
cagatggcct accgcttcaa cggcatcggc gtgacccaga acgtgctgta cgagaaccag      2760
aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgagcagc      2820
accgccagcg ccctgggcaa gctgcaggac gtggtgaacc agaacgccca ggccctgaac      2880
accctggtga agcagctgag cagcaacttc ggcgccatca gcagcgtgct gaacgacatc      2940
ctgagccgcc tggacgtgaa ggaggccgag gtgcagatcg accgcctgat caccggccgc      3000
ctgcagagcc tgcagaccta cgtgacccag cagctgatcc gcgccgccga gatccgcgcc      3060
agcgccaacc tggccgccac caagatgagc gagtgcgtgc tgggccagag caagcgcgtg      3120
gacttctgcg gcaagggcta ccacctgatg agcttccccc agagcgcccc ccacggcgtg      3180
gtgttcctgc acgtgaccta cgtgcccgcc caggagaaga acttcaccac cgcccccgcc      3240
atctgccacg acggcaaggc ccacttcccc cgcgagggcg tgttcgtgag caacggcacc      3300
cactggttcg tgacccagcg caacttctac gagccccaga tcatcaccac cgacaacacc      3360
ttcgtgagcg gcaactgcga cgtggtgatc ggcatcgtga acaacaccgt gtacgacccc      3420
ctgcagcccg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc      3480
agccccgacg tggacctggg cgacatcagc ggcatcaacg ccagcgtggt gaacatccag      3540
aaggagatcg accgcctgaa cgaggtggcc aagaacctga acgagagcct gatcgacctg      3600
caggagctgg gcaagtacga gcagggctac atccccgagg ccccccgcga cggccaggcc      3660
tacgtgcgca aggacggcga gtgggtgctg ctgagcacct tcctgtga                   3708

