                         SEQUENCE LISTING

<110>  SUZHOU ABOGEN BIOSCIENCES CO., LTD.
 
<120>  NUCLEIC ACID VACCINES FOR CORONAVIRUS

<130>  14639-009-228

<140>  TBA
<141>  

<150>  202010276288.0
<151>  2020-04-09

<150>  63/011,116
<151>  2020-04-16

<150>  202110293284.8
<151>  2021-03-19

<160>  59    

<170>  PatentIn version 3.5

<210>  1
<211>  29903
<212>  DNA
<213>  Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)

<220>
<223>  Severe acute respiratory syndrome coronavirus 2 isolate
 Wuhan-Hu-1, complete genome

<300>
<308>  GenBank/MN908947.3
<309>  2020-03-18
<313>  (1)..(29903)

<400>  1
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct       60

gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact      120

cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc      180

ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt      240

cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac      300

acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg      360

agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg      420

cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa      480

acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact      540

cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg      600

cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg      660

tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga      720

tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga      780

actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg      840

ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc      900

atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg      960

tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca     1020

gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa     1080

ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa     1140

gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg     1200

caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca     1260

gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga     1320

aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc     1380

atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg     1440

cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc     1500

ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg     1560

ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga     1620

aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga     1680

gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa     1740

aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac     1800

aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc     1860

tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct     1920

tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg     1980

aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac     2040

taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg     2100

gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga     2160

agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat     2220

ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa     2280

ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc     2340

tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca     2400

ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc     2460

tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt     2520

aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga     2580

agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga     2640

aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac     2700

cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga     2760

agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt     2820

acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc     2880

ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc     2940

actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg     3000

tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga     3060

agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga     3120

agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga     3180

agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga     3240

cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt     3300

agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt     3360

aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt     3420

aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc     3480

aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc     3540

tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa     3600

acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa     3660

gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg     3720

tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa     3780

tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga     3840

aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa     3900

gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat     3960

caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa     4020

cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag     4080

tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca     4140

agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat     4200

gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca     4260

gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc     4320

cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc     4380

ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg     4440

tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca     4500

agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc     4560

gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta     4620

tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc     4680

agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc     4740

ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa     4800

agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga     4860

taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac     4920

ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac     4980

aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca     5040

acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc     5100

acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt     5160

tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca     5220

cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa     5280

caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc     5340

acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc     5400

acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat     5460

gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg     5520

taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg     5580

cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca     5640

agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc     5700

tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca     5760

gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt     5820

acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag     5880

ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat     5940

tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat     6000

tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg     6060

tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc     6120

aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta     6180

taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg     6240

gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg     6300

tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga     6360

cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt     6420

ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt     6480

aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca     6540

cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga     6600

attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag     6660

tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac     6720

aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt     6780

ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc     6840

atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga     6900

ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg     6960

gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt     7020

tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa     7080

ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct     7140

tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc     7200

atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat     7260

tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag     7320

ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt     7380

acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta     7440

tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg     7500

ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag     7560

gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg     7620

tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga     7680

cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga     7740

tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac     7800

ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac     7860

taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc     7920

atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact     7980

agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga     8040

tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact     8100

agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac     8160

ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt     8220

tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa     8280

ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat     8340

tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat     8400

atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc     8460

tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa     8520

tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca     8580

gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc     8640

tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat     8700

tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc     8760

tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc     8820

attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac     8880

gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt     8940

tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc     9000

ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata     9060

ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac     9120

acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc     9180

tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc     9240

agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag     9300

atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac     9360

accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat     9420

tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg     9480

tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact     9540

ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt     9600

gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt     9660

cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca     9720

tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt     9780

tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa     9840

gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa     9900

taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg     9960

tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc    10020

accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc    10080

atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg    10140

tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat    10200

gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca    10260

ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct    10320

taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg    10380

acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc    10440

tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg    10500

ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac    10560

tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca    10620

aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta    10680

cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga    10740

ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat    10800

actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa    10860

agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga    10920

tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt    10980

gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt    11040

agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt    11100

accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa    11160

gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat    11220

ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac    11280

tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact    11340

aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat    11400

gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc    11460

catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat    11520

gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac    11580

tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg    11640

ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga    11700

ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa    11760

gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg    11820

tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt    11880

actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt    11940

ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt    12000

ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga    12060

agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc    12120

atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga    12180

ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga    12240

ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat    12300

gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat    12360

gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc    12420

aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt    12480

tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc    12540

atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag    12600

tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag    12660

ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat    12720

gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta    12780

caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa    12840

atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc    12900

ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa    12960

aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct    13020

acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt    13080

tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac    13140

taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc    13200

ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg    13260

ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat    13320

acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt    13380

ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca    13440

gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca    13500

ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat    13560

aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac    13620

gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac    13680

caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac    13740

ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact    13800

aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac    13860

acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag    13920

gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa    13980

cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt    14040

attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt    14100

gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg    14160

ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac    14220

ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta    14280

aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac    14340

tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg    14400

ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt    14460

gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac    14520

ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg    14580

cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca    14640

cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat    14700

gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc    14760

ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta    14820

ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt    14880

gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa    14940

tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt    15000

tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact    15060

caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc    15120

tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc    15180

gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac    15240

atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct    15300

aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc    15360

aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct    15420

caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc    15480

tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc    15540

acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc    15600

cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac    15660

tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac    15720

gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag    15780

aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg    15840

actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt    15900

aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc    15960

ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg    16020

tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc    16080

tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta    16140

gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt    16200

tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc    16260

aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa    16320

tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat    16380

gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg    16440

agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa    16500

gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca    16560

attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa    16620

agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct    16680

tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa    16740

gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact    16800

aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct    16860

gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca    16920

tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga    16980

attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat    17040

tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag    17100

agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct    17160

tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat    17220

aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg    17280

aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca    17340

gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat    17400

gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca    17460

cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt    17520

atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt    17580

gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca    17640

gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt    17700

aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa    17760

gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta    17820

ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa    17880

accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca    17940

aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca    18000

agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc    18060

tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc    18120

agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag    18180

gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat    18240

ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt    18300

ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta    18360

cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca    18420

cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa    18480

cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta    18540

caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca    18600

catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt    18660

tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg    18720

catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg    18780

ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca    18840

catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt    18900

aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg    18960

gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca    19020

gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa    19080

tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc    19140

tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc    19200

aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct    19260

aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac    19320

acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac    19380

tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca    19440

ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat    19500

gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc    19560

ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag    19620

agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt    19680

gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta    19740

gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag    19800

cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct    19860

gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt    19920

gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact    19980

gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt    20040

gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct    20100

agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag    20160

aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta    20220

caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa    20280

ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt    20340

agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa    20400

tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata    20460

acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat    20520

gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg    20580

actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca    20640

ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt    20700

tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca    20760

acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta    20820

aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct    20880

gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg    20940

cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat    21000

tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct    21060

aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt    21120

gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat    21180

tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt    21240

actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa    21300

ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca    21360

aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta    21420

aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt    21480

cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt    21540

cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag    21600

tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac    21660

acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga    21720

cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac    21780

caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc    21840

ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa    21900

gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt    21960

tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat    22020

ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca    22080

gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt    22140

gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt    22200

gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat    22260

taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga    22320

ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag    22380

gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact    22440

tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta    22500

tcaaacttct aactttagag tccaaccaac agaatctatt gttagatttc ctaatattac    22560

aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg    22620

gaacaggaag agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc    22680

attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac    22740

taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg    22800

gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt    22860

tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta    22920

tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta    22980

tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca    23040

atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact    23100

ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt    23160

ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac    23220

tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac    23280

tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg    23340

tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca    23400

ggatgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg    23460

gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc    23520

tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag    23580

ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat    23640

tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctaata actctattgc    23700

catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa    23760

gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt    23820

gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga    23880

acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc    23940

aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag    24000

caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt    24060

catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca    24120

aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata    24180

cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc    24240

attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca    24300

gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa    24360

aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa    24420

ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat    24480

ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat    24540

tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat    24600

tagagctgca gaaatcagag cttctgctaa tcttgctgct actaaaatgt cagagtgtgt    24660

acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc    24720

tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa    24780

gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg    24840

tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca    24900

aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt    24960

caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga    25020

taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa    25080

tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt    25140

aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc    25200

atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat    25260

gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg    25320

ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac    25380

ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag    25440

caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg    25500

atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt    25560

cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt    25620

gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc    25680

gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag    25740

agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa    25800

aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat    25860

tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca    25920

agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga    25980

gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca    26040

actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt    26100

gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acggttcatc cggagttgtt    26160

aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa    26220

gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagac aggtacgtta    26280

atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc    26340

atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta    26400

aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat    26460

cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag    26520

ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat    26580

ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg    26640

ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag    26700

taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa    26760

ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt    26820

tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc    26880

tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa    26940

tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg    27000

acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaacgctt tcttattaca    27060

aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca    27120

ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc    27180

ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag    27240

atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata    27300

aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat    27360

gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg    27420

ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta    27480

cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta    27540

gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac    27600

ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga    27660

caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt    27720

ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact    27780

tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt    27840

ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat    27900

ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac    27960

agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt    28020

ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg    28080

atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct    28140

gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt    28200

cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa    28260

cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac    28320

gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg    28380

atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct    28440

cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac    28500

caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg    28560

tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg    28620

gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga    28680

gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc    28740

aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag    28800

cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa    28860

ttcaactcca ggcagcagta ggggaacttc tcctgctaga atggctggca atggcggtga    28920

tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg    28980

taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa    29040

gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag    29100

acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac    29160

tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg    29220

aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc    29280

catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca    29340

tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc    29400

tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc    29460

tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc    29520

aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc    29580

ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc    29640

acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta    29700

gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt    29760

acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat    29820

tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaatgac aaaaaaaaaa    29880

aaaaaaaaaa aaaaaaaaaa aaa                                            29903


<210>  2
<211>  1273
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein with native signal peptide

<400>  2

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
        675                 680                 685             


Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 


Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 


Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     


Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         


Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             


Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 


Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 


Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     


Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         


Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             


Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 


Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 


Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                885                 890                 895     


Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         


Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             


Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
    930                 935                 940                 


Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 


Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     


Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
            980                 985                 990         


Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
        995                 1000                 1005             


Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
    1010                 1015                 1020             


Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
    1025                 1030                 1035             


Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
    1040                 1045                 1050             


Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
    1055                 1060                 1065             


Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
    1070                 1075                 1080             


Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
    1085                 1090                 1095             


Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
    1100                 1105                 1110             


Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
    1115                 1120                 1125             


Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
    1130                 1135                 1140             


Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
    1145                 1150                 1155             


His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
    1160                 1165                 1170             


Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
    1190                 1195                 1200             


Gly Lys  Tyr Glu Gln Tyr Ile  Lys Trp Pro Trp Tyr  Ile Trp Leu 
    1205                 1210                 1215             


Gly Phe  Ile Ala Gly Leu Ile  Ala Ile Val Met Val  Thr Ile Met 
    1220                 1225                 1230             


Leu Cys  Cys Met Thr Ser Cys  Cys Ser Cys Leu Lys  Gly Cys Cys 
    1235                 1240                 1245             


Ser Cys  Gly Ser Cys Cys Lys  Phe Asp Glu Asp Asp  Ser Glu Pro 
    1250                 1255                 1260             


Val Leu  Lys Gly Val Lys Leu  His Tyr Thr 
    1265                 1270             


<210>  3
<211>  3819
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein with native signal peptide

<400>  3
atgtttgttt ttcttgtttt attgccatta gtctctagtc agtgtgttaa tcttacaacc       60

agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac      120

aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc      180

aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat      240

aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccactgagaa gtctaacata      300

ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct acttattgtt      360

aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa tgatccattt      420

ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat      480

tctagtgcga ataattgcac ttttgaatat gtctctcagc cttttcttat ggaccttgaa      540

ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt ttaagaatat tgatggttat      600

tttaaaatat attctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt      660

tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact      720

ttacttgctt tacatagaag ttatttgact cctggtgatt cttcttcagg ttggacagct      780

ggtgctgcag cttattatgt gggttatctt caacctagga cttttctatt aaaatataat      840

gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag      900

tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc      960

caaccaacag aatctattgt tagatttcct aatattacaa acttgtgccc ttttggtgaa     1020

gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac     1080

tgtgttgctg attattctgt cctatataat tccgcatcat tttccacttt taagtgttat     1140

ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt     1200

gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat     1260

tataattata aattaccaga tgattttaca ggctgcgtta tagcttggaa ttctaacaat     1320

cttgattcta aggttggtgg taattataat tacctgtata gattgtttag gaagtctaat     1380

ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt     1440

aatggtgttg aaggttttaa ttgttacttt cctttacaat catatggttt ccaacccact     1500

aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca     1560

ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaaaaacaa atgtgtcaat     1620

ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg     1680

cctttccaac aatttggcag agacattgct gacactactg atgctgtccg tgatccacag     1740

acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca     1800

ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc     1860

cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct     1920

aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat     1980

gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct     2040

cctcggcggg cacgtagtgt agctagtcaa tccatcattg cctacactat gtcacttggt     2100

gcagaaaatt cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt     2160

agtgttacca cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg     2220

tacatttgtg gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt     2280

acacaattaa accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa     2340

gtttttgcac aagtcaaaca aatttacaaa acaccaccaa ttaaagattt tggtggtttt     2400

aatttttcac aaatattacc agatccatca aaaccaagca agaggtcatt tattgaagat     2460

ctacttttca acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc     2520

cttggtgata ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt     2580

ttgccacctt tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcgggt     2640

acaatcactt ctggttggac ctttggtgca ggtgctgcat tacaaatacc atttgctatg     2700

caaatggctt ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa     2760

aaattgattg ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc     2820

acagcaagtg cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac     2880

acgcttgtta aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaatgatatc     2940

ctttcacgtc ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga     3000

cttcaaagtt tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct     3060

tctgctaatc ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt     3120

gatttttgtg gaaagggcta tcatcttatg tccttccctc agtcagcacc tcatggtgta     3180

gtcttcttgc atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc     3240

atttgtcatg atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca     3300

cactggtttg taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca     3360

tttgtgtctg gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct     3420

ttgcaacctg aattagactc attcaaggag gagttagata aatattttaa gaatcataca     3480

tcaccagatg ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcaa     3540

aaagaaattg accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc     3600

caagaacttg gaaagtatga gcagtatata aaatggccat ggtacatttg gctaggtttt     3660

atagctggct tgattgccat agtaatggtg acaattatgc tttgctgtat gaccagttgc     3720

tgtagttgtc tcaagggctg ttgttcttgt ggatcctgct gcaaatttga tgaagacgac     3780

tctgagccag tgctcaaagg agtcaaatta cattacaca                            3819


<210>  4
<211>  1198
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein ectodomain (ECD) 

<400>  4

Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr 
1               5                   10                  15      


Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser 
            20                  25                  30          


Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn 
        35                  40                  45              


Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys 
    50                  55                  60                  


Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala 
65                  70                  75                  80  


Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr 
                85                  90                  95      


Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn 
            100                 105                 110         


Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu 
        115                 120                 125             


Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe 
    130                 135                 140                 


Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln 
145                 150                 155                 160 


Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu 
                165                 170                 175     


Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser 
            180                 185                 190         


Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser 
        195                 200                 205             


Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg 
    210                 215                 220                 


Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp 
225                 230                 235                 240 


Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr 
                245                 250                 255     


Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile 
            260                 265                 270         


Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys 
        275                 280                 285             


Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn 
    290                 295                 300                 


Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
305                 310                 315                 320 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
                325                 330                 335     


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
            340                 345                 350         


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
        355                 360                 365             


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
    370                 375                 380                 


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
385                 390                 395                 400 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
                405                 410                 415     


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
            420                 425                 430         


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
        435                 440                 445             


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
    450                 455                 460                 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
465                 470                 475                 480 


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
                485                 490                 495     


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
            500                 505                 510         


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
        515                 520                 525             


Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys 
    530                 535                 540                 


Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr 
545                 550                 555                 560 


Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro 
                565                 570                 575     


Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser 
            580                 585                 590         


Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro 
        595                 600                 605             


Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser 
    610                 615                 620                 


Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala 
625                 630                 635                 640 


Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly 
                645                 650                 655     


Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
            660                 665                 670         


Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala 
        675                 680                 685             


Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn 
    690                 695                 700                 


Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys 
705                 710                 715                 720 


Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys 
                725                 730                 735     


Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg 
            740                 745                 750         


Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val 
        755                 760                 765             


Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe 
    770                 775                 780                 


Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser 
785                 790                 795                 800 


Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala 
                805                 810                 815     


Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala 
            820                 825                 830         


Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu 
        835                 840                 845             


Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu 
    850                 855                 860                 


Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala 
865                 870                 875                 880 


Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile 
                885                 890                 895     


Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn 
            900                 905                 910         


Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr 
        915                 920                 925             


Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln 
    930                 935                 940                 


Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile 
945                 950                 955                 960 


Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala 
                965                 970                 975     


Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln 
            980                 985                 990         


Thr Tyr Val Thr Gln Gln Leu Ile  Arg Ala Ala Glu Ile  Arg Ala Ser 
        995                 1000                 1005             


Ala Asn  Leu Ala Ala Thr Lys  Met Ser Glu Cys Val  Leu Gly Gln 
    1010                 1015                 1020             


Ser Lys  Arg Val Asp Phe Cys  Gly Lys Gly Tyr His  Leu Met Ser 
    1025                 1030                 1035             


Phe Pro  Gln Ser Ala Pro His  Gly Val Val Phe Leu  His Val Thr 
    1040                 1045                 1050             


Tyr Val  Pro Ala Gln Glu Lys  Asn Phe Thr Thr Ala  Pro Ala Ile 
    1055                 1060                 1065             


Cys His  Asp Gly Lys Ala His  Phe Pro Arg Glu Gly  Val Phe Val 
    1070                 1075                 1080             


Ser Asn  Gly Thr His Trp Phe  Val Thr Gln Arg Asn  Phe Tyr Glu 
    1085                 1090                 1095             


Pro Gln  Ile Ile Thr Thr Asp  Asn Thr Phe Val Ser  Gly Asn Cys 
    1100                 1105                 1110             


Asp Val  Val Ile Gly Ile Val  Asn Asn Thr Val Tyr  Asp Pro Leu 
    1115                 1120                 1125             


Gln Pro  Glu Leu Asp Ser Phe  Lys Glu Glu Leu Asp  Lys Tyr Phe 
    1130                 1135                 1140             


Lys Asn  His Thr Ser Pro Asp  Val Asp Leu Gly Asp  Ile Ser Gly 
    1145                 1150                 1155             


Ile Asn  Ala Ser Val Val Asn  Ile Gln Lys Glu Ile  Asp Arg Leu 
    1160                 1165                 1170             


Asn Glu  Val Ala Lys Asn Leu  Asn Glu Ser Leu Ile  Asp Leu Gln 
    1175                 1180                 1185             


Glu Leu  Gly Lys Tyr Glu Gln  Tyr Ile Lys 
    1190                 1195             


<210>  5
<211>  3594
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein ECD coding sequence 

<400>  5
cagtgtgtta atcttacaac cagaactcaa ttaccccctg catacactaa ttctttcaca       60

cgtggtgttt attaccctga caaagttttc agatcctcag ttttacattc aactcaggac      120

ttgttcttac ctttcttttc caatgttact tggttccatg ctatacatgt ctctgggacc      180

aatggtacta agaggtttga taaccctgtc ctaccattta atgatggtgt ttattttgct      240

tccactgaga agtctaacat aataagaggc tggatttttg gtactacttt agattcgaag      300

acccagtccc tacttattgt taataacgct actaatgttg ttattaaagt ctgtgaattt      360

caattttgta atgatccatt tttgggtgtt tattaccaca aaaacaacaa aagttggatg      420

gaaagtgagt tcagagttta ttctagtgcg aataattgca cttttgaata tgtctctcag      480

ccttttctta tggaccttga aggaaaacag ggtaatttca aaaatcttag ggaatttgtg      540

tttaagaata ttgatggtta ttttaaaata tattctaagc acacgcctat taatttagtg      600

cgtgatctcc ctcagggttt ttcggcttta gaaccattgg tagatttgcc aataggtatt      660

aacatcacta ggtttcaaac tttacttgct ttacatagaa gttatttgac tcctggtgat      720

tcttcttcag gttggacagc tggtgctgca gcttattatg tgggttatct tcaacctagg      780

acttttctat taaaatataa tgaaaatgga accattacag atgctgtaga ctgtgcactt      840

gaccctctct cagaaacaaa gtgtacgttg aaatccttca ctgtagaaaa aggaatctat      900

caaacttcta actttagagt ccaaccaaca gaatctattg ttagatttcc taatattaca      960

aacttgtgcc cttttggtga agtttttaac gccaccagat ttgcatctgt ttatgcttgg     1020

aacaggaaga gaatcagcaa ctgtgttgct gattattctg tcctatataa ttccgcatca     1080

ttttccactt ttaagtgtta tggagtgtct cctactaaat taaatgatct ctgctttact     1140

aatgtctatg cagattcatt tgtaattaga ggtgatgaag tcagacaaat cgctccaggg     1200

caaactggaa agattgctga ttataattat aaattaccag atgattttac aggctgcgtt     1260

atagcttgga actctaacaa tcttgattct aaggttggtg gtaattataa ttacctgtat     1320

agattgttta ggaagtctaa tctcaaacct tttgagagag atatttcaac tgaaatctat     1380

caggccggta gcacaccttg taatggtgtt gaaggtttta attgttactt tcctttacaa     1440

tcatatggtt tccaacccac taatggtgtt ggttaccaac catacagagt agtagtactt     1500

tcttttgaac ttctacatgc accagcaact gtttgtggac ctaaaaagtc tactaatttg     1560

gttaaaaaca aatgtgtcaa tttcaacttc aatggtttaa caggcacagg tgttcttact     1620

gagtctaaca aaaagtttct gcctttccaa caatttggca gagacattgc tgacactact     1680

gatgctgtcc gtgatccaca gacacttgag attcttgaca ttacaccatg ttcttttggt     1740

ggtgtcagtg ttataacacc aggaacaaat acttctaacc aggttgctgt tctttatcag     1800

gatgttaact gcacagaagt ccctgttgct attcatgcag atcaacttac tcctacttgg     1860

cgtgtttatt ctacaggttc taatgttttt caaacacgtg caggctgttt aataggggct     1920

gaacatgtca acaactcata tgagtgtgac atacccattg gtgcaggtat atgcgctagt     1980

tatcagactc agactaattc tcctcggcgg gcacgtagtg tagctagtca atccatcatt     2040

gcctacacta tgtcacttgg tgcagaaaat tcagttgctt actctaataa ctctattgcc     2100

atacccacaa attttactat tagtgttacc acagaaattc taccagtgtc tatgaccaag     2160

acatcagtag attgtacaat gtacatttgt ggtgattcaa ctgaatgcag caatcttttg     2220

ttgcaatatg gcagtttttg tacacaatta aaccgtgctt taactggaat agctgttgaa     2280

caagacaaaa acacccaaga agtttttgca caagtcaaac aaatttacaa aacaccacca     2340

attaaagatt ttggtggttt taatttttca caaatattac cagatccatc aaaaccaagc     2400

aagaggtcat ttattgaaga tctacttttc aacaaagtga cacttgcaga tgctggcttc     2460

atcaaacaat atggtgattg ccttggtgat attgctgcta gagacctcat ttgtgcacaa     2520

aagtttaacg gccttactgt tttgccacct ttgctcacag atgaaatgat tgctcaatac     2580

acttctgcac tgttagcggg tacaatcact tctggttgga cctttggtgc aggtgctgca     2640

ttacaaatac catttgctat gcaaatggct tataggttta atggtattgg agttacacag     2700

aatgttctct atgagaacca aaaattgatt gccaaccaat ttaatagtgc tattggcaaa     2760

attcaagact cactttcttc cacagcaagt gcacttggaa aacttcaaga tgtggtcaac     2820

caaaatgcac aagctttaaa cacgcttgtt aaacaactta gctccaattt tggtgcaatt     2880

tcaagtgttt taaatgatat cctttcacgt cttgacaaag ttgaggctga agtgcaaatt     2940

gataggttga tcacaggcag acttcaaagt ttgcagacat atgtgactca acaattaatt     3000

agagctgcag aaatcagagc ttctgctaat cttgctgcta ctaaaatgtc agagtgtgta     3060

cttggacaat caaaaagagt tgatttttgt ggaaagggct atcatcttat gtccttccct     3120

cagtcagcac ctcatggtgt agtcttcttg catgtgactt atgtccctgc acaagaaaag     3180

aacttcacaa ctgctcctgc catttgtcat gatggaaaag cacactttcc tcgtgaaggt     3240

gtctttgttt caaatggcac acactggttt gtaacacaaa ggaattttta tgaaccacaa     3300

atcattacta cagacaacac atttgtgtct ggtaactgtg atgttgtaat aggaattgtc     3360

aacaacacag tttatgatcc tttgcaacct gaattagact cattcaagga ggagttagat     3420

aaatatttta agaatcatac atcaccagat gttgatttag gtgacatctc tggcattaat     3480

gcttcagttg taaacattca aaaagaaatt gaccgcctca atgaggttgc caagaattta     3540

aatgaatctc tcatcgatct ccaagaactt ggaaagtatg agcagtatat aaaa           3594


<210>  6
<211>  672
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein S1 subunit

<400>  6

Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr 
1               5                   10                  15      


Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser 
            20                  25                  30          


Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn 
        35                  40                  45              


Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys 
    50                  55                  60                  


Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala 
65                  70                  75                  80  


Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr 
                85                  90                  95      


Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn 
            100                 105                 110         


Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu 
        115                 120                 125             


Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe 
    130                 135                 140                 


Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln 
145                 150                 155                 160 


Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu 
                165                 170                 175     


Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser 
            180                 185                 190         


Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser 
        195                 200                 205             


Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg 
    210                 215                 220                 


Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp 
225                 230                 235                 240 


Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr 
                245                 250                 255     


Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile 
            260                 265                 270         


Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys 
        275                 280                 285             


Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn 
    290                 295                 300                 


Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
305                 310                 315                 320 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
                325                 330                 335     


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
            340                 345                 350         


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
        355                 360                 365             


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
    370                 375                 380                 


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
385                 390                 395                 400 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
                405                 410                 415     


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
            420                 425                 430         


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
        435                 440                 445             


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
    450                 455                 460                 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
465                 470                 475                 480 


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
                485                 490                 495     


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
            500                 505                 510         


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
        515                 520                 525             


Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys 
    530                 535                 540                 


Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr 
545                 550                 555                 560 


Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro 
                565                 570                 575     


Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser 
            580                 585                 590         


Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro 
        595                 600                 605             


Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser 
    610                 615                 620                 


Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala 
625                 630                 635                 640 


Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly 
                645                 650                 655     


Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
            660                 665                 670         


<210>  7
<211>  2016
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein S1 subunit coding sequence

<400>  7
cagtgtgtta atcttacaac cagaactcaa ttaccccctg catacactaa ttctttcaca       60

cgtggtgttt attaccctga caaagttttc agatcctcag ttttacattc aactcaggac      120

ttgttcttac ctttcttttc caatgttact tggttccatg ctatacatgt ctctgggacc      180

aatggtacta agaggtttga taaccctgtc ctaccattta atgatggtgt ttattttgct      240

tccactgaga agtctaacat aataagaggc tggatttttg gtactacttt agattcgaag      300

acccagtccc tacttattgt taataacgct actaatgttg ttattaaagt ctgtgaattt      360

caattttgta atgatccatt tttgggtgtt tattaccaca aaaacaacaa aagttggatg      420

gaaagtgagt tcagagttta ttctagtgcg aataattgca cttttgaata tgtctctcag      480

ccttttctta tggaccttga aggaaaacag ggtaatttca aaaatcttag ggaatttgtg      540

tttaagaata ttgatggtta ttttaaaata tattctaagc acacgcctat taatttagtg      600

cgtgatctcc ctcagggttt ttcggcttta gaaccattgg tagatttgcc aataggtatt      660

aacatcacta ggtttcaaac tttacttgct ttacatagaa gttatttgac tcctggtgat      720

tcttcttcag gttggacagc tggtgctgca gcttattatg tgggttatct tcaacctagg      780

acttttctat taaaatataa tgaaaatgga accattacag atgctgtaga ctgtgcactt      840

gaccctctct cagaaacaaa gtgtacgttg aaatccttca ctgtagaaaa aggaatctat      900

caaacttcta actttagagt ccaaccaaca gaatctattg ttagatttcc taatattaca      960

aacttgtgcc cttttggtga agtttttaac gccaccagat ttgcatctgt ttatgcttgg     1020

aacaggaaga gaatcagcaa ctgtgttgct gattattctg tcctatataa ttccgcatca     1080

ttttccactt ttaagtgtta tggagtgtct cctactaaat taaatgatct ctgctttact     1140

aatgtctatg cagattcatt tgtaattaga ggtgatgaag tcagacaaat cgctccaggg     1200

caaactggaa agattgctga ttataattat aaattaccag atgattttac aggctgcgtt     1260

atagcttgga attctaacaa tcttgattct aaggttggtg gtaattataa ttacctgtat     1320

agattgttta ggaagtctaa tctcaaacct tttgagagag atatttcaac tgaaatctat     1380

caggccggta gcacaccttg taatggtgtt gaaggtttta attgttactt tcctttacaa     1440

tcatatggtt tccaacccac taatggtgtt ggttaccaac catacagagt agtagtactt     1500

tcttttgaac ttctacatgc accagcaact gtttgtggac ctaaaaagtc tactaatttg     1560

gttaaaaaca aatgtgtcaa tttcaacttc aatggtttaa caggcacagg tgttcttact     1620

gagtctaaca aaaagtttct gcctttccaa caatttggca gagacattgc tgacactact     1680

gatgctgtcc gtgatccaca gacacttgag attcttgaca ttacaccatg ttcttttggt     1740

ggtgtcagtg ttataacacc aggaacaaat acttctaacc aggttgctgt tctttatcag     1800

gatgttaact gcacagaagt ccctgttgct attcatgcag atcaacttac tcctacttgg     1860

cgtgtttatt ctacaggttc taatgttttt caaacacgtg caggctgttt aataggggct     1920

gaacatgtca acaactcata tgagtgtgac atacccattg gtgcaggtat atgcgctagt     1980

tatcagactc agactaattc tcctcggcgg gcacgt                               2016


<210>  8
<211>  223
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein receptor binding domain (RBD) 
spanning positions 319-541 (RBD-1)

<400>  8

Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn 
1               5                   10                  15      


Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val 
            20                  25                  30          


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
        35                  40                  45              


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
    50                  55                  60                  


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
65                  70                  75                  80  


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
                85                  90                  95      


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
            100                 105                 110         


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
        115                 120                 125             


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
    130                 135                 140                 


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
145                 150                 155                 160 


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
                165                 170                 175     


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
            180                 185                 190         


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
        195                 200                 205             


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
    210                 215                 220             


<210>  9
<211>  669
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBD-1 coding sequence 

<400>  9
agagtccaac caacagaatc tattgttaga tttcctaata ttacaaactt gtgccctttt       60

ggtgaagttt ttaacgccac cagatttgca tctgtttatg cttggaacag gaagagaatc      120

agcaactgtg ttgctgatta ttctgtccta tataattccg catcattttc cacttttaag      180

tgttatggag tgtctcctac taaattaaat gatctctgct ttactaatgt ctatgcagat      240

tcatttgtaa ttagaggtga tgaagtcaga caaatcgctc cagggcaaac tggaaagatt      300

gctgattata attataaatt accagatgat tttacaggct gcgttatagc ttggaactct      360

aacaatcttg attctaaggt tggtggtaat tataattacc tgtatagatt gtttaggaag      420

tctaatctca aaccttttga gagagatatt tcaactgaaa tctatcaggc cggtagcaca      480

ccttgtaatg gtgttgaagg ttttaattgt tactttcctt tacaatcata tggtttccaa      540

cccactaatg gtgttggtta ccaaccatac agagtagtag tactttcttt tgaacttcta      600

catgcaccag caactgtttg tggacctaaa aagtctacta atttggttaa aaacaaatgt      660

gtcaatttc                                                              669


<210>  10
<211>  199
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBD spanning positions 331-529 (RBD-2) 

<400>  10

Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
1               5                   10                  15      


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
            20                  25                  30          


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
        35                  40                  45              


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
    50                  55                  60                  


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
65                  70                  75                  80  


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
                85                  90                  95      


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
            100                 105                 110         


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
        115                 120                 125             


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
    130                 135                 140                 


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
145                 150                 155                 160 


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
                165                 170                 175     


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
            180                 185                 190         


Thr Val Cys Gly Pro Lys Lys 
        195                 


<210>  11
<211>  597
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBD-2 coding sequence 

<400>  11
aatattacaa acttgtgccc ttttggtgaa gtttttaacg ccaccagatt tgcatctgtt       60

tatgcttgga acaggaagag aatcagcaac tgtgttgctg attattctgt cctatataat      120

tccgcatcat tttccacttt taagtgttat ggagtgtctc ctactaaatt aaatgatctc      180

tgctttacta atgtctatgc agattcattt gtaattagag gtgatgaagt cagacaaatc      240

gctccagggc aaactggaaa gattgctgat tataattata aattaccaga tgattttaca      300

ggctgcgtta tagcttggaa ctctaacaat cttgattcta aggttggtgg taattataat      360

tacctgtata gattgtttag gaagtctaat ctcaaacctt ttgagagaga tatttcaact      420

gaaatctatc aggccggtag cacaccttgt aatggtgttg aaggttttaa ttgttacttt      480

cctttacaat catatggttt ccaacccact aatggtgttg gttaccaacc atacagagta      540

gtagtacttt cttttgaact tctacatgca ccagcaactg tttgtggacc taaaaag         597


<210>  12
<211>  194
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBD spanning positions 331-524 (RBD-3) 

<400>  12

Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
1               5                   10                  15      


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
            20                  25                  30          


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
        35                  40                  45              


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
    50                  55                  60                  


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
65                  70                  75                  80  


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
                85                  90                  95      


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
            100                 105                 110         


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
        115                 120                 125             


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
    130                 135                 140                 


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
145                 150                 155                 160 


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
                165                 170                 175     


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
            180                 185                 190         


Thr Val 
        


<210>  13
<211>  582
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBD-3 coding sequence 

<400>  13
aatattacaa acttgtgccc ttttggtgaa gtttttaacg ccaccagatt tgcatctgtt       60

tatgcttgga acaggaagag aatcagcaac tgtgttgctg attattctgt cctatataat      120

tccgcatcat tttccacttt taagtgttat ggagtgtctc ctactaaatt aaatgatctc      180

tgctttacta atgtctatgc agattcattt gtaattagag gtgatgaagt cagacaaatc      240

gctccagggc aaactggaaa gattgctgat tataattata aattaccaga tgattttaca      300

ggctgcgtta tagcttggaa ctctaacaat cttgattcta aggttggtgg taattataat      360

tacctgtata gattgtttag gaagtctaat ctcaaacctt ttgagagaga tatttcaact      420

gaaatctatc aggccggtag cacaccttgt aatggtgttg aaggttttaa ttgttacttt      480

cctttacaat catatggttt ccaacccact aatggtgttg gttaccaacc atacagagta      540

gtagtacttt cttttgaact tctacatgca ccagcaactg tt                         582


<210>  14
<211>  211
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBD spanning positions 319-529 (RBD-4) 

<400>  14

Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn 
1               5                   10                  15      


Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val 
            20                  25                  30          


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
        35                  40                  45              


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
    50                  55                  60                  


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
65                  70                  75                  80  


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
                85                  90                  95      


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
            100                 105                 110         


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
        115                 120                 125             


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
    130                 135                 140                 


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
145                 150                 155                 160 


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
                165                 170                 175     


Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
            180                 185                 190         


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
        195                 200                 205             


Pro Lys Lys 
    210     


<210>  15
<211>  633
<212>  DNA
<213>  SARS-CoV-2 

<220>
<223>  SARS-CoV-2 spike protein RBD-4 coding sequence 

<400>  15
agagtccaac caacagaatc tattgttaga tttcctaata ttacaaactt gtgccctttt       60

ggtgaagttt ttaacgccac cagatttgca tctgtttatg cttggaacag gaagagaatc      120

agcaactgtg ttgctgatta ttctgtccta tataattccg catcattttc cacttttaag      180

tgttatggag tgtctcctac taaattaaat gatctctgct ttactaatgt ctatgcagat      240

tcatttgtaa ttagaggtga tgaagtcaga caaatcgctc cagggcaaac tggaaagatt      300

gctgattata attataaatt accagatgat tttacaggct gcgttatagc ttggaactct      360

aacaatcttg attctaaggt tggtggtaat tataattacc tgtatagatt gtttaggaag      420

tctaatctca aaccttttga gagagatatt tcaactgaaa tctatcaggc cggtagcaca      480

ccttgtaatg gtgttgaagg ttttaattgt tactttcctt tacaatcata tggtttccaa      540

cccactaatg gtgttggtta ccaaccatac agagtagtag tactttcttt tgaacttcta      600

catgcaccag caactgtttg tggacctaaa aag                                   633


<210>  16
<211>  56
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein receptor binding motif (RBM) 

<400>  16

Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn 
1               5                   10                  15      


Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly 
            20                  25                  30          


Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu 
        35                  40                  45              


Gln Ser Tyr Gly Phe Gln Pro Thr 
    50                  55      


<210>  17
<211>  168
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein RBM coding sequence 

<400>  17
gttggtggta attataatta cctgtataga ttgtttagga agtctaatct caaacctttt       60

gagagagata tttcaactga aatctatcag gccggtagca caccttgtaa tggtgttgaa      120

ggttttaatt gttactttcc tttacaatca tatggtttcc aacccact                   168


<210>  18
<211>  419
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 nucleocapsid protein 

<400>  18

Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr 
1               5                   10                  15      


Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg 
            20                  25                  30          


Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn 
        35                  40                  45              


Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu 
    50                  55                  60                  


Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro 
65                  70                  75                  80  


Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly 
                85                  90                  95      


Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr 
            100                 105                 110         


Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp 
        115                 120                 125             


Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp 
    130                 135                 140                 


His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln 
145                 150                 155                 160 


Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser 
                165                 170                 175     


Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn 
            180                 185                 190         


Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala 
        195                 200                 205             


Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu 
    210                 215                 220                 


Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln 
225                 230                 235                 240 


Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys 
                245                 250                 255     


Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln 
            260                 265                 270         


Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp 
        275                 280                 285             


Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile 
    290                 295                 300                 


Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile 
305                 310                 315                 320 


Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Ala Ala 
                325                 330                 335     


Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu 
            340                 345                 350         


Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro 
        355                 360                 365             


Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln 
    370                 375                 380                 


Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu 
385                 390                 395                 400 


Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser 
                405                 410                 415     


Thr Gln Ala 
            


<210>  19
<211>  1257
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 nucleocapsid protein coding sequence 

<400>  19
atgtctgata atggacccca aaatcagcga aatgcacccc gcattacgtt tggtggaccc       60

tcagattcaa ctggcagtaa ccagaatgga gaacgcagtg gggcgcgatc aaaacaacgt      120

cggccccaag gtttacccaa taatactgcg tcttggttca ccgctctcac tcaacatggc      180

aaggaagacc ttaaattccc tcgaggacaa ggcgttccaa ttaacaccaa tagcagtcca      240

gatgaccaaa ttggctacta ccgaagagct accagacgaa ttcgtggtgg tgacggtaaa      300

atgaaagatc tcagtccaag atggtatttc tactacctag gaactgggcc agaagctgga      360

cttccctatg gtgctaacaa agacggcatc atatgggttg caactgaggg agccttgaat      420

acaccaaaag atcacattgg cacccgcaat cctgctaaca atgctgcaat cgtgctacaa      480

cttcctcaag gaacaacatt gccaaaaggc ttctacgcag aagggagcag aggcggcagt      540

caagcctctt ctcgttcctc atcacgtagt cgcaacagtt caagaaattc aactccaggc      600

agcagtaggg gaacttctcc tgctagaatg gctggcaatg gcggtgatgc tgctcttgct      660

ttgctgctgc ttgacagatt gaaccagctt gagagcaaaa tgtctggtaa aggccaacaa      720

caacaaggcc aaactgtcac taagaaatct gctgctgagg cttctaagaa gcctcggcaa      780

aaacgtactg ccactaaagc atacaatgta acacaagctt tcggcagacg tggtccagaa      840

caaacccaag gaaattttgg ggaccaggaa ctaatcagac aaggaactga ttacaaacat      900

tggccgcaaa ttgcacaatt tgcccccagc gcttcagcgt tcttcggaat gtcgcgcatt      960

ggcatggaag tcacaccttc gggaacgtgg ttgacctaca cagctgccat caaattggat     1020

gacaaagatc caaatttcaa agatcaagtc attttgctga ataagcatat tgacgcatac     1080

aaaacattcc caccaacaga gcctaaaaag gacaaaaaga agaaggctga tgaaactcaa     1140

gccttaccgc agagacagaa gaaacagcaa actgtgactc ttcttcctgc tgcagatttg     1200

gatgatttct ccaaacaatt gcaacaatcc atgagcagtg ctgactcaac tcaggcc        1257


<210>  20
<211>  1273
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein with native signal peptide and 
an N501T substitution

<400>  20

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
        675                 680                 685             


Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 


Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 


Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     


Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         


Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             


Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 


Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 


Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     


Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         


Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             


Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 


Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 


Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                885                 890                 895     


Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         


Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             


Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
    930                 935                 940                 


Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 


Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     


Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
            980                 985                 990         


Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
        995                 1000                 1005             


Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
    1010                 1015                 1020             


Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
    1025                 1030                 1035             


Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
    1040                 1045                 1050             


Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
    1055                 1060                 1065             


Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
    1070                 1075                 1080             


Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
    1085                 1090                 1095             


Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
    1100                 1105                 1110             


Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
    1115                 1120                 1125             


Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
    1130                 1135                 1140             


Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
    1145                 1150                 1155             


His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
    1160                 1165                 1170             


Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
    1190                 1195                 1200             


Gly Lys  Tyr Glu Gln Tyr Ile  Lys Trp Pro Trp Tyr  Ile Trp Leu 
    1205                 1210                 1215             


Gly Phe  Ile Ala Gly Leu Ile  Ala Ile Val Met Val  Thr Ile Met 
    1220                 1225                 1230             


Leu Cys  Cys Met Thr Ser Cys  Cys Ser Cys Leu Lys  Gly Cys Cys 
    1235                 1240                 1245             


Ser Cys  Gly Ser Cys Cys Lys  Phe Asp Glu Asp Asp  Ser Glu Pro 
    1250                 1255                 1260             


Val Leu  Lys Gly Val Lys Leu  His Tyr Thr 
    1265                 1270             


<210>  21
<211>  1198
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein ECD with an N501T substitution 

<400>  21

Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr 
1               5                   10                  15      


Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser 
            20                  25                  30          


Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn 
        35                  40                  45              


Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys 
    50                  55                  60                  


Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala 
65                  70                  75                  80  


Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr 
                85                  90                  95      


Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn 
            100                 105                 110         


Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu 
        115                 120                 125             


Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe 
    130                 135                 140                 


Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln 
145                 150                 155                 160 


Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu 
                165                 170                 175     


Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser 
            180                 185                 190         


Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser 
        195                 200                 205             


Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg 
    210                 215                 220                 


Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp 
225                 230                 235                 240 


Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr 
                245                 250                 255     


Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile 
            260                 265                 270         


Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys 
        275                 280                 285             


Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn 
    290                 295                 300                 


Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
305                 310                 315                 320 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
                325                 330                 335     


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
            340                 345                 350         


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
        355                 360                 365             


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
    370                 375                 380                 


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
385                 390                 395                 400 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
                405                 410                 415     


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
            420                 425                 430         


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
        435                 440                 445             


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
    450                 455                 460                 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
465                 470                 475                 480 


Ser Tyr Gly Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln Pro Tyr Arg 
                485                 490                 495     


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
            500                 505                 510         


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
        515                 520                 525             


Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys 
    530                 535                 540                 


Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr 
545                 550                 555                 560 


Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro 
                565                 570                 575     


Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser 
            580                 585                 590         


Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro 
        595                 600                 605             


Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser 
    610                 615                 620                 


Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala 
625                 630                 635                 640 


Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly 
                645                 650                 655     


Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
            660                 665                 670         


Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala 
        675                 680                 685             


Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn 
    690                 695                 700                 


Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys 
705                 710                 715                 720 


Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys 
                725                 730                 735     


Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg 
            740                 745                 750         


Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val 
        755                 760                 765             


Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe 
    770                 775                 780                 


Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser 
785                 790                 795                 800 


Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala 
                805                 810                 815     


Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala 
            820                 825                 830         


Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu 
        835                 840                 845             


Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu 
    850                 855                 860                 


Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala 
865                 870                 875                 880 


Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile 
                885                 890                 895     


Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn 
            900                 905                 910         


Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr 
        915                 920                 925             


Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln 
    930                 935                 940                 


Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile 
945                 950                 955                 960 


Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala 
                965                 970                 975     


Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln 
            980                 985                 990         


Thr Tyr Val Thr Gln Gln Leu Ile  Arg Ala Ala Glu Ile  Arg Ala Ser 
        995                 1000                 1005             


Ala Asn  Leu Ala Ala Thr Lys  Met Ser Glu Cys Val  Leu Gly Gln 
    1010                 1015                 1020             


Ser Lys  Arg Val Asp Phe Cys  Gly Lys Gly Tyr His  Leu Met Ser 
    1025                 1030                 1035             


Phe Pro  Gln Ser Ala Pro His  Gly Val Val Phe Leu  His Val Thr 
    1040                 1045                 1050             


Tyr Val  Pro Ala Gln Glu Lys  Asn Phe Thr Thr Ala  Pro Ala Ile 
    1055                 1060                 1065             


Cys His  Asp Gly Lys Ala His  Phe Pro Arg Glu Gly  Val Phe Val 
    1070                 1075                 1080             


Ser Asn  Gly Thr His Trp Phe  Val Thr Gln Arg Asn  Phe Tyr Glu 
    1085                 1090                 1095             


Pro Gln  Ile Ile Thr Thr Asp  Asn Thr Phe Val Ser  Gly Asn Cys 
    1100                 1105                 1110             


Asp Val  Val Ile Gly Ile Val  Asn Asn Thr Val Tyr  Asp Pro Leu 
    1115                 1120                 1125             


Gln Pro  Glu Leu Asp Ser Phe  Lys Glu Glu Leu Asp  Lys Tyr Phe 
    1130                 1135                 1140             


Lys Asn  His Thr Ser Pro Asp  Val Asp Leu Gly Asp  Ile Ser Gly 
    1145                 1150                 1155             


Ile Asn  Ala Ser Val Val Asn  Ile Gln Lys Glu Ile  Asp Arg Leu 
    1160                 1165                 1170             


Asn Glu  Val Ala Lys Asn Leu  Asn Glu Ser Leu Ile  Asp Leu Gln 
    1175                 1180                 1185             


Glu Leu  Gly Lys Tyr Glu Gln  Tyr Ile Lys 
    1190                 1195             


<210>  22
<211>  672
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein S1 subunit with an N501T substitution

<400>  22

Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr 
1               5                   10                  15      


Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser 
            20                  25                  30          


Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn 
        35                  40                  45              


Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys 
    50                  55                  60                  


Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala 
65                  70                  75                  80  


Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr 
                85                  90                  95      


Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn 
            100                 105                 110         


Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu 
        115                 120                 125             


Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe 
    130                 135                 140                 


Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln 
145                 150                 155                 160 


Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu 
                165                 170                 175     


Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser 
            180                 185                 190         


Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser 
        195                 200                 205             


Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg 
    210                 215                 220                 


Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp 
225                 230                 235                 240 


Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr 
                245                 250                 255     


Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile 
            260                 265                 270         


Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys 
        275                 280                 285             


Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn 
    290                 295                 300                 


Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
305                 310                 315                 320 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
                325                 330                 335     


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
            340                 345                 350         


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
        355                 360                 365             


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
    370                 375                 380                 


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
385                 390                 395                 400 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
                405                 410                 415     


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
            420                 425                 430         


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
        435                 440                 445             


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
    450                 455                 460                 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
465                 470                 475                 480 


Ser Tyr Gly Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln Pro Tyr Arg 
                485                 490                 495     


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
            500                 505                 510         


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
        515                 520                 525             


Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys 
    530                 535                 540                 


Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr 
545                 550                 555                 560 


Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro 
                565                 570                 575     


Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser 
            580                 585                 590         


Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro 
        595                 600                 605             


Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser 
    610                 615                 620                 


Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala 
625                 630                 635                 640 


Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly 
                645                 650                 655     


Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
            660                 665                 670         


<210>  23
<211>  223
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-1 with an N501T substitution

<400>  23

Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn 
1               5                   10                  15      


Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val 
            20                  25                  30          


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
        35                  40                  45              


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
    50                  55                  60                  


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
65                  70                  75                  80  


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
                85                  90                  95      


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
            100                 105                 110         


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
        115                 120                 125             


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
    130                 135                 140                 


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
145                 150                 155                 160 


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
                165                 170                 175     


Tyr Gly Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln Pro Tyr Arg Val 
            180                 185                 190         


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
        195                 200                 205             


Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
    210                 215                 220             


<210>  24
<211>  198
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-2 with an N501T 
substitution (RBD-6)

<400>  24

Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe 
1               5                   10                  15      


Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala 
            20                  25                  30          


Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys 
        35                  40                  45              


Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val 
    50                  55                  60                  


Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala 
65                  70                  75                  80  


Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp 
                85                  90                  95      


Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser 
            100                 105                 110         


Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser 
        115                 120                 125             


Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala 
    130                 135                 140                 


Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro 
145                 150                 155                 160 


Leu Gln Ser Tyr Gly Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln Pro 
                165                 170                 175     


Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr 
            180                 185                 190         


Val Cys Gly Pro Lys Lys 
        195             


<210>  25
<211>  194
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-3 with an N501T 
substitution (RBD-7) 

<400>  25

Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
1               5                   10                  15      


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
            20                  25                  30          


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
        35                  40                  45              


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
    50                  55                  60                  


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
65                  70                  75                  80  


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
                85                  90                  95      


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
            100                 105                 110         


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
        115                 120                 125             


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
    130                 135                 140                 


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
145                 150                 155                 160 


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln 
                165                 170                 175     


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
            180                 185                 190         


Thr Val 
        


<210>  26
<211>  211
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-4 with an N501T 
substitution (RBD-8) 

<400>  26

Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn 
1               5                   10                  15      


Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val 
            20                  25                  30          


Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
        35                  40                  45              


Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
    50                  55                  60                  


Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
65                  70                  75                  80  


Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
                85                  90                  95      


Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
            100                 105                 110         


Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
        115                 120                 125             


Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
    130                 135                 140                 


Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
145                 150                 155                 160 


Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
                165                 170                 175     


Tyr Gly Phe Gln Pro Thr Thr Gly Val Gly Tyr Gln Pro Tyr Arg Val 
            180                 185                 190         


Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
        195                 200                 205             


Pro Lys Lys 
    210     


<210>  27
<211>  633
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-8 coding sequence 

<400>  27
agagtccaac caacagaatc tattgttaga tttcctaata ttacaaactt gtgccctttt       60

ggtgaagttt ttaacgccac cagatttgca tctgtttatg cttggaacag gaagagaatc      120

agcaactgtg ttgctgatta ttctgtccta tataattccg catcattttc cacttttaag      180

tgttatggag tgtctcctac taaattaaat gatctctgct ttactaatgt ctatgcagat      240

tcatttgtaa ttagaggtga tgaagtcaga caaatcgctc cagggcaaac tggaaagatt      300

gctgattata attataaatt accagatgat tttacaggct gcgttatagc ttggaactct      360

aacaatcttg attctaaggt tggtggtaat tataattacc tgtatagatt gtttaggaag      420

tctaatctca aaccttttga gagagatatt tcaactgaaa tctatcaggc cggtagcaca      480

ccttgtaatg gtgttgaagg ttttaattgt tactttcctt tacaatcata tggtttccaa      540

cccactactg gtgttggtta ccaaccatac agagtagtag tactttcttt tgaacttcta      600

catgcaccag caactgtttg tggacctaaa aag                                   633


<210>  28
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  (G3S)2 linker peptide 

<400>  28

Gly Gly Gly Ser Gly Gly Gly Ser 
1               5               


<210>  29
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  (G3S)2 linker peptide coding sequence 

<400>  29
ggaggaggaa gtggaggagg aagt                                              24


<210>  30
<211>  28
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Trimmerization peptide

<400>  30

Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys 
1               5                   10                  15      


Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly 
            20                  25              


<210>  31
<211>  84
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Trimmerization peptide coding sequence 

<400>  31
ggctatattc cggaagcgcc gcgcgatggc caggcgtatg tgcgcaaaga tggcgaatgg       60

gtgctgctga gcacctttct gggc                                              84


<210>  32
<211>  1234
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein ECD with C terminal fusion of a 
(G3S)2 linker and a trimmerization peptide 

<400>  32

Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr 
1               5                   10                  15      


Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser 
            20                  25                  30          


Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn 
        35                  40                  45              


Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys 
    50                  55                  60                  


Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala 
65                  70                  75                  80  


Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr 
                85                  90                  95      


Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn 
            100                 105                 110         


Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu 
        115                 120                 125             


Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe 
    130                 135                 140                 


Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln 
145                 150                 155                 160 


Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu 
                165                 170                 175     


Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser 
            180                 185                 190         


Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser 
        195                 200                 205             


Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg 
    210                 215                 220                 


Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp 
225                 230                 235                 240 


Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr 
                245                 250                 255     


Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile 
            260                 265                 270         


Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys 
        275                 280                 285             


Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn 
    290                 295                 300                 


Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
305                 310                 315                 320 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
                325                 330                 335     


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
            340                 345                 350         


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
        355                 360                 365             


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
    370                 375                 380                 


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
385                 390                 395                 400 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
                405                 410                 415     


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
            420                 425                 430         


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
        435                 440                 445             


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
    450                 455                 460                 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
465                 470                 475                 480 


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
                485                 490                 495     


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
            500                 505                 510         


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
        515                 520                 525             


Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys 
    530                 535                 540                 


Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr 
545                 550                 555                 560 


Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro 
                565                 570                 575     


Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser 
            580                 585                 590         


Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro 
        595                 600                 605             


Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser 
    610                 615                 620                 


Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala 
625                 630                 635                 640 


Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly 
                645                 650                 655     


Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
            660                 665                 670         


Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala 
        675                 680                 685             


Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn 
    690                 695                 700                 


Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys 
705                 710                 715                 720 


Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys 
                725                 730                 735     


Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg 
            740                 745                 750         


Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val 
        755                 760                 765             


Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe 
    770                 775                 780                 


Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser 
785                 790                 795                 800 


Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala 
                805                 810                 815     


Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala 
            820                 825                 830         


Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu 
        835                 840                 845             


Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu 
    850                 855                 860                 


Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala 
865                 870                 875                 880 


Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile 
                885                 890                 895     


Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn 
            900                 905                 910         


Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr 
        915                 920                 925             


Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln 
    930                 935                 940                 


Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile 
945                 950                 955                 960 


Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala 
                965                 970                 975     


Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln 
            980                 985                 990         


Thr Tyr Val Thr Gln Gln Leu Ile  Arg Ala Ala Glu Ile  Arg Ala Ser 
        995                 1000                 1005             


Ala Asn  Leu Ala Ala Thr Lys  Met Ser Glu Cys Val  Leu Gly Gln 
    1010                 1015                 1020             


Ser Lys  Arg Val Asp Phe Cys  Gly Lys Gly Tyr His  Leu Met Ser 
    1025                 1030                 1035             


Phe Pro  Gln Ser Ala Pro His  Gly Val Val Phe Leu  His Val Thr 
    1040                 1045                 1050             


Tyr Val  Pro Ala Gln Glu Lys  Asn Phe Thr Thr Ala  Pro Ala Ile 
    1055                 1060                 1065             


Cys His  Asp Gly Lys Ala His  Phe Pro Arg Glu Gly  Val Phe Val 
    1070                 1075                 1080             


Ser Asn  Gly Thr His Trp Phe  Val Thr Gln Arg Asn  Phe Tyr Glu 
    1085                 1090                 1095             


Pro Gln  Ile Ile Thr Thr Asp  Asn Thr Phe Val Ser  Gly Asn Cys 
    1100                 1105                 1110             


Asp Val  Val Ile Gly Ile Val  Asn Asn Thr Val Tyr  Asp Pro Leu 
    1115                 1120                 1125             


Gln Pro  Glu Leu Asp Ser Phe  Lys Glu Glu Leu Asp  Lys Tyr Phe 
    1130                 1135                 1140             


Lys Asn  His Thr Ser Pro Asp  Val Asp Leu Gly Asp  Ile Ser Gly 
    1145                 1150                 1155             


Ile Asn  Ala Ser Val Val Asn  Ile Gln Lys Glu Ile  Asp Arg Leu 
    1160                 1165                 1170             


Asn Glu  Val Ala Lys Asn Leu  Asn Glu Ser Leu Ile  Asp Leu Gln 
    1175                 1180                 1185             


Glu Leu  Gly Lys Tyr Glu Gln  Tyr Ile Lys Gly Gly  Gly Ser Gly 
    1190                 1195                 1200             


Gly Gly  Ser Gly Tyr Ile Pro  Glu Ala Pro Arg Asp  Gly Gln Ala 
    1205                 1210                 1215             


Tyr Val  Arg Lys Asp Gly Glu  Trp Val Leu Leu Ser  Thr Phe Leu 
    1220                 1225                 1230             


Gly 
    


<210>  33
<211>  3702
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein ECD with C terminal fusion of a 
(G3S)2 linker and a trimmerization peptide coding sequence

<400>  33
cagtgtgtta atcttacaac cagaactcaa ttaccccctg catacactaa ttctttcaca       60

cgtggtgttt attaccctga caaagttttc agatcctcag ttttacattc aactcaggac      120

ttgttcttac ctttcttttc caatgttact tggttccatg ctatacatgt ctctgggacc      180

aatggtacta agaggtttga taaccctgtc ctaccattta atgatggtgt ttattttgct      240

tccactgaga agtctaacat aataagaggc tggatttttg gtactacttt agattcgaag      300

acccagtccc tacttattgt taataacgct actaatgttg ttattaaagt ctgtgaattt      360

caattttgta atgatccatt tttgggtgtt tattaccaca aaaacaacaa aagttggatg      420

gaaagtgagt tcagagttta ttctagtgcg aataattgca cttttgaata tgtctctcag      480

ccttttctta tggaccttga aggaaaacag ggtaatttca aaaatcttag ggaatttgtg      540

tttaagaata ttgatggtta ttttaaaata tattctaagc acacgcctat taatttagtg      600

cgtgatctcc ctcagggttt ttcggcttta gaaccattgg tagatttgcc aataggtatt      660

aacatcacta ggtttcaaac tttacttgct ttacatagaa gttatttgac tcctggtgat      720

tcttcttcag gttggacagc tggtgctgca gcttattatg tgggttatct tcaacctagg      780

acttttctat taaaatataa tgaaaatgga accattacag atgctgtaga ctgtgcactt      840

gaccctctct cagaaacaaa gtgtacgttg aaatccttca ctgtagaaaa aggaatctat      900

caaacttcta actttagagt ccaaccaaca gaatctattg ttagatttcc taatattaca      960

aacttgtgcc cttttggtga agtttttaac gccaccagat ttgcatctgt ttatgcttgg     1020

aacaggaaga gaatcagcaa ctgtgttgct gattattctg tcctatataa ttccgcatca     1080

ttttccactt ttaagtgtta tggagtgtct cctactaaat taaatgatct ctgctttact     1140

aatgtctatg cagattcatt tgtaattaga ggtgatgaag tcagacaaat cgctccaggg     1200

caaactggaa agattgctga ttataattat aaattaccag atgattttac aggctgcgtt     1260

atagcttgga actctaacaa tcttgattct aaggttggtg gtaattataa ttacctgtat     1320

agattgttta ggaagtctaa tctcaaacct tttgagagag atatttcaac tgaaatctat     1380

caggccggta gcacaccttg taatggtgtt gaaggtttta attgttactt tcctttacaa     1440

tcatatggtt tccaacccac taatggtgtt ggttaccaac catacagagt agtagtactt     1500

tcttttgaac ttctacatgc accagcaact gtttgtggac ctaaaaagtc tactaatttg     1560

gttaaaaaca aatgtgtcaa tttcaacttc aatggtttaa caggcacagg tgttcttact     1620

gagtctaaca aaaagtttct gcctttccaa caatttggca gagacattgc tgacactact     1680

gatgctgtcc gtgatccaca gacacttgag attcttgaca ttacaccatg ttcttttggt     1740

ggtgtcagtg ttataacacc aggaacaaat acttctaacc aggttgctgt tctttatcag     1800

gatgttaact gcacagaagt ccctgttgct attcatgcag atcaacttac tcctacttgg     1860

cgtgtttatt ctacaggttc taatgttttt caaacacgtg caggctgttt aataggggct     1920

gaacatgtca acaactcata tgagtgtgac atacccattg gtgcaggtat atgcgctagt     1980

tatcagactc agactaattc tcctcggcgg gcacgtagtg tagctagtca atccatcatt     2040

gcctacacta tgtcacttgg tgcagaaaat tcagttgctt actctaataa ctctattgcc     2100

atacccacaa attttactat tagtgttacc acagaaattc taccagtgtc tatgaccaag     2160

acatcagtag attgtacaat gtacatttgt ggtgattcaa ctgaatgcag caatcttttg     2220

ttgcaatatg gcagtttttg tacacaatta aaccgtgctt taactggaat agctgttgaa     2280

caagacaaaa acacccaaga agtttttgca caagtcaaac aaatttacaa aacaccacca     2340

attaaagatt ttggtggttt taatttttca caaatattac cagatccatc aaaaccaagc     2400

aagaggtcat ttattgaaga tctacttttc aacaaagtga cacttgcaga tgctggcttc     2460

atcaaacaat atggtgattg ccttggtgat attgctgcta gagacctcat ttgtgcacaa     2520

aagtttaacg gccttactgt tttgccacct ttgctcacag atgaaatgat tgctcaatac     2580

acttctgcac tgttagcggg tacaatcact tctggttgga cctttggtgc aggtgctgca     2640

ttacaaatac catttgctat gcaaatggct tataggttta atggtattgg agttacacag     2700

aatgttctct atgagaacca aaaattgatt gccaaccaat ttaatagtgc tattggcaaa     2760

attcaagact cactttcttc cacagcaagt gcacttggaa aacttcaaga tgtggtcaac     2820

caaaatgcac aagctttaaa cacgcttgtt aaacaactta gctccaattt tggtgcaatt     2880

tcaagtgttt taaatgatat cctttcacgt cttgacaaag ttgaggctga agtgcaaatt     2940

gataggttga tcacaggcag acttcaaagt ttgcagacat atgtgactca acaattaatt     3000

agagctgcag aaatcagagc ttctgctaat cttgctgcta ctaaaatgtc agagtgtgta     3060

cttggacaat caaaaagagt tgatttttgt ggaaagggct atcatcttat gtccttccct     3120

cagtcagcac ctcatggtgt agtcttcttg catgtgactt atgtccctgc acaagaaaag     3180

aacttcacaa ctgctcctgc catttgtcat gatggaaaag cacactttcc tcgtgaaggt     3240

gtctttgttt caaatggcac acactggttt gtaacacaaa ggaattttta tgaaccacaa     3300

atcattacta cagacaacac atttgtgtct ggtaactgtg atgttgtaat aggaattgtc     3360

aacaacacag tttatgatcc tttgcaacct gaattagact cattcaagga ggagttagat     3420

aaatatttta agaatcatac atcaccagat gttgatttag gtgacatctc tggcattaat     3480

gcttcagttg taaacattca aaaagaaatt gaccgcctca atgaggttgc caagaattta     3540

aatgaatctc tcatcgatct ccaagaactt ggaaagtatg agcagtatat aaaaggagga     3600

ggaagtggag gaggaagtgg ctatattccg gaagcgccgc gcgatggcca ggcgtatgtg     3660

cgcaaagatg gcgaatgggt gctgctgagc acctttctgg gc                        3702


<210>  34
<211>  235
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-2 with C-terminal fusion of a 
(G3S)2 linker and a trimmerization peptide 

<400>  34

Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
1               5                   10                  15      


Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
            20                  25                  30          


Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
        35                  40                  45              


Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
    50                  55                  60                  


Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
65                  70                  75                  80  


Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
                85                  90                  95      


Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
            100                 105                 110         


Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
        115                 120                 125             


Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
    130                 135                 140                 


Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
145                 150                 155                 160 


Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
                165                 170                 175     


Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
            180                 185                 190         


Thr Val Cys Gly Pro Lys Lys Gly Gly Gly Ser Gly Gly Gly Ser Gly 
        195                 200                 205             


Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp 
    210                 215                 220                 


Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly 
225                 230                 235 


<210>  35
<211>  705
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein RBD-2 with C-terminal fusion of a 
(G3S)2 linker and a trimmerization peptide coding sequence 

<400>  35
aatattacaa acttgtgccc ttttggtgaa gtttttaacg ccaccagatt tgcatctgtt       60

tatgcttgga acaggaagag aatcagcaac tgtgttgctg attattctgt cctatataat      120

tccgcatcat tttccacttt taagtgttat ggagtgtctc ctactaaatt aaatgatctc      180

tgctttacta atgtctatgc agattcattt gtaattagag gtgatgaagt cagacaaatc      240

gctccagggc aaactggaaa gattgctgat tataattata aattaccaga tgattttaca      300

ggctgcgtta tagcttggaa ctctaacaat cttgattcta aggttggtgg taattataat      360

tacctgtata gattgtttag gaagtctaat ctcaaacctt ttgagagaga tatttcaact      420

gaaatctatc aggccggtag cacaccttgt aatggtgttg aaggttttaa ttgttacttt      480

cctttacaat catatggttt ccaacccact aatggtgttg gttaccaacc atacagagta      540

gtagtacttt cttttgaact tctacatgca ccagcaactg tttgtggacc taaaaaggga      600

ggaggaagtg gaggaggaag tggctatatt ccggaagcgc cgcgcgatgg ccaggcgtat      660

gtgcgcaaag atggcgaatg ggtgctgctg agcacctttc tgggc                      705


<210>  36
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein native signal peptide

<400>  36

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser 
1               5                   10              


<210>  37
<211>  39
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein native signal peptide coding sequence

<400>  37
atgtttgttt ttcttgtttt attgccatta gtctctagt                              39


<210>  38
<211>  18
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein native signal peptide coding sequence 

<400>  38

Met Asp Trp Thr Trp Ile Leu Phe Leu Val Ala Ala Ala Thr Arg Val 
1               5                   10                  15      


His Ser 
        


<210>  39
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Human IgE signal peptide amino acid sequence

<400>  39
atggactgga cctggattct cttcttggtg gcagcagcca cgcgagtcca ctcc             54


<210>  40
<211>  1260
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein without native signal peptide 

<400>  40

Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr 
1               5                   10                  15      


Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser 
            20                  25                  30          


Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn 
        35                  40                  45              


Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys 
    50                  55                  60                  


Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala 
65                  70                  75                  80  


Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr 
                85                  90                  95      


Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn 
            100                 105                 110         


Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu 
        115                 120                 125             


Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe 
    130                 135                 140                 


Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln 
145                 150                 155                 160 


Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu 
                165                 170                 175     


Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser 
            180                 185                 190         


Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser 
        195                 200                 205             


Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg 
    210                 215                 220                 


Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp 
225                 230                 235                 240 


Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr 
                245                 250                 255     


Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile 
            260                 265                 270         


Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys 
        275                 280                 285             


Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn 
    290                 295                 300                 


Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr 
305                 310                 315                 320 


Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser 
                325                 330                 335     


Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr 
            340                 345                 350         


Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly 
        355                 360                 365             


Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala 
    370                 375                 380                 


Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly 
385                 390                 395                 400 


Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
                405                 410                 415     


Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val 
            420                 425                 430         


Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu 
        435                 440                 445             


Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser 
    450                 455                 460                 


Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln 
465                 470                 475                 480 


Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg 
                485                 490                 495     


Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys 
            500                 505                 510         


Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe 
        515                 520                 525             


Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys 
    530                 535                 540                 


Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr 
545                 550                 555                 560 


Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro 
                565                 570                 575     


Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser 
            580                 585                 590         


Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro 
        595                 600                 605             


Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser 
    610                 615                 620                 


Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala 
625                 630                 635                 640 


Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly 
                645                 650                 655     


Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
            660                 665                 670         


Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala 
        675                 680                 685             


Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn 
    690                 695                 700                 


Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys 
705                 710                 715                 720 


Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys 
                725                 730                 735     


Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg 
            740                 745                 750         


Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val 
        755                 760                 765             


Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe 
    770                 775                 780                 


Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser 
785                 790                 795                 800 


Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala 
                805                 810                 815     


Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala 
            820                 825                 830         


Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu 
        835                 840                 845             


Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu 
    850                 855                 860                 


Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala 
865                 870                 875                 880 


Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile 
                885                 890                 895     


Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn 
            900                 905                 910         


Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr 
        915                 920                 925             


Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln 
    930                 935                 940                 


Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile 
945                 950                 955                 960 


Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala 
                965                 970                 975     


Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln 
            980                 985                 990         


Thr Tyr Val Thr Gln Gln Leu Ile  Arg Ala Ala Glu Ile  Arg Ala Ser 
        995                 1000                 1005             


Ala Asn  Leu Ala Ala Thr Lys  Met Ser Glu Cys Val  Leu Gly Gln 
    1010                 1015                 1020             


Ser Lys  Arg Val Asp Phe Cys  Gly Lys Gly Tyr His  Leu Met Ser 
    1025                 1030                 1035             


Phe Pro  Gln Ser Ala Pro His  Gly Val Val Phe Leu  His Val Thr 
    1040                 1045                 1050             


Tyr Val  Pro Ala Gln Glu Lys  Asn Phe Thr Thr Ala  Pro Ala Ile 
    1055                 1060                 1065             


Cys His  Asp Gly Lys Ala His  Phe Pro Arg Glu Gly  Val Phe Val 
    1070                 1075                 1080             


Ser Asn  Gly Thr His Trp Phe  Val Thr Gln Arg Asn  Phe Tyr Glu 
    1085                 1090                 1095             


Pro Gln  Ile Ile Thr Thr Asp  Asn Thr Phe Val Ser  Gly Asn Cys 
    1100                 1105                 1110             


Asp Val  Val Ile Gly Ile Val  Asn Asn Thr Val Tyr  Asp Pro Leu 
    1115                 1120                 1125             


Gln Pro  Glu Leu Asp Ser Phe  Lys Glu Glu Leu Asp  Lys Tyr Phe 
    1130                 1135                 1140             


Lys Asn  His Thr Ser Pro Asp  Val Asp Leu Gly Asp  Ile Ser Gly 
    1145                 1150                 1155             


Ile Asn  Ala Ser Val Val Asn  Ile Gln Lys Glu Ile  Asp Arg Leu 
    1160                 1165                 1170             


Asn Glu  Val Ala Lys Asn Leu  Asn Glu Ser Leu Ile  Asp Leu Gln 
    1175                 1180                 1185             


Glu Leu  Gly Lys Tyr Glu Gln  Tyr Ile Lys Trp Pro  Trp Tyr Ile 
    1190                 1195                 1200             


Trp Leu  Gly Phe Ile Ala Gly  Leu Ile Ala Ile Val  Met Val Thr 
    1205                 1210                 1215             


Ile Met  Leu Cys Cys Met Thr  Ser Cys Cys Ser Cys  Leu Lys Gly 
    1220                 1225                 1230             


Cys Cys  Ser Cys Gly Ser Cys  Cys Lys Phe Asp Glu  Asp Asp Ser 
    1235                 1240                 1245             


Glu Pro  Val Leu Lys Gly Val  Lys Leu His Tyr Thr  
    1250                 1255                 1260 


<210>  41
<211>  3633
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2 spike protein without native signal peptide 
coding sequence 

<400>  41
atgtttgttt ttcttgtttt attgccatta gtctctagtc agtgtgttaa tcttacaacc       60

agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac      120

aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc      180

aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat      240

aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccactgagaa gtctaacata      300

ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct acttattgtt      360

aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa tgatccattt      420

ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat      480

tctagtgcga ataattgcac ttttgaatat gtctctcagc cttttcttat ggaccttgaa      540

ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt ttaagaatat tgatggttat      600

tttaaaatat attctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt      660

tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact      720

ttacttgctt tacatagaag ttatttgact cctggtgatt cttcttcagg ttggacagct      780

ggtgctgcag cttattatgt gggttatctt caacctagga cttttctatt aaaatataat      840

gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag      900

tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc      960

caaccaacag aatctattgt tagatttcct aatattacaa acttgtgccc ttttggtgaa     1020

gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac     1080

tgtgttgctg attattctgt cctatataat tccgcatcat tttccacttt taagtgttat     1140

ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt     1200

gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat     1260

tataattata aattaccaga tgattttaca ggctgcgtta tagcttggaa ctctaacaat     1320

cttgattcta aggttggtgg taattataat tacctgtata gattgtttag gaagtctaat     1380

ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt     1440

aatggtgttg aaggttttaa ttgttacttt cctttacaat catatggttt ccaacccact     1500

aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca     1560

ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaaaaacaa atgtgtcaat     1620

ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg     1680

cctttccaac aatttggcag agacattgct gacactactg atgctgtccg tgatccacag     1740

acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca     1800

ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc     1860

cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct     1920

aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat     1980

gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct     2040

cctcggcggg cacgtagtgt agctagtcaa tccatcattg cctacactat gtcacttggt     2100

gcagaaaatt cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt     2160

agtgttacca cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg     2220

tacatttgtg gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt     2280

acacaattaa accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa     2340

gtttttgcac aagtcaaaca aatttacaaa acaccaccaa ttaaagattt tggtggtttt     2400

aatttttcac aaatattacc agatccatca aaaccaagca agaggtcatt tattgaagat     2460

ctacttttca acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc     2520

cttggtgata ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt     2580

ttgccacctt tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcgggt     2640

acaatcactt ctggttggac ctttggtgca ggtgctgcat tacaaatacc atttgctatg     2700

caaatggctt ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa     2760

aaattgattg ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc     2820

acagcaagtg cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac     2880

acgcttgtta aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaatgatatc     2940

ctttcacgtc ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga     3000

cttcaaagtt tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct     3060

tctgctaatc ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt     3120

gatttttgtg gaaagggcta tcatcttatg tccttccctc agtcagcacc tcatggtgta     3180

gtcttcttgc atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc     3240

atttgtcatg atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca     3300

cactggtttg taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca     3360

tttgtgtctg gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct     3420

ttgcaacctg aattagactc attcaaggag gagttagata aatattttaa gaatcataca     3480

tcaccagatg ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcaa     3540

aaagaaattg accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc     3600

caagaacttg gaaagtatga gcagtatata aaa                                  3633


<210>  42
<211>  1211
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein ectodomain (ECD) with native 
signal peptide 

<400>  42

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
        675                 680                 685             


Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 


Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 


Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     


Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         


Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             


Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 


Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 


Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     


Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         


Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             


Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 


Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 


Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                885                 890                 895     


Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         


Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             


Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
    930                 935                 940                 


Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 


Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     


Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
            980                 985                 990         


Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
        995                 1000                 1005             


Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
    1010                 1015                 1020             


Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
    1025                 1030                 1035             


Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
    1040                 1045                 1050             


Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
    1055                 1060                 1065             


Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
    1070                 1075                 1080             


Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
    1085                 1090                 1095             


Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
    1100                 1105                 1110             


Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
    1115                 1120                 1125             


Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
    1130                 1135                 1140             


Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
    1145                 1150                 1155             


His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
    1160                 1165                 1170             


Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
    1190                 1195                 1200             


Gly Lys  Tyr Glu Gln Tyr Ile  Lys 
    1205                 1210     


<210>  43
<211>  3633
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein ectodomain (ECD) with native signal
 peptide coding sequence 

<400>  43
atgtttgttt ttcttgtttt attgccatta gtctctagtc agtgtgttaa tcttacaacc       60

agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac      120

aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc      180

aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat      240

aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccactgagaa gtctaacata      300

ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct acttattgtt      360

aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa tgatccattt      420

ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat      480

tctagtgcga ataattgcac ttttgaatat gtctctcagc cttttcttat ggaccttgaa      540

ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt ttaagaatat tgatggttat      600

tttaaaatat attctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt      660

tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact      720

ttacttgctt tacatagaag ttatttgact cctggtgatt cttcttcagg ttggacagct      780

ggtgctgcag cttattatgt gggttatctt caacctagga cttttctatt aaaatataat      840

gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag      900

tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc      960

caaccaacag aatctattgt tagatttcct aatattacaa acttgtgccc ttttggtgaa     1020

gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac     1080

tgtgttgctg attattctgt cctatataat tccgcatcat tttccacttt taagtgttat     1140

ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt     1200

gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat     1260

tataattata aattaccaga tgattttaca ggctgcgtta tagcttggaa ctctaacaat     1320

cttgattcta aggttggtgg taattataat tacctgtata gattgtttag gaagtctaat     1380

ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt     1440

aatggtgttg aaggttttaa ttgttacttt cctttacaat catatggttt ccaacccact     1500

aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca     1560

ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaaaaacaa atgtgtcaat     1620

ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg     1680

cctttccaac aatttggcag agacattgct gacactactg atgctgtccg tgatccacag     1740

acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca     1800

ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc     1860

cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct     1920

aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat     1980

gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct     2040

cctcggcggg cacgtagtgt agctagtcaa tccatcattg cctacactat gtcacttggt     2100

gcagaaaatt cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt     2160

agtgttacca cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg     2220

tacatttgtg gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt     2280

acacaattaa accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa     2340

gtttttgcac aagtcaaaca aatttacaaa acaccaccaa ttaaagattt tggtggtttt     2400

aatttttcac aaatattacc agatccatca aaaccaagca agaggtcatt tattgaagat     2460

ctacttttca acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc     2520

cttggtgata ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt     2580

ttgccacctt tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcgggt     2640

acaatcactt ctggttggac ctttggtgca ggtgctgcat tacaaatacc atttgctatg     2700

caaatggctt ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa     2760

aaattgattg ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc     2820

acagcaagtg cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac     2880

acgcttgtta aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaatgatatc     2940

ctttcacgtc ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga     3000

cttcaaagtt tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct     3060

tctgctaatc ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt     3120

gatttttgtg gaaagggcta tcatcttatg tccttccctc agtcagcacc tcatggtgta     3180

gtcttcttgc atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc     3240

atttgtcatg atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca     3300

cactggtttg taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca     3360

tttgtgtctg gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct     3420

ttgcaacctg aattagactc attcaaggag gagttagata aatattttaa gaatcataca     3480

tcaccagatg ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcaa     3540

aaagaaattg accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc     3600

caagaacttg gaaagtatga gcagtatata aaa                                  3633


<210>  44
<211>  685
<212>  PRT
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein S1 subunit with native signal peptide 

<400>  44

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg 
        675                 680                 685 


<210>  45
<211>  2055
<212>  DNA
<213>  SARS-CoV-2

<220>
<223>  SARS-CoV-2 spike protein S1 subunit with native signal 
peptide coding sequence 

<400>  45
atgtttgttt ttcttgtttt attgccatta gtctctagtc agtgtgttaa tcttacaacc       60

agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac      120

aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc      180

aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat      240

aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccactgagaa gtctaacata      300

ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct acttattgtt      360

aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa tgatccattt      420

ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat      480

tctagtgcga ataattgcac ttttgaatat gtctctcagc cttttcttat ggaccttgaa      540

ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt ttaagaatat tgatggttat      600

tttaaaatat attctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt      660

tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact      720

ttacttgctt tacatagaag ttatttgact cctggtgatt cttcttcagg ttggacagct      780

ggtgctgcag cttattatgt gggttatctt caacctagga cttttctatt aaaatataat      840

gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag      900

tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc      960

caaccaacag aatctattgt tagatttcct aatattacaa acttgtgccc ttttggtgaa     1020

gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac     1080

tgtgttgctg attattctgt cctatataat tccgcatcat tttccacttt taagtgttat     1140

ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt     1200

gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat     1260

tataattata aattaccaga tgattttaca ggctgcgtta tagcttggaa ttctaacaat     1320

cttgattcta aggttggtgg taattataat tacctgtata gattgtttag gaagtctaat     1380

ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt     1440

aatggtgttg aaggttttaa ttgttacttt cctttacaat catatggttt ccaacccact     1500

aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca     1560

ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaaaaacaa atgtgtcaat     1620

ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg     1680

cctttccaac aatttggcag agacattgct gacactactg atgctgtccg tgatccacag     1740

acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca     1800

ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc     1860

cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct     1920

aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat     1980

gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct     2040

cctcggcggg cacgt                                                      2055


<210>  46
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 5-Prime-UTR DNA Sequence 

<400>  46
gaaataagag agaaaagaag agtaagaaga aatataaga                              39


<210>  47
<211>  39
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 5-Prime-UTR RNA Sequence 

<400>  47
gaaauaagag agaaaagaag aguaagaaga aauauaaga                              39


<210>  48
<211>  43
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 5-Prime-UTR DNA Sequence 

<400>  48
cttgttcttt ttgcagaagc tcagaataaa cgctcaactt tgg                         43


<210>  49
<211>  43
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 5-Prime-UTR RNA Sequence 

<400>  49
cuuguucuuu uugcagaagc ucagaauaaa cgcucaacuu ugg                         43


<210>  50
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 5-Prime-UTR DNA Sequence 

<400>  50
gcaggagcca gggctgggca taaaagtcag ggcagagcca tctattgctt acatttgctt       60

ctgacacaac tgtgttcact agcaacctca aacagacacc                            100


<210>  51
<211>  100
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 5-Prime-UTR RNA Sequence 

<400>  51
gcaggagcca gggcugggca uaaaagucag ggcagagcca ucuauugcuu acauuugcuu       60

cugacacaac uguguucacu agcaaccuca aacagacacc                            100


<210>  52
<211>  113
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 3-Prime-UTR DNA Sequence 

<400>  52
taggctggag cctcggtggc catgcttctt gccccttggg cctcccccca gcccctcctc       60

cccttcctgc acccgtaccc ccgtggtctt tgaataaagt ctgagtgggc ggc             113


<210>  53
<211>  113
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 3-Prime-UTR RNA Sequence 

<400>  53
uaggcuggag ccucgguggc caugcuucuu gccccuuggg ccucccccca gccccuccuc       60

cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc             113


<210>  54
<211>  132
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 3-Prime-UTR DNA Sequence 

<400>  54
gctcgctttc ttgctgtcca atttctatta aaggttcctt tgttccctaa gtccaactac       60

taaactgggg gatattatga agggccttga gcatctggat tctgcctaat aaaaaacatt      120

tattttcatt gc                                                          132


<210>  55
<211>  132
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 3-Prime-UTR RNA Sequence 

<400>  55
gcucgcuuuc uugcugucca auuucuauua aagguuccuu uguucccuaa guccaacuac       60

uaaacugggg gauauuauga agggccuuga gcaucuggau ucugccuaau aaaaaacauu      120

uauuuucauu gc                                                          132


<210>  56
<211>  278
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 3-Prime-UTR DNA Sequence 

<400>  56
ctggtactgc atgcacgcaa tgctagctgc ccctttcccg tcctgggtac cccgagtctc       60

ccccgacctc gggtcccagg tatgctccca cctccacctg ccccactcac cacctctgct      120

agttccagac acctcccaag cacgcagcaa tgcagctcaa aacgcttagc ctagccacac      180

ccccacggga aacagcagtg attaaccttt agcaataaac gaaagtttaa ctaagctata      240

ctaaccccag ggttggtcaa tttcgtgcca gccacacc                              278


<210>  57
<211>  278
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Exemplary 3-Prime-UTR RNA Sequence 

<400>  57
cugguacugc augcacgcaa ugcuagcugc cccuuucccg uccuggguac cccgagucuc       60

ccccgaccuc gggucccagg uaugcuccca ccuccaccug ccccacucac caccucugcu      120

aguuccagac accucccaag cacgcagcaa ugcagcucaa aacgcuuagc cuagccacac      180

ccccacggga aacagcagug auuaaccuuu agcaauaaac gaaaguuuaa cuaagcuaua      240

cuaaccccag gguuggucaa uuucgugcca gccacacc                              278


<210>  58
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  stem-loop sequence 

<400>  58
caaaggctct tttcagagcc acca                                              24


<210>  59
<211>  24
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  stem-loop sequence 

<400>  59
caaaggcucu uuucagagcc acca                                              24


