               Sequence listing

<110> JSC "BIOCAD"

<120> AAV5-based vaccine for induction of specific immunity to SARS-CoV-2 and/or prevention of SARS-CoV-2-related coronavirus infection

<150> RU2020142220
<151> 21-12-2020

<160> 19

<170> BiSSAP 1.3.6

<210> 1
<211> 278
<212> PRT
<213> Artificial sequence


<220> 
<223> isolated recombinant receptor-binding domain of glycoprotein S (RED-S) of 
      SARS-CoV-2 virus with amino acid substitution at position 272

<400> 1
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn 
1               5                   10                  15      
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val 
            20                  25                  30          
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
        35                  40                  45              
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
    50                  55                  60                  
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
65                  70                  75                  80  
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
                85                  90                  95      
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
            100                 105                 110         
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
        115                 120                 125             
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
    130                 135                 140                 
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
145                 150                 155                 160 
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
                165                 170                 175     
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
            180                 185                 190         
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
        195                 200                 205             
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
    210                 215                 220                 
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys 
225                 230                 235                 240 
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp 
                245                 250                 255     
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Ser 
            260                 265                 270         
Ser Phe Gly Gly Val Ser 
        275             

<210> 2
<211> 834
<212> DNA
<213> Artificial sequence


<220> 
<223> Nucleic acid encoding the recombinant receptor-binding 
      domain of glycoprotein S (RED-S) of the SARS-CoV-2 virus

<400> 2
agagtccaac caacagaatc tattgttaga tttcctaata ttacaaactt gtgccctttt      60

ggtgaagttt ttaacgccac cagatttgca tctgtttatg cttggaacag gaagagaatc     120

agcaactgtg ttgctgatta ttctgtccta tataattccg catcattttc cacttttaag     180

tgttatggag tgtctcctac taaattaaat gatctctgct ttactaatgt ctatgcagat     240

tcatttgtaa ttagaggtga tgaagtcaga caaatcgctc cagggcaaac tggaaagatt     300

gctgattata attataaatt accagatgat tttacaggct gcgttatagc ttggaattct     360

aacaatcttg attctaaggt tggtggtaat tataattacc tgtatagatt gtttaggaag     420

tctaatctca aaccttttga gagagatatt tcaactgaaa tctatcaggc cggtagcaca     480

ccttgtaatg gtgttgaagg ttttaattgt tactttcctt tacaatcata tggtttccaa     540

cccactaatg gtgttggtta ccaaccatac agagtagtag tactttcttt tgaacttcta     600

catgcaccag caactgtttg tggacctaaa aagtctacta atttggttaa aaacaaatgt     660

gtcaatttca acttcaatgg tttaacaggc acaggtgttc ttactgagtc taacaaaaag     720

tttctgcctt tccaacaatt tggcagagac attgctgaca ctactgatgc tgtccgtgat     780

ccacagacac ttgagattct tgacattaca ccatcttctt ttggtggtgt cagt           834


<210> 3
<211> 2940
<212> DNA
<213> Artificial sequence


<220> 
<223> expression cassette with the gene of the recombinant 
      receptor-binding domain of glycoprotein S (RED-S) of the SARS-CoV-2 virus

<400> 3
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcgtcg ggcgaccttt      60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact     120

aggggttcct gcggccgcac gcgtctagtt attaatagta atcaattacg gggtcattag     180

ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct     240

gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc     300

caatagggac tttccattga cgtcaatggg tggagtattt acggtaaact gcccacttgg     360

cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat     420

ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca     480

tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc     540

gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga     600

gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat     660

tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctcgtttag     720

tgaaccgtca gatcgcctgg agacgccatc cacgctgttt tgacctccat agaagacacc     780

gggaccgatc cagcctccgc ggattcgaat cccggccggg aacggtgcat tggaacgcgg     840

attccccgtg ccaagagtga cgtaagtacc gcctatagag tctataggcc cacaaaaaat     900

gctttcttct tttaatatac ttttttgttt atcttatttc taatactttc cctaatctct     960

ttctttcagg gcaataatga tacaatgtat catgcctctt tgcaccattc taaagaataa    1020

cagtgataat ttctgggtta aggcaatagc aatatttctg catataaata tttctgcata    1080

taaattgtaa ctgatgtaag aggtttcata ttgctaatag cagctacaat ccagctacca    1140

ttctgctttt attttatggt tgggataagg ctggattatt ctgagtccaa gctaggccct    1200

tttgctaatc atgttcatac ctcttatctt cctcccacag ctcctgggca acgtgctggt    1260

ctgtgtgctg gcccatcact ttggcaaaga attgggattc gaacatcgcg ataattagcc    1320

gccaccatgg agaccgacac cctgctgctg tgggtgctgc tgctgtgggt gcccgggtcg    1380

accgggagag tccaaccaac agaatctatt gttagatttc ctaatattac aaacttgtgc    1440

ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg gaacaggaag    1500

agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc attttccact    1560

tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac taatgtctat    1620

gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg gcaaactgga    1680

aagattgctg attataatta taaattacca gatgatttta caggctgcgt tatagcttgg    1740

aattctaaca atcttgattc taaggttggt ggtaattata attacctgta tagattgttt    1800

aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta tcaggccggt    1860

agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca atcatatggt    1920

ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact ttcttttgaa    1980

cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt ggttaaaaac    2040

aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac tgagtctaac    2100

aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac tgatgctgtc    2160

cgtgatccac agacacttga gattcttgac attacaccat cttcttttgg tggtgtcagt    2220

taaggatcct ctagagtcga cctgcagaag cttgcctcga gcagcgctgc tcgagagatc    2280

tacgggtggc atccctgtga cccctcccca gtgcctctcc tggccctgga agttgccact    2340

ccagtgccca ccagccttgt cctaataaaa ttaagttgca tcattttgtc tgactaggtg    2400

tccttctata atattatggg gtggaggggg gtggtatgga gcaaggggca agttgggaag    2460

acaacctgta gggcctgcgg ggtctattgg gaaccaagct ggagtgcagt ggcacaatct    2520

tggctcactg caatctccgc ctcctgggtt caagcgattc tcctgcctca gcctcccgag    2580

ttgttgggat tccaggcatg catgaccagg ctcagctaat ttttgttttt ttggtagaga    2640

cggggtttca ccatattggc caggctggtc tccaactcct aatctcaggt gatctaccca    2700

ccttggcctc ccaaattgct gggattacag gcgtgaacca ctgctccctt ccctgtcctt    2760

ctgattttgt aggtaaccac gtgcggaccg agcggccgca ggaaccccta gtgatggagt    2820

tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc    2880

gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcagc tgcctgcagg    2940


<210> 4
<211> 724
<212> PRT
<213> Natural sequence


<220> 
<223> Natural sequence of the wild-type AAV5 capsid VP1 protein

<400> 4
Met Ser Phe Val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 
1               5                   10                  15      
Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 
            20                  25                  30          
Pro Asn Gln Gln His Gln Asp Gln Ala Arg Gly Leu Val Leu Pro Gly 
        35                  40                  45              
Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 
    50                  55                  60                  
Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp Ile Ser Tyr Asn Glu 
65                  70                  75                  80  
Gln Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
                85                  90                  95      
Ala Glu Phe Gln Glu Lys Leu Ala Asp Asp Thr Ser Phe Gly Gly Asn 
            100                 105                 110         
Leu Gly Lys Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro Phe 
        115                 120                 125             
Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg Ile 
    130                 135                 140                 
Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp Ser 
145                 150                 155                 160 
Lys Pro Ser Thr Ser Ser Asp Ala Glu Ala Gly Pro Ser Gly Ser Gln 
                165                 170                 175     
Gln Leu Gln Ile Pro Ala Gln Pro Ala Ser Ser Leu Gly Ala Asp Thr 
            180                 185                 190         
Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gln Gly Ala 
        195                 200                 205             
Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
    210                 215                 220                 
Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
225                 230                 235                 240 
Ser Tyr Asn Asn His Gln Tyr Arg Glu Ile Lys Ser Gly Ser Val Asp 
                245                 250                 255     
Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
            260                 265                 270         
Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gln 
        275                 280                 285             
Arg Leu Ile Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 
    290                 295                 300                 
Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Val Gln Asp Ser Thr 
305                 310                 315                 320 
Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
                325                 330                 335     
Asp Asp Tyr Gln Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
            340                 345                 350         
Leu Pro Ala Phe Pro Pro Gln Val Phe Thr Leu Pro Gln Tyr Gly Tyr 
        355                 360                 365             
Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
    370                 375                 380                 
Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
385                 390                 395                 400 
Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 
                405                 410                 415     
Phe Ala Pro Ser Gln Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
            420                 425                 430         
Gln Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gln 
        435                 440                 445             
Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
    450                 455                 460                 
Phe Pro Gly Pro Met Gly Arg Thr Gln Gly Trp Asn Leu Gly Ser Gly 
465                 470                 475                 480 
Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
                485                 490                 495     
Leu Glu Gly Ala Ser Tyr Gln Val Pro Pro Gln Pro Asn Gly Met Thr 
            500                 505                 510         
Asn Asn Leu Gln Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met Ile 
        515                 520                 525             
Phe Asn Ser Gln Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
    530                 535                 540                 
Gly Asn Met Leu Ile Thr Ser Glu Ser Glu Thr Gln Pro Val Asn Arg 
545                 550                 555                 560 
Val Ala Tyr Asn Val Gly Gly Gln Met Ala Thr Asn Asn Gln Ser Ser 
                565                 570                 575     
Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gln Glu Ile Val Pro 
            580                 585                 590         
Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
        595                 600                 605             
Ala Lys Ile Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
    610                 615                 620                 
Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu Ile Lys Asn 
625                 630                 635                 640 
Thr Pro Val Pro Gly Asn Ile Thr Ser Phe Ser Asp Val Pro Val Ser 
                645                 650                 655     
Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Thr Val Glu Met Glu 
            660                 665                 670         
Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln 
        675                 680                 685             
Tyr Thr Asn Asn Tyr Asn Asp Pro Gln Phe Val Asp Phe Ala Pro Asp 
    690                 695                 700                 
Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro Ile Gly Thr Arg Tyr Leu 
705                 710                 715                 720 
Thr Arg Pro Leu 
                

<210> 5
<211> 724
<212> PRT
<213> Artificial sequence


<220> 
<223> isolated modified VP1 protein of AAV5 capsid, 
      which includes S2A and T711S substitutions

<400> 5
Met Ala Phe Val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 
1               5                   10                  15      
Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 
            20                  25                  30          
Pro Asn Gln Gln His Gln Asp Gln Ala Arg Gly Leu Val Leu Pro Gly 
        35                  40                  45              
Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 
    50                  55                  60                  
Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp Ile Ser Tyr Asn Glu 
65                  70                  75                  80  
Gln Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
                85                  90                  95      
Ala Glu Phe Gln Glu Lys Leu Ala Asp Asp Thr Ser Phe Gly Gly Asn 
            100                 105                 110         
Leu Gly Lys Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro Phe 
        115                 120                 125             
Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg Ile 
    130                 135                 140                 
Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp Ser 
145                 150                 155                 160 
Lys Pro Ser Thr Ser Ser Asp Ala Glu Ala Gly Pro Ser Gly Ser Gln 
                165                 170                 175     
Gln Leu Gln Ile Pro Ala Gln Pro Ala Ser Ser Leu Gly Ala Asp Thr 
            180                 185                 190         
Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gln Gly Ala 
        195                 200                 205             
Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
    210                 215                 220                 
Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
225                 230                 235                 240 
Ser Tyr Asn Asn His Gln Tyr Arg Glu Ile Lys Ser Gly Ser Val Asp 
                245                 250                 255     
Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
            260                 265                 270         
Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gln 
        275                 280                 285             
Arg Leu Ile Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 
    290                 295                 300                 
Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Val Gln Asp Ser Thr 
305                 310                 315                 320 
Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
                325                 330                 335     
Asp Asp Tyr Gln Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
            340                 345                 350         
Leu Pro Ala Phe Pro Pro Gln Val Phe Thr Leu Pro Gln Tyr Gly Tyr 
        355                 360                 365             
Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
    370                 375                 380                 
Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
385                 390                 395                 400 
Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 
                405                 410                 415     
Phe Ala Pro Ser Gln Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
            420                 425                 430         
Gln Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gln 
        435                 440                 445             
Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
    450                 455                 460                 
Phe Pro Gly Pro Met Gly Arg Thr Gln Gly Trp Asn Leu Gly Ser Gly 
465                 470                 475                 480 
Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
                485                 490                 495     
Leu Glu Gly Ala Ser Tyr Gln Val Pro Pro Gln Pro Asn Gly Met Thr 
            500                 505                 510         
Asn Asn Leu Gln Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met Ile 
        515                 520                 525             
Phe Asn Ser Gln Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
    530                 535                 540                 
Gly Asn Met Leu Ile Thr Ser Glu Ser Glu Thr Gln Pro Val Asn Arg 
545                 550                 555                 560 
Val Ala Tyr Asn Val Gly Gly Gln Met Ala Thr Asn Asn Gln Ser Ser 
                565                 570                 575     
Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gln Glu Ile Val Pro 
            580                 585                 590         
Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
        595                 600                 605             
Ala Lys Ile Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
    610                 615                 620                 
Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu Ile Lys Asn 
625                 630                 635                 640 
Thr Pro Val Pro Gly Asn Ile Thr Ser Phe Ser Asp Val Pro Val Ser 
                645                 650                 655     
Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Thr Val Glu Met Glu 
            660                 665                 670         
Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln 
        675                 680                 685             
Tyr Thr Asn Asn Tyr Asn Asp Pro Gln Phe Val Asp Phe Ala Pro Asp 
    690                 695                 700                 
Ser Thr Gly Glu Tyr Arg Ser Thr Arg Pro Ile Gly Thr Arg Tyr Leu 
705                 710                 715                 720 
Thr Arg Pro Leu 
                

<210> 6
<211> 1273
<212> PRT
<213> Natural sequence


<220> 
<223> Natural sequence of full-size glycoprotein S of the SARS-CoV-2 virus

<400> 6
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
        675                 680                 685             
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                885                 890                 895     
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
    930                 935                 940                 
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
            980                 985                 990         
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val 
        995                 1000                1005            
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu 
    1010                1015                1020                
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val 
1025                1030                1035                1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala 
                1045                1050                1055    
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu 
            1060                1065                1070        
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His 
        1075                1080                1085            
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val 
    1090                1095                1100                
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr 
1105                1110                1115                1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr 
                1125                1130                1135    
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu 
            1140                1145                1150        
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp 
        1155                1160                1165            
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp 
    1170                1175                1180                
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu 
1185                1190                1195                1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile 
                1205                1210                1215    
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile 
            1220                1225                1230        
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys 
        1235                1240                1245            
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val 
    1250                1255                1260                
Leu Lys Gly Val Lys Leu His Tyr Thr 
1265                1270            

<210> 7
<211> 278
<212> PRT
<213> Artificial sequence


<220> 
<223> isolated receptor-binding domain of glycoprotein S (RBD-S) of the SARS-CoV-2 virus


<400> 7
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn 
1               5                   10                  15      
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val 
            20                  25                  30          
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser 
        35                  40                  45              
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val 
    50                  55                  60                  
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp 
65                  70                  75                  80  
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln 
                85                  90                  95      
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr 
            100                 105                 110         
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly 
        115                 120                 125             
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys 
    130                 135                 140                 
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr 
145                 150                 155                 160 
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser 
                165                 170                 175     
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val 
            180                 185                 190         
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly 
        195                 200                 205             
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn 
    210                 215                 220                 
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys 
225                 230                 235                 240 
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp 
                245                 250                 255     
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys 
            260                 265                 270         
Ser Phe Gly Gly Val Ser 
        275             

<210> 8
<211> 130
<212> DNA
<213> Natural sequence


<220> 
<223> left (first) ITR (inverted end repeats)

<400> 8
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcgtcg ggcgaccttt     60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact    120

aggggttcct                                                           130


<210> 9
<211> 304
<212> DNA
<213> Natural sequence


<220> 
<223> CMV (cytomegalovirus) enhancer

<400> 9
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt     60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca    120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc    180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta    240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac    300

catg                                                                 304


<210> 10
<211> 204
<212> DNA
<213> Natural sequence


<220> 
<223> CMV (Cytomegalovirus) promoter

<400> 10
gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc acggggattt     60

ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac    120

tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg    180

tgggaggtct atataagcag agct                                           204


<210> 11
<211> 493
<212> DNA
<213> Natural sequence


<220> 
<223> intron of the hmg1 gene (hemoglobin gamma-1 subunit gene)

<400> 11
cgaatcccgg ccgggaacgg tgcattggaa cgcggattcc ccgtgccaag agtgacgtaa      60

gtaccgccta tagagtctat aggcccacaa aaaatgcttt cttcttttaa tatacttttt     120

tgtttatctt atttctaata ctttccctaa tctctttctt tcagggcaat aatgatacaa     180

tgtatcatgc ctctttgcac cattctaaag aataacagtg ataatttctg ggttaaggca     240

atagcaatat ttctgcatat aaatatttct gcatataaat tgtaactgat gtaagaggtt     300

tcatattgct aatagcagct acaatccagc taccattctg cttttatttt atggttggga     360

taaggctgga ttattctgag tccaagctag gcccttttgc taatcatgtt catacctctt     420

atcttcctcc cacagctcct gggcaacgtg ctggtctgtg tgctggccca tcactttggc     480

aaagaattgg gat                                                        493


<210> 12
<211> 479
<212> DNA
<213> Natural sequence


<220> 
<223> hGH1 polyadenylation signal (human growth hormone gene polyadenylation signal)


<400> 12
acgggtggca tccctgtgac ccctccccag tgcctctcct ggccctggaa gttgccactc      60

cagtgcccac cagccttgtc ctaataaaat taagttgcat cattttgtct gactaggtgt     120

ccttctataa tattatgggg tggagggggg tggtatggag caaggggcaa gttgggaaga     180

caacctgtag ggcctgcggg gtctattggg aaccaagctg gagtgcagtg gcacaatctt     240

ggctcactgc aatctccgcc tcctgggttc aagcgattct cctgcctcag cctcccgagt     300

tgttgggatt ccaggcatgc atgaccaggc tcagctaatt tttgtttttt tggtagagac     360

ggggtttcac catattggcc aggctggtct ccaactccta atctcaggtg atctacccac     420

cttggcctcc caaattgctg ggattacagg cgtgaaccac tgctcccttc cctgtcctt      479


<210> 13
<211> 141
<212> DNA
<213> Natural sequence


<220> 
<223> right (second) ITR

<400> 13
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg     60

ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc    120

gagcgcgcag ctgcctgcag g                                              141


<210> 14
<211> 588
<212> PRT
<213> Natural sequence


<220> 
<223> Natural sequence of the wild-type AAV5 capsid VP2 protein

<400> 14
Thr Ala Pro Thr Gly Lys Arg Ile Asp Asp His Phe Pro Lys Arg Lys 
1               5                   10                  15      
Lys Ala Arg Thr Glu Glu Asp Ser Lys Pro Ser Thr Ser Ser Asp Ala 
            20                  25                  30          
Glu Ala Gly Pro Ser Gly Ser Gln Gln Leu Gln Ile Pro Ala Gln Pro 
        35                  40                  45              
Ala Ser Ser Leu Gly Ala Asp Thr Met Ser Ala Gly Gly Gly Gly Pro 
    50                  55                  60                  
Leu Gly Asp Asn Asn Gln Gly Ala Asp Gly Val Gly Asn Ala Ser Gly 
65                  70                  75                  80  
Asp Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Val Thr Lys 
                85                  90                  95      
Ser Thr Arg Thr Trp Val Leu Pro Ser Tyr Asn Asn His Gln Tyr Arg 
            100                 105                 110         
Glu Ile Lys Ser Gly Ser Val Asp Gly Ser Asn Ala Asn Ala Tyr Phe 
        115                 120                 125             
Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Ser 
    130                 135                 140                 
His Trp Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Tyr Trp Gly 
145                 150                 155                 160 
Phe Arg Pro Arg Ser Leu Arg Val Lys Ile Phe Asn Ile Gln Val Lys 
                165                 170                 175     
Glu Val Thr Val Gln Asp Ser Thr Thr Thr Ile Ala Asn Asn Leu Thr 
            180                 185                 190         
Ser Thr Val Gln Val Phe Thr Asp Asp Asp Tyr Gln Leu Pro Tyr Val 
        195                 200                 205             
Val Gly Asn Gly Thr Glu Gly Cys Leu Pro Ala Phe Pro Pro Gln Val 
    210                 215                 220                 
Phe Thr Leu Pro Gln Tyr Gly Tyr Ala Thr Leu Asn Arg Asp Asn Thr 
225                 230                 235                 240 
Glu Asn Pro Thr Glu Arg Ser Ser Phe Phe Cys Leu Glu Tyr Phe Pro 
                245                 250                 255     
Ser Lys Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Thr Tyr Asn Phe 
            260                 265                 270         
Glu Glu Val Pro Phe His Ser Ser Phe Ala Pro Ser Gln Asn Leu Phe 
        275                 280                 285             
Lys Leu Ala Asn Pro Leu Val Asp Gln Tyr Leu Tyr Arg Phe Val Ser 
    290                 295                 300                 
Thr Asn Asn Thr Gly Gly Val Gln Phe Asn Lys Asn Leu Ala Gly Arg 
305                 310                 315                 320 
Tyr Ala Asn Thr Tyr Lys Asn Trp Phe Pro Gly Pro Met Gly Arg Thr 
                325                 330                 335     
Gln Gly Trp Asn Leu Gly Ser Gly Val Asn Arg Ala Ser Val Ser Ala 
            340                 345                 350         
Phe Ala Thr Thr Asn Arg Met Glu Leu Glu Gly Ala Ser Tyr Gln Val 
        355                 360                 365             
Pro Pro Gln Pro Asn Gly Met Thr Asn Asn Leu Gln Gly Ser Asn Thr 
    370                 375                 380                 
Tyr Ala Leu Glu Asn Thr Met Ile Phe Asn Ser Gln Pro Ala Asn Pro 
385                 390                 395                 400 
Gly Thr Thr Ala Thr Tyr Leu Glu Gly Asn Met Leu Ile Thr Ser Glu 
                405                 410                 415     
Ser Glu Thr Gln Pro Val Asn Arg Val Ala Tyr Asn Val Gly Gly Gln 
            420                 425                 430         
Met Ala Thr Asn Asn Gln Ser Ser Thr Thr Ala Pro Ala Thr Gly Thr 
        435                 440                 445             
Tyr Asn Leu Gln Glu Ile Val Pro Gly Ser Val Trp Met Glu Arg Asp 
    450                 455                 460                 
Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro Glu Thr Gly Ala 
465                 470                 475                 480 
His Phe His Pro Ser Pro Ala Met Gly Gly Phe Gly Leu Lys His Pro 
                485                 490                 495     
Pro Pro Met Met Leu Ile Lys Asn Thr Pro Val Pro Gly Asn Ile Thr 
            500                 505                 510         
Ser Phe Ser Asp Val Pro Val Ser Ser Phe Ile Thr Gln Tyr Ser Thr 
        515                 520                 525             
Gly Gln Val Thr Val Glu Met Glu Trp Glu Leu Lys Lys Glu Asn Ser 
    530                 535                 540                 
Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Asn Asn Tyr Asn Asp Pro 
545                 550                 555                 560 
Gln Phe Val Asp Phe Ala Pro Asp Ser Thr Gly Glu Tyr Arg Thr Thr 
                565                 570                 575     
Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
            580                 585             

<210> 15
<211> 532
<212> PRT
<213> Natural sequence


<220> 
<223> Natural sequence of the wild-type AAV5 capsid VP3 protein

<400> 15
Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gln Gly Ala 
1               5                   10                  15      
Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          
Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
        35                  40                  45              
Ser Tyr Asn Asn His Gln Tyr Arg Glu Ile Lys Ser Gly Ser Val Asp 
    50                  55                  60                  
Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  
Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gln 
                85                  90                  95      
Arg Leu Ile Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 
            100                 105                 110         
Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Val Gln Asp Ser Thr 
        115                 120                 125             
Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 
Asp Asp Tyr Gln Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
145                 150                 155                 160 
Leu Pro Ala Phe Pro Pro Gln Val Phe Thr Leu Pro Gln Tyr Gly Tyr 
                165                 170                 175     
Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
            180                 185                 190         
Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
        195                 200                 205             
Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 
    210                 215                 220                 
Phe Ala Pro Ser Gln Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
225                 230                 235                 240 
Gln Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gln 
                245                 250                 255     
Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
            260                 265                 270         
Phe Pro Gly Pro Met Gly Arg Thr Gln Gly Trp Asn Leu Gly Ser Gly 
        275                 280                 285             
Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
    290                 295                 300                 
Leu Glu Gly Ala Ser Tyr Gln Val Pro Pro Gln Pro Asn Gly Met Thr 
305                 310                 315                 320 
Asn Asn Leu Gln Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met Ile 
                325                 330                 335     
Phe Asn Ser Gln Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
            340                 345                 350         
Gly Asn Met Leu Ile Thr Ser Glu Ser Glu Thr Gln Pro Val Asn Arg 
        355                 360                 365             
Val Ala Tyr Asn Val Gly Gly Gln Met Ala Thr Asn Asn Gln Ser Ser 
    370                 375                 380                 
Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gln Glu Ile Val Pro 
385                 390                 395                 400 
Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     
Ala Lys Ile Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
            420                 425                 430         
Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu Ile Lys Asn 
        435                 440                 445             
Thr Pro Val Pro Gly Asn Ile Thr Ser Phe Ser Asp Val Pro Val Ser 
    450                 455                 460                 
Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Thr Val Glu Met Glu 
465                 470                 475                 480 
Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln 
                485                 490                 495     
Tyr Thr Asn Asn Tyr Asn Asp Pro Gln Phe Val Asp Phe Ala Pro Asp 
            500                 505                 510         
Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro Ile Gly Thr Arg Tyr Leu 
        515                 520                 525             
Thr Arg Pro Leu 
    530         

<210> 16
<211> 588
<212> PRT
<213> Artificial sequence


<220> 
<223> isolated modified VP2 protein of AAV5 capsid, which includes the replacement of T575S

<400> 16
Thr Ala Pro Thr Gly Lys Arg Ile Asp Asp His Phe Pro Lys Arg Lys 
1               5                   10                  15      
Lys Ala Arg Thr Glu Glu Asp Ser Lys Pro Ser Thr Ser Ser Asp Ala 
            20                  25                  30          
Glu Ala Gly Pro Ser Gly Ser Gln Gln Leu Gln Ile Pro Ala Gln Pro 
        35                  40                  45              
Ala Ser Ser Leu Gly Ala Asp Thr Met Ser Ala Gly Gly Gly Gly Pro 
    50                  55                  60                  
Leu Gly Asp Asn Asn Gln Gly Ala Asp Gly Val Gly Asn Ala Ser Gly 
65                  70                  75                  80  
Asp Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Val Thr Lys 
                85                  90                  95      
Ser Thr Arg Thr Trp Val Leu Pro Ser Tyr Asn Asn His Gln Tyr Arg 
            100                 105                 110         
Glu Ile Lys Ser Gly Ser Val Asp Gly Ser Asn Ala Asn Ala Tyr Phe 
        115                 120                 125             
Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Ser 
    130                 135                 140                 
His Trp Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Tyr Trp Gly 
145                 150                 155                 160 
Phe Arg Pro Arg Ser Leu Arg Val Lys Ile Phe Asn Ile Gln Val Lys 
                165                 170                 175     
Glu Val Thr Val Gln Asp Ser Thr Thr Thr Ile Ala Asn Asn Leu Thr 
            180                 185                 190         
Ser Thr Val Gln Val Phe Thr Asp Asp Asp Tyr Gln Leu Pro Tyr Val 
        195                 200                 205             
Val Gly Asn Gly Thr Glu Gly Cys Leu Pro Ala Phe Pro Pro Gln Val 
    210                 215                 220                 
Phe Thr Leu Pro Gln Tyr Gly Tyr Ala Thr Leu Asn Arg Asp Asn Thr 
225                 230                 235                 240 
Glu Asn Pro Thr Glu Arg Ser Ser Phe Phe Cys Leu Glu Tyr Phe Pro 
                245                 250                 255     
Ser Lys Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Thr Tyr Asn Phe 
            260                 265                 270         
Glu Glu Val Pro Phe His Ser Ser Phe Ala Pro Ser Gln Asn Leu Phe 
        275                 280                 285             
Lys Leu Ala Asn Pro Leu Val Asp Gln Tyr Leu Tyr Arg Phe Val Ser 
    290                 295                 300                 
Thr Asn Asn Thr Gly Gly Val Gln Phe Asn Lys Asn Leu Ala Gly Arg 
305                 310                 315                 320 
Tyr Ala Asn Thr Tyr Lys Asn Trp Phe Pro Gly Pro Met Gly Arg Thr 
                325                 330                 335     
Gln Gly Trp Asn Leu Gly Ser Gly Val Asn Arg Ala Ser Val Ser Ala 
            340                 345                 350         
Phe Ala Thr Thr Asn Arg Met Glu Leu Glu Gly Ala Ser Tyr Gln Val 
        355                 360                 365             
Pro Pro Gln Pro Asn Gly Met Thr Asn Asn Leu Gln Gly Ser Asn Thr 
    370                 375                 380                 
Tyr Ala Leu Glu Asn Thr Met Ile Phe Asn Ser Gln Pro Ala Asn Pro 
385                 390                 395                 400 
Gly Thr Thr Ala Thr Tyr Leu Glu Gly Asn Met Leu Ile Thr Ser Glu 
                405                 410                 415     
Ser Glu Thr Gln Pro Val Asn Arg Val Ala Tyr Asn Val Gly Gly Gln 
            420                 425                 430         
Met Ala Thr Asn Asn Gln Ser Ser Thr Thr Ala Pro Ala Thr Gly Thr 
        435                 440                 445             
Tyr Asn Leu Gln Glu Ile Val Pro Gly Ser Val Trp Met Glu Arg Asp 
    450                 455                 460                 
Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro Glu Thr Gly Ala 
465                 470                 475                 480 
His Phe His Pro Ser Pro Ala Met Gly Gly Phe Gly Leu Lys His Pro 
                485                 490                 495     
Pro Pro Met Met Leu Ile Lys Asn Thr Pro Val Pro Gly Asn Ile Thr 
            500                 505                 510         
Ser Phe Ser Asp Val Pro Val Ser Ser Phe Ile Thr Gln Tyr Ser Thr 
        515                 520                 525             
Gly Gln Val Thr Val Glu Met Glu Trp Glu Leu Lys Lys Glu Asn Ser 
    530                 535                 540                 
Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Asn Asn Tyr Asn Asp Pro 
545                 550                 555                 560 
Gln Phe Val Asp Phe Ala Pro Asp Ser Thr Gly Glu Tyr Arg Ser Thr 
                565                 570                 575     
Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
            580                 585             

<210> 17
<211> 532
<212> PRT
<213> Artificial sequence


<220> 
<223> isolated modified VP3 protein of the AAV5 capsid, which includes the replacement of T519S

<400> 17
Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gln Gly Ala 
1               5                   10                  15      
Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          
Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
        35                  40                  45              
Ser Tyr Asn Asn His Gln Tyr Arg Glu Ile Lys Ser Gly Ser Val Asp 
    50                  55                  60                  
Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  
Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gln 
                85                  90                  95      
Arg Leu Ile Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 
            100                 105                 110         
Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Val Gln Asp Ser Thr 
        115                 120                 125             
Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 
Asp Asp Tyr Gln Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
145                 150                 155                 160 
Leu Pro Ala Phe Pro Pro Gln Val Phe Thr Leu Pro Gln Tyr Gly Tyr 
                165                 170                 175     
Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
            180                 185                 190         
Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
        195                 200                 205             
Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 
    210                 215                 220                 
Phe Ala Pro Ser Gln Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
225                 230                 235                 240 
Gln Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gln 
                245                 250                 255     
Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
            260                 265                 270         
Phe Pro Gly Pro Met Gly Arg Thr Gln Gly Trp Asn Leu Gly Ser Gly 
        275                 280                 285             
Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
    290                 295                 300                 
Leu Glu Gly Ala Ser Tyr Gln Val Pro Pro Gln Pro Asn Gly Met Thr 
305                 310                 315                 320 
Asn Asn Leu Gln Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met Ile 
                325                 330                 335     
Phe Asn Ser Gln Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
            340                 345                 350         
Gly Asn Met Leu Ile Thr Ser Glu Ser Glu Thr Gln Pro Val Asn Arg 
        355                 360                 365             
Val Ala Tyr Asn Val Gly Gly Gln Met Ala Thr Asn Asn Gln Ser Ser 
    370                 375                 380                 
Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gln Glu Ile Val Pro 
385                 390                 395                 400 
Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     
Ala Lys Ile Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
            420                 425                 430         
Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu Ile Lys Asn 
        435                 440                 445             
Thr Pro Val Pro Gly Asn Ile Thr Ser Phe Ser Asp Val Pro Val Ser 
    450                 455                 460                 
Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Thr Val Glu Met Glu 
465                 470                 475                 480 
Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln 
                485                 490                 495     
Tyr Thr Asn Asn Tyr Asn Asp Pro Gln Phe Val Asp Phe Ala Pro Asp 
            500                 505                 510         
Ser Thr Gly Glu Tyr Arg Ser Thr Arg Pro Ile Gly Thr Arg Tyr Leu 
        515                 520                 525             
Thr Arg Pro Leu 
    530         

<210> 18
<211> 834
<212> DNA
<213> Artificial sequence


<220> 
<223> Codon is an optimized nucleic acid encoding the recombinant 
      receptor-binding domain of glycoprotein S (RBD-SCO) of the SARS-CoV-2 virus

<400> 18
agagtgcagc ctaccgagag catcgtgaga ttccccaaca tcaccaacct gtgcccattc      60

ggagaggtgt tcaacgccac tagatttgcc agcgtgtatg cctggaatag gaagaggatc     120

tccaattgtg tggccgacta ctccgtgctg tataattccg cctcctttag caccttcaag     180

tgttatggcg tgtcccccac aaagctgaat gacctgtgct tcaccaacgt gtacgccgat     240

tccttcgtga ttagaggcga cgaggtgagg cagattgcac caggacagac tggcaagatt     300

gccgactaca actacaagct gcccgatgat ttcacaggct gtgtgatcgc ctggaacagc     360

aataacctgg acagcaaagt gggaggcaac tacaactacc tgtacagact gttcaggaag     420

tccaatctga agcctttcga gagagacatc agcaccgaga tctaccaggc cggctcaaca     480

ccatgtaatg gagtggaggg ctttaactgt tacttccccc tgcagtctta cggcttccag     540

cccactaatg gcgtgggata tcagccctat agagtggtgg tgctgagctt tgagctgctg     600

catgctccag ctaccgtgtg tggccctaag aagagcacca atctggtgaa gaataagtgc     660

gtgaacttca acttcaacgg cctgaccggc acaggagtgc tgacagaaag caataagaag     720

ttcctgccct tccagcagtt cggcagagat attgccgaca caaccgatgc cgtgagggac     780

ccacagactc tggagatcct ggatattaca cctagcagct ttgggggcgt gtcc           834


<210> 19
<211> 2936
<212> DNA
<213> Artificial sequence


<220> 
<223> expression cassette with codon-optimized gene of recombinant 
      receptor-binding domain of glycoprotein S (RBD-SCO) of SARS-CoV-2 virus

<400> 19
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcgtcg ggcgaccttt      60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact     120

aggggttcct gcggccgcac gcgtctagtt attaatagta atcaattacg gggtcattag     180

ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct     240

gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc     300

caatagggac tttccattga cgtcaatggg tggagtattt acggtaaact gcccacttgg     360

cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat     420

ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca     480

tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc     540

gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga     600

gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat     660

tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctcgtttag     720

tgaaccgtca gatcgcctgg agacgccatc cacgctgttt tgacctccat agaagacacc     780

gggaccgatc cagcctccgc ggattcgaat cccggccggg aacggtgcat tggaacgcgg     840

attccccgtg ccaagagtga cgtaagtacc gcctatagag tctataggcc cacaaaaaat     900

gctttcttct tttaatatac ttttttgttt atcttatttc taatactttc cctaatctct     960

ttctttcagg gcaataatga tacaatgtat catgcctctt tgcaccattc taaagaataa    1020

cagtgataat ttctgggtta aggcaatagc aatatttctg catataaata tttctgcata    1080

taaattgtaa ctgatgtaag aggtttcata ttgctaatag cagctacaat ccagctacca    1140

ttctgctttt attttatggt tgggataagg ctggattatt ctgagtccaa gctaggccct    1200

tttgctaatc atgttcatac ctcttatctt cctcccacag ctcctgggca acgtgctggt    1260

ctgtgtgctg gcccatcact ttggcaaaga attgggattc gaacatcgat tgagccacca    1320

tggagaccga caccctgctg ctgtgggtgc tgctgctgtg ggtgcccggg tcgaccggga    1380

gagtgcagcc taccgagagc atcgtgagat tccccaacat caccaacctg tgcccattcg    1440

gagaggtgtt caacgccact agatttgcca gcgtgtatgc ctggaatagg aagaggatct    1500

ccaattgtgt ggccgactac tccgtgctgt ataattccgc ctcctttagc accttcaagt    1560

gttatggcgt gtcccccaca aagctgaatg acctgtgctt caccaacgtg tacgccgatt    1620

ccttcgtgat tagaggcgac gaggtgaggc agattgcacc aggacagact ggcaagattg    1680

ccgactacaa ctacaagctg cccgatgatt tcacaggctg tgtgatcgcc tggaacagca    1740

ataacctgga cagcaaagtg ggaggcaact acaactacct gtacagactg ttcaggaagt    1800

ccaatctgaa gcctttcgag agagacatca gcaccgagat ctaccaggcc ggctcaacac    1860

catgtaatgg agtggagggc tttaactgtt acttccccct gcagtcttac ggcttccagc    1920

ccactaatgg cgtgggatat cagccctata gagtggtggt gctgagcttt gagctgctgc    1980

atgctccagc taccgtgtgt ggccctaaga agagcaccaa tctggtgaag aataagtgcg    2040

tgaacttcaa cttcaacggc ctgaccggca caggagtgct gacagaaagc aataagaagt    2100

tcctgccctt ccagcagttc ggcagagata ttgccgacac aaccgatgcc gtgagggacc    2160

cacagactct ggagatcctg gatattacac ctagcagctt tgggggcgtg tcctaatagg    2220

gatcctctag agtcgacctg cagaagcttg cctcgagcag cgctgctcga gagatctacg    2280

ggtggcatcc ctgtgacccc tccccagtgc ctctcctggc cctggaagtt gccactccag    2340

tgcccaccag ccttgtccta ataaaattaa gttgcatcat tttgtctgac taggtgtcct    2400

tctataatat tatggggtgg aggggggtgg tatggagcaa ggggcaagtt gggaagacaa    2460

cctgtagggc ctgcggggtc tattgggaac caagctggag tgcagtggca caatcttggc    2520

tcactgcaat ctccgcctcc tgggttcaag cgattctcct gcctcagcct cccgagttgt    2580

tgggattcca ggcatgcatg accaggctca gctaattttt gtttttttgg tagagacggg    2640

gtttcaccat attggccagg ctggtctcca actcctaatc tcaggtgatc tacccacctt    2700

ggcctcccaa attgctggga ttacaggcgt gaaccactgc tcccttccct gtccttctga    2760

ttttgtaggt aaccacgtgc ggaccgagcg gccgcaggaa cccctagtga tggagttggc    2820

cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg    2880

cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagctgcc tgcagg        2936


