                         SEQUENCE LISTING

<110>  University of Cape Town
 
<120>  INTEGRATED MOLECULAR AND GLYCO-ENGINEERING OF COMPLEX VIRAL 
       GLYCOPROTEINS

<130>  PA176489/P

<160>  45    

<170>  PatentIn version 3.5

<210>  1
<211>  1266
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Modified Human Calreticulin

<400>  1
accggtatgc tgctgagcgt gcctctgctg ctggggctgc tggggctggc cgtcgccgaa       60

cctgctgtct acttcaagga acagtttctg gatggcgacg gatggaccag ccggtggatc      120

gagagcaagc acaagtccga tttcggcaag tttgtgctga gctccggcaa gttctatggc      180

gatgaggaga aggacaaggg cctgcagaca agccaggacg cccggttcta cgccctgtct      240

gccagcttcg agccattttc caacaagggc cagaccctgg tggtgcagtt cacagtgaag      300

cacgagcaga acatcgattg cggcggcggc tatgtgaagc tgtttcccaa ttccctggat      360

cagaccgaca tgcacggcga ctctgagtac aacatcatgt tcggcccaga tatctgcggc      420

cccggcacaa agaaggtgca cgtgatcttt aattataagg gcaagaacgt gctgatcaat      480

aaggacatca ggtgtaagga cgatgagttc acccacctgt acacactgat cgtgcgccct      540

gacaacacct atgaggtgaa gatcgataat tcccaggtgg agtccggctc tctggaggac      600

gattgggatt ttctgccccc taagaagatc aaggaccctg atgcctctaa gccagaggac      660

tgggatgaga gggccaagat cgacgatccc acagacagca agcctgagga ctgggataag      720

cccgagcaca tccctgaccc agatgccaag aagcccgaag actgggatga ggagatggat      780

ggcgagtggg agccacccgt gatccagaac cccgagtaca agggcgagtg gaagcctaga      840

cagatcgata atccagacta taagggcacc tggatccacc cagagatcga taaccccgag      900

tactctcccg accctagcat ctacgcctat gataatttcg gcgtgctggg cctggacctg      960

tggcaggtga agtctggcac catcttcgac aactttctga tcacaaatga tgaggcctac     1020

gccgaggagt ttggcaatga gacatggggc gtgacaaagg ccgccgagaa gcagatgaag     1080

gataagcagg acgaggagca gcggctgaag gaagaggagg aggacaagaa gagaaaggag     1140

gaggaggagg ccgaggataa ggaggacgat gaggacaagg atgaggacga ggaggacgag     1200

gaggataagg aggaagatga agaggaggat gtcccagggc aggcaaaaga tgaactgtga     1260

ctcgag                                                                1266


<210>  2
<211>  417
<212>  PRT
<213>  artificial sequence

<220>
<223>  Modified Human Calreticulin

<400>  2

Met Leu Leu Ser Val Pro Leu Leu Leu Gly Leu Leu Gly Leu Ala Val 
1               5                   10                  15      


Ala Glu Pro Ala Val Tyr Phe Lys Glu Gln Phe Leu Asp Gly Asp Gly 
            20                  25                  30          


Trp Thr Ser Arg Trp Ile Glu Ser Lys His Lys Ser Asp Phe Gly Lys 
        35                  40                  45              


Phe Val Leu Ser Ser Gly Lys Phe Tyr Gly Asp Glu Glu Lys Asp Lys 
    50                  55                  60                  


Gly Leu Gln Thr Ser Gln Asp Ala Arg Phe Tyr Ala Leu Ser Ala Ser 
65                  70                  75                  80  


Phe Glu Pro Phe Ser Asn Lys Gly Gln Thr Leu Val Val Gln Phe Thr 
                85                  90                  95      


Val Lys His Glu Gln Asn Ile Asp Cys Gly Gly Gly Tyr Val Lys Leu 
            100                 105                 110         


Phe Pro Asn Ser Leu Asp Gln Thr Asp Met His Gly Asp Ser Glu Tyr 
        115                 120                 125             


Asn Ile Met Phe Gly Pro Asp Ile Cys Gly Pro Gly Thr Lys Lys Val 
    130                 135                 140                 


His Val Ile Phe Asn Tyr Lys Gly Lys Asn Val Leu Ile Asn Lys Asp 
145                 150                 155                 160 


Ile Arg Cys Lys Asp Asp Glu Phe Thr His Leu Tyr Thr Leu Ile Val 
                165                 170                 175     


Arg Pro Asp Asn Thr Tyr Glu Val Lys Ile Asp Asn Ser Gln Val Glu 
            180                 185                 190         


Ser Gly Ser Leu Glu Asp Asp Trp Asp Phe Leu Pro Pro Lys Lys Ile 
        195                 200                 205             


Lys Asp Pro Asp Ala Ser Lys Pro Glu Asp Trp Asp Glu Arg Ala Lys 
    210                 215                 220                 


Ile Asp Asp Pro Thr Asp Ser Lys Pro Glu Asp Trp Asp Lys Pro Glu 
225                 230                 235                 240 


His Ile Pro Asp Pro Asp Ala Lys Lys Pro Glu Asp Trp Asp Glu Glu 
                245                 250                 255     


Met Asp Gly Glu Trp Glu Pro Pro Val Ile Gln Asn Pro Glu Tyr Lys 
            260                 265                 270         


Gly Glu Trp Lys Pro Arg Gln Ile Asp Asn Pro Asp Tyr Lys Gly Thr 
        275                 280                 285             


Trp Ile His Pro Glu Ile Asp Asn Pro Glu Tyr Ser Pro Asp Pro Ser 
    290                 295                 300                 


Ile Tyr Ala Tyr Asp Asn Phe Gly Val Leu Gly Leu Asp Leu Trp Gln 
305                 310                 315                 320 


Val Lys Ser Gly Thr Ile Phe Asp Asn Phe Leu Ile Thr Asn Asp Glu 
                325                 330                 335     


Ala Tyr Ala Glu Glu Phe Gly Asn Glu Thr Trp Gly Val Thr Lys Ala 
            340                 345                 350         


Ala Glu Lys Gln Met Lys Asp Lys Gln Asp Glu Glu Gln Arg Leu Lys 
        355                 360                 365             


Glu Glu Glu Glu Asp Lys Lys Arg Lys Glu Glu Glu Glu Ala Glu Asp 
    370                 375                 380                 


Lys Glu Asp Asp Glu Asp Lys Asp Glu Asp Glu Glu Asp Glu Glu Asp 
385                 390                 395                 400 


Lys Glu Glu Asp Glu Glu Glu Asp Val Pro Gly Gln Ala Lys Asp Glu 
                405                 410                 415     


Leu 
    


<210>  3
<211>  1791
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Modified Human Calnexin

<400>  3
accggtatgg aggggaaatg gctgctgtgc atgctgctgg tgctgggaac tgctattgtg       60

gaggctcacg acggacacga tgacgatgtg attgacatcg aggacgatct ggacgatgtg      120

atcgaggagg tggaggacag caagcctgat accacagccc cccctagctc cccaaaggtg      180

acctacaagg cccccgtgcc tacaggcgag gtgtatttcg ccgactcctt tgataggggc      240

accctgagcg gatggatcct gtccaaggcc aagaaggacg atacagacga tgagatcgcc      300

aagtacgacg gcaagtggga agtggaggag atgaaggagt ctaagctgcc tggcgataag      360

ggcctggtgc tgatgtcccg ggccaagcac cacgccatct ctgccaagct gaataagcca      420

ttcctgtttg acaccaagcc cctgatcgtg cagtacgagg tgaacttcca gaatggcatc      480

gagtgcggcg gcgcctatgt gaagctgctg tccaagacac cagagctgaa tctggaccag      540

ttccacgata agacccctta cacaatcatg tttggcccag acaagtgtgg cgaggattat      600

aagctgcact tcatctttag acacaagaac ccaaagaccg gcatctatga ggagaagcac      660

gccaagaggc ccgacgccga tctgaagacc tacttcacag acaagaagac ccacctgtat      720

acactgatcc tgaacccaga caattctttt gagatcctgg tggatcagtc cgtggtgaac      780

tctggcaatc tgctgaacga tatgacccca cccgtgaatc ccagcaggga gatcgaggac      840

cccgaggatc gcaagcctga ggactgggat gagcggccca agatcccaga cccagaggca      900

gtgaagcctg acgattggga cgaggatgcc cctgccaaga tcccagatga ggaggccaca      960

aagcccgagg gctggctgga cgatgagcct gagtacgtgc ctgacccaga tgccgagaag     1020

cccgaggact gggatgagga catggatggc gagtgggagg ccccacagat cgcaaaccca     1080

agatgcgaga gcgcccctgg atgtggcgtg tggcagaggc ctgtgatcga caacccaaat     1140

tacaagggca agtggaagcc tccaatgatc gataatccat cctatcaggg catctggaag     1200

ccccgcaaga tccccaaccc tgacttcttt gaggatctgg agcccttccg gatgacccct     1260

ttttctgcca tcggcctgga gctgtggtct atgacaagcg acatcttctt tgataacttc     1320

atcatctgcg ccgaccggag aatcgtggac gattgggcca acgacggatg gggcctgaag     1380

aaggcagcag atggagcagc agagccagga gtggtgggac agatgatcga ggcagcagag     1440

gagcggccct ggctgtgggt ggtgtacatc ctgaccgtgg ccctgcccgt gttcctggtc     1500

atcctgttct gctgttctgg caagaagcag accagcggca tggagtataa gaagacagac     1560

gccccacagc ccgatgtgaa agaggaggag gaggagaagg aggaggagaa ggacaagggc     1620

gatgaggagg aggagggcga ggagaagctg gaggagaagc agaagagcga cgccgaggag     1680

gatggcggca cagtgtccca ggaggaggag gaccggaagc ctaaggcaga agaagacgaa     1740

atcctgaatc ggtcaccaag aaatagaaaa ccacggaggg aatgactcga g              1791


<210>  4
<211>  592
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Modified Human Calnexin

<400>  4

Met Glu Gly Lys Trp Leu Leu Cys Met Leu Leu Val Leu Gly Thr Ala 
1               5                   10                  15      


Ile Val Glu Ala His Asp Gly His Asp Asp Asp Val Ile Asp Ile Glu 
            20                  25                  30          


Asp Asp Leu Asp Asp Val Ile Glu Glu Val Glu Asp Ser Lys Pro Asp 
        35                  40                  45              


Thr Thr Ala Pro Pro Ser Ser Pro Lys Val Thr Tyr Lys Ala Pro Val 
    50                  55                  60                  


Pro Thr Gly Glu Val Tyr Phe Ala Asp Ser Phe Asp Arg Gly Thr Leu 
65                  70                  75                  80  


Ser Gly Trp Ile Leu Ser Lys Ala Lys Lys Asp Asp Thr Asp Asp Glu 
                85                  90                  95      


Ile Ala Lys Tyr Asp Gly Lys Trp Glu Val Glu Glu Met Lys Glu Ser 
            100                 105                 110         


Lys Leu Pro Gly Asp Lys Gly Leu Val Leu Met Ser Arg Ala Lys His 
        115                 120                 125             


His Ala Ile Ser Ala Lys Leu Asn Lys Pro Phe Leu Phe Asp Thr Lys 
    130                 135                 140                 


Pro Leu Ile Val Gln Tyr Glu Val Asn Phe Gln Asn Gly Ile Glu Cys 
145                 150                 155                 160 


Gly Gly Ala Tyr Val Lys Leu Leu Ser Lys Thr Pro Glu Leu Asn Leu 
                165                 170                 175     


Asp Gln Phe His Asp Lys Thr Pro Tyr Thr Ile Met Phe Gly Pro Asp 
            180                 185                 190         


Lys Cys Gly Glu Asp Tyr Lys Leu His Phe Ile Phe Arg His Lys Asn 
        195                 200                 205             


Pro Lys Thr Gly Ile Tyr Glu Glu Lys His Ala Lys Arg Pro Asp Ala 
    210                 215                 220                 


Asp Leu Lys Thr Tyr Phe Thr Asp Lys Lys Thr His Leu Tyr Thr Leu 
225                 230                 235                 240 


Ile Leu Asn Pro Asp Asn Ser Phe Glu Ile Leu Val Asp Gln Ser Val 
                245                 250                 255     


Val Asn Ser Gly Asn Leu Leu Asn Asp Met Thr Pro Pro Val Asn Pro 
            260                 265                 270         


Ser Arg Glu Ile Glu Asp Pro Glu Asp Arg Lys Pro Glu Asp Trp Asp 
        275                 280                 285             


Glu Arg Pro Lys Ile Pro Asp Pro Glu Ala Val Lys Pro Asp Asp Trp 
    290                 295                 300                 


Asp Glu Asp Ala Pro Ala Lys Ile Pro Asp Glu Glu Ala Thr Lys Pro 
305                 310                 315                 320 


Glu Gly Trp Leu Asp Asp Glu Pro Glu Tyr Val Pro Asp Pro Asp Ala 
                325                 330                 335     


Glu Lys Pro Glu Asp Trp Asp Glu Asp Met Asp Gly Glu Trp Glu Ala 
            340                 345                 350         


Pro Gln Ile Ala Asn Pro Arg Cys Glu Ser Ala Pro Gly Cys Gly Val 
        355                 360                 365             


Trp Gln Arg Pro Val Ile Asp Asn Pro Asn Tyr Lys Gly Lys Trp Lys 
    370                 375                 380                 


Pro Pro Met Ile Asp Asn Pro Ser Tyr Gln Gly Ile Trp Lys Pro Arg 
385                 390                 395                 400 


Lys Ile Pro Asn Pro Asp Phe Phe Glu Asp Leu Glu Pro Phe Arg Met 
                405                 410                 415     


Thr Pro Phe Ser Ala Ile Gly Leu Glu Leu Trp Ser Met Thr Ser Asp 
            420                 425                 430         


Ile Phe Phe Asp Asn Phe Ile Ile Cys Ala Asp Arg Arg Ile Val Asp 
        435                 440                 445             


Asp Trp Ala Asn Asp Gly Trp Gly Leu Lys Lys Ala Ala Asp Gly Ala 
    450                 455                 460                 


Ala Glu Pro Gly Val Val Gly Gln Met Ile Glu Ala Ala Glu Glu Arg 
465                 470                 475                 480 


Pro Trp Leu Trp Val Val Tyr Ile Leu Thr Val Ala Leu Pro Val Phe 
                485                 490                 495     


Leu Val Ile Leu Phe Cys Cys Ser Gly Lys Lys Gln Thr Ser Gly Met 
            500                 505                 510         


Glu Tyr Lys Lys Thr Asp Ala Pro Gln Pro Asp Val Lys Glu Glu Glu 
        515                 520                 525             


Glu Glu Lys Glu Glu Glu Lys Asp Lys Gly Asp Glu Glu Glu Glu Gly 
    530                 535                 540                 


Glu Glu Lys Leu Glu Glu Lys Gln Lys Ser Asp Ala Glu Glu Asp Gly 
545                 550                 555                 560 


Gly Thr Val Ser Gln Glu Glu Glu Asp Arg Lys Pro Lys Ala Glu Glu 
                565                 570                 575     


Asp Glu Ile Leu Asn Arg Ser Pro Arg Asn Arg Lys Pro Arg Arg Glu 
            580                 585                 590         


<210>  5
<211>  2583
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Leishmania major LmSTT3D

<400>  5
tctagaatgg gtaagagaaa gggaaacagc ctgggagatt ctggttctgc tgctactgct       60

tcaagagagg cttctgctca agctgaggat gctgcttctc agactaagac tgctagccct      120

cctgctaagg tgatcctttt gcctaagacc ctgaccgatg agaaggattt tatcggtatc      180

ttccctttcc cattctggcc tgtgcatttc gtgcttactg tggtggctct tttcgtgttg      240

gctgcttctt gcttccaggc tttcaccgtg aggatgatct ctgtgcagat ctacggttac      300

ctgatccacg agttcgatcc atggttcaac tatagggctg ctgagtacat gtctacccac      360

ggttggtctg ctttcttcag ctggttcgat tacatgagct ggtatcctct tggtaggcct      420

gtgggttcta ctacctatcc tggacttcag cttaccgctg tggctattca tagagctttg      480

gctgctgctg gtatgcctat gtctctgaac aatgtgtgcg tgctgatgcc tgcttggttt      540

ggtgctattg ctaccgctac tcttgctttc tgtacctacg aggcttcagg ttctactgtt      600

gctgctgcag ctgctgctct gagcttctct attattcctg ctcacctgat gagatccatg      660

gctggtgagt tcgataacga gtgcattgct gtggctgcta tgctgcttac tttctactgc      720

tgggtgagaa gccttaggac cagatcttct tggcctattg gtgtgcttac cggtgttgct      780

tacggttaca tggctgcagc ttggggaggt tacattttcg tgctgaacat ggtggctatg      840

cacgctggaa tcagctctat ggttgattgg gctaggaata cctacaaccc atctctgctt      900

agggcttaca ccctgttcta cgttgtggga accgctattg ctgtttgtgt gcctcctgtt      960

ggtatgagcc ctttcaagtc tcttgagcag cttggtgctc tgcttgtgct tgttttcttg     1020

tgcggacttc aggtgtgcga ggttttgaga gctagagctg gtgttgaggt taggtccagg     1080

gctaacttca agatcagggt gagggtgttc tcagtgatgg ctggtgttgc tgctctggct     1140

atttctgtgc ttgctcctac tggttacttc ggtcctttgt ctgttagggt tagggctctg     1200

ttcgttgagc ataccaggac tggtaaccct ctggttgatt ctgttgctga gcatcagcct     1260

gcttctccag aggctatgtg ggcttttctt catgtgtgcg gtgtgacttg gggtctgggt     1320

tctatcgttc ttgctgtgtc taccttcgtg cactacagcc cttctaaggt gttctggctt     1380

ctgaactctg gtgctgtgta ctacttctct accaggatgg ctaggcttct tctgttgtct     1440

ggacctgctg cttgcctgtc tactggtatt ttcgtgggaa ccatccttga ggctgctgtg     1500

cagctttcat tctgggattc tgatgctacc aaggctaaga aacagcaaaa gcaggctcag     1560

aggcatcaga gaggtgctgg taagggttct ggaagggatg atgctaagaa tgctactacc     1620

gctagggctt tctgtgatgt gttcgctggt tcttctcttg cttggggtca taggatggtg     1680

ctgtctattg caatgtgggc tcttgtgact accaccgctg tgtctttctt ctcctccgaa     1740

tttgcttccc actccaccaa gttcgctgag cagtcatcta accctatgat cgtgttcgca     1800

gctgtggtgc agaatagggc tactggaaag cctatgaacc tgctggtgga tgattacctg     1860

aaggcttacg agtggctgag ggattctact ccagaggatg ctagagttct ggcatggtgg     1920

gattacggat accagattac cggtatcggt aacaggacct ctctggctga tggtaatact     1980

tggaaccacg agcacattgc taccatcggt aagatgctta ccagccctgt tgttgaggct     2040

cactctcttg ttaggcacat ggctgattac gtgctgattt gggctggtca gtctggtgat     2100

ctgatgaagt ctcctcacat ggctaggatc ggtaactccg tgtaccacga tatctgccct     2160

gatgatcctc tttgtcagca gttcggtttc cataggaacg attactctag gcctacccct     2220

atgatgaggg cttctcttct ttacaacctg cacgaggcag gtaaaagaaa aggtgtgaag     2280

gtgaacccta gcctgttcca agaggtgtac agctctaagt acggtctggt gaggatcttc     2340

aaggtgatga acgtgagcgc tgagagcaag aagtgggttg cagatcctgc taatagggtg     2400

tgccatcctc ctggttcttg gatttgtcct ggtcagtacc ctccagctaa agaaattcaa     2460

gagatgctgg ctcacagggt gccattcgat caggttacca acgctgatag gaagaacaac     2520

gttggaagct atcaagagga gtacatgaga agaatgaggg aatccgagaa cagaagggga     2580

tcc                                                                   2583


<210>  6
<211>  859
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Leishmania major LmSTT3D

<400>  6

Met Gly Lys Arg Lys Gly Asn Ser Leu Gly Asp Ser Gly Ser Ala Ala 
1               5                   10                  15      


Thr Ala Ser Arg Glu Ala Ser Ala Gln Ala Glu Asp Ala Ala Ser Gln 
            20                  25                  30          


Thr Lys Thr Ala Ser Pro Pro Ala Lys Val Ile Leu Leu Pro Lys Thr 
        35                  40                  45              


Leu Thr Asp Glu Lys Asp Phe Ile Gly Ile Phe Pro Phe Pro Phe Trp 
    50                  55                  60                  


Pro Val His Phe Val Leu Thr Val Val Ala Leu Phe Val Leu Ala Ala 
65                  70                  75                  80  


Ser Cys Phe Gln Ala Phe Thr Val Arg Met Ile Ser Val Gln Ile Tyr 
                85                  90                  95      


Gly Tyr Leu Ile His Glu Phe Asp Pro Trp Phe Asn Tyr Arg Ala Ala 
            100                 105                 110         


Glu Tyr Met Ser Thr His Gly Trp Ser Ala Phe Phe Ser Trp Phe Asp 
        115                 120                 125             


Tyr Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly Ser Thr Thr Tyr 
    130                 135                 140                 


Pro Gly Leu Gln Leu Thr Ala Val Ala Ile His Arg Ala Leu Ala Ala 
145                 150                 155                 160 


Ala Gly Met Pro Met Ser Leu Asn Asn Val Cys Val Leu Met Pro Ala 
                165                 170                 175     


Trp Phe Gly Ala Ile Ala Thr Ala Thr Leu Ala Phe Cys Thr Tyr Glu 
            180                 185                 190         


Ala Ser Gly Ser Thr Val Ala Ala Ala Ala Ala Ala Leu Ser Phe Ser 
        195                 200                 205             


Ile Ile Pro Ala His Leu Met Arg Ser Met Ala Gly Glu Phe Asp Asn 
    210                 215                 220                 


Glu Cys Ile Ala Val Ala Ala Met Leu Leu Thr Phe Tyr Cys Trp Val 
225                 230                 235                 240 


Arg Ser Leu Arg Thr Arg Ser Ser Trp Pro Ile Gly Val Leu Thr Gly 
                245                 250                 255     


Val Ala Tyr Gly Tyr Met Ala Ala Ala Trp Gly Gly Tyr Ile Phe Val 
            260                 265                 270         


Leu Asn Met Val Ala Met His Ala Gly Ile Ser Ser Met Val Asp Trp 
        275                 280                 285             


Ala Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala Tyr Thr Leu Phe 
    290                 295                 300                 


Tyr Val Val Gly Thr Ala Ile Ala Val Cys Val Pro Pro Val Gly Met 
305                 310                 315                 320 


Ser Pro Phe Lys Ser Leu Glu Gln Leu Gly Ala Leu Leu Val Leu Val 
                325                 330                 335     


Phe Leu Cys Gly Leu Gln Val Cys Glu Val Leu Arg Ala Arg Ala Gly 
            340                 345                 350         


Val Glu Val Arg Ser Arg Ala Asn Phe Lys Ile Arg Val Arg Val Phe 
        355                 360                 365             


Ser Val Met Ala Gly Val Ala Ala Leu Ala Ile Ser Val Leu Ala Pro 
    370                 375                 380                 


Thr Gly Tyr Phe Gly Pro Leu Ser Val Arg Val Arg Ala Leu Phe Val 
385                 390                 395                 400 


Glu His Thr Arg Thr Gly Asn Pro Leu Val Asp Ser Val Ala Glu His 
                405                 410                 415     


Gln Pro Ala Ser Pro Glu Ala Met Trp Ala Phe Leu His Val Cys Gly 
            420                 425                 430         


Val Thr Trp Gly Leu Gly Ser Ile Val Leu Ala Val Ser Thr Phe Val 
        435                 440                 445             


His Tyr Ser Pro Ser Lys Val Phe Trp Leu Leu Asn Ser Gly Ala Val 
    450                 455                 460                 


Tyr Tyr Phe Ser Thr Arg Met Ala Arg Leu Leu Leu Leu Ser Gly Pro 
465                 470                 475                 480 


Ala Ala Cys Leu Ser Thr Gly Ile Phe Val Gly Thr Ile Leu Glu Ala 
                485                 490                 495     


Ala Val Gln Leu Ser Phe Trp Asp Ser Asp Ala Thr Lys Ala Lys Lys 
            500                 505                 510         


Gln Gln Lys Gln Ala Gln Arg His Gln Arg Gly Ala Gly Lys Gly Ser 
        515                 520                 525             


Gly Arg Asp Asp Ala Lys Asn Ala Thr Thr Ala Arg Ala Phe Cys Asp 
    530                 535                 540                 


Val Phe Ala Gly Ser Ser Leu Ala Trp Gly His Arg Met Val Leu Ser 
545                 550                 555                 560 


Ile Ala Met Trp Ala Leu Val Thr Thr Thr Ala Val Ser Phe Phe Ser 
                565                 570                 575     


Ser Glu Phe Ala Ser His Ser Thr Lys Phe Ala Glu Gln Ser Ser Asn 
            580                 585                 590         


Pro Met Ile Val Phe Ala Ala Val Val Gln Asn Arg Ala Thr Gly Lys 
        595                 600                 605             


Pro Met Asn Leu Leu Val Asp Asp Tyr Leu Lys Ala Tyr Glu Trp Leu 
    610                 615                 620                 


Arg Asp Ser Thr Pro Glu Asp Ala Arg Val Leu Ala Trp Trp Asp Tyr 
625                 630                 635                 640 


Gly Tyr Gln Ile Thr Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly 
                645                 650                 655     


Asn Thr Trp Asn His Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr 
            660                 665                 670         


Ser Pro Val Val Glu Ala His Ser Leu Val Arg His Met Ala Asp Tyr 
        675                 680                 685             


Val Leu Ile Trp Ala Gly Gln Ser Gly Asp Leu Met Lys Ser Pro His 
    690                 695                 700                 


Met Ala Arg Ile Gly Asn Ser Val Tyr His Asp Ile Cys Pro Asp Asp 
705                 710                 715                 720 


Pro Leu Cys Gln Gln Phe Gly Phe His Arg Asn Asp Tyr Ser Arg Pro 
                725                 730                 735     


Thr Pro Met Met Arg Ala Ser Leu Leu Tyr Asn Leu His Glu Ala Gly 
            740                 745                 750         


Lys Arg Lys Gly Val Lys Val Asn Pro Ser Leu Phe Gln Glu Val Tyr 
        755                 760                 765             


Ser Ser Lys Tyr Gly Leu Val Arg Ile Phe Lys Val Met Asn Val Ser 
    770                 775                 780                 


Ala Glu Ser Lys Lys Trp Val Ala Asp Pro Ala Asn Arg Val Cys His 
785                 790                 795                 800 


Pro Pro Gly Ser Trp Ile Cys Pro Gly Gln Tyr Pro Pro Ala Lys Glu 
                805                 810                 815     


Ile Gln Glu Met Leu Ala His Arg Val Pro Phe Asp Gln Val Thr Asn 
            820                 825                 830         


Ala Asp Arg Lys Asn Asn Val Gly Ser Tyr Gln Glu Glu Tyr Met Arg 
        835                 840                 845             


Arg Met Arg Glu Ser Glu Asn Arg Arg Gly Ser 
    850                 855                 


<210>  7
<211>  220
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HEXO3RNAi sense strand

<400>  7
gcaaaaacag tttatggggc gttgcatggt ttgcagacat ttagtcaagt atgccatttt       60

aactttacaa ccagaacaat tgaagttcat caagttccat ggaccatagt tgatcgacca      120

agattctctt atcgagggct tttaattgat acttcccgtc actatctgcc gttgcctgtg      180

atattgaagg ttatcgattc aatggcttat gcaaaactga                            220


<210>  8
<211>  220
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HEXO3RNAi antisense strand

<400>  8
tcagttttgc ataagccatt gaatcgataa ccttcaatat cacaggcaac ggcagatagt       60

gacgggaagt atcaattaaa agccctcgat aagagaatct tggtcgatca actatggtcc      120

atggaacttg atgaacttca attgttctgg ttgtaaagtt aaaatggcat acttgactaa      180

atgtctgcaa accatgcaac gccccataaa ctgtttttgc                            220


<210>  9
<211>  1984
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Modified HIV envelope gp140 for expression in mammalian cells

<400>  9
atggatgcta tgaagcgggg actctgttgc gtgctgctgc tgtgcggagc cgtgttcgtg       60

tcaccctctg ccggagggct gtgggtcact gtctactatg gcgtgcctgt ctggagagag      120

gccaagacca cactgttctg cgcttccgat gcaaagtctt acgaaaaaga ggtgcacaac      180

gtctgggcca cacatgcttg cgtgccaact gaccccaacc ctcaggaact ggtgctgaag      240

aatgtcaccg agaactttaa tatgtggaaa aatgacatgg tggatcagat gcacgaggat      300

atcattagtc tgtgggacca gtcactgaag ccctgcgtga aactgacacc tctgtgcgtc      360

actctgaact gtagcgatgc aaaggtgaac attaatgcca catacaatgg cactcgcgag      420

gaaatcaaaa actgttcctt caatgcaact accgaactga gggacaagaa gaagaaggag      480

tacgccctgt tttatcgcct ggacatcgtg cccctgaaca aggaagggaa caataacagt      540

gagtatcggc tgattaactg caataccagc gtgattaccc aggcctgtcc taaagtcacc      600

ttcgatccaa ttcccatcca ctactgcgca ccagccggat atgctattct gaagtgtaac      660

aacaaaactt ttaacgggac cggaccctgc aataacgtgt ctacagtcca gtgtactcat      720

ggcatcaagc ctgtggtctc aacccagctg ctgctgaatg ggagcctggc cgaggaagag      780

atcattatca gaagcgagaa cctgaccgac aatgtgaaga caattatcgt ccacctgaac      840

gaatccgtgg agattaattg caccaggcca aacaacaaca cacgaaaatc tattcggatc      900

ggaccaggac agaccttcta cgcaacaggg gacattatcg gagatatcag gcaggctcat      960

tgtaacattt ctgaaatcaa gtgggagaaa accctgcagc gcgtgagtga aaagctgcga     1020

gagcacttca acaaaacaat catctttaat cagagctccg gcggggacct ggaaatcaca     1080

actcattcat tcaactgcgg aggcgagttc ttttactgta acactagcga tctgttcttt     1140

aataagacct ttgacgagac ctattccaca ggctcaaaca gcactaattc taccattaca     1200

ctgccatgcc gaatcaaaca gattatcaac atgtggcagg aagtgggccg ggcaatgtat     1260

gccagcccca ttgccggaga gatcacctgt aagtccaata tcactggact gctgctgacc     1320

agagatgggg gaggcaacaa ttctactgaa gagaccttta ggcccggggg aggcaacatg     1380

agagacaatt ggaggagcga actgtacaag tataaagtgg tcgaggtgaa gcctctggga     1440

atcgcaccaa ccgaggcccg gagaagggtg gtccagcagg gcggtggagg ctcaggtgga     1500

ggcggatccg ctgtggtcgg actgggagca gtgttcctgg ggtttctggg aactgctggc     1560

agcaccatgg gagccgcttc cattactctg accgtgcagg cacgccagct gctgtctggc     1620

atcgtccagc agcagagtaa cctgctgcgg gctcctgaag cacagcagca tatgctgcag     1680

ctgaccgtgt gggggattaa gcagctgcag gcccgggtcc tggctatcga gagatacctg     1740

aaggatcagc agctgctggg gatgtgggga tgcagtggca aactgatttg caccacaaac     1800

gtgtactgga acagcagctg gtccaacaag acatataatg aaatctggga caacatgact     1860

tggatgcagt gggaccgcga gatcgataac tacacagaca ctatctataa actgctggaa     1920

gtctcacaga aacagcagga gtcaaatgaa aaggacctgc tggcactgga tgcggccgca     1980

tgat                                                                  1984


<210>  10
<211>  658
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Modified HIV envelope gp140 expressed in mammalian cells

<400>  10

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1               5                   10                  15      


Ala Val Phe Val Ser Pro Ala Gly Leu Trp Val Thr Val Tyr Tyr Gly 
            20                  25                  30          


Val Pro Val Trp Arg Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp 
        35                  40                  45              


Ala Lys Ser Tyr Glu Lys Glu Val His Asn Val Trp Ala Thr His Ala 
    50                  55                  60                  


Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Leu Val Leu Lys Asn Val 
65                  70                  75                  80  


Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met Val Asp Gln Met His 
                85                  90                  95      


Glu Asp Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys 
            100                 105                 110         


Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Ser Asp Ala Lys Val Asn 
        115                 120                 125             


Ile Asn Ala Thr Tyr Asn Gly Thr Arg Glu Glu Ile Lys Asn Cys Ser 
    130                 135                 140                 


Phe Asn Ala Thr Thr Glu Leu Arg Asp Lys Lys Lys Lys Glu Tyr Ala 
145                 150                 155                 160 


Leu Phe Tyr Arg Leu Asp Ile Val Pro Leu Asn Lys Glu Gly Asn Asn 
                165                 170                 175     


Asn Ser Glu Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln 
            180                 185                 190         


Ala Cys Pro Lys Val Thr Phe Asp Pro Ile Pro Ile His Tyr Cys Ala 
        195                 200                 205             


Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly 
    210                 215                 220                 


Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile 
225                 230                 235                 240 


Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu 
                245                 250                 255     


Glu Glu Ile Ile Ile Arg Ser Glu Asn Leu Thr Asp Asn Val Lys Thr 
            260                 265                 270         


Ile Ile Val His Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro 
        275                 280                 285             


Asn Asn Asn Thr Arg Lys Ser Ile Arg Ile Gly Pro Gly Gln Thr Phe 
    290                 295                 300                 


Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn 
305                 310                 315                 320 


Ile Ser Glu Ile Lys Trp Glu Lys Thr Leu Gln Arg Val Ser Glu Lys 
                325                 330                 335     


Leu Arg Glu His Phe Asn Lys Thr Ile Ile Phe Asn Gln Ser Ser Gly 
            340                 345                 350         


Gly Asp Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe 
        355                 360                 365             


Phe Tyr Cys Asn Thr Ser Asp Leu Phe Phe Asn Lys Thr Phe Asp Glu 
    370                 375                 380                 


Thr Tyr Ser Thr Gly Ser Asn Ser Thr Asn Ser Thr Ile Thr Leu Pro 
385                 390                 395                 400 


Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala 
                405                 410                 415     


Met Tyr Ala Ser Pro Ile Ala Gly Glu Ile Thr Cys Lys Ser Asn Ile 
            420                 425                 430         


Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Gly Asn Asn Ser Thr Glu 
        435                 440                 445             


Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser 
    450                 455                 460                 


Glu Leu Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Ile Ala 
465                 470                 475                 480 


Pro Thr Glu Ala Arg Arg Arg Val Val Gln Gln Gly Gly Gly Gly Ser 
                485                 490                 495     


Gly Gly Gly Gly Ser Ala Val Val Gly Leu Gly Ala Val Phe Leu Gly 
            500                 505                 510         


Phe Leu Gly Thr Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu 
        515                 520                 525             


Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser 
    530                 535                 540                 


Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln His Met Leu Gln Leu Thr 
545                 550                 555                 560 


Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Ile Glu Arg 
                565                 570                 575     


Tyr Leu Lys Asp Gln Gln Leu Leu Gly Met Trp Gly Cys Ser Gly Lys 
            580                 585                 590         


Leu Ile Cys Thr Thr Asn Val Tyr Trp Asn Ser Ser Trp Ser Asn Lys 
        595                 600                 605             


Thr Tyr Asn Glu Ile Trp Asp Asn Met Thr Trp Met Gln Trp Asp Arg 
    610                 615                 620                 


Glu Ile Asp Asn Tyr Thr Asp Thr Ile Tyr Lys Leu Leu Glu Val Ser 
625                 630                 635                 640 


Gln Lys Gln Gln Glu Ser Asn Glu Lys Asp Leu Leu Ala Leu Asp Ala 
                645                 650                 655     


Ala Ala 
        


<210>  11
<211>  2018
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Modified HIV envelope gp140 for expression in plant cells

<400>  11
tcatgaaagc ttaccggtgc caccatggag tggtcttgga tcttcctgtt tctgctgagc       60

gggactgctg gagtgcattc ttccggaggg ctgtgggtca ctgtctacta tggcgtgcct      120

gtctggagag aggccaagac cacactgttc tgcgcttccg atgcaaagtc ttacgaaaaa      180

gaggtgcaca acgtctgggc cacacatgct tgcgtgccaa ctgaccccaa ccctcaggaa      240

ctggtgctga agaatgtcac cgagaacttt aatatgtgga aaaatgacat ggtggatcag      300

atgcacgagg atatcattag tctgtgggac cagtcactga agccctgcgt gaaactgaca      360

cctctgtgcg tcactctgaa ctgtagcgat gcaaaggtga acattaatgc cacatacaat      420

ggcactcgcg aggaaatcaa aaactgttcc ttcaatgcaa ctaccgaact gagggacaag      480

aagaagaagg agtacgccct gttttatcgc ctggacatcg tgcccctgaa caaggaaggg      540

aacaataaca gtgagtatcg gctgattaac tgcaatacca gcgtgattac ccaggcctgt      600

cctaaagtca ccttcgatcc aattcccatc cactactgcg caccagccgg atatgctatt      660

ctgaagtgta acaacaaaac ttttaacggg accggaccct gcaataacgt gtctacagtc      720

cagtgtactc atggcatcaa gcctgtggtc tcaacccagc tgctgctgaa tgggagcctg      780

gccgaggaag agatcattat cagaagcgag aacctgaccg acaatgtgaa gacaattatc      840

gtccacctga acgaatccgt ggagattaat tgcaccaggc caaacaacaa cacacgaaaa      900

tctattcgga tcggaccagg acagaccttc tacgcaacag gggacattat cggagatatc      960

aggcaggctc attgtaacat ttctgaaatc aagtgggaga aaaccctgca gcgcgtgagt     1020

gaaaagctgc gagagcactt caacaaaaca atcatcttta atcagagctc cggcggggac     1080

ctggaaatca caactcattc attcaactgc ggaggcgagt tcttttactg taacactagc     1140

gatctgttct ttaataagac ctttgacgag acctattcca caggctcaaa cagcactaat     1200

tctaccatta cactgccatg ccgaatcaaa cagattatca acatgtggca ggaagtgggc     1260

cgggcaatgt atgccagccc cattgccgga gagatcacct gtaagtccaa tatcactgga     1320

ctgctgctga ccagagatgg gggaggcaac aattctactg aagagacctt taggcccggg     1380

ggaggcaaca tgagagacaa ttggaggagc gaactgtaca agtataaagt ggtcgaggtg     1440

aagcctctgg gaatcgcacc aaccgaggcc cggagaaggg tggtccagca gggcggtgga     1500

ggctcaggtg gaggcggatc cgctgtggtc ggactgggag cagtgttcct ggggtttctg     1560

ggaactgctg gcagcaccat gggagccgct tccattactc tgaccgtgca ggcacgccag     1620

ctgctgtctg gcatcgtcca gcagcagagt aacctgctgc gggctcctga agcacagcag     1680

catatgctgc agctgaccgt gtgggggatt aagcagctgc aggcccgggt cctggctatc     1740

gagagatacc tgaaggatca gcagctgctg gggatgtggg gatgcagtgg caaactgatt     1800

tgcaccacaa acgtgtactg gaacagcagc tggtccaaca agacatataa tgaaatctgg     1860

gacaacatga cttggatgca gtgggaccgc gagatcgata actacacaga cactatctat     1920

aaactgctgg aagtctcaca gaaacagcag gagtcaaatg aaaaggacct gctggcactg     1980

gatgcggccg catgattttt ctgaattcta gactcgag                             2018


<210>  12
<211>  656
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Modified HIV envelope gp140 expressed in plant cells

<400>  12

Met Glu Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Thr Ala Gly 
1               5                   10                  15      


Val His Ser Ser Gly Gly Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
            20                  25                  30          


Val Trp Arg Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
        35                  40                  45              


Ser Tyr Glu Lys Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
    50                  55                  60                  


Pro Thr Asp Pro Asn Pro Gln Glu Leu Val Leu Lys Asn Val Thr Glu 
65                  70                  75                  80  


Asn Phe Asn Met Trp Lys Asn Asp Met Val Asp Gln Met His Glu Asp 
                85                  90                  95      


Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr 
            100                 105                 110         


Pro Leu Cys Val Thr Leu Asn Cys Ser Asp Ala Lys Val Asn Ile Asn 
        115                 120                 125             


Ala Thr Tyr Asn Gly Thr Arg Glu Glu Ile Lys Asn Cys Ser Phe Asn 
    130                 135                 140                 


Ala Thr Thr Glu Leu Arg Asp Lys Lys Lys Lys Glu Tyr Ala Leu Phe 
145                 150                 155                 160 


Tyr Arg Leu Asp Ile Val Pro Leu Asn Lys Glu Gly Asn Asn Asn Ser 
                165                 170                 175     


Glu Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 
            180                 185                 190         


Pro Lys Val Thr Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
        195                 200                 205             


Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly 
    210                 215                 220                 


Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro 
225                 230                 235                 240 


Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
                245                 250                 255     


Ile Ile Ile Arg Ser Glu Asn Leu Thr Asp Asn Val Lys Thr Ile Ile 
            260                 265                 270         


Val His Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn 
        275                 280                 285             


Asn Thr Arg Lys Ser Ile Arg Ile Gly Pro Gly Gln Thr Phe Tyr Ala 
    290                 295                 300                 


Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser 
305                 310                 315                 320 


Glu Ile Lys Trp Glu Lys Thr Leu Gln Arg Val Ser Glu Lys Leu Arg 
                325                 330                 335     


Glu His Phe Asn Lys Thr Ile Ile Phe Asn Gln Ser Ser Gly Gly Asp 
            340                 345                 350         


Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
        355                 360                 365             


Cys Asn Thr Ser Asp Leu Phe Phe Asn Lys Thr Phe Asp Glu Thr Tyr 
    370                 375                 380                 


Ser Thr Gly Ser Asn Ser Thr Asn Ser Thr Ile Thr Leu Pro Cys Arg 
385                 390                 395                 400 


Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr 
                405                 410                 415     


Ala Ser Pro Ile Ala Gly Glu Ile Thr Cys Lys Ser Asn Ile Thr Gly 
            420                 425                 430         


Leu Leu Leu Thr Arg Asp Gly Gly Gly Asn Asn Ser Thr Glu Glu Thr 
        435                 440                 445             


Phe Arg Pro Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu 
    450                 455                 460                 


Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Ile Ala Pro Thr 
465                 470                 475                 480 


Glu Ala Arg Arg Arg Val Val Gln Gln Gly Gly Gly Gly Ser Gly Gly 
                485                 490                 495     


Gly Gly Ser Ala Val Val Gly Leu Gly Ala Val Phe Leu Gly Phe Leu 
            500                 505                 510         


Gly Thr Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 
        515                 520                 525             


Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 
    530                 535                 540                 


Leu Arg Ala Pro Glu Ala Gln Gln His Met Leu Gln Leu Thr Val Trp 
545                 550                 555                 560 


Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Ile Glu Arg Tyr Leu 
                565                 570                 575     


Lys Asp Gln Gln Leu Leu Gly Met Trp Gly Cys Ser Gly Lys Leu Ile 
            580                 585                 590         


Cys Thr Thr Asn Val Tyr Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr 
        595                 600                 605             


Asn Glu Ile Trp Asp Asn Met Thr Trp Met Gln Trp Asp Arg Glu Ile 
    610                 615                 620                 


Asp Asn Tyr Thr Asp Thr Ile Tyr Lys Leu Leu Glu Val Ser Gln Lys 
625                 630                 635                 640 


Gln Gln Glu Ser Asn Glu Lys Asp Leu Leu Ala Leu Asp Ala Ala Ala 
                645                 650                 655     


<210>  13
<211>  2007
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Marburg viral glycoprotein for expression in mammalian cells

<400>  13
aagcttccac catggatgca atgaagagag ggctctgctg tgtgctgctg ctgtgtggag       60

cagtcttcgt ttcgcccagc gctctgccta ttctggaaat cgcctcaaac aatcagcctc      120

agaacgtgga cagcgtgtgc tctggcaccc tgcagaagac agaggatgtg cacctgatgg      180

gcttcaccct gagcggccag aaggtggcag actccccact ggaggcctct aagaggtggg      240

cctttcgcac cggcgtgcca cctaagaatg tggagtacac cgagggcgag gaggccaaga      300

catgctataa catctccgtg accgatccca gcggcaagtc cctgctgctg gacccaccca      360

ccaatatccg ggattaccca aagtgtaaga caatccacca catccaggga cagaacccac      420

acgcacaggg aatcgccctg cacctgtggg gcgccttctt tctgtacgac cggatcgcca      480

gcaccacaat gtatagaggc aaggtgttca ccgagggcaa tatcgccgcc atgatcgtga      540

acaagacagt gcacaagatg atctttagcc ggcagggcca gggctacaga cacatgaatc      600

tgacatccac caacaagtat tggaccagct ccaatggcac acagaccaac gatacaggct      660

gcttcggcgc cctgcaggag tataattcta ccaagaacca gacatgtgca ccaagcaaga      720

tccctccacc actgccaacc gcccggcctg agatcaagct gacaagcacc cctacagacg      780

ccacaaagct gaacaccaca gacccatcta gcgacgatga ggatctggcc acctccggct      840

ctggctccgg agagagagag cctcacacca catccgatgc cgtgaccaag cagggcctgt      900

cctctaccat gcctccaaca cctagcccac agccatccac acctcagcag ggaggcaaca      960

ataccaatca ctcccaggac gccgtgacag agctggataa gaacaatacc acagcccagc     1020

catctatgcc ccctcacaac accacaacca tcagcaccaa caatacatcc aagcacaatt     1080

tttctaccct gagcgccccc ctgcagaaca caaccaacga caatacccag tctaccatca     1140

cagagaatga gcagacatcc gccccctcta tcacaaccct gccacccacc ggcaacccta     1200

caaccgccaa gagcacaagc tccaagaagg gccctgccac aaccgcccca aatacaacca     1260

acgagcactt caccagccct ccacccaccc catctagcac agcccagcac ctggtgtact     1320

ttggcggtgg aggctcaggt ggaggcggat cctctatcct gtggagggag ggcgacatgt     1380

tcccctttct ggatggcctg atcaatgccc ctatcgattt cgatcctgtg ccaaacacca     1440

agacaatctt tgacgagagc tcctctagcg gcgcatccgc cgaggaggat cagcacgcct     1500

ctccaaatat cagcctgacc ctgtcctact tccccaacat caatgagaac acagcctatt     1560

ctggcgagaa tgagaacgac tgcgatgccg agctgaggat ctggtccgtg caggaggacg     1620

atctggcagc aggcctgtct tggattccct tcttcggacc aggaatcgag ggcctgtata     1680

ccgccgtgct gatcaagaat cagaacaacc tggtgtgcag gctgcggaga ctggcaaacc     1740

agaccgccaa gtctctggag ctgctgctga gggtgacaac cgaggagcgc accttcagcc     1800

tgatcaatag gcacgccatc gactttctgc tgaccagatg gggaggaaca tgcaaggtgc     1860

tgggccctga ctgctgtatc ggcatcgagg atctgtctaa gaacatcagc gagcagatcg     1920

accagatcaa gaaggatgag cagaaggagg gaaccggctg gggcctgggc ggcaagtggt     1980

ggacatccga ttgatttttc tgaattc                                         2007


<210>  14
<211>  660
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Marburg viral glycoprotein expressed in mammalian cells

<400>  14

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1               5                   10                  15      


Ala Val Phe Val Ser Pro Ser Ala Leu Pro Ile Leu Glu Ile Ala Ser 
            20                  25                  30          


Asn Asn Gln Pro Gln Asn Val Asp Ser Val Cys Ser Gly Thr Leu Gln 
        35                  40                  45              


Lys Thr Glu Asp Val His Leu Met Gly Phe Thr Leu Ser Gly Gln Lys 
    50                  55                  60                  


Val Ala Asp Ser Pro Leu Glu Ala Ser Lys Arg Trp Ala Phe Arg Thr 
65                  70                  75                  80  


Gly Val Pro Pro Lys Asn Val Glu Tyr Thr Glu Gly Glu Glu Ala Lys 
                85                  90                  95      


Thr Cys Tyr Asn Ile Ser Val Thr Asp Pro Ser Gly Lys Ser Leu Leu 
            100                 105                 110         


Leu Asp Pro Pro Thr Asn Ile Arg Asp Tyr Pro Lys Cys Lys Thr Ile 
        115                 120                 125             


His His Ile Gln Gly Gln Asn Pro His Ala Gln Gly Ile Ala Leu His 
    130                 135                 140                 


Leu Trp Gly Ala Phe Phe Leu Tyr Asp Arg Ile Ala Ser Thr Thr Met 
145                 150                 155                 160 


Tyr Arg Gly Lys Val Phe Thr Glu Gly Asn Ile Ala Ala Met Ile Val 
                165                 170                 175     


Asn Lys Thr Val His Lys Met Ile Phe Ser Arg Gln Gly Gln Gly Tyr 
            180                 185                 190         


Arg His Met Asn Leu Thr Ser Thr Asn Lys Tyr Trp Thr Ser Ser Asn 
        195                 200                 205             


Gly Thr Gln Thr Asn Asp Thr Gly Cys Phe Gly Ala Leu Gln Glu Tyr 
    210                 215                 220                 


Asn Ser Thr Lys Asn Gln Thr Cys Ala Pro Ser Lys Ile Pro Pro Pro 
225                 230                 235                 240 


Leu Pro Thr Ala Arg Pro Glu Ile Lys Leu Thr Ser Thr Pro Thr Asp 
                245                 250                 255     


Ala Thr Lys Leu Asn Thr Thr Asp Pro Ser Ser Asp Asp Glu Asp Leu 
            260                 265                 270         


Ala Thr Ser Gly Ser Gly Ser Gly Glu Arg Glu Pro His Thr Thr Ser 
        275                 280                 285             


Asp Ala Val Thr Lys Gln Gly Leu Ser Ser Thr Met Pro Pro Thr Pro 
    290                 295                 300                 


Ser Pro Gln Pro Ser Thr Pro Gln Gln Gly Gly Asn Asn Thr Asn His 
305                 310                 315                 320 


Ser Gln Asp Ala Val Thr Glu Leu Asp Lys Asn Asn Thr Thr Ala Gln 
                325                 330                 335     


Pro Ser Met Pro Pro His Asn Thr Thr Thr Ile Ser Thr Asn Asn Thr 
            340                 345                 350         


Ser Lys His Asn Phe Ser Thr Leu Ser Ala Pro Leu Gln Asn Thr Thr 
        355                 360                 365             


Asn Asp Asn Thr Gln Ser Thr Ile Thr Glu Asn Glu Gln Thr Ser Ala 
    370                 375                 380                 


Pro Ser Ile Thr Thr Leu Pro Pro Thr Gly Asn Pro Thr Thr Ala Lys 
385                 390                 395                 400 


Ser Thr Ser Ser Lys Lys Gly Pro Ala Thr Thr Ala Pro Asn Thr Thr 
                405                 410                 415     


Asn Glu His Phe Thr Ser Pro Pro Pro Thr Pro Ser Ser Thr Ala Gln 
            420                 425                 430         


His Leu Val Tyr Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ser 
        435                 440                 445             


Ile Leu Trp Arg Glu Gly Asp Met Phe Pro Phe Leu Asp Gly Leu Ile 
    450                 455                 460                 


Asn Ala Pro Ile Asp Phe Asp Pro Val Pro Asn Thr Lys Thr Ile Phe 
465                 470                 475                 480 


Asp Glu Ser Ser Ser Ser Gly Ala Ser Ala Glu Glu Asp Gln His Ala 
                485                 490                 495     


Ser Pro Asn Ile Ser Leu Thr Leu Ser Tyr Phe Pro Asn Ile Asn Glu 
            500                 505                 510         


Asn Thr Ala Tyr Ser Gly Glu Asn Glu Asn Asp Cys Asp Ala Glu Leu 
        515                 520                 525             


Arg Ile Trp Ser Val Gln Glu Asp Asp Leu Ala Ala Gly Leu Ser Trp 
    530                 535                 540                 


Ile Pro Phe Phe Gly Pro Gly Ile Glu Gly Leu Tyr Thr Ala Val Leu 
545                 550                 555                 560 


Ile Lys Asn Gln Asn Asn Leu Val Cys Arg Leu Arg Arg Leu Ala Asn 
                565                 570                 575     


Gln Thr Ala Lys Ser Leu Glu Leu Leu Leu Arg Val Thr Thr Glu Glu 
            580                 585                 590         


Arg Thr Phe Ser Leu Ile Asn Arg His Ala Ile Asp Phe Leu Leu Thr 
        595                 600                 605             


Arg Trp Gly Gly Thr Cys Lys Val Leu Gly Pro Asp Cys Cys Ile Gly 
    610                 615                 620                 


Ile Glu Asp Leu Ser Lys Asn Ile Ser Glu Gln Ile Asp Gln Ile Lys 
625                 630                 635                 640 


Lys Asp Glu Gln Lys Glu Gly Thr Gly Trp Gly Leu Gly Gly Lys Trp 
                645                 650                 655     


Trp Thr Ser Asp 
            660 


<210>  15
<211>  2006
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Marburg viral glycoprotein for expression in plant cells

<400>  15
accggtaagc ttccaccatg gagtggagtt ggatctttct gttcctgctg tcagggaccg       60

ctggagtgca tagcgctctg cctattctgg aaatcgcctc aaacaatcag cctcagaacg      120

tggacagcgt gtgctctggc accctgcaga agacagagga tgtgcacctg atgggcttca      180

ccctgagcgg ccagaaggtg gcagactccc cactggaggc ctctaagagg tgggcctttc      240

gcaccggcgt gccacctaag aatgtggagt acaccgaggg cgaggaggcc aagacatgct      300

ataacatctc cgtgaccgat cccagcggca agtccctgct gctggaccca cccaccaata      360

tccgggatta cccaaagtgt aagacaatcc accacatcca gggacagaac ccacacgcac      420

agggaatcgc cctgcacctg tggggcgcct tctttctgta cgaccggatc gccagcacca      480

caatgtatag aggcaaggtg ttcaccgagg gcaatatcgc cgccatgatc gtgaacaaga      540

cagtgcacaa gatgatcttt agccggcagg gccagggcta cagacacatg aatctgacat      600

ccaccaacaa gtattggacc agctccaatg gcacacagac caacgataca ggctgcttcg      660

gcgccctgca ggagtataat tctaccaaga accagacatg tgcaccaagc aagatccctc      720

caccactgcc aaccgcccgg cctgagatca agctgacaag cacccctaca gacgccacaa      780

agctgaacac cacagaccca tctagcgacg atgaggatct ggccacctcc ggctctggct      840

ccggagagag agagcctcac accacatccg atgccgtgac caagcagggc ctgtcctcta      900

ccatgcctcc aacacctagc ccacagccat ccacacctca gcagggaggc aacaatacca      960

atcactccca ggacgccgtg acagagctgg ataagaacaa taccacagcc cagccatcta     1020

tgccccctca caacaccaca accatcagca ccaacaatac atccaagcac aatttttcta     1080

ccctgagcgc ccccctgcag aacacaacca acgacaatac ccagtctacc atcacagaga     1140

atgagcagac atccgccccc tctatcacaa ccctgccacc caccggcaac cctacaaccg     1200

ccaagagcac aagctccaag aagggccctg ccacaaccgc cccaaataca accaacgagc     1260

acttcaccag ccctccaccc accccatcta gcacagccca gcacctggtg tactttggcg     1320

gtggaggctc aggtggaggc ggatcctcta tcctgtggag ggagggcgac atgttcccct     1380

ttctggatgg cctgatcaat gcccctatcg atttcgatcc tgtgccaaac accaagacaa     1440

tctttgacga gagctcctct agcggcgcat ccgccgagga ggatcagcac gcctctccaa     1500

atatcagcct gaccctgtcc tacttcccca acatcaatga gaacacagcc tattctggcg     1560

agaatgagaa cgactgcgat gccgagctga ggatctggtc cgtgcaggag gacgatctgg     1620

cagcaggcct gtcttggatt cccttcttcg gaccaggaat cgagggcctg tataccgccg     1680

tgctgatcaa gaatcagaac aacctggtgt gcaggctgcg gagactggca aaccagaccg     1740

ccaagtctct ggagctgctg ctgagggtga caaccgagga gcgcaccttc agcctgatca     1800

ataggcacgc catcgacttt ctgctgacca gatggggagg aacatgcaag gtgctgggcc     1860

ctgactgctg tatcggcatc gaggatctgt ctaagaacat cagcgagcag atcgaccaga     1920

tcaagaagga tgagcagaag gagggaaccg gctggggcct gggcggcaag tggtggacat     1980

ccgattgatt tttctgaatt ctcgag                                          2006


<210>  16
<211>  656
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Marburg viral glycoprotein expressed in plant cells

<400>  16

Met Glu Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Thr Ala Gly 
1               5                   10                  15      


Val His Ser Ala Leu Pro Ile Leu Glu Ile Ala Ser Asn Asn Gln Pro 
            20                  25                  30          


Gln Asn Val Asp Ser Val Cys Ser Gly Thr Leu Gln Lys Thr Glu Asp 
        35                  40                  45              


Val His Leu Met Gly Phe Thr Leu Ser Gly Gln Lys Val Ala Asp Ser 
    50                  55                  60                  


Pro Leu Glu Ala Ser Lys Arg Trp Ala Phe Arg Thr Gly Val Pro Pro 
65                  70                  75                  80  


Lys Asn Val Glu Tyr Thr Glu Gly Glu Glu Ala Lys Thr Cys Tyr Asn 
                85                  90                  95      


Ile Ser Val Thr Asp Pro Ser Gly Lys Ser Leu Leu Leu Asp Pro Pro 
            100                 105                 110         


Thr Asn Ile Arg Asp Tyr Pro Lys Cys Lys Thr Ile His His Ile Gln 
        115                 120                 125             


Gly Gln Asn Pro His Ala Gln Gly Ile Ala Leu His Leu Trp Gly Ala 
    130                 135                 140                 


Phe Phe Leu Tyr Asp Arg Ile Ala Ser Thr Thr Met Tyr Arg Gly Lys 
145                 150                 155                 160 


Val Phe Thr Glu Gly Asn Ile Ala Ala Met Ile Val Asn Lys Thr Val 
                165                 170                 175     


His Lys Met Ile Phe Ser Arg Gln Gly Gln Gly Tyr Arg His Met Asn 
            180                 185                 190         


Leu Thr Ser Thr Asn Lys Tyr Trp Thr Ser Ser Asn Gly Thr Gln Thr 
        195                 200                 205             


Asn Asp Thr Gly Cys Phe Gly Ala Leu Gln Glu Tyr Asn Ser Thr Lys 
    210                 215                 220                 


Asn Gln Thr Cys Ala Pro Ser Lys Ile Pro Pro Pro Leu Pro Thr Ala 
225                 230                 235                 240 


Arg Pro Glu Ile Lys Leu Thr Ser Thr Pro Thr Asp Ala Thr Lys Leu 
                245                 250                 255     


Asn Thr Thr Asp Pro Ser Ser Asp Asp Glu Asp Leu Ala Thr Ser Gly 
            260                 265                 270         


Ser Gly Ser Gly Glu Arg Glu Pro His Thr Thr Ser Asp Ala Val Thr 
        275                 280                 285             


Lys Gln Gly Leu Ser Ser Thr Met Pro Pro Thr Pro Ser Pro Gln Pro 
    290                 295                 300                 


Ser Thr Pro Gln Gln Gly Gly Asn Asn Thr Asn His Ser Gln Asp Ala 
305                 310                 315                 320 


Val Thr Glu Leu Asp Lys Asn Asn Thr Thr Ala Gln Pro Ser Met Pro 
                325                 330                 335     


Pro His Asn Thr Thr Thr Ile Ser Thr Asn Asn Thr Ser Lys His Asn 
            340                 345                 350         


Phe Ser Thr Leu Ser Ala Pro Leu Gln Asn Thr Thr Asn Asp Asn Thr 
        355                 360                 365             


Gln Ser Thr Ile Thr Glu Asn Glu Gln Thr Ser Ala Pro Ser Ile Thr 
    370                 375                 380                 


Thr Leu Pro Pro Thr Gly Asn Pro Thr Thr Ala Lys Ser Thr Ser Ser 
385                 390                 395                 400 


Lys Lys Gly Pro Ala Thr Thr Ala Pro Asn Thr Thr Asn Glu His Phe 
                405                 410                 415     


Thr Ser Pro Pro Pro Thr Pro Ser Ser Thr Ala Gln His Leu Val Tyr 
            420                 425                 430         


Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ser Ile Leu Trp Arg 
        435                 440                 445             


Glu Gly Asp Met Phe Pro Phe Leu Asp Gly Leu Ile Asn Ala Pro Ile 
    450                 455                 460                 


Asp Phe Asp Pro Val Pro Asn Thr Lys Thr Ile Phe Asp Glu Ser Ser 
465                 470                 475                 480 


Ser Ser Gly Ala Ser Ala Glu Glu Asp Gln His Ala Ser Pro Asn Ile 
                485                 490                 495     


Ser Leu Thr Leu Ser Tyr Phe Pro Asn Ile Asn Glu Asn Thr Ala Tyr 
            500                 505                 510         


Ser Gly Glu Asn Glu Asn Asp Cys Asp Ala Glu Leu Arg Ile Trp Ser 
        515                 520                 525             


Val Gln Glu Asp Asp Leu Ala Ala Gly Leu Ser Trp Ile Pro Phe Phe 
    530                 535                 540                 


Gly Pro Gly Ile Glu Gly Leu Tyr Thr Ala Val Leu Ile Lys Asn Gln 
545                 550                 555                 560 


Asn Asn Leu Val Cys Arg Leu Arg Arg Leu Ala Asn Gln Thr Ala Lys 
                565                 570                 575     


Ser Leu Glu Leu Leu Leu Arg Val Thr Thr Glu Glu Arg Thr Phe Ser 
            580                 585                 590         


Leu Ile Asn Arg His Ala Ile Asp Phe Leu Leu Thr Arg Trp Gly Gly 
        595                 600                 605             


Thr Cys Lys Val Leu Gly Pro Asp Cys Cys Ile Gly Ile Glu Asp Leu 
    610                 615                 620                 


Ser Lys Asn Ile Ser Glu Gln Ile Asp Gln Ile Lys Lys Asp Glu Gln 
625                 630                 635                 640 


Lys Glu Gly Thr Gly Trp Gly Leu Gly Gly Lys Trp Trp Thr Ser Asp 
                645                 650                 655     


<210>  17
<211>  68
<212>  DNA
<213>  Artificial sequence

<220>
<223>  TPA leader sequence for expression with modified HIV Env gp140

<400>  17
atggatgcta tgaagcgggg actctgttgc gtgctgctgc tgtgcggagc cgtgttcgtg       60

tcaccctc                                                                68


<210>  18
<211>  66
<212>  DNA
<213>  Artificial sequence

<220>
<223>  TPA leader sequence for expression with modified Marburg viral 
       glycoprotein

<400>  18
atggatgcaa tgaagagagg gctctgctgt gtgctgctgc tgtgtggagc agtcttcgtt       60

tcgccc                                                                  66


<210>  19
<211>  66
<212>  DNA
<213>  Artificial sequence

<220>
<223>  TPA leader sequence for expression with cleaved SOSIP.664

<400>  19
atggatgcta tgaaaagggg gctgtgctgc gtcctgctgc tgtgcggggc tgtcttcgtg       60

tcacca                                                                  66


<210>  20
<211>  22
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence for tissue plasminogen activator leader 
       sequence

<400>  20

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1               5                   10                  15      


Ala Val Phe Val Ser Pro 
            20          


<210>  21
<211>  57
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Murine monoclonal leader peptide heavy chain for the modified HIV
       env gp140

<400>  21
atggagtggt cttggatctt cctgtttctg ctgagcggga ctgctggagt gcattct          57


<210>  22
<211>  57
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Murine monoclonal leader peptide heavy chain for the Marburg 
       viral glycoprotein

<400>  22
atggagtgga gttggatctt tctgttcctg ctgtcaggga ccgctggagt gcatagc          57


<210>  23
<211>  66
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Murine monoclonal leader peptide heavy chain for the modified 
       Epstein-Barr virus gp350

<400>  23
atggatgcta tgaaaagggg gctgtgctgc gtcctgctgc tgtgcggggc tgtcttcgtg       60

tcacca                                                                  66


<210>  24
<211>  19
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence for the murine monoclonal leader peptide 
       heavy chain

<400>  24

Met Glu Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Thr Ala Gly 
1               5                   10                  15      


Val His Ser 
            


<210>  25
<211>  4
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of the furin cleavage site for the modified 
       HIV env gp140 polypeptide

<400>  25

Arg Glu Arg Arg 
1               


<210>  26
<211>  4
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of the furin cleavage site for the modified 
       Marburg viral glycoprotein

<400>  26

Arg Arg Lys Arg 
1               


<210>  27
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Flexible linker for modified HIV env gp140 for expression in 
       plant cells

<400>  27
ggcggtggag gctcaggtgg aggcggatcc                                        30


<210>  28
<211>  32
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Flexible linker for modified HIV env gp140 for expression in 
       mammalian cells

<400>  28
ggcggtggag gctcaggtgg aggcggatcc gc                                     32


<210>  29
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Flexible linker for modified Marburg viral glycoprotein for 
       expression in plant cells

<400>  29
ggcggtggag gctcaggtgg aggcggatcc                                        30


<210>  30
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Flexible linker for modified Marburg viral glycoprotein for 
       expression in mammalian cells

<400>  30
ggcggtggag gctcaggtgg aggcggatcc                                        30


<210>  31
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of the flexible linker

<400>  31

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10  


<210>  32
<211>  2689
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Modified Epstein-Barr virus gp350

<400>  32
aagcttccac catggagtgg tcatggattt ttctgtttct gctgtctggg actgccggag       60

tgcatagcgc tgaggctgct ctgctggtct gtcagtatac catccagagc ctgatccacc      120

tgacaggcga ggaccccggc ttctttaacg tggagatccc agagttcccc ttttacccta      180

catgcaacgt gtgcaccgcc gacgtgaacg tgaccatcaa tttcgatgtg ggcggcaaga      240

agcaccagct ggacctggat tttggccagc tgacccctca cacaaaggcc gtgtatcagc      300

caagaggcgc cttcggcgga agcgagaacg caaccaatct gtttctgctg gagctgctgg      360

gcgcaggaga gctggccctg accatgaggt ccaagaagct gccaatcaac gtgaccacag      420

gcgaggagca gcaggtgtcc ctggagtctg tggacgtgta cttccaggac gtgttcggca      480

ccatgtggtg ccaccacgcc gagatgcaga atcccgtgta cctgatccct gagacagtgc      540

catatatcaa gtgggacaac tgtaatagca ccaacatcac agcagtggtg agagcacagg      600

gcctggacgt gaccctgcca ctgagcctgc ctacatccgc ccaggatagc aacttctccg      660

tgaagaccga gatgctgggc aatgagatcg acatcgagtg catcatggag gatggcgaga      720

tctcccaggt gctgcctggc gataacaagt ttaatatcac ctgttctgga tacgagagcc      780

acgtgccttc cggaggcatc ctgacctcta caagcccagt ggcaacacca atccctggaa      840

ccggctacgc ctatagcctg cggctgaccc caagacccgt gtctaggttc ctgggcaaca      900

atagcatcct gtacgtgttt tattccggaa acggaccaaa ggcctctgga ggcgactatt      960

gcatccagag caatatcgtg ttctccgacg agatccccgc ctctcaggat atgcctacca     1020

acaccacaga catcacatac gtgggcgata atgccaccta tagcgtgccc atggtgacat     1080

ctgaggacgc caacagccct aatgtgaccg tgacagcctt ctgggcctgg ccaaacaata     1140

ccgagacaga cttcaagtgc aagtggaccc tgacatctgg cacccccagc ggctgtgaga     1200

acatctccgg cgccttcgcc tctaatagga catttgatat caccgtgtcc ggcctgggca     1260

cagcccctaa gaccctgatc atcacccgca cagccaccaa cgccaccaca accacacaca     1320

aagtgatctt cagcaaggcc cctaggtcca ccacaacctc tccaaccctg aacacaaccg     1380

gctttgccga cccaaataca accacaggcc tgccaagctc cacccacgtg ccaaccaacc     1440

tgacagcccc tgcctctacc ggcccaacag tgagcaccgc cgatgtgaca tcccctaccc     1500

cagcaggaac cacatctgga gcaagccccg tgactccatc cccttctcca tgggacaatg     1560

gcacagagtc taaggccccc gatatgacat ctagcaccag ccctgtgacc acacccaccc     1620

ctaacgccac atctccaacc cccgccgtga ccacacctac cccaaatgcc acaagcccaa     1680

cccctgcagt gaccacacca accccaaacg ccacatcccc caccctgggc aagacatccc     1740

ctacctctgc cgtcactacc ccaaccccaa atgccacatc cccaaccctg ggcaagacaa     1800

gccccacctc cgccgtgacc acaccaactc caaacgccac atctcctacc ctgggcaaga     1860

catctccaac cagcgccgtc actacaccaa ccccaaatgc aacaggccca accgtgggag     1920

agacaagccc tcaggccaac gccacaaatc acaccctggg cggcacaagc ccaaccccag     1980

tggtgacctc ccagcccaag aacgccacat ctgccgtgac cacaggccag cacaatatca     2040

catcctctag cacctcctct atgagcctgc gcccaagctc caaccccgag acactgagcc     2100

cttccacctc tgacaatagc acatcccaca tgccactgct gaccagcgcc cacccaacag     2160

gcggagagaa catcacacag gtgaccccag cctctatcag cacccaccac gtgtccacat     2220

ctagcccagc acctcggcca ggaaccacat cccaggcctc tggaccaggc aattcctcta     2280

caagcaccaa gcccggcgaa gtgaacgtga caaagggaac cccacctcag aatgccacca     2340

gcccacaggc accatccgga cagaagacag cagtgccaac agtgaccagc acaggcggca     2400

aggccaactc caccacaggc ggcaagcaca ccacaggcca cggagcacgg acctccacag     2460

agcctaccac agactacggc ggcgattcta ccacaccccg gcctagatac aatgccacca     2520

catatctgcc accaagcacc agctccaagc tgaggccccg ctggaccttc acatcccctc     2580

cagtgaccac agcccaggca accgtcccag tcccccccac ctcacagcca agattttcca     2640

acctgcatat gcatcatcac catcaccatt gatttttctg aattctcga                 2689


<210>  33
<211>  886
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Modified Epstein-Barr virus gp350

<400>  33

Met Glu Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Thr Ala Gly 
1               5                   10                  15      


Val His Ser Ala Glu Ala Ala Leu Leu Val Cys Gln Tyr Thr Ile Gln 
            20                  25                  30          


Ser Leu Ile His Leu Thr Gly Glu Asp Pro Gly Phe Phe Asn Val Glu 
        35                  40                  45              


Ile Pro Glu Phe Pro Phe Tyr Pro Thr Cys Asn Val Cys Thr Ala Asp 
    50                  55                  60                  


Val Asn Val Thr Ile Asn Phe Asp Val Gly Gly Lys Lys His Gln Leu 
65                  70                  75                  80  


Asp Leu Asp Phe Gly Gln Leu Thr Pro His Thr Lys Ala Val Tyr Gln 
                85                  90                  95      


Pro Arg Gly Ala Phe Gly Gly Ser Glu Asn Ala Thr Asn Leu Phe Leu 
            100                 105                 110         


Leu Glu Leu Leu Gly Ala Gly Glu Leu Ala Leu Thr Met Arg Ser Lys 
        115                 120                 125             


Lys Leu Pro Ile Asn Val Thr Thr Gly Glu Glu Gln Gln Val Ser Leu 
    130                 135                 140                 


Glu Ser Val Asp Val Tyr Phe Gln Asp Val Phe Gly Thr Met Trp Cys 
145                 150                 155                 160 


His His Ala Glu Met Gln Asn Pro Val Tyr Leu Ile Pro Glu Thr Val 
                165                 170                 175     


Pro Tyr Ile Lys Trp Asp Asn Cys Asn Ser Thr Asn Ile Thr Ala Val 
            180                 185                 190         


Val Arg Ala Gln Gly Leu Asp Val Thr Leu Pro Leu Ser Leu Pro Thr 
        195                 200                 205             


Ser Ala Gln Asp Ser Asn Phe Ser Val Lys Thr Glu Met Leu Gly Asn 
    210                 215                 220                 


Glu Ile Asp Ile Glu Cys Ile Met Glu Asp Gly Glu Ile Ser Gln Val 
225                 230                 235                 240 


Leu Pro Gly Asp Asn Lys Phe Asn Ile Thr Cys Ser Gly Tyr Glu Ser 
                245                 250                 255     


His Val Pro Ser Gly Gly Ile Leu Thr Ser Thr Ser Pro Val Ala Thr 
            260                 265                 270         


Pro Ile Pro Gly Thr Gly Tyr Ala Tyr Ser Leu Arg Leu Thr Pro Arg 
        275                 280                 285             


Pro Val Ser Arg Phe Leu Gly Asn Asn Ser Ile Leu Tyr Val Phe Tyr 
    290                 295                 300                 


Ser Gly Asn Gly Pro Lys Ala Ser Gly Gly Asp Tyr Cys Ile Gln Ser 
305                 310                 315                 320 


Asn Ile Val Phe Ser Asp Glu Ile Pro Ala Ser Gln Asp Met Pro Thr 
                325                 330                 335     


Asn Thr Thr Asp Ile Thr Tyr Val Gly Asp Asn Ala Thr Tyr Ser Val 
            340                 345                 350         


Pro Met Val Thr Ser Glu Asp Ala Asn Ser Pro Asn Val Thr Val Thr 
        355                 360                 365             


Ala Phe Trp Ala Trp Pro Asn Asn Thr Glu Thr Asp Phe Lys Cys Lys 
    370                 375                 380                 


Trp Thr Leu Thr Ser Gly Thr Pro Ser Gly Cys Glu Asn Ile Ser Gly 
385                 390                 395                 400 


Ala Phe Ala Ser Asn Arg Thr Phe Asp Ile Thr Val Ser Gly Leu Gly 
                405                 410                 415     


Thr Ala Pro Lys Thr Leu Ile Ile Thr Arg Thr Ala Thr Asn Ala Thr 
            420                 425                 430         


Thr Thr Thr His Lys Val Ile Phe Ser Lys Ala Pro Arg Ser Thr Thr 
        435                 440                 445             


Thr Ser Pro Thr Leu Asn Thr Thr Gly Phe Ala Asp Pro Asn Thr Thr 
    450                 455                 460                 


Thr Gly Leu Pro Ser Ser Thr His Val Pro Thr Asn Leu Thr Ala Pro 
465                 470                 475                 480 


Ala Ser Thr Gly Pro Thr Val Ser Thr Ala Asp Val Thr Ser Pro Thr 
                485                 490                 495     


Pro Ala Gly Thr Thr Ser Gly Ala Ser Pro Val Thr Pro Ser Pro Ser 
            500                 505                 510         


Pro Trp Asp Asn Gly Thr Glu Ser Lys Ala Pro Asp Met Thr Ser Ser 
        515                 520                 525             


Thr Ser Pro Val Thr Thr Pro Thr Pro Asn Ala Thr Ser Pro Thr Pro 
    530                 535                 540                 


Ala Val Thr Thr Pro Thr Pro Asn Ala Thr Ser Pro Thr Pro Ala Val 
545                 550                 555                 560 


Thr Thr Pro Thr Pro Asn Ala Thr Ser Pro Thr Leu Gly Lys Thr Ser 
                565                 570                 575     


Pro Thr Ser Ala Val Thr Thr Pro Thr Pro Asn Ala Thr Ser Pro Thr 
            580                 585                 590         


Leu Gly Lys Thr Ser Pro Thr Ser Ala Val Thr Thr Pro Thr Pro Asn 
        595                 600                 605             


Ala Thr Ser Pro Thr Leu Gly Lys Thr Ser Pro Thr Ser Ala Val Thr 
    610                 615                 620                 


Thr Pro Thr Pro Asn Ala Thr Gly Pro Thr Val Gly Glu Thr Ser Pro 
625                 630                 635                 640 


Gln Ala Asn Ala Thr Asn His Thr Leu Gly Gly Thr Ser Pro Thr Pro 
                645                 650                 655     


Val Val Thr Ser Gln Pro Lys Asn Ala Thr Ser Ala Val Thr Thr Gly 
            660                 665                 670         


Gln His Asn Ile Thr Ser Ser Ser Thr Ser Ser Met Ser Leu Arg Pro 
        675                 680                 685             


Ser Ser Asn Pro Glu Thr Leu Ser Pro Ser Thr Ser Asp Asn Ser Thr 
    690                 695                 700                 


Ser His Met Pro Leu Leu Thr Ser Ala His Pro Thr Gly Gly Glu Asn 
705                 710                 715                 720 


Ile Thr Gln Val Thr Pro Ala Ser Ile Ser Thr His His Val Ser Thr 
                725                 730                 735     


Ser Ser Pro Ala Pro Arg Pro Gly Thr Thr Ser Gln Ala Ser Gly Pro 
            740                 745                 750         


Gly Asn Ser Ser Thr Ser Thr Lys Pro Gly Glu Val Asn Val Thr Lys 
        755                 760                 765             


Gly Thr Pro Pro Gln Asn Ala Thr Ser Pro Gln Ala Pro Ser Gly Gln 
    770                 775                 780                 


Lys Thr Ala Val Pro Thr Val Thr Ser Thr Gly Gly Lys Ala Asn Ser 
785                 790                 795                 800 


Thr Thr Gly Gly Lys His Thr Thr Gly His Gly Ala Arg Thr Ser Thr 
                805                 810                 815     


Glu Pro Thr Thr Asp Tyr Gly Gly Asp Ser Thr Thr Pro Arg Pro Arg 
            820                 825                 830         


Tyr Asn Ala Thr Thr Tyr Leu Pro Pro Ser Thr Ser Ser Lys Leu Arg 
        835                 840                 845             


Pro Arg Trp Thr Phe Thr Ser Pro Pro Val Thr Thr Ala Gln Ala Thr 
    850                 855                 860                 


Val Pro Val Pro Pro Thr Ser Gln Pro Arg Phe Ser Asn Leu His Met 
865                 870                 875                 880 


His His His His His His 
                885     


<210>  34
<211>  1964
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Modified SOSIP.664

<400>  34
accggtatgg atgctatgaa aagggggctg tgctgcgtcc tgctgctgtg cggggctgtc       60

ttcgtgtcac caagcgggct gtgggtcact gtctactatg gcgtgcctgt ctggagagag      120

gccaagacca cactgttctg cgcttccgat gcaaagtctt acgaaaaaga ggtgcacaac      180

gtctgggcca cacatgcttg cgtgccaact gaccccaacc ctcaggaact ggtgctgaag      240

aatgtcaccg agaactttaa tatgtggaaa aatgacatgg tggatcagat gcacgaggat      300

atcattagtc tgtgggacca gtcactgaag ccctgcgtga aactgacacc tctgtgcgtc      360

actctgaact gtagcgatgc aaaggtgaac attaatgcca catacaatgg cactcgcgag      420

gaaatcaaaa actgttcctt caatgcaact accgaactga gggacaagaa gaagaaggag      480

tacgccctgt tttatcgcct ggacatcgtg cccctgaaca aggaagggaa caataacagt      540

gagtatcggc tgattaactg caataccagc gtgattaccc aggcctgtcc taaagtcacc      600

ttcgatccaa ttcccatcca ctactgcgca ccagccggat atgctattct gaagtgtaac      660

aacaaaactt ttaacgggac cggaccctgc aataacgtgt ctacagtcca gtgtactcat      720

ggcatcaagc ctgtggtctc aacccagctg ctgctgaatg ggagcctggc cgaggaagag      780

atcattatca gaagcgagaa cctgaccgac aatgtgaaga caattatcgt ccacctgaac      840

gaatccgtgg agattaattg caccaggcca aacaacaaca cacgaaaatc tattcggatc      900

ggaccaggac agaccttcta cgcaacaggg gacattatcg gagatatcag gcaggctcat      960

tgtaacattt ctgaaatcaa gtgggagaaa accctgcagc gcgtgagtga aaagctgcga     1020

gagcacttca acaaaacaat catctttaat cagagctccg gcggggacct ggaaatcaca     1080

actcattcat tcaactgcgg aggcgagttc ttttactgta acactagcga tctgttcttt     1140

aataagacct ttgacgagac ctattccaca ggctcaaaca gcactaattc taccattaca     1200

ctgccatgcc gaatcaaaca gattatcaac atgtggcagg aagtgggccg ggcaatgtat     1260

gccagcccca ttgccggaga gatcacctgt aagtccaata tcactggact gctgctgacc     1320

agagatgggg gaggcaacaa ttctactgaa gagaccttta ggcccggggg aggcaacatg     1380

agagacaatt ggaggagcga actgtacaag tataaagtgg tcgaggtgaa gcctctggga     1440

atcgcaccaa ccgagtgccg gagaagggtg gtccagcgcc gacggagaag gcgcgctgtg     1500

gtcggactgg gagcagtgtt cctggggttt ctgggaactg ctggcagcac catgggagcc     1560

gcttccatta ctctgaccgt gcaggcacgc cagctgctgt ctggcatcgt ccagcagcag     1620

agtaacctgc tgcgggctcc tgaagcacag cagcatatgc tgcagctgac cgtgtggggg     1680

attaagcagc tgcaggcccg ggtcctggct atcgagagat acctgaagga tcagcagctg     1740

ctggggatgt ggggatgcag tggcaaactg atttgctgta caaacgtgta ctggaacagc     1800

agctggtcca acaagacata taatgaaatc tgggacaaca tgacttggat gcagtgggac     1860

cgcgagatcg ataactacac agacactatc tataaactgc tggaagtctc acagaaacag     1920

caggagtcaa atgaaaagga cctgctggca ctggattaac tcga                      1964


<210>  35
<211>  794
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Modified SOSIP.664

<400>  35

Met Glu Leu Arg Pro Trp Leu Leu Trp Val Val Ala Ala Thr Gly Thr 
1               5                   10                  15      


Leu Val Leu Leu Ala Ala Asp Ala Gln Gly Gln Lys Val Phe Thr Asn 
            20                  25                  30          


Thr Trp Ala Val Arg Ile Pro Gly Gly Pro Ala Val Ala Asn Ser Val 
        35                  40                  45              


Ala Arg Lys His Gly Phe Leu Asn Leu Gly Gln Ile Phe Gly Asp Tyr 
    50                  55                  60                  


Tyr His Phe Trp His Arg Gly Val Thr Lys Arg Ser Leu Ser Pro His 
65                  70                  75                  80  


Arg Pro Arg His Ser Arg Leu Gln Arg Glu Pro Gln Val Gln Trp Leu 
                85                  90                  95      


Glu Gln Gln Val Ala Lys Arg Arg Thr Lys Arg Asp Val Tyr Gln Glu 
            100                 105                 110         


Pro Thr Asp Pro Lys Phe Pro Gln Gln Trp Tyr Leu Ser Gly Val Thr 
        115                 120                 125             


Gln Arg Asp Leu Asn Val Lys Ala Ala Trp Ala Gln Gly Tyr Thr Gly 
    130                 135                 140                 


His Gly Ile Val Val Ser Ile Leu Asp Asp Gly Ile Glu Lys Asn His 
145                 150                 155                 160 


Pro Asp Leu Ala Gly Asn Tyr Asp Pro Gly Ala Ser Phe Asp Val Asn 
                165                 170                 175     


Asp Gln Asp Pro Asp Pro Gln Pro Arg Tyr Thr Gln Met Asn Asp Asn 
            180                 185                 190         


Arg His Gly Thr Arg Cys Ala Gly Glu Val Ala Ala Val Ala Asn Asn 
        195                 200                 205             


Gly Val Cys Gly Val Gly Val Ala Tyr Asn Ala Arg Ile Gly Gly Val 
    210                 215                 220                 


Arg Met Leu Asp Gly Glu Val Thr Asp Ala Val Glu Ala Arg Ser Leu 
225                 230                 235                 240 


Gly Leu Asn Pro Asn His Ile His Ile Tyr Ser Ala Ser Trp Gly Pro 
                245                 250                 255     


Glu Asp Asp Gly Lys Thr Val Asp Gly Pro Ala Arg Leu Ala Glu Glu 
            260                 265                 270         


Ala Phe Phe Arg Gly Val Ser Gln Gly Arg Gly Gly Leu Gly Ser Ile 
        275                 280                 285             


Phe Val Trp Ala Ser Gly Asn Gly Gly Arg Glu His Asp Ser Cys Asn 
    290                 295                 300                 


Cys Asp Gly Tyr Thr Asn Ser Ile Tyr Thr Leu Ser Ile Ser Ser Ala 
305                 310                 315                 320 


Thr Gln Phe Gly Asn Val Pro Trp Tyr Ser Glu Ala Cys Ser Ser Thr 
                325                 330                 335     


Leu Ala Thr Thr Tyr Ser Ser Gly Asn Gln Asn Glu Lys Gln Ile Val 
            340                 345                 350         


Thr Thr Asp Leu Arg Gln Lys Cys Thr Glu Ser His Thr Gly Thr Ser 
        355                 360                 365             


Ala Ser Ala Pro Leu Ala Ala Gly Ile Ile Ala Leu Thr Leu Glu Ala 
    370                 375                 380                 


Asn Lys Asn Leu Thr Trp Arg Asp Met Gln His Leu Val Val Gln Thr 
385                 390                 395                 400 


Ser Lys Pro Ala His Leu Asn Ala Asn Asp Trp Ala Thr Asn Gly Val 
                405                 410                 415     


Gly Arg Lys Val Ser His Ser Tyr Gly Tyr Gly Leu Leu Asp Ala Gly 
            420                 425                 430         


Ala Met Val Ala Leu Ala Gln Asn Trp Thr Thr Val Ala Pro Gln Arg 
        435                 440                 445             


Lys Cys Ile Ile Asp Ile Leu Thr Glu Pro Lys Asp Ile Gly Lys Arg 
    450                 455                 460                 


Leu Glu Val Arg Lys Thr Val Thr Ala Cys Leu Gly Glu Pro Asn His 
465                 470                 475                 480 


Ile Thr Arg Leu Glu His Ala Gln Ala Arg Leu Thr Leu Ser Tyr Asn 
                485                 490                 495     


Arg Arg Gly Asp Leu Ala Ile His Leu Val Ser Pro Met Gly Thr Arg 
            500                 505                 510         


Ser Thr Leu Leu Ala Ala Arg Pro His Asp Tyr Ser Ala Asp Gly Phe 
        515                 520                 525             


Asn Asp Trp Ala Phe Met Thr Thr His Ser Trp Asp Glu Asp Pro Ser 
    530                 535                 540                 


Gly Glu Trp Val Leu Glu Ile Glu Asn Thr Ser Glu Ala Asn Asn Tyr 
545                 550                 555                 560 


Gly Thr Leu Thr Lys Phe Thr Leu Val Leu Tyr Gly Thr Ala Pro Glu 
                565                 570                 575     


Gly Leu Pro Val Pro Pro Glu Ser Ser Gly Cys Lys Thr Leu Thr Ser 
            580                 585                 590         


Ser Gln Ala Cys Val Val Cys Glu Glu Gly Phe Ser Leu His Gln Lys 
        595                 600                 605             


Ser Cys Val Gln His Cys Pro Pro Gly Phe Ala Pro Gln Val Leu Asp 
    610                 615                 620                 


Thr His Tyr Ser Thr Glu Asn Asp Val Glu Thr Ile Arg Ala Ser Val 
625                 630                 635                 640 


Cys Ala Pro Cys His Ala Ser Cys Ala Thr Cys Gln Gly Pro Ala Leu 
                645                 650                 655     


Thr Asp Cys Leu Ser Cys Pro Ser His Ala Ser Leu Asp Pro Val Glu 
            660                 665                 670         


Gln Thr Cys Ser Arg Gln Ser Gln Ser Ser Arg Glu Ser Pro Pro Gln 
        675                 680                 685             


Gln Gln Pro Pro Arg Leu Pro Pro Glu Val Glu Ala Gly Gln Arg Leu 
    690                 695                 700                 


Arg Ala Gly Leu Leu Pro Ser His Leu Pro Glu Val Val Ala Gly Leu 
705                 710                 715                 720 


Ser Cys Ala Phe Ile Val Leu Val Phe Val Thr Val Phe Leu Val Leu 
                725                 730                 735     


Gln Leu Arg Ser Gly Phe Ser Phe Arg Gly Val Lys Val Tyr Thr Met 
            740                 745                 750         


Asp Arg Gly Leu Ile Ser Tyr Lys Gly Leu Pro Pro Glu Ala Trp Gln 
        755                 760                 765             


Glu Glu Cys Pro Ser Asp Ser Glu Glu Asp Glu Gly Arg Gly Glu Arg 
    770                 775                 780                 


Thr Ala Phe Ile Lys Asp Gln Ser Ala Leu 
785                 790                 


<210>  36
<211>  3830
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2

<400>  36
ccaccatgga tgctatgaaa agaggactgt gctgcgtgct gctgctgtgc ggagccgtgt       60

ttgtgagccc cgtgaacctg actaccagaa ctcagctgcc ccctgcctat accaatagct      120

tcacaagggg cgtgtactat cctgacaagg tgtttcgcag ctccgtgctg cactccacac      180

aggatctgtt tctgccattc ttttctaacg tgacctggtt ccacgccatc cacgtgtccg      240

gcaccaatgg cacaaagagg ttcgacaatc cagtgctgcc ctttaacgat ggcgtgtact      300

tcgcctccac cgagaagtct aacatcatcc gcggctggat ctttggcacc acactggaca      360

gcaagacaca gtccctgctg atcgtgaaca atgccaccaa cgtggtcatc aaggtgtgcg      420

agttccagtt ttgtaatgat cccttcctgg gcgtgtacta tcacaagaac aataagtctt      480

ggatggagag cgagtttagg gtgtattcta gcgccaacaa ttgcacattt gagtacgtga      540

gccagccttt cctgatggac ctggagggca agcagggcaa tttcaagaac ctgcgggagt      600

tcgtgtttaa gaatatcgat ggctacttca agatctactc taagcacacc cccatcaacc      660

tggtgcggga cctgcctcag ggcttcagcg ccctggagcc tctggtggat ctgccaatcg      720

gcatcaacat caccaggttt cagacactgc tggccctgca ccgctcctac ctgacaccag      780

gcgactcctc tagcggatgg accgcaggag cagcagccta ctatgtgggc tatctgcagc      840

ctcggacctt cctgctgaag tacaacgaga atggcaccat cacagacgca gtggattgcg      900

cactggaccc cctgtctgag acaaagtgta cactgaagag ctttaccgtg gagaagggca      960

tctatcagac aagcaatttc cgggtgcagc caaccgagtc catcgtgaga tttccaaata     1020

tcacaaacct gtgccccttt ggcgaggtgt tcaacgccac cagattcgcc agcgtgtacg     1080

cctggaatag gaagcgcatc tctaactgcg tggccgacta tagcgtgctg tacaactccg     1140

cctctttcag cacctttaag tgctatggcg tgtcccccac aaagctgaat gacctgtgct     1200

ttaccaacgt gtacgccgat tctttcgtga tcaggggcga cgaggtgcgc cagatcgcac     1260

ctggacagac aggcaagatc gccgactaca attataagct gccagacgat ttcaccggct     1320

gcgtgatcgc ctggaactct aacaatctgg atagcaaagt gggcggcaac tacaattatc     1380

tgtaccggct gtttagaaag tccaatctga agcccttcga gcgggacatc tccacagaga     1440

tctaccaggc cggctctacc ccttgcaatg gcgtggaggg ctttaactgt tatttcccac     1500

tgcagtctta cggcttccag cccaccaacg gcgtgggcta tcagccttac agagtggtgg     1560

tgctgagctt tgagctgctg cacgcaccag caacagtgtg cggacctaag aagtccacca     1620

atctggtgaa gaacaagtgc gtgaacttca acttcaacgg cctgaccggc acaggcgtgc     1680

tgaccgagag caacaagaag ttcctgccat ttcagcagtt cggccgggac atcgcagata     1740

ccacagacgc cgtgcgggac ccccagaccc tggagatcct ggatatcaca ccctgctcct     1800

tcggcggcgt gtctgtgatc acacccggca ccaatacatc caaccaggtg gccgtgctgt     1860

atcaggacgt gaattgtacc gaggtgcctg tggccatcca cgccgatcag ctgaccccaa     1920

catggagggt gtacagcacc ggctccaacg tgttccagac acgcgccgga tgcctgatcg     1980

gagcagagca cgtgaacaat tcttatgagt gcgacatccc aatcggcgcc ggcatctgtg     2040

cctcctacca gacccagaca aactctccac ggagaaggcg ccggagaagc gtggcatccc     2100

agtctatcat cgcctatacc atgtccctgg gcgccgagaa cagcgtggcc tactctaaca     2160

atagcatcgc catcccaacc aacttcacaa tcagcgtgac cacagagatc ctgcccgtga     2220

gcatgaccaa gacatccgtg gactgcacaa tgtatatctg tggcgattcc accgagtgct     2280

ctaacctgct gctgcagtac ggctcctttt gtacccagct gaatagggcc ctgacaggca     2340

tcgcagtgga gcaggataag aacacacagg aggtgttcgc ccaggtgaag cagatctaca     2400

agaccccacc catcaaggac tttggcggct tcaacttcag ccagatcctg cccgatcctt     2460

ccaagccttc taagaggagc tttatcgagg acctgctgtt caacaaggtg accctggccg     2520

atgccggctt catcaagcag tatggcgatt gcctgggcga catcgcagcc cgcgacctga     2580

tctgtgccca gaagtttaat ggcctgaccg tgctgcctcc actgctgaca gatgagatga     2640

tcgcccagta cacatctgcc ctgctggcag gaaccatcac aagcggatgg accttcggcg     2700

caggagccgc cctgcagatc ccctttgcca tgcagatggc ctatcgcttc aacggcatcg     2760

gcgtgaccca gaatgtgctg tacgagaacc agaagctgat cgccaatcag tttaacagcg     2820

ccatcggcaa gatccaggac agcctgtcct ctacagcctc cgccctgggc aagctgcagg     2880

atgtggtgaa tcagaacgcc caggccctga ataccctggt gaagcagctg agctccaact     2940

tcggcgccat ctctagcgtg ctgaatgata tcctgtccag gctggacaag gtggaggccg     3000

aggtgcagat cgaccggctg atcacaggca gactgcagtc tctgcagacc tacgtgacac     3060

agcagctgat cagggcagca gagatcaggg caagcgccaa tctggcagca accaagatga     3120

gcgagtgcgt gctgggacag tccaagaggg tggacttttg tggcaagggc tatcacctga     3180

tgagcttccc acagtccgcc ccacacggag tggtgtttct gcacgtgacc tacgtgcccg     3240

cccaggagaa gaacttcacc acagcccctg ccatctgcca cgatggcaag gcccactttc     3300

cacgggaggg cgtgttcgtg tccaacggca cccactggtt tgtgacacag agaaatttct     3360

acgagcccca gatcatcacc acagacaata ccttcgtgag cggcaactgt gacgtggtca     3420

tcggcatcgt gaacaatacc gtgtatgatc ctctgcagcc agagctggac agctttaagg     3480

aggagctgga taagtacttc aagaatcaca cctcccccga cgtggatctg ggcgacatct     3540

ctggcatcaa tgccagcgtg gtgaacatcc agaaggagat cgacagactg aacgaggtgg     3600

ccaagaatct gaacgagagc ctgatcgatc tgcaggagct gggcaagtat gagcagtaca     3660

tcaagtggcc tggatccggc tccggctctc caggctccgg atatatccca gaggcacctc     3720

gggacggaca ggcctacgtg agaaaggatg gcgagtgggt gctgctgagc accttcctgg     3780

gagggagcgg cggcagcggg ggcagcgggc accatcacca ccaccactga                3830


<210>  37
<211>  1274
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2

<400>  37

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1               5                   10                  15      


Ala Val Phe Val Ser Pro Val Asn Leu Thr Thr Arg Thr Gln Leu Pro 
            20                  25                  30          


Pro Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys 
        35                  40                  45              


Val Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro 
    50                  55                  60                  


Phe Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr 
65                  70                  75                  80  


Asn Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly 
                85                  90                  95      


Val Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile 
            100                 105                 110         


Phe Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn 
        115                 120                 125             


Asn Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn 
    130                 135                 140                 


Asp Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met 
145                 150                 155                 160 


Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu 
                165                 170                 175     


Tyr Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn 
            180                 185                 190         


Phe Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe 
        195                 200                 205             


Lys Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro 
    210                 215                 220                 


Gln Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile 
225                 230                 235                 240 


Asn Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu 
                245                 250                 255     


Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr 
            260                 265                 270         


Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu 
        275                 280                 285             


Asn Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser 
    290                 295                 300                 


Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr 
305                 310                 315                 320 


Gln Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe 
                325                 330                 335     


Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr 
            340                 345                 350         


Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys 
        355                 360                 365             


Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe 
    370                 375                 380                 


Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr 
385                 390                 395                 400 


Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln 
                405                 410                 415     


Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu 
            420                 425                 430         


Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu 
        435                 440                 445             


Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg 
    450                 455                 460                 


Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr 
465                 470                 475                 480 


Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr 
                485                 490                 495     


Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr 
            500                 505                 510         


Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro 
        515                 520                 525             


Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys 
    530                 535                 540                 


Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr 
545                 550                 555                 560 


Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile 
                565                 570                 575     


Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu 
            580                 585                 590         


Asp Ile Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly 
        595                 600                 605             


Thr Asn Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys 
    610                 615                 620                 


Thr Glu Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp 
625                 630                 635                 640 


Arg Val Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys 
                645                 650                 655     


Leu Ile Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro 
            660                 665                 670         


Ile Gly Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro 
        675                 680                 685             


Arg Arg Arg Arg Arg Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr 
    690                 695                 700                 


Thr Met Ser Leu Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser 
705                 710                 715                 720 


Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu 
                725                 730                 735     


Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys 
            740                 745                 750         


Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe 
        755                 760                 765             


Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp 
    770                 775                 780                 


Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr 
785                 790                 795                 800 


Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro 
                805                 810                 815     


Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe 
            820                 825                 830         


Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp 
        835                 840                 845             


Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe 
    850                 855                 860                 


Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala 
865                 870                 875                 880 


Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr 
                885                 890                 895     


Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala 
            900                 905                 910         


Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn 
        915                 920                 925             


Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln 
    930                 935                 940                 


Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val 
945                 950                 955                 960 


Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser 
                965                 970                 975     


Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg 
            980                 985                 990         


Leu Asp Lys Val Glu Ala Glu Val  Gln Ile Asp Arg Leu  Ile Thr Gly 
        995                 1000                 1005             


Arg Leu  Gln Ser Leu Gln Thr  Tyr Val Thr Gln Gln  Leu Ile Arg 
    1010                 1015                 1020             


Ala Ala  Glu Ile Arg Ala Ser  Ala Asn Leu Ala Ala  Thr Lys Met 
    1025                 1030                 1035             


Ser Glu  Cys Val Leu Gly Gln  Ser Lys Arg Val Asp  Phe Cys Gly 
    1040                 1045                 1050             


Lys Gly  Tyr His Leu Met Ser  Phe Pro Gln Ser Ala  Pro His Gly 
    1055                 1060                 1065             


Val Val  Phe Leu His Val Thr  Tyr Val Pro Ala Gln  Glu Lys Asn 
    1070                 1075                 1080             


Phe Thr  Thr Ala Pro Ala Ile  Cys His Asp Gly Lys  Ala His Phe 
    1085                 1090                 1095             


Pro Arg  Glu Gly Val Phe Val  Ser Asn Gly Thr His  Trp Phe Val 
    1100                 1105                 1110             


Thr Gln  Arg Asn Phe Tyr Glu  Pro Gln Ile Ile Thr  Thr Asp Asn 
    1115                 1120                 1125             


Thr Phe  Val Ser Gly Asn Cys  Asp Val Val Ile Gly  Ile Val Asn 
    1130                 1135                 1140             


Asn Thr  Val Tyr Asp Pro Leu  Gln Pro Glu Leu Asp  Ser Phe Lys 
    1145                 1150                 1155             


Glu Glu  Leu Asp Lys Tyr Phe  Lys Asn His Thr Ser  Pro Asp Val 
    1160                 1165                 1170             


Asp Leu  Gly Asp Ile Ser Gly  Ile Asn Ala Ser Val  Val Asn Ile 
    1175                 1180                 1185             


Gln Lys  Glu Ile Asp Arg Leu  Asn Glu Val Ala Lys  Asn Leu Asn 
    1190                 1195                 1200             


Glu Ser  Leu Ile Asp Leu Gln  Glu Leu Gly Lys Tyr  Glu Gln Tyr 
    1205                 1210                 1215             


Ile Lys  Trp Pro Gly Ser Gly  Ser Gly Ser Pro Gly  Ser Gly Tyr 
    1220                 1225                 1230             


Ile Pro  Glu Ala Pro Arg Asp  Gly Gln Ala Tyr Val  Arg Lys Asp 
    1235                 1240                 1245             


Gly Glu  Trp Val Leu Leu Ser  Thr Phe Leu Gly Gly  Ser Gly Gly 
    1250                 1255                 1260             


Ser Gly  Gly Ser Gly His His  His His His His 
    1265                 1270                 


<210>  38
<211>  3806
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2

<400>  38
ccaccatgtt tgtctttctg gtcctgctgc ccctggtctc ttcacagtgc gtcaatctga       60

ctacacgaac tcagctgcct cctgcctata ctaattcctt cacccggggc gtgtactatc      120

cagacaaggt gtttagaagc tccgtgctgc actccacaca ggatctgttt ctgcccttct      180

tttctaacgt gacctggttc cacgccatcc acgtgagcgg caccaatggc acaaagcggt      240

tcgacaatcc tgtgctgcca tttaacgatg gcgtgtactt cgcctccacc gagaagtcta      300

acatcatcag aggctggatc tttggcacca cactggacag caagacacag tccctgctga      360

tcgtgaacaa tgccaccaac gtggtgatca aggtgtgcga gttccagttt tgtaatgatc      420

ctttcctggg cgtgtactat cacaagaaca ataagtcttg gatggagagc gagtttagag      480

tgtattctag cgccaacaat tgcacatttg agtacgtgtc ccagccattc ctgatggacc      540

tggagggcaa gcagggcaat ttcaagaacc tgagggagtt cgtgtttaag aatatcgatg      600

gctacttcaa gatctacagc aagcacaccc caatcaacct ggtgcgcgac ctgccacagg      660

gattctccgc cctggagcca ctggtggatc tgcctatcgg catcaacatc acccggtttc      720

agacactgct ggccctgcac agaagctacc tgacacctgg cgactcctct agcggatgga      780

ccgcaggagc agcagcctac tatgtgggct atctgcagcc acggaccttc ctgctgaagt      840

acaacgagaa tggcaccatc acagacgcag tggattgcgc cctggacccc ctgtctgaga      900

caaagtgtac actgaagagc tttaccgtgg agaagggcat ctatcagaca agcaatttca      960

gggtgcagcc caccgagtcc atcgtgcgct ttcccaatat cacaaacctg tgcccttttg     1020

gcgaggtgtt caacgccacc agattcgcca gcgtgtacgc ctggaatcgg aagagaatct     1080

ccaactgcgt ggccgactat tctgtgctgt acaacagcgc ctccttctct acctttaagt     1140

gctatggcgt gtctcctaca aagctgaatg atctgtgctt taccaacgtg tacgccgata     1200

gcttcgtgat caggggagac gaagtgagac agatcgcacc aggacagaca ggcaagatcg     1260

cagactacaa ttataagctg cctgacgatt tcaccggctg cgtgatcgcc tggaactcta     1320

acaatctgga tagcaaagtg ggcggcaact acaattatct gtacaggctg tttcgcaagt     1380

ccaatctgaa gcctttcgag agggacatct ccacagagat ctaccaggcc ggctctaccc     1440

catgcaatgg cgtggagggc tttaactgtt atttccctct gcagtcttac ggcttccagc     1500

caacaaacgg cgtgggctat cagccctacc gcgtggtggt gctgtccttt gagctgctgc     1560

acgcacctgc aacagtgtgc ggaccaaaga agtctaccaa tctggtgaag aacaagtgcg     1620

tgaacttcaa cttcaacggc ctgaccggca caggcgtgct gaccgagtcc aacaagaagt     1680

tcctgccctt tcagcagttc ggcagggaca tcgcagatac cacagacgcc gtgcgcgacc     1740

ctcagaccct ggagatcctg gatatcacac catgctcctt cggcggcgtg tctgtgatca     1800

cacctggcac caatacaagc aaccaggtgg ccgtgctgta tcaggacgtg aattgtaccg     1860

aggtgcccgt ggcaatccac gcagatcagc tgacccctac atggcgggtg tactctaccg     1920

gcagcaacgt gttccagaca agagccggat gcctgatcgg agcagagcac gtgaacaata     1980

gctatgagtg cgacatcccc atcggcgccg gcatctgtgc ctcctaccag acccagacaa     2040

actcccctgg atccgcctcc tctgtggcaa gccagtccat catcgcctat accatgtccc     2100

tgggcgccga gaacagcgtg gcctacagca acaattccat cgccatccct accaacttca     2160

caatctccgt gaccacagag atcctgccag tgagcatgac caagacatcc gtggactgca     2220

caatgtatat ctgtggcgat tccaccgagt gctctaacct gctgctgcag tacggcagct     2280

tttgtaccca gctgaatagg gccctgacag gcatcgcagt ggagcaggat aagaacacac     2340

aggaggtgtt cgcccaggtg aagcagatct acaagacccc ccctatcaag gactttggcg     2400

gcttcaactt cagccagatc ctgcccgatc cttctaagcc aagcaagcgg tcccccatcg     2460

aggacctgct gtttaacaag gtgaccctgg ccgatgccgg cttcatcaag cagtatggag     2520

attgcctggg agacatcgca gcccgggacc tgatctgtgc ccagaagttc aatggcctga     2580

ccgtgctgcc acccctgctg acagatgaga tgatcgccca gtacacatct gccctgctgg     2640

caggaaccat cacaagcgga tggacctttg gcgcaggacc agccctgcag atcccatttc     2700

ccatgcagat ggcctatcgc ttcaacggca tcggcgtgac ccagaatgtg ctgtacgaga     2760

accagaagct gatcgccaat cagttcaact ccgccatcgg caagatccag gacagcctga     2820

gctccacacc atccgccctg ggaaagctgc aggatgtggt gaatcagaac gcccaggccc     2880

tgaataccct ggtgaagcag ctgtctagca actttggcgc catctcctct gtgctgaatg     2940

atatcctgag caggctggac cctccagagg cagaggtgca gatcgacagg ctgatcacag     3000

gccgcctgca gagcctgcag acctatgtga cacagcagct gatcagggca gcagagatca     3060

gagcatccgc caatctggcc gccaccaaga tgagcgagtg cgtgctggga cagtccaaga     3120

gggtggactt ttgtggcaag ggctatcacc tgatgagctt ccctcagtcc gccccacacg     3180

gagtggtgtt tctgcacgtg acctacgtgc cagcccagga gaagaacttc accacagcac     3240

cagcaatctg ccacgatgga aaggcacact ttcccaggga gggcgtgttc gtgtctaacg     3300

gcacccactg gtttgtgaca cagcgcaatt tctacgagcc tcagatcatc accacagaca     3360

atacattcgt gagcggcaac tgtgatgtgg tgatcggcat cgtgaacaat accgtgtatg     3420

atcccctgca gcctgagctg gactctttta aggaggagct ggataagtac ttcaagaatc     3480

acaccagccc cgacgtggat ctgggcgaca tctctggcat caatgccagc gtggtgaaca     3540

tccagaagga gatcgaccgg ctgaacgagg tggccaagaa tctgaacgag tccctgatcg     3600

atctgcagga gctgggcaag tatgagcagg gctctggata catcccagag gcacccaggg     3660

acggacaggc ctacgtgcgc aaggatggcg agtgggtgct gctgtctacc tttctgggca     3720

gaagcctgga ggtgctgttc cagggaccag gacaccatca tcaccaccac caccactctg     3780

cctggagcca cccccagttc gagaag                                          3806


<210>  39
<211>  1288
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SARS-CoV-2

<400>  39

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Gly Ser Ala Ser Ser Val Ala 
        675                 680                 685             


Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 


Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 


Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     


Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         


Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             


Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 


Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 


Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     


Pro Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         


Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             


Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 


Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 


Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Pro Ala Leu Gln Ile 
                885                 890                 895     


Pro Phe Pro Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         


Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             


Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Pro Ser Ala 
    930                 935                 940                 


Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 


Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     


Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro Glu Ala Glu Val Gln 
            980                 985                 990         


Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
        995                 1000                 1005             


Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
    1010                 1015                 1020             


Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
    1025                 1030                 1035             


Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
    1040                 1045                 1050             


Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
    1055                 1060                 1065             


Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
    1070                 1075                 1080             


Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
    1085                 1090                 1095             


Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
    1100                 1105                 1110             


Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
    1115                 1120                 1125             


Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
    1130                 1135                 1140             


Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
    1145                 1150                 1155             


His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
    1160                 1165                 1170             


Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
    1190                 1195                 1200             


Gly Lys  Tyr Glu Gln Gly Ser  Gly Tyr Ile Pro Glu  Ala Pro Arg 
    1205                 1210                 1215             


Asp Gly  Gln Ala Tyr Val Arg  Lys Asp Gly Glu Trp  Val Leu Leu 
    1220                 1225                 1230             


Ser Thr  Phe Leu Gly Arg Ser  Leu Glu Val Leu Phe  Gln Gly Pro 
    1235                 1240                 1245             


Gly His  His His His His His  His His Ser Ala Trp  Ser His Pro 
    1250                 1255                 1260             


Gln Phe  Glu Lys Gly Gly Gly  Ser Gly Gly Gly Gly  Ser Gly Gly 
    1265                 1270                 1275             


Ser Ala  Trp Ser His Pro Gln  Phe Glu Lys 
    1280                 1285             


<210>  40
<211>  1496
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ebola virus

<400>  40
ccaccatgga tgccatgaag aggggcctgt gctgcgtcct gctgctgtgc ggggccgtgt       60

tcgtgtcccc ctccattccc ctcggcgtga tccacaacag caccctgcag gtgagcgatg      120

tggataagct ggtgtgccgg gacaagttga gcagcactaa ccagctgcgg agcgtcggcc      180

tgaacctgga ggggaacggc gtggccacag atgtgccaag cgccacaaag aggtgggggt      240

tcaggagcgg cgtgccccct aaggtggtga actacgaggc cggggagtgg gccgagaact      300

gctacaacct ggagatcaag aagcccgacg ggagcgagtg cctgccagcc gcccccgacg      360

gcattagggg ctttcctagg tgcaggtacg tgcacaaggt gagcgggacc ggcccttgcg      420

ccggggactt cgcctttcac aaggagggcg ccttcttcct gtacgataga ctggccagca      480

ccgtgatcta ccggggcaca accttcgccg agggggtggt cgcctttctc atcctgcccc      540

aggccaagaa ggatttcttc tccagccacc ccctgcggga gccagtgaac gccaccgagg      600

accccagctc cggctactac tccaccacaa tccggtacca ggctactggc ttcgggacca      660

acgagactga gtacctgttc gaggtcgaca acctgacata cgtccagctc gaaagccggt      720

tcacccccca gttcctgctg cagctgaacg agactatcta caccagcggg aagaggtcta      780

acaccacagg gaagctgatc tggaaggtga accccgagat tgatacaacc atcggggagt      840

gggccttttg ggagaccaag aagaacctga cacggaagat tcggagcgag gagctgagct      900

tcaccgtggt ctccaacggc gccaagaaca ttagcggcaa gctggggctc atcaccaaca      960

caatcgctgg cgtcgccggc ctgatcaccg gggggaggcg gacacggcgg gaggccattg     1020

tgaacgccca gccaaagtgc aaccctaacc tccactactg gacaacacag gacgagggcg     1080

ccgccatcgg gctggcctgg atcccttact tcggccccgc cgccgagggc atctacattg     1140

agggcctgat gcacaaccag gacggcctga tctgcgggct gaggcagctg gccaacgaga     1200

caacccaggc cctgcagctg tttctgcggg ccacacccga gctgcggaca ttttccattc     1260

tgaacaggtt cgccatcgat ttcctgctgc agcggtgggg cgggacctgc cacattctcg     1320

gccctgattg ctgcattgag ccccacgact ggaccaagaa catcacagat aagatcgacc     1380

agatcatcca cgatttcgtg gataagacac tgcccgacca gggggataac gacaactggt     1440

ggacaggcgg gagcggcggc agcggcggca gcggccacca ccatcaccac cactga         1496


<210>  41
<211>  496
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Ebola Virus

<400>  41

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1               5                   10                  15      


Ala Val Phe Val Ser Pro Ser Ile Pro Leu Gly Val Ile His Asn Ser 
            20                  25                  30          


Thr Leu Gln Val Ser Asp Val Asp Lys Leu Val Cys Arg Asp Lys Leu 
        35                  40                  45              


Ser Ser Thr Asn Gln Leu Arg Ser Val Gly Leu Asn Leu Glu Gly Asn 
    50                  55                  60                  


Gly Val Ala Thr Asp Val Pro Ser Ala Thr Lys Arg Trp Gly Phe Arg 
65                  70                  75                  80  


Ser Gly Val Pro Pro Lys Val Val Asn Tyr Glu Ala Gly Glu Trp Ala 
                85                  90                  95      


Glu Asn Cys Tyr Asn Leu Glu Ile Lys Lys Pro Asp Gly Ser Glu Cys 
            100                 105                 110         


Leu Pro Ala Ala Pro Asp Gly Ile Arg Gly Phe Pro Arg Cys Arg Tyr 
        115                 120                 125             


Val His Lys Val Ser Gly Thr Gly Pro Cys Ala Gly Asp Phe Ala Phe 
    130                 135                 140                 


His Lys Glu Gly Ala Phe Phe Leu Tyr Asp Arg Leu Ala Ser Thr Val 
145                 150                 155                 160 


Ile Tyr Arg Gly Thr Thr Phe Ala Glu Gly Val Val Ala Phe Leu Ile 
                165                 170                 175     


Leu Pro Gln Ala Lys Lys Asp Phe Phe Ser Ser His Pro Leu Arg Glu 
            180                 185                 190         


Pro Val Asn Ala Thr Glu Asp Pro Ser Ser Gly Tyr Tyr Ser Thr Thr 
        195                 200                 205             


Ile Arg Tyr Gln Ala Thr Gly Phe Gly Thr Asn Glu Thr Glu Tyr Leu 
    210                 215                 220                 


Phe Glu Val Asp Asn Leu Thr Tyr Val Gln Leu Glu Ser Arg Phe Thr 
225                 230                 235                 240 


Pro Gln Phe Leu Leu Gln Leu Asn Glu Thr Ile Tyr Thr Ser Gly Lys 
                245                 250                 255     


Arg Ser Asn Thr Thr Gly Lys Leu Ile Trp Lys Val Asn Pro Glu Ile 
            260                 265                 270         


Asp Thr Thr Ile Gly Glu Trp Ala Phe Trp Glu Thr Lys Lys Asn Leu 
        275                 280                 285             


Thr Arg Lys Ile Arg Ser Glu Glu Leu Ser Phe Thr Val Val Ser Asn 
    290                 295                 300                 


Gly Ala Lys Asn Ile Ser Gly Lys Leu Gly Leu Ile Thr Asn Thr Ile 
305                 310                 315                 320 


Ala Gly Val Ala Gly Leu Ile Thr Gly Gly Arg Arg Thr Arg Arg Glu 
                325                 330                 335     


Ala Ile Val Asn Ala Gln Pro Lys Cys Asn Pro Asn Leu His Tyr Trp 
            340                 345                 350         


Thr Thr Gln Asp Glu Gly Ala Ala Ile Gly Leu Ala Trp Ile Pro Tyr 
        355                 360                 365             


Phe Gly Pro Ala Ala Glu Gly Ile Tyr Ile Glu Gly Leu Met His Asn 
    370                 375                 380                 


Gln Asp Gly Leu Ile Cys Gly Leu Arg Gln Leu Ala Asn Glu Thr Thr 
385                 390                 395                 400 


Gln Ala Leu Gln Leu Phe Leu Arg Ala Thr Pro Glu Leu Arg Thr Phe 
                405                 410                 415     


Ser Ile Leu Asn Arg Phe Ala Ile Asp Phe Leu Leu Gln Arg Trp Gly 
            420                 425                 430         


Gly Thr Cys His Ile Leu Gly Pro Asp Cys Cys Ile Glu Pro His Asp 
        435                 440                 445             


Trp Thr Lys Asn Ile Thr Asp Lys Ile Asp Gln Ile Ile His Asp Phe 
    450                 455                 460                 


Val Asp Lys Thr Leu Pro Asp Gln Gly Asp Asn Asp Asn Trp Trp Thr 
465                 470                 475                 480 


Gly Gly Ser Gly Gly Ser Gly Gly Ser Gly His His His His His His 
                485                 490                 495     


<210>  42
<211>  1610
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nipah virus

<400>  42
ccaccatggt ggtgatcctg gacaagcggt gctactgtaa cctgctgatc ctgatcctga       60

tgatctccga gtgctctgtg ggcatcctgc actacgagaa gctgtccaag atcggcctgg      120

tgaagggcgt gacccggaag tataagatca agtctaatcc tctgacaaag gatatcgtga      180

tcaagatgat cccaaacgtg agcaatatgt cccagtgtac cggctccgtg atggagaact      240

acaagacccg cctgaatggc atcctgacac caatcaaggg cgccctggag atctataaga      300

acaatacaca cgactgcgtg ggcgatgtgc ggctggcagg cgtgtgcatg gcaggagtgg      360

caatcggaat cgcaaccgca gcacagatca cagcaggagt ggccctgtat gaggccatga      420

agaacgccga caacatcaat aagctgaaga gctccatcga gagcaccaat gaggccgtgg      480

tgaagctgca ggagaccgcc gagaagacag tgtacgtgtt cacagccctg caggactata      540

tcaacaccaa tctggtgccc acaatcgata agatcccttg caagcagacc gagctgagcc      600

tggacctggc cctgtccaag tacctgtctg atctgctgtt cgtgtttggc cctaacctgc      660

aggatccagt gtccaattct atgacaatcc aggccatctc ccaggccttc ggcggcaact      720

acgagaccct gctgagaaca ctgggctatg ccaccgagga ctttgacgat ctgctggaga      780

gcgattccat cacaggccag atcatctatg tggacctgtc tagctactat atcatcgtga      840

gggtgtactt ccccatcctg accgagatcc agcaggccta tatccaggag ctgctgcccg      900

tgagcttcaa caatgataac agcgagtgga tctccatcgt gccaaacttc atcctggtgc      960

ggaacaccct gatctctaat atcgagatcg gcttttgcct gatcacaaag agaagcgtga     1020

tctgtaacca ggactacgcc acccccatga caaacaatat gagggagtgc ctgaccggca     1080

gcacagagaa gtgtccaagg gagctggtgg tgtcctctca cgtgccaagg ttcgcactga     1140

gcaacggcgt gctgtttgcc aattgcatct ccgtgacctg ccagtgtcag accacaggca     1200

gagccatctc tcagagcggc gagcagaccc tgctgatgat cgataacacc acatgtccca     1260

cagccgtgct gggcaatgtg atcatctccc tgggcaagta cctgggctcc gtgaactata     1320

attctgaggg aatcgcaatc ggaccccccg tgttcaccga caaggtggat atcagctccc     1380

agatctctag catgaaccag tctctgcagc agagcaagga ctacatcaag gaggcccagc     1440

ggctgctgga tacagtgaat ccttctctgg gcagcggcta catcccagag gcaccccggg     1500

acggacaggc ctatgtgaga aaggatggcg agtgggtgct gctgagcacc tttctgggat     1560

ccggaggatc tggaggaagc ggacaccatc atcaccacca ccaccactga                1610


<210>  43
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Nipah virus

<400>  43

Met Val Val Ile Leu Asp Lys Arg Cys Tyr Cys Asn Leu Leu Ile Leu 
1               5                   10                  15      


Ile Leu Met Ile Ser Glu Cys Ser Val Gly Ile Leu His Tyr Glu Lys 
            20                  25                  30          


Leu Ser Lys Ile Gly Leu Val Lys Gly Val Thr Arg Lys Tyr Lys Ile 
        35                  40                  45              


Lys Ser Asn Pro Leu Thr Lys Asp Ile Val Ile Lys Met Ile Pro Asn 
    50                  55                  60                  


Val Ser Asn Met Ser Gln Cys Thr Gly Ser Val Met Glu Asn Tyr Lys 
65                  70                  75                  80  


Thr Arg Leu Asn Gly Ile Leu Thr Pro Ile Lys Gly Ala Leu Glu Ile 
                85                  90                  95      


Tyr Lys Asn Asn Thr His Asp Cys Val Gly Asp Val Arg Leu Ala Gly 
            100                 105                 110         


Val Cys Met Ala Gly Val Ala Ile Gly Ile Ala Thr Ala Ala Gln Ile 
        115                 120                 125             


Thr Ala Gly Val Ala Leu Tyr Glu Ala Met Lys Asn Ala Asp Asn Ile 
    130                 135                 140                 


Asn Lys Leu Lys Ser Ser Ile Glu Ser Thr Asn Glu Ala Val Val Lys 
145                 150                 155                 160 


Leu Gln Glu Thr Ala Glu Lys Thr Val Tyr Val Phe Thr Ala Leu Gln 
                165                 170                 175     


Asp Tyr Ile Asn Thr Asn Leu Val Pro Thr Ile Asp Lys Ile Pro Cys 
            180                 185                 190         


Lys Gln Thr Glu Leu Ser Leu Asp Leu Ala Leu Ser Lys Tyr Leu Ser 
        195                 200                 205             


Asp Leu Leu Phe Val Phe Gly Pro Asn Leu Gln Asp Pro Val Ser Asn 
    210                 215                 220                 


Ser Met Thr Ile Gln Ala Ile Ser Gln Ala Phe Gly Gly Asn Tyr Glu 
225                 230                 235                 240 


Thr Leu Leu Arg Thr Leu Gly Tyr Ala Thr Glu Asp Phe Asp Asp Leu 
                245                 250                 255     


Leu Glu Ser Asp Ser Ile Thr Gly Gln Ile Ile Tyr Val Asp Leu Ser 
            260                 265                 270         


Ser Tyr Tyr Ile Ile Val Arg Val Tyr Phe Pro Ile Leu Thr Glu Ile 
        275                 280                 285             


Gln Gln Ala Tyr Ile Gln Glu Leu Leu Pro Val Ser Phe Asn Asn Asp 
    290                 295                 300                 


Asn Ser Glu Trp Ile Ser Ile Val Pro Asn Phe Ile Leu Val Arg Asn 
305                 310                 315                 320 


Thr Leu Ile Ser Asn Ile Glu Ile Gly Phe Cys Leu Ile Thr Lys Arg 
                325                 330                 335     


Ser Val Ile Cys Asn Gln Asp Tyr Ala Thr Pro Met Thr Asn Asn Met 
            340                 345                 350         


Arg Glu Cys Leu Thr Gly Ser Thr Glu Lys Cys Pro Arg Glu Leu Val 
        355                 360                 365             


Val Ser Ser His Val Pro Arg Phe Ala Leu Ser Asn Gly Val Leu Phe 
    370                 375                 380                 


Ala Asn Cys Ile Ser Val Thr Cys Gln Cys Gln Thr Thr Gly Arg Ala 
385                 390                 395                 400 


Ile Ser Gln Ser Gly Glu Gln Thr Leu Leu Met Ile Asp Asn Thr Thr 
                405                 410                 415     


Cys Pro Thr Ala Val Leu Gly Asn Val Ile Ile Ser Leu Gly Lys Tyr 
            420                 425                 430         


Leu Gly Ser Val Asn Tyr Asn Ser Glu Gly Ile Ala Ile Gly Pro Pro 
        435                 440                 445             


Val Phe Thr Asp Lys Val Asp Ile Ser Ser Gln Ile Ser Ser Met Asn 
    450                 455                 460                 


Gln Ser Leu Gln Gln Ser Lys Asp Tyr Ile Lys Glu Ala Gln Arg Leu 
465                 470                 475                 480 


Leu Asp Thr Val Asn Pro Ser Leu Gly Ser Gly Tyr Ile Pro Glu Ala 
                485                 490                 495     


Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu 
            500                 505                 510         


Leu Ser Thr Phe Leu Gly Ser Gly Gly Ser Gly Gly Ser Gly His His 
        515                 520                 525             


His His His His His His 
    530                 


<210>  44
<211>  1238
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lujo virus

<400>  44
ccaccatggg gcagattgtc gctgtcttcc aggctattcc cgagattctg aacgaggcta       60

tcaacatcgt catcattgtc atcattatgt ttaccctgat caagggcgtg tttaatctgt      120

acaagtccgg cctgtttcag ctggtcatct tcctgctgct gtgcggcaag aggtgtgata      180

gctccctgct gtctggcttt aacctggaga cagtgcactt caatatgagc ctgctgtcta      240

gcatccccat ggtgtccgag cagcagcact gtatccagca caatcactcc tctatcacct      300

tttctctgct gacaaacaag agcgacctgg agaagtgcaa tttcaccagg ctgcaggccg      360

tggatcgcgt gatcttcgac ctgtttaggg agttccacca ccgcgtgggc gattttcctg      420

tgaccagcga cctgaagtgt agccacaaca catcctacag agtgatcgag tatgaggtga      480

ccaaggagtc tctgccaaga ctgcaggagg ccgtgagcac actgtttccc gatctgcacc      540

tgtccgagga ccgcttcctg cagatccagg cccacgacga taagaactgt accggcctgc      600

acccactgaa ttacctgagg ctgctgaagg agaactccga gacacactat aaggtgcgca      660

agctgatgaa gctgttccag tggagcctga gcgatgagac aggcagcccc ctgcctggag      720

gacactgcct ggagcggtgg ctgatctttg ccagcgatat caagtgcttc gacaacgccg      780

ccatcgccaa gtgtaataag gagcacgatg aggagttttg cgacatgctg cggctgttcg      840

attacaacaa ggccagcatc gccaagctga gaggcgaggc cagctcctct atcaacctgc      900

tgtccggcag gatcaatgcc atcatctctg acacactgct gatgcggagc tccctgaaga      960

gactgatggg catcccttac tgtaattata ccaagttttg gtacctgaac cacacaaagc     1020

tgggcatcca ctccctgcca cggtgctggc tggtgtccaa cggctcttat ctgaatgaga     1080

caaagttcac acacgacatg gaggatgagg ccgacaagct gctgaccgag atgctgaaga     1140

aggagtatgt gcggagacag gagaagacac ccatcaccct gatggacatt ggaagcggag     1200

gcagcggcgg atccggacac caccaccacc accactga                             1238


<210>  45
<211>  410
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Lujo virus

<400>  45

Met Gly Gln Ile Val Ala Val Phe Gln Ala Ile Pro Glu Ile Leu Asn 
1               5                   10                  15      


Glu Ala Ile Asn Ile Val Ile Ile Val Ile Ile Met Phe Thr Leu Ile 
            20                  25                  30          


Lys Gly Val Phe Asn Leu Tyr Lys Ser Gly Leu Phe Gln Leu Val Ile 
        35                  40                  45              


Phe Leu Leu Leu Cys Gly Lys Arg Cys Asp Ser Ser Leu Leu Ser Gly 
    50                  55                  60                  


Phe Asn Leu Glu Thr Val His Phe Asn Met Ser Leu Leu Ser Ser Ile 
65                  70                  75                  80  


Pro Met Val Ser Glu Gln Gln His Cys Ile Gln His Asn His Ser Ser 
                85                  90                  95      


Ile Thr Phe Ser Leu Leu Thr Asn Lys Ser Asp Leu Glu Lys Cys Asn 
            100                 105                 110         


Phe Thr Arg Leu Gln Ala Val Asp Arg Val Ile Phe Asp Leu Phe Arg 
        115                 120                 125             


Glu Phe His His Arg Val Gly Asp Phe Pro Val Thr Ser Asp Leu Lys 
    130                 135                 140                 


Cys Ser His Asn Thr Ser Tyr Arg Val Ile Glu Tyr Glu Val Thr Lys 
145                 150                 155                 160 


Glu Ser Leu Pro Arg Leu Gln Glu Ala Val Ser Thr Leu Phe Pro Asp 
                165                 170                 175     


Leu His Leu Ser Glu Asp Arg Phe Leu Gln Ile Gln Ala His Asp Asp 
            180                 185                 190         


Lys Asn Cys Thr Gly Leu His Pro Leu Asn Tyr Leu Arg Leu Leu Lys 
        195                 200                 205             


Glu Asn Ser Glu Thr His Tyr Lys Val Arg Lys Leu Met Lys Leu Phe 
    210                 215                 220                 


Gln Trp Ser Leu Ser Asp Glu Thr Gly Ser Pro Leu Pro Gly Gly His 
225                 230                 235                 240 


Cys Leu Glu Arg Trp Leu Ile Phe Ala Ser Asp Ile Lys Cys Phe Asp 
                245                 250                 255     


Asn Ala Ala Ile Ala Lys Cys Asn Lys Glu His Asp Glu Glu Phe Cys 
            260                 265                 270         


Asp Met Leu Arg Leu Phe Asp Tyr Asn Lys Ala Ser Ile Ala Lys Leu 
        275                 280                 285             


Arg Gly Glu Ala Ser Ser Ser Ile Asn Leu Leu Ser Gly Arg Ile Asn 
    290                 295                 300                 


Ala Ile Ile Ser Asp Thr Leu Leu Met Arg Ser Ser Leu Lys Arg Leu 
305                 310                 315                 320 


Met Gly Ile Pro Tyr Cys Asn Tyr Thr Lys Phe Trp Tyr Leu Asn His 
                325                 330                 335     


Thr Lys Leu Gly Ile His Ser Leu Pro Arg Cys Trp Leu Val Ser Asn 
            340                 345                 350         


Gly Ser Tyr Leu Asn Glu Thr Lys Phe Thr His Asp Met Glu Asp Glu 
        355                 360                 365             


Ala Asp Lys Leu Leu Thr Glu Met Leu Lys Lys Glu Tyr Val Arg Arg 
    370                 375                 380                 


Gln Glu Lys Thr Pro Ile Thr Leu Met Asp Ile Gly Ser Gly Gly Ser 
385                 390                 395                 400 


Gly Gly Ser Gly His His His His His His 
                405                 410 


