                         SEQUENCE LISTING

<110>  Janssen Sciences Ireland Unlimited Company
 
<120>  Self-replicating RNA molecules for Hepatitis B Virus (HBV) Vaccines 
       and Uses Thereof

<130>  065814.11217/9WO1

<150>  US 63/006,925
<151>  2020-04-08

<150>  US 62/863,961
<151>  2019-06-20

<160>  67    

<170>  PatentIn version 3.5

<210>  1
<211>  444
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen gene

<400>  1
gacatcgacc cttacaagga gttcggcgcc agcgtggaac tgctgtcttt tctgcccagt       60

gatttctttc cttccattcg agacctgctg gataccgcct ctgctctgta tcgggaagcc      120

ctggagagcc cagaacactg ctccccacac cataccgctc tgcgacaggc aatcctgtgc      180

tggggggagc tgatgaacct ggccacatgg gtgggatcga atctggagga ccccgcttca      240

cgggaactgg tggtcagcta cgtgaacgtc aatatgggcc tgaaaatccg ccagctgctg      300

tggttccata ttagctgcct gacttttgga cgagagaccg tgctggaata cctggtgtcc      360

ttcggcgtct ggattcgcac tccccctgct tatcgaccac ccaacgcacc aattctgtcc      420

accctgcccg agaccacagt ggtc                                             444


<210>  2
<211>  148
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen

<400>  2

Asp Ile Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser 
1               5                   10                  15      


Phe Leu Pro Ser Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp Thr 
            20                  25                  30          


Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser 
        35                  40                  45              


Pro His His Thr Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu Leu 
    50                  55                  60                  


Met Asn Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser 
65                  70                  75                  80  


Arg Glu Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys Ile 
                85                  90                  95      


Arg Gln Leu Leu Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg Glu 
            100                 105                 110         


Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr Pro 
        115                 120                 125             


Pro Ala Tyr Arg Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro Glu 
    130                 135                 140                 


Thr Thr Val Val 
145             


<210>  3
<211>  447
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen gene

<400>  3
atggacatcg acccttacaa ggagttcggc gccagcgtgg aactgctgtc ttttctgccc       60

agtgatttct ttccttccat tcgagacctg ctggataccg cctctgctct gtatcgggaa      120

gccctggaga gcccagaaca ctgctcccca caccataccg ctctgcgaca ggcaatcctg      180

tgctgggggg agctgatgaa cctggccaca tgggtgggat ccaatctgga ggaccccgct      240

tcacgggaac tggtggtcag ctacgtgaac gtcaatatgg gcctgaaaat ccgccagctg      300

ctgtggttcc atattagctg cctgactttt ggacgagaga ccgtgctgga atacctggtg      360

tccttcggcg tctggatccg cactccccct gcttatcgac cacccaacgc accaattctg      420

tccaccctgc ccgagaccac agtggtc                                          447


<210>  4
<211>  149
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen

<400>  4

Met Asp Ile Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu 
1               5                   10                  15      


Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp 
            20                  25                  30          


Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
        35                  40                  45              


Ser Pro His His Thr Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu 
    50                  55                  60                  


Leu Met Asn Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala 
65                  70                  75                  80  


Ser Arg Glu Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys 
                85                  90                  95      


Ile Arg Gln Leu Leu Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg 
            100                 105                 110         


Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr 
        115                 120                 125             


Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro 
    130                 135                 140                 


Glu Thr Thr Val Val 
145                 


<210>  5
<211>  2529
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV pol antigen gene

<400>  5
atgcccctgt cttaccagca ctttagaaag cttctgctgc tggacgatga agccgggcct       60

ctggaggaag agctgccaag gctggcagac gaggggctga accggagagt ggccgaagat      120

ctgaatctgg gaaacctgaa cgtgagcatc ccttggactc ataaagtcgg caacttcacc      180

gggctgtaca gctccacagt gcctgtcttc aatccagagt ggcagacacc atcctttccc      240

aacattcacc tgcaggagga catcattaat agatgcgaac agttcgtggg acctctgaca      300

gtcaacgaaa agaggcgcct gaaactgatc atgcctgcca ggttttaccc aaatgtgact      360

aagtatctgc cactggataa gggcatcaag ccttactatc cagagcacct ggtgaaccat      420

tacttccaga ctagacacta tctgcatacc ctgtggaagg ccggaatcct gtacaaacga      480

gaaactaccc ggagtgcttc attttgtggc tccccatatt cttgggaaca ggagctgcag      540

catggcaggc tggtgttcca gaccagcaca cgccacgggg atgagtcctt ttgccagcag      600

tctagtggca tcctgagcag atcccccgtg gggccttgtc tgcagtctca gctgcggaag      660

agtagactgg gactgcagcc acagcaggga cacctggcac gacggcagca gggaaggtct      720

ggcagtatcc gggctagagt gcatcccaca actagaaggc ctttcggcgt cgagccatca      780

ggaagcggcc acaccacaaa caccgcatca agctcctcta gttgcctgca tcagtcagcc      840

gtgagaaagg ccgcttacag ccacctgtcc acatctaaaa ggcactcaag ctccgggcat      900

gctgtggagc tgcacaacat ccctccaaat tctgcacgca gtcagtcaga aggacccgtg      960

ttcagctgct ggtggctgca gtttcggaac tcaaagcctt gcagcgacta ttgtctgagc     1020

catattgtga atctgctgga ggattggggc ccttgtaccg agcacgggga acaccatatc     1080

aggattccac gaacaccagc acgagtgact ggaggggtgt tcctggtgga caagaacccc     1140

cacaatacta ccgagagccg gctggtggtc gatttcagtc agttttcaag aggcaacaca     1200

agggtgtcat ggcccaaatt cgccgtccct aatctgcaga gtctgactaa cctgctgtct     1260

agtaatctga gctggctgtc cctggacgtg tccgcagcct tttaccacct gcctctgcat     1320

ccagctgcaa tgccccatct gctggtgggg tcaagcggac tgagtcgcta cgtcgcccga     1380

ctgtcctcta actcacgcat cattaatcac cagcatggca ccatgcagaa cctgcacgat     1440

agctgttccc ggaatctgta cgtgtctctg ctgctgctgt ataagacatt cggcagaaaa     1500

ctgcacctgt acagccatcc tatcattctg gggtttagga agatcccaat gggagtggga     1560

ctgagcccct tcctgctggc acagtttacc tccgccattt gctctgtggt ccgccgagcc     1620

ttcccacact gtctggcttt ttcctatatg aacaatgtgg tcctgggcgc caaatccgtg     1680

cagcatctgg agtctctgtt cacagctgtc actaactttc tgctgagcct ggggatccac     1740

ctgaacccaa ataagactaa acgctggggg tacagcctga atttcatggg atatgtgatt     1800

ggatcctggg ggaccctgcc acaggagcac atcgtgcaga agatcaagga atgctttcgg     1860

aagctgcccg tcaacagacc tatcgactgg aaagtgtgcc agcggattgt cggactgctg     1920

ggcttcgccg ctccctttac ccagtgcggg tacccagcac tgatgcccct gtatgcctgt     1980

atccagtcta agcaggcttt cacctttagt cctacataca aggcattcct gtgcaaacag     2040

tacctgaacc tgtatccagt ggcaaggcag cgacctggac tgtgccaggt ctttgcaaat     2100

gccactccta ccggctgggg gctggctatc ggacatcagc gaatgcgggg cacattcgtg     2160

gcccccctgc ctattcacac tgctcagctg ctggcagcct gctttgctag atctaggagt     2220

ggagcaaagc tgatcggcac cgacaatagt gtggtcctgt caagaaaata cacatccttc     2280

ccatggctgc tgggatgtgc tgcaaactgg attctgaggg gcaccagctt cgtgtacgtc     2340

ccctcagccc tgaatcctgc tgacgatcca tcccgcgggc gactgggact gtaccgacct     2400

ctgctgagac tgcccttcag gcctacaact ggccggacat ctctgtatgc cgattcacca     2460

agcgtgccct cacacctgcc tgacagagtc cactttgctt cacccctgca cgtcgcttgg     2520

cggcctcca                                                             2529


<210>  6
<211>  2529
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV pol antigen gene

<400>  6
atgcccctgt cttaccagca ctttagaaag ctgctgctgc tggacgatga agccgggcct       60

ctggaggaag agctgccaag gctggcagac gaggggctga accggagagt ggccgaagat      120

ctgaatctgg gaaacctgaa cgtgagcatc ccttggactc ataaagtcgg caacttcacc      180

gggctgtaca gctccacagt gcctgtcttc aatccagagt ggcagacacc atcctttccc      240

aacattcacc tgcaggagga catcattaat agatgcgaac agttcgtggg acctctgaca      300

gtcaacgaaa agaggcgcct gaaactgatc atgcctgcca ggttttaccc aaatgtgact      360

aagtatctgc cactggataa gggcatcaag ccttactatc cagagcacct ggtgaaccat      420

tacttccaga ctagacacta tctgcatacc ctgtggaagg ccggaatcct gtacaaacga      480

gaaactaccc ggagtgcttc attttgtggc tccccatatt cttgggaaca ggagctgcag      540

catggcaggc tggtgttcca gaccagcaca cgccacgggg atgagtcctt ttgccagcag      600

tctagtggca tcctgagcag atcccccgtg gggccttgtc tgcagtctca gctgcggaag      660

agtagactgg gactgcagcc acagcaggga cacctggcac gacggcagca gggaaggtct      720

ggcagtatcc gggctagagt gcatcccaca actagaaggc ctttcggcgt cgagccatca      780

ggaagcggcc acaccacaaa caccgcatca agctcctcta gttgcctgca tcagtcagcc      840

gtgagaaagg ccgcttacag ccacctgtcc acatctaaaa ggcactcaag ctccgggcat      900

gctgtggagc tgcacaacat ccctccaaat tctgcacgca gtcagtcaga aggacccgtg      960

ttcagctgct ggtggctgca gtttcggaac tcaaagcctt gcagcgacta ttgtctgagc     1020

catattgtga atctgctgga ggattggggc ccttgtaccg agcacgggga acaccatatc     1080

aggattccac gaacaccagc acgagtgact ggaggggtgt tcctggtgga caagaacccc     1140

cacaatacta ccgagagccg gctggtggtc gatttcagtc agttttcaag aggcaacaca     1200

agggtgtcat ggcccaaatt cgccgtccct aatctgcaga gtctgactaa cctgctgtct     1260

agtaatctga gctggctgtc cctggacgtg tccgcagcct tttaccacct gcctctgcat     1320

ccagctgcaa tgccccatct gctggtgggg tcaagcggac tgagtcgcta cgtcgcccga     1380

ctgtcctcta actcacgcat cattaatcac cagcatggca ccatgcagaa cctgcacgat     1440

agctgttccc ggaatctgta cgtgtctctg ctgctgctgt ataagacatt cggcagaaaa     1500

ctgcacctgt acagccatcc tatcattctg gggtttagga agatcccaat gggagtggga     1560

ctgagcccct tcctgctggc acagtttacc tccgccattt gctctgtggt ccgccgagcc     1620

ttcccacact gtctggcttt ttcctatatg aacaatgtgg tcctgggcgc caaatccgtg     1680

cagcatctgg agtctctgtt cacagctgtc actaactttc tgctgagcct ggggatccac     1740

ctgaacccaa ataagactaa acgctggggg tacagcctga atttcatggg atatgtgatt     1800

ggatcctggg ggaccctgcc acaggagcac atcgtgcaga agatcaagga atgctttcgg     1860

aagctgcccg tcaacagacc tatcgactgg aaagtgtgcc agcggattgt cggactgctg     1920

ggcttcgccg ctccctttac ccagtgcggg tacccagcac tgatgcccct gtatgcctgt     1980

atccagtcta agcaggcttt cacctttagt cctacataca aggcattcct gtgcaaacag     2040

tacctgaacc tgtatccagt ggcaaggcag cgacctggac tgtgccaggt ctttgcaaat     2100

gccactccta ccggctgggg gctggctatc ggacatcagc gaatgcgggg cacattcgtg     2160

gcccccctgc ctattcacac tgctcagctg ctggcagcct gctttgctag atctaggagt     2220

ggagcaaagc tgatcggcac cgacaatagt gtggtcctgt caagaaaata cacatccttc     2280

ccatggctgc tgggatgtgc tgcaaactgg attctgaggg gcaccagctt cgtgtacgtc     2340

ccctcagccc tgaatcctgc tgacgatcca tcccgcgggc gactgggact gtaccgacct     2400

ctgctgagac tgcccttcag gcctacaact ggccggacat ctctgtatgc cgattcacca     2460

agcgtgccct cacacctgcc tgacagagtc cactttgctt cacccctgca cgtcgcttgg     2520

cggcctcca                                                             2529


<210>  7
<211>  843
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV pol antigen

<400>  7

Met Pro Leu Ser Tyr Gln His Phe Arg Lys Leu Leu Leu Leu Asp Asp 
1               5                   10                  15      


Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu Ala Asp Glu Gly 
            20                  25                  30          


Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly Asn Leu Asn Val 
        35                  40                  45              


Ser Ile Pro Trp Thr His Lys Val Gly Asn Phe Thr Gly Leu Tyr Ser 
    50                  55                  60                  


Ser Thr Val Pro Val Phe Asn Pro Glu Trp Gln Thr Pro Ser Phe Pro 
65                  70                  75                  80  


Asn Ile His Leu Gln Glu Asp Ile Ile Asn Arg Cys Glu Gln Phe Val 
                85                  90                  95      


Gly Pro Leu Thr Val Asn Glu Lys Arg Arg Leu Lys Leu Ile Met Pro 
            100                 105                 110         


Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro Leu Asp Lys Gly 
        115                 120                 125             


Ile Lys Pro Tyr Tyr Pro Glu His Leu Val Asn His Tyr Phe Gln Thr 
    130                 135                 140                 


Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly Ile Leu Tyr Lys Arg 
145                 150                 155                 160 


Glu Thr Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro Tyr Ser Trp Glu 
                165                 170                 175     


Gln Glu Leu Gln His Gly Arg Leu Val Phe Gln Thr Ser Thr Arg His 
            180                 185                 190         


Gly Asp Glu Ser Phe Cys Gln Gln Ser Ser Gly Ile Leu Ser Arg Ser 
        195                 200                 205             


Pro Val Gly Pro Cys Leu Gln Ser Gln Leu Arg Lys Ser Arg Leu Gly 
    210                 215                 220                 


Leu Gln Pro Gln Gln Gly His Leu Ala Arg Arg Gln Gln Gly Arg Ser 
225                 230                 235                 240 


Gly Ser Ile Arg Ala Arg Val His Pro Thr Thr Arg Arg Pro Phe Gly 
                245                 250                 255     


Val Glu Pro Ser Gly Ser Gly His Thr Thr Asn Thr Ala Ser Ser Ser 
            260                 265                 270         


Ser Ser Cys Leu His Gln Ser Ala Val Arg Lys Ala Ala Tyr Ser His 
        275                 280                 285             


Leu Ser Thr Ser Lys Arg His Ser Ser Ser Gly His Ala Val Glu Leu 
    290                 295                 300                 


His Asn Ile Pro Pro Asn Ser Ala Arg Ser Gln Ser Glu Gly Pro Val 
305                 310                 315                 320 


Phe Ser Cys Trp Trp Leu Gln Phe Arg Asn Ser Lys Pro Cys Ser Asp 
                325                 330                 335     


Tyr Cys Leu Ser His Ile Val Asn Leu Leu Glu Asp Trp Gly Pro Cys 
            340                 345                 350         


Thr Glu His Gly Glu His His Ile Arg Ile Pro Arg Thr Pro Ala Arg 
        355                 360                 365             


Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro His Asn Thr Thr 
    370                 375                 380                 


Glu Ser Arg Leu Val Val Asp Phe Ser Gln Phe Ser Arg Gly Asn Thr 
385                 390                 395                 400 


Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu Gln Ser Leu Thr 
                405                 410                 415     


Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu Asp Val Ser Ala 
            420                 425                 430         


Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met Pro His Leu Leu 
        435                 440                 445             


Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg Leu Ser Ser Asn 
    450                 455                 460                 


Ser Arg Ile Ile Asn His Gln His Gly Thr Met Gln Asn Leu His Asp 
465                 470                 475                 480 


Ser Cys Ser Arg Asn Leu Tyr Val Ser Leu Leu Leu Leu Tyr Lys Thr 
                485                 490                 495     


Phe Gly Arg Lys Leu His Leu Tyr Ser His Pro Ile Ile Leu Gly Phe 
            500                 505                 510         


Arg Lys Ile Pro Met Gly Val Gly Leu Ser Pro Phe Leu Leu Ala Gln 
        515                 520                 525             


Phe Thr Ser Ala Ile Cys Ser Val Val Arg Arg Ala Phe Pro His Cys 
    530                 535                 540                 


Leu Ala Phe Ser Tyr Met Asn Asn Val Val Leu Gly Ala Lys Ser Val 
545                 550                 555                 560 


Gln His Leu Glu Ser Leu Phe Thr Ala Val Thr Asn Phe Leu Leu Ser 
                565                 570                 575     


Leu Gly Ile His Leu Asn Pro Asn Lys Thr Lys Arg Trp Gly Tyr Ser 
            580                 585                 590         


Leu Asn Phe Met Gly Tyr Val Ile Gly Ser Trp Gly Thr Leu Pro Gln 
        595                 600                 605             


Glu His Ile Val Gln Lys Ile Lys Glu Cys Phe Arg Lys Leu Pro Val 
    610                 615                 620                 


Asn Arg Pro Ile Asp Trp Lys Val Cys Gln Arg Ile Val Gly Leu Leu 
625                 630                 635                 640 


Gly Phe Ala Ala Pro Phe Thr Gln Cys Gly Tyr Pro Ala Leu Met Pro 
                645                 650                 655     


Leu Tyr Ala Cys Ile Gln Ser Lys Gln Ala Phe Thr Phe Ser Pro Thr 
            660                 665                 670         


Tyr Lys Ala Phe Leu Cys Lys Gln Tyr Leu Asn Leu Tyr Pro Val Ala 
        675                 680                 685             


Arg Gln Arg Pro Gly Leu Cys Gln Val Phe Ala Asn Ala Thr Pro Thr 
    690                 695                 700                 


Gly Trp Gly Leu Ala Ile Gly His Gln Arg Met Arg Gly Thr Phe Val 
705                 710                 715                 720 


Ala Pro Leu Pro Ile His Thr Ala Gln Leu Leu Ala Ala Cys Phe Ala 
                725                 730                 735     


Arg Ser Arg Ser Gly Ala Lys Leu Ile Gly Thr Asp Asn Ser Val Val 
            740                 745                 750         


Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu Gly Cys Ala Ala 
        755                 760                 765             


Asn Trp Ile Leu Arg Gly Thr Ser Phe Val Tyr Val Pro Ser Ala Leu 
    770                 775                 780                 


Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly Leu Tyr Arg Pro 
785                 790                 795                 800 


Leu Leu Arg Leu Pro Phe Arg Pro Thr Thr Gly Arg Thr Ser Leu Tyr 
                805                 810                 815     


Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 
            820                 825                 830         


Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
        835                 840             


<210>  8
<211>  63
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystatin S signal peptide coding sequence

<400>  8
atggctcgac ctctgtgtac cctgctactc ctgatggcta ccctggctgg agctctggcc       60

agc                                                                     63


<210>  9
<211>  21
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cystatin S signal peptide sequence

<400>  9

Met Ala Arg Pro Leu Cys Thr Leu Leu Leu Leu Met Ala Thr Leu Ala 
1               5                   10                  15      


Gly Ala Leu Ala Ser 
            20      


<210>  10
<211>  378
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  triple enhancer regulatory sequence

<400>  10
ggctcgcatc tctccttcac gcgcccgccg ccctacctga ggccgccatc cacgccggtt       60

gagtcgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgtccg ccgtctaggt      120

aagtttaaag ctcaggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag      180

actcagccgg ctctccacgc tttgcctgac cctgcttgct caactctagt tctctcgtta      240

acttaatgag acagatagaa actggtcttg tagaaacaga gtagtcgcct gcttttctgc      300

caggtgctga cttctctccc ctgggctttt ttctttttct caggttgaaa agaagaagac      360

gaagaagacg aagaagac                                                    378


<210>  11
<211>  12
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  linker coding sequence

<400>  11
gccggagctg gc                                                           12


<210>  12
<211>  248
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ApoAI gene fragment

<400>  12
ttggccgtgc tcttcctgac gggtaggtgt cccctaacct agggagccaa ccatcggggg       60

gccttctccc taaatccccg tggcccaccc tcctgggcag aggcagcagg tttctcactg      120

gccccctctc ccccacctcc aagcttggcc tttcggctca gatctcagcc cacagctggc      180

ctgatctggg tctcccctcc caccctcagg gagccaggct cggcatttcg tcgacaagct      240

tagccacc                                                               248


<210>  13
<211>  130
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SV40 polyadenylation signal sequence

<400>  13
aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca       60

aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct      120

tatcatgtct                                                             130


<210>  14
<211>  81
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  immunoglobulin secretion signal coding sequence

<400>  14
atggagttcg gcctgtcttg ggtctttctg gtggcaatcc tgaagggcgt gcagtgtgaa       60

gtgcagctgc tggagtctgg a                                                 81


<210>  15
<211>  27
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  immunoglobulin secretion signal sequence

<400>  15

Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Ile Leu Lys Gly 
1               5                   10                  15      


Val Gln Cys Glu Val Gln Leu Leu Glu Ser Gly 
            20                  25          


<210>  16
<211>  996
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV core-pol fusion antigen sequence

<400>  16

Met Asp Ile Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu 
1               5                   10                  15      


Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp 
            20                  25                  30          


Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
        35                  40                  45              


Ser Pro His His Thr Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu 
    50                  55                  60                  


Leu Met Asn Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala 
65                  70                  75                  80  


Ser Arg Glu Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys 
                85                  90                  95      


Ile Arg Gln Leu Leu Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg 
            100                 105                 110         


Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr 
        115                 120                 125             


Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro 
    130                 135                 140                 


Glu Thr Thr Val Val Ala Gly Ala Gly Met Pro Leu Ser Tyr Gln His 
145                 150                 155                 160 


Phe Arg Lys Leu Leu Leu Leu Asp Asp Glu Ala Gly Pro Leu Glu Glu 
                165                 170                 175     


Glu Leu Pro Arg Leu Ala Asp Glu Gly Leu Asn Arg Arg Val Ala Glu 
            180                 185                 190         


Asp Leu Asn Leu Gly Asn Leu Asn Val Ser Ile Pro Trp Thr His Lys 
        195                 200                 205             


Val Gly Asn Phe Thr Gly Leu Tyr Ser Ser Thr Val Pro Val Phe Asn 
    210                 215                 220                 


Pro Glu Trp Gln Thr Pro Ser Phe Pro Asn Ile His Leu Gln Glu Asp 
225                 230                 235                 240 


Ile Ile Asn Arg Cys Glu Gln Phe Val Gly Pro Leu Thr Val Asn Glu 
                245                 250                 255     


Lys Arg Arg Leu Lys Leu Ile Met Pro Ala Arg Phe Tyr Pro Asn Val 
            260                 265                 270         


Thr Lys Tyr Leu Pro Leu Asp Lys Gly Ile Lys Pro Tyr Tyr Pro Glu 
        275                 280                 285             


His Leu Val Asn His Tyr Phe Gln Thr Arg His Tyr Leu His Thr Leu 
    290                 295                 300                 


Trp Lys Ala Gly Ile Leu Tyr Lys Arg Glu Thr Thr Arg Ser Ala Ser 
305                 310                 315                 320 


Phe Cys Gly Ser Pro Tyr Ser Trp Glu Gln Glu Leu Gln His Gly Arg 
                325                 330                 335     


Leu Val Phe Gln Thr Ser Thr Arg His Gly Asp Glu Ser Phe Cys Gln 
            340                 345                 350         


Gln Ser Ser Gly Ile Leu Ser Arg Ser Pro Val Gly Pro Cys Leu Gln 
        355                 360                 365             


Ser Gln Leu Arg Lys Ser Arg Leu Gly Leu Gln Pro Gln Gln Gly His 
    370                 375                 380                 


Leu Ala Arg Arg Gln Gln Gly Arg Ser Gly Ser Ile Arg Ala Arg Val 
385                 390                 395                 400 


His Pro Thr Thr Arg Arg Pro Phe Gly Val Glu Pro Ser Gly Ser Gly 
                405                 410                 415     


His Thr Thr Asn Thr Ala Ser Ser Ser Ser Ser Cys Leu His Gln Ser 
            420                 425                 430         


Ala Val Arg Lys Ala Ala Tyr Ser His Leu Ser Thr Ser Lys Arg His 
        435                 440                 445             


Ser Ser Ser Gly His Ala Val Glu Leu His Asn Ile Pro Pro Asn Ser 
    450                 455                 460                 


Ala Arg Ser Gln Ser Glu Gly Pro Val Phe Ser Cys Trp Trp Leu Gln 
465                 470                 475                 480 


Phe Arg Asn Ser Lys Pro Cys Ser Asp Tyr Cys Leu Ser His Ile Val 
                485                 490                 495     


Asn Leu Leu Glu Asp Trp Gly Pro Cys Thr Glu His Gly Glu His His 
            500                 505                 510         


Ile Arg Ile Pro Arg Thr Pro Ala Arg Val Thr Gly Gly Val Phe Leu 
        515                 520                 525             


Val Asp Lys Asn Pro His Asn Thr Thr Glu Ser Arg Leu Val Val Asp 
    530                 535                 540                 


Phe Ser Gln Phe Ser Arg Gly Asn Thr Arg Val Ser Trp Pro Lys Phe 
545                 550                 555                 560 


Ala Val Pro Asn Leu Gln Ser Leu Thr Asn Leu Leu Ser Ser Asn Leu 
                565                 570                 575     


Ser Trp Leu Ser Leu Asp Val Ser Ala Ala Phe Tyr His Leu Pro Leu 
            580                 585                 590         


His Pro Ala Ala Met Pro His Leu Leu Val Gly Ser Ser Gly Leu Ser 
        595                 600                 605             


Arg Tyr Val Ala Arg Leu Ser Ser Asn Ser Arg Ile Ile Asn His Gln 
    610                 615                 620                 


His Gly Thr Met Gln Asn Leu His Asp Ser Cys Ser Arg Asn Leu Tyr 
625                 630                 635                 640 


Val Ser Leu Leu Leu Leu Tyr Lys Thr Phe Gly Arg Lys Leu His Leu 
                645                 650                 655     


Tyr Ser His Pro Ile Ile Leu Gly Phe Arg Lys Ile Pro Met Gly Val 
            660                 665                 670         


Gly Leu Ser Pro Phe Leu Leu Ala Gln Phe Thr Ser Ala Ile Cys Ser 
        675                 680                 685             


Val Val Arg Arg Ala Phe Pro His Cys Leu Ala Phe Ser Tyr Met Asn 
    690                 695                 700                 


Asn Val Val Leu Gly Ala Lys Ser Val Gln His Leu Glu Ser Leu Phe 
705                 710                 715                 720 


Thr Ala Val Thr Asn Phe Leu Leu Ser Leu Gly Ile His Leu Asn Pro 
                725                 730                 735     


Asn Lys Thr Lys Arg Trp Gly Tyr Ser Leu Asn Phe Met Gly Tyr Val 
            740                 745                 750         


Ile Gly Ser Trp Gly Thr Leu Pro Gln Glu His Ile Val Gln Lys Ile 
        755                 760                 765             


Lys Glu Cys Phe Arg Lys Leu Pro Val Asn Arg Pro Ile Asp Trp Lys 
    770                 775                 780                 


Val Cys Gln Arg Ile Val Gly Leu Leu Gly Phe Ala Ala Pro Phe Thr 
785                 790                 795                 800 


Gln Cys Gly Tyr Pro Ala Leu Met Pro Leu Tyr Ala Cys Ile Gln Ser 
                805                 810                 815     


Lys Gln Ala Phe Thr Phe Ser Pro Thr Tyr Lys Ala Phe Leu Cys Lys 
            820                 825                 830         


Gln Tyr Leu Asn Leu Tyr Pro Val Ala Arg Gln Arg Pro Gly Leu Cys 
        835                 840                 845             


Gln Val Phe Ala Asn Ala Thr Pro Thr Gly Trp Gly Leu Ala Ile Gly 
    850                 855                 860                 


His Gln Arg Met Arg Gly Thr Phe Val Ala Pro Leu Pro Ile His Thr 
865                 870                 875                 880 


Ala Gln Leu Leu Ala Ala Cys Phe Ala Arg Ser Arg Ser Gly Ala Lys 
                885                 890                 895     


Leu Ile Gly Thr Asp Asn Ser Val Val Leu Ser Arg Lys Tyr Thr Ser 
            900                 905                 910         


Phe Pro Trp Leu Leu Gly Cys Ala Ala Asn Trp Ile Leu Arg Gly Thr 
        915                 920                 925             


Ser Phe Val Tyr Val Pro Ser Ala Leu Asn Pro Ala Asp Asp Pro Ser 
    930                 935                 940                 


Arg Gly Arg Leu Gly Leu Tyr Arg Pro Leu Leu Arg Leu Pro Phe Arg 
945                 950                 955                 960 


Pro Thr Thr Gly Arg Thr Ser Leu Tyr Ala Asp Ser Pro Ser Val Pro 
                965                 970                 975     


Ser His Leu Pro Asp Arg Val His Phe Ala Ser Pro Leu His Val Ala 
            980                 985                 990         


Trp Arg Pro Pro 
        995     


<210>  17
<211>  1023
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV core-pol fusion antigen sequence with Ig signal sequence

<400>  17

Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Ile Leu Lys Gly 
1               5                   10                  15      


Val Gln Cys Glu Val Gln Leu Leu Glu Ser Gly Met Asp Ile Asp Pro 
            20                  25                  30          


Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser Phe Leu Pro Ser 
        35                  40                  45              


Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp Thr Ala Ser Ala Leu 
    50                  55                  60                  


Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His Thr 
65                  70                  75                  80  


Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu Leu Met Asn Leu Ala 
                85                  90                  95      


Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser Arg Glu Leu Val 
            100                 105                 110         


Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys Ile Arg Gln Leu Leu 
        115                 120                 125             


Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu Glu 
    130                 135                 140                 


Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr Pro Pro Ala Tyr Arg 
145                 150                 155                 160 


Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro Glu Thr Thr Val Val 
                165                 170                 175     


Ala Gly Ala Gly Met Pro Leu Ser Tyr Gln His Phe Arg Lys Leu Leu 
            180                 185                 190         


Leu Leu Asp Asp Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu 
        195                 200                 205             


Ala Asp Glu Gly Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly 
    210                 215                 220                 


Asn Leu Asn Val Ser Ile Pro Trp Thr His Lys Val Gly Asn Phe Thr 
225                 230                 235                 240 


Gly Leu Tyr Ser Ser Thr Val Pro Val Phe Asn Pro Glu Trp Gln Thr 
                245                 250                 255     


Pro Ser Phe Pro Asn Ile His Leu Gln Glu Asp Ile Ile Asn Arg Cys 
            260                 265                 270         


Glu Gln Phe Val Gly Pro Leu Thr Val Asn Glu Lys Arg Arg Leu Lys 
        275                 280                 285             


Leu Ile Met Pro Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro 
    290                 295                 300                 


Leu Asp Lys Gly Ile Lys Pro Tyr Tyr Pro Glu His Leu Val Asn His 
305                 310                 315                 320 


Tyr Phe Gln Thr Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly Ile 
                325                 330                 335     


Leu Tyr Lys Arg Glu Thr Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro 
            340                 345                 350         


Tyr Ser Trp Glu Gln Glu Leu Gln His Gly Arg Leu Val Phe Gln Thr 
        355                 360                 365             


Ser Thr Arg His Gly Asp Glu Ser Phe Cys Gln Gln Ser Ser Gly Ile 
    370                 375                 380                 


Leu Ser Arg Ser Pro Val Gly Pro Cys Leu Gln Ser Gln Leu Arg Lys 
385                 390                 395                 400 


Ser Arg Leu Gly Leu Gln Pro Gln Gln Gly His Leu Ala Arg Arg Gln 
                405                 410                 415     


Gln Gly Arg Ser Gly Ser Ile Arg Ala Arg Val His Pro Thr Thr Arg 
            420                 425                 430         


Arg Pro Phe Gly Val Glu Pro Ser Gly Ser Gly His Thr Thr Asn Thr 
        435                 440                 445             


Ala Ser Ser Ser Ser Ser Cys Leu His Gln Ser Ala Val Arg Lys Ala 
    450                 455                 460                 


Ala Tyr Ser His Leu Ser Thr Ser Lys Arg His Ser Ser Ser Gly His 
465                 470                 475                 480 


Ala Val Glu Leu His Asn Ile Pro Pro Asn Ser Ala Arg Ser Gln Ser 
                485                 490                 495     


Glu Gly Pro Val Phe Ser Cys Trp Trp Leu Gln Phe Arg Asn Ser Lys 
            500                 505                 510         


Pro Cys Ser Asp Tyr Cys Leu Ser His Ile Val Asn Leu Leu Glu Asp 
        515                 520                 525             


Trp Gly Pro Cys Thr Glu His Gly Glu His His Ile Arg Ile Pro Arg 
    530                 535                 540                 


Thr Pro Ala Arg Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro 
545                 550                 555                 560 


His Asn Thr Thr Glu Ser Arg Leu Val Val Asp Phe Ser Gln Phe Ser 
                565                 570                 575     


Arg Gly Asn Thr Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu 
            580                 585                 590         


Gln Ser Leu Thr Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu 
        595                 600                 605             


Asp Val Ser Ala Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met 
    610                 615                 620                 


Pro His Leu Leu Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg 
625                 630                 635                 640 


Leu Ser Ser Asn Ser Arg Ile Ile Asn His Gln His Gly Thr Met Gln 
                645                 650                 655     


Asn Leu His Asp Ser Cys Ser Arg Asn Leu Tyr Val Ser Leu Leu Leu 
            660                 665                 670         


Leu Tyr Lys Thr Phe Gly Arg Lys Leu His Leu Tyr Ser His Pro Ile 
        675                 680                 685             


Ile Leu Gly Phe Arg Lys Ile Pro Met Gly Val Gly Leu Ser Pro Phe 
    690                 695                 700                 


Leu Leu Ala Gln Phe Thr Ser Ala Ile Cys Ser Val Val Arg Arg Ala 
705                 710                 715                 720 


Phe Pro His Cys Leu Ala Phe Ser Tyr Met Asn Asn Val Val Leu Gly 
                725                 730                 735     


Ala Lys Ser Val Gln His Leu Glu Ser Leu Phe Thr Ala Val Thr Asn 
            740                 745                 750         


Phe Leu Leu Ser Leu Gly Ile His Leu Asn Pro Asn Lys Thr Lys Arg 
        755                 760                 765             


Trp Gly Tyr Ser Leu Asn Phe Met Gly Tyr Val Ile Gly Ser Trp Gly 
    770                 775                 780                 


Thr Leu Pro Gln Glu His Ile Val Gln Lys Ile Lys Glu Cys Phe Arg 
785                 790                 795                 800 


Lys Leu Pro Val Asn Arg Pro Ile Asp Trp Lys Val Cys Gln Arg Ile 
                805                 810                 815     


Val Gly Leu Leu Gly Phe Ala Ala Pro Phe Thr Gln Cys Gly Tyr Pro 
            820                 825                 830         


Ala Leu Met Pro Leu Tyr Ala Cys Ile Gln Ser Lys Gln Ala Phe Thr 
        835                 840                 845             


Phe Ser Pro Thr Tyr Lys Ala Phe Leu Cys Lys Gln Tyr Leu Asn Leu 
    850                 855                 860                 


Tyr Pro Val Ala Arg Gln Arg Pro Gly Leu Cys Gln Val Phe Ala Asn 
865                 870                 875                 880 


Ala Thr Pro Thr Gly Trp Gly Leu Ala Ile Gly His Gln Arg Met Arg 
                885                 890                 895     


Gly Thr Phe Val Ala Pro Leu Pro Ile His Thr Ala Gln Leu Leu Ala 
            900                 905                 910         


Ala Cys Phe Ala Arg Ser Arg Ser Gly Ala Lys Leu Ile Gly Thr Asp 
        915                 920                 925             


Asn Ser Val Val Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu 
    930                 935                 940                 


Gly Cys Ala Ala Asn Trp Ile Leu Arg Gly Thr Ser Phe Val Tyr Val 
945                 950                 955                 960 


Pro Ser Ala Leu Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly 
                965                 970                 975     


Leu Tyr Arg Pro Leu Leu Arg Leu Pro Phe Arg Pro Thr Thr Gly Arg 
            980                 985                 990         


Thr Ser Leu Tyr Ala Asp Ser Pro  Ser Val Pro Ser His  Leu Pro Asp 
        995                 1000                 1005             


Arg Val  His Phe Ala Ser Pro  Leu His Val Ala Trp  Arg Pro Pro 
    1010                 1015                 1020             


<210>  18
<211>  584
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hCMV promoter

<400>  18
tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt agttcatagc       60

ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc      120

aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg      180

actttccatt gacgtcaatg ggtggactat ttacggtaaa ctgcccactt ggcagtacat      240

caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc      300

tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta catctacgta      360

ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag      420

cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt      480

tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa      540

atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagc                       584


<210>  19
<211>  684
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hCMV promoter sequence

<400>  19
accgccatgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt       60

agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg      120

ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac      180

gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt      240

ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa      300

atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta      360

catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg      420

gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg      480

gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc      540

attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctcgttt      600

agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca      660

ccgggaccga tccagcctcc gcgg                                             684


<210>  20
<211>  225
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  bGH polyA signal

<400>  20
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc       60

tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc      120

tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt      180

gggaagacaa tagcaggcat gctggggatg cggtgggctc tatgg                      225


<210>  21
<211>  671
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pUC ORI

<400>  21
cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc       60

ttgcaaacaa aaaaaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact      120

ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt tcttctagtg      180

tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg      240

ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac      300

tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca      360

cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga      420

gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc      480

ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct      540

gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg      600

agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct      660

tttgctcaca t                                                           671


<210>  22
<211>  795
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  KanR coding sequence

<400>  22
atgattgagc aagatggtct tcacgctggc tcgccagctg cgtgggtgga acgcctgttt       60

ggttatgatt gggcgcagca gactattgga tgttccgacg cggctgtatt tcggctgtct      120

gctcagggtc gccccgtgct gtttgtgaag acggatttgt ctggcgcatt aaatgagtta      180

caggacgagg cggctcgtct gagttggttg gccaccaccg gcgtgccctg cgccgcagtg      240

ctggatgtcg tgacagaagc aggccgcgat tggctccttc tcggcgaagt gccgggccag      300

gacctgctca gcagccactt ggcaccggca gaaaaagttt ctatcatggc cgacgccatg      360

cgtcgtcttc acactctcga tccggccacg tgcccctttg accaccaggc caagcatcgt      420

attgaacgtg cgcgtactcg gatggaagca ggtttagtag accaggacga tttggatgag      480

gaacatcaag gcctggcccc ggctgaactg tttgcgcgct taaaagcgtc gatgccagat      540

ggcgaagatt tggtagtcac ccatggagat gcgtgtttgc caaacatcat ggttgaaaat      600

ggccgcttct caggctttat tgactgtggg cgcctgggtg ttgccgaccg ctatcaagat      660

attgcgctcg caactcgtga catcgctgaa gagctgggcg gagaatgggc tgaccgtttc      720

ctggtactgt atggcattgc agcgcccgat tcccaacgca tcgcatttta tcgtctgctg      780

gatgagtttt tctaa                                                       795


<210>  23
<211>  264
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized Kanr

<400>  23

Met Ile Glu Gln Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val 
1               5                   10                  15      


Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gln Gln Thr Ile Gly Cys Ser 
            20                  25                  30          


Asp Ala Ala Val Phe Arg Leu Ser Ala Gln Gly Arg Pro Val Leu Phe 
        35                  40                  45              


Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gln Asp Glu Ala 
    50                  55                  60                  


Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val 
65                  70                  75                  80  


Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
                85                  90                  95      


Val Pro Gly Gln Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
            100                 105                 110         


Val Ser Ile Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 
        115                 120                 125             


Ala Thr Cys Pro Phe Asp His Gln Ala Lys His Arg Ile Glu Arg Ala 
    130                 135                 140                 


Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gln Asp Asp Leu Asp Glu 
145                 150                 155                 160 


Glu His Gln Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
                165                 170                 175     


Ser Met Pro Asp Gly Glu Asp Leu Val Val Thr His Gly Asp Ala Cys 
            180                 185                 190         


Leu Pro Asn Ile Met Val Glu Asn Gly Arg Phe Ser Gly Phe Ile Asp 
        195                 200                 205             


Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gln Asp Ile Ala Leu Ala 
    210                 215                 220                 


Thr Arg Asp Ile Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 
225                 230                 235                 240 


Leu Val Leu Tyr Gly Ile Ala Ala Pro Asp Ser Gln Arg Ile Ala Phe 
                245                 250                 255     


Tyr Arg Leu Leu Asp Glu Phe Phe 
            260                 


<210>  24
<211>  99
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  bla promoter

<400>  24
acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa       60

ccctgataaa tgcttcaata atattgaaaa aggaagagt                              99


<210>  25
<211>  8364
<212>  DNA
<213>  Artificial sequence

<220>
<223>  alphavirus genome sequence

<400>  25
taatacgact cactatagag aggcggcgca tgagagaagc ccagaccaat tacctaccca       60

aaatggagaa agttcacgtt gacatcgagg aagacagccc attcctcaga gctttgcagc      120

ggagcttccc gcagtttgag gtagaagcca agcaggtcac tgataatgac catgctaatg      180

ccagagcgtt ttcgcatctg gcttcaaaac tgatcgaaac ggaggtggac ccatccgaca      240

cgatccttga cattggaagt gcgcccgccc gcagaatgta ttctaagcac aagtatcatt      300

gtatctgtcc gatgagatgt gcggaagatc cggacagatt gtataagtat gcaactaagc      360

tgaagaaaaa ctgtaaggaa ataactgata aggaattgga caagaaaatg aaggagctcg      420

ccgccgtcat gagcgaccct gacctggaaa ctgagactat gtgcctccac gacgacgagt      480

cgtgtcgcta cgaagggcaa gtcgctgttt accaggatgt atacgcggtt gacggaccga      540

caagtctcta tcaccaagcc aataagggag ttagagtcgc ctactggata ggctttgaca      600

ccaccccttt tatgtttaag aacttggctg gagcatatcc atcatactct accaactggg      660

ccgacgaaac cgtgttaacg gctcgtaaca taggcctatg cagctctgac gttatggagc      720

ggtcacgtag agggatgtcc attcttagaa agaagtattt gaaaccatcc aacaatgttc      780

tattctctgt tggctcgacc atctaccacg agaagaggga cttactgagg agctggcacc      840

tgccgtctgt atttcactta cgtggcaagc aaaattacac atgtcggtgt gagactatag      900

ttagttgcga cgggtacgtc gttaaaagaa tagctatcag tccaggcctg tatgggaagc      960

cttcaggcta tgctgctacg atgcaccgcg agggattctt gtgctgcaaa gtgacagaca     1020

cattgaacgg ggagagggtc tcttttcccg tgtgcacgta tgtgccagct acattgtgtg     1080

accaaatgac tggcatactg gcaacagatg tcagtgcgga cgacgcgcaa aaactgctgg     1140

ttgggctcaa ccagcgtata gtcgtcaacg gtcgcaccca gagaaacacc aataccatga     1200

aaaattacct tttgcccgta gtggcccagg catttgctag gtgggcaaag gaatataagg     1260

aagatcaaga agatgaaagg ccactaggac tacgagatag acagttagtc atggggtgtt     1320

gttgggcttt tagaaggcac aagataacat ctatttataa gcgcccggat acccaaacca     1380

tcatcaaagt gaacagcgat ttccactcat tcgtgctgcc caggataggc agtaacacat     1440

tggagatcgg gctgagaaca agaatcagga aaatgttaga ggagcacaag gagccgtcac     1500

ctctcattac cgccgaggac gtacaagaag ctaagtgcgc agccgatgag gctaaggagg     1560

tgcgtgaagc cgaggagttg cgcgcagctc taccaccttt ggcagctgat gttgaggagc     1620

ccactctgga agccgatgtc gacttgatgt tacaagaggc tggggccggc tcagtggaga     1680

cacctcgtgg cttgataaag gttaccagct acgatggcga ggacaagatc ggctcttacg     1740

ctgtgctttc tccgcaggct gtactcaaga gtgaaaaatt atcttgcatc caccctctcg     1800

ctgaacaagt catagtgata acacactctg gccgaaaagg gcgttatgcc gtggaaccat     1860

accatggtaa agtagtggtg ccagagggac atgcaatacc cgtccaggac tttcaagctc     1920

tgagtgaaag tgccaccatt gtgtacaacg aacgtgagtt cgtaaacagg tacctgcacc     1980

atattgccac acatggagga gcgctgaaca ctgatgaaga atattacaaa actgtcaagc     2040

ccagcgagca cgacggcgaa tacctgtacg acatcgacag gaaacagtgc gtcaagaaag     2100

aactagtcac tgggctaggg ctcacaggcg agctggtgga tcctcccttc catgaattcg     2160

cctacgagag tctgagaaca cgaccagccg ctccttacca agtaccaacc ataggggtgt     2220

atggcgtgcc aggatcaggc aagtctggca tcattaaaag cgcagtcacc aaaaaagatc     2280

tagtggtgag cgccaagaaa gaaaactgtg cagaaattat aagggacgtc aagaaaatga     2340

aagggctgga cgtcaatgcc agaactgtgg actcagtgct cttgaatgga tgcaaacacc     2400

ccgtagagac cctgtatatt gacgaagctt ttgcttgtca tgcaggtact ctcagagcgc     2460

tcatagccat tataagacct aaaaaggcag tgctctgcgg ggatcccaaa cagtgcggtt     2520

tttttaacat gatgtgcctg aaagtgcatt ttaaccacga gatttgcaca caagtcttcc     2580

acaaaagcat ctctcgccgt tgcactaaat ctgtgacttc ggtcgtctca accttgtttt     2640

acgacaaaaa aatgagaacg acgaatccga aagagactaa gattgtgatt gacactaccg     2700

gcagtaccaa acctaagcag gacgatctca ttctcacttg tttcagaggg tgggtgaagc     2760

agttgcaaat agattacaaa ggcaacgaaa taatgacggc agctgcctct caagggctga     2820

cccgtaaagg tgtgtatgcc gttcggtaca aggtgaatga aaatcctctg tacgcaccca     2880

cctctgaaca tgtgaacgtc ctactgaccc gcacggagga ccgcatcgtg tggaaaacac     2940

tagccggcga cccatggata aaaacactga ctgccaagta ccctgggaat ttcactgcca     3000

cgatagagga gtggcaagca gagcatgatg ccatcatgag gcacatcttg gagagaccgg     3060

accctaccga cgtcttccag aataaggcaa acgtgtgttg ggccaaggct ttagtgccgg     3120

tgctgaagac cgctggcata gacatgacca ctgaacaatg gaacactgtg gattattttg     3180

aaacggacaa agctcactca gcagagatag tattgaacca actatgcgtg aggttctttg     3240

gactcgatct ggactccggt ctattttctg cacccactgt tccgttatcc attaggaata     3300

atcactggga taactccccg tcgcctaaca tgtacgggct gaataaagaa gtggtccgtc     3360

agctctctcg caggtaccca caactgcctc gggcagttgc cactggaaga gtctatgaca     3420

tgaacactgg tacactgcgc aattatgatc cgcgcataaa cctagtacct gtaaacagaa     3480

gactgcctca tgctttagtc ctccaccata atgaacaccc acagagtgac ttttcttcat     3540

tcgtcagcaa attgaagggc agaactgtcc tggtggtcgg ggaaaagttg tccgtcccag     3600

gcaaaatggt tgactggttg tcagaccggc ctgaggctac cttcagagct cggctggatt     3660

taggcatccc aggtgatgtg cccaaatatg acataatatt tgttaatgtg aggaccccat     3720

ataaatacca tcactatcag cagtgtgaag accatgccat taagcttagc atgttgacca     3780

agaaagcttg tctgcatctg aatcccggcg gaacctgtgt cagcataggt tatggttacg     3840

ctgacagggc cagcgaaagc atcattggtg ctatagcgcg gcagttcaag ttttcccggg     3900

tatgcaaacc gaaatcctca cttgaagaga cggaagttct gtttgtattc attgggtacg     3960

atcgcaaggc ccgtacgcac aatccttaca agctttcatc aaccttgacc aacatttata     4020

caggttccag actccacgaa gccggatgtg caccctcata tcatgtggtg cgaggggata     4080

ttgccacggc caccgaagga gtgattataa atgctgctaa cagcaaagga caacctggcg     4140

gaggggtgtg cggagcgctg tataagaaat tcccggaaag cttcgattta cagccgatcg     4200

aagtaggaaa agcgcgactg gtcaaaggtg cagctaaaca tatcattcat gccgtaggac     4260

caaacttcaa caaagtttcg gaggttgaag gtgacaaaca gttggcagag gcttatgagt     4320

ccatcgctaa gattgtcaac gataacaatt acaagtcagt agcgattcca ctgttgtcca     4380

ccggcatctt ttccgggaac aaagatcgac taacccaatc attgaaccat ttgctgacag     4440

ctttagacac cactgatgca gatgtagcca tatactgcag ggacaagaaa tgggaaatga     4500

ctctcaagga agcagtggct aggagagaag cagtggagga gatatgcata tccgacgact     4560

cttcagtgac agaacctgat gcagagctgg tgagggtgca tccgaagagt tctttggctg     4620

gaaggaaggg ctacagcaca agcgatggca aaactttctc atatttggaa gggaccaagt     4680

ttcaccaggc ggccaaggat atagcagaaa ttaatgccat gtggcccgtt gcaacggagg     4740

ccaatgagca ggtatgcatg tatatcctcg gagaaagcat gagcagtatt aggtcgaaat     4800

gccccgtcga agagtcggaa gcctccacac cacctagcac gctgccttgc ttgtgcatcc     4860

atgccatgac tccagaaaga gtacagcgcc taaaagcctc acgtccagaa caaattactg     4920

tgtgctcatc ctttccattg ccgaagtata gaatcactgg tgtgcagaag atccaatgct     4980

cccagcctat attgttctca ccgaaagtgc ctgcgtatat tcatccaagg aagtatctcg     5040

tggaaacacc accggtagac gagactccgg agccatcggc agagaaccaa tccacagagg     5100

ggacacctga acaaccacca cttataaccg aggatgagac caggactaga acgcctgagc     5160

cgatcatcat cgaagaggaa gaagaggata gcataagttt gctgtcagat ggcccgaccc     5220

accaggtgct gcaagtcgag gcagacattc acgggccgcc ctctgtatct agctcatcct     5280

ggtccattcc tcatgcatcc gactttgatg tggacagttt atccatactt gacaccctgg     5340

agggagctag cgtgaccagc ggggcaacgt cagccgagac taactcttac ttcgcaaaga     5400

gtatggagtt tctggcgcga ccggtgcctg cgcctcgaac agtattcagg aaccctccac     5460

atcccgctcc gcgcacaaga acaccgtcac ttgcacccag cagggcctgc tcgagaacca     5520

gcctagtttc caccccgcca ggcgtgaata gggtgatcac tagagaggag ctcgaggcgc     5580

ttaccccgtc acgcactcct agcaggtcgg tctcgagaac cagcctggtc tccaacccgc     5640

caggcgtaaa tagggtgatt acaagagagg agtttgaggc gttcgtagca caacaacaat     5700

gacggtttga tgcgggtgca tacatctttt cctccgacac cggtcaaggg catttacaac     5760

aaaaatcagt aaggcaaacg gtgctatccg aagtggtgtt ggagaggacc gaattggaga     5820

tttcgtatgc cccgcgcctc gaccaagaaa aagaagaatt actacgcaag aaattacagt     5880

taaatcccac acctgctaac agaagcagat accagtccag gaaggtggag aacatgaaag     5940

ccataacagc tagacgtatt ctgcaaggcc tagggcatta tttgaaggca gaaggaaaag     6000

tggagtgcta ccgaaccctg catcctgttc ctttgtattc atctagtgtg aaccgtgcct     6060

tttcaagccc caaggtcgca gtggaagcct gtaacgccat gttgaaagag aactttccga     6120

ctgtggcttc ttactgtatt attccagagt acgatgccta tttggacatg gttgacggag     6180

cttcatgctg cttagacact gccagttttt gccctgcaaa gctgcgcagc tttccaaaga     6240

aacactccta tttggaaccc acaatacgat cggcagtgcc ttcagcgatc cagaacacgc     6300

tccagaacgt cctggcagct gccacaaaaa gaaattgcaa tgtcacgcaa atgagagaat     6360

tgcccgtatt ggattcggcg gcctttaatg tggaatgctt caagaaatat gcgtgtaata     6420

atgaatattg ggaaacgttt aaagaaaacc ccatcaggct tactgaagaa aacgtggtaa     6480

attacattac caaattaaaa ggaccaaaag ctgctgctct ttttgcgaag acacataatt     6540

tgaatatgtt gcaggacata ccaatggaca ggtttgtaat ggacttaaag agagacgtga     6600

aagtgactcc aggaacaaaa catactgaag aacggcccaa ggtacaggtg atccaggctg     6660

ccgatccgct agcaacagcg tatctgtgcg gaatccaccg agagctggtt aggagattaa     6720

atgcggtcct gcttccgaac attcatacac tgtttgatat gtcggctgaa gactttgacg     6780

ctattatagc cgagcacttc cagcctgggg attgtgttct ggaaactgac atcgcgtcgt     6840

ttgataaaag tgaggacgac gccatggctc tgaccgcgtt aatgattctg gaagacttag     6900

gtgtggacgc agagctgttg acgctgattg aggcggcttt cggcgaaatt tcatcaatac     6960

atttgcccac taaaactaaa tttaaattcg gagccatgat gaaatctgga atgttcctca     7020

cactgtttgt gaacacagtc attaacattg taatcgcaag cagagtgttg agagaacggc     7080

taaccggatc accatgtgca gcattcattg gagatgacaa tatcgtgaaa ggagtcaaat     7140

cggacaaatt aatggcagac aggtgcgcca cctggttgaa tatggaagtc aagattatag     7200

atgctgtggt gggcgagaaa gcgccttatt tctgtggagg gtttattttg tgtgactccg     7260

tgaccggcac agcgtgccgt gtggcagacc ccctaaaaag gctgtttaag cttggcaaac     7320

ctctggcagc agacgatgaa catgatgatg acaggagaag ggcattgcat gaagagtcaa     7380

cacgctggaa ccgagtgggt attctttcag agctgtgcaa ggcagtagaa tcaaggtatg     7440

aaaccgtagg aacttccatc atagttatgg ccatgactac tctagctagc agtgttaaat     7500

cattcagcta cctgagaggg gcccctataa ctctctacgg ctaacctgaa tggactacga     7560

catagtctag tccgccaaga tatcatcgat acagcagcaa ttggcaagct gcttacatag     7620

aaggcgcgcc gtttaaacgg ccggccttaa ttaagtaacg atacagcagc aattggcaag     7680

ctgcttacat agaactcgcg gcgattggca tgccgcttta aaatttttat tttatttttc     7740

ttttcttttc cgaatcggat tttgttttta atatttcaaa aaaaaaaaaa aaaaaaaaaa     7800

aaaaaaaaaa aaaaaaaccc ctctctaaac ggaggggttt ttttcagcgt aactggactg     7860

gccacagtta ggcggccgcg catgttcatc atcagtaacc cgtatcgtga gcatcctctc     7920

tcgtttcatc ggtatcatta cctccatgaa cagaaatccc ccttacacgg aggcatcagt     7980

gaccaaacag gaaaaaaccg cccttaacat ggcccgcttt atcagaagcc agacattaac     8040

gcttctggag aaactcaacg agctggacgc ggatgaacag gcagacatct gtgaatcgct     8100

tcacgaccac gctgatgagc tttaccgcag ctgcctcgcg cgtttcggtg atgacggtga     8160

aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg     8220

gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat     8280

gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag     8340

attgtactga gagtgcacca tatg                                            8364


<210>  26
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Aura virus

<400>  26
atagcggacg gactagtact tgtactacag aattaactgc cgtgtgccgc                  50


<210>  27
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Chikungunya virus

<400>  27
atggctgcgt gagacacacg tagcctacca gtttcttact gctctactct                  50


<210>  28
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of O'nyong-nyong virus

<400>  28
atagctgcgt gatacacaca cgcagcttac gggtttcata ctgctctact                  50


<210>  29
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Bebaru virus

<400>  29
atggcggctg tgtgacacac gagccgtcga tttcaacctt cttgctccct                  50


<210>  30
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Semliki forest virus

<400>  30
atggcggatg tgtgacatac acgacgccaa aagattttgt tccagctcct                  50


<210>  31
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR Mayaro virus

<400>  31
atggcgggca agtgacactt gttccgccgg tcgtctctaa gctcttcctc                  50


<210>  32
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Getah virus

<400>  32
atggcggacg tgtgacatca ccgttcgctc tttctaggat cctttgctac                  50


<210>  33
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Sagiyama virus

<400>  33
atggcggacg tgtgacatca ccgttcgctc tttctaggat cctttgctac                  50


<210>  34
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR Ndumu virus

<400>  34
atggtgcgga gttgagagac gaagcaccaa acaactacgc ggctcaccat                  50


<210>  35
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Middelburg virus

<400>  35
attggtggtt acgtacacgt gccaccaccc cccaccctcc aagcgatcca                  50


<210>  36
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Eastern equine encephalitis virus

<400>  36
atagggtacg gtgtagaggc aaccacccta tttccaccta tccaaaatgg                  50


<210>  37
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Fort Morgan virus

<400>  37
atagggtatg gtttagaggc gcctacccta cttaaccgat ccaaacatgg                  50


<210>  38
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Buggy Creek virus

<400>  38
atagggtatg gtttagaggc gcctacccta cttaaccgat ccaaacatgg                  50


<210>  39
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Venezuelan equine encephalitis virus

<400>  39
atgggcggcg caagagagaa gcccaaacca attacctacc caaaatggag                  50


<210>  40
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Whataroa virus

<400>  40
attggcggca tagtacatac tatataaaag aaacagccga ccaattgcac                  50


<210>  41
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Sindbis virus

<400>  41
attgacggcg tagtacacac tattgaatca aacagccgac caattgcact                  50


<210>  42
<211>  50
<212>  DNA
<213>  Artificial sequence

<220>
<223>  5' UTR of Bebaru virus

<400>  42
attggcggcg tagtacacac tattgaatca aacagccgac caattgcact                  50


<210>  43
<211>  145
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Sindbis virus

<400>  43
atagtcagca tagtacattt catctgacta atactacaac accaccacca tgaatagagg       60

attctttaac atgctcggcc gccgcccctt cccggccccc actgccatgt ggaggccgcg      120

gagaaggagg caggcggccc cgatg                                            145


<210>  44
<211>  104
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Sindbis virus

<400>  44
atgaatagag gattctttaa catgctcggc cgccgcccct tcccggcccc cactgccatg       60

tggaggccgc ggagaaggag gcaggcggcc ccgatgcctg cccg                       104


<210>  45
<211>  120
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Aura virus

<400>  45
atgaactctg tcttttacaa tccgtttggc cgaggtgcct acgctcaacc tccaatagca       60

tggaggccaa gacgtagggc tgcacctgcg cctcgaccat ccgggttgac tacccagatc      120


<210>  46
<211>  71
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Eastern Equine Encephalitis virus SA

<400>  46
atgtttccgt atccaacatt gaactacccg cctatggcac cggttaatcc gatggcatac       60

agggacccca a                                                            71


<210>  47
<211>  91
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from O'nyong-nyong virus

<400>  47
atggagttca taccagcaca aacttactac aatagaagat accagcctag accctggact       60

caacgcccta ctatccaggt gatcaggcca a                                      91


<210>  48
<211>  67
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Semliki forest virus

<400>  48
atgaattaca tccctacgca aacgttttac ggccgccggt ggcgcccgcg cccggcggcc       60

cgtcctt                                                                 67


<210>  49
<211>  69
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Ross River virus

<400>  49
atgaattaca taccaaccca gactttttac ggacgccgtt ggcggcctcg cccggcgttc       60

cgtccatgg                                                               69


<210>  50
<211>  91
<212>  DNA
<213>  Artificial sequence

<220>
<223>  DLP from Mayaro virus

<400>  50
atggatttcc taccaacaca agtgttttat ggcaggcgat ggagaccacg aatgccgcca       60

cgcccttgga ggccacgccc acctacaatt c                                      91


<210>  51
<211>  2492
<212>  PRT
<213>  Artificial sequence

<220>
<223>  non-structural polyprotein (P1234), PRT, Venezuelan equine 
       encephalitis virus (VEEV)

<400>  51

Met Glu Lys Val His Val Asp Ile Glu Glu Asp Ser Pro Phe Leu Arg 
1               5                   10                  15      


Ala Leu Gln Arg Ser Phe Pro Gln Phe Glu Val Glu Ala Lys Gln Val 
            20                  25                  30          


Thr Asp Asn Asp His Ala Asn Ala Arg Ala Phe Ser His Leu Ala Ser 
        35                  40                  45              


Lys Leu Ile Glu Thr Glu Val Asp Pro Ser Asp Thr Ile Leu Asp Ile 
    50                  55                  60                  


Gly Ser Ala Pro Ala Arg Arg Met Tyr Ser Lys His Lys Tyr His Cys 
65                  70                  75                  80  


Ile Cys Pro Met Arg Cys Ala Glu Asp Pro Asp Arg Leu Tyr Lys Tyr 
                85                  90                  95      


Ala Thr Lys Leu Lys Lys Asn Cys Lys Glu Ile Thr Asp Lys Glu Leu 
            100                 105                 110         


Asp Lys Lys Met Lys Glu Leu Ala Ala Val Met Ser Asp Pro Asp Leu 
        115                 120                 125             


Glu Thr Glu Thr Met Cys Leu His Asp Asp Glu Ser Cys Arg Tyr Glu 
    130                 135                 140                 


Gly Gln Val Ala Val Tyr Gln Asp Val Tyr Ala Val Asp Gly Pro Thr 
145                 150                 155                 160 


Ser Leu Tyr His Gln Ala Asn Lys Gly Val Arg Val Ala Tyr Trp Ile 
                165                 170                 175     


Gly Phe Asp Thr Thr Pro Phe Met Phe Lys Asn Leu Ala Gly Ala Tyr 
            180                 185                 190         


Pro Ser Tyr Ser Thr Asn Trp Ala Asp Glu Thr Val Leu Thr Ala Arg 
        195                 200                 205             


Asn Ile Gly Leu Cys Ser Ser Asp Val Met Glu Arg Ser Arg Arg Gly 
    210                 215                 220                 


Met Ser Ile Leu Arg Lys Lys Tyr Leu Lys Pro Ser Asn Asn Val Leu 
225                 230                 235                 240 


Phe Ser Val Gly Ser Thr Ile Tyr His Glu Lys Arg Asp Leu Leu Arg 
                245                 250                 255     


Ser Trp His Leu Pro Ser Val Phe His Leu Arg Gly Lys Gln Asn Tyr 
            260                 265                 270         


Thr Cys Arg Cys Glu Thr Ile Val Ser Cys Asp Gly Tyr Val Val Lys 
        275                 280                 285             


Arg Ile Ala Ile Ser Pro Gly Leu Tyr Gly Lys Pro Ser Gly Tyr Ala 
    290                 295                 300                 


Ala Thr Met His Arg Glu Gly Phe Leu Cys Cys Lys Val Thr Asp Thr 
305                 310                 315                 320 


Leu Asn Gly Glu Arg Val Ser Phe Pro Val Cys Thr Tyr Val Pro Ala 
                325                 330                 335     


Thr Leu Cys Asp Gln Met Thr Gly Ile Leu Ala Thr Asp Val Ser Ala 
            340                 345                 350         


Asp Asp Ala Gln Lys Leu Leu Val Gly Leu Asn Gln Arg Ile Val Val 
        355                 360                 365             


Asn Gly Arg Thr Gln Arg Asn Thr Asn Thr Met Lys Asn Tyr Leu Leu 
    370                 375                 380                 


Pro Val Val Ala Gln Ala Phe Ala Arg Trp Ala Lys Glu Tyr Lys Glu 
385                 390                 395                 400 


Asp Gln Glu Asp Glu Arg Pro Leu Gly Leu Arg Asp Arg Gln Leu Val 
                405                 410                 415     


Met Gly Cys Cys Trp Ala Phe Arg Arg His Lys Ile Thr Ser Ile Tyr 
            420                 425                 430         


Lys Arg Pro Asp Thr Gln Thr Ile Ile Lys Val Asn Ser Asp Phe His 
        435                 440                 445             


Ser Phe Val Leu Pro Arg Ile Gly Ser Asn Thr Leu Glu Ile Gly Leu 
    450                 455                 460                 


Arg Thr Arg Ile Arg Lys Met Leu Glu Glu His Lys Glu Pro Ser Pro 
465                 470                 475                 480 


Leu Ile Thr Ala Glu Asp Val Gln Glu Ala Lys Cys Ala Ala Asp Glu 
                485                 490                 495     


Ala Lys Glu Val Arg Glu Ala Glu Glu Leu Arg Ala Ala Leu Pro Pro 
            500                 505                 510         


Leu Ala Ala Asp Val Glu Glu Pro Thr Leu Glu Ala Asp Val Asp Leu 
        515                 520                 525             


Met Leu Gln Glu Ala Gly Ala Gly Ser Val Glu Thr Pro Arg Gly Leu 
    530                 535                 540                 


Ile Lys Val Thr Ser Tyr Ala Gly Glu Asp Lys Ile Gly Ser Tyr Ala 
545                 550                 555                 560 


Val Leu Ser Pro Gln Ala Val Leu Lys Ser Glu Lys Leu Ser Cys Ile 
                565                 570                 575     


His Pro Leu Ala Glu Gln Val Ile Val Ile Thr His Ser Gly Arg Lys 
            580                 585                 590         


Gly Arg Tyr Ala Val Glu Pro Tyr His Gly Lys Val Val Val Pro Glu 
        595                 600                 605             


Gly His Ala Ile Pro Val Gln Asp Phe Gln Ala Leu Ser Glu Ser Ala 
    610                 615                 620                 


Thr Ile Val Tyr Asn Glu Arg Glu Phe Val Asn Arg Tyr Leu His His 
625                 630                 635                 640 


Ile Ala Thr His Gly Gly Ala Leu Asn Thr Asp Glu Glu Tyr Tyr Lys 
                645                 650                 655     


Thr Val Lys Pro Ser Glu His Asp Gly Glu Tyr Leu Tyr Asp Ile Asp 
            660                 665                 670         


Arg Lys Gln Cys Val Lys Lys Glu Leu Val Thr Gly Leu Gly Leu Thr 
        675                 680                 685             


Gly Glu Leu Val Asp Pro Pro Phe His Glu Phe Ala Tyr Glu Ser Leu 
    690                 695                 700                 


Arg Thr Arg Pro Ala Ala Pro Tyr Gln Val Pro Thr Ile Gly Val Tyr 
705                 710                 715                 720 


Gly Val Pro Gly Ser Gly Lys Ser Gly Ile Ile Lys Ser Ala Val Thr 
                725                 730                 735     


Lys Lys Asp Leu Val Val Ser Ala Lys Lys Glu Asn Cys Ala Glu Ile 
            740                 745                 750         


Ile Arg Asp Val Lys Lys Met Lys Gly Leu Asp Val Asn Ala Arg Thr 
        755                 760                 765             


Val Asp Ser Val Leu Leu Asn Gly Cys Lys His Pro Val Glu Thr Leu 
    770                 775                 780                 


Tyr Ile Asp Glu Ala Phe Ala Cys His Ala Gly Thr Leu Arg Ala Leu 
785                 790                 795                 800 


Ile Ala Ile Ile Arg Pro Lys Lys Ala Val Leu Cys Gly Asp Pro Lys 
                805                 810                 815     


Gln Cys Gly Phe Phe Asn Met Met Cys Leu Lys Val His Phe Asn His 
            820                 825                 830         


Glu Ile Cys Thr Gln Val Phe His Lys Ser Ile Ser Arg Arg Cys Thr 
        835                 840                 845             


Lys Ser Val Thr Ser Val Val Ser Thr Leu Phe Tyr Asp Lys Lys Met 
    850                 855                 860                 


Arg Thr Thr Asn Pro Lys Glu Thr Lys Ile Val Ile Asp Thr Thr Gly 
865                 870                 875                 880 


Ser Thr Lys Pro Lys Gln Asp Asp Leu Ile Leu Thr Cys Phe Arg Gly 
                885                 890                 895     


Trp Val Lys Gln Leu Gln Ile Asp Tyr Lys Gly Asn Glu Ile Met Thr 
            900                 905                 910         


Ala Ala Ala Ser Gln Gly Leu Thr Arg Lys Gly Val Tyr Ala Val Arg 
        915                 920                 925             


Tyr Lys Val Asn Glu Asn Pro Leu Tyr Ala Pro Thr Ser Glu His Val 
    930                 935                 940                 


Asn Val Leu Leu Thr Arg Thr Glu Asp Arg Ile Val Trp Lys Thr Leu 
945                 950                 955                 960 


Ala Gly Asp Pro Trp Ile Lys Thr Leu Thr Ala Lys Tyr Pro Gly Asn 
                965                 970                 975     


Phe Thr Ala Thr Ile Glu Glu Trp Gln Ala Glu His Asp Ala Ile Met 
            980                 985                 990         


Arg His Ile Leu Glu Arg Pro Asp  Pro Thr Asp Val Phe  Gln Asn Lys 
        995                 1000                 1005             


Ala Asn  Val Cys Trp Ala Lys  Ala Leu Val Pro Val  Leu Lys Thr 
    1010                 1015                 1020             


Ala Gly  Ile Asp Met Thr Thr  Glu Gln Trp Asn Thr  Val Asp Tyr 
    1025                 1030                 1035             


Phe Glu  Thr Asp Lys Ala His  Ser Ala Glu Ile Val  Leu Asn Gln 
    1040                 1045                 1050             


Leu Cys  Val Arg Phe Phe Gly  Leu Asp Leu Asp Ser  Gly Leu Phe 
    1055                 1060                 1065             


Ser Ala  Pro Thr Val Pro Leu  Ser Ile Arg Asn Asn  His Trp Asp 
    1070                 1075                 1080             


Asn Ser  Pro Ser Pro Asn Met  Tyr Gly Leu Asn Lys  Glu Val Val 
    1085                 1090                 1095             


Arg Gln  Leu Ser Arg Arg Tyr  Pro Gln Leu Pro Arg  Ala Val Ala 
    1100                 1105                 1110             


Thr Gly  Arg Val Tyr Asp Met  Asn Thr Gly Thr Leu  Arg Asn Tyr 
    1115                 1120                 1125             


Asp Pro  Arg Ile Asn Leu Val  Pro Val Asn Arg Arg  Leu Pro His 
    1130                 1135                 1140             


Ala Leu  Val Leu His His Asn  Glu His Pro Gln Ser  Asp Phe Ser 
    1145                 1150                 1155             


Ser Phe  Val Ser Lys Leu Lys  Gly Arg Thr Val Leu  Val Val Gly 
    1160                 1165                 1170             


Glu Lys  Leu Ser Val Pro Gly  Lys Met Val Asp Trp  Leu Ser Asp 
    1175                 1180                 1185             


Arg Pro  Glu Ala Thr Phe Arg  Ala Arg Leu Asp Leu  Gly Ile Pro 
    1190                 1195                 1200             


Gly Asp  Val Pro Lys Tyr Asp  Ile Ile Phe Val Asn  Val Arg Thr 
    1205                 1210                 1215             


Pro Tyr  Lys Tyr His His Tyr  Gln Gln Cys Glu Asp  His Ala Ile 
    1220                 1225                 1230             


Lys Leu  Ser Met Leu Thr Lys  Lys Ala Cys Leu His  Leu Asn Pro 
    1235                 1240                 1245             


Gly Gly  Thr Cys Val Ser Ile  Gly Tyr Gly Tyr Ala  Asp Arg Ala 
    1250                 1255                 1260             


Ser Glu  Ser Ile Ile Gly Ala  Ile Ala Arg Gln Phe  Lys Phe Ser 
    1265                 1270                 1275             


Arg Val  Cys Lys Pro Lys Ser  Ser Leu Glu Glu Thr  Glu Val Leu 
    1280                 1285                 1290             


Phe Val  Phe Ile Gly Tyr Asp  Arg Lys Ala Arg Thr  His Asn Pro 
    1295                 1300                 1305             


Tyr Lys  Leu Ser Ser Thr Leu  Thr Asn Ile Tyr Thr  Gly Ser Arg 
    1310                 1315                 1320             


Leu His  Glu Ala Gly Cys Ala  Pro Ser Tyr His Val  Val Arg Gly 
    1325                 1330                 1335             


Asp Ile  Ala Thr Ala Thr Glu  Gly Val Ile Ile Asn  Ala Ala Asn 
    1340                 1345                 1350             


Ser Lys  Gly Gln Pro Gly Gly  Gly Val Cys Gly Ala  Leu Tyr Lys 
    1355                 1360                 1365             


Lys Phe  Pro Glu Ser Phe Asp  Leu Gln Pro Ile Glu  Val Gly Lys 
    1370                 1375                 1380             


Ala Arg  Leu Val Lys Gly Ala  Ala Lys His Ile Ile  His Ala Val 
    1385                 1390                 1395             


Gly Pro  Asn Phe Asn Lys Val  Ser Glu Val Glu Gly  Asp Lys Gln 
    1400                 1405                 1410             


Leu Ala  Glu Ala Tyr Glu Ser  Ile Ala Lys Ile Val  Asn Asp Asn 
    1415                 1420                 1425             


Asn Tyr  Lys Ser Val Ala Ile  Pro Leu Leu Ser Thr  Gly Ile Phe 
    1430                 1435                 1440             


Ser Gly  Asn Lys Asp Arg Leu  Thr Gln Ser Leu Asn  His Leu Leu 
    1445                 1450                 1455             


Thr Ala  Leu Asp Thr Thr Asp  Ala Asp Val Ala Ile  Tyr Cys Arg 
    1460                 1465                 1470             


Asp Lys  Lys Trp Glu Met Thr  Leu Lys Glu Ala Val  Ala Arg Arg 
    1475                 1480                 1485             


Glu Ala  Val Glu Glu Ile Cys  Ile Ser Asp Asp Ser  Ser Val Thr 
    1490                 1495                 1500             


Glu Pro  Asp Ala Glu Leu Val  Arg Val His Pro Lys  Ser Ser Leu 
    1505                 1510                 1515             


Ala Gly  Arg Lys Gly Tyr Ser  Thr Ser Asp Gly Lys  Thr Phe Ser 
    1520                 1525                 1530             


Tyr Leu  Glu Gly Thr Lys Phe  His Gln Ala Ala Lys  Asp Ile Ala 
    1535                 1540                 1545             


Glu Ile  Asn Ala Met Trp Pro  Val Ala Thr Glu Ala  Asn Glu Gln 
    1550                 1555                 1560             


Val Cys  Met Tyr Ile Leu Gly  Glu Ser Met Ser Ser  Ile Arg Ser 
    1565                 1570                 1575             


Lys Cys  Pro Val Glu Glu Ser  Glu Ala Ser Thr Pro  Pro Ser Thr 
    1580                 1585                 1590             


Leu Pro  Cys Leu Cys Ile His  Ala Met Thr Pro Glu  Arg Val Gln 
    1595                 1600                 1605             


Arg Leu  Lys Ala Ser Arg Pro  Glu Gln Ile Thr Val  Cys Ser Ser 
    1610                 1615                 1620             


Phe Pro  Leu Pro Lys Tyr Arg  Ile Thr Gly Val Gln  Lys Ile Gln 
    1625                 1630                 1635             


Cys Ser  Gln Pro Ile Leu Phe  Ser Pro Lys Val Pro  Ala Tyr Ile 
    1640                 1645                 1650             


His Pro  Arg Lys Tyr Leu Val  Glu Thr Pro Pro Val  Asp Glu Thr 
    1655                 1660                 1665             


Pro Glu  Pro Ser Ala Glu Asn  Gln Ser Thr Glu Gly  Thr Pro Glu 
    1670                 1675                 1680             


Gln Pro  Pro Leu Ile Thr Glu  Asp Glu Thr Arg Thr  Arg Thr Pro 
    1685                 1690                 1695             


Glu Pro  Ile Ile Ile Glu Glu  Glu Glu Glu Asp Ser  Ile Ser Leu 
    1700                 1705                 1710             


Leu Ser  Asp Gly Pro Thr His  Gln Val Leu Gln Val  Glu Ala Asp 
    1715                 1720                 1725             


Ile His  Gly Pro Pro Ser Val  Ser Ser Ser Ser Trp  Ser Ile Pro 
    1730                 1735                 1740             


His Ala  Ser Asp Phe Asp Val  Asp Ser Leu Ser Ile  Leu Asp Thr 
    1745                 1750                 1755             


Leu Glu  Gly Ala Ser Val Thr  Ser Gly Ala Thr Ser  Ala Glu Thr 
    1760                 1765                 1770             


Asn Ser  Tyr Phe Ala Lys Ser  Met Glu Phe Leu Ala  Arg Pro Val 
    1775                 1780                 1785             


Pro Ala  Pro Arg Thr Val Phe  Arg Asn Pro Pro His  Pro Ala Pro 
    1790                 1795                 1800             


Arg Thr  Arg Thr Pro Ser Leu  Ala Pro Ser Arg Ala  Cys Ser Arg 
    1805                 1810                 1815             


Thr Ser  Leu Val Ser Thr Pro  Pro Gly Val Asn Arg  Val Ile Thr 
    1820                 1825                 1830             


Arg Glu  Glu Leu Glu Ala Leu  Thr Pro Ser Arg Thr  Pro Ser Arg 
    1835                 1840                 1845             


Ser Val  Ser Arg Thr Ser Leu  Val Ser Asn Pro Pro  Gly Val Asn 
    1850                 1855                 1860             


Arg Val  Ile Thr Arg Glu Glu  Phe Glu Ala Phe Val  Ala Gln Gln 
    1865                 1870                 1875             


Gln Arg  Phe Asp Ala Gly Ala  Tyr Ile Phe Ser Ser  Asp Thr Gly 
    1880                 1885                 1890             


Gln Gly  His Leu Gln Gln Lys  Ser Val Arg Gln Thr  Val Leu Ser 
    1895                 1900                 1905             


Glu Val  Val Leu Glu Arg Thr  Glu Leu Glu Ile Ser  Tyr Ala Pro 
    1910                 1915                 1920             


Arg Leu  Asp Gln Glu Lys Glu  Glu Leu Leu Arg Lys  Lys Leu Gln 
    1925                 1930                 1935             


Leu Asn  Pro Thr Pro Ala Asn  Arg Ser Arg Tyr Gln  Ser Arg Lys 
    1940                 1945                 1950             


Val Glu  Asn Met Lys Ala Ile  Thr Ala Arg Arg Ile  Leu Gln Gly 
    1955                 1960                 1965             


Leu Gly  His Tyr Leu Lys Ala  Glu Gly Lys Val Glu  Cys Tyr Arg 
    1970                 1975                 1980             


Thr Leu  His Pro Val Pro Leu  Tyr Ser Ser Ser Val  Asn Arg Ala 
    1985                 1990                 1995             


Phe Ser  Ser Pro Lys Val Ala  Val Glu Ala Cys Asn  Ala Met Leu 
    2000                 2005                 2010             


Lys Glu  Asn Phe Pro Thr Val  Ala Ser Tyr Cys Ile  Ile Pro Glu 
    2015                 2020                 2025             


Tyr Asp  Ala Tyr Leu Asp Met  Val Asp Gly Ala Ser  Cys Cys Leu 
    2030                 2035                 2040             


Asp Thr  Ala Ser Phe Cys Pro  Ala Lys Leu Arg Ser  Phe Pro Lys 
    2045                 2050                 2055             


Lys His  Ser Tyr Leu Glu Pro  Thr Ile Arg Ser Ala  Val Pro Ser 
    2060                 2065                 2070             


Ala Ile  Gln Asn Thr Leu Gln  Asn Val Leu Ala Ala  Ala Thr Lys 
    2075                 2080                 2085             


Arg Asn  Cys Asn Val Thr Gln  Met Arg Glu Leu Pro  Val Leu Asp 
    2090                 2095                 2100             


Ser Ala  Ala Phe Asn Val Glu  Cys Phe Lys Lys Tyr  Ala Cys Asn 
    2105                 2110                 2115             


Asn Glu  Tyr Trp Glu Thr Phe  Lys Glu Asn Pro Ile  Arg Leu Thr 
    2120                 2125                 2130             


Glu Glu  Asn Val Val Asn Tyr  Ile Thr Lys Leu Lys  Gly Pro Lys 
    2135                 2140                 2145             


Ala Ala  Ala Leu Phe Ala Lys  Thr His Asn Leu Asn  Met Leu Gln 
    2150                 2155                 2160             


Asp Ile  Pro Met Asp Arg Phe  Val Met Asp Leu Lys  Arg Asp Val 
    2165                 2170                 2175             


Lys Val  Thr Pro Gly Thr Lys  His Thr Glu Glu Arg  Pro Lys Val 
    2180                 2185                 2190             


Gln Val  Ile Gln Ala Ala Asp  Pro Leu Ala Thr Ala  Tyr Leu Cys 
    2195                 2200                 2205             


Gly Ile  His Arg Glu Leu Val  Arg Arg Leu Asn Ala  Val Leu Leu 
    2210                 2215                 2220             


Pro Asn  Ile His Thr Leu Phe  Asp Met Ser Ala Glu  Asp Phe Asp 
    2225                 2230                 2235             


Ala Ile  Ile Ala Glu His Phe  Gln Pro Gly Asp Cys  Val Leu Glu 
    2240                 2245                 2250             


Thr Asp  Ile Ala Ser Phe Asp  Lys Ser Glu Asp Asp  Ala Met Ala 
    2255                 2260                 2265             


Leu Thr  Ala Leu Met Ile Leu  Glu Asp Leu Gly Val  Asp Ala Glu 
    2270                 2275                 2280             


Leu Leu  Thr Leu Ile Glu Ala  Ala Phe Gly Glu Ile  Ser Ser Ile 
    2285                 2290                 2295             


His Leu  Pro Thr Lys Thr Lys  Phe Lys Phe Gly Ala  Met Met Lys 
    2300                 2305                 2310             


Ser Gly  Met Phe Leu Thr Leu  Phe Val Asn Thr Val  Ile Asn Ile 
    2315                 2320                 2325             


Val Ile  Ala Ser Arg Val Leu  Arg Glu Arg Leu Thr  Gly Ser Pro 
    2330                 2335                 2340             


Cys Ala  Ala Phe Ile Gly Asp  Asp Asn Ile Val Lys  Gly Val Lys 
    2345                 2350                 2355             


Ser Asp  Lys Leu Met Ala Asp  Arg Cys Ala Thr Trp  Leu Asn Met 
    2360                 2365                 2370             


Glu Val  Lys Ile Ile Asp Ala  Val Val Gly Glu Lys  Ala Pro Tyr 
    2375                 2380                 2385             


Phe Cys  Gly Gly Phe Ile Leu  Cys Asp Ser Val Thr  Gly Thr Ala 
    2390                 2395                 2400             


Cys Arg  Val Ala Asp Pro Leu  Lys Arg Leu Phe Lys  Leu Gly Lys 
    2405                 2410                 2415             


Pro Leu  Ala Ala Asp Asp Glu  His Asp Asp Asp Arg  Arg Arg Ala 
    2420                 2425                 2430             


Leu His  Glu Glu Ser Thr Arg  Trp Asn Arg Val Gly  Ile Leu Ser 
    2435                 2440                 2445             


Glu Leu  Cys Lys Ala Val Glu  Ser Arg Tyr Glu Thr  Val Gly Thr 
    2450                 2455                 2460             


Ser Ile  Ile Val Met Ala Met  Thr Thr Leu Ala Ser  Ser Val Lys 
    2465                 2470                 2475             


Ser Phe  Ser Tyr Leu Arg Gly  Ala Pro Ile Thr Leu  Tyr Gly 
    2480                 2485                 2490         


<210>  52
<211>  2493
<212>  PRT
<213>  Artificial sequence

<220>
<223>  non-structural polyprotein (P1234), PRT, eastern equine 
       encephalitis virus (EEEV)

<400>  52

Met Glu Lys Val His Val Asp Leu Asp Ala Asp Ser Pro Phe Val Lys 
1               5                   10                  15      


Ser Leu Gln Arg Cys Phe Pro His Phe Glu Ile Glu Ala Thr Gln Val 
            20                  25                  30          


Thr Asp Asn Asp His Ala Asn Ala Arg Ala Phe Ser His Leu Ala Thr 
        35                  40                  45              


Lys Leu Ile Glu Gly Glu Val Asp Thr Asp Gln Val Ile Leu Asp Ile 
    50                  55                  60                  


Gly Ser Ala Pro Val Arg His Thr His Ser Lys His Lys Tyr His Cys 
65                  70                  75                  80  


Ile Cys Pro Met Lys Ser Ala Glu Asp Pro Asp Arg Leu Tyr Arg Tyr 
                85                  90                  95      


Ala Asp Lys Leu Arg Lys Ser Asp Val Thr Asp Lys Cys Ile Ala Ser 
            100                 105                 110         


Lys Ala Ala Asp Leu Leu Thr Val Met Ser Thr Pro Asp Ala Glu Thr 
        115                 120                 125             


Pro Ser Leu Cys Met His Thr Asp Ser Thr Cys Arg Tyr His Gly Ser 
    130                 135                 140                 


Val Ala Val Tyr Gln Asp Val Tyr Ala Val His Ala Pro Thr Ser Ile 
145                 150                 155                 160 


Tyr Tyr Gln Ala Leu Lys Gly Val Arg Thr Ile Tyr Trp Ile Gly Phe 
                165                 170                 175     


Asp Thr Thr Pro Phe Met Tyr Lys Asn Met Ala Gly Ala Tyr Pro Thr 
            180                 185                 190         


Tyr Asn Thr Asn Trp Ala Asp Glu Ser Val Leu Glu Ala Arg Asn Ile 
        195                 200                 205             


Gly Leu Gly Ser Ser Asp Leu His Glu Lys Ser Phe Gly Lys Val Ser 
    210                 215                 220                 


Ile Met Arg Lys Lys Lys Leu Gln Pro Thr Asn Lys Val Ile Phe Ser 
225                 230                 235                 240 


Val Gly Ser Thr Ile Tyr Thr Glu Glu Arg Ile Leu Leu Arg Ser Trp 
                245                 250                 255     


His Leu Pro Asn Val Phe His Leu Lys Gly Lys Thr Ser Phe Thr Gly 
            260                 265                 270         


Arg Cys Asn Thr Ile Val Ser Cys Glu Gly Tyr Val Val Lys Lys Ile 
        275                 280                 285             


Thr Leu Ser Pro Gly Ile Tyr Gly Lys Val Asp Asn Leu Ala Ser Thr 
    290                 295                 300                 


Met His Arg Glu Gly Phe Leu Ser Cys Lys Val Thr Asp Thr Leu Arg 
305                 310                 315                 320 


Gly Glu Arg Val Ser Phe Pro Val Cys Thr Tyr Val Pro Ala Thr Leu 
                325                 330                 335     


Cys Asp Gln Met Thr Gly Ile Leu Ala Thr Asp Val Ser Val Asp Asp 
            340                 345                 350         


Ala Gln Lys Leu Leu Val Gly Leu Asn Gln Arg Ile Val Val Asn Gly 
        355                 360                 365             


Arg Thr Gln Arg Asn Thr Asn Thr Met Gln Asn Tyr Leu Leu Pro Val 
    370                 375                 380                 


Val Ala Gln Ala Phe Ser Arg Trp Ala Arg Glu His Arg Ala Asp Leu 
385                 390                 395                 400 


Glu Asp Glu Lys Gly Leu Gly Val Arg Glu Arg Ser Leu Val Met Gly 
                405                 410                 415     


Cys Cys Trp Ala Phe Lys Thr His Lys Ile Thr Ser Ile Tyr Lys Arg 
            420                 425                 430         


Pro Gly Thr Gln Thr Ile Lys Lys Val Pro Ala Val Phe Asn Ser Phe 
        435                 440                 445             


Val Ile Pro Gln Pro Thr Ser Tyr Gly Leu Asp Ile Gly Leu Arg Arg 
    450                 455                 460                 


Arg Ile Lys Met Leu Phe Asp Ala Lys Lys Ala Pro Ala Pro Ile Ile 
465                 470                 475                 480 


Thr Glu Ala Asp Val Ala His Leu Lys Gly Leu Gln Asp Glu Ala Glu 
                485                 490                 495     


Ala Val Ala Glu Ala Glu Ala Val Arg Ala Ala Leu Pro Pro Leu Leu 
            500                 505                 510         


Pro Glu Val Asp Lys Glu Thr Val Glu Ala Asp Ile Asp Leu Ile Met 
        515                 520                 525             


Gln Glu Ala Gly Ala Gly Ser Val Glu Thr Pro Arg Arg His Ile Lys 
    530                 535                 540                 


Val Thr Thr Tyr Pro Gly Glu Glu Met Ile Gly Ser Tyr Ala Val Leu 
545                 550                 555                 560 


Ser Pro Gln Ala Val Leu Asn Ser Glu Lys Leu Ala Cys Ile His Pro 
                565                 570                 575     


Leu Ala Glu Gln Val Leu Val Met Thr His Lys Gly Arg Ala Gly Arg 
            580                 585                 590         


Tyr Lys Val Glu Pro Tyr His Gly Arg Val Ile Val Pro Ser Gly Thr 
        595                 600                 605             


Ala Ile Pro Ile Leu Asp Phe Gln Ala Leu Ser Glu Ser Ala Thr Ile 
    610                 615                 620                 


Val Phe Asn Glu Arg Glu Phe Val Asn Arg Tyr Leu His His Ile Ala 
625                 630                 635                 640 


Val Asn Gly Gly Ala Leu Asn Thr Asp Glu Glu Tyr Tyr Lys Val Val 
                645                 650                 655     


Lys Ser Thr Glu Thr Asp Ser Glu Tyr Val Phe Asp Ile Asp Ala Lys 
            660                 665                 670         


Lys Cys Val Lys Lys Gly Asp Ala Gly Pro Met Cys Leu Val Gly Glu 
        675                 680                 685             


Leu Val Asp Pro Pro Phe His Glu Phe Ala Tyr Glu Ser Leu Lys Thr 
    690                 695                 700                 


Arg Pro Ala Ala Pro His Lys Val Pro Thr Ile Gly Val Tyr Gly Val 
705                 710                 715                 720 


Pro Gly Ser Gly Lys Ser Gly Ile Ile Lys Ser Ala Val Thr Lys Arg 
                725                 730                 735     


Asp Leu Val Val Ser Ala Lys Lys Glu Asn Cys Met Glu Ile Ile Lys 
            740                 745                 750         


Asp Val Lys Arg Met Arg Gly Met Asp Ile Ala Ala Arg Thr Val Asp 
        755                 760                 765             


Ser Val Leu Leu Asn Gly Val Lys His Ser Val Asp Thr Leu Tyr Ile 
    770                 775                 780                 


Asp Glu Ala Phe Ala Cys His Ala Gly Thr Leu Leu Ala Leu Ile Ala 
785                 790                 795                 800 


Ile Val Lys Pro Lys Lys Val Val Leu Cys Gly Asp Pro Lys Gln Cys 
                805                 810                 815     


Gly Phe Phe Asn Met Met Cys Leu Lys Val His Phe Asn His Glu Ile 
            820                 825                 830         


Cys Thr Glu Val Tyr His Lys Ser Ile Ser Arg Arg Cys Thr Lys Thr 
        835                 840                 845             


Val Thr Ser Ile Val Ser Thr Leu Phe Tyr Asp Lys Arg Met Arg Thr 
    850                 855                 860                 


Val Asn Pro Cys Asn Asp Lys Ile Ile Ile Asp Thr Thr Ser Thr Thr 
865                 870                 875                 880 


Lys Pro Leu Lys Asp Asp Ile Ile Leu Thr Cys Phe Arg Gly Trp Val 
                885                 890                 895     


Lys Gln Leu Gln Ile Asp Tyr Lys Asn His Glu Ile Met Thr Ala Ala 
            900                 905                 910         


Ala Ser Gln Gly Leu Thr Arg Lys Gly Val Tyr Ala Val Arg Tyr Lys 
        915                 920                 925             


Val Asn Glu Asn Pro Leu Tyr Ala Gln Thr Ser Glu His Val Asn Val 
    930                 935                 940                 


Leu Leu Thr Arg Thr Glu Lys Arg Ile Val Trp Lys Thr Leu Ala Gly 
945                 950                 955                 960 


Asp Pro Trp Ile Lys Thr Leu Thr Ala Ser Tyr Pro Gly Asn Phe Thr 
                965                 970                 975     


Ala Thr Leu Glu Glu Trp Gln Ala Glu His Asp Ala Ile Met Ala Lys 
            980                 985                 990         


Ile Leu Glu Thr Pro Ala Ser Ser  Asp Val Phe Gln Asn  Lys Val Asn 
        995                 1000                 1005             


Val Cys  Trp Ala Lys Ala Leu  Glu Pro Val Leu Ala  Thr Ala Asn 
    1010                 1015                 1020             


Ile Thr  Leu Thr Arg Ser Gln  Trp Glu Thr Ile Pro  Ala Phe Lys 
    1025                 1030                 1035             


Asp Asp  Lys Ala Tyr Ser Pro  Glu Met Ala Leu Asn  Phe Phe Cys 
    1040                 1045                 1050             


Thr Arg  Phe Phe Gly Val Asp  Ile Asp Ser Gly Leu  Phe Ser Ala 
    1055                 1060                 1065             


Pro Thr  Val Pro Leu Thr Tyr  Thr Asn Glu His Trp  Asp Asn Ser 
    1070                 1075                 1080             


Pro Gly  Pro Asn Met Tyr Gly  Leu Cys Met Arg Thr  Ala Lys Glu 
    1085                 1090                 1095             


Leu Ala  Arg Arg Tyr Pro Cys  Ile Leu Lys Ala Val  Asp Thr Gly 
    1100                 1105                 1110             


Arg Val  Ala Asp Val Arg Thr  Asp Thr Ile Lys Asp  Tyr Asn Pro 
    1115                 1120                 1125             


Leu Ile  Asn Val Val Pro Leu  Asn Arg Arg Leu Pro  His Ser Leu 
    1130                 1135                 1140             


Val Val  Thr His Arg Tyr Thr  Gly Asn Gly Asp Tyr  Ser Gln Leu 
    1145                 1150                 1155             


Val Thr  Lys Met Thr Gly Lys  Thr Val Leu Val Val  Gly Thr Pro 
    1160                 1165                 1170             


Met Asn  Ile Pro Gly Lys Arg  Val Glu Thr Leu Gly  Pro Ser Pro 
    1175                 1180                 1185             


Gln Cys  Thr Tyr Lys Ala Glu  Leu Asp Leu Gly Ile  Pro Ala Ala 
    1190                 1195                 1200             


Leu Gly  Lys Tyr Asp Ile Ile  Phe Ile Asn Val Arg  Thr Pro Tyr 
    1205                 1210                 1215             


Arg His  His His Tyr Gln Gln  Cys Glu Asp His Ala  Ile His His 
    1220                 1225                 1230             


Ser Met  Leu Thr Arg Lys Ala  Val Asp His Leu Asn  Lys Gly Gly 
    1235                 1240                 1245             


Thr Cys  Ile Ala Leu Gly Tyr  Gly Thr Ala Asp Arg  Ala Thr Glu 
    1250                 1255                 1260             


Asn Ile  Ile Ser Ala Val Ala  Arg Ser Phe Arg Phe  Ser Arg Val 
    1265                 1270                 1275             


Cys Gln  Pro Lys Cys Ala Trp  Glu Asn Thr Glu Val  Ala Phe Val 
    1280                 1285                 1290             


Phe Phe  Gly Lys Asp Asn Gly  Asn His Leu Gln Asp  Gln Asp Arg 
    1295                 1300                 1305             


Leu Ser  Val Val Leu Asn Asn  Ile Tyr Gln Gly Ser  Thr Gln His 
    1310                 1315                 1320             


Glu Ala  Gly Arg Ala Pro Ala  Tyr Arg Val Val Arg  Gly Asp Ile 
    1325                 1330                 1335             


Thr Lys  Ser Asn Asp Glu Val  Ile Val Asn Ala Ala  Asn Asn Lys 
    1340                 1345                 1350             


Gly Gln  Pro Gly Ser Gly Val  Cys Gly Ala Leu Tyr  Arg Lys Trp 
    1355                 1360                 1365             


Pro Gly  Ala Phe Asp Lys Gln  Pro Val Ala Thr Gly  Lys Ala His 
    1370                 1375                 1380             


Leu Val  Lys His Ser Pro Asn  Val Ile His Ala Val  Gly Pro Asn 
    1385                 1390                 1395             


Phe Ser  Arg Leu Ser Glu Asn  Glu Gly Asp Gln Lys  Leu Ser Glu 
    1400                 1405                 1410             


Val Tyr  Met Asp Ile Ala Arg  Ile Ile Asn Asn Glu  Arg Phe Thr 
    1415                 1420                 1425             


Lys Val  Ser Ile Pro Leu Leu  Ser Thr Gly Ile Tyr  Ala Gly Gly 
    1430                 1435                 1440             


Lys Asp  Arg Val Met Gln Ser  Leu Asn His Leu Phe  Thr Ala Met 
    1445                 1450                 1455             


Asp Thr  Thr Asp Ala Asp Ile  Thr Ile Tyr Cys Leu  Asp Lys Gln 
    1460                 1465                 1470             


Trp Glu  Ser Arg Ile Lys Glu  Ala Ile Thr Arg Lys  Glu Ser Val 
    1475                 1480                 1485             


Glu Glu  Leu Thr Glu Asp Asp  Arg Pro Val Asp Ile  Glu Leu Val 
    1490                 1495                 1500             


Arg Val  His Pro Leu Ser Ser  Leu Ala Gly Arg Pro  Gly Tyr Ser 
    1505                 1510                 1515             


Thr Thr  Glu Gly Lys Val Tyr  Ser Tyr Leu Glu Gly  Thr Arg Phe 
    1520                 1525                 1530             


His Gln  Thr Ala Lys Asp Ile  Ala Glu Ile Tyr Ala  Met Trp Pro 
    1535                 1540                 1545             


Asn Lys  Gln Glu Ala Asn Glu  Gln Ile Cys Leu Tyr  Val Leu Gly 
    1550                 1555                 1560             


Glu Ser  Met Asn Ser Ile Arg  Ser Lys Cys Pro Val  Glu Glu Ser 
    1565                 1570                 1575             


Glu Ala  Ser Ser Pro Pro His  Thr Ile Pro Cys Leu  Cys Asn Tyr 
    1580                 1585                 1590             


Ala Met  Thr Ala Glu Arg Val  Tyr Arg Leu Arg Met  Ala Lys Asn 
    1595                 1600                 1605             


Glu Gln  Phe Ala Val Cys Ser  Ser Phe Gln Leu Pro  Lys Tyr Arg 
    1610                 1615                 1620             


Ile Thr  Gly Val Gln Lys Ile  Gln Cys Ser Lys Pro  Val Ile Phe 
    1625                 1630                 1635             


Ser Gly  Thr Val Pro Pro Ala  Ile His Pro Arg Lys  Phe Ala Ser 
    1640                 1645                 1650             


Val Thr  Val Glu Asp Thr Pro  Val Val Gln Pro Glu  Arg Leu Val 
    1655                 1660                 1665             


Pro Arg  Arg Pro Ala Pro Pro  Val Pro Val Pro Ala  Arg Ile Pro 
    1670                 1675                 1680             


Ser Pro  Pro Cys Thr Ser Thr  Asn Gly Ser Thr Thr  Ser Ile Gln 
    1685                 1690                 1695             


Ser Leu  Gly Glu Asp Gln Ser  Ala Ser Ala Ser Ser  Gly Ala Glu 
    1700                 1705                 1710             


Ile Ser  Val Asp Gln Val Ser  Leu Trp Ser Ile Pro  Ser Ala Thr 
    1715                 1720                 1725             


Gly Phe  Asp Val Arg Thr Ser  Ser Ser Leu Ser Leu  Glu Gln Pro 
    1730                 1735                 1740             


Thr Phe  Pro Thr Met Val Val  Glu Ala Glu Ile His  Ala Ser Gln 
    1745                 1750                 1755             


Gly Ser  Leu Trp Ser Ile Pro  Ser Ile Thr Gly Ser  Glu Thr Arg 
    1760                 1765                 1770             


Ala Pro  Ser Pro Pro Ser Gln  Asp Ser Arg Pro Ser  Thr Pro Ser 
    1775                 1780                 1785             


Ala Ser  Gly Ser His Thr Ser  Val Asp Leu Ile Thr  Phe Asp Ser 
    1790                 1795                 1800             


Val Ala  Glu Ile Leu Glu Asp  Phe Ser Arg Ser Pro  Phe Gln Phe 
    1805                 1810                 1815             


Leu Ser  Glu Ile Lys Pro Ile  Pro Ala Pro Arg Thr  Arg Val Asn 
    1820                 1825                 1830             


Asn Met  Ser Arg Ser Ala Asp  Thr Ile Lys Pro Ile  Pro Lys Pro 
    1835                 1840                 1845             


Arg Lys  Cys Gln Val Lys Tyr  Thr Gln Pro Pro Gly  Val Ala Arg 
    1850                 1855                 1860             


Val Ile  Ser Ala Ala Glu Phe  Asp Glu Phe Val Arg  Arg His Ser 
    1865                 1870                 1875             


Asn Arg  Tyr Glu Ala Gly Ala  Tyr Ile Phe Ser Ser  Glu Thr Gly 
    1880                 1885                 1890             


Gln Gly  His Leu Gln Gln Lys  Ser Thr Arg Gln Cys  Lys Leu Gln 
    1895                 1900                 1905             


Tyr Pro  Ile Leu Glu Arg Ser  Val His Glu Lys Phe  Tyr Ala Pro 
    1910                 1915                 1920             


Arg Leu  Asp Leu Glu Arg Glu  Lys Leu Leu Gln Lys  Lys Leu Gln 
    1925                 1930                 1935             


Leu Cys  Ala Ser Glu Gly Asn  Arg Ser Arg Tyr Gln  Ser Arg Lys 
    1940                 1945                 1950             


Val Glu  Asn Met Lys Ala Ile  Thr Val Glu Arg Leu  Leu Gln Gly 
    1955                 1960                 1965             


Ile Gly  Ser Tyr Leu Ser Ala  Glu Pro Gln Pro Val  Glu Cys Tyr 
    1970                 1975                 1980             


Lys Val  Thr Tyr Pro Ala Pro  Met Tyr Ser Ser Thr  Ala Ser Asn 
    1985                 1990                 1995             


Ser Phe  Ser Ser Ala Glu Val  Ala Val Lys Val Cys  Asn Leu Val 
    2000                 2005                 2010             


Leu Gln  Glu Asn Phe Pro Thr  Val Ala Ser Tyr Asn  Ile Thr Asp 
    2015                 2020                 2025             


Glu Tyr  Asp Ala Tyr Leu Asp  Met Val Asp Gly Ala  Ser Cys Cys 
    2030                 2035                 2040             


Leu Asp  Thr Ala Thr Phe Cys  Pro Ala Lys Leu Arg  Ser Phe Pro 
    2045                 2050                 2055             


Lys Lys  His Ser Tyr Leu Arg  Pro Glu Ile Arg Ser  Ala Val Pro 
    2060                 2065                 2070             


Ser Pro  Ile Gln Asn Thr Leu  Gln Asn Val Leu Ala  Ala Ala Thr 
    2075                 2080                 2085             


Lys Arg  Asn Cys Asn Val Thr  Gln Met Arg Glu Leu  Pro Val Leu 
    2090                 2095                 2100             


Asp Ser  Ala Ala Phe Asn Val  Glu Cys Phe Lys Lys  Tyr Ala Cys 
    2105                 2110                 2115             


Asn Asp  Glu Tyr Trp Asp Phe  Tyr Lys Thr Asn Pro  Ile Arg Leu 
    2120                 2125                 2130             


Thr Ala  Glu Asn Val Thr Gln  Tyr Val Thr Lys Leu  Lys Gly Pro 
    2135                 2140                 2145             


Lys Ala  Ala Ala Leu Phe Ala  Lys Thr His Asn Leu  Gln Pro Leu 
    2150                 2155                 2160             


His Glu  Ile Pro Met Asp Arg  Phe Val Met Asp Leu  Lys Arg Asp 
    2165                 2170                 2175             


Val Lys  Val Thr Pro Gly Thr  Lys His Thr Glu Glu  Arg Pro Lys 
    2180                 2185                 2190             


Val Gln  Val Ile Gln Ala Ala  Asp Pro Leu Ala Thr  Ala Tyr Leu 
    2195                 2200                 2205             


Cys Gly  Ile His Arg Glu Leu  Val Arg Arg Leu Asn  Ala Val Leu 
    2210                 2215                 2220             


Leu Pro  Asn Ile His Thr Leu  Phe Asp Met Ser Ala  Glu Asp Phe 
    2225                 2230                 2235             


Asp Ala  Ile Ile Ala Glu His  Phe Gln Phe Gly Asp  Ala Val Leu 
    2240                 2245                 2250             


Glu Thr  Asp Ile Ala Ser Phe  Asp Lys Ser Glu Asp  Asp Ala Ile 
    2255                 2260                 2265             


Ala Met  Ser Ala Leu Met Ile  Leu Glu Asp Leu Gly  Val Asp Gln 
    2270                 2275                 2280             


Ala Leu  Leu Asn Leu Ile Glu  Ala Ala Phe Gly Asn  Ile Thr Ser 
    2285                 2290                 2295             


Val His  Leu Pro Thr Gly Thr  Arg Phe Lys Phe Gly  Ala Met Met 
    2300                 2305                 2310             


Lys Ser  Gly Met Phe Leu Thr  Leu Phe Ile Asn Thr  Val Val Asn 
    2315                 2320                 2325             


Ile Met  Ile Ala Ser Arg Val  Leu Arg Glu Arg Leu  Thr Thr Ser 
    2330                 2335                 2340             


Pro Cys  Ala Ala Phe Ile Gly  Asp Asp Asn Ile Val  Lys Gly Val 
    2345                 2350                 2355             


Thr Ser  Asp Ala Leu Met Ala  Glu Arg Cys Ala Thr  Trp Leu Asn 
    2360                 2365                 2370             


Met Glu  Val Lys Ile Ile Asp  Ala Val Val Gly Val  Lys Ala Pro 
    2375                 2380                 2385             


Tyr Phe  Cys Gly Gly Phe Ile  Val Val Asp Gln Ile  Thr Gly Thr 
    2390                 2395                 2400             


Ala Cys  Arg Val Ala Asp Pro  Leu Lys Arg Leu Phe  Lys Leu Gly 
    2405                 2410                 2415             


Lys Pro  Leu Pro Leu Asp Asp  Asp Gln Asp Val Asp  Arg Arg Arg 
    2420                 2425                 2430             


Ala Leu  His Asp Glu Ala Ala  Arg Trp Asn Arg Ile  Gly Ile Thr 
    2435                 2440                 2445             


Glu Glu  Leu Val Lys Ala Val  Glu Ser Arg Tyr Glu  Val Asn Tyr 
    2450                 2455                 2460             


Val Ser  Leu Ile Ile Thr Ala  Leu Thr Thr Leu Ala  Ser Ser Val 
    2465                 2470                 2475             


Ser Asn  Phe Lys His Ile Arg  Gly His Pro Ile Thr  Leu Tyr Gly 
    2480                 2485                 2490             


<210>  53
<211>  2431
<212>  PRT
<213>  Artificial sequence

<220>
<223>  non-structural polyprotein (P1234), PRT, Semliki forest virus 
       (SFV)

<400>  53

Met Ala Ala Lys Val His Val Asp Ile Glu Ala Asp Ser Pro Phe Ile 
1               5                   10                  15      


Lys Ser Leu Gln Lys Ala Phe Pro Ser Phe Glu Val Glu Ser Leu Gln 
            20                  25                  30          


Val Thr Pro Asn Asp His Ala Asn Ala Arg Ala Phe Ser His Leu Ala 
        35                  40                  45              


Thr Lys Leu Ile Glu Gln Glu Thr Asp Lys Asp Thr Leu Ile Leu Asp 
    50                  55                  60                  


Ile Gly Ser Ala Pro Ser Arg Arg Met Met Ser Thr His Lys Tyr His 
65                  70                  75                  80  


Cys Val Cys Pro Met Arg Ser Ala Glu Asp Pro Glu Arg Leu Asp Ser 
                85                  90                  95      


Tyr Ala Lys Lys Leu Ala Ala Ala Ser Gly Lys Val Leu Asp Arg Glu 
            100                 105                 110         


Ile Ala Gly Lys Ile Thr Asp Leu Gln Thr Val Met Ala Thr Pro Asp 
        115                 120                 125             


Ala Glu Ser Pro Thr Phe Cys Leu His Thr Asp Val Thr Cys Arg Thr 
    130                 135                 140                 


Ala Ala Glu Val Ala Val Tyr Gln Asp Val Tyr Ala Val His Ala Pro 
145                 150                 155                 160 


Thr Ser Leu Tyr His Gln Ala Met Lys Gly Val Arg Thr Ala Tyr Trp 
                165                 170                 175     


Ile Gly Phe Asp Thr Thr Pro Phe Met Phe Asp Ala Leu Ala Gly Ala 
            180                 185                 190         


Tyr Pro Thr Tyr Ala Thr Asn Trp Ala Asp Glu Gln Val Leu Gln Ala 
        195                 200                 205             


Arg Asn Ile Gly Leu Cys Ala Ala Ser Leu Thr Glu Gly Arg Leu Gly 
    210                 215                 220                 


Lys Leu Ser Ile Leu Arg Lys Lys Gln Leu Lys Pro Cys Asp Thr Val 
225                 230                 235                 240 


Met Phe Ser Val Gly Ser Thr Leu Tyr Thr Glu Ser Arg Lys Leu Leu 
                245                 250                 255     


Arg Ser Trp His Leu Pro Ser Val Phe His Leu Lys Gly Lys Gln Ser 
            260                 265                 270         


Phe Thr Cys Arg Cys Asp Thr Ile Val Ser Cys Glu Gly Tyr Val Val 
        275                 280                 285             


Lys Lys Ile Thr Met Cys Pro Gly Leu Tyr Gly Lys Thr Val Gly Tyr 
    290                 295                 300                 


Ala Val Thr Tyr His Ala Glu Gly Phe Leu Val Cys Lys Thr Thr Asp 
305                 310                 315                 320 


Thr Val Lys Gly Glu Arg Val Ser Phe Pro Val Cys Thr Tyr Val Pro 
                325                 330                 335     


Ser Thr Ile Cys Asp Gln Met Thr Gly Ile Leu Ala Thr Asp Val Thr 
            340                 345                 350         


Pro Glu Asp Ala Gln Lys Leu Leu Val Gly Leu Asn Gln Arg Ile Val 
        355                 360                 365             


Val Asn Gly Arg Thr Gln Arg Asn Thr Asn Thr Met Lys Asn Tyr Leu 
    370                 375                 380                 


Leu Pro Ile Val Ala Val Ala Phe Ser Lys Trp Ala Arg Glu Tyr Lys 
385                 390                 395                 400 


Ala Asp Leu Asp Asp Glu Lys Pro Leu Gly Val Arg Glu Arg Ser Leu 
                405                 410                 415     


Thr Cys Cys Cys Leu Trp Ala Phe Lys Thr Arg Lys Met His Thr Met 
            420                 425                 430         


Tyr Lys Lys Pro Asp Thr Gln Thr Ile Val Lys Val Pro Ser Glu Phe 
        435                 440                 445             


Asn Ser Phe Val Ile Pro Ser Leu Trp Ser Thr Gly Leu Ala Ile Pro 
    450                 455                 460                 


Val Arg Ser Arg Ile Lys Met Leu Leu Ala Lys Lys Thr Lys Arg Glu 
465                 470                 475                 480 


Leu Ile Pro Val Leu Asp Ala Ser Ser Ala Arg Asp Ala Glu Gln Glu 
                485                 490                 495     


Glu Lys Glu Arg Leu Glu Ala Glu Leu Thr Arg Glu Ala Leu Pro Pro 
            500                 505                 510         


Leu Val Pro Ile Ala Pro Ala Glu Thr Gly Val Val Asp Val Asp Val 
        515                 520                 525             


Glu Glu Leu Glu Tyr His Ala Gly Ala Gly Val Val Glu Thr Pro Arg 
    530                 535                 540                 


Ser Ala Leu Lys Val Thr Ala Gln Pro Asn Asp Val Leu Leu Gly Asn 
545                 550                 555                 560 


Tyr Val Val Leu Ser Pro Gln Thr Val Leu Lys Ser Ser Lys Leu Ala 
                565                 570                 575     


Pro Val His Pro Leu Ala Glu Gln Val Lys Ile Ile Thr His Asn Gly 
            580                 585                 590         


Arg Ala Gly Gly Tyr Gln Val Asp Gly Tyr Asp Gly Arg Val Leu Leu 
        595                 600                 605             


Pro Cys Gly Ser Ala Ile Pro Val Pro Glu Phe Gln Ala Leu Ser Glu 
    610                 615                 620                 


Ser Ala Thr Met Val Tyr Asn Glu Arg Glu Phe Val Asn Arg Lys Leu 
625                 630                 635                 640 


Tyr His Ile Ala Val His Gly Pro Ser Leu Asn Thr Asp Glu Glu Asn 
                645                 650                 655     


Tyr Glu Lys Val Arg Ala Glu Arg Thr Asp Ala Glu Tyr Val Phe Asp 
            660                 665                 670         


Val Asp Lys Lys Cys Cys Val Lys Arg Glu Glu Ala Ser Gly Leu Val 
        675                 680                 685             


Leu Val Gly Glu Leu Thr Asn Pro Pro Phe His Glu Phe Ala Tyr Glu 
    690                 695                 700                 


Gly Leu Lys Ile Arg Pro Ser Ala Pro Tyr Lys Thr Thr Val Val Gly 
705                 710                 715                 720 


Val Phe Gly Val Pro Gly Ser Gly Lys Ser Ala Ile Ile Lys Ser Leu 
                725                 730                 735     


Val Thr Lys His Asp Leu Val Thr Ser Gly Lys Lys Glu Asn Cys Gln 
            740                 745                 750         


Glu Ile Val Asn Asp Val Lys Lys His Arg Gly Lys Gly Thr Ser Arg 
        755                 760                 765             


Glu Asn Ser Asp Ser Ile Leu Leu Asn Gly Cys Arg Arg Ala Val Asp 
    770                 775                 780                 


Ile Leu Tyr Val Asp Glu Ala Phe Ala Cys His Ser Gly Thr Leu Leu 
785                 790                 795                 800 


Ala Leu Ile Ala Leu Val Lys Pro Arg Ser Lys Val Val Leu Cys Gly 
                805                 810                 815     


Asp Pro Lys Gln Cys Gly Phe Phe Asn Met Met Gln Leu Lys Val Asn 
            820                 825                 830         


Phe Asn His Asn Ile Cys Thr Glu Val Cys His Lys Ser Ile Ser Arg 
        835                 840                 845             


Arg Cys Thr Arg Pro Val Thr Ala Ile Val Ser Thr Leu His Tyr Gly 
    850                 855                 860                 


Gly Lys Met Arg Thr Thr Asn Pro Cys Asn Lys Pro Ile Ile Ile Asp 
865                 870                 875                 880 


Thr Thr Gly Gln Thr Lys Pro Lys Pro Gly Asp Ile Val Leu Thr Cys 
                885                 890                 895     


Phe Arg Gly Trp Ala Lys Gln Leu Gln Leu Asp Tyr Arg Gly His Glu 
            900                 905                 910         


Val Met Thr Ala Ala Ala Ser Gln Gly Leu Thr Arg Lys Gly Val Tyr 
        915                 920                 925             


Ala Val Arg Gln Lys Val Asn Glu Asn Pro Leu Tyr Ala Pro Ala Ser 
    930                 935                 940                 


Glu His Val Asn Val Leu Leu Thr Arg Thr Glu Asp Arg Leu Val Trp 
945                 950                 955                 960 


Lys Thr Leu Ala Gly Asp Pro Trp Ile Lys Val Leu Ser Asn Ile Pro 
                965                 970                 975     


Gln Gly Asn Phe Thr Ala Thr Leu Glu Glu Trp Gln Glu Glu His Asp 
            980                 985                 990         


Lys Ile Met Lys Val Ile Glu Gly  Pro Ala Ala Pro Val  Asp Ala Phe 
        995                 1000                 1005             


Gln Asn  Lys Ala Asn Val Cys  Trp Ala Lys Ser Leu  Val Pro Val 
    1010                 1015                 1020             


Leu Asp  Thr Ala Gly Ile Arg  Leu Thr Ala Glu Glu  Trp Ser Thr 
    1025                 1030                 1035             


Ile Ile  Thr Ala Phe Lys Glu  Asp Arg Ala Tyr Ser  Pro Val Val 
    1040                 1045                 1050             


Ala Leu  Asn Glu Ile Cys Thr  Lys Tyr Tyr Gly Val  Asp Leu Asp 
    1055                 1060                 1065             


Ser Gly  Leu Phe Ser Ala Pro  Lys Val Ser Leu Tyr  Tyr Glu Asn 
    1070                 1075                 1080             


Asn His  Trp Asp Asn Arg Pro  Gly Gly Arg Met Tyr  Gly Phe Asn 
    1085                 1090                 1095             


Ala Ala  Thr Ala Ala Arg Leu  Glu Ala Arg His Thr  Phe Leu Lys 
    1100                 1105                 1110             


Gly Gln  Trp His Thr Gly Lys  Gln Ala Val Ile Ala  Glu Arg Lys 
    1115                 1120                 1125             


Ile Gln  Pro Leu Ser Val Leu  Asp Asn Val Ile Pro  Ile Asn Arg 
    1130                 1135                 1140             


Arg Leu  Pro His Ala Leu Val  Ala Glu Tyr Lys Thr  Val Lys Gly 
    1145                 1150                 1155             


Ser Arg  Val Glu Trp Leu Val  Asn Lys Val Arg Gly  Tyr His Val 
    1160                 1165                 1170             


Leu Leu  Val Ser Glu Tyr Asn  Leu Ala Leu Pro Arg  Arg Arg Val 
    1175                 1180                 1185             


Thr Trp  Leu Ser Pro Leu Asn  Val Thr Gly Ala Asp  Arg Cys Tyr 
    1190                 1195                 1200             


Asp Leu  Ser Leu Gly Leu Pro  Ala Asp Ala Gly Arg  Phe Asp Leu 
    1205                 1210                 1215             


Val Phe  Val Asn Ile His Thr  Glu Phe Arg Ile His  His Tyr Gln 
    1220                 1225                 1230             


Gln Cys  Val Asp His Ala Met  Lys Leu Gln Met Leu  Gly Gly Asp 
    1235                 1240                 1245             


Ala Leu  Arg Leu Leu Lys Pro  Gly Gly Ile Leu Met  Arg Ala Tyr 
    1250                 1255                 1260             


Gly Tyr  Ala Asp Lys Ile Ser  Glu Ala Val Val Ser  Ser Leu Ser 
    1265                 1270                 1275             


Arg Lys  Phe Ser Ser Ala Arg  Val Leu Arg Pro Asp  Cys Val Thr 
    1280                 1285                 1290             


Ser Asn  Thr Glu Val Phe Leu  Leu Phe Ser Asn Phe  Asp Asn Gly 
    1295                 1300                 1305             


Lys Arg  Pro Ser Thr Leu His  Gln Met Asn Thr Lys  Leu Ser Ala 
    1310                 1315                 1320             


Val Tyr  Ala Gly Glu Ala Met  His Thr Ala Gly Cys  Ala Pro Ser 
    1325                 1330                 1335             


Tyr Arg  Val Lys Arg Ala Asp  Ile Ala Thr Cys Thr  Glu Ala Ala 
    1340                 1345                 1350             


Val Val  Asn Ala Ala Asn Ala  Arg Gly Thr Val Gly  Asp Gly Val 
    1355                 1360                 1365             


Cys Arg  Ala Val Ala Lys Lys  Trp Pro Ser Ala Phe  Lys Gly Ala 
    1370                 1375                 1380             


Ala Thr  Pro Val Gly Thr Ile  Lys Thr Val Met Cys  Gly Ser Tyr 
    1385                 1390                 1395             


Pro Val  Ile His Ala Val Ala  Pro Asn Phe Ser Ala  Thr Thr Glu 
    1400                 1405                 1410             


Ala Glu  Gly Asp Arg Glu Leu  Ala Ala Val Tyr Arg  Ala Val Ala 
    1415                 1420                 1425             


Ala Glu  Val Asn Arg Leu Ser  Leu Ser Ser Val Ala  Ile Pro Leu 
    1430                 1435                 1440             


Leu Ser  Thr Gly Val Phe Ser  Gly Gly Arg Asp Arg  Leu Gln Gln 
    1445                 1450                 1455             


Ser Leu  Asn His Leu Phe Thr  Ala Met Asp Ala Thr  Asp Ala Asp 
    1460                 1465                 1470             


Val Thr  Ile Tyr Cys Arg Asp  Lys Ser Trp Glu Lys  Lys Ile Gln 
    1475                 1480                 1485             


Glu Ala  Ile Asp Met Arg Thr  Ala Val Glu Leu Leu  Asn Asp Asp 
    1490                 1495                 1500             


Val Glu  Leu Thr Thr Asp Leu  Val Arg Val His Pro  Asp Ser Ser 
    1505                 1510                 1515             


Leu Val  Gly Arg Lys Gly Tyr  Ser Thr Thr Asp Gly  Ser Leu Tyr 
    1520                 1525                 1530             


Ser Tyr  Phe Glu Gly Thr Lys  Phe Asn Gln Ala Ala  Ile Asp Met 
    1535                 1540                 1545             


Ala Glu  Ile Leu Thr Leu Trp  Pro Arg Leu Gln Glu  Ala Asn Glu 
    1550                 1555                 1560             


Arg Ile  Cys Leu Tyr Ala Leu  Gly Glu Thr Met Asp  Asn Ile Gly 
    1565                 1570                 1575             


Ser Lys  Cys Pro Val Asn Asp  Ser Asp Ser Ser Thr  Pro Pro Arg 
    1580                 1585                 1590             


Thr Val  Pro Cys Leu Cys Arg  Tyr Ala Met Thr Ala  Glu Arg Ile 
    1595                 1600                 1605             


Ala Arg  Leu Arg Ser His Gln  Val Lys Ser Met Val  Val Cys Ser 
    1610                 1615                 1620             


Ser Phe  Pro Leu Pro Lys Tyr  His Val Asp Gly Val  Gln Lys Val 
    1625                 1630                 1635             


Lys Cys  Glu Lys Val Leu Leu  Phe Asp Pro Thr Val  Pro Ser Val 
    1640                 1645                 1650             


Val Ser  Pro Arg Lys Tyr Ala  Ala Ser Thr Thr Asp  His Ser Asp 
    1655                 1660                 1665             


Arg Ser  Leu Arg Gly Phe Asp  Leu Asp Trp Thr Thr  Asp Ser Ser 
    1670                 1675                 1680             


Ser Thr  Ala Ser Asp Thr Met  Ser Leu Pro Ser Leu  Gln Ser Cys 
    1685                 1690                 1695             


Asp Ile  Asp Ser Ile Tyr Glu  Pro Met Ala Pro Ile  Val Val Thr 
    1700                 1705                 1710             


Ala Asp  Val His Pro Glu Pro  Ala Gly Ile Ala Asp  Leu Ala Ala 
    1715                 1720                 1725             


Asp Val  His Pro Glu Pro Ala  Asp His Val Asp Leu  Glu Asn Pro 
    1730                 1735                 1740             


Ile Pro  Pro Pro Arg Pro Lys  Arg Ala Ala Tyr Leu  Ala Ser Arg 
    1745                 1750                 1755             


Ala Ala  Glu Arg Pro Val Pro  Ala Pro Arg Lys Pro  Thr Pro Ala 
    1760                 1765                 1770             


Pro Arg  Thr Ala Phe Arg Asn  Lys Leu Pro Leu Thr  Phe Gly Asp 
    1775                 1780                 1785             


Phe Asp  Glu His Glu Val Asp  Ala Leu Ala Ser Gly  Ile Thr Phe 
    1790                 1795                 1800             


Gly Asp  Phe Asp Asp Val Leu  Arg Leu Gly Arg Ala  Gly Ala Tyr 
    1805                 1810                 1815             


Ile Phe  Ser Ser Asp Thr Gly  Ser Gly His Leu Gln  Gln Lys Ser 
    1820                 1825                 1830             


Val Arg  Gln His Asn Leu Gln  Cys Ala Gln Leu Asp  Ala Val Gln 
    1835                 1840                 1845             


Glu Glu  Lys Met Tyr Pro Pro  Lys Leu Asp Thr Glu  Arg Glu Lys 
    1850                 1855                 1860             


Leu Leu  Leu Leu Lys Met Gln  Met His Pro Ser Glu  Ala Asn Lys 
    1865                 1870                 1875             


Ser Arg  Tyr Gln Ser Arg Lys  Val Glu Asn Met Lys  Ala Thr Val 
    1880                 1885                 1890             


Val Asp  Arg Leu Thr Ser Gly  Ala Arg Leu Tyr Thr  Gly Ala Asp 
    1895                 1900                 1905             


Val Gly  Arg Ile Pro Thr Tyr  Ala Val Arg Tyr Pro  Arg Pro Val 
    1910                 1915                 1920             


Tyr Ser  Pro Thr Val Ile Glu  Arg Phe Ser Ser Pro  Asp Val Ala 
    1925                 1930                 1935             


Ile Ala  Ala Cys Asn Glu Tyr  Leu Ser Arg Asn Tyr  Pro Thr Val 
    1940                 1945                 1950             


Ala Ser  Tyr Gln Ile Thr Asp  Glu Tyr Asp Ala Tyr  Leu Asp Met 
    1955                 1960                 1965             


Val Asp  Gly Ser Asp Ser Cys  Leu Asp Arg Ala Thr  Phe Cys Pro 
    1970                 1975                 1980             


Ala Lys  Leu Arg Cys Tyr Pro  Lys His His Ala Tyr  His Gln Pro 
    1985                 1990                 1995             


Thr Val  Arg Ser Ala Val Pro  Ser Pro Phe Gln Asn  Thr Leu Gln 
    2000                 2005                 2010             


Asn Val  Leu Ala Ala Ala Thr  Lys Arg Asn Cys Asn  Val Thr Gln 
    2015                 2020                 2025             


Met Arg  Glu Leu Pro Thr Met  Asp Ser Ala Val Phe  Asn Val Glu 
    2030                 2035                 2040             


Cys Phe  Lys Arg Tyr Ala Cys  Ser Gly Glu Tyr Trp  Glu Glu Tyr 
    2045                 2050                 2055             


Ala Lys  Gln Pro Ile Arg Ile  Thr Thr Glu Asn Ile  Thr Thr Tyr 
    2060                 2065                 2070             


Val Thr  Lys Leu Lys Gly Pro  Lys Ala Ala Ala Leu  Phe Ala Lys 
    2075                 2080                 2085             


Thr His  Asn Leu Val Pro Leu  Gln Glu Val Pro Met  Asp Arg Phe 
    2090                 2095                 2100             


Thr Val  Asp Met Lys Arg Asp  Val Lys Val Thr Pro  Gly Thr Lys 
    2105                 2110                 2115             


His Thr  Glu Glu Arg Pro Lys  Val Gln Val Ile Gln  Ala Ala Glu 
    2120                 2125                 2130             


Pro Leu  Ala Thr Ala Tyr Leu  Cys Gly Ile His Arg  Glu Leu Val 
    2135                 2140                 2145             


Arg Arg  Leu Asn Ala Val Leu  Arg Pro Asn Val His  Thr Leu Phe 
    2150                 2155                 2160             


Asp Met  Ser Ala Glu Asp Phe  Asp Ala Ile Ile Ala  Ser His Phe 
    2165                 2170                 2175             


His Pro  Gly Asp Pro Val Leu  Glu Thr Asp Ile Ala  Ser Phe Asp 
    2180                 2185                 2190             


Lys Ser  Gln Asp Asp Ser Leu  Ala Leu Thr Gly Leu  Met Ile Leu 
    2195                 2200                 2205             


Glu Asp  Leu Gly Val Asp Gln  Tyr Leu Leu Asp Leu  Ile Glu Ala 
    2210                 2215                 2220             


Ala Phe  Gly Glu Ile Ser Ser  Cys His Leu Pro Thr  Gly Thr Arg 
    2225                 2230                 2235             


Phe Lys  Phe Gly Ala Met Met  Lys Ser Gly Met Phe  Leu Thr Leu 
    2240                 2245                 2250             


Phe Ile  Asn Thr Val Leu Asn  Ile Thr Ile Ala Ser  Arg Val Leu 
    2255                 2260                 2265             


Glu Gln  Arg Leu Thr Asp Ser  Ala Cys Ala Ala Phe  Ile Gly Asp 
    2270                 2275                 2280             


Asp Asn  Ile Val His Gly Val  Ile Ser Asp Lys Leu  Met Ala Glu 
    2285                 2290                 2295             


Arg Cys  Ala Ser Trp Val Asn  Met Glu Val Lys Ile  Ile Asp Ala 
    2300                 2305                 2310             


Val Met  Gly Glu Lys Pro Pro  Tyr Phe Cys Gly Gly  Phe Ile Val 
    2315                 2320                 2325             


Phe Asp  Ser Val Thr Gln Thr  Ala Cys Arg Val Ser  Asp Pro Leu 
    2330                 2335                 2340             


Lys Arg  Leu Phe Lys Leu Gly  Lys Pro Leu Thr Ala  Glu Asp Lys 
    2345                 2350                 2355             


Gln Asp  Glu Asp Arg Arg Arg  Ala Leu Ser Asp Glu  Val Ser Lys 
    2360                 2365                 2370             


Trp Phe  Arg Thr Gly Leu Gly  Ala Glu Leu Glu Val  Ala Leu Thr 
    2375                 2380                 2385             


Ser Arg  Tyr Glu Val Glu Gly  Cys Lys Ser Ile Leu  Ile Ala Met 
    2390                 2395                 2400             


Thr Thr  Leu Ala Arg Asp Ile  Lys Ala Phe Lys Lys  Leu Arg Gly 
    2405                 2410                 2415             


Pro Val  Ile His Leu Tyr Gly  Gly Pro Arg Leu Val  Arg 
    2420                 2425                 2430     


<210>  54
<211>  2512
<212>  PRT
<213>  Artificial sequence

<220>
<223>  non-structural polyprotein (P1234), PRT, Sindbis virus (SINV)

<400>  54

Met Glu Lys Pro Val Val Asn Val Asp Val Asp Pro Gln Ser Pro Phe 
1               5                   10                  15      


Val Val Gln Leu Gln Lys Ser Phe Pro Gln Phe Glu Val Val Ala Gln 
            20                  25                  30          


Gln Val Thr Pro Asn Asp His Ala Asn Ala Arg Ala Phe Ser His Leu 
        35                  40                  45              


Ala Ser Lys Leu Ile Glu Leu Glu Val Pro Thr Thr Ala Thr Ile Leu 
    50                  55                  60                  


Asp Ile Gly Ser Ala Pro Ala Arg Arg Met Phe Ser Glu His Gln Tyr 
65                  70                  75                  80  


His Cys Val Cys Pro Met Arg Ser Pro Glu Asp Pro Asp Arg Met Met 
                85                  90                  95      


Lys Tyr Ala Ser Lys Leu Ala Glu Lys Ala Cys Lys Ile Thr Asn Lys 
            100                 105                 110         


Asn Leu His Glu Lys Ile Lys Asp Leu Arg Thr Val Leu Asp Thr Pro 
        115                 120                 125             


Asp Ala Glu Thr Pro Ser Leu Cys Phe His Asn Asp Val Thr Cys Asn 
    130                 135                 140                 


Met Arg Ala Glu Tyr Ser Val Met Gln Asp Val Tyr Ile Asn Ala Pro 
145                 150                 155                 160 


Gly Thr Ile Tyr His Gln Ala Met Lys Gly Val Arg Thr Leu Tyr Trp 
                165                 170                 175     


Ile Gly Phe Asp Thr Thr Gln Phe Met Phe Ser Ala Met Ala Gly Ser 
            180                 185                 190         


Tyr Pro Ala Tyr Asn Thr Asn Trp Ala Asp Glu Lys Val Leu Glu Ala 
        195                 200                 205             


Arg Asn Ile Gly Leu Cys Ser Thr Lys Leu Ser Glu Gly Arg Thr Gly 
    210                 215                 220                 


Lys Leu Ser Ile Met Arg Lys Lys Glu Leu Lys Pro Gly Ser Arg Val 
225                 230                 235                 240 


Tyr Phe Ser Val Gly Ser Thr Leu Tyr Pro Glu His Arg Ala Ser Leu 
                245                 250                 255     


Gln Ser Trp His Leu Pro Ser Val Phe His Leu Asn Gly Lys Gln Ser 
            260                 265                 270         


Tyr Thr Cys Arg Cys Asp Thr Val Val Ser Cys Glu Gly Tyr Val Val 
        275                 280                 285             


Lys Lys Ile Thr Ile Ser Pro Gly Ile Thr Gly Glu Thr Val Gly Tyr 
    290                 295                 300                 


Ala Val Thr His Asn Ser Glu Gly Phe Leu Leu Cys Lys Val Thr Asp 
305                 310                 315                 320 


Thr Val Lys Gly Glu Arg Val Ser Phe Pro Val Cys Thr Tyr Ile Pro 
                325                 330                 335     


Ala Thr Ile Cys Asp Gln Met Thr Gly Ile Met Ala Thr Asp Ile Ser 
            340                 345                 350         


Pro Asp Asp Ala Gln Lys Leu Leu Val Gly Leu Asn Gln Arg Ile Val 
        355                 360                 365             


Ile Asn Gly Arg Thr Asn Arg Asn Thr Asn Thr Met Gln Asn Tyr Leu 
    370                 375                 380                 


Leu Pro Ile Ile Ala Gln Gly Phe Ser Lys Trp Ala Lys Glu Arg Lys 
385                 390                 395                 400 


Asp Asp Leu Asp Asn Glu Lys Met Leu Gly Thr Arg Glu Arg Lys Leu 
                405                 410                 415     


Thr Tyr Gly Cys Leu Trp Ala Phe Arg Thr Lys Lys Val His Ser Phe 
            420                 425                 430         


Tyr Arg Pro Pro Gly Thr Gln Thr Cys Val Lys Val Pro Ala Ser Phe 
        435                 440                 445             


Ser Ala Phe Pro Met Ser Ser Val Trp Thr Thr Ser Leu Pro Met Ser 
    450                 455                 460                 


Leu Arg Gln Lys Leu Lys Leu Ala Leu Gln Pro Lys Lys Glu Glu Lys 
465                 470                 475                 480 


Leu Leu Gln Val Ser Glu Glu Leu Val Met Glu Ala Lys Ala Ala Phe 
                485                 490                 495     


Glu Asp Ala Gln Glu Glu Ala Arg Ala Glu Lys Leu Arg Glu Ala Leu 
            500                 505                 510         


Pro Pro Leu Val Ala Asp Lys Gly Ile Glu Ala Ala Ala Glu Val Val 
        515                 520                 525             


Cys Glu Val Glu Gly Leu Gln Ala Asp Ile Gly Ala Ala Leu Val Glu 
    530                 535                 540                 


Thr Pro Arg Gly His Val Arg Ile Ile Pro Gln Ala Asn Asp Arg Met 
545                 550                 555                 560 


Ile Gly Gln Tyr Ile Val Val Ser Pro Asn Ser Val Leu Lys Asn Ala 
                565                 570                 575     


Lys Leu Ala Pro Ala His Pro Leu Ala Asp Gln Val Lys Ile Ile Thr 
            580                 585                 590         


His Ser Gly Arg Ser Gly Arg Tyr Ala Val Glu Pro Tyr Asp Ala Lys 
        595                 600                 605             


Val Leu Met Pro Ala Gly Gly Ala Val Pro Trp Pro Glu Phe Leu Ala 
    610                 615                 620                 


Leu Ser Glu Ser Ala Thr Leu Val Tyr Asn Glu Arg Glu Phe Val Asn 
625                 630                 635                 640 


Arg Lys Leu Tyr His Ile Ala Met His Gly Pro Ala Lys Asn Thr Glu 
                645                 650                 655     


Glu Glu Gln Tyr Lys Val Thr Lys Ala Glu Leu Ala Glu Thr Glu Tyr 
            660                 665                 670         


Val Phe Asp Val Asp Lys Lys Arg Cys Val Lys Lys Glu Glu Ala Ser 
        675                 680                 685             


Gly Leu Val Leu Ser Gly Glu Leu Thr Asn Pro Pro Tyr His Glu Leu 
    690                 695                 700                 


Ala Leu Glu Gly Leu Lys Thr Arg Pro Ala Val Pro Tyr Lys Val Glu 
705                 710                 715                 720 


Thr Ile Gly Val Ile Gly Thr Pro Gly Ser Gly Lys Ser Ala Ile Ile 
                725                 730                 735     


Lys Ser Thr Val Thr Ala Arg Asp Leu Val Thr Ser Gly Lys Lys Glu 
            740                 745                 750         


Asn Cys Arg Glu Ile Glu Ala Asp Val Leu Arg Leu Arg Gly Met Gln 
        755                 760                 765             


Ile Thr Ser Lys Thr Val Asp Ser Val Met Leu Asn Gly Cys His Lys 
    770                 775                 780                 


Ala Val Glu Val Leu Tyr Val Asp Glu Ala Phe Ala Cys His Ala Gly 
785                 790                 795                 800 


Ala Leu Leu Ala Leu Ile Ala Ile Val Arg Pro Arg Lys Lys Val Val 
                805                 810                 815     


Leu Cys Gly Asp Pro Met Gln Cys Gly Phe Phe Asn Met Met Gln Leu 
            820                 825                 830         


Lys Val His Phe Asn His Pro Glu Lys Asp Ile Cys Thr Lys Thr Phe 
        835                 840                 845             


Tyr Lys Tyr Ile Ser Arg Arg Cys Thr Gln Pro Val Thr Ala Ile Val 
    850                 855                 860                 


Ser Thr Leu His Tyr Asp Gly Lys Met Lys Thr Thr Asn Pro Cys Lys 
865                 870                 875                 880 


Lys Asn Ile Glu Ile Asp Ile Thr Gly Ala Thr Lys Pro Lys Pro Gly 
                885                 890                 895     


Asp Ile Ile Leu Thr Cys Phe Arg Gly Trp Val Lys Gln Leu Gln Ile 
            900                 905                 910         


Asp Tyr Pro Gly His Glu Val Met Thr Ala Ala Ala Ser Gln Gly Leu 
        915                 920                 925             


Thr Arg Lys Gly Val Tyr Ala Val Arg Gln Lys Val Asn Glu Asn Pro 
    930                 935                 940                 


Leu Tyr Ala Ile Thr Ser Glu His Val Asn Val Leu Leu Thr Arg Thr 
945                 950                 955                 960 


Glu Asp Arg Leu Val Trp Lys Thr Leu Gln Gly Asp Pro Trp Ile Lys 
                965                 970                 975     


Gln Pro Thr Asn Ile Pro Lys Gly Asn Phe Gln Ala Thr Ile Glu Asp 
            980                 985                 990         


Trp Glu Ala Glu His Lys Gly Ile  Ile Ala Ala Ile Asn  Ser Pro Thr 
        995                 1000                 1005             


Pro Arg  Ala Asn Pro Phe Ser  Cys Lys Thr Asn Val  Cys Trp Ala 
    1010                 1015                 1020             


Lys Ala  Leu Glu Pro Ile Leu  Ala Thr Ala Gly Ile  Val Leu Thr 
    1025                 1030                 1035             


Gly Cys  Gln Trp Ser Glu Leu  Phe Pro Gln Phe Ala  Asp Asp Lys 
    1040                 1045                 1050             


Pro His  Ser Ala Ile Tyr Ala  Leu Asp Val Ile Cys  Ile Lys Phe 
    1055                 1060                 1065             


Phe Gly  Met Asp Leu Thr Ser  Gly Leu Phe Ser Lys  Gln Ser Ile 
    1070                 1075                 1080             


Pro Leu  Thr Tyr His Pro Ala  Asp Ser Ala Arg Pro  Val Ala His 
    1085                 1090                 1095             


Trp Asp  Asn Ser Pro Gly Thr  Arg Lys Tyr Gly Tyr  Asp His Ala 
    1100                 1105                 1110             


Ile Ala  Ala Glu Leu Ser Arg  Arg Phe Pro Val Phe  Gln Leu Ala 
    1115                 1120                 1125             


Gly Lys  Gly Thr Gln Leu Asp  Leu Gln Thr Gly Arg  Thr Arg Val 
    1130                 1135                 1140             


Ile Ser  Ala Gln His Asn Leu  Val Pro Val Asn Arg  Asn Leu Pro 
    1145                 1150                 1155             


His Ala  Leu Val Pro Glu Tyr  Lys Glu Lys Gln Pro  Gly Pro Val 
    1160                 1165                 1170             


Lys Lys  Phe Leu Asn Gln Phe  Lys His His Ser Val  Leu Val Val 
    1175                 1180                 1185             


Ser Glu  Glu Lys Ile Glu Ala  Pro Arg Lys Arg Ile  Glu Trp Ile 
    1190                 1195                 1200             


Ala Pro  Ile Gly Ile Ala Gly  Ala Asp Lys Asn Tyr  Asn Leu Ala 
    1205                 1210                 1215             


Phe Gly  Phe Pro Pro Gln Ala  Arg Tyr Asp Leu Val  Phe Ile Asn 
    1220                 1225                 1230             


Ile Gly  Thr Lys Tyr Arg Asn  His His Phe Gln Gln  Cys Glu Asp 
    1235                 1240                 1245             


His Ala  Ala Thr Leu Lys Thr  Leu Ser Arg Ser Ala  Leu Asn Cys 
    1250                 1255                 1260             


Leu Asn  Pro Gly Gly Thr Leu  Val Val Lys Ser Tyr  Gly Tyr Ala 
    1265                 1270                 1275             


Asp Arg  Asn Ser Glu Asp Val  Val Thr Ala Leu Ala  Arg Lys Phe 
    1280                 1285                 1290             


Val Arg  Val Ser Ala Ala Arg  Pro Asp Cys Val Ser  Ser Asn Thr 
    1295                 1300                 1305             


Glu Met  Tyr Leu Ile Phe Arg  Gln Leu Asp Asn Ser  Arg Thr Arg 
    1310                 1315                 1320             


Gln Phe  Thr Pro His His Leu  Asn Cys Val Ile Ser  Ser Val Tyr 
    1325                 1330                 1335             


Glu Gly  Thr Arg Asp Gly Val  Gly Ala Ala Pro Ser  Tyr Arg Thr 
    1340                 1345                 1350             


Lys Arg  Glu Asn Ile Ala Asp  Cys Gln Glu Glu Ala  Val Val Asn 
    1355                 1360                 1365             


Ala Ala  Asn Pro Leu Gly Arg  Pro Gly Glu Gly Val  Cys Arg Ala 
    1370                 1375                 1380             


Ile Tyr  Lys Arg Trp Pro Thr  Ser Phe Thr Asp Ser  Ala Thr Glu 
    1385                 1390                 1395             


Thr Gly  Thr Ala Arg Met Thr  Val Cys Leu Gly Lys  Lys Val Ile 
    1400                 1405                 1410             


His Ala  Val Gly Pro Asp Phe  Arg Lys His Pro Glu  Ala Glu Ala 
    1415                 1420                 1425             


Leu Lys  Leu Leu Gln Asn Ala  Tyr His Ala Val Ala  Asp Leu Val 
    1430                 1435                 1440             


Asn Glu  His Asn Ile Lys Ser  Val Ala Ile Pro Leu  Leu Ser Thr 
    1445                 1450                 1455             


Gly Ile  Tyr Ala Ala Gly Lys  Asp Arg Leu Glu Val  Ser Leu Asn 
    1460                 1465                 1470             


Cys Leu  Thr Thr Ala Leu Asp  Arg Thr Asp Ala Asp  Val Thr Ile 
    1475                 1480                 1485             


Tyr Cys  Leu Asp Lys Lys Trp  Lys Glu Arg Ile Asp  Ala Ala Leu 
    1490                 1495                 1500             


Gln Leu  Lys Glu Ser Val Thr  Glu Leu Lys Asp Glu  Asp Met Glu 
    1505                 1510                 1515             


Ile Asp  Asp Glu Leu Val Trp  Ile His Pro Asp Ser  Cys Leu Lys 
    1520                 1525                 1530             


Gly Arg  Lys Gly Phe Ser Thr  Thr Lys Gly Lys Leu  Tyr Ser Tyr 
    1535                 1540                 1545             


Phe Glu  Gly Thr Lys Phe His  Gln Ala Ala Lys Asp  Met Ala Glu 
    1550                 1555                 1560             


Ile Lys  Val Leu Phe Pro Asn  Asp Gln Glu Ser Asn  Glu Gln Leu 
    1565                 1570                 1575             


Cys Ala  Tyr Ile Leu Gly Glu  Thr Met Glu Ala Ile  Arg Glu Lys 
    1580                 1585                 1590             


Cys Pro  Val Asp His Asn Pro  Ser Ser Ser Pro Pro  Lys Thr Leu 
    1595                 1600                 1605             


Pro Cys  Leu Cys Met Tyr Ala  Met Thr Pro Glu Arg  Val His Arg 
    1610                 1615                 1620             


Leu Arg  Ser Asn Asn Val Lys  Glu Val Thr Val Cys  Ser Ser Thr 
    1625                 1630                 1635             


Pro Leu  Pro Lys His Lys Ile  Lys Asn Val Gln Lys  Val Gln Cys 
    1640                 1645                 1650             


Thr Lys  Val Val Leu Phe Asn  Pro His Thr Pro Ala  Phe Val Pro 
    1655                 1660                 1665             


Ala Arg  Lys Tyr Ile Glu Val  Pro Glu Gln Pro Thr  Ala Pro Pro 
    1670                 1675                 1680             


Ala Gln  Ala Glu Glu Ala Pro  Glu Val Val Ala Thr  Pro Ser Pro 
    1685                 1690                 1695             


Ser Thr  Ala Asp Asn Thr Ser  Leu Asp Val Thr Asp  Ile Ser Leu 
    1700                 1705                 1710             


Asp Met  Asp Asp Ser Ser Glu  Gly Ser Leu Phe Ser  Ser Phe Ser 
    1715                 1720                 1725             


Gly Ser  Asp Asn Ser Ile Thr  Ser Met Asp Ser Trp  Ser Ser Gly 
    1730                 1735                 1740             


Pro Ser  Ser Leu Glu Ile Val  Asp Arg Arg Gln Val  Val Val Ala 
    1745                 1750                 1755             


Asp Val  His Ala Val Gln Glu  Pro Ala Pro Ile Pro  Pro Pro Arg 
    1760                 1765                 1770             


Leu Lys  Lys Met Ala Arg Leu  Ala Ala Ala Arg Lys  Glu Pro Thr 
    1775                 1780                 1785             


Pro Pro  Ala Ser Asn Ser Ser  Glu Ser Leu His Leu  Ser Phe Gly 
    1790                 1795                 1800             


Gly Val  Ser Met Ser Leu Gly  Ser Ile Phe Asp Gly  Glu Thr Ala 
    1805                 1810                 1815             


Arg Gln  Ala Ala Val Gln Pro  Leu Ala Thr Gly Pro  Thr Asp Val 
    1820                 1825                 1830             


Pro Met  Ser Phe Gly Ser Phe  Ser Asp Gly Glu Ile  Asp Glu Leu 
    1835                 1840                 1845             


Ser Arg  Arg Val Thr Glu Ser  Glu Pro Val Leu Phe  Gly Ser Phe 
    1850                 1855                 1860             


Glu Pro  Gly Glu Val Asn Ser  Ile Ile Ser Ser Arg  Ser Ala Val 
    1865                 1870                 1875             


Ser Phe  Pro Leu Arg Lys Gln  Arg Arg Arg Arg Arg  Ser Arg Arg 
    1880                 1885                 1890             


Thr Glu  Tyr Leu Thr Gly Val  Gly Gly Tyr Ile Phe  Ser Thr Asp 
    1895                 1900                 1905             


Thr Gly  Pro Gly His Leu Gln  Lys Lys Ser Val Leu  Gln Asn Gln 
    1910                 1915                 1920             


Leu Thr  Glu Pro Thr Leu Glu  Arg Asn Val Leu Glu  Arg Ile His 
    1925                 1930                 1935             


Ala Pro  Val Leu Asp Thr Ser  Lys Glu Glu Gln Leu  Lys Leu Arg 
    1940                 1945                 1950             


Tyr Gln  Met Met Pro Thr Glu  Ala Asn Lys Ser Arg  Tyr Gln Ser 
    1955                 1960                 1965             


Arg Lys  Val Glu Asn Gln Lys  Ala Ile Thr Thr Glu  Arg Leu Leu 
    1970                 1975                 1980             


Ser Gly  Leu Arg Leu Tyr Asn  Ser Ala Thr Asp Gln  Pro Glu Cys 
    1985                 1990                 1995             


Tyr Lys  Ile Thr Tyr Pro Lys  Pro Leu Tyr Ser Ser  Ser Val Pro 
    2000                 2005                 2010             


Ala Asn  Tyr Ser Asp Pro Gln  Phe Ala Val Ala Val  Cys Asn Asn 
    2015                 2020                 2025             


Tyr Leu  His Glu Asn Tyr Pro  Thr Val Ala Ser Tyr  Gln Ile Thr 
    2030                 2035                 2040             


Asp Glu  Tyr Asp Ala Tyr Leu  Asp Met Val Asp Gly  Thr Val Ala 
    2045                 2050                 2055             


Cys Leu  Asp Thr Ala Thr Phe  Cys Pro Ala Lys Leu  Arg Ser Tyr 
    2060                 2065                 2070             


Pro Lys  Lys His Glu Tyr Arg  Ala Pro Asn Ile Arg  Ser Ala Val 
    2075                 2080                 2085             


Pro Ser  Ala Met Gln Asn Thr  Leu Gln Asn Val Leu  Ile Ala Ala 
    2090                 2095                 2100             


Thr Lys  Arg Asn Cys Asn Val  Thr Gln Met Arg Glu  Leu Pro Thr 
    2105                 2110                 2115             


Leu Asp  Ser Ala Thr Phe Asn  Val Glu Cys Phe Arg  Lys Tyr Ala 
    2120                 2125                 2130             


Cys Asn  Asp Glu Tyr Trp Glu  Glu Phe Ala Arg Lys  Pro Ile Arg 
    2135                 2140                 2145             


Ile Thr  Thr Glu Phe Val Thr  Ala Tyr Val Ala Arg  Leu Lys Gly 
    2150                 2155                 2160             


Pro Lys  Ala Ala Ala Leu Phe  Ala Lys Thr Tyr Asn  Leu Val Pro 
    2165                 2170                 2175             


Leu Gln  Glu Val Pro Met Asp  Arg Phe Val Met Asp  Met Lys Arg 
    2180                 2185                 2190             


Asp Val  Lys Val Thr Pro Gly  Thr Lys His Thr Glu  Glu Arg Pro 
    2195                 2200                 2205             


Lys Val  Gln Val Ile Gln Ala  Ala Glu Pro Leu Ala  Thr Ala Tyr 
    2210                 2215                 2220             


Leu Cys  Gly Ile His Arg Glu  Leu Val Arg Arg Leu  Thr Ala Val 
    2225                 2230                 2235             


Leu Leu  Pro Asn Ile His Thr  Leu Phe Asp Met Ser  Ala Glu Asp 
    2240                 2245                 2250             


Phe Asp  Ala Ile Ile Ala Glu  His Phe Lys Gln Gly  Asp Pro Val 
    2255                 2260                 2265             


Leu Glu  Thr Asp Ile Ala Ser  Phe Asp Lys Ser Gln  Asp Asp Ala 
    2270                 2275                 2280             


Met Ala  Leu Thr Gly Leu Met  Ile Leu Glu Asp Leu  Gly Val Asp 
    2285                 2290                 2295             


Gln Pro  Leu Leu Asp Leu Ile  Glu Cys Ala Phe Gly  Glu Ile Ser 
    2300                 2305                 2310             


Ser Thr  His Leu Pro Thr Gly  Thr Arg Phe Lys Phe  Gly Ala Met 
    2315                 2320                 2325             


Met Lys  Ser Gly Met Phe Leu  Thr Leu Phe Val Asn  Thr Val Leu 
    2330                 2335                 2340             


Asn Val  Val Ile Ala Ser Arg  Val Leu Glu Glu Arg  Leu Lys Thr 
    2345                 2350                 2355             


Ser Arg  Cys Ala Ala Phe Ile  Gly Asp Asp Asn Ile  Ile His Gly 
    2360                 2365                 2370             


Val Val  Ser Asp Lys Glu Met  Ala Glu Arg Cys Ala  Thr Trp Leu 
    2375                 2380                 2385             


Asn Met  Glu Val Lys Ile Ile  Asp Ala Val Ile Gly  Glu Arg Pro 
    2390                 2395                 2400             


Pro Tyr  Phe Cys Gly Gly Phe  Ile Leu Gln Asp Ser  Val Thr Ser 
    2405                 2410                 2415             


Thr Ala  Cys Arg Val Ala Asp  Pro Leu Lys Arg Leu  Phe Lys Leu 
    2420                 2425                 2430             


Gly Lys  Pro Leu Pro Ala Asp  Asp Glu Gln Asp Glu  Asp Arg Arg 
    2435                 2440                 2445             


Arg Ala  Leu Leu Asp Glu Thr  Lys Ala Trp Phe Arg  Val Gly Ile 
    2450                 2455                 2460             


Thr Gly  Thr Leu Ala Val Ala  Val Thr Thr Arg Tyr  Glu Val Asp 
    2465                 2470                 2475             


Asn Ile  Thr Pro Val Leu Leu  Ala Leu Arg Thr Phe  Ala Gln Ser 
    2480                 2485                 2490             


Lys Arg  Ala Phe Gln Ala Ile  Arg Gly Glu Ile Lys  His Leu Tyr 
    2495                 2500                 2505             


Gly Gly  Pro Lys 
    2510         


<210>  55
<211>  2474
<212>  PRT
<213>  Artificial sequence

<220>
<223>  non-structural polyprotein (P1234), PRT, Chikungunya virus 
       (CHIKV)

<400>  55

Met Asp Pro Val Tyr Val Asp Ile Asp Ala Asp Ser Ala Phe Leu Lys 
1               5                   10                  15      


Ala Leu Gln Arg Ala Tyr Pro Met Phe Glu Val Glu Pro Arg Gln Val 
            20                  25                  30          


Thr Pro Asn Asp His Ala Asn Ala Arg Ala Phe Ser His Leu Ala Ile 
        35                  40                  45              


Lys Leu Ile Glu Gln Glu Ile Asp Pro Asp Ser Thr Ile Leu Asp Ile 
    50                  55                  60                  


Gly Ser Ala Pro Ala Arg Arg Met Met Ser Asp Arg Lys Tyr His Cys 
65                  70                  75                  80  


Val Cys Pro Met Arg Ser Ala Glu Asp Pro Glu Arg Leu Ala Asn Tyr 
                85                  90                  95      


Ala Arg Lys Leu Ala Ser Ala Ala Gly Lys Val Leu Asp Arg Asn Ile 
            100                 105                 110         


Ser Gly Lys Ile Gly Asp Leu Gln Ala Val Met Ala Val Pro Asp Thr 
        115                 120                 125             


Glu Thr Pro Thr Phe Cys Leu His Thr Asp Val Ser Cys Arg Gln Arg 
    130                 135                 140                 


Ala Asp Val Ala Ile Tyr Gln Asp Val Tyr Ala Val His Ala Pro Thr 
145                 150                 155                 160 


Ser Leu Tyr His Gln Ala Ile Lys Gly Val Arg Leu Ala Tyr Trp Val 
                165                 170                 175     


Gly Phe Asp Thr Thr Pro Phe Met Tyr Asn Ala Met Ala Gly Ala Tyr 
            180                 185                 190         


Pro Ser Tyr Ser Thr Asn Trp Ala Asp Glu Gln Val Leu Lys Ala Lys 
        195                 200                 205             


Asn Ile Gly Leu Cys Ser Thr Asp Leu Thr Glu Gly Arg Arg Gly Lys 
    210                 215                 220                 


Leu Ser Ile Met Arg Gly Lys Lys Leu Glu Pro Cys Asp Arg Val Leu 
225                 230                 235                 240 


Phe Ser Val Gly Ser Thr Leu Tyr Pro Glu Ser Arg Lys Leu Leu Lys 
                245                 250                 255     


Ser Trp His Leu Pro Ser Val Phe His Leu Lys Gly Lys Leu Ser Phe 
            260                 265                 270         


Thr Cys Arg Cys Asp Thr Val Val Ser Cys Glu Gly Tyr Val Val Lys 
        275                 280                 285             


Arg Ile Thr Met Ser Pro Gly Leu Tyr Gly Lys Thr Thr Gly Tyr Ala 
    290                 295                 300                 


Val Thr His His Ala Asp Gly Phe Leu Met Cys Lys Thr Thr Asp Thr 
305                 310                 315                 320 


Val Asp Gly Glu Arg Val Ser Phe Ser Val Cys Thr Tyr Val Pro Ala 
                325                 330                 335     


Thr Ile Cys Asp Gln Met Thr Gly Ile Leu Ala Thr Glu Val Thr Pro 
            340                 345                 350         


Glu Asp Ala Gln Lys Leu Leu Val Gly Leu Asn Gln Arg Ile Val Val 
        355                 360                 365             


Asn Gly Arg Thr Gln Arg Asn Thr Asn Thr Met Lys Asn Tyr Met Ile 
    370                 375                 380                 


Pro Val Val Ala Gln Ala Phe Ser Lys Trp Ala Lys Glu Cys Arg Lys 
385                 390                 395                 400 


Asp Met Glu Asp Glu Lys Leu Leu Gly Val Arg Glu Arg Thr Leu Thr 
                405                 410                 415     


Cys Cys Cys Leu Trp Ala Phe Lys Lys Gln Lys Thr His Thr Val Tyr 
            420                 425                 430         


Lys Arg Pro Asp Thr Gln Ser Ile Gln Lys Val Gln Ala Glu Phe Asp 
        435                 440                 445             


Ser Phe Val Val Pro Ser Leu Trp Ser Ser Gly Leu Ser Ile Pro Leu 
    450                 455                 460                 


Arg Thr Arg Ile Lys Trp Leu Leu Ser Lys Val Pro Lys Thr Asp Leu 
465                 470                 475                 480 


Thr Pro Tyr Ser Gly Asp Ala Gln Glu Ala Arg Asp Ala Glu Lys Glu 
                485                 490                 495     


Ala Glu Glu Glu Arg Glu Ala Glu Leu Thr Leu Glu Ala Leu Pro Pro 
            500                 505                 510         


Leu Gln Ala Ala Gln Glu Asp Val Gln Val Glu Ile Asp Val Glu Gln 
        515                 520                 525             


Leu Glu Asp Arg Ala Gly Ala Gly Ile Ile Glu Thr Pro Arg Gly Ala 
    530                 535                 540                 


Ile Lys Val Thr Ala Gln Pro Thr Asp His Val Val Gly Glu Tyr Leu 
545                 550                 555                 560 


Val Leu Ser Pro Gln Thr Val Leu Arg Ser Gln Lys Leu Ser Leu Ile 
                565                 570                 575     


His Ala Leu Ala Glu Gln Val Lys Thr Cys Thr His Ser Gly Arg Ala 
            580                 585                 590         


Gly Arg Tyr Ala Val Glu Ala Tyr Asp Gly Arg Val Leu Val Pro Ser 
        595                 600                 605             


Gly Tyr Ala Ile Ser Pro Glu Asp Phe Gln Ser Leu Ser Glu Ser Ala 
    610                 615                 620                 


Thr Met Val Tyr Asn Glu Arg Glu Phe Val Asn Arg Lys Leu His His 
625                 630                 635                 640 


Ile Ala Met His Gly Pro Ala Leu Asn Thr Asp Glu Glu Ser Tyr Glu 
                645                 650                 655     


Leu Val Arg Ala Glu Arg Thr Glu His Glu Tyr Val Tyr Asp Val Asp 
            660                 665                 670         


Gln Arg Arg Cys Cys Lys Lys Glu Glu Ala Ala Gly Leu Val Leu Val 
        675                 680                 685             


Gly Asp Leu Thr Asn Pro Pro Tyr His Glu Phe Ala Tyr Glu Gly Leu 
    690                 695                 700                 


Lys Ile Arg Pro Ala Cys Pro Tyr Lys Ile Ala Val Ile Gly Val Phe 
705                 710                 715                 720 


Gly Val Pro Gly Ser Gly Lys Ser Ala Ile Ile Lys Asn Leu Val Thr 
                725                 730                 735     


Arg Gln Asp Leu Val Thr Ser Gly Lys Lys Glu Asn Cys Gln Glu Ile 
            740                 745                 750         


Thr Thr Asp Val Met Arg Gln Arg Gly Leu Glu Ile Ser Ala Arg Thr 
        755                 760                 765             


Val Asp Ser Leu Leu Leu Asn Gly Cys Asn Arg Pro Val Asp Val Leu 
    770                 775                 780                 


Tyr Val Asp Glu Ala Phe Ala Cys His Ser Gly Thr Leu Leu Ala Leu 
785                 790                 795                 800 


Ile Ala Leu Val Arg Pro Arg Gln Lys Val Val Leu Cys Gly Asp Pro 
                805                 810                 815     


Lys Gln Cys Gly Phe Phe Asn Met Met Gln Met Lys Val Asn Tyr Asn 
            820                 825                 830         


His Asn Ile Cys Thr Gln Val Tyr His Lys Ser Ile Ser Arg Arg Cys 
        835                 840                 845             


Thr Leu Pro Val Thr Ala Ile Val Ser Ser Leu His Tyr Glu Gly Lys 
    850                 855                 860                 


Met Arg Thr Thr Asn Glu Tyr Asn Lys Pro Ile Val Val Asp Thr Thr 
865                 870                 875                 880 


Gly Ser Thr Lys Pro Asp Pro Gly Asp Leu Val Leu Thr Cys Phe Arg 
                885                 890                 895     


Gly Trp Val Lys Gln Leu Gln Ile Asp Tyr Arg Gly His Glu Val Met 
            900                 905                 910         


Thr Ala Ala Ala Ser Gln Gly Leu Thr Arg Lys Gly Val Tyr Ala Val 
        915                 920                 925             


Arg Gln Lys Val Asn Glu Asn Pro Leu Tyr Ala Ser Thr Ser Glu His 
    930                 935                 940                 


Val Asn Val Leu Leu Thr Arg Thr Glu Gly Lys Leu Val Trp Lys Thr 
945                 950                 955                 960 


Leu Ser Gly Asp Pro Trp Ile Lys Thr Leu Gln Asn Pro Pro Lys Gly 
                965                 970                 975     


Asn Phe Lys Ala Thr Ile Lys Glu Trp Glu Val Glu His Ala Ser Ile 
            980                 985                 990         


Met Ala Gly Ile Cys Ser His Gln  Met Thr Phe Asp Thr  Phe Gln Asn 
        995                 1000                 1005             


Lys Ala  Asn Val Cys Trp Ala  Lys Ser Leu Val Pro  Ile Leu Glu 
    1010                 1015                 1020             


Thr Ala  Gly Ile Lys Leu Asn  Asp Arg Gln Trp Ser  Gln Ile Ile 
    1025                 1030                 1035             


Gln Ala  Phe Lys Glu Asp Lys  Ala Tyr Ser Pro Glu  Val Ala Leu 
    1040                 1045                 1050             


Asn Glu  Ile Cys Thr Arg Met  Tyr Gly Val Asp Leu  Asp Ser Gly 
    1055                 1060                 1065             


Leu Phe  Ser Lys Pro Leu Val  Ser Val Tyr Tyr Ala  Asp Asn His 
    1070                 1075                 1080             


Trp Asp  Asn Arg Pro Gly Gly  Lys Met Phe Gly Phe  Asn Pro Glu 
    1085                 1090                 1095             


Ala Ala  Ser Ile Leu Glu Arg  Lys Tyr Pro Phe Thr  Lys Gly Lys 
    1100                 1105                 1110             


Trp Asn  Ile Asn Lys Gln Ile  Cys Val Thr Thr Arg  Arg Ile Glu 
    1115                 1120                 1125             


Asp Phe  Asn Pro Thr Thr Asn  Ile Ile Pro Ala Asn  Arg Arg Leu 
    1130                 1135                 1140             


Pro His  Ser Leu Val Ala Glu  His Arg Pro Val Lys  Gly Glu Arg 
    1145                 1150                 1155             


Met Glu  Trp Leu Val Asn Lys  Ile Asn Gly His His  Val Leu Leu 
    1160                 1165                 1170             


Val Ser  Gly Cys Ser Leu Ala  Leu Pro Thr Lys Arg  Val Thr Trp 
    1175                 1180                 1185             


Val Ala  Pro Leu Gly Val Arg  Gly Ala Asp Tyr Thr  Tyr Asn Leu 
    1190                 1195                 1200             


Glu Leu  Gly Leu Pro Ala Thr  Leu Gly Arg Tyr Asp  Leu Val Val 
    1205                 1210                 1215             


Ile Asn  Ile His Thr Pro Phe  Arg Ile His His Tyr  Gln Gln Cys 
    1220                 1225                 1230             


Val Asp  His Ala Met Lys Leu  Gln Met Leu Gly Gly  Asp Ser Leu 
    1235                 1240                 1245             


Arg Leu  Leu Lys Pro Gly Gly  Ser Leu Leu Ile Arg  Ala Tyr Gly 
    1250                 1255                 1260             


Tyr Ala  Asp Arg Thr Ser Glu  Arg Val Ile Cys Val  Leu Gly Arg 
    1265                 1270                 1275             


Lys Phe  Arg Ser Ser Arg Ala  Leu Lys Pro Pro Cys  Val Thr Ser 
    1280                 1285                 1290             


Asn Thr  Glu Met Phe Phe Leu  Phe Ser Asn Phe Asp  Asn Gly Arg 
    1295                 1300                 1305             


Arg Asn  Phe Thr Thr His Val  Met Asn Asn Gln Leu  Asn Ala Ala 
    1310                 1315                 1320             


Phe Val  Gly Gln Ala Thr Arg  Ala Gly Cys Ala Pro  Ser Tyr Arg 
    1325                 1330                 1335             


Val Lys  Arg Met Asp Ile Ala  Lys Asn Asp Glu Glu  Cys Val Val 
    1340                 1345                 1350             


Asn Ala  Ala Asn Pro Arg Gly  Leu Pro Gly Asp Gly  Val Cys Lys 
    1355                 1360                 1365             


Ala Val  Tyr Lys Lys Trp Pro  Glu Ser Phe Lys Asn  Ser Ala Thr 
    1370                 1375                 1380             


Pro Val  Gly Thr Ala Lys Thr  Val Met Cys Gly Thr  Tyr Pro Val 
    1385                 1390                 1395             


Ile His  Ala Val Gly Pro Asn  Phe Ser Asn Tyr Ser  Glu Ser Glu 
    1400                 1405                 1410             


Gly Asp  Arg Glu Leu Ala Ala  Ala Tyr Arg Glu Val  Ala Lys Glu 
    1415                 1420                 1425             


Val Thr  Arg Leu Gly Val Asn  Ser Val Ala Ile Pro  Leu Leu Ser 
    1430                 1435                 1440             


Thr Gly  Val Tyr Ser Gly Gly  Lys Asp Arg Leu Thr  Gln Ser Leu 
    1445                 1450                 1455             


Asn His  Leu Phe Thr Ala Met  Asp Ser Thr Asp Ala  Asp Val Val 
    1460                 1465                 1470             


Ile Tyr  Cys Arg Asp Lys Glu  Trp Glu Lys Lys Ile  Ser Glu Ala 
    1475                 1480                 1485             


Ile Gln  Met Arg Thr Gln Val  Glu Leu Leu Asp Glu  His Ile Ser 
    1490                 1495                 1500             


Ile Asp  Cys Asp Val Val Arg  Val His Pro Asp Ser  Ser Leu Ala 
    1505                 1510                 1515             


Gly Arg  Lys Gly Tyr Ser Thr  Thr Glu Gly Ala Leu  Tyr Ser Tyr 
    1520                 1525                 1530             


Leu Glu  Gly Thr Arg Phe His  Gln Thr Ala Val Asp  Met Ala Glu 
    1535                 1540                 1545             


Ile Tyr  Thr Met Trp Pro Lys  Gln Thr Glu Ala Asn  Glu Gln Val 
    1550                 1555                 1560             


Cys Leu  Tyr Ala Leu Gly Glu  Ser Ile Glu Ser Ile  Arg Gln Lys 
    1565                 1570                 1575             


Cys Pro  Val Asp Asp Ala Asp  Ala Ser Ser Pro Pro  Lys Thr Val 
    1580                 1585                 1590             


Pro Cys  Leu Cys Arg Tyr Ala  Met Thr Pro Glu Arg  Val Thr Arg 
    1595                 1600                 1605             


Leu Arg  Met Asn His Val Thr  Ser Ile Ile Val Cys  Ser Ser Phe 
    1610                 1615                 1620             


Pro Leu  Pro Lys Tyr Lys Ile  Glu Gly Val Gln Lys  Val Lys Cys 
    1625                 1630                 1635             


Ser Lys  Val Met Leu Phe Asp  His Asn Val Pro Ser  Arg Val Ser 
    1640                 1645                 1650             


Pro Arg  Glu Tyr Arg Pro Ser  Gln Glu Ser Val Gln  Glu Ala Ser 
    1655                 1660                 1665             


Thr Thr  Thr Ser Leu Thr His  Ser Gln Phe Asp Leu  Ser Val Asp 
    1670                 1675                 1680             


Gly Lys  Ile Leu Pro Val Pro  Ser Asp Leu Asp Ala  Asp Ala Pro 
    1685                 1690                 1695             


Ala Leu  Glu Pro Ala Leu Asp  Asp Gly Ala Ile His  Thr Leu Pro 
    1700                 1705                 1710             


Ser Ala  Thr Gly Asn Leu Ala  Ala Val Ser Asp Trp  Val Met Ser 
    1715                 1720                 1725             


Thr Val  Pro Val Ala Pro Pro  Arg Arg Arg Arg Gly  Arg Asn Leu 
    1730                 1735                 1740             


Thr Val  Thr Cys Asp Glu Arg  Glu Gly Asn Ile Thr  Pro Met Ala 
    1745                 1750                 1755             


Ser Val  Arg Phe Phe Arg Ala  Glu Leu Cys Pro Val  Val Gln Glu 
    1760                 1765                 1770             


Thr Ala  Glu Thr Arg Asp Thr  Ala Met Ser Leu Gln  Ala Pro Pro 
    1775                 1780                 1785             


Ser Thr  Ala Thr Glu Leu Ser  His Pro Pro Ile Ser  Phe Gly Ala 
    1790                 1795                 1800             


Pro Ser  Glu Thr Phe Pro Ile  Thr Phe Gly Asp Phe  Asn Glu Gly 
    1805                 1810                 1815             


Glu Ile  Glu Ser Leu Ser Ser  Glu Leu Leu Thr Phe  Gly Asp Phe 
    1820                 1825                 1830             


Leu Pro  Gly Glu Val Asp Asp  Leu Thr Asp Ser Asp  Trp Ser Thr 
    1835                 1840                 1845             


Cys Ser  Asp Thr Asp Asp Glu  Leu Arg Leu Asp Arg  Ala Gly Gly 
    1850                 1855                 1860             


Tyr Ile  Phe Ser Ser Asp Thr  Gly Pro Gly His Leu  Gln Gln Lys 
    1865                 1870                 1875             


Ser Val  Arg Gln Ser Val Leu  Pro Val Asn Thr Leu  Glu Glu Val 
    1880                 1885                 1890             


His Glu  Glu Lys Cys Tyr Pro  Pro Lys Leu Asp Glu  Ala Lys Glu 
    1895                 1900                 1905             


Gln Leu  Leu Leu Lys Lys Leu  Gln Glu Ser Ala Ser  Met Ala Asn 
    1910                 1915                 1920             


Arg Ser  Arg Tyr Gln Ser Arg  Lys Val Glu Asn Met  Lys Ala Thr 
    1925                 1930                 1935             


Ile Ile  Gln Arg Leu Lys Arg  Gly Cys Arg Leu Tyr  Leu Met Ser 
    1940                 1945                 1950             


Glu Thr  Pro Lys Val Pro Thr  Tyr Arg Thr Thr Tyr  Pro Ala Pro 
    1955                 1960                 1965             


Val Tyr  Ser Pro Pro Ile Asn  Val Arg Leu Ser Asn  Pro Glu Ser 
    1970                 1975                 1980             


Ala Val  Ala Ala Cys Asn Glu  Phe Leu Ala Arg Asn  Tyr Pro Thr 
    1985                 1990                 1995             


Val Ser  Ser Tyr Gln Ile Thr  Asp Glu Tyr Asp Ala  Tyr Leu Asp 
    2000                 2005                 2010             


Met Val  Asp Gly Ser Glu Ser  Cys Leu Asp Arg Ala  Thr Phe Asn 
    2015                 2020                 2025             


Pro Ser  Lys Leu Arg Ser Tyr  Pro Lys Gln His Ala  Tyr His Ala 
    2030                 2035                 2040             


Pro Ser  Ile Arg Ser Ala Val  Pro Ser Pro Phe Gln  Asn Thr Leu 
    2045                 2050                 2055             


Gln Asn  Val Leu Ala Ala Ala  Thr Lys Arg Asn Cys  Asn Val Thr 
    2060                 2065                 2070             


Gln Met  Arg Glu Leu Pro Thr  Leu Asp Ser Ala Val  Phe Asn Val 
    2075                 2080                 2085             


Glu Cys  Phe Lys Lys Phe Ala  Cys Asn Gln Glu Tyr  Trp Glu Glu 
    2090                 2095                 2100             


Phe Ala  Ala Ser Pro Ile Arg  Ile Thr Thr Glu Asn  Leu Thr Thr 
    2105                 2110                 2115             


Tyr Val  Thr Lys Leu Lys Gly  Pro Lys Ala Ala Ala  Leu Phe Ala 
    2120                 2125                 2130             


Lys Thr  His Asn Leu Leu Pro  Leu Gln Glu Val Pro  Met Asp Arg 
    2135                 2140                 2145             


Phe Thr  Val Asp Met Lys Arg  Asp Val Lys Val Thr  Pro Gly Thr 
    2150                 2155                 2160             


Lys His  Thr Glu Glu Arg Pro  Lys Val Gln Val Ile  Gln Ala Ala 
    2165                 2170                 2175             


Glu Pro  Leu Ala Thr Ala Tyr  Leu Cys Gly Ile His  Arg Glu Leu 
    2180                 2185                 2190             


Val Arg  Arg Leu Asn Ala Val  Leu Leu Pro Asn Val  His Thr Leu 
    2195                 2200                 2205             


Phe Asp  Met Ser Ala Glu Asp  Phe Asp Ala Ile Ile  Ala Ala His 
    2210                 2215                 2220             


Phe Lys  Pro Gly Asp Thr Val  Leu Glu Thr Asp Ile  Ala Ser Phe 
    2225                 2230                 2235             


Asp Lys  Ser Gln Asp Asp Ser  Leu Ala Leu Thr Ala  Leu Met Leu 
    2240                 2245                 2250             


Leu Glu  Asp Leu Gly Val Asp  His Ser Leu Leu Asp  Leu Ile Glu 
    2255                 2260                 2265             


Ala Ala  Phe Gly Glu Ile Ser  Ser Cys His Leu Pro  Thr Gly Thr 
    2270                 2275                 2280             


Arg Phe  Lys Phe Gly Ala Met  Met Lys Ser Gly Met  Phe Leu Thr 
    2285                 2290                 2295             


Leu Phe  Val Asn Thr Leu Leu  Asn Ile Thr Ile Ala  Ser Arg Val 
    2300                 2305                 2310             


Leu Glu  Asp Arg Leu Thr Lys  Ser Ala Cys Ala Ala  Phe Ile Gly 
    2315                 2320                 2325             


Asp Asp  Asn Ile Ile His Gly  Val Val Ser Asp Glu  Leu Met Ala 
    2330                 2335                 2340             


Ala Arg  Cys Ala Thr Trp Met  Asn Met Glu Val Lys  Ile Ile Asp 
    2345                 2350                 2355             


Ala Val  Val Ser Gln Lys Ala  Pro Tyr Phe Cys Gly  Gly Phe Ile 
    2360                 2365                 2370             


Leu His  Asp Ile Val Thr Gly  Thr Ala Cys Arg Val  Ala Asp Pro 
    2375                 2380                 2385             


Leu Lys  Arg Leu Phe Lys Leu  Gly Lys Pro Leu Ala  Ala Gly Asp 
    2390                 2395                 2400             


Glu Gln  Asp Glu Asp Arg Arg  Arg Ala Leu Ala Asp  Glu Val Val 
    2405                 2410                 2415             


Arg Trp  Gln Arg Thr Gly Leu  Ile Asp Glu Leu Glu  Lys Ala Val 
    2420                 2425                 2430             


Tyr Ser  Arg Tyr Glu Val Gln  Gly Ile Ser Val Val  Val Met Ser 
    2435                 2440                 2445             


Met Ala  Thr Phe Ala Ser Ser  Arg Ser Asn Phe Glu  Lys Leu Arg 
    2450                 2455                 2460             


Gly Pro  Val Val Thr Leu Tyr  Gly Gly Pro Lys 
    2465                 2470                 


<210>  56
<211>  14
<212>  PRT
<213>  Artificial sequence

<220>
<223>  483-495 of CHIKV, PRT, chikungunya virus

<400>  56

Asn Glu Gly Glu Ile Glu Ser Leu Ser Ser Glu Leu Leu Thr 
1               5                   10                  


<210>  57
<211>  20
<212>  PRT
<213>  Artificial sequence

<220>
<223>  494-512 of SINV, PRT, sindbis virus

<400>  57

Ser Asp Gly Glu Ile Asp Glu Leu Ser Arg Arg Val Thr Thr Glu Ser 
1               5                   10                  15      


Glu Pro Val Leu 
            20  


<210>  58
<211>  13
<212>  PRT
<213>  Artificial sequence

<220>
<223>  448-459 of SFV, PRT, semliki forest virus

<400>  58

Asp Glu His Glu Val Asp Ala Leu Ala Ser Gly Ile Thr 
1               5                   10              


<210>  59
<211>  30
<212>  PRT
<213>  Artificial sequence

<220>
<223>  501-530 or CHIKV, PRT, chikungunya virus

<400>  59

Leu Pro Gly Glu Val Asp Asp Leu Thr Asp Ser Asp Trp Ser Thr Cys 
1               5                   10                  15      


Ser Asp Thr Asp Asp Glu Leu Arg Leu Asp Arg Ala Gly Gly 
            20                  25                  30  


<210>  60
<211>  33
<212>  PRT
<213>  Artificial sequence

<220>
<223>  517-549 of SINV, PRT, sindbis virus

<400>  60

Glu Pro Gly Glu Val Asn Ser Ile Ile Ser Ser Arg Ser Ala Val Ser 
1               5                   10                  15      


Phe Pro Leu Arg Lys Gln Arg Arg Arg Arg Arg Ser Arg Arg Thr Glu 
            20                  25                  30          


Tyr 
    


<210>  61
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  465-475 of SFV, PRT, semliki forest virus

<400>  61

Asp Asp Val Leu Arg Leu Gly Arg Ala Gly Ala 
1               5                   10      


<210>  62
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  nsP2/nsP3 junction, PRT, venezuelan equine encephalitis virus

<400>  62

Leu His Glu Ala Gly Cys Ala Pro Ser Tyr 
1               5                   10  


<210>  63
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  nsP3/nsP4 junction, PRT, venezuelan equine encephalitis virus

<400>  63

Arg Phe Asp Ala Gly Ala Tyr Ile Phe Ser 
1               5                   10  


<210>  64
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  nsP2/nsP3 junction, PRT, eastern equine encephalitis virus

<400>  64

Gln His Glu Ala Gly Arg Ala Pro Ala Tyr 
1               5                   10  


<210>  65
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  nsP3/nsP4 junction, PRT, eastern equine encephalitis virus

<400>  65

Arg Tyr Glu Ala Gly Ala Tyr Ile Phe Ser 
1               5                   10  


<210>  66
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  nsP2/nsP3 junction, PRT, western equine encephalitis virus

<400>  66

Arg Tyr Glu Ala Gly Arg Ala Pro Ala Tyr 
1               5                   10  


<210>  67
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  nsP3/nsP4 junction, PRT, western equine encephalitis virus

<400>  67

Arg Tyr Glu Ala Gly Ala Tyr Ile Phe Ser 
1               5                   10  


