                         SEQUENCE LISTING

<110>  Janssen Sciences Ireland Unlimited Company
 
<120>  Arenavirus Vectors for Hepatitis V Virus (HBV) Vaccines and Uses
       Thereof	 
       
<130>  065814.11194/11WO1

<150>  US 62/862813
<151>  2019-06-18

<160>  29    

<170>  PatentIn version 3.5

<210>  1
<211>  444
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen gene

<400>  1
gacatcgacc cttacaagga gttcggcgcc agcgtggaac tgctgtcttt tctgcccagt       60

gatttctttc cttccattcg agacctgctg gataccgcct ctgctctgta tcgggaagcc      120

ctggagagcc cagaacactg ctccccacac cataccgctc tgcgacaggc aatcctgtgc      180

tggggggagc tgatgaacct ggccacatgg gtgggatcga atctggagga ccccgcttca      240

cgggaactgg tggtcagcta cgtgaacgtc aatatgggcc tgaaaatccg ccagctgctg      300

tggttccata ttagctgcct gacttttgga cgagagaccg tgctggaata cctggtgtcc      360

ttcggcgtct ggattcgcac tccccctgct tatcgaccac ccaacgcacc aattctgtcc      420

accctgcccg agaccacagt ggtc                                             444


<210>  2
<211>  148
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen

<400>  2

Asp Ile Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser 
1               5                   10                  15      


Phe Leu Pro Ser Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp Thr 
            20                  25                  30          


Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser 
        35                  40                  45              


Pro His His Thr Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu Leu 
    50                  55                  60                  


Met Asn Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser 
65                  70                  75                  80  


Arg Glu Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys Ile 
                85                  90                  95      


Arg Gln Leu Leu Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg Glu 
            100                 105                 110         


Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr Pro 
        115                 120                 125             


Pro Ala Tyr Arg Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro Glu 
    130                 135                 140                 


Thr Thr Val Val 
145             


<210>  3
<211>  447
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen gene

<400>  3
atggacatcg acccttacaa ggagttcggc gccagcgtgg aactgctgtc ttttctgccc       60

agtgatttct ttccttccat tcgagacctg ctggataccg cctctgctct gtatcgggaa      120

gccctggaga gcccagaaca ctgctcccca caccataccg ctctgcgaca ggcaatcctg      180

tgctgggggg agctgatgaa cctggccaca tgggtgggat ccaatctgga ggaccccgct      240

tcacgggaac tggtggtcag ctacgtgaac gtcaatatgg gcctgaaaat ccgccagctg      300

ctgtggttcc atattagctg cctgactttt ggacgagaga ccgtgctgga atacctggtg      360

tccttcggcg tctggatccg cactccccct gcttatcgac cacccaacgc accaattctg      420

tccaccctgc ccgagaccac agtggtc                                          447


<210>  4
<211>  149
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV truncated core antigen

<400>  4

Met Asp Ile Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu 
1               5                   10                  15      


Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp 
            20                  25                  30          


Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
        35                  40                  45              


Ser Pro His His Thr Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu 
    50                  55                  60                  


Leu Met Asn Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala 
65                  70                  75                  80  


Ser Arg Glu Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys 
                85                  90                  95      


Ile Arg Gln Leu Leu Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg 
            100                 105                 110         


Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr 
        115                 120                 125             


Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro 
    130                 135                 140                 


Glu Thr Thr Val Val 
145                 


<210>  5
<211>  2529
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV pol antigen gene

<400>  5
atgcccctgt cttaccagca ctttagaaag cttctgctgc tggacgatga agccgggcct       60

ctggaggaag agctgccaag gctggcagac gaggggctga accggagagt ggccgaagat      120

ctgaatctgg gaaacctgaa cgtgagcatc ccttggactc ataaagtcgg caacttcacc      180

gggctgtaca gctccacagt gcctgtcttc aatccagagt ggcagacacc atcctttccc      240

aacattcacc tgcaggagga catcattaat agatgcgaac agttcgtggg acctctgaca      300

gtcaacgaaa agaggcgcct gaaactgatc atgcctgcca ggttttaccc aaatgtgact      360

aagtatctgc cactggataa gggcatcaag ccttactatc cagagcacct ggtgaaccat      420

tacttccaga ctagacacta tctgcatacc ctgtggaagg ccggaatcct gtacaaacga      480

gaaactaccc ggagtgcttc attttgtggc tccccatatt cttgggaaca ggagctgcag      540

catggcaggc tggtgttcca gaccagcaca cgccacgggg atgagtcctt ttgccagcag      600

tctagtggca tcctgagcag atcccccgtg gggccttgtc tgcagtctca gctgcggaag      660

agtagactgg gactgcagcc acagcaggga cacctggcac gacggcagca gggaaggtct      720

ggcagtatcc gggctagagt gcatcccaca actagaaggc ctttcggcgt cgagccatca      780

ggaagcggcc acaccacaaa caccgcatca agctcctcta gttgcctgca tcagtcagcc      840

gtgagaaagg ccgcttacag ccacctgtcc acatctaaaa ggcactcaag ctccgggcat      900

gctgtggagc tgcacaacat ccctccaaat tctgcacgca gtcagtcaga aggacccgtg      960

ttcagctgct ggtggctgca gtttcggaac tcaaagcctt gcagcgacta ttgtctgagc     1020

catattgtga atctgctgga ggattggggc ccttgtaccg agcacgggga acaccatatc     1080

aggattccac gaacaccagc acgagtgact ggaggggtgt tcctggtgga caagaacccc     1140

cacaatacta ccgagagccg gctggtggtc gatttcagtc agttttcaag aggcaacaca     1200

agggtgtcat ggcccaaatt cgccgtccct aatctgcaga gtctgactaa cctgctgtct     1260

agtaatctga gctggctgtc cctggacgtg tccgcagcct tttaccacct gcctctgcat     1320

ccagctgcaa tgccccatct gctggtgggg tcaagcggac tgagtcgcta cgtcgcccga     1380

ctgtcctcta actcacgcat cattaatcac cagcatggca ccatgcagaa cctgcacgat     1440

agctgttccc ggaatctgta cgtgtctctg ctgctgctgt ataagacatt cggcagaaaa     1500

ctgcacctgt acagccatcc tatcattctg gggtttagga agatcccaat gggagtggga     1560

ctgagcccct tcctgctggc acagtttacc tccgccattt gctctgtggt ccgccgagcc     1620

ttcccacact gtctggcttt ttcctatatg aacaatgtgg tcctgggcgc caaatccgtg     1680

cagcatctgg agtctctgtt cacagctgtc actaactttc tgctgagcct ggggatccac     1740

ctgaacccaa ataagactaa acgctggggg tacagcctga atttcatggg atatgtgatt     1800

ggatcctggg ggaccctgcc acaggagcac atcgtgcaga agatcaagga atgctttcgg     1860

aagctgcccg tcaacagacc tatcgactgg aaagtgtgcc agcggattgt cggactgctg     1920

ggcttcgccg ctccctttac ccagtgcggg tacccagcac tgatgcccct gtatgcctgt     1980

atccagtcta agcaggcttt cacctttagt cctacataca aggcattcct gtgcaaacag     2040

tacctgaacc tgtatccagt ggcaaggcag cgacctggac tgtgccaggt ctttgcaaat     2100

gccactccta ccggctgggg gctggctatc ggacatcagc gaatgcgggg cacattcgtg     2160

gcccccctgc ctattcacac tgctcagctg ctggcagcct gctttgctag atctaggagt     2220

ggagcaaagc tgatcggcac cgacaatagt gtggtcctgt caagaaaata cacatccttc     2280

ccatggctgc tgggatgtgc tgcaaactgg attctgaggg gcaccagctt cgtgtacgtc     2340

ccctcagccc tgaatcctgc tgacgatcca tcccgcgggc gactgggact gtaccgacct     2400

ctgctgagac tgcccttcag gcctacaact ggccggacat ctctgtatgc cgattcacca     2460

agcgtgccct cacacctgcc tgacagagtc cactttgctt cacccctgca cgtcgcttgg     2520

cggcctcca                                                             2529


<210>  6
<211>  2529
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HBV pol antigen gene

<400>  6
atgcccctgt cttaccagca ctttagaaag ctgctgctgc tggacgatga agccgggcct       60

ctggaggaag agctgccaag gctggcagac gaggggctga accggagagt ggccgaagat      120

ctgaatctgg gaaacctgaa cgtgagcatc ccttggactc ataaagtcgg caacttcacc      180

gggctgtaca gctccacagt gcctgtcttc aatccagagt ggcagacacc atcctttccc      240

aacattcacc tgcaggagga catcattaat agatgcgaac agttcgtggg acctctgaca      300

gtcaacgaaa agaggcgcct gaaactgatc atgcctgcca ggttttaccc aaatgtgact      360

aagtatctgc cactggataa gggcatcaag ccttactatc cagagcacct ggtgaaccat      420

tacttccaga ctagacacta tctgcatacc ctgtggaagg ccggaatcct gtacaaacga      480

gaaactaccc ggagtgcttc attttgtggc tccccatatt cttgggaaca ggagctgcag      540

catggcaggc tggtgttcca gaccagcaca cgccacgggg atgagtcctt ttgccagcag      600

tctagtggca tcctgagcag atcccccgtg gggccttgtc tgcagtctca gctgcggaag      660

agtagactgg gactgcagcc acagcaggga cacctggcac gacggcagca gggaaggtct      720

ggcagtatcc gggctagagt gcatcccaca actagaaggc ctttcggcgt cgagccatca      780

ggaagcggcc acaccacaaa caccgcatca agctcctcta gttgcctgca tcagtcagcc      840

gtgagaaagg ccgcttacag ccacctgtcc acatctaaaa ggcactcaag ctccgggcat      900

gctgtggagc tgcacaacat ccctccaaat tctgcacgca gtcagtcaga aggacccgtg      960

ttcagctgct ggtggctgca gtttcggaac tcaaagcctt gcagcgacta ttgtctgagc     1020

catattgtga atctgctgga ggattggggc ccttgtaccg agcacgggga acaccatatc     1080

aggattccac gaacaccagc acgagtgact ggaggggtgt tcctggtgga caagaacccc     1140

cacaatacta ccgagagccg gctggtggtc gatttcagtc agttttcaag aggcaacaca     1200

agggtgtcat ggcccaaatt cgccgtccct aatctgcaga gtctgactaa cctgctgtct     1260

agtaatctga gctggctgtc cctggacgtg tccgcagcct tttaccacct gcctctgcat     1320

ccagctgcaa tgccccatct gctggtgggg tcaagcggac tgagtcgcta cgtcgcccga     1380

ctgtcctcta actcacgcat cattaatcac cagcatggca ccatgcagaa cctgcacgat     1440

agctgttccc ggaatctgta cgtgtctctg ctgctgctgt ataagacatt cggcagaaaa     1500

ctgcacctgt acagccatcc tatcattctg gggtttagga agatcccaat gggagtggga     1560

ctgagcccct tcctgctggc acagtttacc tccgccattt gctctgtggt ccgccgagcc     1620

ttcccacact gtctggcttt ttcctatatg aacaatgtgg tcctgggcgc caaatccgtg     1680

cagcatctgg agtctctgtt cacagctgtc actaactttc tgctgagcct ggggatccac     1740

ctgaacccaa ataagactaa acgctggggg tacagcctga atttcatggg atatgtgatt     1800

ggatcctggg ggaccctgcc acaggagcac atcgtgcaga agatcaagga atgctttcgg     1860

aagctgcccg tcaacagacc tatcgactgg aaagtgtgcc agcggattgt cggactgctg     1920

ggcttcgccg ctccctttac ccagtgcggg tacccagcac tgatgcccct gtatgcctgt     1980

atccagtcta agcaggcttt cacctttagt cctacataca aggcattcct gtgcaaacag     2040

tacctgaacc tgtatccagt ggcaaggcag cgacctggac tgtgccaggt ctttgcaaat     2100

gccactccta ccggctgggg gctggctatc ggacatcagc gaatgcgggg cacattcgtg     2160

gcccccctgc ctattcacac tgctcagctg ctggcagcct gctttgctag atctaggagt     2220

ggagcaaagc tgatcggcac cgacaatagt gtggtcctgt caagaaaata cacatccttc     2280

ccatggctgc tgggatgtgc tgcaaactgg attctgaggg gcaccagctt cgtgtacgtc     2340

ccctcagccc tgaatcctgc tgacgatcca tcccgcgggc gactgggact gtaccgacct     2400

ctgctgagac tgcccttcag gcctacaact ggccggacat ctctgtatgc cgattcacca     2460

agcgtgccct cacacctgcc tgacagagtc cactttgctt cacccctgca cgtcgcttgg     2520

cggcctcca                                                             2529


<210>  7
<211>  843
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV pol antigen

<400>  7

Met Pro Leu Ser Tyr Gln His Phe Arg Lys Leu Leu Leu Leu Asp Asp 
1               5                   10                  15      


Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu Ala Asp Glu Gly 
            20                  25                  30          


Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly Asn Leu Asn Val 
        35                  40                  45              


Ser Ile Pro Trp Thr His Lys Val Gly Asn Phe Thr Gly Leu Tyr Ser 
    50                  55                  60                  


Ser Thr Val Pro Val Phe Asn Pro Glu Trp Gln Thr Pro Ser Phe Pro 
65                  70                  75                  80  


Asn Ile His Leu Gln Glu Asp Ile Ile Asn Arg Cys Glu Gln Phe Val 
                85                  90                  95      


Gly Pro Leu Thr Val Asn Glu Lys Arg Arg Leu Lys Leu Ile Met Pro 
            100                 105                 110         


Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro Leu Asp Lys Gly 
        115                 120                 125             


Ile Lys Pro Tyr Tyr Pro Glu His Leu Val Asn His Tyr Phe Gln Thr 
    130                 135                 140                 


Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly Ile Leu Tyr Lys Arg 
145                 150                 155                 160 


Glu Thr Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro Tyr Ser Trp Glu 
                165                 170                 175     


Gln Glu Leu Gln His Gly Arg Leu Val Phe Gln Thr Ser Thr Arg His 
            180                 185                 190         


Gly Asp Glu Ser Phe Cys Gln Gln Ser Ser Gly Ile Leu Ser Arg Ser 
        195                 200                 205             


Pro Val Gly Pro Cys Leu Gln Ser Gln Leu Arg Lys Ser Arg Leu Gly 
    210                 215                 220                 


Leu Gln Pro Gln Gln Gly His Leu Ala Arg Arg Gln Gln Gly Arg Ser 
225                 230                 235                 240 


Gly Ser Ile Arg Ala Arg Val His Pro Thr Thr Arg Arg Pro Phe Gly 
                245                 250                 255     


Val Glu Pro Ser Gly Ser Gly His Thr Thr Asn Thr Ala Ser Ser Ser 
            260                 265                 270         


Ser Ser Cys Leu His Gln Ser Ala Val Arg Lys Ala Ala Tyr Ser His 
        275                 280                 285             


Leu Ser Thr Ser Lys Arg His Ser Ser Ser Gly His Ala Val Glu Leu 
    290                 295                 300                 


His Asn Ile Pro Pro Asn Ser Ala Arg Ser Gln Ser Glu Gly Pro Val 
305                 310                 315                 320 


Phe Ser Cys Trp Trp Leu Gln Phe Arg Asn Ser Lys Pro Cys Ser Asp 
                325                 330                 335     


Tyr Cys Leu Ser His Ile Val Asn Leu Leu Glu Asp Trp Gly Pro Cys 
            340                 345                 350         


Thr Glu His Gly Glu His His Ile Arg Ile Pro Arg Thr Pro Ala Arg 
        355                 360                 365             


Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro His Asn Thr Thr 
    370                 375                 380                 


Glu Ser Arg Leu Val Val Asp Phe Ser Gln Phe Ser Arg Gly Asn Thr 
385                 390                 395                 400 


Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu Gln Ser Leu Thr 
                405                 410                 415     


Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu Asp Val Ser Ala 
            420                 425                 430         


Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met Pro His Leu Leu 
        435                 440                 445             


Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg Leu Ser Ser Asn 
    450                 455                 460                 


Ser Arg Ile Ile Asn His Gln His Gly Thr Met Gln Asn Leu His Asp 
465                 470                 475                 480 


Ser Cys Ser Arg Asn Leu Tyr Val Ser Leu Leu Leu Leu Tyr Lys Thr 
                485                 490                 495     


Phe Gly Arg Lys Leu His Leu Tyr Ser His Pro Ile Ile Leu Gly Phe 
            500                 505                 510         


Arg Lys Ile Pro Met Gly Val Gly Leu Ser Pro Phe Leu Leu Ala Gln 
        515                 520                 525             


Phe Thr Ser Ala Ile Cys Ser Val Val Arg Arg Ala Phe Pro His Cys 
    530                 535                 540                 


Leu Ala Phe Ser Tyr Met Asn Asn Val Val Leu Gly Ala Lys Ser Val 
545                 550                 555                 560 


Gln His Leu Glu Ser Leu Phe Thr Ala Val Thr Asn Phe Leu Leu Ser 
                565                 570                 575     


Leu Gly Ile His Leu Asn Pro Asn Lys Thr Lys Arg Trp Gly Tyr Ser 
            580                 585                 590         


Leu Asn Phe Met Gly Tyr Val Ile Gly Ser Trp Gly Thr Leu Pro Gln 
        595                 600                 605             


Glu His Ile Val Gln Lys Ile Lys Glu Cys Phe Arg Lys Leu Pro Val 
    610                 615                 620                 


Asn Arg Pro Ile Asp Trp Lys Val Cys Gln Arg Ile Val Gly Leu Leu 
625                 630                 635                 640 


Gly Phe Ala Ala Pro Phe Thr Gln Cys Gly Tyr Pro Ala Leu Met Pro 
                645                 650                 655     


Leu Tyr Ala Cys Ile Gln Ser Lys Gln Ala Phe Thr Phe Ser Pro Thr 
            660                 665                 670         


Tyr Lys Ala Phe Leu Cys Lys Gln Tyr Leu Asn Leu Tyr Pro Val Ala 
        675                 680                 685             


Arg Gln Arg Pro Gly Leu Cys Gln Val Phe Ala Asn Ala Thr Pro Thr 
    690                 695                 700                 


Gly Trp Gly Leu Ala Ile Gly His Gln Arg Met Arg Gly Thr Phe Val 
705                 710                 715                 720 


Ala Pro Leu Pro Ile His Thr Ala Gln Leu Leu Ala Ala Cys Phe Ala 
                725                 730                 735     


Arg Ser Arg Ser Gly Ala Lys Leu Ile Gly Thr Asp Asn Ser Val Val 
            740                 745                 750         


Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu Gly Cys Ala Ala 
        755                 760                 765             


Asn Trp Ile Leu Arg Gly Thr Ser Phe Val Tyr Val Pro Ser Ala Leu 
    770                 775                 780                 


Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly Leu Tyr Arg Pro 
785                 790                 795                 800 


Leu Leu Arg Leu Pro Phe Arg Pro Thr Thr Gly Arg Thr Ser Leu Tyr 
                805                 810                 815     


Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 
            820                 825                 830         


Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
        835                 840             


<210>  8
<211>  63
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystatin S signal peptide coding sequence

<400>  8
atggctcgac ctctgtgtac cctgctactc ctgatggcta ccctggctgg agctctggcc       60

agc                                                                     63


<210>  9
<211>  21
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cystatin S signal peptide sequence

<400>  9

Met Ala Arg Pro Leu Cys Thr Leu Leu Leu Leu Met Ala Thr Leu Ala 
1               5                   10                  15      


Gly Ala Leu Ala Ser 
            20      


<210>  10
<211>  378
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  triple enhancer regulatory sequence

<400>  10
ggctcgcatc tctccttcac gcgcccgccg ccctacctga ggccgccatc cacgccggtt       60

gagtcgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgtccg ccgtctaggt      120

aagtttaaag ctcaggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag      180

actcagccgg ctctccacgc tttgcctgac cctgcttgct caactctagt tctctcgtta      240

acttaatgag acagatagaa actggtcttg tagaaacaga gtagtcgcct gcttttctgc      300

caggtgctga cttctctccc ctgggctttt ttctttttct caggttgaaa agaagaagac      360

gaagaagacg aagaagac                                                    378


<210>  11
<211>  12
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  linker coding sequence

<400>  11
gccggagctg gc                                                           12


<210>  12
<211>  248
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ApoAI gene fragment

<400>  12
ttggccgtgc tcttcctgac gggtaggtgt cccctaacct agggagccaa ccatcggggg       60

gccttctccc taaatccccg tggcccaccc tcctgggcag aggcagcagg tttctcactg      120

gccccctctc ccccacctcc aagcttggcc tttcggctca gatctcagcc cacagctggc      180

ctgatctggg tctcccctcc caccctcagg gagccaggct cggcatttcg tcgacaagct      240

tagccacc                                                               248


<210>  13
<211>  130
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SV40 polyadenylation signal sequence

<400>  13
aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca       60

aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct      120

tatcatgtct                                                             130


<210>  14
<211>  81
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  immunoglobulin secretion signal coding sequence

<400>  14
atggagttcg gcctgtcttg ggtctttctg gtggcaatcc tgaagggcgt gcagtgtgaa       60

gtgcagctgc tggagtctgg a                                                 81


<210>  15
<211>  27
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  immunoglobulin secretion signal sequence

<400>  15

Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Ile Leu Lys Gly 
1               5                   10                  15      


Val Gln Cys Glu Val Gln Leu Leu Glu Ser Gly 
            20                  25          


<210>  16
<211>  996
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV core-pol fusion antigen sequence

<400>  16

Met Asp Ile Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu 
1               5                   10                  15      


Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp 
            20                  25                  30          


Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
        35                  40                  45              


Ser Pro His His Thr Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu 
    50                  55                  60                  


Leu Met Asn Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala 
65                  70                  75                  80  


Ser Arg Glu Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys 
                85                  90                  95      


Ile Arg Gln Leu Leu Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg 
            100                 105                 110         


Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr 
        115                 120                 125             


Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro 
    130                 135                 140                 


Glu Thr Thr Val Val Ala Gly Ala Gly Met Pro Leu Ser Tyr Gln His 
145                 150                 155                 160 


Phe Arg Lys Leu Leu Leu Leu Asp Asp Glu Ala Gly Pro Leu Glu Glu 
                165                 170                 175     


Glu Leu Pro Arg Leu Ala Asp Glu Gly Leu Asn Arg Arg Val Ala Glu 
            180                 185                 190         


Asp Leu Asn Leu Gly Asn Leu Asn Val Ser Ile Pro Trp Thr His Lys 
        195                 200                 205             


Val Gly Asn Phe Thr Gly Leu Tyr Ser Ser Thr Val Pro Val Phe Asn 
    210                 215                 220                 


Pro Glu Trp Gln Thr Pro Ser Phe Pro Asn Ile His Leu Gln Glu Asp 
225                 230                 235                 240 


Ile Ile Asn Arg Cys Glu Gln Phe Val Gly Pro Leu Thr Val Asn Glu 
                245                 250                 255     


Lys Arg Arg Leu Lys Leu Ile Met Pro Ala Arg Phe Tyr Pro Asn Val 
            260                 265                 270         


Thr Lys Tyr Leu Pro Leu Asp Lys Gly Ile Lys Pro Tyr Tyr Pro Glu 
        275                 280                 285             


His Leu Val Asn His Tyr Phe Gln Thr Arg His Tyr Leu His Thr Leu 
    290                 295                 300                 


Trp Lys Ala Gly Ile Leu Tyr Lys Arg Glu Thr Thr Arg Ser Ala Ser 
305                 310                 315                 320 


Phe Cys Gly Ser Pro Tyr Ser Trp Glu Gln Glu Leu Gln His Gly Arg 
                325                 330                 335     


Leu Val Phe Gln Thr Ser Thr Arg His Gly Asp Glu Ser Phe Cys Gln 
            340                 345                 350         


Gln Ser Ser Gly Ile Leu Ser Arg Ser Pro Val Gly Pro Cys Leu Gln 
        355                 360                 365             


Ser Gln Leu Arg Lys Ser Arg Leu Gly Leu Gln Pro Gln Gln Gly His 
    370                 375                 380                 


Leu Ala Arg Arg Gln Gln Gly Arg Ser Gly Ser Ile Arg Ala Arg Val 
385                 390                 395                 400 


His Pro Thr Thr Arg Arg Pro Phe Gly Val Glu Pro Ser Gly Ser Gly 
                405                 410                 415     


His Thr Thr Asn Thr Ala Ser Ser Ser Ser Ser Cys Leu His Gln Ser 
            420                 425                 430         


Ala Val Arg Lys Ala Ala Tyr Ser His Leu Ser Thr Ser Lys Arg His 
        435                 440                 445             


Ser Ser Ser Gly His Ala Val Glu Leu His Asn Ile Pro Pro Asn Ser 
    450                 455                 460                 


Ala Arg Ser Gln Ser Glu Gly Pro Val Phe Ser Cys Trp Trp Leu Gln 
465                 470                 475                 480 


Phe Arg Asn Ser Lys Pro Cys Ser Asp Tyr Cys Leu Ser His Ile Val 
                485                 490                 495     


Asn Leu Leu Glu Asp Trp Gly Pro Cys Thr Glu His Gly Glu His His 
            500                 505                 510         


Ile Arg Ile Pro Arg Thr Pro Ala Arg Val Thr Gly Gly Val Phe Leu 
        515                 520                 525             


Val Asp Lys Asn Pro His Asn Thr Thr Glu Ser Arg Leu Val Val Asp 
    530                 535                 540                 


Phe Ser Gln Phe Ser Arg Gly Asn Thr Arg Val Ser Trp Pro Lys Phe 
545                 550                 555                 560 


Ala Val Pro Asn Leu Gln Ser Leu Thr Asn Leu Leu Ser Ser Asn Leu 
                565                 570                 575     


Ser Trp Leu Ser Leu Asp Val Ser Ala Ala Phe Tyr His Leu Pro Leu 
            580                 585                 590         


His Pro Ala Ala Met Pro His Leu Leu Val Gly Ser Ser Gly Leu Ser 
        595                 600                 605             


Arg Tyr Val Ala Arg Leu Ser Ser Asn Ser Arg Ile Ile Asn His Gln 
    610                 615                 620                 


His Gly Thr Met Gln Asn Leu His Asp Ser Cys Ser Arg Asn Leu Tyr 
625                 630                 635                 640 


Val Ser Leu Leu Leu Leu Tyr Lys Thr Phe Gly Arg Lys Leu His Leu 
                645                 650                 655     


Tyr Ser His Pro Ile Ile Leu Gly Phe Arg Lys Ile Pro Met Gly Val 
            660                 665                 670         


Gly Leu Ser Pro Phe Leu Leu Ala Gln Phe Thr Ser Ala Ile Cys Ser 
        675                 680                 685             


Val Val Arg Arg Ala Phe Pro His Cys Leu Ala Phe Ser Tyr Met Asn 
    690                 695                 700                 


Asn Val Val Leu Gly Ala Lys Ser Val Gln His Leu Glu Ser Leu Phe 
705                 710                 715                 720 


Thr Ala Val Thr Asn Phe Leu Leu Ser Leu Gly Ile His Leu Asn Pro 
                725                 730                 735     


Asn Lys Thr Lys Arg Trp Gly Tyr Ser Leu Asn Phe Met Gly Tyr Val 
            740                 745                 750         


Ile Gly Ser Trp Gly Thr Leu Pro Gln Glu His Ile Val Gln Lys Ile 
        755                 760                 765             


Lys Glu Cys Phe Arg Lys Leu Pro Val Asn Arg Pro Ile Asp Trp Lys 
    770                 775                 780                 


Val Cys Gln Arg Ile Val Gly Leu Leu Gly Phe Ala Ala Pro Phe Thr 
785                 790                 795                 800 


Gln Cys Gly Tyr Pro Ala Leu Met Pro Leu Tyr Ala Cys Ile Gln Ser 
                805                 810                 815     


Lys Gln Ala Phe Thr Phe Ser Pro Thr Tyr Lys Ala Phe Leu Cys Lys 
            820                 825                 830         


Gln Tyr Leu Asn Leu Tyr Pro Val Ala Arg Gln Arg Pro Gly Leu Cys 
        835                 840                 845             


Gln Val Phe Ala Asn Ala Thr Pro Thr Gly Trp Gly Leu Ala Ile Gly 
    850                 855                 860                 


His Gln Arg Met Arg Gly Thr Phe Val Ala Pro Leu Pro Ile His Thr 
865                 870                 875                 880 


Ala Gln Leu Leu Ala Ala Cys Phe Ala Arg Ser Arg Ser Gly Ala Lys 
                885                 890                 895     


Leu Ile Gly Thr Asp Asn Ser Val Val Leu Ser Arg Lys Tyr Thr Ser 
            900                 905                 910         


Phe Pro Trp Leu Leu Gly Cys Ala Ala Asn Trp Ile Leu Arg Gly Thr 
        915                 920                 925             


Ser Phe Val Tyr Val Pro Ser Ala Leu Asn Pro Ala Asp Asp Pro Ser 
    930                 935                 940                 


Arg Gly Arg Leu Gly Leu Tyr Arg Pro Leu Leu Arg Leu Pro Phe Arg 
945                 950                 955                 960 


Pro Thr Thr Gly Arg Thr Ser Leu Tyr Ala Asp Ser Pro Ser Val Pro 
                965                 970                 975     


Ser His Leu Pro Asp Arg Val His Phe Ala Ser Pro Leu His Val Ala 
            980                 985                 990         


Trp Arg Pro Pro 
        995     


<210>  17
<211>  1023
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HBV core-pol fusion antigen sequence with Ig signal sequence

<400>  17

Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala Ile Leu Lys Gly 
1               5                   10                  15      


Val Gln Cys Glu Val Gln Leu Leu Glu Ser Gly Met Asp Ile Asp Pro 
            20                  25                  30          


Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser Phe Leu Pro Ser 
        35                  40                  45              


Asp Phe Phe Pro Ser Ile Arg Asp Leu Leu Asp Thr Ala Ser Ala Leu 
    50                  55                  60                  


Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His Thr 
65                  70                  75                  80  


Ala Leu Arg Gln Ala Ile Leu Cys Trp Gly Glu Leu Met Asn Leu Ala 
                85                  90                  95      


Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser Arg Glu Leu Val 
            100                 105                 110         


Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys Ile Arg Gln Leu Leu 
        115                 120                 125             


Trp Phe His Ile Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu Glu 
    130                 135                 140                 


Tyr Leu Val Ser Phe Gly Val Trp Ile Arg Thr Pro Pro Ala Tyr Arg 
145                 150                 155                 160 


Pro Pro Asn Ala Pro Ile Leu Ser Thr Leu Pro Glu Thr Thr Val Val 
                165                 170                 175     


Ala Gly Ala Gly Met Pro Leu Ser Tyr Gln His Phe Arg Lys Leu Leu 
            180                 185                 190         


Leu Leu Asp Asp Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu 
        195                 200                 205             


Ala Asp Glu Gly Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly 
    210                 215                 220                 


Asn Leu Asn Val Ser Ile Pro Trp Thr His Lys Val Gly Asn Phe Thr 
225                 230                 235                 240 


Gly Leu Tyr Ser Ser Thr Val Pro Val Phe Asn Pro Glu Trp Gln Thr 
                245                 250                 255     


Pro Ser Phe Pro Asn Ile His Leu Gln Glu Asp Ile Ile Asn Arg Cys 
            260                 265                 270         


Glu Gln Phe Val Gly Pro Leu Thr Val Asn Glu Lys Arg Arg Leu Lys 
        275                 280                 285             


Leu Ile Met Pro Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro 
    290                 295                 300                 


Leu Asp Lys Gly Ile Lys Pro Tyr Tyr Pro Glu His Leu Val Asn His 
305                 310                 315                 320 


Tyr Phe Gln Thr Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly Ile 
                325                 330                 335     


Leu Tyr Lys Arg Glu Thr Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro 
            340                 345                 350         


Tyr Ser Trp Glu Gln Glu Leu Gln His Gly Arg Leu Val Phe Gln Thr 
        355                 360                 365             


Ser Thr Arg His Gly Asp Glu Ser Phe Cys Gln Gln Ser Ser Gly Ile 
    370                 375                 380                 


Leu Ser Arg Ser Pro Val Gly Pro Cys Leu Gln Ser Gln Leu Arg Lys 
385                 390                 395                 400 


Ser Arg Leu Gly Leu Gln Pro Gln Gln Gly His Leu Ala Arg Arg Gln 
                405                 410                 415     


Gln Gly Arg Ser Gly Ser Ile Arg Ala Arg Val His Pro Thr Thr Arg 
            420                 425                 430         


Arg Pro Phe Gly Val Glu Pro Ser Gly Ser Gly His Thr Thr Asn Thr 
        435                 440                 445             


Ala Ser Ser Ser Ser Ser Cys Leu His Gln Ser Ala Val Arg Lys Ala 
    450                 455                 460                 


Ala Tyr Ser His Leu Ser Thr Ser Lys Arg His Ser Ser Ser Gly His 
465                 470                 475                 480 


Ala Val Glu Leu His Asn Ile Pro Pro Asn Ser Ala Arg Ser Gln Ser 
                485                 490                 495     


Glu Gly Pro Val Phe Ser Cys Trp Trp Leu Gln Phe Arg Asn Ser Lys 
            500                 505                 510         


Pro Cys Ser Asp Tyr Cys Leu Ser His Ile Val Asn Leu Leu Glu Asp 
        515                 520                 525             


Trp Gly Pro Cys Thr Glu His Gly Glu His His Ile Arg Ile Pro Arg 
    530                 535                 540                 


Thr Pro Ala Arg Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro 
545                 550                 555                 560 


His Asn Thr Thr Glu Ser Arg Leu Val Val Asp Phe Ser Gln Phe Ser 
                565                 570                 575     


Arg Gly Asn Thr Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu 
            580                 585                 590         


Gln Ser Leu Thr Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu 
        595                 600                 605             


Asp Val Ser Ala Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met 
    610                 615                 620                 


Pro His Leu Leu Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg 
625                 630                 635                 640 


Leu Ser Ser Asn Ser Arg Ile Ile Asn His Gln His Gly Thr Met Gln 
                645                 650                 655     


Asn Leu His Asp Ser Cys Ser Arg Asn Leu Tyr Val Ser Leu Leu Leu 
            660                 665                 670         


Leu Tyr Lys Thr Phe Gly Arg Lys Leu His Leu Tyr Ser His Pro Ile 
        675                 680                 685             


Ile Leu Gly Phe Arg Lys Ile Pro Met Gly Val Gly Leu Ser Pro Phe 
    690                 695                 700                 


Leu Leu Ala Gln Phe Thr Ser Ala Ile Cys Ser Val Val Arg Arg Ala 
705                 710                 715                 720 


Phe Pro His Cys Leu Ala Phe Ser Tyr Met Asn Asn Val Val Leu Gly 
                725                 730                 735     


Ala Lys Ser Val Gln His Leu Glu Ser Leu Phe Thr Ala Val Thr Asn 
            740                 745                 750         


Phe Leu Leu Ser Leu Gly Ile His Leu Asn Pro Asn Lys Thr Lys Arg 
        755                 760                 765             


Trp Gly Tyr Ser Leu Asn Phe Met Gly Tyr Val Ile Gly Ser Trp Gly 
    770                 775                 780                 


Thr Leu Pro Gln Glu His Ile Val Gln Lys Ile Lys Glu Cys Phe Arg 
785                 790                 795                 800 


Lys Leu Pro Val Asn Arg Pro Ile Asp Trp Lys Val Cys Gln Arg Ile 
                805                 810                 815     


Val Gly Leu Leu Gly Phe Ala Ala Pro Phe Thr Gln Cys Gly Tyr Pro 
            820                 825                 830         


Ala Leu Met Pro Leu Tyr Ala Cys Ile Gln Ser Lys Gln Ala Phe Thr 
        835                 840                 845             


Phe Ser Pro Thr Tyr Lys Ala Phe Leu Cys Lys Gln Tyr Leu Asn Leu 
    850                 855                 860                 


Tyr Pro Val Ala Arg Gln Arg Pro Gly Leu Cys Gln Val Phe Ala Asn 
865                 870                 875                 880 


Ala Thr Pro Thr Gly Trp Gly Leu Ala Ile Gly His Gln Arg Met Arg 
                885                 890                 895     


Gly Thr Phe Val Ala Pro Leu Pro Ile His Thr Ala Gln Leu Leu Ala 
            900                 905                 910         


Ala Cys Phe Ala Arg Ser Arg Ser Gly Ala Lys Leu Ile Gly Thr Asp 
        915                 920                 925             


Asn Ser Val Val Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu 
    930                 935                 940                 


Gly Cys Ala Ala Asn Trp Ile Leu Arg Gly Thr Ser Phe Val Tyr Val 
945                 950                 955                 960 


Pro Ser Ala Leu Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly 
                965                 970                 975     


Leu Tyr Arg Pro Leu Leu Arg Leu Pro Phe Arg Pro Thr Thr Gly Arg 
            980                 985                 990         


Thr Ser Leu Tyr Ala Asp Ser Pro  Ser Val Pro Ser His  Leu Pro Asp 
        995                 1000                 1005             


Arg Val  His Phe Ala Ser Pro  Leu His Val Ala Trp  Arg Pro Pro 
    1010                 1015                 1020             


<210>  18
<211>  584
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hCMV promoter

<400>  18
tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt agttcatagc       60

ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc      120

aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg      180

actttccatt gacgtcaatg ggtggactat ttacggtaaa ctgcccactt ggcagtacat      240

caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc      300

tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta catctacgta      360

ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag      420

cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt      480

tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa      540

atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagc                       584


<210>  19
<211>  684
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hCMV promoter sequence

<400>  19
accgccatgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt       60

agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg      120

ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac      180

gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt      240

ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa      300

atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta      360

catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg      420

gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg      480

gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc      540

attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctcgttt      600

agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca      660

ccgggaccga tccagcctcc gcgg                                             684


<210>  20
<211>  225
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  bGH polyA signal

<400>  20
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc       60

tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc      120

tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt      180

gggaagacaa tagcaggcat gctggggatg cggtgggctc tatgg                      225


<210>  21
<211>  671
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pUC ORI

<400>  21
cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc       60

ttgcaaacaa aaaaaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact      120

ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt tcttctagtg      180

tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg      240

ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac      300

tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca      360

cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga      420

gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc      480

ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct      540

gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg      600

agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct      660

tttgctcaca t                                                           671


<210>  22
<211>  795
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  KanR coding sequence

<400>  22
atgattgagc aagatggtct tcacgctggc tcgccagctg cgtgggtgga acgcctgttt       60

ggttatgatt gggcgcagca gactattgga tgttccgacg cggctgtatt tcggctgtct      120

gctcagggtc gccccgtgct gtttgtgaag acggatttgt ctggcgcatt aaatgagtta      180

caggacgagg cggctcgtct gagttggttg gccaccaccg gcgtgccctg cgccgcagtg      240

ctggatgtcg tgacagaagc aggccgcgat tggctccttc tcggcgaagt gccgggccag      300

gacctgctca gcagccactt ggcaccggca gaaaaagttt ctatcatggc cgacgccatg      360

cgtcgtcttc acactctcga tccggccacg tgcccctttg accaccaggc caagcatcgt      420

attgaacgtg cgcgtactcg gatggaagca ggtttagtag accaggacga tttggatgag      480

gaacatcaag gcctggcccc ggctgaactg tttgcgcgct taaaagcgtc gatgccagat      540

ggcgaagatt tggtagtcac ccatggagat gcgtgtttgc caaacatcat ggttgaaaat      600

ggccgcttct caggctttat tgactgtggg cgcctgggtg ttgccgaccg ctatcaagat      660

attgcgctcg caactcgtga catcgctgaa gagctgggcg gagaatgggc tgaccgtttc      720

ctggtactgt atggcattgc agcgcccgat tcccaacgca tcgcatttta tcgtctgctg      780

gatgagtttt tctaa                                                       795


<210>  23
<211>  264
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized Kanr

<400>  23

Met Ile Glu Gln Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val 
1               5                   10                  15      


Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gln Gln Thr Ile Gly Cys Ser 
            20                  25                  30          


Asp Ala Ala Val Phe Arg Leu Ser Ala Gln Gly Arg Pro Val Leu Phe 
        35                  40                  45              


Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gln Asp Glu Ala 
    50                  55                  60                  


Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val 
65                  70                  75                  80  


Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
                85                  90                  95      


Val Pro Gly Gln Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
            100                 105                 110         


Val Ser Ile Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 
        115                 120                 125             


Ala Thr Cys Pro Phe Asp His Gln Ala Lys His Arg Ile Glu Arg Ala 
    130                 135                 140                 


Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gln Asp Asp Leu Asp Glu 
145                 150                 155                 160 


Glu His Gln Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
                165                 170                 175     


Ser Met Pro Asp Gly Glu Asp Leu Val Val Thr His Gly Asp Ala Cys 
            180                 185                 190         


Leu Pro Asn Ile Met Val Glu Asn Gly Arg Phe Ser Gly Phe Ile Asp 
        195                 200                 205             


Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gln Asp Ile Ala Leu Ala 
    210                 215                 220                 


Thr Arg Asp Ile Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 
225                 230                 235                 240 


Leu Val Leu Tyr Gly Ile Ala Ala Pro Asp Ser Gln Arg Ile Ala Phe 
                245                 250                 255     


Tyr Arg Leu Leu Asp Glu Phe Phe 
            260                 


<210>  24
<211>  99
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  bla promoter

<400>  24
acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa       60

ccctgataaa tgcttcaata atattgaaaa aggaagagt                              99


<210>  25
<211>  3377
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Viral genome sequence

<400>  25
gcgcaccggg gatcctaggc tttttggatt gcgctttcct ctagatcaac tgggtgtcag       60

gccctatcct acagaaggat gggtcagatt gtgacaatgt ttgaggctct gcctcacatc      120

atcgatgagg tgatcaacat tgtcattatt gtgcttatcg tgatcacggg tatcaaggct      180

gtctacaatt ttgccacctg tgggatattc gcattgatca gtttcctact tctggctggc      240

aggtcctgtg gcatgtacgg tcttaaggga cccgacattt acaaaggagt ttaccaattt      300

aagtcagtgg agtttgatat gtcacatctg aacctgacca tgcccaacgc atgttcagcc      360

aacaactccc accattacat cagtatgggg acttctggac tagaattgac cttcaccaat      420

gattccatca tcagtcacaa cttttgcaat ctgacctctg ccttcaacaa aaagaccttt      480

gaccacacac tcatgagtat agtttcgagc ctacacctca gtatcagagg gaactccaac      540

tataaggcag tatcctgcga cttcaacaat ggcataacca tccaatacaa cttgacattc      600

tcagatgcac aaagtgctca gagccagtgt agaaccttca gaggtagagt cctagatatg      660

tttagaactg ccttcggggg gaaatacatg aggagtggct ggggctggac aggctcagat      720

ggcaagacca cctggtgtag ccagacgagt taccaatacc tgattataca aaatagaacc      780

tgggaaaacc actgcacata tgcaggtcct tttgggatgt ccaggattct cctttcccaa      840

gagaagacta agttcctcac taggagacta gcgggcacat tcacctggac tttgtcagac      900

tcttcagggg tggagaatcc aggtggttat tgcctgacca aatggatgat tcttgctgca      960

gagcttaagt gtttcgggaa cacagcagtt gcgaaatgca atgtaaatca tgatgaagaa     1020

ttctgtgaca tgctgcgact aattgactac aacaaggctg ctttgagtaa gttcaaagag     1080

gacgtagaat ctgccttgca cttattcaaa acaacagtga attctttgat ttcagatcaa     1140

ctactgatga ggaaccactt gagagatctg atgggggtgc catattgcaa ttactcaaag     1200

ttttggtacc tagaacatgc aaagaccggc gaaactagtg tccccaagtg ctggcttgtc     1260

accaatggtt cttacttaaa tgagacccac ttcagtgacc aaatcgaaca ggaagccgat     1320

aacatgatta cagagatgtt gaggaaggat tacataaaga ggcaggggag taccccccta     1380

gcattgatgg accttctgat gttttccaca tctgcatatc tagtcagcat cttcctgcac     1440

cttgtcaaaa taccaacaca caggcacata aaaggtggct catgtccaaa gccacaccga     1500

ttaaccaaca aaggaatttg tagttgtggt gcatttaagg tgcctggtgt aaaaaccgtc     1560

tggaaaagac gctgaagaac agcgcctccc tgactctcca cctcgaaaga ggtggagagt     1620

cagggaggcc cagagggtct tagagtgtca caacatttgg gcctctaaaa attaggtcat     1680

gtggcagaat gttgtgaaca gttttcagat ctgggagcct tgctttggag gcgctttcaa     1740

aaatgatgca gtccatgagt gcacagtgcg gggtgatctc tttcttcttt ttgtccctta     1800

ctattccagt atgcatctta cacaaccagc catatttgtc ccacactttg tcttcatact     1860

ccctcgaagc ttccctggtc atttcaacat cgataagctt aatgtccttc ctattctgtg     1920

agtccagaag ctttctgatg tcatcggagc cttgacagct tagaaccatc ccctgcggaa     1980

gagcacctat aactgacgag gtcaacccgg gttgcgcatt gaagaggtcg gcaagatcca     2040

tgccgtgtga gtacttggaa tcttgcttga attgtttttg atcaacgggt tccctgtaaa     2100

agtgtatgaa ctgcccgttc tgtggttgga aaattgctat ttccactgga tcattaaatc     2160

taccctcaat gtcaatccat gtaggagcgt tggggtcaat tcctcccatg aggtctttta     2220

aaagcattgt ctggctgtag cttaagccca cctgaggtgg acctgctgct ccaggcgctg     2280

gcctgggtga attgactgca ggtttctcgc ttgtgagatc aattgttgtg ttttcccatg     2340

ctctccccac aatcgatgtt ctacaagcta tgtatggcca tccttcacct gaaaggcaaa     2400

ctttatagag gatgttttca taagggttcc tgtccccaac ttggtctgaa acaaacatgt     2460

tgagttttct cttggccccg agaactgcct tcaagaggtc ctcgctgttg cttggcttga     2520

tcaaaattga ctctaacatg ttacccccat ccaacagggc tgcccctgcc ttcacggcag     2580

caccaagact aaagttatag ccagaaatgt tgatgctgga ctgctgttca gtgatgaccc     2640

ccagaactgg gtgcttgtct ttcagccttt caagatcatt aagatttgga tacttgactg     2700

tgtaaagcaa gccaaggtct gtgagcgctt gtacaacgtc attgagcgga gtctgtgact     2760

gtttggccat acaagccata gttagacttg gcattgtgcc aaattgattg ttcaaaagtg     2820

atgagtcttt cacatcccaa actcttacca caccacttgc accctgctga ggctttctca     2880

tcccaactat ctgtaggatc tgagatcttt ggtctagttg ctgtgttgtt aagttcccca     2940

tatatacccc tgaagcctgg ggcctttcag acctcatgat cttggccttc agcttctcaa     3000

ggtcagccgc aagagacatc agttcttctg cactgagcct ccccactttc aaaacattct     3060

tctttgatgt tgactttaaa tccacaagag aatgtacagt ctggttgaga cttctgagtc     3120

tctgtaggtc tttgtcatct ctcttttcct tcctcatgat cctctgaaca ttgctgacct     3180

cagagaagtc caacccattc agaaggttgg ttgcatcctt aatgacagca gccttcacat     3240

ctgatgtgaa gctctgcaat tctcttctca atgcttgcgt ccattggaag ctcttaactt     3300

ccttagacaa ggacatcttg ttgctcaatg gtttctcaag acaaatgcgc aatcaaatgc     3360

ctaggatcca ctgtgcg                                                    3377


<210>  26
<211>  7229
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Viral genome sequence

<400>  26
gcgcaccggg gatcctaggc gtttagttgc gctgtttggt tgcacaactt tcttcgtgag       60

gctgtcagaa gtggacctgg ctgatagcga tgggtcaagg caagtccaga gaggagaaag      120

gcaccaatag tacaaacagg gccgaaatcc taccagatac cacctatctt ggccctttaa      180

gctgcaaatc ttgctggcag aaatttgaca gcttggtaag atgccatgac cactaccttt      240

gcaggcactg tttaaacctt ctgctgtcag tatccgacag gtgtcctctt tgtaaatatc      300

cattaccaac cagattgaag atatcaacag ccccaagctc tccacctccc tacgaagagt      360

aacaccgtcc ggccccggcc ccgacaaaca gcccagcaca agggaaccgc acgtcaccca      420

acgcacacag acacagcacc caacacagaa cacgcacaca cacacacaca cacacccaca      480

cgcacgcgcc cccaccaccg gggggcgccc ccccccgggg ggcggccccc cgggagcccg      540

ggcggagccc cacggagatg cccatcagtc gatgtcctcg gccaccgacc cgcccagcca      600

atcgtcgcag gacctcccct tgagtctaaa cctgcccccc actgtttcat acatcaaagt      660

gctcctagat ttgctaaaac aaagtctgca atccttaaag gcgaaccagt ctggcaaaag      720

cgacagtgga atcagcagaa tagatctgtc tatacatagt tcctggagga ttacacttat      780

ctctgaaccc aacaaatgtt caccagttct gaatcgatgc aggaagaggt tcccaaggac      840

atcactaatc ttttcatagc cctcaagtcc tgctagaaag actttcatgt ccttggtctc      900

cagcttcaca atgatatttt ggacaaggtt tcttccttca aaaagggcac ccatctttac      960

agtcagtggc acaggctccc actcaggtcc aactctctca aagtcaatag atctaatccc     1020

atccagtatt cttttggagc ccaacaactc aagctcaaga gaatcaccaa gtatcaaggg     1080

atcttccatg taatcctcaa actcttcaga tctgatatca aagacaccat cgttcacctt     1140

gaagacagag tctgtcctca gtaagtggag gcattcatcc aacattcttc tatctatctc     1200

acccttaaag aggtgagagc atgataaaag ttcagccaca cctggattct gtaattggca     1260

cctaaccaag aatatcaatg aaaatttcct taaacagtca gtattattct gattgtgcgt     1320

aaagtccact gaaattgaaa actccaatac cccttttgtg tagttgagca tgtagtccca     1380

cagatccttt aaggatttaa atgcctttgg gtttgtcagg ccctgcctaa tcaacatggc     1440

agcattacac acaacatctc ccattcggta agagaaccac ccaaaaccaa actgcaaatc     1500

attcctaaac ataggcctct ccacattttt gttcaccacc tttgagacaa atgattgaaa     1560

ggggcccagt gcctcagcac catcttcaga tggcatcatt tctttatgag ggaaccatga     1620

aaaattgcct aatgtcctgg ttgttgcaac aaattctcga acaaatgatt caaaatacac     1680

ctgttttaag aagttcttgc agacatccct cgtgctaaca acaaattcat caaccagact     1740

ggagtcagat cgctgatgag aattggcaag gtcagaaaac agaacagtgt aatgttcatc     1800

ccttttccac ttaacaacat gagaaatgag tgacaaggat tctgagttaa tatcaattaa     1860

aacacagagg tcaaggaatt taattctggg actccacctc atgttttttg agctcatgtc     1920

agacataaat ggaagaagct gatcctcaaa gatcttggga tatagccgcc tcacagattg     1980

aatcacttgg ttcaaattca ctttgtcctc cagtagcctt gagctctcag gctttcttgc     2040

tacataatca catgggttta agtgcttaag agttaggttc tcactgttat tcttcccttt     2100

ggtcggttct gctaggaccc aaacacccaa ctcaaaagag ttgctcaatg aaatacaaat     2160

gtagtcccaa agaagaggcc ttaaaaggca tatatgatca cggtgggctt ctggatgaga     2220

ctgtttgtca caaatgtaca gcgttatacc atcccgattg caaactcttg tcacatgatc     2280

atctgtggtt agatcctcaa gcagcttttt gatatacaga ttttccctat ttttgtttct     2340

cacacacctg cttcctagag ttttgcaaag gcctataaag ccagatgaga tacaactctg     2400

gaaagctgac ttgttgattg cttctgacag cagcttctgt gcaccccttg tgaatttact     2460

acaaagtttg ttctggagtg tcttgatcaa tgatgggatt ctttcctctt ggaaagtcat     2520

cactgatgga taaaccacct tttgtcttaa aaccatcctt aatgggaaca tttcattcaa     2580

attcaaccag ttaacatctg ctaactgatt cagatcttct tcaagaccga ggaggtctcc     2640

caattgaaga atggcctcct ttttatctct gttaaatagg tctaagaaaa attcttcatt     2700

aaattcacca tttttgagct tatgatgcag tttccttaca agctttctta caacctttgt     2760

ttcattagga cacagttcct caatgagtct ttgtattctg taacctctag aaccatccag     2820

ccaatctttc acatcagtgt tggtattcag tagaaatgga tccaaaggga aattggcata     2880

ctttaggagg tccagtgttc tcctttggat actattaact agggagactg ggacgccatt     2940

tgcgatggct tgatctgcaa ttgtatctat tgtttcacaa agttgatgtg gctctttaca     3000

cttgacattg tgtagcgctg cagatacaaa ctttgtgaga agagggactt cctcccccca     3060

tacatagaat ctagatttaa attctgcagc gaacctccca gccacacttt ttgggctgat     3120

aaatttgttt aacaagccgc tcagatgaga ttggaattcc aacaggacaa ggacttcctc     3180

cggatcactt acaaccaggt cactcagcct cctatcaaat aaagtgatct gatcatcact     3240

tgatgtgtaa gcctctggtc tttcgccaaa gataacacca atgcagtagt tgatgaacct     3300

ctcgctaagc aaaccataga agtcagaagc attatgcaag attccctgcc ccatatcaat     3360

aaggctggat atatgggatg gcactatccc catttcaaaa tattgtctga aaattctctc     3420

agtaacagtt gtttctgaac ccctgagaag ttttagcttc gacttgacat atgatttcat     3480

cattgcattc acaacaggaa aggggacctc gacaagctta tgcatgtgcc aagttaacaa     3540

agtgctaaca tgatctttcc cggaacgcac atactggtca tcacctagtt tgagattttg     3600

tagaaacatt aagaacaaaa atgggcacat cattggtccc catttgctgt gatccatact     3660

atagtttaag aacccttccc gcacattgat agtcattgac aagattgcat tttcaaattc     3720

cttatcattg tttaaacagg agcctgaaaa gaaacttgaa aaagactcaa aataatcttc     3780

tattaacctt gtgaacattt ttgtcctcaa atctccaata tagagttctc tatttccccc     3840

aacctgctct ttataagata gtgcaaattt cagccttcca gagtcaggac ctactgaggt     3900

gtatgatgtt ggtgattctt ctgagtagaa gcacagattt ttcaaagcag cactcataca     3960

ttgtgtcaac gacagagctt tactaaggga ctcagaatta ctttccctct cactgattct     4020

cacgtcttct tccagtttgt cccagtcaaa tttgaaattc aagccttgcc tttgcatatg     4080

cctgtatttc cctgagtacg catttgcatt catttgcaac agaatcatct tcatgcaaga     4140

aaaccaatca ttctcagaaa agaactttct acaaaggttt tttgccatct catcgaggcc     4200

acactgatct ttaatgactg aggtgaaata caaaggtgac agctctgtgg aaccctcaac     4260

agcctcacag ataaatttca tgtcatcatt ggttagacat gatgggtcaa agtcttctac     4320

taaatggaaa gatatttctg acaagataac ttttcttaag tgagccatct tccctgttag     4380

aataagctgt aaatgatgta gtccttttgt atttgtaagt ttttctccat ctcctttgtc     4440

attggccctc ctacctcttc tgtaccgtgc tattgtggtg ttgacctttt cttcgagact     4500

tttgaagaag cttgtctctt cttctccatc aaaacatatt tctgccaggt tgtcttccga     4560

tctccctgtc tcttctccct tggaaccgat gaccaatcta gagactaact tggaaacttt     4620

atattcatag tctgagtggc tcaacttata cttttgtttt cttacgaaac tctccgtaat     4680

ttgactcaca gcactaacaa gcaatttgtt aaagtcatat tccagaagtc gttctccatt     4740

tagatgctta ttaaccacca cacttttgtt actagcaaga tctaatgctg tcgcacatcc     4800

agagttagtc atgggatcta ggctgtttag cttcttctct cctttgaaaa ttaaagtgcc     4860

gttgttaaat gaagacacca ttaggctaaa ggcttccaga ttaacacctg gagttgtatg     4920

ctgacagtca atttctttac tagtgaatct cttcatttgc tcatagaaca cacattcttc     4980

ctcaggagtg attgcttcct tggggttgac aaaaaaacca aattgacttt tgggctcaaa     5040

gaacttttca aaacatttta tctgatctgt tagcctgtca ggggtctcct ttgtgatcaa     5100

atgacacagg tatgacacat tcaacataaa tttaaatttt gcactcaaca acaccttctc     5160

accagtacca aaaatagttt ttattaggaa tctaagcagc ttatacacca ccttctcagc     5220

aggtgtgatc agatcctccc tcaacttatc cattaatgat gtagatgaaa aatctgacac     5280

tattgccatc accaaatatc tgacactctg tacctgcttt tgatttctct ttgttgggtt     5340

ggtgagcatt agcaacaata gggtcctcag tgcaacctca atgtcggtga gacagtcttt     5400

caaatcagga catgatctaa tccatgaaat catgatgtct atcatattgt ataagacctc     5460

atctgaaaaa attggtaaaa agaacctttt aggatctgca tagaaggaaa ttaaatgacc     5520

atccgggcct tgtatggagt agcaccttga agattctcca gtcttctggt ataataggtg     5580

gtattcttca gagtccagtt ttattacttg gcaaaacact tctttgcatt ctaccacttg     5640

atatctcaca gaccctattt gattttgcct tagtctagca actgagctag ttttcatact     5700

gtttgttaag gccagacaaa cagatgataa tcttctcagg ctctgtatgt tcttcagctg     5760

ctctgtgctg ggttggaaat tgtaatcttc aaacttcgta taatacatta tcgggtgagc     5820

tccaattttc ataaagttct caaattcagt gaatggtatg tggcattctt gctcaaggtg     5880

ttcagacagt ccgtaatgct cgaaactcag tcccaccact aacaggcatt tttgaatttt     5940

tgcaatgaac tcactaatag atgccctaaa caattcctca aaagacacct ttctaaacac     6000

ctttgacttt tttctattcc tcaaaagtct aatgaactcc tctttagtgc tgtgaaagct     6060

taccagccta tcattcacac tactatagca acaacccacc cagtgtttat cattttttaa     6120

ccctttgaat ttcgactgtt ttatcaatga ggaaagacac aaaacatcca gatttaacaa     6180

ctgtctcctt ctagtattca acagtttcaa actcttgact ttgtttaaca tagagaggag     6240

cctctcatat tcagtgctag tctcacttcc cctttcgtgc ccatgggtct ctgcagttat     6300

gaatctcatc aaaggacagg attcgactgc ctccctgctt aatgttaaga tatcatcact     6360

atcagcaagg ttttcataga gctcagagaa ttccttgatc aagccttcag ggtttacttt     6420

ctgaaagttt ctctttaatt tcccactttc taaatctctt ctaaacctgc tgaaaagaga     6480

gtttattcca aaaaccacat catcacagct catgttgggg ttgatgcctt cgtggcacat     6540

cctcataatt tcatcattgt gagttgacct cgcatctttc agaattttca tagagtccat     6600

accggagcgc ttgtcgatag tagtcttcag ggactcacag agtctaaaat attcagactc     6660

ttcaaagact ttctcatttt ggttagaata ctccaaaagt ttgaataaaa ggtctctaaa     6720

tttgaagttt gcccactctg gcataaaact attatcataa tcacaacgac catctactat     6780

tggaactaat gtgacacccg caacagcaag gtcttccctg atgcatgcca atttgttagt     6840

gtcctctata aatttcttct caaaactggc tggagtgctc ctaacaaaac actcaagaag     6900

aatgagagaa ttgtctatca gcttgtaacc atcaggaatg ataagtggta gtcctgggca     6960

tacaattcca gactccacca aaattgtttc cacagactta tcgtcgtggt tgtgtgtgca     7020

gccactcttg tctgcactgt ctatttcaat gcagcgtgac agcaacttga gtccctcaat     7080

cagaaccatt ctgggttccc tttgtcccag aaagttgagt ttctgccttg acaacctctc     7140

atcctgttct atatagttta aacataactc tctcaattct gagatgattt catccattgc     7200

gcatcaaaaa gcctaggatc ctcggtgcg                                       7229


<210>  27
<211>  7205
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Viral genome sequence

<400>  27
gcgcaccggg gatcctaggc atttttgttg cgcattttgt tgtgttattt gttgcacagc       60

ccttcatcgt gggaccttca caaacaaacc aaaccaccag ccatgggcca aggcaagtcc      120

aaagagggaa gggatgccag caatacgagc agagctgaaa ttctgccaga caccacctat      180

ctcggacctc tgaactgcaa gtcatgctgg cagagatttg acagtttagt cagatgccat      240

gaccactatc tctgcagaca ctgcctgaac ctcctgctgt cagtctccga caggtgccct      300

ctctgcaaac atccattgcc aaccaaactg aaaatatcca cggccccaag ctctccaccc      360

ccttacgagg agtgacgccc cgagccccaa caccgacaca aggaggccac caacacaacg      420

cccaacacgg aacacacaca cacacaccca cacacacatc cacacacacg cgcccccaca      480

acgggggcgc ccccccgggg gtggcccccc gggtgctcgg gcggagcccc acggagaggc      540

caattagtcg atctcctcga ccaccgactt ggtcagccag tcatcacagg acttgccctt      600

aagtctgtac ttgcccacaa ctgtttcata catcaccgtg ttctttgact tactgaaaca      660

tagcctacag tctttgaaag tgaaccagtc aggcacaagt gacagcggta ccagtagaat      720

ggatctatct atacacaact cttggagaat tgtgctaatt tccgacccct gtagatgctc      780

accagttctg aatcgatgta gaagaaggct cccaaggacg tcatcaaaat ttccataacc      840

ctcgagctct gccaagaaaa ctctcatatc cttggtctcc agtttcacaa cgatgttctg      900

aacaaggctt cttccctcaa aaagagcacc cattctcaca gtcaagggca caggctccca      960

ttcaggccca atcctctcaa aatcaaggga tctgatcccg tccagtattt tccttgagcc     1020

tatcagctca agctcaagag agtcaccgag tatcaggggg tcctccatat agtcctcaaa     1080

ctcttcagac ctaatgtcaa aaacaccatc gttcaccttg aagatagagt ctgatctcaa     1140

caggtggagg cattcgtcca agaaccttct gtccacctca cctttaaaga ggtgagagca     1200

tgataggaac tcagctacac ctggaccttg taactggcac ttcactaaaa agatcaatga     1260

aaacttcctc aaacaatcag tgttattctg gttgtgagtg aaatctactg taattgagaa     1320

ctctagcact ccctctgtat tatttatcat gtaatcccac aagtttctca aagacttgaa     1380

tgcctttgga tttgtcaagc cttgtttgat tagcatggca gcattgcaca caatatctcc     1440

caatcggtaa gagaaccatc caaatccaaa ttgcaagtca ttcctaaaca tgggcctctc     1500

catatttttg ttcactactt ttaagatgaa tgattggaaa ggccccaatg cttcagcgcc     1560

atcttcagat ggcatcatgt ctttatgagg gaaccatgaa aaacttccta gagttctgct     1620

tgttgctaca aattctcgta caaatgactc aaaatacact tgttttaaaa agtttttgca     1680

gacatccctt gtactaacga caaattcatc aacaaggctt gagtcagagc gctgatggga     1740

atttacaaga tcagaaaata gaacagtgta gtgttcgtcc ctcttccact taactacatg     1800

agaaatgagc gataaagatt ctgaattgat atcgatcaat acgcaaaggt caaggaattt     1860

gattctggga ctccatctca tgttttttga gctcatatca gacatgaagg gaagcagctg     1920

atcttcatag attttagggt acaatcgcct cacagattgg attacatggt ttaaacttat     1980

cttgtcctcc agtagccttg aactctcagg cttccttgct acataatcac atgggttcaa     2040

gtgcttgagg cttgagcttc cctcattctt ccctttcaca ggttcagcta agacccaaac     2100

acccaactca aaggaattac tcagtgagat gcaaatatag tcccaaagga ggggcctcaa     2160

gagactgatg tggtcgcagt gagcttctgg atgactttgc ctgtcacaaa tgtacaacat     2220

tatgccatca tgtctgtgga ttgctgtcac atgcgcatcc atagctagat cctcaagcac     2280

ttttctaatg tatagattgt ccctattttt atttctcaca catctacttc ccaaagtttt     2340

gcaaagacct ataaagcctg atgagatgca actttgaaag gctgacttat tgattgcttc     2400

tgacagcaac ttctgtgcac ctcttgtgaa cttactgcag agcttgttct ggagtgtctt     2460

gattaatgat gggattcttt cctcttggaa agtcattact gatggataaa ccactttctg     2520

cctcaagacc attcttaatg ggaacaactc attcaaattc agccaattta tgtttgccaa     2580

ttgacttaga tcctcttcga ggccaaggat gtttcccaac tgaagaatgg cttccttttt     2640

atccctattg aagaggtcta agaagaattc ttcattgaac tcaccattct tgagcttatg     2700

atgtagtctc cttacaagcc ttctcatgac cttcgtttca ctaggacaca attcttcaat     2760

aagcctttgg attctgtaac ctctagagcc atccaaccaa tccttgacat cagtattagt     2820

gttaagcaaa aatgggtcca agggaaagtt ggcatatttt aagaggtcta atgttctctt     2880

ctggatgcag tttaccaatg aaactggaac accatttgca acagcttgat cggcaattgt     2940

atctattgtt tcacagagtt ggtgtggctc tttacactta acgttgtgta atgctgctga     3000

cacaaatttt gttaaaagtg ggacctcttc cccccacaca taaaatctgg atttaaattc     3060

tgcagcaaat cgccccacca cacttttcgg actgatgaac ttgttaagca agccactcaa     3120

atgagaatga aattccagca atacaaggac ttcctcaggg tcactatcaa ccagttcact     3180

caatctccta tcaaataagg tgatctgatc atcacttgat gtgtaagatt ctggtctctc     3240

accaaaaatg acaccgatac aataattaat gaatctctca ctgattaagc cgtaaaagtc     3300

agaggcatta tgtaagattc cctgtcccat gtcaatgaga ctgcttatat gggaaggcac     3360

tattcctaat tcaaaatatt ctcgaaagat tctttcagtc acagttgtct ctgaacccct     3420

aagaagtttc agctttgatt tgatatatga tttcatcatt gcattcacaa caggaaaagg     3480

gacctcaaca agtttgtgca tgtgccaagt taataaggtg ctgatatgat cctttccgga     3540

acgcacatac tggtcatcac ccagtttgag attttgaagg agcattaaaa acaaaaatgg     3600

gcacatcatt ggcccccatt tgctatgatc catactgtag ttcaacaacc cctctcgcac     3660

attgatggtc attgatagaa ttgcattttc aaattctttg tcattgttta agcatgaacc     3720

tgagaagaag ctagaaaaag actcaaaata atcctctatc aatcttgtaa acatttttgt     3780

tctcaaatcc ccaatataaa gttctctgtt tcctccaacc tgctctttgt atgataacgc     3840

aaacttcaac cttccggaat caggaccaac tgaagtgtat gacgttggtg actcctctga     3900

gtaaaaacat aaattcttta aagcagcact catgcatttt gtcaatgata gagccttact     3960

tagagactca gaattacttt ccctttcact aattctaaca tcttcttcta gtttgtccca     4020

gtcaaacttg aaattcagac cttgtctttg catgtgcctg tatttccctg agtatgcatt     4080

tgcattcatt tgcagtagaa tcattttcat acacgaaaac caatcaccct ctgaaaaaaa     4140

cttcctgcag aggttttttg ccatttcatc cagaccacat tgttctttga cagctgaagt     4200

gaaatacaat ggtgacagtt ctgtagaagt ttcaatagcc tcacagataa atttcatgtc     4260

atcattggtg agacaagatg ggtcaaaatc ttccacaaga tgaaaagaaa tttctgataa     4320

gatgaccttc cttaaatatg ccattttacc tgacaatata gtctgaaggt gatgcaatcc     4380

ttttgtattt tcaaacccca cctcattttc cccttcattg gtcttcttgc ttctttcata     4440

ccgctttatt gtggagttga ccttatcttc taaattcttg aagaaacttg tctcttcttc     4500

cccatcaaag catatgtctg ctgagtcacc ttctagtttc ccagcttctg tttctttaga     4560

gccgataacc aatctagaga ccaactttga aaccttgtac tcgtaatctg agtggttcaa     4620

tttgtacttc tgctttctca tgaagctctc tgtgatctga ctcacagcac taacaagcaa     4680

tttgttaaaa tcatactcta ggagccgttc cccatttaaa tgtttgttaa caaccacact     4740

tttgttgctg gcaaggtcta atgctgttgc acacccagag ttagtcatgg gatccaagct     4800

attgagcctc ttctcccctt tgaaaatcaa agtgccattg ttgaatgagg acaccatcat     4860

gctaaaggcc tccagattga cacctggggt tgtgcgctga cagtcaactt ctttcccagt     4920

gaacttcttc atttggtcat aaaaaacaca ctcttcctca ggggtgattg actctttagg     4980

gttaacaaag aagccaaact cacttttagg ctcaaagaat ttctcaaagc atttaatttg     5040

atctgtcagc ctatcagggg tttcctttgt gattaaatga cacaggtatg acacattcaa     5100

catgaacttg aactttgcgc tcaacagtac cttttcacca gtcccaaaaa cagttttgat     5160

caaaaatctg agcaatttgt acactacttt ctcagcaggt gtgatcaaat cctccttcaa     5220

cttgtccatc aatgatgtgg atgagaagtc tgagacaatg gccatcacta aatacctaat     5280

gttttgaacc tgtttttgat tcctctttgt tgggttggtg agcatgagta ataatagggt     5340

tctcaatgca atctcaacat catcaatgct gtccttcaag tcaggacatg atctgatcca     5400

tgagatcatg gtgtcaatca tgttgtgcaa cacttcatct gagaagattg gtaaaaagaa     5460

cctttttggg tctgcataaa aagagattag atggccattg ggaccttgta tagaataaca     5520

ccttgaggat tctccagtct tttgatacag caggtgatat tcctcagagt ccaattttat     5580

cacttggcaa aatacctctt tacattccac cacttgatac cttacagagc ccaattggtt     5640

ttgtcttaat ctagcaactg aacttgtttt catactgttt gtcaaagcta gacagacaga     5700

tgacaatctt ttcaaactat gcatgttcct taattgttcc gtattaggct ggaaatcata     5760

atcttcaaac tttgtataat acattatagg atgagttccg gacctcatga aattctcaaa     5820

ctcaataaat ggtatgtggc actcatgctc aagatgttca gacagaccat agtgcccaaa     5880

actaagtccc accactgaca agcacctttg aacttttaaa atgaactcat ttatggatgt     5940

tctaaacaaa tcctcaagag atacctttct atacgccttt gactttctcc tgttccttag     6000

aagtctgatg aactcttcct tggtgctatg aaagctcacc aacctatcat tcacactccc     6060

atagcaacaa ccaacccagt gcttatcatt ttttgaccct ttgagtttag actgtttgat     6120

caacgaagag agacacaaga catccaaatt cagtaactgt ctccttctgg tgttcaataa     6180

ttttaaactt ttaactttgt tcaacataga gaggagcctc tcatactcag tgctagtctc     6240

acttcctctc tcataaccat gggtatctgc tgtgataaat ctcatcaaag gacaggattc     6300

aactgcctcc ttgcttagtg ctgaaatgtc atcactgtca gcaagagtct cataaagctc     6360

agagaattcc ttaattaaat ttccggggtt gattttctga aaactcctct tgagcttccc     6420

agtttccaag tctcttctaa acctgctgta aagggagttt atgccaagaa ccacatcatc     6480

gcagttcatg tttgggttga caccatcatg gcacattttc ataatttcat cattgtgaaa     6540

tgatcttgca tctttcaaga ttttcataga gtctataccg gaacgcttat caacagtggt     6600

cttgagagat tcgcaaagtc tgaagtactc agattcctca aagactttct catcttggct     6660

agaatactct aaaagtttaa acagaaggtc tctgaacttg aaattcaccc actctggcat     6720

aaagctgtta tcataatcac accgaccatc cactattggg accaatgtga tacccgcaat     6780

ggcaaggtct tctttgatac aggctagttt attggtgtcc tctataaatt tcttctcaaa     6840

actagctggt gtgcttctaa cgaagcactc aagaagaatg agggaattgt caatcagttt     6900

ataaccatca ggaatgatca aaggcagtcc cgggcacaca atcccagact ctattagaat     6960

tgcctcaaca gatttatcat catggttgtg tatgcagccg ctcttgtcag cactgtctat     7020

ctctatacaa cgcgacaaaa gtttgagtcc ctctatcaat accattctgg gttctctttg     7080

ccctaaaaag ttgagcttct gccttgacaa cctctcatct tgttctatgt ggtttaagca     7140

caactctctc aactccgaaa tagcctcatc cattgcgcat caaaaagcct aggatcctcg     7200

gtgcg                                                                 7205


<210>  28
<211>  3359
<212>  DNA
<213>  Artificial sequence

<220>
<223>  viral genome sequence

<400>  28
cgcaccgggg atcctaggct ttttggattg cgctttcctc agctccgtct tgtgggagaa       60

tgggtcaaat tgtgacgatg tttgaggctc tgcctcacat cattgatgag gtcattaaca      120

ttgtcattat cgtgcttatt atcatcacga gcatcaaagc tgtgtacaat ttcgccacct      180

gcgggatact tgcattgatc agctttcttt ttctggctgg caggtcctgt ggaatgtatg      240

gtcttgatgg gcctgacatt tacaaagggg tttaccgatt caagtcagtg gagtttgaca      300

tgtcttacct taacctgacg atgcccaatg catgttcggc aaacaactcc catcattata      360

taagtatggg gacttctgga ttggagttaa ccttcacaaa tgactccatc atcacccaca      420

acttttgtaa tctgacttcc gccctcaaca agaggacttt tgaccacaca cttatgagta      480

tagtctcaag tctgcacctc agcattagag gggtccccag ctacaaagca gtgtcctgtg      540

attttaacaa tggcatcact attcaataca acctgtcatt ttctaatgca cagagcgctc      600

tgagtcaatg taagaccttc agggggagag tcctggatat gttcagaact gcttttggag      660

gaaagtacat gaggagtggc tggggctgga caggttcaga tggcaagact acttggtgca      720

gccagacaaa ctaccaatat ctgattatac aaaacaggac ttgggaaaac cactgcaggt      780

acgcaggccc tttcggaatg tctagaattc tcttcgctca agaaaagaca aggtttctaa      840

ctagaaggct tgcaggcaca ttcacttgga ctttatcaga ctcatcagga gtggagaatc      900

caggtggtta ctgcttgacc aagtggatga tcctcgctgc agagctcaag tgttttggga      960

acacagctgt tgcaaagtgc aatgtaaatc atgatgaaga gttctgtgat atgctacgac     1020

tgattgatta caacaaggct gctttgagta aattcaaaga agatgtagaa tccgctctac     1080

atctgttcaa gacaacagtg aattctttga tttctgatca gcttttgatg agaaatcacc     1140

taagagactt gatgggagtg ccatactgca attactcgaa attctggtat ctagagcatg     1200

caaagactgg tgagactagt gtccccaagt gctggcttgt cagcaatggt tcttatttga     1260

atgaaaccca tttcagcgac caaattgagc aggaagcaga taatatgatc acagaaatgc     1320

tgagaaagga ctacataaaa aggcaaggga gtacccctct agccttgatg gatctattga     1380

tgttttctac atcagcatat ttgatcagca tctttctgca tcttgtgagg ataccaacac     1440

acagacacat aaagggcggc tcatgcccaa aaccacatcg gttaaccagc aagggaatct     1500

gtagttgtgg tgcatttaaa gtaccaggtg tggaaaccac ctggaaaaga cgctgaacag     1560

cagcgcctcc ctgactcacc acctcgaaag aggtggtgag tcagggaggc ccagagggtc     1620

ttagagtgtt acgacatttg gacctctgaa gattaggtca tgtggtagga tattgtggac     1680

agttttcagg tcggggagcc ttgccttgga ggcgctttca aagatgatac agtccatgag     1740

tgcacagtgt ggggtgacct ctttcttttt cttgtccctc actattccag tgtgcatctt     1800

gcatagccag ccatatttgt cccagacttt gtcctcatat tctcttgaag cttctttagt     1860

catctcaaca tcgatgagct taatgtctct tctgttttgt gaatctagga gtttcctgat     1920

gtcatcagat ccctgacaac ttaggaccat tccctgtgga agagcaccta ttactgaaga     1980

tgtcagccca ggttgtgcat tgaagaggtc agcaaggtcc atgccatgtg agtatttgga     2040

gtcctgcttg aattgttttt gatcagtggg ttctctatag aaatgtatgt actgcccatt     2100

ctgtggctga aatattgcta tttctaccgg gtcattaaat ctgccctcaa tgtcaatcca     2160

tgtaggagcg ttagggtcaa tacctcccat gaggtccttc agcaacattg tttggctgta     2220

gcttaagccc acctgaggtg ggcccgctgc cccaggcgct ggtttgggtg agttggccat     2280

aggcctctca tttgtcagat caattgttgt gttctcccat gctctcccta caactgatgt     2340

tctacaagct atgtatggcc acccctcccc tgaaagacag actttgtaga ggatgttctc     2400

gtaaggattc ctgtctccaa cctgatcaga aacaaacatg ttgagtttct tcttggcccc     2460

aagaactgct ttcaggagat cctcactgtt gcttggctta attaagatgg attccaacat     2520

gttaccccca tctaacaagg ctgcccctgc tttcacagca gcaccgagac tgaaattgta     2580

gccagatatg ttgatgctag actgctgctc agtgatgact cccaagactg ggtgcttgtc     2640

tttcagcctt tcaaggtcac ttaggttcgg gtacttgact gtgtaaagca gcccaaggtc     2700

tgtgagtgct tgcacaacgt cattgagtga ggtttgtgat tgtttggcca tacaagccat     2760

tgttaagctt ggcattgtgc cgaattgatt gttcagaagt gatgagtcct tcacatccca     2820

gaccctcacc acaccatttg cactctgctg aggtctcctc attccaacca tttgcagaat     2880

ctgagatctt tggtcaagct gttgtgctgt taagttcccc atgtagactc cagaagttag     2940

aggcctttca gacctcatga ttttagcctt cagtttttca aggtcagctg caagggacat     3000

cagttcttct gcactaagcc tccctacttt tagaacattc ttttttgatg ttgactttag     3060

gtccacaagg gaatacacag tttggttgag gcttctgagt ctctgtaaat ctttgtcatc     3120

cctcttctct ttcctcatga tcctctgaac attgctcacc tcagagaagt ctaatccatt     3180

cagaaggctg gtggcatcct tgatcacagc agctttcaca tctgatgtga agccttgaag     3240

ctctctcctc aatgcctggg tccattgaaa gcttttaact tctttggaca gagacatttt     3300

gtcactcagt ggatttccaa gtcaaatgcg caatcaaaat gcctaggatc cactgtgcg      3359


<210>  29
<211>  3376
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Viral genome sequence

<400>  29
cgcaccgggg atcctaggct ttttggattg cgctttcctc tagatcaact gggtgtcagg       60

ccctatccta cagaaggatg ggtcagattg tgacaatgtt tgaggctctg cctcacatca      120

tcgatgaggt gatcaacatt gtcattattg tgcttatcgt gatcacgggt atcaaggctg      180

tctacaattt tgccacctgt gggatattcg cattgatcag tttcctactt ctggctggca      240

ggtcctgtgg catgtacggt cttaagggac ccgacattta caaaggagtt taccaattta      300

agtcagtgga gtttgatatg tcacatctga acctgaccat gcccaacgca tgttcagcca      360

acaactccca ccattacatc agtatgggga cttctggact agaattgacc ttcaccaatg      420

attccatcat cagtcacaac ttttgcaatc tgacctctgc cttcaacaaa aagacctttg      480

accacacact catgagtata gtttcgagcc tacacctcag tatcagaggg aactccaact      540

ataaggcagt atcctgcgac ttcaacaatg gcataaccat ccaatacaac ttgacattct      600

cagatcgaca aagtgctcag agccagtgta gaaccttcag aggtagagtc ctagatatgt      660

ttagaactgc cttcgggggg aaatacatga ggagtggctg gggctggaca ggctcagatg      720

gcaagaccac ctggtgtagc cagacgagtt accaatacct gattatacaa aatagaacct      780

gggaaaacca ctgcacatat gcaggtcctt ttgggatgtc caggattctc ctttcccaag      840

agaagactaa gttcttcact aggagactag cgggcacatt cacctggact ttgtcagact      900

cttcaggggt ggagaatcca ggtggttatt gcctgaccaa atggatgatt cttgctgcag      960

agcttaagtg tttcgggaac acagcagttg cgaaatgcaa tgtaaatcat gatgccgaat     1020

tctgtgacat gctgcgacta attgactaca acaaggctgc tttgagtaag ttcaaagagg     1080

acgtagaatc tgccttgcac ttattcaaaa caacagtgaa ttctttgatt tcagatcaac     1140

tactgatgag gaaccacttg agagatctga tgggggtgcc atattgcaat tactcaaagt     1200

tttggtacct agaacatgca aagaccggcg aaactagtgt ccccaagtgc tggcttgtca     1260

ccaatggttc ttacttaaat gagacccact tcagtgatca aatcgaacag gaagccgata     1320

acatgattac agagatgttg aggaaggatt acataaagag gcaggggagt acccccctag     1380

cattgatgga ccttctgatg ttttccacat ctgcatatct agtcagcatc ttcctgcacc     1440

ttgtcaaaat accaacacac aggcacataa aaggtggctc atgtccaaag ccacaccgat     1500

taaccaacaa aggaatttgt agttgtggtg catttaaggt gcctggtgta aaaaccgtct     1560

ggaaaagacg ctgaagaaca gcgcctccct gactctccac ctcgaaagag gtggagagtc     1620

agggaggccc agagggtctt agagtgtcac aacatttggg cctctaaaaa ttaggtcatg     1680

tggcagaatg ttgtgaacag ttttcagatc tgggagcctt gctttggagg cgctttcaaa     1740

aatgatgcag tccatgagtg cacagtgcgg ggtgatctct ttcttctttt tgtcccttac     1800

tattccagta tgcatcttac acaaccagcc atatttgtcc cacactttgt cttcatactc     1860

cctcgaagct tccctggtca tttcaacatc gataagctta atgtccttcc tattctgtga     1920

gtccagaagc tttctgatgt catcggagcc ttgacagctt agaaccatcc cctgcggaag     1980

agcacctata actgacgagg tcaacccggg ttgcgcattg aagaggtcgg caagatccat     2040

gccgtgtgag tacttggaat cttgcttgaa ttgtttttga tcaacgggtt ccctgtaaaa     2100

gtgtatgaac tgcccgttct gtggttggaa aattgctatt tccactggat cattaaatct     2160

accctcaatg tcaatccatg taggagcgtt ggggtcaatt cctcccatga ggtcttttaa     2220

aagcattgtc tggctgtagc ttaagcccac ctgaggtgga cctgctgctc caggcgctgg     2280

cctgggtgaa ttgactgcag gtttctcgct tgtgagatca attgttgtgt tttcccatgc     2340

tctccccaca atcgatgttc tacaagctat gtatggccat ccttcacctg aaaggcaaac     2400

tttatagagg atgttttcat aagggttcct gtccccaact tggtctgaaa caaacatgtt     2460

gagttttctc ttggccccga gaactgcctt caagaggtcc tcgctgttgc ttggcttgat     2520

caaaattgac tctaacatgt tacccccatc caacagggct gcccctgcct tcacggcagc     2580

accaagacta aagttatagc cagaaatgtt gatgctggac tgctgttcag tgatgacccc     2640

cagaactggg tgcttgtctt tcagcctttc aagatcatta agatttggat acttgactgt     2700

gtaaagcaag ccaaggtctg tgagcgcttg tacaacgtca ttgagcggag tctgtgactg     2760

tttggccata caagccatag ttagacttgg cattgtgcca aattgattgt tcaaaagtga     2820

tgagtctttc acatcccaaa ctcttaccac accacttgca ccctgctgag gctttctcat     2880

cccaactatc tgtaggatct gagatctttg gtctagttgc tgtgttgtta agttccccat     2940

atatacccct gaagcctggg gcctttcaga cctcatgatc ttggccttca gcttctcaag     3000

gtcagccgca agagacatca gttcttctgc actgagcctc cccactttca aaacattctt     3060

ctttgatgtt gactttaaat ccacaagaga atgtacagtc tggttgagac ttctgagtct     3120

ctgtaggtct ttgtcatctc tcttttcctt cctcatgatc ctctgaacat tgctgacctc     3180

agagaagtcc aacccattca gaaggttggt tgcatcctta atgacagcag ccttcacatc     3240

tgatgtgaag ctctgcaatt ctcttctcaa tgcttgcgtc cattggaagc tcttaacttc     3300

cttagacaag gacatcttgt tgctcaatgg tttctcaaga caaatgcgca atcaaatgcc     3360

taggatccac tgtgcg                                                     3376


