                         SEQUENCE LISTING

<110>  Autolus Limited
 
<120>  Plasmid system

<130>  P113886PCT

<150>  GB 1720948.7
<151>  2017-12-15

<160>  26    

<170>  PatentIn version 3.5

<210>  1
<211>  5217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Gag and Pol polyproteins codon-shuffled variant

<400>  1
atgggccaaa ccgtgaccac cccgctgtcg ctgactctgg ggcattggaa ggatgtggaa       60

cgcatcgccc acaaccagag cgtggacgtg aagaagcgcc gctgggtgac cttctgctcc      120

gcagaatggc ctacctttaa cgtggggtgg cctcgggacg gcaccttcaa tcgggacctg      180

atcacccagg tgaaaatcaa ggtgttcagc ccgggtccgc acggccatcc agatcaagtc      240

ccgtacatcg tgacttggga agccctggcg ttcgaccccc caccgtgggt caaaccattc      300

gtccacccga agccaccgcc acccctgccg ccgtcggcgc cctcactgcc gctggaacct      360

ccgagatcga ctcctccgag atcatcgctc tacccggcgc tcactccgag cctgggcgca      420

aagccaaagc cgcaagtgct gtccgattcg ggaggacctc tcatcgacct gctcaccgag      480

gaccctccac cctacagaga tccgcgccct cccccgagcg acagggacgg gaacggcggg      540

gaggccaccc cggcaggaga agccccggac ccaagcccta tggcgtcaag actcagaggc      600

agaagagaac ctccggtggc agactcgact acttcgcagg cattcccact gcgcgccggg      660

ggaaatggcc agctgcagta ctggccgttc agctcatcgg acctctacaa ttggaagaac      720

aacaatccct cgttctcgga ggaccctggt aaactaaccg ctttgatcga atcggtcctg      780

attacccacc agccgacctg ggacgactgc cagcagctcc tgggcactct gctgaccgga      840

gaggaaaaac aaagagtgct gctggaagca cggaaggcag tgcgcgggga tgatggcagg      900

ccgacccagc tcccgaacga ggtggacgct gccttcccac tggaacgccc agattgggac      960

tacaccaccc aagctggaag aaaccacctg gtccattacc gccaactgct gctggcagga     1020

ctccaaaacg caggacggtc ccctactaac ctggccaagg tgaaagggat tactcaaggc     1080

ccgaacgagt cgccgagcgc gttcctagag cgcctaaaag aggcctaccg gcgctacacc     1140

ccatatgacc cagaggaccc aggacaggaa accaatgtga gcatgtcatt catctggcag     1200

tcagcccccg acatcggacg caagctggaa cgcctggaag acctgaagaa taaaacgctc     1260

ggcgatctgg tgcgggaagc agagaagatt ttcaataaac gggaaacccc ggaagagcgg     1320

gaggaacgca tccggcgcga gaccgaagaa aaggaggaac gcagacgcac cgaggatgaa     1380

cagaaggaga aggagagaga ccgccgccgg caccgcgaaa tgtcgaaact gctggccacg     1440

gtggtcagcg gtcagaagca ggatcgccaa ggaggcgagc gcagaagatc gcaactggat     1500

cgcgaccagt gcgcctactg caaggagaag gggcactggg cgaaagattg tcccaagaaa     1560

ccacgaggac ctcggggacc aagaccccag acctccctcc tgaccctaga tgactaggga     1620

ggtcagggtc aggagccccc ccctgaaccc aggataaccc tcaaagtcgg ggggcaaccc     1680

gtcaccttcc tggtggacac cggcgcgcag cacagcgtgc tgacccaaaa cccgggacct     1740

ctgtcagaca agtccgcctg ggtgcagggc gcaactggag ggaagcggta tcggtggacc     1800

actgatcgca aagtgcacct ggcaacggga aaagtgaccc attcatttct gcacgtgccg     1860

gactgcccgt acccgcttct gggacgcgac ctcctgacta agctcaaggc acagatccac     1920

ttcgagggat caggagcgca ggtcatggga cctatgggac aaccattgca ggtcctgacc     1980

ttgaacatcg aagacgagta caggctgcac gagactagca aggaacctga cgtgtcgctg     2040

gggagcacct ggctgtcgga ctttccccaa gcctgggcag agaccggagg aatggggctc     2100

gcggtcagac aggcaccact catcatccca ctcaaggcca cctccacccc ggtctcaatt     2160

aagcaatacc cgatgtcgca ggaagcccgc ctcggaatca agccgcatat tcaacgcctc     2220

ctggaccaag ggattctggt gccgtgccag tcgccgtgga acaccccact attgccggtc     2280

aagaagcctg gaactaacga ttacaggccg gtgcaggacc tgcgggaagt gaacaaacgg     2340

gtggaggaca tccacccgac cgtgccgaat ccgtacaacc ttctgtccgg actccctccc     2400

tcacatcagt ggtacactgt gctcgacctt aaggacgcgt tcttctgcct gcgcctgcat     2460

ccgacgtcac agccgttgtt cgctttcgag tggcgcgatc ccgaaatggg tatctcgggc     2520

caactgactt ggactcggct gccacaagga ttcaagaact cgccaactct gtttgatgaa     2580

gctctacacc gcgacctggc cgacttcaga atccaacacc cggacctgat cctgcttcaa     2640

tacgtggatg acctgctgct cgccgcgact tccgagctgg actgtcagca gggcactaga     2700

gcactgctac agaccttggg taatctggga tacagagcaa gcgccaagaa agctcagatt     2760

tgccaaaagc aagtgaagta cctgggctac cttctcaaag aaggccagag atggctgacc     2820

gaagccagaa aggagaccgt gatgggacaa ccgaccccta aaacccctcg gcagctgcgc     2880

gagttcctgg gaaccgcagg cttctgccgc ctgtggattc ccggattcgc agagatggcc     2940

gccccgctat accctctgac caagaccgga accctgttta attggggacc tgaccagcag     3000

aaggcgtacc aagagatcaa gcaagccctg ctgaccgccc ctgccctcgg actgccggac     3060

ctgactaagc cctttgagct gttcgtggac gagaagcaag gatacgcaaa gggcgtcctg     3120

actcagaagc tgggaccgtg gagaagaccg gtcgcgtacc tgtccaagaa gctggacccg     3180

gtggccgctg gatggccacc gtgcctgcgg atggtggctg ccattgctgt gctcaccaag     3240

gacgcaggca agctgactat gggacagcca ctggtgatcc tcgcaccgca cgccgtggag     3300

gctctggtga aacagcctcc tgaccggtgg ctgtccaatg cgcgcatgac tcattaccag     3360

gccctgctcc tagacaccga tcgggtgcag ttcggaccag tggtggcact gaacccagca     3420

actctgctgc cgctgccgga agaggggttg cagcacgact gcctggacat cctcgcagaa     3480

gctcacggaa cgcggtccga ccttaccgac caaccactgc ccgatgctga tcacacttgg     3540

tacactgatg ggtcatcatt cctgcaagaa ggccagcgca aagcaggggc tgcagtgact     3600

accgaaactg aagtcatttg ggctcgggca ctgccggcgg ggacgtcggc acagcgggcg     3660

gaactcatcg cactcaccca ggcgctgaag atggccgagg gcaaaaagct gaacgtgtac     3720

accgactcaa gatacgcgtt cgcaactgca catatccacg gggagattta cagacggcgc     3780

ggtctgctga cttcggaggg caaggaaatc aaaaacaagg acgagatcct ggcgctcctg     3840

aaagccctgt tcctgccaaa gcggctgtca atcatccact gccctggcca tcagaagggt     3900

aactccgctg aagccagggg aaaccgcatg gccgatcaag ccgcgcgcga ggtcgctacc     3960

agagagaccc ccggaacttc gacgctgctt atcgagaact ccacgccata cacccacgag     4020

cactttcact acactgtcac cgacactaag gatctaacta agctgggtgc cacttatgat     4080

agcgcaaaga agtactgggt gtaccagggg aagcctgtga tgcccgatca gttcaccttc     4140

gagctgctgg atttcctgca tcaactgacg cacctgagct tctcaaagac caaggctctg     4200

ctggaacgca gcccttcgcc gtactatatg ttgaataggg atcgcaccct gaagaatatc     4260

accgaaacct gcaaggcctg cgcccaggtg aatgcttcca agtccgccgt gaagcagggc     4320

acccgcgtcc gcggacaccg ccctggaact cactgggaga tcgacttcac tgaggtgaaa     4380

ccgggccttt acggctacaa atacctgctg gtgttcgtgg acactttctc gggatggatc     4440

gaggccttcc cgactaaaaa ggaaactgca aaagtggtga ctaagaagct gctggaggag     4500

attttccccc gctttggcat gccgcaggta ttgggaactg acaatgggcc tgccttcgtc     4560

tccaaggtga gtcagacagt ggccgatctg ttggggattg attggaaatt acattgtgca     4620

tacagacccc aaagctcagg tcaggtagaa agaatgaata ggaccatcaa ggagacttta     4680

actaaattaa cgcttgcaac tggctctaga gactgggtgc tcctactccc cttagccctg     4740

taccgagccc gcaacacgcc gggcccccat ggcctcaccc catatgagat cttatatggg     4800

gcacccccgc cccttgtaaa cttccctgac cctgacatga ccagagttac taacagcccc     4860

tctctccaag ctcacttaca ggctctctac ttagtccagc acgaagtttg gagaccactg     4920

gcggcagctt accaagaaca actggaccgg ccggtggtgc ctcaccctta ccgggtcggc     4980

gacacagtgt gggtccgccg acatcaaacc aagaacctag aacctcgctg gaaaggacct     5040

tacacagtcc tgctgaccac ccccaccgcc ctcaaagtag acggtatcgc agcttggata     5100

cacgcagccc acgtaaaggc ggccgacacc gagagtggac catcctctgg acggacatgg     5160

cgcgttcaac gctctcaaaa ccccctcaag ataagattaa cccgtggaag cccttag        5217


<210>  2
<211>  5217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Gag and Pol polyproteins codon-shuffled variant

<400>  2
atgggtcaga cggtgactac tccgctttca ctcacactcg gtcattggaa agacgttgag       60

cgaatcgcgc acaatcagag tgtggacgta aaaaagcgcc gctgggttac gttctgttct      120

gctgagtggc ctacgtttaa tgtagggtgg ccaagggatg gcactttcaa cagggatctg      180

ataacacagg taaagataaa agtttttagt ccaggcccac acggtcatcc tgatcaggtg      240

ccttacattg taacatggga agcactggcg tttgatcctc cgccgtgggt taaacccttt      300

gtacacccca aaccccctcc accactccca ccctctgccc catcattgcc gttggaaccg      360

cccaggtcta cgcccccccg ctcatccctt taccctgctc tgacacctag ccttggtgcc      420

aaacccaagc cacaagtgct ctcagacagc ggcggccctc tgatagattt gctgactgaa      480

gatccgcctc cttatcgcga cccgcggcct ccaccgtcag atagggatgg caatggcggc      540

gaagccacac ccgcaggtga ggcccctgat ccaagtccca tggcttctcg acttcgaggc      600

cgacgggagc cgcctgtcgc tgatagtacg acttcacaag cattcccttt gagagcgggg      660

gggaatgggc aattgcaata ttggcccttt agcagcagtg acctgtacaa ttggaaaaac      720

aataaccctt cttttagtga ggatcctggt aagcttacgg ctttgataga atccgtgctt      780

attacacatc agccgacatg ggatgactgc caacaactct tgggtacatt gctgacgggt      840

gaagaaaaac agcgcgtgct cttggaagcc aggaaagctg tacgcggcga cgacggtcgg      900

cccacacagc ttcctaacga agtcgacgcc gcttttcctc tcgagcggcc agattgggat      960

tacacaaccc aggccggccg gaaccatttg gtacattacc ggcaactctt gttggcaggg     1020

ttgcaaaacg ctggtcggag ccccacgaac ttggcgaaag tgaagggtat cacccaaggc     1080

ccaaacgagt caccttcagc ttttctcgaa cgacttaaag aagcctacag acgatacact     1140

ccgtacgatc cagaggaccc gggccaggaa accaacgtat ctatgtcttt catttggcag     1200

agcgctccag acatcgggcg aaaactggaa cgcctcgaag acctgaagaa taaaactctc     1260

ggtgacctcg ttcgcgaagc cgagaaaatt tttaataaaa gagaaactcc ggaagagcgc     1320

gaggaaagaa ttaggcgcga gacggaggaa aaagaagaac ggaggagaac cgaggacgaa     1380

caaaaggaga aagagcgaga ccgacggcgc cacagagaaa tgagcaaact gcttgccacc     1440

gtggtgagcg gtcaaaagca agaccgacag ggaggggagc ggagacgaag tcagctcgac     1500

agggaccagt gtgcttattg taaagaaaag ggccactggg ctaaagactg ccccaaaaaa     1560

ccgagaggcc ccaggggtcc gagaccgcag acctctttgt tgactttgga tgattaaggc     1620

ggacagggtc aagagcctcc accggaacca cgcataactc tcaaagtggg aggccagcca     1680

gtaacgtttc tcgtcgacac aggagcacaa cattcagttc ttactcaaaa cccagggccg     1740

ctgagtgaca agtctgcttg ggtgcaggga gctactggag ggaagcggta ccggtggacg     1800

acggaccgga aagtgcatct ggcgacgggt aaagtaacac actctttctt gcatgtaccg     1860

gattgcccct acccacttct cggccgcgac ttgcttacaa aacttaaagc tcagatccat     1920

ttcgagggaa gcggggctca ggtaatgggc ccgatggggc agcctcttca ggtcctgacc     1980

ttgaatatcg aagacgagta tcgcttgcat gaaacctcta aggaacctga tgtgtctctg     2040

gggtcaacgt ggctgtccga ctttcctcag gcatgggctg aaaccggagg catgggtttg     2100

gcggtcagac aggcaccgct tattattccc cttaaggcga cgtctacgcc cgtctcaata     2160

aaacaatacc caatgtctca agaagcccgg ctgggaatca agcctcacat tcaaagactg     2220

ctcgatcagg gcatcctcgt cccttgccag agcccgtgga atacgcctct gttgccggtg     2280

aagaagcccg gcacgaatga ctatcggcct gtccaggacc tccgggaagt gaacaagaga     2340

gtggaggaca tacaccctac agtgcccaat ccctataatc tgctgtccgg tctccctcct     2400

tcccatcaat ggtatacggt cctcgatctg aaggatgcct ttttttgtct taggcttcac     2460

cctacgtctc aacccctctt cgccttcgag tggcgcgatc ccgaaatggg gatcagcgga     2520

caacttactt ggactaggct tccccagggg ttcaaaaata gtcccacact gttcgatgag     2580

gctctgcaca gggacttggc ggatttccgg atacaacacc ctgacctcat tttgcttcaa     2640

tatgtcgacg atcttctcct ggcggccaca tctgaactcg attgccaaca aggaactagg     2700

gctcttctgc aaactctcgg aaacttgggt tatcgggcta gtgcaaaaaa ggctcagata     2760

tgccagaaac aagtaaagta cctcggctat ctcctgaaag aagggcaacg gtggctcaca     2820

gaagcaagga aggaaacggt gatgggccag ccaactccga aaacgccccg acagttgaga     2880

gagttcctgg gtacagcggg gttttgccga ctctggatcc cgggctttgc ggaaatggcc     2940

gccccactgt atccgcttac caagacggga acgcttttta actgggggcc tgaccaacaa     3000

aaggcatacc aggaaatcaa gcaagcactg ctcacagctc cagcgctcgg tctcccggac     3060

ttgactaaac cctttgaact ttttgttgat gagaagcaag gctatgcaaa gggcgtgctt     3120

acacagaagt tgggtccatg gagaaggccg gttgcctatt tgtccaaaaa actggaccct     3180

gtggcagctg gctggccccc atgcttgagg atggtagctg ccatagctgt gctgaccaag     3240

gacgcaggga aacttaccat gggccaacct cttgtgatac ttgcaccgca tgctgttgaa     3300

gccctggtca agcaaccgcc ggaccgctgg ctctctaacg cgaggatgac gcactaccaa     3360

gctttgctcc tcgacacgga ccgggtccaa ttcggtcctg tcgtcgcgct caatcccgcg     3420

acactcctcc cccttcctga ggaagggctg caacatgact gtctcgacat acttgcagaa     3480

gcacacggca cgcggtcaga cttgacagac cagcctctcc ctgatgccga ccacacttgg     3540

tataccgatg gcagtagttt tttgcaggaa ggtcagcgaa aggctggcgc cgcagtcacc     3600

acagaaactg aggtaatttg ggcgagggct ctcccagctg ggacatctgc tcaacgcgcg     3660

gaactcattg cactcaccca agccctgaag atggcagaag gaaaaaaatt gaatgtctac     3720

actgattccc ggtatgcttt tgccacggcg catatccatg gggagatata tcgacgccga     3780

ggtctgctta cgtctgaagg taaggagatt aaaaacaaag acgagatcct cgcccttctg     3840

aaggcactgt tcttgccaaa aagactgagt atcatacact gtcctggaca ccagaaaggt     3900

aattcagccg aagcgagggg taaccggatg gcagatcaag cagcacggga agtcgctacc     3960

cgagaaaccc ccggaacctc cacccttttg atcgagaaca gtactcctta cactcacgag     4020

catttccatt atacagtgac ggacacgaaa gatttgacga aactgggtgc aacgtacgat     4080

agtgcaaaaa aatactgggt atatcagggc aaacccgtga tgcctgacca gttcacgttc     4140

gagcttctgg atttcctcca ccagcttacg catttgtctt tttccaagac gaaagcgctt     4200

ctggaacggt ctccgtcccc atattatatg ttgaatagag ataggacctt gaaaaatata     4260

acagaaacct gcaaggcttg tgctcaagtg aatgcttcca agagcgcagt caaacaaggt     4320

acgagggtca gaggccacag gccaggaacc cattgggaga tcgacttcac tgaggtgaaa     4380

ccaggccttt acggctacaa gtaccttctt gtttttgttg atacgttctc cggctggatc     4440

gaggcctttc caactaagaa ggagactgcg aaagtggtca caaagaaact cctggaagaa     4500

atcttcccgc gctttgggat gcctcaggtc cttgggaccg ataacgggcc tgcttttgta     4560

tccaaagtca gccaaacagt cgccgacctc ttgggaatcg attggaaact gcactgtgcc     4620

tatcgccccc agtcaagcgg ccaagtagaa aggatgaaca ggacaatcaa agaaactctc     4680

accaagctga ctttggcaac tgggtcacgc gactgggtct tgcttttgcc acttgctctt     4740

taccgcgctc gcaacacacc cggtccccac ggtctcactc catatgagat tttgtatggc     4800

gcaccacccc ctctcgtgaa ttttcccgat cctgacatga cgagggtcac caactctccc     4860

tctttgcagg ctcatcttca ggcgctttat cttgtgcagc acgaggtttg gagacctctt     4920

gcagctgcat accaagaaca gcttgacagg cctgtcgtgc cacatccgta ccgggtcgga     4980

gatacggtat gggtaaggag acaccaaact aaaaacctgg agccaagatg gaaagggcct     5040

tatactgttc tcctgactac gcctactgct ctcaaggttg atggcatagc agcctggatt     5100

catgcggccc atgttaaggc tgcagataca gaatccggtc cctcatccgg aaggacatgg     5160

cgggttcaaa ggtcccaaaa ccccctcaaa attcgactca cacgcggctc cccgtaa        5217


<210>  3
<211>  5217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Gag and Pol polyproteins codon-shuffled variant

<400>  3
atgggccaga ccgtgaccac ccccctgagc ctgaccctgg gccactggaa ggacgtggag       60

cgcatcgccc acaaccagag cgtggacgtg aagaagcgcc gctgggtgac cttctgcagc      120

gccgagtggc ccaccttcaa cgtgggctgg ccccgcgacg gcaccttcaa ccgcgacctg      180

atcacccagg tgaagatcaa ggtgttcagc cccggccccc acggccaccc cgaccaggtg      240

ccctacatcg tgacctggga ggccctggcc ttcgaccccc ccccctgggt gaagcccttc      300

gtgcacccca agcccccccc ccccctgccc cccagcgccc ccagcctgcc cctggagccc      360

ccccgcagca cccccccccg cagcagcctg taccccgccc tgacccccag cctgggcgcc      420

aagcccaagc cccaggtgct gagcgacagc ggcggccccc tgatcgacct gctgaccgag      480

gacccccccc cctaccgcga cccccgcccc ccccccagcg accgcgacgg caacggcggc      540

gaggccaccc ccgccggcga ggcccccgac cccagcccca tggccagccg cctgcgcggc      600

cgccgcgagc cccccgtggc cgacagcacc accagccagg ccttccccct gcgcgccggc      660

ggcaacggcc agctgcagta ctggcccttc agcagcagcg acctgtacaa ctggaagaac      720

aacaacccca gcttcagcga ggaccccggc aagctgaccg ccctgatcga gagcgtgctg      780

atcacccacc agcccacctg ggacgactgc cagcagctgc tgggcaccct gctgaccggc      840

gaggagaagc agcgcgtgct gctggaggcc cgcaaggccg tgcgcggcga cgacggccgc      900

cccacccagc tgcccaacga ggtggacgcc gccttccccc tggagcgccc cgactgggac      960

tacaccaccc aggccggccg caaccacctg gtgcactacc gccagctgct gctggccggc     1020

ctgcagaacg ccggccgcag ccccaccaac ctggccaagg tgaagggcat cacccagggc     1080

cccaacgaga gccccagcgc cttcctggag cgcctgaagg aggcctaccg ccgctacacc     1140

ccctacgacc ccgaggaccc cggccaggag accaacgtga gcatgagctt catctggcag     1200

agcgcccccg acatcggccg caagctggag cgcctggagg acctgaagaa caagaccctg     1260

ggcgacctgg tgcgcgaggc cgagaagatc ttcaacaagc gcgagacccc cgaggagcgc     1320

gaggagcgca tccgccgcga gaccgaggag aaggaggagc gccgccgcac cgaggacgag     1380

cagaaggaga aggagcgcga ccgccgccgc caccgcgaga tgagcaagct gctggccacc     1440

gtggtgagcg gccagaagca ggaccgccag ggcggcgagc gccgccgcag ccagctggac     1500

cgcgaccagt gcgcctactg caaggagaag ggccactggg ccaaggactg ccccaagaag     1560

ccccgcggcc cccgcggccc ccgcccccag accagcctgc tgaccctgga cgactaaggc     1620

ggccagggcc aggagccccc ccccgagccc cgcatcaccc tgaaggtggg cggccagccc     1680

gtgaccttcc tggtggacac cggcgcccag cacagcgtgc tgacccagaa ccccggcccc     1740

ctgagcgaca agagcgcctg ggtgcagggc gccaccggcg gcaagcgcta ccgctggacc     1800

accgaccgca aggtgcacct ggccaccggc aaggtgaccc acagcttcct gcacgtgccc     1860

gactgcccct accccctgct gggccgcgac ctgctgacca agctgaaggc ccagatccac     1920

ttcgagggca gcggcgccca ggtgatgggc cccatgggcc agcccctgca ggtgctgacc     1980

ctgaacatcg aggacgagta ccgcctgcac gagaccagca aggagcccga cgtgagcctg     2040

ggcagcacct ggctgagcga cttcccccag gcctgggccg agaccggcgg catgggcctg     2100

gccgtgcgcc aggcccccct gatcatcccc ctgaaggcca ccagcacccc cgtgagcatc     2160

aagcagtacc ccatgagcca ggaggcccgc ctgggcatca agccccacat ccagcgcctg     2220

ctggaccagg gcatcctggt gccctgccag agcccctgga acacccccct gctgcccgtg     2280

aagaagcccg gcaccaacga ctaccgcccc gtgcaggacc tgcgcgaggt gaacaagcgc     2340

gtggaggaca tccaccccac cgtgcccaac ccctacaacc tgctgagcgg cctgcccccc     2400

agccaccagt ggtacaccgt gctggacctg aaggacgcct tcttctgcct gcgcctgcac     2460

cccaccagcc agcccctgtt cgccttcgag tggcgcgacc ccgagatggg catcagcggc     2520

cagctgacct ggacccgcct gccccagggc ttcaagaaca gccccaccct gttcgacgag     2580

gccctgcacc gcgacctggc cgacttccgc atccagcacc ccgacctgat cctgctgcag     2640

tacgtggacg acctgctgct ggccgccacc agcgagctgg actgccagca gggcacccgc     2700

gccctgctgc agaccctggg caacctgggc taccgcgcca gcgccaagaa ggcccagatc     2760

tgccagaagc aggtgaagta cctgggctac ctgctgaagg agggccagcg ctggctgacc     2820

gaggcccgca aggagaccgt gatgggccag cccaccccca agaccccccg ccagctgcgc     2880

gagttcctgg gcaccgccgg cttctgccgc ctgtggatcc ccggcttcgc cgagatggcc     2940

gcccccctgt accccctgac caagaccggc accctgttca actggggccc cgaccagcag     3000

aaggcctacc aggagatcaa gcaggccctg ctgaccgccc ccgccctggg cctgcccgac     3060

ctgaccaagc ccttcgagct gttcgtggac gagaagcagg gctacgccaa gggcgtgctg     3120

acccagaagc tgggcccctg gcgccgcccc gtggcctacc tgagcaagaa gctggacccc     3180

gtggccgccg gctggccccc ctgcctgcgc atggtggccg ccatcgccgt gctgaccaag     3240

gacgccggca agctgaccat gggccagccc ctggtgatcc tggcccccca cgccgtggag     3300

gccctggtga agcagccccc cgaccgctgg ctgagcaacg cccgcatgac ccactaccag     3360

gccctgctgc tggacaccga ccgcgtgcag ttcggccccg tggtggccct gaaccccgcc     3420

accctgctgc ccctgcccga ggagggcctg cagcacgact gcctggacat cctggccgag     3480

gcccacggca cccgcagcga cctgaccgac cagcccctgc ccgacgccga ccacacctgg     3540

tacaccgacg gcagcagctt cctgcaggag ggccagcgca aggccggcgc cgccgtgacc     3600

accgagaccg aggtgatctg ggcccgcgcc ctgcccgccg gcaccagcgc ccagcgcgcc     3660

gagctgatcg ccctgaccca ggccctgaag atggccgagg gcaagaagct gaacgtgtac     3720

accgacagcc gctacgcctt cgccaccgcc cacatccacg gcgagatcta ccgccgccgc     3780

ggcctgctga ccagcgaggg caaggagatc aagaacaagg acgagatcct ggccctgctg     3840

aaggccctgt tcctgcccaa gcgcctgagc atcatccact gccccggcca ccagaagggc     3900

aacagcgccg aggcccgcgg caaccgcatg gccgaccagg ccgcccgcga ggtggccacc     3960

cgcgagaccc ccggcaccag caccctgctg atcgagaaca gcacccccta cacccacgag     4020

cacttccact acaccgtgac cgacaccaag gacctgacca agctgggcgc cacctacgac     4080

agcgccaaga agtactgggt gtaccagggc aagcccgtga tgcccgacca gttcaccttc     4140

gagctgctgg acttcctgca ccagctgacc cacctgagct tcagcaagac caaggccctg     4200

ctggagcgca gccccagccc ctactacatg ctgaaccgcg accgcaccct gaagaacatc     4260

accgagacct gcaaggcctg cgcccaggtg aacgccagca agagcgccgt gaagcagggc     4320

acccgcgtgc gcggccaccg ccccggcacc cactgggaga tcgacttcac cgaggtgaag     4380

cccggcctgt acggctacaa gtacctgctg gtgttcgtgg acaccttcag cggctggatc     4440

gaggccttcc ccaccaagaa ggagaccgcc aaggtggtga ccaagaagct gctggaggag     4500

atcttccccc gcttcggcat gccccaggtg ctgggcaccg acaacggccc cgccttcgtg     4560

agcaaggtga gccagaccgt ggccgacctg ctgggcatcg actggaagct gcactgcgcc     4620

taccgccccc agagcagcgg ccaggtggag cgcatgaacc gcaccatcaa ggagaccctg     4680

accaagctga ccctggccac cggcagccgc gactgggtgc tgctgctgcc cctggccctg     4740

taccgcgccc gcaacacccc cggcccccac ggcctgaccc cctacgagat cctgtacggc     4800

gccccccccc ccctggtgaa cttccccgac cccgacatga cccgcgtgac caacagcccc     4860

agcctgcagg cccacctgca ggccctgtac ctggtgcagc acgaggtgtg gcgccccctg     4920

gccgccgcct accaggagca gctggaccgc cccgtggtgc cccaccccta ccgcgtgggc     4980

gacaccgtgt gggtgcgccg ccaccagacc aagaacctgg agccccgctg gaagggcccc     5040

tacaccgtgc tgctgaccac ccccaccgcc ctgaaggtgg acggcatcgc cgcctggatc     5100

cacgccgccc acgtgaaggc cgccgacacc gagagcggcc ccagcagcgg ccgcacctgg     5160

cgcgtgcagc gcagccagaa ccccctgaag atccgcctga cccgcggcag cccctaa        5217


<210>  4
<211>  5217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Wildtype Gag and Pol polyproteins

<400>  4
atgggccaga ctgttaccac tcccttaagt ttgaccttag gtcactggaa agatgtcgag       60

cggatcgctc acaaccagtc ggtagatgtc aagaagagac gttgggttac cttctgctct      120

gcagaatggc caacctttaa cgtcggatgg ccgcgagacg gcacctttaa ccgagacctc      180

atcacccagg ttaagatcaa ggtcttttca cctggcccgc atggacaccc agaccaggtc      240

ccctacatcg tgacctggga agccttggct tttgaccccc ctccctgggt caagcccttt      300

gtacacccta agcctccgcc tcctcttcct ccatccgccc cgtctctccc ccttgaacct      360

cctcgttcga ccccgcctcg atcctccctt tatccagccc tcactccttc tctaggcgcc      420

aaacctaaac ctcaagttct ttctgacagt ggggggccgc tcatcgacct acttacagaa      480

gaccccccgc cttataggga cccaagacca cccccttccg acagggacgg aaatggtgga      540

gaagcgaccc ctgcgggaga ggcaccggac ccctccccaa tggcatctcg cctacgtggg      600

agacgggagc cccctgtggc cgactccact acctcgcagg cattccccct ccgcgcagga      660

ggaaacggac agcttcaata ctggccgttc tcctcttctg acctttacaa ctggaaaaat      720

aataaccctt ctttttctga agatccaggt aaactgacag ctctgatcga gtctgttctc      780

atcacccatc agcccacctg ggacgactgt cagcagctgt tggggactct gctgaccgga      840

gaagaaaaac aacgggtgct cttagaggct agaaaggcgg tgcggggcga tgatgggcgc      900

cccactcaac tgcccaatga agtcgatgcc gcttttcccc tcgagcgccc agactgggat      960

tacaccaccc aggcaggtag gaaccaccta gtccactatc gccagttgct cctagcgggt     1020

ctccaaaacg cgggcagaag ccccaccaat ttggccaagg taaaaggaat aacacaaggg     1080

cccaatgagt ctccctcggc cttcctagag agacttaagg aagcctatcg caggtacact     1140

ccttatgacc ctgaggaccc agggcaagaa actaatgtgt ctatgtcttt catttggcag     1200

tctgccccag acattgggag aaagttagag aggttagaag atttaaaaaa caagacgctt     1260

ggagatttgg ttagagaggc agaaaagatc tttaataaac gagaaacccc ggaagaaaga     1320

gaggaacgta tcaggagaga aacagaggaa aaagaagaac gccgtaggac agaggatgag     1380

cagaaagaga aagaaagaga tcgtaggaga catagagaga tgagcaagct attggccact     1440

gtcgttagtg gacagaaaca ggatagacag ggaggagaac gaaggaggtc ccaactcgat     1500

cgcgaccagt gtgcctactg caaagaaaag gggcactggg ctaaagattg tcccaagaaa     1560

ccacgaggac ctcggggacc aagaccccag acctccctcc tgaccctaga tgactaggga     1620

ggtcagggtc aggagccccc ccctgaaccc aggataaccc tcaaagtcgg ggggcaaccc     1680

gtcaccttcc tggtagatac tggggcccaa cactccgtgc tgacccaaaa tcctggaccc     1740

ctaagtgata agtctgcctg ggtccaaggg gctactggag gaaagcggta tcgctggacc     1800

acggatcgca aagtacatct agctaccggt aaggtcaccc actctttcct ccatgtacca     1860

gactgtccct atcctctgtt aggaagagat ttgctgacta aactaaaagc ccaaatccac     1920

tttgagggat caggagctca ggttatggga ccaatggggc agcccctgca agtgttgacc     1980

ctaaatatag aagatgagta tcggctacat gagacctcaa aagagccaga tgtttctcta     2040

gggtccacat ggctgtctga ttttcctcag gcctgggcgg aaaccggggg catgggactg     2100

gcagttcgcc aagctcctct gatcatacct ctgaaagcaa cctctacccc cgtgtccata     2160

aaacaatacc ccatgtcaca agaagccaga ctggggatca agccccacat acagagactg     2220

ttggaccagg gaatactggt accctgccag tccccctgga acacgcccct gctacccgtt     2280

aagaaaccag ggactaatga ttataggcct gtccaggatc tgagagaagt caacaagcgg     2340

gtggaagaca tccaccccac cgtgcccaac ccttacaacc tcttgagcgg gctcccaccg     2400

tcccaccagt ggtacactgt gcttgattta aaggatgcct ttttctgcct gagactccac     2460

cccaccagtc agcctctctt cgcctttgag tggagagatc cagagatggg aatctcagga     2520

caattgacct ggaccagact cccacagggt ttcaaaaaca gtcccaccct gtttgatgag     2580

gcactgcaca gagacctagc agacttccgg atccagcacc cagacttgat cctgctacag     2640

tacgtggatg acttactgct ggccgccact tctgagctag actgccaaca aggtactcgg     2700

gccctgttac aaaccctagg gaacctcggg tatcgggcct cggccaagaa agcccaaatt     2760

tgccagaaac aggtcaagta tctggggtat cttctaaaag agggtcagag atggctgact     2820

gaggccagaa aagagactgt gatggggcag cctactccga agacccctcg acaactaagg     2880

gagttcctag ggacggcagg cttctgtcgc ctctggatcc ctgggtttgc agaaatggca     2940

gcccccttgt accctctcac caaaacgggg actctgttta attggggccc agaccaacaa     3000

aaggcctatc aagaaatcaa gcaagctctt ctaactgccc cagccctggg gttgccagat     3060

ttgactaagc cctttgaact ctttgtcgac gagaagcagg gctacgccaa aggcgtccta     3120

acgcaaaagc tgggaccttg gcgtcggccg gtggcctacc tgtctaaaaa gctagaccca     3180

gtggcagctg gctggccccc ctgcctacgg atggtggcag ccattgcagt tctgacaaaa     3240

gatgctggca agctcactat gggacagccg ttggtcattc tggcccccca tgccgtagag     3300

gcactagtta agcaaccccc tgatcgctgg ctctccaatg cccggatgac ccattaccaa     3360

gccctgctcc tggacacgga ccgggtccag ttcgggccag tagtggccct aaatccagct     3420

acgctgctcc ctctgcctga ggaggggctg caacatgact gccttgacat cttggctgaa     3480

gcccacggaa ctagatcaga tcttacggac cagcccctcc cagacgccga ccacacctgg     3540

tacacggatg ggagcagctt cctgcaagaa gggcagcgta aggccggagc agcggtgacc     3600

actgagactg aggtaatctg ggccagggca ttgccagccg ggacatcggc ccaaagagct     3660

gaactgatag cgctcaccca agccctaaag atggcagaag gtaagaagct aaatgtttat     3720

actgatagcc gttacgcttt tgccaccgcc catattcatg gagaaatata cagaaggcgc     3780

gggttgctca catcagaagg aaaagagatc aagaacaagg acgagatctt agccctacta     3840

aaggctctct tcttgcccaa aagacttagc ataattcatt gcccgggaca tcaaaaagga     3900

aacagcgcag aggccagggg caaccggatg gccgaccaag cggcccgaga agtagccact     3960

agagaaactc caggaacttc cacacttctg atagaaaact caacccccta tacccatgaa     4020

cactttcact atacagtaac tgacacaaag gatttgacca aactaggagc cacttatgac     4080

agtgcgaaga aatattgggt ctatcaagga aagcctgtta tgcctgatca attcaccttt     4140

gagttactag actttcttca ccaattgacc cacctcagct tctcaaaaac aaaggctctc     4200

ctagagagaa gccccagtcc ctactacatg ctgaaccggg atcgaacact caaaaatatc     4260

actgagacct gcaaagcttg tgcacaagtc aatgccagca agtctgccgt taagcaagga     4320

actagggtcc gcgggcatcg gcctggcaca cactgggaga tcgatttcac cgaggtaaaa     4380

cctggattgt atggctataa gtatctttta gtttttgtag atactttttc tggctggata     4440

gaagctttcc caactaagaa agaaaccgcc aaggtcgtga ccaagaaact gctagaagag     4500

atcttcccta ggttcggcat gccgcaggta ttgggaactg acaatgggcc tgccttcgtc     4560

tccaaggtga gtcagacagt ggccgatctg ttggggattg attggaaatt acattgtgca     4620

tacagacccc aaagctcagg tcaggtagaa agaatgaata ggaccatcaa ggagacttta     4680

actaaattaa cgcttgcaac tggctctaga gactgggtgc tcctactccc cttagccctg     4740

taccgagccc gcaacacgcc gggcccccat ggcctcaccc catatgagat cttatatggg     4800

gcacccccgc cccttgtaaa cttccctgac cctgacatga ccagagttac taacagcccc     4860

tctctccaag ctcacttaca ggctctctac ttagtccagc acgaagtttg gagaccactg     4920

gcggcagctt accaagaaca actggaccgg ccggtggtgc ctcaccctta ccgggtcggc     4980

gacacagtgt gggtccgccg acatcaaacc aagaacctag aacctcgctg gaaaggacct     5040

tacacagtcc tgctgaccac ccccaccgcc ctcaaagtag acggtatcgc agcttggata     5100

cacgcagccc acgtaaaggc ggccgacacc gagagtggac catcctctgg acggacatgg     5160

cgcgttcaac gctctcaaaa ccccctcaag ataagattaa cccgtggaag cccttaa        5217


<210>  5
<211>  1698
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  RD114 ENV protein codon-optimised variant

<400>  5
atgaagctgc cgacgggaat ggtgatcctg tgcagcctga ttatcgtgcg cgcggggttc       60

gacgacccga gaaaggctat cgctatcgtg caaaagcagc acgggaaacc atgcgaatgc      120

agcggtggcc aggtgtcaga ggccccaccg aactcaatcc agcaggtcac ctgccccggt      180

aaaaccgcat acctgatgac caatcaaaag tggaagtgcc gggtgacccc gaagaatctg      240

actccaagcg ggggagaact gcagaactgc ccctgcaata ctttccagga ttcaatgcac      300

tcctcctgtt acaccgaata ccgccagtgc agggcaaata acaaaacgta ctacactgcg      360

accctgctga agatccgctc cggctcccta aatgaagtgc agatcctgca gaatccaaac      420

caactgttgc agagcccgtg cagaggcagc atcaatcagc cggtctgctg gagcgccacc      480

gcacctatcc acatctcaga cggaggggga ccgctcgata ccaagcgcgt gtggaccgtg      540

caaaagcggc tagagcagat ccacaaagct atgcacccgg aactgcaata ccacccgctg      600

gcgcttccaa aggtccgcga cgatctgtcg ctggacgcgc ggaccttcga catcttgaat      660

actaccttcc gcctgctgca gatgtcgaat ttcagcctgg cacaggattg ttggctgtgc      720

ctgaagctgg gtactccgac cccgctggcc atccccaccc cgtcactgac ttactcactc      780

gcagactcgt tggcaaacgc ctcctgccag attatcccac ctctgctggt gcagccgatg      840

cagttctcga actccagctg cctgtcatca ccattcatca acgacactga acagattgat      900

ctgggagcag tgactttcac caactgcact tcagtggcca acgtctcctc gccactgtgc      960

gctctgaacg ggtccgtgtt cctgtgtgga aacaatatgg cgtacactta cctgccgcaa     1020

aactggactg gcctgtgcgt gcaagcgtca ctgctgcctg acatcgacat tatcccagga     1080

gacgagcccg tcccgatccc ggcaatcgac cactacattc accgcccgaa acgggcagtc     1140

cagttcatcc cgctcctggc tggactgggg atcaccgctg ctttcactac cggagccact     1200

ggcttgggtg tctccgtgac ccagtacacg aagctgtccc accaactgat ttcggacgtc     1260

caagtcctat cgggaaccat ccaggacctc caggatcagg tcgattccct cgcagaggtg     1320

gtgctccaga accgcagagg actggatctg ctgaccgctg aacagggagg catctgcctt     1380

gcactccagg agaagtgctg cttctacgcc aataagtcgg ggatcgtgcg gaacaaaatc     1440

agaactctgc aggaagaact gcagaagcgc cgggaaagcc tcgccagcaa tccgctgtgg     1500

accggactcc aaggatttct cccgtatctt ctcccgctgc tggggcctct gctcactctg     1560

ctgctgatcc tgaccatcgg accgtgcgtc tttagcagac tgatggcatt tatcaacgac     1620

agactgaacg tggtgcatgc aatggtcctg gcacagcagt accaggccct gaaggccgag     1680

gaggaagcac aggactag                                                   1698


<210>  6
<211>  1698
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  RD114 ENV protein codon-optimised variant

<400>  6
atgaagctgc ccaccggcat ggtgatcctg tgcagcctga tcatcgtgcg cgccggcttc       60

gacgaccccc gcaaggccat cgccctggtg cagaagcagc acggcaagcc ctgcgagtgc      120

agcggcggcc aggtgagcga ggcccccccc aacagcatcc agcaggtgac ctgccccggc      180

aagaccgcct acctgatgac caaccagaag tggaagtgcc gcgtgacccc caagaacctg      240

acccccagcg gcggcgagct gcagaactgc ccctgcaaca ccttccagga cagcatgcac      300

agcagctgct acaccgagta ccgccagtgc cgcgccaaca acaagaccta ctacaccgcc      360

accctgctga agatccgcag cggcagcctg aacgaggtgc agatcctgca gaaccccaac      420

cagctgctgc agagcccctg ccgcggcagc atcaaccagc ccgtgtgctg gagcgccacc      480

gcccccatcc acatcagcga cggcggcggc cccctggaca ccaagcgcgt gtggaccgtg      540

cagaagcgcc tggagcagat ccacaaggcc atgcaccccg agctgcagta ccaccccctg      600

gccctgccca aggtgcgcga cgacctgagc ctggacgccc gcaccttcga catcctgaac      660

accaccttcc gcctgctgca gatgagcaac ttcagcctgg cccaggactg ctggctgtgc      720

ctgaagctgg gcacccccac ccccctggcc atccccaccc ccagcctgac ctacagcctg      780

gccgacagcc tggccaacgc cagctgccag atcatccccc ccctgctggt gcagcccatg      840

cagttcagca acagcagctg cctgagcagc cccttcatca acgacaccga gcagatcgac      900

ctgggcgccg tgaccttcac caactgcacc agcgtggcca acgtgagcag ccccctgtgc      960

gccctgaacg gcagcgtgtt cctgtgcggc aacaacatgg cctacaccta cctgccccag     1020

aactggaccg gcctgtgcgt gcaggccagc ctgctgcccg acatcgacat catccccggc     1080

gacgagcccg tgcccatccc cgccatcgac cactacatcc accgccccaa gcgcgccgtg     1140

cagttcatcc ccctgctggc cggcctgggc atcaccgccg ccttcaccac cggcgccacc     1200

ggcctgggcg tgagcgtgac ccagtacacc aagctgagcc accagctgat cagcgacgtg     1260

caggtgctga gcggcaccat ccaggacctg caggaccagg tggacagcct ggccgaggtg     1320

gtgctgcaga accgccgcgg cctggacctg ctgaccgccg agcagggcgg catctgcctg     1380

gccctgcagg agaagtgctg cttctacgcc aacaagagcg gcatcgtgcg caacaagatc     1440

cgcaccctgc aggaggagct gcagaagcgc cgcgagagcc tggccagcaa ccccctgtgg     1500

accggcctgc agggcttcct gccctacctg ctgcccctgc tgggccccct gctgaccctg     1560

ctgctgatcc tgaccatcgg cccctgcgtg ttcagccgcc tgatggcctt catcaacgac     1620

cgcctgaacg tggtgcacgc catggtgctg gcccagcagt accaggccct gaaggccgag     1680

gaggaggccc aggactaa                                                   1698


<210>  7
<211>  1698
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  RD114 ENV protein codon-optimised variant

<400>  7
atgaaacttc ctacgggcat ggtcattctg tgtagtttga taatagtccg ggccgggttt       60

gatgatccta ggaaggccat cgcattggtt cagaaacagc acgggaagcc ctgtgagtgc      120

agtggtgggc aagttagtga agccccgcct aacagcattc agcaagtcac ttgtccgggt      180

aaaactgcat acctgatgac taaccagaaa tggaaatgta gagttactcc taaaaatttg      240

acaccttcag gcggagagct ccaaaactgc ccttgtaata cttttcagga ctctatgcat      300

agctcctgtt acacagagta caggcaatgc agagcgaata acaagactta ctatactgcg      360

acccttctga agatccggtc aggctcactc aacgaagtgc aaattctgca gaacccaaac      420

caactgctcc aaagtccatg tcggggcagt atcaatcaac cagtatgctg gtcagccacg      480

gcacctattc acatatctga tggcggcgga cccttggaca caaagcgagt ctggaccgtt      540

caaaagcgac ttgagcaaat acacaaagcc atgcatcctg aactccagta tcaccccttg      600

gcattgccaa aagtacggga cgatctcagt cttgatgcaa ggacctttga catacttaac      660

actacattca gactgctcca gatgagtaat ttcagcctcg cacaggactg ttggctttgt      720

ctcaagctgg gcacccccac cccgctcgcg atcccgacac cgagtctgac atactcactc      780

gccgactcat tggcaaatgc aagttgccag ataatcccgc ccttgctcgt ccagccgatg      840

cagttcagta actcatcctg tctctcaagt ccgttcatta acgacacaga acaaatcgac      900

ttgggcgcag tcaccttcac caactgcaca agtgtggcaa atgtcagtag cccactttgc      960

gccctgaacg ggagcgtatt tctctgtgga aataatatgg cgtacacgta tttgccgcaa     1020

aactggaccg gcctttgtgt tcaagcctca ctcctgccgg atatcgacat aatccctggc     1080

gacgaacctg taccaatccc cgcaatcgac cactacattc acagaccaaa gagagcagtc     1140

cagtttatcc cccttcttgc gggccttggt atcactgctg cattcactac gggcgcaacg     1200

gggcttgggg tatctgtaac acaatataca aagctttctc atcagctcat ttctgacgta     1260

caggtgcttt ctggaactat ccaagatttg caagatcaag tagattccct cgcagaagtg     1320

gtcctccaga accggagggg tctcgatctt ctgactgccg aacaaggggg tatctgcctt     1380

gcactccaag agaaatgctg cttttacgca aacaaaagtg gtattgtacg caacaagata     1440

cgcacgctgc aagaggagct tcagaagcga cgggagagct tggctagtaa ccccctttgg     1500

accggacttc aaggtttctt gccctacctt cttcctcttt tgggcccact cctgactttg     1560

ttgctgattc tcacaatagg tccctgtgtt ttctctcgcc ttatggcttt catcaacgac     1620

aggttgaatg tcgtgcatgc tatggttttg gcacagcaat accaagccct taaagcagaa     1680

gaggaagcac aggactga                                                   1698


<210>  8
<211>  1698
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Wildtype RD114 ENV protein

<400>  8
atgaaactcc caacaggaat ggtcatttta tgtagcctaa taatagttcg ggcagggttt       60

gacgaccccc gcaaggctat cgcattagta caaaaacaac atggtaaacc atgcgaatgc      120

agcggagggc aggtatccga ggccccaccg aactccatcc aacaggtaac ttgcccaggc      180

aagacggcct acttaatgac caaccaaaaa tggaaatgca gagtcactcc aaaaaatctc      240

acccctagcg ggggagaact ccagaactgc ccctgtaaca ctttccagga ctcgatgcac      300

agttcttgtt atactgaata ccggcaatgc agggcgaata ataagacata ctacacggcc      360

accttgctta aaatacggtc tgggagcctc aacgaggtac agatattaca aaaccccaat      420

cagctcctac agtccccttg taggggctct ataaatcagc ccgtttgctg gagtgccaca      480

gcccccatcc atatctccga tggtggagga cccctcgata ctaagagagt gtggacagtc      540

caaaaaaggc tagaacaaat tcataaggct atgcatcctg aacttcaata ccacccctta      600

gccctgccca aagtcagaga tgaccttagc cttgatgcac ggacttttga tatcctgaat      660

accactttta ggttactcca gatgtccaat tttagccttg cccaagattg ttggctctgt      720

ttaaaactag gtacccctac ccctcttgcg atacccactc cctctttaac ctactcccta      780

gcagactccc tagcgaatgc ctcctgtcag attatacctc ccctcttggt tcaaccgatg      840

cagttctcca actcgtcctg tttatcttcc cctttcatta acgatacgga acaaatagac      900

ttaggtgcag tcacctttac taactgcacc tctgtagcca atgtcagtag tcctttatgt      960

gccctaaacg ggtcagtctt cctctgtgga aataacatgg catacaccta tttaccccaa     1020

aactggacag gactttgcgt ccaagcctcc ctcctccccg acattgacat catcccgggg     1080

gatgagccag tccccattcc tgccattgat cattatatac atagacctaa acgagctgta     1140

cagttcatcc ctttactagc tggactggga atcaccgcag cattcaccac cggagctaca     1200

ggcctaggtg tctccgtcac ccagtataca aaattatccc atcagttaat atctgatgtc     1260

caagtcttat ccggtaccat acaagattta caagaccagg tagactcgtt agctgaagta     1320

gttctccaaa ataggagggg actggaccta ctaacggcag aacaaggagg aatttgttta     1380

gccttacaag aaaaatgctg tttttatgct aacaagtcag gaattgtgag aaacaaaata     1440

agaaccctac aagaagaatt acaaaaacgc agggaaagcc tggcatccaa ccctctctgg     1500

accgggctgc agggctttct tccgtacctc ctacctctcc tgggacccct actcaccctc     1560

ctactcatac taaccattgg gccatgcgtt ttcagtcgcc tcatggcctt cattaatgat     1620

agacttaatg ttgtacatgc catggtgctg gcccagcaat accaagcact caaagctgag     1680

gaagaagctc aggattga                                                   1698


<210>  9
<211>  2004
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GALV ENV protein codon-optimised variant

<400>  9
atggtgttgc ttcctgggtc tatgctgctg acatctaatc tccaccatct cagacaccag       60

atgtcacccg gcagttggaa gcggctgatc atactgttga gctgcgtatt cggcggaggg      120

ggcacctctc tccagaacaa aaatcctcat caaccgatga cgcttacgtg gcaggtattg      180

tcccagacgg gtgacgtggt atgggacact aaggctgttc aaccgccttg gacgtggtgg      240

ccgacgctga agccagatgt ctgtgccctg gcggcgtccc ttgagagctg ggatatcccg      300

gggaccgacg tatcctccag caagagagtt cgcccccctg actcagacta cacagccgcc      360

tataagcaaa tcacttgggg cgcgattggg tgttcatatc cccgagcacg caccagaatg      420

gcaagctcta cattctatgt ttgtccccgc gatggccgga cgctgtccga agcgaggcga      480

tgcggaggtc tcgaaagcct ctactgcaag gaatgggact gtgagactac gggcacgggt      540

tattggcttt ctaaatcaag caaagacttg atcactctta agtgggacca gaactcagag      600

tggacacaaa agtttcaaca atgccaccag actggatggt gcaacccctt gaagatagac      660

tttactgaca aaggtaagct gagcaaggac tggataacag ggaaaacttg ggggttgcgc      720

ttttatgtct caggccatcc gggggtacaa tttacgattc gcctcaaaat cacgaacatg      780

ccggcggtcg ctgtaggtcc ggacttggtt ttggtagaac aaggccctcc tcggactagc      840

ctcgcactgc ctcccccact cccgcctcga gaggcaccac cgccgagcct gccggattcc      900

aattcaacgg ctctggccac ctccgcacaa acaccaacag tgcggaagac tatcgtgacc      960

ctcaacactc cgcccccgac cacgggcgac agattgtttg acctggttca aggggccttc     1020

ttgacgctca atgcaacgaa ccctggagca acagagtctt gttggctttg tctggccatg     1080

ggtccccctt attatgaagc catcgcgtca tctggtgaag tggcttactc aaccgacctc     1140

gatcgctgta ggtggggcac gcaaggaaag cttactttga ccgaggtctc aggtcatggg     1200

ttgtgcattg ggaaggtccc ctttacacac caacatcttt gtaaccagac tctgagtata     1260

aattcttctg gagatcatca gtatttgctg ccgagtaacc attcatggtg ggcgtgctcc     1320

acgggactca ccccttgcct ttcaacttcc gtttttaatc aaacgagaga tttctgtatc     1380

caagtgcaac tcattccgag gatctactac tatccggaag aagtactcct gcaggcgtat     1440

gacaattccc accctaggac caaacgcgaa gcagtgagcc tgacccttgc agtattgttg     1500

ggtttgggga ttactgcggg tatcggcact ggttccaccg cgctgattaa gggaccgatc     1560

gatttgcaac aaggattgac ttcactccag atagccatag acgccgacct tcgcgcgttg     1620

caggattctg tgtctaagct ggaggatagt ttgacaagcc tctcagaggt ggtgctgcaa     1680

aacagacgag gccttgatct cttgtttctt aaggagggag gcctttgcgc tgctctgaag     1740

gaagagtgtt gtttctacat cgatcatagc ggagcggtca gagattctat gaagaagctt     1800

aaggagaagc ttgacaagcg acagctcgaa cgccaaaaga gccagaattg gtacgaagga     1860

tggtttaata attctccatg gttcactaca ctgctttcca ccatcgctgg tccgctgctg     1920

ctcctgctgc tcctgttgat actcggtccg tgcataatta ataagctcgt tcaattcata     1980

aacgaccgga tctctgcgtg ctaa                                            2004


<210>  10
<211>  2004
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GALV ENV protein codon-optimised variant

<400>  10
atggtgcttc tccctggtag catgcttttg acctcaaacc tccatcatct gcgacaccag       60

atgtcacctg gctcttggaa acgccttatt atattgctga gctgtgtttt tggaggcgga      120

ggtacatcat tgcagaacaa aaaccctcat cagccaatga cgttgacctg gcaagtattg      180

tcccagaccg gagatgtcgt ttgggacacg aaagcggtac aacctccctg gacttggtgg      240

ccgaccctca agcccgacgt ttgcgctctt gcggcgtctt tggagtcttg ggacataccg      300

gggacggatg tctcatcttc aaagagggtt cgaccgccgg attcagacta caccgctgca      360

tataagcaga ttacgtgggg agccattggc tgtagttatc cgcgggcgag gacgcggatg      420

gcttccagta ctttttatgt gtgtccgaga gacggccgca ccctgtctga ggctcggcgc      480

tgcggggggc tcgaaagcct gtactgcaaa gaatgggatt gtgagactac agggactggt      540

tattggctct caaaatctag caaagatctg attacgctca aatgggatca aaattcagaa      600

tggacccaaa agttccagca atgtcatcag accgggtggt gtaatccgct gaagatagac      660

tttacagaca aaggcaaact gtcaaaagac tggattacgg gtaagacttg gggcctccgc      720

ttttacgtaa gcggtcatcc tggggtacag tttactataa ggctgaaaat aacgaacatg      780

ccggcggtcg ctgtcgggcc ggatttggtg ctcgtggaac aagggccacc taggacctct      840

ctcgctcttc ccccgccatt gccaccacgg gaagcaccgc caccaagtct tccagattcc      900

aactctaccg cactggctac gagtgcgcag acaccaacgg ttagaaaaac cattgtcacg      960

cttaacaccc cccctccgac aaccggagat cgccttttcg atctcgtaca gggcgcgttt     1020

cttacgctta acgccacaaa tcctggggcc actgagagct gttggctttg ccttgctatg     1080

ggcccaccat actatgaggc catcgcctcc tccggcgaag tagcctactc cacggacctt     1140

gaccgatgca ggtggggaac gcaaggcaaa ttgactttga ctgaggtgag cgggcatggt     1200

ctctgcatcg gaaaagttcc gttcactcat cagcaccttt gtaaccagac cctcagcatt     1260

aattcttccg gggatcatca gtacctcctg ccgtcaaacc actcttggtg ggcctgctcc     1320

acaggtctta ctccctgctt gagcacatcc gtatttaatc agacccgaga cttctgtatc     1380

caggtacaat tgataccgag aatttattac taccccgagg aagtgttgct ccaagcatac     1440

gataactcac accctagaac gaagagagaa gcagtctccc tgacgttggc cgtccttctg     1500

ggactgggaa tcaccgcggg tataggcact ggatctacgg cactgatcaa ggggcctata     1560

gatttgcagc aggggcttac ttcacttcaa attgccatag acgcggatct tcgggcgctc     1620

caggactccg tttccaagtt ggaagactct ctgactagcc tgtccgaagt tgtgttgcag     1680

aacagacgag gacttgactt gttgtttctc aaggaagggg gtctctgtgc tgcgcttaag     1740

gaggaatgtt gcttctatat agatcattcc ggcgcggtac gggactccat gaaaaaactt     1800

aaagaaaagt tggacaagag acagttggag aggcaaaagt cccagaactg gtatgagggc     1860

tggtttaata actccccatg gtttacaacc cttttgtcta ccattgctgg gccgctcctt     1920

cttcttctgt tgctgctcat attggggcct tgtattatta acaagcttgt gcaattcatt     1980

aatgaccgaa tttctgcatg ctaa                                            2004


<210>  11
<211>  2004
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GALV ENV protein codon-optimised variant

<400>  11
atggtgctgc tgcccggcag catgctgctg accagcaacc tgcaccacct gcgccaccag       60

atgagccccg gcagctggaa gcgcctgatc atcctgctga gctgcgtgtt cggcggcggc      120

ggcaccagcc tgcagaacaa gaacccccac cagcccatga ccctgacctg gcaggtgctg      180

agccagaccg gcgacgtggt gtgggacacc aaggccgtgc agcccccctg gacctggtgg      240

cccaccctga agcccgacgt gtgcgccctg gccgccagcc tggagagctg ggacatcccc      300

ggcaccgacg tgagcagcag caagcgcgtg cgcccccccg acagcgacta caccgccgcc      360

tacaagcaga tcacctgggg cgccatcggc tgcagctacc cccgcgcccg cacccgcatg      420

gccagcagca ccttctacgt gtgcccccgc gacggccgca ccctgagcga ggcccgccgc      480

tgcggcggcc tggagagcct gtactgcaag gagtgggact gcgagaccac cggcaccggc      540

tactggctga gcaagagcag caaggacctg atcaccctga agtgggacca gaacagcgag      600

tggacccaga agttccagca gtgccaccag accggctggt gcaaccccct gaagatcgac      660

ttcaccgaca agggcaagct gagcaaggac tggatcaccg gcaagacctg gggcctgcgc      720

ttctacgtga gcggccaccc cggcgtgcag ttcaccatcc gcctgaagat caccaacatg      780

cccgccgtgg ccgtgggccc cgacctggtg ctggtggagc agggcccccc ccgcaccagc      840

ctggccctgc ccccccccct gcccccccgc gaggcccccc cccccagcct gcccgacagc      900

aacagcaccg ccctggccac cagcgcccag acccccaccg tgcgcaagac catcgtgacc      960

ctgaacaccc ccccccccac caccggcgac cgcctgttcg acctggtgca gggcgccttc     1020

ctgaccctga acgccaccaa ccccggcgcc accgagagct gctggctgtg cctggccatg     1080

ggccccccct actacgaggc catcgccagc agcggcgagg tggcctacag caccgacctg     1140

gaccgctgcc gctggggcac ccagggcaag ctgaccctga ccgaggtgag cggccacggc     1200

ctgtgcatcg gcaaggtgcc cttcacccac cagcacctgt gcaaccagac cctgagcatc     1260

aacagcagcg gcgaccacca gtacctgctg cccagcaacc acagctggtg ggcctgcagc     1320

accggcctga ccccctgcct gagcaccagc gtgttcaacc agacccgcga cttctgcatc     1380

caggtgcagc tgatcccccg catctactac taccccgagg aggtgctgct gcaggcctac     1440

gacaacagcc acccccgcac caagcgcgag gccgtgagcc tgaccctggc cgtgctgctg     1500

ggcctgggca tcaccgccgg catcggcacc ggcagcaccg ccctgatcaa gggccccatc     1560

gacctgcagc agggcctgac cagcctgcag atcgccatcg acgccgacct gcgcgccctg     1620

caggacagcg tgagcaagct ggaggacagc ctgaccagcc tgagcgaggt ggtgctgcag     1680

aaccgccgcg gcctggacct gctgttcctg aaggagggcg gcctgtgcgc cgccctgaag     1740

gaggagtgct gcttctacat cgaccacagc ggcgccgtgc gcgacagcat gaagaagctg     1800

aaggagaagc tggacaagcg ccagctggag cgccagaaga gccagaactg gtacgagggc     1860

tggttcaaca acagcccctg gttcaccacc ctgctgagca ccatcgccgg ccccctgctg     1920

ctgctgctgc tgctgctgat cctgggcccc tgcatcatca acaagctggt gcagttcatc     1980

aacgaccgca tcagcgcctg ctaa                                            2004


<210>  12
<211>  2004
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Wild type GALV ENV protein

<400>  12
atggtattgc tgcctgggtc catgcttctc acctcaaacc tgcaccacct tcggcaccag       60

atgagtcctg ggagctggaa aagactgatc atcctcctaa gctgcgtatt cggcggcggc      120

ggtaccagtc tgcaaaataa gaacccccac cagcccatga ccctcacttg gcaggtactg      180

tcccaaactg gagacgttgt ctgggataca aaggcagtcc agcccccttg gacttggtgg      240

cccacactta aacctgatgt atgtgccttg gcggctagtc ttgagtcctg ggatatcccg      300

ggaaccgatg tctcgtcctc taaacgagtc agacctccgg actcagacta tactgccgct      360

tataagcaaa tcacctgggg agccataggg tgcagctacc ctcgggctag gactagaatg      420

gcaagctcta ccttctacgt atgtccccgg gatggccgga ccctttcaga agctagaagg      480

tgcggggggc tagaatccct atactgtaaa gaatgggatt gtgagaccac ggggaccggt      540

tattggctat ctaaatcctc aaaagacctc ataactctta agtgggacca aaatagcgaa      600

tggactcaaa aatttcaaca gtgtcaccag accggctggt gtaaccccct taaaatagat      660

ttcacagaca aaggaaaatt atccaaggac tggataacgg gaaaaacctg gggattaaga      720

ttctatgtgt ctggacatcc aggcgtacag ttcaccattc gcttaaaaat caccaacatg      780

ccagctgtgg cagtaggtcc tgacctcgtc cttgtggaac aaggacctcc tagaacgtcc      840

ctcgctctcc cacctcctct tcccccaagg gaagcgccac cgccatctct ccccgactct      900

aactccacag ccctggcgac tagtgcacaa actcccacgg tgagaaaaac aattgttacc      960

ctaaacactc cgcctcccac cacaggcgac agactttttg atcttgtgca gggggccttc     1020

ctaaccttaa atgctaccaa cccaggggcc actgagtctt gctggctttg tttggccatg     1080

ggcccccctt attatgaagc aatagcctca tcaggagagg tcgcctactc caccgacctt     1140

gaccggtgcc gctgggggac ccaaggaaag ctcaccctca ctgaggtctc aggacacggg     1200

ttgtgcatag gaaaggtgcc ctttacccat cagcatctct gcaatcagac cctatccatc     1260

aattcctccg gagaccatca gtatctgctc ccctccaacc atagctggtg ggcttgcagc     1320

actggcctca ccccttgcct ctccacctca gtttttaatc agactagaga tttctgtatc     1380

caggtccagc tgattcctcg catctattac tatcctgaag aagttttgtt acaggcctat     1440

gacaattctc accccaggac taaaagagag gctgtctcac ttaccctagc tgttttactg     1500

gggttgggaa tcacggcggg aataggtact ggttcaactg ccttaattaa aggacctata     1560

gacctccagc aaggcctgac aagcctccag atcgccatag atgctgacct ccgggccctc     1620

caagactcag tcagcaagtt agaggactca ctgacttccc tgtccgaggt agtgctccaa     1680

aataggagag gccttgactt gctgtttcta aaagaaggtg gcctctgtgc ggccctaaag     1740

gaagagtgct gtttttacat agaccactca ggtgcagtac gggactccat gaaaaaactc     1800

aaagaaaaac tggataaaag acagttagag cgccagaaaa gccaaaactg gtatgaagga     1860

tggttcaata actccccttg gttcactacc ctgctatcaa ccatcgctgg gcccctatta     1920

ctcctccttc tgttgctcat cctcgggcca tgcatcatca ataagttagt tcaattcatc     1980

aatgatagga taagtgcatg ttaa                                            2004


<210>  13
<211>  538
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Gag Polyprotein Amino Acid Sequence

<400>  13

Met Gly Gln Thr Val Thr Thr Pro Leu Ser Leu Thr Leu Gly His Trp 
1               5                   10                  15      


Lys Asp Val Glu Arg Ile Ala His Asn Gln Ser Val Asp Val Lys Lys 
            20                  25                  30          


Arg Arg Trp Val Thr Phe Cys Ser Ala Glu Trp Pro Thr Phe Asn Val 
        35                  40                  45              


Gly Trp Pro Arg Asp Gly Thr Phe Asn Arg Asp Leu Ile Thr Gln Val 
    50                  55                  60                  


Lys Ile Lys Val Phe Ser Pro Gly Pro His Gly His Pro Asp Gln Val 
65                  70                  75                  80  


Pro Tyr Ile Val Thr Trp Glu Ala Leu Ala Phe Asp Pro Pro Pro Trp 
                85                  90                  95      


Val Lys Pro Phe Val His Pro Lys Pro Pro Pro Pro Leu Pro Pro Ser 
            100                 105                 110         


Ala Pro Ser Leu Pro Leu Glu Pro Pro Arg Ser Thr Pro Pro Arg Ser 
        115                 120                 125             


Ser Leu Tyr Pro Ala Leu Thr Pro Ser Leu Gly Ala Lys Pro Lys Pro 
    130                 135                 140                 


Gln Val Leu Ser Asp Ser Gly Gly Pro Leu Ile Asp Leu Leu Thr Glu 
145                 150                 155                 160 


Asp Pro Pro Pro Tyr Arg Asp Pro Arg Pro Pro Pro Ser Asp Arg Asp 
                165                 170                 175     


Gly Asn Gly Gly Glu Ala Thr Pro Ala Gly Glu Ala Pro Asp Pro Ser 
            180                 185                 190         


Pro Met Ala Ser Arg Leu Arg Gly Arg Arg Glu Pro Pro Val Ala Asp 
        195                 200                 205             


Ser Thr Thr Ser Gln Ala Phe Pro Leu Arg Ala Gly Gly Asn Gly Gln 
    210                 215                 220                 


Leu Gln Tyr Trp Pro Phe Ser Ser Ser Asp Leu Tyr Asn Trp Lys Asn 
225                 230                 235                 240 


Asn Asn Pro Ser Phe Ser Glu Asp Pro Gly Lys Leu Thr Ala Leu Ile 
                245                 250                 255     


Glu Ser Val Leu Ile Thr His Gln Pro Thr Trp Asp Asp Cys Gln Gln 
            260                 265                 270         


Leu Leu Gly Thr Leu Leu Thr Gly Glu Glu Lys Gln Arg Val Leu Leu 
        275                 280                 285             


Glu Ala Arg Lys Ala Val Arg Gly Asp Asp Gly Arg Pro Thr Gln Leu 
    290                 295                 300                 


Pro Asn Glu Val Asp Ala Ala Phe Pro Leu Glu Arg Pro Asp Trp Asp 
305                 310                 315                 320 


Tyr Thr Thr Gln Ala Gly Arg Asn His Leu Val His Tyr Arg Gln Leu 
                325                 330                 335     


Leu Leu Ala Gly Leu Gln Asn Ala Gly Arg Ser Pro Thr Asn Leu Ala 
            340                 345                 350         


Lys Val Lys Gly Ile Thr Gln Gly Pro Asn Glu Ser Pro Ser Ala Phe 
        355                 360                 365             


Leu Glu Arg Leu Lys Glu Ala Tyr Arg Arg Tyr Thr Pro Tyr Asp Pro 
    370                 375                 380                 


Glu Asp Pro Gly Gln Glu Thr Asn Val Ser Met Ser Phe Ile Trp Gln 
385                 390                 395                 400 


Ser Ala Pro Asp Ile Gly Arg Lys Leu Glu Arg Leu Glu Asp Leu Lys 
                405                 410                 415     


Asn Lys Thr Leu Gly Asp Leu Val Arg Glu Ala Glu Lys Ile Phe Asn 
            420                 425                 430         


Lys Arg Glu Thr Pro Glu Glu Arg Glu Glu Arg Ile Arg Arg Glu Thr 
        435                 440                 445             


Glu Glu Lys Glu Glu Arg Arg Arg Thr Glu Asp Glu Gln Lys Glu Lys 
    450                 455                 460                 


Glu Arg Asp Arg Arg Arg His Arg Glu Met Ser Lys Leu Leu Ala Thr 
465                 470                 475                 480 


Val Val Ser Gly Gln Lys Gln Asp Arg Gln Gly Gly Glu Arg Arg Arg 
                485                 490                 495     


Ser Gln Leu Asp Arg Asp Gln Cys Ala Tyr Cys Lys Glu Lys Gly His 
            500                 505                 510         


Trp Ala Lys Asp Cys Pro Lys Lys Pro Arg Gly Pro Arg Gly Pro Arg 
        515                 520                 525             


Pro Gln Thr Ser Leu Leu Thr Leu Asp Asp 
    530                 535             


<210>  14
<211>  1090
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Pol Polyprotein Amino Acid Sequence

<400>  14

Met Gly Pro Met Gly Gln Pro Leu Gln Val Leu Thr Leu Asn Ile Glu 
1               5                   10                  15      


Asp Glu Tyr Arg Leu His Glu Thr Ser Lys Glu Pro Asp Val Ser Leu 
            20                  25                  30          


Gly Ser Thr Trp Leu Ser Asp Phe Pro Gln Ala Trp Ala Glu Thr Gly 
        35                  40                  45              


Gly Met Gly Leu Ala Val Arg Gln Ala Pro Leu Ile Ile Pro Leu Lys 
    50                  55                  60                  


Ala Thr Ser Thr Pro Val Ser Ile Lys Gln Tyr Pro Met Ser Gln Glu 
65                  70                  75                  80  


Ala Arg Leu Gly Ile Lys Pro His Ile Gln Arg Leu Leu Asp Gln Gly 
                85                  90                  95      


Ile Leu Val Pro Cys Gln Ser Pro Trp Asn Thr Pro Leu Leu Pro Val 
            100                 105                 110         


Lys Lys Pro Gly Thr Asn Asp Tyr Arg Pro Val Gln Asp Leu Arg Glu 
        115                 120                 125             


Val Asn Lys Arg Val Glu Asp Ile His Pro Thr Val Pro Asn Pro Tyr 
    130                 135                 140                 


Asn Leu Leu Ser Gly Leu Pro Pro Ser His Gln Trp Tyr Thr Val Leu 
145                 150                 155                 160 


Asp Leu Lys Asp Ala Phe Phe Cys Leu Arg Leu His Pro Thr Ser Gln 
                165                 170                 175     


Pro Leu Phe Ala Phe Glu Trp Arg Asp Pro Glu Met Gly Ile Ser Gly 
            180                 185                 190         


Gln Leu Thr Trp Thr Arg Leu Pro Gln Gly Phe Lys Asn Ser Pro Thr 
        195                 200                 205             


Leu Phe Asp Glu Ala Leu His Arg Asp Leu Ala Asp Phe Arg Ile Gln 
    210                 215                 220                 


His Pro Asp Leu Ile Leu Leu Gln Tyr Val Asp Asp Leu Leu Leu Ala 
225                 230                 235                 240 


Ala Thr Ser Glu Leu Asp Cys Gln Gln Gly Thr Arg Ala Leu Leu Gln 
                245                 250                 255     


Thr Leu Gly Asn Leu Gly Tyr Arg Ala Ser Ala Lys Lys Ala Gln Ile 
            260                 265                 270         


Cys Gln Lys Gln Val Lys Tyr Leu Gly Tyr Leu Leu Lys Glu Gly Gln 
        275                 280                 285             


Arg Trp Leu Thr Glu Ala Arg Lys Glu Thr Val Met Gly Gln Pro Thr 
    290                 295                 300                 


Pro Lys Thr Pro Arg Gln Leu Arg Glu Phe Leu Gly Thr Ala Gly Phe 
305                 310                 315                 320 


Cys Arg Leu Trp Ile Pro Gly Phe Ala Glu Met Ala Ala Pro Leu Tyr 
                325                 330                 335     


Pro Leu Thr Lys Thr Gly Thr Leu Phe Asn Trp Gly Pro Asp Gln Gln 
            340                 345                 350         


Lys Ala Tyr Gln Glu Ile Lys Gln Ala Leu Leu Thr Ala Pro Ala Leu 
        355                 360                 365             


Gly Leu Pro Asp Leu Thr Lys Pro Phe Glu Leu Phe Val Asp Glu Lys 
    370                 375                 380                 


Gln Gly Tyr Ala Lys Gly Val Leu Thr Gln Lys Leu Gly Pro Trp Arg 
385                 390                 395                 400 


Arg Pro Val Ala Tyr Leu Ser Lys Lys Leu Asp Pro Val Ala Ala Gly 
                405                 410                 415     


Trp Pro Pro Cys Leu Arg Met Val Ala Ala Ile Ala Val Leu Thr Lys 
            420                 425                 430         


Asp Ala Gly Lys Leu Thr Met Gly Gln Pro Leu Val Ile Leu Ala Pro 
        435                 440                 445             


His Ala Val Glu Ala Leu Val Lys Gln Pro Pro Asp Arg Trp Leu Ser 
    450                 455                 460                 


Asn Ala Arg Met Thr His Tyr Gln Ala Leu Leu Leu Asp Thr Asp Arg 
465                 470                 475                 480 


Val Gln Phe Gly Pro Val Val Ala Leu Asn Pro Ala Thr Leu Leu Pro 
                485                 490                 495     


Leu Pro Glu Glu Gly Leu Gln His Asp Cys Leu Asp Ile Leu Ala Glu 
            500                 505                 510         


Ala His Gly Thr Arg Ser Asp Leu Thr Asp Gln Pro Leu Pro Asp Ala 
        515                 520                 525             


Asp His Thr Trp Tyr Thr Asp Gly Ser Ser Phe Leu Gln Glu Gly Gln 
    530                 535                 540                 


Arg Lys Ala Gly Ala Ala Val Thr Thr Glu Thr Glu Val Ile Trp Ala 
545                 550                 555                 560 


Arg Ala Leu Pro Ala Gly Thr Ser Ala Gln Arg Ala Glu Leu Ile Ala 
                565                 570                 575     


Leu Thr Gln Ala Leu Lys Met Ala Glu Gly Lys Lys Leu Asn Val Tyr 
            580                 585                 590         


Thr Asp Ser Arg Tyr Ala Phe Ala Thr Ala His Ile His Gly Glu Ile 
        595                 600                 605             


Tyr Arg Arg Arg Gly Leu Leu Thr Ser Glu Gly Lys Glu Ile Lys Asn 
    610                 615                 620                 


Lys Asp Glu Ile Leu Ala Leu Leu Lys Ala Leu Phe Leu Pro Lys Arg 
625                 630                 635                 640 


Leu Ser Ile Ile His Cys Pro Gly His Gln Lys Gly Asn Ser Ala Glu 
                645                 650                 655     


Ala Arg Gly Asn Arg Met Ala Asp Gln Ala Ala Arg Glu Val Ala Thr 
            660                 665                 670         


Arg Glu Thr Pro Gly Thr Ser Thr Leu Leu Ile Glu Asn Ser Thr Pro 
        675                 680                 685             


Tyr Thr His Glu His Phe His Tyr Thr Val Thr Asp Thr Lys Asp Leu 
    690                 695                 700                 


Thr Lys Leu Gly Ala Thr Tyr Asp Ser Ala Lys Lys Tyr Trp Val Tyr 
705                 710                 715                 720 


Gln Gly Lys Pro Val Met Pro Asp Gln Phe Thr Phe Glu Leu Leu Asp 
                725                 730                 735     


Phe Leu His Gln Leu Thr His Leu Ser Phe Ser Lys Thr Lys Ala Leu 
            740                 745                 750         


Leu Glu Arg Ser Pro Ser Pro Tyr Tyr Met Leu Asn Arg Asp Arg Thr 
        755                 760                 765             


Leu Lys Asn Ile Thr Glu Thr Cys Lys Ala Cys Ala Gln Val Asn Ala 
    770                 775                 780                 


Ser Lys Ser Ala Val Lys Gln Gly Thr Arg Val Arg Gly His Arg Pro 
785                 790                 795                 800 


Gly Thr His Trp Glu Ile Asp Phe Thr Glu Val Lys Pro Gly Leu Tyr 
                805                 810                 815     


Gly Tyr Lys Tyr Leu Leu Val Phe Val Asp Thr Phe Ser Gly Trp Ile 
            820                 825                 830         


Glu Ala Phe Pro Thr Lys Lys Glu Thr Ala Lys Val Val Thr Lys Lys 
        835                 840                 845             


Leu Leu Glu Glu Ile Phe Pro Arg Phe Gly Met Pro Gln Val Leu Gly 
    850                 855                 860                 


Thr Asp Asn Gly Pro Ala Phe Val Ser Lys Val Ser Gln Thr Val Ala 
865                 870                 875                 880 


Asp Leu Leu Gly Ile Asp Trp Lys Leu His Cys Ala Tyr Arg Pro Gln 
                885                 890                 895     


Ser Ser Gly Gln Val Glu Arg Met Asn Arg Thr Ile Lys Glu Thr Leu 
            900                 905                 910         


Thr Lys Leu Thr Leu Ala Thr Gly Ser Arg Asp Trp Val Leu Leu Leu 
        915                 920                 925             


Pro Leu Ala Leu Tyr Arg Ala Arg Asn Thr Pro Gly Pro His Gly Leu 
    930                 935                 940                 


Thr Pro Tyr Glu Ile Leu Tyr Gly Ala Pro Pro Pro Leu Val Asn Phe 
945                 950                 955                 960 


Pro Asp Pro Asp Met Thr Arg Val Thr Asn Ser Pro Ser Leu Gln Ala 
                965                 970                 975     


His Leu Gln Ala Leu Tyr Leu Val Gln His Glu Val Trp Arg Pro Leu 
            980                 985                 990         


Ala Ala Ala Tyr Gln Glu Gln Leu  Asp Arg Pro Val Val  Pro His Pro 
        995                 1000                 1005             


Tyr Arg  Val Gly Asp Thr Val  Trp Val Arg Arg His  Gln Thr Lys 
    1010                 1015                 1020             


Asn Leu  Glu Pro Arg Trp Lys  Gly Pro Tyr Thr Val  Leu Leu Thr 
    1025                 1030                 1035             


Thr Pro  Thr Ala Leu Lys Val  Asp Gly Ile Ala Ala  Trp Ile His 
    1040                 1045                 1050             


Ala Ala  His Val Lys Ala Ala  Asp Thr Glu Ser Gly  Pro Ser Ser 
    1055                 1060                 1065             


Gly Arg  Thr Trp Arg Val Gln  Arg Ser Gln Asn Pro  Leu Lys Ile 
    1070                 1075                 1080             


Arg Leu  Thr Arg Gly Ser Pro  
    1085                 1090 


<210>  15
<211>  565
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  RD114 Envelope Amino Acid Sequence

<400>  15

Met Lys Leu Pro Thr Gly Met Val Ile Leu Cys Ser Leu Ile Ile Val 
1               5                   10                  15      


Arg Ala Gly Phe Asp Asp Pro Arg Lys Ala Ile Ala Ile Val Gln Lys 
            20                  25                  30          


Gln His Gly Lys Pro Cys Glu Cys Ser Gly Gly Gln Val Ser Glu Ala 
        35                  40                  45              


Pro Pro Asn Ser Ile Gln Gln Val Thr Cys Pro Gly Lys Thr Ala Tyr 
    50                  55                  60                  


Leu Met Thr Asn Gln Lys Trp Lys Cys Arg Val Thr Pro Lys Asn Leu 
65                  70                  75                  80  


Thr Pro Ser Gly Gly Glu Leu Gln Asn Cys Pro Cys Asn Thr Phe Gln 
                85                  90                  95      


Asp Ser Met His Ser Ser Cys Tyr Thr Glu Tyr Arg Gln Cys Arg Ala 
            100                 105                 110         


Asn Asn Lys Thr Tyr Tyr Thr Ala Thr Leu Leu Lys Ile Arg Ser Gly 
        115                 120                 125             


Ser Leu Asn Glu Val Gln Ile Leu Gln Asn Pro Asn Gln Leu Leu Gln 
    130                 135                 140                 


Ser Pro Cys Arg Gly Ser Ile Asn Gln Pro Val Cys Trp Ser Ala Thr 
145                 150                 155                 160 


Ala Pro Ile His Ile Ser Asp Gly Gly Gly Pro Leu Asp Thr Lys Arg 
                165                 170                 175     


Val Trp Thr Val Gln Lys Arg Leu Glu Gln Ile His Lys Ala Met His 
            180                 185                 190         


Pro Glu Leu Gln Tyr His Pro Leu Ala Leu Pro Lys Val Arg Asp Asp 
        195                 200                 205             


Leu Ser Leu Asp Ala Arg Thr Phe Asp Ile Leu Asn Thr Thr Phe Arg 
    210                 215                 220                 


Leu Leu Gln Met Ser Asn Phe Ser Leu Ala Gln Asp Cys Trp Leu Cys 
225                 230                 235                 240 


Leu Lys Leu Gly Thr Pro Thr Pro Leu Ala Ile Pro Thr Pro Ser Leu 
                245                 250                 255     


Thr Tyr Ser Leu Ala Asp Ser Leu Ala Asn Ala Ser Cys Gln Ile Ile 
            260                 265                 270         


Pro Pro Leu Leu Val Gln Pro Met Gln Phe Ser Asn Ser Ser Cys Leu 
        275                 280                 285             


Ser Ser Pro Phe Ile Asn Asp Thr Glu Gln Ile Asp Leu Gly Ala Val 
    290                 295                 300                 


Thr Phe Thr Asn Cys Thr Ser Val Ala Asn Val Ser Ser Pro Leu Cys 
305                 310                 315                 320 


Ala Leu Asn Gly Ser Val Phe Leu Cys Gly Asn Asn Met Ala Tyr Thr 
                325                 330                 335     


Tyr Leu Pro Gln Asn Trp Thr Gly Leu Cys Val Gln Ala Ser Leu Leu 
            340                 345                 350         


Pro Asp Ile Asp Ile Ile Pro Gly Asp Glu Pro Val Pro Ile Pro Ala 
        355                 360                 365             


Ile Asp His Tyr Ile His Arg Pro Lys Arg Ala Val Gln Phe Ile Pro 
    370                 375                 380                 


Leu Leu Ala Gly Leu Gly Ile Thr Ala Ala Phe Thr Thr Gly Ala Thr 
385                 390                 395                 400 


Gly Leu Gly Val Ser Val Thr Gln Tyr Thr Lys Leu Ser His Gln Leu 
                405                 410                 415     


Ile Ser Asp Val Gln Val Leu Ser Gly Thr Ile Gln Asp Leu Gln Asp 
            420                 425                 430         


Gln Val Asp Ser Leu Ala Glu Val Val Leu Gln Asn Arg Arg Gly Leu 
        435                 440                 445             


Asp Leu Leu Thr Ala Glu Gln Gly Gly Ile Cys Leu Ala Leu Gln Glu 
    450                 455                 460                 


Lys Cys Cys Phe Tyr Ala Asn Lys Ser Gly Ile Val Arg Asn Lys Ile 
465                 470                 475                 480 


Arg Thr Leu Gln Glu Glu Leu Gln Lys Arg Arg Glu Ser Leu Ala Ser 
                485                 490                 495     


Asn Pro Leu Trp Thr Gly Leu Gln Gly Phe Leu Pro Tyr Leu Leu Pro 
            500                 505                 510         


Leu Leu Gly Pro Leu Leu Thr Leu Leu Leu Ile Leu Thr Ile Gly Pro 
        515                 520                 525             


Cys Val Phe Ser Arg Leu Met Ala Phe Ile Asn Asp Arg Leu Asn Val 
    530                 535                 540                 


Val His Ala Met Val Leu Ala Gln Gln Tyr Gln Ala Leu Lys Ala Glu 
545                 550                 555                 560 


Glu Glu Ala Gln Asp 
                565 


<210>  16
<211>  667
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Galv Envelope Amino Acid Sequence

<400>  16

Met Val Leu Leu Pro Gly Ser Met Leu Leu Thr Ser Asn Leu His His 
1               5                   10                  15      


Leu Arg His Gln Met Ser Pro Gly Ser Trp Lys Arg Leu Ile Ile Leu 
            20                  25                  30          


Leu Ser Cys Val Phe Gly Gly Gly Gly Thr Ser Leu Gln Asn Lys Asn 
        35                  40                  45              


Pro His Gln Pro Met Thr Leu Thr Trp Gln Val Leu Ser Gln Thr Gly 
    50                  55                  60                  


Asp Val Val Trp Asp Thr Lys Ala Val Gln Pro Pro Trp Thr Trp Trp 
65                  70                  75                  80  


Pro Thr Leu Lys Pro Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser 
                85                  90                  95      


Trp Asp Ile Pro Gly Thr Asp Val Ser Ser Ser Lys Arg Val Arg Pro 
            100                 105                 110         


Pro Asp Ser Asp Tyr Thr Ala Ala Tyr Lys Gln Ile Thr Trp Gly Ala 
        115                 120                 125             


Ile Gly Cys Ser Tyr Pro Arg Ala Arg Thr Arg Met Ala Ser Ser Thr 
    130                 135                 140                 


Phe Tyr Val Cys Pro Arg Asp Gly Arg Thr Leu Ser Glu Ala Arg Arg 
145                 150                 155                 160 


Cys Gly Gly Leu Glu Ser Leu Tyr Cys Lys Glu Trp Asp Cys Glu Thr 
                165                 170                 175     


Thr Gly Thr Gly Tyr Trp Leu Ser Lys Ser Ser Lys Asp Leu Ile Thr 
            180                 185                 190         


Leu Lys Trp Asp Gln Asn Ser Glu Trp Thr Gln Lys Phe Gln Gln Cys 
        195                 200                 205             


His Gln Thr Gly Trp Cys Asn Pro Leu Lys Ile Asp Phe Thr Asp Lys 
    210                 215                 220                 


Gly Lys Leu Ser Lys Asp Trp Ile Thr Gly Lys Thr Trp Gly Leu Arg 
225                 230                 235                 240 


Phe Tyr Val Ser Gly His Pro Gly Val Gln Phe Thr Ile Arg Leu Lys 
                245                 250                 255     


Ile Thr Asn Met Pro Ala Val Ala Val Gly Pro Asp Leu Val Leu Val 
            260                 265                 270         


Glu Gln Gly Pro Pro Arg Thr Ser Leu Ala Leu Pro Pro Pro Leu Pro 
        275                 280                 285             


Pro Arg Glu Ala Pro Pro Pro Ser Leu Pro Asp Ser Asn Ser Thr Ala 
    290                 295                 300                 


Leu Ala Thr Ser Ala Gln Thr Pro Thr Val Arg Lys Thr Ile Val Thr 
305                 310                 315                 320 


Leu Asn Thr Pro Pro Pro Thr Thr Gly Asp Arg Leu Phe Asp Leu Val 
                325                 330                 335     


Gln Gly Ala Phe Leu Thr Leu Asn Ala Thr Asn Pro Gly Ala Thr Glu 
            340                 345                 350         


Ser Cys Trp Leu Cys Leu Ala Met Gly Pro Pro Tyr Tyr Glu Ala Ile 
        355                 360                 365             


Ala Ser Ser Gly Glu Val Ala Tyr Ser Thr Asp Leu Asp Arg Cys Arg 
    370                 375                 380                 


Trp Gly Thr Gln Gly Lys Leu Thr Leu Thr Glu Val Ser Gly His Gly 
385                 390                 395                 400 


Leu Cys Ile Gly Lys Val Pro Phe Thr His Gln His Leu Cys Asn Gln 
                405                 410                 415     


Thr Leu Ser Ile Asn Ser Ser Gly Asp His Gln Tyr Leu Leu Pro Ser 
            420                 425                 430         


Asn His Ser Trp Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys Leu Ser 
        435                 440                 445             


Thr Ser Val Phe Asn Gln Thr Arg Asp Phe Cys Ile Gln Val Gln Leu 
    450                 455                 460                 


Ile Pro Arg Ile Tyr Tyr Tyr Pro Glu Glu Val Leu Leu Gln Ala Tyr 
465                 470                 475                 480 


Asp Asn Ser His Pro Arg Thr Lys Arg Glu Ala Val Ser Leu Thr Leu 
                485                 490                 495     


Ala Val Leu Leu Gly Leu Gly Ile Thr Ala Gly Ile Gly Thr Gly Ser 
            500                 505                 510         


Thr Ala Leu Ile Lys Gly Pro Ile Asp Leu Gln Gln Gly Leu Thr Ser 
        515                 520                 525             


Leu Gln Ile Ala Ile Asp Ala Asp Leu Arg Ala Leu Gln Asp Ser Val 
    530                 535                 540                 


Ser Lys Leu Glu Asp Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gln 
545                 550                 555                 560 


Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly Leu Cys 
                565                 570                 575     


Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ile Asp His Ser Gly Ala 
            580                 585                 590         


Val Arg Asp Ser Met Lys Lys Leu Lys Glu Lys Leu Asp Lys Arg Gln 
        595                 600                 605             


Leu Glu Arg Gln Lys Ser Gln Asn Trp Tyr Glu Gly Trp Phe Asn Asn 
    610                 615                 620                 


Ser Pro Trp Phe Thr Thr Leu Leu Ser Thr Ile Ala Gly Pro Leu Leu 
625                 630                 635                 640 


Leu Leu Leu Leu Leu Leu Ile Leu Gly Pro Cys Ile Ile Asn Lys Leu 
                645                 650                 655     


Val Gln Phe Ile Asn Asp Arg Ile Ser Ala Cys 
            660                 665         


<210>  17
<211>  1019
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence encoding the packaging signal in the genome 
       plasmid

<400>  17
aagctggcca gcaacttatc tgtgtctgtc cgattgtcta gtgtctatga ctgattttat       60

gcgcctgcgt cggtactagt tagctaacta gctctgtatc tggcggaccc gtggtggaac      120

tgacgagttc ggaacacccg gccgcaaccc tgggagacgt cccagggact tcgggggccg      180

tttttgtggc ccgacctgag tcctaaaatc ccgatcgttt aggactcttt ggtgcacccc      240

ccttagagga gggatatgtg gttctggtag gagacgagaa cctaaaacag ttcccgcctc      300

cgtctgaatt tttgctttcg gtttgggacc gaagccgcgc cgcgcgtctt gtctgctgca      360

gcatcgttct gtgttgtctc tgtctgactg tgtttctgta tttgtctgaa aatatgggcc      420

cgggctagcc tgttaccact cccttaagtt tgaccttagg tcactggaaa gatgtcgagc      480

ggatcgctca caaccagtcg gtagatgtca agaagagacg ttgggttacc ttctgctctg      540

cagaatggcc aacctttaac gtcggatggc cgcgagacgg cacctttaac cgagacctca      600

tcacccaggt taagatcaag gtcttttcac ctggcccgca tggacaccca gaccaggtcc      660

cctacatcgt gacctgggaa gccttggctt ttgacccccc tccctgggtc aagccctttg      720

tacaccctaa gcctccgcct cctcttcctc catccgcccc gtctctcccc cttgaacctc      780

ctcgttcgac cccgcctcga tcctcccttt atccagccct cactccttct ctaggcgccc      840

ccatatggcc atatgagatc ttatatgggg cacccccgcc ccttgtaaac ttccctgacc      900

ctgacatgac aagagttact aacagcccct ctctccaagc tcacttacag gctctctact      960

tagtccagca cgaagtctgg agacctctgg cggcagccta ccaagaacaa ctggaccga      1019


<210>  18
<211>  573
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence of CMV promoter

<400>  18
agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac       60

ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa      120

tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt      180

atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc      240

ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat      300

gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc atgctgatgc      360

ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc      420

tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa      480

aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg      540

tctatataag cagagctggt ttagtgaacc gtc                                   573


<210>  19
<211>  1691
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence of CAG promoter

<400>  19
atagtaatca attacggggt cattagttca tagcccatat atggagttcc gcgttacata       60

acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat      120

aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga      180

ctatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc      240

ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt      300

atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta ccatgcgtcg      360

aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca cccccaattt      420

tgtatttatt tattttttaa ttattttatg cagcgatggg ggcggggggg gggggggcgc      480

gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg      540

gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg      600

cggccctata aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgttgcc ttcgccccgt      660

gccccgctcc gcgccgcctc gcgccgcccg ccccggctct gactgaccgc gttactccca      720

caggtgagcg ggcgggacgg cccttctccc tccgggctgt aattagcgct tggtttaatg      780

acggctcgtt tcttttctgt ggctgcgtga aagccttaaa gggctccggg agggcctttg      840

tgcggggggg agcggctcgg ggggtgcgtg cgtgtgtgtg tgcgtgggga gcgccgcgtg      900

cggcccgcgc tgcccggcgg ctgtgagcgc tgcgggcgcg gcgcggggct ttgtgcgctc      960

cgcgtgtgcg cgaggggagc gcgggccggg ggcggtgccc cgcggtgcgg gggggctgcg     1020

aggggaacaa aggctgcgtg cggggtgtgt gcgtgggggg gtgagcaggg ggtgtgggcg     1080

cggcggtcgg gctgtaaccc ccccctggca cccccctccc cgagttgctg agcacggccc     1140

ggcttcgggt gcggggctcc gtgcggggcg tggcgcgggg ctcgccgtgc cgggcggggg     1200

gtggcggcag gtgggggtgc cgggcggggc ggggccgcct cgggccgggg agggctcggg     1260

ggaggggcgc ggcggccccg gagcgccggc ggctgtcgag gcgcggcgag ccgcagccat     1320

tgccttttat ggtaatcgtg cgagagggcg cagggacttc ctttgtccca aatctggcgg     1380

agccgaaatc tgggaggcgc cgccgcaccc cctctagcgg gcgcgggcga agcggtgcgg     1440

cgccggcagg aaggaaatgg gcggggaggg ccttcgtgcg tcgccgcgcc gccgtcccct     1500

tctccatctc cagcctcggg gctgccgcag ggggacggct gccttcgggg gggacggggc     1560

agggcggggt tcggcttctg gcgtgtgacc ggcggcttta gagcctctgc taaccatgtt     1620

catgccttct tctttttcct acagctcctg ggcaacgtgc tggttgttgt gctgtctcat     1680

cattttggca a                                                          1691


<210>  20
<211>  167
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence of Rabbit B-globin polyA

<400>  20
ctggtgtggc caatgccctg gctcacaaat accactgacg atctttttcc ctctgccaaa       60

aattatgggg acatcatgaa gccccttgag catctgactt ctggctaata aaggaaattt      120

attttcattg caatagtgtg ttggaatttt ttgtgtctct cactcgg                    167


<210>  21
<211>  642
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence of BGIntron

<400>  21
agatcgcctg gagacgccat ccacgctgtt ttgacctcca tagaagacac cgggaccgat       60

ccagcctccc ctcgaagctt acatgtggta ccgagctcgg atcctgagaa cttcagggtg      120

agtctatggg acccttgatg ttttctttcc ccttcttttc tatggttaag ttcatgtcat      180

aggaagggga gaagtaacag ggtacacata ttgaccaaat cagggtaatt ttgcatttgt      240

aattttaaaa aatgctttct tcttttaata tacttttttg tttatcttat ttctaatact      300

ttccctaatc tctttctttc agggcaataa tgatacaatg tatcatgcct ctttgcacca      360

ttctaaagaa taacagtgat aatttctggg ttaaggcaat agcaatattt ctgcatataa      420

atatttctgc atataaattg taactgatgt aagaggtttc atattgctaa tagcagctac      480

aatccagcta ccattctgct tttattttat ggttgggata aggctggatt attctgagtc      540

caagctaggc ccttttgcta atcatgttca tacctcttat cttcctccca cagctcctgg      600

gcaacgtgct ggtctgtgtg ctggcccatc actttggcaa ag                         642


<210>  22
<211>  1047
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence of Ferritin promoter

<400>  22
agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac       60

ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa      120

tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt      180

atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc      240

ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat      300

gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc atgctgatgc      360

ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc      420

tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa      480

aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg      540

tctatataag cagagctggt ttagtgaacc ggatcccccg ggctgcagga atttatgaaa      600

tcctttatgg gggacccccc cctttgtcaa ccttgctcaa ttccttctcc ccctccgatc      660

ctaagactga tttacaagcc cgactaaaag ggctgcaagc ggtgcaggcc caaatctgga      720

cacccctggc cgaattgtac cggccaggac atccacaaac tagccaccca tttcaggtgg      780

gagactccgt gtacgtccgg cggcacgcct ctcaaggatt ggagcctcgt tggaagggac      840

cttacatcgt cctgctgacc acgcccaccg ccataaaggt tgacgggatc gccgcctgga      900

ttcacgcatc gcacgccaag gcagccccaa aaacccctgg accagaaact cccaaaacct      960

ggaagctccg ccgttcggag aaccctctta agataagact ctcccgtgtc tgactgctaa     1020

tccaccttgt ccctgtacta acccaaa                                         1047


<210>  23
<211>  191
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence of CMV-RD114UTR

<400>  23
actagttccg ccagagcgcg cgagggcctc cagcggccgc ccctccccca cagcaggggc       60

ggggtcccgc gcccaccgga aggagcgggc tcggggcggg cggcgctgat tggccggggc      120

gggcctgacg ccgacgcggc tataagagac cacaagcgac ccgcagggcc agacgttctt      180

cgccgaagct t                                                           191


<210>  24
<211>  192
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sv40 PolyA

<400>  24
cagacatgat aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa       60

aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt ataagctgca      120

ataaacaagt taacaacaac aattgcattc attttatgtt tcaggttcag ggggaggtgt      180

gggaggtttt tt                                                          192


<210>  25
<211>  476
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  RD114 Intron

<400>  25
gatcccccgg gctgcaggaa tttatgaaat cctttatggg ggaccccccc ctttgtcaac       60

cttgctcaat tccttctccc cctccgatcc taagactgat ttacaagccc gactaaaagg      120

gctgcaagcg gtgcaggccc aaatctggac acccctggcc gaattgtacc ggccaggaca      180

tccacaaact agccacccat ttcaggtggg agactccgtg tacgtccggc ggcaccgctc      240

tcaaggattg gagcctcgtt ggaagggacc ttacatcgtc ctgctgacca cgcccaccgc      300

cataaaggtt gacgggatcg ccgcctggat tcacgcatcg cacgccaagg cagccccaaa      360

aacccctgga ccagaaactc ccaaaacctg gaagctccgc cgttcggaga accctcttaa      420

gataagactc tcccgtgtct gactgctaat ccaccttgtc cctgtactaa cccaaa          476


<210>  26
<211>  999
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MEF1 Intron

<400>  26
gccgtcagaa cgcaggtgag gggcgggtgt ggcttccgcg ggccgccgag ctggaggtcc       60

tgctccgagc gggccgggcc ccgctgtcgt cggcggggat tagctgcgag cattcccgct      120

tcgagttgcg ggcggcgcgg gaggcagagt gcgaggccta gcggcaaccc cgtagcctcg      180

cctcgtgtcc ggcttgaggc ctagcgtggt gtccgcgccg ccgccgcgtg ctactccggc      240

cgcactctgg tctttttttt ttttgttgtt gttgccctgc tgccttcgat tgccgttcag      300

caataggggc taacaaaggg agggtgcggg gcttgctcgc ccggagcccg gagaggtcat      360

ggttggggag gaatggaggg acaggagtgg cggctggggc ccgcccgcct tcggagcaca      420

tgtccgacgc cacctggatg gggcgaggcc tggggttttt cccgaagcaa ccaggctggg      480

gttagcgtgc cgaggccatg tggccccagc acccggcacg atctggcttg gcggcgccgc      540

gttgccctgc ctccctaact agggtgaggc catcccgtcc ggcaccagtt gcgtgcgtgg      600

aaagatggcc gctcccgggc cctgttgcaa ggagctcaaa atggaggacg cggcagcccg      660

gtggagcggg cgggtgagtc acccacacaa aggaagaggg cctggtccct caccggctgc      720

tgcttcctgt gaccccgtgg tcctatcggc cgcaatagtc acctcgggct tttgagcacg      780

gctagtcgcg gcggggggag gggatgtaat ggcgttggag tttgttcaca tttggtgggt      840

ggagactagt caggccagcc tggcgctgga agtcattttt ggaatttgtc cccttgagtt      900

ttgagcggag ctaattctcg ggcttcttag cggttcaaag gtatctttta aacccttttt      960

taggtgttgt gaaaaccacc gctaattcaa agcaaccgg                             999


