                         SEQUENCE LISTING

<110>  bluebird bio, Inc.
       Goss, Kendrick
       Parsons, Geoffrey
 
<120>  GENE THERAPY FOR MUCOPOLYSACCHARIDOSIS, TYPE I

<130>  BLBD-081/01WO

<150>  US 62/430,795
<151>  2016-12-06

<160>  17    

<170>  PatentIn version 3.5

<210>  1
<211>  7833
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthesized lentiviral vector encoding an alpha-L iduronidase 
       (IDUA) polypeptide

<400>  1
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatcatat gccagcctat ggtgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca      600

gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat      660

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa      720

caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag      780

cagagctcgt ttagtgaacc gggtctctct ggttagacca gatctgagcc tgggagctct      840

ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgctcaaag      900

tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt      960

cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga aagtaaagcc     1020

agaggagatc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg     1080

gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg agagagtagg     1140

gtgcgagagc gtcggtatta agcgggggag aattagataa atgggaaaaa attcggttaa     1200

ggccaggggg aaagaaacaa tataaactaa aacatatagt tagggcaagc agggagctag     1260

aacgattcgc agttaatcct ggccttttag agacatcaga aggctgtaga caaatactgg     1320

gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacaa     1380

tagcagtcct ctattgtgtg catcaaagga tagatgtaaa agacaccaag gaagccttag     1440

ataagataga ggaagagcaa aacaaaagta agaaaaaggc acagcaagca gcagctgaca     1500

caggaaacaa cagccaggtc agccaaaatt accctatagt gcagaacctc caggggcaaa     1560

tggtacatca ggccatatca cctagaactt taaattaaga cagcagtaca aatggcagta     1620

ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata     1680

gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt     1740

caaaattttc gggtttatta cagggacagc agagatccag tttggaaagg accagcaaag     1800

ctcctctgga aaggtgaagg ggcagtagta atacaagata atagtgacat aaaagtagtg     1860

ccaagaagaa aagcaaagat catcagggat tatggaaaac agatggcagg tgatgattgt     1920

gtggcaagta gacaggatga ggattaacac atggaaaaga ttagtaaaac accatagctc     1980

tagagcgatc ccgatcttca gacctggagg aggagatatg agggacaatt ggagaagtga     2040

attatataaa tataaagtag taaaaattga accattagga gtagcaccca ccaaggcaaa     2100

gagaagagtg gtgcagagag aaaaaagagc agtgggaata ggagctttgt tccttgggtt     2160

cttgggagca gcaggaagca ctatgggcgc agcgtcaatg acgctgacgg tacaggccag     2220

acaattattg tctggtatag tgcagcagca gaacaatttg ctgagggcta ttgaggcgca     2280

acagcatctg ttgcaactca cagtctgggg catcaagcag ctccaggcaa gaatcctggc     2340

tgtggaaaga tacctaaagg atcaacagct cctggggatt tggggttgct ctggaaaact     2400

catttgcacc actgctgtgc cttggaatgc tagttggagt aataaatctc tggaacagat     2460

ttggaatcac acgacctgga tggagtggga cagagaaatt aacaattaca caagcttggt     2520

aggtttaaga atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc     2580

accattatcg tttcagaccc acctcccaac cccgagggga cccgacaggc ccgaaggaat     2640

agaagaagaa ggtggagaga gagacagaga cagatccatt cgattagtga acggatccat     2700

ctcgacggaa tgaaagaccc cacctgtagg tttggcaagc taggatcaag gttaggaaca     2760

gagagacagc agaatatggg ccaaacagga tatctgtggt aagcagttcc tgccccggct     2820

cagggccaag aacagttgga acagcagaat atgggccaaa caggatatct gtggtaagca     2880

gttcctgccc cggctcaggg ccaagaacag atggtcccca gatgcggtcc cgccctcagc     2940

agtttctaga gaaccatcag atgtttccag ggtgccccaa ggacctgaaa tgaccctgtg     3000

ccttatttga actaaccaat cagttcgctt ctcgcttctg ttcgcgcgct tctgctcccc     3060

gagctcaata aaagagccca caacccctca ctcggcgcga ttcacctgac gcgtctacgc     3120

caccatgcgg cccctgaggc ccagggcggc gctcctggcc ctccttgcct ccctgttggc     3180

ggccccccct gtggcccccg cggaggcccc ccacctcgtg cacgtggatg ccgccagggc     3240

tctgtggcca ctccggcggt tctggcggag cacaggtttc tgcccaccat tgccgcactc     3300

ccaagctgat cagtacgtgc tgagctggga ccagcagctg aacctggctt acgtgggagc     3360

cgtgccgcac cggggcatca aacaagtccg gactcactgg ctcctggaac tcgtgactac     3420

ccgggggtca accggtcgcg gcttgtcgta caactttacc cacctggatg gctacctgga     3480

tcttctccgc gaaaaccagt tgctgccggg atttgagctc atggggtcgg cctccggcca     3540

cttcactgac ttcgaggaca agcaacaagt gttcgagtgg aaggacctgg tgtcctccct     3600

ggcccggaga tacatcggcc gctacggact ggcccacgtg tccaagtgga acttcgaaac     3660

ctggaatgag ccagaccacc acgacttcga caacgtgtcg atgaccatgc agggattcct     3720

gaactactac gacgcctgca gcgaagggtt gcgggccgca tcccccgccc ttcggcttgg     3780

cgggcccgga gactcctttc acaccccgcc gcggagcccg ctcagctggg gactgctgag     3840

acactgtcac gacggaacca acttcttcac tggcgaagcc ggagtcaggc tggactacat     3900

ttcgctgcat cgcaaggggg cgcggtcgtc catttcgatt ctggagcagg agaaggtcgt     3960

ggcacagcag atccgccagc tgttcccgaa gttcgctgat accccaatct acaacgacga     4020

agccgatccg cttgtcggct ggagcctgcc tcagccgtgg cgcgccgacg tgacctacgc     4080

ggctatggtg gtcaaggtca tcgcacagca ccagaacctc ctgctggcga acactacttc     4140

ggccttccct tacgcccttc tgtccaacga taacgccttc ctgtcctacc atccacatcc     4200

gttcgcccaa agaaccctga ctgcgcggtt ccaagtcaac aatacccgac cgcctcacgt     4260

gcaacttctg cgcaagcctg tgctcaccgc tatgggcctc ttggccctgc tggacgagga     4320

gcaactgtgg gccgaggtgt cccaggccgg gacggtgttg gactcaaacc acaccgtggg     4380

cgtgctggcc agcgcgcaca gaccccaggg acccgctgat gcatggcgcg cggccgtgct     4440

tatctacgca tctgacgaca ctagggccca tcccaaccgc tccgtcgccg tgaccctgag     4500

actgagagga gtgccacccg gtcctggcct cgtctatgtg acccgctacc tcgacaatgg     4560

actctgttcc cccgatggag aatggcgcag gctcgggcgg ccggtgttcc ctaccgccga     4620

acagtttaga agaatgcgcg ccgcggaaga tccggtggcc gcagcgcctc ggccgctgcc     4680

ggctggcgga cggctgaccc tgcgccctgc cctgcgactg ccgtcactcc tgctggtcca     4740

tgtctgcgcc cggcctgaga agccgccagg acaggtcacc cggctgcgcg ccctgccgct     4800

gacccaggga cagctcgtgc tcgtgtggtc cgacgagcac gtcggctcca agtgcctctg     4860

gacctatgaa atccagttca gccaggacgg gaaagcctac accccggtgt cgaggaagcc     4920

atccactttc aacctgttcg tgttctcacc tgacacgggt gccgtgtcag ggagctacag     4980

agtgcgggcc ctggactact gggcacggcc gggccccttc tccgacccgg tgccctacct     5040

ggaagtgcca gtgccgcgcg gaccgcctag ccccggcaac ccttagtaat gacaggtacc     5100

tttaagacca atgacttaca aggcagctgt agatcttagc cactttttaa aagaaaaggg     5160

gggactggaa gggctaattc actcccaaag aagacaagat ctgctttttg cctgtactgg     5220

gtctctctgg ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact     5280

gcttaagcct caataaagct tgccttgagt gcttcaatgt gtgtgttggt tttttgtgtg     5340

tcgaaattct agcgattcta gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa     5400

ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg     5460

gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca     5520

gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg     5580

tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg     5640

gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg     5700

ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa     5760

ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg     5820

acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc     5880

tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc     5940

ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc     6000

ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg     6060

ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc     6120

actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga     6180

gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc     6240

tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac     6300

caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg     6360

atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc     6420

acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa     6480

ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta     6540

ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt     6600

tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag     6660

tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca     6720

gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc     6780

tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt     6840

tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag     6900

ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt     6960

tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat     7020

ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt     7080

gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc     7140

ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat     7200

cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag     7260

ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt     7320

ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg     7380

gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta     7440

ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc     7500

gcgcacattt ccccgaaaag tgccacctgg gactagcttt ttgcaaaagc ctaggcctcc     7560

aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct cggcctctgc     7620

ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg gcggagttag     7680

gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga gatgagcttg     7740

catgccgaca ttgattattg actagtccct aagaaaccat tcttatcatg acattaacct     7800

ataaaaatag gcgtatcacg aggccctttc gtc                                  7833


<210>  2
<211>  7967
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthesized lentiviral vector encoding an IDUA polypeptide

<400>  2
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatcatat gccagcctat ggtgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca      600

gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat      660

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa      720

caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag      780

cagagctcgt ttagtgaacc gggtctctct ggttagacca gatctgagcc tgggagctct      840

ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgctcaaag      900

tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt      960

cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga aagtaaagcc     1020

agaggagatc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg     1080

gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg agagagtagg     1140

gtgcgagagc gtcggtatta agcgggggag aattagataa atgggaaaaa attcggttaa     1200

ggccaggggg aaagaaacaa tataaactaa aacatatagt tagggcaagc agggagctag     1260

aacgattcgc agttaatcct ggccttttag agacatcaga aggctgtaga caaatactgg     1320

gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacaa     1380

tagcagtcct ctattgtgtg catcaaagga tagatgtaaa agacaccaag gaagccttag     1440

ataagataga ggaagagcaa aacaaaagta agaaaaaggc acagcaagca gcagctgaca     1500

caggaaacaa cagccaggtc agccaaaatt accctatagt gcagaacctc caggggcaaa     1560

tggtacatca ggccatatca cctagaactt taaattaaga cagcagtaca aatggcagta     1620

ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata     1680

gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt     1740

caaaattttc gggtttatta cagggacagc agagatccag tttggaaagg accagcaaag     1800

ctcctctgga aaggtgaagg ggcagtagta atacaagata atagtgacat aaaagtagtg     1860

ccaagaagaa aagcaaagat catcagggat tatggaaaac agatggcagg tgatgattgt     1920

gtggcaagta gacaggatga ggattaacac atggaaaaga ttagtaaaac accatagctc     1980

tagagcgatc ccgatcttca gacctggagg aggagatatg agggacaatt ggagaagtga     2040

attatataaa tataaagtag taaaaattga accattagga gtagcaccca ccaaggcaaa     2100

gagaagagtg gtgcagagag aaaaaagagc agtgggaata ggagctttgt tccttgggtt     2160

cttgggagca gcaggaagca ctatgggcgc agcgtcaatg acgctgacgg tacaggccag     2220

acaattattg tctggtatag tgcagcagca gaacaatttg ctgagggcta ttgaggcgca     2280

acagcatctg ttgcaactca cagtctgggg catcaagcag ctccaggcaa gaatcctggc     2340

tgtggaaaga tacctaaagg atcaacagct cctggggatt tggggttgct ctggaaaact     2400

catttgcacc actgctgtgc cttggaatgc tagttggagt aataaatctc tggaacagat     2460

ttggaatcac acgacctgga tggagtggga cagagaaatt aacaattaca caagcttggt     2520

aggtttaaga atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc     2580

accattatcg tttcagaccc acctcccaac cccgagggga cccgacaggc ccgaaggaat     2640

agaagaagaa ggtggagaga gagacagaga cagatccatt cgattagtga acggatccaa     2700

ggatctgcga tcgctccggt gcccgtcagt gggcagagcg cacatcgccc acagtccccg     2760

agaagttggg gggaggggtc ggcaattgaa cgggtgccta gagaaggtgg cgcggggtaa     2820

actgggaaag tgatgtcgtg tactggctcc gcctttttcc cgagggtggg ggagaaccgt     2880

atataagtgc agtagtcgcc gtgaacgttc tttttcgcaa cgggtttgcc gccagaacac     2940

agctgaagct tcgaggggct cgcatctctc cttcacgcgc ccgccgccct acctgaggcc     3000

gccatccacg ccggttgagt cgcgttctgc cgcctcccgc ctgtggtgcc tcctgaactg     3060

cgtccgccgt ctaggtaagt ttaaagctca ggtcgagacc gggcctttgt ccggcgctcc     3120

cttggagcct acctagactc agccggctct ccacgctttg cctgaccctg cttgctcaac     3180

tctacgtctt tgtttcgttt tctgttctgc gccgttacag atccaagctg tgaccggcgc     3240

ctacgcgtct acgccaccat gcggcccctg aggcccaggg cggcgctcct ggccctcctt     3300

gcctccctgt tggcggcccc ccctgtggcc cccgcggagg ccccccacct cgtgcacgtg     3360

gatgccgcca gggctctgtg gccactccgg cggttctggc ggagcacagg tttctgccca     3420

ccattgccgc actcccaagc tgatcagtac gtgctgagct gggaccagca gctgaacctg     3480

gcttacgtgg gagccgtgcc gcaccggggc atcaaacaag tccggactca ctggctcctg     3540

gaactcgtga ctacccgggg gtcaaccggt cgcggcttgt cgtacaactt tacccacctg     3600

gatggctacc tggatcttct ccgcgaaaac cagttgctgc cgggatttga gctcatgggg     3660

tcggcctccg gccacttcac tgacttcgag gacaagcaac aagtgttcga gtggaaggac     3720

ctggtgtcct ccctggcccg gagatacatc ggccgctacg gactggccca cgtgtccaag     3780

tggaacttcg aaacctggaa tgagccagac caccacgact tcgacaacgt gtcgatgacc     3840

atgcagggat tcctgaacta ctacgacgcc tgcagcgaag ggttgcgggc cgcatccccc     3900

gcccttcggc ttggcgggcc cggagactcc tttcacaccc cgccgcggag cccgctcagc     3960

tggggactgc tgagacactg tcacgacgga accaacttct tcactggcga agccggagtc     4020

aggctggact acatttcgct gcatcgcaag ggggcgcggt cgtccatttc gattctggag     4080

caggagaagg tcgtggcaca gcagatccgc cagctgttcc cgaagttcgc tgatacccca     4140

atctacaacg acgaagccga tccgcttgtc ggctggagcc tgcctcagcc gtggcgcgcc     4200

gacgtgacct acgcggctat ggtggtcaag gtcatcgcac agcaccagaa cctcctgctg     4260

gcgaacacta cttcggcctt cccttacgcc cttctgtcca acgataacgc cttcctgtcc     4320

taccatccac atccgttcgc ccaaagaacc ctgactgcgc ggttccaagt caacaatacc     4380

cgaccgcctc acgtgcaact tctgcgcaag cctgtgctca ccgctatggg cctcttggcc     4440

ctgctggacg aggagcaact gtgggccgag gtgtcccagg ccgggacggt gttggactca     4500

aaccacaccg tgggcgtgct ggccagcgcg cacagacccc agggacccgc tgatgcatgg     4560

cgcgcggccg tgcttatcta cgcatctgac gacactaggg cccatcccaa ccgctccgtc     4620

gccgtgaccc tgagactgag aggagtgcca cccggtcctg gcctcgtcta tgtgacccgc     4680

tacctcgaca atggactctg ttcccccgat ggagaatggc gcaggctcgg gcggccggtg     4740

ttccctaccg ccgaacagtt tagaagaatg cgcgccgcgg aagatccggt ggccgcagcg     4800

cctcggccgc tgccggctgg cggacggctg accctgcgcc ctgccctgcg actgccgtca     4860

ctcctgctgg tccatgtctg cgcccggcct gagaagccgc caggacaggt cacccggctg     4920

cgcgccctgc cgctgaccca gggacagctc gtgctcgtgt ggtccgacga gcacgtcggc     4980

tccaagtgcc tctggaccta tgaaatccag ttcagccagg acgggaaagc ctacaccccg     5040

gtgtcgagga agccatccac tttcaacctg ttcgtgttct cacctgacac gggtgccgtg     5100

tcagggagct acagagtgcg ggccctggac tactgggcac ggccgggccc cttctccgac     5160

ccggtgccct acctggaagt gccagtgccg cgcggaccgc ctagccccgg caacccttag     5220

taatgacagg tacctttaag accaatgact tacaaggcag ctgtagatct tagccacttt     5280

ttaaaagaaa aggggggact ggaagggcta attcactccc aaagaagaca agatctgctt     5340

tttgcctgta ctgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa     5400

ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca atgtgtgtgt     5460

tggttttttg tgtgtcgaaa ttctagcgat tctagcttgg cgtaatcatg gtcatagctg     5520

tttcctgtgt gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata     5580

aagtgtaaag cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca     5640

ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc     5700

gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg     5760

cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta     5820

tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc     5880

aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag     5940

catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac     6000

caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc     6060

ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt     6120

aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc     6180

gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga     6240

cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta     6300

ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta     6360

tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga     6420

tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg     6480

cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag     6540

tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc     6600

tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact     6660

tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt     6720

cgttcatcca tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta     6780

ccatctggcc ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta     6840

tcagcaataa accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc     6900

gcctccatcc agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat     6960

agtttgcgca acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt     7020

atggcttcat tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg     7080

tgcaaaaaag cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca     7140

gtgttatcac tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta     7200

agatgctttt ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg     7260

cgaccgagtt gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact     7320

ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg     7380

ctgttgagat ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt     7440

actttcacca gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga     7500

ataagggcga cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc     7560

atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa     7620

caaatagggg ttccgcgcac atttccccga aaagtgccac ctgggactag ctttttgcaa     7680

aagcctaggc ctccaaaaaa gcctcctcac tacttctgga atagctcaga ggccgaggcg     7740

gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa     7800

ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa     7860

ttgagatgag cttgcatgcc gacattgatt attgactagt ccctaagaaa ccattcttat     7920

catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtc                   7967


<210>  3
<211>  3
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  3

Gly Gly Gly 
1           


<210>  4
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  4

Asp Gly Gly Gly Ser 
1               5   


<210>  5
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  5

Thr Gly Glu Lys Pro 
1               5   


<210>  6
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  6

Gly Gly Arg Arg 
1               


<210>  7
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  7

Gly Gly Gly Gly Ser 
1               5   


<210>  8
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  8

Glu Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Val Asp 
1               5                   10                  


<210>  9
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  9

Lys Glu Ser Gly Ser Val Ser Ser Glu Gln Leu Ala Gln Phe Arg Ser 
1               5                   10                  15      


Leu Asp 
        


<210>  10
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  10

Gly Gly Arg Arg Gly Gly Gly Ser 
1               5               


<210>  11
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  11

Leu Arg Gln Arg Asp Gly Glu Arg Pro 
1               5                   


<210>  12
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  12

Leu Arg Gln Lys Asp Gly Gly Gly Ser Glu Arg Pro 
1               5                   10          


<210>  13
<211>  16
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  13

Leu Arg Gln Lys Asp Gly Gly Gly Ser Gly Gly Gly Ser Glu Arg Pro 
1               5                   10                  15      


<210>  14
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cleavage sequence by TEV protease


<220>
<221>  misc_feature
<222>  (2)..(3)
<223>  Xaa is any amino acid

<220>
<221>  misc_feature
<222>  (5)..(5)
<223>  Xaa is any amino acid

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa = Gly or Ser

<400>  14

Glu Xaa Xaa Tyr Xaa Gln Xaa 
1               5           


<210>  15
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cleavage sequence by TEV protease

<400>  15

Glu Asn Leu Tyr Phe Gln Gly 
1               5           


<210>  16
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cleavage sequence by TEV protease

<400>  16

Glu Asn Leu Tyr Phe Gln Ser 
1               5           


<210>  17
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Consensus Kozak sequence

<400>  17
gccrccatgg                                                              10


