                         SEQUENCE LISTING

<110>  The Trustees of the University of Pennsylvania
       University of Southern California
       BRAVERMAN, Nancy
       ARGYRIOU, Catherine
 
<120>  GENE THERAPY FOR TREATING PEROXISOMAL DISORDERS

<130>  17000-001-PCT

<150>  US62513156
<151>  2017-05-31

<160>  13    

<170>  PatentIn version 3.5

<210>  1
<211>  3852
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized human PEX1

<400>  1
atgtggggaa gcgacagact ggccggagct ggagggggag gagcagccgt caccgtggcg       60

ttcactaacg cgcgggactg ctttctccat ctgccgcgga ggctggtcgc ccagctgcac      120

ctcctgcaga accaggccat cgaggtggtg tggtcccacc aaccggcctt tttgagctgg      180

gtcgagggaa ggcacttttc ggaccaggga gaaaatgtgg cggagatcaa ccgccaggtc      240

ggccagaagc tgggactgtc caacggcgga caggtgttcc tcaagccgtg cagccacgtg      300

gtgtcctgcc aacaggtgga agtggagccg ctctccgccg acgactggga gatcctcgaa      360

ttgcatgccg tgagcctcga acagcatctg ttggaccaga ttcgcattgt gttcccgaag      420

gccatattcc ccgtgtgggt cgatcagcag acctatatct tcatccagat tgtggccctc      480

atcccggccg cctcatacgg acggctggaa actgacacca agctgctgat tcaacctaag      540

acccggaggg ccaaagaaaa caccttctcc aaggccgacg ctgagtacaa gaagctccac      600

tcctacggac gggaccagaa ggggatgatg aaggagctgc aaaccaagca gctccagagc      660

aacaccgtgg ggatcaccga gtccaatgaa aacgagtcgg aaatcccagt cgattcatct      720

tccgtggcca gcctgtggac tatgatcggt tccattttct cgttccaatc tgagaagaag      780

caggaaacta gctgggggct gactgagatc aacgccttca agaacatgca gtccaaagtg      840

gtgcctctgg ataacatctt tcgcgtgtgc aagtcccaac cgccctcaat ctacaacgcg      900

tccgctacct ccgtgtttca taagcactgt gccatccacg tgttcccatg ggatcaggaa      960

tacttcgatg tcgaaccttc cttcaccgtg acttacggga agcttgtcaa gctcctcagc     1020

cccaagcagc agcaatcgaa aactaagcag aacgtgcttt ccccggagaa ggagaagcaa     1080

atgtcagaac cactcgacca gaagaaaatc agatcggatc ataacgaaga ggacgagaag     1140

gcctgcgtcc ttcaggtggt ctggaacggc ctggaggagc tgaacaacgc gattaagtac     1200

accaagaacg tcgaggtcct tcacctggga aaggtgtgga ttccggatga tctgaggaaa     1260

cgcctcaaca tcgaaatgca cgctgtggtg cggattaccc cggtcgaggt caccccaaag     1320

atccctcgct ccttgaagct gcagccgcga gaaaacttgc ccaaggacat ttctgaagag     1380

gatatcaaga ctgtgttcta ctcctggctg caacagagca ctaccaccat gctccctctg     1440

gtcatttcgg aggaagaatt catcaaactg gaaaccaagg acggactgaa agaattctcc     1500

ctgtccatcg tgcactcctg ggaaaaggag aaggacaaga atatcttcct gctgtccccc     1560

aatctgctgc aaaagaccac gatccaggtg ctgctcgacc ccatggtgaa ggaggaaaac     1620

tcagaagaga tcgacttcat cctgccgttc cttaagctga gttcactggg aggcgtgaac     1680

tcccttggcg tgtcctcgct ggagcacatc actcactcac tgctgggccg gcctctgagc     1740

agacagctta tgagcttggt cgccggactc agaaacggtg ccctcctgct caccggcggc     1800

aagggatcgg gaaagtccac cctcgctaag gccatttgca aagaggcatt cgataagctg     1860

gacgcccatg tggagcgggt ggactgtaag gccctccgcg gaaagcgatt ggaaaatatt     1920

caaaagactc tcgaagtcgc cttttccgaa gccgtctgga tgcagccctc ggtcgtcctg     1980

ctcgacgatc tggacctcat cgctgggctg ccggccgtgc cggagcatga acactcccct     2040

gacgcggtcc agtcgcaacg gctcgcccac gccctgaacg atatgattaa ggaattcatc     2100

tcaatgggat cactggtggc cctgatcgcg acttcccaga gccagcagtc cctgcaccct     2160

ctgctggtgt cggcccaggg cgtgcacatt tttcagtgtg tgcaacacat ccagccgccc     2220

aaccaggagc agcggtgcga aatcctgtgc aacgtgatta agaacaagct ggactgcgat     2280

atcaacaagt ttaccgacct tgatctccaa catgtggcta aggagactgg gggcttcgtg     2340

gctcgggact tcacagtgtt ggtggaccgg gcaattcact ccagactgtc ccgccagagc     2400

atttccaccc gcgaaaaact ggtcctgacc accctcgact tccagaaggc cctcagaggc     2460

ttccttcctg cgagcctcag atccgtcaac cttcacaagc cgcgggacct tggctgggac     2520

aagatcggtg ggctccacga ggtgcggcag atcctcatgg acaccattca gctgcctgca     2580

aagtaccccg agctgttcgc caacttgccg attcgccagc gcacgggaat cctgctctac     2640

ggccccccgg gcaccggaaa gaccctgctg gccggtgtga tcgcccggga atcgaggatg     2700

aacttcatct ccgtgaaggg acccgaactc ctgtccaagt acatcggtgc ctccgaacag     2760

gccgtgcgcg atatattcat tagggcccag gccgcgaagc cctgcattct gttcttcgac     2820

gagtttgaat cgatcgcgcc ccggaggggc cacgacaaca cgggagtgac cgaccgggtg     2880

gtgaaccagc tgctcaccca actggatggc gtggaaggcc ttcagggagt gtacgtgctg     2940

gcggctacct ccagaccgga cctgatcgat ccggccctgc tgcgccccgg gagactggac     3000

aagtgcgtgt attgccctcc ccctgaccag gtgtcaaggt tggaaatcct caacgtgctc     3060

tcggactccc tgccactggc agatgatgtg gacctccagc atgtggcctc cgtgactgac     3120

agcttcacag gagccgatct gaaggccctg ctttacaacg cccagttgga ggcgctgcac     3180

ggtatgctgc tgtcctccgg tctgcaggat ggctcctcct cttccgatag cgacctgtcg     3240

ctgagcagca tggtgttcct gaaccattcc agcggctccg atgacagcgc gggcgacgga     3300

gaatgtggac tggatcaatc cctggtgtcc ctggagatga gcgagattct gccagacgag     3360

tccaagttca acatgtacag gctgtacttc ggcagcagct acgagtccga gctgggaaat     3420

ggtacctcgt ccgacctgtc aagccagtgc ctgtccgcgc cttcctccat gacccaggac     3480

ctccctggag tgccagggaa ggatcagctg ttcagccagc ctcccgtgct gcgcactgcg     3540

agccaggaag ggtgccagga attgacccaa gagcagcggg accaactgcg cgcggacatt     3600

tcgatcatca aaggcagata ccgctcccaa tccggggagg acgaaagcat gaaccagccc     3660

gggcctatca agactagact ggcaatctcc caaagccacc tgatgaccgc actgggacac     3720

acccggccct cgatctcgga ggacgactgg aagaacttcg ctgagctgta cgaatccttc     3780

cagaatccga agcggagaaa gaaccagagc ggaactatgt tccggcccgg acagaaggtg     3840

accctggcct ga                                                         3852


<210>  2
<211>  4390
<212>  DNA
<213>  Homo sapiens

<400>  2
cgatcgatct cctccggctc cgacgtcctc ggcctgccgg gtcccgggtc ctttgcggcg       60

ctagggtggg cgaacccaga gcgacgctcc gggacgatgt ggggcagcga tcgcctggcg      120

ggtgctgggg gaggcggggc ggcagtgact gtggccttca ccaacgctcg cgactgcttc      180

ctccacctgc cgcggcgtct cgtggcccag ctgcatctgc tgcagaatca agctatagaa      240

gtggtctgga gtcaccagcc tgcattcttg agctgggtgg aaggcaggca ttttagtgat      300

caaggtgaaa atgtggctga aattaacaga caagttggtc aaaaacttgg actctcaaat      360

gggggacagg tatttctcaa gccatgttcc catgtggtat cttgtcaaca agttgaggtg      420

gaacccctct cagcagatga ttgggagata ctggagctgc atgctgtttc ccttgaacaa      480

catcttctag atcaaattcg aatagttttt ccaaaagcca tttttcctgt ttgggttgat      540

caacaaacgt acatatttat ccaaattgtt gcactaatac cagctgcctc ttatggaagg      600

ctggaaactg acaccaaact ccttattcag ccaaagacac gccgagccaa agagaataca      660

ttttcaaaag ctgatgctga atataaaaaa cttcatagtt atggaagaga ccagaaagga      720

atgatgaaag aacttcaaac caagcaactt cagtcaaata ctgtgggaat cactgaatct      780

aatgaaaacg agtcagagat tccagttgac tcatcatcag tagcaagttt atggactatg      840

ataggaagca ttttttcctt tcaatctgag aagaaacaag agacatcttg gggtttaact      900

gaaatcaatg cattcaaaaa tatgcagtca aaggttgttc ctctagacaa tattttcaga      960

gtatgcaaat ctcaacctcc tagtatatat aacgcgtcag caacctctgt ttttcataaa     1020

cactgtgcca ttcatgtatt tccatgggac caggaatatt ttgatgtaga gcccagcttt     1080

actgtgacat atggaaagct agttaagcta ctttctccaa agcaacagca aagtaaaaca     1140

aaacaaaatg tgttatcacc tgaaaaagag aagcagatgt cagagccact agatcaaaaa     1200

aaaattaggt cagatcataa tgaagaagat gagaaggcct gtgtgctaca agtagtctgg     1260

aatggacttg aagaattgaa caatgccatc aaatatacca aaaatgtaga agttctccat     1320

cttgggaaag tctggattcc agatgacctg aggaagagac taaatataga aatgcatgcc     1380

gtagtcagga taactccagt ggaagttacc cctaaaattc caagatctct aaagttacaa     1440

cctagagaga atttacctaa agacataagt gaagaagaca taaaaactgt attttattca     1500

tggctacagc agtctactac caccatgctt cctttggtaa tatcagagga agaatttatt     1560

aagctggaaa ctaaagatgg actgaaggaa ttttctctga gtatagttca ttcttgggaa     1620

aaagaaaaag ataaaaatat ttttctgttg agtcccaatt tgctgcagaa gactacaata     1680

caagtccttc tagatcctat ggtaaaagaa gaaaacagtg aggaaattga ctttattctt     1740

ccttttttaa agctgagctc tttgggagga gtgaattcct taggcgtatc ctccttggag     1800

cacatcactc acagcctcct gggacgccct ttgtctcggc agctgatgtc tcttgttgca     1860

ggacttagga atggagctct tttactcaca ggaggaaagg gaagtggaaa atcaacttta     1920

gccaaagcaa tctgtaaaga agcatttgac aaactggatg cccatgtgga gagagttgac     1980

tgtaaagctt tacgaggaaa aaggcttgaa aacatacaaa aaaccctaga ggtggctttc     2040

tcagaggcag tgtggatgca gccatctgtt gtcctgctgg atgaccttga cctcattgct     2100

ggactgcctg ctgtcccgga acatgagcac agtcctgatg cggtgcagag ccagcggctt     2160

gctcatgctt tgaatgatat gataaaagag tttatctcca tgggaagttt ggttgcactg     2220

attgccacaa gtcagtctca gcaatctcta catcctttac ttgtttctgc tcaaggagtt     2280

cacatatttc agtgcgtcca acacattcag cctcctaatc aggaacaaag atgtgaaatt     2340

ctgtgtaatg taataaaaaa taaattggac tgtgatataa acaagttcac cgatcttgac     2400

ctgcagcatg tagctaaaga aactggcggg tttgtggcta gagattttac agtacttgtg     2460

gatcgagcca tacattctcg actctctcgt cagagtatat ccaccagaga aaaattagtt     2520

ttaacaacat tggacttcca aaaggctctc cgcggatttc ttcctgcgtc tttgcgaagt     2580

gtcaacctgc ataaacctag agacctgggt tgggacaaga ttggtgggtt acatgaagtt     2640

aggcagatac tcatggatac tatccagtta cctgccaagt atccagaatt atttgcaaac     2700

ttgcccatac gacaaagaac aggaatactg ttgtatggtc cgcctggaac aggaaaaacc     2760

ttactagctg gggtaattgc acgagagagt agaatgaatt ttataagtgt caaggggcca     2820

gagttactca gcaaatacat tggagcaagt gaacaagctg ttcgggatat ttttattaga     2880

gcacaggctg caaagccctg cattcttttc tttgatgaat ttgaatccat tgctcctcgg     2940

cggggtcatg ataatacagg agttacagac cgagtagtta accagttgct gactcagttg     3000

gatggagtag aaggcttaca gggtgtttat gtattggctg ctactagtcg ccctgacttg     3060

attgaccctg ccctgcttag gcctggtcga ctagataaat gtgtatactg tcctcctcct     3120

gatcaggtgt cacgtcttga aattttaaat gtcctcagtg actctctacc tctggcagat     3180

gatgttgacc ttcagcatgt agcatcagta actgactcct ttactggagc tgatctgaaa     3240

gctttacttt acaatgccca attggaggcc ttacatggaa tgctgctctc gagtggactc     3300

caggatggaa gttccagctc tgatagtgac ctaagtctgt cttcaatggt ctttcttaac     3360

catagcagtg gctctgacga ttcagctgga gatggagaat gtggcttaga tcagtccctt     3420

gtttctttag agatgtccga gatccttcca gatgaatcaa aattcaatat gtaccggctc     3480

tactttggaa gctcttatga atcagaactt ggaaatggaa cctcttctga tttgagctca     3540

caatgtctct ctgcaccaag ctccatgact caggatttgc ctggagttcc tgggaaagac     3600

cagttgtttt cacagcctcc agtgttaagg acagcttcac aagagggttg ccaagaactt     3660

acacaagaac aaagagatca actgagggca gatatcagta ttatcaaagg cagataccgg     3720

agccaaagtg gagaggacga atccatgaac caaccaggac caatcaaaac cagactggct     3780

attagtcagt cacatttaat gactgcactt ggtcacacaa gaccatccat tagtgaagat     3840

gactggaaga attttgctga gctatatgaa agctttcaaa atccaaagag gagaaaaaat     3900

caaagtggaa caatgtttcg acctggacag aaagtaactt tagcataaaa tatacttctt     3960

tttgatttgg ttctgttaag ttttttgatg gcttttccat atgttgtaac aggaaaaaaa     4020

tggtgtctat gaatttcttc ttaatttaac aaatttggtt aatttataaa atcacagatt     4080

ggtaaatgct ataattatgt aatgatcagg attgagatta atactgtagt ataaattggg     4140

acattataac agattccata ttttatttcc taaaatctaa attcagtctt taatgaaata     4200

atattagcca aatggtggaa ctaatttatt tcttttgagg aaaagataat aaagaatgta     4260

attaaattta aatttcttgg aattcccagt tgtatattca tcacctttgt agcatttgac     4320

aaattttatg cttagcagct tcttcactgt tttgaaataa aatatcctat tacctactga     4380

taaaaaaaaa                                                            4390


<210>  3
<211>  4221
<212>  DNA
<213>  Homo sapiens

<400>  3
cgatcgatct cctccggctc cgacgtcctc ggcctgccgg gtcccgggtc ctttgcggcg       60

ctagggtggg cgaacccaga gcgacgctcc gggacgatgt ggggcagcga tcgcctggcg      120

ggtgctgggg gaggcggggc ggcagtgact gtggccttca ccaacgctcg cgactgcttc      180

ctccacctgc cgcggcgtct cgtggcccag ctgcatctgc tgcagaatca agctatagaa      240

gtggtctgga gtcaccagcc tgcattcttg agctgggtgg aaggcaggca ttttagtgat      300

caaggtgaaa atgtggctga aattaacaga caagttggtc aaaaacttgg actctcaaat      360

gggggacagg tatttctcaa gccatgttcc catgtggtat cttgtcaaca agttgaggtg      420

gaacccctct cagcagatga ttgggagata ctggagctgc atgctgtttc ccttgaacaa      480

catcttctag atcaaattcg aatagttttt ccaaaagcca tttttcctgt ttgggttgat      540

caacaaacgt acatatttat ccaaattgtt gcactaatac cagctgcctc ttatggaagg      600

ctggaaactg acaccaaact ccttattcag ccaaagacac gccgagccaa agagaataca      660

ttttcaaaag ctgatgctga atataaaaaa cttcatagtt atggaagaga ccagaaagga      720

atgatgaaag aacttcaaac caagcaactt cagtcaaata ctgtgggaat cactgaatct      780

aatgaaaacg agtcagagat tccagttgac tcatcatcag tagcaagttt atggactatg      840

ataggaagca ttttttcctt tcaatctgag aagaaacaag agacatcttg gggtttaact      900

gaaatcaatg cattcaaaaa tatgcagtca aaggttgttc ctctagacaa tattttcaga      960

gtatgcaaat ctcaacctcc tagtatatat aacgcgtcag caacctctgt ttttcataaa     1020

cactgtgcca ttcatgtatt tccatgggac caggaatatt ttgatgtaga gcccagcttt     1080

actgtgacat atggaaagct agttaagcta ctttctccaa agcaacagca aagtaaaaca     1140

aaacaaaatg tgttatcacc tgaaaaagag aagcagatgt cagagccact agatcaaaaa     1200

aaaattaggt cagatcataa tgaagaagat gagaaggcct gtgtgctaca agtagtctgg     1260

aatggacttg aagaattgaa caatgccatc aaatatacca aaaatgtaga agttctccat     1320

cttgggaaag tctggattcc agatgacctg aggaagagac taaatataga aatgcatgcc     1380

gtagtcagga taactccagt ggaagttacc cctaaaattc caagatctct aaagttacaa     1440

cctagagaga atttacctaa agacataagt gaagaagaca taaaaactgt attttattca     1500

tggctacagc agtctactac caccatgctt cctttggtaa tatcagagga agaatttatt     1560

aagctggaaa ctaaagatgg actgaaggaa ttttctctga gtatagttca ttcttgggaa     1620

aaagaaaaag ataaaaatat ttttctgttg agtcccaatt tgctgcagaa gactacaata     1680

caagtccttc tagatcctat ggtaaaagaa gaaaacagtg aggaaattga ctttattctt     1740

ccttttttaa agctgagctc tttgggagga gtgaattcct taggcgtatc ctccttggag     1800

cacatcactc acagcctcct gggacgccct ttgtctcggc agctgatgtc tcttgttgca     1860

ggacttagga atggagctct tttactcaca ggaggaaagg gaagtggaaa atcaacttta     1920

gccaaagcaa tctgtaaaga agcatttgac aaactggatg cccatgtgga gagagttgac     1980

tgtaaagctt tacgagcttt gaatgatatg ataaaagagt ttatctccat gggaagtttg     2040

gttgcactga ttgccacaag tcagtctcag caatctctac atcctttact tgtttctgct     2100

caaggagttc acatatttca gtgcgtccaa cacattcagc ctcctaatca ggaacaaaga     2160

tgtgaaattc tgtgtaatgt aataaaaaat aaattggact gtgatataaa caagttcacc     2220

gatcttgacc tgcagcatgt agctaaagaa actggcgggt ttgtggctag agattttaca     2280

gtacttgtgg atcgagccat acattctcga ctctctcgtc agagtatatc caccagagaa     2340

aaattagttt taacaacatt ggacttccaa aaggctctcc gcggatttct tcctgcgtct     2400

ttgcgaagtg tcaacctgca taaacctaga gacctgggtt gggacaagat tggtgggtta     2460

catgaagtta ggcagatact catggatact atccagttac ctgccaagta tccagaatta     2520

tttgcaaact tgcccatacg acaaagaaca ggaatactgt tgtatggtcc gcctggaaca     2580

ggaaaaacct tactagctgg ggtaattgca cgagagagta gaatgaattt tataagtgtc     2640

aaggggccag agttactcag caaatacatt ggagcaagtg aacaagctgt tcgggatatt     2700

tttattagag cacaggctgc aaagccctgc attcttttct ttgatgaatt tgaatccatt     2760

gctcctcggc ggggtcatga taatacagga gttacagacc gagtagttaa ccagttgctg     2820

actcagttgg atggagtaga aggcttacag ggtgtttatg tattggctgc tactagtcgc     2880

cctgacttga ttgaccctgc cctgcttagg cctggtcgac tagataaatg tgtatactgt     2940

cctcctcctg atcaggtgtc acgtcttgaa attttaaatg tcctcagtga ctctctacct     3000

ctggcagatg atgttgacct tcagcatgta gcatcagtaa ctgactcctt tactggagct     3060

gatctgaaag ctttacttta caatgcccaa ttggaggcct tacatggaat gctgctctcg     3120

agtggactcc aggatggaag ttccagctct gatagtgacc taagtctgtc ttcaatggtc     3180

tttcttaacc atagcagtgg ctctgacgat tcagctggag atggagaatg tggcttagat     3240

cagtcccttg tttctttaga gatgtccgag atccttccag atgaatcaaa attcaatatg     3300

taccggctct actttggaag ctcttatgaa tcagaacttg gaaatggaac ctcttctgat     3360

ttgagctcac aatgtctctc tgcaccaagc tccatgactc aggatttgcc tggagttcct     3420

gggaaagacc agttgttttc acagcctcca gtgttaagga cagcttcaca agagggttgc     3480

caagaactta cacaagaaca aagagatcaa ctgagggcag atatcagtat tatcaaaggc     3540

agataccgga gccaaagtgg agaggacgaa tccatgaacc aaccaggacc aatcaaaacc     3600

agactggcta ttagtcagtc acatttaatg actgcacttg gtcacacaag accatccatt     3660

agtgaagatg actggaagaa ttttgctgag ctatatgaaa gctttcaaaa tccaaagagg     3720

agaaaaaatc aaagtggaac aatgtttcga cctggacaga aagtaacttt agcataaaat     3780

atacttcttt ttgatttggt tctgttaagt tttttgatgg cttttccata tgttgtaaca     3840

ggaaaaaaat ggtgtctatg aatttcttct taatttaaca aatttggtta atttataaaa     3900

tcacagattg gtaaatgcta taattatgta atgatcagga ttgagattaa tactgtagta     3960

taaattggga cattataaca gattccatat tttatttcct aaaatctaaa ttcagtcttt     4020

aatgaaataa tattagccaa atggtggaac taatttattt cttttgagga aaagataata     4080

aagaatgtaa ttaaatttaa atttcttgga attcccagtt gtatattcat cacctttgta     4140

gcatttgaca aattttatgc ttagcagctt cttcactgtt ttgaaataaa atatcctatt     4200

acctactgat aaaaaaaaaa a                                               4221


<210>  4
<211>  4427
<212>  DNA
<213>  Homo sapiens

<400>  4
cgatcgatct cctccggctc cgacgtcctc ggcctgccgg gtcccgggtc ctttgcggcg       60

ctagggtggg cgaacccaga gcgacgctcc gggacgatgt ggggcagcga tcgcctggcg      120

ggtgctgggg gaggcggggc ggcagtgact gtggccttca ccaacgctcg cgactgcttc      180

ctccacctgc cgcggcgtct cgtggcccag ctgcatctgc tgcagaatca agctatagaa      240

gtggtctgga gtcaccagcc tgcattcttg agctgggtgg aaggcaggca ttttagtgat      300

caaggtgaaa atgtggctga aattaacaga caagttggtc aaaaacttgg actctcaaat      360

gggggacagg tatttctcaa gccatgttcc catgtggtat cttgtcaaca agttgaggtg      420

gaacccctct cagcagatga ttgggagata ctggtaaaga aaaccaaata agaactatct      480

catttaagga gctgcatgct gtttcccttg aacaacatct tctagatcaa attcgaatag      540

tttttccaaa agccattttt cctgtttggg ttgatcaaca aacgtacata tttatccaaa      600

ttgttgcact aataccagct gcctcttatg gaaggctgga aactgacacc aaactcctta      660

ttcagccaaa gacacgccga gccaaagaga atacattttc aaaagctgat gctgaatata      720

aaaaacttca tagttatgga agagaccaga aaggaatgat gaaagaactt caaaccaagc      780

aacttcagtc aaatactgtg ggaatcactg aatctaatga aaacgagtca gagattccag      840

ttgactcatc atcagtagca agtttatgga ctatgatagg aagcattttt tcctttcaat      900

ctgagaagaa acaagagaca tcttggggtt taactgaaat caatgcattc aaaaatatgc      960

agtcaaaggt tgttcctcta gacaatattt tcagagtatg caaatctcaa cctcctagta     1020

tatataacgc gtcagcaacc tctgtttttc ataaacactg tgccattcat gtatttccat     1080

gggaccagga atattttgat gtagagccca gctttactgt gacatatgga aagctagtta     1140

agctactttc tccaaagcaa cagcaaagta aaacaaaaca aaatgtgtta tcacctgaaa     1200

aagagaagca gatgtcagag ccactagatc aaaaaaaaat taggtcagat cataatgaag     1260

aagatgagaa ggcctgtgtg ctacaagtag tctggaatgg acttgaagaa ttgaacaatg     1320

ccatcaaata taccaaaaat gtagaagttc tccatcttgg gaaagtctgg attccagatg     1380

acctgaggaa gagactaaat atagaaatgc atgccgtagt caggataact ccagtggaag     1440

ttacccctaa aattccaaga tctctaaagt tacaacctag agagaattta cctaaagaca     1500

taagtgaaga agacataaaa actgtatttt attcatggct acagcagtct actaccacca     1560

tgcttccttt ggtaatatca gaggaagaat ttattaagct ggaaactaaa gatggactga     1620

aggaattttc tctgagtata gttcattctt gggaaaaaga aaaagataaa aatatttttc     1680

tgttgagtcc caatttgctg cagaagacta caatacaagt ccttctagat cctatggtaa     1740

aagaagaaaa cagtgaggaa attgacttta ttcttccttt tttaaagctg agctctttgg     1800

gaggagtgaa ttccttaggc gtatcctcct tggagcacat cactcacagc ctcctgggac     1860

gccctttgtc tcggcagctg atgtctcttg ttgcaggact taggaatgga gctcttttac     1920

tcacaggagg aaagggaagt ggaaaatcaa ctttagccaa agcaatctgt aaagaagcat     1980

ttgacaaact ggatgcccat gtggagagag ttgactgtaa agctttacga ggaaaaaggc     2040

ttgaaaacat acaaaaaacc ctagaggtgg ctttctcaga ggcagtgtgg atgcagccat     2100

ctgttgtcct gctggatgac cttgacctca ttgctggact gcctgctgtc ccggaacatg     2160

agcacagtcc tgatgcggtg cagagccagc ggcttgctca tgctttgaat gatatgataa     2220

aagagtttat ctccatggga agtttggttg cactgattgc cacaagtcag tctcagcaat     2280

ctctacatcc tttacttgtt tctgctcaag gagttcacat atttcagtgc gtccaacaca     2340

ttcagcctcc taatcaggaa caaagatgtg aaattctgtg taatgtaata aaaaataaat     2400

tggactgtga tataaacaag ttcaccgatc ttgacctgca gcatgtagct aaagaaactg     2460

gcgggtttgt ggctagagat tttacagtac ttgtggatcg agccatacat tctcgactct     2520

ctcgtcagag tatatccacc agagaaaaat tagttttaac aacattggac ttccaaaagg     2580

ctctccgcgg atttcttcct gcgtctttgc gaagtgtcaa cctgcataaa cctagagacc     2640

tgggttggga caagattggt gggttacatg aagttaggca gatactcatg gatactatcc     2700

agttacctgc caagtatcca gaattatttg caaacttgcc catacgacaa agaacaggaa     2760

tactgttgta tggtccgcct ggaacaggaa aaaccttact agctggggta attgcacgag     2820

agagtagaat gaattttata agtgtcaagg ggccagagtt actcagcaaa tacattggag     2880

caagtgaaca agctgttcgg gatattttta ttagagcaca ggctgcaaag ccctgcattc     2940

ttttctttga tgaatttgaa tccattgctc ctcggcgggg tcatgataat acaggagtta     3000

cagaccgagt agttaaccag ttgctgactc agttggatgg agtagaaggc ttacagggtg     3060

tttatgtatt ggctgctact agtcgccctg acttgattga ccctgccctg cttaggcctg     3120

gtcgactaga taaatgtgta tactgtcctc ctcctgatca ggtgtcacgt cttgaaattt     3180

taaatgtcct cagtgactct ctacctctgg cagatgatgt tgaccttcag catgtagcat     3240

cagtaactga ctcctttact ggagctgatc tgaaagcttt actttacaat gcccaattgg     3300

aggccttaca tggaatgctg ctctcgagtg gactccagga tggaagttcc agctctgata     3360

gtgacctaag tctgtcttca atggtctttc ttaaccatag cagtggctct gacgattcag     3420

ctggagatgg agaatgtggc ttagatcagt cccttgtttc tttagagatg tccgagatcc     3480

ttccagatga atcaaaattc aatatgtacc ggctctactt tggaagctct tatgaatcag     3540

aacttggaaa tggaacctct tctgatttga gctcacaatg tctctctgca ccaagctcca     3600

tgactcagga tttgcctgga gttcctggga aagaccagtt gttttcacag cctccagtgt     3660

taaggacagc ttcacaagag ggttgccaag aacttacaca agaacaaaga gatcaactga     3720

gggcagatat cagtattatc aaaggcagat accggagcca aagtggagag gacgaatcca     3780

tgaaccaacc aggaccaatc aaaaccagac tggctattag tcagtcacat ttaatgactg     3840

cacttggtca cacaagacca tccattagtg aagatgactg gaagaatttt gctgagctat     3900

atgaaagctt tcaaaatcca aagaggagaa aaaatcaaag tggaacaatg tttcgacctg     3960

gacagaaagt aactttagca taaaatatac ttctttttga tttggttctg ttaagttttt     4020

tgatggcttt tccatatgtt gtaacaggaa aaaaatggtg tctatgaatt tcttcttaat     4080

ttaacaaatt tggttaattt ataaaatcac agattggtaa atgctataat tatgtaatga     4140

tcaggattga gattaatact gtagtataaa ttgggacatt ataacagatt ccatatttta     4200

tttcctaaaa tctaaattca gtctttaatg aaataatatt agccaaatgg tggaactaat     4260

ttatttcttt tgaggaaaag ataataaaga atgtaattaa atttaaattt cttggaattc     4320

ccagttgtat attcatcacc tttgtagcat ttgacaaatt ttatgcttag cagcttcttc     4380

actgttttga aataaaatat cctattacct actgataaaa aaaaaaa                   4427


<210>  5
<211>  3192
<212>  DNA
<213>  Homo sapiens

<400>  5
cagcaacctc tgtttttcat aaacactgtg ccattcatgt atttccatgg gaccaggaat       60

attttgatgt agagcccagc tttactgtga catatggaaa gctagttaag ctactttctc      120

caaagcaaca gcaaagtaaa acaaaacaaa atgtgttatc acctgaaaaa gagaagcaga      180

tgtcagagcc actagatcaa aaaaaaatta ggtcagatca taatgaagaa gatgagaagg      240

cctgtgtgct acaagtagtc tggaatggac ttgaagaatt gaacaatgcc atcaaatata      300

ccaaaaatgt agaagttctc catcttggga aagtctggat tccagatgac ctgaggaaga      360

gactaaatat agaaatgcat gccgtagtca ggataactcc agtggaagtt acccctaaaa      420

ttccaagatc tctaaagtta caacctagag agaatttacc taaagacata agtgaagaag      480

acataaaaac tgtattttat tcatggctac agcagtctac taccaccatg cttcctttgg      540

taatatcaga ggaagaattt attaagctgg aaactaaaga tggactgaag gaattttctc      600

tgagtatagt tcattcttgg gaaaaagaaa aagataaaaa tatttttctg ttgagtccca      660

atttgctgca gaagactaca atacaaagga gtgaattcct taggcgtatc ctccttggag      720

cacatcactc acagcctcct gggacgccct ttgtctcggc agctgatgtc tcttgttgca      780

ggacttagga atggagctct tttactcaca ggaggaaagg gaagtggaaa atcaacttta      840

gccaaagcaa tctgtaaaga agcatttgac aaactggatg cccatgtgga gagagttgac      900

tgtaaagctt tacgaggaaa aaggcttgaa aacatacaaa aaaccctaga ggtggctttc      960

tcagaggcag tgtggatgca gccatctgtt gtcctgctgg atgaccttga cctcattgct     1020

ggactgcctg ctgtcccgga acatgagcac agtcctgatg cggtgcagag ccagcggctt     1080

gctcatgctt tgaatgatat gataaaagag tttatctcca tgggaagttt ggttgcactg     1140

attgccacaa gtcagtctca gcaatctcta catcctttac ttgtttctgc tcaaggagtt     1200

cacatatttc agtgcgtcca acacattcag cctcctaatc aggaacaaag atgtgaaatt     1260

ctgtgtaatg taataaaaaa taaattggac tgtgatataa acaagttcac cgatcttgac     1320

ctgcagcatg tagctaaaga aactggcggg tttgtggcta gagattttac agtacttgtg     1380

gatcgagcca tacattctcg actctctcgt cagagtatat ccaccagaga aaaattagtt     1440

ttaacaacat tggacttcca aaaggctctc cgcggatttc ttcctgcgtc tttgcgaagt     1500

gtcaacctgc ataaacctag agacctgggt tgggacaaga ttggtgggtt acatgaagtt     1560

aggcagatac tcatggatac tatccagtta cctgccaagt atccagaatt atttgcaaac     1620

ttgcccatac gacaaagaac aggaatactg ttgtatggtc cgcctggaac aggaaaaacc     1680

ttactagctg gggtaattgc acgagagagt agaatgaatt ttataagtgt caaggggcca     1740

gagttactca gcaaatacat tggagcaagt gaacaagctg ttcgggatat ttttattaga     1800

gcacaggctg caaagccctg cattcttttc tttgatgaat ttgaatccat tgctcctcgg     1860

cggggtcatg ataatacagg agttacagac cgagtagtta accagttgct gactcagttg     1920

gatggagtag aaggcttaca gggtgtttat gtattggctg ctactagtcg ccctgacttg     1980

attgaccctg ccctgcttag gcctggtcga ctagataaat gtgtatactg tcctcctcct     2040

gatcaggtgt cacgtcttga aattttaaat gtcctcagtg actctctacc tctggcagat     2100

gatgttgacc ttcagcatgt agcatcagta actgactcct ttactggagc tgatctgaaa     2160

gctttacttt acaatgccca attggaggcc ttacatggaa tgctgctctc gagtggactc     2220

caggatggaa gttccagctc tgatagtgac ctaagtctgt cttcaatggt ctttcttaac     2280

catagcagtg gctctgacga ttcagctgga gatggagaat gtggcttaga tcagtccctt     2340

gtttctttag agatgtccga gatccttcca gatgaatcaa aattcaatat gtaccggctc     2400

tactttggaa gctcttatga atcagaactt ggaaatggaa cctcttctga tttgagctca     2460

caatgtctct ctgcaccaag ctccatgact caggatttgc ctggagttcc tgggaaagac     2520

cagttgtttt cacagcctcc agtgttaagg acagcttcac aagagggttg ccaagaactt     2580

acacaagaac aaagagatca actgagggca gatatcagta ttatcaaagg cagataccgg     2640

agccaaagtg gagaggacga atccatgaac caaccaggac caatcaaaac cagactggct     2700

attagtcagt cacatttaat gactgcactt ggtcacacaa gaccatccat tagtgaagat     2760

gactggaaga attttgctga gctatatgaa agctttcaaa atccaaagag gagaaaaaat     2820

caaagtggaa caatgtttcg acctggacag aaagtaactt tagcataaaa tatacttctt     2880

tttgatttgg ttctgttaag ttttttgatg gcttttccat atgttgtaac aggaaaaaaa     2940

tggtgtctat gaatttcttc ttaatttaac aaatttggtt aatttataaa atcacagatt     3000

ggtaaatgct ataattatgt aatgatcagg attgagatta atactgtagt ataaattggg     3060

acattataac agattccata ttttatttcc taaaatctaa attcagtctt taatgaaata     3120

atattagcca aatggtggaa ctaatttatt tcttttgagg aaaagataat aaagaatgta     3180

attaaattta aa                                                         3192


<210>  6
<211>  4983
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV.hPEX1


<220>
<221>  repeat_region
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (113)..(130)
<223>  ITR D Segment

<220>
<221>  enhancer
<222>  (181)..(484)
<223>  human cytomegalovirus (CMV) immediate early enhancer

<220>
<221>  promoter
<222>  (485)..(688)
<223>  human cytomegalovirus (CMV) immediate early promoter

<220>
<221>  misc_feature
<222>  (706)..(714)
<223>  Kozak

<220>
<221>  CDS
<222>  (715)..(4566)
<223>  codon-optimized human PEX1

<220>
<221>  polyA_signal
<222>  (4597)..(4804)
<223>  bovine growth hormone polyadenylation signal (bGH poly(A) signal)

<220>
<221>  repeat_region
<222>  (4854)..(4983)
<223>  3' ITR

<220>
<221>  repeat_region
<222>  (4854)..(4871)
<223>  ITR D Segment

<400>  6
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      240

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      300

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      360

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      420

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      480

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      540

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      600

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      660

acggtgggag gtctatataa gcagagcttg tacactagcg gccgcgccgc cacc atg        717
                                                            Met           
                                                            1             

tgg gga agc gac aga ctg gcc gga gct gga ggg gga gga gca gcc gtc        765
Trp Gly Ser Asp Arg Leu Ala Gly Ala Gly Gly Gly Gly Ala Ala Val           
            5                   10                  15                    

acc gtg gcg ttc act aac gcg cgg gac tgc ttt ctc cat ctg ccg cgg        813
Thr Val Ala Phe Thr Asn Ala Arg Asp Cys Phe Leu His Leu Pro Arg           
        20                  25                  30                        

agg ctg gtc gcc cag ctg cac ctc ctg cag aac cag gcc atc gag gtg        861
Arg Leu Val Ala Gln Leu His Leu Leu Gln Asn Gln Ala Ile Glu Val           
    35                  40                  45                            

gtg tgg tcc cac caa ccg gcc ttt ttg agc tgg gtc gag gga agg cac        909
Val Trp Ser His Gln Pro Ala Phe Leu Ser Trp Val Glu Gly Arg His           
50                  55                  60                  65            

ttt tcg gac cag gga gaa aat gtg gcg gag atc aac cgc cag gtc ggc        957
Phe Ser Asp Gln Gly Glu Asn Val Ala Glu Ile Asn Arg Gln Val Gly           
                70                  75                  80                

cag aag ctg gga ctg tcc aac ggc gga cag gtg ttc ctc aag ccg tgc       1005
Gln Lys Leu Gly Leu Ser Asn Gly Gly Gln Val Phe Leu Lys Pro Cys           
            85                  90                  95                    

agc cac gtg gtg tcc tgc caa cag gtg gaa gtg gag ccg ctc tcc gcc       1053
Ser His Val Val Ser Cys Gln Gln Val Glu Val Glu Pro Leu Ser Ala           
        100                 105                 110                       

gac gac tgg gag atc ctc gaa ttg cat gcc gtg agc ctc gaa cag cat       1101
Asp Asp Trp Glu Ile Leu Glu Leu His Ala Val Ser Leu Glu Gln His           
    115                 120                 125                           

ctg ttg gac cag att cgc att gtg ttc ccg aag gcc ata ttc ccc gtg       1149
Leu Leu Asp Gln Ile Arg Ile Val Phe Pro Lys Ala Ile Phe Pro Val           
130                 135                 140                 145           

tgg gtc gat cag cag acc tat atc ttc atc cag att gtg gcc ctc atc       1197
Trp Val Asp Gln Gln Thr Tyr Ile Phe Ile Gln Ile Val Ala Leu Ile           
                150                 155                 160               

ccg gcc gcc tca tac gga cgg ctg gaa act gac acc aag ctg ctg att       1245
Pro Ala Ala Ser Tyr Gly Arg Leu Glu Thr Asp Thr Lys Leu Leu Ile           
            165                 170                 175                   

caa cct aag acc cgg agg gcc aaa gaa aac acc ttc tcc aag gcc gac       1293
Gln Pro Lys Thr Arg Arg Ala Lys Glu Asn Thr Phe Ser Lys Ala Asp           
        180                 185                 190                       

gct gag tac aag aag ctc cac tcc tac gga cgg gac cag aag ggg atg       1341
Ala Glu Tyr Lys Lys Leu His Ser Tyr Gly Arg Asp Gln Lys Gly Met           
    195                 200                 205                           

atg aag gag ctg caa acc aag cag ctc cag agc aac acc gtg ggg atc       1389
Met Lys Glu Leu Gln Thr Lys Gln Leu Gln Ser Asn Thr Val Gly Ile           
210                 215                 220                 225           

acc gag tcc aat gaa aac gag tcg gaa atc cca gtc gat tca tct tcc       1437
Thr Glu Ser Asn Glu Asn Glu Ser Glu Ile Pro Val Asp Ser Ser Ser           
                230                 235                 240               

gtg gcc agc ctg tgg act atg atc ggt tcc att ttc tcg ttc caa tct       1485
Val Ala Ser Leu Trp Thr Met Ile Gly Ser Ile Phe Ser Phe Gln Ser           
            245                 250                 255                   

gag aag aag cag gaa act agc tgg ggg ctg act gag atc aac gcc ttc       1533
Glu Lys Lys Gln Glu Thr Ser Trp Gly Leu Thr Glu Ile Asn Ala Phe           
        260                 265                 270                       

aag aac atg cag tcc aaa gtg gtg cct ctg gat aac atc ttt cgc gtg       1581
Lys Asn Met Gln Ser Lys Val Val Pro Leu Asp Asn Ile Phe Arg Val           
    275                 280                 285                           

tgc aag tcc caa ccg ccc tca atc tac aac gcg tcc gct acc tcc gtg       1629
Cys Lys Ser Gln Pro Pro Ser Ile Tyr Asn Ala Ser Ala Thr Ser Val           
290                 295                 300                 305           

ttt cat aag cac tgt gcc atc cac gtg ttc cca tgg gat cag gaa tac       1677
Phe His Lys His Cys Ala Ile His Val Phe Pro Trp Asp Gln Glu Tyr           
                310                 315                 320               

ttc gat gtc gaa cct tcc ttc acc gtg act tac ggg aag ctt gtc aag       1725
Phe Asp Val Glu Pro Ser Phe Thr Val Thr Tyr Gly Lys Leu Val Lys           
            325                 330                 335                   

ctc ctc agc ccc aag cag cag caa tcg aaa act aag cag aac gtg ctt       1773
Leu Leu Ser Pro Lys Gln Gln Gln Ser Lys Thr Lys Gln Asn Val Leu           
        340                 345                 350                       

tcc ccg gag aag gag aag caa atg tca gaa cca ctc gac cag aag aaa       1821
Ser Pro Glu Lys Glu Lys Gln Met Ser Glu Pro Leu Asp Gln Lys Lys           
    355                 360                 365                           

atc aga tcg gat cat aac gaa gag gac gag aag gcc tgc gtc ctt cag       1869
Ile Arg Ser Asp His Asn Glu Glu Asp Glu Lys Ala Cys Val Leu Gln           
370                 375                 380                 385           

gtg gtc tgg aac ggc ctg gag gag ctg aac aac gcg att aag tac acc       1917
Val Val Trp Asn Gly Leu Glu Glu Leu Asn Asn Ala Ile Lys Tyr Thr           
                390                 395                 400               

aag aac gtc gag gtc ctt cac ctg gga aag gtg tgg att ccg gat gat       1965
Lys Asn Val Glu Val Leu His Leu Gly Lys Val Trp Ile Pro Asp Asp           
            405                 410                 415                   

ctg agg aaa cgc ctc aac atc gaa atg cac gct gtg gtg cgg att acc       2013
Leu Arg Lys Arg Leu Asn Ile Glu Met His Ala Val Val Arg Ile Thr           
        420                 425                 430                       

ccg gtc gag gtc acc cca aag atc cct cgc tcc ttg aag ctg cag ccg       2061
Pro Val Glu Val Thr Pro Lys Ile Pro Arg Ser Leu Lys Leu Gln Pro           
    435                 440                 445                           

cga gaa aac ttg ccc aag gac att tct gaa gag gat atc aag act gtg       2109
Arg Glu Asn Leu Pro Lys Asp Ile Ser Glu Glu Asp Ile Lys Thr Val           
450                 455                 460                 465           

ttc tac tcc tgg ctg caa cag agc act acc acc atg ctc cct ctg gtc       2157
Phe Tyr Ser Trp Leu Gln Gln Ser Thr Thr Thr Met Leu Pro Leu Val           
                470                 475                 480               

att tcg gag gaa gaa ttc atc aaa ctg gaa acc aag gac gga ctg aaa       2205
Ile Ser Glu Glu Glu Phe Ile Lys Leu Glu Thr Lys Asp Gly Leu Lys           
            485                 490                 495                   

gaa ttc tcc ctg tcc atc gtg cac tcc tgg gaa aag gag aag gac aag       2253
Glu Phe Ser Leu Ser Ile Val His Ser Trp Glu Lys Glu Lys Asp Lys           
        500                 505                 510                       

aat atc ttc ctg ctg tcc ccc aat ctg ctg caa aag acc acg atc cag       2301
Asn Ile Phe Leu Leu Ser Pro Asn Leu Leu Gln Lys Thr Thr Ile Gln           
    515                 520                 525                           

gtg ctg ctc gac ccc atg gtg aag gag gaa aac tca gaa gag atc gac       2349
Val Leu Leu Asp Pro Met Val Lys Glu Glu Asn Ser Glu Glu Ile Asp           
530                 535                 540                 545           

ttc atc ctg ccg ttc ctt aag ctg agt tca ctg gga ggc gtg aac tcc       2397
Phe Ile Leu Pro Phe Leu Lys Leu Ser Ser Leu Gly Gly Val Asn Ser           
                550                 555                 560               

ctt ggc gtg tcc tcg ctg gag cac atc act cac tca ctg ctg ggc cgg       2445
Leu Gly Val Ser Ser Leu Glu His Ile Thr His Ser Leu Leu Gly Arg           
            565                 570                 575                   

cct ctg agc aga cag ctt atg agc ttg gtc gcc gga ctc aga aac ggt       2493
Pro Leu Ser Arg Gln Leu Met Ser Leu Val Ala Gly Leu Arg Asn Gly           
        580                 585                 590                       

gcc ctc ctg ctc acc ggc ggc aag gga tcg gga aag tcc acc ctc gct       2541
Ala Leu Leu Leu Thr Gly Gly Lys Gly Ser Gly Lys Ser Thr Leu Ala           
    595                 600                 605                           

aag gcc att tgc aaa gag gca ttc gat aag ctg gac gcc cat gtg gag       2589
Lys Ala Ile Cys Lys Glu Ala Phe Asp Lys Leu Asp Ala His Val Glu           
610                 615                 620                 625           

cgg gtg gac tgt aag gcc ctc cgc gga aag cga ttg gaa aat att caa       2637
Arg Val Asp Cys Lys Ala Leu Arg Gly Lys Arg Leu Glu Asn Ile Gln           
                630                 635                 640               

aag act ctc gaa gtc gcc ttt tcc gaa gcc gtc tgg atg cag ccc tcg       2685
Lys Thr Leu Glu Val Ala Phe Ser Glu Ala Val Trp Met Gln Pro Ser           
            645                 650                 655                   

gtc gtc ctg ctc gac gat ctg gac ctc atc gct ggg ctg ccg gcc gtg       2733
Val Val Leu Leu Asp Asp Leu Asp Leu Ile Ala Gly Leu Pro Ala Val           
        660                 665                 670                       

ccg gag cat gaa cac tcc cct gac gcg gtc cag tcg caa cgg ctc gcc       2781
Pro Glu His Glu His Ser Pro Asp Ala Val Gln Ser Gln Arg Leu Ala           
    675                 680                 685                           

cac gcc ctg aac gat atg att aag gaa ttc atc tca atg gga tca ctg       2829
His Ala Leu Asn Asp Met Ile Lys Glu Phe Ile Ser Met Gly Ser Leu           
690                 695                 700                 705           

gtg gcc ctg atc gcg act tcc cag agc cag cag tcc ctg cac cct ctg       2877
Val Ala Leu Ile Ala Thr Ser Gln Ser Gln Gln Ser Leu His Pro Leu           
                710                 715                 720               

ctg gtg tcg gcc cag ggc gtg cac att ttt cag tgt gtg caa cac atc       2925
Leu Val Ser Ala Gln Gly Val His Ile Phe Gln Cys Val Gln His Ile           
            725                 730                 735                   

cag ccg ccc aac cag gag cag cgg tgc gaa atc ctg tgc aac gtg att       2973
Gln Pro Pro Asn Gln Glu Gln Arg Cys Glu Ile Leu Cys Asn Val Ile           
        740                 745                 750                       

aag aac aag ctg gac tgc gat atc aac aag ttt acc gac ctt gat ctc       3021
Lys Asn Lys Leu Asp Cys Asp Ile Asn Lys Phe Thr Asp Leu Asp Leu           
    755                 760                 765                           

caa cat gtg gct aag gag act ggg ggc ttc gtg gct cgg gac ttc aca       3069
Gln His Val Ala Lys Glu Thr Gly Gly Phe Val Ala Arg Asp Phe Thr           
770                 775                 780                 785           

gtg ttg gtg gac cgg gca att cac tcc aga ctg tcc cgc cag agc att       3117
Val Leu Val Asp Arg Ala Ile His Ser Arg Leu Ser Arg Gln Ser Ile           
                790                 795                 800               

tcc acc cgc gaa aaa ctg gtc ctg acc acc ctc gac ttc cag aag gcc       3165
Ser Thr Arg Glu Lys Leu Val Leu Thr Thr Leu Asp Phe Gln Lys Ala           
            805                 810                 815                   

ctc aga ggc ttc ctt cct gcg agc ctc aga tcc gtc aac ctt cac aag       3213
Leu Arg Gly Phe Leu Pro Ala Ser Leu Arg Ser Val Asn Leu His Lys           
        820                 825                 830                       

ccg cgg gac ctt ggc tgg gac aag atc ggt ggg ctc cac gag gtg cgg       3261
Pro Arg Asp Leu Gly Trp Asp Lys Ile Gly Gly Leu His Glu Val Arg           
    835                 840                 845                           

cag atc ctc atg gac acc att cag ctg cct gca aag tac ccc gag ctg       3309
Gln Ile Leu Met Asp Thr Ile Gln Leu Pro Ala Lys Tyr Pro Glu Leu           
850                 855                 860                 865           

ttc gcc aac ttg ccg att cgc cag cgc acg gga atc ctg ctc tac ggc       3357
Phe Ala Asn Leu Pro Ile Arg Gln Arg Thr Gly Ile Leu Leu Tyr Gly           
                870                 875                 880               

ccc ccg ggc acc gga aag acc ctg ctg gcc ggt gtg atc gcc cgg gaa       3405
Pro Pro Gly Thr Gly Lys Thr Leu Leu Ala Gly Val Ile Ala Arg Glu           
            885                 890                 895                   

tcg agg atg aac ttc atc tcc gtg aag gga ccc gaa ctc ctg tcc aag       3453
Ser Arg Met Asn Phe Ile Ser Val Lys Gly Pro Glu Leu Leu Ser Lys           
        900                 905                 910                       

tac atc ggt gcc tcc gaa cag gcc gtg cgc gat ata ttc att agg gcc       3501
Tyr Ile Gly Ala Ser Glu Gln Ala Val Arg Asp Ile Phe Ile Arg Ala           
    915                 920                 925                           

cag gcc gcg aag ccc tgc att ctg ttc ttc gac gag ttt gaa tcg atc       3549
Gln Ala Ala Lys Pro Cys Ile Leu Phe Phe Asp Glu Phe Glu Ser Ile           
930                 935                 940                 945           

gcg ccc cgg agg ggc cac gac aac acg gga gtg acc gac cgg gtg gtg       3597
Ala Pro Arg Arg Gly His Asp Asn Thr Gly Val Thr Asp Arg Val Val           
                950                 955                 960               

aac cag ctg ctc acc caa ctg gat ggc gtg gaa ggc ctt cag gga gtg       3645
Asn Gln Leu Leu Thr Gln Leu Asp Gly Val Glu Gly Leu Gln Gly Val           
            965                 970                 975                   

tac gtg ctg gcg gct acc tcc aga ccg gac ctg atc gat ccg gcc ctg       3693
Tyr Val Leu Ala Ala Thr Ser Arg Pro Asp Leu Ile Asp Pro Ala Leu           
        980                 985                 990                       

ctg cgc ccc ggg aga ctg gac  aag tgc gtg tat tgc  cct ccc cct gac     3741
Leu Arg Pro Gly Arg Leu Asp  Lys Cys Val Tyr Cys  Pro Pro Pro Asp         
    995                 1000                 1005                         

cag  gtg tca agg ttg gaa  atc ctc aac gtg ctc  tcg gac tcc ctg        3786
Gln  Val Ser Arg Leu Glu  Ile Leu Asn Val Leu  Ser Asp Ser Leu            
1010                 1015                 1020                            

cca  ctg gca gat gat gtg  gac ctc cag cat gtg  gcc tcc gtg act        3831
Pro  Leu Ala Asp Asp Val  Asp Leu Gln His Val  Ala Ser Val Thr            
1025                 1030                 1035                            

gac  agc ttc aca gga gcc  gat ctg aag gcc ctg  ctt tac aac gcc        3876
Asp  Ser Phe Thr Gly Ala  Asp Leu Lys Ala Leu  Leu Tyr Asn Ala            
1040                 1045                 1050                            

cag  ttg gag gcg ctg cac  ggt atg ctg ctg tcc  tcc ggt ctg cag        3921
Gln  Leu Glu Ala Leu His  Gly Met Leu Leu Ser  Ser Gly Leu Gln            
1055                 1060                 1065                            

gat  ggc tcc tcc tct tcc  gat agc gac ctg tcg  ctg agc agc atg        3966
Asp  Gly Ser Ser Ser Ser  Asp Ser Asp Leu Ser  Leu Ser Ser Met            
1070                 1075                 1080                            

gtg  ttc ctg aac cat tcc  agc ggc tcc gat gac  agc gcg ggc gac        4011
Val  Phe Leu Asn His Ser  Ser Gly Ser Asp Asp  Ser Ala Gly Asp            
1085                 1090                 1095                            

gga  gaa tgt gga ctg gat  caa tcc ctg gtg tcc  ctg gag atg agc        4056
Gly  Glu Cys Gly Leu Asp  Gln Ser Leu Val Ser  Leu Glu Met Ser            
1100                 1105                 1110                            

gag  att ctg cca gac gag  tcc aag ttc aac atg  tac agg ctg tac        4101
Glu  Ile Leu Pro Asp Glu  Ser Lys Phe Asn Met  Tyr Arg Leu Tyr            
1115                 1120                 1125                            

ttc  ggc agc agc tac gag  tcc gag ctg gga aat  ggt acc tcg tcc        4146
Phe  Gly Ser Ser Tyr Glu  Ser Glu Leu Gly Asn  Gly Thr Ser Ser            
1130                 1135                 1140                            

gac  ctg tca agc cag tgc  ctg tcc gcg cct tcc  tcc atg acc cag        4191
Asp  Leu Ser Ser Gln Cys  Leu Ser Ala Pro Ser  Ser Met Thr Gln            
1145                 1150                 1155                            

gac  ctc cct gga gtg cca  ggg aag gat cag ctg  ttc agc cag cct        4236
Asp  Leu Pro Gly Val Pro  Gly Lys Asp Gln Leu  Phe Ser Gln Pro            
1160                 1165                 1170                            

ccc  gtg ctg cgc act gcg  agc cag gaa ggg tgc  cag gaa ttg acc        4281
Pro  Val Leu Arg Thr Ala  Ser Gln Glu Gly Cys  Gln Glu Leu Thr            
1175                 1180                 1185                            

caa  gag cag cgg gac caa  ctg cgc gcg gac att  tcg atc atc aaa        4326
Gln  Glu Gln Arg Asp Gln  Leu Arg Ala Asp Ile  Ser Ile Ile Lys            
1190                 1195                 1200                            

ggc  aga tac cgc tcc caa  tcc ggg gag gac gaa  agc atg aac cag        4371
Gly  Arg Tyr Arg Ser Gln  Ser Gly Glu Asp Glu  Ser Met Asn Gln            
1205                 1210                 1215                            

ccc  ggg cct atc aag act  aga ctg gca atc tcc  caa agc cac ctg        4416
Pro  Gly Pro Ile Lys Thr  Arg Leu Ala Ile Ser  Gln Ser His Leu            
1220                 1225                 1230                            

atg  acc gca ctg gga cac  acc cgg ccc tcg atc  tcg gag gac gac        4461
Met  Thr Ala Leu Gly His  Thr Arg Pro Ser Ile  Ser Glu Asp Asp            
1235                 1240                 1245                            

tgg  aag aac ttc gct gag  ctg tac gaa tcc ttc  cag aat ccg aag        4506
Trp  Lys Asn Phe Ala Glu  Leu Tyr Glu Ser Phe  Gln Asn Pro Lys            
1250                 1255                 1260                            

cgg  aga aag aac cag agc  gga act atg ttc cgg  ccc gga cag aag        4551
Arg  Arg Lys Asn Gln Ser  Gly Thr Met Phe Arg  Pro Gly Gln Lys            
1265                 1270                 1275                            

gtg  acc ctg gcc tga agtactgcgg atcctgcaga tctgcctcga ctgtgccttc      4606
Val  Thr Leu Ala                                                          
1280                                                                      

tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc     4666

cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg     4726

tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa     4786

tagcaggcat gctggggact cgagttctac gtagataagt agcatggcgg gttaatcatt     4846

aactacaagg aacccctagt gatggagttg gccactccct ctctgcgcgc tcgctcgctc     4906

actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg     4966

agcgagcgag cgcgcag                                                    4983


<210>  7
<211>  1283
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  7

Met Trp Gly Ser Asp Arg Leu Ala Gly Ala Gly Gly Gly Gly Ala Ala 
1               5                   10                  15      


Val Thr Val Ala Phe Thr Asn Ala Arg Asp Cys Phe Leu His Leu Pro 
            20                  25                  30          


Arg Arg Leu Val Ala Gln Leu His Leu Leu Gln Asn Gln Ala Ile Glu 
        35                  40                  45              


Val Val Trp Ser His Gln Pro Ala Phe Leu Ser Trp Val Glu Gly Arg 
    50                  55                  60                  


His Phe Ser Asp Gln Gly Glu Asn Val Ala Glu Ile Asn Arg Gln Val 
65                  70                  75                  80  


Gly Gln Lys Leu Gly Leu Ser Asn Gly Gly Gln Val Phe Leu Lys Pro 
                85                  90                  95      


Cys Ser His Val Val Ser Cys Gln Gln Val Glu Val Glu Pro Leu Ser 
            100                 105                 110         


Ala Asp Asp Trp Glu Ile Leu Glu Leu His Ala Val Ser Leu Glu Gln 
        115                 120                 125             


His Leu Leu Asp Gln Ile Arg Ile Val Phe Pro Lys Ala Ile Phe Pro 
    130                 135                 140                 


Val Trp Val Asp Gln Gln Thr Tyr Ile Phe Ile Gln Ile Val Ala Leu 
145                 150                 155                 160 


Ile Pro Ala Ala Ser Tyr Gly Arg Leu Glu Thr Asp Thr Lys Leu Leu 
                165                 170                 175     


Ile Gln Pro Lys Thr Arg Arg Ala Lys Glu Asn Thr Phe Ser Lys Ala 
            180                 185                 190         


Asp Ala Glu Tyr Lys Lys Leu His Ser Tyr Gly Arg Asp Gln Lys Gly 
        195                 200                 205             


Met Met Lys Glu Leu Gln Thr Lys Gln Leu Gln Ser Asn Thr Val Gly 
    210                 215                 220                 


Ile Thr Glu Ser Asn Glu Asn Glu Ser Glu Ile Pro Val Asp Ser Ser 
225                 230                 235                 240 


Ser Val Ala Ser Leu Trp Thr Met Ile Gly Ser Ile Phe Ser Phe Gln 
                245                 250                 255     


Ser Glu Lys Lys Gln Glu Thr Ser Trp Gly Leu Thr Glu Ile Asn Ala 
            260                 265                 270         


Phe Lys Asn Met Gln Ser Lys Val Val Pro Leu Asp Asn Ile Phe Arg 
        275                 280                 285             


Val Cys Lys Ser Gln Pro Pro Ser Ile Tyr Asn Ala Ser Ala Thr Ser 
    290                 295                 300                 


Val Phe His Lys His Cys Ala Ile His Val Phe Pro Trp Asp Gln Glu 
305                 310                 315                 320 


Tyr Phe Asp Val Glu Pro Ser Phe Thr Val Thr Tyr Gly Lys Leu Val 
                325                 330                 335     


Lys Leu Leu Ser Pro Lys Gln Gln Gln Ser Lys Thr Lys Gln Asn Val 
            340                 345                 350         


Leu Ser Pro Glu Lys Glu Lys Gln Met Ser Glu Pro Leu Asp Gln Lys 
        355                 360                 365             


Lys Ile Arg Ser Asp His Asn Glu Glu Asp Glu Lys Ala Cys Val Leu 
    370                 375                 380                 


Gln Val Val Trp Asn Gly Leu Glu Glu Leu Asn Asn Ala Ile Lys Tyr 
385                 390                 395                 400 


Thr Lys Asn Val Glu Val Leu His Leu Gly Lys Val Trp Ile Pro Asp 
                405                 410                 415     


Asp Leu Arg Lys Arg Leu Asn Ile Glu Met His Ala Val Val Arg Ile 
            420                 425                 430         


Thr Pro Val Glu Val Thr Pro Lys Ile Pro Arg Ser Leu Lys Leu Gln 
        435                 440                 445             


Pro Arg Glu Asn Leu Pro Lys Asp Ile Ser Glu Glu Asp Ile Lys Thr 
    450                 455                 460                 


Val Phe Tyr Ser Trp Leu Gln Gln Ser Thr Thr Thr Met Leu Pro Leu 
465                 470                 475                 480 


Val Ile Ser Glu Glu Glu Phe Ile Lys Leu Glu Thr Lys Asp Gly Leu 
                485                 490                 495     


Lys Glu Phe Ser Leu Ser Ile Val His Ser Trp Glu Lys Glu Lys Asp 
            500                 505                 510         


Lys Asn Ile Phe Leu Leu Ser Pro Asn Leu Leu Gln Lys Thr Thr Ile 
        515                 520                 525             


Gln Val Leu Leu Asp Pro Met Val Lys Glu Glu Asn Ser Glu Glu Ile 
    530                 535                 540                 


Asp Phe Ile Leu Pro Phe Leu Lys Leu Ser Ser Leu Gly Gly Val Asn 
545                 550                 555                 560 


Ser Leu Gly Val Ser Ser Leu Glu His Ile Thr His Ser Leu Leu Gly 
                565                 570                 575     


Arg Pro Leu Ser Arg Gln Leu Met Ser Leu Val Ala Gly Leu Arg Asn 
            580                 585                 590         


Gly Ala Leu Leu Leu Thr Gly Gly Lys Gly Ser Gly Lys Ser Thr Leu 
        595                 600                 605             


Ala Lys Ala Ile Cys Lys Glu Ala Phe Asp Lys Leu Asp Ala His Val 
    610                 615                 620                 


Glu Arg Val Asp Cys Lys Ala Leu Arg Gly Lys Arg Leu Glu Asn Ile 
625                 630                 635                 640 


Gln Lys Thr Leu Glu Val Ala Phe Ser Glu Ala Val Trp Met Gln Pro 
                645                 650                 655     


Ser Val Val Leu Leu Asp Asp Leu Asp Leu Ile Ala Gly Leu Pro Ala 
            660                 665                 670         


Val Pro Glu His Glu His Ser Pro Asp Ala Val Gln Ser Gln Arg Leu 
        675                 680                 685             


Ala His Ala Leu Asn Asp Met Ile Lys Glu Phe Ile Ser Met Gly Ser 
    690                 695                 700                 


Leu Val Ala Leu Ile Ala Thr Ser Gln Ser Gln Gln Ser Leu His Pro 
705                 710                 715                 720 


Leu Leu Val Ser Ala Gln Gly Val His Ile Phe Gln Cys Val Gln His 
                725                 730                 735     


Ile Gln Pro Pro Asn Gln Glu Gln Arg Cys Glu Ile Leu Cys Asn Val 
            740                 745                 750         


Ile Lys Asn Lys Leu Asp Cys Asp Ile Asn Lys Phe Thr Asp Leu Asp 
        755                 760                 765             


Leu Gln His Val Ala Lys Glu Thr Gly Gly Phe Val Ala Arg Asp Phe 
    770                 775                 780                 


Thr Val Leu Val Asp Arg Ala Ile His Ser Arg Leu Ser Arg Gln Ser 
785                 790                 795                 800 


Ile Ser Thr Arg Glu Lys Leu Val Leu Thr Thr Leu Asp Phe Gln Lys 
                805                 810                 815     


Ala Leu Arg Gly Phe Leu Pro Ala Ser Leu Arg Ser Val Asn Leu His 
            820                 825                 830         


Lys Pro Arg Asp Leu Gly Trp Asp Lys Ile Gly Gly Leu His Glu Val 
        835                 840                 845             


Arg Gln Ile Leu Met Asp Thr Ile Gln Leu Pro Ala Lys Tyr Pro Glu 
    850                 855                 860                 


Leu Phe Ala Asn Leu Pro Ile Arg Gln Arg Thr Gly Ile Leu Leu Tyr 
865                 870                 875                 880 


Gly Pro Pro Gly Thr Gly Lys Thr Leu Leu Ala Gly Val Ile Ala Arg 
                885                 890                 895     


Glu Ser Arg Met Asn Phe Ile Ser Val Lys Gly Pro Glu Leu Leu Ser 
            900                 905                 910         


Lys Tyr Ile Gly Ala Ser Glu Gln Ala Val Arg Asp Ile Phe Ile Arg 
        915                 920                 925             


Ala Gln Ala Ala Lys Pro Cys Ile Leu Phe Phe Asp Glu Phe Glu Ser 
    930                 935                 940                 


Ile Ala Pro Arg Arg Gly His Asp Asn Thr Gly Val Thr Asp Arg Val 
945                 950                 955                 960 


Val Asn Gln Leu Leu Thr Gln Leu Asp Gly Val Glu Gly Leu Gln Gly 
                965                 970                 975     


Val Tyr Val Leu Ala Ala Thr Ser Arg Pro Asp Leu Ile Asp Pro Ala 
            980                 985                 990         


Leu Leu Arg Pro Gly Arg Leu Asp  Lys Cys Val Tyr Cys  Pro Pro Pro 
        995                 1000                 1005             


Asp Gln  Val Ser Arg Leu Glu  Ile Leu Asn Val Leu  Ser Asp Ser 
    1010                 1015                 1020             


Leu Pro  Leu Ala Asp Asp Val  Asp Leu Gln His Val  Ala Ser Val 
    1025                 1030                 1035             


Thr Asp  Ser Phe Thr Gly Ala  Asp Leu Lys Ala Leu  Leu Tyr Asn 
    1040                 1045                 1050             


Ala Gln  Leu Glu Ala Leu His  Gly Met Leu Leu Ser  Ser Gly Leu 
    1055                 1060                 1065             


Gln Asp  Gly Ser Ser Ser Ser  Asp Ser Asp Leu Ser  Leu Ser Ser 
    1070                 1075                 1080             


Met Val  Phe Leu Asn His Ser  Ser Gly Ser Asp Asp  Ser Ala Gly 
    1085                 1090                 1095             


Asp Gly  Glu Cys Gly Leu Asp  Gln Ser Leu Val Ser  Leu Glu Met 
    1100                 1105                 1110             


Ser Glu  Ile Leu Pro Asp Glu  Ser Lys Phe Asn Met  Tyr Arg Leu 
    1115                 1120                 1125             


Tyr Phe  Gly Ser Ser Tyr Glu  Ser Glu Leu Gly Asn  Gly Thr Ser 
    1130                 1135                 1140             


Ser Asp  Leu Ser Ser Gln Cys  Leu Ser Ala Pro Ser  Ser Met Thr 
    1145                 1150                 1155             


Gln Asp  Leu Pro Gly Val Pro  Gly Lys Asp Gln Leu  Phe Ser Gln 
    1160                 1165                 1170             


Pro Pro  Val Leu Arg Thr Ala  Ser Gln Glu Gly Cys  Gln Glu Leu 
    1175                 1180                 1185             


Thr Gln  Glu Gln Arg Asp Gln  Leu Arg Ala Asp Ile  Ser Ile Ile 
    1190                 1195                 1200             


Lys Gly  Arg Tyr Arg Ser Gln  Ser Gly Glu Asp Glu  Ser Met Asn 
    1205                 1210                 1215             


Gln Pro  Gly Pro Ile Lys Thr  Arg Leu Ala Ile Ser  Gln Ser His 
    1220                 1225                 1230             


Leu Met  Thr Ala Leu Gly His  Thr Arg Pro Ser Ile  Ser Glu Asp 
    1235                 1240                 1245             


Asp Trp  Lys Asn Phe Ala Glu  Leu Tyr Glu Ser Phe  Gln Asn Pro 
    1250                 1255                 1260             


Lys Arg  Arg Lys Asn Gln Ser  Gly Thr Met Phe Arg  Pro Gly Gln 
    1265                 1270                 1275             


Lys Val  Thr Leu Ala 
    1280             


<210>  8
<211>  4947
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hRK1.hPEX1


<220>
<221>  repeat_region
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (113)..(130)
<223>  ITR D Segment

<220>
<221>  promoter
<222>  (175)..(684)
<223>  hRK1 promoter

<220>
<221>  misc_feature
<222>  (691)..(699)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (700)..(4551)
<223>  codon-optimized human PEX1

<220>
<221>  polyA_signal
<222>  (4573)..(4684)
<223>  bovine growth hormone (bGH) polyadenylation (poly(A)) signal

<220>
<221>  repeat_region
<222>  (4818)..(4947)
<223>  3' ITR

<220>
<221>  misc_feature
<222>  (4818)..(4835)
<223>  ITR D segment

<400>  8
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

aagatccaag ctcagatctc gatcgagttg ggccccagaa gcctggtggt tgtttgtcct      240

tctcagggga aaagtgaggc ggccccttgg aggaaggggc cgggcagaat gatctaatcg      300

gattccaagc agctcagggg attgtctttt tctagcacct tcttgccact cctaagcgtc      360

ctccgtgacc ccggctggga tttagcctgg tgctgtgtca gccccggtct cccaggggct      420

tcccagtggt ccccaggaac cctcgacagg gcccggtctc tctcgtccag caagggcagg      480

gacgggccac aggccaaggg ccctcgatcg aggaactgaa aaaccagaaa gttaactggt      540

aagtttagtc tttttgtctt ttatttcagg tcccggatcc ggtggtggtg caaatcaaag      600

aactgctcct cagtggatgt tgcctttact tctaggcctg tacggaagtg ttacttctgc      660

tctaaaagct gcggaattgt acccgcggcc gccgccacca tgtggggaag cgacagactg      720

gccggagctg gagggggagg agcagccgtc accgtggcgt tcactaacgc gcgggactgc      780

tttctccatc tgccgcggag gctggtcgcc cagctgcacc tcctgcagaa ccaggccatc      840

gaggtggtgt ggtcccacca accggccttt ttgagctggg tcgagggaag gcacttttcg      900

gaccagggag aaaatgtggc ggagatcaac cgccaggtcg gccagaagct gggactgtcc      960

aacggcggac aggtgttcct caagccgtgc agccacgtgg tgtcctgcca acaggtggaa     1020

gtggagccgc tctccgccga cgactgggag atcctcgaat tgcatgccgt gagcctcgaa     1080

cagcatctgt tggaccagat tcgcattgtg ttcccgaagg ccatattccc cgtgtgggtc     1140

gatcagcaga cctatatctt catccagatt gtggccctca tcccggccgc ctcatacgga     1200

cggctggaaa ctgacaccaa gctgctgatt caacctaaga cccggagggc caaagaaaac     1260

accttctcca aggccgacgc tgagtacaag aagctccact cctacggacg ggaccagaag     1320

gggatgatga aggagctgca aaccaagcag ctccagagca acaccgtggg gatcaccgag     1380

tccaatgaaa acgagtcgga aatcccagtc gattcatctt ccgtggccag cctgtggact     1440

atgatcggtt ccattttctc gttccaatct gagaagaagc aggaaactag ctgggggctg     1500

actgagatca acgccttcaa gaacatgcag tccaaagtgg tgcctctgga taacatcttt     1560

cgcgtgtgca agtcccaacc gccctcaatc tacaacgcgt ccgctacctc cgtgtttcat     1620

aagcactgtg ccatccacgt gttcccatgg gatcaggaat acttcgatgt cgaaccttcc     1680

ttcaccgtga cttacgggaa gcttgtcaag ctcctcagcc ccaagcagca gcaatcgaaa     1740

actaagcaga acgtgctttc cccggagaag gagaagcaaa tgtcagaacc actcgaccag     1800

aagaaaatca gatcggatca taacgaagag gacgagaagg cctgcgtcct tcaggtggtc     1860

tggaacggcc tggaggagct gaacaacgcg attaagtaca ccaagaacgt cgaggtcctt     1920

cacctgggaa aggtgtggat tccggatgat ctgaggaaac gcctcaacat cgaaatgcac     1980

gctgtggtgc ggattacccc ggtcgaggtc accccaaaga tccctcgctc cttgaagctg     2040

cagccgcgag aaaacttgcc caaggacatt tctgaagagg atatcaagac tgtgttctac     2100

tcctggctgc aacagagcac taccaccatg ctccctctgg tcatttcgga ggaagaattc     2160

atcaaactgg aaaccaagga cggactgaaa gaattctccc tgtccatcgt gcactcctgg     2220

gaaaaggaga aggacaagaa tatcttcctg ctgtccccca atctgctgca aaagaccacg     2280

atccaggtgc tgctcgaccc catggtgaag gaggaaaact cagaagagat cgacttcatc     2340

ctgccgttcc ttaagctgag ttcactggga ggcgtgaact cccttggcgt gtcctcgctg     2400

gagcacatca ctcactcact gctgggccgg cctctgagca gacagcttat gagcttggtc     2460

gccggactca gaaacggtgc cctcctgctc accggcggca agggatcggg aaagtccacc     2520

ctcgctaagg ccatttgcaa agaggcattc gataagctgg acgcccatgt ggagcgggtg     2580

gactgtaagg ccctccgcgg aaagcgattg gaaaatattc aaaagactct cgaagtcgcc     2640

ttttccgaag ccgtctggat gcagccctcg gtcgtcctgc tcgacgatct ggacctcatc     2700

gctgggctgc cggccgtgcc ggagcatgaa cactcccctg acgcggtcca gtcgcaacgg     2760

ctcgcccacg ccctgaacga tatgattaag gaattcatct caatgggatc actggtggcc     2820

ctgatcgcga cttcccagag ccagcagtcc ctgcaccctc tgctggtgtc ggcccagggc     2880

gtgcacattt ttcagtgtgt gcaacacatc cagccgccca accaggagca gcggtgcgaa     2940

atcctgtgca acgtgattaa gaacaagctg gactgcgata tcaacaagtt taccgacctt     3000

gatctccaac atgtggctaa ggagactggg ggcttcgtgg ctcgggactt cacagtgttg     3060

gtggaccggg caattcactc cagactgtcc cgccagagca tttccacccg cgaaaaactg     3120

gtcctgacca ccctcgactt ccagaaggcc ctcagaggct tccttcctgc gagcctcaga     3180

tccgtcaacc ttcacaagcc gcgggacctt ggctgggaca agatcggtgg gctccacgag     3240

gtgcggcaga tcctcatgga caccattcag ctgcctgcaa agtaccccga gctgttcgcc     3300

aacttgccga ttcgccagcg cacgggaatc ctgctctacg gccccccggg caccggaaag     3360

accctgctgg ccggtgtgat cgcccgggaa tcgaggatga acttcatctc cgtgaaggga     3420

cccgaactcc tgtccaagta catcggtgcc tccgaacagg ccgtgcgcga tatattcatt     3480

agggcccagg ccgcgaagcc ctgcattctg ttcttcgacg agtttgaatc gatcgcgccc     3540

cggaggggcc acgacaacac gggagtgacc gaccgggtgg tgaaccagct gctcacccaa     3600

ctggatggcg tggaaggcct tcagggagtg tacgtgctgg cggctacctc cagaccggac     3660

ctgatcgatc cggccctgct gcgccccggg agactggaca agtgcgtgta ttgccctccc     3720

cctgaccagg tgtcaaggtt ggaaatcctc aacgtgctct cggactccct gccactggca     3780

gatgatgtgg acctccagca tgtggcctcc gtgactgaca gcttcacagg agccgatctg     3840

aaggccctgc tttacaacgc ccagttggag gcgctgcacg gtatgctgct gtcctccggt     3900

ctgcaggatg gctcctcctc ttccgatagc gacctgtcgc tgagcagcat ggtgttcctg     3960

aaccattcca gcggctccga tgacagcgcg ggcgacggag aatgtggact ggatcaatcc     4020

ctggtgtccc tggagatgag cgagattctg ccagacgagt ccaagttcaa catgtacagg     4080

ctgtacttcg gcagcagcta cgagtccgag ctgggaaatg gtacctcgtc cgacctgtca     4140

agccagtgcc tgtccgcgcc ttcctccatg acccaggacc tccctggagt gccagggaag     4200

gatcagctgt tcagccagcc tcccgtgctg cgcactgcga gccaggaagg gtgccaggaa     4260

ttgacccaag agcagcggga ccaactgcgc gcggacattt cgatcatcaa aggcagatac     4320

cgctcccaat ccggggagga cgaaagcatg aaccagcccg ggcctatcaa gactagactg     4380

gcaatctccc aaagccacct gatgaccgca ctgggacaca cccggccctc gatctcggag     4440

gacgactgga agaacttcgc tgagctgtac gaatccttcc agaatccgaa gcggagaaag     4500

aaccagagcg gaactatgtt ccggcccgga cagaaggtga ccctggcctg atgtacaagt     4560

aataagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc     4620

cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg     4680

catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca     4740

agggggagga ttgggaagac aatagcaggt cgagttctac gtagataagt agcatggcgg     4800

gttaatcatt aactacaagg aacccctagt gatggagttg gccactccct ctctgcgcgc     4860

tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc     4920

ggcctcagtg agcgagcgag cgcgcag                                         4947


<210>  9
<211>  13990
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAAV.CAG.copt.hPEX1


<220>
<221>  misc_feature
<222>  (1253)..(7390)
<223>  AAV expression cassette

<220>
<221>  misc_feature
<222>  (1253)..(1382)
<223>  5' ITR

<220>
<221>  enhancer
<222>  (1493)..(1796)
<223>  CMV enhancer

<220>
<221>  promoter
<222>  (1798)..(2075)
<223>  CBA promoter

<220>
<221>  Intron
<222>  (2076)..(3104)
<223>  Chimeric intron

<220>
<221>  misc_feature
<222>  (3113)..(3121)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (3122)..(6973)
<223>  codon optimized hPEX1

<220>
<221>  polyA_signal
<222>  (7004)..(7211)
<223>  bGH poly(A)

<220>
<221>  misc_feature
<222>  (7261)..(7390)
<223>  3' ITR

<400>  9
tagaaaaact catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata       60

ccatattttt gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat      120

aggatggcaa gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct      180

attaatttcc cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact      240

gaatccggtg agaatggcaa aagtttatgc atttctttcc agacttgttc aacaggccag      300

ccattacgct cgtcatcaaa atcactcgca tcaaccaaac cgttattcat tcgtgattgc      360

gcctgagcga ggcgaaatac gcgatcgctg ttaaaaggac aattacaaac aggaatcgag      420

tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat      480

tcttctaata cctggaacgc tgtttttccg gggatcgcag tggtgagtaa ccatgcatca      540

tcaggagtac ggataaaatg cttgatggtc ggaagtggca taaattccgt cagccagttt      600

agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac      660

aactctggcg catcgggctt cccatacaag cgatagattg tcgcacctga ttgcccgaca      720

ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc      780

ctcgacgttt cccgttgaat atggctcata ttcttccttt ttcaatatta ttgaagcatt      840

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa      900

ataggggtca gtgttacaac caattaacca attctgaaca ttatcgcgag cccatttata      960

cctgaatatg gctcataaca ccccttgttt gcctggcggc agtagcgcgg tggtcccacc     1020

tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc     1080

ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact     1140

gggcctttcg cccgggctaa ttagggggtg tcgcccttat tcgactctat agtgaagttc     1200

ctattctcta gaaagtatag gaacttctga agtggggtcg acttaattaa ggctgcgcgc     1260

tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc     1320

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc     1380

cttgtagtta atgattaacc cgccatgcta cttatctacg tagcaagcta gctagttatt     1440

aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc cgcgttacat     1500

aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca ttgacgtcaa     1560

taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt caatgggtgg     1620

agtatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg ccaagtacgc     1680

cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag tacatgacct     1740

tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt aacatggtcg     1800

aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca cccccaattt     1860

tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg gggggggggc     1920

gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg agaggtgcgg     1980

cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg cggcggcggc     2040

ggcggcccta taaaaagcga agcgcgcggc gggcggggag tcgctgcgac gctgccttcg     2100

ccccgtgccc cgctccgccg ccgcctcgcg ccgcccgccc cggctctgac tgaccgcgtt     2160

actcccacag gtgagcgggc gggacggccc ttctcctccg ggctgtaatt agcgcttggt     2220

ttaatgacgg cttgtttctt ttctgtggct gcgtgaaagc cttgaggggc tccgggaggg     2280

ccctttgtgc ggggggagcg gctcgggggg tgcgtgcgtg tgtgtgtgcg tggggagcgc     2340

cgcgtgcggc tccgcgctgc ccggcggctg tgagcgctgc gggcgcggcg cggggctttg     2400

tgcgctccgc agtgtgcgcg aggggagcgc ggccgggggc ggtgccccgc ggtgcggggg     2460

gggctgcgag gggaacaaag gctgcgtgcg gggtgtgtgc gtgggggggt gagcaggggg     2520

tgtgggcgcg tcggtcgggc tgcaaccccc cctgcacccc cctccccgag ttgctgagca     2580

cggcccggct tcgggtgcgg ggctccgtac ggggcgtggc gcggggctcg ccgtgccggg     2640

cggggggtgg cggcaggtgg gggtgccggg cggggcgggg ccgcctcggg ccggggaggg     2700

ctcgggggag gggcgcggcg gcccccggag cgccggcggc tgtcgaggcg cggcgagccg     2760

cagccattgc cttttatggt aatcgtgcga gagggcgcag ggacttcctt tgtcccaaat     2820

ctgtgcggag ccgaaatctg ggaggcgccg ccgcaccccc tctagcgggc gcggggcgaa     2880

gcggtgcggc gccggcagga aggaaatggg cggggagggc cttcgtgcgt cgccgcgccg     2940

ccgtcccctt ctccctctcc agcctcgggg ctgtccgcgg ggggacggct gccttcgggg     3000

gggacggggc agggcggggt tcggcttctg gcgtgtgacc ggcggctcta gacaattgta     3060

ctaaccttct tctctttcct ctcctgacag gttggtgtac actagcggcc gcgccgccac     3120

catgtgggga agcgacagac tggccggagc tggaggggga ggagcagccg tcaccgtggc     3180

gttcactaac gcgcgggact gctttctcca tctgccgcgg aggctggtcg cccagctgca     3240

cctcctgcag aaccaggcca tcgaggtggt gtggtcccac caaccggcct ttttgagctg     3300

ggtcgaggga aggcactttt cggaccaggg agaaaatgtg gcggagatca accgccaggt     3360

cggccagaag ctgggactgt ccaacggcgg acaggtgttc ctcaagccgt gcagccacgt     3420

ggtgtcctgc caacaggtgg aagtggagcc gctctccgcc gacgactggg agatcctcga     3480

attgcatgcc gtgagcctcg aacagcatct gttggaccag attcgcattg tgttcccgaa     3540

ggccatattc cccgtgtggg tcgatcagca gacctatatc ttcatccaga ttgtggccct     3600

catcccggcc gcctcatacg gacggctgga aactgacacc aagctgctga ttcaacctaa     3660

gacccggagg gccaaagaaa acaccttctc caaggccgac gctgagtaca agaagctcca     3720

ctcctacgga cgggaccaga aggggatgat gaaggagctg caaaccaagc agctccagag     3780

caacaccgtg gggatcaccg agtccaatga aaacgagtcg gaaatcccag tcgattcatc     3840

ttccgtggcc agcctgtgga ctatgatcgg ttccattttc tcgttccaat ctgagaagaa     3900

gcaggaaact agctgggggc tgactgagat caacgccttc aagaacatgc agtccaaagt     3960

ggtgcctctg gataacatct ttcgcgtgtg caagtcccaa ccgccctcaa tctacaacgc     4020

gtccgctacc tccgtgtttc ataagcactg tgccatccac gtgttcccat gggatcagga     4080

atacttcgat gtcgaacctt ccttcaccgt gacttacggg aagcttgtca agctcctcag     4140

ccccaagcag cagcaatcga aaactaagca gaacgtgctt tccccggaga aggagaagca     4200

aatgtcagaa ccactcgacc agaagaaaat cagatcggat cataacgaag aggacgagaa     4260

ggcctgcgtc cttcaggtgg tctggaacgg cctggaggag ctgaacaacg cgattaagta     4320

caccaagaac gtcgaggtcc ttcacctggg aaaggtgtgg attccggatg atctgaggaa     4380

acgcctcaac atcgaaatgc acgctgtggt gcggattacc ccggtcgagg tcaccccaaa     4440

gatccctcgc tccttgaagc tgcagccgcg agaaaacttg cccaaggaca tttctgaaga     4500

ggatatcaag actgtgttct actcctggct gcaacagagc actaccacca tgctccctct     4560

ggtcatttcg gaggaagaat tcatcaaact ggaaaccaag gacggactga aagaattctc     4620

cctgtccatc gtgcactcct gggaaaagga gaaggacaag aatatcttcc tgctgtcccc     4680

caatctgctg caaaagacca cgatccaggt gctgctcgac cccatggtga aggaggaaaa     4740

ctcagaagag atcgacttca tcctgccgtt ccttaagctg agttcactgg gaggcgtgaa     4800

ctcccttggc gtgtcctcgc tggagcacat cactcactca ctgctgggcc ggcctctgag     4860

cagacagctt atgagcttgg tcgccggact cagaaacggt gccctcctgc tcaccggcgg     4920

caagggatcg ggaaagtcca ccctcgctaa ggccatttgc aaagaggcat tcgataagct     4980

ggacgcccat gtggagcggg tggactgtaa ggccctccgc ggaaagcgat tggaaaatat     5040

tcaaaagact ctcgaagtcg ccttttccga agccgtctgg atgcagccct cggtcgtcct     5100

gctcgacgat ctggacctca tcgctgggct gccggccgtg ccggagcatg aacactcccc     5160

tgacgcggtc cagtcgcaac ggctcgccca cgccctgaac gatatgatta aggaattcat     5220

ctcaatggga tcactggtgg ccctgatcgc gacttcccag agccagcagt ccctgcaccc     5280

tctgctggtg tcggcccagg gcgtgcacat ttttcagtgt gtgcaacaca tccagccgcc     5340

caaccaggag cagcggtgcg aaatcctgtg caacgtgatt aagaacaagc tggactgcga     5400

tatcaacaag tttaccgacc ttgatctcca acatgtggct aaggagactg ggggcttcgt     5460

ggctcgggac ttcacagtgt tggtggaccg ggcaattcac tccagactgt cccgccagag     5520

catttccacc cgcgaaaaac tggtcctgac caccctcgac ttccagaagg ccctcagagg     5580

cttccttcct gcgagcctca gatccgtcaa ccttcacaag ccgcgggacc ttggctggga     5640

caagatcggt gggctccacg aggtgcggca gatcctcatg gacaccattc agctgcctgc     5700

aaagtacccc gagctgttcg ccaacttgcc gattcgccag cgcacgggaa tcctgctcta     5760

cggccccccg ggcaccggaa agaccctgct ggccggtgtg atcgcccggg aatcgaggat     5820

gaacttcatc tccgtgaagg gacccgaact cctgtccaag tacatcggtg cctccgaaca     5880

ggccgtgcgc gatatattca ttagggccca ggccgcgaag ccctgcattc tgttcttcga     5940

cgagtttgaa tcgatcgcgc cccggagggg ccacgacaac acgggagtga ccgaccgggt     6000

ggtgaaccag ctgctcaccc aactggatgg cgtggaaggc cttcagggag tgtacgtgct     6060

ggcggctacc tccagaccgg acctgatcga tccggccctg ctgcgccccg ggagactgga     6120

caagtgcgtg tattgccctc cccctgacca ggtgtcaagg ttggaaatcc tcaacgtgct     6180

ctcggactcc ctgccactgg cagatgatgt ggacctccag catgtggcct ccgtgactga     6240

cagcttcaca ggagccgatc tgaaggccct gctttacaac gcccagttgg aggcgctgca     6300

cggtatgctg ctgtcctccg gtctgcagga tggctcctcc tcttccgata gcgacctgtc     6360

gctgagcagc atggtgttcc tgaaccattc cagcggctcc gatgacagcg cgggcgacgg     6420

agaatgtgga ctggatcaat ccctggtgtc cctggagatg agcgagattc tgccagacga     6480

gtccaagttc aacatgtaca ggctgtactt cggcagcagc tacgagtccg agctgggaaa     6540

tggtacctcg tccgacctgt caagccagtg cctgtccgcg ccttcctcca tgacccagga     6600

cctccctgga gtgccaggga aggatcagct gttcagccag cctcccgtgc tgcgcactgc     6660

gagccaggaa gggtgccagg aattgaccca agagcagcgg gaccaactgc gcgcggacat     6720

ttcgatcatc aaaggcagat accgctccca atccggggag gacgaaagca tgaaccagcc     6780

cgggcctatc aagactagac tggcaatctc ccaaagccac ctgatgaccg cactgggaca     6840

cacccggccc tcgatctcgg aggacgactg gaagaacttc gctgagctgt acgaatcctt     6900

ccagaatccg aagcggagaa agaaccagag cggaactatg ttccggcccg gacagaaggt     6960

gaccctggcc tgaagtactg cggatcctgc agatctgcct cgactgtgcc ttctagttgc     7020

cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc     7080

actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct     7140

attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg     7200

catgctgggg actcgagttc tacgtagata agtagcatgg cgggttaatc attaactaca     7260

aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg     7320

ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc     7380

gagcgcgcag ccttaattaa cctaaggaaa atgaagtgaa gttcctatac tttctagaga     7440

ataggaactt ctatagtgag tcgaataagg gcgacacaaa atttattcta aatgcataat     7500

aaatactgat aacatcttat agtttgtatt atattttgta ttatcgttga catgtataat     7560

tttgatatca aaaactgatt ttccctttat tattttcgag atttattttc ttaattctct     7620

ttaacaaact agaaatattg tatatacaaa aaatcataaa taatagatga atagtttaat     7680

tataggtgtt catcaatcga aaaagcaacg tatcttattt aaagtgcgtt gcttttttct     7740

catttataag gttaaataat tctcatatat caagcaaagt gacaggcgcc cttaaatatt     7800

ctgacaaatg ctctttccct aaactccccc cataaaaaaa cccgccgaag cgggttttta     7860

cgttatttgc ggattaacga ttactcgtta tcagaaccgc ccagggggcc cgagcttaac     7920

ctttttattt gggggagagg gaagtcatga aaaaactaac ctttgaaatt cgatctccag     7980

cacatcagca aaacgctatt cacgcagtac agcaaatcct tccagaccca accaaaccaa     8040

tcgtagtaac cattcaggaa cgcaaccgca gcttagacca aaacaggaag ctatgggcct     8100

gcttaggtga cgtctctcgt caggttgaat ggcatggtcg ctggctggat gcagaaagct     8160

ggaagtgtgt gtttaccgca gcattaaagc agcaggatgt tgttcctaac cttgccggga     8220

atggctttgt ggtaataggc cagtcaacca gcaggatgcg tgtaggcgaa tttgcggagc     8280

tattagagct tatacaggca ttcggtacag agcgtggcgt taagtggtca gacgaagcga     8340

gactggctct ggagtggaaa gcgagatggg gagacagggc tgcatgataa atgtcgttag     8400

tttctccggt ggcaggacgt cagcatattt gctctggcta atggagcaaa agcgacgggc     8460

aggtaaagac gtgcattacg ttttcatgga tacaggttgt gaacatccaa tgacatatcg     8520

gtttgtcagg gaagttgtga agttctggga tataccgctc accgtattgc aggttgatat     8580

caacccggag cttggacagc caaatggtta tacggtatgg gaaccaaagg atattcagac     8640

gcgaatgcct gttctgaagc catttatcga tatggtaaag aaatatggca ctccatacgt     8700

cggcggcgcg ttctgcactg acagattaaa actcgttccc ttcaccaaat actgtgatga     8760

ccatttcggg cgagggaatt acaccacgtg gattggcatc agagctgatg aaccgaagcg     8820

gctaaagcca aagcctggaa tcagatatct tgctgaactg tcagactttg agaaggaaga     8880

tatcctcgca tggtggaagc aacaaccatt cgatttgcaa ataccggaac atctcggtaa     8940

ctgcatattc tgcattaaaa aatcaacgca aaaaatcgga cttgcctgca aagatgagga     9000

gggattgcag cgtgttttta atgaggtcat cacgggatcc catgtgcgtg acggacatcg     9060

ggaaacgcca aaggagatta tgtaccgagg aagaatgtcg ctggacggta tcgcgaaaat     9120

gtattcagaa aatgattatc aagccctgta tcaggacatg gtacgagcta aaagattcga     9180

taccggctct tgttctgagt catgcgaaat atttggaggg cagcttgatt tcgacttcgg     9240

gagggaagct gcatgatgcg atgttatcgg tgcggtgaat gcaaagaaga taaccgcttc     9300

cgaccaaatc aaccttactg gaatcgatgg tgtctccggt gtgaaagaac accaacaggg     9360

gtgttaccac taccgcagga aaaggaggac gtgtggcgag acagcgacga agtatcaccg     9420

acataatctg cgaaaactgc aaataccttc caacgaaacg caccagaaat aaacccaagc     9480

caatcccaaa agaatctgac gtaaaaacct tcaactacac ggctcacctg tgggatatcc     9540

ggtggctaag acgtcgtgcg aggaaaacaa ggtgattgac caaaatcgaa gttacgaaca     9600

agaaagcgtc gagcgagctt taacgtgcgc taactgcggt cagaagctgc atgtgctgga     9660

agttcacgtg tgtgagcact gctgcgcaga actgatgagc gatccgaata gctcgatgca     9720

cgaggaagaa gatgatggct aaaccagcgc gaagacgatg taaaaacgat gaatgccggg     9780

aatggtttca ccctgcattc gctaatcagt ggtggtgctc tccagagtgt ggaaccaaga     9840

tagcactcga acgacgaagt aaagaacgcg aaaaagcgga aaaagcagca gagaagaaac     9900

gacgacgaga ggagcagaaa cagaaagata aacttaagat tcgaaaactc gccttaaagc     9960

cccgcagtta ctggattaaa caagcccaac aagccgtaaa cgccttcatc agagaaagag    10020

accgcgactt accatgtatc tcgtgcggaa cgctcacgtc tgctcagtgg gatgccggac    10080

attaccggac aactgctgcg gcacctcaac tccgatttaa tgaacgcaat attcacaagc    10140

aatgcgtggt gtgcaaccag cacaaaagcg gaaatctcgt tccgtatcgc gtcgaactga    10200

ttagccgcat cgggcaggaa gcagtagacg aaatcgaatc aaaccataac cgccatcgct    10260

ggactatcga agagtgcaag gcgatcaagg cagagtacca acagaaactc aaagacctgc    10320

gaaatagcag aagtgaggcc gcatgacgtt ctcagtaaaa accattccag acatgctcgt    10380

tgaagcatac ggaaatcaga cagaagtagc acgcagactg aaatgtagtc gcggtacggt    10440

cagaaaatac gttgatgata aagacgggaa aatgcacgcc atcgtcaacg acgttctcat    10500

ggttcatcgc ggatggagtg aaagagatgc gctattacga aaaaattgat ggcagcaaat    10560

accgaaatat ttgggtagtt ggcgatctgc acggatgcta cacgaacctg atgaacaaac    10620

tggatacgat tggattcgac aacaaaaaag acctgcttat ctcggtgggc gatttggttg    10680

atcgtggtgc agagaacgtt gaatgcctgg aattaatcac attcccctgg ttcagagctg    10740

tacgtggaaa ccatgagcaa atgatgattg atggcttatc agagcgtgga aacgttaatc    10800

actggctgct taatggcggt ggctggttct ttaatctcga ttacgacaaa gaaattctgg    10860

ctaaagctct tgcccataaa gcagatgaac ttccgttaat catcgaactg gtgagcaaag    10920

ataaaaaata tgttatctgc cacgccgatt atccctttga cgaatacgag tttggaaagc    10980

cagttgatca tcagcaggta atctggaacc gcgaacgaat cagcaactca caaaacggga    11040

tcgtgaaaga aatcaaaggc gcggacacgt tcatctttgg tcatacgcca gcagtgaaac    11100

cactcaagtt tgccaaccaa atgtatatcg ataccggcgc agtgttctgc ggaaacctaa    11160

cattgattca ggtacaggga gaaggcgcat gagactcgaa agcgtagcta aatttcattc    11220

gccaaaaagc ccgatgatga gcgactcacc acgggccacg gcttctgact ctctttccgg    11280

tactgatgtg atggctgcta tggggatggc gcaatcacaa gccggattcg gtatggctgc    11340

attctgcggt aagcacgaac tcagccagaa cgacaaacaa aaggctatca actatctgat    11400

gcaatttgca cacaaggtat cggggaaata ccgtggtgtg gcaaagcttg aaggaaatac    11460

taaggcaaag gtactgcaag tgctcgcaac attcgcttat gcggattatt gccgtagtgc    11520

cgcgacgccg ggggcaagat gcagagattg ccatggtaca ggccgtgcgg ttgatattgc    11580

caaaacagag ctgtggggga gagttgtcga gaaagagtgc ggaagatgca aaggcgtcgg    11640

ctattcaagg atgccagcaa gcgcagcata tcgcgctgtg acgatgctaa tcccaaacct    11700

tacccaaccc acctggtcac gcactgttaa gccgctgtat gacgctctgg tggtgcaatg    11760

ccacaaagaa gagtcaatcg cagacaacat tttgaatgcg gtcacacgtt agcagcatga    11820

ttgccacgga tggcaacata ttaacggcat gatattgact tattgaataa aattgggtaa    11880

atttgactca acgatgggtt aattcgctcg ttgtggtagt gagatgaaaa gaggcggcgc    11940

ttactaccga ttccgcctag ttggtcactt cgacgtatcg tctggaactc caaccatcgc    12000

aggcagagag gtctgcaaaa tgcaatcccg aaacagttcg caggtaatag ttagagcctg    12060

cataacggtt tcgggatttt ttatatctgc acaacaggta agagcattga gtcgataatc    12120

gtgaagagtc ggcgagcctg gttagccagt gctctttccg ttgtgctgaa ttaagcgaat    12180

accggaagca gaaccggatc accaaatgcg tacaggcgtc atcgccgccc agcaacagca    12240

caacccaaac tgagccgtag ccactgtctg tcctgaattc attagtaata gttacgctgc    12300

ggccttttac acatgacctt cgtgaaagcg ggtggcagga ggtcgcgcta acaacctcct    12360

gccgttttgc ccgtgcatat cggtcacgaa caaatctgat tactaaacac agtagcctgg    12420

atttgttcta tcagtaatcg accttattcc taattaaata gagcaaatcc ccttattggg    12480

ggtaagacat gaagatgcca gaaaaacatg acctgttggc cgccattctc gcggcaaagg    12540

aacaaggcat cggggcaatc cttgcgtttg caatggcgta ccttcgcggc agatataatg    12600

gcggtgcgtt tacaaaaaca gtaatcgacg caacgatgtg cgccattatc gcctggttca    12660

ttcgtgacct tctcgacttc gccggactaa gtagcaatct cgcttatata acgagcgtgt    12720

ttatcggcta catcggtact gactcgattg gttcgcttat caaacgcttc gctgctaaaa    12780

aagccggagt agaagatggt agaaatcaat aatcaacgta aggcgttcct cgatatgctg    12840

gcgtggtcgg agggaactga taacggacgt cagaaaacca gaaatcatgg ttatgacgtc    12900

attgtaggcg gagagctatt tactgattac tccgatcacc ctcgcaaact tgtcacgcta    12960

aacccaaaac tcaaatcaac aggcgcttaa gactggccgt cgttttacaa cacagaaaga    13020

gtttgtagaa acgcaaaaag gccatccgtc aggggccttc tgcttagttt gatgcctggc    13080

agttccctac tctcgccttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg    13140

ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg    13200

gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag    13260

gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga    13320

cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct    13380

ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc    13440

tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg    13500

gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc    13560

tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca    13620

ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag    13680

ttcttgaagt ggtgggctaa ctacggctac actagaagaa cagtatttgg tatctgcgct    13740

ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc    13800

accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga    13860

tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgacgcgcgc    13920

gtaactcacg ttaagggatt ttggtcatga gcttgcgccg tcccgtcaag tcagcgtaat    13980

gctctgcttt                                                           13990


<210>  10
<211>  12560
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAAV.EF1ac.copt.hPEX1


<220>
<221>  misc_feature
<222>  (1253)..(5960)
<223>  AAV expression cassette

<220>
<221>  misc_feature
<222>  (1253)..(1382)
<223>  5' ITR

<220>
<221>  promoter
<222>  (1463)..(1674)
<223>  EF1a core promoter

<220>
<221>  misc_feature
<222>  (1683)..(1691)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (1692)..(5543)
<223>  Codon optimized hPEX1

<220>
<221>  polyA_signal
<222>  (5574)..(5781)
<223>  bGH Poly(A)

<220>
<221>  misc_feature
<222>  (5831)..(5960)
<223>  3' ITR

<400>  10
tagaaaaact catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata       60

ccatattttt gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat      120

aggatggcaa gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct      180

attaatttcc cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact      240

gaatccggtg agaatggcaa aagtttatgc atttctttcc agacttgttc aacaggccag      300

ccattacgct cgtcatcaaa atcactcgca tcaaccaaac cgttattcat tcgtgattgc      360

gcctgagcga ggcgaaatac gcgatcgctg ttaaaaggac aattacaaac aggaatcgag      420

tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat      480

tcttctaata cctggaacgc tgtttttccg gggatcgcag tggtgagtaa ccatgcatca      540

tcaggagtac ggataaaatg cttgatggtc ggaagtggca taaattccgt cagccagttt      600

agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac      660

aactctggcg catcgggctt cccatacaag cgatagattg tcgcacctga ttgcccgaca      720

ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc      780

ctcgacgttt cccgttgaat atggctcata ttcttccttt ttcaatatta ttgaagcatt      840

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa      900

ataggggtca gtgttacaac caattaacca attctgaaca ttatcgcgag cccatttata      960

cctgaatatg gctcataaca ccccttgttt gcctggcggc agtagcgcgg tggtcccacc     1020

tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc     1080

ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact     1140

gggcctttcg cccgggctaa ttagggggtg tcgcccttat tcgactctat agtgaagttc     1200

ctattctcta gaaagtatag gaacttctga agtggggtcg acttaattaa ggctgcgcgc     1260

tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc     1320

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc     1380

cttgtagtta atgattaacc cgccatgcta cttatctacg tagcaagcta gcgagtggga     1440

attggctccg gtgcccgtca gtgggcagag cgcacatcgc ccacagtccc cgagaagttg     1500

gggggagggg tcggcaattg atccggtgcc tagagaaggt ggcgcggggt aaactgggaa     1560

agtgatgtcg tgtactggct ccgccttttt cccgagggtg ggggagaacc gtatataagt     1620

gcagtagtcg ccgtgaacgt tctttttcgc aacgggtttg ccgccagaac acaggcggcc     1680

gcgccgccac catgtgggga agcgacagac tggccggagc tggaggggga ggagcagccg     1740

tcaccgtggc gttcactaac gcgcgggact gctttctcca tctgccgcgg aggctggtcg     1800

cccagctgca cctcctgcag aaccaggcca tcgaggtggt gtggtcccac caaccggcct     1860

ttttgagctg ggtcgaggga aggcactttt cggaccaggg agaaaatgtg gcggagatca     1920

accgccaggt cggccagaag ctgggactgt ccaacggcgg acaggtgttc ctcaagccgt     1980

gcagccacgt ggtgtcctgc caacaggtgg aagtggagcc gctctccgcc gacgactggg     2040

agatcctcga attgcatgcc gtgagcctcg aacagcatct gttggaccag attcgcattg     2100

tgttcccgaa ggccatattc cccgtgtggg tcgatcagca gacctatatc ttcatccaga     2160

ttgtggccct catcccggcc gcctcatacg gacggctgga aactgacacc aagctgctga     2220

ttcaacctaa gacccggagg gccaaagaaa acaccttctc caaggccgac gctgagtaca     2280

agaagctcca ctcctacgga cgggaccaga aggggatgat gaaggagctg caaaccaagc     2340

agctccagag caacaccgtg gggatcaccg agtccaatga aaacgagtcg gaaatcccag     2400

tcgattcatc ttccgtggcc agcctgtgga ctatgatcgg ttccattttc tcgttccaat     2460

ctgagaagaa gcaggaaact agctgggggc tgactgagat caacgccttc aagaacatgc     2520

agtccaaagt ggtgcctctg gataacatct ttcgcgtgtg caagtcccaa ccgccctcaa     2580

tctacaacgc gtccgctacc tccgtgtttc ataagcactg tgccatccac gtgttcccat     2640

gggatcagga atacttcgat gtcgaacctt ccttcaccgt gacttacggg aagcttgtca     2700

agctcctcag ccccaagcag cagcaatcga aaactaagca gaacgtgctt tccccggaga     2760

aggagaagca aatgtcagaa ccactcgacc agaagaaaat cagatcggat cataacgaag     2820

aggacgagaa ggcctgcgtc cttcaggtgg tctggaacgg cctggaggag ctgaacaacg     2880

cgattaagta caccaagaac gtcgaggtcc ttcacctggg aaaggtgtgg attccggatg     2940

atctgaggaa acgcctcaac atcgaaatgc acgctgtggt gcggattacc ccggtcgagg     3000

tcaccccaaa gatccctcgc tccttgaagc tgcagccgcg agaaaacttg cccaaggaca     3060

tttctgaaga ggatatcaag actgtgttct actcctggct gcaacagagc actaccacca     3120

tgctccctct ggtcatttcg gaggaagaat tcatcaaact ggaaaccaag gacggactga     3180

aagaattctc cctgtccatc gtgcactcct gggaaaagga gaaggacaag aatatcttcc     3240

tgctgtcccc caatctgctg caaaagacca cgatccaggt gctgctcgac cccatggtga     3300

aggaggaaaa ctcagaagag atcgacttca tcctgccgtt ccttaagctg agttcactgg     3360

gaggcgtgaa ctcccttggc gtgtcctcgc tggagcacat cactcactca ctgctgggcc     3420

ggcctctgag cagacagctt atgagcttgg tcgccggact cagaaacggt gccctcctgc     3480

tcaccggcgg caagggatcg ggaaagtcca ccctcgctaa ggccatttgc aaagaggcat     3540

tcgataagct ggacgcccat gtggagcggg tggactgtaa ggccctccgc ggaaagcgat     3600

tggaaaatat tcaaaagact ctcgaagtcg ccttttccga agccgtctgg atgcagccct     3660

cggtcgtcct gctcgacgat ctggacctca tcgctgggct gccggccgtg ccggagcatg     3720

aacactcccc tgacgcggtc cagtcgcaac ggctcgccca cgccctgaac gatatgatta     3780

aggaattcat ctcaatggga tcactggtgg ccctgatcgc gacttcccag agccagcagt     3840

ccctgcaccc tctgctggtg tcggcccagg gcgtgcacat ttttcagtgt gtgcaacaca     3900

tccagccgcc caaccaggag cagcggtgcg aaatcctgtg caacgtgatt aagaacaagc     3960

tggactgcga tatcaacaag tttaccgacc ttgatctcca acatgtggct aaggagactg     4020

ggggcttcgt ggctcgggac ttcacagtgt tggtggaccg ggcaattcac tccagactgt     4080

cccgccagag catttccacc cgcgaaaaac tggtcctgac caccctcgac ttccagaagg     4140

ccctcagagg cttccttcct gcgagcctca gatccgtcaa ccttcacaag ccgcgggacc     4200

ttggctggga caagatcggt gggctccacg aggtgcggca gatcctcatg gacaccattc     4260

agctgcctgc aaagtacccc gagctgttcg ccaacttgcc gattcgccag cgcacgggaa     4320

tcctgctcta cggccccccg ggcaccggaa agaccctgct ggccggtgtg atcgcccggg     4380

aatcgaggat gaacttcatc tccgtgaagg gacccgaact cctgtccaag tacatcggtg     4440

cctccgaaca ggccgtgcgc gatatattca ttagggccca ggccgcgaag ccctgcattc     4500

tgttcttcga cgagtttgaa tcgatcgcgc cccggagggg ccacgacaac acgggagtga     4560

ccgaccgggt ggtgaaccag ctgctcaccc aactggatgg cgtggaaggc cttcagggag     4620

tgtacgtgct ggcggctacc tccagaccgg acctgatcga tccggccctg ctgcgccccg     4680

ggagactgga caagtgcgtg tattgccctc cccctgacca ggtgtcaagg ttggaaatcc     4740

tcaacgtgct ctcggactcc ctgccactgg cagatgatgt ggacctccag catgtggcct     4800

ccgtgactga cagcttcaca ggagccgatc tgaaggccct gctttacaac gcccagttgg     4860

aggcgctgca cggtatgctg ctgtcctccg gtctgcagga tggctcctcc tcttccgata     4920

gcgacctgtc gctgagcagc atggtgttcc tgaaccattc cagcggctcc gatgacagcg     4980

cgggcgacgg agaatgtgga ctggatcaat ccctggtgtc cctggagatg agcgagattc     5040

tgccagacga gtccaagttc aacatgtaca ggctgtactt cggcagcagc tacgagtccg     5100

agctgggaaa tggtacctcg tccgacctgt caagccagtg cctgtccgcg ccttcctcca     5160

tgacccagga cctccctgga gtgccaggga aggatcagct gttcagccag cctcccgtgc     5220

tgcgcactgc gagccaggaa gggtgccagg aattgaccca agagcagcgg gaccaactgc     5280

gcgcggacat ttcgatcatc aaaggcagat accgctccca atccggggag gacgaaagca     5340

tgaaccagcc cgggcctatc aagactagac tggcaatctc ccaaagccac ctgatgaccg     5400

cactgggaca cacccggccc tcgatctcgg aggacgactg gaagaacttc gctgagctgt     5460

acgaatcctt ccagaatccg aagcggagaa agaaccagag cggaactatg ttccggcccg     5520

gacagaaggt gaccctggcc tgaagtactg cggatcctgc agatctgcct cgactgtgcc     5580

ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg     5640

tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag     5700

gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga     5760

caatagcagg catgctgggg actcgagttc tacgtagata agtagcatgg cgggttaatc     5820

attaactaca aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg     5880

ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca     5940

gtgagcgagc gagcgcgcag ccttaattaa cctaaggaaa atgaagtgaa gttcctatac     6000

tttctagaga ataggaactt ctatagtgag tcgaataagg gcgacacaaa atttattcta     6060

aatgcataat aaatactgat aacatcttat agtttgtatt atattttgta ttatcgttga     6120

catgtataat tttgatatca aaaactgatt ttccctttat tattttcgag atttattttc     6180

ttaattctct ttaacaaact agaaatattg tatatacaaa aaatcataaa taatagatga     6240

atagtttaat tataggtgtt catcaatcga aaaagcaacg tatcttattt aaagtgcgtt     6300

gcttttttct catttataag gttaaataat tctcatatat caagcaaagt gacaggcgcc     6360

cttaaatatt ctgacaaatg ctctttccct aaactccccc cataaaaaaa cccgccgaag     6420

cgggttttta cgttatttgc ggattaacga ttactcgtta tcagaaccgc ccagggggcc     6480

cgagcttaac ctttttattt gggggagagg gaagtcatga aaaaactaac ctttgaaatt     6540

cgatctccag cacatcagca aaacgctatt cacgcagtac agcaaatcct tccagaccca     6600

accaaaccaa tcgtagtaac cattcaggaa cgcaaccgca gcttagacca aaacaggaag     6660

ctatgggcct gcttaggtga cgtctctcgt caggttgaat ggcatggtcg ctggctggat     6720

gcagaaagct ggaagtgtgt gtttaccgca gcattaaagc agcaggatgt tgttcctaac     6780

cttgccggga atggctttgt ggtaataggc cagtcaacca gcaggatgcg tgtaggcgaa     6840

tttgcggagc tattagagct tatacaggca ttcggtacag agcgtggcgt taagtggtca     6900

gacgaagcga gactggctct ggagtggaaa gcgagatggg gagacagggc tgcatgataa     6960

atgtcgttag tttctccggt ggcaggacgt cagcatattt gctctggcta atggagcaaa     7020

agcgacgggc aggtaaagac gtgcattacg ttttcatgga tacaggttgt gaacatccaa     7080

tgacatatcg gtttgtcagg gaagttgtga agttctggga tataccgctc accgtattgc     7140

aggttgatat caacccggag cttggacagc caaatggtta tacggtatgg gaaccaaagg     7200

atattcagac gcgaatgcct gttctgaagc catttatcga tatggtaaag aaatatggca     7260

ctccatacgt cggcggcgcg ttctgcactg acagattaaa actcgttccc ttcaccaaat     7320

actgtgatga ccatttcggg cgagggaatt acaccacgtg gattggcatc agagctgatg     7380

aaccgaagcg gctaaagcca aagcctggaa tcagatatct tgctgaactg tcagactttg     7440

agaaggaaga tatcctcgca tggtggaagc aacaaccatt cgatttgcaa ataccggaac     7500

atctcggtaa ctgcatattc tgcattaaaa aatcaacgca aaaaatcgga cttgcctgca     7560

aagatgagga gggattgcag cgtgttttta atgaggtcat cacgggatcc catgtgcgtg     7620

acggacatcg ggaaacgcca aaggagatta tgtaccgagg aagaatgtcg ctggacggta     7680

tcgcgaaaat gtattcagaa aatgattatc aagccctgta tcaggacatg gtacgagcta     7740

aaagattcga taccggctct tgttctgagt catgcgaaat atttggaggg cagcttgatt     7800

tcgacttcgg gagggaagct gcatgatgcg atgttatcgg tgcggtgaat gcaaagaaga     7860

taaccgcttc cgaccaaatc aaccttactg gaatcgatgg tgtctccggt gtgaaagaac     7920

accaacaggg gtgttaccac taccgcagga aaaggaggac gtgtggcgag acagcgacga     7980

agtatcaccg acataatctg cgaaaactgc aaataccttc caacgaaacg caccagaaat     8040

aaacccaagc caatcccaaa agaatctgac gtaaaaacct tcaactacac ggctcacctg     8100

tgggatatcc ggtggctaag acgtcgtgcg aggaaaacaa ggtgattgac caaaatcgaa     8160

gttacgaaca agaaagcgtc gagcgagctt taacgtgcgc taactgcggt cagaagctgc     8220

atgtgctgga agttcacgtg tgtgagcact gctgcgcaga actgatgagc gatccgaata     8280

gctcgatgca cgaggaagaa gatgatggct aaaccagcgc gaagacgatg taaaaacgat     8340

gaatgccggg aatggtttca ccctgcattc gctaatcagt ggtggtgctc tccagagtgt     8400

ggaaccaaga tagcactcga acgacgaagt aaagaacgcg aaaaagcgga aaaagcagca     8460

gagaagaaac gacgacgaga ggagcagaaa cagaaagata aacttaagat tcgaaaactc     8520

gccttaaagc cccgcagtta ctggattaaa caagcccaac aagccgtaaa cgccttcatc     8580

agagaaagag accgcgactt accatgtatc tcgtgcggaa cgctcacgtc tgctcagtgg     8640

gatgccggac attaccggac aactgctgcg gcacctcaac tccgatttaa tgaacgcaat     8700

attcacaagc aatgcgtggt gtgcaaccag cacaaaagcg gaaatctcgt tccgtatcgc     8760

gtcgaactga ttagccgcat cgggcaggaa gcagtagacg aaatcgaatc aaaccataac     8820

cgccatcgct ggactatcga agagtgcaag gcgatcaagg cagagtacca acagaaactc     8880

aaagacctgc gaaatagcag aagtgaggcc gcatgacgtt ctcagtaaaa accattccag     8940

acatgctcgt tgaagcatac ggaaatcaga cagaagtagc acgcagactg aaatgtagtc     9000

gcggtacggt cagaaaatac gttgatgata aagacgggaa aatgcacgcc atcgtcaacg     9060

acgttctcat ggttcatcgc ggatggagtg aaagagatgc gctattacga aaaaattgat     9120

ggcagcaaat accgaaatat ttgggtagtt ggcgatctgc acggatgcta cacgaacctg     9180

atgaacaaac tggatacgat tggattcgac aacaaaaaag acctgcttat ctcggtgggc     9240

gatttggttg atcgtggtgc agagaacgtt gaatgcctgg aattaatcac attcccctgg     9300

ttcagagctg tacgtggaaa ccatgagcaa atgatgattg atggcttatc agagcgtgga     9360

aacgttaatc actggctgct taatggcggt ggctggttct ttaatctcga ttacgacaaa     9420

gaaattctgg ctaaagctct tgcccataaa gcagatgaac ttccgttaat catcgaactg     9480

gtgagcaaag ataaaaaata tgttatctgc cacgccgatt atccctttga cgaatacgag     9540

tttggaaagc cagttgatca tcagcaggta atctggaacc gcgaacgaat cagcaactca     9600

caaaacggga tcgtgaaaga aatcaaaggc gcggacacgt tcatctttgg tcatacgcca     9660

gcagtgaaac cactcaagtt tgccaaccaa atgtatatcg ataccggcgc agtgttctgc     9720

ggaaacctaa cattgattca ggtacaggga gaaggcgcat gagactcgaa agcgtagcta     9780

aatttcattc gccaaaaagc ccgatgatga gcgactcacc acgggccacg gcttctgact     9840

ctctttccgg tactgatgtg atggctgcta tggggatggc gcaatcacaa gccggattcg     9900

gtatggctgc attctgcggt aagcacgaac tcagccagaa cgacaaacaa aaggctatca     9960

actatctgat gcaatttgca cacaaggtat cggggaaata ccgtggtgtg gcaaagcttg    10020

aaggaaatac taaggcaaag gtactgcaag tgctcgcaac attcgcttat gcggattatt    10080

gccgtagtgc cgcgacgccg ggggcaagat gcagagattg ccatggtaca ggccgtgcgg    10140

ttgatattgc caaaacagag ctgtggggga gagttgtcga gaaagagtgc ggaagatgca    10200

aaggcgtcgg ctattcaagg atgccagcaa gcgcagcata tcgcgctgtg acgatgctaa    10260

tcccaaacct tacccaaccc acctggtcac gcactgttaa gccgctgtat gacgctctgg    10320

tggtgcaatg ccacaaagaa gagtcaatcg cagacaacat tttgaatgcg gtcacacgtt    10380

agcagcatga ttgccacgga tggcaacata ttaacggcat gatattgact tattgaataa    10440

aattgggtaa atttgactca acgatgggtt aattcgctcg ttgtggtagt gagatgaaaa    10500

gaggcggcgc ttactaccga ttccgcctag ttggtcactt cgacgtatcg tctggaactc    10560

caaccatcgc aggcagagag gtctgcaaaa tgcaatcccg aaacagttcg caggtaatag    10620

ttagagcctg cataacggtt tcgggatttt ttatatctgc acaacaggta agagcattga    10680

gtcgataatc gtgaagagtc ggcgagcctg gttagccagt gctctttccg ttgtgctgaa    10740

ttaagcgaat accggaagca gaaccggatc accaaatgcg tacaggcgtc atcgccgccc    10800

agcaacagca caacccaaac tgagccgtag ccactgtctg tcctgaattc attagtaata    10860

gttacgctgc ggccttttac acatgacctt cgtgaaagcg ggtggcagga ggtcgcgcta    10920

acaacctcct gccgttttgc ccgtgcatat cggtcacgaa caaatctgat tactaaacac    10980

agtagcctgg atttgttcta tcagtaatcg accttattcc taattaaata gagcaaatcc    11040

ccttattggg ggtaagacat gaagatgcca gaaaaacatg acctgttggc cgccattctc    11100

gcggcaaagg aacaaggcat cggggcaatc cttgcgtttg caatggcgta ccttcgcggc    11160

agatataatg gcggtgcgtt tacaaaaaca gtaatcgacg caacgatgtg cgccattatc    11220

gcctggttca ttcgtgacct tctcgacttc gccggactaa gtagcaatct cgcttatata    11280

acgagcgtgt ttatcggcta catcggtact gactcgattg gttcgcttat caaacgcttc    11340

gctgctaaaa aagccggagt agaagatggt agaaatcaat aatcaacgta aggcgttcct    11400

cgatatgctg gcgtggtcgg agggaactga taacggacgt cagaaaacca gaaatcatgg    11460

ttatgacgtc attgtaggcg gagagctatt tactgattac tccgatcacc ctcgcaaact    11520

tgtcacgcta aacccaaaac tcaaatcaac aggcgcttaa gactggccgt cgttttacaa    11580

cacagaaaga gtttgtagaa acgcaaaaag gccatccgtc aggggccttc tgcttagttt    11640

gatgcctggc agttccctac tctcgccttc cgcttcctcg ctcactgact cgctgcgctc    11700

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac    11760

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa    11820

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca    11880

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc    11940

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata    12000

cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta    12060

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca    12120

gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga    12180

cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg    12240

tgctacagag ttcttgaagt ggtgggctaa ctacggctac actagaagaa cagtatttgg    12300

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg    12360

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag    12420

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa    12480

cgacgcgcgc gtaactcacg ttaagggatt ttggtcatga gcttgcgccg tcccgtcaag    12540

tcagcgtaat gctctgcttt                                                12560


<210>  11
<211>  12796
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAAV.GRK1.copt.hPEX1


<220>
<221>  misc_feature
<222>  (1253)..(6196)
<223>  AAV expression cassette

<220>
<221>  misc_feature
<222>  (1253)..(1382)
<223>  5' ITR

<220>
<221>  promoter
<222>  (1427)..(1790)
<223>  GRK1 promoter

<220>
<221>  Intron
<222>  (1791)..(1887)
<223>  SV40 intron

<220>
<221>  misc_feature
<222>  (1940)..(1948)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (1949)..(5800)
<223>  codon optimized hPEX1

<220>
<221>  polyA_signal
<222>  (5822)..(5933)
<223>  bGH Poly(A)

<220>
<221>  misc_feature
<222>  (6067)..(6196)
<223>  3' ITR

<400>  11
tagaaaaact catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata       60

ccatattttt gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat      120

aggatggcaa gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct      180

attaatttcc cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact      240

gaatccggtg agaatggcaa aagtttatgc atttctttcc agacttgttc aacaggccag      300

ccattacgct cgtcatcaaa atcactcgca tcaaccaaac cgttattcat tcgtgattgc      360

gcctgagcga ggcgaaatac gcgatcgctg ttaaaaggac aattacaaac aggaatcgag      420

tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat      480

tcttctaata cctggaacgc tgtttttccg gggatcgcag tggtgagtaa ccatgcatca      540

tcaggagtac ggataaaatg cttgatggtc ggaagtggca taaattccgt cagccagttt      600

agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac      660

aactctggcg catcgggctt cccatacaag cgatagattg tcgcacctga ttgcccgaca      720

ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc      780

ctcgacgttt cccgttgaat atggctcata ttcttccttt ttcaatatta ttgaagcatt      840

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa      900

ataggggtca gtgttacaac caattaacca attctgaaca ttatcgcgag cccatttata      960

cctgaatatg gctcataaca ccccttgttt gcctggcggc agtagcgcgg tggtcccacc     1020

tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc     1080

ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact     1140

gggcctttcg cccgggctaa ttagggggtg tcgcccttat tcgactctat agtgaagttc     1200

ctattctcta gaaagtatag gaacttctga agtggggtcg acttaattaa ggctgcgcgc     1260

tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc     1320

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc     1380

cttgtagtta atgattaacc cgccatgcta cttatctacg tagcaagcta gcaagatcca     1440

agctcagatc tcgatcgagt tgggccccag aagcctggtg gttgtttgtc cttctcaggg     1500

gaaaagtgag gcggcccctt ggaggaaggg gccgggcaga atgatctaat cggattccaa     1560

gcagctcagg ggattgtctt tttctagcac cttcttgcca ctcctaagcg tcctccgtga     1620

ccccggctgg gatttagcct ggtgctgtgt cagccccggt ctcccagggg cttcccagtg     1680

gtccccagga accctcgaca gggcccggtc tctctcgtcc agcaagggca gggacgggcc     1740

acaggccaag ggccctcgat cgaggaactg aaaaaccaga aagttaactg gtaagtttag     1800

tctttttgtc ttttatttca ggtcccggat ccggtggtgg tgcaaatcaa agaactgctc     1860

ctcagtggat gttgccttta cttctaggcc tgtacggaag tgttacttct gctctaaaag     1920

ctgcggaatt gtacccgcgg ccgccaccat gtggggaagc gacagactgg ccggagctgg     1980

agggggagga gcagccgtca ccgtggcgtt cactaacgcg cgggactgct ttctccatct     2040

gccgcggagg ctggtcgccc agctgcacct cctgcagaac caggccatcg aggtggtgtg     2100

gtcccaccaa ccggcctttt tgagctgggt cgagggaagg cacttttcgg accagggaga     2160

aaatgtggcg gagatcaacc gccaggtcgg ccagaagctg ggactgtcca acggcggaca     2220

ggtgttcctc aagccgtgca gccacgtggt gtcctgccaa caggtggaag tggagccgct     2280

ctccgccgac gactgggaga tcctcgaatt gcatgccgtg agcctcgaac agcatctgtt     2340

ggaccagatt cgcattgtgt tcccgaaggc catattcccc gtgtgggtcg atcagcagac     2400

ctatatcttc atccagattg tggccctcat cccggccgcc tcatacggac ggctggaaac     2460

tgacaccaag ctgctgattc aacctaagac ccggagggcc aaagaaaaca ccttctccaa     2520

ggccgacgct gagtacaaga agctccactc ctacggacgg gaccagaagg ggatgatgaa     2580

ggagctgcaa accaagcagc tccagagcaa caccgtgggg atcaccgagt ccaatgaaaa     2640

cgagtcggaa atcccagtcg attcatcttc cgtggccagc ctgtggacta tgatcggttc     2700

cattttctcg ttccaatctg agaagaagca ggaaactagc tgggggctga ctgagatcaa     2760

cgccttcaag aacatgcagt ccaaagtggt gcctctggat aacatctttc gcgtgtgcaa     2820

gtcccaaccg ccctcaatct acaacgcgtc cgctacctcc gtgtttcata agcactgtgc     2880

catccacgtg ttcccatggg atcaggaata cttcgatgtc gaaccttcct tcaccgtgac     2940

ttacgggaag cttgtcaagc tcctcagccc caagcagcag caatcgaaaa ctaagcagaa     3000

cgtgctttcc ccggagaagg agaagcaaat gtcagaacca ctcgaccaga agaaaatcag     3060

atcggatcat aacgaagagg acgagaaggc ctgcgtcctt caggtggtct ggaacggcct     3120

ggaggagctg aacaacgcga ttaagtacac caagaacgtc gaggtccttc acctgggaaa     3180

ggtgtggatt ccggatgatc tgaggaaacg cctcaacatc gaaatgcacg ctgtggtgcg     3240

gattaccccg gtcgaggtca ccccaaagat ccctcgctcc ttgaagctgc agccgcgaga     3300

aaacttgccc aaggacattt ctgaagagga tatcaagact gtgttctact cctggctgca     3360

acagagcact accaccatgc tccctctggt catttcggag gaagaattca tcaaactgga     3420

aaccaaggac ggactgaaag aattctccct gtccatcgtg cactcctggg aaaaggagaa     3480

ggacaagaat atcttcctgc tgtcccccaa tctgctgcaa aagaccacga tccaggtgct     3540

gctcgacccc atggtgaagg aggaaaactc agaagagatc gacttcatcc tgccgttcct     3600

taagctgagt tcactgggag gcgtgaactc ccttggcgtg tcctcgctgg agcacatcac     3660

tcactcactg ctgggccggc ctctgagcag acagcttatg agcttggtcg ccggactcag     3720

aaacggtgcc ctcctgctca ccggcggcaa gggatcggga aagtccaccc tcgctaaggc     3780

catttgcaaa gaggcattcg ataagctgga cgcccatgtg gagcgggtgg actgtaaggc     3840

cctccgcgga aagcgattgg aaaatattca aaagactctc gaagtcgcct tttccgaagc     3900

cgtctggatg cagccctcgg tcgtcctgct cgacgatctg gacctcatcg ctgggctgcc     3960

ggccgtgccg gagcatgaac actcccctga cgcggtccag tcgcaacggc tcgcccacgc     4020

cctgaacgat atgattaagg aattcatctc aatgggatca ctggtggccc tgatcgcgac     4080

ttcccagagc cagcagtccc tgcaccctct gctggtgtcg gcccagggcg tgcacatttt     4140

tcagtgtgtg caacacatcc agccgcccaa ccaggagcag cggtgcgaaa tcctgtgcaa     4200

cgtgattaag aacaagctgg actgcgatat caacaagttt accgaccttg atctccaaca     4260

tgtggctaag gagactgggg gcttcgtggc tcgggacttc acagtgttgg tggaccgggc     4320

aattcactcc agactgtccc gccagagcat ttccacccgc gaaaaactgg tcctgaccac     4380

cctcgacttc cagaaggccc tcagaggctt ccttcctgcg agcctcagat ccgtcaacct     4440

tcacaagccg cgggaccttg gctgggacaa gatcggtggg ctccacgagg tgcggcagat     4500

cctcatggac accattcagc tgcctgcaaa gtaccccgag ctgttcgcca acttgccgat     4560

tcgccagcgc acgggaatcc tgctctacgg ccccccgggc accggaaaga ccctgctggc     4620

cggtgtgatc gcccgggaat cgaggatgaa cttcatctcc gtgaagggac ccgaactcct     4680

gtccaagtac atcggtgcct ccgaacaggc cgtgcgcgat atattcatta gggcccaggc     4740

cgcgaagccc tgcattctgt tcttcgacga gtttgaatcg atcgcgcccc ggaggggcca     4800

cgacaacacg ggagtgaccg accgggtggt gaaccagctg ctcacccaac tggatggcgt     4860

ggaaggcctt cagggagtgt acgtgctggc ggctacctcc agaccggacc tgatcgatcc     4920

ggccctgctg cgccccggga gactggacaa gtgcgtgtat tgccctcccc ctgaccaggt     4980

gtcaaggttg gaaatcctca acgtgctctc ggactccctg ccactggcag atgatgtgga     5040

cctccagcat gtggcctccg tgactgacag cttcacagga gccgatctga aggccctgct     5100

ttacaacgcc cagttggagg cgctgcacgg tatgctgctg tcctccggtc tgcaggatgg     5160

ctcctcctct tccgatagcg acctgtcgct gagcagcatg gtgttcctga accattccag     5220

cggctccgat gacagcgcgg gcgacggaga atgtggactg gatcaatccc tggtgtccct     5280

ggagatgagc gagattctgc cagacgagtc caagttcaac atgtacaggc tgtacttcgg     5340

cagcagctac gagtccgagc tgggaaatgg tacctcgtcc gacctgtcaa gccagtgcct     5400

gtccgcgcct tcctccatga cccaggacct ccctggagtg ccagggaagg atcagctgtt     5460

cagccagcct cccgtgctgc gcactgcgag ccaggaaggg tgccaggaat tgacccaaga     5520

gcagcgggac caactgcgcg cggacatttc gatcatcaaa ggcagatacc gctcccaatc     5580

cggggaggac gaaagcatga accagcccgg gcctatcaag actagactgg caatctccca     5640

aagccacctg atgaccgcac tgggacacac ccggccctcg atctcggagg acgactggaa     5700

gaacttcgct gagctgtacg aatccttcca gaatccgaag cggagaaaga accagagcgg     5760

aactatgttc cggcccggac agaaggtgac cctggcctga tgtacaagta ataagcctcg     5820

actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc     5880

ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt     5940

ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat     6000

tgggaagaca atagcaggtc gagttctacg tagataagta gcatggcggg ttaatcatta     6060

actacaagga acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca     6120

ctgaggccgg gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga     6180

gcgagcgagc gcgcagcctt aattaaccta aggaaaatga agtgaagttc ctatactttc     6240

tagagaatag gaacttctat agtgagtcga ataagggcga cacaaaattt attctaaatg     6300

cataataaat actgataaca tcttatagtt tgtattatat tttgtattat cgttgacatg     6360

tataattttg atatcaaaaa ctgattttcc ctttattatt ttcgagattt attttcttaa     6420

ttctctttaa caaactagaa atattgtata tacaaaaaat cataaataat agatgaatag     6480

tttaattata ggtgttcatc aatcgaaaaa gcaacgtatc ttatttaaag tgcgttgctt     6540

ttttctcatt tataaggtta aataattctc atatatcaag caaagtgaca ggcgccctta     6600

aatattctga caaatgctct ttccctaaac tccccccata aaaaaacccg ccgaagcggg     6660

tttttacgtt atttgcggat taacgattac tcgttatcag aaccgcccag ggggcccgag     6720

cttaaccttt ttatttgggg gagagggaag tcatgaaaaa actaaccttt gaaattcgat     6780

ctccagcaca tcagcaaaac gctattcacg cagtacagca aatccttcca gacccaacca     6840

aaccaatcgt agtaaccatt caggaacgca accgcagctt agaccaaaac aggaagctat     6900

gggcctgctt aggtgacgtc tctcgtcagg ttgaatggca tggtcgctgg ctggatgcag     6960

aaagctggaa gtgtgtgttt accgcagcat taaagcagca ggatgttgtt cctaaccttg     7020

ccgggaatgg ctttgtggta ataggccagt caaccagcag gatgcgtgta ggcgaatttg     7080

cggagctatt agagcttata caggcattcg gtacagagcg tggcgttaag tggtcagacg     7140

aagcgagact ggctctggag tggaaagcga gatggggaga cagggctgca tgataaatgt     7200

cgttagtttc tccggtggca ggacgtcagc atatttgctc tggctaatgg agcaaaagcg     7260

acgggcaggt aaagacgtgc attacgtttt catggataca ggttgtgaac atccaatgac     7320

atatcggttt gtcagggaag ttgtgaagtt ctgggatata ccgctcaccg tattgcaggt     7380

tgatatcaac ccggagcttg gacagccaaa tggttatacg gtatgggaac caaaggatat     7440

tcagacgcga atgcctgttc tgaagccatt tatcgatatg gtaaagaaat atggcactcc     7500

atacgtcggc ggcgcgttct gcactgacag attaaaactc gttcccttca ccaaatactg     7560

tgatgaccat ttcgggcgag ggaattacac cacgtggatt ggcatcagag ctgatgaacc     7620

gaagcggcta aagccaaagc ctggaatcag atatcttgct gaactgtcag actttgagaa     7680

ggaagatatc ctcgcatggt ggaagcaaca accattcgat ttgcaaatac cggaacatct     7740

cggtaactgc atattctgca ttaaaaaatc aacgcaaaaa atcggacttg cctgcaaaga     7800

tgaggaggga ttgcagcgtg tttttaatga ggtcatcacg ggatcccatg tgcgtgacgg     7860

acatcgggaa acgccaaagg agattatgta ccgaggaaga atgtcgctgg acggtatcgc     7920

gaaaatgtat tcagaaaatg attatcaagc cctgtatcag gacatggtac gagctaaaag     7980

attcgatacc ggctcttgtt ctgagtcatg cgaaatattt ggagggcagc ttgatttcga     8040

cttcgggagg gaagctgcat gatgcgatgt tatcggtgcg gtgaatgcaa agaagataac     8100

cgcttccgac caaatcaacc ttactggaat cgatggtgtc tccggtgtga aagaacacca     8160

acaggggtgt taccactacc gcaggaaaag gaggacgtgt ggcgagacag cgacgaagta     8220

tcaccgacat aatctgcgaa aactgcaaat accttccaac gaaacgcacc agaaataaac     8280

ccaagccaat cccaaaagaa tctgacgtaa aaaccttcaa ctacacggct cacctgtggg     8340

atatccggtg gctaagacgt cgtgcgagga aaacaaggtg attgaccaaa atcgaagtta     8400

cgaacaagaa agcgtcgagc gagctttaac gtgcgctaac tgcggtcaga agctgcatgt     8460

gctggaagtt cacgtgtgtg agcactgctg cgcagaactg atgagcgatc cgaatagctc     8520

gatgcacgag gaagaagatg atggctaaac cagcgcgaag acgatgtaaa aacgatgaat     8580

gccgggaatg gtttcaccct gcattcgcta atcagtggtg gtgctctcca gagtgtggaa     8640

ccaagatagc actcgaacga cgaagtaaag aacgcgaaaa agcggaaaaa gcagcagaga     8700

agaaacgacg acgagaggag cagaaacaga aagataaact taagattcga aaactcgcct     8760

taaagccccg cagttactgg attaaacaag cccaacaagc cgtaaacgcc ttcatcagag     8820

aaagagaccg cgacttacca tgtatctcgt gcggaacgct cacgtctgct cagtgggatg     8880

ccggacatta ccggacaact gctgcggcac ctcaactccg atttaatgaa cgcaatattc     8940

acaagcaatg cgtggtgtgc aaccagcaca aaagcggaaa tctcgttccg tatcgcgtcg     9000

aactgattag ccgcatcggg caggaagcag tagacgaaat cgaatcaaac cataaccgcc     9060

atcgctggac tatcgaagag tgcaaggcga tcaaggcaga gtaccaacag aaactcaaag     9120

acctgcgaaa tagcagaagt gaggccgcat gacgttctca gtaaaaacca ttccagacat     9180

gctcgttgaa gcatacggaa atcagacaga agtagcacgc agactgaaat gtagtcgcgg     9240

tacggtcaga aaatacgttg atgataaaga cgggaaaatg cacgccatcg tcaacgacgt     9300

tctcatggtt catcgcggat ggagtgaaag agatgcgcta ttacgaaaaa attgatggca     9360

gcaaataccg aaatatttgg gtagttggcg atctgcacgg atgctacacg aacctgatga     9420

acaaactgga tacgattgga ttcgacaaca aaaaagacct gcttatctcg gtgggcgatt     9480

tggttgatcg tggtgcagag aacgttgaat gcctggaatt aatcacattc ccctggttca     9540

gagctgtacg tggaaaccat gagcaaatga tgattgatgg cttatcagag cgtggaaacg     9600

ttaatcactg gctgcttaat ggcggtggct ggttctttaa tctcgattac gacaaagaaa     9660

ttctggctaa agctcttgcc cataaagcag atgaacttcc gttaatcatc gaactggtga     9720

gcaaagataa aaaatatgtt atctgccacg ccgattatcc ctttgacgaa tacgagtttg     9780

gaaagccagt tgatcatcag caggtaatct ggaaccgcga acgaatcagc aactcacaaa     9840

acgggatcgt gaaagaaatc aaaggcgcgg acacgttcat ctttggtcat acgccagcag     9900

tgaaaccact caagtttgcc aaccaaatgt atatcgatac cggcgcagtg ttctgcggaa     9960

acctaacatt gattcaggta cagggagaag gcgcatgaga ctcgaaagcg tagctaaatt    10020

tcattcgcca aaaagcccga tgatgagcga ctcaccacgg gccacggctt ctgactctct    10080

ttccggtact gatgtgatgg ctgctatggg gatggcgcaa tcacaagccg gattcggtat    10140

ggctgcattc tgcggtaagc acgaactcag ccagaacgac aaacaaaagg ctatcaacta    10200

tctgatgcaa tttgcacaca aggtatcggg gaaataccgt ggtgtggcaa agcttgaagg    10260

aaatactaag gcaaaggtac tgcaagtgct cgcaacattc gcttatgcgg attattgccg    10320

tagtgccgcg acgccggggg caagatgcag agattgccat ggtacaggcc gtgcggttga    10380

tattgccaaa acagagctgt gggggagagt tgtcgagaaa gagtgcggaa gatgcaaagg    10440

cgtcggctat tcaaggatgc cagcaagcgc agcatatcgc gctgtgacga tgctaatccc    10500

aaaccttacc caacccacct ggtcacgcac tgttaagccg ctgtatgacg ctctggtggt    10560

gcaatgccac aaagaagagt caatcgcaga caacattttg aatgcggtca cacgttagca    10620

gcatgattgc cacggatggc aacatattaa cggcatgata ttgacttatt gaataaaatt    10680

gggtaaattt gactcaacga tgggttaatt cgctcgttgt ggtagtgaga tgaaaagagg    10740

cggcgcttac taccgattcc gcctagttgg tcacttcgac gtatcgtctg gaactccaac    10800

catcgcaggc agagaggtct gcaaaatgca atcccgaaac agttcgcagg taatagttag    10860

agcctgcata acggtttcgg gattttttat atctgcacaa caggtaagag cattgagtcg    10920

ataatcgtga agagtcggcg agcctggtta gccagtgctc tttccgttgt gctgaattaa    10980

gcgaataccg gaagcagaac cggatcacca aatgcgtaca ggcgtcatcg ccgcccagca    11040

acagcacaac ccaaactgag ccgtagccac tgtctgtcct gaattcatta gtaatagtta    11100

cgctgcggcc ttttacacat gaccttcgtg aaagcgggtg gcaggaggtc gcgctaacaa    11160

cctcctgccg ttttgcccgt gcatatcggt cacgaacaaa tctgattact aaacacagta    11220

gcctggattt gttctatcag taatcgacct tattcctaat taaatagagc aaatcccctt    11280

attgggggta agacatgaag atgccagaaa aacatgacct gttggccgcc attctcgcgg    11340

caaaggaaca aggcatcggg gcaatccttg cgtttgcaat ggcgtacctt cgcggcagat    11400

ataatggcgg tgcgtttaca aaaacagtaa tcgacgcaac gatgtgcgcc attatcgcct    11460

ggttcattcg tgaccttctc gacttcgccg gactaagtag caatctcgct tatataacga    11520

gcgtgtttat cggctacatc ggtactgact cgattggttc gcttatcaaa cgcttcgctg    11580

ctaaaaaagc cggagtagaa gatggtagaa atcaataatc aacgtaaggc gttcctcgat    11640

atgctggcgt ggtcggaggg aactgataac ggacgtcaga aaaccagaaa tcatggttat    11700

gacgtcattg taggcggaga gctatttact gattactccg atcaccctcg caaacttgtc    11760

acgctaaacc caaaactcaa atcaacaggc gcttaagact ggccgtcgtt ttacaacaca    11820

gaaagagttt gtagaaacgc aaaaaggcca tccgtcaggg gccttctgct tagtttgatg    11880

cctggcagtt ccctactctc gccttccgct tcctcgctca ctgactcgct gcgctcggtc    11940

gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa    12000

tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt    12060

aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa    12120

aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt    12180

ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg    12240

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc    12300

agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc    12360

gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta    12420

tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct    12480

acagagttct tgaagtggtg ggctaactac ggctacacta gaagaacagt atttggtatc    12540

tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa    12600

caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa    12660

aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgac    12720

gcgcgcgtaa ctcacgttaa gggattttgg tcatgagctt gcgccgtccc gtcaagtcag    12780

cgtaatgctc tgcttt                                                    12796


<210>  12
<211>  12551
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAAV.MECP2.copt.hPEX1


<220>
<221>  misc_feature
<222>  (1253)..(5951)
<223>  AAV expression cassette

<220>
<221>  misc_feature
<222>  (1253)..(1382)
<223>  5' ITR

<220>
<221>  promoter
<222>  (1437)..(1665)
<223>  MECP2 promoter

<220>
<221>  misc_feature
<222>  (1674)..(1682)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (1683)..(5534)
<223>  Codon optimized hPEX1

<220>
<221>  polyA_signal
<222>  (5565)..(5772)
<223>  bGH Poly(A)

<220>
<221>  misc_feature
<222>  (5822)..(5951)
<223>  3' ITR

<400>  12
tagaaaaact catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata       60

ccatattttt gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat      120

aggatggcaa gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct      180

attaatttcc cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact      240

gaatccggtg agaatggcaa aagtttatgc atttctttcc agacttgttc aacaggccag      300

ccattacgct cgtcatcaaa atcactcgca tcaaccaaac cgttattcat tcgtgattgc      360

gcctgagcga ggcgaaatac gcgatcgctg ttaaaaggac aattacaaac aggaatcgag      420

tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat      480

tcttctaata cctggaacgc tgtttttccg gggatcgcag tggtgagtaa ccatgcatca      540

tcaggagtac ggataaaatg cttgatggtc ggaagtggca taaattccgt cagccagttt      600

agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac      660

aactctggcg catcgggctt cccatacaag cgatagattg tcgcacctga ttgcccgaca      720

ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc      780

ctcgacgttt cccgttgaat atggctcata ttcttccttt ttcaatatta ttgaagcatt      840

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa      900

ataggggtca gtgttacaac caattaacca attctgaaca ttatcgcgag cccatttata      960

cctgaatatg gctcataaca ccccttgttt gcctggcggc agtagcgcgg tggtcccacc     1020

tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc     1080

ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact     1140

gggcctttcg cccgggctaa ttagggggtg tcgcccttat tcgactctat agtgaagttc     1200

ctattctcta gaaagtatag gaacttctga agtggggtcg acttaattaa ggctgcgcgc     1260

tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc     1320

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc     1380

cttgtagtta atgattaacc cgccatgcta cttatctacg tagcaagcta gcgcttagct     1440

gaatggggtc cgcctctttt ccctgcctaa acagacagga actcctgcca attgagggcg     1500

tcaccgctaa ggctccgccc cagcctgggc tccacaacca atgaagggta atctcgacaa     1560

agagcaaggg gtggggcgcg ggcgcgcagg tgcagcagca cacaggctgg tcgggagggc     1620

ggggcgcgac gtctgccgtg cggggtcccg gcatcggttg cgcgcgcggc cgcgccgcca     1680

ccatgtgggg aagcgacaga ctggccggag ctggaggggg aggagcagcc gtcaccgtgg     1740

cgttcactaa cgcgcgggac tgctttctcc atctgccgcg gaggctggtc gcccagctgc     1800

acctcctgca gaaccaggcc atcgaggtgg tgtggtccca ccaaccggcc tttttgagct     1860

gggtcgaggg aaggcacttt tcggaccagg gagaaaatgt ggcggagatc aaccgccagg     1920

tcggccagaa gctgggactg tccaacggcg gacaggtgtt cctcaagccg tgcagccacg     1980

tggtgtcctg ccaacaggtg gaagtggagc cgctctccgc cgacgactgg gagatcctcg     2040

aattgcatgc cgtgagcctc gaacagcatc tgttggacca gattcgcatt gtgttcccga     2100

aggccatatt ccccgtgtgg gtcgatcagc agacctatat cttcatccag attgtggccc     2160

tcatcccggc cgcctcatac ggacggctgg aaactgacac caagctgctg attcaaccta     2220

agacccggag ggccaaagaa aacaccttct ccaaggccga cgctgagtac aagaagctcc     2280

actcctacgg acgggaccag aaggggatga tgaaggagct gcaaaccaag cagctccaga     2340

gcaacaccgt ggggatcacc gagtccaatg aaaacgagtc ggaaatccca gtcgattcat     2400

cttccgtggc cagcctgtgg actatgatcg gttccatttt ctcgttccaa tctgagaaga     2460

agcaggaaac tagctggggg ctgactgaga tcaacgcctt caagaacatg cagtccaaag     2520

tggtgcctct ggataacatc tttcgcgtgt gcaagtccca accgccctca atctacaacg     2580

cgtccgctac ctccgtgttt cataagcact gtgccatcca cgtgttccca tgggatcagg     2640

aatacttcga tgtcgaacct tccttcaccg tgacttacgg gaagcttgtc aagctcctca     2700

gccccaagca gcagcaatcg aaaactaagc agaacgtgct ttccccggag aaggagaagc     2760

aaatgtcaga accactcgac cagaagaaaa tcagatcgga tcataacgaa gaggacgaga     2820

aggcctgcgt ccttcaggtg gtctggaacg gcctggagga gctgaacaac gcgattaagt     2880

acaccaagaa cgtcgaggtc cttcacctgg gaaaggtgtg gattccggat gatctgagga     2940

aacgcctcaa catcgaaatg cacgctgtgg tgcggattac cccggtcgag gtcaccccaa     3000

agatccctcg ctccttgaag ctgcagccgc gagaaaactt gcccaaggac atttctgaag     3060

aggatatcaa gactgtgttc tactcctggc tgcaacagag cactaccacc atgctccctc     3120

tggtcatttc ggaggaagaa ttcatcaaac tggaaaccaa ggacggactg aaagaattct     3180

ccctgtccat cgtgcactcc tgggaaaagg agaaggacaa gaatatcttc ctgctgtccc     3240

ccaatctgct gcaaaagacc acgatccagg tgctgctcga ccccatggtg aaggaggaaa     3300

actcagaaga gatcgacttc atcctgccgt tccttaagct gagttcactg ggaggcgtga     3360

actcccttgg cgtgtcctcg ctggagcaca tcactcactc actgctgggc cggcctctga     3420

gcagacagct tatgagcttg gtcgccggac tcagaaacgg tgccctcctg ctcaccggcg     3480

gcaagggatc gggaaagtcc accctcgcta aggccatttg caaagaggca ttcgataagc     3540

tggacgccca tgtggagcgg gtggactgta aggccctccg cggaaagcga ttggaaaata     3600

ttcaaaagac tctcgaagtc gccttttccg aagccgtctg gatgcagccc tcggtcgtcc     3660

tgctcgacga tctggacctc atcgctgggc tgccggccgt gccggagcat gaacactccc     3720

ctgacgcggt ccagtcgcaa cggctcgccc acgccctgaa cgatatgatt aaggaattca     3780

tctcaatggg atcactggtg gccctgatcg cgacttccca gagccagcag tccctgcacc     3840

ctctgctggt gtcggcccag ggcgtgcaca tttttcagtg tgtgcaacac atccagccgc     3900

ccaaccagga gcagcggtgc gaaatcctgt gcaacgtgat taagaacaag ctggactgcg     3960

atatcaacaa gtttaccgac cttgatctcc aacatgtggc taaggagact gggggcttcg     4020

tggctcggga cttcacagtg ttggtggacc gggcaattca ctccagactg tcccgccaga     4080

gcatttccac ccgcgaaaaa ctggtcctga ccaccctcga cttccagaag gccctcagag     4140

gcttccttcc tgcgagcctc agatccgtca accttcacaa gccgcgggac cttggctggg     4200

acaagatcgg tgggctccac gaggtgcggc agatcctcat ggacaccatt cagctgcctg     4260

caaagtaccc cgagctgttc gccaacttgc cgattcgcca gcgcacggga atcctgctct     4320

acggcccccc gggcaccgga aagaccctgc tggccggtgt gatcgcccgg gaatcgagga     4380

tgaacttcat ctccgtgaag ggacccgaac tcctgtccaa gtacatcggt gcctccgaac     4440

aggccgtgcg cgatatattc attagggccc aggccgcgaa gccctgcatt ctgttcttcg     4500

acgagtttga atcgatcgcg ccccggaggg gccacgacaa cacgggagtg accgaccggg     4560

tggtgaacca gctgctcacc caactggatg gcgtggaagg ccttcaggga gtgtacgtgc     4620

tggcggctac ctccagaccg gacctgatcg atccggccct gctgcgcccc gggagactgg     4680

acaagtgcgt gtattgccct ccccctgacc aggtgtcaag gttggaaatc ctcaacgtgc     4740

tctcggactc cctgccactg gcagatgatg tggacctcca gcatgtggcc tccgtgactg     4800

acagcttcac aggagccgat ctgaaggccc tgctttacaa cgcccagttg gaggcgctgc     4860

acggtatgct gctgtcctcc ggtctgcagg atggctcctc ctcttccgat agcgacctgt     4920

cgctgagcag catggtgttc ctgaaccatt ccagcggctc cgatgacagc gcgggcgacg     4980

gagaatgtgg actggatcaa tccctggtgt ccctggagat gagcgagatt ctgccagacg     5040

agtccaagtt caacatgtac aggctgtact tcggcagcag ctacgagtcc gagctgggaa     5100

atggtacctc gtccgacctg tcaagccagt gcctgtccgc gccttcctcc atgacccagg     5160

acctccctgg agtgccaggg aaggatcagc tgttcagcca gcctcccgtg ctgcgcactg     5220

cgagccagga agggtgccag gaattgaccc aagagcagcg ggaccaactg cgcgcggaca     5280

tttcgatcat caaaggcaga taccgctccc aatccgggga ggacgaaagc atgaaccagc     5340

ccgggcctat caagactaga ctggcaatct cccaaagcca cctgatgacc gcactgggac     5400

acacccggcc ctcgatctcg gaggacgact ggaagaactt cgctgagctg tacgaatcct     5460

tccagaatcc gaagcggaga aagaaccaga gcggaactat gttccggccc ggacagaagg     5520

tgaccctggc ctgaagtact gcggatcctg cagatctgcc tcgactgtgc cttctagttg     5580

ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag gtgccactcc     5640

cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc     5700

tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag acaatagcag     5760

gcatgctggg gactcgagtt ctacgtagat aagtagcatg gcgggttaat cattaactac     5820

aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag     5880

gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag     5940

cgagcgcgca gccttaatta acctaaggaa aatgaagtga agttcctata ctttctagag     6000

aataggaact tctatagtga gtcgaataag ggcgacacaa aatttattct aaatgcataa     6060

taaatactga taacatctta tagtttgtat tatattttgt attatcgttg acatgtataa     6120

ttttgatatc aaaaactgat tttcccttta ttattttcga gatttatttt cttaattctc     6180

tttaacaaac tagaaatatt gtatatacaa aaaatcataa ataatagatg aatagtttaa     6240

ttataggtgt tcatcaatcg aaaaagcaac gtatcttatt taaagtgcgt tgcttttttc     6300

tcatttataa ggttaaataa ttctcatata tcaagcaaag tgacaggcgc ccttaaatat     6360

tctgacaaat gctctttccc taaactcccc ccataaaaaa acccgccgaa gcgggttttt     6420

acgttatttg cggattaacg attactcgtt atcagaaccg cccagggggc ccgagcttaa     6480

cctttttatt tgggggagag ggaagtcatg aaaaaactaa cctttgaaat tcgatctcca     6540

gcacatcagc aaaacgctat tcacgcagta cagcaaatcc ttccagaccc aaccaaacca     6600

atcgtagtaa ccattcagga acgcaaccgc agcttagacc aaaacaggaa gctatgggcc     6660

tgcttaggtg acgtctctcg tcaggttgaa tggcatggtc gctggctgga tgcagaaagc     6720

tggaagtgtg tgtttaccgc agcattaaag cagcaggatg ttgttcctaa ccttgccggg     6780

aatggctttg tggtaatagg ccagtcaacc agcaggatgc gtgtaggcga atttgcggag     6840

ctattagagc ttatacaggc attcggtaca gagcgtggcg ttaagtggtc agacgaagcg     6900

agactggctc tggagtggaa agcgagatgg ggagacaggg ctgcatgata aatgtcgtta     6960

gtttctccgg tggcaggacg tcagcatatt tgctctggct aatggagcaa aagcgacggg     7020

caggtaaaga cgtgcattac gttttcatgg atacaggttg tgaacatcca atgacatatc     7080

ggtttgtcag ggaagttgtg aagttctggg atataccgct caccgtattg caggttgata     7140

tcaacccgga gcttggacag ccaaatggtt atacggtatg ggaaccaaag gatattcaga     7200

cgcgaatgcc tgttctgaag ccatttatcg atatggtaaa gaaatatggc actccatacg     7260

tcggcggcgc gttctgcact gacagattaa aactcgttcc cttcaccaaa tactgtgatg     7320

accatttcgg gcgagggaat tacaccacgt ggattggcat cagagctgat gaaccgaagc     7380

ggctaaagcc aaagcctgga atcagatatc ttgctgaact gtcagacttt gagaaggaag     7440

atatcctcgc atggtggaag caacaaccat tcgatttgca aataccggaa catctcggta     7500

actgcatatt ctgcattaaa aaatcaacgc aaaaaatcgg acttgcctgc aaagatgagg     7560

agggattgca gcgtgttttt aatgaggtca tcacgggatc ccatgtgcgt gacggacatc     7620

gggaaacgcc aaaggagatt atgtaccgag gaagaatgtc gctggacggt atcgcgaaaa     7680

tgtattcaga aaatgattat caagccctgt atcaggacat ggtacgagct aaaagattcg     7740

ataccggctc ttgttctgag tcatgcgaaa tatttggagg gcagcttgat ttcgacttcg     7800

ggagggaagc tgcatgatgc gatgttatcg gtgcggtgaa tgcaaagaag ataaccgctt     7860

ccgaccaaat caaccttact ggaatcgatg gtgtctccgg tgtgaaagaa caccaacagg     7920

ggtgttacca ctaccgcagg aaaaggagga cgtgtggcga gacagcgacg aagtatcacc     7980

gacataatct gcgaaaactg caaatacctt ccaacgaaac gcaccagaaa taaacccaag     8040

ccaatcccaa aagaatctga cgtaaaaacc ttcaactaca cggctcacct gtgggatatc     8100

cggtggctaa gacgtcgtgc gaggaaaaca aggtgattga ccaaaatcga agttacgaac     8160

aagaaagcgt cgagcgagct ttaacgtgcg ctaactgcgg tcagaagctg catgtgctgg     8220

aagttcacgt gtgtgagcac tgctgcgcag aactgatgag cgatccgaat agctcgatgc     8280

acgaggaaga agatgatggc taaaccagcg cgaagacgat gtaaaaacga tgaatgccgg     8340

gaatggtttc accctgcatt cgctaatcag tggtggtgct ctccagagtg tggaaccaag     8400

atagcactcg aacgacgaag taaagaacgc gaaaaagcgg aaaaagcagc agagaagaaa     8460

cgacgacgag aggagcagaa acagaaagat aaacttaaga ttcgaaaact cgccttaaag     8520

ccccgcagtt actggattaa acaagcccaa caagccgtaa acgccttcat cagagaaaga     8580

gaccgcgact taccatgtat ctcgtgcgga acgctcacgt ctgctcagtg ggatgccgga     8640

cattaccgga caactgctgc ggcacctcaa ctccgattta atgaacgcaa tattcacaag     8700

caatgcgtgg tgtgcaacca gcacaaaagc ggaaatctcg ttccgtatcg cgtcgaactg     8760

attagccgca tcgggcagga agcagtagac gaaatcgaat caaaccataa ccgccatcgc     8820

tggactatcg aagagtgcaa ggcgatcaag gcagagtacc aacagaaact caaagacctg     8880

cgaaatagca gaagtgaggc cgcatgacgt tctcagtaaa aaccattcca gacatgctcg     8940

ttgaagcata cggaaatcag acagaagtag cacgcagact gaaatgtagt cgcggtacgg     9000

tcagaaaata cgttgatgat aaagacggga aaatgcacgc catcgtcaac gacgttctca     9060

tggttcatcg cggatggagt gaaagagatg cgctattacg aaaaaattga tggcagcaaa     9120

taccgaaata tttgggtagt tggcgatctg cacggatgct acacgaacct gatgaacaaa     9180

ctggatacga ttggattcga caacaaaaaa gacctgctta tctcggtggg cgatttggtt     9240

gatcgtggtg cagagaacgt tgaatgcctg gaattaatca cattcccctg gttcagagct     9300

gtacgtggaa accatgagca aatgatgatt gatggcttat cagagcgtgg aaacgttaat     9360

cactggctgc ttaatggcgg tggctggttc tttaatctcg attacgacaa agaaattctg     9420

gctaaagctc ttgcccataa agcagatgaa cttccgttaa tcatcgaact ggtgagcaaa     9480

gataaaaaat atgttatctg ccacgccgat tatccctttg acgaatacga gtttggaaag     9540

ccagttgatc atcagcaggt aatctggaac cgcgaacgaa tcagcaactc acaaaacggg     9600

atcgtgaaag aaatcaaagg cgcggacacg ttcatctttg gtcatacgcc agcagtgaaa     9660

ccactcaagt ttgccaacca aatgtatatc gataccggcg cagtgttctg cggaaaccta     9720

acattgattc aggtacaggg agaaggcgca tgagactcga aagcgtagct aaatttcatt     9780

cgccaaaaag cccgatgatg agcgactcac cacgggccac ggcttctgac tctctttccg     9840

gtactgatgt gatggctgct atggggatgg cgcaatcaca agccggattc ggtatggctg     9900

cattctgcgg taagcacgaa ctcagccaga acgacaaaca aaaggctatc aactatctga     9960

tgcaatttgc acacaaggta tcggggaaat accgtggtgt ggcaaagctt gaaggaaata    10020

ctaaggcaaa ggtactgcaa gtgctcgcaa cattcgctta tgcggattat tgccgtagtg    10080

ccgcgacgcc gggggcaaga tgcagagatt gccatggtac aggccgtgcg gttgatattg    10140

ccaaaacaga gctgtggggg agagttgtcg agaaagagtg cggaagatgc aaaggcgtcg    10200

gctattcaag gatgccagca agcgcagcat atcgcgctgt gacgatgcta atcccaaacc    10260

ttacccaacc cacctggtca cgcactgtta agccgctgta tgacgctctg gtggtgcaat    10320

gccacaaaga agagtcaatc gcagacaaca ttttgaatgc ggtcacacgt tagcagcatg    10380

attgccacgg atggcaacat attaacggca tgatattgac ttattgaata aaattgggta    10440

aatttgactc aacgatgggt taattcgctc gttgtggtag tgagatgaaa agaggcggcg    10500

cttactaccg attccgccta gttggtcact tcgacgtatc gtctggaact ccaaccatcg    10560

caggcagaga ggtctgcaaa atgcaatccc gaaacagttc gcaggtaata gttagagcct    10620

gcataacggt ttcgggattt tttatatctg cacaacaggt aagagcattg agtcgataat    10680

cgtgaagagt cggcgagcct ggttagccag tgctctttcc gttgtgctga attaagcgaa    10740

taccggaagc agaaccggat caccaaatgc gtacaggcgt catcgccgcc cagcaacagc    10800

acaacccaaa ctgagccgta gccactgtct gtcctgaatt cattagtaat agttacgctg    10860

cggcctttta cacatgacct tcgtgaaagc gggtggcagg aggtcgcgct aacaacctcc    10920

tgccgttttg cccgtgcata tcggtcacga acaaatctga ttactaaaca cagtagcctg    10980

gatttgttct atcagtaatc gaccttattc ctaattaaat agagcaaatc cccttattgg    11040

gggtaagaca tgaagatgcc agaaaaacat gacctgttgg ccgccattct cgcggcaaag    11100

gaacaaggca tcggggcaat ccttgcgttt gcaatggcgt accttcgcgg cagatataat    11160

ggcggtgcgt ttacaaaaac agtaatcgac gcaacgatgt gcgccattat cgcctggttc    11220

attcgtgacc ttctcgactt cgccggacta agtagcaatc tcgcttatat aacgagcgtg    11280

tttatcggct acatcggtac tgactcgatt ggttcgctta tcaaacgctt cgctgctaaa    11340

aaagccggag tagaagatgg tagaaatcaa taatcaacgt aaggcgttcc tcgatatgct    11400

ggcgtggtcg gagggaactg ataacggacg tcagaaaacc agaaatcatg gttatgacgt    11460

cattgtaggc ggagagctat ttactgatta ctccgatcac cctcgcaaac ttgtcacgct    11520

aaacccaaaa ctcaaatcaa caggcgctta agactggccg tcgttttaca acacagaaag    11580

agtttgtaga aacgcaaaaa ggccatccgt caggggcctt ctgcttagtt tgatgcctgg    11640

cagttcccta ctctcgcctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg    11700

gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg    11760

ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa    11820

ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg    11880

acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc    11940

tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc    12000

ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc    12060

ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg    12120

ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc    12180

actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga    12240

gttcttgaag tggtgggcta actacggcta cactagaaga acagtatttg gtatctgcgc    12300

tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac    12360

caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg    12420

atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgacgcgcg    12480

cgtaactcac gttaagggat tttggtcatg agcttgcgcc gtcccgtcaa gtcagcgtaa    12540

tgctctgctt t                                                         12551


<210>  13
<211>  12835
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pAAV.CMV.hPEX1


<220>
<221>  misc_feature
<222>  (1253)..(6235)
<223>  AAV expression cassette

<220>
<221>  misc_feature
<222>  (1253)..(1382)
<223>  5' ITR

<220>
<221>  enhancer
<222>  (1433)..(1736)
<223>  CMV enhancer

<220>
<221>  promoter
<222>  (1737)..(1940)
<223>  CMV promoter

<220>
<221>  misc_feature
<222>  (1958)..(1966)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (1967)..(5818)
<223>  Cdodon optimized hPEX1

<220>
<221>  polyA_signal
<222>  (5849)..(6056)
<223>  bGH Poly(A)

<220>
<221>  misc_feature
<222>  (6106)..(6235)
<223>  3' ITR

<400>  13
tagaaaaact catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata       60

ccatattttt gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat      120

aggatggcaa gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct      180

attaatttcc cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact      240

gaatccggtg agaatggcaa aagtttatgc atttctttcc agacttgttc aacaggccag      300

ccattacgct cgtcatcaaa atcactcgca tcaaccaaac cgttattcat tcgtgattgc      360

gcctgagcga ggcgaaatac gcgatcgctg ttaaaaggac aattacaaac aggaatcgag      420

tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat      480

tcttctaata cctggaacgc tgtttttccg gggatcgcag tggtgagtaa ccatgcatca      540

tcaggagtac ggataaaatg cttgatggtc ggaagtggca taaattccgt cagccagttt      600

agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac      660

aactctggcg catcgggctt cccatacaag cgatagattg tcgcacctga ttgcccgaca      720

ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc      780

ctcgacgttt cccgttgaat atggctcata ttcttccttt ttcaatatta ttgaagcatt      840

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa      900

ataggggtca gtgttacaac caattaacca attctgaaca ttatcgcgag cccatttata      960

cctgaatatg gctcataaca ccccttgttt gcctggcggc agtagcgcgg tggtcccacc     1020

tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc     1080

ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact     1140

gggcctttcg cccgggctaa ttagggggtg tcgcccttat tcgactctat agtgaagttc     1200

ctattctcta gaaagtatag gaacttctga agtggggtcg acttaattaa ggctgcgcgc     1260

tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc     1320

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc     1380

cttgtagtta atgattaacc cgccatgcta cttatctacg tagcaagcta gccgttacat     1440

aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca ttgacgtcaa     1500

taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt caatgggtgg     1560

agtatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg ccaagtacgc     1620

cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag tacatgacct     1680

tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt accatggtga     1740

tgcggttttg gcagtacatc aatgggcgtg gatagcggtt tgactcacgg ggatttccaa     1800

gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa cgggactttc     1860

caaaatgtcg taacaactcc gccccattga cgcaaatggg cggtaggcgt gtacggtggg     1920

aggtctatat aagcagagct tgtacactag cggccgcgcc gccaccatgt ggggaagcga     1980

cagactggcc ggagctggag ggggaggagc agccgtcacc gtggcgttca ctaacgcgcg     2040

ggactgcttt ctccatctgc cgcggaggct ggtcgcccag ctgcacctcc tgcagaacca     2100

ggccatcgag gtggtgtggt cccaccaacc ggcctttttg agctgggtcg agggaaggca     2160

cttttcggac cagggagaaa atgtggcgga gatcaaccgc caggtcggcc agaagctggg     2220

actgtccaac ggcggacagg tgttcctcaa gccgtgcagc cacgtggtgt cctgccaaca     2280

ggtggaagtg gagccgctct ccgccgacga ctgggagatc ctcgaattgc atgccgtgag     2340

cctcgaacag catctgttgg accagattcg cattgtgttc ccgaaggcca tattccccgt     2400

gtgggtcgat cagcagacct atatcttcat ccagattgtg gccctcatcc cggccgcctc     2460

atacggacgg ctggaaactg acaccaagct gctgattcaa cctaagaccc ggagggccaa     2520

agaaaacacc ttctccaagg ccgacgctga gtacaagaag ctccactcct acggacggga     2580

ccagaagggg atgatgaagg agctgcaaac caagcagctc cagagcaaca ccgtggggat     2640

caccgagtcc aatgaaaacg agtcggaaat cccagtcgat tcatcttccg tggccagcct     2700

gtggactatg atcggttcca ttttctcgtt ccaatctgag aagaagcagg aaactagctg     2760

ggggctgact gagatcaacg ccttcaagaa catgcagtcc aaagtggtgc ctctggataa     2820

catctttcgc gtgtgcaagt cccaaccgcc ctcaatctac aacgcgtccg ctacctccgt     2880

gtttcataag cactgtgcca tccacgtgtt cccatgggat caggaatact tcgatgtcga     2940

accttccttc accgtgactt acgggaagct tgtcaagctc ctcagcccca agcagcagca     3000

atcgaaaact aagcagaacg tgctttcccc ggagaaggag aagcaaatgt cagaaccact     3060

cgaccagaag aaaatcagat cggatcataa cgaagaggac gagaaggcct gcgtccttca     3120

ggtggtctgg aacggcctgg aggagctgaa caacgcgatt aagtacacca agaacgtcga     3180

ggtccttcac ctgggaaagg tgtggattcc ggatgatctg aggaaacgcc tcaacatcga     3240

aatgcacgct gtggtgcgga ttaccccggt cgaggtcacc ccaaagatcc ctcgctcctt     3300

gaagctgcag ccgcgagaaa acttgcccaa ggacatttct gaagaggata tcaagactgt     3360

gttctactcc tggctgcaac agagcactac caccatgctc cctctggtca tttcggagga     3420

agaattcatc aaactggaaa ccaaggacgg actgaaagaa ttctccctgt ccatcgtgca     3480

ctcctgggaa aaggagaagg acaagaatat cttcctgctg tcccccaatc tgctgcaaaa     3540

gaccacgatc caggtgctgc tcgaccccat ggtgaaggag gaaaactcag aagagatcga     3600

cttcatcctg ccgttcctta agctgagttc actgggaggc gtgaactccc ttggcgtgtc     3660

ctcgctggag cacatcactc actcactgct gggccggcct ctgagcagac agcttatgag     3720

cttggtcgcc ggactcagaa acggtgccct cctgctcacc ggcggcaagg gatcgggaaa     3780

gtccaccctc gctaaggcca tttgcaaaga ggcattcgat aagctggacg cccatgtgga     3840

gcgggtggac tgtaaggccc tccgcggaaa gcgattggaa aatattcaaa agactctcga     3900

agtcgccttt tccgaagccg tctggatgca gccctcggtc gtcctgctcg acgatctgga     3960

cctcatcgct gggctgccgg ccgtgccgga gcatgaacac tcccctgacg cggtccagtc     4020

gcaacggctc gcccacgccc tgaacgatat gattaaggaa ttcatctcaa tgggatcact     4080

ggtggccctg atcgcgactt cccagagcca gcagtccctg caccctctgc tggtgtcggc     4140

ccagggcgtg cacatttttc agtgtgtgca acacatccag ccgcccaacc aggagcagcg     4200

gtgcgaaatc ctgtgcaacg tgattaagaa caagctggac tgcgatatca acaagtttac     4260

cgaccttgat ctccaacatg tggctaagga gactgggggc ttcgtggctc gggacttcac     4320

agtgttggtg gaccgggcaa ttcactccag actgtcccgc cagagcattt ccacccgcga     4380

aaaactggtc ctgaccaccc tcgacttcca gaaggccctc agaggcttcc ttcctgcgag     4440

cctcagatcc gtcaaccttc acaagccgcg ggaccttggc tgggacaaga tcggtgggct     4500

ccacgaggtg cggcagatcc tcatggacac cattcagctg cctgcaaagt accccgagct     4560

gttcgccaac ttgccgattc gccagcgcac gggaatcctg ctctacggcc ccccgggcac     4620

cggaaagacc ctgctggccg gtgtgatcgc ccgggaatcg aggatgaact tcatctccgt     4680

gaagggaccc gaactcctgt ccaagtacat cggtgcctcc gaacaggccg tgcgcgatat     4740

attcattagg gcccaggccg cgaagccctg cattctgttc ttcgacgagt ttgaatcgat     4800

cgcgccccgg aggggccacg acaacacggg agtgaccgac cgggtggtga accagctgct     4860

cacccaactg gatggcgtgg aaggccttca gggagtgtac gtgctggcgg ctacctccag     4920

accggacctg atcgatccgg ccctgctgcg ccccgggaga ctggacaagt gcgtgtattg     4980

ccctccccct gaccaggtgt caaggttgga aatcctcaac gtgctctcgg actccctgcc     5040

actggcagat gatgtggacc tccagcatgt ggcctccgtg actgacagct tcacaggagc     5100

cgatctgaag gccctgcttt acaacgccca gttggaggcg ctgcacggta tgctgctgtc     5160

ctccggtctg caggatggct cctcctcttc cgatagcgac ctgtcgctga gcagcatggt     5220

gttcctgaac cattccagcg gctccgatga cagcgcgggc gacggagaat gtggactgga     5280

tcaatccctg gtgtccctgg agatgagcga gattctgcca gacgagtcca agttcaacat     5340

gtacaggctg tacttcggca gcagctacga gtccgagctg ggaaatggta cctcgtccga     5400

cctgtcaagc cagtgcctgt ccgcgccttc ctccatgacc caggacctcc ctggagtgcc     5460

agggaaggat cagctgttca gccagcctcc cgtgctgcgc actgcgagcc aggaagggtg     5520

ccaggaattg acccaagagc agcgggacca actgcgcgcg gacatttcga tcatcaaagg     5580

cagataccgc tcccaatccg gggaggacga aagcatgaac cagcccgggc ctatcaagac     5640

tagactggca atctcccaaa gccacctgat gaccgcactg ggacacaccc ggccctcgat     5700

ctcggaggac gactggaaga acttcgctga gctgtacgaa tccttccaga atccgaagcg     5760

gagaaagaac cagagcggaa ctatgttccg gcccggacag aaggtgaccc tggcctgaag     5820

tactgcggat cctgcagatc tgcctcgact gtgccttcta gttgccagcc atctgttgtt     5880

tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa     5940

taaaatgagg aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg     6000

gtggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggactcg     6060

agttctacgt agataagtag catggcgggt taatcattaa ctacaaggaa cccctagtga     6120

tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg     6180

tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagcctta     6240

attaacctaa ggaaaatgaa gtgaagttcc tatactttct agagaatagg aacttctata     6300

gtgagtcgaa taagggcgac acaaaattta ttctaaatgc ataataaata ctgataacat     6360

cttatagttt gtattatatt ttgtattatc gttgacatgt ataattttga tatcaaaaac     6420

tgattttccc tttattattt tcgagattta ttttcttaat tctctttaac aaactagaaa     6480

tattgtatat acaaaaaatc ataaataata gatgaatagt ttaattatag gtgttcatca     6540

atcgaaaaag caacgtatct tatttaaagt gcgttgcttt tttctcattt ataaggttaa     6600

ataattctca tatatcaagc aaagtgacag gcgcccttaa atattctgac aaatgctctt     6660

tccctaaact ccccccataa aaaaacccgc cgaagcgggt ttttacgtta tttgcggatt     6720

aacgattact cgttatcaga accgcccagg gggcccgagc ttaacctttt tatttggggg     6780

agagggaagt catgaaaaaa ctaacctttg aaattcgatc tccagcacat cagcaaaacg     6840

ctattcacgc agtacagcaa atccttccag acccaaccaa accaatcgta gtaaccattc     6900

aggaacgcaa ccgcagctta gaccaaaaca ggaagctatg ggcctgctta ggtgacgtct     6960

ctcgtcaggt tgaatggcat ggtcgctggc tggatgcaga aagctggaag tgtgtgttta     7020

ccgcagcatt aaagcagcag gatgttgttc ctaaccttgc cgggaatggc tttgtggtaa     7080

taggccagtc aaccagcagg atgcgtgtag gcgaatttgc ggagctatta gagcttatac     7140

aggcattcgg tacagagcgt ggcgttaagt ggtcagacga agcgagactg gctctggagt     7200

ggaaagcgag atggggagac agggctgcat gataaatgtc gttagtttct ccggtggcag     7260

gacgtcagca tatttgctct ggctaatgga gcaaaagcga cgggcaggta aagacgtgca     7320

ttacgttttc atggatacag gttgtgaaca tccaatgaca tatcggtttg tcagggaagt     7380

tgtgaagttc tgggatatac cgctcaccgt attgcaggtt gatatcaacc cggagcttgg     7440

acagccaaat ggttatacgg tatgggaacc aaaggatatt cagacgcgaa tgcctgttct     7500

gaagccattt atcgatatgg taaagaaata tggcactcca tacgtcggcg gcgcgttctg     7560

cactgacaga ttaaaactcg ttcccttcac caaatactgt gatgaccatt tcgggcgagg     7620

gaattacacc acgtggattg gcatcagagc tgatgaaccg aagcggctaa agccaaagcc     7680

tggaatcaga tatcttgctg aactgtcaga ctttgagaag gaagatatcc tcgcatggtg     7740

gaagcaacaa ccattcgatt tgcaaatacc ggaacatctc ggtaactgca tattctgcat     7800

taaaaaatca acgcaaaaaa tcggacttgc ctgcaaagat gaggagggat tgcagcgtgt     7860

ttttaatgag gtcatcacgg gatcccatgt gcgtgacgga catcgggaaa cgccaaagga     7920

gattatgtac cgaggaagaa tgtcgctgga cggtatcgcg aaaatgtatt cagaaaatga     7980

ttatcaagcc ctgtatcagg acatggtacg agctaaaaga ttcgataccg gctcttgttc     8040

tgagtcatgc gaaatatttg gagggcagct tgatttcgac ttcgggaggg aagctgcatg     8100

atgcgatgtt atcggtgcgg tgaatgcaaa gaagataacc gcttccgacc aaatcaacct     8160

tactggaatc gatggtgtct ccggtgtgaa agaacaccaa caggggtgtt accactaccg     8220

caggaaaagg aggacgtgtg gcgagacagc gacgaagtat caccgacata atctgcgaaa     8280

actgcaaata ccttccaacg aaacgcacca gaaataaacc caagccaatc ccaaaagaat     8340

ctgacgtaaa aaccttcaac tacacggctc acctgtggga tatccggtgg ctaagacgtc     8400

gtgcgaggaa aacaaggtga ttgaccaaaa tcgaagttac gaacaagaaa gcgtcgagcg     8460

agctttaacg tgcgctaact gcggtcagaa gctgcatgtg ctggaagttc acgtgtgtga     8520

gcactgctgc gcagaactga tgagcgatcc gaatagctcg atgcacgagg aagaagatga     8580

tggctaaacc agcgcgaaga cgatgtaaaa acgatgaatg ccgggaatgg tttcaccctg     8640

cattcgctaa tcagtggtgg tgctctccag agtgtggaac caagatagca ctcgaacgac     8700

gaagtaaaga acgcgaaaaa gcggaaaaag cagcagagaa gaaacgacga cgagaggagc     8760

agaaacagaa agataaactt aagattcgaa aactcgcctt aaagccccgc agttactgga     8820

ttaaacaagc ccaacaagcc gtaaacgcct tcatcagaga aagagaccgc gacttaccat     8880

gtatctcgtg cggaacgctc acgtctgctc agtgggatgc cggacattac cggacaactg     8940

ctgcggcacc tcaactccga tttaatgaac gcaatattca caagcaatgc gtggtgtgca     9000

accagcacaa aagcggaaat ctcgttccgt atcgcgtcga actgattagc cgcatcgggc     9060

aggaagcagt agacgaaatc gaatcaaacc ataaccgcca tcgctggact atcgaagagt     9120

gcaaggcgat caaggcagag taccaacaga aactcaaaga cctgcgaaat agcagaagtg     9180

aggccgcatg acgttctcag taaaaaccat tccagacatg ctcgttgaag catacggaaa     9240

tcagacagaa gtagcacgca gactgaaatg tagtcgcggt acggtcagaa aatacgttga     9300

tgataaagac gggaaaatgc acgccatcgt caacgacgtt ctcatggttc atcgcggatg     9360

gagtgaaaga gatgcgctat tacgaaaaaa ttgatggcag caaataccga aatatttggg     9420

tagttggcga tctgcacgga tgctacacga acctgatgaa caaactggat acgattggat     9480

tcgacaacaa aaaagacctg cttatctcgg tgggcgattt ggttgatcgt ggtgcagaga     9540

acgttgaatg cctggaatta atcacattcc cctggttcag agctgtacgt ggaaaccatg     9600

agcaaatgat gattgatggc ttatcagagc gtggaaacgt taatcactgg ctgcttaatg     9660

gcggtggctg gttctttaat ctcgattacg acaaagaaat tctggctaaa gctcttgccc     9720

ataaagcaga tgaacttccg ttaatcatcg aactggtgag caaagataaa aaatatgtta     9780

tctgccacgc cgattatccc tttgacgaat acgagtttgg aaagccagtt gatcatcagc     9840

aggtaatctg gaaccgcgaa cgaatcagca actcacaaaa cgggatcgtg aaagaaatca     9900

aaggcgcgga cacgttcatc tttggtcata cgccagcagt gaaaccactc aagtttgcca     9960

accaaatgta tatcgatacc ggcgcagtgt tctgcggaaa cctaacattg attcaggtac    10020

agggagaagg cgcatgagac tcgaaagcgt agctaaattt cattcgccaa aaagcccgat    10080

gatgagcgac tcaccacggg ccacggcttc tgactctctt tccggtactg atgtgatggc    10140

tgctatgggg atggcgcaat cacaagccgg attcggtatg gctgcattct gcggtaagca    10200

cgaactcagc cagaacgaca aacaaaaggc tatcaactat ctgatgcaat ttgcacacaa    10260

ggtatcgggg aaataccgtg gtgtggcaaa gcttgaagga aatactaagg caaaggtact    10320

gcaagtgctc gcaacattcg cttatgcgga ttattgccgt agtgccgcga cgccgggggc    10380

aagatgcaga gattgccatg gtacaggccg tgcggttgat attgccaaaa cagagctgtg    10440

ggggagagtt gtcgagaaag agtgcggaag atgcaaaggc gtcggctatt caaggatgcc    10500

agcaagcgca gcatatcgcg ctgtgacgat gctaatccca aaccttaccc aacccacctg    10560

gtcacgcact gttaagccgc tgtatgacgc tctggtggtg caatgccaca aagaagagtc    10620

aatcgcagac aacattttga atgcggtcac acgttagcag catgattgcc acggatggca    10680

acatattaac ggcatgatat tgacttattg aataaaattg ggtaaatttg actcaacgat    10740

gggttaattc gctcgttgtg gtagtgagat gaaaagaggc ggcgcttact accgattccg    10800

cctagttggt cacttcgacg tatcgtctgg aactccaacc atcgcaggca gagaggtctg    10860

caaaatgcaa tcccgaaaca gttcgcaggt aatagttaga gcctgcataa cggtttcggg    10920

attttttata tctgcacaac aggtaagagc attgagtcga taatcgtgaa gagtcggcga    10980

gcctggttag ccagtgctct ttccgttgtg ctgaattaag cgaataccgg aagcagaacc    11040

ggatcaccaa atgcgtacag gcgtcatcgc cgcccagcaa cagcacaacc caaactgagc    11100

cgtagccact gtctgtcctg aattcattag taatagttac gctgcggcct tttacacatg    11160

accttcgtga aagcgggtgg caggaggtcg cgctaacaac ctcctgccgt tttgcccgtg    11220

catatcggtc acgaacaaat ctgattacta aacacagtag cctggatttg ttctatcagt    11280

aatcgacctt attcctaatt aaatagagca aatcccctta ttgggggtaa gacatgaaga    11340

tgccagaaaa acatgacctg ttggccgcca ttctcgcggc aaaggaacaa ggcatcgggg    11400

caatccttgc gtttgcaatg gcgtaccttc gcggcagata taatggcggt gcgtttacaa    11460

aaacagtaat cgacgcaacg atgtgcgcca ttatcgcctg gttcattcgt gaccttctcg    11520

acttcgccgg actaagtagc aatctcgctt atataacgag cgtgtttatc ggctacatcg    11580

gtactgactc gattggttcg cttatcaaac gcttcgctgc taaaaaagcc ggagtagaag    11640

atggtagaaa tcaataatca acgtaaggcg ttcctcgata tgctggcgtg gtcggaggga    11700

actgataacg gacgtcagaa aaccagaaat catggttatg acgtcattgt aggcggagag    11760

ctatttactg attactccga tcaccctcgc aaacttgtca cgctaaaccc aaaactcaaa    11820

tcaacaggcg cttaagactg gccgtcgttt tacaacacag aaagagtttg tagaaacgca    11880

aaaaggccat ccgtcagggg ccttctgctt agtttgatgc ctggcagttc cctactctcg    11940

ccttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta    12000

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag    12060

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg    12120

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg    12180

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg    12240

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga    12300

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc    12360

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt    12420

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact    12480

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg    12540

gctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt    12600

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt    12660

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct    12720

ttgatctttt ctacggggtc tgacgctcag tggaacgacg cgcgcgtaac tcacgttaag    12780

ggattttggt catgagcttg cgccgtcccg tcaagtcagc gtaatgctct gcttt         12835


