                         SEQUENCE LISTING

<110>  ULTRAGENYX PHARMACEUTICAL INC.
 
<120>  GENE THERAPY FOR TREATING PROPIONIC ACIDEMIA

<130>  ULP-009WO

<150>  63/002,541
<151>  2020-03-31

<160>  31    

<170>  PatentIn version 3.5

<210>  1
<211>  572
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(572)
<223>  IVS2 INTRON

<400>  1
agcttacttg tggtaccgag ctcggatcct gagaacttca gggtgagtct atgggaccct       60

tgatgttttc tttccccttc ttttctatgg ttaagttcat gtcataggaa ggggagaagt      120

aacagggtac acatattgac caaatcaggg taattttgca tttgtaattt taaaaaatgc      180

tttcttcttt taatatactt ttttgtttat cttatttcta atactttccc taatctcttt      240

ctttcagggc aataatgata caatgtatca tgcctctttg caccattcta aagaataaca      300

gtgataattt ctgggttaag gcaatagcaa tatttctgca tataaatatt tctgcatata      360

aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc agctaccatt      420

ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc taggcccttt      480

tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac gtgctggtct      540

gtgtgctggc ccatcacttt ggcaaagaat tg                                    572


<210>  2
<211>  2184
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(2184)
<223>  PCCA Coding Sequence - Wild-Type

<400>  2
atggcggggt tctgggtcgg gacagcaccg ctggtcgctg ccggacggcg tgggcggtgg       60

ccgccgcagc agctgatgct gagcgcggcg ctgcggaccc tgaagcatgt tctgtactat      120

tcaagacagt gcttaatggt gtcccgtaat cttggttcag tgggatatga tcctaatgaa      180

aaaacttttg ataaaattct tgttgctaat agaggagaaa ttgcatgtcg ggttattaga      240

acttgcaaga agatgggcat taagacagtt gccatccaca gtgatgttga tgctagttct      300

gttcatgtga aaatggcgga tgaggctgtc tgtgttggcc cagctcccac cagtaaaagc      360

tacctcaaca tggatgccat catggaagcc attaagaaaa ccagggccca agctgtacat      420

ccaggttatg gattcctttc agaaaacaaa gaatttgcca gatgtttggc agcagaagat      480

gtcgttttca ttggacctga cacacatgct attcaagcca tgggcgacaa gattgaaagc      540

aaattattag ctaagaaagc agaggttaat acaatccctg gctttgatgg agtagtcaag      600

gatgcagaag aagctgtcag aattgcaagg gaaattggct accctgtcat gatcaaggcc      660

tcagcaggtg gtggtgggaa aggcatgcgc attgcttggg atgatgaaga gaccagggat      720

ggttttagat tgtcatctca agaagctgct tctagttttg gcgatgatag actactaata      780

gaaaaattta ttgataatcc tcgtcatata gaaatccagg ttctaggtga taaacatggg      840

aatgctttat ggcttaatga aagagagtgc tcaattcaga gaagaaatca gaaggtggtg      900

gaggaagcac caagcatttt tttggatgcg gagactcgaa gagcgatggg agaacaagct      960

gtagctcttg ccagagcagt aaaatattcc tctgctggga ccgtggagtt ccttgtggac     1020

tctaagaaga atttttattt cttggaaatg aatacaagac tccaggttga gcatcctgtc     1080

acagaatgca ttactggcct ggacctagtc caggaaatga tccgtgttgc taagggctac     1140

cctctcaggc acaaacaagc tgatattcgc atcaacggct gggcagttga atgtcgggtt     1200

tatgctgagg acccctacaa gtcttttggt ttaccatcta ttgggagatt gtctcagtac     1260

caagaaccgt tacatctacc tggtgtccga gtggacagtg gcatccaacc aggaagtgat     1320

attagcattt attatgatcc tatgatttca aaactaatca catatggctc tgatagaact     1380

gaggcactga agagaatggc agatgcactg gataactatg ttattcgagg tgttacacat     1440

aatattgcat tacttcgaga ggtgataatc aactcacgct ttgtaaaagg agacatcagc     1500

actaaatttc tctccgatgt gtatcctgat ggcttcaaag gacacatgct aaccaagagt     1560

gagaagaacc agttattggc aatagcatca tcattgtttg tggcattcca gttaagagca     1620

caacattttc aagaaaattc aagaatgcct gttattaaac cagacatagc caactgggag     1680

ctctcagtaa aattgcatga taaagttcat accgtagtag catcaaacaa tgggtcagtg     1740

ttctcggtgg aagttgatgg gtcgaaacta aatgtgacca gcacgtggaa cctggcttcg     1800

cccttattgt ctgtcagcgt tgatggcact cagaggactg tccagtgtct ttctcgagaa     1860

gcaggtggaa acatgagcat tcagtttctt ggtacagtgt acaaggtgaa tatcttaacc     1920

agacttgccg cagaattgaa caaatttatg ctggaaaaag tgactgagga cacaagcagt     1980

gttctgcgtt ccccgatgcc cggagtggtg gtggccgtct ctgtcaagcc tggagacgcg     2040

gtagcagaag gtcaagaaat ttgtgtgatt gaagccatga aaatgcagaa tagtatgaca     2100

gctgggaaaa ctggcacggt gaaatctgtg cactgtcaag ctggagacac agttggagaa     2160

ggggatctgc tcgtggagct ggaa                                            2184


<210>  3
<211>  2184
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCA Coding Sequence- 
       Codon-Optimized

<400>  3
atggctggat tttgggtcgg aacggcccct ctcgtggccg ccggccgccg gggtcggtgg       60

ccgccgcaac agctgatgtt gtcggccgcg ctgcgcaccc ttaagcatgt gctgtactac      120

tcccggcaat gccttatggt gtccagaaac ctgggtagcg tgggctatga cccgaacgag      180

aaaaccttcg acaagattct ggtggccaac cggggggaaa ttgcctgccg ggtcatcagg      240

acttgcaaga agatgggcat caagaccgtc gccattcact ccgacgtgga cgcctcctcc      300

gtgcacgtga agatggcaga tgaagccgtc tgcgtgggcc ccgccccgac ctccaagtcc      360

taccttaaca tggacgcgat catggaagcc atcaaaaaga ccagagccca ggcagtgcac      420

ccgggatacg gctttctctc cgaaaacaag gagttcgcgc ggtgcctggc cgctgaagat      480

gtcgtgttca tcggccctga tacccacgcg atccaggcta tgggagacaa gatcgaatcc      540

aagctgctcg ccaagaaagc cgaagtcaac accatacctg ggtttgacgg cgtggtcaag      600

gacgcagaag aagccgtcag gattgcccgc gagatcggat accccgtgat gatcaaggca      660

tccgccgggg ggggaggaaa gggaatgcgc atcgcctggg atgacgaaga aacccgggac      720

ggcttcagac tctcgtcaca agaggccgcg tcctcattcg gggatgaccg gctcctgatt      780

gagaagttca ttgacaatcc tcggcacatc gagattcagg tcctgggcga taagcatgga      840

aacgccctgt ggctgaacga acgcgaatgc agcatccaga ggcggaacca gaaagtggtg      900

gaagaggccc catccatctt tctcgacgcc gagactcgga gagcgatggg tgaacaggcc      960

gtggccctgg cccgagccgt gaagtactcc agcgcgggga ctgtcgagtt cctggtggac     1020

agcaagaaga atttctactt cctggagatg aatactcggc tccaagtgga acaccccgtg     1080

accgaatgca ttaccggtct ggacctcgtc caagaaatga tccgcgtcgc caagggctac     1140

ccattgagac acaaacaggc cgacattcgg atcaacggat gggccgtcga gtgtcgcgtg     1200

tacgcggaag atccgtataa gtcgttcgga ctgccgtcca ttggtagact ctcgcagtac     1260

caagagccac tgcacctccc cggagtgcgc gtggactcag gcatccagcc cggaagcgac     1320

atctctatct actacgaccc catgatttcc aagttgatca cctacgggtc cgataggacc     1380

gaggcactga agcgcatggc tgacgcactt gacaactacg tgatccgcgg ggtcactcac     1440

aacattgccc tgctccgcga agtgatcatc aactcgcgct tcgtgaaggg cgacatctcc     1500

actaagttcc tgtccgacgt gtaccctgac ggtttcaagg gccatatgct gaccaagtcc     1560

gagaagaacc agctcctggc tatcgcctcc tccctgtttg tggcgttcca gctgagggcg     1620

cagcacttcc aggagaacag ccggatgccc gtgatcaagc ctgacatcgc caattgggag     1680

ctgtccgtga agctgcacga taaggtccat accgtggtgg catccaacaa cggatcggtg     1740

ttcagcgtgg aagtggacgg gtccaagctg aacgtgacca gcacatggaa cctggcgtcc     1800

cccctgttgt ctgtgtcggt cgatggcacg cagcgcactg tgcagtgcct ctcccgggaa     1860

gctggcggaa acatgagcat ccagttcctg ggtactgtgt acaaggtcaa cattctgact     1920

cggctggccg ccgagctgaa caagttcatg ttggaaaaag tcaccgaaga tacatcgtca     1980

gtcctgcgga gcccaatgcc tggagtcgtg gtggcggtgt cagtgaagcc cggcgatgct     2040

gtggccgaag gccaagagat ctgcgtgatc gaggccatga agatgcagaa ctcgatgacc     2100

gccggaaaga ccggtaccgt gaagtccgtg cattgtcaag cgggcgacac tgtgggagag     2160

ggagatctgc tcgtggagct ggag                                            2184


<210>  4
<211>  2184
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCA Coding Sequence - 
       Codon-Optimized

<400>  4
atggccggct tctgggtcgg caccgcccct ctggtcgcgg ccggacgacg cggacgctgg       60

ccaccccagc aactgatgct gagcgcggcc ttgaggactc tgaagcacgt gctctactac      120

tcgcggcagt gcctgatggt gtcccggaat ctggggtccg tgggatacga ccctaacgaa      180

aagaccttcg ataagatcct cgtggcaaat cggggagaga tcgcgtgtcg cgtgatccgc      240

acgtgcaaga agatggggat caagactgtg gcaatccata gcgatgtgga tgcatcctcg      300

gtccacgtga agatggccga cgaagctgtg tgcgtgggac cggcgccgac ttcgaaatcg      360

tacctgaaca tggacgctat tatggaggcg atcaagaaaa cgcgcgccca agcggtccat      420

cccggttacg gattcctgag cgagaacaag gaatttgcac ggtgcctcgc tgccgaggac      480

gtggtgttta tcggtcccga cacccacgcc atccaagcta tgggggacaa gattgagtcc      540

aagctcctgg cgaaaaaggc agaggtcaac acaattcctg gtttcgacgg cgtcgtgaag      600

gacgccgaag aagccgtgcg catcgcgagg gaaatcggtt accctgtgat gattaaggcc      660

tccgccggcg gcggtggaaa gggaatgaga attgcctggg acgatgaaga aacccgcgac      720

ggattccgcc tgtcgagcca ggaagccgcc tcttccttcg gcgatgacag actgctgatc      780

gaaaagttca tcgataaccc cagacacatt gagatccaag tgctcgggga taagcacggc      840

aacgcccttt ggctgaacga gagagagtgc tccattcaac gccgcaatca gaaggtcgtg      900

gaggaagccc cgtcgatatt cctggatgcc gaaacccggc gggccatggg agagcaggct      960

gtcgcgttgg cgcgggccgt caagtacagc tcggccggga ccgtggaatt tctggtcgat     1020

tccaagaaga acttctattt cctggagatg aacaccagac tccaggtcga gcacccggtc     1080

actgagtgta tcaccgggct cgatctggtg caagagatga ttcgggtggc gaagggatat     1140

ccccttcggc ataaacaagc cgacatcagg atcaacggtt gggccgtgga atgcagggtc     1200

tacgccgagg acccctacaa gagcttcggc ctgcccagca tcggccgcct gtcacagtat     1260

caggaaccgc tgcatcttcc gggcgtgcgg gtcgacagcg gaattcagcc tggctcagat     1320

atctccatct actacgatcc aatgatctca aagctgatta cttatggatc cgaccggacc     1380

gaagccctta agcgaatggc cgacgccctg gacaactacg tgatccgggg agtgacccac     1440

aacatcgcct tgctgcggga agtgatcatt aacagcagat tcgtgaaggg agacatcagc     1500

accaagttcc tgtcggatgt ctacccggac gggttcaaag ggcacatgct tactaagtcc     1560

gagaagaatc agctgctcgc cattgcgtca agcttgttcg tggcctttca actccgggcc     1620

cagcacttcc aggaaaactc ccgcatgcca gtcattaagc cggacatcgc caactgggaa     1680

ctcagcgtga agctccatga caaagtgcat accgtggtgg ccagcaacaa cggtagcgtg     1740

ttctcagtcg aggtcgatgg ctcgaagctc aacgtcactt ccacttggaa cttggccagc     1800

ccgctgctgt ccgtgtccgt ggacggaacc cagaggaccg tgcagtgtct gtcgagagaa     1860

gccggcggca acatgtcaat ccagttcctg ggaaccgtgt acaaggtcaa catcctgacc     1920

agactggccg ccgaactgaa caagtttatg ctcgagaaag tgaccgagga cactagctcc     1980

gtgctgcgct cccctatgcc cggagtggtc gtggcagtgt ccgtgaagcc gggcgacgcc     2040

gtggccgagg gacaggaaat ctgtgtgatc gaagcgatga agatgcagaa ttcaatgacc     2100

gcgggaaaga ctgggaccgt gaagtctgtg cactgccagg ctggcgatac cgtgggggag     2160

ggcgaccttc tggtggaact cgag                                            2184


<210>  5
<211>  2184
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCA Coding Sequence - 
       Codon-Optimized

<400>  5
atggctgggt tttgggtggg gacagctcct ctggtggctg ctgggaggag ggggaggtgg       60

cctcctcagc agctgatgct gtctgctgct ctgaggacac tgaagcatgt gctgtattat      120

tctaggcagt gtctgatggt gtctaggaat ctggggtctg tggggtatga tcctaatgag      180

aagacatttg ataagattct ggtggctaat aggggggaga ttgcttgtag ggtgattagg      240

acatgtaaga agatggggat taagacagtg gctattcatt ctgatgtgga tgcttcttct      300

gtgcatgtga agatggctga tgaggctgtg tgtgtggggc ctgctcctac atctaagtct      360

tatctgaata tggatgctat tatggaggct attaagaaga caagggctca ggctgtgcat      420

cctgggtatg ggtttctgtc tgagaataag gagtttgcta ggtgtctggc tgctgaggat      480

gtggtgttta ttgggcctga tacacatgct attcaggcta tgggggataa gattgagtct      540

aagctgctgg ctaagaaggc tgaggtgaat acaattcctg ggtttgatgg ggtggtgaag      600

gatgctgagg aggctgtgag gattgctagg gagattgggt atcctgtgat gattaaggct      660

tctgctgggg ggggggggaa ggggatgagg attgcttggg atgatgagga gacaagggat      720

gggtttaggc tgtcttctca ggaggctgct tcttcttttg gggatgatag gctgctgatt      780

gagaagttta ttgataatcc taggcatatt gagattcagg tgctggggga taagcatggg      840

aatgctctgt ggctgaatga gagggagtgt tctattcaga ggaggaatca gaaggtggtg      900

gaggaggctc cttctatttt tctggatgct gagacaagga gggctatggg ggagcaggct      960

gtggctctgg ctagggctgt gaagtattct tctgctggga cagtggagtt tctggtggat     1020

tctaagaaga atttttattt tctggagatg aatacaaggc tgcaggtgga gcatcctgtg     1080

acagagtgta ttacagggct ggatctggtg caggagatga ttagggtggc taaggggtat     1140

cctctgaggc ataagcaggc tgatattagg attaatgggt gggctgtgga gtgtagggtg     1200

tatgctgagg atccttataa gtcttttggg ctgccttcta ttgggaggct gtctcagtat     1260

caggagcctc tgcatctgcc tggggtgagg gtggattctg ggattcagcc tgggtctgat     1320

atttctattt attatgatcc tatgatttct aagctgatta catatgggtc tgataggaca     1380

gaggctctga agaggatggc tgatgctctg gataattatg tgattagggg ggtgacacat     1440

aatattgctc tgctgaggga ggtgattatt aattctaggt ttgtgaaggg ggatatttct     1500

acaaagtttc tgtctgatgt gtatcctgat gggtttaagg ggcatatgct gacaaagtct     1560

gagaagaatc agctgctggc tattgcttct tctctgtttg tggcttttca gctgagggct     1620

cagcattttc aggagaattc taggatgcct gtgattaagc ctgatattgc taattgggag     1680

ctgtctgtga agctgcatga taaggtgcat acagtggtgg cttctaataa tgggtctgtg     1740

ttttctgtgg aggtggatgg gtctaagctg aatgtgacat ctacatggaa tctggcttct     1800

cctctgctgt ctgtgtctgt ggatgggaca cagaggacag tgcagtgtct gtctagggag     1860

gctgggggga atatgtctat tcagtttctg gggacagtgt ataaggtgaa tattctgaca     1920

aggctggctg ctgagctgaa taagtttatg ctggagaagg tgacagagga tacatcttct     1980

gtgctgaggt ctcctatgcc tggggtggtg gtggctgtgt ctgtgaagcc tggggatgct     2040

gtggctgagg ggcaggagat ttgtgtgatt gaggctatga agatgcagaa ttctatgaca     2100

gctgggaaga cagggacagt gaagtctgtg cattgtcagg ctggggatac agtgggggag     2160

ggggatctgc tggtggagct ggag                                            2184


<210>  6
<211>  2184
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCA Coding Sequence - 
       Codon-Optimized

<400>  6
atggcagggt tttgggtggg gacagcacca ctggtggcag cagggaggag ggggaggtgg       60

ccaccacagc agctgatgct gtcagcagca ctgaggacac tgaagcatgt gctgtattat      120

tcaaggcagt gtctgatggt gtcaaggaat ctggggtcag tggggtatga tccaaatgag      180

aagacatttg ataagattct ggtggcaaat aggggggaga ttgcatgtag ggtgattagg      240

acatgtaaga agatggggat taagacagtg gcaattcatt cagatgtgga tgcatcatca      300

gtgcatgtga agatggcaga tgaggcagtg tgtgtggggc cagcaccaac atcaaagtca      360

tatctgaata tggatgcaat tatggaggca attaagaaga caagggcaca ggcagtgcat      420

ccagggtatg ggtttctgtc agagaataag gagtttgcaa ggtgtctggc agcagaggat      480

gtggtgttta ttgggccaga tacacatgca attcaggcaa tgggggataa gattgagtca      540

aagctgctgg caaagaaggc agaggtgaat acaattccag ggtttgatgg ggtggtgaag      600

gatgcagagg aggcagtgag gattgcaagg gagattgggt atccagtgat gattaaggca      660

tcagcagggg ggggggggaa ggggatgagg attgcatggg atgatgagga gacaagggat      720

gggtttaggc tgtcatcaca ggaggcagca tcatcatttg gggatgatag gctgctgatt      780

gagaagttta ttgataatcc aaggcatatt gagattcagg tgctggggga taagcatggg      840

aatgcactgt ggctgaatga gagggagtgt tcaattcaga ggaggaatca gaaggtggtg      900

gaggaggcac catcaatttt tctggatgca gagacaagga gggcaatggg ggagcaggca      960

gtggcactgg caagggcagt gaagtattca tcagcaggga cagtggagtt tctggtggat     1020

tcaaagaaga atttttattt tctggagatg aatacaaggc tgcaggtgga gcatccagtg     1080

acagagtgta ttacagggct ggatctggtg caggagatga ttagggtggc aaaggggtat     1140

ccactgaggc ataagcaggc agatattagg attaatgggt gggcagtgga gtgtagggtg     1200

tatgcagagg atccatataa gtcatttggg ctgccatcaa ttgggaggct gtcacagtat     1260

caggagccac tgcatctgcc aggggtgagg gtggattcag ggattcagcc agggtcagat     1320

atttcaattt attatgatcc aatgatttca aagctgatta catatgggtc agataggaca     1380

gaggcactga agaggatggc agatgcactg gataattatg tgattagggg ggtgacacat     1440

aatattgcac tgctgaggga ggtgattatt aattcaaggt ttgtgaaggg ggatatttca     1500

acaaagtttc tgtcagatgt gtatccagat gggtttaagg ggcatatgct gacaaagtca     1560

gagaagaatc agctgctggc aattgcatca tcactgtttg tggcatttca gctgagggca     1620

cagcattttc aggagaattc aaggatgcca gtgattaagc cagatattgc aaattgggag     1680

ctgtcagtga agctgcatga taaggtgcat acagtggtgg catcaaataa tgggtcagtg     1740

ttttcagtgg aggtggatgg gtcaaagctg aatgtgacat caacatggaa tctggcatca     1800

ccactgctgt cagtgtcagt ggatgggaca cagaggacag tgcagtgtct gtcaagggag     1860

gcagggggga atatgtcaat tcagtttctg gggacagtgt ataaggtgaa tattctgaca     1920

aggctggcag cagagctgaa taagtttatg ctggagaagg tgacagagga tacatcatca     1980

gtgctgaggt caccaatgcc aggggtggtg gtggcagtgt cagtgaagcc aggggatgca     2040

gtggcagagg ggcaggagat ttgtgtgatt gaggcaatga agatgcagaa ttcaatgaca     2100

gcagggaaga cagggacagt gaagtcagtg cattgtcagg caggggatac agtgggggag     2160

ggggatctgc tggtggagct ggag                                            2184


<210>  7
<211>  2184
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCA Coding Sequence - 
       Codon-Optimized

<400>  7
atggctggtt tttgggtagg tacagcacca ctagtagcag caggtaggag gggtaggtgg       60

ccaccacaac aactaatgct atcagcagca ctaaggacac taaagcatgt actatattat      120

tcaaggcaat gtctaatggt atcaaggaat ctggggtctg tggggtatga tcctaatgag      180

aagacatttg ataagattct ggtggctaat aggggggaga ttgcttgtag ggtgattagg      240

acatgtaaga agatggggat taagacagtg gctattcatt ctgatgtgga tgcttcttct      300

gtgcatgtga agatggctga tgaggctgtg tgtgtggggc ctgctcctac atctaagtct      360

tatctgaata tggatgctat tatggaggct attaagaaga caagggctca ggctgtgcat      420

cctgggtatg ggtttctgtc tgagaataag gagtttgcta ggtgtctggc tgctgaggat      480

gtggtgttta ttgggcctga tacacatgct attcaggcta tgggggataa gattgagtct      540

aagctgctgg ctaagaaggc tgaggtgaat acaattcctg ggtttgatgg ggtggtgaag      600

gatgctgagg aggctgtgag gattgctagg gagattgggt atcctgtgat gattaaggct      660

tctgctgggg ggggggggaa ggggatgagg attgcttggg atgatgagga gacaagggat      720

gggtttaggc tgtcttctca ggaggctgct tcttcttttg gggatgatag gctgctgatt      780

gagaagttta ttgataatcc taggcatatt gagattcagg tgctggggga taagcatggg      840

aatgctctgt ggctgaatga gagggagtgt tctattcaga ggaggaatca gaaggtggtg      900

gaggaggctc cttctatttt tctggatgct gagacaagga gggctatggg ggagcaggct      960

gtggctctgg ctagggctgt gaagtattct tctgctggga cagtggagtt tctggtggat     1020

tctaagaaga atttttattt tctggagatg aatacaaggc tgcaggtgga gcatcctgtg     1080

acagagtgta ttacagggct ggatctggtg caggagatga ttagggtggc taaggggtat     1140

cctctgaggc ataagcaggc tgatattagg attaatgggt gggctgtgga gtgtagggtg     1200

tatgctgagg atccttataa gtcttttggg ctgccttcta ttgggaggct gtctcagtat     1260

caggagcctc tgcatctgcc tggggtgagg gtggattctg ggattcagcc tgggtctgat     1320

atttctattt attatgatcc tatgatttct aagctgatta catatgggtc tgataggaca     1380

gaggctctga agaggatggc tgatgctctg gataattatg tgattagggg ggtgacacat     1440

aatattgctc tgctgaggga ggtgattatt aattctaggt ttgtgaaggg ggatatttct     1500

acaaagtttc tgtctgatgt gtatcctgat gggtttaagg ggcatatgct gacaaagtct     1560

gagaagaatc agctgctggc tattgcttct tctctgtttg tggcttttca gctgagggct     1620

cagcattttc aggagaattc taggatgcct gtgattaagc ctgatattgc taattgggag     1680

ctgtctgtga agctgcatga taaggtgcat acagtggtgg cttctaataa tgggtctgtg     1740

ttttctgtgg aggtggatgg gtctaagctg aatgtgacat ctacatggaa tctggcttct     1800

cctctgctgt ctgtgtctgt ggatgggaca cagaggacag tgcagtgtct gtctagggag     1860

gctgggggga atatgtctat tcagtttctg gggacagtgt ataaggtgaa tattctgaca     1920

aggctggctg ctgagctgaa taagtttatg ctggagaagg tgacagagga tacatcttct     1980

gtgctgaggt ctcctatgcc tggggtggtg gtggctgtgt ctgtgaagcc tggggatgct     2040

gtggctgagg ggcaggagat ttgtgtgatt gaggctatga agatgcagaa ttctatgaca     2100

gctgggaaga cagggacagt gaagtctgtg cattgtcagg ctggggatac agtgggggag     2160

ggggatctgc tggtggagct ggag                                            2184


<210>  8
<211>  1617
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1617)
<223>  PCCB Coding Sequence - Wild-Type

<400>  8
atggcggcgg cattacgggt ggcggcggtc ggggcaaggc tcagcgttct ggcgagcggt       60

ctccgcgccg cggtccgcag cctttgcagc caggccacct ctgttaacga acgcatcgaa      120

aacaagcgcc ggaccgcgct gctgggaggg ggccaacgcc gtattgacgc gcagcacaag      180

cgaggaaagc taacagccag ggagaggatc agtctcttgc tggaccctgg cagctttgtt      240

gagagcgaca tgtttgtgga acacagatgt gcagattttg gaatggctgc tgataagaat      300

aagtttcctg gagacagcgt ggtcactgga cgaggccgaa tcaatggaag attggtttat      360

gtcttcagtc aggattttac agtttttgga ggcagtctgt caggagcaca tgcccaaaag      420

atctgcaaaa tcatggacca ggccataacg gtgggggctc cagtgattgg gctgaatgac      480

tctgggggag cacggatcca agaaggagtg gagtctttgg ctggctatgc agacatcttt      540

ctgaggaatg ttacggcatc cggagtcatc cctcagattt ctctgatcat gggcccatgt      600

gctggtgggg ccgtctactc cccagcccta acagacttca cgttcatggt aaaggacacc      660

tcctacctgt tcatcactgg ccctgatgtt gtgaagtctg tcaccaatga ggatgttacc      720

caggaggagc tcggtggtgc caagacccac accaccatgt caggtgtggc ccacagagct      780

tttgaaaatg atgttgatgc cttgtgtaat ctccgggatt tcttcaacta cctgcccctg      840

agcagtcagg acccggctcc cgtccgtgag tgccacgatc ccagtgaccg tctggttcct      900

gagcttgaca caattgtccc tttggaatca accaaagcct acaacatggt ggacatcata      960

cactctgttg ttgatgagcg tgaatttttt gagatcatgc ccaattatgc caagaacatc     1020

attgttggtt ttgcaagaat gaatgggagg actgttggaa ttgttggcaa ccaacctaag     1080

gtggcctcag gatgcttgga tattaattca tctgtgaaag gggctcgttt tgtcagattc     1140

tgtgatgcat tcaatattcc actcatcact tttgttgatg tccctggctt tctacctggc     1200

acagcacagg aatacggggg catcatccgg catggtgcca agcttctcta cgcatttgct     1260

gaggcaactg tacccaaagt cacagtcatc accaggaagg cctatggagg tgcctatgat     1320

gtcatgagct ctaagcacct ttgtggtgat accaactatg cctggcccac cgcagagatt     1380

gcagtcatgg gagcaaaggg cgctgtggag atcatcttca aagggcatga gaatgtggaa     1440

gctgctcagg cagagtacat cgagaagttt gccaaccctt tccctgcagc agtgcgaggg     1500

tttgtggatg acatcatcca accttcttcc acacgtgccc gaatctgctg tgacctggat     1560

gtcttggcca gcaagaaggt acaacgtcct tggagaaaac atgcaaatat tccattg        1617


<210>  9
<211>  1617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCB Coding Sequence - 
       Codon-Optimized

<400>  9
atggctgccg ccctgcgcgt ggcggccgtg ggagcaagac tgtccgtgct ggcgtcgggc       60

ttgagagcgg ccgtgcggag cctgtgctca caagcaacct cggtgaacga acgcatcgag      120

aacaagcgca ggactgcgct gctgggcggg ggccagcgca ggatcgacgc acagcataag      180

cgcggaaagc tgaccgcccg cgagcggatt tccctgctcc tggatcctgg aagcttcgtg      240

gagtccgaca tgttcgtgga gcaccgctgc gccgacttcg ggatggctgc cgacaagaac      300

aagttccccg gggactcagt ggtcactggt cgcggaagaa tcaatggccg gctcgtctac      360

gtgttctcac aagactttac tgtgttcggc ggctccctgt cgggagccca cgcgcaaaag      420

atctgcaaga ttatggatca ggccatcact gtgggagcgc ctgtgattgg actcaacgac      480

tccgggggag caagaatcca ggaaggagtg gaaagccttg ccggctacgc tgacatcttc      540

ctccggaacg tgaccgcctc tggagtgatt ccgcaaatct ccctgatcat gggaccatgt      600

gccgggggcg ccgtgtactc cccggcgctg actgacttca ctttcatggt caaggacaca      660

tcctacctgt tcatcaccgg tcccgacgtc gtgaagtccg tgaccaacga ggatgtgacc      720

caggaagaac tggggggggc caagacgcat accaccatgt cgggagtggc ccaccgggcc      780

ttcgagaacg atgtggacgc cttgtgcaac cttcgggact tcttcaatta tctcccgctg      840

agcagccagg atccggcccc agtgcgggaa tgccacgacc cttcggatcg gttggtgcct      900

gagctggata ccatcgtgcc cctcgaatcc accaaggctt acaacatggt cgacatcatt      960

cactccgtgg tggacgagag ggaattcttc gagattatgc cgaactacgc caagaacatc     1020

attgtcggat tcgcccgcat gaacggtcga actgtgggca ttgtcggaaa ccagcctaaa     1080

gtggcctccg gttgcctgga catcaactca agcgtgaagg gtgccagatt tgtgcggttt     1140

tgtgacgcgt tcaatattcc gctgatcacc ttcgtcgacg tcccgggctt cctgcctggg     1200

accgcccagg aatacggcgg catcatcaga cacggcgcga agctcctcta cgcgttcgcg     1260

gaagccaccg tgcccaaggt caccgtgatc actcgcaagg catacggcgg cgcatacgat     1320

gtgatgtcct ccaagcacct gtgtggcgac accaactacg cctggcccac cgccgagatc     1380

gccgtgatgg gtgccaaggg tgctgtcgag atcatcttca agggacatga aaacgtggaa     1440

gctgcccagg ccgagtacat tgaaaagttc gctaacccct tccctgccgc cgtgcgggga     1500

tttgtggatg acattatcca gccgagctcg accagggcca gaatctgctg cgatcttgat     1560

gtgttggcca gcaaaaaggt ccagcggccc tggcggaaac acgccaacat tccactg        1617


<210>  10
<211>  1617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCB Coding Sequence - 
       Codon-Optimized

<400>  10
atggccgcgg cgcttagagt ggccgctgtg ggagccaggc tgagcgtgct ggccagcggt       60

ctgcgcgccg cagtgcgctc gctgtgtagc caggctacct ccgtgaatga gcggatcgaa      120

aacaagcggc gcaccgccct gttgggcggc ggacagcggc gaattgacgc ccaacacaag      180

cggggaaagc tcactgcgag ggaaagaatc tcactgctgc tcgaccccgg gtcgttcgtg      240

gaatcggata tgtttgtcga acatagatgc gcagatttcg gaatggccgc tgacaagaac      300

aagttcccgg gagattccgt cgtgaccgga agggggcgca ttaacgggag acttgtgtac      360

gtgttcagcc aggatttcac ggtgttcggc ggatcactga gcggtgcaca tgcacagaag      420

atctgcaaga tcatggacca ggccattacc gtcggggcac ctgtgatcgg cctgaatgat      480

tcgggcggag cccggattca agagggcgtg gagtcactcg cgggttacgc cgacattttc      540

ctgcggaacg tcaccgcctc cggcgtgatc cctcaaatca gcctcattat gggcccctgc      600

gcgggcggtg ccgtctactc acccgctctg accgatttta ccttcatggt caaggacacc      660

tcctatctgt ttatcactgg accagatgtg gtcaagtccg tgaccaacga ggacgtcact      720

caggaagaac tcggtggagc aaagacccac actactatgt ccggggtcgc gcatagagct      780

ttcgaaaacg acgtcgatgc tctctgtaac ctgagggatt tcttcaacta ccttccactg      840

tcgtcgcaag acccagcccc cgtgcgcgag tgccacgatc cctccgaccg cctggtgccg      900

gaactcgaca ctattgtccc tctggagtca accaaggcct acaacatggt ggacatcatc      960

catagcgtcg tggatgaacg ggagttcttc gaaatcatgc ccaactatgc gaaaaatatc     1020

atcgtgggct ttgcgcggat gaacggccgc accgtgggca tagtgggcaa ccagccgaag     1080

gtcgcgtcgg gatgcctcga tatcaacagc tctgtgaagg gagcgcggtt cgtgcgcttc     1140

tgcgacgcct tcaacatccc cttgatcacc ttcgtggatg tgcctgggtt cttgcctgga     1200

accgcccagg aatacggggg gatcattcgg cacggagcaa aactgctgta cgccttcgcc     1260

gaggccactg tgccgaaagt gacagtgatt acccggaagg cctacggggg tgcctacgac     1320

gtgatgagct ccaagcacct gtgcggagac accaattacg cgtggcctac tgctgaaatt     1380

gctgtcatgg gagccaaggg cgccgtggaa atcattttca agggccacga aaacgtcgag     1440

gccgcccaag ctgagtacat cgagaagttt gccaacccgt ttcctgcggc tgtgcgcggc     1500

ttcgtcgacg atatcattca gccctcgtcc actcgcgccc gcatttgttg tgacctcgac     1560

gtgctggcgt ccaagaaagt gcaaagaccg tggagaaagc atgcaaacat cccgctc        1617


<210>  11
<211>  1617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCB Coding Sequence - 
       Codon-Optimized

<400>  11
atggctgctg ctctgagggt ggctgctgtg ggggctaggc tgtctgtgct ggcttctggg       60

ctgagggctg ctgtgaggtc tctgtgttct caggctacat ctgtgaatga gaggattgag      120

aataagagga ggacagctct gctggggggg gggcagagga ggattgatgc tcagcataag      180

agggggaagc tgacagctag ggagaggatt tctctgctgc tggatcctgg gtcttttgtg      240

gagtctgata tgtttgtgga gcataggtgt gctgattttg ggatggctgc tgataagaat      300

aagtttcctg gggattctgt ggtgacaggg agggggagga ttaatgggag gctggtgtat      360

gtgttttctc aggattttac agtgtttggg gggtctctgt ctggggctca tgctcagaag      420

atttgtaaga ttatggatca ggctattaca gtgggggctc ctgtgattgg gctgaatgat      480

tctggggggg ctaggattca ggagggggtg gagtctctgg ctgggtatgc tgatattttt      540

ctgaggaatg tgacagcttc tggggtgatt cctcagattt ctctgattat ggggccttgt      600

gctggggggg ctgtgtattc tcctgctctg acagatttta catttatggt gaaggataca      660

tcttatctgt ttattacagg gcctgatgtg gtgaagtctg tgacaaatga ggatgtgaca      720

caggaggagc tggggggggc taagacacat acaacaatgt ctggggtggc tcatagggct      780

tttgagaatg atgtggatgc tctgtgtaat ctgagggatt tttttaatta tctgcctctg      840

tcttctcagg atcctgctcc tgtgagggag tgtcatgatc cttctgatag gctggtgcct      900

gagctggata caattgtgcc tctggagtct acaaaggctt ataatatggt ggatattatt      960

cattctgtgg tggatgagag ggagtttttt gagattatgc ctaattatgc taagaatatt     1020

attgtggggt ttgctaggat gaatgggagg acagtgggga ttgtggggaa tcagcctaag     1080

gtggcttctg ggtgtctgga tattaattct tctgtgaagg gggctaggtt tgtgaggttt     1140

tgtgatgctt ttaatattcc tctgattaca tttgtggatg tgcctgggtt tctgcctggg     1200

acagctcagg agtatggggg gattattagg catggggcta agctgctgta tgcttttgct     1260

gaggctacag tgcctaaggt gacagtgatt acaaggaagg cttatggggg ggcttatgat     1320

gtgatgtctt ctaagcatct gtgtggggat acaaattatg cttggcctac agctgagatt     1380

gctgtgatgg gggctaaggg ggctgtggag attattttta aggggcatga gaatgtggag     1440

gctgctcagg ctgagtatat tgagaagttt gctaatcctt ttcctgctgc tgtgaggggg     1500

tttgtggatg atattattca gccttcttct acaagggcta ggatttgttg tgatctggat     1560

gtgctggctt ctaagaaggt gcagaggcct tggaggaagc atgctaatat tcctctg        1617


<210>  12
<211>  1617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCB Coding Sequence - 
       Codon-Optimized

<400>  12
atggcagcag cactgagggt ggcagcagtg ggggcaaggc tgtcagtgct ggcatcaggg       60

ctgagggcag cagtgaggtc actgtgttca caggcaacat cagtgaatga gaggattgag      120

aataagagga ggacagcact gctggggggg gggcagagga ggattgatgc acagcataag      180

agggggaagc tgacagcaag ggagaggatt tcactgctgc tggatccagg gtcatttgtg      240

gagtcagata tgtttgtgga gcataggtgt gcagattttg ggatggcagc agataagaat      300

aagtttccag gggattcagt ggtgacaggg agggggagga ttaatgggag gctggtgtat      360

gtgttttcac aggattttac agtgtttggg gggtcactgt caggggcaca tgcacagaag      420

atttgtaaga ttatggatca ggcaattaca gtgggggcac cagtgattgg gctgaatgat      480

tcaggggggg caaggattca ggagggggtg gagtcactgg cagggtatgc agatattttt      540

ctgaggaatg tgacagcatc aggggtgatt ccacagattt cactgattat ggggccatgt      600

gcaggggggg cagtgtattc accagcactg acagatttta catttatggt gaaggataca      660

tcatatctgt ttattacagg gccagatgtg gtgaagtcag tgacaaatga ggatgtgaca      720

caggaggagc tggggggggc aaagacacat acaacaatgt caggggtggc acatagggca      780

tttgagaatg atgtggatgc actgtgtaat ctgagggatt tttttaatta tctgccactg      840

tcatcacagg atccagcacc agtgagggag tgtcatgatc catcagatag gctggtgcca      900

gagctggata caattgtgcc actggagtca acaaaggcat ataatatggt ggatattatt      960

cattcagtgg tggatgagag ggagtttttt gagattatgc caaattatgc aaagaatatt     1020

attgtggggt ttgcaaggat gaatgggagg acagtgggga ttgtggggaa tcagccaaag     1080

gtggcatcag ggtgtctgga tattaattca tcagtgaagg gggcaaggtt tgtgaggttt     1140

tgtgatgcat ttaatattcc actgattaca tttgtggatg tgccagggtt tctgccaggg     1200

acagcacagg agtatggggg gattattagg catggggcaa agctgctgta tgcatttgca     1260

gaggcaacag tgccaaaggt gacagtgatt acaaggaagg catatggggg ggcatatgat     1320

gtgatgtcat caaagcatct gtgtggggat acaaattatg catggccaac agcagagatt     1380

gcagtgatgg gggcaaaggg ggcagtggag attattttta aggggcatga gaatgtggag     1440

gcagcacagg cagagtatat tgagaagttt gcaaatccat ttccagcagc agtgaggggg     1500

tttgtggatg atattattca gccatcatca acaagggcaa ggatttgttg tgatctggat     1560

gtgctggcat caaagaaggt gcagaggcca tggaggaagc atgcaaatat tccactg        1617


<210>  13
<211>  1617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: PCCB Coding Sequence - 
       Codon-Optimized

<400>  13
atggctgcag cactaagggt agcagcagta ggtgcaaggc tatcagtact agcatcaggt       60

ctaagggcag cagtaaggtc actatgttca caagcaacat cagtaaatga aaggatagaa      120

aataagagga ggacagcact actaggtggt gggcagagga ggattgatgc tcagcataag      180

agggggaagc tgacagctag ggagaggatt tctctgctgc tggatcctgg gtcttttgtg      240

gagtctgata tgtttgtgga gcataggtgt gctgattttg ggatggctgc tgataagaat      300

aagtttcctg gggattctgt ggtgacaggg agggggagga ttaatgggag gctggtgtat      360

gtgttttctc aggattttac agtgtttggg gggtctctgt ctggggctca tgctcagaag      420

atttgtaaga ttatggatca ggctattaca gtgggggctc ctgtgattgg gctgaatgat      480

tctggggggg ctaggattca ggagggggtg gagtctctgg ctgggtatgc tgatattttt      540

ctgaggaatg tgacagcttc tggggtgatt cctcagattt ctctgattat ggggccttgt      600

gctggggggg ctgtgtattc tcctgctctg acagatttta catttatggt gaaggataca      660

tcttatctgt ttattacagg gcctgatgtg gtgaagtctg tgacaaatga ggatgtgaca      720

caggaggagc tggggggggc taagacacat acaacaatgt ctggggtggc tcatagggct      780

tttgagaatg atgtggatgc tctgtgtaat ctgagggatt tttttaatta tctgcctctg      840

tcttctcagg atcctgctcc tgtgagggag tgtcatgatc cttctgatag gctggtgcct      900

gagctggata caattgtgcc tctggagtct acaaaggctt ataatatggt ggatattatt      960

cattctgtgg tggatgagag ggagtttttt gagattatgc ctaattatgc taagaatatt     1020

attgtggggt ttgctaggat gaatgggagg acagtgggga ttgtggggaa tcagcctaag     1080

gtggcttctg ggtgtctgga tattaattct tctgtgaagg gggctaggtt tgtgaggttt     1140

tgtgatgctt ttaatattcc tctgattaca tttgtggatg tgcctgggtt tctgcctggg     1200

acagctcagg agtatggggg gattattagg catggggcta agctgctgta tgcttttgct     1260

gaggctacag tgcctaaggt gacagtgatt acaaggaagg cttatggggg ggcttatgat     1320

gtgatgtctt ctaagcatct gtgtggggat acaaattatg cttggcctac agctgagatt     1380

gctgtgatgg gggctaaggg ggctgtggag attattttta aggggcatga gaatgtggag     1440

gctgctcagg ctgagtatat tgagaagttt gctaatcctt ttcctgctgc tgtgaggggg     1500

tttgtggatg atattattca gccttcttct acaagggcta ggatttgttg tgatctggat     1560

gtgctggctt ctaagaaggt gcagaggcct tggaggaagc atgctaatat tcctctg        1617


<210>  14
<211>  2217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: AAV8 VP1 Nucleic Acid 
       Sequence

<400>  14
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggctgcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta gttcctcggg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccaacg ggacatcggg aggagccacc aacgacaaca cctacttcgg ctacagcacc      840

ccctgggggt attttgactt taacagattc cactgccact tttcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg cccaagagac tcagcttcaa gctcttcaac      960

atccaggtca aggaggtcac gcagaatgaa ggcaccaaga ccatcgccaa taacctcacc     1020

agcaccatcc aggtgtttac ggactcggag taccagctgc cgtacgttct cggctctgcc     1080

caccagggct gcctgcctcc gttcccggcg gacgtgttca tgattcccca gtacggctac     1140

ctaacactca acaacggtag tcaggccgtg ggacgctcct ccttctactg cctggaatac     1200

tttccttcgc agatgctgag aaccggcaac aacttccagt ttacttacac cttcgaggac     1260

gtgcctttcc acagcagcta cgcccacagc cagagcttgg accggctgat gaatcctctg     1320

attgaccagt acctgtacta cttgtctcgg actcaaacaa caggaggcac ggcaaatacg     1380

cagactctgg gcttcagcca aggtgggcct aatacaatgg ccaatcaggc aaagaactgg     1440

ctgccaggac cctgttaccg ccaacaacgc gtctcaacga caaccgggca aaacaacaat     1500

agcaactttg cctggactgc tgggaccaaa taccatctga atggaagaaa ttcattggct     1560

aatcctggca tcgctatggc aacacacaaa gacgacgagg agcgtttttt tcccagtaac     1620

gggatcctga tttttggcaa acaaaatgct gccagagaca atgcggatta cagcgatgtc     1680

atgctcacca gcgaggaaga aatcaaaacc actaaccctg tggctacaga ggaatacggt     1740

atcgtggcag ataacttgca gcagcaaaac acggctcctc aaattggaac tgtcaacagc     1800

cagggggcct tacccggtat ggtctggcag aaccgggacg tgtacctgca gggtcccatc     1860

tgggccaaga ttcctcacac ggacggcaac ttccacccgt ctccgctgat gggcggcttt     1920

ggcctgaaac atcctccgcc tcagatcctg atcaagaaca cgcctgtacc tgcggatcct     1980

ccgaccacct tcaaccagtc aaagctgaac tctttcatca cgcaatacag caccggacag     2040

gtcagcgtgg aaattgaatg ggagctgcag aaggaaaaca gcaagcgctg gaaccccgag     2100

atccagtaca cctccaacta ctacaaatct acaagtgtgg actttgctgt taatacagaa     2160

ggcgtgtact ctgaaccccg ccccattggc acccgttacc tcacccgtaa tctgtaa        2217


<210>  15
<211>  738
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: AAV8 VP1 Amino Acid Sequence

<400>  15

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Thr Tyr 
                405                 410                 415     


Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly 
    450                 455                 460                 


Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly Ile Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly Ile Leu Ile 
    530                 535                 540                 


Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala 
            580                 585                 590         


Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  16
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: AAV9 VP1 Nucleic Acid 
       Sequence

<400>  16
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  17
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: AAV9 VP1 Amino Acid Sequence

<400>  17

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  18
<211>  145
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: AAV2 ITR

<400>  18
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc       60

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg      120

gccaactcca tcactagggg ttcct                                            145


<210>  19
<211>  728
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(728)
<223>  PCCA Amino Acid Sequence

<400>  19

Met Ala Gly Phe Trp Val Gly Thr Ala Pro Leu Val Ala Ala Gly Arg 
1               5                   10                  15      


Arg Gly Arg Trp Pro Pro Gln Gln Leu Met Leu Ser Ala Ala Leu Arg 
            20                  25                  30          


Thr Leu Lys His Val Leu Tyr Tyr Ser Arg Gln Cys Leu Met Val Ser 
        35                  40                  45              


Arg Asn Leu Gly Ser Val Gly Tyr Asp Pro Asn Glu Lys Thr Phe Asp 
    50                  55                  60                  


Lys Ile Leu Val Ala Asn Arg Gly Glu Ile Ala Cys Arg Val Ile Arg 
65                  70                  75                  80  


Thr Cys Lys Lys Met Gly Ile Lys Thr Val Ala Ile His Ser Asp Val 
                85                  90                  95      


Asp Ala Ser Ser Val His Val Lys Met Ala Asp Glu Ala Val Cys Val 
            100                 105                 110         


Gly Pro Ala Pro Thr Ser Lys Ser Tyr Leu Asn Met Asp Ala Ile Met 
        115                 120                 125             


Glu Ala Ile Lys Lys Thr Arg Ala Gln Ala Val His Pro Gly Tyr Gly 
    130                 135                 140                 


Phe Leu Ser Glu Asn Lys Glu Phe Ala Arg Cys Leu Ala Ala Glu Asp 
145                 150                 155                 160 


Val Val Phe Ile Gly Pro Asp Thr His Ala Ile Gln Ala Met Gly Asp 
                165                 170                 175     


Lys Ile Glu Ser Lys Leu Leu Ala Lys Lys Ala Glu Val Asn Thr Ile 
            180                 185                 190         


Pro Gly Phe Asp Gly Val Val Lys Asp Ala Glu Glu Ala Val Arg Ile 
        195                 200                 205             


Ala Arg Glu Ile Gly Tyr Pro Val Met Ile Lys Ala Ser Ala Gly Gly 
    210                 215                 220                 


Gly Gly Lys Gly Met Arg Ile Ala Trp Asp Asp Glu Glu Thr Arg Asp 
225                 230                 235                 240 


Gly Phe Arg Leu Ser Ser Gln Glu Ala Ala Ser Ser Phe Gly Asp Asp 
                245                 250                 255     


Arg Leu Leu Ile Glu Lys Phe Ile Asp Asn Pro Arg His Ile Glu Ile 
            260                 265                 270         


Gln Val Leu Gly Asp Lys His Gly Asn Ala Leu Trp Leu Asn Glu Arg 
        275                 280                 285             


Glu Cys Ser Ile Gln Arg Arg Asn Gln Lys Val Val Glu Glu Ala Pro 
    290                 295                 300                 


Ser Ile Phe Leu Asp Ala Glu Thr Arg Arg Ala Met Gly Glu Gln Ala 
305                 310                 315                 320 


Val Ala Leu Ala Arg Ala Val Lys Tyr Ser Ser Ala Gly Thr Val Glu 
                325                 330                 335     


Phe Leu Val Asp Ser Lys Lys Asn Phe Tyr Phe Leu Glu Met Asn Thr 
            340                 345                 350         


Arg Leu Gln Val Glu His Pro Val Thr Glu Cys Ile Thr Gly Leu Asp 
        355                 360                 365             


Leu Val Gln Glu Met Ile Arg Val Ala Lys Gly Tyr Pro Leu Arg His 
    370                 375                 380                 


Lys Gln Ala Asp Ile Arg Ile Asn Gly Trp Ala Val Glu Cys Arg Val 
385                 390                 395                 400 


Tyr Ala Glu Asp Pro Tyr Lys Ser Phe Gly Leu Pro Ser Ile Gly Arg 
                405                 410                 415     


Leu Ser Gln Tyr Gln Glu Pro Leu His Leu Pro Gly Val Arg Val Asp 
            420                 425                 430         


Ser Gly Ile Gln Pro Gly Ser Asp Ile Ser Ile Tyr Tyr Asp Pro Met 
        435                 440                 445             


Ile Ser Lys Leu Ile Thr Tyr Gly Ser Asp Arg Thr Glu Ala Leu Lys 
    450                 455                 460                 


Arg Met Ala Asp Ala Leu Asp Asn Tyr Val Ile Arg Gly Val Thr His 
465                 470                 475                 480 


Asn Ile Ala Leu Leu Arg Glu Val Ile Ile Asn Ser Arg Phe Val Lys 
                485                 490                 495     


Gly Asp Ile Ser Thr Lys Phe Leu Ser Asp Val Tyr Pro Asp Gly Phe 
            500                 505                 510         


Lys Gly His Met Leu Thr Lys Ser Glu Lys Asn Gln Leu Leu Ala Ile 
        515                 520                 525             


Ala Ser Ser Leu Phe Val Ala Phe Gln Leu Arg Ala Gln His Phe Gln 
    530                 535                 540                 


Glu Asn Ser Arg Met Pro Val Ile Lys Pro Asp Ile Ala Asn Trp Glu 
545                 550                 555                 560 


Leu Ser Val Lys Leu His Asp Lys Val His Thr Val Val Ala Ser Asn 
                565                 570                 575     


Asn Gly Ser Val Phe Ser Val Glu Val Asp Gly Ser Lys Leu Asn Val 
            580                 585                 590         


Thr Ser Thr Trp Asn Leu Ala Ser Pro Leu Leu Ser Val Ser Val Asp 
        595                 600                 605             


Gly Thr Gln Arg Thr Val Gln Cys Leu Ser Arg Glu Ala Gly Gly Asn 
    610                 615                 620                 


Met Ser Ile Gln Phe Leu Gly Thr Val Tyr Lys Val Asn Ile Leu Thr 
625                 630                 635                 640 


Arg Leu Ala Ala Glu Leu Asn Lys Phe Met Leu Glu Lys Val Thr Glu 
                645                 650                 655     


Asp Thr Ser Ser Val Leu Arg Ser Pro Met Pro Gly Val Val Val Ala 
            660                 665                 670         


Val Ser Val Lys Pro Gly Asp Ala Val Ala Glu Gly Gln Glu Ile Cys 
        675                 680                 685             


Val Ile Glu Ala Met Lys Met Gln Asn Ser Met Thr Ala Gly Lys Thr 
    690                 695                 700                 


Gly Thr Val Lys Ser Val His Cys Gln Ala Gly Asp Thr Val Gly Glu 
705                 710                 715                 720 


Gly Asp Leu Leu Val Glu Leu Glu 
                725             


<210>  20
<211>  539
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(539)
<223>  PCCB Amino Acid Sequence

<400>  20

Met Ala Ala Ala Leu Arg Val Ala Ala Val Gly Ala Arg Leu Ser Val 
1               5                   10                  15      


Leu Ala Ser Gly Leu Arg Ala Ala Val Arg Ser Leu Cys Ser Gln Ala 
            20                  25                  30          


Thr Ser Val Asn Glu Arg Ile Glu Asn Lys Arg Arg Thr Ala Leu Leu 
        35                  40                  45              


Gly Gly Gly Gln Arg Arg Ile Asp Ala Gln His Lys Arg Gly Lys Leu 
    50                  55                  60                  


Thr Ala Arg Glu Arg Ile Ser Leu Leu Leu Asp Pro Gly Ser Phe Val 
65                  70                  75                  80  


Glu Ser Asp Met Phe Val Glu His Arg Cys Ala Asp Phe Gly Met Ala 
                85                  90                  95      


Ala Asp Lys Asn Lys Phe Pro Gly Asp Ser Val Val Thr Gly Arg Gly 
            100                 105                 110         


Arg Ile Asn Gly Arg Leu Val Tyr Val Phe Ser Gln Asp Phe Thr Val 
        115                 120                 125             


Phe Gly Gly Ser Leu Ser Gly Ala His Ala Gln Lys Ile Cys Lys Ile 
    130                 135                 140                 


Met Asp Gln Ala Ile Thr Val Gly Ala Pro Val Ile Gly Leu Asn Asp 
145                 150                 155                 160 


Ser Gly Gly Ala Arg Ile Gln Glu Gly Val Glu Ser Leu Ala Gly Tyr 
                165                 170                 175     


Ala Asp Ile Phe Leu Arg Asn Val Thr Ala Ser Gly Val Ile Pro Gln 
            180                 185                 190         


Ile Ser Leu Ile Met Gly Pro Cys Ala Gly Gly Ala Val Tyr Ser Pro 
        195                 200                 205             


Ala Leu Thr Asp Phe Thr Phe Met Val Lys Asp Thr Ser Tyr Leu Phe 
    210                 215                 220                 


Ile Thr Gly Pro Asp Val Val Lys Ser Val Thr Asn Glu Asp Val Thr 
225                 230                 235                 240 


Gln Glu Glu Leu Gly Gly Ala Lys Thr His Thr Thr Met Ser Gly Val 
                245                 250                 255     


Ala His Arg Ala Phe Glu Asn Asp Val Asp Ala Leu Cys Asn Leu Arg 
            260                 265                 270         


Asp Phe Phe Asn Tyr Leu Pro Leu Ser Ser Gln Asp Pro Ala Pro Val 
        275                 280                 285             


Arg Glu Cys His Asp Pro Ser Asp Arg Leu Val Pro Glu Leu Asp Thr 
    290                 295                 300                 


Ile Val Pro Leu Glu Ser Thr Lys Ala Tyr Asn Met Val Asp Ile Ile 
305                 310                 315                 320 


His Ser Val Val Asp Glu Arg Glu Phe Phe Glu Ile Met Pro Asn Tyr 
                325                 330                 335     


Ala Lys Asn Ile Ile Val Gly Phe Ala Arg Met Asn Gly Arg Thr Val 
            340                 345                 350         


Gly Ile Val Gly Asn Gln Pro Lys Val Ala Ser Gly Cys Leu Asp Ile 
        355                 360                 365             


Asn Ser Ser Val Lys Gly Ala Arg Phe Val Arg Phe Cys Asp Ala Phe 
    370                 375                 380                 


Asn Ile Pro Leu Ile Thr Phe Val Asp Val Pro Gly Phe Leu Pro Gly 
385                 390                 395                 400 


Thr Ala Gln Glu Tyr Gly Gly Ile Ile Arg His Gly Ala Lys Leu Leu 
                405                 410                 415     


Tyr Ala Phe Ala Glu Ala Thr Val Pro Lys Val Thr Val Ile Thr Arg 
            420                 425                 430         


Lys Ala Tyr Gly Gly Ala Tyr Asp Val Met Ser Ser Lys His Leu Cys 
        435                 440                 445             


Gly Asp Thr Asn Tyr Ala Trp Pro Thr Ala Glu Ile Ala Val Met Gly 
    450                 455                 460                 


Ala Lys Gly Ala Val Glu Ile Ile Phe Lys Gly His Glu Asn Val Glu 
465                 470                 475                 480 


Ala Ala Gln Ala Glu Tyr Ile Glu Lys Phe Ala Asn Pro Phe Pro Ala 
                485                 490                 495     


Ala Val Arg Gly Phe Val Asp Asp Ile Ile Gln Pro Ser Ser Thr Arg 
            500                 505                 510         


Ala Arg Ile Cys Cys Asp Leu Asp Val Leu Ala Ser Lys Lys Val Gln 
        515                 520                 525             


Arg Pro Trp Arg Lys His Ala Asn Ile Pro Leu 
    530                 535                 


<210>  21
<211>  276
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: CBA Promoter

<400>  21
tcgaggtgag ccccacgttc tgcttcactc tccccatctc ccccccctcc ccacccccaa       60

ttttgtattt atttattttt taattatttt atgcagcgat gggggcgggg gggggggggg      120

cgcgcgccag gcggggcggg gcggggcgag gggcggggcg gggcgaggcg gagaggtgcg      180

gcggcagcca atcagagcgg cgcgctccga aagtttcctt ttatggcgag gcggcggcgg      240

cggcggccct ataaaaagcg aagcgcgcgg cgggcg                                276


<210>  22
<211>  304
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: CMV Enhancer

<400>  22
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catg                                                                   304


<210>  23
<211>  197
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: SV40 Late polyadenylation 
       signal sequence

<400>  23
atccagacat gataagatac attgatgagt ttggacaaac cacaactaga atgcagtgaa       60

aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt atttgtaacc attataagct      120

gcaataaaca agttaacaac aacaattgca ttcattttat gtttcaggtt cagggggagg      180

tgtgggaggt tttttag                                                     197


<210>  24
<211>  6
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: Consensus Kozak Sequence

<400>  24
gccgcc                                                                   6


<210>  25
<211>  728
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(728)
<223>  UniProtKB-Swiss-Prot Accession No. P05165-1 Amino Acid Sequence

<400>  25

Met Ala Gly Phe Trp Val Gly Thr Ala Pro Leu Val Ala Ala Gly Arg 
1               5                   10                  15      


Arg Gly Arg Trp Pro Pro Gln Gln Leu Met Leu Ser Ala Ala Leu Arg 
            20                  25                  30          


Thr Leu Lys His Val Leu Tyr Tyr Ser Arg Gln Cys Leu Met Val Ser 
        35                  40                  45              


Arg Asn Leu Gly Ser Val Gly Tyr Asp Pro Asn Glu Lys Thr Phe Asp 
    50                  55                  60                  


Lys Ile Leu Val Ala Asn Arg Gly Glu Ile Ala Cys Arg Val Ile Arg 
65                  70                  75                  80  


Thr Cys Lys Lys Met Gly Ile Lys Thr Val Ala Ile His Ser Asp Val 
                85                  90                  95      


Asp Ala Ser Ser Val His Val Lys Met Ala Asp Glu Ala Val Cys Val 
            100                 105                 110         


Gly Pro Ala Pro Thr Ser Lys Ser Tyr Leu Asn Met Asp Ala Ile Met 
        115                 120                 125             


Glu Ala Ile Lys Lys Thr Arg Ala Gln Ala Val His Pro Gly Tyr Gly 
    130                 135                 140                 


Phe Leu Ser Glu Asn Lys Glu Phe Ala Arg Cys Leu Ala Ala Glu Asp 
145                 150                 155                 160 


Val Val Phe Ile Gly Pro Asp Thr His Ala Ile Gln Ala Met Gly Asp 
                165                 170                 175     


Lys Ile Glu Ser Lys Leu Leu Ala Lys Lys Ala Glu Val Asn Thr Ile 
            180                 185                 190         


Pro Gly Phe Asp Gly Val Val Lys Asp Ala Glu Glu Ala Val Arg Ile 
        195                 200                 205             


Ala Arg Glu Ile Gly Tyr Pro Val Met Ile Lys Ala Ser Ala Gly Gly 
    210                 215                 220                 


Gly Gly Lys Gly Met Arg Ile Ala Trp Asp Asp Glu Glu Thr Arg Asp 
225                 230                 235                 240 


Gly Phe Arg Leu Ser Ser Gln Glu Ala Ala Ser Ser Phe Gly Asp Asp 
                245                 250                 255     


Arg Leu Leu Ile Glu Lys Phe Ile Asp Asn Pro Arg His Ile Glu Ile 
            260                 265                 270         


Gln Val Leu Gly Asp Lys His Gly Asn Ala Leu Trp Leu Asn Glu Arg 
        275                 280                 285             


Glu Cys Ser Ile Gln Arg Arg Asn Gln Lys Val Val Glu Glu Ala Pro 
    290                 295                 300                 


Ser Ile Phe Leu Asp Ala Glu Thr Arg Arg Ala Met Gly Glu Gln Ala 
305                 310                 315                 320 


Val Ala Leu Ala Arg Ala Val Lys Tyr Ser Ser Ala Gly Thr Val Glu 
                325                 330                 335     


Phe Leu Val Asp Ser Lys Lys Asn Phe Tyr Phe Leu Glu Met Asn Thr 
            340                 345                 350         


Arg Leu Gln Val Glu His Pro Val Thr Glu Cys Ile Thr Gly Leu Asp 
        355                 360                 365             


Leu Val Gln Glu Met Ile Arg Val Ala Lys Gly Tyr Pro Leu Arg His 
    370                 375                 380                 


Lys Gln Ala Asp Ile Arg Ile Asn Gly Trp Ala Val Glu Cys Arg Val 
385                 390                 395                 400 


Tyr Ala Glu Asp Pro Tyr Lys Ser Phe Gly Leu Pro Ser Ile Gly Arg 
                405                 410                 415     


Leu Ser Gln Tyr Gln Glu Pro Leu His Leu Pro Gly Val Arg Val Asp 
            420                 425                 430         


Ser Gly Ile Gln Pro Gly Ser Asp Ile Ser Ile Tyr Tyr Asp Pro Met 
        435                 440                 445             


Ile Ser Lys Leu Ile Thr Tyr Gly Ser Asp Arg Thr Glu Ala Leu Lys 
    450                 455                 460                 


Arg Met Ala Asp Ala Leu Asp Asn Tyr Val Ile Arg Gly Val Thr His 
465                 470                 475                 480 


Asn Ile Ala Leu Leu Arg Glu Val Ile Ile Asn Ser Arg Phe Val Lys 
                485                 490                 495     


Gly Asp Ile Ser Thr Lys Phe Leu Ser Asp Val Tyr Pro Asp Gly Phe 
            500                 505                 510         


Lys Gly His Met Leu Thr Lys Ser Glu Lys Asn Gln Leu Leu Ala Ile 
        515                 520                 525             


Ala Ser Ser Leu Phe Val Ala Phe Gln Leu Arg Ala Gln His Phe Gln 
    530                 535                 540                 


Glu Asn Ser Arg Met Pro Val Ile Lys Pro Asp Ile Ala Asn Trp Glu 
545                 550                 555                 560 


Leu Ser Val Lys Leu His Asp Lys Val His Thr Val Val Ala Ser Asn 
                565                 570                 575     


Asn Gly Ser Val Phe Ser Val Glu Val Asp Gly Ser Lys Leu Asn Val 
            580                 585                 590         


Thr Ser Thr Trp Asn Leu Ala Ser Pro Leu Leu Ser Val Ser Val Asp 
        595                 600                 605             


Gly Thr Gln Arg Thr Val Gln Cys Leu Ser Arg Glu Ala Gly Gly Asn 
    610                 615                 620                 


Met Ser Ile Gln Phe Leu Gly Thr Val Tyr Lys Val Asn Ile Leu Thr 
625                 630                 635                 640 


Arg Leu Ala Ala Glu Leu Asn Lys Phe Met Leu Glu Lys Val Thr Glu 
                645                 650                 655     


Asp Thr Ser Ser Val Leu Arg Ser Pro Met Pro Gly Val Val Val Ala 
            660                 665                 670         


Val Ser Val Lys Pro Gly Asp Ala Val Ala Glu Gly Gln Glu Ile Cys 
        675                 680                 685             


Val Ile Glu Ala Met Lys Met Gln Asn Ser Met Thr Ala Gly Lys Thr 
    690                 695                 700                 


Gly Thr Val Lys Ser Val His Cys Gln Ala Gly Asp Thr Val Gly Glu 
705                 710                 715                 720 


Gly Asp Leu Leu Val Glu Leu Glu 
                725             


<210>  26
<211>  702
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(702)
<223>  UniProtKB-Swiss-Prot Accession No. P05165-2 Amino Acid Sequence

<400>  26

Met Ala Gly Phe Trp Val Gly Thr Ala Pro Leu Val Ala Ala Gly Arg 
1               5                   10                  15      


Arg Gly Arg Trp Pro Pro Gln Gln Leu Met Leu Ser Ala Ala Leu Arg 
            20                  25                  30          


Thr Leu Lys Thr Phe Asp Lys Ile Leu Val Ala Asn Arg Gly Glu Ile 
        35                  40                  45              


Ala Cys Arg Val Ile Arg Thr Cys Lys Lys Met Gly Ile Lys Thr Val 
    50                  55                  60                  


Ala Ile His Ser Asp Val Asp Ala Ser Ser Val His Val Lys Met Ala 
65                  70                  75                  80  


Asp Glu Ala Val Cys Val Gly Pro Ala Pro Thr Ser Lys Ser Tyr Leu 
                85                  90                  95      


Asn Met Asp Ala Ile Met Glu Ala Ile Lys Lys Thr Arg Ala Gln Ala 
            100                 105                 110         


Val His Pro Gly Tyr Gly Phe Leu Ser Glu Asn Lys Glu Phe Ala Arg 
        115                 120                 125             


Cys Leu Ala Ala Glu Asp Val Val Phe Ile Gly Pro Asp Thr His Ala 
    130                 135                 140                 


Ile Gln Ala Met Gly Asp Lys Ile Glu Ser Lys Leu Leu Ala Lys Lys 
145                 150                 155                 160 


Ala Glu Val Asn Thr Ile Pro Gly Phe Asp Gly Val Val Lys Asp Ala 
                165                 170                 175     


Glu Glu Ala Val Arg Ile Ala Arg Glu Ile Gly Tyr Pro Val Met Ile 
            180                 185                 190         


Lys Ala Ser Ala Gly Gly Gly Gly Lys Gly Met Arg Ile Ala Trp Asp 
        195                 200                 205             


Asp Glu Glu Thr Arg Asp Gly Phe Arg Leu Ser Ser Gln Glu Ala Ala 
    210                 215                 220                 


Ser Ser Phe Gly Asp Asp Arg Leu Leu Ile Glu Lys Phe Ile Asp Asn 
225                 230                 235                 240 


Pro Arg His Ile Glu Ile Gln Val Leu Gly Asp Lys His Gly Asn Ala 
                245                 250                 255     


Leu Trp Leu Asn Glu Arg Glu Cys Ser Ile Gln Arg Arg Asn Gln Lys 
            260                 265                 270         


Val Val Glu Glu Ala Pro Ser Ile Phe Leu Asp Ala Glu Thr Arg Arg 
        275                 280                 285             


Ala Met Gly Glu Gln Ala Val Ala Leu Ala Arg Ala Val Lys Tyr Ser 
    290                 295                 300                 


Ser Ala Gly Thr Val Glu Phe Leu Val Asp Ser Lys Lys Asn Phe Tyr 
305                 310                 315                 320 


Phe Leu Glu Met Asn Thr Arg Leu Gln Val Glu His Pro Val Thr Glu 
                325                 330                 335     


Cys Ile Thr Gly Leu Asp Leu Val Gln Glu Met Ile Arg Val Ala Lys 
            340                 345                 350         


Gly Tyr Pro Leu Arg His Lys Gln Ala Asp Ile Arg Ile Asn Gly Trp 
        355                 360                 365             


Ala Val Glu Cys Arg Val Tyr Ala Glu Asp Pro Tyr Lys Ser Phe Gly 
    370                 375                 380                 


Leu Pro Ser Ile Gly Arg Leu Ser Gln Tyr Gln Glu Pro Leu His Leu 
385                 390                 395                 400 


Pro Gly Val Arg Val Asp Ser Gly Ile Gln Pro Gly Ser Asp Ile Ser 
                405                 410                 415     


Ile Tyr Tyr Asp Pro Met Ile Ser Lys Leu Ile Thr Tyr Gly Ser Asp 
            420                 425                 430         


Arg Thr Glu Ala Leu Lys Arg Met Ala Asp Ala Leu Asp Asn Tyr Val 
        435                 440                 445             


Ile Arg Gly Val Thr His Asn Ile Ala Leu Leu Arg Glu Val Ile Ile 
    450                 455                 460                 


Asn Ser Arg Phe Val Lys Gly Asp Ile Ser Thr Lys Phe Leu Ser Asp 
465                 470                 475                 480 


Val Tyr Pro Asp Gly Phe Lys Gly His Met Leu Thr Lys Ser Glu Lys 
                485                 490                 495     


Asn Gln Leu Leu Ala Ile Ala Ser Ser Leu Phe Val Ala Phe Gln Leu 
            500                 505                 510         


Arg Ala Gln His Phe Gln Glu Asn Ser Arg Met Pro Val Ile Lys Pro 
        515                 520                 525             


Asp Ile Ala Asn Trp Glu Leu Ser Val Lys Leu His Asp Lys Val His 
    530                 535                 540                 


Thr Val Val Ala Ser Asn Asn Gly Ser Val Phe Ser Val Glu Val Asp 
545                 550                 555                 560 


Gly Ser Lys Leu Asn Val Thr Ser Thr Trp Asn Leu Ala Ser Pro Leu 
                565                 570                 575     


Leu Ser Val Ser Val Asp Gly Thr Gln Arg Thr Val Gln Cys Leu Ser 
            580                 585                 590         


Arg Glu Ala Gly Gly Asn Met Ser Ile Gln Phe Leu Gly Thr Val Tyr 
        595                 600                 605             


Lys Val Asn Ile Leu Thr Arg Leu Ala Ala Glu Leu Asn Lys Phe Met 
    610                 615                 620                 


Leu Glu Lys Val Thr Glu Asp Thr Ser Ser Val Leu Arg Ser Pro Met 
625                 630                 635                 640 


Pro Gly Val Val Val Ala Val Ser Val Lys Pro Gly Asp Ala Val Ala 
                645                 650                 655     


Glu Gly Gln Glu Ile Cys Val Ile Glu Ala Met Lys Met Gln Asn Ser 
            660                 665                 670         


Met Thr Ala Gly Lys Thr Gly Thr Val Lys Ser Val His Cys Gln Ala 
        675                 680                 685             


Gly Asp Thr Val Gly Glu Gly Asp Leu Leu Val Glu Leu Glu 
    690                 695                 700         


<210>  27
<211>  681
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(681)
<223>  UniProtKB-Swiss-Prot Accession No. P05165-3 Amino Acid Sequence

<400>  27

Met Ala Gly Phe Trp Val Gly Thr Ala Pro Leu Val Ala Ala Gly Arg 
1               5                   10                  15      


Arg Gly Arg Trp Pro Pro Gln Gln Leu Met Leu Ser Ala Ala Leu Arg 
            20                  25                  30          


Thr Leu Lys His Val Leu Tyr Tyr Ser Arg Gln Cys Leu Met Val Ser 
        35                  40                  45              


Arg Asn Leu Gly Ser Val Gly Tyr Asp Pro Asn Glu Lys Thr Phe Asp 
    50                  55                  60                  


Lys Ile Leu Val Ala Asn Arg Gly Glu Ile Ala Cys Arg Val Ile Arg 
65                  70                  75                  80  


Thr Cys Lys Lys Met Gly Ile Lys Thr Val Ala Ile His Ser Asp Val 
                85                  90                  95      


Asp Ala Ser Ser Val His Val Lys Met Ala Asp Glu Ala Val Cys Val 
            100                 105                 110         


Gly Pro Ala Pro Thr Ser Lys Ser Tyr Leu Asn Met Asp Ala Ile Met 
        115                 120                 125             


Glu Ala Ile Lys Lys Thr Arg Ala Gln Ala Val His Pro Gly Tyr Gly 
    130                 135                 140                 


Phe Leu Ser Glu Asn Lys Glu Phe Ala Arg Cys Leu Ala Ala Glu Asp 
145                 150                 155                 160 


Val Val Phe Ile Gly Pro Asp Thr His Ala Ile Gln Ala Met Gly Asp 
                165                 170                 175     


Lys Ile Glu Ser Lys Leu Leu Ala Lys Lys Ala Glu Val Asn Thr Ile 
            180                 185                 190         


Pro Gly Phe Asp Gly Val Val Lys Asp Ala Glu Glu Ala Val Arg Ile 
        195                 200                 205             


Ala Arg Glu Ile Gly Tyr Pro Val Met Ile Lys Ala Ser Ala Gly Gly 
    210                 215                 220                 


Gly Gly Lys Gly Met Arg Ile Ala Trp Asp Asp Glu Glu Thr Arg Asp 
225                 230                 235                 240 


Gly Phe Arg Leu Ser Ser Gln Glu Ala Ala Ser Ser Phe Gly Asp Asp 
                245                 250                 255     


Arg Leu Leu Ile Glu Lys Phe Ile Asp Asn Pro Arg His Ile Glu Ile 
            260                 265                 270         


Gln Val Leu Gly Asp Lys His Gly Asn Ala Leu Trp Leu Asn Glu Arg 
        275                 280                 285             


Glu Cys Ser Ile Gln Arg Arg Asn Gln Lys Val Val Glu Glu Ala Pro 
    290                 295                 300                 


Ser Ile Phe Leu Asp Ala Glu Thr Arg Arg Ala Met Gly Glu Gln Ala 
305                 310                 315                 320 


Val Ala Leu Ala Arg Ala Val Lys Tyr Ser Ser Ala Gly Thr Val Glu 
                325                 330                 335     


Phe Leu Val Asp Ser Lys Lys Asn Phe Tyr Phe Leu Glu Met Asn Thr 
            340                 345                 350         


Arg Leu Gln Val Glu His Pro Val Thr Glu Cys Ile Thr Gly Leu Asp 
        355                 360                 365             


Leu Val Gln Glu Met Ile Arg Val Ala Lys Gly Tyr Pro Leu Arg His 
    370                 375                 380                 


Lys Gln Ala Asp Ile Arg Ile Asn Gly Trp Ala Val Glu Cys Arg Val 
385                 390                 395                 400 


Tyr Ala Glu Asp Pro Tyr Lys Ser Phe Gly Leu Pro Ser Ile Gly Arg 
                405                 410                 415     


Leu Ser Gln Tyr Gln Glu Pro Leu His Leu Pro Gly Val Arg Val Asp 
            420                 425                 430         


Ser Gly Ile Gln Pro Gly Ser Asp Ile Ser Ile Tyr Tyr Asp Pro Met 
        435                 440                 445             


Ile Ser Lys Leu Ile Thr Tyr Gly Ser Asp Arg Thr Glu Ala Leu Lys 
    450                 455                 460                 


Arg Met Ala Asp Ala Leu Asp Asn Tyr Val Ile Arg Gly Val Thr His 
465                 470                 475                 480 


Asn Ile Ala Leu Leu Arg Glu Val Ile Ile Asn Ser Arg Phe Val Lys 
                485                 490                 495     


Gly Asp Ile Ser Thr Lys Phe Leu Ser Asp Val Tyr Pro Asp Gly Phe 
            500                 505                 510         


Lys Gly His Met Leu Thr Lys Ser Glu Lys Asn Gln Leu Leu Ala Ile 
        515                 520                 525             


Ala Ser Ser Leu Phe Val Ala Phe Gln Leu Arg Ala Gln His Phe Gln 
    530                 535                 540                 


Glu Asn Ser Arg Met Pro Val Ile Lys Pro Asp Ile Ala Asn Trp Glu 
545                 550                 555                 560 


Leu Ser Val Lys Leu His Asp Lys Val His Thr Val Val Ala Ser Asn 
                565                 570                 575     


Asn Gly Ser Val Phe Ser Val Glu Val Asp Gly Ser Lys Leu Asn Val 
            580                 585                 590         


Thr Ser Thr Trp Asn Leu Ala Ser Pro Leu Leu Ser Val Ser Val Asp 
        595                 600                 605             


Gly Thr Gln Arg Thr Val Gln Cys Leu Ser Arg Glu Ala Gly Gly Asn 
    610                 615                 620                 


Met Ser Ile Gln Phe Leu Gly Thr Val Val Ala Glu Gly Gln Glu Ile 
625                 630                 635                 640 


Cys Val Ile Glu Ala Met Lys Met Gln Asn Ser Met Thr Ala Gly Lys 
                645                 650                 655     


Thr Gly Thr Val Lys Ser Val His Cys Gln Ala Gly Asp Thr Val Gly 
            660                 665                 670         


Glu Gly Asp Leu Leu Val Glu Leu Glu 
        675                 680     


<210>  28
<211>  539
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(539)
<223>  UniProtKB-Swiss-Prot Accession No. P05166-1 Amino Acid Sequence

<400>  28

Met Ala Ala Ala Leu Arg Val Ala Ala Val Gly Ala Arg Leu Ser Val 
1               5                   10                  15      


Leu Ala Ser Gly Leu Arg Ala Ala Val Arg Ser Leu Cys Ser Gln Ala 
            20                  25                  30          


Thr Ser Val Asn Glu Arg Ile Glu Asn Lys Arg Arg Thr Ala Leu Leu 
        35                  40                  45              


Gly Gly Gly Gln Arg Arg Ile Asp Ala Gln His Lys Arg Gly Lys Leu 
    50                  55                  60                  


Thr Ala Arg Glu Arg Ile Ser Leu Leu Leu Asp Pro Gly Ser Phe Val 
65                  70                  75                  80  


Glu Ser Asp Met Phe Val Glu His Arg Cys Ala Asp Phe Gly Met Ala 
                85                  90                  95      


Ala Asp Lys Asn Lys Phe Pro Gly Asp Ser Val Val Thr Gly Arg Gly 
            100                 105                 110         


Arg Ile Asn Gly Arg Leu Val Tyr Val Phe Ser Gln Asp Phe Thr Val 
        115                 120                 125             


Phe Gly Gly Ser Leu Ser Gly Ala His Ala Gln Lys Ile Cys Lys Ile 
    130                 135                 140                 


Met Asp Gln Ala Ile Thr Val Gly Ala Pro Val Ile Gly Leu Asn Asp 
145                 150                 155                 160 


Ser Gly Gly Ala Arg Ile Gln Glu Gly Val Glu Ser Leu Ala Gly Tyr 
                165                 170                 175     


Ala Asp Ile Phe Leu Arg Asn Val Thr Ala Ser Gly Val Ile Pro Gln 
            180                 185                 190         


Ile Ser Leu Ile Met Gly Pro Cys Ala Gly Gly Ala Val Tyr Ser Pro 
        195                 200                 205             


Ala Leu Thr Asp Phe Thr Phe Met Val Lys Asp Thr Ser Tyr Leu Phe 
    210                 215                 220                 


Ile Thr Gly Pro Asp Val Val Lys Ser Val Thr Asn Glu Asp Val Thr 
225                 230                 235                 240 


Gln Glu Glu Leu Gly Gly Ala Lys Thr His Thr Thr Met Ser Gly Val 
                245                 250                 255     


Ala His Arg Ala Phe Glu Asn Asp Val Asp Ala Leu Cys Asn Leu Arg 
            260                 265                 270         


Asp Phe Phe Asn Tyr Leu Pro Leu Ser Ser Gln Asp Pro Ala Pro Val 
        275                 280                 285             


Arg Glu Cys His Asp Pro Ser Asp Arg Leu Val Pro Glu Leu Asp Thr 
    290                 295                 300                 


Ile Val Pro Leu Glu Ser Thr Lys Ala Tyr Asn Met Val Asp Ile Ile 
305                 310                 315                 320 


His Ser Val Val Asp Glu Arg Glu Phe Phe Glu Ile Met Pro Asn Tyr 
                325                 330                 335     


Ala Lys Asn Ile Ile Val Gly Phe Ala Arg Met Asn Gly Arg Thr Val 
            340                 345                 350         


Gly Ile Val Gly Asn Gln Pro Lys Val Ala Ser Gly Cys Leu Asp Ile 
        355                 360                 365             


Asn Ser Ser Val Lys Gly Ala Arg Phe Val Arg Phe Cys Asp Ala Phe 
    370                 375                 380                 


Asn Ile Pro Leu Ile Thr Phe Val Asp Val Pro Gly Phe Leu Pro Gly 
385                 390                 395                 400 


Thr Ala Gln Glu Tyr Gly Gly Ile Ile Arg His Gly Ala Lys Leu Leu 
                405                 410                 415     


Tyr Ala Phe Ala Glu Ala Thr Val Pro Lys Val Thr Val Ile Thr Arg 
            420                 425                 430         


Lys Ala Tyr Gly Gly Ala Tyr Asp Val Met Ser Ser Lys His Leu Cys 
        435                 440                 445             


Gly Asp Thr Asn Tyr Ala Trp Pro Thr Ala Glu Ile Ala Val Met Gly 
    450                 455                 460                 


Ala Lys Gly Ala Val Glu Ile Ile Phe Lys Gly His Glu Asn Val Glu 
465                 470                 475                 480 


Ala Ala Gln Ala Glu Tyr Ile Glu Lys Phe Ala Asn Pro Phe Pro Ala 
                485                 490                 495     


Ala Val Arg Gly Phe Val Asp Asp Ile Ile Gln Pro Ser Ser Thr Arg 
            500                 505                 510         


Ala Arg Ile Cys Cys Asp Leu Asp Val Leu Ala Ser Lys Lys Val Gln 
        515                 520                 525             


Arg Pro Trp Arg Lys His Ala Asn Ile Pro Leu 
    530                 535                 


<210>  29
<211>  559
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(559)
<223>  UniProtKB-Swiss-Prot Accession No. P05166-2 Amino Acid Sequence

<400>  29

Met Ala Ala Ala Leu Arg Val Ala Ala Val Gly Ala Arg Leu Ser Val 
1               5                   10                  15      


Leu Ala Ser Gly Leu Arg Ala Ala Val Arg Ser Leu Cys Ser Gln Ala 
            20                  25                  30          


Thr Ser Val Asn Glu Arg Ile Glu Asn Lys Arg Arg Thr Ala Leu Leu 
        35                  40                  45              


Gly Gly Gly Gln Arg Arg Ile Asp Ala Gln His Lys Arg Gly Lys Leu 
    50                  55                  60                  


Thr Ala Arg Glu Arg Ile Ser Leu Leu Leu Asp Pro Gly Ser Phe Val 
65                  70                  75                  80  


Glu Ser Asp Met Phe Val Glu His Arg Cys Ala Asp Phe Gly Met Ala 
                85                  90                  95      


Ala Asp Lys Asn Lys Phe Pro Gly Asp Ser Val Val Thr Gly Arg Gly 
            100                 105                 110         


Arg Ile Asn Gly Arg Leu Val Tyr Val Phe Ser Gln Gln Ile Ile Gly 
        115                 120                 125             


Trp Ala Gln Trp Leu Pro Leu Val Ile Ser Ala Leu Trp Glu Ala Glu 
    130                 135                 140                 


Asp Phe Thr Val Phe Gly Gly Ser Leu Ser Gly Ala His Ala Gln Lys 
145                 150                 155                 160 


Ile Cys Lys Ile Met Asp Gln Ala Ile Thr Val Gly Ala Pro Val Ile 
                165                 170                 175     


Gly Leu Asn Asp Ser Gly Gly Ala Arg Ile Gln Glu Gly Val Glu Ser 
            180                 185                 190         


Leu Ala Gly Tyr Ala Asp Ile Phe Leu Arg Asn Val Thr Ala Ser Gly 
        195                 200                 205             


Val Ile Pro Gln Ile Ser Leu Ile Met Gly Pro Cys Ala Gly Gly Ala 
    210                 215                 220                 


Val Tyr Ser Pro Ala Leu Thr Asp Phe Thr Phe Met Val Lys Asp Thr 
225                 230                 235                 240 


Ser Tyr Leu Phe Ile Thr Gly Pro Asp Val Val Lys Ser Val Thr Asn 
                245                 250                 255     


Glu Asp Val Thr Gln Glu Glu Leu Gly Gly Ala Lys Thr His Thr Thr 
            260                 265                 270         


Met Ser Gly Val Ala His Arg Ala Phe Glu Asn Asp Val Asp Ala Leu 
        275                 280                 285             


Cys Asn Leu Arg Asp Phe Phe Asn Tyr Leu Pro Leu Ser Ser Gln Asp 
    290                 295                 300                 


Pro Ala Pro Val Arg Glu Cys His Asp Pro Ser Asp Arg Leu Val Pro 
305                 310                 315                 320 


Glu Leu Asp Thr Ile Val Pro Leu Glu Ser Thr Lys Ala Tyr Asn Met 
                325                 330                 335     


Val Asp Ile Ile His Ser Val Val Asp Glu Arg Glu Phe Phe Glu Ile 
            340                 345                 350         


Met Pro Asn Tyr Ala Lys Asn Ile Ile Val Gly Phe Ala Arg Met Asn 
        355                 360                 365             


Gly Arg Thr Val Gly Ile Val Gly Asn Gln Pro Lys Val Ala Ser Gly 
    370                 375                 380                 


Cys Leu Asp Ile Asn Ser Ser Val Lys Gly Ala Arg Phe Val Arg Phe 
385                 390                 395                 400 


Cys Asp Ala Phe Asn Ile Pro Leu Ile Thr Phe Val Asp Val Pro Gly 
                405                 410                 415     


Phe Leu Pro Gly Thr Ala Gln Glu Tyr Gly Gly Ile Ile Arg His Gly 
            420                 425                 430         


Ala Lys Leu Leu Tyr Ala Phe Ala Glu Ala Thr Val Pro Lys Val Thr 
        435                 440                 445             


Val Ile Thr Arg Lys Ala Tyr Gly Gly Ala Tyr Asp Val Met Ser Ser 
    450                 455                 460                 


Lys His Leu Cys Gly Asp Thr Asn Tyr Ala Trp Pro Thr Ala Glu Ile 
465                 470                 475                 480 


Ala Val Met Gly Ala Lys Gly Ala Val Glu Ile Ile Phe Lys Gly His 
                485                 490                 495     


Glu Asn Val Glu Ala Ala Gln Ala Glu Tyr Ile Glu Lys Phe Ala Asn 
            500                 505                 510         


Pro Phe Pro Ala Ala Val Arg Gly Phe Val Asp Asp Ile Ile Gln Pro 
        515                 520                 525             


Ser Ser Thr Arg Ala Arg Ile Cys Cys Asp Leu Asp Val Leu Ala Ser 
    530                 535                 540                 


Lys Lys Val Gln Arg Pro Trp Arg Lys His Ala Asn Ile Pro Leu 
545                 550                 555                 


<210>  30
<211>  3843
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: DTC430

<400>  30
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc       60

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg      120

gccaactcca tcactagggg ttcctcgtta cataacttac ggtaaatggc ccgcctggct      180

gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc      240

caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg      300

cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat      360

ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca      420

tctacgtatt agtcatcgct attaccatgc gtcgaggtga gccccacgtt ctgcttcact      480

ctccccatct cccccccctc cccaccccca attttgtatt tatttatttt ttaattattt      540

tatgcagcga tgggggcggg gggggggggg gcgcgcgcca ggcggggcgg ggcggggcga      600

ggggcggggc ggggcgaggc ggagaggtgc ggcggcagcc aatcagagcg gcgcgctccg      660

aaagtttcct tttatggcga ggcggcggcg gcggcggccc tataaaaagc gaagcgcgcg      720

gcgggcgatc agcttacttg tggtaccgag ctcggatcct gagaacttca gggtgagtct      780

atgggaccct tgatgttttc tttccccttc ttttctatgg ttaagttcat gtcataggaa      840

ggggagaagt aacagggtac acatattgac caaatcaggg taattttgca tttgtaattt      900

taaaaaatgc tttcttcttt taatatactt ttttgtttat cttatttcta atactttccc      960

taatctcttt ctttcagggc aataatgata caatgtatca tgcctctttg caccattcta     1020

aagaataaca gtgataattt ctgggttaag gcaatagcaa tatttctgca tataaatatt     1080

tctgcatata aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc     1140

agctaccatt ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc     1200

taggcccttt tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac     1260

gtgctggtct gtgtgctggc ccatcacttt ggcaaagaat tgatcgccgc caccatggcg     1320

gggttctggg tcgggacagc accgctggtc gctgccggac ggcgtgggcg gtggccgccg     1380

cagcagctga tgctgagcgc ggcgctgcgg accctgaagc atgttctgta ctattcaaga     1440

cagtgcttaa tggtgtcccg taatcttggt tcagtgggat atgatcctaa tgaaaaaact     1500

tttgataaaa ttcttgttgc taatagagga gaaattgcat gtcgggttat tagaacttgc     1560

aagaagatgg gcattaagac agttgccatc cacagtgatg ttgatgctag ttctgttcat     1620

gtgaaaatgg cggatgaggc tgtctgtgtt ggcccagctc ccaccagtaa aagctacctc     1680

aacatggatg ccatcatgga agccattaag aaaaccaggg cccaagctgt acatccaggt     1740

tatggattcc tttcagaaaa caaagaattt gccagatgtt tggcagcaga agatgtcgtt     1800

ttcattggac ctgacacaca tgctattcaa gccatgggcg acaagattga aagcaaatta     1860

ttagctaaga aagcagaggt taatacaatc cctggctttg atggagtagt caaggatgca     1920

gaagaagctg tcagaattgc aagggaaatt ggctaccctg tcatgatcaa ggcctcagca     1980

ggtggtggtg ggaaaggcat gcgcattgct tgggatgatg aagagaccag ggatggtttt     2040

agattgtcat ctcaagaagc tgcttctagt tttggcgatg atagactact aatagaaaaa     2100

tttattgata atcctcgtca tatagaaatc caggttctag gtgataaaca tgggaatgct     2160

ttatggctta atgaaagaga gtgctcaatt cagagaagaa atcagaaggt ggtggaggaa     2220

gcaccaagca tttttttgga tgcggagact cgaagagcga tgggagaaca agctgtagct     2280

cttgccagag cagtaaaata ttcctctgct gggaccgtgg agttccttgt ggactctaag     2340

aagaattttt atttcttgga aatgaataca agactccagg ttgagcatcc tgtcacagaa     2400

tgcattactg gcctggacct agtccaggaa atgatccgtg ttgctaaggg ctaccctctc     2460

aggcacaaac aagctgatat tcgcatcaac ggctgggcag ttgaatgtcg ggtttatgct     2520

gaggacccct acaagtcttt tggtttacca tctattggga gattgtctca gtaccaagaa     2580

ccgttacatc tacctggtgt ccgagtggac agtggcatcc aaccaggaag tgatattagc     2640

atttattatg atcctatgat ttcaaaacta atcacatatg gctctgatag aactgaggca     2700

ctgaagagaa tggcagatgc actggataac tatgttattc gaggtgttac acataatatt     2760

gcattacttc gagaggtgat aatcaactca cgctttgtaa aaggagacat cagcactaaa     2820

tttctctccg atgtgtatcc tgatggcttc aaaggacaca tgctaaccaa gagtgagaag     2880

aaccagttat tggcaatagc atcatcattg tttgtggcat tccagttaag agcacaacat     2940

tttcaagaaa attcaagaat gcctgttatt aaaccagaca tagccaactg ggagctctca     3000

gtaaaattgc atgataaagt tcataccgta gtagcatcaa acaatgggtc agtgttctcg     3060

gtggaagttg atgggtcgaa actaaatgtg accagcacgt ggaacctggc ttcgccctta     3120

ttgtctgtca gcgttgatgg cactcagagg actgtccagt gtctttctcg agaagcaggt     3180

ggaaacatga gcattcagtt tcttggtaca gtgtacaagg tgaatatctt aaccagactt     3240

gccgcagaat tgaacaaatt tatgctggaa aaagtgactg aggacacaag cagtgttctg     3300

cgttccccga tgcccggagt ggtggtggcc gtctctgtca agcctggaga cgcggtagca     3360

gaaggtcaag aaatttgtgt gattgaagcc atgaaaatgc agaatagtat gacagctggg     3420

aaaactggca cggtgaaatc tgtgcactgt caagctggag acacagttgg agaaggggat     3480

ctgctcgtgg agctggaatg aatccagaca tgataagata cattgatgag tttggacaaa     3540

ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat gctattgctt     3600

tatttgtaac cattataagc tgcaataaac aagttaacaa caacaattgc attcatttta     3660

tgtttcaggt tcagggggag gtgtgggagg ttttttagag gaacccctag tgatggagtt     3720

ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg     3780

tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag agggagtggc     3840

caa                                                                   3843


<210>  31
<211>  3276
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Description of Artificial Sequence: DTC504

<400>  31
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc       60

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg      120

gccaactcca tcactagggg ttcctcgtta cataacttac ggtaaatggc ccgcctggct      180

gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc      240

caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg      300

cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat      360

ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca      420

tctacgtatt agtcatcgct attaccatgc gtcgaggtga gccccacgtt ctgcttcact      480

ctccccatct cccccccctc cccaccccca attttgtatt tatttatttt ttaattattt      540

tatgcagcga tgggggcggg gggggggggg gcgcgcgcca ggcggggcgg ggcggggcga      600

ggggcggggc ggggcgaggc ggagaggtgc ggcggcagcc aatcagagcg gcgcgctccg      660

aaagtttcct tttatggcga ggcggcggcg gcggcggccc tataaaaagc gaagcgcgcg      720

gcgggcgatc agcttacttg tggtaccgag ctcggatcct gagaacttca gggtgagtct      780

atgggaccct tgatgttttc tttccccttc ttttctatgg ttaagttcat gtcataggaa      840

ggggagaagt aacagggtac acatattgac caaatcaggg taattttgca tttgtaattt      900

taaaaaatgc tttcttcttt taatatactt ttttgtttat cttatttcta atactttccc      960

taatctcttt ctttcagggc aataatgata caatgtatca tgcctctttg caccattcta     1020

aagaataaca gtgataattt ctgggttaag gcaatagcaa tatttctgca tataaatatt     1080

tctgcatata aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc     1140

agctaccatt ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc     1200

taggcccttt tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac     1260

gtgctggtct gtgtgctggc ccatcacttt ggcaaagaat tgatcgccgc caccatggcg     1320

gcggcattac gggtggcggc ggtcggggca aggctcagcg ttctggcgag cggtctccgc     1380

gccgcggtcc gcagcctttg cagccaggcc acctctgtta acgaacgcat cgaaaacaag     1440

cgccggaccg cgctgctggg agggggccaa cgccgtattg acgcgcagca caagcgagga     1500

aagctaacag ccagggagag gatcagtctc ttgctggacc ctggcagctt tgttgagagc     1560

gacatgtttg tggaacacag atgtgcagat tttggaatgg ctgctgataa gaataagttt     1620

cctggagaca gcgtggtcac tggacgaggc cgaatcaatg gaagattggt ttatgtcttc     1680

agtcaggatt ttacagtttt tggaggcagt ctgtcaggag cacatgccca aaagatctgc     1740

aaaatcatgg accaggccat aacggtgggg gctccagtga ttgggctgaa tgactctggg     1800

ggagcacgga tccaagaagg agtggagtct ttggctggct atgcagacat ctttctgagg     1860

aatgttacgg catccggagt catccctcag atttctctga tcatgggccc atgtgctggt     1920

ggggccgtct actccccagc cctaacagac ttcacgttca tggtaaagga cacctcctac     1980

ctgttcatca ctggccctga tgttgtgaag tctgtcacca atgaggatgt tacccaggag     2040

gagctcggtg gtgccaagac ccacaccacc atgtcaggtg tggcccacag agcttttgaa     2100

aatgatgttg atgccttgtg taatctccgg gatttcttca actacctgcc cctgagcagt     2160

caggacccgg ctcccgtccg tgagtgccac gatcccagtg accgtctggt tcctgagctt     2220

gacacaattg tccctttgga atcaaccaaa gcctacaaca tggtggacat catacactct     2280

gttgttgatg agcgtgaatt ttttgagatc atgcccaatt atgccaagaa catcattgtt     2340

ggttttgcaa gaatgaatgg gaggactgtt ggaattgttg gcaaccaacc taaggtggcc     2400

tcaggatgct tggatattaa ttcatctgtg aaaggggctc gttttgtcag attctgtgat     2460

gcattcaata ttccactcat cacttttgtt gatgtccctg gctttctacc tggcacagca     2520

caggaatacg ggggcatcat ccggcatggt gccaagcttc tctacgcatt tgctgaggca     2580

actgtaccca aagtcacagt catcaccagg aaggcctatg gaggtgccta tgatgtcatg     2640

agctctaagc acctttgtgg tgataccaac tatgcctggc ccaccgcaga gattgcagtc     2700

atgggagcaa agggcgctgt ggagatcatc ttcaaagggc atgagaatgt ggaagctgct     2760

caggcagagt acatcgagaa gtttgccaac cctttccctg cagcagtgcg agggtttgtg     2820

gatgacatca tccaaccttc ttccacacgt gcccgaatct gctgtgacct ggatgtcttg     2880

gccagcaaga aggtacaacg tccttggaga aaacatgcaa atattccatt gtgaatccag     2940

acatgataag atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat     3000

gctttatttg tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata     3060

aacaagttaa caacaacaat tgcattcatt ttatgtttca ggttcagggg gaggtgtggg     3120

aggtttttta gaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc     3180

gctcactgag gccgcccggg caaagcccgg gcgtcgggcg acctttggtc gcccggcctc     3240

agtgagcgag cgagcgcgca gagagggagt ggccaa                               3276


