                         SEQUENCE LISTING

<110>  The Trustees of the University of Pennsylvania
 
<120>  Compositions Useful in Treatment of Metachromatic Leukodystrophy

<130>  UPN-18-8585PCT

<150>  US 62/843,091
<151>  2019-05-03

<160>  24    

<170>  PatentIn version 3.5

<210>  1
<211>  1521
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Engineered hARSA coding sequence


<220>
<221>  CDS
<222>  (1)..(1521)
<223>  Engineered hARSA coding sequence

<400>  1
atg gga gcc cct aga tct ctg ctg ctg gct ctg gct gct gga ctg gca         48
Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly Leu Ala           
1               5                   10                  15                

gtt gcc aga cct cct aac atc gtg ctg atc ttc gcc gac gat ctc ggc         96
Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp Leu Gly           
            20                  25                  30                    

tac ggc gat ctg ggc tgt tac gga cac ccc agc agc acc aca cct aac        144
Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr Pro Asn           
        35                  40                  45                        

ctg gat caa ctt gcc gct ggc ggc ctg aga ttc acc gat ttc tac gtg        192
Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe Tyr Val           
    50                  55                  60                            

ccc gtg tct ctg tgc acc cct tct aga gct gct ctg ctg aca ggc aga        240
Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr Gly Arg           
65                  70                  75                  80            

ctc cct gtg cgg atg gga atg tat cct ggc gtg ctg gtg cct agc tct        288
Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser           
                85                  90                  95                

aga ggc gga ctg cct ctg gaa gaa gtg aca gtt gcc gaa gtg ctg gcc        336
Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala           
            100                 105                 110                   

gcc aga gga tat ctg act ggc atg gcc gga aag tgg cac ctc gga gtt        384
Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val           
        115                 120                 125                       

gga cca gaa ggc gct ttt ctg cct cct cac cag ggc ttc cac cgg ttt        432
Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His Arg Phe           
    130                 135                 140                           

ctg ggc atc cct tac tct cac gat cag ggc ccc tgc cag aac ctg acc        480
Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn Leu Thr           
145                 150                 155                 160           

tgt ttt cct cct gcc aca cct tgc gac ggc ggc tgt gat caa gga ctg        528
Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln Gly Leu           
                165                 170                 175               

gtg cca att cct ctg ctg gcc aac ctg agc gtg gaa gct caa cct cct        576
Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln Pro Pro           
            180                 185                 190                   

tgg ctg cca gga ctg gaa gcc cgg tat atg gcc ttc gct cac gac ctg        624
Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His Asp Leu           
        195                 200                 205                       

atg gcc gac gct cag aga cag gac aga cca ttc ttc ctg tac tac gcc        672
Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr Tyr Ala           
    210                 215                 220                           

agc cac cac aca cac tac cct cag ttt agc ggc cag agc ttc gcc gag        720
Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe Ala Glu           
225                 230                 235                 240           

aga tct ggc aga gga cct ttc ggc gac agc ctg atg gaa ctg gat gcc        768
Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala           
                245                 250                 255               

gct gtg ggc aca ctg atg aca gcc atc gga gat ctg gga ctg ctg gaa        816
Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu Leu Glu           
            260                 265                 270                   

gag aca ctg gtc atc ttc acc gcc gac aac ggc ccc gag aca atg aga        864
Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr Met Arg           
        275                 280                 285                       

atg agc aga ggc ggc tgt agc ggc ctg ctg aga tgt ggc aag ggc acc        912
Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr           
    290                 295                 300                           

aca tat gaa ggc ggc gtg aga gaa cct gct ctg gcc ttt tgg cct ggc        960
Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly           
305                 310                 315                 320           

cat att gct cca ggc gtg aca cac gag ctg gcc tct tct ctg gat ctg       1008
His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu           
                325                 330                 335               

ctg cct aca ctg gca gct ctt gct ggt gct ccc ctg cct aat gtg acc       1056
Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr           
            340                 345                 350                   

ctg gat ggc ttc gat ctg agc cca ctg ctg ctc ggc aca ggc aag tct       1104
Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly Lys Ser           
        355                 360                 365                       

cca aga cag agc ctg ttc ttc tac cct agc tac ccc gac gaa gtg cgg       1152
Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu Val Arg           
    370                 375                 380                           

gga gtg ttt gcc gtg cgg acc gga aag tat aag gcc cac ttc ttc acc       1200
Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe Phe Thr           
385                 390                 395                 400           

caa ggc agc gcc cac tct gac acc aca gct gat cct gct tgt cac gcc       1248
Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala           
                405                 410                 415               

agc tct agc ctg aca gcc cat gaa cct cca ctg ctg tac gac ctg agc       1296
Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser           
            420                 425                 430                   

aag gac ccc ggc gag aac tac aat ctg ctt ggc gga gtt gcc ggc gct       1344
Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala           
        435                 440                 445                       

aca cct gaa gtt ctg cag gcc ctg aaa cag ctc cag ctg ctg aaa gcc       1392
Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu Lys Ala           
    450                 455                 460                           

cag ctg gac gct gcc gtg aca ttt gga cct agt cag gtg gcc aga ggc       1440
Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala Arg Gly           
465                 470                 475                 480           

gag gat cct gct ctg cag atc tgt tgt cac cct ggc tgc aca ccc aga       1488
Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr Pro Arg           
                485                 490                 495               

cct gcc tgc tgt cat tgt cct gat cca cac gcc                           1521
Pro Ala Cys Cys His Cys Pro Asp Pro His Ala                               
            500                 505                                       


<210>  2
<211>  507
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  2

Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly Leu Ala 
1               5                   10                  15      


Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp Leu Gly 
            20                  25                  30          


Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr Pro Asn 
        35                  40                  45              


Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe Tyr Val 
    50                  55                  60                  


Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr Gly Arg 
65                  70                  75                  80  


Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser 
                85                  90                  95      


Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala 
            100                 105                 110         


Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val 
        115                 120                 125             


Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His Arg Phe 
    130                 135                 140                 


Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn Leu Thr 
145                 150                 155                 160 


Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln Gly Leu 
                165                 170                 175     


Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln Pro Pro 
            180                 185                 190         


Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His Asp Leu 
        195                 200                 205             


Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr Tyr Ala 
    210                 215                 220                 


Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe Ala Glu 
225                 230                 235                 240 


Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala 
                245                 250                 255     


Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu Leu Glu 
            260                 265                 270         


Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr Met Arg 
        275                 280                 285             


Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr 
    290                 295                 300                 


Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly 
305                 310                 315                 320 


His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu 
                325                 330                 335     


Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr 
            340                 345                 350         


Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly Lys Ser 
        355                 360                 365             


Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu Val Arg 
    370                 375                 380                 


Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe Phe Thr 
385                 390                 395                 400 


Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala 
                405                 410                 415     


Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser 
            420                 425                 430         


Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala 
        435                 440                 445             


Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu Lys Ala 
    450                 455                 460                 


Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala Arg Gly 
465                 470                 475                 480 


Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr Pro Arg 
                485                 490                 495     


Pro Ala Cys Cys His Cys Pro Asp Pro His Ala 
            500                 505         


<210>  3
<211>  1527
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Engineered hARSA coding sequence


<220>
<221>  CDS
<222>  (1)..(1527)
<223>  Engineered hARSA coding sequence

<400>  3
atg tct atg gga gcc cct aga tct ctg ctg ctg gct ctg gct gct gga         48
Met Ser Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly           
1               5                   10                  15                

ctg gca gtt gcc aga cct cct aac atc gtg ctg atc ttc gcc gac gat         96
Leu Ala Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp           
            20                  25                  30                    

ctc ggc tac ggc gat ctg ggc tgt tac gga cac ccc agc agc acc aca        144
Leu Gly Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr           
        35                  40                  45                        

cct aac ctg gat caa ctt gcc gct ggc ggc ctg aga ttc acc gat ttc        192
Pro Asn Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe           
    50                  55                  60                            

tac gtg ccc gtg tct ctg tgc acc cct tct aga gct gct ctg ctg aca        240
Tyr Val Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr           
65                  70                  75                  80            

ggc aga ctc cct gtg cgg atg gga atg tat cct ggc gtg ctg gtg cct        288
Gly Arg Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro           
                85                  90                  95                

agc tct aga ggc gga ctg cct ctg gaa gaa gtg aca gtt gcc gaa gtg        336
Ser Ser Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val           
            100                 105                 110                   

ctg gcc gcc aga gga tat ctg act ggc atg gcc gga aag tgg cac ctc        384
Leu Ala Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu           
        115                 120                 125                       

gga gtt gga cca gaa ggc gct ttt ctg cct cct cac cag ggc ttc cac        432
Gly Val Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His           
    130                 135                 140                           

cgg ttt ctg ggc atc cct tac tct cac gat cag ggc ccc tgc cag aac        480
Arg Phe Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn           
145                 150                 155                 160           

ctg acc tgt ttt cct cct gcc aca cct tgc gac ggc ggc tgt gat caa        528
Leu Thr Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln           
                165                 170                 175               

gga ctg gtg cca att cct ctg ctg gcc aac ctg agc gtg gaa gct caa        576
Gly Leu Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln           
            180                 185                 190                   

cct cct tgg ctg cca gga ctg gaa gcc cgg tat atg gcc ttc gct cac        624
Pro Pro Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His           
        195                 200                 205                       

gac ctg atg gcc gac gct cag aga cag gac aga cca ttc ttc ctg tac        672
Asp Leu Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr           
    210                 215                 220                           

tac gcc agc cac cac aca cac tac cct cag ttt agc ggc cag agc ttc        720
Tyr Ala Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe           
225                 230                 235                 240           

gcc gag aga tct ggc aga gga cct ttc ggc gac agc ctg atg gaa ctg        768
Ala Glu Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu           
                245                 250                 255               

gat gcc gct gtg ggc aca ctg atg aca gcc atc gga gat ctg gga ctg        816
Asp Ala Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu           
            260                 265                 270                   

ctg gaa gag aca ctg gtc atc ttc acc gcc gac aac ggc ccc gag aca        864
Leu Glu Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr           
        275                 280                 285                       

atg aga atg agc aga ggc ggc tgt agc ggc ctg ctg aga tgt ggc aag        912
Met Arg Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys           
    290                 295                 300                           

ggc acc aca tat gaa ggc ggc gtg aga gaa cct gct ctg gcc ttt tgg        960
Gly Thr Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp           
305                 310                 315                 320           

cct ggc cat att gct cca ggc gtg aca cac gag ctg gcc tct tct ctg       1008
Pro Gly His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu           
                325                 330                 335               

gat ctg ctg cct aca ctg gca gct ctt gct ggt gct ccc ctg cct aat       1056
Asp Leu Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn           
            340                 345                 350                   

gtg acc ctg gat ggc ttc gat ctg agc cca ctg ctg ctc ggc aca ggc       1104
Val Thr Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly           
        355                 360                 365                       

aag tct cca aga cag agc ctg ttc ttc tac cct agc tac ccc gac gaa       1152
Lys Ser Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu           
    370                 375                 380                           

gtg cgg gga gtg ttt gcc gtg cgg acc gga aag tat aag gcc cac ttc       1200
Val Arg Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe           
385                 390                 395                 400           

ttc acc caa ggc agc gcc cac tct gac acc aca gct gat cct gct tgt       1248
Phe Thr Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys           
                405                 410                 415               

cac gcc agc tct agc ctg aca gcc cat gaa cct cca ctg ctg tac gac       1296
His Ala Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp           
            420                 425                 430                   

ctg agc aag gac ccc ggc gag aac tac aat ctg ctt ggc gga gtt gcc       1344
Leu Ser Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala           
        435                 440                 445                       

ggc gct aca cct gaa gtt ctg cag gcc ctg aaa cag ctc cag ctg ctg       1392
Gly Ala Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu           
    450                 455                 460                           

aaa gcc cag ctg gac gct gcc gtg aca ttt gga cct agt cag gtg gcc       1440
Lys Ala Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala           
465                 470                 475                 480           

aga ggc gag gat cct gct ctg cag atc tgt tgt cac cct ggc tgc aca       1488
Arg Gly Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr           
                485                 490                 495               

ccc aga cct gcc tgc tgt cat tgt cct gat cca cac gcc                   1527
Pro Arg Pro Ala Cys Cys His Cys Pro Asp Pro His Ala                       
            500                 505                                       


<210>  4
<211>  509
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  4

Met Ser Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly 
1               5                   10                  15      


Leu Ala Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp 
            20                  25                  30          


Leu Gly Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr 
        35                  40                  45              


Pro Asn Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe 
    50                  55                  60                  


Tyr Val Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr 
65                  70                  75                  80  


Gly Arg Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro 
                85                  90                  95      


Ser Ser Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val 
            100                 105                 110         


Leu Ala Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu 
        115                 120                 125             


Gly Val Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His 
    130                 135                 140                 


Arg Phe Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn 
145                 150                 155                 160 


Leu Thr Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln 
                165                 170                 175     


Gly Leu Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln 
            180                 185                 190         


Pro Pro Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His 
        195                 200                 205             


Asp Leu Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr 
    210                 215                 220                 


Tyr Ala Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe 
225                 230                 235                 240 


Ala Glu Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu 
                245                 250                 255     


Asp Ala Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu 
            260                 265                 270         


Leu Glu Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr 
        275                 280                 285             


Met Arg Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys 
    290                 295                 300                 


Gly Thr Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp 
305                 310                 315                 320 


Pro Gly His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu 
                325                 330                 335     


Asp Leu Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn 
            340                 345                 350         


Val Thr Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly 
        355                 360                 365             


Lys Ser Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu 
    370                 375                 380                 


Val Arg Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe 
385                 390                 395                 400 


Phe Thr Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys 
                405                 410                 415     


His Ala Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp 
            420                 425                 430         


Leu Ser Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala 
        435                 440                 445             


Gly Ala Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu 
    450                 455                 460                 


Lys Ala Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala 
465                 470                 475                 480 


Arg Gly Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr 
                485                 490                 495     


Pro Arg Pro Ala Cys Cys His Cys Pro Asp Pro His Ala 
            500                 505                 


<210>  5
<211>  7141
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Production plasmid for AAV. CB7,CI.hARSAco.RBG


<220>
<221>  repeat_region
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  promoter
<222>  (198)..(579)
<223>  CMV IE promoter

<220>
<221>  promoter
<222>  (582)..(862)
<223>  CB promoter

<220>
<221>  TATA_signal
<222>  (836)..(839)
<223>  TATA

<220>
<221>  Intron
<222>  (956)..(1928)
<223>  chicken beta-actin intron

<220>
<221>  misc_feature
<222>  (1935)..(3506)
<223>  Engineered ARSA coding sequence (hARSAco)

<220>
<221>  polyA_signal
<222>  (3539)..(3665)
<223>  Rabbit globin poly A

<220>
<221>  repeat_region
<222>  (3754)..(3883)
<223>  3' ITR

<220>
<221>  rep_origin
<222>  (4060)..(4498)
<223>  f1 ori

<220>
<221>  rep_origin
<222>  (4527)..(5169)
<223>  pUC origin of replication

<220>
<221>  misc_feature
<222>  (5844)..(6649)
<223>  Kan-r

<400>  5
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctaccag ggtaatgggg      180

atcctctaga actatagcta gtcgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtcgagg tgagccccac      600

gttctgcttc actctcccca tctccccccc ctccccaccc ccaattttgt atttatttat      660

tttttaatta ttttgtgcag cgatgggggc gggggggggg ggggggcgcg cgccaggcgg      720

ggcggggcgg ggcgaggggc ggggcggggc gaggcggaga ggtgcggcgg cagccaatca      780

gagcggcgcg ctccgaaagt ttccttttat ggcgaggcgg cggcggcggc ggccctataa      840

aaagcgaagc gcgcggcggg cgggagtcgc tgcgcgctgc cttcgccccg tgccccgctc      900

cgccgccgcc tcgcgccgcc cgccccggct ctgactgacc gcgttactcc cacaggtgag      960

cgggcgggac ggcccttctc ctccgggctg taattagcgc ttggtttaat gacggcttgt     1020

ttcttttctg tggctgcgtg aaagccttga ggggctccgg gagggccctt tgtgcggggg     1080

gagcggctcg gggggtgcgt gcgtgtgtgt gtgcgtgggg agcgccgcgt gcggctccgc     1140

gctgcccggc ggctgtgagc gctgcgggcg cggcgcgggg ctttgtgcgc tccgcagtgt     1200

gcgcgagggg agcgcggccg ggggcggtgc cccgcggtgc ggggggggct gcgaggggaa     1260

caaaggctgc gtgcggggtg tgtgcgtggg ggggtgagca gggggtgtgg gcgcgtcggt     1320

cgggctgcaa ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg     1380

tgcggggctc cgtacggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca     1440

ggtgggggtg ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg     1500

cggcggcccc cggagcgccg gcggctgtcg aggcgcggcg agccgcagcc attgcctttt     1560

atggtaatcg tgcgagaggg cgcagggact tcctttgtcc caaatctgtg cggagccgaa     1620

atctgggagg cgccgccgca ccccctctag cgggcgcggg gcgaagcggt gcggcgccgg     1680

caggaaggaa atgggcgggg agggccttcg tgcgtcgccg cgccgccgtc cccttctccc     1740

tctccagcct cggggctgtc cgcgggggga cggctgcctt cgggggggac ggggcagggc     1800

ggggttcggc ttctggcgtg tgaccggcgg ctctagagcc tctgctaacc atgttcatgc     1860

cttcttcttt ttcctacagc tcctgggcaa cgtgctggtt attgtgctgt ctcatcattt     1920

tggcaaagaa ttcacgcgtg aattcggtac cacaggccac catgtctatg ggagccccta     1980

gatctctgct gctggctctg gctgctggac tggcagttgc cagacctcct aacatcgtgc     2040

tgatcttcgc cgacgatctc ggctacggcg atctgggctg ttacggacac cccagcagca     2100

ccacacctaa cctggatcaa cttgccgctg gcggcctgag attcaccgat ttctacgtgc     2160

ccgtgtctct gtgcacccct tctagagctg ctctgctgac aggcagactc cctgtgcgga     2220

tgggaatgta tcctggcgtg ctggtgccta gctctagagg cggactgcct ctggaagaag     2280

tgacagttgc cgaagtgctg gccgccagag gatatctgac tggcatggcc ggaaagtggc     2340

acctcggagt tggaccagaa ggcgcttttc tgcctcctca ccagggcttc caccggtttc     2400

tgggcatccc ttactctcac gatcagggcc cctgccagaa cctgacctgt tttcctcctg     2460

ccacaccttg cgacggcggc tgtgatcaag gactggtgcc aattcctctg ctggccaacc     2520

tgagcgtgga agctcaacct ccttggctgc caggactgga agcccggtat atggccttcg     2580

ctcacgacct gatggccgac gctcagagac aggacagacc attcttcctg tactacgcca     2640

gccaccacac acactaccct cagtttagcg gccagagctt cgccgagaga tctggcagag     2700

gacctttcgg cgacagcctg atggaactgg atgccgctgt gggcacactg atgacagcca     2760

tcggagatct gggactgctg gaagagacac tggtcatctt caccgccgac aacggccccg     2820

agacaatgag aatgagcaga ggcggctgta gcggcctgct gagatgtggc aagggcacca     2880

catatgaagg cggcgtgaga gaacctgctc tggccttttg gcctggccat attgctccag     2940

gcgtgacaca cgagctggcc tcttctctgg atctgctgcc tacactggca gctcttgctg     3000

gtgctcccct gcctaatgtg accctggatg gcttcgatct gagcccactg ctgctcggca     3060

caggcaagtc tccaagacag agcctgttct tctaccctag ctaccccgac gaagtgcggg     3120

gagtgtttgc cgtgcggacc ggaaagtata aggcccactt cttcacccaa ggcagcgccc     3180

actctgacac cacagctgat cctgcttgtc acgccagctc tagcctgaca gcccatgaac     3240

ctccactgct gtacgacctg agcaaggacc ccggcgagaa ctacaatctg cttggcggag     3300

ttgccggcgc tacacctgaa gttctgcagg ccctgaaaca gctccagctg ctgaaagccc     3360

agctggacgc tgccgtgaca tttggaccta gtcaggtggc cagaggcgag gatcctgctc     3420

tgcagatctg ttgtcaccct ggctgcacac ccagacctgc ctgctgtcat tgtcctgatc     3480

cacacgcctg atgaacagcc tgaggctcga ggacggggtg aactacgcct gaggatccga     3540

tctttttccc tctgccaaaa attatgggga catcatgaag ccccttgagc atctgacttc     3600

tggctaataa aggaaattta ttttcattgc aatagtgtgt tggaattttt tgtgtctctc     3660

actcggaagc aattcgttga tctgaatttc gaccacccat aatacccatt accctggtag     3720

ataagtagca tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca     3780

ctccctctct gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc     3840

cgggctttgc ccgggcggcc tcagtgagcg agcgagcgcg cagccttaat taacctaatt     3900

cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc     3960

gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc     4020

gcccttccca acagttgcgc agcctgaatg gcgaatggga cgcgccctgt agcggcgcat     4080

taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag     4140

cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc     4200

aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc     4260

ccaaaaaact tgattagggt gatggttcac gtagtgggcc atcgccctga tagacggttt     4320

ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa     4380

caacactcaa ccctatctcg gtctattctt ttgatttata agggattttg ccgatttcgg     4440

cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatca     4500

tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt     4560

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc     4620

gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct     4680

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg     4740

tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca     4800

agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact     4860

atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta     4920

acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta     4980

actacggcta cactagaaga acagtatttg gtatctgcgc tctgctgaag ccagttacct     5040

tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt     5100

tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga     5160

tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca     5220

tgagattatc aaaaaggatc ttcacctaga tccttttgat cctccggcgt tcagcctgtg     5280

ccacagccga caggatggtg accaccattt gccccatatc accgtcggta ctgatcccgt     5340

cgtcaataaa ccgaaccgct acaccctgag catcaaactc ttttatcagt tggatcatgt     5400

cggcggtgtc gcggccaaga cggtcgagct tcttcaccag aatgacatca ccttcctcca     5460

ccttcatcct cagcaaatcc agcccttccc gatctgttga actgccggat gccttgtcgg     5520

taaagatgcg gttagctttt acccctgcat ctttgagcgc tgaggtctgc ctcgtgaaga     5580

aggtgttgct gactcatacc aggcctgaat cgccccatca tccagccaga aagtgaggga     5640

gccacggttg atgagagctt tgttgtaggt ggaccagttg gtgattttga acttttgctt     5700

tgccacggaa cggtctgcgt tgtcgggaag atgcgtgatc tgatccttca actcagcaaa     5760

agttcgattt attcaacaaa gccgccgtcc cgtcaagtca gcgtaatgct ctgccagtgt     5820

tacaaccaat taaccaattc tgattagaaa aactcatcga gcatcaaatg aaactgcaat     5880

ttattcatat caggattatc aataccatat ttttgaaaaa gccgtttctg taatgaagga     5940

gaaaactcac cgaggcagtt ccataggatg gcaagatcct ggtatcggtc tgcgattccg     6000

actcgtccaa catcaataca acctattaat ttcccctcgt caaaaataag gttatcaagt     6060

gagaaatcac catgagtgac gactgaatcc ggtgagaatg gcaaaagctt atgcatttct     6120

ttccagactt gttcaacagg ccagccatta cgctcgtcat caaaatcact cgcatcaacc     6180

aaaccgttat tcattcgtga ttgcgcctga gcgagacgaa atacgcgatc gctgttaaaa     6240

ggacaattac aaacaggaat cgaatgcaac cggcgcagga acactgccag cgcatcaaca     6300

atattttcac ctgaatcagg atattcttct aatacctgga atgctgtttt cccggggatc     6360

gcagtggtga gtaaccatgc atcatcagga gtacggataa aatgcttgat ggtcggaaga     6420

ggcataaatt ccgtcagcca gtttagtctg accatctcat ctgtaacatc attggcaacg     6480

ctacctttgc catgtttcag aaacaactct ggcgcatcgg gcttcccata caatcgatag     6540

attgtcgcac ctgattgccc gacattatcg cgagcccatt tatacccata taaatcagca     6600

tccatgttgg aatttaatcg cggcctcgag caagacgttt cccgttgaat atggctcata     6660

acaccccttg tattactgtt tatgtaagca gacagtttta ttgttcatga tgatatattt     6720

ttatcttgtg caatgtaaca tcagagattt tgagacacca tgttctttcc tgcgttatcc     6780

cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc     6840

cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa     6900

ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac     6960

tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca ttaggcaccc     7020

caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag cggataacaa     7080

tttcacacag gaaacagcta tgaccatgat tacgccagat ttaattaagg ccttaattag     7140

g                                                                     7141


<210>  6
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAVhu68 vp1


<220>
<221>  CDS
<222>  (1)..(2211)

<400>  6
atg gct gcc gat ggt tat ctt cca gat tgg ctc gag gac aac ctc agt         48
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser           
1               5                   10                  15                

gaa ggc att cgc gag tgg tgg gct ttg aaa cct gga gcc cct caa ccc         96
Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro           
            20                  25                  30                    

aag gca aat caa caa cat caa gac aac gct cgg ggt ctt gtg ctt ccg        144
Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro           
        35                  40                  45                        

ggt tac aaa tac ctt gga ccc ggc aac gga ctc gac aag ggg gag ccg        192
Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro           
    50                  55                  60                            

gtc aac gaa gca gac gcg gcg gcc ctc gag cac gac aag gcc tac gac        240
Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp           
65                  70                  75                  80            

cag cag ctc aag gcc gga gac aac ccg tac ctc aag tac aac cac gcc        288
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala           
                85                  90                  95                

gac gcc gag ttc cag gag cgg ctc aaa gaa gat acg tct ttt ggg ggc        336
Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly           
            100                 105                 110                   

aac ctc ggg cga gca gtc ttc cag gcc aaa aag agg ctt ctt gaa cct        384
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro           
        115                 120                 125                       

ctt ggt ctg gtt gag gaa gcg gct aag acg gct cct gga aag aag agg        432
Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg           
    130                 135                 140                           

cct gta gag cag tct cct cag gaa ccg gac tcc tcc gtg ggt att ggc        480
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly           
145                 150                 155                 160           

aaa tcg ggt gca cag ccc gct aaa aag aga ctc aat ttc ggt cag act        528
Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr           
                165                 170                 175               

ggc gac aca gag tca gtc ccc gac cct caa cca atc gga gaa cct ccc        576
Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro           
            180                 185                 190                   

gca gcc ccc tca ggt gtg gga tct ctt aca atg gct tca ggt ggt ggc        624
Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly           
        195                 200                 205                       

gca cca gtg gca gac aat aac gaa ggt gcc gat gga gtg ggt agt tcc        672
Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser           
    210                 215                 220                           

tcg gga aat tgg cat tgc gat tcc caa tgg ctg ggg gac aga gtc atc        720
Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile           
225                 230                 235                 240           

acc acc agc acc cga acc tgg gcc ctg ccc acc tac aac aat cac ctc        768
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu           
                245                 250                 255               

tac aag caa atc tcc aac agc aca tct gga gga tct tca aat gac aac        816
Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn           
            260                 265                 270                   

gcc tac ttc ggc tac agc acc ccc tgg ggg tat ttt gac ttc aac aga        864
Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg           
        275                 280                 285                       

ttc cac tgc cac ttc tca cca cgt gac tgg caa aga ctc atc aac aac        912
Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn           
    290                 295                 300                           

aac tgg gga ttc cgg cct aag cga ctc aac ttc aag ctc ttc aac att        960
Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile           
305                 310                 315                 320           

cag gtc aaa gag gtt acg gac aac aat gga gtc aag acc atc gct aat       1008
Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn           
                325                 330                 335               

aac ctt acc agc acg gtc cag gtc ttc acg gac tca gac tat cag ctc       1056
Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu           
            340                 345                 350                   

ccg tac gtg ctc ggg tcg gct cac gag ggc tgc ctc ccg ccg ttc cca       1104
Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro           
        355                 360                 365                       

gcg gac gtt ttc atg att cct cag tac ggg tat cta acg ctt aat gat       1152
Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp           
    370                 375                 380                           

gga agc caa gcc gtg ggt cgt tcg tcc ttt tac tgc ctg gaa tat ttc       1200
Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe           
385                 390                 395                 400           

ccg tcg caa atg cta aga acg ggt aac aac ttc cag ttc agc tac gag       1248
Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu           
                405                 410                 415               

ttt gag aac gta cct ttc cat agc agc tat gct cac agc caa agc ctg       1296
Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu           
            420                 425                 430                   

gac cga ctc atg aat cca ctc atc gac caa tac ttg tac tat ctc tca       1344
Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser           
        435                 440                 445                       

aag act att aac ggt tct gga cag aat caa caa acg cta aaa ttc agt       1392
Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser           
    450                 455                 460                           

gtg gcc gga ccc agc aac atg gct gtc cag gga aga aac tac ata cct       1440
Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro           
465                 470                 475                 480           

gga ccc agc tac cga caa caa cgt gtc tca acc act gtg act caa aac       1488
Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn           
                485                 490                 495               

aac aac agc gaa ttt gct tgg cct gga gct tct tct tgg gct ctc aat       1536
Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn           
            500                 505                 510                   

gga cgt aat agc ttg atg aat cct gga cct gct atg gcc agc cac aaa       1584
Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys           
        515                 520                 525                       

gaa gga gag gac cgt ttc ttt cct ttg tct gga tct tta att ttt ggc       1632
Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly           
    530                 535                 540                           

aaa caa gga act gga aga gac aac gtg gat gcg gac aaa gtc atg ata       1680
Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile           
545                 550                 555                 560           

acc aac gaa gaa gaa att aaa act acc aac cca gta gca acg gag tcc       1728
Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser           
                565                 570                 575               

tat gga caa gtg gcc aca aac cac cag agt gcc caa gca cag gcg cag       1776
Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln           
            580                 585                 590                   

acc ggc tgg gtt caa aac caa gga ata ctt ccg ggt atg gtt tgg cag       1824
Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln           
        595                 600                 605                       

gac aga gat gtg tac ctg caa gga ccc att tgg gcc aaa att cct cac       1872
Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His           
    610                 615                 620                           

acg gac ggc aac ttt cac cct tct ccg ctg atg gga ggg ttt gga atg       1920
Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met           
625                 630                 635                 640           

aag cac ccg cct cct cag atc ctc atc aaa aac aca cct gta cct gcg       1968
Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala           
                645                 650                 655               

gat cct cca acg gct ttc aac aag gac aag ctg aac tct ttc atc acc       2016
Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr           
            660                 665                 670                   

cag tat tct act ggc caa gtc agc gtg gag att gag tgg gag ctg cag       2064
Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln           
        675                 680                 685                       

aag gaa aac agc aag cgc tgg aac ccg gag atc cag tac act tcc aac       2112
Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn           
    690                 695                 700                           

tat tac aag tct aat aat gtt gaa ttt gct gtt aat act gaa ggt gtt       2160
Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val           
705                 710                 715                 720           

tat tct gaa ccc cgc ccc att ggc acc aga tac ctg act cgt aat ctg       2208
Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu           
                725                 730                 735               

taa                                                                   2211


<210>  7
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  7

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  8
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  modified hu68vp1


<220>
<221>  MISC_FEATURE
<222>  (23)..(23)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W.

<220>
<221>  MISC_FEATURE
<222>  (35)..(35)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (57)..(57)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (66)..(66)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (94)..(94)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (97)..(97)
<223>  Xaa may be D (asp, aspartic acid), or isomerized D.

<220>
<221>  MISC_FEATURE
<222>  (107)..(107)
<223>  Xaa may be D (asp, aspartic acid), or isomerized D.

<220>
<221>  misc_feature
<222>  (113)..(113)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  MISC_FEATURE
<222>  (149)..(149)
<223>  Xaa may be S (Ser, serine), or Phosphorilated S

<220>
<221>  MISC_FEATURE
<222>  (149)..(149)
<223>  Xaa may be S (Ser, serine), or Phosphorylated S

<220>
<221>  MISC_FEATURE
<222>  (247)..(247)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W (e.g., kynurenine).

<220>
<221>  MISC_FEATURE
<222>  (253)..(253)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (259)..(259)
<223>  Xaa represents Q, or Q deamidated to glutamic acid 
       (alpha-glutamic acid), gamma-glutamic acid (Glu), or a blend of 
       alpha- and gamma-glutamic acid

<220>
<221>  MISC_FEATURE
<222>  (270)..(270)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (297)..(297)
<223>  Xaa represents D (Asp, aspartic acid) or amindated D to N (Asn, 
       asparagine)

<220>
<221>  MISC_FEATURE
<222>  (304)..(304)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (306)..(306)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W (e.g., kynurenine).

<220>
<221>  MISC_FEATURE
<222>  (314)..(314)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (319)..(319)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (329)..(329)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (332)..(332)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (336)..(336)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (384)..(384)
<223>  Xaa may be D (asp, aspartic acid), or isomerized D.

<220>
<221>  MISC_FEATURE
<222>  (404)..(404)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (409)..(409)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (436)..(436)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (452)..(452)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (477)..(477)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (499)..(499)
<223>  Xaa may be S (Ser, serine), or Phosphorylated S

<220>
<221>  MISC_FEATURE
<222>  (512)..(512)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (515)..(515)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (518)..(518)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (524)..(524)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (559)..(559)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (569)..(569)
<223>  Xaa may be T (Thr, threonine), or Phosphorylated T

<220>
<221>  MISC_FEATURE
<222>  (586)..(586)
<223>  Xaa may be S (Ser, serine), or Phosphorylated S

<220>
<221>  MISC_FEATURE
<222>  (599)..(599)
<223>  Xaa represents Q, or Q deamidated to glutamic acid 
       (alpha-glutamic acid), gamma-glutamic acid (Glu), or a blend of 
       alpha- and gamma-glutamic acid

<220>
<221>  MISC_FEATURE
<222>  (605)..(605)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (619)..(619)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W (e.g., kynurenine).

<220>
<221>  MISC_FEATURE
<222>  (628)..(628)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (640)..(640)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (651)..(651)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (663)..(663)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (666)..(666)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (689)..(689)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (693)..(693)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (695)..(695)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W.

<220>
<221>  MISC_FEATURE
<222>  (709)..(709)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (735)..(735)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<400>  8

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Xaa Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Xaa Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Xaa Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Xaa Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Xaa His Ala 
                85                  90                  95      


Xaa Ala Glu Phe Gln Glu Arg Leu Lys Glu Xaa Thr Ser Phe Gly Gly 
            100                 105                 110         


Xaa Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Xaa Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Xaa Ala Leu Pro Thr Tyr Xaa Asn His Leu 
                245                 250                 255     


Tyr Lys Xaa Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Xaa Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Xaa Trp Gln Arg Leu Ile Asn Xaa 
    290                 295                 300                 


Asn Xaa Gly Phe Arg Pro Lys Arg Leu Xaa Phe Lys Leu Phe Xaa Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Xaa Gly Val Xaa Thr Ile Ala Xaa 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Xaa 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Xaa Leu Arg Thr Gly Xaa Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Xaa Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Xaa Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Xaa Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Xaa Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Xaa 
            500                 505                 510         


Gly Arg Xaa Ser Leu Xaa Asn Pro Gly Pro Ala Xaa Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Xaa Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Xaa Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Xaa Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Xaa Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Xaa Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Xaa Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Xaa 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Xaa Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Xaa Lys Asp Xaa Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Xaa Glu Asn Ser Xaa Arg Xaa Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Xaa Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Xaa Leu 
                725                 730                 735     


<210>  9
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV9 vp1 coding sequence

<400>  9
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  10
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Encoded AAV9 vp1 amino acid sequence

<400>  10

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  11
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAVhu31 vp1 coding sequence

<400>  11
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  12
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Encoded AAVhu31 vp1 amino acid sequence

<400>  12

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ser Gln Pro Ala Lys Lys Lys Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Gly Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Ser Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  13
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAVhu32 vp1 coding sequence

<400>  13
atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga       60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac      120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtt cacagcccgc taaaaagaaa ctcaatttcg gtcagactgg cgacacagag      540

tcagtccccg accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atgggagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gcgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc taatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gctttcaata aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagattg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  14
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Encoded AAVhu32 vp1 amino acid sequence

<400>  14

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ser Gln Pro Ala Lys Lys Lys Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  15
<211>  423
<212>  PRT
<213>  Homo sapiens

<400>  15

Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser Arg Gly Gly Leu 
1               5                   10                  15      


Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala Ala Arg Gly Tyr 
            20                  25                  30          


Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val Gly Pro Glu Gly 
        35                  40                  45              


Ala Phe Leu Pro Pro His Gln Gly Phe His Arg Phe Leu Gly Ile Pro 
    50                  55                  60                  


Tyr Ser His Asp Gln Gly Pro Cys Gln Asn Leu Thr Cys Phe Pro Pro 
65                  70                  75                  80  


Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln Gly Leu Val Pro Ile Pro 
                85                  90                  95      


Leu Leu Ala Asn Leu Ser Val Glu Ala Gln Pro Pro Trp Leu Pro Gly 
            100                 105                 110         


Leu Glu Ala Arg Tyr Met Ala Phe Ala His Asp Leu Met Ala Asp Ala 
        115                 120                 125             


Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr Tyr Ala Ser His His Thr 
    130                 135                 140                 


His Tyr Pro Gln Phe Ser Gly Gln Ser Phe Ala Glu Arg Ser Gly Arg 
145                 150                 155                 160 


Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala Ala Val Gly Thr 
                165                 170                 175     


Leu Met Thr Ala Ile Gly Asp Leu Gly Leu Leu Glu Glu Thr Leu Val 
            180                 185                 190         


Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr Met Arg Met Ser Arg Gly 
        195                 200                 205             


Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr Thr Tyr Glu Gly 
    210                 215                 220                 


Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly His Ile Ala Pro 
225                 230                 235                 240 


Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu Leu Pro Thr Leu 
                245                 250                 255     


Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr Leu Asp Gly Phe 
            260                 265                 270         


Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly Lys Ser Pro Arg Gln Ser 
        275                 280                 285             


Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu Val Arg Gly Val Phe Ala 
    290                 295                 300                 


Val Arg Thr Gly Lys Tyr Lys Ala His Phe Phe Thr Gln Gly Ser Ala 
305                 310                 315                 320 


His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala Ser Ser Ser Leu 
                325                 330                 335     


Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser Lys Asp Pro Gly 
            340                 345                 350         


Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala Thr Pro Glu Val 
        355                 360                 365             


Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu Lys Ala Gln Leu Asp Ala 
    370                 375                 380                 


Ala Val Thr Phe Gly Pro Ser Gln Val Ala Arg Gly Glu Asp Pro Ala 
385                 390                 395                 400 


Leu Gln Ile Cys Cys His Pro Gly Cys Thr Pro Arg Pro Ala Cys Cys 
                405                 410                 415     


His Cys Pro Asp Pro His Ala 
            420             


<210>  16
<211>  666
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  chicken beta actin promoter with a cytomegalovirus enhancer (CB7)

<400>  16
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg actatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt accatggtcg aggtgagccc cacgttctgc ttcactctcc      420

ccatctcccc cccctcccca cccccaattt tgtatttatt tattttttaa ttattttgtg      480

cagcgatggg ggcggggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg      540

ggcggggcgg ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa      600

agtttccttt tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc      660

gggcgg                                                                 666


<210>  17
<211>  973
<212>  DNA
<213>  Artificial sequence

<220>
<223>  chicken beta-actin intron

<400>  17
gtgagcgggc gggacggccc ttctcctccg ggctgtaatt agcgcttggt ttaatgacgg       60

cttgtttctt ttctgtggct gcgtgaaagc cttgaggggc tccgggaggg ccctttgtgc      120

ggggggagcg gctcgggggg tgcgtgcgtg tgtgtgtgcg tggggagcgc cgcgtgcggc      180

tccgcgctgc ccggcggctg tgagcgctgc gggcgcggcg cggggctttg tgcgctccgc      240

agtgtgcgcg aggggagcgc ggccgggggc ggtgccccgc ggtgcggggg gggctgcgag      300

gggaacaaag gctgcgtgcg gggtgtgtgc gtgggggggt gagcaggggg tgtgggcgcg      360

tcggtcgggc tgcaaccccc cctgcacccc cctccccgag ttgctgagca cggcccggct      420

tcgggtgcgg ggctccgtac ggggcgtggc gcggggctcg ccgtgccggg cggggggtgg      480

cggcaggtgg gggtgccggg cggggcgggg ccgcctcggg ccggggaggg ctcgggggag      540

gggcgcggcg gcccccggag cgccggcggc tgtcgaggcg cggcgagccg cagccattgc      600

cttttatggt aatcgtgcga gagggcgcag ggacttcctt tgtcccaaat ctgtgcggag      660

ccgaaatctg ggaggcgccg ccgcaccccc tctagcgggc gcggggcgaa gcggtgcggc      720

gccggcagga aggaaatggg cggggagggc cttcgtgcgt cgccgcgccg ccgtcccctt      780

ctccctctcc agcctcgggg ctgtccgcgg ggggacggct gccttcgggg gggacggggc      840

agggcggggt tcggcttctg gcgtgtgacc ggcggctcta gagcctctgc taaccatgtt      900

catgccttct tctttttcct acagctcctg ggcaacgtgc tggttattgt gctgtctcat      960

cattttggca aag                                                         973


<210>  18
<211>  282
<212>  DNA
<213>  Artificial sequence

<220>
<223>  CB promoter

<400>  18
tggtcgaggt gagccccacg ttctgcttca ctctccccat ctcccccccc tccccacccc       60

caattttgta tttatttatt ttttaattat tttgtgcagc gatgggggcg gggggggggg      120

gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg gggcggggcg aggcggagag      180

gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt tccttttatg gcgaggcggc      240

ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc gg                         282


<210>  19
<211>  382
<212>  DNA
<213>  Artificial sequence

<220>
<223>  CMV Immediate early Promoter

<400>  19
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg actatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt ac                                               382


<210>  20
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miR183

<400>  20
agtgaattct accagtgcca ta                                                22


<210>  21
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA target sequence

<400>  21
agcaaaaatg tgctagtgcc aaa                                               23


<210>  22
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA target sequence

<400>  22
agtgtgagtt ctaccattgc caaa                                              24


<210>  23
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA target sequence

<400>  23
agggattcct gggaaaactg gac                                               23


<210>  24
<211>  16
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Spacer

<400>  24
atgacttaaa ccaggt                                                       16


