                         SEQUENCE LISTING

<110>  The Trustees of The University of Pennsylvania
 
<120>  RECOMBINANT ADENO-ASSOCIATED VIRUSES FOR LESCH-NYHAN DISORDERS 
       AND USES THEREOF

<130>  UPN-21-9749.PCT

<150>  US 63/208,280
<151>  2021-06-08

<150>  US 63/341,699
<151>  2022-05-13

<160>  22    

<170>  PatentIn version 3.5

<210>  1
<211>  3006
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  vector genome CB7.CI.HPRT.RBG


<220>
<221>  repeat_region
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  promoter
<222>  (198)..(863)
<223>  CB7 hybrid promoter

<220>
<221>  misc_feature
<222>  (198)..(579)
<223>  CMV IE enhancer

<220>
<221>  misc_feature
<222>  (582)..(863)
<223>  CB promoter

<220>
<221>  TATA_signal
<222>  (836)..(839)
<223>  TATA

<220>
<221>  Intron
<222>  (958)..(1930)
<223>  chicken beta-actin intron

<220>
<221>  CDS
<222>  (1942)..(2601)
<223>  HPRT

<220>
<221>  polyA_signal
<222>  (2662)..(2788)
<223>  Rabbit beta globin polyA

<220>
<221>  repeat_region
<222>  (2877)..(3006)
<223>  3' ITR

<400>  1
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctaccag ggtaatgggg      180

atcctctaga actatagcta gtcgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtcgagg tgagccccac      600

gttctgcttc actctcccca tctccccccc ctccccaccc ccaattttgt atttatttat      660

tttttaatta ttttgtgcag cgatgggggc gggggggggg ggggggcgcg cgccaggcgg      720

ggcggggcgg ggcgaggggc ggggcggggc gaggcggaga ggtgcggcgg cagccaatca      780

gagcggcgcg ctccgaaagt ttccttttat ggcgaggcgg cggcggcggc ggccctataa      840

aaagcgaagc gcgcggcggg cggggagtcg ctgcgacgct gccttcgccc cgtgccccgc      900

tccgccgccg cctcgcgccg cccgccccgg ctctgactga ccgcgttact cccacaggtg      960

agcgggcggg acggcccttc tcctccgggc tgtaattagc gcttggttta atgacggctt     1020

gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc gggagggccc tttgtgcggg     1080

gggagcggct cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggctcc     1140

gcgctgcccg gcggctgtga gcgctgcggg cgcggcgcgg ggctttgtgc gctccgcagt     1200

gtgcgcgagg ggagcgcggc cgggggcggt gccccgcggt gcgggggggg ctgcgagggg     1260

aacaaaggct gcgtgcgggg tgtgtgcgtg ggggggtgag cagggggtgt gggcgcgtcg     1320

gtcgggctgc aaccccccct gcacccccct ccccgagttg ctgagcacgg cccggcttcg     1380

ggtgcggggc tccgtacggg gcgtggcgcg gggctcgccg tgccgggcgg ggggtggcgg     1440

caggtggggg tgccgggcgg ggcggggccg cctcgggccg gggagggctc gggggagggg     1500

cgcggcggcc cccggagcgc cggcggctgt cgaggcgcgg cgagccgcag ccattgcctt     1560

ttatggtaat cgtgcgagag ggcgcaggga cttcctttgt cccaaatctg tgcggagccg     1620

aaatctggga ggcgccgccg caccccctct agcgggcgcg gggcgaagcg gtgcggcgcc     1680

ggcaggaagg aaatgggcgg ggagggcctt cgtgcgtcgc cgcgccgccg tccccttctc     1740

cctctccagc ctcggggctg tccgcggggg gacggctgcc ttcggggggg acggggcagg     1800

gcggggttcg gcttctggcg tgtgaccggc ggctctagag cctctgctaa ccatgttcat     1860

gccttcttct ttttcctaca gctcctgggc aacgtgctgg ttattgtgct gtctcatcat     1920

tttggcaaag aattcgccac c atg gcc aca aga tct ccc ggc gtg gtc atc       1971
                        Met Ala Thr Arg Ser Pro Gly Val Val Ile           
                        1               5                   10            

agc gac gac gag cct ggc tac gac ctg gac ctg ttc tgc atc ccc aat       2019
Ser Asp Asp Glu Pro Gly Tyr Asp Leu Asp Leu Phe Cys Ile Pro Asn           
                15                  20                  25                

cac tac gcc gag gac ctg gaa cgg gtg ttc att cct cac ggc ctg atc       2067
His Tyr Ala Glu Asp Leu Glu Arg Val Phe Ile Pro His Gly Leu Ile           
            30                  35                  40                    

atg gac cgg acc gaa aga ctg gcc cgg gac gtg atg aag gaa atg ggc       2115
Met Asp Arg Thr Glu Arg Leu Ala Arg Asp Val Met Lys Glu Met Gly           
        45                  50                  55                        

gga cac cac atc gtg gcc ctg tgt gtt ctg aaa ggc ggc tac aag ttc       2163
Gly His His Ile Val Ala Leu Cys Val Leu Lys Gly Gly Tyr Lys Phe           
    60                  65                  70                            

ttc gcc gac ctg ctg gac tac atc aag gcc ctg aac cgg aac agc gat       2211
Phe Ala Asp Leu Leu Asp Tyr Ile Lys Ala Leu Asn Arg Asn Ser Asp           
75                  80                  85                  90            

cgg agc atc cct atg acc gtg gac ttc atc agg ctg aag tcc tac tgc       2259
Arg Ser Ile Pro Met Thr Val Asp Phe Ile Arg Leu Lys Ser Tyr Cys           
                95                  100                 105               

aac gac cag agc acc ggc gac atc aaa gtg atc ggc ggc gac gat ctg       2307
Asn Asp Gln Ser Thr Gly Asp Ile Lys Val Ile Gly Gly Asp Asp Leu           
            110                 115                 120                   

agc acc ctg aca ggc aag aac gtg ctg atc gtg gaa gat atc atc gac       2355
Ser Thr Leu Thr Gly Lys Asn Val Leu Ile Val Glu Asp Ile Ile Asp           
        125                 130                 135                       

acc ggc aag acc atg cag acc ctg ctg tct ctc gtg cgg cag tac aac       2403
Thr Gly Lys Thr Met Gln Thr Leu Leu Ser Leu Val Arg Gln Tyr Asn           
    140                 145                 150                           

ccc aag atg gtc aag gtg gcc agc ctg ctg gtc aag aga acc cct aga       2451
Pro Lys Met Val Lys Val Ala Ser Leu Leu Val Lys Arg Thr Pro Arg           
155                 160                 165                 170           

agc gtg ggc tac aag ccc gac ttc gtg ggc ttc gag atc ccc gac aag       2499
Ser Val Gly Tyr Lys Pro Asp Phe Val Gly Phe Glu Ile Pro Asp Lys           
                175                 180                 185               

ttc gtc gtg ggc tac gcc ctg gat tac aac gag tac ttc cgg gac ctg       2547
Phe Val Val Gly Tyr Ala Leu Asp Tyr Asn Glu Tyr Phe Arg Asp Leu           
            190                 195                 200                   

aac cac gtg tgc gtg atc agc gaa aca ggc aag gcc aag tac aag gcc       2595
Asn His Val Cys Val Ile Ser Glu Thr Gly Lys Ala Lys Tyr Lys Ala           
        205                 210                 215                       

tga tga ggtacctcta gagtcgaccc gggcggcctc gaggacgggg tgaactacgc        2651

ctgaggatcc gatctttttc cctctgccaa aaattatggg gacatcatga agccccttga     2711

gcatctgact tctggctaat aaaggaaatt tattttcatt gcaatagtgt gttggaattt     2771

tttgtgtctc tcactcggaa gcaattcgtt gatctgaatt tcgaccaccc ataataccca     2831

ttaccctggt agataagtag catggcgggt taatcattaa ctacaaggaa cccctagtga     2891

tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg     2951

tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcag          3006


<210>  2
<211>  218
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  2

Met Ala Thr Arg Ser Pro Gly Val Val Ile Ser Asp Asp Glu Pro Gly 
1               5                   10                  15      


Tyr Asp Leu Asp Leu Phe Cys Ile Pro Asn His Tyr Ala Glu Asp Leu 
            20                  25                  30          


Glu Arg Val Phe Ile Pro His Gly Leu Ile Met Asp Arg Thr Glu Arg 
        35                  40                  45              


Leu Ala Arg Asp Val Met Lys Glu Met Gly Gly His His Ile Val Ala 
    50                  55                  60                  


Leu Cys Val Leu Lys Gly Gly Tyr Lys Phe Phe Ala Asp Leu Leu Asp 
65                  70                  75                  80  


Tyr Ile Lys Ala Leu Asn Arg Asn Ser Asp Arg Ser Ile Pro Met Thr 
                85                  90                  95      


Val Asp Phe Ile Arg Leu Lys Ser Tyr Cys Asn Asp Gln Ser Thr Gly 
            100                 105                 110         


Asp Ile Lys Val Ile Gly Gly Asp Asp Leu Ser Thr Leu Thr Gly Lys 
        115                 120                 125             


Asn Val Leu Ile Val Glu Asp Ile Ile Asp Thr Gly Lys Thr Met Gln 
    130                 135                 140                 


Thr Leu Leu Ser Leu Val Arg Gln Tyr Asn Pro Lys Met Val Lys Val 
145                 150                 155                 160 


Ala Ser Leu Leu Val Lys Arg Thr Pro Arg Ser Val Gly Tyr Lys Pro 
                165                 170                 175     


Asp Phe Val Gly Phe Glu Ile Pro Asp Lys Phe Val Val Gly Tyr Ala 
            180                 185                 190         


Leu Asp Tyr Asn Glu Tyr Phe Arg Asp Leu Asn His Val Cys Val Ile 
        195                 200                 205             


Ser Glu Thr Gly Lys Ala Lys Tyr Lys Ala 
    210                 215             


<210>  3
<211>  660
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  engineered nucleic acid sequence HPRT

<400>  3
atggccacaa gatctcccgg cgtggtcatc agcgacgacg agcctggcta cgacctggac       60

ctgttctgca tccccaatca ctacgccgag gacctggaac gggtgttcat tcctcacggc      120

ctgatcatgg accggaccga aagactggcc cgggacgtga tgaaggaaat gggcggacac      180

cacatcgtgg ccctgtgtgt tctgaaaggc ggctacaagt tcttcgccga cctgctggac      240

tacatcaagg ccctgaaccg gaacagcgat cggagcatcc ctatgaccgt ggacttcatc      300

aggctgaagt cctactgcaa cgaccagagc accggcgaca tcaaagtgat cggcggcgac      360

gatctgagca ccctgacagg caagaacgtg ctgatcgtgg aagatatcat cgacaccggc      420

aagaccatgc agaccctgct gtctctcgtg cggcagtaca accccaagat ggtcaaggtg      480

gccagcctgc tggtcaagag aacccctaga agcgtgggct acaagcccga cttcgtgggc      540

ttcgagatcc ccgacaagtt cgtcgtgggc tacgccctgg attacaacga gtacttccgg      600

gacctgaacc acgtgtgcgt gatcagcgaa acaggcaagg ccaagtacaa ggcctgatga      660


<210>  4
<211>  218
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence HPRT

<400>  4

Met Ala Thr Arg Ser Pro Gly Val Val Ile Ser Asp Asp Glu Pro Gly 
1               5                   10                  15      


Tyr Asp Leu Asp Leu Phe Cys Ile Pro Asn His Tyr Ala Glu Asp Leu 
            20                  25                  30          


Glu Arg Val Phe Ile Pro His Gly Leu Ile Met Asp Arg Thr Glu Arg 
        35                  40                  45              


Leu Ala Arg Asp Val Met Lys Glu Met Gly Gly His His Ile Val Ala 
    50                  55                  60                  


Leu Cys Val Leu Lys Gly Gly Tyr Lys Phe Phe Ala Asp Leu Leu Asp 
65                  70                  75                  80  


Tyr Ile Lys Ala Leu Asn Arg Asn Ser Asp Arg Ser Ile Pro Met Thr 
                85                  90                  95      


Val Asp Phe Ile Arg Leu Lys Ser Tyr Cys Asn Asp Gln Ser Thr Gly 
            100                 105                 110         


Asp Ile Lys Val Ile Gly Gly Asp Asp Leu Ser Thr Leu Thr Gly Lys 
        115                 120                 125             


Asn Val Leu Ile Val Glu Asp Ile Ile Asp Thr Gly Lys Thr Met Gln 
    130                 135                 140                 


Thr Leu Leu Ser Leu Val Arg Gln Tyr Asn Pro Lys Met Val Lys Val 
145                 150                 155                 160 


Ala Ser Leu Leu Val Lys Arg Thr Pro Arg Ser Val Gly Tyr Lys Pro 
                165                 170                 175     


Asp Phe Val Gly Phe Glu Ile Pro Asp Lys Phe Val Val Gly Tyr Ala 
            180                 185                 190         


Leu Asp Tyr Asn Glu Tyr Phe Arg Asp Leu Asn His Val Cys Val Ile 
        195                 200                 205             


Ser Glu Thr Gly Lys Ala Lys Tyr Lys Ala 
    210                 215             


<210>  5
<211>  2229
<212>  DNA
<213>  adeno-associated virus PHP.eB

<400>  5
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctagag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa agcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct ctttaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctctagaact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggcctctca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg tactggcaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtga tgggactttg gcggtgcctt ttaaggcaca ggcgcagacc     1800

ggttgggttc aaaaccaagg aatacttccg ggtatggttt ggcaggacag agatgtgtac     1860

ctgcaaggac ccatttgggc caaaattcct cacacggacg gcaactttca cccttctccg     1920

ctgatgggag ggtttggaat gaagcacccg cctcctcaga tcctcatcaa aaacacacct     1980

gtacctgcgg atcctccaac ggccttcaac aaggacaagc tgaactcttt catcacccag     2040

tattctactg gtcaagtcag cgtggagatc gagtgggagc tgcagaagga aaacagcaag     2100

cgctggaacc cggagatcca gtacacttcc aactattaca agtctaataa tgttgaattt     2160

gctgttaata ctgaaggtgt atatagtgaa ccccgcccca ttggcaccag atacctgact     2220

cgtaatctg                                                             2229


<210>  6
<211>  743
<212>  PRT
<213>  adeno-associated virus PHP.eB

<400>  6

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Arg Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Asp Gly Thr Leu Ala Val 
            580                 585                 590         


Pro Phe Lys Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile 
        595                 600                 605             


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
    610                 615                 620                 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
625                 630                 635                 640 


Leu Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile 
                645                 650                 655     


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp 
            660                 665                 670         


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
        675                 680                 685             


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
    690                 695                 700                 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe 
705                 710                 715                 720 


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
                725                 730                 735     


Arg Tyr Leu Thr Arg Asn Leu 
            740             


<210>  7
<211>  2211
<212>  DNA
<213>  adeno-associated virus hu68

<400>  7
atggctgccg atggttatct tccagattgg ctcgaggaca acctcagtga aggcattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcggg gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacga agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgt gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtccccg accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgctaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatcta     1140

acgcttaatg atggaagcca agccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctatgc tcacagccaa agcctggacc gactcatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactaccaac ccagtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gctttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagattg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgtt     2160

tattctgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  8
<211>  736
<212>  PRT
<213>  adeno-associated virus hu68

<400>  8

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  9
<211>  2211
<212>  DNA
<213>  adeno-associated virus hu68

<400>  9
atggctgccg atggttatct tccagattgg ctcgaggaca acctcagtga aggcattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcggg gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacga agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgt gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtccccg accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgctaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta tggatacctc     1140

accctgaacg acggcagtca ggcggtgggc cgctcatcct tctactgcct ggagtacttc     1200

ccttcgcaga tgctgaggac tggcaacaac ttccagttca gctacgagtt cgagaacgtc     1260

cctttccaca gcagctacgc ccacagccag agtttggacc gcttgatgaa ccctctgatc     1320

gaccagtacc tgtactacct gtcaaagacg atcaacggtt ctggccagaa ccagcagacg     1380

ctgaagttca gcgtggccgg gcctagcaac atggccgtcc agggcagaaa ctacatccct     1440

gggcccagct accggcagca gagagtctca accactgtga ctcagaacaa caacagtgag     1500

ttcgcctggc ctggcgccag ctcttgggcc ctcaacggcc gcaactcgct gatgaaccca     1560

ggcccagcca tggccagtca caaggagggc gaggaccgtt tcttcccttt gtctggctct     1620

ctgatcttcg gcaagcaggg gaccggcaga gacaacgtgg acgcggacaa ggtcatgatc     1680

acgaacgagg aggagatcaa gaccaccaac cctgtggcaa ccgagtccta cggccaggtg     1740

gcaaccaacc accagagcgc ccaggcacag gcgcagactg gctgggtcca gaaccagggg     1800

atcctgcctg gcatggtgtg gcaggaccgt gacgtgtacc tgcagggccc tatctgggca     1860

aagatccctc acacggacgg caacttccac ccttctcctc tgatgggcgg cttcggcatg     1920

aagcacccgc ctcctcagat cctcatcaag aacactccgg tcccggcaga ccctccgacg     1980

gccttcaaca aggacaagct gaactcattc atcactcagt actccactgg ccaggtcagc     2040

gtggagatcg agtgggagct gcagaaggag aacagcaagc gttggaaccc agagatccag     2100

tacacttcca actactacaa gtctaacaac gtggagttcg ccgtcaacac tgagggtgtg     2160

tacagtgagc ctcgccctat cggcacccgg tacctcaccc gaaacttgtg a              2211


<210>  10
<211>  666
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CB7 hybrid promoter

<400>  10
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt accatggtcg aggtgagccc cacgttctgc ttcactctcc      420

ccatctcccc cccctcccca cccccaattt tgtatttatt tattttttaa ttattttgtg      480

cagcgatggg ggcggggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg      540

ggcggggcgg ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa      600

agtttccttt tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc      660

gggcgg                                                                 666


<210>  11
<211>  973
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  chicken beta actin intron

<400>  11
gtgagcgggc gggacggccc ttctcctccg ggctgtaatt agcgcttggt ttaatgacgg       60

cttgtttctt ttctgtggct gcgtgaaagc cttgaggggc tccgggaggg ccctttgtgc      120

ggggggagcg gctcgggggg tgcgtgcgtg tgtgtgtgcg tggggagcgc cgcgtgcggc      180

tccgcgctgc ccggcggctg tgagcgctgc gggcgcggcg cggggctttg tgcgctccgc      240

agtgtgcgcg aggggagcgc ggccgggggc ggtgccccgc ggtgcggggg gggctgcgag      300

gggaacaaag gctgcgtgcg gggtgtgtgc gtgggggggt gagcaggggg tgtgggcgcg      360

tcggtcgggc tgcaaccccc cctgcacccc cctccccgag ttgctgagca cggcccggct      420

tcgggtgcgg ggctccgtac ggggcgtggc gcggggctcg ccgtgccggg cggggggtgg      480

cggcaggtgg gggtgccggg cggggcgggg ccgcctcggg ccggggaggg ctcgggggag      540

gggcgcggcg gcccccggag cgccggcggc tgtcgaggcg cggcgagccg cagccattgc      600

cttttatggt aatcgtgcga gagggcgcag ggacttcctt tgtcccaaat ctgtgcggag      660

ccgaaatctg ggaggcgccg ccgcaccccc tctagcgggc gcggggcgaa gcggtgcggc      720

gccggcagga aggaaatggg cggggagggc cttcgtgcgt cgccgcgccg ccgtcccctt      780

ctccctctcc agcctcgggg ctgtccgcgg ggggacggct gccttcgggg gggacggggc      840

agggcggggt tcggcttctg gcgtgtgacc ggcggctcta gagcctctgc taaccatgtt      900

catgccttct tctttttcct acagctcctg ggcaacgtgc tggttattgt gctgtctcat      960

cattttggca aag                                                         973


<210>  12
<211>  127
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  rabbit beta-globin polyA

<400>  12
gatctttttc cctctgccaa aaattatggg gacatcatga agccccttga gcatctgact       60

tctggctaat aaaggaaatt tattttcatt gcaatagtgt gttggaattt tttgtgtctc      120

tcactcg                                                                127


<210>  13
<211>  542
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  WPRE element (mut)

<400>  13
aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct       60

ccttttacgc tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt      120

atggctttca ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg      180

tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact      240

ggttggggca ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct      300

attgccacgg cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg      360

ttgggcactg acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc      420

gcctgtgttg ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc      480

aatccagcgg accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt      540

cg                                                                     542


<210>  14
<211>  2591
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  expression cassette CB7.CI.HPRT.RBG


<220>
<221>  enhancer
<222>  (1)..(382)
<223>  CMV IE enhancer

<220>
<221>  promoter
<222>  (1)..(666)
<223>  CB7 hybrid promoter

<220>
<221>  promoter
<222>  (385)..(666)
<223>  CB promoter

<220>
<221>  TATA_signal
<222>  (639)..(642)
<223>  TATA signal

<220>
<221>  Intron
<222>  (761)..(1733)
<223>  chicken beta actin intron

<220>
<221>  misc_feature
<222>  (1745)..(2404)
<223>  HPRT

<220>
<221>  polyA_signal
<222>  (2465)..(2591)
<223>  Rabbit beta globin polyA

<400>  14
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt accatggtcg aggtgagccc cacgttctgc ttcactctcc      420

ccatctcccc cccctcccca cccccaattt tgtatttatt tattttttaa ttattttgtg      480

cagcgatggg ggcggggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg      540

ggcggggcgg ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa      600

agtttccttt tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc      660

gggcggggag tcgctgcgac gctgccttcg ccccgtgccc cgctccgccg ccgcctcgcg      720

ccgcccgccc cggctctgac tgaccgcgtt actcccacag gtgagcgggc gggacggccc      780

ttctcctccg ggctgtaatt agcgcttggt ttaatgacgg cttgtttctt ttctgtggct      840

gcgtgaaagc cttgaggggc tccgggaggg ccctttgtgc ggggggagcg gctcgggggg      900

tgcgtgcgtg tgtgtgtgcg tggggagcgc cgcgtgcggc tccgcgctgc ccggcggctg      960

tgagcgctgc gggcgcggcg cggggctttg tgcgctccgc agtgtgcgcg aggggagcgc     1020

ggccgggggc ggtgccccgc ggtgcggggg gggctgcgag gggaacaaag gctgcgtgcg     1080

gggtgtgtgc gtgggggggt gagcaggggg tgtgggcgcg tcggtcgggc tgcaaccccc     1140

cctgcacccc cctccccgag ttgctgagca cggcccggct tcgggtgcgg ggctccgtac     1200

ggggcgtggc gcggggctcg ccgtgccggg cggggggtgg cggcaggtgg gggtgccggg     1260

cggggcgggg ccgcctcggg ccggggaggg ctcgggggag gggcgcggcg gcccccggag     1320

cgccggcggc tgtcgaggcg cggcgagccg cagccattgc cttttatggt aatcgtgcga     1380

gagggcgcag ggacttcctt tgtcccaaat ctgtgcggag ccgaaatctg ggaggcgccg     1440

ccgcaccccc tctagcgggc gcggggcgaa gcggtgcggc gccggcagga aggaaatggg     1500

cggggagggc cttcgtgcgt cgccgcgccg ccgtcccctt ctccctctcc agcctcgggg     1560

ctgtccgcgg ggggacggct gccttcgggg gggacggggc agggcggggt tcggcttctg     1620

gcgtgtgacc ggcggctcta gagcctctgc taaccatgtt catgccttct tctttttcct     1680

acagctcctg ggcaacgtgc tggttattgt gctgtctcat cattttggca aagaattcgc     1740

caccatggcc acaagatctc ccggcgtggt catcagcgac gacgagcctg gctacgacct     1800

ggacctgttc tgcatcccca atcactacgc cgaggacctg gaacgggtgt tcattcctca     1860

cggcctgatc atggaccgga ccgaaagact ggcccgggac gtgatgaagg aaatgggcgg     1920

acaccacatc gtggccctgt gtgttctgaa aggcggctac aagttcttcg ccgacctgct     1980

ggactacatc aaggccctga accggaacag cgatcggagc atccctatga ccgtggactt     2040

catcaggctg aagtcctact gcaacgacca gagcaccggc gacatcaaag tgatcggcgg     2100

cgacgatctg agcaccctga caggcaagaa cgtgctgatc gtggaagata tcatcgacac     2160

cggcaagacc atgcagaccc tgctgtctct cgtgcggcag tacaacccca agatggtcaa     2220

ggtggccagc ctgctggtca agagaacccc tagaagcgtg ggctacaagc ccgacttcgt     2280

gggcttcgag atccccgaca agttcgtcgt gggctacgcc ctggattaca acgagtactt     2340

ccgggacctg aaccacgtgt gcgtgatcag cgaaacaggc aaggccaagt acaaggcctg     2400

atgaggtacc tctagagtcg acccgggcgg cctcgaggac ggggtgaact acgcctgagg     2460

atccgatctt tttccctctg ccaaaaatta tggggacatc atgaagcccc ttgagcatct     2520

gacttctggc taataaagga aatttatttt cattgcaata gtgtgttgga attttttgtg     2580

tctctcactc g                                                          2591


<210>  15
<211>  382
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  cytomegalovirus immediate early (CMV IE) enhancer

<400>  15
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt ac                                               382


<210>  16
<211>  282
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  chicken beta-actin promoter

<400>  16
tggtcgaggt gagccccacg ttctgcttca ctctccccat ctcccccccc tccccacccc       60

caattttgta tttatttatt ttttaattat tttgtgcagc gatgggggcg gggggggggg      120

gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg gggcggggcg aggcggagag      180

gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt tccttttatg gcgaggcggc      240

ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc gg                         282


<210>  17
<211>  599
<212>  PRT
<213>  adeno-associated virus hu68 VP2

<400>  17

Thr Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Val Gly Ile Gly Lys Ser Gly Ala Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Thr Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Ile Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Ser Leu 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Val Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Gln 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Gln Phe Ser Tyr Glu Phe Glu Asn Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Val Ala Gly Pro Ser Asn Met Ala Val 
                325                 330                 335     


Gln Gly Arg Asn Tyr Ile Pro Gly Pro Ser Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Val Thr Gln Asn Asn Asn Ser Glu Phe Ala Trp Pro Gly 
        355                 360                 365             


Ala Ser Ser Trp Ala Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val 
                405                 410                 415     


Asp Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Ser Tyr Gly Gln Val Ala Thr Asn His Gln 
        435                 440                 445             


Ser Ala Gln Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Asn Leu 
        595                 


<210>  18
<211>  534
<212>  PRT
<213>  adeno-associated virus hu68 VP3

<400>  18

Met Ala Ser Gly Gly Gly Ala Pro Val Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Gln Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Gln Phe Ser Tyr Glu Phe Glu Asn Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Val Ala Gly Pro Ser Asn Met Ala Val Gln 
            260                 265                 270         


Gly Arg Asn Tyr Ile Pro Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Val Thr Gln Asn Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala 
    290                 295                 300                 


Ser Ser Trp Ala Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val Asp 
            340                 345                 350         


Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Ser Tyr Gly Gln Val Ala Thr Asn His Gln Ser 
    370                 375                 380                 


Ala Gln Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Asn Leu 
    530                 


<210>  19
<211>  137
<212>  PRT
<213>  adeno-associated virus hu68 VP1 fragment

<400>  19

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys 
    130                 135         


<210>  20
<211>  202
<212>  PRT
<213>  adeno-associated virus hu68 VP2 fragment

<400>  20

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr 
        195                 200         


<210>  21
<211>  1800
<212>  DNA
<213>  adeno-associated virus hu68 VP2

<400>  21
acggctcctg gaaagaagag gcctgtagag cagtctcctc aggaaccgga ctcctccgtg       60

ggtattggca aatcgggtgc acagcccgct aaaaagagac tcaatttcgg tcagactggc      120

gacacagagt cagtccccga ccctcaacca atcggagaac ctcccgcagc cccctcaggt      180

gtgggatctc ttacaatggc ttcaggtggt ggcgcaccag tggcagacaa taacgaaggt      240

gccgatggag tgggtagttc ctcgggaaat tggcattgcg attcccaatg gctgggggac      300

agagtcatca ccaccagcac ccgaacctgg gccctgccca cctacaacaa tcacctctac      360

aagcaaatct ccaacagcac atctggagga tcttcaaatg acaacgccta cttcggctac      420

agcaccccct gggggtattt tgacttcaac agattccact gccacttctc accacgtgac      480

tggcaaagac tcatcaacaa caactgggga ttccggccta agcgactcaa cttcaagctc      540

ttcaacattc aggtcaaaga ggttacggac aacaatggag tcaagaccat cgctaataac      600

cttaccagca cggtccaggt cttcacggac tcagactatc agctcccgta cgtgctcggg      660

tcggctcacg agggctgcct cccgccgttc ccagcggacg ttttcatgat tcctcagtac      720

gggtatctaa cgcttaatga tggaagccaa gccgtgggtc gttcgtcctt ttactgcctg      780

gaatatttcc cgtcgcaaat gctaagaacg ggtaacaact tccagttcag ctacgagttt      840

gagaacgtac ctttccatag cagctatgct cacagccaaa gcctggaccg actcatgaat      900

ccactcatcg accaatactt gtactatctc tcaaagacta ttaacggttc tggacagaat      960

caacaaacgc taaaattcag tgtggccgga cccagcaaca tggctgtcca gggaagaaac     1020

tacatacctg gacccagcta ccgacaacaa cgtgtctcaa ccactgtgac tcaaaacaac     1080

aacagcgaat ttgcttggcc tggagcttct tcttgggctc tcaatggacg taatagcttg     1140

atgaatcctg gacctgctat ggccagccac aaagaaggag aggaccgttt ctttcctttg     1200

tctggatctt taatttttgg caaacaagga actggaagag acaacgtgga tgcggacaaa     1260

gtcatgataa ccaacgaaga agaaattaaa actaccaacc cagtagcaac ggagtcctat     1320

ggacaagtgg ccacaaacca ccagagtgcc caagcacagg cgcagaccgg ctgggttcaa     1380

aaccaaggaa tacttccggg tatggtttgg caggacagag atgtgtacct gcaaggaccc     1440

atttgggcca aaattcctca cacggacggc aactttcacc cttctccgct gatgggaggg     1500

tttggaatga agcacccgcc tcctcagatc ctcatcaaaa acacacctgt acctgcggat     1560

cctccaacgg ctttcaacaa ggacaagctg aactctttca tcacccagta ttctactggc     1620

caagtcagcg tggagattga gtgggagctg cagaaggaaa acagcaagcg ctggaacccg     1680

gagatccagt acacttccaa ctattacaag tctaataatg ttgaatttgc tgttaatact     1740

gaaggtgttt attctgaacc ccgccccatt ggcaccagat acctgactcg taatctgtaa     1800


<210>  22
<211>  1605
<212>  DNA
<213>  adeno-associated virus hu68 VP3

<400>  22
atggcttcag gtggtggcgc accagtggca gacaataacg aaggtgccga tggagtgggt       60

agttcctcgg gaaattggca ttgcgattcc caatggctgg gggacagagt catcaccacc      120

agcacccgaa cctgggccct gcccacctac aacaatcacc tctacaagca aatctccaac      180

agcacatctg gaggatcttc aaatgacaac gcctacttcg gctacagcac cccctggggg      240

tattttgact tcaacagatt ccactgccac ttctcaccac gtgactggca aagactcatc      300

aacaacaact ggggattccg gcctaagcga ctcaacttca agctcttcaa cattcaggtc      360

aaagaggtta cggacaacaa tggagtcaag accatcgcta ataaccttac cagcacggtc      420

caggtcttca cggactcaga ctatcagctc ccgtacgtgc tcgggtcggc tcacgagggc      480

tgcctcccgc cgttcccagc ggacgttttc atgattcctc agtacgggta tctaacgctt      540

aatgatggaa gccaagccgt gggtcgttcg tccttttact gcctggaata tttcccgtcg      600

caaatgctaa gaacgggtaa caacttccag ttcagctacg agtttgagaa cgtacctttc      660

catagcagct atgctcacag ccaaagcctg gaccgactca tgaatccact catcgaccaa      720

tacttgtact atctctcaaa gactattaac ggttctggac agaatcaaca aacgctaaaa      780

ttcagtgtgg ccggacccag caacatggct gtccagggaa gaaactacat acctggaccc      840

agctaccgac aacaacgtgt ctcaaccact gtgactcaaa acaacaacag cgaatttgct      900

tggcctggag cttcttcttg ggctctcaat ggacgtaata gcttgatgaa tcctggacct      960

gctatggcca gccacaaaga aggagaggac cgtttctttc ctttgtctgg atctttaatt     1020

tttggcaaac aaggaactgg aagagacaac gtggatgcgg acaaagtcat gataaccaac     1080

gaagaagaaa ttaaaactac caacccagta gcaacggagt cctatggaca agtggccaca     1140

aaccaccaga gtgcccaagc acaggcgcag accggctggg ttcaaaacca aggaatactt     1200

ccgggtatgg tttggcagga cagagatgtg tacctgcaag gacccatttg ggccaaaatt     1260

cctcacacgg acggcaactt tcacccttct ccgctgatgg gagggtttgg aatgaagcac     1320

ccgcctcctc agatcctcat caaaaacaca cctgtacctg cggatcctcc aacggctttc     1380

aacaaggaca agctgaactc tttcatcacc cagtattcta ctggccaagt cagcgtggag     1440

attgagtggg agctgcagaa ggaaaacagc aagcgctgga acccggagat ccagtacact     1500

tccaactatt acaagtctaa taatgttgaa tttgctgtta atactgaagg tgtttattct     1560

gaaccccgcc ccattggcac cagatacctg actcgtaatc tgtaa                     1605


