                         SEQUENCE LISTING

<110>  Abeona Therapeutics, Inc.
 
<120>  RECOMBINANT ADENO-ASSOCIATED VIRAL VECTOR FOR GENE DELIVERY

<130>  ABEO-002/04WO 337067-2028

<150>  US 62/914,856
<151>  2019-10-14

<150>  US 62/863,126
<151>  2019-06-18

<150>  US 62/801,195
<151>  2019-02-05

<150>  US 62/775,871
<151>  2018-12-05

<160>  163   

<170>  PatentIn version 3.5

<210>  1
<211>  737
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV110 VPl

<400>  1

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn 
        435                 440                 445             


Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe 
    450                 455                 460                 


Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu 
465                 470                 475                 480 


Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp 
                485                 490                 495     


Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu 
            500                 505                 510         


Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His 
        515                 520                 525             


Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe 
    530                 535                 540                 


Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met 
545                 550                 555                 560 


Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu 
                565                 570                 575     


Arg Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro 
            580                 585                 590         


Ala Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp 
        595                 600                 605             


Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro 
    610                 615                 620                 


His Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625                 630                 635                 640 


Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro 
                645                 650                 655     


Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile 
            660                 665                 670         


Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu 
        675                 680                 685             


Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser 
    690                 695                 700                 


Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly 
705                 710                 715                 720 


Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro 
                725                 730                 735     


Leu 
    


<210>  2
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV204 VP1

<400>  2

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 
        435                 440                 445             


Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser 
    450                 455                 460                 


Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
            500                 505                 510         


Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 
    530                 535                 540                 


Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
                565                 570                 575     


Phe Gly Thr Val Ala Val Asn Leu Gln Asn Ser Ser Thr Asp Pro Ala 
            580                 585                 590         


Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705                 710                 715                 720 


Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  3
<211>  735
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214 VPl

<400>  3

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Lys 
        435                 440                 445             


Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser Gln 
    450                 455                 460                 


Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn Asn 
                485                 490                 495     


Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys Glu 
        515                 520                 525             


Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu Thr 
545                 550                 555                 560 


Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
                565                 570                 575     


Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile 
            580                 585                 590         


Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln Asn 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asp 
                645                 650                 655     


Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735 


<210>  4
<211>  4287
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTRdeltaR

<400>  4
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag caccctgcag gccagaagaa gacagtctgt gctgaacctg     2160

atgacccact ctgtgaacca gggccagaac atccacagaa agaccacagc cagcaccaga     2220

aaggtgagcc tggcccccca ggccaacctg acagagctgg acatctacag cagaagactg     2280

agccaggaga caggcctgga gatctctgag gagatcaatg aggaggacct gaaggagtgc     2340

ttctttgatg acatggagag catccctgct gtgaccacct ggaacaccta cctgagatac     2400

atcacagtgc acaagagcct gatctttgtg ctgatctggt gcctggtgat cttcctggct     2460

gaggtggctg ccagcctggt ggtgctgtgg ctgctgggca acacccccct gcaggacaag     2520

ggcaacagca cccacagcag aaacaacagc tatgctgtga tcatcaccag caccagcagc     2580

tactatgtgt tctacatcta tgtgggggtg gctgacaccc tgctggccat gggcttcttc     2640

agaggcctgc ccctggtgca caccctgatc acagtgagca agatcctgca ccacaagatg     2700

ctgcactctg tgctgcaggc ccccatgagc accctgaaca ccctgaaggc tgggggcatc     2760

ctgaacagat tcagcaagga cattgccatc ctggatgacc tgctgcccct gaccatcttt     2820

gacttcatcc agctgctgct gattgtgatt ggggccattg ctgtggtggc tgtgctgcag     2880

ccctacatct ttgtggccac agtgcctgtg attgtggcct tcatcatgct gagagcctac     2940

ttcctgcaga ccagccagca gctgaagcag ctggagtctg agggcagaag ccccatcttc     3000

acccacctgg tgaccagcct gaagggcctg tggaccctga gagcctttgg cagacagccc     3060

tactttgaga ccctgttcca caaggccctg aacctgcaca cagccaactg gttcctgtac     3120

ctgagcaccc tgagatggtt ccagatgaga attgagatga tctttgtgat cttcttcatt     3180

gctgtgacct tcatcagcat cctgaccaca ggggaggggg agggcagagt gggcatcatc     3240

ctgaccctgg ccatgaacat catgagcacc ctgcagtggg ctgtgaacag cagcattgat     3300

gtggacagcc tgatgagatc tgtgagcaga gtgttcaagt tcattgacat gcccacagag     3360

ggcaagccca ccaagagcac caagccctac aagaatggcc agctgagcaa ggtgatgatc     3420

attgagaaca gccatgtgaa gaaggatgac atctggccct ctgggggcca gatgacagtg     3480

aaggacctga cagccaagta cacagagggg ggcaatgcca tcctggagaa catcagcttc     3540

agcatcagcc ctggccagag agtgggcctg ctgggcagaa caggctctgg caagagcacc     3600

ctgctgtctg ccttcctgag actgctgaac acagaggggg agatccagat tgatggggtg     3660

agctgggaca gcatcaccct gcagcagtgg agaaaggcct ttggggtgat cccccagaag     3720

gtgttcatct tctctggcac cttcagaaag aacctggacc cctatgagca gtggtctgac     3780

caggagatct ggaaggtggc tgatgaggtg ggcctgagat ctgtgattga gcagttccct     3840

ggcaagctgg actttgtgct ggtggatggg ggctgtgtgc tgagccatgg ccacaagcag     3900

ctgatgtgcc tggccagatc tgtgctgagc aaggccaaga tcctgctgct ggatgagccc     3960

tctgcccacc tggaccctgt gacctaccag atcatcagaa gaaccctgaa gcaggccttt     4020

gctgactgca cagtgatcct gtgtgagcac agaattgagg ccatgctgga gtgccagcag     4080

ttcctggtga ttgaggagaa caaggtgaga cagtatgaca gcatccagaa gctgctgaat     4140

gagagaagcc tgttcagaca ggccatcagc ccctctgaca gagtgaagct gttcccccac     4200

agaaacagca gcaagtgcaa gagcaagccc cagattgctg ccctgaagga ggagaccgag     4260

gaggaggtgc aggacaccag actgtaa                                         4287


<210>  5
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GAA

<400>  5
atgggagtcc gccacccgcc ctgctcacat cgcctgcttg ctgtctgtgc cctcgtgtca       60

cttgctaccg ccgcgctgct tggtcacatt ctgctgcacg actttttact agttccgagg      120

gaactgtcgg gatccagccc cgtgctcgag gaaactcacc ccgcgcacca acagggggcg      180

tccaggccgg gaccgcgcga cgcccaggcc cacccgggcc ggcctcgggc cgtgccaact      240

cagtgcgatg tgccgccgaa ctcccgcttc gactgtgcgc ctgacaaggc cataacccag      300

gaacagtgcg aagcacgcgg ctgctgctat attccggcga agcagggctt gcagggtgcc      360

caaatgggtc agccttggtg cttctttccc ccgtcgtacc cctcgtacaa gctggagaac      420

ctgagcagca gcgaaatggg gtacaccgcc actctgaccc ggacgacccc gaccttcttc      480

ccgaaagaca tcctgaccct gcggctggat gtgatgatgg aaactgagaa cagactgcac      540

ttcactatca aggaccccgc gaaccgcaga tatgaggtgc cactggaaac ccctcatgtg      600

cattcccggg ccccatcccc tctgtactcg gtggaattct ccgaagaacc cttcggggtc      660

attgtgcgcc ggcagcttga tggccgggtc ctgctcaaca ccaccgtggc accccttttc      720

ttcgctgacc agttcctcca gctgagcacc tcgctgccga gccagtacat caccggactg      780

gccgagcacc tctcccctct gatgctgtcc actagctgga ctaggatcac tctgtggaac      840

cgggatctgg cccctacccc gggcgcgaac ctgtacggat cgcacccctt ctacctggcc      900

ctcgaggacg gaggctccgc ccacggagtg ttcctgctga actccaacgc tatggacgtg      960

gtgctccagc cgtcccctgc actgtcctgg cggagcacag ggggtattct ggatgtctac     1020

atcttcctcg gcccggagcc aaagtccgtg gtgcaacagt atctggatgt cgtgggttac     1080

ccattcatgc cgccatactg gggccttggc ttccacctgt gccgctgggg atacagctcc     1140

accgccatca ctagacaggt cgtggaaaac atgactagag cccacttccc cctcgatgtc     1200

cagtggaatg acctggacta catggattcc agacgcgact tcactttcaa caaggatgga     1260

ttcagagatt tccccgctat ggtccaagaa ctgcaccagg gtggccggcg gtacatgatg     1320

attgtggacc ccgccatttc aagctccgga ccagcgggct cgtaccggcc ctacgacgaa     1380

ggtttgcgcc gcggcgtgtt catcactaac gaaaccggcc agccactgat tgggaaggtc     1440

tggcctggaa gcaccgcgtt cccggacttc actaacccaa cggccttggc gtggtgggag     1500

gacatggtgg ccgaattcca cgaccaagtc ccattcgacg gaatgtggat cgacatgaac     1560

gagcccagca acttcatccg aggctccgag gacggctgcc ctaacaacga acttgagaac     1620

cctccgtacg tgcctggcgt cgtcggcgga acactgcagg ccgctacgat ctgtgcctca     1680

tcgcatcagt tcctgtcaac ccactacaac ctccataatc tgtacggcct caccgaagcc     1740

atcgcctccc accgggccct ggtcaaggcc cgggggacta ggcccttcgt gattagccgg     1800

agcactttcg ccggacacgg aagatacgcc ggacattgga ccggcgacgt gtggtcatcg     1860

tgggagcagc tcgcctcctc cgtccccgaa atcctgcagt tcaatctcct gggagtcccc     1920

ctcgtgggcg cggacgtgtg cggattcctg ggcaatacct ctgaggagct gtgcgtgaga     1980

tggacccagc tgggggcgtt ctaccccttc atgcggaacc acaactcact gctgtccctg     2040

cctcaagagc cgtactcatt ctccgagccg gcacaacagg ccatgcgaaa ggctctgacc     2100

ctccgctatg cgctcttgcc ccacctctac actctgtttc accaagccca tgtcgcgggc     2160

gaaacagtgg ccagaccact ctttctggaa ttcccaaagg actcctcaac ctggactgtg     2220

gatcatcagc tgctctgggg agaggcactg ctgatcaccc cggtgctcca agccggaaag     2280

gcggaagtga ccggatactt ccctctcggt acttggtacg acctccaaac cgtgccggtc     2340

gaggccctgg gcagcttgcc tccgccgccg gctgccccgc gggagcctgc aatccactcc     2400

gaggggcaat gggtgaccct ccctgcacca ctggacacca tcaacgtgca cctccgggcc     2460

ggctacatca tcccgctgca aggaccgggt ctgactacca ccgaatcccg gcagcagccc     2520

atggcactgg ccgtggccct gaccaaggga ggggaagcac ggggagaact cttttgggac     2580

gatggagaat ccctggaagt gctcgagcgg ggagcctaca ctcaagtcat ctttcttgcc     2640

cgcaacaaca ccatcgtgaa cgaattggtc cgcgtgacct ccgagggggc cggactccag     2700

ctgcaaaaag tgaccgtgct gggggtggca accgccccgc aacaagtgtt gtctaacgga     2760

gtgccggtgt ccaacttcac ctactcccct gataccaaag ttctagatat ttgcgtgagc     2820

ctgctgatgg gagaacagtt cctggtgtcc tggtgctga                            2859


<210>  6
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GAA codon-optimized nucleotide sequence 1 (GAA 15)

<400>  6
atgggagtcc gccacccgcc ctgctcacat cgcctgcttg ctgtctgtgc cctcgtgtca       60

cttgctaccg ccgcgctgct tggtcacatt ctgctgcacg actttttact agttccgagg      120

gaactgtcgg gatccagccc cgtgctcgag gaaactcacc ccgcgcacca acagggggcg      180

tccaggccgg gaccgcgcga cgcccaggcc cacccgggcc ggcctcgggc cgtgccaact      240

cagtgcgatg tgccgccgaa ctcccgcttc gactgtgcgc ctgacaaggc cataacccag      300

gaacagtgcg aagcacgcgg ctgctgctat attccggcga agcagggctt gcagggtgcc      360

caaatgggtc agccttggtg cttctttccc ccgtcgtacc cctcgtacaa gctggagaac      420

ctgagcagca gcgaaatggg gtacaccgcc actctgaccc ggacgacccc gaccttcttc      480

ccgaaagaca tcctgaccct gcggctggat gtgatgatgg aaactgagaa cagactgcac      540

ttcactatca aggaccccgc gaaccgcaga tatgaggtgc cactggaaac ccctcatgtg      600

cattcccggg ccccatcccc tctgtactcg gtggaattct ccgaagaacc cttcggggtc      660

attgtgcgcc ggcagcttga tggccgggtc ctgctcaaca ccaccgtggc accccttttc      720

ttcgctgacc agttcctcca gctgagcacc tcgctgccga gccagtacat caccggactg      780

gccgagcacc tctcccctct gatgctgtcc actagctgga ctaggatcac tctgtggaac      840

cgggatctgg cccctacccc gggcgcgaac ctgtacggat cgcacccctt ctacctggcc      900

ctcgaggacg gaggctccgc ccacggagtg ttcctgctga actccaacgc tatggacgtg      960

gtgctccagc cgtcccctgc actgtcctgg cggagcacag ggggtattct ggatgtctac     1020

atcttcctcg gcccggagcc aaagtccgtg gtgcaacagt atctggatgt cgtgggttac     1080

ccattcatgc cgccatactg gggccttggc ttccacctgt gccgctgggg atacagctcc     1140

accgccatca ctagacaggt cgtggaaaac atgactagag cccacttccc cctcgatgtc     1200

cagtggaatg acctggacta catggattcc agacgcgact tcactttcaa caaggatgga     1260

ttcagagatt tccccgctat ggtccaagaa ctgcaccagg gtggccggcg gtacatgatg     1320

attgtggacc ccgccatttc aagctccgga ccagcgggct cgtaccggcc ctacgacgaa     1380

ggtttgcgcc gcggcgtgtt catcactaac gaaaccggcc agccactgat tgggaaggtc     1440

tggcctggaa gcaccgcgtt cccggacttc actaacccaa cggccttggc gtggtgggag     1500

gacatggtgg ccgaattcca cgaccaagtc ccattcgacg gaatgtggat cgacatgaac     1560

gagcccagca acttcatccg aggctccgag gacggctgcc ctaacaacga acttgagaac     1620

cctccgtacg tgcctggcgt cgtcggcgga acactgcagg ccgctacgat ctgtgcctca     1680

tcgcatcagt tcctgtcaac ccactacaac ctccataatc tgtacggcct caccgaagcc     1740

atcgcctccc accgggccct ggtcaaggcc cgggggacta ggcccttcgt gattagccgg     1800

agcactttcg ccggacacgg aagatacgcc ggacattgga ccggcgacgt gtggtcatcg     1860

tgggagcagc tcgcctcctc cgtccccgaa atcctgcagt tcaatctcct gggagtcccc     1920

ctcgtgggcg cggacgtgtg cggattcctg ggcaatacct ctgaggagct gtgcgtgaga     1980

tggacccagc tgggggcgtt ctaccccttc atgcggaacc acaactcact gctgtccctg     2040

cctcaagagc cgtactcatt ctccgagccg gcacaacagg ccatgcgaaa ggctctgacc     2100

ctccgctatg cgctcttgcc ccacctctac actctgtttc accaagccca tgtcgcgggc     2160

gaaacagtgg ccagaccact ctttctggaa ttcccaaagg actcctcaac ctggactgtg     2220

gatcatcagc tgctctgggg agaggcactg ctgatcaccc cggtgctcca agccggaaag     2280

gcggaagtga ccggatactt ccctctcggt acttggtacg acctccaaac cgtgccggtc     2340

gaggccctgg gcagcttgcc tccgccgccg gctgccccgc gggagcctgc aatccactcc     2400

gaggggcaat gggtgaccct ccctgcacca ctggacacca tcaacgtgca cctccgggcc     2460

ggctacatca tcccgctgca aggaccgggt ctgactacca ccgaatcccg gcagcagccc     2520

atggcactgg ccgtggccct gaccaaggga ggggaagcac ggggagaact cttttgggac     2580

gatggagaat ccctggaagt gctcgagcgg ggagcctaca ctcaagtcat ctttcttgcc     2640

cgcaacaaca ccatcgtgaa cgaattggtc cgcgtgacct ccgagggggc cggactccag     2700

ctgcaaaaag tgaccgtgct gggggtggca accgccccgc aacaagtgtt gtctaacgga     2760

gtgccggtgt ccaacttcac ctactcccct gataccaaag ttctagatat ttgcgtgagc     2820

ctgctgatgg gagaacagtt cctggtgtcc tggtgctga                            2859


<210>  7
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GAA Codon-optimized 2 (GAA21)

<400>  7
atgggagtta gacaccctcc atgtagccac agactgctgg ccgtgtgtgc tctggtgtct       60

ctggctacag ctgccctgct gggacatatc ctgctgcacg acttcttact agttcccaga      120

gagctgtccg gcagcagccc tgtgctggaa gaaacacacc ctgcacatca gcagggcgcc      180

tctagacctg gacctagaga tgctcaggcc catcctggca gacctagagc tgtgcccaca      240

cagtgtgacg tgccacctaa cagcagattc gactgcgccc ctgacaaggc catcacacaa      300

gagcagtgtg aagccagagg ctgctgctac atccctgcca aacaaggact gcagggcgct      360

cagatgggac agccctggtg cttcttccca ccatcttacc ccagctacaa gctggaaaac      420

ctgagcagca gcgagatggg ctacaccgcc acactgacca gaaccacacc tacattcttc      480

ccgaaggaca tcctgacact gcggctggac gtgatgatgg aaaccgagaa ccggctgcac      540

ttcaccatca aggaccccgc caatcggaga tacgaggtgc cactggaaac ccctcacgtg      600

cactctagag ccccatctcc actgtacagc gtggaattca gcgaggaacc cttcggcgtg      660

atcgtgcgga gacagctgga tggaagagtg ctgctgaaca ccacagtggc ccctctgttc      720

ttcgccgacc agtttctgca gctgtccacc agcctgccta gccagtatat cacaggcctg      780

gccgagcacc tgtctccact gatgctgtct accagctgga cccggatcac cctgtggaac      840

agggatcttg ctcctacacc tggcgccaac ctgtacggct ctcacccttt ttatctggcc      900

ctggaagatg gcggatctgc ccacggtgtc tttctgctga actccaacgc catggacgtg      960

gtgctgcagc catctcctgc tctgtcttgg agaagcacag gcggcatcct ggacgtgtac     1020

atctttctgg gccccgagcc taagagcgtg gtgcagcagt atctggacgt cgtgggctac     1080

cccttcatgc ctccttattg gggcctgggc ttccacctgt gcagatgggg atacagcagc     1140

accgccatca ccagacaggt ggtggaaaac atgacccggg ctcacttccc actggatgtg     1200

cagtggaacg acctggacta catggacagc agacgggact tcaccttcaa caaggacggc     1260

ttcagagact tccccgccat ggtgcaagaa ctgcaccaag gcggcagacg gtacatgatg     1320

atcgtggatc cagccatcag ctctagcggc cctgccggct cttacagacc ttacgatgag     1380

ggcctgagaa gaggcgtgtt catcaccaac gagacaggcc agcctctgat cggcaaagtg     1440

tggcctggca gcacagcctt tccagacttc acaaacccca ccgctctggc ttggtgggaa     1500

gatatggtgg ccgagtttca cgatcaggtg cccttcgacg gcatgtggat cgacatgaac     1560

gagcccagca acttcatccg gggcagcgag gatggctgcc ccaacaacga actggaaaat     1620

cctccttacg tgcccggcgt tgtcggcgga acacttcagg ccgctacaat ctgtgccagc     1680

agccaccagt tcctcagcac ccactacaac ctgcacaatc tgtatggcct gaccgaggcc     1740

attgccagcc atagagccct ggttaaggcc aggggcacca gacctttcgt gatcagcaga     1800

agcaccttcg ccggccacgg cagatatgcc ggacattgga caggcgacgt gtggtctagt     1860

tgggagcagc tggctagcag cgtgccagag atcctgcagt tcaatctgct gggcgtgcca     1920

ctcgtgggag ccgatgtttg tggcttcctg ggcaacacct ccgaggaact gtgtgtgcgt     1980

tggacacagc tgggcgcctt ctatcccttc atgagaaacc acaacagcct tctcagcctg     2040

ccacaagagc cctacagctt ctctgagcct gcacagcagg ccatgagaaa ggccctgact     2100

ctgagatacg ctctgctgcc ccacctgtac accctgtttc accaggctca tgtggccggg     2160

gagacagtgg ctagacctct gttcctggaa ttccccaagg acagctccac ctggaccgtg     2220

gatcatcagc tgctgtgggg agaagccctg ctcatcacac ctgttctgca ggccggaaag     2280

gccgaagtga ccggctattt tcctctcggc acttggtacg acctgcagac cgtgcctgtt     2340

gaggctctgg gatctcttcc tccacctcct gccgctccta gagagcctgc cattcactct     2400

gaaggccagt gggttaccct gcctgctcct ctggacacca tcaacgtgca cctgagagct     2460

ggctacatca tccctctgca aggccctggc ctgacaacca ccgaatctag acagcagccc     2520

atggctctgg ccgtggcttt gacaaaaggc ggagaggcta gaggcgagct gttctgggat     2580

gatggcgaga gcctggaagt gctggaacgg ggcgcttata cccaagtgat cttcctggcc     2640

agaaacaaca ccatcgtgaa cgaactcgtg cgcgtgacca gtgaaggtgc tggactgcaa     2700

ctgcagaaag tgaccgtgct cggagtggcc acagcacctc agcaggttct gtctaatggc     2760

gtgcccgtgt ccaacttcac atacagcccc gacaccaagg tcctggacat ctgtgtgtca     2820

ctgctgatgg gcgagcagtt cctggtgtcc tggtgttga                            2859


<210>  8
<211>  952
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(952)
<223>  Acid Alpha-Glucosidase (GAA)

<400>  8

Met Gly Val Arg His Pro Pro Cys Ser His Arg Leu Leu Ala Val Cys 
1               5                   10                  15      


Ala Leu Val Ser Leu Ala Thr Ala Ala Leu Leu Gly His Ile Leu Leu 
            20                  25                  30          


His Asp Phe Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val 
        35                  40                  45              


Leu Glu Glu Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly 
    50                  55                  60                  


Pro Arg Asp Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr 
65                  70                  75                  80  


Gln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys 
                85                  90                  95      


Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro 
            100                 105                 110         


Ala Lys Gln Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe 
        115                 120                 125             


Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser 
    130                 135                 140                 


Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe 
145                 150                 155                 160 


Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu 
                165                 170                 175     


Asn Arg Leu His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu 
            180                 185                 190         


Val Pro Leu Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu 
        195                 200                 205             


Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg 
    210                 215                 220                 


Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe 
225                 230                 235                 240 


Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr 
                245                 250                 255     


Ile Thr Gly Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser 
            260                 265                 270         


Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly 
        275                 280                 285             


Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly 
    290                 295                 300                 


Gly Ser Ala His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val 
305                 310                 315                 320 


Val Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile 
                325                 330                 335     


Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln 
            340                 345                 350         


Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly 
        355                 360                 365             


Leu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr 
    370                 375                 380                 


Arg Gln Val Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val 
385                 390                 395                 400 


Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe 
                405                 410                 415     


Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His 
            420                 425                 430         


Gln Gly Gly Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser 
        435                 440                 445             


Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg 
    450                 455                 460                 


Gly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val 
465                 470                 475                 480 


Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu 
                485                 490                 495     


Ala Trp Trp Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe 
            500                 505                 510         


Asp Gly Met Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly 
        515                 520                 525             


Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val 
    530                 535                 540                 


Pro Gly Val Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser 
545                 550                 555                 560 


Ser His Gln Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly 
                565                 570                 575     


Leu Thr Glu Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly 
            580                 585                 590         


Thr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg 
        595                 600                 605             


Tyr Ala Gly His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu 
    610                 615                 620                 


Ala Ser Ser Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro 
625                 630                 635                 640 


Leu Val Gly Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu 
                645                 650                 655     


Leu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg 
            660                 665                 670         


Asn His Asn Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser 
        675                 680                 685             


Glu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala 
    690                 695                 700                 


Leu Leu Pro His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly 
705                 710                 715                 720 


Glu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser 
                725                 730                 735     


Thr Trp Thr Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile 
            740                 745                 750         


Thr Pro Val Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro 
        755                 760                 765             


Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly 
    770                 775                 780                 


Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser 
785                 790                 795                 800 


Glu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val 
                805                 810                 815     


His Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr 
            820                 825                 830         


Thr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr 
        835                 840                 845             


Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser 
    850                 855                 860                 


Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala 
865                 870                 875                 880 


Arg Asn Asn Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly 
                885                 890                 895     


Ala Gly Leu Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala 
            900                 905                 910         


Pro Gln Gln Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr 
        915                 920                 925             


Ser Pro Asp Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly 
    930                 935                 940                 


Glu Gln Phe Leu Val Ser Trp Cys 
945                 950         


<210>  9
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GLA

<400>  9
atgcagctga ggaacccaga actacatctg ggctgcgcgc ttgcgcttcg cttcctggcc       60

ctcgtttcct gggacatccc tggggctaga gcactggaca atggattggc aaggacgcct      120

accatgggct ggctgcactg ggagcgcttc atgtgcaacc ttgactgcca ggaagagcca      180

gattcctgca tcagtgagaa gctcttcatg gagatggcag agctcatggt ctcagaaggc      240

tggaaggatg caggttatga gtacctctgc attgatgact gttggatggc tccccaaaga      300

gattcagaag gcagacttca ggcagaccct cagcgctttc ctcatgggat tcgccagcta      360

gctaattatg ttcacagcaa aggactgaag ctagggattt atgcagatgt tggaaataaa      420

acctgcgcag gcttccctgg gagttttgga tactacgaca ttgatgccca gacctttgct      480

gactggggag tagatctgct aaaatttgat ggttgttact gtgacagttt ggaaaatttg      540

gcagatggtt ataagcacat gtccttggcc ctgaatagga ctggcagaag cattgtgtac      600

tcctgtgagt ggcctcttta tatgtggccc tttcaaaagc ccaattatac agaaatccga      660

cagtactgca atcactggcg aaattttgct gacattgatg attcctggaa aagtataaag      720

agtatcttgg actggacatc ttttaaccag gagagaattg ttgatgttgc tggaccaggg      780

ggttggaatg acccagatat gttagtgatt ggcaactttg gcctcagctg gaatcagcaa      840

gtaactcaga tggccctctg ggctatcatg gctgctcctt tattcatgtc taatgacctc      900

cgacacatca gccctcaagc caaagctctc cttcaggata aggacgtaat tgccatcaat      960

caggacccct tgggcaagca agggtaccag cttagacagg gagacaactt tgaagtgtgg     1020

gaacgacctc tctcaggctt agcctgggct gtagctatga taaaccggca ggagattggt     1080

ggacctcgct cttataccat cgcagttgct tccctgggta aaggagtggc ctgtaatcct     1140

gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact     1200

tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca     1260

atgcagatgt cattaaaaga cttactttaa                                      1290


<210>  10
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GLA codon-optimized

<400>  10
atgcagctga gaaatcctga actgcacctg ggctgtgccc tggctctgag atttctggct       60

ctggtgtcct gggacattcc tggcgctaga gccctggata atggcctggc cagaacacct      120

acaatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca agaggaaccc      180

gacagctgca tcagcgagaa gctgttcatg gaaatggccg agctgatggt gtccgaaggc      240

tggaaggatg ccggctacga gtacctgtgc atcgacgatt gctggatggc ccctcagaga      300

gattctgagg gcagactgca ggccgatcct cagagatttc ctcacggaat ccggcagctg      360

gccaactacg tgcactctaa gggactgaag ctgggcatct acgccgacgt gggcaacaag      420

acatgtgccg gctttccagg cagcttcggc tactacgata tcgacgccca gacctttgcc      480

gattggggcg tcgacctgct gaagttcgat ggctgctact gcgacagcct ggaaaacctg      540

gccgacggct acaaacacat gtctctggcc ctgaaccgga ccggcagatc tatcgtgtac      600

tcttgcgagt ggcccctgta catgtggccc ttccagaagc ctaactacac cgagatcaga      660

cagtactgca accactggcg gaacttcgcc gacatcgatg acagctggaa gtccatcaag      720

agcatcctgg actggaccag cttcaatcaa gagcggatcg tggatgtggc tggcccaggc      780

ggatggaacg atcctgatat gctggtcatc ggcaacttcg gcctgagctg gaatcagcaa      840

gtgacccaga tggccctgtg ggccattatg gccgctcctc tgttcatgag caacgacctg      900

agacacatca gccctcaggc caaggctctg ctgcaggata aggacgtgat cgccatcaac      960

caggatcctc tgggcaagca gggctatcag ctgagacagg gcgacaattt cgaagtgtgg     1020

gaaagacctc tgagcggcct ggcttgggcc gtcgccatga tcaatagaca agagatcggc     1080

ggaccccggt cctatacaat tgccgtggct tctctcggaa aaggcgtggc ctgcaatcct     1140

gcctgcttta tcacacagct gctccccgtg aagagaaagc tgggctttta cgagtggacc     1200

agcagactga gatcccacat caaccccaca ggcactgttc tgctgcaact ggaaaacaca     1260

atgcagatga gcctgaagga cctgctgtag                                      1290


<210>  11
<211>  429
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GLA

<400>  11

Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu 
1               5                   10                  15      


Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu 
            20                  25                  30          


Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu 
        35                  40                  45              


Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile 
    50                  55                  60                  


Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly 
65                  70                  75                  80  


Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met 
                85                  90                  95      


Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg 
            100                 105                 110         


Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly 
        115                 120                 125             


Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly 
    130                 135                 140                 


Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala 
145                 150                 155                 160 


Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser 
                165                 170                 175     


Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn 
            180                 185                 190         


Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met 
        195                 200                 205             


Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn 
    210                 215                 220                 


His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys 
225                 230                 235                 240 


Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val 
                245                 250                 255     


Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn 
            260                 265                 270         


Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala 
        275                 280                 285             


Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser 
    290                 295                 300                 


Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn 
305                 310                 315                 320 


Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn 
                325                 330                 335     


Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala 
            340                 345                 350         


Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala 
        355                 360                 365             


Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile 
    370                 375                 380                 


Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 
385                 390                 395                 400 


Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln 
                405                 410                 415     


Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu 
            420                 425                 


<210>  12
<211>  1317
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CLN3

<400>  12
atgggaggct gtgcaggctc gcggcggcgc ttttcggatt ccgaggggga ggagaccgtc       60

ccggagcccc ggctccctct gttggaccat cagggcgcgc attggaagaa cgcggtgggc      120

ttctggctgc tgggcctttg caacaacttc tcttatgtgg tgatgctgag tgccgcccac      180

gacatcctta gccacaagag gacatcggga aaccagagcc atgtggaccc aggcccaacg      240

ccgatccccc acaacagctc atcacgattt gactgcaact ctgtctctac ggctgctgtg      300

ctcctggcgg acatcctccc cacactcgtc atcaaattgt tggctcctct tggccttcac      360

ctgctgccct acagcccccg ggttctcgtc agtgggattt gtgctgctgg aagcttcgtc      420

ctggttgcct tttctcattc tgtggggacc agcctgtgtg gtgtggtctt cgctagcatc      480

tcatcaggcc ttggggaggt caccttcctc tccctcactg ccttctaccc cagggccgtg      540

atctcctggt ggtcctcagg gactggggga gctgggctgc tgggggccct gtcctacctg      600

ggcctcaccc aggccggcct ctcccctcag cagaccctgc tgtccatgct gggtatccct      660

gccctgctgc tggccagcta tttcttgttg ctcacatctc ctgaggccca ggaccctgga      720

ggggaagaag aagcagagag cgcagcccgg cagcccctca taagaaccga ggccccggag      780

tcgaagccag gctccagctc cagcctctcc cttcgggaaa ggtggacagt gttcaagggt      840

ctgctgtggt acattgttcc cttggtcgta gtttactttg ccgagtattt cattaaccag      900

ggactttttg aactcctctt tttctggaac acttccctga gtcacgctca gcaataccgc      960

tggtaccaga tgctgtacca ggctggcgtc tttgcctccc gctcttctct ccgctgctgt     1020

cgcatccgtt tcacctgggc cctggccctg ctgcagtgcc tcaacctggt gttcctgctg     1080

gcagacgtgt ggttcggctt tctgccaagc atctacctcg tcttcctgat cattctgtat     1140

gaggggctcc tgggaggcgc agcctacgtg aacaccttcc acaacatcgc cctggagacc     1200

agtgatgagc accgggagtt tgcaatggcg gccacctgca tctctgacac actggggatc     1260

tccctgtcgg ggctcctggc tttgcctctg catgacttcc tctgccagct ctcctga        1317


<210>  13
<211>  1318
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CLN3 codon-optimized

<400>  13
atgggaggat gtgctgggtc aagaagacgg tttagcgatt ccgaaggaga ggagactgtg       60

cctgagccaa gactgcccct gctggatcac cagggagcac actggaagaa cgcagtggga      120

ttctggctgc tgggcctgtg caacaacttc agctacgtgg tcatgctgtc cgccgcccac      180

gacatcctgt cccacaagcg gacctccggc aatcagtctc acgtggaccc cggccctaca      240

ccaatccccc acaacagcag cagccggttc gactgtaatt ccgtgtctac cgcagccgtg      300

ctgctggcag acatcctgcc caccctggtc atcaagctgc tggcaccact gggcctgcac      360

ctgctgcctt attctccaag ggtgctggtg agcggcatct gcgcagcagg cagcttcgtg      420

ctggtggcct ttagccactc cgtgggcacc tctctgtgcg gagtggtgtt tgcaagcatc      480

agctccggcc tgggagaggt gaccttcctg agcctgacag ccttttaccc tcgcgccgtg      540

atctcctggt ggtctagcgg cacaggagga gcaggcctgc tgggcgccct gtcctatctg      600

ggcctgaccc aggcaggcct gtccccacag cagacactgc tgtctatgct gggcatccct      660

gccctgctgc tggcaagcta cttcctgctg ctgacctccc cagaggcaca ggaccccgga      720

ggagaggagg aggccgagag cgccgcaagg cagccactga tcaggaccga ggcaccagag      780

tccaagcctg gctcctctag ctccctgtct ctgcgggaga gatggacagt gttcaagggc      840

ctgctgtggt acatcgtgcc cctggtggtg gtgtacttcg ccgagtactt catcaaccag      900

ggcctgtttg agctgctgtt cttttggaat acctctctga gccacgccca gcagtaccgg      960

tggtatcaga tgctgtatca ggcaggcgtg ttcgcctccc ggtctagcct gagatgctgt     1020

cggatcagat tcacctgggc actggccctg ctgcagtgcc tgaacctggt gttcctgctg     1080

gccgacgtgt ggttcggctt tctgccctct atctacctgg tgtttctgat catcctgtat     1140

gagggcctgc tgggaggagc agcctatgtg aacaccttcc acaatatcgc cctggagaca     1200

tctgacgagc acagagagtt tgctatggcc gccacctgta tcagcgatac actgggcatc     1260

tctctgagcg gactgctggc tctgcctctg catgactttc tgtgccagct gagttaat       1318


<210>  14
<211>  438
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(438)
<223>  Ceroid Lipofuscinosis, Neuronal 3 (CLN3)

<400>  14

Met Gly Gly Cys Ala Gly Ser Arg Arg Arg Phe Ser Asp Ser Glu Gly 
1               5                   10                  15      


Glu Glu Thr Val Pro Glu Pro Arg Leu Pro Leu Leu Asp His Gln Gly 
            20                  25                  30          


Ala His Trp Lys Asn Ala Val Gly Phe Trp Leu Leu Gly Leu Cys Asn 
        35                  40                  45              


Asn Phe Ser Tyr Val Val Met Leu Ser Ala Ala His Asp Ile Leu Ser 
    50                  55                  60                  


His Lys Arg Thr Ser Gly Asn Gln Ser His Val Asp Pro Gly Pro Thr 
65                  70                  75                  80  


Pro Ile Pro His Asn Ser Ser Ser Arg Phe Asp Cys Asn Ser Val Ser 
                85                  90                  95      


Thr Ala Ala Val Leu Leu Ala Asp Ile Leu Pro Thr Leu Val Ile Lys 
            100                 105                 110         


Leu Leu Ala Pro Leu Gly Leu His Leu Leu Pro Tyr Ser Pro Arg Val 
        115                 120                 125             


Leu Val Ser Gly Ile Cys Ala Ala Gly Ser Phe Val Leu Val Ala Phe 
    130                 135                 140                 


Ser His Ser Val Gly Thr Ser Leu Cys Gly Val Val Phe Ala Ser Ile 
145                 150                 155                 160 


Ser Ser Gly Leu Gly Glu Val Thr Phe Leu Ser Leu Thr Ala Phe Tyr 
                165                 170                 175     


Pro Arg Ala Val Ile Ser Trp Trp Ser Ser Gly Thr Gly Gly Ala Gly 
            180                 185                 190         


Leu Leu Gly Ala Leu Ser Tyr Leu Gly Leu Thr Gln Ala Gly Leu Ser 
        195                 200                 205             


Pro Gln Gln Thr Leu Leu Ser Met Leu Gly Ile Pro Ala Leu Leu Leu 
    210                 215                 220                 


Ala Ser Tyr Phe Leu Leu Leu Thr Ser Pro Glu Ala Gln Asp Pro Gly 
225                 230                 235                 240 


Gly Glu Glu Glu Ala Glu Ser Ala Ala Arg Gln Pro Leu Ile Arg Thr 
                245                 250                 255     


Glu Ala Pro Glu Ser Lys Pro Gly Ser Ser Ser Ser Leu Ser Leu Arg 
            260                 265                 270         


Glu Arg Trp Thr Val Phe Lys Gly Leu Leu Trp Tyr Ile Val Pro Leu 
        275                 280                 285             


Val Val Val Tyr Phe Ala Glu Tyr Phe Ile Asn Gln Gly Leu Phe Glu 
    290                 295                 300                 


Leu Leu Phe Phe Trp Asn Thr Ser Leu Ser His Ala Gln Gln Tyr Arg 
305                 310                 315                 320 


Trp Tyr Gln Met Leu Tyr Gln Ala Gly Val Phe Ala Ser Arg Ser Ser 
                325                 330                 335     


Leu Arg Cys Cys Arg Ile Arg Phe Thr Trp Ala Leu Ala Leu Leu Gln 
            340                 345                 350         


Cys Leu Asn Leu Val Phe Leu Leu Ala Asp Val Trp Phe Gly Phe Leu 
        355                 360                 365             


Pro Ser Ile Tyr Leu Val Phe Leu Ile Ile Leu Tyr Glu Gly Leu Leu 
    370                 375                 380                 


Gly Gly Ala Ala Tyr Val Asn Thr Phe His Asn Ile Ala Leu Glu Thr 
385                 390                 395                 400 


Ser Asp Glu His Arg Glu Phe Ala Met Ala Ala Thr Cys Ile Ser Asp 
                405                 410                 415     


Thr Leu Gly Ile Ser Leu Ser Gly Leu Leu Ala Leu Pro Leu His Asp 
            420                 425                 430         


Phe Leu Cys Gln Leu Ser 
        435             


<210>  15
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV204 VP1

<400>  15
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcacca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc      900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa      960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg     1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct     1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac     1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac     1380

ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct     1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac     1500

tttacctgga caggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct     1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc     1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc     1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg     1740

gcagtcaatc tccagaacag cagcacagac cctgcgaccg gagatgtgca tgttatggga     1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc     1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt     1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca     1980

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc     2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag     2100

tatacatcta actatgcaaa atctgccaac gttgatttca ctgtagacaa caatggactt     2160

tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  16
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV204 VP3

<400>  16
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgaa catgggcctt gcccacctat aacaaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgatttca acagattcca ctgccatttc tcaccacgtg actggcagcg actcatcaac      300

aacaattggg gattccggcc caagagactc aacttcaagc tcttcaacat ccaagtcaag      360

gaggtcacga cgaatgatgg cgtcacgacc atcgctaata accttaccag cacggttcaa      420

gtcttctcgg actcggagta ccagttgccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

aatggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaataa ctttaccttc agctacacct tcgaggacgt gcctttccac      660

agcagctacg cgcacagcca gagcctggac cggctgatga atcctctcat cgaccagtac      720

ctgtattacc tgaacagaac tcagaatcag tccggaagtg cccaaaacaa ggacttgctg      780

tttagccggg ggtctccagc tggcatgtct gttcagccca aaaactggct acctggaccc      840

tgttaccggc agcagcgcgt ttctaaaaca aaaacagaca acaacaacag caactttacc      900

tggacaggtg cttcaaaata taaccttaat gggcgtgaat ctataatcaa ccctggcact      960

gctatggcct cacacaaaga cgacaaagac aagttctttc ccatgagcgg tgtcatgatt     1020

tttggaaagg agagcgccgg agcttcaaac actgcattgg acaatgtcat gatcacagac     1080

gaagaggaaa tcaaagccac taaccccgtg gccaccgaaa gatttgggac tgtggcagtc     1140

aatctccaga acagcagcac agaccctgcg accggagatg tgcatgttat gggagcctta     1200

cctggaatgg tgtggcaaga cagagacgta tacctgcagg gtcctatttg ggccaaaatt     1260

cctcacacgg atggacactt tcacccgtct cctctcatgg gcggctttgg acttaagcac     1320

ccgcctcctc agatcctcat caaaaacacg cctgttcctg cgaatcctcc ggcagagttt     1380

tcggctacaa agtttgcttc attcatcacc cagtattcca caggacaagt gagcgtggag     1440

attgaatggg agctgcagaa agaaaacagc aaacgctgga atcccgaagt gcagtataca     1500

tctaactatg caaaatctgc caacgttgat ttcactgtag acaacaatgg actttatact     1560

gagcctcgcc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  17
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV204 VP3

<400>  17

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val 
        115                 120                 125             


Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp 
    130                 135                 140                 


Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Asn Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn 
                245                 250                 255     


Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln 
            260                 265                 270         


Pro Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala 
    290                 295                 300                 


Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser 
                325                 330                 335     


Gly Val Met Ile Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala 
            340                 345                 350         


Leu Asp Asn Val Met Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Leu Gln Asn 
    370                 375                 380                 


Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Val Met Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys 
    450                 455                 460                 


Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Val Gln Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr 
            500                 505                 510         


Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  18
<211>  2208
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 214 (AAV214) VP1

<400>  18
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcaaagactc      900

atcaacaaca actggggatt ccgacccaag agactcaact tcaagctctt taacattcaa      960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataacct taccagcacg     1020

gtccaggtct tcacggactc agactatcag ctcccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacgacg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggacgttcct     1260

ttccacagca gctacgctca cagccagagt ctggaccgtc tcatgaatcc tctgattgac     1320

cagtacctgt actacttgtc taagactatc aacggatccg gccagaatca gcagactctg     1380

aagttcagcc aaggtgggcc taatacaatg gccaatcagg caaagaactg gctgccagga     1440

ccctgttacc gccaacaacg cgtctcaacg acaaccgggc aaaacaacaa tagcaacttt     1500

gcctggactg ctgggaccaa ataccatctg aatggaagaa attcattgat gaatcctggc     1560

cccgctatgg catcccacaa agagggcgag gaccgttttt ttcccctgtc cgggtccctg     1620

atttttggca aacaaaatgc tgccagagac aatgcggatt acagcgatgt catgctcacc     1680

agcgaggaag aaatcaaaac cactaaccct gtggctacag aggaatacgg tatcgtggca     1740

gataacttgc agcagcaaaa cacggctcct caaattggaa ctgtcaacag ccagggggcc     1800

ttacccggta tggtctggca gaaccgggac gtgtacctgc agggtcccat ctgggccaag     1860

attcctcaca cggacggcaa cttccacccg tctccgctga tgggcggctt tggcctgaaa     1920

catcctccgc ctcagatcct gatcaagaac acgcctgtac ctgcggatcc tccgaccacc     1980

ttcaaccagt caaagctgaa ctctttcatc acgcaataca gcaccggaca ggtcagcgtg     2040

gaaattgaat gggagctgca gaaggaaaac agcaagcgct ggaaccccga gatccagtac     2100

acctccaact actacaaatc tacaagtgtg gactttgctg ttaatacaga aggcgtgtac     2160

tctgaacccc accccattgg cacccgttac ctcacccgtc ccctgtaa                  2208


<210>  19
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-A VP1

<400>  19
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  20
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e VP1

<400>  20
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga      600

cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  21
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP1

<400>  21
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggcttcagg cggtggcgca ccaatggcgg acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  22
<211>  2208
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP1

<400>  22
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctccc caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctctgg tgtgggatct      600

cttacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcaaagactc      900

atcaacaaca actggggatt ccgacccaag agactcaact tcaagctctt taacattcaa      960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataacct taccagcacg     1020

gtccaggtct tcacggactc agactatcag ctcccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacgacg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggacgttcct     1260

ttccacagca gctacgctca cagccagagt ctggaccgtc tcatgaatcc tctgattgac     1320

cagtacctgt actacttgtc taagactatc aacggatccg gccagaatca gcagactctg     1380

aagttcagcc aaggtgggcc taatacaatg gccaatcagg caaagaactg gctgccagga     1440

ccctgttacc gccaacaacg cgtctcaacg acaaccgggc aaaacaacaa tagcaacttt     1500

gcctggactg ctgggaccaa ataccatctg aatggaagaa attcattgat gaatcctggc     1560

cccgctatgg catcccacaa agagggcgag gaccgttttt ttcccctgtc cgggtccctg     1620

atttttggca aacaaaatgc tgccagagac aatgcggatt acagcgatgt catgctcacc     1680

agcgaggaag aaatcaaaac cactaaccct gtggctacag aggaatacgg tatcgtggca     1740

gataacttgc agcagcaaaa cacggctcct caaattggaa ctgtcaacag ccagggggcc     1800

ttacccggta tggtctggca gaaccgggac gtgtacctgc agggtcccat ctgggccaag     1860

attcctcaca cggacggcaa cttccacccg tctccgctga tgggcggctt tggcctgaaa     1920

catcctccgc ctcagatcct gatcaagaac acgcctgtac ctgcggatcc tccgaccacc     1980

ttcaaccagt caaagctgaa ctctttcatc acgcaataca gcaccggaca ggtcagcgtg     2040

gaaattgaat gggagctgca gaaggaaaac agcaagcgct ggaaccccga gatccagtac     2100

acctccaact actacaaatc tacaagtgtg gactttgctg ttaatacaga aggcgtgtac     2160

tctgaacccc accccattgg cacccgttac ctcacccgtc ccctgtaa                  2208


<210>  23
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP1

<400>  23
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccagcagcc cgcgaaaaag agactcaact ttgggcagac tggcgactca      540

gagtcagtgc ccgaccctca accaatcgga gaaccccccg caggcccctc tggtctggga      600

tctggtacaa tggcttcagg cggtggcgca ccaatggcgg acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  24
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 214 (AAV214) VP3

<400>  24
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  25
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-A VP3

<400>  25
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccaac      180

agcacatctg gaggatcttc aaatgacaac gcctacttcg gctacagcac cccctggggg      240

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      300

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      360

aaagaggtta cggacaacaa tggagtcaag accatcgcca ataaccttac cagcacggtc      420

caggtcttca cggactcaga ctatcagctc ccgtacgtcc tcggctctgc gcaccagggc      480

tgcctccctc cgttcccggc ggacgtgttc atgattccgc agtacggcta cctaacgctc      540

aacgacggca gccaggcagt gggacggtca tccttttact gcctggaata tttcccatcg      600

cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga cgttcctttc      660

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct gattgaccag      720

tacctgtact acttgtctaa gactatcaac ggatccggcc agaatcagca gactctgaag      780

ttcagccaag gtgggcctaa tacaatggcc aatcaggcaa agaactggct gccaggaccc      840

tgttaccgcc aacaacgcgt ctcaacgaca accgggcaaa acaacaatag caactttgcc      900

tggactgctg ggaccaaata ccatctgaat ggaagaaatt cattgatgaa tcctggcccc      960

gctatggcat cccacaaaga gggcgaggac cgtttttttc ccctgtccgg gtccctgatt     1020

tttggcaaac aaaatgctgc cagagacaat gcggattaca gcgatgtcat gctcaccagc     1080

gaggaagaaa tcaaaaccac taaccctgtg gctacagagg aatacggtat cgtggcagat     1140

aacttgcagc agcaaaacac ggctcctcaa attggaactg tcaacagcca gggggcctta     1200

cccggtatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg ggccaagatt     1260

cctcacacgg acggcaactt ccacccgtct ccgctgatgg gcggctttgg cctgaaacat     1320

cctccgcctc agatcctgat caagaacacg cctgtacctg cggatcctcc gaccaccttc     1380

aaccagtcaa agctgaactc tttcatcacg caatacagca ccggacaggt cagcgtggaa     1440

attgaatggg agctgcagaa ggaaaacagc aagcgctgga accccgagat ccagtacacc     1500

tccaactact acaaatctac aagtgtggac tttgctgtta atacagaagg cgtgtactct     1560

gaaccccacc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  26
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e VP3

<400>  26
atggcttcag gcggtggcgc accaatggca gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  27
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP3

<400>  27
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  28
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP3

<400>  28
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  29
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP3

<400>  29
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  30
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214A VP1

<400>  30

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  31
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e VP1

<400>  31

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  32
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP1

<400>  32

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  33
<211>  735
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP1

<400>  33

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Lys 
        435                 440                 445             


Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser Gln 
    450                 455                 460                 


Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn Asn 
                485                 490                 495     


Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys Glu 
        515                 520                 525             


Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu Thr 
545                 550                 555                 560 


Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
                565                 570                 575     


Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile 
            580                 585                 590         


Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln Asn 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asp 
                645                 650                 655     


Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735 


<210>  34
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP1

<400>  34

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  35
<211>  598
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214 VP2

<400>  35

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
            180                 185                 190         


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        195                 200                 205             


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
305                 310                 315                 320 


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
                325                 330                 335     


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
            340                 345                 350         


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
        355                 360                 365             


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
    370                 375                 380                 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
385                 390                 395                 400 


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
                405                 410                 415     


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
            420                 425                 430         


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
        435                 440                 445             


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
    450                 455                 460                 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
465                 470                 475                 480 


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
                485                 490                 495     


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
            500                 505                 510         


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
        515                 520                 525             


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
    530                 535                 540                 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
545                 550                 555                 560 


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
                565                 570                 575     


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
            580                 585                 590         


Tyr Leu Thr Arg Pro Leu 
        595             


<210>  36
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214A VP2

<400>  36

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  37
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e VP2

<400>  37

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Arg 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro 
    50                  55                  60                  


Thr Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser 
        115                 120                 125             


Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  38
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP2

<400>  38

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Arg 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Pro 
    50                  55                  60                  


Asn Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser 
        115                 120                 125             


Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  39
<211>  598
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP2

<400>  39

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ala Gly Ile Gly Lys Ser Gly Ala Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Thr Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Ile Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Ser Leu 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
            180                 185                 190         


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        195                 200                 205             


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
305                 310                 315                 320 


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
                325                 330                 335     


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
            340                 345                 350         


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
        355                 360                 365             


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
    370                 375                 380                 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
385                 390                 395                 400 


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
                405                 410                 415     


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
            420                 425                 430         


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
        435                 440                 445             


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
    450                 455                 460                 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
465                 470                 475                 480 


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
                485                 490                 495     


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
            500                 505                 510         


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
        515                 520                 525             


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
    530                 535                 540                 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
545                 550                 555                 560 


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
                565                 570                 575     


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
            580                 585                 590         


Tyr Leu Thr Arg Pro Leu 
        595             


<210>  40
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP2

<400>  40

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Lys 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Ile Gly Glu Pro Pro Ala Gly Pro Ser Gly Leu Gly Ser 
    50                  55                  60                  


Gly Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser 
        115                 120                 125             


Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  41
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214 VP3

<400>  41

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  42
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214A VP3

<400>  42

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
            260                 265                 270         


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
    290                 295                 300                 


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
            340                 345                 350         


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
    370                 375                 380                 


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  43
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E VP3

<400>  43

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  44
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E8 VP3

<400>  44

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  45
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E9 VP3

<400>  45

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  46
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E10 VP3

<400>  46

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  47
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP1

<400>  47
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcaaagactc      900

atcaacaaca actggggatt ccgacccaag agactcaact tcaagctctt taacattcaa      960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataacct taccagcacg     1020

gtccaggtct tcacggactc agactatcag ctcccgtacg tgctcgggtc ggctcacgag     1080

ggctgcctcc cgccgttccc agcggacgtt ttcatgattc ctcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggacgttcct     1260

ttccacagca gctacgctca cagccagagt ctggaccggc tgatgaatcc tctgattgac     1320

cagtacctgt actacttgtc tcggactcaa acaacaggag gcacggcaaa tacgcagact     1380

ctgggcttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaagg cactggcaga gacaatgtgg atgccgacaa agtcatgatc     1680

accaacgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  48
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP3

<400>  48
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtgctcg ggtcggctca cgagggctgc      480

ctcccgccgt tcccagcgga cgttttcatg attcctcagt acggctacct aacgctcaac      540

aatggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cggctgatga atcctctgat tgaccagtac      720

ctgtactact tgtctcggac tcaaacaaca ggaggcacgg caaatacgca gactctgggc      780

ttcagccaag gtgggcctaa tacaatggcc aatcaggcaa agaactggct gccaggaccc      840

tgttaccgcc aacaacgcgt ctcaacgaca accgggcaaa acaacaatag caactttgcc      900

tggactgctg ggaccaaata ccatctgaat ggaagaaatt cattgatgaa tcctggcccc      960

gctatggcat cccacaaaga gggcgaggac cgtttttttc ccctgtccgg gtccctgatt     1020

tttggcaaac aaggcactgg cagagacaat gtggatgccg acaaagtcat gatcaccaac     1080

gaggaagaaa tcaaaaccac taaccctgtg gctacagagg aatacggtat cgtggcagat     1140

aacttgcagc agcaaaacac ggctcctcaa attggaactg tcaacagcca gggggcctta     1200

cccggtatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg ggccaagatt     1260

cctcacacgg acggcaactt ccacccgtct ccgctgatgg gcggctttgg cctgaaacat     1320

cctccgcctc agatcctgat caagaacacg cctgtacctg cggatcctcc gaccaccttc     1380

aaccagtcaa agctgaactc tttcatcacg caatacagca ccggacaggt cagcgtggaa     1440

attgaatggg agctgcagaa ggaaaacagc aagcgctgga accccgagat ccagtacacc     1500

tccaactact acaaatctac aagtgtggac tttgctgtta atacagaagg cgtgtactct     1560

gaaccccacc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  49
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP1

<400>  49

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg 
        435                 440                 445             


Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  50
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP2

<400>  50

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
            180                 185                 190         


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        195                 200                 205             


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn 
305                 310                 315                 320 


Thr Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val 
                405                 410                 415     


Asp Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  51
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP3

<400>  51

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr 
                245                 250                 255     


Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
            260                 265                 270         


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
    290                 295                 300                 


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val Asp 
            340                 345                 350         


Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
    370                 375                 380                 


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  52
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-I

<400>  52

Ser Ala Ser Thr Gly Ala Ser 
1               5           


<210>  53
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-I

<400>  53

Asn Ser Thr Ser Gly Gly Ser Ser 
1               5               


<210>  54
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-II

<400>  54

Asp Asn Asn Gly Val Lys 
1               5       


<210>  55
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-III

<400>  55

Asn Asp Gly Ser 
1               


<210>  56
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-IV

<400>  56

Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr 
1               5                   10  


<210>  57
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-V

<400>  57

Arg Val Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp 
1               5                   10                  15      


Thr Ala 
        


<210>  58
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-VI

<400>  58

His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
1               5                   10              


<210>  59
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-VII

<400>  59

Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
1               5                   10                  


<210>  60
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-VIII

<400>  60

Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile 
1               5                   10              


<210>  61
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-IX

<400>  61

Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
1               5                   10  


<210>  62
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV6 VP1

<400>  62
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc      900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa      960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg     1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct     1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac     1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac     1380

ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct     1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac     1500

tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct     1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc     1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc     1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg     1740

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga     1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc     1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt     1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca     1980

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc     2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag     2100

tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt     2160

tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  63
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV6 VP1

<400>  63

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 
        435                 440                 445             


Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser 
    450                 455                 460                 


Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
            500                 505                 510         


Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 
    530                 535                 540                 


Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
                565                 570                 575     


Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 
            580                 585                 590         


Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705                 710                 715                 720 


Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  64
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV6 VP2

<400>  64

Thr Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Thr Asn Asp Gly 
            180                 185                 190         


Val Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Ser 
        195                 200                 205             


Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Asn Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln 
305                 310                 315                 320 


Asn Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val 
                325                 330                 335     


Gln Pro Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly 
        355                 360                 365             


Ala Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly 
    370                 375                 380                 


Thr Ala Met Ala Ser His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met 
385                 390                 395                 400 


Ser Gly Val Met Ile Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr 
                405                 410                 415     


Ala Leu Asp Asn Val Met Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Leu Gln 
        435                 440                 445             


Ser Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Val Met Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr 
        515                 520                 525             


Lys Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Val Gln Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe 
                565                 570                 575     


Thr Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  65
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV6 VP3

<400>  65

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val 
        115                 120                 125             


Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp 
    130                 135                 140                 


Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Asn Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn 
                245                 250                 255     


Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln 
            260                 265                 270         


Pro Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala 
    290                 295                 300                 


Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser 
                325                 330                 335     


Gly Val Met Ile Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala 
            340                 345                 350         


Leu Asp Asn Val Met Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Leu Gln Ser 
    370                 375                 380                 


Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Val Met Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys 
    450                 455                 460                 


Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Val Gln Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr 
            500                 505                 510         


Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  66
<211>  2217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV8 VP1

<400>  66
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggctgcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta gttcctcggg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccaacg ggacatcggg aggagccacc aacgacaaca cctacttcgg ctacagcacc      840

ccctgggggt attttgactt taacagattc cactgccact tttcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg cccaagagac tcagcttcaa gctcttcaac      960

atccaggtca aggaggtcac gcagaatgaa ggcaccaaga ccatcgccaa taacctcacc     1020

agcaccatcc aggtgtttac ggactcggag taccagctgc cgtacgttct cggctctgcc     1080

caccagggct gcctgcctcc gttcccggcg gacgtgttca tgattcccca gtacggctac     1140

ctaacactca acaacggtag tcaggccgtg ggacgctcct ccttctactg cctggaatac     1200

tttccttcgc agatgctgag aaccggcaac aacttccagt ttacttacac cttcgaggac     1260

gtgcctttcc acagcagcta cgcccacagc cagagcttgg accggctgat gaatcctctg     1320

attgaccagt acctgtacta cttgtctcgg actcaaacaa caggaggcac ggcaaatacg     1380

cagactctgg gcttcagcca aggtgggcct aatacaatgg ccaatcaggc aaagaactgg     1440

ctgccaggac cctgttaccg ccaacaacgc gtctcaacga caaccgggca aaacaacaat     1500

agcaactttg cctggactgc tgggaccaaa taccatctga atggaagaaa ttcattggct     1560

aatcctggca tcgctatggc aacacacaaa gacgacgagg agcgtttttt tcccagtaac     1620

gggatcctga tttttggcaa acaaaatgct gccagagaca atgcggatta cagcgatgtc     1680

atgctcacca gcgaggaaga aatcaaaacc actaaccctg tggctacaga ggaatacggt     1740

atcgtggcag ataacttgca gcagcaaaac acggctcctc aaattggaac tgtcaacagc     1800

cagggggcct tacccggtat ggtctggcag aaccgggacg tgtacctgca gggtcccatc     1860

tgggccaaga ttcctcacac ggacggcaac ttccacccgt ctccgctgat gggcggcttt     1920

ggcctgaaac atcctccgcc tcagatcctg atcaagaaca cgcctgtacc tgcggatcct     1980

ccgaccacct tcaaccagtc aaagctgaac tctttcatca cgcaatacag caccggacag     2040

gtcagcgtgg aaattgaatg ggagctgcag aaggaaaaca gcaagcgctg gaaccccgag     2100

atccagtaca cctccaacta ctacaaatct acaagtgtgg actttgctgt taatacagaa     2160

ggcgtgtact ctgaaccccg ccccattggc acccgttacc tcacccgtaa tctgtaa        2217


<210>  67
<211>  738
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 VP1

<400>  67

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Thr Tyr 
                405                 410                 415     


Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly 
    450                 455                 460                 


Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly Ile Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly Ile Leu Ile 
    530                 535                 540                 


Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala 
            580                 585                 590         


Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  68
<211>  601
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 VP2

<400>  68

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Arg 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Pro 
    50                  55                  60                  


Asn Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Gly Thr 
        115                 120                 125             


Ser Gly Gly Ala Thr Asn Asp Asn Thr Tyr Phe Gly Tyr Ser Thr Pro 
    130                 135                 140                 


Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg 
145                 150                 155                 160 


Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg 
                165                 170                 175     


Leu Ser Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn 
            180                 185                 190         


Glu Gly Thr Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Ile Gln Val 
        195                 200                 205             


Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His 
    210                 215                 220                 


Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln 
225                 230                 235                 240 


Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser 
                245                 250                 255     


Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly 
            260                 265                 270         


Asn Asn Phe Gln Phe Thr Tyr Thr Phe Glu Asp Val Pro Phe His Ser 
        275                 280                 285             


Ser Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile 
    290                 295                 300                 


Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr 
305                 310                 315                 320 


Ala Asn Thr Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met 
                325                 330                 335     


Ala Asn Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln 
            340                 345                 350         


Arg Val Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp 
        355                 360                 365             


Thr Ala Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Ala Asn 
    370                 375                 380                 


Pro Gly Ile Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg Phe Phe 
385                 390                 395                 400 


Pro Ser Asn Gly Ile Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp 
                405                 410                 415     


Asn Ala Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys 
            420                 425                 430         


Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn 
        435                 440                 445             


Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln 
    450                 455                 460                 


Gly Ala Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln 
465                 470                 475                 480 


Gly Pro Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro 
                485                 490                 495     


Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile 
            500                 505                 510         


Leu Ile Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn 
        515                 520                 525             


Gln Ser Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val 
    530                 535                 540                 


Ser Val Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp 
545                 550                 555                 560 


Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val 
                565                 570                 575     


Asp Phe Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile 
            580                 585                 590         


Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
        595                 600     


<210>  69
<211>  535
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 VP3

<400>  69

Met Ala Ala Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly 
    50                  55                  60                  


Gly Ala Thr Asn Asp Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly 
        115                 120                 125             


Thr Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Gln Phe Thr Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn 
                245                 250                 255     


Thr Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
            260                 265                 270         


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
        275                 280                 285             


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
    290                 295                 300                 


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly 
305                 310                 315                 320 


Ile Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser 
                325                 330                 335     


Asn Gly Ile Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
            340                 345                 350         


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
        355                 360                 365             


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
    370                 375                 380                 


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
385                 390                 395                 400 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
                405                 410                 415     


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
            420                 425                 430         


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
        435                 440                 445             


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
    450                 455                 460                 


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
465                 470                 475                 480 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
                485                 490                 495     


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
            500                 505                 510         


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
        515                 520                 525             


Arg Tyr Leu Thr Arg Asn Leu 
    530                 535 


<210>  70
<211>  2214
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV9 VP1

<400>  70
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacggcaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggctgcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta attcctcggg aaattggcat tgcgattcca catggctggg ggacagagtc      720

atcaccacca gcacccgaac ctgggcattg cccacctaca acaaccacct ctacaagcaa      780

atctccaatg gaacatcggg aggaagcacc aacgacaaca cctactttgg ctacagcacc      840

ccctgggggt attttgactt caacagattc cactgccact tctcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg ccaaagagac tcaacttcaa gctgttcaac      960

atccaggtca aggaggttac gacgaacgaa ggcaccaaga ccatcgccaa taaccttacc     1020

agcaccgtcc aggtctttac ggactcggag taccagctac cgtacgtcct aggctctgcc     1080

caccaaggat gcctgccacc gtttcctgca gacgtcttca tggttcctca gtacggctac     1140

ctgacgctca acaatggaag tcaagcgtta ggacgttctt ctttctactg tctggaatac     1200

ttcccttctc agatgctgag aaccggcaac aactttcagt tcagctacac tttcgaggac     1260

gtgcctttcc acagcagcta cgcacacagc cagagtctag atcgactgat gaaccccctc     1320

atcgaccagt acctatacta cctggtcaga acacagacaa ctggaactgg gggaactcaa     1380

actttggcat tcagccaagc aggccctagc tcaatggcca atcaggctag aaactgggta     1440

cccgggcctt gctaccgtca gcagcgcgtc tccacaacca ccaaccaaaa taacaacagc     1500

aactttgcgt ggacgggagc tgctaaattc aagctgaacg ggagagactc gctaatgaat     1560

cctggcgtgg ctatggcatc gcacaaagac gacgaggacc gcttctttcc atcaagtggc     1620

gttctcatat ttggcaagca aggagccggg aacgatggag tcgactacag ccaggtgctg     1680

attacagatg aggaagaaat taaagccacc aaccctgtag ccacagagga atacggagca     1740

gtggccatca acaaccaggc cgctaacacg caggcgcaaa ctggacttgt gcataaccag     1800

ggagttattc ctggtatggt ctggcagaac cgggacgtgt acctgcaggg ccctatttgg     1860

gctaaaatac ctcacacaga tggcaacttt cacccgtctc ctctgatggg tggatttgga     1920

ctgaaacacc cacctccaca gattctaatt aaaaatacac cagtgccggc agatcctcct     1980

cttaccttca atcaagccaa gctgaactct ttcatcacgc agtacagcac gggacaagtc     2040

agcgtggaaa tcgagtggga gctgcagaaa gaaaacagca agcgctggaa tccagagatc     2100

cagtatactt caaactacta caaatctaca aatgtggact ttgctgtcaa taccaaaggt     2160

gtttactctg agcctcgccc cattggtact cgttacctca cccgtaattt gtaa           2214


<210>  71
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 VP1

<400>  71

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  72
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 VP2

<400>  72

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ala Gly Ile Gly Lys Ser Gly Ala Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Thr Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Ile Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Ser Leu 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Val Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Gln 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Gln Phe Ser Tyr Glu Phe Glu Asn Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Val Ala Gly Pro Ser Asn Met Ala Val 
                325                 330                 335     


Gln Gly Arg Asn Tyr Ile Pro Gly Pro Ser Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Val Thr Gln Asn Asn Asn Ser Glu Phe Ala Trp Pro Gly 
        355                 360                 365             


Ala Ser Ser Trp Ala Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val 
                405                 410                 415     


Asp Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Ser Tyr Gly Gln Val Ala Thr Asn His Gln 
        435                 440                 445             


Ser Ala Gln Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Asn Leu 
        595                 


<210>  73
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 VP3

<400>  73

Met Ala Ser Gly Gly Gly Ala Pro Val Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Gln Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Gln Phe Ser Tyr Glu Phe Glu Asn Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Val Ala Gly Pro Ser Asn Met Ala Val Gln 
            260                 265                 270         


Gly Arg Asn Tyr Ile Pro Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Val Thr Gln Asn Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala 
    290                 295                 300                 


Ser Ser Trp Ala Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val Asp 
            340                 345                 350         


Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Ser Tyr Gly Gln Val Ala Thr Asn His Gln Ser 
    370                 375                 380                 


Ala Gln Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Asn Leu 
    530                 


<210>  74
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRII-204/AAV6

<400>  74

Thr Asn Asp Gly Val Lys 
1               5       


<210>  75
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRIII-204/AAV6

<400>  75

Asn Asn Gly Ser 
1               


<210>  76
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRIV-204/AAV6

<400>  76

Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp 
1               5                   10      


<210>  77
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRV-204/AAV6

<400>  77

Arg Val Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp 
1               5                   10                  15      


Thr Gly 
        


<210>  78
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRVI-204/AAV6

<400>  78

His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly 
1               5                   10              


<210>  79
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRVII-204/AAV6

<400>  79

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val 
1               5                   10                  


<210>  80
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRVIII-204/AAV6

<400>  80

Ala Val Asn Leu Gln Asn Ser Ser Thr Asp Pro Ala Thr 
1               5                   10              


<210>  81
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRIX-204/AAV6

<400>  81

Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe 
1               5                   10  


<210>  82
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-AB VP1

<400>  82
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagcagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  83
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-AB VP3

<400>  83
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagc      180

agcacatctg gaggatcttc aaatgacaac gcctacttcg gctacagcac cccctggggg      240

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      300

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      360

aaagaggtta cggacaacaa tggagtcaag accatcgcca ataaccttac cagcacggtc      420

caggtcttca cggactcaga ctatcagctc ccgtacgtcc tcggctctgc gcaccagggc      480

tgcctccctc cgttcccggc ggacgtgttc atgattccgc agtacggcta cctaacgctc      540

aacgacggca gccaggcagt gggacggtca tccttttact gcctggaata tttcccatcg      600

cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga cgttcctttc      660

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct gattgaccag      720

tacctgtact acttgtctaa gactatcaac ggatccggcc agaatcagca gactctgaag      780

ttcagccaag gtgggcctaa tacaatggcc aatcaggcaa agaactggct gccaggaccc      840

tgttaccgcc aacaacgcgt ctcaacgaca accgggcaaa acaacaatag caactttgcc      900

tggactgctg ggaccaaata ccatctgaat ggaagaaatt cattgatgaa tcctggcccc      960

gctatggcat cccacaaaga gggcgaggac cgtttttttc ccctgtccgg gtccctgatt     1020

tttggcaaac aaaatgctgc cagagacaat gcggattaca gcgatgtcat gctcaccagc     1080

gaggaagaaa tcaaaaccac taaccctgtg gctacagagg aatacggtat cgtggcagat     1140

aacttgcagc agcaaaacac ggctcctcaa attggaactg tcaacagcca gggggcctta     1200

cccggtatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg ggccaagatt     1260

cctcacacgg acggcaactt ccacccgtct ccgctgatgg gcggctttgg cctgaaacat     1320

cctccgcctc agatcctgat caagaacacg cctgtacctg cggatcctcc gaccaccttc     1380

aaccagtcaa agctgaactc tttcatcacg caatacagca ccggacaggt cagcgtggaa     1440

attgaatggg agctgcagaa ggaaaacagc aagcgctgga accccgagat ccagtacacc     1500

tccaactact acaaatctac aagtgtggac tttgctgtta atacagaagg cgtgtactct     1560

gaaccccacc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  84
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VP1

<400>  84

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  85
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VP2

<400>  85

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  86
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VP3

<400>  86

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
            260                 265                 270         


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
    290                 295                 300                 


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
            340                 345                 350         


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
    370                 375                 380                 


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  87
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VR-1 amino acid

<400>  87

Ser Ser Thr Ser Gly Gly Ser Ser 
1               5               


<210>  88
<211>  6719
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF1

<400>  88
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc atggaggcgg tactatgtag atgagaattc      180

aggagcaaac tgggaaaagc aactgcttcc aaatatttgt gatttttaca gtgtagtttt      240

ggaaaaactc ttagcctacc aattcttcta agtgttttaa aatgtgggag ccagtacaca      300

tgaagttata gagtgtttta atgaggctta aatatttacc gtaactatga aatgctacgc      360

atatcatgct gttcaggctc cgtggccacg caactcatac cggtagtact cgccaccatg      420

cagagaagcc ccctggagaa ggcctctgtg gtgagcaagc tgttcttcag ctggaccaga      480

cccatcctga gaaagggcta cagacagaga ctggagctgt ctgacatcta ccagatcccc      540

tctgtggact ctgctgacaa cctgtctgag aagctggaga gagagtggga cagagagctg      600

gccagcaaga agaaccccaa gctgatcaat gccctgagaa gatgcttctt ctggagattc      660

atgttctatg gcatcttcct gtacctgggg gaggtgacca aggctgtgca gcccctgctg      720

ctgggcagaa tcattgccag ctatgaccct gacaacaagg aggagagaag cattgccatc      780

tacctgggca ttggcctgtg cctgctgttc attgtgagaa ccctgctgct gcaccctgcc      840

atctttggcc tgcaccacat tggcatgcag atgagaattg ccatgttcag cctgatctac      900

aagaagaccc tgaagctgag cagcagagtg ctggacaaga tcagcattgg ccagctggtg      960

agcctgctga gcaacaacct gaacaagttt gatgagggcc tggccctggc ccactttgtg     1020

tggattgccc ccctgcaggt ggccctgctg atgggcctga tctgggagct gctgcaggcc     1080

tctgccttct gtggcctggg cttcctgatt gtgctggccc tgttccaggc tggcctgggc     1140

agaatgatga tgaagtacag agaccagaga gctggcaaga tctctgagag actggtgatc     1200

acctctgaga tgattgagaa catccagtct gtgaaggcct actgctggga ggaggccatg     1260

gagaagatga ttgagaacct gagacagaca gagctgaagc tgaccagaaa ggctgcctat     1320

gtgagatact tcaacagctc tgccttcttc ttctctggct tctttgtggt gttcctgtct     1380

gtgctgccct atgccctgat caagggcatc atcctgagaa agatcttcac caccatcagc     1440

ttctgcattg tgctgagaat ggctgtgacc agacagttcc cctgggctgt gcagacctgg     1500

tatgacagcc tgggggccat caacaagatc caggacttcc tgcagaagca ggagtacaag     1560

accctggagt acaacctgac caccacagag gtggtgatgg agaatgtgac agccttctgg     1620

gaggagggct ttggggagct gtttgagaag gccaagcaga acaacaacaa cagaaagacc     1680

agcaatgggg atgacagcct gttcttcagc aacttcagcc tgctgggcac ccctgtgctg     1740

aaggacatca acttcaagat tgagagaggc cagctgctgg ctgtggctgg cagcacaggg     1800

gctggcaaga ccagcctgct gatgatgatc atgggggagc tggagccctc tgagggcaag     1860

atcaagcact ctggcagaat cagcttctgc agccagttca gctggatcat gcctggcacc     1920

atcaaggaga acatcatctt tggggtgagc tatgatgagt acagatacag atctgtgatc     1980

aaggcctgcc agctggagga ggacatcagc aagtttgctg agaaggacaa cattgtgctg     2040

ggggaggggg gcatcaccct gtctgggggc cagagagcca gaatcagcct ggccagagct     2100

gtgtacaagg atgctgacct gtacctgctg gacagcccct ttggctacct ggatgtgctg     2160

acagagaagg agatctttga gagctgtgtg tgcaagctga tggccaacaa gaccagaatc     2220

ctggtgacca gcaagatgga gcacctgaag aaggctgaca agatcctgat cctgcatgag     2280

ggcagcagct acttctatgg caccttctct gagctgcaga acctgcagcc tgacttcagc     2340

agcaagctga tgggctgtga cagctttgac cagttctctg ctgagagaag aaacagcatc     2400

ctgacagaga ccctgcacag attcagcctg gagggggatg cccctgtgag ctggacagag     2460

accaagaagc agagcttcaa gcagacaggg gagtttgggg agaagagaaa gaacagcatc     2520

ctgaacccca tcaacagcac cctgcaggcc agaagaagac agtctgtgct gaacctgatg     2580

acccactctg tgaaccaggg ccagaacatc cacagaaaga ccacagccag caccagaaag     2640

gtgagcctgg ccccccaggc caacctgaca gagctggaca tctacagcag aagactgagc     2700

caggagacag gcctggagat ctctgaggag atcaatgagg aggacctgaa ggagtgcttc     2760

tttgatgaca tggagagcat ccctgctgtg accacctgga acacctacct gagatacatc     2820

acagtgcaca agagcctgat ctttgtgctg atctggtgcc tggtgatctt cctggctgag     2880

gtggctgcca gcctggtggt gctgtggctg ctgggcaaca cccccctgca ggacaagggc     2940

aacagcaccc acagcagaaa caacagctat gctgtgatca tcaccagcac cagcagctac     3000

tatgtgttct acatctatgt gggggtggct gacaccctgc tggccatggg cttcttcaga     3060

ggcctgcccc tggtgcacac cctgatcaca gtgagcaaga tcctgcacca caagatgctg     3120

cactctgtgc tgcaggcccc catgagcacc ctgaacaccc tgaaggctgg gggcatcctg     3180

aacagattca gcaaggacat tgccatcctg gatgacctgc tgcccctgac catctttgac     3240

ttcatccagc tgctgctgat tgtgattggg gccattgctg tggtggctgt gctgcagccc     3300

tacatctttg tggccacagt gcctgtgatt gtggccttca tcatgctgag agcctacttc     3360

ctgcagacca gccagcagct gaagcagctg gagtctgagg gcagaagccc catcttcacc     3420

cacctggtga ccagcctgaa gggcctgtgg accctgagag cctttggcag acagccctac     3480

tttgagaccc tgttccacaa ggccctgaac ctgcacacag ccaactggtt cctgtacctg     3540

agcaccctga gatggttcca gatgagaatt gagatgatct ttgtgatctt cttcattgct     3600

gtgaccttca tcagcatcct gaccacaggg gagggggagg gcagagtggg catcatcctg     3660

accctggcca tgaacatcat gagcaccctg cagtgggctg tgaacagcag cattgatgtg     3720

gacagcctga tgagatctgt gagcagagtg ttcaagttca ttgacatgcc cacagagggc     3780

aagcccacca agagcaccaa gccctacaag aatggccagc tgagcaaggt gatgatcatt     3840

gagaacagcc atgtgaagaa ggatgacatc tggccctctg ggggccagat gacagtgaag     3900

gacctgacag ccaagtacac agaggggggc aatgccatcc tggagaacat cagcttcagc     3960

atcagccctg gccagagagt gggcctgctg ggcagaacag gctctggcaa gagcaccctg     4020

ctgtctgcct tcctgagact gctgaacaca gagggggaga tccagattga tggggtgagc     4080

tgggacagca tcaccctgca gcagtggaga aaggcctttg gggtgatccc ccagaaggtg     4140

ttcatcttct ctggcacctt cagaaagaac ctggacccct atgagcagtg gtctgaccag     4200

gagatctgga aggtggctga tgaggtgggc ctgagatctg tgattgagca gttccctggc     4260

aagctggact ttgtgctggt ggatgggggc tgtgtgctga gccatggcca caagcagctg     4320

atgtgcctgg ccagatctgt gctgagcaag gccaagatcc tgctgctgga tgagccctct     4380

gcccacctgg accctgtgac ctaccagatc atcagaagaa ccctgaagca ggcctttgct     4440

gactgcacag tgatcctgtg tgagcacaga attgaggcca tgctggagtg ccagcagttc     4500

ctggtgattg aggagaacaa ggtgagacag tatgacagca tccagaagct gctgaatgag     4560

agaagcctgt tcagacaggc catcagcccc tctgacagag tgaagctgtt cccccacaga     4620

aacagcagca agtgcaagag caagccccag attgctgccc tgaaggagga gaccgaggag     4680

gaggtgcagg acaccagact gtaaataaaa tacgaaatgg atctgaggaa cccctagtga     4740

tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg     4800

tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagagagg     4860

gagtggccaa ttaattaagg cgatgaacgg taatcgtaaa actagcatgt caatcatatg     4920

taccccggtt gataatcaga aaagccccaa aaacaggaag attgtataag cattaattaa     4980

tttaaataca tggacatgtc agaattggtt aattggttgt aacactgacc cctatttgtt     5040

tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc     5100

ttcaataata ttgaaaaagg aagaatatga gccatattca acgggaaacg tcgaggccgc     5160

gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc gataatgtcg     5220

ggcaatcagg tgcgacaatc tatcgcttgt atgggaagcc cgatgcgcca gagttgtttc     5280

tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc agactaaact     5340

ggctgacgga atttatgcca cttccgacca tcaagcattt tatccgtact cctgatgatg     5400

catggttact caccactgcg atccccggaa aaacagcgtt ccaggtatta gaagaatatc     5460

ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg ttgcactcga     5520

ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgcctcgct caggcgcaat     5580

cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt aatggctggc     5640

ctgttgaaca agtctggaaa gaaatgcata aacttttgcc attctcaccg gattcagtcg     5700

tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa ttaataggtt     5760

gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc atcctatgga     5820

actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa tatggtattg     5880

ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt ttctaaaagc     5940

agagcattac gctgacttga cgggacggcg caagctcatg accaaaatcc cttaacgtga     6000

gttacgcgcg cgtcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct     6060

tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca     6120

gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc     6180

agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagc ccaccacttc     6240

aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct     6300

gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag     6360

gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc     6420

tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg     6480

agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag     6540

cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt     6600

gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac     6660

gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt taaaccatg      6719


<210>  89
<211>  6751
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF3

<400>  89
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc atggaggcgg tactatgtag atgagaattc      180

aggagcaaac tgggaaaagc aactgcttcc aaatatttgt gatttttaca gtgtagtttt      240

ggaaaaactc ttagcctacc aattcttcta agtgttttaa aatgtgggag ccagtacaca      300

tgaagttata gagtgtttta atgaggctta aatatttacc gtaactatga aatgctacgc      360

atatcatgct gttcaggctc cgtggccacg caactcatac cggtagtact cgccaccatg      420

cagagaagcc ccctggagaa ggcctctgtg gtgagcaagc tgttcttcag ctggaccaga      480

cccatcctga gaaagggcta cagacagaga ctggagctgt ctgacatcta ccagatcccc      540

tctgtggact ctgctgacaa cctgtctgag aagctggaga gagagtggga cagagagctg      600

gccagcaaga agaaccccaa gctgatcaat gccctgagaa gatgcttctt ctggagattc      660

atgttctatg gcatcttcct gtacctgggg gaggtgacca aggctgtgca gcccctgctg      720

ctgggcagaa tcattgccag ctatgaccct gacaacaagg aggagagaag cattgccatc      780

tacctgggca ttggcctgtg cctgctgttc attgtgagaa ccctgctgct gcaccctgcc      840

atctttggcc tgcaccacat tggcatgcag atgagaattg ccatgttcag cctgatctac      900

aagaagaccc tgaagctgag cagcagagtg ctggacaaga tcagcattgg ccagctggtg      960

agcctgctga gcaacaacct gaacaagttt gatgagggcc tggccctggc ccactttgtg     1020

tggattgccc ccctgcaggt ggccctgctg atgggcctga tctgggagct gctgcaggcc     1080

tctgccttct gtggcctggg cttcctgatt gtgctggccc tgttccaggc tggcctgggc     1140

agaatgatga tgaagtacag agaccagaga gctggcaaga tctctgagag actggtgatc     1200

acctctgaga tgattgagaa catccagtct gtgaaggcct actgctggga ggaggccatg     1260

gagaagatga ttgagaacct gagacagaca gagctgaagc tgaccagaaa ggctgcctat     1320

gtgagatact tcaacagctc tgccttcttc ttctctggct tctttgtggt gttcctgtct     1380

gtgctgccct atgccctgat caagggcatc atcctgagaa agatcttcac caccatcagc     1440

ttctgcattg tgctgagaat ggctgtgacc agacagttcc cctgggctgt gcagacctgg     1500

tatgacagcc tgggggccat caacaagatc caggacttcc tgcagaagca ggagtacaag     1560

accctggagt acaacctgac caccacagag gtggtgatgg agaatgtgac agccttctgg     1620

gaggagggct ttggggagct gtttgagaag gccaagcaga acaacaacaa cagaaagacc     1680

agcaatgggg atgacagcct gttcttcagc aacttcagcc tgctgggcac ccctgtgctg     1740

aaggacatca acttcaagat tgagagaggc cagctgctgg ctgtggctgg cagcacaggg     1800

gctggcaaga ccagcctgct gatgatgatc atgggggagc tggagccctc tgagggcaag     1860

atcaagcact ctggcagaat cagcttctgc agccagttca gctggatcat gcctggcacc     1920

atcaaggaga acatcatctt tggggtgagc tatgatgagt acagatacag atctgtgatc     1980

aaggcctgcc agctggagga ggacatcagc aagtttgctg agaaggacaa cattgtgctg     2040

ggggaggggg gcatcaccct gtctgggggc cagagagcca gaatcagcct ggccagagct     2100

gtgtacaagg atgctgacct gtacctgctg gacagcccct ttggctacct ggatgtgctg     2160

acagagaagg agatctttga gagctgtgtg tgcaagctga tggccaacaa gaccagaatc     2220

ctggtgacca gcaagatgga gcacctgaag aaggctgaca agatcctgat cctgcatgag     2280

ggcagcagct acttctatgg caccttctct gagctgcaga acctgcagcc tgacttcagc     2340

agcaagctga tgggctgtga cagctttgac cagttctctg ctgagagaag aaacagcatc     2400

ctgacagaga ccctgcacag attcagcctg gagggggatg cccctgtgag ctggacagag     2460

accaagaagc agagcttcaa gcagacaggg gagtttgggg agaagagaaa gaacagcatc     2520

ctgaacccca tcaacagcac cctgcaggcc agaagaagac agtctgtgct gaacctgatg     2580

acccactctg tgaaccaggg ccagaacatc cacagaaaga ccacagccag caccagaaag     2640

gtgagcctgg ccccccaggc caacctgaca gagctggaca tctacagcag aagactgagc     2700

caggagacag gcctggagat ctctgaggag atcaatgagg aggacctgaa ggagtgcttc     2760

tttgatgaca tggagagcat ccctgctgtg accacctgga acacctacct gagatacatc     2820

acagtgcaca agagcctgat ctttgtgctg atctggtgcc tggtgatctt cctggctgag     2880

gtggctgcca gcctggtggt gctgtggctg ctgggcaaca cccccctgca ggacaagggc     2940

aacagcaccc acagcagaaa caacagctat gctgtgatca tcaccagcac cagcagctac     3000

tatgtgttct acatctatgt gggggtggct gacaccctgc tggccatggg cttcttcaga     3060

ggcctgcccc tggtgcacac cctgatcaca gtgagcaaga tcctgcacca caagatgctg     3120

cactctgtgc tgcaggcccc catgagcacc ctgaacaccc tgaaggctgg gggcatcctg     3180

aacagattca gcaaggacat tgccatcctg gatgacctgc tgcccctgac catctttgac     3240

ttcatccagc tgctgctgat tgtgattggg gccattgctg tggtggctgt gctgcagccc     3300

tacatctttg tggccacagt gcctgtgatt gtggccttca tcatgctgag agcctacttc     3360

ctgcagacca gccagcagct gaagcagctg gagtctgagg gcagaagccc catcttcacc     3420

cacctggtga ccagcctgaa gggcctgtgg accctgagag cctttggcag acagccctac     3480

tttgagaccc tgttccacaa ggccctgaac ctgcacacag ccaactggtt cctgtacctg     3540

agcaccctga gatggttcca gatgagaatt gagatgatct ttgtgatctt cttcattgct     3600

gtgaccttca tcagcatcct gaccacaggg gagggggagg gcagagtggg catcatcctg     3660

accctggcca tgaacatcat gagcaccctg cagtgggctg tgaacagcag cattgatgtg     3720

gacagcctga tgagatctgt gagcagagtg ttcaagttca ttgacatgcc cacagagggc     3780

aagcccacca agagcaccaa gccctacaag aatggccagc tgagcaaggt gatgatcatt     3840

gagaacagcc atgtgaagaa ggatgacatc tggccctctg ggggccagat gacagtgaag     3900

gacctgacag ccaagtacac agaggggggc aatgccatcc tggagaacat cagcttcagc     3960

atcagccctg gccagagagt gggcctgctg ggcagaacag gctctggcaa gagcaccctg     4020

ctgtctgcct tcctgagact gctgaacaca gagggggaga tccagattga tggggtgagc     4080

tgggacagca tcaccctgca gcagtggaga aaggcctttg gggtgatccc ccagaaggtg     4140

ttcatcttct ctggcacctt cagaaagaac ctggacccct atgagcagtg gtctgaccag     4200

gagatctgga aggtggctga tgaggtgggc ctgagatctg tgattgagca gttccctggc     4260

aagctggact ttgtgctggt ggatgggggc tgtgtgctga gccatggcca caagcagctg     4320

atgtgcctgg ccagatctgt gctgagcaag gccaagatcc tgctgctgga tgagccctct     4380

gcccacctgg accctgtgac ctaccagatc atcagaagaa ccctgaagca ggcctttgct     4440

gactgcacag tgatcctgtg tgagcacaga attgaggcca tgctggagtg ccagcagttc     4500

ctggtgattg aggagaacaa ggtgagacag tatgacagca tccagaagct gctgaatgag     4560

agaagcctgt tcagacaggc catcagcccc tctgacagag tgaagctgtt cccccacaga     4620

aacagcagca agtgcaagag caagccccag attgctgccc tgaaggagga gaccgaggag     4680

gaggtgcagg acaccagact gtaaataaat atctttattt tcattacatc tgtgtgttgg     4740

ttttttgtgt ggatctgagg aacccctagt gatggagttg gccactccct ctctgcgcgc     4800

tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc     4860

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aattaattaa ggcgatgaac     4920

ggtaatcgta aaactagcat gtcaatcata tgtaccccgg ttgataatca gaaaagcccc     4980

aaaaacagga agattgtata agcattaatt aatttaaata catggacatg tcagaattgg     5040

ttaattggtt gtaacactga cccctatttg tttatttttc taaatacatt caaatatgta     5100

tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagaatat     5160

gagccatatt caacgggaaa cgtcgaggcc gcgattaaat tccaacatgg atgctgattt     5220

atatgggtat aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa tctatcgctt     5280

gtatgggaag cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta gcgttgccaa     5340

tgatgttaca gatgagatgg tcagactaaa ctggctgacg gaatttatgc cacttccgac     5400

catcaagcat tttatccgta ctcctgatga tgcatggtta ctcaccactg cgatccccgg     5460

aaaaacagcg ttccaggtat tagaagaata tcctgattca ggtgaaaata ttgttgatgc     5520

gctggcagtg ttcctgcgcc ggttgcactc gattcctgtt tgtaattgtc cttttaacag     5580

cgatcgcgta tttcgcctcg ctcaggcgca atcacgaatg aataacggtt tggttgatgc     5640

gagtgatttt gatgacgagc gtaatggctg gcctgttgaa caagtctgga aagaaatgca     5700

taaacttttg ccattctcac cggattcagt cgtcactcat ggtgatttct cacttgataa     5760

ccttattttt gacgagggga aattaatagg ttgtattgat gttggacgag tcggaatcgc     5820

agaccgatac caggatcttg ccatcctatg gaactgcctc ggtgagtttt ctccttcatt     5880

acagaaacgg ctttttcaaa aatatggtat tgataatcct gatatgaata aattgcagtt     5940

tcatttgatg ctcgatgagt ttttctaaaa gcagagcatt acgctgactt gacgggacgg     6000

cgcaagctca tgaccaaaat cccttaacgt gagttacgcg cgcgtcgttc cactgagcgt     6060

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     6120

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     6180

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     6240

ttctagtgta gccgtagtta gcccaccact tcaagaactc tgtagcaccg cctacatacc     6300

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     6360

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     6420

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     6480

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     6540

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     6600

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     6660

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     6720

gctggccttt tgctcacatg tttaaaccat g                                    6751


<210>  90
<211>  6603
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF5

<400>  90
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc aatatttgca tgtcgctatg tgttctggga      180

aatcaccata aacgtgaaat gtctttggat ttgggaatct tcgaagttct gtatgagacc      240

acagatctcc accggtagta ctcgccacca tgcagagaag ccccctggag aaggcctctg      300

tggtgagcaa gctgttcttc agctggacca gacccatcct gagaaagggc tacagacaga      360

gactggagct gtctgacatc taccagatcc cctctgtgga ctctgctgac aacctgtctg      420

agaagctgga gagagagtgg gacagagagc tggccagcaa gaagaacccc aagctgatca      480

atgccctgag aagatgcttc ttctggagat tcatgttcta tggcatcttc ctgtacctgg      540

gggaggtgac caaggctgtg cagcccctgc tgctgggcag aatcattgcc agctatgacc      600

ctgacaacaa ggaggagaga agcattgcca tctacctggg cattggcctg tgcctgctgt      660

tcattgtgag aaccctgctg ctgcaccctg ccatctttgg cctgcaccac attggcatgc      720

agatgagaat tgccatgttc agcctgatct acaagaagac cctgaagctg agcagcagag      780

tgctggacaa gatcagcatt ggccagctgg tgagcctgct gagcaacaac ctgaacaagt      840

ttgatgaggg cctggccctg gcccactttg tgtggattgc ccccctgcag gtggccctgc      900

tgatgggcct gatctgggag ctgctgcagg cctctgcctt ctgtggcctg ggcttcctga      960

ttgtgctggc cctgttccag gctggcctgg gcagaatgat gatgaagtac agagaccaga     1020

gagctggcaa gatctctgag agactggtga tcacctctga gatgattgag aacatccagt     1080

ctgtgaaggc ctactgctgg gaggaggcca tggagaagat gattgagaac ctgagacaga     1140

cagagctgaa gctgaccaga aaggctgcct atgtgagata cttcaacagc tctgccttct     1200

tcttctctgg cttctttgtg gtgttcctgt ctgtgctgcc ctatgccctg atcaagggca     1260

tcatcctgag aaagatcttc accaccatca gcttctgcat tgtgctgaga atggctgtga     1320

ccagacagtt cccctgggct gtgcagacct ggtatgacag cctgggggcc atcaacaaga     1380

tccaggactt cctgcagaag caggagtaca agaccctgga gtacaacctg accaccacag     1440

aggtggtgat ggagaatgtg acagccttct gggaggaggg ctttggggag ctgtttgaga     1500

aggccaagca gaacaacaac aacagaaaga ccagcaatgg ggatgacagc ctgttcttca     1560

gcaacttcag cctgctgggc acccctgtgc tgaaggacat caacttcaag attgagagag     1620

gccagctgct ggctgtggct ggcagcacag gggctggcaa gaccagcctg ctgatgatga     1680

tcatggggga gctggagccc tctgagggca agatcaagca ctctggcaga atcagcttct     1740

gcagccagtt cagctggatc atgcctggca ccatcaagga gaacatcatc tttggggtga     1800

gctatgatga gtacagatac agatctgtga tcaaggcctg ccagctggag gaggacatca     1860

gcaagtttgc tgagaaggac aacattgtgc tgggggaggg gggcatcacc ctgtctgggg     1920

gccagagagc cagaatcagc ctggccagag ctgtgtacaa ggatgctgac ctgtacctgc     1980

tggacagccc ctttggctac ctggatgtgc tgacagagaa ggagatcttt gagagctgtg     2040

tgtgcaagct gatggccaac aagaccagaa tcctggtgac cagcaagatg gagcacctga     2100

agaaggctga caagatcctg atcctgcatg agggcagcag ctacttctat ggcaccttct     2160

ctgagctgca gaacctgcag cctgacttca gcagcaagct gatgggctgt gacagctttg     2220

accagttctc tgctgagaga agaaacagca tcctgacaga gaccctgcac agattcagcc     2280

tggaggggga tgcccctgtg agctggacag agaccaagaa gcagagcttc aagcagacag     2340

gggagtttgg ggagaagaga aagaacagca tcctgaaccc catcaacagc accctgcagg     2400

ccagaagaag acagtctgtg ctgaacctga tgacccactc tgtgaaccag ggccagaaca     2460

tccacagaaa gaccacagcc agcaccagaa aggtgagcct ggccccccag gccaacctga     2520

cagagctgga catctacagc agaagactga gccaggagac aggcctggag atctctgagg     2580

agatcaatga ggaggacctg aaggagtgct tctttgatga catggagagc atccctgctg     2640

tgaccacctg gaacacctac ctgagataca tcacagtgca caagagcctg atctttgtgc     2700

tgatctggtg cctggtgatc ttcctggctg aggtggctgc cagcctggtg gtgctgtggc     2760

tgctgggcaa cacccccctg caggacaagg gcaacagcac ccacagcaga aacaacagct     2820

atgctgtgat catcaccagc accagcagct actatgtgtt ctacatctat gtgggggtgg     2880

ctgacaccct gctggccatg ggcttcttca gaggcctgcc cctggtgcac accctgatca     2940

cagtgagcaa gatcctgcac cacaagatgc tgcactctgt gctgcaggcc cccatgagca     3000

ccctgaacac cctgaaggct gggggcatcc tgaacagatt cagcaaggac attgccatcc     3060

tggatgacct gctgcccctg accatctttg acttcatcca gctgctgctg attgtgattg     3120

gggccattgc tgtggtggct gtgctgcagc cctacatctt tgtggccaca gtgcctgtga     3180

ttgtggcctt catcatgctg agagcctact tcctgcagac cagccagcag ctgaagcagc     3240

tggagtctga gggcagaagc cccatcttca cccacctggt gaccagcctg aagggcctgt     3300

ggaccctgag agcctttggc agacagccct actttgagac cctgttccac aaggccctga     3360

acctgcacac agccaactgg ttcctgtacc tgagcaccct gagatggttc cagatgagaa     3420

ttgagatgat ctttgtgatc ttcttcattg ctgtgacctt catcagcatc ctgaccacag     3480

gggaggggga gggcagagtg ggcatcatcc tgaccctggc catgaacatc atgagcaccc     3540

tgcagtgggc tgtgaacagc agcattgatg tggacagcct gatgagatct gtgagcagag     3600

tgttcaagtt cattgacatg cccacagagg gcaagcccac caagagcacc aagccctaca     3660

agaatggcca gctgagcaag gtgatgatca ttgagaacag ccatgtgaag aaggatgaca     3720

tctggccctc tgggggccag atgacagtga aggacctgac agccaagtac acagaggggg     3780

gcaatgccat cctggagaac atcagcttca gcatcagccc tggccagaga gtgggcctgc     3840

tgggcagaac aggctctggc aagagcaccc tgctgtctgc cttcctgaga ctgctgaaca     3900

cagaggggga gatccagatt gatggggtga gctgggacag catcaccctg cagcagtgga     3960

gaaaggcctt tggggtgatc ccccagaagg tgttcatctt ctctggcacc ttcagaaaga     4020

acctggaccc ctatgagcag tggtctgacc aggagatctg gaaggtggct gatgaggtgg     4080

gcctgagatc tgtgattgag cagttccctg gcaagctgga ctttgtgctg gtggatgggg     4140

gctgtgtgct gagccatggc cacaagcagc tgatgtgcct ggccagatct gtgctgagca     4200

aggccaagat cctgctgctg gatgagccct ctgcccacct ggaccctgtg acctaccaga     4260

tcatcagaag aaccctgaag caggcctttg ctgactgcac agtgatcctg tgtgagcaca     4320

gaattgaggc catgctggag tgccagcagt tcctggtgat tgaggagaac aaggtgagac     4380

agtatgacag catccagaag ctgctgaatg agagaagcct gttcagacag gccatcagcc     4440

cctctgacag agtgaagctg ttcccccaca gaaacagcag caagtgcaag agcaagcccc     4500

agattgctgc cctgaaggag gagaccgagg aggaggtgca ggacaccaga ctgtaaataa     4560

atatctttat tttcattaca tctgtgtgtt ggttttttgt gtggatctga ggaaccccta     4620

gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca     4680

aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcaga     4740

gagggagtgg ccaattaatt aaggcgatga acggtaatcg taaaactagc atgtcaatca     4800

tatgtacccc ggttgataat cagaaaagcc ccaaaaacag gaagattgta taagcattaa     4860

ttaatttaaa tacatggaca tgtcagaatt ggttaattgg ttgtaacact gacccctatt     4920

tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa     4980

atgcttcaat aatattgaaa aaggaagaat atgagccata ttcaacggga aacgtcgagg     5040

ccgcgattaa attccaacat ggatgctgat ttatatgggt ataaatgggc tcgcgataat     5100

gtcgggcaat caggtgcgac aatctatcgc ttgtatggga agcccgatgc gccagagttg     5160

tttctgaaac atggcaaagg tagcgttgcc aatgatgtta cagatgagat ggtcagacta     5220

aactggctga cggaatttat gccacttccg accatcaagc attttatccg tactcctgat     5280

gatgcatggt tactcaccac tgcgatcccc ggaaaaacag cgttccaggt attagaagaa     5340

tatcctgatt caggtgaaaa tattgttgat gcgctggcag tgttcctgcg ccggttgcac     5400

tcgattcctg tttgtaattg tccttttaac agcgatcgcg tatttcgcct cgctcaggcg     5460

caatcacgaa tgaataacgg tttggttgat gcgagtgatt ttgatgacga gcgtaatggc     5520

tggcctgttg aacaagtctg gaaagaaatg cataaacttt tgccattctc accggattca     5580

gtcgtcactc atggtgattt ctcacttgat aaccttattt ttgacgaggg gaaattaata     5640

ggttgtattg atgttggacg agtcggaatc gcagaccgat accaggatct tgccatccta     5700

tggaactgcc tcggtgagtt ttctccttca ttacagaaac ggctttttca aaaatatggt     5760

attgataatc ctgatatgaa taaattgcag tttcatttga tgctcgatga gtttttctaa     5820

aagcagagca ttacgctgac ttgacgggac ggcgcaagct catgaccaaa atcccttaac     5880

gtgagttacg cgcgcgtcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc     5940

ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct     6000

accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg     6060

cttcagcaga gcgcagatac caaatactgt tcttctagtg tagccgtagt tagcccacca     6120

cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc     6180

tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga     6240

taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac     6300

gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga     6360

agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag     6420

ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg     6480

acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag     6540

caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgtttaaacc     6600

atg                                                                   6603


<210>  91
<211>  7519
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF7

<400>  91
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc aatatttgca tgtcgctatg tgttctggga      180

aatcaccata aacgtgaaat gtctttggat ttgggaatct tcgaagttct gtatgagacc      240

acagatctcc accggtagta ctcgccacca tgcagagaag ccccctggag aaggcctctg      300

tggtgagcaa gctgttcttc ccccctggag aaggcctctg tggtgagcaa gctgttcttc      360

agctggacca gacccatcct gagaaagggc tacagacaga gactggagct gtctgacatc      420

taccagatcc cctctgtgga ctctgctgac aacctgtctg agaagctgga gagagagtgg      480

gacagagagc tggccagcaa gaagaacccc aagctgatca atgccctgag aagatgcttc      540

ttctggagat tcatgttcta tggcatcttc ctgtacctgg gggaggtgac caaggctgtg      600

cagcccctgc tgctgggcag aatcattgcc agctatgacc cagcccctgc tgctgggcag      660

aatcattgcc agctatgacc ctgacaacaa ggaggagaga agcattgcca tctacctggg      720

cattggcctg tgcctgctgt tcattgtgag aaccctgctg ctgcaccctg ccatctttgg      780

cctgcaccac attggcatgc agatgagaat tgccatgttc agcctgatct acaagaagac      840

cctgaagctg agcagcagag tgctggacaa gatcagcatt ggccagctgg tgagcctgct      900

gagcaacaac ctgaacaagt ttgatgaggg cctggccctg gcccactttg tgtggattgc      960

ttgatgaggg cctggccctg gcccactttg tgtggattgc ccccctgcag gtggccctgc     1020

tgatgggcct gatctgggag ctgctgcagg cctctgcctt ctgtggcctg ggcttcctga     1080

ttgtgctggc cctgttccag gctggcctgg gcagaatgat gatgaagtac agagaccaga     1140

gagctggcaa gatctctgag agactggtga tcacctctga gatgattgag aacatccagt     1200

ctgtgaaggc ctactgctgg gaggaggcca tggagaagat gattgagaac ctgagacaga     1260

cagagctgaa gctgaccaga aaggctgcct atgtgagata cttcaacagc tctgccttct     1320

tcttctctgg cttctttgtg gtgttcctgt ctgtgctgcc tctgccttct tcttctctgg     1380

cttctttgtg gtgttcctgt ctgtgctgcc ctatgccctg atcaagggca tcatcctgag     1440

aaagatcttc accaccatca gcttctgcat tgtgctgaga atggctgtga ccagacagtt     1500

cccctgggct gtgcagacct ggtatgacag cctgggggcc atcaacaaga tccaggactt     1560

cctgcagaag caggagtaca agaccctgga gtacaacctg accaccacag aggtggtgat     1620

ggagaatgtg acagccttct gggaggaggg ctttggggag ctgtttgaga aggccaagca     1680

gaacaacaac aacagaaaga ccagcaatgg ggatgacagc ctgttcttca gcaacttcag     1740

cctgctgggc acccctgtgc ggatgacagc ctgttcttca gcaacttcag cctgctgggc     1800

acccctgtgc tgaaggacat caacttcaag attgagagag gccagctgct ggctgtggct     1860

ggcagcacag gggctggcaa gaccagcctg ctgatgatga tcatggggga gctggagccc     1920

tctgagggca agatcaagca ctctggcaga atcagcttct gcagccagtt cagctggatc     1980

atgcctggca ccatcaagga gaacatcatc tttggggtga gctatgatga gtacagatac     2040

agatctgtga tcaaggcctg ccagctggag gaggacatca agatctgtga tcaaggcctg     2100

ccagctggag gaggacatca gcaagtttgc tgagaaggac aacattgtgc tgggggaggg     2160

gggcatcacc ctgtctgggg gccagagagc cagaatcagc ctggccagag ctgtgtacaa     2220

ggatgctgac ctgtacctgc tggacagccc ctttggctac ctggatgtgc tgacagagaa     2280

ggagatcttt gagagctgtg tgtgcaagct gatggccaac aagaccagaa tcctggtgac     2340

cagcaagatg gagcacctga agaaggctga caagatcctg atcctgcatg agggcagcag     2400

agaaggctga caagatcctg atcctgcatg agggcagcag ctacttctat ggcaccttct     2460

ctgagctgca gaacctgcag cctgacttca gcagcaagct gatgggctgt gacagctttg     2520

accagttctc tgctgagaga agaaacagca tcctgacaga gaccctgcac agattcagcc     2580

tggaggggga tgcccctgtg agctggacag agaccaagaa gcagagcttc aagcagacag     2640

gggagtttgg ggagaagaga aagaacagca tcctgaaccc catcaacagc atcagaaagt     2700

tcagcattgt gcagaagacc catcaacagc atcagaaagt tcagcattgt gcagaagacc     2760

cccctgcaga tgaatggcat tgaggaggac tctgatgagc ccctggagag aagactgagc     2820

ctggtgcctg actctgagca gggggaggcc atcctgccca gaatctctgt gatcagcaca     2880

ggccccaccc tgcaggccag aagaagacag tctgtgctga acctgatgac ccactctgtg     2940

aaccagggcc agaacatcca ccactctgtg aaccagggcc agaacatcca cagaaagacc     3000

acagccagca ccagaaaggt gagcctggcc ccccaggcca acctgacaga gctggacatc     3060

tacagcagaa gactgagcca ggagacaggc ctggagatct ctgaggagat caatgaggag     3120

gacctgaagg agtgcttctt tgatgacatg gagagcatcc ctgctgtgac cacctggaac     3180

acctacctga gatacatcac agtgcacaag agcctgatct ttgtgctgat ctggtgcctg     3240

gtgatcttcc tggctgaggt ggctgccagc ctggtggtgc gtgatcttcc tggctgaggt     3300

ggctgccagc ctggtggtgc tgtggctgct gggcaacacc cccctgcagg acaagggcaa     3360

cagcacccac agcagaaaca acagctatgc tgtgatcatc accagcacca gcagctacta     3420

tgtgttctac atctatgtgg gggtggctga caccctgctg gccatgggct tcttcagagg     3480

cctgcccctg gtgcacaccc tgatcacagt gagcaagatc ctgcaccaca agatgctgca     3540

ctctgtgctg caggccccca tgagcaccct gaacaccctg aaggctgggg gcatcctgaa     3600

tgagcaccct gaacaccctg aaggctgggg gcatcctgaa cagattcagc aaggacattg     3660

ccatcctgga tgacctgctg cccctgacca tctttgactt catccagctg ctgctgattg     3720

tgattggggc cattgctgtg gtggctgtgc tgcagcccta catctttgtg gccacagtgc     3780

ctgtgattgt ggccttcatc atgctgagag cctacttcct gcagaccagc cagcagctga     3840

agcagctgga gtctgagggc agaagcccca tcttcaccca cctggtgacc agcctgaagg     3900

gcctgtggac cctgagagcc cctggtgacc agcctgaagg gcctgtggac cctgagagcc     3960

tttggcagac agccctactt tgagaccctg ttccacaagg ccctgaacct gcacacagcc     4020

aactggttcc tgtacctgag caccctgaga tggttccaga tgagaattga gatgatcttt     4080

gtgatcttct tcattgctgt gaccttcatc agcatcctga ccacagggga gggggagggc     4140

agagtgggca tcatcctgac cctggccatg aacatcatga gcaccctgca gtgggctgtg     4200

aacagcagca ttgatgtgga cagcctgatg agatctgtga gcagagtgtt caagttcatt     4260

gacatgccca cagagggcaa gcccaccaag agcaccaagc cctacaagaa tggccagctg     4320

cagagggcaa gcccaccaag agcaccaagc cctacaagaa tggccagctg agcaaggtga     4380

tgatcattga gaacagccat gtgaagaagg atgacatctg gccctctggg ggccagatga     4440

cagtgaagga cctgacagcc aagtacacag aggggggcaa tgccatcctg gagaacatca     4500

gcttcagcat cagccctggc cagagagtgg gcctgctggg cagaacaggc tctggcaaga     4560

gcaccctgct gtctgccttc ctgagactgc tgaacacaga gggggagatc cagattgatg     4620

gggtgagctg ggacagcatc accctgcagc agtggagaaa ggcctttggg gtgatccccc     4680

agaaggtgtt catcttctct ggcaccttca gaaagaacct gtgatccccc agaaggtgtt     4740

catcttctct ggcaccttca gaaagaacct ggacccctat gagcagtggt ctgaccagga     4800

gatctggaag gtggctgatg aggtgggcct gagatctgtg attgagcagt tccctggcaa     4860

gctggacttt gtgctggtgg atgggggctg tgtgctgagc catggccaca agcagctgat     4920

gtgcctggcc agatctgtgc tgagcaaggc caagatcctg ctgctggatg agccctctgc     4980

ccacctggac cctgtgacct accagatcat cagaagaacc ctgaagcagg cctttgctga     5040

accagatcat cagaagaacc ctgaagcagg cctttgctga ctgcacagtg atcctgtgtg     5100

agcacagaat tgaggccatg ctggagtgcc agcagttcct ggtgattgag gagaacaagg     5160

tgagacagta tgacagcatc cagaagctgc tgaatgagag aagcctgttc agacaggcca     5220

tcagcccctc tgacagagtg aagctgttcc cccacagaaa cagcagcaag tgcaagagca     5280

agccccagat tgctgccctg aaggaggaga ccgaggagga ggtgcaggac accagactgt     5340

aaataaatat ctttattttc attacatctg tgtgttggtt ttttgtgtgg atctgaggaa     5400

cccctagtga tggagttggc cactccctct ctgcgcgctc atctgaggaa cccctagtga     5460

tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg     5520

tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagagagg     5580

gagtggccaa ttaattaagg cgatgaacgg taatcgtaaa actagcatgt caatcatatg     5640

taccccggtt gataatcaga aaagccccaa aaacaggaag attgtataag cattaattaa     5700

tttaaataca tggacatgtc agaattggtt aattggttgt aacactgacc cctatttgtt     5760

tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc     5820

ttcaataata ttgaaaaagg aagaatatga gccatattca acgggaaacg tcgaggccgc     5880

gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc gataatgtcg     5940

ggcaatcagg tgcgacaatc tatcgcttgt atgggaagcc cgatgcgcca gagttgtttc     6000

gataatgtcg ggcaatcagg tgcgacaatc tatcgcttgt atgggaagcc cgatgcgcca     6060

gagttgtttc tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc     6120

agactaaact ggctgacgga atttatgcca cttccgacca tcaagcattt tatccgtact     6180

cctgatgatg catggttact caccactgcg atccccggaa aaacagcgtt ccaggtatta     6240

gaagaatatc ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg     6300

ttgcactcga ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgcctcgct     6360

caggcgcaat cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt     6420

aatggctggc ctgttgaaca agtctggaaa gaaatgcata aacttttgcc attctcaccg     6480

gattcagtcg tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa     6540

ttaataggtt gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc     6600

atcctatgga actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa     6660

tatggtattg ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt     6720

ttctaaaagc agagcattac gctgacttga cgggacggcg caagctcatg accaaaatcc     6780

cttaacgtga gttacgcgcg cgtcgttcca ctgagcgtca gaccccgtag aaaagatcaa     6840

aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc     6900

accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt     6960

aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagc     7020

ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc     7080

agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt     7140

accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga     7200

ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa     7260

gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa     7320

caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg     7380

ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc     7440

tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg     7500

ctcacatgtt taaaccatg                                                  7519


<210>  92
<211>  11577
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pHELPK plasmid DNA

<400>  92
ggtacccaac tccatgctta acagtcccca ggtacagccc accctgcgtc gcaaccagga       60

acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat      120

taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctaggagaca      180

ctttcaataa aggcaaatgt ttttatttgt acactctcgg gtgattattt accccccacc      240

cttgccgtct gcgccgttta aaaatcaaag gggttctgcc gcgcatcgct atgcgccact      300

ggcagggaca cgttgcgata ctggtgttta gtgctccact taaactcagg cacaaccatc      360

cgcggcagct cggtgaagtt ttcactccac aggctgcgca ccatcaccaa cgcgtttagc      420

aggtcgggcg ccgatatctt gaagtcgcag ttggggcctc cgccctgcgc gcgcgagttg      480

cgatacacag ggttgcagca ctggaacact atcagcgccg ggtggtgcac gctggccagc      540

acgctcttgt cggagatcag atccgcgtcc aggtcctccg cgttgctcag ggcgaacgga      600

gtcaactttg gtagctgcct tcccaaaaag ggtgcatgcc caggctttga gttgcactcg      660

caccgtagtg gcatcagaag gtgaccgtgc ccggtctggg cgttaggata cagcgcctgc      720

atgaaagcct tgatctgctt aaaagccacc tgagcctttg cgccttcaga gaagaacatg      780

ccgcaagact tgccggaaaa ctgattggcc ggacaggccg cgtcatgcac gcagcacctt      840

gcgtcggtgt tggagatctg caccacattt cggccccacc ggttcttcac gatcttggcc      900

ttgctagact gctccttcag cgcgcgctgc ccgttttcgc tcgtcacatc catttcaatc      960

acgtgctcct tatttatcat aatgctcccg tgtagacact taagctcgcc ttcgatctca     1020

gcgcagcggt gcagccacaa cgcgcagccc gtgggctcgt ggtgcttgta ggttacctct     1080

gcaaacgact gcaggtacgc ctgcaggaat cgccccatca tcgtcacaaa ggtcttgttg     1140

ctggtgaagg tcagctgcaa cccgcggtgc tcctcgttta gccaggtctt gcatacggcc     1200

gccagagctt ccacttggtc aggcagtagc ttgaagtttg cctttagatc gttatccacg     1260

tggtacttgt ccatcaacgc gcgcgcagcc tccatgccct tctcccacgc agacacgatc     1320

ggcaggctca gcgggtttat caccgtgctt tcactttccg cttcactgga ctcttccttt     1380

tcctcttgcg tccgcatacc ccgcgccact gggtcgtctt cattcagccg ccgcaccgtg     1440

cgcttacctc ccttgccgtg cttgattagc accggtgggt tgctgaaacc caccatttgt     1500

agcgccacat cttctctttc ttcctcgctg tccacgatca cctctgggga tggcgggcgc     1560

tcgggcttgg gagaggggcg cttctttttc tttttggacg caatggccaa atccgccgtc     1620

gaggtcgatg gccgcgggct gggtgtgcgc ggcaccagcg catcttgtga cgagtcttct     1680

tcgtcctcgg actcgagacg ccgcctcagc cgcttttttg ggggcgcgcg gggaggcggc     1740

ggcgacggcg acggggacga cacgtcctcc atggttggtg gacgtcgcgc cgcaccgcgt     1800

ccgcgctcgg gggtggtttc gcgctgctcc tcttcccgac tggccatttc cttctcctat     1860

aggcagaaaa agatcatgga gtcagtcgag aaggaggaca gcctaaccgc cccctttgag     1920

ttcgccacca ccgcctccac cgatgccgcc aacgcgccta ccaccttccc cgtcgaggca     1980

cccccgcttg aggaggagga agtgattatc gagcaggacc caggttttgt aagcgaagac     2040

gacgaggatc gctcagtacc aacagaggat aaaaagcaag accaggacga cgcagaggca     2100

aacgaggaac aagtcgggcg gggggaccaa aggcatggcg actacctaga tgtgggagac     2160

gacgtgctgt tgaagcatct gcagcgccag tgcgccatta tctgcgacgc gttgcaagag     2220

cgcagcgatg tgcccctcgc catagcggat gtcagccttg cctacgaacg ccacctgttc     2280

tcaccgcgcg taccccccaa acgccaagaa aacggcacat gcgagcccaa cccgcgcctc     2340

aacttctacc ccgtatttgc cgtgccagag gtgcttgcca cctatcacat ctttttccaa     2400

aactgcaaga tacccctatc ctgccgtgcc aaccgcagcc gagcggacaa gcagctggcc     2460

ttgcggcagg gcgctgtcat acctgatatc gcctcgctcg acgaagtgcc aaaaatcttt     2520

gagggtcttg gacgcgacga gaaacgcgcg gcaaacgctc tgcaacaaga aaacagcgaa     2580

aatgaaagtc actgtggagt gctggtggaa cttgagggtg acaacgcgcg cctagccgtg     2640

ctgaaacgca gcatcgaggt cacccacttt gcctacccgg cacttaacct accccccaag     2700

gttatgagca cagtcatgag cgagctgatc gtgcgccgtg cacgacccct ggagagggat     2760

gcaaacttgc aagaacaaac cgaggagggc ctacccgcag ttggcgatga gcagctggcg     2820

cgctggcttg agacgcgcga gcctgccgac ttggaggagc gacgcaagct aatgatggcc     2880

gcagtgcttg ttaccgtgga gcttgagtgc atgcagcggt tctttgctga cccggagatg     2940

cagcgcaagc tagaggaaac gttgcactac acctttcgcc agggctacgt gcgccaggcc     3000

tgcaaaattt ccaacgtgga gctctgcaac ctggtctcct accttggaat tttgcacgaa     3060

aaccgcctcg ggcaaaacgt gcttcattcc acgctcaagg gcgaggcgcg ccgcgactac     3120

gtccgcgact gcgtttactt atttctgtgc tacacctggc aaacggccat gggcgtgtgg     3180

cagcaatgcc tggaggagcg caacctaaag gagctgcaga agctgctaaa gcaaaacttg     3240

aaggacctat ggacggcctt caacgagcgc tccgtggccg cgcacctggc ggacattatc     3300

ttccccgaac gcctgcttaa aaccctgcaa cagggtctgc cagacttcac cagtcaaagc     3360

atgttgcaaa actttaggaa ctttatccta gagcgttcag gaattctgcc cgccacctgc     3420

tgtgcgcttc ctagcgactt tgtgcccatt aagtaccgtg aatgccctcc gccgctttgg     3480

ggtcactgct accttctgca gctagccaac taccttgcct accactccga catcatggaa     3540

gacgtgagcg gtgacggcct actggagtgt cactgtcgct gcaacctatg caccccgcac     3600

cgctccctgg tctgcaattc gcaactgctt agcgaaagtc aaattatcgg tacctttgag     3660

ctgcagggtc cctcgcctga cgaaaagtcc gcggctccgg ggttgaaact cactccgggg     3720

ctgtggacgt cggcttacct tcgcaaattt gtacctgagg actaccacgc ccacgagatt     3780

aggttctacg aagaccaatc ccgcccgcca aatgcggagc ttaccgcctg cgtcattacc     3840

cagggccaca tccttggcca attgcaagcc atcaacaaag cccgccaaga gtttctgcta     3900

cgaaagggac ggggggttta cctggacccc cagtccggcg aggagctcaa cccaatcccc     3960

ccgccgccgc agccctatca gcagccgcgg gcccttgctt cccaggatgg cacccaaaaa     4020

gaagctgcag ctgccgccgc cgccacccac ggacgaggag gaatactggg acagtcaggc     4080

agaggaggtt ttggacgagg aggaggagat gatggaagac tgggacagcc tagacgaagc     4140

ttccgaggcc gaagaggtgt cagacgaaac accgtcaccc tcggtcgcat tcccctcgcc     4200

ggcgccccag aaattggcaa ccgttcccag catcgctaca acctccgctc ctcaggcgcc     4260

gccggcactg cctgttcgcc gacccaaccg tagatgggac accactggaa ccagggccgg     4320

taagtctaag cagccgccgc cgttagccca agagcaacaa cagcgccaag gctaccgctc     4380

gtggcgcggg cacaagaacg ccatagttgc ttgcttgcaa gactgtgggg gcaacatctc     4440

cttcgcccgc cgctttcttc tctaccatca cggcgtggcc ttcccccgta acatcctgca     4500

ttactaccgt catctctaca gcccctactg caccggcggc agcggcagcg gcagcaacag     4560

cagcggtcac acagaagcaa aggcgaccgg atagcaagac tctgacaaag cccaagaaat     4620

ccacagcggc ggcagcagca ggaggaggag cgctgcgtct ggcgcccaac gaacccgtat     4680

cgacccgcga gcttagaaat aggatttttc ccactctgta tgctatattt caacaaagca     4740

ggggccaaga acaagagctg aaaataaaaa acaggtctct gcgctccctc acccgcagct     4800

gcctgtatca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg gaggctctct     4860

tcagcaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct caaatttaag     4920

cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtc gtcagcgcca     4980

ttatgagcaa ggaaattccc acgccctaca tgtggagtta ccagccacaa atgggacttg     5040

cggctggagc tgcccaagac tactcaaccc gaataaacta catgagcgcg ggaccccaca     5100

tgatatcccg ggtcaacgga atccgcgccc accgaaaccg aattctcctc gaacaggcgg     5160

ctattaccac cacacctcgt aataacctta atccccgtag ttggcccgct gccctggtgt     5220

accaggaaag tcccgctccc accactgtgg tacttcccag agacgcccag gccgaagttc     5280

agatgactaa ctcaggggcg cagcttgcgg gcggctttcg tcacagggtg cggtcgcccg     5340

ggcgttttag ggcggagtaa cttgcatgta ttgggaattg tagttttttt aaaatgggaa     5400

gtgacgtatc gtgggaaaac ggaagtgaag atttgaggaa gttgtgggtt ttttggcttt     5460

cgtttctggg cgtaggttcg cgtgcggttt tctgggtgtt ttttgtggac tttaaccgtt     5520

acgtcatttt ttagtcctat atatactcgc tctgtacttg gcccttttta cactgtgact     5580

gattgagctg gtgccgtgtc gagtggtgtt ttttaatagg tttttttact ggtaaggctg     5640

actgttatgg ctgccgctgt ggaagcgctg tatgttgttc tggagcggga gggtgctatt     5700

ttgcctaggc aggagggttt ttcaggtgtt tatgtgtttt tctctcctat taattttgtt     5760

atacctccta tgggggctgt aatgttgtct ctacgcctgc gggtatgtat tcccccgggc     5820

tatttcggtc gctttttagc actgaccgat gttaaccaac ctgatgtgtt taccgagtct     5880

tacattatga ctccggacat gaccgaggaa ctgtcggtgg tgctttttaa tcacggtgac     5940

cagttttttt acggtcacgc cggcatggcc gtagtccgtc ttatgcttat aagggttgtt     6000

tttcctgttg taagacaggc ttctaatgtt taaatgtttt tttttttgtt attttatttt     6060

gtgtttaatg caggaacccg cagacatgtt tgagagaaaa atggtgtctt tttctgtggt     6120

ggttccggaa cttacctgcc tttatctgca tgagcatgac tacgatgtgc ttgctttttt     6180

gcgcgaggct ttgcctgatt ttttgagcag caccttgcat tttatatcgc cgcccatgca     6240

acaagcttac ataggggcta cgctggttag catagctccg agtatgcgtg tcataatcag     6300

tgtgggttct tttgtcatgg ttcctggcgg ggaagtggcc gcgctggtcc gtgcagacct     6360

gcacgattat gttcagctgg ccctgcgaag ggacctacgg gatcgcggta tttttgttaa     6420

tgttccgctt ttgaatctta tacaggtctg tgaggaacct gaatttttgc aatcatgatt     6480

cgctgcttga ggctgaaggt ggagggcgct ctggagcaga tttttacaat ggccggactt     6540

aatattcggg atttgcttag agacatattg ataaggtggc gagatgaaaa ttatttgggc     6600

atggttgaag gtgctggaat gtttatagag gagattcacc ctgaagggtt tagcctttac     6660

gtccacttgg acgtgagggc agtttgcctt ttggaagcca ttgtgcaaca tcttacaaat     6720

gccattatct gttctttggc tgtagagttt gaccacgcca ccggagggga gcgcgttcac     6780

ttaatagatc ttcattttga ggttttggat aatcttttgg aataaaaaaa aaaaaacatg     6840

gttcttccag ctcttcccgc tcctcccgtg tgtgactcgc agaacgaatg tgtaggttgg     6900

ctgggtgtgg cttattctgc ggtggtggat gttatcaggg cagcggcgca tgaaggagtt     6960

tacatagaac ccgaagccag ggggcgcctg gatgctttga gagagtggat atactacaac     7020

tactacacag agcgagctaa gcgacgagac cggagacgca gatctgtttg tcacgcccgc     7080

acctggtttt gcttcaggaa atatgactac gtccggcgtt ccatttggca tgacactacg     7140

accaacacga tctcggttgt ctcggcgcac tccgtacagt agggatcgcc tacctccttt     7200

tgagacagag acccgcgcta ccatactgga ggatcatccg ctgctgcccg aatgtaacac     7260

tttgacaatg cacaacgtga gttacgtgcg aggtcttccc tgcagtgtgg gatttacgct     7320

gattcaggaa tgggttgttc cctgggatat ggttctgacg cgggaggagc ttgtaatcct     7380

gaggaagtgt atgcacgtgt gcctgtgttg tgccaacatt gatatcatga cgagcatgat     7440

gatccatggt tacgagtcct gggctctcca ctgtcattgt tccagtcccg gttccctgca     7500

gtgcatagcc ggcgggcagg ttttggccag ctggtttagg atggtggtgg atggcgccat     7560

gtttaatcag aggtttatat ggtaccggga ggtggtgaat tacaacatgc caaaagaggt     7620

aatgtttatg tccagcgtgt ttatgagggg tcgccactta atctacctgc gcttgtggta     7680

tgatggccac gtgggttctg tggtccccgc catgagcttt ggatacagcg ccttgcactg     7740

tgggattttg aacaatattg tggtgctgtg ctgcagttac tgtgctgatt taagtgagat     7800

cagggtgcgc tgctgtgccc ggaggacaag gcgtctcatg ctgcgggcgg tgcgaatcat     7860

cgctgaggag accactgcca tgttgtattc ctgcaggacg gagcggcggc ggcagcagtt     7920

tattcgcgcg ctgctgcagc accaccgccc tatcctgatg cacgattatg actctacccc     7980

catgtaggcg tggacttccc cttcgccgcc cgttgagcaa ccgcaagttg gacagcagcc     8040

tgtggctcag cagctggaca gcgacatgaa cttaagcgag ctgcccgggg agtttattaa     8100

tatcactgat gagcgtttgg ctcgacagga aaccgtgtgg aatataacac ctaagaatat     8160

gtctgttacc catgatatga tgctttttaa ggccagccgg ggagaaagga ctgtgtactc     8220

tgtgtgttgg gagggaggtg gcaggttgaa tactagggtt ctgtgagttt gattaaggta     8280

cggtgatcaa tataagctat gtggtggtgg ggctatacta ctgaatgaaa aatgacttga     8340

aattttctgc aattgaaaaa taaacacgtt gaaacataac atgcaacagg ttcacgattc     8400

tttattcctg ggcaatgtag gagaaggtgt aagagttggt agcaaaagtt tcagtggtgt     8460

attttccact ttcccaggac catgtaaaag acatagagta agtgcttacc tcgctagttt     8520

ctgtggattc actagaatcg atgtaggatg ttgcccctcc tgacgcggta ggagaagggg     8580

agggtgccct gcatgtctgc cgctgctctt gctcttgccg ctgctgagga ggggggcgca     8640

tctgccgcag caccggatgc atctgggaaa agcaaaaaag gggctcgtcc ctgtttccgg     8700

aggaatttgc aagcggggtc ttgcatgacg gggaggcaaa cccccgttcg ccgcagtccg     8760

gccggcccga gactcgaacc gggggtcctg cgactcaacc cttggaaaat aaccctccgg     8820

ctacagggag cgagccactt aatgctttcg ctttccagcc taaccgctta cgccgcgcgc     8880

ggccagtggc caaaaaagct agcgcagcag ccgccgcgcc tggaaggaag ccaaaaggag     8940

cgctcccccg ttgtctgacg tcgcacacct gggttcgaca cgcgggcggt aaccgcatgg     9000

atcacggcgg acggccggat ccggggttcg aaccccggtc gtccgccatg atacccttgc     9060

gaatttatcc accagaccac ggaagagtgc ccgcttacag gctctccttt tgcacggtct     9120

agagcgtcaa cgactgcgca cgcctcaccg gccagagcgt cccgaccatg gagcactttt     9180

tgccgctgcg caacatctgg aaccgcgtcc gcgactttcc gcgcgcctcc accaccgccg     9240

ccggcatcac ctggatgtcc aggtacatct acggattacg tcgacgttta aaccatatga     9300

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     9360

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     9420

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     9480

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     9540

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     9600

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     9660

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     9720

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     9780

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     9840

cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt     9900

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     9960

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct    10020

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg    10080

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt    10140

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttagaaaaa ctcatcgagc    10200

atcaaatgaa actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc    10260

cgtttctgta atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg    10320

tatcggtctg cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca    10380

aaaataaggt tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc    10440

aaaagtttat gcatttcttt ccagacttgt tcaacaggcc agccattacg ctcgtcatca    10500

aaatcactcg catcaaccaa accgttattc attcgtgatt gcgcctgagc gagacgaaat    10560

acgcgatcgc tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac    10620

actgccagcg catcaacaat attttcacct gaatcaggat attcttctaa tacctggaat    10680

gctgttttcc cagggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa    10740

tgcttgatgg tcggaagagg cataaattcc gtcagccagt ttagtctgac catctcatct    10800

gtaacatcat tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc    10860

ttcccataca atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta    10920

tacccatata aatcagcatc catgttggaa tttaatcgcg gcctagagca agacgtttcc    10980

cgttgaatat ggctcatact cttccttttt caatattatt gaagcattta tcagggttat    11040

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg    11100

cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta    11160

acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg tgatgacggt    11220

gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc    11280

gggagcagac aacaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg    11340

tgaaccatca ccctaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa    11400

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa    11460

ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg gcaagtgtag cggtcacgct    11520

gcgcgtaacc accacacccg ccgcgcttaa tgcgccgcta cagggcgcga tggatcc       11577


<210>  93
<211>  4443
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR

<400>  93
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag catcagaaag ttcagcattg tgcagaagac ccccctgcag     2160

atgaatggca ttgaggagga ctctgatgag cccctggaga gaagactgag cctggtgcct     2220

gactctgagc agggggaggc catcctgccc agaatctctg tgatcagcac aggccccacc     2280

ctgcaggcca gaagaagaca gtctgtgctg aacctgatga cccactctgt gaaccagggc     2340

cagaacatcc acagaaagac cacagccagc accagaaagg tgagcctggc cccccaggcc     2400

aacctgacag agctggacat ctacagcaga agactgagcc aggagacagg cctggagatc     2460

tctgaggaga tcaatgagga ggacctgaag gagtgcttct ttgatgacat ggagagcatc     2520

cctgctgtga ccacctggaa cacctacctg agatacatca cagtgcacaa gagcctgatc     2580

tttgtgctga tctggtgcct ggtgatcttc ctggctgagg tggctgccag cctggtggtg     2640

ctgtggctgc tgggcaacac ccccctgcag gacaagggca acagcaccca cagcagaaac     2700

aacagctatg ctgtgatcat caccagcacc agcagctact atgtgttcta catctatgtg     2760

ggggtggctg acaccctgct ggccatgggc ttcttcagag gcctgcccct ggtgcacacc     2820

ctgatcacag tgagcaagat cctgcaccac aagatgctgc actctgtgct gcaggccccc     2880

atgagcaccc tgaacaccct gaaggctggg ggcatcctga acagattcag caaggacatt     2940

gccatcctgg atgacctgct gcccctgacc atctttgact tcatccagct gctgctgatt     3000

gtgattgggg ccattgctgt ggtggctgtg ctgcagccct acatctttgt ggccacagtg     3060

cctgtgattg tggccttcat catgctgaga gcctacttcc tgcagaccag ccagcagctg     3120

aagcagctgg agtctgaggg cagaagcccc atcttcaccc acctggtgac cagcctgaag     3180

ggcctgtgga ccctgagagc ctttggcaga cagccctact ttgagaccct gttccacaag     3240

gccctgaacc tgcacacagc caactggttc ctgtacctga gcaccctgag atggttccag     3300

atgagaattg agatgatctt tgtgatcttc ttcattgctg tgaccttcat cagcatcctg     3360

accacagggg agggggaggg cagagtgggc atcatcctga ccctggccat gaacatcatg     3420

agcaccctgc agtgggctgt gaacagcagc attgatgtgg acagcctgat gagatctgtg     3480

agcagagtgt tcaagttcat tgacatgccc acagagggca agcccaccaa gagcaccaag     3540

ccctacaaga atggccagct gagcaaggtg atgatcattg agaacagcca tgtgaagaag     3600

gatgacatct ggccctctgg gggccagatg acagtgaagg acctgacagc caagtacaca     3660

gaggggggca atgccatcct ggagaacatc agcttcagca tcagccctgg ccagagagtg     3720

ggcctgctgg gcagaacagg ctctggcaag agcaccctgc tgtctgcctt cctgagactg     3780

ctgaacacag agggggagat ccagattgat ggggtgagct gggacagcat caccctgcag     3840

cagtggagaa aggcctttgg ggtgatcccc cagaaggtgt tcatcttctc tggcaccttc     3900

agaaagaacc tggaccccta tgagcagtgg tctgaccagg agatctggaa ggtggctgat     3960

gaggtgggcc tgagatctgt gattgagcag ttccctggca agctggactt tgtgctggtg     4020

gatgggggct gtgtgctgag ccatggccac aagcagctga tgtgcctggc cagatctgtg     4080

ctgagcaagg ccaagatcct gctgctggat gagccctctg cccacctgga ccctgtgacc     4140

taccagatca tcagaagaac cctgaagcag gcctttgctg actgcacagt gatcctgtgt     4200

gagcacagaa ttgaggccat gctggagtgc cagcagttcc tggtgattga ggagaacaag     4260

gtgagacagt atgacagcat ccagaagctg ctgaatgaga gaagcctgtt cagacaggcc     4320

atcagcccct ctgacagagt gaagctgttc ccccacagaa acagcagcaa gtgcaagagc     4380

aagccccaga ttgctgccct gaaggaggag accgaggagg aggtgcagga caccagactg     4440

taa                                                                   4443


<210>  94
<211>  1480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CFTR protein

<400>  94

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Ile Arg Lys Phe Ser Ile Val Gln Lys Thr Pro Leu Gln 
705                 710                 715                 720 


Met Asn Gly Ile Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 
                725                 730                 735     


Ser Leu Val Pro Asp Ser Glu Gln Gly Glu Ala Ile Leu Pro Arg Ile 
            740                 745                 750         


Ser Val Ile Ser Thr Gly Pro Thr Leu Gln Ala Arg Arg Arg Gln Ser 
        755                 760                 765             


Val Leu Asn Leu Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His 
    770                 775                 780                 


Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala 
785                 790                 795                 800 


Asn Leu Thr Glu Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr 
                805                 810                 815     


Gly Leu Glu Ile Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys 
            820                 825                 830         


Phe Phe Asp Asp Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr 
        835                 840                 845             


Tyr Leu Arg Tyr Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile 
    850                 855                 860                 


Trp Cys Leu Val Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865                 870                 875                 880 


Leu Trp Leu Leu Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr 
                885                 890                 895     


His Ser Arg Asn Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser 
            900                 905                 910         


Tyr Tyr Val Phe Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala 
        915                 920                 925             


Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val 
    930                 935                 940                 


Ser Lys Ile Leu His His Lys Met Leu His Ser Val Leu Gln Ala Pro 
945                 950                 955                 960 


Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe 
                965                 970                 975     


Ser Lys Asp Ile Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe 
            980                 985                 990         


Asp Phe Ile Gln Leu Leu Leu Ile  Val Ile Gly Ala Ile  Ala Val Val 
        995                 1000                 1005             


Ala Val  Leu Gln Pro Tyr Ile  Phe Val Ala Thr Val  Pro Val Ile 
    1010                 1015                 1020             


Val Ala  Phe Ile Met Leu Arg  Ala Tyr Phe Leu Gln  Thr Ser Gln 
    1025                 1030                 1035             


Gln Leu  Lys Gln Leu Glu Ser  Glu Gly Arg Ser Pro  Ile Phe Thr 
    1040                 1045                 1050             


His Leu  Val Thr Ser Leu Lys  Gly Leu Trp Thr Leu  Arg Ala Phe 
    1055                 1060                 1065             


Gly Arg  Gln Pro Tyr Phe Glu  Thr Leu Phe His Lys  Ala Leu Asn 
    1070                 1075                 1080             


Leu His  Thr Ala Asn Trp Phe  Leu Tyr Leu Ser Thr  Leu Arg Trp 
    1085                 1090                 1095             


Phe Gln  Met Arg Ile Glu Met  Ile Phe Val Ile Phe  Phe Ile Ala 
    1100                 1105                 1110             


Val Thr  Phe Ile Ser Ile Leu  Thr Thr Gly Glu Gly  Glu Gly Arg 
    1115                 1120                 1125             


Val Gly  Ile Ile Leu Thr Leu  Ala Met Asn Ile Met  Ser Thr Leu 
    1130                 1135                 1140             


Gln Trp  Ala Val Asn Ser Ser  Ile Asp Val Asp Ser  Leu Met Arg 
    1145                 1150                 1155             


Ser Val  Ser Arg Val Phe Lys  Phe Ile Asp Met Pro  Thr Glu Gly 
    1160                 1165                 1170             


Lys Pro  Thr Lys Ser Thr Lys  Pro Tyr Lys Asn Gly  Gln Leu Ser 
    1175                 1180                 1185             


Lys Val  Met Ile Ile Glu Asn  Ser His Val Lys Lys  Asp Asp Ile 
    1190                 1195                 1200             


Trp Pro  Ser Gly Gly Gln Met  Thr Val Lys Asp Leu  Thr Ala Lys 
    1205                 1210                 1215             


Tyr Thr  Glu Gly Gly Asn Ala  Ile Leu Glu Asn Ile  Ser Phe Ser 
    1220                 1225                 1230             


Ile Ser  Pro Gly Gln Arg Val  Gly Leu Leu Gly Arg  Thr Gly Ser 
    1235                 1240                 1245             


Gly Lys  Ser Thr Leu Leu Ser  Ala Phe Leu Arg Leu  Leu Asn Thr 
    1250                 1255                 1260             


Glu Gly  Glu Ile Gln Ile Asp  Gly Val Ser Trp Asp  Ser Ile Thr 
    1265                 1270                 1275             


Leu Gln  Gln Trp Arg Lys Ala  Phe Gly Val Ile Pro  Gln Lys Val 
    1280                 1285                 1290             


Phe Ile  Phe Ser Gly Thr Phe  Arg Lys Asn Leu Asp  Pro Tyr Glu 
    1295                 1300                 1305             


Gln Trp  Ser Asp Gln Glu Ile  Trp Lys Val Ala Asp  Glu Val Gly 
    1310                 1315                 1320             


Leu Arg  Ser Val Ile Glu Gln  Phe Pro Gly Lys Leu  Asp Phe Val 
    1325                 1330                 1335             


Leu Val  Asp Gly Gly Cys Val  Leu Ser His Gly His  Lys Gln Leu 
    1340                 1345                 1350             


Met Cys  Leu Ala Arg Ser Val  Leu Ser Lys Ala Lys  Ile Leu Leu 
    1355                 1360                 1365             


Leu Asp  Glu Pro Ser Ala His  Leu Asp Pro Val Thr  Tyr Gln Ile 
    1370                 1375                 1380             


Ile Arg  Arg Thr Leu Lys Gln  Ala Phe Ala Asp Cys  Thr Val Ile 
    1385                 1390                 1395             


Leu Cys  Glu His Arg Ile Glu  Ala Met Leu Glu Cys  Gln Gln Phe 
    1400                 1405                 1410             


Leu Val  Ile Glu Glu Asn Lys  Val Arg Gln Tyr Asp  Ser Ile Gln 
    1415                 1420                 1425             


Lys Leu  Leu Asn Glu Arg Ser  Leu Phe Arg Gln Ala  Ile Ser Pro 
    1430                 1435                 1440             


Ser Asp  Arg Val Lys Leu Phe  Pro His Arg Asn Ser  Ser Lys Cys 
    1445                 1450                 1455             


Lys Ser  Lys Pro Gln Ile Ala  Ala Leu Lys Glu Glu  Thr Glu Glu 
    1460                 1465                 1470             


Glu Val  Gln Asp Thr Arg Leu  
    1475                 1480 


<210>  95
<211>  1428
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CFTRdeltaR protein

<400>  95

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Thr Leu Gln Ala Arg Arg Arg Gln Ser Val Leu Asn Leu 
705                 710                 715                 720 


Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His Arg Lys Thr Thr 
                725                 730                 735     


Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala Asn Leu Thr Glu 
            740                 745                 750         


Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr Gly Leu Glu Ile 
        755                 760                 765             


Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys Phe Phe Asp Asp 
    770                 775                 780                 


Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr Tyr Leu Arg Tyr 
785                 790                 795                 800 


Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile Trp Cys Leu Val 
                805                 810                 815     


Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val Leu Trp Leu Leu 
            820                 825                 830         


Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr His Ser Arg Asn 
        835                 840                 845             


Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser Tyr Tyr Val Phe 
    850                 855                 860                 


Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala Met Gly Phe Phe 
865                 870                 875                 880 


Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val Ser Lys Ile Leu 
                885                 890                 895     


His His Lys Met Leu His Ser Val Leu Gln Ala Pro Met Ser Thr Leu 
            900                 905                 910         


Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe Ser Lys Asp Ile 
        915                 920                 925             


Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe Asp Phe Ile Gln 
    930                 935                 940                 


Leu Leu Leu Ile Val Ile Gly Ala Ile Ala Val Val Ala Val Leu Gln 
945                 950                 955                 960 


Pro Tyr Ile Phe Val Ala Thr Val Pro Val Ile Val Ala Phe Ile Met 
                965                 970                 975     


Leu Arg Ala Tyr Phe Leu Gln Thr Ser Gln Gln Leu Lys Gln Leu Glu 
            980                 985                 990         


Ser Glu Gly Arg Ser Pro Ile Phe  Thr His Leu Val Thr  Ser Leu Lys 
        995                 1000                 1005             


Gly Leu  Trp Thr Leu Arg Ala  Phe Gly Arg Gln Pro  Tyr Phe Glu 
    1010                 1015                 1020             


Thr Leu  Phe His Lys Ala Leu  Asn Leu His Thr Ala  Asn Trp Phe 
    1025                 1030                 1035             


Leu Tyr  Leu Ser Thr Leu Arg  Trp Phe Gln Met Arg  Ile Glu Met 
    1040                 1045                 1050             


Ile Phe  Val Ile Phe Phe Ile  Ala Val Thr Phe Ile  Ser Ile Leu 
    1055                 1060                 1065             


Thr Thr  Gly Glu Gly Glu Gly  Arg Val Gly Ile Ile  Leu Thr Leu 
    1070                 1075                 1080             


Ala Met  Asn Ile Met Ser Thr  Leu Gln Trp Ala Val  Asn Ser Ser 
    1085                 1090                 1095             


Ile Asp  Val Asp Ser Leu Met  Arg Ser Val Ser Arg  Val Phe Lys 
    1100                 1105                 1110             


Phe Ile  Asp Met Pro Thr Glu  Gly Lys Pro Thr Lys  Ser Thr Lys 
    1115                 1120                 1125             


Pro Tyr  Lys Asn Gly Gln Leu  Ser Lys Val Met Ile  Ile Glu Asn 
    1130                 1135                 1140             


Ser His  Val Lys Lys Asp Asp  Ile Trp Pro Ser Gly  Gly Gln Met 
    1145                 1150                 1155             


Thr Val  Lys Asp Leu Thr Ala  Lys Tyr Thr Glu Gly  Gly Asn Ala 
    1160                 1165                 1170             


Ile Leu  Glu Asn Ile Ser Phe  Ser Ile Ser Pro Gly  Gln Arg Val 
    1175                 1180                 1185             


Gly Leu  Leu Gly Arg Thr Gly  Ser Gly Lys Ser Thr  Leu Leu Ser 
    1190                 1195                 1200             


Ala Phe  Leu Arg Leu Leu Asn  Thr Glu Gly Glu Ile  Gln Ile Asp 
    1205                 1210                 1215             


Gly Val  Ser Trp Asp Ser Ile  Thr Leu Gln Gln Trp  Arg Lys Ala 
    1220                 1225                 1230             


Phe Gly  Val Ile Pro Gln Lys  Val Phe Ile Phe Ser  Gly Thr Phe 
    1235                 1240                 1245             


Arg Lys  Asn Leu Asp Pro Tyr  Glu Gln Trp Ser Asp  Gln Glu Ile 
    1250                 1255                 1260             


Trp Lys  Val Ala Asp Glu Val  Gly Leu Arg Ser Val  Ile Glu Gln 
    1265                 1270                 1275             


Phe Pro  Gly Lys Leu Asp Phe  Val Leu Val Asp Gly  Gly Cys Val 
    1280                 1285                 1290             


Leu Ser  His Gly His Lys Gln  Leu Met Cys Leu Ala  Arg Ser Val 
    1295                 1300                 1305             


Leu Ser  Lys Ala Lys Ile Leu  Leu Leu Asp Glu Pro  Ser Ala His 
    1310                 1315                 1320             


Leu Asp  Pro Val Thr Tyr Gln  Ile Ile Arg Arg Thr  Leu Lys Gln 
    1325                 1330                 1335             


Ala Phe  Ala Asp Cys Thr Val  Ile Leu Cys Glu His  Arg Ile Glu 
    1340                 1345                 1350             


Ala Met  Leu Glu Cys Gln Gln  Phe Leu Val Ile Glu  Glu Asn Lys 
    1355                 1360                 1365             


Val Arg  Gln Tyr Asp Ser Ile  Gln Lys Leu Leu Asn  Glu Arg Ser 
    1370                 1375                 1380             


Leu Phe  Arg Gln Ala Ile Ser  Pro Ser Asp Arg Val  Lys Leu Phe 
    1385                 1390                 1395             


Pro His  Arg Asn Ser Ser Lys  Cys Lys Ser Lys Pro  Gln Ile Ala 
    1400                 1405                 1410             


Ala Leu  Lys Glu Glu Thr Glu  Glu Glu Val Gln Asp  Thr Arg Leu 
    1415                 1420                 1425             


<210>  96
<211>  250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mouse U1a promoter sequence

<400>  96
atggaggcgg tactatgtag atgagaattc aggagcaaac tgggaaaagc aactgcttcc       60

aaatatttgt gatttttaca gtgtagtttt ggaaaaactc ttagcctacc aattcttcta      120

agtgttttaa aatgtgggag ccagtacaca tgaagttata gagtgtttta atgaggctta      180

aatatttacc gtaactatga aatgctacgc atatcatgct gttcaggctc cgtggccacg      240

caactcatac                                                             250


<210>  97
<211>  101
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polymerase III H1 mutant promoter sequence

<400>  97
aatatttgca tgtcgctatg tgttctggga aatcaccata aacgtgaaat gtctttggat       60

ttgggaatct tcgaagttct gtatgagacc acagatctcc a                          101


<210>  98
<211>  2214
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV110 DNA

<400>  98
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga      600

cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac atgggccttg cccacctata acaaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgatttcaa cagattccac tgccatttct caccacgtga ctggcagcga      900

ctcatcaaca acaattgggg attccggccc aagagactca acttcaagct cttcaacatc      960

caagtcaagg aggtcacgac gaatgatggc gtcacgacca tcgctaataa ccttaccagc     1020

acggttcaag tcttctcgga ctcggagtac cagttgccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaaca atggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaataac tttaccttca gctacacctt cgaggacgtg     1260

cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc     1320

gaccagtacc tgtattacct gaacagaact cagaatcagt ccggaagtgc ccaaaacaag     1380

gacttgctgt ttagccgggg gtctccagct ggcatgtctg ttcagcccaa aaactggcta     1440

cctggaccct gttaccggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc     1500

aactttacct ggactggtgc ttcaaaatat aaccttaatg ggcgtgaatc tataatcaac     1560

cctggcactg ctatggcctc acacaaagac gacaaagaca agttctttcc catgagcggt     1620

gtcatgattt ttggaaagga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg     1680

atcacagacg aagaggaaat caaagccact aaccccgtgg ccaccgaaag atttgggact     1740

gtggcagtca atctccagag cagcagcaca gaccctgcga ccggagatgt gcatgttatg     1800

ggagccttac ctggaatggt gtggcaagac agagacgtat acctgcaggg tcctatttgg     1860

gccaaaattc ctcacacgga tggacacttt cacccgtctc ctctcatggg cggctttgga     1920

cttaagcacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg     1980

gcagagtttt cggctacaaa gtttgcttca ttcatcaccc agtattccac aggacaagtg     2040

agcgtggaga ttgaatggga gctgcagaaa gaaaacagca aacgctggaa tcccgaagtg     2100

cagtatacat ctaactatgc aaaatctgcc aacgttgatt tcactgtgga caacaatgga     2160

ctttatactg agcctcgccc cattggcacc cgttacctca cccgtcccct gtaa           2214


<210>  99
<211>  1509
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1509)
<223>  Sulfoglucosamine sulfohydrolase (SGSH)

<400>  99
atgagctgcc ccgtgcccgc ctgctgcgcg ctgctgctag tcctggggct ctgccgggcg       60

cgtccccgga acgcactgct gctcctcgcg gatgacggag gctttgagag tggcgcgtac      120

aacaacagcg ccatcgccac cccgcacctg gacgccttgg cccgccgcag cctcctcttt      180

cgcaatgcct tcacctcggt cagcagctgc tctcccagcc gcgccagcct cctcactggc      240

ctgccccagc atcagaatgg gatgtacggg ctgcaccagg acgtgcacca cttcaactcc      300

ttcgacaagg tgcggagcct gccgctgctg ctcagccaag ctggtgtgcg cacaggcatc      360

atcgggaaga agcacgtggg gccggagacc gtgtacccgt ttgactttgc gtacacggag      420

gagaatggct ccgtcctcca ggtggggcgg aacatcacta gaattaagct gctcgtccgg      480

aaattcctgc agactcagga tgaccagcct ttcttcctct acgtcgcctt ccacgacccc      540

caccgctgtg ggcactccca gccccagtac ggaaccttct gtgagaagtt tggcaacgga      600

gagagcggca tgggtcgtat cccagactgg accccccagg cctacgaccc actggacgtg      660

ctggtgcctt acttcgtccc caacaccccg gcagcccgag ccgacctggc cgctcagtac      720

accaccgtcg gccgcatgga ccaaggagtt ggactggtgc tccaggagct gcgtgacgcc      780

ggtgtcctga acgacacact ggtgatcttc acgtccgaca acgggatccc cttccccagc      840

ggcaggacca acctgtactg gccgggcact gctgaaccct tactggtgtc atccccggag      900

cacccaaaac gctggggcca agtcagcgag gcctacgtga gcctcctaga cctcacgccc      960

accatcttgg attggttctc gatcccgtac cccagctacg ccatctttgg ctcgaagacc     1020

atccacctca ctggccggtc cctcctgccg gcgctggagg ccgagcccct ctgggccacc     1080

gtctttggca gccagagcca ccacgaggtc accatgtcct accccatgcg ctccgtgcag     1140

caccggcact tccgcctcgt gcacaacctc aacttcaaga tgccctttcc catcgaccag     1200

gacttctacg tctcacccac cttccaggac ctcctgaacc gcaccacagc tggtcagccc     1260

acgggctggt acaaggacct ccgtcattac tactaccggg cgcgctggga gctctacgac     1320

cggagccggg acccccacga gacccagaac ctggccaccg acccgcgctt tgctcagctt     1380

ctggagatgc ttcgggacca gctggccaag tggcagtggg agacccacga cccctgggtg     1440

tgcgcccccg acggcgtcct ggaggagaag ctctctcccc agtgccagcc cctccacaat     1500

gagctgtga                                                             1509


<210>  100
<211>  1509
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-SGSH

<400>  100
atgagctgtc ctgttccagc ctgttgtgcc ctgctgctgg ttctgggact gtgcagagcc       60

agacctagga acgctctgct gctgctcgct gacgatggcg gatttgagag cggcgcctac      120

aacaacagcg ccattgccac acctcacctg gatgccctgg ccagaagaag cctgctgttc      180

agaaacgcct tcaccagcgt gtccagctgc agcccttcta gagctagcct gctgacagga      240

ctgccccagc accagaatgg gatgtatggc ctgcaccagg acgtgcacca cttcaacagc      300

ttcgacaaag tgcggagcct gcctctgctt ctgtctcaag ccggcgtcag aacaggcatc      360

atcggcaaga aacacgtggg ccccgagaca gtgtacccct tcgatttcgc ctacaccgaa      420

gagaacggca gcgtgctgca agtgggcaga aacatcaccc ggatcaagct gctcgtgcgg      480

aagttcctgc agacccagga cgaccagcct ttcttcctgt acgtggcctt ccacgatcct      540

cacagatgcg gccatagcca gcctcagtac ggcaccttct gcgagaagtt tggcaacggc      600

gagagcggca tgggcagaat ccctgattgg acccctcagg cctacgatcc cctggatgtg      660

ctggtgcctt acttcgtgcc taacacacca gccgccagag ccgatctggc cgctcagtat      720

acaaccgtgg gaagaatgga ccaaggcgtc ggcctggttc tgcaagagct tagagatgcc      780

ggcgtgctga acgacaccct ggtcatcttt accagcgaca acggcatccc ctttccatct      840

ggccggacca atctgtactg gcctggaaca gctgagcccc tgctggtgtc tagccctgag      900

caccctaaga gatggggcca agtgtctgag gcctacgtgt ccctgctgga tctgacccct      960

accatcctgg actggttcag catcccctat cctagctacg ccatcttcgg cagcaagacc     1020

atccacctga ccggcagatc tctgctgcca gctctggaag ctgaacctct gtgggccaca     1080

gtgtttggca gccagtctca ccacgaagtg acaatgagct accccatgcg gagcgtgcag     1140

cacagacact tcagactggt gcacaacctg aacttcaaga tgccctttcc aatcgaccag     1200

gacttctatg tgtccccaac cttccaggac ctgctgaaca gaaccacagc cggccaacct     1260

accggctggt acaaggacct gcggcactac tactatagag ccagatggga gctgtacgac     1320

cggtccagag atccccacga gacacagaac ctggccaccg atcctagatt cgcccagctg     1380

ctggaaatgc tgagagatca gctggccaag tggcagtggg agacacacga tccttgggtc     1440

tgcgctcctg atggcgtgct ggaagagaag ctgtcccctc agtgtcagcc cctgcacaac     1500

gagctttaa                                                             1509


<210>  101
<211>  1596
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET CO1-SGSH-GET

<400>  101
atgagctgtc ctgttccagc ctgttgtgcc ctgctgctgg ttctgggact gtgcagagcc       60

agacctagga acgctctgct gctgctcgct gacgatggcg gatttgagag cggcgcctac      120

aacaacagcg ccattgccac acctcacctg gatgccctgg ccagaagaag cctgctgttc      180

agaaacgcct tcaccagcgt gtccagctgc agcccttcta gagctagcct gctgacagga      240

ctgccccagc accagaatgg gatgtatggc ctgcaccagg acgtgcacca cttcaacagc      300

ttcgacaaag tgcggagcct gcctctgctt ctgtctcaag ccggcgtcag aacaggcatc      360

atcggcaaga aacacgtggg ccccgagaca gtgtacccct tcgatttcgc ctacaccgaa      420

gagaacggca gcgtgctgca agtgggcaga aacatcaccc ggatcaagct gctcgtgcgg      480

aagttcctgc agacccagga cgaccagcct ttcttcctgt acgtggcctt ccacgatcct      540

cacagatgcg gccatagcca gcctcagtac ggcaccttct gcgagaagtt tggcaacggc      600

gagagcggca tgggcagaat ccctgattgg acccctcagg cctacgatcc cctggatgtg      660

ctggtgcctt acttcgtgcc taacacacca gccgccagag ccgatctggc cgctcagtat      720

acaaccgtgg gaagaatgga ccaaggcgtc ggcctggttc tgcaagagct tagagatgcc      780

ggcgtgctga acgacaccct ggtcatcttt accagcgaca acggcatccc ctttccatct      840

ggccggacca atctgtactg gcctggaaca gctgagcccc tgctggtgtc tagccctgag      900

caccctaaga gatggggcca agtgtctgag gcctacgtgt ccctgctgga tctgacccct      960

accatcctgg actggttcag catcccctat cctagctacg ccatcttcgg cagcaagacc     1020

atccacctga ccggcagatc tctgctgcca gctctggaag ctgaacctct gtgggccaca     1080

gtgtttggca gccagtctca ccacgaagtg acaatgagct accccatgcg gagcgtgcag     1140

cacagacact tcagactggt gcacaacctg aacttcaaga tgccctttcc aatcgaccag     1200

gacttctatg tgtccccaac cttccaggac ctgctgaaca gaaccacagc cggccaacct     1260

accggctggt acaaggacct gcggcactac tactatagag ccagatggga gctgtacgac     1320

cggtccagag atccccacga gacacagaac ctggccaccg atcctagatt cgcccagctg     1380

ctggaaatgc tgagagatca gctggccaag tggcagtggg agacacacga tccttgggtc     1440

tgcgctcctg atggcgtgct ggaagagaag ctgtcccctc agtgtcagcc cctgcacaac     1500

gagctgcggc gtcgtcggcg aagaagaaga aagcgcaaga aaaaaggcaa aggcctgggc     1560

aagaagcggg acccctgtct gagaaagtac aaataa                               1596


<210>  102
<211>  1509
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-SGSH

<400>  102
atgagctgcc ctgtgcctgc ctgctgtgcc ctgctgctgg tgctgggcct gtgcagagcc       60

agacctagga atgccctgct gctgctggct gatgatgggg gctttgagag tggggcctac      120

aacaacagtg ccattgccac cccccacctg gatgccctgg ccagaagaag cctgctgttc      180

agaaatgcct tcaccagtgt gagcagctgc agccccagca gagccagcct gctgacaggc      240

ctgccccagc accagaatgg catgtatggc ctgcaccagg atgtgcacca cttcaacagc      300

tttgacaagg tgagaagcct gcccctgctg ctgagccagg ctggggtgag aacaggcatc      360

attggcaaga agcatgtggg ccctgagaca gtgtacccct ttgactttgc ctacacagag      420

gagaatggca gtgtgctgca ggtgggcaga aacatcacca gaatcaagct gctggtgaga      480

aagttcctgc agacccagga tgaccagccc ttcttcctgt atgtggcctt ccatgacccc      540

cacagatgtg gccacagcca gccccagtat ggcaccttct gtgagaagtt tggcaatggg      600

gagagtggca tgggcagaat ccctgactgg accccccagg cctatgaccc cctggatgtg      660

ctggtgccct actttgtgcc caacacccct gctgccagag ctgacctggc tgcccagtac      720

accacagtgg gcagaatgga ccagggggtg ggcctggtgc tgcaggagct gagagatgct      780

ggggtgctga atgacaccct ggtgatcttc accagtgaca atggcatccc cttccccagt      840

ggcagaacca acctgtactg gcctggcaca gctgagcccc tgctggtgag cagccctgag      900

caccccaaga gatggggcca ggtgagtgag gcctatgtga gcctgctgga cctgaccccc      960

accatcctgg actggttcag catcccctac cccagctatg ccatctttgg cagcaagacc     1020

atccacctga caggcagaag cctgctgcct gccctggagg ctgagcccct gtgggccaca     1080

gtgtttggca gccagagcca ccatgaggtg accatgagct accccatgag aagtgtgcag     1140

cacagacact tcagactggt gcacaacctg aacttcaaga tgcccttccc cattgaccag     1200

gacttctatg tgagccccac cttccaggac ctgctgaaca gaaccacagc tggccagccc     1260

acaggctggt acaaggacct gagacactac tactacagag ccagatggga gctgtatgac     1320

agaagcagag acccccatga gacccagaac ctggccacag accccagatt tgcccagctg     1380

ctggagatgc tgagagacca gctggccaag tggcagtggg agacccatga cccctgggtg     1440

tgtgcccctg atggggtgct ggaggagaag ctgagccccc agtgccagcc cctgcacaat     1500

gagctgtga                                                             1509


<210>  103
<211>  921
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized Ceroid Lipofuscinosis, Neuronal, 1 (CLN1)

<400>  103
atggcttctc cggggtgtct gtggctgctg gcagtggcac tccttccctg gacttgcgcc       60

agccgggctc tgcagcacct cgaccctcca gcccctcttc cactggtgat ttggcacgga      120

atgggtgatt cctgctgtaa tcccctgtca atgggagcca tcaagaagat ggtggagaag      180

aagatccctg gaatctacgt gctgtcactg gagattggaa agaccctgat ggaggacgtc      240

gagaactcct tcttcctcaa tgtcaactct caagtgacca ccgtctgcca ggccctggcc      300

aaggacccga agctgcagca ggggtataat gctatggggt tcagccaggg aggacagttc      360

cttcgggctg tggcccaacg ctgccctagc ccacccatga tcaacctgat ctcagtgggt      420

ggccagcatc agggcgtgtt cggacttccc cggtgtcccg gggaatcctc tcatatctgc      480

gacttcatcc gcaaaactct caatgcaggc gcttattcaa aggtcgtcca agagaggctg      540

gtgcaagccg agtactggca cgatcccatt aaggaggacg tgtacagaaa tcactcaatc      600

tttctggccg acattaacca ggagagggga attaacgaat catataagaa gaatctcatg      660

gccctcaaaa agttcgtcat ggtgaagttc cttaacgata gcattgtgga cccagtggac      720

agcgaatggt tcggatttta ccgctcaggc caggcaaaag aaaccatccc tctccaagag      780

acttctcttt acacccaaga cagacttggg cttaaggaaa tggataacgc tggtcagctg      840

gtgttcctcg ccaccgaagg tgaccatctg cagctcagcg aagagtggtt ctacgctcat      900

atcatcccgt ttcttggttg a                                                921


<210>  104
<211>  885
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(885)
<223>  Survival Motor Neuron 1 (SMN1)

<400>  104
atggcgatga gcagcggcgg cagtggtggc ggcgtcccgg agcaggagga ttccgtgctg       60

ttccggcgcg gcacaggcca gagcgatgat tctgacattt gggatgatac agcactgata      120

aaagcatatg ataaagctgt ggcttcattt aagcatgctc taaagaatgg tgacatttgt      180

gaaacttcgg gtaaaccaaa aaccacacct aaaagaaaac ctgctaagaa gaataaaagc      240

caaaagaaga atactgcagc ttccttacaa cagtggaaag ttggggacaa atgttctgcc      300

atttggtcag aagacggttg catttaccca gctaccattg cttcaattga ttttaagaga      360

gaaacctgtg ttgtggttta cactggatat ggaaatagag aggagcaaaa tctgtccgat      420

ctactttccc caatctgtga agtagctaat aatatagaac agaatgctca agagaatgaa      480

aatgaaagcc aagtttcaac agatgaaagt gagaactcca ggtctcctgg aaataaatca      540

gataacatca agcccaaatc tgctccatgg aactcttttc tccctccacc accccccatg      600

ccagggccaa gactgggacc aggaaagcca ggtctaaaat tcaatggccc accaccgcca      660

ccgccaccac caccacccca cttactatca tgctggctgc ctccatttcc ttctggacca      720

ccaataattc ccccaccacc tcccatatgt ccagattctc ttgatgatgc tgatgctttg      780

ggaagtatgt taatttcatg gtacatgagt ggctatcata ctggctatta tatgggtttt      840

agacaaaatc aaaaagaagg aaggtgctca cattccttaa attaa                      885


<210>  105
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-SMN1

<400>  105
atggcgatgt ctagtggtgg atctggtggc ggcgtgcccg agcaagaaga tagcgtcctg       60

ttcagaagag gcaccggcca gagcgacgac agcgacatct gggatgatac agccctgatc      120

aaggcctacg acaaggccgt ggccagcttt aagcacgccc tgaagaacgg cgatatctgc      180

gagacaagcg gcaagcccaa gaccacacct aagagaaagc ccgccaagaa gaacaagagc      240

cagaagaaga ataccgccgc cagcctgcag cagtggaaag tgggcgataa gtgcagcgcc      300

atttggagcg aggacggctg tatctaccct gccacaatcg ccagcatcga cttcaagcgg      360

gaaacctgcg tggtggtgta cacaggctac ggcaacagag aggaacagaa cctgagcgac      420

ctgctgtccc caatttgcga ggtggccaac aacatcgagc agaacgccca agagaacgag      480

aacgagtccc aggtgtccac cgacgagagc gagaatagca gaagccccgg caacaagagc      540

gacaacatca agcctaagag cgccccttgg aacagcttcc tgcctcctcc tccaccaatg      600

cctggaccta gactcggacc tggaaagccc ggcctgaagt tcaatggacc tccaccaccg      660

ccaccacctc cgcctccaca tcttctgtct tgttggctgc ctccatttcc tagcggccct      720

ccaatcatcc cgccacctcc acctatctgc cccgacagtc tggatgatgc tgatgccctg      780

ggctccatgc tgatctcttg gtacatgagc ggctaccaca ccggctacta catgggcttc      840

agacagaacc agaaagaggg ccgttgcagc cacagcctga actga                      885


<210>  106
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-SMN1

<400>  106
atggccatga gcagtggggg cagtggagga ggggtgcctg agcaggagga cagtgtgctg       60

ttcagaagag gcacaggcca gagtgatgac agtgacatct gggatgacac agccctgatc      120

aaggcctatg acaaggctgt ggccagcttc aagcatgccc tgaagaatgg ggacatctgt      180

gagaccagtg gcaagcccaa gaccaccccc aagagaaagc ctgccaagaa gaacaagagc      240

cagaagaaga acacagctgc cagcctgcag cagtggaagg tgggagacaa gtgcagtgcc      300

atctggagtg aggatggctg catctaccct gccaccattg ccagcattga cttcaagaga      360

gagacctgtg tggtggtgta cacaggctat ggcaacagag aggagcagaa cctgagtgac      420

ctgctgagcc ccatctgtga ggtggccaac aacattgagc agaatgccca ggagaatgag      480

aatgagagcc aggtgagcac agatgagagt gagaacagca gaagccctgg caacaagagt      540

gacaacatca agcccaagag tgccccttgg aacagcttcc tgccaccccc accacccatg      600

cctggcccca gactgggccc tggcaagcct ggcctgaagt tcaatggccc accaccccct      660

cctccaccac cccctcccca cctgctgagc tgctggctgc cccccttccc cagtggccca      720

cccatcatcc cacctccccc acccatctgc cctgacagcc tggatgatgc tgatgccctg      780

ggcagcatgc tgatcagctg gtacatgagt ggctaccaca caggctacta catgggcttc      840

agacagaacc agaaggaggg cagatgcagc cacagcctga actga                      885


<210>  107
<211>  1548
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1548)
<223>  Tissue Non-specific Alkaline Phosphatase (TNALP)

<400>  107
atgatttcac cattcttagt actggccatt ggcacctgcc ttactaactc actagtgcca       60

gagaaagaga aagaccccaa gtactggcga gaccaagcgc aagagacact gaaatatgcc      120

ctggagcttc agaagctcaa caccaacgtg gctaagaatg tcatcatgtt cctgggagat      180

gggatgggtg tctccacagt gacggctgcc cgcatcctca agggtcagct ccaccacaac      240

cctggggagg agaccaggct ggagatggac aagttcccct tcgtggccct ctccaagacg      300

tacaacacca atgcccaggt ccctgacagc gccggcaccg ccaccgccta cctgtgtggg      360

gtgaaggcca atgagggcac cgtgggggta agcgcagcca ctgagcgttc ccggtgcaac      420

accacccagg ggaacgaggt cacctccatc ctgcgctggg ccaaggacgc tgggaaatct      480

gtgggcattg tgaccaccac gagagtgaac catgccaccc ccagcgccgc ctacgcccac      540

tcggctgacc gggactggta ctcagacaac gagatgcccc ctgaggcctt gagccagggc      600

tgtaaggaca tcgcctacca gctcatgcat aacatcaggg acattgacgt gatcatgggg      660

ggtggccgga aatacatgta ccccaagaat aaaactgatg tggagtatga gagtgacgag      720

aaagccaggg gcacgaggct ggacggcctg gacctcgttg acacctggaa gagcttcaaa      780

ccgagataca agcactccca cttcatctgg aaccgcacgg aactcctgac ccttgacccc      840

cacaatgtgg actacctatt gggtctcttc gagccagggg acatgcagta cgagctgaac      900

aggaacaacg tgacggaccc gtcactctcc gagatggtgg tggtggccat ccagatcctg      960

cggaagaacc ccaaaggctt cttcttgctg gtggaaggag gcagaattga ccacgggcac     1020

catgaaggaa aagccaagca ggccctgcat gaggcggtgg agatggaccg ggccatcggg     1080

caggcaggca gcttgacctc ctcggaagac actctgaccg tggtcactgc ggaccattcc     1140

cacgtcttca catttggtgg atacaccccc cgtggcaact ctatctttgg tctggccccc     1200

atgctgagtg acacagacaa gaagcccttc actgccatcc tgtatggcaa tgggcctggc     1260

tacaaggtgg tgggcggtga acgagagaat gtctccatgg tggactatgc tcacaacaac     1320

taccaggcgc agtctgctgt gcccctgcgc cacgagaccc acggcgggga ggacgtggcc     1380

gtcttctcca agggccccat ggcgcacctg ctgcacggcg tccacgagca gaactacgtc     1440

ccccacgtga tggcgtatgc agcctgcatc ggggccaacc tcggccactg tgctcctgcc     1500

agctcggcag gatccgatga tgacgacgac gatgacgatg atgattga                  1548


<210>  108
<211>  1548
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, CO1-TNALP contains D10 tag at C end

<400>  108
atgatctctc catttctggt gctggccatc ggcacctgtc tgaccaactc actagtgccc       60

gagaaagaga aggaccccaa gtactggcgc gatcaggccc aagagacact gaagtacgcc      120

ctggaactgc agaaactgaa caccaacgtg gccaagaacg tgatcatgtt cctcggcgac      180

ggcatgggcg tgtccacagt tacagccgcc agaatcctga agggccagct gcaccataat      240

cctggcgaag agacacggct ggaaatggac aagttcccat tcgtggccct gagcaagacc      300

tacaacacca atgctcaggt gcccgattct gccggaacag ccacagctta tctgtgcggc      360

gtgaaggcca atgagggcac cgttggagtg tctgccgcca ccgaaagatc ccggtgcaat      420

accacacagg gcaacgaagt gaccagcatc ctgagatggg ccaaagacgc cggcaagtct      480

gtgggcatcg tgaccaccac cagagtgaac cacgccacac ctagcgccgc ctatgctcac      540

tctgccgaca gagactggta cagcgacaac gagatgcctc ctgaggctct gtctcagggc      600

tgcaaggata tcgcctacca gctgatgcac aacatccggg acattgatgt gatcatgggc      660

ggaggccgga agtacatgta tcccaagaac aagaccgacg tcgagtacga gagcgacgag      720

aaggccagag gcacaagact ggatggcctg gacctggtgg atacctggaa gtccttcaag      780

ccccggtaca agcacagcca cttcatctgg aaccggaccg agctgctgac actggaccct      840

cacaatgtgg actacctgct gggcctgttc gagcccggcg atatgcagta cgagctgaac      900

cggaacaacg tgacagaccc cagcctgagc gagatggtgg ttgtggccat tcagatcctg      960

cggaagaacc ccaagggatt cttcctgctg gtggaaggcg gcaggatcga tcacggacac     1020

catgagggaa aagccaagca ggccctgcac gaggccgtcg aaatggatag agccattggc     1080

caggccggca gcctgacaag ctctgaggat acactgaccg tggtcaccgc cgatcacagc     1140

cacgtgttca cattcggcgg ctacacccct agaggcaaca gcatctttgg actggcccct     1200

atgctgagcg acaccgacaa gaagcctttc accgccatcc tgtacggcaa cggccctggc     1260

tataaggttg tcggaggcga gagggaaaac gtgtccatgg tggattacgc ccacaacaac     1320

taccaggctc agagcgccgt gcctctgaga cacgaaacac acggcggaga agatgtggcc     1380

gtgttcagca agggccccat ggctcatctg ctgcatggcg tgcacgagca gaattacgtg     1440

ccacacgtga tggcctacgc cgcctgtatt ggagccaatc tgggacattg tgcccctgcc     1500

agtagcgccg gatccgacga tgatgacgac gacgatgacg atgactga                  1548


<210>  109
<211>  1548
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, CO2-TNALP contains D10 tag at C end

<400>  109
atgatcagcc ccttcctggt gctggccatt ggcacctgcc tgaccaacag cctggtgcct       60

gagaaggaga aggaccccaa gtactggaga gaccaggccc aggagaccct gaagtatgcc      120

ctggagctgc agaagctgaa caccaatgtg gccaagaatg tgatcatgtt cctgggggat      180

ggcatggggg tgagcacagt gacagctgcc agaatcctga agggccagct gcaccacaac      240

cctggggagg agaccagact ggagatggac aagttcccct ttgtggccct gagcaagacc      300

tacaacacca atgcccaggt gcctgacagt gctggcacag ccacagccta cctgtgtggg      360

gtgaaggcca atgagggcac agtgggggtg agtgctgcca cagagagaag cagatgcaac      420

accacccagg gcaatgaggt gaccagcatc ctgagatggg ccaaggatgc tggcaagagt      480

gtgggcattg tgaccaccac cagagtgaac catgccaccc ccagtgctgc ctatgcccac      540

agtgctgaca gagactggta cagtgacaat gagatgcccc ctgaggccct gagccagggc      600

tgcaaggaca ttgcctacca gctgatgcac aacatcagag acattgatgt gatcatgggg      660

gggggcagaa agtacatgta ccccaagaac aagacagatg tggagtatga gagtgatgag      720

aaggccagag gcaccagact ggatggcctg gacctggtgg acacctggaa gagcttcaag      780

cccagataca agcacagcca cttcatctgg aacagaacag agctgctgac cctggacccc      840

cacaatgtgg actacctgct gggcctgttt gagcctgggg acatgcagta tgagctgaac      900

agaaacaatg tgacagaccc cagcctgagt gagatggtgg tggtggccat ccagatcctg      960

agaaagaacc ccaagggctt cttcctgctg gtggaggggg gcagaattga ccatggccac     1020

catgagggca aggccaagca ggccctgcat gaggctgtgg agatggacag agccattggc     1080

caggctggca gcctgaccag cagtgaggac accctgacag tggtgacagc tgaccacagc     1140

catgtgttca cctttggggg ctacaccccc agaggcaaca gcatctttgg cctggccccc     1200

atgctgagtg acacagacaa gaagcccttc acagccatcc tgtatggcaa tggccctggc     1260

tacaaggtgg tgggggggga gagagagaat gtgagcatgg tggactatgc ccacaacaac     1320

taccaggccc agagtgctgt gcccctgaga catgagaccc atggggggga ggatgtggct     1380

gtgttcagca agggccccat ggcccacctg ctgcatgggg tgcatgagca gaactatgtg     1440

ccccatgtga tggcctatgc tgcctgcatt ggggccaacc tgggccactg tgcccctgcc     1500

agcagtgctg gatccgatga tgatgatgat gatgatgatg atgactga                  1548


<210>  110
<211>  636
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(636)
<223>  Glial Cell Derived Neurotrophic Factor (GDNF)

<400>  110
atgaagttat gggatgtcgt ggctgtctgc ctggtgctgc tccacaccgc gtccgccttc       60

ccgctgcccg ccggcaagag gcctcccgag gcgcccgccg aagaccgctc cctcggccgc      120

cgccgcgcgc ccttcgcgct gagcagtgac tcaaatatgc cagaggatta tcctgatcag      180

ttcgatgatg tcatggattt tattcaagcc accattaaaa gactgaaaag gtcaccagat      240

aaacaaatgg cagtgcttcc tagaagagag cggaatcggc aggctgcagc tgccaaccca      300

gagaattcca gaggaaaagg tcggagaggc cagaggggca aaaaccgggg ttgtgtctta      360

actgcaatac atttaaatgt cactgacttg ggtctgggct atgaaaccaa ggaggaactg      420

atttttaggt actgcagcgg ctcttgcgat gcagctgaga caacgtacga caaaatattg      480

aaaaacttat ccagaaatag aaggctggtg agtgacaaag tagggcaggc atgttgcaga      540

cccatcgcct ttgatgatga cctgtcgttt ttagatgata acctggttta ccatattcta      600

agaaagcatt ccgctaaaag gtgtggatgt atctaa                                636


<210>  111
<211>  1611
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1611)
<223>  Tissue Glucosyl Ceramidase beta (GBA1)

<400>  111
atggagtttt caagtccttc cagagaggaa tgtcccaagc ctttgagtag ggtaagcatc       60

atggctggca gcctcacagg attgcttcta cttcaggcag tgtcgtgggc atcaggtgcc      120

cgcccctgca tccctaaaag cttcggctac agctcggtgg tgtgtgtctg caatgccaca      180

tactgtgact cctttgaccc cccgaccttt cctgcccttg gtaccttcag ccgctatgag      240

agtacacgca gtgggcgacg gatggagctg agtatggggc ccatccaggc taatcacacg      300

ggcacaggcc tgctactgac cctgcagcca gaacagaagt tccagaaagt gaagggattt      360

ggaggggcca tgacagatgc tgctgctctc aacatccttg ccctgtcacc ccctgcccaa      420

aatttgctac ttaaatcgta cttctctgaa gaaggaatcg gatataacat catccgggta      480

ccaatggcca gctgtgactt ctccatccgc acctacacct atgcagacac ccctgatgat      540

ttccagttgc acaacttcag cctcccagag gaagatacca agctcaagat acccctgatt      600

caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca      660

cccacttggc tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc      720

ggagacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct      780

gagcacaagt tacagttctg ggcagtgaca gctgaaaatg agccttctgc tgggctgttg      840

agtggatacc ccttccagtg cctgggcttc acccctgaac atcagcgaga cttcattgcc      900

cgtgacctag gtcctaccct cgccaacagt actcaccaca atgtccgcct actcatgctg      960

gatgaccaac gcttgctgct gccccactgg gcaaaggtgg tactgacaga cccagaagca     1020

gctaaatatg ttcatggcat tgctgtacat tggtacctgg actttctggc tccagccaaa     1080

gccaccctag gggagacaca ccgcctgttc cccaacacca tgctctttgc ctcagaggcc     1140

tgtgtgggct ccaagttctg ggagcagagt gtgcggctag gctcctggga tcgagggatg     1200

cagtacagcc acagcatcat cacgaacctc ctgtaccatg tggtcggctg gaccgactgg     1260

aaccttgccc tgaaccccga aggaggaccc aattgggtgc gtaactttgt cgacagtccc     1320

atcattgtag acatcaccaa ggacacgttt tacaaacagc ccatgttcta ccaccttggc     1380

cacttcagca agttcattcc tgagggctcc cagagagtgg ggctggttgc cagtcagaag     1440

aacgacctgg acgcagtggc actgatgcat cccgatggct ctgctgttgt ggtcgtgcta     1500

aaccgctcct ctaaggatgt gcctcttacc atcaaggatc ctgctgtggg cttcctggag     1560

acaatctcac ctggctactc cattcacacc tacctgtggc gtcgccagtg a              1611


<210>  112
<211>  1611
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-GBA1

<400>  112
atggagttca gcagccccag cagagaggag tgccccaagc ccctgagcag agtgagcatc       60

atggctggca gcctgacagg cctgctgctg ctgcaggctg tgagctgggc cagtggggcc      120

agaccctgca tccccaagag ctttggctac agcagtgtgg tgtgtgtgtg caatgccacc      180

tactgtgaca gctttgaccc ccccaccttc cctgccctgg gcaccttcag cagatatgag      240

agcaccagaa gtggcagaag aatggagctg agcatgggcc ccatccaggc caaccacaca      300

ggcacaggcc tgctgctgac cctgcagcct gagcagaagt tccagaaggt gaagggcttt      360

gggggggcca tgacagatgc tgctgccctg aacatcctgg ccctgagccc ccctgcccag      420

aacctgctgc tgaagagcta cttcagtgag gagggcattg gctacaacat catcagagtg      480

ccaatggcca gctgtgactt cagcatcaga acctacacct atgctgacac ccctgatgac      540

ttccagctgc acaacttcag cctgcctgag gaggacacca agctgaagat ccccctgatc      600

cacagagccc tgcagctggc ccagagacct gtgagcctgc tggccagccc ctggaccagc      660

cccacctggc tgaagaccaa tggggctgtg aatggcaagg gcagcctgaa gggccagcct      720

ggggacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct      780

gagcacaagc tgcagttctg ggctgtgaca gctgagaatg agcccagtgc tggcctgctg      840

agtggctacc ccttccagtg cctgggcttc acccctgagc accagagaga cttcattgcc      900

agagacctgg gccccaccct ggccaacagc acccaccaca atgtgagact gctgatgctg      960

gatgaccaga gactgctgct gccccactgg gccaaggtgg tgctgacaga ccctgaggct     1020

gccaagtatg tgcatggcat tgctgtgcac tggtacctgg acttcctggc ccctgccaag     1080

gccaccctgg gggagaccca cagactgttc cccaacacca tgctgtttgc cagtgaggcc     1140

tgtgtgggca gcaagttctg ggagcagagt gtgagactgg gcagctggga cagaggcatg     1200

cagtacagcc acagcatcat caccaacctg ctgtaccatg tggtgggctg gacagactgg     1260

aacctggccc tgaaccctga ggggggcccc aactgggtga gaaactttgt ggacagcccc     1320

atcattgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctgggc     1380

cacttcagca agttcatccc tgagggcagc cagagagtgg gcctggtggc cagccagaag     1440

aatgacctgg atgctgtggc cctgatgcac cctgatggca gtgctgtggt ggtggtgctg     1500

aacagaagca gcaaggatgt gcccctgacc atcaaggacc ctgctgtggg cttcctggag     1560

accatcagcc ctggctacag catccacacc tacctgtgga gaagacagtg a              1611


<210>  113
<211>  1611
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-GBA1

<400>  113
atggagttta gcagccctag cagagaggaa tgccccaagc ctctgagccg ggtgtcaatc       60

atggccggat ctctgacagg actgctgctg cttcaggccg tgtcttgggc ttctggcgct      120

agaccttgca tccccaagag cttcggctac agcagcgtcg tgtgcgtgtg caatgccacc      180

tactgcgaca gcttcgaccc tcctaccttt cctgctctgg gcaccttcag cagatacgag      240

agcaccagat ccggcagacg gatggaactg agcatgggac ccatccaggc caatcacaca      300

ggcactggcc tgctgctgac actgcagcct gagcagaaat tccagaaagt gaaaggcttc      360

ggcggagcca tgacagatgc cgccgctctg aatatcctgg ctctgtctcc accagctcag      420

aacctgctgc tcaagagcta cttcagcgag gaaggcatcg gctacaacat catccgggtg      480

ccaatggcca gctgcgactt cagcatccgg acctacacct acgccgacac acccgacgat      540

ttccagctgc acaacttcag cctgcctgaa gaggacacca agctgaagat ccctctgatc      600

cacagagccc tgcagctggc acaaagaccc gtttctctgc tggctagccc ctggacatct      660

cccacctggc tgaaaacaaa tggcgccgtg aatggcaagg gcagcctgaa aggccaacct      720

ggcgatatct accaccagac ctgggccaga tacttcgtga agttcctgga cgcctatgcc      780

gagcacaagc tgcagttttg ggccgtgaca gccgagaacg aaccttctgc tggactgctg      840

agcggctacc cctttcagtg cctgggcttt acacccgagc accagcggga ctttatcgcc      900

agagatctgg gacccacact ggccaatagc acccaccata atgtgcggct gctgatgctg      960

gacgaccaga gactgcttct gccccactgg gctaaagtgg tgctgacaga tcctgaggcc     1020

gccaaatacg tgcacggaat cgccgtgcac tggtatctgg actttctggc ccctgccaag     1080

gccacactgg gagagacaca cagactgttc cccaacacca tgctgttcgc cagcgaagcc     1140

tgtgtgggca gcaagttttg ggaacagagc gtgcggctcg gcagctggga tagaggcatg     1200

cagtacagcc acagcatcat caccaacctg ctgtaccacg tcgtcggctg gaccgactgg     1260

aatctggccc tgaatcctga aggcggccct aactgggtcc gaaacttcgt ggacagcccc     1320

atcatcgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctggga     1380

cacttcagca agttcatccc cgagggctct cagcgcgttg gactggtggc cagccagaag     1440

aatgatctgg acgccgtggc tctgatgcac cctgatggat ctgctgtggt ggtggtcctg     1500

aaccgcagca gcaaagatgt gcccctgacc atcaaggatc ccgccgtggg attcctggaa     1560

acaatcagcc ctggctactc catccacacc tacctgtggc ggagacagtg a              1611


<210>  114
<211>  1962
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1962)
<223>  Iduronidase alpha-L- (IDUA)

<400>  114
atgcgtcccc tgcgcccccg cgccgcgctg ctggcgctcc tggcctcgct cctggccgcg       60

cccccggtgg ccccggccga ggccccgcac ctggtgcatg tggacgcggc ccgcgcgctg      120

tggcccctgc ggcgcttctg gaggagcaca ggcttctgcc ccccgctgcc acacagccag      180

gctgaccagt acgtcctcag ctgggaccag cagctcaacc tcgcctatgt gggcgccgtc      240

cctcaccgcg gcatcaagca ggtccggacc cactggctgc tggagcttgt caccaccagg      300

gggtccactg gacggggcct gagctacaac ttcacccacc tggacgggta cctggacctt      360

ctcagggaga accagctcct cccagggttt gagctgatgg gcagcgcctc gggccacttc      420

actgactttg aggacaagca gcaggtgttt gagtggaagg acttggtctc cagcctggcc      480

aggagataca tcggtaggta cggactggcg catgtttcca agtggaactt cgagacgtgg      540

aatgagccag accaccacga ctttgacaac gtctccatga ccatgcaagg cttcctgaac      600

tactacgatg cctgctcgga gggtctgcgc gccgccagcc ccgccctgcg gctgggaggc      660

cccggcgact ccttccacac cccaccgcga tccccgctga gctggggcct cctgcgccac      720

tgccacgacg gtaccaactt cttcactggg gaggcgggcg tgcggctgga ctacatctcc      780

ctccacagga agggtgcgcg cagctccatc tccatcctgg agcaggagaa ggtcgtcgcg      840

cagcagatcc ggcagctctt ccccaagttc gcggacaccc ccatttacaa cgacgaggcg      900

gacccgctgg tgggctggtc cctgccacag ccgtggaggg cggacgtgac ctacgcggcc      960

atggtggtga aggtcatcgc gcagcatcag aacctgctac tggccaacac cacctccgcc     1020

ttcccctacg cgctcctgag caacgacaat gccttcctga gctaccaccc gcaccccttc     1080

gcgcagcgca cgctcaccgc gcgcttccag gtcaacaaca cccgcccgcc gcacgtgcag     1140

ctgttgcgca agccggtgct cacggccatg gggctgctgg cgctgctgga tgaggagcag     1200

ctctgggccg aagtgtcgca ggccgggacc gtcctggaca gcaaccacac ggtgggcgtc     1260

ctggccagcg cccaccgccc ccagggcccg gccgacgcct ggcgcgccgc ggtgctgatc     1320

tacgcgagcg acgacacccg cgcccacccc aaccgcagcg tcgcggtgac cctgcggctg     1380

cgcggggtgc cccccggccc gggcctggtc tacgtcacgc gctacctgga caacgggctc     1440

tgcagccccg acggcgagtg gcggcgcctg ggccggcccg tcttccccac ggcagagcag     1500

ttccggcgca tgcgcgcggc tgaggacccg gtggccgcgg cgccccgccc cttacccgcc     1560

ggcggccgcc tgaccctgcg ccccgcgctg cggctgccgt cgcttttgct ggtgcacgtg     1620

tgtgcgcgcc ccgagaagcc gcccgggcag gtcacgcggc tccgcgccct gcccctgacc     1680

caagggcagc tggttctggt ctggtcggat gaacacgtgg gctccaagtg cctgtggaca     1740

tacgagatcc agttctctca ggacggtaag gcgtacaccc cggtcagcag gaagccatcg     1800

accttcaacc tctttgtgtt cagcccagac acaggtgctg tctctggctc ctaccgagtt     1860

cgagccctgg actactgggc ccgaccaggc cccttctcgg accctgtgcc gtacctggag     1920

gtccctgtgc caagagggcc cccatccccg ggcaatccat ga                        1962


<210>  115
<211>  1962
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-IDUA

<400>  115
atgagacccc tgagacccag agctgccctg ctggccctgc tggccagcct gctggctgcc       60

ccccctgtgg cccctgctga ggccccccac cttgtacatg tggatgctgc cagagccctg      120

tggcccctga gaagattctg gagaagcaca ggcttctgcc cccccctgcc ccacagccag      180

gctgaccagt atgtgctgag ctgggaccag cagctgaacc tggcctatgt gggggctgtg      240

ccccacagag gcatcaagca ggtgagaacc cactggctgc tggagctggt gaccaccaga      300

ggcagcacag gcagaggcct gagctacaac ttcacccacc tggatggcta cctggacctg      360

ctgagagaga accagctgct gcctggcttt gagctgatgg gcagtgccag tggccacttc      420

acagactttg aggacaagca gcaggtgttt gagtggaagg acctggtgag cagcctggcc      480

agaagataca ttggcagata tggcctggcc catgtgagca agtggaactt tgagacctgg      540

aatgagcctg accaccatga ctttgacaat gtgagcatga ccatgcaggg cttcctgaac      600

tactatgatg cctgcagtga gggcctgaga gctgccagcc ctgccctgag actggggggc      660

cctggggaca gcttccacac cccccccaga agccccctga gctggggcct gctgagacac      720

tgccatgatg gcaccaactt cttcacaggg gaggctgggg tgagactgga ctacatcagc      780

ctgcacagaa agggggccag aagcagcatc agcatcctgg agcaggagaa ggtggtggcc      840

cagcagatca gacagctgtt ccccaagttt gctgacaccc ccatctacaa tgatgaggct      900

gaccccctgg tgggctggag cctgccccag ccctggagag ctgatgtgac ctatgctgcc      960

atggtggtga aggtgattgc ccagcaccag aacctgctgc tggccaacac caccagtgcc     1020

ttcccctatg ccctgctgag caatgacaat gccttcctga gctaccaccc ccaccccttt     1080

gcccagagaa ccctgacagc cagattccag gtgaacaaca ccagaccccc ccatgtgcag     1140

ctgctgagaa agcctgtgct gacagccatg ggcctgctgg ccctgctgga tgaggagcag     1200

ctgtgggctg aggtgagcca ggctggcaca gtgctggaca gcaaccacac agtgggggtg     1260

ctggccagtg cccacagacc ccagggccct gctgatgcct ggagagctgc tgtgctgatc     1320

tatgccagtg atgacaccag agcccacccc aacagaagtg tggctgtgac cctgagactg     1380

agaggggtgc cccctggccc tggcctggtg tatgtgacca gatacctgga caatggcctg     1440

tgcagccctg atggggagtg gagaagactg ggcagacctg tgttccccac agctgagcag     1500

ttcagaagaa tgagagctgc tgaggaccct gtggctgctg cccccagacc cctgcctgct     1560

gggggcagac tgaccctgag acctgccctg agactgccca gcctgctgct ggtgcatgtg     1620

tgtgccagac ctgagaagcc ccctggccag gtgaccagac tgagagccct gcccctgacc     1680

cagggccagc tggtgctggt gtggagtgat gagcatgtgg gcagcaagtg cctgtggacc     1740

tatgagatcc agttcagcca ggatggcaag gcctacaccc ctgtgagcag aaagcccagc     1800

accttcaacc tgtttgtgtt cagccctgac acaggggctg tgagtggcag ctacagagtg     1860

agagccctgg actactgggc cagacctggc cccttcagtg accctgtgcc ctacctggag     1920

gtgcctgtgc ccagaggccc ccccagccct ggcaacccct ga                        1962


<210>  116
<211>  1578
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1578)
<223>  Cytochrome P450 family 4 subfamily V member 2 (CYP4V2)

<400>  116
atggcggggc tctggctggg gctcgtgtgg cagaagctgc tgctgtgggg cgcggcgagt       60

gccctttccc tggccggcgc cagtctggtc ctgagcctgc tgcagagggt ggcgagctac      120

gcgcggaaat ggcagcagat gcggcccatc cccacggtgg cccgcgccta cccactggtg      180

ggccacgcgc tgctgatgaa gccggacggg cgagaatttt ttcagcagat cattgagtac      240

acagaggaat accgccacat gccgctgctg aagctctggg tcgggccagt gcccatggtg      300

gccctttata atgcagaaaa tgtggaggta attttaacta gttcaaagca aattgacaaa      360

tcctctatgt acaagttttt agaaccatgg cttggcctag gacttcttac aagtactgga      420

aacaaatggc gctccaggag aaagatgtta acacccactt tccattttac cattctggaa      480

gatttcttag atatcatgaa tgaacaagca aatatattgg ttaagaaact tgaaaaacac      540

attaaccaag aagcatttaa ctgctttttt tacatcactc tttgtgcctt agatatcatc      600

tgtgaaacag ctatggggaa gaatattggt gctcaaagta atgatgattc cgagtatgtc      660

cgtgcagttt atagaatgag tgagatgata tttcgaagaa taaagatgcc ctggctttgg      720

cttgatctct ggtatcttat gtttaaagaa ggatgggaac acaaaaagag ccttcagatc      780

ctacatactt ttaccaacag tgtcatcgct gaacgggcca atgaaatgaa cgccaatgaa      840

gactgtagag gtgatggcag gggctctgcc ccctccaaaa ataaacgcag ggcctttctt      900

gacttgcttt taagtgtgac tgatgacgaa gggaacaggc taagtcatga agatattcga      960

gaagaagttg acaccttcat gtttgagggg cacgatacaa ctgcagctgc aataaactgg     1020

tccttatacc tgttgggttc taacccagaa gtccagaaaa aagtggatca tgaattggat     1080

gacgtgtttg ggaagtctga ccgtcccgct acagtagaag acctgaagaa acttcggtat     1140

ctggaatgtg ttattaagga gacccttcgc ctttttcctt ctgttccttt atttgcccgt     1200

agtgttagtg aagattgtga agtggcaggt tacagagttc taaaaggcac tgaagccgtc     1260

atcattccct atgcattgca cagagatccg agatacttcc ccaaccccga ggagttccag     1320

cctgagcggt tcttccccga gaatgcacaa gggcgccatc catatgccta cgtgcccttc     1380

tctgctggcc ccaggaactg tataggtcaa aagtttgctg tgatggaaga aaagaccatt     1440

ctttcgtgca tcctgaggca cttttggata gaatccaacc agaaaagaga agagcttggt     1500

ctagaaggac agttgattct tcgtccaagt aatggcatct ggatcaagtt gaagaggaga     1560

aatgcagatg aacgctaa                                                   1578


<210>  117
<211>  711
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(711)
<223>  Retinoschisin 1 (RS1)

<400>  117
atgagccgca agatagaagg ctttttgtta ttacttctct ttggctatga agccacattg       60

ggattatcgt ctaccgagga tgaaggcgag gacccctggt atcaaaaagc atgcgatgaa      120

ggcgaggacc cctggtatca aaaagcatgc aagtgcgatt gccaaggagg acccaatgct      180

ctgtggtctg caggtgccac ctccttggac tgtataccag aatgcccata tcacaagcct      240

ctgggtttcg agtcagggga ggtcacaccg gaccagatca cctgctctaa cccggagcag      300

tatgtgggct ggtattcttc gtggactgca aacaaggccc ggctcaacag tcaaggcttt      360

gggtgtgcct ggctctccaa gttccaggac agtagccagt ggttacagat agatctgaag      420

gagatcaaag tgatttcagg gatcctcacc caggggcgct gtgacatcga tgagtggatg      480

accaagtaca gcgtgcagta caggaccgat gagcgcctga actggattta ctacaaggac      540

cagactggaa acaaccgggt cttctatggc aactcggacc gcacctccac ggttcagaac      600

ctgctgcggc cccccatcat ctcccgcttc atccgcctca tcccgctggg ctggcacgtc      660

cgcattgcca tccggatgga gctgctggag tgcgtcagca agtgtgcctg a               711


<210>  118
<211>  2565
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(2565)
<223>  Phosphodiesterase 6B (PDE6B)

<400>  118
atgagcctca gtgaggagca ggcccggagc tttctggacc agaaccccga ttttgcccgc       60

cagtactttg ggaagaaact gagccctgag aatgtggccg cggcctgcga ggacgggtgc      120

ccgccggact gcgacagcct ccgggacctc tgccaggtgg aggagagcac ggcgctgctg      180

gagctggtgc aggatatgca ggagagcatc aacatggagc gcgtggtctt caaggtcctg      240

cggcgcctct gcaccctgct gcaggccgac cgctgcagcc tcttcatgta ccgccagcgc      300

aacggcgtgg ccgagctggc caccaggctt ttcagcgtgc agccggacag cgtcctggag      360

gactgcctgg tgccccccga ctccgagatc gtcttcccac tggacatcgg ggtcgtgggc      420

cacgtggctc agaccaaaaa gatggtgaac gtcgaggacg tggccgagtg ccctcacttc      480

agctcatttg ctgacgagct cactgactac aagacaaaga atatgctggc cacacccatc      540

atgaatggca aagacgtcgt ggcggtgatc atggcagtga acaagctcaa cggcccattc      600

ttcaccagcg aagacgaaga tgtgttcttg aagtacctga attttgccac gttgtacctg      660

aaaatctatc acctgagcta cctccacaac tgcgagacgc gccgcggcca ggtgctgctg      720

tggtcggcca acaaggtgtt tgaggagctg acggacatcg agaggcagtt ccacaaggcc      780

ttctacacgg tgcgggccta cctcaactgc gagcggtact ccgtgggcct cctggacatg      840

accaaggaga aggaattttt tgacgtgtgg tctgtgctga tgggagagtc ccagccgtac      900

tcgggcccac gcacgcctga tggccgggaa attgtcttct acaaagtgat cgactacatc      960

ctccacggca aggaggagat caaggtcatt cccacaccct cagccgatca ctgggccctg     1020

gccagcggcc ttccaagcta cgtggcagaa agcggcttta tttgtaacat catgaatgct     1080

tccgctgacg aaatgttcaa atttcaggaa ggggccctgg acgactccgg gtggctcatc     1140

aagaatgtgc tgtccatgcc catcgtcaac aagaaggagg agattgtggg agtcgccaca     1200

ttttacaaca ggaaagacgg gaagcccttt gacgaacagg acgaggttct catggagtcc     1260

ctgacacagt tcctgggctg gtcagtgatg aacaccgaca cctacgacaa gatgaacaag     1320

ctggagaacc gcaaggacat cgcacaggac atggtccttt accacgtgaa gtgcgacagg     1380

gacgagatcc agctcatcct gccaaccaga gcgcgcctgg ggaaggagcc tgctgactgc     1440

gatgaggacg agctgggcga aatcctgaag gaggagctgc cagggcccac cacatttgac     1500

atctacgaat tccacttctc tgacctggag tgcaccgaac tggacctggt caaatgtggc     1560

atccagatgt actacgagct gggcgtggtc cgaaagttcc agatccccca ggaggtcctg     1620

gtgcggttcc tgttctccat cagcaaaggc taccggagaa tcacctacca caactggcgc     1680

cacggcttca acgtggccca gacgatgttc acgctgctca tgaccggcaa actgaagagc     1740

tactacacgg acctggaggc cttcgccatg gtgacagccg gcctgtgcca tgacatcgac     1800

caccgcggca ccaacaacct gtaccagatg aagtcccaga accccttggc taaactccac     1860

ggctcctcga ttttggagcg gcaccacctg gagtttggga agttcctgct ctcggaggag     1920

accctgaaca tctaccagaa cctgaaccgg cggcagcacg agcacgtgat ccacctgatg     1980

gacatcgcca tcatcgccac ggacctggcc ctgtacttca agaagagagc gatgtttcag     2040

aagatcgtgg atgagtccaa gaactaccag gacaagaaga gctgggtgga gtacctgtcc     2100

ctggagacga cccggaagga gatcgtcatg gccatgatga tgacagcctg cgacctgtct     2160

gccatcacca agccctggga agtccagagc aaggtcgcac ttctcgtggc tgctgagttc     2220

tgggagcaag gtgacttgga aaggacagtc ttggatcagc agcccattcc tatgatggac     2280

cggaacaagg cggccgagct ccccaagctg caagtgggct tcatcgactt cgtgtgcaca     2340

ttcgtgtaca aggagttctc tcgtttccac gaagagatcc tgcccatgtt cgaccgactg     2400

cagaacaata ggaaagagtg gaaggcgctg gctgatgagt atgaggccaa agtgaaggct     2460

ctggaggaga aggaggagga ggagagggtg gcagccaaga aagtaggcac agaaatttgc     2520

aatggcggcc cagcacccaa gtcttcaacc tgctgtatcc tgtga                     2565


<210>  119
<211>  1497
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1497)
<223>  Methyl-CpG Binding Protein (MeCP2)

<400>  119
atggccgccg ccgccgccgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga       60

ctggaagaaa agtcagaaga ccaggacctc cagggcctca aggacaaacc cctcaagttt      120

aaaaaggtga agaaagataa gaaagaagag aaagagggca agcatgagcc cgtgcagcca      180

tcagcccacc actctgctga gcccgcagag gcaggcaaag cagagacatc agaagggtca      240

ggctccgccc cggctgtgcc ggaagcttct gcctccccca aacagcggcg ctccatcatc      300

cgtgaccggg gacccatgta tgatgacccc accctgcctg aaggctggac acggaagctt      360

aagcaaagga aatctggccg ctctgctggg aagtatgatg tgtatttgat caatccccag      420

ggaaaagcct ttcgctctaa agtggagttg attgcgtact tcgaaaaggt aggcgacaca      480

tccctggacc ctaatgattt tgacttcacg gtaactggga gagggagccc ctcccggcga      540

gagcagaaac cacctaagaa gcccaaatct cccaaagctc caggaactgg cagaggccgg      600

ggacgcccca aagggagcgg caccacgaga cccaaggcgg ccacgtcaga gggtgtgcag      660

gtgaaaaggg tcctggagaa aagtcctggg aagctccttg tcaagatgcc ttttcaaact      720

tcgccagggg gcaaggctga ggggggtggg gccaccacat ccacccaggt catggtgatc      780

aaacgccccg gcaggaagcg aaaagctgag gccgaccctc aggccattcc caagaaacgg      840

ggccgaaagc cggggagtgt ggtggcagcc gctgccgccg aggccaaaaa gaaagccgtg      900

aaggagtctt ctatccgatc tgtgcaggag accgtactcc ccatcaagaa gcgcaagacc      960

cgggagacgg tcagcatcga ggtcaaggaa gtggtgaagc ccctgctggt gtccaccctc     1020

ggtgagaaga gcgggaaagg actgaagacc tgtaagagcc ctgggcggaa aagcaaggag     1080

agcagcccca aggggcgcag cagcagcgcc tcctcacccc ccaagaagga gcaccaccac     1140

catcaccacc actcagagtc cccaaaggcc cccgtgccac tgctcccacc cctgccccca     1200

cctccacctg agcccgagag ctccgaggac cccaccagcc cccctgagcc ccaggacttg     1260

agcagcagcg tctgcaaaga ggagaagatg cccagaggag gctcactgga gagcgacggc     1320

tgccccaagg agccagctaa gactcagccc gcggttgcca ccgccgccac ggccgcagaa     1380

aagtacaaac accgagggga gggagagcgc aaagacattg tttcatcctc catgccaagg     1440

ccaaacagag aggagcctgt ggacagccgg acgcccgtga ccgagagagt tagctag        1497


<210>  120
<211>  2232
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(2232)
<223>  N-acetyl-alpha-glucosaminidase (NAGLU)

<400>  120
atggaggcgg tggcggtggc cgcggcggtg ggggtccttc tcctggccgg ggccgggggc       60

gcggcaggcg acgaggcccg ggaggcggcg gccgtgcggg cgctcgtggc ccggctgctg      120

gggccaggcc ccgcggccga cttctccgtg tcggtggagc gcgctctggc tgccaagccg      180

ggcttggaca cctacagcct gggcggcggc ggcgcggcgc gcgtgcgggt gcgcggctcc      240

acgggcgtgg cggccgccgc ggggctgcac cgctacctgc gcgacttctg tggctgccac      300

gtggcctggt ccggctctca gctgcgcctg ccgcggccac tgccagccgt gccgggggag      360

ctgaccgagg ccacgcccaa caggtaccgc tattaccaga atgtgtgcac gcaaagctac      420

tccttcgtgt ggtgggactg ggcccgctgg gagcgagaga tagactggat ggcgctgaat      480

ggcatcaacc tggcactggc ctggagcggc caggaggcca tctggcagcg ggtgtacctg      540

gccttgggcc tgacccaggc agagatcaat gagttcttta ctggtcctgc cttcctggcc      600

tgggggcgaa tgggcaacct gcacacctgg gatggccccc tgcccccctc ctggcacatc      660

aagcagcttt acctgcagca ccgggtcctg gaccagatgc gctccttcgg catgacccca      720

gtgctgcctg cattcgcggg gcatgttccc gaggctgtca ccagggtgtt ccctcaggtc      780

aatgtcacga agatgggcag ttggggccac tttaactgtt cctactcctg ctccttcctt      840

ctggctccgg aagaccccat attccccatc atcgggagcc tcttcctgcg agagctgatc      900

aaagagtttg gcacagacca catctatggg gccgacactt tcaatgagat gcagccacct      960

tcctcagagc cctcctacct tgccgcagcc accactgccg tctatgaggc catgactgca     1020

gtggatactg aggctgtgtg gctgctccaa ggctggctct tccagcacca gccgcagttc     1080

tgggggcccg cccagatcag ggctgtgctg ggagctgtgc cccgtggccg cctcctggtt     1140

ctggacctgt ttgctgagag ccagcctgtg tatacccgca ctgcctcctt ccagggccag     1200

cccttcatct ggtgcatgct gcacaacttt gggggaaacc atggtctttt tggagcccta     1260

gaggctgtga acggaggccc agaagctgcc cgcctcttcc ccaactccac catggtaggc     1320

acgggcatgg cccccgaggg catcagccag aacgaagtgg tctattccct catggctgag     1380

ctgggctggc gaaaggaccc agtgccagat ttggcagcct gggtgaccag ctttgccgcc     1440

cggcggtatg gggtctccca cccggacgca ggggcagcgt ggaggctact gctccggagt     1500

gtgtacaact gctccgggga ggcctgcagg ggccacaatc gtagcccgct ggtcaggcgg     1560

ccgtccctac agatgaatac cagcatctgg tacaaccgat ctgatgtgtt tgaggcctgg     1620

cggctgctgc tcacatctgc tccctccctg gccaccagcc ccgccttccg ctacgacctg     1680

ctggacctca ctcggcaggc agtgcaggag ctggtcagct tgtactatga ggaggcaaga     1740

agcgcctacc tgagcaagga gctggcctcc ctgttgaggg ctggaggcgt cctggcctat     1800

gagctgctgc cggcactgga cgaggtgctg gctagtgaca gccgcttctt gctgggcagc     1860

tggctagagc aggcccgagc agcggcagtc agtgaggccg aggccgattt ctacgagcag     1920

aacagccgct accagctgac cttgtggggg ccagaaggca acatcctgga ctatgccaac     1980

aagcagctgg cggggttggt ggccaactac tacacccctc gctggcggct tttcctggag     2040

gcgctggttg acagtgtggc ccagggcatc cctttccaac agcaccagtt tgacaaaaat     2100

gtcttccaac tggagcaggc cttcgttctc agcaagcaga ggtaccccag ccagccgcga     2160

ggagacactg tggacctggc caagaagatc ttcctcaaat attaccccgg ctgggtggcc     2220

ggctcttggt ga                                                         2232


<210>  121
<211>  1317
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1317)
<223>  Ceroid Lipofuscinosis, Neuronal 3 (CLN3)

<400>  121
atgggaggct gtgcaggctc gcggcggcgc ttttcggatt ccgaggggga ggagaccgtc       60

ccggagcccc ggctccctct gttggaccat cagggcgcgc attggaagaa cgcggtgggc      120

ttctggctgc tgggcctttg caacaacttc tcttatgtgg tgatgctgag tgccgcccac      180

gacatcctta gccacaagag gacatcggga aaccagagcc atgtggaccc aggcccaacg      240

ccgatccccc acaacagctc atcacgattt gactgcaact ctgtctctac ggctgctgtg      300

ctcctggcgg acatcctccc cacactcgtc atcaaattgt tggctcctct tggccttcac      360

ctgctgccct acagcccccg ggttctcgtc agtgggattt gtgctgctgg aagcttcgtc      420

ctggttgcct tttctcattc tgtggggacc agcctgtgtg gtgtggtctt cgctagcatc      480

tcatcaggcc ttggggaggt caccttcctc tccctcactg ccttctaccc cagggccgtg      540

atctcctggt ggtcctcagg gactggggga gctgggctgc tgggggccct gtcctacctg      600

ggcctcaccc aggccggcct ctcccctcag cagaccctgc tgtccatgct gggtatccct      660

gccctgctgc tggccagcta tttcttgttg ctcacatctc ctgaggccca ggaccctgga      720

ggggaagaag aagcagagag cgcagcccgg cagcccctca taagaaccga ggccccggag      780

tcgaagccag gctccagctc cagcctctcc cttcgggaaa ggtggacagt gttcaagggt      840

ctgctgtggt acattgttcc cttggtcgta gtttactttg ccgagtattt cattaaccag      900

ggactttttg aactcctctt tttctggaac acttccctga gtcacgctca gcaataccgc      960

tggtaccaga tgctgtacca ggctggcgtc tttgcctccc gctcttctct ccgctgctgt     1020

cgcatccgtt tcacctgggc cctggccctg ctgcagtgcc tcaacctggt gttcctgctg     1080

gcagacgtgt ggttcggctt tctgccaagc atctacctcg tcttcctgat cattctgtat     1140

gaggggctcc tgggaggcgc agcctacgtg aacaccttcc acaacatcgc cctggagacc     1200

agtgatgagc accgggagtt tgcaatggcg gccacctgca tctctgacac actggggatc     1260

tccctgtcgg ggctcctggc tttgcctctg catgacttcc tctgccagct ctcctga        1317


<210>  122
<211>  1317
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-CLN3

<400>  122
atgggaggat gtgctgggtc aagaagacgg tttagcgatt ccgaaggaga ggagactgtg       60

cctgagccaa gactgcccct gctggatcac cagggagcac actggaagaa cgcagtggga      120

ttctggctgc tgggcctgtg caacaacttc agctacgtgg tcatgctgtc cgccgcccac      180

gacatcctgt cccacaagcg gacctccggc aatcagtctc acgtggaccc cggccctaca      240

ccaatccccc acaacagcag cagccggttc gactgtaatt ccgtgtctac cgcagccgtg      300

ctgctggcag acatcctgcc caccctggtc atcaagctgc tggcaccact gggcctgcac      360

ctgctgcctt attctccaag ggtgctggtg agcggcatct gcgcagcagg cagcttcgtg      420

ctggtggcct ttagccactc cgtgggcacc tctctgtgcg gagtggtgtt tgcaagcatc      480

agctccggcc tgggagaggt gaccttcctg agcctgacag ccttttaccc tcgcgccgtg      540

atctcctggt ggtctagcgg cacaggagga gcaggcctgc tgggcgccct gtcctatctg      600

ggcctgaccc aggcaggcct gtccccacag cagacactgc tgtctatgct gggcatccct      660

gccctgctgc tggcaagcta cttcctgctg ctgacctccc cagaggcaca ggaccccgga      720

ggagaggagg aggccgagag cgccgcaagg cagccactga tcaggaccga ggcaccagag      780

tccaagcctg gctcctctag ctccctgtct ctgcgggaga gatggacagt gttcaagggc      840

ctgctgtggt acatcgtgcc cctggtggtg gtgtacttcg ccgagtactt catcaaccag      900

ggcctgtttg agctgctgtt cttttggaat acctctctga gccacgccca gcagtaccgg      960

tggtatcaga tgctgtatca ggcaggcgtg ttcgcctccc ggtctagcct gagatgctgt     1020

cggatcagat tcacctgggc actggccctg ctgcagtgcc tgaacctggt gttcctgctg     1080

gccgacgtgt ggttcggctt tctgccctct atctacctgg tgtttctgat catcctgtat     1140

gagggcctgc tgggaggagc agcctatgtg aacaccttcc acaatatcgc cctggagaca     1200

tctgacgagc acagagagtt tgctatggcc gccacctgta tcagcgatac actgggcatc     1260

tctctgagcg gactgctggc tctgcctctg catgactttc tgtgccagct gagttaa        1317


<210>  123
<211>  2859
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(2859)
<223>  Acid Alpha-Glucosidase (GAA)

<400>  123
atgggagtga ggcacccgcc ctgctcccac cggctcctgg ccgtctgcgc cctcgtgtcc       60

ttggcaaccg ctgcactcct ggggcacatc ctactccatg atttcctgct ggttccccga      120

gagctgagtg gctcctcccc agtcctggag gagactcacc cagctcacca gcagggagcc      180

agcagaccag ggccccggga tgcccaggca caccccggcc gtcccagagc agtgcccaca      240

cagtgcgacg tcccccccaa cagccgcttc gattgcgccc ctgacaaggc catcacccag      300

gaacagtgcg aggcccgcgg ctgttgctac atccctgcaa agcaggggct gcagggagcc      360

cagatggggc agccctggtg cttcttccca cccagctacc ccagctacaa gctggagaac      420

ctgagctcct ctgaaatggg ctacacggcc accctgaccc gtaccacccc caccttcttc      480

cccaaggaca tcctgaccct gcggctggac gtgatgatgg agactgagaa ccgcctccac      540

ttcacgatca aagatccagc taacaggcgc tacgaggtgc ccttggagac cccgcatgtc      600

cacagccggg caccgtcccc actctacagc gtggagttct ccgaggagcc cttcggggtg      660

atcgtgcgcc ggcagctgga cggccgcgtg ctgctgaaca cgacggtggc gcccctgttc      720

tttgcggacc agttccttca gctgtccacc tcgctgccct cgcagtatat cacaggcctc      780

gccgagcacc tcagtcccct gatgctcagc accagctgga ccaggatcac cctgtggaac      840

cgggaccttg cgcccacgcc cggtgcgaac ctctacgggt ctcacccttt ctacctggcg      900

ctggaggacg gcgggtcggc acacggggtg ttcctgctaa acagcaatgc catggatgtg      960

gtcctgcagc cgagccctgc ccttagctgg aggtcgacag gtgggatcct ggatgtctac     1020

atcttcctgg gcccagagcc caagagcgtg gtgcagcagt acctggacgt tgtgggatac     1080

ccgttcatgc cgccatactg gggcctgggc ttccacctgt gccgctgggg ctactcctcc     1140

accgctatca cccgccaggt ggtggagaac atgaccaggg cccacttccc cctggacgtc     1200

cagtggaacg acctggacta catggactcc cggagggact tcacgttcaa caaggatggc     1260

ttccgggact tcccggccat ggtgcaggag ctgcaccagg gcggccggcg ctacatgatg     1320

atcgtggatc ctgccatcag cagctcgggc cctgccggga gctacaggcc ctacgacgag     1380

ggtctgcgga ggggggtttt catcaccaac gagaccggcc agccgctgat tgggaaggta     1440

tggcccgggt ccactgcctt ccccgacttc accaacccca cagccctggc ctggtgggag     1500

gacatggtgg ctgagttcca tgaccaggtg cccttcgacg gcatgtggat tgacatgaac     1560

gagccttcca acttcatcag gggctctgag gacggctgcc ccaacaatga gctggagaac     1620

ccaccctacg tgcctggggt ggttgggggg accctccagg cggccaccat ctgtgcctcc     1680

agccaccagt ttctctccac acactacaac ctgcacaacc tctacggcct gaccgaagcc     1740

atcgcctccc acagggcgct ggtgaaggct cgggggacac gcccatttgt gatctcccgc     1800

tcgacctttg ctggccacgg ccgatacgcc ggccactgga cgggggacgt gtggagctcc     1860

tgggagcagc tcgcctcctc cgtgccagaa atcctgcagt ttaacctgct gggggtgcct     1920

ctggtcgggg ccgacgtctg cggcttcctg ggcaacacct cagaggagct gtgtgtgcgc     1980

tggacccagc tgggggcctt ctaccccttc atgcggaacc acaacagcct gctcagtctg     2040

ccccaggagc cgtacagctt cagcgagccg gcccagcagg ccatgaggaa ggccctcacc     2100

ctgcgctacg cactcctccc ccacctctac acactgttcc accaggccca cgtcgcgggg     2160

gagaccgtgg cccggcccct cttcctggag ttccccaagg actctagcac ctggactgtg     2220

gaccaccagc tcctgtgggg ggaggccctg ctcatcaccc cagtgctcca ggccgggaag     2280

gccgaagtga ctggctactt ccccttgggc acatggtacg acctgcagac ggtgccagta     2340

gaggcccttg gcagcctccc acccccacct gcagctcccc gtgagccagc catccacagc     2400

gaggggcagt gggtgacgct gccggccccc ctggacacca tcaacgtcca cctccgggct     2460

gggtacatca tccccctgca gggccctggc ctcacaacca cagagtcccg ccagcagccc     2520

atggccctgg ctgtggccct gaccaagggt ggggaggccc gaggggagct gttctgggac     2580

gatggagaga gcctggaagt gctggagcga ggggcctaca cacaggtcat cttcctggcc     2640

aggaataaca cgatcgtgaa tgagctggta cgtgtgacca gtgagggagc tggcctgcag     2700

ctgcagaagg tgactgtcct gggcgtggcc acggcgcccc agcaggtcct ctccaacggt     2760

gtccctgtct ccaacttcac ctacagcccc gacaccaagg tcctggacat ctgtgtctcg     2820

ctgttgatgg gagagcagtt tctcgtcagc tggtgttag                            2859


<210>  124
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-GAA

<400>  124
atgggagtcc gccacccgcc ctgctcacat cgcctgcttg ctgtctgtgc cctcgtgtca       60

cttgctaccg ccgcgctgct tggtcacatt ctgctgcacg actttttact agttccgagg      120

gaactgtcgg gatccagccc cgtgctcgag gaaactcacc ccgcgcacca acagggggcg      180

tccaggccgg gaccgcgcga cgcccaggcc cacccgggcc ggcctcgggc cgtgccaact      240

cagtgcgatg tgccgccgaa ctcccgcttc gactgtgcgc ctgacaaggc cataacccag      300

gaacagtgcg aagcacgcgg ctgctgctat attccggcga agcagggctt gcagggtgcc      360

caaatgggtc agccttggtg cttctttccc ccgtcgtacc cctcgtacaa gctggagaac      420

ctgagcagca gcgaaatggg gtacaccgcc actctgaccc ggacgacccc gaccttcttc      480

ccgaaagaca tcctgaccct gcggctggat gtgatgatgg aaactgagaa cagactgcac      540

ttcactatca aggaccccgc gaaccgcaga tatgaggtgc cactggaaac ccctcatgtg      600

cattcccggg ccccatcccc tctgtactcg gtggaattct ccgaagaacc cttcggggtc      660

attgtgcgcc ggcagcttga tggccgggtc ctgctcaaca ccaccgtggc accccttttc      720

ttcgctgacc agttcctcca gctgagcacc tcgctgccga gccagtacat caccggactg      780

gccgagcacc tctcccctct gatgctgtcc actagctgga ctaggatcac tctgtggaac      840

cgggatctgg cccctacccc gggcgcgaac ctgtacggat cgcacccctt ctacctggcc      900

ctcgaggacg gaggctccgc ccacggagtg ttcctgctga actccaacgc tatggacgtg      960

gtgctccagc cgtcccctgc actgtcctgg cggagcacag ggggtattct ggatgtctac     1020

atcttcctcg gcccggagcc aaagtccgtg gtgcaacagt atctggatgt cgtgggttac     1080

ccattcatgc cgccatactg gggccttggc ttccacctgt gccgctgggg atacagctcc     1140

accgccatca ctagacaggt cgtggaaaac atgactagag cccacttccc cctcgatgtc     1200

cagtggaatg acctggacta catggattcc agacgcgact tcactttcaa caaggatgga     1260

ttcagagatt tccccgctat ggtccaagaa ctgcaccagg gtggccggcg gtacatgatg     1320

attgtggacc ccgccatttc aagctccgga ccagcgggct cgtaccggcc ctacgacgaa     1380

ggtttgcgcc gcggcgtgtt catcactaac gaaaccggcc agccactgat tgggaaggtc     1440

tggcctggaa gcaccgcgtt cccggacttc actaacccaa cggccttggc gtggtgggag     1500

gacatggtgg ccgaattcca cgaccaagtc ccattcgacg gaatgtggat cgacatgaac     1560

gagcccagca acttcatccg aggctccgag gacggctgcc ctaacaacga acttgagaac     1620

cctccgtacg tgcctggcgt cgtcggcgga acactgcagg ccgctacgat ctgtgcctca     1680

tcgcatcagt tcctgtcaac ccactacaac ctccataatc tgtacggcct caccgaagcc     1740

atcgcctccc accgggccct ggtcaaggcc cgggggacta ggcccttcgt gattagccgg     1800

agcactttcg ccggacacgg aagatacgcc ggacattgga ccggcgacgt gtggtcatcg     1860

tgggagcagc tcgcctcctc cgtccccgaa atcctgcagt tcaatctcct gggagtcccc     1920

ctcgtgggcg cggacgtgtg cggattcctg ggcaatacct ctgaggagct gtgcgtgaga     1980

tggacccagc tgggggcgtt ctaccccttc atgcggaacc acaactcact gctgtccctg     2040

cctcaagagc cgtactcatt ctccgagccg gcacaacagg ccatgcgaaa ggctctgacc     2100

ctccgctatg cgctcttgcc ccacctctac actctgtttc accaagccca tgtcgcgggc     2160

gaaacagtgg ccagaccact ctttctggaa ttcccaaagg actcctcaac ctggactgtg     2220

gatcatcagc tgctctgggg agaggcactg ctgatcaccc cggtgctcca agccggaaag     2280

gcggaagtga ccggatactt ccctctcggt acttggtacg acctccaaac cgtgccggtc     2340

gaggccctgg gcagcttgcc tccgccgccg gctgccccgc gggagcctgc aatccactcc     2400

gaggggcaat gggtgaccct ccctgcacca ctggacacca tcaacgtgca cctccgggcc     2460

ggctacatca tcccgctgca aggaccgggt ctgactacca ccgaatcccg gcagcagccc     2520

atggcactgg ccgtggccct gaccaaggga ggggaagcac ggggagaact cttttgggac     2580

gatggagaat ccctggaagt gctcgagcgg ggagcctaca ctcaagtcat ctttcttgcc     2640

cgcaacaaca ccatcgtgaa cgaattggtc cgcgtgacct ccgagggggc cggactccag     2700

ctgcaaaaag tgaccgtgct gggggtggca accgccccgc aacaagtgtt gtctaacgga     2760

gtgccggtgt ccaacttcac ctactcccct gataccaaag ttctagatat ttgcgtgagc     2820

ctgctgatgg gagaacagtt cctggtgtcc tggtgctga                            2859


<210>  125
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-GAA

<400>  125
atgggagtta gacaccctcc atgtagccac agactgctgg ccgtgtgtgc tctggtgtct       60

ctggctacag ctgccctgct gggacatatc ctgctgcacg acttcttact agttcccaga      120

gagctgtccg gcagcagccc tgtgctggaa gaaacacacc ctgcacatca gcagggcgcc      180

tctagacctg gacctagaga tgctcaggcc catcctggca gacctagagc tgtgcccaca      240

cagtgtgacg tgccacctaa cagcagattc gactgcgccc ctgacaaggc catcacacaa      300

gagcagtgtg aagccagagg ctgctgctac atccctgcca aacaaggact gcagggcgct      360

cagatgggac agccctggtg cttcttccca ccatcttacc ccagctacaa gctggaaaac      420

ctgagcagca gcgagatggg ctacaccgcc acactgacca gaaccacacc tacattcttc      480

ccgaaggaca tcctgacact gcggctggac gtgatgatgg aaaccgagaa ccggctgcac      540

ttcaccatca aggaccccgc caatcggaga tacgaggtgc cactggaaac ccctcacgtg      600

cactctagag ccccatctcc actgtacagc gtggaattca gcgaggaacc cttcggcgtg      660

atcgtgcgga gacagctgga tggaagagtg ctgctgaaca ccacagtggc ccctctgttc      720

ttcgccgacc agtttctgca gctgtccacc agcctgccta gccagtatat cacaggcctg      780

gccgagcacc tgtctccact gatgctgtct accagctgga cccggatcac cctgtggaac      840

agggatcttg ctcctacacc tggcgccaac ctgtacggct ctcacccttt ttatctggcc      900

ctggaagatg gcggatctgc ccacggtgtc tttctgctga actccaacgc catggacgtg      960

gtgctgcagc catctcctgc tctgtcttgg agaagcacag gcggcatcct ggacgtgtac     1020

atctttctgg gccccgagcc taagagcgtg gtgcagcagt atctggacgt cgtgggctac     1080

cccttcatgc ctccttattg gggcctgggc ttccacctgt gcagatgggg atacagcagc     1140

accgccatca ccagacaggt ggtggaaaac atgacccggg ctcacttccc actggatgtg     1200

cagtggaacg acctggacta catggacagc agacgggact tcaccttcaa caaggacggc     1260

ttcagagact tccccgccat ggtgcaagaa ctgcaccaag gcggcagacg gtacatgatg     1320

atcgtggatc cagccatcag ctctagcggc cctgccggct cttacagacc ttacgatgag     1380

ggcctgagaa gaggcgtgtt catcaccaac gagacaggcc agcctctgat cggcaaagtg     1440

tggcctggca gcacagcctt tccagacttc acaaacccca ccgctctggc ttggtgggaa     1500

gatatggtgg ccgagtttca cgatcaggtg cccttcgacg gcatgtggat cgacatgaac     1560

gagcccagca acttcatccg gggcagcgag gatggctgcc ccaacaacga actggaaaat     1620

cctccttacg tgcccggcgt tgtcggcgga acacttcagg ccgctacaat ctgtgccagc     1680

agccaccagt tcctcagcac ccactacaac ctgcacaatc tgtatggcct gaccgaggcc     1740

attgccagcc atagagccct ggttaaggcc aggggcacca gacctttcgt gatcagcaga     1800

agcaccttcg ccggccacgg cagatatgcc ggacattgga caggcgacgt gtggtctagt     1860

tgggagcagc tggctagcag cgtgccagag atcctgcagt tcaatctgct gggcgtgcca     1920

ctcgtgggag ccgatgtttg tggcttcctg ggcaacacct ccgaggaact gtgtgtgcgt     1980

tggacacagc tgggcgcctt ctatcccttc atgagaaacc acaacagcct tctcagcctg     2040

ccacaagagc cctacagctt ctctgagcct gcacagcagg ccatgagaaa ggccctgact     2100

ctgagatacg ctctgctgcc ccacctgtac accctgtttc accaggctca tgtggccggg     2160

gagacagtgg ctagacctct gttcctggaa ttccccaagg acagctccac ctggaccgtg     2220

gatcatcagc tgctgtgggg agaagccctg ctcatcacac ctgttctgca ggccggaaag     2280

gccgaagtga ccggctattt tcctctcggc acttggtacg acctgcagac cgtgcctgtt     2340

gaggctctgg gatctcttcc tccacctcct gccgctccta gagagcctgc cattcactct     2400

gaaggccagt gggttaccct gcctgctcct ctggacacca tcaacgtgca cctgagagct     2460

ggctacatca tccctctgca aggccctggc ctgacaacca ccgaatctag acagcagccc     2520

atggctctgg ccgtggcttt gacaaaaggc ggagaggcta gaggcgagct gttctgggat     2580

gatggcgaga gcctggaagt gctggaacgg ggcgcttata cccaagtgat cttcctggcc     2640

agaaacaaca ccatcgtgaa cgaactcgtg cgcgtgacca gtgaaggtgc tggactgcaa     2700

ctgcagaaag tgaccgtgct cggagtggcc acagcacctc agcaggttct gtctaatggc     2760

gtgcccgtgt ccaacttcac atacagcccc gacaccaagg tcctggacat ctgtgtgtca     2820

ctgctgatgg gcgagcagtt cctggtgtcc tggtgttga                            2859


<210>  126
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO3-GAA

<400>  126
atgggggtga gacacccccc ctgcagccac agactgctgg ctgtgtgtgc cctggtgagc       60

ctggccacag ctgccctgct gggccacatc ctgctgcatg acttcctact agtgcccaga      120

gagctgagtg gcagcagccc tgtgctggag gagacccacc ctgcccacca gcagggggcc      180

agcagacctg gccccagaga tgcccaggcc caccctggca gacccagagc tgtgcccacc      240

cagtgtgatg tgccccccaa cagcagattt gactgtgccc ctgacaaggc catcacccag      300

gagcagtgtg aggccagagg ctgctgctac atccctgcca agcagggcct gcagggggcc      360

cagatgggcc agccctggtg cttcttcccc cccagctacc ccagctacaa gctggagaac      420

ctgagcagca gtgagatggg ctacacagcc accctgacca gaaccacccc caccttcttc      480

cccaaggaca tcctgaccct gagactggat gtgatgatgg agacagagaa cagactgcac      540

ttcaccatca aggaccctgc caacagaaga tatgaggtgc ccctggagac cccccatgtg      600

cacagcagag cccccagccc cctgtacagt gtggagttca gtgaggagcc ctttggggtg      660

attgtgagaa gacagctgga tggcagagtg ctgctgaaca ccacagtggc ccccctgttc      720

tttgctgacc agttcctgca gctgagcacc agcctgccca gccagtacat cacaggcctg      780

gctgagcacc tgagccccct gatgctgagc accagctgga ccagaatcac cctgtggaac      840

agagacctgg cccccacccc tggggccaac ctgtatggca gccacccctt ctacctggcc      900

ctggaggatg ggggcagtgc ccatggggtg ttcctgctga acagcaatgc catggatgtg      960

gtgctgcagc ccagccctgc cctgagctgg agaagcacag ggggcatcct ggatgtgtac     1020

atcttcctgg gccctgagcc caagagtgtg gtgcagcagt acctggatgt ggtgggctac     1080

cccttcatgc ccccctactg gggcctgggc ttccacctgt gcagatgggg ctacagcagc     1140

acagccatca ccagacaggt ggtggagaac atgaccagag cccacttccc cctggatgtg     1200

cagtggaatg acctggacta catggacagc agaagagact tcaccttcaa caaggatggc     1260

ttcagagact tccctgccat ggtgcaggag ctgcaccagg ggggcagaag atacatgatg     1320

attgtggacc ctgccatcag cagcagtggc cctgctggca gctacagacc ctatgatgag     1380

ggcctgagaa gaggggtgtt catcaccaat gagacaggcc agcccctgat tggcaaggtg     1440

tggcctggca gcacagcctt ccctgacttc accaacccca cagccctggc ctggtgggag     1500

gacatggtgg ctgagttcca tgaccaggtg ccctttgatg gcatgtggat tgacatgaat     1560

gagcccagca acttcatcag aggcagtgag gatggctgcc ccaacaatga gctggagaac     1620

cccccctatg tgcctggggt ggtggggggc accctgcagg ctgccaccat ctgtgccagc     1680

agccaccagt tcctgagcac ccactacaac ctgcacaacc tgtatggcct gacagaggcc     1740

attgccagcc acagagccct ggtgaaggcc agaggcacca gaccctttgt gatcagcaga     1800

agcacctttg ctggccatgg cagatatgct ggccactgga caggggatgt gtggagcagc     1860

tgggagcagc tggccagcag tgtgcctgag atcctgcagt tcaacctgct gggggtgccc     1920

ctggtggggg ctgatgtgtg tggcttcctg ggcaacacca gtgaggagct gtgtgtgaga     1980

tggacccagc tgggggcctt ctaccccttc atgagaaacc acaacagcct gctgagcctg     2040

ccccaggagc cctacagctt cagtgagcct gcccagcagg ccatgagaaa ggccctgacc     2100

ctgagatatg ccctgctgcc ccacctgtac accctgttcc accaggccca tgtggctggg     2160

gagacagtgg ccagacccct gttcctggag ttccccaagg acagcagcac ctggacagtg     2220

gaccaccagc tgctgtgggg ggaggccctg ctgatcaccc ctgtgctgca ggctggcaag     2280

gctgaggtga caggctactt ccccctgggc acctggtatg acctgcagac agtgcctgtg     2340

gaggccctgg gcagcctgcc ccccccccct gctgccccca gagagcctgc catccacagt     2400

gagggccagt gggtgaccct gcctgccccc ctggacacca tcaatgtgca cctgagagct     2460

ggctacatca tccccctgca gggccctggc ctgaccacca cagagagcag acagcagccc     2520

atggccctgg ctgtggccct gaccaagggg ggggaggcca gaggggagct gttctgggat     2580

gatggggaga gcctggaggt gctggagaga ggggcctaca cccaggtgat cttcctggcc     2640

agaaacaaca ccattgtgaa tgagctggtg agagtgacca gtgagggggc tggcctgcag     2700

ctgcagaagg tgacagtgct gggggtggcc acagcccccc agcaggtgct gagcaatggg     2760

gtgcctgtga gcaacttcac ctacagccct gacaccaagg tgctggacat ctgtgtgagc     2820

ctgctgatgg gggagcagtt cctggtgagc tggtgctga                            2859


<210>  127
<211>  1290
<212>  DNA
<213>  Homo sapiens

<220>
<221>  misc_feature
<222>  (1)..(1290)
<223>  Alpha-Galactosidase A (GLA)

<400>  127
atgcagctga ggaacccaga actacatctg ggctgcgcgc ttgcgcttcg cttcctggcc       60

ctcgtttcct gggacatccc tggggctaga gcactggaca atggattggc aaggacgcct      120

accatgggct ggctgcactg ggagcgcttc atgtgcaacc ttgactgcca ggaagagcca      180

gattcctgca tcagtgagaa gctcttcatg gagatggcag agctcatggt ctcagaaggc      240

tggaaggatg caggttatga gtacctctgc attgatgact gttggatggc tccccaaaga      300

gattcagaag gcagacttca ggcagaccct cagcgctttc ctcatgggat tcgccagcta      360

gctaattatg ttcacagcaa aggactgaag ctagggattt atgcagatgt tggaaataaa      420

acctgcgcag gcttccctgg gagttttgga tactacgaca ttgatgccca gacctttgct      480

gactggggag tagatctgct aaaatttgat ggttgttact gtgacagttt ggaaaatttg      540

gcagatggtt ataagcacat gtccttggcc ctgaatagga ctggcagaag cattgtgtac      600

tcctgtgagt ggcctcttta tatgtggccc tttcaaaagc ccaattatac agaaatccga      660

cagtactgca atcactggcg aaattttgct gacattgatg attcctggaa aagtataaag      720

agtatcttgg actggacatc ttttaaccag gagagaattg ttgatgttgc tggaccaggg      780

ggttggaatg acccagatat gttagtgatt ggcaactttg gcctcagctg gaatcagcaa      840

gtaactcaga tggccctctg ggctatcatg gctgctcctt tattcatgtc taatgacctc      900

cgacacatca gccctcaagc caaagctctc cttcaggata aggacgtaat tgccatcaat      960

caggacccct tgggcaagca agggtaccag cttagacagg gagacaactt tgaagtgtgg     1020

gaacgacctc tctcaggctt agcctgggct gtagctatga taaaccggca ggagattggt     1080

ggacctcgct cttataccat cgcagttgct tccctgggta aaggagtggc ctgtaatcct     1140

gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact     1200

tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca     1260

atgcagatgt cattaaaaga cttactttaa                                      1290


<210>  128
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-GLA

<400>  128
atgcagctga gaaatcctga actgcacctg ggctgtgccc tggctctgag atttctggct       60

ctggtgtcct gggacattcc tggcgctaga gccctggata atggcctggc cagaacacct      120

acaatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca agaggaaccc      180

gacagctgca tcagcgagaa gctgttcatg gaaatggccg agctgatggt gtccgaaggc      240

tggaaggatg ccggctacga gtacctgtgc atcgacgatt gctggatggc ccctcagaga      300

gattctgagg gcagactgca ggccgatcct cagagatttc ctcacggaat ccggcagctg      360

gccaactacg tgcactctaa gggactgaag ctgggcatct acgccgacgt gggcaacaag      420

acatgtgccg gctttccagg cagcttcggc tactacgata tcgacgccca gacctttgcc      480

gattggggcg tcgacctgct gaagttcgat ggctgctact gcgacagcct ggaaaacctg      540

gccgacggct acaaacacat gtctctggcc ctgaaccgga ccggcagatc tatcgtgtac      600

tcttgcgagt ggcccctgta catgtggccc ttccagaagc ctaactacac cgagatcaga      660

cagtactgca accactggcg gaacttcgcc gacatcgatg acagctggaa gtccatcaag      720

agcatcctgg actggaccag cttcaatcaa gagcggatcg tggatgtggc tggcccaggc      780

ggatggaacg atcctgatat gctggtcatc ggcaacttcg gcctgagctg gaatcagcaa      840

gtgacccaga tggccctgtg ggccattatg gccgctcctc tgttcatgag caacgacctg      900

agacacatca gccctcaggc caaggctctg ctgcaggata aggacgtgat cgccatcaac      960

caggatcctc tgggcaagca gggctatcag ctgagacagg gcgacaattt cgaagtgtgg     1020

gaaagacctc tgagcggcct ggcttgggcc gtcgccatga tcaatagaca agagatcggc     1080

ggaccccggt cctatacaat tgccgtggct tctctcggaa aaggcgtggc ctgcaatcct     1140

gcctgcttta tcacacagct gctccccgtg aagagaaagc tgggctttta cgagtggacc     1200

agcagactga gatcccacat caaccccaca ggcactgttc tgctgcaact ggaaaacaca     1260

atgcagatga gcctgaagga cctgctgtag                                      1290


<210>  129
<211>  1377
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET, CO1-GLA-GET

<400>  129
atgcagctga gaaatcctga actgcacctg ggctgtgccc tggctctgag atttctggct       60

ctggtgtcct gggacattcc tggcgctaga gccctggata atggcctggc cagaacacct      120

acaatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca agaggaaccc      180

gacagctgca tcagcgagaa gctgttcatg gaaatggccg agctgatggt gtccgaaggc      240

tggaaggatg ccggctacga gtacctgtgc atcgacgatt gctggatggc ccctcagaga      300

gattctgagg gcagactgca ggccgatcct cagagatttc ctcacggaat ccggcagctg      360

gccaactacg tgcactctaa gggactgaag ctgggcatct acgccgacgt gggcaacaag      420

acatgtgccg gctttccagg cagcttcggc tactacgata tcgacgccca gacctttgcc      480

gattggggcg tcgacctgct gaagttcgat ggctgctact gcgacagcct ggaaaacctg      540

gccgacggct acaaacacat gtctctggcc ctgaaccgga ccggcagatc tatcgtgtac      600

tcttgcgagt ggcccctgta catgtggccc ttccagaagc ctaactacac cgagatcaga      660

cagtactgca accactggcg gaacttcgcc gacatcgatg acagctggaa gtccatcaag      720

agcatcctgg actggaccag cttcaatcaa gagcggatcg tggatgtggc tggcccaggc      780

ggatggaacg atcctgatat gctggtcatc ggcaacttcg gcctgagctg gaatcagcaa      840

gtgacccaga tggccctgtg ggccattatg gccgctcctc tgttcatgag caacgacctg      900

agacacatca gccctcaggc caaggctctg ctgcaggata aggacgtgat cgccatcaac      960

caggatcctc tgggcaagca gggctatcag ctgagacagg gcgacaattt cgaagtgtgg     1020

gaaagacctc tgagcggcct ggcttgggcc gtcgccatga tcaatagaca agagatcggc     1080

ggaccccggt cctatacaat tgccgtggct tctctcggaa aaggcgtggc ctgcaatcct     1140

gcctgcttta tcacacagct gctccccgtg aagagaaagc tgggctttta cgagtggacc     1200

agcagactga gatcccacat caaccccaca ggcactgttc tgctgcaact ggaaaacaca     1260

atgcagatga gcctgaagga cctgctgcgg agaagaagaa ggcgcagacg caagcgcaag     1320

aagaaaggca aaggcctcgg caagaagcgg gacccctgtc tgagaaagta caagtaa        1377


<210>  130
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-GLA

<400>  130
atgcagctga gaaaccctga gctgcacctg ggctgtgccc tggccctgag attcctggcc       60

ctggtgagct gggacatccc tggggccaga gccctggaca atgggctagc cagaaccccc      120

accatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca ggaggagcct      180

gacagctgca tcagtgagaa gctgttcatg gagatggctg agctgatggt gagtgagggc      240

tggaaggatg ctggctatga gtacctgtgc attgatgact gctggatggc cccccagaga      300

gacagtgagg gcagactgca ggctgacccc cagagattcc cccatggcat cagacagctg      360

gccaactatg tgcacagcaa gggcctgaag ctgggcatct atgctgatgt gggcaacaag      420

acctgtgctg gcttccctgg cagctttggc tactatgaca ttgatgccca gacctttgct      480

gactgggggg tggacctgct gaagtttgat ggctgctact gtgacagcct ggagaacctg      540

gctgatggct acaagcacat gagcctggcc ctgaacagaa caggcagaag cattgtgtac      600

agctgtgagt ggcccctgta catgtggccc ttccagaagc ccaactacac agagatcaga      660

cagtactgca accactggag aaactttgct gacattgatg acagctggaa gagcatcaag      720

agcatcctgg actggaccag cttcaaccag gagagaattg tggatgtggc tggccctggg      780

ggctggaatg accctgacat gctggtgatt ggcaactttg gcctgagctg gaaccagcag      840

gtgacccaga tggccctgtg ggccatcatg gctgcccccc tgttcatgag caatgacctg      900

agacacatca gcccccaggc caaggccctg ctgcaggaca aggatgtgat tgccatcaac      960

caggaccccc tgggcaagca gggctaccag ctgagacagg gggacaactt tgaggtgtgg     1020

gagagacccc tgagtggcct ggcctgggct gtggccatga tcaacagaca ggagattggg     1080

ggccccagaa gctacaccat tgctgtggcc agcctgggca agggggtggc ctgcaaccct     1140

gcctgcttca tcacccagct gctgcctgtg aagagaaagc tgggcttcta tgagtggacc     1200

agcagactga gaagccacat caaccccaca ggcacagtgc tgctgcagct ggagaacacc     1260

atgcagatga gcctgaagga cctgctgtga                                      1290


<210>  131
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO3-GLA

<400>  131
atgcagctga gaaaccctga gctgcacctg ggctgtgccc tggccctgag attcctggcc       60

ctggtgagct gggacatccc tggggccaga gccctggaca atgggctagc cagaaccccc      120

accatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca ggaggagcct      180

gacagctgca tcagtgagaa gctgttcatg gagatggctg agctgatggt gagtgagggc      240

tggaaggatg ctggctatga gtacctgtgc attgatgact gctggatggc cccccagaga      300

gacagtgagg gcagactgca ggctgacccc cagagattcc cccatggcat cagacagctg      360

gccaactatg tgcacagcaa gggcctgaag ctgggcatct atgctgatgt gggcaacaag      420

acctgtgctg gcttccctgg cagctttggc tactatgaca ttgatgccca gacctttgct      480

gactgggggg tggacctgct gaagtttgat ggctgctact gtgacagcct ggagaacctg      540

gctgatggct acaagcacat gagcctggcc ctgaacagaa caggcagaag cattgtgtac      600

agctgtgagt ggcccctgta catgtggccc ttccagaagc ccaactacac agagatcaga      660

cagtactgca accactggag aaactttgct gacattgatg acagctggaa gagcatcaag      720

agcatcctgg actggaccag cttcaaccag gagagaattg tggatgtggc tggccctggg      780

ggctggaatg accctgacat gctggtgatt ggcaactttg gcctgagctg gaaccagcag      840

gtgacccaga tggccctgtg ggccatcatg gctgcccccc tgttcatgag caatgacctg      900

agacacatca gcccccaggc caaggccctg ctgcaggaca aggatgtgat tgccatcaac      960

caggaccccc tgggcaagca gggctaccag ctgagacagg gggacaactt tgaggtgtgg     1020

gagagacccc tgagtggcct ggcctgggct gtggccatga tcaacagaca ggagattggg     1080

ggccccagaa gctacaccat tgctgtggct tccctgggta aaggagtggc ctgtaatcct     1140

gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact     1200

tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca     1260

atgcagatgt cattaaaaga cttactttaa                                      1290


<210>  132
<211>  4287
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, Cystic Fibrosis Transmembrane Regulator deltaR 
       (CFTRdeltaR) contains R domain deletion

<400>  132
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag caccctgcag gccagaagaa gacagtctgt gctgaacctg     2160

atgacccact ctgtgaacca gggccagaac atccacagaa agaccacagc cagcaccaga     2220

aaggtgagcc tggcccccca ggccaacctg acagagctgg acatctacag cagaagactg     2280

agccaggaga caggcctgga gatctctgag gagatcaatg aggaggacct gaaggagtgc     2340

ttctttgatg acatggagag catccctgct gtgaccacct ggaacaccta cctgagatac     2400

atcacagtgc acaagagcct gatctttgtg ctgatctggt gcctggtgat cttcctggct     2460

gaggtggctg ccagcctggt ggtgctgtgg ctgctgggca acacccccct gcaggacaag     2520

ggcaacagca cccacagcag aaacaacagc tatgctgtga tcatcaccag caccagcagc     2580

tactatgtgt tctacatcta tgtgggggtg gctgacaccc tgctggccat gggcttcttc     2640

agaggcctgc ccctggtgca caccctgatc acagtgagca agatcctgca ccacaagatg     2700

ctgcactctg tgctgcaggc ccccatgagc accctgaaca ccctgaaggc tgggggcatc     2760

ctgaacagat tcagcaagga cattgccatc ctggatgacc tgctgcccct gaccatcttt     2820

gacttcatcc agctgctgct gattgtgatt ggggccattg ctgtggtggc tgtgctgcag     2880

ccctacatct ttgtggccac agtgcctgtg attgtggcct tcatcatgct gagagcctac     2940

ttcctgcaga ccagccagca gctgaagcag ctggagtctg agggcagaag ccccatcttc     3000

acccacctgg tgaccagcct gaagggcctg tggaccctga gagcctttgg cagacagccc     3060

tactttgaga ccctgttcca caaggccctg aacctgcaca cagccaactg gttcctgtac     3120

ctgagcaccc tgagatggtt ccagatgaga attgagatga tctttgtgat cttcttcatt     3180

gctgtgacct tcatcagcat cctgaccaca ggggaggggg agggcagagt gggcatcatc     3240

ctgaccctgg ccatgaacat catgagcacc ctgcagtggg ctgtgaacag cagcattgat     3300

gtggacagcc tgatgagatc tgtgagcaga gtgttcaagt tcattgacat gcccacagag     3360

ggcaagccca ccaagagcac caagccctac aagaatggcc agctgagcaa ggtgatgatc     3420

attgagaaca gccatgtgaa gaaggatgac atctggccct ctgggggcca gatgacagtg     3480

aaggacctga cagccaagta cacagagggg ggcaatgcca tcctggagaa catcagcttc     3540

agcatcagcc ctggccagag agtgggcctg ctgggcagaa caggctctgg caagagcacc     3600

ctgctgtctg ccttcctgag actgctgaac acagaggggg agatccagat tgatggggtg     3660

agctgggaca gcatcaccct gcagcagtgg agaaaggcct ttggggtgat cccccagaag     3720

gtgttcatct tctctggcac cttcagaaag aacctggacc cctatgagca gtggtctgac     3780

caggagatct ggaaggtggc tgatgaggtg ggcctgagat ctgtgattga gcagttccct     3840

ggcaagctgg actttgtgct ggtggatggg ggctgtgtgc tgagccatgg ccacaagcag     3900

ctgatgtgcc tggccagatc tgtgctgagc aaggccaaga tcctgctgct ggatgagccc     3960

tctgcccacc tggaccctgt gacctaccag atcatcagaa gaaccctgaa gcaggccttt     4020

gctgactgca cagtgatcct gtgtgagcac agaattgagg ccatgctgga gtgccagcag     4080

ttcctggtga ttgaggagaa caaggtgaga cagtatgaca gcatccagaa gctgctgaat     4140

gagagaagcc tgttcagaca ggccatcagc ccctctgaca gagtgaagct gttcccccac     4200

agaaacagca gcaagtgcaa gagcaagccc cagattgctg ccctgaagga ggagaccgag     4260

gaggaggtgc aggacaccag actgtaa                                         4287


<210>  133
<211>  4443
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, full length Cystic Fibrosis Transmembrane 
       Regulator (CFTR)

<400>  133
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag catcagaaag ttcagcattg tgcagaagac ccccctgcag     2160

atgaatggca ttgaggagga ctctgatgag cccctggaga gaagactgag cctggtgcct     2220

gactctgagc agggggaggc catcctgccc agaatctctg tgatcagcac aggccccacc     2280

ctgcaggcca gaagaagaca gtctgtgctg aacctgatga cccactctgt gaaccagggc     2340

cagaacatcc acagaaagac cacagccagc accagaaagg tgagcctggc cccccaggcc     2400

aacctgacag agctggacat ctacagcaga agactgagcc aggagacagg cctggagatc     2460

tctgaggaga tcaatgagga ggacctgaag gagtgcttct ttgatgacat ggagagcatc     2520

cctgctgtga ccacctggaa cacctacctg agatacatca cagtgcacaa gagcctgatc     2580

tttgtgctga tctggtgcct ggtgatcttc ctggctgagg tggctgccag cctggtggtg     2640

ctgtggctgc tgggcaacac ccccctgcag gacaagggca acagcaccca cagcagaaac     2700

aacagctatg ctgtgatcat caccagcacc agcagctact atgtgttcta catctatgtg     2760

ggggtggctg acaccctgct ggccatgggc ttcttcagag gcctgcccct ggtgcacacc     2820

ctgatcacag tgagcaagat cctgcaccac aagatgctgc actctgtgct gcaggccccc     2880

atgagcaccc tgaacaccct gaaggctggg ggcatcctga acagattcag caaggacatt     2940

gccatcctgg atgacctgct gcccctgacc atctttgact tcatccagct gctgctgatt     3000

gtgattgggg ccattgctgt ggtggctgtg ctgcagccct acatctttgt ggccacagtg     3060

cctgtgattg tggccttcat catgctgaga gcctacttcc tgcagaccag ccagcagctg     3120

aagcagctgg agtctgaggg cagaagcccc atcttcaccc acctggtgac cagcctgaag     3180

ggcctgtgga ccctgagagc ctttggcaga cagccctact ttgagaccct gttccacaag     3240

gccctgaacc tgcacacagc caactggttc ctgtacctga gcaccctgag atggttccag     3300

atgagaattg agatgatctt tgtgatcttc ttcattgctg tgaccttcat cagcatcctg     3360

accacagggg agggggaggg cagagtgggc atcatcctga ccctggccat gaacatcatg     3420

agcaccctgc agtgggctgt gaacagcagc attgatgtgg acagcctgat gagatctgtg     3480

agcagagtgt tcaagttcat tgacatgccc acagagggca agcccaccaa gagcaccaag     3540

ccctacaaga atggccagct gagcaaggtg atgatcattg agaacagcca tgtgaagaag     3600

gatgacatct ggccctctgg gggccagatg acagtgaagg acctgacagc caagtacaca     3660

gaggggggca atgccatcct ggagaacatc agcttcagca tcagccctgg ccagagagtg     3720

ggcctgctgg gcagaacagg ctctggcaag agcaccctgc tgtctgcctt cctgagactg     3780

ctgaacacag agggggagat ccagattgat ggggtgagct gggacagcat caccctgcag     3840

cagtggagaa aggcctttgg ggtgatcccc cagaaggtgt tcatcttctc tggcaccttc     3900

agaaagaacc tggaccccta tgagcagtgg tctgaccagg agatctggaa ggtggctgat     3960

gaggtgggcc tgagatctgt gattgagcag ttccctggca agctggactt tgtgctggtg     4020

gatgggggct gtgtgctgag ccatggccac aagcagctga tgtgcctggc cagatctgtg     4080

ctgagcaagg ccaagatcct gctgctggat gagccctctg cccacctgga ccctgtgacc     4140

taccagatca tcagaagaac cctgaagcag gcctttgctg actgcacagt gatcctgtgt     4200

gagcacagaa ttgaggccat gctggagtgc cagcagttcc tggtgattga ggagaacaag     4260

gtgagacagt atgacagcat ccagaagctg ctgaatgaga gaagcctgtt cagacaggcc     4320

atcagcccct ctgacagagt gaagctgttc ccccacagaa acagcagcaa gtgcaagagc     4380

aagccccaga ttgctgccct gaaggaggag accgaggagg aggtgcagga caccagactg     4440

taa                                                                   4443


<210>  134
<211>  502
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(502)
<223>  Sulfoglucosamine sulfohydrolase (SGSH)

<400>  134

Met Ser Cys Pro Val Pro Ala Cys Cys Ala Leu Leu Leu Val Leu Gly 
1               5                   10                  15      


Leu Cys Arg Ala Arg Pro Arg Asn Ala Leu Leu Leu Leu Ala Asp Asp 
            20                  25                  30          


Gly Gly Phe Glu Ser Gly Ala Tyr Asn Asn Ser Ala Ile Ala Thr Pro 
        35                  40                  45              


His Leu Asp Ala Leu Ala Arg Arg Ser Leu Leu Phe Arg Asn Ala Phe 
    50                  55                  60                  


Thr Ser Val Ser Ser Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly 
65                  70                  75                  80  


Leu Pro Gln His Gln Asn Gly Met Tyr Gly Leu His Gln Asp Val His 
                85                  90                  95      


His Phe Asn Ser Phe Asp Lys Val Arg Ser Leu Pro Leu Leu Leu Ser 
            100                 105                 110         


Gln Ala Gly Val Arg Thr Gly Ile Ile Gly Lys Lys His Val Gly Pro 
        115                 120                 125             


Glu Thr Val Tyr Pro Phe Asp Phe Ala Tyr Thr Glu Glu Asn Gly Ser 
    130                 135                 140                 


Val Leu Gln Val Gly Arg Asn Ile Thr Arg Ile Lys Leu Leu Val Arg 
145                 150                 155                 160 


Lys Phe Leu Gln Thr Gln Asp Asp Gln Pro Phe Phe Leu Tyr Val Ala 
                165                 170                 175     


Phe His Asp Pro His Arg Cys Gly His Ser Gln Pro Gln Tyr Gly Thr 
            180                 185                 190         


Phe Cys Glu Lys Phe Gly Asn Gly Glu Ser Gly Met Gly Arg Ile Pro 
        195                 200                 205             


Asp Trp Thr Pro Gln Ala Tyr Asp Pro Leu Asp Val Leu Val Pro Tyr 
    210                 215                 220                 


Phe Val Pro Asn Thr Pro Ala Ala Arg Ala Asp Leu Ala Ala Gln Tyr 
225                 230                 235                 240 


Thr Thr Val Gly Arg Met Asp Gln Gly Val Gly Leu Val Leu Gln Glu 
                245                 250                 255     


Leu Arg Asp Ala Gly Val Leu Asn Asp Thr Leu Val Ile Phe Thr Ser 
            260                 265                 270         


Asp Asn Gly Ile Pro Phe Pro Ser Gly Arg Thr Asn Leu Tyr Trp Pro 
        275                 280                 285             


Gly Thr Ala Glu Pro Leu Leu Val Ser Ser Pro Glu His Pro Lys Arg 
    290                 295                 300                 


Trp Gly Gln Val Ser Glu Ala Tyr Val Ser Leu Leu Asp Leu Thr Pro 
305                 310                 315                 320 


Thr Ile Leu Asp Trp Phe Ser Ile Pro Tyr Pro Ser Tyr Ala Ile Phe 
                325                 330                 335     


Gly Ser Lys Thr Ile His Leu Thr Gly Arg Ser Leu Leu Pro Ala Leu 
            340                 345                 350         


Glu Ala Glu Pro Leu Trp Ala Thr Val Phe Gly Ser Gln Ser His His 
        355                 360                 365             


Glu Val Thr Met Ser Tyr Pro Met Arg Ser Val Gln His Arg His Phe 
    370                 375                 380                 


Arg Leu Val His Asn Leu Asn Phe Lys Met Pro Phe Pro Ile Asp Gln 
385                 390                 395                 400 


Asp Phe Tyr Val Ser Pro Thr Phe Gln Asp Leu Leu Asn Arg Thr Thr 
                405                 410                 415     


Ala Gly Gln Pro Thr Gly Trp Tyr Lys Asp Leu Arg His Tyr Tyr Tyr 
            420                 425                 430         


Arg Ala Arg Trp Glu Leu Tyr Asp Arg Ser Arg Asp Pro His Glu Thr 
        435                 440                 445             


Gln Asn Leu Ala Thr Asp Pro Arg Phe Ala Gln Leu Leu Glu Met Leu 
    450                 455                 460                 


Arg Asp Gln Leu Ala Lys Trp Gln Trp Glu Thr His Asp Pro Trp Val 
465                 470                 475                 480 


Cys Ala Pro Asp Gly Val Leu Glu Glu Lys Leu Ser Pro Gln Cys Gln 
                485                 490                 495     


Pro Leu His Asn Glu Leu 
            500         


<210>  135
<211>  531
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET CO1-SGSH-GET

<400>  135

Met Ser Cys Pro Val Pro Ala Cys Cys Ala Leu Leu Leu Val Leu Gly 
1               5                   10                  15      


Leu Cys Arg Ala Arg Pro Arg Asn Ala Leu Leu Leu Leu Ala Asp Asp 
            20                  25                  30          


Gly Gly Phe Glu Ser Gly Ala Tyr Asn Asn Ser Ala Ile Ala Thr Pro 
        35                  40                  45              


His Leu Asp Ala Leu Ala Arg Arg Ser Leu Leu Phe Arg Asn Ala Phe 
    50                  55                  60                  


Thr Ser Val Ser Ser Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly 
65                  70                  75                  80  


Leu Pro Gln His Gln Asn Gly Met Tyr Gly Leu His Gln Asp Val His 
                85                  90                  95      


His Phe Asn Ser Phe Asp Lys Val Arg Ser Leu Pro Leu Leu Leu Ser 
            100                 105                 110         


Gln Ala Gly Val Arg Thr Gly Ile Ile Gly Lys Lys His Val Gly Pro 
        115                 120                 125             


Glu Thr Val Tyr Pro Phe Asp Phe Ala Tyr Thr Glu Glu Asn Gly Ser 
    130                 135                 140                 


Val Leu Gln Val Gly Arg Asn Ile Thr Arg Ile Lys Leu Leu Val Arg 
145                 150                 155                 160 


Lys Phe Leu Gln Thr Gln Asp Asp Gln Pro Phe Phe Leu Tyr Val Ala 
                165                 170                 175     


Phe His Asp Pro His Arg Cys Gly His Ser Gln Pro Gln Tyr Gly Thr 
            180                 185                 190         


Phe Cys Glu Lys Phe Gly Asn Gly Glu Ser Gly Met Gly Arg Ile Pro 
        195                 200                 205             


Asp Trp Thr Pro Gln Ala Tyr Asp Pro Leu Asp Val Leu Val Pro Tyr 
    210                 215                 220                 


Phe Val Pro Asn Thr Pro Ala Ala Arg Ala Asp Leu Ala Ala Gln Tyr 
225                 230                 235                 240 


Thr Thr Val Gly Arg Met Asp Gln Gly Val Gly Leu Val Leu Gln Glu 
                245                 250                 255     


Leu Arg Asp Ala Gly Val Leu Asn Asp Thr Leu Val Ile Phe Thr Ser 
            260                 265                 270         


Asp Asn Gly Ile Pro Phe Pro Ser Gly Arg Thr Asn Leu Tyr Trp Pro 
        275                 280                 285             


Gly Thr Ala Glu Pro Leu Leu Val Ser Ser Pro Glu His Pro Lys Arg 
    290                 295                 300                 


Trp Gly Gln Val Ser Glu Ala Tyr Val Ser Leu Leu Asp Leu Thr Pro 
305                 310                 315                 320 


Thr Ile Leu Asp Trp Phe Ser Ile Pro Tyr Pro Ser Tyr Ala Ile Phe 
                325                 330                 335     


Gly Ser Lys Thr Ile His Leu Thr Gly Arg Ser Leu Leu Pro Ala Leu 
            340                 345                 350         


Glu Ala Glu Pro Leu Trp Ala Thr Val Phe Gly Ser Gln Ser His His 
        355                 360                 365             


Glu Val Thr Met Ser Tyr Pro Met Arg Ser Val Gln His Arg His Phe 
    370                 375                 380                 


Arg Leu Val His Asn Leu Asn Phe Lys Met Pro Phe Pro Ile Asp Gln 
385                 390                 395                 400 


Asp Phe Tyr Val Ser Pro Thr Phe Gln Asp Leu Leu Asn Arg Thr Thr 
                405                 410                 415     


Ala Gly Gln Pro Thr Gly Trp Tyr Lys Asp Leu Arg His Tyr Tyr Tyr 
            420                 425                 430         


Arg Ala Arg Trp Glu Leu Tyr Asp Arg Ser Arg Asp Pro His Glu Thr 
        435                 440                 445             


Gln Asn Leu Ala Thr Asp Pro Arg Phe Ala Gln Leu Leu Glu Met Leu 
    450                 455                 460                 


Arg Asp Gln Leu Ala Lys Trp Gln Trp Glu Thr His Asp Pro Trp Val 
465                 470                 475                 480 


Cys Ala Pro Asp Gly Val Leu Glu Glu Lys Leu Ser Pro Gln Cys Gln 
                485                 490                 495     


Pro Leu His Asn Glu Leu Arg Arg Arg Arg Arg Arg Arg Arg Lys Arg 
            500                 505                 510         


Lys Lys Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg 
        515                 520                 525             


Lys Tyr Lys 
    530     


<210>  136
<211>  306
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized Ceroid Lipofuscinosis, Neuronal, 1 (CLN1)

<400>  136

Met Ala Ser Pro Gly Cys Leu Trp Leu Leu Ala Val Ala Leu Leu Pro 
1               5                   10                  15      


Trp Thr Cys Ala Ser Arg Ala Leu Gln His Leu Asp Pro Pro Ala Pro 
            20                  25                  30          


Leu Pro Leu Val Ile Trp His Gly Met Gly Asp Ser Cys Cys Asn Pro 
        35                  40                  45              


Leu Ser Met Gly Ala Ile Lys Lys Met Val Glu Lys Lys Ile Pro Gly 
    50                  55                  60                  


Ile Tyr Val Leu Ser Leu Glu Ile Gly Lys Thr Leu Met Glu Asp Val 
65                  70                  75                  80  


Glu Asn Ser Phe Phe Leu Asn Val Asn Ser Gln Val Thr Thr Val Cys 
                85                  90                  95      


Gln Ala Leu Ala Lys Asp Pro Lys Leu Gln Gln Gly Tyr Asn Ala Met 
            100                 105                 110         


Gly Phe Ser Gln Gly Gly Gln Phe Leu Arg Ala Val Ala Gln Arg Cys 
        115                 120                 125             


Pro Ser Pro Pro Met Ile Asn Leu Ile Ser Val Gly Gly Gln His Gln 
    130                 135                 140                 


Gly Val Phe Gly Leu Pro Arg Cys Pro Gly Glu Ser Ser His Ile Cys 
145                 150                 155                 160 


Asp Phe Ile Arg Lys Thr Leu Asn Ala Gly Ala Tyr Ser Lys Val Val 
                165                 170                 175     


Gln Glu Arg Leu Val Gln Ala Glu Tyr Trp His Asp Pro Ile Lys Glu 
            180                 185                 190         


Asp Val Tyr Arg Asn His Ser Ile Phe Leu Ala Asp Ile Asn Gln Glu 
        195                 200                 205             


Arg Gly Ile Asn Glu Ser Tyr Lys Lys Asn Leu Met Ala Leu Lys Lys 
    210                 215                 220                 


Phe Val Met Val Lys Phe Leu Asn Asp Ser Ile Val Asp Pro Val Asp 
225                 230                 235                 240 


Ser Glu Trp Phe Gly Phe Tyr Arg Ser Gly Gln Ala Lys Glu Thr Ile 
                245                 250                 255     


Pro Leu Gln Glu Thr Ser Leu Tyr Thr Gln Asp Arg Leu Gly Leu Lys 
            260                 265                 270         


Glu Met Asp Asn Ala Gly Gln Leu Val Phe Leu Ala Thr Glu Gly Asp 
        275                 280                 285             


His Leu Gln Leu Ser Glu Glu Trp Phe Tyr Ala His Ile Ile Pro Phe 
    290                 295                 300                 


Leu Gly 
305     


<210>  137
<211>  294
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(294)
<223>  Survival Motor Neuron 1 (SMN1)

<400>  137

Met Ala Met Ser Ser Gly Gly Ser Gly Gly Gly Val Pro Glu Gln Glu 
1               5                   10                  15      


Asp Ser Val Leu Phe Arg Arg Gly Thr Gly Gln Ser Asp Asp Ser Asp 
            20                  25                  30          


Ile Trp Asp Asp Thr Ala Leu Ile Lys Ala Tyr Asp Lys Ala Val Ala 
        35                  40                  45              


Ser Phe Lys His Ala Leu Lys Asn Gly Asp Ile Cys Glu Thr Ser Gly 
    50                  55                  60                  


Lys Pro Lys Thr Thr Pro Lys Arg Lys Pro Ala Lys Lys Asn Lys Ser 
65                  70                  75                  80  


Gln Lys Lys Asn Thr Ala Ala Ser Leu Gln Gln Trp Lys Val Gly Asp 
                85                  90                  95      


Lys Cys Ser Ala Ile Trp Ser Glu Asp Gly Cys Ile Tyr Pro Ala Thr 
            100                 105                 110         


Ile Ala Ser Ile Asp Phe Lys Arg Glu Thr Cys Val Val Val Tyr Thr 
        115                 120                 125             


Gly Tyr Gly Asn Arg Glu Glu Gln Asn Leu Ser Asp Leu Leu Ser Pro 
    130                 135                 140                 


Ile Cys Glu Val Ala Asn Asn Ile Glu Gln Asn Ala Gln Glu Asn Glu 
145                 150                 155                 160 


Asn Glu Ser Gln Val Ser Thr Asp Glu Ser Glu Asn Ser Arg Ser Pro 
                165                 170                 175     


Gly Asn Lys Ser Asp Asn Ile Lys Pro Lys Ser Ala Pro Trp Asn Ser 
            180                 185                 190         


Phe Leu Pro Pro Pro Pro Pro Met Pro Gly Pro Arg Leu Gly Pro Gly 
        195                 200                 205             


Lys Pro Gly Leu Lys Phe Asn Gly Pro Pro Pro Pro Pro Pro Pro Pro 
    210                 215                 220                 


Pro Pro His Leu Leu Ser Cys Trp Leu Pro Pro Phe Pro Ser Gly Pro 
225                 230                 235                 240 


Pro Ile Ile Pro Pro Pro Pro Pro Ile Cys Pro Asp Ser Leu Asp Asp 
                245                 250                 255     


Ala Asp Ala Leu Gly Ser Met Leu Ile Ser Trp Tyr Met Ser Gly Tyr 
            260                 265                 270         


His Thr Gly Tyr Tyr Met Gly Phe Arg Gln Asn Gln Lys Glu Gly Arg 
        275                 280                 285             


Cys Ser His Ser Leu Asn 
    290                 


<210>  138
<211>  515
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(515)
<223>  Tissue Non-specific Alkaline Phosphatase (TNALP)

<400>  138

Met Ile Ser Pro Phe Leu Val Leu Ala Ile Gly Thr Cys Leu Thr Asn 
1               5                   10                  15      


Ser Leu Val Pro Glu Lys Glu Lys Asp Pro Lys Tyr Trp Arg Asp Gln 
            20                  25                  30          


Ala Gln Glu Thr Leu Lys Tyr Ala Leu Glu Leu Gln Lys Leu Asn Thr 
        35                  40                  45              


Asn Val Ala Lys Asn Val Ile Met Phe Leu Gly Asp Gly Met Gly Val 
    50                  55                  60                  


Ser Thr Val Thr Ala Ala Arg Ile Leu Lys Gly Gln Leu His His Asn 
65                  70                  75                  80  


Pro Gly Glu Glu Thr Arg Leu Glu Met Asp Lys Phe Pro Phe Val Ala 
                85                  90                  95      


Leu Ser Lys Thr Tyr Asn Thr Asn Ala Gln Val Pro Asp Ser Ala Gly 
            100                 105                 110         


Thr Ala Thr Ala Tyr Leu Cys Gly Val Lys Ala Asn Glu Gly Thr Val 
        115                 120                 125             


Gly Val Ser Ala Ala Thr Glu Arg Ser Arg Cys Asn Thr Thr Gln Gly 
    130                 135                 140                 


Asn Glu Val Thr Ser Ile Leu Arg Trp Ala Lys Asp Ala Gly Lys Ser 
145                 150                 155                 160 


Val Gly Ile Val Thr Thr Thr Arg Val Asn His Ala Thr Pro Ser Ala 
                165                 170                 175     


Ala Tyr Ala His Ser Ala Asp Arg Asp Trp Tyr Ser Asp Asn Glu Met 
            180                 185                 190         


Pro Pro Glu Ala Leu Ser Gln Gly Cys Lys Asp Ile Ala Tyr Gln Leu 
        195                 200                 205             


Met His Asn Ile Arg Asp Ile Asp Val Ile Met Gly Gly Gly Arg Lys 
    210                 215                 220                 


Tyr Met Tyr Pro Lys Asn Lys Thr Asp Val Glu Tyr Glu Ser Asp Glu 
225                 230                 235                 240 


Lys Ala Arg Gly Thr Arg Leu Asp Gly Leu Asp Leu Val Asp Thr Trp 
                245                 250                 255     


Lys Ser Phe Lys Pro Arg Tyr Lys His Ser His Phe Ile Trp Asn Arg 
            260                 265                 270         


Thr Glu Leu Leu Thr Leu Asp Pro His Asn Val Asp Tyr Leu Leu Gly 
        275                 280                 285             


Leu Phe Glu Pro Gly Asp Met Gln Tyr Glu Leu Asn Arg Asn Asn Val 
    290                 295                 300                 


Thr Asp Pro Ser Leu Ser Glu Met Val Val Val Ala Ile Gln Ile Leu 
305                 310                 315                 320 


Arg Lys Asn Pro Lys Gly Phe Phe Leu Leu Val Glu Gly Gly Arg Ile 
                325                 330                 335     


Asp His Gly His His Glu Gly Lys Ala Lys Gln Ala Leu His Glu Ala 
            340                 345                 350         


Val Glu Met Asp Arg Ala Ile Gly Gln Ala Gly Ser Leu Thr Ser Ser 
        355                 360                 365             


Glu Asp Thr Leu Thr Val Val Thr Ala Asp His Ser His Val Phe Thr 
    370                 375                 380                 


Phe Gly Gly Tyr Thr Pro Arg Gly Asn Ser Ile Phe Gly Leu Ala Pro 
385                 390                 395                 400 


Met Leu Ser Asp Thr Asp Lys Lys Pro Phe Thr Ala Ile Leu Tyr Gly 
                405                 410                 415     


Asn Gly Pro Gly Tyr Lys Val Val Gly Gly Glu Arg Glu Asn Val Ser 
            420                 425                 430         


Met Val Asp Tyr Ala His Asn Asn Tyr Gln Ala Gln Ser Ala Val Pro 
        435                 440                 445             


Leu Arg His Glu Thr His Gly Gly Glu Asp Val Ala Val Phe Ser Lys 
    450                 455                 460                 


Gly Pro Met Ala His Leu Leu His Gly Val His Glu Gln Asn Tyr Val 
465                 470                 475                 480 


Pro His Val Met Ala Tyr Ala Ala Cys Ile Gly Ala Asn Leu Gly His 
                485                 490                 495     


Cys Ala Pro Ala Ser Ser Ala Gly Ser Asp Asp Asp Asp Asp Asp Asp 
            500                 505                 510         


Asp Asp Asp 
        515 


<210>  139
<211>  211
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(211)
<223>  Glial Cell Derived Neurotrophic Factor (GDNF)

<400>  139

Met Lys Leu Trp Asp Val Val Ala Val Cys Leu Val Leu Leu His Thr 
1               5                   10                  15      


Ala Ser Ala Phe Pro Leu Pro Ala Gly Lys Arg Pro Pro Glu Ala Pro 
            20                  25                  30          


Ala Glu Asp Arg Ser Leu Gly Arg Arg Arg Ala Pro Phe Ala Leu Ser 
        35                  40                  45              


Ser Asp Ser Asn Met Pro Glu Asp Tyr Pro Asp Gln Phe Asp Asp Val 
    50                  55                  60                  


Met Asp Phe Ile Gln Ala Thr Ile Lys Arg Leu Lys Arg Ser Pro Asp 
65                  70                  75                  80  


Lys Gln Met Ala Val Leu Pro Arg Arg Glu Arg Asn Arg Gln Ala Ala 
                85                  90                  95      


Ala Ala Asn Pro Glu Asn Ser Arg Gly Lys Gly Arg Arg Gly Gln Arg 
            100                 105                 110         


Gly Lys Asn Arg Gly Cys Val Leu Thr Ala Ile His Leu Asn Val Thr 
        115                 120                 125             


Asp Leu Gly Leu Gly Tyr Glu Thr Lys Glu Glu Leu Ile Phe Arg Tyr 
    130                 135                 140                 


Cys Ser Gly Ser Cys Asp Ala Ala Glu Thr Thr Tyr Asp Lys Ile Leu 
145                 150                 155                 160 


Lys Asn Leu Ser Arg Asn Arg Arg Leu Val Ser Asp Lys Val Gly Gln 
                165                 170                 175     


Ala Cys Cys Arg Pro Ile Ala Phe Asp Asp Asp Leu Ser Phe Leu Asp 
            180                 185                 190         


Asp Asn Leu Val Tyr His Ile Leu Arg Lys His Ser Ala Lys Arg Cys 
        195                 200                 205             


Gly Cys Ile 
    210     


<210>  140
<211>  536
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(536)
<223>  Tissue Glucosyl Ceramidase beta (GBA1)

<400>  140

Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser 
1               5                   10                  15      


Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln 
            20                  25                  30          


Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe 
        35                  40                  45              


Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser 
    50                  55                  60                  


Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu 
65                  70                  75                  80  


Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln 
                85                  90                  95      


Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln 
            100                 105                 110         


Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala 
        115                 120                 125             


Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu 
    130                 135                 140                 


Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val 
145                 150                 155                 160 


Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp 
                165                 170                 175     


Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp 
            180                 185                 190         


Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln 
        195                 200                 205             


Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu 
    210                 215                 220                 


Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro 
225                 230                 235                 240 


Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu 
                245                 250                 255     


Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu 
            260                 265                 270         


Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu 
        275                 280                 285             


Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly 
    290                 295                 300                 


Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu 
305                 310                 315                 320 


Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr 
                325                 330                 335     


Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr 
            340                 345                 350         


Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg 
        355                 360                 365             


Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser 
    370                 375                 380                 


Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met 
385                 390                 395                 400 


Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly 
                405                 410                 415     


Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp 
            420                 425                 430         


Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp 
        435                 440                 445             


Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys 
    450                 455                 460                 


Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys 
465                 470                 475                 480 


Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val 
                485                 490                 495     


Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys 
            500                 505                 510         


Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile 
        515                 520                 525             


His Thr Tyr Leu Trp Arg Arg Gln 
    530                 535     


<210>  141
<211>  653
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(653)
<223>  Iduronidase alpha-L- (IDUA)

<400>  141

Met Arg Pro Leu Arg Pro Arg Ala Ala Leu Leu Ala Leu Leu Ala Ser 
1               5                   10                  15      


Leu Leu Ala Ala Pro Pro Val Ala Pro Ala Glu Ala Pro His Leu Val 
            20                  25                  30          


His Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg 
        35                  40                  45              


Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr 
    50                  55                  60                  


Val Leu Ser Trp Asp Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val 
65                  70                  75                  80  


Pro His Arg Gly Ile Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu 
                85                  90                  95      


Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr 
            100                 105                 110         


His Leu Asp Gly Tyr Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro 
        115                 120                 125             


Gly Phe Glu Leu Met Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu 
    130                 135                 140                 


Asp Lys Gln Gln Val Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala 
145                 150                 155                 160 


Arg Arg Tyr Ile Gly Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn 
                165                 170                 175     


Phe Glu Thr Trp Asn Glu Pro Asp His His Asp Phe Asp Asn Val Ser 
            180                 185                 190         


Met Thr Met Gln Gly Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly 
        195                 200                 205             


Leu Arg Ala Ala Ser Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser 
    210                 215                 220                 


Phe His Thr Pro Pro Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His 
225                 230                 235                 240 


Cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu 
                245                 250                 255     


Asp Tyr Ile Ser Leu His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile 
            260                 265                 270         


Leu Glu Gln Glu Lys Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro 
        275                 280                 285             


Lys Phe Ala Asp Thr Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val 
    290                 295                 300                 


Gly Trp Ser Leu Pro Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala 
305                 310                 315                 320 


Met Val Val Lys Val Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn 
                325                 330                 335     


Thr Thr Ser Ala Phe Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe 
            340                 345                 350         


Leu Ser Tyr His Pro His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg 
        355                 360                 365             


Phe Gln Val Asn Asn Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys 
    370                 375                 380                 


Pro Val Leu Thr Ala Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln 
385                 390                 395                 400 


Leu Trp Ala Glu Val Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His 
                405                 410                 415     


Thr Val Gly Val Leu Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp 
            420                 425                 430         


Ala Trp Arg Ala Ala Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala 
        435                 440                 445             


His Pro Asn Arg Ser Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro 
    450                 455                 460                 


Pro Gly Pro Gly Leu Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu 
465                 470                 475                 480 


Cys Ser Pro Asp Gly Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro 
                485                 490                 495     


Thr Ala Glu Gln Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala 
            500                 505                 510         


Ala Ala Pro Arg Pro Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro 
        515                 520                 525             


Ala Leu Arg Leu Pro Ser Leu Leu Leu Val His Val Cys Ala Arg Pro 
    530                 535                 540                 


Glu Lys Pro Pro Gly Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr 
545                 550                 555                 560 


Gln Gly Gln Leu Val Leu Val Trp Ser Asp Glu His Val Gly Ser Lys 
                565                 570                 575     


Cys Leu Trp Thr Tyr Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr 
            580                 585                 590         


Thr Pro Val Ser Arg Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser 
        595                 600                 605             


Pro Asp Thr Gly Ala Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp 
    610                 615                 620                 


Tyr Trp Ala Arg Pro Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu 
625                 630                 635                 640 


Val Pro Val Pro Arg Gly Pro Pro Ser Pro Gly Asn Pro 
                645                 650             


<210>  142
<211>  525
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(525)
<223>  Cytochrome P450 family 4 subfamily V member 2 (CYP4V2)

<400>  142

Met Ala Gly Leu Trp Leu Gly Leu Val Trp Gln Lys Leu Leu Leu Trp 
1               5                   10                  15      


Gly Ala Ala Ser Ala Leu Ser Leu Ala Gly Ala Ser Leu Val Leu Ser 
            20                  25                  30          


Leu Leu Gln Arg Val Ala Ser Tyr Ala Arg Lys Trp Gln Gln Met Arg 
        35                  40                  45              


Pro Ile Pro Thr Val Ala Arg Ala Tyr Pro Leu Val Gly His Ala Leu 
    50                  55                  60                  


Leu Met Lys Pro Asp Gly Arg Glu Phe Phe Gln Gln Ile Ile Glu Tyr 
65                  70                  75                  80  


Thr Glu Glu Tyr Arg His Met Pro Leu Leu Lys Leu Trp Val Gly Pro 
                85                  90                  95      


Val Pro Met Val Ala Leu Tyr Asn Ala Glu Asn Val Glu Val Ile Leu 
            100                 105                 110         


Thr Ser Ser Lys Gln Ile Asp Lys Ser Ser Met Tyr Lys Phe Leu Glu 
        115                 120                 125             


Pro Trp Leu Gly Leu Gly Leu Leu Thr Ser Thr Gly Asn Lys Trp Arg 
    130                 135                 140                 


Ser Arg Arg Lys Met Leu Thr Pro Thr Phe His Phe Thr Ile Leu Glu 
145                 150                 155                 160 


Asp Phe Leu Asp Ile Met Asn Glu Gln Ala Asn Ile Leu Val Lys Lys 
                165                 170                 175     


Leu Glu Lys His Ile Asn Gln Glu Ala Phe Asn Cys Phe Phe Tyr Ile 
            180                 185                 190         


Thr Leu Cys Ala Leu Asp Ile Ile Cys Glu Thr Ala Met Gly Lys Asn 
        195                 200                 205             


Ile Gly Ala Gln Ser Asn Asp Asp Ser Glu Tyr Val Arg Ala Val Tyr 
    210                 215                 220                 


Arg Met Ser Glu Met Ile Phe Arg Arg Ile Lys Met Pro Trp Leu Trp 
225                 230                 235                 240 


Leu Asp Leu Trp Tyr Leu Met Phe Lys Glu Gly Trp Glu His Lys Lys 
                245                 250                 255     


Ser Leu Gln Ile Leu His Thr Phe Thr Asn Ser Val Ile Ala Glu Arg 
            260                 265                 270         


Ala Asn Glu Met Asn Ala Asn Glu Asp Cys Arg Gly Asp Gly Arg Gly 
        275                 280                 285             


Ser Ala Pro Ser Lys Asn Lys Arg Arg Ala Phe Leu Asp Leu Leu Leu 
    290                 295                 300                 


Ser Val Thr Asp Asp Glu Gly Asn Arg Leu Ser His Glu Asp Ile Arg 
305                 310                 315                 320 


Glu Glu Val Asp Thr Phe Met Phe Glu Gly His Asp Thr Thr Ala Ala 
                325                 330                 335     


Ala Ile Asn Trp Ser Leu Tyr Leu Leu Gly Ser Asn Pro Glu Val Gln 
            340                 345                 350         


Lys Lys Val Asp His Glu Leu Asp Asp Val Phe Gly Lys Ser Asp Arg 
        355                 360                 365             


Pro Ala Thr Val Glu Asp Leu Lys Lys Leu Arg Tyr Leu Glu Cys Val 
    370                 375                 380                 


Ile Lys Glu Thr Leu Arg Leu Phe Pro Ser Val Pro Leu Phe Ala Arg 
385                 390                 395                 400 


Ser Val Ser Glu Asp Cys Glu Val Ala Gly Tyr Arg Val Leu Lys Gly 
                405                 410                 415     


Thr Glu Ala Val Ile Ile Pro Tyr Ala Leu His Arg Asp Pro Arg Tyr 
            420                 425                 430         


Phe Pro Asn Pro Glu Glu Phe Gln Pro Glu Arg Phe Phe Pro Glu Asn 
        435                 440                 445             


Ala Gln Gly Arg His Pro Tyr Ala Tyr Val Pro Phe Ser Ala Gly Pro 
    450                 455                 460                 


Arg Asn Cys Ile Gly Gln Lys Phe Ala Val Met Glu Glu Lys Thr Ile 
465                 470                 475                 480 


Leu Ser Cys Ile Leu Arg His Phe Trp Ile Glu Ser Asn Gln Lys Arg 
                485                 490                 495     


Glu Glu Leu Gly Leu Glu Gly Gln Leu Ile Leu Arg Pro Ser Asn Gly 
            500                 505                 510         


Ile Trp Ile Lys Leu Lys Arg Arg Asn Ala Asp Glu Arg 
        515                 520                 525 


<210>  143
<211>  236
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(236)
<223>  Retinoschisin 1 (RS1)

<400>  143

Met Ser Arg Lys Ile Glu Gly Phe Leu Leu Leu Leu Leu Phe Gly Tyr 
1               5                   10                  15      


Glu Ala Thr Leu Gly Leu Ser Ser Thr Glu Asp Glu Gly Glu Asp Pro 
            20                  25                  30          


Trp Tyr Gln Lys Ala Cys Asp Glu Gly Glu Asp Pro Trp Tyr Gln Lys 
        35                  40                  45              


Ala Cys Lys Cys Asp Cys Gln Gly Gly Pro Asn Ala Leu Trp Ser Ala 
    50                  55                  60                  


Gly Ala Thr Ser Leu Asp Cys Ile Pro Glu Cys Pro Tyr His Lys Pro 
65                  70                  75                  80  


Leu Gly Phe Glu Ser Gly Glu Val Thr Pro Asp Gln Ile Thr Cys Ser 
                85                  90                  95      


Asn Pro Glu Gln Tyr Val Gly Trp Tyr Ser Ser Trp Thr Ala Asn Lys 
            100                 105                 110         


Ala Arg Leu Asn Ser Gln Gly Phe Gly Cys Ala Trp Leu Ser Lys Phe 
        115                 120                 125             


Gln Asp Ser Ser Gln Trp Leu Gln Ile Asp Leu Lys Glu Ile Lys Val 
    130                 135                 140                 


Ile Ser Gly Ile Leu Thr Gln Gly Arg Cys Asp Ile Asp Glu Trp Met 
145                 150                 155                 160 


Thr Lys Tyr Ser Val Gln Tyr Arg Thr Asp Glu Arg Leu Asn Trp Ile 
                165                 170                 175     


Tyr Tyr Lys Asp Gln Thr Gly Asn Asn Arg Val Phe Tyr Gly Asn Ser 
            180                 185                 190         


Asp Arg Thr Ser Thr Val Gln Asn Leu Leu Arg Pro Pro Ile Ile Ser 
        195                 200                 205             


Arg Phe Ile Arg Leu Ile Pro Leu Gly Trp His Val Arg Ile Ala Ile 
    210                 215                 220                 


Arg Met Glu Leu Leu Glu Cys Val Ser Lys Cys Ala 
225                 230                 235     


<210>  144
<211>  854
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(854)
<223>  Phosphodiesterase 6B (PDE6B)

<400>  144

Met Ser Leu Ser Glu Glu Gln Ala Arg Ser Phe Leu Asp Gln Asn Pro 
1               5                   10                  15      


Asp Phe Ala Arg Gln Tyr Phe Gly Lys Lys Leu Ser Pro Glu Asn Val 
            20                  25                  30          


Ala Ala Ala Cys Glu Asp Gly Cys Pro Pro Asp Cys Asp Ser Leu Arg 
        35                  40                  45              


Asp Leu Cys Gln Val Glu Glu Ser Thr Ala Leu Leu Glu Leu Val Gln 
    50                  55                  60                  


Asp Met Gln Glu Ser Ile Asn Met Glu Arg Val Val Phe Lys Val Leu 
65                  70                  75                  80  


Arg Arg Leu Cys Thr Leu Leu Gln Ala Asp Arg Cys Ser Leu Phe Met 
                85                  90                  95      


Tyr Arg Gln Arg Asn Gly Val Ala Glu Leu Ala Thr Arg Leu Phe Ser 
            100                 105                 110         


Val Gln Pro Asp Ser Val Leu Glu Asp Cys Leu Val Pro Pro Asp Ser 
        115                 120                 125             


Glu Ile Val Phe Pro Leu Asp Ile Gly Val Val Gly His Val Ala Gln 
    130                 135                 140                 


Thr Lys Lys Met Val Asn Val Glu Asp Val Ala Glu Cys Pro His Phe 
145                 150                 155                 160 


Ser Ser Phe Ala Asp Glu Leu Thr Asp Tyr Lys Thr Lys Asn Met Leu 
                165                 170                 175     


Ala Thr Pro Ile Met Asn Gly Lys Asp Val Val Ala Val Ile Met Ala 
            180                 185                 190         


Val Asn Lys Leu Asn Gly Pro Phe Phe Thr Ser Glu Asp Glu Asp Val 
        195                 200                 205             


Phe Leu Lys Tyr Leu Asn Phe Ala Thr Leu Tyr Leu Lys Ile Tyr His 
    210                 215                 220                 


Leu Ser Tyr Leu His Asn Cys Glu Thr Arg Arg Gly Gln Val Leu Leu 
225                 230                 235                 240 


Trp Ser Ala Asn Lys Val Phe Glu Glu Leu Thr Asp Ile Glu Arg Gln 
                245                 250                 255     


Phe His Lys Ala Phe Tyr Thr Val Arg Ala Tyr Leu Asn Cys Glu Arg 
            260                 265                 270         


Tyr Ser Val Gly Leu Leu Asp Met Thr Lys Glu Lys Glu Phe Phe Asp 
        275                 280                 285             


Val Trp Ser Val Leu Met Gly Glu Ser Gln Pro Tyr Ser Gly Pro Arg 
    290                 295                 300                 


Thr Pro Asp Gly Arg Glu Ile Val Phe Tyr Lys Val Ile Asp Tyr Ile 
305                 310                 315                 320 


Leu His Gly Lys Glu Glu Ile Lys Val Ile Pro Thr Pro Ser Ala Asp 
                325                 330                 335     


His Trp Ala Leu Ala Ser Gly Leu Pro Ser Tyr Val Ala Glu Ser Gly 
            340                 345                 350         


Phe Ile Cys Asn Ile Met Asn Ala Ser Ala Asp Glu Met Phe Lys Phe 
        355                 360                 365             


Gln Glu Gly Ala Leu Asp Asp Ser Gly Trp Leu Ile Lys Asn Val Leu 
    370                 375                 380                 


Ser Met Pro Ile Val Asn Lys Lys Glu Glu Ile Val Gly Val Ala Thr 
385                 390                 395                 400 


Phe Tyr Asn Arg Lys Asp Gly Lys Pro Phe Asp Glu Gln Asp Glu Val 
                405                 410                 415     


Leu Met Glu Ser Leu Thr Gln Phe Leu Gly Trp Ser Val Met Asn Thr 
            420                 425                 430         


Asp Thr Tyr Asp Lys Met Asn Lys Leu Glu Asn Arg Lys Asp Ile Ala 
        435                 440                 445             


Gln Asp Met Val Leu Tyr His Val Lys Cys Asp Arg Asp Glu Ile Gln 
    450                 455                 460                 


Leu Ile Leu Pro Thr Arg Ala Arg Leu Gly Lys Glu Pro Ala Asp Cys 
465                 470                 475                 480 


Asp Glu Asp Glu Leu Gly Glu Ile Leu Lys Glu Glu Leu Pro Gly Pro 
                485                 490                 495     


Thr Thr Phe Asp Ile Tyr Glu Phe His Phe Ser Asp Leu Glu Cys Thr 
            500                 505                 510         


Glu Leu Asp Leu Val Lys Cys Gly Ile Gln Met Tyr Tyr Glu Leu Gly 
        515                 520                 525             


Val Val Arg Lys Phe Gln Ile Pro Gln Glu Val Leu Val Arg Phe Leu 
    530                 535                 540                 


Phe Ser Ile Ser Lys Gly Tyr Arg Arg Ile Thr Tyr His Asn Trp Arg 
545                 550                 555                 560 


His Gly Phe Asn Val Ala Gln Thr Met Phe Thr Leu Leu Met Thr Gly 
                565                 570                 575     


Lys Leu Lys Ser Tyr Tyr Thr Asp Leu Glu Ala Phe Ala Met Val Thr 
            580                 585                 590         


Ala Gly Leu Cys His Asp Ile Asp His Arg Gly Thr Asn Asn Leu Tyr 
        595                 600                 605             


Gln Met Lys Ser Gln Asn Pro Leu Ala Lys Leu His Gly Ser Ser Ile 
    610                 615                 620                 


Leu Glu Arg His His Leu Glu Phe Gly Lys Phe Leu Leu Ser Glu Glu 
625                 630                 635                 640 


Thr Leu Asn Ile Tyr Gln Asn Leu Asn Arg Arg Gln His Glu His Val 
                645                 650                 655     


Ile His Leu Met Asp Ile Ala Ile Ile Ala Thr Asp Leu Ala Leu Tyr 
            660                 665                 670         


Phe Lys Lys Arg Ala Met Phe Gln Lys Ile Val Asp Glu Ser Lys Asn 
        675                 680                 685             


Tyr Gln Asp Lys Lys Ser Trp Val Glu Tyr Leu Ser Leu Glu Thr Thr 
    690                 695                 700                 


Arg Lys Glu Ile Val Met Ala Met Met Met Thr Ala Cys Asp Leu Ser 
705                 710                 715                 720 


Ala Ile Thr Lys Pro Trp Glu Val Gln Ser Lys Val Ala Leu Leu Val 
                725                 730                 735     


Ala Ala Glu Phe Trp Glu Gln Gly Asp Leu Glu Arg Thr Val Leu Asp 
            740                 745                 750         


Gln Gln Pro Ile Pro Met Met Asp Arg Asn Lys Ala Ala Glu Leu Pro 
        755                 760                 765             


Lys Leu Gln Val Gly Phe Ile Asp Phe Val Cys Thr Phe Val Tyr Lys 
    770                 775                 780                 


Glu Phe Ser Arg Phe His Glu Glu Ile Leu Pro Met Phe Asp Arg Leu 
785                 790                 795                 800 


Gln Asn Asn Arg Lys Glu Trp Lys Ala Leu Ala Asp Glu Tyr Glu Ala 
                805                 810                 815     


Lys Val Lys Ala Leu Glu Glu Lys Glu Glu Glu Glu Arg Val Ala Ala 
            820                 825                 830         


Lys Lys Val Gly Thr Glu Ile Cys Asn Gly Gly Pro Ala Pro Lys Ser 
        835                 840                 845             


Ser Thr Cys Cys Ile Leu 
    850                 


<210>  145
<211>  498
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(498)
<223>  Methyl-CpG Binding Protein (MeCP2)

<400>  145

Met Ala Ala Ala Ala Ala Ala Ala Pro Ser Gly Gly Gly Gly Gly Gly 
1               5                   10                  15      


Glu Glu Glu Arg Leu Glu Glu Lys Ser Glu Asp Gln Asp Leu Gln Gly 
            20                  25                  30          


Leu Lys Asp Lys Pro Leu Lys Phe Lys Lys Val Lys Lys Asp Lys Lys 
        35                  40                  45              


Glu Glu Lys Glu Gly Lys His Glu Pro Val Gln Pro Ser Ala His His 
    50                  55                  60                  


Ser Ala Glu Pro Ala Glu Ala Gly Lys Ala Glu Thr Ser Glu Gly Ser 
65                  70                  75                  80  


Gly Ser Ala Pro Ala Val Pro Glu Ala Ser Ala Ser Pro Lys Gln Arg 
                85                  90                  95      


Arg Ser Ile Ile Arg Asp Arg Gly Pro Met Tyr Asp Asp Pro Thr Leu 
            100                 105                 110         


Pro Glu Gly Trp Thr Arg Lys Leu Lys Gln Arg Lys Ser Gly Arg Ser 
        115                 120                 125             


Ala Gly Lys Tyr Asp Val Tyr Leu Ile Asn Pro Gln Gly Lys Ala Phe 
    130                 135                 140                 


Arg Ser Lys Val Glu Leu Ile Ala Tyr Phe Glu Lys Val Gly Asp Thr 
145                 150                 155                 160 


Ser Leu Asp Pro Asn Asp Phe Asp Phe Thr Val Thr Gly Arg Gly Ser 
                165                 170                 175     


Pro Ser Arg Arg Glu Gln Lys Pro Pro Lys Lys Pro Lys Ser Pro Lys 
            180                 185                 190         


Ala Pro Gly Thr Gly Arg Gly Arg Gly Arg Pro Lys Gly Ser Gly Thr 
        195                 200                 205             


Thr Arg Pro Lys Ala Ala Thr Ser Glu Gly Val Gln Val Lys Arg Val 
    210                 215                 220                 


Leu Glu Lys Ser Pro Gly Lys Leu Leu Val Lys Met Pro Phe Gln Thr 
225                 230                 235                 240 


Ser Pro Gly Gly Lys Ala Glu Gly Gly Gly Ala Thr Thr Ser Thr Gln 
                245                 250                 255     


Val Met Val Ile Lys Arg Pro Gly Arg Lys Arg Lys Ala Glu Ala Asp 
            260                 265                 270         


Pro Gln Ala Ile Pro Lys Lys Arg Gly Arg Lys Pro Gly Ser Val Val 
        275                 280                 285             


Ala Ala Ala Ala Ala Glu Ala Lys Lys Lys Ala Val Lys Glu Ser Ser 
    290                 295                 300                 


Ile Arg Ser Val Gln Glu Thr Val Leu Pro Ile Lys Lys Arg Lys Thr 
305                 310                 315                 320 


Arg Glu Thr Val Ser Ile Glu Val Lys Glu Val Val Lys Pro Leu Leu 
                325                 330                 335     


Val Ser Thr Leu Gly Glu Lys Ser Gly Lys Gly Leu Lys Thr Cys Lys 
            340                 345                 350         


Ser Pro Gly Arg Lys Ser Lys Glu Ser Ser Pro Lys Gly Arg Ser Ser 
        355                 360                 365             


Ser Ala Ser Ser Pro Pro Lys Lys Glu His His His His His His His 
    370                 375                 380                 


Ser Glu Ser Pro Lys Ala Pro Val Pro Leu Leu Pro Pro Leu Pro Pro 
385                 390                 395                 400 


Pro Pro Pro Glu Pro Glu Ser Ser Glu Asp Pro Thr Ser Pro Pro Glu 
                405                 410                 415     


Pro Gln Asp Leu Ser Ser Ser Val Cys Lys Glu Glu Lys Met Pro Arg 
            420                 425                 430         


Gly Gly Ser Leu Glu Ser Asp Gly Cys Pro Lys Glu Pro Ala Lys Thr 
        435                 440                 445             


Gln Pro Ala Val Ala Thr Ala Ala Thr Ala Ala Glu Lys Tyr Lys His 
    450                 455                 460                 


Arg Gly Glu Gly Glu Arg Lys Asp Ile Val Ser Ser Ser Met Pro Arg 
465                 470                 475                 480 


Pro Asn Arg Glu Glu Pro Val Asp Ser Arg Thr Pro Val Thr Glu Arg 
                485                 490                 495     


Val Ser 
        


<210>  146
<211>  743
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(743)
<223>  N-acetyl-alpha-glucosaminidase (NAGLU)

<400>  146

Met Glu Ala Val Ala Val Ala Ala Ala Val Gly Val Leu Leu Leu Ala 
1               5                   10                  15      


Gly Ala Gly Gly Ala Ala Gly Asp Glu Ala Arg Glu Ala Ala Ala Val 
            20                  25                  30          


Arg Ala Leu Val Ala Arg Leu Leu Gly Pro Gly Pro Ala Ala Asp Phe 
        35                  40                  45              


Ser Val Ser Val Glu Arg Ala Leu Ala Ala Lys Pro Gly Leu Asp Thr 
    50                  55                  60                  


Tyr Ser Leu Gly Gly Gly Gly Ala Ala Arg Val Arg Val Arg Gly Ser 
65                  70                  75                  80  


Thr Gly Val Ala Ala Ala Ala Gly Leu His Arg Tyr Leu Arg Asp Phe 
                85                  90                  95      


Cys Gly Cys His Val Ala Trp Ser Gly Ser Gln Leu Arg Leu Pro Arg 
            100                 105                 110         


Pro Leu Pro Ala Val Pro Gly Glu Leu Thr Glu Ala Thr Pro Asn Arg 
        115                 120                 125             


Tyr Arg Tyr Tyr Gln Asn Val Cys Thr Gln Ser Tyr Ser Phe Val Trp 
    130                 135                 140                 


Trp Asp Trp Ala Arg Trp Glu Arg Glu Ile Asp Trp Met Ala Leu Asn 
145                 150                 155                 160 


Gly Ile Asn Leu Ala Leu Ala Trp Ser Gly Gln Glu Ala Ile Trp Gln 
                165                 170                 175     


Arg Val Tyr Leu Ala Leu Gly Leu Thr Gln Ala Glu Ile Asn Glu Phe 
            180                 185                 190         


Phe Thr Gly Pro Ala Phe Leu Ala Trp Gly Arg Met Gly Asn Leu His 
        195                 200                 205             


Thr Trp Asp Gly Pro Leu Pro Pro Ser Trp His Ile Lys Gln Leu Tyr 
    210                 215                 220                 


Leu Gln His Arg Val Leu Asp Gln Met Arg Ser Phe Gly Met Thr Pro 
225                 230                 235                 240 


Val Leu Pro Ala Phe Ala Gly His Val Pro Glu Ala Val Thr Arg Val 
                245                 250                 255     


Phe Pro Gln Val Asn Val Thr Lys Met Gly Ser Trp Gly His Phe Asn 
            260                 265                 270         


Cys Ser Tyr Ser Cys Ser Phe Leu Leu Ala Pro Glu Asp Pro Ile Phe 
        275                 280                 285             


Pro Ile Ile Gly Ser Leu Phe Leu Arg Glu Leu Ile Lys Glu Phe Gly 
    290                 295                 300                 


Thr Asp His Ile Tyr Gly Ala Asp Thr Phe Asn Glu Met Gln Pro Pro 
305                 310                 315                 320 


Ser Ser Glu Pro Ser Tyr Leu Ala Ala Ala Thr Thr Ala Val Tyr Glu 
                325                 330                 335     


Ala Met Thr Ala Val Asp Thr Glu Ala Val Trp Leu Leu Gln Gly Trp 
            340                 345                 350         


Leu Phe Gln His Gln Pro Gln Phe Trp Gly Pro Ala Gln Ile Arg Ala 
        355                 360                 365             


Val Leu Gly Ala Val Pro Arg Gly Arg Leu Leu Val Leu Asp Leu Phe 
    370                 375                 380                 


Ala Glu Ser Gln Pro Val Tyr Thr Arg Thr Ala Ser Phe Gln Gly Gln 
385                 390                 395                 400 


Pro Phe Ile Trp Cys Met Leu His Asn Phe Gly Gly Asn His Gly Leu 
                405                 410                 415     


Phe Gly Ala Leu Glu Ala Val Asn Gly Gly Pro Glu Ala Ala Arg Leu 
            420                 425                 430         


Phe Pro Asn Ser Thr Met Val Gly Thr Gly Met Ala Pro Glu Gly Ile 
        435                 440                 445             


Ser Gln Asn Glu Val Val Tyr Ser Leu Met Ala Glu Leu Gly Trp Arg 
    450                 455                 460                 


Lys Asp Pro Val Pro Asp Leu Ala Ala Trp Val Thr Ser Phe Ala Ala 
465                 470                 475                 480 


Arg Arg Tyr Gly Val Ser His Pro Asp Ala Gly Ala Ala Trp Arg Leu 
                485                 490                 495     


Leu Leu Arg Ser Val Tyr Asn Cys Ser Gly Glu Ala Cys Arg Gly His 
            500                 505                 510         


Asn Arg Ser Pro Leu Val Arg Arg Pro Ser Leu Gln Met Asn Thr Ser 
        515                 520                 525             


Ile Trp Tyr Asn Arg Ser Asp Val Phe Glu Ala Trp Arg Leu Leu Leu 
    530                 535                 540                 


Thr Ser Ala Pro Ser Leu Ala Thr Ser Pro Ala Phe Arg Tyr Asp Leu 
545                 550                 555                 560 


Leu Asp Leu Thr Arg Gln Ala Val Gln Glu Leu Val Ser Leu Tyr Tyr 
                565                 570                 575     


Glu Glu Ala Arg Ser Ala Tyr Leu Ser Lys Glu Leu Ala Ser Leu Leu 
            580                 585                 590         


Arg Ala Gly Gly Val Leu Ala Tyr Glu Leu Leu Pro Ala Leu Asp Glu 
        595                 600                 605             


Val Leu Ala Ser Asp Ser Arg Phe Leu Leu Gly Ser Trp Leu Glu Gln 
    610                 615                 620                 


Ala Arg Ala Ala Ala Val Ser Glu Ala Glu Ala Asp Phe Tyr Glu Gln 
625                 630                 635                 640 


Asn Ser Arg Tyr Gln Leu Thr Leu Trp Gly Pro Glu Gly Asn Ile Leu 
                645                 650                 655     


Asp Tyr Ala Asn Lys Gln Leu Ala Gly Leu Val Ala Asn Tyr Tyr Thr 
            660                 665                 670         


Pro Arg Trp Arg Leu Phe Leu Glu Ala Leu Val Asp Ser Val Ala Gln 
        675                 680                 685             


Gly Ile Pro Phe Gln Gln His Gln Phe Asp Lys Asn Val Phe Gln Leu 
    690                 695                 700                 


Glu Gln Ala Phe Val Leu Ser Lys Gln Arg Tyr Pro Ser Gln Pro Arg 
705                 710                 715                 720 


Gly Asp Thr Val Asp Leu Ala Lys Lys Ile Phe Leu Lys Tyr Tyr Pro 
                725                 730                 735     


Gly Trp Val Ala Gly Ser Trp 
            740             


<210>  147

<400>  147
000

<210>  148
<211>  429
<212>  PRT
<213>  Homo sapiens

<220>
<221>  MISC_FEATURE
<222>  (1)..(429)
<223>  Alpha-Galactosidase A (GLA)

<400>  148

Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu 
1               5                   10                  15      


Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu 
            20                  25                  30          


Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu 
        35                  40                  45              


Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile 
    50                  55                  60                  


Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly 
65                  70                  75                  80  


Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met 
                85                  90                  95      


Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg 
            100                 105                 110         


Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly 
        115                 120                 125             


Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly 
    130                 135                 140                 


Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala 
145                 150                 155                 160 


Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser 
                165                 170                 175     


Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn 
            180                 185                 190         


Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met 
        195                 200                 205             


Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn 
    210                 215                 220                 


His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys 
225                 230                 235                 240 


Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val 
                245                 250                 255     


Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn 
            260                 265                 270         


Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala 
        275                 280                 285             


Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser 
    290                 295                 300                 


Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn 
305                 310                 315                 320 


Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn 
                325                 330                 335     


Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala 
            340                 345                 350         


Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala 
        355                 360                 365             


Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile 
    370                 375                 380                 


Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 
385                 390                 395                 400 


Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln 
                405                 410                 415     


Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu 
            420                 425                 


<210>  149
<211>  458
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET, CO1-GLA-GET

<400>  149

Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu 
1               5                   10                  15      


Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu 
            20                  25                  30          


Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu 
        35                  40                  45              


Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile 
    50                  55                  60                  


Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly 
65                  70                  75                  80  


Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met 
                85                  90                  95      


Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg 
            100                 105                 110         


Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly 
        115                 120                 125             


Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly 
    130                 135                 140                 


Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala 
145                 150                 155                 160 


Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser 
                165                 170                 175     


Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn 
            180                 185                 190         


Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met 
        195                 200                 205             


Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn 
    210                 215                 220                 


His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys 
225                 230                 235                 240 


Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val 
                245                 250                 255     


Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn 
            260                 265                 270         


Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala 
        275                 280                 285             


Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser 
    290                 295                 300                 


Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn 
305                 310                 315                 320 


Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn 
                325                 330                 335     


Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala 
            340                 345                 350         


Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala 
        355                 360                 365             


Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile 
    370                 375                 380                 


Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 
385                 390                 395                 400 


Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln 
                405                 410                 415     


Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu Arg Arg Arg 
            420                 425                 430         


Arg Arg Arg Arg Arg Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys 
        435                 440                 445             


Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys 
    450                 455             


<210>  150
<211>  1428
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized, Cystic Fibrosis Transmembrane Regulator deltaR 
       (CFTRdeltaR) contains R domain deletion

<400>  150

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Thr Leu Gln Ala Arg Arg Arg Gln Ser Val Leu Asn Leu 
705                 710                 715                 720 


Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His Arg Lys Thr Thr 
                725                 730                 735     


Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala Asn Leu Thr Glu 
            740                 745                 750         


Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr Gly Leu Glu Ile 
        755                 760                 765             


Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys Phe Phe Asp Asp 
    770                 775                 780                 


Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr Tyr Leu Arg Tyr 
785                 790                 795                 800 


Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile Trp Cys Leu Val 
                805                 810                 815     


Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val Leu Trp Leu Leu 
            820                 825                 830         


Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr His Ser Arg Asn 
        835                 840                 845             


Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser Tyr Tyr Val Phe 
    850                 855                 860                 


Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala Met Gly Phe Phe 
865                 870                 875                 880 


Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val Ser Lys Ile Leu 
                885                 890                 895     


His His Lys Met Leu His Ser Val Leu Gln Ala Pro Met Ser Thr Leu 
            900                 905                 910         


Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe Ser Lys Asp Ile 
        915                 920                 925             


Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe Asp Phe Ile Gln 
    930                 935                 940                 


Leu Leu Leu Ile Val Ile Gly Ala Ile Ala Val Val Ala Val Leu Gln 
945                 950                 955                 960 


Pro Tyr Ile Phe Val Ala Thr Val Pro Val Ile Val Ala Phe Ile Met 
                965                 970                 975     


Leu Arg Ala Tyr Phe Leu Gln Thr Ser Gln Gln Leu Lys Gln Leu Glu 
            980                 985                 990         


Ser Glu Gly Arg Ser Pro Ile Phe  Thr His Leu Val Thr  Ser Leu Lys 
        995                 1000                 1005             


Gly Leu  Trp Thr Leu Arg Ala  Phe Gly Arg Gln Pro  Tyr Phe Glu 
    1010                 1015                 1020             


Thr Leu  Phe His Lys Ala Leu  Asn Leu His Thr Ala  Asn Trp Phe 
    1025                 1030                 1035             


Leu Tyr  Leu Ser Thr Leu Arg  Trp Phe Gln Met Arg  Ile Glu Met 
    1040                 1045                 1050             


Ile Phe  Val Ile Phe Phe Ile  Ala Val Thr Phe Ile  Ser Ile Leu 
    1055                 1060                 1065             


Thr Thr  Gly Glu Gly Glu Gly  Arg Val Gly Ile Ile  Leu Thr Leu 
    1070                 1075                 1080             


Ala Met  Asn Ile Met Ser Thr  Leu Gln Trp Ala Val  Asn Ser Ser 
    1085                 1090                 1095             


Ile Asp  Val Asp Ser Leu Met  Arg Ser Val Ser Arg  Val Phe Lys 
    1100                 1105                 1110             


Phe Ile  Asp Met Pro Thr Glu  Gly Lys Pro Thr Lys  Ser Thr Lys 
    1115                 1120                 1125             


Pro Tyr  Lys Asn Gly Gln Leu  Ser Lys Val Met Ile  Ile Glu Asn 
    1130                 1135                 1140             


Ser His  Val Lys Lys Asp Asp  Ile Trp Pro Ser Gly  Gly Gln Met 
    1145                 1150                 1155             


Thr Val  Lys Asp Leu Thr Ala  Lys Tyr Thr Glu Gly  Gly Asn Ala 
    1160                 1165                 1170             


Ile Leu  Glu Asn Ile Ser Phe  Ser Ile Ser Pro Gly  Gln Arg Val 
    1175                 1180                 1185             


Gly Leu  Leu Gly Arg Thr Gly  Ser Gly Lys Ser Thr  Leu Leu Ser 
    1190                 1195                 1200             


Ala Phe  Leu Arg Leu Leu Asn  Thr Glu Gly Glu Ile  Gln Ile Asp 
    1205                 1210                 1215             


Gly Val  Ser Trp Asp Ser Ile  Thr Leu Gln Gln Trp  Arg Lys Ala 
    1220                 1225                 1230             


Phe Gly  Val Ile Pro Gln Lys  Val Phe Ile Phe Ser  Gly Thr Phe 
    1235                 1240                 1245             


Arg Lys  Asn Leu Asp Pro Tyr  Glu Gln Trp Ser Asp  Gln Glu Ile 
    1250                 1255                 1260             


Trp Lys  Val Ala Asp Glu Val  Gly Leu Arg Ser Val  Ile Glu Gln 
    1265                 1270                 1275             


Phe Pro  Gly Lys Leu Asp Phe  Val Leu Val Asp Gly  Gly Cys Val 
    1280                 1285                 1290             


Leu Ser  His Gly His Lys Gln  Leu Met Cys Leu Ala  Arg Ser Val 
    1295                 1300                 1305             


Leu Ser  Lys Ala Lys Ile Leu  Leu Leu Asp Glu Pro  Ser Ala His 
    1310                 1315                 1320             


Leu Asp  Pro Val Thr Tyr Gln  Ile Ile Arg Arg Thr  Leu Lys Gln 
    1325                 1330                 1335             


Ala Phe  Ala Asp Cys Thr Val  Ile Leu Cys Glu His  Arg Ile Glu 
    1340                 1345                 1350             


Ala Met  Leu Glu Cys Gln Gln  Phe Leu Val Ile Glu  Glu Asn Lys 
    1355                 1360                 1365             


Val Arg  Gln Tyr Asp Ser Ile  Gln Lys Leu Leu Asn  Glu Arg Ser 
    1370                 1375                 1380             


Leu Phe  Arg Gln Ala Ile Ser  Pro Ser Asp Arg Val  Lys Leu Phe 
    1385                 1390                 1395             


Pro His  Arg Asn Ser Ser Lys  Cys Lys Ser Lys Pro  Gln Ile Ala 
    1400                 1405                 1410             


Ala Leu  Lys Glu Glu Thr Glu  Glu Glu Val Gln Asp  Thr Arg Leu 
    1415                 1420                 1425             


<210>  151
<211>  1480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized, full length Cystic Fibrosis Transmembrane 
       Regulator (CFTR)

<400>  151

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Ile Arg Lys Phe Ser Ile Val Gln Lys Thr Pro Leu Gln 
705                 710                 715                 720 


Met Asn Gly Ile Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 
                725                 730                 735     


Ser Leu Val Pro Asp Ser Glu Gln Gly Glu Ala Ile Leu Pro Arg Ile 
            740                 745                 750         


Ser Val Ile Ser Thr Gly Pro Thr Leu Gln Ala Arg Arg Arg Gln Ser 
        755                 760                 765             


Val Leu Asn Leu Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His 
    770                 775                 780                 


Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala 
785                 790                 795                 800 


Asn Leu Thr Glu Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr 
                805                 810                 815     


Gly Leu Glu Ile Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys 
            820                 825                 830         


Phe Phe Asp Asp Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr 
        835                 840                 845             


Tyr Leu Arg Tyr Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile 
    850                 855                 860                 


Trp Cys Leu Val Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865                 870                 875                 880 


Leu Trp Leu Leu Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr 
                885                 890                 895     


His Ser Arg Asn Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser 
            900                 905                 910         


Tyr Tyr Val Phe Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala 
        915                 920                 925             


Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val 
    930                 935                 940                 


Ser Lys Ile Leu His His Lys Met Leu His Ser Val Leu Gln Ala Pro 
945                 950                 955                 960 


Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe 
                965                 970                 975     


Ser Lys Asp Ile Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe 
            980                 985                 990         


Asp Phe Ile Gln Leu Leu Leu Ile  Val Ile Gly Ala Ile  Ala Val Val 
        995                 1000                 1005             


Ala Val  Leu Gln Pro Tyr Ile  Phe Val Ala Thr Val  Pro Val Ile 
    1010                 1015                 1020             


Val Ala  Phe Ile Met Leu Arg  Ala Tyr Phe Leu Gln  Thr Ser Gln 
    1025                 1030                 1035             


Gln Leu  Lys Gln Leu Glu Ser  Glu Gly Arg Ser Pro  Ile Phe Thr 
    1040                 1045                 1050             


His Leu  Val Thr Ser Leu Lys  Gly Leu Trp Thr Leu  Arg Ala Phe 
    1055                 1060                 1065             


Gly Arg  Gln Pro Tyr Phe Glu  Thr Leu Phe His Lys  Ala Leu Asn 
    1070                 1075                 1080             


Leu His  Thr Ala Asn Trp Phe  Leu Tyr Leu Ser Thr  Leu Arg Trp 
    1085                 1090                 1095             


Phe Gln  Met Arg Ile Glu Met  Ile Phe Val Ile Phe  Phe Ile Ala 
    1100                 1105                 1110             


Val Thr  Phe Ile Ser Ile Leu  Thr Thr Gly Glu Gly  Glu Gly Arg 
    1115                 1120                 1125             


Val Gly  Ile Ile Leu Thr Leu  Ala Met Asn Ile Met  Ser Thr Leu 
    1130                 1135                 1140             


Gln Trp  Ala Val Asn Ser Ser  Ile Asp Val Asp Ser  Leu Met Arg 
    1145                 1150                 1155             


Ser Val  Ser Arg Val Phe Lys  Phe Ile Asp Met Pro  Thr Glu Gly 
    1160                 1165                 1170             


Lys Pro  Thr Lys Ser Thr Lys  Pro Tyr Lys Asn Gly  Gln Leu Ser 
    1175                 1180                 1185             


Lys Val  Met Ile Ile Glu Asn  Ser His Val Lys Lys  Asp Asp Ile 
    1190                 1195                 1200             


Trp Pro  Ser Gly Gly Gln Met  Thr Val Lys Asp Leu  Thr Ala Lys 
    1205                 1210                 1215             


Tyr Thr  Glu Gly Gly Asn Ala  Ile Leu Glu Asn Ile  Ser Phe Ser 
    1220                 1225                 1230             


Ile Ser  Pro Gly Gln Arg Val  Gly Leu Leu Gly Arg  Thr Gly Ser 
    1235                 1240                 1245             


Gly Lys  Ser Thr Leu Leu Ser  Ala Phe Leu Arg Leu  Leu Asn Thr 
    1250                 1255                 1260             


Glu Gly  Glu Ile Gln Ile Asp  Gly Val Ser Trp Asp  Ser Ile Thr 
    1265                 1270                 1275             


Leu Gln  Gln Trp Arg Lys Ala  Phe Gly Val Ile Pro  Gln Lys Val 
    1280                 1285                 1290             


Phe Ile  Phe Ser Gly Thr Phe  Arg Lys Asn Leu Asp  Pro Tyr Glu 
    1295                 1300                 1305             


Gln Trp  Ser Asp Gln Glu Ile  Trp Lys Val Ala Asp  Glu Val Gly 
    1310                 1315                 1320             


Leu Arg  Ser Val Ile Glu Gln  Phe Pro Gly Lys Leu  Asp Phe Val 
    1325                 1330                 1335             


Leu Val  Asp Gly Gly Cys Val  Leu Ser His Gly His  Lys Gln Leu 
    1340                 1345                 1350             


Met Cys  Leu Ala Arg Ser Val  Leu Ser Lys Ala Lys  Ile Leu Leu 
    1355                 1360                 1365             


Leu Asp  Glu Pro Ser Ala His  Leu Asp Pro Val Thr  Tyr Gln Ile 
    1370                 1375                 1380             


Ile Arg  Arg Thr Leu Lys Gln  Ala Phe Ala Asp Cys  Thr Val Ile 
    1385                 1390                 1395             


Leu Cys  Glu His Arg Ile Glu  Ala Met Leu Glu Cys  Gln Gln Phe 
    1400                 1405                 1410             


Leu Val  Ile Glu Glu Asn Lys  Val Arg Gln Tyr Asp  Ser Ile Gln 
    1415                 1420                 1425             


Lys Leu  Leu Asn Glu Arg Ser  Leu Phe Arg Gln Ala  Ile Ser Pro 
    1430                 1435                 1440             


Ser Asp  Arg Val Lys Leu Phe  Pro His Arg Asn Ser  Ser Lys Cys 
    1445                 1450                 1455             


Lys Ser  Lys Pro Gln Ile Ala  Ala Leu Lys Glu Glu  Thr Glu Glu 
    1460                 1465                 1470             


Glu Val  Gln Asp Thr Arg Leu  
    1475                 1480 


<210>  152
<211>  250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mouse U1a promoter

<400>  152
atggaggcgg tactatgtag atgagaattc aggagcaaac tgggaaaagc aactgcttcc       60

aaatatttgt gatttttaca gtgtagtttt ggaaaaactc ttagcctacc aattcttcta      120

agtgttttaa aatgtgggag ccagtacaca tgaagttata gagtgtttta atgaggctta      180

aatatttacc gtaactatga aatgctacgc atatcatgct gttcaggctc cgtggccacg      240

caactcatac                                                             250


<210>  153
<211>  101
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polymerase III H1 mutant promoter

<400>  153
aatatttgca tgtcgctatg tgttctggga aatcaccata aacgtgaaat gtctttggat       60

ttgggaatct tcgaagttct gtatgagacc acagatctcc a                          101


<210>  154
<211>  701
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Chicken beta-actin hybrid promoter CBh (CBh promoter consists 
       of CMV enhancer, CBA promoter, first CBA exon and partial intron)

<400>  154
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg      120

gtaaactgcc cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga      180

cgtcaatgac ggtaaatggc ccgcctggca ttgtgcccag tacatgacct tatgggactt      240

tcctacttgg cagtacatct acgtattagt catcgctatt accatggtcg aggtgagccc      300

cacgttctgc ttcactctcc ccatctcccc cccctcccca cccccaattt tgtatttatt      360

tattttttaa ttattttgtg cagcgatggg ggcggggggg gggggggggc gcgcgccagg      420

cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg agaggtgcgg cggcagccaa      480

tcagagcggc gcgctccgaa agtttccttt tatggcgagg cggcggcggc ggcggcccta      540

taaaaagcga agcgcgcggc gggcgggagt cgctgcgcgc tgccttcgcc ccgtgccccg      600

ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg accgcgttac tcccacaggt      660

gagcgggcgg gacggccctt ctcctccggg ctgtaattag c                          701


<210>  155
<211>  229
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP2 min promoter sequence

<400>  155
agctgaatgg ggtccgcctc ttttccctgc ctaaacagac aggaactcct gccaattgag       60

ggcgtcaccg ctaaggctcc gccccagcct gggctccaca accaatgaag ggtaatctcg      120

acaaagagca aggggtgggg cgcgggcgcg caggtgcagc agcacacagg ctggtcggga      180

gggcggggcg cgacgtctgc cgtgcggggt cccggcatcg gttgcgcgc                  229


<210>  156
<211>  737
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP2 promoter sequence

<400>  156
tcaaaccatc tgattcaaca atgcacgacc gatctcttat gggcttggca cacaccatct       60

gcccattata aacgtctgca aagaccaagg tttgatatgt tgattttact gtcagcctta      120

agagtgcgac atctgctaat ttagtgtaat aatacaatca gtagaccctt taaaacaagt      180

cccttggctt ggaacaacgc caggctcctc aacaggcaac tttgctactt ctacagaaaa      240

tgataataaa gaaatgctgg tgaagtcaaa tgcttatcac aatggtgaac tactcagcag      300

ggaggctcta ataggcgcca agagcctaga cttccttaag cgccagagtc cacaagggcc      360

cagttaatcc tcaacattca aatgctgccc acaaaaccag cccctctgtg ccctagccgc      420

ctcttttttc caagtgacag tagaactcca ccaatccgca gctgaatggg gtccgcctct      480

tttccctgcc taaacagaca ggaactcctg ccaattgagg gcgtcaccgc taaggctccg      540

ccccagcctg ggctccacaa ccaatgaagg gtaatctcga caaagagcaa ggggtggggc      600

gcgggcgcgc aggtgcagca gcacacaggc tggtcgggag ggcggggcgc gacgtctgcc      660

gtgcggggtc ccggcatcgg ttgcgcgcgc gctccctcct ctcggagaga gggctgtggt      720

aaaacccgtc cggaaaa                                                     737


<210>  157
<211>  418
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP418 promoter sequence

<400>  157
ataggcgcca agagcctaga cttccttaag cgccagagtc cacaagggcc cagttaatcc       60

tcaacattca aatgctgccc acaaaaccag cccctctgtg ccctagccgc ctcttttttc      120

caagtgacag tagaactcca ccaatccgca gctgaatggg gtccgcctct tttccctgcc      180

taaacagaca ggaactcctg ccaattgagg gcgtcaccgc taaggctccg ccccagcctg      240

ggctccacaa ccaatgaagg gtaatctcga caaagagcaa ggggtggggc gcgggcgcgc      300

aggtgcagca gcacacaggc tggtcgggag ggcggggcgc gacgtctgcc gtgcggggtc      360

ccggcatcgg ttgcgcgcgc gctccctcct ctcggagaga gggctgtggt aaaacccg        418


<210>  158
<211>  426
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP426 promoter sequence

<400>  158
ataggcgcca agagcctaga cttccttaag cgccagagtc cacaagggcc cagttaatcc       60

tcaacattca aatgctgccc acaaaaccag cccctctgtg ccctagccgc ctcttttttc      120

caagtgacag tagaactcca ccaatccgca gctgaatggg gtccgcctct tttccctgcc      180

taaacagaca ggaactcctg ccaattgagg gcgtcaccgc taaggctccg ccccagcctg      240

ggctccacaa ccaatgaagg gtaatctcga caaagagcaa ggggtggggc gcgggcgcgc      300

aggtgcagca gcacacaggc tggtcgggag ggcggggcgc gacgtctgcc gtgcggggtc      360

ccggcatcgg ttgcgcgcgc gctccctcct ctcggagaga gggctgtggt aaaacccgtc      420

cggaaa                                                                 426


<210>  159
<211>  400
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  VMD2 promoter

<400>  159
aattctgtca ttttactagg gtgatgaaat tcccaagcaa caccatcctt ttcagataag       60

ggcactgagg ctgagagagg agctgaaacc tacccggggt caccacacac aggtggcaag      120

gctgggacca gaaaccagga ctgttgactc tggattttag ggccatggta gagggggtgt      180

tgccctaaat tccagccctg gtctcagccc aacaccctcc aagaagaaat tagaggggcc      240

atggccaggc tgtgctagcc gttgcttctg agcagattac aagaagggac taagacaagg      300

actcctttgt ggaggtcctg gcttagggag tcaagtgacg gcggctcagc actcacgtgg      360

gcagtgccag cctctaagag tgggcagggg cactggccac                            400


<210>  160
<211>  136
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PDE6b promoter

<400>  160
cccatttgta ggagtgagtc agctgacccg cccccggggt tcctaatctc actaagaaag       60

actttgctga tgacagggtt tcctgggagt ccatgcgtgc ctggagcagc agcgtctcca      120

gggacaggca gccacc                                                      136


<210>  161
<211>  2035
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRho promoter

<400>  161
gcgccaatca gccgatgact tctaacaata ctcttaactc acacagagct tgtctcactg       60

agccaacacc ctgtaccctc agctcagtga cggctttcaa cctgtggggc tgcctctgtt      120

acccaagtga gagagggcca gtgctcccag aggtgacctt gtttgcccat tctctccctg      180

ggtcagccag tgtttatctg ttgtataccc agtccaccct gcaggctcac atcagagcct      240

aggagatggc tagtgtcccc gcggagacca cgatgaagct tcccagctgt ctcaagcaca      300

agctggctgc agaggctgct gaggcactgc tagctgggga tgggggcagg gtagatctgg      360

ggctgaccac cagggtcaga atcagaacct ccaccttgac ctcattaacg ctggtcttaa      420

tcaccaagcc aagctcctta aactgctagt ggccaactcc caggccctga cacacatacc      480

tgccctgtgt tcccaaacaa gacacctgca tggaaggaag ggggttgctt ttctaagcaa      540

acatctagga atcccgggtg cagtgtgagg agactaggcg agggagtact ttaagggcct      600

caaggctcag agaggaatac ttcttccctg gttagcctcg tgcctaggct ccagggtctt      660

tgtcctgcct ggatacctat gtggcaaggg gcatagcatt tcccccacca tcagctctta      720

gctcaacctt atcttctcgg aaagactgcg cagtgtaaca acacagcaga gacttttctt      780

ttgtcccctg tctacccctg taactgctac tcagaagcat ctttctcaca gggtactggc      840

ttcttgcatc cagagttttt tgtctccctc gggcccccag aatcaaattc ttcctctggg      900

actcagtgga tgtttcacac acgtatcggc ctgacagtca tcctggagca tcctacacag      960

gggccatcac agctgcatgt cagaaatgct ggcctcacat cctcagacac caggcctagt     1020

gctggtcttc ctcagactgg cgtccccagc aggccagtag gatcatcttt tagcctacag     1080

agttctgaag cctcagagcc ccaggtccct ggtcatcttc tctgcccctg agatttttcc     1140

aagttgtatg ccttctaggt aaggcaaaac ttcttacgcc cctcctcgtg gcctccaggc     1200

cccacatgct cacctgaata acctggcagc ctgctccctc atgcagggac cacgtcctgc     1260

tgcacccagc aggccatccc gtctccatag cccatggtca tccctccctg gacaggaatg     1320

tgtctcctcc ccgggctgag tcttgctcaa gctagaagca ctccgaacag ggttatgggc     1380

gcctcctcca tctcccaagt ggctggctta tgaatgttta atgtacatgt gagtgaacaa     1440

attccaattg aacgcaacaa atagttatcg agccgctgag ccggggggcg gggggtgtga     1500

gactggaggc gatggacgga gctgacggca cacacagctc agatctgtca agtgagccat     1560

tgtcagggct tggggactgg ataagtcagg gggtctcctg ggaagagatg ggataggtga     1620

gttcaggagg agacattgtc aactggagcc atgtggagaa gtgaatttag ggcccaaagg     1680

ttccagtcgc agcctgaggc caccagactg acatggggag gaattcccag aggactctgg     1740

ggcagacaag atgagacacc ctttcctttc tttacctaag ggcctccacc cgatgtcacc     1800

ttggcccctc tgcaagccaa ttaggccccg gtggcagcag tgggattagc gttagtatga     1860

tatctcgcgg atgctgaatc agcctctggc ttagggagag aaggtcactt tataagggtc     1920

tggggggggt cagtgcctgg agttgcgctg tgggagccgt cagtggctga gctcgccaag     1980

cagccttggt ctctgtctac gaagagcccg tggggcagcc tcgagagccg cagcc          2035


<210>  162
<211>  511
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV promoter

<400>  162
ccgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat       60

tgacgtcaat agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac      120

ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg      180

acgtcaatga cggtaaatgg cccgcctggc attgtgccca gtacatgacc ttatgggact      240

ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt      300

ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc      360

ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc      420

gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata      480

taagcagagc tcgtttagtg aaccgtcaga t                                     511


<210>  163
<211>  334
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC promoter

<400>  163
ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg       60

ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag      120

cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag      180

gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg      240

aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat      300

gattatataa ggacgcgccg ggtgtggcac agct                                  334


