                         SEQUENCE LISTING

<110>  Abeona Therapeutics Inc.
 
<120>  RECOMBINANT ADENO-ASSOCIATED VIRAL VECTORS FOR MULTIPARTITE GENE 
       DELIVERY

<130>  ABEO-007/02WO 337067-2069

<150>  63/179,612
<151>  2021-04-26

<150>  63/051,721
<151>  2020-07-14

<160>  207   

<170>  PatentIn version 3.5

<210>  1
<211>  737
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV110 VPl

<400>  1

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn 
        435                 440                 445             


Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe 
    450                 455                 460                 


Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu 
465                 470                 475                 480 


Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp 
                485                 490                 495     


Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu 
            500                 505                 510         


Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His 
        515                 520                 525             


Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe 
    530                 535                 540                 


Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met 
545                 550                 555                 560 


Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu 
                565                 570                 575     


Arg Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro 
            580                 585                 590         


Ala Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp 
        595                 600                 605             


Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro 
    610                 615                 620                 


His Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625                 630                 635                 640 


Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro 
                645                 650                 655     


Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile 
            660                 665                 670         


Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu 
        675                 680                 685             


Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser 
    690                 695                 700                 


Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly 
705                 710                 715                 720 


Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro 
                725                 730                 735     


Leu 
    


<210>  2
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV204 VP1

<400>  2

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 
        435                 440                 445             


Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser 
    450                 455                 460                 


Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
            500                 505                 510         


Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 
    530                 535                 540                 


Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
                565                 570                 575     


Phe Gly Thr Val Ala Val Asn Leu Gln Asn Ser Ser Thr Asp Pro Ala 
            580                 585                 590         


Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705                 710                 715                 720 


Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  3
<211>  735
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214 VPl

<400>  3

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Lys 
        435                 440                 445             


Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser Gln 
    450                 455                 460                 


Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn Asn 
                485                 490                 495     


Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys Glu 
        515                 520                 525             


Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu Thr 
545                 550                 555                 560 


Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
                565                 570                 575     


Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile 
            580                 585                 590         


Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln Asn 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asp 
                645                 650                 655     


Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735 


<210>  4
<211>  4287
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTRdeltaR

<400>  4
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag caccctgcag gccagaagaa gacagtctgt gctgaacctg     2160

atgacccact ctgtgaacca gggccagaac atccacagaa agaccacagc cagcaccaga     2220

aaggtgagcc tggcccccca ggccaacctg acagagctgg acatctacag cagaagactg     2280

agccaggaga caggcctgga gatctctgag gagatcaatg aggaggacct gaaggagtgc     2340

ttctttgatg acatggagag catccctgct gtgaccacct ggaacaccta cctgagatac     2400

atcacagtgc acaagagcct gatctttgtg ctgatctggt gcctggtgat cttcctggct     2460

gaggtggctg ccagcctggt ggtgctgtgg ctgctgggca acacccccct gcaggacaag     2520

ggcaacagca cccacagcag aaacaacagc tatgctgtga tcatcaccag caccagcagc     2580

tactatgtgt tctacatcta tgtgggggtg gctgacaccc tgctggccat gggcttcttc     2640

agaggcctgc ccctggtgca caccctgatc acagtgagca agatcctgca ccacaagatg     2700

ctgcactctg tgctgcaggc ccccatgagc accctgaaca ccctgaaggc tgggggcatc     2760

ctgaacagat tcagcaagga cattgccatc ctggatgacc tgctgcccct gaccatcttt     2820

gacttcatcc agctgctgct gattgtgatt ggggccattg ctgtggtggc tgtgctgcag     2880

ccctacatct ttgtggccac agtgcctgtg attgtggcct tcatcatgct gagagcctac     2940

ttcctgcaga ccagccagca gctgaagcag ctggagtctg agggcagaag ccccatcttc     3000

acccacctgg tgaccagcct gaagggcctg tggaccctga gagcctttgg cagacagccc     3060

tactttgaga ccctgttcca caaggccctg aacctgcaca cagccaactg gttcctgtac     3120

ctgagcaccc tgagatggtt ccagatgaga attgagatga tctttgtgat cttcttcatt     3180

gctgtgacct tcatcagcat cctgaccaca ggggaggggg agggcagagt gggcatcatc     3240

ctgaccctgg ccatgaacat catgagcacc ctgcagtggg ctgtgaacag cagcattgat     3300

gtggacagcc tgatgagatc tgtgagcaga gtgttcaagt tcattgacat gcccacagag     3360

ggcaagccca ccaagagcac caagccctac aagaatggcc agctgagcaa ggtgatgatc     3420

attgagaaca gccatgtgaa gaaggatgac atctggccct ctgggggcca gatgacagtg     3480

aaggacctga cagccaagta cacagagggg ggcaatgcca tcctggagaa catcagcttc     3540

agcatcagcc ctggccagag agtgggcctg ctgggcagaa caggctctgg caagagcacc     3600

ctgctgtctg ccttcctgag actgctgaac acagaggggg agatccagat tgatggggtg     3660

agctgggaca gcatcaccct gcagcagtgg agaaaggcct ttggggtgat cccccagaag     3720

gtgttcatct tctctggcac cttcagaaag aacctggacc cctatgagca gtggtctgac     3780

caggagatct ggaaggtggc tgatgaggtg ggcctgagat ctgtgattga gcagttccct     3840

ggcaagctgg actttgtgct ggtggatggg ggctgtgtgc tgagccatgg ccacaagcag     3900

ctgatgtgcc tggccagatc tgtgctgagc aaggccaaga tcctgctgct ggatgagccc     3960

tctgcccacc tggaccctgt gacctaccag atcatcagaa gaaccctgaa gcaggccttt     4020

gctgactgca cagtgatcct gtgtgagcac agaattgagg ccatgctgga gtgccagcag     4080

ttcctggtga ttgaggagaa caaggtgaga cagtatgaca gcatccagaa gctgctgaat     4140

gagagaagcc tgttcagaca ggccatcagc ccctctgaca gagtgaagct gttcccccac     4200

agaaacagca gcaagtgcaa gagcaagccc cagattgctg ccctgaagga ggagaccgag     4260

gaggaggtgc aggacaccag actgtaa                                         4287


<210>  5
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GAA

<400>  5
atgggagtcc gccacccgcc ctgctcacat cgcctgcttg ctgtctgtgc cctcgtgtca       60

cttgctaccg ccgcgctgct tggtcacatt ctgctgcacg actttttact agttccgagg      120

gaactgtcgg gatccagccc cgtgctcgag gaaactcacc ccgcgcacca acagggggcg      180

tccaggccgg gaccgcgcga cgcccaggcc cacccgggcc ggcctcgggc cgtgccaact      240

cagtgcgatg tgccgccgaa ctcccgcttc gactgtgcgc ctgacaaggc cataacccag      300

gaacagtgcg aagcacgcgg ctgctgctat attccggcga agcagggctt gcagggtgcc      360

caaatgggtc agccttggtg cttctttccc ccgtcgtacc cctcgtacaa gctggagaac      420

ctgagcagca gcgaaatggg gtacaccgcc actctgaccc ggacgacccc gaccttcttc      480

ccgaaagaca tcctgaccct gcggctggat gtgatgatgg aaactgagaa cagactgcac      540

ttcactatca aggaccccgc gaaccgcaga tatgaggtgc cactggaaac ccctcatgtg      600

cattcccggg ccccatcccc tctgtactcg gtggaattct ccgaagaacc cttcggggtc      660

attgtgcgcc ggcagcttga tggccgggtc ctgctcaaca ccaccgtggc accccttttc      720

ttcgctgacc agttcctcca gctgagcacc tcgctgccga gccagtacat caccggactg      780

gccgagcacc tctcccctct gatgctgtcc actagctgga ctaggatcac tctgtggaac      840

cgggatctgg cccctacccc gggcgcgaac ctgtacggat cgcacccctt ctacctggcc      900

ctcgaggacg gaggctccgc ccacggagtg ttcctgctga actccaacgc tatggacgtg      960

gtgctccagc cgtcccctgc actgtcctgg cggagcacag ggggtattct ggatgtctac     1020

atcttcctcg gcccggagcc aaagtccgtg gtgcaacagt atctggatgt cgtgggttac     1080

ccattcatgc cgccatactg gggccttggc ttccacctgt gccgctgggg atacagctcc     1140

accgccatca ctagacaggt cgtggaaaac atgactagag cccacttccc cctcgatgtc     1200

cagtggaatg acctggacta catggattcc agacgcgact tcactttcaa caaggatgga     1260

ttcagagatt tccccgctat ggtccaagaa ctgcaccagg gtggccggcg gtacatgatg     1320

attgtggacc ccgccatttc aagctccgga ccagcgggct cgtaccggcc ctacgacgaa     1380

ggtttgcgcc gcggcgtgtt catcactaac gaaaccggcc agccactgat tgggaaggtc     1440

tggcctggaa gcaccgcgtt cccggacttc actaacccaa cggccttggc gtggtgggag     1500

gacatggtgg ccgaattcca cgaccaagtc ccattcgacg gaatgtggat cgacatgaac     1560

gagcccagca acttcatccg aggctccgag gacggctgcc ctaacaacga acttgagaac     1620

cctccgtacg tgcctggcgt cgtcggcgga acactgcagg ccgctacgat ctgtgcctca     1680

tcgcatcagt tcctgtcaac ccactacaac ctccataatc tgtacggcct caccgaagcc     1740

atcgcctccc accgggccct ggtcaaggcc cgggggacta ggcccttcgt gattagccgg     1800

agcactttcg ccggacacgg aagatacgcc ggacattgga ccggcgacgt gtggtcatcg     1860

tgggagcagc tcgcctcctc cgtccccgaa atcctgcagt tcaatctcct gggagtcccc     1920

ctcgtgggcg cggacgtgtg cggattcctg ggcaatacct ctgaggagct gtgcgtgaga     1980

tggacccagc tgggggcgtt ctaccccttc atgcggaacc acaactcact gctgtccctg     2040

cctcaagagc cgtactcatt ctccgagccg gcacaacagg ccatgcgaaa ggctctgacc     2100

ctccgctatg cgctcttgcc ccacctctac actctgtttc accaagccca tgtcgcgggc     2160

gaaacagtgg ccagaccact ctttctggaa ttcccaaagg actcctcaac ctggactgtg     2220

gatcatcagc tgctctgggg agaggcactg ctgatcaccc cggtgctcca agccggaaag     2280

gcggaagtga ccggatactt ccctctcggt acttggtacg acctccaaac cgtgccggtc     2340

gaggccctgg gcagcttgcc tccgccgccg gctgccccgc gggagcctgc aatccactcc     2400

gaggggcaat gggtgaccct ccctgcacca ctggacacca tcaacgtgca cctccgggcc     2460

ggctacatca tcccgctgca aggaccgggt ctgactacca ccgaatcccg gcagcagccc     2520

atggcactgg ccgtggccct gaccaaggga ggggaagcac ggggagaact cttttgggac     2580

gatggagaat ccctggaagt gctcgagcgg ggagcctaca ctcaagtcat ctttcttgcc     2640

cgcaacaaca ccatcgtgaa cgaattggtc cgcgtgacct ccgagggggc cggactccag     2700

ctgcaaaaag tgaccgtgct gggggtggca accgccccgc aacaagtgtt gtctaacgga     2760

gtgccggtgt ccaacttcac ctactcccct gataccaaag ttctagatat ttgcgtgagc     2820

ctgctgatgg gagaacagtt cctggtgtcc tggtgctga                            2859


<210>  6
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GAA codon-optimized nucleotide sequence 1 (GAA 15)

<400>  6
atgggagtcc gccacccgcc ctgctcacat cgcctgcttg ctgtctgtgc cctcgtgtca       60

cttgctaccg ccgcgctgct tggtcacatt ctgctgcacg actttttact agttccgagg      120

gaactgtcgg gatccagccc cgtgctcgag gaaactcacc ccgcgcacca acagggggcg      180

tccaggccgg gaccgcgcga cgcccaggcc cacccgggcc ggcctcgggc cgtgccaact      240

cagtgcgatg tgccgccgaa ctcccgcttc gactgtgcgc ctgacaaggc cataacccag      300

gaacagtgcg aagcacgcgg ctgctgctat attccggcga agcagggctt gcagggtgcc      360

caaatgggtc agccttggtg cttctttccc ccgtcgtacc cctcgtacaa gctggagaac      420

ctgagcagca gcgaaatggg gtacaccgcc actctgaccc ggacgacccc gaccttcttc      480

ccgaaagaca tcctgaccct gcggctggat gtgatgatgg aaactgagaa cagactgcac      540

ttcactatca aggaccccgc gaaccgcaga tatgaggtgc cactggaaac ccctcatgtg      600

cattcccggg ccccatcccc tctgtactcg gtggaattct ccgaagaacc cttcggggtc      660

attgtgcgcc ggcagcttga tggccgggtc ctgctcaaca ccaccgtggc accccttttc      720

ttcgctgacc agttcctcca gctgagcacc tcgctgccga gccagtacat caccggactg      780

gccgagcacc tctcccctct gatgctgtcc actagctgga ctaggatcac tctgtggaac      840

cgggatctgg cccctacccc gggcgcgaac ctgtacggat cgcacccctt ctacctggcc      900

ctcgaggacg gaggctccgc ccacggagtg ttcctgctga actccaacgc tatggacgtg      960

gtgctccagc cgtcccctgc actgtcctgg cggagcacag ggggtattct ggatgtctac     1020

atcttcctcg gcccggagcc aaagtccgtg gtgcaacagt atctggatgt cgtgggttac     1080

ccattcatgc cgccatactg gggccttggc ttccacctgt gccgctgggg atacagctcc     1140

accgccatca ctagacaggt cgtggaaaac atgactagag cccacttccc cctcgatgtc     1200

cagtggaatg acctggacta catggattcc agacgcgact tcactttcaa caaggatgga     1260

ttcagagatt tccccgctat ggtccaagaa ctgcaccagg gtggccggcg gtacatgatg     1320

attgtggacc ccgccatttc aagctccgga ccagcgggct cgtaccggcc ctacgacgaa     1380

ggtttgcgcc gcggcgtgtt catcactaac gaaaccggcc agccactgat tgggaaggtc     1440

tggcctggaa gcaccgcgtt cccggacttc actaacccaa cggccttggc gtggtgggag     1500

gacatggtgg ccgaattcca cgaccaagtc ccattcgacg gaatgtggat cgacatgaac     1560

gagcccagca acttcatccg aggctccgag gacggctgcc ctaacaacga acttgagaac     1620

cctccgtacg tgcctggcgt cgtcggcgga acactgcagg ccgctacgat ctgtgcctca     1680

tcgcatcagt tcctgtcaac ccactacaac ctccataatc tgtacggcct caccgaagcc     1740

atcgcctccc accgggccct ggtcaaggcc cgggggacta ggcccttcgt gattagccgg     1800

agcactttcg ccggacacgg aagatacgcc ggacattgga ccggcgacgt gtggtcatcg     1860

tgggagcagc tcgcctcctc cgtccccgaa atcctgcagt tcaatctcct gggagtcccc     1920

ctcgtgggcg cggacgtgtg cggattcctg ggcaatacct ctgaggagct gtgcgtgaga     1980

tggacccagc tgggggcgtt ctaccccttc atgcggaacc acaactcact gctgtccctg     2040

cctcaagagc cgtactcatt ctccgagccg gcacaacagg ccatgcgaaa ggctctgacc     2100

ctccgctatg cgctcttgcc ccacctctac actctgtttc accaagccca tgtcgcgggc     2160

gaaacagtgg ccagaccact ctttctggaa ttcccaaagg actcctcaac ctggactgtg     2220

gatcatcagc tgctctgggg agaggcactg ctgatcaccc cggtgctcca agccggaaag     2280

gcggaagtga ccggatactt ccctctcggt acttggtacg acctccaaac cgtgccggtc     2340

gaggccctgg gcagcttgcc tccgccgccg gctgccccgc gggagcctgc aatccactcc     2400

gaggggcaat gggtgaccct ccctgcacca ctggacacca tcaacgtgca cctccgggcc     2460

ggctacatca tcccgctgca aggaccgggt ctgactacca ccgaatcccg gcagcagccc     2520

atggcactgg ccgtggccct gaccaaggga ggggaagcac ggggagaact cttttgggac     2580

gatggagaat ccctggaagt gctcgagcgg ggagcctaca ctcaagtcat ctttcttgcc     2640

cgcaacaaca ccatcgtgaa cgaattggtc cgcgtgacct ccgagggggc cggactccag     2700

ctgcaaaaag tgaccgtgct gggggtggca accgccccgc aacaagtgtt gtctaacgga     2760

gtgccggtgt ccaacttcac ctactcccct gataccaaag ttctagatat ttgcgtgagc     2820

ctgctgatgg gagaacagtt cctggtgtcc tggtgctga                            2859


<210>  7
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GAA Codon-optimized 2 (GAA21)

<400>  7
atgggagtta gacaccctcc atgtagccac agactgctgg ccgtgtgtgc tctggtgtct       60

ctggctacag ctgccctgct gggacatatc ctgctgcacg acttcttact agttcccaga      120

gagctgtccg gcagcagccc tgtgctggaa gaaacacacc ctgcacatca gcagggcgcc      180

tctagacctg gacctagaga tgctcaggcc catcctggca gacctagagc tgtgcccaca      240

cagtgtgacg tgccacctaa cagcagattc gactgcgccc ctgacaaggc catcacacaa      300

gagcagtgtg aagccagagg ctgctgctac atccctgcca aacaaggact gcagggcgct      360

cagatgggac agccctggtg cttcttccca ccatcttacc ccagctacaa gctggaaaac      420

ctgagcagca gcgagatggg ctacaccgcc acactgacca gaaccacacc tacattcttc      480

ccgaaggaca tcctgacact gcggctggac gtgatgatgg aaaccgagaa ccggctgcac      540

ttcaccatca aggaccccgc caatcggaga tacgaggtgc cactggaaac ccctcacgtg      600

cactctagag ccccatctcc actgtacagc gtggaattca gcgaggaacc cttcggcgtg      660

atcgtgcgga gacagctgga tggaagagtg ctgctgaaca ccacagtggc ccctctgttc      720

ttcgccgacc agtttctgca gctgtccacc agcctgccta gccagtatat cacaggcctg      780

gccgagcacc tgtctccact gatgctgtct accagctgga cccggatcac cctgtggaac      840

agggatcttg ctcctacacc tggcgccaac ctgtacggct ctcacccttt ttatctggcc      900

ctggaagatg gcggatctgc ccacggtgtc tttctgctga actccaacgc catggacgtg      960

gtgctgcagc catctcctgc tctgtcttgg agaagcacag gcggcatcct ggacgtgtac     1020

atctttctgg gccccgagcc taagagcgtg gtgcagcagt atctggacgt cgtgggctac     1080

cccttcatgc ctccttattg gggcctgggc ttccacctgt gcagatgggg atacagcagc     1140

accgccatca ccagacaggt ggtggaaaac atgacccggg ctcacttccc actggatgtg     1200

cagtggaacg acctggacta catggacagc agacgggact tcaccttcaa caaggacggc     1260

ttcagagact tccccgccat ggtgcaagaa ctgcaccaag gcggcagacg gtacatgatg     1320

atcgtggatc cagccatcag ctctagcggc cctgccggct cttacagacc ttacgatgag     1380

ggcctgagaa gaggcgtgtt catcaccaac gagacaggcc agcctctgat cggcaaagtg     1440

tggcctggca gcacagcctt tccagacttc acaaacccca ccgctctggc ttggtgggaa     1500

gatatggtgg ccgagtttca cgatcaggtg cccttcgacg gcatgtggat cgacatgaac     1560

gagcccagca acttcatccg gggcagcgag gatggctgcc ccaacaacga actggaaaat     1620

cctccttacg tgcccggcgt tgtcggcgga acacttcagg ccgctacaat ctgtgccagc     1680

agccaccagt tcctcagcac ccactacaac ctgcacaatc tgtatggcct gaccgaggcc     1740

attgccagcc atagagccct ggttaaggcc aggggcacca gacctttcgt gatcagcaga     1800

agcaccttcg ccggccacgg cagatatgcc ggacattgga caggcgacgt gtggtctagt     1860

tgggagcagc tggctagcag cgtgccagag atcctgcagt tcaatctgct gggcgtgcca     1920

ctcgtgggag ccgatgtttg tggcttcctg ggcaacacct ccgaggaact gtgtgtgcgt     1980

tggacacagc tgggcgcctt ctatcccttc atgagaaacc acaacagcct tctcagcctg     2040

ccacaagagc cctacagctt ctctgagcct gcacagcagg ccatgagaaa ggccctgact     2100

ctgagatacg ctctgctgcc ccacctgtac accctgtttc accaggctca tgtggccggg     2160

gagacagtgg ctagacctct gttcctggaa ttccccaagg acagctccac ctggaccgtg     2220

gatcatcagc tgctgtgggg agaagccctg ctcatcacac ctgttctgca ggccggaaag     2280

gccgaagtga ccggctattt tcctctcggc acttggtacg acctgcagac cgtgcctgtt     2340

gaggctctgg gatctcttcc tccacctcct gccgctccta gagagcctgc cattcactct     2400

gaaggccagt gggttaccct gcctgctcct ctggacacca tcaacgtgca cctgagagct     2460

ggctacatca tccctctgca aggccctggc ctgacaacca ccgaatctag acagcagccc     2520

atggctctgg ccgtggcttt gacaaaaggc ggagaggcta gaggcgagct gttctgggat     2580

gatggcgaga gcctggaagt gctggaacgg ggcgcttata cccaagtgat cttcctggcc     2640

agaaacaaca ccatcgtgaa cgaactcgtg cgcgtgacca gtgaaggtgc tggactgcaa     2700

ctgcagaaag tgaccgtgct cggagtggcc acagcacctc agcaggttct gtctaatggc     2760

gtgcccgtgt ccaacttcac atacagcccc gacaccaagg tcctggacat ctgtgtgtca     2820

ctgctgatgg gcgagcagtt cctggtgtcc tggtgttga                            2859


<210>  8
<211>  952
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(952)
<223>  Acid Alpha-Glucosidase (GAA)

<400>  8

Met Gly Val Arg His Pro Pro Cys Ser His Arg Leu Leu Ala Val Cys 
1               5                   10                  15      


Ala Leu Val Ser Leu Ala Thr Ala Ala Leu Leu Gly His Ile Leu Leu 
            20                  25                  30          


His Asp Phe Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val 
        35                  40                  45              


Leu Glu Glu Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly 
    50                  55                  60                  


Pro Arg Asp Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr 
65                  70                  75                  80  


Gln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys 
                85                  90                  95      


Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro 
            100                 105                 110         


Ala Lys Gln Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe 
        115                 120                 125             


Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser 
    130                 135                 140                 


Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe 
145                 150                 155                 160 


Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu 
                165                 170                 175     


Asn Arg Leu His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu 
            180                 185                 190         


Val Pro Leu Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu 
        195                 200                 205             


Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg 
    210                 215                 220                 


Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe 
225                 230                 235                 240 


Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr 
                245                 250                 255     


Ile Thr Gly Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser 
            260                 265                 270         


Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly 
        275                 280                 285             


Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly 
    290                 295                 300                 


Gly Ser Ala His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val 
305                 310                 315                 320 


Val Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile 
                325                 330                 335     


Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln 
            340                 345                 350         


Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly 
        355                 360                 365             


Leu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr 
    370                 375                 380                 


Arg Gln Val Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val 
385                 390                 395                 400 


Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe 
                405                 410                 415     


Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His 
            420                 425                 430         


Gln Gly Gly Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser 
        435                 440                 445             


Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg 
    450                 455                 460                 


Gly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val 
465                 470                 475                 480 


Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu 
                485                 490                 495     


Ala Trp Trp Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe 
            500                 505                 510         


Asp Gly Met Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly 
        515                 520                 525             


Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val 
    530                 535                 540                 


Pro Gly Val Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser 
545                 550                 555                 560 


Ser His Gln Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly 
                565                 570                 575     


Leu Thr Glu Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly 
            580                 585                 590         


Thr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg 
        595                 600                 605             


Tyr Ala Gly His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu 
    610                 615                 620                 


Ala Ser Ser Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro 
625                 630                 635                 640 


Leu Val Gly Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu 
                645                 650                 655     


Leu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg 
            660                 665                 670         


Asn His Asn Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser 
        675                 680                 685             


Glu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala 
    690                 695                 700                 


Leu Leu Pro His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly 
705                 710                 715                 720 


Glu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser 
                725                 730                 735     


Thr Trp Thr Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile 
            740                 745                 750         


Thr Pro Val Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro 
        755                 760                 765             


Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly 
    770                 775                 780                 


Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser 
785                 790                 795                 800 


Glu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val 
                805                 810                 815     


His Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr 
            820                 825                 830         


Thr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr 
        835                 840                 845             


Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser 
    850                 855                 860                 


Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala 
865                 870                 875                 880 


Arg Asn Asn Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly 
                885                 890                 895     


Ala Gly Leu Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala 
            900                 905                 910         


Pro Gln Gln Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr 
        915                 920                 925             


Ser Pro Asp Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly 
    930                 935                 940                 


Glu Gln Phe Leu Val Ser Trp Cys 
945                 950         


<210>  9
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GLA

<400>  9
atgcagctga ggaacccaga actacatctg ggctgcgcgc ttgcgcttcg cttcctggcc       60

ctcgtttcct gggacatccc tggggctaga gcactggaca atggattggc aaggacgcct      120

accatgggct ggctgcactg ggagcgcttc atgtgcaacc ttgactgcca ggaagagcca      180

gattcctgca tcagtgagaa gctcttcatg gagatggcag agctcatggt ctcagaaggc      240

tggaaggatg caggttatga gtacctctgc attgatgact gttggatggc tccccaaaga      300

gattcagaag gcagacttca ggcagaccct cagcgctttc ctcatgggat tcgccagcta      360

gctaattatg ttcacagcaa aggactgaag ctagggattt atgcagatgt tggaaataaa      420

acctgcgcag gcttccctgg gagttttgga tactacgaca ttgatgccca gacctttgct      480

gactggggag tagatctgct aaaatttgat ggttgttact gtgacagttt ggaaaatttg      540

gcagatggtt ataagcacat gtccttggcc ctgaatagga ctggcagaag cattgtgtac      600

tcctgtgagt ggcctcttta tatgtggccc tttcaaaagc ccaattatac agaaatccga      660

cagtactgca atcactggcg aaattttgct gacattgatg attcctggaa aagtataaag      720

agtatcttgg actggacatc ttttaaccag gagagaattg ttgatgttgc tggaccaggg      780

ggttggaatg acccagatat gttagtgatt ggcaactttg gcctcagctg gaatcagcaa      840

gtaactcaga tggccctctg ggctatcatg gctgctcctt tattcatgtc taatgacctc      900

cgacacatca gccctcaagc caaagctctc cttcaggata aggacgtaat tgccatcaat      960

caggacccct tgggcaagca agggtaccag cttagacagg gagacaactt tgaagtgtgg     1020

gaacgacctc tctcaggctt agcctgggct gtagctatga taaaccggca ggagattggt     1080

ggacctcgct cttataccat cgcagttgct tccctgggta aaggagtggc ctgtaatcct     1140

gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact     1200

tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca     1260

atgcagatgt cattaaaaga cttactttaa                                      1290


<210>  10
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GLA codon-optimized

<400>  10
atgcagctga gaaatcctga actgcacctg ggctgtgccc tggctctgag atttctggct       60

ctggtgtcct gggacattcc tggcgctaga gccctggata atggcctggc cagaacacct      120

acaatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca agaggaaccc      180

gacagctgca tcagcgagaa gctgttcatg gaaatggccg agctgatggt gtccgaaggc      240

tggaaggatg ccggctacga gtacctgtgc atcgacgatt gctggatggc ccctcagaga      300

gattctgagg gcagactgca ggccgatcct cagagatttc ctcacggaat ccggcagctg      360

gccaactacg tgcactctaa gggactgaag ctgggcatct acgccgacgt gggcaacaag      420

acatgtgccg gctttccagg cagcttcggc tactacgata tcgacgccca gacctttgcc      480

gattggggcg tcgacctgct gaagttcgat ggctgctact gcgacagcct ggaaaacctg      540

gccgacggct acaaacacat gtctctggcc ctgaaccgga ccggcagatc tatcgtgtac      600

tcttgcgagt ggcccctgta catgtggccc ttccagaagc ctaactacac cgagatcaga      660

cagtactgca accactggcg gaacttcgcc gacatcgatg acagctggaa gtccatcaag      720

agcatcctgg actggaccag cttcaatcaa gagcggatcg tggatgtggc tggcccaggc      780

ggatggaacg atcctgatat gctggtcatc ggcaacttcg gcctgagctg gaatcagcaa      840

gtgacccaga tggccctgtg ggccattatg gccgctcctc tgttcatgag caacgacctg      900

agacacatca gccctcaggc caaggctctg ctgcaggata aggacgtgat cgccatcaac      960

caggatcctc tgggcaagca gggctatcag ctgagacagg gcgacaattt cgaagtgtgg     1020

gaaagacctc tgagcggcct ggcttgggcc gtcgccatga tcaatagaca agagatcggc     1080

ggaccccggt cctatacaat tgccgtggct tctctcggaa aaggcgtggc ctgcaatcct     1140

gcctgcttta tcacacagct gctccccgtg aagagaaagc tgggctttta cgagtggacc     1200

agcagactga gatcccacat caaccccaca ggcactgttc tgctgcaact ggaaaacaca     1260

atgcagatga gcctgaagga cctgctgtag                                      1290


<210>  11
<211>  429
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GLA

<400>  11

Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu 
1               5                   10                  15      


Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu 
            20                  25                  30          


Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu 
        35                  40                  45              


Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile 
    50                  55                  60                  


Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly 
65                  70                  75                  80  


Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met 
                85                  90                  95      


Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg 
            100                 105                 110         


Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly 
        115                 120                 125             


Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly 
    130                 135                 140                 


Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala 
145                 150                 155                 160 


Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser 
                165                 170                 175     


Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn 
            180                 185                 190         


Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met 
        195                 200                 205             


Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn 
    210                 215                 220                 


His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys 
225                 230                 235                 240 


Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val 
                245                 250                 255     


Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn 
            260                 265                 270         


Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala 
        275                 280                 285             


Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser 
    290                 295                 300                 


Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn 
305                 310                 315                 320 


Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn 
                325                 330                 335     


Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala 
            340                 345                 350         


Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala 
        355                 360                 365             


Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile 
    370                 375                 380                 


Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 
385                 390                 395                 400 


Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln 
                405                 410                 415     


Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu 
            420                 425                 


<210>  12
<211>  1317
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CLN3

<400>  12
atgggaggct gtgcaggctc gcggcggcgc ttttcggatt ccgaggggga ggagaccgtc       60

ccggagcccc ggctccctct gttggaccat cagggcgcgc attggaagaa cgcggtgggc      120

ttctggctgc tgggcctttg caacaacttc tcttatgtgg tgatgctgag tgccgcccac      180

gacatcctta gccacaagag gacatcggga aaccagagcc atgtggaccc aggcccaacg      240

ccgatccccc acaacagctc atcacgattt gactgcaact ctgtctctac ggctgctgtg      300

ctcctggcgg acatcctccc cacactcgtc atcaaattgt tggctcctct tggccttcac      360

ctgctgccct acagcccccg ggttctcgtc agtgggattt gtgctgctgg aagcttcgtc      420

ctggttgcct tttctcattc tgtggggacc agcctgtgtg gtgtggtctt cgctagcatc      480

tcatcaggcc ttggggaggt caccttcctc tccctcactg ccttctaccc cagggccgtg      540

atctcctggt ggtcctcagg gactggggga gctgggctgc tgggggccct gtcctacctg      600

ggcctcaccc aggccggcct ctcccctcag cagaccctgc tgtccatgct gggtatccct      660

gccctgctgc tggccagcta tttcttgttg ctcacatctc ctgaggccca ggaccctgga      720

ggggaagaag aagcagagag cgcagcccgg cagcccctca taagaaccga ggccccggag      780

tcgaagccag gctccagctc cagcctctcc cttcgggaaa ggtggacagt gttcaagggt      840

ctgctgtggt acattgttcc cttggtcgta gtttactttg ccgagtattt cattaaccag      900

ggactttttg aactcctctt tttctggaac acttccctga gtcacgctca gcaataccgc      960

tggtaccaga tgctgtacca ggctggcgtc tttgcctccc gctcttctct ccgctgctgt     1020

cgcatccgtt tcacctgggc cctggccctg ctgcagtgcc tcaacctggt gttcctgctg     1080

gcagacgtgt ggttcggctt tctgccaagc atctacctcg tcttcctgat cattctgtat     1140

gaggggctcc tgggaggcgc agcctacgtg aacaccttcc acaacatcgc cctggagacc     1200

agtgatgagc accgggagtt tgcaatggcg gccacctgca tctctgacac actggggatc     1260

tccctgtcgg ggctcctggc tttgcctctg catgacttcc tctgccagct ctcctga        1317


<210>  13
<211>  1317
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CLN3 codon-optimized

<400>  13
atgggaggat gtgctgggtc aagaagacgg tttagcgatt ccgaaggaga ggagactgtg       60

cctgagccaa gactgcccct gctggatcac cagggagcac actggaagaa cgcagtggga      120

ttctggctgc tgggcctgtg caacaacttc agctacgtgg tcatgctgtc cgccgcccac      180

gacatcctgt cccacaagcg gacctccggc aatcagtctc acgtggaccc cggccctaca      240

ccaatccccc acaacagcag cagccggttc gactgtaatt ccgtgtctac cgcagccgtg      300

ctgctggcag acatcctgcc caccctggtc atcaagctgc tggcaccact gggcctgcac      360

ctgctgcctt attctccaag ggtgctggtg agcggcatct gcgcagcagg cagcttcgtg      420

ctggtggcct ttagccactc cgtgggcacc tctctgtgcg gagtggtgtt tgcaagcatc      480

agctccggcc tgggagaggt gaccttcctg agcctgacag ccttttaccc tcgcgccgtg      540

atctcctggt ggtctagcgg cacaggagga gcaggcctgc tgggcgccct gtcctatctg      600

ggcctgaccc aggcaggcct gtccccacag cagacactgc tgtctatgct gggcatccct      660

gccctgctgc tggcaagcta cttcctgctg ctgacctccc cagaggcaca ggaccccgga      720

ggagaggagg aggccgagag cgccgcaagg cagccactga tcaggaccga ggcaccagag      780

tccaagcctg gctcctctag ctccctgtct ctgcgggaga gatggacagt gttcaagggc      840

ctgctgtggt acatcgtgcc cctggtggtg gtgtacttcg ccgagtactt catcaaccag      900

ggcctgtttg agctgctgtt cttttggaat acctctctga gccacgccca gcagtaccgg      960

tggtatcaga tgctgtatca ggcaggcgtg ttcgcctccc ggtctagcct gagatgctgt     1020

cggatcagat tcacctgggc actggccctg ctgcagtgcc tgaacctggt gttcctgctg     1080

gccgacgtgt ggttcggctt tctgccctct atctacctgg tgtttctgat catcctgtat     1140

gagggcctgc tgggaggagc agcctatgtg aacaccttcc acaatatcgc cctggagaca     1200

tctgacgagc acagagagtt tgctatggcc gccacctgta tcagcgatac actgggcatc     1260

tctctgagcg gactgctggc tctgcctctg catgactttc tgtgccagct gagttaa        1317


<210>  14
<211>  438
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(438)
<223>  Ceroid Lipofuscinosis, Neuronal 3 (CLN3)

<400>  14

Met Gly Gly Cys Ala Gly Ser Arg Arg Arg Phe Ser Asp Ser Glu Gly 
1               5                   10                  15      


Glu Glu Thr Val Pro Glu Pro Arg Leu Pro Leu Leu Asp His Gln Gly 
            20                  25                  30          


Ala His Trp Lys Asn Ala Val Gly Phe Trp Leu Leu Gly Leu Cys Asn 
        35                  40                  45              


Asn Phe Ser Tyr Val Val Met Leu Ser Ala Ala His Asp Ile Leu Ser 
    50                  55                  60                  


His Lys Arg Thr Ser Gly Asn Gln Ser His Val Asp Pro Gly Pro Thr 
65                  70                  75                  80  


Pro Ile Pro His Asn Ser Ser Ser Arg Phe Asp Cys Asn Ser Val Ser 
                85                  90                  95      


Thr Ala Ala Val Leu Leu Ala Asp Ile Leu Pro Thr Leu Val Ile Lys 
            100                 105                 110         


Leu Leu Ala Pro Leu Gly Leu His Leu Leu Pro Tyr Ser Pro Arg Val 
        115                 120                 125             


Leu Val Ser Gly Ile Cys Ala Ala Gly Ser Phe Val Leu Val Ala Phe 
    130                 135                 140                 


Ser His Ser Val Gly Thr Ser Leu Cys Gly Val Val Phe Ala Ser Ile 
145                 150                 155                 160 


Ser Ser Gly Leu Gly Glu Val Thr Phe Leu Ser Leu Thr Ala Phe Tyr 
                165                 170                 175     


Pro Arg Ala Val Ile Ser Trp Trp Ser Ser Gly Thr Gly Gly Ala Gly 
            180                 185                 190         


Leu Leu Gly Ala Leu Ser Tyr Leu Gly Leu Thr Gln Ala Gly Leu Ser 
        195                 200                 205             


Pro Gln Gln Thr Leu Leu Ser Met Leu Gly Ile Pro Ala Leu Leu Leu 
    210                 215                 220                 


Ala Ser Tyr Phe Leu Leu Leu Thr Ser Pro Glu Ala Gln Asp Pro Gly 
225                 230                 235                 240 


Gly Glu Glu Glu Ala Glu Ser Ala Ala Arg Gln Pro Leu Ile Arg Thr 
                245                 250                 255     


Glu Ala Pro Glu Ser Lys Pro Gly Ser Ser Ser Ser Leu Ser Leu Arg 
            260                 265                 270         


Glu Arg Trp Thr Val Phe Lys Gly Leu Leu Trp Tyr Ile Val Pro Leu 
        275                 280                 285             


Val Val Val Tyr Phe Ala Glu Tyr Phe Ile Asn Gln Gly Leu Phe Glu 
    290                 295                 300                 


Leu Leu Phe Phe Trp Asn Thr Ser Leu Ser His Ala Gln Gln Tyr Arg 
305                 310                 315                 320 


Trp Tyr Gln Met Leu Tyr Gln Ala Gly Val Phe Ala Ser Arg Ser Ser 
                325                 330                 335     


Leu Arg Cys Cys Arg Ile Arg Phe Thr Trp Ala Leu Ala Leu Leu Gln 
            340                 345                 350         


Cys Leu Asn Leu Val Phe Leu Leu Ala Asp Val Trp Phe Gly Phe Leu 
        355                 360                 365             


Pro Ser Ile Tyr Leu Val Phe Leu Ile Ile Leu Tyr Glu Gly Leu Leu 
    370                 375                 380                 


Gly Gly Ala Ala Tyr Val Asn Thr Phe His Asn Ile Ala Leu Glu Thr 
385                 390                 395                 400 


Ser Asp Glu His Arg Glu Phe Ala Met Ala Ala Thr Cys Ile Ser Asp 
                405                 410                 415     


Thr Leu Gly Ile Ser Leu Ser Gly Leu Leu Ala Leu Pro Leu His Asp 
            420                 425                 430         


Phe Leu Cys Gln Leu Ser 
        435             


<210>  15
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV204 VP1

<400>  15
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcacca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc      900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa      960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg     1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct     1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac     1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac     1380

ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct     1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac     1500

tttacctgga caggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct     1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc     1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc     1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg     1740

gcagtcaatc tccagaacag cagcacagac cctgcgaccg gagatgtgca tgttatggga     1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc     1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt     1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca     1980

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc     2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag     2100

tatacatcta actatgcaaa atctgccaac gttgatttca ctgtagacaa caatggactt     2160

tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  16
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV204 VP3

<400>  16
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgaa catgggcctt gcccacctat aacaaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgatttca acagattcca ctgccatttc tcaccacgtg actggcagcg actcatcaac      300

aacaattggg gattccggcc caagagactc aacttcaagc tcttcaacat ccaagtcaag      360

gaggtcacga cgaatgatgg cgtcacgacc atcgctaata accttaccag cacggttcaa      420

gtcttctcgg actcggagta ccagttgccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

aatggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaataa ctttaccttc agctacacct tcgaggacgt gcctttccac      660

agcagctacg cgcacagcca gagcctggac cggctgatga atcctctcat cgaccagtac      720

ctgtattacc tgaacagaac tcagaatcag tccggaagtg cccaaaacaa ggacttgctg      780

tttagccggg ggtctccagc tggcatgtct gttcagccca aaaactggct acctggaccc      840

tgttaccggc agcagcgcgt ttctaaaaca aaaacagaca acaacaacag caactttacc      900

tggacaggtg cttcaaaata taaccttaat gggcgtgaat ctataatcaa ccctggcact      960

gctatggcct cacacaaaga cgacaaagac aagttctttc ccatgagcgg tgtcatgatt     1020

tttggaaagg agagcgccgg agcttcaaac actgcattgg acaatgtcat gatcacagac     1080

gaagaggaaa tcaaagccac taaccccgtg gccaccgaaa gatttgggac tgtggcagtc     1140

aatctccaga acagcagcac agaccctgcg accggagatg tgcatgttat gggagcctta     1200

cctggaatgg tgtggcaaga cagagacgta tacctgcagg gtcctatttg ggccaaaatt     1260

cctcacacgg atggacactt tcacccgtct cctctcatgg gcggctttgg acttaagcac     1320

ccgcctcctc agatcctcat caaaaacacg cctgttcctg cgaatcctcc ggcagagttt     1380

tcggctacaa agtttgcttc attcatcacc cagtattcca caggacaagt gagcgtggag     1440

attgaatggg agctgcagaa agaaaacagc aaacgctgga atcccgaagt gcagtataca     1500

tctaactatg caaaatctgc caacgttgat ttcactgtag acaacaatgg actttatact     1560

gagcctcgcc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  17
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV204 VP3

<400>  17

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val 
        115                 120                 125             


Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp 
    130                 135                 140                 


Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Asn Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn 
                245                 250                 255     


Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln 
            260                 265                 270         


Pro Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala 
    290                 295                 300                 


Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser 
                325                 330                 335     


Gly Val Met Ile Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala 
            340                 345                 350         


Leu Asp Asn Val Met Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Leu Gln Asn 
    370                 375                 380                 


Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Val Met Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys 
    450                 455                 460                 


Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Val Gln Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr 
            500                 505                 510         


Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  18
<211>  2208
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 214 (AAV214) VP1

<400>  18
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcaaagactc      900

atcaacaaca actggggatt ccgacccaag agactcaact tcaagctctt taacattcaa      960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataacct taccagcacg     1020

gtccaggtct tcacggactc agactatcag ctcccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacgacg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggacgttcct     1260

ttccacagca gctacgctca cagccagagt ctggaccgtc tcatgaatcc tctgattgac     1320

cagtacctgt actacttgtc taagactatc aacggatccg gccagaatca gcagactctg     1380

aagttcagcc aaggtgggcc taatacaatg gccaatcagg caaagaactg gctgccagga     1440

ccctgttacc gccaacaacg cgtctcaacg acaaccgggc aaaacaacaa tagcaacttt     1500

gcctggactg ctgggaccaa ataccatctg aatggaagaa attcattgat gaatcctggc     1560

cccgctatgg catcccacaa agagggcgag gaccgttttt ttcccctgtc cgggtccctg     1620

atttttggca aacaaaatgc tgccagagac aatgcggatt acagcgatgt catgctcacc     1680

agcgaggaag aaatcaaaac cactaaccct gtggctacag aggaatacgg tatcgtggca     1740

gataacttgc agcagcaaaa cacggctcct caaattggaa ctgtcaacag ccagggggcc     1800

ttacccggta tggtctggca gaaccgggac gtgtacctgc agggtcccat ctgggccaag     1860

attcctcaca cggacggcaa cttccacccg tctccgctga tgggcggctt tggcctgaaa     1920

catcctccgc ctcagatcct gatcaagaac acgcctgtac ctgcggatcc tccgaccacc     1980

ttcaaccagt caaagctgaa ctctttcatc acgcaataca gcaccggaca ggtcagcgtg     2040

gaaattgaat gggagctgca gaaggaaaac agcaagcgct ggaaccccga gatccagtac     2100

acctccaact actacaaatc tacaagtgtg gactttgctg ttaatacaga aggcgtgtac     2160

tctgaacccc accccattgg cacccgttac ctcacccgtc ccctgtaa                  2208


<210>  19
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-A VP1

<400>  19
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  20
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e VP1

<400>  20
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga      600

cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  21
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP1

<400>  21
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggcttcagg cggtggcgca ccaatggcgg acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  22
<211>  2208
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP1

<400>  22
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctccc caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctctgg tgtgggatct      600

cttacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcaaagactc      900

atcaacaaca actggggatt ccgacccaag agactcaact tcaagctctt taacattcaa      960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataacct taccagcacg     1020

gtccaggtct tcacggactc agactatcag ctcccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacgacg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggacgttcct     1260

ttccacagca gctacgctca cagccagagt ctggaccgtc tcatgaatcc tctgattgac     1320

cagtacctgt actacttgtc taagactatc aacggatccg gccagaatca gcagactctg     1380

aagttcagcc aaggtgggcc taatacaatg gccaatcagg caaagaactg gctgccagga     1440

ccctgttacc gccaacaacg cgtctcaacg acaaccgggc aaaacaacaa tagcaacttt     1500

gcctggactg ctgggaccaa ataccatctg aatggaagaa attcattgat gaatcctggc     1560

cccgctatgg catcccacaa agagggcgag gaccgttttt ttcccctgtc cgggtccctg     1620

atttttggca aacaaaatgc tgccagagac aatgcggatt acagcgatgt catgctcacc     1680

agcgaggaag aaatcaaaac cactaaccct gtggctacag aggaatacgg tatcgtggca     1740

gataacttgc agcagcaaaa cacggctcct caaattggaa ctgtcaacag ccagggggcc     1800

ttacccggta tggtctggca gaaccgggac gtgtacctgc agggtcccat ctgggccaag     1860

attcctcaca cggacggcaa cttccacccg tctccgctga tgggcggctt tggcctgaaa     1920

catcctccgc ctcagatcct gatcaagaac acgcctgtac ctgcggatcc tccgaccacc     1980

ttcaaccagt caaagctgaa ctctttcatc acgcaataca gcaccggaca ggtcagcgtg     2040

gaaattgaat gggagctgca gaaggaaaac agcaagcgct ggaaccccga gatccagtac     2100

acctccaact actacaaatc tacaagtgtg gactttgctg ttaatacaga aggcgtgtac     2160

tctgaacccc accccattgg cacccgttac ctcacccgtc ccctgtaa                  2208


<210>  23
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP1

<400>  23
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccagcagcc cgcgaaaaag agactcaact ttgggcagac tggcgactca      540

gagtcagtgc ccgaccctca accaatcgga gaaccccccg caggcccctc tggtctggga      600

tctggtacaa tggcttcagg cggtggcgca ccaatggcgg acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccttg cccacctaca ataaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  24
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 214 (AAV214) VP3

<400>  24
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  25
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-A VP3

<400>  25
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccaac      180

agcacatctg gaggatcttc aaatgacaac gcctacttcg gctacagcac cccctggggg      240

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      300

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      360

aaagaggtta cggacaacaa tggagtcaag accatcgcca ataaccttac cagcacggtc      420

caggtcttca cggactcaga ctatcagctc ccgtacgtcc tcggctctgc gcaccagggc      480

tgcctccctc cgttcccggc ggacgtgttc atgattccgc agtacggcta cctaacgctc      540

aacgacggca gccaggcagt gggacggtca tccttttact gcctggaata tttcccatcg      600

cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga cgttcctttc      660

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct gattgaccag      720

tacctgtact acttgtctaa gactatcaac ggatccggcc agaatcagca gactctgaag      780

ttcagccaag gtgggcctaa tacaatggcc aatcaggcaa agaactggct gccaggaccc      840

tgttaccgcc aacaacgcgt ctcaacgaca accgggcaaa acaacaatag caactttgcc      900

tggactgctg ggaccaaata ccatctgaat ggaagaaatt cattgatgaa tcctggcccc      960

gctatggcat cccacaaaga gggcgaggac cgtttttttc ccctgtccgg gtccctgatt     1020

tttggcaaac aaaatgctgc cagagacaat gcggattaca gcgatgtcat gctcaccagc     1080

gaggaagaaa tcaaaaccac taaccctgtg gctacagagg aatacggtat cgtggcagat     1140

aacttgcagc agcaaaacac ggctcctcaa attggaactg tcaacagcca gggggcctta     1200

cccggtatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg ggccaagatt     1260

cctcacacgg acggcaactt ccacccgtct ccgctgatgg gcggctttgg cctgaaacat     1320

cctccgcctc agatcctgat caagaacacg cctgtacctg cggatcctcc gaccaccttc     1380

aaccagtcaa agctgaactc tttcatcacg caatacagca ccggacaggt cagcgtggaa     1440

attgaatggg agctgcagaa ggaaaacagc aagcgctgga accccgagat ccagtacacc     1500

tccaactact acaaatctac aagtgtggac tttgctgtta atacagaagg cgtgtactct     1560

gaaccccacc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  26
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e VP3

<400>  26
atggcttcag gcggtggcgc accaatggca gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  27
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP3

<400>  27
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  28
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP3

<400>  28
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  29
<211>  1602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP3

<400>  29
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtcctcg gctctgcgca ccagggctgc      480

ctccctccgt tcccggcgga cgtgttcatg attccgcagt acggctacct aacgctcaac      540

gacggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cgtctcatga atcctctgat tgaccagtac      720

ctgtactact tgtctaagac tatcaacgga tccggccaga atcagcagac tctgaagttc      780

agccaaggtg ggcctaatac aatggccaat caggcaaaga actggctgcc aggaccctgt      840

taccgccaac aacgcgtctc aacgacaacc gggcaaaaca acaatagcaa ctttgcctgg      900

actgctggga ccaaatacca tctgaatgga agaaattcat tgatgaatcc tggccccgct      960

atggcatccc acaaagaggg cgaggaccgt ttttttcccc tgtccgggtc cctgattttt     1020

ggcaaacaaa atgctgccag agacaatgcg gattacagcg atgtcatgct caccagcgag     1080

gaagaaatca aaaccactaa ccctgtggct acagaggaat acggtatcgt ggcagataac     1140

ttgcagcagc aaaacacggc tcctcaaatt ggaactgtca acagccaggg ggccttaccc     1200

ggtatggtct ggcagaaccg ggacgtgtac ctgcagggtc ccatctgggc caagattcct     1260

cacacggacg gcaacttcca cccgtctccg ctgatgggcg gctttggcct gaaacatcct     1320

ccgcctcaga tcctgatcaa gaacacgcct gtacctgcgg atcctccgac caccttcaac     1380

cagtcaaagc tgaactcttt catcacgcaa tacagcaccg gacaggtcag cgtggaaatt     1440

gaatgggagc tgcagaagga aaacagcaag cgctggaacc ccgagatcca gtacacctcc     1500

aactactaca aatctacaag tgtggacttt gctgttaata cagaaggcgt gtactctgaa     1560

ccccacccca ttggcacccg ttacctcacc cgtcccctgt aa                        1602


<210>  30
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214A VP1

<400>  30

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  31
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e VP1

<400>  31

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  32
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP1

<400>  32

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  33
<211>  735
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP1

<400>  33

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Lys 
        435                 440                 445             


Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser Gln 
    450                 455                 460                 


Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn Asn 
                485                 490                 495     


Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys Glu 
        515                 520                 525             


Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu Thr 
545                 550                 555                 560 


Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
                565                 570                 575     


Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile 
            580                 585                 590         


Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln Asn 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asp 
                645                 650                 655     


Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735 


<210>  34
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP1

<400>  34

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ser Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn 
            260                 265                 270         


His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  35
<211>  598
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214 VP2

<400>  35

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
            180                 185                 190         


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        195                 200                 205             


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
305                 310                 315                 320 


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
                325                 330                 335     


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
            340                 345                 350         


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
        355                 360                 365             


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
    370                 375                 380                 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
385                 390                 395                 400 


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
                405                 410                 415     


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
            420                 425                 430         


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
        435                 440                 445             


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
    450                 455                 460                 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
465                 470                 475                 480 


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
                485                 490                 495     


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
            500                 505                 510         


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
        515                 520                 525             


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
    530                 535                 540                 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
545                 550                 555                 560 


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
                565                 570                 575     


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
            580                 585                 590         


Tyr Leu Thr Arg Pro Leu 
        595             


<210>  36
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214A VP2

<400>  36

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  37
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e VP2

<400>  37

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Arg 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro 
    50                  55                  60                  


Thr Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser 
        115                 120                 125             


Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  38
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e8 VP2

<400>  38

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Arg 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Pro 
    50                  55                  60                  


Asn Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser 
        115                 120                 125             


Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  39
<211>  598
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e9 VP2

<400>  39

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ala Gly Ile Gly Lys Ser Gly Ala Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Thr Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Ile Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Ser Leu 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
            180                 185                 190         


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        195                 200                 205             


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
305                 310                 315                 320 


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
                325                 330                 335     


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
            340                 345                 350         


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
        355                 360                 365             


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
    370                 375                 380                 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
385                 390                 395                 400 


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
                405                 410                 415     


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
            420                 425                 430         


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
        435                 440                 445             


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
    450                 455                 460                 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
465                 470                 475                 480 


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
                485                 490                 495     


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
            500                 505                 510         


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
        515                 520                 525             


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
    530                 535                 540                 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
545                 550                 555                 560 


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
                565                 570                 575     


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
            580                 585                 590         


Tyr Leu Thr Arg Pro Leu 
        595             


<210>  40
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214e10 VP2

<400>  40

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Lys 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Ile Gly Glu Pro Pro Ala Gly Pro Ser Gly Leu Gly Ser 
    50                  55                  60                  


Gly Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser 
        115                 120                 125             


Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  41
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214 VP3

<400>  41

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  42
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214A VP3

<400>  42

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
            260                 265                 270         


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
    290                 295                 300                 


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
            340                 345                 350         


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
    370                 375                 380                 


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  43
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E VP3

<400>  43

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  44
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E8 VP3

<400>  44

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  45
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E9 VP3

<400>  45

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  46
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214E10 VP3

<400>  46

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln 
                245                 250                 255     


Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala 
            260                 265                 270         


Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr 
        275                 280                 285             


Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
                325                 330                 335     


Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr 
            340                 345                 350         


Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln 
    370                 375                 380                 


Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu 
    450                 455                 460                 


Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val 
            500                 505                 510         


Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Pro Leu 
    530             


<210>  47
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP1

<400>  47
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg acttcaacag attccactgc cacttttcac cacgtgactg gcaaagactc      900

atcaacaaca actggggatt ccgacccaag agactcaact tcaagctctt taacattcaa      960

gtcaaagagg ttacggacaa caatggagtc aagaccatcg ccaataacct taccagcacg     1020

gtccaggtct tcacggactc agactatcag ctcccgtacg tgctcgggtc ggctcacgag     1080

ggctgcctcc cgccgttccc agcggacgtt ttcatgattc ctcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggacgttcct     1260

ttccacagca gctacgctca cagccagagt ctggaccggc tgatgaatcc tctgattgac     1320

cagtacctgt actacttgtc tcggactcaa acaacaggag gcacggcaaa tacgcagact     1380

ctgggcttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaagg cactggcaga gacaatgtgg atgccgacaa agtcatgatc     1680

accaacgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  48
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP3

<400>  48
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagt      180

gcttcaacgg gggccagcaa cgacaaccac tacttcggct acagcacccc ctgggggtat      240

tttgacttca acagattcca ctgccacttt tcaccacgtg actggcaaag actcatcaac      300

aacaactggg gattccgacc caagagactc aacttcaagc tctttaacat tcaagtcaaa      360

gaggttacgg acaacaatgg agtcaagacc atcgccaata accttaccag cacggtccag      420

gtcttcacgg actcagacta tcagctcccg tacgtgctcg ggtcggctca cgagggctgc      480

ctcccgccgt tcccagcgga cgttttcatg attcctcagt acggctacct aacgctcaac      540

aatggcagcc aggcagtggg acggtcatcc ttttactgcc tggaatattt cccatcgcag      600

atgctgagaa cgggcaacaa ctttaccttc agctacacct ttgaggacgt tcctttccac      660

agcagctacg ctcacagcca gagtctggac cggctgatga atcctctgat tgaccagtac      720

ctgtactact tgtctcggac tcaaacaaca ggaggcacgg caaatacgca gactctgggc      780

ttcagccaag gtgggcctaa tacaatggcc aatcaggcaa agaactggct gccaggaccc      840

tgttaccgcc aacaacgcgt ctcaacgaca accgggcaaa acaacaatag caactttgcc      900

tggactgctg ggaccaaata ccatctgaat ggaagaaatt cattgatgaa tcctggcccc      960

gctatggcat cccacaaaga gggcgaggac cgtttttttc ccctgtccgg gtccctgatt     1020

tttggcaaac aaggcactgg cagagacaat gtggatgccg acaaagtcat gatcaccaac     1080

gaggaagaaa tcaaaaccac taaccctgtg gctacagagg aatacggtat cgtggcagat     1140

aacttgcagc agcaaaacac ggctcctcaa attggaactg tcaacagcca gggggcctta     1200

cccggtatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg ggccaagatt     1260

cctcacacgg acggcaactt ccacccgtct ccgctgatgg gcggctttgg cctgaaacat     1320

cctccgcctc agatcctgat caagaacacg cctgtacctg cggatcctcc gaccaccttc     1380

aaccagtcaa agctgaactc tttcatcacg caatacagca ccggacaggt cagcgtggaa     1440

attgaatggg agctgcagaa ggaaaacagc aagcgctgga accccgagat ccagtacacc     1500

tccaactact acaaatctac aagtgtggac tttgctgtta atacagaagg cgtgtactct     1560

gaaccccacc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  49
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP1

<400>  49

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg 
        435                 440                 445             


Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  50
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP2

<400>  50

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
            180                 185                 190         


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        195                 200                 205             


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn 
305                 310                 315                 320 


Thr Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val 
                405                 410                 415     


Asp Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  51
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITB102 45 VP3

<400>  51

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val 
        115                 120                 125             


Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
    130                 135                 140                 


Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr 
                245                 250                 255     


Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
            260                 265                 270         


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
    290                 295                 300                 


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val Asp 
            340                 345                 350         


Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
    370                 375                 380                 


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  52
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-I

<400>  52

Ser Ala Ser Thr Gly Ala Ser 
1               5           


<210>  53
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-I

<400>  53

Asn Ser Thr Ser Gly Gly Ser Ser 
1               5               


<210>  54
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-II

<400>  54

Asp Asn Asn Gly Val Lys 
1               5       


<210>  55
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-III

<400>  55

Asn Asp Gly Ser 
1               


<210>  56
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-IV

<400>  56

Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr 
1               5                   10  


<210>  57
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-V

<400>  57

Arg Val Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp 
1               5                   10                  15      


Thr Ala 
        


<210>  58
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-VI

<400>  58

His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly 
1               5                   10              


<210>  59
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-VII

<400>  59

Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
1               5                   10                  


<210>  60
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-VIII

<400>  60

Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile 
1               5                   10              


<210>  61
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VR-IX

<400>  61

Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
1               5                   10  


<210>  62
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV6 VP1

<400>  62
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc      900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa      960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg     1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct     1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac     1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac     1380

ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct     1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac     1500

tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct     1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc     1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc     1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg     1740

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga     1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc     1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt     1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca     1980

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc     2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag     2100

tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt     2160

tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  63
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV6 VP1

<400>  63

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 
        435                 440                 445             


Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser 
    450                 455                 460                 


Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
            500                 505                 510         


Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 
    530                 535                 540                 


Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
                565                 570                 575     


Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 
            580                 585                 590         


Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705                 710                 715                 720 


Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  64
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV6 VP2

<400>  64

Thr Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr 
        115                 120                 125             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    130                 135                 140                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
145                 150                 155                 160 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                165                 170                 175     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Thr Asn Asp Gly 
            180                 185                 190         


Val Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Ser 
        195                 200                 205             


Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    210                 215                 220                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
225                 230                 235                 240 


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                245                 250                 255     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            260                 265                 270         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        275                 280                 285             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    290                 295                 300                 


Tyr Leu Tyr Tyr Leu Asn Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln 
305                 310                 315                 320 


Asn Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val 
                325                 330                 335     


Gln Pro Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly 
        355                 360                 365             


Ala Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly 
    370                 375                 380                 


Thr Ala Met Ala Ser His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met 
385                 390                 395                 400 


Ser Gly Val Met Ile Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr 
                405                 410                 415     


Ala Leu Asp Asn Val Met Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Leu Gln 
        435                 440                 445             


Ser Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Val Met Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr 
        515                 520                 525             


Lys Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Val Gln Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe 
                565                 570                 575     


Thr Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  65
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV6 VP3

<400>  65

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly 
    50                  55                  60                  


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65                  70                  75                  80  


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
                85                  90                  95      


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
            100                 105                 110         


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val 
        115                 120                 125             


Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp 
    130                 135                 140                 


Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
145                 150                 155                 160 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr 
                165                 170                 175     


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
            180                 185                 190         


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
        195                 200                 205             


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
    210                 215                 220                 


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
225                 230                 235                 240 


Leu Tyr Tyr Leu Asn Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn 
                245                 250                 255     


Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln 
            260                 265                 270         


Pro Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala 
    290                 295                 300                 


Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser 
                325                 330                 335     


Gly Val Met Ile Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala 
            340                 345                 350         


Leu Asp Asn Val Met Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Leu Gln Ser 
    370                 375                 380                 


Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Val Met Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys 
    450                 455                 460                 


Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Val Gln Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr 
            500                 505                 510         


Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  66
<211>  2217
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV8 VP1

<400>  66
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggctgcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta gttcctcggg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccaacg ggacatcggg aggagccacc aacgacaaca cctacttcgg ctacagcacc      840

ccctgggggt attttgactt taacagattc cactgccact tttcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg cccaagagac tcagcttcaa gctcttcaac      960

atccaggtca aggaggtcac gcagaatgaa ggcaccaaga ccatcgccaa taacctcacc     1020

agcaccatcc aggtgtttac ggactcggag taccagctgc cgtacgttct cggctctgcc     1080

caccagggct gcctgcctcc gttcccggcg gacgtgttca tgattcccca gtacggctac     1140

ctaacactca acaacggtag tcaggccgtg ggacgctcct ccttctactg cctggaatac     1200

tttccttcgc agatgctgag aaccggcaac aacttccagt ttacttacac cttcgaggac     1260

gtgcctttcc acagcagcta cgcccacagc cagagcttgg accggctgat gaatcctctg     1320

attgaccagt acctgtacta cttgtctcgg actcaaacaa caggaggcac ggcaaatacg     1380

cagactctgg gcttcagcca aggtgggcct aatacaatgg ccaatcaggc aaagaactgg     1440

ctgccaggac cctgttaccg ccaacaacgc gtctcaacga caaccgggca aaacaacaat     1500

agcaactttg cctggactgc tgggaccaaa taccatctga atggaagaaa ttcattggct     1560

aatcctggca tcgctatggc aacacacaaa gacgacgagg agcgtttttt tcccagtaac     1620

gggatcctga tttttggcaa acaaaatgct gccagagaca atgcggatta cagcgatgtc     1680

atgctcacca gcgaggaaga aatcaaaacc actaaccctg tggctacaga ggaatacggt     1740

atcgtggcag ataacttgca gcagcaaaac acggctcctc aaattggaac tgtcaacagc     1800

cagggggcct tacccggtat ggtctggcag aaccgggacg tgtacctgca gggtcccatc     1860

tgggccaaga ttcctcacac ggacggcaac ttccacccgt ctccgctgat gggcggcttt     1920

ggcctgaaac atcctccgcc tcagatcctg atcaagaaca cgcctgtacc tgcggatcct     1980

ccgaccacct tcaaccagtc aaagctgaac tctttcatca cgcaatacag caccggacag     2040

gtcagcgtgg aaattgaatg ggagctgcag aaggaaaaca gcaagcgctg gaaccccgag     2100

atccagtaca cctccaacta ctacaaatct acaagtgtgg actttgctgt taatacagaa     2160

ggcgtgtact ctgaaccccg ccccattggc acccgttacc tcacccgtaa tctgtaa        2217


<210>  67
<211>  738
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 VP1

<400>  67

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Thr Tyr 
                405                 410                 415     


Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly 
    450                 455                 460                 


Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly Ile Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly Ile Leu Ile 
    530                 535                 540                 


Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala 
            580                 585                 590         


Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  68
<211>  601
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 VP2

<400>  68

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Pro Ser Pro Gln Arg Ser 
1               5                   10                  15      


Pro Asp Ser Ser Thr Gly Ile Gly Lys Lys Gly Gln Gln Pro Ala Arg 
            20                  25                  30          


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp 
        35                  40                  45              


Pro Gln Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Pro 
    50                  55                  60                  


Asn Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu 
65                  70                  75                  80  


Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser 
                85                  90                  95      


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            100                 105                 110         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Gly Thr 
        115                 120                 125             


Ser Gly Gly Ala Thr Asn Asp Asn Thr Tyr Phe Gly Tyr Ser Thr Pro 
    130                 135                 140                 


Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg 
145                 150                 155                 160 


Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg 
                165                 170                 175     


Leu Ser Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn 
            180                 185                 190         


Glu Gly Thr Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Ile Gln Val 
        195                 200                 205             


Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His 
    210                 215                 220                 


Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln 
225                 230                 235                 240 


Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser 
                245                 250                 255     


Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly 
            260                 265                 270         


Asn Asn Phe Gln Phe Thr Tyr Thr Phe Glu Asp Val Pro Phe His Ser 
        275                 280                 285             


Ser Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile 
    290                 295                 300                 


Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr 
305                 310                 315                 320 


Ala Asn Thr Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met 
                325                 330                 335     


Ala Asn Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln 
            340                 345                 350         


Arg Val Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp 
        355                 360                 365             


Thr Ala Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Ala Asn 
    370                 375                 380                 


Pro Gly Ile Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg Phe Phe 
385                 390                 395                 400 


Pro Ser Asn Gly Ile Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp 
                405                 410                 415     


Asn Ala Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys 
            420                 425                 430         


Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn 
        435                 440                 445             


Leu Gln Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln 
    450                 455                 460                 


Gly Ala Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln 
465                 470                 475                 480 


Gly Pro Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro 
                485                 490                 495     


Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile 
            500                 505                 510         


Leu Ile Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn 
        515                 520                 525             


Gln Ser Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val 
    530                 535                 540                 


Ser Val Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp 
545                 550                 555                 560 


Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val 
                565                 570                 575     


Asp Phe Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile 
            580                 585                 590         


Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
        595                 600     


<210>  69
<211>  535
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 VP3

<400>  69

Met Ala Ala Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly 
    50                  55                  60                  


Gly Ala Thr Asn Asp Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly 
        115                 120                 125             


Thr Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Gln Phe Thr Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn 
                245                 250                 255     


Thr Gln Thr Leu Gly Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
            260                 265                 270         


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
        275                 280                 285             


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
    290                 295                 300                 


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly 
305                 310                 315                 320 


Ile Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser 
                325                 330                 335     


Asn Gly Ile Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
            340                 345                 350         


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
        355                 360                 365             


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
    370                 375                 380                 


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
385                 390                 395                 400 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
                405                 410                 415     


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
            420                 425                 430         


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
        435                 440                 445             


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
    450                 455                 460                 


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
465                 470                 475                 480 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
                485                 490                 495     


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
            500                 505                 510         


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
        515                 520                 525             


Arg Tyr Leu Thr Arg Asn Leu 
    530                 535 


<210>  70
<211>  2214
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV9 VP1

<400>  70
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacggcaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggctgcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta attcctcggg aaattggcat tgcgattcca catggctggg ggacagagtc      720

atcaccacca gcacccgaac ctgggcattg cccacctaca acaaccacct ctacaagcaa      780

atctccaatg gaacatcggg aggaagcacc aacgacaaca cctactttgg ctacagcacc      840

ccctgggggt attttgactt caacagattc cactgccact tctcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg ccaaagagac tcaacttcaa gctgttcaac      960

atccaggtca aggaggttac gacgaacgaa ggcaccaaga ccatcgccaa taaccttacc     1020

agcaccgtcc aggtctttac ggactcggag taccagctac cgtacgtcct aggctctgcc     1080

caccaaggat gcctgccacc gtttcctgca gacgtcttca tggttcctca gtacggctac     1140

ctgacgctca acaatggaag tcaagcgtta ggacgttctt ctttctactg tctggaatac     1200

ttcccttctc agatgctgag aaccggcaac aactttcagt tcagctacac tttcgaggac     1260

gtgcctttcc acagcagcta cgcacacagc cagagtctag atcgactgat gaaccccctc     1320

atcgaccagt acctatacta cctggtcaga acacagacaa ctggaactgg gggaactcaa     1380

actttggcat tcagccaagc aggccctagc tcaatggcca atcaggctag aaactgggta     1440

cccgggcctt gctaccgtca gcagcgcgtc tccacaacca ccaaccaaaa taacaacagc     1500

aactttgcgt ggacgggagc tgctaaattc aagctgaacg ggagagactc gctaatgaat     1560

cctggcgtgg ctatggcatc gcacaaagac gacgaggacc gcttctttcc atcaagtggc     1620

gttctcatat ttggcaagca aggagccggg aacgatggag tcgactacag ccaggtgctg     1680

attacagatg aggaagaaat taaagccacc aaccctgtag ccacagagga atacggagca     1740

gtggccatca acaaccaggc cgctaacacg caggcgcaaa ctggacttgt gcataaccag     1800

ggagttattc ctggtatggt ctggcagaac cgggacgtgt acctgcaggg ccctatttgg     1860

gctaaaatac ctcacacaga tggcaacttt cacccgtctc ctctgatggg tggatttgga     1920

ctgaaacacc cacctccaca gattctaatt aaaaatacac cagtgccggc agatcctcct     1980

cttaccttca atcaagccaa gctgaactct ttcatcacgc agtacagcac gggacaagtc     2040

agcgtggaaa tcgagtggga gctgcagaaa gaaaacagca agcgctggaa tccagagatc     2100

cagtatactt caaactacta caaatctaca aatgtggact ttgctgtcaa taccaaaggt     2160

gtttactctg agcctcgccc cattggtact cgttacctca cccgtaattt gtaa           2214


<210>  71
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 VP1

<400>  71

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  72
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 VP2

<400>  72

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ala Gly Ile Gly Lys Ser Gly Ala Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Thr Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Ile Gly Glu Pro Pro Ala Ala Pro Ser Gly Val Gly Ser Leu 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Val Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Gln 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Gln Phe Ser Tyr Glu Phe Glu Asn Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Val Ala Gly Pro Ser Asn Met Ala Val 
                325                 330                 335     


Gln Gly Arg Asn Tyr Ile Pro Gly Pro Ser Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Val Thr Gln Asn Asn Asn Ser Glu Phe Ala Trp Pro Gly 
        355                 360                 365             


Ala Ser Ser Trp Ala Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val 
                405                 410                 415     


Asp Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Ser Tyr Gly Gln Val Ala Thr Asn His Gln 
        435                 440                 445             


Ser Ala Gln Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Asn Leu 
        595                 


<210>  73
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 VP3

<400>  73

Met Ala Ser Gly Gly Gly Ala Pro Val Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Ser Ser Ser Gly Asn Trp His Cys Asp Ser Gln Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Glu Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Gln Phe Ser Tyr Glu Phe Glu Asn Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Val Ala Gly Pro Ser Asn Met Ala Val Gln 
            260                 265                 270         


Gly Arg Asn Tyr Ile Pro Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Val Thr Gln Asn Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala 
    290                 295                 300                 


Ser Ser Trp Ala Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Gly Thr Gly Arg Asp Asn Val Asp 
            340                 345                 350         


Ala Asp Lys Val Met Ile Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Ser Tyr Gly Gln Val Ala Thr Asn His Gln Ser 
    370                 375                 380                 


Ala Gln Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Asn Leu 
    530                 


<210>  74
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRII-204/AAV6

<400>  74

Thr Asn Asp Gly Val Lys 
1               5       


<210>  75
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRIII-204/AAV6

<400>  75

Asn Asn Gly Ser 
1               


<210>  76
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRIV-204/AAV6

<400>  76

Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp 
1               5                   10      


<210>  77
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRV-204/AAV6

<400>  77

Arg Val Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp 
1               5                   10                  15      


Thr Gly 
        


<210>  78
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRVI-204/AAV6

<400>  78

His Lys Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly 
1               5                   10              


<210>  79
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRVII-204/AAV6

<400>  79

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val 
1               5                   10                  


<210>  80
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRVIII-204/AAV6

<400>  80

Ala Val Asn Leu Gln Asn Ser Ser Thr Asp Pro Ala Thr 
1               5                   10              


<210>  81
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  VRIX-204/AAV6

<400>  81

Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe 
1               5                   10  


<210>  82
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-AB VP1

<400>  82
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcggaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagcagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccactttt caccacgtga ctggcaaaga      900

ctcatcaaca acaactgggg attccgaccc aagagactca acttcaagct ctttaacatt      960

caagtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaacg acggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggacgtt     1260

cctttccaca gcagctacgc tcacagccag agtctggacc gtctcatgaa tcctctgatt     1320

gaccagtacc tgtactactt gtctaagact atcaacggat ccggccagaa tcagcagact     1380

ctgaagttca gccaaggtgg gcctaataca atggccaatc aggcaaagaa ctggctgcca     1440

ggaccctgtt accgccaaca acgcgtctca acgacaaccg ggcaaaacaa caatagcaac     1500

tttgcctgga ctgctgggac caaataccat ctgaatggaa gaaattcatt gatgaatcct     1560

ggccccgcta tggcatccca caaagagggc gaggaccgtt tttttcccct gtccgggtcc     1620

ctgatttttg gcaaacaaaa tgctgccaga gacaatgcgg attacagcga tgtcatgctc     1680

accagcgagg aagaaatcaa aaccactaac cctgtggcta cagaggaata cggtatcgtg     1740

gcagataact tgcagcagca aaacacggct cctcaaattg gaactgtcaa cagccagggg     1800

gccttacccg gtatggtctg gcagaaccgg gacgtgtacc tgcagggtcc catctgggcc     1860

aagattcctc acacggacgg caacttccac ccgtctccgc tgatgggcgg ctttggcctg     1920

aaacatcctc cgcctcagat cctgatcaag aacacgcctg tacctgcgga tcctccgacc     1980

accttcaacc agtcaaagct gaactctttc atcacgcaat acagcaccgg acaggtcagc     2040

gtggaaattg aatgggagct gcagaaggaa aacagcaagc gctggaaccc cgagatccag     2100

tacacctcca actactacaa atctacaagt gtggactttg ctgttaatac agaaggcgtg     2160

tactctgaac cccaccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210>  83
<211>  1605
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV214-AB VP3

<400>  83
atggcttcag gcggtggcgc accaatggcg gacaataacg aaggcgccga cggagtgggt       60

aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc      120

agcacccgca cctgggcctt gcccacctac aataaccacc tctacaagca aatctccagc      180

agcacatctg gaggatcttc aaatgacaac gcctacttcg gctacagcac cccctggggg      240

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      300

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      360

aaagaggtta cggacaacaa tggagtcaag accatcgcca ataaccttac cagcacggtc      420

caggtcttca cggactcaga ctatcagctc ccgtacgtcc tcggctctgc gcaccagggc      480

tgcctccctc cgttcccggc ggacgtgttc atgattccgc agtacggcta cctaacgctc      540

aacgacggca gccaggcagt gggacggtca tccttttact gcctggaata tttcccatcg      600

cagatgctga gaacgggcaa caactttacc ttcagctaca cctttgagga cgttcctttc      660

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct gattgaccag      720

tacctgtact acttgtctaa gactatcaac ggatccggcc agaatcagca gactctgaag      780

ttcagccaag gtgggcctaa tacaatggcc aatcaggcaa agaactggct gccaggaccc      840

tgttaccgcc aacaacgcgt ctcaacgaca accgggcaaa acaacaatag caactttgcc      900

tggactgctg ggaccaaata ccatctgaat ggaagaaatt cattgatgaa tcctggcccc      960

gctatggcat cccacaaaga gggcgaggac cgtttttttc ccctgtccgg gtccctgatt     1020

tttggcaaac aaaatgctgc cagagacaat gcggattaca gcgatgtcat gctcaccagc     1080

gaggaagaaa tcaaaaccac taaccctgtg gctacagagg aatacggtat cgtggcagat     1140

aacttgcagc agcaaaacac ggctcctcaa attggaactg tcaacagcca gggggcctta     1200

cccggtatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg ggccaagatt     1260

cctcacacgg acggcaactt ccacccgtct ccgctgatgg gcggctttgg cctgaaacat     1320

cctccgcctc agatcctgat caagaacacg cctgtacctg cggatcctcc gaccaccttc     1380

aaccagtcaa agctgaactc tttcatcacg caatacagca ccggacaggt cagcgtggaa     1440

attgaatggg agctgcagaa ggaaaacagc aagcgctgga accccgagat ccagtacacc     1500

tccaactact acaaatctac aagtgtggac tttgctgtta atacagaagg cgtgtactct     1560

gaaccccacc ccattggcac ccgttacctc acccgtcccc tgtaa                     1605


<210>  84
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VP1

<400>  84

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val Met Leu 
545                 550                 555                 560 


Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala Pro Gln 
            580                 585                 590         


Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro His Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
                725                 730                 735     


<210>  85
<211>  599
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VP2

<400>  85

Met Ala Pro Gly Lys Lys Arg Pro Val Glu Gln Ser Pro Gln Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Ile Gly Lys Thr Gly Gln Gln Pro Ala Lys Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
    50                  55                  60                  


Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ser Thr Ser 
        115                 120                 125             


Gly Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp 
    130                 135                 140                 


Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp 
145                 150                 155                 160 


Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu 
                165                 170                 175     


Asn Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn 
            180                 185                 190         


Gly Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe 
        195                 200                 205             


Thr Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln 
    210                 215                 220                 


Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr 
225                 230                 235                 240 


Gly Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser 
                245                 250                 255     


Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 
            260                 265                 270         


Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser 
        275                 280                 285             


Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp 
    290                 295                 300                 


Gln Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn 
305                 310                 315                 320 


Gln Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn 
                325                 330                 335     


Gln Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            340                 345                 350         


Ser Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala 
        355                 360                 365             


Gly Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly 
    370                 375                 380                 


Pro Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu 
385                 390                 395                 400 


Ser Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala 
                405                 410                 415     


Asp Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr 
            420                 425                 430         


Asn Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln 
        435                 440                 445             


Gln Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala 
    450                 455                 460                 


Leu Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro 
465                 470                 475                 480 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro 
                485                 490                 495     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            500                 505                 510         


Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser 
        515                 520                 525             


Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    530                 535                 540                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545                 550                 555                 560 


Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe 
                565                 570                 575     


Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr 
            580                 585                 590         


Arg Tyr Leu Thr Arg Pro Leu 
        595                 


<210>  86
<211>  534
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VP3

<400>  86

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Ser Thr Ser Gly 
    50                  55                  60                  


Gly Ser Ser Asn Asp Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
65                  70                  75                  80  


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
                85                  90                  95      


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
            100                 105                 110         


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Asp Asn Asn Gly 
        115                 120                 125             


Val Lys Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
    130                 135                 140                 


Asp Ser Asp Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
145                 150                 155                 160 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly 
                165                 170                 175     


Tyr Leu Thr Leu Asn Asp Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
            180                 185                 190         


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
        195                 200                 205             


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
    210                 215                 220                 


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
225                 230                 235                 240 


Tyr Leu Tyr Tyr Leu Ser Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln 
                245                 250                 255     


Gln Thr Leu Lys Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln 
            260                 265                 270         


Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
        275                 280                 285             


Thr Thr Thr Gly Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly 
    290                 295                 300                 


Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Met Asn Pro Gly Pro 
305                 310                 315                 320 


Ala Met Ala Ser His Lys Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser 
                325                 330                 335     


Gly Ser Leu Ile Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp 
            340                 345                 350         


Tyr Ser Asp Val Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn 
        355                 360                 365             


Pro Val Ala Thr Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln 
    370                 375                 380                 


Gln Asn Thr Ala Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu 
385                 390                 395                 400 


Pro Gly Met Val Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile 
                405                 410                 415     


Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu 
            420                 425                 430         


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
        435                 440                 445             


Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys 
    450                 455                 460                 


Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
465                 470                 475                 480 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
                485                 490                 495     


Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala 
            500                 505                 510         


Val Asn Thr Glu Gly Val Tyr Ser Glu Pro His Pro Ile Gly Thr Arg 
        515                 520                 525             


Tyr Leu Thr Arg Pro Leu 
    530                 


<210>  87
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV214AB VR-1 amino acid

<400>  87

Ser Ser Thr Ser Gly Gly Ser Ser 
1               5               


<210>  88
<211>  6719
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF1

<400>  88
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc atggaggcgg tactatgtag atgagaattc      180

aggagcaaac tgggaaaagc aactgcttcc aaatatttgt gatttttaca gtgtagtttt      240

ggaaaaactc ttagcctacc aattcttcta agtgttttaa aatgtgggag ccagtacaca      300

tgaagttata gagtgtttta atgaggctta aatatttacc gtaactatga aatgctacgc      360

atatcatgct gttcaggctc cgtggccacg caactcatac cggtagtact cgccaccatg      420

cagagaagcc ccctggagaa ggcctctgtg gtgagcaagc tgttcttcag ctggaccaga      480

cccatcctga gaaagggcta cagacagaga ctggagctgt ctgacatcta ccagatcccc      540

tctgtggact ctgctgacaa cctgtctgag aagctggaga gagagtggga cagagagctg      600

gccagcaaga agaaccccaa gctgatcaat gccctgagaa gatgcttctt ctggagattc      660

atgttctatg gcatcttcct gtacctgggg gaggtgacca aggctgtgca gcccctgctg      720

ctgggcagaa tcattgccag ctatgaccct gacaacaagg aggagagaag cattgccatc      780

tacctgggca ttggcctgtg cctgctgttc attgtgagaa ccctgctgct gcaccctgcc      840

atctttggcc tgcaccacat tggcatgcag atgagaattg ccatgttcag cctgatctac      900

aagaagaccc tgaagctgag cagcagagtg ctggacaaga tcagcattgg ccagctggtg      960

agcctgctga gcaacaacct gaacaagttt gatgagggcc tggccctggc ccactttgtg     1020

tggattgccc ccctgcaggt ggccctgctg atgggcctga tctgggagct gctgcaggcc     1080

tctgccttct gtggcctggg cttcctgatt gtgctggccc tgttccaggc tggcctgggc     1140

agaatgatga tgaagtacag agaccagaga gctggcaaga tctctgagag actggtgatc     1200

acctctgaga tgattgagaa catccagtct gtgaaggcct actgctggga ggaggccatg     1260

gagaagatga ttgagaacct gagacagaca gagctgaagc tgaccagaaa ggctgcctat     1320

gtgagatact tcaacagctc tgccttcttc ttctctggct tctttgtggt gttcctgtct     1380

gtgctgccct atgccctgat caagggcatc atcctgagaa agatcttcac caccatcagc     1440

ttctgcattg tgctgagaat ggctgtgacc agacagttcc cctgggctgt gcagacctgg     1500

tatgacagcc tgggggccat caacaagatc caggacttcc tgcagaagca ggagtacaag     1560

accctggagt acaacctgac caccacagag gtggtgatgg agaatgtgac agccttctgg     1620

gaggagggct ttggggagct gtttgagaag gccaagcaga acaacaacaa cagaaagacc     1680

agcaatgggg atgacagcct gttcttcagc aacttcagcc tgctgggcac ccctgtgctg     1740

aaggacatca acttcaagat tgagagaggc cagctgctgg ctgtggctgg cagcacaggg     1800

gctggcaaga ccagcctgct gatgatgatc atgggggagc tggagccctc tgagggcaag     1860

atcaagcact ctggcagaat cagcttctgc agccagttca gctggatcat gcctggcacc     1920

atcaaggaga acatcatctt tggggtgagc tatgatgagt acagatacag atctgtgatc     1980

aaggcctgcc agctggagga ggacatcagc aagtttgctg agaaggacaa cattgtgctg     2040

ggggaggggg gcatcaccct gtctgggggc cagagagcca gaatcagcct ggccagagct     2100

gtgtacaagg atgctgacct gtacctgctg gacagcccct ttggctacct ggatgtgctg     2160

acagagaagg agatctttga gagctgtgtg tgcaagctga tggccaacaa gaccagaatc     2220

ctggtgacca gcaagatgga gcacctgaag aaggctgaca agatcctgat cctgcatgag     2280

ggcagcagct acttctatgg caccttctct gagctgcaga acctgcagcc tgacttcagc     2340

agcaagctga tgggctgtga cagctttgac cagttctctg ctgagagaag aaacagcatc     2400

ctgacagaga ccctgcacag attcagcctg gagggggatg cccctgtgag ctggacagag     2460

accaagaagc agagcttcaa gcagacaggg gagtttgggg agaagagaaa gaacagcatc     2520

ctgaacccca tcaacagcac cctgcaggcc agaagaagac agtctgtgct gaacctgatg     2580

acccactctg tgaaccaggg ccagaacatc cacagaaaga ccacagccag caccagaaag     2640

gtgagcctgg ccccccaggc caacctgaca gagctggaca tctacagcag aagactgagc     2700

caggagacag gcctggagat ctctgaggag atcaatgagg aggacctgaa ggagtgcttc     2760

tttgatgaca tggagagcat ccctgctgtg accacctgga acacctacct gagatacatc     2820

acagtgcaca agagcctgat ctttgtgctg atctggtgcc tggtgatctt cctggctgag     2880

gtggctgcca gcctggtggt gctgtggctg ctgggcaaca cccccctgca ggacaagggc     2940

aacagcaccc acagcagaaa caacagctat gctgtgatca tcaccagcac cagcagctac     3000

tatgtgttct acatctatgt gggggtggct gacaccctgc tggccatggg cttcttcaga     3060

ggcctgcccc tggtgcacac cctgatcaca gtgagcaaga tcctgcacca caagatgctg     3120

cactctgtgc tgcaggcccc catgagcacc ctgaacaccc tgaaggctgg gggcatcctg     3180

aacagattca gcaaggacat tgccatcctg gatgacctgc tgcccctgac catctttgac     3240

ttcatccagc tgctgctgat tgtgattggg gccattgctg tggtggctgt gctgcagccc     3300

tacatctttg tggccacagt gcctgtgatt gtggccttca tcatgctgag agcctacttc     3360

ctgcagacca gccagcagct gaagcagctg gagtctgagg gcagaagccc catcttcacc     3420

cacctggtga ccagcctgaa gggcctgtgg accctgagag cctttggcag acagccctac     3480

tttgagaccc tgttccacaa ggccctgaac ctgcacacag ccaactggtt cctgtacctg     3540

agcaccctga gatggttcca gatgagaatt gagatgatct ttgtgatctt cttcattgct     3600

gtgaccttca tcagcatcct gaccacaggg gagggggagg gcagagtggg catcatcctg     3660

accctggcca tgaacatcat gagcaccctg cagtgggctg tgaacagcag cattgatgtg     3720

gacagcctga tgagatctgt gagcagagtg ttcaagttca ttgacatgcc cacagagggc     3780

aagcccacca agagcaccaa gccctacaag aatggccagc tgagcaaggt gatgatcatt     3840

gagaacagcc atgtgaagaa ggatgacatc tggccctctg ggggccagat gacagtgaag     3900

gacctgacag ccaagtacac agaggggggc aatgccatcc tggagaacat cagcttcagc     3960

atcagccctg gccagagagt gggcctgctg ggcagaacag gctctggcaa gagcaccctg     4020

ctgtctgcct tcctgagact gctgaacaca gagggggaga tccagattga tggggtgagc     4080

tgggacagca tcaccctgca gcagtggaga aaggcctttg gggtgatccc ccagaaggtg     4140

ttcatcttct ctggcacctt cagaaagaac ctggacccct atgagcagtg gtctgaccag     4200

gagatctgga aggtggctga tgaggtgggc ctgagatctg tgattgagca gttccctggc     4260

aagctggact ttgtgctggt ggatgggggc tgtgtgctga gccatggcca caagcagctg     4320

atgtgcctgg ccagatctgt gctgagcaag gccaagatcc tgctgctgga tgagccctct     4380

gcccacctgg accctgtgac ctaccagatc atcagaagaa ccctgaagca ggcctttgct     4440

gactgcacag tgatcctgtg tgagcacaga attgaggcca tgctggagtg ccagcagttc     4500

ctggtgattg aggagaacaa ggtgagacag tatgacagca tccagaagct gctgaatgag     4560

agaagcctgt tcagacaggc catcagcccc tctgacagag tgaagctgtt cccccacaga     4620

aacagcagca agtgcaagag caagccccag attgctgccc tgaaggagga gaccgaggag     4680

gaggtgcagg acaccagact gtaaataaaa tacgaaatgg atctgaggaa cccctagtga     4740

tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg     4800

tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagagagg     4860

gagtggccaa ttaattaagg cgatgaacgg taatcgtaaa actagcatgt caatcatatg     4920

taccccggtt gataatcaga aaagccccaa aaacaggaag attgtataag cattaattaa     4980

tttaaataca tggacatgtc agaattggtt aattggttgt aacactgacc cctatttgtt     5040

tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc     5100

ttcaataata ttgaaaaagg aagaatatga gccatattca acgggaaacg tcgaggccgc     5160

gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc gataatgtcg     5220

ggcaatcagg tgcgacaatc tatcgcttgt atgggaagcc cgatgcgcca gagttgtttc     5280

tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc agactaaact     5340

ggctgacgga atttatgcca cttccgacca tcaagcattt tatccgtact cctgatgatg     5400

catggttact caccactgcg atccccggaa aaacagcgtt ccaggtatta gaagaatatc     5460

ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg ttgcactcga     5520

ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgcctcgct caggcgcaat     5580

cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt aatggctggc     5640

ctgttgaaca agtctggaaa gaaatgcata aacttttgcc attctcaccg gattcagtcg     5700

tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa ttaataggtt     5760

gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc atcctatgga     5820

actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa tatggtattg     5880

ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt ttctaaaagc     5940

agagcattac gctgacttga cgggacggcg caagctcatg accaaaatcc cttaacgtga     6000

gttacgcgcg cgtcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct     6060

tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca     6120

gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc     6180

agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagc ccaccacttc     6240

aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct     6300

gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag     6360

gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc     6420

tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg     6480

agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag     6540

cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt     6600

gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac     6660

gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt taaaccatg      6719


<210>  89
<211>  6751
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF3

<400>  89
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc atggaggcgg tactatgtag atgagaattc      180

aggagcaaac tgggaaaagc aactgcttcc aaatatttgt gatttttaca gtgtagtttt      240

ggaaaaactc ttagcctacc aattcttcta agtgttttaa aatgtgggag ccagtacaca      300

tgaagttata gagtgtttta atgaggctta aatatttacc gtaactatga aatgctacgc      360

atatcatgct gttcaggctc cgtggccacg caactcatac cggtagtact cgccaccatg      420

cagagaagcc ccctggagaa ggcctctgtg gtgagcaagc tgttcttcag ctggaccaga      480

cccatcctga gaaagggcta cagacagaga ctggagctgt ctgacatcta ccagatcccc      540

tctgtggact ctgctgacaa cctgtctgag aagctggaga gagagtggga cagagagctg      600

gccagcaaga agaaccccaa gctgatcaat gccctgagaa gatgcttctt ctggagattc      660

atgttctatg gcatcttcct gtacctgggg gaggtgacca aggctgtgca gcccctgctg      720

ctgggcagaa tcattgccag ctatgaccct gacaacaagg aggagagaag cattgccatc      780

tacctgggca ttggcctgtg cctgctgttc attgtgagaa ccctgctgct gcaccctgcc      840

atctttggcc tgcaccacat tggcatgcag atgagaattg ccatgttcag cctgatctac      900

aagaagaccc tgaagctgag cagcagagtg ctggacaaga tcagcattgg ccagctggtg      960

agcctgctga gcaacaacct gaacaagttt gatgagggcc tggccctggc ccactttgtg     1020

tggattgccc ccctgcaggt ggccctgctg atgggcctga tctgggagct gctgcaggcc     1080

tctgccttct gtggcctggg cttcctgatt gtgctggccc tgttccaggc tggcctgggc     1140

agaatgatga tgaagtacag agaccagaga gctggcaaga tctctgagag actggtgatc     1200

acctctgaga tgattgagaa catccagtct gtgaaggcct actgctggga ggaggccatg     1260

gagaagatga ttgagaacct gagacagaca gagctgaagc tgaccagaaa ggctgcctat     1320

gtgagatact tcaacagctc tgccttcttc ttctctggct tctttgtggt gttcctgtct     1380

gtgctgccct atgccctgat caagggcatc atcctgagaa agatcttcac caccatcagc     1440

ttctgcattg tgctgagaat ggctgtgacc agacagttcc cctgggctgt gcagacctgg     1500

tatgacagcc tgggggccat caacaagatc caggacttcc tgcagaagca ggagtacaag     1560

accctggagt acaacctgac caccacagag gtggtgatgg agaatgtgac agccttctgg     1620

gaggagggct ttggggagct gtttgagaag gccaagcaga acaacaacaa cagaaagacc     1680

agcaatgggg atgacagcct gttcttcagc aacttcagcc tgctgggcac ccctgtgctg     1740

aaggacatca acttcaagat tgagagaggc cagctgctgg ctgtggctgg cagcacaggg     1800

gctggcaaga ccagcctgct gatgatgatc atgggggagc tggagccctc tgagggcaag     1860

atcaagcact ctggcagaat cagcttctgc agccagttca gctggatcat gcctggcacc     1920

atcaaggaga acatcatctt tggggtgagc tatgatgagt acagatacag atctgtgatc     1980

aaggcctgcc agctggagga ggacatcagc aagtttgctg agaaggacaa cattgtgctg     2040

ggggaggggg gcatcaccct gtctgggggc cagagagcca gaatcagcct ggccagagct     2100

gtgtacaagg atgctgacct gtacctgctg gacagcccct ttggctacct ggatgtgctg     2160

acagagaagg agatctttga gagctgtgtg tgcaagctga tggccaacaa gaccagaatc     2220

ctggtgacca gcaagatgga gcacctgaag aaggctgaca agatcctgat cctgcatgag     2280

ggcagcagct acttctatgg caccttctct gagctgcaga acctgcagcc tgacttcagc     2340

agcaagctga tgggctgtga cagctttgac cagttctctg ctgagagaag aaacagcatc     2400

ctgacagaga ccctgcacag attcagcctg gagggggatg cccctgtgag ctggacagag     2460

accaagaagc agagcttcaa gcagacaggg gagtttgggg agaagagaaa gaacagcatc     2520

ctgaacccca tcaacagcac cctgcaggcc agaagaagac agtctgtgct gaacctgatg     2580

acccactctg tgaaccaggg ccagaacatc cacagaaaga ccacagccag caccagaaag     2640

gtgagcctgg ccccccaggc caacctgaca gagctggaca tctacagcag aagactgagc     2700

caggagacag gcctggagat ctctgaggag atcaatgagg aggacctgaa ggagtgcttc     2760

tttgatgaca tggagagcat ccctgctgtg accacctgga acacctacct gagatacatc     2820

acagtgcaca agagcctgat ctttgtgctg atctggtgcc tggtgatctt cctggctgag     2880

gtggctgcca gcctggtggt gctgtggctg ctgggcaaca cccccctgca ggacaagggc     2940

aacagcaccc acagcagaaa caacagctat gctgtgatca tcaccagcac cagcagctac     3000

tatgtgttct acatctatgt gggggtggct gacaccctgc tggccatggg cttcttcaga     3060

ggcctgcccc tggtgcacac cctgatcaca gtgagcaaga tcctgcacca caagatgctg     3120

cactctgtgc tgcaggcccc catgagcacc ctgaacaccc tgaaggctgg gggcatcctg     3180

aacagattca gcaaggacat tgccatcctg gatgacctgc tgcccctgac catctttgac     3240

ttcatccagc tgctgctgat tgtgattggg gccattgctg tggtggctgt gctgcagccc     3300

tacatctttg tggccacagt gcctgtgatt gtggccttca tcatgctgag agcctacttc     3360

ctgcagacca gccagcagct gaagcagctg gagtctgagg gcagaagccc catcttcacc     3420

cacctggtga ccagcctgaa gggcctgtgg accctgagag cctttggcag acagccctac     3480

tttgagaccc tgttccacaa ggccctgaac ctgcacacag ccaactggtt cctgtacctg     3540

agcaccctga gatggttcca gatgagaatt gagatgatct ttgtgatctt cttcattgct     3600

gtgaccttca tcagcatcct gaccacaggg gagggggagg gcagagtggg catcatcctg     3660

accctggcca tgaacatcat gagcaccctg cagtgggctg tgaacagcag cattgatgtg     3720

gacagcctga tgagatctgt gagcagagtg ttcaagttca ttgacatgcc cacagagggc     3780

aagcccacca agagcaccaa gccctacaag aatggccagc tgagcaaggt gatgatcatt     3840

gagaacagcc atgtgaagaa ggatgacatc tggccctctg ggggccagat gacagtgaag     3900

gacctgacag ccaagtacac agaggggggc aatgccatcc tggagaacat cagcttcagc     3960

atcagccctg gccagagagt gggcctgctg ggcagaacag gctctggcaa gagcaccctg     4020

ctgtctgcct tcctgagact gctgaacaca gagggggaga tccagattga tggggtgagc     4080

tgggacagca tcaccctgca gcagtggaga aaggcctttg gggtgatccc ccagaaggtg     4140

ttcatcttct ctggcacctt cagaaagaac ctggacccct atgagcagtg gtctgaccag     4200

gagatctgga aggtggctga tgaggtgggc ctgagatctg tgattgagca gttccctggc     4260

aagctggact ttgtgctggt ggatgggggc tgtgtgctga gccatggcca caagcagctg     4320

atgtgcctgg ccagatctgt gctgagcaag gccaagatcc tgctgctgga tgagccctct     4380

gcccacctgg accctgtgac ctaccagatc atcagaagaa ccctgaagca ggcctttgct     4440

gactgcacag tgatcctgtg tgagcacaga attgaggcca tgctggagtg ccagcagttc     4500

ctggtgattg aggagaacaa ggtgagacag tatgacagca tccagaagct gctgaatgag     4560

agaagcctgt tcagacaggc catcagcccc tctgacagag tgaagctgtt cccccacaga     4620

aacagcagca agtgcaagag caagccccag attgctgccc tgaaggagga gaccgaggag     4680

gaggtgcagg acaccagact gtaaataaat atctttattt tcattacatc tgtgtgttgg     4740

ttttttgtgt ggatctgagg aacccctagt gatggagttg gccactccct ctctgcgcgc     4800

tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc     4860

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aattaattaa ggcgatgaac     4920

ggtaatcgta aaactagcat gtcaatcata tgtaccccgg ttgataatca gaaaagcccc     4980

aaaaacagga agattgtata agcattaatt aatttaaata catggacatg tcagaattgg     5040

ttaattggtt gtaacactga cccctatttg tttatttttc taaatacatt caaatatgta     5100

tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagaatat     5160

gagccatatt caacgggaaa cgtcgaggcc gcgattaaat tccaacatgg atgctgattt     5220

atatgggtat aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa tctatcgctt     5280

gtatgggaag cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta gcgttgccaa     5340

tgatgttaca gatgagatgg tcagactaaa ctggctgacg gaatttatgc cacttccgac     5400

catcaagcat tttatccgta ctcctgatga tgcatggtta ctcaccactg cgatccccgg     5460

aaaaacagcg ttccaggtat tagaagaata tcctgattca ggtgaaaata ttgttgatgc     5520

gctggcagtg ttcctgcgcc ggttgcactc gattcctgtt tgtaattgtc cttttaacag     5580

cgatcgcgta tttcgcctcg ctcaggcgca atcacgaatg aataacggtt tggttgatgc     5640

gagtgatttt gatgacgagc gtaatggctg gcctgttgaa caagtctgga aagaaatgca     5700

taaacttttg ccattctcac cggattcagt cgtcactcat ggtgatttct cacttgataa     5760

ccttattttt gacgagggga aattaatagg ttgtattgat gttggacgag tcggaatcgc     5820

agaccgatac caggatcttg ccatcctatg gaactgcctc ggtgagtttt ctccttcatt     5880

acagaaacgg ctttttcaaa aatatggtat tgataatcct gatatgaata aattgcagtt     5940

tcatttgatg ctcgatgagt ttttctaaaa gcagagcatt acgctgactt gacgggacgg     6000

cgcaagctca tgaccaaaat cccttaacgt gagttacgcg cgcgtcgttc cactgagcgt     6060

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     6120

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     6180

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     6240

ttctagtgta gccgtagtta gcccaccact tcaagaactc tgtagcaccg cctacatacc     6300

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     6360

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     6420

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     6480

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     6540

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     6600

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     6660

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     6720

gctggccttt tgctcacatg tttaaaccat g                                    6751


<210>  90
<211>  6603
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF5

<400>  90
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc aatatttgca tgtcgctatg tgttctggga      180

aatcaccata aacgtgaaat gtctttggat ttgggaatct tcgaagttct gtatgagacc      240

acagatctcc accggtagta ctcgccacca tgcagagaag ccccctggag aaggcctctg      300

tggtgagcaa gctgttcttc agctggacca gacccatcct gagaaagggc tacagacaga      360

gactggagct gtctgacatc taccagatcc cctctgtgga ctctgctgac aacctgtctg      420

agaagctgga gagagagtgg gacagagagc tggccagcaa gaagaacccc aagctgatca      480

atgccctgag aagatgcttc ttctggagat tcatgttcta tggcatcttc ctgtacctgg      540

gggaggtgac caaggctgtg cagcccctgc tgctgggcag aatcattgcc agctatgacc      600

ctgacaacaa ggaggagaga agcattgcca tctacctggg cattggcctg tgcctgctgt      660

tcattgtgag aaccctgctg ctgcaccctg ccatctttgg cctgcaccac attggcatgc      720

agatgagaat tgccatgttc agcctgatct acaagaagac cctgaagctg agcagcagag      780

tgctggacaa gatcagcatt ggccagctgg tgagcctgct gagcaacaac ctgaacaagt      840

ttgatgaggg cctggccctg gcccactttg tgtggattgc ccccctgcag gtggccctgc      900

tgatgggcct gatctgggag ctgctgcagg cctctgcctt ctgtggcctg ggcttcctga      960

ttgtgctggc cctgttccag gctggcctgg gcagaatgat gatgaagtac agagaccaga     1020

gagctggcaa gatctctgag agactggtga tcacctctga gatgattgag aacatccagt     1080

ctgtgaaggc ctactgctgg gaggaggcca tggagaagat gattgagaac ctgagacaga     1140

cagagctgaa gctgaccaga aaggctgcct atgtgagata cttcaacagc tctgccttct     1200

tcttctctgg cttctttgtg gtgttcctgt ctgtgctgcc ctatgccctg atcaagggca     1260

tcatcctgag aaagatcttc accaccatca gcttctgcat tgtgctgaga atggctgtga     1320

ccagacagtt cccctgggct gtgcagacct ggtatgacag cctgggggcc atcaacaaga     1380

tccaggactt cctgcagaag caggagtaca agaccctgga gtacaacctg accaccacag     1440

aggtggtgat ggagaatgtg acagccttct gggaggaggg ctttggggag ctgtttgaga     1500

aggccaagca gaacaacaac aacagaaaga ccagcaatgg ggatgacagc ctgttcttca     1560

gcaacttcag cctgctgggc acccctgtgc tgaaggacat caacttcaag attgagagag     1620

gccagctgct ggctgtggct ggcagcacag gggctggcaa gaccagcctg ctgatgatga     1680

tcatggggga gctggagccc tctgagggca agatcaagca ctctggcaga atcagcttct     1740

gcagccagtt cagctggatc atgcctggca ccatcaagga gaacatcatc tttggggtga     1800

gctatgatga gtacagatac agatctgtga tcaaggcctg ccagctggag gaggacatca     1860

gcaagtttgc tgagaaggac aacattgtgc tgggggaggg gggcatcacc ctgtctgggg     1920

gccagagagc cagaatcagc ctggccagag ctgtgtacaa ggatgctgac ctgtacctgc     1980

tggacagccc ctttggctac ctggatgtgc tgacagagaa ggagatcttt gagagctgtg     2040

tgtgcaagct gatggccaac aagaccagaa tcctggtgac cagcaagatg gagcacctga     2100

agaaggctga caagatcctg atcctgcatg agggcagcag ctacttctat ggcaccttct     2160

ctgagctgca gaacctgcag cctgacttca gcagcaagct gatgggctgt gacagctttg     2220

accagttctc tgctgagaga agaaacagca tcctgacaga gaccctgcac agattcagcc     2280

tggaggggga tgcccctgtg agctggacag agaccaagaa gcagagcttc aagcagacag     2340

gggagtttgg ggagaagaga aagaacagca tcctgaaccc catcaacagc accctgcagg     2400

ccagaagaag acagtctgtg ctgaacctga tgacccactc tgtgaaccag ggccagaaca     2460

tccacagaaa gaccacagcc agcaccagaa aggtgagcct ggccccccag gccaacctga     2520

cagagctgga catctacagc agaagactga gccaggagac aggcctggag atctctgagg     2580

agatcaatga ggaggacctg aaggagtgct tctttgatga catggagagc atccctgctg     2640

tgaccacctg gaacacctac ctgagataca tcacagtgca caagagcctg atctttgtgc     2700

tgatctggtg cctggtgatc ttcctggctg aggtggctgc cagcctggtg gtgctgtggc     2760

tgctgggcaa cacccccctg caggacaagg gcaacagcac ccacagcaga aacaacagct     2820

atgctgtgat catcaccagc accagcagct actatgtgtt ctacatctat gtgggggtgg     2880

ctgacaccct gctggccatg ggcttcttca gaggcctgcc cctggtgcac accctgatca     2940

cagtgagcaa gatcctgcac cacaagatgc tgcactctgt gctgcaggcc cccatgagca     3000

ccctgaacac cctgaaggct gggggcatcc tgaacagatt cagcaaggac attgccatcc     3060

tggatgacct gctgcccctg accatctttg acttcatcca gctgctgctg attgtgattg     3120

gggccattgc tgtggtggct gtgctgcagc cctacatctt tgtggccaca gtgcctgtga     3180

ttgtggcctt catcatgctg agagcctact tcctgcagac cagccagcag ctgaagcagc     3240

tggagtctga gggcagaagc cccatcttca cccacctggt gaccagcctg aagggcctgt     3300

ggaccctgag agcctttggc agacagccct actttgagac cctgttccac aaggccctga     3360

acctgcacac agccaactgg ttcctgtacc tgagcaccct gagatggttc cagatgagaa     3420

ttgagatgat ctttgtgatc ttcttcattg ctgtgacctt catcagcatc ctgaccacag     3480

gggaggggga gggcagagtg ggcatcatcc tgaccctggc catgaacatc atgagcaccc     3540

tgcagtgggc tgtgaacagc agcattgatg tggacagcct gatgagatct gtgagcagag     3600

tgttcaagtt cattgacatg cccacagagg gcaagcccac caagagcacc aagccctaca     3660

agaatggcca gctgagcaag gtgatgatca ttgagaacag ccatgtgaag aaggatgaca     3720

tctggccctc tgggggccag atgacagtga aggacctgac agccaagtac acagaggggg     3780

gcaatgccat cctggagaac atcagcttca gcatcagccc tggccagaga gtgggcctgc     3840

tgggcagaac aggctctggc aagagcaccc tgctgtctgc cttcctgaga ctgctgaaca     3900

cagaggggga gatccagatt gatggggtga gctgggacag catcaccctg cagcagtgga     3960

gaaaggcctt tggggtgatc ccccagaagg tgttcatctt ctctggcacc ttcagaaaga     4020

acctggaccc ctatgagcag tggtctgacc aggagatctg gaaggtggct gatgaggtgg     4080

gcctgagatc tgtgattgag cagttccctg gcaagctgga ctttgtgctg gtggatgggg     4140

gctgtgtgct gagccatggc cacaagcagc tgatgtgcct ggccagatct gtgctgagca     4200

aggccaagat cctgctgctg gatgagccct ctgcccacct ggaccctgtg acctaccaga     4260

tcatcagaag aaccctgaag caggcctttg ctgactgcac agtgatcctg tgtgagcaca     4320

gaattgaggc catgctggag tgccagcagt tcctggtgat tgaggagaac aaggtgagac     4380

agtatgacag catccagaag ctgctgaatg agagaagcct gttcagacag gccatcagcc     4440

cctctgacag agtgaagctg ttcccccaca gaaacagcag caagtgcaag agcaagcccc     4500

agattgctgc cctgaaggag gagaccgagg aggaggtgca ggacaccaga ctgtaaataa     4560

atatctttat tttcattaca tctgtgtgtt ggttttttgt gtggatctga ggaaccccta     4620

gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca     4680

aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcaga     4740

gagggagtgg ccaattaatt aaggcgatga acggtaatcg taaaactagc atgtcaatca     4800

tatgtacccc ggttgataat cagaaaagcc ccaaaaacag gaagattgta taagcattaa     4860

ttaatttaaa tacatggaca tgtcagaatt ggttaattgg ttgtaacact gacccctatt     4920

tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa     4980

atgcttcaat aatattgaaa aaggaagaat atgagccata ttcaacggga aacgtcgagg     5040

ccgcgattaa attccaacat ggatgctgat ttatatgggt ataaatgggc tcgcgataat     5100

gtcgggcaat caggtgcgac aatctatcgc ttgtatggga agcccgatgc gccagagttg     5160

tttctgaaac atggcaaagg tagcgttgcc aatgatgtta cagatgagat ggtcagacta     5220

aactggctga cggaatttat gccacttccg accatcaagc attttatccg tactcctgat     5280

gatgcatggt tactcaccac tgcgatcccc ggaaaaacag cgttccaggt attagaagaa     5340

tatcctgatt caggtgaaaa tattgttgat gcgctggcag tgttcctgcg ccggttgcac     5400

tcgattcctg tttgtaattg tccttttaac agcgatcgcg tatttcgcct cgctcaggcg     5460

caatcacgaa tgaataacgg tttggttgat gcgagtgatt ttgatgacga gcgtaatggc     5520

tggcctgttg aacaagtctg gaaagaaatg cataaacttt tgccattctc accggattca     5580

gtcgtcactc atggtgattt ctcacttgat aaccttattt ttgacgaggg gaaattaata     5640

ggttgtattg atgttggacg agtcggaatc gcagaccgat accaggatct tgccatccta     5700

tggaactgcc tcggtgagtt ttctccttca ttacagaaac ggctttttca aaaatatggt     5760

attgataatc ctgatatgaa taaattgcag tttcatttga tgctcgatga gtttttctaa     5820

aagcagagca ttacgctgac ttgacgggac ggcgcaagct catgaccaaa atcccttaac     5880

gtgagttacg cgcgcgtcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc     5940

ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct     6000

accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg     6060

cttcagcaga gcgcagatac caaatactgt tcttctagtg tagccgtagt tagcccacca     6120

cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc     6180

tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga     6240

taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac     6300

gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga     6360

agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag     6420

ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg     6480

acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag     6540

caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgtttaaacc     6600

atg                                                                   6603


<210>  91
<211>  7519
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pA-CF7

<400>  91
tcctgcaggc agctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt       60

cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc      120

aactccatca ctaggggttc ctgcggccgc aatatttgca tgtcgctatg tgttctggga      180

aatcaccata aacgtgaaat gtctttggat ttgggaatct tcgaagttct gtatgagacc      240

acagatctcc accggtagta ctcgccacca tgcagagaag ccccctggag aaggcctctg      300

tggtgagcaa gctgttcttc ccccctggag aaggcctctg tggtgagcaa gctgttcttc      360

agctggacca gacccatcct gagaaagggc tacagacaga gactggagct gtctgacatc      420

taccagatcc cctctgtgga ctctgctgac aacctgtctg agaagctgga gagagagtgg      480

gacagagagc tggccagcaa gaagaacccc aagctgatca atgccctgag aagatgcttc      540

ttctggagat tcatgttcta tggcatcttc ctgtacctgg gggaggtgac caaggctgtg      600

cagcccctgc tgctgggcag aatcattgcc agctatgacc cagcccctgc tgctgggcag      660

aatcattgcc agctatgacc ctgacaacaa ggaggagaga agcattgcca tctacctggg      720

cattggcctg tgcctgctgt tcattgtgag aaccctgctg ctgcaccctg ccatctttgg      780

cctgcaccac attggcatgc agatgagaat tgccatgttc agcctgatct acaagaagac      840

cctgaagctg agcagcagag tgctggacaa gatcagcatt ggccagctgg tgagcctgct      900

gagcaacaac ctgaacaagt ttgatgaggg cctggccctg gcccactttg tgtggattgc      960

ttgatgaggg cctggccctg gcccactttg tgtggattgc ccccctgcag gtggccctgc     1020

tgatgggcct gatctgggag ctgctgcagg cctctgcctt ctgtggcctg ggcttcctga     1080

ttgtgctggc cctgttccag gctggcctgg gcagaatgat gatgaagtac agagaccaga     1140

gagctggcaa gatctctgag agactggtga tcacctctga gatgattgag aacatccagt     1200

ctgtgaaggc ctactgctgg gaggaggcca tggagaagat gattgagaac ctgagacaga     1260

cagagctgaa gctgaccaga aaggctgcct atgtgagata cttcaacagc tctgccttct     1320

tcttctctgg cttctttgtg gtgttcctgt ctgtgctgcc tctgccttct tcttctctgg     1380

cttctttgtg gtgttcctgt ctgtgctgcc ctatgccctg atcaagggca tcatcctgag     1440

aaagatcttc accaccatca gcttctgcat tgtgctgaga atggctgtga ccagacagtt     1500

cccctgggct gtgcagacct ggtatgacag cctgggggcc atcaacaaga tccaggactt     1560

cctgcagaag caggagtaca agaccctgga gtacaacctg accaccacag aggtggtgat     1620

ggagaatgtg acagccttct gggaggaggg ctttggggag ctgtttgaga aggccaagca     1680

gaacaacaac aacagaaaga ccagcaatgg ggatgacagc ctgttcttca gcaacttcag     1740

cctgctgggc acccctgtgc ggatgacagc ctgttcttca gcaacttcag cctgctgggc     1800

acccctgtgc tgaaggacat caacttcaag attgagagag gccagctgct ggctgtggct     1860

ggcagcacag gggctggcaa gaccagcctg ctgatgatga tcatggggga gctggagccc     1920

tctgagggca agatcaagca ctctggcaga atcagcttct gcagccagtt cagctggatc     1980

atgcctggca ccatcaagga gaacatcatc tttggggtga gctatgatga gtacagatac     2040

agatctgtga tcaaggcctg ccagctggag gaggacatca agatctgtga tcaaggcctg     2100

ccagctggag gaggacatca gcaagtttgc tgagaaggac aacattgtgc tgggggaggg     2160

gggcatcacc ctgtctgggg gccagagagc cagaatcagc ctggccagag ctgtgtacaa     2220

ggatgctgac ctgtacctgc tggacagccc ctttggctac ctggatgtgc tgacagagaa     2280

ggagatcttt gagagctgtg tgtgcaagct gatggccaac aagaccagaa tcctggtgac     2340

cagcaagatg gagcacctga agaaggctga caagatcctg atcctgcatg agggcagcag     2400

agaaggctga caagatcctg atcctgcatg agggcagcag ctacttctat ggcaccttct     2460

ctgagctgca gaacctgcag cctgacttca gcagcaagct gatgggctgt gacagctttg     2520

accagttctc tgctgagaga agaaacagca tcctgacaga gaccctgcac agattcagcc     2580

tggaggggga tgcccctgtg agctggacag agaccaagaa gcagagcttc aagcagacag     2640

gggagtttgg ggagaagaga aagaacagca tcctgaaccc catcaacagc atcagaaagt     2700

tcagcattgt gcagaagacc catcaacagc atcagaaagt tcagcattgt gcagaagacc     2760

cccctgcaga tgaatggcat tgaggaggac tctgatgagc ccctggagag aagactgagc     2820

ctggtgcctg actctgagca gggggaggcc atcctgccca gaatctctgt gatcagcaca     2880

ggccccaccc tgcaggccag aagaagacag tctgtgctga acctgatgac ccactctgtg     2940

aaccagggcc agaacatcca ccactctgtg aaccagggcc agaacatcca cagaaagacc     3000

acagccagca ccagaaaggt gagcctggcc ccccaggcca acctgacaga gctggacatc     3060

tacagcagaa gactgagcca ggagacaggc ctggagatct ctgaggagat caatgaggag     3120

gacctgaagg agtgcttctt tgatgacatg gagagcatcc ctgctgtgac cacctggaac     3180

acctacctga gatacatcac agtgcacaag agcctgatct ttgtgctgat ctggtgcctg     3240

gtgatcttcc tggctgaggt ggctgccagc ctggtggtgc gtgatcttcc tggctgaggt     3300

ggctgccagc ctggtggtgc tgtggctgct gggcaacacc cccctgcagg acaagggcaa     3360

cagcacccac agcagaaaca acagctatgc tgtgatcatc accagcacca gcagctacta     3420

tgtgttctac atctatgtgg gggtggctga caccctgctg gccatgggct tcttcagagg     3480

cctgcccctg gtgcacaccc tgatcacagt gagcaagatc ctgcaccaca agatgctgca     3540

ctctgtgctg caggccccca tgagcaccct gaacaccctg aaggctgggg gcatcctgaa     3600

tgagcaccct gaacaccctg aaggctgggg gcatcctgaa cagattcagc aaggacattg     3660

ccatcctgga tgacctgctg cccctgacca tctttgactt catccagctg ctgctgattg     3720

tgattggggc cattgctgtg gtggctgtgc tgcagcccta catctttgtg gccacagtgc     3780

ctgtgattgt ggccttcatc atgctgagag cctacttcct gcagaccagc cagcagctga     3840

agcagctgga gtctgagggc agaagcccca tcttcaccca cctggtgacc agcctgaagg     3900

gcctgtggac cctgagagcc cctggtgacc agcctgaagg gcctgtggac cctgagagcc     3960

tttggcagac agccctactt tgagaccctg ttccacaagg ccctgaacct gcacacagcc     4020

aactggttcc tgtacctgag caccctgaga tggttccaga tgagaattga gatgatcttt     4080

gtgatcttct tcattgctgt gaccttcatc agcatcctga ccacagggga gggggagggc     4140

agagtgggca tcatcctgac cctggccatg aacatcatga gcaccctgca gtgggctgtg     4200

aacagcagca ttgatgtgga cagcctgatg agatctgtga gcagagtgtt caagttcatt     4260

gacatgccca cagagggcaa gcccaccaag agcaccaagc cctacaagaa tggccagctg     4320

cagagggcaa gcccaccaag agcaccaagc cctacaagaa tggccagctg agcaaggtga     4380

tgatcattga gaacagccat gtgaagaagg atgacatctg gccctctggg ggccagatga     4440

cagtgaagga cctgacagcc aagtacacag aggggggcaa tgccatcctg gagaacatca     4500

gcttcagcat cagccctggc cagagagtgg gcctgctggg cagaacaggc tctggcaaga     4560

gcaccctgct gtctgccttc ctgagactgc tgaacacaga gggggagatc cagattgatg     4620

gggtgagctg ggacagcatc accctgcagc agtggagaaa ggcctttggg gtgatccccc     4680

agaaggtgtt catcttctct ggcaccttca gaaagaacct gtgatccccc agaaggtgtt     4740

catcttctct ggcaccttca gaaagaacct ggacccctat gagcagtggt ctgaccagga     4800

gatctggaag gtggctgatg aggtgggcct gagatctgtg attgagcagt tccctggcaa     4860

gctggacttt gtgctggtgg atgggggctg tgtgctgagc catggccaca agcagctgat     4920

gtgcctggcc agatctgtgc tgagcaaggc caagatcctg ctgctggatg agccctctgc     4980

ccacctggac cctgtgacct accagatcat cagaagaacc ctgaagcagg cctttgctga     5040

accagatcat cagaagaacc ctgaagcagg cctttgctga ctgcacagtg atcctgtgtg     5100

agcacagaat tgaggccatg ctggagtgcc agcagttcct ggtgattgag gagaacaagg     5160

tgagacagta tgacagcatc cagaagctgc tgaatgagag aagcctgttc agacaggcca     5220

tcagcccctc tgacagagtg aagctgttcc cccacagaaa cagcagcaag tgcaagagca     5280

agccccagat tgctgccctg aaggaggaga ccgaggagga ggtgcaggac accagactgt     5340

aaataaatat ctttattttc attacatctg tgtgttggtt ttttgtgtgg atctgaggaa     5400

cccctagtga tggagttggc cactccctct ctgcgcgctc atctgaggaa cccctagtga     5460

tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg     5520

tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagagagg     5580

gagtggccaa ttaattaagg cgatgaacgg taatcgtaaa actagcatgt caatcatatg     5640

taccccggtt gataatcaga aaagccccaa aaacaggaag attgtataag cattaattaa     5700

tttaaataca tggacatgtc agaattggtt aattggttgt aacactgacc cctatttgtt     5760

tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc     5820

ttcaataata ttgaaaaagg aagaatatga gccatattca acgggaaacg tcgaggccgc     5880

gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc gataatgtcg     5940

ggcaatcagg tgcgacaatc tatcgcttgt atgggaagcc cgatgcgcca gagttgtttc     6000

gataatgtcg ggcaatcagg tgcgacaatc tatcgcttgt atgggaagcc cgatgcgcca     6060

gagttgtttc tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc     6120

agactaaact ggctgacgga atttatgcca cttccgacca tcaagcattt tatccgtact     6180

cctgatgatg catggttact caccactgcg atccccggaa aaacagcgtt ccaggtatta     6240

gaagaatatc ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg     6300

ttgcactcga ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgcctcgct     6360

caggcgcaat cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt     6420

aatggctggc ctgttgaaca agtctggaaa gaaatgcata aacttttgcc attctcaccg     6480

gattcagtcg tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa     6540

ttaataggtt gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc     6600

atcctatgga actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa     6660

tatggtattg ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt     6720

ttctaaaagc agagcattac gctgacttga cgggacggcg caagctcatg accaaaatcc     6780

cttaacgtga gttacgcgcg cgtcgttcca ctgagcgtca gaccccgtag aaaagatcaa     6840

aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc     6900

accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt     6960

aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagc     7020

ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc     7080

agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt     7140

accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga     7200

ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa     7260

gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa     7320

caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg     7380

ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc     7440

tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg     7500

ctcacatgtt taaaccatg                                                  7519


<210>  92
<211>  11577
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pHELPK plasmid DNA

<400>  92
ggtacccaac tccatgctta acagtcccca ggtacagccc accctgcgtc gcaaccagga       60

acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat      120

taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctaggagaca      180

ctttcaataa aggcaaatgt ttttatttgt acactctcgg gtgattattt accccccacc      240

cttgccgtct gcgccgttta aaaatcaaag gggttctgcc gcgcatcgct atgcgccact      300

ggcagggaca cgttgcgata ctggtgttta gtgctccact taaactcagg cacaaccatc      360

cgcggcagct cggtgaagtt ttcactccac aggctgcgca ccatcaccaa cgcgtttagc      420

aggtcgggcg ccgatatctt gaagtcgcag ttggggcctc cgccctgcgc gcgcgagttg      480

cgatacacag ggttgcagca ctggaacact atcagcgccg ggtggtgcac gctggccagc      540

acgctcttgt cggagatcag atccgcgtcc aggtcctccg cgttgctcag ggcgaacgga      600

gtcaactttg gtagctgcct tcccaaaaag ggtgcatgcc caggctttga gttgcactcg      660

caccgtagtg gcatcagaag gtgaccgtgc ccggtctggg cgttaggata cagcgcctgc      720

atgaaagcct tgatctgctt aaaagccacc tgagcctttg cgccttcaga gaagaacatg      780

ccgcaagact tgccggaaaa ctgattggcc ggacaggccg cgtcatgcac gcagcacctt      840

gcgtcggtgt tggagatctg caccacattt cggccccacc ggttcttcac gatcttggcc      900

ttgctagact gctccttcag cgcgcgctgc ccgttttcgc tcgtcacatc catttcaatc      960

acgtgctcct tatttatcat aatgctcccg tgtagacact taagctcgcc ttcgatctca     1020

gcgcagcggt gcagccacaa cgcgcagccc gtgggctcgt ggtgcttgta ggttacctct     1080

gcaaacgact gcaggtacgc ctgcaggaat cgccccatca tcgtcacaaa ggtcttgttg     1140

ctggtgaagg tcagctgcaa cccgcggtgc tcctcgttta gccaggtctt gcatacggcc     1200

gccagagctt ccacttggtc aggcagtagc ttgaagtttg cctttagatc gttatccacg     1260

tggtacttgt ccatcaacgc gcgcgcagcc tccatgccct tctcccacgc agacacgatc     1320

ggcaggctca gcgggtttat caccgtgctt tcactttccg cttcactgga ctcttccttt     1380

tcctcttgcg tccgcatacc ccgcgccact gggtcgtctt cattcagccg ccgcaccgtg     1440

cgcttacctc ccttgccgtg cttgattagc accggtgggt tgctgaaacc caccatttgt     1500

agcgccacat cttctctttc ttcctcgctg tccacgatca cctctgggga tggcgggcgc     1560

tcgggcttgg gagaggggcg cttctttttc tttttggacg caatggccaa atccgccgtc     1620

gaggtcgatg gccgcgggct gggtgtgcgc ggcaccagcg catcttgtga cgagtcttct     1680

tcgtcctcgg actcgagacg ccgcctcagc cgcttttttg ggggcgcgcg gggaggcggc     1740

ggcgacggcg acggggacga cacgtcctcc atggttggtg gacgtcgcgc cgcaccgcgt     1800

ccgcgctcgg gggtggtttc gcgctgctcc tcttcccgac tggccatttc cttctcctat     1860

aggcagaaaa agatcatgga gtcagtcgag aaggaggaca gcctaaccgc cccctttgag     1920

ttcgccacca ccgcctccac cgatgccgcc aacgcgccta ccaccttccc cgtcgaggca     1980

cccccgcttg aggaggagga agtgattatc gagcaggacc caggttttgt aagcgaagac     2040

gacgaggatc gctcagtacc aacagaggat aaaaagcaag accaggacga cgcagaggca     2100

aacgaggaac aagtcgggcg gggggaccaa aggcatggcg actacctaga tgtgggagac     2160

gacgtgctgt tgaagcatct gcagcgccag tgcgccatta tctgcgacgc gttgcaagag     2220

cgcagcgatg tgcccctcgc catagcggat gtcagccttg cctacgaacg ccacctgttc     2280

tcaccgcgcg taccccccaa acgccaagaa aacggcacat gcgagcccaa cccgcgcctc     2340

aacttctacc ccgtatttgc cgtgccagag gtgcttgcca cctatcacat ctttttccaa     2400

aactgcaaga tacccctatc ctgccgtgcc aaccgcagcc gagcggacaa gcagctggcc     2460

ttgcggcagg gcgctgtcat acctgatatc gcctcgctcg acgaagtgcc aaaaatcttt     2520

gagggtcttg gacgcgacga gaaacgcgcg gcaaacgctc tgcaacaaga aaacagcgaa     2580

aatgaaagtc actgtggagt gctggtggaa cttgagggtg acaacgcgcg cctagccgtg     2640

ctgaaacgca gcatcgaggt cacccacttt gcctacccgg cacttaacct accccccaag     2700

gttatgagca cagtcatgag cgagctgatc gtgcgccgtg cacgacccct ggagagggat     2760

gcaaacttgc aagaacaaac cgaggagggc ctacccgcag ttggcgatga gcagctggcg     2820

cgctggcttg agacgcgcga gcctgccgac ttggaggagc gacgcaagct aatgatggcc     2880

gcagtgcttg ttaccgtgga gcttgagtgc atgcagcggt tctttgctga cccggagatg     2940

cagcgcaagc tagaggaaac gttgcactac acctttcgcc agggctacgt gcgccaggcc     3000

tgcaaaattt ccaacgtgga gctctgcaac ctggtctcct accttggaat tttgcacgaa     3060

aaccgcctcg ggcaaaacgt gcttcattcc acgctcaagg gcgaggcgcg ccgcgactac     3120

gtccgcgact gcgtttactt atttctgtgc tacacctggc aaacggccat gggcgtgtgg     3180

cagcaatgcc tggaggagcg caacctaaag gagctgcaga agctgctaaa gcaaaacttg     3240

aaggacctat ggacggcctt caacgagcgc tccgtggccg cgcacctggc ggacattatc     3300

ttccccgaac gcctgcttaa aaccctgcaa cagggtctgc cagacttcac cagtcaaagc     3360

atgttgcaaa actttaggaa ctttatccta gagcgttcag gaattctgcc cgccacctgc     3420

tgtgcgcttc ctagcgactt tgtgcccatt aagtaccgtg aatgccctcc gccgctttgg     3480

ggtcactgct accttctgca gctagccaac taccttgcct accactccga catcatggaa     3540

gacgtgagcg gtgacggcct actggagtgt cactgtcgct gcaacctatg caccccgcac     3600

cgctccctgg tctgcaattc gcaactgctt agcgaaagtc aaattatcgg tacctttgag     3660

ctgcagggtc cctcgcctga cgaaaagtcc gcggctccgg ggttgaaact cactccgggg     3720

ctgtggacgt cggcttacct tcgcaaattt gtacctgagg actaccacgc ccacgagatt     3780

aggttctacg aagaccaatc ccgcccgcca aatgcggagc ttaccgcctg cgtcattacc     3840

cagggccaca tccttggcca attgcaagcc atcaacaaag cccgccaaga gtttctgcta     3900

cgaaagggac ggggggttta cctggacccc cagtccggcg aggagctcaa cccaatcccc     3960

ccgccgccgc agccctatca gcagccgcgg gcccttgctt cccaggatgg cacccaaaaa     4020

gaagctgcag ctgccgccgc cgccacccac ggacgaggag gaatactggg acagtcaggc     4080

agaggaggtt ttggacgagg aggaggagat gatggaagac tgggacagcc tagacgaagc     4140

ttccgaggcc gaagaggtgt cagacgaaac accgtcaccc tcggtcgcat tcccctcgcc     4200

ggcgccccag aaattggcaa ccgttcccag catcgctaca acctccgctc ctcaggcgcc     4260

gccggcactg cctgttcgcc gacccaaccg tagatgggac accactggaa ccagggccgg     4320

taagtctaag cagccgccgc cgttagccca agagcaacaa cagcgccaag gctaccgctc     4380

gtggcgcggg cacaagaacg ccatagttgc ttgcttgcaa gactgtgggg gcaacatctc     4440

cttcgcccgc cgctttcttc tctaccatca cggcgtggcc ttcccccgta acatcctgca     4500

ttactaccgt catctctaca gcccctactg caccggcggc agcggcagcg gcagcaacag     4560

cagcggtcac acagaagcaa aggcgaccgg atagcaagac tctgacaaag cccaagaaat     4620

ccacagcggc ggcagcagca ggaggaggag cgctgcgtct ggcgcccaac gaacccgtat     4680

cgacccgcga gcttagaaat aggatttttc ccactctgta tgctatattt caacaaagca     4740

ggggccaaga acaagagctg aaaataaaaa acaggtctct gcgctccctc acccgcagct     4800

gcctgtatca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg gaggctctct     4860

tcagcaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct caaatttaag     4920

cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtc gtcagcgcca     4980

ttatgagcaa ggaaattccc acgccctaca tgtggagtta ccagccacaa atgggacttg     5040

cggctggagc tgcccaagac tactcaaccc gaataaacta catgagcgcg ggaccccaca     5100

tgatatcccg ggtcaacgga atccgcgccc accgaaaccg aattctcctc gaacaggcgg     5160

ctattaccac cacacctcgt aataacctta atccccgtag ttggcccgct gccctggtgt     5220

accaggaaag tcccgctccc accactgtgg tacttcccag agacgcccag gccgaagttc     5280

agatgactaa ctcaggggcg cagcttgcgg gcggctttcg tcacagggtg cggtcgcccg     5340

ggcgttttag ggcggagtaa cttgcatgta ttgggaattg tagttttttt aaaatgggaa     5400

gtgacgtatc gtgggaaaac ggaagtgaag atttgaggaa gttgtgggtt ttttggcttt     5460

cgtttctggg cgtaggttcg cgtgcggttt tctgggtgtt ttttgtggac tttaaccgtt     5520

acgtcatttt ttagtcctat atatactcgc tctgtacttg gcccttttta cactgtgact     5580

gattgagctg gtgccgtgtc gagtggtgtt ttttaatagg tttttttact ggtaaggctg     5640

actgttatgg ctgccgctgt ggaagcgctg tatgttgttc tggagcggga gggtgctatt     5700

ttgcctaggc aggagggttt ttcaggtgtt tatgtgtttt tctctcctat taattttgtt     5760

atacctccta tgggggctgt aatgttgtct ctacgcctgc gggtatgtat tcccccgggc     5820

tatttcggtc gctttttagc actgaccgat gttaaccaac ctgatgtgtt taccgagtct     5880

tacattatga ctccggacat gaccgaggaa ctgtcggtgg tgctttttaa tcacggtgac     5940

cagttttttt acggtcacgc cggcatggcc gtagtccgtc ttatgcttat aagggttgtt     6000

tttcctgttg taagacaggc ttctaatgtt taaatgtttt tttttttgtt attttatttt     6060

gtgtttaatg caggaacccg cagacatgtt tgagagaaaa atggtgtctt tttctgtggt     6120

ggttccggaa cttacctgcc tttatctgca tgagcatgac tacgatgtgc ttgctttttt     6180

gcgcgaggct ttgcctgatt ttttgagcag caccttgcat tttatatcgc cgcccatgca     6240

acaagcttac ataggggcta cgctggttag catagctccg agtatgcgtg tcataatcag     6300

tgtgggttct tttgtcatgg ttcctggcgg ggaagtggcc gcgctggtcc gtgcagacct     6360

gcacgattat gttcagctgg ccctgcgaag ggacctacgg gatcgcggta tttttgttaa     6420

tgttccgctt ttgaatctta tacaggtctg tgaggaacct gaatttttgc aatcatgatt     6480

cgctgcttga ggctgaaggt ggagggcgct ctggagcaga tttttacaat ggccggactt     6540

aatattcggg atttgcttag agacatattg ataaggtggc gagatgaaaa ttatttgggc     6600

atggttgaag gtgctggaat gtttatagag gagattcacc ctgaagggtt tagcctttac     6660

gtccacttgg acgtgagggc agtttgcctt ttggaagcca ttgtgcaaca tcttacaaat     6720

gccattatct gttctttggc tgtagagttt gaccacgcca ccggagggga gcgcgttcac     6780

ttaatagatc ttcattttga ggttttggat aatcttttgg aataaaaaaa aaaaaacatg     6840

gttcttccag ctcttcccgc tcctcccgtg tgtgactcgc agaacgaatg tgtaggttgg     6900

ctgggtgtgg cttattctgc ggtggtggat gttatcaggg cagcggcgca tgaaggagtt     6960

tacatagaac ccgaagccag ggggcgcctg gatgctttga gagagtggat atactacaac     7020

tactacacag agcgagctaa gcgacgagac cggagacgca gatctgtttg tcacgcccgc     7080

acctggtttt gcttcaggaa atatgactac gtccggcgtt ccatttggca tgacactacg     7140

accaacacga tctcggttgt ctcggcgcac tccgtacagt agggatcgcc tacctccttt     7200

tgagacagag acccgcgcta ccatactgga ggatcatccg ctgctgcccg aatgtaacac     7260

tttgacaatg cacaacgtga gttacgtgcg aggtcttccc tgcagtgtgg gatttacgct     7320

gattcaggaa tgggttgttc cctgggatat ggttctgacg cgggaggagc ttgtaatcct     7380

gaggaagtgt atgcacgtgt gcctgtgttg tgccaacatt gatatcatga cgagcatgat     7440

gatccatggt tacgagtcct gggctctcca ctgtcattgt tccagtcccg gttccctgca     7500

gtgcatagcc ggcgggcagg ttttggccag ctggtttagg atggtggtgg atggcgccat     7560

gtttaatcag aggtttatat ggtaccggga ggtggtgaat tacaacatgc caaaagaggt     7620

aatgtttatg tccagcgtgt ttatgagggg tcgccactta atctacctgc gcttgtggta     7680

tgatggccac gtgggttctg tggtccccgc catgagcttt ggatacagcg ccttgcactg     7740

tgggattttg aacaatattg tggtgctgtg ctgcagttac tgtgctgatt taagtgagat     7800

cagggtgcgc tgctgtgccc ggaggacaag gcgtctcatg ctgcgggcgg tgcgaatcat     7860

cgctgaggag accactgcca tgttgtattc ctgcaggacg gagcggcggc ggcagcagtt     7920

tattcgcgcg ctgctgcagc accaccgccc tatcctgatg cacgattatg actctacccc     7980

catgtaggcg tggacttccc cttcgccgcc cgttgagcaa ccgcaagttg gacagcagcc     8040

tgtggctcag cagctggaca gcgacatgaa cttaagcgag ctgcccgggg agtttattaa     8100

tatcactgat gagcgtttgg ctcgacagga aaccgtgtgg aatataacac ctaagaatat     8160

gtctgttacc catgatatga tgctttttaa ggccagccgg ggagaaagga ctgtgtactc     8220

tgtgtgttgg gagggaggtg gcaggttgaa tactagggtt ctgtgagttt gattaaggta     8280

cggtgatcaa tataagctat gtggtggtgg ggctatacta ctgaatgaaa aatgacttga     8340

aattttctgc aattgaaaaa taaacacgtt gaaacataac atgcaacagg ttcacgattc     8400

tttattcctg ggcaatgtag gagaaggtgt aagagttggt agcaaaagtt tcagtggtgt     8460

attttccact ttcccaggac catgtaaaag acatagagta agtgcttacc tcgctagttt     8520

ctgtggattc actagaatcg atgtaggatg ttgcccctcc tgacgcggta ggagaagggg     8580

agggtgccct gcatgtctgc cgctgctctt gctcttgccg ctgctgagga ggggggcgca     8640

tctgccgcag caccggatgc atctgggaaa agcaaaaaag gggctcgtcc ctgtttccgg     8700

aggaatttgc aagcggggtc ttgcatgacg gggaggcaaa cccccgttcg ccgcagtccg     8760

gccggcccga gactcgaacc gggggtcctg cgactcaacc cttggaaaat aaccctccgg     8820

ctacagggag cgagccactt aatgctttcg ctttccagcc taaccgctta cgccgcgcgc     8880

ggccagtggc caaaaaagct agcgcagcag ccgccgcgcc tggaaggaag ccaaaaggag     8940

cgctcccccg ttgtctgacg tcgcacacct gggttcgaca cgcgggcggt aaccgcatgg     9000

atcacggcgg acggccggat ccggggttcg aaccccggtc gtccgccatg atacccttgc     9060

gaatttatcc accagaccac ggaagagtgc ccgcttacag gctctccttt tgcacggtct     9120

agagcgtcaa cgactgcgca cgcctcaccg gccagagcgt cccgaccatg gagcactttt     9180

tgccgctgcg caacatctgg aaccgcgtcc gcgactttcc gcgcgcctcc accaccgccg     9240

ccggcatcac ctggatgtcc aggtacatct acggattacg tcgacgttta aaccatatga     9300

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     9360

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     9420

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     9480

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     9540

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     9600

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     9660

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     9720

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     9780

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     9840

cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt     9900

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     9960

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct    10020

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg    10080

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt    10140

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttagaaaaa ctcatcgagc    10200

atcaaatgaa actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc    10260

cgtttctgta atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg    10320

tatcggtctg cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca    10380

aaaataaggt tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc    10440

aaaagtttat gcatttcttt ccagacttgt tcaacaggcc agccattacg ctcgtcatca    10500

aaatcactcg catcaaccaa accgttattc attcgtgatt gcgcctgagc gagacgaaat    10560

acgcgatcgc tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac    10620

actgccagcg catcaacaat attttcacct gaatcaggat attcttctaa tacctggaat    10680

gctgttttcc cagggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa    10740

tgcttgatgg tcggaagagg cataaattcc gtcagccagt ttagtctgac catctcatct    10800

gtaacatcat tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc    10860

ttcccataca atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta    10920

tacccatata aatcagcatc catgttggaa tttaatcgcg gcctagagca agacgtttcc    10980

cgttgaatat ggctcatact cttccttttt caatattatt gaagcattta tcagggttat    11040

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg    11100

cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta    11160

acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg tgatgacggt    11220

gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc    11280

gggagcagac aacaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg    11340

tgaaccatca ccctaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa    11400

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa    11460

ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg gcaagtgtag cggtcacgct    11520

gcgcgtaacc accacacccg ccgcgcttaa tgcgccgcta cagggcgcga tggatcc       11577


<210>  93
<211>  4443
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR

<400>  93
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag catcagaaag ttcagcattg tgcagaagac ccccctgcag     2160

atgaatggca ttgaggagga ctctgatgag cccctggaga gaagactgag cctggtgcct     2220

gactctgagc agggggaggc catcctgccc agaatctctg tgatcagcac aggccccacc     2280

ctgcaggcca gaagaagaca gtctgtgctg aacctgatga cccactctgt gaaccagggc     2340

cagaacatcc acagaaagac cacagccagc accagaaagg tgagcctggc cccccaggcc     2400

aacctgacag agctggacat ctacagcaga agactgagcc aggagacagg cctggagatc     2460

tctgaggaga tcaatgagga ggacctgaag gagtgcttct ttgatgacat ggagagcatc     2520

cctgctgtga ccacctggaa cacctacctg agatacatca cagtgcacaa gagcctgatc     2580

tttgtgctga tctggtgcct ggtgatcttc ctggctgagg tggctgccag cctggtggtg     2640

ctgtggctgc tgggcaacac ccccctgcag gacaagggca acagcaccca cagcagaaac     2700

aacagctatg ctgtgatcat caccagcacc agcagctact atgtgttcta catctatgtg     2760

ggggtggctg acaccctgct ggccatgggc ttcttcagag gcctgcccct ggtgcacacc     2820

ctgatcacag tgagcaagat cctgcaccac aagatgctgc actctgtgct gcaggccccc     2880

atgagcaccc tgaacaccct gaaggctggg ggcatcctga acagattcag caaggacatt     2940

gccatcctgg atgacctgct gcccctgacc atctttgact tcatccagct gctgctgatt     3000

gtgattgggg ccattgctgt ggtggctgtg ctgcagccct acatctttgt ggccacagtg     3060

cctgtgattg tggccttcat catgctgaga gcctacttcc tgcagaccag ccagcagctg     3120

aagcagctgg agtctgaggg cagaagcccc atcttcaccc acctggtgac cagcctgaag     3180

ggcctgtgga ccctgagagc ctttggcaga cagccctact ttgagaccct gttccacaag     3240

gccctgaacc tgcacacagc caactggttc ctgtacctga gcaccctgag atggttccag     3300

atgagaattg agatgatctt tgtgatcttc ttcattgctg tgaccttcat cagcatcctg     3360

accacagggg agggggaggg cagagtgggc atcatcctga ccctggccat gaacatcatg     3420

agcaccctgc agtgggctgt gaacagcagc attgatgtgg acagcctgat gagatctgtg     3480

agcagagtgt tcaagttcat tgacatgccc acagagggca agcccaccaa gagcaccaag     3540

ccctacaaga atggccagct gagcaaggtg atgatcattg agaacagcca tgtgaagaag     3600

gatgacatct ggccctctgg gggccagatg acagtgaagg acctgacagc caagtacaca     3660

gaggggggca atgccatcct ggagaacatc agcttcagca tcagccctgg ccagagagtg     3720

ggcctgctgg gcagaacagg ctctggcaag agcaccctgc tgtctgcctt cctgagactg     3780

ctgaacacag agggggagat ccagattgat ggggtgagct gggacagcat caccctgcag     3840

cagtggagaa aggcctttgg ggtgatcccc cagaaggtgt tcatcttctc tggcaccttc     3900

agaaagaacc tggaccccta tgagcagtgg tctgaccagg agatctggaa ggtggctgat     3960

gaggtgggcc tgagatctgt gattgagcag ttccctggca agctggactt tgtgctggtg     4020

gatgggggct gtgtgctgag ccatggccac aagcagctga tgtgcctggc cagatctgtg     4080

ctgagcaagg ccaagatcct gctgctggat gagccctctg cccacctgga ccctgtgacc     4140

taccagatca tcagaagaac cctgaagcag gcctttgctg actgcacagt gatcctgtgt     4200

gagcacagaa ttgaggccat gctggagtgc cagcagttcc tggtgattga ggagaacaag     4260

gtgagacagt atgacagcat ccagaagctg ctgaatgaga gaagcctgtt cagacaggcc     4320

atcagcccct ctgacagagt gaagctgttc ccccacagaa acagcagcaa gtgcaagagc     4380

aagccccaga ttgctgccct gaaggaggag accgaggagg aggtgcagga caccagactg     4440

taa                                                                   4443


<210>  94
<211>  1480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CFTR protein

<400>  94

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Ile Arg Lys Phe Ser Ile Val Gln Lys Thr Pro Leu Gln 
705                 710                 715                 720 


Met Asn Gly Ile Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 
                725                 730                 735     


Ser Leu Val Pro Asp Ser Glu Gln Gly Glu Ala Ile Leu Pro Arg Ile 
            740                 745                 750         


Ser Val Ile Ser Thr Gly Pro Thr Leu Gln Ala Arg Arg Arg Gln Ser 
        755                 760                 765             


Val Leu Asn Leu Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His 
    770                 775                 780                 


Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala 
785                 790                 795                 800 


Asn Leu Thr Glu Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr 
                805                 810                 815     


Gly Leu Glu Ile Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys 
            820                 825                 830         


Phe Phe Asp Asp Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr 
        835                 840                 845             


Tyr Leu Arg Tyr Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile 
    850                 855                 860                 


Trp Cys Leu Val Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865                 870                 875                 880 


Leu Trp Leu Leu Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr 
                885                 890                 895     


His Ser Arg Asn Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser 
            900                 905                 910         


Tyr Tyr Val Phe Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala 
        915                 920                 925             


Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val 
    930                 935                 940                 


Ser Lys Ile Leu His His Lys Met Leu His Ser Val Leu Gln Ala Pro 
945                 950                 955                 960 


Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe 
                965                 970                 975     


Ser Lys Asp Ile Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe 
            980                 985                 990         


Asp Phe Ile Gln Leu Leu Leu Ile  Val Ile Gly Ala Ile  Ala Val Val 
        995                 1000                 1005             


Ala Val  Leu Gln Pro Tyr Ile  Phe Val Ala Thr Val  Pro Val Ile 
    1010                 1015                 1020             


Val Ala  Phe Ile Met Leu Arg  Ala Tyr Phe Leu Gln  Thr Ser Gln 
    1025                 1030                 1035             


Gln Leu  Lys Gln Leu Glu Ser  Glu Gly Arg Ser Pro  Ile Phe Thr 
    1040                 1045                 1050             


His Leu  Val Thr Ser Leu Lys  Gly Leu Trp Thr Leu  Arg Ala Phe 
    1055                 1060                 1065             


Gly Arg  Gln Pro Tyr Phe Glu  Thr Leu Phe His Lys  Ala Leu Asn 
    1070                 1075                 1080             


Leu His  Thr Ala Asn Trp Phe  Leu Tyr Leu Ser Thr  Leu Arg Trp 
    1085                 1090                 1095             


Phe Gln  Met Arg Ile Glu Met  Ile Phe Val Ile Phe  Phe Ile Ala 
    1100                 1105                 1110             


Val Thr  Phe Ile Ser Ile Leu  Thr Thr Gly Glu Gly  Glu Gly Arg 
    1115                 1120                 1125             


Val Gly  Ile Ile Leu Thr Leu  Ala Met Asn Ile Met  Ser Thr Leu 
    1130                 1135                 1140             


Gln Trp  Ala Val Asn Ser Ser  Ile Asp Val Asp Ser  Leu Met Arg 
    1145                 1150                 1155             


Ser Val  Ser Arg Val Phe Lys  Phe Ile Asp Met Pro  Thr Glu Gly 
    1160                 1165                 1170             


Lys Pro  Thr Lys Ser Thr Lys  Pro Tyr Lys Asn Gly  Gln Leu Ser 
    1175                 1180                 1185             


Lys Val  Met Ile Ile Glu Asn  Ser His Val Lys Lys  Asp Asp Ile 
    1190                 1195                 1200             


Trp Pro  Ser Gly Gly Gln Met  Thr Val Lys Asp Leu  Thr Ala Lys 
    1205                 1210                 1215             


Tyr Thr  Glu Gly Gly Asn Ala  Ile Leu Glu Asn Ile  Ser Phe Ser 
    1220                 1225                 1230             


Ile Ser  Pro Gly Gln Arg Val  Gly Leu Leu Gly Arg  Thr Gly Ser 
    1235                 1240                 1245             


Gly Lys  Ser Thr Leu Leu Ser  Ala Phe Leu Arg Leu  Leu Asn Thr 
    1250                 1255                 1260             


Glu Gly  Glu Ile Gln Ile Asp  Gly Val Ser Trp Asp  Ser Ile Thr 
    1265                 1270                 1275             


Leu Gln  Gln Trp Arg Lys Ala  Phe Gly Val Ile Pro  Gln Lys Val 
    1280                 1285                 1290             


Phe Ile  Phe Ser Gly Thr Phe  Arg Lys Asn Leu Asp  Pro Tyr Glu 
    1295                 1300                 1305             


Gln Trp  Ser Asp Gln Glu Ile  Trp Lys Val Ala Asp  Glu Val Gly 
    1310                 1315                 1320             


Leu Arg  Ser Val Ile Glu Gln  Phe Pro Gly Lys Leu  Asp Phe Val 
    1325                 1330                 1335             


Leu Val  Asp Gly Gly Cys Val  Leu Ser His Gly His  Lys Gln Leu 
    1340                 1345                 1350             


Met Cys  Leu Ala Arg Ser Val  Leu Ser Lys Ala Lys  Ile Leu Leu 
    1355                 1360                 1365             


Leu Asp  Glu Pro Ser Ala His  Leu Asp Pro Val Thr  Tyr Gln Ile 
    1370                 1375                 1380             


Ile Arg  Arg Thr Leu Lys Gln  Ala Phe Ala Asp Cys  Thr Val Ile 
    1385                 1390                 1395             


Leu Cys  Glu His Arg Ile Glu  Ala Met Leu Glu Cys  Gln Gln Phe 
    1400                 1405                 1410             


Leu Val  Ile Glu Glu Asn Lys  Val Arg Gln Tyr Asp  Ser Ile Gln 
    1415                 1420                 1425             


Lys Leu  Leu Asn Glu Arg Ser  Leu Phe Arg Gln Ala  Ile Ser Pro 
    1430                 1435                 1440             


Ser Asp  Arg Val Lys Leu Phe  Pro His Arg Asn Ser  Ser Lys Cys 
    1445                 1450                 1455             


Lys Ser  Lys Pro Gln Ile Ala  Ala Leu Lys Glu Glu  Thr Glu Glu 
    1460                 1465                 1470             


Glu Val  Gln Asp Thr Arg Leu  
    1475                 1480 


<210>  95
<211>  1428
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CFTRdeltaR protein

<400>  95

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Thr Leu Gln Ala Arg Arg Arg Gln Ser Val Leu Asn Leu 
705                 710                 715                 720 


Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His Arg Lys Thr Thr 
                725                 730                 735     


Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala Asn Leu Thr Glu 
            740                 745                 750         


Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr Gly Leu Glu Ile 
        755                 760                 765             


Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys Phe Phe Asp Asp 
    770                 775                 780                 


Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr Tyr Leu Arg Tyr 
785                 790                 795                 800 


Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile Trp Cys Leu Val 
                805                 810                 815     


Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val Leu Trp Leu Leu 
            820                 825                 830         


Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr His Ser Arg Asn 
        835                 840                 845             


Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser Tyr Tyr Val Phe 
    850                 855                 860                 


Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala Met Gly Phe Phe 
865                 870                 875                 880 


Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val Ser Lys Ile Leu 
                885                 890                 895     


His His Lys Met Leu His Ser Val Leu Gln Ala Pro Met Ser Thr Leu 
            900                 905                 910         


Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe Ser Lys Asp Ile 
        915                 920                 925             


Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe Asp Phe Ile Gln 
    930                 935                 940                 


Leu Leu Leu Ile Val Ile Gly Ala Ile Ala Val Val Ala Val Leu Gln 
945                 950                 955                 960 


Pro Tyr Ile Phe Val Ala Thr Val Pro Val Ile Val Ala Phe Ile Met 
                965                 970                 975     


Leu Arg Ala Tyr Phe Leu Gln Thr Ser Gln Gln Leu Lys Gln Leu Glu 
            980                 985                 990         


Ser Glu Gly Arg Ser Pro Ile Phe  Thr His Leu Val Thr  Ser Leu Lys 
        995                 1000                 1005             


Gly Leu  Trp Thr Leu Arg Ala  Phe Gly Arg Gln Pro  Tyr Phe Glu 
    1010                 1015                 1020             


Thr Leu  Phe His Lys Ala Leu  Asn Leu His Thr Ala  Asn Trp Phe 
    1025                 1030                 1035             


Leu Tyr  Leu Ser Thr Leu Arg  Trp Phe Gln Met Arg  Ile Glu Met 
    1040                 1045                 1050             


Ile Phe  Val Ile Phe Phe Ile  Ala Val Thr Phe Ile  Ser Ile Leu 
    1055                 1060                 1065             


Thr Thr  Gly Glu Gly Glu Gly  Arg Val Gly Ile Ile  Leu Thr Leu 
    1070                 1075                 1080             


Ala Met  Asn Ile Met Ser Thr  Leu Gln Trp Ala Val  Asn Ser Ser 
    1085                 1090                 1095             


Ile Asp  Val Asp Ser Leu Met  Arg Ser Val Ser Arg  Val Phe Lys 
    1100                 1105                 1110             


Phe Ile  Asp Met Pro Thr Glu  Gly Lys Pro Thr Lys  Ser Thr Lys 
    1115                 1120                 1125             


Pro Tyr  Lys Asn Gly Gln Leu  Ser Lys Val Met Ile  Ile Glu Asn 
    1130                 1135                 1140             


Ser His  Val Lys Lys Asp Asp  Ile Trp Pro Ser Gly  Gly Gln Met 
    1145                 1150                 1155             


Thr Val  Lys Asp Leu Thr Ala  Lys Tyr Thr Glu Gly  Gly Asn Ala 
    1160                 1165                 1170             


Ile Leu  Glu Asn Ile Ser Phe  Ser Ile Ser Pro Gly  Gln Arg Val 
    1175                 1180                 1185             


Gly Leu  Leu Gly Arg Thr Gly  Ser Gly Lys Ser Thr  Leu Leu Ser 
    1190                 1195                 1200             


Ala Phe  Leu Arg Leu Leu Asn  Thr Glu Gly Glu Ile  Gln Ile Asp 
    1205                 1210                 1215             


Gly Val  Ser Trp Asp Ser Ile  Thr Leu Gln Gln Trp  Arg Lys Ala 
    1220                 1225                 1230             


Phe Gly  Val Ile Pro Gln Lys  Val Phe Ile Phe Ser  Gly Thr Phe 
    1235                 1240                 1245             


Arg Lys  Asn Leu Asp Pro Tyr  Glu Gln Trp Ser Asp  Gln Glu Ile 
    1250                 1255                 1260             


Trp Lys  Val Ala Asp Glu Val  Gly Leu Arg Ser Val  Ile Glu Gln 
    1265                 1270                 1275             


Phe Pro  Gly Lys Leu Asp Phe  Val Leu Val Asp Gly  Gly Cys Val 
    1280                 1285                 1290             


Leu Ser  His Gly His Lys Gln  Leu Met Cys Leu Ala  Arg Ser Val 
    1295                 1300                 1305             


Leu Ser  Lys Ala Lys Ile Leu  Leu Leu Asp Glu Pro  Ser Ala His 
    1310                 1315                 1320             


Leu Asp  Pro Val Thr Tyr Gln  Ile Ile Arg Arg Thr  Leu Lys Gln 
    1325                 1330                 1335             


Ala Phe  Ala Asp Cys Thr Val  Ile Leu Cys Glu His  Arg Ile Glu 
    1340                 1345                 1350             


Ala Met  Leu Glu Cys Gln Gln  Phe Leu Val Ile Glu  Glu Asn Lys 
    1355                 1360                 1365             


Val Arg  Gln Tyr Asp Ser Ile  Gln Lys Leu Leu Asn  Glu Arg Ser 
    1370                 1375                 1380             


Leu Phe  Arg Gln Ala Ile Ser  Pro Ser Asp Arg Val  Lys Leu Phe 
    1385                 1390                 1395             


Pro His  Arg Asn Ser Ser Lys  Cys Lys Ser Lys Pro  Gln Ile Ala 
    1400                 1405                 1410             


Ala Leu  Lys Glu Glu Thr Glu  Glu Glu Val Gln Asp  Thr Arg Leu 
    1415                 1420                 1425             


<210>  96
<211>  250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mouse U1a promoter sequence

<400>  96
atggaggcgg tactatgtag atgagaattc aggagcaaac tgggaaaagc aactgcttcc       60

aaatatttgt gatttttaca gtgtagtttt ggaaaaactc ttagcctacc aattcttcta      120

agtgttttaa aatgtgggag ccagtacaca tgaagttata gagtgtttta atgaggctta      180

aatatttacc gtaactatga aatgctacgc atatcatgct gttcaggctc cgtggccacg      240

caactcatac                                                             250


<210>  97
<211>  101
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polymerase III H1 mutant promoter sequence

<400>  97
aatatttgca tgtcgctatg tgttctggga aatcaccata aacgtgaaat gtctttggat       60

ttgggaatct tcgaagttct gtatgagacc acagatctcc a                          101


<210>  98
<211>  2214
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV110 DNA

<400>  98
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagtcc ccgacccaca acctctcgga gaacctccag caacccccgc tgctgtggga      600

cctactacaa tggcttcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac atgggccttg cccacctata acaaccacct ctacaagcaa      780

atctccagtg cttcaacggg ggccagcaac gacaaccact acttcggcta cagcaccccc      840

tgggggtatt ttgatttcaa cagattccac tgccatttct caccacgtga ctggcagcga      900

ctcatcaaca acaattgggg attccggccc aagagactca acttcaagct cttcaacatc      960

caagtcaagg aggtcacgac gaatgatggc gtcacgacca tcgctaataa ccttaccagc     1020

acggttcaag tcttctcgga ctcggagtac cagttgccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcagta cggctaccta     1140

acgctcaaca atggcagcca ggcagtggga cggtcatcct tttactgcct ggaatatttc     1200

ccatcgcaga tgctgagaac gggcaataac tttaccttca gctacacctt cgaggacgtg     1260

cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc     1320

gaccagtacc tgtattacct gaacagaact cagaatcagt ccggaagtgc ccaaaacaag     1380

gacttgctgt ttagccgggg gtctccagct ggcatgtctg ttcagcccaa aaactggcta     1440

cctggaccct gttaccggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc     1500

aactttacct ggactggtgc ttcaaaatat aaccttaatg ggcgtgaatc tataatcaac     1560

cctggcactg ctatggcctc acacaaagac gacaaagaca agttctttcc catgagcggt     1620

gtcatgattt ttggaaagga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg     1680

atcacagacg aagaggaaat caaagccact aaccccgtgg ccaccgaaag atttgggact     1740

gtggcagtca atctccagag cagcagcaca gaccctgcga ccggagatgt gcatgttatg     1800

ggagccttac ctggaatggt gtggcaagac agagacgtat acctgcaggg tcctatttgg     1860

gccaaaattc ctcacacgga tggacacttt cacccgtctc ctctcatggg cggctttgga     1920

cttaagcacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg     1980

gcagagtttt cggctacaaa gtttgcttca ttcatcaccc agtattccac aggacaagtg     2040

agcgtggaga ttgaatggga gctgcagaaa gaaaacagca aacgctggaa tcccgaagtg     2100

cagtatacat ctaactatgc aaaatctgcc aacgttgatt tcactgtgga caacaatgga     2160

ctttatactg agcctcgccc cattggcacc cgttacctca cccgtcccct gtaa           2214


<210>  99
<211>  1509
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1509)
<223>  Sulfoglucosamine sulfohydrolase (SGSH)

<400>  99
atgagctgcc ccgtgcccgc ctgctgcgcg ctgctgctag tcctggggct ctgccgggcg       60

cgtccccgga acgcactgct gctcctcgcg gatgacggag gctttgagag tggcgcgtac      120

aacaacagcg ccatcgccac cccgcacctg gacgccttgg cccgccgcag cctcctcttt      180

cgcaatgcct tcacctcggt cagcagctgc tctcccagcc gcgccagcct cctcactggc      240

ctgccccagc atcagaatgg gatgtacggg ctgcaccagg acgtgcacca cttcaactcc      300

ttcgacaagg tgcggagcct gccgctgctg ctcagccaag ctggtgtgcg cacaggcatc      360

atcgggaaga agcacgtggg gccggagacc gtgtacccgt ttgactttgc gtacacggag      420

gagaatggct ccgtcctcca ggtggggcgg aacatcacta gaattaagct gctcgtccgg      480

aaattcctgc agactcagga tgaccagcct ttcttcctct acgtcgcctt ccacgacccc      540

caccgctgtg ggcactccca gccccagtac ggaaccttct gtgagaagtt tggcaacgga      600

gagagcggca tgggtcgtat cccagactgg accccccagg cctacgaccc actggacgtg      660

ctggtgcctt acttcgtccc caacaccccg gcagcccgag ccgacctggc cgctcagtac      720

accaccgtcg gccgcatgga ccaaggagtt ggactggtgc tccaggagct gcgtgacgcc      780

ggtgtcctga acgacacact ggtgatcttc acgtccgaca acgggatccc cttccccagc      840

ggcaggacca acctgtactg gccgggcact gctgaaccct tactggtgtc atccccggag      900

cacccaaaac gctggggcca agtcagcgag gcctacgtga gcctcctaga cctcacgccc      960

accatcttgg attggttctc gatcccgtac cccagctacg ccatctttgg ctcgaagacc     1020

atccacctca ctggccggtc cctcctgccg gcgctggagg ccgagcccct ctgggccacc     1080

gtctttggca gccagagcca ccacgaggtc accatgtcct accccatgcg ctccgtgcag     1140

caccggcact tccgcctcgt gcacaacctc aacttcaaga tgccctttcc catcgaccag     1200

gacttctacg tctcacccac cttccaggac ctcctgaacc gcaccacagc tggtcagccc     1260

acgggctggt acaaggacct ccgtcattac tactaccggg cgcgctggga gctctacgac     1320

cggagccggg acccccacga gacccagaac ctggccaccg acccgcgctt tgctcagctt     1380

ctggagatgc ttcgggacca gctggccaag tggcagtggg agacccacga cccctgggtg     1440

tgcgcccccg acggcgtcct ggaggagaag ctctctcccc agtgccagcc cctccacaat     1500

gagctgtga                                                             1509


<210>  100
<211>  1509
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-SGSH

<400>  100
atgagctgtc ctgttccagc ctgttgtgcc ctgctgctgg ttctgggact gtgcagagcc       60

agacctagga acgctctgct gctgctcgct gacgatggcg gatttgagag cggcgcctac      120

aacaacagcg ccattgccac acctcacctg gatgccctgg ccagaagaag cctgctgttc      180

agaaacgcct tcaccagcgt gtccagctgc agcccttcta gagctagcct gctgacagga      240

ctgccccagc accagaatgg gatgtatggc ctgcaccagg acgtgcacca cttcaacagc      300

ttcgacaaag tgcggagcct gcctctgctt ctgtctcaag ccggcgtcag aacaggcatc      360

atcggcaaga aacacgtggg ccccgagaca gtgtacccct tcgatttcgc ctacaccgaa      420

gagaacggca gcgtgctgca agtgggcaga aacatcaccc ggatcaagct gctcgtgcgg      480

aagttcctgc agacccagga cgaccagcct ttcttcctgt acgtggcctt ccacgatcct      540

cacagatgcg gccatagcca gcctcagtac ggcaccttct gcgagaagtt tggcaacggc      600

gagagcggca tgggcagaat ccctgattgg acccctcagg cctacgatcc cctggatgtg      660

ctggtgcctt acttcgtgcc taacacacca gccgccagag ccgatctggc cgctcagtat      720

acaaccgtgg gaagaatgga ccaaggcgtc ggcctggttc tgcaagagct tagagatgcc      780

ggcgtgctga acgacaccct ggtcatcttt accagcgaca acggcatccc ctttccatct      840

ggccggacca atctgtactg gcctggaaca gctgagcccc tgctggtgtc tagccctgag      900

caccctaaga gatggggcca agtgtctgag gcctacgtgt ccctgctgga tctgacccct      960

accatcctgg actggttcag catcccctat cctagctacg ccatcttcgg cagcaagacc     1020

atccacctga ccggcagatc tctgctgcca gctctggaag ctgaacctct gtgggccaca     1080

gtgtttggca gccagtctca ccacgaagtg acaatgagct accccatgcg gagcgtgcag     1140

cacagacact tcagactggt gcacaacctg aacttcaaga tgccctttcc aatcgaccag     1200

gacttctatg tgtccccaac cttccaggac ctgctgaaca gaaccacagc cggccaacct     1260

accggctggt acaaggacct gcggcactac tactatagag ccagatggga gctgtacgac     1320

cggtccagag atccccacga gacacagaac ctggccaccg atcctagatt cgcccagctg     1380

ctggaaatgc tgagagatca gctggccaag tggcagtggg agacacacga tccttgggtc     1440

tgcgctcctg atggcgtgct ggaagagaag ctgtcccctc agtgtcagcc cctgcacaac     1500

gagctttaa                                                             1509


<210>  101
<211>  1596
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET CO1-SGSH-GET

<400>  101
atgagctgtc ctgttccagc ctgttgtgcc ctgctgctgg ttctgggact gtgcagagcc       60

agacctagga acgctctgct gctgctcgct gacgatggcg gatttgagag cggcgcctac      120

aacaacagcg ccattgccac acctcacctg gatgccctgg ccagaagaag cctgctgttc      180

agaaacgcct tcaccagcgt gtccagctgc agcccttcta gagctagcct gctgacagga      240

ctgccccagc accagaatgg gatgtatggc ctgcaccagg acgtgcacca cttcaacagc      300

ttcgacaaag tgcggagcct gcctctgctt ctgtctcaag ccggcgtcag aacaggcatc      360

atcggcaaga aacacgtggg ccccgagaca gtgtacccct tcgatttcgc ctacaccgaa      420

gagaacggca gcgtgctgca agtgggcaga aacatcaccc ggatcaagct gctcgtgcgg      480

aagttcctgc agacccagga cgaccagcct ttcttcctgt acgtggcctt ccacgatcct      540

cacagatgcg gccatagcca gcctcagtac ggcaccttct gcgagaagtt tggcaacggc      600

gagagcggca tgggcagaat ccctgattgg acccctcagg cctacgatcc cctggatgtg      660

ctggtgcctt acttcgtgcc taacacacca gccgccagag ccgatctggc cgctcagtat      720

acaaccgtgg gaagaatgga ccaaggcgtc ggcctggttc tgcaagagct tagagatgcc      780

ggcgtgctga acgacaccct ggtcatcttt accagcgaca acggcatccc ctttccatct      840

ggccggacca atctgtactg gcctggaaca gctgagcccc tgctggtgtc tagccctgag      900

caccctaaga gatggggcca agtgtctgag gcctacgtgt ccctgctgga tctgacccct      960

accatcctgg actggttcag catcccctat cctagctacg ccatcttcgg cagcaagacc     1020

atccacctga ccggcagatc tctgctgcca gctctggaag ctgaacctct gtgggccaca     1080

gtgtttggca gccagtctca ccacgaagtg acaatgagct accccatgcg gagcgtgcag     1140

cacagacact tcagactggt gcacaacctg aacttcaaga tgccctttcc aatcgaccag     1200

gacttctatg tgtccccaac cttccaggac ctgctgaaca gaaccacagc cggccaacct     1260

accggctggt acaaggacct gcggcactac tactatagag ccagatggga gctgtacgac     1320

cggtccagag atccccacga gacacagaac ctggccaccg atcctagatt cgcccagctg     1380

ctggaaatgc tgagagatca gctggccaag tggcagtggg agacacacga tccttgggtc     1440

tgcgctcctg atggcgtgct ggaagagaag ctgtcccctc agtgtcagcc cctgcacaac     1500

gagctgcggc gtcgtcggcg aagaagaaga aagcgcaaga aaaaaggcaa aggcctgggc     1560

aagaagcggg acccctgtct gagaaagtac aaataa                               1596


<210>  102
<211>  1509
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-SGSH

<400>  102
atgagctgcc ctgtgcctgc ctgctgtgcc ctgctgctgg tgctgggcct gtgcagagcc       60

agacctagga atgccctgct gctgctggct gatgatgggg gctttgagag tggggcctac      120

aacaacagtg ccattgccac cccccacctg gatgccctgg ccagaagaag cctgctgttc      180

agaaatgcct tcaccagtgt gagcagctgc agccccagca gagccagcct gctgacaggc      240

ctgccccagc accagaatgg catgtatggc ctgcaccagg atgtgcacca cttcaacagc      300

tttgacaagg tgagaagcct gcccctgctg ctgagccagg ctggggtgag aacaggcatc      360

attggcaaga agcatgtggg ccctgagaca gtgtacccct ttgactttgc ctacacagag      420

gagaatggca gtgtgctgca ggtgggcaga aacatcacca gaatcaagct gctggtgaga      480

aagttcctgc agacccagga tgaccagccc ttcttcctgt atgtggcctt ccatgacccc      540

cacagatgtg gccacagcca gccccagtat ggcaccttct gtgagaagtt tggcaatggg      600

gagagtggca tgggcagaat ccctgactgg accccccagg cctatgaccc cctggatgtg      660

ctggtgccct actttgtgcc caacacccct gctgccagag ctgacctggc tgcccagtac      720

accacagtgg gcagaatgga ccagggggtg ggcctggtgc tgcaggagct gagagatgct      780

ggggtgctga atgacaccct ggtgatcttc accagtgaca atggcatccc cttccccagt      840

ggcagaacca acctgtactg gcctggcaca gctgagcccc tgctggtgag cagccctgag      900

caccccaaga gatggggcca ggtgagtgag gcctatgtga gcctgctgga cctgaccccc      960

accatcctgg actggttcag catcccctac cccagctatg ccatctttgg cagcaagacc     1020

atccacctga caggcagaag cctgctgcct gccctggagg ctgagcccct gtgggccaca     1080

gtgtttggca gccagagcca ccatgaggtg accatgagct accccatgag aagtgtgcag     1140

cacagacact tcagactggt gcacaacctg aacttcaaga tgcccttccc cattgaccag     1200

gacttctatg tgagccccac cttccaggac ctgctgaaca gaaccacagc tggccagccc     1260

acaggctggt acaaggacct gagacactac tactacagag ccagatggga gctgtatgac     1320

agaagcagag acccccatga gacccagaac ctggccacag accccagatt tgcccagctg     1380

ctggagatgc tgagagacca gctggccaag tggcagtggg agacccatga cccctgggtg     1440

tgtgcccctg atggggtgct ggaggagaag ctgagccccc agtgccagcc cctgcacaat     1500

gagctgtga                                                             1509


<210>  103
<211>  921
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized Ceroid Lipofuscinosis, Neuronal, 1 (CLN1)

<400>  103
atggcttctc cggggtgtct gtggctgctg gcagtggcac tccttccctg gacttgcgcc       60

agccgggctc tgcagcacct cgaccctcca gcccctcttc cactggtgat ttggcacgga      120

atgggtgatt cctgctgtaa tcccctgtca atgggagcca tcaagaagat ggtggagaag      180

aagatccctg gaatctacgt gctgtcactg gagattggaa agaccctgat ggaggacgtc      240

gagaactcct tcttcctcaa tgtcaactct caagtgacca ccgtctgcca ggccctggcc      300

aaggacccga agctgcagca ggggtataat gctatggggt tcagccaggg aggacagttc      360

cttcgggctg tggcccaacg ctgccctagc ccacccatga tcaacctgat ctcagtgggt      420

ggccagcatc agggcgtgtt cggacttccc cggtgtcccg gggaatcctc tcatatctgc      480

gacttcatcc gcaaaactct caatgcaggc gcttattcaa aggtcgtcca agagaggctg      540

gtgcaagccg agtactggca cgatcccatt aaggaggacg tgtacagaaa tcactcaatc      600

tttctggccg acattaacca ggagagggga attaacgaat catataagaa gaatctcatg      660

gccctcaaaa agttcgtcat ggtgaagttc cttaacgata gcattgtgga cccagtggac      720

agcgaatggt tcggatttta ccgctcaggc caggcaaaag aaaccatccc tctccaagag      780

acttctcttt acacccaaga cagacttggg cttaaggaaa tggataacgc tggtcagctg      840

gtgttcctcg ccaccgaagg tgaccatctg cagctcagcg aagagtggtt ctacgctcat      900

atcatcccgt ttcttggttg a                                                921


<210>  104
<211>  885
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(885)
<223>  Survival Motor Neuron 1 (SMN1)

<400>  104
atggcgatga gcagcggcgg cagtggtggc ggcgtcccgg agcaggagga ttccgtgctg       60

ttccggcgcg gcacaggcca gagcgatgat tctgacattt gggatgatac agcactgata      120

aaagcatatg ataaagctgt ggcttcattt aagcatgctc taaagaatgg tgacatttgt      180

gaaacttcgg gtaaaccaaa aaccacacct aaaagaaaac ctgctaagaa gaataaaagc      240

caaaagaaga atactgcagc ttccttacaa cagtggaaag ttggggacaa atgttctgcc      300

atttggtcag aagacggttg catttaccca gctaccattg cttcaattga ttttaagaga      360

gaaacctgtg ttgtggttta cactggatat ggaaatagag aggagcaaaa tctgtccgat      420

ctactttccc caatctgtga agtagctaat aatatagaac agaatgctca agagaatgaa      480

aatgaaagcc aagtttcaac agatgaaagt gagaactcca ggtctcctgg aaataaatca      540

gataacatca agcccaaatc tgctccatgg aactcttttc tccctccacc accccccatg      600

ccagggccaa gactgggacc aggaaagcca ggtctaaaat tcaatggccc accaccgcca      660

ccgccaccac caccacccca cttactatca tgctggctgc ctccatttcc ttctggacca      720

ccaataattc ccccaccacc tcccatatgt ccagattctc ttgatgatgc tgatgctttg      780

ggaagtatgt taatttcatg gtacatgagt ggctatcata ctggctatta tatgggtttt      840

agacaaaatc aaaaagaagg aaggtgctca cattccttaa attaa                      885


<210>  105
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-SMN1

<400>  105
atggcgatgt ctagtggtgg atctggtggc ggcgtgcccg agcaagaaga tagcgtcctg       60

ttcagaagag gcaccggcca gagcgacgac agcgacatct gggatgatac agccctgatc      120

aaggcctacg acaaggccgt ggccagcttt aagcacgccc tgaagaacgg cgatatctgc      180

gagacaagcg gcaagcccaa gaccacacct aagagaaagc ccgccaagaa gaacaagagc      240

cagaagaaga ataccgccgc cagcctgcag cagtggaaag tgggcgataa gtgcagcgcc      300

atttggagcg aggacggctg tatctaccct gccacaatcg ccagcatcga cttcaagcgg      360

gaaacctgcg tggtggtgta cacaggctac ggcaacagag aggaacagaa cctgagcgac      420

ctgctgtccc caatttgcga ggtggccaac aacatcgagc agaacgccca agagaacgag      480

aacgagtccc aggtgtccac cgacgagagc gagaatagca gaagccccgg caacaagagc      540

gacaacatca agcctaagag cgccccttgg aacagcttcc tgcctcctcc tccaccaatg      600

cctggaccta gactcggacc tggaaagccc ggcctgaagt tcaatggacc tccaccaccg      660

ccaccacctc cgcctccaca tcttctgtct tgttggctgc ctccatttcc tagcggccct      720

ccaatcatcc cgccacctcc acctatctgc cccgacagtc tggatgatgc tgatgccctg      780

ggctccatgc tgatctcttg gtacatgagc ggctaccaca ccggctacta catgggcttc      840

agacagaacc agaaagaggg ccgttgcagc cacagcctga actga                      885


<210>  106
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-SMN1

<400>  106
atggccatga gcagtggggg cagtggagga ggggtgcctg agcaggagga cagtgtgctg       60

ttcagaagag gcacaggcca gagtgatgac agtgacatct gggatgacac agccctgatc      120

aaggcctatg acaaggctgt ggccagcttc aagcatgccc tgaagaatgg ggacatctgt      180

gagaccagtg gcaagcccaa gaccaccccc aagagaaagc ctgccaagaa gaacaagagc      240

cagaagaaga acacagctgc cagcctgcag cagtggaagg tgggagacaa gtgcagtgcc      300

atctggagtg aggatggctg catctaccct gccaccattg ccagcattga cttcaagaga      360

gagacctgtg tggtggtgta cacaggctat ggcaacagag aggagcagaa cctgagtgac      420

ctgctgagcc ccatctgtga ggtggccaac aacattgagc agaatgccca ggagaatgag      480

aatgagagcc aggtgagcac agatgagagt gagaacagca gaagccctgg caacaagagt      540

gacaacatca agcccaagag tgccccttgg aacagcttcc tgccaccccc accacccatg      600

cctggcccca gactgggccc tggcaagcct ggcctgaagt tcaatggccc accaccccct      660

cctccaccac cccctcccca cctgctgagc tgctggctgc cccccttccc cagtggccca      720

cccatcatcc cacctccccc acccatctgc cctgacagcc tggatgatgc tgatgccctg      780

ggcagcatgc tgatcagctg gtacatgagt ggctaccaca caggctacta catgggcttc      840

agacagaacc agaaggaggg cagatgcagc cacagcctga actga                      885


<210>  107
<211>  1548
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1548)
<223>  Tissue Non-specific Alkaline Phosphatase (TNALP)

<400>  107
atgatttcac cattcttagt actggccatt ggcacctgcc ttactaactc actagtgcca       60

gagaaagaga aagaccccaa gtactggcga gaccaagcgc aagagacact gaaatatgcc      120

ctggagcttc agaagctcaa caccaacgtg gctaagaatg tcatcatgtt cctgggagat      180

gggatgggtg tctccacagt gacggctgcc cgcatcctca agggtcagct ccaccacaac      240

cctggggagg agaccaggct ggagatggac aagttcccct tcgtggccct ctccaagacg      300

tacaacacca atgcccaggt ccctgacagc gccggcaccg ccaccgccta cctgtgtggg      360

gtgaaggcca atgagggcac cgtgggggta agcgcagcca ctgagcgttc ccggtgcaac      420

accacccagg ggaacgaggt cacctccatc ctgcgctggg ccaaggacgc tgggaaatct      480

gtgggcattg tgaccaccac gagagtgaac catgccaccc ccagcgccgc ctacgcccac      540

tcggctgacc gggactggta ctcagacaac gagatgcccc ctgaggcctt gagccagggc      600

tgtaaggaca tcgcctacca gctcatgcat aacatcaggg acattgacgt gatcatgggg      660

ggtggccgga aatacatgta ccccaagaat aaaactgatg tggagtatga gagtgacgag      720

aaagccaggg gcacgaggct ggacggcctg gacctcgttg acacctggaa gagcttcaaa      780

ccgagataca agcactccca cttcatctgg aaccgcacgg aactcctgac ccttgacccc      840

cacaatgtgg actacctatt gggtctcttc gagccagggg acatgcagta cgagctgaac      900

aggaacaacg tgacggaccc gtcactctcc gagatggtgg tggtggccat ccagatcctg      960

cggaagaacc ccaaaggctt cttcttgctg gtggaaggag gcagaattga ccacgggcac     1020

catgaaggaa aagccaagca ggccctgcat gaggcggtgg agatggaccg ggccatcggg     1080

caggcaggca gcttgacctc ctcggaagac actctgaccg tggtcactgc ggaccattcc     1140

cacgtcttca catttggtgg atacaccccc cgtggcaact ctatctttgg tctggccccc     1200

atgctgagtg acacagacaa gaagcccttc actgccatcc tgtatggcaa tgggcctggc     1260

tacaaggtgg tgggcggtga acgagagaat gtctccatgg tggactatgc tcacaacaac     1320

taccaggcgc agtctgctgt gcccctgcgc cacgagaccc acggcgggga ggacgtggcc     1380

gtcttctcca agggccccat ggcgcacctg ctgcacggcg tccacgagca gaactacgtc     1440

ccccacgtga tggcgtatgc agcctgcatc ggggccaacc tcggccactg tgctcctgcc     1500

agctcggcag gatccgatga tgacgacgac gatgacgatg atgattga                  1548


<210>  108
<211>  1548
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, CO1-TNALP contains D10 tag at C end

<400>  108
atgatctctc catttctggt gctggccatc ggcacctgtc tgaccaactc actagtgccc       60

gagaaagaga aggaccccaa gtactggcgc gatcaggccc aagagacact gaagtacgcc      120

ctggaactgc agaaactgaa caccaacgtg gccaagaacg tgatcatgtt cctcggcgac      180

ggcatgggcg tgtccacagt tacagccgcc agaatcctga agggccagct gcaccataat      240

cctggcgaag agacacggct ggaaatggac aagttcccat tcgtggccct gagcaagacc      300

tacaacacca atgctcaggt gcccgattct gccggaacag ccacagctta tctgtgcggc      360

gtgaaggcca atgagggcac cgttggagtg tctgccgcca ccgaaagatc ccggtgcaat      420

accacacagg gcaacgaagt gaccagcatc ctgagatggg ccaaagacgc cggcaagtct      480

gtgggcatcg tgaccaccac cagagtgaac cacgccacac ctagcgccgc ctatgctcac      540

tctgccgaca gagactggta cagcgacaac gagatgcctc ctgaggctct gtctcagggc      600

tgcaaggata tcgcctacca gctgatgcac aacatccggg acattgatgt gatcatgggc      660

ggaggccgga agtacatgta tcccaagaac aagaccgacg tcgagtacga gagcgacgag      720

aaggccagag gcacaagact ggatggcctg gacctggtgg atacctggaa gtccttcaag      780

ccccggtaca agcacagcca cttcatctgg aaccggaccg agctgctgac actggaccct      840

cacaatgtgg actacctgct gggcctgttc gagcccggcg atatgcagta cgagctgaac      900

cggaacaacg tgacagaccc cagcctgagc gagatggtgg ttgtggccat tcagatcctg      960

cggaagaacc ccaagggatt cttcctgctg gtggaaggcg gcaggatcga tcacggacac     1020

catgagggaa aagccaagca ggccctgcac gaggccgtcg aaatggatag agccattggc     1080

caggccggca gcctgacaag ctctgaggat acactgaccg tggtcaccgc cgatcacagc     1140

cacgtgttca cattcggcgg ctacacccct agaggcaaca gcatctttgg actggcccct     1200

atgctgagcg acaccgacaa gaagcctttc accgccatcc tgtacggcaa cggccctggc     1260

tataaggttg tcggaggcga gagggaaaac gtgtccatgg tggattacgc ccacaacaac     1320

taccaggctc agagcgccgt gcctctgaga cacgaaacac acggcggaga agatgtggcc     1380

gtgttcagca agggccccat ggctcatctg ctgcatggcg tgcacgagca gaattacgtg     1440

ccacacgtga tggcctacgc cgcctgtatt ggagccaatc tgggacattg tgcccctgcc     1500

agtagcgccg gatccgacga tgatgacgac gacgatgacg atgactga                  1548


<210>  109
<211>  1548
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, CO2-TNALP contains D10 tag at C end

<400>  109
atgatcagcc ccttcctggt gctggccatt ggcacctgcc tgaccaacag cctggtgcct       60

gagaaggaga aggaccccaa gtactggaga gaccaggccc aggagaccct gaagtatgcc      120

ctggagctgc agaagctgaa caccaatgtg gccaagaatg tgatcatgtt cctgggggat      180

ggcatggggg tgagcacagt gacagctgcc agaatcctga agggccagct gcaccacaac      240

cctggggagg agaccagact ggagatggac aagttcccct ttgtggccct gagcaagacc      300

tacaacacca atgcccaggt gcctgacagt gctggcacag ccacagccta cctgtgtggg      360

gtgaaggcca atgagggcac agtgggggtg agtgctgcca cagagagaag cagatgcaac      420

accacccagg gcaatgaggt gaccagcatc ctgagatggg ccaaggatgc tggcaagagt      480

gtgggcattg tgaccaccac cagagtgaac catgccaccc ccagtgctgc ctatgcccac      540

agtgctgaca gagactggta cagtgacaat gagatgcccc ctgaggccct gagccagggc      600

tgcaaggaca ttgcctacca gctgatgcac aacatcagag acattgatgt gatcatgggg      660

gggggcagaa agtacatgta ccccaagaac aagacagatg tggagtatga gagtgatgag      720

aaggccagag gcaccagact ggatggcctg gacctggtgg acacctggaa gagcttcaag      780

cccagataca agcacagcca cttcatctgg aacagaacag agctgctgac cctggacccc      840

cacaatgtgg actacctgct gggcctgttt gagcctgggg acatgcagta tgagctgaac      900

agaaacaatg tgacagaccc cagcctgagt gagatggtgg tggtggccat ccagatcctg      960

agaaagaacc ccaagggctt cttcctgctg gtggaggggg gcagaattga ccatggccac     1020

catgagggca aggccaagca ggccctgcat gaggctgtgg agatggacag agccattggc     1080

caggctggca gcctgaccag cagtgaggac accctgacag tggtgacagc tgaccacagc     1140

catgtgttca cctttggggg ctacaccccc agaggcaaca gcatctttgg cctggccccc     1200

atgctgagtg acacagacaa gaagcccttc acagccatcc tgtatggcaa tggccctggc     1260

tacaaggtgg tgggggggga gagagagaat gtgagcatgg tggactatgc ccacaacaac     1320

taccaggccc agagtgctgt gcccctgaga catgagaccc atggggggga ggatgtggct     1380

gtgttcagca agggccccat ggcccacctg ctgcatgggg tgcatgagca gaactatgtg     1440

ccccatgtga tggcctatgc tgcctgcatt ggggccaacc tgggccactg tgcccctgcc     1500

agcagtgctg gatccgatga tgatgatgat gatgatgatg atgactga                  1548


<210>  110
<211>  636
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(636)
<223>  Glial Cell Derived Neurotrophic Factor (GDNF)

<400>  110
atgaagttat gggatgtcgt ggctgtctgc ctggtgctgc tccacaccgc gtccgccttc       60

ccgctgcccg ccggcaagag gcctcccgag gcgcccgccg aagaccgctc cctcggccgc      120

cgccgcgcgc ccttcgcgct gagcagtgac tcaaatatgc cagaggatta tcctgatcag      180

ttcgatgatg tcatggattt tattcaagcc accattaaaa gactgaaaag gtcaccagat      240

aaacaaatgg cagtgcttcc tagaagagag cggaatcggc aggctgcagc tgccaaccca      300

gagaattcca gaggaaaagg tcggagaggc cagaggggca aaaaccgggg ttgtgtctta      360

actgcaatac atttaaatgt cactgacttg ggtctgggct atgaaaccaa ggaggaactg      420

atttttaggt actgcagcgg ctcttgcgat gcagctgaga caacgtacga caaaatattg      480

aaaaacttat ccagaaatag aaggctggtg agtgacaaag tagggcaggc atgttgcaga      540

cccatcgcct ttgatgatga cctgtcgttt ttagatgata acctggttta ccatattcta      600

agaaagcatt ccgctaaaag gtgtggatgt atctaa                                636


<210>  111
<211>  1611
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1611)
<223>  Tissue Glucosyl Ceramidase beta (GBA1)

<400>  111
atggagtttt caagtccttc cagagaggaa tgtcccaagc ctttgagtag ggtaagcatc       60

atggctggca gcctcacagg attgcttcta cttcaggcag tgtcgtgggc atcaggtgcc      120

cgcccctgca tccctaaaag cttcggctac agctcggtgg tgtgtgtctg caatgccaca      180

tactgtgact cctttgaccc cccgaccttt cctgcccttg gtaccttcag ccgctatgag      240

agtacacgca gtgggcgacg gatggagctg agtatggggc ccatccaggc taatcacacg      300

ggcacaggcc tgctactgac cctgcagcca gaacagaagt tccagaaagt gaagggattt      360

ggaggggcca tgacagatgc tgctgctctc aacatccttg ccctgtcacc ccctgcccaa      420

aatttgctac ttaaatcgta cttctctgaa gaaggaatcg gatataacat catccgggta      480

ccaatggcca gctgtgactt ctccatccgc acctacacct atgcagacac ccctgatgat      540

ttccagttgc acaacttcag cctcccagag gaagatacca agctcaagat acccctgatt      600

caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca      660

cccacttggc tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc      720

ggagacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct      780

gagcacaagt tacagttctg ggcagtgaca gctgaaaatg agccttctgc tgggctgttg      840

agtggatacc ccttccagtg cctgggcttc acccctgaac atcagcgaga cttcattgcc      900

cgtgacctag gtcctaccct cgccaacagt actcaccaca atgtccgcct actcatgctg      960

gatgaccaac gcttgctgct gccccactgg gcaaaggtgg tactgacaga cccagaagca     1020

gctaaatatg ttcatggcat tgctgtacat tggtacctgg actttctggc tccagccaaa     1080

gccaccctag gggagacaca ccgcctgttc cccaacacca tgctctttgc ctcagaggcc     1140

tgtgtgggct ccaagttctg ggagcagagt gtgcggctag gctcctggga tcgagggatg     1200

cagtacagcc acagcatcat cacgaacctc ctgtaccatg tggtcggctg gaccgactgg     1260

aaccttgccc tgaaccccga aggaggaccc aattgggtgc gtaactttgt cgacagtccc     1320

atcattgtag acatcaccaa ggacacgttt tacaaacagc ccatgttcta ccaccttggc     1380

cacttcagca agttcattcc tgagggctcc cagagagtgg ggctggttgc cagtcagaag     1440

aacgacctgg acgcagtggc actgatgcat cccgatggct ctgctgttgt ggtcgtgcta     1500

aaccgctcct ctaaggatgt gcctcttacc atcaaggatc ctgctgtggg cttcctggag     1560

acaatctcac ctggctactc cattcacacc tacctgtggc gtcgccagtg a              1611


<210>  112
<211>  1611
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-GBA1

<400>  112
atggagttca gcagccccag cagagaggag tgccccaagc ccctgagcag agtgagcatc       60

atggctggca gcctgacagg cctgctgctg ctgcaggctg tgagctgggc cagtggggcc      120

agaccctgca tccccaagag ctttggctac agcagtgtgg tgtgtgtgtg caatgccacc      180

tactgtgaca gctttgaccc ccccaccttc cctgccctgg gcaccttcag cagatatgag      240

agcaccagaa gtggcagaag aatggagctg agcatgggcc ccatccaggc caaccacaca      300

ggcacaggcc tgctgctgac cctgcagcct gagcagaagt tccagaaggt gaagggcttt      360

gggggggcca tgacagatgc tgctgccctg aacatcctgg ccctgagccc ccctgcccag      420

aacctgctgc tgaagagcta cttcagtgag gagggcattg gctacaacat catcagagtg      480

ccaatggcca gctgtgactt cagcatcaga acctacacct atgctgacac ccctgatgac      540

ttccagctgc acaacttcag cctgcctgag gaggacacca agctgaagat ccccctgatc      600

cacagagccc tgcagctggc ccagagacct gtgagcctgc tggccagccc ctggaccagc      660

cccacctggc tgaagaccaa tggggctgtg aatggcaagg gcagcctgaa gggccagcct      720

ggggacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct      780

gagcacaagc tgcagttctg ggctgtgaca gctgagaatg agcccagtgc tggcctgctg      840

agtggctacc ccttccagtg cctgggcttc acccctgagc accagagaga cttcattgcc      900

agagacctgg gccccaccct ggccaacagc acccaccaca atgtgagact gctgatgctg      960

gatgaccaga gactgctgct gccccactgg gccaaggtgg tgctgacaga ccctgaggct     1020

gccaagtatg tgcatggcat tgctgtgcac tggtacctgg acttcctggc ccctgccaag     1080

gccaccctgg gggagaccca cagactgttc cccaacacca tgctgtttgc cagtgaggcc     1140

tgtgtgggca gcaagttctg ggagcagagt gtgagactgg gcagctggga cagaggcatg     1200

cagtacagcc acagcatcat caccaacctg ctgtaccatg tggtgggctg gacagactgg     1260

aacctggccc tgaaccctga ggggggcccc aactgggtga gaaactttgt ggacagcccc     1320

atcattgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctgggc     1380

cacttcagca agttcatccc tgagggcagc cagagagtgg gcctggtggc cagccagaag     1440

aatgacctgg atgctgtggc cctgatgcac cctgatggca gtgctgtggt ggtggtgctg     1500

aacagaagca gcaaggatgt gcccctgacc atcaaggacc ctgctgtggg cttcctggag     1560

accatcagcc ctggctacag catccacacc tacctgtgga gaagacagtg a              1611


<210>  113
<211>  1611
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-GBA1

<400>  113
atggagttta gcagccctag cagagaggaa tgccccaagc ctctgagccg ggtgtcaatc       60

atggccggat ctctgacagg actgctgctg cttcaggccg tgtcttgggc ttctggcgct      120

agaccttgca tccccaagag cttcggctac agcagcgtcg tgtgcgtgtg caatgccacc      180

tactgcgaca gcttcgaccc tcctaccttt cctgctctgg gcaccttcag cagatacgag      240

agcaccagat ccggcagacg gatggaactg agcatgggac ccatccaggc caatcacaca      300

ggcactggcc tgctgctgac actgcagcct gagcagaaat tccagaaagt gaaaggcttc      360

ggcggagcca tgacagatgc cgccgctctg aatatcctgg ctctgtctcc accagctcag      420

aacctgctgc tcaagagcta cttcagcgag gaaggcatcg gctacaacat catccgggtg      480

ccaatggcca gctgcgactt cagcatccgg acctacacct acgccgacac acccgacgat      540

ttccagctgc acaacttcag cctgcctgaa gaggacacca agctgaagat ccctctgatc      600

cacagagccc tgcagctggc acaaagaccc gtttctctgc tggctagccc ctggacatct      660

cccacctggc tgaaaacaaa tggcgccgtg aatggcaagg gcagcctgaa aggccaacct      720

ggcgatatct accaccagac ctgggccaga tacttcgtga agttcctgga cgcctatgcc      780

gagcacaagc tgcagttttg ggccgtgaca gccgagaacg aaccttctgc tggactgctg      840

agcggctacc cctttcagtg cctgggcttt acacccgagc accagcggga ctttatcgcc      900

agagatctgg gacccacact ggccaatagc acccaccata atgtgcggct gctgatgctg      960

gacgaccaga gactgcttct gccccactgg gctaaagtgg tgctgacaga tcctgaggcc     1020

gccaaatacg tgcacggaat cgccgtgcac tggtatctgg actttctggc ccctgccaag     1080

gccacactgg gagagacaca cagactgttc cccaacacca tgctgttcgc cagcgaagcc     1140

tgtgtgggca gcaagttttg ggaacagagc gtgcggctcg gcagctggga tagaggcatg     1200

cagtacagcc acagcatcat caccaacctg ctgtaccacg tcgtcggctg gaccgactgg     1260

aatctggccc tgaatcctga aggcggccct aactgggtcc gaaacttcgt ggacagcccc     1320

atcatcgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctggga     1380

cacttcagca agttcatccc cgagggctct cagcgcgttg gactggtggc cagccagaag     1440

aatgatctgg acgccgtggc tctgatgcac cctgatggat ctgctgtggt ggtggtcctg     1500

aaccgcagca gcaaagatgt gcccctgacc atcaaggatc ccgccgtggg attcctggaa     1560

acaatcagcc ctggctactc catccacacc tacctgtggc ggagacagtg a              1611


<210>  114
<211>  1962
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1962)
<223>  Iduronidase alpha-L- (IDUA)

<400>  114
atgcgtcccc tgcgcccccg cgccgcgctg ctggcgctcc tggcctcgct cctggccgcg       60

cccccggtgg ccccggccga ggccccgcac ctggtgcatg tggacgcggc ccgcgcgctg      120

tggcccctgc ggcgcttctg gaggagcaca ggcttctgcc ccccgctgcc acacagccag      180

gctgaccagt acgtcctcag ctgggaccag cagctcaacc tcgcctatgt gggcgccgtc      240

cctcaccgcg gcatcaagca ggtccggacc cactggctgc tggagcttgt caccaccagg      300

gggtccactg gacggggcct gagctacaac ttcacccacc tggacgggta cctggacctt      360

ctcagggaga accagctcct cccagggttt gagctgatgg gcagcgcctc gggccacttc      420

actgactttg aggacaagca gcaggtgttt gagtggaagg acttggtctc cagcctggcc      480

aggagataca tcggtaggta cggactggcg catgtttcca agtggaactt cgagacgtgg      540

aatgagccag accaccacga ctttgacaac gtctccatga ccatgcaagg cttcctgaac      600

tactacgatg cctgctcgga gggtctgcgc gccgccagcc ccgccctgcg gctgggaggc      660

cccggcgact ccttccacac cccaccgcga tccccgctga gctggggcct cctgcgccac      720

tgccacgacg gtaccaactt cttcactggg gaggcgggcg tgcggctgga ctacatctcc      780

ctccacagga agggtgcgcg cagctccatc tccatcctgg agcaggagaa ggtcgtcgcg      840

cagcagatcc ggcagctctt ccccaagttc gcggacaccc ccatttacaa cgacgaggcg      900

gacccgctgg tgggctggtc cctgccacag ccgtggaggg cggacgtgac ctacgcggcc      960

atggtggtga aggtcatcgc gcagcatcag aacctgctac tggccaacac cacctccgcc     1020

ttcccctacg cgctcctgag caacgacaat gccttcctga gctaccaccc gcaccccttc     1080

gcgcagcgca cgctcaccgc gcgcttccag gtcaacaaca cccgcccgcc gcacgtgcag     1140

ctgttgcgca agccggtgct cacggccatg gggctgctgg cgctgctgga tgaggagcag     1200

ctctgggccg aagtgtcgca ggccgggacc gtcctggaca gcaaccacac ggtgggcgtc     1260

ctggccagcg cccaccgccc ccagggcccg gccgacgcct ggcgcgccgc ggtgctgatc     1320

tacgcgagcg acgacacccg cgcccacccc aaccgcagcg tcgcggtgac cctgcggctg     1380

cgcggggtgc cccccggccc gggcctggtc tacgtcacgc gctacctgga caacgggctc     1440

tgcagccccg acggcgagtg gcggcgcctg ggccggcccg tcttccccac ggcagagcag     1500

ttccggcgca tgcgcgcggc tgaggacccg gtggccgcgg cgccccgccc cttacccgcc     1560

ggcggccgcc tgaccctgcg ccccgcgctg cggctgccgt cgcttttgct ggtgcacgtg     1620

tgtgcgcgcc ccgagaagcc gcccgggcag gtcacgcggc tccgcgccct gcccctgacc     1680

caagggcagc tggttctggt ctggtcggat gaacacgtgg gctccaagtg cctgtggaca     1740

tacgagatcc agttctctca ggacggtaag gcgtacaccc cggtcagcag gaagccatcg     1800

accttcaacc tctttgtgtt cagcccagac acaggtgctg tctctggctc ctaccgagtt     1860

cgagccctgg actactgggc ccgaccaggc cccttctcgg accctgtgcc gtacctggag     1920

gtccctgtgc caagagggcc cccatccccg ggcaatccat ga                        1962


<210>  115
<211>  1962
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-IDUA

<400>  115
atgagacccc tgagacccag agctgccctg ctggccctgc tggccagcct gctggctgcc       60

ccccctgtgg cccctgctga ggccccccac cttgtacatg tggatgctgc cagagccctg      120

tggcccctga gaagattctg gagaagcaca ggcttctgcc cccccctgcc ccacagccag      180

gctgaccagt atgtgctgag ctgggaccag cagctgaacc tggcctatgt gggggctgtg      240

ccccacagag gcatcaagca ggtgagaacc cactggctgc tggagctggt gaccaccaga      300

ggcagcacag gcagaggcct gagctacaac ttcacccacc tggatggcta cctggacctg      360

ctgagagaga accagctgct gcctggcttt gagctgatgg gcagtgccag tggccacttc      420

acagactttg aggacaagca gcaggtgttt gagtggaagg acctggtgag cagcctggcc      480

agaagataca ttggcagata tggcctggcc catgtgagca agtggaactt tgagacctgg      540

aatgagcctg accaccatga ctttgacaat gtgagcatga ccatgcaggg cttcctgaac      600

tactatgatg cctgcagtga gggcctgaga gctgccagcc ctgccctgag actggggggc      660

cctggggaca gcttccacac cccccccaga agccccctga gctggggcct gctgagacac      720

tgccatgatg gcaccaactt cttcacaggg gaggctgggg tgagactgga ctacatcagc      780

ctgcacagaa agggggccag aagcagcatc agcatcctgg agcaggagaa ggtggtggcc      840

cagcagatca gacagctgtt ccccaagttt gctgacaccc ccatctacaa tgatgaggct      900

gaccccctgg tgggctggag cctgccccag ccctggagag ctgatgtgac ctatgctgcc      960

atggtggtga aggtgattgc ccagcaccag aacctgctgc tggccaacac caccagtgcc     1020

ttcccctatg ccctgctgag caatgacaat gccttcctga gctaccaccc ccaccccttt     1080

gcccagagaa ccctgacagc cagattccag gtgaacaaca ccagaccccc ccatgtgcag     1140

ctgctgagaa agcctgtgct gacagccatg ggcctgctgg ccctgctgga tgaggagcag     1200

ctgtgggctg aggtgagcca ggctggcaca gtgctggaca gcaaccacac agtgggggtg     1260

ctggccagtg cccacagacc ccagggccct gctgatgcct ggagagctgc tgtgctgatc     1320

tatgccagtg atgacaccag agcccacccc aacagaagtg tggctgtgac cctgagactg     1380

agaggggtgc cccctggccc tggcctggtg tatgtgacca gatacctgga caatggcctg     1440

tgcagccctg atggggagtg gagaagactg ggcagacctg tgttccccac agctgagcag     1500

ttcagaagaa tgagagctgc tgaggaccct gtggctgctg cccccagacc cctgcctgct     1560

gggggcagac tgaccctgag acctgccctg agactgccca gcctgctgct ggtgcatgtg     1620

tgtgccagac ctgagaagcc ccctggccag gtgaccagac tgagagccct gcccctgacc     1680

cagggccagc tggtgctggt gtggagtgat gagcatgtgg gcagcaagtg cctgtggacc     1740

tatgagatcc agttcagcca ggatggcaag gcctacaccc ctgtgagcag aaagcccagc     1800

accttcaacc tgtttgtgtt cagccctgac acaggggctg tgagtggcag ctacagagtg     1860

agagccctgg actactgggc cagacctggc cccttcagtg accctgtgcc ctacctggag     1920

gtgcctgtgc ccagaggccc ccccagccct ggcaacccct ga                        1962


<210>  116
<211>  1578
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1578)
<223>  Cytochrome P450 family 4 subfamily V member 2 (CYP4V2)

<400>  116
atggcggggc tctggctggg gctcgtgtgg cagaagctgc tgctgtgggg cgcggcgagt       60

gccctttccc tggccggcgc cagtctggtc ctgagcctgc tgcagagggt ggcgagctac      120

gcgcggaaat ggcagcagat gcggcccatc cccacggtgg cccgcgccta cccactggtg      180

ggccacgcgc tgctgatgaa gccggacggg cgagaatttt ttcagcagat cattgagtac      240

acagaggaat accgccacat gccgctgctg aagctctggg tcgggccagt gcccatggtg      300

gccctttata atgcagaaaa tgtggaggta attttaacta gttcaaagca aattgacaaa      360

tcctctatgt acaagttttt agaaccatgg cttggcctag gacttcttac aagtactgga      420

aacaaatggc gctccaggag aaagatgtta acacccactt tccattttac cattctggaa      480

gatttcttag atatcatgaa tgaacaagca aatatattgg ttaagaaact tgaaaaacac      540

attaaccaag aagcatttaa ctgctttttt tacatcactc tttgtgcctt agatatcatc      600

tgtgaaacag ctatggggaa gaatattggt gctcaaagta atgatgattc cgagtatgtc      660

cgtgcagttt atagaatgag tgagatgata tttcgaagaa taaagatgcc ctggctttgg      720

cttgatctct ggtatcttat gtttaaagaa ggatgggaac acaaaaagag ccttcagatc      780

ctacatactt ttaccaacag tgtcatcgct gaacgggcca atgaaatgaa cgccaatgaa      840

gactgtagag gtgatggcag gggctctgcc ccctccaaaa ataaacgcag ggcctttctt      900

gacttgcttt taagtgtgac tgatgacgaa gggaacaggc taagtcatga agatattcga      960

gaagaagttg acaccttcat gtttgagggg cacgatacaa ctgcagctgc aataaactgg     1020

tccttatacc tgttgggttc taacccagaa gtccagaaaa aagtggatca tgaattggat     1080

gacgtgtttg ggaagtctga ccgtcccgct acagtagaag acctgaagaa acttcggtat     1140

ctggaatgtg ttattaagga gacccttcgc ctttttcctt ctgttccttt atttgcccgt     1200

agtgttagtg aagattgtga agtggcaggt tacagagttc taaaaggcac tgaagccgtc     1260

atcattccct atgcattgca cagagatccg agatacttcc ccaaccccga ggagttccag     1320

cctgagcggt tcttccccga gaatgcacaa gggcgccatc catatgccta cgtgcccttc     1380

tctgctggcc ccaggaactg tataggtcaa aagtttgctg tgatggaaga aaagaccatt     1440

ctttcgtgca tcctgaggca cttttggata gaatccaacc agaaaagaga agagcttggt     1500

ctagaaggac agttgattct tcgtccaagt aatggcatct ggatcaagtt gaagaggaga     1560

aatgcagatg aacgctaa                                                   1578


<210>  117
<211>  711
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(711)
<223>  Retinoschisin 1 (RS1)

<400>  117
atgagccgca agatagaagg ctttttgtta ttacttctct ttggctatga agccacattg       60

ggattatcgt ctaccgagga tgaaggcgag gacccctggt atcaaaaagc atgcgatgaa      120

ggcgaggacc cctggtatca aaaagcatgc aagtgcgatt gccaaggagg acccaatgct      180

ctgtggtctg caggtgccac ctccttggac tgtataccag aatgcccata tcacaagcct      240

ctgggtttcg agtcagggga ggtcacaccg gaccagatca cctgctctaa cccggagcag      300

tatgtgggct ggtattcttc gtggactgca aacaaggccc ggctcaacag tcaaggcttt      360

gggtgtgcct ggctctccaa gttccaggac agtagccagt ggttacagat agatctgaag      420

gagatcaaag tgatttcagg gatcctcacc caggggcgct gtgacatcga tgagtggatg      480

accaagtaca gcgtgcagta caggaccgat gagcgcctga actggattta ctacaaggac      540

cagactggaa acaaccgggt cttctatggc aactcggacc gcacctccac ggttcagaac      600

ctgctgcggc cccccatcat ctcccgcttc atccgcctca tcccgctggg ctggcacgtc      660

cgcattgcca tccggatgga gctgctggag tgcgtcagca agtgtgcctg a               711


<210>  118
<211>  2565
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(2565)
<223>  Phosphodiesterase 6B (PDE6B)

<400>  118
atgagcctca gtgaggagca ggcccggagc tttctggacc agaaccccga ttttgcccgc       60

cagtactttg ggaagaaact gagccctgag aatgtggccg cggcctgcga ggacgggtgc      120

ccgccggact gcgacagcct ccgggacctc tgccaggtgg aggagagcac ggcgctgctg      180

gagctggtgc aggatatgca ggagagcatc aacatggagc gcgtggtctt caaggtcctg      240

cggcgcctct gcaccctgct gcaggccgac cgctgcagcc tcttcatgta ccgccagcgc      300

aacggcgtgg ccgagctggc caccaggctt ttcagcgtgc agccggacag cgtcctggag      360

gactgcctgg tgccccccga ctccgagatc gtcttcccac tggacatcgg ggtcgtgggc      420

cacgtggctc agaccaaaaa gatggtgaac gtcgaggacg tggccgagtg ccctcacttc      480

agctcatttg ctgacgagct cactgactac aagacaaaga atatgctggc cacacccatc      540

atgaatggca aagacgtcgt ggcggtgatc atggcagtga acaagctcaa cggcccattc      600

ttcaccagcg aagacgaaga tgtgttcttg aagtacctga attttgccac gttgtacctg      660

aaaatctatc acctgagcta cctccacaac tgcgagacgc gccgcggcca ggtgctgctg      720

tggtcggcca acaaggtgtt tgaggagctg acggacatcg agaggcagtt ccacaaggcc      780

ttctacacgg tgcgggccta cctcaactgc gagcggtact ccgtgggcct cctggacatg      840

accaaggaga aggaattttt tgacgtgtgg tctgtgctga tgggagagtc ccagccgtac      900

tcgggcccac gcacgcctga tggccgggaa attgtcttct acaaagtgat cgactacatc      960

ctccacggca aggaggagat caaggtcatt cccacaccct cagccgatca ctgggccctg     1020

gccagcggcc ttccaagcta cgtggcagaa agcggcttta tttgtaacat catgaatgct     1080

tccgctgacg aaatgttcaa atttcaggaa ggggccctgg acgactccgg gtggctcatc     1140

aagaatgtgc tgtccatgcc catcgtcaac aagaaggagg agattgtggg agtcgccaca     1200

ttttacaaca ggaaagacgg gaagcccttt gacgaacagg acgaggttct catggagtcc     1260

ctgacacagt tcctgggctg gtcagtgatg aacaccgaca cctacgacaa gatgaacaag     1320

ctggagaacc gcaaggacat cgcacaggac atggtccttt accacgtgaa gtgcgacagg     1380

gacgagatcc agctcatcct gccaaccaga gcgcgcctgg ggaaggagcc tgctgactgc     1440

gatgaggacg agctgggcga aatcctgaag gaggagctgc cagggcccac cacatttgac     1500

atctacgaat tccacttctc tgacctggag tgcaccgaac tggacctggt caaatgtggc     1560

atccagatgt actacgagct gggcgtggtc cgaaagttcc agatccccca ggaggtcctg     1620

gtgcggttcc tgttctccat cagcaaaggc taccggagaa tcacctacca caactggcgc     1680

cacggcttca acgtggccca gacgatgttc acgctgctca tgaccggcaa actgaagagc     1740

tactacacgg acctggaggc cttcgccatg gtgacagccg gcctgtgcca tgacatcgac     1800

caccgcggca ccaacaacct gtaccagatg aagtcccaga accccttggc taaactccac     1860

ggctcctcga ttttggagcg gcaccacctg gagtttggga agttcctgct ctcggaggag     1920

accctgaaca tctaccagaa cctgaaccgg cggcagcacg agcacgtgat ccacctgatg     1980

gacatcgcca tcatcgccac ggacctggcc ctgtacttca agaagagagc gatgtttcag     2040

aagatcgtgg atgagtccaa gaactaccag gacaagaaga gctgggtgga gtacctgtcc     2100

ctggagacga cccggaagga gatcgtcatg gccatgatga tgacagcctg cgacctgtct     2160

gccatcacca agccctggga agtccagagc aaggtcgcac ttctcgtggc tgctgagttc     2220

tgggagcaag gtgacttgga aaggacagtc ttggatcagc agcccattcc tatgatggac     2280

cggaacaagg cggccgagct ccccaagctg caagtgggct tcatcgactt cgtgtgcaca     2340

ttcgtgtaca aggagttctc tcgtttccac gaagagatcc tgcccatgtt cgaccgactg     2400

cagaacaata ggaaagagtg gaaggcgctg gctgatgagt atgaggccaa agtgaaggct     2460

ctggaggaga aggaggagga ggagagggtg gcagccaaga aagtaggcac agaaatttgc     2520

aatggcggcc cagcacccaa gtcttcaacc tgctgtatcc tgtga                     2565


<210>  119
<211>  1497
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1497)
<223>  Methyl-CpG Binding Protein (MeCP2)

<400>  119
atggccgccg ccgccgccgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga       60

ctggaagaaa agtcagaaga ccaggacctc cagggcctca aggacaaacc cctcaagttt      120

aaaaaggtga agaaagataa gaaagaagag aaagagggca agcatgagcc cgtgcagcca      180

tcagcccacc actctgctga gcccgcagag gcaggcaaag cagagacatc agaagggtca      240

ggctccgccc cggctgtgcc ggaagcttct gcctccccca aacagcggcg ctccatcatc      300

cgtgaccggg gacccatgta tgatgacccc accctgcctg aaggctggac acggaagctt      360

aagcaaagga aatctggccg ctctgctggg aagtatgatg tgtatttgat caatccccag      420

ggaaaagcct ttcgctctaa agtggagttg attgcgtact tcgaaaaggt aggcgacaca      480

tccctggacc ctaatgattt tgacttcacg gtaactggga gagggagccc ctcccggcga      540

gagcagaaac cacctaagaa gcccaaatct cccaaagctc caggaactgg cagaggccgg      600

ggacgcccca aagggagcgg caccacgaga cccaaggcgg ccacgtcaga gggtgtgcag      660

gtgaaaaggg tcctggagaa aagtcctggg aagctccttg tcaagatgcc ttttcaaact      720

tcgccagggg gcaaggctga ggggggtggg gccaccacat ccacccaggt catggtgatc      780

aaacgccccg gcaggaagcg aaaagctgag gccgaccctc aggccattcc caagaaacgg      840

ggccgaaagc cggggagtgt ggtggcagcc gctgccgccg aggccaaaaa gaaagccgtg      900

aaggagtctt ctatccgatc tgtgcaggag accgtactcc ccatcaagaa gcgcaagacc      960

cgggagacgg tcagcatcga ggtcaaggaa gtggtgaagc ccctgctggt gtccaccctc     1020

ggtgagaaga gcgggaaagg actgaagacc tgtaagagcc ctgggcggaa aagcaaggag     1080

agcagcccca aggggcgcag cagcagcgcc tcctcacccc ccaagaagga gcaccaccac     1140

catcaccacc actcagagtc cccaaaggcc cccgtgccac tgctcccacc cctgccccca     1200

cctccacctg agcccgagag ctccgaggac cccaccagcc cccctgagcc ccaggacttg     1260

agcagcagcg tctgcaaaga ggagaagatg cccagaggag gctcactgga gagcgacggc     1320

tgccccaagg agccagctaa gactcagccc gcggttgcca ccgccgccac ggccgcagaa     1380

aagtacaaac accgagggga gggagagcgc aaagacattg tttcatcctc catgccaagg     1440

ccaaacagag aggagcctgt ggacagccgg acgcccgtga ccgagagagt tagctag        1497


<210>  120
<211>  2232
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(2232)
<223>  N-acetyl-alpha-glucosaminidase (NAGLU)

<400>  120
atggaggcgg tggcggtggc cgcggcggtg ggggtccttc tcctggccgg ggccgggggc       60

gcggcaggcg acgaggcccg ggaggcggcg gccgtgcggg cgctcgtggc ccggctgctg      120

gggccaggcc ccgcggccga cttctccgtg tcggtggagc gcgctctggc tgccaagccg      180

ggcttggaca cctacagcct gggcggcggc ggcgcggcgc gcgtgcgggt gcgcggctcc      240

acgggcgtgg cggccgccgc ggggctgcac cgctacctgc gcgacttctg tggctgccac      300

gtggcctggt ccggctctca gctgcgcctg ccgcggccac tgccagccgt gccgggggag      360

ctgaccgagg ccacgcccaa caggtaccgc tattaccaga atgtgtgcac gcaaagctac      420

tccttcgtgt ggtgggactg ggcccgctgg gagcgagaga tagactggat ggcgctgaat      480

ggcatcaacc tggcactggc ctggagcggc caggaggcca tctggcagcg ggtgtacctg      540

gccttgggcc tgacccaggc agagatcaat gagttcttta ctggtcctgc cttcctggcc      600

tgggggcgaa tgggcaacct gcacacctgg gatggccccc tgcccccctc ctggcacatc      660

aagcagcttt acctgcagca ccgggtcctg gaccagatgc gctccttcgg catgacccca      720

gtgctgcctg cattcgcggg gcatgttccc gaggctgtca ccagggtgtt ccctcaggtc      780

aatgtcacga agatgggcag ttggggccac tttaactgtt cctactcctg ctccttcctt      840

ctggctccgg aagaccccat attccccatc atcgggagcc tcttcctgcg agagctgatc      900

aaagagtttg gcacagacca catctatggg gccgacactt tcaatgagat gcagccacct      960

tcctcagagc cctcctacct tgccgcagcc accactgccg tctatgaggc catgactgca     1020

gtggatactg aggctgtgtg gctgctccaa ggctggctct tccagcacca gccgcagttc     1080

tgggggcccg cccagatcag ggctgtgctg ggagctgtgc cccgtggccg cctcctggtt     1140

ctggacctgt ttgctgagag ccagcctgtg tatacccgca ctgcctcctt ccagggccag     1200

cccttcatct ggtgcatgct gcacaacttt gggggaaacc atggtctttt tggagcccta     1260

gaggctgtga acggaggccc agaagctgcc cgcctcttcc ccaactccac catggtaggc     1320

acgggcatgg cccccgaggg catcagccag aacgaagtgg tctattccct catggctgag     1380

ctgggctggc gaaaggaccc agtgccagat ttggcagcct gggtgaccag ctttgccgcc     1440

cggcggtatg gggtctccca cccggacgca ggggcagcgt ggaggctact gctccggagt     1500

gtgtacaact gctccgggga ggcctgcagg ggccacaatc gtagcccgct ggtcaggcgg     1560

ccgtccctac agatgaatac cagcatctgg tacaaccgat ctgatgtgtt tgaggcctgg     1620

cggctgctgc tcacatctgc tccctccctg gccaccagcc ccgccttccg ctacgacctg     1680

ctggacctca ctcggcaggc agtgcaggag ctggtcagct tgtactatga ggaggcaaga     1740

agcgcctacc tgagcaagga gctggcctcc ctgttgaggg ctggaggcgt cctggcctat     1800

gagctgctgc cggcactgga cgaggtgctg gctagtgaca gccgcttctt gctgggcagc     1860

tggctagagc aggcccgagc agcggcagtc agtgaggccg aggccgattt ctacgagcag     1920

aacagccgct accagctgac cttgtggggg ccagaaggca acatcctgga ctatgccaac     1980

aagcagctgg cggggttggt ggccaactac tacacccctc gctggcggct tttcctggag     2040

gcgctggttg acagtgtggc ccagggcatc cctttccaac agcaccagtt tgacaaaaat     2100

gtcttccaac tggagcaggc cttcgttctc agcaagcaga ggtaccccag ccagccgcga     2160

ggagacactg tggacctggc caagaagatc ttcctcaaat attaccccgg ctgggtggcc     2220

ggctcttggt ga                                                         2232


<210>  121
<211>  1317
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1317)
<223>  Ceroid Lipofuscinosis, Neuronal 3 (CLN3)

<400>  121
atgggaggct gtgcaggctc gcggcggcgc ttttcggatt ccgaggggga ggagaccgtc       60

ccggagcccc ggctccctct gttggaccat cagggcgcgc attggaagaa cgcggtgggc      120

ttctggctgc tgggcctttg caacaacttc tcttatgtgg tgatgctgag tgccgcccac      180

gacatcctta gccacaagag gacatcggga aaccagagcc atgtggaccc aggcccaacg      240

ccgatccccc acaacagctc atcacgattt gactgcaact ctgtctctac ggctgctgtg      300

ctcctggcgg acatcctccc cacactcgtc atcaaattgt tggctcctct tggccttcac      360

ctgctgccct acagcccccg ggttctcgtc agtgggattt gtgctgctgg aagcttcgtc      420

ctggttgcct tttctcattc tgtggggacc agcctgtgtg gtgtggtctt cgctagcatc      480

tcatcaggcc ttggggaggt caccttcctc tccctcactg ccttctaccc cagggccgtg      540

atctcctggt ggtcctcagg gactggggga gctgggctgc tgggggccct gtcctacctg      600

ggcctcaccc aggccggcct ctcccctcag cagaccctgc tgtccatgct gggtatccct      660

gccctgctgc tggccagcta tttcttgttg ctcacatctc ctgaggccca ggaccctgga      720

ggggaagaag aagcagagag cgcagcccgg cagcccctca taagaaccga ggccccggag      780

tcgaagccag gctccagctc cagcctctcc cttcgggaaa ggtggacagt gttcaagggt      840

ctgctgtggt acattgttcc cttggtcgta gtttactttg ccgagtattt cattaaccag      900

ggactttttg aactcctctt tttctggaac acttccctga gtcacgctca gcaataccgc      960

tggtaccaga tgctgtacca ggctggcgtc tttgcctccc gctcttctct ccgctgctgt     1020

cgcatccgtt tcacctgggc cctggccctg ctgcagtgcc tcaacctggt gttcctgctg     1080

gcagacgtgt ggttcggctt tctgccaagc atctacctcg tcttcctgat cattctgtat     1140

gaggggctcc tgggaggcgc agcctacgtg aacaccttcc acaacatcgc cctggagacc     1200

agtgatgagc accgggagtt tgcaatggcg gccacctgca tctctgacac actggggatc     1260

tccctgtcgg ggctcctggc tttgcctctg catgacttcc tctgccagct ctcctga        1317


<210>  122
<211>  1317
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-CLN3

<400>  122
atgggaggat gtgctgggtc aagaagacgg tttagcgatt ccgaaggaga ggagactgtg       60

cctgagccaa gactgcccct gctggatcac cagggagcac actggaagaa cgcagtggga      120

ttctggctgc tgggcctgtg caacaacttc agctacgtgg tcatgctgtc cgccgcccac      180

gacatcctgt cccacaagcg gacctccggc aatcagtctc acgtggaccc cggccctaca      240

ccaatccccc acaacagcag cagccggttc gactgtaatt ccgtgtctac cgcagccgtg      300

ctgctggcag acatcctgcc caccctggtc atcaagctgc tggcaccact gggcctgcac      360

ctgctgcctt attctccaag ggtgctggtg agcggcatct gcgcagcagg cagcttcgtg      420

ctggtggcct ttagccactc cgtgggcacc tctctgtgcg gagtggtgtt tgcaagcatc      480

agctccggcc tgggagaggt gaccttcctg agcctgacag ccttttaccc tcgcgccgtg      540

atctcctggt ggtctagcgg cacaggagga gcaggcctgc tgggcgccct gtcctatctg      600

ggcctgaccc aggcaggcct gtccccacag cagacactgc tgtctatgct gggcatccct      660

gccctgctgc tggcaagcta cttcctgctg ctgacctccc cagaggcaca ggaccccgga      720

ggagaggagg aggccgagag cgccgcaagg cagccactga tcaggaccga ggcaccagag      780

tccaagcctg gctcctctag ctccctgtct ctgcgggaga gatggacagt gttcaagggc      840

ctgctgtggt acatcgtgcc cctggtggtg gtgtacttcg ccgagtactt catcaaccag      900

ggcctgtttg agctgctgtt cttttggaat acctctctga gccacgccca gcagtaccgg      960

tggtatcaga tgctgtatca ggcaggcgtg ttcgcctccc ggtctagcct gagatgctgt     1020

cggatcagat tcacctgggc actggccctg ctgcagtgcc tgaacctggt gttcctgctg     1080

gccgacgtgt ggttcggctt tctgccctct atctacctgg tgtttctgat catcctgtat     1140

gagggcctgc tgggaggagc agcctatgtg aacaccttcc acaatatcgc cctggagaca     1200

tctgacgagc acagagagtt tgctatggcc gccacctgta tcagcgatac actgggcatc     1260

tctctgagcg gactgctggc tctgcctctg catgactttc tgtgccagct gagttaa        1317


<210>  123
<211>  2859
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(2859)
<223>  Acid Alpha-Glucosidase (GAA)

<400>  123
atgggagtga ggcacccgcc ctgctcccac cggctcctgg ccgtctgcgc cctcgtgtcc       60

ttggcaaccg ctgcactcct ggggcacatc ctactccatg atttcctgct ggttccccga      120

gagctgagtg gctcctcccc agtcctggag gagactcacc cagctcacca gcagggagcc      180

agcagaccag ggccccggga tgcccaggca caccccggcc gtcccagagc agtgcccaca      240

cagtgcgacg tcccccccaa cagccgcttc gattgcgccc ctgacaaggc catcacccag      300

gaacagtgcg aggcccgcgg ctgttgctac atccctgcaa agcaggggct gcagggagcc      360

cagatggggc agccctggtg cttcttccca cccagctacc ccagctacaa gctggagaac      420

ctgagctcct ctgaaatggg ctacacggcc accctgaccc gtaccacccc caccttcttc      480

cccaaggaca tcctgaccct gcggctggac gtgatgatgg agactgagaa ccgcctccac      540

ttcacgatca aagatccagc taacaggcgc tacgaggtgc ccttggagac cccgcatgtc      600

cacagccggg caccgtcccc actctacagc gtggagttct ccgaggagcc cttcggggtg      660

atcgtgcgcc ggcagctgga cggccgcgtg ctgctgaaca cgacggtggc gcccctgttc      720

tttgcggacc agttccttca gctgtccacc tcgctgccct cgcagtatat cacaggcctc      780

gccgagcacc tcagtcccct gatgctcagc accagctgga ccaggatcac cctgtggaac      840

cgggaccttg cgcccacgcc cggtgcgaac ctctacgggt ctcacccttt ctacctggcg      900

ctggaggacg gcgggtcggc acacggggtg ttcctgctaa acagcaatgc catggatgtg      960

gtcctgcagc cgagccctgc ccttagctgg aggtcgacag gtgggatcct ggatgtctac     1020

atcttcctgg gcccagagcc caagagcgtg gtgcagcagt acctggacgt tgtgggatac     1080

ccgttcatgc cgccatactg gggcctgggc ttccacctgt gccgctgggg ctactcctcc     1140

accgctatca cccgccaggt ggtggagaac atgaccaggg cccacttccc cctggacgtc     1200

cagtggaacg acctggacta catggactcc cggagggact tcacgttcaa caaggatggc     1260

ttccgggact tcccggccat ggtgcaggag ctgcaccagg gcggccggcg ctacatgatg     1320

atcgtggatc ctgccatcag cagctcgggc cctgccggga gctacaggcc ctacgacgag     1380

ggtctgcgga ggggggtttt catcaccaac gagaccggcc agccgctgat tgggaaggta     1440

tggcccgggt ccactgcctt ccccgacttc accaacccca cagccctggc ctggtgggag     1500

gacatggtgg ctgagttcca tgaccaggtg cccttcgacg gcatgtggat tgacatgaac     1560

gagccttcca acttcatcag gggctctgag gacggctgcc ccaacaatga gctggagaac     1620

ccaccctacg tgcctggggt ggttgggggg accctccagg cggccaccat ctgtgcctcc     1680

agccaccagt ttctctccac acactacaac ctgcacaacc tctacggcct gaccgaagcc     1740

atcgcctccc acagggcgct ggtgaaggct cgggggacac gcccatttgt gatctcccgc     1800

tcgacctttg ctggccacgg ccgatacgcc ggccactgga cgggggacgt gtggagctcc     1860

tgggagcagc tcgcctcctc cgtgccagaa atcctgcagt ttaacctgct gggggtgcct     1920

ctggtcgggg ccgacgtctg cggcttcctg ggcaacacct cagaggagct gtgtgtgcgc     1980

tggacccagc tgggggcctt ctaccccttc atgcggaacc acaacagcct gctcagtctg     2040

ccccaggagc cgtacagctt cagcgagccg gcccagcagg ccatgaggaa ggccctcacc     2100

ctgcgctacg cactcctccc ccacctctac acactgttcc accaggccca cgtcgcgggg     2160

gagaccgtgg cccggcccct cttcctggag ttccccaagg actctagcac ctggactgtg     2220

gaccaccagc tcctgtgggg ggaggccctg ctcatcaccc cagtgctcca ggccgggaag     2280

gccgaagtga ctggctactt ccccttgggc acatggtacg acctgcagac ggtgccagta     2340

gaggcccttg gcagcctccc acccccacct gcagctcccc gtgagccagc catccacagc     2400

gaggggcagt gggtgacgct gccggccccc ctggacacca tcaacgtcca cctccgggct     2460

gggtacatca tccccctgca gggccctggc ctcacaacca cagagtcccg ccagcagccc     2520

atggccctgg ctgtggccct gaccaagggt ggggaggccc gaggggagct gttctgggac     2580

gatggagaga gcctggaagt gctggagcga ggggcctaca cacaggtcat cttcctggcc     2640

aggaataaca cgatcgtgaa tgagctggta cgtgtgacca gtgagggagc tggcctgcag     2700

ctgcagaagg tgactgtcct gggcgtggcc acggcgcccc agcaggtcct ctccaacggt     2760

gtccctgtct ccaacttcac ctacagcccc gacaccaagg tcctggacat ctgtgtctcg     2820

ctgttgatgg gagagcagtt tctcgtcagc tggtgttag                            2859


<210>  124
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-GAA

<400>  124
atgggagtcc gccacccgcc ctgctcacat cgcctgcttg ctgtctgtgc cctcgtgtca       60

cttgctaccg ccgcgctgct tggtcacatt ctgctgcacg actttttact agttccgagg      120

gaactgtcgg gatccagccc cgtgctcgag gaaactcacc ccgcgcacca acagggggcg      180

tccaggccgg gaccgcgcga cgcccaggcc cacccgggcc ggcctcgggc cgtgccaact      240

cagtgcgatg tgccgccgaa ctcccgcttc gactgtgcgc ctgacaaggc cataacccag      300

gaacagtgcg aagcacgcgg ctgctgctat attccggcga agcagggctt gcagggtgcc      360

caaatgggtc agccttggtg cttctttccc ccgtcgtacc cctcgtacaa gctggagaac      420

ctgagcagca gcgaaatggg gtacaccgcc actctgaccc ggacgacccc gaccttcttc      480

ccgaaagaca tcctgaccct gcggctggat gtgatgatgg aaactgagaa cagactgcac      540

ttcactatca aggaccccgc gaaccgcaga tatgaggtgc cactggaaac ccctcatgtg      600

cattcccggg ccccatcccc tctgtactcg gtggaattct ccgaagaacc cttcggggtc      660

attgtgcgcc ggcagcttga tggccgggtc ctgctcaaca ccaccgtggc accccttttc      720

ttcgctgacc agttcctcca gctgagcacc tcgctgccga gccagtacat caccggactg      780

gccgagcacc tctcccctct gatgctgtcc actagctgga ctaggatcac tctgtggaac      840

cgggatctgg cccctacccc gggcgcgaac ctgtacggat cgcacccctt ctacctggcc      900

ctcgaggacg gaggctccgc ccacggagtg ttcctgctga actccaacgc tatggacgtg      960

gtgctccagc cgtcccctgc actgtcctgg cggagcacag ggggtattct ggatgtctac     1020

atcttcctcg gcccggagcc aaagtccgtg gtgcaacagt atctggatgt cgtgggttac     1080

ccattcatgc cgccatactg gggccttggc ttccacctgt gccgctgggg atacagctcc     1140

accgccatca ctagacaggt cgtggaaaac atgactagag cccacttccc cctcgatgtc     1200

cagtggaatg acctggacta catggattcc agacgcgact tcactttcaa caaggatgga     1260

ttcagagatt tccccgctat ggtccaagaa ctgcaccagg gtggccggcg gtacatgatg     1320

attgtggacc ccgccatttc aagctccgga ccagcgggct cgtaccggcc ctacgacgaa     1380

ggtttgcgcc gcggcgtgtt catcactaac gaaaccggcc agccactgat tgggaaggtc     1440

tggcctggaa gcaccgcgtt cccggacttc actaacccaa cggccttggc gtggtgggag     1500

gacatggtgg ccgaattcca cgaccaagtc ccattcgacg gaatgtggat cgacatgaac     1560

gagcccagca acttcatccg aggctccgag gacggctgcc ctaacaacga acttgagaac     1620

cctccgtacg tgcctggcgt cgtcggcgga acactgcagg ccgctacgat ctgtgcctca     1680

tcgcatcagt tcctgtcaac ccactacaac ctccataatc tgtacggcct caccgaagcc     1740

atcgcctccc accgggccct ggtcaaggcc cgggggacta ggcccttcgt gattagccgg     1800

agcactttcg ccggacacgg aagatacgcc ggacattgga ccggcgacgt gtggtcatcg     1860

tgggagcagc tcgcctcctc cgtccccgaa atcctgcagt tcaatctcct gggagtcccc     1920

ctcgtgggcg cggacgtgtg cggattcctg ggcaatacct ctgaggagct gtgcgtgaga     1980

tggacccagc tgggggcgtt ctaccccttc atgcggaacc acaactcact gctgtccctg     2040

cctcaagagc cgtactcatt ctccgagccg gcacaacagg ccatgcgaaa ggctctgacc     2100

ctccgctatg cgctcttgcc ccacctctac actctgtttc accaagccca tgtcgcgggc     2160

gaaacagtgg ccagaccact ctttctggaa ttcccaaagg actcctcaac ctggactgtg     2220

gatcatcagc tgctctgggg agaggcactg ctgatcaccc cggtgctcca agccggaaag     2280

gcggaagtga ccggatactt ccctctcggt acttggtacg acctccaaac cgtgccggtc     2340

gaggccctgg gcagcttgcc tccgccgccg gctgccccgc gggagcctgc aatccactcc     2400

gaggggcaat gggtgaccct ccctgcacca ctggacacca tcaacgtgca cctccgggcc     2460

ggctacatca tcccgctgca aggaccgggt ctgactacca ccgaatcccg gcagcagccc     2520

atggcactgg ccgtggccct gaccaaggga ggggaagcac ggggagaact cttttgggac     2580

gatggagaat ccctggaagt gctcgagcgg ggagcctaca ctcaagtcat ctttcttgcc     2640

cgcaacaaca ccatcgtgaa cgaattggtc cgcgtgacct ccgagggggc cggactccag     2700

ctgcaaaaag tgaccgtgct gggggtggca accgccccgc aacaagtgtt gtctaacgga     2760

gtgccggtgt ccaacttcac ctactcccct gataccaaag ttctagatat ttgcgtgagc     2820

ctgctgatgg gagaacagtt cctggtgtcc tggtgctga                            2859


<210>  125
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-GAA

<400>  125
atgggagtta gacaccctcc atgtagccac agactgctgg ccgtgtgtgc tctggtgtct       60

ctggctacag ctgccctgct gggacatatc ctgctgcacg acttcttact agttcccaga      120

gagctgtccg gcagcagccc tgtgctggaa gaaacacacc ctgcacatca gcagggcgcc      180

tctagacctg gacctagaga tgctcaggcc catcctggca gacctagagc tgtgcccaca      240

cagtgtgacg tgccacctaa cagcagattc gactgcgccc ctgacaaggc catcacacaa      300

gagcagtgtg aagccagagg ctgctgctac atccctgcca aacaaggact gcagggcgct      360

cagatgggac agccctggtg cttcttccca ccatcttacc ccagctacaa gctggaaaac      420

ctgagcagca gcgagatggg ctacaccgcc acactgacca gaaccacacc tacattcttc      480

ccgaaggaca tcctgacact gcggctggac gtgatgatgg aaaccgagaa ccggctgcac      540

ttcaccatca aggaccccgc caatcggaga tacgaggtgc cactggaaac ccctcacgtg      600

cactctagag ccccatctcc actgtacagc gtggaattca gcgaggaacc cttcggcgtg      660

atcgtgcgga gacagctgga tggaagagtg ctgctgaaca ccacagtggc ccctctgttc      720

ttcgccgacc agtttctgca gctgtccacc agcctgccta gccagtatat cacaggcctg      780

gccgagcacc tgtctccact gatgctgtct accagctgga cccggatcac cctgtggaac      840

agggatcttg ctcctacacc tggcgccaac ctgtacggct ctcacccttt ttatctggcc      900

ctggaagatg gcggatctgc ccacggtgtc tttctgctga actccaacgc catggacgtg      960

gtgctgcagc catctcctgc tctgtcttgg agaagcacag gcggcatcct ggacgtgtac     1020

atctttctgg gccccgagcc taagagcgtg gtgcagcagt atctggacgt cgtgggctac     1080

cccttcatgc ctccttattg gggcctgggc ttccacctgt gcagatgggg atacagcagc     1140

accgccatca ccagacaggt ggtggaaaac atgacccggg ctcacttccc actggatgtg     1200

cagtggaacg acctggacta catggacagc agacgggact tcaccttcaa caaggacggc     1260

ttcagagact tccccgccat ggtgcaagaa ctgcaccaag gcggcagacg gtacatgatg     1320

atcgtggatc cagccatcag ctctagcggc cctgccggct cttacagacc ttacgatgag     1380

ggcctgagaa gaggcgtgtt catcaccaac gagacaggcc agcctctgat cggcaaagtg     1440

tggcctggca gcacagcctt tccagacttc acaaacccca ccgctctggc ttggtgggaa     1500

gatatggtgg ccgagtttca cgatcaggtg cccttcgacg gcatgtggat cgacatgaac     1560

gagcccagca acttcatccg gggcagcgag gatggctgcc ccaacaacga actggaaaat     1620

cctccttacg tgcccggcgt tgtcggcgga acacttcagg ccgctacaat ctgtgccagc     1680

agccaccagt tcctcagcac ccactacaac ctgcacaatc tgtatggcct gaccgaggcc     1740

attgccagcc atagagccct ggttaaggcc aggggcacca gacctttcgt gatcagcaga     1800

agcaccttcg ccggccacgg cagatatgcc ggacattgga caggcgacgt gtggtctagt     1860

tgggagcagc tggctagcag cgtgccagag atcctgcagt tcaatctgct gggcgtgcca     1920

ctcgtgggag ccgatgtttg tggcttcctg ggcaacacct ccgaggaact gtgtgtgcgt     1980

tggacacagc tgggcgcctt ctatcccttc atgagaaacc acaacagcct tctcagcctg     2040

ccacaagagc cctacagctt ctctgagcct gcacagcagg ccatgagaaa ggccctgact     2100

ctgagatacg ctctgctgcc ccacctgtac accctgtttc accaggctca tgtggccggg     2160

gagacagtgg ctagacctct gttcctggaa ttccccaagg acagctccac ctggaccgtg     2220

gatcatcagc tgctgtgggg agaagccctg ctcatcacac ctgttctgca ggccggaaag     2280

gccgaagtga ccggctattt tcctctcggc acttggtacg acctgcagac cgtgcctgtt     2340

gaggctctgg gatctcttcc tccacctcct gccgctccta gagagcctgc cattcactct     2400

gaaggccagt gggttaccct gcctgctcct ctggacacca tcaacgtgca cctgagagct     2460

ggctacatca tccctctgca aggccctggc ctgacaacca ccgaatctag acagcagccc     2520

atggctctgg ccgtggcttt gacaaaaggc ggagaggcta gaggcgagct gttctgggat     2580

gatggcgaga gcctggaagt gctggaacgg ggcgcttata cccaagtgat cttcctggcc     2640

agaaacaaca ccatcgtgaa cgaactcgtg cgcgtgacca gtgaaggtgc tggactgcaa     2700

ctgcagaaag tgaccgtgct cggagtggcc acagcacctc agcaggttct gtctaatggc     2760

gtgcccgtgt ccaacttcac atacagcccc gacaccaagg tcctggacat ctgtgtgtca     2820

ctgctgatgg gcgagcagtt cctggtgtcc tggtgttga                            2859


<210>  126
<211>  2859
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO3-GAA

<400>  126
atgggggtga gacacccccc ctgcagccac agactgctgg ctgtgtgtgc cctggtgagc       60

ctggccacag ctgccctgct gggccacatc ctgctgcatg acttcctact agtgcccaga      120

gagctgagtg gcagcagccc tgtgctggag gagacccacc ctgcccacca gcagggggcc      180

agcagacctg gccccagaga tgcccaggcc caccctggca gacccagagc tgtgcccacc      240

cagtgtgatg tgccccccaa cagcagattt gactgtgccc ctgacaaggc catcacccag      300

gagcagtgtg aggccagagg ctgctgctac atccctgcca agcagggcct gcagggggcc      360

cagatgggcc agccctggtg cttcttcccc cccagctacc ccagctacaa gctggagaac      420

ctgagcagca gtgagatggg ctacacagcc accctgacca gaaccacccc caccttcttc      480

cccaaggaca tcctgaccct gagactggat gtgatgatgg agacagagaa cagactgcac      540

ttcaccatca aggaccctgc caacagaaga tatgaggtgc ccctggagac cccccatgtg      600

cacagcagag cccccagccc cctgtacagt gtggagttca gtgaggagcc ctttggggtg      660

attgtgagaa gacagctgga tggcagagtg ctgctgaaca ccacagtggc ccccctgttc      720

tttgctgacc agttcctgca gctgagcacc agcctgccca gccagtacat cacaggcctg      780

gctgagcacc tgagccccct gatgctgagc accagctgga ccagaatcac cctgtggaac      840

agagacctgg cccccacccc tggggccaac ctgtatggca gccacccctt ctacctggcc      900

ctggaggatg ggggcagtgc ccatggggtg ttcctgctga acagcaatgc catggatgtg      960

gtgctgcagc ccagccctgc cctgagctgg agaagcacag ggggcatcct ggatgtgtac     1020

atcttcctgg gccctgagcc caagagtgtg gtgcagcagt acctggatgt ggtgggctac     1080

cccttcatgc ccccctactg gggcctgggc ttccacctgt gcagatgggg ctacagcagc     1140

acagccatca ccagacaggt ggtggagaac atgaccagag cccacttccc cctggatgtg     1200

cagtggaatg acctggacta catggacagc agaagagact tcaccttcaa caaggatggc     1260

ttcagagact tccctgccat ggtgcaggag ctgcaccagg ggggcagaag atacatgatg     1320

attgtggacc ctgccatcag cagcagtggc cctgctggca gctacagacc ctatgatgag     1380

ggcctgagaa gaggggtgtt catcaccaat gagacaggcc agcccctgat tggcaaggtg     1440

tggcctggca gcacagcctt ccctgacttc accaacccca cagccctggc ctggtgggag     1500

gacatggtgg ctgagttcca tgaccaggtg ccctttgatg gcatgtggat tgacatgaat     1560

gagcccagca acttcatcag aggcagtgag gatggctgcc ccaacaatga gctggagaac     1620

cccccctatg tgcctggggt ggtggggggc accctgcagg ctgccaccat ctgtgccagc     1680

agccaccagt tcctgagcac ccactacaac ctgcacaacc tgtatggcct gacagaggcc     1740

attgccagcc acagagccct ggtgaaggcc agaggcacca gaccctttgt gatcagcaga     1800

agcacctttg ctggccatgg cagatatgct ggccactgga caggggatgt gtggagcagc     1860

tgggagcagc tggccagcag tgtgcctgag atcctgcagt tcaacctgct gggggtgccc     1920

ctggtggggg ctgatgtgtg tggcttcctg ggcaacacca gtgaggagct gtgtgtgaga     1980

tggacccagc tgggggcctt ctaccccttc atgagaaacc acaacagcct gctgagcctg     2040

ccccaggagc cctacagctt cagtgagcct gcccagcagg ccatgagaaa ggccctgacc     2100

ctgagatatg ccctgctgcc ccacctgtac accctgttcc accaggccca tgtggctggg     2160

gagacagtgg ccagacccct gttcctggag ttccccaagg acagcagcac ctggacagtg     2220

gaccaccagc tgctgtgggg ggaggccctg ctgatcaccc ctgtgctgca ggctggcaag     2280

gctgaggtga caggctactt ccccctgggc acctggtatg acctgcagac agtgcctgtg     2340

gaggccctgg gcagcctgcc ccccccccct gctgccccca gagagcctgc catccacagt     2400

gagggccagt gggtgaccct gcctgccccc ctggacacca tcaatgtgca cctgagagct     2460

ggctacatca tccccctgca gggccctggc ctgaccacca cagagagcag acagcagccc     2520

atggccctgg ctgtggccct gaccaagggg ggggaggcca gaggggagct gttctgggat     2580

gatggggaga gcctggaggt gctggagaga ggggcctaca cccaggtgat cttcctggcc     2640

agaaacaaca ccattgtgaa tgagctggtg agagtgacca gtgagggggc tggcctgcag     2700

ctgcagaagg tgacagtgct gggggtggcc acagcccccc agcaggtgct gagcaatggg     2760

gtgcctgtga gcaacttcac ctacagccct gacaccaagg tgctggacat ctgtgtgagc     2820

ctgctgatgg gggagcagtt cctggtgagc tggtgctga                            2859


<210>  127
<211>  1290
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<222>  (1)..(1290)
<223>  Alpha-Galactosidase A (GLA)

<400>  127
atgcagctga ggaacccaga actacatctg ggctgcgcgc ttgcgcttcg cttcctggcc       60

ctcgtttcct gggacatccc tggggctaga gcactggaca atggattggc aaggacgcct      120

accatgggct ggctgcactg ggagcgcttc atgtgcaacc ttgactgcca ggaagagcca      180

gattcctgca tcagtgagaa gctcttcatg gagatggcag agctcatggt ctcagaaggc      240

tggaaggatg caggttatga gtacctctgc attgatgact gttggatggc tccccaaaga      300

gattcagaag gcagacttca ggcagaccct cagcgctttc ctcatgggat tcgccagcta      360

gctaattatg ttcacagcaa aggactgaag ctagggattt atgcagatgt tggaaataaa      420

acctgcgcag gcttccctgg gagttttgga tactacgaca ttgatgccca gacctttgct      480

gactggggag tagatctgct aaaatttgat ggttgttact gtgacagttt ggaaaatttg      540

gcagatggtt ataagcacat gtccttggcc ctgaatagga ctggcagaag cattgtgtac      600

tcctgtgagt ggcctcttta tatgtggccc tttcaaaagc ccaattatac agaaatccga      660

cagtactgca atcactggcg aaattttgct gacattgatg attcctggaa aagtataaag      720

agtatcttgg actggacatc ttttaaccag gagagaattg ttgatgttgc tggaccaggg      780

ggttggaatg acccagatat gttagtgatt ggcaactttg gcctcagctg gaatcagcaa      840

gtaactcaga tggccctctg ggctatcatg gctgctcctt tattcatgtc taatgacctc      900

cgacacatca gccctcaagc caaagctctc cttcaggata aggacgtaat tgccatcaat      960

caggacccct tgggcaagca agggtaccag cttagacagg gagacaactt tgaagtgtgg     1020

gaacgacctc tctcaggctt agcctgggct gtagctatga taaaccggca ggagattggt     1080

ggacctcgct cttataccat cgcagttgct tccctgggta aaggagtggc ctgtaatcct     1140

gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact     1200

tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca     1260

atgcagatgt cattaaaaga cttactttaa                                      1290


<210>  128
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO1-GLA

<400>  128
atgcagctga gaaatcctga actgcacctg ggctgtgccc tggctctgag atttctggct       60

ctggtgtcct gggacattcc tggcgctaga gccctggata atggcctggc cagaacacct      120

acaatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca agaggaaccc      180

gacagctgca tcagcgagaa gctgttcatg gaaatggccg agctgatggt gtccgaaggc      240

tggaaggatg ccggctacga gtacctgtgc atcgacgatt gctggatggc ccctcagaga      300

gattctgagg gcagactgca ggccgatcct cagagatttc ctcacggaat ccggcagctg      360

gccaactacg tgcactctaa gggactgaag ctgggcatct acgccgacgt gggcaacaag      420

acatgtgccg gctttccagg cagcttcggc tactacgata tcgacgccca gacctttgcc      480

gattggggcg tcgacctgct gaagttcgat ggctgctact gcgacagcct ggaaaacctg      540

gccgacggct acaaacacat gtctctggcc ctgaaccgga ccggcagatc tatcgtgtac      600

tcttgcgagt ggcccctgta catgtggccc ttccagaagc ctaactacac cgagatcaga      660

cagtactgca accactggcg gaacttcgcc gacatcgatg acagctggaa gtccatcaag      720

agcatcctgg actggaccag cttcaatcaa gagcggatcg tggatgtggc tggcccaggc      780

ggatggaacg atcctgatat gctggtcatc ggcaacttcg gcctgagctg gaatcagcaa      840

gtgacccaga tggccctgtg ggccattatg gccgctcctc tgttcatgag caacgacctg      900

agacacatca gccctcaggc caaggctctg ctgcaggata aggacgtgat cgccatcaac      960

caggatcctc tgggcaagca gggctatcag ctgagacagg gcgacaattt cgaagtgtgg     1020

gaaagacctc tgagcggcct ggcttgggcc gtcgccatga tcaatagaca agagatcggc     1080

ggaccccggt cctatacaat tgccgtggct tctctcggaa aaggcgtggc ctgcaatcct     1140

gcctgcttta tcacacagct gctccccgtg aagagaaagc tgggctttta cgagtggacc     1200

agcagactga gatcccacat caaccccaca ggcactgttc tgctgcaact ggaaaacaca     1260

atgcagatga gcctgaagga cctgctgtag                                      1290


<210>  129
<211>  1377
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET, CO1-GLA-GET

<400>  129
atgcagctga gaaatcctga actgcacctg ggctgtgccc tggctctgag atttctggct       60

ctggtgtcct gggacattcc tggcgctaga gccctggata atggcctggc cagaacacct      120

acaatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca agaggaaccc      180

gacagctgca tcagcgagaa gctgttcatg gaaatggccg agctgatggt gtccgaaggc      240

tggaaggatg ccggctacga gtacctgtgc atcgacgatt gctggatggc ccctcagaga      300

gattctgagg gcagactgca ggccgatcct cagagatttc ctcacggaat ccggcagctg      360

gccaactacg tgcactctaa gggactgaag ctgggcatct acgccgacgt gggcaacaag      420

acatgtgccg gctttccagg cagcttcggc tactacgata tcgacgccca gacctttgcc      480

gattggggcg tcgacctgct gaagttcgat ggctgctact gcgacagcct ggaaaacctg      540

gccgacggct acaaacacat gtctctggcc ctgaaccgga ccggcagatc tatcgtgtac      600

tcttgcgagt ggcccctgta catgtggccc ttccagaagc ctaactacac cgagatcaga      660

cagtactgca accactggcg gaacttcgcc gacatcgatg acagctggaa gtccatcaag      720

agcatcctgg actggaccag cttcaatcaa gagcggatcg tggatgtggc tggcccaggc      780

ggatggaacg atcctgatat gctggtcatc ggcaacttcg gcctgagctg gaatcagcaa      840

gtgacccaga tggccctgtg ggccattatg gccgctcctc tgttcatgag caacgacctg      900

agacacatca gccctcaggc caaggctctg ctgcaggata aggacgtgat cgccatcaac      960

caggatcctc tgggcaagca gggctatcag ctgagacagg gcgacaattt cgaagtgtgg     1020

gaaagacctc tgagcggcct ggcttgggcc gtcgccatga tcaatagaca agagatcggc     1080

ggaccccggt cctatacaat tgccgtggct tctctcggaa aaggcgtggc ctgcaatcct     1140

gcctgcttta tcacacagct gctccccgtg aagagaaagc tgggctttta cgagtggacc     1200

agcagactga gatcccacat caaccccaca ggcactgttc tgctgcaact ggaaaacaca     1260

atgcagatga gcctgaagga cctgctgcgg agaagaagaa ggcgcagacg caagcgcaag     1320

aagaaaggca aaggcctcgg caagaagcgg gacccctgtc tgagaaagta caagtaa        1377


<210>  130
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO2-GLA

<400>  130
atgcagctga gaaaccctga gctgcacctg ggctgtgccc tggccctgag attcctggcc       60

ctggtgagct gggacatccc tggggccaga gccctggaca atgggctagc cagaaccccc      120

accatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca ggaggagcct      180

gacagctgca tcagtgagaa gctgttcatg gagatggctg agctgatggt gagtgagggc      240

tggaaggatg ctggctatga gtacctgtgc attgatgact gctggatggc cccccagaga      300

gacagtgagg gcagactgca ggctgacccc cagagattcc cccatggcat cagacagctg      360

gccaactatg tgcacagcaa gggcctgaag ctgggcatct atgctgatgt gggcaacaag      420

acctgtgctg gcttccctgg cagctttggc tactatgaca ttgatgccca gacctttgct      480

gactgggggg tggacctgct gaagtttgat ggctgctact gtgacagcct ggagaacctg      540

gctgatggct acaagcacat gagcctggcc ctgaacagaa caggcagaag cattgtgtac      600

agctgtgagt ggcccctgta catgtggccc ttccagaagc ccaactacac agagatcaga      660

cagtactgca accactggag aaactttgct gacattgatg acagctggaa gagcatcaag      720

agcatcctgg actggaccag cttcaaccag gagagaattg tggatgtggc tggccctggg      780

ggctggaatg accctgacat gctggtgatt ggcaactttg gcctgagctg gaaccagcag      840

gtgacccaga tggccctgtg ggccatcatg gctgcccccc tgttcatgag caatgacctg      900

agacacatca gcccccaggc caaggccctg ctgcaggaca aggatgtgat tgccatcaac      960

caggaccccc tgggcaagca gggctaccag ctgagacagg gggacaactt tgaggtgtgg     1020

gagagacccc tgagtggcct ggcctgggct gtggccatga tcaacagaca ggagattggg     1080

ggccccagaa gctacaccat tgctgtggcc agcctgggca agggggtggc ctgcaaccct     1140

gcctgcttca tcacccagct gctgcctgtg aagagaaagc tgggcttcta tgagtggacc     1200

agcagactga gaagccacat caaccccaca ggcacagtgc tgctgcagct ggagaacacc     1260

atgcagatga gcctgaagga cctgctgtga                                      1290


<210>  131
<211>  1290
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized CO3-GLA

<400>  131
atgcagctga gaaaccctga gctgcacctg ggctgtgccc tggccctgag attcctggcc       60

ctggtgagct gggacatccc tggggccaga gccctggaca atgggctagc cagaaccccc      120

accatgggct ggctgcactg ggagagattc atgtgcaacc tggactgcca ggaggagcct      180

gacagctgca tcagtgagaa gctgttcatg gagatggctg agctgatggt gagtgagggc      240

tggaaggatg ctggctatga gtacctgtgc attgatgact gctggatggc cccccagaga      300

gacagtgagg gcagactgca ggctgacccc cagagattcc cccatggcat cagacagctg      360

gccaactatg tgcacagcaa gggcctgaag ctgggcatct atgctgatgt gggcaacaag      420

acctgtgctg gcttccctgg cagctttggc tactatgaca ttgatgccca gacctttgct      480

gactgggggg tggacctgct gaagtttgat ggctgctact gtgacagcct ggagaacctg      540

gctgatggct acaagcacat gagcctggcc ctgaacagaa caggcagaag cattgtgtac      600

agctgtgagt ggcccctgta catgtggccc ttccagaagc ccaactacac agagatcaga      660

cagtactgca accactggag aaactttgct gacattgatg acagctggaa gagcatcaag      720

agcatcctgg actggaccag cttcaaccag gagagaattg tggatgtggc tggccctggg      780

ggctggaatg accctgacat gctggtgatt ggcaactttg gcctgagctg gaaccagcag      840

gtgacccaga tggccctgtg ggccatcatg gctgcccccc tgttcatgag caatgacctg      900

agacacatca gcccccaggc caaggccctg ctgcaggaca aggatgtgat tgccatcaac      960

caggaccccc tgggcaagca gggctaccag ctgagacagg gggacaactt tgaggtgtgg     1020

gagagacccc tgagtggcct ggcctgggct gtggccatga tcaacagaca ggagattggg     1080

ggccccagaa gctacaccat tgctgtggct tccctgggta aaggagtggc ctgtaatcct     1140

gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact     1200

tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca     1260

atgcagatgt cattaaaaga cttactttaa                                      1290


<210>  132
<211>  4287
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, Cystic Fibrosis Transmembrane Regulator deltaR 
       (CFTRdeltaR) contains R domain deletion

<400>  132
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag caccctgcag gccagaagaa gacagtctgt gctgaacctg     2160

atgacccact ctgtgaacca gggccagaac atccacagaa agaccacagc cagcaccaga     2220

aaggtgagcc tggcccccca ggccaacctg acagagctgg acatctacag cagaagactg     2280

agccaggaga caggcctgga gatctctgag gagatcaatg aggaggacct gaaggagtgc     2340

ttctttgatg acatggagag catccctgct gtgaccacct ggaacaccta cctgagatac     2400

atcacagtgc acaagagcct gatctttgtg ctgatctggt gcctggtgat cttcctggct     2460

gaggtggctg ccagcctggt ggtgctgtgg ctgctgggca acacccccct gcaggacaag     2520

ggcaacagca cccacagcag aaacaacagc tatgctgtga tcatcaccag caccagcagc     2580

tactatgtgt tctacatcta tgtgggggtg gctgacaccc tgctggccat gggcttcttc     2640

agaggcctgc ccctggtgca caccctgatc acagtgagca agatcctgca ccacaagatg     2700

ctgcactctg tgctgcaggc ccccatgagc accctgaaca ccctgaaggc tgggggcatc     2760

ctgaacagat tcagcaagga cattgccatc ctggatgacc tgctgcccct gaccatcttt     2820

gacttcatcc agctgctgct gattgtgatt ggggccattg ctgtggtggc tgtgctgcag     2880

ccctacatct ttgtggccac agtgcctgtg attgtggcct tcatcatgct gagagcctac     2940

ttcctgcaga ccagccagca gctgaagcag ctggagtctg agggcagaag ccccatcttc     3000

acccacctgg tgaccagcct gaagggcctg tggaccctga gagcctttgg cagacagccc     3060

tactttgaga ccctgttcca caaggccctg aacctgcaca cagccaactg gttcctgtac     3120

ctgagcaccc tgagatggtt ccagatgaga attgagatga tctttgtgat cttcttcatt     3180

gctgtgacct tcatcagcat cctgaccaca ggggaggggg agggcagagt gggcatcatc     3240

ctgaccctgg ccatgaacat catgagcacc ctgcagtggg ctgtgaacag cagcattgat     3300

gtggacagcc tgatgagatc tgtgagcaga gtgttcaagt tcattgacat gcccacagag     3360

ggcaagccca ccaagagcac caagccctac aagaatggcc agctgagcaa ggtgatgatc     3420

attgagaaca gccatgtgaa gaaggatgac atctggccct ctgggggcca gatgacagtg     3480

aaggacctga cagccaagta cacagagggg ggcaatgcca tcctggagaa catcagcttc     3540

agcatcagcc ctggccagag agtgggcctg ctgggcagaa caggctctgg caagagcacc     3600

ctgctgtctg ccttcctgag actgctgaac acagaggggg agatccagat tgatggggtg     3660

agctgggaca gcatcaccct gcagcagtgg agaaaggcct ttggggtgat cccccagaag     3720

gtgttcatct tctctggcac cttcagaaag aacctggacc cctatgagca gtggtctgac     3780

caggagatct ggaaggtggc tgatgaggtg ggcctgagat ctgtgattga gcagttccct     3840

ggcaagctgg actttgtgct ggtggatggg ggctgtgtgc tgagccatgg ccacaagcag     3900

ctgatgtgcc tggccagatc tgtgctgagc aaggccaaga tcctgctgct ggatgagccc     3960

tctgcccacc tggaccctgt gacctaccag atcatcagaa gaaccctgaa gcaggccttt     4020

gctgactgca cagtgatcct gtgtgagcac agaattgagg ccatgctgga gtgccagcag     4080

ttcctggtga ttgaggagaa caaggtgaga cagtatgaca gcatccagaa gctgctgaat     4140

gagagaagcc tgttcagaca ggccatcagc ccctctgaca gagtgaagct gttcccccac     4200

agaaacagca gcaagtgcaa gagcaagccc cagattgctg ccctgaagga ggagaccgag     4260

gaggaggtgc aggacaccag actgtaa                                         4287


<210>  133
<211>  4443
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized, full length Cystic Fibrosis Transmembrane 
       Regulator (CFTR)

<400>  133
atgcagagaa gccccctgga gaaggcctct gtggtgagca agctgttctt cagctggacc       60

agacccatcc tgagaaaggg ctacagacag agactggagc tgtctgacat ctaccagatc      120

ccctctgtgg actctgctga caacctgtct gagaagctgg agagagagtg ggacagagag      180

ctggccagca agaagaaccc caagctgatc aatgccctga gaagatgctt cttctggaga      240

ttcatgttct atggcatctt cctgtacctg ggggaggtga ccaaggctgt gcagcccctg      300

ctgctgggca gaatcattgc cagctatgac cctgacaaca aggaggagag aagcattgcc      360

atctacctgg gcattggcct gtgcctgctg ttcattgtga gaaccctgct gctgcaccct      420

gccatctttg gcctgcacca cattggcatg cagatgagaa ttgccatgtt cagcctgatc      480

tacaagaaga ccctgaagct gagcagcaga gtgctggaca agatcagcat tggccagctg      540

gtgagcctgc tgagcaacaa cctgaacaag tttgatgagg gcctggccct ggcccacttt      600

gtgtggattg cccccctgca ggtggccctg ctgatgggcc tgatctggga gctgctgcag      660

gcctctgcct tctgtggcct gggcttcctg attgtgctgg ccctgttcca ggctggcctg      720

ggcagaatga tgatgaagta cagagaccag agagctggca agatctctga gagactggtg      780

atcacctctg agatgattga gaacatccag tctgtgaagg cctactgctg ggaggaggcc      840

atggagaaga tgattgagaa cctgagacag acagagctga agctgaccag aaaggctgcc      900

tatgtgagat acttcaacag ctctgccttc ttcttctctg gcttctttgt ggtgttcctg      960

tctgtgctgc cctatgccct gatcaagggc atcatcctga gaaagatctt caccaccatc     1020

agcttctgca ttgtgctgag aatggctgtg accagacagt tcccctgggc tgtgcagacc     1080

tggtatgaca gcctgggggc catcaacaag atccaggact tcctgcagaa gcaggagtac     1140

aagaccctgg agtacaacct gaccaccaca gaggtggtga tggagaatgt gacagccttc     1200

tgggaggagg gctttgggga gctgtttgag aaggccaagc agaacaacaa caacagaaag     1260

accagcaatg gggatgacag cctgttcttc agcaacttca gcctgctggg cacccctgtg     1320

ctgaaggaca tcaacttcaa gattgagaga ggccagctgc tggctgtggc tggcagcaca     1380

ggggctggca agaccagcct gctgatgatg atcatggggg agctggagcc ctctgagggc     1440

aagatcaagc actctggcag aatcagcttc tgcagccagt tcagctggat catgcctggc     1500

accatcaagg agaacatcat ctttggggtg agctatgatg agtacagata cagatctgtg     1560

atcaaggcct gccagctgga ggaggacatc agcaagtttg ctgagaagga caacattgtg     1620

ctgggggagg ggggcatcac cctgtctggg ggccagagag ccagaatcag cctggccaga     1680

gctgtgtaca aggatgctga cctgtacctg ctggacagcc cctttggcta cctggatgtg     1740

ctgacagaga aggagatctt tgagagctgt gtgtgcaagc tgatggccaa caagaccaga     1800

atcctggtga ccagcaagat ggagcacctg aagaaggctg acaagatcct gatcctgcat     1860

gagggcagca gctacttcta tggcaccttc tctgagctgc agaacctgca gcctgacttc     1920

agcagcaagc tgatgggctg tgacagcttt gaccagttct ctgctgagag aagaaacagc     1980

atcctgacag agaccctgca cagattcagc ctggaggggg atgcccctgt gagctggaca     2040

gagaccaaga agcagagctt caagcagaca ggggagtttg gggagaagag aaagaacagc     2100

atcctgaacc ccatcaacag catcagaaag ttcagcattg tgcagaagac ccccctgcag     2160

atgaatggca ttgaggagga ctctgatgag cccctggaga gaagactgag cctggtgcct     2220

gactctgagc agggggaggc catcctgccc agaatctctg tgatcagcac aggccccacc     2280

ctgcaggcca gaagaagaca gtctgtgctg aacctgatga cccactctgt gaaccagggc     2340

cagaacatcc acagaaagac cacagccagc accagaaagg tgagcctggc cccccaggcc     2400

aacctgacag agctggacat ctacagcaga agactgagcc aggagacagg cctggagatc     2460

tctgaggaga tcaatgagga ggacctgaag gagtgcttct ttgatgacat ggagagcatc     2520

cctgctgtga ccacctggaa cacctacctg agatacatca cagtgcacaa gagcctgatc     2580

tttgtgctga tctggtgcct ggtgatcttc ctggctgagg tggctgccag cctggtggtg     2640

ctgtggctgc tgggcaacac ccccctgcag gacaagggca acagcaccca cagcagaaac     2700

aacagctatg ctgtgatcat caccagcacc agcagctact atgtgttcta catctatgtg     2760

ggggtggctg acaccctgct ggccatgggc ttcttcagag gcctgcccct ggtgcacacc     2820

ctgatcacag tgagcaagat cctgcaccac aagatgctgc actctgtgct gcaggccccc     2880

atgagcaccc tgaacaccct gaaggctggg ggcatcctga acagattcag caaggacatt     2940

gccatcctgg atgacctgct gcccctgacc atctttgact tcatccagct gctgctgatt     3000

gtgattgggg ccattgctgt ggtggctgtg ctgcagccct acatctttgt ggccacagtg     3060

cctgtgattg tggccttcat catgctgaga gcctacttcc tgcagaccag ccagcagctg     3120

aagcagctgg agtctgaggg cagaagcccc atcttcaccc acctggtgac cagcctgaag     3180

ggcctgtgga ccctgagagc ctttggcaga cagccctact ttgagaccct gttccacaag     3240

gccctgaacc tgcacacagc caactggttc ctgtacctga gcaccctgag atggttccag     3300

atgagaattg agatgatctt tgtgatcttc ttcattgctg tgaccttcat cagcatcctg     3360

accacagggg agggggaggg cagagtgggc atcatcctga ccctggccat gaacatcatg     3420

agcaccctgc agtgggctgt gaacagcagc attgatgtgg acagcctgat gagatctgtg     3480

agcagagtgt tcaagttcat tgacatgccc acagagggca agcccaccaa gagcaccaag     3540

ccctacaaga atggccagct gagcaaggtg atgatcattg agaacagcca tgtgaagaag     3600

gatgacatct ggccctctgg gggccagatg acagtgaagg acctgacagc caagtacaca     3660

gaggggggca atgccatcct ggagaacatc agcttcagca tcagccctgg ccagagagtg     3720

ggcctgctgg gcagaacagg ctctggcaag agcaccctgc tgtctgcctt cctgagactg     3780

ctgaacacag agggggagat ccagattgat ggggtgagct gggacagcat caccctgcag     3840

cagtggagaa aggcctttgg ggtgatcccc cagaaggtgt tcatcttctc tggcaccttc     3900

agaaagaacc tggaccccta tgagcagtgg tctgaccagg agatctggaa ggtggctgat     3960

gaggtgggcc tgagatctgt gattgagcag ttccctggca agctggactt tgtgctggtg     4020

gatgggggct gtgtgctgag ccatggccac aagcagctga tgtgcctggc cagatctgtg     4080

ctgagcaagg ccaagatcct gctgctggat gagccctctg cccacctgga ccctgtgacc     4140

taccagatca tcagaagaac cctgaagcag gcctttgctg actgcacagt gatcctgtgt     4200

gagcacagaa ttgaggccat gctggagtgc cagcagttcc tggtgattga ggagaacaag     4260

gtgagacagt atgacagcat ccagaagctg ctgaatgaga gaagcctgtt cagacaggcc     4320

atcagcccct ctgacagagt gaagctgttc ccccacagaa acagcagcaa gtgcaagagc     4380

aagccccaga ttgctgccct gaaggaggag accgaggagg aggtgcagga caccagactg     4440

taa                                                                   4443


<210>  134
<211>  502
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(502)
<223>  Sulfoglucosamine sulfohydrolase (SGSH)

<400>  134

Met Ser Cys Pro Val Pro Ala Cys Cys Ala Leu Leu Leu Val Leu Gly 
1               5                   10                  15      


Leu Cys Arg Ala Arg Pro Arg Asn Ala Leu Leu Leu Leu Ala Asp Asp 
            20                  25                  30          


Gly Gly Phe Glu Ser Gly Ala Tyr Asn Asn Ser Ala Ile Ala Thr Pro 
        35                  40                  45              


His Leu Asp Ala Leu Ala Arg Arg Ser Leu Leu Phe Arg Asn Ala Phe 
    50                  55                  60                  


Thr Ser Val Ser Ser Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly 
65                  70                  75                  80  


Leu Pro Gln His Gln Asn Gly Met Tyr Gly Leu His Gln Asp Val His 
                85                  90                  95      


His Phe Asn Ser Phe Asp Lys Val Arg Ser Leu Pro Leu Leu Leu Ser 
            100                 105                 110         


Gln Ala Gly Val Arg Thr Gly Ile Ile Gly Lys Lys His Val Gly Pro 
        115                 120                 125             


Glu Thr Val Tyr Pro Phe Asp Phe Ala Tyr Thr Glu Glu Asn Gly Ser 
    130                 135                 140                 


Val Leu Gln Val Gly Arg Asn Ile Thr Arg Ile Lys Leu Leu Val Arg 
145                 150                 155                 160 


Lys Phe Leu Gln Thr Gln Asp Asp Gln Pro Phe Phe Leu Tyr Val Ala 
                165                 170                 175     


Phe His Asp Pro His Arg Cys Gly His Ser Gln Pro Gln Tyr Gly Thr 
            180                 185                 190         


Phe Cys Glu Lys Phe Gly Asn Gly Glu Ser Gly Met Gly Arg Ile Pro 
        195                 200                 205             


Asp Trp Thr Pro Gln Ala Tyr Asp Pro Leu Asp Val Leu Val Pro Tyr 
    210                 215                 220                 


Phe Val Pro Asn Thr Pro Ala Ala Arg Ala Asp Leu Ala Ala Gln Tyr 
225                 230                 235                 240 


Thr Thr Val Gly Arg Met Asp Gln Gly Val Gly Leu Val Leu Gln Glu 
                245                 250                 255     


Leu Arg Asp Ala Gly Val Leu Asn Asp Thr Leu Val Ile Phe Thr Ser 
            260                 265                 270         


Asp Asn Gly Ile Pro Phe Pro Ser Gly Arg Thr Asn Leu Tyr Trp Pro 
        275                 280                 285             


Gly Thr Ala Glu Pro Leu Leu Val Ser Ser Pro Glu His Pro Lys Arg 
    290                 295                 300                 


Trp Gly Gln Val Ser Glu Ala Tyr Val Ser Leu Leu Asp Leu Thr Pro 
305                 310                 315                 320 


Thr Ile Leu Asp Trp Phe Ser Ile Pro Tyr Pro Ser Tyr Ala Ile Phe 
                325                 330                 335     


Gly Ser Lys Thr Ile His Leu Thr Gly Arg Ser Leu Leu Pro Ala Leu 
            340                 345                 350         


Glu Ala Glu Pro Leu Trp Ala Thr Val Phe Gly Ser Gln Ser His His 
        355                 360                 365             


Glu Val Thr Met Ser Tyr Pro Met Arg Ser Val Gln His Arg His Phe 
    370                 375                 380                 


Arg Leu Val His Asn Leu Asn Phe Lys Met Pro Phe Pro Ile Asp Gln 
385                 390                 395                 400 


Asp Phe Tyr Val Ser Pro Thr Phe Gln Asp Leu Leu Asn Arg Thr Thr 
                405                 410                 415     


Ala Gly Gln Pro Thr Gly Trp Tyr Lys Asp Leu Arg His Tyr Tyr Tyr 
            420                 425                 430         


Arg Ala Arg Trp Glu Leu Tyr Asp Arg Ser Arg Asp Pro His Glu Thr 
        435                 440                 445             


Gln Asn Leu Ala Thr Asp Pro Arg Phe Ala Gln Leu Leu Glu Met Leu 
    450                 455                 460                 


Arg Asp Gln Leu Ala Lys Trp Gln Trp Glu Thr His Asp Pro Trp Val 
465                 470                 475                 480 


Cys Ala Pro Asp Gly Val Leu Glu Glu Lys Leu Ser Pro Gln Cys Gln 
                485                 490                 495     


Pro Leu His Asn Glu Leu 
            500         


<210>  135
<211>  531
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET CO1-SGSH-GET

<400>  135

Met Ser Cys Pro Val Pro Ala Cys Cys Ala Leu Leu Leu Val Leu Gly 
1               5                   10                  15      


Leu Cys Arg Ala Arg Pro Arg Asn Ala Leu Leu Leu Leu Ala Asp Asp 
            20                  25                  30          


Gly Gly Phe Glu Ser Gly Ala Tyr Asn Asn Ser Ala Ile Ala Thr Pro 
        35                  40                  45              


His Leu Asp Ala Leu Ala Arg Arg Ser Leu Leu Phe Arg Asn Ala Phe 
    50                  55                  60                  


Thr Ser Val Ser Ser Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly 
65                  70                  75                  80  


Leu Pro Gln His Gln Asn Gly Met Tyr Gly Leu His Gln Asp Val His 
                85                  90                  95      


His Phe Asn Ser Phe Asp Lys Val Arg Ser Leu Pro Leu Leu Leu Ser 
            100                 105                 110         


Gln Ala Gly Val Arg Thr Gly Ile Ile Gly Lys Lys His Val Gly Pro 
        115                 120                 125             


Glu Thr Val Tyr Pro Phe Asp Phe Ala Tyr Thr Glu Glu Asn Gly Ser 
    130                 135                 140                 


Val Leu Gln Val Gly Arg Asn Ile Thr Arg Ile Lys Leu Leu Val Arg 
145                 150                 155                 160 


Lys Phe Leu Gln Thr Gln Asp Asp Gln Pro Phe Phe Leu Tyr Val Ala 
                165                 170                 175     


Phe His Asp Pro His Arg Cys Gly His Ser Gln Pro Gln Tyr Gly Thr 
            180                 185                 190         


Phe Cys Glu Lys Phe Gly Asn Gly Glu Ser Gly Met Gly Arg Ile Pro 
        195                 200                 205             


Asp Trp Thr Pro Gln Ala Tyr Asp Pro Leu Asp Val Leu Val Pro Tyr 
    210                 215                 220                 


Phe Val Pro Asn Thr Pro Ala Ala Arg Ala Asp Leu Ala Ala Gln Tyr 
225                 230                 235                 240 


Thr Thr Val Gly Arg Met Asp Gln Gly Val Gly Leu Val Leu Gln Glu 
                245                 250                 255     


Leu Arg Asp Ala Gly Val Leu Asn Asp Thr Leu Val Ile Phe Thr Ser 
            260                 265                 270         


Asp Asn Gly Ile Pro Phe Pro Ser Gly Arg Thr Asn Leu Tyr Trp Pro 
        275                 280                 285             


Gly Thr Ala Glu Pro Leu Leu Val Ser Ser Pro Glu His Pro Lys Arg 
    290                 295                 300                 


Trp Gly Gln Val Ser Glu Ala Tyr Val Ser Leu Leu Asp Leu Thr Pro 
305                 310                 315                 320 


Thr Ile Leu Asp Trp Phe Ser Ile Pro Tyr Pro Ser Tyr Ala Ile Phe 
                325                 330                 335     


Gly Ser Lys Thr Ile His Leu Thr Gly Arg Ser Leu Leu Pro Ala Leu 
            340                 345                 350         


Glu Ala Glu Pro Leu Trp Ala Thr Val Phe Gly Ser Gln Ser His His 
        355                 360                 365             


Glu Val Thr Met Ser Tyr Pro Met Arg Ser Val Gln His Arg His Phe 
    370                 375                 380                 


Arg Leu Val His Asn Leu Asn Phe Lys Met Pro Phe Pro Ile Asp Gln 
385                 390                 395                 400 


Asp Phe Tyr Val Ser Pro Thr Phe Gln Asp Leu Leu Asn Arg Thr Thr 
                405                 410                 415     


Ala Gly Gln Pro Thr Gly Trp Tyr Lys Asp Leu Arg His Tyr Tyr Tyr 
            420                 425                 430         


Arg Ala Arg Trp Glu Leu Tyr Asp Arg Ser Arg Asp Pro His Glu Thr 
        435                 440                 445             


Gln Asn Leu Ala Thr Asp Pro Arg Phe Ala Gln Leu Leu Glu Met Leu 
    450                 455                 460                 


Arg Asp Gln Leu Ala Lys Trp Gln Trp Glu Thr His Asp Pro Trp Val 
465                 470                 475                 480 


Cys Ala Pro Asp Gly Val Leu Glu Glu Lys Leu Ser Pro Gln Cys Gln 
                485                 490                 495     


Pro Leu His Asn Glu Leu Arg Arg Arg Arg Arg Arg Arg Arg Lys Arg 
            500                 505                 510         


Lys Lys Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg 
        515                 520                 525             


Lys Tyr Lys 
    530     


<210>  136
<211>  306
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized Ceroid Lipofuscinosis, Neuronal, 1 (CLN1)

<400>  136

Met Ala Ser Pro Gly Cys Leu Trp Leu Leu Ala Val Ala Leu Leu Pro 
1               5                   10                  15      


Trp Thr Cys Ala Ser Arg Ala Leu Gln His Leu Asp Pro Pro Ala Pro 
            20                  25                  30          


Leu Pro Leu Val Ile Trp His Gly Met Gly Asp Ser Cys Cys Asn Pro 
        35                  40                  45              


Leu Ser Met Gly Ala Ile Lys Lys Met Val Glu Lys Lys Ile Pro Gly 
    50                  55                  60                  


Ile Tyr Val Leu Ser Leu Glu Ile Gly Lys Thr Leu Met Glu Asp Val 
65                  70                  75                  80  


Glu Asn Ser Phe Phe Leu Asn Val Asn Ser Gln Val Thr Thr Val Cys 
                85                  90                  95      


Gln Ala Leu Ala Lys Asp Pro Lys Leu Gln Gln Gly Tyr Asn Ala Met 
            100                 105                 110         


Gly Phe Ser Gln Gly Gly Gln Phe Leu Arg Ala Val Ala Gln Arg Cys 
        115                 120                 125             


Pro Ser Pro Pro Met Ile Asn Leu Ile Ser Val Gly Gly Gln His Gln 
    130                 135                 140                 


Gly Val Phe Gly Leu Pro Arg Cys Pro Gly Glu Ser Ser His Ile Cys 
145                 150                 155                 160 


Asp Phe Ile Arg Lys Thr Leu Asn Ala Gly Ala Tyr Ser Lys Val Val 
                165                 170                 175     


Gln Glu Arg Leu Val Gln Ala Glu Tyr Trp His Asp Pro Ile Lys Glu 
            180                 185                 190         


Asp Val Tyr Arg Asn His Ser Ile Phe Leu Ala Asp Ile Asn Gln Glu 
        195                 200                 205             


Arg Gly Ile Asn Glu Ser Tyr Lys Lys Asn Leu Met Ala Leu Lys Lys 
    210                 215                 220                 


Phe Val Met Val Lys Phe Leu Asn Asp Ser Ile Val Asp Pro Val Asp 
225                 230                 235                 240 


Ser Glu Trp Phe Gly Phe Tyr Arg Ser Gly Gln Ala Lys Glu Thr Ile 
                245                 250                 255     


Pro Leu Gln Glu Thr Ser Leu Tyr Thr Gln Asp Arg Leu Gly Leu Lys 
            260                 265                 270         


Glu Met Asp Asn Ala Gly Gln Leu Val Phe Leu Ala Thr Glu Gly Asp 
        275                 280                 285             


His Leu Gln Leu Ser Glu Glu Trp Phe Tyr Ala His Ile Ile Pro Phe 
    290                 295                 300                 


Leu Gly 
305     


<210>  137
<211>  294
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(294)
<223>  Survival Motor Neuron 1 (SMN1)

<400>  137

Met Ala Met Ser Ser Gly Gly Ser Gly Gly Gly Val Pro Glu Gln Glu 
1               5                   10                  15      


Asp Ser Val Leu Phe Arg Arg Gly Thr Gly Gln Ser Asp Asp Ser Asp 
            20                  25                  30          


Ile Trp Asp Asp Thr Ala Leu Ile Lys Ala Tyr Asp Lys Ala Val Ala 
        35                  40                  45              


Ser Phe Lys His Ala Leu Lys Asn Gly Asp Ile Cys Glu Thr Ser Gly 
    50                  55                  60                  


Lys Pro Lys Thr Thr Pro Lys Arg Lys Pro Ala Lys Lys Asn Lys Ser 
65                  70                  75                  80  


Gln Lys Lys Asn Thr Ala Ala Ser Leu Gln Gln Trp Lys Val Gly Asp 
                85                  90                  95      


Lys Cys Ser Ala Ile Trp Ser Glu Asp Gly Cys Ile Tyr Pro Ala Thr 
            100                 105                 110         


Ile Ala Ser Ile Asp Phe Lys Arg Glu Thr Cys Val Val Val Tyr Thr 
        115                 120                 125             


Gly Tyr Gly Asn Arg Glu Glu Gln Asn Leu Ser Asp Leu Leu Ser Pro 
    130                 135                 140                 


Ile Cys Glu Val Ala Asn Asn Ile Glu Gln Asn Ala Gln Glu Asn Glu 
145                 150                 155                 160 


Asn Glu Ser Gln Val Ser Thr Asp Glu Ser Glu Asn Ser Arg Ser Pro 
                165                 170                 175     


Gly Asn Lys Ser Asp Asn Ile Lys Pro Lys Ser Ala Pro Trp Asn Ser 
            180                 185                 190         


Phe Leu Pro Pro Pro Pro Pro Met Pro Gly Pro Arg Leu Gly Pro Gly 
        195                 200                 205             


Lys Pro Gly Leu Lys Phe Asn Gly Pro Pro Pro Pro Pro Pro Pro Pro 
    210                 215                 220                 


Pro Pro His Leu Leu Ser Cys Trp Leu Pro Pro Phe Pro Ser Gly Pro 
225                 230                 235                 240 


Pro Ile Ile Pro Pro Pro Pro Pro Ile Cys Pro Asp Ser Leu Asp Asp 
                245                 250                 255     


Ala Asp Ala Leu Gly Ser Met Leu Ile Ser Trp Tyr Met Ser Gly Tyr 
            260                 265                 270         


His Thr Gly Tyr Tyr Met Gly Phe Arg Gln Asn Gln Lys Glu Gly Arg 
        275                 280                 285             


Cys Ser His Ser Leu Asn 
    290                 


<210>  138
<211>  515
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(515)
<223>  Tissue Non-specific Alkaline Phosphatase (TNALP)

<400>  138

Met Ile Ser Pro Phe Leu Val Leu Ala Ile Gly Thr Cys Leu Thr Asn 
1               5                   10                  15      


Ser Leu Val Pro Glu Lys Glu Lys Asp Pro Lys Tyr Trp Arg Asp Gln 
            20                  25                  30          


Ala Gln Glu Thr Leu Lys Tyr Ala Leu Glu Leu Gln Lys Leu Asn Thr 
        35                  40                  45              


Asn Val Ala Lys Asn Val Ile Met Phe Leu Gly Asp Gly Met Gly Val 
    50                  55                  60                  


Ser Thr Val Thr Ala Ala Arg Ile Leu Lys Gly Gln Leu His His Asn 
65                  70                  75                  80  


Pro Gly Glu Glu Thr Arg Leu Glu Met Asp Lys Phe Pro Phe Val Ala 
                85                  90                  95      


Leu Ser Lys Thr Tyr Asn Thr Asn Ala Gln Val Pro Asp Ser Ala Gly 
            100                 105                 110         


Thr Ala Thr Ala Tyr Leu Cys Gly Val Lys Ala Asn Glu Gly Thr Val 
        115                 120                 125             


Gly Val Ser Ala Ala Thr Glu Arg Ser Arg Cys Asn Thr Thr Gln Gly 
    130                 135                 140                 


Asn Glu Val Thr Ser Ile Leu Arg Trp Ala Lys Asp Ala Gly Lys Ser 
145                 150                 155                 160 


Val Gly Ile Val Thr Thr Thr Arg Val Asn His Ala Thr Pro Ser Ala 
                165                 170                 175     


Ala Tyr Ala His Ser Ala Asp Arg Asp Trp Tyr Ser Asp Asn Glu Met 
            180                 185                 190         


Pro Pro Glu Ala Leu Ser Gln Gly Cys Lys Asp Ile Ala Tyr Gln Leu 
        195                 200                 205             


Met His Asn Ile Arg Asp Ile Asp Val Ile Met Gly Gly Gly Arg Lys 
    210                 215                 220                 


Tyr Met Tyr Pro Lys Asn Lys Thr Asp Val Glu Tyr Glu Ser Asp Glu 
225                 230                 235                 240 


Lys Ala Arg Gly Thr Arg Leu Asp Gly Leu Asp Leu Val Asp Thr Trp 
                245                 250                 255     


Lys Ser Phe Lys Pro Arg Tyr Lys His Ser His Phe Ile Trp Asn Arg 
            260                 265                 270         


Thr Glu Leu Leu Thr Leu Asp Pro His Asn Val Asp Tyr Leu Leu Gly 
        275                 280                 285             


Leu Phe Glu Pro Gly Asp Met Gln Tyr Glu Leu Asn Arg Asn Asn Val 
    290                 295                 300                 


Thr Asp Pro Ser Leu Ser Glu Met Val Val Val Ala Ile Gln Ile Leu 
305                 310                 315                 320 


Arg Lys Asn Pro Lys Gly Phe Phe Leu Leu Val Glu Gly Gly Arg Ile 
                325                 330                 335     


Asp His Gly His His Glu Gly Lys Ala Lys Gln Ala Leu His Glu Ala 
            340                 345                 350         


Val Glu Met Asp Arg Ala Ile Gly Gln Ala Gly Ser Leu Thr Ser Ser 
        355                 360                 365             


Glu Asp Thr Leu Thr Val Val Thr Ala Asp His Ser His Val Phe Thr 
    370                 375                 380                 


Phe Gly Gly Tyr Thr Pro Arg Gly Asn Ser Ile Phe Gly Leu Ala Pro 
385                 390                 395                 400 


Met Leu Ser Asp Thr Asp Lys Lys Pro Phe Thr Ala Ile Leu Tyr Gly 
                405                 410                 415     


Asn Gly Pro Gly Tyr Lys Val Val Gly Gly Glu Arg Glu Asn Val Ser 
            420                 425                 430         


Met Val Asp Tyr Ala His Asn Asn Tyr Gln Ala Gln Ser Ala Val Pro 
        435                 440                 445             


Leu Arg His Glu Thr His Gly Gly Glu Asp Val Ala Val Phe Ser Lys 
    450                 455                 460                 


Gly Pro Met Ala His Leu Leu His Gly Val His Glu Gln Asn Tyr Val 
465                 470                 475                 480 


Pro His Val Met Ala Tyr Ala Ala Cys Ile Gly Ala Asn Leu Gly His 
                485                 490                 495     


Cys Ala Pro Ala Ser Ser Ala Gly Ser Asp Asp Asp Asp Asp Asp Asp 
            500                 505                 510         


Asp Asp Asp 
        515 


<210>  139
<211>  211
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(211)
<223>  Glial Cell Derived Neurotrophic Factor (GDNF)

<400>  139

Met Lys Leu Trp Asp Val Val Ala Val Cys Leu Val Leu Leu His Thr 
1               5                   10                  15      


Ala Ser Ala Phe Pro Leu Pro Ala Gly Lys Arg Pro Pro Glu Ala Pro 
            20                  25                  30          


Ala Glu Asp Arg Ser Leu Gly Arg Arg Arg Ala Pro Phe Ala Leu Ser 
        35                  40                  45              


Ser Asp Ser Asn Met Pro Glu Asp Tyr Pro Asp Gln Phe Asp Asp Val 
    50                  55                  60                  


Met Asp Phe Ile Gln Ala Thr Ile Lys Arg Leu Lys Arg Ser Pro Asp 
65                  70                  75                  80  


Lys Gln Met Ala Val Leu Pro Arg Arg Glu Arg Asn Arg Gln Ala Ala 
                85                  90                  95      


Ala Ala Asn Pro Glu Asn Ser Arg Gly Lys Gly Arg Arg Gly Gln Arg 
            100                 105                 110         


Gly Lys Asn Arg Gly Cys Val Leu Thr Ala Ile His Leu Asn Val Thr 
        115                 120                 125             


Asp Leu Gly Leu Gly Tyr Glu Thr Lys Glu Glu Leu Ile Phe Arg Tyr 
    130                 135                 140                 


Cys Ser Gly Ser Cys Asp Ala Ala Glu Thr Thr Tyr Asp Lys Ile Leu 
145                 150                 155                 160 


Lys Asn Leu Ser Arg Asn Arg Arg Leu Val Ser Asp Lys Val Gly Gln 
                165                 170                 175     


Ala Cys Cys Arg Pro Ile Ala Phe Asp Asp Asp Leu Ser Phe Leu Asp 
            180                 185                 190         


Asp Asn Leu Val Tyr His Ile Leu Arg Lys His Ser Ala Lys Arg Cys 
        195                 200                 205             


Gly Cys Ile 
    210     


<210>  140
<211>  536
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(536)
<223>  Tissue Glucosyl Ceramidase beta (GBA1)

<400>  140

Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser 
1               5                   10                  15      


Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln 
            20                  25                  30          


Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe 
        35                  40                  45              


Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser 
    50                  55                  60                  


Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu 
65                  70                  75                  80  


Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln 
                85                  90                  95      


Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln 
            100                 105                 110         


Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala 
        115                 120                 125             


Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu 
    130                 135                 140                 


Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val 
145                 150                 155                 160 


Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp 
                165                 170                 175     


Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp 
            180                 185                 190         


Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln 
        195                 200                 205             


Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu 
    210                 215                 220                 


Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro 
225                 230                 235                 240 


Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu 
                245                 250                 255     


Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu 
            260                 265                 270         


Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu 
        275                 280                 285             


Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly 
    290                 295                 300                 


Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu 
305                 310                 315                 320 


Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr 
                325                 330                 335     


Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr 
            340                 345                 350         


Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg 
        355                 360                 365             


Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser 
    370                 375                 380                 


Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met 
385                 390                 395                 400 


Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly 
                405                 410                 415     


Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp 
            420                 425                 430         


Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp 
        435                 440                 445             


Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys 
    450                 455                 460                 


Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys 
465                 470                 475                 480 


Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val 
                485                 490                 495     


Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys 
            500                 505                 510         


Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile 
        515                 520                 525             


His Thr Tyr Leu Trp Arg Arg Gln 
    530                 535     


<210>  141
<211>  653
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(653)
<223>  Iduronidase alpha-L- (IDUA)

<400>  141

Met Arg Pro Leu Arg Pro Arg Ala Ala Leu Leu Ala Leu Leu Ala Ser 
1               5                   10                  15      


Leu Leu Ala Ala Pro Pro Val Ala Pro Ala Glu Ala Pro His Leu Val 
            20                  25                  30          


His Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg 
        35                  40                  45              


Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr 
    50                  55                  60                  


Val Leu Ser Trp Asp Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val 
65                  70                  75                  80  


Pro His Arg Gly Ile Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu 
                85                  90                  95      


Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr 
            100                 105                 110         


His Leu Asp Gly Tyr Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro 
        115                 120                 125             


Gly Phe Glu Leu Met Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu 
    130                 135                 140                 


Asp Lys Gln Gln Val Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala 
145                 150                 155                 160 


Arg Arg Tyr Ile Gly Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn 
                165                 170                 175     


Phe Glu Thr Trp Asn Glu Pro Asp His His Asp Phe Asp Asn Val Ser 
            180                 185                 190         


Met Thr Met Gln Gly Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly 
        195                 200                 205             


Leu Arg Ala Ala Ser Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser 
    210                 215                 220                 


Phe His Thr Pro Pro Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His 
225                 230                 235                 240 


Cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu 
                245                 250                 255     


Asp Tyr Ile Ser Leu His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile 
            260                 265                 270         


Leu Glu Gln Glu Lys Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro 
        275                 280                 285             


Lys Phe Ala Asp Thr Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val 
    290                 295                 300                 


Gly Trp Ser Leu Pro Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala 
305                 310                 315                 320 


Met Val Val Lys Val Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn 
                325                 330                 335     


Thr Thr Ser Ala Phe Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe 
            340                 345                 350         


Leu Ser Tyr His Pro His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg 
        355                 360                 365             


Phe Gln Val Asn Asn Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys 
    370                 375                 380                 


Pro Val Leu Thr Ala Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln 
385                 390                 395                 400 


Leu Trp Ala Glu Val Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His 
                405                 410                 415     


Thr Val Gly Val Leu Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp 
            420                 425                 430         


Ala Trp Arg Ala Ala Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala 
        435                 440                 445             


His Pro Asn Arg Ser Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro 
    450                 455                 460                 


Pro Gly Pro Gly Leu Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu 
465                 470                 475                 480 


Cys Ser Pro Asp Gly Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro 
                485                 490                 495     


Thr Ala Glu Gln Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala 
            500                 505                 510         


Ala Ala Pro Arg Pro Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro 
        515                 520                 525             


Ala Leu Arg Leu Pro Ser Leu Leu Leu Val His Val Cys Ala Arg Pro 
    530                 535                 540                 


Glu Lys Pro Pro Gly Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr 
545                 550                 555                 560 


Gln Gly Gln Leu Val Leu Val Trp Ser Asp Glu His Val Gly Ser Lys 
                565                 570                 575     


Cys Leu Trp Thr Tyr Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr 
            580                 585                 590         


Thr Pro Val Ser Arg Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser 
        595                 600                 605             


Pro Asp Thr Gly Ala Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp 
    610                 615                 620                 


Tyr Trp Ala Arg Pro Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu 
625                 630                 635                 640 


Val Pro Val Pro Arg Gly Pro Pro Ser Pro Gly Asn Pro 
                645                 650             


<210>  142
<211>  525
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(525)
<223>  Cytochrome P450 family 4 subfamily V member 2 (CYP4V2)

<400>  142

Met Ala Gly Leu Trp Leu Gly Leu Val Trp Gln Lys Leu Leu Leu Trp 
1               5                   10                  15      


Gly Ala Ala Ser Ala Leu Ser Leu Ala Gly Ala Ser Leu Val Leu Ser 
            20                  25                  30          


Leu Leu Gln Arg Val Ala Ser Tyr Ala Arg Lys Trp Gln Gln Met Arg 
        35                  40                  45              


Pro Ile Pro Thr Val Ala Arg Ala Tyr Pro Leu Val Gly His Ala Leu 
    50                  55                  60                  


Leu Met Lys Pro Asp Gly Arg Glu Phe Phe Gln Gln Ile Ile Glu Tyr 
65                  70                  75                  80  


Thr Glu Glu Tyr Arg His Met Pro Leu Leu Lys Leu Trp Val Gly Pro 
                85                  90                  95      


Val Pro Met Val Ala Leu Tyr Asn Ala Glu Asn Val Glu Val Ile Leu 
            100                 105                 110         


Thr Ser Ser Lys Gln Ile Asp Lys Ser Ser Met Tyr Lys Phe Leu Glu 
        115                 120                 125             


Pro Trp Leu Gly Leu Gly Leu Leu Thr Ser Thr Gly Asn Lys Trp Arg 
    130                 135                 140                 


Ser Arg Arg Lys Met Leu Thr Pro Thr Phe His Phe Thr Ile Leu Glu 
145                 150                 155                 160 


Asp Phe Leu Asp Ile Met Asn Glu Gln Ala Asn Ile Leu Val Lys Lys 
                165                 170                 175     


Leu Glu Lys His Ile Asn Gln Glu Ala Phe Asn Cys Phe Phe Tyr Ile 
            180                 185                 190         


Thr Leu Cys Ala Leu Asp Ile Ile Cys Glu Thr Ala Met Gly Lys Asn 
        195                 200                 205             


Ile Gly Ala Gln Ser Asn Asp Asp Ser Glu Tyr Val Arg Ala Val Tyr 
    210                 215                 220                 


Arg Met Ser Glu Met Ile Phe Arg Arg Ile Lys Met Pro Trp Leu Trp 
225                 230                 235                 240 


Leu Asp Leu Trp Tyr Leu Met Phe Lys Glu Gly Trp Glu His Lys Lys 
                245                 250                 255     


Ser Leu Gln Ile Leu His Thr Phe Thr Asn Ser Val Ile Ala Glu Arg 
            260                 265                 270         


Ala Asn Glu Met Asn Ala Asn Glu Asp Cys Arg Gly Asp Gly Arg Gly 
        275                 280                 285             


Ser Ala Pro Ser Lys Asn Lys Arg Arg Ala Phe Leu Asp Leu Leu Leu 
    290                 295                 300                 


Ser Val Thr Asp Asp Glu Gly Asn Arg Leu Ser His Glu Asp Ile Arg 
305                 310                 315                 320 


Glu Glu Val Asp Thr Phe Met Phe Glu Gly His Asp Thr Thr Ala Ala 
                325                 330                 335     


Ala Ile Asn Trp Ser Leu Tyr Leu Leu Gly Ser Asn Pro Glu Val Gln 
            340                 345                 350         


Lys Lys Val Asp His Glu Leu Asp Asp Val Phe Gly Lys Ser Asp Arg 
        355                 360                 365             


Pro Ala Thr Val Glu Asp Leu Lys Lys Leu Arg Tyr Leu Glu Cys Val 
    370                 375                 380                 


Ile Lys Glu Thr Leu Arg Leu Phe Pro Ser Val Pro Leu Phe Ala Arg 
385                 390                 395                 400 


Ser Val Ser Glu Asp Cys Glu Val Ala Gly Tyr Arg Val Leu Lys Gly 
                405                 410                 415     


Thr Glu Ala Val Ile Ile Pro Tyr Ala Leu His Arg Asp Pro Arg Tyr 
            420                 425                 430         


Phe Pro Asn Pro Glu Glu Phe Gln Pro Glu Arg Phe Phe Pro Glu Asn 
        435                 440                 445             


Ala Gln Gly Arg His Pro Tyr Ala Tyr Val Pro Phe Ser Ala Gly Pro 
    450                 455                 460                 


Arg Asn Cys Ile Gly Gln Lys Phe Ala Val Met Glu Glu Lys Thr Ile 
465                 470                 475                 480 


Leu Ser Cys Ile Leu Arg His Phe Trp Ile Glu Ser Asn Gln Lys Arg 
                485                 490                 495     


Glu Glu Leu Gly Leu Glu Gly Gln Leu Ile Leu Arg Pro Ser Asn Gly 
            500                 505                 510         


Ile Trp Ile Lys Leu Lys Arg Arg Asn Ala Asp Glu Arg 
        515                 520                 525 


<210>  143
<211>  236
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(236)
<223>  Retinoschisin 1 (RS1)

<400>  143

Met Ser Arg Lys Ile Glu Gly Phe Leu Leu Leu Leu Leu Phe Gly Tyr 
1               5                   10                  15      


Glu Ala Thr Leu Gly Leu Ser Ser Thr Glu Asp Glu Gly Glu Asp Pro 
            20                  25                  30          


Trp Tyr Gln Lys Ala Cys Asp Glu Gly Glu Asp Pro Trp Tyr Gln Lys 
        35                  40                  45              


Ala Cys Lys Cys Asp Cys Gln Gly Gly Pro Asn Ala Leu Trp Ser Ala 
    50                  55                  60                  


Gly Ala Thr Ser Leu Asp Cys Ile Pro Glu Cys Pro Tyr His Lys Pro 
65                  70                  75                  80  


Leu Gly Phe Glu Ser Gly Glu Val Thr Pro Asp Gln Ile Thr Cys Ser 
                85                  90                  95      


Asn Pro Glu Gln Tyr Val Gly Trp Tyr Ser Ser Trp Thr Ala Asn Lys 
            100                 105                 110         


Ala Arg Leu Asn Ser Gln Gly Phe Gly Cys Ala Trp Leu Ser Lys Phe 
        115                 120                 125             


Gln Asp Ser Ser Gln Trp Leu Gln Ile Asp Leu Lys Glu Ile Lys Val 
    130                 135                 140                 


Ile Ser Gly Ile Leu Thr Gln Gly Arg Cys Asp Ile Asp Glu Trp Met 
145                 150                 155                 160 


Thr Lys Tyr Ser Val Gln Tyr Arg Thr Asp Glu Arg Leu Asn Trp Ile 
                165                 170                 175     


Tyr Tyr Lys Asp Gln Thr Gly Asn Asn Arg Val Phe Tyr Gly Asn Ser 
            180                 185                 190         


Asp Arg Thr Ser Thr Val Gln Asn Leu Leu Arg Pro Pro Ile Ile Ser 
        195                 200                 205             


Arg Phe Ile Arg Leu Ile Pro Leu Gly Trp His Val Arg Ile Ala Ile 
    210                 215                 220                 


Arg Met Glu Leu Leu Glu Cys Val Ser Lys Cys Ala 
225                 230                 235     


<210>  144
<211>  854
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(854)
<223>  Phosphodiesterase 6B (PDE6B)

<400>  144

Met Ser Leu Ser Glu Glu Gln Ala Arg Ser Phe Leu Asp Gln Asn Pro 
1               5                   10                  15      


Asp Phe Ala Arg Gln Tyr Phe Gly Lys Lys Leu Ser Pro Glu Asn Val 
            20                  25                  30          


Ala Ala Ala Cys Glu Asp Gly Cys Pro Pro Asp Cys Asp Ser Leu Arg 
        35                  40                  45              


Asp Leu Cys Gln Val Glu Glu Ser Thr Ala Leu Leu Glu Leu Val Gln 
    50                  55                  60                  


Asp Met Gln Glu Ser Ile Asn Met Glu Arg Val Val Phe Lys Val Leu 
65                  70                  75                  80  


Arg Arg Leu Cys Thr Leu Leu Gln Ala Asp Arg Cys Ser Leu Phe Met 
                85                  90                  95      


Tyr Arg Gln Arg Asn Gly Val Ala Glu Leu Ala Thr Arg Leu Phe Ser 
            100                 105                 110         


Val Gln Pro Asp Ser Val Leu Glu Asp Cys Leu Val Pro Pro Asp Ser 
        115                 120                 125             


Glu Ile Val Phe Pro Leu Asp Ile Gly Val Val Gly His Val Ala Gln 
    130                 135                 140                 


Thr Lys Lys Met Val Asn Val Glu Asp Val Ala Glu Cys Pro His Phe 
145                 150                 155                 160 


Ser Ser Phe Ala Asp Glu Leu Thr Asp Tyr Lys Thr Lys Asn Met Leu 
                165                 170                 175     


Ala Thr Pro Ile Met Asn Gly Lys Asp Val Val Ala Val Ile Met Ala 
            180                 185                 190         


Val Asn Lys Leu Asn Gly Pro Phe Phe Thr Ser Glu Asp Glu Asp Val 
        195                 200                 205             


Phe Leu Lys Tyr Leu Asn Phe Ala Thr Leu Tyr Leu Lys Ile Tyr His 
    210                 215                 220                 


Leu Ser Tyr Leu His Asn Cys Glu Thr Arg Arg Gly Gln Val Leu Leu 
225                 230                 235                 240 


Trp Ser Ala Asn Lys Val Phe Glu Glu Leu Thr Asp Ile Glu Arg Gln 
                245                 250                 255     


Phe His Lys Ala Phe Tyr Thr Val Arg Ala Tyr Leu Asn Cys Glu Arg 
            260                 265                 270         


Tyr Ser Val Gly Leu Leu Asp Met Thr Lys Glu Lys Glu Phe Phe Asp 
        275                 280                 285             


Val Trp Ser Val Leu Met Gly Glu Ser Gln Pro Tyr Ser Gly Pro Arg 
    290                 295                 300                 


Thr Pro Asp Gly Arg Glu Ile Val Phe Tyr Lys Val Ile Asp Tyr Ile 
305                 310                 315                 320 


Leu His Gly Lys Glu Glu Ile Lys Val Ile Pro Thr Pro Ser Ala Asp 
                325                 330                 335     


His Trp Ala Leu Ala Ser Gly Leu Pro Ser Tyr Val Ala Glu Ser Gly 
            340                 345                 350         


Phe Ile Cys Asn Ile Met Asn Ala Ser Ala Asp Glu Met Phe Lys Phe 
        355                 360                 365             


Gln Glu Gly Ala Leu Asp Asp Ser Gly Trp Leu Ile Lys Asn Val Leu 
    370                 375                 380                 


Ser Met Pro Ile Val Asn Lys Lys Glu Glu Ile Val Gly Val Ala Thr 
385                 390                 395                 400 


Phe Tyr Asn Arg Lys Asp Gly Lys Pro Phe Asp Glu Gln Asp Glu Val 
                405                 410                 415     


Leu Met Glu Ser Leu Thr Gln Phe Leu Gly Trp Ser Val Met Asn Thr 
            420                 425                 430         


Asp Thr Tyr Asp Lys Met Asn Lys Leu Glu Asn Arg Lys Asp Ile Ala 
        435                 440                 445             


Gln Asp Met Val Leu Tyr His Val Lys Cys Asp Arg Asp Glu Ile Gln 
    450                 455                 460                 


Leu Ile Leu Pro Thr Arg Ala Arg Leu Gly Lys Glu Pro Ala Asp Cys 
465                 470                 475                 480 


Asp Glu Asp Glu Leu Gly Glu Ile Leu Lys Glu Glu Leu Pro Gly Pro 
                485                 490                 495     


Thr Thr Phe Asp Ile Tyr Glu Phe His Phe Ser Asp Leu Glu Cys Thr 
            500                 505                 510         


Glu Leu Asp Leu Val Lys Cys Gly Ile Gln Met Tyr Tyr Glu Leu Gly 
        515                 520                 525             


Val Val Arg Lys Phe Gln Ile Pro Gln Glu Val Leu Val Arg Phe Leu 
    530                 535                 540                 


Phe Ser Ile Ser Lys Gly Tyr Arg Arg Ile Thr Tyr His Asn Trp Arg 
545                 550                 555                 560 


His Gly Phe Asn Val Ala Gln Thr Met Phe Thr Leu Leu Met Thr Gly 
                565                 570                 575     


Lys Leu Lys Ser Tyr Tyr Thr Asp Leu Glu Ala Phe Ala Met Val Thr 
            580                 585                 590         


Ala Gly Leu Cys His Asp Ile Asp His Arg Gly Thr Asn Asn Leu Tyr 
        595                 600                 605             


Gln Met Lys Ser Gln Asn Pro Leu Ala Lys Leu His Gly Ser Ser Ile 
    610                 615                 620                 


Leu Glu Arg His His Leu Glu Phe Gly Lys Phe Leu Leu Ser Glu Glu 
625                 630                 635                 640 


Thr Leu Asn Ile Tyr Gln Asn Leu Asn Arg Arg Gln His Glu His Val 
                645                 650                 655     


Ile His Leu Met Asp Ile Ala Ile Ile Ala Thr Asp Leu Ala Leu Tyr 
            660                 665                 670         


Phe Lys Lys Arg Ala Met Phe Gln Lys Ile Val Asp Glu Ser Lys Asn 
        675                 680                 685             


Tyr Gln Asp Lys Lys Ser Trp Val Glu Tyr Leu Ser Leu Glu Thr Thr 
    690                 695                 700                 


Arg Lys Glu Ile Val Met Ala Met Met Met Thr Ala Cys Asp Leu Ser 
705                 710                 715                 720 


Ala Ile Thr Lys Pro Trp Glu Val Gln Ser Lys Val Ala Leu Leu Val 
                725                 730                 735     


Ala Ala Glu Phe Trp Glu Gln Gly Asp Leu Glu Arg Thr Val Leu Asp 
            740                 745                 750         


Gln Gln Pro Ile Pro Met Met Asp Arg Asn Lys Ala Ala Glu Leu Pro 
        755                 760                 765             


Lys Leu Gln Val Gly Phe Ile Asp Phe Val Cys Thr Phe Val Tyr Lys 
    770                 775                 780                 


Glu Phe Ser Arg Phe His Glu Glu Ile Leu Pro Met Phe Asp Arg Leu 
785                 790                 795                 800 


Gln Asn Asn Arg Lys Glu Trp Lys Ala Leu Ala Asp Glu Tyr Glu Ala 
                805                 810                 815     


Lys Val Lys Ala Leu Glu Glu Lys Glu Glu Glu Glu Arg Val Ala Ala 
            820                 825                 830         


Lys Lys Val Gly Thr Glu Ile Cys Asn Gly Gly Pro Ala Pro Lys Ser 
        835                 840                 845             


Ser Thr Cys Cys Ile Leu 
    850                 


<210>  145
<211>  498
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(498)
<223>  Methyl-CpG Binding Protein (MeCP2)

<400>  145

Met Ala Ala Ala Ala Ala Ala Ala Pro Ser Gly Gly Gly Gly Gly Gly 
1               5                   10                  15      


Glu Glu Glu Arg Leu Glu Glu Lys Ser Glu Asp Gln Asp Leu Gln Gly 
            20                  25                  30          


Leu Lys Asp Lys Pro Leu Lys Phe Lys Lys Val Lys Lys Asp Lys Lys 
        35                  40                  45              


Glu Glu Lys Glu Gly Lys His Glu Pro Val Gln Pro Ser Ala His His 
    50                  55                  60                  


Ser Ala Glu Pro Ala Glu Ala Gly Lys Ala Glu Thr Ser Glu Gly Ser 
65                  70                  75                  80  


Gly Ser Ala Pro Ala Val Pro Glu Ala Ser Ala Ser Pro Lys Gln Arg 
                85                  90                  95      


Arg Ser Ile Ile Arg Asp Arg Gly Pro Met Tyr Asp Asp Pro Thr Leu 
            100                 105                 110         


Pro Glu Gly Trp Thr Arg Lys Leu Lys Gln Arg Lys Ser Gly Arg Ser 
        115                 120                 125             


Ala Gly Lys Tyr Asp Val Tyr Leu Ile Asn Pro Gln Gly Lys Ala Phe 
    130                 135                 140                 


Arg Ser Lys Val Glu Leu Ile Ala Tyr Phe Glu Lys Val Gly Asp Thr 
145                 150                 155                 160 


Ser Leu Asp Pro Asn Asp Phe Asp Phe Thr Val Thr Gly Arg Gly Ser 
                165                 170                 175     


Pro Ser Arg Arg Glu Gln Lys Pro Pro Lys Lys Pro Lys Ser Pro Lys 
            180                 185                 190         


Ala Pro Gly Thr Gly Arg Gly Arg Gly Arg Pro Lys Gly Ser Gly Thr 
        195                 200                 205             


Thr Arg Pro Lys Ala Ala Thr Ser Glu Gly Val Gln Val Lys Arg Val 
    210                 215                 220                 


Leu Glu Lys Ser Pro Gly Lys Leu Leu Val Lys Met Pro Phe Gln Thr 
225                 230                 235                 240 


Ser Pro Gly Gly Lys Ala Glu Gly Gly Gly Ala Thr Thr Ser Thr Gln 
                245                 250                 255     


Val Met Val Ile Lys Arg Pro Gly Arg Lys Arg Lys Ala Glu Ala Asp 
            260                 265                 270         


Pro Gln Ala Ile Pro Lys Lys Arg Gly Arg Lys Pro Gly Ser Val Val 
        275                 280                 285             


Ala Ala Ala Ala Ala Glu Ala Lys Lys Lys Ala Val Lys Glu Ser Ser 
    290                 295                 300                 


Ile Arg Ser Val Gln Glu Thr Val Leu Pro Ile Lys Lys Arg Lys Thr 
305                 310                 315                 320 


Arg Glu Thr Val Ser Ile Glu Val Lys Glu Val Val Lys Pro Leu Leu 
                325                 330                 335     


Val Ser Thr Leu Gly Glu Lys Ser Gly Lys Gly Leu Lys Thr Cys Lys 
            340                 345                 350         


Ser Pro Gly Arg Lys Ser Lys Glu Ser Ser Pro Lys Gly Arg Ser Ser 
        355                 360                 365             


Ser Ala Ser Ser Pro Pro Lys Lys Glu His His His His His His His 
    370                 375                 380                 


Ser Glu Ser Pro Lys Ala Pro Val Pro Leu Leu Pro Pro Leu Pro Pro 
385                 390                 395                 400 


Pro Pro Pro Glu Pro Glu Ser Ser Glu Asp Pro Thr Ser Pro Pro Glu 
                405                 410                 415     


Pro Gln Asp Leu Ser Ser Ser Val Cys Lys Glu Glu Lys Met Pro Arg 
            420                 425                 430         


Gly Gly Ser Leu Glu Ser Asp Gly Cys Pro Lys Glu Pro Ala Lys Thr 
        435                 440                 445             


Gln Pro Ala Val Ala Thr Ala Ala Thr Ala Ala Glu Lys Tyr Lys His 
    450                 455                 460                 


Arg Gly Glu Gly Glu Arg Lys Asp Ile Val Ser Ser Ser Met Pro Arg 
465                 470                 475                 480 


Pro Asn Arg Glu Glu Pro Val Asp Ser Arg Thr Pro Val Thr Glu Arg 
                485                 490                 495     


Val Ser 
        


<210>  146
<211>  743
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(743)
<223>  N-acetyl-alpha-glucosaminidase (NAGLU)

<400>  146

Met Glu Ala Val Ala Val Ala Ala Ala Val Gly Val Leu Leu Leu Ala 
1               5                   10                  15      


Gly Ala Gly Gly Ala Ala Gly Asp Glu Ala Arg Glu Ala Ala Ala Val 
            20                  25                  30          


Arg Ala Leu Val Ala Arg Leu Leu Gly Pro Gly Pro Ala Ala Asp Phe 
        35                  40                  45              


Ser Val Ser Val Glu Arg Ala Leu Ala Ala Lys Pro Gly Leu Asp Thr 
    50                  55                  60                  


Tyr Ser Leu Gly Gly Gly Gly Ala Ala Arg Val Arg Val Arg Gly Ser 
65                  70                  75                  80  


Thr Gly Val Ala Ala Ala Ala Gly Leu His Arg Tyr Leu Arg Asp Phe 
                85                  90                  95      


Cys Gly Cys His Val Ala Trp Ser Gly Ser Gln Leu Arg Leu Pro Arg 
            100                 105                 110         


Pro Leu Pro Ala Val Pro Gly Glu Leu Thr Glu Ala Thr Pro Asn Arg 
        115                 120                 125             


Tyr Arg Tyr Tyr Gln Asn Val Cys Thr Gln Ser Tyr Ser Phe Val Trp 
    130                 135                 140                 


Trp Asp Trp Ala Arg Trp Glu Arg Glu Ile Asp Trp Met Ala Leu Asn 
145                 150                 155                 160 


Gly Ile Asn Leu Ala Leu Ala Trp Ser Gly Gln Glu Ala Ile Trp Gln 
                165                 170                 175     


Arg Val Tyr Leu Ala Leu Gly Leu Thr Gln Ala Glu Ile Asn Glu Phe 
            180                 185                 190         


Phe Thr Gly Pro Ala Phe Leu Ala Trp Gly Arg Met Gly Asn Leu His 
        195                 200                 205             


Thr Trp Asp Gly Pro Leu Pro Pro Ser Trp His Ile Lys Gln Leu Tyr 
    210                 215                 220                 


Leu Gln His Arg Val Leu Asp Gln Met Arg Ser Phe Gly Met Thr Pro 
225                 230                 235                 240 


Val Leu Pro Ala Phe Ala Gly His Val Pro Glu Ala Val Thr Arg Val 
                245                 250                 255     


Phe Pro Gln Val Asn Val Thr Lys Met Gly Ser Trp Gly His Phe Asn 
            260                 265                 270         


Cys Ser Tyr Ser Cys Ser Phe Leu Leu Ala Pro Glu Asp Pro Ile Phe 
        275                 280                 285             


Pro Ile Ile Gly Ser Leu Phe Leu Arg Glu Leu Ile Lys Glu Phe Gly 
    290                 295                 300                 


Thr Asp His Ile Tyr Gly Ala Asp Thr Phe Asn Glu Met Gln Pro Pro 
305                 310                 315                 320 


Ser Ser Glu Pro Ser Tyr Leu Ala Ala Ala Thr Thr Ala Val Tyr Glu 
                325                 330                 335     


Ala Met Thr Ala Val Asp Thr Glu Ala Val Trp Leu Leu Gln Gly Trp 
            340                 345                 350         


Leu Phe Gln His Gln Pro Gln Phe Trp Gly Pro Ala Gln Ile Arg Ala 
        355                 360                 365             


Val Leu Gly Ala Val Pro Arg Gly Arg Leu Leu Val Leu Asp Leu Phe 
    370                 375                 380                 


Ala Glu Ser Gln Pro Val Tyr Thr Arg Thr Ala Ser Phe Gln Gly Gln 
385                 390                 395                 400 


Pro Phe Ile Trp Cys Met Leu His Asn Phe Gly Gly Asn His Gly Leu 
                405                 410                 415     


Phe Gly Ala Leu Glu Ala Val Asn Gly Gly Pro Glu Ala Ala Arg Leu 
            420                 425                 430         


Phe Pro Asn Ser Thr Met Val Gly Thr Gly Met Ala Pro Glu Gly Ile 
        435                 440                 445             


Ser Gln Asn Glu Val Val Tyr Ser Leu Met Ala Glu Leu Gly Trp Arg 
    450                 455                 460                 


Lys Asp Pro Val Pro Asp Leu Ala Ala Trp Val Thr Ser Phe Ala Ala 
465                 470                 475                 480 


Arg Arg Tyr Gly Val Ser His Pro Asp Ala Gly Ala Ala Trp Arg Leu 
                485                 490                 495     


Leu Leu Arg Ser Val Tyr Asn Cys Ser Gly Glu Ala Cys Arg Gly His 
            500                 505                 510         


Asn Arg Ser Pro Leu Val Arg Arg Pro Ser Leu Gln Met Asn Thr Ser 
        515                 520                 525             


Ile Trp Tyr Asn Arg Ser Asp Val Phe Glu Ala Trp Arg Leu Leu Leu 
    530                 535                 540                 


Thr Ser Ala Pro Ser Leu Ala Thr Ser Pro Ala Phe Arg Tyr Asp Leu 
545                 550                 555                 560 


Leu Asp Leu Thr Arg Gln Ala Val Gln Glu Leu Val Ser Leu Tyr Tyr 
                565                 570                 575     


Glu Glu Ala Arg Ser Ala Tyr Leu Ser Lys Glu Leu Ala Ser Leu Leu 
            580                 585                 590         


Arg Ala Gly Gly Val Leu Ala Tyr Glu Leu Leu Pro Ala Leu Asp Glu 
        595                 600                 605             


Val Leu Ala Ser Asp Ser Arg Phe Leu Leu Gly Ser Trp Leu Glu Gln 
    610                 615                 620                 


Ala Arg Ala Ala Ala Val Ser Glu Ala Glu Ala Asp Phe Tyr Glu Gln 
625                 630                 635                 640 


Asn Ser Arg Tyr Gln Leu Thr Leu Trp Gly Pro Glu Gly Asn Ile Leu 
                645                 650                 655     


Asp Tyr Ala Asn Lys Gln Leu Ala Gly Leu Val Ala Asn Tyr Tyr Thr 
            660                 665                 670         


Pro Arg Trp Arg Leu Phe Leu Glu Ala Leu Val Asp Ser Val Ala Gln 
        675                 680                 685             


Gly Ile Pro Phe Gln Gln His Gln Phe Asp Lys Asn Val Phe Gln Leu 
    690                 695                 700                 


Glu Gln Ala Phe Val Leu Ser Lys Gln Arg Tyr Pro Ser Gln Pro Arg 
705                 710                 715                 720 


Gly Asp Thr Val Asp Leu Ala Lys Lys Ile Phe Leu Lys Tyr Tyr Pro 
                725                 730                 735     


Gly Trp Val Ala Gly Ser Trp 
            740             


<210>  147

<400>  147
000

<210>  148
<211>  429
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<222>  (1)..(429)
<223>  Alpha-Galactosidase A (GLA)

<400>  148

Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu 
1               5                   10                  15      


Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu 
            20                  25                  30          


Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu 
        35                  40                  45              


Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile 
    50                  55                  60                  


Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly 
65                  70                  75                  80  


Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met 
                85                  90                  95      


Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg 
            100                 105                 110         


Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly 
        115                 120                 125             


Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly 
    130                 135                 140                 


Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala 
145                 150                 155                 160 


Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser 
                165                 170                 175     


Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn 
            180                 185                 190         


Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met 
        195                 200                 205             


Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn 
    210                 215                 220                 


His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys 
225                 230                 235                 240 


Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val 
                245                 250                 255     


Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn 
            260                 265                 270         


Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala 
        275                 280                 285             


Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser 
    290                 295                 300                 


Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn 
305                 310                 315                 320 


Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn 
                325                 330                 335     


Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala 
            340                 345                 350         


Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala 
        355                 360                 365             


Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile 
    370                 375                 380                 


Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 
385                 390                 395                 400 


Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln 
                405                 410                 415     


Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu 
            420                 425                 


<210>  149
<211>  458
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized + GET, CO1-GLA-GET

<400>  149

Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu 
1               5                   10                  15      


Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu 
            20                  25                  30          


Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu 
        35                  40                  45              


Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile 
    50                  55                  60                  


Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly 
65                  70                  75                  80  


Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met 
                85                  90                  95      


Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg 
            100                 105                 110         


Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly 
        115                 120                 125             


Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly 
    130                 135                 140                 


Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala 
145                 150                 155                 160 


Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser 
                165                 170                 175     


Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn 
            180                 185                 190         


Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met 
        195                 200                 205             


Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn 
    210                 215                 220                 


His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys 
225                 230                 235                 240 


Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val 
                245                 250                 255     


Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn 
            260                 265                 270         


Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala 
        275                 280                 285             


Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser 
    290                 295                 300                 


Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn 
305                 310                 315                 320 


Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn 
                325                 330                 335     


Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala 
            340                 345                 350         


Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala 
        355                 360                 365             


Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile 
    370                 375                 380                 


Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr 
385                 390                 395                 400 


Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln 
                405                 410                 415     


Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu Arg Arg Arg 
            420                 425                 430         


Arg Arg Arg Arg Arg Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys 
        435                 440                 445             


Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys 
    450                 455             


<210>  150
<211>  1428
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized, Cystic Fibrosis Transmembrane Regulator deltaR 
       (CFTRdeltaR) contains R domain deletion

<400>  150

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Thr Leu Gln Ala Arg Arg Arg Gln Ser Val Leu Asn Leu 
705                 710                 715                 720 


Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His Arg Lys Thr Thr 
                725                 730                 735     


Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala Asn Leu Thr Glu 
            740                 745                 750         


Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr Gly Leu Glu Ile 
        755                 760                 765             


Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys Phe Phe Asp Asp 
    770                 775                 780                 


Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr Tyr Leu Arg Tyr 
785                 790                 795                 800 


Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile Trp Cys Leu Val 
                805                 810                 815     


Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val Leu Trp Leu Leu 
            820                 825                 830         


Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr His Ser Arg Asn 
        835                 840                 845             


Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser Tyr Tyr Val Phe 
    850                 855                 860                 


Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala Met Gly Phe Phe 
865                 870                 875                 880 


Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val Ser Lys Ile Leu 
                885                 890                 895     


His His Lys Met Leu His Ser Val Leu Gln Ala Pro Met Ser Thr Leu 
            900                 905                 910         


Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe Ser Lys Asp Ile 
        915                 920                 925             


Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe Asp Phe Ile Gln 
    930                 935                 940                 


Leu Leu Leu Ile Val Ile Gly Ala Ile Ala Val Val Ala Val Leu Gln 
945                 950                 955                 960 


Pro Tyr Ile Phe Val Ala Thr Val Pro Val Ile Val Ala Phe Ile Met 
                965                 970                 975     


Leu Arg Ala Tyr Phe Leu Gln Thr Ser Gln Gln Leu Lys Gln Leu Glu 
            980                 985                 990         


Ser Glu Gly Arg Ser Pro Ile Phe  Thr His Leu Val Thr  Ser Leu Lys 
        995                 1000                 1005             


Gly Leu  Trp Thr Leu Arg Ala  Phe Gly Arg Gln Pro  Tyr Phe Glu 
    1010                 1015                 1020             


Thr Leu  Phe His Lys Ala Leu  Asn Leu His Thr Ala  Asn Trp Phe 
    1025                 1030                 1035             


Leu Tyr  Leu Ser Thr Leu Arg  Trp Phe Gln Met Arg  Ile Glu Met 
    1040                 1045                 1050             


Ile Phe  Val Ile Phe Phe Ile  Ala Val Thr Phe Ile  Ser Ile Leu 
    1055                 1060                 1065             


Thr Thr  Gly Glu Gly Glu Gly  Arg Val Gly Ile Ile  Leu Thr Leu 
    1070                 1075                 1080             


Ala Met  Asn Ile Met Ser Thr  Leu Gln Trp Ala Val  Asn Ser Ser 
    1085                 1090                 1095             


Ile Asp  Val Asp Ser Leu Met  Arg Ser Val Ser Arg  Val Phe Lys 
    1100                 1105                 1110             


Phe Ile  Asp Met Pro Thr Glu  Gly Lys Pro Thr Lys  Ser Thr Lys 
    1115                 1120                 1125             


Pro Tyr  Lys Asn Gly Gln Leu  Ser Lys Val Met Ile  Ile Glu Asn 
    1130                 1135                 1140             


Ser His  Val Lys Lys Asp Asp  Ile Trp Pro Ser Gly  Gly Gln Met 
    1145                 1150                 1155             


Thr Val  Lys Asp Leu Thr Ala  Lys Tyr Thr Glu Gly  Gly Asn Ala 
    1160                 1165                 1170             


Ile Leu  Glu Asn Ile Ser Phe  Ser Ile Ser Pro Gly  Gln Arg Val 
    1175                 1180                 1185             


Gly Leu  Leu Gly Arg Thr Gly  Ser Gly Lys Ser Thr  Leu Leu Ser 
    1190                 1195                 1200             


Ala Phe  Leu Arg Leu Leu Asn  Thr Glu Gly Glu Ile  Gln Ile Asp 
    1205                 1210                 1215             


Gly Val  Ser Trp Asp Ser Ile  Thr Leu Gln Gln Trp  Arg Lys Ala 
    1220                 1225                 1230             


Phe Gly  Val Ile Pro Gln Lys  Val Phe Ile Phe Ser  Gly Thr Phe 
    1235                 1240                 1245             


Arg Lys  Asn Leu Asp Pro Tyr  Glu Gln Trp Ser Asp  Gln Glu Ile 
    1250                 1255                 1260             


Trp Lys  Val Ala Asp Glu Val  Gly Leu Arg Ser Val  Ile Glu Gln 
    1265                 1270                 1275             


Phe Pro  Gly Lys Leu Asp Phe  Val Leu Val Asp Gly  Gly Cys Val 
    1280                 1285                 1290             


Leu Ser  His Gly His Lys Gln  Leu Met Cys Leu Ala  Arg Ser Val 
    1295                 1300                 1305             


Leu Ser  Lys Ala Lys Ile Leu  Leu Leu Asp Glu Pro  Ser Ala His 
    1310                 1315                 1320             


Leu Asp  Pro Val Thr Tyr Gln  Ile Ile Arg Arg Thr  Leu Lys Gln 
    1325                 1330                 1335             


Ala Phe  Ala Asp Cys Thr Val  Ile Leu Cys Glu His  Arg Ile Glu 
    1340                 1345                 1350             


Ala Met  Leu Glu Cys Gln Gln  Phe Leu Val Ile Glu  Glu Asn Lys 
    1355                 1360                 1365             


Val Arg  Gln Tyr Asp Ser Ile  Gln Lys Leu Leu Asn  Glu Arg Ser 
    1370                 1375                 1380             


Leu Phe  Arg Gln Ala Ile Ser  Pro Ser Asp Arg Val  Lys Leu Phe 
    1385                 1390                 1395             


Pro His  Arg Asn Ser Ser Lys  Cys Lys Ser Lys Pro  Gln Ile Ala 
    1400                 1405                 1410             


Ala Leu  Lys Glu Glu Thr Glu  Glu Glu Val Gln Asp  Thr Arg Leu 
    1415                 1420                 1425             


<210>  151
<211>  1480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized, full length Cystic Fibrosis Transmembrane 
       Regulator (CFTR)

<400>  151

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Met Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Ile Arg Lys Phe Ser Ile Val Gln Lys Thr Pro Leu Gln 
705                 710                 715                 720 


Met Asn Gly Ile Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 
                725                 730                 735     


Ser Leu Val Pro Asp Ser Glu Gln Gly Glu Ala Ile Leu Pro Arg Ile 
            740                 745                 750         


Ser Val Ile Ser Thr Gly Pro Thr Leu Gln Ala Arg Arg Arg Gln Ser 
        755                 760                 765             


Val Leu Asn Leu Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His 
    770                 775                 780                 


Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala 
785                 790                 795                 800 


Asn Leu Thr Glu Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr 
                805                 810                 815     


Gly Leu Glu Ile Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys 
            820                 825                 830         


Phe Phe Asp Asp Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr 
        835                 840                 845             


Tyr Leu Arg Tyr Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile 
    850                 855                 860                 


Trp Cys Leu Val Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865                 870                 875                 880 


Leu Trp Leu Leu Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr 
                885                 890                 895     


His Ser Arg Asn Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser 
            900                 905                 910         


Tyr Tyr Val Phe Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala 
        915                 920                 925             


Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val 
    930                 935                 940                 


Ser Lys Ile Leu His His Lys Met Leu His Ser Val Leu Gln Ala Pro 
945                 950                 955                 960 


Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe 
                965                 970                 975     


Ser Lys Asp Ile Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe 
            980                 985                 990         


Asp Phe Ile Gln Leu Leu Leu Ile  Val Ile Gly Ala Ile  Ala Val Val 
        995                 1000                 1005             


Ala Val  Leu Gln Pro Tyr Ile  Phe Val Ala Thr Val  Pro Val Ile 
    1010                 1015                 1020             


Val Ala  Phe Ile Met Leu Arg  Ala Tyr Phe Leu Gln  Thr Ser Gln 
    1025                 1030                 1035             


Gln Leu  Lys Gln Leu Glu Ser  Glu Gly Arg Ser Pro  Ile Phe Thr 
    1040                 1045                 1050             


His Leu  Val Thr Ser Leu Lys  Gly Leu Trp Thr Leu  Arg Ala Phe 
    1055                 1060                 1065             


Gly Arg  Gln Pro Tyr Phe Glu  Thr Leu Phe His Lys  Ala Leu Asn 
    1070                 1075                 1080             


Leu His  Thr Ala Asn Trp Phe  Leu Tyr Leu Ser Thr  Leu Arg Trp 
    1085                 1090                 1095             


Phe Gln  Met Arg Ile Glu Met  Ile Phe Val Ile Phe  Phe Ile Ala 
    1100                 1105                 1110             


Val Thr  Phe Ile Ser Ile Leu  Thr Thr Gly Glu Gly  Glu Gly Arg 
    1115                 1120                 1125             


Val Gly  Ile Ile Leu Thr Leu  Ala Met Asn Ile Met  Ser Thr Leu 
    1130                 1135                 1140             


Gln Trp  Ala Val Asn Ser Ser  Ile Asp Val Asp Ser  Leu Met Arg 
    1145                 1150                 1155             


Ser Val  Ser Arg Val Phe Lys  Phe Ile Asp Met Pro  Thr Glu Gly 
    1160                 1165                 1170             


Lys Pro  Thr Lys Ser Thr Lys  Pro Tyr Lys Asn Gly  Gln Leu Ser 
    1175                 1180                 1185             


Lys Val  Met Ile Ile Glu Asn  Ser His Val Lys Lys  Asp Asp Ile 
    1190                 1195                 1200             


Trp Pro  Ser Gly Gly Gln Met  Thr Val Lys Asp Leu  Thr Ala Lys 
    1205                 1210                 1215             


Tyr Thr  Glu Gly Gly Asn Ala  Ile Leu Glu Asn Ile  Ser Phe Ser 
    1220                 1225                 1230             


Ile Ser  Pro Gly Gln Arg Val  Gly Leu Leu Gly Arg  Thr Gly Ser 
    1235                 1240                 1245             


Gly Lys  Ser Thr Leu Leu Ser  Ala Phe Leu Arg Leu  Leu Asn Thr 
    1250                 1255                 1260             


Glu Gly  Glu Ile Gln Ile Asp  Gly Val Ser Trp Asp  Ser Ile Thr 
    1265                 1270                 1275             


Leu Gln  Gln Trp Arg Lys Ala  Phe Gly Val Ile Pro  Gln Lys Val 
    1280                 1285                 1290             


Phe Ile  Phe Ser Gly Thr Phe  Arg Lys Asn Leu Asp  Pro Tyr Glu 
    1295                 1300                 1305             


Gln Trp  Ser Asp Gln Glu Ile  Trp Lys Val Ala Asp  Glu Val Gly 
    1310                 1315                 1320             


Leu Arg  Ser Val Ile Glu Gln  Phe Pro Gly Lys Leu  Asp Phe Val 
    1325                 1330                 1335             


Leu Val  Asp Gly Gly Cys Val  Leu Ser His Gly His  Lys Gln Leu 
    1340                 1345                 1350             


Met Cys  Leu Ala Arg Ser Val  Leu Ser Lys Ala Lys  Ile Leu Leu 
    1355                 1360                 1365             


Leu Asp  Glu Pro Ser Ala His  Leu Asp Pro Val Thr  Tyr Gln Ile 
    1370                 1375                 1380             


Ile Arg  Arg Thr Leu Lys Gln  Ala Phe Ala Asp Cys  Thr Val Ile 
    1385                 1390                 1395             


Leu Cys  Glu His Arg Ile Glu  Ala Met Leu Glu Cys  Gln Gln Phe 
    1400                 1405                 1410             


Leu Val  Ile Glu Glu Asn Lys  Val Arg Gln Tyr Asp  Ser Ile Gln 
    1415                 1420                 1425             


Lys Leu  Leu Asn Glu Arg Ser  Leu Phe Arg Gln Ala  Ile Ser Pro 
    1430                 1435                 1440             


Ser Asp  Arg Val Lys Leu Phe  Pro His Arg Asn Ser  Ser Lys Cys 
    1445                 1450                 1455             


Lys Ser  Lys Pro Gln Ile Ala  Ala Leu Lys Glu Glu  Thr Glu Glu 
    1460                 1465                 1470             


Glu Val  Gln Asp Thr Arg Leu  
    1475                 1480 


<210>  152
<211>  250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mouse U1a promoter

<400>  152
atggaggcgg tactatgtag atgagaattc aggagcaaac tgggaaaagc aactgcttcc       60

aaatatttgt gatttttaca gtgtagtttt ggaaaaactc ttagcctacc aattcttcta      120

agtgttttaa aatgtgggag ccagtacaca tgaagttata gagtgtttta atgaggctta      180

aatatttacc gtaactatga aatgctacgc atatcatgct gttcaggctc cgtggccacg      240

caactcatac                                                             250


<210>  153
<211>  101
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polymerase III H1 mutant promoter

<400>  153
aatatttgca tgtcgctatg tgttctggga aatcaccata aacgtgaaat gtctttggat       60

ttgggaatct tcgaagttct gtatgagacc acagatctcc a                          101


<210>  154
<211>  701
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Chicken beta-actin hybrid promoter CBh (CBh promoter consists of 
       CMV enhancer, CBA promoter, first CBA exon and partial intron)

<400>  154
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg      120

gtaaactgcc cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga      180

cgtcaatgac ggtaaatggc ccgcctggca ttgtgcccag tacatgacct tatgggactt      240

tcctacttgg cagtacatct acgtattagt catcgctatt accatggtcg aggtgagccc      300

cacgttctgc ttcactctcc ccatctcccc cccctcccca cccccaattt tgtatttatt      360

tattttttaa ttattttgtg cagcgatggg ggcggggggg gggggggggc gcgcgccagg      420

cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg agaggtgcgg cggcagccaa      480

tcagagcggc gcgctccgaa agtttccttt tatggcgagg cggcggcggc ggcggcccta      540

taaaaagcga agcgcgcggc gggcgggagt cgctgcgcgc tgccttcgcc ccgtgccccg      600

ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg accgcgttac tcccacaggt      660

gagcgggcgg gacggccctt ctcctccggg ctgtaattag c                          701


<210>  155
<211>  229
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP2 min promoter sequence

<400>  155
agctgaatgg ggtccgcctc ttttccctgc ctaaacagac aggaactcct gccaattgag       60

ggcgtcaccg ctaaggctcc gccccagcct gggctccaca accaatgaag ggtaatctcg      120

acaaagagca aggggtgggg cgcgggcgcg caggtgcagc agcacacagg ctggtcggga      180

gggcggggcg cgacgtctgc cgtgcggggt cccggcatcg gttgcgcgc                  229


<210>  156
<211>  737
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP2 promoter sequence

<400>  156
tcaaaccatc tgattcaaca atgcacgacc gatctcttat gggcttggca cacaccatct       60

gcccattata aacgtctgca aagaccaagg tttgatatgt tgattttact gtcagcctta      120

agagtgcgac atctgctaat ttagtgtaat aatacaatca gtagaccctt taaaacaagt      180

cccttggctt ggaacaacgc caggctcctc aacaggcaac tttgctactt ctacagaaaa      240

tgataataaa gaaatgctgg tgaagtcaaa tgcttatcac aatggtgaac tactcagcag      300

ggaggctcta ataggcgcca agagcctaga cttccttaag cgccagagtc cacaagggcc      360

cagttaatcc tcaacattca aatgctgccc acaaaaccag cccctctgtg ccctagccgc      420

ctcttttttc caagtgacag tagaactcca ccaatccgca gctgaatggg gtccgcctct      480

tttccctgcc taaacagaca ggaactcctg ccaattgagg gcgtcaccgc taaggctccg      540

ccccagcctg ggctccacaa ccaatgaagg gtaatctcga caaagagcaa ggggtggggc      600

gcgggcgcgc aggtgcagca gcacacaggc tggtcgggag ggcggggcgc gacgtctgcc      660

gtgcggggtc ccggcatcgg ttgcgcgcgc gctccctcct ctcggagaga gggctgtggt      720

aaaacccgtc cggaaaa                                                     737


<210>  157
<211>  418
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP418 promoter sequence

<400>  157
ataggcgcca agagcctaga cttccttaag cgccagagtc cacaagggcc cagttaatcc       60

tcaacattca aatgctgccc acaaaaccag cccctctgtg ccctagccgc ctcttttttc      120

caagtgacag tagaactcca ccaatccgca gctgaatggg gtccgcctct tttccctgcc      180

taaacagaca ggaactcctg ccaattgagg gcgtcaccgc taaggctccg ccccagcctg      240

ggctccacaa ccaatgaagg gtaatctcga caaagagcaa ggggtggggc gcgggcgcgc      300

aggtgcagca gcacacaggc tggtcgggag ggcggggcgc gacgtctgcc gtgcggggtc      360

ccggcatcgg ttgcgcgcgc gctccctcct ctcggagaga gggctgtggt aaaacccg        418


<210>  158
<211>  426
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  MeCP426 promoter sequence

<400>  158
ataggcgcca agagcctaga cttccttaag cgccagagtc cacaagggcc cagttaatcc       60

tcaacattca aatgctgccc acaaaaccag cccctctgtg ccctagccgc ctcttttttc      120

caagtgacag tagaactcca ccaatccgca gctgaatggg gtccgcctct tttccctgcc      180

taaacagaca ggaactcctg ccaattgagg gcgtcaccgc taaggctccg ccccagcctg      240

ggctccacaa ccaatgaagg gtaatctcga caaagagcaa ggggtggggc gcgggcgcgc      300

aggtgcagca gcacacaggc tggtcgggag ggcggggcgc gacgtctgcc gtgcggggtc      360

ccggcatcgg ttgcgcgcgc gctccctcct ctcggagaga gggctgtggt aaaacccgtc      420

cggaaa                                                                 426


<210>  159
<211>  400
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  VMD2 promoter

<400>  159
aattctgtca ttttactagg gtgatgaaat tcccaagcaa caccatcctt ttcagataag       60

ggcactgagg ctgagagagg agctgaaacc tacccggggt caccacacac aggtggcaag      120

gctgggacca gaaaccagga ctgttgactc tggattttag ggccatggta gagggggtgt      180

tgccctaaat tccagccctg gtctcagccc aacaccctcc aagaagaaat tagaggggcc      240

atggccaggc tgtgctagcc gttgcttctg agcagattac aagaagggac taagacaagg      300

actcctttgt ggaggtcctg gcttagggag tcaagtgacg gcggctcagc actcacgtgg      360

gcagtgccag cctctaagag tgggcagggg cactggccac                            400


<210>  160
<211>  136
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PDE6b promoter

<400>  160
cccatttgta ggagtgagtc agctgacccg cccccggggt tcctaatctc actaagaaag       60

actttgctga tgacagggtt tcctgggagt ccatgcgtgc ctggagcagc agcgtctcca      120

gggacaggca gccacc                                                      136


<210>  161
<211>  2035
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRho promoter

<400>  161
gcgccaatca gccgatgact tctaacaata ctcttaactc acacagagct tgtctcactg       60

agccaacacc ctgtaccctc agctcagtga cggctttcaa cctgtggggc tgcctctgtt      120

acccaagtga gagagggcca gtgctcccag aggtgacctt gtttgcccat tctctccctg      180

ggtcagccag tgtttatctg ttgtataccc agtccaccct gcaggctcac atcagagcct      240

aggagatggc tagtgtcccc gcggagacca cgatgaagct tcccagctgt ctcaagcaca      300

agctggctgc agaggctgct gaggcactgc tagctgggga tgggggcagg gtagatctgg      360

ggctgaccac cagggtcaga atcagaacct ccaccttgac ctcattaacg ctggtcttaa      420

tcaccaagcc aagctcctta aactgctagt ggccaactcc caggccctga cacacatacc      480

tgccctgtgt tcccaaacaa gacacctgca tggaaggaag ggggttgctt ttctaagcaa      540

acatctagga atcccgggtg cagtgtgagg agactaggcg agggagtact ttaagggcct      600

caaggctcag agaggaatac ttcttccctg gttagcctcg tgcctaggct ccagggtctt      660

tgtcctgcct ggatacctat gtggcaaggg gcatagcatt tcccccacca tcagctctta      720

gctcaacctt atcttctcgg aaagactgcg cagtgtaaca acacagcaga gacttttctt      780

ttgtcccctg tctacccctg taactgctac tcagaagcat ctttctcaca gggtactggc      840

ttcttgcatc cagagttttt tgtctccctc gggcccccag aatcaaattc ttcctctggg      900

actcagtgga tgtttcacac acgtatcggc ctgacagtca tcctggagca tcctacacag      960

gggccatcac agctgcatgt cagaaatgct ggcctcacat cctcagacac caggcctagt     1020

gctggtcttc ctcagactgg cgtccccagc aggccagtag gatcatcttt tagcctacag     1080

agttctgaag cctcagagcc ccaggtccct ggtcatcttc tctgcccctg agatttttcc     1140

aagttgtatg ccttctaggt aaggcaaaac ttcttacgcc cctcctcgtg gcctccaggc     1200

cccacatgct cacctgaata acctggcagc ctgctccctc atgcagggac cacgtcctgc     1260

tgcacccagc aggccatccc gtctccatag cccatggtca tccctccctg gacaggaatg     1320

tgtctcctcc ccgggctgag tcttgctcaa gctagaagca ctccgaacag ggttatgggc     1380

gcctcctcca tctcccaagt ggctggctta tgaatgttta atgtacatgt gagtgaacaa     1440

attccaattg aacgcaacaa atagttatcg agccgctgag ccggggggcg gggggtgtga     1500

gactggaggc gatggacgga gctgacggca cacacagctc agatctgtca agtgagccat     1560

tgtcagggct tggggactgg ataagtcagg gggtctcctg ggaagagatg ggataggtga     1620

gttcaggagg agacattgtc aactggagcc atgtggagaa gtgaatttag ggcccaaagg     1680

ttccagtcgc agcctgaggc caccagactg acatggggag gaattcccag aggactctgg     1740

ggcagacaag atgagacacc ctttcctttc tttacctaag ggcctccacc cgatgtcacc     1800

ttggcccctc tgcaagccaa ttaggccccg gtggcagcag tgggattagc gttagtatga     1860

tatctcgcgg atgctgaatc agcctctggc ttagggagag aaggtcactt tataagggtc     1920

tggggggggt cagtgcctgg agttgcgctg tgggagccgt cagtggctga gctcgccaag     1980

cagccttggt ctctgtctac gaagagcccg tggggcagcc tcgagagccg cagcc          2035


<210>  162
<211>  511
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV promoter

<400>  162
ccgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat       60

tgacgtcaat agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac      120

ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg      180

acgtcaatga cggtaaatgg cccgcctggc attgtgccca gtacatgacc ttatgggact      240

ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt      300

ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc      360

ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc      420

gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata      480

taagcagagc tcgtttagtg aaccgtcaga t                                     511


<210>  163
<211>  334
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC promoter

<400>  163
ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg       60

ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag      120

cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag      180

gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg      240

aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat      300

gattatataa ggacgcgccg ggtgtggcac agct                                  334


<210>  164
<211>  342
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cre recombinase

<400>  164

Ser Asn Leu Leu Thr Val His Gln Asn Leu Pro Ala Leu Pro Val Asp 
1               5                   10                  15      


Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp 
            20                  25                  30          


Arg Gln Ala Phe Ser Glu His Thr Trp Lys Met Leu Gln Ser Val Cys 
        35                  40                  45              


Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro 
    50                  55                  60                  


Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gln Ala Arg 
65                  70                  75                  80  


Gly Leu Ala Val Lys Thr Ile Gln Gln His Leu Gly Gln Leu Asn Met 
                85                  90                  95      


Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val 
            100                 105                 110         


Ser Leu Val Met Arg Arg Ile Arg Lys Glu Asn Val Asp Ala Gly Glu 
        115                 120                 125             


Arg Ala Lys Gln Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gln Val 
    130                 135                 140                 


Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gln Asp Ile Arg Asn Leu 
145                 150                 155                 160 


Ala Phe Leu Gly Ile Ala Tyr Asn Thr Leu Leu Arg Ile Ala Glu Ile 
                165                 170                 175     


Ala Arg Ile Arg Val Lys Asp Ile Ser Arg Thr Asp Gly Gly Arg Met 
            180                 185                 190         


Leu Ile His Ile Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val 
        195                 200                 205             


Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp Ile 
    210                 215                 220                 


Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg 
225                 230                 235                 240 


Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gln Leu Ser 
                245                 250                 255     


Thr Arg Ala Leu Glu Gly Ile Phe Glu Ala Thr His Arg Leu Ile Tyr 
            260                 265                 270         


Gly Ala Lys Asp Asp Ser Gly Gln Arg Tyr Leu Ala Trp Ser Gly His 
        275                 280                 285             


Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser 
    290                 295                 300                 


Ile Pro Glu Ile Met Gln Ala Gly Gly Trp Thr Asn Val Asn Ile Val 
305                 310                 315                 320 


Met Asn Tyr Ile Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg 
                325                 330                 335     


Leu Leu Glu Asp Gly Asp 
            340         


<210>  165
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NLS

<400>  165

Pro Lys Lys Lys Arg Lys Val 
1               5           


<210>  166
<211>  349
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cre with NLS

<400>  166

Pro Lys Lys Lys Arg Lys Val Ser Asn Leu Leu Thr Val His Gln Asn 
1               5                   10                  15      


Leu Pro Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn 
            20                  25                  30          


Leu Met Asp Met Phe Arg Asp Arg Gln Ala Phe Ser Glu His Thr Trp 
        35                  40                  45              


Lys Met Leu Gln Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu 
    50                  55                  60                  


Asn Asn Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr 
65                  70                  75                  80  


Leu Leu Tyr Leu Gln Ala Arg Gly Leu Ala Val Lys Thr Ile Gln Gln 
                85                  90                  95      


His Leu Gly Gln Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg 
            100                 105                 110         


Pro Ser Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg Ile Arg Lys 
        115                 120                 125             


Glu Asn Val Asp Ala Gly Glu Arg Ala Lys Gln Ala Leu Ala Phe Glu 
    130                 135                 140                 


Arg Thr Asp Phe Asp Gln Val Arg Ser Leu Met Glu Asn Ser Asp Arg 
145                 150                 155                 160 


Cys Gln Asp Ile Arg Asn Leu Ala Phe Leu Gly Ile Ala Tyr Asn Thr 
                165                 170                 175     


Leu Leu Arg Ile Ala Glu Ile Ala Arg Ile Arg Val Lys Asp Ile Ser 
            180                 185                 190         


Arg Thr Asp Gly Gly Arg Met Leu Ile His Ile Gly Arg Thr Lys Thr 
        195                 200                 205             


Leu Val Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr 
    210                 215                 220                 


Lys Leu Val Glu Arg Trp Ile Ser Val Ser Gly Val Ala Asp Asp Pro 
225                 230                 235                 240 


Asn Asn Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro 
                245                 250                 255     


Ser Ala Thr Ser Gln Leu Ser Thr Arg Ala Leu Glu Gly Ile Phe Glu 
            260                 265                 270         


Ala Thr His Arg Leu Ile Tyr Gly Ala Lys Asp Asp Ser Gly Gln Arg 
        275                 280                 285             


Tyr Leu Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp 
    290                 295                 300                 


Met Ala Arg Ala Gly Val Ser Ile Pro Glu Ile Met Gln Ala Gly Gly 
305                 310                 315                 320 


Trp Thr Asn Val Asn Ile Val Met Asn Tyr Ile Arg Asn Leu Asp Ser 
                325                 330                 335     


Glu Thr Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
            340                 345                 


<210>  167
<211>  1050
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cre with NLS

<400>  167
cccaagaaga agaggaaggt gtccaattta ctgaccgtac accaaaattt gcctgcactg       60

ccggtcgatg caacgagtga tgaggttcgc aagaacctga tggacatgtt cagggatcgc      120

caggcgtttt ctgagcatac ctggaaaatg cttcagtccg tttgccggtc gtgggcggca      180

tggtgcaagt tgaataaccg gaaatggttt cccgcagaac ctgaagatgt tcgcgattat      240

cttctatatc ttcaggcgcg cggtctggca gtaaaaacta tccagcaaca tttgggccag      300

ctaaacatgc ttcatcgtcg gtccgggctg ccacgaccaa gtgacagcaa tgctgtttca      360

ctggttatgc ggcggatccg aaaagaaaac gttgatgccg gtgaacgtgc aaaacaggct      420

ctagcgttcg aacgcactga tttcgaccag gttcgttcac tcatggaaaa tagcgatcgc      480

tgccaggata tacgtaatct ggcatttctg gggattgctt ataacaccct gttacgtata      540

gccgaaattg ccaggatcag ggttaaagat atctcacgta ctgacggtgg gagaatgtta      600

atccatattg gcagaacgaa aacgctggtt agcaccgcag gtgtagagaa ggcacttagc      660

ctaggggtaa ctaaactggt cgagcgatgg atttccgtct ctggtgtagc tgatgatccg      720

aataactacc tgttttgccg ggtcagaaaa aatggtgttg ccgcgccatc tgccaccagc      780

cagctatcaa ctcgcgccct ggaagggatt tttgaagcaa ctcatcgatt gatttacggc      840

gctaaggatg actctggtca gagatacctg gcctggtctg gacacagtgc ccgtgtcgga      900

gccgcgcgag atatggcccg cgctggagtt tcaataccgg agatcatgca agctggtggc      960

tggaccaatg taaatattgt catgaactat atccgtaacc tggatagtga aacaggggca     1020

atggtgcgcc tgctcgagga tggcgattaa                                      1050


<210>  168
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  LoxP wildtype

<400>  168
ataacttcgt ataatgtatg ctatacgaag ttat                                   34


<210>  169
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 511

<400>  169
ataacttcgt ataatgtata ctatacgaag ttat                                   34


<210>  170
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 5171

<400>  170
ataacttcgt ataatgtgta ctatacgaag ttat                                   34


<210>  171
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 2272

<400>  171
ataacttcgt ataaagtatc ctatacgaag ttat                                   34


<210>  172
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  M2

<400>  172
ataacttcgt ataagaaacc atatacgaag ttat                                   34


<210>  173
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  M3

<400>  173
ataacttcgt atataatacc atatacgaag ttat                                   34


<210>  174
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  M7

<400>  174
ataacttcgt ataagataga atatacgaag ttat                                   34


<210>  175
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  M11

<400>  175
ataacttcgt atacgatacc atatacgaag ttat                                   34


<210>  176
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 71


<220>
<221>  misc_feature
<222>  (14)..(16)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (19)..(21)
<223>  n is a, c, g, or t

<400>  176
taccgttcgt atannntann ntatacgaag ttat                                   34


<210>  177
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 66


<220>
<221>  misc_feature
<222>  (14)..(16)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (19)..(21)
<223>  n is a, c, g, or t

<400>  177
ataacttcgt atannntann ntatacgaac ggta                                   34


<210>  178
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 71

<400>  178
taccgttcgt ataatgtatg ctatacgaag ttat                                   34


<210>  179
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lox 66

<400>  179
ataacttcgt ataatgtatg ctatacgaac ggta                                   34


<210>  180
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  T2A

<400>  180

Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro 
1               5                   10                  15      


Gly Pro 
        


<210>  181
<211>  19
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  P2A

<400>  181

Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn 
1               5                   10                  15      


Pro Gly Pro 
            


<210>  182
<211>  20
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E2A

<400>  182

Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp Val Glu Ser 
1               5                   10                  15      


Asn Pro Gly Pro 
            20  


<210>  183
<211>  22
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  F2A

<400>  183

Val Lys Gln Thr Leu Asn Phe Asp Leu Leu Lys Leu Ala Gly Asp Val 
1               5                   10                  15      


Glu Ser Asn Pro Gly Pro 
            20          


<210>  184
<211>  22
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  P2A

<400>  184

Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val 
1               5                   10                  15      


Glu Glu Asn Pro Gly Pro 
            20          


<210>  185
<211>  66
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P2A

<400>  185
ggaagcggag ctactaactt cagcctgctg aagcaggctg gagacgtgga ggagaaccct       60

ggaccc                                                                  66


<210>  186
<211>  82
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Splice Donor

<400>  186
gtaagtatca aggttacaag acaggtttaa ggagaccaat agaaactggg cttgtcgaga       60

cagagaagac tcttgcgttt ct                                                82


<210>  187
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Splice Acceptor

<400>  187
gataggcacc tattggtctt actgacatcc actttgcctt tctctccaca g                51


<210>  188
<211>  295
<212>  DNA
<213>  Homo sapiens

<400>  188
gggccccaga agcctggtgg ttgtttgtcc ttctcagggg aaaagtgagg cggccccttg       60

gaggaagggg ccgggcagaa tgatctaatc ggattccaag cagctcaggg gattgtcttt      120

ttctagcacc ttcttgccac tcctaagcgt cctccgtgac cccggctggg atttagcctg      180

gtgctgtgtc agccccgggc tcccaggggc ttcccagtgg tccccaggaa ccctcgacag      240

ggccagggcg tctctctcgt ccagcaaggg cagggacggg ccacaggcca agggc           295


<210>  189
<211>  49
<212>  DNA
<213>  Homo sapiens

<400>  189
aaataaatat ctttattttc attacatctg tgtgttggtt ttttgtgtg                   49


<210>  190
<211>  2273
<212>  PRT
<213>  Homo sapiens

<400>  190

Met Gly Phe Val Arg Gln Ile Gln Leu Leu Leu Trp Lys Asn Trp Thr 
1               5                   10                  15      


Leu Arg Lys Arg Gln Lys Ile Arg Phe Val Val Glu Leu Val Trp Pro 
            20                  25                  30          


Leu Ser Leu Phe Leu Val Leu Ile Trp Leu Arg Asn Ala Asn Pro Leu 
        35                  40                  45              


Tyr Ser His His Glu Cys His Phe Pro Asn Lys Ala Met Pro Ser Ala 
    50                  55                  60                  


Gly Met Leu Pro Trp Leu Gln Gly Ile Phe Cys Asn Val Asn Asn Pro 
65                  70                  75                  80  


Cys Phe Gln Ser Pro Thr Pro Gly Glu Ser Pro Gly Ile Val Ser Asn 
                85                  90                  95      


Tyr Asn Asn Ser Ile Leu Ala Arg Val Tyr Arg Asp Phe Gln Glu Leu 
            100                 105                 110         


Leu Met Asn Ala Pro Glu Ser Gln His Leu Gly Arg Ile Trp Thr Glu 
        115                 120                 125             


Leu His Ile Leu Ser Gln Phe Met Asp Thr Leu Arg Thr His Pro Glu 
    130                 135                 140                 


Arg Ile Ala Gly Arg Gly Ile Arg Ile Arg Asp Ile Leu Lys Asp Glu 
145                 150                 155                 160 


Glu Thr Leu Thr Leu Phe Leu Ile Lys Asn Ile Gly Leu Ser Asp Ser 
                165                 170                 175     


Val Val Tyr Leu Leu Ile Asn Ser Gln Val Arg Pro Glu Gln Phe Ala 
            180                 185                 190         


His Gly Val Pro Asp Leu Ala Leu Lys Asp Ile Ala Cys Ser Glu Ala 
        195                 200                 205             


Leu Leu Glu Arg Phe Ile Ile Phe Ser Gln Arg Arg Gly Ala Lys Thr 
    210                 215                 220                 


Val Arg Tyr Ala Leu Cys Ser Leu Ser Gln Gly Thr Leu Gln Trp Ile 
225                 230                 235                 240 


Glu Asp Thr Leu Tyr Ala Asn Val Asp Phe Phe Lys Leu Phe Arg Val 
                245                 250                 255     


Leu Pro Thr Leu Leu Asp Ser Arg Ser Gln Gly Ile Asn Leu Arg Ser 
            260                 265                 270         


Trp Gly Gly Ile Leu Ser Asp Met Ser Pro Arg Ile Gln Glu Phe Ile 
        275                 280                 285             


His Arg Pro Ser Met Gln Asp Leu Leu Trp Val Thr Arg Pro Leu Met 
    290                 295                 300                 


Gln Asn Gly Gly Pro Glu Thr Phe Thr Lys Leu Met Gly Ile Leu Ser 
305                 310                 315                 320 


Asp Leu Leu Cys Gly Tyr Pro Glu Gly Gly Gly Ser Arg Val Leu Ser 
                325                 330                 335     


Phe Asn Trp Tyr Glu Asp Asn Asn Tyr Lys Ala Phe Leu Gly Ile Asp 
            340                 345                 350         


Ser Thr Arg Lys Asp Pro Ile Tyr Ser Tyr Asp Arg Arg Thr Thr Ser 
        355                 360                 365             


Phe Cys Asn Ala Leu Ile Gln Ser Leu Glu Ser Asn Pro Leu Thr Lys 
    370                 375                 380                 


Ile Ala Trp Arg Ala Ala Lys Pro Leu Leu Met Gly Lys Ile Leu Tyr 
385                 390                 395                 400 


Thr Pro Asp Ser Pro Ala Ala Arg Arg Ile Leu Lys Asn Ala Asn Ser 
                405                 410                 415     


Thr Phe Glu Glu Leu Glu His Val Arg Lys Leu Val Lys Ala Trp Glu 
            420                 425                 430         


Glu Val Gly Pro Gln Ile Trp Tyr Phe Phe Asp Asn Ser Thr Gln Met 
        435                 440                 445             


Asn Met Ile Arg Asp Thr Leu Gly Asn Pro Thr Val Lys Asp Phe Leu 
    450                 455                 460                 


Asn Arg Gln Leu Gly Glu Glu Gly Ile Thr Ala Glu Ala Ile Leu Asn 
465                 470                 475                 480 


Phe Leu Tyr Lys Gly Pro Arg Glu Ser Gln Ala Asp Asp Met Ala Asn 
                485                 490                 495     


Phe Asp Trp Arg Asp Ile Phe Asn Ile Thr Asp Arg Thr Leu Arg Leu 
            500                 505                 510         


Val Asn Gln Tyr Leu Glu Cys Leu Val Leu Asp Lys Phe Glu Ser Tyr 
        515                 520                 525             


Asn Asp Glu Thr Gln Leu Thr Gln Arg Ala Leu Ser Leu Leu Glu Glu 
    530                 535                 540                 


Asn Met Phe Trp Ala Gly Val Val Phe Pro Asp Met Tyr Pro Trp Thr 
545                 550                 555                 560 


Ser Ser Leu Pro Pro His Val Lys Tyr Lys Ile Arg Met Asp Ile Asp 
                565                 570                 575     


Val Val Glu Lys Thr Asn Lys Ile Lys Asp Arg Tyr Trp Asp Ser Gly 
            580                 585                 590         


Pro Arg Ala Asp Pro Val Glu Asp Phe Arg Tyr Ile Trp Gly Gly Phe 
        595                 600                 605             


Ala Tyr Leu Gln Asp Met Val Glu Gln Gly Ile Thr Arg Ser Gln Val 
    610                 615                 620                 


Gln Ala Glu Ala Pro Val Gly Ile Tyr Leu Gln Gln Met Pro Tyr Pro 
625                 630                 635                 640 


Cys Phe Val Asp Asp Ser Phe Met Ile Ile Leu Asn Arg Cys Phe Pro 
                645                 650                 655     


Ile Phe Met Val Leu Ala Trp Ile Tyr Ser Val Ser Met Thr Val Lys 
            660                 665                 670         


Ser Ile Val Leu Glu Lys Glu Leu Arg Leu Lys Glu Thr Leu Lys Asn 
        675                 680                 685             


Gln Gly Val Ser Asn Ala Val Ile Trp Cys Thr Trp Phe Leu Asp Ser 
    690                 695                 700                 


Phe Ser Ile Met Ser Met Ser Ile Phe Leu Leu Thr Ile Phe Ile Met 
705                 710                 715                 720 


His Gly Arg Ile Leu His Tyr Ser Asp Pro Phe Ile Leu Phe Leu Phe 
                725                 730                 735     


Leu Leu Ala Phe Ser Thr Ala Thr Ile Met Leu Cys Phe Leu Leu Ser 
            740                 745                 750         


Thr Phe Phe Ser Lys Ala Ser Leu Ala Ala Ala Cys Ser Gly Val Ile 
        755                 760                 765             


Tyr Phe Thr Leu Tyr Leu Pro His Ile Leu Cys Phe Ala Trp Gln Asp 
    770                 775                 780                 


Arg Met Thr Ala Glu Leu Lys Lys Ala Val Ser Leu Leu Ser Pro Val 
785                 790                 795                 800 


Ala Phe Gly Phe Gly Thr Glu Tyr Leu Val Arg Phe Glu Glu Gln Gly 
                805                 810                 815     


Leu Gly Leu Gln Trp Ser Asn Ile Gly Asn Ser Pro Thr Glu Gly Asp 
            820                 825                 830         


Glu Phe Ser Phe Leu Leu Ser Met Gln Met Met Leu Leu Asp Ala Ala 
        835                 840                 845             


Val Tyr Gly Leu Leu Ala Trp Tyr Leu Asp Gln Val Phe Pro Gly Asp 
    850                 855                 860                 


Tyr Gly Thr Pro Leu Pro Trp Tyr Phe Leu Leu Gln Glu Ser Tyr Trp 
865                 870                 875                 880 


Leu Gly Gly Glu Gly Cys Ser Thr Arg Glu Glu Arg Ala Leu Glu Lys 
                885                 890                 895     


Thr Glu Pro Leu Thr Glu Glu Thr Glu Asp Pro Glu His Pro Glu Gly 
            900                 905                 910         


Ile His Asp Ser Phe Phe Glu Arg Glu His Pro Gly Trp Val Pro Gly 
        915                 920                 925             


Val Cys Val Lys Asn Leu Val Lys Ile Phe Glu Pro Cys Gly Arg Pro 
    930                 935                 940                 


Ala Val Asp Arg Leu Asn Ile Thr Phe Tyr Glu Asn Gln Ile Thr Ala 
945                 950                 955                 960 


Phe Leu Gly His Asn Gly Ala Gly Lys Thr Thr Thr Leu Ser Ile Leu 
                965                 970                 975     


Thr Gly Leu Leu Pro Pro Thr Ser Gly Thr Val Leu Val Gly Gly Arg 
            980                 985                 990         


Asp Ile Glu Thr Ser Leu Asp Ala  Val Arg Gln Ser Leu  Gly Met Cys 
        995                 1000                 1005             


Pro Gln  His Asn Ile Leu Phe  His His Leu Thr Val  Ala Glu His 
    1010                 1015                 1020             


Met Leu  Phe Tyr Ala Gln Leu  Lys Gly Lys Ser Gln  Glu Glu Ala 
    1025                 1030                 1035             


Gln Leu  Glu Met Glu Ala Met  Leu Glu Asp Thr Gly  Leu His His 
    1040                 1045                 1050             


Lys Arg  Asn Glu Glu Ala Gln  Asp Leu Ser Gly Gly  Met Gln Arg 
    1055                 1060                 1065             


Lys Leu  Ser Val Ala Ile Ala  Phe Val Gly Asp Ala  Lys Val Val 
    1070                 1075                 1080             


Ile Leu  Asp Glu Pro Thr Ser  Gly Val Asp Pro Tyr  Ser Arg Arg 
    1085                 1090                 1095             


Ser Ile  Trp Asp Leu Leu Leu  Lys Tyr Arg Ser Gly  Arg Thr Ile 
    1100                 1105                 1110             


Ile Met  Ser Thr His His Met  Asp Glu Ala Asp Leu  Leu Gly Asp 
    1115                 1120                 1125             


Arg Ile  Ala Ile Ile Ala Gln  Gly Arg Leu Tyr Cys  Ser Gly Thr 
    1130                 1135                 1140             


Pro Leu  Phe Leu Lys Asn Cys  Phe Gly Thr Gly Leu  Tyr Leu Thr 
    1145                 1150                 1155             


Leu Val  Arg Lys Met Lys Asn  Ile Gln Ser Gln Arg  Lys Gly Ser 
    1160                 1165                 1170             


Glu Gly  Thr Cys Ser Cys Ser  Ser Lys Gly Phe Ser  Thr Thr Cys 
    1175                 1180                 1185             


Pro Ala  His Val Asp Asp Leu  Thr Pro Glu Gln Val  Leu Asp Gly 
    1190                 1195                 1200             


Asp Val  Asn Glu Leu Met Asp  Val Val Leu His His  Val Pro Glu 
    1205                 1210                 1215             


Ala Lys  Leu Val Glu Cys Ile  Gly Gln Glu Leu Ile  Phe Leu Leu 
    1220                 1225                 1230             


Pro Asn  Lys Asn Phe Lys His  Arg Ala Tyr Ala Ser  Leu Phe Arg 
    1235                 1240                 1245             


Glu Leu  Glu Glu Thr Leu Ala  Asp Leu Gly Leu Ser  Ser Phe Gly 
    1250                 1255                 1260             


Ile Ser  Asp Thr Pro Leu Glu  Glu Ile Phe Leu Lys  Val Thr Glu 
    1265                 1270                 1275             


Asp Ser  Asp Ser Gly Pro Leu  Phe Ala Gly Gly Ala  Gln Gln Lys 
    1280                 1285                 1290             


Arg Glu  Asn Val Asn Pro Arg  His Pro Cys Leu Gly  Pro Arg Glu 
    1295                 1300                 1305             


Lys Ala  Gly Gln Thr Pro Gln  Asp Ser Asn Val Cys  Ser Pro Gly 
    1310                 1315                 1320             


Ala Pro  Ala Ala His Pro Glu  Gly Gln Pro Pro Pro  Glu Pro Glu 
    1325                 1330                 1335             


Cys Pro  Gly Pro Gln Leu Asn  Thr Gly Thr Gln Leu  Val Leu Gln 
    1340                 1345                 1350             


His Val  Gln Ala Leu Leu Val  Lys Arg Phe Gln His  Thr Ile Arg 
    1355                 1360                 1365             


Ser His  Lys Asp Phe Leu Ala  Gln Ile Val Leu Pro  Ala Thr Phe 
    1370                 1375                 1380             


Val Phe  Leu Ala Leu Met Leu  Ser Ile Val Ile Pro  Pro Phe Gly 
    1385                 1390                 1395             


Glu Tyr  Pro Ala Leu Thr Leu  His Pro Trp Ile Tyr  Gly Gln Gln 
    1400                 1405                 1410             


Tyr Thr  Phe Phe Ser Met Asp  Glu Pro Gly Ser Glu  Gln Phe Thr 
    1415                 1420                 1425             


Val Leu  Ala Asp Val Leu Leu  Asn Lys Pro Gly Phe  Gly Asn Arg 
    1430                 1435                 1440             


Cys Leu  Lys Glu Gly Trp Leu  Pro Glu Tyr Pro Cys  Gly Asn Ser 
    1445                 1450                 1455             


Thr Pro  Trp Lys Thr Pro Ser  Val Ser Pro Asn Ile  Thr Gln Leu 
    1460                 1465                 1470             


Phe Gln  Lys Gln Lys Trp Thr  Gln Val Asn Pro Ser  Pro Ser Cys 
    1475                 1480                 1485             


Arg Cys  Ser Thr Arg Glu Lys  Leu Thr Met Leu Pro  Glu Cys Pro 
    1490                 1495                 1500             


Glu Gly  Ala Gly Gly Leu Pro  Pro Pro Gln Arg Thr  Gln Arg Ser 
    1505                 1510                 1515             


Thr Glu  Ile Leu Gln Asp Leu  Thr Asp Arg Asn Ile  Ser Asp Phe 
    1520                 1525                 1530             


Leu Val  Lys Thr Tyr Pro Ala  Leu Ile Arg Ser Ser  Leu Lys Ser 
    1535                 1540                 1545             


Lys Phe  Trp Val Asn Glu Gln  Arg Tyr Gly Gly Ile  Ser Ile Gly 
    1550                 1555                 1560             


Gly Lys  Leu Pro Val Val Pro  Ile Thr Gly Glu Ala  Leu Val Gly 
    1565                 1570                 1575             


Phe Leu  Ser Asp Leu Gly Arg  Ile Met Asn Val Ser  Gly Gly Pro 
    1580                 1585                 1590             


Ile Thr  Arg Glu Ala Ser Lys  Glu Ile Pro Asp Phe  Leu Lys His 
    1595                 1600                 1605             


Leu Glu  Thr Glu Asp Asn Ile  Lys Val Trp Phe Asn  Asn Lys Gly 
    1610                 1615                 1620             


Trp His  Ala Leu Val Ser Phe  Leu Asn Val Ala His  Asn Ala Ile 
    1625                 1630                 1635             


Leu Arg  Ala Ser Leu Pro Lys  Asp Arg Ser Pro Glu  Glu Tyr Gly 
    1640                 1645                 1650             


Ile Thr  Val Ile Ser Gln Pro  Leu Asn Leu Thr Lys  Glu Gln Leu 
    1655                 1660                 1665             


Ser Glu  Ile Thr Val Leu Thr  Thr Ser Val Asp Ala  Val Val Ala 
    1670                 1675                 1680             


Ile Cys  Val Ile Phe Ser Met  Ser Phe Val Pro Ala  Ser Phe Val 
    1685                 1690                 1695             


Leu Tyr  Leu Ile Gln Glu Arg  Val Asn Lys Ser Lys  His Leu Gln 
    1700                 1705                 1710             


Phe Ile  Ser Gly Val Ser Pro  Thr Thr Tyr Trp Val  Thr Asn Phe 
    1715                 1720                 1725             


Leu Trp  Asp Ile Met Asn Tyr  Ser Val Ser Ala Gly  Leu Val Val 
    1730                 1735                 1740             


Gly Ile  Phe Ile Gly Phe Gln  Lys Lys Ala Tyr Thr  Ser Pro Glu 
    1745                 1750                 1755             


Asn Leu  Pro Ala Leu Val Ala  Leu Leu Leu Leu Tyr  Gly Trp Ala 
    1760                 1765                 1770             


Val Ile  Pro Met Met Tyr Pro  Ala Ser Phe Leu Phe  Asp Val Pro 
    1775                 1780                 1785             


Ser Thr  Ala Tyr Val Ala Leu  Ser Cys Ala Asn Leu  Phe Ile Gly 
    1790                 1795                 1800             


Ile Asn  Ser Ser Ala Ile Thr  Phe Ile Leu Glu Leu  Phe Glu Asn 
    1805                 1810                 1815             


Asn Arg  Thr Leu Leu Arg Phe  Asn Ala Val Leu Arg  Lys Leu Leu 
    1820                 1825                 1830             


Ile Val  Phe Pro His Phe Cys  Leu Gly Arg Gly Leu  Ile Asp Leu 
    1835                 1840                 1845             


Ala Leu  Ser Gln Ala Val Thr  Asp Val Tyr Ala Arg  Phe Gly Glu 
    1850                 1855                 1860             


Glu His  Ser Ala Asn Pro Phe  His Trp Asp Leu Ile  Gly Lys Asn 
    1865                 1870                 1875             


Leu Phe  Ala Met Val Val Glu  Gly Val Val Tyr Phe  Leu Leu Thr 
    1880                 1885                 1890             


Leu Leu  Val Gln Arg His Phe  Phe Leu Ser Gln Trp  Ile Ala Glu 
    1895                 1900                 1905             


Pro Thr  Lys Glu Pro Ile Val  Asp Glu Asp Asp Asp  Val Ala Glu 
    1910                 1915                 1920             


Glu Arg  Gln Arg Ile Ile Thr  Gly Gly Asn Lys Thr  Asp Ile Leu 
    1925                 1930                 1935             


Arg Leu  His Glu Leu Thr Lys  Ile Tyr Pro Gly Thr  Ser Ser Pro 
    1940                 1945                 1950             


Ala Val  Asp Arg Leu Cys Val  Gly Val Arg Pro Gly  Glu Cys Phe 
    1955                 1960                 1965             


Gly Leu  Leu Gly Val Asn Gly  Ala Gly Lys Thr Thr  Thr Phe Lys 
    1970                 1975                 1980             


Met Leu  Thr Gly Asp Thr Thr  Val Thr Ser Gly Asp  Ala Thr Val 
    1985                 1990                 1995             


Ala Gly  Lys Ser Ile Leu Thr  Asn Ile Ser Glu Val  His Gln Asn 
    2000                 2005                 2010             


Met Gly  Tyr Cys Pro Gln Phe  Asp Ala Ile Asp Glu  Leu Leu Thr 
    2015                 2020                 2025             


Gly Arg  Glu His Leu Tyr Leu  Tyr Ala Arg Leu Arg  Gly Val Pro 
    2030                 2035                 2040             


Ala Glu  Glu Ile Glu Lys Val  Ala Asn Trp Ser Ile  Lys Ser Leu 
    2045                 2050                 2055             


Gly Leu  Thr Val Tyr Ala Asp  Cys Leu Ala Gly Thr  Tyr Ser Gly 
    2060                 2065                 2070             


Gly Asn  Lys Arg Lys Leu Ser  Thr Ala Ile Ala Leu  Ile Gly Cys 
    2075                 2080                 2085             


Pro Pro  Leu Val Leu Leu Asp  Glu Pro Thr Thr Gly  Met Asp Pro 
    2090                 2095                 2100             


Gln Ala  Arg Arg Met Leu Trp  Asn Val Ile Val Ser  Ile Ile Arg 
    2105                 2110                 2115             


Glu Gly  Arg Ala Val Val Leu  Thr Ser His Ser Met  Glu Glu Cys 
    2120                 2125                 2130             


Glu Ala  Leu Cys Thr Arg Leu  Ala Ile Met Val Lys  Gly Ala Phe 
    2135                 2140                 2145             


Arg Cys  Met Gly Thr Ile Gln  His Leu Lys Ser Lys  Phe Gly Asp 
    2150                 2155                 2160             


Gly Tyr  Ile Val Thr Met Lys  Ile Lys Ser Pro Lys  Asp Asp Leu 
    2165                 2170                 2175             


Leu Pro  Asp Leu Asn Pro Val  Glu Gln Phe Phe Gln  Gly Asn Phe 
    2180                 2185                 2190             


Pro Gly  Ser Val Gln Arg Glu  Arg His Tyr Asn Met  Leu Gln Phe 
    2195                 2200                 2205             


Gln Val  Ser Ser Ser Ser Leu  Ala Arg Ile Phe Gln  Leu Leu Leu 
    2210                 2215                 2220             


Ser His  Lys Asp Ser Leu Leu  Ile Glu Glu Tyr Ser  Val Thr Gln 
    2225                 2230                 2235             


Thr Thr  Leu Asp Gln Val Phe  Val Asn Phe Ala Lys  Gln Gln Thr 
    2240                 2245                 2250             


Glu Ser  His Asp Leu Pro Leu  His Pro Arg Ala Ala  Gly Ala Ser 
    2255                 2260                 2265             


Arg Gln  Ala Gln Asp 
    2270             


<210>  191
<211>  6822
<212>  DNA
<213>  Homo sapiens

<400>  191
atgggcttcg tgagacagat acagcttttg ctctggaaga actggaccct gcggaaaagg       60

caaaagattc gctttgtggt ggaactcgtg tggcctttat ctttatttct ggtcttgatc      120

tggttaagga atgccaaccc actctacagc catcatgaat gccatttccc caacaaggcg      180

atgccctcag caggaatgct gccgtggctc caggggatct tctgcaatgt gaacaatccc      240

tgttttcaaa gccccacccc aggagaatct cctggaattg tgtcaaacta taacaactcc      300

atcttggcaa gggtatatcg agattttcaa gaactcctca tgaatgcacc agagagccag      360

caccttggcc gtatttggac agagctacac atcttgtccc aattcatgga caccctccgg      420

actcacccgg agagaattgc aggaagagga atacgaataa gggatatctt gaaagatgaa      480

gaaacactga cactatttct cattaaaaac atcggcctgt ctgactcagt ggtctacctt      540

ctgatcaact ctcaagtccg tccagagcag ttcgctcatg gagtcccgga cctggcgctg      600

aaggacatcg cctgcagcga ggccctcctg gagcgcttca tcatcttcag ccagagacgc      660

ggggcaaaga cggtgcgcta tgccctgtgc tccctctccc agggcaccct acagtggata      720

gaagacactc tgtatgccaa cgtggacttc ttcaagctct tccgtgtgct tcccacactc      780

ctagacagcc gttctcaagg tatcaatctg agatcttggg gaggaatatt atctgatatg      840

tcaccaagaa ttcaagagtt tatccatcgg ccgagtatgc aggacttgct gtgggtgacc      900

aggcccctca tgcagaatgg tggtccagag acctttacaa agctgatggg catcctgtct      960

gacctcctgt gtggctaccc cgagggaggt ggctctcggg tgctctcctt caactggtat     1020

gaagacaata actataaggc ctttctgggg attgactcca caaggaagga tcctatctat     1080

tcttatgaca gaagaacaac atccttttgt aatgcattga tccagagcct ggagtcaaat     1140

cctttaacca aaatcgcttg gagggcggca aagcctttgc tgatgggaaa aatcctgtac     1200

actcctgatt cacctgcagc acgaaggata ctgaagaatg ccaactcaac ttttgaagaa     1260

ctggaacacg ttaggaagtt ggtcaaagcc tgggaagaag tagggcccca gatctggtac     1320

ttctttgaca acagcacaca gatgaacatg atcagagata ccctggggaa cccaacagta     1380

aaagactttt tgaataggca gcttggtgaa gaaggtatta ctgctgaagc catcctaaac     1440

ttcctctaca agggccctcg ggaaagccag gctgacgaca tggccaactt cgactggagg     1500

gacatattta acatcactga tcgcaccctc cgcctggtca atcaatacct ggagtgcttg     1560

gtcctggata agtttgaaag ctacaatgat gaaactcagc tcacccaacg tgccctctct     1620

ctactggagg aaaacatgtt ctgggccgga gtggtattcc ctgacatgta tccctggacc     1680

agctctctac caccccacgt gaagtataag atccgaatgg acatagacgt ggtggagaaa     1740

accaataaga ttaaagacag gtattgggat tctggtccca gagctgatcc cgtggaagat     1800

ttccggtaca tctggggcgg gtttgcctat ctgcaggaca tggttgaaca ggggatcaca     1860

aggagccagg tgcaggcgga ggctccagtt ggaatctacc tccagcagat gccctacccc     1920

tgcttcgtgg acgattcttt catgatcatc ctgaaccgct gtttccctat cttcatggtg     1980

ctggcatgga tctactctgt ctccatgact gtgaagagca tcgtcttgga gaaggagttg     2040

cgactgaagg agaccttgaa aaatcagggt gtctccaatg cagtgatttg gtgtacctgg     2100

ttcctggaca gcttctccat catgtcgatg agcatcttcc tcctgacgat attcatcatg     2160

catggaagaa tcctacatta cagcgaccca ttcatcctct tcctgttctt gttggctttc     2220

tccactgcca ccatcatgct gtgctttctg ctcagcacct tcttctccaa ggccagtctg     2280

gcagcagcct gtagtggtgt catctatttc accctctacc tgccacacat cctgtgcttc     2340

gcctggcagg accgcatgac cgctgagctg aagaaggctg tgagcttact gtctccggtg     2400

gcatttggat ttggcactga gtacctggtt cgctttgaag agcaaggcct ggggctgcag     2460

tggagcaaca tcgggaacag tcccacggaa ggggacgaat tcagcttcct gctgtccatg     2520

cagatgatgc tccttgatgc tgctgtctat ggcttactcg cttggtacct tgatcaggtg     2580

tttccaggag actatggaac cccacttcct tggtactttc ttctacaaga gtcgtattgg     2640

cttggcggtg aaggttgttc aaccagagaa gaaagagccc tggaaaagac cgagccccta     2700

acagaggaaa cggaggatcc agagcaccca gaaggaatac acgactcctt ctttgaacgt     2760

gagcatccag ggtgggttcc tggggtatgc gtgaagaatc tggtaaagat ttttgagccc     2820

tgtggccggc cagctgtgga ccgtctgaac atcaccttct acgagaacca gatcaccgca     2880

ttcctgggcc acaatggagc tgggaaaacc accaccttgt ccatcctgac gggtctgttg     2940

ccaccaacct ctgggactgt gctcgttggg ggaagggaca ttgaaaccag cctggatgca     3000

gtccggcaga gccttggcat gtgtccacag cacaacatcc tgttccacca cctcacggtg     3060

gctgagcaca tgctgttcta tgcccagctg aaaggaaagt cccaggagga ggcccagctg     3120

gagatggaag ccatgttgga ggacacaggc ctccaccaca agcggaatga agaggctcag     3180

gacctatcag gtggcatgca gagaaagctg tcggttgcca ttgcctttgt gggagatgcc     3240

aaggtggtga ttctggacga acccacctct ggggtggacc cttactcgag acgctcaatc     3300

tgggatctgc tcctgaagta tcgctcaggc agaaccatca tcatgtccac tcaccacatg     3360

gacgaggccg acctccttgg ggaccgcatt gccatcattg cccagggaag gctctactgc     3420

tcaggcaccc cactcttcct gaagaactgc tttggcacag gcttgtactt aaccttggtg     3480

cgcaagatga aaaacatcca gagccaaagg aaaggcagtg aggggacctg cagctgctcg     3540

tctaagggtt tctccaccac gtgtccagcc cacgtcgatg acctaactcc agaacaagtc     3600

ctggatgggg atgtaaatga gctgatggat gtagttctcc accatgttcc agaggcaaag     3660

ctggtggagt gcattggtca agaacttatc ttccttcttc caaataagaa cttcaagcac     3720

agagcatatg ccagcctttt cagagagctg gaggagacgc tggctgacct tggtctcagc     3780

agttttggaa tttctgacac tcccctggaa gagatttttc tgaaggtcac ggaggattct     3840

gattcaggac ctctgtttgc gggtggcgct cagcagaaaa gagaaaacgt caacccccga     3900

cacccctgct tgggtcccag agagaaggct ggacagacac cccaggactc caatgtctgc     3960

tccccagggg cgccggctgc tcacccagag ggccagcctc ccccagagcc agagtgccca     4020

ggcccgcagc tcaacacggg gacacagctg gtcctccagc atgtgcaggc gctgctggtc     4080

aagagattcc aacacaccat ccgcagccac aaggacttcc tggcgcagat cgtgctcccg     4140

gctacctttg tgtttttggc tctgatgctt tctattgtta tccctccttt tggcgaatac     4200

cccgctttga cccttcaccc ctggatatat gggcagcagt acaccttctt cagcatggat     4260

gaaccaggca gtgagcagtt cacggtactt gcagacgtcc tcctgaataa gccaggcttt     4320

ggcaaccgct gcctgaagga agggtggctt ccggagtacc cctgtggcaa ctcaacaccc     4380

tggaagactc cttctgtgtc cccaaacatc acccagctgt tccagaagca gaaatggaca     4440

caggtcaacc cttcaccatc ctgcaggtgc agcaccaggg agaagctcac catgctgcca     4500

gagtgccccg agggtgccgg gggcctcccg cccccccaga gaacacagcg cagcacggaa     4560

attctacaag acctgacgga caggaacatc tccgacttct tggtaaaaac gtatcctgct     4620

cttataagaa gcagcttaaa gagcaaattc tgggtcaatg aacagaggta tggaggaatt     4680

tccattggag gaaagctccc agtcgtcccc atcacggggg aagcacttgt tgggttttta     4740

agcgaccttg gccggatcat gaatgtgagc gggggcccta tcactagaga ggcctctaaa     4800

gaaatacctg atttccttaa acatctagaa actgaagaca acattaaggt gtggtttaat     4860

aacaaaggct ggcatgccct ggtcagcttt ctcaatgtgg cccacaacgc catcttacgg     4920

gccagcctgc ctaaggacag gagccccgag gagtatggaa tcaccgtcat tagccaaccc     4980

ctgaacctga ccaaggagca gctctcagag attacagtgc tgaccacttc agtggatgct     5040

gtggttgcca tctgcgtgat tttctccatg tccttcgtcc cagccagctt tgtcctttat     5100

ttgatccagg agcgggtgaa caaatccaag cacctccagt ttatcagtgg agtgagcccc     5160

accacctact gggtgaccaa cttcctctgg gacatcatga attattccgt gagtgctggg     5220

ctggtggtgg gcatcttcat cgggtttcag aagaaagcct acacttctcc agaaaacctt     5280

cctgcccttg tggcactgct cctgctgtat ggatgggcgg tcattcccat gatgtaccca     5340

gcatccttcc tgtttgatgt ccccagcaca gcctatgtgg ctttatcttg tgctaatctg     5400

ttcatcggca tcaacagcag tgctattacc ttcatcttgg aattatttga gaataaccgg     5460

acgctgctca ggttcaacgc cgtgctgagg aagctgctca ttgtcttccc ccacttctgc     5520

ctgggccggg gcctcattga ccttgcactg agccaggctg tgacagatgt ctatgcccgg     5580

tttggtgagg agcactctgc aaatccgttc cactgggacc tgattgggaa gaacctgttt     5640

gccatggtgg tggaaggggt ggtgtacttc ctcctgaccc tgctggtcca gcgccacttc     5700

ttcctctccc aatggattgc cgagcccact aaggagccca ttgttgatga agatgatgat     5760

gtggctgaag aaagacaaag aattattact ggtggaaata aaactgacat cttaaggcta     5820

catgaactaa ccaagattta tccaggcacc tccagcccag cagtggacag gctgtgtgtc     5880

ggagttcgcc ctggagagtg ctttggcctc ctgggagtga atggtgccgg caaaacaacc     5940

acattcaaga tgctcactgg ggacaccaca gtgacctcag gggatgccac cgtagcaggc     6000

aagagtattt taaccaatat ttctgaagtc catcaaaata tgggctactg tcctcagttt     6060

gatgcaattg atgagctgct cacaggacga gaacatcttt acctttatgc ccggcttcga     6120

ggtgtaccag cagaagaaat cgaaaaggtt gcaaactgga gtattaagag cctgggcctg     6180

actgtctacg ccgactgcct ggctggcacg tacagtgggg gcaacaagcg gaaactctcc     6240

acagccatcg cactcattgg ctgcccaccg ctggtgctgc tggatgagcc caccacaggg     6300

atggaccccc aggcacgccg catgctgtgg aacgtcatcg tgagcatcat cagagaaggg     6360

agggctgtgg tcctcacatc ccacagcatg gaagaatgtg aggcactgtg tacccggctg     6420

gccatcatgg taaagggcgc ctttcgatgt atgggcacca ttcagcatct caagtccaaa     6480

tttggagatg gctatatcgt cacaatgaag atcaaatccc cgaaggacga cctgcttcct     6540

gacctgaacc ctgtggagca gttcttccag gggaacttcc caggcagtgt gcagagggag     6600

aggcactaca acatgctcca gttccaggtc tcctcctcct ccctggcgag gatcttccag     6660

ctcctcctct cccacaagga cagcctgctc atcgaggagt actcagtcac acagaccaca     6720

ctggaccagg tgtttgtaaa ttttgctaaa cagcagactg aaagccatga cctccctctg     6780

caccctcgag ctgctggagc cagtcgacaa gcccaggact aa                        6822


<210>  192
<211>  2653
<212>  DNA
<213>  Homo sapiens

<400>  192
atgggcttcg tgagacagat acagcttttg ctctggaaga actggaccct gcggaaaagg       60

caaaagattc gctttgtggt ggaactcgtg tggcctttat ctttatttct ggtcttgatc      120

tggttaagga atgccaaccc actctacagc catcatgaat gccatttccc caacaaggcg      180

atgccctcag caggaatgct gccgtggctc caggggatct tctgcaatgt gaacaatccc      240

tgttttcaaa gccccacccc aggagaatct cctggaattg tgtcaaacta taacaactcc      300

atcttggcaa gggtatatcg agattttcaa gaactcctca tgaatgcacc agagagccag      360

caccttggcc gtatttggac agagctacac atcttgtccc aattcatgga caccctccgg      420

actcacccgg agagaattgc aggaagagga atacgaataa gggatatctt gaaagatgaa      480

gaaacactga cactatttct cattaaaaac atcggcctgt ctgactcagt ggtctacctt      540

ctgatcaact ctcaagtccg tccagagcag ttcgctcatg gagtcccgga cctggcgctg      600

aaggacatcg cctgcagcga ggccctcctg gagcgcttca tcatcttcag ccagagacgc      660

ggggcaaaga cggtgcgcta tgccctgtgc tccctctccc agggcaccct acagtggata      720

gaagacactc tgtatgccaa cgtggacttc ttcaagctct tccgtgtgct tcccacactc      780

ctagacagcc gttctcaagg tatcaatctg agatcttggg gaggaatatt atctgatatg      840

tcaccaagaa ttcaagagtt tatccatcgg ccgagtatgc aggacttgct gtgggtgacc      900

aggcccctca tgcagaatgg tggtccagag acctttacaa agctgatggg catcctgtct      960

gacctcctgt gtggctaccc cgagggaggt ggctctcggg tgctctcctt caactggtat     1020

gaagacaata actataaggc ctttctgggg attgactcca caaggaagga tcctatctat     1080

tcttatgaca gaagaacaac atccttttgt aatgcattga tccagagcct ggagtcaaat     1140

cctttaacca aaatcgcttg gagggcggca aagcctttgc tgatgggaaa aatcctgtac     1200

actcctgatt cacctgcagc acgaaggata ctgaagaatg ccaactcaac ttttgaagaa     1260

ctggaacacg ttaggaagtt ggtcaaagcc tgggaagaag tagggcccca gatctggtac     1320

ttctttgaca acagcacaca gatgaacatg atcagagata ccctggggaa cccaacagta     1380

aaagactttt tgaataggca gcttggtgaa gaaggtatta ctgctgaagc catcctaaac     1440

ttcctctaca agggccctcg ggaaagccag gctgacgaca tggccaactt cgactggagg     1500

gacatattta acatcactga tcgcaccctc cgcctggtca atcaatacct ggagtgcttg     1560

gtcctggata agtttgaaag ctacaatgat gaaactcagc tcacccaacg tgccctctct     1620

ctactggagg aaaacatgtt ctgggccgga gtggtattcc ctgacatgta tccctggacc     1680

agctctctac caccccacgt gaagtataag atccgaatgg acatagacgt ggtggagaaa     1740

accaataaga ttaaagacag gtattgggat tctggtccca gagctgatcc cgtggaagat     1800

ttccggtaca tctggggcgg gtttgcctat ctgcaggaca tggttgaaca ggggatcaca     1860

aggagccagg tgcaggcgga ggctccagtt ggaatctacc tccagcagat gccctacccc     1920

tgcttcgtgg acgattcttt catgatcatc ctgaaccgct gtttccctat cttcatggtg     1980

ctggcatgga tctactctgt ctccatgact gtgaagagca tcgtcttgga gaaggagttg     2040

cgactgaagg agaccttgaa aaatcagggt gtctccaatg cagtgatttg gtgtacctgg     2100

ttcctggaca gcttctccat catgtcgatg agcatcttcc tcctgacgat attcatcatg     2160

catggaagaa tcctacatta cagcgaccca ttcatcctct tcctgttctt gttggctttc     2220

tccactgcca ccatcatgct gtgctttctg ctcagcacct tcttctccaa ggccagtctg     2280

gcagcagcct gtagtggtgt catctatttc accctctacc tgccacacat cctgtgcttc     2340

gcctggcagg accgcatgac cgctgagctg aagaaggctg tgagcttact gtctccggtg     2400

gcatttggat ttggcactga gtacctggtt cgctttgaag agcaaggcct ggggctgcag     2460

tggagcaaca tcgggaacag tcccacggaa ggggacgaat tcagcttcct gctgtccatg     2520

cagatgatgc tccttgatgc tgctgtctat ggcttactcg cttggtacct tgatcaggtg     2580

tttccaggag actatggaac cccacttcct tggtactttc ttctacaaga gtcgtattgg     2640

cttggcggtg aag                                                        2653


<210>  193
<211>  4169
<212>  DNA
<213>  Homo sapiens

<400>  193
gttgttcaac cagagaagaa agagccctgg aaaagaccga gcccctaaca gaggaaacgg       60

aggatccaga gcacccagaa ggaatacacg actccttctt tgaacgtgag catccagggt      120

gggttcctgg ggtatgcgtg aagaatctgg taaagatttt tgagccctgt ggccggccag      180

ctgtggaccg tctgaacatc accttctacg agaaccagat caccgcattc ctgggccaca      240

atggagctgg gaaaaccacc accttgtcca tcctgacggg tctgttgcca ccaacctctg      300

ggactgtgct cgttggggga agggacattg aaaccagcct ggatgcagtc cggcagagcc      360

ttggcatgtg tccacagcac aacatcctgt tccaccacct cacggtggct gagcacatgc      420

tgttctatgc ccagctgaaa ggaaagtccc aggaggaggc ccagctggag atggaagcca      480

tgttggagga cacaggcctc caccacaagc ggaatgaaga ggctcaggac ctatcaggtg      540

gcatgcagag aaagctgtcg gttgccattg cctttgtggg agatgccaag gtggtgattc      600

tggacgaacc cacctctggg gtggaccctt actcgagacg ctcaatctgg gatctgctcc      660

tgaagtatcg ctcaggcaga accatcatca tgtccactca ccacatggac gaggccgacc      720

tccttgggga ccgcattgcc atcattgccc agggaaggct ctactgctca ggcaccccac      780

tcttcctgaa gaactgcttt ggcacaggct tgtacttaac cttggtgcgc aagatgaaaa      840

acatccagag ccaaaggaaa ggcagtgagg ggacctgcag ctgctcgtct aagggtttct      900

ccaccacgtg tccagcccac gtcgatgacc taactccaga acaagtcctg gatggggatg      960

taaatgagct gatggatgta gttctccacc atgttccaga ggcaaagctg gtggagtgca     1020

ttggtcaaga acttatcttc cttcttccaa ataagaactt caagcacaga gcatatgcca     1080

gccttttcag agagctggag gagacgctgg ctgaccttgg tctcagcagt tttggaattt     1140

ctgacactcc cctggaagag atttttctga aggtcacgga ggattctgat tcaggacctc     1200

tgtttgcggg tggcgctcag cagaaaagag aaaacgtcaa cccccgacac ccctgcttgg     1260

gtcccagaga gaaggctgga cagacacccc aggactccaa tgtctgctcc ccaggggcgc     1320

cggctgctca cccagagggc cagcctcccc cagagccaga gtgcccaggc ccgcagctca     1380

acacggggac acagctggtc ctccagcatg tgcaggcgct gctggtcaag agattccaac     1440

acaccatccg cagccacaag gacttcctgg cgcagatcgt gctcccggct acctttgtgt     1500

ttttggctct gatgctttct attgttatcc ctccttttgg cgaatacccc gctttgaccc     1560

ttcacccctg gatatatggg cagcagtaca ccttcttcag catggatgaa ccaggcagtg     1620

agcagttcac ggtacttgca gacgtcctcc tgaataagcc aggctttggc aaccgctgcc     1680

tgaaggaagg gtggcttccg gagtacccct gtggcaactc aacaccctgg aagactcctt     1740

ctgtgtcccc aaacatcacc cagctgttcc agaagcagaa atggacacag gtcaaccctt     1800

caccatcctg caggtgcagc accagggaga agctcaccat gctgccagag tgccccgagg     1860

gtgccggggg cctcccgccc ccccagagaa cacagcgcag cacggaaatt ctacaagacc     1920

tgacggacag gaacatctcc gacttcttgg taaaaacgta tcctgctctt ataagaagca     1980

gcttaaagag caaattctgg gtcaatgaac agaggtatgg aggaatttcc attggaggaa     2040

agctcccagt cgtccccatc acgggggaag cacttgttgg gtttttaagc gaccttggcc     2100

ggatcatgaa tgtgagcggg ggccctatca ctagagaggc ctctaaagaa atacctgatt     2160

tccttaaaca tctagaaact gaagacaaca ttaaggtgtg gtttaataac aaaggctggc     2220

atgccctggt cagctttctc aatgtggccc acaacgccat cttacgggcc agcctgccta     2280

aggacaggag ccccgaggag tatggaatca ccgtcattag ccaacccctg aacctgacca     2340

aggagcagct ctcagagatt acagtgctga ccacttcagt ggatgctgtg gttgccatct     2400

gcgtgatttt ctccatgtcc ttcgtcccag ccagctttgt cctttatttg atccaggagc     2460

gggtgaacaa atccaagcac ctccagttta tcagtggagt gagccccacc acctactggg     2520

tgaccaactt cctctgggac atcatgaatt attccgtgag tgctgggctg gtggtgggca     2580

tcttcatcgg gtttcagaag aaagcctaca cttctccaga aaaccttcct gcccttgtgg     2640

cactgctcct gctgtatgga tgggcggtca ttcccatgat gtacccagca tccttcctgt     2700

ttgatgtccc cagcacagcc tatgtggctt tatcttgtgc taatctgttc atcggcatca     2760

acagcagtgc tattaccttc atcttggaat tatttgagaa taaccggacg ctgctcaggt     2820

tcaacgccgt gctgaggaag ctgctcattg tcttccccca cttctgcctg ggccggggcc     2880

tcattgacct tgcactgagc caggctgtga cagatgtcta tgcccggttt ggtgaggagc     2940

actctgcaaa tccgttccac tgggacctga ttgggaagaa cctgtttgcc atggtggtgg     3000

aaggggtggt gtacttcctc ctgaccctgc tggtccagcg ccacttcttc ctctcccaat     3060

ggattgccga gcccactaag gagcccattg ttgatgaaga tgatgatgtg gctgaagaaa     3120

gacaaagaat tattactggt ggaaataaaa ctgacatctt aaggctacat gaactaacca     3180

agatttatcc aggcacctcc agcccagcag tggacaggct gtgtgtcgga gttcgccctg     3240

gagagtgctt tggcctcctg ggagtgaatg gtgccggcaa aacaaccaca ttcaagatgc     3300

tcactgggga caccacagtg acctcagggg atgccaccgt agcaggcaag agtattttaa     3360

ccaatatttc tgaagtccat caaaatatgg gctactgtcc tcagtttgat gcaattgatg     3420

agctgctcac aggacgagaa catctttacc tttatgcccg gcttcgaggt gtaccagcag     3480

aagaaatcga aaaggttgca aactggagta ttaagagcct gggcctgact gtctacgccg     3540

actgcctggc tggcacgtac agtgggggca acaagcggaa actctccaca gccatcgcac     3600

tcattggctg cccaccgctg gtgctgctgg atgagcccac cacagggatg gacccccagg     3660

cacgccgcat gctgtggaac gtcatcgtga gcatcatcag agaagggagg gctgtggtcc     3720

tcacatccca cagcatggaa gaatgtgagg cactgtgtac ccggctggcc atcatggtaa     3780

agggcgcctt tcgatgtatg ggcaccattc agcatctcaa gtccaaattt ggagatggct     3840

atatcgtcac aatgaagatc aaatccccga aggacgacct gcttcctgac ctgaaccctg     3900

tggagcagtt cttccagggg aacttcccag gcagtgtgca gagggagagg cactacaaca     3960

tgctccagtt ccaggtctcc tcctcctccc tggcgaggat cttccagctc ctcctctccc     4020

acaaggacag cctgctcatc gaggagtact cagtcacaca gaccacactg gaccaggtgt     4080

ttgtaaattt tgctaaacag cagactgaaa gccatgacct ccctctgcac cctcgagctg     4140

ctggagccag tcgacaagcc caggactaa                                       4169


<210>  194
<211>  2317
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ABCA4 with FLAG tag

<400>  194

Met Gly Phe Val Arg Gln Ile Gln Leu Leu Leu Trp Lys Asn Trp Thr 
1               5                   10                  15      


Leu Arg Lys Arg Gln Lys Ile Arg Phe Val Val Glu Leu Val Trp Pro 
            20                  25                  30          


Leu Ser Leu Phe Leu Val Leu Ile Trp Leu Arg Asn Ala Asn Pro Leu 
        35                  40                  45              


Tyr Ser His His Glu Cys His Phe Pro Asn Lys Ala Met Pro Ser Ala 
    50                  55                  60                  


Gly Met Leu Pro Trp Leu Gln Gly Ile Phe Cys Asn Val Asn Asn Pro 
65                  70                  75                  80  


Cys Phe Gln Ser Pro Thr Pro Gly Glu Ser Pro Gly Ile Val Ser Asn 
                85                  90                  95      


Tyr Asn Asn Ser Ile Leu Ala Arg Val Tyr Arg Asp Phe Gln Glu Leu 
            100                 105                 110         


Leu Met Asn Ala Pro Glu Ser Gln His Leu Gly Arg Ile Trp Thr Glu 
        115                 120                 125             


Leu His Ile Leu Ser Gln Phe Met Asp Thr Leu Arg Thr His Pro Glu 
    130                 135                 140                 


Arg Ile Ala Gly Arg Gly Ile Arg Ile Arg Asp Ile Leu Lys Asp Glu 
145                 150                 155                 160 


Glu Thr Leu Thr Leu Phe Leu Ile Lys Asn Ile Gly Leu Ser Asp Ser 
                165                 170                 175     


Val Val Tyr Leu Leu Ile Asn Ser Gln Val Arg Pro Glu Gln Phe Ala 
            180                 185                 190         


His Gly Val Pro Asp Leu Ala Leu Lys Asp Ile Ala Cys Ser Glu Ala 
        195                 200                 205             


Leu Leu Glu Arg Phe Ile Ile Phe Ser Gln Arg Arg Gly Ala Lys Thr 
    210                 215                 220                 


Val Arg Tyr Ala Leu Cys Ser Leu Ser Gln Gly Thr Leu Gln Trp Ile 
225                 230                 235                 240 


Glu Asp Thr Leu Tyr Ala Asn Val Asp Phe Phe Lys Leu Phe Arg Val 
                245                 250                 255     


Leu Pro Thr Leu Leu Asp Ser Arg Ser Gln Gly Ile Asn Leu Arg Ser 
            260                 265                 270         


Trp Gly Gly Ile Leu Ser Asp Met Ser Pro Arg Ile Gln Glu Phe Ile 
        275                 280                 285             


His Arg Pro Ser Met Gln Asp Leu Leu Trp Val Thr Arg Pro Leu Met 
    290                 295                 300                 


Gln Asn Gly Gly Pro Glu Thr Phe Thr Lys Leu Met Gly Ile Leu Ser 
305                 310                 315                 320 


Asp Leu Leu Cys Gly Tyr Pro Glu Gly Gly Gly Ser Arg Val Leu Ser 
                325                 330                 335     


Phe Asn Trp Tyr Glu Asp Asn Asn Tyr Lys Ala Phe Leu Gly Ile Asp 
            340                 345                 350         


Ser Thr Arg Lys Asp Pro Ile Tyr Ser Tyr Asp Arg Arg Thr Thr Ser 
        355                 360                 365             


Phe Cys Asn Ala Leu Ile Gln Ser Leu Glu Ser Asn Pro Leu Thr Lys 
    370                 375                 380                 


Ile Ala Trp Arg Ala Ala Lys Pro Leu Leu Met Gly Lys Ile Leu Tyr 
385                 390                 395                 400 


Thr Pro Asp Ser Pro Ala Ala Arg Arg Ile Leu Lys Asn Ala Asn Ser 
                405                 410                 415     


Thr Phe Glu Glu Leu Glu His Val Arg Lys Leu Val Lys Ala Trp Glu 
            420                 425                 430         


Glu Val Gly Pro Gln Ile Trp Tyr Phe Phe Asp Asn Ser Thr Gln Met 
        435                 440                 445             


Asn Met Ile Arg Asp Thr Leu Gly Asn Pro Thr Val Lys Asp Phe Leu 
    450                 455                 460                 


Asn Arg Gln Leu Gly Glu Glu Gly Ile Thr Ala Glu Ala Ile Leu Asn 
465                 470                 475                 480 


Phe Leu Tyr Lys Gly Pro Arg Glu Ser Gln Ala Asp Asp Met Ala Asn 
                485                 490                 495     


Phe Asp Trp Arg Asp Ile Phe Asn Ile Thr Asp Arg Thr Leu Arg Leu 
            500                 505                 510         


Val Asn Gln Tyr Leu Glu Cys Leu Val Leu Asp Lys Phe Glu Ser Tyr 
        515                 520                 525             


Asn Asp Glu Thr Gln Leu Thr Gln Arg Ala Leu Ser Leu Leu Glu Glu 
    530                 535                 540                 


Asn Met Phe Trp Ala Gly Val Val Phe Pro Asp Met Tyr Pro Trp Thr 
545                 550                 555                 560 


Ser Ser Leu Pro Pro His Val Lys Tyr Lys Ile Arg Met Asp Ile Asp 
                565                 570                 575     


Val Val Glu Lys Thr Asn Lys Ile Lys Asp Arg Tyr Trp Asp Asp Tyr 
            580                 585                 590         


Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp Tyr Lys Asp 
        595                 600                 605             


Asp Asp Asp Lys Ser Gly Pro Arg Ala Asp Pro Val Glu Asp Phe Arg 
    610                 615                 620                 


Tyr Ile Trp Gly Gly Phe Ala Tyr Leu Gln Asp Met Val Glu Gln Gly 
625                 630                 635                 640 


Ile Thr Arg Ser Gln Val Gln Ala Glu Ala Pro Val Gly Ile Tyr Leu 
                645                 650                 655     


Gln Gln Met Pro Tyr Pro Cys Phe Val Asp Asp Ser Phe Met Ile Ile 
            660                 665                 670         


Leu Asn Arg Cys Phe Pro Ile Phe Met Val Leu Ala Trp Ile Tyr Ser 
        675                 680                 685             


Val Ser Met Thr Val Lys Ser Ile Val Leu Glu Lys Glu Leu Arg Leu 
    690                 695                 700                 


Lys Glu Thr Leu Lys Asn Gln Gly Val Ser Asn Ala Val Ile Trp Cys 
705                 710                 715                 720 


Thr Trp Phe Leu Asp Ser Phe Ser Ile Met Ser Met Ser Ile Phe Leu 
                725                 730                 735     


Leu Thr Ile Phe Ile Met His Gly Arg Ile Leu His Tyr Ser Asp Pro 
            740                 745                 750         


Phe Ile Leu Phe Leu Phe Leu Leu Ala Phe Ser Thr Ala Thr Ile Met 
        755                 760                 765             


Leu Cys Phe Leu Leu Ser Thr Phe Phe Ser Lys Ala Ser Leu Ala Ala 
    770                 775                 780                 


Ala Cys Ser Gly Val Ile Tyr Phe Thr Leu Tyr Leu Pro His Ile Leu 
785                 790                 795                 800 


Cys Phe Ala Trp Gln Asp Arg Met Thr Ala Glu Leu Lys Lys Ala Val 
                805                 810                 815     


Ser Leu Leu Ser Pro Val Ala Phe Gly Phe Gly Thr Glu Tyr Leu Val 
            820                 825                 830         


Arg Phe Glu Glu Gln Gly Leu Gly Leu Gln Trp Ser Asn Ile Gly Asn 
        835                 840                 845             


Ser Pro Thr Glu Gly Asp Glu Phe Ser Phe Leu Leu Ser Met Gln Met 
    850                 855                 860                 


Met Leu Leu Asp Ala Ala Val Tyr Gly Leu Leu Ala Trp Tyr Leu Asp 
865                 870                 875                 880 


Gln Val Phe Pro Gly Asp Tyr Gly Thr Pro Leu Pro Trp Tyr Phe Leu 
                885                 890                 895     


Leu Gln Glu Ser Tyr Trp Leu Gly Gly Glu Gly Cys Ser Thr Arg Glu 
            900                 905                 910         


Glu Arg Ala Leu Glu Lys Thr Glu Pro Leu Thr Glu Glu Thr Glu Asp 
        915                 920                 925             


Pro Glu His Pro Glu Gly Ile His Asp Ser Phe Phe Glu Arg Glu His 
    930                 935                 940                 


Pro Gly Trp Val Pro Gly Val Cys Val Lys Asn Leu Val Lys Ile Phe 
945                 950                 955                 960 


Glu Pro Cys Gly Arg Pro Ala Val Asp Arg Leu Asn Ile Thr Phe Tyr 
                965                 970                 975     


Glu Asn Gln Ile Thr Ala Phe Leu Gly His Asn Gly Ala Gly Lys Thr 
            980                 985                 990         


Thr Thr Leu Ser Ile Leu Thr Gly  Leu Leu Pro Pro Thr  Ser Gly Thr 
        995                 1000                 1005             


Val Leu  Val Gly Gly Arg Asp  Ile Glu Thr Ser Leu  Asp Ala Val 
    1010                 1015                 1020             


Arg Gln  Ser Leu Gly Met Cys  Pro Gln His Asn Ile  Leu Phe His 
    1025                 1030                 1035             


His Leu  Thr Val Ala Glu His  Met Leu Phe Tyr Ala  Gln Leu Lys 
    1040                 1045                 1050             


Gly Lys  Ser Gln Glu Glu Ala  Gln Leu Glu Met Glu  Ala Met Leu 
    1055                 1060                 1065             


Glu Asp  Thr Gly Leu His His  Lys Arg Asn Glu Glu  Ala Gln Asp 
    1070                 1075                 1080             


Leu Ser  Gly Gly Met Gln Arg  Lys Leu Ser Val Ala  Ile Ala Phe 
    1085                 1090                 1095             


Val Gly  Asp Ala Lys Val Val  Ile Leu Asp Glu Pro  Thr Ser Gly 
    1100                 1105                 1110             


Val Asp  Pro Tyr Ser Arg Arg  Ser Ile Trp Asp Leu  Leu Leu Lys 
    1115                 1120                 1125             


Tyr Arg  Ser Gly Arg Thr Ile  Ile Met Ser Thr His  His Met Asp 
    1130                 1135                 1140             


Glu Ala  Asp Leu Leu Gly Asp  Arg Ile Ala Ile Ile  Ala Gln Gly 
    1145                 1150                 1155             


Arg Leu  Tyr Cys Ser Gly Thr  Pro Leu Phe Leu Lys  Asn Cys Phe 
    1160                 1165                 1170             


Gly Thr  Gly Leu Tyr Leu Thr  Leu Val Arg Lys Met  Lys Asn Ile 
    1175                 1180                 1185             


Gln Ser  Gln Arg Lys Gly Ser  Glu Gly Thr Cys Ser  Cys Ser Ser 
    1190                 1195                 1200             


Lys Gly  Phe Ser Thr Thr Cys  Pro Ala His Val Asp  Asp Leu Thr 
    1205                 1210                 1215             


Pro Glu  Gln Val Leu Asp Gly  Asp Val Asn Glu Leu  Met Asp Val 
    1220                 1225                 1230             


Val Leu  His His Val Pro Glu  Ala Lys Leu Val Glu  Cys Ile Gly 
    1235                 1240                 1245             


Gln Glu  Leu Ile Phe Leu Leu  Pro Asn Lys Asn Phe  Lys His Arg 
    1250                 1255                 1260             


Ala Tyr  Ala Ser Leu Phe Arg  Glu Leu Glu Glu Thr  Leu Ala Asp 
    1265                 1270                 1275             


Leu Gly  Leu Ser Ser Phe Gly  Ile Ser Asp Thr Pro  Leu Glu Glu 
    1280                 1285                 1290             


Ile Phe  Leu Lys Val Thr Glu  Asp Ser Asp Ser Gly  Pro Leu Phe 
    1295                 1300                 1305             


Ala Gly  Gly Ala Gln Gln Lys  Arg Glu Asn Val Asn  Pro Arg His 
    1310                 1315                 1320             


Pro Cys  Leu Gly Pro Arg Glu  Lys Ala Gly Gln Thr  Pro Gln Asp 
    1325                 1330                 1335             


Ser Asn  Val Cys Ser Pro Gly  Ala Pro Ala Ala His  Pro Glu Gly 
    1340                 1345                 1350             


Gln Pro  Pro Pro Glu Pro Glu  Cys Pro Gly Pro Gln  Leu Asn Thr 
    1355                 1360                 1365             


Gly Thr  Gln Leu Val Leu Gln  His Val Gln Ala Leu  Leu Val Lys 
    1370                 1375                 1380             


Arg Phe  Gln His Thr Ile Arg  Ser His Lys Asp Phe  Leu Ala Gln 
    1385                 1390                 1395             


Ile Val  Leu Pro Ala Thr Phe  Val Phe Leu Ala Leu  Met Leu Ser 
    1400                 1405                 1410             


Ile Val  Ile Pro Pro Phe Gly  Glu Tyr Pro Ala Leu  Thr Leu His 
    1415                 1420                 1425             


Pro Trp  Ile Tyr Gly Gln Gln  Tyr Thr Phe Phe Ser  Met Asp Glu 
    1430                 1435                 1440             


Pro Gly  Ser Glu Gln Phe Thr  Val Leu Ala Asp Val  Leu Leu Asn 
    1445                 1450                 1455             


Lys Pro  Gly Phe Gly Asn Arg  Cys Leu Lys Glu Gly  Trp Leu Pro 
    1460                 1465                 1470             


Glu Tyr  Pro Cys Gly Asn Ser  Thr Pro Trp Lys Thr  Pro Ser Val 
    1475                 1480                 1485             


Ser Pro  Asn Ile Thr Gln Leu  Phe Gln Lys Gln Lys  Trp Thr Gln 
    1490                 1495                 1500             


Val Asn  Pro Ser Pro Ser Cys  Arg Cys Ser Thr Arg  Glu Lys Leu 
    1505                 1510                 1515             


Thr Met  Leu Pro Glu Cys Pro  Glu Gly Ala Gly Gly  Leu Pro Pro 
    1520                 1525                 1530             


Pro Gln  Arg Thr Gln Arg Ser  Thr Glu Ile Leu Gln  Asp Leu Thr 
    1535                 1540                 1545             


Asp Arg  Asn Ile Ser Asp Phe  Leu Val Lys Thr Tyr  Pro Ala Leu 
    1550                 1555                 1560             


Ile Arg  Ser Ser Leu Lys Ser  Lys Phe Trp Val Asn  Glu Gln Arg 
    1565                 1570                 1575             


Tyr Gly  Gly Ile Ser Ile Gly  Gly Lys Leu Pro Val  Val Pro Ile 
    1580                 1585                 1590             


Thr Gly  Glu Ala Leu Val Gly  Phe Leu Ser Asp Leu  Gly Arg Ile 
    1595                 1600                 1605             


Met Asn  Val Ser Gly Gly Pro  Ile Thr Arg Glu Ala  Ser Lys Glu 
    1610                 1615                 1620             


Ile Pro  Asp Phe Leu Lys His  Leu Glu Thr Glu Asp  Asn Ile Lys 
    1625                 1630                 1635             


Val Trp  Phe Asn Asn Lys Gly  Trp His Ala Leu Val  Ser Phe Leu 
    1640                 1645                 1650             


Asn Val  Ala His Asn Ala Ile  Leu Arg Ala Ser Leu  Pro Lys Asp 
    1655                 1660                 1665             


Arg Ser  Pro Glu Glu Tyr Gly  Ile Thr Val Ile Ser  Gln Pro Leu 
    1670                 1675                 1680             


Asn Leu  Thr Lys Glu Gln Leu  Ser Glu Ile Thr Val  Leu Thr Thr 
    1685                 1690                 1695             


Ser Val  Asp Ala Val Val Ala  Ile Cys Val Ile Phe  Ser Met Ser 
    1700                 1705                 1710             


Phe Val  Pro Ala Ser Phe Val  Leu Tyr Leu Ile Gln  Glu Arg Val 
    1715                 1720                 1725             


Asn Lys  Ser Lys His Leu Gln  Phe Ile Ser Gly Val  Ser Pro Thr 
    1730                 1735                 1740             


Thr Tyr  Trp Val Thr Asn Phe  Leu Trp Asp Ile Met  Asn Tyr Ser 
    1745                 1750                 1755             


Val Ser  Ala Gly Leu Val Val  Gly Ile Phe Ile Gly  Phe Gln Lys 
    1760                 1765                 1770             


Lys Ala  Tyr Thr Ser Pro Glu  Asn Leu Pro Ala Leu  Val Ala Leu 
    1775                 1780                 1785             


Leu Leu  Leu Tyr Gly Trp Ala  Val Ile Pro Met Met  Tyr Pro Ala 
    1790                 1795                 1800             


Ser Phe  Leu Phe Asp Val Pro  Ser Thr Ala Tyr Val  Ala Leu Ser 
    1805                 1810                 1815             


Cys Ala  Asn Leu Phe Ile Gly  Ile Asn Ser Ser Ala  Ile Thr Phe 
    1820                 1825                 1830             


Ile Leu  Glu Leu Phe Glu Asn  Asn Arg Thr Leu Leu  Arg Phe Asn 
    1835                 1840                 1845             


Ala Val  Leu Arg Lys Leu Leu  Ile Val Phe Pro His  Phe Cys Leu 
    1850                 1855                 1860             


Gly Arg  Gly Leu Ile Asp Leu  Ala Leu Ser Gln Ala  Val Thr Asp 
    1865                 1870                 1875             


Val Tyr  Ala Arg Phe Gly Glu  Glu His Ser Ala Asn  Pro Phe His 
    1880                 1885                 1890             


Trp Asp  Leu Ile Gly Lys Asn  Leu Phe Ala Met Val  Val Glu Gly 
    1895                 1900                 1905             


Val Val  Tyr Phe Leu Leu Thr  Leu Leu Val Gln Arg  His Phe Phe 
    1910                 1915                 1920             


Leu Ser  Gln Trp Ile Ala Glu  Pro Thr Lys Glu Pro  Ile Val Asp 
    1925                 1930                 1935             


Glu Asp  Asp Asp Val Ala Glu  Glu Arg Gln Arg Ile  Ile Thr Gly 
    1940                 1945                 1950             


Gly Asn  Lys Thr Asp Ile Leu  Arg Leu His Glu Leu  Thr Lys Ile 
    1955                 1960                 1965             


Tyr Pro  Gly Thr Ser Ser Pro  Ala Val Asp Arg Leu  Cys Val Gly 
    1970                 1975                 1980             


Val Arg  Pro Gly Glu Cys Phe  Gly Leu Leu Gly Val  Asn Gly Ala 
    1985                 1990                 1995             


Gly Lys  Thr Thr Thr Phe Lys  Met Leu Thr Gly Asp  Thr Thr Val 
    2000                 2005                 2010             


Thr Ser  Gly Asp Ala Thr Val  Ala Gly Lys Ser Ile  Leu Thr Asn 
    2015                 2020                 2025             


Ile Ser  Glu Val His Gln Asn  Met Gly Tyr Cys Pro  Gln Phe Asp 
    2030                 2035                 2040             


Ala Ile  Asp Glu Leu Leu Thr  Gly Arg Glu His Leu  Tyr Leu Tyr 
    2045                 2050                 2055             


Ala Arg  Leu Arg Gly Val Pro  Ala Glu Glu Ile Glu  Lys Val Ala 
    2060                 2065                 2070             


Asn Trp  Ser Ile Lys Ser Leu  Gly Leu Thr Val Tyr  Ala Asp Cys 
    2075                 2080                 2085             


Leu Ala  Gly Thr Tyr Ser Gly  Gly Asn Lys Arg Lys  Leu Ser Thr 
    2090                 2095                 2100             


Ala Ile  Ala Leu Ile Gly Cys  Pro Pro Leu Val Leu  Leu Asp Glu 
    2105                 2110                 2115             


Pro Thr  Thr Gly Met Asp Pro  Gln Ala Arg Arg Met  Leu Trp Asn 
    2120                 2125                 2130             


Val Ile  Val Ser Ile Ile Arg  Glu Gly Arg Ala Val  Val Leu Thr 
    2135                 2140                 2145             


Ser His  Ser Met Glu Glu Cys  Glu Ala Leu Cys Thr  Arg Leu Ala 
    2150                 2155                 2160             


Ile Met  Val Lys Gly Ala Phe  Arg Cys Met Gly Thr  Ile Gln His 
    2165                 2170                 2175             


Leu Lys  Ser Lys Phe Gly Asp  Gly Tyr Ile Val Thr  Met Lys Ile 
    2180                 2185                 2190             


Lys Ser  Pro Lys Asp Asp Leu  Leu Pro Asp Leu Asn  Pro Val Glu 
    2195                 2200                 2205             


Gln Phe  Phe Gln Gly Asn Phe  Pro Gly Ser Val Gln  Arg Glu Arg 
    2210                 2215                 2220             


His Tyr  Asn Met Leu Gln Phe  Gln Val Ser Ser Ser  Ser Leu Ala 
    2225                 2230                 2235             


Arg Ile  Phe Gln Leu Leu Leu  Ser His Lys Asp Ser  Leu Leu Ile 
    2240                 2245                 2250             


Glu Glu  Tyr Ser Val Thr Gln  Thr Thr Leu Asp Gln  Val Phe Val 
    2255                 2260                 2265             


Asn Phe  Ala Lys Gln Gln Thr  Glu Ser His Asp Leu  Pro Leu His 
    2270                 2275                 2280             


Pro Arg  Ala Ala Gly Ala Ser  Arg Gln Ala Gln Asp  Asp Tyr Lys 
    2285                 2290                 2295             


Asp His  Asp Gly Asp Tyr Lys  Asp His Asp Ile Asp  Tyr Lys Asp 
    2300                 2305                 2310             


Asp Asp  Asp Lys 
    2315         


<210>  195
<211>  6954
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ABCA4 with FLAG tag

<400>  195
atgggcttcg tgagacagat acagcttttg ctctggaaga actggaccct gcggaaaagg       60

caaaagattc gctttgtggt ggaactcgtg tggcctttat ctttatttct ggtcttgatc      120

tggttaagga atgccaaccc actctacagc catcatgaat gccatttccc caacaaggcg      180

atgccctcag caggaatgct gccgtggctc caggggatct tctgcaatgt gaacaatccc      240

tgttttcaaa gccccacccc aggagaatct cctggaattg tgtcaaacta taacaactcc      300

atcttggcaa gggtatatcg agattttcaa gaactcctca tgaatgcacc agagagccag      360

caccttggcc gtatttggac agagctacac atcttgtccc aattcatgga caccctccgg      420

actcacccgg agagaattgc aggaagagga atacgaataa gggatatctt gaaagatgaa      480

gaaacactga cactatttct cattaaaaac atcggcctgt ctgactcagt ggtctacctt      540

ctgatcaact ctcaagtccg tccagagcag ttcgctcatg gagtcccgga cctggcgctg      600

aaggacatcg cctgcagcga ggccctcctg gagcgcttca tcatcttcag ccagagacgc      660

ggggcaaaga cggtgcgcta tgccctgtgc tccctctccc agggcaccct acagtggata      720

gaagacactc tgtatgccaa cgtggacttc ttcaagctct tccgtgtgct tcccacactc      780

ctagacagcc gttctcaagg tatcaatctg agatcttggg gaggaatatt atctgatatg      840

tcaccaagaa ttcaagagtt tatccatcgg ccgagtatgc aggacttgct gtgggtgacc      900

aggcccctca tgcagaatgg tggtccagag acctttacaa agctgatggg catcctgtct      960

gacctcctgt gtggctaccc cgagggaggt ggctctcggg tgctctcctt caactggtat     1020

gaagacaata actataaggc ctttctgggg attgactcca caaggaagga tcctatctat     1080

tcttatgaca gaagaacaac atccttttgt aatgcattga tccagagcct ggagtcaaat     1140

cctttaacca aaatcgcttg gagggcggca aagcctttgc tgatgggaaa aatcctgtac     1200

actcctgatt cacctgcagc acgaaggata ctgaagaatg ccaactcaac ttttgaagaa     1260

ctggaacacg ttaggaagtt ggtcaaagcc tgggaagaag tagggcccca gatctggtac     1320

ttctttgaca acagcacaca gatgaacatg atcagagata ccctggggaa cccaacagta     1380

aaagactttt tgaataggca gcttggtgaa gaaggtatta ctgctgaagc catcctaaac     1440

ttcctctaca agggccctcg ggaaagccag gctgacgaca tggccaactt cgactggagg     1500

gacatattta acatcactga tcgcaccctc cgcctggtca atcaatacct ggagtgcttg     1560

gtcctggata agtttgaaag ctacaatgat gaaactcagc tcacccaacg tgccctctct     1620

ctactggagg aaaacatgtt ctgggccgga gtggtattcc ctgacatgta tccctggacc     1680

agctctctac caccccacgt gaagtataag atccgaatgg acatagacgt ggtggagaaa     1740

accaataaga ttaaagacag gtattgggat gactacaaag accatgacgg tgattataaa     1800

gatcatgaca tcgattacaa ggatgacgat gacaagtctg gtcccagagc tgatcccgtg     1860

gaagatttcc ggtacatctg gggcgggttt gcctatctgc aggacatggt tgaacagggg     1920

atcacaagga gccaggtgca ggcggaggct ccagttggaa tctacctcca gcagatgccc     1980

tacccctgct tcgtggacga ttctttcatg atcatcctga accgctgttt ccctatcttc     2040

atggtgctgg catggatcta ctctgtctcc atgactgtga agagcatcgt cttggagaag     2100

gagttgcgac tgaaggagac cttgaaaaat cagggtgtct ccaatgcagt gatttggtgt     2160

acctggttcc tggacagctt ctccatcatg tcgatgagca tcttcctcct gacgatattc     2220

atcatgcatg gaagaatcct acattacagc gacccattca tcctcttcct gttcttgttg     2280

gctttctcca ctgccaccat catgctgtgc tttctgctca gcaccttctt ctccaaggca     2340

agcttggcag cagcctgtag tggtgtcatc tatttcaccc tctacctgcc acacatcctg     2400

tgcttcgcct ggcaggaccg catgaccgct gagctgaaga aggctgtgag cttactgtct     2460

ccggtggcat ttggatttgg cactgagtac ctggttcgct ttgaagagca aggcctgggg     2520

ctgcagtgga gcaacatcgg gaacagtccc acggaagggg acgaattcag cttcctgctg     2580

tccatgcaga tgatgctcct tgatgctgct gtctatggct tactcgcttg gtatcttgat     2640

caggtgtttc caggagacta tggaacccca cttccttggt actttcttct acaagagtcg     2700

tattggcttg gcggtgaagg ttgttcaacc agagaagaaa gagccctgga aaagaccgag     2760

cccctaacag aggaaacgga ggatccagag cacccagaag gaatacacga ctccttcttt     2820

gaacgtgagc atccagggtg ggttcctggg gtatgcgtga agaatctggt aaagattttt     2880

gagccctgtg gccggccagc tgtggaccgt ctgaacatca ccttctacga gaaccagatc     2940

accgcattcc tgggccacaa tggagctggg aaaaccacca ccttgtccat cctgacgggt     3000

ctgttgccac caacctctgg gactgtgctc gttgggggaa gggacattga aaccagcctg     3060

gatgcagtcc ggcagagcct tggcatgtgt ccacagcaca acatcctgtt ccaccacctc     3120

acggtggctg agcacatgct gttctatgcc cagctgaaag gaaagtccca ggaggaggcc     3180

cagctggaga tggaagccat gttggaggac acaggcctcc accacaagcg gaatgaagag     3240

gctcaggacc tatcaggtgg catgcagaga aagctgtcgg ttgccattgc ctttgtggga     3300

gatgccaagg tggtgattct ggacgaaccc acctctgggg tggaccctta ctcgagacgc     3360

tcaatctggg atctgctcct gaagtatcgc tcaggcagaa ccatcatcat gtccactcac     3420

cacatggacg aggccgacct ccttggggac cgcattgcca tcattgccca gggaaggctc     3480

tactgctcag gcaccccact cttcctgaag aactgctttg gcacaggctt gtacttaacc     3540

ttggtgcgca agatgaaaaa catccagagc caaaggaaag gcagtgaggg gacctgcagc     3600

tgctcgtcta agggtttctc caccacgtgt ccagcccacg tcgatgacct aactccagaa     3660

caagtcctgg atggggatgt aaatgagctg atggatgtag ttctccacca tgttccagag     3720

gcaaagctgg tggagtgcat tggtcaagaa cttatcttcc ttcttccaaa taagaacttc     3780

aagcacagag catatgccag ccttttcaga gagctggagg agacgctggc tgaccttggt     3840

ctcagcagtt ttggaatttc tgacactccc ctggaagaga tttttctgaa ggtcacggag     3900

gattctgatt caggacctct gtttgcgggt ggcgctcagc agaaaagaga aaacgtcaac     3960

ccccgacacc cctgcttggg tcccagagag aaggctggac agacacccca ggactccaat     4020

gtctgctccc caggggcgcc ggctgctcac ccagagggcc agcctccccc agagccagag     4080

tgcccaggcc cgcagctcaa cacggggaca cagctggtcc tccagcatgt gcaggcgctg     4140

ctggtcaaga gattccaaca caccatccgc agccacaagg acttcctggc gcagatcgtg     4200

ctcccggcta cctttgtgtt tttggctctg atgctttcta ttgttatccc tccttttggc     4260

gaataccccg ctttgaccct tcacccctgg atatatgggc agcagtacac cttcttcagc     4320

atggatgaac caggcagtga gcagttcacg gtacttgcag acgtcctcct gaataagcca     4380

ggctttggca accgctgcct gaaggaaggg tggcttccgg agtacccctg tggcaactca     4440

acaccctgga agactccttc tgtgtcccca aacatcaccc agctgttcca gaagcagaaa     4500

tggacacagg tcaacccttc accatcctgc aggtgcagca ccagggagaa gctcaccatg     4560

ctgccagagt gccccgaggg tgccgggggc ctcccgcccc cccagagaac acagcgcagc     4620

acggaaattc tacaagacct gacggacagg aacatctccg acttcttggt aaaaacgtat     4680

cctgctctta taagaagcag cttaaagagc aaattctggg tcaatgaaca gaggtatgga     4740

ggaatttcca ttggaggaaa gctcccagtc gtccccatca cgggggaagc acttgttggg     4800

tttttaagcg accttggccg gatcatgaat gtgagcgggg gccctatcac tagagaggcc     4860

tctaaagaaa tacctgattt ccttaaacat ctagaaactg aagacaacat taaggtgtgg     4920

tttaataaca aaggctggca tgccctggtc agctttctca atgtggccca caacgccatc     4980

ttacgggcca gcctgcctaa ggacaggagc cccgaggagt atggaatcac cgtcattagc     5040

caacccctga acctgaccaa ggagcagctc tcagagatta cagtgctgac cacttcagtg     5100

gatgctgtgg ttgccatctg cgtgattttc tccatgtcct tcgtcccagc cagctttgtc     5160

ctttatttga tccaggagcg ggtgaacaaa tccaagcacc tccagtttat cagtggagtg     5220

agccccacca cctactgggt gaccaacttc ctctgggaca tcatgaatta ttccgtgagt     5280

gctgggctgg tggtgggcat cttcatcggg tttcagaaga aagcctacac ttctccagaa     5340

aaccttcctg cccttgtggc actgctcctg ctgtatggat gggcggtcat tcccatgatg     5400

tacccagcat ccttcctgtt tgatgtcccc agcacagcct atgtggcttt atcttgtgct     5460

aatctgttca tcggcatcaa cagcagtgct attaccttca tcttggaatt atttgagaat     5520

aaccggacgc tgctcaggtt caacgccgtg ctgaggaagc tgctcattgt cttcccccac     5580

ttctgcctgg gccggggcct cattgacctt gcactgagcc aggctgtgac agatgtctat     5640

gcccggtttg gtgaggagca ctctgcaaat ccgttccact gggacctgat tgggaagaac     5700

ctgtttgcca tggtggtgga aggggtggtg tacttcctcc tgaccctgct ggtccagcgc     5760

cacttcttcc tctcccaatg gattgccgag cccactaagg agcccattgt tgatgaagat     5820

gatgatgtgg ctgaagaaag acaaagaatt attactggtg gaaataaaac tgacatctta     5880

aggctacatg aactaaccaa gatttatcca ggcacctcca gcccagcagt ggacaggctg     5940

tgtgtcggag ttcgccctgg agagtgcttt ggcctcctgg gagtgaatgg tgccggcaaa     6000

acaaccacat tcaagatgct cactggggac accacagtga cctcagggga tgccaccgta     6060

gcaggcaaga gtattttaac caatatttct gaagtccatc aaaatatggg ctactgtcct     6120

cagtttgatg caattgatga gctgctcaca ggacgagaac atctttacct ttatgcccgg     6180

cttcgaggtg taccagcaga agaaatcgaa aaggttgcaa actggagtat taagagcctg     6240

ggcctgactg tctacgccga ctgcctggct ggcacgtaca gtgggggcaa caagcggaaa     6300

ctctccacag ccatcgcact cattggctgc ccaccgctgg tgctgctgga tgagcccacc     6360

acagggatgg acccccaggc acgccgcatg ctgtggaacg tcatcgtgag catcatcaga     6420

gaagggaggg ctgtggtcct cacatcccac agcatggaag aatgtgaggc actgtgtacc     6480

cggctggcca tcatggtaaa gggcgccttt cgatgtatgg gcaccattca gcatctcaag     6540

tccaaatttg gagatggcta tatcgtcaca atgaagatca aatccccgaa ggacgacctg     6600

cttcctgacc tgaaccctgt ggagcagttc ttccagggga acttcccagg cagtgtgcag     6660

agggagaggc actacaacat gctccagttc caggtctcct cctcctccct ggcgaggatc     6720

ttccagctcc tcctctccca caaggacagc ctgctcatcg aggagtactc agtcacacag     6780

accacactgg accaggtgtt tgtaaatttt gctaaacagc agactgaaag ccatgacctc     6840

cctctgcacc ctcgagctgc tggagccagt cgacaagccc aggacgacta caaagaccat     6900

gacggtgatt ataaagatca tgacatcgat tacaaggatg acgatgacaa gtaa           6954


<210>  196
<211>  735
<212>  PRT
<213>  Adeno-associated virus

<400>  196

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145                 150                 155                 160 


Lys Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 
            260                 265                 270         


Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
        275                 280                 285             


Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 
    290                 295                 300                 


Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val 
305                 310                 315                 320 


Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 
                325                 330                 335     


Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 
            340                 345                 350         


Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 
        355                 360                 365             


Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
    370                 375                 380                 


Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385                 390                 395                 400 


Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
                405                 410                 415     


Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 
            420                 425                 430         


Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr 
        435                 440                 445             


Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln 
    450                 455                 460                 


Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
                485                 490                 495     


Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
        515                 520                 525             


Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr 
545                 550                 555                 560 


Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr 
                565                 570                 575     


Gly Ser Val Ser Thr Asn Leu Gln Arg Gly Asn Arg Gln Ala Ala Thr 
            580                 585                 590         


Ala Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn 
                645                 650                 655     


Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735 


<210>  197
<211>  2208
<212>  DNA
<213>  Adeno-associated virus

<400>  197
atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga       60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac      120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac      180

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcctacgac      240

cggcagctcg acagcggaga caacccgtac ctcaagtaca accacgccga cgcggagttt      300

caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag      360

gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gacggctccg      420

ggaaaaaaga ggccggtaga gcactctcct gtggagccag actcctcctc gggaaccgga      480

aaggcgggcc agcagcctgc aagaaaaaga ttgaattttg gtcagactgg agacgcagac      540

tcagtacctg acccccagcc tctcggacag ccaccagcag ccccctctgg tctgggaact      600

aatacgatgg ctacaggcag tggcgcacca atggcagaca ataacgaggg cgccgacgga      660

gtgggtaatt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt      780

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg      840

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      900

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      960

aaagaggtca cgcagaatga cggtacgacg acgattgcca ataaccttac cagcacggtt     1020

caggtgttta ctgactcgga gtaccagctc ccgtacgtcc tcggctcggc gcatcaagga     1080

tgcctcccgc cgttcccagc agacgtcttc atggtgccac agtatggata cctcaccctg     1140

aacaacggga gtcaggcagt aggacgctct tcattttact gcctggagta ctttccttct     1200

cagatgctgc gtaccggaaa caactttacc ttcagctaca cttttgagga cgttcctttc     1260

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct catcgaccag     1320

tacctgtatt acttgagcag aacaaacact ccaagtggaa ccaccacgca gtcaaggctt     1380

cagttttctc aggccggagc gagtgacatt cgggaccagt ctaggaactg gcttcctgga     1440

ccctgttacc gccagcagcg agtatcaaag acatctgcgg ataacaacaa cagtgaatac     1500

tcgtggactg gagctaccaa gtaccacctc aatggcagag actctctggt gaatccgggc     1560

ccggccatgg caagccacaa ggacgatgaa gaaaagtttt ttcctcagag cggggttctc     1620

atctttggga agcaaggctc agagaaaaca aatgtggaca ttgaaaaggt catgattaca     1680

gacgaagagg aaatcaggac aaccaatccc gtggctacgg agcagtatgg ttctgtatct     1740

accaacctcc agagaggcaa cagacaagca gctaccgcag atgtcaacac acaaggcgtt     1800

cttccaggca tggtctggca ggacagagat gtgtaccttc aggggcccat ctgggcaaag     1860

attccacaca cggacggaca ttttcacccc tctcccctca tgggtggatt cggacttaaa     1920

caccctcctc cacagattct catcaagaac accccggtac ctgcgaatcc ttcgaccacc     1980

ttcagtgcgg caaagtttgc ttccttcatc acacagtact ccacgggaca ggtcagcgtg     2040

gagatcgagt gggagctgca gaaggaaaac agcaaacgct ggaatcccga aattcagtac     2100

acttccaact acaacaagtc tgttaatgtg gactttactg tggacactaa tggcgtgtat     2160

tcagagcctc gccccattgg caccagatac ctgactcgta atctgtaa                  2208


<210>  198
<211>  2818
<212>  DNA
<213>  Homo sapiens

<400>  198
gctccttcct gtactgccca gctccgcttg ctccctgacc atccctgcag cagccctgat       60

gtgtcattgt ccccctctta acctgcgctg cagtgctgca gggctgggct ctggagctgg      120

gtctggtcat ttctccttag atatgtagag gcccaggaaa ggtttggagc ctaagaagcc      180

ctaggactcc aggtctccag ggcagcccca gcctcttgga atgactttcc ctaataccac      240

aggggtgttc taatcccagg cagacccaag ctgcccctca ccaactccta cgtcctcaac      300

ttcctttcat aacttctagg atggaaacac ctaatcctcc agcaatactg aggcttttct      360

ccttattctg ttttcccttt tgaagaagcc aaggctcaga gcagtcgagt cacctaatca      420

tggtctcatg tcgcctgatc aaggtctcat gtcaccttat caagatctca cccactcacc      480

tattcagttc tcaccagttc agttcaggat ggcttctaag ctaccctgca cagctctgcc      540

cacaggacat ttgtataagt gagggggtgc aggccttcca gccccctcca actccaaaac      600

tcagccccca agatcaagtg gactctctga acccaccctg gccctacagt tgtcagggtc      660

tggatgggaa gatgtagagc tctcggcttt cactctgggg acttacccag aacatattct      720

cctcatgagc taaggaggct ggctgccatc ttcctacatc cccccacggc ctgggggcaa      780

ggacaccctg gccccctgga gtctggagaa ctctgaggac agaacttgct cttccacctg      840

cttgggcctt acccacagga gaagcactgc ttctctaccc atgccccatc caactcaggc      900

accccaggga cttgcaacag tctgattttt tctcacgtcc ttcttaaggc tctgggctag      960

ccacacaaat caaatcccag tgataggtcc agacaatcct atcctgaaac tacatcttag     1020

taagactcca gggaatcctt tccccaaaga cagtcttact cctgttctcc ccccaagcct     1080

ttctgggcca gaagctttgc ctggactcaa gcaatggcag acaagtgccc tctgaggaca     1140

cggaagtgca tgctcagaac tgtgattctc caagtggagg cagaggagaa ggcccaggct     1200

tcccagcagg gctaaggata tgcaaggagt gcattcatcc ggaggtgttg gcagcatccc     1260

agccccaccc cattctcatc gtaaatcagg ctcacttcca ttggctgcat acggtggagt     1320

gatgtgacca tatgtcactt gagcattaca caaatcctaa tgagctaaaa atatgtttgt     1380

tttagctaat tgacctcttt ggccttcata aagcagttgg taaacatcct cagataatga     1440

tttccaaaga gcagattgtg ggtctcagct gtgcagagaa agcccacgtc cctgagacca     1500

ccttctccag ctgcctactg aggcacacag gggcgcctgc ctgctgcccg ctcagccaag     1560

gcggtgttgc tggagccagc ttgggacagc tctcccaacg ctctgccctg gccttgcgac     1620

cactctctgg gccgtagttg tctgtctgtt aagtgaggaa agtgcccatc tccagaggca     1680

ttcagcggca aagcagggct tccaggttcc gaccccatag caggacttct tggatttcta     1740

cagccagtca gttgcaagca gcacccatat tatttctata agaagtggca ggagctggga     1800

tctgaagagt tcagcagtct acctttccct gtttcttgtg ctttatgcag tcaggaggaa     1860

tgatctggat tccatgtgaa gcctgggacc acggagaccc aagacttcct gcttgattct     1920

ccctgcgaac tgcaggctgt gggctgagcc ttcaagaagc aggagtcccc tctagccatt     1980

aactctcaga gctaacctca tttgaatggg aacactagtc ctgtgatgtc tggaaggtgg     2040

gcgcctctac actccacacc ctacatggtg gtccagacac atcattccca gcattagaaa     2100

gctgtagggg gacccgttct gttccctgga ggcattaaag ggacatagaa ataaatctca     2160

agctctgagg ctgatgccag cctcagactc agcctctgca ctgtatgggc caattgtagc     2220

cccaaggact tcttcttgct gcacccccta tctgtccaca cctaaaacga tgggcttcta     2280

tttagttaca gaactctctg gcctgttttg ttttgctttg ctttgttttg ttttgttttt     2340

ttgttttttt gttttttagc tatgaaacag aggtaatatc taatacagat aacttaccag     2400

taatgagtgc ttcctactta ctgggtactg ggaagaagtg ctttacacat attttctcat     2460

ttaatctaca caataagtaa ttaagacatt tccctgaggc cacgggagag acagtggcag     2520

aacagttctc caaggaggac ttgcaagtta ataactggac tttgcaaggc tctggtggaa     2580

actgtcagct tgtaaaggat ggagcacagt gtctggcatg tagcaggaac taaaataatg     2640

gcagtgatta atgttatgat atgcagacac aacacagcaa gataagatgc aatgtacctt     2700

ctgggtcaaa ccaccctggc cactcctccc cgatacccag ggttgatgtg cttgaattag     2760

acaggattaa aggcttactg gagctggaag ccttgcccca actcaggagt ttagcccc       2818


<210>  199
<211>  2100
<212>  DNA
<213>  Homo sapiens

<400>  199
ccctgctaat ttttgtattt tttgtagaga cagggtctca ctatgttgcc caggctagtc       60

tcgaactcct gggctcaagt gatcctctca cctcggcctc ccaaagaact gggattacag      120

gcatgagcta ccatgtccag cccaattgcc cattgatggg cacgggttgc ttccatgttt      180

cagctgttgt gaatcacgct gctgtgaaca tgcgtgtgca aacagccctt ccagaccctg      240

ccttccattc ctctgggcct atacccagca gtgcggttgc tgggtcctat gggaattcta      300

cgtttaactt ttggaggagc tgccaaactg ttttccacag tggctgcgcc atcacaatcc      360

aattttagga catttttatc acccataaag cactccctgt acccattaag aagtcatcct      420

ccatttccct ccctcccctg tcctggcacc cattcctctg ctttgtgtgt ctctggattg      480

ccctatctaa gcatttcaca gagatggagc catgcgctcc gtggtctttt gtgtctggct      540

tcgctcactg agcatgctgt tctccaggtc catccacgtt gtagcgtagg tcagcccttc      600

attccctgtt atggccaaat gatactccat tgtacagaca ggccactttt tacccactca      660

tctgcttctg gacttttggg ttgcttctac catgtggcct gttgggaaca gtgctctgtt      720

tgtattcatg tacgggtttt tgtgtggaca cacattttca agtctcttgg gtacacaggt      780

gtagaagtgc cagagttggg gaaaaagctc acctttctag gctgtgaatg ggccctggca      840

agtctgtggc caggactcgt cttctcttcc acatggggcc cctagcttgg cacctagcac      900

gtggcaggca gcgacagatg ttaaaagcca ttcttgctat gggtagccag gctggggctc      960

catgcagccc tggccttcag cttggcagcc agggccccct tgtgcctgca gcagaagcca     1020

tgctgccagg agtgtaagtg tgagccagga atgctggaga atcgtggctc tgagaacagg     1080

gacaagaggc cacaagctca cgccttggct ttcctaagct taaggaataa acccaaaagg     1140

aggtacctgg aaggagctgg atttggggac tgaggagctg ggagctgatg gaagccgtga     1200

aaggggatgt gctcctgggg aggcgctggg gcgggtgggc cgtggagggg acagggcccg     1260

ttggttggaa actgaggcga ggctacggag ttgggcacta acaggtcatc cgtgcccctg     1320

cgaagcgtgg ggacacaggg acagcagaga tggcctgtct ggacactctg tcgacggggg     1380

gcctgtggtt ggtgaagccc aaggcaaggc tgtgaactca gggcaaggga gacgtgagca     1440

ggcgctgccg tgggctgatg tgggcactgc atgtgcaccc tggcggccaa aggacctaca     1500

gctcatgggg ggcaaggggg aggagggaag ccaacagcag gatgtgcgca gtcagtctgc     1560

cccccctaca ctggaggagg agccccccgg cacaaatctc gcccgtttgg gcccacggac     1620

atggctggcc tcgcaaggag gatccggttc caggcctcgg ccctaaatag tctccctggg     1680

ctttcaagag aaccacatga gaaaggagga ttcgggctct gagcagtttc accacccacc     1740

ccccagtctg caaatcctga cccgtgggtc cacctgcccc aaaggcggac gcaggacagt     1800

agaagggaac agagaacaca taaacacaga gagggccaca gcggctccca cagtcaccgc     1860

caccttcctg gcggggatgg gtggggcgtc tgagtttggt tcccagcaaa tccctctgag     1920

ccgcccttgc gggctcgcct caggagcagg ggagcaagag gtgggaggag gaggtctaag     1980

tcccaggccc aattaagaga tcaggtagtg tagggtttgg gagcttttaa ggtgaagagg     2040

cccgggctga tcccacaggc cagtataaag cgccgtgacc ctcaggtgat gcgccagggc     2100


<210>  200
<211>  136
<212>  DNA
<213>  Homo sapiens

<400>  200
cccatttgta ggagtgagtc agctgacccg cccccggggt tcctaatctc actaagaaag       60

actttgctga tgacagggtt tcctgggagt ccatgcgtgc ctggagcagc agcgtctcca      120

gggacaggca gccacc                                                      136


<210>  201
<211>  2215
<212>  PRT
<213>  Homo sapiens

<400>  201

Met Val Ile Leu Gln Gln Gly Asp His Val Trp Met Asp Leu Arg Leu 
1               5                   10                  15      


Gly Gln Glu Phe Asp Val Pro Ile Gly Ala Val Val Lys Leu Cys Asp 
            20                  25                  30          


Ser Gly Gln Val Gln Val Val Asp Asp Glu Asp Asn Glu His Trp Ile 
        35                  40                  45              


Ser Pro Gln Asn Ala Thr His Ile Lys Pro Met His Pro Thr Ser Val 
    50                  55                  60                  


His Gly Val Glu Asp Met Ile Arg Leu Gly Asp Leu Asn Glu Ala Gly 
65                  70                  75                  80  


Ile Leu Arg Asn Leu Leu Ile Arg Tyr Arg Asp His Leu Ile Tyr Thr 
                85                  90                  95      


Tyr Thr Gly Ser Ile Leu Val Ala Val Asn Pro Tyr Gln Leu Leu Ser 
            100                 105                 110         


Ile Tyr Ser Pro Glu His Ile Arg Gln Tyr Thr Asn Lys Lys Ile Gly 
        115                 120                 125             


Glu Met Pro Pro His Ile Phe Ala Ile Ala Asp Asn Cys Tyr Phe Asn 
    130                 135                 140                 


Met Lys Arg Asn Ser Arg Asp Gln Cys Cys Ile Ile Ser Gly Glu Ser 
145                 150                 155                 160 


Gly Ala Gly Lys Thr Glu Ser Thr Lys Leu Ile Leu Gln Phe Leu Ala 
                165                 170                 175     


Ala Ile Ser Gly Gln His Ser Trp Ile Glu Gln Gln Val Leu Glu Ala 
            180                 185                 190         


Thr Pro Ile Leu Glu Ala Phe Gly Asn Ala Lys Thr Ile Arg Asn Asp 
        195                 200                 205             


Asn Ser Ser Arg Phe Gly Lys Tyr Ile Asp Ile His Phe Asn Lys Arg 
    210                 215                 220                 


Gly Ala Ile Glu Gly Ala Lys Ile Glu Gln Tyr Leu Leu Glu Lys Ser 
225                 230                 235                 240 


Arg Val Cys Arg Gln Ala Leu Asp Glu Arg Asn Tyr His Val Phe Tyr 
                245                 250                 255     


Cys Met Leu Glu Gly Met Ser Glu Asp Gln Lys Lys Lys Leu Gly Leu 
            260                 265                 270         


Gly Gln Ala Ser Asp Tyr Asn Tyr Leu Ala Met Gly Asn Cys Ile Thr 
        275                 280                 285             


Cys Glu Gly Arg Val Asp Ser Gln Glu Tyr Ala Asn Ile Arg Ser Ala 
    290                 295                 300                 


Met Lys Val Leu Met Phe Thr Asp Thr Glu Asn Trp Glu Ile Ser Lys 
305                 310                 315                 320 


Leu Leu Ala Ala Ile Leu His Leu Gly Asn Leu Gln Tyr Glu Ala Arg 
                325                 330                 335     


Thr Phe Glu Asn Leu Asp Ala Cys Glu Val Leu Phe Ser Pro Ser Leu 
            340                 345                 350         


Ala Thr Ala Ala Ser Leu Leu Glu Val Asn Pro Pro Asp Leu Met Ser 
        355                 360                 365             


Cys Leu Thr Ser Arg Thr Leu Ile Thr Arg Gly Glu Thr Val Ser Thr 
    370                 375                 380                 


Pro Leu Ser Arg Glu Gln Ala Leu Asp Val Arg Asp Ala Phe Val Lys 
385                 390                 395                 400 


Gly Ile Tyr Gly Arg Leu Phe Val Trp Ile Val Asp Lys Ile Asn Ala 
                405                 410                 415     


Ala Ile Tyr Lys Pro Pro Ser Gln Asp Val Lys Asn Ser Arg Arg Ser 
            420                 425                 430         


Ile Gly Leu Leu Asp Ile Phe Gly Phe Glu Asn Phe Ala Val Asn Ser 
        435                 440                 445             


Phe Glu Gln Leu Cys Ile Asn Phe Ala Asn Glu His Leu Gln Gln Phe 
    450                 455                 460                 


Phe Val Arg His Val Phe Lys Leu Glu Gln Glu Glu Tyr Asp Leu Glu 
465                 470                 475                 480 


Ser Ile Asp Trp Leu His Ile Glu Phe Thr Asp Asn Gln Asp Ala Leu 
                485                 490                 495     


Asp Met Ile Ala Asn Lys Pro Met Asn Ile Ile Ser Leu Ile Asp Glu 
            500                 505                 510         


Glu Ser Lys Phe Pro Lys Gly Thr Asp Thr Thr Met Leu His Lys Leu 
        515                 520                 525             


Asn Ser Gln His Lys Leu Asn Ala Asn Tyr Ile Pro Pro Lys Asn Asn 
    530                 535                 540                 


His Glu Thr Gln Phe Gly Ile Asn His Phe Ala Gly Ile Val Tyr Tyr 
545                 550                 555                 560 


Glu Thr Gln Gly Phe Leu Glu Lys Asn Arg Asp Thr Leu His Gly Asp 
                565                 570                 575     


Ile Ile Gln Leu Val His Ser Ser Arg Asn Lys Phe Ile Lys Gln Ile 
            580                 585                 590         


Phe Gln Ala Asp Val Ala Met Gly Ala Glu Thr Arg Lys Arg Ser Pro 
        595                 600                 605             


Thr Leu Ser Ser Gln Phe Lys Arg Ser Leu Glu Leu Leu Met Arg Thr 
    610                 615                 620                 


Leu Gly Ala Cys Gln Pro Phe Phe Val Arg Cys Ile Lys Pro Asn Glu 
625                 630                 635                 640 


Phe Lys Lys Pro Met Leu Phe Asp Arg His Leu Cys Val Arg Gln Leu 
                645                 650                 655     


Arg Tyr Ser Gly Met Met Glu Thr Ile Arg Ile Arg Arg Ala Gly Tyr 
            660                 665                 670         


Pro Ile Arg Tyr Ser Phe Val Glu Phe Val Glu Arg Tyr Arg Val Leu 
        675                 680                 685             


Leu Pro Gly Val Lys Pro Ala Tyr Lys Gln Gly Asp Leu Arg Gly Thr 
    690                 695                 700                 


Cys Gln Arg Met Ala Glu Ala Val Leu Gly Thr His Asp Asp Trp Gln 
705                 710                 715                 720 


Ile Gly Lys Thr Lys Ile Phe Leu Lys Asp His His Asp Met Leu Leu 
                725                 730                 735     


Glu Val Glu Arg Asp Lys Ala Ile Thr Asp Arg Val Ile Leu Leu Gln 
            740                 745                 750         


Lys Val Ile Arg Gly Phe Lys Asp Arg Ser Asn Phe Leu Lys Leu Lys 
        755                 760                 765             


Asn Ala Ala Thr Leu Ile Gln Arg His Trp Arg Gly His Asn Cys Arg 
    770                 775                 780                 


Lys Asn Tyr Gly Leu Met Arg Leu Gly Phe Leu Arg Leu Gln Ala Leu 
785                 790                 795                 800 


His Arg Ser Arg Lys Leu His Gln Gln Tyr Arg Leu Ala Arg Gln Arg 
                805                 810                 815     


Ile Ile Gln Phe Gln Ala Arg Cys Arg Ala Tyr Leu Val Arg Lys Ala 
            820                 825                 830         


Phe Arg His Arg Leu Trp Ala Val Leu Thr Val Gln Ala Tyr Ala Arg 
        835                 840                 845             


Gly Met Ile Ala Arg Arg Leu His Gln Arg Leu Arg Ala Glu Tyr Leu 
    850                 855                 860                 


Trp Arg Leu Glu Ala Glu Lys Met Arg Leu Ala Glu Glu Glu Lys Leu 
865                 870                 875                 880 


Arg Lys Glu Met Ser Ala Lys Lys Ala Lys Glu Glu Ala Glu Arg Lys 
                885                 890                 895     


His Gln Glu Arg Leu Ala Gln Leu Ala Arg Glu Asp Ala Glu Arg Glu 
            900                 905                 910         


Leu Lys Glu Lys Glu Ala Ala Arg Arg Lys Lys Glu Leu Leu Glu Gln 
        915                 920                 925             


Met Glu Arg Ala Arg His Glu Pro Val Asn His Ser Asp Met Val Asp 
    930                 935                 940                 


Lys Met Phe Gly Phe Leu Gly Thr Ser Gly Gly Leu Pro Gly Gln Glu 
945                 950                 955                 960 


Gly Gln Ala Pro Ser Gly Phe Glu Asp Leu Glu Arg Gly Arg Arg Glu 
                965                 970                 975     


Met Val Glu Glu Asp Leu Asp Ala Ala Leu Pro Leu Pro Asp Glu Asp 
            980                 985                 990         


Glu Glu Asp Leu Ser Glu Tyr Lys  Phe Ala Lys Phe Ala  Ala Thr Tyr 
        995                 1000                 1005             


Phe Gln  Gly Thr Thr Thr His  Ser Tyr Thr Arg Arg  Pro Leu Lys 
    1010                 1015                 1020             


Gln Pro  Leu Leu Tyr His Asp  Asp Glu Gly Asp Gln  Leu Ala Ala 
    1025                 1030                 1035             


Leu Ala  Val Trp Ile Thr Ile  Leu Arg Phe Met Gly  Asp Leu Pro 
    1040                 1045                 1050             


Glu Pro  Lys Tyr His Thr Ala  Met Ser Asp Gly Ser  Glu Lys Ile 
    1055                 1060                 1065             


Pro Val  Met Thr Lys Ile Tyr  Glu Thr Leu Gly Lys  Lys Thr Tyr 
    1070                 1075                 1080             


Lys Arg  Glu Leu Gln Ala Leu  Gln Gly Glu Gly Glu  Ala Gln Leu 
    1085                 1090                 1095             


Pro Glu  Gly Gln Lys Lys Ser  Ser Val Arg His Lys  Leu Val His 
    1100                 1105                 1110             


Leu Thr  Leu Lys Lys Lys Ser  Lys Leu Thr Glu Glu  Val Thr Lys 
    1115                 1120                 1125             


Arg Leu  His Asp Gly Glu Ser  Thr Val Gln Gly Asn  Ser Met Leu 
    1130                 1135                 1140             


Glu Asp  Arg Pro Thr Ser Asn  Leu Glu Lys Leu His  Phe Ile Ile 
    1145                 1150                 1155             


Gly Asn  Gly Ile Leu Arg Pro  Ala Leu Arg Asp Glu  Ile Tyr Cys 
    1160                 1165                 1170             


Gln Ile  Ser Lys Gln Leu Thr  His Asn Pro Ser Lys  Ser Ser Tyr 
    1175                 1180                 1185             


Ala Arg  Gly Trp Ile Leu Val  Ser Leu Cys Val Gly  Cys Phe Ala 
    1190                 1195                 1200             


Pro Ser  Glu Lys Phe Val Lys  Tyr Leu Arg Asn Phe  Ile His Gly 
    1205                 1210                 1215             


Gly Pro  Pro Gly Tyr Ala Pro  Tyr Cys Glu Glu Arg  Leu Arg Arg 
    1220                 1225                 1230             


Thr Phe  Val Asn Gly Thr Arg  Thr Gln Pro Pro Ser  Trp Leu Glu 
    1235                 1240                 1245             


Leu Gln  Ala Thr Lys Ser Lys  Lys Pro Ile Met Leu  Pro Val Thr 
    1250                 1255                 1260             


Phe Met  Asp Gly Thr Thr Lys  Thr Leu Leu Thr Asp  Ser Ala Thr 
    1265                 1270                 1275             


Thr Ala  Lys Glu Leu Cys Asn  Ala Leu Ala Asp Lys  Ile Ser Leu 
    1280                 1285                 1290             


Lys Asp  Arg Phe Gly Phe Ser  Leu Tyr Ile Ala Leu  Phe Asp Lys 
    1295                 1300                 1305             


Val Ser  Ser Leu Gly Ser Gly  Ser Asp His Val Met  Asp Ala Ile 
    1310                 1315                 1320             


Ser Gln  Cys Glu Gln Tyr Ala  Lys Glu Gln Gly Ala  Gln Glu Arg 
    1325                 1330                 1335             


Asn Ala  Pro Trp Arg Leu Phe  Phe Arg Lys Glu Val  Phe Thr Pro 
    1340                 1345                 1350             


Trp His  Ser Pro Ser Glu Asp  Asn Val Ala Thr Asn  Leu Ile Tyr 
    1355                 1360                 1365             


Gln Gln  Val Val Arg Gly Val  Lys Phe Gly Glu Tyr  Arg Cys Glu 
    1370                 1375                 1380             


Lys Glu  Asp Asp Leu Ala Glu  Leu Ala Ser Gln Gln  Tyr Phe Val 
    1385                 1390                 1395             


Asp Tyr  Gly Ser Glu Met Ile  Leu Glu Arg Leu Leu  Asn Leu Val 
    1400                 1405                 1410             


Pro Thr  Tyr Ile Pro Asp Arg  Glu Ile Thr Pro Leu  Lys Thr Leu 
    1415                 1420                 1425             


Glu Lys  Trp Ala Gln Leu Ala  Ile Ala Ala His Lys  Lys Gly Ile 
    1430                 1435                 1440             


Tyr Ala  Gln Arg Arg Thr Asp  Ala Gln Lys Val Lys  Glu Asp Val 
    1445                 1450                 1455             


Val Ser  Tyr Ala Arg Phe Lys  Trp Pro Leu Leu Phe  Ser Arg Phe 
    1460                 1465                 1470             


Tyr Glu  Ala Tyr Lys Phe Ser  Gly Pro Ser Leu Pro  Lys Asn Asp 
    1475                 1480                 1485             


Val Ile  Val Ala Val Asn Trp  Thr Gly Val Tyr Phe  Val Asp Glu 
    1490                 1495                 1500             


Gln Glu  Gln Val Leu Leu Glu  Leu Ser Phe Pro Glu  Ile Met Ala 
    1505                 1510                 1515             


Val Ser  Ser Ser Arg Glu Cys  Arg Val Trp Leu Ser  Leu Gly Cys 
    1520                 1525                 1530             


Ser Asp  Leu Gly Cys Ala Ala  Pro His Ser Gly Trp  Ala Gly Leu 
    1535                 1540                 1545             


Thr Pro  Ala Gly Pro Cys Ser  Pro Cys Trp Ser Cys  Arg Gly Ala 
    1550                 1555                 1560             


Lys Thr  Thr Ala Pro Ser Phe  Thr Leu Ala Thr Ile  Lys Gly Asp 
    1565                 1570                 1575             


Glu Tyr  Thr Phe Thr Ser Ser  Asn Ala Glu Asp Ile  Arg Asp Leu 
    1580                 1585                 1590             


Val Val  Thr Phe Leu Glu Gly  Leu Arg Lys Arg Ser  Lys Tyr Val 
    1595                 1600                 1605             


Val Ala  Leu Gln Asp Asn Pro  Asn Pro Ala Gly Glu  Glu Ser Gly 
    1610                 1615                 1620             


Phe Leu  Ser Phe Ala Lys Gly  Asp Leu Ile Ile Leu  Asp His Asp 
    1625                 1630                 1635             


Thr Gly  Glu Gln Val Met Asn  Ser Gly Trp Ala Asn  Gly Ile Asn 
    1640                 1645                 1650             


Glu Arg  Thr Lys Gln Arg Gly  Asp Phe Pro Thr Asp  Ser Val Tyr 
    1655                 1660                 1665             


Val Met  Pro Thr Val Thr Met  Pro Pro Arg Glu Ile  Val Ala Leu 
    1670                 1675                 1680             


Val Thr  Met Thr Pro Asp Gln  Arg Gln Asp Val Val  Arg Leu Leu 
    1685                 1690                 1695             


Gln Leu  Arg Thr Ala Glu Pro  Glu Val Arg Ala Lys  Pro Tyr Thr 
    1700                 1705                 1710             


Leu Glu  Glu Phe Ser Tyr Asp  Tyr Phe Arg Pro Pro  Pro Lys His 
    1715                 1720                 1725             


Thr Leu  Ser Arg Val Met Val  Ser Lys Ala Arg Gly  Lys Asp Arg 
    1730                 1735                 1740             


Leu Trp  Ser His Thr Arg Glu  Pro Leu Lys Gln Ala  Leu Leu Lys 
    1745                 1750                 1755             


Lys Leu  Leu Gly Ser Glu Glu  Leu Ser Gln Glu Ala  Cys Leu Ala 
    1760                 1765                 1770             


Phe Ile  Ala Val Leu Lys Tyr  Met Gly Asp Tyr Pro  Ser Lys Arg 
    1775                 1780                 1785             


Thr Arg  Ser Val Asn Glu Leu  Thr Asp Gln Ile Phe  Glu Gly Pro 
    1790                 1795                 1800             


Leu Lys  Ala Glu Pro Leu Lys  Asp Glu Ala Tyr Val  Gln Ile Leu 
    1805                 1810                 1815             


Lys Gln  Leu Thr Asp Asn His  Ile Arg Tyr Ser Glu  Glu Arg Gly 
    1820                 1825                 1830             


Trp Glu  Leu Leu Trp Leu Cys  Thr Gly Leu Phe Pro  Pro Ser Asn 
    1835                 1840                 1845             


Ile Leu  Leu Pro His Val Gln  Arg Phe Leu Gln Ser  Arg Lys His 
    1850                 1855                 1860             


Cys Pro  Leu Ala Ile Asp Cys  Leu Gln Arg Leu Gln  Lys Ala Leu 
    1865                 1870                 1875             


Arg Asn  Gly Ser Arg Lys Tyr  Pro Pro His Leu Val  Glu Val Glu 
    1880                 1885                 1890             


Ala Ile  Gln His Lys Thr Thr  Gln Ile Phe His Lys  Val Tyr Phe 
    1895                 1900                 1905             


Pro Asp  Asp Thr Asp Glu Ala  Phe Glu Val Glu Ser  Ser Thr Lys 
    1910                 1915                 1920             


Ala Lys  Asp Phe Cys Gln Asn  Ile Ala Thr Arg Leu  Leu Leu Lys 
    1925                 1930                 1935             


Ser Ser  Glu Gly Phe Ser Leu  Phe Val Lys Ile Ala  Asp Lys Val 
    1940                 1945                 1950             


Leu Ser  Val Pro Glu Asn Asp  Phe Phe Phe Asp Phe  Val Arg His 
    1955                 1960                 1965             


Leu Thr  Asp Trp Ile Lys Lys  Ala Arg Pro Ile Lys  Asp Gly Ile 
    1970                 1975                 1980             


Val Pro  Ser Leu Thr Tyr Gln  Val Phe Phe Met Lys  Lys Leu Trp 
    1985                 1990                 1995             


Thr Thr  Thr Val Pro Gly Lys  Asp Pro Met Ala Asp  Ser Ile Phe 
    2000                 2005                 2010             


His Tyr  Tyr Gln Glu Leu Pro  Lys Tyr Leu Arg Gly  Tyr His Lys 
    2015                 2020                 2025             


Cys Thr  Arg Glu Glu Val Leu  Gln Leu Gly Ala Leu  Ile Tyr Arg 
    2030                 2035                 2040             


Val Lys  Phe Glu Glu Asp Lys  Ser Tyr Phe Pro Ser  Ile Pro Lys 
    2045                 2050                 2055             


Leu Leu  Arg Glu Leu Val Pro  Gln Asp Leu Ile Arg  Gln Val Ser 
    2060                 2065                 2070             


Pro Asp  Asp Trp Lys Arg Ser  Ile Val Ala Tyr Phe  Asn Lys His 
    2075                 2080                 2085             


Ala Gly  Lys Ser Lys Glu Glu  Ala Lys Leu Ala Phe  Leu Lys Leu 
    2090                 2095                 2100             


Ile Phe  Lys Trp Pro Thr Phe  Gly Ser Ala Phe Phe  Glu Val Lys 
    2105                 2110                 2115             


Gln Thr  Thr Glu Pro Asn Phe  Pro Glu Ile Leu Leu  Ile Ala Ile 
    2120                 2125                 2130             


Asn Lys  Tyr Gly Val Ser Leu  Ile Asp Pro Lys Thr  Lys Asp Ile 
    2135                 2140                 2145             


Leu Thr  Thr His Pro Phe Thr  Lys Ile Ser Asn Trp  Ser Ser Gly 
    2150                 2155                 2160             


Asn Thr  Tyr Phe His Ile Thr  Ile Gly Asn Leu Val  Arg Gly Ser 
    2165                 2170                 2175             


Lys Leu  Leu Cys Glu Thr Ser  Leu Gly Tyr Lys Met  Asp Asp Leu 
    2180                 2185                 2190             


Leu Thr  Ser Tyr Ile Ser Gln  Met Leu Thr Ala Met  Ser Lys Gln 
    2195                 2200                 2205             


Arg Gly  Ser Arg Ser Gly Lys  
    2210                 2215 


<210>  202
<211>  6648
<212>  DNA
<213>  Homo sapiens

<400>  202
atggtgattc ttcagcaggg ggaccatgtg tggatggacc tgagattggg gcaggagttc       60

gacgtgccca tcggggcggt ggtgaagctc tgcgactctg ggcaggtcca ggtggtggat      120

gatgaagaca atgaacactg gatctctccg cagaacgcaa cgcacatcaa gcctatgcac      180

cccacgtcgg tccacggcgt ggaggacatg atccgcctgg gggacctcaa cgaggcgggc      240

atcttgcgca acctgcttat ccgctaccgg gaccacctca tctacacgta tacgggctcc      300

atcctggtgg ctgtgaaccc ctaccagctg ctctccatct actcgccaga gcacatccgc      360

cagtatacca acaagaagat tggggagatg cccccccaca tctttgccat tgctgacaac      420

tgctacttca acatgaaacg caacagccga gaccagtgct gcatcatcag tggggaatct      480

ggggccggga agacggagag cacaaagctg atcctgcagt tcctggcagc catcagtggg      540

cagcactcgt ggattgagca gcaggtcttg gaggccaccc ccattctgga agcatttggg      600

aatgccaaga ccatccgcaa tgacaactca agccgtttcg gaaagtacat cgacatccac      660

ttcaacaagc ggggcgccat cgagggcgcg aagattgagc agtacctgct ggaaaagtca      720

cgtgtctgtc gccaggccct ggatgaaagg aactaccacg tgttctactg catgctggag      780

ggtatgagtg aggatcagaa gaagaagctg ggcttgggcc aggcctctga ctacaactac      840

ttggccatgg gtaactgcat aacctgtgag ggccgggtgg acagccagga gtacgccaac      900

atccgctccg ccatgaaggt gctcatgttc actgacaccg agaactggga gatctcgaag      960

ctcctggctg ccatcctgca cctgggcaac ctgcagtatg aggcacgcac atttgaaaac     1020

ctggatgcct gtgaggttct cttctcccca tcgctggcca cagctgcatc cctgcttgag     1080

gtgaaccccc cagacctgat gagctgcctg actagccgca ccctcatcac ccgcggggag     1140

acggtgtcca ccccactgag cagggaacag gcactggacg tgcgcgacgc cttcgtaaag     1200

gggatctacg ggcggctgtt cgtgtggatt gtggacaaga tcaacgcagc aatttacaag     1260

cctccctccc aggatgtgaa gaactctcgc aggtccatcg gcctcctgga catctttggg     1320

tttgagaact ttgctgtgaa cagctttgag cagctctgca tcaacttcgc caatgagcac     1380

ctgcagcagt tctttgtgcg gcacgtgttc aagctggagc aggaggaata tgacctggag     1440

agcattgact ggctgcacat cgagttcact gacaaccagg atgccctgga catgattgcc     1500

aacaagccca tgaacatcat ctccctcatc gatgaggaga gcaagttccc caagggcaca     1560

gacaccacca tgttacacaa gctgaactcc cagcacaagc tcaacgccaa ctacatcccc     1620

cccaagaaca accatgagac ccagtttggc atcaaccatt ttgcaggcat cgtctactat     1680

gagacccaag gcttcctgga gaagaaccga gacaccctgc atggggacat tatccagctg     1740

gtccactcct ccaggaacaa gttcatcaag cagatcttcc aggccgatgt cgccatgggc     1800

gccgagacca ggaagcgctc gcccacactt agcagccagt tcaagcggtc actggagctg     1860

ctgatgcgca cgctgggtgc ctgccagccc ttctttgtgc gatgcatcaa gcccaatgag     1920

ttcaagaagc ccatgctgtt cgaccggcac ctgtgcgtgc gccagctgcg gtactcagga     1980

atgatggaga ccatccgaat ccgccgagct ggctacccca tccgctacag cttcgtagag     2040

tttgtggagc ggtaccgtgt gctgctgcca ggtgtgaagc cggcctacaa gcagggcgac     2100

ctccgcggga cttgccagcg catggctgag gctgtgctgg gcacccacga tgactggcag     2160

ataggcaaaa ccaagatctt tctgaaggac caccatgaca tgctgctgga agtggagcgg     2220

gacaaagcca tcaccgacag agtcatcctc cttcagaaag tcatccgggg attcaaagac     2280

aggtctaact ttctgaagct gaagaacgct gccacactga tccagaggca ctggcggggt     2340

cacaactgta ggaagaacta cgggctgatg cgtctgggct tcctgcggct gcaggccctg     2400

caccgctccc ggaagctgca ccagcagtac cgcctggccc gccagcgcat catccagttc     2460

caggcccgct gccgcgccta tctggtgcgc aaggccttcc gccaccgcct ctgggctgtg     2520

ctcaccgtgc aggcctatgc ccggggcatg atcgcccgca ggctgcacca acgcctcagg     2580

gctgagtatc tgtggcgcct cgaggctgag aaaatgcggc tggcggagga agagaagctt     2640

cggaaggaga tgagcgccaa gaaggccaag gaggaggccg agcgcaagca tcaggagcgc     2700

ctggcccagc tggctcgtga ggacgctgag cgggagctga aggagaagga ggccgctcgg     2760

cggaagaagg agctcctgga gcagatggaa agggcccgcc atgagcctgt caatcactca     2820

gacatggtgg acaagatgtt tggcttcctg gggacttcag gtggcctgcc aggccaggag     2880

ggccaggcac ctagtggctt tgaggacctg gagcgagggc ggagggagat ggtggaggag     2940

gacctggatg cagccctgcc cctgcctgac gaggatgagg aggacctctc tgagtataaa     3000

tttgccaagt tcgcggccac ctacttccag gggacaacca cgcactccta cacccggcgg     3060

ccactcaaac agccactgct ctaccatgac gacgagggtg accagctggc agccctggcg     3120

gtctggatca ccatcctccg cttcatgggg gacctccctg agcccaagta ccacacagcc     3180

atgagtgatg gcagtgagaa gatccctgtg atgaccaaga tttatgagac cctgggcaag     3240

aagacgtaca agagggagct gcaggccctg cagggcgagg gcgaggccca gctccccgag     3300

ggccagaaga agagcagtgt gaggcacaag ctggtgcatt tgactctgaa aaagaagtcc     3360

aagctcacag aggaggtgac caagaggctg catgacgggg agtccacagt gcagggcaac     3420

agcatgctgg aggaccggcc cacctccaac ctggagaagc tgcacttcat catcggcaat     3480

ggcatcctgc ggccagcact ccgggacgag atctactgcc agatcagcaa gcagctgacc     3540

cacaacccct ccaagagcag ctatgcccgg ggctggattc tcgtgtctct ctgcgtgggc     3600

tgtttcgccc cctccgagaa gtttgtcaag tacctgcgga acttcatcca cgggggcccg     3660

cccggctacg ccccgtactg tgaggagcgc ctgagaagga cctttgtcaa tgggacacgg     3720

acacagccgc ccagctggct ggagctgcag gccaccaagt ccaagaagcc aatcatgttg     3780

cccgtgacat tcatggatgg gaccaccaag accctgctga cggactcggc aaccacggcc     3840

aaggagctct gcaacgcgct ggccgacaag atctctctca aggaccggtt cgggttctcc     3900

ctctacattg ccctgtttga caaggtgtcc tccctgggca gcggcagtga ccacgtcatg     3960

gacgccatct cccagtgcga gcagtacgcc aaggagcagg gcgcccagga gcgcaacgcc     4020

ccctggaggc tcttcttccg caaagaggtc ttcacgccct ggcacagccc ctccgaggac     4080

aacgtggcca ccaacctcat ctaccagcag gtggtgcgag gagtcaagtt tggggagtac     4140

aggtgtgaga aggaggacga cctggctgag ctggcctccc agcagtactt tgtagactat     4200

ggctctgaga tgatcctgga gcgcctcctg aacctcgtgc ccacctacat ccccgaccgc     4260

gagatcacgc ccctgaagac gctggagaag tgggcccagc tggccatcgc cgcccacaag     4320

aaggggattt atgcccagag gagaactgat gcccagaagg tcaaagagga tgtggtcagt     4380

tatgcccgct tcaagtggcc cttgctcttc tccaggtttt atgaagccta caaattctca     4440

ggccccagtc tccccaagaa cgacgtcatc gtggccgtca actggacggg tgtgtacttt     4500

gtggatgagc aggagcaggt acttctggag ctgtccttcc cagagatcat ggccgtgtcc     4560

agcagcaggg agtgccgtgt ctggctctca ctgggctgct ctgatcttgg ctgtgctgcg     4620

cctcactcag gctgggcagg actgaccccg gcggggccct gttctccgtg ttggtcctgc     4680

aggggagcga aaacgacggc ccccagcttc acgctggcca ccatcaaggg ggacgaatac     4740

accttcacct ccagcaatgc tgaggacatt cgtgacctgg tggtcacctt cctagagggg     4800

ctccggaaga gatctaagta tgttgtggcc ctgcaggata accccaaccc cgcaggcgag     4860

gagtcaggct tcctcagctt tgccaaggga gacctcatca tcctggacca tgacacgggc     4920

gagcaggtca tgaactcggg ctgggccaac ggcatcaatg agaggaccaa gcagcgtggg     4980

gacttcccca ccgacagtgt gtacgtcatg cccactgtca ccatgccacc gcgggagatt     5040

gtggccctgg tcaccatgac tcccgatcag aggcaggacg ttgtccggct cttgcagctg     5100

cgaacggcgg agcccgaggt gcgtgccaag ccctacacgc tggaggagtt ttcctatgac     5160

tacttcaggc ccccacccaa gcacacgctg agccgtgtca tggtgtccaa ggcccgaggc     5220

aaggaccggc tgtggagcca cacgcgggaa ccgctcaagc aggcgctgct caagaagctc     5280

ctgggcagtg aggagctctc gcaggaggcc tgcctggcct tcattgctgt gctcaagtac     5340

atgggcgact acccgtccaa gaggacacgc tccgtcaacg agctcaccga ccagatcttt     5400

gagggtcccc tgaaagccga gcccctgaag gacgaggcat atgtgcagat cctgaagcag     5460

ctgaccgaca accacatcag gtacagcgag gagcggggtt gggagctgct ctggctgtgc     5520

acgggccttt tcccacccag caacatcctc ctgccccacg tgcagcgctt cctgcagtcc     5580

cgaaagcact gcccactcgc catcgactgc ctgcaacggc tccagaaagc cctgagaaac     5640

gggtcccgga agtaccctcc gcacctggtg gaggtggagg ccatccagca caagaccacc     5700

cagattttcc acaaagtcta cttccctgat gacactgacg aggccttcga agtggagtcc     5760

agcaccaagg ccaaggactt ctgccagaac atcgccacca ggctgctcct caagtcctca     5820

gagggattca gcctctttgt caaaattgca gacaaggtcc tcagcgttcc tgagaatgac     5880

ttcttctttg actttgttcg acacttgaca gactggataa agaaagctcg gcccatcaag     5940

gacggaattg tgccctcact cacctaccag gtgttcttca tgaagaagct gtggaccacc     6000

acggtgccag ggaaggatcc catggccgat tccatcttcc actattacca ggagttgccc     6060

aagtatctcc gaggctacca caagtgcacg cgggaggagg tgctgcagct gggggcgctg     6120

atctacaggg tcaagttcga ggaggacaag tcctacttcc ccagcatccc caagctgctg     6180

cgggagctgg tgccccagga ccttatccgg caggtctcac ctgatgactg gaagcggtcc     6240

atcgtcgcct acttcaacaa gcacgcaggg aagtccaagg aggaggccaa gctggccttc     6300

ctgaagctca tcttcaagtg gcccaccttt ggctcagcct tcttcgaggt gaagcaaact     6360

acggagccaa acttccctga gatcctccta attgccatca acaagtatgg ggtcagcctc     6420

atcgatccca aaacgaagga tatcctcacc actcatccct tcaccaagat ctccaactgg     6480

agcagcggca acacctactt ccacatcacc attgggaact tggtgcgcgg gagcaaactg     6540

ctctgcgaga cgtcactggg ctacaagatg gatgacctcc tgacttccta cattagccag     6600

atgctcacag ccatgagcaa acagcggggc tccaggagcg gcaagtga                  6648


<210>  203
<211>  2479
<212>  PRT
<213>  Homo sapiens

<400>  203

Met Pro Pro Asn Ile Asn Trp Lys Glu Ile Met Lys Val Asp Pro Asp 
1               5                   10                  15      


Asp Leu Pro Arg Gln Glu Glu Leu Ala Asp Asn Leu Leu Ile Ser Leu 
            20                  25                  30          


Ser Lys Val Glu Val Asn Glu Leu Lys Ser Glu Lys Gln Glu Asn Val 
        35                  40                  45              


Ile His Leu Phe Arg Ile Thr Gln Ser Leu Met Lys Met Lys Ala Gln 
    50                  55                  60                  


Glu Val Glu Leu Ala Leu Glu Glu Val Glu Lys Ala Gly Glu Glu Gln 
65                  70                  75                  80  


Ala Lys Phe Glu Asn Gln Leu Lys Thr Lys Val Met Lys Leu Glu Asn 
                85                  90                  95      


Glu Leu Glu Met Ala Gln Gln Ser Ala Gly Gly Arg Asp Thr Arg Phe 
            100                 105                 110         


Leu Arg Asn Glu Ile Cys Gln Leu Glu Lys Gln Leu Glu Gln Lys Asp 
        115                 120                 125             


Arg Glu Leu Glu Asp Met Glu Lys Glu Leu Glu Lys Glu Lys Lys Val 
    130                 135                 140                 


Asn Glu Gln Leu Ala Leu Arg Asn Glu Glu Ala Glu Asn Glu Asn Ser 
145                 150                 155                 160 


Lys Leu Arg Arg Glu Asn Lys Arg Leu Lys Lys Lys Asn Glu Gln Leu 
                165                 170                 175     


Cys Gln Asp Ile Ile Asp Tyr Gln Lys Gln Ile Asp Ser Gln Lys Glu 
            180                 185                 190         


Thr Leu Leu Ser Arg Arg Gly Glu Asp Ser Asp Tyr Arg Ser Gln Leu 
        195                 200                 205             


Ser Lys Lys Asn Tyr Glu Leu Ile Gln Tyr Leu Asp Glu Ile Gln Thr 
    210                 215                 220                 


Leu Thr Glu Ala Asn Glu Lys Ile Glu Val Gln Asn Gln Glu Met Arg 
225                 230                 235                 240 


Lys Asn Leu Glu Glu Ser Val Gln Glu Met Glu Lys Met Thr Asp Glu 
                245                 250                 255     


Tyr Asn Arg Met Lys Ala Ile Val His Gln Thr Asp Asn Val Ile Asp 
            260                 265                 270         


Gln Leu Lys Lys Glu Asn Asp His Tyr Gln Leu Gln Val Gln Glu Leu 
        275                 280                 285             


Thr Asp Leu Leu Lys Ser Lys Asn Glu Glu Asp Asp Pro Ile Met Val 
    290                 295                 300                 


Ala Val Asn Ala Lys Val Glu Glu Trp Lys Leu Ile Leu Ser Ser Lys 
305                 310                 315                 320 


Asp Asp Glu Ile Ile Glu Tyr Gln Gln Met Leu His Asn Leu Arg Glu 
                325                 330                 335     


Lys Leu Lys Asn Ala Gln Leu Asp Ala Asp Lys Ser Asn Val Met Ala 
            340                 345                 350         


Leu Gln Gln Gly Ile Gln Glu Arg Asp Ser Gln Ile Lys Met Leu Thr 
        355                 360                 365             


Glu Gln Val Glu Gln Tyr Thr Lys Glu Met Glu Lys Asn Thr Cys Ile 
    370                 375                 380                 


Ile Glu Asp Leu Lys Asn Glu Leu Gln Arg Asn Lys Gly Ala Ser Thr 
385                 390                 395                 400 


Leu Ser Gln Gln Thr His Met Lys Ile Gln Ser Thr Leu Asp Ile Leu 
                405                 410                 415     


Lys Glu Lys Thr Lys Glu Ala Glu Arg Thr Ala Glu Leu Ala Glu Ala 
            420                 425                 430         


Asp Ala Arg Glu Lys Asp Lys Glu Leu Val Glu Ala Leu Lys Arg Leu 
        435                 440                 445             


Lys Asp Tyr Glu Ser Gly Val Tyr Gly Leu Glu Asp Ala Val Val Glu 
    450                 455                 460                 


Ile Lys Asn Cys Lys Asn Gln Ile Lys Ile Arg Asp Arg Glu Ile Glu 
465                 470                 475                 480 


Ile Leu Thr Lys Glu Ile Asn Lys Leu Glu Leu Lys Ile Ser Asp Phe 
                485                 490                 495     


Leu Asp Glu Asn Glu Ala Leu Arg Glu Arg Val Gly Leu Glu Pro Lys 
            500                 505                 510         


Thr Met Ile Asp Leu Thr Glu Phe Arg Asn Ser Lys His Leu Lys Gln 
        515                 520                 525             


Gln Gln Tyr Arg Ala Glu Asn Gln Ile Leu Leu Lys Glu Ile Glu Ser 
    530                 535                 540                 


Leu Glu Glu Glu Arg Leu Asp Leu Lys Lys Lys Ile Arg Gln Met Ala 
545                 550                 555                 560 


Gln Glu Arg Gly Lys Arg Ser Ala Thr Ser Gly Leu Thr Thr Glu Asp 
                565                 570                 575     


Leu Asn Leu Thr Glu Asn Ile Ser Gln Gly Asp Arg Ile Ser Glu Arg 
            580                 585                 590         


Lys Leu Asp Leu Leu Ser Leu Lys Asn Met Ser Glu Ala Gln Ser Lys 
        595                 600                 605             


Asn Glu Phe Leu Ser Arg Glu Leu Ile Glu Lys Glu Arg Asp Leu Glu 
    610                 615                 620                 


Arg Ser Arg Thr Val Ile Ala Lys Phe Gln Asn Lys Leu Lys Glu Leu 
625                 630                 635                 640 


Val Glu Glu Asn Lys Gln Leu Glu Glu Gly Met Lys Glu Ile Leu Gln 
                645                 650                 655     


Ala Ile Lys Glu Met Gln Lys Asp Pro Asp Val Lys Gly Gly Glu Thr 
            660                 665                 670         


Ser Leu Ile Ile Pro Ser Leu Glu Arg Leu Val Asn Ala Ile Glu Ser 
        675                 680                 685             


Lys Asn Ala Glu Gly Ile Phe Asp Ala Ser Leu His Leu Lys Ala Gln 
    690                 695                 700                 


Val Asp Gln Leu Thr Gly Arg Asn Glu Glu Leu Arg Gln Glu Leu Arg 
705                 710                 715                 720 


Glu Ser Arg Lys Glu Ala Ile Asn Tyr Ser Gln Gln Leu Ala Lys Ala 
                725                 730                 735     


Asn Leu Lys Ile Asp His Leu Glu Lys Glu Thr Ser Leu Leu Arg Gln 
            740                 745                 750         


Ser Glu Gly Ser Asn Val Val Phe Lys Gly Ile Asp Leu Pro Asp Gly 
        755                 760                 765             


Ile Ala Pro Ser Ser Ala Ser Ile Ile Asn Ser Gln Asn Glu Tyr Leu 
    770                 775                 780                 


Ile His Leu Leu Gln Glu Leu Glu Asn Lys Glu Lys Lys Leu Lys Asn 
785                 790                 795                 800 


Leu Glu Asp Ser Leu Glu Asp Tyr Asn Arg Lys Phe Ala Val Ile Arg 
                805                 810                 815     


His Gln Gln Ser Leu Leu Tyr Lys Glu Tyr Leu Ser Glu Lys Glu Thr 
            820                 825                 830         


Trp Lys Thr Glu Ser Lys Thr Ile Lys Glu Glu Lys Arg Lys Leu Glu 
        835                 840                 845             


Asp Gln Val Gln Gln Asp Ala Ile Lys Val Lys Glu Tyr Asn Asn Leu 
    850                 855                 860                 


Leu Asn Ala Leu Gln Met Asp Ser Asp Glu Met Lys Lys Ile Leu Ala 
865                 870                 875                 880 


Glu Asn Ser Arg Lys Ile Thr Val Leu Gln Val Asn Glu Lys Ser Leu 
                885                 890                 895     


Ile Arg Gln Tyr Thr Thr Leu Val Glu Leu Glu Arg Gln Leu Arg Lys 
            900                 905                 910         


Glu Asn Glu Lys Gln Lys Asn Glu Leu Leu Ser Met Glu Ala Glu Val 
        915                 920                 925             


Cys Glu Lys Ile Gly Cys Leu Gln Arg Phe Lys Glu Met Ala Ile Phe 
    930                 935                 940                 


Lys Ile Ala Ala Leu Gln Lys Val Val Asp Asn Ser Val Ser Leu Ser 
945                 950                 955                 960 


Glu Leu Glu Leu Ala Asn Lys Gln Tyr Asn Glu Leu Thr Ala Lys Tyr 
                965                 970                 975     


Arg Asp Ile Leu Gln Lys Asp Asn Met Leu Val Gln Arg Thr Ser Asn 
            980                 985                 990         


Leu Glu His Leu Glu Cys Glu Asn  Ile Ser Leu Lys Glu  Gln Val Glu 
        995                 1000                 1005             


Ser Ile  Asn Lys Glu Leu Glu  Ile Thr Lys Glu Lys  Leu His Thr 
    1010                 1015                 1020             


Ile Glu  Gln Ala Trp Glu Gln  Glu Thr Lys Leu Gly  Asn Glu Ser 
    1025                 1030                 1035             


Ser Met  Asp Lys Ala Lys Lys  Ser Ile Thr Asn Ser  Asp Ile Val 
    1040                 1045                 1050             


Ser Ile  Ser Lys Lys Ile Thr  Met Leu Glu Met Lys  Glu Leu Asn 
    1055                 1060                 1065             


Glu Arg  Gln Arg Ala Glu His  Cys Gln Lys Met Tyr  Glu His Leu 
    1070                 1075                 1080             


Arg Thr  Ser Leu Lys Gln Met  Glu Glu Arg Asn Phe  Glu Leu Glu 
    1085                 1090                 1095             


Thr Lys  Phe Ala Glu Leu Thr  Lys Ile Asn Leu Asp  Ala Gln Lys 
    1100                 1105                 1110             


Val Glu  Gln Met Leu Arg Asp  Glu Leu Ala Asp Ser  Val Ser Lys 
    1115                 1120                 1125             


Ala Val  Ser Asp Ala Asp Arg  Gln Arg Ile Leu Glu  Leu Glu Lys 
    1130                 1135                 1140             


Asn Glu  Met Glu Leu Lys Val  Glu Val Ser Lys Leu  Arg Glu Ile 
    1145                 1150                 1155             


Ser Asp  Ile Ala Arg Arg Gln  Val Glu Ile Leu Asn  Ala Gln Gln 
    1160                 1165                 1170             


Gln Ser  Arg Asp Lys Glu Val  Glu Ser Leu Arg Met  Gln Leu Leu 
    1175                 1180                 1185             


Asp Tyr  Gln Ala Gln Ser Asp  Glu Lys Ser Leu Ile  Ala Lys Leu 
    1190                 1195                 1200             


His Gln  His Asn Val Ser Leu  Gln Leu Ser Glu Ala  Thr Ala Leu 
    1205                 1210                 1215             


Gly Lys  Leu Glu Ser Ile Thr  Ser Lys Leu Gln Lys  Met Glu Ala 
    1220                 1225                 1230             


Tyr Asn  Leu Arg Leu Glu Gln  Lys Leu Asp Glu Lys  Glu Gln Ala 
    1235                 1240                 1245             


Leu Tyr  Tyr Ala Arg Leu Glu  Gly Arg Asn Arg Ala  Lys His Leu 
    1250                 1255                 1260             


Arg Gln  Thr Ile Gln Ser Leu  Arg Arg Gln Phe Ser  Gly Ala Leu 
    1265                 1270                 1275             


Pro Leu  Ala Gln Gln Glu Lys  Phe Ser Lys Thr Met  Ile Gln Leu 
    1280                 1285                 1290             


Gln Asn  Asp Lys Leu Lys Ile  Met Gln Glu Met Lys  Asn Ser Gln 
    1295                 1300                 1305             


Gln Glu  His Arg Asn Met Glu  Asn Lys Thr Leu Glu  Met Glu Leu 
    1310                 1315                 1320             


Lys Leu  Lys Gly Leu Glu Glu  Leu Ile Ser Thr Leu  Lys Asp Thr 
    1325                 1330                 1335             


Lys Gly  Ala Gln Lys Val Ile  Asn Trp His Met Lys  Ile Glu Glu 
    1340                 1345                 1350             


Leu Arg  Leu Gln Glu Leu Lys  Leu Asn Arg Glu Leu  Val Lys Asp 
    1355                 1360                 1365             


Lys Glu  Glu Ile Lys Tyr Leu  Asn Asn Ile Ile Ser  Glu Tyr Glu 
    1370                 1375                 1380             


Arg Thr  Ile Ser Ser Leu Glu  Glu Glu Ile Val Gln  Gln Asn Lys 
    1385                 1390                 1395             


Phe His  Glu Glu Arg Gln Met  Ala Trp Asp Gln Arg  Glu Val Asp 
    1400                 1405                 1410             


Leu Glu  Arg Gln Leu Asp Ile  Phe Asp Arg Gln Gln  Asn Glu Ile 
    1415                 1420                 1425             


Leu Asn  Ala Ala Gln Lys Phe  Glu Glu Ala Thr Gly  Ser Ile Pro 
    1430                 1435                 1440             


Asp Pro  Ser Leu Pro Leu Pro  Asn Gln Leu Glu Ile  Ala Leu Arg 
    1445                 1450                 1455             


Lys Ile  Lys Glu Asn Ile Arg  Ile Ile Leu Glu Thr  Arg Ala Thr 
    1460                 1465                 1470             


Cys Lys  Ser Leu Glu Glu Lys  Leu Lys Glu Lys Glu  Ser Ala Leu 
    1475                 1480                 1485             


Arg Leu  Ala Glu Gln Asn Ile  Leu Ser Arg Asp Lys  Val Ile Asn 
    1490                 1495                 1500             


Glu Leu  Arg Leu Arg Leu Pro  Ala Thr Ala Glu Arg  Glu Lys Leu 
    1505                 1510                 1515             


Ile Ala  Glu Leu Gly Arg Lys  Glu Met Glu Pro Lys  Ser His His 
    1520                 1525                 1530             


Thr Leu  Lys Ile Ala His Gln  Thr Ile Ala Asn Met  Gln Ala Arg 
    1535                 1540                 1545             


Leu Asn  Gln Lys Glu Glu Val  Leu Lys Lys Tyr Gln  Arg Leu Leu 
    1550                 1555                 1560             


Glu Lys  Ala Arg Glu Glu Gln  Arg Glu Ile Val Lys  Lys His Glu 
    1565                 1570                 1575             


Glu Asp  Leu His Ile Leu His  His Arg Leu Glu Leu  Gln Ala Asp 
    1580                 1585                 1590             


Ser Ser  Leu Asn Lys Phe Lys  Gln Thr Ala Trp Asp  Leu Met Lys 
    1595                 1600                 1605             


Gln Ser  Pro Thr Pro Val Pro  Thr Asn Lys His Phe  Ile Arg Leu 
    1610                 1615                 1620             


Ala Glu  Met Glu Gln Thr Val  Ala Glu Gln Asp Asp  Ser Leu Ser 
    1625                 1630                 1635             


Ser Leu  Leu Val Lys Leu Lys  Lys Val Ser Gln Asp  Leu Glu Arg 
    1640                 1645                 1650             


Gln Arg  Glu Ile Thr Glu Leu  Lys Val Lys Glu Phe  Glu Asn Ile 
    1655                 1660                 1665             


Lys Leu  Gln Leu Gln Glu Asn  His Glu Asp Glu Val  Lys Lys Val 
    1670                 1675                 1680             


Lys Ala  Glu Val Glu Asp Leu  Lys Tyr Leu Leu Asp  Gln Ser Gln 
    1685                 1690                 1695             


Lys Glu  Ser Gln Cys Leu Lys  Ser Glu Leu Gln Ala  Gln Lys Glu 
    1700                 1705                 1710             


Ala Asn  Ser Arg Ala Pro Thr  Thr Thr Met Arg Asn  Leu Val Glu 
    1715                 1720                 1725             


Arg Leu  Lys Ser Gln Leu Ala  Leu Lys Glu Lys Gln  Gln Lys Ala 
    1730                 1735                 1740             


Leu Ser  Arg Ala Leu Leu Glu  Leu Arg Ala Glu Met  Thr Ala Ala 
    1745                 1750                 1755             


Ala Glu  Glu Arg Ile Ile Ser  Ala Thr Ser Gln Lys  Glu Ala His 
    1760                 1765                 1770             


Leu Asn  Val Gln Gln Ile Val  Asp Arg His Thr Arg  Glu Leu Lys 
    1775                 1780                 1785             


Thr Gln  Val Glu Asp Leu Asn  Glu Asn Leu Leu Lys  Leu Lys Glu 
    1790                 1795                 1800             


Ala Leu  Lys Thr Ser Lys Asn  Arg Glu Asn Ser Leu  Thr Asp Asn 
    1805                 1810                 1815             


Leu Asn  Asp Leu Asn Asn Glu  Leu Gln Lys Lys Gln  Lys Ala Tyr 
    1820                 1825                 1830             


Asn Lys  Ile Leu Arg Glu Lys  Glu Glu Ile Asp Gln  Glu Asn Asp 
    1835                 1840                 1845             


Glu Leu  Lys Arg Gln Ile Lys  Arg Leu Thr Ser Gly  Leu Gln Gly 
    1850                 1855                 1860             


Lys Pro  Leu Thr Asp Asn Lys  Gln Ser Leu Ile Glu  Glu Leu Gln 
    1865                 1870                 1875             


Arg Lys  Val Lys Lys Leu Glu  Asn Gln Leu Glu Gly  Lys Val Glu 
    1880                 1885                 1890             


Glu Val  Asp Leu Lys Pro Met  Lys Glu Lys Asn Ala  Lys Glu Glu 
    1895                 1900                 1905             


Leu Ile  Arg Trp Glu Glu Gly  Lys Lys Trp Gln Ala  Lys Ile Glu 
    1910                 1915                 1920             


Gly Ile  Arg Asn Lys Leu Lys  Glu Lys Glu Gly Glu  Val Phe Thr 
    1925                 1930                 1935             


Leu Thr  Lys Gln Leu Asn Thr  Leu Lys Asp Leu Phe  Ala Lys Ala 
    1940                 1945                 1950             


Asp Lys  Glu Lys Leu Thr Leu  Gln Arg Lys Leu Lys  Thr Thr Gly 
    1955                 1960                 1965             


Met Thr  Val Asp Gln Val Leu  Gly Ile Arg Ala Leu  Glu Ser Glu 
    1970                 1975                 1980             


Lys Glu  Leu Glu Glu Leu Lys  Lys Arg Asn Leu Asp  Leu Glu Asn 
    1985                 1990                 1995             


Asp Ile  Leu Tyr Met Arg Ala  His Gln Ala Leu Pro  Arg Asp Ser 
    2000                 2005                 2010             


Val Val  Glu Asp Leu His Leu  Gln Asn Arg Tyr Leu  Gln Glu Lys 
    2015                 2020                 2025             


Leu His  Ala Leu Glu Lys Gln  Phe Ser Lys Asp Thr  Tyr Ser Lys 
    2030                 2035                 2040             


Pro Ser  Ile Ser Gly Ile Glu  Ser Asp Asp His Cys  Gln Arg Glu 
    2045                 2050                 2055             


Gln Glu  Leu Gln Lys Glu Asn  Leu Lys Leu Ser Ser  Glu Asn Ile 
    2060                 2065                 2070             


Glu Leu  Lys Phe Gln Leu Glu  Gln Ala Asn Lys Asp  Leu Pro Arg 
    2075                 2080                 2085             


Leu Lys  Asn Gln Val Arg Asp  Leu Lys Glu Met Cys  Glu Phe Leu 
    2090                 2095                 2100             


Lys Lys  Glu Lys Ala Glu Val  Gln Arg Lys Leu Gly  His Val Arg 
    2105                 2110                 2115             


Gly Ser  Gly Arg Ser Gly Lys  Thr Ile Pro Glu Leu  Glu Lys Thr 
    2120                 2125                 2130             


Ile Gly  Leu Met Lys Lys Val  Val Glu Lys Val Gln  Arg Glu Asn 
    2135                 2140                 2145             


Glu Gln  Leu Lys Lys Ala Ser  Gly Ile Leu Thr Ser  Glu Lys Met 
    2150                 2155                 2160             


Ala Asn  Ile Glu Gln Glu Asn  Glu Lys Leu Lys Ala  Glu Leu Glu 
    2165                 2170                 2175             


Lys Leu  Lys Ala His Leu Gly  His Gln Leu Ser Met  His Tyr Glu 
    2180                 2185                 2190             


Ser Lys  Thr Lys Gly Thr Glu  Lys Ile Ile Ala Glu  Asn Glu Arg 
    2195                 2200                 2205             


Leu Arg  Lys Glu Leu Lys Lys  Glu Thr Asp Ala Ala  Glu Lys Leu 
    2210                 2215                 2220             


Arg Ile  Ala Lys Asn Asn Leu  Glu Ile Leu Asn Glu  Lys Met Thr 
    2225                 2230                 2235             


Val Gln  Leu Glu Glu Thr Gly  Lys Arg Leu Gln Phe  Ala Glu Ser 
    2240                 2245                 2250             


Arg Gly  Pro Gln Leu Glu Gly  Ala Asp Ser Lys Ser  Trp Lys Ser 
    2255                 2260                 2265             


Ile Val  Val Thr Arg Met Tyr  Glu Thr Lys Leu Lys  Glu Leu Glu 
    2270                 2275                 2280             


Thr Asp  Ile Ala Lys Lys Asn  Gln Ser Ile Thr Asp  Leu Lys Gln 
    2285                 2290                 2295             


Leu Val  Lys Glu Ala Thr Glu  Arg Glu Gln Lys Val  Asn Lys Tyr 
    2300                 2305                 2310             


Asn Glu  Asp Leu Glu Gln Gln  Ile Lys Ile Leu Lys  His Val Pro 
    2315                 2320                 2325             


Glu Gly  Ala Glu Thr Glu Gln  Gly Leu Lys Arg Glu  Leu Gln Val 
    2330                 2335                 2340             


Leu Arg  Leu Ala Asn His Gln  Leu Asp Lys Glu Lys  Ala Glu Leu 
    2345                 2350                 2355             


Ile His  Gln Ile Glu Ala Asn  Lys Asp Gln Ser Gly  Ala Glu Ser 
    2360                 2365                 2370             


Thr Ile  Pro Asp Ala Asp Gln  Leu Lys Glu Lys Ile  Lys Asp Leu 
    2375                 2380                 2385             


Glu Thr  Gln Leu Lys Met Ser  Asp Leu Glu Lys Gln  His Leu Lys 
    2390                 2395                 2400             


Glu Glu  Ile Lys Lys Leu Lys  Lys Glu Leu Glu Asn  Phe Asp Pro 
    2405                 2410                 2415             


Ser Phe  Phe Glu Glu Ile Glu  Asp Leu Lys Tyr Asn  Tyr Lys Glu 
    2420                 2425                 2430             


Glu Val  Lys Lys Asn Ile Leu  Leu Glu Glu Lys Val  Lys Lys Leu 
    2435                 2440                 2445             


Ser Glu  Gln Leu Gly Val Glu  Leu Thr Ser Pro Val  Ala Ala Ser 
    2450                 2455                 2460             


Glu Glu  Phe Glu Asp Glu Glu  Glu Ser Pro Val Asn  Phe Pro Ile 
    2465                 2470                 2475             


Tyr 
    


<210>  204
<211>  7440
<212>  DNA
<213>  Homo sapiens

<400>  204
atgccaccta atataaactg gaaagaaata atgaaagttg acccagatga cctgccccgt       60

caagaagaac tggcagataa tttattgatt tccttatcca aggtggaagt aaatgagcta      120

aaaagtgaaa agcaagaaaa tgtgatacac cttttcagaa ttactcagtc actaatgaag      180

atgaaagctc aagaagtgga gctggctttg gaagaagtag aaaaagctgg agaagaacaa      240

gcaaaatttg aaaatcaatt aaaaactaaa gtaatgaaac tggaaaatga actggagatg      300

gctcagcagt ctgcaggtgg acgagatact cggtttttac gtaatgaaat ttgccaactt      360

gaaaaacaat tagaacaaaa agatagagaa ttggaggaca tggaaaagga gttggagaaa      420

gagaagaaag ttaatgagca attggctctt cgaaatgagg aggcagaaaa tgaaaacagc      480

aaattaagaa gagagaacaa acgtctaaag aaaaagaatg aacaactttg tcaggatatt      540

attgactacc agaaacaaat agattcacag aaagaaacac ttttatcaag aagaggggaa      600

gacagtgact accgatcaca gttgtctaaa aaaaactatg agcttatcca atatcttgat      660

gaaattcaga ctttaacaga agctaatgag aaaattgaag ttcagaatca agaaatgaga      720

aaaaatttag aagagtctgt acaggaaatg gagaagatga ctgatgaata taatagaatg      780

aaagctattg tgcatcagac agataatgta atagatcagt taaaaaaaga aaacgatcat      840

tatcaacttc aagtgcagga gcttacagat cttctgaaat caaaaaatga agaagatgat      900

ccaattatgg tagctgtcaa tgcaaaagta gaagaatgga agctaatttt gtcttctaaa      960

gatgatgaaa ttattgagta tcagcaaatg ttacataacc taagggagaa acttaagaat     1020

gctcagcttg atgctgataa aagtaatgtt atggctctac agcagggtat acaggaacga     1080

gacagtcaaa ttaagatgct caccgaacaa gtagaacaat atacaaaaga aatggaaaag     1140

aatacttgta ttattgaaga tttgaaaaat gagctccaaa gaaacaaagg tgcttcaacc     1200

ctttctcaac agactcatat gaaaattcag tcaacgttag acattttaaa agagaaaact     1260

aaagaggctg agagaacagc tgaactggct gaggctgatg ctagggaaaa ggataaagaa     1320

ttagttgagg ctctgaagag gttaaaagat tatgaatcgg gagtatatgg tttagaagat     1380

gctgtcgttg aaataaagaa ttgtaaaaac caaattaaaa taagagatcg agagattgaa     1440

atattaacaa aggaaatcaa taaacttgaa ttgaagatca gtgatttcct tgatgaaaat     1500

gaggcactta gagagcgtgt gggccttgaa ccaaagacaa tgattgattt aactgaattt     1560

agaaatagca aacacttaaa acagcagcag tacagagctg aaaaccagat tcttttgaaa     1620

gagattgaaa gtctagagga agaacgactt gatctgaaaa aaaaaattcg tcaaatggct     1680

caagaaagag gaaaaagaag tgcaacttca ggattaacca ctgaggacct gaacctaact     1740

gaaaacattt ctcaaggaga tagaataagt gaaagaaaat tggatttatt gagcctcaaa     1800

aatatgagtg aagcacaatc aaagaatgaa tttctttcaa gagaactaat tgaaaaagaa     1860

agagatttag aaaggagtag gacagtgata gccaaatttc agaataaatt aaaagaatta     1920

gttgaagaaa ataagcaact tgaagaaggt atgaaagaaa tattgcaagc aattaaggaa     1980

atgcagaaag atcctgatgt taaaggagga gaaacatctc taattatccc tagccttgaa     2040

agactagtta atgctataga atcaaagaat gcagaaggaa tctttgatgc gagtctgcat     2100

ttgaaagccc aagttgatca gcttaccgga agaaatgaag aattaagaca ggagctcagg     2160

gaatctcgga aagaggctat aaattattca cagcagttgg caaaagctaa tttaaagata     2220

gaccatcttg aaaaagaaac tagtctttta cgacaatcag aaggatcaaa tgttgttttt     2280

aaaggaattg acttacctga tgggatagca ccatctagtg ccagtatcat taattctcag     2340

aatgaatatt taatacattt gttacaggaa ctagaaaata aagaaaaaaa gttaaagaat     2400

ttagaagatt ctcttgaaga ttacaacaga aaatttgctg taattcgtca tcaacaaagt     2460

ttgttgtata aagaatacct aagtgaaaag gagacctgga aaacagaatc taaaacaata     2520

aaagaggaaa agagaaaact tgaggatcaa gtccaacaag atgctataaa agtaaaagaa     2580

tataataatt tgctcaatgc tcttcagatg gattcggatg aaatgaaaaa aatacttgca     2640

gaaaatagta ggaaaattac tgttttgcaa gtgaatgaaa aatcacttat aaggcaatat     2700

acaaccttag tagaattgga gcgacaactt agaaaagaaa atgagaagca aaagaatgaa     2760

ttgttgtcaa tggaggctga agtttgtgaa aaaattgggt gtttgcaaag atttaaggaa     2820

atggccattt tcaagattgc agctctccaa aaagttgtag ataatagtgt ttctttgtct     2880

gaactagaac tggctaataa acagtacaat gaactgactg ctaagtacag ggacatcttg     2940

caaaaagata atatgcttgt tcaaagaaca agtaacttgg aacacctgga gtgtgaaaac     3000

atctccttaa aagaacaagt ggagtctata aataaagaac tggagattac caaggaaaaa     3060

cttcacacta ttgaacaagc ctgggaacag gaaactaaat taggtaatga atctagcatg     3120

gataaggcaa agaaatcaat aaccaacagt gacattgttt ccatttcaaa aaaaataact     3180

atgctggaaa tgaaggaatt aaatgaaagg cagcgggctg aacattgtca aaaaatgtat     3240

gaacacttac ggacttcgtt aaagcaaatg gaggaacgta attttgaatt ggaaaccaaa     3300

tttgctgagc ttaccaaaat caatttggat gcacagaagg tggaacagat gttaagagat     3360

gaattagctg atagtgtgag caaggcagta agtgatgctg ataggcaacg gattctagaa     3420

ttagagaaga atgaaatgga actaaaagtt gaagtgtcaa aactgagaga gatttctgat     3480

attgccagaa gacaagttga aattttgaat gcacaacaac aatctaggga caaggaagta     3540

gagtccctca gaatgcaact gctagactat caggcacagt ctgatgaaaa gtcgctcatt     3600

gccaagttgc accaacataa tgtctctctt caactgagtg aggctactgc tcttggtaag     3660

ttggagtcaa ttacatctaa actgcagaag atggaggcct acaacttgcg cttagagcag     3720

aaacttgatg aaaaagaaca ggctctctat tatgctcgtt tggagggaag aaacagagca     3780

aaacatctgc gccaaacaat tcagtctcta cgacgacagt ttagtggagc tttacccttg     3840

gcacaacagg aaaagttctc caaaacaatg attcaactac aaaatgacaa acttaagata     3900

atgcaagaaa tgaaaaattc tcaacaagaa catagaaata tggagaacaa aacattggag     3960

atggaattaa aattaaaggg cctggaagag ttaataagca ctttaaagga taccaaagga     4020

gcccaaaagg taatcaactg gcatatgaaa atagaagaac ttcgtcttca agaacttaaa     4080

ctaaatcggg aattagtcaa ggataaagaa gaaataaaat atttgaataa cataatttct     4140

gaatatgaac gtacaatcag cagtcttgaa gaagaaattg tgcaacagaa caagtttcat     4200

gaagaaagac aaatggcctg ggatcaaaga gaagttgacc tggaacgcca actagacatt     4260

tttgaccgtc agcaaaatga aatactaaat gcggcacaaa agtttgaaga agctacagga     4320

tcaatccctg accctagttt gccccttcca aatcaacttg agatcgctct aaggaaaatt     4380

aaggagaaca ttcgaataat tctagaaaca cgggcaactt gcaaatcact agaagagaaa     4440

ctaaaagaga aagaatctgc tttaaggtta gcagaacaaa atatactgtc aagagacaaa     4500

gtaatcaatg aactgaggct tcgattgcct gccactgcag aaagagaaaa gctcatagct     4560

gagctaggca gaaaagagat ggaaccaaaa tctcaccaca cattgaaaat tgctcatcaa     4620

accattgcaa acatgcaagc aaggttaaat caaaaagaag aagtattaaa gaagtatcaa     4680

cgtcttctag aaaaagccag agaggagcaa agagaaattg tgaagaaaca tgaggaagac     4740

cttcatattc ttcatcacag attagaacta caggctgata gttcactaaa taaattcaaa     4800

caaacggctt gggatttaat gaaacagtct cccactccag ttcctaccaa caagcatttt     4860

attcgtctgg ctgagatgga acagacagta gcagaacaag atgactctct ttcctcactc     4920

ttggtcaaac taaagaaagt atcacaagat ttggagagac aaagagaaat cactgaatta     4980

aaagtaaaag aatttgaaaa tatcaaatta cagcttcaag aaaaccatga agatgaagtg     5040

aaaaaagtaa aagcggaagt agaggattta aagtatcttc tggaccagtc acaaaaggag     5100

tcacagtgtt taaaatctga acttcaggct caaaaagaag caaattcaag agctccaaca     5160

actacaatga gaaatctagt agaacggcta aagagccaat tagccttgaa ggagaaacaa     5220

cagaaagcac ttagtcgggc acttttagaa ctccgggcag aaatgacagc agctgctgaa     5280

gaacgtatta tttctgcaac ttctcaaaaa gaggcccatc tcaatgttca acaaatcgtt     5340

gatcgacata ctagagagct aaagacacaa gttgaagatt taaatgaaaa tcttttaaaa     5400

ttgaaagaag cacttaaaac aagtaaaaac agagaaaact cactaactga taatttgaat     5460

gacttaaata atgaactgca aaagaaacaa aaagcctata ataaaatact tagagagaaa     5520

gaggaaattg atcaagagaa tgatgaactg aaaaggcaaa ttaaaagact aaccagtgga     5580

ttacagggca aacccctgac agataataaa caaagtctaa ttgaagaact ccaaaggaaa     5640

gttaaaaaac tagagaacca attagaggga aaggtggagg aagtagacct aaaacctatg     5700

aaagaaaaga atgctaaaga agaattaatt aggtgggaag aaggtaaaaa gtggcaagcc     5760

aaaatagaag gaattcgaaa caagttaaaa gagaaagagg gggaagtctt tactttaaca     5820

aagcagttga atactttgaa ggatcttttt gccaaagccg ataaagagaa acttactttg     5880

cagaggaaac taaaaacaac tggcatgact gttgatcagg ttttgggaat acgagctttg     5940

gagtcagaaa aagaattgga agaattaaaa aagagaaatc ttgacttaga aaatgatata     6000

ttgtatatga gggcccacca agctcttcct cgagattctg ttgtagaaga tttacattta     6060

caaaatagat acctccaaga aaaacttcat gctttagaaa aacagttttc aaaggataca     6120

tattctaagc cttcaatttc aggaatagag tcagatgatc attgtcagag agaacaggag     6180

cttcagaagg aaaacttgaa gttgtcatct gaaaatattg aactgaaatt tcagcttgaa     6240

caagcaaata aagatttgcc aagattaaag aatcaagtca gagatttgaa ggaaatgtgt     6300

gaatttctta agaaagaaaa agcagaagtt cagcggaaac ttggccatgt tagagggtct     6360

ggtagaagtg gaaagacaat cccagaactg gaaaaaacca ttggtttaat gaaaaaagta     6420

gttgaaaaag tccagagaga aaatgaacag ttgaaaaaag catcaggaat attgactagt     6480

gaaaaaatgg ctaatattga gcaggaaaat gaaaaattga aggctgaatt agaaaaactt     6540

aaagctcatc ttgggcatca gttgagcatg cactatgaat ccaagaccaa aggcacagaa     6600

aaaattattg ctgaaaatga aaggcttcgt aaagaactta aaaaagaaac tgatgctgca     6660

gagaaattac ggatagcaaa gaataattta gagatattaa atgagaagat gacagttcaa     6720

ctagaagaga ctggtaagag attgcagttt gcagaaagca gaggtccaca gcttgaaggt     6780

gctgacagta agagctggaa atccattgtg gttacaagaa tgtatgaaac caagttaaaa     6840

gaattggaaa ctgatattgc caaaaaaaat caaagcatta ctgaccttaa acagcttgta     6900

aaagaagcaa cagagagaga acaaaaagtt aacaaataca atgaagacct tgaacaacag     6960

attaagattc ttaaacatgt tcctgaaggt gctgagacag agcaaggcct taaacgggag     7020

cttcaagttc ttagattagc taatcatcag ctggataaag agaaagcaga attaatccat     7080

cagatagaag ctaacaagga ccaaagtgga gctgaaagca ccatacctga tgctgatcaa     7140

ctaaaggaaa aaataaaaga tctagagaca cagctcaaaa tgtcagatct agaaaagcag     7200

catttgaagg aggaaataaa gaagctgaaa aaagaactgg aaaattttga tccttcattt     7260

tttgaagaaa ttgaagatct taagtataat tacaaggaag aagtgaagaa gaatattctc     7320

ttagaagaga aggtaaaaaa actttcagaa caattgggag ttgaattaac tagccctgtt     7380

gctgcttctg aagagtttga agatgaagaa gaaagtcctg ttaatttccc catttactaa     7440


<210>  205
<211>  724
<212>  PRT
<213>  Adeno-associated virus

<400>  205

Met Ser Phe Val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 
1               5                   10                  15      


Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 
            20                  25                  30          


Pro Asn Gln Gln His Gln Asp Gln Ala Arg Gly Leu Val Leu Pro Gly 
        35                  40                  45              


Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 
    50                  55                  60                  


Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp Ile Ser Tyr Asn Glu 
65                  70                  75                  80  


Gln Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
                85                  90                  95      


Ala Glu Phe Gln Glu Lys Leu Ala Asp Asp Thr Ser Phe Gly Gly Asn 
            100                 105                 110         


Leu Gly Lys Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro Phe 
        115                 120                 125             


Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg Ile 
    130                 135                 140                 


Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp Ser 
145                 150                 155                 160 


Lys Pro Ser Thr Ser Ser Asp Ala Glu Ala Gly Pro Ser Gly Ser Gln 
                165                 170                 175     


Gln Leu Gln Ile Pro Ala Gln Pro Ala Ser Ser Leu Gly Ala Asp Thr 
            180                 185                 190         


Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gln Gly Ala 
        195                 200                 205             


Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
    210                 215                 220                 


Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
225                 230                 235                 240 


Ser Tyr Asn Asn His Gln Tyr Arg Glu Ile Lys Ser Gly Ser Val Asp 
                245                 250                 255     


Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
            260                 265                 270         


Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gln 
        275                 280                 285             


Arg Leu Ile Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 
    290                 295                 300                 


Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Val Gln Asp Ser Thr 
305                 310                 315                 320 


Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
                325                 330                 335     


Asp Asp Tyr Gln Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
            340                 345                 350         


Leu Pro Ala Phe Pro Pro Gln Val Phe Thr Leu Pro Gln Tyr Gly Tyr 
        355                 360                 365             


Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
    370                 375                 380                 


Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
385                 390                 395                 400 


Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 
                405                 410                 415     


Phe Ala Pro Ser Gln Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
            420                 425                 430         


Gln Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gln 
        435                 440                 445             


Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
    450                 455                 460                 


Phe Pro Gly Pro Met Gly Arg Thr Gln Gly Trp Asn Leu Gly Ser Gly 
465                 470                 475                 480 


Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
                485                 490                 495     


Leu Glu Gly Ala Ser Tyr Gln Val Pro Pro Gln Pro Asn Gly Met Thr 
            500                 505                 510         


Asn Asn Leu Gln Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met Ile 
        515                 520                 525             


Phe Asn Ser Gln Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
    530                 535                 540                 


Gly Asn Met Leu Ile Thr Ser Glu Ser Glu Thr Gln Pro Val Asn Arg 
545                 550                 555                 560 


Val Ala Tyr Asn Val Gly Gly Gln Met Ala Thr Asn Asn Gln Ser Ser 
                565                 570                 575     


Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gln Glu Ile Val Pro 
            580                 585                 590         


Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
        595                 600                 605             


Ala Lys Ile Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
    610                 615                 620                 


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu Ile Lys Asn 
625                 630                 635                 640 


Thr Pro Val Pro Gly Asn Ile Thr Ser Phe Ser Asp Val Pro Val Ser 
                645                 650                 655     


Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Thr Val Glu Met Glu 
            660                 665                 670         


Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln 
        675                 680                 685             


Tyr Thr Asn Asn Tyr Asn Asp Pro Gln Phe Val Asp Phe Ala Pro Asp 
    690                 695                 700                 


Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro Ile Gly Thr Arg Tyr Leu 
705                 710                 715                 720 


Thr Arg Pro Leu 
                


<210>  206
<211>  737
<212>  PRT
<213>  Adeno-associated virus

<400>  206

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
    210                 215                 220                 


Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 
            260                 265                 270         


Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Ile Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn 
    370                 375                 380                 


Gly Ser Gln Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ala 
        435                 440                 445             


Arg Thr Gln Ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gln 
    450                 455                 460                 


Phe Tyr Gln Gly Gly Pro Ser Thr Met Ala Glu Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Phe Arg Gln Gln Arg Val Ser Lys Thr Leu Asp 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu Ile 
    530                 535                 540                 


Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 
545                 550                 555                 560 


Met Thr Asn Glu Glu Glu Ile Arg Pro Thr Asn Pro Val Ala Thr Glu 
                565                 570                 575     


Glu Tyr Gly Ile Val Ser Ser Asn Leu Gln Ala Ala Asn Thr Ala Ala 
            580                 585                 590         


Gln Thr Gln Val Val Asn Asn Gln Gly Ala Leu Pro Gly Met Val Trp 
        595                 600                 605             


Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro 
    610                 615                 620                 


His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625                 630                 635                 640 


Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro 
                645                 650                 655     


Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe Ile 
            660                 665                 670         


Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu 
        675                 680                 685             


Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser 
    690                 695                 700                 


Asn Phe Glu Lys Gln Thr Gly Val Asp Phe Ala Val Asp Ser Gln Gly 
705                 710                 715                 720 


Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn 
                725                 730                 735     


Leu 
    


<210>  207
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  LoxN

<400>  207
ataacttcgt atagtatacc ttatacgaag ttat                                   34


