                         SEQUENCE LISTING

<110>  HANGZHOU JIAYIN BIOTECH LTD.
       HANGZHOU EXEGENESIS BIO LTD.
 
<120>  NUCLEIC ACID CONSTRUCTS AND USES THEREOF FOR TREATING SPINAL 
       MUSCULAR ATROPHY

<130>  14652-017-228

<140>
<141>

<150>  PCT/CN2020/138056
<151>  2020-12-21

<150>  PCT/CN2020/107173
<151>  2020-08-05

<160>  53    

<170>  PatentIn version 3.5

<210>  1
<211>  71
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  hsa-mir-1-5p

<400>  1
ugggaaacau acuucuuuau augcccauau ggaccugcua agcuauggaa uguaaagaag       60

uauguaucuc a                                                            71


<210>  2
<211>  71
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  hsa-mir-208a-5p

<400>  2
ugacgggcga gcuuuuggcc cggguuauac cugaugcuca cguauaagac gagcaaaaag       60

cuuguugguc a                                                            71


<210>  3
<211>  77
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  hsa-mir-208b-5p

<400>  3
ccucucaggg aagcuuuuug cucgaauuau guuucugauc cgaauauaag acgaacaaaa       60

gguuugucug agggcag                                                      77


<210>  4
<211>  85
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  hsa-mir-122

<400>  4
ccuuagcaga gcuguggagu gugacaaugg uguuuguguc uaaacuauca aacgccauua       60

ucacacuaaa uagcuacugc uaggc                                             85


<210>  5
<211>  88
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  hsa-mir-133a-1

<400>  5
acaaugcuuu gcuagagcug guaaaaugga accaaaucgc cucuucaaug gauuuggucc       60

ccuucaacca gcuguagcua ugcauuga                                          88


<210>  6
<211>  83
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  hsa-mir-488-5p

<400>  6
gagaaucauc ucucccagau aauggcacuc ucaaacaagu uuccaaauug uuugaaaggc       60

uauuucuugg ucagaugacu cuc                                               83


<210>  7
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  target segment of hsa-mir-1-5p

<400>  7
atgggcatat aaagaagtat gt                                                22


<210>  8
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  target segment of hsa-mir-208a-5p

<400>  8
gtataacccg ggccaaaagc tc                                                22


<210>  9
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  target segment of hsa-mir-208b-5p

<400>  9
acataattcg agcaaaaagc t                                                 21


<210>  10
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  target segment of hsa-mir-122

<400>  10
caaacaccat tgtcacactc ca                                                22


<210>  11
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  target segment of hsa-mir-133a-1

<400>  11
cagctggttg aaggggacca aa                                                22


<210>  12
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  target segment of hsa-mir-488-5p

<400>  12
ttgagagtgc cattatctgg g                                                 21


<210>  13
<211>  6
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG-Link01

<400>  13
cttgac                                                                   6


<210>  14
<211>  6
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG-Link02

<400>  14
ccatag                                                                   6


<210>  15
<211>  6
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG-Link03

<400>  15
tttcta                                                                   6


<210>  16
<211>  6
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG-Link04

<400>  16
caagct                                                                   6


<210>  17
<211>  6
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG-Link05

<400>  17
gatcta                                                                   6


<210>  18
<211>  486
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA Sponge Region-1x hsa-mir-1; 2x hsa-mir-208a; 3x 
       hsa-mir-208b; 3x hsa-mir-122 (as in EXG202)

<400>  18
ttgaatgagg cttcagtact ttacagaatc gttgcctgca catcttggaa acacttgctg       60

ggattacttc ttcaggttaa cccaacagaa ggctcgagaa ggtatattgc tgttgacagt      120

gagcgctaca tacttcttta tatgcccatg tgaagccaca gatgatgggc atataaagaa      180

gtatgtattg cctactgcct cggaattcaa ggggctactt taggagcaat tatcttgttt      240

actaaaactg aataccttgc tatctctttg atacattttt acaaagctga attaaaatgg      300

tataaattaa atcacttttt tctagtataa cccgggccaa aagctcagta taacccgggc      360

caaaagctcg acataattcg agcaaaaagc taacataatt cgagcaaaaa gctcttgaca      420

aacaccattg tcacactcca acaaacacca ttgtcacact ccaacaaaca ccattgtcac      480

actcca                                                                 486


<210>  19
<211>  237
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA Sponge Region-2x hsa-mir-208a; 2x hsa-mir-208b; 3x 
       hsa-mir-122; 3x hsa-mir-133a (as in EXG204)

<400>  19
gtataacccg ggccaaaagc tcagtataac ccgggccaaa agctcgacat aattcgagca       60

aaaagctaac ataattcgag caaaaagctc ttgacaaaca ccattgtcac actccaacaa      120

acaccattgt cacactccaa caaacaccat tgtcacactc caccatagac agctggttga      180

aggggaccaa aacagctggt tgaaggggac caaaacagct ggttgaaggg gaccaaa         237


<210>  20
<211>  258
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA Sponge Region-2x hsa-mir-208a; 2x hsa-mir-208b; 3x 
       hsa-mir-122; 2x hsa-mir-488; 2x hsa-mir-1 (as in EXG205)

<400>  20
gtataacccg ggccaaaagc tcagtataac ccgggccaaa agctcgacat aattcgagca       60

aaaagctaac ataattcgag caaaaagctc ttgacaaaca ccattgtcac actccaacaa      120

acaccattgt cacactccaa caaacaccat tgtcacactc caccatagat tgagagtgcc      180

attatctggg attgagagtg ccattatctg ggaatgggca tataaagaag tatgtaatgg      240

gcatataaag aagtatgt                                                    258


<210>  21
<211>  486
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  miRNA Sponge Region-1x hsa-mir-133a; 2x hsa-mir-208a; 2x 
       hsa-mir-208b; 3x hsa-mir-122 (as in EXG206)

<400>  21
ttgaatgagg cttcagtact ttacagaatc gttgcctgca catcttggaa acacttgctg       60

ggattacttc ttcaggttaa cccaacagaa ggctcgagaa ggtatattgc tgttgacagt      120

gagcgctttt ggtccccttc aaccagctgg tgaagccaca gatgcagctg gttgaagggg      180

accaaaattg cctactgcct cggaattcaa ggggctactt taggagcaat tatcttgttt      240

actaaaactg aataccttgc tatctctttg atacattttt acaaagctga attaaaatgg      300

tataaattaa atcacttttt tctagtataa cccgggccaa aagctcagta taacccgggc      360

caaaagctcg acataattcg agcaaaaagc taacataatt cgagcaaaaa gctcttgaca      420

aacaccattg tcacactcca acaaacacca ttgtcacact ccaacaaaca ccattgtcac      480

actcca                                                                 486


<210>  22
<211>  2596
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG204-LmiR122-HmiR133

<400>  22
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttca attcacgcgt ggtacccgtt acataactta cggtaaatgg cccgcctggc      180

tgaccgccca acgacccccg cccattgacg tcaatagtaa cgccaatagg gactttccat      240

tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat      300

catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattgt      360

gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt attagtcatc      420

gctattacca tggtcgaggt gagccccacg ttctgcttca ctctccccat ctcccccccc      480

tccccacccc caattttgta tttatttatt ttttaattat tttgtgcagc gatgggggcg      540

gggggggggg gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg gggcggggcg      600

aggcggagag gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt tccttttatg      660

gcgaggcggc ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc gggagtcgct      720

gcgacgctgc cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc cgccccggct      780

ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc ctccgggctg      840

taattagctg agcaagaggt aagggtttaa gggatggttg gttggtgggg tattaatgtt      900

taattacctg gagcacctgc ctgaaatcac tttttttcag gaattcccgg gatatcgtcg      960

acccacgcgt ccgggcccca cgctgcgcac ccgcgggttt gctatggcga tgagcagcgg     1020

cggcagtggt ggcggcgtcc cggagcagga ggattccgtg ctgttccggc gcggcacagg     1080

ccagagcgat gattctgaca tttgggatga tacagcactg ataaaagcat atgataaagc     1140

tgtggcttca tttaagcatg ctctaaagaa tggtgacatt tgtgaaactt cgggtaaacc     1200

aaaaaccaca cctaaaagaa aacctgctaa gaagaataaa agccaaaaga agaatactgc     1260

agcttcctta caacagtgga aagttgggga caaatgttct gccatttggt cagaagacgg     1320

ttgcatttac ccagctacca ttgcttcaat tgattttaag agagaaacct gtgttgtggt     1380

ttacactgga tatggaaata gagaggagca aaatctgtcc gatctacttt ccccaatctg     1440

tgaagtagct aataatatag aacagaatgc tcaagagaat gaaaatgaaa gccaagtttc     1500

aacagatgaa agtgagaact ccaggtctcc tggaaataaa tcagataaca tcaagcccaa     1560

atctgctcca tggaactctt ttctccctcc accacccccc atgccagggc caagactggg     1620

accaggaaag ccaggtctaa aattcaatgg cccaccaccg ccaccgccac caccaccacc     1680

ccacttacta tcatgctggc tgcctccatt tccttctgga ccaccaataa ttcccccacc     1740

acctcccata tgtccagatt ctcttgatga tgctgatgct ttgggaagta tgttaatttc     1800

atggtacatg agtggctatc atactggcta ttatatgggt tttagacaaa atcaaaaaga     1860

aggaaggtgc tcacattcct taaattaagg agaaatgctg gcatagagca gcactaaatg     1920

acaccactaa agaaacgatc agacagatct agtataaccc gggccaaaag ctcagtataa     1980

cccgggccaa aagctcgaca taattcgagc aaaaagctaa cataattcga gcaaaaagct     2040

cttgacaaac accattgtca cactccaaca aacaccattg tcacactcca acaaacacca     2100

ttgtcacact ccaccataga cagctggttg aaggggacca aaacagctgg ttgaagggga     2160

ccaaaacagc tggttgaagg ggaccaaaca agcttatcga taccgtcgac tagagctcgc     2220

tgatcagcct cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg     2280

ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt     2340

gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc     2400

aagggggagg attgggaagt ctagagcagg catgctgggg agagatcgat ctgaggaacc     2460

cctagtgatg gagttggcca ctccctctct gcgcgctcgc tcgctcactg aggccgggcg     2520

accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg agcgagcgcg     2580

cagagaggga gtggcc                                                     2596


<210>  23
<211>  2595
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG207-LmiR122-HmiR133

<400>  23
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttca attcacgcgt ggtacctctg gtcgttacat aacttacggt aaatggcccg      180

cctggctgac cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata      240

gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc      300

cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac      360

ggtaaatggc ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg      420

cagtacatct actcgaggcc acgttctgct tcactctccc catctccccc ccctccccac      480

ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg      540

ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga      600

gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc      660

ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagcg ggatcagcca      720

ccgcggtggc ggcctagagt cgacgaggaa ctgaaaaacc agaaagttaa ctggtaagtt      780

tagtcttttt gtcttttatt tcaggtcccg gatccggtgg tggtgcaaat caaagaactg      840

ctcctcagtg gatgttgcct ttacttctag gcctgtacgg aagtgttact tctgctctaa      900

aagctgcgga attgtacccg cggccgatcc accggtccgg aattcccggg atatcgtcga      960

cccacgcgtc cgggccccac gctgcgcacc cgcgggtttg ctatggccat gagcagcgga     1020

ggaagcggag gaggagtgcc cgagcaagag gacagcgtgc tgtttaggag aggaaccgga     1080

cagagcgatg actccgatat ctgggacgac accgctctga tcaaggccta tgacaaagcc     1140

gtggcctcct tcaagcacgc tctgaagaat ggcgatatct gtgagacctc cggcaaacct     1200

aagaccaccc ccaagaggaa gcccgccaag aagaacaagt cccagaagaa gaataccgcc     1260

gctagcctcc agcagtggaa agtgggcgat aagtgcagcg ccatttggag cgaggatgga     1320

tgcatctacc ccgccaccat tgccagcatc gacttcaaga gggagacatg cgtggtggtg     1380

tataccggat acggaaatag agaggagcag aatctgagcg atctgctgtc ccccatctgc     1440

gaggtggcca ataatatcga gcagaacgcc caagagaacg agaacgaaag ccaagtgtcc     1500

accgatgaga gcgagaactc cagaagcccc ggaaacaagt ccgacaacat caaacccaag     1560

agcgcccctt ggaacagctt tctgcctcct ccccccccca tgcccggccc tagactggga     1620

cccggcaagc ccggactgaa gttcaacgga cccccccctc ctcctccccc ccctcctcct     1680

catctgctga gctgctggct cccccctttc cctagcggcc cccccattat ccccccccct     1740

ccccctatct gtcccgacag cctcgatgac gctgacgccc tcggaagcat gctgatcagc     1800

tggtacatga gcggctacca caccggatac tacatgggct tcagacagaa ccagaaggag     1860

ggcagatgct cccactctct gaactgagga gaaatgctgg catagagcag cactaaatga     1920

caccactaaa gaaacgatca gacagatcta gtataacccg ggccaaaagc tcagtataac     1980

ccgggccaaa agctcgacat aattcgagca aaaagctaac ataattcgag caaaaagctc     2040

ttgacaaaca ccattgtcac actccaacaa acaccattgt cacactccaa caaacaccat     2100

tgtcacactc caccatagac agctggttga aggggaccaa aacagctggt tgaaggggac     2160

caaaacagct ggttgaaggg gaccaaacaa gcttatcgat accgtcgact agagctcgct     2220

gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc     2280

cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg     2340

catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca     2400

agggggagga ttgggaagtc tagagcaggc atgctgggga gagatcgatc tgaggaaccc     2460

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     2520

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     2580

agagagggag tggcc                                                      2595


<210>  24
<211>  2595
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG209-LmiR122-HmiR133

<400>  24
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatt cacgcgtgga      120

tctgaattca attcacgcgt ggtacctctg gtcgttacat aacttacggt aaatggcccg      180

cctggctgac cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata      240

gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc      300

cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac      360

ggtaaatggc ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg      420

cagtacatct actcgaggcc acgttctgct tcactctccc catctccccc ccctccccac      480

ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg      540

ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga      600

gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc      660

ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagcg ggatcagcca      720

ccgcggtggc ggcctagagt cgacgaggaa ctgaaaaacc agaaagttaa ctggtaagtt      780

tagtcttttt gtcttttatt tcaggtcccg gatccggtgg tggtgcaaat caaagaactg      840

ctcctcagtg gatgttgcct ttacttctag gcctgtacgg aagtgttact tctgctctaa      900

aagctgcgga attgtacccg cggccgatcc accggtccgg aattcccggg atatcgtcga      960

cccacgcgtc cgggccccac gctgcgcacc cgcgggtttg ctatggcgat gagcagcggc     1020

ggcagtggtg gcggcgtccc ggagcaggag gattccgtgc tgttccggcg cggcacaggc     1080

cagagcgatg attctgacat ttgggatgat acagcactga taaaagcata tgataaagct     1140

gtggcttcat ttaagcatgc tctaaagaat ggtgacattt gtgaaacttc gggtaaacca     1200

aaaaccacac ctaaaagaaa acctgctaag aagaataaaa gccaaaagaa gaatactgca     1260

gcttccttac aacagtggaa agttggggac aaatgttctg ccatttggtc agaagacggt     1320

tgcatttacc cagctaccat tgcttcaatt gattttaaga gagaaacctg tgttgtggtt     1380

tacactggat atggaaatag agaggagcaa aatctgtccg atctactttc cccaatctgt     1440

gaagtagcta ataatataga acagaatgct caagagaatg aaaatgaaag ccaagtttca     1500

acagatgaaa gtgagaactc caggtctcct ggaaataaat cagataacat caagcccaaa     1560

tctgctccat ggaactcttt tctccctcca ccacccccca tgccagggcc aagactggga     1620

ccaggaaagc caggtctaaa attcaatggc ccaccaccgc caccgccacc accaccaccc     1680

cacttactat catgctggct gcctccattt ccttctggac caccaataat tcccccacca     1740

cctcccatat gtccagattc tcttgatgat gctgatgctt tgggaagtat gttaatttca     1800

tggtacatga gtggctatca tactggctat tatatgggtt ttagacaaaa tcaaaaagaa     1860

ggaaggtgct cacattcctt aaattaagga gaaatgctgg catagagcag cactaaatga     1920

caccactaaa gaaacgatca gacagatcta gtataacccg ggccaaaagc tcagtataac     1980

ccgggccaaa agctcgacat aattcgagca aaaagctaac ataattcgag caaaaagctc     2040

ttgacaaaca ccattgtcac actccaacaa acaccattgt cacactccaa caaacaccat     2100

tgtcacactc caccatagac agctggttga aggggaccaa aacagctggt tgaaggggac     2160

caaaacagct ggttgaaggg gaccaaacaa gcttatcgat accgtcgact agagctcgct     2220

gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc     2280

cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg     2340

catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca     2400

agggggagga ttgggaagtc tagagcaggc atgctgggga gagatcgatc tgaggaaccc     2460

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     2520

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     2580

agagagggag tggcc                                                      2595


<210>  25
<211>  2594
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG211-LmiR122-HmiR133

<400>  25
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttca attcacgcgt ggtacccgtt acataactta cggtaaatgg cccgcctggc      180

tgaccgccca acgacccccg cccattgacg tcaatagtaa cgccaatagg gactttccat      240

tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat      300

catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattgt      360

gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt attagtcatc      420

gctattacca tggtcgaggt gagccccacg ttctgcttca ctctccccat ctcccccccc      480

tccccacccc caattttgta tttatttatt ttttaattat tttgtgcagc gatgggggcg      540

gggggggggg gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg gggcggggcg      600

aggcggagag gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt tccttttatg      660

gcgaggcggc ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc gggagtcgct      720

gcgacgctgc cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc cgccccggct      780

ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc ctccgggctg      840

taattagctg agcaagaggt aagggtttaa gggatggttg gttggtgggg tattaatgtt      900

taattacctg gagcacctgc ctgaaatcac tttttttcag gaattcccgg gatatcgtcg      960

acccacgcgt ccgggcccca cgctgcgcac ccggggccac catggccatg agcagcggag     1020

gaagcggagg aggagtgccc gagcaagagg acagcgtgct gtttaggaga ggaaccggac     1080

agagcgatga ctccgatatc tgggacgaca ccgctctgat caaggcctat gacaaagccg     1140

tggcctcctt caagcacgct ctgaagaatg gcgatatctg tgagacctcc ggcaaaccta     1200

agaccacccc caagaggaag cccgccaaga agaacaagtc ccagaagaag aataccgccg     1260

ctagcctcca gcagtggaaa gtgggcgata agtgcagcgc catttggagc gaggatggat     1320

gcatctaccc cgccaccatt gccagcatcg acttcaagag ggagacatgc gtggtggtgt     1380

ataccggata cggaaataga gaggagcaga atctgagcga tctgctgtcc cccatctgcg     1440

aggtggccaa taatatcgag cagaacgccc aagagaacga gaacgaaagc caagtgtcca     1500

ccgatgagag cgagaactcc agaagccccg gaaacaagtc cgacaacatc aaacccaaga     1560

gcgccccttg gaacagcttt ctgcctcctc ccccccccat gcccggccct agactgggac     1620

ccggcaagcc cggactgaag ttcaacggac ccccccctcc tcctcccccc cctcctcctc     1680

atctgctgag ctgctggctc ccccctttcc ctagcggccc ccccattatc cccccccctc     1740

cccctatctg tcccgacagc ctcgatgacg ctgacgccct cggaagcatg ctgatcagct     1800

ggtacatgag cggctaccac accggatact acatgggctt cagacagaac cagaaggagg     1860

gcagatgctc ccactctctg aactgaggag aaatgctggc atagagcagc actaaatgac     1920

accactaaag aaacgatcag acagatctag tataacccgg gccaaaagct cagtataacc     1980

cgggccaaaa gctcgacata attcgagcaa aaagctaaca taattcgagc aaaaagctct     2040

tgacaaacac cattgtcaca ctccaacaaa caccattgtc acactccaac aaacaccatt     2100

gtcacactcc accatagaca gctggttgaa ggggaccaaa acagctggtt gaaggggacc     2160

aaaacagctg gttgaagggg accaaacaag cttatcgata ccgtcgacta gagctcgctg     2220

atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc     2280

ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc     2340

atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa     2400

gggggaggat tgggaagtct agagcaggca tgctggggag agatcgatct gaggaacccc     2460

tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac     2520

caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca     2580

gagagggagt ggcc                                                       2594


<210>  26
<211>  735
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV2 (UniProt: P03135-1)

<400>  26

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145                 150                 155                 160 


Lys Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 
            260                 265                 270         


Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
        275                 280                 285             


Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 
    290                 295                 300                 


Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val 
305                 310                 315                 320 


Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 
                325                 330                 335     


Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 
            340                 345                 350         


Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 
        355                 360                 365             


Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
    370                 375                 380                 


Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385                 390                 395                 400 


Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
                405                 410                 415     


Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 
            420                 425                 430         


Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr 
        435                 440                 445             


Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln 
    450                 455                 460                 


Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
                485                 490                 495     


Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
        515                 520                 525             


Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr 
545                 550                 555                 560 


Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr 
                565                 570                 575     


Gly Ser Val Ser Thr Asn Leu Gln Arg Gly Asn Arg Gln Ala Ala Thr 
            580                 585                 590         


Ala Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn 
                645                 650                 655     


Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735 


<210>  27
<211>  738
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV8 (Uniprot Q8JQF8_9VIRU)

<400>  27

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 
            180                 185                 190         


Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Thr Tyr 
                405                 410                 415     


Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly 
    450                 455                 460                 


Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly Ile Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly Ile Leu Ile 
    530                 535                 540                 


Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala 
            580                 585                 590         


Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  28
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAVrh8 (Uniprot Q808Y3_9VIRU)

<400>  28

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
            260                 265                 270         


Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn 
    370                 375                 380                 


Gly Ser Gln Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Val 
        435                 440                 445             


Arg Thr Gln Thr Thr Gly Thr Gly Gly Thr Gln Thr Leu Ala Phe Ser 
    450                 455                 460                 


Gln Ala Gly Pro Ser Ser Met Ala Asn Gln Ala Arg Asn Trp Val Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Asn Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
            500                 505                 510         


Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gln Val Leu Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ala Val Ala Ile Asn Asn Gln Ala Ala Asn Thr Gln Ala Gln 
            580                 585                 590         


Thr Gly Leu Val His Asn Gln Gly Val Ile Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Leu Thr Phe Asn Gln Ala Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  29
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV9 (Uniprot Q6JC40_9VIRU)

<400>  29

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  30
<211>  738
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAVrh10 (Uniprot Q808W5_9VIRU)

<400>  30

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Gln Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 
            580                 585                 590         


Pro Ile Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Ser Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  31
<211>  735
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV2v

<400>  31

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145                 150                 155                 160 


Lys Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 
            260                 265                 270         


Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
        275                 280                 285             


Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 
    290                 295                 300                 


Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val 
305                 310                 315                 320 


Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 
                325                 330                 335     


Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 
            340                 345                 350         


Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 
        355                 360                 365             


Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
    370                 375                 380                 


Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385                 390                 395                 400 


Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
                405                 410                 415     


Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 
            420                 425                 430         


Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Phe Leu Ser Arg Thr 
        435                 440                 445             


Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln 
    450                 455                 460                 


Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Gly Val Ser Lys Val Ser Ala Asp Asn Asn 
                485                 490                 495     


Asn Ser Glu Phe Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
        515                 520                 525             


Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr 
545                 550                 555                 560 


Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr 
                565                 570                 575     


Gly Ser Val Ser Thr Asn Leu Gln Ser Gly Asn Thr Gln Ala Ala Thr 
            580                 585                 590         


Ala Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn 
                645                 650                 655     


Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro Arg Pro Ile Gly Thr Arg Phe Leu Thr Arg Asn Leu 
                725                 730                 735 


<210>  32
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  AAV44-9

<400>  32

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
            260                 265                 270         


Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn 
    370                 375                 380                 


Gly Ser Gln Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Thr 
                405                 410                 415     


Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Val 
        435                 440                 445             


Arg Thr Gln Thr Thr Gly Thr Gly Gly Thr Gln Thr Leu Ala Phe Ser 
    450                 455                 460                 


Gln Ala Gly Pro Ser Asn Met Ala Ser Gln Ala Arg Asn Trp Val Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Asn Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
            500                 505                 510         


Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gln Val Leu Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Ala Val Ala Ile Asn Asn Gln Ala Ala Asn Thr Gln Ala Gln 
            580                 585                 590         


Thr Gly Leu Val His Asn Gln Gly Val Ile Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Leu Thr Phe Asn Gln Ala Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  33
<211>  294
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SMN protein (wt) amino acid sequence

<400>  33

Met Ala Met Ser Ser Gly Gly Ser Gly Gly Gly Val Pro Glu Gln Glu 
1               5                   10                  15      


Asp Ser Val Leu Phe Arg Arg Gly Thr Gly Gln Ser Asp Asp Ser Asp 
            20                  25                  30          


Ile Trp Asp Asp Thr Ala Leu Ile Lys Ala Tyr Asp Lys Ala Val Ala 
        35                  40                  45              


Ser Phe Lys His Ala Leu Lys Asn Gly Asp Ile Cys Glu Thr Ser Gly 
    50                  55                  60                  


Lys Pro Lys Thr Thr Pro Lys Arg Lys Pro Ala Lys Lys Asn Lys Ser 
65                  70                  75                  80  


Gln Lys Lys Asn Thr Ala Ala Ser Leu Gln Gln Trp Lys Val Gly Asp 
                85                  90                  95      


Lys Cys Ser Ala Ile Trp Ser Glu Asp Gly Cys Ile Tyr Pro Ala Thr 
            100                 105                 110         


Ile Ala Ser Ile Asp Phe Lys Arg Glu Thr Cys Val Val Val Tyr Thr 
        115                 120                 125             


Gly Tyr Gly Asn Arg Glu Glu Gln Asn Leu Ser Asp Leu Leu Ser Pro 
    130                 135                 140                 


Ile Cys Glu Val Ala Asn Asn Ile Glu Gln Asn Ala Gln Glu Asn Glu 
145                 150                 155                 160 


Asn Glu Ser Gln Val Ser Thr Asp Glu Ser Glu Asn Ser Arg Ser Pro 
                165                 170                 175     


Gly Asn Lys Ser Asp Asn Ile Lys Pro Lys Ser Ala Pro Trp Asn Ser 
            180                 185                 190         


Phe Leu Pro Pro Pro Pro Pro Met Pro Gly Pro Arg Leu Gly Pro Gly 
        195                 200                 205             


Lys Pro Gly Leu Lys Phe Asn Gly Pro Pro Pro Pro Pro Pro Pro Pro 
    210                 215                 220                 


Pro Pro His Leu Leu Ser Cys Trp Leu Pro Pro Phe Pro Ser Gly Pro 
225                 230                 235                 240 


Pro Ile Ile Pro Pro Pro Pro Pro Ile Cys Pro Asp Ser Leu Asp Asp 
                245                 250                 255     


Ala Asp Ala Leu Gly Ser Met Leu Ile Ser Trp Tyr Met Ser Gly Tyr 
            260                 265                 270         


His Thr Gly Tyr Tyr Met Gly Phe Arg Gln Asn Gln Lys Glu Gly Arg 
        275                 280                 285             


Cys Ser His Ser Leu Asn 
    290                 


<210>  34
<211>  882
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SMN nucleic acid sequence (wt)

<400>  34
atggcgatga gcagcggcgg cagtggtggc ggcgtcccgg agcaggagga ttccgtgctg       60

ttccggcgcg gcacaggcca gagcgatgat tctgacattt gggatgatac agcactgata      120

aaagcatatg ataaagctgt ggcttcattt aagcatgctc taaagaatgg tgacatttgt      180

gaaacttcgg gtaaaccaaa aaccacacct aaaagaaaac ctgctaagaa gaataaaagc      240

caaaagaaga atactgcagc ttccttacaa cagtggaaag ttggggacaa atgttctgcc      300

atttggtcag aagacggttg catttaccca gctaccattg cttcaattga ttttaagaga      360

gaaacctgtg ttgtggttta cactggatat ggaaatagag aggagcaaaa tctgtccgat      420

ctactttccc caatctgtga agtagctaat aatatagaac agaatgctca agagaatgaa      480

aatgaaagcc aagtttcaac agatgaaagt gagaactcca ggtctcctgg aaataaatca      540

gataacatca agcccaaatc tgctccatgg aactcttttc tccctccacc accccccatg      600

ccagggccaa gactgggacc aggaaagcca ggtctaaaat tcaatggccc accaccgcca      660

ccgccaccac caccacccca cttactatca tgctggctgc ctccatttcc ttctggacca      720

ccaataattc ccccaccacc tcccatatgt ccagattctc ttgatgatgc tgatgctttg      780

ggaagtatgt taatttcatg gtacatgagt ggctatcata ctggctatta tatgggtttt      840

agacaaaatc aaaaagaagg aaggtgctca cattccttaa at                         882


<210>  35
<211>  882
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized SMN nucleic acid sequence

<400>  35
atggccatga gcagcggagg aagcggagga ggagtgcccg agcaagagga cagcgtgctg       60

tttaggagag gaaccggaca gagcgatgac tccgatatct gggacgacac cgctctgatc      120

aaggcctatg acaaagccgt ggcctccttc aagcacgctc tgaagaatgg cgatatctgt      180

gagacctccg gcaaacctaa gaccaccccc aagaggaagc ccgccaagaa gaacaagtcc      240

cagaagaaga ataccgccgc tagcctccag cagtggaaag tgggcgataa gtgcagcgcc      300

atttggagcg aggatggatg catctacccc gccaccattg ccagcatcga cttcaagagg      360

gagacatgcg tggtggtgta taccggatac ggaaatagag aggagcagaa tctgagcgat      420

ctgctgtccc ccatctgcga ggtggccaat aatatcgagc agaacgccca agagaacgag      480

aacgaaagcc aagtgtccac cgatgagagc gagaactcca gaagccccgg aaacaagtcc      540

gacaacatca aacccaagag cgccccttgg aacagctttc tgcctcctcc cccccccatg      600

cccggcccta gactgggacc cggcaagccc ggactgaagt tcaacggacc cccccctcct      660

cctccccccc ctcctcctca tctgctgagc tgctggctcc cccctttccc tagcggcccc      720

cccattatcc ccccccctcc ccctatctgt cccgacagcc tcgatgacgc tgacgccctc      780

ggaagcatgc tgatcagctg gtacatgagc ggctaccaca ccggatacta catgggcttc      840

agacagaacc agaaggaggg cagatgctcc cactctctga ac                         882


<210>  36
<211>  850
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary Promoter Sequence (1)

<400>  36
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac tcgaggccac gttctgcttc      300

actctcccca tctccccccc ctccccaccc ccaattttgt atttatttat tttttaatta      360

ttttgtgcag cgatgggggc gggggggggg ggggggcgcg cgccaggcgg ggcggggcgg      420

ggcgaggggc ggggcggggc gaggcggaga ggtgcggcgg cagccaatca gagcggcgcg      480

ctccgaaagt ttccttttat ggcgaggcgg cggcggcggc ggccctataa aaagcgaagc      540

gcgcggcggg cgggagcggg atcagccacc gcggtggcgg cctagagtcg acgaggaact      600

gaaaaaccag aaagttaact ggtaagttta gtctttttgt cttttatttc aggtcccgga      660

tccggtggtg gtgcaaatca aagaactgct cctcagtgga tgttgccttt acttctaggc      720

ctgtacggaa gtgttacttc tgctctaaaa gctgcggaat tgtacccgcg gccgatccac      780

cggtccggaa ttcccgggat atcgtcgacc cacgcgtccg ggccccacgc tgcgcacccg      840

cgggtttgct                                                             850


<210>  37
<211>  796
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Exemplary Promoter Sequence (2)

<400>  37
ccgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat       60

tgacgtcaat agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac      120

ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg      180

acgtcaatga cggtaaatgg cccgcctggc attgtgccca gtacatgacc ttatgggact      240

ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtc gaggtgagcc      300

ccacgttctg cttcactctc cccatctccc ccccctcccc acccccaatt ttgtatttat      360

ttatttttta attattttgt gcagcgatgg gggcgggggg gggggggggg cgcgcgccag      420

gcggggcggg gcggggcgag gggcggggcg gggcgaggcg gagaggtgcg gcggcagcca      480

atcagagcgg cgcgctccga aagtttcctt ttatggcgag gcggcggcgg cggcggccct      540

ataaaaagcg aagcgcgcgg cgggcgggag tcgctgcgac gctgccttcg ccccgtgccc      600

cgctccgccg ccgcctcgcg ccgcccgccc cggctctgac tgaccgcgtt actcccacag      660

gtgagcgggc gggacggccc ttctcctccg ggctgtaatt agctgagcaa gaggtaaggg      720

tttaagggat ggttggttgg tggggtatta atgtttaatt acctggagca cctgcctgaa      780

atcacttttt ttcagg                                                      796


<210>  38
<211>  448
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Core promoter (hSyn) sequences

<400>  38
agtgcaagtg ggttttagga ccaggatgag gcggggtggg ggtgcctacc tgacgaccga       60

ccccgaccca ctggacaagc acccaacccc cattccccaa attgcgcatc ccctatcaga      120

gagggggagg ggaaacagga tgcggcgagg cgcgtgcgca ctgccagctt cagcaccgcg      180

gacagtgcct tcgcccccgc ctggcggcgc gcgccaccgc cgcctcagca ctgaaggcgc      240

gctgacgtca ctcgccggtc ccccgcaaac tccccttccc ggccaccttg gtcgcgtccg      300

cgccgccgcc ggcccagccg gaccgcacca cgcgaggcgc gagatagggg ggcacgggcg      360

cgaccatctg cgctgcggcg ccggcgactc agcgctgcct cagtctgcgg tgggcagcgg      420

aggagtcgtg tcgtgcctga gagcgcag                                         448


<210>  39
<211>  304
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV enhancer

<400>  39
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catg                                                                   304


<210>  40
<211>  420
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ProC3 enhancer

<400>  40
gatgaattcc gccggaaact aggtccggag gactgccgga aacacctgta atcaagccgc       60

cggaaacctg ttgtggccgt atgccggaaa cgtcttaatt ggacgtgccg gaaactcttt      120

taatgagttc gccggaaacc agaccagccg agctgccgga aaccggttat atagaacggc      180

cggaaacggt ccacaggaaa aagccggaaa cacccaaacg gttagcgccg gaaacgactg      240

gggaggacgt gccggaaacg tactatctga agatgccgga aacacttgaa agctccaagc      300

cggaaacggg cccgtgcgga tagccggaaa ctgacggtac acggccgccg gaaacactac      360

ttgtatggta gccggaaact tgggcgtggc tggggccgga aacgctcgag atctgcgatc      420


<210>  41
<211>  450
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ProA5 enhancer

<400>  41
cctggaggcc ttcctggaag aagagatcct ggcaccgcac aaagagaagc acaggctttc       60

cagggctgag gagagggagg tcaagtgagg cccaggtgcc cctgcctgag cctgtgtccc      120

cagaaacctc ctctccctct catcaccccc acatcctccc tgccactccc cgcagctccc      180

tgtggccaag tgcactgcag cactcggctc tgctccacaa acggtctgct ccactccagg      240

aaggccacct cctccccccc cccccacctc cggctgtcac cactcaccgc tctagcctcc      300

agggggtggg gaccccagag ctggacacac cccatcgaag ccccacagct cagccagccg      360

gacagactca cggtcggact caagaccccg gagccctgag gtgggcagcg cgccagggtt      420

cctcgcagcc tcttcaaggt cagtgcaagt                                       450


<210>  42
<211>  472
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ProB15 enhancer

<400>  42
cagcccccgg gccctcctcc tccctctgcc tttttaaggg acgccctcca gggcgacccc       60

ggagggcgga cttgccaagc tgaagagaat cagtcaaaaa tccgcccaca ggggacacat      120

catttaaata aatgtgtttc tttgcccgaa cagaagttca gataggctcg attatcatta      180

attctgggtt tcacgtaacg agaggaaaca caggttgcaa taaaaataaa aaaatggttt      240

gaaatcaatt ttaactcatt ttgaacgtcc tcacacgttt gacaaaccga tttgtttcag      300

gagacttgct aatatctaaa tcggtgacag ggtgtttgct gtgagtgtgg ctctggaaaa      360

gttattaagc gttataaaaa aaatgatgta atgaaattct aattaatggg agggaagtgc      420

caacaaatca ctccttaaaa tattaacgct atcaaagaac agctggagaa gg              472


<210>  43
<211>  675
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ProC3 promoter

<400>  43
gaaacagctg agggtgccca gccggaaact cgaaatcaac gtaggccgga aactattcga       60

tgaattccgc cggaaactag gtccggagga ctgccggaaa cacctgtaat caagccgccg      120

gaaacctgtt gtggccgtat gccggaaacg tcttaattgg acgtgccgga aactctttta      180

atgagttcgc cggaaaccag accagccgag ctgccggaaa ccggttatat agaacggccg      240

gaaacggtcc acaggaaaaa gccggaaaca cccaaacggt tagcgccgga aacgactggg      300

gaggacgtgc cggaaacgta ctatctgaag atgccggaaa cacttgaaag ctccaagccg      360

gaaacgggcc cgtgcggata gccggaaact gacggtacac ggccgccgga aacactactt      420

gtatggtagc cggaaacttg ggcgtggctg gggccggaaa cgctcgagat ctgcgatctg      480

catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact      540

ccgcccagtt ccgcccattc tccgccccat cgctgactaa ttttttttat ttatgcagag      600

gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc      660

ctaggctttt gcaaa                                                       675


<210>  44
<211>  1178
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EF1a promoter

<400>  44
ggctccggtg cccgtcagtg ggcagagcgc acatcgccca cagtccccga gaagttgggg       60

ggaggggtcg gcaattgaac cggtgcctag agaaggtggc gcggggtaaa ctgggaaagt      120

gatgtcgtgt actggctccg cctttttccc gagggtgggg gagaaccgta tataagtgca      180

gtagtcgccg tgaacgttct ttttcgcaac gggtttgccg ccagaacaca ggtaagtgcc      240

gtgtgtggtt cccgcgggcc tggcctcttt acgggttatg gcccttgcgt gccttgaatt      300

acttccactg gctgcagtac gtgattcttg atcccgagct tcgggttgga agtgggtggg      360

agagttcgag gccttgcgct taaggagccc cttcgcctcg tgcttgagtt gaggcctggc      420

ctgggcgctg gggccgccgc gtgcgaatct ggtggcacct tcgcgcctgt ctcgctgctt      480

tcgataagtc tctagccatt taaaattttt gatgacctgc tgcgacgctt tttttctggc      540

aagatagtct tgtaaatgcg ggccaagatc tgcacactgg tatttcggtt tttggggccg      600

cgggcggcga cggggcccgt gcgtcccagc gcacatgttc ggcgaggcgg ggcctgcgag      660

cgcggccacc gagaatcgga cgggggtagt ctcaagctgg ccggcctgct ctggtgcctg      720

gcctcgcgcc gccgtgtatc gccccgccct gggcggcaag gctggcccgg tcggcaccag      780

ttgcgtgagc ggaaagatgg ccgcttcccg gccctgctgc agggagctca aaatggagga      840

cgcggcgctc gggagagcgg gcgggtgagt cacccacaca aaggaaaagg gcctttccgt      900

cctcagccgt cgcttcatgt gactccacgg agtaccgggc gccgtccagg cacctcgatt      960

agttctcgag cttttggagt acgtcgtctt taggttgggg ggaggggttt tatgcgatgg     1020

agtttcccca cactgagtgg gtggagactg aagttaggcc agcttggcac ttgatgtaat     1080

tctccttgga atttgccctt tttgagtttg gatcttggtt cattctcaag cctcagacag     1140

tggttcaaag tttttttctt ccatttcagg tgtcgtga                             1178


<210>  45
<211>  1984
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG301

<400>  45
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttca attcacgcgt gctgcgtatg gcgtatagtg caagtgggtt ttaggaccag      180

gatgaggcgg ggtgggggtg cctacctgac gaccgacccc gacccactgg acaagcaccc      240

aacccccatt ccccaaattg cgcatcccct atcagagagg gggaggggaa acaggatgcg      300

gcgaggcgcg tgcgcactgc cagcttcagc accgcggaca gtgccttcgc ccccgcctgg      360

cggcgcgcgc caccgccgcc tcagcactga aggcgcgctg acgtcactcg ccggtccccc      420

gcaaactccc cttcccggcc accttggtcg cgtccgcgcc gccgccggcc cagccggacc      480

gcaccacgcg aggcgcgaga taggggggca cgggcgcgac catctgcgct gcggcgccgg      540

cgactcagcg ctgcctcagt ctgcggtggg cagcggagga gtcgtgtcgt gcctgagagc      600

gcagtcgaat tcaagctgct agccaccatg gcgatgagca gcggcggcag tggtggcggc      660

gtcccggagc aggaggattc cgtgctgttc cggcgcggca caggccagag cgatgattct      720

gacatttggg atgatacagc actgataaaa gcatatgata aagctgtggc ttcatttaag      780

catgctctaa agaatggtga catttgtgaa acttcgggta aaccaaaaac cacacctaaa      840

agaaaacctg ctaagaagaa taaaagccaa aagaagaata ctgcagcttc cttacaacag      900

tggaaagttg gggacaaatg ttctgccatt tggtcagaag acggttgcat ttacccagct      960

accattgctt caattgattt taagagagaa acctgtgttg tggtttacac tggatatgga     1020

aatagagagg agcaaaatct gtccgatcta ctttccccaa tctgtgaagt agctaataat     1080

atagaacaga atgctcaaga gaatgaaaat gaaagccaag tttcaacaga tgaaagtgag     1140

aactccaggt ctcctggaaa taaatcagat aacatcaagc ccaaatctgc tccatggaac     1200

tcttttctcc ctccaccacc ccccatgcca gggccaagac tgggaccagg aaagccaggt     1260

ctaaaattca atggcccacc accgccaccg ccaccaccac caccccactt actatcatgc     1320

tggctgcctc catttccttc tggaccacca ataattcccc caccacctcc catatgtcca     1380

gattctcttg atgatgctga tgctttggga agtatgttaa tttcatggta catgagtggc     1440

tatcatactg gctattatat gggttttaga caaaatcaaa aagaaggaag gtgctcacat     1500

tccttaaatt aaggagaaat gctggcatag agcagcacta aatgacacca ctaaagaaac     1560

gatcagacag atctacaaag cttatcgata ccgtcgacta gagctcgctg atcagcctcg     1620

actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc     1680

ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt     1740

ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat     1800

tgggaagtct agagcaggca tgctggggag agatcgatct gaggaacccc tagtgatgga     1860

gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac caaaggtcgc     1920

ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca gagagggagt     1980

ggcc                                                                  1984


<210>  46
<211>  2199
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG302

<400>  46
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttca acgcgtaaag ctggaactgg ggccggaaac agctgagggt gcccagccgg      180

aaactcgaaa tcaacgtagg ccggaaacta ttcgatgaat tccgccggaa actaggtccg      240

gaggactgcc ggaaacacct gtaatcaagc cgccggaaac ctgttgtggc cgtatgccgg      300

aaacgtctta attggacgtg ccggaaactc ttttaatgag ttcgccggaa accagaccag      360

ccgagctgcc ggaaaccggt tatatagaac ggccggaaac ggtccacagg aaaaagccgg      420

aaacacccaa acggttagcg ccggaaacga ctggggagga cgtgccggaa acgtactatc      480

tgaagatgcc ggaaacactt gaaagctcca agccggaaac gggcccgtgc ggatagccgg      540

aaactgacgg tacacggccg ccggaaacac tacttgtatg gtagccggaa acttgggcgt      600

ggctggggcc ggaaacgctc gagatctgcg atctgcatct caattagtca gcaaccatag      660

tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc cattctccgc      720

cccatcgctg actaattttt tttatttatg cagaggccga ggccgcctcg gcctctgagc      780

tattccagaa gtagtgagga ggcttttttg gaggcctagg cttttgcaaa ggatccgcca      840

ccatggcgat gagcagcggc ggcagtggtg gcggcgtccc ggagcaggag gattccgtgc      900

tgttccggcg cggcacaggc cagagcgatg attctgacat ttgggatgat acagcactga      960

taaaagcata tgataaagct gtggcttcat ttaagcatgc tctaaagaat ggtgacattt     1020

gtgaaacttc gggtaaacca aaaaccacac ctaaaagaaa acctgctaag aagaataaaa     1080

gccaaaagaa gaatactgca gcttccttac aacagtggaa agttggggac aaatgttctg     1140

ccatttggtc agaagacggt tgcatttacc cagctaccat tgcttcaatt gattttaaga     1200

gagaaacctg tgttgtggtt tacactggat atggaaatag agaggagcaa aatctgtccg     1260

atctactttc cccaatctgt gaagtagcta ataatataga acagaatgct caagagaatg     1320

aaaatgaaag ccaagtttca acagatgaaa gtgagaactc caggtctcct ggaaataaat     1380

cagataacat caagcccaaa tctgctccat ggaactcttt tctccctcca ccacccccca     1440

tgccagggcc aagactggga ccaggaaagc caggtctaaa attcaatggc ccaccaccgc     1500

caccgccacc accaccaccc cacttactat catgctggct gcctccattt ccttctggac     1560

caccaataat tcccccacca cctcccatat gtccagattc tcttgatgat gctgatgctt     1620

tgggaagtat gttaatttca tggtacatga gtggctatca tactggctat tatatgggtt     1680

ttagacaaaa tcaaaaagaa ggaaggtgct cacattcctt aaattaagga gaaatgctgg     1740

catagagcag cactaaatga caccactaaa gaaacgatca gacagatcta caaagcttat     1800

cgataccgtc gactagagct cgctgatcag cctcgactgt gccttctagt tgccagccat     1860

ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc     1920

tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg     1980

ggggtggggt ggggcaggac agcaaggggg aggattggga agtctagagc aggcatgctg     2040

gggagagatc gatctgagga acccctagtg atggagttgg ccactccctc tctgcgcgct     2100

cgctcgctca ctgaggccgg gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg     2160

gcctcagtga gcgagcgagc gcgcagagag ggagtggcc                            2199


<210>  47
<211>  2685
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG303

<400>  47
cctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt       60

tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggaat gcacgcgtgg      120

atctgagttc aattcacgcg tgtggctccg gtgcccgtca gtgggcagag cgcacatcgc      180

ccacagtccc cgagaagttg gggggagggg tcggcaattg aaccggtgcc tagagaaggt      240

ggcgcggggt aaactgggaa agtgatgtcg tgtactggct ccgccttttt cccgagggtg      300

ggggagaacc gtatataagt gcagtagtcg ccgtgaacgt tctttttcgc aacgggtttg      360

ccgccagaac acaggtaagt gccgtgtgtg gttcccgcgg gcctggcctc tttacgggtt      420

atggcccttg cgtgccttga attacttcca ctggctgcag tacgtgattc ttgatcccga      480

gcttcgggtt ggaagtgggt gggagagttc gaggccttgc gcttaaggag ccccttcgcc      540

tcgtgcttga gttgaggcct ggcctgggcg ctggggccgc cgcgtgcgaa tctggtggca      600

ccttcgcgcc tgtctcgctg ctttcgataa gtctctagcc atttaaaatt tttgatgacc      660

tgctgcgacg ctttttttct ggcaagatag tcttgtaaat gcgggccaag atctgcacac      720

tggtatttcg gtttttgggg ccgcgggcgg cgacggggcc cgtgcgtccc agcgcacatg      780

ttcggcgagg cggggcctgc gagcgcggcc accgagaatc ggacgggggt agtctcaagc      840

tggccggcct gctctggtgc ctggcctcgc gccgccgtgt atcgccccgc cctgggcggc      900

aaggctggcc cggtcggcac cagttgcgtg agcggaaaga tggccgcttc ccggccctgc      960

tgcagggagc tcaaaatgga ggacgcggcg ctcgggagag cgggcgggtg agtcacccac     1020

acaaaggaaa agggcctttc cgtcctcagc cgtcgcttca tgtgactcca cggagtaccg     1080

ggcgccgtcc aggcacctcg attagttctc gagcttttgg agtacgtcgt ctttaggttg     1140

gggggagggg ttttatgcga tggagtttcc ccacactgag tgggtggaga ctgaagttag     1200

gccagcttgg cacttgatgt aattctcctt ggaatttgcc ctttttgagt ttggatcttg     1260

gttcattctc aagcctcaga cagtggttca aagttttttt cttccatttc aggtgtcgtg     1320

acgccaccat ggcgatgagc agcggcggca gtggtggcgg cgtcccggag caggaggatt     1380

ccgtgctgtt ccggcgcggc acaggccaga gcgatgattc tgacatttgg gatgatacag     1440

cactgataaa agcatatgat aaagctgtgg cttcatttaa gcatgctcta aagaatggtg     1500

acatttgtga aacttcgggt aaaccaaaaa ccacacctaa aagaaaacct gctaagaaga     1560

ataaaagcca aaagaagaat actgcagctt ccttacaaca gtggaaagtt ggggacaaat     1620

gttctgccat ttggtcagaa gacggttgca tttacccagc taccattgct tcaattgatt     1680

ttaagagaga aacctgtgtt gtggtttaca ctggatatgg aaatagagag gagcaaaatc     1740

tgtccgatct actttcccca atctgtgaag tagctaataa tatagaacag aatgctcaag     1800

agaatgaaaa tgaaagccaa gtttcaacag atgaaagtga gaactccagg tctcctggaa     1860

ataaatcaga taacatcaag cccaaatctg ctccatggaa ctcttttctc cctccaccac     1920

cccccatgcc agggccaaga ctgggaccag gaaagccagg tctaaaattc aatggcccac     1980

caccgccacc gccaccacca ccaccccact tactatcatg ctggctgcct ccatttcctt     2040

ctggaccacc aataattccc ccaccacctc ccatatgtcc agattctctt gatgatgctg     2100

atgctttggg aagtatgtta atttcatggt acatgagtgg ctatcatact ggctattata     2160

tgggttttag acaaaatcaa aaagaaggaa ggtgctcaca ttccttaaat taaggagaaa     2220

tgctggcata gagcagcact aaatgacacc actaaagaaa cgatcagaca gatctacaaa     2280

gcttatcgat accgtcgact agagctcgct gatcagcctc gactgtgcct tctagttgcc     2340

agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt gccactccca     2400

ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg tgtcattcta     2460

ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagtc tagagcaggc     2520

atgctgggga gagatcgatc tgaggaaccc ctagtgatgg agttggccac tccctctctg     2580

cgcgctcgct cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc     2640

cgggcggcct cagtgagcga gcgagcgcgc agagagggag tggcc                     2685


<210>  48
<211>  2452
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG304

<400>  48
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttca acgcgtaaag ctggaactgg ggccggaaac agctgagggt gcccagccgg      180

aaactcgaaa tcaacgtagg ccggaaacta ttcgatgaat tccgccggaa actaggtccg      240

gaggactgcc ggaaacacct gtaatcaagc cgccggaaac ctgttgtggc cgtatgccgg      300

aaacgtctta attggacgtg ccggaaactc ttttaatgag ttcgccggaa accagaccag      360

ccgagctgcc ggaaaccggt tatatagaac ggccggaaac ggtccacagg aaaaagccgg      420

aaacacccaa acggttagcg ccggaaacga ctggggagga cgtgccggaa acgtactatc      480

tgaagatgcc ggaaacactt gaaagctcca agccggaaac gggcccgtgc ggatagccgg      540

aaactgacgg tacacggccg ccggaaacac tacttgtatg gtagccggaa acttgggcgt      600

ggctggggcc ggaaacgctc gagatctgcg atcagtgcaa gtgggtttta ggaccaggat      660

gaggcggggt gggggtgcct acctgacgac cgaccccgac ccactggaca agcacccaac      720

ccccattccc caaattgcgc atcccctatc agagaggggg aggggaaaca ggatgcggcg      780

aggcgcgtgc gcactgccag cttcagcacc gcggacagtg ccttcgcccc cgcctggcgg      840

cgcgcgccac cgccgcctca gcactgaagg cgcgctgacg tcactcgccg gtcccccgca      900

aactcccctt cccggccacc ttggtcgcgt ccgcgccgcc gccggcccag ccggaccgca      960

ccacgcgagg cgcgagatag gggggcacgg gcgcgaccat ctgcgctgcg gcgccggcga     1020

ctcagcgctg cctcagtctg cggtgggcag cggaggagtc gtgtcgtgcc tgagagcgca     1080

gggatacacg ccaccatggc gatgagcagc ggcggcagtg gtggcggcgt cccggagcag     1140

gaggattccg tgctgttccg gcgcggcaca ggccagagcg atgattctga catttgggat     1200

gatacagcac tgataaaagc atatgataaa gctgtggctt catttaagca tgctctaaag     1260

aatggtgaca tttgtgaaac ttcgggtaaa ccaaaaacca cacctaaaag aaaacctgct     1320

aagaagaata aaagccaaaa gaagaatact gcagcttcct tacaacagtg gaaagttggg     1380

gacaaatgtt ctgccatttg gtcagaagac ggttgcattt acccagctac cattgcttca     1440

attgatttta agagagaaac ctgtgttgtg gtttacactg gatatggaaa tagagaggag     1500

caaaatctgt ccgatctact ttccccaatc tgtgaagtag ctaataatat agaacagaat     1560

gctcaagaga atgaaaatga aagccaagtt tcaacagatg aaagtgagaa ctccaggtct     1620

cctggaaata aatcagataa catcaagccc aaatctgctc catggaactc ttttctccct     1680

ccaccacccc ccatgccagg gccaagactg ggaccaggaa agccaggtct aaaattcaat     1740

ggcccaccac cgccaccgcc accaccacca ccccacttac tatcatgctg gctgcctcca     1800

tttccttctg gaccaccaat aattccccca ccacctccca tatgtccaga ttctcttgat     1860

gatgctgatg ctttgggaag tatgttaatt tcatggtaca tgagtggcta tcatactggc     1920

tattatatgg gttttagaca aaatcaaaaa gaaggaaggt gctcacattc cttaaattaa     1980

ggagaaatgc tggcatagag cagcactaaa tgacaccact aaagaaacga tcagacagat     2040

ctacaaagct tatcgatacc gtcgactaga gctcgctgat cagcctcgac tgtgccttct     2100

agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc     2160

actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt     2220

cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagtctag     2280

agcaggcatg ctggggagag atcgatctga ggaaccccta gtgatggagt tggccactcc     2340

ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc gacgcccggg     2400

ctttgcccgg gcggcctcag tgagcgagcg agcgcgcaga gagggagtgg cc             2452


<210>  49
<211>  2451
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG305

<400>  49
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttcg cgtgcccctg cctgcgcgag ggcgggaaga cagcccccgg gccctcctcc      180

tccctctgcc tttttaaggg acgccctcca gggcgacccc ggagggcgga cttgccaagc      240

tgaagagaat cagtcaaaaa tccgcccaca ggggacacat catttaaata aatgtgtttc      300

tttgcccgaa cagaagttca gataggctcg attatcatta attctgggtt tcacgtaacg      360

agaggaaaca caggttgcaa taaaaataaa aaaatggttt gaaatcaatt ttaactcatt      420

ttgaacgtcc tcacacgttt gacaaaccga tttgtttcag gagacttgct aatatctaaa      480

tcggtgacag ggtgtttgct gtgagtgtgg ctctggaaaa gttattaagc gttataaaaa      540

aaatgatgta atgaaattct aattaatggg agggaagtgc caacaaatca ctccttaaaa      600

tattaacgct atcaaagaac agctggagaa ggagtgcaag tgggttttag gaccaggatg      660

aggcggggtg ggggtgccta cctgacgacc gaccccgacc cactggacaa gcacccaacc      720

cccattcccc aaattgcgca tcccctatca gagaggggga ggggaaacag gatgcggcga      780

ggcgcgtgcg cactgccagc ttcagcaccg cggacagtgc cttcgccccc gcctggcggc      840

gcgcgccacc gccgcctcag cactgaaggc gcgctgacgt cactcgccgg tcccccgcaa      900

actccccttc ccggccacct tggtcgcgtc cgcgccgccg ccggcccagc cggaccgcac      960

cacgcgaggc gcgagatagg ggggcacggg cgcgaccatc tgcgctgcgg cgccggcgac     1020

tcagcgctgc ctcagtctgc ggtgggcagc ggaggagtcg tgtcgtgcct gagagcgcag     1080

ggatacacgc caccatggcg atgagcagcg gcggcagtgg tggcggcgtc ccggagcagg     1140

aggattccgt gctgttccgg cgcggcacag gccagagcga tgattctgac atttgggatg     1200

atacagcact gataaaagca tatgataaag ctgtggcttc atttaagcat gctctaaaga     1260

atggtgacat ttgtgaaact tcgggtaaac caaaaaccac acctaaaaga aaacctgcta     1320

agaagaataa aagccaaaag aagaatactg cagcttcctt acaacagtgg aaagttgggg     1380

acaaatgttc tgccatttgg tcagaagacg gttgcattta cccagctacc attgcttcaa     1440

ttgattttaa gagagaaacc tgtgttgtgg tttacactgg atatggaaat agagaggagc     1500

aaaatctgtc cgatctactt tccccaatct gtgaagtagc taataatata gaacagaatg     1560

ctcaagagaa tgaaaatgaa agccaagttt caacagatga aagtgagaac tccaggtctc     1620

ctggaaataa atcagataac atcaagccca aatctgctcc atggaactct tttctccctc     1680

caccaccccc catgccaggg ccaagactgg gaccaggaaa gccaggtcta aaattcaatg     1740

gcccaccacc gccaccgcca ccaccaccac cccacttact atcatgctgg ctgcctccat     1800

ttccttctgg accaccaata attcccccac cacctcccat atgtccagat tctcttgatg     1860

atgctgatgc tttgggaagt atgttaattt catggtacat gagtggctat catactggct     1920

attatatggg ttttagacaa aatcaaaaag aaggaaggtg ctcacattcc ttaaattaag     1980

gagaaatgct ggcatagagc agcactaaat gacaccacta aagaaacgat cagacagatc     2040

tacaaagctt atcgataccg tcgactagag ctcgctgatc agcctcgact gtgccttcta     2100

gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca     2160

ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc     2220

attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagtctaga     2280

gcaggcatgc tggggagaga tcgatctgag gaacccctag tgatggagtt ggccactccc     2340

tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg acgcccgggc     2400

tttgcccggg cggcctcagt gagcgagcga gcgcgcagag agggagtggc c              2451


<210>  50
<211>  2450
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG306

<400>  50
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttcg cgtggtgccc aggcagtggg agcagggctg accagagttc tgcagagatt      180

gcctggaggc cttcctggaa gaagagatcc tggcaccgca caaagagaag cacaggcttt      240

ccagggctga ggagagggag gtcaagtgag gcccaggtgc ccctgcctga gcctgtgtcc      300

ccagaaacct cctctccctc tcatcacccc cacatcctcc ctgccactcc ccgcagctcc      360

ctgtggccaa gtgcactgca gcactcggct ctgctccaca aacggtctgc tccactccag      420

gaaggccacc tcctcccccc ccccccacct ccggctgtca ccactcaccg ctctagcctc      480

cagggggtgg ggaccccaga gctggacaca ccccatcgaa gccccacagc tcagccagcc      540

ggacagactc acggtcggac tcaagacccc ggagccctga ggtgggcagc gcgccagggt      600

tcctcgcagc ctcttcaagg tcagtgcaag tagtgcaagt gggttttagg accaggatga      660

ggcggggtgg gggtgcctac ctgacgaccg accccgaccc actggacaag cacccaaccc      720

ccattcccca aattgcgcat cccctatcag agagggggag gggaaacagg atgcggcgag      780

gcgcgtgcgc actgccagct tcagcaccgc ggacagtgcc ttcgcccccg cctggcggcg      840

cgcgccaccg ccgcctcagc actgaaggcg cgctgacgtc actcgccggt cccccgcaaa      900

ctccccttcc cggccacctt ggtcgcgtcc gcgccgccgc cggcccagcc ggaccgcacc      960

acgcgaggcg cgagataggg gggcacgggc gcgaccatct gcgctgcggc gccggcgact     1020

cagcgctgcc tcagtctgcg gtgggcagcg gaggagtcgt gtcgtgcctg agagcgcagg     1080

gatacacgcc accatggcga tgagcagcgg cggcagtggt ggcggcgtcc cggagcagga     1140

ggattccgtg ctgttccggc gcggcacagg ccagagcgat gattctgaca tttgggatga     1200

tacagcactg ataaaagcat atgataaagc tgtggcttca tttaagcatg ctctaaagaa     1260

tggtgacatt tgtgaaactt cgggtaaacc aaaaaccaca cctaaaagaa aacctgctaa     1320

gaagaataaa agccaaaaga agaatactgc agcttcctta caacagtgga aagttgggga     1380

caaatgttct gccatttggt cagaagacgg ttgcatttac ccagctacca ttgcttcaat     1440

tgattttaag agagaaacct gtgttgtggt ttacactgga tatggaaata gagaggagca     1500

aaatctgtcc gatctacttt ccccaatctg tgaagtagct aataatatag aacagaatgc     1560

tcaagagaat gaaaatgaaa gccaagtttc aacagatgaa agtgagaact ccaggtctcc     1620

tggaaataaa tcagataaca tcaagcccaa atctgctcca tggaactctt ttctccctcc     1680

accacccccc atgccagggc caagactggg accaggaaag ccaggtctaa aattcaatgg     1740

cccaccaccg ccaccgccac caccaccacc ccacttacta tcatgctggc tgcctccatt     1800

tccttctgga ccaccaataa ttcccccacc acctcccata tgtccagatt ctcttgatga     1860

tgctgatgct ttgggaagta tgttaatttc atggtacatg agtggctatc atactggcta     1920

ttatatgggt tttagacaaa atcaaaaaga aggaaggtgc tcacattcct taaattaagg     1980

agaaatgctg gcatagagca gcactaaatg acaccactaa agaaacgatc agacagatct     2040

acaaagctta tcgataccgt cgactagagc tcgctgatca gcctcgactg tgccttctag     2100

ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac     2160

tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca     2220

ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg aagtctagag     2280

caggcatgct ggggagagat cgatctgagg aacccctagt gatggagttg gccactccct     2340

ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct     2400

ttgcccgggc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc                2450


<210>  51
<211>  2256
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG307

<400>  51
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttcg cgtcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg      180

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt      240

tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag      300

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc      360

attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag      420

tcatcgctat taccatgagt gcaagtgggt tttaggacca ggatgaggcg gggtgggggt      480

gcctacctga cgaccgaccc cgacccactg gacaagcacc caacccccat tccccaaatt      540

gcgcatcccc tatcagagag ggggagggga aacaggatgc ggcgaggcgc gtgcgcactg      600

ccagcttcag caccgcggac agtgccttcg cccccgcctg gcggcgcgcg ccaccgccgc      660

ctcagcactg aaggcgcgct gacgtcactc gccggtcccc cgcaaactcc ccttcccggc      720

caccttggtc gcgtccgcgc cgccgccggc ccagccggac cgcaccacgc gaggcgcgag      780

ataggggggc acgggcgcga ccatctgcgc tgcggcgccg gcgactcagc gctgcctcag      840

tctgcggtgg gcagcggagg agtcgtgtcg tgcctgagag cgcagggata cacgccacca      900

tggcgatgag cagcggcggc agtggtggcg gcgtcccgga gcaggaggat tccgtgctgt      960

tccggcgcgg cacaggccag agcgatgatt ctgacatttg ggatgataca gcactgataa     1020

aagcatatga taaagctgtg gcttcattta agcatgctct aaagaatggt gacatttgtg     1080

aaacttcggg taaaccaaaa accacaccta aaagaaaacc tgctaagaag aataaaagcc     1140

aaaagaagaa tactgcagct tccttacaac agtggaaagt tggggacaaa tgttctgcca     1200

tttggtcaga agacggttgc atttacccag ctaccattgc ttcaattgat tttaagagag     1260

aaacctgtgt tgtggtttac actggatatg gaaatagaga ggagcaaaat ctgtccgatc     1320

tactttcccc aatctgtgaa gtagctaata atatagaaca gaatgctcaa gagaatgaaa     1380

atgaaagcca agtttcaaca gatgaaagtg agaactccag gtctcctgga aataaatcag     1440

ataacatcaa gcccaaatct gctccatgga actcttttct ccctccacca ccccccatgc     1500

cagggccaag actgggacca ggaaagccag gtctaaaatt caatggccca ccaccgccac     1560

cgccaccacc accaccccac ttactatcat gctggctgcc tccatttcct tctggaccac     1620

caataattcc cccaccacct cccatatgtc cagattctct tgatgatgct gatgctttgg     1680

gaagtatgtt aatttcatgg tacatgagtg gctatcatac tggctattat atgggtttta     1740

gacaaaatca aaaagaagga aggtgctcac attccttaaa ttaaggagaa atgctggcat     1800

agagcagcac taaatgacac cactaaagaa acgatcagac agatctacaa agcttatcga     1860

taccgtcgac tagagctcgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg     1920

ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt     1980

cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg     2040

gtggggtggg gcaggacagc aagggggagg attgggaagt ctagagcagg catgctgggg     2100

agagatcgat ctgaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc     2160

tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc     2220

tcagtgagcg agcgagcgcg cagagaggga gtggcc                               2256


<210>  52
<211>  2256
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG340

<400>  52
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttcg cgtcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg      180

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt      240

tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag      300

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc      360

attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag      420

tcatcgctat taccatgagt gcaagtgggt tttaggacca ggatgaggcg gggtgggggt      480

gcctacctga cgaccgaccc cgacccactg gacaagcacc caacccccat tccccaaatt      540

gcgcatcccc tatcagagag ggggagggga aacaggatgc ggcgaggcgc gtgcgcactg      600

ccagcttcag caccgcggac agtgccttcg cccccgcctg gcggcgcgcg ccaccgccgc      660

ctcagcactg aaggcgcgct gacgtcactc gccggtcccc cgcaaactcc ccttcccggc      720

caccttggtc gcgtccgcgc cgccgccggc ccagccggac cgcaccacgc gaggcgcgag      780

ataggggggc acgggcgcga ccatctgcgc tgcggcgccg gcgactcagc gctgcctcag      840

tctgcggtgg gcagcggagg agtcgtgtcg tgcctgagag cgcagggata cacgccacca      900

tggccatgag cagcggagga agcggaggag gagtgcccga gcaagaggac agcgtgctgt      960

ttaggagagg aaccggacag agcgatgact ccgatatctg ggacgacacc gctctgatca     1020

aggcctatga caaagccgtg gcctccttca agcacgctct gaagaatggc gatatctgtg     1080

agacctccgg caaacctaag accaccccca agaggaagcc cgccaagaag aacaagtccc     1140

agaagaagaa taccgccgct agcctccagc agtggaaagt gggcgataag tgcagcgcca     1200

tttggagcga ggatggatgc atctaccccg ccaccattgc cagcatcgac ttcaagaggg     1260

agacatgcgt ggtggtgtat accggatacg gaaatagaga ggagcagaat ctgagcgatc     1320

tgctgtcccc catctgcgag gtggccaata atatcgagca gaacgcccaa gagaacgaga     1380

acgaaagcca agtgtccacc gatgagagcg agaactccag aagccccgga aacaagtccg     1440

acaacatcaa acccaagagc gccccttgga acagctttct gcctcctccc ccccccatgc     1500

ccggccctag actgggaccc ggcaagcccg gactgaagtt caacggaccc ccccctcctc     1560

ctcccccccc tcctcctcat ctgctgagct gctggctccc ccctttccct agcggccccc     1620

ccattatccc cccccctccc cctatctgtc ccgacagcct cgatgacgct gacgccctcg     1680

gaagcatgct gatcagctgg tacatgagcg gctaccacac cggatactac atgggcttca     1740

gacagaacca gaaggagggc agatgctccc actctctgaa ctgaggagaa atgctggcat     1800

agagcagcac taaatgacac cactaaagaa acgatcagac agatctacaa agcttatcga     1860

taccgtcgac tagagctcgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg     1920

ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt     1980

cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg     2040

gtggggtggg gcaggacagc aagggggagg attgggaagt ctagagcagg catgctgggg     2100

agagatcgat ctgaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc     2160

tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc     2220

tcagtgagcg agcgagcgcg cagagaggga gtggcc                               2256


<210>  53
<211>  2503
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EXG341

<400>  53
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatg cacgcgtgga      120

tctgagttcg cgtcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg      180

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt      240

tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag      300

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc      360

attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag      420

tcatcgctat taccatgagt gcaagtgggt tttaggacca ggatgaggcg gggtgggggt      480

gcctacctga cgaccgaccc cgacccactg gacaagcacc caacccccat tccccaaatt      540

gcgcatcccc tatcagagag ggggagggga aacaggatgc ggcgaggcgc gtgcgcactg      600

ccagcttcag caccgcggac agtgccttcg cccccgcctg gcggcgcgcg ccaccgccgc      660

ctcagcactg aaggcgcgct gacgtcactc gccggtcccc cgcaaactcc ccttcccggc      720

caccttggtc gcgtccgcgc cgccgccggc ccagccggac cgcaccacgc gaggcgcgag      780

ataggggggc acgggcgcga ccatctgcgc tgcggcgccg gcgactcagc gctgcctcag      840

tctgcggtgg gcagcggagg agtcgtgtcg tgcctgagag cgcagggata cacgccacca      900

tggccatgag cagcggagga agcggaggag gagtgcccga gcaagaggac agcgtgctgt      960

ttaggagagg aaccggacag agcgatgact ccgatatctg ggacgacacc gctctgatca     1020

aggcctatga caaagccgtg gcctccttca agcacgctct gaagaatggc gatatctgtg     1080

agacctccgg caaacctaag accaccccca agaggaagcc cgccaagaag aacaagtccc     1140

agaagaagaa taccgccgct agcctccagc agtggaaagt gggcgataag tgcagcgcca     1200

tttggagcga ggatggatgc atctaccccg ccaccattgc cagcatcgac ttcaagaggg     1260

agacatgcgt ggtggtgtat accggatacg gaaatagaga ggagcagaat ctgagcgatc     1320

tgctgtcccc catctgcgag gtggccaata atatcgagca gaacgcccaa gagaacgaga     1380

acgaaagcca agtgtccacc gatgagagcg agaactccag aagccccgga aacaagtccg     1440

acaacatcaa acccaagagc gccccttgga acagctttct gcctcctccc ccccccatgc     1500

ccggccctag actgggaccc ggcaagcccg gactgaagtt caacggaccc ccccctcctc     1560

ctcccccccc tcctcctcat ctgctgagct gctggctccc ccctttccct agcggccccc     1620

ccattatccc cccccctccc cctatctgtc ccgacagcct cgatgacgct gacgccctcg     1680

gaagcatgct gatcagctgg tacatgagcg gctaccacac cggatactac atgggcttca     1740

gacagaacca gaaggagggc agatgctccc actctctgaa ctgaggagaa atgctggcat     1800

agagcagcac taaatgacac cactaaagaa acgatcagac agatctataa tcaacctctg     1860

gattacaaaa tttgtgaaag attgactggt attcttaact atgttgctcc ttttacgcta     1920

tgtggatacg ctgctttaat gcctttgtat catgctattg cttcccgtat ggctttcatt     1980

ttctcctcct tgtataaatc ctggttagtt cttgccacgg cggaactcat cgccgcctgc     2040

cttgcccgct gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttaagc     2100

ttatcgatac cgtcgactag agctcgctga tcagcctcga ctgtgccttc tagttgccag     2160

ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact     2220

gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt     2280

ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagtcta gagcaggcat     2340

gctggggaga gatcgatctg aggaacccct agtgatggag ttggccactc cctctctgcg     2400

cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg     2460

ggcggcctca gtgagcgagc gagcgcgcag agagggagtg gcc                       2503


