                         SEQUENCE LISTING

<110>  The Trustees of the University of Pennsylvania
 
<120>  GENE THERAPY FOR TREATING HEMOPHILIA A

<130>  UPN-16-7798PCT

<150>  US 62/323,336
<151>  2016-04-15

<150>  US 62/331,807
<151>  2016-05-04

<150>  US 62/428,866
<151>  2016-12-01

<160>  20    

<170>  PatentIn version 3.5

<210>  1
<211>  4371
<212>  DNA
<213>  Homo sapiens

<400>  1
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc       60

accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc      120

ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac      180

acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc      240

gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat      300

gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt      360

ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg      420

gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg      480

aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat      540

gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa      600

gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta      660

tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat      720

gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct      780

ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc      840

accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat      900

cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg      960

gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa     1020

gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa     1080

gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat     1140

gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact     1200

tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc     1260

cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg     1320

aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct     1380

attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg     1440

ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact     1500

gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt     1560

ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca     1620

actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga     1680

gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa     1740

agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag     1800

aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg     1860

cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt     1920

tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc     1980

attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa     2040

atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg     2100

atggaaaacc caggtctatg gattctgggg tgccacaact cagactttcg gaacagaggc     2160

atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac     2220

agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc     2280

ttctcccaga atccaccagt cttgaaacgc catcaacggg aaataactcg tactactctt     2340

cagtcagatc aagaggaaat tgactatgat gataccatat cagttgaaat gaagaaggaa     2400

gattttgaca tttatgatga ggatgaaaat cagagccccc gcagctttca aaagaaaaca     2460

cgacactatt ttattgctgc agtggagagg ctctgggatt atgggatgag tagctcccca     2520

catgttctaa gaaacagggc tcagagtggc agtgtccctc agttcaagaa agttgttttc     2580

caggaattta ctgatggctc ctttactcag cccttatacc gtggagaact aaatgaacat     2640

ttgggactcc tggggccata tataagagca gaagttgaag ataatatcat ggtaactttc     2700

agaaatcagg cctctcgtcc ctattccttc tattctagcc ttatttctta tgaggaagat     2760

cagaggcaag gagcagaacc tagaaaaaac tttgtcaagc ctaatgaaac caaaacttac     2820

ttttggaaag tgcaacatca tatggcaccc actaaagatg agtttgactg caaagcctgg     2880

gcttatttct ctgatgttga cctggaaaaa gatgtgcact caggcctgat tggacccctt     2940

ctggtctgcc acactaacac actgaaccct gctcatggga gacaagtgac agtacaggaa     3000

tttgctctgt ttttcaccat ctttgatgag accaaaagct ggtacttcac tgaaaatatg     3060

gaaagaaact gcagggctcc ctgcaatatc cagatggaag atcccacttt taaagagaat     3120

tatcgcttcc atgcaatcaa tggctacata atggatacac tacctggctt agtaatggct     3180

caggatcaaa ggattcgatg gtatctgctc agcatgggca gcaatgaaaa catccattct     3240

attcatttca gtggacatgt gttcactgta cgaaaaaaag aggagtataa aatggcactg     3300

tacaatctct atccaggtgt ttttgagaca gtggaaatgt taccatccaa agctggaatt     3360

tggcgggtgg aatgccttat tggcgagcat ctacatgctg ggatgagcac actttttctg     3420

gtgtacagca ataagtgtca gactcccctg ggaatggctt ctggacacat tagagatttt     3480

cagattacag cttcaggaca atatggacag tgggccccaa agctggccag acttcattat     3540

tccggatcaa tcaatgcctg gagcaccaag gagccctttt cttggatcaa ggtggatctg     3600

ttggcaccaa tgattattca cggcatcaag acccagggtg cccgtcagaa gttctccagc     3660

ctctacatct ctcagtttat catcatgtat agtcttgatg ggaagaagtg gcagacttat     3720

cgaggaaatt ccactggaac cttaatggtc ttctttggca atgtggattc atctgggata     3780

aaacacaata tttttaaccc tccaattatt gctcgataca tccgtttgca cccaactcat     3840

tatagcattc gcagcactct tcgcatggag ttgatgggct gtgatttaaa tagttgcagc     3900

atgccattgg gaatggagag taaagcaata tcagatgcac agattactgc ttcatcctac     3960

tttaccaata tgtttgccac ctggtctcct tcaaaagctc gacttcacct ccaagggagg     4020

agtaatgcct ggagacctca ggtgaataat ccaaaagagt ggctgcaagt ggacttccag     4080

aagacaatga aagtcacagg agtaactact cagggagtaa aatctctgct taccagcatg     4140

tatgtgaagg agttcctcat ctccagcagt caagatggcc atcagtggac tctctttttt     4200

cagaatggca aagtaaaggt ttttcaggga aatcaagact ccttcacacc tgtggtgaac     4260

tctctagacc caccgttact gactcgctac cttcgaattc acccccagag ttgggtgcac     4320

cagattgccc tgaggatgga ggttctgggc tgcgaggcac aggacctcta c              4371


<210>  2
<211>  4374
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  2
atgcagatcg agctgagcac ctgcttcttc ctgtgcctgc tgcggttctg cttctccgcc       60

acccggcggt actacctggg agccgtggag ctgagctggg attacatgca gagcgatctg      120

ggagagctgc cagtggatgc ccggttccca ccacgggtgc caaagagctt cccattcaac      180

accagcgtgg tgtacaagaa gaccctgttc gtggagttca ccgatcacct gttcaacatc      240

gccaagccac ggccaccctg gatgggactg ctgggaccaa ccatccaggc cgaggtgtac      300

gataccgtgg tgatcaccct gaagaacatg gcctctcatc ctgtgtccct gcacgccgtg      360

ggagtgagct actggaaggc cagcgaggga gccgagtacg atgatcagac cagccagcgg      420

gagaaggagg atgataaggt gttcccagga ggaagccaca cctacgtgtg gcaggtgctg      480

aaggagaacg gaccaatggc cagcgatcca ctgtgcctga cctacagcta cctgagccac      540

gtggatctgg tgaaggatct gaacagcgga ctgatcggag ccctgctggt gtgccgggag      600

ggaagcctgg ccaaggagaa gacccagacc ctgcacaagt tcatcctgct gttcgccgtg      660

ttcgatgagg gaaagagctg gcacagcgag accaagaaca gcctgatgca ggatcgggat      720

gccgccagcg cccgggcctg gccaaagatg cacaccgtga acggatacgt gaaccggagc      780

ctgccaggac tgatcggatg ccaccggaag agcgtgtact ggcacgtgat cggaatggga      840

accaccccag aggtgcactc tatcttcctg gagggacaca cctttctggt gcggaaccac      900

cggcaggcca gcctggagat cagcccaatc accttcctga ccgcccagac cctgctgatg      960

gatctgggac agttcctgct gttctgccat atcagcagcc accagcacga tggaatggag     1020

gcctacgtga aggtggatag ctgcccagag gagccacagc tgcggatgaa gaacaacgag     1080

gaggccgagg attacgatga tgatctgacc gatagcgaga tggatgtggt gcggttcgat     1140

gatgataaca gcccaagctt catccagatc cggagcgtgg ccaagaagca cccaaagacc     1200

tgggtgcact acatcgccgc cgaggaggag gattgggatt acgccccact ggtgctggcc     1260

cctgatgatc ggagctacaa gagccagtac ctgaacaacg gaccacagcg gatcggacgg     1320

aagtacaaaa aagtgcggtt catggcctac accgatgaga ccttcaagac ccgggaggcc     1380

atccagcacg agagcggaat cctgggacca ctgctgtacg gagaggtggg agataccctg     1440

ctgatcatct tcaagaacca ggccagccgg ccatacaaca tctacccaca cggaatcacc     1500

gatgtgcggc cactgtacag ccggcggctg ccaaagggag tgaagcacct gaaggatttc     1560

ccaatcctgc caggagagat cttcaagtac aagtggacag tgacagtgga ggatggacca     1620

accaagtctg atccaagatg cctgaccaga tactacagca gctttgtgaa catggagaga     1680

gacctggcct ctggactgat tggaccactg ctgatctgct acaaggagtc tgtggatcag     1740

agaggaaacc agatcatgtc tgataagaga aatgtgatcc tgttctctgt gtttgatgag     1800

aacagaagct ggtacctgac agagaacatc cagagattcc tgccaaaccc agccggagtg     1860

cagctggagg atccagagtt ccaggccagc aacatcatgc acagcatcaa cggatacgtg     1920

ttcgatagcc tgcagctgag cgtgtgcctg cacgaggtgg cctattggta tatcctgagc     1980

atcggagccc agaccgattt cctgagcgtg ttcttcagcg gatacacctt caagcacaag     2040

atggtgtacg aggataccct gaccctgttc ccattctccg gagagaccgt gttcatgagc     2100

atggagaacc caggactgtg gatcctggga tgccacaact ctgatttcag aaacagagga     2160

atgactgccc tgctgaaagt gtccagctgt gataagaaca ctggagatta ctatgaggat     2220

agctatgagg atatctctgc ctacctgctg agcaagaaca atgccattga gccaagaagc     2280

ttcagccaga acccaccagt gctgaagaga caccagagag agatcaccag aaccaccctg     2340

cagtctgatc aggaggagat tgattatgat gataccatct ctgtggagat gaagaaggag     2400

gattttgata tctatgatga ggatgagaac cagagcccaa gaagcttcca gaagaagacc     2460

agacactact tcatcgctgc agtggagaga ctgtgggatt atggaatgag cagcagccca     2520

cacgtgctga gaaacagagc ccagagcgga tctgtgccac agttcaagaa ggtggtgttc     2580

caggagttca ccgatggaag cttcacccag ccactgtacc ggggagagct gaacgagcac     2640

ctgggactgc tgggaccata catccgggcc gaggtggagg ataacatcat ggtgaccttc     2700

cggaaccagg ccagccggcc atacagcttc tacagcagcc tgatcagcta cgaggaggat     2760

cagcggcagg gagccgagcc acggaagaac ttcgtgaagc caaacgagac caagacctac     2820

ttctggaagg tgcagcacca catggcccca accaaggatg agttcgattg caaggcctgg     2880

gcctacttca gcgatgtgga tctggagaag gatgtgcaca gcggactgat cggaccactg     2940

ctggtgtgcc acaccaacac cctgaaccca gcccacggac ggcaggtgac cgtgcaggag     3000

ttcgccctgt tcttcaccat cttcgatgag accaagagct ggtacttcac cgagaacatg     3060

gagcggaact gccgggcccc ttgcaacatc cagatggagg atccaacctt caaggagaac     3120

taccggttcc acgccatcaa cggatacatc atggataccc tgccaggact ggtgatggcc     3180

caggatcagc ggatccggtg gtacctgctg agcatgggaa gcaacgagaa catccacagc     3240

atccacttca gcggacacgt gttcaccgtg cggaagaagg aggagtacaa gatggccctg     3300

tacaacctgt acccaggagt gttcgagacc gtggagatgc tgccaagcaa ggccggaatc     3360

tggcgggtgg agtgcctgat cggagagcac ctgcacgccg gaatgagcac cctgttcctg     3420

gtgtacagca acaagtgcca gaccccactg ggaatggcca gcggacacat ccgggatttc     3480

cagatcaccg ccagcggaca gtacggacag tgggccccaa agctggcccg gctgcactac     3540

agcggaagca tcaacgcctg gagcaccaag gagccattca gctggatcaa agtggatctg     3600

ctggccccaa tgatcatcca cggaatcaag acccagggag cccggcagaa gttcagcagc     3660

ctgtacatca gccagttcat catcatgtac agcctggatg gaaagaagtg gcagacctac     3720

cggggaaaca gcaccggaac cctgatggtg ttcttcggaa acgtggatag cagcggaatc     3780

aagcacaaca tcttcaaccc accaatcatc gcccgataca tccggctgca cccaacccac     3840

tacagcatca gaagcaccct gcggatggag ctgatgggat gtgatctgaa cagctgctcc     3900

atgccactgg gaatggagag caaggccatc agcgatgccc agatcaccgc cagcagctac     3960

ttcaccaaca tgttcgccac ctggagccca agcaaggccc ggctgcacct gcagggacgg     4020

agcaacgcct ggcggccaca ggtgaataac ccaaaggagt ggctgcaggt ggatttccag     4080

aagaccatga aggtgaccgg agtgaccacc cagggagtga agagcctgct gactagcatg     4140

tatgtgaagg agttcctgat cagcagcagc caggatggac accagtggac cctgttcttc     4200

cagaacggaa aggtgaaggt gttccaggga aaccaggata gcttcacccc agtggtgaac     4260

agcctggatc caccactgct gacccgatac ctgcggatcc acccacagag ctgggtgcac     4320

cagatcgccc tgagaatgga ggtgctggga tgcgaggccc aggatctgta ctga           4374


<210>  3
<211>  1457
<212>  PRT
<213>  Homo sapiens

<400>  3

Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1               5                   10                  15      


Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
            20                  25                  30          


Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
        35                  40                  45              


Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
    50                  55                  60                  


Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile 
65                  70                  75                  80  


Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln 
                85                  90                  95      


Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser 
            100                 105                 110         


His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
        115                 120                 125             


Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp 
    130                 135                 140                 


Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu 
145                 150                 155                 160 


Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 
                165                 170                 175     


Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile 
            180                 185                 190         


Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
        195                 200                 205             


Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly 
    210                 215                 220                 


Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp 
225                 230                 235                 240 


Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 
                245                 250                 255     


Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val 
            260                 265                 270         


Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile 
        275                 280                 285             


Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser 
    290                 295                 300                 


Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met 
305                 310                 315                 320 


Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His 
                325                 330                 335     


Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
            340                 345                 350         


Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
        355                 360                 365             


Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
    370                 375                 380                 


Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385                 390                 395                 400 


Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
                405                 410                 415     


Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn 
            420                 425                 430         


Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
        435                 440                 445             


Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu 
    450                 455                 460                 


Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465                 470                 475                 480 


Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro 
                485                 490                 495     


His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
            500                 505                 510         


Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe 
        515                 520                 525             


Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
    530                 535                 540                 


Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545                 550                 555                 560 


Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu 
                565                 570                 575     


Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val 
            580                 585                 590         


Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
        595                 600                 605             


Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp 
    610                 615                 620                 


Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val 
625                 630                 635                 640 


Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 
                645                 650                 655     


Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe 
            660                 665                 670         


Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
        675                 680                 685             


Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
    690                 695                 700                 


Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705                 710                 715                 720 


Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 
                725                 730                 735     


Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys 
            740                 745                 750         


Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu 
        755                 760                 765             


Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln 
    770                 775                 780                 


Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu 
785                 790                 795                 800 


Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe 
                805                 810                 815     


Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp 
            820                 825                 830         


Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln 
        835                 840                 845             


Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr 
    850                 855                 860                 


Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His 
865                 870                 875                 880 


Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile 
                885                 890                 895     


Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser 
            900                 905                 910         


Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg 
        915                 920                 925             


Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val 
    930                 935                 940                 


Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp 
945                 950                 955                 960 


Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu 
                965                 970                 975     


Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His 
            980                 985                 990         


Gly Arg Gln Val Thr Val Gln Glu  Phe Ala Leu Phe Phe  Thr Ile Phe 
        995                 1000                 1005             


Asp Glu  Thr Lys Ser Trp Tyr  Phe Thr Glu Asn Met  Glu Arg Asn 
    1010                 1015                 1020             


Cys Arg  Ala Pro Cys Asn Ile  Gln Met Glu Asp Pro  Thr Phe Lys 
    1025                 1030                 1035             


Glu Asn  Tyr Arg Phe His Ala  Ile Asn Gly Tyr Ile  Met Asp Thr 
    1040                 1045                 1050             


Leu Pro  Gly Leu Val Met Ala  Gln Asp Gln Arg Ile  Arg Trp Tyr 
    1055                 1060                 1065             


Leu Leu  Ser Met Gly Ser Asn  Glu Asn Ile His Ser  Ile His Phe 
    1070                 1075                 1080             


Ser Gly  His Val Phe Thr Val  Arg Lys Lys Glu Glu  Tyr Lys Met 
    1085                 1090                 1095             


Ala Leu  Tyr Asn Leu Tyr Pro  Gly Val Phe Glu Thr  Val Glu Met 
    1100                 1105                 1110             


Leu Pro  Ser Lys Ala Gly Ile  Trp Arg Val Glu Cys  Leu Ile Gly 
    1115                 1120                 1125             


Glu His  Leu His Ala Gly Met  Ser Thr Leu Phe Leu  Val Tyr Ser 
    1130                 1135                 1140             


Asn Lys  Cys Gln Thr Pro Leu  Gly Met Ala Ser Gly  His Ile Arg 
    1145                 1150                 1155             


Asp Phe  Gln Ile Thr Ala Ser  Gly Gln Tyr Gly Gln  Trp Ala Pro 
    1160                 1165                 1170             


Lys Leu  Ala Arg Leu His Tyr  Ser Gly Ser Ile Asn  Ala Trp Ser 
    1175                 1180                 1185             


Thr Lys  Glu Pro Phe Ser Trp  Ile Lys Val Asp Leu  Leu Ala Pro 
    1190                 1195                 1200             


Met Ile  Ile His Gly Ile Lys  Thr Gln Gly Ala Arg  Gln Lys Phe 
    1205                 1210                 1215             


Ser Ser  Leu Tyr Ile Ser Gln  Phe Ile Ile Met Tyr  Ser Leu Asp 
    1220                 1225                 1230             


Gly Lys  Lys Trp Gln Thr Tyr  Arg Gly Asn Ser Thr  Gly Thr Leu 
    1235                 1240                 1245             


Met Val  Phe Phe Gly Asn Val  Asp Ser Ser Gly Ile  Lys His Asn 
    1250                 1255                 1260             


Ile Phe  Asn Pro Pro Ile Ile  Ala Arg Tyr Ile Arg  Leu His Pro 
    1265                 1270                 1275             


Thr His  Tyr Ser Ile Arg Ser  Thr Leu Arg Met Glu  Leu Met Gly 
    1280                 1285                 1290             


Cys Asp  Leu Asn Ser Cys Ser  Met Pro Leu Gly Met  Glu Ser Lys 
    1295                 1300                 1305             


Ala Ile  Ser Asp Ala Gln Ile  Thr Ala Ser Ser Tyr  Phe Thr Asn 
    1310                 1315                 1320             


Met Phe  Ala Thr Trp Ser Pro  Ser Lys Ala Arg Leu  His Leu Gln 
    1325                 1330                 1335             


Gly Arg  Ser Asn Ala Trp Arg  Pro Gln Val Asn Asn  Pro Lys Glu 
    1340                 1345                 1350             


Trp Leu  Gln Val Asp Phe Gln  Lys Thr Met Lys Val  Thr Gly Val 
    1355                 1360                 1365             


Thr Thr  Gln Gly Val Lys Ser  Leu Leu Thr Ser Met  Tyr Val Lys 
    1370                 1375                 1380             


Glu Phe  Leu Ile Ser Ser Ser  Gln Asp Gly His Gln  Trp Thr Leu 
    1385                 1390                 1395             


Phe Phe  Gln Asn Gly Lys Val  Lys Val Phe Gln Gly  Asn Gln Asp 
    1400                 1405                 1410             


Ser Phe  Thr Pro Val Val Asn  Ser Leu Asp Pro Pro  Leu Leu Thr 
    1415                 1420                 1425             


Arg Tyr  Leu Arg Ile His Pro  Gln Ser Trp Val His  Gln Ile Ala 
    1430                 1435                 1440             


Leu Arg  Met Glu Val Leu Gly  Cys Glu Ala Gln Asp  Leu Tyr 
    1445                 1450                 1455         


<210>  4
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  4
tgtttgctgc ttgcaatgtt tgcccatttt aggg                                   34


<210>  5
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  5
ctacctcgtg atcgcccggc ccctgttcaa acatgtccta atactctgtc tctgcaaggg       60

tcatcagtag ttttccatct tactcaacat cctcccagtg                            100


<210>  6
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  6
aggttaattt ttaaactgtt tgctctggtt aataatctca gg                          42


<210>  7
<211>  190
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  7
atttcataga acgaatgttc cgatgctcta atctctctag acaaggttca tatttgtatg       60

ggttacttat tctctctttg ttgactaagt caataatcag aatcagcagg tttgcagtca      120

gattggcagg gataagcagc ctagctcagg agaagtgagt ataaaagccc caggctggga      180

gcagccatca                                                             190


<210>  8
<211>  176
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  8
actcaaagtt caaaccttat cattttttgc tttgttcctc ttggccttgg ttttgtacat       60

cagctttgaa aataccatcc cagggttaat gctggggtta atttataact aagagtgctc      120

tagttttgca atacaggaca tgctataaaa atggaaagat gttgctttct gagaga          176


<210>  9
<211>  218
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  9
tggacacagg acgctgtggt ttctgagcca gggggcgact cagatcccag ccagtggact       60

tagcccctgt ttgctcctcc gataactggg gtgaccttgg ttaatattca ccagcagcct      120

cccccgttgc ccctctggat ccactgctta aatacggacg aggacagggc cctgtctcct      180

cagcttcagg caccaccact gacctgggac agtgaata                              218


<210>  10
<211>  75
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  10
aataaagtct gagtgggcgg cagcctgtgt gtgcctgggt tctctctgtc ccggaatgtg       60

caaacaatgg aggtg                                                        75


<210>  11
<211>  168
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  11
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctact                   168


<210>  12
<211>  164
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  12
gataagtagc atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc       60

actccctctc tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc      120

ccgggctttg cccgggcggc ctcagtgagc gagcgagcgc gcag                       164


<210>  13
<211>  7920
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  13
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctactta agctacctcg      180

tgatcgcccg gcccctgttc aaacatgtcc taatactctg tctctgcaag ggtcatcagt      240

agttttccat cttactcaac atcctcccag tggaattcat ttcatagaac gaatgttccg      300

atgctctaat ctctctagac aaggttcata tttgtatggg ttacttattc tctctttgtt      360

gactaagtca ataatcagaa tcagcaggtt tgcagtcaga ttggcaggga taagcagcct      420

agctcaggag aagtgagtat aaaagcccca ggctgggagc agccatcagc ggccgccacc      480

atgcagatcg agctgagcac ctgcttcttc ctgtgcctgc tgcggttctg cttctccgcc      540

acccggcggt actacctggg agccgtggag ctgagctggg attacatgca gagcgatctg      600

ggagagctgc cagtggatgc ccggttccca ccacgggtgc caaagagctt cccattcaac      660

accagcgtgg tgtacaagaa gaccctgttc gtggagttca ccgatcacct gttcaacatc      720

gccaagccac ggccaccctg gatgggactg ctgggaccaa ccatccaggc cgaggtgtac      780

gataccgtgg tgatcaccct gaagaacatg gcctctcatc ctgtgtccct gcacgccgtg      840

ggagtgagct actggaaggc cagcgaggga gccgagtacg atgatcagac cagccagcgg      900

gagaaggagg atgataaggt gttcccagga ggaagccaca cctacgtgtg gcaggtgctg      960

aaggagaacg gaccaatggc cagcgatcca ctgtgcctga cctacagcta cctgagccac     1020

gtggatctgg tgaaggatct gaacagcgga ctgatcggag ccctgctggt gtgccgggag     1080

ggaagcctgg ccaaggagaa gacccagacc ctgcacaagt tcatcctgct gttcgccgtg     1140

ttcgatgagg gaaagagctg gcacagcgag accaagaaca gcctgatgca ggatcgggat     1200

gccgccagcg cccgggcctg gccaaagatg cacaccgtga acggatacgt gaaccggagc     1260

ctgccaggac tgatcggatg ccaccggaag agcgtgtact ggcacgtgat cggaatggga     1320

accaccccag aggtgcactc tatcttcctg gagggacaca cctttctggt gcggaaccac     1380

cggcaggcca gcctggagat cagcccaatc accttcctga ccgcccagac cctgctgatg     1440

gatctgggac agttcctgct gttctgccat atcagcagcc accagcacga tggaatggag     1500

gcctacgtga aggtggatag ctgcccagag gagccacagc tgcggatgaa gaacaacgag     1560

gaggccgagg attacgatga tgatctgacc gatagcgaga tggatgtggt gcggttcgat     1620

gatgataaca gcccaagctt catccagatc cggagcgtgg ccaagaagca cccaaagacc     1680

tgggtgcact acatcgccgc cgaggaggag gattgggatt acgccccact ggtgctggcc     1740

cctgatgatc ggagctacaa gagccagtac ctgaacaacg gaccacagcg gatcggacgg     1800

aagtacaaaa aagtgcggtt catggcctac accgatgaga ccttcaagac ccgggaggcc     1860

atccagcacg agagcggaat cctgggacca ctgctgtacg gagaggtggg agataccctg     1920

ctgatcatct tcaagaacca ggccagccgg ccatacaaca tctacccaca cggaatcacc     1980

gatgtgcggc cactgtacag ccggcggctg ccaaagggag tgaagcacct gaaggatttc     2040

ccaatcctgc caggagagat cttcaagtac aagtggacag tgacagtgga ggatggacca     2100

accaagtctg atccaagatg cctgaccaga tactacagca gctttgtgaa catggagaga     2160

gacctggcct ctggactgat tggaccactg ctgatctgct acaaggagtc tgtggatcag     2220

agaggaaacc agatcatgtc tgataagaga aatgtgatcc tgttctctgt gtttgatgag     2280

aacagaagct ggtacctgac agagaacatc cagagattcc tgccaaaccc agccggagtg     2340

cagctggagg atccagagtt ccaggccagc aacatcatgc acagcatcaa cggatacgtg     2400

ttcgatagcc tgcagctgag cgtgtgcctg cacgaggtgg cctattggta tatcctgagc     2460

atcggagccc agaccgattt cctgagcgtg ttcttcagcg gatacacctt caagcacaag     2520

atggtgtacg aggataccct gaccctgttc ccattctccg gagagaccgt gttcatgagc     2580

atggagaacc caggactgtg gatcctggga tgccacaact ctgatttcag aaacagagga     2640

atgactgccc tgctgaaagt gtccagctgt gataagaaca ctggagatta ctatgaggat     2700

agctatgagg atatctctgc ctacctgctg agcaagaaca atgccattga gccaagaagc     2760

ttcagccaga acccaccagt gctgaagaga caccagagag agatcaccag aaccaccctg     2820

cagtctgatc aggaggagat tgattatgat gataccatct ctgtggagat gaagaaggag     2880

gattttgata tctatgatga ggatgagaac cagagcccaa gaagcttcca gaagaagacc     2940

agacactact tcatcgctgc agtggagaga ctgtgggatt atggaatgag cagcagccca     3000

cacgtgctga gaaacagagc ccagagcgga tctgtgccac agttcaagaa ggtggtgttc     3060

caggagttca ccgatggaag cttcacccag ccactgtacc ggggagagct gaacgagcac     3120

ctgggactgc tgggaccata catccgggcc gaggtggagg ataacatcat ggtgaccttc     3180

cggaaccagg ccagccggcc atacagcttc tacagcagcc tgatcagcta cgaggaggat     3240

cagcggcagg gagccgagcc acggaagaac ttcgtgaagc caaacgagac caagacctac     3300

ttctggaagg tgcagcacca catggcccca accaaggatg agttcgattg caaggcctgg     3360

gcctacttca gcgatgtgga tctggagaag gatgtgcaca gcggactgat cggaccactg     3420

ctggtgtgcc acaccaacac cctgaaccca gcccacggac ggcaggtgac cgtgcaggag     3480

ttcgccctgt tcttcaccat cttcgatgag accaagagct ggtacttcac cgagaacatg     3540

gagcggaact gccgggcccc ttgcaacatc cagatggagg atccaacctt caaggagaac     3600

taccggttcc acgccatcaa cggatacatc atggataccc tgccaggact ggtgatggcc     3660

caggatcagc ggatccggtg gtacctgctg agcatgggaa gcaacgagaa catccacagc     3720

atccacttca gcggacacgt gttcaccgtg cggaagaagg aggagtacaa gatggccctg     3780

tacaacctgt acccaggagt gttcgagacc gtggagatgc tgccaagcaa ggccggaatc     3840

tggcgggtgg agtgcctgat cggagagcac ctgcacgccg gaatgagcac cctgttcctg     3900

gtgtacagca acaagtgcca gaccccactg ggaatggcca gcggacacat ccgggatttc     3960

cagatcaccg ccagcggaca gtacggacag tgggccccaa agctggcccg gctgcactac     4020

agcggaagca tcaacgcctg gagcaccaag gagccattca gctggatcaa agtggatctg     4080

ctggccccaa tgatcatcca cggaatcaag acccagggag cccggcagaa gttcagcagc     4140

ctgtacatca gccagttcat catcatgtac agcctggatg gaaagaagtg gcagacctac     4200

cggggaaaca gcaccggaac cctgatggtg ttcttcggaa acgtggatag cagcggaatc     4260

aagcacaaca tcttcaaccc accaatcatc gcccgataca tccggctgca cccaacccac     4320

tacagcatca gaagcaccct gcggatggag ctgatgggat gtgatctgaa cagctgctcc     4380

atgccactgg gaatggagag caaggccatc agcgatgccc agatcaccgc cagcagctac     4440

ttcaccaaca tgttcgccac ctggagccca agcaaggccc ggctgcacct gcagggacgg     4500

agcaacgcct ggcggccaca ggtgaataac ccaaaggagt ggctgcaggt ggatttccag     4560

aagaccatga aggtgaccgg agtgaccacc cagggagtga agagcctgct gactagcatg     4620

tatgtgaagg agttcctgat cagcagcagc caggatggac accagtggac cctgttcttc     4680

cagaacggaa aggtgaaggt gttccaggga aaccaggata gcttcacccc agtggtgaac     4740

agcctggatc caccactgct gacccgatac ctgcggatcc acccacagag ctgggtgcac     4800

cagatcgccc tgagaatgga ggtgctggga tgcgaggccc aggatctgta ctgatgagca     4860

tgcaataaag tctgagtggg cggcagcctg tgtgtgcctg ggttctctct gtcccggaat     4920

gtgcaaacaa tggaggtgct cgagtagata agtagcatgg cgggttaatc attaactaca     4980

aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg     5040

ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc     5100

gagcgcgcag ccttaattaa cctaattcac tggccgtcgt tttacaacgt cgtgactggg     5160

aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc     5220

gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg     5280

aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg     5340

tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc     5400

tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc     5460

gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta     5520

gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta     5580

atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg     5640

atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa     5700

aatttaacgc gaattttaac aaaatattaa cgcttacaat ttaggtggca cttttcgggg     5760

aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct     5820

catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat     5880

tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc     5940

tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg     6000

ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg     6060

ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga     6120

cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta     6180

ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc     6240

tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc     6300

gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg     6360

ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc     6420

aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca     6480

acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct     6540

tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat     6600

cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg     6660

gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat     6720

taagcattgg taactgtcag accaagttta ctcatatata ctttagattg atttaaaact     6780

tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat     6840

cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc     6900

ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct     6960

accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg     7020

cttcagcaga gcgcagatac caaatactgt tcttctagtg tagccgtagt taggccacca     7080

cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc     7140

tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga     7200

taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac     7260

gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga     7320

agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag     7380

ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg     7440

acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag     7500

caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc     7560

tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc     7620

tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc     7680

aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag     7740

gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca     7800

ttaggcaccc caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag     7860

cggataacaa tttcacacag gaaacagcta tgaccatgat tacgccagat ttaattaagg     7920


<210>  14
<211>  8004
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  14
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctactta agctacctcg      180

tgatcgcccg gcccctgttc aaacatgtcc taatactctg tctctgcaag ggtcatcagt      240

agttttccat cttactcaac atcctcccag tgaggttaat ttttaaactg tttgctctgg      300

ttaataatct caggaggtta atttttaaac tgtttgctct ggttaataat ctcagggaat      360

tcatttcata gaacgaatgt tccgatgctc taatctctct agacaaggtt catatttgta      420

tgggttactt attctctctt tgttgactaa gtcaataatc agaatcagca ggtttgcagt      480

cagattggca gggataagca gcctagctca ggagaagtga gtataaaagc cccaggctgg      540

gagcagccat cagcggccgc caccatgcag atcgagctga gcacctgctt cttcctgtgc      600

ctgctgcggt tctgcttctc cgccacccgg cggtactacc tgggagccgt ggagctgagc      660

tgggattaca tgcagagcga tctgggagag ctgccagtgg atgcccggtt cccaccacgg      720

gtgccaaaga gcttcccatt caacaccagc gtggtgtaca agaagaccct gttcgtggag      780

ttcaccgatc acctgttcaa catcgccaag ccacggccac cctggatggg actgctggga      840

ccaaccatcc aggccgaggt gtacgatacc gtggtgatca ccctgaagaa catggcctct      900

catcctgtgt ccctgcacgc cgtgggagtg agctactgga aggccagcga gggagccgag      960

tacgatgatc agaccagcca gcgggagaag gaggatgata aggtgttccc aggaggaagc     1020

cacacctacg tgtggcaggt gctgaaggag aacggaccaa tggccagcga tccactgtgc     1080

ctgacctaca gctacctgag ccacgtggat ctggtgaagg atctgaacag cggactgatc     1140

ggagccctgc tggtgtgccg ggagggaagc ctggccaagg agaagaccca gaccctgcac     1200

aagttcatcc tgctgttcgc cgtgttcgat gagggaaaga gctggcacag cgagaccaag     1260

aacagcctga tgcaggatcg ggatgccgcc agcgcccggg cctggccaaa gatgcacacc     1320

gtgaacggat acgtgaaccg gagcctgcca ggactgatcg gatgccaccg gaagagcgtg     1380

tactggcacg tgatcggaat gggaaccacc ccagaggtgc actctatctt cctggaggga     1440

cacacctttc tggtgcggaa ccaccggcag gccagcctgg agatcagccc aatcaccttc     1500

ctgaccgccc agaccctgct gatggatctg ggacagttcc tgctgttctg ccatatcagc     1560

agccaccagc acgatggaat ggaggcctac gtgaaggtgg atagctgccc agaggagcca     1620

cagctgcgga tgaagaacaa cgaggaggcc gaggattacg atgatgatct gaccgatagc     1680

gagatggatg tggtgcggtt cgatgatgat aacagcccaa gcttcatcca gatccggagc     1740

gtggccaaga agcacccaaa gacctgggtg cactacatcg ccgccgagga ggaggattgg     1800

gattacgccc cactggtgct ggcccctgat gatcggagct acaagagcca gtacctgaac     1860

aacggaccac agcggatcgg acggaagtac aaaaaagtgc ggttcatggc ctacaccgat     1920

gagaccttca agacccggga ggccatccag cacgagagcg gaatcctggg accactgctg     1980

tacggagagg tgggagatac cctgctgatc atcttcaaga accaggccag ccggccatac     2040

aacatctacc cacacggaat caccgatgtg cggccactgt acagccggcg gctgccaaag     2100

ggagtgaagc acctgaagga tttcccaatc ctgccaggag agatcttcaa gtacaagtgg     2160

acagtgacag tggaggatgg accaaccaag tctgatccaa gatgcctgac cagatactac     2220

agcagctttg tgaacatgga gagagacctg gcctctggac tgattggacc actgctgatc     2280

tgctacaagg agtctgtgga tcagagagga aaccagatca tgtctgataa gagaaatgtg     2340

atcctgttct ctgtgtttga tgagaacaga agctggtacc tgacagagaa catccagaga     2400

ttcctgccaa acccagccgg agtgcagctg gaggatccag agttccaggc cagcaacatc     2460

atgcacagca tcaacggata cgtgttcgat agcctgcagc tgagcgtgtg cctgcacgag     2520

gtggcctatt ggtatatcct gagcatcgga gcccagaccg atttcctgag cgtgttcttc     2580

agcggataca ccttcaagca caagatggtg tacgaggata ccctgaccct gttcccattc     2640

tccggagaga ccgtgttcat gagcatggag aacccaggac tgtggatcct gggatgccac     2700

aactctgatt tcagaaacag aggaatgact gccctgctga aagtgtccag ctgtgataag     2760

aacactggag attactatga ggatagctat gaggatatct ctgcctacct gctgagcaag     2820

aacaatgcca ttgagccaag aagcttcagc cagaacccac cagtgctgaa gagacaccag     2880

agagagatca ccagaaccac cctgcagtct gatcaggagg agattgatta tgatgatacc     2940

atctctgtgg agatgaagaa ggaggatttt gatatctatg atgaggatga gaaccagagc     3000

ccaagaagct tccagaagaa gaccagacac tacttcatcg ctgcagtgga gagactgtgg     3060

gattatggaa tgagcagcag cccacacgtg ctgagaaaca gagcccagag cggatctgtg     3120

ccacagttca agaaggtggt gttccaggag ttcaccgatg gaagcttcac ccagccactg     3180

taccggggag agctgaacga gcacctggga ctgctgggac catacatccg ggccgaggtg     3240

gaggataaca tcatggtgac cttccggaac caggccagcc ggccatacag cttctacagc     3300

agcctgatca gctacgagga ggatcagcgg cagggagccg agccacggaa gaacttcgtg     3360

aagccaaacg agaccaagac ctacttctgg aaggtgcagc accacatggc cccaaccaag     3420

gatgagttcg attgcaaggc ctgggcctac ttcagcgatg tggatctgga gaaggatgtg     3480

cacagcggac tgatcggacc actgctggtg tgccacacca acaccctgaa cccagcccac     3540

ggacggcagg tgaccgtgca ggagttcgcc ctgttcttca ccatcttcga tgagaccaag     3600

agctggtact tcaccgagaa catggagcgg aactgccggg ccccttgcaa catccagatg     3660

gaggatccaa ccttcaagga gaactaccgg ttccacgcca tcaacggata catcatggat     3720

accctgccag gactggtgat ggcccaggat cagcggatcc ggtggtacct gctgagcatg     3780

ggaagcaacg agaacatcca cagcatccac ttcagcggac acgtgttcac cgtgcggaag     3840

aaggaggagt acaagatggc cctgtacaac ctgtacccag gagtgttcga gaccgtggag     3900

atgctgccaa gcaaggccgg aatctggcgg gtggagtgcc tgatcggaga gcacctgcac     3960

gccggaatga gcaccctgtt cctggtgtac agcaacaagt gccagacccc actgggaatg     4020

gccagcggac acatccggga tttccagatc accgccagcg gacagtacgg acagtgggcc     4080

ccaaagctgg cccggctgca ctacagcgga agcatcaacg cctggagcac caaggagcca     4140

ttcagctgga tcaaagtgga tctgctggcc ccaatgatca tccacggaat caagacccag     4200

ggagcccggc agaagttcag cagcctgtac atcagccagt tcatcatcat gtacagcctg     4260

gatggaaaga agtggcagac ctaccgggga aacagcaccg gaaccctgat ggtgttcttc     4320

ggaaacgtgg atagcagcgg aatcaagcac aacatcttca acccaccaat catcgcccga     4380

tacatccggc tgcacccaac ccactacagc atcagaagca ccctgcggat ggagctgatg     4440

ggatgtgatc tgaacagctg ctccatgcca ctgggaatgg agagcaaggc catcagcgat     4500

gcccagatca ccgccagcag ctacttcacc aacatgttcg ccacctggag cccaagcaag     4560

gcccggctgc acctgcaggg acggagcaac gcctggcggc cacaggtgaa taacccaaag     4620

gagtggctgc aggtggattt ccagaagacc atgaaggtga ccggagtgac cacccaggga     4680

gtgaagagcc tgctgactag catgtatgtg aaggagttcc tgatcagcag cagccaggat     4740

ggacaccagt ggaccctgtt cttccagaac ggaaaggtga aggtgttcca gggaaaccag     4800

gatagcttca ccccagtggt gaacagcctg gatccaccac tgctgacccg atacctgcgg     4860

atccacccac agagctgggt gcaccagatc gccctgagaa tggaggtgct gggatgcgag     4920

gcccaggatc tgtactgatg agcatgcaat aaagtctgag tgggcggcag cctgtgtgtg     4980

cctgggttct ctctgtcccg gaatgtgcaa acaatggagg tgctcgagta gataagtagc     5040

atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc     5100

tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg     5160

cccgggcggc ctcagtgagc gagcgagcgc gcagccttaa ttaacctaat tcactggccg     5220

tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag     5280

cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc     5340

aacagttgcg cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg     5400

cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc     5460

ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa     5520

atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac     5580

ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt     5640

tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca     5700

accctatctc ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt     5760

taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta     5820

caatttaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta     5880

aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata     5940

ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc     6000

ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga     6060

agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct     6120

tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg     6180

tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta     6240

ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat     6300

gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt     6360

acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga     6420

tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga     6480

gcgtgacacc acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga     6540

actacttact ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc     6600

aggaccactt ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc     6660

cggtgagcgt gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg     6720

tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat     6780

cgctgagata ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata     6840

tatactttag attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct     6900

ttttgataat ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga     6960

ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg     7020

cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc     7080

aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgttcttct     7140

agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc     7200

tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt     7260

ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg     7320

cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct     7380

atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag     7440

ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag     7500

tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg     7560

gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg     7620

gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac     7680

cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt     7740

gagcgaggaa gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat     7800

tcattaatgc agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc     7860

aattaatgtg agttagctca ctcattaggc accccaggct ttacacttta tgcttccggc     7920

tcgtatgttg tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca     7980

tgattacgcc agatttaatt aagg                                            8004


<210>  15
<211>  7948
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  15
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctactta agctacctcg      180

tgatcgcccg gcccctgttc aaacatgtcc taatactctg tctctgcaag ggtcatcagt      240

agttttccat cttactcaac atcctcccag tggaattctg gacacaggac gctgtggttt      300

ctgagccagg gggcgactca gatcccagcc agtggactta gcccctgttt gctcctccga      360

taactggggt gaccttggtt aatattcacc agcagcctcc cccgttgccc ctctggatcc      420

actgcttaaa tacggacgag gacagggccc tgtctcctca gcttcaggca ccaccactga      480

cctgggacag tgaatagcgg ccgccaccat gcagatcgag ctgagcacct gcttcttcct      540

gtgcctgctg cggttctgct tctccgccac ccggcggtac tacctgggag ccgtggagct      600

gagctgggat tacatgcaga gcgatctggg agagctgcca gtggatgccc ggttcccacc      660

acgggtgcca aagagcttcc cattcaacac cagcgtggtg tacaagaaga ccctgttcgt      720

ggagttcacc gatcacctgt tcaacatcgc caagccacgg ccaccctgga tgggactgct      780

gggaccaacc atccaggccg aggtgtacga taccgtggtg atcaccctga agaacatggc      840

ctctcatcct gtgtccctgc acgccgtggg agtgagctac tggaaggcca gcgagggagc      900

cgagtacgat gatcagacca gccagcggga gaaggaggat gataaggtgt tcccaggagg      960

aagccacacc tacgtgtggc aggtgctgaa ggagaacgga ccaatggcca gcgatccact     1020

gtgcctgacc tacagctacc tgagccacgt ggatctggtg aaggatctga acagcggact     1080

gatcggagcc ctgctggtgt gccgggaggg aagcctggcc aaggagaaga cccagaccct     1140

gcacaagttc atcctgctgt tcgccgtgtt cgatgaggga aagagctggc acagcgagac     1200

caagaacagc ctgatgcagg atcgggatgc cgccagcgcc cgggcctggc caaagatgca     1260

caccgtgaac ggatacgtga accggagcct gccaggactg atcggatgcc accggaagag     1320

cgtgtactgg cacgtgatcg gaatgggaac caccccagag gtgcactcta tcttcctgga     1380

gggacacacc tttctggtgc ggaaccaccg gcaggccagc ctggagatca gcccaatcac     1440

cttcctgacc gcccagaccc tgctgatgga tctgggacag ttcctgctgt tctgccatat     1500

cagcagccac cagcacgatg gaatggaggc ctacgtgaag gtggatagct gcccagagga     1560

gccacagctg cggatgaaga acaacgagga ggccgaggat tacgatgatg atctgaccga     1620

tagcgagatg gatgtggtgc ggttcgatga tgataacagc ccaagcttca tccagatccg     1680

gagcgtggcc aagaagcacc caaagacctg ggtgcactac atcgccgccg aggaggagga     1740

ttgggattac gccccactgg tgctggcccc tgatgatcgg agctacaaga gccagtacct     1800

gaacaacgga ccacagcgga tcggacggaa gtacaaaaaa gtgcggttca tggcctacac     1860

cgatgagacc ttcaagaccc gggaggccat ccagcacgag agcggaatcc tgggaccact     1920

gctgtacgga gaggtgggag ataccctgct gatcatcttc aagaaccagg ccagccggcc     1980

atacaacatc tacccacacg gaatcaccga tgtgcggcca ctgtacagcc ggcggctgcc     2040

aaagggagtg aagcacctga aggatttccc aatcctgcca ggagagatct tcaagtacaa     2100

gtggacagtg acagtggagg atggaccaac caagtctgat ccaagatgcc tgaccagata     2160

ctacagcagc tttgtgaaca tggagagaga cctggcctct ggactgattg gaccactgct     2220

gatctgctac aaggagtctg tggatcagag aggaaaccag atcatgtctg ataagagaaa     2280

tgtgatcctg ttctctgtgt ttgatgagaa cagaagctgg tacctgacag agaacatcca     2340

gagattcctg ccaaacccag ccggagtgca gctggaggat ccagagttcc aggccagcaa     2400

catcatgcac agcatcaacg gatacgtgtt cgatagcctg cagctgagcg tgtgcctgca     2460

cgaggtggcc tattggtata tcctgagcat cggagcccag accgatttcc tgagcgtgtt     2520

cttcagcgga tacaccttca agcacaagat ggtgtacgag gataccctga ccctgttccc     2580

attctccgga gagaccgtgt tcatgagcat ggagaaccca ggactgtgga tcctgggatg     2640

ccacaactct gatttcagaa acagaggaat gactgccctg ctgaaagtgt ccagctgtga     2700

taagaacact ggagattact atgaggatag ctatgaggat atctctgcct acctgctgag     2760

caagaacaat gccattgagc caagaagctt cagccagaac ccaccagtgc tgaagagaca     2820

ccagagagag atcaccagaa ccaccctgca gtctgatcag gaggagattg attatgatga     2880

taccatctct gtggagatga agaaggagga ttttgatatc tatgatgagg atgagaacca     2940

gagcccaaga agcttccaga agaagaccag acactacttc atcgctgcag tggagagact     3000

gtgggattat ggaatgagca gcagcccaca cgtgctgaga aacagagccc agagcggatc     3060

tgtgccacag ttcaagaagg tggtgttcca ggagttcacc gatggaagct tcacccagcc     3120

actgtaccgg ggagagctga acgagcacct gggactgctg ggaccataca tccgggccga     3180

ggtggaggat aacatcatgg tgaccttccg gaaccaggcc agccggccat acagcttcta     3240

cagcagcctg atcagctacg aggaggatca gcggcaggga gccgagccac ggaagaactt     3300

cgtgaagcca aacgagacca agacctactt ctggaaggtg cagcaccaca tggccccaac     3360

caaggatgag ttcgattgca aggcctgggc ctacttcagc gatgtggatc tggagaagga     3420

tgtgcacagc ggactgatcg gaccactgct ggtgtgccac accaacaccc tgaacccagc     3480

ccacggacgg caggtgaccg tgcaggagtt cgccctgttc ttcaccatct tcgatgagac     3540

caagagctgg tacttcaccg agaacatgga gcggaactgc cgggcccctt gcaacatcca     3600

gatggaggat ccaaccttca aggagaacta ccggttccac gccatcaacg gatacatcat     3660

ggataccctg ccaggactgg tgatggccca ggatcagcgg atccggtggt acctgctgag     3720

catgggaagc aacgagaaca tccacagcat ccacttcagc ggacacgtgt tcaccgtgcg     3780

gaagaaggag gagtacaaga tggccctgta caacctgtac ccaggagtgt tcgagaccgt     3840

ggagatgctg ccaagcaagg ccggaatctg gcgggtggag tgcctgatcg gagagcacct     3900

gcacgccgga atgagcaccc tgttcctggt gtacagcaac aagtgccaga ccccactggg     3960

aatggccagc ggacacatcc gggatttcca gatcaccgcc agcggacagt acggacagtg     4020

ggccccaaag ctggcccggc tgcactacag cggaagcatc aacgcctgga gcaccaagga     4080

gccattcagc tggatcaaag tggatctgct ggccccaatg atcatccacg gaatcaagac     4140

ccagggagcc cggcagaagt tcagcagcct gtacatcagc cagttcatca tcatgtacag     4200

cctggatgga aagaagtggc agacctaccg gggaaacagc accggaaccc tgatggtgtt     4260

cttcggaaac gtggatagca gcggaatcaa gcacaacatc ttcaacccac caatcatcgc     4320

ccgatacatc cggctgcacc caacccacta cagcatcaga agcaccctgc ggatggagct     4380

gatgggatgt gatctgaaca gctgctccat gccactggga atggagagca aggccatcag     4440

cgatgcccag atcaccgcca gcagctactt caccaacatg ttcgccacct ggagcccaag     4500

caaggcccgg ctgcacctgc agggacggag caacgcctgg cggccacagg tgaataaccc     4560

aaaggagtgg ctgcaggtgg atttccagaa gaccatgaag gtgaccggag tgaccaccca     4620

gggagtgaag agcctgctga ctagcatgta tgtgaaggag ttcctgatca gcagcagcca     4680

ggatggacac cagtggaccc tgttcttcca gaacggaaag gtgaaggtgt tccagggaaa     4740

ccaggatagc ttcaccccag tggtgaacag cctggatcca ccactgctga cccgatacct     4800

gcggatccac ccacagagct gggtgcacca gatcgccctg agaatggagg tgctgggatg     4860

cgaggcccag gatctgtact gatgagcatg caataaagtc tgagtgggcg gcagcctgtg     4920

tgtgcctggg ttctctctgt cccggaatgt gcaaacaatg gaggtgctcg agtagataag     4980

tagcatggcg ggttaatcat taactacaag gaacccctag tgatggagtt ggccactccc     5040

tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg acgcccgggc     5100

tttgcccggg cggcctcagt gagcgagcga gcgcgcagcc ttaattaacc taattcactg     5160

gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt     5220

gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct     5280

tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc     5340

gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc     5400

gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct     5460

ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa     5520

aaacttgatt agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc     5580

cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca     5640

ctcaacccta tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat     5700

tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg     5760

cttacaattt aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt     5820

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat     5880

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt     5940

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg     6000

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga     6060

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc     6120

tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac     6180

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg     6240

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca     6300

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg     6360

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg     6420

acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg     6480

gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag     6540

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg     6600

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct     6660

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac     6720

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact     6780

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     6840

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     6900

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     6960

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     7020

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     7080

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     7140

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     7200

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     7260

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     7320

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     7380

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     7440

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     7500

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     7560

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     7620

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     7680

cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc     7740

cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca     7800

acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc     7860

cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg     7920

accatgatta cgccagattt aattaagg                                        7948


<210>  16
<211>  8032
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  16
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctactta agctacctcg      180

tgatcgcccg gcccctgttc aaacatgtcc taatactctg tctctgcaag ggtcatcagt      240

agttttccat cttactcaac atcctcccag tgaggttaat ttttaaactg tttgctctgg      300

ttaataatct caggaggtta atttttaaac tgtttgctct ggttaataat ctcagggaat      360

tctggacaca ggacgctgtg gtttctgagc cagggggcga ctcagatccc agccagtgga      420

cttagcccct gtttgctcct ccgataactg gggtgacctt ggttaatatt caccagcagc      480

ctcccccgtt gcccctctgg atccactgct taaatacgga cgaggacagg gccctgtctc      540

ctcagcttca ggcaccacca ctgacctggg acagtgaata gcggccgcca ccatgcagat      600

cgagctgagc acctgcttct tcctgtgcct gctgcggttc tgcttctccg ccacccggcg      660

gtactacctg ggagccgtgg agctgagctg ggattacatg cagagcgatc tgggagagct      720

gccagtggat gcccggttcc caccacgggt gccaaagagc ttcccattca acaccagcgt      780

ggtgtacaag aagaccctgt tcgtggagtt caccgatcac ctgttcaaca tcgccaagcc      840

acggccaccc tggatgggac tgctgggacc aaccatccag gccgaggtgt acgataccgt      900

ggtgatcacc ctgaagaaca tggcctctca tcctgtgtcc ctgcacgccg tgggagtgag      960

ctactggaag gccagcgagg gagccgagta cgatgatcag accagccagc gggagaagga     1020

ggatgataag gtgttcccag gaggaagcca cacctacgtg tggcaggtgc tgaaggagaa     1080

cggaccaatg gccagcgatc cactgtgcct gacctacagc tacctgagcc acgtggatct     1140

ggtgaaggat ctgaacagcg gactgatcgg agccctgctg gtgtgccggg agggaagcct     1200

ggccaaggag aagacccaga ccctgcacaa gttcatcctg ctgttcgccg tgttcgatga     1260

gggaaagagc tggcacagcg agaccaagaa cagcctgatg caggatcggg atgccgccag     1320

cgcccgggcc tggccaaaga tgcacaccgt gaacggatac gtgaaccgga gcctgccagg     1380

actgatcgga tgccaccgga agagcgtgta ctggcacgtg atcggaatgg gaaccacccc     1440

agaggtgcac tctatcttcc tggagggaca cacctttctg gtgcggaacc accggcaggc     1500

cagcctggag atcagcccaa tcaccttcct gaccgcccag accctgctga tggatctggg     1560

acagttcctg ctgttctgcc atatcagcag ccaccagcac gatggaatgg aggcctacgt     1620

gaaggtggat agctgcccag aggagccaca gctgcggatg aagaacaacg aggaggccga     1680

ggattacgat gatgatctga ccgatagcga gatggatgtg gtgcggttcg atgatgataa     1740

cagcccaagc ttcatccaga tccggagcgt ggccaagaag cacccaaaga cctgggtgca     1800

ctacatcgcc gccgaggagg aggattggga ttacgcccca ctggtgctgg cccctgatga     1860

tcggagctac aagagccagt acctgaacaa cggaccacag cggatcggac ggaagtacaa     1920

aaaagtgcgg ttcatggcct acaccgatga gaccttcaag acccgggagg ccatccagca     1980

cgagagcgga atcctgggac cactgctgta cggagaggtg ggagataccc tgctgatcat     2040

cttcaagaac caggccagcc ggccatacaa catctaccca cacggaatca ccgatgtgcg     2100

gccactgtac agccggcggc tgccaaaggg agtgaagcac ctgaaggatt tcccaatcct     2160

gccaggagag atcttcaagt acaagtggac agtgacagtg gaggatggac caaccaagtc     2220

tgatccaaga tgcctgacca gatactacag cagctttgtg aacatggaga gagacctggc     2280

ctctggactg attggaccac tgctgatctg ctacaaggag tctgtggatc agagaggaaa     2340

ccagatcatg tctgataaga gaaatgtgat cctgttctct gtgtttgatg agaacagaag     2400

ctggtacctg acagagaaca tccagagatt cctgccaaac ccagccggag tgcagctgga     2460

ggatccagag ttccaggcca gcaacatcat gcacagcatc aacggatacg tgttcgatag     2520

cctgcagctg agcgtgtgcc tgcacgaggt ggcctattgg tatatcctga gcatcggagc     2580

ccagaccgat ttcctgagcg tgttcttcag cggatacacc ttcaagcaca agatggtgta     2640

cgaggatacc ctgaccctgt tcccattctc cggagagacc gtgttcatga gcatggagaa     2700

cccaggactg tggatcctgg gatgccacaa ctctgatttc agaaacagag gaatgactgc     2760

cctgctgaaa gtgtccagct gtgataagaa cactggagat tactatgagg atagctatga     2820

ggatatctct gcctacctgc tgagcaagaa caatgccatt gagccaagaa gcttcagcca     2880

gaacccacca gtgctgaaga gacaccagag agagatcacc agaaccaccc tgcagtctga     2940

tcaggaggag attgattatg atgataccat ctctgtggag atgaagaagg aggattttga     3000

tatctatgat gaggatgaga accagagccc aagaagcttc cagaagaaga ccagacacta     3060

cttcatcgct gcagtggaga gactgtggga ttatggaatg agcagcagcc cacacgtgct     3120

gagaaacaga gcccagagcg gatctgtgcc acagttcaag aaggtggtgt tccaggagtt     3180

caccgatgga agcttcaccc agccactgta ccggggagag ctgaacgagc acctgggact     3240

gctgggacca tacatccggg ccgaggtgga ggataacatc atggtgacct tccggaacca     3300

ggccagccgg ccatacagct tctacagcag cctgatcagc tacgaggagg atcagcggca     3360

gggagccgag ccacggaaga acttcgtgaa gccaaacgag accaagacct acttctggaa     3420

ggtgcagcac cacatggccc caaccaagga tgagttcgat tgcaaggcct gggcctactt     3480

cagcgatgtg gatctggaga aggatgtgca cagcggactg atcggaccac tgctggtgtg     3540

ccacaccaac accctgaacc cagcccacgg acggcaggtg accgtgcagg agttcgccct     3600

gttcttcacc atcttcgatg agaccaagag ctggtacttc accgagaaca tggagcggaa     3660

ctgccgggcc ccttgcaaca tccagatgga ggatccaacc ttcaaggaga actaccggtt     3720

ccacgccatc aacggataca tcatggatac cctgccagga ctggtgatgg cccaggatca     3780

gcggatccgg tggtacctgc tgagcatggg aagcaacgag aacatccaca gcatccactt     3840

cagcggacac gtgttcaccg tgcggaagaa ggaggagtac aagatggccc tgtacaacct     3900

gtacccagga gtgttcgaga ccgtggagat gctgccaagc aaggccggaa tctggcgggt     3960

ggagtgcctg atcggagagc acctgcacgc cggaatgagc accctgttcc tggtgtacag     4020

caacaagtgc cagaccccac tgggaatggc cagcggacac atccgggatt tccagatcac     4080

cgccagcgga cagtacggac agtgggcccc aaagctggcc cggctgcact acagcggaag     4140

catcaacgcc tggagcacca aggagccatt cagctggatc aaagtggatc tgctggcccc     4200

aatgatcatc cacggaatca agacccaggg agcccggcag aagttcagca gcctgtacat     4260

cagccagttc atcatcatgt acagcctgga tggaaagaag tggcagacct accggggaaa     4320

cagcaccgga accctgatgg tgttcttcgg aaacgtggat agcagcggaa tcaagcacaa     4380

catcttcaac ccaccaatca tcgcccgata catccggctg cacccaaccc actacagcat     4440

cagaagcacc ctgcggatgg agctgatggg atgtgatctg aacagctgct ccatgccact     4500

gggaatggag agcaaggcca tcagcgatgc ccagatcacc gccagcagct acttcaccaa     4560

catgttcgcc acctggagcc caagcaaggc ccggctgcac ctgcagggac ggagcaacgc     4620

ctggcggcca caggtgaata acccaaagga gtggctgcag gtggatttcc agaagaccat     4680

gaaggtgacc ggagtgacca cccagggagt gaagagcctg ctgactagca tgtatgtgaa     4740

ggagttcctg atcagcagca gccaggatgg acaccagtgg accctgttct tccagaacgg     4800

aaaggtgaag gtgttccagg gaaaccagga tagcttcacc ccagtggtga acagcctgga     4860

tccaccactg ctgacccgat acctgcggat ccacccacag agctgggtgc accagatcgc     4920

cctgagaatg gaggtgctgg gatgcgaggc ccaggatctg tactgatgag catgcaataa     4980

agtctgagtg ggcggcagcc tgtgtgtgcc tgggttctct ctgtcccgga atgtgcaaac     5040

aatggaggtg ctcgagtaga taagtagcat ggcgggttaa tcattaacta caaggaaccc     5100

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     5160

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     5220

agccttaatt aacctaattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct     5280

ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc     5340

gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac     5400

gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct     5460

acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg     5520

ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt     5580

gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca     5640

tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga     5700

ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa     5760

gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac     5820

gcgaatttta acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc     5880

gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac     5940

aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt     6000

tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag     6060

aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg     6120

aactggatct caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa     6180

tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc     6240

aagagcaact cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag     6300

tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa     6360

ccatgagtga taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc     6420

taaccgcttt tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg     6480

agctgaatga agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa     6540

caacgttgcg caaactatta actggcgaac tacttactct agcttcccgg caacaattaa     6600

tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg     6660

gctggtttat tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag     6720

cactggggcc agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg     6780

caactatgga tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt     6840

ggtaactgtc agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt     6900

aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac     6960

gtgagttttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag     7020

atcctttttt tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg     7080

tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca     7140

gagcgcagat accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga     7200

actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca     7260

gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc     7320

agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca     7380

ccgaactgag atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa     7440

aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc     7500

cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc     7560

gtcgattttt gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg     7620

cctttttacg gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat     7680

cccctgattc tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca     7740

gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca     7800

aaccgcctct ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg     7860

actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac     7920

cccaggcttt acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac     7980

aatttcacac aggaaacagc tatgaccatg attacgccag atttaattaa gg             8032


<210>  17
<211>  738
<212>  PRT
<213>  Unknown

<220>
<223>  AAVhu.37 capsid

<400>  17

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Gln Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Ala Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Arg Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Thr Asn Thr Gly 
            580                 585                 590         


Pro Ile Val Gly Asn Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Ser Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  18
<211>  738
<212>  PRT
<213>  Unknown

<220>
<223>  AAVrh.10 capsid

<400>  18

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Gln Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 
            580                 585                 590         


Pro Ile Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Ser Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210>  19
<211>  4371
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  19
atgcagatcg agctgtctac ctgcttcttc ctgtgcctgc tgcggttctg cttcagcgcc       60

accagacggt actatctggg cgccgtggaa ctgagctggg actacatgca gagcgacctg      120

ggcgagctgc ccgtggacgc cagattccct ccaagagtgc ccaagagctt ccccttcaac      180

acctccgtgg tgtacaagaa aaccctgttc gtggaattca ccgaccacct gttcaatatc      240

gccaagccca gacccccctg gatgggcctg ctgggaccta caattcaggc cgaggtgtac      300

gacaccgtcg tgatcaccct gaagaacatg gccagccacc ccgtgtctct gcacgccgtg      360

ggagtgtcct actggaaggc ctctgagggc gccgagtacg acgatcagac cagccagcgc      420

gagaaagagg acgacaaggt gttccctggc ggcagccaca cctacgtgtg gcaggtgctg      480

aaagaaaacg gccccatggc ctccgaccct ctgtgcctga catacagcta cctgagccac      540

gtggacctcg tgaaggacct gaacagcggc ctgatcggag ccctgctcgt gtgtagagag      600

ggcagcctgg ccaaagagaa aacccagacc ctgcacaagt tcatcctgct gttcgccgtg      660

ttcgacgagg gcaagagctg gcacagcgag acaaagaaca gcctgatgca ggaccgggac      720

gccgcctctg ctagagcctg gcccaaaatg cacaccgtga acggctacgt gaacagaagc      780

ctgcccggac tgatcggctg ccaccggaag tctgtgtact ggcacgtgat cggcatgggc      840

accacccctg aggtgcacag catctttctg gaaggacaca cctttctcgt gcggaaccac      900

cggcaggcca gcctggaaat cagccctatc accttcctga ccgcccagac actgctgatg      960

gacctgggcc agtttctgct gttctgccac atcagctccc accagcacga cggcatggaa     1020

gcctacgtga aggtggacag ctgccccgag gaaccccagc tgcggatgaa gaacaacgag     1080

gaagccgagg actacgacga cgacctgacc gacagcgaga tggacgtggt gcgcttcgac     1140

gacgataaca gccccagctt catccagatc agaagcgtgg ccaagaagca ccccaagacc     1200

tgggtgcact atatcgccgc cgaggaagag gactgggatt acgcccctct ggtgctggcc     1260

cccgacgaca gaagctacaa gagccagtac ctgaacaacg gcccccagcg gatcggccgg     1320

aagtataaga aagtgcggtt catggcctac accgacgaga cattcaagac cagagaggcc     1380

atccagcacg agagcggcat cctgggccct ctgctgtatg gcgaagtggg cgacaccctg     1440

ctgatcatct tcaagaacca ggccagcaga ccctacaaca tctaccctca cggcatcacc     1500

gacgtgcggc ccctgtactc tagaaggctg cccaagggcg tgaaacacct gaaggacttc     1560

cccatcctgc ccggcgagat cttcaagtac aagtggaccg tgaccgtgga agatggcccc     1620

accaagagcg accccagatg cctgacacgg tactatagca gcttcgtgaa catggaacgg     1680

gacctggcct ccggcctgat tggcccactg ctgatctgct acaaagaaag cgtggaccag     1740

cggggcaacc agatcatgag cgacaagcgg aacgtgatcc tgtttagcgt gttcgatgag     1800

aaccggtcct ggtatctgac cgagaatatc cagcggttcc tgcccaaccc tgccggcgtg     1860

cagctggaag atcctgagtt ccaggcctcc aacatcatgc actccatcaa tggctatgtg     1920

ttcgacagcc tgcagctgag cgtgtgcctg cacgaggtgg cctactggta catcctgagc     1980

atcggggccc agaccgactt cctgtccgtg ttcttctccg gctacacctt caagcacaag     2040

atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gttcatgagc     2100

atggaaaacc ccggcctgtg gatcctgggc tgccacaaca gcgacttccg gaacagaggc     2160

atgaccgccc tgctgaaggt gtccagctgc gacaagaaca ccggcgacta ctacgaggac     2220

agctatgagg acatcagcgc ctacctgctg agcaagaaca acgccatcga gcccagaagc     2280

ttcagccaga acccccccgt gctgaagcgg caccagagag agatcacccg gaccaccctg     2340

cagtccgacc aggaagagat cgattacgac gacaccatca gcgtggaaat gaagaaagaa     2400

gatttcgaca tctacgacga ggacgagaac cagagccccc ggtcctttca gaaaaagacc     2460

cggcactact tcattgccgc tgtggaacgg ctgtgggact acggcatgag cagcagccct     2520

cacgtgctga gaaacagggc ccagagcggc agcgtgcccc agttcaagaa agtggtgttc     2580

caggaattca cagacggcag cttcacccag cctctgtacc gcggcgagct gaacgagcac     2640

ctgggactgc tgggccccta tatcagagcc gaagtggaag ataacatcat ggtcaccttc     2700

cggaatcagg cctcccggcc ctacagcttc tacagctccc tgatcagcta cgaagaggac     2760

cagagacagg gcgctgagcc ccggaagaac ttcgtgaagc ccaacgagac taagacctac     2820

ttttggaagg tgcagcacca catggcccct acaaaggacg agttcgactg caaggcctgg     2880

gcctacttct ccgacgtgga cctggaaaag gacgtgcact ctgggctgat cggccccctg     2940

ctcgtgtgcc acaccaacac cctgaatccc gcccacggca gacaggtgac agtgcaggaa     3000

ttcgccctgt tcttcaccat cttcgacgaa acaaagagct ggtacttcac cgaaaacatg     3060

gaaagaaact gccgggctcc ctgcaacatc cagatggaag atcccacctt caaagagaac     3120

taccggttcc acgccatcaa cggctacatc atggacacac tgcccggcct cgtgatggct     3180

caggatcagc ggatccggtg gtatctgctg tccatgggct ccaacgagaa catccacagc     3240

atccacttca gcggccacgt gttcaccgtg cggaaaaaag aagagtacaa aatggccctg     3300

tacaacctgt accctggggt gttcgagaca gtggaaatgc tgcccagcaa ggccggcatc     3360

tggcgggtgg agtgtctgat cggcgagcac ctgcacgctg ggatgagcac actgtttctg     3420

gtgtacagca acaagtgcca gacacctctg ggcatggcct ctggccacat ccgggacttt     3480

cagatcacag ccagcggcca gtacggccag tgggccccaa aactggccag actgcactac     3540

agcggcagca tcaacgcctg gtccaccaaa gagcccttca gctggatcaa ggtggacctg     3600

ctggctccca tgatcatcca cggaatcaag acccagggcg ccagacagaa gttcagcagc     3660

ctgtacatca gccagttcat catcatgtac agcctggacg gcaagaagtg gcagacctac     3720

cggggcaata gcaccggcac cctgatggtg ttcttcggca acgtggactc cagcggcatt     3780

aagcacaaca tcttcaaccc ccccatcatt gcccggtaca tccggctgca ccccacccac     3840

tacagcatcc ggtccaccct gagaatggaa ctgatgggct gcgacctgaa ctcctgctcc     3900

atgcccctgg ggatggaaag caaggccatc tccgacgccc agatcaccgc ctccagctac     3960

ttcaccaaca tgttcgccac ctggtcccca tccaaggccc ggctgcacct gcagggcaga     4020

agcaatgctt ggaggcctca ggtgaacaac cccaaagagt ggctgcaggt ggacttccag     4080

aaaaccatga aagtgaccgg cgtgaccacc cagggcgtga agtctctgct gacctctatg     4140

tacgtgaaag agttcctgat ctccagcagc caggacggcc accagtggac cctgtttttc     4200

cagaacggca aagtgaaagt gtttcagggg aaccaggact ccttcacccc cgtcgtgaat     4260

agcctggacc ctccactgct gaccagatac ctgcggatcc accctcagag ttgggtgcac     4320

cagattgctc tgcggatgga agtgctggga tgcgaggccc aggacctgta c              4371


<210>  20
<211>  736
<212>  PRT
<213>  Unknown

<220>
<223>  AAV3B capsid

<400>  20

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Arg Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Ile Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Asp Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Val Gly 
145                 150                 155                 160 


Lys Ser Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 
            260                 265                 270         


Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
        275                 280                 285             


Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 
    290                 295                 300                 


Gly Phe Arg Pro Lys Lys Leu Ser Phe Lys Leu Phe Asn Ile Gln Val 
305                 310                 315                 320 


Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 
                325                 330                 335     


Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 
            340                 345                 350         


Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 
        355                 360                 365             


Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
    370                 375                 380                 


Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385                 390                 395                 400 


Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Thr Phe Glu 
                405                 410                 415     


Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 
            420                 425                 430         


Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg Thr 
        435                 440                 445             


Gln Gly Thr Thr Ser Gly Thr Thr Asn Gln Ser Arg Leu Leu Phe Ser 
    450                 455                 460                 


Gln Ala Gly Pro Gln Ser Met Ser Leu Gln Ala Arg Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Leu Ser Lys Thr Ala Asn Asp Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Pro Trp Thr Ala Ala Ser Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Asp Asp Glu Glu Lys Phe Phe Pro Met His Gly Asn Leu Ile Phe Gly 
    530                 535                 540                 


Lys Glu Gly Thr Thr Ala Ser Asn Ala Glu Leu Asp Asn Val Met Ile 
545                 550                 555                 560 


Thr Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln 
                565                 570                 575     


Tyr Gly Thr Val Ala Asn Asn Leu Gln Ser Ser Asn Thr Ala Pro Thr 
            580                 585                 590         


Thr Arg Thr Val Asn Asp Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Met Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asn Pro Pro Thr Thr Phe Ser Pro Ala Lys Phe Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


