                               SEQUENCE LISTING

<110> WILLIAM MARSH RICE UNIVERSITY
 
<120> LIGHT-CONTROLLED GENE DELIVERY WITH VIRUS VECTORS THROUGH 
      INCORPORATION OF OPTOGENETIC PROTEINS AND GENETIC INSERTION OF 
      NON-CONFORMATIONALLY CONSTRAINED PEPTIDES

<130> 15-21018-WO

<140>
<141>

<150> 62/222,047
<151> 2015-09-22

<150> 62/221,754
<151> 2015-09-22

<160> 177   

<170> PatentIn version 3.5

<210> 1
<211> 8
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 1
Ile Pro Val Ser Leu Arg Ser Gly 
1               5               


<210> 2
<211> 8
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 2
Ile Pro Glu Ser Leu Arg Ala Gly 
1               5               


<210> 3
<211> 5
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 3
Asp Asp Asp Asp Lys 
1               5   


<210> 4
<211> 4
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 4
Gly Gly Gly Ser 
1               


<210> 5
<211> 7
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 5
Pro Lys Lys Lys Arg Lys Val 
1               5           


<210> 6
<211> 21
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 6
Thr Arg Pro Gln Arg Asp Cys Pro Thr Pro Thr Trp Gln Pro Gln Pro 
1               5                   10                  15      


Arg Arg Lys Ser Trp 
            20      


<210> 7
<211> 11
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 7
Leu Gln Leu Pro Pro Leu Glu Arg Leu Thr Leu 
1               5                   10      


<210> 8
<211> 9
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 8
Leu Pro Pro Leu Glu Arg Leu Thr Leu 
1               5                   


<210> 9
<211> 18
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 9
Pro Ser Thr Arg Ile Gln Gln Gln Leu Gly Gln Leu Thr Leu Glu Asn 
1               5                   10                  15      


Leu Gln 
        


<210> 10
<211> 11
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 10
Met Leu Ala Leu Lys Leu Ala Gly Leu Asp Ile 
1               5                   10      


<210> 11
<211> 21
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 11
cccaagaaaa agcggaaggt g                                                 21


<210> 12
<211> 65
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 12
acgaggccgc aaagagactg cccgacgcca acctggcagc cgcagccaag aagaaaaagc       60

tggac                                                                   65


<210> 13
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 13
cttcaacttc ctcctcttga gagacttact ctt                                    33


<210> 14
<211> 27
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 14
cttcctcctc ttgagagact tactctt                                           27


<210> 15
<211> 54
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 15
cccagcaccc ggatccagca gcagctgggc cagctgaccc tggagaacct gcag             54


<210> 16
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 16
atgttagcct tgaaattagc aggtcttgat atc                                    33


<210> 17
<211> 6
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 17
Pro Leu Gly Leu Ala Arg 
1               5       


<210> 18
<211> 8
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 18
Val Pro Met Ser Met Arg Gly Gly 
1               5               


<210> 19
<211> 6
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide


<220>
<221> MOD_RES
<222> (6)..(6)
<223> Gln or Gly

<400> 19
Glu Asn Leu Tyr Phe Xaa 
1               5       


<210> 20
<211> 4
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 20
Asp Asp Asp Lys 
1               


<210> 21
<211> 21
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 21
tcacggggat ttccaagtct c                                                 21


<210> 22
<211> 22
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 22
aatggggcgg agttgttacg ac                                                22


<210> 23
<211> 6
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      6xHis tag

<400> 23
His His His His His His 
1               5       


<210> 24
<211> 37
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 24
gcattaggtc tctaatggta tctggtgttg gtggttc                                37


<210> 25
<211> 57
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 25
atgatgatga tgatgatgac caccaccacc tactgcaaga gcttgttgta attctgg          57


<210> 26
<211> 41
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 26
gctaatggtc tcttttaatg atgatgatga tgatgaccac c                           41


<210> 27
<211> 15
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 27
Ala Gly Pro Leu Gly Leu Ala Arg Gly Asp Asp Asp Lys Gly Ala 
1               5                   10                  15  


<210> 28
<211> 16
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 28
Ala Gly Asp Asp Asp Asp Lys Gly Pro Leu Gly Leu Ala Arg Gly Ala 
1               5                   10                  15      


<210> 29
<211> 4
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 29
Asp Asp Asp Asp 
1               


<210> 30
<211> 18
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide


<220>
<221> MOD_RES
<222> (3)..(10)
<223> Any amino acid

<220>
<221> MOD_RES
<222> (12)..(16)
<223> Any amino acid

<400> 30
Ala Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15      


Gly Ala 
        


<210> 31
<211> 1950
<212> DNA
<213> Arabidopsis thaliana

<400> 31
atggtttccg gagtcggggg tagtggcggt ggccgtggcg gtggccgtgg cggagaagaa       60

gaaccgtcgt caagtcacac tcctaataac cgaagaggag gagaacaagc tcaatcgtcg      120

ggaacgaaat ctctcagacc aagaagcaac actgaatcaa tgagcaaagc aattcaacag      180

tacaccgtcg acgcaagact ccacgccgtt ttcgaacaat ccggcgaatc agggaaatca      240

ttcgactact cacaatcact caaaacgacg acgtacggtt cctctgtacc tgagcaacag      300

atcacagctt atctctctcg aatccagcga ggtggttaca ttcagccttt cggatgtatg      360

atcgccgtcg atgaatccag tttccggatc atcggttaca gtgaaaacgc cagagaaatg      420

ttagggatta tgcctcaatc tgttcctact cttgagaaac ctgagattct agctatggga      480

actgatgtga gatctttgtt cacttcttcg agctcgattc tactcgagcg tgctttcgtt      540

gctcgagaga ttaccttgtt aaatccggtt tggatccatt ccaagaatac tggtaaaccg      600

ttttacgcca ttcttcatag gattgatgtt ggtgttgtta ttgatttaga gccagctaga      660

actgaagatc ctgcgctttc tattgctggt gctgttcaat cgcagaaact cgcggttcgt      720

gcgatttctc agttacaggc tcttcctggt ggagatatta agcttttgtg tgacactgtc      780

gtggaaagtg tgagggactt gactggttat gatcgtgtta tggtttataa gtttcatgaa      840

gatgagcatg gagaagttgt agctgagagt aaacgagatg atttagagcc ttatattgga      900

ctgcattatc ctgctactga tattcctcaa gcgtcaaggt tcttgtttaa gcagaaccgt      960

gtccgaatga tagtagattg caatgccaca cctgttcttg tggtccagga cgataggcta     1020

actcagtcta tgtgcttggt tggttctact cttagggctc ctcatggttg tcactctcag     1080

tatatggcta acatgggatc tattgcgtct ttagcaatgg cggttataat caatggaaat     1140

gaagatgatg ggagcaatgt agctagtgga agaagctcga tgaggctttg gggtttggtt     1200

gtttgccatc acacttcttc tcgctgcata ccgtttccgc taaggtatgc ttgtgagttt     1260

ttgatgcagg ctttcggttt acagttaaac atggaattgc agttagcttt gcaaatgtca     1320

gagaaacgcg ttttgagaac gcagacactg ttatgtgata tgcttctgcg tgactcgcct     1380

gctggaattg ttacacagag tcccagtatc atggacttag tgaaatgtga cggtgcagca     1440

tttctttacc acgggaagta ttacccgttg ggtgttgctc ctagtgaagt tcagataaaa     1500

gatgttgtgg agtggttgct tgcgaatcat gcggattcaa ccggattaag cactgatagt     1560

ttaggcgatg cggggtatcc cggtgcagct gcgttagggg atgctgtgtg cggtatggca     1620

gttgcatata tcacaaaaag agactttctt ttttggtttc gatctcacac tgcgaaagaa     1680

atcaaatggg gaggcgctaa gcatcatccg gaggataaag atgatgggca acgaatgcat     1740

cctcgttcgt cctttcaggc ttttcttgaa gttgttaaga gccggagtca gccatgggaa     1800

actgcggaaa tggatgcgat tcactcgctc cagcttattc tgagagactc ttttaaagaa     1860

tctgaggcgg ctatgaactc taaagttgtg gatggtgtgg ttcagccatg tagggatatg     1920

gcgggggaac aggggattga tgagttaggt                                      1950


<210> 32
<211> 650
<212> PRT
<213> Arabidopsis thaliana

<400> 32
Met Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro Asn Asn Arg Arg 
            20                  25                  30          


Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser Leu Arg Pro Arg 
        35                  40                  45              


Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln Tyr Thr Val Asp 
    50                  55                  60                  


Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu Ser Gly Lys Ser 
65                  70                  75                  80  


Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr Gly Ser Ser Val 
                85                  90                  95      


Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile Gln Arg Gly Gly 
            100                 105                 110         


Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp Glu Ser Ser Phe 
        115                 120                 125             


Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met Leu Gly Ile Met 
    130                 135                 140                 


Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile Leu Ala Met Gly 
145                 150                 155                 160 


Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser Ile Leu Leu Glu 
                165                 170                 175     


Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn Pro Val Trp Ile 
            180                 185                 190         


His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile Leu His Arg Ile 
        195                 200                 205             


Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg Thr Glu Asp Pro 
    210                 215                 220                 


Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys Leu Ala Val Arg 
225                 230                 235                 240 


Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp Ile Lys Leu Leu 
                245                 250                 255     


Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr Gly Tyr Asp Arg 
            260                 265                 270         


Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly Glu Val Val Ala 
        275                 280                 285             


Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly Leu His Tyr Pro 
    290                 295                 300                 


Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe Lys Gln Asn Arg 
305                 310                 315                 320 


Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val Leu Val Val Gln 
                325                 330                 335     


Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly Ser Thr Leu Arg 
            340                 345                 350         


Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn Met Gly Ser Ile 
        355                 360                 365             


Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn Glu Asp Asp Gly 
    370                 375                 380                 


Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu Trp Gly Leu Val 
385                 390                 395                 400 


Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe Pro Leu Arg Tyr 
                405                 410                 415     


Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln Leu Asn Met Glu 
            420                 425                 430         


Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val Leu Arg Thr Gln 
        435                 440                 445             


Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro Ala Gly Ile Val 
    450                 455                 460                 


Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys Asp Gly Ala Ala 
465                 470                 475                 480 


Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val Ala Pro Ser Glu 
                485                 490                 495     


Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala Asn His Ala Asp 
            500                 505                 510         


Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala Gly Tyr Pro Gly 
        515                 520                 525             


Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala Val Ala Tyr Ile 
    530                 535                 540                 


Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His Thr Ala Lys Glu 
545                 550                 555                 560 


Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp Lys Asp Asp Gly 
                565                 570                 575     


Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe Leu Glu Val Val 
            580                 585                 590         


Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met Asp Ala Ile His 
        595                 600                 605             


Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu Ser Glu Ala Ala 
    610                 615                 620                 


Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro Cys Arg Asp Met 
625                 630                 635                 640 


Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly 
                645                 650 


<210> 33
<211> 2394
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 33
atggtttccg gagtcggggg tagtggcggt ggccgtggcg gtggccgtgg cggagaagaa       60

gaaccgtcgt caagtcacac tcctaataac cgaagaggag gagaacaagc tcaatcgtcg      120

ggaacgaaat ctctcagacc aagaagcaac actgaatcaa tgagcaaagc aattcaacag      180

tacaccgtcg acgcaagact ccacgccgtt ttcgaacaat ccggcgaatc agggaaatca      240

ttcgactact cacaatcact caaaacgacg acgtacggtt cctctgtacc tgagcaacag      300

atcacagctt atctctctcg aatccagcga ggtggttaca ttcagccttt cggatgtatg      360

atcgccgtcg atgaatccag tttccggatc atcggttaca gtgaaaacgc cagagaaatg      420

ttagggatta tgcctcaatc tgttcctact cttgagaaac ctgagattct agctatggga      480

actgatgtga gatctttgtt cacttcttcg agctcgattc tactcgagcg tgctttcgtt      540

gctcgagaga ttaccttgtt aaatccggtt tggatccatt ccaagaatac tggtaaaccg      600

ttttacgcca ttcttcatag gattgatgtt ggtgttgtta ttgatttaga gccagctaga      660

actgaagatc ctgcgctttc tattgctggt gctgttcaat cgcagaaact cgcggttcgt      720

gcgatttctc agttacaggc tcttcctggt ggagatatta agcttttgtg tgacactgtc      780

gtggaaagtg tgagggactt gactggttat gatcgtgtta tggtttataa gtttcatgaa      840

gatgagcatg gagaagttgt agctgagagt aaacgagatg atttagagcc ttatattgga      900

ctgcattatc ctgctactga tattcctcaa gcgtcaaggt tcttgtttaa gcagaaccgt      960

gtccgaatga tagtagattg caatgccaca cctgttcttg tggtccagga cgataggcta     1020

actcagtcta tgtgcttggt tggttctact cttagggctc ctcatggttg tcactctcag     1080

tatatggcta acatgggatc tattgcgtct ttagcaatgg cggttataat caatggaaat     1140

gaagatgatg ggagcaatgt agctagtgga agaagctcga tgaggctttg gggtttggtt     1200

gtttgccatc acacttcttc tcgctgcata ccgtttccgc taaggtatgc ttgtgagttt     1260

ttgatgcagg ctttcggttt acagttaaac atggaattgc agttagcttt gcaaatgtca     1320

gagaaacgcg ttttgagaac gcagacactg ttatgtgata tgcttctgcg tgactcgcct     1380

gctggaattg ttacacagag tcccagtatc atggacttag tgaaatgtga cggtgcagca     1440

tttctttacc acgggaagta ttacccgttg ggtgttgctc ctagtgaagt tcagataaaa     1500

gatgttgtgg agtggttgct tgcgaatcat gcggattcaa ccggattaag cactgatagt     1560

ttaggcgatg cggggtatcc cggtgcagct gcgttagggg atgctgtgtg cggtatggca     1620

gttgcatata tcacaaaaag agactttctt ttttggtttc gatctcacac tgcgaaagaa     1680

atcaaatggg gaggcgctaa gcatcatccg gaggataaag atgatgggca acgaatgcat     1740

cctcgttcgt cctttcaggc ttttcttgaa gttgttaaga gccggagtca gccatgggaa     1800

actgcggaaa tggatgcgat tcactcgctc cagcttattc tgagagactc ttttaaagaa     1860

tctgaggcgg ctatgaactc taaagttgtg gatggtgtgg ttcagccatg tagggatatg     1920

gcgggggaac aggggattga tgagttaggt gaattcgata gtgctggtag tgctggtagt     1980

gctggttccg cgtacagccg cgcgcgtacg aaaaacaatt acgggtctac catcgagggc     2040

ctgctcgatc tcccggacga cgacgccccc gaagaggcgg ggctggcggc tccgcgcctg     2100

tcctttctcc ccgcgggaca cacgcgcaga ctgtcgacgg cccccccgac cgatgtcagc     2160

ctgggggacg agctccactt agacggcgag gacgtggcga tggcgcatgc cgacgcgcta     2220

gacgatttcg atctggacat gttgggggac ggggattccc cgggtccggg atttaccccc     2280

cacgactccg ccccctacgg cgctctggat atggccgact tcgagtttga gcagatgttt     2340

accgatgccc ttggaattga cgagtacggt gggcccaaga aaaagcggaa ggtg           2394


<210> 34
<211> 798
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 34
Met Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro Asn Asn Arg Arg 
            20                  25                  30          


Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser Leu Arg Pro Arg 
        35                  40                  45              


Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln Tyr Thr Val Asp 
    50                  55                  60                  


Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu Ser Gly Lys Ser 
65                  70                  75                  80  


Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr Gly Ser Ser Val 
                85                  90                  95      


Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile Gln Arg Gly Gly 
            100                 105                 110         


Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp Glu Ser Ser Phe 
        115                 120                 125             


Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met Leu Gly Ile Met 
    130                 135                 140                 


Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile Leu Ala Met Gly 
145                 150                 155                 160 


Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser Ile Leu Leu Glu 
                165                 170                 175     


Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn Pro Val Trp Ile 
            180                 185                 190         


His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile Leu His Arg Ile 
        195                 200                 205             


Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg Thr Glu Asp Pro 
    210                 215                 220                 


Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys Leu Ala Val Arg 
225                 230                 235                 240 


Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp Ile Lys Leu Leu 
                245                 250                 255     


Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr Gly Tyr Asp Arg 
            260                 265                 270         


Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly Glu Val Val Ala 
        275                 280                 285             


Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly Leu His Tyr Pro 
    290                 295                 300                 


Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe Lys Gln Asn Arg 
305                 310                 315                 320 


Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val Leu Val Val Gln 
                325                 330                 335     


Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly Ser Thr Leu Arg 
            340                 345                 350         


Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn Met Gly Ser Ile 
        355                 360                 365             


Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn Glu Asp Asp Gly 
    370                 375                 380                 


Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu Trp Gly Leu Val 
385                 390                 395                 400 


Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe Pro Leu Arg Tyr 
                405                 410                 415     


Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln Leu Asn Met Glu 
            420                 425                 430         


Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val Leu Arg Thr Gln 
        435                 440                 445             


Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro Ala Gly Ile Val 
    450                 455                 460                 


Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys Asp Gly Ala Ala 
465                 470                 475                 480 


Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val Ala Pro Ser Glu 
                485                 490                 495     


Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala Asn His Ala Asp 
            500                 505                 510         


Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala Gly Tyr Pro Gly 
        515                 520                 525             


Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala Val Ala Tyr Ile 
    530                 535                 540                 


Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His Thr Ala Lys Glu 
545                 550                 555                 560 


Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp Lys Asp Asp Gly 
                565                 570                 575     


Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe Leu Glu Val Val 
            580                 585                 590         


Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met Asp Ala Ile His 
        595                 600                 605             


Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu Ser Glu Ala Ala 
    610                 615                 620                 


Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro Cys Arg Asp Met 
625                 630                 635                 640 


Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly Glu Phe Asp Ser Ala Gly 
                645                 650                 655     


Ser Ala Gly Ser Ala Gly Ser Ala Tyr Ser Arg Ala Arg Thr Lys Asn 
            660                 665                 670         


Asn Tyr Gly Ser Thr Ile Glu Gly Leu Leu Asp Leu Pro Asp Asp Asp 
        675                 680                 685             


Ala Pro Glu Glu Ala Gly Leu Ala Ala Pro Arg Leu Ser Phe Leu Pro 
    690                 695                 700                 


Ala Gly His Thr Arg Arg Leu Ser Thr Ala Pro Pro Thr Asp Val Ser 
705                 710                 715                 720 


Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His 
                725                 730                 735     


Ala Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp 
            740                 745                 750         


Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala 
        755                 760                 765             


Leu Asp Met Ala Asp Phe Glu Phe Glu Gln Met Phe Thr Asp Ala Leu 
    770                 775                 780                 


Gly Ile Asp Glu Tyr Gly Gly Pro Lys Lys Lys Arg Lys Val 
785                 790                 795             


<210> 35
<211> 2748
<212> DNA
<213> Arabidopsis thaliana

<400> 35
gtatctggtg ttggtggttc tggtggtgga agaggtggag gtagaggagg tgaagaagaa       60

ccatcaagta gtcatacacc taacaatcgt agaggtggtg agcaagctca atcatcaggt      120

acaaaatcat tacgtccaag aagtaatact gaatcaatgt caaaagcaat tcaacaatac      180

acagtagatg ctagattaca cgccgtattc gaacaatctg gagaaagtgg taagagtttt      240

gattactcac aatcattgaa aacaaccact tatggtagtt cagttccaga acaacaaatc      300

actgcatatc ttagtagaat acaacgtggt ggttacattc aaccatttgg ttgtatgatt      360

gcagttgatg aatcttcttt tagaatcatt ggttattcag aaaatgcaag agaaatgttg      420

ggtatcatgc cacaatcagt accaacctta gaaaaaccag aaattcttgc aatgggtaca      480

gatgttagaa gtttgtttac atcatcatca tcaattcttt tggagagagc ttttgttgca      540

cgtgaaatca ctttacttaa tccagtatgg attcatagta agaatactgg aaagccattc      600

tatgcaattc ttcatagaat agatgtagga gttgttattg atcttgagcc agcaagaaca      660

gaagatccag cattatctat tgctggtgca gtacaatcac aaaaacttgc tgttagagca      720

attagtcaat tacaagcctt gccaggtggt gatataaaac ttctttgtga tacagttgtt      780

gaatcagttc gtgatcttac cggttatgat agagttatgg tatacaaatt ccatgaggat      840

gaacatggtg aagttgttgc agaaagtaaa agagatgatc ttgaaccata cattggtttg      900

cattatccag ctactgatat tccacaagca tcaagatttc ttttcaaaca aaatcgtgtt      960

agaatgattg tagattgtaa tgccacccca gtattagttg ttcaagatga tagattgaca     1020

caaagtatgt gtttagtagg ttcaacatta agagcacctc atggatgtca ttcacaatat     1080

atggccaata tgggttcaat agcatcatta gctatggcag taatcatcaa tggaaatgaa     1140

gatgatggtt caaatgttgc atcaggtaga agttcaatgc gtttatgggg tttagtagtt     1200

tgtcatcata caagttctcg ttgtatccca tttcctttac gttatgcatg tgaatttctt     1260

atgcaagcat ttggtttaca attgaatatg gaacttcaat tagcattaca aatgagtgaa     1320

aagagagttt tacgtacaca aacattgtta tgcgatatgt tattgagaga ttctccagct     1380

ggtattgtta ctcaatcacc atctatcatg gatcttgtaa agtgtgatgg tgcagcattc     1440

ttataccacg gaaagtacta tccattaggt gttgcaccat ctgaagttca aatcaaagat     1500

gttgtagaat ggttattggc taatcacgca gattctactg gtttatcaac tgattctctt     1560

ggtgatgctg gttatcctgg tgccgcagcc ttaggagatg ctgtatgtgg tatggccgtt     1620

gcttacatta caaaaagaga tttcttgttt tggtttcgtt ctcatacagc taaagagatc     1680

aaatggggtg gtgcaaaaca tcatccagaa gataaggatg atggtcaaag aatgcatcca     1740

agatcatcat ttcaagcatt cttagaagta gttaagtcaa gaagtcaacc ttgggaaaca     1800

gcagaaatgg atgcaataca ttcattacaa ttgatacttc gtgattcatt caaagaatca     1860

gaagcagcaa tgaatagtaa agttgttgat ggtgttgttc aaccatgtag agatatggcc     1920

ggtgaacaag gtattgatga attaggtgct gtagctagag aaatggttag attgatagaa     1980

actgccactg ttccaatctt cgctgttgat gctggtggat gcataaacgg ttggaatgct     2040

aagatcgcag aattgaccgg tttgtcagtt gaagaagcta tgggtaaaag tttagtttca     2100

gatttgatct ataaggaaaa tgaagcaacc gttaacaaat tgttatcaag agcattgaga     2160

ggagatgagg aaaagaatgt agaagttaag ttaaagacat tttcaccaga gttacaaggt     2220

aaagcagttt ttgttgtagt taatgcttgt tcatcaaaag attacttgaa taacattgta     2280

ggtgtttgtt ttgttggtca agatgtaact tcacaaaaga ttgttatgga taagtttatc     2340

aatatccaag gtgattacaa agctattgtt cattctccaa atccattgat tccaccaatc     2400

tttgcagctg atgagaatac atgttgttta gaatggaata tggcaatgga aaagttaact     2460

ggttggtcac gttcagaagt aattggtaag atgattgttg gagaggtttt tggtagttgt     2520

tgtatgctta aaggtccaga tgctttaact aagtttatga ttgttttgca taatgcaatt     2580

ggtggtcaag atacagataa gttcccattc cctttcttcg atagaaatgg aaagtttgtt     2640

caagcattac ttactgctaa caaaagagta tcattagaag gtaaagtaat aggagctttt     2700

tgtttcttac aaattccttc accagaatta caacaagctc ttgcagta                  2748


<210> 36
<211> 916
<212> PRT
<213> Arabidopsis thaliana

<400> 36
Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg Gly Gly Gly Arg Gly 
1               5                   10                  15      


Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro Asn Asn Arg Arg Gly 
            20                  25                  30          


Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser Leu Arg Pro Arg Ser 
        35                  40                  45              


Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln Tyr Thr Val Asp Ala 
    50                  55                  60                  


Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu Ser Gly Lys Ser Phe 
65                  70                  75                  80  


Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr Gly Ser Ser Val Pro 
                85                  90                  95      


Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile Gln Arg Gly Gly Tyr 
            100                 105                 110         


Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp Glu Ser Ser Phe Arg 
        115                 120                 125             


Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met Leu Gly Ile Met Pro 
    130                 135                 140                 


Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile Leu Ala Met Gly Thr 
145                 150                 155                 160 


Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser Ile Leu Leu Glu Arg 
                165                 170                 175     


Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn Pro Val Trp Ile His 
            180                 185                 190         


Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile Leu His Arg Ile Asp 
        195                 200                 205             


Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg Thr Glu Asp Pro Ala 
    210                 215                 220                 


Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys Leu Ala Val Arg Ala 
225                 230                 235                 240 


Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp Ile Lys Leu Leu Cys 
                245                 250                 255     


Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr Gly Tyr Asp Arg Val 
            260                 265                 270         


Met Val Tyr Lys Phe His Glu Asp Glu His Gly Glu Val Val Ala Glu 
        275                 280                 285             


Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly Leu His Tyr Pro Ala 
    290                 295                 300                 


Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe Lys Gln Asn Arg Val 
305                 310                 315                 320 


Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val Leu Val Val Gln Asp 
                325                 330                 335     


Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly Ser Thr Leu Arg Ala 
            340                 345                 350         


Pro His Gly Cys His Ser Gln Tyr Met Ala Asn Met Gly Ser Ile Ala 
        355                 360                 365             


Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn Glu Asp Asp Gly Ser 
    370                 375                 380                 


Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu Trp Gly Leu Val Val 
385                 390                 395                 400 


Cys His His Thr Ser Ser Arg Cys Ile Pro Phe Pro Leu Arg Tyr Ala 
                405                 410                 415     


Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln Leu Asn Met Glu Leu 
            420                 425                 430         


Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val Leu Arg Thr Gln Thr 
        435                 440                 445             


Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro Ala Gly Ile Val Thr 
    450                 455                 460                 


Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys Asp Gly Ala Ala Phe 
465                 470                 475                 480 


Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val Ala Pro Ser Glu Val 
                485                 490                 495     


Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala Asn His Ala Asp Ser 
            500                 505                 510         


Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala Gly Tyr Pro Gly Ala 
        515                 520                 525             


Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala Val Ala Tyr Ile Thr 
    530                 535                 540                 


Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His Thr Ala Lys Glu Ile 
545                 550                 555                 560 


Lys Trp Gly Gly Ala Lys His His Pro Glu Asp Lys Asp Asp Gly Gln 
                565                 570                 575     


Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe Leu Glu Val Val Lys 
            580                 585                 590         


Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met Asp Ala Ile His Ser 
        595                 600                 605             


Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu Ser Glu Ala Ala Met 
    610                 615                 620                 


Asn Ser Lys Val Val Asp Gly Val Val Gln Pro Cys Arg Asp Met Ala 
625                 630                 635                 640 


Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val Ala Arg Glu Met Val 
                645                 650                 655     


Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe Ala Val Asp Ala Gly 
            660                 665                 670         


Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala Glu Leu Thr Gly Leu 
        675                 680                 685             


Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val Ser Asp Leu Ile Tyr 
    690                 695                 700                 


Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu Ser Arg Ala Leu Arg 
705                 710                 715                 720 


Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu Lys Thr Phe Ser Pro 
                725                 730                 735     


Glu Leu Gln Gly Lys Ala Val Phe Val Val Val Asn Ala Cys Ser Ser 
            740                 745                 750         


Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys Phe Val Gly Gln Asp 
        755                 760                 765             


Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe Ile Asn Ile Gln Gly 
    770                 775                 780                 


Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro Leu Ile Pro Pro Ile 
785                 790                 795                 800 


Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu Trp Asn Met Ala Met 
                805                 810                 815     


Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val Ile Gly Lys Met Ile 
            820                 825                 830         


Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu Lys Gly Pro Asp Ala 
        835                 840                 845             


Leu Thr Lys Phe Met Ile Val Leu His Asn Ala Ile Gly Gly Gln Asp 
    850                 855                 860                 


Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg Asn Gly Lys Phe Val 
865                 870                 875                 880 


Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser Leu Glu Gly Lys Val 
                885                 890                 895     


Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser Pro Glu Leu Gln Gln 
            900                 905                 910         


Ala Leu Ala Val 
        915     


<210> 37
<211> 2778
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 37
gtatctggtg ttggtggttc tggtggtgga agaggtggag gtagaggagg tgaagaagaa       60

ccatcaagta gtcatacacc taacaatcgt agaggtggtg agcaagctca atcatcaggt      120

acaaaatcat tacgtccaag aagtaatact gaatcaatgt caaaagcaat tcaacaatac      180

acagtagatg ctagattaca cgccgtattc gaacaatctg gagaaagtgg taagagtttt      240

gattactcac aatcattgaa aacaaccact tatggtagtt cagttccaga acaacaaatc      300

actgcatatc ttagtagaat acaacgtggt ggttacattc aaccatttgg ttgtatgatt      360

gcagttgatg aatcttcttt tagaatcatt ggttattcag aaaatgcaag agaaatgttg      420

ggtatcatgc cacaatcagt accaacctta gaaaaaccag aaattcttgc aatgggtaca      480

gatgttagaa gtttgtttac atcatcatca tcaattcttt tggagagagc ttttgttgca      540

cgtgaaatca ctttacttaa tccagtatgg attcatagta agaatactgg aaagccattc      600

tatgcaattc ttcatagaat agatgtagga gttgttattg atcttgagcc agcaagaaca      660

gaagatccag cattatctat tgctggtgca gtacaatcac aaaaacttgc tgttagagca      720

attagtcaat tacaagcctt gccaggtggt gatataaaac ttctttgtga tacagttgtt      780

gaatcagttc gtgatcttac cggttatgat agagttatgg tatacaaatt ccatgaggat      840

gaacatggtg aagttgttgc agaaagtaaa agagatgatc ttgaaccata cattggtttg      900

cattatccag ctactgatat tccacaagca tcaagatttc ttttcaaaca aaatcgtgtt      960

agaatgattg tagattgtaa tgccacccca gtattagttg ttcaagatga tagattgaca     1020

caaagtatgt gtttagtagg ttcaacatta agagcacctc atggatgtca ttcacaatat     1080

atggccaata tgggttcaat agcatcatta gctatggcag taatcatcaa tggaaatgaa     1140

gatgatggtt caaatgttgc atcaggtaga agttcaatgc gtttatgggg tttagtagtt     1200

tgtcatcata caagttctcg ttgtatccca tttcctttac gttatgcatg tgaatttctt     1260

atgcaagcat ttggtttaca attgaatatg gaacttcaat tagcattaca aatgagtgaa     1320

aagagagttt tacgtacaca aacattgtta tgcgatatgt tattgagaga ttctccagct     1380

ggtattgtta ctcaatcacc atctatcatg gatcttgtaa agtgtgatgg tgcagcattc     1440

ttataccacg gaaagtacta tccattaggt gttgcaccat ctgaagttca aatcaaagat     1500

gttgtagaat ggttattggc taatcacgca gattctactg gtttatcaac tgattctctt     1560

ggtgatgctg gttatcctgg tgccgcagcc ttaggagatg ctgtatgtgg tatggccgtt     1620

gcttacatta caaaaagaga tttcttgttt tggtttcgtt ctcatacagc taaagagatc     1680

aaatggggtg gtgcaaaaca tcatccagaa gataaggatg atggtcaaag aatgcatcca     1740

agatcatcat ttcaagcatt cttagaagta gttaagtcaa gaagtcaacc ttgggaaaca     1800

gcagaaatgg atgcaataca ttcattacaa ttgatacttc gtgattcatt caaagaatca     1860

gaagcagcaa tgaatagtaa agttgttgat ggtgttgttc aaccatgtag agatatggcc     1920

ggtgaacaag gtattgatga attaggtgct gtagctagag aaatggttag attgatagaa     1980

actgccactg ttccaatctt cgctgttgat gctggtggat gcataaacgg ttggaatgct     2040

aagatcgcag aattgaccgg tttgtcagtt gaagaagcta tgggtaaaag tttagtttca     2100

gatttgatct ataaggaaaa tgaagcaacc gttaacaaat tgttatcaag agcattgaga     2160

ggagatgagg aaaagaatgt agaagttaag ttaaagacat tttcaccaga gttacaaggt     2220

aaagcagttt ttgttgtagt taatgcttgt tcatcaaaag attacttgaa taacattgta     2280

ggtgtttgtt ttgttggtca agatgtaact tcacaaaaga ttgttatgga taagtttatc     2340

aatatccaag gtgattacaa agctattgtt cattctccaa atccattgat tccaccaatc     2400

tttgcagctg atgagaatac atgttgttta gaatggaata tggcaatgga aaagttaact     2460

ggttggtcac gttcagaagt aattggtaag atgattgttg gagaggtttt tggtagttgt     2520

tgtatgctta aaggtccaga tgctttaact aagtttatga ttgttttgca taatgcaatt     2580

ggtggtcaag atacagataa gttcccattc cctttcttcg atagaaatgg aaagtttgtt     2640

caagcattac ttactgctaa caaaagagta tcattagaag gtaaagtaat aggagctttt     2700

tgtttcttac aaattccttc accagaatta caacaagctc ttgcagtagg tgcttcaggt     2760

catcatcatc atcatcat                                                   2778


<210> 38
<211> 926
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 38
Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg Gly Gly Gly Arg Gly 
1               5                   10                  15      


Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro Asn Asn Arg Arg Gly 
            20                  25                  30          


Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser Leu Arg Pro Arg Ser 
        35                  40                  45              


Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln Tyr Thr Val Asp Ala 
    50                  55                  60                  


Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu Ser Gly Lys Ser Phe 
65                  70                  75                  80  


Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr Gly Ser Ser Val Pro 
                85                  90                  95      


Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile Gln Arg Gly Gly Tyr 
            100                 105                 110         


Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp Glu Ser Ser Phe Arg 
        115                 120                 125             


Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met Leu Gly Ile Met Pro 
    130                 135                 140                 


Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile Leu Ala Met Gly Thr 
145                 150                 155                 160 


Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser Ile Leu Leu Glu Arg 
                165                 170                 175     


Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn Pro Val Trp Ile His 
            180                 185                 190         


Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile Leu His Arg Ile Asp 
        195                 200                 205             


Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg Thr Glu Asp Pro Ala 
    210                 215                 220                 


Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys Leu Ala Val Arg Ala 
225                 230                 235                 240 


Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp Ile Lys Leu Leu Cys 
                245                 250                 255     


Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr Gly Tyr Asp Arg Val 
            260                 265                 270         


Met Val Tyr Lys Phe His Glu Asp Glu His Gly Glu Val Val Ala Glu 
        275                 280                 285             


Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly Leu His Tyr Pro Ala 
    290                 295                 300                 


Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe Lys Gln Asn Arg Val 
305                 310                 315                 320 


Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val Leu Val Val Gln Asp 
                325                 330                 335     


Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly Ser Thr Leu Arg Ala 
            340                 345                 350         


Pro His Gly Cys His Ser Gln Tyr Met Ala Asn Met Gly Ser Ile Ala 
        355                 360                 365             


Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn Glu Asp Asp Gly Ser 
    370                 375                 380                 


Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu Trp Gly Leu Val Val 
385                 390                 395                 400 


Cys His His Thr Ser Ser Arg Cys Ile Pro Phe Pro Leu Arg Tyr Ala 
                405                 410                 415     


Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln Leu Asn Met Glu Leu 
            420                 425                 430         


Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val Leu Arg Thr Gln Thr 
        435                 440                 445             


Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro Ala Gly Ile Val Thr 
    450                 455                 460                 


Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys Asp Gly Ala Ala Phe 
465                 470                 475                 480 


Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val Ala Pro Ser Glu Val 
                485                 490                 495     


Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala Asn His Ala Asp Ser 
            500                 505                 510         


Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala Gly Tyr Pro Gly Ala 
        515                 520                 525             


Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala Val Ala Tyr Ile Thr 
    530                 535                 540                 


Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His Thr Ala Lys Glu Ile 
545                 550                 555                 560 


Lys Trp Gly Gly Ala Lys His His Pro Glu Asp Lys Asp Asp Gly Gln 
                565                 570                 575     


Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe Leu Glu Val Val Lys 
            580                 585                 590         


Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met Asp Ala Ile His Ser 
        595                 600                 605             


Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu Ser Glu Ala Ala Met 
    610                 615                 620                 


Asn Ser Lys Val Val Asp Gly Val Val Gln Pro Cys Arg Asp Met Ala 
625                 630                 635                 640 


Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val Ala Arg Glu Met Val 
                645                 650                 655     


Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe Ala Val Asp Ala Gly 
            660                 665                 670         


Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala Glu Leu Thr Gly Leu 
        675                 680                 685             


Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val Ser Asp Leu Ile Tyr 
    690                 695                 700                 


Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu Ser Arg Ala Leu Arg 
705                 710                 715                 720 


Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu Lys Thr Phe Ser Pro 
                725                 730                 735     


Glu Leu Gln Gly Lys Ala Val Phe Val Val Val Asn Ala Cys Ser Ser 
            740                 745                 750         


Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys Phe Val Gly Gln Asp 
        755                 760                 765             


Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe Ile Asn Ile Gln Gly 
    770                 775                 780                 


Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro Leu Ile Pro Pro Ile 
785                 790                 795                 800 


Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu Trp Asn Met Ala Met 
                805                 810                 815     


Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val Ile Gly Lys Met Ile 
            820                 825                 830         


Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu Lys Gly Pro Asp Ala 
        835                 840                 845             


Leu Thr Lys Phe Met Ile Val Leu His Asn Ala Ile Gly Gly Gln Asp 
    850                 855                 860                 


Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg Asn Gly Lys Phe Val 
865                 870                 875                 880 


Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser Leu Glu Gly Lys Val 
                885                 890                 895     


Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser Pro Glu Leu Gln Gln 
            900                 905                 910         


Ala Leu Ala Val Gly Ala Ser Gly His His His His His His 
        915                 920                 925     


<210> 39
<211> 2772
<212> DNA
<213> Arabidopsis thaliana

<400> 39
atgggtgctt caggtgtatc tggtgttggt ggttctggtg gtggaagagg tggaggtaga       60

ggaggtgaag aagaaccatc aagtagtcat acacctaaca atcgtagagg tggtgagcaa      120

gctcaatcat caggtacaaa atcattacgt ccaagaagta atactgaatc aatgtcaaaa      180

gcaattcaac aatacacagt agatgctaga ttacacgccg tattcgaaca atctggagaa      240

agtggtaaga gttttgatta ctcacaatca ttgaaaacaa ccacttatgg tagttcagtt      300

ccagaacaac aaatcactgc atatcttagt agaatacaac gtggtggtta cattcaacca      360

tttggttgta tgattgcagt tgatgaatct tcttttagaa tcattggtta ttcagaaaat      420

gcaagagaaa tgttgggtat catgccacaa tcagtaccaa ccttagaaaa accagaaatt      480

cttgcaatgg gtacagatgt tagaagtttg tttacatcat catcatcaat tcttttggag      540

agagcttttg ttgcacgtga aatcacttta cttaatccag tatggattca tagtaagaat      600

actggaaagc cattctatgc aattcttcat agaatagatg taggagttgt tattgatctt      660

gagccagcaa gaacagaaga tccagcatta tctattgctg gtgcagtaca atcacaaaaa      720

cttgctgtta gagcaattag tcaattacaa gccttgccag gtggtgatat aaaacttctt      780

tgtgatacag ttgttgaatc agttcgtgat cttaccggtt atgatagagt tatggtatac      840

aaattccatg aggatgaaca tggtgaagtt gttgcagaaa gtaaaagaga tgatcttgaa      900

ccatacattg gtttgcatta tccagctact gatattccac aagcatcaag atttcttttc      960

aaacaaaatc gtgttagaat gattgtagat tgtaatgcca ccccagtatt agttgttcaa     1020

gatgatagat tgacacaaag tatgtgttta gtaggttcaa cattaagagc acctcatgga     1080

tgtcattcac aatatatggc caatatgggt tcaatagcat cattagctat ggcagtaatc     1140

atcaatggaa atgaagatga tggttcaaat gttgcatcag gtagaagttc aatgcgttta     1200

tggggtttag tagtttgtca tcatacaagt tctcgttgta tcccatttcc tttacgttat     1260

gcatgtgaat ttcttatgca agcatttggt ttacaattga atatggaact tcaattagca     1320

ttacaaatga gtgaaaagag agttttacgt acacaaacat tgttatgcga tatgttattg     1380

agagattctc cagctggtat tgttactcaa tcaccatcta tcatggatct tgtaaagtgt     1440

gatggtgcag cattcttata ccacggaaag tactatccat taggtgttgc accatctgaa     1500

gttcaaatca aagatgttgt agaatggtta ttggctaatc acgcagattc tactggttta     1560

tcaactgatt ctcttggtga tgctggttat cctggtgccg cagccttagg agatgctgta     1620

tgtggtatgg ccgttgctta cattacaaaa agagatttct tgttttggtt tcgttctcat     1680

acagctaaag agatcaaatg gggtggtgca aaacatcatc cagaagataa ggatgatggt     1740

caaagaatgc atccaagatc atcatttcaa gcattcttag aagtagttaa gtcaagaagt     1800

caaccttggg aaacagcaga aatggatgca atacattcat tacaattgat acttcgtgat     1860

tcattcaaag aatcagaagc agcaatgaat agtaaagttg ttgatggtgt tgttcaacca     1920

tgtagagata tggccggtga acaaggtatt gatgaattag gtgctgtagc tagagaaatg     1980

gttagattga tagaaactgc cactgttcca atcttcgctg ttgatgctgg tggatgcata     2040

aacggttgga atgctaagat cgcagaattg accggtttgt cagttgaaga agctatgggt     2100

aaaagtttag tttcagattt gatctataag gaaaatgaag caaccgttaa caaattgtta     2160

tcaagagcat tgagaggaga tgaggaaaag aatgtagaag ttaagttaaa gacattttca     2220

ccagagttac aaggtaaagc agtttttgtt gtagttaatg cttgttcatc aaaagattac     2280

ttgaataaca ttgtaggtgt ttgttttgtt ggtcaagatg taacttcaca aaagattgtt     2340

atggataagt ttatcaatat ccaaggtgat tacaaagcta ttgttcattc tccaaatcca     2400

ttgattccac caatctttgc agctgatgag aatacatgtt gtttagaatg gaatatggca     2460

atggaaaagt taactggttg gtcacgttca gaagtaattg gtaagatgat tgttggagag     2520

gtttttggta gttgttgtat gcttaaaggt ccagatgctt taactaagtt tatgattgtt     2580

ttgcataatg caattggtgg tcaagataca gataagttcc cattcccttt cttcgataga     2640

aatggaaagt ttgttcaagc attacttact gctaacaaaa gagtatcatt agaaggtaaa     2700

gtaataggag ctttttgttt cttacaaatt ccttcaccag aattacaaca agctcttgca     2760

gtaggtggta gt                                                         2772


<210> 40
<211> 924
<212> PRT
<213> Arabidopsis thaliana

<400> 40
Met Gly Ala Ser Gly Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Gly Arg Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro 
            20                  25                  30          


Asn Asn Arg Arg Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser 
        35                  40                  45              


Leu Arg Pro Arg Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln 
    50                  55                  60                  


Tyr Thr Val Asp Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu 
65                  70                  75                  80  


Ser Gly Lys Ser Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr 
                85                  90                  95      


Gly Ser Ser Val Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile 
            100                 105                 110         


Gln Arg Gly Gly Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp 
        115                 120                 125             


Glu Ser Ser Phe Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met 
    130                 135                 140                 


Leu Gly Ile Met Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile 
145                 150                 155                 160 


Leu Ala Met Gly Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser 
                165                 170                 175     


Ile Leu Leu Glu Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn 
            180                 185                 190         


Pro Val Trp Ile His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile 
        195                 200                 205             


Leu His Arg Ile Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg 
    210                 215                 220                 


Thr Glu Asp Pro Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys 
225                 230                 235                 240 


Leu Ala Val Arg Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp 
                245                 250                 255     


Ile Lys Leu Leu Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr 
            260                 265                 270         


Gly Tyr Asp Arg Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly 
        275                 280                 285             


Glu Val Val Ala Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly 
    290                 295                 300                 


Leu His Tyr Pro Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe 
305                 310                 315                 320 


Lys Gln Asn Arg Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val 
                325                 330                 335     


Leu Val Val Gln Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly 
            340                 345                 350         


Ser Thr Leu Arg Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn 
        355                 360                 365             


Met Gly Ser Ile Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn 
    370                 375                 380                 


Glu Asp Asp Gly Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu 
385                 390                 395                 400 


Trp Gly Leu Val Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe 
                405                 410                 415     


Pro Leu Arg Tyr Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln 
            420                 425                 430         


Leu Asn Met Glu Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val 
        435                 440                 445             


Leu Arg Thr Gln Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro 
    450                 455                 460                 


Ala Gly Ile Val Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys 
465                 470                 475                 480 


Asp Gly Ala Ala Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Ser Glu Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala 
            500                 505                 510         


Asn His Ala Asp Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala 
        515                 520                 525             


Gly Tyr Pro Gly Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala 
    530                 535                 540                 


Val Ala Tyr Ile Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His 
545                 550                 555                 560 


Thr Ala Lys Glu Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp 
                565                 570                 575     


Lys Asp Asp Gly Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe 
            580                 585                 590         


Leu Glu Val Val Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met 
        595                 600                 605             


Asp Ala Ile His Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu 
    610                 615                 620                 


Ser Glu Ala Ala Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro 
625                 630                 635                 640 


Cys Arg Asp Met Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val 
                645                 650                 655     


Ala Arg Glu Met Val Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe 
            660                 665                 670         


Ala Val Asp Ala Gly Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala 
        675                 680                 685             


Glu Leu Thr Gly Leu Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val 
    690                 695                 700                 


Ser Asp Leu Ile Tyr Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu 
705                 710                 715                 720 


Ser Arg Ala Leu Arg Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu 
                725                 730                 735     


Lys Thr Phe Ser Pro Glu Leu Gln Gly Lys Ala Val Phe Val Val Val 
            740                 745                 750         


Asn Ala Cys Ser Ser Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys 
        755                 760                 765             


Phe Val Gly Gln Asp Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe 
    770                 775                 780                 


Ile Asn Ile Gln Gly Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro 
785                 790                 795                 800 


Leu Ile Pro Pro Ile Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu 
                805                 810                 815     


Trp Asn Met Ala Met Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val 
            820                 825                 830         


Ile Gly Lys Met Ile Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu 
        835                 840                 845             


Lys Gly Pro Asp Ala Leu Thr Lys Phe Met Ile Val Leu His Asn Ala 
    850                 855                 860                 


Ile Gly Gly Gln Asp Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg 
865                 870                 875                 880 


Asn Gly Lys Phe Val Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser 
                885                 890                 895     


Leu Glu Gly Lys Val Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser 
            900                 905                 910         


Pro Glu Leu Gln Gln Ala Leu Ala Val Gly Gly Ser 
        915                 920                 


<210> 41
<211> 2793
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 41
atgggtgctt caggtgtatc tggtgttggt ggttctggtg gtggaagagg tggaggtaga       60

ggaggtgaag aagaaccatc aagtagtcat acacctaaca atcgtagagg tggtgagcaa      120

gctcaatcat caggtacaaa atcattacgt ccaagaagta atactgaatc aatgtcaaaa      180

gcaattcaac aatacacagt agatgctaga ttacacgccg tattcgaaca atctggagaa      240

agtggtaaga gttttgatta ctcacaatca ttgaaaacaa ccacttatgg tagttcagtt      300

ccagaacaac aaatcactgc atatcttagt agaatacaac gtggtggtta cattcaacca      360

tttggttgta tgattgcagt tgatgaatct tcttttagaa tcattggtta ttcagaaaat      420

gcaagagaaa tgttgggtat catgccacaa tcagtaccaa ccttagaaaa accagaaatt      480

cttgcaatgg gtacagatgt tagaagtttg tttacatcat catcatcaat tcttttggag      540

agagcttttg ttgcacgtga aatcacttta cttaatccag tatggattca tagtaagaat      600

actggaaagc cattctatgc aattcttcat agaatagatg taggagttgt tattgatctt      660

gagccagcaa gaacagaaga tccagcatta tctattgctg gtgcagtaca atcacaaaaa      720

cttgctgtta gagcaattag tcaattacaa gccttgccag gtggtgatat aaaacttctt      780

tgtgatacag ttgttgaatc agttcgtgat cttaccggtt atgatagagt tatggtatac      840

aaattccatg aggatgaaca tggtgaagtt gttgcagaaa gtaaaagaga tgatcttgaa      900

ccatacattg gtttgcatta tccagctact gatattccac aagcatcaag atttcttttc      960

aaacaaaatc gtgttagaat gattgtagat tgtaatgcca ccccagtatt agttgttcaa     1020

gatgatagat tgacacaaag tatgtgttta gtaggttcaa cattaagagc acctcatgga     1080

tgtcattcac aatatatggc caatatgggt tcaatagcat cattagctat ggcagtaatc     1140

atcaatggaa atgaagatga tggttcaaat gttgcatcag gtagaagttc aatgcgttta     1200

tggggtttag tagtttgtca tcatacaagt tctcgttgta tcccatttcc tttacgttat     1260

gcatgtgaat ttcttatgca agcatttggt ttacaattga atatggaact tcaattagca     1320

ttacaaatga gtgaaaagag agttttacgt acacaaacat tgttatgcga tatgttattg     1380

agagattctc cagctggtat tgttactcaa tcaccatcta tcatggatct tgtaaagtgt     1440

gatggtgcag cattcttata ccacggaaag tactatccat taggtgttgc accatctgaa     1500

gttcaaatca aagatgttgt agaatggtta ttggctaatc acgcagattc tactggttta     1560

tcaactgatt ctcttggtga tgctggttat cctggtgccg cagccttagg agatgctgta     1620

tgtggtatgg ccgttgctta cattacaaaa agagatttct tgttttggtt tcgttctcat     1680

acagctaaag agatcaaatg gggtggtgca aaacatcatc cagaagataa ggatgatggt     1740

caaagaatgc atccaagatc atcatttcaa gcattcttag aagtagttaa gtcaagaagt     1800

caaccttggg aaacagcaga aatggatgca atacattcat tacaattgat acttcgtgat     1860

tcattcaaag aatcagaagc agcaatgaat agtaaagttg ttgatggtgt tgttcaacca     1920

tgtagagata tggccggtga acaaggtatt gatgaattag gtgctgtagc tagagaaatg     1980

gttagattga tagaaactgc cactgttcca atcttcgctg ttgatgctgg tggatgcata     2040

aacggttgga atgctaagat cgcagaattg accggtttgt cagttgaaga agctatgggt     2100

aaaagtttag tttcagattt gatctataag gaaaatgaag caaccgttaa caaattgtta     2160

tcaagagcat tgagaggaga tgaggaaaag aatgtagaag ttaagttaaa gacattttca     2220

ccagagttac aaggtaaagc agtttttgtt gtagttaatg cttgttcatc aaaagattac     2280

ttgaataaca ttgtaggtgt ttgttttgtt ggtcaagatg taacttcaca aaagattgtt     2340

atggataagt ttatcaatat ccaaggtgat tacaaagcta ttgttcattc tccaaatcca     2400

ttgattccac caatctttgc agctgatgag aatacatgtt gtttagaatg gaatatggca     2460

atggaaaagt taactggttg gtcacgttca gaagtaattg gtaagatgat tgttggagag     2520

gtttttggta gttgttgtat gcttaaaggt ccagatgctt taactaagtt tatgattgtt     2580

ttgcataatg caattggtgg tcaagataca gataagttcc cattcccttt cttcgataga     2640

aatggaaagt ttgttcaagc attacttact gctaacaaaa gagtatcatt agaaggtaaa     2700

gtaataggag ctttttgttt cttacaaatt ccttcaccag aattacaaca agctcttgca     2760

gtaggtggta gtcatcatca tcatcatcat taa                                  2793


<210> 42
<211> 930
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 42
Met Gly Ala Ser Gly Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Gly Arg Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro 
            20                  25                  30          


Asn Asn Arg Arg Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser 
        35                  40                  45              


Leu Arg Pro Arg Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln 
    50                  55                  60                  


Tyr Thr Val Asp Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu 
65                  70                  75                  80  


Ser Gly Lys Ser Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr 
                85                  90                  95      


Gly Ser Ser Val Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile 
            100                 105                 110         


Gln Arg Gly Gly Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp 
        115                 120                 125             


Glu Ser Ser Phe Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met 
    130                 135                 140                 


Leu Gly Ile Met Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile 
145                 150                 155                 160 


Leu Ala Met Gly Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser 
                165                 170                 175     


Ile Leu Leu Glu Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn 
            180                 185                 190         


Pro Val Trp Ile His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile 
        195                 200                 205             


Leu His Arg Ile Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg 
    210                 215                 220                 


Thr Glu Asp Pro Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys 
225                 230                 235                 240 


Leu Ala Val Arg Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp 
                245                 250                 255     


Ile Lys Leu Leu Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr 
            260                 265                 270         


Gly Tyr Asp Arg Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly 
        275                 280                 285             


Glu Val Val Ala Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly 
    290                 295                 300                 


Leu His Tyr Pro Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe 
305                 310                 315                 320 


Lys Gln Asn Arg Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val 
                325                 330                 335     


Leu Val Val Gln Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly 
            340                 345                 350         


Ser Thr Leu Arg Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn 
        355                 360                 365             


Met Gly Ser Ile Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn 
    370                 375                 380                 


Glu Asp Asp Gly Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu 
385                 390                 395                 400 


Trp Gly Leu Val Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe 
                405                 410                 415     


Pro Leu Arg Tyr Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln 
            420                 425                 430         


Leu Asn Met Glu Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val 
        435                 440                 445             


Leu Arg Thr Gln Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro 
    450                 455                 460                 


Ala Gly Ile Val Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys 
465                 470                 475                 480 


Asp Gly Ala Ala Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Ser Glu Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala 
            500                 505                 510         


Asn His Ala Asp Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala 
        515                 520                 525             


Gly Tyr Pro Gly Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala 
    530                 535                 540                 


Val Ala Tyr Ile Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His 
545                 550                 555                 560 


Thr Ala Lys Glu Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp 
                565                 570                 575     


Lys Asp Asp Gly Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe 
            580                 585                 590         


Leu Glu Val Val Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met 
        595                 600                 605             


Asp Ala Ile His Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu 
    610                 615                 620                 


Ser Glu Ala Ala Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro 
625                 630                 635                 640 


Cys Arg Asp Met Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val 
                645                 650                 655     


Ala Arg Glu Met Val Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe 
            660                 665                 670         


Ala Val Asp Ala Gly Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala 
        675                 680                 685             


Glu Leu Thr Gly Leu Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val 
    690                 695                 700                 


Ser Asp Leu Ile Tyr Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu 
705                 710                 715                 720 


Ser Arg Ala Leu Arg Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu 
                725                 730                 735     


Lys Thr Phe Ser Pro Glu Leu Gln Gly Lys Ala Val Phe Val Val Val 
            740                 745                 750         


Asn Ala Cys Ser Ser Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys 
        755                 760                 765             


Phe Val Gly Gln Asp Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe 
    770                 775                 780                 


Ile Asn Ile Gln Gly Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro 
785                 790                 795                 800 


Leu Ile Pro Pro Ile Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu 
                805                 810                 815     


Trp Asn Met Ala Met Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val 
            820                 825                 830         


Ile Gly Lys Met Ile Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu 
        835                 840                 845             


Lys Gly Pro Asp Ala Leu Thr Lys Phe Met Ile Val Leu His Asn Ala 
    850                 855                 860                 


Ile Gly Gly Gln Asp Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg 
865                 870                 875                 880 


Asn Gly Lys Phe Val Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser 
                885                 890                 895     


Leu Glu Gly Lys Val Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser 
            900                 905                 910         


Pro Glu Leu Gln Gln Ala Leu Ala Val Gly Gly Ser His His His His 
        915                 920                 925             


His His 
    930 


<210> 43
<211> 2520
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 43
atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga       60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac      120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac      180

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcctacgac      240

cggcagctcg acagcggaga caacccgtac ctcaagtaca accacgccga cgcggagttt      300

caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag      360

gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gatggccggc      420

atgatgttcc ttcctactga ttattgttgc agactgagcg accaggaata catggaactc      480

gtcttcgaga acggacagat actcgcaaaa ggccagaggt caaatgttag tctccataat      540

cagcggacga aaagcatcat ggatctgtat gaggccgaat acaacgaaga ttttatgaaa      600

agtattatcc atggaggggg tggcgctatt accaacctgg gagataccca agtggtccca      660

cagtcccacg tagcagccgc tcacgagacc aatatgctgg agtccaacaa acacgtagac      720

ggcgccgctc cgggaaaaaa gaggccggta gagcactctc ctgtggagcc agactcctcc      780

tcgggaaccg gaaaggcggg ccagcagcct gcaagaaaaa gattgaattt tggtcagact      840

ggagacgcag actcagtacc tgacccccag cctctcggac agccaccagc agccccctct      900

ggtctgggaa ctaatacgct ggctacaggc agtggcgcac cactggcaga caataacgag      960

ggcgccgacg gagtgggtaa ttcctcggga aattggcatt gcgattccac atggctgggc     1020

gacagagtca tcaccaccag cacccgaacc tgggccctgc ccacctacaa caaccacctc     1080

tacaaacaaa tttccagcca atcaggagcc tcgaacgaca atcactactt tggctacagc     1140

accccttggg ggtattttga cttcaacaga ttccactgcc acttttcacc acgtgactgg     1200

caaagactca tcaacaacaa ctggggattc cgacccaaga gactcaactt caagctcttt     1260

aacattcaag tcaaagaggt cacgcagaat gacggtacga cgacgattgc caataacctt     1320

accagcacgg ttcaggtgtt tactgactcg gagtaccagc tcccgtacgt cctcggctcg     1380

gcgcatcaag gatgcctccc gccgttccca gcagacgtct tcatggtgcc acagtatgga     1440

tacctcaccc tgaacaacgg gagtcaggca gtaggacgct cttcatttta ctgcctggag     1500

tactttcctt ctcagatgct gcgtaccgga aacaacttta ccttcagcta cacttttgag     1560

gacgttcctt tccacagcag ctacgctcac agccagagtc tggaccgtct catgaatcct     1620

ctcatcgacc agtacctgta ttacttgagc agaacaaaca ctccaagtgg aaccaccacg     1680

cagtcaaggc ttcagttttc tcaggccgga gcgagtgaca ttcgggacca gtctaggaac     1740

tggcttcctg gaccctgtta ccgccagcag cgagtatcaa agacatctgc ggataacaac     1800

aacagtgaat actcgtggac tggagctacc aagtaccacc tcaatggcag agactctctg     1860

gtgaatccgg gcccggccat ggcaagccac aaggacgatg aagaaaagtt ttttcctcag     1920

agcggggttc tcatctttgg gaagcaaggc tcagagaaaa caaatgtgga cattgaaaag     1980

gtcatgatta cagacgaaga ggaaatcagg acaaccaatc ccgtggctac ggagcagtat     2040

ggttctgtat ctaccaacct ccagagaggc aacagacaag cagctaccgc agatgtcaac     2100

acacaaggcg ttcttccagg catggtctgg caggacagag atgtgtacct tcaggggccc     2160

atctgggcaa agattccaca cacggacgga cattttcacc cctctcccct catgggtgga     2220

ttcggactta aacaccctcc tccacagatt ctcatcaaga acaccccggt acctgcgaat     2280

ccttcgacca ccttcagtgc ggcaaagttt gcttccttca tcacacagta ctccacggga     2340

caggtcagcg tggagatcga gtgggagctg cagaaggaaa acagcaaacg ctggaatccc     2400

gaaattcagt acacttccaa ctacaacaag tctgttaatg tggactttac tgtggacact     2460

aatggcgtgt attcagagcc tcgccccatt ggcaccagat acctgactcg taatctgtaa     2520


<210> 44
<211> 839
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 44
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Pro Val Lys Met Ala Gly Met Met Phe Leu 
    130                 135                 140                 


Pro Thr Asp Tyr Cys Cys Arg Leu Ser Asp Gln Glu Tyr Met Glu Leu 
145                 150                 155                 160 


Val Phe Glu Asn Gly Gln Ile Leu Ala Lys Gly Gln Arg Ser Asn Val 
                165                 170                 175     


Ser Leu His Asn Gln Arg Thr Lys Ser Ile Met Asp Leu Tyr Glu Ala 
            180                 185                 190         


Glu Tyr Asn Glu Asp Phe Met Lys Ser Ile Ile His Gly Gly Gly Gly 
        195                 200                 205             


Ala Ile Thr Asn Leu Gly Asp Thr Gln Val Val Pro Gln Ser His Val 
    210                 215                 220                 


Ala Ala Ala His Glu Thr Asn Met Leu Glu Ser Asn Lys His Val Asp 
225                 230                 235                 240 


Gly Ala Ala Pro Gly Lys Lys Arg Pro Val Glu His Ser Pro Val Glu 
                245                 250                 255     


Pro Asp Ser Ser Ser Gly Thr Gly Lys Ala Gly Gln Gln Pro Ala Arg 
            260                 265                 270         


Lys Arg Leu Asn Phe Gly Gln Thr Gly Asp Ala Asp Ser Val Pro Asp 
        275                 280                 285             


Pro Gln Pro Leu Gly Gln Pro Pro Ala Ala Pro Ser Gly Leu Gly Thr 
    290                 295                 300                 


Asn Thr Leu Ala Thr Gly Ser Gly Ala Pro Leu Ala Asp Asn Asn Glu 
305                 310                 315                 320 


Gly Ala Asp Gly Val Gly Asn Ser Ser Gly Asn Trp His Cys Asp Ser 
                325                 330                 335     


Thr Trp Leu Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala 
            340                 345                 350         


Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Gln Ser 
        355                 360                 365             


Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 
    370                 375                 380                 


Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 
385                 390                 395                 400 


Gln Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
                405                 410                 415     


Phe Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn Asp Gly 
            420                 425                 430         


Thr Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr 
        435                 440                 445             


Asp Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly 
    450                 455                 460                 


Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met Val Pro Gln Tyr Gly 
465                 470                 475                 480 


Tyr Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe 
                485                 490                 495     


Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn 
            500                 505                 510         


Phe Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr 
        515                 520                 525             


Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln 
    530                 535                 540                 


Tyr Leu Tyr Tyr Leu Ser Arg Thr Asn Thr Pro Ser Gly Thr Thr Thr 
545                 550                 555                 560 


Gln Ser Arg Leu Gln Phe Ser Gln Ala Gly Ala Ser Asp Ile Arg Asp 
                565                 570                 575     


Gln Ser Arg Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val 
            580                 585                 590         


Ser Lys Thr Ser Ala Asp Asn Asn Asn Ser Glu Tyr Ser Trp Thr Gly 
        595                 600                 605             


Ala Thr Lys Tyr His Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly 
    610                 615                 620                 


Pro Ala Met Ala Ser His Lys Asp Asp Glu Glu Lys Phe Phe Pro Gln 
625                 630                 635                 640 


Ser Gly Val Leu Ile Phe Gly Lys Gln Gly Ser Glu Lys Thr Asn Val 
                645                 650                 655     


Asp Ile Glu Lys Val Met Ile Thr Asp Glu Glu Glu Ile Arg Thr Thr 
            660                 665                 670         


Asn Pro Val Ala Thr Glu Gln Tyr Gly Ser Val Ser Thr Asn Leu Gln 
        675                 680                 685             


Arg Gly Asn Arg Gln Ala Ala Thr Ala Asp Val Asn Thr Gln Gly Val 
    690                 695                 700                 


Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro 
705                 710                 715                 720 


Ile Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro 
                725                 730                 735     


Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile 
            740                 745                 750         


Lys Asn Thr Pro Val Pro Ala Asn Pro Ser Thr Thr Phe Ser Ala Ala 
        755                 760                 765             


Lys Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val 
    770                 775                 780                 


Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro 
785                 790                 795                 800 


Glu Ile Gln Tyr Thr Ser Asn Tyr Asn Lys Ser Val Asn Val Asp Phe 
                805                 810                 815     


Thr Val Asp Thr Asn Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr 
            820                 825                 830         


Arg Tyr Leu Thr Arg Asn Leu 
        835                 


<210> 45
<211> 2109
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 45
atggccggca tgatgttcct tcctactgat tattgttgca gactgagcga ccaggaatac       60

atggaactcg tcttcgagaa cggacagata ctcgcaaaag gccagaggtc aaatgttagt      120

ctccataatc agcggacgaa aagcatcatg gatctgtatg aggccgaata caacgaagat      180

tttatgaaaa gtattatcca tggagggggt ggcgctatta ccaacctggg agatacccaa      240

gtggtcccac agtcccacgt agcagccgct cacgagacca atatgctgga gtccaacaaa      300

cacgtagacg gcgccgctcc gggaaaaaag aggccggtag agcactctcc tgtggagcca      360

gactcctcct cgggaaccgg aaaggcgggc cagcagcctg caagaaaaag attgaatttt      420

ggtcagactg gagacgcaga ctcagtacct gacccccagc ctctcggaca gccaccagca      480

gccccctctg gtctgggaac taatacgctg gctacaggca gtggcgcacc actggcagac      540

aataacgagg gcgccgacgg agtgggtaat tcctcgggaa attggcattg cgattccaca      600

tggctgggcg acagagtcat caccaccagc acccgaacct gggccctgcc cacctacaac      660

aaccacctct acaaacaaat ttccagccaa tcaggagcct cgaacgacaa tcactacttt      720

ggctacagca ccccttgggg gtattttgac ttcaacagat tccactgcca cttttcacca      780

cgtgactggc aaagactcat caacaacaac tggggattcc gacccaagag actcaacttc      840

aagctcttta acattcaagt caaagaggtc acgcagaatg acggtacgac gacgattgcc      900

aataacctta ccagcacggt tcaggtgttt actgactcgg agtaccagct cccgtacgtc      960

ctcggctcgg cgcatcaagg atgcctcccg ccgttcccag cagacgtctt catggtgcca     1020

cagtatggat acctcaccct gaacaacggg agtcaggcag taggacgctc ttcattttac     1080

tgcctggagt actttccttc tcagatgctg cgtaccggaa acaactttac cttcagctac     1140

acttttgagg acgttccttt ccacagcagc tacgctcaca gccagagtct ggaccgtctc     1200

atgaatcctc tcatcgacca gtacctgtat tacttgagca gaacaaacac tccaagtgga     1260

accaccacgc agtcaaggct tcagttttct caggccggag cgagtgacat tcgggaccag     1320

tctaggaact ggcttcctgg accctgttac cgccagcagc gagtatcaaa gacatctgcg     1380

gataacaaca acagtgaata ctcgtggact ggagctacca agtaccacct caatggcaga     1440

gactctctgg tgaatccggg cccggccatg gcaagccaca aggacgatga agaaaagttt     1500

tttcctcaga gcggggttct catctttggg aagcaaggct cagagaaaac aaatgtggac     1560

attgaaaagg tcatgattac agacgaagag gaaatcagga caaccaatcc cgtggctacg     1620

gagcagtatg gttctgtatc taccaacctc cagagaggca acagacaagc agctaccgca     1680

gatgtcaaca cacaaggcgt tcttccaggc atggtctggc aggacagaga tgtgtacctt     1740

caggggccca tctgggcaaa gattccacac acggacggac attttcaccc ctctcccctc     1800

atgggtggat tcggacttaa acaccctcct ccacagattc tcatcaagaa caccccggta     1860

cctgcgaatc cttcgaccac cttcagtgcg gcaaagtttg cttccttcat cacacagtac     1920

tccacgggac aggtcagcgt ggagatcgag tgggagctgc agaaggaaaa cagcaaacgc     1980

tggaatcccg aaattcagta cacttccaac tacaacaagt ctgttaatgt ggactttact     2040

gtggacacta atggcgtgta ttcagagcct cgccccattg gcaccagata cctgactcgt     2100

aatctgtaa                                                             2109


<210> 46
<211> 702
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 46
Met Ala Gly Met Met Phe Leu Pro Thr Asp Tyr Cys Cys Arg Leu Ser 
1               5                   10                  15      


Asp Gln Glu Tyr Met Glu Leu Val Phe Glu Asn Gly Gln Ile Leu Ala 
            20                  25                  30          


Lys Gly Gln Arg Ser Asn Val Ser Leu His Asn Gln Arg Thr Lys Ser 
        35                  40                  45              


Ile Met Asp Leu Tyr Glu Ala Glu Tyr Asn Glu Asp Phe Met Lys Ser 
    50                  55                  60                  


Ile Ile His Gly Gly Gly Gly Ala Ile Thr Asn Leu Gly Asp Thr Gln 
65                  70                  75                  80  


Val Val Pro Gln Ser His Val Ala Ala Ala His Glu Thr Asn Met Leu 
                85                  90                  95      


Glu Ser Asn Lys His Val Asp Gly Ala Ala Pro Gly Lys Lys Arg Pro 
            100                 105                 110         


Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly Lys 
        115                 120                 125             


Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr Gly 
    130                 135                 140                 


Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro Ala 
145                 150                 155                 160 


Ala Pro Ser Gly Leu Gly Thr Asn Thr Leu Ala Thr Gly Ser Gly Ala 
                165                 170                 175     


Pro Leu Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser Ser 
            180                 185                 190         


Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile Thr 
        195                 200                 205             


Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr 
    210                 215                 220                 


Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr Phe 
225                 230                 235                 240 


Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys 
                245                 250                 255     


His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp Gly 
            260                 265                 270         


Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val Lys 
        275                 280                 285             


Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu Thr 
    290                 295                 300                 


Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr Val 
305                 310                 315                 320 


Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp Val 
                325                 330                 335     


Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gln 
            340                 345                 350         


Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln 
        355                 360                 365             


Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu Asp 
    370                 375                 380                 


Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg Leu 
385                 390                 395                 400 


Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr Asn 
                405                 410                 415     


Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln Ala 
            420                 425                 430         


Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly Pro 
        435                 440                 445             


Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Ser Ala Asp Asn Asn Asn 
    450                 455                 460                 


Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg 
465                 470                 475                 480 


Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp Asp 
                485                 490                 495     


Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys Gln 
            500                 505                 510         


Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr Asp 
        515                 520                 525             


Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr Gly 
    530                 535                 540                 


Ser Val Ser Thr Asn Leu Gln Arg Gly Asn Arg Gln Ala Ala Thr Ala 
545                 550                 555                 560 


Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp Arg 
                565                 570                 575     


Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr Asp 
            580                 585                 590         


Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His 
        595                 600                 605             


Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn Pro 
    610                 615                 620                 


Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln Tyr 
625                 630                 635                 640 


Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys Glu 
                645                 650                 655     


Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr Asn 
            660                 665                 670         


Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr Ser 
        675                 680                 685             


Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
    690                 695                 700         


<210> 47
<211> 312
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 47
gccggcatga tgttccttcc tactgattat tgttgcagac tgagcgacca ggaatacatg       60

gaactcgtct tcgagaacgg acagatactc gcaaaaggcc agaggtcaaa tgttagtctc      120

cataatcagc ggacgaaaag catcatggat ctgtatgagg ccgaatacaa cgaagatttt      180

atgaaaagta ttatccatgg agggggtggc gctattacca acctgggaga tacccaagtg      240

gtcccacagt cccacgtagc agccgctcac gagaccaata tgctggagtc caacaaacac      300

gtagacggcg cc                                                          312


<210> 48
<211> 104
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 48
Ala Gly Met Met Phe Leu Pro Thr Asp Tyr Cys Cys Arg Leu Ser Asp 
1               5                   10                  15      


Gln Glu Tyr Met Glu Leu Val Phe Glu Asn Gly Gln Ile Leu Ala Lys 
            20                  25                  30          


Gly Gln Arg Ser Asn Val Ser Leu His Asn Gln Arg Thr Lys Ser Ile 
        35                  40                  45              


Met Asp Leu Tyr Glu Ala Glu Tyr Asn Glu Asp Phe Met Lys Ser Ile 
    50                  55                  60                  


Ile His Gly Gly Gly Gly Ala Ile Thr Asn Leu Gly Asp Thr Gln Val 
65                  70                  75                  80  


Val Pro Gln Ser His Val Ala Ala Ala His Glu Thr Asn Met Leu Glu 
                85                  90                  95      


Ser Asn Lys His Val Asp Gly Ala 
            100                 


<210> 49
<211> 2208
<212> DNA
<213> Adeno-associated virus 2

<400> 49
atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga       60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac      120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac      180

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcctacgac      240

cggcagctcg acagcggaga caacccgtac ctcaagtaca accacgccga cgcggagttt      300

caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag      360

gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gaaggctccg      420

ggaaaaaaga ggccggtaga gcactctcct gtggagccag actcctcctc gggaaccgga      480

aaggcgggcc agcagcctgc aagaaaaaga ttgaattttg gtcagactgg agacgcagac      540

tcagtacctg acccccagcc tctcggacag ccaccagcag ccccctctgg tctgggaact      600

aataccatgg ctacaggcag tggcgcacca atggcagaca ataacgaggg tgccgacgga      660

gtgggtaatt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt      780

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg      840

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      900

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      960

aaagaggtca cgcagaatga cggtacgacg acgattgcca ataaccttac cagcacggtt     1020

caggtgttta ctgactcgga gtaccagctc ccgtacgtcc tcggctcggc gcatcaagga     1080

tgcctcccgc cgttcccagc agacgtcttc atggtgccac agtatggata cctcaccctg     1140

aacaacggga gtcaggcagt aggacgctct tcattttact gcctggagta ctttccttct     1200

cagatgctgc gtaccggaaa caactttacc ttcagctaca cttttgagga cgttcctttc     1260

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct catcgaccag     1320

tacctgtatt acttgagcag aacaaacact ccaagtggaa ccaccacgca gtcaaggctt     1380

cagttttctc aggccggagc gagtgacatt cgggaccagt ctaggaactg gcttcctgga     1440

ccctgttacc gccagcagcg agtatcaaag acatctgcgg ataacaacaa cagtgaatac     1500

tcgtggactg gagctaccaa gtaccacctc aatggcagag actctctggt gaatccgggc     1560

ccggctatgg caagccacaa ggacgatgaa gaaaagtttt ttcctcagag cggggttctc     1620

atctttggga agcaaggctc agagaaaaca aatgtggaca ttgaaaaggt catgattaca     1680

gacgaagagg aaatcaggac aaccaatccc gtggctacgg agcagtatgg ttctgtatct     1740

accaacctcc agagaggcaa cagacaagca gctaccgcag atgtcaacac acaaggcgtt     1800

cttccaggca tggtctggca ggacagagat gtgtaccttc aggggcccat ctgggcaaag     1860

attccacaca cggacggaca ttttcacccc tctcccctca tgggtggatt cggacttaaa     1920

caccctcctc cacagattct catcaagaac accccggtgc ctgcgaatcc ttcgaccacc     1980

ttcagtgcgg caaagtttgc ttccttcatc acacagtact ccacgggaca ggtcagcgtg     2040

gagatcgagt gggagctgca gaaggaaaac agcaaacgct ggaatcccga aattcagtac     2100

acttccaact acaacaagtc tgttaatgtg gactttactg tggacactaa tggcgtgtat     2160

tcagagcctc gccccattgg caccagatac ctgactcgta atctgtaa                  2208


<210> 50
<211> 735
<212> PRT
<213> Adeno-associated virus 2

<400> 50
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Pro Val Lys Lys Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145                 150                 155                 160 


Lys Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 
            260                 265                 270         


Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
        275                 280                 285             


Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 
    290                 295                 300                 


Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val 
305                 310                 315                 320 


Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 
                325                 330                 335     


Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 
            340                 345                 350         


Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 
        355                 360                 365             


Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
    370                 375                 380                 


Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385                 390                 395                 400 


Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
                405                 410                 415     


Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 
            420                 425                 430         


Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr 
        435                 440                 445             


Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln 
    450                 455                 460                 


Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
                485                 490                 495     


Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
        515                 520                 525             


Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr 
545                 550                 555                 560 


Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr 
                565                 570                 575     


Gly Ser Val Ser Thr Asn Leu Gln Arg Gly Asn Arg Gln Ala Ala Thr 
            580                 585                 590         


Ala Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn 
                645                 650                 655     


Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735 


<210> 51
<211> 1797
<212> DNA
<213> Adeno-associated virus 2

<400> 51
aaggctccgg gaaaaaagag gccggtagag cactctcctg tggagccaga ctcctcctcg       60

ggaaccggaa aggcgggcca gcagcctgca agaaaaagat tgaattttgg tcagactgga      120

gacgcagact cagtacctga cccccagcct ctcggacagc caccagcagc cccctctggt      180

ctgggaacta ataccatggc tacaggcagt ggcgcaccaa tggcagacaa taacgagggt      240

gccgacggag tgggtaattc ctcgggaaat tggcattgcg attccacatg gatgggcgac      300

agagtcatca ccaccagcac ccgaacctgg gccctgccca cctacaacaa ccacctctac      360

aaacaaattt ccagccaatc aggagcctcg aacgacaatc actactttgg ctacagcacc      420

ccttgggggt attttgactt caacagattc cactgccact tttcaccacg tgactggcaa      480

agactcatca acaacaactg gggattccga cccaagagac tcaacttcaa gctctttaac      540

attcaagtca aagaggtcac gcagaatgac ggtacgacga cgattgccaa taaccttacc      600

agcacggttc aggtgtttac tgactcggag taccagctcc cgtacgtcct cggctcggcg      660

catcaaggat gcctcccgcc gttcccagca gacgtcttca tggtgccaca gtatggatac      720

ctcaccctga acaacgggag tcaggcagta ggacgctctt cattttactg cctggagtac      780

tttccttctc agatgctgcg taccggaaac aactttacct tcagctacac ttttgaggac      840

gttcctttcc acagcagcta cgctcacagc cagagtctgg accgtctcat gaatcctctc      900

atcgaccagt acctgtatta cttgagcaga acaaacactc caagtggaac caccacgcag      960

tcaaggcttc agttttctca ggccggagcg agtgacattc gggaccagtc taggaactgg     1020

cttcctggac cctgttaccg ccagcagcga gtatcaaaga catctgcgga taacaacaac     1080

agtgaatact cgtggactgg agctaccaag taccacctca atggcagaga ctctctggtg     1140

aatccgggcc cggctatggc aagccacaag gacgatgaag aaaagttttt tcctcagagc     1200

ggggttctca tctttgggaa gcaaggctca gagaaaacaa atgtggacat tgaaaaggtc     1260

atgattacag acgaagagga aatcaggaca accaatcccg tggctacgga gcagtatggt     1320

tctgtatcta ccaacctcca gagaggcaac agacaagcag ctaccgcaga tgtcaacaca     1380

caaggcgttc ttccaggcat ggtctggcag gacagagatg tgtaccttca ggggcccatc     1440

tgggcaaaga ttccacacac ggacggacat tttcacccct ctcccctcat gggtggattc     1500

ggacttaaac accctcctcc acagattctc atcaagaaca ccccggtgcc tgcgaatcct     1560

tcgaccacct tcagtgcggc aaagtttgct tccttcatca cacagtactc cacgggacag     1620

gtcagcgtgg agatcgagtg ggagctgcag aaggaaaaca gcaaacgctg gaatcccgaa     1680

attcagtaca cttccaacta caacaagtct gttaatgtgg actttactgt ggacactaat     1740

ggcgtgtatt cagagcctcg ccccattggc accagatacc tgactcgtaa tctgtaa        1797


<210> 52
<211> 598
<212> PRT
<213> Adeno-associated virus 2

<400> 52
Lys Ala Pro Gly Lys Lys Arg Pro Val Glu His Ser Pro Val Glu Pro 
1               5                   10                  15      


Asp Ser Ser Ser Gly Thr Gly Lys Ala Gly Gln Gln Pro Ala Arg Lys 
            20                  25                  30          


Arg Leu Asn Phe Gly Gln Thr Gly Asp Ala Asp Ser Val Pro Asp Pro 
        35                  40                  45              


Gln Pro Leu Gly Gln Pro Pro Ala Ala Pro Ser Gly Leu Gly Thr Asn 
    50                  55                  60                  


Thr Met Ala Thr Gly Ser Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65                  70                  75                  80  


Ala Asp Gly Val Gly Asn Ser Ser Gly Asn Trp His Cys Asp Ser Thr 
                85                  90                  95      


Trp Met Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu 
            100                 105                 110         


Pro Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Gln Ser Gly 
        115                 120                 125             


Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
    130                 135                 140                 


Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 
145                 150                 155                 160 


Arg Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
                165                 170                 175     


Lys Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn Asp Gly Thr 
            180                 185                 190         


Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 
        195                 200                 205             


Ser Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys 
    210                 215                 220                 


Leu Pro Pro Phe Pro Ala Asp Val Phe Met Val Pro Gln Tyr Gly Tyr 
225                 230                 235                 240 


Leu Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr 
                245                 250                 255     


Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe 
            260                 265                 270         


Thr Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala 
        275                 280                 285             


His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr 
    290                 295                 300                 


Leu Tyr Tyr Leu Ser Arg Thr Asn Thr Pro Ser Gly Thr Thr Thr Gln 
305                 310                 315                 320 


Ser Arg Leu Gln Phe Ser Gln Ala Gly Ala Ser Asp Ile Arg Asp Gln 
                325                 330                 335     


Ser Arg Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser 
            340                 345                 350         


Lys Thr Ser Ala Asp Asn Asn Asn Ser Glu Tyr Ser Trp Thr Gly Ala 
        355                 360                 365             


Thr Lys Tyr His Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Pro 
    370                 375                 380                 


Ala Met Ala Ser His Lys Asp Asp Glu Glu Lys Phe Phe Pro Gln Ser 
385                 390                 395                 400 


Gly Val Leu Ile Phe Gly Lys Gln Gly Ser Glu Lys Thr Asn Val Asp 
                405                 410                 415     


Ile Glu Lys Val Met Ile Thr Asp Glu Glu Glu Ile Arg Thr Thr Asn 
            420                 425                 430         


Pro Val Ala Thr Glu Gln Tyr Gly Ser Val Ser Thr Asn Leu Gln Arg 
        435                 440                 445             


Gly Asn Arg Gln Ala Ala Thr Ala Asp Val Asn Thr Gln Gly Val Leu 
    450                 455                 460                 


Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile 
465                 470                 475                 480 


Trp Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
                485                 490                 495     


Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys 
            500                 505                 510         


Asn Thr Pro Val Pro Ala Asn Pro Ser Thr Thr Phe Ser Ala Ala Lys 
        515                 520                 525             


Phe Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu 
    530                 535                 540                 


Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
545                 550                 555                 560 


Ile Gln Tyr Thr Ser Asn Tyr Asn Lys Ser Val Asn Val Asp Phe Thr 
                565                 570                 575     


Val Asp Thr Asn Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg 
            580                 585                 590         


Tyr Leu Thr Arg Asn Leu 
        595             


<210> 53
<211> 1602
<212> DNA
<213> Adeno-associated virus 2

<400> 53
atggctacag gcagtggcgc accaatggca gacaataacg agggtgccga cggagtgggt       60

aattcctcgg gaaattggca ttgcgattcc acatggatgg gcgacagagt catcaccacc      120

agcacccgaa cctgggccct gcccacctac aacaaccacc tctacaaaca aatttccagc      180

caatcaggag cctcgaacga caatcactac tttggctaca gcaccccttg ggggtatttt      240

gacttcaaca gattccactg ccacttttca ccacgtgact ggcaaagact catcaacaac      300

aactggggat tccgacccaa gagactcaac ttcaagctct ttaacattca agtcaaagag      360

gtcacgcaga atgacggtac gacgacgatt gccaataacc ttaccagcac ggttcaggtg      420

tttactgact cggagtacca gctcccgtac gtcctcggct cggcgcatca aggatgcctc      480

ccgccgttcc cagcagacgt cttcatggtg ccacagtatg gatacctcac cctgaacaac      540

gggagtcagg cagtaggacg ctcttcattt tactgcctgg agtactttcc ttctcagatg      600

ctgcgtaccg gaaacaactt taccttcagc tacacttttg aggacgttcc tttccacagc      660

agctacgctc acagccagag tctggaccgt ctcatgaatc ctctcatcga ccagtacctg      720

tattacttga gcagaacaaa cactccaagt ggaaccacca cgcagtcaag gcttcagttt      780

tctcaggccg gagcgagtga cattcgggac cagtctagga actggcttcc tggaccctgt      840

taccgccagc agcgagtatc aaagacatct gcggataaca acaacagtga atactcgtgg      900

actggagcta ccaagtacca cctcaatggc agagactctc tggtgaatcc gggcccggct      960

atggcaagcc acaaggacga tgaagaaaag ttttttcctc agagcggggt tctcatcttt     1020

gggaagcaag gctcagagaa aacaaatgtg gacattgaaa aggtcatgat tacagacgaa     1080

gaggaaatca ggacaaccaa tcccgtggct acggagcagt atggttctgt atctaccaac     1140

ctccagagag gcaacagaca agcagctacc gcagatgtca acacacaagg cgttcttcca     1200

ggcatggtct ggcaggacag agatgtgtac cttcaggggc ccatctgggc aaagattcca     1260

cacacggacg gacattttca cccctctccc ctcatgggtg gattcggact taaacaccct     1320

cctccacaga ttctcatcaa gaacaccccg gtgcctgcga atccttcgac caccttcagt     1380

gcggcaaagt ttgcttcctt catcacacag tactccacgg gacaggtcag cgtggagatc     1440

gagtgggagc tgcagaagga aaacagcaaa cgctggaatc ccgaaattca gtacacttcc     1500

aactacaaca agtctgttaa tgtggacttt actgtggaca ctaatggcgt gtattcagag     1560

cctcgcccca ttggcaccag atacctgact cgtaatctgt aa                        1602


<210> 54
<211> 533
<212> PRT
<213> Adeno-associated virus 2

<400> 54
Met Ala Thr Gly Ser Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
1               5                   10                  15      


Asp Gly Val Gly Asn Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
            20                  25                  30          


Met Gly Asp Arg Val Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
        35                  40                  45              


Thr Tyr Asn Asn His Leu Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala 
    50                  55                  60                  


Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe 
65                  70                  75                  80  


Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg 
                85                  90                  95      


Leu Ile Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys 
            100                 105                 110         


Leu Phe Asn Ile Gln Val Lys Glu Val Thr Gln Asn Asp Gly Thr Thr 
        115                 120                 125             


Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser 
    130                 135                 140                 


Glu Tyr Gln Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu 
145                 150                 155                 160 


Pro Pro Phe Pro Ala Asp Val Phe Met Val Pro Gln Tyr Gly Tyr Leu 
                165                 170                 175     


Thr Leu Asn Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys 
            180                 185                 190         


Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr 
        195                 200                 205             


Phe Ser Tyr Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His 
    210                 215                 220                 


Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu 
225                 230                 235                 240 


Tyr Tyr Leu Ser Arg Thr Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser 
                245                 250                 255     


Arg Leu Gln Phe Ser Gln Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser 
            260                 265                 270         


Arg Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys 
        275                 280                 285             


Thr Ser Ala Asp Asn Asn Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr 
    290                 295                 300                 


Lys Tyr His Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala 
305                 310                 315                 320 


Met Ala Ser His Lys Asp Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly 
                325                 330                 335     


Val Leu Ile Phe Gly Lys Gln Gly Ser Glu Lys Thr Asn Val Asp Ile 
            340                 345                 350         


Glu Lys Val Met Ile Thr Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro 
        355                 360                 365             


Val Ala Thr Glu Gln Tyr Gly Ser Val Ser Thr Asn Leu Gln Arg Gly 
    370                 375                 380                 


Asn Arg Gln Ala Ala Thr Ala Asp Val Asn Thr Gln Gly Val Leu Pro 
385                 390                 395                 400 


Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 
                405                 410                 415     


Ala Lys Ile Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu Met 
            420                 425                 430         


Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn 
        435                 440                 445             


Thr Pro Val Pro Ala Asn Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe 
    450                 455                 460                 


Ala Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile 
465                 470                 475                 480 


Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile 
                485                 490                 495     


Gln Tyr Thr Ser Asn Tyr Asn Lys Ser Val Asn Val Asp Phe Thr Val 
            500                 505                 510         


Asp Thr Asn Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr 
        515                 520                 525             


Leu Thr Arg Asn Leu 
    530             


<210> 55
<211> 21
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 55
cccaagaaaa agcggaaggt g                                                 21


<210> 56
<211> 7
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 56
Pro Lys Lys Lys Arg Lys Val 
1               5           


<210> 57
<211> 65
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 57
acgaggccgc aaagagactg cccgacgcca acctggcagc cgcagccaag aagaaaaagc       60

tggac                                                                   65


<210> 58
<211> 21
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 58
Thr Arg Pro Gln Arg Asp Cys Pro Thr Pro Thr Trp Gln Pro Gln Pro 
1               5                   10                  15      


Arg Arg Lys Ser Trp 
            20      


<210> 59
<211> 33
<212> DNA
<213> Human immunodeficiency virus

<400> 59
cttcaacttc ctcctcttga gagacttact ctt                                    33


<210> 60
<211> 11
<212> PRT
<213> Human immunodeficiency virus

<400> 60
Leu Gln Leu Pro Pro Leu Glu Arg Leu Thr Leu 
1               5                   10      


<210> 61
<211> 27
<212> DNA
<213> Human immunodeficiency virus

<400> 61
cttcctcctc ttgagagact tactctt                                           27


<210> 62
<211> 9
<212> PRT
<213> Human immunodeficiency virus

<400> 62
Leu Pro Pro Leu Glu Arg Leu Thr Leu 
1               5                   


<210> 63
<211> 54
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 63
cccagcaccc ggatccagca gcagctgggc cagctgaccc tggagaacct gcag             54


<210> 64
<211> 18
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 64
Pro Ser Thr Arg Ile Gln Gln Gln Leu Gly Gln Leu Thr Leu Glu Asn 
1               5                   10                  15      


Leu Gln 
        


<210> 65
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 65
atgttagcct tgaaattagc aggtcttgat atc                                    33


<210> 66
<211> 11
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 66
Met Leu Ala Leu Lys Leu Ala Gly Leu Asp Ile 
1               5                   10      


<210> 67
<211> 408
<212> DNA
<213> Avena sativa

<400> 67
ttggctacta cacttgaacg tattgagaag aactttgtca ttactgaccc aagattgcca       60

gataatccca ttatattcgc gtccgatagt ttcttgcagt tgacagaata tagccgtgaa      120

gaaattttgg gaagaaactg caggtttcta caaggtcctg aaactgatcg cgcgacagtg      180

agaaaaatta gagatgccat agataaccaa acagaggtca ctgttcagct gattaattat      240

acaaagagtg gtaaaaagtt ctggaacctc tttcacttgc agcctatgcg agatcagaag      300

ggagatgtcc agtactttat tggggttcag ttggatggaa ctgagcatgt ccgagatgct      360

gccgagagag agggagtcat gctgattaag aaaactgcag aaaatatt                   408


<210> 68
<211> 136
<212> PRT
<213> Avena sativa

<400> 68
Leu Ala Thr Thr Leu Glu Arg Ile Glu Lys Asn Phe Val Ile Thr Asp 
1               5                   10                  15      


Pro Arg Leu Pro Asp Asn Pro Ile Ile Phe Ala Ser Asp Ser Phe Leu 
            20                  25                  30          


Gln Leu Thr Glu Tyr Ser Arg Glu Glu Ile Leu Gly Arg Asn Cys Arg 
        35                  40                  45              


Phe Leu Gln Gly Pro Glu Thr Asp Arg Ala Thr Val Arg Lys Ile Arg 
    50                  55                  60                  


Asp Ala Ile Asp Asn Gln Thr Glu Val Thr Val Gln Leu Ile Asn Tyr 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Lys Phe Trp Asn Leu Phe His Leu Gln Pro Met 
                85                  90                  95      


Arg Asp Gln Lys Gly Asp Val Gln Tyr Phe Ile Gly Val Gln Leu Asp 
            100                 105                 110         


Gly Thr Glu His Val Arg Asp Ala Ala Glu Arg Glu Gly Val Met Leu 
        115                 120                 125             


Ile Lys Lys Thr Ala Glu Asn Ile 
    130                 135     


<210> 69
<211> 18930
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 69
tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta       60

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag      120

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg      180

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg      240

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg      300

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga      360

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc      420

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt      480

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact      540

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg      600

cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt      660

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt      720

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct      780

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg      840

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt      900

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt      960

gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc     1020

gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg     1080

cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc     1140

gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg     1200

gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca     1260

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga     1320

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct     1380

ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg     1440

cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca     1500

accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata     1560

cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct     1620

tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact     1680

cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa     1740

acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc     1800

atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga     1860

tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga     1920

aaagtgccac ctgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg     1980

cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac     2040

atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc     2100

cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta tgcggcatca     2160

gagcagattg tactgagagt gcaccataaa attgtaaacg ttaatatttt gttaaaattc     2220

gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat cggcaaaatc     2280

ccttataaat caaaagaata gcccgagata gggttgagtg ttgttccagt ttggaacaag     2340

agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc     2400

gatggcccac tacgtgaacc atcacccaaa tcaagttttt tggggtcgag gtgccgtaaa     2460

gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccggcg     2520

aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc gctggcaagt     2580

gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc ttaatgcgcc gctacagggc     2640

gcgtactatg gttgctttga cgtatgcggt gtgaaatacc gcacagatgc gtaaggagaa     2700

aataccgcat caggcgccat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg     2760

tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg atgtgctgca aggcgattaa     2820

gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgccaagc     2880

ttaaggtgca cggcccacgt ggccactagt acttctcgac agaagcacca tgtccttggg     2940

tccggcctgc tgaatgcgca ggcggtcggc catgccccag gcttcgtttt gacatcggcg     3000

caggtctttg tagtagtctt gcatgagcct ttctaccggc acttcttctt ctccttcctc     3060

ttgtcctgca tctcttgcat ctatcgctgc ggcggcggcg gagtttggcc gtaggtggcg     3120

ccctcttcct cccatgcgtg tgaccccgaa gcccctcatc ggctgaagca gggctaggtc     3180

ggcgacaacg cgctcggcta atatggcctg ctgcacctgc gtgagggtag actggaagtc     3240

atccatgtcc acaaagcggt ggtatgcgcc cgtgttgatg gtgtaagtgc agttggccat     3300

aacggaccag ttaacggtct ggtgacccgg ctgcgagagc tcggtgtacc tgagacgcga     3360

gtaagccctc gagtcaaata cgtagtcgtt gcaagtccgc accaggtact ggtatcccac     3420

caaaaagtgc ggcggcggct ggcggtagag gggccagcgt agggtggccg gggctccggg     3480

ggcgagatct tccaacataa ggcgatgata tccgtagatg tacctggaca tccaggtgat     3540

gccggcggcg gtggtggagg cgcgcggaaa gtcgcggacg cggttccaga tgttgcgcag     3600

cggcaaaaag tgctccatgg tcgggacgct ctggccggtc aggcgcgcgc aatcgttgac     3660

gctctaccgt gcaaaaggag agcctgtaag cgggcactct tccgtggtct ggtggataaa     3720

ttcgcaaggg tatcatggcg gacgaccggg gttcgagccc cgtatccggc cgtccgccgt     3780

gatccatgcg gttaccgccc gcgtgtcgaa cccaggtgtg cgacgtcaga caacggggga     3840

gtgctccttt tggcttcctt ccaggcgcgg cggctgctgc gctagctttt ttggccactg     3900

gccgcgcgca gcgtaagcgg ttaggctgga aagcgaaagc attaagtggc tcgctccctg     3960

tagccggagg gttattttcc aagggttgag tcgcgggacc cccggttcga gtctcggacc     4020

ggccggactg cggcgaacgg gggtttgcct ccccgtcatg caagaccccg cttgcaaatt     4080

cctccggaaa cagggacgag cccctttttt gcttttccca gatgcatccg gtgctgcggc     4140

agatgcgccc ccctcctcag cagcggcaag agcaagagca gcggcagaca tgcagggcac     4200

cctcccctcc tcctaccgcg tcaggagggg cgacatccgc ggttgacgcg gcagcagatg     4260

gtgattacga acccccgcgg cgccgggccc ggcactacct ggacttggag gagggcgagg     4320

gcctggcgcg gctaggagcg ccctctcctg agcggtaccc aagggtgcag ctgaagcgtg     4380

atacgcgtga ggcgtacgtg ccgcggcaga acctgtttcg cgaccgcgag ggagaggagc     4440

ccgaggagat gcgggatcga aagttccacg cagggcgcga gctgcggcat ggcctgaatc     4500

gcgagcggtt gctgcgcgag gaggactttg agcccgacgc gcgaaccggg attagtcccg     4560

cgcgcgcaca cgtggcggcc gccgacctgg taaccgcata cgagcagacg gtgaaccagg     4620

agattaactt tcaaaaaagc tttaacaacc acgtgcgtac gcttgtggcg cgcgaggagg     4680

tggctatagg actgatgcat ctgtgggact ttgtaagcgc gctggagcaa aacccaaata     4740

gcaagccgct catggcgcag ctgttcctta tagtgcagca cagcagggac aacgaggcat     4800

tcagggatgc gctgctaaac atagtagagc ccgagggccg ctggctgctc gatttgataa     4860

acatcctgca gagcatagtg gtgcaggagc gcagcttgag cctggctgac aaggtggccg     4920

ccatcaacta ttccatgctt agcctgggca agttttacgc ccgcaagata taccataccc     4980

cttacgttcc catagacaag gaggtaaaga tcgaggggtt ctacatgcgc atggcgctga     5040

aggtgcttac cttgagcgac gacctgggcg tttatcgcaa cgagcgcatc cacaaggccg     5100

tgagcgtgag ccggcggcgc gagctcagcg accgcgagct gatgcacagc ctgcaaaggg     5160

ccctggctgg cacgggcagc ggcgatagag aggccgagtc ctactttgac gcgggcgctg     5220

acctgcgctg ggccccaagc cgacgcgccc tggaggcagc tggggccgga cctgggctgg     5280

cggtggcacc cgcgcgcgct ggcaacgtcg gcggcgtgga ggaatatgac gaggacgatg     5340

agtacgagcc agaggacggc gagtactaag cggtgatgtt tctgatcaga tgatgcaaga     5400

cgcaacggac ccggcggtgc gggcggcgct gcagagccag ccgtccggcc ttaactccac     5460

ggacgactgg cgccaggtca tggaccgcat catgtcgctg actgcgcgca atcctgacgc     5520

gttccggcag cagccgcagg ccaaccggct ctccgcaatt ctggaagcgg tggtcccggc     5580

gcgcgcaaac cccacgcacg agaaggtgct ggcgatcgta aacgcgctgg ccgaaaacag     5640

ggccatccgg cccgacgagg ccggcctggt ctacgacgcg ctgcttcagc gcgtggctcg     5700

ttacaacagc ggcaacgtgc agaccaacct ggaccggctg gtgggggatg tgcgcgaggc     5760

cgtggcgcag cgtgagcgcg cgcagcagca gggcaacctg ggctccatgg ttgcactaaa     5820

cgccttcctg agtacacagc ccgccaacgt gccgcgggga caggaggact acaccaactt     5880

tgtgagcgca ctgcggctaa tggtgactga gacaccgcaa agtgaggtgt accagtctgg     5940

gccagactat tttttccaga ccagtagaca aggcctgcag accgtaaacc tgagccaggc     6000

tttcaaaaac ttgcaggggc tgtggggggt gcgggctccc acaggcgacc gcgcgaccgt     6060

gtctagcttg ctgacgccca actcgcgcct gttgctgctg ctaatagcgc ccttcacgga     6120

cagtggcagc gtgtcccggg acacatacct aggtcacttg ctgacactgt accgcgaggc     6180

cataggtcag gcgcatgtgg acgagcatac tttccaggag attacaagtg tcagccgcgc     6240

gctggggcag gaggacacgg gcagcctgga ggcaacccta aactacctgc tgaccaaccg     6300

gcggcagaag atcccctcgt tgcacagttt cgcacccttt ggcgcatccc attctccagt     6360

aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc     6420

gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat     6480

gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa     6540

accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa     6600

catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag     6660

atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt     6720

ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac     6780

actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg     6840

gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc     6900

gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg     6960

tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg     7020

ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac     7080

ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc     7140

tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga     7200

gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca     7260

ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg     7320

tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg     7380

acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc atccgcggca     7440

gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg     7500

gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca     7560

cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct     7620

tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact     7680

ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta     7740

gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag     7800

ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag     7860

acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg     7920

tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag     7980

actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct     8040

ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc tcagcgcagc     8100

ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg     8160

actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga     8220

aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag     8280

cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact     8340

tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac     8400

tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt     8460

gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac     8520

ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca     8580

catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct     8640

tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg     8700

atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct     8760

cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg     8820

gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct     8880

cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga     8940

aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca     9000

ccaccgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc     9060

ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg     9120

accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg     9180

aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc     9240

tgttgaagca tctgcagcgc cagtgcgcca ttatctgcga cgcgttgcaa gagcgcagcg     9300

atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc     9360

gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct     9420

accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca     9480

agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc     9540

agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc     9600

ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa     9660

gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac     9720

gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga     9780

gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt     9840

tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc     9900

ttcaaacgcg cgagcctgcc gacttggagg agcgacgcaa actaatgatg gccgcagtgc     9960

tcgttaccgt ggagcttgag tgcatgcagc ggttctttgc tgacccggag atgcagcgca    10020

agctagagga aacattgcac tacacctttc gacagggcta cgtacgccag gcctgcaaga    10080

tctccaacgt ggagctctgc aacctggtct cctaccttgg aattttgcac gaaaaccgcc    10140

ttgggcaaaa cgtgcttcat tccacgctca agggcgaggc gcgccgcgac tacgtccgcg    10200

actgcgttta cttatttcta tgctacacct ggcagacggc catgggcgtt tggcagcagt    10260

gcttggagga gtgcaacctc aaggagctgc agaaactgct aaagcaaaac ttgaaggacc    10320

tatggacggc cttcaacgag cgctccgtgg ccgcgcacct ggcggacatc attttccccg    10380

aacgcctgct taaaaccctg caacagggtc tgccagactt caccagtcaa agcatgttgc    10440

agaactttag gaactttatc ctagagcgct caggaatctt gcccgccacc tgctgtgcac    10500

ttcctagcga ctttgtgccc attaagtacc gcgaatgccc tccgccgctt tggggccact    10560

gctaccttct gcagctagcc aactaccttg cctaccactc tgacataatg gaagacgtga    10620

gcggtgacgg tctactggag tgtcactgtc gctgcaacct atgcaccccg caccgctccc    10680

tggtttgcaa ttcgcagctg cttaacgaaa gtcaaattat cggtaccttt gagctgcagg    10740

gtccctcgcc tgacgaaaag tccgcggctc cggggttgaa actcactccg gggctgtgga    10800

cgtcggctta ccttcgcaaa tttgtacctg aggactacca cgcccacgag attaggttct    10860

acgaagacca atcccgcccg ccaaatgcgg agcttaccgc ctgcgtcatt acccagggcc    10920

acattcttgg ccaattgcaa gccatcaaca aagcccgcca agagtttctg ctacgaaagg    10980

gacggggggt ttacttggac ccccagtccg gcgaggagct caacccaatc cccccgccgc    11040

cgcagcccta tcagcagcag ccgcgggccc ttgcttccca ggatggcacc caaaaagaag    11100

ctgcagctgc cgccgccacc cacggacgag gaggaatact gggacagtca ggcagaggag    11160

gttttggacg aggaggagga ggacatgatg gaagactggg agagcctaga cgaggaagct    11220

tccgaggtcg aagaggtgtc agacgaaaca ccgtcaccct cggtcgcatt cccctcgccg    11280

gcgccccaga aatcggcaac cggttccagc atggctacaa cctccgctcc tcaggcgccg    11340

ccggcactgc ccgttcgccg acccaaccgt agatgggaca ccactggaac cagggccggt    11400

aagtccaagc agccgccgcc gttagcccaa gagcaacaac agcgccaagg ctaccgctca    11460

tggcgcgggc acaagaacgc catagttgct tgcttgcaag actgtggggg caacatctcc    11520

ttcgcccgcc gctttcttct ctaccatcac ggcgtggcct tcccccgtaa catcctgcat    11580

tactaccgtc atctctacag cccatactgc accggcggca gcggcagcgg cagcaacagc    11640

agcggccaca cagaagcaaa ggcgaccgga tagcaagact ctgacaaagc ccaagaaatc    11700

cacagcggcg gcagcagcag gaggaggagc gctgcgtctg gcgcccaacg aacccgtatc    11760

gacccgcgag cttagaaaca ggatttttcc cactctgtat gctatatttc aacagagcag    11820

gggccaagaa caagagctga aaataaaaaa caggtctctg cgatccctca cccgcagctg    11880

cctgtatcac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg aggctctctt    11940

cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc aaatttaagc    12000

gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgtcg tcagcgccat    12060

tatgagcaag gaaattccca cgccctacat gtggagttac cagccacaaa tgggacttgc    12120

ggctggagct gcccaagact actcaacccg aataaactac atgagcgcgg gaccccacat    12180

gatatcccgg gtcaacggaa tccgcgccca ccgaaaccga attctcttgg aacaggcggc    12240

tattaccacc acacctcgta ataaccttaa tccccgtagt tggcccgctg ccctggtgta    12300

ccaggaaagt cccgctccca ccactgtggt acttcccaga gacgcccagg ccgaagttca    12360

gatgactaac tcaggggcgc agcttgcggg cggctttcgt cacagggtgc ggtcgcccgg    12420

gcagggtata actcacctga caatcagagg gcgaggtatt cagctcaacg acgagtcggt    12480

gagctcctcg cttggtctcc gtccggacgg gacatttcag atcggcggcg ccggccgtcc    12540

ttcattcacg cctcgtcagg caatcctaac tctgcagacc tcgtcctctg agccgcgctc    12600

tggaggcatt ggaactctgc aatttattga ggagtttgtg ccatcggtct actttaaccc    12660

cttctcggga cctcccggcc actatccgga tcaatttatt cctaactttg acgcggtaaa    12720

ggactcggcg gacggctacg actgaatgtt aagtggagag gcagagcaac tgcgcctgaa    12780

acacctggtc cactgtcgcc gccacaagtg ctttgcccgc gactccggtg agttttgcta    12840

ctttgaattg cccgaggatc atatcgaggg cccggcgcac ggcgtccggc ttaccgccca    12900

gggagagctt gcccgtagcc tgattcggga gtttacccag cgccccctgc tagttgagcg    12960

ggacagggga ccctgtgttc tcactgtgat ttgcaactgt cctaaccttg gattacatca    13020

agatcctcta gttaattaac tagagtaccc ggggatctta ttccctttaa ctaataaaaa    13080

aaaataataa agcatcactt acttaaaatc agttagcaaa tttctgtcca gtttattcag    13140

cagcacctcc ttgccctcct cccagctctg gtattgcagc ttcctcctgg ctgcaaactt    13200

tctccacaat ctaaatggaa tgtcagtttc ctcctgttcc tgtccatccg cacccactat    13260

cttcatgttg ttgcagatga agcgcgcaag accgtctgaa gataccttca accccgtgta    13320

tccatatgac acggaaaccg gtcctccaac tgtgcctttt cttactcctc cctttgtatc    13380

ccccaatggg tttcaagaga gtccccctgg ggtactctct ttgcgcctat ccgaacctct    13440

agttacctcc aatggcatgc ttgcgctcaa aatgggcaac ggcctctctc tggacgaggc    13500

cggcaacctt acctcccaaa atgtaaccac tgtgagccca cctctcaaaa aaaccaagtc    13560

aaacataaac ctggaaatat ctgcacccct cacagttacc tcagaagccc taactgtggc    13620

tgccgccgca cctctaatgg tcgcgggcaa cacactcacc atgcaatcac aggccccgct    13680

aaccgtgcac gactccaaac ttagcattgc cacccaagga cccctcacag tgtcagaagg    13740

aaagctagcc ctgcaaacat caggccccct caccaccacc gatagcagta cccttactat    13800

cactgcctca ccccctctaa ctactgccac tggtagcttg ggcattgact tgaaagagcc    13860

catttataca caaaatggaa aactaggact aaagtacggg gctcctttgc atgtaacaga    13920

cgacctaaac actttgaccg tagcaactgg tccaggtgtg actattaata atacttcctt    13980

gcaaactaaa gttactggag ccttgggttt tgattcacaa ggcaatatgc aacttaatgt    14040

agcaggagga ctaaggattg attctcaaaa cagacgcctt atacttgatg ttagttatcc    14100

gtttgatgct caaaaccaac taaatctaag actaggacag ggccctcttt ttataaactc    14160

agcccacaac ttggatatta actacaacaa aggcctttac ttgtttacag cttcaaacaa    14220

ttccaaaaag cttgaggtta acctaagcac tgccaagggg ttgatgtttg acgctacagc    14280

catagccatt aatgcaggag atgggcttga atttggttca cctaatgcac caaacacaaa    14340

tcccctcaaa acaaaaattg gccatggcct agaatttgat tcaaacaagg ctatggttcc    14400

taaactagga actggcctta gttttgacag cacaggtgcc attacagtag gaaacaaaaa    14460

taatgataag ctaactttgt ggaccacacc agctccatct cctaactgta gactaaatgc    14520

agagaaagat gctaaactca ctttggtctt aacaaaatgt ggcagtcaaa tacttgctac    14580

agtttcagtt ttggctgtta aaggcagttt ggctccaata tctggaacag ttcaaagtgc    14640

tcatcttatt ataagatttg acgaaaatgg agtgctacta aacaattcct tcctggaccc    14700

agaatattgg aactttagaa atggagatct tactgaaggc acagcctata caaacgctgt    14760

tggatttatg cctaacctat cagcttatcc aaaatctcac ggtaaaactg ccaaaagtaa    14820

cattgtcagt caagtttact taaacggaga caaaactaaa cctgtaacac taaccattac    14880

actaaacggt acacaggaaa caggagacac aactccaagt gcatactcta tgtcattttc    14940

atgggactgg tctggccaca actacattaa tgaaatattt gccacatcct cttacacttt    15000

ttcatacatt gcccaagaat aaagaatcgt ttgtgttatg tttcaacgtg tttatttttc    15060

aattgcagaa aatttcaagt catttttcat tcagtagtat agccccacca ccacatagct    15120

tatacagatc accgtacctt aatcaaactc acagaaccct agtattcaac ctgccacctc    15180

cctcccaaca cacagagtac acagtccttt ctccccggct ggccttaaaa agcatcatat    15240

catgggtaac agacatattc ttaggtgtta tattccacac ggtttcctgt cgagccaaac    15300

gctcatcagt gatattaata aactccccgg gcagctcact taagttcatg tcgctgtcca    15360

gctgctgagc cacaggctgc tgtccaactt gcggttgctt aacgggcggc gaaggagaag    15420

tccacgccta catgggggta gagtcataat cgtgcatcag gatagggcgg tggtgctgca    15480

gcagcgcgcg aataaactgc tgccgccgcc gctccgtcct gcaggaatac aacatggcag    15540

tggtctcctc agcgatgatt cgcaccgccc gcagcataag gcgccttgtc ctccgggcac    15600

agcagcgcac cctgatctca cttaaatcag cacagtaact gcagcacagc accacaatat    15660

tgttcaaaat cccacagtgc aaggcgctgt atccaaagct catggcgggg accacagaac    15720

ccacgtggcc atcataccac aagcgcaggt agattaagtg gcgacccctc ataaacacgc    15780

tggacataaa cattacctct tttggcatgt tgtaattcac cacctcccgg taccatataa    15840

acctctgatt aaacatggcg ccatccacca ccatcctaaa ccagctggcc aaaacctgcc    15900

cgccggctat acactgcagg gaaccgggac tggaacaatg acagtggaga gcccaggact    15960

cgtaaccatg gatcatcatg ctcgtcatga tatcaatgtt ggcacaacac aggcacacgt    16020

gcatacactt cctcaggatt acaagctcct cccgcgttag aaccatatcc cagggaacaa    16080

cccattcctg aatcagcgta aatcccacac tgcagggaag acctcgcacg taactcacgt    16140

tgtgcattgt caaagtgtta cattcgggca gcagcggatg atcctccagt atggtagcgc    16200

gggtttctgt ctcaaaagga ggtagacgat ccctactgta cggagtgcgc cgagacaacc    16260

gagatcgtgt tggtcgtagt gtcatgccaa atggaacgcc ggacgtagtc atatttcctg    16320

aagcaaaacc aggtgcgggc gtgacaaaca gatctgcgtc tccggtctcg ccgcttagat    16380

cgctctgtgt agtagttgta gtatatccac tctctcaaag catccaggcg ccccctggct    16440

tcgggttcta tgtaaactcc ttcatgcgcc gctgccctga taacatccac caccgcagaa    16500

taagccacac ccagccaacc tacacattcg ttctgcgagt cacacacggg aggagcggga    16560

agagctggaa gaaccatgtt ttttttttta ttccaaaaga ttatccaaaa cctcaaaatg    16620

aagatctatt aagtgaacgc gctcccctcc ggtggcgtgg tcaaactcta cagccaaaga    16680

acagataatg gcatttgtaa gatgttgcac aatggcttcc aaaaggcaaa cggccctcac    16740

gtccaagtgg acgtaaaggc taaacccttc agggtgaatc tcctctataa acattccagc    16800

accttcaacc atgcccaaat aattctcatc tcgccacctt ctcaatatat ctctaagcaa    16860

atcccgaata ttaagtccgg ccattgtaaa aatctgctcc agagcgccct ccaccttcag    16920

cctcaagcag cgaatcatga ttgcaaaaat tcaggttcct cacagacctg tataagattc    16980

aaaagcggaa cattaacaaa aataccgcga tcccgtaggt cccttcgcag ggccagctga    17040

acataatcgt gcaggtctgc acggaccagc gcggccactt ccccgccagg aaccttgaca    17100

aaagaaccca cactgattat gacacgcata ctcggagcta tgctaaccag cgtagccccg    17160

atgtaagctt tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca aaaaatcagg    17220

caaagcctcg cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat aaaggcaggt    17280

aagctccgga accaccacag aaaaagacac catttttctc tcaaacatgt ctgcgggttt    17340

ctgcataaac acaaaataaa ataacaaaaa aacatttaaa cattagaagc ctgtcttaca    17400

acaggaaaaa caacccttat aagcataaga cggactacgg ccatgccggc gtgaccgtaa    17460

aaaaactggt caccgtgatt aaaaagcacc accgacagct cctcggtcat gtccggagtc    17520

ataatgtaag actcggtaaa cacatcaggt tgattcatcg gtcagtgcta aaaagcgacc    17580

gaaatagccc gggggaatac atacccgcag gcgtagagac aacattacag cccccatagg    17640

aggtataaca aaattaatag gagagaaaaa cacataaaca cctgaaaaac cctcctgcct    17700

aggcaaaata gcaccctccc gctccagaac aacatacagc gcttcacagc ggcagcctaa    17760

cagtcagcct taccagtaaa aaagaaaacc tattaaaaaa acaccactcg acacggcacc    17820

agctcaatca gtcacagtgt aaaaaagggc caagtgcaga gcgagtatat ataggactaa    17880

aaaatgacgt aacggttaaa gtccacaaaa aacacccaga aaaccgcacg cgaacctacg    17940

cccagaaacg aaagccaaaa aacccacaac ttcctcaaat cgtcacttcc gttttcccac    18000

gttacgtaac ttcccatttt aagaaaacta caattcccaa cacatacaag ttactccgcc    18060

ctaaaaccta cgtcacccgc cccgttccca cgccccgcgc cacgtcacaa actccacccc    18120

ctcattatca tattggcttc aatccaaaat aaggtatatt attgatgatt tattttggat    18180

tgaagccaat atgataatga gggggtggag tttgtgacgt ggcgcggggc gtgggaacgg    18240

ggcgggtgac gtagtagtgt ggcggaagtg tgatgttgca agtgtggcgg aacacatgta    18300

agcgacggat gtggcaaaag tgacgttttt ggtgtgcgcc ggatccacag gacgggtgtg    18360

gtcgccatga tcgcgtagtc gatagtggct ccaagtagcg aagcgagcag gactgggcgg    18420

cggccaaagc ggtcggacag tgctccgaga acgggtgcgc atagaaattg catcaacgca    18480

tatagcgcta gcagcacgcc atagtgactg gcgatgctgt cggaatggac gatatcccgc    18540

aagaggcccg gcagtaccgg cataaccaag cctatgccta cagcatccag ggtgacggtg    18600

ccgaggatga cgatgagcgc attgttagat ttcatacacg gtgcctgact gcgttagcaa    18660

tttaactgtg ataaactacc gcattaaagc ttatcgaatt cgtaatcatg gtcatagctg    18720

tttcctgtgt gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata    18780

aagtgtaaag cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca    18840

ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc    18900

gcggggagag gcggtttgcg tattgggcgc                                     18930


<210> 70
<211> 8376
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 70
aattcccatc atcaataata taccttattt tggattgaag ccaatatgat aatgaggggg       60

tggagtttgt gacgtggcgc ggggcgtggg aacggggcgg gtgacgtagt agtctctaga      120

gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat gtggtcacgc      180

tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga ggtttgaacg      240

cgcagccacc acgccggggt tttacgagat tgtgattaag gtccccagcg accttgacgg      300

gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg aatgggagtt      360

gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga      420

gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc cggaggccct      480

tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc tcgtggaaac      540

caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg aaaaactgat      600

tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg tcacaaagac      660

cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc ccaattactt      720

gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac agtatttaag      780

cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga cgcacgtgtc      840

gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc cggtgatcag      900

atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca aggggattac      960

ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca atgcggcctc     1020

caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta tgagcctgac     1080

taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt ccagcaatcg     1140

gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt ccgtctttct     1200

gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg ggcctgcaac     1260

taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct acgggtgcgt     1320

aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg tgatctggtg     1380

ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc tcggaggaag     1440

caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga ctcccgtgat     1500

cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga ccttcgaaca     1560

ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc tggatcatga     1620

ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa aggatcacgt     1680

ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa gacccgcccc     1740

cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc agccatcgac     1800

gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat gttctcgtca     1860

cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga atcagaattc     1920

aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg tgtcagaatc     1980

tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc atcatatcat     2040

gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt tggatgactg     2100

catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat cttccagatt     2160

ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa cctggcccac     2220

caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg cttcctgggt     2280

acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac gaggcagacg     2340

ccgcggccct cgagcacgac aaagcctacg accggcagct cgacagcgga gacaacccgt     2400

acctcaagta caaccacgcc gacgcggagt ttcaggagcg ccttaaagaa gatacgtctt     2460

ttgggggcaa cctcggacga gcagtcttcc aggcgaaaaa gagggttctt gaacctctgg     2520

gcctggttga ggaacctgtt aagacggctc cgggaaaaaa gaggccggta gagcactctc     2580

ctgtggagcc agactcctcc tcgggaaccg gaaaggcggg ccagcagcct gcaagaaaaa     2640

gattgaattt tggtcagact ggagacgcag actcagtacc tgacccccag cctctcggac     2700

agccaccagc agccccctct ggtctgggaa ctaatacgat ggctacaggc agtggcgcac     2760

caatggcaga caataacgag ggcgccgacg gagtgggtaa ttcctcggga aattggcatt     2820

gcgattccac atggatgggc gacagagtca tcaccaccag cacccgaacc tgggccctgc     2880

ccacctacaa caaccacctc tacaaacaaa tttccagcca atcaggagcc tcgaacgaca     2940

atcactactt tggctacagc accccttggg ggtattttga cttcaacaga ttccactgcc     3000

acttttcacc acgtgactgg caaagactca tcaacaacaa ctggggattc cgacccaaga     3060

gactcaactt caagctcttt aacattcaag tcaaagaggt cacgcagaat gacggtacga     3120

cgacgattgc caataacctt accagcacgg ttcaggtgtt tactgactcg gagtaccagc     3180

tcccgtacgt cctcggctcg gcgcatcaag gatgcctccc gccgttccca gcagacgtct     3240

tcatggtgcc acagtatgga tacctcaccc tgaacaacgg gagtcaggca gtaggacgct     3300

cttcatttta ctgcctggag tactttcctt ctcagatgct gcgtaccgga aacaacttta     3360

ccttcagcta cacttttgag gacgttcctt tccacagcag ctacgctcac agccagagtc     3420

tggaccgtct catgaatcct ctcatcgacc agtacctgta ttacttgagc agaacaaaca     3480

ctccaagtgg aaccaccacg cagtcaaggc ttcagttttc tcaggccgga gcgagtgaca     3540

ttcgggacca gtctaggaac tggcttcctg gaccctgtta ccgccagcag cgagtatcaa     3600

agacatctgc ggataacaac aacagtgaat actcgtggac tggagctacc aagtaccacc     3660

tcaatggcag agactctctg gtgaatccgg gcccggccat ggcaagccac aaggacgatg     3720

aagaaaagtt ttttcctcag agcggggttc tcatctttgg gaagcaaggc tcagagaaaa     3780

caaatgtgga cattgaaaag gtcatgatta cagacgaaga ggaaatcagg acaaccaatc     3840

ccgtggctac ggagcagtat ggttctgtat ctaccaacct ccagagaggc aacagacaag     3900

cagctaccgc agatgtcaac acacaaggcg ttcttccagg catggtctgg caggacagag     3960

atgtgtacct tcaggggccc atctgggcaa agattccaca cacggacgga cattttcacc     4020

cctctcccct catgggtgga ttcggactta aacaccctcc tccacagatt ctcatcaaga     4080

acaccccggt acctgcgaat ccttcgacca ccttcagtgc ggcaaagttt gcttccttca     4140

tcacacagta ctccacggga caggtcagcg tggagatcga gtgggagctg cagaaggaaa     4200

acagcaaacg ctggaatccc gaaattcagt acacttccaa ctacaacaag tctgttaatg     4260

tggactttac tgtggacact aatggcgtgt attcagagcc tcgccccatt ggcaccagat     4320

acctgactcg taatctgtaa ttgcttgtta atcaataaac cgtttaattc gtttcagttg     4380

aactttggtc tctgcgtatt tctttcttat ctagtttcca tgctctagag tcctgtatta     4440

gaggtcacgt gagtgttttg cgacattttg cgacaccatg tggtcacgct gggtatttaa     4500

gcccgagtga gcacgcaggg tctccatttt gaagcgggag gtttgaacgc gcagccacca     4560

cggcggggtt ttacgagatt gtgattaagg tccccagcga ccttgacggg catctgcccg     4620

gcatttctga cagctttgtg aactgggtgg ccgagaagga atgggagttg ccgccagatt     4680

ctgacatgga tctgaatctg attgagcagg cacccctgac cgtggccgag aagctgcatc     4740

gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga     4800

atggcgaatg gaattccaga cgattgagcg tcaaaatgta ggtatttcca tgagcgtttt     4860

tcctgttgca atggctggcg gtaatattgt tctggatatt accagcaagg ccgatagttt     4920

gagttcttct actcaggcaa gtgatgttat tactaatcaa agaagtattg cgacaacggt     4980

taatttgcgt gatggacaga ctcttttact cggtggcctc actgattata aaaacacttc     5040

tcaggattct ggcgtaccgt tcctgtctaa aatcccttta atcggcctcc tgtttagctc     5100

ccgctctgat tctaacgagg aaagcacgtt atacgtgctc gtcaaagcaa ccatagtacg     5160

cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta     5220

cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt     5280

tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg     5340

ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat     5400

cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac     5460

tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag     5520

ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg     5580

cgaattttaa caaaatatta acgtttacaa tttaaatatt tgcttataca atcttcctgt     5640

ttttggggct tttctgatta tcaaccgggg tacatatgat tgacatgcta gttttacgat     5700

taccgttcat cgattctctt gtttgctcca gactctcagg caatgacctg atagcctttg     5760

tagagacctc tcaaaaatag ctaccctctc cggcatgaat ttatcagcta gaacggttga     5820

atatcatatt gatggtgatt tgactgtctc cggcctttct cacccgtttg aatctttacc     5880

tacacattac tcaggcattg catttaaaat atatgagggt tctaaaaatt tttatccttg     5940

cgttgaaata aaggcttctc ccgcaaaagt attacagggt cataatgttt ttggtacaac     6000

cgatttagct ttatgctctg aggctttatt gcttaatttt gctaattctt tgccttgcct     6060

gtatgattta ttggatgttg gaattcctga tgcggtattt tctccttacg catctgtgcg     6120

gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa     6180

gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg     6240

catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac     6300

cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt ttataggtta     6360

atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg     6420

gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat     6480

aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc     6540

gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa     6600

cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac     6660

tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga     6720

tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag     6780

agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca     6840

cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca     6900

tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa     6960

ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc     7020

tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa     7080

cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag     7140

actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct     7200

ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac     7260

tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa     7320

ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt     7380

aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat     7440

ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg     7500

agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc     7560

ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg     7620

tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag     7680

cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact     7740

ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg     7800

gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc     7860

ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg     7920

aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg     7980

cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag     8040

ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc     8100

gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct     8160

ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc     8220

ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc     8280

gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac     8340

cgcctctccc cgcgcgttgg ccgattcatt aatgca                               8376


<210> 71
<211> 621
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 71
Thr Pro Gly Phe Tyr Glu Ile Val Ile Lys Val Pro Ser Asp Leu Asp 
1               5                   10                  15      


Gly His Leu Pro Gly Ile Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
            20                  25                  30          


Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu Ile 
        35                  40                  45              


Glu Gln Ala Pro Leu Thr Val Ala Glu Lys Leu Gln Arg Asp Phe Leu 
    50                  55                  60                  


Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65                  70                  75                  80  


Gln Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 
                85                  90                  95      


Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gln Ile 
            100                 105                 110         


Arg Glu Lys Leu Ile Gln Arg Ile Tyr Arg Gly Ile Glu Pro Thr Leu 
        115                 120                 125             


Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
    130                 135                 140                 


Asn Lys Val Val Asp Glu Cys Tyr Ile Pro Asn Tyr Leu Leu Pro Lys 
145                 150                 155                 160 


Thr Gln Pro Glu Leu Gln Trp Ala Trp Thr Asn Met Glu Gln Tyr Leu 
                165                 170                 175     


Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gln His 
            180                 185                 190         


Leu Thr His Val Ser Gln Thr Gln Glu Gln Asn Lys Glu Asn Gln Asn 
        195                 200                 205             


Pro Asn Ser Asp Ala Pro Val Ile Arg Ser Lys Thr Ser Ala Arg Tyr 
    210                 215                 220                 


Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly Ile Thr Ser Glu Lys 
225                 230                 235                 240 


Gln Trp Ile Gln Glu Asp Gln Ala Ser Tyr Ile Ser Phe Asn Ala Ala 
                245                 250                 255     


Ser Asn Ser Arg Ser Gln Ile Lys Ala Ala Leu Asp Asn Ala Gly Lys 
            260                 265                 270         


Ile Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gln Gln 
        275                 280                 285             


Pro Val Glu Asp Ile Ser Ser Asn Arg Ile Tyr Lys Ile Leu Glu Leu 
    290                 295                 300                 


Asn Gly Tyr Asp Pro Gln Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305                 310                 315                 320 


Thr Lys Lys Phe Gly Lys Arg Asn Thr Ile Trp Leu Phe Gly Pro Ala 
                325                 330                 335     


Thr Thr Gly Lys Thr Asn Ile Ala Glu Ala Ile Ala His Thr Val Pro 
            340                 345                 350         


Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
        355                 360                 365             


Cys Val Asp Lys Met Val Ile Trp Trp Glu Glu Gly Lys Met Thr Ala 
    370                 375                 380                 


Lys Val Val Glu Ser Ala Lys Ala Ile Leu Gly Gly Ser Lys Val Arg 
385                 390                 395                 400 


Val Asp Gln Lys Cys Lys Ser Ser Ala Gln Ile Asp Pro Thr Pro Val 
                405                 410                 415     


Ile Val Thr Ser Asn Thr Asn Met Cys Ala Val Ile Asp Gly Asn Ser 
            420                 425                 430         


Thr Thr Phe Glu His Gln Gln Pro Leu Gln Asp Arg Met Phe Lys Phe 
        435                 440                 445             


Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gln 
    450                 455                 460                 


Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465                 470                 475                 480 


Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
                485                 490                 495     


Pro Ser Asp Ala Asp Ile Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
            500                 505                 510         


Ala Gln Pro Ser Thr Ser Asp Ala Glu Ala Ser Ile Asn Tyr Ala Asp 
        515                 520                 525             


Arg Tyr Gln Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
    530                 535                 540                 


Phe Pro Cys Arg Gln Cys Glu Arg Met Asn Gln Asn Ser Asn Ile Cys 
545                 550                 555                 560 


Phe Thr His Gly Gln Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 
                565                 570                 575     


Ser Gln Pro Val Ser Val Val Lys Lys Ala Tyr Gln Lys Leu Cys Tyr 
            580                 585                 590         


Ile His His Ile Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 
        595                 600                 605             


Leu Val Asn Val Asp Leu Asp Asp Cys Ile Phe Glu Gln 
    610                 615                 620     


<210> 72
<211> 735
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 72
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145                 150                 155                 160 


Lys Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 
            260                 265                 270         


Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
        275                 280                 285             


Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 
    290                 295                 300                 


Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val 
305                 310                 315                 320 


Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 
                325                 330                 335     


Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 
            340                 345                 350         


Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 
        355                 360                 365             


Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
    370                 375                 380                 


Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385                 390                 395                 400 


Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
                405                 410                 415     


Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 
            420                 425                 430         


Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr 
        435                 440                 445             


Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln 
    450                 455                 460                 


Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly 
465                 470                 475                 480 


Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
                485                 490                 495     


Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
            500                 505                 510         


Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
        515                 520                 525             


Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys 
    530                 535                 540                 


Gln Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr 
545                 550                 555                 560 


Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr 
                565                 570                 575     


Gly Ser Val Ser Thr Asn Leu Gln Arg Gly Asn Arg Gln Ala Ala Thr 
            580                 585                 590         


Ala Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp 
        595                 600                 605             


Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 
    610                 615                 620                 


Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625                 630                 635                 640 


His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn 
                645                 650                 655     


Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln 
            660                 665                 670         


Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 
        675                 680                 685             


Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 
    690                 695                 700                 


Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705                 710                 715                 720 


Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735 


<210> 73
<211> 7582
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 73
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc agcccggggg      720

atccactagt tctagagtcc tgtattagag gtcacgtgag tgttttgcga cattttgcga      780

caccatgtgg tcacgctggg tatttaagcc cgagtgagca cgcagggtct ccattttgaa      840

gcgggaggtt tgaacgcgca gccgccatgc cggggtttta cgagattgtg attaaggtcc      900

ccagcgacct tgacgagcat ctgcccggca tttctgacag ctttgtgaac tgggtggccg      960

agaaggaatg ggagttgccg ccagattctg acatggatct gaatctgatt gagcaggcac     1020

ccctgaccgt ggccgagaag ctgcagcgcg actttctgac ggaatggcgc cgtgtgagta     1080

aggccccgga ggcccttttc tttgtgcaat ttgagaaggg agagagctac ttccacatgc     1140

acgtgctcgt ggaaaccacc ggggtgaaat ccatggtttt gggacgtttc ctgagtcaga     1200

ttcgcgaaaa actgattcag agaatttacc gcgggatcga gccgactttg ccaaactggt     1260

tcgcggtcac aaagaccaga aatggcgccg gaggcgggaa caaggtggtg gatgagtgct     1320

acatccccaa ttacttgctc cccaaaaccc agcctgagct ccagtgggcg tggactaata     1380

tggaacagta tttaagcgcc tgtttgaatc tcacggagcg taaacggttg gtggcgcagc     1440

atctgacgca cgtgtcgcag acgcaggagc agaacaaaga gaatcagaat cccaattctg     1500

atgcgccggt gatcagatca aaaacttcag ccaggtacat ggagctggtc gggtggctcg     1560

tggacaaggg gattacctcg gagaagcagt ggatccagga ggaccaggcc tcatacatct     1620

ccttcaatgc ggcctccaac tcgcggtccc aaatcaaggc tgccttggac aatgcgggaa     1680

agattatgag cctgactaaa accgcccccg actacctggt gggccagcag cccgtggagg     1740

acatttccag caatcggatt tataaaattt tggaactaaa cgggtacgat ccccaatatg     1800

cggcttccgt ctttctggga tgggccacga aaaagttcgg caagaggaac accatctggc     1860

tgtttgggcc tgcaactacc gggaagacca acatcgcgga ggccatagcc cacactgtgc     1920

ccttctacgg gtgcgtaaac tggaccaatg agaactttcc cttcaacgac tgtgtcgaca     1980

agatggtgat ctggtgggag gaggggaaga tgaccgccaa ggtcgtggag tcggccaaag     2040

ccattctcgg aggaagcaag gtgcgcgtgg accagaaatg caagtcctcg gcccagatag     2100

acccgactcc cgtgatcgtc acctccaaca ccaacatgtg cgccgtgatt gacgggaact     2160

caacgacctt cgaacaccag cagccgttgc aagaccggat gttcaaattt gaactcaccc     2220

gccgtctgga tcatgacttt gggaaggtca ccaagcagga agtcaaagac tttttccggt     2280

gggcaaagga tcacgtggtt gaggtggagc atgaattcta cgtcaaaaag ggtggagcca     2340

agaaaagacc cgcccccagt gacgcagata taagtgagcc caaacgggtg cgcgagtcag     2400

ttgcgcagcc atcgacgtca gacgcggaag cttcgatcaa ctacgcagac aggtaccaaa     2460

acaaatgttc tcgtcacgtg ggcatgaatc tgatgctgtt tccctgcaga caatgcgaga     2520

gaatgaatca gaattcaaat atctgcttca ctcacggaca gaaagactgt ttagagtgct     2580

ttcccgtgtc agaatctcaa cccgtttctg tcgtcaaaaa ggcgtatcag aaactgtgct     2640

acattcatca tatcatggga aaggtgccag acgcttgcac tgcctgcgat ctggtcaatg     2700

tggatttgga tgactgcatc tttgaacaat aaatgattta aatcaggtat ggctgccgat     2760

ggttatcttc cagattggct cgaggacact ctctctgaag gaataagaca gtggtggaag     2820

ctcaaacctg gcccaccacc accaaagccc gcagagcggc ataaggacga cagcaggggt     2880

cttgtgcttc ctgggtacaa gtacctcgga cccttcaacg gactcgacaa gggagagccg     2940

gtcaacgagg cagacgccgc ggccctcgag cacgacaaag cctacgaccg gcagctcgac     3000

agcggagaca acccgtacct caagtacaac cacgccgacg cggagtttca ggagcgcctt     3060

aaagaagata cgtcttttgg gggcaacctc ggacgagcag tcttccaggc gaaaaagagg     3120

gttcttgaac ctctgggcct ggttgaggaa cctgttaaga tggccggcat gatgttcctt     3180

cctactgatt attgttgcag actgagcgac caggaataca tggaactcgt cttcgagaac     3240

ggacagatac tcgcaaaagg ccagaggtca aatgttagtc tccataatca gcggacgaaa     3300

agcatcatgg atctgtatga ggccgaatac aacgaagatt ttatgaaaag tattatccat     3360

ggagggggtg gcgctattac caacctggga gatacccaag tggtcccaca gtcccacgta     3420

gcagccgctc acgagaccaa tatgctggag tccaacaaac acgtagacgg cgccgctccg     3480

ggaaaaaaga ggccggtaga gcactctcct gtggagccag actcctcctc gggaaccgga     3540

aaggcgggcc agcagcctgc aagaaaaaga ttgaattttg gtcagactgg agacgcagac     3600

tcagtacctg acccccagcc tctcggacag ccaccagcag ccccctctgg tctgggaact     3660

aatacgctgg ctacaggcag tggcgcacca ctggcagaca ataacgaggg cgccgacgga     3720

gtgggtaatt cctcgggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc     3780

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt     3840

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg     3900

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc     3960

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc     4020

aaagaggtca cgcagaatga cggtacgacg acgattgcca ataaccttac cagcacggtt     4080

caggtgttta ctgactcgga gtaccagctc ccgtacgtcc tcggctcggc gcatcaagga     4140

tgcctcccgc cgttcccagc agacgtcttc atggtgccac agtatggata cctcaccctg     4200

aacaacggga gtcaggcagt aggacgctct tcattttact gcctggagta ctttccttct     4260

cagatgctgc gtaccggaaa caactttacc ttcagctaca cttttgagga cgttcctttc     4320

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct catcgaccag     4380

tacctgtatt acttgagcag aacaaacact ccaagtggaa ccaccacgca gtcaaggctt     4440

cagttttctc aggccggagc gagtgacatt cgggaccagt ctaggaactg gcttcctgga     4500

ccctgttacc gccagcagcg agtatcaaag acatctgcgg ataacaacaa cagtgaatac     4560

tcgtggactg gagctaccaa gtaccacctc aatggcagag actctctggt gaatccgggc     4620

ccggccatgg caagccacaa ggacgatgaa gaaaagtttt ttcctcagag cggggttctc     4680

atctttggga agcaaggctc agagaaaaca aatgtggaca ttgaaaaggt catgattaca     4740

gacgaagagg aaatcaggac aaccaatccc gtggctacgg agcagtatgg ttctgtatct     4800

accaacctcc agagaggcaa cagacaagca gctaccgcag atgtcaacac acaaggcgtt     4860

cttccaggca tggtctggca ggacagagat gtgtaccttc aggggcccat ctgggcaaag     4920

attccacaca cggacggaca ttttcacccc tctcccctca tgggtggatt cggacttaaa     4980

caccctcctc cacagattct catcaagaac accccggtac ctgcgaatcc ttcgaccacc     5040

ttcagtgcgg caaagtttgc ttccttcatc acacagtact ccacgggaca ggtcagcgtg     5100

gagatcgagt gggagctgca gaaggaaaac agcaaacgct ggaatcccga aattcagtac     5160

acttccaact acaacaagtc tgttaatgtg gactttactg tggacactaa tggcgtgtat     5220

tcagagcctc gccccattgg caccagatac ctgactcgta atctgtaatt gcttgttaat     5280

caataaaccg tttaattcgt ttcagttgaa ctttggtctc tgcgtatttc tttcttatct     5340

agtttccatg ctctagagcg gccgccaccg cggtggagct ccagcttttg ttccctttag     5400

tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt     5460

tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt     5520

gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg     5580

ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg     5640

cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg     5700

cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat     5760

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc     5820

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc     5880

tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga     5940

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt     6000

ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg     6060

taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc     6120

gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg     6180

gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc     6240

ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg     6300

ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc     6360

gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct     6420

caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt     6480

taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa     6540

aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa     6600

tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc     6660

tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct     6720

gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca     6780

gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt     6840

aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt     6900

gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc     6960

ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc     7020

tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt     7080

atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact     7140

ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc     7200

ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt     7260

ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg     7320

atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct     7380

gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa     7440

tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt     7500

ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc     7560

acatttcccc gaaaagtgcc ac                                              7582


<210> 74
<211> 7270
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 74
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc agcccggggg      720

atccactagt tctagagtcc tgtattagag gtcacgtgag tgttttgcga cattttgcga      780

caccatgtgg tcacgctggg tatttaagcc cgagtgagca cgcagggtct ccattttgaa      840

gcgggaggtt tgaacgcgca gccgccatgc cggggtttta cgagattgtg attaaggtcc      900

ccagcgacct tgacgagcat ctgcccggca tttctgacag ctttgtgaac tgggtggccg      960

agaaggaatg ggagttgccg ccagattctg acatggatct gaatctgatt gagcaggcac     1020

ccctgaccgt ggccgagaag ctgcagcgcg actttctgac ggaatggcgc cgtgtgagta     1080

aggccccgga ggcccttttc tttgtgcaat ttgagaaggg agagagctac ttccacatgc     1140

acgtgctcgt ggaaaccacc ggggtgaaat ccatggtttt gggacgtttc ctgagtcaga     1200

ttcgcgaaaa actgattcag agaatttacc gcgggatcga gccgactttg ccaaactggt     1260

tcgcggtcac aaagaccaga aatggcgccg gaggcgggaa caaggtggtg gatgagtgct     1320

acatccccaa ttacttgctc cccaaaaccc agcctgagct ccagtgggcg tggactaata     1380

tggaacagta tttaagcgcc tgtttgaatc tcacggagcg taaacggttg gtggcgcagc     1440

atctgacgca cgtgtcgcag acgcaggagc agaacaaaga gaatcagaat cccaattctg     1500

atgcgccggt gatcagatca aaaacttcag ccaggtacat ggagctggtc gggtggctcg     1560

tggacaaggg gattacctcg gagaagcagt ggatccagga ggaccaggcc tcatacatct     1620

ccttcaatgc ggcctccaac tcgcggtccc aaatcaaggc tgccttggac aatgcgggaa     1680

agattatgag cctgactaaa accgcccccg actacctggt gggccagcag cccgtggagg     1740

acatttccag caatcggatt tataaaattt tggaactaaa cgggtacgat ccccaatatg     1800

cggcttccgt ctttctggga tgggccacga aaaagttcgg caagaggaac accatctggc     1860

tgtttgggcc tgcaactacc gggaagacca acatcgcgga ggccatagcc cacactgtgc     1920

ccttctacgg gtgcgtaaac tggaccaatg agaactttcc cttcaacgac tgtgtcgaca     1980

agatggtgat ctggtgggag gaggggaaga tgaccgccaa ggtcgtggag tcggccaaag     2040

ccattctcgg aggaagcaag gtgcgcgtgg accagaaatg caagtcctcg gcccagatag     2100

acccgactcc cgtgatcgtc acctccaaca ccaacatgtg cgccgtgatt gacgggaact     2160

caacgacctt cgaacaccag cagccgttgc aagaccggat gttcaaattt gaactcaccc     2220

gccgtctgga tcatgacttt gggaaggtca ccaagcagga agtcaaagac tttttccggt     2280

gggcaaagga tcacgtggtt gaggtggagc atgaattcta cgtcaaaaag ggtggagcca     2340

agaaaagacc cgcccccagt gacgcagata taagtgagcc caaacgggtg cgcgagtcag     2400

ttgcgcagcc atcgacgtca gacgcggaag cttcgatcaa ctacgcagac aggtaccaaa     2460

acaaatgttc tcgtcacgtg ggcatgaatc tgatgctgtt tccctgcaga caatgcgaga     2520

gaatgaatca gaattcaaat atctgcttca ctcacggaca gaaagactgt ttagagtgct     2580

ttcccgtgtc agaatctcaa cccgtttctg tcgtcaaaaa ggcgtatcag aaactgtgct     2640

acattcatca tatcatggga aaggtgccag acgcttgcac tgcctgcgat ctggtcaatg     2700

tggatttgga tgactgcatc tttgaacaat aaatgattta aatcaggtat ggctgccgat     2760

ggttatcttc cagattggct cgaggacact ctctctgaag gaataagaca gtggtggaag     2820

ctcaaacctg gcccaccacc accaaagccc gcagagcggc ataaggacga cagcaggggt     2880

cttgtgcttc ctgggtacaa gtacctcgga cccttcaacg gactcgacaa gggagagccg     2940

gtcaacgagg cagacgccgc ggccctcgag cacgacaaag cctacgaccg gcagctcgac     3000

agcggagaca acccgtacct caagtacaac cacgccgacg cggagtttca ggagcgcctt     3060

aaagaagata cgtcttttgg gggcaacctc ggacgagcag tcttccaggc gaaaaagagg     3120

gttcttgaac ctctgggcct ggttgaggaa cctgttaaga tggctccggg aaaaaagagg     3180

ccggtagagc actctcctgt ggagccagac tcctcctcgg gaaccggaaa ggcgggccag     3240

cagcctgcaa gaaaaagatt gaattttggt cagactggag acgcagactc agtacctgac     3300

ccccagcctc tcggacagcc accagcagcc ccctctggtc tgggaactaa tacgctggct     3360

acaggcagtg gcgcaccact ggcagacaat aacgagggcg ccgacggagt gggtaattcc     3420

tcgggaaatt ggcattgcga ttccacatgg ctgggcgaca gagtcatcac caccagcacc     3480

cgaacctggg ccctgcccac ctacaacaac cacctctaca aacaaatttc cagccaatca     3540

ggagcctcga acgacaatca ctactttggc tacagcaccc cttgggggta ttttgacttc     3600

aacagattcc actgccactt ttcaccacgt gactggcaaa gactcatcaa caacaactgg     3660

ggattccgac ccaagagact caacttcaag ctctttaaca ttcaagtcaa agaggtcacg     3720

cagaatgacg gtacgacgac gattgccaat aaccttacca gcacggttca ggtgtttact     3780

gactcggagt accagctccc gtacgtcctc ggctcggcgc atcaaggatg cctcccgccg     3840

ttcccagcag acgtcttcat ggtgccacag tatggatacc tcaccctgaa caacgggagt     3900

caggcagtag gacgctcttc attttactgc ctggagtact ttccttctca gatgctgcgt     3960

accggaaaca actttacctt cagctacact tttgaggacg ttcctttcca cagcagctac     4020

gctcacagcc agagtctgga ccgtctcatg aatcctctca tcgaccagta cctgtattac     4080

ttgagcagaa caaacactcc aagtggaacc accacgcagt caaggcttca gttttctcag     4140

gccggagcga gtgacattcg ggaccagtct aggaactggc ttcctggacc ctgttaccgc     4200

cagcagcgag tatcaaagac atctgcggat aacaacaaca gtgaatactc gtggactgga     4260

gctaccaagt accacctcaa tggcagagac tctctggtga atccgggccc ggccatggca     4320

agccacaagg acgatgaaga aaagtttttt cctcagagcg gggttctcat ctttgggaag     4380

caaggctcag agaaaacaaa tgtggacatt gaaaaggtca tgattacaga cgaagaggaa     4440

atcaggacaa ccaatcccgt ggctacggag cagtatggtt ctgtatctac caacctccag     4500

agaggcaaca gacaagcagc taccgcagat gtcaacacac aaggcgttct tccaggcatg     4560

gtctggcagg acagagatgt gtaccttcag gggcccatct gggcaaagat tccacacacg     4620

gacggacatt ttcacccctc tcccctcatg ggtggattcg gacttaaaca ccctcctcca     4680

cagattctca tcaagaacac cccggtacct gcgaatcctt cgaccacctt cagtgcggca     4740

aagtttgctt ccttcatcac acagtactcc acgggacagg tcagcgtgga gatcgagtgg     4800

gagctgcaga aggaaaacag caaacgctgg aatcccgaaa ttcagtacac ttccaactac     4860

aacaagtctg ttaatgtgga ctttactgtg gacactaatg gcgtgtattc agagcctcgc     4920

cccattggca ccagatacct gactcgtaat ctgtaattgc ttgttaatca ataaaccgtt     4980

taattcgttt cagttgaact ttggtctctg cgtatttctt tcttatctag tttccatgct     5040

ctagagcggc cgccaccgcg gtggagctcc agcttttgtt ccctttagtg agggttaatt     5100

gcgcgcttgg cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta tccgctcaca     5160

attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc ctaatgagtg     5220

agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg     5280

tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg tattgggcgc     5340

tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta     5400

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     5460

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     5520

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     5580

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     5640

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     5700

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     5760

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     5820

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     5880

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     5940

cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt     6000

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     6060

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct     6120

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg     6180

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt     6240

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt     6300

gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc     6360

gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg     6420

cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc     6480

gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg     6540

gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca     6600

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga     6660

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct     6720

ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg     6780

cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca     6840

accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata     6900

cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct     6960

tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact     7020

cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa     7080

acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc     7140

atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga     7200

tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga     7260

aaagtgccac                                                            7270


<210> 75
<211> 3860
<212> DNA
<213> Unknown

<220>
<223> Description of Unknown: 
      enterokinase sequence

<400> 75
agatttgttg tttgacaaaa ctttgaaaac tggagagttt ctgctcttca actgctgcaa       60

gcttctgtgc tcttccagag tccttagggt agcaaacctt caaaaaccaa aaatggggtc      120

aaagcgaagt gtaccatcaa ggcaccgttc tctcaccacc tatgaagtca tgtttgccgt      180

tctctttgtc atattggtgg cgctctgtgc tggattaatt gccgtgtcct ggctgtcaat      240

ccagggatca gtaaaagatg cagcatttgg aaaaagtcat gaagccagag ggacattgaa      300

aataatatcc ggagctactt ataatcctca tttgcaagac aaactctcag tggacttcaa      360

agttcttgct tttgacattc agcaaatgat agatgatatc tttcaatcaa gtaatctgaa      420

aaatgaatat aaaaactcaa gagttttaca atttgaaaat ggcagcatta tagtcatatt      480

tgaccttctc tttgaccagt gggtgtcaga taaaaatgta aaagaagaac tgattcaagg      540

cattgaagca aataaatcca gccaactggt cactttccac attgacttga acagcattga      600

tatcacagcc tctttggaga atttctctac gataagtcct gcaacaacgt cagaaaagct      660

aacaaccagc attcctctgg caaccccagg aaatgtctca atagagtgcc cacctgattc      720

aaggctgtgt gctgatgctc taaagtgcat agcaattgat ttattttgtg atggagaatt      780

aaactgtcca gatggctctg atgaagacaa taaaacttgt gccacagctt gtgatggaag      840

atttttgttg actggatctt ctgggtcctt tgaggctctg cattatccca agccttctaa      900

taatacaagc gctgtttgtc ggtggattat acgtgtaaac caaggacttt ccattcaact      960

gaacttcgat tattttaata catattatgc agatgtatta aatatttatg aaggaatggg     1020

ttcaagcaag attttaagag cttctctctg gtcaaataat cctggcataa ttaggatttt     1080

ttccaatcaa gttactgcca cttttcttat acagtctgat gaaagtgatt atattggctt     1140

caaagtaaca tacactgcat ttaacagcaa agagcttaat aattatgaga aaatcaactg     1200

taattttgaa gatggcttct gtttctggat ccaggatcta aatgatgaca atgagtggga     1260

aaggactcag ggaagcacct ttcctccatc tactggacca acttttgacc acacttttgg     1320

caatgagtca ggattttaca tttccacccc aactggacca ggaggaagac gagaaagagt     1380

aggactttta actctccctt tagatcccac tcctgaacaa gcctgcctta gtttctggta     1440

ttatatgtat ggtgaaaatg tttacaaact aagcattaat atcagcagtg accaaaacat     1500

ggagaagaca attttccaaa aagaaggaaa ttatggacaa aattggaact atggacaagt     1560

aacattaaat gaaacagtgg aatttaaggt ttctttctat gggtttaaaa accagatcct     1620

gagtgatata gcattggatg acattagcct aacatatggg atttgtaatg tgagtgtcta     1680

tccagaacca actttagtcc caactcctcc accagaactt cccacggact gtggagggcc     1740

tcatgacctg tgggagccaa atacaacatt cacgtctata aacttcccaa acagctaccc     1800

taatcaggct ttctgtattt ggaatttaaa tgcacaaaag ggaaaaaata ttcagctcca     1860

ctttcaagaa tttgacctgg aaaatattgc agatgtagtt gaaatcagag atggtgaagg     1920

agatgattcc ttgttcttag ctgtgtacac aggccctggt ccagtaaacg atgtgttctc     1980

aaccaccaac cgaatgactg tgctttttat cactgataat atgctggcaa aacagggatt     2040

taaagcaaat ttcactactg gctatggctt ggggattcca gaaccctgca aggaagacaa     2100

ttttcagtgc aaggatgggg agtgtattcc gctggtgaat ctctgtgacg gttttccaca     2160

ctgtaaggat ggctcagatg aagcacactg tgtgcgtctc ttcaatggca cgacagacag     2220

cagtggtttg gtgcagttca ggatccaaag catatggcat gtagcctgtg ccgagaactg     2280

gacaacccag atctcagatg atgtgtgtca gctgctggga ctagggactg gaaactcatc     2340

cgtgccaacc ttttctactg gaggtggacc atatgtaaat ttaaacacag cacctaatgg     2400

cagcttaata ctaacgccaa gccaacagtg cttagaggat tcactgattc tgctacaatg     2460

taactacaaa tcatgtggga aaaaactggt gactcaagaa gttagcccga agattgtcgg     2520

aggaagtgac tccagagaag gagcctggcc ttgggtcgtt gctctgtatt tcgacgatca     2580

acaggtctgc ggagcttctc tggtgagcag ggattggctg gtgtcggccg cccactgcgt     2640

gtacgggaga aatatggagc cgtctaagtg gaaagcagtg ctaggcctgc atatggcatc     2700

aaatctgact tctcctcaga tagaaactag gttgattgac caaattgtca taaacccaca     2760

ctacaataaa cggagaaaga acaatgacat tgccatgatg catcttgaaa tgaaagtgaa     2820

ctacacagat tatatacagc ctatttgttt accagaagaa aatcaagttt ttcccccagg     2880

aagaatttgt tctattgctg gctggggggc acttatatat caaggttcta ctgcagacgt     2940

actgcaagaa gctgacgttc cccttctatc aaatgagaaa tgtcaacaac agatgccaga     3000

atataacatt acggaaaata tggtgtgtgc aggctatgaa gcaggagggg tagattcttg     3060

tcagggggat tcaggcggac cactcatgtg ccaagaaaac aacagatggc tcctggctgg     3120

cgtgacgtca tttggatatc aatgtgcact gcctaatcgc ccaggggtgt atgcccgggt     3180

cccaaggttc acagagtgga tacaaagttt tctacattag agtgtttcca gaaacaaaga     3240

tgaaaatcag gcagttttcc catttcactt taagaagcat ggaaattgag agttaaaaaa     3300

ataataattt ataaaagtct tgattcttac ctaaggcact gaaatgctac agaaaaaaaa     3360

aagcaaaaac taatctttac aatacaaagt aactataaaa taataaattc tgattttatt     3420

gtcaacagtt actctttcac agacatcatt atttcctttg ttcttaatca ttatttttat     3480

cgtattctta tttaaagaaa ttatatttta aatcatgtaa tataatgttt aagcaaagtt     3540

aggaagagac atgaaataaa cttttacaca aagtagggta ttgtttgaaa tagattgtta     3600

taagttatct aattccagga taggtcacta ttatcagcat ctcaatcatt ttgctgtttt     3660

tctatccaaa tgcattttca atccatcttg agcacatcct taatattttc cccataataa     3720

aatatattta ttgtaagctc atgtcacaag cctggactaa actgattgta caatcctttc     3780

aaataagcta gttaaacaga aaactagcac aagtctatat attgcccttg catcaaataa     3840

agctaaaata attaacattg                                                 3860


<210> 76
<211> 1035
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      enterokinase sequence

<400> 76
Met Gly Ser Lys Arg Ser Val Pro Ser Arg His Arg Ser Leu Thr Thr 
1               5                   10                  15      


Tyr Glu Val Met Phe Ala Val Leu Phe Val Ile Leu Val Ala Leu Cys 
            20                  25                  30          


Ala Gly Leu Ile Ala Val Ser Trp Leu Ser Ile Gln Gly Ser Val Lys 
        35                  40                  45              


Asp Ala Ala Phe Gly Lys Ser His Glu Ala Arg Gly Thr Leu Lys Ile 
    50                  55                  60                  


Ile Ser Gly Ala Thr Tyr Asn Pro His Leu Gln Asp Lys Leu Ser Val 
65                  70                  75                  80  


Asp Phe Lys Val Leu Ala Phe Asp Ile Gln Gln Met Ile Asp Asp Ile 
                85                  90                  95      


Phe Gln Ser Ser Asn Leu Lys Asn Glu Tyr Lys Asn Ser Arg Val Leu 
            100                 105                 110         


Gln Phe Glu Asn Gly Ser Ile Ile Val Ile Phe Asp Leu Leu Phe Asp 
        115                 120                 125             


Gln Trp Val Ser Asp Lys Asn Val Lys Glu Glu Leu Ile Gln Gly Ile 
    130                 135                 140                 


Glu Ala Asn Lys Ser Ser Gln Leu Val Thr Phe His Ile Asp Leu Asn 
145                 150                 155                 160 


Ser Ile Asp Ile Thr Ala Ser Leu Glu Asn Phe Ser Thr Ile Ser Pro 
                165                 170                 175     


Ala Thr Thr Ser Glu Lys Leu Thr Thr Ser Ile Pro Leu Ala Thr Pro 
            180                 185                 190         


Gly Asn Val Ser Ile Glu Cys Pro Pro Asp Ser Arg Leu Cys Ala Asp 
        195                 200                 205             


Ala Leu Lys Cys Ile Ala Ile Asp Leu Phe Cys Asp Gly Glu Leu Asn 
    210                 215                 220                 


Cys Pro Asp Gly Ser Asp Glu Asp Asn Lys Thr Cys Ala Thr Ala Cys 
225                 230                 235                 240 


Asp Gly Arg Phe Leu Leu Thr Gly Ser Ser Gly Ser Phe Glu Ala Leu 
                245                 250                 255     


His Tyr Pro Lys Pro Ser Asn Asn Thr Ser Ala Val Cys Arg Trp Ile 
            260                 265                 270         


Ile Arg Val Asn Gln Gly Leu Ser Ile Gln Leu Asn Phe Asp Tyr Phe 
        275                 280                 285             


Asn Thr Tyr Tyr Ala Asp Val Leu Asn Ile Tyr Glu Gly Met Gly Ser 
    290                 295                 300                 


Ser Lys Ile Leu Arg Ala Ser Leu Trp Ser Asn Asn Pro Gly Ile Ile 
305                 310                 315                 320 


Arg Ile Phe Ser Asn Gln Val Thr Ala Thr Phe Leu Ile Gln Ser Asp 
                325                 330                 335     


Glu Ser Asp Tyr Ile Gly Phe Lys Val Thr Tyr Thr Ala Phe Asn Ser 
            340                 345                 350         


Lys Glu Leu Asn Asn Tyr Glu Lys Ile Asn Cys Asn Phe Glu Asp Gly 
        355                 360                 365             


Phe Cys Phe Trp Ile Gln Asp Leu Asn Asp Asp Asn Glu Trp Glu Arg 
    370                 375                 380                 


Thr Gln Gly Ser Thr Phe Pro Pro Ser Thr Gly Pro Thr Phe Asp His 
385                 390                 395                 400 


Thr Phe Gly Asn Glu Ser Gly Phe Tyr Ile Ser Thr Pro Thr Gly Pro 
                405                 410                 415     


Gly Gly Arg Arg Glu Arg Val Gly Leu Leu Thr Leu Pro Leu Asp Pro 
            420                 425                 430         


Thr Pro Glu Gln Ala Cys Leu Ser Phe Trp Tyr Tyr Met Tyr Gly Glu 
        435                 440                 445             


Asn Val Tyr Lys Leu Ser Ile Asn Ile Ser Ser Asp Gln Asn Met Glu 
    450                 455                 460                 


Lys Thr Ile Phe Gln Lys Glu Gly Asn Tyr Gly Gln Asn Trp Asn Tyr 
465                 470                 475                 480 


Gly Gln Val Thr Leu Asn Glu Thr Val Glu Phe Lys Val Ser Phe Tyr 
                485                 490                 495     


Gly Phe Lys Asn Gln Ile Leu Ser Asp Ile Ala Leu Asp Asp Ile Ser 
            500                 505                 510         


Leu Thr Tyr Gly Ile Cys Asn Val Ser Val Tyr Pro Glu Pro Thr Leu 
        515                 520                 525             


Val Pro Thr Pro Pro Pro Glu Leu Pro Thr Asp Cys Gly Gly Pro His 
    530                 535                 540                 


Asp Leu Trp Glu Pro Asn Thr Thr Phe Thr Ser Ile Asn Phe Pro Asn 
545                 550                 555                 560 


Ser Tyr Pro Asn Gln Ala Phe Cys Ile Trp Asn Leu Asn Ala Gln Lys 
                565                 570                 575     


Gly Lys Asn Ile Gln Leu His Phe Gln Glu Phe Asp Leu Glu Asn Ile 
            580                 585                 590         


Ala Asp Val Val Glu Ile Arg Asp Gly Glu Gly Asp Asp Ser Leu Phe 
        595                 600                 605             


Leu Ala Val Tyr Thr Gly Pro Gly Pro Val Asn Asp Val Phe Ser Thr 
    610                 615                 620                 


Thr Asn Arg Met Thr Val Leu Phe Ile Thr Asp Asn Met Leu Ala Lys 
625                 630                 635                 640 


Gln Gly Phe Lys Ala Asn Phe Thr Thr Gly Tyr Gly Leu Gly Ile Pro 
                645                 650                 655     


Glu Pro Cys Lys Glu Asp Asn Phe Gln Cys Lys Asp Gly Glu Cys Ile 
            660                 665                 670         


Pro Leu Val Asn Leu Cys Asp Gly Phe Pro His Cys Lys Asp Gly Ser 
        675                 680                 685             


Asp Glu Ala His Cys Val Arg Leu Phe Asn Gly Thr Thr Asp Ser Ser 
    690                 695                 700                 


Gly Leu Val Gln Phe Arg Ile Gln Ser Ile Trp His Val Ala Cys Ala 
705                 710                 715                 720 


Glu Asn Trp Thr Thr Gln Ile Ser Asp Asp Val Cys Gln Leu Leu Gly 
                725                 730                 735     


Leu Gly Thr Gly Asn Ser Ser Val Pro Thr Phe Ser Thr Gly Gly Gly 
            740                 745                 750         


Pro Tyr Val Asn Leu Asn Thr Ala Pro Asn Gly Ser Leu Ile Leu Thr 
        755                 760                 765             


Pro Ser Gln Gln Cys Leu Glu Asp Ser Leu Ile Leu Leu Gln Cys Asn 
    770                 775                 780                 


Tyr Lys Ser Cys Gly Lys Lys Leu Val Thr Gln Glu Val Ser Pro Lys 
785                 790                 795                 800 


Ile Val Gly Gly Ser Asp Ser Arg Glu Gly Ala Trp Pro Trp Val Val 
                805                 810                 815     


Ala Leu Tyr Phe Asp Asp Gln Gln Val Cys Gly Ala Ser Leu Val Ser 
            820                 825                 830         


Arg Asp Trp Leu Val Ser Ala Ala His Cys Val Tyr Gly Arg Asn Met 
        835                 840                 845             


Glu Pro Ser Lys Trp Lys Ala Val Leu Gly Leu His Met Ala Ser Asn 
    850                 855                 860                 


Leu Thr Ser Pro Gln Ile Glu Thr Arg Leu Ile Asp Gln Ile Val Ile 
865                 870                 875                 880 


Asn Pro His Tyr Asn Lys Arg Arg Lys Asn Asn Asp Ile Ala Met Met 
                885                 890                 895     


His Leu Glu Met Lys Val Asn Tyr Thr Asp Tyr Ile Gln Pro Ile Cys 
            900                 905                 910         


Leu Pro Glu Glu Asn Gln Val Phe Pro Pro Gly Arg Ile Cys Ser Ile 
        915                 920                 925             


Ala Gly Trp Gly Ala Leu Ile Tyr Gln Gly Ser Thr Ala Asp Val Leu 
    930                 935                 940                 


Gln Glu Ala Asp Val Pro Leu Leu Ser Asn Glu Lys Cys Gln Gln Gln 
945                 950                 955                 960 


Met Pro Glu Tyr Asn Ile Thr Glu Asn Met Val Cys Ala Gly Tyr Glu 
                965                 970                 975     


Ala Gly Gly Val Asp Ser Cys Gln Gly Asp Ser Gly Gly Pro Leu Met 
            980                 985                 990         


Cys Gln Glu Asn Asn Arg Trp Leu  Leu Ala Gly Val Thr  Ser Phe Gly 
        995                 1000                 1005             


Tyr Gln  Cys Ala Leu Pro Asn  Arg Pro Gly Val Tyr  Ala Arg Val 
    1010                 1015                 1020             


Pro Arg  Phe Thr Glu Trp Ile  Gln Ser Phe Leu His  
    1025                 1030                 1035 


<210> 77
<211> 7271
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 77
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc agcccggggg      720

atccactagt tctagaggtc ctgtattaga ggtcacgtga gtgttttgcg acattttgcg      780

acaccatgtg gtcacgctgg gtatttaagc ccgagtgagc acgcagggtc tccattttga      840

agcgggaggt ttgaacgcgc agccgccatg ccggggtttt acgagattgt gattaaggtc      900

cccagcgacc ttgacgagca tctgcccggc atttctgaca gctttgtgaa ctgggtggcc      960

gagaaggaat gggagttgcc gccagattct gacatggatc tgaatctgat tgagcaggca     1020

cccctgaccg tggccgagaa gctgcagcgc gactttctga cggaatggcg ccgtgtgagt     1080

aaggccccgg aggccctttt ctttgtgcaa tttgagaagg gagagagcta cttccacatg     1140

cacgtgctcg tggaaaccac cggggtgaaa tccatggttt tgggacgttt cctgagtcag     1200

attcgcgaaa aactgattca gagaatttac cgcgggatcg agccgacttt gccaaactgg     1260

ttcgcggtca caaagaccag aaatggcgcc ggaggcggga acaaggtggt ggatgagtgc     1320

tacatcccca attacttgct ccccaaaacc cagcctgagc tccagtgggc gtggactaat     1380

atggaacagt atttaagcgc ctgtttgaat ctcacggagc gtaaacggtt ggtggcgcag     1440

catctgacgc acgtgtcgca gacgcaggag cagaacaaag agaatcagaa tcccaattct     1500

gatgcgccgg tgatcagatc aaaaacttca gccaggtaca tggagctggt cgggtggctc     1560

gtggacaagg ggattacctc ggagaagcag tggatacagg aggaccaggc ctcatacatc     1620

tccttcaatg cggcctccaa ctcgcggtcc caaatcaagg ctgccttgga caatgcggga     1680

aagattatga gcctgactaa aaccgccccc gactacctgg tgggccagca gcccgtggag     1740

gacatttcca gcaatcggat ttataaaatt ttggaactaa acgggtacga tccccaatat     1800

gcggcttccg tctttctggg atgggccacg aaaaagttcg gcaagaggaa caccatctgg     1860

ctgtttgggc ctgcaactac cgggaagacc aacatcgcgg aggccatagc ccacactgtg     1920

cccttctacg ggtgcgtaaa ctggaccaat gagaactttc ccttcaacga ctgtgtcgac     1980

aagatggtga tctggtggga ggaggggaag atgaccgcca aggtcgtgga gtcggccaaa     2040

gccattctcg gaggaagcaa ggtgcgcgtg gaccagaaat gcaagtcctc ggcccagata     2100

gacccgactc ccgtgatcgt cacctccaac accaacatgt gcgccgtgat tgacgggaac     2160

tcaacgacct tcgaacacca gcagccgttg caagaccgga tgttcaaatt tgaactcacc     2220

cgccgtctgg atcatgactt tgggaaggtc accaagcagg aagtcaaaga ctttttccgg     2280

tgggcaaagg atcacgtggt tgaggtggag catgaattct acgtcaaaaa gggtggagcc     2340

aagaaaagac ccgcccccag tgacgcagat ataagtgagc ccaaacgggt gcgcgagtca     2400

gttgcgcagc catcgacgtc agacgcggaa gcttcgatca actacgcaga caggtaccaa     2460

aacaaatgtt ctcgtcacgt gggcatgaat ctgatgctgt ttccctgcag acaatgcgag     2520

agaatgaatc agaattcaaa tatctgcttc actcacggac agaaagactg tttagagtgc     2580

tttcccgtgt cagaatctca acccgtttct gtcgtcaaaa aggcgtatca gaaactgtgc     2640

tacattcatc atatcatggg aaaggtgcca gacgcttgca ctgcctgcga tctggtcaat     2700

gtggatttgg atgactgcat ctttgaacaa taaatgattt aaatcaggta tggctgccga     2760

tggttatctt ccagattggc tcgaggacac tctctctgaa ggaataagac agtggtggaa     2820

gctcaaacct ggcccaccac caccaaagcc cgcagagcgg cataaggacg acagcagggg     2880

tcttgtgctt cctgggtaca agtacctcgg acccttcaac ggactcgaca agggagagcc     2940

ggtcaacgag gcagacgccg cggccctcga gcacgacaaa gcctacgacc ggcagctcga     3000

cagcggagac aacccgtacc tcaagtacaa ccacgccgac gcggagtttc aggagcgcct     3060

taaagaagat acgtcttttg ggggcaacct cggacgagca gtcttccagg cgaaaaagag     3120

ggttcttgaa cctctgggcc tggttgagga acctgttaag aaggctccgg gaaaaaagag     3180

gccggtagag cactctcctg tggagccaga ctcctcctcg ggaaccggaa aggcgggcca     3240

gcagcctgca agaaaaagat tgaattttgg tcagactgga gacgcagact cagtacctga     3300

cccccagcct ctcggacagc caccagcagc cccctctggt ctgggaacta ataccatggc     3360

tacaggcagt ggcgcaccaa tggcagacaa taacgagggt gccgacggag tgggtaattc     3420

ctcgggaaat tggcattgcg attccacatg gatgggcgac agagtcatca ccaccagcac     3480

ccgaacctgg gccctgccca cctacaacaa ccacctctac aaacaaattt ccagccaatc     3540

aggagcctcg aacgacaatc actactttgg ctacagcacc ccttgggggt attttgactt     3600

caacagattc cactgccact tttcaccacg tgactggcaa agactcatca acaacaactg     3660

gggattccga cccaagagac tcaacttcaa gctctttaac attcaagtca aagaggtcac     3720

gcagaatgac ggtacgacga cgattgccaa taaccttacc agcacggttc aggtgtttac     3780

tgactcggag taccagctcc cgtacgtcct cggctcggcg catcaaggat gcctcccgcc     3840

gttcccagca gacgtcttca tggtgccaca gtatggatac ctcaccctga acaacgggag     3900

tcaggcagta ggacgctctt cattttactg cctggagtac tttccttctc agatgctgcg     3960

taccggaaac aactttacct tcagctacac ttttgaggac gttcctttcc acagcagcta     4020

cgctcacagc cagagtctgg accgtctcat gaatcctctc atcgaccagt acctgtatta     4080

cttgagcaga acaaacactc caagtggaac caccacgcag tcaaggcttc agttttctca     4140

ggccggagcg agtgacattc gggaccagtc taggaactgg cttcctggac cctgttaccg     4200

ccagcagcga gtatcaaaga catctgcgga taacaacaac agtgaatact cgtggactgg     4260

agctaccaag taccacctca atggcagaga ctctctggtg aatccgggcc cggccatggc     4320

aagccacaag gacgatgaag aaaagttttt tcctcagagc ggggttctca tctttgggaa     4380

gcaaggctca gagaaaacaa atgtggacat tgaaaaggtc atgattacag acgaagagga     4440

aatcaggaca accaatcccg tggctacgga gcagtatggt tctgtatcta ccaacctcca     4500

gagaggcaac agacaagcag ctaccgcaga tgtcaacaca caaggcgttc ttccaggcat     4560

ggtctggcag gacagagatg tgtaccttca ggggcccatc tgggcaaaga ttccacacac     4620

ggacggacat tttcacccct ctcccctcat gggtggattc ggacttaaac accctcctcc     4680

acagattctc atcaagaaca ccccggtacc tgcgaatcct tcgaccacct tcagtgcggc     4740

aaagtttgct tccttcatca cacagtactc cacgggacag gtcagcgtgg agatcgagtg     4800

ggagctgcag aaggaaaaca gcaaacgctg gaatcccgaa attcagtaca cttccaacta     4860

caacaagtct gttaatgtgg actttactgt ggacactaat ggcgtgtatt cagagcctcg     4920

ccccattggc accagatacc tgactcgtaa tctgtaattg cttgttaatc aataaaccgt     4980

ttaattcgtt tcagttgaac tttggtctct gcgtatttct ttcttatcta gtttccatgc     5040

tctagagcgg ccgccaccgc ggtggagctc cagcttttgt tccctttagt gagggttaat     5100

tgcgcgcttg gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgctcac     5160

aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt     5220

gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc     5280

gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg     5340

ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt     5400

atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa     5460

gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc     5520

gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag     5580

gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt     5640

gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg     5700

aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg     5760

ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg     5820

taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac     5880

tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg     5940

gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt     6000

taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg     6060

tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc     6120

tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt     6180

ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt     6240

taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag     6300

tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt     6360

cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg caatgatacc     6420

gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc     6480

cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg     6540

ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac     6600

aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg     6660

atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc     6720

tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact     6780

gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc     6840

aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat     6900

acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc     6960

ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac     7020

tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa     7080

aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact     7140

catactcttc ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg     7200

atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg     7260

aaaagtgcca c                                                          7271


<210> 78
<211> 6957
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 78
cagcagctgc gcgctcgctc gctcactgag gccgcccggg caaagcccgg gcgtcgggcg       60

acctttggtc gcccggcctc agtgagcgag cgagcgcgca gagagggagt ggccaactcc      120

atcactaggg gttccttgta gttaatgatt aacccgccat gctacttatc tacgtagcca      180

tgctctagag gatccggcct cggcctctgc ataaataaaa aaaattagtc agccatgagc      240

ttggcccatt gcatacgttg tatccatatc ataatatgta catttatatt ggctcatgtc      300

caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa tcaattacgg      360

ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg gtaaatggcc      420

cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg tatgttccca      480

tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta cggtaaactg      540

cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt gacgtcaatg      600

acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac tttcctactt      660

ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt tggcagtaca      720

tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac cccattgacg      780

tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt cgtaacaact      840

ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat ataagcagag      900

ctcgtttagt gaaccgtcag atcgcctgga gacgccatcc acgctgtttt gacctccata      960

gaagacaccg ggaccgatcc agcctcccct cgaagcttac atgtggtacc gagctcggat     1020

cctgagaact tcagggtgag tctatgggac ccttgatgtt ttctttcccc ttcttttcta     1080

tggttaagtt catgtcatag gaaggggaga agtaacaggg tacacatatt gaccaaatca     1140

gggtaatttt gcatttgtaa ttttaaaaaa tgctttcttc ttttaatata cttttttgtt     1200

tatcttattt ctaatacttt ccctaatctc tttctttcag ggcaataatg atacaatgta     1260

tcatgcctct ttgcaccatt ctaaagaata acagtgataa tttctgggtt aaggcaatag     1320

caatatttct gcatataaat atttctgcat ataaattgta actgatgtaa gaggtttcat     1380

attgctaata gcagctacaa tccagctacc attctgcttt tattttatgg ttgggataag     1440

gctggattat tctgagtcca agctaggccc ttttgctaat catgttcata cctcttatct     1500

tcctcccaca gctcctgggc aacgtgctgg tctgtgtgct ggcccatcac tttggcaaag     1560

cacgctaccg gtcgccacca tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc     1620

catcctggtc gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg     1680

cgagggcgat gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct     1740

gcccgtgccc tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg     1800

ctaccccgac cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt     1860

ccaggagcgc accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa     1920

gttcgagggc gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga     1980

cggcaacatc ctggggcaca agctggagta caactacaac agccacaacg tctatatcat     2040

ggccgacaag cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga     2100

cggcagcgtg cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt     2160

gctgctgccc gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga     2220

gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat     2280

ggacgagctg tacaagtaaa gcggccgctc tagaggatcc aagcttatcg ataccgtcga     2340

cctcgagggc ccagatctaa ttcaccccac cagtgcaggc tgcctatcag aaagtggtgg     2400

ctggtgtggc taatgccctg gcccacaagt atcactaagc tcgctttctt gctgtccaat     2460

ttctattaaa ggttcctttg ttccctaagt ccaactacta aactggggga tattatgaag     2520

ggccttgagc atctggattc tgcctaataa aaaacattta ttttcattgc aatgatgtat     2580

ttaaattatt tctgaatatt ttactaaaaa gggaatgtgg gaggtcagtg catttaaaac     2640

ataaagaaat gaagagctag ttcaaacctt gggaaaatac actatatctt aaactccatg     2700

aaagaaggtg aggctgcaaa cagctaatgc acattggcaa cagcccctga tgcctatgcc     2760

ttattcatcc ctcagaaaag gattcaagta gaggcttgat ttggaggtta aagttttgct     2820

atgctgtatt ttacattact tattgtttta gctgtcctca tgaatgtctt ttcactaccc     2880

atttgcttat cctgcatctc tcagccttga ctccactcag ttctcttgct tagagatacc     2940

acctttcccc tgaagtgttc cttccatgtt ttacggcgag atggtttctc ctcgcctggc     3000

cactcagcct tagttgtctc tgttgtctta tagaggtcta cttgaagaag gaaaaacagg     3060

gggcatggtt tgactgtcct gtgagccctt cttccctgcc tcccccactc acagtgaccc     3120

ggaatccctc gacatggcat cctagagcat ggctacgtag ataagtagca tggcgggtta     3180

atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc     3240

tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc     3300

tcagtgagcg agcgagcgcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc     3360

ttcccaacag ttgcgcagcc tgaatggcga atggaattcc agacgattga gcgtcaaaat     3420

gtaggtattt ccatgagcgt ttttcctgtt gcaatggctg gcggtaatat tgttctggat     3480

attaccagca aggccgatag tttgagttct tctactcagg caagtgatgt tattactaat     3540

caaagaagta ttgcgacaac ggttaatttg cgtgatggac agactctttt actcggtggc     3600

ctcactgatt ataaaaacac ttctcaggat tctggcgtac cgttcctgtc taaaatccct     3660

ttaatcggcc tcctgtttag ctcccgctct gattctaacg aggaaagcac gttatacgtg     3720

ctcgtcaaag caaccatagt acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt     3780

ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt     3840

cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct     3900

ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgattaggg     3960

tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga     4020

gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc     4080

ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt taaaaaatga     4140

gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgttta caatttaaat     4200

atttgcttat acaatcttcc tgtttttggg gcttttctga ttatcaaccg gggtacatat     4260

gattgacatg ctagttttac gattaccgtt catcgattct cttgtttgct ccagactctc     4320

aggcaatgac ctgatagcct ttgtagagac ctctcaaaaa tagctaccct ctccggcatg     4380

aatttatcag ctagaacggt tgaatatcat attgatggtg atttgactgt ctccggcctt     4440

tctcacccgt ttgaatcttt acctacacat tactcaggca ttgcatttaa aatatatgag     4500

ggttctaaaa atttttatcc ttgcgttgaa ataaaggctt ctcccgcaaa agtattacag     4560

ggtcataatg tttttggtac aaccgattta gctttatgct ctgaggcttt attgcttaat     4620

tttgctaatt ctttgccttg cctgtatgat ttattggatg ttggaattcc tgatgcggta     4680

ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc tcagtacaat     4740

ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg ctgacgcgcc     4800

ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag     4860

ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgagacgaa agggcctcgt     4920

gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg     4980

cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa     5040

tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa     5100

gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct     5160

tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg     5220

tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg     5280

ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt     5340

atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga     5400

cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga     5460

attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac     5520

gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg     5580

ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac     5640

gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct     5700

agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct     5760

gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg     5820

gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat     5880

ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg     5940

tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat     6000

tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct     6060

catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa     6120

gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa     6180

aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc     6240

gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta     6300

gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct     6360

gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg     6420

atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag     6480

cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc     6540

cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg     6600

agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt     6660

tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg     6720

gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca     6780

catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg     6840

agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc     6900

ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatg        6957


<210> 79
<211> 4718
<212> DNA
<213> Adeno-associated virus 1

<400> 79
ttgcccactc cctctctgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc       60

agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcag agagggagtg      120

ggcaactcca tcactagggg taatcgcgaa gcgcctccca cgctgccgcg tcagcgctga      180

cgtaaattac gtcatagggg agtggtcctg tattagctgt cacgtgagtg cttttgcgac      240

attttgcgac accacgtggc catttagggt atatatggcc gagtgagcga gcaggatctc      300

cattttgacc gcgaaatttg aacgagcagc agccatgccg ggcttctacg agatcgtgat      360

caaggtgccg agcgacctgg acgagcacct gccgggcatt tctgactcgt ttgtgagctg      420

ggtggccgag aaggaatggg agctgccccc ggattctgac atggatctga atctgattga      480

gcaggcaccc ctgaccgtgg ccgagaagct gcagcgcgac ttcctggtcc aatggcgccg      540

cgtgagtaag gccccggagg ccctcttctt tgttcagttc gagaagggcg agtcctactt      600

ccacctccat attctggtgg agaccacggg ggtcaaatcc atggtgctgg gccgcttcct      660

gagtcagatt agggacaagc tggtgcagac catctaccgc gggatcgagc cgaccctgcc      720

caactggttc gcggtgacca agacgcgtaa tggcgccgga ggggggaaca aggtggtgga      780

cgagtgctac atccccaact acctcctgcc caagactcag cccgagctgc agtgggcgtg      840

gactaacatg gaggagtata taagcgcctg tttgaacctg gccgagcgca aacggctcgt      900

ggcgcagcac ctgacccacg tcagccagac ccaggagcag aacaaggaga atctgaaccc      960

caattctgac gcgcctgtca tccggtcaaa aacctccgcg cgctacatgg agctggtcgg     1020

gtggctggtg gaccggggca tcacctccga gaagcagtgg atccaggagg accaggcctc     1080

gtacatctcc ttcaacgccg cttccaactc gcggtcccag atcaaggccg ctctggacaa     1140

tgccggcaag atcatggcgc tgaccaaatc cgcgcccgac tacctggtag gccccgctcc     1200

gcccgcggac attaaaacca accgcatcta ccgcatcctg gagctgaacg gctacgaacc     1260

tgcctacgcc ggctccgtct ttctcggctg ggcccagaaa aggttcggga agcgcaacac     1320

catctggctg tttgggccgg ccaccacggg caagaccaac atcgcggaag ccatcgccca     1380

cgccgtgccc ttctacggct gcgtcaactg gaccaatgag aactttccct tcaatgattg     1440

cgtcgacaag atggtgatct ggtgggagga gggcaagatg acggccaagg tcgtggagtc     1500

cgccaaggcc attctcggcg gcagcaaggt gcgcgtggac caaaagtgca agtcgtccgc     1560

ccagatcgac cccacccccg tgatcgtcac ctccaacacc aacatgtgcg ccgtgattga     1620

cgggaacagc accaccttcg agcaccagca gccgttgcag gaccggatgt tcaaatttga     1680

actcacccgc cgtctggagc atgactttgg caaggtgaca aagcaggaag tcaaagagtt     1740

cttccgctgg gcgcaggatc acgtgaccga ggtggcgcat gagttctacg tcagaaaggg     1800

tggagccaac aaaagacccg cccccgatga cgcggataaa agcgagccca agcgggcctg     1860

cccctcagtc gcggatccat cgacgtcaga cgcggaagga gctccggtgg actttgccga     1920

caggtaccaa aacaaatgtt ctcgtcacgc gggcatgctt cagatgctgt ttccctgcaa     1980

gacatgcgag agaatgaatc agaatttcaa catttgcttc acgcacggga cgagagactg     2040

ttcagagtgc ttccccggcg tgtcagaatc tcaaccggtc gtcagaaaga ggacgtatcg     2100

gaaactctgt gccattcatc atctgctggg gcgggctccc gagattgctt gctcggcctg     2160

cgatctggtc aacgtggacc tggatgactg tgtttctgag caataaatga cttaaaccag     2220

gtatggctgc cgatggttat cttccagatt ggctcgagga caacctctct gagggcattc     2280

gcgagtggtg ggacttgaaa cctggagccc cgaagcccaa agccaaccag caaaagcagg     2340

acgacggccg gggtctggtg cttcctggct acaagtacct cggacccttc aacggactcg     2400

acaaggggga gcccgtcaac gcggcggacg cagcggccct cgagcacgac aaggcctacg     2460

accagcagct caaagcgggt gacaatccgt acctgcggta taaccacgcc gacgccgagt     2520

ttcaggagcg tctgcaagaa gatacgtctt ttgggggcaa cctcgggcga gcagtcttcc     2580

aggccaagaa gcgggttctc gaacctctcg gtctggttga ggaaggcgct aagacggctc     2640

ctggaaagaa acgtccggta gagcagtcgc cacaagagcc agactcctcc tcgggcatcg     2700

gcaagacagg ccagcagccc gctaaaaaga gactcaattt tggtcagact ggcgactcag     2760

agtcagtccc cgatccacaa cctctcggag aacctccagc aacccccgct gctgtgggac     2820

ctactacaat ggcttcaggc ggtggcgcac caatggcaga caataacgaa ggcgccgacg     2880

gagtgggtaa tgcctcagga aattggcatt gcgattccac atggctgggc gacagagtca     2940

tcaccaccag cacccgcacc tgggccttgc ccacctacaa taaccacctc tacaagcaaa     3000

tctccagtgc ttcaacgggg gccagcaacg acaaccacta cttcggctac agcaccccct     3060

gggggtattt tgatttcaac agattccact gccacttttc accacgtgac tggcagcgac     3120

tcatcaacaa caattgggga ttccggccca agagactcaa cttcaaactc ttcaacatcc     3180

aagtcaagga ggtcacgacg aatgatggcg tcacaaccat cgctaataac cttaccagca     3240

cggttcaagt cttctcggac tcggagtacc agcttccgta cgtcctcggc tctgcgcacc     3300

agggctgcct ccctccgttc ccggcggacg tgttcatgat tccgcaatac ggctacctga     3360

cgctcaacaa tggcagccaa gccgtgggac gttcatcctt ttactgcctg gaatatttcc     3420

cttctcagat gctgagaacg ggcaacaact ttaccttcag ctacaccttt gaggaagtgc     3480

ctttccacag cagctacgcg cacagccaga gcctggaccg gctgatgaat cctctcatcg     3540

accaatacct gtattacctg aacagaactc aaaatcagtc cggaagtgcc caaaacaagg     3600

acttgctgtt tagccgtggg tctccagctg gcatgtctgt tcagcccaaa aactggctac     3660

ctggaccctg ttatcggcag cagcgcgttt ctaaaacaaa aacagacaac aacaacagca     3720

attttacctg gactggtgct tcaaaatata acctcaatgg gcgtgaatcc atcatcaacc     3780

ctggcactgc tatggcctca cacaaagacg acgaagacaa gttctttccc atgagcggtg     3840

tcatgatttt tggaaaagag agcgccggag cttcaaacac tgcattggac aatgtcatga     3900

ttacagacga agaggaaatt aaagccacta accctgtggc caccgaaaga tttgggaccg     3960

tggcagtcaa tttccagagc agcagcacag accctgcgac cggagatgtg catgctatgg     4020

gagcattacc tggcatggtg tggcaagata gagacgtgta cctgcagggt cccatttggg     4080

ccaaaattcc tcacacagat ggacactttc acccgtctcc tcttatgggc ggctttggac     4140

tcaagaaccc gcctcctcag atcctcatca aaaacacgcc tgttcctgcg aatcctccgg     4200

cggagttttc agctacaaag tttgcttcat tcatcaccca atactccaca ggacaagtga     4260

gtgtggaaat tgaatgggag ctgcagaaag aaaacagcaa gcgctggaat cccgaagtgc     4320

agtacacatc caattatgca aaatctgcca acgttgattt tactgtggac aacaatggac     4380

tttatactga gcctcgcccc attggcaccc gttaccttac ccgtcccctg taattacgtg     4440

ttaatcaata aaccggttga ttcgtttcag ttgaactttg gtctcctgtc cttcttatct     4500

tatcggttac catggttata gcttacacat taactgcttg gttgcgcttc gcgataaaag     4560

acttacgtca tcgggttacc cctagtgatg gagttgccca ctccctctct gcgcgctcgc     4620

tcgctcggtg gggcctgcgg accaaaggtc cgcagacggc agagctctgc tctgccggcc     4680

ccaccgagcg agcgagcgcg cagagaggga gtgggcaa                             4718


<210> 80
<211> 1872
<212> DNA
<213> Adeno-associated virus 1

<400> 80
atgccgggct tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg       60

ggcatttctg actcgtttgt gagctgggtg gccgagaagg aatgggagct gcccccggat      120

tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgacttcc tggtccaatg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt      240

cagttcgaga agggcgagtc ctacttccac ctccatattc tggtggagac cacgggggtc      300

aaatccatgg tgctgggccg cttcctgagt cagattaggg acaagctggt gcagaccatc      360

taccgcggga tcgagccgac cctgcccaac tggttcgcgg tgaccaagac gcgtaatggc      420

gccggagggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag      480

actcagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcctgtttg      540

aacctggccg agcgcaaacg gctcgtggcg cagcacctga cccacgtcag ccagacccag      600

gagcagaaca aggagaatct gaaccccaat tctgacgcgc ctgtcatccg gtcaaaaacc      660

tccgcgcgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag      720

cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcttc caactcgcgg      780

tcccagatca aggccgctct ggacaatgcc ggcaagatca tggcgctgac caaatccgcg      840

cccgactacc tggtaggccc cgctccgccc gcggacatta aaaccaaccg catctaccgc      900

atcctggagc tgaacggcta cgaacctgcc tacgccggct ccgtctttct cggctgggcc      960

cagaaaaggt tcgggaagcg caacaccatc tggctgtttg ggccggccac cacgggcaag     1020

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt caactggacc     1080

aatgagaact ttcccttcaa tgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgacgg ccaaggtcgt ggagtccgcc aaggccattc tcggcggcag caaggtgcgc     1200

gtggaccaaa agtgcaagtc gtccgcccag atcgacccca cccccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca ccagcagccg     1320

ttgcaggacc ggatgttcaa atttgaactc acccgccgtc tggagcatga ctttggcaag     1380

gtgacaaagc aggaagtcaa agagttcttc cgctgggcgc aggatcacgt gaccgaggtg     1440

gcgcatgagt tctacgtcag aaagggtgga gccaacaaaa gacccgcccc cgatgacgcg     1500

gataaaagcg agcccaagcg ggcctgcccc tcagtcgcgg atccatcgac gtcagacgcg     1560

gaaggagctc cggtggactt tgccgacagg taccaaaaca aatgttctcg tcacgcgggc     1620

atgcttcaga tgctgtttcc ctgcaagaca tgcgagagaa tgaatcagaa tttcaacatt     1680

tgcttcacgc acgggacgag agactgttca gagtgcttcc ccggcgtgtc agaatctcaa     1740

ccggtcgtca gaaagaggac gtatcggaaa ctctgtgcca ttcatcatct gctggggcgg     1800

gctcccgaga ttgcttgctc ggcctgcgat ctggtcaacg tggacctgga tgactgtgtt     1860

tctgagcaat aa                                                         1872


<210> 81
<211> 2211
<212> DNA
<213> Adeno-associated virus 1

<400> 81
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg atccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc      900

atcaacaaca attggggatt ccggcccaag agactcaact tcaaactctt caacatccaa      960

gtcaaggagg tcacgacgaa tgatggcgtc acaaccatcg ctaataacct taccagcacg     1020

gttcaagtct tctcggactc ggagtaccag cttccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg     1140

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga atatttccct     1200

tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggaagtgcct     1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac     1320

caatacctgt attacctgaa cagaactcaa aatcagtccg gaagtgccca aaacaaggac     1380

ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct     1440

ggaccctgtt atcggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaat     1500

tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat catcaaccct     1560

ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat gagcggtgtc     1620

atgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa tgtcatgatt     1680

acagacgaag aggaaattaa agccactaac cctgtggcca ccgaaagatt tgggaccgtg     1740

gcagtcaatt tccagagcag cagcacagac cctgcgaccg gagatgtgca tgctatggga     1800

gcattacctg gcatggtgtg gcaagataga gacgtgtacc tgcagggtcc catttgggcc     1860

aaaattcctc acacagatgg acactttcac ccgtctcctc ttatgggcgg ctttggactc     1920

aagaacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggcg     1980

gagttttcag ctacaaagtt tgcttcattc atcacccaat actccacagg acaagtgagt     2040

gtggaaattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc cgaagtgcag     2100

tacacatcca attatgcaaa atctgccaac gttgatttta ctgtggacaa caatggactt     2160

tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta a              2211


<210> 82
<211> 8376
<212> DNA
<213> Adeno-associated virus 2

<400> 82
aattcccatc atcaataata taccttattt tggattgaag ccaatatgat aatgaggggg       60

tggagtttgt gacgtggcgc ggggcgtggg aacggggcgg gtgacgtagt agtctctaga      120

gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat gtggtcacgc      180

tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga ggtttgaacg      240

cgcagccacc acgccggggt tttacgagat tgtgattaag gtccccagcg accttgacgg      300

gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg aatgggagtt      360

gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga      420

gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc cggaggccct      480

tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc tcgtggaaac      540

caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg aaaaactgat      600

tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg tcacaaagac      660

cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc ccaattactt      720

gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac agtatttaag      780

cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga cgcacgtgtc      840

gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc cggtgatcag      900

atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca aggggattac      960

ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca atgcggcctc     1020

caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta tgagcctgac     1080

taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt ccagcaatcg     1140

gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt ccgtctttct     1200

gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg ggcctgcaac     1260

taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct acgggtgcgt     1320

aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg tgatctggtg     1380

ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc tcggaggaag     1440

caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga ctcccgtgat     1500

cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga ccttcgaaca     1560

ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc tggatcatga     1620

ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa aggatcacgt     1680

ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa gacccgcccc     1740

cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc agccatcgac     1800

gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat gttctcgtca     1860

cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga atcagaattc     1920

aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg tgtcagaatc     1980

tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc atcatatcat     2040

gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt tggatgactg     2100

catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat cttccagatt     2160

ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa cctggcccac     2220

caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg cttcctgggt     2280

acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac gaggcagacg     2340

ccgcggccct cgagcacgac aaagcctacg accggcagct cgacagcgga gacaacccgt     2400

acctcaagta caaccacgcc gacgcggagt ttcaggagcg ccttaaagaa gatacgtctt     2460

ttgggggcaa cctcggacga gcagtcttcc aggcgaaaaa gagggttctt gaacctctgg     2520

gcctggttga ggaacctgtt aagacggctc cgggaaaaaa gaggccggta gagcactctc     2580

ctgtggagcc agactcctcc tcgggaaccg gaaaggcggg ccagcagcct gcaagaaaaa     2640

gattgaattt tggtcagact ggagacgcag actcagtacc tgacccccag cctctcggac     2700

agccaccagc agccccctct ggtctgggaa ctaatacgat ggctacaggc agtggcgcac     2760

caatggcaga caataacgag ggcgccgacg gagtgggtaa ttcctcggga aattggcatt     2820

gcgattccac atggatgggc gacagagtca tcaccaccag cacccgaacc tgggccctgc     2880

ccacctacaa caaccacctc tacaaacaaa tttccagcca atcaggagcc tcgaacgaca     2940

atcactactt tggctacagc accccttggg ggtattttga cttcaacaga ttccactgcc     3000

acttttcacc acgtgactgg caaagactca tcaacaacaa ctggggattc cgacccaaga     3060

gactcaactt caagctcttt aacattcaag tcaaagaggt cacgcagaat gacggtacga     3120

cgacgattgc caataacctt accagcacgg ttcaggtgtt tactgactcg gagtaccagc     3180

tcccgtacgt cctcggctcg gcgcatcaag gatgcctccc gccgttccca gcagacgtct     3240

tcatggtgcc acagtatgga tacctcaccc tgaacaacgg gagtcaggca gtaggacgct     3300

cttcatttta ctgcctggag tactttcctt ctcagatgct gcgtaccgga aacaacttta     3360

ccttcagcta cacttttgag gacgttcctt tccacagcag ctacgctcac agccagagtc     3420

tggaccgtct catgaatcct ctcatcgacc agtacctgta ttacttgagc agaacaaaca     3480

ctccaagtgg aaccaccacg cagtcaaggc ttcagttttc tcaggccgga gcgagtgaca     3540

ttcgggacca gtctaggaac tggcttcctg gaccctgtta ccgccagcag cgagtatcaa     3600

agacatctgc ggataacaac aacagtgaat actcgtggac tggagctacc aagtaccacc     3660

tcaatggcag agactctctg gtgaatccgg gcccggccat ggcaagccac aaggacgatg     3720

aagaaaagtt ttttcctcag agcggggttc tcatctttgg gaagcaaggc tcagagaaaa     3780

caaatgtgga cattgaaaag gtcatgatta cagacgaaga ggaaatcagg acaaccaatc     3840

ccgtggctac ggagcagtat ggttctgtat ctaccaacct ccagagaggc aacagacaag     3900

cagctaccgc agatgtcaac acacaaggcg ttcttccagg catggtctgg caggacagag     3960

atgtgtacct tcaggggccc atctgggcaa agattccaca cacggacgga cattttcacc     4020

cctctcccct catgggtgga ttcggactta aacaccctcc tccacagatt ctcatcaaga     4080

acaccccggt acctgcgaat ccttcgacca ccttcagtgc ggcaaagttt gcttccttca     4140

tcacacagta ctccacggga caggtcagcg tggagatcga gtgggagctg cagaaggaaa     4200

acagcaaacg ctggaatccc gaaattcagt acacttccaa ctacaacaag tctgttaatg     4260

tggactttac tgtggacact aatggcgtgt attcagagcc tcgccccatt ggcaccagat     4320

acctgactcg taatctgtaa ttgcttgtta atcaataaac cgtttaattc gtttcagttg     4380

aactttggtc tctgcgtatt tctttcttat ctagtttcca tgctctagag tcctgtatta     4440

gaggtcacgt gagtgttttg cgacattttg cgacaccatg tggtcacgct gggtatttaa     4500

gcccgagtga gcacgcaggg tctccatttt gaagcgggag gtttgaacgc gcagccacca     4560

cggcggggtt ttacgagatt gtgattaagg tccccagcga ccttgacggg catctgcccg     4620

gcatttctga cagctttgtg aactgggtgg ccgagaagga atgggagttg ccgccagatt     4680

ctgacatgga tctgaatctg attgagcagg cacccctgac cgtggccgag aagctgcatc     4740

gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga     4800

atggcgaatg gaattccaga cgattgagcg tcaaaatgta ggtatttcca tgagcgtttt     4860

tcctgttgca atggctggcg gtaatattgt tctggatatt accagcaagg ccgatagttt     4920

gagttcttct actcaggcaa gtgatgttat tactaatcaa agaagtattg cgacaacggt     4980

taatttgcgt gatggacaga ctcttttact cggtggcctc actgattata aaaacacttc     5040

tcaggattct ggcgtaccgt tcctgtctaa aatcccttta atcggcctcc tgtttagctc     5100

ccgctctgat tctaacgagg aaagcacgtt atacgtgctc gtcaaagcaa ccatagtacg     5160

cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta     5220

cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt     5280

tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg     5340

ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat     5400

cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac     5460

tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag     5520

ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg     5580

cgaattttaa caaaatatta acgtttacaa tttaaatatt tgcttataca atcttcctgt     5640

ttttggggct tttctgatta tcaaccgggg tacatatgat tgacatgcta gttttacgat     5700

taccgttcat cgattctctt gtttgctcca gactctcagg caatgacctg atagcctttg     5760

tagagacctc tcaaaaatag ctaccctctc cggcatgaat ttatcagcta gaacggttga     5820

atatcatatt gatggtgatt tgactgtctc cggcctttct cacccgtttg aatctttacc     5880

tacacattac tcaggcattg catttaaaat atatgagggt tctaaaaatt tttatccttg     5940

cgttgaaata aaggcttctc ccgcaaaagt attacagggt cataatgttt ttggtacaac     6000

cgatttagct ttatgctctg aggctttatt gcttaatttt gctaattctt tgccttgcct     6060

gtatgattta ttggatgttg gaattcctga tgcggtattt tctccttacg catctgtgcg     6120

gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa     6180

gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg     6240

catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac     6300

cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt ttataggtta     6360

atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg     6420

gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat     6480

aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc     6540

gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa     6600

cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac     6660

tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga     6720

tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag     6780

agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca     6840

cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca     6900

tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa     6960

ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc     7020

tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa     7080

cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag     7140

actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct     7200

ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac     7260

tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa     7320

ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt     7380

aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat     7440

ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg     7500

agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc     7560

ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg     7620

tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag     7680

cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact     7740

ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg     7800

gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc     7860

ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg     7920

aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg     7980

cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag     8040

ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc     8100

gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct     8160

ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc     8220

ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc     8280

gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac     8340

cgcctctccc cgcgcgttgg ccgattcatt aatgca                               8376


<210> 83
<211> 1882
<212> DNA
<213> Adeno-associated virus 2

<400> 83
acgccggggt tttacgagat tgtgattaag gtccccagcg accttgacgg gcatctgccc       60

ggcatttctg acagctttgt gaactgggtg gccgagaagg aatgggagtt gccgccagat      120

tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc cggaggccct tttctttgtg      240

caatttgaga agggagagag ctacttccac atgcacgtgc tcgtggaaac caccggggtg      300

aaatccatgg ttttgggacg tttcctgagt cagattcgcg aaaaactgat tcagagaatt      360

taccgcggga tcgagccgac tttgccaaac tggttcgcgg tcacaaagac cagaaatggc      420

gccggaggcg ggaacaaggt ggtggatgag tgctacatcc ccaattactt gctccccaaa      480

acccagcctg agctccagtg ggcgtggact aatatggaac agtatttaag cgcctgtttg      540

aatctcacgg agcgtaaacg gttggtggcg cagcatctga cgcacgtgtc gcagacgcag      600

gagcagaaca aagagaatca gaatcccaat tctgatgcgc cggtgatcag atcaaaaact      660

tcagccaggt acatggagct ggtcgggtgg ctcgtggaca aggggattac ctcggagaag      720

cagtggatcc aggaggacca ggcctcatac atctccttca atgcggcctc caactcgcgg      780

tcccaaatca aggctgcctt ggacaatgcg ggaaagatta tgagcctgac taaaaccgcc      840

cccgactacc tggtgggcca gcagcccgtg gaggacattt ccagcaatcg gatttataaa      900

attttggaac taaacgggta cgatccccaa tatgcggctt ccgtctttct gggatgggcc      960

acgaaaaagt tcggcaagag gaacaccatc tggctgtttg ggcctgcaac taccgggaag     1020

accaacatcg cggaggccat agcccacact gtgcccttct acgggtgcgt aaactggacc     1080

aatgagaact ttcccttcaa cgactgtgtc gacaagatgg tgatctggtg ggaggagggg     1140

aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc tcggaggaag caaggtgcgc     1200

gtggaccaga aatgcaagtc ctcggcccag atagacccga ctcccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gattgacggg aactcaacga ccttcgaaca ccagcagccg     1320

ttgcaagacc ggatgttcaa atttgaactc acccgccgtc tggatcatga ctttgggaag     1380

gtcaccaagc aggaagtcaa agactttttc cggtgggcaa aggatcacgt ggttgaggtg     1440

gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa gacccgcccc cagtgacgca     1500

gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc agccatcgac gtcagacgcg     1560

gaagcttcga tcaactacgc agacaggtac caaaacaaat gttctcgtca cgtgggcatg     1620

aatctgatgc tgtttccctg cagacaatgc gagagaatga atcagaattc aaatatctgc     1680

ttcactcacg gacagaaaga ctgtttagag tgctttcccg tgtcagaatc tcaacccgtt     1740

tctgtcgtca aaaaggcgta tcagaaactg tgctacattc atcatatcat gggaaaggtg     1800

ccagacgctt gcactgcctg cgatctggtc aatgtggatt tggatgactg catctttgaa     1860

caataaatga tttaaatcag gt                                              1882


<210> 84
<211> 2208
<212> DNA
<213> Adeno-associated virus 2

<400> 84
atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga       60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac      120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac      180

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa agcctacgac      240

cggcagctcg acagcggaga caacccgtac ctcaagtaca accacgccga cgcggagttt      300

caggagcgcc ttaaagaaga tacgtctttt gggggcaacc tcggacgagc agtcttccag      360

gcgaaaaaga gggttcttga acctctgggc ctggttgagg aacctgttaa gacggctccg      420

ggaaaaaaga ggccggtaga gcactctcct gtggagccag actcctcctc gggaaccgga      480

aaggcgggcc agcagcctgc aagaaaaaga ttgaattttg gtcagactgg agacgcagac      540

tcagtacctg acccccagcc tctcggacag ccaccagcag ccccctctgg tctgggaact      600

aatacgatgg ctacaggcag tggcgcacca atggcagaca ataacgaggg cgccgacgga      660

gtgggtaatt cctcgggaaa ttggcattgc gattccacat ggatgggcga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt      780

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg      840

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc      900

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc      960

aaagaggtca cgcagaatga cggtacgacg acgattgcca ataaccttac cagcacggtt     1020

caggtgttta ctgactcgga gtaccagctc ccgtacgtcc tcggctcggc gcatcaagga     1080

tgcctcccgc cgttcccagc agacgtcttc atggtgccac agtatggata cctcaccctg     1140

aacaacggga gtcaggcagt aggacgctct tcattttact gcctggagta ctttccttct     1200

cagatgctgc gtaccggaaa caactttacc ttcagctaca cttttgagga cgttcctttc     1260

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct catcgaccag     1320

tacctgtatt acttgagcag aacaaacact ccaagtggaa ccaccacgca gtcaaggctt     1380

cagttttctc aggccggagc gagtgacatt cgggaccagt ctaggaactg gcttcctgga     1440

ccctgttacc gccagcagcg agtatcaaag acatctgcgg ataacaacaa cagtgaatac     1500

tcgtggactg gagctaccaa gtaccacctc aatggcagag actctctggt gaatccgggc     1560

ccggccatgg caagccacaa ggacgatgaa gaaaagtttt ttcctcagag cggggttctc     1620

atctttggga agcaaggctc agagaaaaca aatgtggaca ttgaaaaggt catgattaca     1680

gacgaagagg aaatcaggac aaccaatccc gtggctacgg agcagtatgg ttctgtatct     1740

accaacctcc agagaggcaa cagacaagca gctaccgcag atgtcaacac acaaggcgtt     1800

cttccaggca tggtctggca ggacagagat gtgtaccttc aggggcccat ctgggcaaag     1860

attccacaca cggacggaca ttttcacccc tctcccctca tgggtggatt cggacttaaa     1920

caccctcctc cacagattct catcaagaac accccggtac ctgcgaatcc ttcgaccacc     1980

ttcagtgcgg caaagtttgc ttccttcatc acacagtact ccacgggaca ggtcagcgtg     2040

gagatcgagt gggagctgca gaaggaaaac agcaaacgct ggaatcccga aattcagtac     2100

acttccaact acaacaagtc tgttaatgtg gactttactg tggacactaa tggcgtgtat     2160

tcagagcctc gccccattgg caccagatac ctgactcgta atctgtaa                  2208


<210> 85
<211> 4726
<212> DNA
<213> Adeno-associated virus 3

<400> 85
ttggccactc cctctatgcg cactcgctcg ctcggtgggg cctggcgacc aaaggtcgcc       60

agacggacgt gctttgcacg tccggcccca ccgagcgagc gagtgcgcat agagggagtg      120

gccaactcca tcactagagg tatggcagtg acgtaacgcg aagcgcgcga agcgagacca      180

cgcctaccag ctgcgtcagc agtcaggtga cccttttgcg acagtttgcg acaccacgtg      240

gccgctgagg gtatatattc tcgagtgagc gaaccaggag ctccattttg accgcgaaat      300

ttgaacgagc agcagccatg ccggggttct acgagattgt cctgaaggtc ccgagtgacc      360

tggacgagcg cctgccgggc atttctaact cgtttgttaa ctgggtggcc gagaaggaat      420

gggacgtgcc gccggattct gacatggatc cgaatctgat tgagcaggca cccctgaccg      480

tggccgaaaa gcttcagcgc gagttcctgg tggagtggcg ccgcgtgagt aaggccccgg      540

aggccctctt ttttgtccag ttcgaaaagg gggagaccta cttccacctg cacgtgctga      600

ttgagaccat cggggtcaaa tccatggtgg tcggccgcta cgtgagccag attaaagaga      660

agctggtgac ccgcatctac cgcggggtcg agccgcagct tccgaactgg ttcgcggtga      720

ccaaaacgcg aaatggcgcc gggggcggga acaaggtggt ggacgactgc tacatcccca      780

actacctgct ccccaagacc cagcccgagc tccagtgggc gtggactaac atggaccagt      840

atttaagcgc ctgtttgaat ctcgcggagc gtaaacggct ggtggcgcag catctgacgc      900

acgtgtcgca gacgcaggag cagaacaaag agaatcagaa ccccaattct gacgcgccgg      960

tcatcaggtc aaaaacctca gccaggtaca tggagctggt cgggtggctg gtggaccgcg     1020

ggatcacgtc agaaaagcaa tggattcagg aggaccaggc ctcgtacatc tccttcaacg     1080

ccgcctccaa ctcgcggtcc cagatcaagg ccgcgctgga caatgcctcc aagatcatga     1140

gcctgacaaa gacggctccg gactacctgg tgggcagcaa cccgccggag gacattacca     1200

aaaatcggat ctaccaaatc ctggagctga acgggtacga tccgcagtac gcggcctccg     1260

tcttcctggg ctgggcgcaa aagaagttcg ggaagaggaa caccatctgg ctctttgggc     1320

cggccacgac gggtaaaacc aacatcgcgg aagccatcgc ccacgccgtg cccttctacg     1380

gctgcgtaaa ctggaccaat gagaactttc ccttcaacga ttgcgtcgac aagatggtga     1440

tctggtggga ggagggcaag atgacggcca aggtcgtgga gagcgccaag gccattctgg     1500

gcggaagcaa ggtgcgcgtg gaccaaaagt gcaagtcatc ggcccagatc gaacccactc     1560

ccgtgatcgt cacctccaac accaacatgt gcgccgtgat tgacgggaac agcaccacct     1620

tcgagcatca gcagccgctg caggaccgga tgtttgaatt tgaacttacc cgccgtttgg     1680

accatgactt tgggaaggtc accaaacagg aagtaaagga ctttttccgg tgggcttccg     1740

atcacgtgac tgacgtggct catgagttct acgtcagaaa gggtggagct aagaaacgcc     1800

ccgcctccaa tgacgcggat gtaagcgagc caaaacggga gtgcacgtca cttgcgcagc     1860

cgacaacgtc agacgcggaa gcaccggcgg actacgcgga caggtaccaa aacaaatgtt     1920

ctcgtcacgt gggcatgaat ctgatgcttt ttccctgtaa aacatgcgag agaatgaatc     1980

aaatttccaa tgtctgtttt acgcatggtc aaagagactg tggggaatgc ttccctggaa     2040

tgtcagaatc tcaacccgtt tctgtcgtca aaaagaagac ttatcagaaa ctgtgtccaa     2100

ttcatcatat cctgggaagg gcacccgaga ttgcctgttc ggcctgcgat ttggccaatg     2160

tggacttgga tgactgtgtt tctgagcaat aaatgactta aaccaggtat ggctgctgac     2220

ggttatcttc cagattggct cgaggacaac ctttctgaag gcattcgtga gtggtgggct     2280

ctgaaacctg gagtccctca acccaaagcg aaccaacaac accaggacaa ccgtcggggt     2340

cttgtgcttc cgggttacaa atacctcgga cccggtaacg gactcgacaa aggagagccg     2400

gtcaacgagg cggacgcggc agccctcgaa cacgacaaag cttacgacca gcagctcaag     2460

gccggtgaca acccgtacct caagtacaac cacgccgacg ccgagtttca ggagcgtctt     2520

caagaagata cgtcttttgg gggcaacctt ggcagagcag tcttccaggc caaaaagagg     2580

atccttgagc ctcttggtct ggttgaggaa gcagctaaaa cggctcctgg aaagaagggg     2640

gctgtagatc agtctcctca ggaaccggac tcatcatctg gtgttggcaa atcgggcaaa     2700

cagcctgcca gaaaaagact aaatttcggt cagactggag actcagagtc agtcccagac     2760

cctcaacctc tcggagaacc accagcagcc cccacaagtt tgggatctaa tacaatggct     2820

tcaggcggtg gcgcaccaat ggcagacaat aacgagggtg ccgatggagt gggtaattcc     2880

tcaggaaatt ggcattgcga ttcccaatgg ctgggcgaca gagtcatcac caccagcacc     2940

agaacctggg ccctgcccac ttacaacaac catctctaca agcaaatctc cagccaatca     3000

ggagcttcaa acgacaacca ctactttggc tacagcaccc cttgggggta ttttgacttt     3060

aacagattcc actgccactt ctcaccacgt gactggcagc gactcattaa caacaactgg     3120

ggattccggc ccaagaaact cagcttcaag ctcttcaaca tccaagttag aggggtcacg     3180

cagaacgatg gcacgacgac tattgccaat aaccttacca gcacggttca agtgtttacg     3240

gactcggagt atcagctccc gtacgtgctc gggtcggcgc accaaggctg tctcccgccg     3300

tttccagcgg acgtcttcat ggtccctcag tatggatacc tcaccctgaa caacggaagt     3360

caagcggtgg gacgctcatc cttttactgc ctggagtact tcccttcgca gatgctaagg     3420

actggaaata acttccaatt cagctatacc ttcgaggatg taccttttca cagcagctac     3480

gctcacagcc agagtttgga tcgcttgatg aatcctctta ttgatcagta tctgtactac     3540

ctgaacagaa cgcaaggaac aacctctgga acaaccaacc aatcacggct gctttttagc     3600

caggctgggc ctcagtctat gtctttgcag gccagaaatt ggctacctgg gccctgctac     3660

cggcaacaga gactttcaaa gactgctaac gacaacaaca acagtaactt tccttggaca     3720

gcggccagca aatatcatct caatggccgc gactcgctgg tgaatccagg accagctatg     3780

gccagtcaca aggacgatga agaaaaattt ttccctatgc acggcaatct aatatttggc     3840

aaagaaggga caacggcaag taacgcagaa ttagataatg taatgattac ggatgaagaa     3900

gagattcgta ccaccaatcc tgtggcaaca gagcagtatg gaactgtggc aaataacttg     3960

cagagctcaa atacagctcc cacgactgga actgtcaatc atcagggggc cttacctggc     4020

atggtgtggc aagatcgtga cgtgtacctt caaggaccta tctgggcaaa gattcctcac     4080

acggatggac actttcatcc ttctcctctg atgggaggct ttggactgaa acatccgcct     4140

cctcaaatca tgatcaaaaa tactccggta ccggcaaatc ctccgacgac tttcagcccg     4200

gccaagtttg cttcatttat cactcagtac tccactggac aggtcagcgt ggaaattgag     4260

tgggagctac agaaagaaaa cagcaaacgt tggaatccag agattcagta cacttccaac     4320

tacaacaagt ctgttaatgt ggactttact gtagacacta atggtgttta tagtgaacct     4380

cgccctattg gaacccggta tctcacacga aacttgtgaa tcctggttaa tcaataaacc     4440

gtttaattcg tttcagttga actttggctc ttgtgcactt ctttatcttt atcttgtttc     4500

catggctact gcgtagataa gcagcggcct gcggcgcttg cgcttcgcgg tttacaactg     4560

ctggttaata tttaactctc gccatacctc tagtgatgga gttggccact ccctctatgc     4620

gcactcgctc gctcggtggg gcctggcgac caaaggtcgc cagacggacg tgctttgcac     4680

gtccggcccc accgagcgag cgagtgcgca tagagggagt ggccaa                    4726


<210> 86
<211> 1812
<212> DNA
<213> Adeno-associated virus 3

<400> 86
atgccggggt tctacgagat tgtcctgaag gtcccgagtg acctggacga gcgcctgccg       60

ggcatttcta actcgtttgt taactgggtg gccgagaagg aatgggacgt gccgccggat      120

tctgacatgg atccgaatct gattgagcag gcacccctga ccgtggccga aaagcttcag      180

cgcgagttcc tggtggagtg gcgccgcgtg agtaaggccc cggaggccct cttttttgtc      240

cagttcgaaa agggggagac ctacttccac ctgcacgtgc tgattgagac catcggggtc      300

aaatccatgg tggtcggccg ctacgtgagc cagattaaag agaagctggt gacccgcatc      360

taccgcgggg tcgagccgca gcttccgaac tggttcgcgg tgaccaaaac gcgaaatggc      420

gccgggggcg ggaacaaggt ggtggacgac tgctacatcc ccaactacct gctccccaag      480

acccagcccg agctccagtg ggcgtggact aacatggacc agtatttaag cgcctgtttg      540

aatctcgcgg agcgtaaacg gctggtggcg cagcatctga cgcacgtgtc gcagacgcag      600

gagcagaaca aagagaatca gaaccccaat tctgacgcgc cggtcatcag gtcaaaaacc      660

tcagccaggt acatggagct ggtcgggtgg ctggtggacc gcgggatcac gtcagaaaag      720

caatggattc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg      780

tcccagatca aggccgcgct ggacaatgcc tccaagatca tgagcctgac aaagacggct      840

ccggactacc tggtgggcag caacccgccg gaggacatta ccaaaaatcg gatctaccaa      900

atcctggagc tgaacgggta cgatccgcag tacgcggcct ccgtcttcct gggctgggcg      960

caaaagaagt tcgggaagag gaacaccatc tggctctttg ggccggccac gacgggtaaa     1020

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt aaactggacc     1080

aatgagaact ttcccttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgacgg ccaaggtcgt ggagagcgcc aaggccattc tgggcggaag caaggtgcgc     1200

gtggaccaaa agtgcaagtc atcggcccag atcgaaccca ctcccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca tcagcagccg     1320

ctgcaggacc ggatgtttga atttgaactt acccgccgtt tggaccatga ctttgggaag     1380

gtcaccaaac aggaagtaaa ggactttttc cggtgggctt ccgatcacgt gactgacgtg     1440

gctcatgagt tctacgtcag aaagggtgga gctaagaaac gccccgcctc caatgacgcg     1500

gatgtaagcg agccaaaacg ggagtgcacg tcacttgcgc agccgacaac gtcagacgcg     1560

gaagcaccgg cggactacgc ggacaggtac caaaacaaat gttctcgtca cgtgggcatg     1620

aatctgatgc tttttccctg taaaacatgc gagagaatga atcaaatttc caatgtctgt     1680

tttacgcatg gtcaaagaga ctgtggggaa tgcttccctg gaatgtcaga atctcaaccc     1740

gtttctgtcg tcaaaaagaa gacttatcag aaactgtgtc caattcatca tatcctggga     1800

agggcacccg ag                                                         1812


<210> 87
<211> 2211
<212> DNA
<213> Adeno-associated virus 3

<400> 87
atggctgctg acggttatct tccagattgg ctcgaggaca acctttctga aggcattcgt       60

gagtggtggg ctctgaaacc tggagtccct caacccaaag cgaaccaaca acaccaggac      120

aaccgtcggg gtcttgtgct tccgggttac aaatacctcg gacccggtaa cggactcgac      180

aaaggagagc cggtcaacga ggcggacgcg gcagccctcg aacacgacaa agcttacgac      240

cagcagctca aggccggtga caacccgtac ctcaagtaca accacgccga cgccgagttt      300

caggagcgtc ttcaagaaga tacgtctttt gggggcaacc ttggcagagc agtcttccag      360

gccaaaaaga ggatccttga gcctcttggt ctggttgagg aagcagctaa aacggctcct      420

ggaaagaagg gggctgtaga tcagtctcct caggaaccgg actcatcatc tggtgttggc      480

aaatcgggca aacagcctgc cagaaaaaga ctaaatttcg gtcagactgg agactcagag      540

tcagtcccag accctcaacc tctcggagaa ccaccagcag cccccacaag tttgggatct      600

aatacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaggg tgccgatgga      660

gtgggtaatt cctcaggaaa ttggcattgc gattcccaat ggctgggcga cagagtcatc      720

accaccagca ccagaacctg ggccctgccc acttacaaca accatctcta caagcaaatc      780

tccagccaat caggagcttc aaacgacaac cactactttg gctacagcac cccttggggg      840

tattttgact ttaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatt      900

aacaacaact ggggattccg gcccaagaaa ctcagcttca agctcttcaa catccaagtt      960

agaggggtca cgcagaacga tggcacgacg actattgcca ataaccttac cagcacggtt     1020

caagtgttta cggactcgga gtatcagctc ccgtacgtgc tcgggtcggc gcaccaaggc     1080

tgtctcccgc cgtttccagc ggacgtcttc atggtccctc agtatggata cctcaccctg     1140

aacaacggaa gtcaagcggt gggacgctca tccttttact gcctggagta cttcccttcg     1200

cagatgctaa ggactggaaa taacttccaa ttcagctata ccttcgagga tgtacctttt     1260

cacagcagct acgctcacag ccagagtttg gatcgcttga tgaatcctct tattgatcag     1320

tatctgtact acctgaacag aacgcaagga acaacctctg gaacaaccaa ccaatcacgg     1380

ctgcttttta gccaggctgg gcctcagtct atgtctttgc aggccagaaa ttggctacct     1440

gggccctgct accggcaaca gagactttca aagactgcta acgacaacaa caacagtaac     1500

tttccttgga cagcggccag caaatatcat ctcaatggcc gcgactcgct ggtgaatcca     1560

ggaccagcta tggccagtca caaggacgat gaagaaaaat ttttccctat gcacggcaat     1620

ctaatatttg gcaaagaagg gacaacggca agtaacgcag aattagataa tgtaatgatt     1680

acggatgaag aagagattcg taccaccaat cctgtggcaa cagagcagta tggaactgtg     1740

gcaaataact tgcagagctc aaatacagct cccacgactg gaactgtcaa tcatcagggg     1800

gccttacctg gcatggtgtg gcaagatcgt gacgtgtacc ttcaaggacc tatctgggca     1860

aagattcctc acacggatgg acactttcat ccttctcctc tgatgggagg ctttggactg     1920

aaacatccgc ctcctcaaat catgatcaaa aatactccgg taccggcaaa tcctccgacg     1980

actttcagcc cggccaagtt tgcttcattt atcactcagt actccactgg acaggtcagc     2040

gtggaaattg agtgggagct acagaaagaa aacagcaaac gttggaatcc agagattcag     2100

tacacttcca actacaacaa gtctgttaat gtggacttta ctgtagacac taatggtgtt     2160

tatagtgaac ctcgccctat tggaacccgg tatctcacac gaaacttgtg a              2211


<210> 88
<211> 4767
<212> DNA
<213> Adeno-associated virus 4

<400> 88
ttggccactc cctctatgcg cgctcgctca ctcactcggc cctggagacc aaaggtctcc       60

agactgccgg cctctggccg gcagggccga gtgagtgagc gagcgcgcat agagggagtg      120

gccaactcca tcatctaggt ttgcccactg acgtcaatgt gacgtcctag ggttagggag      180

gtccctgtat tagcagtcac gtgagtgtcg tatttcgcgg agcgtagcgg agcgcatacc      240

aagctgccac gtcacagcca cgtggtccgt ttgcgacagt ttgcgacacc atgtggtcag      300

gagggtatat aaccgcgagt gagccagcga ggagctccat tttgcccgcg aattttgaac      360

gagcagcagc catgccgggg ttctacgaga tcgtgctgaa ggtgcccagc gacctggacg      420

agcacctgcc cggcatttct gactcttttg tgagctgggt ggccgagaag gaatgggagc      480

tgccgccgga ttctgacatg gacttgaatc tgattgagca ggcacccctg accgtggccg      540

aaaagctgca acgcgagttc ctggtcgagt ggcgccgcgt gagtaaggcc ccggaggccc      600

tcttctttgt ccagttcgag aagggggaca gctacttcca cctgcacatc ctggtggaga      660

ccgtgggcgt caaatccatg gtggtgggcc gctacgtgag ccagattaaa gagaagctgg      720

tgacccgcat ctaccgcggg gtcgagccgc agcttccgaa ctggttcgcg gtgaccaaga      780

cgcgtaatgg cgccggaggc gggaacaagg tggtggacga ctgctacatc cccaactacc      840

tgctccccaa gacccagccc gagctccagt gggcgtggac taacatggac cagtatataa      900

gcgcctgttt gaatctcgcg gagcgtaaac ggctggtggc gcagcatctg acgcacgtgt      960

cgcagacgca ggagcagaac aaggaaaacc agaaccccaa ttctgacgcg ccggtcatca     1020

ggtcaaaaac ctccgccagg tacatggagc tggtcgggtg gctggtggac cgcgggatca     1080

cgtcagaaaa gcaatggatc caggaggacc aggcgtccta catctccttc aacgccgcct     1140

ccaactcgcg gtcacaaatc aaggccgcgc tggacaatgc ctccaaaatc atgagcctga     1200

caaagacggc tccggactac ctggtgggcc agaacccgcc ggaggacatt tccagcaacc     1260

gcatctaccg aatcctcgag atgaacgggt acgatccgca gtacgcggcc tccgtcttcc     1320

tgggctgggc gcaaaagaag ttcgggaaga ggaacaccat ctggctcttt gggccggcca     1380

cgacgggtaa aaccaacatc gcggaagcca tcgcccacgc cgtgcccttc tacggctgcg     1440

tgaactggac caatgagaac tttccgttca acgattgcgt cgacaagatg gtgatctggt     1500

gggaggaggg caagatgacg gccaaggtcg tagagagcgc caaggccatc ctgggcggaa     1560

gcaaggtgcg cgtggaccaa aagtgcaagt catcggccca gatcgaccca actcccgtga     1620

tcgtcacctc caacaccaac atgtgcgcgg tcatcgacgg aaactcgacc accttcgagc     1680

accaacaacc actccaggac cggatgttca agttcgagct caccaagcgc ctggagcacg     1740

actttggcaa ggtcaccaag caggaagtca aagacttttt ccggtgggcg tcagatcacg     1800

tgaccgaggt gactcacgag ttttacgtca gaaagggtgg agctagaaag aggcccgccc     1860

ccaatgacgc agatataagt gagcccaagc gggcctgtcc gtcagttgcg cagccatcga     1920

cgtcagacgc ggaagctccg gtggactacg cggacaggta ccaaaacaaa tgttctcgtc     1980

acgtgggtat gaatctgatg ctttttccct gccggcaatg cgagagaatg aatcagaatg     2040

tggacatttg cttcacgcac ggggtcatgg actgtgccga gtgcttcccc gtgtcagaat     2100

ctcaacccgt gtctgtcgtc agaaagcgga cgtatcagaa actgtgtccg attcatcaca     2160

tcatggggag ggcgcccgag gtggcctgct cggcctgcga actggccaat gtggacttgg     2220

atgactgtga catggaacaa taaatgactc aaaccagata tgactgacgg ttaccttcca     2280

gattggctag aggacaacct ctctgaaggc gttcgagagt ggtgggcgct gcaacctgga     2340

gcccctaaac ccaaggcaaa tcaacaacat caggacaacg ctcggggtct tgtgcttccg     2400

ggttacaaat acctcggacc cggcaacgga ctcgacaagg gggaacccgt caacgcagcg     2460

gacgcggcag ccctcgagca cgacaaggcc tacgaccagc agctcaaggc cggtgacaac     2520

ccctacctca agtacaacca cgccgacgcg gagttccagc agcggcttca gggcgacaca     2580

tcgtttgggg gcaacctcgg cagagcagtc ttccaggcca aaaagagggt tcttgaacct     2640

cttggtctgg ttgagcaagc gggtgagacg gctcctggaa agaagagacc gttgattgaa     2700

tccccccagc agcccgactc ctccacgggt atcggcaaaa aaggcaagca gccggctaaa     2760

aagaagctcg ttttcgaaga cgaaactgga gcaggcgacg gaccccctga gggatcaact     2820

tccggagcca tgtctgatga cagtgagatg cgtgcagcag ctggcggagc tgcagtcgag     2880

ggcggacaag gtgccgatgg agtgggtaat gcctcgggtg attggcattg cgattccacc     2940

tggtctgagg gccacgtcac gaccaccagc accagaacct gggtcttgcc cacctacaac     3000

aaccacctct acaagcgact cggagagagc ctgcagtcca acacctacaa cggattctcc     3060

accccctggg gatactttga cttcaaccgc ttccactgcc acttctcacc acgtgactgg     3120

cagcgactca tcaacaacaa ctggggcatg cgacccaaag ccatgcgggt caaaatcttc     3180

aacatccagg tcaaggaggt cacgacgtcg aacggcgaga caacggtggc taataacctt     3240

accagcacgg ttcagatctt tgcggactcg tcgtacgaac tgccgtacgt gatggatgcg     3300

ggtcaagagg gcagcctgcc tccttttccc aacgacgtct ttatggtgcc ccagtacggc     3360

tactgtggac tggtgaccgg caacacttcg cagcaacaga ctgacagaaa tgccttctac     3420

tgcctggagt actttccttc gcagatgctg cggactggca acaactttga aattacgtac     3480

agttttgaga aggtgccttt ccactcgatg tacgcgcaca gccagagcct ggaccggctg     3540

atgaaccctc tcatcgacca gtacctgtgg ggactgcaat cgaccaccac cggaaccacc     3600

ctgaatgccg ggactgccac caccaacttt accaagctgc ggcctaccaa cttttccaac     3660

tttaaaaaga actggctgcc cgggccttca atcaagcagc agggcttctc aaagactgcc     3720

aatcaaaact acaagatccc tgccaccggg tcagacagtc tcatcaaata cgagacgcac     3780

agcactctgg acggaagatg gagtgccctg acccccggac ctccaatggc cacggctgga     3840

cctgcggaca gcaagttcag caacagccag ctcatctttg cggggcctaa acagaacggc     3900

aacacggcca ccgtacccgg gactctgatc ttcacctctg aggaggagct ggcagccacc     3960

aacgccaccg atacggacat gtggggcaac ctacctggcg gtgaccagag caacagcaac     4020

ctgccgaccg tggacagact gacagccttg ggagccgtgc ctggaatggt ctggcaaaac     4080

agagacattt actaccaggg tcccatttgg gccaagattc ctcataccga tggacacttt     4140

cacccctcac cgctgattgg tgggtttggg ctgaaacacc cgcctcctca aatttttatc     4200

aagaacaccc cggtacctgc gaatcctgca acgaccttca gctctactcc ggtaaactcc     4260

ttcattactc agtacagcac tggccaggtg tcggtgcaga ttgactggga gatccagaag     4320

gagcggtcca aacgctggaa ccccgaggtc cagtttacct ccaactacgg acagcaaaac     4380

tctctgttgt gggctcccga tgcggctggg aaatacactg agcctagggc tatcggtacc     4440

cgctacctca cccaccacct gtaataacct gttaatcaat aaaccggttt attcgtttca     4500

gttgaacttt ggtctccgtg tccttcttat cttatctcgt ttccatggct actgcgtaca     4560

taagcagcgg cctgcggcgc ttgcgcttcg cggtttacaa ctgccggtta atcagtaact     4620

tctggcaaac cagatgatgg agttggccac attagctatg cgcgctcgct cactcactcg     4680

gccctggaga ccaaaggtct ccagactgcc ggcctctggc cggcagggcc gagtgagtga     4740

gcgagcgcgc atagagggag tggccaa                                         4767


<210> 89
<211> 1872
<212> DNA
<213> Adeno-associated virus 4

<400> 89
atgccggggt tctacgagat cgtgctgaag gtgcccagcg acctggacga gcacctgccc       60

ggcatttctg actcttttgt gagctgggtg gccgagaagg aatgggagct gccgccggat      120

tctgacatgg acttgaatct gattgagcag gcacccctga ccgtggccga aaagctgcaa      180

cgcgagttcc tggtcgagtg gcgccgcgtg agtaaggccc cggaggccct cttctttgtc      240

cagttcgaga agggggacag ctacttccac ctgcacatcc tggtggagac cgtgggcgtc      300

aaatccatgg tggtgggccg ctacgtgagc cagattaaag agaagctggt gacccgcatc      360

taccgcgggg tcgagccgca gcttccgaac tggttcgcgg tgaccaagac gcgtaatggc      420

gccggaggcg ggaacaaggt ggtggacgac tgctacatcc ccaactacct gctccccaag      480

acccagcccg agctccagtg ggcgtggact aacatggacc agtatataag cgcctgtttg      540

aatctcgcgg agcgtaaacg gctggtggcg cagcatctga cgcacgtgtc gcagacgcag      600

gagcagaaca aggaaaacca gaaccccaat tctgacgcgc cggtcatcag gtcaaaaacc      660

tccgccaggt acatggagct ggtcgggtgg ctggtggacc gcgggatcac gtcagaaaag      720

caatggatcc aggaggacca ggcgtcctac atctccttca acgccgcctc caactcgcgg      780

tcacaaatca aggccgcgct ggacaatgcc tccaaaatca tgagcctgac aaagacggct      840

ccggactacc tggtgggcca gaacccgccg gaggacattt ccagcaaccg catctaccga      900

atcctcgaga tgaacgggta cgatccgcag tacgcggcct ccgtcttcct gggctgggcg      960

caaaagaagt tcgggaagag gaacaccatc tggctctttg ggccggccac gacgggtaaa     1020

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt gaactggacc     1080

aatgagaact ttccgttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgacgg ccaaggtcgt agagagcgcc aaggccatcc tgggcggaag caaggtgcgc     1200

gtggaccaaa agtgcaagtc atcggcccag atcgacccaa ctcccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgcggt catcgacgga aactcgacca ccttcgagca ccaacaacca     1320

ctccaggacc ggatgttcaa gttcgagctc accaagcgcc tggagcacga ctttggcaag     1380

gtcaccaagc aggaagtcaa agactttttc cggtgggcgt cagatcacgt gaccgaggtg     1440

actcacgagt tttacgtcag aaagggtgga gctagaaaga ggcccgcccc caatgacgca     1500

gatataagtg agcccaagcg ggcctgtccg tcagttgcgc agccatcgac gtcagacgcg     1560

gaagctccgg tggactacgc ggacaggtac caaaacaaat gttctcgtca cgtgggtatg     1620

aatctgatgc tttttccctg ccggcaatgc gagagaatga atcagaatgt ggacatttgc     1680

ttcacgcacg gggtcatgga ctgtgccgag tgcttccccg tgtcagaatc tcaacccgtg     1740

tctgtcgtca gaaagcggac gtatcagaaa ctgtgtccga ttcatcacat catggggagg     1800

gcgcccgagg tggcctgctc ggcctgcgaa ctggccaatg tggacttgga tgactgtgac     1860

atggaacaat aa                                                         1872


<210> 90
<211> 2205
<212> DNA
<213> Adeno-associated virus 4

<400> 90
atgactgacg gttaccttcc agattggcta gaggacaacc tctctgaagg cgttcgagag       60

tggtgggcgc tgcaacctgg agcccctaaa cccaaggcaa atcaacaaca tcaggacaac      120

gctcggggtc ttgtgcttcc gggttacaaa tacctcggac ccggcaacgg actcgacaag      180

ggggaacccg tcaacgcagc ggacgcggca gccctcgagc acgacaaggc ctacgaccag      240

cagctcaagg ccggtgacaa cccctacctc aagtacaacc acgccgacgc ggagttccag      300

cagcggcttc agggcgacac atcgtttggg ggcaacctcg gcagagcagt cttccaggcc      360

aaaaagaggg ttcttgaacc tcttggtctg gttgagcaag cgggtgagac ggctcctgga      420

aagaagagac cgttgattga atccccccag cagcccgact cctccacggg tatcggcaaa      480

aaaggcaagc agccggctaa aaagaagctc gttttcgaag acgaaactgg agcaggcgac      540

ggaccccctg agggatcaac ttccggagcc atgtctgatg acagtgagat gcgtgcagca      600

gctggcggag ctgcagtcga gggcggacaa ggtgccgatg gagtgggtaa tgcctcgggt      660

gattggcatt gcgattccac ctggtctgag ggccacgtca cgaccaccag caccagaacc      720

tgggtcttgc ccacctacaa caaccacctc tacaagcgac tcggagagag cctgcagtcc      780

aacacctaca acggattctc caccccctgg ggatactttg acttcaaccg cttccactgc      840

cacttctcac cacgtgactg gcagcgactc atcaacaaca actggggcat gcgacccaaa      900

gccatgcggg tcaaaatctt caacatccag gtcaaggagg tcacgacgtc gaacggcgag      960

acaacggtgg ctaataacct taccagcacg gttcagatct ttgcggactc gtcgtacgaa     1020

ctgccgtacg tgatggatgc gggtcaagag ggcagcctgc ctccttttcc caacgacgtc     1080

tttatggtgc cccagtacgg ctactgtgga ctggtgaccg gcaacacttc gcagcaacag     1140

actgacagaa atgccttcta ctgcctggag tactttcctt cgcagatgct gcggactggc     1200

aacaactttg aaattacgta cagttttgag aaggtgcctt tccactcgat gtacgcgcac     1260

agccagagcc tggaccggct gatgaaccct ctcatcgacc agtacctgtg gggactgcaa     1320

tcgaccacca ccggaaccac cctgaatgcc gggactgcca ccaccaactt taccaagctg     1380

cggcctacca acttttccaa ctttaaaaag aactggctgc ccgggccttc aatcaagcag     1440

cagggcttct caaagactgc caatcaaaac tacaagatcc ctgccaccgg gtcagacagt     1500

ctcatcaaat acgagacgca cagcactctg gacggaagat ggagtgccct gacccccgga     1560

cctccaatgg ccacggctgg acctgcggac agcaagttca gcaacagcca gctcatcttt     1620

gcggggccta aacagaacgg caacacggcc accgtacccg ggactctgat cttcacctct     1680

gaggaggagc tggcagccac caacgccacc gatacggaca tgtggggcaa cctacctggc     1740

ggtgaccaga gcaacagcaa cctgccgacc gtggacagac tgacagcctt gggagccgtg     1800

cctggaatgg tctggcaaaa cagagacatt tactaccagg gtcccatttg ggccaagatt     1860

cctcataccg atggacactt tcacccctca ccgctgattg gtgggtttgg gctgaaacac     1920

ccgcctcctc aaatttttat caagaacacc ccggtacctg cgaatcctgc aacgaccttc     1980

agctctactc cggtaaactc cttcattact cagtacagca ctggccaggt gtcggtgcag     2040

attgactggg agatccagaa ggagcggtcc aaacgctgga accccgaggt ccagtttacc     2100

tccaactacg gacagcaaaa ctctctgttg tgggctcccg atgcggctgg gaaatacact     2160

gagcctaggg ctatcggtac ccgctacctc acccaccacc tgtaa                     2205


<210> 91
<211> 4642
<212> DNA
<213> Adeno-associated virus 5

<400> 91
ctctcccccc tgtcgcgttc gctcgctcgc tggctcgttt gggggggtgg cagctcaaag       60

agctgccaga cgacggccct ctggccgtcg cccccccaaa cgagccagcg agcgagcgaa      120

cgcgacaggg gggagagtgc cacactctca agcaaggggg ttttgtaagc agtgatgtca      180

taatgatgta atgcttattg tcacgcgata gttaatgatt aacagtcatg tgatgtgttt      240

tatccaatag gaagaaagcg cgcgtatgag ttctcgcgag acttccgggg tataaaagac      300

cgagtgaacg agcccgccgc cattctttgc tctggactgc tagaggaccc tcgctgccat      360

ggctaccttc tatgaagtca ttgttcgcgt cccatttgac gtggaggaac atctgcctgg      420

aatttctgac agctttgtgg actgggtaac tggtcaaatt tgggagctgc ctccagagtc      480

agatttaaat ttgactctgg ttgaacagcc tcagttgacg gtggctgata gaattcgccg      540

cgtgttcctg tacgagtgga acaaattttc caagcaggag tccaaattct ttgtgcagtt      600

tgaaaaggga tctgaatatt ttcatctgca cacgcttgtg gagacctccg gcatctcttc      660

catggtcctc ggccgctacg tgagtcagat tcgcgcccag ctggtgaaag tggtcttcca      720

gggaattgaa ccccagatca acgactgggt cgccatcacc aaggtaaaga agggcggagc      780

caataaggtg gtggattctg ggtatattcc cgcctacctg ctgccgaagg tccaaccgga      840

gcttcagtgg gcgtggacaa acctggacga gtataaattg gccgccctga atctggagga      900

gcgcaaacgg ctcgtcgcgc agtttctggc agaatcctcg cagcgctcgc aggaggcggc      960

ttcgcagcgt gagttctcgg ctgacccggt catcaaaagc aagacttccc agaaatacat     1020

ggcgctcgtc aactggctcg tggagcacgg catcacttcc gagaagcagt ggatccagga     1080

aaatcaggag agctacctct ccttcaactc caccggcaac tctcggagcc agatcaaggc     1140

cgcgctcgac aacgcgacca aaattatgag tctgacaaaa agcgcggtgg actacctcgt     1200

ggggagctcc gttcccgagg acatttcaaa aaacagaatc tggcaaattt ttgagatgaa     1260

tggctacgac ccggcctacg cgggatccat cctctacggc tggtgtcagc gctccttcaa     1320

caagaggaac accgtctggc tctacggacc cgccacgacc ggcaagacca acatcgcgga     1380

ggccatcgcc cacactgtgc ccttttacgg ctgcgtgaac tggaccaatg aaaactttcc     1440

ctttaatgac tgtgtggaca aaatgctcat ttggtgggag gagggaaaga tgaccaacaa     1500

ggtggttgaa tccgccaagg ccatcctggg gggctcaaag gtgcgggtcg atcagaaatg     1560

taaatcctct gttcaaattg attctacccc tgtcattgta acttccaata caaacatgtg     1620

tgtggtggtg gatgggaatt ccacgacctt tgaacaccag cagccgctgg aggaccgcat     1680

gttcaaattt gaactgacta agcggctccc gccagatttt ggcaagatta ctaagcagga     1740

agtcaaggac ttttttgctt gggcaaaggt caatcaggtg ccggtgactc acgagtttaa     1800

agttcccagg gaattggcgg gaactaaagg ggcggagaaa tctctaaaac gcccactggg     1860

tgacgtcacc aatactagct ataaaagtct ggagaagcgg gccaggctct catttgttcc     1920

cgagacgcct cgcagttcag acgtgactgt tgatcccgct cctctgcgac cgctcaattg     1980

gaattcaagg tatgattgca aatgtgacta tcatgctcaa tttgacaaca tttctaacaa     2040

atgtgatgaa tgtgaatatt tgaatcgggg caaaaatgga tgtatctgtc acaatgtaac     2100

tcactgtcaa atttgtcatg ggattccccc ctgggaaaag gaaaacttgt cagattttgg     2160

ggattttgac gatgccaata aagaacagta aataaagcga gtagtcatgt cttttgttga     2220

tcaccctcca gattggttgg aagaagttgg tgaaggtctt cgcgagtttt tgggccttga     2280

agcgggccca ccgaaaccaa aacccaatca gcagcatcaa gatcaagccc gtggtcttgt     2340

gctgcctggt tataactatc tcggacccgg aaacggtctc gatcgaggag agcctgtcaa     2400

cagggcagac gaggtcgcgc gagagcacga catctcgtac aacgagcagc ttgaggcggg     2460

agacaacccc tacctcaagt acaaccacgc ggacgccgag tttcaggaga agctcgccga     2520

cgacacatcc ttcgggggaa acctcggaaa ggcagtcttt caggccaaga aaagggttct     2580

cgaacctttt ggcctggttg aagagggtgc taagacggcc cctaccggaa agcggataga     2640

cgaccacttt ccaaaaagaa agaaggctcg gaccgaagag gactccaagc cttccacctc     2700

gtcagacgcc gaagctggac ccagcggatc ccagcagctg caaatcccag cccaaccagc     2760

ctcaagtttg ggagctgata caatgtctgc gggaggtggc ggcccattgg gcgacaataa     2820

ccaaggtgcc gatggagtgg gcaatgcctc gggagattgg cattgcgatt ccacgtggat     2880

gggggacaga gtcgtcacca agtccacccg aacctgggtg ctgcccagct acaacaacca     2940

ccagtaccga gagatcaaaa gcggctccgt cgacggaagc aacgccaacg cctactttgg     3000

atacagcacc ccctgggggt actttgactt taaccgcttc cacagccact ggagcccccg     3060

agactggcaa agactcatca acaactactg gggcttcaga ccccggtccc tcagagtcaa     3120

aatcttcaac attcaagtca aagaggtcac ggtgcaggac tccaccacca ccatcgccaa     3180

caacctcacc tccaccgtcc aagtgtttac ggacgacgac taccagctgc cctacgtcgt     3240

cggcaacggg accgagggat gcctgccggc cttccctccg caggtcttta cgctgccgca     3300

gtacggttac gcgacgctga accgcgacaa cacagaaaat cccaccgaga ggagcagctt     3360

cttctgccta gagtactttc ccagcaagat gctgagaacg ggcaacaact ttgagtttac     3420

ctacaacttt gaggaggtgc ccttccactc cagcttcgct cccagtcaga acctgttcaa     3480

gctggccaac ccgctggtgg accagtactt gtaccgcttc gtgagcacaa ataacactgg     3540

cggagtccag ttcaacaaga acctggccgg gagatacgcc aacacctaca aaaactggtt     3600

cccggggccc atgggccgaa cccagggctg gaacctgggc tccggggtca accgcgccag     3660

tgtcagcgcc ttcgccacga ccaataggat ggagctcgag ggcgcgagtt accaggtgcc     3720

cccgcagccg aacggcatga ccaacaacct ccagggcagc aacacctatg ccctggagaa     3780

cactatgatc ttcaacagcc agccggcgaa cccgggcacc accgccacgt acctcgaggg     3840

caacatgctc atcaccagcg agagcgagac gcagccggtg aaccgcgtgg cgtacaacgt     3900

cggcgggcag atggccacca acaaccagag ctccaccact gcccccgcga ccggcacgta     3960

caacctccag gaaatcgtgc ccggcagcgt gtggatggag agggacgtgt acctccaagg     4020

acccatctgg gccaagatcc cagagacggg ggcgcacttt cacccctctc cggccatggg     4080

cggattcgga ctcaaacacc caccgcccat gatgctcatc aagaacacgc ctgtgcccgg     4140

aaatatcacc agcttctcgg acgtgcccgt cagcagcttc atcacccagt acagcaccgg     4200

gcaggtcacc gtggagatgg agtgggagct caagaaggaa aactccaaga ggtggaaccc     4260

agagatccag tacacaaaca actacaacga cccccagttt gtggactttg ccccggacag     4320

caccggggaa tacagaacca ccagacctat cggaacccga taccttaccc gaccccttta     4380

acccattcat gtcgcatacc ctcaataaac cgtgtattcg tgtcagtaaa atactgcctc     4440

ttgtggtcat tcaatgaata acagcttaca acatctacaa aacctccttg cttgagagtg     4500

tggcactctc ccccctgtcg cgttcgctcg ctcgctggct cgtttggggg ggtggcagct     4560

caaagagctg ccagacgacg gccctctggc cgtcgccccc ccaaacgagc cagcgagcga     4620

gcgaacgcga caggggggag ag                                              4642


<210> 92
<211> 1833
<212> DNA
<213> Adeno-associated virus 5

<400> 92
atggctacct tctatgaagt cattgttcgc gtcccatttg acgtggagga acatctgcct       60

ggaatttctg acagctttgt ggactgggta actggtcaaa tttgggagct gcctccagag      120

tcagatttaa atttgactct ggttgaacag cctcagttga cggtggctga tagaattcgc      180

cgcgtgttcc tgtacgagtg gaacaaattt tccaagcagg agtccaaatt ctttgtgcag      240

tttgaaaagg gatctgaata ttttcatctg cacacgcttg tggagacctc cggcatctct      300

tccatggtcc tcggccgcta cgtgagtcag attcgcgccc agctggtgaa agtggtcttc      360

cagggaattg aaccccagat caacgactgg gtcgccatca ccaaggtaaa gaagggcgga      420

gccaataagg tggtggattc tgggtatatt cccgcctacc tgctgccgaa ggtccaaccg      480

gagcttcagt gggcgtggac aaacctggac gagtataaat tggccgccct gaatctggag      540

gagcgcaaac ggctcgtcgc gcagtttctg gcagaatcct cgcagcgctc gcaggaggcg      600

gcttcgcagc gtgagttctc ggctgacccg gtcatcaaaa gcaagacttc ccagaaatac      660

atggcgctcg tcaactggct cgtggagcac ggcatcactt ccgagaagca gtggatccag      720

gaaaatcagg agagctacct ctccttcaac tccaccggca actctcggag ccagatcaag      780

gccgcgctcg acaacgcgac caaaattatg agtctgacaa aaagcgcggt ggactacctc      840

gtggggagct ccgttcccga ggacatttca aaaaacagaa tctggcaaat ttttgagatg      900

aatggctacg acccggccta cgcgggatcc atcctctacg gctggtgtca gcgctccttc      960

aacaagagga acaccgtctg gctctacgga cccgccacga ccggcaagac caacatcgcg     1020

gaggccatcg cccacactgt gcccttttac ggctgcgtga actggaccaa tgaaaacttt     1080

ccctttaatg actgtgtgga caaaatgctc atttggtggg aggagggaaa gatgaccaac     1140

aaggtggttg aatccgccaa ggccatcctg gggggctcaa aggtgcgggt cgatcagaaa     1200

tgtaaatcct ctgttcaaat tgattctacc cctgtcattg taacttccaa tacaaacatg     1260

tgtgtggtgg tggatgggaa ttccacgacc tttgaacacc agcagccgct ggaggaccgc     1320

atgttcaaat ttgaactgac taagcggctc ccgccagatt ttggcaagat tactaagcag     1380

gaagtcaagg acttttttgc ttgggcaaag gtcaatcagg tgccggtgac tcacgagttt     1440

aaagttccca gggaattggc gggaactaaa ggggcggaga aatctctaaa acgcccactg     1500

ggtgacgtca ccaatactag ctataaaagt ctggagaagc gggccaggct ctcatttgtt     1560

cccgagacgc ctcgcagttc agacgtgact gttgatcccg ctcctctgcg accgctcaat     1620

tggaattcaa ggtatgattg caaatgtgac tatcatgctc aatttgacaa catttctaac     1680

aaatgtgatg aatgtgaata tttgaatcgg ggcaaaaatg gatgtatctg tcacaatgta     1740

actcactgtc aaatttgtca tgggattccc ccctgggaaa aggaaaactt gtcagatttt     1800

ggggattttg acgatgccaa taaagaacag taa                                  1833


<210> 93
<211> 2175
<212> DNA
<213> Adeno-associated virus 5

<400> 93
atgtcttttg ttgatcaccc tccagattgg ttggaagaag ttggtgaagg tcttcgcgag       60

tttttgggcc ttgaagcggg cccaccgaaa ccaaaaccca atcagcagca tcaagatcaa      120

gcccgtggtc ttgtgctgcc tggttataac tatctcggac ccggaaacgg tctcgatcga      180

ggagagcctg tcaacagggc agacgaggtc gcgcgagagc acgacatctc gtacaacgag      240

cagcttgagg cgggagacaa cccctacctc aagtacaacc acgcggacgc cgagtttcag      300

gagaagctcg ccgacgacac atccttcggg ggaaacctcg gaaaggcagt ctttcaggcc      360

aagaaaaggg ttctcgaacc ttttggcctg gttgaagagg gtgctaagac ggcccctacc      420

ggaaagcgga tagacgacca ctttccaaaa agaaagaagg ctcggaccga agaggactcc      480

aagccttcca cctcgtcaga cgccgaagct ggacccagcg gatcccagca gctgcaaatc      540

ccagcccaac cagcctcaag tttgggagct gatacaatgt ctgcgggagg tggcggccca      600

ttgggcgaca ataaccaagg tgccgatgga gtgggcaatg cctcgggaga ttggcattgc      660

gattccacgt ggatggggga cagagtcgtc accaagtcca cccgaacctg ggtgctgccc      720

agctacaaca accaccagta ccgagagatc aaaagcggct ccgtcgacgg aagcaacgcc      780

aacgcctact ttggatacag caccccctgg gggtactttg actttaaccg cttccacagc      840

cactggagcc cccgagactg gcaaagactc atcaacaact actggggctt cagaccccgg      900

tccctcagag tcaaaatctt caacattcaa gtcaaagagg tcacggtgca ggactccacc      960

accaccatcg ccaacaacct cacctccacc gtccaagtgt ttacggacga cgactaccag     1020

ctgccctacg tcgtcggcaa cgggaccgag ggatgcctgc cggccttccc tccgcaggtc     1080

tttacgctgc cgcagtacgg ttacgcgacg ctgaaccgcg acaacacaga aaatcccacc     1140

gagaggagca gcttcttctg cctagagtac tttcccagca agatgctgag aacgggcaac     1200

aactttgagt ttacctacaa ctttgaggag gtgcccttcc actccagctt cgctcccagt     1260

cagaacctgt tcaagctggc caacccgctg gtggaccagt acttgtaccg cttcgtgagc     1320

acaaataaca ctggcggagt ccagttcaac aagaacctgg ccgggagata cgccaacacc     1380

tacaaaaact ggttcccggg gcccatgggc cgaacccagg gctggaacct gggctccggg     1440

gtcaaccgcg ccagtgtcag cgccttcgcc acgaccaata ggatggagct cgagggcgcg     1500

agttaccagg tgcccccgca gccgaacggc atgaccaaca acctccaggg cagcaacacc     1560

tatgccctgg agaacactat gatcttcaac agccagccgg cgaacccggg caccaccgcc     1620

acgtacctcg agggcaacat gctcatcacc agcgagagcg agacgcagcc ggtgaaccgc     1680

gtggcgtaca acgtcggcgg gcagatggcc accaacaacc agagctccac cactgccccc     1740

gcgaccggca cgtacaacct ccaggaaatc gtgcccggca gcgtgtggat ggagagggac     1800

gtgtacctcc aaggacccat ctgggccaag atcccagaga cgggggcgca ctttcacccc     1860

tctccggcca tgggcggatt cggactcaaa cacccaccgc ccatgatgct catcaagaac     1920

acgcctgtgc ccggaaatat caccagcttc tcggacgtgc ccgtcagcag cttcatcacc     1980

cagtacagca ccgggcaggt caccgtggag atggagtggg agctcaagaa ggaaaactcc     2040

aagaggtgga acccagagat ccagtacaca aacaactaca acgaccccca gtttgtggac     2100

tttgccccgg acagcaccgg ggaatacaga accaccagac ctatcggaac ccgatacctt     2160

acccgacccc tttaa                                                      2175


<210> 94
<211> 4683
<212> DNA
<213> Adeno-associated virus 6

<400> 94
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc       60

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg      120

gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag      180

ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat      240

gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga      300

ggtttgaacg cgcagcgcca tgccggggtt ttacgagatt gtgattaagg tccccagcga      360

ccttgacgag catctgcccg gcatttctga cagctttgtg aactgggtgg ccgagaagga      420

atgggagttg ccgccagatt ctgacatgga tctgaatctg attgagcagg cacccctgac      480

cgtggccgag aagctgcagc gcgacttcct ggtccagtgg cgccgcgtga gtaaggcccc      540

ggaggccctc ttctttgttc agttcgagaa gggcgagtcc tacttccacc tccatattct      600

ggtggagacc acgggggtca aatccatggt gctgggccgc ttcctgagtc agattaggga      660

caagctggtg cagaccatct accgcgggat cgagccgacc ctgcccaact ggttcgcggt      720

gaccaagacg cgtaatggcg ccggaggggg gaacaaggtg gtggacgagt gctacatccc      780

caactacctc ctgcccaaga ctcagcccga gctgcagtgg gcgtggacta acatggagga      840

gtatataagc gcgtgtttaa acctggccga gcgcaaacgg ctcgtggcgc acgacctgac      900

ccacgtcagc cagacccagg agcagaacaa ggagaatctg aaccccaatt ctgacgcgcc      960

tgtcatccgg tcaaaaacct ccgcacgcta catggagctg gtcgggtggc tggtggaccg     1020

gggcatcacc tccgagaagc agtggatcca ggaggaccag gcctcgtaca tctccttcaa     1080

cgccgcctcc aactcgcggt cccagatcaa ggccgctctg gacaatgccg gcaagatcat     1140

ggcgctgacc aaatccgcgc ccgactacct ggtaggcccc gctccgcccg ccgacattaa     1200

aaccaaccgc atttaccgca tcctggagct gaacggctac gaccctgcct acgccggctc     1260

cgtctttctc ggctgggccc agaaaaggtt cggaaaacgc aacaccatct ggctgtttgg     1320

gccggccacc acgggcaaga ccaacatcgc ggaagccatc gcccacgccg tgcccttcta     1380

cggctgcgtc aactggacca atgagaactt tcccttcaac gattgcgtcg acaagatggt     1440

gatctggtgg gaggagggca agatgacggc caaggtcgtg gagtccgcca aggccattct     1500

cggcggcagc aaggtgcgcg tggaccaaaa gtgcaagtcg tccgcccaga tcgatcccac     1560

ccccgtgatc gtcacctcca acaccaacat gtgcgccgtg attgacggga acagcaccac     1620

cttcgagcac cagcagccgt tgcaggaccg gatgttcaaa tttgaactca cccgccgtct     1680

ggagcatgac tttggcaagg tgacaaagca ggaagtcaaa gagttcttcc gctgggcgca     1740

ggatcacgtg accgaggtgg cgcatgagtt ctacgtcaga aagggtggag ccaacaagag     1800

acccgccccc gatgacgcgg ataaaagcga gcccaagcgg gcctgcccct cagtcgcgga     1860

tccatcgacg tcagacgcgg aaggagctcc ggtggacttt gccgacaggt accaaaacaa     1920

atgttctcgt cacgcgggca tgcttcagat gctgtttccc tgcaaaacat gcgagagaat     1980

gaatcagaat ttcaacattt gcttcacgca cgggaccaga gactgttcag aatgtttccc     2040

cggcgtgtca gaatctcaac cggtcgtcag aaagaggacg tatcggaaac tctgtgccat     2100

tcatcatctg ctggggcggg ctcccgagat tgcttgctcg gcctgcgatc tggtcaacgt     2160

ggatctggat gactgtgttt ctgagcaata aatgacttaa accaggtatg gctgccgatg     2220

gttatcttcc agattggctc gaggacaacc tctctgaggg cattcgcgag tggtgggact     2280

tgaaacctgg agccccgaaa cccaaagcca accagcaaaa gcaggacgac ggccggggtc     2340

tggtgcttcc tggctacaag tacctcggac ccttcaacgg actcgacaag ggggagcccg     2400

tcaacgcggc ggatgcagcg gccctcgagc acgacaaggc ctacgaccag cagctcaaag     2460

cgggtgacaa tccgtacctg cggtataacc acgccgacgc cgagtttcag gagcgtctgc     2520

aagaagatac gtcttttggg ggcaacctcg ggcgagcagt cttccaggcc aagaagaggg     2580

ttctcgaacc ttttggtctg gttgaggaag gtgctaagac ggctcctgga aagaaacgtc     2640

cggtagagca gtcgccacaa gagccagact cctcctcggg cattggcaag acaggccagc     2700

agcccgctaa aaagagactc aattttggtc agactggcga ctcagagtca gtccccgacc     2760

cacaacctct cggagaacct ccagcaaccc ccgctgctgt gggacctact acaatggctt     2820

caggcggtgg cgcaccaatg gcagacaata acgaaggcgc cgacggagtg ggtaatgcct     2880

caggaaattg gcattgcgat tccacatggc tgggcgacag agtcatcacc accagcaccc     2940

gaacatgggc cttgcccacc tataacaacc acctctacaa gcaaatctcc agtgcttcaa     3000

cgggggccag caacgacaac cactacttcg gctacagcac cccctggggg tattttgatt     3060

tcaacagatt ccactgccat ttctcaccac gtgactggca gcgactcatc aacaacaatt     3120

ggggattccg gcccaagaga ctcaacttca agctcttcaa catccaagtc aaggaggtca     3180

cgacgaatga tggcgtcacg accatcgcta ataaccttac cagcacggtt caagtcttct     3240

cggactcgga gtaccagttg ccgtacgtcc tcggctctgc gcaccagggc tgcctccctc     3300

cgttcccggc ggacgtgttc atgattccgc agtacggcta cctaacgctc aacaatggca     3360

gccaggcagt gggacggtca tccttttact gcctggaata tttcccatcg cagatgctga     3420

gaacgggcaa taactttacc ttcagctaca ccttcgagga cgtgcctttc cacagcagct     3480

acgcgcacag ccagagcctg gaccggctga tgaatcctct catcgaccag tacctgtatt     3540

acctgaacag aactcagaat cagtccggaa gtgcccaaaa caaggacttg ctgtttagcc     3600

gggggtctcc agctggcatg tctgttcagc ccaaaaactg gctacctgga ccctgttacc     3660

ggcagcagcg cgtttctaaa acaaaaacag acaacaacaa cagcaacttt acctggactg     3720

gtgcttcaaa atataacctt aatgggcgtg aatctataat caaccctggc actgctatgg     3780

cctcacacaa agacgacaaa gacaagttct ttcccatgag cggtgtcatg atttttggaa     3840

aggagagcgc cggagcttca aacactgcat tggacaatgt catgatcaca gacgaagagg     3900

aaatcaaagc cactaacccc gtggccaccg aaagatttgg gactgtggca gtcaatctcc     3960

agagcagcag cacagaccct gcgaccggag atgtgcatgt tatgggagcc ttacctggaa     4020

tggtgtggca agacagagac gtatacctgc agggtcctat ttgggccaaa attcctcaca     4080

cggatggaca ctttcacccg tctcctctca tgggcggctt tggacttaag cacccgcctc     4140

ctcagatcct catcaaaaac acgcctgttc ctgcgaatcc tccggcagag ttttcggcta     4200

caaagtttgc ttcattcatc acccagtatt ccacaggaca agtgagcgtg gagattgaat     4260

gggagctgca gaaagaaaac agcaaacgct ggaatcccga agtgcagtat acatctaact     4320

atgcaaaatc tgccaacgtt gatttcactg tggacaacaa tggactttat actgagcctc     4380

gccccattgg cacccgttac ctcacccgtc ccctgtaatt gtgtgttaat caataaaccg     4440

gttaattcgt gtcagttgaa ctttggtctc atgtcgttat tatcttatct ggtcaccata     4500

gcaaccggtt acacattaac tgcttagttg cgcttcgcga atacccctag tgatggagtt     4560

gcccactccc tctatgcgcg ctcgctcgct cggtggggcc ggcagagcag agctctgccg     4620

tctgcggacc tttggtccgc aggccccacc gagcgagcga gcgcgcatag agggagtggg     4680

caa                                                                   4683


<210> 95
<211> 1872
<212> DNA
<213> Adeno-associated virus 6

<400> 95
atgccggggt tttacgagat tgtgattaag gtccccagcg accttgacga gcatctgccc       60

ggcatttctg acagctttgt gaactgggtg gccgagaagg aatgggagtt gccgccagat      120

tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgacttcc tggtccagtg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt      240

cagttcgaga agggcgagtc ctacttccac ctccatattc tggtggagac cacgggggtc      300

aaatccatgg tgctgggccg cttcctgagt cagattaggg acaagctggt gcagaccatc      360

taccgcggga tcgagccgac cctgcccaac tggttcgcgg tgaccaagac gcgtaatggc      420

gccggagggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag      480

actcagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcgtgttta      540

aacctggccg agcgcaaacg gctcgtggcg cacgacctga cccacgtcag ccagacccag      600

gagcagaaca aggagaatct gaaccccaat tctgacgcgc ctgtcatccg gtcaaaaacc      660

tccgcacgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag      720

cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg      780

tcccagatca aggccgctct ggacaatgcc ggcaagatca tggcgctgac caaatccgcg      840

cccgactacc tggtaggccc cgctccgccc gccgacatta aaaccaaccg catttaccgc      900

atcctggagc tgaacggcta cgaccctgcc tacgccggct ccgtctttct cggctgggcc      960

cagaaaaggt tcggaaaacg caacaccatc tggctgtttg ggccggccac cacgggcaag     1020

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt caactggacc     1080

aatgagaact ttcccttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgacgg ccaaggtcgt ggagtccgcc aaggccattc tcggcggcag caaggtgcgc     1200

gtggaccaaa agtgcaagtc gtccgcccag atcgatccca cccccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca ccagcagccg     1320

ttgcaggacc ggatgttcaa atttgaactc acccgccgtc tggagcatga ctttggcaag     1380

gtgacaaagc aggaagtcaa agagttcttc cgctgggcgc aggatcacgt gaccgaggtg     1440

gcgcatgagt tctacgtcag aaagggtgga gccaacaaga gacccgcccc cgatgacgcg     1500

gataaaagcg agcccaagcg ggcctgcccc tcagtcgcgg atccatcgac gtcagacgcg     1560

gaaggagctc cggtggactt tgccgacagg taccaaaaca aatgttctcg tcacgcgggc     1620

atgcttcaga tgctgtttcc ctgcaaaaca tgcgagagaa tgaatcagaa tttcaacatt     1680

tgcttcacgc acgggaccag agactgttca gaatgtttcc ccggcgtgtc agaatctcaa     1740

ccggtcgtca gaaagaggac gtatcggaaa ctctgtgcca ttcatcatct gctggggcgg     1800

gctcccgaga ttgcttgctc ggcctgcgat ctggtcaacg tggatctgga tgactgtgtt     1860

tctgagcaat aa                                                         1872


<210> 96
<211> 2211
<212> DNA
<213> Adeno-associated virus 6

<400> 96
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct      420

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc      480

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag      540

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct      600

actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga      660

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc      720

accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc      780

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg      840

gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc      900

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa      960

gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg     1020

gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag     1080

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg     1140

ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca     1200

tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct     1260

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac     1320

cagtacctgt attacctgaa cagaactcag aatcagtccg gaagtgccca aaacaaggac     1380

ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct     1440

ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac     1500

tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct     1560

ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc     1620

atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc     1680

acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg     1740

gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga     1800

gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc     1860

aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt     1920

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca     1980

gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc     2040

gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag     2100

tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt     2160

tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a              2211


<210> 97
<211> 4721
<212> DNA
<213> Adeno-associated virus 7

<400> 97
ttggccactc cctctatgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc       60

agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcat agagggagtg      120

gccaactcca tcactagggg taccgcgaag cgcctcccac gctgccgcgt cagcgctgac      180

gtaaatcacg tcatagggga gtggtcctgt attagctgtc acgtgagtgc ttttgcgaca      240

ttttgcgaca ccacgtggcc atttgaggta tatatggccg agtgagcgag caggatctcc      300

attttgaccg cgaaatttga acgagcagca gccatgccgg gtttctacga gatcgtgatc      360

aaggtgccga gcgacctgga cgagcacctg ccgggcattt ctgactcgtt tgtgaactgg      420

gtggccgaga aggaatggga gctgcccccg gattctgaca tggatctgaa tctgatcgag      480

caggcacccc tgaccgtggc cgagaagctg cagcgcgact tcctggtcca atggcgccgc      540

gtgagtaagg ccccggaggc cctgttcttt gttcagttcg agaagggcga gagctacttc      600

caccttcacg ttctggtgga gaccacgggg gtcaagtcca tggtgctagg ccgcttcctg      660

agtcagattc gggagaagct ggtccagacc atctaccgcg gggtcgagcc cacgctgccc      720

aactggttcg cggtgaccaa gacgcgtaat ggcgccggcg gggggaacaa ggtggtggac      780

gagtgctaca tccccaacta cctcctgccc aagacccagc ccgagctgca gtgggcgtgg      840

actaacatgg aggagtatat aagcgcgtgt ttgaacctgg ccgaacgcaa acggctcgtg      900

gcgcagcacc tgacccacgt cagccagacg caggagcaga acaaggagaa tctgaacccc      960

aattctgacg cgcccgtgat caggtcaaaa acctccgcgc gctacatgga gctggtcggg     1020

tggctggtgg accggggcat cacctccgag aagcagtgga tccaggagga ccaggcctcg     1080

tacatctcct tcaacgccgc ctccaactcg cggtcccaga tcaaggccgc gctggacaat     1140

gccggcaaga tcatggcgct gaccaaatcc gcgcccgact acctggtggg gccctcgctg     1200

cccgcggaca ttaaaaccaa ccgcatctac cgcatcctgg agctgaacgg gtacgatcct     1260

gcctacgccg gctccgtctt tctcggctgg gcccagaaaa agttcgggaa gcgcaacacc     1320

atctggctgt ttgggcccgc caccaccggc aagaccaaca ttgcggaagc catcgcccac     1380

gccgtgccct tctacggctg cgtcaactgg accaatgaga actttccctt caacgattgc     1440

gtcgacaaga tggtgatctg gtgggaggag ggcaagatga cggccaaggt cgtggagtcc     1500

gccaaggcca ttctcggcgg cagcaaggtg cgcgtggacc aaaagtgcaa gtcgtccgcc     1560

cagatcgacc ccacccccgt gatcgtcacc tccaacacca acatgtgcgc cgtgattgac     1620

gggaacagca ccaccttcga gcaccagcag ccgttgcagg accggatgtt caaatttgaa     1680

ctcacccgcc gtctggagca cgactttggc aaggtgacga agcaggaagt caaagagttc     1740

ttccgctggg ccagtgatca cgtgaccgag gtggcgcatg agttctacgt cagaaagggc     1800

ggagccagca aaagacccgc ccccgatgac gcggatataa gcgagcccaa gcgggcctgc     1860

ccctcagtcg cggatccatc gacgtcagac gcggaaggag ctccggtgga ctttgccgac     1920

aggtaccaaa acaaatgttc tcgtcacgcg ggcatgattc agatgctgtt tccctgcaaa     1980

acgtgcgaga gaatgaatca gaatttcaac atttgcttca cacacggggt cagagactgt     2040

ttagagtgtt tccccggcgt gtcagaatct caaccggtcg tcagaaaaaa gacgtatcgg     2100

aaactctgcg cgattcatca tctgctgggg cgggcgcccg agattgcttg ctcggcctgc     2160

gacctggtca acgtggacct ggacgactgc gtttctgagc aataaatgac ttaaaccagg     2220

tatggctgcc gatggttatc ttccagattg gctcgaggac aacctctctg agggcattcg     2280

cgagtggtgg gacctgaaac ctggagcccc gaaacccaaa gccaaccagc aaaagcagga     2340

caacggccgg ggtctggtgc ttcctggcta caagtacctc ggacccttca acggactcga     2400

caagggggag cccgtcaacg cggcggacgc agcggccctc gagcacgaca aggcctacga     2460

ccagcagctc aaagcgggtg acaatccgta cctgcggtat aaccacgccg acgccgagtt     2520

tcaggagcgt ctgcaagaag atacgtcatt tgggggcaac ctcgggcgag cagtcttcca     2580

ggccaagaag cgggttctcg aacctctcgg tctggttgag gaaggcgcta agacggctcc     2640

tgcaaagaag agaccggtag agccgtcacc tcagcgttcc cccgactcct ccacgggcat     2700

cggcaagaaa ggccagcagc ccgccagaaa gagactcaat ttcggtcaga ctggcgactc     2760

agagtcagtc cccgaccctc aacctctcgg agaacctcca gcagcgccct ctagtgtggg     2820

atctggtaca gtggctgcag gcggtggcgc accaatggca gacaataacg aaggtgccga     2880

cggagtgggt aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt     2940

cattaccacc agcacccgaa cctgggccct gcccacctac aacaaccacc tctacaagca     3000

aatctccagt gaaactgcag gtagtaccaa cgacaacacc tacttcggct acagcacccc     3060

ctgggggtat tttgacttta acagattcca ctgccacttc tcaccacgtg actggcagcg     3120

actcatcaac aacaactggg gattccggcc caagaagctg cggttcaagc tcttcaacat     3180

ccaggtcaag gaggtcacga cgaatgacgg cgttacgacc atcgctaata accttaccag     3240

cacgattcag gtattctcgg actcggaata ccagctgccg tacgtcctcg gctctgcgca     3300

ccagggctgc ctgcctccgt tcccggcgga cgtcttcatg attcctcagt acggctacct     3360

gactctcaac aatggcagtc agtctgtggg acgttcctcc ttctactgcc tggagtactt     3420

cccctctcag atgctgagaa cgggcaacaa ctttgagttc agctacagct tcgaggacgt     3480

gcctttccac agcagctacg cacacagcca gagcctggac cggctgatga atcccctcat     3540

cgaccagtac ttgtactacc tggccagaac acagagtaac ccaggaggca cagctggcaa     3600

tcgggaactg cagttttacc agggcgggcc ttcaactatg gccgaacaag ccaagaattg     3660

gttacctgga ccttgcttcc ggcaacaaag agtctccaaa acgctggatc aaaacaacaa     3720

cagcaacttt gcttggactg gtgccaccaa atatcacctg aacggcagaa actcgttggt     3780

taatcccggc gtcgccatgg caactcacaa ggacgacgag gaccgctttt tcccatccag     3840

cggagtcctg atttttggaa aaactggagc aactaacaaa actacattgg aaaatgtgtt     3900

aatgacaaat gaagaagaaa ttcgtcctac taatcctgta gccacggaag aatacgggat     3960

agtcagcagc aacttacaag cggctaatac tgcagcccag acacaagttg tcaacaacca     4020

gggagcctta cctggcatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg     4080

ggccaagatt cctcacacgg atggcaactt tcacccgtct cctttgatgg gcggctttgg     4140

acttaaacat ccgcctcctc agatcctgat caagaacact cccgttcccg ctaatcctcc     4200

ggaggtgttt actcctgcca agtttgcttc gttcatcaca cagtacagca ccggacaagt     4260

cagcgtggaa atcgagtggg agctgcagaa ggaaaacagc aagcgctgga acccggagat     4320

tcagtacacc tccaactttg aaaagcagac tggtgtggac tttgccgttg acagccaggg     4380

tgtttactct gagcctcgcc ctattggcac tcgttacctc acccgtaatc tgtaattgca     4440

tgttaatcaa taaaccggtt gattcgtttc agttgaactt tggtctcctg tgcttcttat     4500

cttatcggtt tccatagcaa ctggttacac attaactgct tgggtgcgct tcacgataag     4560

aacactgacg tcaccgcggt acccctagtg atggagttgg ccactccctc tatgcgcgct     4620

cgctcgctcg gtggggcctg cggaccaaag gtccgcagac ggcagagctc tgctctgccg     4680

gccccaccga gcgagcgagc gcgcatagag ggagtggcca a                         4721


<210> 98
<211> 1872
<212> DNA
<213> Adeno-associated virus 7

<400> 98
atgccgggtt tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg       60

ggcatttctg actcgtttgt gaactgggtg gccgagaagg aatgggagct gcccccggat      120

tctgacatgg atctgaatct gatcgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgacttcc tggtccaatg gcgccgcgtg agtaaggccc cggaggccct gttctttgtt      240

cagttcgaga agggcgagag ctacttccac cttcacgttc tggtggagac cacgggggtc      300

aagtccatgg tgctaggccg cttcctgagt cagattcggg agaagctggt ccagaccatc      360

taccgcgggg tcgagcccac gctgcccaac tggttcgcgg tgaccaagac gcgtaatggc      420

gccggcgggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag      480

acccagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcgtgtttg      540

aacctggccg aacgcaaacg gctcgtggcg cagcacctga cccacgtcag ccagacgcag      600

gagcagaaca aggagaatct gaaccccaat tctgacgcgc ccgtgatcag gtcaaaaacc      660

tccgcgcgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag      720

cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg      780

tcccagatca aggccgcgct ggacaatgcc ggcaagatca tggcgctgac caaatccgcg      840

cccgactacc tggtggggcc ctcgctgccc gcggacatta aaaccaaccg catctaccgc      900

atcctggagc tgaacgggta cgatcctgcc tacgccggct ccgtctttct cggctgggcc      960

cagaaaaagt tcgggaagcg caacaccatc tggctgtttg ggcccgccac caccggcaag     1020

accaacattg cggaagccat cgcccacgcc gtgcccttct acggctgcgt caactggacc     1080

aatgagaact ttcccttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgacgg ccaaggtcgt ggagtccgcc aaggccattc tcggcggcag caaggtgcgc     1200

gtggaccaaa agtgcaagtc gtccgcccag atcgacccca cccccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca ccagcagccg     1320

ttgcaggacc ggatgttcaa atttgaactc acccgccgtc tggagcacga ctttggcaag     1380

gtgacgaagc aggaagtcaa agagttcttc cgctgggcca gtgatcacgt gaccgaggtg     1440

gcgcatgagt tctacgtcag aaagggcgga gccagcaaaa gacccgcccc cgatgacgcg     1500

gatataagcg agcccaagcg ggcctgcccc tcagtcgcgg atccatcgac gtcagacgcg     1560

gaaggagctc cggtggactt tgccgacagg taccaaaaca aatgttctcg tcacgcgggc     1620

atgattcaga tgctgtttcc ctgcaaaacg tgcgagagaa tgaatcagaa tttcaacatt     1680

tgcttcacac acggggtcag agactgttta gagtgtttcc ccggcgtgtc agaatctcaa     1740

ccggtcgtca gaaaaaagac gtatcggaaa ctctgcgcga ttcatcatct gctggggcgg     1800

gcgcccgaga ttgcttgctc ggcctgcgac ctggtcaacg tggacctgga cgactgcgtt     1860

tctgagcaat aa                                                         1872


<210> 99
<211> 2214
<212> DNA
<213> Adeno-associated virus 7

<400> 99
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acctgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

aacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtcattt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

gcaaagaaga gaccggtaga gccgtcacct cagcgttccc ccgactcctc cacgggcatc      480

ggcaagaaag gccagcagcc cgccagaaag agactcaatt tcggtcagac tggcgactca      540

gagtcagtcc ccgaccctca acctctcgga gaacctccag cagcgccctc tagtgtggga      600

tctggtacag tggctgcagg cggtggcgca ccaatggcag acaataacga aggtgccgac      660

ggagtgggta atgcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

attaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccagtg aaactgcagg tagtaccaac gacaacacct acttcggcta cagcaccccc      840

tgggggtatt ttgactttaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggccc aagaagctgc ggttcaagct cttcaacatc      960

caggtcaagg aggtcacgac gaatgacggc gttacgacca tcgctaataa ccttaccagc     1020

acgattcagg tattctcgga ctcggaatac cagctgccgt acgtcctcgg ctctgcgcac     1080

cagggctgcc tgcctccgtt cccggcggac gtcttcatga ttcctcagta cggctacctg     1140

actctcaaca atggcagtca gtctgtggga cgttcctcct tctactgcct ggagtacttc     1200

ccctctcaga tgctgagaac gggcaacaac tttgagttca gctacagctt cgaggacgtg     1260

cctttccaca gcagctacgc acacagccag agcctggacc ggctgatgaa tcccctcatc     1320

gaccagtact tgtactacct ggccagaaca cagagtaacc caggaggcac agctggcaat     1380

cgggaactgc agttttacca gggcgggcct tcaactatgg ccgaacaagc caagaattgg     1440

ttacctggac cttgcttccg gcaacaaaga gtctccaaaa cgctggatca aaacaacaac     1500

agcaactttg cttggactgg tgccaccaaa tatcacctga acggcagaaa ctcgttggtt     1560

aatcccggcg tcgccatggc aactcacaag gacgacgagg accgcttttt cccatccagc     1620

ggagtcctga tttttggaaa aactggagca actaacaaaa ctacattgga aaatgtgtta     1680

atgacaaatg aagaagaaat tcgtcctact aatcctgtag ccacggaaga atacgggata     1740

gtcagcagca acttacaagc ggctaatact gcagcccaga cacaagttgt caacaaccag     1800

ggagccttac ctggcatggt ctggcagaac cgggacgtgt acctgcaggg tcccatctgg     1860

gccaagattc ctcacacgga tggcaacttt cacccgtctc ctttgatggg cggctttgga     1920

cttaaacatc cgcctcctca gatcctgatc aagaacactc ccgttcccgc taatcctccg     1980

gaggtgttta ctcctgccaa gtttgcttcg ttcatcacac agtacagcac cggacaagtc     2040

agcgtggaaa tcgagtggga gctgcagaag gaaaacagca agcgctggaa cccggagatt     2100

cagtacacct ccaactttga aaagcagact ggtgtggact ttgccgttga cagccagggt     2160

gtttactctg agcctcgccc tattggcact cgttacctca cccgtaatct gtaa           2214


<210> 100
<211> 4393
<212> DNA
<213> Adeno-associated virus 8

<400> 100
cagagaggga gtggccaact ccatcactag gggtagcgcg aagcgcctcc cacgctgccg       60

cgtcagcgct gacgtaaatt acgtcatagg ggagtggtcc tgtattagct gtcacgtgag      120

tgcttttgcg gcattttgcg acaccacgtg gccatttgag gtatatatgg ccgagtgagc      180

gagcaggatc tccattttga ccgcgaaatt tgaacgagca gcagccatgc cgggcttcta      240

cgagatcgtg atcaaggtgc cgagcgacct ggacgagcac ctgccgggca tttctgactc      300

gtttgtgaac tgggtggccg agaaggaatg ggagctgccc ccggattctg acatggatcg      360

gaatctgatc gagcaggcac ccctgaccgt ggccgagaag ctgcagcgcg acttcctggt      420

ccaatggcgc cgcgtgagta aggccccgga ggccctcttc tttgttcagt tcgagaaggg      480

cgagagctac tttcacctgc acgttctggt cgagaccacg ggggtcaagt ccatggtgct      540

aggccgcttc ctgagtcaga ttcgggaaaa gcttggtcca gaccatctac ccgcggggtc      600

gagccccacc ttgcccaact ggttcgcggt gaccaaagac gcggtaatgg cgccggcggg      660

ggggaacaag gtggtggacg agtgctacat ccccaactac ctcctgccca agactcagcc      720

cgagctgcag tgggcgtgga ctaacatgga ggagtatata agcgcgtgct tgaacctggc      780

cgagcgcaaa cggctcgtgg cgcagcacct gacccacgtc agccagacgc aggagcagaa      840

caaggagaat ctgaacccca attctgacgc gcccgtgatc aggtcaaaaa cctccgcgcg      900

ctatatggag ctggtcgggt ggctggtgga ccggggcatc acctccgaga agcagtggat      960

ccaggaggac caggcctcgt acatctcctt caacgccgcc tccaactcgc ggtcccagat     1020

caaggccgcg ctggacaatg ccggcaagat catggcgctg accaaatccg cgcccgacta     1080

cctggtgggg ccctcgctgc ccgcggacat tacccagaac cgcatctacc gcatcctcgc     1140

tctcaacggc tacgaccctg cctacgccgg ctccgtcttt ctcggctggg ctcagaaaaa     1200

gttcgggaaa cgcaacacca tctggctgtt tggacccgcc accaccggca agaccaacat     1260

tgcggaagcc atcgcccacg ccgtgccctt ctacggctgc gtcaactgga ccaatgagaa     1320

ctttcccttc aatgattgcg tcgacaagat ggtgatctgg tgggaggagg gcaagatgac     1380

ggccaaggtc gtggagtccg ccaaggccat tctcggcggc agcaaggtgc gcgtggacca     1440

aaagtgcaag tcgtccgccc agatcgaccc cacccccgtg atcgtcacct ccaacaccaa     1500

catgtgcgcc gtgattgacg ggaacagcac caccttcgag caccagcagc ctctccagga     1560

ccggatgttt aagttcgaac tcacccgccg tctggagcac gactttggca aggtgacaaa     1620

gcaggaagtc aaagagttct tccgctgggc cagtgatcac gtgaccgagg tggcgcatga     1680

gttttacgtc agaaagggcg gagccagcaa aagacccgcc cccgatgacg cggataaaag     1740

cgagcccaag cgggcctgcc cctcagtcgc ggatccatcg acgtcagacg cggaaggagc     1800

tccggtggac tttgccgaca ggtaccaaaa caaatgttct cgtcacgcgg gcatgcttca     1860

gatgctgttt ccctgcaaaa cgtgcgagag aatgaatcag aatttcaaca tttgcttcac     1920

acacggggtc agagactgct cagagtgttt ccccggcgtg tcagaatctc aaccggtcgt     1980

cagaaagagg acgtatcgga aactctgtgc gattcatcat ctgctggggc gggctcccga     2040

gattgcttgc tcggcctgcg atctggtcaa cgtggacctg gatgactgtg tttctgagca     2100

ataaatgact taaaccaggt atggctgccg atggttatct tccagattgg ctcgaggaca     2160

acctctctga gggcattcgc gagtggtggg cgctgaaacc tggagccccg aagcccaaag     2220

ccaaccagca aaagcaggac gacggccggg gtctggtgct tcctggctac aagtacctcg     2280

gacccttcaa cggactcgac aagggggagc ccgtcaacgc ggcggacgca gcggccctcg     2340

agcacgacaa ggcctacgac cagcagctgc aggcgggtga caatccgtac ctgcggtata     2400

accacgccga cgccgagttt caggagcgtc tgcaagaaga tacgtctttt gggggcaacc     2460

tcgggcgagc agtcttccag gccaagaagc gggttctcga acctctcggt ctggttgagg     2520

aaggcgctaa gacggctcct ggaaagaaga gaccggtaga gccatcaccc cagcgttctc     2580

cagactcctc tacgggcatc ggcaagaaag gccaacagcc cgccagaaaa agactcaatt     2640

ttggtcagac tggcgactca gagtcagttc cagaccctca acctctcgga gaacctccag     2700

cagcgccctc tggtgtggga cctaatacaa tggctgcagg cggtggcgca ccaatggcag     2760

acaataacga aggcgccgac ggagtgggta gttcctcggg aaattggcat tgcgattcca     2820

catggctggg cgacagagtc atcaccacca gcacccgaac ctgggccctg cccacctaca     2880

acaaccacct ctacaagcaa atctccaacg ggacatcggg aggagccacc aacgacaaca     2940

cctacttcgg ctacagcacc ccctgggggt attttgactt taacagattc cactgccact     3000

tttcaccacg tgactggcag cgactcatca acaacaactg gggattccgg cccaagagac     3060

tcagcttcaa gctcttcaac atccaggtca aggaggtcac gcagaatgaa ggcaccaaga     3120

ccatcgccaa taacctcacc agcaccatcc aggtgtttac ggactcggag taccagctgc     3180

cgtacgttct cggctctgcc caccagggct gcctgcctcc gttcccggcg gacgtgttca     3240

tgattcccca gtacggctac ctaacactca acaacggtag tcaggccgtg ggacgctcct     3300

ccttctactg cctggaatac tttccttcgc agatgctgag aaccggcaac aacttccagt     3360

ttacttacac cttcgaggac gtgcctttcc acagcagcta cgcccacagc cagagcttgg     3420

accggctgat gaatcctctg attgaccagt acctgtacta cttgtctcgg actcaaacaa     3480

caggaggcac ggcaaatacg cagactctgg gcttcagcca aggtgggcct aatacaatgg     3540

ccaatcaggc aaagaactgg ctgccaggac cctgttaccg ccaacaacgc gtctcaacga     3600

caaccgggca aaacaacaat agcaactttg cctggactgc tgggaccaaa taccatctga     3660

atggaagaaa ttcattggct aatcctggca tcgctatggc aacacacaaa gacgacgagg     3720

agcgtttttt tcccagtaac gggatcctga tttttggcaa acaaaatgct gccagagaca     3780

atgcggatta cagcgatgtc atgctcacca gcgaggaaga aatcaaaacc actaaccctg     3840

tggctacaga ggaatacggt atcgtggcag ataacttgca gcagcaaaac acggctcctc     3900

aaattggaac tgtcaacagc cagggggcct tacccggtat ggtctggcag aaccgggacg     3960

tgtacctgca gggtcccatc tgggccaaga ttcctcacac ggacggcaac ttccacccgt     4020

ctccgctgat gggcggcttt ggcctgaaac atcctccgcc tcagatcctg atcaagaaca     4080

cgcctgtacc tgcggatcct ccgaccacct tcaaccagtc aaagctgaac tctttcatca     4140

cgcaatacag caccggacag gtcagcgtgg aaattgaatg ggagctgcag aaggaaaaca     4200

gcaagcgctg gaaccccgag atccagtaca cctccaacta ctacaaatct acaagtgtgg     4260

actttgctgt taatacagaa ggcgtgtact ctgaaccccg ccccattggc acccgttacc     4320

tcacccgtaa tctgtaattg cctgttaatc aataaaccgg ttgattcgtt tcagttgaac     4380

tttggtctct gcg                                                        4393


<210> 101
<211> 1878
<212> DNA
<213> Adeno-associated virus 8

<400> 101
atgccgggct tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg       60

ggcatttctg actcgtttgt gaactgggtg gccgagaagg aatgggagct gcccccggat      120

tctgacatgg atcggaatct gatcgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgacttcc tggtccaatg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt      240

cagttcgaga agggcgagag ctactttcac ctgcacgttc tggtcgagac cacgggggtc      300

aagtccatgg tgctaggccg cttcctgagt cagattcggg aaaagcttgg tccagaccat      360

ctacccgcgg ggtcgagccc caccttgccc aactggttcg cggtgaccaa agacgcggta      420

atggcgccgg cgggggggaa caaggtggtg gacgagtgct acatccccaa ctacctcctg      480

cccaagactc agcccgagct gcagtgggcg tggactaaca tggaggagta tataagcgcg      540

tgcttgaacc tggccgagcg caaacggctc gtggcgcagc acctgaccca cgtcagccag      600

acgcaggagc agaacaagga gaatctgaac cccaattctg acgcgcccgt gatcaggtca      660

aaaacctccg cgcgctatat ggagctggtc gggtggctgg tggaccgggg catcacctcc      720

gagaagcagt ggatccagga ggaccaggcc tcgtacatct ccttcaacgc cgcctccaac      780

tcgcggtccc agatcaaggc cgcgctggac aatgccggca agatcatggc gctgaccaaa      840

tccgcgcccg actacctggt ggggccctcg ctgcccgcgg acattaccca gaaccgcatc      900

taccgcatcc tcgctctcaa cggctacgac cctgcctacg ccggctccgt ctttctcggc      960

tgggctcaga aaaagttcgg gaaacgcaac accatctggc tgtttggacc cgccaccacc     1020

ggcaagacca acattgcgga agccatcgcc cacgccgtgc ccttctacgg ctgcgtcaac     1080

tggaccaatg agaactttcc cttcaatgat tgcgtcgaca agatggtgat ctggtgggag     1140

gagggcaaga tgacggccaa ggtcgtggag tccgccaagg ccattctcgg cggcagcaag     1200

gtgcgcgtgg accaaaagtg caagtcgtcc gcccagatcg accccacccc cgtgatcgtc     1260

acctccaaca ccaacatgtg cgccgtgatt gacgggaaca gcaccacctt cgagcaccag     1320

cagcctctcc aggaccggat gtttaagttc gaactcaccc gccgtctgga gcacgacttt     1380

ggcaaggtga caaagcagga agtcaaagag ttcttccgct gggccagtga tcacgtgacc     1440

gaggtggcgc atgagtttta cgtcagaaag ggcggagcca gcaaaagacc cgcccccgat     1500

gacgcggata aaagcgagcc caagcgggcc tgcccctcag tcgcggatcc atcgacgtca     1560

gacgcggaag gagctccggt ggactttgcc gacaggtacc aaaacaaatg ttctcgtcac     1620

gcgggcatgc ttcagatgct gtttccctgc aaaacgtgcg agagaatgaa tcagaatttc     1680

aacatttgct tcacacacgg ggtcagagac tgctcagagt gtttccccgg cgtgtcagaa     1740

tctcaaccgg tcgtcagaaa gaggacgtat cggaaactct gtgcgattca tcatctgctg     1800

gggcgggctc ccgagattgc ttgctcggcc tgcgatctgg tcaacgtgga cctggatgac     1860

tgtgtttctg agcaataa                                                   1878


<210> 102
<211> 2217
<212> DNA
<213> Adeno-associated virus 8

<400> 102
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg cgctgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctgc aggcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc      480

ggcaagaaag gccaacagcc cgccagaaaa agactcaatt ttggtcagac tggcgactca      540

gagtcagttc cagaccctca acctctcgga gaacctccag cagcgccctc tggtgtggga      600

cctaatacaa tggctgcagg cggtggcgca ccaatggcag acaataacga aggcgccgac      660

ggagtgggta gttcctcggg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccaacg ggacatcggg aggagccacc aacgacaaca cctacttcgg ctacagcacc      840

ccctgggggt attttgactt taacagattc cactgccact tttcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg cccaagagac tcagcttcaa gctcttcaac      960

atccaggtca aggaggtcac gcagaatgaa ggcaccaaga ccatcgccaa taacctcacc     1020

agcaccatcc aggtgtttac ggactcggag taccagctgc cgtacgttct cggctctgcc     1080

caccagggct gcctgcctcc gttcccggcg gacgtgttca tgattcccca gtacggctac     1140

ctaacactca acaacggtag tcaggccgtg ggacgctcct ccttctactg cctggaatac     1200

tttccttcgc agatgctgag aaccggcaac aacttccagt ttacttacac cttcgaggac     1260

gtgcctttcc acagcagcta cgcccacagc cagagcttgg accggctgat gaatcctctg     1320

attgaccagt acctgtacta cttgtctcgg actcaaacaa caggaggcac ggcaaatacg     1380

cagactctgg gcttcagcca aggtgggcct aatacaatgg ccaatcaggc aaagaactgg     1440

ctgccaggac cctgttaccg ccaacaacgc gtctcaacga caaccgggca aaacaacaat     1500

agcaactttg cctggactgc tgggaccaaa taccatctga atggaagaaa ttcattggct     1560

aatcctggca tcgctatggc aacacacaaa gacgacgagg agcgtttttt tcccagtaac     1620

gggatcctga tttttggcaa acaaaatgct gccagagaca atgcggatta cagcgatgtc     1680

atgctcacca gcgaggaaga aatcaaaacc actaaccctg tggctacaga ggaatacggt     1740

atcgtggcag ataacttgca gcagcaaaac acggctcctc aaattggaac tgtcaacagc     1800

cagggggcct tacccggtat ggtctggcag aaccgggacg tgtacctgca gggtcccatc     1860

tgggccaaga ttcctcacac ggacggcaac ttccacccgt ctccgctgat gggcggcttt     1920

ggcctgaaac atcctccgcc tcagatcctg atcaagaaca cgcctgtacc tgcggatcct     1980

ccgaccacct tcaaccagtc aaagctgaac tctttcatca cgcaatacag caccggacag     2040

gtcagcgtgg aaattgaatg ggagctgcag aaggaaaaca gcaagcgctg gaaccccgag     2100

atccagtaca cctccaacta ctacaaatct acaagtgtgg actttgctgt taatacagaa     2160

ggcgtgtact ctgaaccccg ccccattggc acccgttacc tcacccgtaa tctgtaa        2217


<210> 103
<211> 6042
<212> DNA
<213> Adeno-associated virus 9

<400> 103
gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcgta       60

atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat      120

ggcgattccg ttgcaatggc tggcggtaat attgttctgg atattaccag caaggccgat      180

agtttgagtt cttctactca ggcaagtgat gttattacta atcaaagaag tattgcgaca      240

acggttaatt tgcgtgatgg acagactctt ttactcggtg gcctcactga ttataaaaac      300

acttctcagg attctggcgt accgttcctg tctaaaatcc ctttaatcgg cctcctgttt      360

agctcccgct ctgattctaa cgaggaaagc acgttatacg tgctcgtcaa agcaaccata      420

gtacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac      480

cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc      540

cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt      600

tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg      660

gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag      720

tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt      780

ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt      840

taacgcgaat tttaacaaaa tattaacgct tacaatttaa atatttgctt atacaatctt      900

cctgtttttg gggcttttct gattatcaac cggggtacat atgattgaca tgctagtttt      960

acgattaccg ttcatcgccc tgcgcgctcg ctcgctcact gaggccgccc gggcaaagcc     1020

cgggcgtcgg gcgacctttg gtcgcccggc ctcagtgagc gagcgagcgc gcagagaggg     1080

agtggaattc acgcgtggat ctgaattcaa ttcacgcgtg gtacctctgg tcgttacata     1140

acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat     1200

aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga     1260

gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc     1320

ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt     1380

atgggacttt cctacttggc agtacatcta ctcgaggcca cgttctgctt cactctcccc     1440

atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt attttgtgca     1500

gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg gggcgagggg     1560

cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc gctccgaaag     1620

tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg     1680

gcgggagcgg gatcagccac cgcggtggcg gcctagagtc gacgaggaac tgaaaaacca     1740

gaaagttaac tggtaagttt agtctttttg tcttttattt caggtcccgg atccggtggt     1800

ggtgcaaatc aaagaactgc tcctcagtgg atgttgcctt tacttctagg cctgtacgga     1860

agtgttactt ctgctctaaa agctgcggaa ttgtacccgc ggccgatcca ccggtccgga     1920

attcccggga tatcgtcgac ccacgcgtcc gggccccacg ctgcgcaccc gcgggtttgc     1980

tatggcgatg agcagcggcg gcagtggtgg cggcgtcccg gagcaggagg attccgtgct     2040

gttccggcgc ggcacaggcc agagcgatga ttctgacatt tgggatgata cagcactgat     2100

aaaagcatat gataaagctg tggcttcatt taagcatgct ctaaagaatg gtgacatttg     2160

tgaaacttcg ggtaaaccaa aaaccacacc taaaagaaaa cctgctaaga agaataaaag     2220

ccaaaagaag aatactgcag cttccttaca acagtggaaa gttggggaca aatgttctgc     2280

catttggtca gaagacggtt gcatttaccc agctaccatt gcttcaattg attttaagag     2340

agaaacctgt gttgtggttt acactggata tggaaataga gaggagcaaa atctgtccga     2400

tctactttcc ccaatctgtg aagtagctaa taatatagaa cagaatgctc aagagaatga     2460

aaatgaaagc caagtttcaa cagatgaaag tgagaactcc aggtctcctg gaaataaatc     2520

agataacatc aagcccaaat ctgctccatg gaactctttt ctccctccac caccccccat     2580

gccagggcca agactgggac caggaaagcc aggtctaaaa ttcaatggcc caccaccgcc     2640

accgccacca ccaccacccc acttactatc atgctggctg cctccatttc cttctggacc     2700

accaataatt cccccaccac ctcccatatg tccagattct cttgatgatg ctgatgcttt     2760

gggaagtatg ttaatttcat ggtacatgag tggctatcat actggctatt atatgggttt     2820

tagacaaaat caaaaagaag gaaggtgctc acattcctta aattaaggag aaatgctggc     2880

atagagcagc actaaatgac accactaaag aaacgatcag acagatctag aaagcttatc     2940

gataccgtcg actagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc     3000

tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct     3060

ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg     3120

gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg     3180

ggagagatcg atctgaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc     3240

gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg     3300

cctcagtgag cgagcgagcg cgcagagagg gagtggcccc cccccccccc cccccggcga     3360

ttctcttgtt tgctccagac tctcaggcaa tgacctgata gcctttgtag agacctctca     3420

aaaatagcta ccctctccgg catgaattta tcagctagaa cggttgaata tcatattgat     3480

ggtgatttga ctgtctccgg cctttctcac ccgtttgaat ctttacctac acattactca     3540

ggcattgcat ttaaaatata tgagggttct aaaaattttt atccttgcgt tgaaataaag     3600

gcttctcccg caaaagtatt acagggtcat aatgtttttg gtacaaccga tttagcttta     3660

tgctctgagg ctttattgct taattttgct aattctttgc cttgcctgta tgatttattg     3720

gatgttggaa tcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc     3780

gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac     3840

acccgccaac actatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc     3900

agccccgaca cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat     3960

ccgcttacag acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt     4020

catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg cctattttta taggttaatg     4080

tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa     4140

cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac     4200

cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg     4260

tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc     4320

tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg     4380

atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga     4440

gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc     4500

aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag     4560

aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga     4620

gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg     4680

cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga     4740

atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt     4800

tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact     4860

ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt     4920

ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg     4980

ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta     5040

tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac     5100

tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta     5160

aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt     5220

tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt     5280

tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt     5340

gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc     5400

agataccaaa tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg     5460

tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg     5520

ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt     5580

cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac     5640

tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg     5700

acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg     5760

gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat     5820

ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt     5880

tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg     5940

attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa     6000

cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gc                        6042


<210> 104
<211> 4102
<212> DNA
<213> Adeno-associated virus 10

<400> 104
atgccgggct tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg       60

ggcatttctg actcgtttgt gaactgggtg gccgagaagg aatgggagct gcccccggat      120

tctgacatgg atcggaatct gatcgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgacttcc tggtccactg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt      240

cagttcgaga agggcgagtc ctactttcac ctgcacgttc tggtcgagac cacgggggtc      300

aagtccatgg tcctgggccg cttcctgagt cagatcagag acaggctggt gcagaccatc      360

taccgcgggg tagagcccac gctgcccaac tggttcgcgg tgaccaagac gcgaaatggc      420

gccggcgggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag      480

acgcagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcgtgtctg      540

aacctcgcgg agcgtaaacg gctcgtggcg cagcacctga cccacgtcag ccagacgcag      600

gagcagaaca aggagaatct gaacccgaat tctgacgcgc ccgtgatcag gtcaaaaacc      660

tccgcgcgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag      720

cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg      780

tcccagatca aggccgcgct ggacaatgcc ggaaagatca tggcgctgac caaatccgcg      840

cccgactacc tggtaggccc gtccttaccc gcggacatta aggccaaccg catctaccgc      900

atcctggagc tcaacggcta cgaccccgcc tacgccggct ccgtcttcct gggctgggcg      960

cagaaaaagt tcggtaaaag gaatacaatt tggctgttcg ggcccgccac caccggcaag     1020

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt caactggacc     1080

aatgagaact ttcccttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgaccg ccaaggtcgt ggagtccgcc aaggccattc tgggcggaag caaggtgcgc     1200

gtcgaccaaa agtgcaagtc ctcggcccag atcgacccca cgcccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gatcgacggg aacagcacca ccttcgagca ccagcagccc     1320

ctgcaggacc gcatgttcaa gttcgagctc acccgccgtc tggagcacga ctttggcaag     1380

gtgaccaagc aggaagtcaa agagttcttc cgctgggctc aggatcacgt gactgaggtg     1440

acgcatgagt tctacgtcag aaagggcgga gccaccaaaa gacccgcccc cagtgacgcg     1500

gatataagcg agcccaagcg ggcctgcccc tcagttgcgg agccatcgac gtcagacgcg     1560

gaagcaccgg tggactttgc ggacaggtac caaaacaaat gttctcgtca cgcgggcatg     1620

cttcagatgc tgtttccctg caagacatgc gagagaatga atcagaattt caacgtctgc     1680

ttcacgcacg gggtcagaga ctgctcagag tgcttccccg gcgcgtcaga atctcaacct     1740

gtcgtcagaa aaaagacgta tcagaaactg tgcgcgattc atcatctgct ggggcgggca     1800

cccgagattg cgtgttcggc ctgcgatctc gtcaacgtgg acttggatga ctgtgtttct     1860

gagcaataaa tgacttaaac caggtatggc tgctgacggt tatcttccag attggctcga     1920

ggacaacctc tctgagggca ttcgcgagtg gtgggacctg aaacctggag cccccaagcc     1980

caaggccaac cagcagaagc aggacgacgg ccggggtctg gtgcttcctg gctacaagta     2040

cctcggaccc ttcaacggac tcgacaaggg ggagcccgtc aacgcggcgg acgcagcggc     2100

cctcgagcac gacaaggcct acgaccagca gctcaaagcg ggtgacaatc cgtacctgcg     2160

gtataaccac gccgacgccg agtttcagga gcgtctgcaa gaagatacgt cttttggggg     2220

caacctcggg cgagcagtct tccaggccaa gaagcgggtt ctcgaacctc tcggtctggt     2280

tgaggaagct gctaagacgg ctcctggaaa gaagagaccg gtagaaccgt cacctcagcg     2340

ttcccccgac tcctccacgg gcatcggcaa gaaaggccag cagcccgcta aaaagagact     2400

gaactttggg cagactggcg agtcagagtc agtccccgac cctcaaccaa tcggagaacc     2460

accagcaggc ccctctggtc tgggatctgg tacaatggct gcaggcggtg gcgctccaat     2520

ggcagacaat aacgaaggcg ccgacggagt gggtagttcc tcaggaaatt ggcattgcga     2580

ttccacatgg ctgggcgaca gagtcatcac caccagcacc cgaacctggg ccctgcccac     2640

ctacaacaac cacctctaca agcaaatctc caacgggaca tcgggaggaa gcaccaacga     2700

caacacctac ttcggctaca gcaccccctg ggggtatttt gacttcaaca gattccactg     2760

ccacttctca ccacgtgact ggcagcgact catcaacaac aactggggat tccggccaaa     2820

aagactcagc ttcaagctct tcaacatcca ggtcaaggag gtcacgcaga atgaaggcac     2880

caagaccatc gccaataacc ttaccagcac gattcaggta tttacggact cggaatacca     2940

gctgccgtac gtcctcggct ccgcgcacca gggctgcctg cctccgttcc cggcggatgt     3000

cttcatgatt ccccagtacg gctacctgac actgaacaat ggaagtcaag ccgtaggccg     3060

ttcctccttc tactgcctgg aatattttcc atctcaaatg ctgcgaactg gaaacaattt     3120

tgaattcagc tacaccttcg aggacgtgcc tttccacagc agctacgcac acagccagag     3180

cttggaccga ctgatgaatc ctctcattga ccagtacctg tactacttat ccagaactca     3240

gtccacagga ggaactcaag gtacccagca attgttattt tctcaagctg ggcctgcaaa     3300

catgtcggct caggccaaga actggctgcc tggaccttgc taccggcagc agcgagtctc     3360

cacgacactg tcgcaaaaca acaacagcaa ctttgcttgg actggtgcca ccaaatatca     3420

cctgaacgga agagactctc tggtgaatcc cggtgtcgcc atggcaaccc acaaggacga     3480

cgaggaacgc ttcttcccgt cgagcggagt cctgatgttt ggaaaacagg gtgctggaag     3540

agacaatgtg gactacagca gcgttatgct aacaagcgaa gaagaaatta aaaccactaa     3600

ccctgtagcc acagaacaat acggcgtggt ggctgacaac ttgcagcaag ccaatacagg     3660

gcctattgtg ggaaatgtca acagccaagg agccttacct ggcatggtct ggcagaaccg     3720

agacgtgtac ctgcagggtc ccatctgggc caagattcct cacacggacg gcaactttca     3780

cccgtctcct ctgatgggcg gctttggact taaacacccg cctccacaga tcctgatcaa     3840

gaacacgccg gtacctgcgg atcctccaac aacgttcagc caggcgaaat tggcttcctt     3900

catcacgcag tacagcaccg gacaggtcag cgtggaaatc gagtgggagc tgcagaagga     3960

gaacagcaaa cgctggaacc cagagattca gtacacttca aactactaca aatctacaaa     4020

tgtggacttt gctgtcaata cagagggaac ttattctgag cctcgcccca ttggtactcg     4080

ttatctgaca cgtaatctgt aa                                              4102


<210> 105
<211> 2217
<212> DNA
<213> Adeno-associated virus 10

<400> 105
atggctgctg acggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acctgaaacc tggagccccc aagcccaagg ccaaccagca gaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaagc gggttctcga acctctcggt ctggttgagg aagctgctaa gacggctcct      420

ggaaagaaga gaccggtaga accgtcacct cagcgttccc ccgactcctc cacgggcatc      480

ggcaagaaag gccagcagcc cgctaaaaag agactgaact ttgggcagac tggcgagtca      540

gagtcagtcc ccgaccctca accaatcgga gaaccaccag caggcccctc tggtctggga      600

tctggtacaa tggctgcagg cggtggcgct ccaatggcag acaataacga aggcgccgac      660

ggagtgggta gttcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccaacg ggacatcggg aggaagcacc aacgacaaca cctacttcgg ctacagcacc      840

ccctgggggt attttgactt caacagattc cactgccact tctcaccacg tgactggcag      900

cgactcatca acaacaactg gggattccgg ccaaaaagac tcagcttcaa gctcttcaac      960

atccaggtca aggaggtcac gcagaatgaa ggcaccaaga ccatcgccaa taaccttacc     1020

agcacgattc aggtatttac ggactcggaa taccagctgc cgtacgtcct cggctccgcg     1080

caccagggct gcctgcctcc gttcccggcg gatgtcttca tgattcccca gtacggctac     1140

ctgacactga acaatggaag tcaagccgta ggccgttcct ccttctactg cctggaatat     1200

tttccatctc aaatgctgcg aactggaaac aattttgaat tcagctacac cttcgaggac     1260

gtgcctttcc acagcagcta cgcacacagc cagagcttgg accgactgat gaatcctctc     1320

attgaccagt acctgtacta cttatccaga actcagtcca caggaggaac tcaaggtacc     1380

cagcaattgt tattttctca agctgggcct gcaaacatgt cggctcaggc caagaactgg     1440

ctgcctggac cttgctaccg gcagcagcga gtctccacga cactgtcgca aaacaacaac     1500

agcaactttg cttggactgg tgccaccaaa tatcacctga acggaagaga ctctctggtg     1560

aatcccggtg tcgccatggc aacccacaag gacgacgagg aacgcttctt cccgtcgagc     1620

ggagtcctga tgtttggaaa acagggtgct ggaagagaca atgtggacta cagcagcgtt     1680

atgctaacaa gcgaagaaga aattaaaacc actaaccctg tagccacaga acaatacggc     1740

gtggtggctg acaacttgca gcaagccaat acagggccta ttgtgggaaa tgtcaacagc     1800

caaggagcct tacctggcat ggtctggcag aaccgagacg tgtacctgca gggtcccatc     1860

tgggccaaga ttcctcacac ggacggcaac tttcacccgt ctcctctgat gggcggcttt     1920

ggacttaaac acccgcctcc acagatcctg atcaagaaca cgccggtacc tgcggatcct     1980

ccaacaacgt tcagccaggc gaaattggct tccttcatca cgcagtacag caccggacag     2040

gtcagcgtgg aaatcgagtg ggagctgcag aaggagaaca gcaaacgctg gaacccagag     2100

attcagtaca cttcaaacta ctacaaatct acaaatgtgg actttgctgt caatacagag     2160

ggaacttatt ctgagcctcg ccccattggt actcgttatc tgacacgtaa tctgtaa        2217


<210> 106
<211> 4087
<212> DNA
<213> Adeno-associated virus 11

<400> 106
atgccgggct tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg       60

ggcatttctg actcgtttgt gaactgggtg gccgagaagg aatgggagct gcccccggat      120

tctgacatgg atcggaatct gatcgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgacttcc tggtccactg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt      240

cagttcgaga agggcgagtc ctacttccac ctccacgttc tcgtcgagac cacgggggtc      300

aagtccatgg tcctgggccg cttcctgagt cagatcagag acaggctggt gcagaccatc      360

taccgcgggg tcgagcccac gctgcccaac tggttcgcgg tgaccaagac gcgaaatggc      420

gccggcgggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag      480

acccagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcgtgtcta      540

aacctcgcgg agcgtaaacg gctcgtggcg cagcacctga cccacgtcag ccagacgcag      600

gagcagaaca aggagaatct gaacccgaat tctgacgcgc ccgtgatcag gtcaaaaacc      660

tccgcgcgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag      720

cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg      780

tcccagatca aggccgcgct ggacaatgcc ggaaagatca tggcgctgac caaatccgcg      840

cccgactacc tggtaggccc gtccttaccc gcggacatta aggccaaccg catctaccgc      900

atcctggagc tcaacggcta cgaccccgcc tacgccggct ccgtcttcct gggctgggcg      960

cagaaaaagt tcggtaaacg caacaccatc tggctgtttg ggcccgccac caccggcaag     1020

accaacatcg cggaagccat agcccacgcc gtgcccttct acggctgcgt gaactggacc     1080

aatgagaact ttcccttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc     1140

aagatgaccg ccaaggtcgt ggagtccgcc aaggccattc tgggcggaag caaggtgcgc     1200

gtggaccaaa agtgcaagtc ctcggcccag atcgacccca cgcccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gatcgacggg aacagcacca ccttcgagca ccagcagccg     1320

ctgcaggacc gcatgttcaa gttcgagctc acccgccgtc tggagcacga ctttggcaag     1380

gtgaccaagc aggaagtcaa agagttcttc cgctgggctc aggatcacgt gactgaggtg     1440

gcgcatgagt tctacgtcag aaagggcgga gccaccaaaa gacccgcccc cagtgacgcg     1500

gatataagcg agcccaagcg ggcctgcccc tcagttccgg agccatcgac gtcagacgcg     1560

gaagcaccgg tggactttgc ggacaggtac caaaacaaat gttctcgtca cgcgggcatg     1620

cttcagatgc tgtttccctg caagacatgc gagagaatga atcagaattt caacgtctgc     1680

ttcacgcacg gggtcagaga ctgctcagag tgcttccccg gcgcgtcaga atctcaaccc     1740

gtcgtcagaa aaaagacgta tcagaaactg tgcgcgattc atcatctgct ggggcgggca     1800

cccgagattg cgtgttcggc ctgcgatctc gtcaacgtgg acttggatga ctgtgtttct     1860

gagcaataaa tgacttaaac caggtatggc tgctgacggt tatcttccag attggctcga     1920

ggacaacctc tctgagggca ttcgcgagtg gtgggacctg aaacctggag ccccgaagcc     1980

caaggccaac cagcagaagc aggacgacgg ccggggtctg gtgcttcctg gctacaagta     2040

cctcggaccc ttcaacggac tcgacaaggg ggagcccgtc aacgcggcgg acgcagcggc     2100

cctcgagcac gacaaggcct acgaccagca gctcaaagcg ggtgacaatc cgtacctgcg     2160

gtataaccac gccgacgccg agtttcagga gcgtctgcaa gaagatacgt cttttggggg     2220

caacctcggg cgagcagtct tccaggccaa gaagagggta ctcgaacctc tgggcctggt     2280

tgaagaaggt gctaaaacgg ctcctggaaa gaagagaccg ttagagtcac cacaagagcc     2340

cgactcctcc tcgggcatcg gcaaaaaagg caaacaacca gccagaaaga ggctcaactt     2400

tgaagaggac actggagccg gagacggacc ccctgaagga tcagatacca gcgccatgtc     2460

ttcagacatt gaaatgcgtg cagcaccggg cggaaatgct gtcgatgcgg gacaaggttc     2520

cgatggagtg ggtaatgcct cgggtgattg gcattgcgat tccacctggt ctgagggcaa     2580

ggtcacaaca acctcgacca gaacctgggt cttgcccacc tacaacaacc acttgtacct     2640

gcgtctcgga acaacatcaa gcagcaacac ctacaacgga ttctccaccc cctggggata     2700

ttttgacttc aacagattcc actgtcactt ctcaccacgt gactggcaaa gactcatcaa     2760

caacaactgg ggactacgac caaaagccat gcgcgttaaa atcttcaata tccaagttaa     2820

ggaggtcaca acgtcgaacg gcgagactac ggtcgctaat aaccttacca gcacggttca     2880

gatatttgcg gactcgtcgt atgagctccc gtacgtgatg gacgctggac aagaggggag     2940

cctgcctcct ttccccaatg acgtgttcat ggtgcctcaa tatggctact gtggcatcgt     3000

gactggcgag aatcagaacc aaacggacag aaacgctttc tactgcctgg agtattttcc     3060

ttcgcaaatg ttgagaactg gcaacaactt tgaaatggct tacaactttg agaaggtgcc     3120

gttccactca atgtatgctc acagccagag cctggacaga ctgatgaatc ccctcctgga     3180

ccagtacctg tggcacttac agtcgactac ctctggagag actctgaatc aaggcaatgc     3240

agcaaccaca tttggaaaaa tcaggagtgg agactttgcc ttttacagaa agaactggct     3300

gcctgggcct tgtgttaaac agcagagatt ctcaaaaact gccagtcaaa attacaagat     3360

tcctgccagc gggggcaacg ctctgttaaa gtatgacacc cactatacct taaacaaccg     3420

ctggagcaac atcgcgcccg gacctccaat ggccacagcc ggaccttcgg atggggactt     3480

cagtaacgcc cagcttatat tccctggacc atctgttacc ggaaatacaa caacttcagc     3540

caacaatctg ttgtttacat cagaagaaga aattgctgcc accaacccaa gagacacgga     3600

catgtttggc cagattgctg acaataatca gaatgctaca actgctccca taaccggcaa     3660

cgtgactgct atgggagtgc tgcctggcat ggtgtggcaa aacagagaca tttactacca     3720

agggccaatt tgggccaaga tcccacacgc ggacggacat tttcatcctt caccgctgat     3780

tggtgggttt ggactgaaac acccgcctcc ccagatattc atcaagaaca ctcccgtacc     3840

tgccaatcct gcgacaacct tcactgcagc cagagtggac tctttcatca cacaatacag     3900

caccggccag gtcgctgttc agattgaatg ggaaattgaa aaggaacgct ccaaacgctg     3960

gaatcctgaa gtgcagttta cttcaaacta tgggaaccag tcttctatgt tgtgggctcc     4020

tgatacaact gggaagtata cagagccgcg ggttattggc tctcgttatt tgactaatca     4080

tttgtaa                                                               4087


<210> 107
<211> 2202
<212> DNA
<213> Adeno-associated virus 11

<400> 107
atggctgctg acggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acctgaaacc tggagccccg aagcccaagg ccaaccagca gaagcaggac      120

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaagaaga gggtactcga acctctgggc ctggttgaag aaggtgctaa aacggctcct      420

ggaaagaaga gaccgttaga gtcaccacaa gagcccgact cctcctcggg catcggcaaa      480

aaaggcaaac aaccagccag aaagaggctc aactttgaag aggacactgg agccggagac      540

ggaccccctg aaggatcaga taccagcgcc atgtcttcag acattgaaat gcgtgcagca      600

ccgggcggaa atgctgtcga tgcgggacaa ggttccgatg gagtgggtaa tgcctcgggt      660

gattggcatt gcgattccac ctggtctgag ggcaaggtca caacaacctc gaccagaacc      720

tgggtcttgc ccacctacaa caaccacttg tacctgcgtc tcggaacaac atcaagcagc      780

aacacctaca acggattctc caccccctgg ggatattttg acttcaacag attccactgt      840

cacttctcac cacgtgactg gcaaagactc atcaacaaca actggggact acgaccaaaa      900

gccatgcgcg ttaaaatctt caatatccaa gttaaggagg tcacaacgtc gaacggcgag      960

actacggtcg ctaataacct taccagcacg gttcagatat ttgcggactc gtcgtatgag     1020

ctcccgtacg tgatggacgc tggacaagag gggagcctgc ctcctttccc caatgacgtg     1080

ttcatggtgc ctcaatatgg ctactgtggc atcgtgactg gcgagaatca gaaccaaacg     1140

gacagaaacg ctttctactg cctggagtat tttccttcgc aaatgttgag aactggcaac     1200

aactttgaaa tggcttacaa ctttgagaag gtgccgttcc actcaatgta tgctcacagc     1260

cagagcctgg acagactgat gaatcccctc ctggaccagt acctgtggca cttacagtcg     1320

actacctctg gagagactct gaatcaaggc aatgcagcaa ccacatttgg aaaaatcagg     1380

agtggagact ttgcctttta cagaaagaac tggctgcctg ggccttgtgt taaacagcag     1440

agattctcaa aaactgccag tcaaaattac aagattcctg ccagcggggg caacgctctg     1500

ttaaagtatg acacccacta taccttaaac aaccgctgga gcaacatcgc gcccggacct     1560

ccaatggcca cagccggacc ttcggatggg gacttcagta acgcccagct tatattccct     1620

ggaccatctg ttaccggaaa tacaacaact tcagccaaca atctgttgtt tacatcagaa     1680

gaagaaattg ctgccaccaa cccaagagac acggacatgt ttggccagat tgctgacaat     1740

aatcagaatg ctacaactgc tcccataacc ggcaacgtga ctgctatggg agtgctgcct     1800

ggcatggtgt ggcaaaacag agacatttac taccaagggc caatttgggc caagatccca     1860

cacgcggacg gacattttca tccttcaccg ctgattggtg ggtttggact gaaacacccg     1920

cctccccaga tattcatcaa gaacactccc gtacctgcca atcctgcgac aaccttcact     1980

gcagccagag tggactcttt catcacacaa tacagcaccg gccaggtcgc tgttcagatt     2040

gaatgggaaa ttgaaaagga acgctccaaa cgctggaatc ctgaagtgca gtttacttca     2100

aactatggga accagtcttc tatgttgtgg gctcctgata caactgggaa gtatacagag     2160

ccgcgggtta ttggctctcg ttatttgact aatcatttgt aa                        2202


<210> 108
<211> 4213
<212> DNA
<213> Adeno-associated virus 12

<400> 108
ttgcgacagt ttgcgacacc atgtggtcac aagaggtata taaccgcgag tgagccagcg       60

aggagctcca ttttgcccgc gaagtttgaa cgagcagcag ccatgccggg gttctacgag      120

gtggtgatca aggtgcccag cgacctggac gagcacctgc ccggcatttc tgactccttt      180

gtgaactggg tggccgagaa ggaatgggag ttgcccccgg attctgacat ggatcagaat      240

ctgattgagc aggcacccct gaccgtggcc gagaagctgc agcgcgagtt cctggtggaa      300

tggcgccgag tgagtaaatt tctggaggcc aagttttttg tgcagtttga aaagggggac      360

tcgtactttc atttgcatat tctgattgaa attaccggcg tgaaatccat ggtggtgggc      420

cgctacgtga gtcagattag ggataaactg atccagcgca tctaccgcgg ggtcgagccc      480

cagctgccca actggttcgc ggtcacaaag acccgaaatg gcgccggagg cgggaacaag      540

gtggtggacg agtgctacat ccccaactac ctgctcccca aggtccagcc cgagcttcag      600

tgggcgtgga ctaacatgga ggagtatata agcgcctgtt tgaacctcgc ggagcgtaaa      660

cggctcgtgg cgcagcacct gacgcacgtc tcccagaccc aggagggcga caaggagaat      720

ctgaacccga attctgacgc gccggtgatc cggtcaaaaa cctccgccag gtacatggag      780

ctggtcgggt ggctggtgga caagggcatc acgtccgaga agcagtggat ccaggaggac      840

caggcctcgt acatctcctt caacgcggcc tccaactccc ggtcgcagat caaggcggcc      900

ctggacaatg cctccaaaat catgagcctc accaaaacgg ctccggacta tctcatcggg      960

cagcagcccg tgggggacat taccaccaac cggatctaca aaatcctgga actgaacggg     1020

tacgaccccc agtacgccgc ctccgtcttt ctcggctggg cccagaaaaa gtttggaaag     1080

cgcaacacca tctggctgtt tgggcccgcc accaccggca agaccaacat cgcggaagcc     1140

atcgcccacg cggtcccctt ctacggctgc gtcaactgga ccaatgagaa ctttcccttc     1200

aacgactgcg tcgacaaaat ggtgatttgg tgggaggagg gcaagatgac cgccaaggtc     1260

gtagagtccg ccaaggccat tctgggcggc agcaaggtgc gcgtggacca aaaatgcaag     1320

gcctctgcgc agatcgaccc cacccccgtg atcgtcacct ccaacaccaa catgtgcgcc     1380

gtgattgacg ggaacagcac caccttcgag caccagcagc ccctgcagga ccggatgttc     1440

aagtttgaac tcacccgccg cctcgaccac gactttggca aggtcaccaa gcaggaagtc     1500

aaggactttt tccggtgggc ggctgatcac gtgactgacg tggctcatga gttttacgtc     1560

acaaagggtg gagctaagaa aaggcccgcc ccctctgacg aggatataag cgagcccaag     1620

cggccgcgcg tgtcatttgc gcagccggag acgtcagacg cggaagctcc cggagacttc     1680

gccgacaggt accaaaacaa atgttctcgt cacgcgggta tgctgcagat gctctttccc     1740

tgcaagacgt gcgagagaat gaatcagaat tccaacgtct gcttcacgca cggtcagaaa     1800

gattgcgggg agtgctttcc cgggtcagaa tctcaaccgg tttctgtcgt cagaaaaacg     1860

tatcagaaac tgtgcatcct tcatcagctc cggggggcac ccgagatcgc ctgctctgct     1920

tgcgaccaac tcaaccccga tttggacgat tgccaatttg agcaataaat gactgaaatc     1980

aggtatggct gctgacggtt atcttccaga ttggctcgag gacaacctct ctgaaggcat     2040

tcgcgagtgg tgggcgctga aacctggagc tccacaaccc aaggccaacc aacagcatca     2100

ggacaacggc aggggtcttg tgcttcctgg gtacaagtac ctcggaccct tcaacggact     2160

cgacaaggga gagccggtca acgaggcaga cgccgcggcc ctcgagcacg acaaggccta     2220

cgacaagcag ctcgagcagg gggacaaccc gtatctcaag tacaaccacg ccgacgccga     2280

gttccagcag cgcttggcga ccgacacctc ttttgggggc aacctcgggc gagcagtctt     2340

ccaggccaaa aagaggattc tcgagcctct gggtctggtt gaagagggcg ttaaaacggc     2400

tcctggaaag aaacgcccat tagaaaagac tccaaatcgg ccgaccaacc cggactctgg     2460

gaaggccccg gccaagaaaa agcaaaaaga cggcgaacca gccgactctg ctagaaggac     2520

actcgacttt gaagactctg gagcaggaga cggaccccct gagggatcat cttccggaga     2580

aatgtctcat gatgctgaga tgcgtgcggc gccaggcgga aatgctgtcg aggcgggaca     2640

aggtgccgat ggagtgggta atgcctccgg tgattggcat tgcgattcca cctggtcaga     2700

gggccgagtc accaccacca gcacccgaac ctgggtccta cccacgtaca acaaccacct     2760

gtacctgcga atcggaacaa cggccaacag caacacctac aacggattct ccaccccctg     2820

gggatacttt gactttaacc gcttccactg ccacttttcc ccacgcgact ggcagcgact     2880

catcaacaac aactggggac tcaggccgaa atcgatgcgt gttaaaatct tcaacataca     2940

ggtcaaggag gtcacgacgt caaacggcga gactacggtc gctaataacc ttaccagcac     3000

ggttcagatc tttgcggatt cgacgtatga actcccatac gtgatggacg ccggtcagga     3060

ggggagcttt cctccgtttc ccaacgacgt ctttatggtt ccccaatacg gatactgcgg     3120

agttgtcact ggaaaaaacc agaaccagac agacagaaat gccttttact gcctggaata     3180

ctttccatcc caaatgctaa gaactggcaa caattttgaa gtcagttacc aatttgaaaa     3240

agttcctttc cattcaatgt acgcgcacag ccagagcctg gacagaatga tgaatccttt     3300

actggatcag tacctgtggc atctgcaatc gaccactacc ggaaattccc ttaatcaagg     3360

aacagctacc accacgtacg ggaaaattac cactggagac tttgcctact acaggaaaaa     3420

ctggttgcct ggagcctgca ttaaacaaca aaaattttca aagaatgcca atcaaaacta     3480

caagattccc gccagcgggg gagacgccct tttaaagtat gacacgcata ccactctaaa     3540

tgggcgatgg agtaacatgg ctcctggacc tccaatggca accgcaggtg ccggggactc     3600

ggattttagc aacagccagc tgatctttgc cggacccaat ccgagcggta acacgaccac     3660

atcttcaaac aatttgttgt ttacctcaga agaggagatt gccacaacaa acccacgaga     3720

cacggacatg tttggacaga ttgcagataa taatcaaaat gccaccaccg cccctcacat     3780

cgctaacctg gacgctatgg gaattgttcc cggaatggtc tggcaaaaca gagacatcta     3840

ctaccagggc cctatttggg ccaaggtccc tcacacggac ggacactttc acccttcgcc     3900

gctgatggga ggatttggac tgaaacaccc gcctccacag attttcatca aaaacacccc     3960

cgtacccgcc aatcccaata ctacctttag cgctgcaagg attaattctt ttctgacgca     4020

gtacagcacc ggacaagttg ccgttcagat cgactgggaa attcagaagg agcattccaa     4080

acgctggaat cccgaagttc aatttacttc aaactacggc actcaaaatt ctatgctgtg     4140

ggctcccgac aatgctggca actaccacga actccgggct attgggtccc gtttcctcac     4200

ccaccacttg taa                                                        4213


<210> 109
<211> 1866
<212> DNA
<213> Adeno-associated virus 12

<400> 109
atgccggggt tctacgaggt ggtgatcaag gtgcccagcg acctggacga gcacctgccc       60

ggcatttctg actcctttgt gaactgggtg gccgagaagg aatgggagtt gcccccggat      120

tctgacatgg atcagaatct gattgagcag gcacccctga ccgtggccga gaagctgcag      180

cgcgagttcc tggtggaatg gcgccgagtg agtaaatttc tggaggccaa gttttttgtg      240

cagtttgaaa agggggactc gtactttcat ttgcatattc tgattgaaat taccggcgtg      300

aaatccatgg tggtgggccg ctacgtgagt cagattaggg ataaactgat ccagcgcatc      360

taccgcgggg tcgagcccca gctgcccaac tggttcgcgg tcacaaagac ccgaaatggc      420

gccggaggcg ggaacaaggt ggtggacgag tgctacatcc ccaactacct gctccccaag      480

gtccagcccg agcttcagtg ggcgtggact aacatggagg agtatataag cgcctgtttg      540

aacctcgcgg agcgtaaacg gctcgtggcg cagcacctga cgcacgtctc ccagacccag      600

gagggcgaca aggagaatct gaacccgaat tctgacgcgc cggtgatccg gtcaaaaacc      660

tccgccaggt acatggagct ggtcgggtgg ctggtggaca agggcatcac gtccgagaag      720

cagtggatcc aggaggacca ggcctcgtac atctccttca acgcggcctc caactcccgg      780

tcgcagatca aggcggccct ggacaatgcc tccaaaatca tgagcctcac caaaacggct      840

ccggactatc tcatcgggca gcagcccgtg ggggacatta ccaccaaccg gatctacaaa      900

atcctggaac tgaacgggta cgacccccag tacgccgcct ccgtctttct cggctgggcc      960

cagaaaaagt ttggaaagcg caacaccatc tggctgtttg ggcccgccac caccggcaag     1020

accaacatcg cggaagccat cgcccacgcg gtccccttct acggctgcgt caactggacc     1080

aatgagaact ttcccttcaa cgactgcgtc gacaaaatgg tgatttggtg ggaggagggc     1140

aagatgaccg ccaaggtcgt agagtccgcc aaggccattc tgggcggcag caaggtgcgc     1200

gtggaccaaa aatgcaaggc ctctgcgcag atcgacccca cccccgtgat cgtcacctcc     1260

aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca ccagcagccc     1320

ctgcaggacc ggatgttcaa gtttgaactc acccgccgcc tcgaccacga ctttggcaag     1380

gtcaccaagc aggaagtcaa ggactttttc cggtgggcgg ctgatcacgt gactgacgtg     1440

gctcatgagt tttacgtcac aaagggtgga gctaagaaaa ggcccgcccc ctctgacgag     1500

gatataagcg agcccaagcg gccgcgcgtg tcatttgcgc agccggagac gtcagacgcg     1560

gaagctcccg gagacttcgc cgacaggtac caaaacaaat gttctcgtca cgcgggtatg     1620

ctgcagatgc tctttccctg caagacgtgc gagagaatga atcagaattc caacgtctgc     1680

ttcacgcacg gtcagaaaga ttgcggggag tgctttcccg ggtcagaatc tcaaccggtt     1740

tctgtcgtca gaaaaacgta tcagaaactg tgcatccttc atcagctccg gggggcaccc     1800

gagatcgcct gctctgcttg cgaccaactc aaccccgatt tggacgattg ccaatttgag     1860

caataa                                                                1866


<210> 110
<211> 2229
<212> DNA
<213> Adeno-associated virus 12

<400> 110
atggctgctg acggttatct tccagattgg ctcgaggaca acctctctga aggcattcgc       60

gagtggtggg cgctgaaacc tggagctcca caacccaagg ccaaccaaca gcatcaggac      120

aacggcaggg gtcttgtgct tcctgggtac aagtacctcg gacccttcaa cggactcgac      180

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa ggcctacgac      240

aagcagctcg agcaggggga caacccgtat ctcaagtaca accacgccga cgccgagttc      300

cagcagcgct tggcgaccga cacctctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggattctcga gcctctgggt ctggttgaag agggcgttaa aacggctcct      420

ggaaagaaac gcccattaga aaagactcca aatcggccga ccaacccgga ctctgggaag      480

gccccggcca agaaaaagca aaaagacggc gaaccagccg actctgctag aaggacactc      540

gactttgaag actctggagc aggagacgga ccccctgagg gatcatcttc cggagaaatg      600

tctcatgatg ctgagatgcg tgcggcgcca ggcggaaatg ctgtcgaggc gggacaaggt      660

gccgatggag tgggtaatgc ctccggtgat tggcattgcg attccacctg gtcagagggc      720

cgagtcacca ccaccagcac ccgaacctgg gtcctaccca cgtacaacaa ccacctgtac      780

ctgcgaatcg gaacaacggc caacagcaac acctacaacg gattctccac cccctgggga      840

tactttgact ttaaccgctt ccactgccac ttttccccac gcgactggca gcgactcatc      900

aacaacaact ggggactcag gccgaaatcg atgcgtgtta aaatcttcaa catacaggtc      960

aaggaggtca cgacgtcaaa cggcgagact acggtcgcta ataaccttac cagcacggtt     1020

cagatctttg cggattcgac gtatgaactc ccatacgtga tggacgccgg tcaggagggg     1080

agctttcctc cgtttcccaa cgacgtcttt atggttcccc aatacggata ctgcggagtt     1140

gtcactggaa aaaaccagaa ccagacagac agaaatgcct tttactgcct ggaatacttt     1200

ccatcccaaa tgctaagaac tggcaacaat tttgaagtca gttaccaatt tgaaaaagtt     1260

cctttccatt caatgtacgc gcacagccag agcctggaca gaatgatgaa tcctttactg     1320

gatcagtacc tgtggcatct gcaatcgacc actaccggaa attcccttaa tcaaggaaca     1380

gctaccacca cgtacgggaa aattaccact ggagactttg cctactacag gaaaaactgg     1440

ttgcctggag cctgcattaa acaacaaaaa ttttcaaaga atgccaatca aaactacaag     1500

attcccgcca gcgggggaga cgccctttta aagtatgaca cgcataccac tctaaatggg     1560

cgatggagta acatggctcc tggacctcca atggcaaccg caggtgccgg ggactcggat     1620

tttagcaaca gccagctgat ctttgccgga cccaatccga gcggtaacac gaccacatct     1680

tcaaacaatt tgttgtttac ctcagaagag gagattgcca caacaaaccc acgagacacg     1740

gacatgtttg gacagattgc agataataat caaaatgcca ccaccgcccc tcacatcgct     1800

aacctggacg ctatgggaat tgttcccgga atggtctggc aaaacagaga catctactac     1860

cagggcccta tttgggccaa ggtccctcac acggacggac actttcaccc ttcgccgctg     1920

atgggaggat ttggactgaa acacccgcct ccacagattt tcatcaaaaa cacccccgta     1980

cccgccaatc ccaatactac ctttagcgct gcaaggatta attcttttct gacgcagtac     2040

agcaccggac aagttgccgt tcagatcgac tgggaaattc agaaggagca ttccaaacgc     2100

tggaatcccg aagttcaatt tacttcaaac tacggcactc aaaattctat gctgtgggct     2160

cccgacaatg ctggcaacta ccacgaactc cgggctattg ggtcccgttt cctcacccac     2220

cacttgtaa                                                             2229


<210> 111
<211> 675
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 111
atgagtgtga ttaaaccaga catgaagatc aagctgcgta tggaaggcgc tgtaaatgga       60

cacccgttcg cgattgaagg agttggcctt gggaagcctt tcgagggaaa acagagtatg      120

gaccttaaag tcaaagaagg cggacctctg cctttcgcct atgacatctt gacaactgtg      180

ttctgttacg gcaacagggt attcgccaaa tacccagaaa atatagtaga ctatttcaag      240

cagtcgtttc ctgagggcta ctcttgggaa cgaagcatga attacgaaga cgggggcatt      300

tgtaacgcga caaacgacat aaccctggat ggtgactgtt atatctatga aattcgattt      360

gatggtgtga actttcctgc caatggtcca gttatgcaga agaggactgt gaaatgggag      420

ccatccactg agaaattgta tgtgcgtgat ggagtgctga agggtgatgt taacatggct      480

ctgtcgcttg aaggaggtgg ccattaccga tgtgacttca aaactactta taaagctaag      540

aaggttgtcc agttgccaga ctatcacttt gtggaccacc acattgagat taaaagccac      600

gacaaagatt acagtaatgt taatctgcat gagcatgccg aagcgcattc tgagctgccg      660

aggcaggcca agtaa                                                       675


<210> 112
<211> 224
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 112
Met Ser Val Ile Lys Pro Asp Met Lys Ile Lys Leu Arg Met Glu Gly 
1               5                   10                  15      


Ala Val Asn Gly His Pro Phe Ala Ile Glu Gly Val Gly Leu Gly Lys 
            20                  25                  30          


Pro Phe Glu Gly Lys Gln Ser Met Asp Leu Lys Val Lys Glu Gly Gly 
        35                  40                  45              


Pro Leu Pro Phe Ala Tyr Asp Ile Leu Thr Thr Val Phe Cys Tyr Gly 
    50                  55                  60                  


Asn Arg Val Phe Ala Lys Tyr Pro Glu Asn Ile Val Asp Tyr Phe Lys 
65                  70                  75                  80  


Gln Ser Phe Pro Glu Gly Tyr Ser Trp Glu Arg Ser Met Asn Tyr Glu 
                85                  90                  95      


Asp Gly Gly Ile Cys Asn Ala Thr Asn Asp Ile Thr Leu Asp Gly Asp 
            100                 105                 110         


Cys Tyr Ile Tyr Glu Ile Arg Phe Asp Gly Val Asn Phe Pro Ala Asn 
        115                 120                 125             


Gly Pro Val Met Gln Lys Arg Thr Val Lys Trp Glu Pro Ser Thr Glu 
    130                 135                 140                 


Lys Leu Tyr Val Arg Asp Gly Val Leu Lys Gly Asp Val Asn Met Ala 
145                 150                 155                 160 


Leu Ser Leu Glu Gly Gly Gly His Tyr Arg Cys Asp Phe Lys Thr Thr 
                165                 170                 175     


Tyr Lys Ala Lys Lys Val Val Gln Leu Pro Asp Tyr His Phe Val Asp 
            180                 185                 190         


His His Ile Glu Ile Lys Ser His Asp Lys Asp Tyr Ser Asn Val Asn 
        195                 200                 205             


Leu His Glu His Ala Glu Ala His Ser Glu Leu Pro Arg Gln Ala Lys 
    210                 215                 220                 


<210> 113
<211> 1528
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 113
gtgggatctc tgtgcaaagc tacaatggag atctattgta tgaaccgtgg gagatatact       60

gcgaaaaggg caaacctttt acgagtttca attcttactg gaagaaatgc ttagatatgt      120

cgattgaatc cgttatgctt cctcctcctt ggcggttgat gccaataact gcaggtaaaa      180

ccttaaagag tattacttaa aagctaaaac gtttttgatt tcttcaggac ataagcggta      240

gtaaaagttt atggcttttt ctttgttagc ggctgaagcg atttgggcgt gttcgattga      300

agaactaggg ctggagaatg aggccgagaa accgagcaat gcgttgttaa ctagagcttg      360

gtctccagga tggagcaatg ctgataagtt actaaatgag ttcatcgaga agcagttgat      420

agattatgca aagaacagca agaaagttgt tgggaattct acttcactac tttctccgta      480

tctccatttc ggggaaataa gcgtcagaca cgttttccag tgtgcccgga tgaaacaaat      540

tatatgggca agagataaga acagtgaagg agaagaaagt gcagatcttt ttcttagggg      600

aatcggttta agagagtatt ctcggtatat atgtttcaac ttcccgttta ctcacgagca      660

atcgttgttg agtcatcttc ggtttttccc ttgggatgct gatgttgata agttcaaggc      720

ctggagacaa ggcaggaccg gttatccgtt ggtggatgcc ggaatgagag agctttgggc      780

taccggatgg atgcataaca gaataagagt gattgtttca agctttgctg tgaagtttct      840

tctccttcca tggaaatggg gaatgaagta tttctgggat acacttttgg atgctgattt      900

ggaatgtgac atccttggct ggcagtatat ctctgggagt atccccgatg gccacgagct      960

tgatcgcttg gacaatcccg cggtaaacta caaaacttgt cttatagttt agaattcaaa     1020

gcttaatacc agtttttgct atgcattcgt tttttatttt atttttcagc ttatttggtt     1080

ttggttgatt tagttctgaa gtctatgaaa actctgtttt tatttcagtt acaaggcgcc     1140

aaatatgacc cagaaggtga gtacataagg caatggcttc ccgagcttgc gagattgcca     1200

actgaatgga tccatcatcc atgggacgct cctttaaccg tactcaaagc ttctggtgtg     1260

gaactcggaa caaactatgc gaaacccatt gtagacatcg acacagctcg tgagctacta     1320

gctaaagcta tttcaagaac ccgtgaagca cagatcatga tcggagcagc acctgatgag     1380

attgtagcag atagcttcga ggccttaggg gctaatacca ttaaagaacc tggtctttgc     1440

ccatctgtgt cttctaatga ccaacaagta ccttcggctg ttcgttacaa cgggtcaaag     1500

agagtgaaac ctgaggaaga agaagaga                                        1528


<210> 114
<211> 7582
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 114
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc agcccggggg      720

atccactagt tctagagtcc tgtattagag gtcacgtgag tgttttgcga cattttgcga      780

caccatgtgg tcacgctggg tatttaagcc cgagtgagca cgcagggtct ccattttgaa      840

gcgggaggtt tgaacgcgca gccgccatgc cggggtttta cgagattgtg attaaggtcc      900

ccagcgacct tgacgagcat ctgcccggca tttctgacag ctttgtgaac tgggtggccg      960

agaaggaatg ggagttgccg ccagattctg acatggatct gaatctgatt gagcaggcac     1020

ccctgaccgt ggccgagaag ctgcagcgcg actttctgac ggaatggcgc cgtgtgagta     1080

aggccccgga ggcccttttc tttgtgcaat ttgagaaggg agagagctac ttccacatgc     1140

acgtgctcgt ggaaaccacc ggggtgaaat ccatggtttt gggacgtttc ctgagtcaga     1200

ttcgcgaaaa actgattcag agaatttacc gcgggatcga gccgactttg ccaaactggt     1260

tcgcggtcac aaagaccaga aatggcgccg gaggcgggaa caaggtggtg gatgagtgct     1320

acatccccaa ttacttgctc cccaaaaccc agcctgagct ccagtgggcg tggactaata     1380

tggaacagta tttaagcgcc tgtttgaatc tcacggagcg taaacggttg gtggcgcagc     1440

atctgacgca cgtgtcgcag acgcaggagc agaacaaaga gaatcagaat cccaattctg     1500

atgcgccggt gatcagatca aaaacttcag ccaggtacat ggagctggtc gggtggctcg     1560

tggacaaggg gattacctcg gagaagcagt ggatccagga ggaccaggcc tcatacatct     1620

ccttcaatgc ggcctccaac tcgcggtccc aaatcaaggc tgccttggac aatgcgggaa     1680

agattatgag cctgactaaa accgcccccg actacctggt gggccagcag cccgtggagg     1740

acatttccag caatcggatt tataaaattt tggaactaaa cgggtacgat ccccaatatg     1800

cggcttccgt ctttctggga tgggccacga aaaagttcgg caagaggaac accatctggc     1860

tgtttgggcc tgcaactacc gggaagacca acatcgcgga ggccatagcc cacactgtgc     1920

ccttctacgg gtgcgtaaac tggaccaatg agaactttcc cttcaacgac tgtgtcgaca     1980

agatggtgat ctggtgggag gaggggaaga tgaccgccaa ggtcgtggag tcggccaaag     2040

ccattctcgg aggaagcaag gtgcgcgtgg accagaaatg caagtcctcg gcccagatag     2100

acccgactcc cgtgatcgtc acctccaaca ccaacatgtg cgccgtgatt gacgggaact     2160

caacgacctt cgaacaccag cagccgttgc aagaccggat gttcaaattt gaactcaccc     2220

gccgtctgga tcatgacttt gggaaggtca ccaagcagga agtcaaagac tttttccggt     2280

gggcaaagga tcacgtggtt gaggtggagc atgaattcta cgtcaaaaag ggtggagcca     2340

agaaaagacc cgcccccagt gacgcagata taagtgagcc caaacgggtg cgcgagtcag     2400

ttgcgcagcc atcgacgtca gacgcggaag cttcgatcaa ctacgcagac aggtaccaaa     2460

acaaatgttc tcgtcacgtg ggcatgaatc tgatgctgtt tccctgcaga caatgcgaga     2520

gaatgaatca gaattcaaat atctgcttca ctcacggaca gaaagactgt ttagagtgct     2580

ttcccgtgtc agaatctcaa cccgtttctg tcgtcaaaaa ggcgtatcag aaactgtgct     2640

acattcatca tatcatggga aaggtgccag acgcttgcac tgcctgcgat ctggtcaatg     2700

tggatttgga tgactgcatc tttgaacaat aaatgattta aatcaggtat ggctgccgat     2760

ggttatcttc cagattggct cgaggacact ctctctgaag gaataagaca gtggtggaag     2820

ctcaaacctg gcccaccacc accaaagccc gcagagcggc ataaggacga cagcaggggt     2880

cttgtgcttc ctgggtacaa gtacctcgga cccttcaacg gactcgacaa gggagagccg     2940

gtcaacgagg cagacgccgc ggccctcgag cacgacaaag cctacgaccg gcagctcgac     3000

agcggagaca acccgtacct caagtacaac cacgccgacg cggagtttca ggagcgcctt     3060

aaagaagata cgtcttttgg gggcaacctc ggacgagcag tcttccaggc gaaaaagagg     3120

gttcttgaac ctctgggcct ggttgaggaa cctgttaaga tgcggccgat gatgttcctt     3180

cctactgatt attgttgcag actgagcgac caggaataca tggaactcgt cttcgagaac     3240

ggacagatac tcgcaaaagg ccagaggtca aatgttagtc tccataatca gcggacgaaa     3300

agcatcatgg atctgtatga ggccgaatac aacgaagatt ttatgaaaag tattatccat     3360

ggagggggtg gcgctattac caacctggga gatacccaag tggtcccaca gtcccacgta     3420

gcagccgctc acgagaccaa tatgctggag tccaacaaac acgtagacac gcgtgctccg     3480

ggaaaaaaga ggccggtaga gcactctcct gtggagccag actcctcctc gggaaccgga     3540

aaggcgggcc agcagcctgc aagaaaaaga ttgaattttg gtcagactgg agacgcagac     3600

tcagtacctg acccccagcc tctcggacag ccaccagcag ccccctctgg tctgggaact     3660

aatacgctgg ctacaggcag tggcgcacca ctggcagaca ataacgaggg cgccgacgga     3720

gtgggtaatt cctcgggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc     3780

accaccagca cccgaacctg ggccctgccc acctacaaca accacctcta caaacaaatt     3840

tccagccaat caggagcctc gaacgacaat cactactttg gctacagcac cccttggggg     3900

tattttgact tcaacagatt ccactgccac ttttcaccac gtgactggca aagactcatc     3960

aacaacaact ggggattccg acccaagaga ctcaacttca agctctttaa cattcaagtc     4020

aaagaggtca cgcagaatga cggtacgacg acgattgcca ataaccttac cagcacggtt     4080

caggtgttta ctgactcgga gtaccagctc ccgtacgtcc tcggctcggc gcatcaagga     4140

tgcctcccgc cgttcccagc agacgtcttc atggtgccac agtatggata cctcaccctg     4200

aacaacggga gtcaggcagt aggacgctct tcattttact gcctggagta ctttccttct     4260

cagatgctgc gtaccggaaa caactttacc ttcagctaca cttttgagga cgttcctttc     4320

cacagcagct acgctcacag ccagagtctg gaccgtctca tgaatcctct catcgaccag     4380

tacctgtatt acttgagcag aacaaacact ccaagtggaa ccaccacgca gtcaaggctt     4440

cagttttctc aggccggagc gagtgacatt cgggaccagt ctaggaactg gcttcctgga     4500

ccctgttacc gccagcagcg agtatcaaag acatctgcgg ataacaacaa cagtgaatac     4560

tcgtggactg gagctaccaa gtaccacctc aatggcagag actctctggt gaatccgggc     4620

ccggccatgg caagccacaa ggacgatgaa gaaaagtttt ttcctcagag cggggttctc     4680

atctttggga agcaaggctc agagaaaaca aatgtggaca ttgaaaaggt catgattaca     4740

gacgaagagg aaatcaggac aaccaatccc gtggctacgg agcagtatgg ttctgtatct     4800

accaacctcc agagaggcaa cagacaagca gctaccgcag atgtcaacac acaaggcgtt     4860

cttccaggca tggtctggca ggacagagat gtgtaccttc aggggcccat ctgggcaaag     4920

attccacaca cggacggaca ttttcacccc tctcccctca tgggtggatt cggacttaaa     4980

caccctcctc cacagattct catcaagaac accccggtac ctgcgaatcc ttcgaccacc     5040

ttcagtgcgg caaagtttgc ttccttcatc acacagtact ccacgggaca ggtcagcgtg     5100

gagatcgagt gggagctgca gaaggaaaac agcaaacgct ggaatcccga aattcagtac     5160

acttccaact acaacaagtc tgttaatgtg gactttactg tggacactaa tggcgtgtat     5220

tcagagcctc gccccattgg caccagatac ctgactcgta atctgtaatt gcttgttaat     5280

caataaaccg tttaattcgt ttcagttgaa ctttggtctc tgcgtatttc tttcttatct     5340

agtttccatg ctctagagcg gccgccaccg cggtggagct ccagcttttg ttccctttag     5400

tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt     5460

tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt     5520

gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg     5580

ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg     5640

cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg     5700

cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat     5760

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc     5820

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc     5880

tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga     5940

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt     6000

ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg     6060

taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc     6120

gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg     6180

gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc     6240

ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg     6300

ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc     6360

gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct     6420

caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt     6480

taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa     6540

aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa     6600

tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc     6660

tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct     6720

gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca     6780

gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt     6840

aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt     6900

gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc     6960

ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc     7020

tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt     7080

atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact     7140

ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc     7200

ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt     7260

ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg     7320

atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct     7380

gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa     7440

tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt     7500

ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc     7560

acatttcccc gaaaagtgcc ac                                              7582


<210> 115
<211> 2793
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 115
atgggtgctt caggtgtatc tggtgttggt ggttctggtg gtggaagagg tggaggtaga       60

ggaggtgaag aagaaccatc aagtagtcat acacctaaca atcgtagagg tggtgagcaa      120

gctcaatcat caggtacaaa atcattacgt ccaagaagta atactgaatc aatgtcaaaa      180

gcaattcaac aatacacagt agatgctaga ttacacgccg tattcgaaca atctggagaa      240

agtggtaaga gttttgatta ctcacaatca ttgaaaacaa ccacttatgg tagttcagtt      300

ccagaacaac aaatcactgc atatcttagt agaatacaac gtggtggtta cattcaacca      360

tttggttgta tgattgcagt tgatgaatct tcttttagaa tcattggtta ttcagaaaat      420

gcaagagaaa tgttgggtat catgccacaa tcagtaccaa ccttagaaaa accagaaatt      480

cttgcaatgg gtacagatgt tagaagtttg tttacatcat catcatcaat tcttttggag      540

agagcttttg ttgcacgtga aatcacttta cttaatccag tatggattca tagtaagaat      600

actggaaagc cattctatgc aattcttcat agaatagatg taggagttgt tattgatctt      660

gagccagcaa gaacagaaga tccagcatta tctattgctg gtgcagtaca atcacaaaaa      720

cttgctgtta gagcaattag tcaattacaa gccttgccag gtggtgatat aaaacttctt      780

tgtgatacag ttgttgaatc agttcgtgat cttaccggtt atgatagagt tatggtatac      840

aaattccatg aggatgaaca tggtgaagtt gttgcagaaa gtaaaagaga tgatcttgaa      900

ccatacattg gtttgcatta tccagctact gatattccac aagcatcaag atttcttttc      960

aaacaaaatc gtgttagaat gattgtagat tgtaatgcca ccccagtatt agttgttcaa     1020

gatgatagat tgacacaaag tatgtgttta gtaggttcaa cattaagagc acctcatgga     1080

tgtcattcac aatatatggc caatatgggt tcaatagcat cattagctat ggcagtaatc     1140

atcaatggaa atgaagatga tggttcaaat gttgcatcag gtagaagttc aatgcgttta     1200

tggggtttag tagtttgtca tcatacaagt tctcgttgta tcccatttcc tttacgttat     1260

gcatgtgaat ttcttatgca agcatttggt ttacaattga atatggaact tcaattagca     1320

ttacaaatga gtgaaaagag agttttacgt acacaaacat tgttatgcga tatgttattg     1380

agagattctc cagctggtat tgttactcaa tcaccatcta tcatggatct tgtaaagtgt     1440

gatggtgcag cattcttata ccacggaaag tactatccat taggtgttgc accatctgaa     1500

gttcaaatca aagatgttgt agaatggtta ttggctaatc acgcagattc tactggttta     1560

tcaactgatt ctcttggtga tgctggttat cctggtgccg cagccttagg agatgctgta     1620

tgtggtatgg ccgttgctta cattacaaaa agagatttct tgttttggtt tcgttctcat     1680

acagctaaag agatcaaatg gggtggtgca aaacatcatc cagaagataa ggatgatggt     1740

caaagaatgc atccaagatc atcatttcaa gcattcttag aagtagttaa gtcaagaagt     1800

caaccttggg aaacagcaga aatggatgca atacattcat tacaattgat acttcgtgat     1860

tcattcaaag aatcagaagc agcaatgaat agtaaagttg ttgatggtgt tgttcaacca     1920

tgtagagata tggccggtga acaaggtatt gatgaattag gtgctgtagc tagagaaatg     1980

gttagattga tagaaactgc cactgttcca atcttcgctg ttgatgctgg tggatgcata     2040

aacggttgga atgctaagat cgcagaattg accggtttgt cagttgaaga agctatgggt     2100

aaaagtttag tttcagattt gatctataag gaaaatgaag caaccgttaa caaattgtta     2160

tcaagagcat tgagaggaga tgaggaaaag aatgtagaag ttaagttaaa gacattttca     2220

ccagagttac aaggtaaagc agtttttgtt gtagttaatg cttgttcatc aaaagattac     2280

ttgaataaca ttgtaggtgt ttgttttgtt ggtcaagatg taacttcaca aaagattgtt     2340

atggataagt ttatcaatat ccaaggtgat tacaaagcta ttgttcattc tccaaatcca     2400

ttgattccac caatctttgc agctgatgag aatacatgtt gtttagaatg gaatatggca     2460

atggaaaagt taactggttg gtcacgttca gaagtaattg gtaagatgat tgttggagag     2520

gtttttggta gttgttgtat gcttaaaggt ccagatgctt taactaagtt tatgattgtt     2580

ttgcataatg caattggtgg tcaagataca gataagttcc cattcccttt cttcgataga     2640

aatggaaagt ttgttcaagc attacttact gctaacaaaa gagtatcatt agaaggtaaa     2700

gtaataggag ctttttgttt cttacaaatt ccttcaccag aattacaaca agctcttgca     2760

gtaggtggta gtcatcatca tcatcatcat taa                                  2793


<210> 116
<211> 930
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 116
Met Gly Ala Ser Gly Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Gly Arg Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro 
            20                  25                  30          


Asn Asn Arg Arg Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser 
        35                  40                  45              


Leu Arg Pro Arg Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln 
    50                  55                  60                  


Tyr Thr Val Asp Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu 
65                  70                  75                  80  


Ser Gly Lys Ser Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr 
                85                  90                  95      


Gly Ser Ser Val Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile 
            100                 105                 110         


Gln Arg Gly Gly Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp 
        115                 120                 125             


Glu Ser Ser Phe Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met 
    130                 135                 140                 


Leu Gly Ile Met Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile 
145                 150                 155                 160 


Leu Ala Met Gly Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser 
                165                 170                 175     


Ile Leu Leu Glu Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn 
            180                 185                 190         


Pro Val Trp Ile His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile 
        195                 200                 205             


Leu His Arg Ile Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg 
    210                 215                 220                 


Thr Glu Asp Pro Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys 
225                 230                 235                 240 


Leu Ala Val Arg Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp 
                245                 250                 255     


Ile Lys Leu Leu Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr 
            260                 265                 270         


Gly Tyr Asp Arg Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly 
        275                 280                 285             


Glu Val Val Ala Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly 
    290                 295                 300                 


Leu His Tyr Pro Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe 
305                 310                 315                 320 


Lys Gln Asn Arg Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val 
                325                 330                 335     


Leu Val Val Gln Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly 
            340                 345                 350         


Ser Thr Leu Arg Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn 
        355                 360                 365             


Met Gly Ser Ile Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn 
    370                 375                 380                 


Glu Asp Asp Gly Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu 
385                 390                 395                 400 


Trp Gly Leu Val Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe 
                405                 410                 415     


Pro Leu Arg Tyr Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln 
            420                 425                 430         


Leu Asn Met Glu Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val 
        435                 440                 445             


Leu Arg Thr Gln Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro 
    450                 455                 460                 


Ala Gly Ile Val Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys 
465                 470                 475                 480 


Asp Gly Ala Ala Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Ser Glu Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala 
            500                 505                 510         


Asn His Ala Asp Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala 
        515                 520                 525             


Gly Tyr Pro Gly Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala 
    530                 535                 540                 


Val Ala Tyr Ile Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His 
545                 550                 555                 560 


Thr Ala Lys Glu Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp 
                565                 570                 575     


Lys Asp Asp Gly Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe 
            580                 585                 590         


Leu Glu Val Val Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met 
        595                 600                 605             


Asp Ala Ile His Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu 
    610                 615                 620                 


Ser Glu Ala Ala Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro 
625                 630                 635                 640 


Cys Arg Asp Met Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val 
                645                 650                 655     


Ala Arg Glu Met Val Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe 
            660                 665                 670         


Ala Val Asp Ala Gly Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala 
        675                 680                 685             


Glu Leu Thr Gly Leu Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val 
    690                 695                 700                 


Ser Asp Leu Ile Tyr Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu 
705                 710                 715                 720 


Ser Arg Ala Leu Arg Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu 
                725                 730                 735     


Lys Thr Phe Ser Pro Glu Leu Gln Gly Lys Ala Val Phe Val Val Val 
            740                 745                 750         


Asn Ala Cys Ser Ser Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys 
        755                 760                 765             


Phe Val Gly Gln Asp Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe 
    770                 775                 780                 


Ile Asn Ile Gln Gly Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro 
785                 790                 795                 800 


Leu Ile Pro Pro Ile Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu 
                805                 810                 815     


Trp Asn Met Ala Met Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val 
            820                 825                 830         


Ile Gly Lys Met Ile Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu 
        835                 840                 845             


Lys Gly Pro Asp Ala Leu Thr Lys Phe Met Ile Val Leu His Asn Ala 
    850                 855                 860                 


Ile Gly Gly Gln Asp Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg 
865                 870                 875                 880 


Asn Gly Lys Phe Val Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser 
                885                 890                 895     


Leu Glu Gly Lys Val Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser 
            900                 905                 910         


Pro Glu Leu Gln Gln Ala Leu Ala Val Gly Gly Ser His His His His 
        915                 920                 925             


His His 
    930 


<210> 117
<211> 6006
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 117
agctagcttc tgtggaatgt gtgtcagtta gggtgtggaa agtccccagg ctccccagca       60

ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ccaggtgtgg aaagtcccca      120

ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc      180

ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc      240

catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta      300

ttccagaagt agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctccctcga      360

ggaactggaa aaccagaaag ttaactggta agtttagtct ttttgtcttt tatttcaggt      420

cccggatcga attgcggccg cccaccatgg tttccggagt cgggggtagt ggcggtggcc      480

gtggcggtgg ccgtggcgga gaagaagaac cgtcgtcaag tcacactcct aataaccgaa      540

gaggaggaga acaagctcaa tcgtcgggaa cgaaatctct cagaccaaga agcaacactg      600

aatcaatgag caaagcaatt caacagtaca ccgtcgacgc aagactccac gccgttttcg      660

aacaatccgg cgaatcaggg aaatcattcg actactcaca atcactcaaa acgacgacgt      720

acggttcctc tgtacctgag caacagatca cagcttatct ctctcgaatc cagcgaggtg      780

gttacattca gcctttcgga tgtatgatcg ccgtcgatga atccagtttc cggatcatcg      840

gttacagtga aaacgccaga gaaatgttag ggattatgcc tcaatctgtt cctactcttg      900

agaaacctga gattctagct atgggaactg atgtgagatc tttgttcact tcttcgagct      960

cgattctact cgagcgtgct ttcgttgctc gagagattac cttgttaaat ccggtttgga     1020

tccattccaa gaatactggt aaaccgtttt acgccattct tcataggatt gatgttggtg     1080

ttgttattga tttagagcca gctagaactg aagatcctgc gctttctatt gctggtgctg     1140

ttcaatcgca gaaactcgcg gttcgtgcga tttctcagtt acaggctctt cctggtggag     1200

atattaagct tttgtgtgac actgtcgtgg aaagtgtgag ggacttgact ggttatgatc     1260

gtgttatggt ttataagttt catgaagatg agcatggaga agttgtagct gagagtaaac     1320

gagatgattt agagccttat attggactgc attatcctgc tactgatatt cctcaagcgt     1380

caaggttctt gtttaagcag aaccgtgtcc gaatgatagt agattgcaat gccacacctg     1440

ttcttgtggt ccaggacgat aggctaactc agtctatgtg cttggttggt tctactctta     1500

gggctcctca tggttgtcac tctcagtata tggctaacat gggatctatt gcgtctttag     1560

caatggcggt tataatcaat ggaaatgaag atgatgggag caatgtagct agtggaagaa     1620

gctcgatgag gctttggggt ttggttgttt gccatcacac ttcttctcgc tgcataccgt     1680

ttccgctaag gtatgcttgt gagtttttga tgcaggcttt cggtttacag ttaaacatgg     1740

aattgcagtt agctttgcaa atgtcagaga aacgcgtttt gagaacgcag acactgttat     1800

gtgatatgct tctgcgtgac tcgcctgctg gaattgttac acagagtccc agtatcatgg     1860

acttagtgaa atgtgacggt gcagcatttc tttaccacgg gaagtattac ccgttgggtg     1920

ttgctcctag tgaagttcag ataaaagatg ttgtggagtg gttgcttgcg aatcatgcgg     1980

attcaaccgg attaagcact gatagtttag gcgatgcggg gtatcccggt gcagctgcgt     2040

taggggatgc tgtgtgcggt atggcagttg catatatcac aaaaagagac tttctttttt     2100

ggtttcgatc tcacactgcg aaagaaatca aatggggagg cgctaagcat catccggagg     2160

ataaagatga tgggcaacga atgcatcctc gttcgtcctt tcaggctttt cttgaagttg     2220

ttaagagccg gagtcagcca tgggaaactg cggaaatgga tgcgattcac tcgctccagc     2280

ttattctgag agactctttt aaagaatctg aggcggctat gaactctaaa gttgtggatg     2340

gtgtggttca gccatgtagg gatatggcgg gggaacaggg gattgatgag ttaggtgcag     2400

ttgcaagaga gatggttagg ctcattgaga ctgcaactgt tcctatattc gctgtggatg     2460

ccggaggctg catcaatgga tggaacgcta agattgcaga gttgacaggt ctctcagttg     2520

aagaagctat ggggaagtct ctggtttctg atttaatata caaagagaat gaagcaactg     2580

tcaataagct tctttctcgt gctttgagag gggacgagga aaagaatgtg gaggttaagc     2640

tgaaaacttt cagccccgaa ctacaaggga aagcagtttt tgtggttgtg aatgcttgtt     2700

ccagcaagga ctacttgaac aacattgtcg gcgtttgttt tgttggacaa gacgttacta     2760

gtcagaaaat cgtaatggat aagttcatca acatacaagg agattacaag gctattgtac     2820

atagcccaaa ccctctaatc ccgccaattt ttgctgctga cgagaacacg tgctgcctgg     2880

aatggaacat ggcgatggaa aagcttacgg gttggtctcg cagtgaagtg attgggaaaa     2940

tgattgtcgg ggaagtgttt gggagctgtt gcatgctaaa gggtcctgat gctttaacca     3000

agttcatgat tgtattgcat aatgcgattg gtggccaaga tacggataag ttccctttcc     3060

cattctttga ccgcaatggg aagtttgttc aggctctatt gactgcaaac aagcgggtta     3120

gcctcgaggg aaaggttatt ggggctttct gtttcttgca aatcccgagc gaattcgata     3180

gtgctggtag tgctggtagt gctggttccg cgtacagccg cgcgcgtacg aaaaacaatt     3240

acgggtctac catcgagggc ctgctcgatc tcccggacga cgacgccccc gaagaggcgg     3300

ggctggcggc tccgcgcctg tcctttctcc ccgcgggaca cacgcgcaga ctgtcgacgg     3360

cccccccgac cgatgtcagc ctgggggacg agctccactt agacggcgag gacgtggcga     3420

tggcgcatgc cgacgcgcta gacgatttcg atctggacat gttgggggac ggggattccc     3480

cgggtccggg atttaccccc cacgactccg ccccctacgg cgctctggat atggccgact     3540

tcgagtttga gcagatgttt accgatgccc ttggaattga cgagtacggt gggtaggggg     3600

cgcgaggatc ctctagagtc gacctgcagc ccaagcttcg atccagacat gataagatac     3660

attgatgagt ttggacaaac cacaactaga atgcagtgaa aaaaatgctt tatttgtgaa     3720

atttgtgatg ctattgcttt atttgtaacc attataagct gcaataaaca agttaacaac     3780

aacaattgca ttcattttat gtttcaggtt cagggggagg tgtgggaggt tttttaaagc     3840

aagtaaaacc tctacaaatg tggtatggct gattatgatc ctgcctcgcg cgtttcggtg     3900

atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag     3960

cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg     4020

gcgcagccat gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc     4080

atcagagcag attgtactga gagtgcacca tatgtcgggc cgcgttgctg gcgtttttcc     4140

ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa     4200

acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc     4260

ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg     4320

cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc     4380

tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc     4440

gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca     4500

ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact     4560

acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg     4620

gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt     4680

ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct     4740

tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga     4800

gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa     4860

tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac     4920

ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga     4980

taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc     5040

cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca     5100

gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta     5160

gagtaagtag ttcgccagtt aatagtgcgc aacgttgttg ccattgctac aggcatcgtg     5220

gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga     5280

gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt     5340

gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct     5400

cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca     5460

ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaac acgggataat     5520

accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga     5580

aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc     5640

aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg     5700

caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc     5760

ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt     5820

gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca     5880

cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg     5940

aggccctttc gtcttcaaga attggtcgat cgaccaattc tcatgtttga cagcttatca     6000

tcgata                                                                6006


<210> 118
<211> 6012
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 118
agcttctgtg gaatgtgtgt cagttagggt gtggaaagtc cccaggctcc ccagcaggca       60

gaagtatgca aagcatgcat ctcaattagt cagcaaccag gtgtggaaag tccccaggct      120

ccccagcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc atagtcccgc      180

ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg      240

gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct gagctattcc      300

agaagtagtg aggaggcttt tttggaggcc taggcttttg caaaaagctc cctcgaggaa      360

ctggaaaacc agaaagttaa ctggtaagtt tagtcttttt gtcttttatt tcaggtcccg      420

gatcgaattg cggccgccca ccatggtttc cggagtcggg ggtagtggcg gtggccgtgg      480

cggtggccgt ggcggagaag aagaaccgtc gtcaagtcac actcctaata accgaagagg      540

aggagaacaa gctcaatcgt cgggaacgaa atctctcaga ccaagaagca acactgaatc      600

aatgagcaaa gcaattcaac agtacaccgt cgacgcaaga ctccacgccg ttttcgaaca      660

atccggcgaa tcagggaaat cattcgacta ctcacaatca ctcaaaacga cgacgtacgg      720

ttcctctgta cctgagcaac agatcacagc ttatctctct cgaatccagc gaggtggtta      780

cattcagcct ttcggatgta tgatcgccgt cgatgaatcc agtttccgga tcatcggtta      840

cagtgaaaac gccagagaaa tgttagggat tatgcctcaa tctgttccta ctcttgagaa      900

acctgagatt ctagctatgg gaactgatgt gagatctttg ttcacttctt cgagctcgat      960

tctactcgag cgtgctttcg ttgctcgaga gattaccttg ttaaatccgg tttggatcca     1020

ttccaagaat actggtaaac cgttttacgc cattcttcat aggattgatg ttggtgttgt     1080

tattgattta gagccagcta gaactgaaga tcctgcgctt tctattgctg gtgctgttca     1140

atcgcagaaa ctcgcggttc gtgcgatttc tcagttacag gctcttcctg gtggagatat     1200

taagcttttg tgtgacactg tcgtggaaag tgtgagggac ttgactggtt atgatcgtgt     1260

tatggtttat aagtttcatg aagatgagca tggagaagtt gtagctgaga gtaaacgaga     1320

tgatttagag ccttatattg gactgcatta tcctgctact gatattcctc aagcgtcaag     1380

gttcttgttt aagcagaacc gtgtccgaat gatagtagat tgcaatgcca cacctgttct     1440

tgtggtccag gacgataggc taactcagtc tatgtgcttg gttggttcta ctcttagggc     1500

tcctcatggt tgtcactctc agtatatggc taacatggga tctattgcgt ctttagcaat     1560

ggcggttata atcaatggaa atgaagatga tgggagcaat gtagctagtg gaagaagctc     1620

gatgaggctt tggggtttgg ttgtttgcca tcacacttct tctcgctgca taccgtttcc     1680

gctaaggtat gcttgtgagt ttttgatgca ggctttcggt ttacagttaa acatggaatt     1740

gcagttagct ttgcaaatgt cagagaaacg cgttttgaga acgcagacac tgttatgtga     1800

tatgcttctg cgtgactcgc ctgctggaat tgttacacag agtcccagta tcatggactt     1860

agtgaaatgt gacggtgcag catttcttta ccacgggaag tattacccgt tgggtgttgc     1920

tcctagtgaa gttcagataa aagatgttgt ggagtggttg cttgcgaatc atgcggattc     1980

aaccggatta agcactgata gtttaggcga tgcggggtat cccggtgcag ctgcgttagg     2040

ggatgctgtg tgcggtatgg cagttgcata tatcacaaaa agagactttc ttttttggtt     2100

tcgatctcac actgcgaaag aaatcaaatg gggaggcgct aagcatcatc cggaggataa     2160

agatgatggg caacgaatgc atcctcgttc gtcctttcag gcttttcttg aagttgttaa     2220

gagccggagt cagccatggg aaactgcgga aatggatgcg attcactcgc tccagcttat     2280

tctgagagac tcttttaaag aatctgaggc ggctatgaac tctaaagttg tggatggtgt     2340

ggttcagcca tgtagggata tggcggggga acaggggatt gatgagttag gtgcagttgc     2400

aagagagatg gttaggctca ttgagactgc aactgttcct atattcgctg tggatgccgg     2460

aggctgcatc aatggatgga acgctaagat tgcagagttg acaggtctct cagttgaaga     2520

agctatgggg aagtctctgg tttctgattt aatatacaaa gagaatgaag caactgtcaa     2580

taagcttctt tctcgtgctt tgagagggga cgaggaaaag aatgtggagg ttaagctgaa     2640

aactttcagc cccgaactac aagggaaagc agtttttgtg gttgtgaatg cttgttccag     2700

caaggactac ttgaacaaca ttgtcggcgt ttgttttgtt ggacaagacg ttactagtca     2760

gaaaatcgta atggataagt tcatcaacat acaaggagat tacaaggcta ttgtacatag     2820

cccaaaccct ctaatcccgc caatttttgc tgctgacgag aacacgtgct gcctggaatg     2880

gaacatggcg atggaaaagc ttacgggttg gtctcgcagt gaagtgattg ggaaaatgat     2940

tgtcggggaa gtgtttggga gctgttgcat gctaaagggt cctgatgctt taaccaagtt     3000

catgattgta ttgcataatg cgattggtgg ccaagatacg gataagttcc ctttcccatt     3060

ctttgaccgc aatgggaagt ttgttcaggc tctattgact gcaaacaagc gggttagcct     3120

cgagggaaag gttattgggg ctttctgttt cttgcaaatc ccgagcgaat tcgatagtgc     3180

tggtagtgct ggtagtgctg gttccgcgta cagccgcgcg cgtacgaaaa acaattacgg     3240

gtctaccatc gagggcctgc tcgatctccc ggacgacgac gcccccgaag aggcggggct     3300

ggcggctccg cgcctgtcct ttctccccgc gggacacacg cgcagactgt cgacggcccc     3360

cccgaccgat gtcagcctgg gggacgagct ccacttagac ggcgaggacg tggcgatggc     3420

gcatgccgac gcgctagacg atttcgatct ggacatgttg ggggacgggg attccccggg     3480

tccgggattt accccccacg actccgcccc ctacggcgct ctggatatgg ccgacttcga     3540

gtttgagcag atgtttaccg atgcccttgg aattgacgag tacggtgggc ccaagaaaaa     3600

gcggaaggtg tgatctagag tcgacctgca gcccaagctt cgatccagac atgataagat     3660

acattgatga gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg     3720

aaatttgtga tgctattgct ttatttgtaa ccattataag ctgcaataaa caagttaaca     3780

acaacaattg cattcatttt atgtttcagg ttcaggggga ggtgtgggag gttttttaaa     3840

gcaagtaaaa cctctacaaa tgtggtatgg ctgattatga tcctgcctcg cgcgtttcgg     3900

tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta     3960

agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg     4020

gggcgcagcc atgacccagt cacgtagcga tagcggagtg tatactggct taactatgcg     4080

gcatcagagc agattgtact gagagtgcac catatgtcgg gccgcgttgc tggcgttttt     4140

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg     4200

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc     4260

tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt     4320

ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa     4380

gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta     4440

tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa     4500

caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa     4560

ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt     4620

cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt     4680

ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat     4740

cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat     4800

gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc     4860

aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc     4920

acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta     4980

gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga     5040

cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg     5100

cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc     5160

tagagtaagt agttcgccag ttaatagtgc gcaacgttgt tgccattgct acaggcatcg     5220

tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc     5280

gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg     5340

ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt     5400

ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt     5460

cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca acacgggata     5520

ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc     5580

gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac     5640

ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa     5700

ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct     5760

tcctttttca atattattga agcatttatc agggttattg tctcatgagc ggatacatat     5820

ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc     5880

cacctgacgt ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca     5940

cgaggccctt tcgtcttcaa gaattggtcg atcgaccaat tctcatgttt gacagcttat     6000

catcgataag ct                                                         6012


<210> 119
<211> 5238
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 119
agcttctgtg gaatgtgtgt cagttagggt gtggaaagtc cccaggctcc ccagcaggca       60

gaagtatgca aagcatgcat ctcaattagt cagcaaccag gtgtggaaag tccccaggct      120

ccccagcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc atagtcccgc      180

ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg      240

gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct gagctattcc      300

agaagtagtg aggaggcttt tttggaggcc taggcttttg caaaaagctc cctcgaggaa      360

ctggaaaacc agaaagttaa ctggtaagtt tagtcttttt gtcttttatt tcaggtcccg      420

gatcgaattg cggccgccca ccatggtttc cggagtcggg ggtagtggcg gtggccgtgg      480

cggtggccgt ggcggagaag aagaaccgtc gtcaagtcac actcctaata accgaagagg      540

aggagaacaa gctcaatcgt cgggaacgaa atctctcaga ccaagaagca acactgaatc      600

aatgagcaaa gcaattcaac agtacaccgt cgacgcaaga ctccacgccg ttttcgaaca      660

atccggcgaa tcagggaaat cattcgacta ctcacaatca ctcaaaacga cgacgtacgg      720

ttcctctgta cctgagcaac agatcacagc ttatctctct cgaatccagc gaggtggtta      780

cattcagcct ttcggatgta tgatcgccgt cgatgaatcc agtttccgga tcatcggtta      840

cagtgaaaac gccagagaaa tgttagggat tatgcctcaa tctgttccta ctcttgagaa      900

acctgagatt ctagctatgg gaactgatgt gagatctttg ttcacttctt cgagctcgat      960

tctactcgag cgtgctttcg ttgctcgaga gattaccttg ttaaatccgg tttggatcca     1020

ttccaagaat actggtaaac cgttttacgc cattcttcat aggattgatg ttggtgttgt     1080

tattgattta gagccagcta gaactgaaga tcctgcgctt tctattgctg gtgctgttca     1140

atcgcagaaa ctcgcggttc gtgcgatttc tcagttacag gctcttcctg gtggagatat     1200

taagcttttg tgtgacactg tcgtggaaag tgtgagggac ttgactggtt atgatcgtgt     1260

tatggtttat aagtttcatg aagatgagca tggagaagtt gtagctgaga gtaaacgaga     1320

tgatttagag ccttatattg gactgcatta tcctgctact gatattcctc aagcgtcaag     1380

gttcttgttt aagcagaacc gtgtccgaat gatagtagat tgcaatgcca cacctgttct     1440

tgtggtccag gacgataggc taactcagtc tatgtgcttg gttggttcta ctcttagggc     1500

tcctcatggt tgtcactctc agtatatggc taacatggga tctattgcgt ctttagcaat     1560

ggcggttata atcaatggaa atgaagatga tgggagcaat gtagctagtg gaagaagctc     1620

gatgaggctt tggggtttgg ttgtttgcca tcacacttct tctcgctgca taccgtttcc     1680

gctaaggtat gcttgtgagt ttttgatgca ggctttcggt ttacagttaa acatggaatt     1740

gcagttagct ttgcaaatgt cagagaaacg cgttttgaga acgcagacac tgttatgtga     1800

tatgcttctg cgtgactcgc ctgctggaat tgttacacag agtcccagta tcatggactt     1860

agtgaaatgt gacggtgcag catttcttta ccacgggaag tattacccgt tgggtgttgc     1920

tcctagtgaa gttcagataa aagatgttgt ggagtggttg cttgcgaatc atgcggattc     1980

aaccggatta agcactgata gtttaggcga tgcggggtat cccggtgcag ctgcgttagg     2040

ggatgctgtg tgcggtatgg cagttgcata tatcacaaaa agagactttc ttttttggtt     2100

tcgatctcac actgcgaaag aaatcaaatg gggaggcgct aagcatcatc cggaggataa     2160

agatgatggg caacgaatgc atcctcgttc gtcctttcag gcttttcttg aagttgttaa     2220

gagccggagt cagccatggg aaactgcgga aatggatgcg attcactcgc tccagcttat     2280

tctgagagac tcttttaaag aatctgaggc ggctatgaac tctaaagttg tggatggtgt     2340

ggttcagcca tgtagggata tggcggggga acaggggatt gatgagttag gtgaattcga     2400

tagtgctggt agtgctggta gtgctggttc cgcgtacagc cgcgcgcgta cgaaaaacaa     2460

ttacgggtct accatcgagg gcctgctcga tctcccggac gacgacgccc ccgaagaggc     2520

ggggctggcg gctccgcgcc tgtcctttct ccccgcggga cacacgcgca gactgtcgac     2580

ggcccccccg accgatgtca gcctggggga cgagctccac ttagacggcg aggacgtggc     2640

gatggcgcat gccgacgcgc tagacgattt cgatctggac atgttggggg acggggattc     2700

cccgggtccg ggatttaccc cccacgactc cgccccctac ggcgctctgg atatggccga     2760

cttcgagttt gagcagatgt ttaccgatgc ccttggaatt gacgagtacg gtgggcccaa     2820

gaaaaagcgg aaggtgtgat ctagagtcga cctgcagccc aagcttcgat ccagacatga     2880

taagatacat tgatgagttt ggacaaacca caactagaat gcagtgaaaa aaatgcttta     2940

tttgtgaaat ttgtgatgct attgctttat ttgtaaccat tataagctgc aataaacaag     3000

ttaacaacaa caattgcatt cattttatgt ttcaggttca gggggaggtg tgggaggttt     3060

tttaaagcaa gtaaaacctc tacaaatgtg gtatggctga ttatgatcct gcctcgcgcg     3120

tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg     3180

tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg     3240

gtgtcggggc gcagccatga cccagtcacg tagcgatagc ggagtgtata ctggcttaac     3300

tatgcggcat cagagcagat tgtactgaga gtgcaccata tgtcgggccg cgttgctggc     3360

gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag     3420

gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt     3480

gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg     3540

aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg     3600

ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg     3660

taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac     3720

tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg     3780

gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt     3840

taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg     3900

tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc     3960

tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt     4020

ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt     4080

taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag     4140

tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt     4200

cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg caatgatacc     4260

gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc     4320

cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg     4380

ggaagctaga gtaagtagtt cgccagttaa tagtgcgcaa cgttgttgcc attgctacag     4440

gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat     4500

caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc     4560

cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc     4620

ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa     4680

ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaacac     4740

gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt     4800

cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc     4860

gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa     4920

caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca     4980

tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat     5040

acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa     5100

aagtgccacc tgacgtctaa gaaaccatta ttatcatgac attaacctat aaaaataggc     5160

gtatcacgag gccctttcgt cttcaagaat tggtcgatcg accaattctc atgtttgaca     5220

gcttatcatc gataagct                                                   5238


<210> 120
<211> 304
<212> DNA
<213> Arabidopsis thaliana

<400> 120
gccgatgatg ttccttccta ctgattattg ttgcagactg agcgaccagg aatacatgga       60

actcgtcttc gagaacggac agatactcgc aaaaggccag aggtcaaatg ttagtctcca      120

taatcagcgg acgaaaagca tcatggatct gtatgaggcc gaatacaacg aagattttat      180

gaaaagtatt atccatggag ggggtggcgc tattaccaac ctgggagata cccaagtggt      240

cccacagtcc cacgtagcag ccgctcacga gaccaatatg ctggagtcca acaaacacgt      300

agac                                                                   304


<210> 121
<211> 100
<212> PRT
<213> Arabidopsis thaliana

<400> 121
Met Met Phe Leu Pro Thr Asp Tyr Cys Cys Arg Leu Ser Asp Gln Glu 
1               5                   10                  15      


Tyr Met Glu Leu Val Phe Glu Asn Gly Gln Ile Leu Ala Lys Gly Gln 
            20                  25                  30          


Arg Ser Asn Val Ser Leu His Asn Gln Arg Thr Lys Ser Ile Met Asp 
        35                  40                  45              


Leu Tyr Glu Ala Glu Tyr Asn Glu Asp Phe Met Lys Ser Ile Ile His 
    50                  55                  60                  


Gly Gly Gly Gly Ala Ile Thr Asn Leu Gly Asp Thr Gln Val Val Pro 
65                  70                  75                  80  


Gln Ser His Val Ala Ala Ala His Glu Thr Asn Met Leu Glu Ser Asn 
                85                  90                  95      


Lys His Val Asp 
            100 


<210> 122
<211> 2793
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 122
atgggtgctt caggtgtatc tggtgttggt ggttctggtg gtggaagagg tggaggtaga       60

ggaggtgaag aagaaccatc aagtagtcat acacctaaca atcgtagagg tggtgagcaa      120

gctcaatcat caggtacaaa atcattacgt ccaagaagta atactgaatc aatgtcaaaa      180

gcaattcaac aatacacagt agatgctaga ttacacgccg tattcgaaca atctggagaa      240

agtggtaaga gttttgatta ctcacaatca ttgaaaacaa ccacttatgg tagttcagtt      300

ccagaacaac aaatcactgc atatcttagt agaatacaac gtggtggtta cattcaacca      360

tttggttgta tgattgcagt tgatgaatct tcttttagaa tcattggtta ttcagaaaat      420

gcaagagaaa tgttgggtat catgccacaa tcagtaccaa ccttagaaaa accagaaatt      480

cttgcaatgg gtacagatgt tagaagtttg tttacatcat catcatcaat tcttttggag      540

agagcttttg ttgcacgtga aatcacttta cttaatccag tatggattca tagtaagaat      600

actggaaagc cattctatgc aattcttcat agaatagatg taggagttgt tattgatctt      660

gagccagcaa gaacagaaga tccagcatta tctattgctg gtgcagtaca atcacaaaaa      720

cttgctgtta gagcaattag tcaattacaa gccttgccag gtggtgatat aaaacttctt      780

tgtgatacag ttgttgaatc agttcgtgat cttaccggtt atgatagagt tatggtacac      840

aaattccatg aggatgaaca tggtgaagtt gttgcagaaa gtaaaagaga tgatcttgaa      900

ccatacattg gtttgcatta tccagctact gatattccac aagcatcaag atttcttttc      960

aaacaaaatc gtgttagaat gattgtagat tgtaatgcca ccccagtatt agttgttcaa     1020

gatgatagat tgacacaaag tatgtgttta gtaggttcaa cattaagagc acctcatgga     1080

tgtcattcac aatatatggc caatatgggt tcaatagcat cattagctat ggcagtaatc     1140

atcaatggaa atgaagatga tggttcaaat gttgcatcag gtagaagttc aatgcgttta     1200

tggggtttag tagtttgtca tcatacaagt tctcgttgta tcccatttcc tttacgttat     1260

gcatgtgaat ttcttatgca agcatttggt ttacaattga atatggaact tcaattagca     1320

ttacaaatga gtgaaaagag agttttacgt acacaaacat tgttatgcga tatgttattg     1380

agagattctc cagctggtat tgttactcaa tcaccatcta tcatggatct tgtaaagtgt     1440

gatggtgcag cattcttata ccacggaaag tactatccat taggtgttgc accatctgaa     1500

gttcaaatca aagatgttgt agaatggtta ttggctaatc acgcagattc tactggttta     1560

tcaactgatt ctcttggtga tgctggttat cctggtgccg cagccttagg agatgctgta     1620

tgtggtatgg ccgttgctta cattacaaaa agagatttct tgttttggtt tcgttctcat     1680

acagctaaag agatcaaatg gggtggtgca aaacatcatc cagaagataa ggatgatggt     1740

caaagaatgc atccaagatc atcatttcaa gcattcttag aagtagttaa gtcaagaagt     1800

caaccttggg aaacagcaga aatggatgca atacattcat tacaattgat acttcgtgat     1860

tcattcaaag aatcagaagc agcaatgaat agtaaagttg ttgatggtgt tgttcaacca     1920

tgtagagata tggccggtga acaaggtatt gatgaattag gtgctgtagc tagagaaatg     1980

gttagattga tagaaactgc cactgttcca atcttcgctg ttgatgctgg tggatgcata     2040

aacggttgga atgctaagat cgcagaattg accggtttgt cagttgaaga agctatgggt     2100

aaaagtttag tttcagattt gatctataag gaaaatgaag caaccgttaa caaattgtta     2160

tcaagagcat tgagaggaga tgaggaaaag aatgtagaag ttaagttaaa gacattttca     2220

ccagagttac aaggtaaagc agtttttgtt gtagttaatg cttgttcatc aaaagattac     2280

ttgaataaca ttgtaggtgt ttgttttgtt ggtcaagatg taacttcaca aaagattgtt     2340

atggataagt ttatcaatat ccaaggtgat tacaaagcta ttgttcattc tccaaatcca     2400

ttgattccac caatctttgc agctgatgag aatacatgtt gtttagaatg gaatatggca     2460

atggaaaagt taactggttg gtcacgttca gaagtaattg gtaagatgat tgttggagag     2520

gtttttggta gttgttgtat gcttaaaggt ccagatgctt taactaagtt tatgattgtt     2580

ttgcataatg caattggtgg tcaagataca gataagttcc cattcccttt cttcgataga     2640

aatggaaagt ttgttcaagc attacttact gctaacaaaa gagtatcatt agaaggtaaa     2700

gtaataggag ctttttgttt cttacaaatt ccttcaccag aattacaaca agctcttgca     2760

gtaggtggta gtcatcatca tcatcatcat taa                                  2793


<210> 123
<211> 930
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 123
Met Gly Ala Ser Gly Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Gly Arg Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro 
            20                  25                  30          


Asn Asn Arg Arg Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser 
        35                  40                  45              


Leu Arg Pro Arg Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln 
    50                  55                  60                  


Tyr Thr Val Asp Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu 
65                  70                  75                  80  


Ser Gly Lys Ser Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr 
                85                  90                  95      


Gly Ser Ser Val Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile 
            100                 105                 110         


Gln Arg Gly Gly Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp 
        115                 120                 125             


Glu Ser Ser Phe Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met 
    130                 135                 140                 


Leu Gly Ile Met Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile 
145                 150                 155                 160 


Leu Ala Met Gly Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser 
                165                 170                 175     


Ile Leu Leu Glu Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn 
            180                 185                 190         


Pro Val Trp Ile His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile 
        195                 200                 205             


Leu His Arg Ile Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg 
    210                 215                 220                 


Thr Glu Asp Pro Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys 
225                 230                 235                 240 


Leu Ala Val Arg Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp 
                245                 250                 255     


Ile Lys Leu Leu Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr 
            260                 265                 270         


Gly Tyr Asp Arg Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly 
        275                 280                 285             


Glu Val Val Ala Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly 
    290                 295                 300                 


Leu His Tyr Pro Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe 
305                 310                 315                 320 


Lys Gln Asn Arg Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val 
                325                 330                 335     


Leu Val Val Gln Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly 
            340                 345                 350         


Ser Thr Leu Arg Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn 
        355                 360                 365             


Met Gly Ser Ile Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn 
    370                 375                 380                 


Glu Asp Asp Gly Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu 
385                 390                 395                 400 


Trp Gly Leu Val Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe 
                405                 410                 415     


Pro Leu Arg Tyr Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln 
            420                 425                 430         


Leu Asn Met Glu Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val 
        435                 440                 445             


Leu Arg Thr Gln Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro 
    450                 455                 460                 


Ala Gly Ile Val Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys 
465                 470                 475                 480 


Asp Gly Ala Ala Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Ser Glu Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala 
            500                 505                 510         


Asn His Ala Asp Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala 
        515                 520                 525             


Gly Tyr Pro Gly Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala 
    530                 535                 540                 


Val Ala Tyr Ile Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His 
545                 550                 555                 560 


Thr Ala Lys Glu Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp 
                565                 570                 575     


Lys Asp Asp Gly Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe 
            580                 585                 590         


Leu Glu Val Val Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met 
        595                 600                 605             


Asp Ala Ile His Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu 
    610                 615                 620                 


Ser Glu Ala Ala Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro 
625                 630                 635                 640 


Cys Arg Asp Met Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val 
                645                 650                 655     


Ala Arg Glu Met Val Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe 
            660                 665                 670         


Ala Val Asp Ala Gly Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala 
        675                 680                 685             


Glu Leu Thr Gly Leu Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val 
    690                 695                 700                 


Ser Asp Leu Ile Tyr Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu 
705                 710                 715                 720 


Ser Arg Ala Leu Arg Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu 
                725                 730                 735     


Lys Thr Phe Ser Pro Glu Leu Gln Gly Lys Ala Val Phe Val Val Val 
            740                 745                 750         


Asn Ala Cys Ser Ser Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys 
        755                 760                 765             


Phe Val Gly Gln Asp Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe 
    770                 775                 780                 


Ile Asn Ile Gln Gly Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro 
785                 790                 795                 800 


Leu Ile Pro Pro Ile Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu 
                805                 810                 815     


Trp Asn Met Ala Met Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val 
            820                 825                 830         


Ile Gly Lys Met Ile Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu 
        835                 840                 845             


Lys Gly Pro Asp Ala Leu Thr Lys Phe Met Ile Val Leu His Asn Ala 
    850                 855                 860                 


Ile Gly Gly Gln Asp Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg 
865                 870                 875                 880 


Asn Gly Lys Phe Val Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser 
                885                 890                 895     


Leu Glu Gly Lys Val Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser 
            900                 905                 910         


Pro Glu Leu Gln Gln Ala Leu Ala Val Gly Gly Ser His His His His 
        915                 920                 925             


His His 
    930 


<210> 124
<211> 9597
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 124
actcgagact agagctagat aaaaaaaatt tttatttatt tttatttatt ttgaattaaa       60

tagattacaa attaattaat cccatcaaat ctttaaaaaa aaatggttta aaaaaacttg      120

ggttggttaa ttattatttg aaaattttaa aacccaaatt aaaaaaaaaa aatgggattc      180

aaaaattttt tttttttttt tttttttttt tttttttttt tttttttttt cagattgcat      240

aaaaagattt tttttttttt tttttcttat ttcttaaaac aaataaatta aattaaaaaa      300

taaaaaatgg tatctggtgt tggtggttct ggtggtggaa gaggtggagg tagaggaggt      360

gaagaagaac catcaagtag tcatacacct aacaatcgta gaggtggtga gcaagctcaa      420

tcatcaggta caaaatcatt acgtccaaga agtaatactg aatcaatgtc aaaagcaatt      480

caacaataca cagtagatgc tagattacac gccgtattcg aacaatctgg agaaagtggt      540

aagagttttg attactcaca atcattgaaa acaaccactt atggtagttc agttccagaa      600

caacaaatca ctgcatatct tagtagaata caacgtggtg gttacattca accatttggt      660

tgtatgattg cagttgatga atcttctttt agaatcattg gttattcaga aaatgcaaga      720

gaaatgttgg gtatcatgcc acaatcagta ccaaccttag aaaaaccaga aattcttgca      780

atgggtacag atgttagaag tttgtttaca tcatcatcat caattctttt ggagagagct      840

tttgttgcac gtgaaatcac tttacttaat ccagtatgga ttcatagtaa gaatactgga      900

aagccattct atgcaattct tcatagaata gatgtaggag ttgttattga tcttgagcca      960

gcaagaacag aagatccagc attatctatt gctggtgcag tacaatcaca aaaacttgct     1020

gttagagcaa ttagtcaatt acaagccttg ccaggtggtg atataaaact tctttgtgat     1080

acagttgttg aatcagttcg tgatcttacc ggttatgata gagttatggt atacaaattc     1140

catgaggatg aacatggtga agttgttgca gaaagtaaaa gagatgatct tgaaccatac     1200

attggtttgc attatccagc tactgatatt ccacaagcat caagatttct tttcaaacaa     1260

aatcgtgtta gaatgattgt agattgtaat gccaccccag tattagttgt tcaagatgat     1320

agattgacac aaagtatgtg tttagtaggt tcaacattaa gagcacctca tggatgtcat     1380

tcacaatata tggccaatat gggttcaata gcatcattag ctatggcagt aatcatcaat     1440

ggaaatgaag atgatggttc aaatgttgca tcaggtagaa gttcaatgcg tttatggggt     1500

ttagtagttt gtcatcatac aagttctcgt tgtatcccat ttcctttacg ttatgcatgt     1560

gaatttctta tgcaagcatt tggtttacaa ttgaatatgg aacttcaatt agcattacaa     1620

atgagtgaaa agagagtttt acgtacacaa acattgttat gcgatatgtt attgagagat     1680

tctccagctg gtattgttac tcaatcacca tctatcatgg atcttgtaaa gtgtgatggt     1740

gcagcattct tataccacgg aaagtactat ccattaggtg ttgcaccatc tgaagttcaa     1800

atcaaagatg ttgtagaatg gttattggct aatcacgcag attctactgg tttatcaact     1860

gattctcttg gtgatgctgg ttatcctggt gccgcagcct taggagatgc tgtatgtggt     1920

atggccgttg cttacattac aaaaagagat ttcttgtttt ggtttcgttc tcatacagct     1980

aaagagatca aatggggtgg tgcaaaacat catccagaag ataaggatga tggtcaaaga     2040

atgcatccaa gatcatcatt tcaagcattc ttagaagtag ttaagtcaag aagtcaacct     2100

tgggaaacag cagaaatgga tgcaatacat tcattacaat tgatacttcg tgattcattc     2160

aaagaatcag aagcagcaat gaatagtaaa gttgttgatg gtgttgttca accatgtaga     2220

gatatggccg gtgaacaagg tattgatgaa ttaggtgctg tagctagaga aatggttaga     2280

ttgatagaaa ctgccactgt tccaatcttc gctgttgatg ctggtggatg cataaacggt     2340

tggaatgcta agatcgcaga attgaccggt ttgtcagttg aagaagctat gggtaaaagt     2400

ttagtttcag atttgatcta taaggaaaat gaagcaaccg ttaacaaatt gttatcaaga     2460

gcattgagag gagatgagga aaagaatgta gaagttaagt taaagacatt ttcaccagag     2520

ttacaaggta aagcagtttt tgttgtagtt aatgcttgtt catcaaaaga ttacttgaat     2580

aacattgtag gtgtttgttt tgttggtcaa gatgtaactt cacaaaagat tgttatggat     2640

aagtttatca atatccaagg tgattacaaa gctattgttc attctccaaa tccattgatt     2700

ccaccaatct ttgcagctga tgagaataca tgttgtttag aatggaatat ggcaatggaa     2760

aagttaactg gttggtcacg ttcagaagta attggtaaga tgattgttgg agaggttttt     2820

ggtagttgtt gtatgcttaa aggtccagat gctttaacta agtttatgat tgttttgcat     2880

aatgcaattg gtggtcaaga tacagataag ttcccattcc ctttcttcga tagaaatgga     2940

aagtttgttc aagcattact tactgctaac aaaagagtat cattagaagg taaagtaata     3000

ggagcttttt gtttcttaca aattccttca ccagaattac aacaagctct tgcagtaggt     3060

gcttcaggtc atcatcatca tcatcattaa attatttaat aaataataaa aaaacaaatt     3120

gttgtaataa tctaatattt tctttttttt ttaatttttt ttttttaaat cttaataatt     3180

attaagttat tttaattttt tttttttttt tttttttttt tttttttttt tttctatcaa     3240

aaaaatcaaa tatatttaaa aaatttatta tttacagata cattttgaat ggtgaagata     3300

aatatatgca ttagatgtaa aacagccaaa gagtatgaaa atcaaaaaga taaagcttat     3360

cgatttcgaa aaagtaaata gcaattatta caaaattcaa tccgaatcta cccaaataaa     3420

ttccaatgaa attgccgatt taaaaaagtt tattaaagaa gaagtcaata aaacttcttc     3480

caaaattgat ttctttttag tttcttcaac agatgccctt tcaaatccag aaaattattc     3540

tctcttagaa gtaaagtgta ttaattgtca ttctttgtgt caaggaaaaa atttatatat     3600

ttcatgtaca agagatggat gtcaaaacaa tatttgctat aattgtttag gaataaacat     3660

aaacatatat aatgttgtta ttaattctaa actttgccct ccatgtttca atgattcggt     3720

aatcaacaag aagtgtgcca tgtgtagtaa gaacggaact aaatgtaatt tgaaccaaga     3780

atgtaaactt catctttgtg cacagtgttc taaaaagtgt ctatacattc tgagagtcaa     3840

aactaattaa ataaaatata aacttaattt ctaaataaac tcatttaaaa atatttaaat     3900

aatatgaatt tataactgta attattgtat taaaaaatta tataattatt taatgttaaa     3960

aatgtattaa aataattata aaaaaatata acaaaaattt tcgtaaaaat aatttgtaaa     4020

aaagctatta aaaatattat gaaaaaaaaa ttaaaaaaat tattaaattg tttttgtaat     4080

taagctatta aaataattat aaaaaaaaaa tttttaaaat tttaaaaata ttttttgtaa     4140

aaaagtatta aaataattat gaaaaaaaaa ttttctaaaa aattaaaaaa aaaattaaaa     4200

tatattttat gttaaaaacg tattaaaata actattaaaa aaattatatt taaaaaagta     4260

ttaacttttt tttaggtgtg gttgtggggt ggggtttaat atattataat aaaaaattat     4320

tttttgttca tttattattt tcattgtata taatgtactc aacaacgtta ttattttttc     4380

tttttttttt tattgtatca aaatcttctg ttcttcaaaa tgatcagatt gaagtaaaat     4440

attttcaact tcttattgtt atgtatcaaa aagaaaactg tgttgaaaag tcaatgacag     4500

gcgccgtaat ttatgatgaa tgtaatattc atggaagagt tgaaacaaat agtactcatg     4560

cgctttttta tgatgacatt gaaacaaata attcaagatg taacaatttt cgtaatttaa     4620

caaacttaat taaacttaat gaatgtatta atgacgagtt tggagagtct attctttata     4680

aagaatataa tgaaactgat gatggttatt tgtttagagt ggaagacagc tttgttgaaa     4740

ttacttctct ttcaatggat tgtacaaaaa atagtaaaac aattattgaa aaattcaaca     4800

tttgttcaaa atttgaaaat gtatatcata ttacaaacat tacacaagag aaatccaata     4860

gatttacatg tacagatcca ttgtgccact attgtaagaa tgaaaacatt caaaacaatc     4920

ttgattttaa aacaacaaag tgtactccaa agtatggtgc atctgattct gaatttttat     4980

caacaattta caatccaaag ctcgatggct caaataacgg tatggaaaag tcagtaactc     5040

aagaaaaaaa catttcaaat aatttaaaaa ttaatatata tttaattttc tttttaatta     5100

tttttttaat taaataaagt tttattattt tttaagagta attattgctc ttttttcatt     5160

tgaaacacca gaagctaaac gtaattgttg ttgactgaaa ttttttattt tttttggggt     5220

aataggattt ccttttttat gaagattaat atctttgact cgtgaaacat tctttttaac     5280

ttttgttttt tctgttggtt tatcatttgt tttttcacta atttcaatac catcttgacg     5340

ttcattcata acttcatctt ttttttttcc tgtttctgta tcttcttcta tttttttttc     5400

tttatctttt tctttatctt cttcttgttc ttcctcttct ttttcttctt ctgatactgc     5460

aggtgtttct tcttcttctt cttccgatat tgtcggtttt tctacttctt cttcttgttc     5520

ttcttcttct tcttcttctt cttcttcttc ttcttcttct tcttcttctt ccggtaattt     5580

attaattata tttctttttt tatatgaatt acgtttggtt tgtgcagtaa tttccttaca     5640

tagagtgcag ctttcaagaa aaatttcaat ttcttcgttt gttgcataat aaccactgtc     5700

tttgatatga ttaaacattt ttgattttct taaatgcttt ccttctttaa tatgaaaatt     5760

atcgaattct aattcattaa gaacaataag ctcccctaat ttaaaaaatt agttaaaata     5820

aattaaaatg aacatgtata aagatggatt ttaccatttt ttgaaattct aaataacttt     5880

tcttcatctc caatcttttt gactgaaaaa cgatttttaa ttgaagttat tgttctgtga     5940

gtgttttgaa tcgcccattt ctctaaatca gtttgagata gtgttttata atctgaattg     6000

ttatacacaa cttttgctct attaaccaaa tatttaaaga tttcatcatc aactgaatat     6060

tttgacttta cgattcttgt ccaaaaaaca atttctacta ctatcatttt ttatttataa     6120

aataatttaa atacaaaaat gaattttttt ttttttaaaa aaaaaaaaat ttgaaaaaaa     6180

aaaaaaaaaa attttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaat caaataaaaa     6240

gtaaaaaata aaaaccgaaa aacattcatt gtaatttcaa atgtcgaggc cggcagaggc     6300

ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt     6360

cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca     6420

ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa     6480

aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat     6540

cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc     6600

cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc     6660

gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt     6720

tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac     6780

cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg     6840

ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca     6900

gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc     6960

gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa     7020

accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa     7080

ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac     7140

tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta     7200

aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt     7260

taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata     7320

gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc     7380

agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac     7440

cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag     7500

tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac     7560

gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc     7620

agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg     7680

gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc     7740

atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct     7800

gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc     7860

tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc     7920

atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc     7980

agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc     8040

gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca     8100

cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt     8160

tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt     8220

ccgcgcacat ttccccgaaa agtgccacct gacgcgccct gtagcgggat ccattttatt     8280

taatatacta aataataaaa aagttaaaaa atgatcattg gataaatttt ttataattat     8340

aaataaagat aataattttt tttttaacaa aactaaaaat aaaaataata aaataattgt     8400

taaaataggt tttttttttt tttttttttt tttaataaat ggtatttatt aatttatttg     8460

ttgtgtgtgt tttttttttt ataatatttt tttttttagc attgaattaa gaagaaatca     8520

aattgattct agttcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat     8580

cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt     8640

cagcaatatc acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccgtc     8700

cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat     8760

cgccatgggt cacgacgaga tcctcgccgt cgggcatgcg cgccttgagc ctggcgaaca     8820

gttcggctgg cgcgagcccc tgatgctctt cgtccagatc atcctgatcg acaagaccgg     8880

cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg aatgggcagg     8940

tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat actttctcgg     9000

caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat agcagccagt     9060

cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc gtcgtggcca     9120

gccacgatag ccgcgctgcc tcgtcctgca gttcattcag ggcaccggac aggtcggtct     9180

tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca tcagagcagc     9240

cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg gccggagaac     9300

ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc agcttgaaca tcttcaccat     9360

ccattttgga tcttttatat tatatttatt tattgattat ttttttgaat taattaaaaa     9420

aaaaaaaaat ttcattttat aatctcagaa acctcaaaaa aaaaaaaata aaaaataaaa     9480

aatataaaaa aataaaaata aaatcccaat tttaaagcga aaaaccaccc atggtttgaa     9540

aatttcaatc aatttcaaat aactttactt aaaaaaaacc cattttttat ttaaaaa        9597


<210> 125
<211> 9597
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 125
actcgagact agagctagat aaaaaaaatt tttatttatt tttatttatt ttgaattaaa       60

tagattacaa attaattaat cccatcaaat ctttaaaaaa aaatggttta aaaaaacttg      120

ggttggttaa ttattatttg aaaattttaa aacccaaatt aaaaaaaaaa aatgggattc      180

aaaaattttt tttttttttt tttttttttt tttttttttt tttttttttt cagattgcat      240

aaaaagattt tttttttttt tttttcttat ttcttaaaac aaataaatta aattaaaaaa      300

taaaaaatgg tatctggtgt tggtggttct ggtggtggaa gaggtggagg tagaggaggt      360

gaagaagaac catcaagtag tcatacacct aacaatcgta gaggtggtga gcaagctcaa      420

tcatcaggta caaaatcatt acgtccaaga agtaatactg aatcaatgtc aaaagcaatt      480

caacaataca cagtagatgc tagattacac gccgtattcg aacaatctgg agaaagtggt      540

aagagttttg attactcaca atcattgaaa acaaccactt atggtagttc agttccagaa      600

caacaaatca ctgcatatct tagtagaata caacgtggtg gttacattca accatttggt      660

tgtatgattg cagttgatga atcttctttt agaatcattg gttattcaga aaatgcaaga      720

gaaatgttgg gtatcatgcc acaatcagta ccaaccttag aaaaaccaga aattcttgca      780

atgggtacag atgttagaag tttgtttaca tcatcatcat caattctttt ggagagagct      840

tttgttgcac gtgaaatcac tttacttaat ccagtatgga ttcatagtaa gaatactgga      900

aagccattct atgcaattct tcatagaata gatgtaggag ttgttattga tcttgagcca      960

gcaagaacag aagatccagc attatctatt gctggtgcag tacaatcaca aaaacttgct     1020

gttagagcaa ttagtcaatt acaagccttg ccaggtggtg atataaaact tctttgtgat     1080

acagttgttg aatcagttcg tgatcttacc ggttatgata gagttatggt acacaaattc     1140

catgaggatg aacatggtga agttgttgca gaaagtaaaa gagatgatct tgaaccatac     1200

attggtttgc attatccagc tactgatatt ccacaagcat caagatttct tttcaaacaa     1260

aatcgtgtta gaatgattgt agattgtaat gccaccccag tattagttgt tcaagatgat     1320

agattgacac aaagtatgtg tttagtaggt tcaacattaa gagcacctca tggatgtcat     1380

tcacaatata tggccaatat gggttcaata gcatcattag ctatggcagt aatcatcaat     1440

ggaaatgaag atgatggttc aaatgttgca tcaggtagaa gttcaatgcg tttatggggt     1500

ttagtagttt gtcatcatac aagttctcgt tgtatcccat ttcctttacg ttatgcatgt     1560

gaatttctta tgcaagcatt tggtttacaa ttgaatatgg aacttcaatt agcattacaa     1620

atgagtgaaa agagagtttt acgtacacaa acattgttat gcgatatgtt attgagagat     1680

tctccagctg gtattgttac tcaatcacca tctatcatgg atcttgtaaa gtgtgatggt     1740

gcagcattct tataccacgg aaagtactat ccattaggtg ttgcaccatc tgaagttcaa     1800

atcaaagatg ttgtagaatg gttattggct aatcacgcag attctactgg tttatcaact     1860

gattctcttg gtgatgctgg ttatcctggt gccgcagcct taggagatgc tgtatgtggt     1920

atggccgttg cttacattac aaaaagagat ttcttgtttt ggtttcgttc tcatacagct     1980

aaagagatca aatggggtgg tgcaaaacat catccagaag ataaggatga tggtcaaaga     2040

atgcatccaa gatcatcatt tcaagcattc ttagaagtag ttaagtcaag aagtcaacct     2100

tgggaaacag cagaaatgga tgcaatacat tcattacaat tgatacttcg tgattcattc     2160

aaagaatcag aagcagcaat gaatagtaaa gttgttgatg gtgttgttca accatgtaga     2220

gatatggccg gtgaacaagg tattgatgaa ttaggtgctg tagctagaga aatggttaga     2280

ttgatagaaa ctgccactgt tccaatcttc gctgttgatg ctggtggatg cataaacggt     2340

tggaatgcta agatcgcaga attgaccggt ttgtcagttg aagaagctat gggtaaaagt     2400

ttagtttcag atttgatcta taaggaaaat gaagcaaccg ttaacaaatt gttatcaaga     2460

gcattgagag gagatgagga aaagaatgta gaagttaagt taaagacatt ttcaccagag     2520

ttacaaggta aagcagtttt tgttgtagtt aatgcttgtt catcaaaaga ttacttgaat     2580

aacattgtag gtgtttgttt tgttggtcaa gatgtaactt cacaaaagat tgttatggat     2640

aagtttatca atatccaagg tgattacaaa gctattgttc attctccaaa tccattgatt     2700

ccaccaatct ttgcagctga tgagaataca tgttgtttag aatggaatat ggcaatggaa     2760

aagttaactg gttggtcacg ttcagaagta attggtaaga tgattgttgg agaggttttt     2820

ggtagttgtt gtatgcttaa aggtccagat gctttaacta agtttatgat tgttttgcat     2880

aatgcaattg gtggtcaaga tacagataag ttcccattcc ctttcttcga tagaaatgga     2940

aagtttgttc aagcattact tactgctaac aaaagagtat cattagaagg taaagtaata     3000

ggagcttttt gtttcttaca aattccttca ccagaattac aacaagctct tgcagtaggt     3060

gcttcaggtc atcatcatca tcatcattaa attatttaat aaataataaa aaaacaaatt     3120

gttgtaataa tctaatattt tctttttttt ttaatttttt ttttttaaat cttaataatt     3180

attaagttat tttaattttt tttttttttt tttttttttt tttttttttt tttctatcaa     3240

aaaaatcaaa tatatttaaa aaatttatta tttacagata cattttgaat ggtgaagata     3300

aatatatgca ttagatgtaa aacagccaaa gagtatgaaa atcaaaaaga taaagcttat     3360

cgatttcgaa aaagtaaata gcaattatta caaaattcaa tccgaatcta cccaaataaa     3420

ttccaatgaa attgccgatt taaaaaagtt tattaaagaa gaagtcaata aaacttcttc     3480

caaaattgat ttctttttag tttcttcaac agatgccctt tcaaatccag aaaattattc     3540

tctcttagaa gtaaagtgta ttaattgtca ttctttgtgt caaggaaaaa atttatatat     3600

ttcatgtaca agagatggat gtcaaaacaa tatttgctat aattgtttag gaataaacat     3660

aaacatatat aatgttgtta ttaattctaa actttgccct ccatgtttca atgattcggt     3720

aatcaacaag aagtgtgcca tgtgtagtaa gaacggaact aaatgtaatt tgaaccaaga     3780

atgtaaactt catctttgtg cacagtgttc taaaaagtgt ctatacattc tgagagtcaa     3840

aactaattaa ataaaatata aacttaattt ctaaataaac tcatttaaaa atatttaaat     3900

aatatgaatt tataactgta attattgtat taaaaaatta tataattatt taatgttaaa     3960

aatgtattaa aataattata aaaaaatata acaaaaattt tcgtaaaaat aatttgtaaa     4020

aaagctatta aaaatattat gaaaaaaaaa ttaaaaaaat tattaaattg tttttgtaat     4080

taagctatta aaataattat aaaaaaaaaa tttttaaaat tttaaaaata ttttttgtaa     4140

aaaagtatta aaataattat gaaaaaaaaa ttttctaaaa aattaaaaaa aaaattaaaa     4200

tatattttat gttaaaaacg tattaaaata actattaaaa aaattatatt taaaaaagta     4260

ttaacttttt tttaggtgtg gttgtggggt ggggtttaat atattataat aaaaaattat     4320

tttttgttca tttattattt tcattgtata taatgtactc aacaacgtta ttattttttc     4380

tttttttttt tattgtatca aaatcttctg ttcttcaaaa tgatcagatt gaagtaaaat     4440

attttcaact tcttattgtt atgtatcaaa aagaaaactg tgttgaaaag tcaatgacag     4500

gcgccgtaat ttatgatgaa tgtaatattc atggaagagt tgaaacaaat agtactcatg     4560

cgctttttta tgatgacatt gaaacaaata attcaagatg taacaatttt cgtaatttaa     4620

caaacttaat taaacttaat gaatgtatta atgacgagtt tggagagtct attctttata     4680

aagaatataa tgaaactgat gatggttatt tgtttagagt ggaagacagc tttgttgaaa     4740

ttacttctct ttcaatggat tgtacaaaaa atagtaaaac aattattgaa aaattcaaca     4800

tttgttcaaa atttgaaaat gtatatcata ttacaaacat tacacaagag aaatccaata     4860

gatttacatg tacagatcca ttgtgccact attgtaagaa tgaaaacatt caaaacaatc     4920

ttgattttaa aacaacaaag tgtactccaa agtatggtgc atctgattct gaatttttat     4980

caacaattta caatccaaag ctcgatggct caaataacgg tatggaaaag tcagtaactc     5040

aagaaaaaaa catttcaaat aatttaaaaa ttaatatata tttaattttc tttttaatta     5100

tttttttaat taaataaagt tttattattt tttaagagta attattgctc ttttttcatt     5160

tgaaacacca gaagctaaac gtaattgttg ttgactgaaa ttttttattt tttttggggt     5220

aataggattt ccttttttat gaagattaat atctttgact cgtgaaacat tctttttaac     5280

ttttgttttt tctgttggtt tatcatttgt tttttcacta atttcaatac catcttgacg     5340

ttcattcata acttcatctt ttttttttcc tgtttctgta tcttcttcta tttttttttc     5400

tttatctttt tctttatctt cttcttgttc ttcctcttct ttttcttctt ctgatactgc     5460

aggtgtttct tcttcttctt cttccgatat tgtcggtttt tctacttctt cttcttgttc     5520

ttcttcttct tcttcttctt cttcttcttc ttcttcttct tcttcttctt ccggtaattt     5580

attaattata tttctttttt tatatgaatt acgtttggtt tgtgcagtaa tttccttaca     5640

tagagtgcag ctttcaagaa aaatttcaat ttcttcgttt gttgcataat aaccactgtc     5700

tttgatatga ttaaacattt ttgattttct taaatgcttt ccttctttaa tatgaaaatt     5760

atcgaattct aattcattaa gaacaataag ctcccctaat ttaaaaaatt agttaaaata     5820

aattaaaatg aacatgtata aagatggatt ttaccatttt ttgaaattct aaataacttt     5880

tcttcatctc caatcttttt gactgaaaaa cgatttttaa ttgaagttat tgttctgtga     5940

gtgttttgaa tcgcccattt ctctaaatca gtttgagata gtgttttata atctgaattg     6000

ttatacacaa cttttgctct attaaccaaa tatttaaaga tttcatcatc aactgaatat     6060

tttgacttta cgattcttgt ccaaaaaaca atttctacta ctatcatttt ttatttataa     6120

aataatttaa atacaaaaat gaattttttt ttttttaaaa aaaaaaaaat ttgaaaaaaa     6180

aaaaaaaaaa attttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaat caaataaaaa     6240

gtaaaaaata aaaaccgaaa aacattcatt gtaatttcaa atgtcgaggc cggcagaggc     6300

ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt     6360

cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca     6420

ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa     6480

aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat     6540

cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc     6600

cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc     6660

gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt     6720

tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac     6780

cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg     6840

ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca     6900

gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc     6960

gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa     7020

accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa     7080

ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac     7140

tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta     7200

aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt     7260

taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata     7320

gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc     7380

agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac     7440

cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag     7500

tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac     7560

gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc     7620

agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg     7680

gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc     7740

atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct     7800

gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc     7860

tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc     7920

atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc     7980

agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc     8040

gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca     8100

cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt     8160

tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt     8220

ccgcgcacat ttccccgaaa agtgccacct gacgcgccct gtagcgggat ccattttatt     8280

taatatacta aataataaaa aagttaaaaa atgatcattg gataaatttt ttataattat     8340

aaataaagat aataattttt tttttaacaa aactaaaaat aaaaataata aaataattgt     8400

taaaataggt tttttttttt tttttttttt tttaataaat ggtatttatt aatttatttg     8460

ttgtgtgtgt tttttttttt ataatatttt tttttttagc attgaattaa gaagaaatca     8520

aattgattct agttcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat     8580

cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt     8640

cagcaatatc acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccgtc     8700

cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat     8760

cgccatgggt cacgacgaga tcctcgccgt cgggcatgcg cgccttgagc ctggcgaaca     8820

gttcggctgg cgcgagcccc tgatgctctt cgtccagatc atcctgatcg acaagaccgg     8880

cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg aatgggcagg     8940

tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat actttctcgg     9000

caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat agcagccagt     9060

cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc gtcgtggcca     9120

gccacgatag ccgcgctgcc tcgtcctgca gttcattcag ggcaccggac aggtcggtct     9180

tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca tcagagcagc     9240

cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg gccggagaac     9300

ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc agcttgaaca tcttcaccat     9360

ccattttgga tcttttatat tatatttatt tattgattat ttttttgaat taattaaaaa     9420

aaaaaaaaat ttcattttat aatctcagaa acctcaaaaa aaaaaaaata aaaaataaaa     9480

aatataaaaa aataaaaata aaatcccaat tttaaagcga aaaaccaccc atggtttgaa     9540

aatttcaatc aatttcaaat aactttactt aaaaaaaacc cattttttat ttaaaaa        9597


<210> 126
<211> 1172
<212> PRT
<213> Arabidopsis thaliana

<400> 126
Met Val Ser Gly Val Gly Gly Ser Gly Gly Gly Arg Gly Gly Gly Arg 
1               5                   10                  15      


Gly Gly Glu Glu Glu Pro Ser Ser Ser His Thr Pro Asn Asn Arg Arg 
            20                  25                  30          


Gly Gly Glu Gln Ala Gln Ser Ser Gly Thr Lys Ser Leu Arg Pro Arg 
        35                  40                  45              


Ser Asn Thr Glu Ser Met Ser Lys Ala Ile Gln Gln Tyr Thr Val Asp 
    50                  55                  60                  


Ala Arg Leu His Ala Val Phe Glu Gln Ser Gly Glu Ser Gly Lys Ser 
65                  70                  75                  80  


Phe Asp Tyr Ser Gln Ser Leu Lys Thr Thr Thr Tyr Gly Ser Ser Val 
                85                  90                  95      


Pro Glu Gln Gln Ile Thr Ala Tyr Leu Ser Arg Ile Gln Arg Gly Gly 
            100                 105                 110         


Tyr Ile Gln Pro Phe Gly Cys Met Ile Ala Val Asp Glu Ser Ser Phe 
        115                 120                 125             


Arg Ile Ile Gly Tyr Ser Glu Asn Ala Arg Glu Met Leu Gly Ile Met 
    130                 135                 140                 


Pro Gln Ser Val Pro Thr Leu Glu Lys Pro Glu Ile Leu Ala Met Gly 
145                 150                 155                 160 


Thr Asp Val Arg Ser Leu Phe Thr Ser Ser Ser Ser Ile Leu Leu Glu 
                165                 170                 175     


Arg Ala Phe Val Ala Arg Glu Ile Thr Leu Leu Asn Pro Val Trp Ile 
            180                 185                 190         


His Ser Lys Asn Thr Gly Lys Pro Phe Tyr Ala Ile Leu His Arg Ile 
        195                 200                 205             


Asp Val Gly Val Val Ile Asp Leu Glu Pro Ala Arg Thr Glu Asp Pro 
    210                 215                 220                 


Ala Leu Ser Ile Ala Gly Ala Val Gln Ser Gln Lys Leu Ala Val Arg 
225                 230                 235                 240 


Ala Ile Ser Gln Leu Gln Ala Leu Pro Gly Gly Asp Ile Lys Leu Leu 
                245                 250                 255     


Cys Asp Thr Val Val Glu Ser Val Arg Asp Leu Thr Gly Tyr Asp Arg 
            260                 265                 270         


Val Met Val Tyr Lys Phe His Glu Asp Glu His Gly Glu Val Val Ala 
        275                 280                 285             


Glu Ser Lys Arg Asp Asp Leu Glu Pro Tyr Ile Gly Leu His Tyr Pro 
    290                 295                 300                 


Ala Thr Asp Ile Pro Gln Ala Ser Arg Phe Leu Phe Lys Gln Asn Arg 
305                 310                 315                 320 


Val Arg Met Ile Val Asp Cys Asn Ala Thr Pro Val Leu Val Val Gln 
                325                 330                 335     


Asp Asp Arg Leu Thr Gln Ser Met Cys Leu Val Gly Ser Thr Leu Arg 
            340                 345                 350         


Ala Pro His Gly Cys His Ser Gln Tyr Met Ala Asn Met Gly Ser Ile 
        355                 360                 365             


Ala Ser Leu Ala Met Ala Val Ile Ile Asn Gly Asn Glu Asp Asp Gly 
    370                 375                 380                 


Ser Asn Val Ala Ser Gly Arg Ser Ser Met Arg Leu Trp Gly Leu Val 
385                 390                 395                 400 


Val Cys His His Thr Ser Ser Arg Cys Ile Pro Phe Pro Leu Arg Tyr 
                405                 410                 415     


Ala Cys Glu Phe Leu Met Gln Ala Phe Gly Leu Gln Leu Asn Met Glu 
            420                 425                 430         


Leu Gln Leu Ala Leu Gln Met Ser Glu Lys Arg Val Leu Arg Thr Gln 
        435                 440                 445             


Thr Leu Leu Cys Asp Met Leu Leu Arg Asp Ser Pro Ala Gly Ile Val 
    450                 455                 460                 


Thr Gln Ser Pro Ser Ile Met Asp Leu Val Lys Cys Asp Gly Ala Ala 
465                 470                 475                 480 


Phe Leu Tyr His Gly Lys Tyr Tyr Pro Leu Gly Val Ala Pro Ser Glu 
                485                 490                 495     


Val Gln Ile Lys Asp Val Val Glu Trp Leu Leu Ala Asn His Ala Asp 
            500                 505                 510         


Ser Thr Gly Leu Ser Thr Asp Ser Leu Gly Asp Ala Gly Tyr Pro Gly 
        515                 520                 525             


Ala Ala Ala Leu Gly Asp Ala Val Cys Gly Met Ala Val Ala Tyr Ile 
    530                 535                 540                 


Thr Lys Arg Asp Phe Leu Phe Trp Phe Arg Ser His Thr Ala Lys Glu 
545                 550                 555                 560 


Ile Lys Trp Gly Gly Ala Lys His His Pro Glu Asp Lys Asp Asp Gly 
                565                 570                 575     


Gln Arg Met His Pro Arg Ser Ser Phe Gln Ala Phe Leu Glu Val Val 
            580                 585                 590         


Lys Ser Arg Ser Gln Pro Trp Glu Thr Ala Glu Met Asp Ala Ile His 
        595                 600                 605             


Ser Leu Gln Leu Ile Leu Arg Asp Ser Phe Lys Glu Ser Glu Ala Ala 
    610                 615                 620                 


Met Asn Ser Lys Val Val Asp Gly Val Val Gln Pro Cys Arg Asp Met 
625                 630                 635                 640 


Ala Gly Glu Gln Gly Ile Asp Glu Leu Gly Ala Val Ala Arg Glu Met 
                645                 650                 655     


Val Arg Leu Ile Glu Thr Ala Thr Val Pro Ile Phe Ala Val Asp Ala 
            660                 665                 670         


Gly Gly Cys Ile Asn Gly Trp Asn Ala Lys Ile Ala Glu Leu Thr Gly 
        675                 680                 685             


Leu Ser Val Glu Glu Ala Met Gly Lys Ser Leu Val Ser Asp Leu Ile 
    690                 695                 700                 


Tyr Lys Glu Asn Glu Ala Thr Val Asn Lys Leu Leu Ser Arg Ala Leu 
705                 710                 715                 720 


Arg Gly Asp Glu Glu Lys Asn Val Glu Val Lys Leu Lys Thr Phe Ser 
                725                 730                 735     


Pro Glu Leu Gln Gly Lys Ala Val Phe Val Val Val Asn Ala Cys Ser 
            740                 745                 750         


Ser Lys Asp Tyr Leu Asn Asn Ile Val Gly Val Cys Phe Val Gly Gln 
        755                 760                 765             


Asp Val Thr Ser Gln Lys Ile Val Met Asp Lys Phe Ile Asn Ile Gln 
    770                 775                 780                 


Gly Asp Tyr Lys Ala Ile Val His Ser Pro Asn Pro Leu Ile Pro Pro 
785                 790                 795                 800 


Ile Phe Ala Ala Asp Glu Asn Thr Cys Cys Leu Glu Trp Asn Met Ala 
                805                 810                 815     


Met Glu Lys Leu Thr Gly Trp Ser Arg Ser Glu Val Ile Gly Lys Met 
            820                 825                 830         


Ile Val Gly Glu Val Phe Gly Ser Cys Cys Met Leu Lys Gly Pro Asp 
        835                 840                 845             


Ala Leu Thr Lys Phe Met Ile Val Leu His Asn Ala Ile Gly Gly Gln 
    850                 855                 860                 


Asp Thr Asp Lys Phe Pro Phe Pro Phe Phe Asp Arg Asn Gly Lys Phe 
865                 870                 875                 880 


Val Gln Ala Leu Leu Thr Ala Asn Lys Arg Val Ser Leu Glu Gly Lys 
                885                 890                 895     


Val Ile Gly Ala Phe Cys Phe Leu Gln Ile Pro Ser Pro Glu Leu Gln 
            900                 905                 910         


Gln Ala Leu Ala Val Gln Arg Arg Gln Asp Thr Glu Cys Phe Thr Lys 
        915                 920                 925             


Ala Lys Glu Leu Ala Tyr Ile Cys Gln Val Ile Lys Asn Pro Leu Ser 
    930                 935                 940                 


Gly Met Arg Phe Ala Asn Ser Leu Leu Glu Ala Thr Asp Leu Asn Glu 
945                 950                 955                 960 


Asp Gln Lys Gln Leu Leu Glu Thr Ser Val Ser Cys Glu Lys Gln Ile 
                965                 970                 975     


Ser Arg Ile Val Gly Asp Met Asp Leu Glu Ser Ile Glu Asp Gly Ser 
            980                 985                 990         


Phe Val Leu Lys Arg Glu Glu Phe  Phe Leu Gly Ser Val  Ile Asn Ala 
        995                 1000                 1005             


Ile Val  Ser Gln Ala Met Phe  Leu Leu Arg Asp Arg  Gly Leu Gln 
    1010                 1015                 1020             


Leu Ile  Arg Asp Ile Pro Glu  Glu Ile Lys Ser Ile  Glu Val Phe 
    1025                 1030                 1035             


Gly Asp  Gln Ile Arg Ile Gln  Gln Leu Leu Ala Glu  Phe Leu Leu 
    1040                 1045                 1050             


Ser Ile  Ile Arg Tyr Ala Pro  Ser Gln Glu Trp Val  Glu Ile His 
    1055                 1060                 1065             


Leu Ser  Gln Leu Ser Lys Gln  Met Ala Asp Gly Phe  Ala Ala Ile 
    1070                 1075                 1080             


Arg Thr  Glu Phe Arg Met Ala  Cys Pro Gly Glu Gly  Leu Pro Pro 
    1085                 1090                 1095             


Glu Leu  Val Arg Asp Met Phe  His Ser Ser Arg Trp  Thr Ser Pro 
    1100                 1105                 1110             


Glu Gly  Leu Gly Leu Ser Val  Cys Arg Lys Ile Leu  Lys Leu Met 
    1115                 1120                 1125             


Asn Gly  Glu Val Gln Tyr Ile  Arg Glu Ser Glu Arg  Ser Tyr Phe 
    1130                 1135                 1140             


Leu Ile  Ile Leu Glu Leu Pro  Val Pro Arg Lys Arg  Pro Leu Ser 
    1145                 1150                 1155             


Thr Ala  Ser Gly Ser Gly Asp  Met Met Leu Met Met  Pro Tyr 
    1160                 1165                 1170         


<210> 127
<211> 3152
<212> DNA
<213> Homo sapiens

<400> 127
tgtaaacaac ttttggacac atctgggcag ttgctaaggg ctcttgccaa gcgtctagca       60

atacctgaac accttctatg gctgccccaa ggagagctgc aacctgtttg tgctgaagga      120

cacactaaag aagatgcaga agttctttgg actgccccag acaggtgatc ttgaccagaa      180

taccatcgag accatgcgga agccacgctg cggcaaccca gatgtggcca actacaactt      240

cttccctcgc aagcccaagt gggacaagaa ccagatcaca tacaggatca ttggctacac      300

acctgatctg gacccagaga cagtggatga tgcctttgct cgtgccttcc aagtctggag      360

cgatgtgacc ccactgcggt tttctcgaat ccatgatgga gaggcagaca tcatgatcaa      420

ctttggccgc tgggagcatg gcgatggata cccctttgac ggtaaggacg gactcctggc      480

tcatgccttc gccccaggca ctggtgttgg gggagactcc cattttgatg acgatgagct      540

atggaccttg ggagaaggcc aagtggtccg tgtgaagtat gggaacgccg atggggagta      600

ctgcaagttc cccttcttgt tcaatggcaa ggagtacaac agctgcactg ataccggccg      660

cagcgatggc ttcctctggt gctccaccac ctacaacttt gagaaggatg gcaagtacgg      720

cttctgtccc catgaagccc tgttcaccat gggcggcaac gctgaaggac agccctgcaa      780

gtttccattc cgcttccagg gcacatccta tgacagctgc accactgagg gccgcacgga      840

tggctaccgc tggtgcggca ccactgagga ctacgaccgc gacaagaagt atggcttctg      900

ccctgagacc gccatgtcca ctgttggtgg gaactcagaa ggtgccccct gtgtcttccc      960

cttcactttc ctgggcaaca aatatgagag ctgcaccagc gccggccgca gtgacggaaa     1020

gatgtggtgt gcgaccacag ccaactacga tgatgaccgc aagtggggct tctgccctga     1080

ccaagggtac agcctgttcc tcgtggcagc ccacgagttt ggccacgcca tggggctgga     1140

gcactcccaa gaccctgggg ccctgatggc acccatttac acctacacca agaacttccg     1200

tctgtcccag gatgacatca agggcattca ggagctctat ggggcctctc ctgacattga     1260

ccttggcacc ggccccaccc ccacgctggg ccctgtcact cctgagatct gcaaacagga     1320

cattgtattt gatggcatcg ctcagatccg tggtgagatc ttcttcttca aggaccggtt     1380

catttggcgg actgtgacgc cacgtgacaa gcccatgggg cccctgctgg tggccacatt     1440

ctggcctgag ctcccggaaa agattgatgc ggtatacgag gccccacagg aggagaaggc     1500

tgtgttcttt gcagggaatg aatactggat ctactcagcc agcaccctgg agcgagggta     1560

ccccaagcca ctgaccagcc tgggactgcc ccctgatgtc cagcgagtgg atgccgcctt     1620

taactggagc aaaaacaaga agacatacat ctttgctgga gacaaattct ggagatacaa     1680

tgaggtgaag aagaaaatgg atcctggctt ccccaagctc atcgcagatg cctggaatgc     1740

catccccgat aacctggatg ccgtcgtgga cctgcagggc ggcggtcaca gctacttctt     1800

caagggtgcc tattacctga agctggagaa ccaaagtctg aagagcgtga agtttggaag     1860

catcaaatcc gactggctag gctgctgagc tggccctggc tcccacaggc ccttcctctc     1920

cactgccttc gatacaccgg gcctggagaa ctagagaagg acccggaggg gcctggcagc     1980

cgtgccttca gctctacagc taatcagcat tctcactcct acctggtaat ttaagattcc     2040

agagagtggc tcctcccggt gcccaagaat agatgctgac tgtactcctc ccaggcgccc     2100

cttccccctc caatcccacc aaccctcaga gccaccccta aagagatact ttgatatttt     2160

caacgcagcc ctgctttggg ctgccctggt gctgccacac ttcaggctct tctcctttca     2220

caaccttctg tggctcacag aacccttgga gccaatggag actgtctcaa gagggcactg     2280

gtggcccgac agcctggcac agggcagtgg gacagggcat ggccaggtgg ccactccaga     2340

cccctggctt ttcactgctg gctgccttag aacctttctt acattagcag tttgctttgt     2400

atgcactttg tttttttctt tgggtcttgt tttttttttc cacttagaaa ttgcatttcc     2460

tgacagaagg actcaggttg tctgaagtca ctgcacagtg catctcagcc cacatagtga     2520

tggttcccct gttcactcta cttagcatgt ccctaccgag tctcttctcc actggatgga     2580

ggaaaaccaa gccgtggctt cccgctcagc cctccctgcc cctcccttca accattcccc     2640

atgggaaatg tcaacaagta tgaataaaga cacctactga gtggccgtgt ttgccatctg     2700

ttttagcaga gcctagacaa gggccacaga cccagccaga agcggaaact taaaaagtcc     2760

gaatctctgc tccctgcagg gcacaggtga tggtgtctgc tggaaaggtc agagcttcca     2820

aagtaaacag caagagaacc tcagggagag taagctctag tccctctgtc ctgtagaaag     2880

agccctgaag aatcagcaat tttgttgctt tattgtggca tctgttcgag gtttgcttcc     2940

tctttaagtc tgtttcttca ttagcaatca tatcagtttt aatgctacta ctaacaatga     3000

acagtaacaa taatatcccc ctcaattaat agagtgcttt ctatgtgcaa ggcacttttc     3060

acgtgtcacc tattttaacc tttccaacca cataaataaa aaaggccatt attagttgaa     3120

tcttattgat gaagagaaaa aaaaaaaaaa aa                                   3152


<210> 128
<211> 3159
<212> DNA
<213> Homo sapiens

<400> 128
agatgttgtc ttgtgagcgt gcgcgcgcct ggctggaggg gcactgagcc tggccgcagt       60

gttgccaata cctgaacacc ttctatggct gccccaagga gagctgcaac ctgtttgtgc      120

tgaaggacac actaaagaag atgcagaagt tctttggact gccccagaca ggtgatcttg      180

accagaatac catcgagacc atgcggaagc cacgctgcgg caacccagat gtggccaact      240

acaacttctt ccctcgcaag cccaagtggg acaagaacca gatcacatac aggatcattg      300

gctacacacc tgatctggac ccagagacag tggatgatgc ctttgctcgt gccttccaag      360

tctggagcga tgtgacccca ctgcggtttt ctcgaatcca tgatggagag gcagacatca      420

tgatcaactt tggccgctgg gagcatggcg atggataccc ctttgacggt aaggacggac      480

tcctggctca tgccttcgcc ccaggcactg gtgttggggg agactcccat tttgatgacg      540

atgagctatg gaccttggga gaaggccaag tggtccgtgt gaagtatggg aacgccgatg      600

gggagtactg caagttcccc ttcttgttca atggcaagga gtacaacagc tgcactgata      660

ccggccgcag cgatggcttc ctctggtgct ccaccaccta caactttgag aaggatggca      720

agtacggctt ctgtccccat gaagccctgt tcaccatggg cggcaacgct gaaggacagc      780

cctgcaagtt tccattccgc ttccagggca catcctatga cagctgcacc actgagggcc      840

gcacggatgg ctaccgctgg tgcggcacca ctgaggacta cgaccgcgac aagaagtatg      900

gcttctgccc tgagaccgcc atgtccactg ttggtgggaa ctcagaaggt gccccctgtg      960

tcttcccctt cactttcctg ggcaacaaat atgagagctg caccagcgcc ggccgcagtg     1020

acggaaagat gtggtgtgcg accacagcca actacgatga tgaccgcaag tggggcttct     1080

gccctgacca agggtacagc ctgttcctcg tggcagccca cgagtttggc cacgccatgg     1140

ggctggagca ctcccaagac cctggggccc tgatggcacc catttacacc tacaccaaga     1200

acttccgtct gtcccaggat gacatcaagg gcattcagga gctctatggg gcctctcctg     1260

acattgacct tggcaccggc cccaccccca cgctgggccc tgtcactcct gagatctgca     1320

aacaggacat tgtatttgat ggcatcgctc agatccgtgg tgagatcttc ttcttcaagg     1380

accggttcat ttggcggact gtgacgccac gtgacaagcc catggggccc ctgctggtgg     1440

ccacattctg gcctgagctc ccggaaaaga ttgatgcggt atacgaggcc ccacaggagg     1500

agaaggctgt gttctttgca gggaatgaat actggatcta ctcagccagc accctggagc     1560

gagggtaccc caagccactg accagcctgg gactgccccc tgatgtccag cgagtggatg     1620

ccgcctttaa ctggagcaaa aacaagaaga catacatctt tgctggagac aaattctgga     1680

gatacaatga ggtgaagaag aaaatggatc ctggcttccc caagctcatc gcagatgcct     1740

ggaatgccat ccccgataac ctggatgccg tcgtggacct gcagggcggc ggtcacagct     1800

acttcttcaa gggtgcctat tacctgaagc tggagaacca aagtctgaag agcgtgaagt     1860

ttggaagcat caaatccgac tggctaggct gctgagctgg ccctggctcc cacaggccct     1920

tcctctccac tgccttcgat acaccgggcc tggagaacta gagaaggacc cggaggggcc     1980

tggcagccgt gccttcagct ctacagctaa tcagcattct cactcctacc tggtaattta     2040

agattccaga gagtggctcc tcccggtgcc caagaataga tgctgactgt actcctccca     2100

ggcgcccctt ccccctccaa tcccaccaac cctcagagcc acccctaaag agatactttg     2160

atattttcaa cgcagccctg ctttgggctg ccctggtgct gccacacttc aggctcttct     2220

cctttcacaa ccttctgtgg ctcacagaac ccttggagcc aatggagact gtctcaagag     2280

ggcactggtg gcccgacagc ctggcacagg gcagtgggac agggcatggc caggtggcca     2340

ctccagaccc ctggcttttc actgctggct gccttagaac ctttcttaca ttagcagttt     2400

gctttgtatg cactttgttt ttttctttgg gtcttgtttt ttttttccac ttagaaattg     2460

catttcctga cagaaggact caggttgtct gaagtcactg cacagtgcat ctcagcccac     2520

atagtgatgg ttcccctgtt cactctactt agcatgtccc taccgagtct cttctccact     2580

ggatggagga aaaccaagcc gtggcttccc gctcagccct ccctgcccct cccttcaacc     2640

attccccatg ggaaatgtca acaagtatga ataaagacac ctactgagtg gccgtgtttg     2700

ccatctgttt tagcagagcc tagacaaggg ccacagaccc agccagaagc ggaaacttaa     2760

aaagtccgaa tctctgctcc ctgcagggca caggtgatgg tgtctgctgg aaaggtcaga     2820

gcttccaaag taaacagcaa gagaacctca gggagagtaa gctctagtcc ctctgtcctg     2880

tagaaagagc cctgaagaat cagcaatttt gttgctttat tgtggcatct gttcgaggtt     2940

tgcttcctct ttaagtctgt ttcttcatta gcaatcatat cagttttaat gctactacta     3000

acaatgaaca gtaacaataa tatccccctc aattaataga gtgctttcta tgtgcaaggc     3060

acttttcacg tgtcacctat tttaaccttt ccaaccacat aaataaaaaa ggccattatt     3120

agttgaatct tattgatgaa gagaaaaaaa aaaaaaaaa                            3159


<210> 129
<211> 3230
<212> DNA
<213> Homo sapiens

<400> 129
gtgcagggtg tcctagccaa gccggcgtcc ctcctagtag taccgctgct ctctaacctc       60

aggacgtcaa gggcctagag cgacagatgt ttcccagcag ggggttctga ggctgtgcgc      120

ccagatcgcg agagagcaat acctgaacac cttctatggc tgccccaagg agagctgcaa      180

cctgtttgtg ctgaaggaca cactaaagaa gatgcagaag ttctttggac tgccccagac      240

aggtgatctt gaccagaata ccatcgagac catgcggaag ccacgctgcg gcaacccaga      300

tgtggccaac tacaacttct tccctcgcaa gcccaagtgg gacaagaacc agatcacata      360

caggatcatt ggctacacac ctgatctgga cccagagaca gtggatgatg cctttgctcg      420

tgccttccaa gtctggagcg atgtgacccc actgcggttt tctcgaatcc atgatggaga      480

ggcagacatc atgatcaact ttggccgctg ggagcatggc gatggatacc cctttgacgg      540

taaggacgga ctcctggctc atgccttcgc cccaggcact ggtgttgggg gagactccca      600

ttttgatgac gatgagctat ggaccttggg agaaggccaa gtggtccgtg tgaagtatgg      660

gaacgccgat ggggagtact gcaagttccc cttcttgttc aatggcaagg agtacaacag      720

ctgcactgat accggccgca gcgatggctt cctctggtgc tccaccacct acaactttga      780

gaaggatggc aagtacggct tctgtcccca tgaagccctg ttcaccatgg gcggcaacgc      840

tgaaggacag ccctgcaagt ttccattccg cttccagggc acatcctatg acagctgcac      900

cactgagggc cgcacggatg gctaccgctg gtgcggcacc actgaggact acgaccgcga      960

caagaagtat ggcttctgcc ctgagaccgc catgtccact gttggtggga actcagaagg     1020

tgccccctgt gtcttcccct tcactttcct gggcaacaaa tatgagagct gcaccagcgc     1080

cggccgcagt gacggaaaga tgtggtgtgc gaccacagcc aactacgatg atgaccgcaa     1140

gtggggcttc tgccctgacc aagggtacag cctgttcctc gtggcagccc acgagtttgg     1200

ccacgccatg gggctggagc actcccaaga ccctggggcc ctgatggcac ccatttacac     1260

ctacaccaag aacttccgtc tgtcccagga tgacatcaag ggcattcagg agctctatgg     1320

ggcctctcct gacattgacc ttggcaccgg ccccaccccc acgctgggcc ctgtcactcc     1380

tgagatctgc aaacaggaca ttgtatttga tggcatcgct cagatccgtg gtgagatctt     1440

cttcttcaag gaccggttca tttggcggac tgtgacgcca cgtgacaagc ccatggggcc     1500

cctgctggtg gccacattct ggcctgagct cccggaaaag attgatgcgg tatacgaggc     1560

cccacaggag gagaaggctg tgttctttgc agggaatgaa tactggatct actcagccag     1620

caccctggag cgagggtacc ccaagccact gaccagcctg ggactgcccc ctgatgtcca     1680

gcgagtggat gccgccttta actggagcaa aaacaagaag acatacatct ttgctggaga     1740

caaattctgg agatacaatg aggtgaagaa gaaaatggat cctggcttcc ccaagctcat     1800

cgcagatgcc tggaatgcca tccccgataa cctggatgcc gtcgtggacc tgcagggcgg     1860

cggtcacagc tacttcttca agggtgccta ttacctgaag ctggagaacc aaagtctgaa     1920

gagcgtgaag tttggaagca tcaaatccga ctggctaggc tgctgagctg gccctggctc     1980

ccacaggccc ttcctctcca ctgccttcga tacaccgggc ctggagaact agagaaggac     2040

ccggaggggc ctggcagccg tgccttcagc tctacagcta atcagcattc tcactcctac     2100

ctggtaattt aagattccag agagtggctc ctcccggtgc ccaagaatag atgctgactg     2160

tactcctccc aggcgcccct tccccctcca atcccaccaa ccctcagagc cacccctaaa     2220

gagatacttt gatattttca acgcagccct gctttgggct gccctggtgc tgccacactt     2280

caggctcttc tcctttcaca accttctgtg gctcacagaa cccttggagc caatggagac     2340

tgtctcaaga gggcactggt ggcccgacag cctggcacag ggcagtggga cagggcatgg     2400

ccaggtggcc actccagacc cctggctttt cactgctggc tgccttagaa cctttcttac     2460

attagcagtt tgctttgtat gcactttgtt tttttctttg ggtcttgttt tttttttcca     2520

cttagaaatt gcatttcctg acagaaggac tcaggttgtc tgaagtcact gcacagtgca     2580

tctcagccca catagtgatg gttcccctgt tcactctact tagcatgtcc ctaccgagtc     2640

tcttctccac tggatggagg aaaaccaagc cgtggcttcc cgctcagccc tccctgcccc     2700

tcccttcaac cattccccat gggaaatgtc aacaagtatg aataaagaca cctactgagt     2760

ggccgtgttt gccatctgtt ttagcagagc ctagacaagg gccacagacc cagccagaag     2820

cggaaactta aaaagtccga atctctgctc cctgcagggc acaggtgatg gtgtctgctg     2880

gaaaggtcag agcttccaaa gtaaacagca agagaacctc agggagagta agctctagtc     2940

cctctgtcct gtagaaagag ccctgaagaa tcagcaattt tgttgcttta ttgtggcatc     3000

tgttcgaggt ttgcttcctc tttaagtctg tttcttcatt agcaatcata tcagttttaa     3060

tgctactact aacaatgaac agtaacaata atatccccct caattaatag agtgctttct     3120

atgtgcaagg cacttttcac gtgtcaccta ttttaacctt tccaaccaca taaataaaaa     3180

aggccattat tagttgaatc ttattgatga agagaaaaaa aaaaaaaaaa                3230


<210> 130
<211> 3416
<212> DNA
<213> Homo sapiens

<400> 130
aatgcatgcc tgccctcctg ggaatgaagc acagcaggtc tcagcctcat cttacccagc       60

cccccactca agatggaggt gcctggtttg aacacctctg acaaatggaa gtctgtgttg      120

tccagaggca atgcagtggg ggcttaagaa gataactctg gacttagacc gcttggcttc      180

aaatcaaaga gtgcatgaac caaccagctg gcctagtgat gatgttaggc aagtgacttc      240

tcagtttctt catctgcaaa ctgggaaatt tcctatctca gggttaaaag agaggtaatc      300

ttaggtgctt acctagcaca tgcaatacct gaacaccttc tatggctgcc ccaaggagag      360

ctgcaacctg tttgtgctga aggacacact aaagaagatg cagaagttct ttggactgcc      420

ccagacaggt gatcttgacc agaataccat cgagaccatg cggaagccac gctgcggcaa      480

cccagatgtg gccaactaca acttcttccc tcgcaagccc aagtgggaca agaaccagat      540

cacatacagg atcattggct acacacctga tctggaccca gagacagtgg atgatgcctt      600

tgctcgtgcc ttccaagtct ggagcgatgt gaccccactg cggttttctc gaatccatga      660

tggagaggca gacatcatga tcaactttgg ccgctgggag catggcgatg gatacccctt      720

tgacggtaag gacggactcc tggctcatgc cttcgcccca ggcactggtg ttgggggaga      780

ctcccatttt gatgacgatg agctatggac cttgggagaa ggccaagtgg tccgtgtgaa      840

gtatgggaac gccgatgggg agtactgcaa gttccccttc ttgttcaatg gcaaggagta      900

caacagctgc actgataccg gccgcagcga tggcttcctc tggtgctcca ccacctacaa      960

ctttgagaag gatggcaagt acggcttctg tccccatgaa gccctgttca ccatgggcgg     1020

caacgctgaa ggacagccct gcaagtttcc attccgcttc cagggcacat cctatgacag     1080

ctgcaccact gagggccgca cggatggcta ccgctggtgc ggcaccactg aggactacga     1140

ccgcgacaag aagtatggct tctgccctga gaccgccatg tccactgttg gtgggaactc     1200

agaaggtgcc ccctgtgtct tccccttcac tttcctgggc aacaaatatg agagctgcac     1260

cagcgccggc cgcagtgacg gaaagatgtg gtgtgcgacc acagccaact acgatgatga     1320

ccgcaagtgg ggcttctgcc ctgaccaagg gtacagcctg ttcctcgtgg cagcccacga     1380

gtttggccac gccatggggc tggagcactc ccaagaccct ggggccctga tggcacccat     1440

ttacacctac accaagaact tccgtctgtc ccaggatgac atcaagggca ttcaggagct     1500

ctatggggcc tctcctgaca ttgaccttgg caccggcccc acccccacgc tgggccctgt     1560

cactcctgag atctgcaaac aggacattgt atttgatggc atcgctcaga tccgtggtga     1620

gatcttcttc ttcaaggacc ggttcatttg gcggactgtg acgccacgtg acaagcccat     1680

ggggcccctg ctggtggcca cattctggcc tgagctcccg gaaaagattg atgcggtata     1740

cgaggcccca caggaggaga aggctgtgtt ctttgcaggg aatgaatact ggatctactc     1800

agccagcacc ctggagcgag ggtaccccaa gccactgacc agcctgggac tgccccctga     1860

tgtccagcga gtggatgccg cctttaactg gagcaaaaac aagaagacat acatctttgc     1920

tggagacaaa ttctggagat acaatgaggt gaagaagaaa atggatcctg gcttccccaa     1980

gctcatcgca gatgcctgga atgccatccc cgataacctg gatgccgtcg tggacctgca     2040

gggcggcggt cacagctact tcttcaaggg tgcctattac ctgaagctgg agaaccaaag     2100

tctgaagagc gtgaagtttg gaagcatcaa atccgactgg ctaggctgct gagctggccc     2160

tggctcccac aggcccttcc tctccactgc cttcgataca ccgggcctgg agaactagag     2220

aaggacccgg aggggcctgg cagccgtgcc ttcagctcta cagctaatca gcattctcac     2280

tcctacctgg taatttaaga ttccagagag tggctcctcc cggtgcccaa gaatagatgc     2340

tgactgtact cctcccaggc gccccttccc cctccaatcc caccaaccct cagagccacc     2400

cctaaagaga tactttgata ttttcaacgc agccctgctt tgggctgccc tggtgctgcc     2460

acacttcagg ctcttctcct ttcacaacct tctgtggctc acagaaccct tggagccaat     2520

ggagactgtc tcaagagggc actggtggcc cgacagcctg gcacagggca gtgggacagg     2580

gcatggccag gtggccactc cagacccctg gcttttcact gctggctgcc ttagaacctt     2640

tcttacatta gcagtttgct ttgtatgcac tttgtttttt tctttgggtc ttgttttttt     2700

tttccactta gaaattgcat ttcctgacag aaggactcag gttgtctgaa gtcactgcac     2760

agtgcatctc agcccacata gtgatggttc ccctgttcac tctacttagc atgtccctac     2820

cgagtctctt ctccactgga tggaggaaaa ccaagccgtg gcttcccgct cagccctccc     2880

tgcccctccc ttcaaccatt ccccatggga aatgtcaaca agtatgaata aagacaccta     2940

ctgagtggcc gtgtttgcca tctgttttag cagagcctag acaagggcca cagacccagc     3000

cagaagcgga aacttaaaaa gtccgaatct ctgctccctg cagggcacag gtgatggtgt     3060

ctgctggaaa ggtcagagct tccaaagtaa acagcaagag aacctcaggg agagtaagct     3120

ctagtccctc tgtcctgtag aaagagccct gaagaatcag caattttgtt gctttattgt     3180

ggcatctgtt cgaggtttgc ttcctcttta agtctgtttc ttcattagca atcatatcag     3240

ttttaatgct actactaaca atgaacagta acaataatat ccccctcaat taatagagtg     3300

ctttctatgt gcaaggcact tttcacgtgt cacctatttt aacctttcca accacataaa     3360

taaaaaaggc cattattagt tgaatcttat tgatgaagag aaaaaaaaaa aaaaaa         3416


<210> 131
<211> 3558
<212> DNA
<213> Homo sapiens

<400> 131
acatctggcg gctgccctcc cttgtttccg ctgcatccag acttcctcag gcggtggctg       60

gaggctgcgc atctggggct ttaaacatac aaagggattg ccaggacctg cggcggcggc      120

ggcggcggcg ggggctgggg cgcgggggcc ggaccatgag ccgctgagcc gggcaaaccc      180

caggccaccg agccagcgga ccctcggagc gcagccctgc gccgcggagc aggctccaac      240

caggcggcga ggcggccaca cgcaccgagc cagcgacccc cgggcgacgc gcggggccag      300

ggagcgctac gatggaggcg ctaatggccc ggggcgcgct cacgggtccc ctgagggcgc      360

tctgtctcct gggctgcctg ctgagccacg ccgccgccgc gccgtcgccc atcatcaagt      420

tccccggcga tgtcgccccc aaaacggaca aagagttggc agtgcaatac ctgaacacct      480

tctatggctg ccccaaggag agctgcaacc tgtttgtgct gaaggacaca ctaaagaaga      540

tgcagaagtt ctttggactg ccccagacag gtgatcttga ccagaatacc atcgagacca      600

tgcggaagcc acgctgcggc aacccagatg tggccaacta caacttcttc cctcgcaagc      660

ccaagtggga caagaaccag atcacataca ggatcattgg ctacacacct gatctggacc      720

cagagacagt ggatgatgcc tttgctcgtg ccttccaagt ctggagcgat gtgaccccac      780

tgcggttttc tcgaatccat gatggagagg cagacatcat gatcaacttt ggccgctggg      840

agcatggcga tggatacccc tttgacggta aggacggact cctggctcat gccttcgccc      900

caggcactgg tgttggggga gactcccatt ttgatgacga tgagctatgg accttgggag      960

aaggccaagt ggtccgtgtg aagtatggga acgccgatgg ggagtactgc aagttcccct     1020

tcttgttcaa tggcaaggag tacaacagct gcactgatac cggccgcagc gatggcttcc     1080

tctggtgctc caccacctac aactttgaga aggatggcaa gtacggcttc tgtccccatg     1140

aagccctgtt caccatgggc ggcaacgctg aaggacagcc ctgcaagttt ccattccgct     1200

tccagggcac atcctatgac agctgcacca ctgagggccg cacggatggc taccgctggt     1260

gcggcaccac tgaggactac gaccgcgaca agaagtatgg cttctgccct gagaccgcca     1320

tgtccactgt tggtgggaac tcagaaggtg ccccctgtgt cttccccttc actttcctgg     1380

gcaacaaata tgagagctgc accagcgccg gccgcagtga cggaaagatg tggtgtgcga     1440

ccacagccaa ctacgatgat gaccgcaagt ggggcttctg ccctgaccaa gggtacagcc     1500

tgttcctcgt ggcagcccac gagtttggcc acgccatggg gctggagcac tcccaagacc     1560

ctggggccct gatggcaccc atttacacct acaccaagaa cttccgtctg tcccaggatg     1620

acatcaaggg cattcaggag ctctatgggg cctctcctga cattgacctt ggcaccggcc     1680

ccacccccac gctgggccct gtcactcctg agatctgcaa acaggacatt gtatttgatg     1740

gcatcgctca gatccgtggt gagatcttct tcttcaagga ccggttcatt tggcggactg     1800

tgacgccacg tgacaagccc atggggcccc tgctggtggc cacattctgg cctgagctcc     1860

cggaaaagat tgatgcggta tacgaggccc cacaggagga gaaggctgtg ttctttgcag     1920

ggaatgaata ctggatctac tcagccagca ccctggagcg agggtacccc aagccactga     1980

ccagcctggg actgccccct gatgtccagc gagtggatgc cgcctttaac tggagcaaaa     2040

acaagaagac atacatcttt gctggagaca aattctggag atacaatgag gtgaagaaga     2100

aaatggatcc tggcttcccc aagctcatcg cagatgcctg gaatgccatc cccgataacc     2160

tggatgccgt cgtggacctg cagggcggcg gtcacagcta cttcttcaag ggtgcctatt     2220

acctgaagct ggagaaccaa agtctgaaga gcgtgaagtt tggaagcatc aaatccgact     2280

ggctaggctg ctgagctggc cctggctccc acaggccctt cctctccact gccttcgata     2340

caccgggcct ggagaactag agaaggaccc ggaggggcct ggcagccgtg ccttcagctc     2400

tacagctaat cagcattctc actcctacct ggtaatttaa gattccagag agtggctcct     2460

cccggtgccc aagaatagat gctgactgta ctcctcccag gcgccccttc cccctccaat     2520

cccaccaacc ctcagagcca cccctaaaga gatactttga tattttcaac gcagccctgc     2580

tttgggctgc cctggtgctg ccacacttca ggctcttctc ctttcacaac cttctgtggc     2640

tcacagaacc cttggagcca atggagactg tctcaagagg gcactggtgg cccgacagcc     2700

tggcacaggg cagtgggaca gggcatggcc aggtggccac tccagacccc tggcttttca     2760

ctgctggctg ccttagaacc tttcttacat tagcagtttg ctttgtatgc actttgtttt     2820

tttctttggg tcttgttttt tttttccact tagaaattgc atttcctgac agaaggactc     2880

aggttgtctg aagtcactgc acagtgcatc tcagcccaca tagtgatggt tcccctgttc     2940

actctactta gcatgtccct accgagtctc ttctccactg gatggaggaa aaccaagccg     3000

tggcttcccg ctcagccctc cctgcccctc ccttcaacca ttccccatgg gaaatgtcaa     3060

caagtatgaa taaagacacc tactgagtgg ccgtgtttgc catctgtttt agcagagcct     3120

agacaagggc cacagaccca gccagaagcg gaaacttaaa aagtccgaat ctctgctccc     3180

tgcagggcac aggtgatggt gtctgctgga aaggtcagag cttccaaagt aaacagcaag     3240

agaacctcag ggagagtaag ctctagtccc tctgtcctgt agaaagagcc ctgaagaatc     3300

agcaattttg ttgctttatt gtggcatctg ttcgaggttt gcttcctctt taagtctgtt     3360

tcttcattag caatcatatc agttttaatg ctactactaa caatgaacag taacaataat     3420

atccccctca attaatagag tgctttctat gtgcaaggca cttttcacgt gtcacctatt     3480

ttaacctttc caaccacata aataaaaaag gccattatta gttgaatctt attgatgaag     3540

agaaaaaaaa aaaaaaaa                                                   3558


<210> 132
<211> 2350
<212> DNA
<213> Bos taurus

<400> 132
ggcacgaggc gggctggggg ccgggccatg ctctgctgag ccgggcaaag ccgaggagac       60

cgaatagaat agcccctcgg agcgcagcgc cgcgcggggg agcaggcgcc agccaggcgg      120

cgacgcggcc acacgcaccg agcctgccac ccccgggcga cgcgcggggc ccgggagcgc      180

aatgaccgag gcgcgagtgt cccggggcgc gctggccgcc cttctgcggg cgctctgcgc      240

cctgggctgc ctgttgggcc gtgccgccgc cgcgccgtcg cccatcatca aatttcccgg      300

cgatgtcgcc cccaaaacgg acaaagagtt ggctgtgcaa tacctaaaca ccttctacgg      360

ctgccccaag gagagctgta acttgtttgt gctgaaggac accctgaaga agatgcagaa      420

gttcttcggg ttaccccaga caggtgaact ggaccagagc accattgaga ccatgcggaa      480

gccgcgctgt ggcaaccccg acgtggccaa ctacaacttc ttcccccgaa agcccaagtg      540

ggacaagaac cagatcacat acaggatcat tggctacaca cctgatctgg acccccagac      600

agtggatgat gccttcgctc gtgccttcca agtctggagc gatgtgactc cgctacggtt      660

ttctcggatc catgatggag aggctgacat catgatcaac tttggccgct gggagcatgg      720

agatgggtac ccttttgatg gcaaagacgg gctcctggct catgccttcg ccccgggccc      780

tggagttggg ggagattccc actttgatga cgatgagctg cggaccctgg gagaaggaca      840

agtggtccgt gtgaagtacg ggaatgctga cggggaatat tgcaagttcc ccttccggtt      900

caacggcaag gagtacacca gctgcacaga cacaggccgc agcgatggct tcctctggtg      960

ttccaccaca tacaactttg acaaggacgg caagtatggc ttctgccccc atgaagccct     1020

gttcaccatg ggcggcaacg ccgacggaca gccctgcaag ttcccgttcc gcttccaggg     1080

cacgtcttac gacagttgca ccacggaggg ccgcacggac ggctaccgct ggtgtggcac     1140

caccgaggac tacgaccgcg acaaggagta cggcttctgc ccggagaccg ccatgtccac     1200

tgtgggcggg aactcggaag gtgccccatg tgtcctcccc ttcaccttcc tgggcaacaa     1260

gcacgagagc tgcaccagcg ctggccgcag tgatgggaag ttgtggtgtg cgaccacctc     1320

caactacgat gatgaccgca agtggggctt ctgccccgac caagggtaca gcctgttcct     1380

ggtggcagcc catgagtttg gccatgcaat ggggctggag cactcacagg accctggagc     1440

cctgatggcg cccatttata cctacaccaa gaacttccgc ctgtcccatg atgacatcca     1500

gggcatccaa gaactctatg gggcctcccc tgacattgat actggcaccg gccccacccc     1560

aaccctgggc cccgtcactc ctgagctctg caaacaggac atcgtcttcg acggcatctc     1620

tcagatccgt ggggagatct tcttcttcaa ggaccgattc atctggcgaa cagtgacacc     1680

acgtgacaag cccacagggc ccctgctggt agccacattc tggcctgagc tgccggaaaa     1740

gatcgatgct gtgtacgaag acccacagga ggagaaggct gtgttctttg cagggaacga     1800

atactgggtc tattcagcca gcaccctgga gcgagggtac cccaagccac tgaccagcct     1860

ggggctcccc cctggtgtcc agaaggtgga tgctgccttt aactggagca agaacaagaa     1920

gacgtacatc ttcgccggag acaaattctg gagatacaat gaggtgaaga agaaaatgga     1980

tcctggcttc cccaagctca tcgccgatgc ctggaacgcc atccctgata acctggatgc     2040

tgtggtggac ctgcagggcg ggggtcacag ctacttcttc aagggcgcct attacctgaa     2100

gttggagaac caaagtctga agagcgtgaa gttcggaagc atcaaatccg attggctggg     2160

ctgctgagct ggctccgcct cccccagggc ctgcccctcc atcacctgct gcacaccagg     2220

gcctgagcac cagggaagga cccgggtggg cgtggcagcc ctcagttctg taattaatca     2280

gcattctcac ccccacctgg taatttaaga aaccctagag tggctctgcc ctgtgctcaa     2340

gtaaaggtga                                                            2350


<210> 133
<211> 1153
<212> DNA
<213> Homo sapiens

<400> 133
gaaaacacca aatcaaccat aggtccaaga acaattgtct ctggacggca gctatgcgac       60

tcaccgtgct gtgtgctgtg tgcctgctgc ctggcagcct ggccctgccg ctgcctcagg      120

aggcgggagg catgagtgag ctacagtggg aacaggctca ggactatctc aagagatttt      180

atctctatga ctcagaaaca aaaaatgcca acagtttaga agccaaactc aaggagatgc      240

aaaaattctt tggcctacct ataactggaa tgttaaactc ccgcgtcata gaaataatgc      300

agaagcccag atgtggagtg ccagatgttg cagaatactc actatttcca aatagcccaa      360

aatggacttc caaagtggtc acctacagga tcgtatcata tactcgagac ttaccgcata      420

ttacagtgga tcgattagtg tcaaaggctt taaacatgtg gggcaaagag atccccctgc      480

atttcaggaa agttgtatgg ggaactgctg acatcatgat tggctttgcg cgaggagctc      540

atggggactc ctacccattt gatgggccag gaaacacgct ggctcatgcc tttgcgcctg      600

ggacaggtct cggaggagat gctcacttcg atgaggatga acgctggacg gatggtagca      660

gtctagggat taacttcctg tatgctgcaa ctcatgaact tggccattct ttgggtatgg      720

gacattcctc tgatcctaat gcagtgatgt atccaaccta tggaaatgga gatccccaaa      780

attttaaact ttcccaggat gatattaaag gcattcagaa actatatgga aagagaagta      840

attcaagaaa gaaatagaaa cttcaggcag aacatccatt cattcattca ttggattgta      900

tatcattgtt gcacaatcag aattgataag cactgttcct ccactccatt tagcaattat      960

gtcacccttt tttattgcag ttggtttttg aatgtctttc actcctttta aggataaact     1020

cctttatggt gtgactgtgt cttattcatc tatacttgca gtgggtagat gtcaataaat     1080

gttacataca caaataaata aaatgtttat tccatggtaa atttaaaaaa aaaaaaaaaa     1140

aaaaaaaaaa aaa                                                        1153


<210> 134
<211> 2350
<212> DNA
<213> Bos taurus

<400> 134
ctcaccatga gccccctgca gcccttggtc ctggcgctcc tggtgctggc ttgctgctct       60

gctgtcccca gacgacgcca gcccaccgtt gtggtctttc caggagaacc acgaaccaac      120

ctcaccaaca ggcagctggc agaggaatac ctgtaccgct atggctacac tcctggggca      180

gagctgagcg aggacggtca gtccctgcag cgagctctgc tgcgcttcca gcggcgcctg      240

tccctgcccg agactggcga gctggacagc accaccctga acgccatgcg agccccgcgc      300

tgcggcgtcc cagacgtggg cagattccag acctttgagg gcgaactcaa gtggcaccac      360

cacaacatca cctactggat ccaaaattac tcggaagacc tgccgcgcgc cgtgatcgac      420

gacgcctttg cccgcgcttt cgcgctctgg agcgctgtga cgccgctcac cttcactcga      480

gtgtacggcc ccgaagctga cattgtcatc cagtttggtg ttagagagca cggagatggg      540

tatcccttcg atgggaagaa cgggctcctg gcacacgcct ttccgcctgg caaaggcatt      600

cagggagatg cccacttcga cgatgaagag ttgtggtctc tgggcaaagg cgttgtgatc      660

ccgacctact tcggaaacgc gaagggcgcc gcctgccact tccccttcac ctttgagggt      720

cgctcctact ccgcctgcac cacggacggc cgttccgacg acatgctctg gtgcagcacc      780

accgccgact acgacgccga ccgccagttc ggcttctgcc ccagcgagag actctacacc      840

caggacggca atgcggacgg caagccctgc gtcttcccgt tcaccttcca gggccgcacc      900

tactccgcct gtacctccga tggtcgctcc gacggctacc gctggtgcgc caccaccgcc      960

aactacgacc aggacaagct ctacggcttc tgcccgaccc gagtcgatgc aacggtgacc     1020

gggggcaacg cggcggggga gctgtgcgtc ttccccttca ccttcctggg caaggaatac     1080

tcggcctgca ccagagaggg tcgcaatgat gggcacctct ggtgcgccac cacctccaac     1140

ttcgacaaag acaagaagtg gggcttctgc ccggatcaag gatacagcct gttccttgtg     1200

gccgcacacg agtttggcca cgcgctgggc ttagatcaca cctccgtgcc agaggcgctc     1260

atgtacccca tgtacagatt cacagaggag caccccctgc atagggacga tgttcagggc     1320

atccagcatc tgtatggtcc tcgccctgag cctgaaccac ggcctccgac cactaccacc     1380

actaccacca ccgaacccca gcccaccgct ccccccacgg tctgcgtcac ggggcctccc     1440

accgcccgcc cctcagaggg tcccactact ggccccacag ggcccccggc agctggccct     1500

acgggtcctc ccacggctgg cccttctgcg gccccgacgg agtccccgga tccagcggag     1560

gacgtctgca acgtggacat cttcgacgcc atcgcggaga ttaggaaccg cttgcatttc     1620

ttcaaggctg ggaagtactg gagactttct gagggagggg gccgccgggt gcagggtccc     1680

ttccttgtca agagcaagtg gcctgcgctg ccccgcaagc tggactccgc cttcgaggat     1740

ccgctcacca agaagatttt cttcttctct gggcgccaag tatgggtgta caccggcgcg     1800

tcgttgctag gcccgaggcg tctggacaag ttgggcctgg gcccggaagt ggcccaggtc     1860

accggggccc tcccgcgccc tgagggtaag gtgctgctgt tcagcgggca gagcttctgg     1920

aggttcgacg tgaagacaca gaaggtggat ccccagagcg tcacccccgt ggaccagatg     1980

ttccccgggg tgcccattag cacgcacgac atctttcagt accaagagaa agcttacttc     2040

tgccaggatc acttctactg gcgcgtgagt tcccagaatg aggtgaatca ggtggactat     2100

gtgggctacg tgaccttcga cctcctgaag tgccctgagg actagggctc ccaagcctgc     2160

ttcagcactg cagcgggggc cccctggggg accctgccaa tagggaatga gccagtctgc     2220

cggatcccaa ctagtggatc tgttctgaag gacgaggagg aggggaggtg ggctgggccc     2280

tctcttccca ccttcctttc ttattagaat gtatttaata aatgtggatt ctttaacctt     2340

aaaaaaaaaa                                                            2350


<210> 135
<211> 2124
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 135
atgagcctct ggcagcccct ggtcctggtg ctcctggtgc tgggctgctg ctttgctgcc       60

cccagacagc gccagtccac ccttgtgctc ttccctggag acctgagaac caatctcacc      120

gacaggcagc tggcagagga atacctgtac cgctatggtt acactcgggt ggcagagatg      180

cgtggagagt cgaaatctct ggggcctgcg ctgctgcttc tccagaagca actgtccctg      240

cccgagaccg gtgagctgga tagcgccacg ctgaaggcca tgcgaacccc acggtgcggg      300

gtcccagacc tgggcagatt ccaaaccttt gagggcgacc tcaagtggca ccaccacaac      360

atcacctatt ggatccaaaa ctactcggaa gacttgccgc gggcggtgat tgacgacgcc      420

tttgcccgcg ccttcgcact gtggagcgcg gtgacgccgc tcaccttcac tcgcgtgtac      480

agccgggacg cagacatcgt catccagttt ggtgtcgcgg agcacggaga cgggtatccc      540

ttcgacggga aggacgggct cctggcacac gcctttcctc ctggccccgg cattcaggga      600

gacgcccatt tcgacgatga cgagttgtgg tccctgggca agggcgtcgt ggttccaact      660

cggtttggaa acgcagatgg cgcggcctgc cacttcccct tcatcttcga gggccgctcc      720

tactctgcct gcaccaccga cggtcgctcc gacggcttgc cctggtgcag taccacggcc      780

aactacgaca ccgacgaccg gtttggcttc tgccccagcg agagactcta cacccgggac      840

ggcaatgctg atgggaaacc ctgccagttt ccattcatct tccaaggcca atcctactcc      900

gcctgcacca cggacggtcg ctccgacggc taccgctggt gcgccaccac cgccaactac      960

gaccgggaca agctcttcgg cttctgcccg acccgagctg actcgacggt gatggggggc     1020

aactcggcgg gggagctgtg cgtcttcccc ttcactttcc tgggtaagga gtactcgacc     1080

tgtaccagcg agggccgcgg agatgggcgc ctctggtgcg ctaccacctc gaactttgac     1140

agcgacaaga agtggggctt ctgcccggac caaggataca gtttgttcct cgtggcggcg     1200

catgagttcg gccacgcgct gggcttagat cattcctcag tgccggaggc gctcatgtac     1260

cctatgtacc gcttcactga ggggcccccc ttgcataagg acgacgtgaa tggcatccgg     1320

cacctctatg gtcctcgccc tgaacctgag ccacggcctc caaccaccac cacaccgcag     1380

cccacggctc ccccgacggt ctgccccacc ggacccccca ctgtccaccc ctcagagcga     1440

cccacagctg gccccacagg tcccccctca gctggcccca caggtccccc cactgctggc     1500

ccttctacgg ccactactgt gcctttgagt ccggtggacg atgcctgcaa cgtgaacatc     1560

ttcgacgcca tcgcggagat tgggaaccag ctgtatttgt tcaaggatgg gaagtactgg     1620

cgattctctg agggcagggg gagccggccg cagggcccct tccttatcgc cgacaagtgg     1680

cccgcgctgc cccgcaagct ggactcggtc tttgaggagc cgctctccaa gaagcttttc     1740

ttcttctctg ggcgccaggt gtgggtgtac acaggcgcgt cggtgctggg cccgaggcgt     1800

ctggacaagc tgggcctggg agccgacgtg gcccaggtga ccggggccct ccggagtggc     1860

agggggaaga tgctgctgtt cagcgggcgg cgcctctgga ggttcgacgt gaaggcgcag     1920

atggtggatc cccggagcgc cagcgaggtg gaccggatgt tccccggggt gcctttggac     1980

acgcacgacg tcttccagta ccgagagaaa gcctatttct gccaggaccg cttctactgg     2040

cgcgtgagtt cccggagtga gttgaaccag gtggaccaag tgggctacgt gacctatgac     2100

atcctgcagt gccctgagga ctag                                            2124


<210> 136
<211> 1848
<212> DNA
<213> Arabidopsis thaliana

<400> 136
atgcatcatt ttgtccctga cttcgatacc gatgatgatt atgtcaacaa ccataattct       60

tctttgaatc atcttcctag aaaatccatt actactatgg gtgaagatga tgatcttatg      120

gagcttttat ggcagaacgg tcaagttgtt gttcaaaacc agagacttca caccaagaaa      180

ccttcttctt ctccaccgaa gcttcttcct tctatggatc ctcagcagca accttcttca      240

gatcagaatc tttttattca agaagatgaa atgacttctt ggcttcatta tcctctccgt      300

gacgatgatt tctgctcaga tcttctcttc tccgccgcac ctactgcgac ggctaccgcg      360

acggtgagtc aagtcaccgc cgcgagaccg ccagtatctt cgacgaatga gtcgaggccg      420

ccggtgagga acttcatgaa tttctcgagg ctgagagggg attttaataa cggtagaggt      480

ggtgaatctg gaccgttgct ttcgaaggcg gttgtgagag aatctacgca ggtaagtcct      540

agcgcaacac cgtcggcggc ggcgagtgaa tccggtttaa cacggcggac ggatggtact      600

gacagttccg ccgtagctgg aggcggcgcg tataatcgga agggaaaagc agtggctatg      660

acggcgccgg cgatcgagat aaccggtaca tcgtcatctg tagtgtcaaa gagcgaaatc      720

gaaccggaga agacgaacgt cgatgatagg aaacgaaaag agagagaagc caccactact      780

gatgaaactg aatcccgtag cgaggaaaca aaacaagcac gtgtatcaac aacatctacc      840

aagagatctc gtgctgctga agttcataat ctctctgaaa gaaaacggag agataggatc      900

aatgagagaa tgaaagcttt gcaagaactt atacctcgct gcaacaagtc agataaagct      960

tcgatgctag atgaagctat tgagtacatg aaatctcttc agcttcaaat acagatgatg     1020

tcaatgggat gtggaatgat gccaatgatg tatccgggca tgcaacagta catgcctcat     1080

atggcgatgg gtatgggtat gaaccagcct attcctcctc cttccttcat gccattcccc     1140

aacatgttag ccgctcaaag acctttgcct acacaaactc acatggccgg gtcaggaccg     1200

caataccctg ttcatgcttc tgacccgtca agagtctttg taccgaacca gcagtatgat     1260

ccaacctcgg gccagcctca gtatccagct ggttacacgg atccatatca gcagttccgc     1320

ggtctccacc cgacccaacc acctcagttt cagaatcaag caacatcgta cccaagttcg     1380

agcagggtga gtagtagtaa ggaatctgag gatcacggaa accacacaac aggttaataa     1440

tgtccatgga gcaacaagaa gatctgtttt cacaagcaaa cacaatttgt tatccgaccc     1500

gacccaacca cctcagtttc agaatcaagc aacatcgtat ccaagttcga gcagggtgag     1560

tagtagtaag gaatctgagg atcacggaaa ccacacaaca ggttaataat gtccatggag     1620

caacaagaag atctgttttc acaagcaaac acaattttga gaaattgaca gagagaccta     1680

acatgtatat atatcgccat ctgtttcttg tttttctttg gtttgttttg tcctctcttc     1740

tcaggttgta tacttagaga gcggtacatg taatgatcca gagatctagg aatcaataca     1800

tagaggttgc agagtcataa aaaaaaaaaa aaaaaaaaaa aaaaaaaa                  1848


<210> 137
<211> 2348
<212> DNA
<213> Arabidopsis thaliana

<400> 137
gtcaagttaa agataatttt ggtatatatg agaaaggtat cgacaaaaac cataacgcta       60

tagatgattg tgatttgaca aaaacaccct caaatcattg ttttcagagt ttttttagat      120

aaggtacaga taagaaacca cctctaaaaa tcaagcaata gatctcatcg cttaaaagaa      180

gagagagatc ttcacttgta tgtgtcccac tgattccaac acaatgtccc agaacttgcc      240

acgtgtcgtt catttcaaaa gattgcagta ctgttgtccc tagagaatca ttatctccct      300

cgctgtaata tctttatgct cctgtcactt tctgtctgta cccaaaagaa gtaatgaacc      360

tctctcatct tcttcttctc tgtttctttc atgttttgtg agttgtttct caacaatttt      420

ctggtctctt agagtgagag gagagagata gagagttgtg ttgggcgtgg aacttggact      480

agttccacat atcaggttat atagatcttc tctttcaact tctgattcgt ccagaagctt      540

tcctaatctg gtcagtagta ctctttttat acgggttttt ggttttataa gatgtggcta      600

tatttggaaa taactatttt gcaagctttc ctagattgcc agaatataaa aaaagatgtt      660

taacaagaga acggactcat ggacttgctt taaattttaa ttattttaaa atcattctat      720

aatgattaga gtaaataaac tattaggact ctgaattata aaattcgatt ttatatatgc      780

tcctccttag atctgacatg gaacaccaag gttggagttt tgaggagaat tatagtttgt      840

ccactaatag aagatctatc aggccacaag atgaactagt ggagttatta tggcgagatg      900

gacaagtggt tctgcagagc caaactcata gagaacaaac ccaaacccag aaacaagatc      960

atcatgaaga agccctaaga tccagcacct ttcttgaaga tcaagaaact gtctcttgga     1020

tccaataccc tccagatgaa gacccattcg aacccgacga cttctcctcc cacttcttct     1080

caaccatgga tcccctccag agaccaacct cagagacggt taagcctaag tccagtcctg     1140

aacctcctca agtcatggtt aagcctaagg cctgtcctga ccctcctcct caagtcatgc     1200

ctcctccaaa atttaggtta acaaattcat catcggggat tagggaaaca gaaatggaac     1260

agtactcggt aacgaccgtt ggacctagcc attgcggaag caacccatca cagaacgatc     1320

tcgatgtctc aatgagtcat gatcgaagca aaaacataga agaaaagctt aatccgaacg     1380

caagttcctc atcaggtggc tcctctggtt gcagctttgg caaagatatc aaagaaatgg     1440

ctagtggaag atgcatcaca accgaccgta agagaaaacg tataaatcac actgacgaat     1500

ctgtatctct atcagatgca atcggtaaca agtcgaacca acgatcagga tcaaaccgaa     1560

ggagtcgagc agctgaagtt cataatctct ccgaaaggag gaggagagat aggatcaatg     1620

agagaatgaa ggctttgcaa gaactaatac ctcactgcag taaaactgat aaagcttcga     1680

ttttagacga agccatagat tatttgaaat cacttcagtt acagcttcaa gtgatgtgga     1740

tggggagtgg aatggcggcg gcggcggctt cggctccgat gatgttcccc ggagttcaac     1800

ctcagcagtt catacgtcag atacagagcc cggtacagtt acctcgattt ccggttatgg     1860

atcagtctgc aattcagaac aatcccggtt tagtttgcca aaacccggta caaaaccaga     1920

tcatctccga ccggtttgct agatacatcg gtgggttccc acacatgcag gccgcgactc     1980

agatgcagcc gatggagatg ttgagattta gttcaccggc gggacagcaa agtcaacaac     2040

cgtcgtctgt gccgacgaag accaccgacg gttctcgttt ggaccactag gttggtgagc     2100

cactttttta cttccttatt tttggtatgt ttctttttta tatctatctt tctgaacata     2160

cttaaaacgt tcaaggatgt attattatag agtaaacgtg caacttcatt acgttatttt     2220

ctgtatatgt gagtttatgt atgtcaaaat gacatgatga gattttttgt aaacaacatc     2280

ttaaaaacag gacatgtgat ttttgtaatc gtaaaaactt tgggatgcag tttattttct     2340

aatcaaaa                                                              2348


<210> 138
<211> 1575
<212> DNA
<213> Arabidopsis thaliana

<400> 138
atgcctctgt ttgagctttt caggctcacc aaagctaagc ttgaatctgc tcaagacagg       60

aacccttctc cacctgtaga tgaagttgtg gagctggtgt gggaaaatgg tcagatatca      120

actcaaagtc agtcaagtag atcgaggaac attcctccac cacaagcaaa ctcttccaga      180

gctagagaga ttggaaatgg ctcaaagacg actatggtgg acgagatccc tatgtcagtg      240

ccatcactaa tgacgggttt gagtcaagac gatgactttg ttccatggtt gaatcatcat      300

ccctcccttg atggatattg ctctgatttc ttgcgtgatg tgtcgtctcc tgttactgtc      360

aacgagcaag agagtgatat ggcggtaaac caaactgctt tcccgttgtt tcagagaaga      420

aaggatggca atgaatcagc tcctgctgct tcttcgtcgc agtataacgg tttccaatcg      480

cattctctgt atggaagtga tagagctaga gatcttccca gccaacaaac caatccggat      540

cggtttactc agacgcagga accactaatt actagtaaca agcctagttt ggtcaacttt      600

tcacatttct tacgccctgc aacttttgcg aagactacta ataataacct tcatgacact      660

aaagaaaaga gtcctcaaag cccgccaaat gtgtttcaga ccagagttct tggagctaaa      720

gactctgaag ataaggttct taacgagtct gttgcttctg ctacgcctaa agataaccaa      780

aaggcttgcc taatatcaga ggactcatgt agaaaagacc aagagagtga aaaagcagtt      840

gtatgttctt ctgttggctc gggtaatagt ctcgatggcc catccgaaag tccttcactt      900

tctttaaaga gaaagcattc gaatattcaa gacattgact gtcatagtga agatgtggaa      960

gaagaatcag gagatggaag aaaggaagca ggtccatctc gaacgggttt gggttcaaag     1020

agaagccgct ctgcagaagt gcataatctg tctgaaagga gacggcgtga taggatcaac     1080

gagaagatgc gtgccctgca agaactcatt ccaaactgta acaaggtgga caaagcttcg     1140

atgctagatg aagccatcga gtatctcaag tcactccaac ttcaagtgca gatcatgtca     1200

atggcgtctg gttactatct gccaccggcg gttatgttcc caccgggtat ggggcattac     1260

ccggcagcag ctgctgcaat ggcaatgggt atgggaatgc cttatgcaat gggcttgcct     1320

gatttgagcc gtggtggttc atcggttaac cacggaccac agttccaagt ctcggggatg     1380

caacaacaac cagtggcgat gggtattcca cgtgtctctg gtggtggtat ctttgccggt     1440

tcttcgacga ttggcaatgg ctcgactaga gatttatctg gttctaaaga tcaaacaacg     1500

acgaataaca acagtaactt gaaaccaata aagagaaaac aggggtcttc tgatcagttt     1560

tgtggatcgt cgtga                                                      1575


<210> 139
<211> 1544
<212> DNA
<213> Arabidopsis thaliana

<400> 139
atggaacaag tgtttgctga ttggaatttt gaagataatt ttcacatgtc cactaataaa       60

agatcaatca gaccagaaga tgaattagtg gagctattgt ggagagatgg tcaagtggtt      120

ttacaaagcc aagctcgtag agaaccgtca gtccaagtcc aaacccacaa acaagaaacc      180

ctaagaaaac ccaacaatat ttttcttgac aaccaagaaa cagtacaaaa gcctaactac      240

gctgctctag atgatcaaga aaccgtctcc tggatacaat accctccgga tgacgtcatc      300

gaccctttcg aatccgagtt ctcctctcat ttcttctctt cgatcgatca cctcggaggt      360

cctgagaagc cacgaatgat cgaagagaca gttaagcatg aggctcaagc catggctcct      420

cctaagttta gatcctcggt tataacagtc ggaccgagtc attgcggcag caaccagtca      480

acaaatattc atcaggccac tacacttccg gtttctatga gtgatagaag caagaacgtc      540

gaagaaagac ttgacacctc gtcaggtggc tcctccggtt gcagctatgg aaggaacaac      600

aaagaaaccg ttagtggaac aagtgtaacc attgaccgta aaagaaaaca tgttatggat      660

gctgatcaag aatctgtgtc tcaatcagat atagggttga cctcaaccga tgatcaaacc      720

atgggcaaca aatcgagcca acggtcagga tctactcgaa gaagccgtgc agctgaagtt      780

cataatctct cagaaaggag gaggagagat cggatcaatg aaagaatgaa ggctcttcaa      840

gaactcatac ctcactgcag cagaacagat aaagcttcga tattggatga agcaattgat      900

tacttaaaat cacttcaaat gcaactccaa gtgatgtgga tgggaagtgg aatggcggcg      960

gcggcagcag cagcagcaag tccgatgatg tttcccgggg tacaatcatc tccatacatt     1020

aatcagatgg ctatgcaaag tcagatgcaa ttgtctcaat tcccggttat gaaccggtcc     1080

gctccgcaga accatcccgg tttagtatgt ctaaacccgg tacagttgca gctccaagca     1140

cagaaccaaa tcttatcgga gcagctcgct aggtacatgg gcgggattcc ccagatgccg     1200

ccggcgggaa atcagaccgt gcaacaacaa ccagcggaca tgttgggatt tggatctccg     1260

gcgggaccgc aaagtcaact gtcggcaccg gcgaccaccg acagtcttca tatgggtaaa     1320

ataggctgac ttggcatata gttttcctcc gaaattattc ttcttacagt tggtgattgt     1380

tatttatttt tggtcgccta agcaagcata aaagctaagt caaatgtatt atagagatct     1440

aataagttag tctcatactt ataacttatt tttaaacagt tgaattatag tatcaatcaa     1500

gtgttgggac ccgtaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa                      1544


<210> 140
<211> 1189
<212> DNA
<213> Arabidopsis thaliana

<400> 140
gaagaaataa cttttggaac attcaacaag acaacaaaat atgacttccc catcatccac       60

cttcagacca aattaagttc ttcaatcttg tttccctgtt tcacacacat atatatatat      120

atatatatat atatatatat atgtgtgtgt ttgtgtgcag acgatgatgt tcttaccaac      180

cgattattgt tgcaggttaa gcgatcaaga gtatatggag cttgtgtttg agaatggcca      240

gattcttgca aagggccaaa gatccaacgt ttctctgcat aatcaacgta ccaaatcgat      300

catggatttg tatgaggcag agtataacga ggatttcatg aagagtatca tccatggtgg      360

tggtggtgcc atcacaaatc tcggggacac gcaggttgtt ccacaaagtc atgttgctgc      420

tgcccatgaa acaaacatgt tggaaagcaa taaacatgtt gacgattctg agactttgaa      480

agcttcttca tcaaagagga tgatggttga ttatcataac cgaaagaaga tcaagtttat      540

acctcctgat gagcaatccg tggttgctga taggtcgttc aaattgggct ttgacacttc      600

ctccgtaggt ttcactgaag acagtgaagg atcgatgtat ctaagcagta gtctagatga      660

cgagtcagat gatgcgaggc cacaagttcc tgcaagaaca agaaaagctt tggtcaaaag      720

aaaacgaaat gcagaagcgt ataattcacc tgagagagac gacaacgaat cgatgttgga      780

tgaagcaatc aattatatga caaaccttca acttcaagtt cagatgatga cgatgggtaa      840

cagatttgtt acaccatcaa tgatgatgcc tttggggccg aactactctc agatgggtct      900

agcaatgggt gtgggaatgc aaatgggcga acaacagttt ctgcctgcac atgttctagg      960

agctggcttg cctgggatta atgattcagc agatatgcta aggtttctta accatcctgg     1020

actaatgcca atgcaaaact ctgcaccttt cattccaacg gaaaattgtt ccccacaatc     1080

tgtccctcca tcgtgcgctg ctttccctaa ccaaatacca aatcccaact ctttgtcaaa     1140

tttagatggt gcaaccttac acaagaaatc aaggaaaact aacagatga                 1189


<210> 141
<211> 561
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 141
ttggctacta cacttgaacg tattgagaag aactttgtca ttactgaccc aagattgcca       60

gataatccca ttatattcgc gtccgatagt ttcttgcagt tgacagaata tagccgtgaa      120

gaaattttgg gaagaaactg caggtttcta caaggtcctg aaactgatcg cgcgacagtg      180

agaaaaatta gttgggaaga aacgccaggt ttctacaagg tcctgaaact gatcgcagat      240

gccatagata accaaacaga ggtcactgtt cagctgatta attatacaaa gagtggtaaa      300

aagttctgga acctctttca cttgcagcct atgcgagatc agaagggaga tgtccagtac      360

tttattgggg ttcagttgga tggaactgag catgtccgag atgctgccga gagagaggga      420

gtcatgctga ttaagaaaac tgcagaaaat attgacgagg ccgcaaagag actgcccgac      480

gccaacctgg cagccgcagc caagaagaaa aagctggacg gaggttcaga tgacgatgac      540

aagggtggat ctggtggatc t                                                561


<210> 142
<211> 555
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 142
atgttagcct tgaaattagc aggtcttgat atcggaagtt tggctactac acttgaacgt       60

attgagaaga actttgtcat tactgaccca agattgccag ataatcccat tatattcgcg      120

tccgatagtt tcttgcagtt gacagaatat agccgtgaag aaattttggg aagaaactgc      180

aggtttctac aaggtcctga aactgatcgc gcgacagtga gaaaaattag agatgccata      240

gataaccaaa cagaggtcac tgttcagctg attaattata caaagagtgg taaaaagttc      300

tggaacctct ttcacttgca gcctatgcga gatcagaagg gagatgtcca gtactttatt      360

ggggttcagt tggatggaac tgagcatgtc cgagatgctg ccgagagaga gggagtcatg      420

ctgattaaga aaactgcaga aaatattgac gaggccgcaa agagactgcc cgacgccaac      480

ctggcagccg cagccaagaa gaaaaagctg gacggaggtt cagatgacga tgacaagggt      540

ggatctggtg gatct                                                       555


<210> 143
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS1 sequence

<400> 143
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Pro Lys Thr Lys Arg Lys 
1               5                   10                  15      


<210> 144
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS2 sequence

<400> 144
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210> 145
<211> 21
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS3 sequence

<400> 145
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Pro Ala 
1               5                   10                  15      


Ala Lys Lys Lys Lys 
            20      


<210> 146
<211> 15
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS4 sequence

<400> 146
Lys Lys Thr Ala Glu Asn Ile Asp Pro Ala Ala Lys Lys Lys Lys 
1               5                   10                  15  


<210> 147
<211> 15
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS5 sequence

<400> 147
Lys Lys Thr Ala Glu Asn Ile Asp Pro Ala Ala Lys Lys Lys Lys 
1               5                   10                  15  


<210> 148
<211> 15
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS6 sequence

<400> 148
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15  


<210> 149
<211> 14
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS7 sequence

<400> 149
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Lys Lys Lys Lys 
1               5                   10                  


<210> 150
<211> 13
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS8 sequence

<400> 150
Lys Arg Leu Pro Asp Ala Asn Leu Ala Lys Lys Lys Lys 
1               5                   10              


<210> 151
<211> 18
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS9 sequence

<400> 151
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Ala Ala Lys Lys 
1               5                   10                  15      


Lys Lys 
        


<210> 152
<211> 20
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS10 sequence

<400> 152
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Ala Ala Ala Ala 
1               5                   10                  15      


Lys Lys Lys Lys 
            20  


<210> 153
<211> 18
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS11 sequence

<400> 153
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Thr Lys Arg 
1               5                   10                  15      


Lys Lys 
        


<210> 154
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS12 sequence

<400> 154
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210> 155
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS13 sequence

<400> 155
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210> 156
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS14 sequence

<400> 156
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210> 157
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS15 sequence

<400> 157
Arg Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210> 158
<211> 16
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS16 sequence

<400> 158
Lys Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210> 159
<211> 20
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS17 sequence

<400> 159
Arg Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Arg Lys Thr Lys 
1               5                   10                  15      


Lys Lys Ile Lys 
            20  


<210> 160
<211> 15
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS18 sequence

<400> 160
Lys Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Arg Arg Arg 
1               5                   10                  15  


<210> 161
<211> 17
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS19 sequence

<400> 161
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Arg Arg 
1               5                   10                  15      


Arg 
    


<210> 162
<211> 22
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS20 sequence

<400> 162
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Pro Asp 
1               5                   10                  15      


Ala Asn Leu Arg Arg Arg 
            20          


<210> 163
<211> 17
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS21 sequence

<400> 163
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Arg Arg 
1               5                   10                  15      


Arg 
    


<210> 164
<211> 19
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS22 sequence

<400> 164
Lys Arg Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Ala Ala Ala Lys 
1               5                   10                  15      


Lys Lys Lys 
            


<210> 165
<211> 18
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS23 sequence

<400> 165
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Lys Lys 
1               5                   10                  15      


Lys Lys 
        


<210> 166
<211> 19
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS24 sequence

<400> 166
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Pro Lys 
1               5                   10                  15      


Lys Lys Lys 
            


<210> 167
<211> 21
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS25 sequence

<400> 167
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Pro Asp 
1               5                   10                  15      


Ala Lys Lys Lys Lys 
            20      


<210> 168
<211> 23
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS26 sequence

<400> 168
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Pro Asp 
1               5                   10                  15      


Ala Asn Leu Lys Lys Lys Lys 
            20              


<210> 169
<211> 29
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS27 sequence

<400> 169
Lys Lys Thr Ala Glu Asn Ile Asp Glu Ala Ala Lys Glu Leu Pro Asp 
1               5                   10                  15      


Ala Asn Leu Ala Ala Ala Ala Ala Ala Lys Lys Lys Lys 
            20                  25                  


<210> 170
<211> 17
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS28 sequence

<400> 170
Arg Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Lys Lys 
1               5                   10                  15      


Lys 
    


<210> 171
<211> 17
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS29 sequence

<400> 171
Lys Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Lys Lys Lys 
1               5                   10                  15      


Lys 
    


<210> 172
<211> 19
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      biLINuS30 sequence

<400> 172
Arg Lys Glu Leu Pro Asp Ala Asn Leu Ala Ala Ala Ala Ala Ala Lys 
1               5                   10                  15      


Lys Lys Lys 
            


<210> 173
<211> 18
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 173
Pro Ser Thr Arg Ile Gln Gln Gln Leu Gly Gln Leu Thr Leu Glu Asn 
1               5                   10                  15      


Leu Gln 
        


<210> 174
<211> 18
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 174
Asn Leu Val Asp Leu Gln Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
1               5                   10                  15      


Gln Gln 
        


<210> 175
<211> 26
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 175
Leu Ala Leu Lys Leu Ala Gly Leu Asp Ile Gly Gly Ser Gly Gly Ser 
1               5                   10                  15      


Leu Ala Leu Lys Leu Ala Gly Leu Asp Ile 
            20                  25      


<210> 176
<211> 7
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 176
Glu Asn Leu Tyr Phe Gln Gly 
1               5           


<210> 177
<211> 5343
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 177
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg       60

cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc      120

ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg      180

gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc      240

acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt      300

ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc      360

ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta      420

acaaaaattt aacgcgaatt ttaacaaact agtaacgttt acaatttcag gtggcacttt      480

tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta      540

tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat      600

tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa      660

actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc      720

gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga      780

aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc      840

agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac      900

cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac      960

aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatgt     1020

tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag     1080

tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca     1140

taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac     1200

ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg     1260

tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca     1320

tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac     1380

cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa     1440

cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga     1500

gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg     1560

gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc     1620

agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag     1680

aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc     1740

agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg     1800

cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac     1860

accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga     1920

aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt     1980

ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag     2040

cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg     2100

gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta     2160

tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc     2220

agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg     2280

tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta     2340

caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg     2400

ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct     2460

gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag     2520

gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc     2580

gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag     2640

aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt     2700

ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa     2760

acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg     2820

ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg     2880

tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc     2940

tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta     3000

cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca     3060

gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc     3120

ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc     3180

catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa     3240

ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc     3300

gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac     3360

gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca     3420

ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta     3480

atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa     3540

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     3600

tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca     3660

ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa     3720

aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt     3780

atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg     3840

cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca     3900

gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta     3960

tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg     4020

agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat     4080

gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct     4140

ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg     4200

catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat     4260

tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc     4320

tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca     4380

gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg     4440

ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt     4500

tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg     4560

catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct     4620

cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga     4680

tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg     4740

ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc     4800

ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg     4860

cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg     4920

gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga     4980

aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa     5040

ttttgtttaa ctttaagaag gagatatacc atgggttctt ctcaccatca ccatcaccat     5100

gaaaacctgt acttccaatc caatattgga agtggataac ggatccgaat tcgagcgccg     5160

tcgacaagct tgcggccgca ctcgagcacc accaccacca ccactgagat ccggctgcta     5220

acaaagcccg aaaggaagct gagttggctg ctgccaccgc tgagcaataa ctagcataac     5280

cccttggggc ctctaaacgg gtcttgaggg gttttttgct gaaaggagga actatatccg     5340

gat                                                                   5343



