                         SEQUENCE LISTING

<110>  The Broad Institute, Inc.
       Massachusetts Institute of Technology
       Zhang, Feng
       Altae-Tran, Han
 
<120>  NOVEL CRISPR-CAS SYSTEMS AND USES THEREOF

<130>  BROD-4230WP

<150>  US 62/850,516
<151>  2019-05-20

<150>  US 63/000,293
<151>  2020-03-26

<160>  52    

<170>  PatentIn version 3.5

<210>  1
<211>  1369
<212>  PRT
<213>  Streptococcus pyogenes

<400>  1

Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Asp Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Gly Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Ala Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Ile Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Arg Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Arg Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Ser Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Ala Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Gly Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly His Ser Leu 
705                 710                 715                 720 


His Glu Gln Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Ile Val Asp Glu Leu Val Lys Val Ile Val 
            740                 745                 750         


Ile Gly His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn 
        755                 760                 765             


Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg 
    770                 775                 780                 


Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His 
785                 790                 795                 800 


Pro Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr 
                805                 810                 815     


Leu Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn 
            820                 825                 830         


Arg Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Ile 
        835                 840                 845             


Lys Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn 
    850                 855                 860                 


Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met 
865                 870                 875                 880 


Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg 
                885                 890                 895     


Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu 
            900                 905                 910         


Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile 
        915                 920                 925             


Thr Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr 
    930                 935                 940                 


Asp Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys 
945                 950                 955                 960 


Ser Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val 
                965                 970                 975     


Arg Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala 
            980                 985                 990         


Val Val Gly Thr Ala Leu Ile Lys  Lys Tyr Pro Lys Leu  Glu Ser Glu 
        995                 1000                 1005             


Phe Val  Tyr Gly Asp Tyr Lys  Val Tyr Asp Val Arg  Lys Met Ile 
    1010                 1015                 1020             


Ala Lys  Ser Glu Gln Glu Ile  Gly Lys Ala Thr Ala  Lys Tyr Phe 
    1025                 1030                 1035             


Phe Tyr  Ser Asn Ile Met Asn  Phe Phe Lys Thr Glu  Ile Thr Leu 
    1040                 1045                 1050             


Ala Asn  Gly Glu Ile Arg Lys  Arg Pro Leu Ile Glu  Thr Asn Gly 
    1055                 1060                 1065             


Glu Thr  Gly Glu Ile Val Trp  Asp Lys Gly Arg Asp  Phe Ala Thr 
    1070                 1075                 1080             


Val Arg  Lys Val Leu Ser Met  Pro Gln Val Asn Ile  Val Lys Lys 
    1085                 1090                 1095             


Thr Glu  Val Gln Thr Gly Gly  Phe Ser Lys Glu Ser  Ile Leu Pro 
    1100                 1105                 1110             


Lys Arg  Asn Ser Asp Lys Leu  Ile Ala Arg Lys Lys  Asp Trp Asp 
    1115                 1120                 1125             


Pro Lys  Lys Tyr Gly Gly Phe  Asp Ser Pro Thr Val  Ala Tyr Ser 
    1130                 1135                 1140             


Val Leu  Val Val Ala Lys Val  Glu Lys Gly Lys Ser  Lys Lys Leu 
    1145                 1150                 1155             


Lys Ser  Val Lys Glu Leu Leu  Gly Ile Thr Ile Met  Glu Arg Ser 
    1160                 1165                 1170             


Ser Phe  Glu Lys Asn Pro Ile  Asp Phe Leu Glu Ala  Lys Gly Tyr 
    1175                 1180                 1185             


Lys Glu  Val Lys Lys Asp Leu  Ile Ile Lys Leu Pro  Lys Tyr Ser 
    1190                 1195                 1200             


Leu Phe  Glu Leu Glu Asn Gly  Arg Lys Arg Met Leu  Ala Ser Ala 
    1205                 1210                 1215             


Gly Glu  Leu Gln Lys Gly Asn  Glu Leu Ala Leu Pro  Ser Lys Tyr 
    1220                 1225                 1230             


Val Asn  Phe Leu Tyr Leu Ala  Ser His Tyr Glu Lys  Leu Lys Gly 
    1235                 1240                 1245             


Ser Pro  Glu Asp Asn Glu Gln  Lys Gln Leu Phe Val  Glu Gln His 
    1250                 1255                 1260             


Lys His  Tyr Leu Asp Glu Ile  Ile Glu Gln Ile Ser  Glu Phe Ser 
    1265                 1270                 1275             


Lys Arg  Val Ile Leu Ala Asp  Ala Asn Leu Asp Lys  Val Leu Ser 
    1280                 1285                 1290             


Ala Tyr  Asn Lys His Arg Asp  Lys Pro Ile Arg Glu  Gln Ala Glu 
    1295                 1300                 1305             


Asn Ile  Ile His Leu Phe Thr  Leu Thr Asn Leu Gly  Ala Pro Ala 
    1310                 1315                 1320             


Ala Phe  Lys Tyr Phe Asp Thr  Thr Ile Asp Arg Lys  Arg Tyr Thr 
    1325                 1330                 1335             


Ser Thr  Lys Glu Val Leu Asp  Ala Thr Leu Ile His  Gln Ser Ile 
    1340                 1345                 1350             


Thr Gly  Leu Tyr Glu Thr Arg  Ile Asp Leu Ser Gln  Leu Gly Gly 
    1355                 1360                 1365             


Asp 
    


<210>  2
<211>  2945
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  2
acttcgccct cacccgcccg gcctagctga tcaaaacgcc tccgcccagc cacgggagtg       60

cagaaggcag ctactccgct gccgcaacaa aggcgcgaag gcttcggcct ctcggaagga      120

attgaccatt tcctgatgaa tatggtatta taagaagcca tagtaatccc aaagggaagg      180

aggcaacgca tgaggagctt atgtcacggt ccgcccgtcg cgccctgccg actctgaggt      240

cagcaacatg accgactgca acgcttgctt gccgctgcac aattggtgga tacagttgag      300

tttcactctg aggtcagcaa catgaccgac tgcaacggtc gagcgcaacg cccgcagtca      360

ggccaccaac gggtttcact ctgaggtcag caacatgacc gactgcaacc cgggtagcag      420

catgataccg ctaccggcgc atgacggttt cactctgagg tcagcaacat gaccgactgc      480

aaccgaccat tctacggggt tctgatgccc cccaggggcg ttggcgcagc agcgcccctc      540

gcaaattgtg actctgcgcc aaactgccgc ccatttgcga aacagcgctt tgctgtacct      600

cagagtggaa tcagccccag cgagcgactc aatcgagtcc acggagtatc gaacccaata      660

ggcatagact ccgtcaccag ctacgtttgc cggaatgctc aagtagcacc tgtggatgtt      720

gctccagtcc gcagccctgc aagtctctgt ggtaggacaa ctcgcccggt cgctgctctg      780

tgcggcggca gggaaggtaa taccccaccg cgaggtactg aagtgggtaa cgtcccagag      840

acatgtgcta ccggcaaaca tgggcgagga gcaaactcac ccgtcttcgg acggagtagg      900

cggtcgcctt cgggcagact gccgaacccc cgtaaggggg aggacatata atgcgcgtct      960

tcgtgcaaaa cgccgatgcg actccgctga tgccctgcca tccggccagg gcgcgcaagc     1020

tcttgcgcaa ggaccgcgcg caggtcgtca atatgcaccc atttgtcatt cgcctgactg     1080

agcaaatcca ggacccaggg atgcagaccg tggagctcgg agtggatgac ggagctaaga     1140

atgtgggcct cgctgttgtt cagcggcgta gcaagcgccc cgatgtcgtg atctttgagg     1200

gcgtaatcga actccgcaca gacatgaaga aaggcttgga cgagagacgt gccatgcgcc     1260

gaggacgccg cagccgtatc cgccaccgcc aaccgcgctt tgacaaccga ccgcgtgcca     1320

aatgcaaggt ctgcggccgc aacacccctg agggtcaggc cctgtgccgc ccccacgccg     1380

ctgagggcca tcacaagtac gcacacctcg agaagaagcc gggatggata cctcccagca     1440

tcaaggcccg gaaggaccag accctccgca ccgttcgcca actgctacgc tggttgccca     1500

tttccacagc ccatctggag gttgccttct ttgacaccca ggccttgagt gagcccaccc     1560

tcactggcga gcagtatcaa tatggcccga acttcggaca tcgcaatcgc aaggctgcag     1620

tcctcttcct ctacaagcac acctgtcagt attgtggcgc aaccgagggc cgcatggaga     1680

tagaccacat agttccacgc ggcgccggcg gcaccgacac catcaccaac ttgacctgtg     1740

cttgtgtcga gtgcaatcgc aagaagggga accgcactcc cgaagcagcg gggatgaagc     1800

tgcgcagatc gccaagagct atcgctctgc gcttgcgcga tgcggccgtt gttcaggcgg     1860

ggaaaagcta tctggagtat cacctccgcg acatgattcc ggaggtgcgg ctggtgctgg     1920

gctggatgac gaactggtgg atgaagaaga tgaatctgcc caagcacgag agcgatggga     1980

agacgaagct gcactacacg gatgccgtgg ctatggtctt acgccaacgc cgagctacaa     2040

cagccagaat gtccgcagtg gtgtatcgca tcgaggcccg ccgccgccag acccgacaga     2100

tgtttaagac ggaaccgtat tcgttcaagc gtaagccccc aatggccgac tgcgtactgc     2160

cggctcgcaa gggaggtaag cgcaggctgc tcaagaccac cccgaatgac caccttctcg     2220

cctgggtgga cgatgcgggc aaacgcaaca agcaggtagt accgaacagg cgatatcccg     2280

acgccgacat gcctgtgctg ccggcaatgg cgcaagcagt gctacggttc gacagaaacg     2340

atattgtccg cgtcaaaggg cgtctcgctc gtgtatccgc ggttttcacc aacggctctc     2400

tgaaggttca acccaccgac gcgaagcagc ctctgtcggt gtccccctgg acggcaagac     2460

ttctcgccaa ggcgcggccc gtcacgttcc tgccctgtcc agttcccccc gtatccagcg     2520

cctggtagcc agcgaggagg ccggacacaa tgaaatgcag cattgacggc tgccccggcg     2580

agtacgagga acgcaagatc gtccacaccg tccgacatca cgggcaggta gtggtcattg     2640

atggcgtgcc ggcagaggtc tgttcggtgt gcagtgatgt cctactgagg ccggagacgg     2700

tccggcgcat tgaagagttg ctgcagagca agacggcccc gacgagcact gccccgctct     2760

accagtacgt atgaccacgg cttgcatttg ccgcaccccg acgccccggc tgctgccgcg     2820

gcatattcca tctcccgtga tcggacaggc gagacgccta tcctaccttc ttacgtcttc     2880

tcactatcct gctctatcgt cgcggcgagg cgggggatct cggggttggt gctgacgcgg     2940

cgcgt                                                                 2945


<210>  3
<211>  525
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  3

Met Arg Val Phe Val Gln Asn Ala Asp Ala Thr Pro Leu Met Pro Cys 
1               5                   10                  15      


His Pro Ala Arg Ala Arg Lys Leu Leu Arg Lys Asp Arg Ala Gln Val 
            20                  25                  30          


Val Asn Met His Pro Phe Val Ile Arg Leu Thr Glu Gln Ile Gln Asp 
        35                  40                  45              


Pro Gly Met Gln Thr Val Glu Leu Gly Val Asp Asp Gly Ala Lys Asn 
    50                  55                  60                  


Val Gly Leu Ala Val Val Gln Arg Arg Ser Lys Arg Pro Asp Val Val 
65                  70                  75                  80  


Ile Phe Glu Gly Val Ile Glu Leu Arg Thr Asp Met Lys Lys Gly Leu 
                85                  90                  95      


Asp Glu Arg Arg Ala Met Arg Arg Gly Arg Arg Ser Arg Ile Arg His 
            100                 105                 110         


Arg Gln Pro Arg Phe Asp Asn Arg Pro Arg Ala Lys Cys Lys Val Cys 
        115                 120                 125             


Gly Arg Asn Thr Pro Glu Gly Gln Ala Leu Cys Arg Pro His Ala Ala 
    130                 135                 140                 


Glu Gly His His Lys Tyr Ala His Leu Glu Lys Lys Pro Gly Trp Ile 
145                 150                 155                 160 


Pro Pro Ser Ile Lys Ala Arg Lys Asp Gln Thr Leu Arg Thr Val Arg 
                165                 170                 175     


Gln Leu Leu Arg Trp Leu Pro Ile Ser Thr Ala His Leu Glu Val Ala 
            180                 185                 190         


Phe Phe Asp Thr Gln Ala Leu Ser Glu Pro Thr Leu Thr Gly Glu Gln 
        195                 200                 205             


Tyr Gln Tyr Gly Pro Asn Phe Gly His Arg Asn Arg Lys Ala Ala Val 
    210                 215                 220                 


Leu Phe Leu Tyr Lys His Thr Cys Gln Tyr Cys Gly Ala Thr Glu Gly 
225                 230                 235                 240 


Arg Met Glu Ile Asp His Ile Val Pro Arg Gly Ala Gly Gly Thr Asp 
                245                 250                 255     


Thr Ile Thr Asn Leu Thr Cys Ala Cys Val Glu Cys Asn Arg Lys Lys 
            260                 265                 270         


Gly Asn Arg Thr Pro Glu Ala Ala Gly Met Lys Leu Arg Arg Ser Pro 
        275                 280                 285             


Arg Ala Ile Ala Leu Arg Leu Arg Asp Ala Ala Val Val Gln Ala Gly 
    290                 295                 300                 


Lys Ser Tyr Leu Glu Tyr His Leu Arg Asp Met Ile Pro Glu Val Arg 
305                 310                 315                 320 


Leu Val Leu Gly Trp Met Thr Asn Trp Trp Met Lys Lys Met Asn Leu 
                325                 330                 335     


Pro Lys His Glu Ser Asp Gly Lys Thr Lys Leu His Tyr Thr Asp Ala 
            340                 345                 350         


Val Ala Met Val Leu Arg Gln Arg Arg Ala Thr Thr Ala Arg Met Ser 
        355                 360                 365             


Ala Val Val Tyr Arg Ile Glu Ala Arg Arg Arg Gln Thr Arg Gln Met 
    370                 375                 380                 


Phe Lys Thr Glu Pro Tyr Ser Phe Lys Arg Lys Pro Pro Met Ala Asp 
385                 390                 395                 400 


Cys Val Leu Pro Ala Arg Lys Gly Gly Lys Arg Arg Leu Leu Lys Thr 
                405                 410                 415     


Thr Pro Asn Asp His Leu Leu Ala Trp Val Asp Asp Ala Gly Lys Arg 
            420                 425                 430         


Asn Lys Gln Val Val Pro Asn Arg Arg Tyr Pro Asp Ala Asp Met Pro 
        435                 440                 445             


Val Leu Pro Ala Met Ala Gln Ala Val Leu Arg Phe Asp Arg Asn Asp 
    450                 455                 460                 


Ile Val Arg Val Lys Gly Arg Leu Ala Arg Val Ser Ala Val Phe Thr 
465                 470                 475                 480 


Asn Gly Ser Leu Lys Val Gln Pro Thr Asp Ala Lys Gln Pro Leu Ser 
                485                 490                 495     


Val Ser Pro Trp Thr Ala Arg Leu Leu Ala Lys Ala Arg Pro Val Thr 
            500                 505                 510         


Phe Leu Pro Cys Pro Val Pro Pro Val Ser Ser Ala Trp 
        515                 520                 525 


<210>  4
<211>  3636
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  4
gatctacggg attacaagcc tcctgtatgg gaatgatagc agcgggcaga tggcactgcc       60

gggtaacctg gcggacctcc atgtgccgca gatcgcaaaa atcatttgcc ggtgcatttt      120

ttatagagca ccggtcccgg attacctggt atcatgggct atagccaatg caagcaagcc      180

gcacatttat aaaaaacgat accactggag gaacgtgctt tcccttgcat gtgctttcct      240

aaaatacaac ctggcactgc aaaagccagg ggaatcggag gttttgccat tggacatcat      300

gaagggctgc aaagacaggg attacttata cggcaggttg cttgccgtgg ctgaccggat      360

cgaataccgg acttttaata acgagcatga cagcaggaga ctgacgaacg caaaacgcta      420

tatgcagaga ttttcagtcc agccatgtca aacctggcag ctgatcgaac agaggatcca      480

gccatacctg cagaaactga aaatgcctga acgtctccat tacctgaaac tcatagacag      540

catcatgggt gaaatggatc ccgctgattt cagtgataac cgcccattga aagggcttta      600

cctggtcgga ttccaccacc agtcttacag tttttatgat aatgcaggga cgaatgtgac      660

agacaaggaa acaggggagg attaattcat gtctaagtta aaagggaaaa ttgatttcac      720

cttatttgta accgtaaact atgcaaaccc gaatggggat ccgctgaatg ggaaccagcc      780

aaggactaac ctgaaaggct acggcgagat ctctgacgta tgcatcaaaa ggaaaatacg      840

gaaccgtatg caggatttag ggtacaaggt ctttgtccag tcggacgaca gggcggacga      900

cgggcatacc agcttaaagg aaagggctgt cagctgccag gaactcaaaa aagagctcgg      960

caaaaaaaga ggggctgacc ccaaaatttg tgcggaaatc gcatgcaggg agtggctgga     1020

tgtacgttcc ttcgggcagg tgttcgcatt taaaggcagc aacgcatccc tggggatccg     1080

tggacctgtg acaatccaga cagccattag cctttcccct gtggaaattg aaagcatgca     1140

gatcacgaag agtgcgaata atgaaccggt ggacgggcgg tcatccgata ctatggggat     1200

gaaacatttc gtgacattcg gcgtatacaa aattgtaggc agcgtgacta cccagcttgc     1260

agaaaagact gaattttcac aggaagacgc cgcaatcctg aaagagtgtt tacggacgct     1320

ttttgtaaat gatgcatcct cagcacgccc tgatgggagc atggatgttg ccaggatgta     1380

ctggtggcag catacggaaa ataccccagt tgtaaccagc cataggatcc aggatgcctt     1440

ccattatgaa aaaccggcag accctgaaag atttgatgat taccttgtgt attgggaacc     1500

gattgacgga tgtattactc cagagatctt tgagtcaata actcctgact aagtcaggag     1560

cttgcttcgg tgagttcctg tcccgccaag cgggttattg agcagaacca agacctgccg     1620

ttcactccgg ggtaacgcca agccccggac actggcacag gcaggccaag gttatggcaa     1680

cacaacaggg gtatacccct gacttacagt atgaaaggaa tgttttatgg tttatgtatt     1740

ggacatggaa ggtaagccat tgatgcccac tacccggcac ggatgggtcc gcagggcctt     1800

aaagtccggg agggcgaaag cagtacagac cttaccgttt acgatccggc tgcagtatag     1860

cctggatgat tctgcattac aggatattac cctggggatc gaccccgggc gcaccaatat     1920

cggggttgcc gcagtccggg aagacggtac ctgcctgtat gccgcccact gtgaaacccg     1980

caacaggcag gtccggaaac agatggatga ccgccgtatg caccgccagg cttcgcgccg     2040

gggcgaacgg ctccgcagga agcgccgggc aaagcggaac ggtacattaa agcaggtgac     2100

attttccatg acagcaatgg gtccccgccc caacatcaat aacaggggcg aatttttcag     2160

gatgctgccg ggatgcaagg aagtttctgt ttacaaggat atccggaata cagaagcccg     2220

gttccagaac cgggcacggc cggaaggctg gcttagcccg actgcggggc acctgctccg     2280

cacgcatctg aacctggtac gcaagataca gcggatcctg cctgtttcca gggtttcgtt     2340

ggaactgaac cggttttcct ttatggagat ggaagccggt gggaagctcc cgcattgggc     2400

atatcagtgc ggcccgctgt atggaaaggg gggcgtacag gacgccattt ctgagatgca     2460

ggacgggaaa tgcctgttgt gtgggaaatc ccctatcgat cactgccacc accttaccca     2520

gcgcgcatgg ggcggaacgg accggctggc gaacctggtt gggctttgca gcggatgcca     2580

taaaaagatc catacggata tggccgccag caggaaactg gaggcaaagg ccggcaggcg     2640

gaataaaggc ttccgggcgc tgtccaccct gaaccagatc atcccatccc ttgcagacag     2700

cctggaggga atgtttggaa accggtttta tattgtcaat ggatgggaca caaaacagtt     2760

ccgtgaggat catgggatag aaaagaccca tgaacaggat gcatactgca tagcgtgttc     2820

tactatgcct ggggtgcgga atgtttcccc ggtagtggca acattccagg tccggcagta     2880

ccgccgccat gaccgtgccc ttgtcaaaag acagacagaa cggtgctatt acctggggaa     2940

gacgaaggtt gctgtcaacc gcaggaagcg gatggaccag aaaacagatt ccctggaaga     3000

ctggtaccag gacatgcgga cggaatatgg ggataagacc gctgacggga tgcgttcccg     3060

gctgaaagta aagaaaagcc agaggtccta caataaccca gggaggttgc tccctggcgc     3120

aaaattccat tatggtggga aaacctatgt catggaatcg cagatgacaa acggacaata     3180

ctaccgtgct gtaggacagg gcaagaaaaa tttccctgca gcaaaatcca ggatcctatg     3240

caggaaccag gggcttgtga ctgttggggt cagctattga cccgcattca tctcccggtt     3300

ctaccgggag ttttctgctt aagtgtttaa ataaattgac cgtgtcaata tgacaccctc     3360

tataaatcgc atcttatata ttaggtgtgc gagttgaaac gtgacatatt aatagaagat     3420

aaaaaatatc gtatcttaat ataggtgcgc gagttaaaat gcaaaaaacg caaaattatc     3480

aattggtttt tatcgatcgc atcttaatat aggtgcgcga gttgaagcgt ggaagccatg     3540

tgttaaatgg gaagaagcat ctgtcaggga aacctgatgg gtgctttttt ctgcaaaaat     3600

tgctatggaa aaagaaatat gtgaaaaaaa tctgtg                               3636


<210>  5
<211>  517
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  5

Met Val Tyr Val Leu Asp Met Glu Gly Lys Pro Leu Met Pro Thr Thr 
1               5                   10                  15      


Arg His Gly Trp Val Arg Arg Ala Leu Lys Ser Gly Arg Ala Lys Ala 
            20                  25                  30          


Val Gln Thr Leu Pro Phe Thr Ile Arg Leu Gln Tyr Ser Leu Asp Asp 
        35                  40                  45              


Ser Ala Leu Gln Asp Ile Thr Leu Gly Ile Asp Pro Gly Arg Thr Asn 
    50                  55                  60                  


Ile Gly Val Ala Ala Val Arg Glu Asp Gly Thr Cys Leu Tyr Ala Ala 
65                  70                  75                  80  


His Cys Glu Thr Arg Asn Arg Gln Val Arg Lys Gln Met Asp Asp Arg 
                85                  90                  95      


Arg Met His Arg Gln Ala Ser Arg Arg Gly Glu Arg Leu Arg Arg Lys 
            100                 105                 110         


Arg Arg Ala Lys Arg Asn Gly Thr Leu Lys Gln Val Thr Phe Ser Met 
        115                 120                 125             


Thr Ala Met Gly Pro Arg Pro Asn Ile Asn Asn Arg Gly Glu Phe Phe 
    130                 135                 140                 


Arg Met Leu Pro Gly Cys Lys Glu Val Ser Val Tyr Lys Asp Ile Arg 
145                 150                 155                 160 


Asn Thr Glu Ala Arg Phe Gln Asn Arg Ala Arg Pro Glu Gly Trp Leu 
                165                 170                 175     


Ser Pro Thr Ala Gly His Leu Leu Arg Thr His Leu Asn Leu Val Arg 
            180                 185                 190         


Lys Ile Gln Arg Ile Leu Pro Val Ser Arg Val Ser Leu Glu Leu Asn 
        195                 200                 205             


Arg Phe Ser Phe Met Glu Met Glu Ala Gly Gly Lys Leu Pro His Trp 
    210                 215                 220                 


Ala Tyr Gln Cys Gly Pro Leu Tyr Gly Lys Gly Gly Val Gln Asp Ala 
225                 230                 235                 240 


Ile Ser Glu Met Gln Asp Gly Lys Cys Leu Leu Cys Gly Lys Ser Pro 
                245                 250                 255     


Ile Asp His Cys His His Leu Thr Gln Arg Ala Trp Gly Gly Thr Asp 
            260                 265                 270         


Arg Leu Ala Asn Leu Val Gly Leu Cys Ser Gly Cys His Lys Lys Ile 
        275                 280                 285             


His Thr Asp Met Ala Ala Ser Arg Lys Leu Glu Ala Lys Ala Gly Arg 
    290                 295                 300                 


Arg Asn Lys Gly Phe Arg Ala Leu Ser Thr Leu Asn Gln Ile Ile Pro 
305                 310                 315                 320 


Ser Leu Ala Asp Ser Leu Glu Gly Met Phe Gly Asn Arg Phe Tyr Ile 
                325                 330                 335     


Val Asn Gly Trp Asp Thr Lys Gln Phe Arg Glu Asp His Gly Ile Glu 
            340                 345                 350         


Lys Thr His Glu Gln Asp Ala Tyr Cys Ile Ala Cys Ser Thr Met Pro 
        355                 360                 365             


Gly Val Arg Asn Val Ser Pro Val Val Ala Thr Phe Gln Val Arg Gln 
    370                 375                 380                 


Tyr Arg Arg His Asp Arg Ala Leu Val Lys Arg Gln Thr Glu Arg Cys 
385                 390                 395                 400 


Tyr Tyr Leu Gly Lys Thr Lys Val Ala Val Asn Arg Arg Lys Arg Met 
                405                 410                 415     


Asp Gln Lys Thr Asp Ser Leu Glu Asp Trp Tyr Gln Asp Met Arg Thr 
            420                 425                 430         


Glu Tyr Gly Asp Lys Thr Ala Asp Gly Met Arg Ser Arg Leu Lys Val 
        435                 440                 445             


Lys Lys Ser Gln Arg Ser Tyr Asn Asn Pro Gly Arg Leu Leu Pro Gly 
    450                 455                 460                 


Ala Lys Phe His Tyr Gly Gly Lys Thr Tyr Val Met Glu Ser Gln Met 
465                 470                 475                 480 


Thr Asn Gly Gln Tyr Tyr Arg Ala Val Gly Gln Gly Lys Lys Asn Phe 
                485                 490                 495     


Pro Ala Ala Lys Ser Arg Ile Leu Cys Arg Asn Gln Gly Leu Val Thr 
            500                 505                 510         


Val Gly Val Ser Tyr 
        515         


<210>  6
<211>  6487
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  6
aaacgataca taaccaagtg ttgcaccccg cacggggtgc attagattga aatcatcggg       60

tcacagataa cccggacgtg cgctgtgctg tcgcaccccg cacggggtgc gttgattgaa      120

atgattgaaa tcgcaacgtc ggaatcagca gggcacttgt cttgcgttgc accccgcatg      180

gggtgcaact ggtgtaggta taatatccta gcacgaacga aataaatcaa tatgtgttaa      240

gatatcttgc atcaaaagca gtctttatga gaaatcgtaa agactgcttt ttgtttgtgt      300

gatgcaaacc gttacaaaag acggaaactt atagcacagc ccacaagtgg gcagaaagga      360

aaaatatgct aacagaaaac atcaaacgaa aaattaagga cgttgtaaaa gaaatcaata      420

attacaacga aatcaaaatc atgccggacg acacaatcat tgatatggac cagacagtac      480

gttggaaccg tgataccgtt gctcgtctaa atgctaccgt tggttcgcgt caggcagaaa      540

tcggaacaaa agtggtagca aagatagttg accttattaa cacgttcctt ttagaacaaa      600

agtgcgaaca tatcgagcag gcagcaaaaa acatcatctt tgagtctatg cacgaagaaa      660

tcggtgatgt cttcctccac gaccacgacg acaactacga ttacctgcgc agttgtgatg      720

actatagttt ggaagggcaa atcaccaaaa ttgtcaacaa aaacgatgct tttgtggata      780

tgctcaacag catcgaaaga gagcgcaagt ctctaaatac gcaaaaagta acgcaggatg      840

caccttggat gaatattcgc gctccgttca cctttacatt tgctggaatc gaatattcaa      900

gcatttttgc cgctattatc tgctacaaat atggcgacag gttgaacaaa gttggctgca      960

atacgtggaa agcatccaat attctctctt atatccggaa agaaaaaatc aaaacgagtc     1020

ctgtttggaa aaacatcccg gctcgccgca aaaaaatcga ggatatcatc acggctgcag     1080

cttgtgctgg gctgtacgag aaactctttg aaacaggaaa caagaggctg attgcggatg     1140

acagtatcat cggagaatac ggaagcatcc tgacaaaggt acgagatgat gcatataagc     1200

aaatgcgaat cgactttatt gatgctaaag ggtatttgca aaaataggtg gtttgaataa     1260

aactatctgt ttttacaagt atcacaggca tttggatgtt tgttgcgcct attgattttc     1320

ccagttgata gctccgcacg agtggcattg tctcgtacaa taccgctcac agcaaacacc     1380

cagccaaggg aaacacaacc tcctgtttca acaggagaga cttacagtaa aaggaggtga     1440

cggatgtgtc cactatttat gtactcaaca aagacggtaa acctttgatg cctacgactc     1500

gctgtggtca tgtacgtcgt ctgcttaaag aacaaaaagc acgagccgta gcatcaaaac     1560

cgtttaccat tcaactgttg tatgaaactg acaatatagt acagccactt tacttgggca     1620

tcgaccccgg tagaaccaat atcggcgttg ctgtagtcaa agcagacggc acggcagttt     1680

ttaccgcgca tttggaaact cgtaacaagg aaattccgaa actaatgaaa aagcgtaaag     1740

attcccgccg cgcaagacgt accaacggca gacgatgccg ccgtcaacgg agagccaaag     1800

caaatggcac catttctaaa aagtgcgtaa agcaagccac tgctcaaaat ggcagtgcca     1860

gcaagcgtgc aaagaaaatt ggtgtcatca agcgccatct tccggattgt gagaaagaag     1920

tcctttgcat cggcatcaaa aataaagaag caaagttcaa caatcgcgca agaccggaag     1980

gctggctcac gcctaccgca aatcagttgc tacagaccca cgtcaatttg gtaaagaaaa     2040

ttcaaaagtt tcttcccatc agtgatatcg tgcttgaagt caacaaattt gcgttcatgc     2100

agttggataa tcctaacatt cagaaatggc agtatcagca aggtccgctc tatcaaaaag     2160

caaaccttga agaagccgtc tctgaaatgc aggaacatca ttgcttgttt tgcaagaagc     2220

cgattgagca ttaccaccat gtagtgcctc agcacaaaaa cggcagcaat acaatcagca     2280

acatcgttgg cttatgcaca aagcatcacg accttgtaca taaggatact gaatggcatg     2340

aaaaacttac taaaaagaaa acaggtctca acaaaaagta cggtgcgtta agtgtattga     2400

atcaaatcat accggcacta acaaaagaat tgagctctat tttcccaaaa cacttttttg     2460

tagcaacagg aaaaagcaca tacgattatc gtgcagcgca cggcgtaagt aaagaccact     2520

ggctcgatgc ctattgcatt gcttgctccg ttttgcctaa cgatgtttgc gacagcagca     2580

tcaataatcg cgtgccgtat gagcttaaac agtttcgtcg tcacgataga agagcactgc     2640

acaaagaaaa tatgagccgt gtgtacacgc tcaacggtaa aaaggtggca acaaatcgcc     2700

acaaagccat tgagcagact actgacagtt tggaagagtt tcgccaaaac aacctagata     2760

atgtatgtaa actcaaggta aaagagcatc atccggaata tcgaaatcct aaacgcaact     2820

tccccggctg tgtgtttctt gttggtaaac aaactcatgt aatgcaagga accagcggct     2880

cacacaacgg taaagcagat ggatattacg acacaaacgg caactcgtat tcatctggta     2940

aatgtaagtt tgttgccaaa aacgaaggaa ttatattcac ataaattagt agaccaccta     3000

ttttcaaata gaaaatcacc taattttgca aataccaatg ctaaaaaagg aagcaacact     3060

ctataacaga tattaagcaa aacgccgtcc acctcttggt gggcggtttt tttattgcct     3120

taaagcggat gcagagagac acaacatatg ttataataga gttatattat tgagagttcg     3180

agtctctctg ggcatattta gctatggaga gaatgtcaaa aaggtacagg tccatatttt     3240

gtcacaagtg caataaggag tcacaaatga gtaagcacat actaggctcg gaccgcattc     3300

tacacgaagg tgctggctac cgtaacaagt acacacttaa aaaccataag cctccggttg     3360

gcagcagaga aaacccgtcc aacccaaagc aagaggggac agacgcagtg tacatcccag     3420

ataccgccaa atggtgtagc aaaaagtaaa ccacaagttg attgacccac cacgcagatg     3480

tggtataata taatcagaac gaaacgaaag gagacaaccg aagatgctgt gcaagactgt     3540

taatgctatg tcgtttgctg agtatagtta tgaatctgaa ttcgagtcct acgaatccag     3600

ctttgtttcc tataccaatc gacaagcaaa aaacagacca tgtacagatg cggtgcgtct     3660

ctaaacgata actgcatttt cacacgctgc ttgtcgagtc atttcggcag gcagcgtttt     3720

tttgttgcct gtagtacaga aaggcagcaa gaaaatgaac gttccaacaa tcgatatcca     3780

gcagacaggt gccaacatca aggcactccg aaaagcggca ggcatcaagg tcaaggatgt     3840

ggcagacacg ctcggtgtat ccacacaggc agttgccaaa tggcaggcag gcactgcact     3900

tcctaccatc gacaaccttg tgattctcgc cgcgatgctc gatacgaaaa tcgatgacat     3960

cctcgtcata gcataaaccc tcgccgcagg attgcggcta tatggccgaa tagacgaatt     4020

ggttaagtcg caagcccttc aagcttgaga gtatgggttc aagccccatt tcggtcacca     4080

tctgcttctg tagctcagtt ggtagagcag taggttgaag ccctatgtgt cgctggttcg     4140

attccagccg ggagcaccac gaggcttaat gcctccttat atgtgccggt atgcaagtgg     4200

ttaaagtacg cggtctgtaa aaccgttccg ttacggttcg ctggttcgaa tccagcccgg     4260

cacaccataa ggccccttcg acaagttggt ctaagtcacc acactctcaa tgtggagtca     4320

gcagttcgag tctgctaggg gtcaccaacg caccctgcat agggtgttta catgcagagg     4380

tcgcctaacg gtatggcaac ggaccgctaa tccgtcgcga ggcaaaacgg cactcactac     4440

gaagtgccaa tcaatccctc gcctgcgagt tcgaatctcg ctctctgcgc catatgcatg     4500

tgtgtccgag tggctgatgg aactggtcca gaaaaccagc ggtcagaaat ggcccgtagg     4560

ttcgaatcct accacatgcg ccaaagagcc tatgcttggt gtttatcaag cataggctct     4620

tttgttttgg actaaataaa aaaatcgaag aagcagcatc tatcatgcgt agtaaaaaaa     4680

cggataatgc tgtttgcttg atggatattc ccgatgattt tgggaaagat gacccggatt     4740

tgtacgaaaa aacatatgac gcagcaatcg cttttgcaaa agaatttcca gacacattat     4800

ggtgctatgg aaatcatgac ttgagttatg tatgggggaa gctagaaaca ggatacaatc     4860

cagaacttcg agatttggta tgtcaaaaga tagaggagct gaaagaggtt cttccatcgc     4920

caactcagct tgcatatatc caccgaattg acaaaactct gtttatgcat ggcggtctct     4980

caaacttcta cgtacatcgt tgggtaacgc caagcagtca aaaagcgatt ggtcgtacca     5040

tcaaagaaat caataatatg tatgggtttt gcacaagcaa acagtttaca ggaggccatg     5100

acgattgcgc tcgactatac gcaatctcaa atgacagaaa aaaacgtcaa gaaaactatc     5160

agagataatc tgtattatat cccttcggtt cgcacagggt acacccacta aaaataataa     5220

atatccatcc aactaaagga gctgcctacc gtttggtggg cggctctttt tattgccgca     5280

aaagcggatg cagagtaaca cagtatatgc tataataaag tcatcaaagt ggtgcgaacc     5340

ttaagctaac ataaaattgc tgggagcttc gcaccaaaaa gtaccagata gagggtggtt     5400

ccaatgcagt aaataggcga gtgcttcagt gtgtgaagca cataattagc aaaactgtac     5460

atcctacgtg ctaccaagat acgcttgaat tgtatagttt tgctgtcgca ccccgcacgg     5520

ggtgtattag attgaaatga ttgaaataaa cggcgtgtcc ttatcggtac acactgcgcc     5580

gtaagcgatg ggcgcaaaac ggttgcctgc atattccaca acgccccagc ccacgatcgc     5640

atagccgggg tcgatgccca aaacccgcat agtatcccct ttccctgtgt ttgaatctat     5700

tcaggaaacg ccgcaccggc aaaagggtgc actatcccca tttaactgta tcagtatacc     5760

acaaagccgt gtatgcggca aggaaaagcc cttccgcatt tgcaaatttc ttttgcggtg     5820

gtgcgttcta tttcatgtat tttctgaatt ttttcatttt tttcgaaaaa agggcttgct     5880

ttttctgctt ggatttggta tagtatacaa gtcgcaagga catgcgcggt tagctcagct     5940

ggtagagcat ctgcttgacg tgcaggaggt cacaggttcg agtcctgtac cgcgcaccat     6000

aaccggacac catttttgat acaatacgta tcttgactgg tgtccggttt tttattaagg     6060

tttagattct tcagggcctt taccctttat tatgatttaa cgctaactct tattgcaaaa     6120

atgaaatcta tttttcacag agcgccttcc ggttacttta cttttcggca acagaggttt     6180

ttatgcaccc gcgggtgttt gtgatatttc ggggtgagat atccggcccg ttgtggggca     6240

tcgcccctct actttgggca aggcacatcg cccattttta ttgttcgctc cgggtgggtt     6300

tcggcggggt cgggtgaccc cggccctacc agccatttta tggttttccg cccatcatca     6360

aatgctctgt ggggaacggt attgaccgtt ccgaaagttt gcaaagatcc aacatctcaa     6420

aatcaaatca cttcgtctat tccctatccc accagccaaa gaattcccat tcggggcact     6480

accacaa                                                               6487


<210>  7
<211>  498
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  7

Met Pro Thr Thr Arg Cys Gly His Val Arg Arg Leu Leu Lys Glu Gln 
1               5                   10                  15      


Lys Ala Arg Ala Val Ala Ser Lys Pro Phe Thr Ile Gln Leu Leu Tyr 
            20                  25                  30          


Glu Thr Asp Asn Ile Val Gln Pro Leu Tyr Leu Gly Ile Asp Pro Gly 
        35                  40                  45              


Arg Thr Asn Ile Gly Val Ala Val Val Lys Ala Asp Gly Thr Ala Val 
    50                  55                  60                  


Phe Thr Ala His Leu Glu Thr Arg Asn Lys Glu Ile Pro Lys Leu Met 
65                  70                  75                  80  


Lys Lys Arg Lys Asp Ser Arg Arg Ala Arg Arg Thr Asn Gly Arg Arg 
                85                  90                  95      


Cys Arg Arg Gln Arg Arg Ala Lys Ala Asn Gly Thr Ile Ser Lys Lys 
            100                 105                 110         


Cys Val Lys Gln Ala Thr Ala Gln Asn Gly Ser Ala Ser Lys Arg Ala 
        115                 120                 125             


Lys Lys Ile Gly Val Ile Lys Arg His Leu Pro Asp Cys Glu Lys Glu 
    130                 135                 140                 


Val Leu Cys Ile Gly Ile Lys Asn Lys Glu Ala Lys Phe Asn Asn Arg 
145                 150                 155                 160 


Ala Arg Pro Glu Gly Trp Leu Thr Pro Thr Ala Asn Gln Leu Leu Gln 
                165                 170                 175     


Thr His Val Asn Leu Val Lys Lys Ile Gln Lys Phe Leu Pro Ile Ser 
            180                 185                 190         


Asp Ile Val Leu Glu Val Asn Lys Phe Ala Phe Met Gln Leu Asp Asn 
        195                 200                 205             


Pro Asn Ile Gln Lys Trp Gln Tyr Gln Gln Gly Pro Leu Tyr Gln Lys 
    210                 215                 220                 


Ala Asn Leu Glu Glu Ala Val Ser Glu Met Gln Glu His His Cys Leu 
225                 230                 235                 240 


Phe Cys Lys Lys Pro Ile Glu His Tyr His His Val Val Pro Gln His 
                245                 250                 255     


Lys Asn Gly Ser Asn Thr Ile Ser Asn Ile Val Gly Leu Cys Thr Lys 
            260                 265                 270         


His His Asp Leu Val His Lys Asp Thr Glu Trp His Glu Lys Leu Thr 
        275                 280                 285             


Lys Lys Lys Thr Gly Leu Asn Lys Lys Tyr Gly Ala Leu Ser Val Leu 
    290                 295                 300                 


Asn Gln Ile Ile Pro Ala Leu Thr Lys Glu Leu Ser Ser Ile Phe Pro 
305                 310                 315                 320 


Lys His Phe Phe Val Ala Thr Gly Lys Ser Thr Tyr Asp Tyr Arg Ala 
                325                 330                 335     


Ala His Gly Val Ser Lys Asp His Trp Leu Asp Ala Tyr Cys Ile Ala 
            340                 345                 350         


Cys Ser Val Leu Pro Asn Asp Val Cys Asp Ser Ser Ile Asn Asn Arg 
        355                 360                 365             


Val Pro Tyr Glu Leu Lys Gln Phe Arg Arg His Asp Arg Arg Ala Leu 
    370                 375                 380                 


His Lys Glu Asn Met Ser Arg Val Tyr Thr Leu Asn Gly Lys Lys Val 
385                 390                 395                 400 


Ala Thr Asn Arg His Lys Ala Ile Glu Gln Thr Thr Asp Ser Leu Glu 
                405                 410                 415     


Glu Phe Arg Gln Asn Asn Leu Asp Asn Val Cys Lys Leu Lys Val Lys 
            420                 425                 430         


Glu His His Pro Glu Tyr Arg Asn Pro Lys Arg Asn Phe Pro Gly Cys 
        435                 440                 445             


Val Phe Leu Val Gly Lys Gln Thr His Val Met Gln Gly Thr Ser Gly 
    450                 455                 460                 


Ser His Asn Gly Lys Ala Asp Gly Tyr Tyr Asp Thr Asn Gly Asn Ser 
465                 470                 475                 480 


Tyr Ser Ser Gly Lys Cys Lys Phe Val Ala Lys Asn Glu Gly Ile Ile 
                485                 490                 495     


Phe Thr 
        


<210>  8
<211>  8092
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  8
tacgagatac tgatcggtct cgtgggctcg gagatgtgta taagagacag tgactcattt       60

gttcaactat gacgtgagaa ataaaaaaag ttgattattc tgtcaaagat gaataaatat      120

gtatggcata aataaaaaat ctgtagtttc cttattctgg aggtgctatg gatcatgcga      180

acaaaacagg aatttgaatt tttcgggatc cgattaaaac aactcagaaa aagtaaggga      240

atcacgcaag aagaattggc gtacagaatc aactgttcta caacatatat cagcaaactg      300

gaaaacggga aatctatttg cagtatggaa cggttgtttg aaatctcgga cgttttgcaa      360

tgcgatgtat cggaattatt gttgggaaca aatcgcaaca gccaaatcta tttacatact      420

gaattttctg aacagtttca gaaactgtct tcacatgaca aagaaattat ttgtatgctc      480

atgaaatgta tgctggaaaa gaccgaggag ggtatccctg aatatgccgt tccgtgattt      540

ctcatctgac gaaagggata tgacaagata cttcgagtat tcagctatac cattatacaa      600

atacgcattg aagctctcag acgatccaca aatagcggaa gatatcgtcc aaaatacatt      660

catgaaactg atccggtact ttgataatgt tcgtggattt tcaaatgagc ggttcttcct      720

ttatgcaaaa cgtgtactgg tccatgaatt cggcgccgaa ctgaaaataa aaagcacaat      780

atctttgatc acggatatgg atatgatcca agataaagaa gaaaatatcg atttcataga      840

tgcgatcttg acaagagacg aaataaaacg atgtataaat caacttaatc caaaatacaa      900

aatcatcgta tatctccgat actttgaaaa taaaaccagt acccagatcg gacagatatt      960

gaatatcccg gataatcatg tcagaaccat acttagccga gccaggaatg aactgaaaag     1020

aaaatataag gaaataaagg aaggaaatac cgggcatgac aaataaagat ctggatattt     1080

tgaaaaataa aaaggcagaa caagaattct ttgatatggc cgcaagagaa attgataagg     1140

aattggacca agtatttata gaagcccaaa acctgccgga cccggacccg gaaatcttac     1200

aaaggatcca aaaaaactta aacgcagaaa tgtataaaca gactgtggct tggtatatca     1260

aaaaatgggc gagcggaatc ggaaaagtcg cagcatgtat cgttttcgtg tcaggagtaa     1320

tattcacagg cgcttattta catgtcgagg cgaccaggaa ccaagttaat aatttctttc     1380

tggaattgtt tgatgaatac gctgtgatcc atcacggcga agatgatcgg gaatctggtg     1440

tcacgatgcc tgcatcgtgg tcgggtccgt ttagtgttgg ctgggtacca caacggttta     1500

cggatgtctt tgcacaggat atgacatata gctgggcgct atattacgat gatttgaata     1560

gcgataacga tctggcaatt tatgtctggg atcctgatat tgcaccaacg atcaatacag     1620

aaggggaaaa taaaatcagt gaacaggaaa ataaaatgac ggaatactat cataatcctt     1680

caagaaatac ctattctgtt gtatctgtac aaaacggata tatgatttat gtgaaaaatg     1740

ccaataccca agaagaagca gaaaaaatat tgaaaaatat ttctttctga tgttgcgatt     1800

tacgttgttt ttcgttctta ttatttggtt cgatccaata tttttgaacc agaaaggacg     1860

ataacaatga aaaaactgat ttgtgtgtgg ctggctgcag ttatgatgtt ctgcttctct     1920

acctccgtat ttgcagcaga tacagcagag tcgatctcga attcacacat cgaggcccga     1980

tacaatgctt tcgacacggt ttgggtatac ttgacggaaa cgcaaccggg gatccttcat     2040

gtggaaggcg gagccgggac atccagcagc agcaattacg tcgagatccg tgttgtgatc     2100

tgccagtacc agaactcgca acaagggatg gtcccggtag atggatttga atggacatct     2160

gcatctcagt ttggaacatc tctgcaggcg aatcgctcgg tttcacgcgg cagttaccag     2220

gcgatcatct atgcaaaatg ctaccagaac ggggtcctgg tcgatgacgt tgaggtagag     2280

agcagcatcg taaacgttgt gagctgatac ggttcctaaa cgaccctgta tggaggaatc     2340

aatgaataga tggtcctgta cttaccagta agacaaaatc tatatcatgt ggatttgggg     2400

aattatgtct cttatggatt gaaagcgctt tctctgtcga ttttttctat ccgatccatt     2460

accttcgtgt cggatgtgac acccgatagg acccggatct ttcatttggc atggctttgt     2520

tccataggac aacttgaccc aaatcaattg ttggatgtga tcgaagattt catctgacaa     2580

atcttaattt tcacttcact attaaaattg gatcgaacca gcggtgtcag aagattttct     2640

ggcgccgctt ttttattgca ctttgttgca aagtatttat actggattca taagataaat     2700

caaattgcta caggcacttt agttttatct tcataatccg caaaggtcat ctcaaacaga     2760

gatggccttt tatcttttgc aagaaaggaa atttgtatga acgtaaatca atctctctat     2820

gagaaaatga ccgctgaaca gaataagttc cgggattggt tgaaaggcca aaatccgcag     2880

gagatcctgg atcatgccta cgagtacacg atccgggaag acatcctgat ggcgatggag     2940

gaacttgacc tgcctgaaaa ccaggccgct gcattgctgg cgtcaccgtc tccgttggcc     3000

gatgtctaca aggagttttc caaccgggaa acgccttaca tggatgtggt acgggacagt     3060

atcgaacagc gagccgaggc tgcaatggac gctcaacgca aattgccgat ttaccagcac     3120

aatgctgctt atgctcgtga acagggtgaa ctggatttgt accgggagtc ccgccgcgtc     3180

aacatcgcct gtaagaaggc tatcgaagca tccatctcga aatatcacca tggcaaagga     3240

ctgatcaaag aatatgtgct tgatgtgatc aaacagttcg gctactcccg taccctctat     3300

gtgctggcaa acaccgcaca gcagaaagat tgggatgggc gaatttgtaa ggaaaataag     3360

gaatgggcta agtccgtaaa aatcccggag aatccggatt gctttggcag tgatcgaaac     3420

cgggattttg tgttagacag tcatcccgct ctggttgacc tgtttttgac ccaggccttg     3480

gcgctgacgc tccacgatgt ctgcgacttt taataaggag gaaaaataaa atgtcggcta     3540

aaaatcagta cacgaaaatg atttcccgtg atgggaaaat tgtgggtgaa atcaaaaacc     3600

tgcacagtcg cccttgccct atggctgggt gtaagggaca tcgtatccat gtcctttggc     3660

cggatgggaa aagcacctat ccgtgctcaa acggatgcaa ggaaattgat cccaatacct     3720

tgcaaatttt gtaaaaaaga gagggtggct ttattttggc cgccaacatt tttagaaaaa     3780

catagaaagg atgtgaacgg cgcattcctc ctatgacttc agtcttggga ggaatgccac     3840

gaatttatga aaaacataag tttaaccgag aaagaaaaaa aacgtctctc ttatacaaaa     3900

ccgtatggtc tgtggaaagt cactacagaa ggagattgtg aaggtcgctc atcgagaaat     3960

ctgggtatct ttgaaggata ccttgacgac atcgcatttt atctggcaga caaagcgtat     4020

tatactttgg aattcgaaaa aatagatatt ctccgtattt cgcataaaaa agtgaatgcg     4080

gaacgaagcg aagttaatgt ttcgcttgac atatcttcag gaacatgggg tatgtcgagt     4140

gaagagcgcg tcctggaatt caaaaagctg ctatccggga gacacgttcg cgtcacggaa     4200

ggcgatactt atgcttccgt aaagttgtgc aaggacggaa tttaacagaa atttgccgtg     4260

gaaaaaagta cagaaaggat gtgagcggtg cagaataccc gtgacttcgt cacggagaaa     4320

atgcaccaag tttatgaaac ataacaaaat tgtccgattg atttttgttg cagttctttg     4380

cagcatcgtt atcggttctc tatctggctg tgcgcagttg gaaaacttac taaacactgc     4440

cagggaaaaa ttggtcggtt ccgatttcac gatcacacag tatgaccata tgggcaaccc     4500

aaccatgaaa atccacgggg acagtgtttc cgttggcctg cttgaaaacc aatcgaacct     4560

tgatattgag actacagggt tcgaatcaga agttctcgaa ctgaccgttg acggaaatca     4620

agttctaact gttggcgata cctgtgttat tgccgaagag ggtctcgata tgatcacgga     4680

tttctccgac atcaatacag acatcgatac ggctgacggt ctgcctgctt ttattgctgg     4740

tgatcggttg gtaaatgatt ttcgaaactc tatcggtaaa aatatggtcg tcgtaattaa     4800

atcccagatg ggtatgccaa tcggaattta tcagggcgac gaggtctatg tgactgtgcc     4860

tgatagtctt ccaaaaacaa cgcagctaac tattgacgga aaacagctgt acatccatcg     4920

tgccaactat acgattatcg aaggggacat gttggataac gctgcttgat ttgcaaaagg     4980

accctgcata tgagtcagga gcaatcctga caagggtgta gggtacaggc atcatgggca     5040

ttgctcatga ggaacgtacg taaccttgca ggtcagctgc gaggggatgc actgccgggt     5100

ttttccagct cggtagggtc agggaatcat ctgcatagcg tacgctcagc caggggaaac     5160

attaccttcc gcaaggaaga gtcttattga agggagtagc gtcatggcta cagtttatgt     5220

attgtcaaag actggtaaac ctttgatgcc cacaactcgc tgcgaccatg tgcgcatact     5280

tcttaaacag aagaaagcac gggtcgtgaa tctcaaaccg tttaccatcc aactgttgta     5340

tgactgtaag gaaggtactc agcccattgt gctgggtatt gaccctggtc gtaccaacat     5400

cgggctttct gctgttcgaa aagatacagg tgaacctgta ttcactgcgc agatggagac     5460

ccgcaacaaa gaaattccta aactgatgag agaccgaaag gctttttgtc aaaaacatcg     5520

gtgcttcggc cgccgcaaag tacgtcaacg cagggcatct gcacataaca ccaattcgtc     5580

aaagtgcgca aaacaagagg ttgcacaaaa cggtggcgtt agcaaacagg ctcaaaaagt     5640

tggcgtaatc aaacgacaac ttccgggttg tgaaaaaccg gttctttgta tcggaattaa     5700

aaacaaggag gctcgtttta ataaccgcct acgtccgaaa ggatggctaa cgcctacagc     5760

aaatcatttg ttccagactc acataaacgt tgttaataag gcaaagaaat ttcttccaat     5820

aacagatgtg gttttggaag tcaacaaatt tgtattcatg gcgttggaca atcctcacat     5880

tcagaaatgg atgtatcagc gtggaccgct caaaggctac ggcagcgttg aagaggcggt     5940

ttctgtacag caaggtggtc attgcatctt ctgtaaaaag gagattacaa actaccacca     6000

cattgtccca caaggtaaac gcgggtctaa tactattagc aacattattg gcctttgctc     6060

tatgcaccac gacttagttc ataaggacag tacgtgggaa cagaaactca aaaccaaaaa     6120

gcagggcatg aataaaaagt acggcgcttt aagtgttttg aatcaaatta ttcctaaact     6180

ttgtgattct cttagtgccg agtttagtga gcatttctat gtgacggatg gaagaagtac     6240

caaggctttt cgtgatgcct acaacatcaa gaaagaccat tatcttgatg ctttctgcat     6300

tgcctgtagt attctttcgg tagaagatgt taaggttcct tgcgaaagca atgtgttcct     6360

tatccttcag ttccgtcgtc atgacaggcg cgcttgtcac caggagcgag ttgaccggaa     6420

gtattatctt gacggcaaac gcgttgccac taatcgtcac aaagctattg aacagatgaa     6480

cgatagcctt gaagaatatg taaccaatgg tggctgcgtc gataaactga ctgcgccaaa     6540

acatccgccg ttatacagac ggaaaagccg cattatgccg ggaacagtat ttttggtcgg     6600

caacaaaact aaggtgatga ttgcatcgca aggcacgcat aatggcgttc ccaactacta     6660

tcgtttcacc aatggtttaa gagctacacc aaaaaactgt aaaccaattt atcaaaacac     6720

cggcatcgtg tttgtttgat tttgtatcaa tcacggtgct gtaagggaga aaaatatgac     6780

gaacaagaac acatggacag agacagactc tgactgctgt cagtatgtcc attattttga     6840

tgagatcctt ggaccgactg ggactctttt tgagttcgtt caaatcacag gattgccgaa     6900

tggtcagtac gggatctctc acgctgtcat tgatatcgaa tgctatgagc agaaagatat     6960

ccttgatgcg cttaatcttt acgggtataa atccatggac gatttcgtcc aggaaatctc     7020

tccgtacaag attgaaaaga agaaagacgg aacgcttgat ccggaatccg aacactacat     7080

tatcgacaac gagcaaatcg ctgaaatgct cttcgagatc ggtgctttcg actcccttct     7140

cgataacgta gtctttgaca cgttcgaaga tgctgaacag tatcttacga aatttttcgt     7200

ttgacgtcct cccacccctc acggagtggg attccccaaa ccgaccagga gacagaggtc     7260

ttccattaaa cgagatcctg ccgtgaagca ggtaagacag acaggcgcag ccacaaaatg     7320

gctgcgcctg ttctatttag attcacgctc cccatacggg gagcgacggc atattgctca     7380

aatgtatggc tggctgcgag acatttggat tcacgctccc catacgggga gcgacattga     7440

catcatgatg ctttatggtg cacgctgaat ttagattcac gctccccata cggggagcga     7500

cgttttgttt tgcgatatca ctactgccac gcttaattta gattcacgct ccccatacgg     7560

ggagcgaccg tgtccttccg cacatacttg acaatggaca gatttatatt cacgctcccc     7620

atacggggag cgacccgtgg gttatcggtc aagatggtgg cagccttatt tagattcacg     7680

ctccccatac ggggagcgaa cacctagctt cgggcggaaa ggacacacta tgaaatttag     7740

attcacgctc cccatacggg gagcgaccac tcataatttt gcggcactgc acgacatatt     7800

tatttagatt cacgctcccc atacggggag cgacagcaat cctgcacagt ccattctctg     7860

ctaagggaag caatatttaa cggaaatgca caaaactgca ttgatttcat cgctattata     7920

ccacaaaaac aaaatacagt caatcttaaa gcggtgcgat cggacaatga aaactgtgtt     7980

tgcttgtggt tcgcacaagg aaaattttaa tactttcttt ttgttgcaaa gtaagtatat     8040

taaatcaaaa ttttaatact ttctttttgt tgcaaagtaa gtatattaaa tc             8092


<210>  9
<211>  511
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  9

Met Ala Thr Val Tyr Val Leu Ser Lys Thr Gly Lys Pro Leu Met Pro 
1               5                   10                  15      


Thr Thr Arg Cys Asp His Val Arg Ile Leu Leu Lys Gln Lys Lys Ala 
            20                  25                  30          


Arg Val Val Asn Leu Lys Pro Phe Thr Ile Gln Leu Leu Tyr Asp Cys 
        35                  40                  45              


Lys Glu Gly Thr Gln Pro Ile Val Leu Gly Ile Asp Pro Gly Arg Thr 
    50                  55                  60                  


Asn Ile Gly Leu Ser Ala Val Arg Lys Asp Thr Gly Glu Pro Val Phe 
65                  70                  75                  80  


Thr Ala Gln Met Glu Thr Arg Asn Lys Glu Ile Pro Lys Leu Met Arg 
                85                  90                  95      


Asp Arg Lys Ala Phe Cys Gln Lys His Arg Cys Phe Gly Arg Arg Lys 
            100                 105                 110         


Val Arg Gln Arg Arg Ala Ser Ala His Asn Thr Asn Ser Ser Lys Cys 
        115                 120                 125             


Ala Lys Gln Glu Val Ala Gln Asn Gly Gly Val Ser Lys Gln Ala Gln 
    130                 135                 140                 


Lys Val Gly Val Ile Lys Arg Gln Leu Pro Gly Cys Glu Lys Pro Val 
145                 150                 155                 160 


Leu Cys Ile Gly Ile Lys Asn Lys Glu Ala Arg Phe Asn Asn Arg Leu 
                165                 170                 175     


Arg Pro Lys Gly Trp Leu Thr Pro Thr Ala Asn His Leu Phe Gln Thr 
            180                 185                 190         


His Ile Asn Val Val Asn Lys Ala Lys Lys Phe Leu Pro Ile Thr Asp 
        195                 200                 205             


Val Val Leu Glu Val Asn Lys Phe Val Phe Met Ala Leu Asp Asn Pro 
    210                 215                 220                 


His Ile Gln Lys Trp Met Tyr Gln Arg Gly Pro Leu Lys Gly Tyr Gly 
225                 230                 235                 240 


Ser Val Glu Glu Ala Val Ser Val Gln Gln Gly Gly His Cys Ile Phe 
                245                 250                 255     


Cys Lys Lys Glu Ile Thr Asn Tyr His His Ile Val Pro Gln Gly Lys 
            260                 265                 270         


Arg Gly Ser Asn Thr Ile Ser Asn Ile Ile Gly Leu Cys Ser Met His 
        275                 280                 285             


His Asp Leu Val His Lys Asp Ser Thr Trp Glu Gln Lys Leu Lys Thr 
    290                 295                 300                 


Lys Lys Gln Gly Met Asn Lys Lys Tyr Gly Ala Leu Ser Val Leu Asn 
305                 310                 315                 320 


Gln Ile Ile Pro Lys Leu Cys Asp Ser Leu Ser Ala Glu Phe Ser Glu 
                325                 330                 335     


His Phe Tyr Val Thr Asp Gly Arg Ser Thr Lys Ala Phe Arg Asp Ala 
            340                 345                 350         


Tyr Asn Ile Lys Lys Asp His Tyr Leu Asp Ala Phe Cys Ile Ala Cys 
        355                 360                 365             


Ser Ile Leu Ser Val Glu Asp Val Lys Val Pro Cys Glu Ser Asn Val 
    370                 375                 380                 


Phe Leu Ile Leu Gln Phe Arg Arg His Asp Arg Arg Ala Cys His Gln 
385                 390                 395                 400 


Glu Arg Val Asp Arg Lys Tyr Tyr Leu Asp Gly Lys Arg Val Ala Thr 
                405                 410                 415     


Asn Arg His Lys Ala Ile Glu Gln Met Asn Asp Ser Leu Glu Glu Tyr 
            420                 425                 430         


Val Thr Asn Gly Gly Cys Val Asp Lys Leu Thr Ala Pro Lys His Pro 
        435                 440                 445             


Pro Leu Tyr Arg Arg Lys Ser Arg Ile Met Pro Gly Thr Val Phe Leu 
    450                 455                 460                 


Val Gly Asn Lys Thr Lys Val Met Ile Ala Ser Gln Gly Thr His Asn 
465                 470                 475                 480 


Gly Val Pro Asn Tyr Tyr Arg Phe Thr Asn Gly Leu Arg Ala Thr Pro 
                485                 490                 495     


Lys Asn Cys Lys Pro Ile Tyr Gln Asn Thr Gly Ile Val Phe Val 
            500                 505                 510     


<210>  10
<211>  3062
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  10
atacatcccg gaaatccggg attgtgacga tttcgcgttt cccattatgt cctatattaa       60

attccttaag atgccaggca tcgctcttgg ggtcataaaa tacagataca aacggctaga      120

cggctatgta tatcacatgg caaatatttt catggaccag gatctgaaca tttggattat      180

tgactggaaa cactatccgg gaatcttctg gtcttataat aaaaattgga aggtatatga      240

ggtgattctg tgaaagtcaa cggcctaata cggtcaggtt gaaactttac tgcgtggtcc      300

tgccaggcgt aattatgata ggtcaacggc ctaatacggt caggttgaaa cgaattcgac      360

tatggcatcc atgaaggcaa caaagtcgtc aacggcctaa tacggtcagg ttgaaaccca      420

tgattcagaa aaggaaaaac tcaattaaat taacgctttt gttgtcggct tctcatttcg      480

attgtcgtcg atctccgatg atccaaaaat cccaggggat cgacgatccc gcgccgttgc      540

tgttttcatt gacagatttc aaccaattta cctcaaaggc tctgctcatt tttcggtact      600

tttccccaga tcgacgatgt ttcaatctga ccgtgttgct tgacctgctt gcggatcagg      660

ctacgttgac agctttagca gctttcacgc tgctacgtta tttcagttat cagacccggg      720

gatgcttctc cagttcccgg cactctggcg gcgctgtaaa agtcctggag gcagggacgg      780

tcaaccgcag gacgaccgga cgtttcaggc aagctggaat aacattggcg gggaggaatt      840

ttaccatacg aaagtatgag gtagtttatg ttagtgcatg ttttaagtaa agaaggaaag      900

cctttgatgc caacgcatcc agcgaaagcc aggaagttgc tgaagctagg taaggcaagg      960

cccgcgaagg cgaagacagg ctatttcact gtacaattga cttacgacac agcaagttat     1020

atccagcctg taacggttgg cgtggactta ggaagcaaca cagttcctat agctgccatt     1080

gccaacggta aggttgtcta tgccaaagag aaactactga ggcgtgatat atcagccaag     1140

ctaaaagcta ggggcgaata tagaagacag aggcgcggac gactgcgaca cagaaagccg     1200

cgttttgata acagagttaa gaagaaatgt gcgcggtgcg gtgtcaacaa tgttccgcgt     1260

acctggaaaa agattaagcg ccaaaacggc aagtcaaaga agagagtatt agttggcagg     1320

gctaatttat gtcgcaaatg tcaaggcgag aaaggcttgc accggcagcc gcatctacta     1380

ccaccctcgg tgaaggcgcg ggcagatgcc attttagcgg atattgagaa gttttgcaga     1440

agtctaccgg tcgctaagat agttgtagag atagcttatt tcgatacgca gaaaatggcg     1500

aaccctgata ttgaaggcgt tgaataccaa aaaggcacac ttgaaggcga agagataagg     1560

tcgtatgtgt tcaatgtatt caagcacaaa tgcgcttatt gcaaaggcgc tagtggggat     1620

aaaatacttg aaatagacca tgtgcgcccg aaaagcaaaa aaggcagcga taagttgagt     1680

aacctggtcg ctagttgcag gcaatgtaac atagcaaaag gcagcatgac gttagatcaa     1740

tgggctaaaa ggctacaagc aagtccttgc gagcttgata aaaaacgttt gtcaagcctg     1800

aagcacatca agaaacgcag tgatataaag aagggctttc aatatagtgc cttgactcag     1860

agttacaaga gctatctact tcacgaatta gctcagcgtc ataaggataa gcgtttctct     1920

acaacatacg gctatgccac caagtttgcc agaaaggcaa tggggcttga gaagtcacag     1980

ataaatgatg cgatggttat agcatccgaa ggcagaatgt tcccgacacc caagtattac     2040

ttattagagc gttgtctcaa aaaacgcagg gctgctgagt atataagccc gcataaagaa     2100

ggcacgccgg ttgttaggag gccttggtct aatgcgaagt atgggtttag gctatgggat     2160

aaggtggaag cagaagcgaa gcagggctat gtagctgcct tacgcgagag tggcagtttt     2220

agggttcata ccttgtatgg ggataaaata tttggtggaa agtcttacaa gaagcttagg     2280

ttgataactg agtgcaattc taactatatg cgcgagtgga agatggtaag ccaagagccc     2340

atgcaaatgg aactcacctt tcccgcatga tcataacggg agttcagaat gcaatacgct     2400

aatccaaagt taataggaag caagataata gactgtgtgc cgttgacggg tccggcgggc     2460

gaggtggtcc aggggtacat cacagcccgg cggctaagcg ggagcttccg ggtgggagca     2520

cttgacggtc gggagattac cgggggaaag agctacaaga aactaacgct tctaaagccg     2580

tgtcgcagca actatcaatg tcaagtcaac agcctggcat gtttagattg aaacgggagg     2640

ggaggtgaat cgtatgcact taaaaaagac tgagctgata ttgagtttct gctttggatt     2700

gtatatgact ctcaggtgag taaggatgat aaggatgtcg tggagaattg gctgtgggag     2760

aatccagatt actgcgacct agttgaattg tgaaggaggt gaatagaatg ggtgatataa     2820

tccgatacga gcatcatggt gcagaagtgt gcgttgatga ggacctgaag ggcaaacaca     2880

gaagtcactg cctttgcttc aggtgtgcca agttctgtcc tgagaatagg gaaaagagct     2940

gtccgagggc gaacttgctg tatgcgtact gtgtggcatt tgacatggta acaccggtat     3000

acgaatgtcc tgagtttgag gaggcagaat aatgagaagc ggtctccaca gctttatcgt     3060

ta                                                                    3062


<210>  11
<211>  500
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  11

Met Leu Val His Val Leu Ser Lys Glu Gly Lys Pro Leu Met Pro Thr 
1               5                   10                  15      


His Pro Ala Lys Ala Arg Lys Leu Leu Lys Leu Gly Lys Ala Arg Pro 
            20                  25                  30          


Ala Lys Ala Lys Thr Gly Tyr Phe Thr Val Gln Leu Thr Tyr Asp Thr 
        35                  40                  45              


Ala Ser Tyr Ile Gln Pro Val Thr Val Gly Val Asp Leu Gly Ser Asn 
    50                  55                  60                  


Thr Val Pro Ile Ala Ala Ile Ala Asn Gly Lys Val Val Tyr Ala Lys 
65                  70                  75                  80  


Glu Lys Leu Leu Arg Arg Asp Ile Ser Ala Lys Leu Lys Ala Arg Gly 
                85                  90                  95      


Glu Tyr Arg Arg Gln Arg Arg Gly Arg Leu Arg His Arg Lys Pro Arg 
            100                 105                 110         


Phe Asp Asn Arg Val Lys Lys Lys Cys Ala Arg Cys Gly Val Asn Asn 
        115                 120                 125             


Val Pro Arg Thr Trp Lys Lys Ile Lys Arg Gln Asn Gly Lys Ser Lys 
    130                 135                 140                 


Lys Arg Val Leu Val Gly Arg Ala Asn Leu Cys Arg Lys Cys Gln Gly 
145                 150                 155                 160 


Glu Lys Gly Leu His Arg Gln Pro His Leu Leu Pro Pro Ser Val Lys 
                165                 170                 175     


Ala Arg Ala Asp Ala Ile Leu Ala Asp Ile Glu Lys Phe Cys Arg Ser 
            180                 185                 190         


Leu Pro Val Ala Lys Ile Val Val Glu Ile Ala Tyr Phe Asp Thr Gln 
        195                 200                 205             


Lys Met Ala Asn Pro Asp Ile Glu Gly Val Glu Tyr Gln Lys Gly Thr 
    210                 215                 220                 


Leu Glu Gly Glu Glu Ile Arg Ser Tyr Val Phe Asn Val Phe Lys His 
225                 230                 235                 240 


Lys Cys Ala Tyr Cys Lys Gly Ala Ser Gly Asp Lys Ile Leu Glu Ile 
                245                 250                 255     


Asp His Val Arg Pro Lys Ser Lys Lys Gly Ser Asp Lys Leu Ser Asn 
            260                 265                 270         


Leu Val Ala Ser Cys Arg Gln Cys Asn Ile Ala Lys Gly Ser Met Thr 
        275                 280                 285             


Leu Asp Gln Trp Ala Lys Arg Leu Gln Ala Ser Pro Cys Glu Leu Asp 
    290                 295                 300                 


Lys Lys Arg Leu Ser Ser Leu Lys His Ile Lys Lys Arg Ser Asp Ile 
305                 310                 315                 320 


Lys Lys Gly Phe Gln Tyr Ser Ala Leu Thr Gln Ser Tyr Lys Ser Tyr 
                325                 330                 335     


Leu Leu His Glu Leu Ala Gln Arg His Lys Asp Lys Arg Phe Ser Thr 
            340                 345                 350         


Thr Tyr Gly Tyr Ala Thr Lys Phe Ala Arg Lys Ala Met Gly Leu Glu 
        355                 360                 365             


Lys Ser Gln Ile Asn Asp Ala Met Val Ile Ala Ser Glu Gly Arg Met 
    370                 375                 380                 


Phe Pro Thr Pro Lys Tyr Tyr Leu Leu Glu Arg Cys Leu Lys Lys Arg 
385                 390                 395                 400 


Arg Ala Ala Glu Tyr Ile Ser Pro His Lys Glu Gly Thr Pro Val Val 
                405                 410                 415     


Arg Arg Pro Trp Ser Asn Ala Lys Tyr Gly Phe Arg Leu Trp Asp Lys 
            420                 425                 430         


Val Glu Ala Glu Ala Lys Gln Gly Tyr Val Ala Ala Leu Arg Glu Ser 
        435                 440                 445             


Gly Ser Phe Arg Val His Thr Leu Tyr Gly Asp Lys Ile Phe Gly Gly 
    450                 455                 460                 


Lys Ser Tyr Lys Lys Leu Arg Leu Ile Thr Glu Cys Asn Ser Asn Tyr 
465                 470                 475                 480 


Met Arg Glu Trp Lys Met Val Ser Gln Glu Pro Met Gln Met Glu Leu 
                485                 490                 495     


Thr Phe Pro Ala 
            500 


<210>  12
<211>  3431
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  12
ctaccccgca aggaaggagc cgccgaccga tgtcttgtta ggaaaactaa tccttcggaa       60

attagtcgag aaggggctcg cccccccaaa agccaacctg tcgtggcaat gcatcgaaag      120

cctgaccatc accccgtgga tatgggatcg cttcgccgag cacgggatcg aagtcgcgct      180

cggcgagatg gtcaggcggc ggggcgaaga gaaatcaaaa aaaaatcgaa cggggggttg      240

acttttggat aatagccttc ataatctcgc ccatcgatag ccgagcaacg aagttggatc      300

aggacgtcat tcttcgaatt cgaaaacgcg gggattgctc gaatgggtgg atgatatgtt      360

ggatcaggac gtcacttttc gaattcgaaa acgcaagtcg aagttggcga cttgatcaaa      420

tagttggatc aggacgtcac ttttcgaatt cgaaaactaa agaccgctcg gctgatactt      480

gacggaagtt ggatcaggac gtcacttttc gaattcgaaa accgatctcg cctaagtgat      540

tgattcagcg aagaaattgt agaatcgcag ttgcaaaaaa tgatgtcctg atccaataag      600

gggcgcagcc cccgcaccat ttttccccga tctacggatc ggatcggcgc tgccttcgaa      660

aagccgacat ggaaaacccc cagtttatcg gggacgatgt aaacctgagt cgccacggcg      720

ctcatctcgc ccgctctacg gaccgggctt cgctggaaac tgggagcggt aaccatgaaa      780

caactcacac tcgatgtcgg catcgcttcg atcggctggg cgattgtcag taaaaaaggg      840

aacgtcaaag ccggatcacg catctttccc gacgccaaat ctgggcggga aagcaaccaa      900

tcccgacggg cggctcggct gatgcgccga ggatatcggc gcaaagcaaa gcgccgagcc      960

gacacgctcg ccatcattcg ctccatccac cccggcttcg accccgaagg gcaccctgac     1020

atcgagcgcg aagctctgat caaagccatc attacccccg gcgcccccac cccatcgctc     1080

gaccaactcg cttgcgcgtt ccagcgattc gccaagtctc ggtggccgca atattcgaga     1140

actctcccaa aacgaaccga gcgggaagac cagttcatcc aagtttgggt gatcgccgag     1200

cgaacctatc ccgaccgctt tacccccgac gtcgcaaccc gcttgcttca ggcgatattc     1260

tttcagcgcc ctatcaaaga cggcgaccgc gcgaaatgtc aactattcag gcatcacggt     1320

gacaaagccc cgctcgtcgg ctggacgcac gaacccgaac tccaacgatt cgccatcttg     1380

tccgatctgt ccaatctcac tatcgggatc ggctcgactg ataacctgct ttgcgaatat     1440

cccgacatca tcgaagactt ggagacccga tgcttcgaga ccggcatgag ctggagagag     1500

atagccgaac acgtcaagga agtcatcggc aaaggggtgg tgtttcgagg cattgacggg     1560

cagaaaaaag tcgggcggaa cgggatcggg cctgccaaac tcgaaacaat cgacgaagaa     1620

ggcaactcta ccaagagcac cgcttcgatg tcggtcgaag cggcggtaat gatctatcac     1680

caaatgaaag cggatcgatg ccgagcggca accgctaaaa aaacgctgat cgacgcaggg     1740

gctctttcag cccccctaac cgccaaagat atcaaacgtg gcgatcgcac tctcaccatc     1800

accgaactga tggatatggc gggcaggatt accgacccga ctatcagggc gatctaccac     1860

caagtcgaga tgttggtcaa cgagttgatc gcccgcttcg gtaaacccga acgcatcgtt     1920

atcgaagccc aaaaggaaat cgggcgaagc atcgaagaca tcgagaaagc gatggcaaga     1980

gagcgagaaa agcatatcga acggcaacga gagaatcgag cccgcaacgc cgcaatgggt     2040

accaaagccc gctttgcccg cctgtgcgcc atcaggggtg atcgatgctt tatcagcgac     2100

cgacccgcag ccgaagtcgg ccacctgatc gccgattcga tcgggggcac gcttgaaatg     2160

gctaacctga tcccgatcga ccctgccatc aataaagaga tgggcaaccg cactccctac     2220

gaagcctttc gaaagactga gtattggtcg atcatccagc gcaaactgca agcgcttgaa     2280

gacgaagtga aggctttgaa accgccgaaa gggacaaaag gaacggcgtg gacgatctat     2340

caccgagcca agcatcagtt tgattttttc gcttggcgat ttcaatcgaa tgcgagagaa     2400

acccatcaaa ggaactttcg ccccggctcg cttgacgacc tgcggtggat cgaaaacttg     2460

ctctttctcg gcgttgcccc gatttgcgac aatatccgaa tcgtcagtgg gcgaacaacc     2520

gagcgcattc gccgagagat acttggaatg gacaaagacc gccgagacca tcgccaccat     2580

gcgctcgatg cgcttgctat catgctcgcc aatcctttga agccgtggga tttgaaatcg     2640

agcaattcgc tcggtatccc gctcggccga atcaaacaag ccttcgccga cgctgtcgtc     2700

tcgcaaaagc aagaccactc gcttcgcact gcgttgcata aagagaacgc gatcccgaag     2760

accaagcggg gggctgcata tcgaaaaatc ggaacggggg cgagcgaacg cgtagtcgac     2820

acccaatcaa aggcctattg cgaagtttgg gcgttgccta acgggaagtg ggaagcggtc     2880

gtagtgtcga gtttcgacgc tgcgcaaaag aactaccggc aagggattga ccatcgcccc     2940

catcccgccg ctcgactggt tatgcgcttg ttcaaatccg atttgctcgg tatcgggggc     3000

aagatatacc gggttcagga actactcggt tctggctcga tctacttggt cgatcatcga     3060

ttcgctggca ctattcgaga cgcccgcgca gtttgtaaga cgggcgtcaa tgtcgatttc     3120

tttagcaaag ggggtgattc attgcgcaag gcgggggctc gtcttgtttc gattcgtaag     3180

agttgggtgg gatcgtgagt cacttttcga attcggaaac ttttccgacc gacaagtact     3240

cgatcatcta gttggatcag gacgtcatgc ttcgaattcg aaaacgtcag tcgggtgacc     3300

tacgagcgat agggggttgg atcaggacgt cacttttcga attcgaaaac cattcgatgc     3360

cgatgcctac accgcaaaga gttggatcag gacgtcatgc ttcgaattcg aaaaccttcg     3420

tttcaataac c                                                          3431


<210>  13
<211>  807
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  13

Met Lys Gln Leu Thr Leu Asp Val Gly Ile Ala Ser Ile Gly Trp Ala 
1               5                   10                  15      


Ile Val Ser Lys Lys Gly Asn Val Lys Ala Gly Ser Arg Ile Phe Pro 
            20                  25                  30          


Asp Ala Lys Ser Gly Arg Glu Ser Asn Gln Ser Arg Arg Ala Ala Arg 
        35                  40                  45              


Leu Met Arg Arg Gly Tyr Arg Arg Lys Ala Lys Arg Arg Ala Asp Thr 
    50                  55                  60                  


Leu Ala Ile Ile Arg Ser Ile His Pro Gly Phe Asp Pro Glu Gly His 
65                  70                  75                  80  


Pro Asp Ile Glu Arg Glu Ala Leu Ile Lys Ala Ile Ile Thr Pro Gly 
                85                  90                  95      


Ala Pro Thr Pro Ser Leu Asp Gln Leu Ala Cys Ala Phe Gln Arg Phe 
            100                 105                 110         


Ala Lys Ser Arg Trp Pro Gln Tyr Ser Arg Thr Leu Pro Lys Arg Thr 
        115                 120                 125             


Glu Arg Glu Asp Gln Phe Ile Gln Val Trp Val Ile Ala Glu Arg Thr 
    130                 135                 140                 


Tyr Pro Asp Arg Phe Thr Pro Asp Val Ala Thr Arg Leu Leu Gln Ala 
145                 150                 155                 160 


Ile Phe Phe Gln Arg Pro Ile Lys Asp Gly Asp Arg Ala Lys Cys Gln 
                165                 170                 175     


Leu Phe Arg His His Gly Asp Lys Ala Pro Leu Val Gly Trp Thr His 
            180                 185                 190         


Glu Pro Glu Leu Gln Arg Phe Ala Ile Leu Ser Asp Leu Ser Asn Leu 
        195                 200                 205             


Thr Ile Gly Ile Gly Ser Thr Asp Asn Leu Leu Cys Glu Tyr Pro Asp 
    210                 215                 220                 


Ile Ile Glu Asp Leu Glu Thr Arg Cys Phe Glu Thr Gly Met Ser Trp 
225                 230                 235                 240 


Arg Glu Ile Ala Glu His Val Lys Glu Val Ile Gly Lys Gly Val Val 
                245                 250                 255     


Phe Arg Gly Ile Asp Gly Gln Lys Lys Val Gly Arg Asn Gly Ile Gly 
            260                 265                 270         


Pro Ala Lys Leu Glu Thr Ile Asp Glu Glu Gly Asn Ser Thr Lys Ser 
        275                 280                 285             


Thr Ala Ser Met Ser Val Glu Ala Ala Val Met Ile Tyr His Gln Met 
    290                 295                 300                 


Lys Ala Asp Arg Cys Arg Ala Ala Thr Ala Lys Lys Thr Leu Ile Asp 
305                 310                 315                 320 


Ala Gly Ala Leu Ser Ala Pro Leu Thr Ala Lys Asp Ile Lys Arg Gly 
                325                 330                 335     


Asp Arg Thr Leu Thr Ile Thr Glu Leu Met Asp Met Ala Gly Arg Ile 
            340                 345                 350         


Thr Asp Pro Thr Ile Arg Ala Ile Tyr His Gln Val Glu Met Leu Val 
        355                 360                 365             


Asn Glu Leu Ile Ala Arg Phe Gly Lys Pro Glu Arg Ile Val Ile Glu 
    370                 375                 380                 


Ala Gln Lys Glu Ile Gly Arg Ser Ile Glu Asp Ile Glu Lys Ala Met 
385                 390                 395                 400 


Ala Arg Glu Arg Glu Lys His Ile Glu Arg Gln Arg Glu Asn Arg Ala 
                405                 410                 415     


Arg Asn Ala Ala Met Gly Thr Lys Ala Arg Phe Ala Arg Leu Cys Ala 
            420                 425                 430         


Ile Arg Gly Asp Arg Cys Phe Ile Ser Asp Arg Pro Ala Ala Glu Val 
        435                 440                 445             


Gly His Leu Ile Ala Asp Ser Ile Gly Gly Thr Leu Glu Met Ala Asn 
    450                 455                 460                 


Leu Ile Pro Ile Asp Pro Ala Ile Asn Lys Glu Met Gly Asn Arg Thr 
465                 470                 475                 480 


Pro Tyr Glu Ala Phe Arg Lys Thr Glu Tyr Trp Ser Ile Ile Gln Arg 
                485                 490                 495     


Lys Leu Gln Ala Leu Glu Asp Glu Val Lys Ala Leu Lys Pro Pro Lys 
            500                 505                 510         


Gly Thr Lys Gly Thr Ala Trp Thr Ile Tyr His Arg Ala Lys His Gln 
        515                 520                 525             


Phe Asp Phe Phe Ala Trp Arg Phe Gln Ser Asn Ala Arg Glu Thr His 
    530                 535                 540                 


Gln Arg Asn Phe Arg Pro Gly Ser Leu Asp Asp Leu Arg Trp Ile Glu 
545                 550                 555                 560 


Asn Leu Leu Phe Leu Gly Val Ala Pro Ile Cys Asp Asn Ile Arg Ile 
                565                 570                 575     


Val Ser Gly Arg Thr Thr Glu Arg Ile Arg Arg Glu Ile Leu Gly Met 
            580                 585                 590         


Asp Lys Asp Arg Arg Asp His Arg His His Ala Leu Asp Ala Leu Ala 
        595                 600                 605             


Ile Met Leu Ala Asn Pro Leu Lys Pro Trp Asp Leu Lys Ser Ser Asn 
    610                 615                 620                 


Ser Leu Gly Ile Pro Leu Gly Arg Ile Lys Gln Ala Phe Ala Asp Ala 
625                 630                 635                 640 


Val Val Ser Gln Lys Gln Asp His Ser Leu Arg Thr Ala Leu His Lys 
                645                 650                 655     


Glu Asn Ala Ile Pro Lys Thr Lys Arg Gly Ala Ala Tyr Arg Lys Ile 
            660                 665                 670         


Gly Thr Gly Ala Ser Glu Arg Val Val Asp Thr Gln Ser Lys Ala Tyr 
        675                 680                 685             


Cys Glu Val Trp Ala Leu Pro Asn Gly Lys Trp Glu Ala Val Val Val 
    690                 695                 700                 


Ser Ser Phe Asp Ala Ala Gln Lys Asn Tyr Arg Gln Gly Ile Asp His 
705                 710                 715                 720 


Arg Pro His Pro Ala Ala Arg Leu Val Met Arg Leu Phe Lys Ser Asp 
                725                 730                 735     


Leu Leu Gly Ile Gly Gly Lys Ile Tyr Arg Val Gln Glu Leu Leu Gly 
            740                 745                 750         


Ser Gly Ser Ile Tyr Leu Val Asp His Arg Phe Ala Gly Thr Ile Arg 
        755                 760                 765             


Asp Ala Arg Ala Val Cys Lys Thr Gly Val Asn Val Asp Phe Phe Ser 
    770                 775                 780                 


Lys Gly Gly Asp Ser Leu Arg Lys Ala Gly Ala Arg Leu Val Ser Ile 
785                 790                 795                 800 


Arg Lys Ser Trp Val Gly Ser 
                805         


<210>  14
<211>  10603
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  14
attgcaggat acgatactgt tttttataac tcgtcaaatc tggataaact gattgagatt       60

gcttcaagag aaagtcgcac aattttaacc agaaatagcg gtttcataaa aaaagagaga      120

aaggttttag aggaaagagg tatcccacat cttttattgg catctgacag tgttgaagaa      180

cagcttaaag agacagtaga ttattttaaa cttgacctga aggagagttc ttttacactc      240

tgtgtggatt gtaataagcc gttttttaaa atatcaaaag aggatgcttt gggtaaggtt      300

cctgagtttg tttatcaaaa ttacaatgaa tttgtccaat gcacaaactg taaaaagata      360

ttctgggaag ggacacatag ggaaaggatg gaggaaatgc ttaaaagagt gtattgataa      420

actaatgaaa ggttatattg tttatataac ctatctttcc gttttgtcca agacggtaga      480

aatgccttat atcttttaat aaagaggaga tatcattcct ttttaaataa tttgtttcta      540

aatagtgaca gtagttcatt gccattctgt ttgcatccct gtatctttct ttttcctctg      600

ataataaatc cggatttagt atagaatttt caaacaactg tctctttata ctttttgcag      660

ggactctgcc ggaattaata ctgctgctgt agaaaatggc tgtgatatat ttatcaacct      720

cagcctgaag ctccatctca aagatagtta tctgtctttt atggatggca ttccatactg      780

catatatgaa atggcttatc tcctctgtgg caagaaaaaa atctgaaaga tttattttgc      840

agaaatcttt ttgcagatga atatttttta gattagatag tgtctctctg ccgagatata      900

cagctatttt gagttcatcg tcattctggc ataaaaatag ataacctttt gatgaggggc      960

ttttaatcct acaggatttt tgattgtttg aattattaga aagcgggtca tttatgaggt     1020

aattttctat attaaggaat gtctccactt tatatatttt ttctattatc ttctgtatcc     1080

gggataacat aaaatctatt cgcccgctcc ctctttggga gggagtgggt tggggatgag     1140

agggaataaa atatgcctac tgtatatact tgctgccttt ggatattata ggaataaccc     1200

ctttttcttt caggagttgg gcatccctat cactgtttgt cttgagccac ctttcatata     1260

ttctcaataa atcgctgttt gtgttgatgg atgccttatc gctgacctct gcaattacat     1320

ctatgaattc tttaaatctc tctgatagct cactgaagag ctcactgaaa agctccccat     1380

ttccgtatct cttggtaata tatgccaggt tgctgtaggc tctccctccg ataactatat     1440

aataatcaat atcaataagt tttttcttaa ggctgtcgga gaaaaagccc gatacaaaca     1500

gtgaaaaatc tccaacctgc ttaaatttcc ttatcctctc attgtattct gaatttatgg     1560

cactttctaa aagcattata agaggctcat cagatgcctc ttcagggaat gatttatcaa     1620

tttcaacata ctcagagaga agattcacta aatagaatgc cactgtctca tcagtgtcta     1680

tatcctgatg gtctatagcc gtatccacca gttccttgaa gaattcagca gtgtttttat     1740

gtgttgttat tttggctgcc atatattttc tctaatttaa aattatgcca tactttttaa     1800

gccattcagt ttcgtcaaat ttatccgaaa tatattcttc aaaggcagcc tcattttttg     1860

taagtattct gccgtcaatg taatctcctt ttggttttga ttcatttgga ttatcttcta     1920

atgatttctc cgctgtattt atgaagtcgt ttaccttgat tatcaaggca agcattgcag     1980

atggttctat catattcctt tttttaaggt caaagagaat gcccagcaca tctttagatt     2040

ttaggagcgc ctttgtaatt gcccttgaga gcccatcaag ctctttgttg tctttttcat     2100

tatctctcat tatatcctcc tgattattta tcggtattat ttttaataat ataattaagc     2160

aaaaaatata ccatgattat cacttaattt taattcattg aaaaagaatg agatagctga     2220

tatattgaaa taaataaggc tggaaattat gtcctgattg tgtcaaattt atccagattt     2280

aagaaaaaat gttatgatat tcgccaatga aaaaatacgg tgagaaagca attgttagtg     2340

caaaattaga agcgtgggat aatccctatc cggataggga ttataagatt gaaataagtt     2400

tcccggagtt tacatgcctc tgccccagtt caggctatcc tgattttgct acattcgcca     2460

taaattacat ccccgataag tgtataatag aactcaaatc actgaaacta tatctaaaca     2520

gttttagaaa tcagtatatc tcacatgagg cggtaacaaa taaaatctat gatgctttaa     2580

ataagacttt gaaaccgcgt tttatagagg tagtggggga ttttaatgaa agggggaatg     2640

taaagacagt tataagagtt acatcaaaac agtagtctaa tagtctaaaa gcctaagagt     2700

ctaagagttg tttcctttta gactcttaga ctgttagact gatttatttg gttgactcct     2760

caaggtctaa catcattgta ttgccgacat atttcttact gattaccctt tttatccttt     2820

gtagtttctt tgtgacctct tttattctca ttgcactctg aattgcagta ttaagcatct     2880

cctttatatc ttcattattt gtattcatca gggctaattc aatactgctg acaatgcctg     2940

tcaggggatt atttatctca tggtttaaag ttaccgctaa ctgagaaagt agagattctt     3000

tatccttaga aagtatagta agcgaattaa tagaatttga tgttaataaa tttaattcac     3060

ttgctaattg ttttgacata atgttgacca ccctctaaaa aatttactaa tttatgttgc     3120

acttactatg ccataatcta ttattatgct aacatattga tattataagt gttttgactt     3180

ttgaataccc aaaatctcta ttgtaaatat gtcaaagtta tgacttaatg ggaaaaatat     3240

tccgattaaa tctatcatgc cttaatatct ttatcggatt tatctaagtt tttataaagt     3300

ctttttttag gcgggctatt ctatatagag attcctcaat cctctcttct tttatctcac     3360

cactctctaa tgtccttaat atagaattga acatctctat ctgtttatta tggctatggc     3420

agattaagag tatatcaacc cctgctttaa cagactgaat tgcagaaatt ttatcgctgt     3480

aattatttga gattgccttc atttccaaat catctgtaat tactacatca ttaaacccaa     3540

gtttttttct taaaatctca tttattattt tctctgacat agtagcagga taatcaggag     3600

ctaaagcagg gtaaagaaca tgggctgtca taattgccct taagcccatt ttaaatgtat     3660

taataaaggg tttaagctct atctcctcta atctttctat tttgtgattt acgataggaa     3720

gttcaaggtg ggaatcagag aaagtatctc catgtccggg gaaatgttta cctacaggaa     3780

ttatattttt ttcactaaat gcctttatga catgcccacc gagtctggat acaacatcag     3840

ggtcggaact gaaagacctg tcgccaatga tagggttttt gggatttgta tttacatcaa     3900

gtacaggtga catattcata tttacaccta tttctataag ctcatctgcc attaaaaggg     3960

catttgacct tactgcatct tctgaacctg ctcttcccaa ctcgccataa gaaggatact     4020

gtctgaaagg aggtttgagc ctgaacactc ttccaccttc ctgatcaatt gcgattaata     4080

gagggatttt atggctttta attgataatt cctgaaggga attgcataaa tctttaacct     4140

gtgacggaga ctctatattt cttgaaaata ggattacccc cccgacccca taatcagtta     4200

ttattccttt taactcatct gacatagtag tcccatgaaa tcccaccata aacatctgcc     4260

cgattttttc ttttaaagat attattgaca tggtatcaat ctaaaataac aaggagtgat     4320

tttgcaatca taatcttaca ggatttggta ttgacaagta aatagattta cctataatga     4380

agatgaaatt attgaacctt aataaaacct gacaatgcaa ggacaaaaac ttggtataga     4440

tttaggcggt aagcatgtcg gtcttgctgt tgtaagaaca ccgataaacg aggtggcaca     4500

ttactgcact attgaactca gagaagacat taaggataag atggatgaga ggaggtctct     4560

tcggagggcg aggagaaaca ggctctggca tagggaagcg aggtttgaca ataggcaatt     4620

aagggtgaaa tgcaaatata ttgataaaga tacaggcgaa atctgcggag ctaatactcc     4680

aaagaaatcc aatgtaaaac atcttctact tgagaatata ctcgtcaatc ttaaaatagc     4740

tgatgaatct aaagaggaaa tcagaagaag agggctggac agagacacaa acaaaagtga     4800

attacagaca atccttgaga aattttcaat aaataccttc ctgaaaaaac agattaaaga     4860

catcattctt gaaaaggggg aagggagggc tgtcttttgc agagagcata tcccctttca     4920

ttatgaacag gttgcaacag aggctgagag tttctggctg tcaaattcaa taagggctaa     4980

acaggaccag atactctccc gccttaaaag aatagcaaag gattttaaga tagatgaggt     5040

ggttattgaa agggcgaact ttgatttgca aaagctccag agacctgatg agatagaagc     5100

acctgaagat tacatgaagg gtcctaactt cgggcacaga aacaggtttg aggcattgaa     5160

gcaggaatat ggcaaccgat gctgtttctg cggaaagaag ggtggagatg aagtaaagct     5220

gaagataggg catctctatc cgaaggctaa agatgagata aacaggtggg aaaaccttat     5280

aactatatgt gaaaaatgta atgcgaagca gggtaaaagg acaccagagg aggcagggat     5340

ggaatttgta attgtaaagg agaaggtttt taatcctgca gcaggaaggg taatacccat     5400

aaaaagagaa ctcaagccga agcccataaa tgaatcaaag gttaataaat atatgaccca     5460

tactgatatt ggcataagga ggctcaaaag agaaatccag aatatttttg gaagcatacc     5520

tataagagaa acatacggct atatcacatc gtattttaga aataaatggg agcttgaaaa     5580

agaacattat aatgatgctg tagtcatagc ctctgacaaa gaagatttga atataaaacc     5640

tgtatttaaa gatgcagtcc ctcagacaat taaatcatct atcaagggcg ggaaactctt     5700

tgatacaaat cccctccagt ttagtgatgg aaagttttac cagaacataa cccttatagg     5760

cagaaaggca gggatgcgtt catcaaaaca taaaaggggt cagaggaata tcaggaacta     5820

tggctcaatt tatatggatg agattgaact tataacctca gaatggaaga aaaaggttct     5880

ctgcgaatta agagataaac ttggttatgt aaaaggagat aagaataagt cttttaagcc     5940

tgaggaactg atgaatgcaa atctgccttt caggactgta actattgaca aaaggggtgt     6000

aggagaatct tcaacccgct taatcaataa caatgtattc cgtgcctcag ctgaagtaaa     6060

tacgcatata atggtctatt caaataatga cggtagaatg aaggcatttg cagtaaaaaa     6120

tcctaagata tttaaagatg ccggactccc tcatgatttt caaaaaaaga tattcattgt     6180

aaaaaagggg gatattgtta catggaaaaa aagtgaagat ggaattgccg taacaggcag     6240

ggtgaccaaa tgtttgacaa aaaatggggt aattgatata aaggacatga ataataaaat     6300

acactcaggg aaaaaccctg tgtatattga aaagatagta tctcctgaaa ggggtgctat     6360

ttttgagaga aaatctcttt ctgctctttg aaaattagat attaaagatt gaaaacagcc     6420

tgagtgttga aacagacact aagttgttgg gaacaggtaa agaactacgg cggggtatct     6480

tgaatggtta ccagctccgc cctcttgtag tttttgagta ggaagactcg cccctttggg     6540

gaaggaaaat ggtcggtaag ccactgataa ggccatctac aactcataga catgccctgt     6600

ccgacaacct tggcaaggga aaccattaaa ttatctaata tcatatttta atctaaatac     6660

agtggcatta aaaccagaat tagtaattga gaagatttca atctaacaca gtggcattaa     6720

aactacgacc tatcctaatt attgatacag cgaaacaata tttcaatcta acacagtggc     6780

attaaaaccc ataaaaaggg ctatctttta taatgggtta tcatttgaag ggggagtttt     6840

accctatcaa cctataataa tgacttcctc ctatgcaatg ggaatgttat aattataaaa     6900

atatcaggag attaatatga caaacacatt ggaaatagac agagatatat ataatatact     6960

tattaattcc tttggtgaaa atacgctgag ggaaaaaata gatgatattc tcctgtccgc     7020

gatggatagc ttgctggaaa aatacactcg taacatattg gtatttgaag aaaagtatgg     7080

ggtctctttt aaagaatttg aaaaaatgtg ggatgaaggg aaaattgata ataaacataa     7140

ccacgaaata gaaggggatt ttattgattg ggaaatgtta gagatggaaa aaaaagagtt     7200

gttatcagca ctgtccagac tcaaaggctt taaaaaatga acaaccccaa tgttgatgaa     7260

ttcctatcat cactaaagac ctttctaaag aactatttta cacaatataa aattgatttc     7320

ttgataaaaa caccgaaatc ccttaaagcc aatattcatc ttaatgagaa attttttatt     7380

gcagttcgat ataatgccag aaatgggaga atggactttg ctttaataca ggataacaaa     7440

agaatttttg gatatgataa tttaaaggaa tggcactatc acccttataa aaacccatca     7500

gagcatatct catgcgataa accatctaca gataaaatct tatatgaaat taaaaaagtt     7560

tttgaagatg ctaaatgagg catgaaaaaa cgaaggagaa acctcataga atgaaaaaaa     7620

taataattgc aattacaggt gcgagtggtg ctgtatatgc aaaatacctt tttgatttcc     7680

tttgcaaaaa aggtattgac ctgcatatta tcatctcaga aaatgcaaaa ggaatattaa     7740

aggatgagac agggataggc gagaattatt ttaaaaagaa gaaagtatca atatatgaaa     7800

attcaaatct aaatgtccgg atagcaagcg gctcttttaa atttgatggt atggtcgtca     7860

tccctgcaag tatggggact cttgggagaa ttgcgaatgg ttattccaat aatctaataa     7920

gcagggttgc tgatgttgca ttaaaagaga gaaggaagct gataatagtc ccacgtgaga     7980

cccccttaaa tgatattcat ataaaaaata tgctcacctt aagccgtgcc ggtgcagtaa     8040

tactccctgc atcacctgcc ttctatcata agccgaaagg tattgacgat atagcaaaat     8100

ttatcacagc aagaattctc aatcagcttg atatagataa tgacctcatc cctccatatg     8160

caagagaggg ttaagataat actattttac ctttgcaccc accttgacat ccccatcaaa     8220

tccaataaga accaacctcc cgtctccacc tgctgccaat accatcccct gcgattccgt     8280

gcccatcagt tttgcagact taaggtttgt caccagaaca attttttttc cgattaattc     8340

ttcaggtgta taactttcag caatcccagc aacaacttgt ctttcctctg tccctatatc     8400

tactttgagt tttaggagtt tttttgattt ctcaatcttc tctgcctgtt ttatctcacc     8460

aactctcaaa tcaacctttg caaattcatc aatacttatt aactgcctct cctcaactgc     8520

ccctgccgat tctgttactg ttgctgttac tttatcacca atttccattt tctcctccac     8580

cctcggaaaa agctgtctcc ctattttaat ttcaatgcct gatttaattc caccccattt     8640

tattgattct ttaaaatcat aattttctat tctatcctta atccccaatt gattccatat     8700

ttcctgccct gtctctggca taaatggata tatatagaca gcaataatac gcagactttc     8760

agctaaagta tataaggtat ttgatagtgt cccctcatcc ttttcactcc acggtgcaga     8820

cttttgagca tattcattca tatctccgat gatcttccaa atgtaagaaa gtgcatgagg     8880

aaattcgagt ttatacataa aactgtcaaa actgtttgca gcataactct caaaaccttt     8940

tagagcagga tcgtatttaa aataaccctg aatattctgc tccatacccc tatttttagc     9000

atcaatttca ggtggaatct taccctccct atattttttt atcatattga gagtcctgct     9060

taatagattg cccaagtcat ttgcaaggtc actgtttatc ctcccaatca atgccctctg     9120

ggaaaaatca ccgtcaagtc caaagggaac ttctctcatc aaaaaatacc tgaacgcatc     9180

taccccatat ttatctatca attcatgcgg atttacaaaa ttccctctgg actttgacat     9240

cttctcaccg ttcacagtcc accagccgtg tgcaaatata ttttccggga gtggcaagtc     9300

caatgccttg agcattgttg accagtaaac tgaatgggtg gtgagtatat ctttccccag     9360

aaggtgttgg tctgcgggcc accactcttt gtcattgggg gcaagatatt ttgtagcaga     9420

atagtaatta acgagtgcat caaaccagac ataggtaaca tagttctcat taaagggcag     9480

tggaataccc catgaaagcc tctgctttgg tcttgagata cagaggtctc ccagtatgtt     9540

attttttaga aaaccaagaa cctcattttt acgggcttca ggaagtatat aattagaatt     9600

tttttcaata tattctttta aatcttcttg atactttgac atcagaaaaa agtaattttc     9660

ttcttctatg tgttccacag gacggccgca gtcggggcag ttgccacctt ttatttcctt     9720

ctctgtccag aacctctcat cagggataca ataccatccc aaataactcc ttttttctat     9780

cttcctttca tcaaataact tctgcaaaat atcctgcaca atcttaatat gcccttcatc     9840

agttgtgcgt ataaaggcat tgtttgagat gttgagtctt ttccagagat ttttaaaatt     9900

ctctaccatt aaatccgcat gttcttttgg tttaagaccc ctatctctgg ctgccttatc     9960

taccttctgt ccatgctcat ctgtccctgt cagaaaaaac acctgatagt cgcagagcct    10020

tttatatcgt gccataacat ccgccgctac agtggtatag gcatgcccta tgtgtggaat    10080

atcatttaca tagtagatag gtgtggttat atagaacttt ttcatcgtaa aattataaac    10140

tccaaatccc aaattacaaa taatatccaa tatccaaaat tctaaataat ttggtcattg    10200

tgatttggtt attatttgat tattgatatt tgggtattgg aatctgtttt atcccgtcaa    10260

ctgtaatctc ttaattccct ttgcttgtcc tgaattttca tctatatcaa ttacaacagc    10320

attaaattga ccttcaccgc ccgccatctc aaacttcttt ggcatctttg taagaaactt    10380

ctctaatgca agttcctttc taattcctat gaccgagttg gatggccctg tcattcccac    10440

atcggtaata taagcagtac cgttaggcaa tatcttttca tctgcagtct ggacatgtgt    10500

atgtgtcccg attacagcac ttaccctccc gtccagatac cagcccattg caattttttc    10560

tgatgtcgcc tcggcatgca tatcaactat tgtaaccttt atc                      10603


<210>  15
<211>  658
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  15

Met Gln Gly Gln Lys Leu Gly Ile Asp Leu Gly Gly Lys His Val Gly 
1               5                   10                  15      


Leu Ala Val Val Arg Thr Pro Ile Asn Glu Val Ala His Tyr Cys Thr 
            20                  25                  30          


Ile Glu Leu Arg Glu Asp Ile Lys Asp Lys Met Asp Glu Arg Arg Ser 
        35                  40                  45              


Leu Arg Arg Ala Arg Arg Asn Arg Leu Trp His Arg Glu Ala Arg Phe 
    50                  55                  60                  


Asp Asn Arg Gln Leu Arg Val Lys Cys Lys Tyr Ile Asp Lys Asp Thr 
65                  70                  75                  80  


Gly Glu Ile Cys Gly Ala Asn Thr Pro Lys Lys Ser Asn Val Lys His 
                85                  90                  95      


Leu Leu Leu Glu Asn Ile Leu Val Asn Leu Lys Ile Ala Asp Glu Ser 
            100                 105                 110         


Lys Glu Glu Ile Arg Arg Arg Gly Leu Asp Arg Asp Thr Asn Lys Ser 
        115                 120                 125             


Glu Leu Gln Thr Ile Leu Glu Lys Phe Ser Ile Asn Thr Phe Leu Lys 
    130                 135                 140                 


Lys Gln Ile Lys Asp Ile Ile Leu Glu Lys Gly Glu Gly Arg Ala Val 
145                 150                 155                 160 


Phe Cys Arg Glu His Ile Pro Phe His Tyr Glu Gln Val Ala Thr Glu 
                165                 170                 175     


Ala Glu Ser Phe Trp Leu Ser Asn Ser Ile Arg Ala Lys Gln Asp Gln 
            180                 185                 190         


Ile Leu Ser Arg Leu Lys Arg Ile Ala Lys Asp Phe Lys Ile Asp Glu 
        195                 200                 205             


Val Val Ile Glu Arg Ala Asn Phe Asp Leu Gln Lys Leu Gln Arg Pro 
    210                 215                 220                 


Asp Glu Ile Glu Ala Pro Glu Asp Tyr Met Lys Gly Pro Asn Phe Gly 
225                 230                 235                 240 


His Arg Asn Arg Phe Glu Ala Leu Lys Gln Glu Tyr Gly Asn Arg Cys 
                245                 250                 255     


Cys Phe Cys Gly Lys Lys Gly Gly Asp Glu Val Lys Leu Lys Ile Gly 
            260                 265                 270         


His Leu Tyr Pro Lys Ala Lys Asp Glu Ile Asn Arg Trp Glu Asn Leu 
        275                 280                 285             


Ile Thr Ile Cys Glu Lys Cys Asn Ala Lys Gln Gly Lys Arg Thr Pro 
    290                 295                 300                 


Glu Glu Ala Gly Met Glu Phe Val Ile Val Lys Glu Lys Val Phe Asn 
305                 310                 315                 320 


Pro Ala Ala Gly Arg Val Ile Pro Ile Lys Arg Glu Leu Lys Pro Lys 
                325                 330                 335     


Pro Ile Asn Glu Ser Lys Val Asn Lys Tyr Met Thr His Thr Asp Ile 
            340                 345                 350         


Gly Ile Arg Arg Leu Lys Arg Glu Ile Gln Asn Ile Phe Gly Ser Ile 
        355                 360                 365             


Pro Ile Arg Glu Thr Tyr Gly Tyr Ile Thr Ser Tyr Phe Arg Asn Lys 
    370                 375                 380                 


Trp Glu Leu Glu Lys Glu His Tyr Asn Asp Ala Val Val Ile Ala Ser 
385                 390                 395                 400 


Asp Lys Glu Asp Leu Asn Ile Lys Pro Val Phe Lys Asp Ala Val Pro 
                405                 410                 415     


Gln Thr Ile Lys Ser Ser Ile Lys Gly Gly Lys Leu Phe Asp Thr Asn 
            420                 425                 430         


Pro Leu Gln Phe Ser Asp Gly Lys Phe Tyr Gln Asn Ile Thr Leu Ile 
        435                 440                 445             


Gly Arg Lys Ala Gly Met Arg Ser Ser Lys His Lys Arg Gly Gln Arg 
    450                 455                 460                 


Asn Ile Arg Asn Tyr Gly Ser Ile Tyr Met Asp Glu Ile Glu Leu Ile 
465                 470                 475                 480 


Thr Ser Glu Trp Lys Lys Lys Val Leu Cys Glu Leu Arg Asp Lys Leu 
                485                 490                 495     


Gly Tyr Val Lys Gly Asp Lys Asn Lys Ser Phe Lys Pro Glu Glu Leu 
            500                 505                 510         


Met Asn Ala Asn Leu Pro Phe Arg Thr Val Thr Ile Asp Lys Arg Gly 
        515                 520                 525             


Val Gly Glu Ser Ser Thr Arg Leu Ile Asn Asn Asn Val Phe Arg Ala 
    530                 535                 540                 


Ser Ala Glu Val Asn Thr His Ile Met Val Tyr Ser Asn Asn Asp Gly 
545                 550                 555                 560 


Arg Met Lys Ala Phe Ala Val Lys Asn Pro Lys Ile Phe Lys Asp Ala 
                565                 570                 575     


Gly Leu Pro His Asp Phe Gln Lys Lys Ile Phe Ile Val Lys Lys Gly 
            580                 585                 590         


Asp Ile Val Thr Trp Lys Lys Ser Glu Asp Gly Ile Ala Val Thr Gly 
        595                 600                 605             


Arg Val Thr Lys Cys Leu Thr Lys Asn Gly Val Ile Asp Ile Lys Asp 
    610                 615                 620                 


Met Asn Asn Lys Ile His Ser Gly Lys Asn Pro Val Tyr Ile Glu Lys 
625                 630                 635                 640 


Ile Val Ser Pro Glu Arg Gly Ala Ile Phe Glu Arg Lys Ser Leu Ser 
                645                 650                 655     


Ala Leu 
        


<210>  16
<211>  2960
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  16
cgggggcggt agtttgatca aatcaaggag gaacgagact tccatccact gccggcgggc       60

gatgggggtt gcaacaagga aacggatatg aaagagctta gcgcgccact cttccatcca      120

ctgccggcgg gcgatggggg ttgcaacccc atctatggaa gtcgcctcag ccaccgatcc      180

ttggggatcg acggggctac gttattcaga tgcataccct ggggtggacg actccagccc      240

caggccctat ggggtgaact aaacagcgct gaagtggagg cgcagtggta gcccggtggg      300

actctgtccc acaaaactct gaataacatt ggctgggagt cacctgacgg gttggccgga      360

tacccaggtc ttaccgacat agaaccggcc attctcaacc cttgagaagg gataaaaaca      420

tgggagattt gagaaataaa ccagtttatg tgcaaaatgc agatggtcaa cctttgatgc      480

cgaccacgcc ggcccgggcc aggcgaatgc tggatcaagg caaagccagg attgcgattc      540

gatcaccctt tacgatccgc ttgcttcagc agattgaatc gccgcaattg cagcctgtgc      600

gagttggtct ggatacgggg tccaaggcaa tggggattgc ggccattgcc aatggcaaag      660

ccatatttgt tggggaatta cctttgcgac agtttcaaaa ggggagtgcc gtagctgaca      720

gagccatgca tcgccgggca cgcaggtcac gcttgcgcta tcggaaagcg cggtttttga      780

atcggaccag aaaaaagtgc aaggtttgtg gtggcaatac gccaaagtct gatcgcaaat      840

ccggcgggcg agcggaatta tgccgaaaat gtgccgctga ggggcatcat gcattcgcag      900

gaattgctaa agttccgggt tggatatccc caacattaaa agcgaaaaag gacaaccatg      960

ttcatgccgt caagaacctg gctaaaatcc ttccggtatc agaggtggtt gttgaggtag     1020

ccgattggga tattcaaaag atccggaatc ctgatatttc gggatacgag taccaaaatg     1080

gaccattggc gcattacgag aacctgcgtg catatgttta tgcgcgtgat ggatggacat     1140

gccaatactg taaatccgaa gatgggaatc tgacactgga tcatattatc ccagaatcgc     1200

ggggtggtcc tacgacaccc aacaacctgg ttgcagcgtg ctataactgc aatcgagcaa     1260

aagggaatca aactgccgag gaatggggat accctgacat tcaagaaaga gtcaaaaaga     1320

atgagcttgc ttttaagcac gcggcacacg ttggcagtat caaaaaccac atcatttatg     1380

agttgtcgaa ggaattcccg gtacgaacga catatggttt ttacacgcat atcaagcgac     1440

gggatgagct tggtttagag aagcggcatg ggcatgatgc tgtagctatt gcatgtcgat     1500

ggggtgaaaa ggtcgaggtt gtgagcccca tttaccaggg ccggctgaaa ccttcgcgtc     1560

gccggcaaaa atatcaaatg ttgatgttcc cgcaatatcg atataagcca cgcacgaaaa     1620

aagggaaaaa agatctggac aatcaacttg ctaaattgaa gtacaatagt aaaaacgacc     1680

cagaaaggct taaggctatt gcgcggcaat tgagggccct ggcgcccgag ttttgggaag     1740

gtaaggggta ttttgttccc aaggaaagaa acaagcgcat tgtcgcgaag gatggtacga     1800

tctttaaaaa gagtgattat gttgaagctg tggtaagcgg aaagaaatgc cggggatatg     1860

ttaccgcttt gtattcatct ggtagattga aggttgaaac atcagagggt attaaatctg     1920

caagcccaga tcgttcgcgc aaattgcaat cagcgcggtc aattatgtgg tgggaggaat     1980

aaaatgggaa tctacgtgac gatcctggcc ggcgaccagg agctttacag cggcaaaatt     2040

tcgaacggcc tggcgagcgt gttgatgaac acgttcgatc aggcgcggca acataatttg     2100

ttcggggaaa gtttcctggc gccggcgggc aaggttgagg gggcggaaga tgtggaaagg     2160

gtctttgtca agctgtgccg gcatcttcac aaccctcgct ttttactgcc gcggtgggac     2220

tcgaacgacg agaactttcg ggcaaagcag gcgcgggcag agttgggcaa gatggttgat     2280

gagatgaagg cgctggagct tgcgctaggc agagagaggg agcttgggcg ggagccggcg     2340

gtgaggtggg ggtaatgggg gaagatggaa caattggggt tgccgctaag gattgaatct     2400

gaaatttata gctggcatga ggggtatttc tgtgattgcg atttgcagac ggacgagcct     2460

tcggtttgcg aaattcatgg ccatcgtcgt tatgagcgca ggccggcctg gtggcgctgc     2520

ccgctttgtg aagttgtttt cgtgcctggg tcgaaagagc gtccggtggt gcacaattgt     2580

cagcagaaca ggtaattgca tgggaacgct ttggaaggga gccggcaata tgctgttgac     2640

cccctgcggc actctattgt cacccggttg ccggctcccc tccggggcgt tcccggacat     2700

ggaaggcggc gatgcgcgta cccgaggcat cgcgtgctgg aatgagcccg ggcgggcctg     2760

gggaaacgct cgcccgggaa aaattgcgac ggggggaggt ggtaatcgac caagagagac     2820

atcaactatg acgaactgtc ctctgcatct tgggtgccgg ctttgcgggg catggccggc     2880

acctgggaat agtgtgtaac tgcagttaca cgtggaaaat cgaaggattt gggagatgag     2940

caatagtacg ggtccaaaag                                                 2960


<210>  17
<211>  520
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  17

Met Gly Asp Leu Arg Asn Lys Pro Val Tyr Val Gln Asn Ala Asp Gly 
1               5                   10                  15      


Gln Pro Leu Met Pro Thr Thr Pro Ala Arg Ala Arg Arg Met Leu Asp 
            20                  25                  30          


Gln Gly Lys Ala Arg Ile Ala Ile Arg Ser Pro Phe Thr Ile Arg Leu 
        35                  40                  45              


Leu Gln Gln Ile Glu Ser Pro Gln Leu Gln Pro Val Arg Val Gly Leu 
    50                  55                  60                  


Asp Thr Gly Ser Lys Ala Met Gly Ile Ala Ala Ile Ala Asn Gly Lys 
65                  70                  75                  80  


Ala Ile Phe Val Gly Glu Leu Pro Leu Arg Gln Phe Gln Lys Gly Ser 
                85                  90                  95      


Ala Val Ala Asp Arg Ala Met His Arg Arg Ala Arg Arg Ser Arg Leu 
            100                 105                 110         


Arg Tyr Arg Lys Ala Arg Phe Leu Asn Arg Thr Arg Lys Lys Cys Lys 
        115                 120                 125             


Val Cys Gly Gly Asn Thr Pro Lys Ser Asp Arg Lys Ser Gly Gly Arg 
    130                 135                 140                 


Ala Glu Leu Cys Arg Lys Cys Ala Ala Glu Gly His His Ala Phe Ala 
145                 150                 155                 160 


Gly Ile Ala Lys Val Pro Gly Trp Ile Ser Pro Thr Leu Lys Ala Lys 
                165                 170                 175     


Lys Asp Asn His Val His Ala Val Lys Asn Leu Ala Lys Ile Leu Pro 
            180                 185                 190         


Val Ser Glu Val Val Val Glu Val Ala Asp Trp Asp Ile Gln Lys Ile 
        195                 200                 205             


Arg Asn Pro Asp Ile Ser Gly Tyr Glu Tyr Gln Asn Gly Pro Leu Ala 
    210                 215                 220                 


His Tyr Glu Asn Leu Arg Ala Tyr Val Tyr Ala Arg Asp Gly Trp Thr 
225                 230                 235                 240 


Cys Gln Tyr Cys Lys Ser Glu Asp Gly Asn Leu Thr Leu Asp His Ile 
                245                 250                 255     


Ile Pro Glu Ser Arg Gly Gly Pro Thr Thr Pro Asn Asn Leu Val Ala 
            260                 265                 270         


Ala Cys Tyr Asn Cys Asn Arg Ala Lys Gly Asn Gln Thr Ala Glu Glu 
        275                 280                 285             


Trp Gly Tyr Pro Asp Ile Gln Glu Arg Val Lys Lys Asn Glu Leu Ala 
    290                 295                 300                 


Phe Lys His Ala Ala His Val Gly Ser Ile Lys Asn His Ile Ile Tyr 
305                 310                 315                 320 


Glu Leu Ser Lys Glu Phe Pro Val Arg Thr Thr Tyr Gly Phe Tyr Thr 
                325                 330                 335     


His Ile Lys Arg Arg Asp Glu Leu Gly Leu Glu Lys Arg His Gly His 
            340                 345                 350         


Asp Ala Val Ala Ile Ala Cys Arg Trp Gly Glu Lys Val Glu Val Val 
        355                 360                 365             


Ser Pro Ile Tyr Gln Gly Arg Leu Lys Pro Ser Arg Arg Arg Gln Lys 
    370                 375                 380                 


Tyr Gln Met Leu Met Phe Pro Gln Tyr Arg Tyr Lys Pro Arg Thr Lys 
385                 390                 395                 400 


Lys Gly Lys Lys Asp Leu Asp Asn Gln Leu Ala Lys Leu Lys Tyr Asn 
                405                 410                 415     


Ser Lys Asn Asp Pro Glu Arg Leu Lys Ala Ile Ala Arg Gln Leu Arg 
            420                 425                 430         


Ala Leu Ala Pro Glu Phe Trp Glu Gly Lys Gly Tyr Phe Val Pro Lys 
        435                 440                 445             


Glu Arg Asn Lys Arg Ile Val Ala Lys Asp Gly Thr Ile Phe Lys Lys 
    450                 455                 460                 


Ser Asp Tyr Val Glu Ala Val Val Ser Gly Lys Lys Cys Arg Gly Tyr 
465                 470                 475                 480 


Val Thr Ala Leu Tyr Ser Ser Gly Arg Leu Lys Val Glu Thr Ser Glu 
                485                 490                 495     


Gly Ile Lys Ser Ala Ser Pro Asp Arg Ser Arg Lys Leu Gln Ser Ala 
            500                 505                 510         


Arg Ser Ile Met Trp Trp Glu Glu 
        515                 520 


<210>  18
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  18

Gly Gly Gly Ser 
1               


<210>  19
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  19

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
1               5                   10          


<210>  20
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  20

Ala Glu Ala Ala Ala Lys Ala 
1               5           


<210>  21
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  21

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10                  15  


<210>  22
<211>  30
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  22

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            20                  25                  30  


<210>  23
<211>  45
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  23

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
        35                  40                  45  


<210>  24
<211>  60
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  24

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        35                  40                  45              


Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
    50                  55                  60  


<210>  25
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  25

Gly Gly Gly Gly Ser 
1               5   


<210>  26
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  26

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10  


<210>  27
<211>  20
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  27

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser 
            20  


<210>  28
<211>  25
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  28

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser 
            20                  25  


<210>  29
<211>  35
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  29

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser 
        35  


<210>  30
<211>  40
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  30

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser 
        35                  40  


<210>  31
<211>  50
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  31

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        35                  40                  45              


Gly Ser 
    50  


<210>  32
<211>  55
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  32

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1               5                   10                  15      


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        35                  40                  45              


Gly Ser Gly Gly Gly Gly Ser 
    50                  55  


<210>  33
<211>  7
<212>  PRT
<213>  Simian virus 40

<400>  33

Pro Lys Lys Lys Arg Lys Val 
1               5           


<210>  34
<211>  16
<212>  PRT
<213>  Simian virus 40

<400>  34

Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210>  35
<211>  9
<212>  PRT
<213>  Homo sapiens

<400>  35

Pro Ala Ala Lys Arg Val Lys Leu Asp 
1               5                   


<210>  36
<211>  11
<212>  PRT
<213>  Homo sapiens

<400>  36

Arg Gln Arg Arg Asn Glu Leu Lys Arg Ser Pro 
1               5                   10      


<210>  37
<211>  38
<212>  PRT
<213>  Homo sapiens

<400>  37

Asn Gln Ser Ser Asn Phe Gly Pro Met Lys Gly Gly Asn Phe Gly Gly 
1               5                   10                  15      


Arg Ser Ser Gly Pro Tyr Gly Gly Gly Gly Gln Tyr Phe Ala Lys Pro 
            20                  25                  30          


Arg Asn Gln Gly Gly Tyr 
        35              


<210>  38
<211>  42
<212>  PRT
<213>  Homo sapiens

<400>  38

Arg Met Arg Ile Glx Phe Lys Asn Lys Gly Lys Asp Thr Ala Glu Leu 
1               5                   10                  15      


Arg Arg Arg Arg Val Glu Val Ser Val Glu Leu Arg Lys Ala Lys Lys 
            20                  25                  30          


Asp Glu Gln Ile Leu Lys Arg Arg Asn Val 
        35                  40          


<210>  39
<211>  8
<212>  PRT
<213>  Homo sapiens

<400>  39

Val Ser Arg Lys Arg Pro Arg Pro 
1               5               


<210>  40
<211>  8
<212>  PRT
<213>  Homo sapiens

<400>  40

Pro Pro Lys Lys Ala Arg Glu Asp 
1               5               


<210>  41
<211>  8
<212>  PRT
<213>  Homo sapiens

<400>  41

Pro Gln Pro Lys Lys Lys Pro Leu 
1               5               


<210>  42
<211>  12
<212>  PRT
<213>  Mus sp.

<400>  42

Ser Ala Leu Ile Lys Lys Lys Lys Lys Met Ala Pro 
1               5                   10          


<210>  43
<211>  5
<212>  PRT
<213>  Influenza virus

<400>  43

Asp Arg Leu Arg Arg 
1               5   


<210>  44
<211>  7
<212>  PRT
<213>  Influenza virus

<400>  44

Pro Lys Gln Lys Lys Arg Lys 
1               5           


<210>  45
<211>  10
<212>  PRT
<213>  Hepatitis virus

<400>  45

Arg Lys Leu Lys Lys Lys Ile Lys Lys Leu 
1               5                   10  


<210>  46
<211>  10
<212>  PRT
<213>  Mus sp.

<400>  46

Arg Glu Lys Lys Lys Phe Leu Lys Arg Arg 
1               5                   10  


<210>  47
<211>  20
<212>  PRT
<213>  Homo sapiens

<400>  47

Lys Arg Lys Gly Asp Glu Val Asp Gly Val Asp Glu Val Ala Lys Lys 
1               5                   10                  15      


Lys Ser Lys Lys 
            20  


<210>  48
<211>  17
<212>  PRT
<213>  Homo sapiens

<400>  48

Arg Lys Cys Leu Gln Ala Gly Met Asn Leu Glu Ala Arg Lys Thr Lys 
1               5                   10                  15      


Lys 
    


<210>  49
<211>  12
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  49
acttgtttaa gt                                                           12


<210>  50
<211>  16
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  50
ggcaccgagt cggtgc                                                       16


<210>  51
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  51

Leu Ala Gly Leu Ile Asp Ala Asp Gly 
1               5                   


<210>  52
<211>  47
<212>  DNA
<213>  Caldithrix sp.

<400>  52
ctatatttgt acccctgtca gcgaaaagtg attgagggtt tcactct                     47


