                         SEQUENCE LISTING

<110>  Yeda Research and Development Co. Ltd.
 
<120>  ARTIFICIAL CELLULOSOMES COMPRISING MULTIPLE SCAFFOLDS AND USES 
       THEREOF IN BIOMASS DEGRADATION

<130>  YEDA/0139 PCT

<150>  US 61/862019
<151>  2013-08-04

<160>  54    

<170>  PatentIn version 3.5

<210>  1
<211>  165
<212>  PRT
<213>  Clostridium thermocellum

<400>  1

Ala Asn Thr Pro Val Ser Gly Asn Leu Lys Val Glu Phe Tyr Asn Ser 
1               5                   10                  15      


Asn Pro Ser Asp Thr Thr Asn Ser Ile Asn Pro Gln Phe Lys Val Thr 
            20                  25                  30          


Asn Thr Gly Ser Ser Ala Ile Asp Leu Ser Lys Leu Thr Leu Arg Tyr 
        35                  40                  45              


Tyr Tyr Thr Val Asp Gly Gln Lys Asp Gln Thr Phe Trp Cys Asp His 
    50                  55                  60                  


Ala Ala Ile Ile Gly Ser Asn Gly Ser Tyr Asn Gly Ile Thr Ser Asn 
65                  70                  75                  80  


Val Lys Gly Thr Phe Val Lys Met Ser Ser Ser Thr Asn Asn Ala Asp 
                85                  90                  95      


Thr Tyr Leu Glu Ile Ser Phe Thr Gly Gly Thr Leu Glu Pro Gly Ala 
            100                 105                 110         


His Val Gln Ile Gln Gly Arg Phe Ala Lys Asn Asp Trp Ser Asn Tyr 
        115                 120                 125             


Thr Gln Ser Asn Asp Tyr Ser Phe Lys Ser Ala Ser Gln Phe Val Glu 
    130                 135                 140                 


Trp Asp Gln Val Thr Ala Tyr Leu Asn Gly Val Leu Val Trp Gly Lys 
145                 150                 155                 160 


Glu Pro Gly Gly Ser 
                165 


<210>  2
<211>  145
<212>  PRT
<213>  Clostridium thermocellum

<400>  2

Ser Asp Gly Val Val Val Glu Ile Gly Lys Val Thr Gly Ser Val Gly 
1               5                   10                  15      


Thr Thr Val Glu Ile Pro Val Tyr Phe Arg Gly Val Pro Ser Lys Gly 
            20                  25                  30          


Ile Ala Asn Cys Asp Phe Val Phe Arg Tyr Asp Pro Asn Val Leu Glu 
        35                  40                  45              


Ile Ile Gly Ile Asp Pro Gly Asp Ile Ile Val Asp Pro Asn Pro Thr 
    50                  55                  60                  


Lys Ser Phe Asp Thr Ala Ile Tyr Pro Asp Arg Lys Ile Ile Val Phe 
65                  70                  75                  80  


Leu Phe Ala Glu Asp Ser Gly Thr Gly Ala Tyr Ala Ile Thr Lys Asp 
                85                  90                  95      


Gly Val Phe Ala Lys Ile Arg Ala Thr Val Lys Ser Ser Ala Pro Gly 
            100                 105                 110         


Tyr Ile Thr Phe Asp Glu Val Gly Gly Phe Ala Asp Asn Asp Leu Val 
        115                 120                 125             


Glu Gln Lys Val Ser Phe Ile Asp Gly Gly Val Asn Val Gly Asn Ala 
    130                 135                 140                 


Thr 
145 


<210>  3
<211>  148
<212>  PRT
<213>  Bacteroides cellulosolvens

<400>  3

Ser Ser Pro Gly Asn Lys Met Lys Ile Gln Ile Gly Asp Val Lys Ala 
1               5                   10                  15      


Asn Gln Gly Asp Thr Val Ile Val Pro Ile Thr Phe Asn Glu Val Pro 
            20                  25                  30          


Val Met Gly Val Asn Asn Cys Asn Phe Thr Leu Ala Tyr Asp Lys Asn 
        35                  40                  45              


Ile Met Glu Phe Ile Ser Ala Asp Ala Gly Asp Ile Val Thr Leu Pro 
    50                  55                  60                  


Met Ala Asn Tyr Ser Tyr Asn Met Pro Ser Asp Gly Leu Val Lys Phe 
65                  70                  75                  80  


Leu Tyr Asn Asp Gln Ala Gln Gly Ala Met Ser Ile Lys Glu Asp Gly 
                85                  90                  95      


Thr Phe Ala Asn Val Lys Phe Lys Ile Lys Gln Ser Ala Ala Phe Gly 
            100                 105                 110         


Lys Tyr Ser Val Gly Ile Lys Ala Ile Gly Ser Ile Ser Ala Leu Ser 
        115                 120                 125             


Asn Ser Lys Leu Ile Pro Ile Glu Ser Ile Phe Lys Asp Gly Ser Ile 
    130                 135                 140                 


Thr Val Thr Asn 
145             


<210>  4
<211>  146
<212>  PRT
<213>  Acetivibrio cellulolyticus

<400>  4

Gly Ser Asp Leu Gln Val Asp Ile Gly Ser Thr Ser Gly Lys Ala Gly 
1               5                   10                  15      


Ser Val Val Ser Val Pro Ile Thr Phe Thr Asn Val Pro Lys Ser Gly 
            20                  25                  30          


Ile Tyr Ala Leu Ser Phe Arg Thr Asn Phe Asp Pro Gln Lys Val Thr 
        35                  40                  45              


Val Ala Ser Ile Asp Ala Gly Ser Leu Ile Glu Asn Ala Ser Asp Phe 
    50                  55                  60                  


Thr Thr Tyr Tyr Asn Asn Glu Asn Gly Phe Ala Ser Met Thr Phe Glu 
65                  70                  75                  80  


Ala Pro Val Asp Arg Ala Arg Ile Ile Asp Ser Asp Gly Val Phe Ala 
                85                  90                  95      


Thr Ile Asn Phe Lys Val Ser Asp Ser Ala Lys Val Gly Glu Leu Tyr 
            100                 105                 110         


Asn Ile Thr Thr Asn Ser Ala Tyr Thr Ser Phe Tyr Tyr Ser Gly Thr 
        115                 120                 125             


Asp Glu Ile Lys Asn Val Val Tyr Asn Asp Gly Lys Ile Glu Val Ile 
    130                 135                 140                 


Ala Ser 
145     


<210>  5
<211>  29
<212>  PRT
<213>  Acetivibrio cellulolyticus

<400>  5

Pro Thr Pro Thr Gln Ser Ala Thr Pro Thr Val Thr Pro Ser Ala Thr 
1               5                   10                  15      


Ala Thr Pro Thr Gln Ser Ala Thr Pro Thr Val Thr Pro 
            20                  25                  


<210>  6
<211>  5
<212>  PRT
<213>  Acetivibrio cellulolyticus

<400>  6

Pro Thr Pro Thr Gln 
1               5   


<210>  7
<211>  27
<212>  PRT
<213>  Bacteroides cellulosolvens

<400>  7

Thr Pro Thr Asn Thr Ile Ser Val Thr Pro Thr Asn Asn Ser Thr Pro 
1               5                   10                  15      


Thr Asn Asn Ser Thr Pro Lys Pro Asn Pro Leu 
            20                  25          


<210>  8
<211>  5
<212>  PRT
<213>  Bacteroides cellulosolvens

<400>  8

Thr Pro Thr Asn Thr 
1               5   


<210>  9
<211>  35
<212>  PRT
<213>  Clostridium thermocellum

<400>  9

Pro Thr Lys Gly Ala Thr Pro Thr Asn Thr Ala Thr Pro Thr Lys Ser 
1               5                   10                  15      


Ala Thr Ala Thr Pro Thr Arg Pro Ser Val Pro Thr Asn Thr Pro Thr 
            20                  25                  30          


Asn Thr Pro 
        35  


<210>  10
<211>  5
<212>  PRT
<213>  Clostridium thermocellum

<400>  10

Pro Thr Lys Gly Ala 
1               5   


<210>  11
<211>  31
<212>  PRT
<213>  Clostridium thermocellum

<400>  11

Val Val Pro Ser Thr Gln Pro Val Thr Thr Pro Pro Ala Thr Thr Lys 
1               5                   10                  15      


Pro Pro Ala Thr Thr Lys Pro Pro Ala Thr Thr Ile Pro Pro Ser 
            20                  25                  30      


<210>  12
<211>  5
<212>  PRT
<213>  Clostridium thermocellum

<400>  12

Val Val Pro Ser Thr 
1               5   


<210>  13
<211>  721
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  13

Met Gly Pro Thr Lys Ala Pro Thr Lys Asp Gly Thr Ser Tyr Lys Asp 
1               5                   10                  15      


Leu Phe Leu Glu Leu Tyr Gly Lys Ile Lys Asp Pro Lys Asn Gly Tyr 
            20                  25                  30          


Phe Ser Pro Asp Glu Gly Ile Pro Tyr His Ser Ile Glu Thr Leu Ile 
        35                  40                  45              


Val Glu Ala Pro Asp Tyr Gly His Val Thr Thr Ser Glu Ala Phe Ser 
    50                  55                  60                  


Tyr Tyr Val Trp Leu Glu Ala Met Tyr Gly Asn Leu Thr Gly Asn Trp 
65                  70                  75                  80  


Ser Gly Val Glu Thr Ala Trp Lys Val Met Glu Asp Trp Ile Ile Pro 
                85                  90                  95      


Asp Ser Thr Glu Gln Pro Gly Met Ser Ser Tyr Asn Pro Asn Ser Pro 
            100                 105                 110         


Ala Thr Tyr Ala Asp Glu Tyr Glu Asp Pro Ser Tyr Tyr Pro Ser Glu 
        115                 120                 125             


Leu Lys Phe Asp Thr Val Arg Val Gly Ser Asp Pro Val His Asn Asp 
    130                 135                 140                 


Leu Val Ser Ala Tyr Gly Pro Asn Met Tyr Leu Met His Trp Leu Met 
145                 150                 155                 160 


Asp Val Asp Asn Trp Tyr Gly Phe Gly Thr Gly Thr Arg Ala Thr Phe 
                165                 170                 175     


Ile Asn Thr Phe Gln Arg Gly Glu Gln Glu Ser Thr Trp Glu Thr Ile 
            180                 185                 190         


Pro His Pro Ser Ile Glu Glu Phe Lys Tyr Gly Gly Pro Asn Gly Phe 
        195                 200                 205             


Leu Asp Leu Phe Thr Lys Asp Arg Ser Tyr Ala Lys Gln Trp Arg Tyr 
    210                 215                 220                 


Thr Asn Ala Pro Asp Ala Glu Gly Arg Ala Ile Gln Ala Val Tyr Trp 
225                 230                 235                 240 


Ala Asn Lys Trp Ala Lys Glu Gln Gly Lys Gly Ser Ala Val Ala Ser 
                245                 250                 255     


Val Val Ser Lys Ala Ala Lys Met Gly Asp Phe Leu Arg Asn Asp Met 
            260                 265                 270         


Phe Asp Lys Tyr Phe Met Lys Ile Gly Ala Gln Asp Lys Thr Pro Ala 
        275                 280                 285             


Thr Gly Tyr Asp Ser Ala His Tyr Leu Met Ala Trp Tyr Thr Ala Trp 
    290                 295                 300                 


Gly Gly Gly Ile Gly Ala Ser Trp Ala Trp Lys Ile Gly Cys Ser His 
305                 310                 315                 320 


Ala His Phe Gly Tyr Gln Asn Pro Phe Gln Gly Trp Val Ser Ala Thr 
                325                 330                 335     


Gln Ser Asp Phe Ala Pro Lys Ser Ser Asn Gly Lys Arg Asp Trp Thr 
            340                 345                 350         


Thr Ser Tyr Lys Arg Gln Leu Glu Phe Tyr Gln Trp Leu Gln Ser Ala 
        355                 360                 365             


Glu Gly Gly Ile Ala Gly Gly Ala Thr Asn Ser Trp Asn Gly Arg Tyr 
    370                 375                 380                 


Glu Lys Tyr Pro Ala Gly Thr Ser Thr Phe Tyr Gly Met Ala Tyr Val 
385                 390                 395                 400 


Pro His Pro Val Tyr Ala Asp Pro Gly Ser Asn Gln Trp Phe Gly Phe 
                405                 410                 415     


Gln Ala Trp Ser Met Gln Arg Val Met Glu Tyr Tyr Leu Glu Thr Gly 
            420                 425                 430         


Asp Ser Ser Val Lys Asn Leu Ile Lys Lys Trp Val Asp Trp Val Met 
        435                 440                 445             


Ser Glu Ile Lys Leu Tyr Asp Asp Gly Thr Phe Ala Ile Pro Ser Asp 
    450                 455                 460                 


Leu Glu Trp Ser Gly Gln Pro Asp Thr Trp Thr Gly Thr Tyr Thr Gly 
465                 470                 475                 480 


Asn Pro Asn Leu His Val Arg Val Thr Ser Tyr Gly Thr Asp Leu Gly 
                485                 490                 495     


Val Ala Gly Ser Leu Ala Asn Ala Leu Ala Thr Tyr Ala Ala Ala Thr 
            500                 505                 510         


Glu Arg Trp Glu Gly Lys Leu Asp Thr Lys Ala Arg Asp Met Ala Ala 
        515                 520                 525             


Glu Leu Val Asn Arg Ala Trp Tyr Asn Phe Tyr Cys Ser Glu Gly Lys 
    530                 535                 540                 


Gly Val Val Thr Glu Glu Ala Arg Ala Asp Tyr Lys Arg Phe Phe Glu 
545                 550                 555                 560 


Gln Glu Val Tyr Val Pro Ala Gly Trp Ser Gly Thr Met Pro Asn Gly 
                565                 570                 575     


Asp Lys Ile Gln Pro Gly Ile Lys Phe Ile Asp Ile Arg Thr Lys Tyr 
            580                 585                 590         


Arg Gln Asp Pro Tyr Tyr Asp Ile Val Tyr Gln Ala Tyr Leu Arg Gly 
        595                 600                 605             


Glu Ala Pro Val Leu Asn Tyr His Arg Phe Trp His Glu Val Asp Leu 
    610                 615                 620                 


Ala Val Ala Met Gly Val Leu Ala Thr Tyr Phe Pro Asp Met Thr Tyr 
625                 630                 635                 640 


Lys Val Pro Gly Thr Pro Ser Thr Lys Leu Tyr Gly Asp Val Asn Asp 
                645                 650                 655     


Asp Gly Lys Val Asn Ser Thr Asp Ala Val Ala Leu Lys Arg Tyr Val 
            660                 665                 670         


Leu Arg Ser Gly Ile Ser Ile Asn Thr Asp Asn Ala Asp Leu Asn Glu 
        675                 680                 685             


Asp Gly Arg Val Asn Ser Thr Asp Leu Gly Ile Leu Lys Arg Tyr Ile 
    690                 695                 700                 


Leu Lys Glu Ile Asp Thr Leu Pro Tyr Lys Asn His His His His His 
705                 710                 715                 720 


His 
    


<210>  14
<211>  2166
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  14
atgggtccta caaaggcacc tacaaaagat gggacatctt ataaggatct tttccttgaa       60

ctctacggaa aaattaaaga tcctaagaac ggatatttca gcccagacga gggaattcct      120

tatcactcaa ttgaaacatt gatcgttgaa gcgccggact acggtcacgt tactaccagt      180

gaggctttca gctattatgt atggcttgaa gcaatgtatg gaaatctcac aggcaactgg      240

tccggagtag aaacagcatg gaaagttatg gaggattgga taattcctga cagcacagag      300

cagccgggta tgtcttctta caatccaaac agccctgcca catatgctga cgaatatgag      360

gatccttcat actatccttc agagttgaag tttgataccg taagagttgg atccgaccct      420

gtacacaacg accttgtatc cgcatacggt cctaacatgt acctcatgca ctggttgatg      480

gacgttgaca actggtacgg ttttggtaca ggaacacggg caacattcat aaacaccttc      540

caaagaggtg aacaggaatc cacatgggaa accattcctc atccgtcaat agaagagttc      600

aaatacggcg gaccgaacgg attccttgat ttgtttacaa aggacagatc atatgcaaaa      660

cagtggcgtt atacaaacgc tcctgacgca gaaggccgtg ctatacaggc tgtttactgg      720

gcaaacaaat gggcaaagga gcagggtaaa ggttctgccg ttgcttccgt tgtatccaag      780

gctgcaaaga tgggtgactt cttgagaaac gacatgttcg acaaatactt catgaagatc      840

ggtgcacagg acaagactcc tgctaccggt tatgacagtg cacactacct tatggcctgg      900

tatactgcat ggggtggtgg aattggtgca tcctgggcat ggaagatcgg atgcagccac      960

gcacacttcg gatatcagaa cccattccag ggatgggtaa gtgcaacaca gagcgacttt     1020

gctcctaaat catccaacgg taagagagac tggacaacaa gctacaagag acagcttgaa     1080

ttctatcagt ggttgcagtc ggctgaaggt ggtattgccg gtggagcaac caactcctgg     1140

aacggtagat atgagaaata tcctgctggt acgtcaacgt tctatggtat ggcatatgtt     1200

ccgcatcctg tatacgctga cccgggtagt aaccagtggt tcggattcca ggcatggtca     1260

atgcagcgtg taatggagta ctacctcgaa acaggagatt catcagttaa gaatttgatt     1320

aagaagtggg tcgactgggt aatgagcgaa attaagctct atgacgatgg aacatttgca     1380

attcctagcg acctcgagtg gtcaggtcag cctgatacat ggaccggaac atacacaggc     1440

aacccgaacc tccatgtaag agtaacttct tacggtactg accttggtgt tgcaggttca     1500

cttgcaaatg ctcttgcaac ttatgccgca gctacagaaa gatgggaagg aaaacttgat     1560

acaaaagcaa gagacatggc tgctgaactg gttaaccgtg catggtacaa cttctactgc     1620

tctgaaggaa aaggtgttgt tactgaggaa gcacgtgctg actacaaacg tttctttgag     1680

caggaagtat acgttccggc aggttggagc ggtactatgc cgaacggtga caagattcag     1740

cctggtatta agttcataga catccgtaca aaatatagac aagatcctta ctacgatata     1800

gtatatcagg catacttgag aggcgaagct cctgtattga attatcaccg cttctggcat     1860

gaagttgacc ttgcagttgc aatgggtgta ttggctacat acttcccgga tatgacatat     1920

aaagtacctg gtactccttc tactaaatta tacggcgacg tcaatgatga cggaaaagtt     1980

aactcaactg acgctgtagc attgaagaga tatgttttga gatcaggtat aagcatcaac     2040

actgacaatg ccgatttgaa tgaagacggc agagttaatt caactgactt aggaattttg     2100

aagagatata ttctcaaaga aatagataca ttgccgtaca agaaccacca tcaccatcac     2160

cattaa                                                                2166


<210>  15
<211>  466
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  15

Gly Val Pro Phe Asn Thr Lys Tyr Pro Tyr Gly Pro Thr Ser Ile Ala 
1               5                   10                  15      


Asp Asn Gln Ser Glu Val Thr Ala Met Leu Lys Ala Glu Trp Glu Asp 
            20                  25                  30          


Trp Lys Ser Lys Arg Ile Thr Ser Asn Gly Ala Gly Gly Tyr Lys Arg 
        35                  40                  45              


Val Gln Arg Asp Ala Ser Thr Asn Tyr Asp Thr Val Ser Glu Gly Met 
    50                  55                  60                  


Gly Tyr Gly Leu Leu Leu Ala Val Cys Phe Asn Glu Gln Ala Leu Phe 
65                  70                  75                  80  


Asp Asp Leu Tyr Arg Tyr Val Lys Ser His Phe Asn Gly Asn Gly Leu 
                85                  90                  95      


Met His Trp His Ile Asp Ala Asn Asn Asn Val Thr Ser His Asp Gly 
            100                 105                 110         


Gly Asp Gly Ala Ala Thr Asp Ala Asp Glu Asp Ile Ala Leu Ala Leu 
        115                 120                 125             


Ile Phe Ala Asp Lys Leu Trp Gly Ser Ser Gly Ala Ile Asn Tyr Gly 
    130                 135                 140                 


Gln Glu Ala Arg Thr Leu Ile Asn Asn Leu Tyr Asn His Cys Val Glu 
145                 150                 155                 160 


His Gly Ser Tyr Val Leu Lys Pro Gly Asp Arg Trp Gly Gly Ser Ser 
                165                 170                 175     


Val Thr Asn Pro Ser Tyr Phe Ala Pro Ala Trp Tyr Lys Val Tyr Ala 
            180                 185                 190         


Gln Tyr Thr Gly Asp Thr Arg Trp Asn Gln Val Ala Asp Lys Cys Tyr 
        195                 200                 205             


Gln Ile Val Glu Glu Val Lys Lys Tyr Asn Asn Gly Thr Gly Leu Val 
    210                 215                 220                 


Pro Asp Trp Cys Thr Ala Ser Gly Thr Pro Ala Ser Gly Gln Ser Tyr 
225                 230                 235                 240 


Asp Tyr Lys Tyr Asp Ala Thr Arg Tyr Gly Trp Arg Thr Ala Val Asp 
                245                 250                 255     


Tyr Ser Trp Phe Gly Asp Gln Arg Ala Lys Ala Asn Cys Asp Met Leu 
            260                 265                 270         


Thr Lys Phe Phe Ala Arg Asp Gly Ala Lys Gly Ile Val Asp Gly Tyr 
        275                 280                 285             


Thr Ile Gln Gly Ser Lys Ile Ser Asn Asn His Asn Ala Ser Phe Ile 
    290                 295                 300                 


Gly Pro Val Ala Ala Ala Ser Met Thr Gly Tyr Asp Leu Asn Phe Ala 
305                 310                 315                 320 


Lys Glu Leu Tyr Arg Glu Thr Val Ala Val Lys Asp Ser Glu Tyr Tyr 
                325                 330                 335     


Gly Tyr Tyr Gly Asn Ser Leu Arg Leu Leu Thr Leu Leu Tyr Ile Thr 
            340                 345                 350         


Gly Asn Phe Pro Asn Pro Leu Ser Asp Leu Ser Gly Gln Pro Thr Pro 
        355                 360                 365             


Pro Ser Asn Pro Thr Pro Ser Leu Val Pro Pro Lys Gly Thr Ala Thr 
    370                 375                 380                 


Val Leu Tyr Gly Asp Val Asp Asn Asp Gly Asn Val Asp Ser Asp Asp 
385                 390                 395                 400 


Tyr Ala Tyr Met Arg Gln Trp Leu Ile Gly Met Ile Ala Asp Phe Pro 
                405                 410                 415     


Gly Gly Asp Ile Gly Leu Ala Asn Ala Asp Val Asp Gly Asp Gly Asn 
            420                 425                 430         


Val Asp Ser Asp Asp Tyr Ala Tyr Met Arg Gln Trp Leu Ile Gly Met 
        435                 440                 445             


Ile Ser Glu Phe Pro Ala Glu Gln Lys Ala Leu Glu His His His His 
    450                 455                 460                 


His His 
465     


<210>  16
<211>  1404
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  16
atgggtgtgc cttttaacac aaaatacccc tatggtccta cttctattgc cgataatcag       60

tcggaagtaa ctgcaatgct caaagcagaa tgggaagact ggaagagcaa gagaattacc      120

tcgaacggtg caggaggata caagagagta cagcgtgatg cttccaccaa ttatgatacg      180

gtatccgaag gtatgggata cggacttctt ttggcggttt gctttaacga acaggctttg      240

tttgacgatt tataccgtta cgtaaaatct catttcaatg gaaacggact tatgcactgg      300

cacattgatg ccaacaacaa tgttacaagt catgacggcg gcgacggtgc ggcaaccgat      360

gctgatgagg atattgcact tgcgctcata tttgcggaca agttatgggg ttcttccggt      420

gcaataaact acgggcagga agcaaggaca ttgataaaca atctttacaa ccattgtgta      480

gagcatggat cctatgtatt aaagcccggt gacagatggg gaggttcatc agtaacaaac      540

ccgtcatatt ttgcgcctgc atggtacaaa gtgtatgctc aatatacagg agacacaagg      600

tggaatcaag tggcggacaa gtgttaccaa attgttgaag aagttaagaa atacaacaac      660

ggaaccggcc ttgttcctga ctggtgtact gcaagcggaa ctccggcaag cggtcagagt      720

tacgactaca aatatgatgc tacacgttac ggctggagaa ctgccgtgga ctattcatgg      780

tttggtgacc agagagcaaa ggcaaactgc gatatgctga ccaaattctt tgccagagac      840

ggggcaaaag gaatcgttga cggatacaca attcaaggtt caaaaattag caacaatcac      900

aacgcatcat ttataggacc tgttgcggca gcaagtatga caggttacga tttgaacttt      960

gcaaaggaac tttataggga gactgttgct gtaaaggaca gtgaatatta cggatattac     1020

ggaaacagct tgagactgct cactttgttg tacataacag gaaacttccc gaatcctttg     1080

agtgaccttt ccggccaacc gacaccaccg tcgaatccga caccttcatt ggtacctcca     1140

aaaggcacag ctacagtatt atatggtgac gttgataatg atggaaatgt tgattcagac     1200

gactatgcat atatgagaca atggttgatc ggtatgattg ctgatttccc tggaggagat     1260

atcggattag ctaatgctga tgttgatgga gacggaaatg tagattcaga tgactatgcg     1320

tacatgagac aatggttaat aggaatgatt tccgagttcc cagcagaaca aaaagcgctc     1380

gagcaccacc accaccacca ctga                                            1404


<210>  17
<211>  878
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  17

Met Gly His His His His His His Leu Glu Asp Lys Ser Pro Lys Leu 
1               5                   10                  15      


Pro Asp Tyr Lys Asn Asp Leu Leu Tyr Glu Arg Thr Phe Asp Glu Gly 
            20                  25                  30          


Leu Cys Phe Pro Trp His Thr Cys Glu Asp Ser Gly Gly Lys Cys Asp 
        35                  40                  45              


Phe Ala Val Val Asp Val Pro Gly Glu Pro Gly Asn Lys Ala Phe Arg 
    50                  55                  60                  


Leu Thr Val Ile Asp Lys Gly Gln Asn Lys Trp Ser Val Gln Met Arg 
65                  70                  75                  80  


His Arg Gly Ile Thr Leu Glu Gln Gly His Thr Tyr Thr Val Arg Phe 
                85                  90                  95      


Thr Ile Trp Ser Asp Lys Ser Cys Arg Val Tyr Ala Lys Ile Gly Gln 
            100                 105                 110         


Met Gly Glu Pro Tyr Thr Glu Tyr Trp Asn Asn Asn Trp Asn Pro Phe 
        115                 120                 125             


Asn Leu Thr Pro Gly Gln Lys Leu Thr Val Glu Gln Asn Phe Thr Met 
    130                 135                 140                 


Asn Tyr Pro Thr Asp Asp Thr Cys Glu Phe Thr Phe His Leu Gly Gly 
145                 150                 155                 160 


Glu Leu Ala Ala Gly Thr Pro Tyr Tyr Val Tyr Leu Asp Asp Val Ser 
                165                 170                 175     


Leu Tyr Asp Pro Arg Phe Val Lys Pro Val Glu Tyr Val Leu Pro Gln 
            180                 185                 190         


Pro Asp Val Arg Val Asn Gln Val Gly Tyr Leu Pro Phe Ala Lys Lys 
        195                 200                 205             


Tyr Ala Thr Val Val Ser Ser Ser Thr Ser Pro Leu Lys Trp Gln Leu 
    210                 215                 220                 


Leu Asn Ser Ala Asn Gln Val Val Leu Glu Gly Asn Thr Ile Pro Lys 
225                 230                 235                 240 


Gly Leu Asp Lys Asp Ser Gln Asp Tyr Val His Trp Ile Asp Phe Ser 
                245                 250                 255     


Asn Phe Lys Thr Glu Gly Lys Gly Tyr Tyr Phe Lys Leu Pro Thr Val 
            260                 265                 270         


Asn Ser Asp Thr Asn Tyr Ser His Pro Phe Asp Ile Ser Ala Asp Ile 
        275                 280                 285             


Tyr Ser Lys Met Lys Phe Asp Ala Leu Ala Phe Phe Tyr His Lys Arg 
    290                 295                 300                 


Ser Gly Ile Pro Ile Glu Met Pro Tyr Ala Gly Gly Glu Gln Trp Thr 
305                 310                 315                 320 


Arg Pro Ala Gly His Ile Gly Val Ala Pro Asn Lys Gly Asp Thr Asn 
                325                 330                 335     


Val Pro Thr Trp Pro Gln Asp Asp Glu Tyr Ala Gly Arg Pro Gln Lys 
            340                 345                 350         


Tyr Tyr Thr Lys Asp Val Thr Gly Gly Trp Tyr Asp Ala Gly Asp His 
        355                 360                 365             


Gly Lys Tyr Val Val Asn Gly Gly Ile Ala Val Trp Thr Leu Met Asn 
    370                 375                 380                 


Met Tyr Glu Arg Ala Lys Ile Arg Gly Ile Ala Asn Gln Gly Ala Tyr 
385                 390                 395                 400 


Lys Asp Gly Gly Met Asn Ile Pro Glu Arg Asn Asn Gly Tyr Pro Asp 
                405                 410                 415     


Ile Leu Asp Glu Ala Arg Trp Glu Ile Glu Phe Phe Lys Lys Met Gln 
            420                 425                 430         


Val Thr Glu Lys Glu Asp Pro Ser Ile Ala Gly Met Val His His Lys 
        435                 440                 445             


Ile His Asp Phe Arg Trp Thr Ala Leu Gly Met Leu Pro His Glu Asp 
    450                 455                 460                 


Pro Gln Pro Arg Tyr Leu Arg Pro Val Ser Thr Ala Ala Thr Leu Asn 
465                 470                 475                 480 


Phe Ala Ala Thr Leu Ala Gln Ser Ala Arg Leu Trp Lys Asp Tyr Asp 
                485                 490                 495     


Pro Thr Phe Ala Ala Asp Cys Leu Glu Lys Ala Glu Ile Ala Trp Gln 
            500                 505                 510         


Ala Ala Leu Lys His Pro Asp Ile Tyr Ala Glu Tyr Thr Pro Gly Ser 
        515                 520                 525             


Gly Gly Pro Gly Gly Gly Pro Tyr Asn Asp Asp Tyr Val Gly Asp Glu 
    530                 535                 540                 


Phe Tyr Trp Ala Ala Cys Glu Leu Tyr Val Thr Thr Gly Lys Asp Glu 
545                 550                 555                 560 


Tyr Lys Asn Tyr Leu Met Asn Ser Pro His Tyr Leu Glu Met Pro Ala 
                565                 570                 575     


Lys Met Gly Glu Asn Gly Gly Ala Asn Gly Glu Asp Asn Gly Leu Trp 
            580                 585                 590         


Gly Cys Phe Thr Trp Gly Thr Thr Gln Gly Leu Gly Thr Ile Thr Leu 
        595                 600                 605             


Ala Leu Val Glu Asn Gly Leu Pro Ser Ala Asp Ile Gln Lys Ala Arg 
    610                 615                 620                 


Asn Asn Ile Ala Lys Ala Ala Asp Lys Trp Leu Glu Asn Ile Glu Glu 
625                 630                 635                 640 


Gln Gly Tyr Arg Leu Pro Ile Lys Gln Ala Glu Asp Glu Arg Gly Gly 
                645                 650                 655     


Tyr Pro Trp Gly Ser Asn Ser Phe Ile Leu Asn Gln Met Ile Val Met 
            660                 665                 670         


Gly Tyr Ala Tyr Asp Phe Thr Gly Asn Ser Lys Tyr Leu Asp Gly Met 
        675                 680                 685             


Gln Asp Gly Met Ser Tyr Leu Leu Gly Arg Asn Gly Leu Asp Gln Ser 
    690                 695                 700                 


Tyr Val Thr Gly Tyr Gly Glu Arg Pro Leu Gln Asn Pro His Asp Arg 
705                 710                 715                 720 


Phe Trp Thr Pro Gln Thr Ser Lys Lys Phe Pro Ala Pro Pro Pro Gly 
                725                 730                 735     


Ile Ile Ala Gly Gly Pro Asn Ser Arg Phe Glu Asp Pro Thr Ile Thr 
            740                 745                 750         


Ala Ala Val Lys Lys Asp Thr Pro Pro Gln Lys Cys Tyr Ile Asp His 
        755                 760                 765             


Thr Asp Ser Trp Ser Thr Asn Glu Ile Thr Ile Asn Trp Asn Ala Pro 
    770                 775                 780                 


Phe Ala Trp Val Thr Ala Tyr Leu Asp Glu Ile Asp Leu Ile Thr Pro 
785                 790                 795                 800 


Pro Gly Thr Lys Phe Ile Tyr Gly Asp Val Asp Gly Asn Gly Ser Val 
                805                 810                 815     


Arg Ile Asn Asp Ala Val Leu Ile Arg Asp Tyr Val Leu Gly Lys Ile 
            820                 825                 830         


Asn Glu Phe Pro Tyr Glu Tyr Gly Met Leu Ala Ala Asp Val Asp Gly 
        835                 840                 845             


Asn Gly Ser Ile Lys Ile Asn Asp Ala Val Leu Val Arg Asp Tyr Val 
    850                 855                 860                 


Leu Gly Lys Ile Phe Leu Phe Pro Val Glu Glu Lys Glu Glu 
865                 870                 875             


<210>  18
<211>  2637
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  18
atgggccatc accatcacca tcacttagaa gacaagtctc caaagttgcc ggattataaa       60

aacgaccttt tgtatgaaag aacattcgac gaaggtcttt gctttccgtg gcatacttgc      120

gaagacagtg gaggaaaatg tgatttcgct gttgttgatg ttccaggaga gcctgggaac      180

aaagctttcc gcttgacagt aattgacaaa ggacaaaaca agtggagtgt ccagatgaga      240

cacagaggta ttaccctcga gcaaggacat acatacacgg taaggtttac gatttggtct      300

gacaaatcct gtagggttta tgctaaaatt ggtcagatgg gtgaacccta tactgaatat      360

tggaacaata actggaatcc attcaacctt acaccaggac agaagcttac agttgaacag      420

aattttacaa tgaactatcc tactgatgac acatgcgagt tcacattcca tttgggtgga      480

gaacttgctg caggtacacc ttactatgtt taccttgatg atgtatctct ctacgatcct      540

aggtttgtaa agcctgttga atatgtactt ccgcagccgg atgtacgtgt taaccaggta      600

ggatacttac cgtttgcaaa gaagtatgct actgttgtat cttcttcaac cagcccgctt      660

aagtggcagc ttctcaattc ggcaaatcag gttgttttgg aaggtaatac aataccaaaa      720

ggacttgaca aagattcaca ggattatgta cattggatag atttctccaa ctttaagact      780

gaaggaaaag gttattactt caagcttccg actgtaaaca gcgatacaaa ttacagccat      840

cctttcgata tcagtgctga tatttactcc aagatgaaat ttgatgcatt ggcattcttc      900

tatcacaaga gaagcggtat tcctattgaa atgccgtatg caggaggaga acagtggacc      960

agacctgcag gacatattgg tgttgctccg aacaaaggag acacaaatgt tcctacatgg     1020

cctcaggatg atgaatatgc aggaagacct caaaaatatt atacaaaaga tgtaaccggt     1080

ggatggtatg atgccggtga ccacggtaaa tatgttgtaa acggcggtat agctgtttgg     1140

acattgatga acatgtatga aagggcaaaa atcagaggca tagctaatca aggtgcttat     1200

aaagacggtg gaatgaacat accggagaga aataacggtt atccggacat tcttgatgaa     1260

gcaagatggg aaattgagtt ctttaagaaa atgcaggtaa ctgaaaaaga ggatccttcc     1320

atagccggaa tggtacacca caaaattcac gacttcagat ggactgcttt gggtatgttg     1380

cctcacgaag atccccagcc acgttactta aggccggtaa gtacggctgc gactttgaac     1440

tttgcggcaa ctttggcaca aagtgcacgt ctttggaaag attatgatcc gacttttgct     1500

gctgactgtt tggaaaaggc tgaaatagca tggcaggcgg cattaaagca tcctgatatt     1560

tatgctgagt atactcccgg tagcggtggt cccggaggcg gaccatacaa tgacgactat     1620

gtcggagacg aattctactg ggcagcctgc gaactttatg taacaacagg aaaagacgaa     1680

tataagaatt acctgatgaa ttcacctcac tatcttgaaa tgcctgcaaa gatgggtgaa     1740

aacggtggag caaacggaga agacaacgga ttgtggggat gcttcacctg gggaactact     1800

caaggattgg gaactattac tcttgcatta gttgaaaacg gattgccgtc tgcagacatt     1860

caaaaggcaa gaaacaatat agctaaagct gcagacaaat ggcttgagaa tattgaagag     1920

caaggttaca gactgccgat caaacaggcg gaggatgaga gaggcggtta tccatggggt     1980

tcaaactcct tcattttgaa ccagatgata gttatgggat acgcatatga ctttacaggc     2040

aacagcaagt atcttgacgg aatgcaggat ggtatgagct acctgttggg aagaaacgga     2100

ctggatcagt cctatgtaac agggtatggt gagcgtccac ttcagaatcc tcatgacaga     2160

ttctggacgc cacagacaag taagaaattc cctgctccac ctccgggtat aattgccggt     2220

ggtccgaact cccgtttcga agacccgaca ataactgcag cagttaagaa ggatacaccg     2280

ccgcagaagt gctacattga ccatacagac tcatggtcaa ccaacgagat aactattaac     2340

tggaatgctc cgtttgcatg ggttacagct tatctcgatg aaattgactt aataacaccg     2400

ccaggtacca aatttatata tggtgatgtt gatggtaatg gaagtgtaag aattaatgat     2460

gctgtcctaa taagagacta tgtattagga aaaatcaatg aattcccata tgaatatggt     2520

atgcttgcag cagatgttga tggtaatgga agtataaaaa ttaatgatgc tgttctagta     2580

agagactacg tgttaggaaa gatattttta ttccctgttg aagagaaaga agaataa        2637


<210>  19
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  19
cagtccatgg gtcctacaaa ggcacctac                                         29


<210>  20
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  20
cgcgaagctt ttaatggtga tggtgatggt gg                                     32


<210>  21
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  21
cagtccatgg gtgtgccttt taacacaaa                                         29


<210>  22
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  22
cacgctcgag ataaggtagg tggggtatgc                                        30


<210>  23
<211>  80
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  23
gtttaacttt aagaaggaga tataccatgg gccatcacca tcaccatcac ttagaagaca       60

agtctccaaa gttgccggat                                                   80


<210>  24
<211>  64
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  24
gagtgcggcc gcaagcttgt cgacggagct cttatttatg tggcaataca tctatctctt       60

taag                                                                    64


<210>  25
<211>  67
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  25
ctcgatgaaa ttgacttaat aacaccgcca ggtaccaaat ttatatatgg tgatgttgat       60

ggtaatg                                                                 67


<210>  26
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  26
gagtgcggcc gcaagcttgt cgacggagct cttattcttc tttctcttca acagggaata       60

aaaatatc                                                                68


<210>  27
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  27
attcaaccat gggtgtgcct tttaacacaa aatac                                  35


<210>  28
<211>  46
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  28
atattgctcg agtaatgtgg taccaatgaa ggtgtcggat tcgacg                      46


<210>  29
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  29
actttaggta cctccaaaag gcacagctac                                        30


<210>  30
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  30
attaatctcg agcgcttttt gttctgctgg                                        30


<210>  31
<211>  866
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  31

Met Ala Asn Thr Pro Val Ser Gly Asn Leu Lys Val Glu Phe Tyr Asn 
1               5                   10                  15      


Ser Asn Pro Ser Asp Thr Thr Asn Ser Ile Asn Pro Gln Phe Lys Val 
            20                  25                  30          


Thr Asn Thr Gly Ser Ser Ala Ile Asp Leu Ser Lys Leu Thr Leu Arg 
        35                  40                  45              


Tyr Tyr Tyr Thr Val Asp Gly Gln Lys Asp Gln Thr Phe Trp Cys Asp 
    50                  55                  60                  


His Ala Ala Ile Ile Gly Ser Asn Gly Ser Tyr Asn Gly Ile Thr Ser 
65                  70                  75                  80  


Asn Val Lys Gly Thr Phe Val Lys Met Ser Ser Ser Thr Asn Asn Ala 
                85                  90                  95      


Asp Thr Tyr Leu Glu Ile Ser Phe Thr Gly Gly Thr Leu Glu Pro Gly 
            100                 105                 110         


Ala His Val Gln Ile Gln Gly Arg Phe Ala Lys Asn Asp Trp Ser Asn 
        115                 120                 125             


Tyr Thr Gln Ser Asn Asp Tyr Ser Phe Lys Ser Ala Ser Gln Phe Val 
    130                 135                 140                 


Glu Trp Asp Gln Val Thr Ala Tyr Leu Asn Gly Val Leu Val Trp Gly 
145                 150                 155                 160 


Lys Glu Pro Gly Gly Ser Val Val Pro Ser Thr Gln Pro Val Thr Thr 
                165                 170                 175     


Pro Pro Ala Thr Thr Lys Pro Pro Ala Thr Thr Lys Pro Pro Ala Thr 
            180                 185                 190         


Thr Ile Pro Pro Ser Gly Ser Asp Leu Gln Val Asp Ile Gly Ser Thr 
        195                 200                 205             


Ser Gly Lys Ala Gly Ser Val Val Ser Val Pro Ile Thr Phe Thr Asn 
    210                 215                 220                 


Val Pro Lys Ser Gly Ile Tyr Ala Leu Ser Phe Arg Thr Asn Phe Asp 
225                 230                 235                 240 


Pro Gln Lys Val Thr Val Ala Ser Ile Asp Ala Gly Ser Leu Ile Glu 
                245                 250                 255     


Asn Ala Ser Asp Phe Thr Thr Tyr Tyr Asn Asn Glu Asn Gly Phe Ala 
            260                 265                 270         


Ser Met Thr Phe Glu Ala Pro Val Asp Arg Ala Arg Ile Ile Asp Ser 
        275                 280                 285             


Asp Gly Val Phe Ala Thr Ile Asn Phe Lys Val Ser Asp Ser Ala Lys 
    290                 295                 300                 


Val Gly Glu Leu Tyr Asn Ile Thr Thr Asn Ser Ala Tyr Thr Ser Phe 
305                 310                 315                 320 


Tyr Tyr Ser Gly Thr Asp Glu Ile Lys Asn Val Val Tyr Asn Asp Gly 
                325                 330                 335     


Lys Ile Glu Val Ile Ala Ser Pro Thr Pro Thr Gln Ser Ala Thr Pro 
            340                 345                 350         


Thr Val Thr Pro Ser Ala Thr Ala Thr Pro Thr Gln Ser Ala Thr Pro 
        355                 360                 365             


Thr Val Thr Pro Ser Ser Pro Gly Asn Lys Met Lys Ile Gln Ile Gly 
    370                 375                 380                 


Asp Val Lys Ala Asn Gln Gly Asp Thr Val Ile Val Pro Ile Thr Phe 
385                 390                 395                 400 


Asn Glu Val Pro Val Met Gly Val Asn Asn Cys Asn Phe Thr Leu Ala 
                405                 410                 415     


Tyr Asp Lys Asn Ile Met Glu Phe Ile Ser Ala Asp Ala Gly Asp Ile 
            420                 425                 430         


Val Thr Leu Pro Met Ala Asn Tyr Ser Tyr Asn Met Pro Ser Asp Gly 
        435                 440                 445             


Leu Val Lys Phe Leu Tyr Asn Asp Gln Ala Gln Gly Ala Met Ser Ile 
    450                 455                 460                 


Lys Glu Asp Gly Thr Phe Ala Asn Val Lys Phe Lys Ile Lys Gln Ser 
465                 470                 475                 480 


Ala Ala Phe Gly Lys Tyr Ser Val Gly Ile Lys Ala Ile Gly Ser Ile 
                485                 490                 495     


Ser Ala Leu Ser Asn Ser Lys Leu Ile Pro Ile Glu Ser Ile Phe Lys 
            500                 505                 510         


Asp Gly Ser Ile Thr Val Thr Asn Thr Pro Thr Asn Thr Ile Ser Val 
        515                 520                 525             


Thr Pro Thr Asn Asn Ser Thr Pro Thr Asn Asn Ser Thr Pro Lys Pro 
    530                 535                 540                 


Asn Pro Leu Ser Asp Gly Val Val Val Glu Ile Gly Lys Val Thr Gly 
545                 550                 555                 560 


Ser Val Gly Thr Thr Val Glu Ile Pro Val Tyr Phe Arg Gly Val Pro 
                565                 570                 575     


Ser Lys Gly Ile Ala Asn Cys Asp Phe Val Phe Arg Tyr Asp Pro Asn 
            580                 585                 590         


Val Leu Glu Ile Ile Gly Ile Asp Pro Gly Asp Ile Ile Val Asp Pro 
        595                 600                 605             


Asn Pro Thr Lys Ser Phe Asp Thr Ala Ile Tyr Pro Asp Arg Lys Ile 
    610                 615                 620                 


Ile Val Phe Leu Phe Ala Glu Asp Ser Gly Thr Gly Ala Tyr Ala Ile 
625                 630                 635                 640 


Thr Lys Asp Gly Val Phe Ala Lys Ile Arg Ala Thr Val Lys Ser Ser 
                645                 650                 655     


Ala Pro Gly Tyr Ile Thr Phe Asp Glu Val Gly Gly Phe Ala Asp Asn 
            660                 665                 670         


Asp Leu Val Glu Gln Lys Val Ser Phe Ile Asp Gly Gly Val Asn Val 
        675                 680                 685             


Gly Asn Ala Thr Arg Ser Thr Asn Lys Pro Val Ile Glu Gly Tyr Lys 
    690                 695                 700                 


Val Ser Gly Tyr Ile Leu Pro Asp Phe Ser Phe Asp Ala Thr Val Ala 
705                 710                 715                 720 


Pro Leu Val Lys Ala Gly Phe Lys Val Glu Ile Val Gly Thr Glu Leu 
                725                 730                 735     


Tyr Ala Val Thr Asp Ala Asn Gly Tyr Phe Glu Ile Thr Gly Val Pro 
            740                 745                 750         


Ala Asn Ala Ser Gly Tyr Thr Leu Lys Ile Ser Arg Ala Thr Tyr Leu 
        755                 760                 765             


Asp Arg Val Ile Ala Asn Val Val Val Thr Gly Asp Thr Ser Val Ser 
    770                 775                 780                 


Thr Ser Gln Ala Pro Ile Met Met Trp Val Gly Asp Ile Val Lys Asp 
785                 790                 795                 800 


Asn Ser Ile Asn Leu Leu Asp Val Ala Glu Val Ile Arg Cys Phe Asn 
                805                 810                 815     


Ala Thr Lys Gly Ser Ala Asn Tyr Val Glu Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Asn Gly Ala Ile Asn Met Gln Asp Ile Met Ile Val His Lys His Phe 
        835                 840                 845             


Gly Ala Thr Ser Ser Asp Tyr Asp Ala Gln Leu Glu His His His His 
    850                 855                 860                 


His His 
865     


<210>  32
<211>  2601
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  32
atggcaaata caccggtatc aggcaatttg aaggttgaat tctacaacag caatccttca       60

gatactacta actcaatcaa tcctcagttc aaggttacta ataccggaag cagtgcaatt      120

gatttgtcca aactcacatt gagatattat tatacagtag acggacagaa agatcagacc      180

ttctggtgtg accatgctgc aataatcggc agtaacggca gctacaacgg aattacttca      240

aatgtaaaag gaacatttgt aaaaatgagt tcctcaacaa ataacgcaga cacctacctt      300

gaaataagct ttacaggcgg aactcttgaa ccgggtgcac atgttcagat acaaggtaga      360

tttgcaaaga atgactggag taactataca cagtcaaatg actactcatt caagtctgct      420

tcacagtttg ttgaatggga tcaggtaaca gcatacttga acggtgttct tgtatggggt      480

aaagaacccg gtggcagtgt agtaccatca acacagcctg taacaacacc acctgcaaca      540

acaaaaccac ctgcaacaac aaaaccacct gcaacaacaa taccgccgtc aggatccgat      600

ttacaggttg acattggaag tactagtgga aaagcaggta gtgttgttag tgtacctata      660

acatttacta atgtacctaa atcaggtatc tatgctctaa gttttagaac aaatttcgac      720

ccacaaaagg taactgtagc aagtatagat gctggctcac tgattgaaaa tgcttctgat      780

tttactactt attataataa tgaaaatggt tttgcatcaa tgacgtttga agccccagtt      840

gatagagcta gaatcataga tagtgatggt gtatttgcaa ccattaactt taaagttagt      900

gatagtgcca aagtaggtga actttacaat attactacta atagtgcata tacttcattc      960

tattattctg gaactgatga aatcaaaaat gttgtttaca atgatggaaa aattgaggta     1020

attgcaagtc ctaccccgac gcaatcagcc actccaacgg taactccttc agccaccgcg     1080

acgcctaccc agagtgctac gccgactgta acgccaagtt caccaggaaa taaaatgaaa     1140

attcaaattg gtgatgtaaa agctaatcag ggagatacag ttatagtacc tataactttc     1200

aatgaagttc ctgtaatggg tgttaataac tgtaatttca ctttagctta tgacaaaaat     1260

attatggaat ttatctctgc tgatgcaggt gatattgtaa cattgccaat ggctaactat     1320

agctacaata tgccatctga tgggctagta aaatttttat ataatgatca agctcaaggt     1380

gcaatgtcaa taaaagaaga tggtactttt gctaatgtta aatttaaaat taagcagagt     1440

gccgcatttg ggaaatattc agtaggcatc aaagcaattg gttcaatttc cgctttaagc     1500

aatagtaagt taatacctat tgaatcaata tttaaagatg gaagcattac tgtaactaat     1560

acgccgacca atactatcag tgttactccg acaaacaatt cgactcctac gaataacagt     1620

acgccaaagc caaacccgtt atccgacggt gtggtagtag aaattggcaa agttacggga     1680

tctgttggaa ctacagttga aatacctgta tatttcagag gagttccatc caaaggaata     1740

gcaaactgcg actttgtgtt cagatatgat ccgaatgtat tggaaattat agggatagat     1800

cccggagaca taatagttga cccgaatcct accaagagct ttgatactgc aatatatcct     1860

gacagaaaga taatagtatt cctgtttgcg gaagacagcg gaacaggagc gtatgcaata     1920

actaaagacg gagtatttgc aaaaataaga gcaactgtaa aatcaagtgc tccgggctat     1980

attactttcg acgaagtagg tggatttgca gataatgacc tggtagaaca gaaggtatca     2040

tttatagacg gtggtgttaa cgttggcaat gcaacaagat ccactaataa acctgtaata     2100

gaaggatata aagtatccgg atacattttg ccagacttct ccttcgacgc tactgttgca     2160

ccacttgtaa aggccggatt caaagttgaa atagtaggaa cagaattgta tgcagtaaca     2220

gatgcaaacg gatactttga aataaccgga gtacctgcaa atgcaagcgg atatacattg     2280

aagatttcaa gagcaactta cttggacaga gtaattgcaa atgttgtagt aacgggagat     2340

acttcagttt caacttcaca ggctccaata atgatgtggg taggagacat agtgaaagac     2400

aattctatca acctgttgga cgttgcagaa gttatccgtt gcttcaacgc tactaaagga     2460

agcgcaaact acgtagaaga acttgacatt aatagaaacg gcgcaattaa catgcaagac     2520

ataatgattg ttcataagca ctttggagct acatcaagtg attacgacgc acagctcgag     2580

caccaccacc accaccactg a                                               2601


<210>  33
<211>  712
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  33

Met Thr His His His His His His Ala Met Ala Lys Phe Ile Tyr Gly 
1               5                   10                  15      


Asp Val Asp Gly Asn Gly Ser Val Arg Ile Asn Asp Ala Val Leu Ile 
            20                  25                  30          


Arg Asp Tyr Val Leu Gly Lys Ile Asn Glu Phe Pro Tyr Glu Tyr Gly 
        35                  40                  45              


Met Leu Ala Ala Asp Val Asp Gly Asn Gly Ser Ile Lys Ile Asn Asp 
    50                  55                  60                  


Ala Val Leu Val Arg Asp Tyr Val Leu Gly Lys Ile Phe Leu Phe Pro 
65                  70                  75                  80  


Val Glu Glu Lys Glu Glu Val Pro Pro Leu Ala Thr Gly Thr Ala His 
                85                  90                  95      


Ala Glu Pro Ala Phe Asn Tyr Ala Glu Ala Leu Gln Lys Ser Met Phe 
            100                 105                 110         


Phe Tyr Glu Ala Gln Arg Ser Gly Lys Leu Pro Glu Asn Asn Arg Val 
        115                 120                 125             


Ser Trp Arg Gly Asp Ser Gly Leu Asn Asp Gly Ala Asp Val Gly Leu 
    130                 135                 140                 


Asp Leu Thr Gly Gly Trp Tyr Asp Ala Gly Asp His Val Lys Phe Gly 
145                 150                 155                 160 


Phe Pro Met Ala Phe Thr Ala Thr Met Leu Ala Trp Gly Ala Ile Glu 
                165                 170                 175     


Ser Pro Glu Gly Tyr Ile Arg Ser Gly Gln Met Pro Tyr Leu Lys Asp 
            180                 185                 190         


Asn Leu Arg Trp Val Asn Asp Tyr Phe Ile Lys Ala His Pro Ser Pro 
        195                 200                 205             


Asn Val Leu Tyr Val Gln Val Gly Asp Gly Asp Ala Asp His Lys Trp 
    210                 215                 220                 


Trp Gly Pro Ala Glu Val Met Pro Met Glu Arg Pro Ser Phe Lys Val 
225                 230                 235                 240 


Asp Pro Ser Cys Pro Gly Ser Asp Val Ala Ala Glu Thr Ala Ala Ala 
                245                 250                 255     


Met Ala Ala Ser Ser Ile Val Phe Ala Asp Asp Asp Pro Ala Tyr Ala 
            260                 265                 270         


Ala Thr Leu Val Gln His Ala Lys Gln Leu Tyr Thr Phe Ala Asp Thr 
        275                 280                 285             


Tyr Arg Gly Val Tyr Ser Asp Cys Val Pro Ala Gly Ala Phe Tyr Asn 
    290                 295                 300                 


Ser Trp Ser Gly Tyr Gln Asp Glu Leu Val Trp Gly Ala Tyr Trp Leu 
305                 310                 315                 320 


Tyr Lys Ala Thr Gly Asp Asp Ser Tyr Leu Ala Lys Ala Glu Tyr Glu 
                325                 330                 335     


Tyr Asp Phe Leu Ser Thr Glu Gln Gln Thr Asp Leu Arg Ser Tyr Arg 
            340                 345                 350         


Trp Thr Ile Ala Trp Asp Asp Lys Ser Tyr Gly Thr Tyr Val Leu Leu 
        355                 360                 365             


Ala Lys Glu Thr Gly Lys Gln Lys Tyr Ile Asp Asp Ala Asn Arg Trp 
    370                 375                 380                 


Leu Asp Tyr Trp Thr Val Gly Val Asn Gly Gln Arg Val Pro Tyr Ser 
385                 390                 395                 400 


Pro Gly Gly Met Ala Val Leu Asp Thr Trp Gly Ala Leu Arg Tyr Ala 
                405                 410                 415     


Ala Asn Thr Ala Phe Val Ala Leu Val Tyr Ala Lys Val Ile Asp Asp 
            420                 425                 430         


Pro Val Arg Lys Gln Arg Tyr His Asp Phe Ala Val Arg Gln Ile Asn 
        435                 440                 445             


Tyr Ala Leu Gly Asp Asn Pro Arg Asn Ser Ser Tyr Val Val Gly Phe 
    450                 455                 460                 


Gly Asn Asn Pro Pro Arg Asn Pro His His Arg Thr Ala His Gly Ser 
465                 470                 475                 480 


Trp Thr Asp Ser Ile Ala Ser Pro Ala Glu Asn Arg His Val Leu Tyr 
                485                 490                 495     


Gly Ala Leu Val Gly Gly Pro Gly Ser Pro Asn Asp Ala Tyr Thr Asp 
            500                 505                 510         


Asp Arg Gln Asp Tyr Val Ala Asn Glu Val Ala Thr Asp Tyr Asn Ala 
        515                 520                 525             


Gly Phe Ser Ser Ala Leu Ala Met Leu Val Glu Glu Tyr Gly Gly Thr 
    530                 535                 540                 


Pro Leu Ala Asp Phe Pro Pro Thr Glu Glu Pro Asp Gly Pro Glu Ile 
545                 550                 555                 560 


Phe Val Glu Ala Gln Ile Asn Thr Pro Gly Thr Thr Phe Thr Glu Ile 
                565                 570                 575     


Lys Ala Met Ile Arg Asn Gln Ser Gly Trp Pro Ala Arg Met Leu Asp 
            580                 585                 590         


Lys Gly Thr Phe Arg Tyr Trp Phe Thr Leu Asp Glu Gly Val Asp Pro 
        595                 600                 605             


Ala Asp Ile Thr Val Ser Ser Ala Tyr Asn Gln Cys Ala Thr Pro Glu 
    610                 615                 620                 


Asp Val His His Val Ser Gly Asp Leu Tyr Tyr Val Glu Ile Asp Cys 
625                 630                 635                 640 


Thr Gly Glu Lys Ile Phe Pro Gly Gly Gln Ser Glu His Arg Arg Glu 
                645                 650                 655     


Val Gln Phe Arg Ile Ala Gly Gly Pro Gly Trp Asp Pro Ser Asn Asp 
            660                 665                 670         


Trp Ser Phe Gln Gly Ile Gly Asn Glu Leu Ala Pro Ala Pro Tyr Ile 
        675                 680                 685             


Val Leu Tyr Asp Asp Gly Val Pro Val Trp Gly Thr Ala Pro Glu Glu 
    690                 695                 700                 


Gly Glu Glu Pro Gly Gly Gly Glu 
705                 710         


<210>  34
<211>  2148
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  34
atgacccatc accatcacca tcacgccatg gctaaattta tatatggtga tgttgatggt       60

aatggaagtg taagaattaa tgatgctgtc ctaataagag actatgtatt aggaaaaatc      120

aatgaattcc catatgaata tggtatgctt gcagcagatg ttgatggtaa tggaagtata      180

aaaattaatg atgctgttct agtaagagac tacgtgttag gaaagatatt tttattccct      240

gttgaagaga aagaagaggt accccccttg gccacgggaa ccgcccacgc cgaaccggcg      300

ttcaactacg ccgaagccct ccagaagtcg atgttcttct acgaggccca acgctccggg      360

aaactcccgg agaacaaccg ggtctcctgg cgcggcgact ccgggctcaa cgacggcgcg      420

gacgtgggac tcgacctcac cggcggctgg tacgacgccg gcgaccacgt gaaattcggc      480

ttccccatgg ccttcaccgc gaccatgctc gcctggggcg ccatcgaaag cccggaaggc      540

tacatccgct ccggccagat gccctacctc aaggacaacc tgcgctgggt caacgactac      600

ttcatcaaag cccacccctc gcccaacgtg ctgtacgtgc aggtcggcga cggcgacgcc      660

gaccacaagt ggtggggtcc ggccgaagtc atgccgatgg agcggcccag cttcaaagtg      720

gacccctcct gcccgggcag cgacgtcgca gccgaaaccg ccgcggccat ggccgcgtcc      780

tccatcgtgt tcgccgacga cgaccctgcg tacgcggcca ccctcgtgca gcacgccaag      840

cagctctaca cgttcgccga cacctaccgc ggcgtgtact ccgactgcgt gcccgccgga      900

gcgttctaca actcctggtc gggctaccag gacgagctcg tctggggcgc ctactggctg      960

tacaaggcca ccggggacga ctcctacttg gcgaaggccg agtacgagta cgacttcctc     1020

tccaccgagc agcagaccga cctccgcagc taccggtgga ccatcgcctg ggacgacaag     1080

tcctacggca cctacgtgct gctcgccaag gaaaccggca agcaaaaata catcgacgac     1140

gccaaccggt ggctcgacta ctggacggtc ggcgtcaacg gccagcgcgt gccctactcc     1200

cccggcggga tggctgtgct cgacacctgg ggagccctgc gctacgccgc taacaccgcg     1260

ttcgtcgccc tcgtctacgc caaggtgatc gacgaccccg tccgcaagca gcgataccac     1320

gacttcgcgg tgcggcagat caactacgcg ctcggcgaca acccgcggaa ctccagctac     1380

gtggtgggct tcggcaacaa cccgccgcgc aacccccacc accgcaccgc gcacgggtcg     1440

tggaccgaca gcatcgcctc gcccgcggag aaccggcacg tcctctacgg cgccctcgtc     1500

ggcggtcccg gctccccgaa cgacgcctac accgacgacc ggcaggacta cgtcgccaac     1560

gaagtcgcca ccgactacaa cgccggattc tccagcgcgc tggccatgct ggtcgaagag     1620

tacggcggca ccccgctggc ggacttcccg cccaccgagg agcccgacgg accggagatc     1680

ttcgtggaag cccagatcaa cacgccgggc accacgttca ccgagatcaa agccatgatc     1740

cgcaaccagt cgggctggcc ggcccggatg ctggacaagg gcaccttccg gtactggttc     1800

accctcgatg aaggcgtgga ccccgcggac atcacggtga gctccgccta caaccagtgc     1860

gccaccccgg aggacgtcca ccacgtctcc ggcgacctgt actacgtgga gatcgactgc     1920

accggggaga agatcttccc cggcggccag tcggagcacc gccgcgaagt ccagttccgc     1980

atcgccggcg gccccggatg ggacccctcc aacgactggt ccttccaagg catcggcaac     2040

gaactcgccc ccgccccgta catcgtgctc tacgacgacg gtgtaccggt gtggggcacc     2100

gcccccgagg aaggggaaga gcccggcggc ggagaataac tcgagtga                  2148


<210>  35
<211>  749
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  35

Met Ala His His His His His His Pro Lys Gly Thr Ala Thr Val Leu 
1               5                   10                  15      


Tyr Gly Asp Val Asp Asn Asp Gly Asn Val Asp Ser Asp Asp Tyr Ala 
            20                  25                  30          


Tyr Met Arg Gln Trp Leu Ile Gly Met Ile Ala Asp Phe Pro Gly Gly 
        35                  40                  45              


Asp Ile Gly Leu Ala Asn Ala Asp Val Asp Gly Asp Gly Asn Val Asp 
    50                  55                  60                  


Ser Asp Asp Tyr Ala Tyr Met Arg Gln Trp Leu Ile Gly Met Ile Ser 
65                  70                  75                  80  


Glu Phe Pro Ala Glu Gln Lys Ala Val Pro Gly His Asp Ser Ala Glu 
                85                  90                  95      


Val Thr Val Arg Glu Ile Asp Pro Asn Thr Ser Ser Tyr Asp Gln Ala 
            100                 105                 110         


Phe Leu Glu Gln Tyr Glu Lys Ile Lys Asp Pro Ala Ser Gly Tyr Phe 
        115                 120                 125             


Arg Glu Phe Asn Gly Leu Leu Val Pro Tyr His Ser Val Glu Thr Met 
    130                 135                 140                 


Ile Val Glu Ala Pro Asp His Gly His Gln Thr Thr Ser Glu Ala Phe 
145                 150                 155                 160 


Ser Tyr Tyr Leu Trp Leu Glu Ala Tyr Tyr Gly Arg Val Thr Gly Asp 
                165                 170                 175     


Trp Lys Pro Leu His Asp Ala Trp Glu Ser Met Glu Thr Phe Ile Ile 
            180                 185                 190         


Pro Gly Thr Lys Asp Gln Pro Thr Asn Ser Ala Tyr Asn Pro Asn Ser 
        195                 200                 205             


Pro Ala Thr Tyr Ile Pro Glu Gln Pro Asn Ala Asp Gly Tyr Pro Ser 
    210                 215                 220                 


Pro Leu Met Asn Asn Val Pro Val Gly Gln Asp Pro Leu Ala Gln Glu 
225                 230                 235                 240 


Leu Ser Ser Thr Tyr Gly Thr Asn Glu Ile Tyr Gly Met His Trp Leu 
                245                 250                 255     


Leu Asp Val Asp Asn Val Tyr Gly Phe Gly Phe Cys Gly Asp Gly Thr 
            260                 265                 270         


Asp Asp Ala Pro Ala Tyr Ile Asn Thr Tyr Gln Arg Gly Ala Arg Glu 
        275                 280                 285             


Ser Val Trp Glu Thr Ile Pro His Pro Ser Cys Asp Asp Phe Thr His 
    290                 295                 300                 


Gly Gly Pro Asn Gly Tyr Leu Asp Leu Phe Thr Asp Asp Gln Asn Tyr 
305                 310                 315                 320 


Ala Lys Gln Trp Arg Tyr Thr Asn Ala Pro Asp Ala Asp Ala Arg Ala 
                325                 330                 335     


Val Gln Val Met Phe Trp Ala His Glu Trp Ala Lys Glu Gln Gly Lys 
            340                 345                 350         


Glu Asn Glu Ile Ala Gly Leu Met Asp Lys Ala Ser Lys Met Gly Asp 
        355                 360                 365             


Tyr Leu Arg Tyr Ala Met Phe Asp Lys Tyr Phe Lys Lys Ile Gly Asn 
    370                 375                 380                 


Cys Val Gly Ala Thr Ser Cys Pro Gly Gly Gln Gly Lys Asp Ser Ala 
385                 390                 395                 400 


His Tyr Leu Leu Ser Trp Tyr Tyr Ser Trp Gly Gly Ser Leu Asp Thr 
                405                 410                 415     


Ser Ser Ala Trp Ala Trp Arg Ile Gly Ser Ser Ser Ser His Gln Gly 
            420                 425                 430         


Tyr Gln Asn Val Leu Ala Ala Tyr Ala Leu Ser Gln Val Pro Glu Leu 
        435                 440                 445             


Gln Pro Asp Ser Pro Thr Gly Val Gln Asp Trp Ala Thr Ser Phe Asp 
    450                 455                 460                 


Arg Gln Leu Glu Phe Leu Gln Trp Leu Gln Ser Ala Glu Gly Gly Ile 
465                 470                 475                 480 


Ala Gly Gly Ala Thr Asn Ser Trp Lys Gly Ser Tyr Asp Thr Pro Pro 
                485                 490                 495     


Thr Gly Leu Ser Gln Phe Tyr Gly Met Tyr Tyr Asp Trp Gln Pro Val 
            500                 505                 510         


Trp Asn Asp Pro Pro Ser Asn Asn Trp Phe Gly Phe Gln Val Trp Asn 
        515                 520                 525             


Met Glu Arg Val Ala Gln Leu Tyr Tyr Val Thr Gly Asp Ala Arg Ala 
    530                 535                 540                 


Glu Ala Ile Leu Asp Lys Trp Val Pro Trp Ala Ile Gln His Thr Asp 
545                 550                 555                 560 


Val Asp Ala Asp Asn Gly Gly Gln Asn Phe Gln Val Pro Ser Asp Leu 
                565                 570                 575     


Glu Trp Ser Gly Gln Pro Asp Thr Trp Thr Gly Thr Tyr Thr Gly Asn 
            580                 585                 590         


Pro Asn Leu His Val Gln Val Val Ser Tyr Ser Gln Asp Val Gly Val 
        595                 600                 605             


Thr Ala Ala Leu Ala Lys Thr Leu Met Tyr Tyr Ala Lys Arg Ser Gly 
    610                 615                 620                 


Asp Thr Thr Ala Leu Ala Thr Ala Glu Gly Leu Leu Asp Ala Leu Leu 
625                 630                 635                 640 


Ala His Arg Asp Ser Ile Gly Ile Ala Thr Pro Glu Gln Pro Ser Trp 
                645                 650                 655     


Asp Arg Leu Asp Asp Pro Trp Asp Gly Ser Glu Gly Leu Tyr Val Pro 
            660                 665                 670         


Pro Gly Trp Ser Gly Thr Met Pro Asn Gly Asp Arg Ile Glu Pro Gly 
        675                 680                 685             


Ala Thr Phe Leu Ser Ile Arg Ser Phe Tyr Lys Asn Asp Pro Leu Trp 
    690                 695                 700                 


Pro Gln Val Glu Ala His Leu Asn Asp Pro Gln Asn Val Pro Ala Pro 
705                 710                 715                 720 


Ile Val Glu Arg His Arg Phe Trp Ala Gln Val Glu Ile Ala Thr Ala 
                725                 730                 735     


Phe Ala Ala His Asp Glu Leu Phe Gly Ala Gly Ala Pro 
            740                 745                 


<210>  36
<211>  2250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  36
atggcccacc atcaccatca ccatccaaaa ggcacagcta cagtattata tggtgacgtt       60

gataatgatg gaaatgttga ttcagacgac tatgcatata tgagacaatg gttgatcggt      120

atgattgctg atttccctgg aggagatatc ggattagcta atgctgatgt tgatggagac      180

ggaaatgtag attcagatga ctatgcgtac atgagacaat ggttaatagg aatgatttcc      240

gagttcccag cagaacaaaa agcggtaccc ggccacgact cggccgaggt gacggtccgg      300

gagatcgacc cgaacaccag ctcctacgac caggccttcc tggagcagta cgagaagatc      360

aaggaccccg ccagcggcta cttccgcgaa ttcaacgggc tcctggtccc ctaccactcg      420

gtggagacca tgatcgtcga ggctccggac cacggccacc agaccacgtc cgaggcgttc      480

agctactacc tgtggctgga ggcgtactac ggccgggtca ccggtgactg gaagccgctc      540

cacgacgcct gggagtcgat ggagaccttc atcatccccg gcaccaagga ccagccgacc      600

aactccgcct acaacccgaa ctccccggcg acctacatcc ccgagcagcc caacgctgac      660

ggctacccgt cgcctctcat gaacaacgtc ccggtgggtc aagacccgct cgcccaggag      720

ctgagctcca cctacgggac caacgagatc tacggcatgc actggctgct cgacgtggac      780

aacgtctacg gcttcgggtt ctgcggcgac ggcaccgacg acgcccccgc ctacatcaac      840

acctaccagc gtggtgcgcg cgagtcggtg tgggagacca ttccgcaccc gtcctgcgac      900

gacttcacgc acggcggccc caacggctac ctggacctgt tcaccgacga ccagaactac      960

gccaagcagt ggcgctacac caacgccccc gacgctgacg cgcgggccgt ccaggtgatg     1020

ttctgggcgc acgaatgggc caaggagcag ggcaaggaga acgagatcgc gggcctgatg     1080

gacaaggcgt ccaagatggg cgactacctc cggtacgcga tgttcgacaa gtacttcaag     1140

aagatcggca actgcgtcgg cgccacctcc tgcccgggtg gccaaggcaa ggacagcgcg     1200

cactacctgc tgtcctggta ctactcctgg ggcggctcgc tcgacacctc ctctgcgtgg     1260

gcgtggcgta tcggctccag ctcctcgcac cagggctacc agaacgtgct cgctgcctac     1320

gcgctctcgc aggtgcccga actgcagcct gactccccga ccggtgtcca ggactgggcc     1380

accagcttcg accgccagtt ggagttcctc cagtggctgc agtccgctga aggtggtatc     1440

gccggtggcg ccaccaacag ctggaaggga agctacgaca ccccgccgac cggcctgtcg     1500

cagttctacg gcatgtacta cgactggcag ccggtctgga acgacccgcc gtccaacaac     1560

tggttcggct tccaggtctg gaacatggag cgcgtcgccc agctctacta cgtgaccggc     1620

gacgcccggg ccgaggccat cctcgacaag tgggtgccgt gggccatcca gcacaccgac     1680

gtggacgccg acaacggcgg ccagaacttc caggtcccct ccgacctgga gtggtcgggc     1740

cagcctgaca cctggaccgg cacctacacc ggcaacccga acctgcacgt ccaggtcgtc     1800

tcctacagcc aggacgtcgg tgtgaccgcc gctctggcca agaccctgat gtactacgcg     1860

aagcgttcgg gcgacaccac cgccctcgcc accgcggagg gtctgctgga cgccctgctg     1920

gcccaccggg acagcatcgg tatcgccacc cccgagcagc cgagctggga ccgtctggac     1980

gacccgtggg acggctccga gggcctgtac gtgccgccgg gctggtcggg caccatgccc     2040

aacggtgacc gcatcgagcc gggcgcgacc ttcctgtcca tccgctcgtt ctacaagaac     2100

gacccgctgt ggccgcaggt cgaggcacac ctgaacgacc cgcagaacgt cccggcgccg     2160

atcgtggagc gccaccgctt ctgggctcag gtggaaatcg cgaccgcgtt cgcagcccac     2220

gacgaactgt tcggggccgg agctccctga                                      2250


<210>  37
<211>  384
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  37

Met Val Glu Arg Tyr Gly Lys Val Gln Val Cys Gly Thr Gln Leu Cys 
1               5                   10                  15      


Asp Glu His Gly Asn Pro Val Gln Leu Arg Gly Met Ser Thr His Gly 
            20                  25                  30          


Ile Gln Trp Phe Asp His Cys Leu Thr Asp Ser Ser Leu Asp Ala Leu 
        35                  40                  45              


Ala Tyr Asp Trp Lys Ala Asp Ile Ile Arg Leu Ser Met Tyr Ile Gln 
    50                  55                  60                  


Glu Asp Gly Tyr Glu Thr Asn Pro Arg Gly Phe Thr Asp Arg Met His 
65                  70                  75                  80  


Gln Leu Ile Asp Met Ala Thr Ala Arg Gly Leu Tyr Val Ile Val Asp 
                85                  90                  95      


Trp His Ile Leu Thr Pro Gly Asp Pro His Tyr Asn Leu Asp Arg Ala 
            100                 105                 110         


Lys Thr Phe Phe Ala Glu Ile Ala Gln Arg His Ala Ser Lys Thr Asn 
        115                 120                 125             


Val Leu Tyr Glu Ile Ala Asn Glu Pro Asn Gly Val Ser Trp Ala Ser 
    130                 135                 140                 


Ile Lys Ser Tyr Ala Glu Glu Val Ile Pro Val Ile Arg Gln Arg Asp 
145                 150                 155                 160 


Pro Asp Ser Val Ile Ile Val Gly Thr Arg Gly Trp Ser Ser Leu Gly 
                165                 170                 175     


Val Ser Glu Gly Ser Gly Pro Ala Glu Ile Ala Ala Asn Pro Val Asn 
            180                 185                 190         


Ala Ser Asn Ile Met Tyr Ala Phe His Phe Tyr Ala Ala Ser His Arg 
        195                 200                 205             


Asp Asn Tyr Leu Asn Ala Leu Arg Glu Ala Ser Glu Leu Phe Pro Val 
    210                 215                 220                 


Phe Val Thr Glu Phe Gly Thr Glu Thr Tyr Thr Gly Asp Gly Ala Asn 
225                 230                 235                 240 


Asp Phe Gln Met Ala Asp Arg Tyr Ile Asp Leu Met Ala Glu Arg Lys 
                245                 250                 255     


Ile Gly Trp Thr Lys Trp Asn Tyr Ser Asp Asp Phe Arg Ser Gly Ala 
            260                 265                 270         


Val Phe Gln Pro Gly Thr Cys Ala Ser Gly Gly Pro Trp Ser Gly Ser 
        275                 280                 285             


Ser Leu Lys Ala Ser Gly Gln Trp Val Arg Ser Lys Leu Gln Ser Val 
    290                 295                 300                 


Pro Glu Ser Ser Ser Thr Gly Leu Gly Asp Leu Asn Gly Asp Gly Asn 
305                 310                 315                 320 


Ile Asn Ser Ser Asp Leu Gln Ala Leu Lys Arg His Leu Leu Gly Ile 
                325                 330                 335     


Ser Pro Leu Thr Gly Glu Ala Leu Leu Arg Ala Asp Val Asn Arg Ser 
            340                 345                 350         


Gly Lys Val Asp Ser Thr Asp Tyr Ser Val Leu Lys Arg Tyr Ile Leu 
        355                 360                 365             


Arg Ile Ile Thr Glu Phe Pro Gly Leu Glu His His His His His His 
    370                 375                 380                 


<210>  38
<211>  1155
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  38
atggtcgagc ggtacggcaa agtccaggtc tgcggcaccc agctctgcga cgagcacggc       60

aacccggtcc aactgcgcgg catgagcacc cacggcatcc agtggttcga ccactgcctg      120

accgacagct cgctggacgc cctggcctac gactggaagg ccgacatcat ccgcctgtcc      180

atgtacatcc aggaagacgg ctacgagacc aacccgcgcg gcttcaccga ccggatgcac      240

cagctcatcg acatggccac ggcgcgcggc ctgtacgtga tcgtggactg gcacatcctc      300

accccgggcg atccccacta caacctggac cgggccaaga ccttcttcgc ggaaatcgcc      360

cagcgccacg ccagcaagac caacgtgctc tacgagatcg ccaacgaacc caacggagtg      420

agctgggcct ccatcaagag ctacgccgaa gaggtcatcc cggtgatccg ccagcgcgac      480

cccgactcgg tgatcatcgt gggcacccgc ggctggtcgt cgctcggcgt ctccgaaggc      540

tccggccccg ccgagatcgc ggccaacccg gtcaacgcct ccaacatcat gtacgccttc      600

cacttctacg cggcctcgca ccgcgacaac tacctcaacg cgctgcgtga ggcctccgag      660

ctgttcccgg tcttcgtcac cgagttcggc accgagacct acaccggtga cggcgccaac      720

gacttccaga tggccgaccg ctacatcgac ctgatggcgg aacggaagat cgggtggacc      780

aagtggaact actcggacga cttccgttcc ggcgcggtct tccagccggg cacctgcgcg      840

tccggcggcc cgtggagcgg ttcgtcgctg aaggcgtccg gacagtgggt gcggagcaag      900

ctccagtcgg tacctgaaag cagttccaca ggtctggggg atttaaatgg tgacggaaat      960

attaactcgt cggaccttca ggcgttaaag aggcatttgc tcggtatatc accgcttacg     1020

ggagaggctc ttttaagagc ggatgtaaat aggagcggca aagtggattc tactgactat     1080

tcagtgctga aaagatatat actccgcatt attacagagt tccccggact cgagcaccac     1140

caccaccacc actga                                                      1155


<210>  39
<211>  165
<212>  PRT
<213>  Clostridium cellulolyticum

<400>  39

Asp Ser Leu Lys Val Thr Val Gly Thr Ala Asn Gly Lys Pro Gly Asp 
1               5                   10                  15      


Thr Val Thr Val Pro Val Thr Phe Ala Asp Val Ala Lys Met Lys Asn 
            20                  25                  30          


Val Gly Thr Cys Asn Phe Tyr Leu Gly Tyr Asp Ala Ser Leu Leu Glu 
        35                  40                  45              


Val Val Ser Val Asp Ala Gly Pro Ile Val Lys Asn Ala Ala Val Asn 
    50                  55                  60                  


Phe Ser Ser Ser Ala Ser Asn Gly Thr Ile Ser Phe Leu Phe Leu Asp 
65                  70                  75                  80  


Asn Thr Ile Thr Asp Glu Leu Ile Thr Ala Asp Gly Val Phe Ala Asn 
                85                  90                  95      


Ile Lys Phe Lys Leu Lys Ser Val Thr Ala Lys Thr Thr Thr Pro Val 
            100                 105                 110         


Thr Phe Lys Asp Gly Gly Ala Phe Gly Asp Gly Thr Met Ser Lys Ile 
        115                 120                 125             


Ala Ser Val Thr Lys Thr Asn Gly Ser Val Thr Ile Asp Pro Thr Lys 
    130                 135                 140                 


Gly Ala Thr Pro Thr Asn Thr Ala Thr Pro Thr Lys Ser Ala Thr Ala 
145                 150                 155                 160 


Thr Pro Thr Arg Pro 
                165 


<210>  40
<211>  158
<212>  PRT
<213>  Archaeoglobus fulgidus

<400>  40

Val Pro Pro Lys Thr Thr Ile Ile Ala Gly Ser Ala Glu Ala Pro Gln 
1               5                   10                  15      


Gly Ser Asp Ile Gln Val Pro Val Lys Ile Glu Asn Ala Asp Lys Val 
            20                  25                  30          


Gly Ser Ile Asn Leu Ile Leu Ser Tyr Pro Asn Val Leu Glu Val Glu 
        35                  40                  45              


Asp Val Leu Gln Gly Ser Leu Thr Gln Asn Ser Leu Phe Asp Tyr Asn 
    50                  55                  60                  


Val Glu Gly Asn Gln Ile Lys Val Gly Ile Ala Asp Ser Asn Gly Ile 
65                  70                  75                  80  


Ser Gly Asp Gly Ser Leu Phe Tyr Val Lys Phe Arg Val Thr Gly Asn 
                85                  90                  95      


Glu Lys Ala Glu Gln Ala Glu Asn Val Lys Gly Lys Leu Arg Gly Leu 
            100                 105                 110         


Gly Gln Gln Leu Ser Glu Ile Thr Leu Arg Asn Ser His Ala Leu Thr 
        115                 120                 125             


Leu Gln Gly Ile Glu Ile Tyr Asp Ile Asp Gly Asn Ser Val Lys Val 
    130                 135                 140                 


Ala Thr Ile Asn Gly Thr Phe Arg Ile Val Ser Gln Glu Glu 
145                 150                 155             


<210>  41
<211>  126
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  41

Gly Thr Val Glu Trp Leu Ile Pro Thr Val Thr Ala Ala Pro Gly Gln 
1               5                   10                  15      


Thr Val Thr Met Pro Val Val Val Lys Ser Ser Ser Leu Ala Val Ala 
            20                  25                  30          


Gly Ala Gln Phe Lys Ile Gln Ala Ala Thr Gly Val Arg Tyr Ser Ser 
        35                  40                  45              


Lys Thr Asp Gly Asp Ala Tyr Gly Ser Gly Ile Val Tyr Asn Asn Ser 
    50                  55                  60                  


Lys Tyr Ala Phe Gly Gln Gly Ala Gly Arg Gly Ile Val Ala Ala Asp 
65                  70                  75                  80  


Asp Ser Val Val Leu Thr Leu Ala Tyr Thr Val Pro Ala Asp Cys Ala 
                85                  90                  95      


Glu Gly Thr Tyr Asp Val Lys Trp Ser Asp Ala Phe Val Ser Asp Thr 
            100                 105                 110         


Asp Gly Gln Asn Ile Thr Ser Lys Val Thr Leu Thr Asp Gly 
        115                 120                 125     


<210>  42
<211>  202
<212>  PRT
<213>  Clostridium thermocellum

<400>  42

Val Ala Leu Glu Leu Asp Lys Thr Lys Val Lys Val Gly Asp Ile Ile 
1               5                   10                  15      


Thr Ala Thr Ile Lys Ile Glu Asn Met Lys Asn Phe Ala Gly Tyr Gln 
            20                  25                  30          


Leu Asn Ile Lys Tyr Asp Pro Thr Met Leu Glu Ala Ile Glu Leu Glu 
        35                  40                  45              


Thr Gly Ser Ala Ile Ala Lys Arg Thr Trp Pro Val Thr Gly Gly Thr 
    50                  55                  60                  


Val Leu Gln Ser Asp Asn Tyr Gly Lys Thr Thr Ala Val Ala Asn Asp 
65                  70                  75                  80  


Val Gly Ala Gly Ile Ile Asn Phe Ala Glu Ala Tyr Ser Asn Leu Thr 
                85                  90                  95      


Lys Tyr Arg Glu Thr Gly Val Ala Glu Glu Thr Gly Ile Ile Gly Lys 
            100                 105                 110         


Ile Gly Phe Arg Val Leu Lys Ala Gly Ser Thr Ala Ile Arg Phe Glu 
        115                 120                 125             


Asp Thr Thr Ala Met Pro Gly Ala Ile Glu Gly Thr Tyr Met Phe Asp 
    130                 135                 140                 


Trp Tyr Gly Glu Asn Ile Lys Gly Tyr Ser Val Val Gln Pro Gly Glu 
145                 150                 155                 160 


Ile Val Val Glu Gly Glu Glu Pro Gly Glu Glu Pro Thr Glu Glu Pro 
                165                 170                 175     


Val Pro Thr Glu Thr Ser Val Asp Pro Thr Pro Thr Val Thr Glu Glu 
            180                 185                 190         


Pro Val Pro Ser Glu Leu Pro Asp Ser Tyr 
        195                 200         


<210>  43
<211>  1190
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  43

Met Gly Val Ala Leu Glu Leu Asp Lys Thr Lys Val Lys Val Gly Asp 
1               5                   10                  15      


Ile Ile Thr Ala Thr Ile Lys Ile Glu Asn Met Lys Asn Phe Ala Gly 
            20                  25                  30          


Tyr Gln Leu Asn Ile Lys Tyr Asp Pro Thr Met Leu Glu Ala Ile Glu 
        35                  40                  45              


Leu Glu Thr Gly Ser Ala Ile Ala Lys Arg Thr Trp Pro Val Thr Gly 
    50                  55                  60                  


Gly Thr Val Leu Gln Ser Asp Asn Tyr Gly Lys Thr Thr Ala Val Ala 
65                  70                  75                  80  


Asn Asp Val Gly Ala Gly Ile Ile Asn Phe Ala Glu Ala Tyr Ser Asn 
                85                  90                  95      


Leu Thr Lys Tyr Arg Glu Thr Gly Val Ala Glu Glu Thr Gly Ile Ile 
            100                 105                 110         


Gly Lys Ile Gly Phe Arg Val Leu Lys Ala Gly Ser Thr Ala Ile Arg 
        115                 120                 125             


Phe Glu Asp Thr Thr Ala Met Pro Gly Ala Ile Glu Gly Thr Tyr Met 
    130                 135                 140                 


Phe Asp Trp Tyr Gly Glu Asn Ile Lys Gly Tyr Ser Val Val Gln Pro 
145                 150                 155                 160 


Gly Glu Ile Val Val Glu Gly Glu Glu Pro Gly Glu Glu Pro Thr Glu 
                165                 170                 175     


Glu Pro Val Pro Thr Glu Thr Ser Val Asp Pro Thr Pro Thr Val Thr 
            180                 185                 190         


Glu Glu Pro Val Pro Ser Glu Leu Pro Asp Ser Tyr Ala Arg Leu Lys 
        195                 200                 205             


Val Thr Val Gly Thr Ala Asn Gly Lys Pro Gly Asp Thr Val Thr Val 
    210                 215                 220                 


Pro Val Thr Phe Ala Asp Val Ala Lys Met Lys Asn Val Gly Thr Cys 
225                 230                 235                 240 


Asn Phe Tyr Leu Gly Tyr Asp Ala Ser Leu Leu Glu Val Val Ser Val 
                245                 250                 255     


Asp Ala Gly Pro Ile Val Lys Asn Ala Ala Val Asn Phe Ser Ser Ser 
            260                 265                 270         


Ala Ser Asn Gly Thr Ile Ser Phe Leu Phe Leu Asp Asn Thr Ile Thr 
        275                 280                 285             


Asp Glu Leu Ile Thr Ala Asp Gly Val Phe Ala Asn Ile Lys Phe Lys 
    290                 295                 300                 


Leu Lys Ser Val Thr Ala Lys Thr Thr Thr Pro Val Thr Phe Lys Asp 
305                 310                 315                 320 


Gly Gly Ala Phe Gly Asp Gly Thr Met Ser Lys Ile Ala Ser Val Thr 
                325                 330                 335     


Lys Thr Asn Gly Ser Val Thr Ile Asp Pro Thr Lys Gly Ala Thr Pro 
            340                 345                 350         


Thr Asn Thr Ala Thr Pro Thr Lys Ser Ala Thr Ala Thr Pro Thr Arg 
        355                 360                 365             


Pro Ser Val Pro Arg Pro His Leu Gln Val Asp Ile Gly Ser Thr Ser 
    370                 375                 380                 


Gly Lys Ala Gly Ser Val Val Ser Val Pro Ile Thr Phe Thr Asn Val 
385                 390                 395                 400 


Pro Lys Ser Gly Ile Tyr Ala Leu Ser Phe Arg Thr Asn Phe Asp Pro 
                405                 410                 415     


Gln Lys Val Thr Val Ala Ser Ile Asp Ala Gly Ser Leu Ile Glu Asn 
            420                 425                 430         


Ala Ser Asp Phe Thr Thr Tyr Tyr Asn Asn Glu Asn Gly Phe Ala Ser 
        435                 440                 445             


Met Thr Phe Glu Ala Pro Val Asp Arg Ala Arg Ile Ile Asp Ser Asp 
    450                 455                 460                 


Gly Val Phe Ala Thr Ile Asn Phe Lys Val Ser Asp Ser Ala Lys Val 
465                 470                 475                 480 


Gly Glu Leu Tyr Asn Ile Thr Thr Asn Ser Ala Tyr Thr Ser Phe Tyr 
                485                 490                 495     


Tyr Ser Gly Thr Asp Glu Ile Lys Asn Val Val Tyr Asn Asp Gly Lys 
            500                 505                 510         


Ile Glu Val Ile Ala Ser Val Pro Thr Asn Thr Pro Thr Asn Thr Pro 
        515                 520                 525             


Ala Asn Thr Pro Val Ser Gly Asn Leu Lys Val Glu Phe Tyr Asn Ser 
    530                 535                 540                 


Asn Pro Ser Asp Thr Thr Asn Ser Ile Asn Pro Gln Phe Lys Val Thr 
545                 550                 555                 560 


Asn Thr Gly Ser Ser Ala Ile Asp Leu Ser Lys Leu Thr Leu Arg Tyr 
                565                 570                 575     


Tyr Tyr Thr Val Asp Gly Gln Lys Asp Gln Thr Phe Trp Cys Asp His 
            580                 585                 590         


Ala Ala Ile Ile Gly Ser Asn Gly Ser Tyr Asn Gly Ile Thr Ser Asn 
        595                 600                 605             


Val Lys Gly Thr Phe Val Lys Met Ser Ser Ser Thr Asn Asn Ala Asp 
    610                 615                 620                 


Thr Tyr Leu Glu Ile Ser Phe Thr Gly Gly Thr Leu Glu Pro Gly Ala 
625                 630                 635                 640 


His Val Gln Ile Gln Gly Arg Phe Ala Lys Asn Asp Trp Ser Asn Tyr 
                645                 650                 655     


Thr Gln Ser Asn Asp Tyr Ser Phe Lys Ser Ala Ser Gln Phe Val Glu 
            660                 665                 670         


Trp Asp Gln Val Thr Ala Tyr Leu Asn Gly Val Leu Val Trp Gly Lys 
        675                 680                 685             


Glu Pro Gly Gly Ser Val Val Pro Ser Thr Gln Pro Val Thr Thr Pro 
    690                 695                 700                 


Pro Ala Thr Thr Lys Pro Pro Ala Thr Thr Ile Pro Pro Ser Asp Asp 
705                 710                 715                 720 


Pro Asn Ala Ile Lys Ile Lys Val Asp Thr Val Asn Ala Lys Pro Gly 
                725                 730                 735     


Asp Thr Val Asn Ile Pro Val Arg Phe Ser Gly Ile Pro Ser Lys Gly 
            740                 745                 750         


Ile Ala Asn Cys Asp Phe Val Tyr Ser Tyr Asp Pro Asn Val Leu Glu 
        755                 760                 765             


Ile Ile Glu Ile Lys Pro Gly Glu Leu Ile Val Asp Pro Asn Pro Asp 
    770                 775                 780                 


Lys Ser Phe Asp Thr Ala Val Tyr Pro Asp Arg Lys Ile Ile Val Phe 
785                 790                 795                 800 


Leu Phe Ala Glu Asp Ser Gly Thr Gly Ala Tyr Ala Ile Thr Lys Asp 
                805                 810                 815     


Gly Val Phe Ala Thr Ile Val Ala Lys Val Lys Ser Gly Ala Pro Asn 
            820                 825                 830         


Gly Leu Ser Val Ile Lys Phe Val Glu Val Gly Gly Phe Ala Asn Asn 
        835                 840                 845             


Asp Leu Val Glu Gln Arg Thr Gln Phe Phe Asp Gly Gly Val Asn Val 
    850                 855                 860                 


Gly Asp Ile Gly Ser Val Pro Pro Lys Thr Thr Ile Ile Ala Gly Ser 
865                 870                 875                 880 


Ala Glu Ala Pro Gln Gly Ser Asp Ile Gln Val Pro Val Lys Ile Glu 
                885                 890                 895     


Asn Ala Asp Lys Val Gly Ser Ile Asn Leu Ile Leu Ser Tyr Pro Asn 
            900                 905                 910         


Val Leu Glu Val Glu Asp Val Leu Gln Gly Ser Leu Thr Gln Asn Ser 
        915                 920                 925             


Leu Phe Asp Tyr Asn Val Glu Gly Asn Gln Ile Lys Val Gly Ile Ala 
    930                 935                 940                 


Asp Ser Asn Gly Ile Ser Gly Asp Gly Ser Leu Phe Tyr Val Lys Phe 
945                 950                 955                 960 


Arg Val Thr Gly Asn Glu Lys Ala Glu Gln Ala Glu Asn Val Lys Gly 
                965                 970                 975     


Lys Leu Arg Gly Leu Gly Gln Gln Leu Ser Glu Ile Thr Leu Arg Asn 
            980                 985                 990         


Ser His Ala Leu Thr Leu Gln Gly  Ile Glu Ile Tyr Asp  Ile Asp Gly 
        995                 1000                 1005             


Asn Ser  Val Lys Val Ala Thr  Ile Asn Gly Thr Phe  Arg Ile Val 
    1010                 1015                 1020             


Ser Gln  Glu Glu Ala Ser Ala  Gly Gly Leu Ser Ala  Val Gln Pro 
    1025                 1030                 1035             


Asn Val  Ser Leu Gly Glu Val  Leu Asp Val Ser Ala  Asn Arg Thr 
    1040                 1045                 1050             


Ala Ala  Asp Gly Thr Val Glu  Trp Leu Ile Pro Thr  Val Thr Ala 
    1055                 1060                 1065             


Ala Pro  Gly Gln Thr Val Thr  Met Pro Val Val Val  Lys Ser Ser 
    1070                 1075                 1080             


Ser Leu  Ala Val Ala Gly Ala  Gln Phe Lys Ile Gln  Ala Ala Thr 
    1085                 1090                 1095             


Gly Val  Arg Tyr Ser Ser Lys  Thr Asp Gly Asp Ala  Tyr Gly Ser 
    1100                 1105                 1110             


Gly Ile  Val Tyr Asn Asn Ser  Lys Tyr Ala Phe Gly  Gln Gly Ala 
    1115                 1120                 1125             


Gly Arg  Gly Ile Val Ala Ala  Asp Asp Ser Val Val  Leu Thr Leu 
    1130                 1135                 1140             


Ala Tyr  Thr Val Pro Ala Asp  Cys Ala Glu Gly Thr  Tyr Asp Val 
    1145                 1150                 1155             


Lys Trp  Ser Asp Ala Phe Val  Ser Asp Thr Asp Gly  Gln Asn Ile 
    1160                 1165                 1170             


Thr Ser  Lys Val Thr Leu Thr  Asp Gly Leu Glu His  His His His 
    1175                 1180                 1185             


His His  
    1190 


<210>  44
<211>  3573
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  44
atgggcgtgg ctctggaact ggataagacg aaggtaaaag taggggacat aataacagcg       60

acgataaaga tagagaacat gaagaatttt gcagggtacc agttgaatat caagtatgac      120

ccgaccatgt tggaggcaat agaactggag acaggaagtg cgatagcgaa gaggacatgg      180

ccggttacag gaggtactgt tctgcaaagt gacaattatg gaaagacgac tgcggtagcg      240

aatgatgtag gagcaggtat aataaacttt gctgaggcat actcgaacct taccaaatac      300

agagagacag gtgtggcaga agagacaggt ataataggaa agataggctt cagagtgctg      360

aaggcaggaa gtacggctat aagatttgag gatacgacag cgatgccggg agcaatagaa      420

ggaacataca tgttcgactg gtatggcgag aacatcaaag ggtatagcgt agtacagcct      480

ggggaaatag tggtagaagg agaagagccg ggtgaagagc cgacagaaga gcctgtaccg      540

acagagacat cggtagatcc cacaccgaca gtgacagaag agcctgtacc ttcagagctt      600

ccagattcct atgctagact taaagttaca gtaggaacag ctaatggtaa gcctggcgat      660

acagtaacag ttcctgttac atttgctgat gtagcaaaga tgaaaaacgt aggaacatgt      720

aatttctatc ttggatatga tgcaagcctg ttagaggtag tatcagtaga tgcaggtcca      780

atagttaaga atgcagcagt taacttctca agcagtgcaa gcaacggaac aatcagcttc      840

ctgttcttgg ataacacaat tacagacgaa ttgataactg cagacggtgt gtttgcaaat      900

attaagttca aattaaagag tgtaacggct aaaactacaa caccagtaac atttaaagat      960

ggtggagctt ttggtgacgg aactatgtca aagatagctt cagttactaa gacaaacggt     1020

agtgtaacga tcgatccgac caagggagca acaccaacaa atacagctac gccgacaaaa     1080

tcagctacgg ctacgcccac caggccatcg gtaccgcggc cgcatttaca ggttgacatt     1140

ggaagtacta gtggaaaagc aggtagtgtt gttagtgtac ctataacatt tactaatgta     1200

cctaaatcag gtatctatgc tctaagtttt agaacaaatt tcgacccaca aaaggtaact     1260

gtagcaagta tagatgctgg ctcactgatt gaaaatgctt ctgattttac tacttattat     1320

aataatgaaa atggttttgc atcaatgacg tttgaagccc cagttgatag agctagaatc     1380

atagatagtg atggtgtatt tgcaaccatt aactttaaag ttagtgatag tgccaaagta     1440

ggtgaacttt acaatattac tactaatagt gcatatactt cattctatta ttctggaact     1500

gatgaaatca aaaatgttgt ttacaatgat ggaaaaattg aggtaattgc atcggtaccg     1560

acaaacacac cgacaaacac accggcaaat acaccggtat caggcaattt gaaggttgaa     1620

ttctacaaca gcaatccttc agatactact aactcaatca atcctcagtt caaggttact     1680

aataccggaa gcagtgcaat tgatttgtcc aaactcacat tgagatatta ttatacagta     1740

gacggacaga aagatcagac cttctggtgt gaccatgctg caataatcgg cagtaacggc     1800

agctacaacg gaattacttc aaatgtaaaa ggaacatttg taaaaatgag ttcctcaaca     1860

aataacgcag acacctacct tgaaataagc tttacaggcg gaactcttga accgggtgca     1920

catgttcaga tacaaggtag atttgcaaag aatgactgga gtaactatac acagtcaaat     1980

gactactcat tcaagtctgc ttcacagttt gttgaatggg atcaggtaac agcatacttg     2040

aacggtgttc ttgtatgggg taaagaaccc ggtggcagtg tagtaccatc aacacagcct     2100

gtaacaacac cacctgcaac aacaaaacca cctgcaacaa caataccgcc gtcagatgat     2160

ccgaatgcaa taaagattaa ggtggacaca gtaaatgcaa aaccgggaga cacagtaaat     2220

atacctgtaa gattcagtgg tataccatcc aagggaatag caaactgtga ctttgtatac     2280

agctatgacc cgaatgtact tgagataata gagataaaac cgggagaatt gatagttgac     2340

ccgaatcctg acaagagctt tgatactgca gtatatcctg acagaaagat aatagtattc     2400

ctgtttgcag aagacagcgg aacaggagcg tatgcaataa ctaaagacgg agtatttgct     2460

acgatagtag cgaaagtaaa atccggagca cctaacggac tcagtgtaat caaatttgta     2520

gaagtaggcg gatttgcgaa caatgacctt gtagaacaga ggacacagtt ctttgacggt     2580

ggagtaaatg ttggagatat aggatccgtt cctccgaaaa ctaccatcat tgccggttct     2640

gccgaagcgc cccaaggaag tgatatccag gtgcctgtta aaatcgaaaa tgctgacaaa     2700

gtgggcagca taaatctcat cctgagctac ccgaatgtgc ttgaggttga ggatgtgctt     2760

cagggctctc taactcagaa ctcacttttc gattacaatg ttgaaggtaa tcaaattaaa     2820

gttggcatcg cggacagtaa cgggattagc ggcgacggtt cgctgttcta cgtaaagttc     2880

agagttacag gcaatgaaaa agcggagcag gcagaaaacg ttaaaggtaa acttaggggc     2940

ttgggccaac agctctccga aatcacactc agaaactctc acgctctcac ccttcaaggg     3000

atcgaaatct acgacattga tgggaattct gtaaaggttg cgacgataaa tgggactttc     3060

aggattgtct ctcaggaaga agctagcgcc ggtggtttat ccgctgtgca gcctaatgtt     3120

agtttaggcg aagtactgga tgtttctgct aacagaaccg ctgctgacgg aacagttgaa     3180

tggcttatcc caacagtaac tgcagctcca ggccagacgg tcactatgcc cgtagtagtc     3240

aagagttcaa gtcttgcagt tgctggtgcg cagttcaaga tccaggcggc gacaggcgta     3300

cgttattcgt ccaagacgga cggtgacgct tacggttcag gcattgtgta caataatagt     3360

aagtatgctt ttggacaggg tgcaggtaga ggaatagttg cagctgatga ttcggttgtg     3420

cttactcttg catatacagt tcccgctgat tgtgctgaag gtacatatga tgtcaagtgg     3480

tctgatgcgt ttgtaagtga tacagacgga cagaatatca caagtaaggt tactcttact     3540

gatggcctcg agcaccacca ccaccaccac tga                                  3573


<210>  45
<211>  623
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  45

Met His His His His His His Thr Ser Pro Gln Val Thr Ser Ser Pro 
1               5                   10                  15      


Ser Arg Glu Glu Pro Arg Ala Gly Thr Ile Arg Asn Pro Val Leu Thr 
            20                  25                  30          


Gly Phe Tyr Pro Asp Pro Ser Ile Leu Arg Val Gly Asp Asp Tyr Tyr 
        35                  40                  45              


Met Ala Thr Ser Thr Phe Glu Trp Tyr Pro Gly Val Thr Leu His His 
    50                  55                  60                  


Ser Arg Asp Leu Val His Trp Arg Pro Leu Gly Gly Ala Leu Thr Glu 
65                  70                  75                  80  


Thr Arg Leu Leu Asp Leu Ala Gly Arg Arg Asp Gly Ala Gly Val Trp 
                85                  90                  95      


Ala Pro Ala Leu Ser Tyr Arg Asp Gly Leu Phe Phe Leu Val Phe Thr 
            100                 105                 110         


Asn Val Ala Ser Tyr Ser Gly Asn Phe Trp Asp Ala Pro Asn Tyr Val 
        115                 120                 125             


Thr Thr Ala Pro Asp Ile Thr Gly Pro Trp Ser Asp Pro Val Pro Leu 
    130                 135                 140                 


His Ser Leu Gly Phe Asp Pro Ser Leu Phe His Asp Asp Asp Gly Arg 
145                 150                 155                 160 


Ser Trp Leu Leu Ser Thr Ser Met Asp Trp Arg Pro Gly Arg Asp Ala 
                165                 170                 175     


Phe Gly Gly Ile Val Ala Gln Glu Phe Ser Val Arg Asp Met Lys Leu 
            180                 185                 190         


Val Gly Glu Pro Val Ile Ile Phe Thr Gly Thr Glu Ala Gly Val Thr 
        195                 200                 205             


Glu Ala Pro His Ile Tyr Lys Arg Asp Gly Trp Tyr Tyr Leu Val Thr 
    210                 215                 220                 


Ala Glu Gly Gly Thr Gln Trp Glu His Gln Val Thr Val Ala Arg Ser 
225                 230                 235                 240 


Arg Ser Val Thr Gly Pro Tyr Glu Val Asp Pro Ala Gly Pro Ala Leu 
                245                 250                 255     


Thr Ser Arg His Val Pro Glu Ala Pro Leu Gln Lys Ala Gly His Ala 
            260                 265                 270         


Ser Met Val Glu Thr Gln His Gly Glu Trp Tyr Phe Ala His Leu Thr 
        275                 280                 285             


Gly Arg Pro Met Pro Pro Ser Gly Arg Cys Val Leu Gly Arg Glu Thr 
    290                 295                 300                 


Ala Leu Gln Lys Ile Glu Trp Ser Ser Asp Gly Trp Pro Arg Val Arg 
305                 310                 315                 320 


Asn Ala Glu Pro Leu Leu Glu Val Pro Gly Pro Arg Gly Leu Ala Pro 
                325                 330                 335     


His Pro Trp Pro Gln Pro Ser Glu Thr Asp His Phe Asp Asp Pro Thr 
            340                 345                 350         


Pro Arg Pro Glu Trp Ser Thr Leu Arg Arg Pro Phe Asp Ser Ser Trp 
        355                 360                 365             


Val Ser Leu Thr Glu Arg Pro Gly Tyr Leu Arg Ile Arg Gly Gly Gln 
    370                 375                 380                 


Ser Pro Ala Gly Leu His Glu Pro Ser Leu Val Ala Arg Arg Leu Gln 
385                 390                 395                 400 


His Arg Ala Cys Ile Phe Glu Ala Cys Leu Glu Phe Lys Pro Glu Asp 
                405                 410                 415     


Phe Arg Gln Met Ala Gly Ile Thr Ala Tyr Tyr Asn Thr Arg Gln Trp 
            420                 425                 430         


His Tyr Leu Arg Ile Asn Arg Asp Asp Arg Gly Gly Val Phe Ala Gly 
        435                 440                 445             


Val Leu Thr Ser Asp Arg Gly Ile Ile Arg Glu Val Gly Arg Arg Ile 
    450                 455                 460                 


Ser Val Thr Asp Trp Pro Lys Val Phe Leu Arg Ala Glu Ile Asp Arg 
465                 470                 475                 480 


Asn Asp Leu Arg Phe Ala Val Ser Ser Asp Gly Ser Thr Trp Ala Asp 
                485                 490                 495     


Met Gly Val Arg Leu Asp Met Ser Ile Leu Ser Asp Glu Tyr Ala Glu 
            500                 505                 510         


Glu Arg Phe Gly Asn Asp Pro Ile Met Trp Gly Phe Thr Gly Ala Phe 
        515                 520                 525             


Leu Gly Leu Trp Ala His Asp Met Thr Gly Ala Gly Leu Pro Ala Asp 
    530                 535                 540                 


Phe Asp Phe Cys Thr Tyr Arg Pro Gln Ser Pro Ser Thr Ser Pro Val 
545                 550                 555                 560 


Ile Val Tyr Gly Asp Tyr Asn Asn Asp Gly Asn Val Asp Ala Leu Asp 
                565                 570                 575     


Phe Ala Gly Leu Lys Lys Tyr Ile Met Ala Ala Asp His Ala Tyr Val 
            580                 585                 590         


Lys Asn Leu Asp Val Asn Leu Asp Asn Glu Val Asn Ala Phe Asp Leu 
        595                 600                 605             


Ala Ile Leu Lys Lys Tyr Leu Leu Gly Met Val Ser Lys Leu Pro 
    610                 615                 620             


<210>  46
<211>  1872
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  46
atgcaccatc accatcacca tacttctccc caagtcacgt cctccccgtc tcgtgaggaa       60

ccgagggcgg gcacgattcg caacccggta ctcaccggct tctaccccga cccttccatc      120

ctgcgagtgg gcgacgacta ctacatggcg acctccacat tcgagtggta tcccggagtg      180

accctgcacc attcccggga cttggtgcac tggcgccccc tgggcggtgc actcaccgag      240

actcgactgc tggacctggc tggacggcgg gacggcgcag gggtgtgggc acccgccctg      300

tcctaccggg acggactgtt cttcctcgtc ttcacgaacg tcgcaagcta cagcggcaac      360

ttctgggacg cgcccaacta cgtcaccacc gctcccgaca tcaccggccc ctggtccgac      420

ccggtgccgc tccactccct cggcttcgac ccgtcgctgt tccacgacga cgacggacgg      480

agctggctgc tcagcacctc catggactgg cggccgggac gggacgcgtt cggtggcatc      540

gtcgcccaag agttctcggt gcgcgacatg aaactcgtcg gtgaaccggt gatcatcttc      600

accggcaccg aagccggcgt gaccgaggcg ccccacatct acaagcgcga cggctggtac      660

tacctggtca ccgccgaagg cggcacccag tgggagcacc aggtcaccgt ggcccgctcc      720

cgctcggtca ccggacccta cgaggtcgac ccggccgggc cagccctcac ctcgcggcac      780

gttcccgaag cgccgctgca gaaggccggg cacgcgagca tggtcgaaac ccagcacggc      840

gaatggtatt tcgcgcacct gaccggacgc ccgatgccgc ccagcggccg gtgcgtcctc      900

ggtcgggaga ccgcgttgca gaagatcgaa tggtcttcag acgggtggcc ccgcgtccgc      960

aacgcggaac cgctgctgga agtgccggga ccgcgcggcc tggccccgca cccgtggccg     1020

cagccgtcgg agaccgacca cttcgacgac cccacgccgc ggcccgagtg gagcacgctg     1080

cgccggccct tcgactcctc ctgggtctcc ctcaccgaac ggcccggcta cctgcggatc     1140

cgcggcgggc agtcgcctgc tggcctgcac gagcccagcc tggtggcacg ccgactgcag     1200

caccgcgcct gcatcttcga agcctgcctg gagttcaagc cggaagactt ccggcagatg     1260

gcaggcatca ccgcctacta caacacccgc caatggcact acctgcggat caaccgcgac     1320

gaccggggcg gcgtgttcgc gggcgtgctc accagcgacc gcggcatcat ccgcgaagtg     1380

ggacggcgga tcagcgtcac cgactggccg aaggtcttcc tgcgcgccga aatcgaccgg     1440

aacgacctgc gcttcgccgt ctcctccgac ggcagcacgt gggctgacat gggggtgcgt     1500

ctggacatga gcatcctgtc cgacgagtac gccgaggaac ggttcggcaa cgaccccatc     1560

atgtggggtt tcacgggggc gttcctcggc ctgtgggccc acgacatgac cggggcaggg     1620

ctccctgccg acttcgactt ctgcacctac cggcctcagt ccccctccac tagtcctgta     1680

attgtatatg gagattataa caatgatgga aatgttgatg cacttgattt tgcaggctta     1740

aagaaatata ttatggctgc tgaccatgct tatgtaaaga atttggatgt taatctcgac     1800

aatgaagtga atgcatttga ccttgctatt ttgaaaaaat atctgcttgg tatggtaagt     1860

aagctacctt aa                                                         1872


<210>  47
<211>  399
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  47

Met Ala Ser Met His His His His His His Ala Val Thr Ser Asn Glu 
1               5                   10                  15      


Thr Gly Tyr His Asp Gly Tyr Phe Tyr Ser Phe Trp Thr Asp Ala Pro 
            20                  25                  30          


Gly Thr Val Ser Met Glu Leu Gly Pro Gly Gly Asn Tyr Ser Thr Ser 
        35                  40                  45              


Trp Arg Asn Thr Gly Asn Phe Val Ala Gly Lys Gly Trp Ala Thr Gly 
    50                  55                  60                  


Gly Arg Arg Thr Val Thr Tyr Ser Ala Ser Phe Asn Pro Ser Gly Asn 
65                  70                  75                  80  


Ala Tyr Leu Thr Leu Tyr Gly Trp Thr Arg Asn Pro Leu Val Glu Tyr 
                85                  90                  95      


Tyr Ile Val Glu Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Met 
            100                 105                 110         


Gly Thr Val Thr Thr Asp Gly Gly Thr Tyr Asp Ile Tyr Lys Thr Thr 
        115                 120                 125             


Arg Tyr Asn Ala Pro Ser Ile Glu Gly Thr Arg Thr Phe Asp Gln Tyr 
    130                 135                 140                 


Trp Ser Val Arg Gln Ser Lys Arg Thr Ser Gly Thr Ile Thr Ala Gly 
145                 150                 155                 160 


Asn His Phe Asp Ala Trp Ala Arg His Gly Met His Leu Gly Thr His 
                165                 170                 175     


Asp Tyr Met Ile Met Ala Thr Glu Gly Tyr Gln Ser Ser Gly Ser Ser 
            180                 185                 190         


Asn Val Thr Leu Gly Thr Ser Gly Gly Gly Asn Pro Gly Gly Gly Asn 
        195                 200                 205             


Pro Pro Gly Gly Gly Asn Pro Pro Gly Gly Gly Gly Cys Thr Ala Thr 
    210                 215                 220                 


Leu Ser Ala Gly Gln Gln Trp Asn Asp Arg Tyr Asn Leu Asn Val Asn 
225                 230                 235                 240 


Val Ser Gly Ser Asn Asn Trp Thr Val Thr Val Asn Val Pro Trp Pro 
                245                 250                 255     


Ala Arg Ile Ile Ala Thr Trp Asn Ile His Ala Ser Tyr Pro Asp Ser 
            260                 265                 270         


Gln Thr Leu Val Ala Arg Pro Asn Gly Asn Gly Asn Asn Trp Gly Met 
        275                 280                 285             


Thr Ile Met His Asn Gly Asn Trp Thr Trp Pro Thr Val Ser Cys Ser 
    290                 295                 300                 


Ala Asn Glu Leu Thr Ala Thr Thr Thr Pro Thr Thr Thr Pro Thr Thr 
305                 310                 315                 320 


Thr Pro Thr Pro Lys Phe Ile Tyr Gly Asp Val Asp Gly Asn Gly Ser 
                325                 330                 335     


Val Arg Ile Asn Asp Ala Val Leu Ile Arg Asp Tyr Val Leu Gly Lys 
            340                 345                 350         


Ile Asn Glu Phe Pro Tyr Glu Tyr Gly Met Leu Ala Ala Asp Val Asp 
        355                 360                 365             


Gly Asn Gly Ser Ile Lys Ile Asn Asp Ala Val Leu Val Arg Asp Tyr 
    370                 375                 380                 


Val Leu Gly Lys Ile Phe Leu Phe Pro Val Glu Glu Lys Glu Glu 
385                 390                 395                 


<210>  48
<211>  1200
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  48
atggctagca tgcaccatca ccatcaccac gccgtgacct ccaacgagac cgggtaccac       60

gacgggtact tctactcgtt ctggaccgac gcgcctggaa cggtctccat ggagctgggc      120

cctggcggaa actacagcac ctcctggcgg aacaccggga acttcgtcgc cggtaaggga      180

tgggccaccg gtggccgccg gaccgtgacc tactccgcca gcttcaaccc gtcgggtaac      240

gcctacctga ccctctacgg gtggacgcgg aacccgctcg tggagtacta catcgtcgaa      300

agctggggca cctaccggcc caccggtacc tacatgggca cggtgaccac cgacggtggt      360

acctacgaca tctacaagac cacgcggtac aacgcgccct ccatcgaagg cacccggacc      420

ttcgaccagt actggagcgt ccgccagtcc aagcggacca gcggtaccat caccgcgggg      480

aaccacttcg acgcgtgggc ccgccacggt atgcacctcg gaacccacga ctacatgatc      540

atggcgaccg agggctacca gagcagcgga tcctccaacg tgacgttggg caccagcggc      600

ggtggaaacc ccggtggggg caaccccccc ggtggcggca acccccccgg tggcggtggc      660

tgcacggcga cgctgtccgc gggccagcag tggaacgacc gctacaacct caacgtcaac      720

gtcagcggct ccaacaactg gaccgtgacc gtgaacgttc cgtggccggc gaggatcatc      780

gccacctgga acatccacgc cagctacccg gactcccaga ccttggttgc ccggcctaac      840

ggcaacggca acaactgggg catgacgatc atgcacaacg gcaactggac gtggcccacg      900

gtgtcctgca gcgccaacga gctcacagca actacaacac caactacaac accaactaca      960

acaccaacgc ctaaatttat atatggtgat gttgatggta atggaagtgt aagaattaat     1020

gatgctgtcc taataagaga ctatgtatta ggaaaaatca atgaattccc atatgaatat     1080

ggtatgcttg cagcagatgt tgatggtaat ggaagtataa aaattaatga tgctgttcta     1140

gtaagagact acgtgttagg aaagatattt ttattccctg ttgaagagaa agaagaataa     1200


<210>  49
<211>  460
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  49

Met Ala Ser His His His His His His Gly Pro Val His Asp His His 
1               5                   10                  15      


Pro Ala Pro His Ser Asn Ala Lys Ser Glu Arg Leu Arg Trp Ala Ala 
            20                  25                  30          


Pro Asp Gly Phe Tyr Ile Gly Ser Ala Val Ala Gly Gly Gly His His 
        35                  40                  45              


Leu Glu Gln Asp Tyr Pro Asp Pro Phe Thr His Asp Gly Lys Tyr Arg 
    50                  55                  60                  


Ser Ile Leu Ala Gln Gln Phe Ser Ser Val Ser Pro Glu Asn Gln Met 
65                  70                  75                  80  


Lys Trp Glu Tyr Ile His Pro Glu Pro Asp Arg Tyr Asp Phe Ala Met 
                85                  90                  95      


Ala Asp Lys Ile Val Asp Phe Ala Glu Arg Asn Asp Gln Lys Val Arg 
            100                 105                 110         


Gly His Thr Leu Leu Trp His Ser Gln Asn Pro Glu Trp Leu Glu Glu 
        115                 120                 125             


Gly Asp Tyr Ser Pro Glu Glu Leu Arg Glu Ile Leu Arg Asp His Ile 
    130                 135                 140                 


Thr Thr Val Val Gly Arg Tyr Ala Gly Arg Ile His Gln Trp Asp Val 
145                 150                 155                 160 


Ala Asn Glu Ile Phe Asp Glu Gln Gly Asn Leu Arg Thr Gln Glu Asn 
                165                 170                 175     


Ile Trp Ile Arg Glu Leu Gly Pro Gly Ile Ile Ala Asp Ala Phe Arg 
            180                 185                 190         


Trp Ala His Glu Ala Asp Pro Asn Ala Glu Leu Phe Phe Asn Asp Tyr 
        195                 200                 205             


Asn Val Glu Gly Ile Asn Pro Lys Ser Asp Ala Tyr Tyr Glu Leu Ile 
    210                 215                 220                 


Gln Glu Leu Leu Asp Asp Gly Val Pro Val His Gly Phe Ser Val Gln 
225                 230                 235                 240 


Gly His Leu Ser Thr Arg Tyr Gly Phe Pro Gly Asp Leu Glu Gln Asn 
                245                 250                 255     


Leu Arg Arg Phe Asp Glu Leu Gly Leu Ala Thr Ala Ile Thr Glu Leu 
            260                 265                 270         


Asp Val Arg Met Asp Leu Pro Ala Ser Gly Lys Pro Thr Pro Lys Gln 
        275                 280                 285             


Leu Glu Gln Gln Ala Asp Tyr Tyr Gln Gln Ala Leu Glu Ala Cys Leu 
    290                 295                 300                 


Ala Val Glu Gly Cys Asp Ser Phe Thr Ile Trp Gly Phe Thr Asp Lys 
305                 310                 315                 320 


Tyr Ser Trp Val Pro Val Phe Phe Pro Asp Glu Gly Ala Ala Thr Ile 
                325                 330                 335     


Met Thr Glu Lys Tyr Glu Arg Lys Pro Ala Phe Phe Ala Leu Gln Gln 
            340                 345                 350         


Thr Leu Arg Glu Ala Arg Cys Ala Asp Ser Pro Lys Pro Gly Pro Gly 
        355                 360                 365             


Lys Pro Lys Pro Gly Lys Gly Pro Lys His Asp His Cys Thr Ser Thr 
    370                 375                 380                 


Tyr Lys Val Pro Gly Thr Pro Ser Thr Lys Leu Tyr Gly Asp Val Asn 
385                 390                 395                 400 


Asp Asp Gly Lys Val Asn Ser Thr Asp Ala Val Ala Leu Lys Arg Tyr 
                405                 410                 415     


Val Leu Arg Ser Gly Ile Ser Ile Asn Thr Asp Asn Ala Asp Leu Asn 
            420                 425                 430         


Glu Asp Gly Arg Val Asn Ser Thr Asp Leu Gly Ile Leu Lys Arg Tyr 
        435                 440                 445             


Ile Leu Lys Glu Ile Asp Thr Leu Pro Tyr Lys Asn 
    450                 455                 460 


<210>  50
<211>  1383
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  50
atggctagcc atcaccatca ccatcacgga ccggtccacg accatcatcc cgctccccac       60

tccaacgcga aatccgagcg gctgcgctgg gctgcccccg acggcttcta catcggcagc      120

gcggtcgcgg gcggcggcca ccacctggag caggactacc ccgacccctt cacccacgac      180

gggaaatacc gcagcatcct ggctcagcag ttcagctcag tctccccgga aaaccagatg      240

aagtgggagt acatccatcc tgagccggac cgctacgact tcgccatggc cgacaagatc      300

gtcgacttcg cggagcgtaa cgaccagaag gtccgcggtc acaccctgct gtggcacagc      360

cagaaccccg agtggctcga agagggcgac tactcccctg aggagctgcg cgagatcctg      420

cgggaccaca tcaccaccgt ggtcggccgc tacgccggac ggatccacca gtgggatgtg      480

gccaacgaga tcttcgacga gcagggcaac ctgcgtactc aggagaacat ctggatccgc      540

gagctcggcc ccggcatcat cgctgacgcg ttccgctggg cgcacgaggc agacccgaac      600

gcggagctgt tcttcaacga ctacaacgtg gagggcatca acccgaagag cgacgcctac      660

tacgaactca tccaggagct gctcgacgac ggggttccgg tccacggctt ctccgtccag      720

gggcacctga gcacccgcta cggcttcccg ggcgacctgg aacagaacct gcgccggttc      780

gacgagctcg gtctggccac ggcgatcacc gagctggacg tgcgcatgga cctgccggcc      840

agcggcaagc cgaccccgaa gcagttggag cagcaggccg actactacca gcaggcgctt      900

gaagcgtgcc tggccgtgga aggctgcgac tccttcacga tctggggctt cacggacaag      960

tactcctggg tgccggtgtt cttccccgac gagggcgcgg cgacgatcat gacggagaag     1020

tacgagcgca agcccgcttt cttcgcgctg cagcagacgc tgcgggaagc ccggtgcgcg     1080

gacagcccca agccgggacc gggcaagccg aagccgggca agggccccaa gcacgatcac     1140

tgtactagta catataaagt acctggtact ccttctacta aattatacgg cgacgtcaat     1200

gatgacggaa aagttaactc aactgacgct gtagcattga agagatatgt tttgagatca     1260

ggtataagca tcaacactga caatgccgat ttgaatgaag acggcagagt taattcaact     1320

gacttaggaa ttttgaagag atatattctc aaagaaatag atacattgcc gtacaagaac     1380

taa                                                                   1383


<210>  51
<211>  378
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  51

Met Ala Asn Asp Ser Pro Phe Tyr Val Asn Pro Asn Met Ser Ser Ala 
1               5                   10                  15      


Glu Trp Val Arg Asn Asn Pro Asn Asp Pro Arg Thr Pro Val Ile Arg 
            20                  25                  30          


Asp Arg Ile Ala Ser Val Pro Gln Gly Thr Trp Phe Ala His His Asn 
        35                  40                  45              


Pro Gly Gln Ile Thr Gly Gln Val Asp Ala Leu Met Ser Ala Ala Gln 
    50                  55                  60                  


Ala Ala Gly Lys Ile Pro Ile Leu Val Val Tyr Asn Ala Pro Gly Arg 
65                  70                  75                  80  


Asp Cys Gly Asn His Ser Ser Gly Gly Ala Pro Ser His Ser Ala Tyr 
                85                  90                  95      


Arg Ser Trp Ile Asp Glu Phe Ala Ala Gly Leu Lys Asn Arg Pro Ala 
            100                 105                 110         


Tyr Ile Ile Val Glu Pro Asp Leu Ile Ser Leu Met Ser Ser Cys Met 
        115                 120                 125             


Gln His Val Gln Gln Glu Val Leu Glu Thr Met Ala Tyr Ala Gly Lys 
    130                 135                 140                 


Ala Leu Lys Ala Gly Ser Ser Gln Ala Arg Ile Tyr Phe Asp Ala Gly 
145                 150                 155                 160 


His Ser Ala Trp His Ser Pro Ala Gln Met Ala Ser Trp Leu Gln Gln 
                165                 170                 175     


Ala Asp Ile Ser Asn Ser Ala His Gly Ile Ala Thr Asn Thr Ser Asn 
            180                 185                 190         


Tyr Arg Trp Thr Ala Asp Glu Val Ala Tyr Ala Lys Ala Val Leu Ser 
        195                 200                 205             


Ala Ile Gly Asn Pro Ser Leu Arg Ala Val Ile Asp Thr Ser Arg Asn 
    210                 215                 220                 


Gly Asn Gly Pro Ala Gly Asn Glu Trp Cys Asp Pro Ser Gly Arg Ala 
225                 230                 235                 240 


Ile Gly Thr Pro Ser Thr Thr Asn Thr Gly Asp Pro Met Ile Asp Ala 
                245                 250                 255     


Phe Leu Trp Ile Lys Leu Pro Gly Glu Ala Asp Gly Cys Ile Ala Gly 
            260                 265                 270         


Ala Gly Gln Phe Val Pro Gln Ala Ala Tyr Glu Met Ala Ile Ala Ala 
        275                 280                 285             


Gly Gly Thr Ala Val Pro Glu Glu Ala Asn Lys Gly Asp Val Asn Gly 
    290                 295                 300                 


Asp Gly Glu Ile Asn Ser Leu Asp Ala Leu Leu Ala Leu Gln Met Ser 
305                 310                 315                 320 


Ile Gly Lys Val Glu Pro Asn Pro Val Ala Asp Met Asp Gly Asp Gly 
                325                 330                 335     


Lys Val Leu Ala Lys Asp Ala Thr Glu Ile Met Lys Met Ala Thr Asp 
            340                 345                 350         


Met Met Ile Arg Arg Thr Ala Glu Ile Ile Ser Gln Asn Gly Leu Leu 
        355                 360                 365             


Gly Lys Leu Glu His His His His His His 
    370                 375             


<210>  52
<211>  1137
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  52
atggccaatg attctccgtt ctacgtcaac cccaacatgt cctccgccga atgggtgcgg       60

aacaacccca acgacccgcg taccccggta atccgcgacc ggatcgccag cgtgccgcag      120

ggcacctggt tcgcccacca caaccccggg cagatcaccg gccaggtcga cgcgctcatg      180

agcgccgccc aggccgccgg caagatcccg atcctggtcg tgtacaacgc cccgggccgc      240

gactgcggca accacagcag cggcggcgcc cccagtcaca gcgcctaccg gtcctggatc      300

gacgaattcg ctgccggact gaagaaccgt cccgcctaca tcatcgtcga accggacctg      360

atctcgctga tgtcgagctg catgcagcac gtccagcagg aagtcctgga gacgatggcg      420

tacgcgggca aggccctcaa ggccgggtcc tcgcaggcgc ggatctactt cgacgccggc      480

cactccgcgt ggcactcgcc cgcacagatg gcttcctggc tccagcaggc cgacatctcc      540

aacagcgcgc acggtatcgc caccaacacc tccaactacc ggtggaccgc tgacgaggtc      600

gcctacgcca aggcggtgct ctcggccatc ggcaacccgt ccctgcgcgc ggtcatcgac      660

accagccgca acggcaacgg ccccgccggt aacgagtggt gcgaccccag cggacgcgcc      720

atcggcacgc ccagcaccac caacaccggc gacccgatga tcgacgcctt cctgtggatc      780

aagctgccgg gtgaggccga cggctgcatc gccggcgccg gccagttcgt cccgcaggcg      840

gcctacgaga tggcgatcgc cgcgggcggc accgcggtac cagaagaagc aaacaaggga      900

gatgtgaatg gagatggaga aataaacagt ctcgacgctc tgcttgcact tcagatgtca      960

atcgggaagg ttgagccgaa ccctgtagca gatatggatg gggatggaaa ggtgcttgcg     1020

aaggatgcca ctgaaatcat gaagatggca acagacatga tgatcagaag aacggcggaa     1080

attataagcc agaatggctt actgggtaag ctcgagcacc accaccacca ccactga        1137


<210>  53
<211>  444
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polypeptide

<400>  53

Met His His His His His His Ser Thr Leu Arg Glu Leu Ala Ala Gln 
1               5                   10                  15      


Asn Gly Gly Arg His Phe Gly Thr Ala Ile Ala Tyr Ser Pro Leu Asn 
            20                  25                  30          


Ser Asp Ala Gln Tyr Arg Asn Ile Ala Ala Thr Gln Phe Ser Ala Ile 
        35                  40                  45              


Thr His Glu Asn Glu Met Lys Trp Glu Ser Leu Glu Pro Gln Arg Gly 
    50                  55                  60                  


Gln Tyr Asn Trp Ser Gln Ala Asp Asn Ile Ile Asn Phe Ala Lys Ala 
65                  70                  75                  80  


Asn Asn Gln Ile Val Arg Gly His Thr Leu Val Trp His Ser Gln Leu 
                85                  90                  95      


Pro Ser Trp Leu Asn Asn Gly Gly Phe Ser Gly Ser Gln Leu Arg Ser 
            100                 105                 110         


Ile Met Glu Asn His Ile Glu Val Val Ala Gly Arg Tyr Arg Gly Asp 
        115                 120                 125             


Val Tyr Ala Trp Asp Val Val Asn Glu Ala Phe Asn Glu Asp Gly Thr 
    130                 135                 140                 


Leu Arg Asp Ser Ile Trp Tyr Arg Gly Met Gly Arg Asp Tyr Ile Ala 
145                 150                 155                 160 


His Ala Phe Arg Lys Ala His Glu Val Asp Pro Asp Ala Lys Leu Tyr 
                165                 170                 175     


Ile Asn Asp Tyr Asn Ile Glu Gly Ile Asn Ala Lys Ser Asn Gly Leu 
            180                 185                 190         


Tyr Asn Leu Val Val Asp Leu Leu Arg Asp Gly Val Pro Ile His Gly 
        195                 200                 205             


Ile Gly Ile Gln Ser His Leu Ile Val Gly Gln Val Pro Ser Thr Phe 
    210                 215                 220                 


Gln Gln Asn Ile Gln Arg Phe Ala Asp Leu Gly Leu Asp Val Ala Ile 
225                 230                 235                 240 


Thr Glu Leu Asp Ile Arg Met Gln Met Pro Ala Asp Gln Tyr Lys Leu 
                245                 250                 255     


Gln Gln Gln Ala Arg Asp Tyr Glu Ala Val Val Asn Ala Cys Leu Ala 
            260                 265                 270         


Val Thr Arg Cys Ile Gly Ile Thr Val Trp Gly Ile Asp Asp Glu Arg 
        275                 280                 285             


Ser Trp Val Pro Tyr Thr Phe Pro Gly Glu Gly Ala Pro Leu Leu Tyr 
    290                 295                 300                 


Asp Gly Gln Tyr Asn Arg Lys Pro Ala Trp Tyr Ala Val Tyr Glu Ala 
305                 310                 315                 320 


Leu Gly Gly Asp Ser Ser Gly Gly Gly Pro Gly Glu Pro Gly Gly Pro 
                325                 330                 335     


Gly Gly Pro Gly Glu Pro Gly Gly Pro Gly Gly Pro Gly Glu Pro Gly 
            340                 345                 350         


Gly Pro Gly Asp Gly Thr Ser Gly Thr Lys Leu Val Pro Thr Trp Gly 
        355                 360                 365             


Asp Thr Asn Cys Asp Gly Val Val Asn Val Ala Asp Val Val Val Leu 
    370                 375                 380                 


Asn Arg Phe Leu Asn Asp Pro Thr Tyr Ser Asn Ile Thr Asp Gln Gly 
385                 390                 395                 400 


Lys Val Asn Ala Asp Val Val Asp Pro Gln Asp Lys Ser Gly Ala Ala 
                405                 410                 415     


Val Asp Pro Ala Gly Val Lys Leu Thr Val Ala Asp Ser Glu Ala Ile 
            420                 425                 430         


Leu Lys Ala Ile Val Glu Leu Ile Thr Leu Pro Gln 
        435                 440                 


<210>  54
<211>  1335
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide

<400>  54
atgcaccatc accatcacca ctcgaccctg cgggaactgg ctgcccagaa cggcggccgc       60

cacttcggta cggctatcgc ctacagcccg ctcaacagtg acgcccagta ccgcaacatc      120

gcggctaccc agttcagcgc catcacccac gaaaacgaga tgaagtggga gtcgctggag      180

ccgcagcggg gccagtacaa ctggagccag gccgacaaca tcatcaactt cgccaaggcc      240

aacaaccaga ttgtgcgcgg ccacaccctg gtctggcaca gccagctgcc gtcctggctg      300

aacaacggcg gcttctccgg cagccagctc cggtccatca tggagaacca catcgaggtg      360

gtggccggac gctaccgggg tgacgtctac gcctgggacg tggtcaacga agcgttcaac      420

gaggacggta cgctccgcga ctcgatctgg taccgcggca tgggtcgcga ctacatcgcc      480

cacgcgttcc gcaaggcgca cgaggtcgac cccgacgcca agctgtacat caacgactac      540

aacatcgaag gcatcaacgc taagagcaac ggcctctaca acctggtggt cgacctgctc      600

cgcgacggtg tgccgatcca cggtatcggt atccagtccc acctgatcgt cggccaggtg      660

ccgtccacgt tccagcagaa catccagcgg ttcgctgacc tcggcctgga cgtggccatc      720

accgagctgg acatccgcat gcagatgccg gccgaccagt acaagctcca gcagcaggcc      780

cgcgactacg aggccgtggt caacgcctgc ctcgcggtga cccgctgcat cggtatcacc      840

gtctggggta tcgacgacga gcgctcctgg gtgccctaca ccttcccggg tgaaggtgct      900

ccgctgctct acgacggcca gtacaaccgc aagcccgcct ggtacgcggt ctacgaggct      960

ctcggcggcg actcctccgg cggcggtccg ggtgagccgg gcggtcctgg cggtccgggt     1020

gagccgggcg gtcctggcgg tccgggtgaa ccgggcggcc ccggtgacgg cactagtggc     1080

acaaagctcg ttcctacatg gggcgataca aactgcgacg gcgttgtaaa tgttgctgac     1140

gtagtagttc ttaacagatt cctcaacgat cctacatatt ctaacattac tgatcagggt     1200

aaggttaacg cagacgttgt tgatcctcag gataagtccg gcgcagcagt tgatcctgca     1260

ggcgtaaagc tcacagtagc tgactctgag gcaatcctca aggctatcgt tgaactcatc     1320

acacttcctc agtaa                                                      1335


