                         SEQUENCE LISTING

<110>  Genzyme Corporation
 
<120>  Polypeptide having beta-hexosaminidase activity, and 
       polynucleotides coding for the same

<130>  PAT19110-WO-PCT

<160>  18    

<170>  PatentIn version 3.5

<210>  1
<211>  523
<212>  PRT
<213>  Canavalia ensiformis

<400>  1

Ala Thr Leu Lys Ser Ile Ile Glu Pro Thr Glu Ser Leu Thr Tyr Leu 
1               5                   10                  15      


Trp Pro Leu Pro Ala Asp Phe Thr Ser Gly Asp Glu Thr Leu Ser Val 
            20                  25                  30          


Asp Pro Ala Leu Thr Leu Ser Val Ala Gly Asn Gly Gly Gly Ser Ser 
        35                  40                  45              


Ile Leu Arg Asp Ala Phe Asp Arg Tyr Arg Gly Ile Ile Phe Lys His 
    50                  55                  60                  


Ser Ser Val Gly Phe Ser Leu Ile Arg Lys Leu Arg Glu Arg Leu Val 
65                  70                  75                  80  


Ser Val Ser Ala Tyr Asp Ile Ala Thr Leu Lys Ile Thr Val His Ser 
                85                  90                  95      


Asp Asn Glu Glu Leu Gln Leu Gly Val Asp Glu Thr Tyr Thr Leu Leu 
            100                 105                 110         


Val Pro Lys Ala Lys Asp Ser Tyr Val Ala Gly Glu Val Thr Ile Glu 
        115                 120                 125             


Ala Asn Thr Val Tyr Gly Ala Leu Arg Gly Leu Glu Thr Phe Ser Gln 
    130                 135                 140                 


Leu Cys Ser Phe Asp Tyr Ser Asp Lys Thr Ile Lys Ile Tyr Lys Ala 
145                 150                 155                 160 


Pro Trp Ser Ile Gln Asp Lys Pro Arg Phe Ser Tyr Arg Gly Leu Leu 
                165                 170                 175     


Leu Asp Thr Ser Arg His Tyr Leu Pro Ile Asn Val Ile Lys Gln Ile 
            180                 185                 190         


Ile Glu Ser Met Ser Tyr Ala Lys Leu Asn Val Leu His Trp His Ile 
        195                 200                 205             


Ile Asp Glu Glu Ser Phe Pro Leu Glu Val Pro Thr Tyr Pro Asn Leu 
    210                 215                 220                 


Trp Lys Gly Ser Tyr Thr Lys Trp Glu Arg Tyr Thr Val Glu Asp Ala 
225                 230                 235                 240 


Tyr Glu Ile Val Asn Phe Ala Lys Met Arg Gly Ile Asn Val Met Ala 
                245                 250                 255     


Glu Val Asp Val Pro Gly His Ala Glu Ser Trp Gly Ala Gly Tyr Pro 
            260                 265                 270         


Asn Leu Trp Pro Ser Pro Ser Cys Arg Glu Pro Leu Asp Val Ser Lys 
        275                 280                 285             


Asn Phe Thr Phe Asp Val Ile Ser Gly Ile Leu Thr Asp Ile Arg Lys 
    290                 295                 300                 


Ile Phe Pro Phe Glu Leu Phe His Leu Gly Gly Asp Glu Val Asn Thr 
305                 310                 315                 320 


Asp Cys Trp Thr Ser Thr Ser His Val Lys Glu Trp Leu Ser Thr Gln 
                325                 330                 335     


Asn Met Thr Ala Lys Asp Ala Tyr Glu Tyr Phe Val Leu Lys Ala Gln 
            340                 345                 350         


Glu Ile Ala Val Ser Lys Asn Trp Ser Pro Val Asn Trp Glu Glu Thr 
        355                 360                 365             


Phe Asn Thr Phe Pro Ala Lys Leu His Lys Lys Thr Val Val His Asn 
    370                 375                 380                 


Trp Leu Gly Pro Gly Val Cys Pro Lys Val Val Ala Lys Gly Phe Arg 
385                 390                 395                 400 


Cys Ile Phe Ser Asn Gln Gly Val Trp Tyr Leu Asp His Leu Asp Val 
                405                 410                 415     


Pro Trp Asp Glu Val Tyr Thr Ala Glu Pro Leu Glu Gly Ile Glu Lys 
            420                 425                 430         


Ser Ser Glu Gln Glu Leu Val Ile Gly Gly Glu Val Cys Met Trp Gly 
        435                 440                 445             


Glu Thr Ala Asp Thr Ser Asn Val Gln Gln Thr Ile Trp Pro Arg Ala 
    450                 455                 460                 


Ala Ala Ala Ala Glu Arg Leu Trp Ser Gln Arg Asp Ser Thr Asn Ile 
465                 470                 475                 480 


Thr Val Thr Ala Leu Pro Arg Leu Gln Asn Phe Arg Cys Leu Leu Asn 
                485                 490                 495     


Lys Arg Gly Val Ala Ala Ala Pro Val Lys Asn Tyr Tyr Ala Arg Arg 
            500                 505                 510         


Ala Pro Ser Gly Pro Gly Ser Cys Tyr Glu Gln 
        515                 520             


<210>  2
<211>  1569
<212>  DNA
<213>  Canavalia ensiformis

<400>  2
gctactttga agtccatcat cgagccaact gagtccttga cttacttgtg gccattgcca       60

gctgacttca cttctggtga cgaaactttg tctgttgacc cagctttgac tttgtccgtt      120

gctggtaatg gtggtggttc ctccattttg agagatgctt tcgacagata cagaggtatt      180

atcttcaagc actcctccgt tggattctct ttgatcagaa agttgagaga gagattggtt      240

tccgtttccg cttacgacat tgctactttg aagatcactg ttcactccga caacgaagag      300

ttgcagttgg gtgttgacga gacttacact ttgttggttc caaaggctaa ggactcctac      360

gttgctggtg aggttactat cgaggctaac actgtttacg gtgctttgag aggtttggag      420

actttctccc agttgtgttc cttcgactac tctgacaaga ctatcaagat ttacaaggct      480

ccttggtcca tccaggacaa gccaagattt tcctacagag gtttgttgtt ggacacttcc      540

agacactact tgccaatcaa cgttatcaag cagatcatcg agtccatgtc ctacgctaag      600

ttgaacgttt tgcactggca catcatcgac gaagagtctt tcccattgga ggttccaact      660

tacccaaact tgtggaaggg ttcctacact aagtgggaga gatacactgt tgaggacgct      720

tacgagatcg ttaacttcgc taagatgaga ggtattaacg ttatggctga ggttgacgtt      780

ccaggtcatg ctgaatcttg gggtgctggt tatccaaatt tgtggccatc tccatcctgt      840

agagagccat tggacgtttc caagaacttc actttcgacg ttatctccgg aatcttgact      900

gacatcagaa agatattccc attcgagttg ttccacttgg gaggtgacga ggttaatact      960

gactgttgga cttccacttc ccacgttaag gaatggttgt ccactcagaa catgactgct     1020

aaggatgctt acgaatactt cgttttgaag gctcaagaga tcgctgtttc taagaactgg     1080

tcccctgtta actgggaaga gactttcaac actttcccag ctaagttgca caagaaaact     1140

gttgttcaca actggttggg tccaggtgtt tgtccaaagg ttgttgctaa gggtttcaga     1200

tgtatcttct ccaaccaggg tgtttggtac ttggaccact tggatgttcc ttgggacgag     1260

gtttacactg ctgaaccatt ggaaggtatc gagaagtcct ctgagcaaga gttggttatc     1320

ggtggtgaag tttgtatgtg gggtgagact gctgacactt ctaacgttca gcagactatc     1380

tggccaagag ccgcagctgc tgctgaaaga ttgtggtccc aaagagactc cactaacatc     1440

actgttactg ctttgccaag attgcagaac ttcagatgtt tgttgaacaa gagaggtgtt     1500

gctgctgctc cagttaagaa ctactacgct agaagagccc catccggtcc aggttcttgt     1560

tacgaacaa                                                             1569


<210>  3
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer JB-01

<400>  3
ctcacctacc tctggcccct tcccgc                                            26


<210>  4
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer JB-07

<400>  4
ttattggtca taacatgacc ctggaccaac agg                                    33


<210>  5
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer JB-02

<400>  5
gaggagcttc aatttggagt ggatg                                             25


<210>  6
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer JB-06

<400>  6
atcagctgtc tcaccccaca tgcaaacttc tc                                     32


<210>  7
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primber JB-08

<400>  7
aagtttgcat gtggggtgag ac                                                22


<210>  8
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer JB-09

<400>  8
gcaaacaata tggcctagag ctg                                               23


<210>  9
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CDSIII-short

<400>  9
attctagagg ccgaggcggc cgacatgt                                          28


<210>  10
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  JB-10

<400>  10
aagagtcctt ggctttggga ac                                                22


<210>  11
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Okib57-Adapter

<400>  11
gtaggaattc gggttgtagg gaggtcgaca ttgcc                                  35


<210>  12
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  JB-11

<400>  12
tcaatgtcgc aatgtcatag gc                                                22


<210>  13
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  JB-12

<400>  13
atgagactga acccaacact gc                                                22


<210>  14
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Okib58

<400>  14
ggcaatgtcg acctccctac aac                                               23


<210>  15
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Okib59

<400>  15
ctccctacaa cccgaattcc tac                                               23


<210>  16
<211>  553
<212>  PRT
<213>  Canavalia ensiformis

<400>  16

Met Phe Leu Cys Ile Pro Arg Trp Phe Ser Ser Pro Leu Leu Ile Leu 
1               5                   10                  15      


Phe Val Ile Tyr Cys Ala Leu Phe Ala Pro Gln Ala Ala Ser Ala Thr 
            20                  25                  30          


Leu Lys Ser Ile Ile Glu Pro Thr Glu Ser Leu Thr Tyr Leu Trp Pro 
        35                  40                  45              


Leu Pro Ala Asp Phe Thr Ser Gly Asp Glu Thr Leu Ser Val Asp Pro 
    50                  55                  60                  


Ala Leu Thr Leu Ser Val Ala Gly Asn Gly Gly Gly Ser Ser Ile Leu 
65                  70                  75                  80  


Arg Asp Ala Phe Asp Arg Tyr Arg Gly Ile Ile Phe Lys His Ser Ser 
                85                  90                  95      


Val Gly Phe Ser Leu Ile Arg Lys Leu Arg Glu Arg Leu Val Ser Val 
            100                 105                 110         


Ser Ala Tyr Asp Ile Ala Thr Leu Lys Ile Thr Val His Ser Asp Asn 
        115                 120                 125             


Glu Glu Leu Gln Leu Gly Val Asp Glu Thr Tyr Thr Leu Leu Val Pro 
    130                 135                 140                 


Lys Ala Lys Asp Ser Tyr Val Ala Gly Glu Val Thr Ile Glu Ala Asn 
145                 150                 155                 160 


Thr Val Tyr Gly Ala Leu Arg Gly Leu Glu Thr Phe Ser Gln Leu Cys 
                165                 170                 175     


Ser Phe Asp Tyr Ser Asp Lys Thr Ile Lys Ile Tyr Lys Ala Pro Trp 
            180                 185                 190         


Ser Ile Gln Asp Lys Pro Arg Phe Ser Tyr Arg Gly Leu Leu Leu Asp 
        195                 200                 205             


Thr Ser Arg His Tyr Leu Pro Ile Asn Val Ile Lys Gln Ile Ile Glu 
    210                 215                 220                 


Ser Met Ser Tyr Ala Lys Leu Asn Val Leu His Trp His Ile Ile Asp 
225                 230                 235                 240 


Glu Glu Ser Phe Pro Leu Glu Val Pro Thr Tyr Pro Asn Leu Trp Lys 
                245                 250                 255     


Gly Ser Tyr Thr Lys Trp Glu Arg Tyr Thr Val Glu Asp Ala Tyr Glu 
            260                 265                 270         


Ile Val Asn Phe Ala Lys Met Arg Gly Ile Asn Val Met Ala Glu Val 
        275                 280                 285             


Asp Val Pro Gly His Ala Glu Ser Trp Gly Ala Gly Tyr Pro Asn Leu 
    290                 295                 300                 


Trp Pro Ser Pro Ser Cys Arg Glu Pro Leu Asp Val Ser Lys Asn Phe 
305                 310                 315                 320 


Thr Phe Asp Val Ile Ser Gly Ile Leu Thr Asp Ile Arg Lys Ile Phe 
                325                 330                 335     


Pro Phe Glu Leu Phe His Leu Gly Gly Asp Glu Val Asn Thr Asp Cys 
            340                 345                 350         


Trp Thr Ser Thr Ser His Val Lys Glu Trp Leu Ser Thr Gln Asn Met 
        355                 360                 365             


Thr Ala Lys Asp Ala Tyr Glu Tyr Phe Val Leu Lys Ala Gln Glu Ile 
    370                 375                 380                 


Ala Val Ser Lys Asn Trp Ser Pro Val Asn Trp Glu Glu Thr Phe Asn 
385                 390                 395                 400 


Thr Phe Pro Ala Lys Leu His Lys Lys Thr Val Val His Asn Trp Leu 
                405                 410                 415     


Gly Pro Gly Val Cys Pro Lys Val Val Ala Lys Gly Phe Arg Cys Ile 
            420                 425                 430         


Phe Ser Asn Gln Gly Val Trp Tyr Leu Asp His Leu Asp Val Pro Trp 
        435                 440                 445             


Asp Glu Val Tyr Thr Ala Glu Pro Leu Glu Gly Ile Glu Lys Ser Ser 
    450                 455                 460                 


Glu Gln Glu Leu Val Ile Gly Gly Glu Val Cys Met Trp Gly Glu Thr 
465                 470                 475                 480 


Ala Asp Thr Ser Asn Val Gln Gln Thr Ile Trp Pro Arg Ala Ala Ala 
                485                 490                 495     


Ala Ala Glu Arg Leu Trp Ser Gln Arg Asp Ser Thr Asn Ile Thr Val 
            500                 505                 510         


Thr Ala Leu Pro Arg Leu Gln Asn Phe Arg Cys Leu Leu Asn Lys Arg 
        515                 520                 525             


Gly Val Ala Ala Ala Pro Val Lys Asn Tyr Tyr Ala Arg Arg Ala Pro 
    530                 535                 540                 


Ser Gly Pro Gly Ser Cys Tyr Glu Gln 
545                 550             


<210>  17
<211>  1662
<212>  DNA
<213>  Canavalia ensiformis

<400>  17
atgtttctgt gcatacccag atggttctct tcacctcttc tcattctctt tgtcatttac       60

tgtgccctct ttgctcctca agctgcttct gccacactca aatctatcat tgaacccact      120

gagtccctca catacctttg gcccctcccc gcagacttca cttcaggcga tgaaactctt      180

tccgttgacc ctgcacttac cctctctgtc gccggcaacg gtggtggctc ttccattctc      240

agagatgcat ttgaccgata cagaggaatc atattcaagc acagcagtgt tgggttcagt      300

ctcataagaa agttaaggga aagattggtg tctgtttctg cctatgacat tgcgacattg      360

aagatcactg tccattcaga taacgaggag cttcaacttg gagtggatga aacctatacc      420

ttgctggttc ccaaagccaa ggactcttat gttgctgggg aagtcacaat tgaggcaaac      480

actgtttatg gtgcattgcg cggattagag acattcagcc agttgtgttc tttcgattat      540

tcggataaaa caataaaaat atacaaggca ccttggtcca tccaagataa acctagattt      600

tcctatcgtg ggcttttgtt ggacacatcg aggcactatt taccaattaa cgtaattaag      660

cagattattg aatctatgtc ctatgctaaa cttaatgttc tacattggca catcatagac      720

gaggagtcat ttcctcttga ggtacctaca tatccaaact tgtggaaagg ttcatataca      780

aagtgggaac gttacacggt agaagacgca tatgaaattg tcaacttcgc caaaatgaga      840

ggcataaatg tgatggcaga agtggatgtt cctggtcatg cagaatcatg gggtgctgga      900

tatcccaatc tttggccgtc accttcctgt agggagccac tggatgtttc aaagaatttt      960

acttttgatg tcatttctgg tatcctgaca gatataagaa agattttccc gtttgagcta     1020

tttcacttgg gtggtgatga agttaataca gattgctgga ccagtacttc tcatgtgaag     1080

gaatggcttt cgactcaaaa catgactgct aaagatgcct atgaatattt tgtactgaag     1140

gcccaagaga tagctgtttc aaaaaattgg agtccggtga actgggaaga aaccttcaat     1200

acatttccag caaagctcca taagaaaact gtggtgcata actggttggg ccctggggtt     1260

tgtccaaagg ttgttgcaaa aggtttcagg tgcattttca gtaatcaggg tgtctggtat     1320

cttgaccatc tggatgtacc ttgggatgag gtctatactg ctgagccact agaaggaata     1380

gaaaaatctt ctgaacaaga gcttgtaatt ggaggagaag tttgcatgtg gggtgagaca     1440

gctgatacat ccaatgttca gcaaacaata tggcctagag ctgctgcagc tgcagaacgc     1500

ttatggagtc agagagattc tacaaatatt actgtaactg cgttgccccg gttacaaaac     1560

ttcagatgtc tattgaataa acgtggagtt gcagctgctc ctgtgaaaaa ttattatgct     1620

agaagggctc ctagtggtcc aggctcatgt tatgagcaat aa                        1662


<210>  18
<211>  1572
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence (for human cells)

<400>  18
gccacactga agtccatcat cgagcccacc gagagcctga cctacctgtg gcctctgccc       60

gccgatttca ccagcggcga cgagacactg tccgtggatc ctgccctgac actgagcgtg      120

gccggaaatg gcggcggaag cagcatcctg agagatgcct tcgaccggta cagaggcatc      180

atcttcaagc acagcagcgt gggcttcagc ctgatccgga agctgcgcga gagactggtg      240

tccgtgtccg cctacgatat cgccaccctg aagatcaccg tgcactccga caacgaggaa      300

ctgcagctgg gcgtggacga gacatacacc ctgctggtgc ccaaggccaa ggacagctat      360

gtggccggcg aagtgaccat cgaggccaac acagtgtacg gcgccctgag aggcctggaa      420

accttcagcc agctgtgcag cttcgactac agcgacaaga ccatcaagat ctacaaggcc      480

ccttggagca tccaggacaa gccccggttc agctacagag gcctgctgct ggacaccagc      540

agacactacc tgcccatcaa cgtgatcaag cagatcatcg agagcatgag ctacgccaag      600

ctgaacgtgc tgcactggca catcatcgac gaggaatcct tcccactgga agtgcccacc      660

taccccaacc tgtggaaggg cagctacacc aagtgggagc ggtacaccgt ggaagatgcc      720

tacgagatcg tgaacttcgc caagatgcgg ggcatcaatg tgatggccga ggtggacgtg      780

ccaggccacg ctgaatcttg gggagccggc taccctaatc tgtggcccag ccccagctgt      840

cgcgaacccc tggacgtgtc caagaacttc accttcgacg tgatcagcgg catcctgacc      900

gatatcagaa agatcttccc attcgagctg ttccacctgg gaggcgacga agtgaacacc      960

gactgctgga ccagcaccag ccacgtgaaa gagtggctga gcacccagaa catgaccgcc     1020

aaggacgcct acgagtactt cgtgctgaag gcccaggaaa tcgccgtgtc taagaattgg     1080

agccccgtga actgggagga aacctttaac accttccctg ccaaactgca caagaaaacc     1140

gtggtgcaca attggctggg ccctggcgtg tgccctaagg tggtggccaa gggcttccgc     1200

tgcatattca gcaaccaggg cgtgtggtat ctggaccacc tggatgtgcc ctgggacgag     1260

gtgtacacag ccgagcctct ggaaggcatc gagaagtcct ccgagcagga actcgtgatc     1320

ggcggagaag tgtgcatgtg gggcgagaca gccgacacct ccaacgtgca gcagaccatc     1380

tggcctagag ccgccgctgc cgctgaaaga ctgtggtccc agagagacag caccaacatc     1440

accgtgaccg ccctgccccg gctgcagaac tttagatgcc tgctgaacaa gcggggcgtg     1500

gccgctgccc ccgtgaagaa ttactatgcc agaagggccc ccagcggccc tggcagctgt     1560

tatgaacagt ga                                                         1572


