                                SEQUENCE LISTING

<110> Yang, Jie
      Shaw, Andrew
      Dhawan, Ish Kumar
      Campopiano, Onorato
      Rao, Kripa
      Codexis, Inc.

<120> Improved Endoglucanases
  

<130> 026501-002810PC

<140> WO Not yet assigned 
<141> Not yet assigned  

<150> US 61/165,312       
<151> 2009-03-31  

<160> 16

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 222
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic wild-type Streptomyces avermitilis
      endogluganase catalytic domain (native SavO-EG
      catD, CatD)

<400> 1
Asp Thr Ser Ile Cys Glu Pro Phe Gly Ser Thr Thr Ile Gln Gly Arg
 1               5                  10                  15      
Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Glu Ala Gln Cys Ile
            20                  25                  30          
Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Gln Ala Asp Gly Ser Val
        35                  40                  45              
Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val Tyr Asn Gly Cys
    50                  55                  60                  
His Tyr Thr Asn Cys Ser Pro Gly Thr Ser Leu Pro Ala Gln Leu Ser
65                  70                  75                  80  
Thr Val Ser Ser Ala Pro Thr Ser Ile Ser Tyr Ser Tyr Val Ser Asn
                85                  90                  95      
Ala Met Tyr Asp Ala Ala Tyr Asp Ile Trp Leu Asp Pro Thr Pro Arg
            100                 105                 110         
Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val Trp Phe Asn Lys Val
        115                 120                 125             
Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly Thr Ala Thr Val Ala
    130                 135                 140                 
Gly Arg Gln Trp Gln Val Trp Ser Gly Asn Asn Gly Ser Asn Asp Val
145                 150                 155                 160 
Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser Trp Ser Phe Asp Val
                165                 170                 175     
Met Asp Phe Val Arg Gln Ala Val Ser Arg Gly Leu Ala Gln Asn Ser
            180                 185                 190         
Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro Trp Gln Asn Gly
        195                 200                 205             
Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr Val Asn Thr
    210                 215                 220         


<210> 2
<211> 226
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic wild-type Streptomyces avermitilis
      endogluganase catalytic domain (native SavO-EG catD,
      CatD) post-translationally modified with N-terminal DTSM
      spacer (CDX-SavOCat, CDX native SavOcat, SavO native 4)

<400> 2
Asp Thr Ser Met Asp Thr Ser Ile Cys Glu Pro Phe Gly Ser Thr Thr
 1               5                  10                  15      
Ile Gln Gly Arg Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Glu
            20                  25                  30          
Ala Gln Cys Ile Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Gln Ala
        35                  40                  45              
Asp Gly Ser Val Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val
    50                  55                  60                  
Tyr Asn Gly Cys His Tyr Thr Asn Cys Ser Pro Gly Thr Ser Leu Pro
65                  70                  75                  80  
Ala Gln Leu Ser Thr Val Ser Ser Ala Pro Thr Ser Ile Ser Tyr Ser
                85                  90                  95      
Tyr Val Ser Asn Ala Met Tyr Asp Ala Ala Tyr Asp Ile Trp Leu Asp
            100                 105                 110         
Pro Thr Pro Arg Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val Trp
        115                 120                 125             
Phe Asn Lys Val Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly Thr
    130                 135                 140                 
Ala Thr Val Ala Gly Arg Gln Trp Gln Val Trp Ser Gly Asn Asn Gly
145                 150                 155                 160 
Ser Asn Asp Val Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser Trp
                165                 170                 175     
Ser Phe Asp Val Met Asp Phe Val Arg Gln Ala Val Ser Arg Gly Leu
            180                 185                 190         
Ala Gln Asn Ser Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro
        195                 200                 205             
Trp Gln Asn Gly Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr Val
    210                 215                 220                 
Asn Thr
225     


<210> 3
<211> 375
<212> PRT
<213> Streptomyces avermitilis

<220> 
<223> wild-type Streptomyces avermitilis strain MA-4680
      endo-1,4-beta-glucanase, 1,4-beta-D-glucan
      glucanohydrolase, locus SAV_555, celA1,
      endoglucanase (SavO EG, native SavO)

<400> 3
Met Arg Pro Ser Pro Pro His Ala Arg Ser Ala Arg Gly Leu Phe Gly
 1               5                  10                  15      
Ala Leu Leu Thr Ala Leu Val Ser Leu Ala Ala Leu Leu Thr Thr Ala
            20                  25                  30          
Ser Val Ala Gln Ala Asp Thr Ser Ile Cys Glu Pro Phe Gly Ser Thr
        35                  40                  45              
Thr Ile Gln Gly Arg Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser
    50                  55                  60                  
Glu Ala Gln Cys Ile Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Gln
65                  70                  75                  80  
Ala Asp Gly Ser Val Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser
                85                  90                  95      
Val Tyr Asn Gly Cys His Tyr Thr Asn Cys Ser Pro Gly Thr Ser Leu
            100                 105                 110         
Pro Ala Gln Leu Ser Thr Val Ser Ser Ala Pro Thr Ser Ile Ser Tyr
        115                 120                 125             
Ser Tyr Val Ser Asn Ala Met Tyr Asp Ala Ala Tyr Asp Ile Trp Leu
    130                 135                 140                 
Asp Pro Thr Pro Arg Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val
145                 150                 155                 160 
Trp Phe Asn Lys Val Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly
                165                 170                 175     
Thr Ala Thr Val Ala Gly Arg Gln Trp Gln Val Trp Ser Gly Asn Asn
            180                 185                 190         
Gly Ser Asn Asp Val Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser
        195                 200                 205             
Trp Ser Phe Asp Val Met Asp Phe Val Arg Gln Ala Val Ser Arg Gly
    210                 215                 220                 
Leu Ala Gln Asn Ser Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu
225                 230                 235                 240 
Pro Trp Gln Asn Gly Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr
                245                 250                 255     
Val Asn Thr Gly Gly Gly Asn Pro Gly Asp Pro Gly Ser Pro Thr Ala
            260                 265                 270         
Cys Lys Val Ala Tyr Ala Thr Asn Val Trp Gln Gly Gly Phe Thr Ala
        275                 280                 285             
Asp Val Thr Val Thr Asn Thr Gly Ser Ser Pro Val Asn Gly Trp Lys
    290                 295                 300                 
Leu Ala Phe Thr Leu Pro Ala Gly Gln Gln Ile Thr Ser Ser Trp Ser
305                 310                 315                 320 
Ala Gly Val Ser Pro Ser Ser Gly Ala Val Thr Ala Ser Ser Leu Ala
                325                 330                 335     
Tyr Asn Ala Gln Ile Ala Thr Gly Gly Arg Val Ser Phe Gly Phe Gln
            340                 345                 350         
Gly Ser Tyr Ser Gly Thr Phe Ala Ala Pro Ala Gly Phe Ser Leu Asn
        355                 360                 365             
Gly Ala Ala Cys Thr Thr Ala
    370                 375 


<210> 4
<211> 1017
<212> DNA
<213> Artificial Sequence

<220> 
<223> synthetic codon optimized Streptomyces avermitilis
      endogluganase (EG) including catalytic domain
      (CatD), linker and cellulose binding domain (CBM)

<220> 
<221> misc_feature   
<222> (1)...(666)
<223> catalytic domain (CatD)

<220> 
<221> misc_feature   
<222> (667)...(1017)
<223> linker and cellulose binding domain (CBM)

<400> 4
gatacttcta tttgtgaacc atttggatct actacaatcc aaggacgcta tgtagtacag 60
aataatcgtt ggggcacaag tgaagctcaa tgtataacag caaccgattc aggattccgc 120
attacccaag cggatggttc tgtaccaacg aatggtgctc ctaaatctta tccaagtgtc 180
tataacggat gtcattatac aaattgctct cctgggacgt cgcttccagc ccaattatca 240
acagtttcat ctgctccaac atctattagt tattcttacg tgtcaaatgc catgtatgat 300
gccgcgtacg acatttggtt agatccaaca ccgcgcacag atggtgtaaa tcgaacagaa 360
atcatggtgt ggtttaataa agtaggcagc gtgcagccag taggatctca agtaggtacg 420
gctacggtgg caggccgaca atggcaggtt tggtcaggaa ataacggatc taatgatgtg 480
cttagtttcg tagctccaag tgccattact tcatggtctt ttgatgtaat ggactttgtt 540
cgtcaagccg ttagtcgcgg attagctcaa aactcttggt atttgacatc tgtccaagct 600
ggatttgaac cgtggcagaa tggcgctgga ctagcagtaa cttctttttc gtctacggta 660
aacactggag gcggcaatcc aggagatccg ggatctccta ctgcttgcaa agttgcttat 720
gcaacgaatg tttggcaagg tggatttacg gctgacgtaa ctgtaacgaa tacagggtcc 780
tcacctgtca atggatggaa acttgctttt acgttaccag caggccaaca aattacttcg 840
tcttggtcag caggagtatc tccgtcatct ggagcagtga cagcttctag ccttgcatac 900
aatgcacaaa ttgcaaccgg gggacgtgta tcatttggat ttcaaggtag ttattctggc 960
acatttgcag cacctgcagg tttttcttta aatggggctg cttgcacaac ggcatga    1017

<210> 5
<211> 1029
<212> DNA
<213> Artificial Sequence

<220> 
<223> synthetic codon optimized Streptomyces avermitilis
      endogluganase (EG) including catalytic domain
      (CatD), linker and cellulose binding domain (CBM)
      optimized with N-terminal DTSM spacer

<220> 
<221> misc_feature   
<222> (1)...(12)
<223> N-terminal DTSM spacer

<220> 
<221> misc_feature   
<222> (13)...(678)
<223> catalytic domain (CatD)

<220> 
<221> misc_feature   
<222> (679)...(1029)
<223> linker and cellulose binding domain (CBM)

<400> 5
gatactagta tggatacttc tatttgtgaa ccatttggat ctactacaat ccaaggacgc 60
tatgtagtac agaataatcg ttggggcaca agtgaagctc aatgtataac agcaaccgat 120
tcaggattcc gcattaccca agcggatggt tctgtaccaa cgaatggtgc tcctaaatct 180
tatccaagtg tctataacgg atgtcattat acaaattgct ctcctgggac gtcgcttcca 240
gcccaattat caacagtttc atctgctcca acatctatta gttattctta cgtgtcaaat 300
gccatgtatg atgccgcgta cgacatttgg ttagatccaa caccgcgcac agatggtgta 360
aatcgaacag aaatcatggt gtggtttaat aaagtaggca gcgtgcagcc agtaggatct 420
caagtaggta cggctacggt ggcaggccga caatggcagg tttggtcagg aaataacgga 480
tctaatgatg tgcttagttt cgtagctcca agtgccatta cttcatggtc ttttgatgta 540
atggactttg ttcgtcaagc cgttagtcgc ggattagctc aaaactcttg gtatttgaca 600
tctgtccaag ctggatttga accgtggcag aatggcgctg gactagcagt aacttctttt 660
tcgtctacgg taaacactgg aggcggcaat ccaggagatc cgggatctcc tactgcttgc 720
aaagttgctt atgcaacgaa tgtttggcaa ggtggattta cggctgacgt aactgtaacg 780
aatacagggt cctcacctgt caatggatgg aaacttgctt ttacgttacc agcaggccaa 840
caaattactt cgtcttggtc agcaggagta tctccgtcat ctggagcagt gacagcttct 900
agccttgcat acaatgcaca aattgcaacc gggggacgtg tatcatttgg atttcaaggt 960
agttattctg gcacatttgc agcacctgca ggtttttctt taaatggggc tgcttgcaca 1020
acggcatga                                                         1029

<210> 6
<211> 222
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Streptomyces avermitilis endogluganase
      (EG) variant A29P, A53P, S74P, N191P (SavO variant 1)

<400> 6
Asp Thr Ser Ile Cys Glu Pro Phe Gly Ser Thr Thr Ile Gln Gly Arg
 1               5                  10                  15      
Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Glu Pro Gln Cys Ile
            20                  25                  30          
Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Gln Ala Asp Gly Ser Val
        35                  40                  45              
Pro Thr Asn Gly Pro Pro Lys Ser Tyr Pro Ser Val Tyr Asn Gly Cys
    50                  55                  60                  
His Tyr Thr Asn Cys Ser Pro Gly Thr Pro Leu Pro Ala Gln Leu Ser
65                  70                  75                  80  
Thr Val Ser Ser Ala Pro Thr Ser Ile Ser Tyr Ser Tyr Val Ser Asn
                85                  90                  95      
Ala Met Tyr Asp Ala Ala Tyr Asp Ile Trp Leu Asp Pro Thr Pro Arg
            100                 105                 110         
Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val Trp Phe Asn Lys Val
        115                 120                 125             
Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly Thr Ala Thr Val Ala
    130                 135                 140                 
Gly Arg Gln Trp Gln Val Trp Ser Gly Asn Asn Gly Ser Asn Asp Val
145                 150                 155                 160 
Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser Trp Ser Phe Asp Val
                165                 170                 175     
Met Asp Phe Val Arg Gln Ala Val Ser Arg Gly Leu Ala Gln Pro Ser
            180                 185                 190         
Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro Trp Gln Asn Gly
        195                 200                 205             
Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr Val Asn Thr
    210                 215                 220         


<210> 7
<211> 222
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Streptomyces avermitilis endogluganase
      (EG) variant S10W, A29P, Q43R, A53P, S74P, V82I,
      M98V, N191P (SavO variant 3)

<400> 7
Asp Thr Ser Ile Cys Glu Pro Phe Gly Trp Thr Thr Ile Gln Gly Arg
 1               5                  10                  15      
Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Glu Pro Gln Cys Ile
            20                  25                  30          
Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Arg Ala Asp Gly Ser Val
        35                  40                  45              
Pro Thr Asn Gly Pro Pro Lys Ser Tyr Pro Ser Val Tyr Asn Gly Cys
    50                  55                  60                  
His Tyr Thr Asn Cys Ser Pro Gly Thr Pro Leu Pro Ala Gln Leu Ser
65                  70                  75                  80  
Thr Ile Ser Ser Ala Pro Thr Ser Ile Ser Tyr Ser Tyr Val Ser Asn
                85                  90                  95      
Ala Val Tyr Asp Ala Ala Tyr Asp Ile Trp Leu Asp Pro Thr Pro Arg
            100                 105                 110         
Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val Trp Phe Asn Lys Val
        115                 120                 125             
Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly Thr Ala Thr Val Ala
    130                 135                 140                 
Gly Arg Gln Trp Gln Val Trp Ser Gly Asn Asn Gly Ser Asn Asp Val
145                 150                 155                 160 
Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser Trp Ser Phe Asp Val
                165                 170                 175     
Met Asp Phe Val Arg Gln Ala Val Ser Arg Gly Leu Ala Gln Pro Ser
            180                 185                 190         
Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro Trp Gln Asn Gly
        195                 200                 205             
Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr Val Asn Thr
    210                 215                 220         


<210> 8
<211> 222
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Streptomyces avermitilis endogluganase
      (EG) variant S10W, T12V, A29P, Q43R, V48K, A53P,
      N68I, S74P, L79I, T81K, V82I, M98V, S152M, S185Q,
      N191P (SavO variant 5)

<400> 8
Asp Thr Ser Ile Cys Glu Pro Phe Gly Trp Thr Val Ile Gln Gly Arg
 1               5                  10                  15      
Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Glu Pro Gln Cys Ile
            20                  25                  30          
Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Arg Ala Asp Gly Ser Lys
        35                  40                  45              
Pro Thr Asn Gly Pro Pro Lys Ser Tyr Pro Ser Val Tyr Asn Gly Cys
    50                  55                  60                  
His Tyr Thr Ile Cys Ser Pro Gly Thr Pro Leu Pro Ala Gln Ile Ser
65                  70                  75                  80  
Lys Ile Ser Ser Ala Pro Thr Ser Ile Ser Tyr Ser Tyr Val Ser Asn
                85                  90                  95      
Ala Val Tyr Asp Ala Ala Tyr Asp Ile Trp Leu Asp Pro Thr Pro Arg
            100                 105                 110         
Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val Trp Phe Asn Lys Val
        115                 120                 125             
Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly Thr Ala Thr Val Ala
    130                 135                 140                 
Gly Arg Gln Trp Gln Val Trp Met Gly Asn Asn Gly Ser Asn Asp Val
145                 150                 155                 160 
Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser Trp Ser Phe Asp Val
                165                 170                 175     
Met Asp Phe Val Arg Gln Ala Val Gln Arg Gly Leu Ala Gln Pro Ser
            180                 185                 190         
Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro Trp Glu Asn Gly
        195                 200                 205             
Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr Val Asn Thr
    210                 215                 220         


<210> 9
<211> 29
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Bacillus megaterium ORF_2879 signal
      peptide

<400> 9
Met Lys Arg Ile Val Met Val Gly Phe Ile Leu Leu Phe Pro Leu Asn
 1               5                  10                  15      
Met Leu Ala Gly Pro Ile Ser Ser Ile Ala Glu Ala Gln
            20                  25                  


<210> 10
<211> 254
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic construct with Bacillus megaterium
      ORF_2879 signal peptide carboxy terminus,
      introduced dipeptide TS and SavO variant 5 EG
      amino terminus

<400> 10
Met Lys Arg Ile Val Met Val Gly Phe Ile Leu Leu Phe Pro Leu Asn
 1               5                  10                  15      
Met Leu Ala Gly Pro Ile Ser Ser Ile Ala Glu Ala Gln Thr Ser Met
            20                  25                  30          
Asp Thr Ser Ile Cys Glu Pro Phe Gly Trp Thr Val Ile Gln Gly Arg
        35                  40                  45              
Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Glu Pro Gln Cys Ile
    50                  55                  60                  
Thr Ala Thr Asp Ser Gly Phe Arg Ile Thr Arg Ala Asp Gly Ser Lys
65                  70                  75                  80  
Pro Thr Asn Gly Pro Pro Lys Ser Tyr Pro Ser Val Tyr Asn Gly Cys
                85                  90                  95      
His Tyr Thr Ile Cys Ser Pro Gly Thr Pro Leu Pro Ala Gln Ile Ser
            100                 105                 110         
Lys Ile Ser Ser Ala Pro Thr Ser Ile Ser Tyr Ser Tyr Val Ser Asn
        115                 120                 125             
Ala Val Tyr Asp Ala Ala Tyr Asp Ile Trp Leu Asp Pro Thr Pro Arg
    130                 135                 140                 
Thr Asp Gly Val Asn Arg Thr Glu Ile Met Val Trp Phe Asn Lys Val
145                 150                 155                 160 
Gly Ser Val Gln Pro Val Gly Ser Gln Val Gly Thr Ala Thr Val Ala
                165                 170                 175     
Gly Arg Gln Trp Gln Val Trp Met Gly Asn Asn Gly Ser Asn Asp Val
            180                 185                 190         
Leu Ser Phe Val Ala Pro Ser Ala Ile Thr Ser Trp Ser Phe Asp Val
        195                 200                 205             
Met Asp Phe Val Arg Gln Ala Val Gln Arg Gly Leu Ala Gln Pro Ser
    210                 215                 220                 
Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro Trp Glu Asn Gly
225                 230                 235                 240 
Ala Gly Leu Ala Val Thr Ser Phe Ser Ser Thr Val Asn Thr
                245                 250                 


<210> 11
<211> 218
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Hypocrea schweinitzii endoglucanase (EG)
      Cel12 catalytic domain (CatD)

<400> 11
Gln Thr Ser Cys Asp Gln Tyr Ala Thr Phe Ser Gly Asn Gly Tyr Ile
 1               5                  10                  15      
Val Ser Asn Asn Leu Trp Gly Ala Ser Ala Gly Ser Gly Phe Gly Cys
            20                  25                  30          
Val Thr Ser Val Ser Leu Asn Gly Ala Ala Ser Trp His Ala Asp Trp
        35                  40                  45              
Gln Trp Ser Gly Gly Gln Asn Asn Val Lys Ser Tyr Gln Asn Val Gln
    50                  55                  60                  
Ile Asn Ile Pro Gln Lys Arg Thr Val Asn Ser Ile Gly Ser Met Pro
65                  70                  75                  80  
Thr Thr Ala Ser Trp Ser Tyr Ser Gly Ser Asp Ile Arg Ala Asn Val
                85                  90                  95      
Ala Tyr Asp Leu Phe Thr Ala Ala Asn Pro Asn His Val Thr Tyr Ser
            100                 105                 110         
Gly Asp Tyr Glu Leu Met Ile Trp Leu Gly Lys Tyr Gly Asp Ile Gly
        115                 120                 125             
Pro Ile Gly Ser Ser Gln Gly Thr Val Asn Val Gly Gly Gln Thr Trp
    130                 135                 140                 
Thr Leu Tyr Tyr Gly Tyr Asn Gly Ala Met Gln Val Tyr Ser Phe Val
145                 150                 155                 160 
Ala Gln Ser Asn Thr Thr Ser Tyr Ser Gly Asp Val Lys Asn Phe Phe
                165                 170                 175     
Asn Tyr Leu Arg Asp Asn Lys Gly Tyr Asn Ala Gly Gly Gln Tyr Val
            180                 185                 190         
Leu Ser Tyr Gln Phe Gly Thr Glu Pro Phe Thr Gly Ser Gly Thr Leu
        195                 200                 205             
Asn Val Ala Ser Trp Thr Ala Ser Ile Asn
    210                 215             


<210> 12
<211> 227
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Rhodothermus marinus endoglucanase (EG)
      Cel12 catalytic domain (CatD)

<400> 12
Thr Val Glu Leu Cys Gly Arg Trp Asp Ala Arg Asp Val Ala Gly Gly
 1               5                  10                  15      
Arg Tyr Arg Val Ile Asn Asn Val Trp Gly Ala Glu Thr Ala Gln Cys
            20                  25                  30          
Ile Glu Val Gly Leu Glu Thr Gly Asn Phe Thr Ile Thr Arg Ala Asp
        35                  40                  45              
His Asp Asn Gly Asn Asn Val Ala Ala Tyr Pro Ala Ile Tyr Phe Gly
    50                  55                  60                  
Cys His Trp Gly Ala Cys Thr Ser Asn Ser Gly Leu Pro Arg Arg Val
65                  70                  75                  80  
Gln Glu Leu Ser Asp Val Arg Thr Ser Trp Thr Leu Thr Pro Ile Thr
                85                  90                  95      
Thr Gly Arg Trp Asn Ala Ala Tyr Asp Ile Trp Phe Ser Pro Val Thr
            100                 105                 110         
Asn Ser Gly Asn Gly Tyr Ser Gly Gly Ala Glu Leu Met Ile Trp Leu
        115                 120                 125             
Asn Trp Asn Gly Gly Val Met Pro Gly Gly Ser Arg Val Ala Thr Val
    130                 135                 140                 
Glu Leu Ala Gly Ala Thr Trp Glu Val Trp Tyr Ala Asp Trp Asp Trp
145                 150                 155                 160 
Asn Tyr Ile Ala Tyr Arg Arg Thr Thr Pro Thr Thr Ser Val Ser Glu
                165                 170                 175     
Leu Asp Leu Lys Ala Phe Ile Asp Asp Ala Val Ala Arg Gly Tyr Ile
            180                 185                 190         
Arg Pro Glu Trp Tyr Leu His Ala Val Glu Thr Gly Phe Glu Leu Trp
        195                 200                 205             
Glu Gly Gly Ala Gly Leu Arg Ser Ala Asp Phe Ser Val Thr Val Gln
    210                 215                 220                 
Lys Leu Ala
225         


<210> 13
<211> 222
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Streptomyces sp. 11AG8 endoglucanase
      (EG) Cel12 catalytic domain (CatD)

<400> 13
Asn Gln Gln Ile Cys Asp Arg Tyr Gly Thr Thr Thr Ile Gln Asp Arg
 1               5                  10                  15      
Tyr Val Val Gln Asn Asn Arg Trp Gly Thr Ser Ala Thr Gln Cys Ile
            20                  25                  30          
Asn Val Thr Gly Asn Gly Phe Glu Ile Thr Gln Ala Asp Gly Ser Val
        35                  40                  45              
Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val Tyr Asp Gly Cys
    50                  55                  60                  
His Tyr Gly Asn Cys Ala Pro Arg Thr Thr Leu Pro Met Arg Ile Ser
65                  70                  75                  80  
Ser Ile Gly Ser Ala Pro Ser Ser Val Ser Tyr Arg Tyr Thr Gly Asn
                85                  90                  95      
Gly Val Tyr Asn Ala Ala Tyr Asp Ile Trp Leu Asp Pro Thr Pro Arg
            100                 105                 110         
Thr Asn Gly Val Asn Arg Thr Glu Ile Met Ile Trp Phe Asn Arg Val
        115                 120                 125             
Gly Pro Val Gln Pro Ile Gly Ser Pro Val Gly Thr Ala His Val Gly
    130                 135                 140                 
Gly Arg Ser Trp Glu Val Trp Thr Gly Ser Asn Gly Ser Asn Asp Val
145                 150                 155                 160 
Ile Ser Phe Leu Ala Pro Ser Ala Ile Ser Ser Trp Ser Phe Asp Val
                165                 170                 175     
Lys Asp Phe Val Asp Gln Ala Val Ser His Gly Leu Ala Thr Pro Asp
            180                 185                 190         
Trp Tyr Leu Thr Ser Ile Gln Ala Gly Phe Glu Pro Trp Glu Gly Gly
        195                 200                 205             
Thr Gly Leu Ala Val Asn Ser Phe Ser Ser Ala Val Asn Ala
    210                 215                 220         


<210> 14
<211> 222
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Streptomyces lividans endoglucanase (EG)
      Cel12 catalytic domain (CatD)

<400> 14
Asp Thr Thr Ile Cys Glu Pro Phe Gly Thr Thr Thr Ile Gln Gly Arg
 1               5                  10                  15      
Tyr Val Val Gln Asn Asn Arg Trp Gly Ser Thr Ala Pro Gln Cys Val
            20                  25                  30          
Thr Ala Thr Asp Thr Gly Phe Arg Val Thr Gln Ala Asp Gly Ser Ala
        35                  40                  45              
Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val Phe Asn Gly Cys
    50                  55                  60                  
His Tyr Thr Asn Cys Ser Pro Gly Thr Asp Leu Pro Val Arg Leu Asp
65                  70                  75                  80  
Thr Val Ser Ala Ala Pro Ser Ser Ile Ser Tyr Gly Phe Val Asp Gly
                85                  90                  95      
Ala Val Tyr Asn Ala Ser Tyr Asp Ile Trp Leu Asp Pro Thr Ala Arg
            100                 105                 110         
Thr Asp Gly Val Asn Gln Thr Glu Ile Met Ile Trp Phe Asn Arg Val
        115                 120                 125             
Gly Pro Ile Gln Pro Ile Gly Ser Pro Val Gly Thr Ala Ser Val Gly
    130                 135                 140                 
Gly Arg Thr Trp Glu Val Trp Ser Gly Gly Asn Gly Ser Asn Asp Val
145                 150                 155                 160 
Leu Ser Phe Val Ala Pro Ser Ala Ile Ser Gly Trp Ser Phe Asp Val
                165                 170                 175     
Met Asp Phe Val Arg Ala Thr Val Ala Arg Gly Leu Ala Glu Asn Asp
            180                 185                 190         
Trp Tyr Leu Thr Ser Val Gln Ala Gly Phe Glu Pro Trp Gln Asn Gly
        195                 200                 205             
Ala Gly Leu Ala Val Asn Ser Phe Ser Ser Thr Val Glu Thr
    210                 215                 220         


<210> 15
<211> 218
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic Trichoderma reesei endoglucanase (EG)
      Cel12 catalytic domain (CatD)

<400> 15
Gln Thr Ser Cys Asp Gln Trp Ala Thr Phe Thr Gly Asn Gly Tyr Thr
 1               5                  10                  15      
Val Ser Asn Asn Leu Trp Gly Ala Ser Ala Gly Ser Gly Phe Gly Cys
            20                  25                  30          
Val Thr Ala Val Ser Leu Ser Gly Gly Ala Ser Trp His Ala Asp Trp
        35                  40                  45              
Gln Trp Ser Gly Gly Gln Asn Asn Val Lys Ser Tyr Gln Asn Ser Gln
    50                  55                  60                  
Ile Ala Ile Pro Gln Lys Arg Thr Val Asn Ser Ile Ser Ser Met Pro
65                  70                  75                  80  
Thr Thr Ala Ser Trp Ser Tyr Ser Gly Ser Asn Ile Arg Ala Asn Val
                85                  90                  95      
Ala Tyr Asp Leu Phe Thr Ala Ala Asn Pro Asn His Val Thr Tyr Ser
            100                 105                 110         
Gly Asp Tyr Glu Leu Met Ile Trp Leu Gly Lys Tyr Gly Asp Ile Gly
        115                 120                 125             
Pro Ile Gly Ser Ser Gln Gly Thr Val Asn Val Gly Gly Gln Ser Trp
    130                 135                 140                 
Thr Leu Tyr Tyr Gly Tyr Asn Gly Ala Met Gln Val Tyr Ser Phe Val
145                 150                 155                 160 
Ala Gln Thr Asn Thr Thr Asn Tyr Ser Gly Asp Val Lys Asn Phe Phe
                165                 170                 175     
Asn Tyr Leu Arg Asp Asn Lys Gly Tyr Asn Ala Ala Gly Gln Tyr Val
            180                 185                 190         
Leu Ser Tyr Gln Phe Gly Thr Glu Pro Phe Thr Gly Ser Gly Thr Leu
        195                 200                 205             
Asn Val Ala Ser Trp Thr Ala Ser Ile Asn
    210                 215             


<210> 16
<211> 4
<212> PRT
<213> Artificial Sequence

<220> 
<223> synthetic N-terminal DTSM spacer

<400> 16
Asp Thr Ser Met
 1              



