                         SEQUENCE LISTING

<110>  Sd-Chemie AG
 
<120>  Optimized Cellulase Enzymes

<130>  139 202


<150>  EP10153355
<151>  2010-02-11

<160>  51    

<170>  PatentIn version 3.4

<210>  1
<211>  1509
<212>  DNA
<213>  Artificial

<220>
<223>  Coding Sequence for Talaromyces emersonii CBHI / Trichoderma 
       reesei -CBD fusion (mature CBH-a)

<400>  1
cagcaggccg gcacggcgac ggcagagaac cacccgcccc tgacatggca ggaatgcacc     60

gcccctggga gctgcaccac ccagaacggg gcggtcgttc ttgatgcgaa ctggcgttgg    120

gtgcacgatg tgaacggata caccaactgc tacacgggca atacctggga ccccacgtac    180

tgccctgacg acgaaacctg cgcccagaac tgtgcgctgg acggcgcgga ttacgagggc    240

acctacggcg tgacttcgtc gggcagctcc ttgaaactca atttcgtcac cgggtcgaac    300

gtcggatccc gtctctacct gctgcaggac gactcgacct atcagatctt caagctcctg    360

aaccgcgagt tcagctttga cgtcgatgtc tccaatcttc cgtgcggatt gaacggcgct    420

ctgtactttg tcgccatgga cgccgacggc ggcgtgtcca agtacccgaa caacaaggct    480

ggtgccaagt acggaaccgg gtattgcgac tcccaatgcc cacgggacct caagttcatc    540

gacggcgagg ccaacgtcga gggctggcag ccgtcttcga acaacgccaa caccggaatt    600

ggcgaccacg gctcctgctg tgcggagatg gatgtctggg aagcaaacag catctccaat    660

gcggtcactc cgcacccgtg cgacacgcca ggccagacga tgtgctctgg agatgactgc    720

ggtggcacat actctaacga tcgctacgcg ggaacctgcg atcctgacgg ctgtgacttc    780

aacccttacc gcatgggcaa cacttctttc tacgggcctg gcaagatcat cgataccacc    840

aagcccttca ctgtcgtgac gcagttcctc actgatgatg gtacggatac tggaactctc    900

agcgagatca agcgcttcta catccagaac agcaacgtca ttccgcagcc caactcggac    960

atcagtggcg tgaccggcaa ctcgatcacg acggagttct gcactgctca gaagcaggcc   1020

tttggcgaca cggacgactt ctctcagcac ggtggcctgg ccaagatggg agcggccatg   1080

cagcagggta tggtcctggt gatgagtttg tgggacgact acgccgcgca gatgctgtgg   1140

ttggattccg actacccgac ggatgcggac cccacgaccc ctggtattgc ccgtggaacg   1200

tgtccgacgg actcgggcgt cccatcggat gtcgagtcgc agagccccaa ctcctacgtg   1260

acctactcga acattaagtt tggtccgatc ggtagcacag gtaatccttc aggtggtaat   1320

cctccaggtg gaaacagagg aacaacgaca actagaagac cagctactac aactggttca   1380

agtccaggtc caactcaatc acactacggt caatgtggtg gtataggtta ctctggtccc   1440

actgtttgtg cttctggtac tacttgccaa gttctgaacc cttactactc acagtgtcta   1500

taatgataa                                                           1509


<210>  2
<211>  500
<212>  PRT
<213>  Artificial

<220>
<223>  Mature Sequence of Talaromyces emersonii CBHI / Trichoderma 
       reesei -CBD (mature CBH-a)

<400>  2

Gln Gln Ala Gly Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Gln Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Glu Thr Cys Ala Gln Asn Cys Ala Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asp Gly Glu Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Pro 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Ser Asn Val Ile Pro Gln Pro Asn Ser Asp 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Asp Phe Ser Gln His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Thr Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Tyr Ser Asn Ile Lys Phe Gly Pro Ile Gly Ser 
            420                 425                 430         


Thr Gly Asn Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr 
        435                 440                 445             


Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro 
    450                 455                 460                 


Thr Gln Ser His Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro 
465                 470                 475                 480 


Thr Val Cys Ala Ser Gly Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr 
                485                 490                 495     


Ser Gln Cys Leu 
            500 


<210>  3
<211>  1581
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence of the fusion of CBH-a with Trichoderma reesei 
       CBHI Signal peptide

<400>  3
atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc tcagcaggcc     60

ggcacggcga cggcagagaa ccacccgccc ctgacatggc aggaatgcac cgcccctggg    120

agctgcacca cccagaacgg ggcggtcgtt cttgatgcga actggcgttg ggtgcacgat    180

gtgaacggat acaccaactg ctacacgggc aatacctggg accccacgta ctgccctgac    240

gacgaaacct gcgcccagaa ctgtgcgctg gacggcgcgg attacgaggg cacctacggc    300

gtgacttcgt cgggcagctc cttgaaactc aatttcgtca ccgggtcgaa cgtcggatcc    360

cgtctctacc tgctgcagga cgactcgacc tatcagatct tcaagctcct gaaccgcgag    420

ttcagctttg acgtcgatgt ctccaatctt ccgtgcggat tgaacggcgc tctgtacttt    480

gtcgccatgg acgccgacgg cggcgtgtcc aagtacccga acaacaaggc tggtgccaag    540

tacggaaccg ggtattgcga ctcccaatgc ccacgggacc tcaagttcat cgacggcgag    600

gccaacgtcg agggctggca gccgtcttcg aacaacgcca acaccggaat tggcgaccac    660

ggctcctgct gtgcggagat ggatgtctgg gaagcaaaca gcatctccaa tgcggtcact    720

ccgcacccgt gcgacacgcc aggccagacg atgtgctctg gagatgactg cggtggcaca    780

tactctaacg atcgctacgc gggaacctgc gatcctgacg gctgtgactt caacccttac    840

cgcatgggca acacttcttt ctacgggcct ggcaagatca tcgataccac caagcccttc    900

actgtcgtga cgcagttcct cactgatgat ggtacggata ctggaactct cagcgagatc    960

aagcgcttct acatccagaa cagcaacgtc attccgcagc ccaactcgga catcagtggc   1020

gtgaccggca actcgatcac gacggagttc tgcactgctc agaagcaggc ctttggcgac   1080

acggacgact tctctcagca cggtggcctg gccaagatgg gagcggccat gcagcagggt   1140

atggtcctgg tgatgagttt gtgggacgac tacgccgcgc agatgctgtg gttggattcc   1200

gactacccga cggatgcgga ccccacgacc cctggtattg cccgtggaac gtgtccgacg   1260

gactcgggcg tcccatcgga tgtcgagtcg cagagcccca actcctacgt gacctactcg   1320

aacattaagt ttggtccgat cggtagcaca ggtaatcctt caggtggtaa tcctccaggt   1380

ggaaacagag gaacaacgac aactagaaga ccagctacta caactggttc aagtccaggt   1440

ccaactcaat cacactacgg tcaatgtggt ggtataggtt actctggtcc cactgtttgt   1500

gcttctggta ctacttgcca agttctgaac ccttactact cacagtgtct agcttctgca   1560

catcatcacc accaccatta a                                             1581


<210>  4
<211>  70
<212>  PRT
<213>  Artificial

<220>
<223>  Trichoderma reesei CBHI cellulose binding domain and linker 
       sequence

<400>  4

Gly Ser Thr Gly Asn Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg 
1               5                   10                  15      


Gly Thr Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro 
            20                  25                  30          


Gly Pro Thr Gln Ser His Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser 
        35                  40                  45              


Gly Pro Thr Val Cys Ala Ser Gly Thr Thr Cys Gln Val Leu Asn Pro 
    50                  55                  60                  


Tyr Tyr Ser Gln Cys Leu 
65                  70  


<210>  5
<211>  437
<212>  PRT
<213>  Artificial

<220>
<223>  Talaromyces emersonii CBHI sequence (CBH-b)

<400>  5

Gln Gln Ala Gly Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Gln Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Glu Thr Cys Ala Gln Asn Cys Ala Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asp Gly Glu Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Pro 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Ser Asn Val Ile Pro Gln Pro Asn Ser Asp 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Asp Phe Ser Gln His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Thr Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Tyr Ser Asn Ile Lys Phe Gly Pro Ile Asn Ser 
            420                 425                 430         


Thr Phe Thr Ala Ser 
        435         


<210>  6
<211>  1590
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence of Talaromyces emersonii CBHI fused to the alpha 
       factor signal peptide

<400>  6
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcaccc tctcagcagg ccggcacggc gacggcagag    300

aaccacccgc ccctgacatg gcaggaatgc accgcccctg ggagctgcac cacccagaac    360

ggggcggtcg ttcttgatgc gaactggcgt tgggtgcacg atgtgaacgg atacaccaac    420

tgctacacgg gcaatacctg ggaccccacg tactgccctg acgacgaaac ctgcgcccag    480

aactgtgcgc tggacggcgc ggattacgag ggcacctacg gcgtgacttc gtcgggcagc    540

tccttgaaac tcaatttcgt caccgggtcg aacgtcggat cccgtctcta cctgctgcag    600

gacgactcga cctatcagat cttcaagctt ctgaaccgcg agttcagctt tgacgtcgat    660

gtctccaatc ttccgtgcgg attgaacggc gctctgtact ttgtcgccat ggacgccgac    720

ggcggcgtgt ccaagtaccc gaacaacaag gctggtgcca agtacggaac cgggtattgc    780

gactcccaat gcccacggga cctcaagttc atcgacggcg aggccaacgt cgagggctgg    840

cagccgtctt cgaacaacgc caacaccgga attggcgacc acggctcctg ctgtgcggag    900

atggatgtct gggaagcaaa cagcatctcc aatgcggtca ctccgcaccc gtgcgacacg    960

ccaggccaga cgatgtgctc tggagatgac tgcggtggca catactctaa cgatcgctac   1020

gcgggaacct gcgatcctga cggctgtgac ttcaaccctt accgcatggg caacacttct   1080

ttctacgggc ctggcaagat catcgatacc accaagccct tcactgtcgt gacgcagttc   1140

ctcactgatg atggtacgga tactggaact ctcagcgaga tcaagcgctt ctacatccag   1200

aacagcaacg tcattccgca gcccaactcg gacatcagtg gcgtgaccgg caactcgatc   1260

acgacggagt tctgcactgc tcagaagcag gcctttggcg acacggacga cttctctcag   1320

cacggtggcc tggccaagat gggagcggcc atgcagcagg gtatggtcct ggtgatgagt   1380

ttgtgggacg actacgccgc gcagatgctg tggttggatt ccgactaccc gacggatgcg   1440

gaccccacga cccctggtat tgcccgtgga acgtgtccga cggactcggg cgtcccatcg   1500

gatgtcgagt cgcagagccc caactcctac gtgacctact cgaacattaa gtttggtccg   1560

atcaactcga ccttcaccgc ttcgtgataa                                    1590


<210>  7
<211>  429
<212>  PRT
<213>  Artificial

<220>
<223>  Humicola grisea CBHI (CBH-d)

<400>  7

Gln Gln Ala Gly Thr Ile Thr Ala Glu Asn His Pro Arg Met Thr Trp 
1               5                   10                  15      


Lys Arg Cys Ser Gly Pro Gly Asn Cys Gln Thr Val Gln Gly Glu Val 
            20                  25                  30          


Val Ile Asp Ala Asn Trp Arg Trp Leu His Asn Asn Gly Gln Asn Cys 
        35                  40                  45              


Tyr Glu Gly Asn Lys Trp Thr Ser Gln Cys Ser Ser Ala Thr Asp Cys 
    50                  55                  60                  


Ala Gln Arg Cys Ala Leu Asp Gly Ala Asn Tyr Gln Ser Thr Tyr Gly 
65                  70                  75                  80  


Ala Ser Thr Ser Gly Asp Ser Leu Thr Leu Lys Phe Val Thr Lys His 
                85                  90                  95      


Glu Tyr Gly Thr Asn Ile Gly Ser Arg Phe Tyr Leu Met Ala Asn Gln 
            100                 105                 110         


Asn Lys Tyr Gln Met Phe Thr Leu Met Asn Asn Glu Phe Ala Phe Asp 
        115                 120                 125             


Val Asp Leu Ser Lys Val Glu Cys Gly Ile Asn Ser Ala Leu Tyr Phe 
    130                 135                 140                 


Val Ala Met Glu Glu Asp Gly Gly Met Ala Ser Tyr Pro Ser Asn Arg 
145                 150                 155                 160 


Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ala Gln Cys Ala Arg 
                165                 170                 175     


Asp Leu Lys Phe Ile Gly Gly Lys Ala Asn Ile Glu Gly Trp Arg Pro 
            180                 185                 190         


Ser Thr Asn Asp Pro Asn Ala Gly Val Gly Pro Met Gly Ala Cys Cys 
        195                 200                 205             


Ala Glu Ile Asp Val Trp Glu Ser Asn Ala Tyr Ala Tyr Ala Phe Thr 
    210                 215                 220                 


Pro His Ala Cys Gly Ser Lys Asn Arg Tyr His Ile Cys Glu Thr Asn 
225                 230                 235                 240 


Asn Cys Gly Gly Thr Tyr Ser Asp Asp Arg Phe Ala Gly Tyr Cys Asp 
                245                 250                 255     


Ala Asn Gly Cys Asp Tyr Asn Pro Tyr Arg Met Gly Asn Lys Asp Phe 
            260                 265                 270         


Tyr Gly Lys Gly Lys Thr Val Asp Thr Asn Arg Lys Phe Thr Val Val 
        275                 280                 285             


Ser Arg Phe Glu Arg Asn Arg Leu Ser Gln Phe Phe Val Gln Asp Gly 
    290                 295                 300                 


Arg Lys Ile Glu Val Pro Pro Pro Thr Trp Pro Gly Leu Pro Asn Ser 
305                 310                 315                 320 


Ala Asp Ile Thr Pro Glu Leu Cys Asp Ala Gln Phe Arg Val Phe Asp 
                325                 330                 335     


Asp Arg Asn Arg Phe Ala Glu Thr Gly Gly Phe Asp Ala Leu Asn Glu 
            340                 345                 350         


Ala Leu Thr Ile Pro Met Val Leu Val Met Ser Ile Trp Asp Asp His 
        355                 360                 365             


His Ser Asn Met Leu Trp Leu Asp Ser Ser Tyr Pro Pro Glu Lys Ala 
    370                 375                 380                 


Gly Leu Pro Gly Gly Asp Arg Gly Pro Cys Pro Thr Thr Ser Gly Val 
385                 390                 395                 400 


Pro Ala Glu Val Glu Ala Gln Tyr Pro Asp Ala Gln Val Val Trp Ser 
                405                 410                 415     


Asn Ile Arg Phe Gly Pro Ile Gly Ser Thr Val Asn Val 
            420                 425                 


<210>  8
<211>  1563
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence of Humicola grisea CBHI fused to the alpha factor
       signal peptide

<400>  8
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcaccc tctcagcagg ctggtactat tactgctgag    300

aaccacccaa gaatgacctg gaagagatgc tctggtccag gaaactgtca gactgttcag    360

ggcgaggttg tgattgacgc taattggaga tggttgcaca acaacggcca gaactgttac    420

gagggtaaca agtggacctc tcagtgttct tctgctaccg actgtgctca gagatgtgct    480

ttggacggtg ccaactacca gtctacctac ggtgcttcta cctctggtga ctctctgacc    540

ctgaagttcg ttaccaagca cgagtacgga accaacatcg gctctagatt ctacctgatg    600

gccaaccaga acaagtacca gatgttcacc ctgatgaaca acgagttcgc ctttgacgtt    660

gacctgtcta aggtggagtg cggtatcaac tctgccctgt acttcgttgc tatggaagag    720

gacggtggaa tggcttctta cccatctaac agagccggtg ctaagtacgg tactggttac    780

tgtgacgccc agtgtgctag agacctgaag ttcatcggtg gaaaggccaa cattgagggt    840

tggagaccat ctaccaacga cccaaacgct ggtgttggtc caatgggagc ttgttgtgcc    900

gagattgatg tgtgggagtc taacgcttac gcctacgctt ttaccccaca cgcttgcggt    960

tctaagaaca gataccacat ctgcgagacc aacaactgtg gtggaaccta ctctgacgac   1020

agattcgctg gatactgcga cgctaacggt tgtgactaca acccatacag aatgggcaac   1080

aaggacttct acggcaaggg aaagaccgtt gacaccaaca gaaagttcac cgtggtgtcg   1140

agattcgaga gaaacagact gtcgcagttc tttgtgcagg acggcagaaa gattgaggtc   1200

ccaccaccaa cttggccagg attgccaaac tctgccgaca ttaccccaga gttgtgtgac   1260

gctcagttca gagtgttcga cgacagaaac agatttgctg agaccggtgg ttttgacgct   1320

ttgaacgagg ctctgaccat tccaatggtg ctggtgatgt ctatttggga cgaccaccac   1380

tctaacatgt tgtggctgga ctcttcttac ccaccagaga aggctggatt gccaggtggt   1440

gacagaggac catgtccaac tacttcgggt gttccagctg aggttgaggc tcagtaccca   1500

gacgctcagg ttgtgtggtc gaacatcaga ttcggcccaa tcggttctac cgtgaacgtg   1560

taa                                                                 1563


<210>  9
<211>  439
<212>  PRT
<213>  Artificial

<220>
<223>  Thermoascus auratiacus CBHI (CBH-e)

<400>  9

His Glu Ala Gly Thr Val Thr Ala Glu Asn His Pro Ser Leu Thr Trp 
1               5                   10                  15      


Gln Gln Cys Ser Ser Gly Gly Ser Cys Thr Thr Gln Asn Gly Lys Val 
            20                  25                  30          


Val Ile Asp Ala Asn Trp Arg Trp Val His Thr Thr Ser Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Thr Ser Ile Cys Pro Asp Asp 
    50                  55                  60                  


Val Thr Cys Ala Gln Asn Cys Ala Leu Asp Gly Ala Asp Tyr Ser Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Thr Ser Gly Asn Ala Leu Arg Leu Asn Phe Val 
                85                  90                  95      


Thr Gln Ser Ser Gly Lys Asn Ile Gly Ser Arg Leu Tyr Leu Leu Gln 
            100                 105                 110         


Asp Asp Thr Thr Tyr Gln Ile Phe Lys Leu Leu Gly Gln Glu Phe Thr 
        115                 120                 125             


Phe Asp Val Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu 
    130                 135                 140                 


Tyr Phe Val Ala Met Asp Ala Asp Gly Asn Leu Ser Lys Tyr Pro Gly 
145                 150                 155                 160 


Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys 
                165                 170                 175     


Pro Arg Asp Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Glu Gly Trp 
            180                 185                 190         


Gln Pro Ser Ala Asn Asp Pro Asn Ala Gly Val Gly Asn His Gly Ser 
        195                 200                 205             


Ser Cys Ala Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Thr Ala 
    210                 215                 220                 


Val Thr Pro His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Gln Gly 
225                 230                 235                 240 


Asp Asp Cys Gly Gly Thr Tyr Ser Ser Thr Arg Tyr Ala Gly Thr Cys 
                245                 250                 255     


Asp Thr Asp Gly Cys Asp Phe Asn Pro Tyr Gln Pro Gly Asn His Ser 
            260                 265                 270         


Phe Tyr Gly Pro Gly Lys Ile Val Asp Thr Ser Ser Lys Phe Thr Val 
        275                 280                 285             


Val Thr Gln Phe Ile Thr Asp Asp Gly Thr Pro Ser Gly Thr Leu Thr 
    290                 295                 300                 


Glu Ile Lys Arg Phe Tyr Val Gln Asn Gly Lys Val Ile Pro Gln Ser 
305                 310                 315                 320 


Glu Ser Thr Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Tyr 
                325                 330                 335     


Cys Thr Ala Gln Lys Ala Ala Phe Asp Asn Thr Gly Phe Phe Thr His 
            340                 345                 350         


Gly Gly Leu Gln Lys Ile Ser Gln Ala Leu Ala Gln Gly Met Val Leu 
        355                 360                 365             


Val Met Ser Leu Trp Asp Asp His Ala Ala Asn Met Leu Trp Leu Asp 
    370                 375                 380                 


Ser Thr Tyr Pro Thr Asp Ala Asp Pro Asp Thr Pro Gly Val Ala Arg 
385                 390                 395                 400 


Gly Thr Cys Pro Thr Thr Ser Gly Val Pro Ala Asp Val Glu Ser Gln 
                405                 410                 415     


Asn Pro Asn Ser Tyr Val Ile Tyr Ser Asn Ile Lys Val Gly Pro Ile 
            420                 425                 430         


Asn Ser Thr Phe Thr Ala Asn 
        435                 


<210>  10
<211>  1593
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence of Thermoascus auratiacus CBHI fused to the alpha
       factor signal peptide

<400>  10
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcaccc tctcacgagg ccggtaccgt aaccgcagag    300

aatcaccctt ccctgacctg gcagcaatgc tccagcggcg gtagttgtac cacgcagaat    360

ggaaaagtcg ttatcgatgc gaactggcgt tgggtccata ccacctctgg atacaccaac    420

tgctacacgg gcaatacgtg ggacaccagt atctgtcccg acgacgtgac ctgcgctcag    480

aattgtgcct tggatggagc ggattacagt ggcacctatg gtgttacgac cagtggcaac    540

gccctgagac tgaactttgt cacccaaagc tcagggaaga acattggctc gcgcctgtac    600

ctgctgcagg acgacaccac ttatcagatc ttcaagctgc tgggtcagga gtttaccttc    660

gatgtcgacg tctccaatct cccttgcggg ctgaacggcg ccctctactt tgtggccatg    720

gacgccgacg gcaatttgtc caaataccct ggcaacaagg caggcgctaa gtatggcact    780

ggttactgcg actctcagtg ccctcgggat ctcaagttca tcaacggtca ggccaacgtt    840

gaaggctggc agccgtctgc caacgaccca aatgccggcg ttggtaacca cggttcctcg    900

tgcgctgaga tggatgtctg ggaagccaac agcatctcta ctgcggtgac gcctcaccca    960

tgcgacaccc ccggccagac catgtgccag ggagacgact gtggtggaac ctactcctcc   1020

actcgatatg ctggtacctg cgacactgat ggctgcgact tcaatcctta ccagccaggc   1080

aaccactcgt tctacggccc cgggaagatc gtcgacacta gctccaaatt caccgtcgtc   1140

acccagttca tcaccgacga cgggacaccc tccggcaccc tgacggagat caaacgcttc   1200

tacgtccaga acggcaaggt gatcccccag tcggagtcga cgatcagcgg cgtcaccggc   1260

aactcaatca ccaccgagta ttgcacggcc cagaaggcag ccttcgacaa caccggcttc   1320

ttcacgcacg gcgggcttca gaagatcagt caggctctgg ctcagggcat ggtcctcgtc   1380

atgagcctgt gggacgatca cgccgccaac atgctctggc tggacagcac ctacccgact   1440

gatgcggacc cggacacccc tggcgtcgcg cgcggtacct gccccacgac ctccggcgtc   1500

ccggccgacg tggagtcgca gaaccccaat tcatatgtta tctactccaa catcaaggtc   1560

ggacccatca actcgacctt caccgccaac taa                                1593


<210>  11
<211>  1794
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence for Trichoderma reesei CBHI (CBH-c), including 
       the alpha factor signal peptide and a 6x His Tag

<400>  11
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcaccc tcttcagctt gtacactgca atccgagact    300

catccacctt taacgtggca aaagtgtagt tctggcggaa cttgtactca acagactggt    360

agtgtcgtga tagatgctaa ctggagatgg acacatgcaa cgaactcctc aactaactgc    420

tacgatggta acacctggtc ttctacattg tgtcctgaca acgaaacctg cgctaagaac    480

tgttgtcttg atggagcagc ttacgcaagt acatatggtg tgactacctc tggtaacagc    540

ctttccattg gttttgtaac ccagtcggct cagaagaatg ttggtgctag attgtacctg    600

atggcttcag acaccacata ccaggagttt accttgttgg gaaacgagtt ctctttcgac    660

gtagatgtgt ctcagctacc atgtggattg aatggagcct tgtactttgt ctcaatggat    720

gcagacggag gtgtttcaaa gtaccctact aacacagctg gtgctaagta tggaactgga    780

tactgcgatt ctcaatgccc aagagacctg aagttcatca acggacaagc taacgttgaa    840

ggttgggaac cttctagcaa caacgcaaac actggaattg gtggtcatgg ttcttgctgt    900

tcagagatgg acatttggga agccaactcc atcagtgaag ctttgactcc acatccatgc    960

acaactgttg ggcaagaaat ttgcgaaggt gatggttgtg gtggcactta ctctgataac   1020

agatacggcg gaacatgtga tccagatgga tgtgattgga acccatacag actgggtaac   1080

acttcgtttt acggaccagg ttcttccttc actctagaca ctacgaagaa gttgactgtg   1140

gtcacccaat ttgagacttc tggtgccatt aaccgatact acgtgcagaa cggagttact   1200

ttccaacagc caaacgctga attgggtagt tactcaggca acgagcttaa cgatgactac   1260

tgcactgctg aagaagcaga atttggtgga tcttcctttt cggataaggg tggattgacg   1320

cagttcaaga aagctacctc tggtggaatg gttctagtca tgagtctgtg ggacgattac   1380

tacgctaaca tgctttggct ggactctact taccctacaa acgagacatc ttctactcct   1440

ggtgctgtaa gaggtagctg ttctacatct tctggagttc cagcccaagt tgagagtcaa   1500

agtccaaatg ccaaggtcac cttctccaac atcaagttcg gaccaattgg tagcacaggt   1560

aatccttcag gtggtaatcc tccaggtgga aacagaggaa caacgacaac tagaagacca   1620

gctactacaa ctggttcaag tccaggtcca actcaatcac actacggtca atgtggtggt   1680

ataggttact ctggtcccac tgtttgtgct tctggtacta cttgccaagt tctgaaccct   1740

tactactcac agtgtctagc ttctgcacac catcatcatc atcattaatg ataa         1794


<210>  12
<211>  496
<212>  PRT
<213>  Artificial

<220>
<223>  Trichoderma reesei CBHI (CBH-c)

<400>  12

Gln Ser Ala Cys Thr Leu Gln Ser Glu Thr His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Lys Cys Ser Ser Gly Gly Thr Cys Thr Gln Gln Thr Gly Ser Val 
            20                  25                  30          


Val Ile Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser Thr 
        35                  40                  45              


Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
    50                  55                  60                  


Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser Ile Gly Phe Val 
                85                  90                  95      


Thr Gln Ser Ala Gln Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
            100                 105                 110         


Ser Asp Thr Thr Tyr Gln Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
        115                 120                 125             


Phe Asp Val Asp Val Ser Gln Leu Pro Cys Gly Leu Asn Gly Ala Leu 
    130                 135                 140                 


Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145                 150                 155                 160 


Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys 
                165                 170                 175     


Pro Arg Asp Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Glu Gly Trp 
            180                 185                 190         


Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly Ile Gly Gly His Gly Ser 
        195                 200                 205             


Cys Cys Ser Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Glu Ala 
    210                 215                 220                 


Leu Thr Pro His Pro Cys Thr Thr Val Gly Gln Glu Ile Cys Glu Gly 
225                 230                 235                 240 


Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
                245                 250                 255     


Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser 
            260                 265                 270         


Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 
        275                 280                 285             


Thr Val Val Thr Gln Phe Glu Thr Ser Gly Ala Ile Asn Arg Tyr Tyr 
    290                 295                 300                 


Val Gln Asn Gly Val Thr Phe Gln Gln Pro Asn Ala Glu Leu Gly Ser 
305                 310                 315                 320 


Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
                325                 330                 335     


Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gln Phe 
            340                 345                 350         


Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 
        355                 360                 365             


Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn 
    370                 375                 380                 


Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 
385                 390                 395                 400 


Ser Gly Val Pro Ala Gln Val Glu Ser Gln Ser Pro Asn Ala Lys Val 
                405                 410                 415     


Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Gly Ser Thr Gly Asn Pro 
            420                 425                 430         


Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg 
        435                 440                 445             


Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gln Ser His 
    450                 455                 460                 


Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro Thr Val Cys Ala 
465                 470                 475                 480 


Ser Gly Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr Ser Gln Cys Leu 
                485                 490                 495     


<210>  13
<211>  1767
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence for Trichoderma viride CBHI, including the alpha 
       factor signal peptide

<400>  13
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcaccc tctcaatctg cttgcacctt gcagtctgaa    300

actcacccac cattgacctg gcagaagtgt tcttctggcg gtacttgtac tcagcagacc    360

ggttctgttg ttatcgacgc caactggaga tggactcacg ctaccaactc ttctaccaac    420

tgctacgacg gtaacacttg gtcgtctacc ttgtgtccag acaacgagac ctgtgccaag    480

aactgttgtt tggacggtgc tgcttacgct tctacctacg gtgttaccac ctctggtaac    540

tcgctgtcta tcggtttcgt tacccagtct gcccagaaaa atgttggtgc cagactgtac    600

ttgatggctt ctgacaccac ctaccaagag tttaccctgc tgggtaacga gttctctttc    660

gacgtggacg tttctcaact gccatgtgga ctgaacggtg ccctgtactt cgtttctatg    720

gacgctgacg gtggtgtttc taagtaccca accaacaccg ctggtgctaa atacggaacc    780

ggttactgcg attctcagtg cccaagagac ctgaagttca tcaacggaca ggctaacgtt    840

gaaggatggg agccatcttc taacaacgcc aacaccggta ttggtggtca cggttcttgc    900

tgttctgaga tggacatctg ggaggccaac tctatttctg aggctttgac cccacaccca    960

tgtactactg tgggtcaaga gatctgtgag ggtgatggtt gtggtggtac ttactcggac   1020

aacagatacg gtggtacttg tgacccagac ggttgtgatt gggacccata cagactgggt   1080

aacacctctt tctacggtcc aggatcttct tttaccctgg acaccaccaa gaagttgacc   1140

gttgttaccc agtttgagac ctctggtgcc atcaacagat actacgtgca gaacggtgtt   1200

actttccagc agccaaacgc tgaactggga tcttactctg gtaacggact gaacgacgac   1260

tactgtactg ctgaggaagc tgagttcggt ggttcttctt tctctgacaa gggtggactg   1320

acccagttta agaaggctac ctctggcgga atggtgctgg ttatgtcttt gtgggacgac   1380

tactacgcta acatgctgtg gcttgactct acctacccaa ctaacgagac ctcttctacc   1440

ccaggtgctg ttagaggatc ttgctctacc tcttctggtg ttccagctca ggttgagtct   1500

cagtctccaa acgccaaggt gaccttctct aacatcaagt tcggtccaat cggttctact   1560

ggtgacccat ctggtggtaa cccaccaggt ggaaacccac ctggtactac cactaccaga   1620

agaccagcta ccaccactgg ttcttctcca ggtccaaccc aatctcacta cggtcagtgt   1680

ggtggtattg gttactctgg tccaaccgtt tgtgcttctg gaaccacctg tcaggttctg   1740

aacccatact actcgcagtg cctgtaa                                       1767


<210>  14
<211>  497
<212>  PRT
<213>  Artificial

<220>
<223>  Trichoderma viride CBHI (CBH-f)

<400>  14

Gln Ser Ala Cys Thr Leu Gln Ser Glu Thr His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Lys Cys Ser Ser Gly Gly Thr Cys Thr Gln Gln Thr Gly Ser Val 
            20                  25                  30          


Val Ile Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser Thr 
        35                  40                  45              


Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
    50                  55                  60                  


Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser Ile Gly Phe Val 
                85                  90                  95      


Thr Gln Ser Ala Gln Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
            100                 105                 110         


Ser Asp Thr Thr Tyr Gln Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
        115                 120                 125             


Phe Asp Val Asp Val Ser Gln Leu Pro Cys Gly Leu Asn Gly Ala Leu 
    130                 135                 140                 


Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145                 150                 155                 160 


Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys 
                165                 170                 175     


Pro Arg Asp Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Glu Gly Trp 
            180                 185                 190         


Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly Ile Gly Gly His Gly Ser 
        195                 200                 205             


Cys Cys Ser Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Glu Ala 
    210                 215                 220                 


Leu Thr Pro His Pro Cys Thr Thr Val Gly Gln Glu Ile Cys Glu Gly 
225                 230                 235                 240 


Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
                245                 250                 255     


Asp Pro Asp Gly Cys Asp Trp Asp Pro Tyr Arg Leu Gly Asn Thr Ser 
            260                 265                 270         


Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 
        275                 280                 285             


Thr Val Val Thr Gln Phe Glu Thr Ser Gly Ala Ile Asn Arg Tyr Tyr 
    290                 295                 300                 


Val Gln Asn Gly Val Thr Phe Gln Gln Pro Asn Ala Glu Leu Gly Ser 
305                 310                 315                 320 


Tyr Ser Gly Asn Gly Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
                325                 330                 335     


Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gln Phe 
            340                 345                 350         


Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 
        355                 360                 365             


Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn 
    370                 375                 380                 


Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 
385                 390                 395                 400 


Ser Gly Val Pro Ala Gln Val Glu Ser Gln Ser Pro Asn Ala Lys Val 
                405                 410                 415     


Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Gly Ser Thr Gly Asp Pro 
            420                 425                 430         


Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr Thr 
        435                 440                 445             


Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gln Ser 
    450                 455                 460                 


His Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro Thr Val Cys 
465                 470                 475                 480 


Ala Ser Gly Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr Ser Gln Cys 
                485                 490                 495     


Leu 
    


<210>  15
<211>  1785
<212>  DNA
<213>  Artificial

<220>
<223>  Coding Sequence for Humicola grisea CBHI- Trichoderma reesei CBHI
       cellulose binding domain fusion protein including the alpha 
       factor signal peptide and a 6x His Tag

<400>  15
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcatgc tcgcagcagg ctggtacaat tactgctgag    300

aaccatccaa gaatgacgtg gaagagatgt agtggtccag gaaactgtca gactgttcag    360

ggtgaggtcg tgatagatgc taactggaga tggttgcata acaacggcca gaactgctac    420

gagggtaaca agtggacctc tcagtgttct tctgctaccg actgcgctca gagatgtgct    480

cttgatggag caaactacca gagtacatat ggtgcttcta cctctggtga cagccttacc    540

ctgaagtttg taaccaagca cgagtacgga accaatatcg gttctagatt ctacctgatg    600

gctaaccaga acaagtacca gatgtttacc ttgatgaaca acgagttcgc cttcgacgta    660

gatctgtcta aggtggagtg tggaatcaat tctgccttgt actttgtcgc tatggaagag    720

gacggaggta tggcttctta cccttctaac agagctggtg ctaagtatgg aactggatac    780

tgcgatgccc aatgcgctag agacctgaag ttcatcggtg gaaaggctaa cattgaaggt    840

tggagacctt ctaccaacga cccaaacgct ggagttggtc caatgggtgc ttgctgtgcc    900

gagattgacg tgtgggaatc taacgcttac gcctacgctt ttactccaca tgcttgcggt    960

tctaagaaca gataccacat ttgcgaaacc aacaactgtg gtggcactta ctctgatgac   1020

agattcgctg gatactgtga tgctaacgga tgtgattaca acccatacag aatgggtaac   1080

aaggactttt acggaaaggg taagactgtt gacactaaca gaaagttcac tgtggtctcg   1140

agatttgaga gaaacagact gtcgcagttc tttgtgcagg acggaagaaa gattgaggtc   1200

ccaccaccaa cttggccagg attgccaaac tctgccgaca ttaccccaga gttgtgcgac   1260

gctcagttca gagtgtttga cgacagaaac agatttgctg agaccggtgg atttgacgct   1320

ttgaacgagg ctctgaccat tccaatggtt ctagtcatga gtatttggga cgatcaccac   1380

tctaacatgc tttggctgga ctcttcttac cctccagaga aggctggatt gcctggtggt   1440

gacagaggtc catgtccaac aacttctgga gttccagccg aggttgaggc tcaataccca   1500

gacgcccagg tcgtgtggtc caacatcaga ttcggaccaa ttggaagctt aacaggtaat   1560

ccttcaggtg gtaatcctcc aggtggaaac agaggaacaa cgacaactag aagaccagct   1620

actacaactg gttcaagtcc aggtccaact caatcacact acggtcaatg tggtggtata   1680

ggttactctg gtcccactgt ttgtgcttct ggtactactt gccaagttct gaacccttac   1740

tactcacagt gtctagcttc tgcacaccat catcatcatc attaa                   1785


<210>  16
<211>  503
<212>  PRT
<213>  Artificial

<220>
<223>  Humicola grisea CBHI- Trichoderma reesei CBHI cellulose binding 
       domain fusion protein including a 6x His Tag (CBH-g)

<400>  16

Gln Gln Ala Gly Thr Ile Thr Ala Glu Asn His Pro Arg Met Thr Trp 
1               5                   10                  15      


Lys Arg Cys Ser Gly Pro Gly Asn Cys Gln Thr Val Gln Gly Glu Val 
            20                  25                  30          


Val Ile Asp Ala Asn Trp Arg Trp Leu His Asn Asn Gly Gln Asn Cys 
        35                  40                  45              


Tyr Glu Gly Asn Lys Trp Thr Ser Gln Cys Ser Ser Ala Thr Asp Cys 
    50                  55                  60                  


Ala Gln Arg Cys Ala Leu Asp Gly Ala Asn Tyr Gln Ser Thr Tyr Gly 
65                  70                  75                  80  


Ala Ser Thr Ser Gly Asp Ser Leu Thr Leu Lys Phe Val Thr Lys His 
                85                  90                  95      


Glu Tyr Gly Thr Asn Ile Gly Ser Arg Phe Tyr Leu Met Ala Asn Gln 
            100                 105                 110         


Asn Lys Tyr Gln Met Phe Thr Leu Met Asn Asn Glu Phe Ala Phe Asp 
        115                 120                 125             


Val Asp Leu Ser Lys Val Glu Cys Gly Ile Asn Ser Ala Leu Tyr Phe 
    130                 135                 140                 


Val Ala Met Glu Glu Asp Gly Gly Met Ala Ser Tyr Pro Ser Asn Arg 
145                 150                 155                 160 


Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ala Gln Cys Ala Arg 
                165                 170                 175     


Asp Leu Lys Phe Ile Gly Gly Lys Ala Asn Ile Glu Gly Trp Arg Pro 
            180                 185                 190         


Ser Thr Asn Asp Pro Asn Ala Gly Val Gly Pro Met Gly Ala Cys Cys 
        195                 200                 205             


Ala Glu Ile Asp Val Trp Glu Ser Asn Ala Tyr Ala Tyr Ala Phe Thr 
    210                 215                 220                 


Pro His Ala Cys Gly Ser Lys Asn Arg Tyr His Ile Cys Glu Thr Asn 
225                 230                 235                 240 


Asn Cys Gly Gly Thr Tyr Ser Asp Asp Arg Phe Ala Gly Tyr Cys Asp 
                245                 250                 255     


Ala Asn Gly Cys Asp Tyr Asn Pro Tyr Arg Met Gly Asn Lys Asp Phe 
            260                 265                 270         


Tyr Gly Lys Gly Lys Thr Val Asp Thr Asn Arg Lys Phe Thr Val Val 
        275                 280                 285             


Ser Arg Phe Glu Arg Asn Arg Leu Ser Gln Phe Phe Val Gln Asp Gly 
    290                 295                 300                 


Arg Lys Ile Glu Val Pro Pro Pro Thr Trp Pro Gly Leu Pro Asn Ser 
305                 310                 315                 320 


Ala Asp Ile Thr Pro Glu Leu Cys Asp Ala Gln Phe Arg Val Phe Asp 
                325                 330                 335     


Asp Arg Asn Arg Phe Ala Glu Thr Gly Gly Phe Asp Ala Leu Asn Glu 
            340                 345                 350         


Ala Leu Thr Ile Pro Met Val Leu Val Met Ser Ile Trp Asp Asp His 
        355                 360                 365             


His Ser Asn Met Leu Trp Leu Asp Ser Ser Tyr Pro Pro Glu Lys Ala 
    370                 375                 380                 


Gly Leu Pro Gly Gly Asp Arg Gly Pro Cys Pro Thr Thr Ser Gly Val 
385                 390                 395                 400 


Pro Ala Glu Val Glu Ala Gln Tyr Pro Asp Ala Gln Val Val Trp Ser 
                405                 410                 415     


Asn Ile Arg Phe Gly Pro Ile Gly Ser Leu Thr Gly Asn Pro Ser Gly 
            420                 425                 430         


Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg Arg Pro 
        435                 440                 445             


Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gln Ser His Tyr Gly 
    450                 455                 460                 


Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro Thr Val Cys Ala Ser Gly 
465                 470                 475                 480 


Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr Ser Gln Cys Leu Ala Ser 
                485                 490                 495     


Ala His His His His His His 
            500             


<210>  17
<211>  1809
<212>  DNA
<213>  Artificial

<220>
<223>  Coding sequence for Talaromyces emersonii CBHI / Trichoderma 
       reesei -CBD fusion including the alpha factor signal peptide and 
       a 6x His Tag

<400>  17
atgagatttc cttcaatttt tactgcagtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tacttagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctttggata aacgtgaggc ggaagcatgc tcgcagcagg ccggcacggc gacggcagag    300

aaccacccgc ccctgacatg gcaggaatgc accgcccctg ggagctgcac cacccagaac    360

ggggcggtcg ttcttgatgc gaactggcgt tgggtgcacg atgtgaacgg atacaccaac    420

tgctacacgg gcaatacctg ggaccccacg tactgccctg acgacgaaac ctgcgcccag    480

aactgtgcgc tggacggcgc ggattacgag ggcacctacg gcgtgacttc gtcgggcagc    540

tccttgaaac tcaatttcgt caccgggtcg aacgtcggat cccgtctcta cctgctgcag    600

gacgactcga cctatcagat cttcaagctc ctgaaccgcg agttcagctt tgacgtcgat    660

gtctccaatc ttccgtgcgg attgaacggc gctctgtact ttgtcgccat ggacgccgac    720

ggcggcgtgt ccaagtaccc gaacaacaag gctggtgcca agtacggaac cgggtattgc    780

gactcccaat gcccacggga cctcaagttc atcgacggcg aggccaacgt cgagggctgg    840

cagccgtctt cgaacaacgc caacaccgga attggcgacc acggctcctg ctgtgcggag    900

atggatgtct gggaagcaaa cagcatctcc aatgcggtca ctccgcaccc gtgcgacacg    960

ccaggccaga cgatgtgctc tggagatgac tgcggtggca catactctaa cgatcgctac   1020

gcgggaacct gcgatcctga cggctgtgac ttcaaccctt accgcatggg caacacttct   1080

ttctacgggc ctggcaagat catcgatacc accaagccct tcactgtcgt gacgcagttc   1140

ctcactgatg atggtacgga tactggaact ctcagcgaga tcaagcgctt ctacatccag   1200

aacagcaacg tcattccgca gcccaactcg gacatcagtg gcgtgaccgg caactcgatc   1260

acgacggagt tctgcactgc tcagaagcag gcctttggcg acacggacga cttctctcag   1320

cacggtggcc tggccaagat gggagcggcc atgcagcagg gtatggtcct ggtgatgagt   1380

ttgtgggacg actacgccgc gcagatgctg tggttggatt ccgactaccc gacggatgcg   1440

gaccccacga cccctggtat tgcccgtgga acgtgtccga cggactcggg cgtcccatcg   1500

gatgtcgagt cgcagagccc caactcctac gtgacctact cgaacattaa gtttggtccg   1560

atcggtagca caggtaatcc ttcaggtggt aatcctccag gtggaaacag aggaacaacg   1620

acaactagaa gaccagctac tacaactggt tcaagtccag gtccaactca atcacactac   1680

ggtcaatgtg gtggtatagg ttactctggt cccactgttt gtgcttctgg tactacttgc   1740

caagttctga acccttacta ctcacagtgt ctagcttctg cacatcatca ccaccaccat   1800

taatgataa                                                           1809


<210>  18
<211>  509
<212>  PRT
<213>  Artificial

<220>
<223>  Mature Sequence of Talaromyces emersonii CBHI / Trichoderma 
       reesei -CBD fusion with 6x-His tag (CBH-ah)

<400>  18

Gln Gln Ala Gly Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Gln Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Glu Thr Cys Ala Gln Asn Cys Ala Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asp Gly Glu Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Pro 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Ser Asn Val Ile Pro Gln Pro Asn Ser Asp 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Asp Phe Ser Gln His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Thr Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Tyr Ser Asn Ile Lys Phe Gly Pro Ile Gly Ser 
            420                 425                 430         


Thr Gly Asn Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr 
        435                 440                 445             


Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro 
    450                 455                 460                 


Thr Gln Ser His Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro 
465                 470                 475                 480 


Thr Val Cys Ala Ser Gly Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr 
                485                 490                 495     


Ser Gln Cys Leu Ala Ser Ala His His His His His His 
            500                 505                 


<210>  19
<211>  1335
<212>  DNA
<213>  Artificial

<220>
<223>  Alternative coding sequence of Humicola grisea CBHI with signal 
       sequence

<400>  19
atggccagcg atctggcaca gcaggctggt acaattactg ctgagaacca tccaagaatg     60

acgtggaaga gatgtagtgg tccaggaaac tgtcagactg ttcagggtga ggtcgtgata    120

gatgctaact ggagatggtt gcataacaac ggccagaact gctacgaggg taacaagtgg    180

acctctcagt gttcttctgc taccgactgc gctcagagat gtgctcttga tggagcaaac    240

taccagagta catatggtgc ttctacctct ggtgacagcc ttaccctgaa gtttgtaacc    300

aagcacgagt acggaaccaa tatcggttct agattctacc tgatggctaa ccagaacaag    360

taccagatgt ttaccttgat gaacaacgag ttcgccttcg acgtagatct gtctaaggtg    420

gagtgtggaa tcaattctgc cttgtacttt gtcgctatgg aagaggacgg aggtatggct    480

tcttaccctt ctaacagagc tggtgctaag tatggaactg gatactgcga tgcccaatgc    540

gctagagacc tgaagttcat cggtggaaag gctaacattg aaggttggag accttctacc    600

aacgacccaa acgctggagt tggtccaatg ggtgcttgct gtgccgagat tgacgtgtgg    660

gaatctaacg cttacgccta cgcttttact ccacatgctt gcggttctaa gaacagatac    720

cacatttgcg aaaccaacaa ctgtggtggc acttactctg atgacagatt cgctggatac    780

tgtgatgcta acggatgtga ttacaaccca tacagaatgg gtaacaagga cttttacgga    840

aagggtaaga ctgttgacac taacagaaag ttcactgtgg tctcgagatt tgagagaaac    900

agactgtcgc agttctttgt gcaggacgga agaaagattg aggtcccacc accaacttgg    960

ccaggattgc caaactctgc cgacattacc ccagagttgt gcgacgctca gttcagagtg   1020

tttgacgaca gaaacagatt tgctgagacc ggtggatttg acgctttgaa cgaggctctg   1080

accattccaa tggttctagt catgagtatt tgggacgatc accactctaa catgctttgg   1140

ctggactctt cttaccctcc agagaaggct ggattgcctg gtggtgacag aggtccatgt   1200

ccaacaactt ctggagttcc agccgaggtt gaggctcaat acccagacgc ccaggtcgtg   1260

tggtccaaca tcagattcgg accaattggt agcacagtga atgtggcttc tgcacaccat   1320

catcatcatc attga                                                    1335


<210>  20
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  20
gaggcggaag caccctctca atctgcttgc accttgcagt c                         41


<210>  21
<211>  38
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  21
ggagacgcag agcccttatt acaggcactg cgagtagt                             38


<210>  22
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  22
gaggcggaag caccctctca gcaggctggt actattactg c                         41


<210>  23
<211>  44
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  23
ggagacgcag agcccttaca cgttcacggt agaaccgatt gggc                      44


<210>  24
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  24
gaggcggaag caccctctca cgaggccggt accgtaaccg c                         41


<210>  25
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  25
ggagacgcag agcccttatt agttggcggt gaaggtcgag t                         41


<210>  26
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  26
gaggcggaag caccctctca gcaggccggc acggcgacgg c                         41


<210>  27
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  27
ggagacgcag agcccttatc acgaagcggt gaaggtcgag t                         41


<210>  28
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  28
gaggcggaag caccctctca gcaggccggc acggcgacgg c                         41


<210>  29
<211>  38
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  29
attacctgtg ctaccgatcg gaccaaactt aatgttcg                             38


<210>  30
<211>  38
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  30
aagtttggtc cgatcggtag cacaggtaat ccttcagg                             38


<210>  31
<211>  44
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  31
ggagacgcag agcccttatt atagacactg tgagtagtaa gggt                      44


<210>  32
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  32
gaggcggaag caccctctca gcaggccggc acggcgacgg c                         41


<210>  33
<211>  44
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  33
ggagacgcag agcccttatc attaatggtg gtggtgatga tgag                      44


<210>  34
<211>  40
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  34
aggcggaagc atgctcgcag caggctggta caattactgc                           40


<210>  35
<211>  41
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  35
ggattacctg ttaagcttcc aattggtccg aatctgatgt t                         41


<210>  36
<211>  42
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  36
accaattgga agcttaacag gtaatccttc aggtggtaat cc                        42


<210>  37
<211>  46
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  37
atcttgcagg tcgacttatc attaatgatg atgatgatgg tgtgca                    46


<210>  38
<211>  40
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  38
aggcggaagc atgctcgcag caggctggta caattactgc                           40


<210>  39
<211>  46
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  39
atcttgcagg tcgacttatc attaatgatg atgatgatgg tgtgca                    46


<210>  40
<211>  21
<212>  DNA
<213>  Artificial

<220>
<223>  Oligonucleotide alpha-f

<400>  40
tactattgcc agcattgctg c                                               21


<210>  41
<211>  23
<212>  DNA
<213>  Artificial

<220>
<223>  Oligonucleotide oli740

<400>  41
tcagctattt cacatacaaa tcg                                             23


<210>  42
<211>  521
<212>  PRT
<213>  artificial

<220>
<223>  Talaromyces emersonii CBHI Mutant with Chaetomium thermophilum 
       cellobiohydrolase I CBD with 6x His-TAG

<400>  42

Leu Gln Ala Cys Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Arg Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Val Thr Cys Ala Gln Asn Cys Cys Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asn Gly Met Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Leu 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Gly Asn Val Ile Pro Gln Pro Asn Ser Ile 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Glu Phe Ser Lys His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Val Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Asn Ser 
            420                 425                 430         


Thr Val Pro Gly Leu Asp Gly Ser Thr Pro Ser Asn Pro Thr Ala Thr 
        435                 440                 445             


Val Ala Pro Pro Thr Ser Thr Thr Thr Ser Val Arg Ser Ser Thr Thr 
    450                 455                 460                 


Gln Ile Ser Thr Pro Thr Ser Gln Pro Gly Gly Cys Thr Thr Gln Lys 
465                 470                 475                 480 


Trp Gly Gln Cys Gly Gly Ile Gly Tyr Thr Gly Cys Thr Asn Cys Val 
                485                 490                 495     


Ala Gly Thr Thr Cys Thr Glu Leu Asn Pro Trp Tyr Ser Gln Cys Leu 
            500                 505                 510         


Ala Ser Ala His His His His His His 
        515                 520     


<210>  43
<211>  1563
<212>  DNA
<213>  artificial

<220>
<223>  Coding Sequence for Talaromyces emersonii CBHI Mutant with 
       Chaetmium thermophilum cellobiohydrolase I CBD with 6x His-TAG

<400>  43
ctgcaggcct gcacggcgac ggcagagaac cacccgcccc tgacatggca ggaatgcacc     60

gcccctggga gctgcaccac caggaacggg gcggtcgttc ttgatgcgaa ctggcgttgg    120

gtgcacgatg tgaacggata caccaactgc tacacgggca atacctggga ccccacgtac    180

tgccctgacg acgtaacctg cgcccagaac tgttgcctgg acggcgcgga ttacgagggc    240

acctacggcg tgacttcgtc gggcagctcc ttgaaactca atttcgtcac cgggtcgaac    300

gtcggatccc gtctctacct gctgcaggac gactcgacct atcagatctt caagctcctg    360

aaccgcgagt tcagctttga cgtcgatgtc tccaatcttc cgtgcggatt gaacggcgct    420

ctgtactttg tcgccatgga cgccgacggc ggcgtgtcca agtacccgaa caacaaggct    480

ggtgccaagt acggaaccgg gtattgcgac tcccaatgcc cacgggacct caagttcatc    540

aacggcatgg ccaacgtcga gggctggcag ccgtcatcga acaacgccaa caccggaatt    600

ggcgaccacg gctcctgctg tgcggagatg gatgtctggg aagcaaacag catctccaat    660

gcggtcactc tgcacccgtg cgacacgcca ggccagacga tgtgctctgg agatgactgc    720

ggtggcacat actctaacga tcgctacgcg ggaacctgcg atcctgacgg ctgtgacttc    780

aacccttacc gcatgggcaa cacttctttc tacgggcctg gcaagatcat cgataccacc    840

aagcccttca ctgtcgtgac gcagttcctc actgatgatg gtacggatac tggaactctc    900

agcgagatca agcgcttcta catccagaac ggcaacgtca ttccgcagcc caactcgatc    960

atcagtggcg tgaccggcaa ctcgatcacg acggagttct gcactgctca gaagcaggcc   1020

tttggcgaca cggacgaatt ctctaagcac ggtggcctgg ccaagatggg agcggccatg   1080

cagcagggta tggtcctggt gatgagtttg tgggacgact acgccgcgca gatgctgtgg   1140

ttggattccg actacccgac ggatgcggac cccacggtcc ctggtattgc ccgtggaacg   1200

tgtccgacgg actcgggcgt cccatcggat gtcgagtcgc agagccccaa ctcctacgtg   1260

accttctcga acattaagtt tggtccgatc aactcgaccg tccctggcct cgacggcagc   1320

acccccagca acccgaccgc caccgttgct cctcccactt ctaccaccac cagcgtgaga   1380

agcagcacta ctcagatttc caccccgact agccagcccg gcggctgcac cacccagaag   1440

tggggccagt gcggtggtat cggctacacc ggctgcacta actgcgttgc tggcactacc   1500

tgcactgagc tcaacccctg gtacagccag tgcctggctt ctgctcatca tcaccatcac   1560

cac                                                                 1563


<210>  44
<211>  514
<212>  PRT
<213>  artificial

<220>
<223>  Talaromyces emersonii CBHI Mutant with Phanerochaete 
       chrysosporium cellobiohydrolase CBD with 6x His-TAG

<400>  44

Leu Gln Ala Cys Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Arg Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Val Thr Cys Ala Gln Asn Cys Cys Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asn Gly Met Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Leu 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Gly Asn Val Ile Pro Gln Pro Asn Ser Ile 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Glu Phe Ser Lys His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Val Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Asn Ser 
            420                 425                 430         


Thr Tyr Thr Gly Thr Val Ser Ser Ser Ser Val Ser Ser Ser His Ser 
        435                 440                 445             


Ser Thr Ser Thr Ser Ser Ser His Ser Ser Ser Ser Thr Pro Pro Thr 
    450                 455                 460                 


Gln Pro Thr Gly Val Thr Val Pro Gln Trp Gly Gln Cys Gly Gly Ile 
465                 470                 475                 480 


Gly Tyr Thr Gly Ser Thr Thr Cys Ala Ser Pro Tyr Thr Cys His Val 
                485                 490                 495     


Leu Asn Pro Tyr Tyr Ser Gln Cys Tyr Ala Ser Ala His His His His 
            500                 505                 510         


His His 
        


<210>  45
<211>  1545
<212>  DNA
<213>  artificial

<220>
<223>  Coding Sequence for Talaromyces emersonii CBHI Mutant with 
       Phanerochaete chrysosporium cellobiohydrolase CBD with 6x His-TAG

<400>  45
ctgcaggcct gcacggcgac ggcagagaac cacccgcccc tgacatggca ggaatgcacc     60

gcccctggga gctgcaccac caggaacggg gcggtcgttc ttgatgcgaa ctggcgttgg    120

gtgcacgatg tgaacggata caccaactgc tacacgggca atacctggga ccccacgtac    180

tgccctgacg acgtaacctg cgcccagaac tgttgcctgg acggcgcgga ttacgagggc    240

acctacggcg tgacttcgtc gggcagctcc ttgaaactca atttcgtcac cgggtcgaac    300

gtcggatccc gtctctacct gctgcaggac gactcgacct atcagatctt caagctcctg    360

aaccgcgagt tcagctttga cgtcgatgtc tccaatcttc cgtgcggatt gaacggcgct    420

ctgtactttg tcgccatgga cgccgacggc ggcgtgtcca agtacccgaa caacaaggct    480

ggtgccaagt acggaaccgg gtattgcgac tcccaatgcc cacgggacct caagttcatc    540

aacggcatgg ccaacgtcga gggctggcag ccgtcatcga acaacgccaa caccggaatt    600

ggcgaccacg gctcctgctg tgcggagatg gatgtctggg aagcaaacag catctccaat    660

gcggtcactc tgcacccgtg cgacacgcca ggccagacga tgtgctctgg agatgactgc    720

ggtggcacat actctaacga tcgctacgcg ggaacctgcg atcctgacgg ctgtgacttc    780

aacccttacc gcatgggcaa cacttctttc tacgggcctg gcaagatcat cgataccacc    840

aagcccttca ctgtcgtgac gcagttcctc actgatgatg gtacggatac tggaactctc    900

agcgagatca agcgcttcta catccagaac ggcaacgtca ttccgcagcc caactcgatc    960

atcagtggcg tgaccggcaa ctcgatcacg acggagttct gcactgctca gaagcaggcc   1020

tttggcgaca cggacgaatt ctctaagcac ggtggcctgg ccaagatggg agcggccatg   1080

cagcagggta tggtcctggt gatgagtttg tgggacgact acgccgcgca gatgctgtgg   1140

ttggattccg actacccgac ggatgcggac cccacggtcc ctggtattgc ccgtggaacg   1200

tgtccgacgg actcgggcgt cccatcggat gtcgagtcgc agagccccaa ctcctacgtg   1260

accttctcga acattaagtt tggtccgatc aactcgacct acactggaac tgtttcttca   1320

tcctccgttt catcttctca ctcctccact tctacttcat cttcccattc ctcatcttcc   1380

actccaccaa ctcaaccaac tggtgttact gttccacaat ggggacaatg tggtggtatt   1440

ggttacactg gttccactac ttgtgcttcc ccatacactt gtcacgtttt gaacccatac   1500

tactcccaat gttacgcttc tgctcatcat caccatcacc actaa                   1545


<210>  46
<211>  522
<212>  PRT
<213>  artificial

<220>
<223>  Talaromyces emersonii CBHI Mutant with Penicillium janthinellum 
       cellobiohydrolase CBD with 6x His-TAG

<400>  46

Leu Gln Ala Cys Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Arg Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Val Thr Cys Ala Gln Asn Cys Cys Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asn Gly Met Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Leu 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Gly Asn Val Ile Pro Gln Pro Asn Ser Ile 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Glu Phe Ser Lys His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Val Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Asn Ser 
            420                 425                 430         


Thr Phe Thr Gly Gly Thr Thr Ser Ser Ser Ser Thr Thr Thr Thr Thr 
        435                 440                 445             


Ser Lys Ser Thr Ser Thr Ser Ser Ser Ser Lys Thr Thr Thr Thr Ser 
    450                 455                 460                 


Val Thr Thr Thr Thr Thr Ser Ser Gly Ser Ser Gly Thr Gly Ala Ala 
465                 470                 475                 480 


His Trp Ala Gln Cys Gly Gly Asn Gly Trp Thr Gly Pro Thr Thr Cys 
                485                 490                 495     


Val Ser Pro Tyr Thr Cys Thr Lys Gln Asn Asp Trp Tyr Ser Gln Cys 
            500                 505                 510         


Leu Ala Ser Ala His His His His His His 
        515                 520         


<210>  47
<211>  1566
<212>  DNA
<213>  artificial

<220>
<223>  Coding Sequence for Talaromyces emersonii CBHI Mutant with 
       Penicillium janthinellum cellobiohydrolase CBD with 6x His-TAG

<400>  47
ctgcaggcct gcacggcgac ggcagagaac cacccgcccc tgacatggca ggaatgcacc     60

gcccctggga gctgcaccac caggaacggg gcggtcgttc ttgatgcgaa ctggcgttgg    120

gtgcacgatg tgaacggata caccaactgc tacacgggca atacctggga ccccacgtac    180

tgccctgacg acgtaacctg cgcccagaac tgttgcctgg acggcgcgga ttacgagggc    240

acctacggcg tgacttcgtc gggcagctcc ttgaaactca atttcgtcac cgggtcgaac    300

gtcggatccc gtctctacct gctgcaggac gactcgacct atcagatctt caagctcctg    360

aaccgcgagt tcagctttga cgtcgatgtc tccaatcttc cgtgcggatt gaacggcgct    420

ctgtactttg tcgccatgga cgccgacggc ggcgtgtcca agtacccgaa caacaaggct    480

ggtgccaagt acggaaccgg gtattgcgac tcccaatgcc cacgggacct caagttcatc    540

aacggcatgg ccaacgtcga gggctggcag ccgtcatcga acaacgccaa caccggaatt    600

ggcgaccacg gctcctgctg tgcggagatg gatgtctggg aagcaaacag catctccaat    660

gcggtcactc tgcacccgtg cgacacgcca ggccagacga tgtgctctgg agatgactgc    720

ggtggcacat actctaacga tcgctacgcg ggaacctgcg atcctgacgg ctgtgacttc    780

aacccttacc gcatgggcaa cacttctttc tacgggcctg gcaagatcat cgataccacc    840

aagcccttca ctgtcgtgac gcagttcctc actgatgatg gtacggatac tggaactctc    900

agcgagatca agcgcttcta catccagaac ggcaacgtca ttccgcagcc caactcgatc    960

atcagtggcg tgaccggcaa ctcgatcacg acggagttct gcactgctca gaagcaggcc   1020

tttggcgaca cggacgaatt ctctaagcac ggtggcctgg ccaagatggg agcggccatg   1080

cagcagggta tggtcctggt gatgagtttg tgggacgact acgccgcgca gatgctgtgg   1140

ttggattccg actacccgac ggatgcggac cccacggtcc ctggtattgc ccgtggaacg   1200

tgtccgacgg actcgggcgt cccatcggat gtcgagtcgc agagccccaa ctcctacgtg   1260

accttctcga acattaagtt tggtccgatc aactcgacct tcactggtgg tactacttca   1320

tcctcctcca ctactactac aacttccaag tccacttcca cttcatcttc atccaagact   1380

acaactactt ccgttacaac tactactact tcctctggtt cttctggtac tggtgctgct   1440

cattgggctc aatgtggtgg taatggatgg actggtccaa ctacttgtgt ttccccatac   1500

acttgtacta agcagaacga ctggtactct caatgtttgg cttctgctca tcatcaccat   1560

caccac                                                              1566


<210>  48
<211>  510
<212>  PRT
<213>  artificial

<220>
<223>  Talaromyces emersonii CBHI Mutant with Irpex lacteus 
       cellobiohydrolase CBD with 6x His-TAG

<400>  48

Leu Gln Ala Cys Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Arg Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Val Thr Cys Ala Gln Asn Cys Cys Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asn Gly Met Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Leu 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Gly Asn Val Ile Pro Gln Pro Asn Ser Ile 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Glu Phe Ser Lys His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Val Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Asn Ser 
            420                 425                 430         


Thr Phe Thr Gly Thr Gly Ser Thr Ser Pro Ser Ser Pro Ala Gly Pro 
        435                 440                 445             


Val Ser Ser Ser Thr Ser Val Ala Ser Gln Pro Thr Gln Pro Ala Gln 
    450                 455                 460                 


Gly Thr Val Ala Gln Trp Gly Gln Cys Gly Gly Thr Gly Phe Thr Gly 
465                 470                 475                 480 


Pro Thr Val Cys Ala Ser Pro Phe Thr Cys His Val Val Asn Pro Tyr 
                485                 490                 495     


Tyr Ser Gln Cys Tyr Ala Ser Ala His His His His His His 
            500                 505                 510 


<210>  49
<211>  1530
<212>  DNA
<213>  artificial

<220>
<223>  Coding Sequence for Talaromyces emersonii CBHI Mutant with Irpex 
       lacteus cellobiohydrolase CBD with 6x His-TAG

<400>  49
ctgcaggcct gcacggcgac ggcagagaac cacccgcccc tgacatggca ggaatgcacc     60

gcccctggga gctgcaccac caggaacggg gcggtcgttc ttgatgcgaa ctggcgttgg    120

gtgcacgatg tgaacggata caccaactgc tacacgggca atacctggga ccccacgtac    180

tgccctgacg acgtaacctg cgcccagaac tgttgcctgg acggcgcgga ttacgagggc    240

acctacggcg tgacttcgtc gggcagctcc ttgaaactca atttcgtcac cgggtcgaac    300

gtcggatccc gtctctacct gctgcaggac gactcgacct atcagatctt caagctcctg    360

aaccgcgagt tcagctttga cgtcgatgtc tccaatcttc cgtgcggatt gaacggcgct    420

ctgtactttg tcgccatgga cgccgacggc ggcgtgtcca agtacccgaa caacaaggct    480

ggtgccaagt acggaaccgg gtattgcgac tcccaatgcc cacgggacct caagttcatc    540

aacggcatgg ccaacgtcga gggctggcag ccgtcatcga acaacgccaa caccggaatt    600

ggcgaccacg gctcctgctg tgcggagatg gatgtctggg aagcaaacag catctccaat    660

gcggtcactc tgcacccgtg cgacacgcca ggccagacga tgtgctctgg agatgactgc    720

ggtggcacat actctaacga tcgctacgcg ggaacctgcg atcctgacgg ctgtgacttc    780

aacccttacc gcatgggcaa cacttctttc tacgggcctg gcaagatcat cgataccacc    840

aagcccttca ctgtcgtgac gcagttcctc actgatgatg gtacggatac tggaactctc    900

agcgagatca agcgcttcta catccagaac ggcaacgtca ttccgcagcc caactcgatc    960

atcagtggcg tgaccggcaa ctcgatcacg acggagttct gcactgctca gaagcaggcc   1020

tttggcgaca cggacgaatt ctctaagcac ggtggcctgg ccaagatggg agcggccatg   1080

cagcagggta tggtcctggt gatgagtttg tgggacgact acgccgcgca gatgctgtgg   1140

ttggattccg actacccgac ggatgcggac cccacggtcc ctggtattgc ccgtggaacg   1200

tgtccgacgg actcgggcgt cccatcggat gtcgagtcgc agagccccaa ctcctacgtg   1260

accttctcga acattaagtt tggtccgatc aactcgacct tcactggtac tggttctact   1320

tctccatctt ctccagctgg tccagtttct tcttccactt ccgttgcttc ccaaccaact   1380

caaccagctc aaggtactgt tgctcaatgg ggacaatgtg gtggtactgg tttcactggt   1440

ccaactgttt gtgcttcccc attcacttgt cacgttgtta acccatacta ctcccagtgt   1500

tacgcttctg ctcatcatca tcaccatcac                                    1530


<210>  50
<211>  509
<212>  PRT
<213>  artificial

<220>
<223>  Talaromyces emersonii CBHI Mutant with mutated Trichoderma reesei
       CBD with 6x His-TAG

<400>  50

Leu Gln Ala Cys Thr Ala Thr Ala Glu Asn His Pro Pro Leu Thr Trp 
1               5                   10                  15      


Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Arg Asn Gly Ala Val 
            20                  25                  30          


Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly Tyr Thr 
        35                  40                  45              


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro Asp Asp 
    50                  55                  60                  


Val Thr Cys Ala Gln Asn Cys Cys Leu Asp Gly Ala Asp Tyr Glu Gly 
65                  70                  75                  80  


Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn Phe Val 
                85                  90                  95      


Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp Asp Ser 
            100                 105                 110         


Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe Asp Val 
        115                 120                 125             


Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val 
    130                 135                 140                 


Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn Lys Ala 
145                 150                 155                 160 


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp 
                165                 170                 175     


Leu Lys Phe Ile Asn Gly Met Ala Asn Val Glu Gly Trp Gln Pro Ser 
            180                 185                 190         


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys Cys Ala 
        195                 200                 205             


Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Leu 
    210                 215                 220                 


His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp Asp Cys 
225                 230                 235                 240 


Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp 
                245                 250                 255     


Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe Tyr Gly 
            260                 265                 270         


Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val Thr Gln 
        275                 280                 285             


Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu Ile Lys 
    290                 295                 300                 


Arg Phe Tyr Ile Gln Asn Gly Asn Val Ile Pro Gln Pro Asn Ser Ile 
305                 310                 315                 320 


Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys Thr Ala 
                325                 330                 335     


Gln Lys Gln Ala Phe Gly Asp Thr Asp Glu Phe Ser Lys His Gly Gly 
            340                 345                 350         


Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu Val Met 
        355                 360                 365             


Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp Ser Asp 
    370                 375                 380                 


Tyr Pro Thr Asp Ala Asp Pro Thr Val Pro Gly Ile Ala Arg Gly Thr 
385                 390                 395                 400 


Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln Ser Pro 
                405                 410                 415     


Asn Ser Tyr Val Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Gly Ser 
            420                 425                 430         


Thr Gly Asn Pro Ser Gly Gly Asn Pro Ser Gly Gly Asp Gly Gly Thr 
        435                 440                 445             


Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro 
    450                 455                 460                 


Thr Gln Ser Leu Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro 
465                 470                 475                 480 


Thr Ile Cys Ala Ser Gly Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr 
                485                 490                 495     


Ser Gln Cys Leu Ala Ser Ala His His His His His His 
            500                 505                 


<210>  51
<211>  1527
<212>  DNA
<213>  artificial

<220>
<223>  Coding Sequence for Talaromyces emersonii CBHI Mutant with 
       mutated Trichoderma reesei CBD with 6x His-TAG

<400>  51
ctgcaggcct gcacggcgac ggcagagaac cacccgcccc tgacatggca ggaatgcacc     60

gcccctggga gctgcaccac caggaacggg gcggtcgttc ttgatgcgaa ctggcgttgg    120

gtgcacgatg tgaacggata caccaactgc tacacgggca atacctggga ccccacgtac    180

tgccctgacg acgtaacctg cgcccagaac tgttgcctgg acggcgcgga ttacgagggc    240

acctacggcg tgacttcgtc gggcagctcc ttgaaactca atttcgtcac cgggtcgaac    300

gtcggatccc gtctctacct gctgcaggac gactcgacct atcagatctt caagctcctg    360

aaccgcgagt tcagctttga cgtcgatgtc tccaatcttc cgtgcggatt gaacggcgct    420

ctgtactttg tcgccatgga cgccgacggc ggcgtgtcca agtacccgaa caacaaggct    480

ggtgccaagt acggaaccgg gtattgcgac tcccaatgcc cacgggacct caagttcatc    540

aacggcatgg ccaacgtcga gggctggcag ccgtcatcga acaacgccaa caccggaatt    600

ggcgaccacg gctcctgctg tgcggagatg gatgtctggg aagcaaacag catctccaat    660

gcggtcactc tgcacccgtg cgacacgcca ggccagacga tgtgctctgg agatgactgc    720

ggtggcacat actctaacga tcgctacgcg ggaacctgcg atcctgacgg ctgtgacttc    780

aacccttacc gcatgggcaa cacttctttc tacgggcctg gcaagatcat cgataccacc    840

aagcccttca ctgtcgtgac gcagttcctc actgatgatg gtacggatac tggaactctc    900

agcgagatca agcgcttcta catccagaac ggcaacgtca ttccgcagcc caactcgatc    960

atcagtggcg tgaccggcaa ctcgatcacg acggagttct gcactgctca gaagcaggcc   1020

tttggcgaca cggacgaatt ctctaagcac ggtggcctgg ccaagatggg agcggccatg   1080

cagcagggta tggtcctggt gatgagtttg tgggacgact acgccgcgca gatgctgtgg   1140

ttggattccg actacccgac ggatgcggac cccacggtcc ctggtattgc ccgtggaacg   1200

tgtccgacgg actcgggcgt cccatcggat gtcgagtcgc agagccccaa ctcctacgtg   1260

accttctcga acattaagtt tggtccgatc ggtagcacag gtaatccttc aggtggtaat   1320

ccttcaggtg gagacggcgg aacaacgaca actagaagac cagctactac aactggttca   1380

agtccaggtc caactcaatc actatacggt caatgtggtg gtataggtta ctctggtccc   1440

actatttgtg cttctggtac tacttgccaa gttctgaacc cttactactc acagtgtcta   1500

gcttctgcac atcatcacca ccaccat                                       1527




