                         SEQUENCE LISTING

<110>  E. I. du Pont de Nemours and Company
       Behabtu, Natnael
 
<120>  ENZYMATICALLY PRODUCED CELLULOSE

<130>  CL6399

<150>  PCT/CN2014/094594
<151>  2014-12-23

<150>  PCT/CN2014/094593
<151>  2014-12-23

<160>  8     

<170>  PatentIn version 3.5

<210>  1
<211>  2415
<212>  DNA
<213>  Vibrio ruber


<220>
<221>  misc_feature
<222>  (1)..(2415)
<223>  VruCdp1_wild type

<400>  1
atgatgaaat tcggatattt tgacgataaa aataaagaat atgttgccac aacaccatgt       60
acaccaatca aatggtgtaa ttatgtggga actttaaact ttggtggttt agtcgatagt      120
aacggcggta ttttactgtg taagggtgat cccgcactca atcgtatcac caaatatatt      180
gcccagatgc ccaatgccga ctttaaaggt tcgacactct atttgaaggt tcgcaatcaa      240
aacggggaag tgacaatatt ttctccgttt tatacaccga ctttaaagcc gttagataaa      300
tttgaaaatc acaccggact ttcttatacc accattattg ccgaagctta tggtgtgcgc      360
tgtgagagca ctttctttgt accgaagcag gacccgtttt tactgcaaga tattaaagtg      420
accaatattt ccggtgagga tttacatgtt gatgtgatac ccgtggttga attcacccac      480
tttgatgcat tgaagcaatt ggtgaatgcc gactgggtgc ctcagaccat gattttgcaa      540
gcacatcatc aagatttggg acataccgtt ttggaacagt atgcctttat gaagcgtgat      600
tatgctgtga acttactgac cgctgatcgg ccagcgactt catttgacgg ggatcgccag      660
aagttccttg gtaacctcgg gtatggcagt tgggcggcac cggcagcact caatgatgca      720
gaactgacca acagcgagtg tttgcgcggc gataatatcg gtgcgctgaa cttacgttta      780
ggctggctga aacctcagca aaccgagcgg acggtggtgc agttgactca gatggcgagt      840
ctcgatgcgg cacaaccgat gttagagaaa tatcgcgatc atcaagtggt tgatcaggct      900
tttgccgcac tgggcgagtt ctgggatgac tatttatcgg cgattcaagt ggctacacct      960
gatgcagcaa tgaactcaat gctgaatgtg cacaacccgc gtcagtgtca caccaccaaa     1020
aactggtcgc gttatttatc actgtatcag ctcggctatg gtgcgcgtgg gatcgggttc     1080
cgcgattcat cgcaggatat tctcggtgtg atcagccaca tgcctgaaga agcacgcgaa     1140
tttatcgaac gtttgctgtc agtgcaaaat accgatggtt ccgctatgca tcagttcttc     1200
ccttcaacca tggaagccaa tgccggtgac tcacgtgaag aagaagaccg ccctgactac     1260
tatggtgatg atcacttgtg gatcatttat gccgtcacgc aatatgtgaa agaaaccggt     1320
aatgcagatt ttctcaacca agtgattcct tattatcaaa aagataaaca gggtaatccg     1380
gttgagtcag ggacggtttg ggatcattta tgccgggcga ttgattttac ggcaacgcat     1440
accgggcagc atggcttacc gttgctggga ttcgcggact ggaatgacac agtgaactta     1500
ccgacgggtg ctgagtcgct gatggtcgcc aatatgtacg gtaaagcatt attggatatg     1560
ctcgatttgt gtcagctccg tggtgaggat tcgctcgcac agcgttacca aagccagtac     1620
gaacagatgc agcataccgt caatcagtat ggctgggacg gggaatggtt tgtccgttac     1680
tttgatgaaa agggggcacc gattggttca cataccaatg ctcaggggca aatttatacc     1740
aatggacaaa gctggccggt gatctccggg tttgccacgc ctgagcgcgc catgcaagct     1800
ttggattctg ttcataccaa actcaatacc gcgaacggca ttaagctttc cactcccgga     1860
tataacggat tttcgcctga actgggcggg gtttctactt acccgcccgg agcgaaagag     1920
aatggcggga tcttcttgca cgcgaaccca tggatgatga ttgctgaaac caaagtcggc     1980
aatggtgatc gcgcttatca gtattaccga caaattaatc cggcttctaa gaatgatcag     2040
atcgaggtgt ttgagtctga accctactgt tatccgcaaa atattttggg agatgagcac     2100
ccgcaatttg gtttaggccg taatgcatgg ttgtccggta cgtcatcgtg gacatatgtg     2160
gcaggcacgc agtggatttt aggtgtgcgg cctgaagttg acgggttacg cattgatcct     2220
tgtattccgc gtgactggcc tgaattttcc gtacagcgga aattccgggg agcgacttac     2280
cggattcatg ttgccaatcc gcatcatgtc aaccggggcg tcacagagat gcgcgtcgat     2340
ggggttgtga tccaagggaa taaagcaccg gtatttaccg atggtgaaca tcatatcgag     2400
attactttag gtcaa                                                      2415

<210>  2
<211>  805
<212>  PRT
<213>  Vibrio ruber

<220>
<221>  MISC_FEATURE
<222>  (1)..(805)
<223>  VruCdp1_wild type protein

<400>  2
Met Met Lys Phe Gly Tyr Phe Asp Asp Lys Asn Lys Glu Tyr Val Ala 
1               5                   10                  15      
Thr Thr Pro Cys Thr Pro Ile Lys Trp Cys Asn Tyr Val Gly Thr Leu 
            20                  25                  30          
Asn Phe Gly Gly Leu Val Asp Ser Asn Gly Gly Ile Leu Leu Cys Lys 
        35                  40                  45              
Gly Asp Pro Ala Leu Asn Arg Ile Thr Lys Tyr Ile Ala Gln Met Pro 
    50                  55                  60                  
Asn Ala Asp Phe Lys Gly Ser Thr Leu Tyr Leu Lys Val Arg Asn Gln 
65                  70                  75                  80  
Asn Gly Glu Val Thr Ile Phe Ser Pro Phe Tyr Thr Pro Thr Leu Lys 
                85                  90                  95      
Pro Leu Asp Lys Phe Glu Asn His Thr Gly Leu Ser Tyr Thr Thr Ile 
            100                 105                 110         
Ile Ala Glu Ala Tyr Gly Val Arg Cys Glu Ser Thr Phe Phe Val Pro 
        115                 120                 125             
Lys Gln Asp Pro Phe Leu Leu Gln Asp Ile Lys Val Thr Asn Ile Ser 
    130                 135                 140                 
Gly Glu Asp Leu His Val Asp Val Ile Pro Val Val Glu Phe Thr His 
145                 150                 155                 160 
Phe Asp Ala Leu Lys Gln Leu Val Asn Ala Asp Trp Val Pro Gln Thr 
                165                 170                 175     
Met Ile Leu Gln Ala His His Gln Asp Leu Gly His Thr Val Leu Glu 
            180                 185                 190         
Gln Tyr Ala Phe Met Lys Arg Asp Tyr Ala Val Asn Leu Leu Thr Ala 
        195                 200                 205             
Asp Arg Pro Ala Thr Ser Phe Asp Gly Asp Arg Gln Lys Phe Leu Gly 
    210                 215                 220                 
Asn Leu Gly Tyr Gly Ser Trp Ala Ala Pro Ala Ala Leu Asn Asp Ala 
225                 230                 235                 240 
Glu Leu Thr Asn Ser Glu Cys Leu Arg Gly Asp Asn Ile Gly Ala Leu 
                245                 250                 255     
Asn Leu Arg Leu Gly Trp Leu Lys Pro Gln Gln Thr Glu Arg Thr Val 
            260                 265                 270         
Val Gln Leu Thr Gln Met Ala Ser Leu Asp Ala Ala Gln Pro Met Leu 
        275                 280                 285             
Glu Lys Tyr Arg Asp His Gln Val Val Asp Gln Ala Phe Ala Ala Leu 
    290                 295                 300                 
Gly Glu Phe Trp Asp Asp Tyr Leu Ser Ala Ile Gln Val Ala Thr Pro 
305                 310                 315                 320 
Asp Ala Ala Met Asn Ser Met Leu Asn Val His Asn Pro Arg Gln Cys 
                325                 330                 335     
His Thr Thr Lys Asn Trp Ser Arg Tyr Leu Ser Leu Tyr Gln Leu Gly 
            340                 345                 350         
Tyr Gly Ala Arg Gly Ile Gly Phe Arg Asp Ser Ser Gln Asp Ile Leu 
        355                 360                 365             
Gly Val Ile Ser His Met Pro Glu Glu Ala Arg Glu Phe Ile Glu Arg 
    370                 375                 380                 
Leu Leu Ser Val Gln Asn Thr Asp Gly Ser Ala Met His Gln Phe Phe 
385                 390                 395                 400 
Pro Ser Thr Met Glu Ala Asn Ala Gly Asp Ser Arg Glu Glu Glu Asp 
                405                 410                 415     
Arg Pro Asp Tyr Tyr Gly Asp Asp His Leu Trp Ile Ile Tyr Ala Val 
            420                 425                 430         
Thr Gln Tyr Val Lys Glu Thr Gly Asn Ala Asp Phe Leu Asn Gln Val 
        435                 440                 445             
Ile Pro Tyr Tyr Gln Lys Asp Lys Gln Gly Asn Pro Val Glu Ser Gly 
    450                 455                 460                 
Thr Val Trp Asp His Leu Cys Arg Ala Ile Asp Phe Thr Ala Thr His 
465                 470                 475                 480 
Thr Gly Gln His Gly Leu Pro Leu Leu Gly Phe Ala Asp Trp Asn Asp 
                485                 490                 495     
Thr Val Asn Leu Pro Thr Gly Ala Glu Ser Leu Met Val Ala Asn Met 
            500                 505                 510         
Tyr Gly Lys Ala Leu Leu Asp Met Leu Asp Leu Cys Gln Leu Arg Gly 
        515                 520                 525             
Glu Asp Ser Leu Ala Gln Arg Tyr Gln Ser Gln Tyr Glu Gln Met Gln 
    530                 535                 540                 
His Thr Val Asn Gln Tyr Gly Trp Asp Gly Glu Trp Phe Val Arg Tyr 
545                 550                 555                 560 
Phe Asp Glu Lys Gly Ala Pro Ile Gly Ser His Thr Asn Ala Gln Gly 
                565                 570                 575     
Gln Ile Tyr Thr Asn Gly Gln Ser Trp Pro Val Ile Ser Gly Phe Ala 
            580                 585                 590         
Thr Pro Glu Arg Ala Met Gln Ala Leu Asp Ser Val His Thr Lys Leu 
        595                 600                 605             
Asn Thr Ala Asn Gly Ile Lys Leu Ser Thr Pro Gly Tyr Asn Gly Phe 
    610                 615                 620                 
Ser Pro Glu Leu Gly Gly Val Ser Thr Tyr Pro Pro Gly Ala Lys Glu 
625                 630                 635                 640 
Asn Gly Gly Ile Phe Leu His Ala Asn Pro Trp Met Met Ile Ala Glu 
                645                 650                 655     
Thr Lys Val Gly Asn Gly Asp Arg Ala Tyr Gln Tyr Tyr Arg Gln Ile 
            660                 665                 670         
Asn Pro Ala Ser Lys Asn Asp Gln Ile Glu Val Phe Glu Ser Glu Pro 
        675                 680                 685             
Tyr Cys Tyr Pro Gln Asn Ile Leu Gly Asp Glu His Pro Gln Phe Gly 
    690                 695                 700                 
Leu Gly Arg Asn Ala Trp Leu Ser Gly Thr Ser Ser Trp Thr Tyr Val 
705                 710                 715                 720 
Ala Gly Thr Gln Trp Ile Leu Gly Val Arg Pro Glu Val Asp Gly Leu 
                725                 730                 735     
Arg Ile Asp Pro Cys Ile Pro Arg Asp Trp Pro Glu Phe Ser Val Gln 
            740                 745                 750         
Arg Lys Phe Arg Gly Ala Thr Tyr Arg Ile His Val Ala Asn Pro His 
        755                 760                 765             
His Val Asn Arg Gly Val Thr Glu Met Arg Val Asp Gly Val Val Ile 
    770                 775                 780                 
Gln Gly Asn Lys Ala Pro Val Phe Thr Asp Gly Glu His His Ile Glu 
785                 790                 795                 800 
Ile Thr Leu Gly Gln 
                805 

<210>  3
<211>  2442
<212>  DNA
<213>  Artificial sequence

<220>
<223>  VruCdp1 with added sequences

<400>  3
atgatgaaat tcggctactt cgacgacaaa aacaaagaat atgttgcaac caccccgtgt       60
accccgatta aatggtgtaa ttatgttggc accctgaatt ttggtggtct ggttgatagc      120
aatggtggta ttctgctgtg taaaggtgat ccggcactga atcgtattac caaatatatc      180
gcacagatgc cgaacgccga ttttaaaggt agcaccctgt atctgaaagt gcgtaatcag      240
aatggtgaag tgaccatttt tagcccgttt tataccccga ccctgaaacc gctggataaa      300
tttgaaaatc ataccggtct gagctacacc accattattg ccgaagccta tggtgttcgt      360
tgtgaaagca ccttttttgt tccgaaacag gatccatttc tgctgcagga tatcaaagtt      420
accaatatca gcggtgaaga tctgcatgtt gatgttattc cggttgtgga atttacccat      480
tttgatgcac tgaaacagct ggttaatgca gattgggttc cgcagaccat gattctgcag      540
gcacatcatc aggatctggg tcataccgtt ctggaacagt atgcatttat gaaacgtgat      600
tatgccgtta atctgctgac cgcagatcgt ccggcaacca gctttgatgg tgatcgtcag      660
aaattcctgg gtaatctggg ttatggtagc tgggcagcac cggcagcact gaatgatgca      720
gaactgacca atagcgaatg tctgcgtggt gataatattg gtgccctgaa tctgcgtctg      780
ggttggctga aacctcagca gaccgaacgt accgttgttc agctgacaca gatggcaagc      840
ctggatgcag cacagccgat gctggaaaaa tatcgtgatc atcaggttgt tgatcaggca      900
tttgcagcac tgggcgaatt ttgggatgat tatctgagcg caattcaggt tgcgacaccg      960
gatgcagcca tgaatagcat gctgaatgtt cataatccgc gtcagtgtca taccacaaaa     1020
aattggagcc gttatctgag tctgtatcag ctgggctatg gtgcacgtgg tattggtttt     1080
cgtgatagca gccaggatat tctgggtgtt attagccaca tgccggaaga agcacgcgaa     1140
tttattgaac gtctgctgtc agttcagaat accgatggta gcgcaatgca tcagtttttt     1200
ccgagcacaa tggaagcaaa tgccggtgat agccgtgaag aagaagatcg tcctgattat     1260
tatggtgatg accatctgtg gattatctat gcagttaccc agtatgttaa agaaaccggc     1320
aatgccgatt ttctgaatca ggttattccg tactaccaga aagataaaca gggtaatccg     1380
gttgaaagcg gcaccgtttg ggatcatctg tgccgtgcaa ttgatttcac cgcaacccat     1440
acaggtcagc atggcctgcc gctgctgggt tttgccgatt ggaatgatac cgtgaatctg     1500
ccgacaggtg cagaaagcct gatggttgcc aatatgtatg gtaaagcact gctggatatg     1560
ctggatctgt gccaactgcg tggcgaagat agcctggcac agcgttatca gagccagtat     1620
gagcagatgc agcataccgt taatcagtat ggttgggatg gtgaatggtt tgtgcgttat     1680
tttgatgaaa aaggcgcacc gattggtagc cataccaatg cacagggtca gatttatacc     1740
aatggtcaga gctggccagt tattagcggt tttgcaacac cggaacgtgc aatgcaggca     1800
ctggatagcg ttcataccaa actgaatacc gccaatggta ttaaactgag cacaccgggt     1860
tataatggtt ttagtccgga actgggtggt gttagcacct atccgcctgg tgcaaaagaa     1920
aatggtggca tttttctgca tgcaaatccg tggatgatga ttgcagaaac caaagttggt     1980
aatggcgatc gtgcatatca gtattatcgt cagattaatc cggcaagcaa aaacgatcag     2040
atcgaagttt ttgaaagcga gccgtattgt tatccgcaga acatcctggg tgatgaacat     2100
ccgcagtttg gtctgggtcg taatgcatgg ctgagcggca ccagcagctg gacctatgtt     2160
gcaggcaccc agtggattct gggcgttcgt ccggaagttg atggcctgcg tattgatccg     2220
tgtattccgc gtgattggcc tgaatttagc gttcagcgta aatttcgtgg tgcaacctat     2280
cgtattcatg ttgccaatcc gcatcatgtt aatcgtggtg ttaccgaaat gcgtgttgat     2340
ggtgttgtta ttcagggtaa taaagcaccg gtttttaccg atggcgaaca tcacattgaa     2400
attaccctgg gtcagctcga gcaccaccac caccaccact ga                        2442

<210>  4
<211>  813
<212>  PRT
<213>  Artificial sequence

<220>
<223>  VruCdp1 with added sequences_protein

<400>  4
Met Met Lys Phe Gly Tyr Phe Asp Asp Lys Asn Lys Glu Tyr Val Ala 
1               5                   10                  15      
Thr Thr Pro Cys Thr Pro Ile Lys Trp Cys Asn Tyr Val Gly Thr Leu 
            20                  25                  30          
Asn Phe Gly Gly Leu Val Asp Ser Asn Gly Gly Ile Leu Leu Cys Lys 
        35                  40                  45              
Gly Asp Pro Ala Leu Asn Arg Ile Thr Lys Tyr Ile Ala Gln Met Pro 
    50                  55                  60                  
Asn Ala Asp Phe Lys Gly Ser Thr Leu Tyr Leu Lys Val Arg Asn Gln 
65                  70                  75                  80  
Asn Gly Glu Val Thr Ile Phe Ser Pro Phe Tyr Thr Pro Thr Leu Lys 
                85                  90                  95      
Pro Leu Asp Lys Phe Glu Asn His Thr Gly Leu Ser Tyr Thr Thr Ile 
            100                 105                 110         
Ile Ala Glu Ala Tyr Gly Val Arg Cys Glu Ser Thr Phe Phe Val Pro 
        115                 120                 125             
Lys Gln Asp Pro Phe Leu Leu Gln Asp Ile Lys Val Thr Asn Ile Ser 
    130                 135                 140                 
Gly Glu Asp Leu His Val Asp Val Ile Pro Val Val Glu Phe Thr His 
145                 150                 155                 160 
Phe Asp Ala Leu Lys Gln Leu Val Asn Ala Asp Trp Val Pro Gln Thr 
                165                 170                 175     
Met Ile Leu Gln Ala His His Gln Asp Leu Gly His Thr Val Leu Glu 
            180                 185                 190         
Gln Tyr Ala Phe Met Lys Arg Asp Tyr Ala Val Asn Leu Leu Thr Ala 
        195                 200                 205             
Asp Arg Pro Ala Thr Ser Phe Asp Gly Asp Arg Gln Lys Phe Leu Gly 
    210                 215                 220                 
Asn Leu Gly Tyr Gly Ser Trp Ala Ala Pro Ala Ala Leu Asn Asp Ala 
225                 230                 235                 240 
Glu Leu Thr Asn Ser Glu Cys Leu Arg Gly Asp Asn Ile Gly Ala Leu 
                245                 250                 255     
Asn Leu Arg Leu Gly Trp Leu Lys Pro Gln Gln Thr Glu Arg Thr Val 
            260                 265                 270         
Val Gln Leu Thr Gln Met Ala Ser Leu Asp Ala Ala Gln Pro Met Leu 
        275                 280                 285             
Glu Lys Tyr Arg Asp His Gln Val Val Asp Gln Ala Phe Ala Ala Leu 
    290                 295                 300                 
Gly Glu Phe Trp Asp Asp Tyr Leu Ser Ala Ile Gln Val Ala Thr Pro 
305                 310                 315                 320 
Asp Ala Ala Met Asn Ser Met Leu Asn Val His Asn Pro Arg Gln Cys 
                325                 330                 335     
His Thr Thr Lys Asn Trp Ser Arg Tyr Leu Ser Leu Tyr Gln Leu Gly 
            340                 345                 350         
Tyr Gly Ala Arg Gly Ile Gly Phe Arg Asp Ser Ser Gln Asp Ile Leu 
        355                 360                 365             
Gly Val Ile Ser His Met Pro Glu Glu Ala Arg Glu Phe Ile Glu Arg 
    370                 375                 380                 
Leu Leu Ser Val Gln Asn Thr Asp Gly Ser Ala Met His Gln Phe Phe 
385                 390                 395                 400 
Pro Ser Thr Met Glu Ala Asn Ala Gly Asp Ser Arg Glu Glu Glu Asp 
                405                 410                 415     
Arg Pro Asp Tyr Tyr Gly Asp Asp His Leu Trp Ile Ile Tyr Ala Val 
            420                 425                 430         
Thr Gln Tyr Val Lys Glu Thr Gly Asn Ala Asp Phe Leu Asn Gln Val 
        435                 440                 445             
Ile Pro Tyr Tyr Gln Lys Asp Lys Gln Gly Asn Pro Val Glu Ser Gly 
    450                 455                 460                 
Thr Val Trp Asp His Leu Cys Arg Ala Ile Asp Phe Thr Ala Thr His 
465                 470                 475                 480 
Thr Gly Gln His Gly Leu Pro Leu Leu Gly Phe Ala Asp Trp Asn Asp 
                485                 490                 495     
Thr Val Asn Leu Pro Thr Gly Ala Glu Ser Leu Met Val Ala Asn Met 
            500                 505                 510         
Tyr Gly Lys Ala Leu Leu Asp Met Leu Asp Leu Cys Gln Leu Arg Gly 
        515                 520                 525             
Glu Asp Ser Leu Ala Gln Arg Tyr Gln Ser Gln Tyr Glu Gln Met Gln 
    530                 535                 540                 
His Thr Val Asn Gln Tyr Gly Trp Asp Gly Glu Trp Phe Val Arg Tyr 
545                 550                 555                 560 
Phe Asp Glu Lys Gly Ala Pro Ile Gly Ser His Thr Asn Ala Gln Gly 
                565                 570                 575     
Gln Ile Tyr Thr Asn Gly Gln Ser Trp Pro Val Ile Ser Gly Phe Ala 
            580                 585                 590         
Thr Pro Glu Arg Ala Met Gln Ala Leu Asp Ser Val His Thr Lys Leu 
        595                 600                 605             
Asn Thr Ala Asn Gly Ile Lys Leu Ser Thr Pro Gly Tyr Asn Gly Phe 
    610                 615                 620                 
Ser Pro Glu Leu Gly Gly Val Ser Thr Tyr Pro Pro Gly Ala Lys Glu 
625                 630                 635                 640 
Asn Gly Gly Ile Phe Leu His Ala Asn Pro Trp Met Met Ile Ala Glu 
                645                 650                 655     
Thr Lys Val Gly Asn Gly Asp Arg Ala Tyr Gln Tyr Tyr Arg Gln Ile 
            660                 665                 670         
Asn Pro Ala Ser Lys Asn Asp Gln Ile Glu Val Phe Glu Ser Glu Pro 
        675                 680                 685             
Tyr Cys Tyr Pro Gln Asn Ile Leu Gly Asp Glu His Pro Gln Phe Gly 
    690                 695                 700                 
Leu Gly Arg Asn Ala Trp Leu Ser Gly Thr Ser Ser Trp Thr Tyr Val 
705                 710                 715                 720 
Ala Gly Thr Gln Trp Ile Leu Gly Val Arg Pro Glu Val Asp Gly Leu 
                725                 730                 735     
Arg Ile Asp Pro Cys Ile Pro Arg Asp Trp Pro Glu Phe Ser Val Gln 
            740                 745                 750         
Arg Lys Phe Arg Gly Ala Thr Tyr Arg Ile His Val Ala Asn Pro His 
        755                 760                 765             
His Val Asn Arg Gly Val Thr Glu Met Arg Val Asp Gly Val Val Ile 
    770                 775                 780                 
Gln Gly Asn Lys Ala Pro Val Phe Thr Asp Gly Glu His His Ile Glu 
785                 790                 795                 800 
Ile Thr Leu Gly Gln Leu Glu His His His His His His 
                805                 810             

<210>  5
<211>  2397
<212>  DNA
<213>  Ruminococcus champanellensis

<220>
<221>  misc_feature
<222>  (1)..(2397)
<223>  RchCdp1_wild type

<400>  5
atgcagtacg gttactttga ccttgcaaac aaggaatacg tcatcacaag acctgacacc       60
cctgctccct gggcaaacta cctgggagat ccggaatacg gcgctatgat ctccaacaac      120
gcctgcggct acagctttgt aaagagcggc gcaaacggca gaatttcccg gttccggttc      180
aacagcaata tggcgctgcc cggcagatat atctacatcc gggacaatga cactgcggat      240
tactggtctg catcctggca gccggtgggc aagcccctgg atcagtacaa gagcgtatgc      300
cgccacggta ccgcttacac cattatgact gcggattatg caagcgtgca ttccgagacc      360
acctattatg taccctatca ccagacctat gaggtttggc gcacaaagat caccaacacc      420
tccgacaagc ccagaaagct gtccgtgttc ggctttgtgg aattcaccaa cgacaacaac      480
tacgagcagg atcaggtaaa cctccagtac accctgttca tcacccgcac cagctttgag      540
gaaaaccgca tcatccagca catcaatgaa aacagcggca aggacgcttc cggctccaac      600
cacaaggagc gcttcttcgg catggtgggc gctccggttt ccggctggaa cggcaacctg      660
gacagcttca tcggccccta ccggacctat tccaacccca tcgccgtaga gcagggtaag      720
tgcgacggca gcatgaacta caactccaac gcatgcggcg ccctccagag cgacctggag      780
ctgacacccg gcgaaactgc agagctgatc tacattctcg gtcagcgcaa cagcgcagag      840
gctgctacca tcctggatac ctacaagacg ctgggcaagg tggatgcaga aatcgcagag      900
ctgaagaatt tctggcacaa ggagctgtcc aacttccagg tgaacacccc cagcccggaa      960
ttcaacaata tgatcaacgt atggaacgct taccagtgct tcatcacctt catctggtcc     1020
cgtgcggcat ccttcgtata ctgcggtctg cgcaacggct acggctatcg ggataccgtc     1080
caggatatcc agggcatcat tcacctggat ccggaaatgg cagcagacaa gatccgcttt     1140
atgctctccg cacaggttga caacggcggc ggtctgcccc tggtgaagtt caaccacaat     1200
gcgggtcatg agaacacccc ggacgatccg gagtatgtaa aggaaaccgg tcacccctcc     1260
taccgggcgg acgatgctct gtggctgttc cccaccattg tgaagtacat cggggaaagc     1320
ggcaacaagg cattcctgga cgaggtgatc gtatacgcca acggcggcga ggctacggta     1380
tacgaccacc tgaagaacgc tatccggttc tccatggagc ggctgggggc acacgatatg     1440
cctgccgggc tccatgcgga ctggaacgac tgtctgcgga tgggtgccaa gggtgagtcc     1500
acctttgtgg cattccagct gtactatgcg atgcgcgtga tccgggatat ggcacagcag     1560
cggggcgaca gcgattatgt agcttacatc gacgatatac aggcaaagct gggcgcatcc     1620
ctggaaaagt gctgggatgg ggatcggttc atccggggca tccgggaaga cggagtcgtt     1680
gtgggcgcaa agaaggatcc ggaagcctcc atgtggctca atccccagag ctgggcagtg     1740
atctccggct ttgcaagcaa ggatcaggca gagcagtcca tggaatccgt acaccggatt     1800
ctgaacaccc cctacggcat caagctgctg gatcctccct acagagcgca ttactttgac     1860
ggtgctctga tgcacatctt caatccggac accaaggaaa acggtggtat cttctcccag     1920
tcccagggct gggcgatcct ggcggaaagt ctgctgggtc acggaaaccg tgccttcgag     1980
tactttatgg aaagctcccc ggctgccatg aacgacaggg cggagatccg tgtcatggag     2040
ccgtatgtgc acggtcagtt caccgaaagc accgcttctc cctatgccgg ccgctcccat     2100
gtacactggc tcaccggtac cgcatccacc gttatggtag gctgcgtaga ggggatctgc     2160
ggcatgcgtc ccaatgcgga cggtctggtg atctctccct ccattccctc ctcctgggac     2220
ggcttcacca tcgagaaaaa cttccgtggc aagcatctgt ccatccgggt agagaatcct     2280
agccacgttc agagcggcgt caagtccctg accctcaacg gcaaggagct gtccggcgac     2340
tttgttcccg cagctgagct gaaggatcag aacgaaatca ctgttgtact gggctaa        2397

<210>  6
<211>  798
<212>  PRT
<213>  Ruminococcus champanellensis

<220>
<221>  MISC_FEATURE
<222>  (1)..(798)
<223>  RchCdp1_wild type protein

<400>  6
Met Gln Tyr Gly Tyr Phe Asp Leu Ala Asn Lys Glu Tyr Val Ile Thr 
1               5                   10                  15      
Arg Pro Asp Thr Pro Ala Pro Trp Ala Asn Tyr Leu Gly Asp Pro Glu 
            20                  25                  30          
Tyr Gly Ala Met Ile Ser Asn Asn Ala Cys Gly Tyr Ser Phe Val Lys 
        35                  40                  45              
Ser Gly Ala Asn Gly Arg Ile Ser Arg Phe Arg Phe Asn Ser Asn Met 
    50                  55                  60                  
Ala Leu Pro Gly Arg Tyr Ile Tyr Ile Arg Asp Asn Asp Thr Ala Asp 
65                  70                  75                  80  
Tyr Trp Ser Ala Ser Trp Gln Pro Val Gly Lys Pro Leu Asp Gln Tyr 
                85                  90                  95      
Lys Ser Val Cys Arg His Gly Thr Ala Tyr Thr Ile Met Thr Ala Asp 
            100                 105                 110         
Tyr Ala Ser Val His Ser Glu Thr Thr Tyr Tyr Val Pro Tyr His Gln 
        115                 120                 125             
Thr Tyr Glu Val Trp Arg Thr Lys Ile Thr Asn Thr Ser Asp Lys Pro 
    130                 135                 140                 
Arg Lys Leu Ser Val Phe Gly Phe Val Glu Phe Thr Asn Asp Asn Asn 
145                 150                 155                 160 
Tyr Glu Gln Asp Gln Val Asn Leu Gln Tyr Thr Leu Phe Ile Thr Arg 
                165                 170                 175     
Thr Ser Phe Glu Glu Asn Arg Ile Ile Gln His Ile Asn Glu Asn Ser 
            180                 185                 190         
Gly Lys Asp Ala Ser Gly Ser Asn His Lys Glu Arg Phe Phe Gly Met 
        195                 200                 205             
Val Gly Ala Pro Val Ser Gly Trp Asn Gly Asn Leu Asp Ser Phe Ile 
    210                 215                 220                 
Gly Pro Tyr Arg Thr Tyr Ser Asn Pro Ile Ala Val Glu Gln Gly Lys 
225                 230                 235                 240 
Cys Asp Gly Ser Met Asn Tyr Asn Ser Asn Ala Cys Gly Ala Leu Gln 
                245                 250                 255     
Ser Asp Leu Glu Leu Thr Pro Gly Glu Thr Ala Glu Leu Ile Tyr Ile 
            260                 265                 270         
Leu Gly Gln Arg Asn Ser Ala Glu Ala Ala Thr Ile Leu Asp Thr Tyr 
        275                 280                 285             
Lys Thr Leu Gly Lys Val Asp Ala Glu Ile Ala Glu Leu Lys Asn Phe 
    290                 295                 300                 
Trp His Lys Glu Leu Ser Asn Phe Gln Val Asn Thr Pro Ser Pro Glu 
305                 310                 315                 320 
Phe Asn Asn Met Ile Asn Val Trp Asn Ala Tyr Gln Cys Phe Ile Thr 
                325                 330                 335     
Phe Ile Trp Ser Arg Ala Ala Ser Phe Val Tyr Cys Gly Leu Arg Asn 
            340                 345                 350         
Gly Tyr Gly Tyr Arg Asp Thr Val Gln Asp Ile Gln Gly Ile Ile His 
        355                 360                 365             
Leu Asp Pro Glu Met Ala Ala Asp Lys Ile Arg Phe Met Leu Ser Ala 
    370                 375                 380                 
Gln Val Asp Asn Gly Gly Gly Leu Pro Leu Val Lys Phe Asn His Asn 
385                 390                 395                 400 
Ala Gly His Glu Asn Thr Pro Asp Asp Pro Glu Tyr Val Lys Glu Thr 
                405                 410                 415     
Gly His Pro Ser Tyr Arg Ala Asp Asp Ala Leu Trp Leu Phe Pro Thr 
            420                 425                 430         
Ile Val Lys Tyr Ile Gly Glu Ser Gly Asn Lys Ala Phe Leu Asp Glu 
        435                 440                 445             
Val Ile Val Tyr Ala Asn Gly Gly Glu Ala Thr Val Tyr Asp His Leu 
    450                 455                 460                 
Lys Asn Ala Ile Arg Phe Ser Met Glu Arg Leu Gly Ala His Asp Met 
465                 470                 475                 480 
Pro Ala Gly Leu His Ala Asp Trp Asn Asp Cys Leu Arg Met Gly Ala 
                485                 490                 495     
Lys Gly Glu Ser Thr Phe Val Ala Phe Gln Leu Tyr Tyr Ala Met Arg 
            500                 505                 510         
Val Ile Arg Asp Met Ala Gln Gln Arg Gly Asp Ser Asp Tyr Val Ala 
        515                 520                 525             
Tyr Ile Asp Asp Ile Gln Ala Lys Leu Gly Ala Ser Leu Glu Lys Cys 
    530                 535                 540                 
Trp Asp Gly Asp Arg Phe Ile Arg Gly Ile Arg Glu Asp Gly Val Val 
545                 550                 555                 560 
Val Gly Ala Lys Lys Asp Pro Glu Ala Ser Met Trp Leu Asn Pro Gln 
                565                 570                 575     
Ser Trp Ala Val Ile Ser Gly Phe Ala Ser Lys Asp Gln Ala Glu Gln 
            580                 585                 590         
Ser Met Glu Ser Val His Arg Ile Leu Asn Thr Pro Tyr Gly Ile Lys 
        595                 600                 605             
Leu Leu Asp Pro Pro Tyr Arg Ala His Tyr Phe Asp Gly Ala Leu Met 
    610                 615                 620                 
His Ile Phe Asn Pro Asp Thr Lys Glu Asn Gly Gly Ile Phe Ser Gln 
625                 630                 635                 640 
Ser Gln Gly Trp Ala Ile Leu Ala Glu Ser Leu Leu Gly His Gly Asn 
                645                 650                 655     
Arg Ala Phe Glu Tyr Phe Met Glu Ser Ser Pro Ala Ala Met Asn Asp 
            660                 665                 670         
Arg Ala Glu Ile Arg Val Met Glu Pro Tyr Val His Gly Gln Phe Thr 
        675                 680                 685             
Glu Ser Thr Ala Ser Pro Tyr Ala Gly Arg Ser His Val His Trp Leu 
    690                 695                 700                 
Thr Gly Thr Ala Ser Thr Val Met Val Gly Cys Val Glu Gly Ile Cys 
705                 710                 715                 720 
Gly Met Arg Pro Asn Ala Asp Gly Leu Val Ile Ser Pro Ser Ile Pro 
                725                 730                 735     
Ser Ser Trp Asp Gly Phe Thr Ile Glu Lys Asn Phe Arg Gly Lys His 
            740                 745                 750         
Leu Ser Ile Arg Val Glu Asn Pro Ser His Val Gln Ser Gly Val Lys 
        755                 760                 765             
Ser Leu Thr Leu Asn Gly Lys Glu Leu Ser Gly Asp Phe Val Pro Ala 
    770                 775                 780                 
Ala Glu Leu Lys Asp Gln Asn Glu Ile Thr Val Val Leu Gly 
785                 790                 795             

<210>  7
<211>  2421
<212>  DNA
<213>  Artificial sequence

<220>
<223>  RchCdp1 with added sequences

<400>  7
atgcagtatg gctattttga tctggccaac aaagaatatg ttatcacccg tccggataca       60
ccggcaccgt gggcaaatta tctgggtgat ccggaatatg gtgcaatgat tagcaataat      120
gcatgcggct atagctttgt taaaagcggt gcaaatggtc gtattagccg ttttcgtttt      180
aatagcaata tggcactgcc tggtcgctat atctatattc gtgataatga taccgcagac      240
tattggagcg caagctggca gccggttggt aaaccgctgg atcagtataa aagcgtttgt      300
cgtcatggca ccgcatatac cattatgacc gcagattatg caagcgttca tagcgaaacc      360
acctattatg ttccgtatca tcagacctat gaagtgtggc gtaccaaaat taccaatacc      420
agcgataaac cgcgtaaact gagcgttttt ggttttgtgg aattcaccaa cgataacaac      480
tatgaacagg atcaggtgaa tctgcagtat accctgttta ttacccgtac cagctttgaa      540
gaaaaccgca ttattcagca catcaatgaa aacagcggta aagatgcaag cggcagcaat      600
cataaagaac gcttttttgg tatggttggt gcaccggtta gcggttggaa tggtaatctg      660
gatagcttta ttggtccgta tcgtacctat agcaatccga ttgcagttga acagggtaaa      720
tgtgatggta gcatgaacta taatagtaat gcatgtggtg cactgcagag cgatctggaa      780
ctgacaccgg gtgaaaccgc agaactgatt tatatcctgg gtcagcgtaa tagcgcagaa      840
gcagcaacca ttctggatac ctataaaacc ctgggtaaag tggatgcaga aattgccgaa      900
ctgaaaaact tttggcacaa agaactgagc aactttcagg ttaatacccc gagtccggaa      960
tttaacaata tgattaatgt gtggaacgcc tatcagtgct tcatcacctt tatttggagc     1020
cgtgcagcaa gctttgttta ttgtggtctg cgtaatggtt atggctatcg tgataccgtt     1080
caggatattc agggtattat tcatctggat cctgaaatgg cagccgataa aattcgtttt     1140
atgctgagcg cacaggttga taatggtggt ggtctgccgc tggtgaaatt taaccataat     1200
gcaggtcatg aaaacacacc ggatgatcct gagtatgtta aagaaaccgg tcatccgagc     1260
tatcgtgcag atgatgcact gtggctgttt ccgaccattg tgaaatatat cggtgaaagc     1320
ggtaacaaag cctttctgga tgaagttatt gtgtatgcaa atggcggtga agcaaccgtt     1380
tatgatcatc tgaaaaatgc cattcgcttt agcatggaac gtctgggtgc acatgatatg     1440
cctgcaggtc tgcatgccga ttggaatgat tgtctgcgta tgggtgcaaa aggtgaaagc     1500
acctttgttg catttcagct gtattatgcc atgcgtgtta ttcgcgatat ggcacagcag     1560
cgtggtgata gcgattatgt tgcatatatt gatgacatcc aggcaaaact gggtgcaagc     1620
ctggaaaaat gttgggatgg tgatcgtttt attcgcggta ttcgtgaaga tggtgttgtt     1680
gttggtgcaa aaaaagatcc ggaagcaagc atgtggctga atccgcagag ctgggcagtt     1740
attagcggtt ttgcaagcaa agatcaggca gaacagagca tggaaagcgt gcatcgtatt     1800
ctgaataccc cgtatggtat taaactgctg gacccaccgt atcgtgcaca ttattttgat     1860
ggtgccctga tgcatatctt taacccggat accaaagaaa acggtggtat ttttagccag     1920
agccagggtt gggcaattct ggcagaaagc ctgctgggtc atggtaatcg tgcatttgaa     1980
tactttatgg aaagcagtcc ggcagccatg aatgatcgtg ccgaaattcg tgtgatggaa     2040
ccgtatgttc atggtcagtt taccgaaagc accgcaagcc cgtatgcagg tcgtagccat     2100
gttcattggc tgaccggtac agcaagcacc gttatggtgg gttgtgttga aggtatttgt     2160
ggtatgcgtc cgaatgcaga tggtctggtt attagcccga gcattccgag cagctgggat     2220
ggttttacca ttgaaaaaaa ctttcgcggt aaacatctga gcattcgtgt tgaaaatccg     2280
agtcatgttc agagcggtgt gaaaagcctg accctgaatg gtaaagaact gtcaggtgat     2340
tttgttccgg cagcggaact gaaagatcag aatgaaatta ccgttgtgct gggcctcgag     2400
caccaccacc accaccactg a                                               2421

<210>  8
<211>  806
<212>  PRT
<213>  Artificial sequence

<220>
<223>  RchCdp1 with added sequences_protein

<400>  8
Met Gln Tyr Gly Tyr Phe Asp Leu Ala Asn Lys Glu Tyr Val Ile Thr 
1               5                   10                  15      
Arg Pro Asp Thr Pro Ala Pro Trp Ala Asn Tyr Leu Gly Asp Pro Glu 
            20                  25                  30          
Tyr Gly Ala Met Ile Ser Asn Asn Ala Cys Gly Tyr Ser Phe Val Lys 
        35                  40                  45              
Ser Gly Ala Asn Gly Arg Ile Ser Arg Phe Arg Phe Asn Ser Asn Met 
    50                  55                  60                  
Ala Leu Pro Gly Arg Tyr Ile Tyr Ile Arg Asp Asn Asp Thr Ala Asp 
65                  70                  75                  80  
Tyr Trp Ser Ala Ser Trp Gln Pro Val Gly Lys Pro Leu Asp Gln Tyr 
                85                  90                  95      
Lys Ser Val Cys Arg His Gly Thr Ala Tyr Thr Ile Met Thr Ala Asp 
            100                 105                 110         
Tyr Ala Ser Val His Ser Glu Thr Thr Tyr Tyr Val Pro Tyr His Gln 
        115                 120                 125             
Thr Tyr Glu Val Trp Arg Thr Lys Ile Thr Asn Thr Ser Asp Lys Pro 
    130                 135                 140                 
Arg Lys Leu Ser Val Phe Gly Phe Val Glu Phe Thr Asn Asp Asn Asn 
145                 150                 155                 160 
Tyr Glu Gln Asp Gln Val Asn Leu Gln Tyr Thr Leu Phe Ile Thr Arg 
                165                 170                 175     
Thr Ser Phe Glu Glu Asn Arg Ile Ile Gln His Ile Asn Glu Asn Ser 
            180                 185                 190         
Gly Lys Asp Ala Ser Gly Ser Asn His Lys Glu Arg Phe Phe Gly Met 
        195                 200                 205             
Val Gly Ala Pro Val Ser Gly Trp Asn Gly Asn Leu Asp Ser Phe Ile 
    210                 215                 220                 
Gly Pro Tyr Arg Thr Tyr Ser Asn Pro Ile Ala Val Glu Gln Gly Lys 
225                 230                 235                 240 
Cys Asp Gly Ser Met Asn Tyr Asn Ser Asn Ala Cys Gly Ala Leu Gln 
                245                 250                 255     
Ser Asp Leu Glu Leu Thr Pro Gly Glu Thr Ala Glu Leu Ile Tyr Ile 
            260                 265                 270         
Leu Gly Gln Arg Asn Ser Ala Glu Ala Ala Thr Ile Leu Asp Thr Tyr 
        275                 280                 285             
Lys Thr Leu Gly Lys Val Asp Ala Glu Ile Ala Glu Leu Lys Asn Phe 
    290                 295                 300                 
Trp His Lys Glu Leu Ser Asn Phe Gln Val Asn Thr Pro Ser Pro Glu 
305                 310                 315                 320 
Phe Asn Asn Met Ile Asn Val Trp Asn Ala Tyr Gln Cys Phe Ile Thr 
                325                 330                 335     
Phe Ile Trp Ser Arg Ala Ala Ser Phe Val Tyr Cys Gly Leu Arg Asn 
            340                 345                 350         
Gly Tyr Gly Tyr Arg Asp Thr Val Gln Asp Ile Gln Gly Ile Ile His 
        355                 360                 365             
Leu Asp Pro Glu Met Ala Ala Asp Lys Ile Arg Phe Met Leu Ser Ala 
    370                 375                 380                 
Gln Val Asp Asn Gly Gly Gly Leu Pro Leu Val Lys Phe Asn His Asn 
385                 390                 395                 400 
Ala Gly His Glu Asn Thr Pro Asp Asp Pro Glu Tyr Val Lys Glu Thr 
                405                 410                 415     
Gly His Pro Ser Tyr Arg Ala Asp Asp Ala Leu Trp Leu Phe Pro Thr 
            420                 425                 430         
Ile Val Lys Tyr Ile Gly Glu Ser Gly Asn Lys Ala Phe Leu Asp Glu 
        435                 440                 445             
Val Ile Val Tyr Ala Asn Gly Gly Glu Ala Thr Val Tyr Asp His Leu 
    450                 455                 460                 
Lys Asn Ala Ile Arg Phe Ser Met Glu Arg Leu Gly Ala His Asp Met 
465                 470                 475                 480 
Pro Ala Gly Leu His Ala Asp Trp Asn Asp Cys Leu Arg Met Gly Ala 
                485                 490                 495     
Lys Gly Glu Ser Thr Phe Val Ala Phe Gln Leu Tyr Tyr Ala Met Arg 
            500                 505                 510         
Val Ile Arg Asp Met Ala Gln Gln Arg Gly Asp Ser Asp Tyr Val Ala 
        515                 520                 525             
Tyr Ile Asp Asp Ile Gln Ala Lys Leu Gly Ala Ser Leu Glu Lys Cys 
    530                 535                 540                 
Trp Asp Gly Asp Arg Phe Ile Arg Gly Ile Arg Glu Asp Gly Val Val 
545                 550                 555                 560 
Val Gly Ala Lys Lys Asp Pro Glu Ala Ser Met Trp Leu Asn Pro Gln 
                565                 570                 575     
Ser Trp Ala Val Ile Ser Gly Phe Ala Ser Lys Asp Gln Ala Glu Gln 
            580                 585                 590         
Ser Met Glu Ser Val His Arg Ile Leu Asn Thr Pro Tyr Gly Ile Lys 
        595                 600                 605             
Leu Leu Asp Pro Pro Tyr Arg Ala His Tyr Phe Asp Gly Ala Leu Met 
    610                 615                 620                 
His Ile Phe Asn Pro Asp Thr Lys Glu Asn Gly Gly Ile Phe Ser Gln 
625                 630                 635                 640 
Ser Gln Gly Trp Ala Ile Leu Ala Glu Ser Leu Leu Gly His Gly Asn 
                645                 650                 655     
Arg Ala Phe Glu Tyr Phe Met Glu Ser Ser Pro Ala Ala Met Asn Asp 
            660                 665                 670         
Arg Ala Glu Ile Arg Val Met Glu Pro Tyr Val His Gly Gln Phe Thr 
        675                 680                 685             
Glu Ser Thr Ala Ser Pro Tyr Ala Gly Arg Ser His Val His Trp Leu 
    690                 695                 700                 
Thr Gly Thr Ala Ser Thr Val Met Val Gly Cys Val Glu Gly Ile Cys 
705                 710                 715                 720 
Gly Met Arg Pro Asn Ala Asp Gly Leu Val Ile Ser Pro Ser Ile Pro 
                725                 730                 735     
Ser Ser Trp Asp Gly Phe Thr Ile Glu Lys Asn Phe Arg Gly Lys His 
            740                 745                 750         
Leu Ser Ile Arg Val Glu Asn Pro Ser His Val Gln Ser Gly Val Lys 
        755                 760                 765             
Ser Leu Thr Leu Asn Gly Lys Glu Leu Ser Gly Asp Phe Val Pro Ala 
    770                 775                 780                 
Ala Glu Leu Lys Asp Gln Asn Glu Ile Thr Val Val Leu Gly Leu Glu 
785                 790                 795                 800 
His His His His His His 
                805     

