                         SEQUENCE LISTING

<110>  E. I. du Pont de Nemours and Company
       Yang, Jianjun
       Rice, Barbara
       Qi, Min
       Chen, Zhongqiang
 
<120>  ARABINOSE ISOMERASES FOR YEAST

<130>  CL6357

<150>  US 62/319,945
<151>  2016-04-08

<160>  71    

<170>  PatentIn version 3.5

<210>  1
<211>  495
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  1

Met Asn Leu Lys Pro His Thr Phe Trp Phe Val Thr Gly Ser Gln His 
1               5                   10                  15      
Leu Tyr Gly Pro Glu Thr Leu Glu Gln Val Ala Glu His Ser Arg Ile 
            20                  25                  30          
Val Ala Thr Glu Phe Asp Lys Asp Pro Val Phe Thr Tyr Pro Ile Val 
        35                  40                  45              
Phe Lys Pro Ile Val Thr Thr Pro Asp Glu Ile Tyr Lys Leu Ile Leu 
    50                  55                  60                  
Glu Ala Asn Asn Asp Glu Ser Cys Ala Gly Ile Met Thr Trp Met His 
65                  70                  75                  80  
Thr Phe Ser Pro Ala Lys Met Trp Ile Ala Gly Leu Ser Gln Leu Gln 
                85                  90                  95      
Lys Pro Leu Leu His Phe His Thr Gln Phe Asn Arg Asp Ile Pro Trp 
            100                 105                 110         
Glu Thr Ile Asp Met Asp Phe Met Asn Leu Asn Gln Ser Ala His Gly 
        115                 120                 125             
Asp Arg Glu Tyr Gly His Ile Gly Ala Arg Leu Gly Ile Ala Arg Lys 
    130                 135                 140                 
Val Val Val Gly His Trp Glu Asp Gly Glu Val Arg Gly Ser Ile Ala 
145                 150                 155                 160 
Gly Trp Met Arg Thr Ala Ala Ala Tyr Ala Glu Ser Arg Arg Leu Lys 
                165                 170                 175     
Val Ala Arg Phe Gly Asp Asn Met Arg Gln Val Ala Val Thr Glu Gly 
            180                 185                 190         
Asp Lys Val Glu Ala Gln Ile Lys Leu Gly Trp Ser Val Asn Gly Tyr 
        195                 200                 205             
Gly Ile Gly Asp Leu Val Gln Ser Met Asn Glu Val Gly Asp Glu Glu 
    210                 215                 220                 
Val Lys Ala Leu Leu Asn Glu Tyr Ala Glu Ser Tyr Ser Ile Thr Lys 
225                 230                 235                 240 
Glu Gly Leu Ser Asp Gly Pro Val Arg Asp Ser Ile Ala Tyr Gln Ala 
                245                 250                 255     
Arg Ile Glu Ile Ala Leu Arg Arg Phe Leu Glu Glu Gly Gly Phe Gly 
            260                 265                 270         
Ala Phe Thr Thr Thr Phe Glu Asp Leu His Gly Met Lys Gln Leu Pro 
        275                 280                 285             
Gly Leu Ala Val Gln Arg Leu Met Glu Ser Gly Tyr Gly Phe Gly Gly 
    290                 295                 300                 
Glu Gly Asp Trp Lys Thr Ala Ala Leu Thr Arg Val Leu Lys Val Leu 
305                 310                 315                 320 
Ala Asp Asn Lys Ser Thr Ser Phe Met Glu Asp Tyr Thr Tyr His Phe 
                325                 330                 335     
Glu Pro Gly Asn His Met Ile Leu Gly Ser His Met Leu Glu Val Cys 
            340                 345                 350         
Pro Thr Ile Ala Leu Asp Lys Pro Thr Leu Glu Val His Pro Leu Gly 
        355                 360                 365             
Ile Gly Gly Lys Gly Asp Pro Ala Arg Leu Val Phe Asn Gly Gln Asp 
    370                 375                 380                 
Gly Pro Ala Val Asn Ala Ser Leu Ile Asp Leu Gly His Arg Phe Arg 
385                 390                 395                 400 
Leu Leu Val Asn Val Val Asp Gly Val Lys Val Glu Gln Pro Met Pro 
                405                 410                 415     
Lys Leu Pro Val Ala Arg Val Leu Trp Lys Pro Gln Pro Ser Leu Arg 
            420                 425                 430         
Glu Ser Ala Glu Ala Trp Ile Leu Ala Gly Gly Ala His His Thr Val 
        435                 440                 445             
Leu Ser Tyr Ala Met Thr Ala Glu His Leu Ser Asp Trp Ala Glu Met 
    450                 455                 460                 
Thr Gly Ile Glu Ala Val Val Ile Asp Lys Asp Thr Thr Ile Pro Arg 
465                 470                 475                 480 
Phe Lys Asn Glu Leu Arg Trp Ser Glu Ala Ala Tyr Arg Leu Arg 
                485                 490                 495 

<210>  2
<211>  495
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  2
Met Lys Leu Lys Pro His Ser Phe Trp Phe Val Thr Gly Ser Gln His 
1               5                   10                  15      
Leu Tyr Gly Pro Glu Thr Leu Glu Glu Val Ala Gly His Ser Arg Ile 
            20                  25                  30          
Ile Ala Glu Gln Leu Asp Lys Asp Pro Ala Ile Gly Phe Pro Val Val 
        35                  40                  45              
Phe Lys Pro Ile Val Thr Thr Pro Asp Glu Ile Tyr Lys Leu Ile Leu 
    50                  55                  60                  
Ala Ala Asn Gly Asp Glu Thr Cys Ala Gly Ile Ile Thr Trp Met His 
65                  70                  75                  80  
Thr Phe Ser Pro Ala Lys Met Trp Ile Ala Gly Leu Ser Gln Leu Gln 
                85                  90                  95      
Lys Pro Leu Leu His Phe His Thr Gln Phe Asn Arg Asp Ile Pro Trp 
            100                 105                 110         
Glu Thr Ile Asp Met Asp Phe Met Asn Leu Asn Gln Ser Ala His Gly 
        115                 120                 125             
Asp Arg Glu Tyr Gly His Ile Gly Ala Arg Leu Gly Ile Asn Arg Lys 
    130                 135                 140                 
Ile Val Val Gly His Trp Glu Asp Glu Glu Val Arg Ala Ser Leu Ala 
145                 150                 155                 160 
Gly Trp Met Arg Thr Ala Val Ala Tyr Ala Glu Ser Arg Gln Leu Lys 
                165                 170                 175     
Val Ala Arg Phe Gly Asp Asn Met Arg Glu Val Ala Val Thr Glu Gly 
            180                 185                 190         
Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Ser Val Asn Gly Tyr 
        195                 200                 205             
Gly Val Gly Asp Leu Val Gln Val Leu Asn Glu Val Thr Asp Ala Glu 
    210                 215                 220                 
Ala Glu Ala Leu Leu Lys Glu Tyr Ala Glu Gln Tyr Thr Ile Thr Gln 
225                 230                 235                 240 
Ala Gly Leu Ser Ser Gly Pro Ile Arg Asp Ser Ile Ala Tyr Gln Ala 
                245                 250                 255     
Lys Leu Glu Ile Ala Met Lys Arg Phe Leu Glu Gln Gly Gly Phe Gly 
            260                 265                 270         
Ala Phe Thr Thr Thr Phe Glu Asp Leu His Gly Leu Lys Gln Leu Pro 
        275                 280                 285             
Gly Leu Ala Val Gln Arg Leu Met Glu Ala Gly Tyr Gly Phe Gly Gly 
    290                 295                 300                 
Glu Gly Asp Trp Lys Thr Ala Ala Leu Thr Arg Val Leu Lys Val Leu 
305                 310                 315                 320 
Ala Asn Asn Lys Ser Thr Ser Phe Met Glu Asp Tyr Thr Tyr His Phe 
                325                 330                 335     
Glu Pro Gly Asn His Met Ile Leu Gly Ala His Met Leu Glu Val Cys 
            340                 345                 350         
Pro Thr Ile Ala Ala Thr Lys Pro Thr Ile Glu Val His Pro Leu Gly 
        355                 360                 365             
Ile Gly Gly Lys Ala Asp Pro Ala Arg Met Val Phe Asp Gly Gln Ala 
    370                 375                 380                 
Gly Pro Ala Val Asn Ala Ser Leu Val Asp Leu Gly His Arg Phe Arg 
385                 390                 395                 400 
Leu Leu Val Asn Val Val Asp Gly Val Lys Val Glu Lys Pro Met Pro 
                405                 410                 415     
Lys Leu Pro Val Ala Arg Val Leu Trp Lys Pro Gln Pro Ser Leu Arg 
            420                 425                 430         
Glu Ser Ala Glu Ala Trp Ile Leu Ala Gly Gly Ala His His Thr Val 
        435                 440                 445             
Leu Ser Tyr Ala Ile Thr Ala Glu Asn Leu Ser Asp Trp Ala Glu Met 
    450                 455                 460                 
Val Gly Ile Glu Ala Val Ile Ile Asp Lys Asp Thr Ser Val Pro Arg 
465                 470                 475                 480 
Phe Lys Asn Glu Leu Arg Trp Ser Asp Ala Ala Tyr Arg Leu Arg 
                485                 490                 495 

<210>  3
<211>  495
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  3
Met Gln Arg Thr Pro Tyr Glu Phe Trp Phe Val Thr Gly Ser Gln His 
1               5                   10                  15      
Leu Tyr Gly Ser Glu Ala Leu Ala Glu Val Ser Ser His Ser Arg Gln 
            20                  25                  30          
Ile Thr Gln Ala Phe Asn Glu Ala Asp Ser Ile Ser Phe Pro Ile Val 
        35                  40                  45              
Val Lys Pro Val Val Lys Thr Pro Glu Glu Ile Leu Gln Leu Cys Met 
    50                  55                  60                  
Glu Ala Asn Ser Asp Glu Asn Cys Ala Gly Leu Ile Thr Trp Met His 
65                  70                  75                  80  
Thr Phe Ser Pro Gly Lys Met Trp Ile Gly Gly Leu Ser Gln Leu His 
                85                  90                  95      
Lys Pro Leu Leu His Phe His Thr Gln Phe His Arg Glu Ile Pro Trp 
            100                 105                 110         
Asp Arg Ile Asp Met Asp Phe Met Asn Leu His Gln Ser Ala His Gly 
        115                 120                 125             
Asp Arg Glu Phe Gly Phe Ile Ala Thr Arg Leu Gly Ile Leu Arg Lys 
    130                 135                 140                 
Glu Val Val Gly His Trp Arg Asp Glu Ala Val Gln Lys Arg Leu Ser 
145                 150                 155                 160 
Asp Trp Met Arg Thr Ala Ile Ala Cys Leu Glu Gly Lys Lys Leu Lys 
                165                 170                 175     
Val Ala Arg Phe Gly Asp Asn Met Arg Arg Val Ala Val Thr Glu Gly 
            180                 185                 190         
Asp Lys Val Glu Ala Gln Ile Gln Phe Gly Trp Ser Ile Asn Gly Tyr 
        195                 200                 205             
Gly Val Gly Asp Leu Val Gln Arg Ile Thr Asp Ile Ser Asp Thr Ala 
    210                 215                 220                 
Val His Gln Leu Phe Arg Glu Tyr Gln Glu Arg Tyr Asp Phe Pro Pro 
225                 230                 235                 240 
Glu Ala Arg Glu Ala Gly Pro Ile Arg Asp Ser Ile Leu Glu Gln Ala 
                245                 250                 255     
Arg Ile Glu Leu Gly Leu Lys Leu Phe Leu Arg Glu Gly Gly Tyr Ser 
            260                 265                 270         
Ala Phe Thr Thr Thr Phe Glu Asp Leu His Gly Leu Lys Gln Leu Pro 
        275                 280                 285             
Gly Leu Ala Val Gln Arg Leu Met Ser Glu Gly Tyr Gly Phe Gly Ala 
    290                 295                 300                 
Glu Gly Asp Trp Arg Thr Ala Gly Leu Leu Arg Met Met Lys Ile Met 
305                 310                 315                 320 
Ala Asp Asn Glu Gly Thr Ser Phe Met Glu Asp Tyr Thr Tyr His Leu 
                325                 330                 335     
Glu Pro Gly Asn Glu Met Ile Leu Gly Ala His Met Leu Glu Val Cys 
            340                 345                 350         
Pro Thr Ile Ala Ala Gln Arg Pro Gly Ile Arg Val His Pro Leu Ser 
        355                 360                 365             
Ile Gly Gly Lys Ala Asp Pro Ala Arg Leu Val Phe Asp Gly Arg Pro 
    370                 375                 380                 
Gly Pro Ala Leu Asn Val Ser Leu Ile Asp Leu Gly Asn Arg Phe Arg 
385                 390                 395                 400 
Leu Leu Ile Asn Lys Val Asp Ala Val His Pro Lys Ser Ala Met Pro 
                405                 410                 415     
His Leu Pro Val Ala Arg Val Leu Trp Lys Pro Arg Pro Ser Leu His 
            420                 425                 430         
Asp Ser Ala Glu Ala Trp Met Tyr Ala Gly Gly Ala His His Thr Val 
        435                 440                 445             
Phe Ser Tyr His Val Thr Thr Glu Gln Leu Leu Asp Trp Ala Glu Trp 
    450                 455                 460                 
Val Asp Met Glu Ala Leu Val Ile Asp Glu Gln Thr Ser Leu Ser Ser 
465                 470                 475                 480 
Phe Arg Arg Gln Leu Lys Trp Asn Asp Ala Tyr Tyr Arg Ile Arg 
                485                 490                 495 

<210>  4
<211>  499
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  4
Met Leu Lys Thr Lys Asn Tyr Gln Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Ala His Val Ala Glu His Ala Lys 
            20                  25                  30          
Lys Ile Val Glu Ala Leu Asn Ala Ser Gly Asn Leu Pro Tyr Glu Val 
        35                  40                  45              
Val Trp Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Arg Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Thr Asp Glu Asn Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Phe 
                85                  90                  95      
Arg Lys Pro Leu Leu His Leu His Thr Gln Phe Asn Arg Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Phe Gly His Ile Phe Ser Arg Leu His Met Asn Arg 
    130                 135                 140                 
Lys Val Val Val Gly Tyr Trp Ala Asp Glu Asp Val Gln Lys Gln Ile 
145                 150                 155                 160 
Gly Ser Trp Met Arg Thr Ala Val Gly Val Val Glu Ser Ser His Ile 
                165                 170                 175     
Arg Val Met Arg Ile Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Glu Val Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Ala Val Glu Ala Val Asn Ala Val Ser Gln Ala 
    210                 215                 220                 
Asp Ile Asp Thr Leu Val Glu Glu Tyr Tyr Asp Lys Tyr Glu Ile Leu 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Glu Lys Glu Phe Arg Arg His Val Ala Val Gln 
                245                 250                 255     
Ala Gly Ile Glu Ile Gly Leu Glu Arg Phe Leu Glu Glu Asn Asn Tyr 
            260                 265                 270         
Gln Ala Ile Val Thr His Phe Gly Asp Leu Gly Gly Phe Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Met Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Thr Gly Gly Met Lys Asp Ala Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Pro Gly Lys Glu Gly Ile Leu Glu Ala His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Thr Ile Ala Asp Gly Lys Ile Ser Ile Lys 
        355                 360                 365             
Glu Gln Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ala Lys Glu Gly Pro Ala Ile Ala Ala Ser Leu Ile Asp Leu 
385                 390                 395                 400 
Gly Asp Arg Phe Arg Leu Leu Ile Asn Glu Val Glu Cys Lys Lys Thr 
                405                 410                 415     
Glu Lys Pro Met Pro Lys Leu Pro Val Ala Thr Ala Phe Trp Thr Pro 
            420                 425                 430         
Lys Pro Asn Leu Lys Ile Gly Ala Gln Ser Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Ser Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Glu Ala Met Gly Ile Glu Ala Val Tyr Ile Asp Ala Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Gln Leu Lys Asn Glu Leu Arg Trp Asn Glu Leu Ala 
                485                 490                 495     
Tyr Arg Arg 
            

<210>  5
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  5
Met Lys Thr Gly Arg Asp Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Glu Glu Cys Leu Arg Lys Val Ala Glu His Ser Ala 
            20                  25                  30          
Lys Ile Val Glu Gly Leu Asn Ala Ser Gly Arg Leu Pro Phe Glu Val 
        35                  40                  45              
Val Leu Lys Pro Thr Leu Ile Asp Pro Ala Thr Ile Arg Arg Thr Leu 
    50                  55                  60                  
Asn Glu Ala Asn Glu Asp Gly Glu Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Met Trp Ile Leu Gly Leu Lys Glu Tyr 
                85                  90                  95      
Arg Lys Pro Leu Cys His Leu His Thr Gln Phe Asn Glu Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Phe Gly His Met Val Ser Arg Met Gly Met Glu Arg 
    130                 135                 140                 
Lys Ile Ile Val Gly His Trp Ala Asn Ala Glu Val Gln Glu Lys Ile 
145                 150                 155                 160 
Gly Ser Trp Met Arg Thr Ala Ile Gly Ile Met Glu Ser Ser His Ile 
                165                 170                 175     
Arg Val Cys Arg Ile Gly Asp Asn Met Asn Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Glu Val Lys Phe Gly Trp Glu Ile Asp His 
        195                 200                 205             
Tyr Cys Val Asn Asp Ala Val Glu Tyr Val Asn Ala Val Ser Glu Gly 
    210                 215                 220                 
Asp Val Asn Ala Leu Val Glu Glu Tyr Tyr Ser Lys Tyr Gln Ile Leu 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Pro Glu Glu Phe Arg Ala His Val Ala Ala Gln 
                245                 250                 255     
Ala Lys Ile Glu Ile Gly Leu Glu Lys Phe Leu Glu Asp Gly Asp Tyr 
            260                 265                 270         
His Ala Ile Val Thr His Phe Gly Met Leu Gly Gly Leu Gln Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Ala Ala Gly Val Pro Gly Ala Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Pro Gly Lys Glu Gly Ile Leu Gln Ala His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Ser Ile Ala Glu Gly Pro Ile Ser Ile Lys 
        355                 360                 365             
Val Gln Pro Leu Ser Met Gly Asn Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ser Lys Thr Gly Pro Ala Val Ala Thr Ser Leu Val Asp Leu 
385                 390                 395                 400 
Gly Asn Arg Phe Arg Leu Ile Ile Asn Ala Val Asp Cys Lys Lys Cys 
                405                 410                 415     
Glu Lys Glu Met Pro Lys Leu Pro Val Ala Thr Ala Phe Trp Thr Pro 
            420                 425                 430         
Gln Pro Asp Leu Ala Thr Gly Ala Gln Ala Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Thr Val Asp Gln Met Val 
    450                 455                 460                 
Asp Trp Ala Ala Ala Met Gly Ile Glu Ser Val Val Ile Asp Lys Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Asn Phe Lys Asn Glu Leu Arg Trp Asn Ser Ile Tyr 
                485                 490                 495     
Tyr Arg 
        

<210>  6
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  6
Met Ile Gln Thr Lys Ala Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Val Leu Arg His Val Ala Asp His Ser Lys 
            20                  25                  30          
Glu Ile Val Glu Glu Leu Asn Lys Ser Gly Ile Leu Pro Tyr Glu Val 
        35                  40                  45              
Val Trp Lys Pro Val Leu Ile Thr Asn Gln Leu Ile Arg Gln Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Ala Asp Asp Ser Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Phe 
                85                  90                  95      
Arg Lys Pro Leu Leu His Leu His Thr Gln Tyr Asn Glu Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ala Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Gly His Ile Val Ser Arg Met Gly Ile Glu Arg 
    130                 135                 140                 
Lys Val Ile Ala Gly Tyr Trp Lys Asp Asn Glu Val Arg Ser Arg Ile 
145                 150                 155                 160 
Ala Ser Trp Met Arg Thr Ala Val Gly Val Met Glu Ser Ser His Ile 
                165                 170                 175     
Arg Val Met Arg Val Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Glu Val Asp Thr 
        195                 200                 205             
Tyr Pro Val Asn Glu Ile Ala Asp Ser Val Ala Thr Val Ser Ala Ser 
    210                 215                 220                 
Asp Val Asn Ala Leu Leu Asp Glu Tyr Tyr Asp Lys Tyr Glu Ile Ile 
225                 230                 235                 240 
Leu Asp Gly Arg Asp Pro Asp Glu Phe Lys Lys His Val Ala Val Gln 
                245                 250                 255     
Ala Gln Ile Glu Leu Gly Phe Glu Arg Phe Leu Glu Glu Lys Asn Tyr 
            260                 265                 270         
Gln Ala Ile Val Thr His Phe Gly Asp Leu Gly Ala Leu Gly Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Val Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Thr Ser Gly Met Lys Asp Ala Lys Gly Thr Ser Met Leu Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Arg Gly Lys Glu Gly Ile Leu Glu Ala His 
            340                 345                 350         
Met Leu Glu Ile Cys Pro Thr Ile Ala Asp Gly Pro Ile Ser Ile Arg 
        355                 360                 365             
Val Lys Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ser Lys Glu Gly Lys Gly Val Ala Thr Ser Leu Ile Asp Leu 
385                 390                 395                 400 
Gly Asn Arg Phe Arg Leu Ile Ile Asn Glu Val Glu Cys Lys Lys Thr 
                405                 410                 415     
Glu Lys Pro Met Pro Asn Leu Pro Val Ala Thr Ala Tyr Trp Thr Pro 
            420                 425                 430         
Tyr Pro Asp Leu Tyr Thr Gly Ala Glu Ala Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Thr Ser Gly Gln Met Ala 
    450                 455                 460                 
Asp Trp Ala Glu Met Met Gly Ile Glu Ala Val Ile Ile Asp Lys Asn 
465                 470                 475                 480 
Thr Thr Ile Pro Ala Phe Lys Lys Glu Leu Lys Leu Gly Asp Val Phe 
                485                 490                 495     
Tyr Arg 
        

<210>  7
<211>  477
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  7
Met Lys Phe Trp Phe Val Thr Gly Ser Gln Phe Leu Tyr Gly Glu Glu 
1               5                   10                  15      
Thr Leu Arg Gln Val Glu Glu Asp Ser Lys Lys Ile Val Asp Gly Leu 
            20                  25                  30          
Arg Leu Pro Phe Pro Val Glu Tyr Lys Leu Thr Val Lys Thr Glu Ser 
        35                  40                  45              
Glu Ile Glu Arg Ile Val Lys Glu Ala Asn Tyr Asp Asp Glu Cys Ala 
    50                  55                  60                  
Gly Ile Ile Thr Phe Cys His Thr Phe Ser Pro Ser Lys Met Trp Ile 
65                  70                  75                  80  
Asn Gly Leu Ala Leu Leu Gln Lys Pro Trp Leu His Phe His Thr Gln 
                85                  90                  95      
Phe Asn Glu Thr Ile Pro Asn Glu Ala Ile Asp Met Asp Tyr Met Asn 
            100                 105                 110         
Leu His Gln Ser Ala His Gly Asp Arg Glu His Gly Phe Ile Gly Ala 
        115                 120                 125             
Arg Leu Arg Val Pro Arg Ala Val Val Ala Gly Tyr Trp Lys Asp Pro 
    130                 135                 140                 
Ala Val Gln Ala Lys Ile Gly Glu Trp Gln Arg Ala Ala Val Gly Val 
145                 150                 155                 160 
Met Phe Ser Arg Ser Leu Lys Ile Val Arg Phe Gly Asp Asn Met Arg 
                165                 170                 175     
Glu Val Ala Val Thr Glu Gly Asp Lys Ile Glu Ala Gln Leu Arg Leu 
            180                 185                 190         
Gly Trp Gln Val Asn Thr Phe Ala Val Gly Asp Leu Val Glu Tyr Met 
        195                 200                 205             
Asp Ala Val Thr Asp Ala Glu Ile Asp Ala Leu Met Lys Glu Tyr Ala 
    210                 215                 220                 
Glu Leu Tyr Glu Phe Ser Glu Ala Asp Thr Asp Thr Ile Arg Tyr Gln 
225                 230                 235                 240 
Ala Arg Glu Glu Ile Ala Ile Glu Lys Ile Leu Val Arg Glu Gly Ala 
                245                 250                 255     
Lys Ala Phe Ser Asn Thr Phe Glu Asp Leu His Gly Met Lys Gln Leu 
            260                 265                 270         
Pro Gly Leu Ala Thr Gln His Leu Met His Lys Gly Tyr Gly Phe Gly 
        275                 280                 285             
Ala Glu Gly Asp Trp Lys Thr Ala Gly Met Thr Ala Ile Val Lys Ala 
    290                 295                 300                 
Met Tyr Pro Asp Gly Asn Thr Ser Phe Met Glu Asp Tyr Thr Tyr Asp 
305                 310                 315                 320 
Tyr Glu Arg Gln Leu Ile Leu Gly Ser His Met Leu Glu Val Cys Pro 
                325                 330                 335     
Ser Ile Ala Ala Asp Arg Pro Arg Ile Glu Val His Lys Leu Gly Ile 
            340                 345                 350         
Gly Gly Lys Asp Ala Pro Ala Arg Ile Val Phe Glu Gly Arg Ala Gly 
        355                 360                 365             
Ser Ala Lys Val Leu Ser Leu Ile Asp Ile Gly Gly Arg Phe Arg Leu 
    370                 375                 380                 
Ile Gln Gln Asp Ile Glu Cys Glu Lys Pro Phe Gln Ser Met Pro Asn 
385                 390                 395                 400 
Leu Pro Val Ala Arg Thr Met Trp Arg Pro Ala Pro Ser Phe Leu Glu 
                405                 410                 415     
Gly Leu Glu Cys Trp Ile Ile Ala Gly Gly Ala His His Thr Val Leu 
            420                 425                 430         
Ser Tyr Asp Ile Thr Asp Glu Thr Val Arg Asp Phe Ala Arg Ile Met 
        435                 440                 445             
Gly Ile Glu Leu Val Val Ile Asn Lys Asp Thr Thr Lys Glu Lys Leu 
    450                 455                 460                 
Glu Arg Asp Ile Met Ile Gly Asp Val Ile Tyr Gly Arg 
465                 470                 475         

<210>  8
<211>  477
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  8
Met Lys Phe Trp Phe Ile Thr Gly Ser Gln Phe Leu Tyr Gly Glu Glu 
1               5                   10                  15      
Thr Ile Arg Gln Val Glu Glu Asp Ser Lys Lys Ile Val Asp Gly Leu 
            20                  25                  30          
Lys Leu Pro Phe Pro Val Glu Tyr Lys Leu Thr Val Lys Lys Glu Ser 
        35                  40                  45              
Glu Ile Glu Arg Ile Val Lys Glu Ala Asn Phe Asp Asp Glu Cys Ala 
    50                  55                  60                  
Gly Ile Ile Thr Phe Cys His Thr Phe Ser Pro Ser Lys Met Trp Ile 
65                  70                  75                  80  
Asn Gly Leu Ala Ile Leu Gln Lys Pro Trp Leu His Phe His Thr Gln 
                85                  90                  95      
Phe Asn Glu Thr Ile Pro Asn Glu Ala Ile Asp Met Ala Tyr Met Asn 
            100                 105                 110         
Leu His Gln Ser Ala His Gly Asp Arg Glu His Gly Phe Ile Gly Ala 
        115                 120                 125             
Arg Leu Arg Met Pro Arg Ala Val Val Ala Gly Tyr Trp Lys Asp Pro 
    130                 135                 140                 
Glu Val Gln Ala Lys Ile Ala Glu Trp Gln Arg Ala Ala Val Gly Val 
145                 150                 155                 160 
Met Phe Ser Lys Ser Leu Lys Ile Val Arg Phe Gly Asp Asn Met Arg 
                165                 170                 175     
Glu Val Ala Val Thr Glu Gly Asp Lys Ile Glu Ala Gln Leu Lys Leu 
            180                 185                 190         
Gly Trp Gln Val Asn Thr Phe Ala Val Gly Asp Leu Val Glu Tyr Met 
        195                 200                 205             
Asn Ala Val Thr Asp Ala Glu Ile Asp Val Leu Met Lys Glu Tyr Ala 
    210                 215                 220                 
Glu Leu Tyr Asp Tyr Asp Lys Ala Asp Glu Glu Thr Ile Arg Tyr Gln 
225                 230                 235                 240 
Ala Arg Glu Glu Ile Ala Ile Glu Lys Ile Leu Val Arg Glu Gly Ala 
                245                 250                 255     
Lys Ala Phe Ser Asn Thr Phe Glu Asp Leu His Gly Met Gln Gln Leu 
            260                 265                 270         
Pro Gly Leu Ala Thr Gln His Leu Met His Lys Gly Tyr Gly Phe Gly 
        275                 280                 285             
Ala Glu Gly Asp Trp Lys Thr Ala Gly Met Thr Ala Ile Val Lys Ala 
    290                 295                 300                 
Met Tyr Pro Asp Gly Asn Thr Ser Phe Met Glu Asp Tyr Thr Tyr Asp 
305                 310                 315                 320 
Tyr Glu Arg Lys Leu Ile Leu Gly Ser His Met Leu Glu Val Cys Pro 
                325                 330                 335     
Ser Ile Ala Ala Asp Arg Pro Arg Ile Glu Val His Pro Leu Gly Ile 
            340                 345                 350         
Gly Gly Lys Glu Pro Pro Ala Arg Ile Val Phe Glu Gly Lys Ala Gly 
        355                 360                 365             
Ser Ala Lys Val Leu Ser Leu Ile Asp Ile Gly Gly Arg Leu Arg Leu 
    370                 375                 380                 
Ile Gln Gln Asp Ile Glu Cys Glu Lys Pro Phe Gln Ser Met Pro Asn 
385                 390                 395                 400 
Leu Pro Val Ala Arg Thr Met Trp Arg Pro Ala Pro Ser Phe Leu Glu 
                405                 410                 415     
Gly Leu Glu Cys Trp Ile Ile Ala Gly Gly Ala His His Thr Val Leu 
            420                 425                 430         
Ser Tyr Asp Ile Ser Asp Glu Thr Val Arg Asp Phe Ala Arg Ile Met 
        435                 440                 445             
Gly Ile Glu Leu Val Val Ile Asn Lys Asp Thr Thr Lys Glu Lys Leu 
    450                 455                 460                 
Glu Arg Asp Ile Met Ile Gly Asp Met Ile Tyr Gly Arg 
465                 470                 475         

<210>  9
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  9
Met Ser Glu Met Lys Lys Tyr Gln Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Ala His Val Ala Ala His Ser Lys 
            20                  25                  30          
Glu Met Val Glu Gly Leu Asn Lys Ser Gly Val Leu Pro Phe Glu Ile 
        35                  40                  45              
Val Trp Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Lys Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Asn Asp Pro Asn Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Phe 
                85                  90                  95      
Arg Lys Pro Leu Leu His Leu His Thr Gln Tyr Asn Glu Glu Ile Pro 
            100                 105                 110         
Tyr Ala Thr Met Asp Met Asp Phe Met Asn Glu Asn Gln Ala Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Ala His Ile Leu Ser Arg Met Arg Ile Glu Arg 
    130                 135                 140                 
Lys Val Val Val Gly Phe Trp Lys Asp Ser Glu Val Gln Lys Lys Ile 
145                 150                 155                 160 
Ala Ser Trp Met Arg Thr Ala Ile Gly Ile Met Glu Ser Ser His Ile 
                165                 170                 175     
Arg Val Cys Arg Val Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Leu Lys Phe Gly Trp Glu Ile Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Ile Ala Glu Ala Val Ala Ala Val Ser Ala Ser 
    210                 215                 220                 
Asp Thr Asn Ala Leu Val Asp Glu Tyr Tyr Ser Lys Tyr Asp Ile Cys 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Pro Glu Glu Phe Lys Lys His Val Ala Val Gln 
                245                 250                 255     
Ala Gln Ile Glu Ile Gly Phe Glu Arg Phe Leu Lys Glu Lys Asn Tyr 
            260                 265                 270         
Gln Ala Ile Val Thr His Phe Gly Asp Leu Gly Ala Leu Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Val Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Ser Ala Gly Met Lys Asp Ala Lys Gly Ser Ser Met Leu Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Lys Gly Lys Glu Gly Ile Ile Gln Ala His 
            340                 345                 350         
Met Leu Glu Ile Cys Pro Ser Ile Ser Asp Gly Pro Ile Gln Ile Lys 
        355                 360                 365             
Cys Gln Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Gln Ser Lys Thr Gly Ala Gly Ile Ala Thr Ser Leu Ile Asp Leu 
385                 390                 395                 400 
Gly Asn Arg Phe Arg Leu Ile Ile Gln Asp Val Glu Cys Lys Lys Val 
                405                 410                 415     
Glu Lys Pro Leu Pro Lys Leu Pro Thr Ala Ile Asn Phe Trp Thr Pro 
            420                 425                 430         
Gln Pro Asp Phe Tyr Thr Gly Thr Glu Ala Trp Leu Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Ile Thr Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Ala Ala Met Gly Ile Glu Ala Val Phe Ile Asp Lys Asn 
465                 470                 475                 480 
Thr Asn Ile Arg Asp Phe Lys Lys Asp Leu Met Leu Gly Glu Val Phe 
                485                 490                 495     
Tyr Arg 
        

<210>  10
<211>  487
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  10
Met Gln Arg Glu Phe Trp Phe Ile Val Gly Ser Gln Phe Leu Tyr Gly 
1               5                   10                  15      
Gln Asp Val Leu Asp Thr Val Asp Ala Arg Ala Arg Glu Met Ala Ala 
            20                  25                  30          
Glu Leu Ser Lys Val Leu Pro Tyr Pro Leu Val Tyr Lys Val Thr Ala 
        35                  40                  45              
Lys Thr Asn Lys Glu Ile Ala Asp Thr Val Lys Glu Ala Asn Tyr Arg 
    50                  55                  60                  
Asp Glu Val Met Gly Ile Val Thr Trp Cys His Thr Phe Ser Pro Ser 
65                  70                  75                  80  
Lys Met Trp Ile Asn Gly Leu Val Asn Leu Gln Lys Pro Tyr Cys His 
                85                  90                  95      
Leu Ala Thr Gln Tyr Asn Arg Glu Leu Pro Asn Glu Glu Ile Asp Ile 
            100                 105                 110         
Asp Phe Met Asn Leu Asn Gln Ala Ala His Gly Asp Arg Glu His Gly 
        115                 120                 125             
Phe Ile Ala Ala Arg Leu Arg Met Pro Arg Lys Val Ile Ala Gly Tyr 
    130                 135                 140                 
Trp Gln Asp Glu Lys Val His Lys Arg Leu Ser Asp Trp Met Lys Ala 
145                 150                 155                 160 
Ala Val Gly Val Asp Val Ser Lys His Met Lys Val Met Arg Phe Gly 
                165                 170                 175     
Asp Asn Met Arg Glu Val Ala Val Thr Glu Gly Asp Lys Val Glu Thr 
            180                 185                 190         
Gln Ile Lys Leu Gly Trp Gln Val Asn Thr Trp Ala Val Gly Asp Leu 
        195                 200                 205             
Val Lys Glu Met Asn Asn Val Thr Glu Ala Glu Ile Asp Ala Leu Phe 
    210                 215                 220                 
Ala Glu Tyr Glu Ala Gln Tyr Asp Ile Ala Thr Asp Asn Leu Ala Ala 
225                 230                 235                 240 
Ile Arg Tyr Gln Ala Lys Glu Glu Ile Ala Met Lys Lys Met Leu Asp 
                245                 250                 255     
Arg Glu Gly Cys Lys Ala Phe Ser Asn Thr Phe Gln Asp Leu Tyr Gly 
            260                 265                 270         
Met Glu Gln Leu Pro Gly Leu Ala Ser Gln His Leu Met Ala Gln Gly 
        275                 280                 285             
Tyr Gly Tyr Gly Gly Glu Gly Asp Trp Lys Val Ser Ala Met Thr Ala 
    290                 295                 300                 
Ile Leu Lys Ala Met Gly Glu Asn Gly Asn Gly Ala Ser Ala Phe Met 
305                 310                 315                 320 
Glu Asp Tyr Thr Tyr His Leu Val Glu Gly Gln Glu Tyr Ser Leu Gly 
                325                 330                 335     
Ala His Met Leu Glu Val Cys Pro Ser Leu Ala Ala Asp Lys Pro Arg 
            340                 345                 350         
Ile Glu Thr His His Leu Gly Ile Gly Met Asn Glu Lys Asp Pro Ala 
        355                 360                 365             
Arg Leu Val Phe Glu Gly Lys Ala Gly Lys Gly Ile Val Thr Ser Leu 
    370                 375                 380                 
Ile Asp Met Gly Gly Arg Met Arg Leu Ile Val Gln Asp Ile Glu Ala 
385                 390                 395                 400 
Val Lys Pro Ile Leu Pro Met Pro Asn Leu Pro Val Ala Arg Val Met 
                405                 410                 415     
Trp Arg Ala Met Pro Asp Leu Thr Thr Gly Val Glu Cys Trp Ile Thr 
            420                 425                 430         
Ala Gly Gly Ala His His Thr Val Leu Ser Phe Asp Val Thr Pro Ala 
        435                 440                 445             
Met Leu Arg Asp Trp Ala Arg Met Met Asp Ile Glu Phe Val Tyr Ile 
    450                 455                 460                 
Thr Lys Asp Thr Thr Pro Glu Glu Leu Glu Glu Glu Leu Leu Ile Lys 
465                 470                 475                 480 
Asp Leu Val Trp Lys Leu Lys 
                485         

<210>  11
<211>  499
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  11
Met Leu Lys Thr Lys Asn Tyr Gln Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Ala His Val Ala Glu His Ser Lys 
            20                  25                  30          
Ile Ile Val Asp Ala Leu Asn Lys Ser Gly Asn Leu Pro Tyr Glu Val 
        35                  40                  45              
Val Trp Lys Pro Thr Met Ile Thr Asn Glu Val Ile Arg Lys Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Thr Asp Glu Asn Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Tyr 
                85                  90                  95      
Arg Lys Pro Leu Leu His Leu His Thr Gln Phe Asn Arg Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ala Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Gly His Ile Phe Ser Arg Leu Asn Met Glu Arg 
    130                 135                 140                 
Lys Val Val Ala Gly Tyr Trp Glu Asp Glu Asp Val Gln Lys Gln Ile 
145                 150                 155                 160 
Gly Ser Trp Met Arg Thr Ala Val Gly Val Val Glu Ser Ser His Val 
                165                 170                 175     
Arg Val Met Arg Val Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Glu Val Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Val Val Glu Ala Val Asn Ala Val Ser Gln Ala 
    210                 215                 220                 
Asp Ile Asp Thr Leu Val Glu Glu Tyr Tyr Asp Lys Tyr Asp Ile Leu 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Glu Lys Glu Phe Arg Glu His Val Ala Val Gln 
                245                 250                 255     
Ala Gly Ile Glu Leu Gly Phe Glu Arg Phe Leu Asp Glu Asn Asn Tyr 
            260                 265                 270         
Gln Ala Val Val Thr His Phe Gly Asp Leu Gly Gly Leu Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Met Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Val Met Lys Ile 
305                 310                 315                 320 
Met Thr Gln Gly Met Lys Asp Ala Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Ser Gly Lys Glu Gly Val Leu Glu Ala His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Thr Ile Ala Asp Gly Lys Ile Ser Ile Lys 
        355                 360                 365             
Glu Gln Pro Leu Ser Met Gly Asn Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ser Lys Thr Gly Pro Ala Ile Ala Thr Ser Leu Ile Asp Leu 
385                 390                 395                 400 
Gly Asp Arg Phe Arg Leu Ile Ile Asn Asp Val Asp Cys Lys Lys Thr 
                405                 410                 415     
Glu Lys Pro Met Pro Lys Leu Pro Val Ala Thr Ala Phe Trp Thr Pro 
            420                 425                 430         
Gln Pro Asn Leu Lys Val Gly Thr Glu Ala Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Thr Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Ala Cys Met Gly Ile Glu Ala Val Tyr Ile Asp Lys Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Gln Phe Lys Asn Glu Leu Leu Trp Asn Ser Val Ala 
                485                 490                 495     
Tyr Arg Lys 
            

<210>  12
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  12
Met Thr Gly Val Lys Asn Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Glu Glu Cys Leu Ala His Val Ala Glu His Ser Arg 
            20                  25                  30          
Ile Ile Val Glu Ser Leu Asn Arg Ser Gly Ile Leu Pro Tyr Glu Val 
        35                  40                  45              
Val Trp Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Arg Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Ala Asp Glu Glu Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Phe 
                85                  90                  95      
Arg Lys Pro Leu Met His Phe His Thr Gln Phe Asn Arg Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Gly His Met Val Thr Arg Met Gly Ile Glu Arg 
    130                 135                 140                 
Lys Val Ile Val Gly His Trp Ser Asp Glu Lys Val Val Gly Arg Ile 
145                 150                 155                 160 
Ala Gly Trp Met Arg Thr Ala Val Gly Ile Met Glu Ser Ser His Val 
                165                 170                 175     
Arg Val Val Arg Phe Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Val Lys Phe Gly Trp Glu Val Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Leu Cys Gln Tyr Val Lys Ala Val Pro Lys Gly 
    210                 215                 220                 
Asp Ile Thr Ala Leu Val Asp Glu Tyr Tyr Ser Lys Tyr Thr Ile Leu 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Pro Glu Glu Phe Lys Arg His Val Ala Val Gln 
                245                 250                 255     
Ala Gln Ile Glu Ala Gly Leu Glu Arg Phe Leu Val Glu Lys Asp Tyr 
            260                 265                 270         
His Ala Ile Val Thr His Phe Gly Asp Leu Gly Glu Leu Gln Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Ala Gln Gly Val Lys Asn Ala Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Pro Gly Lys Glu Gly Ile Leu Glu Ala His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Ser Ile Ala Asp Gly Glu Ile Ser Ile Lys 
        355                 360                 365             
Val Asn Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ser Lys Thr Gly His Gly Ile Ala Thr Ser Leu Val Asp Leu 
385                 390                 395                 400 
Gly Thr Arg Phe Arg Leu Ile Ile Asn Asp Val Glu Cys Arg Lys Thr 
                405                 410                 415     
Glu Lys Ala Met Pro Lys Leu Pro Val Ala Thr Ala Phe Trp Thr Pro 
            420                 425                 430         
Glu Pro Ser Leu Ala Thr Gly Ala Glu Ala Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Thr Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Glu Ser Met Gly Ile Glu Val Val Tyr Ile Asp Lys Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Gly Leu Lys Asn Glu Met Arg Trp Asn Gly Ala Val 
                485                 490                 495     
Tyr Arg 
        

<210>  13
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  13
Met Ile Ala Val Lys Asn Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Ala His Val Ala Glu His Ser Gly 
            20                  25                  30          
Ile Ile Val Asp Ser Leu Asn Lys Ser Gly Ile Leu Pro Tyr Glu Val 
        35                  40                  45              
Val Leu Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Arg Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Ala Asp Glu Glu Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Tyr 
                85                  90                  95      
Arg Lys Pro Leu Met His Phe His Thr Gln Phe Asn Gln Glu Ile Pro 
            100                 105                 110         
Tyr Asp Ser Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Gly His Met Val Thr Arg Met Gly Ile Glu Arg 
    130                 135                 140                 
Lys Val Ile Val Gly His Trp Arg Asp Glu Lys Val Val Gly Arg Ile 
145                 150                 155                 160 
Ala Ala Trp Met Arg Thr Ala Val Gly Ile Met Glu Ser Ser His Val 
                165                 170                 175     
Arg Val Ala Arg Phe Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Met Lys Phe Gly Trp Glu Val Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Leu Ala Glu Tyr Val Lys Ala Val Pro Lys Gly 
    210                 215                 220                 
Asp Ile Thr Ala Leu Val Asp Glu Tyr Tyr Ser Lys Tyr Thr Ile Leu 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Pro Glu Glu Phe Lys Arg His Val Ala Val Gln 
                245                 250                 255     
Ala Gln Ile Glu Ala Gly Leu Glu Lys Phe Leu Leu Glu Lys Asp Tyr 
            260                 265                 270         
His Ala Ile Val Thr His Phe Gly Asp Leu Gly Glu Leu Gln Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Thr Gln Gly Met Lys Asp Ala Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Pro Gly Lys Glu Gly Ile Leu Glu Ala His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Thr Ile Ala Asp Gly Glu Ile Ser Ile Lys 
        355                 360                 365             
Ala Cys Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ser Lys Thr Gly His Gly Ile Ala Ala Ser Leu Val Asp Leu 
385                 390                 395                 400 
Gly Thr Arg Phe Arg Leu Ile Ile Asn Asp Val Glu Cys Lys Lys Thr 
                405                 410                 415     
Glu Lys Pro Met Pro Lys Leu Pro Val Ala Thr Ala Phe Trp Thr Pro 
            420                 425                 430         
Glu Pro Asn Leu Ala Thr Gly Ala Glu Ser Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Thr Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Asp Ala Met Gly Ile Glu Thr Val Tyr Ile Asp Lys Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Gly Leu Lys Asn Glu Leu Arg Trp Asn Ala Ala Ala 
                485                 490                 495     
Tyr Arg 
        

<210>  14
<211>  499
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  14
Met Leu Lys Lys Lys Glu Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Ala His Val Ala Glu His Ala Lys 
            20                  25                  30          
Ile Ile Val Glu Lys Leu Asn Glu Ser Gly Val Leu Pro Tyr Glu Val 
        35                  40                  45              
Val Trp Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Lys Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Ile Asp Asp Glu Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Phe 
                85                  90                  95      
Arg Lys Pro Leu Leu His Leu His Thr Gln Phe Asn Met Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Gly Arg Glu Phe Gly His Ile Phe Thr Arg Leu Gly Ile Glu Arg 
    130                 135                 140                 
Lys Val Val Val Gly His Trp Ser Asp Glu Lys Val Gln Glu Lys Ile 
145                 150                 155                 160 
Ala Ser Trp Met Arg Thr Ala Val Gly Val Ile Glu Ser Ser His Val 
                165                 170                 175     
Arg Val Met Arg Val Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Glu Val Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Ile Ala Glu Ser Val Asp Ala Val Ser Ala Ala 
    210                 215                 220                 
Asp Val Asn Thr Leu Val Glu Glu Tyr Tyr Asp Lys Tyr Glu Ile Leu 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Pro Glu Glu Phe Arg Lys His Val Ala Val Gln 
                245                 250                 255     
Ala Gln Ile Glu Leu Gly Phe Glu Arg Phe Leu Glu Glu Lys Asn Tyr 
            260                 265                 270         
Gln Ala Ile Val Thr His Phe Gly Asp Leu Gly Val Leu Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Gln Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Ile Met Lys Ile 
305                 310                 315                 320 
Met Thr Glu Gly Met Lys Asp Ala Lys Gly Thr Ser Met Leu Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Phe Val Pro Gly Lys Glu Gly Ile Leu Gln Ala His 
            340                 345                 350         
Met Leu Glu Ile Cys Pro Ser Ile Ala Asp Gly Pro Ile Ser Ile Lys 
        355                 360                 365             
Val Asn Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Phe Thr Ser Lys Glu Gly Lys Gly Ile Ala Thr Ser Leu Ile Asp Leu 
385                 390                 395                 400 
Gly Asp Arg Phe Arg Leu Ile Ile Asn Thr Val Asp Cys Lys Lys Asn 
                405                 410                 415     
Glu Lys Pro Met Pro Lys Leu Pro Val Ala Thr Asn Phe Trp Thr Pro 
            420                 425                 430         
Glu Pro Asp Leu Ala Thr Gly Ala Glu Ala Trp Ile Leu Cys Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Ile Thr Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Ala Met Met Gly Ile Glu Ala Val Tyr Ile Asp Lys Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Asn Leu Lys Asn Glu Leu Arg Trp Asn Glu Leu Ala 
                485                 490                 495     
Phe Arg Lys 
            

<210>  15
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  15
Met Lys Ala Ala Lys Asp Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Ala His Val Ala Glu His Ser Arg 
            20                  25                  30          
Ile Ile Val Asp Ala Leu Asn Lys Ser Gly Val Leu Pro Tyr Glu Ile 
        35                  40                  45              
Val Trp Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Lys Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Ala Asp Glu Asn Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Gln Glu Phe 
                85                  90                  95      
Arg Lys Pro Leu Leu His Phe His Thr Gln Phe Asn Arg Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ala Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Gly His Ile Val Ser Arg Met Gly Ile Glu Arg 
    130                 135                 140                 
Lys Ile Ile Val Gly Tyr Trp Glu Asp Arg Asp Val Gln Glu Lys Ile 
145                 150                 155                 160 
Ala Ser Trp Met Leu Thr Ala Ile Gly Ile Met Glu Ser Ser His Ile 
                165                 170                 175     
Arg Val Cys Arg Ile Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Glu Ile Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Ile Ala Glu Tyr Val Ala Ala Val Pro Gln Gly 
    210                 215                 220                 
Glu Ile Asn Ala Leu Val Glu Glu Tyr Tyr Ser Lys Tyr Asp Ile Ile 
225                 230                 235                 240 
Leu Glu Gly Arg Asp Pro Gln Glu Phe Arg Glu His Val Ala Val Gln 
                245                 250                 255     
Ala Gly Ile Glu Ile Gly Phe Glu Lys Phe Leu Glu Glu Lys Asn Tyr 
            260                 265                 270         
Gln Ala Ile Val Thr His Phe Gly Asp Leu Gly Ser Leu Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ile Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Thr Ala Gly Val Lys Asn Pro Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Pro Gly Lys Glu Gly Val Leu Glu Ala His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Ser Val Ala Asp Gly Pro Ile Gly Ile Lys 
        355                 360                 365             
Val Cys Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Val 
    370                 375                 380                 
Tyr Thr Ser Lys Thr Gly Pro Ala Ile Ala Thr Ser Leu Ile Asp Leu 
385                 390                 395                 400 
Gly Asn Arg Phe Arg Leu Ile Ile Asn Glu Val Glu Cys Lys Lys Val 
                405                 410                 415     
Glu Lys Pro Met Pro Lys Leu Pro Val Ala Thr Ala Phe Trp Thr Pro 
            420                 425                 430         
Tyr Pro Asp Leu Lys Thr Gly Ala Glu Ala Trp Ile Leu Ala Gly Gly 
        435                 440                 445             
Ala His His Thr Ala Phe Ser Tyr Asp Leu Thr Ala Glu Gln Met Gly 
    450                 455                 460                 
Asp Trp Ala Ala Ala Met Gly Ile Glu Ala Val Tyr Ile Asp Lys Asp 
465                 470                 475                 480 
Thr Thr Ile Arg Asn Phe Lys Arg Asp Leu Gln Leu Gly Asn Ile Val 
                485                 490                 495     
Tyr Arg 
        

<210>  16
<211>  498
<212>  PRT
<213>  unknown

<220>
<223>  sequence from Human Microbiome dataset

<400>  16
Met Val Thr Gly Arg Asn Tyr Lys Phe Trp Phe Cys Thr Gly Ser Gln 
1               5                   10                  15      
Asp Leu Tyr Gly Asp Glu Cys Leu Arg Lys Val Ala Glu His Ser Arg 
            20                  25                  30          
Ile Ile Val Glu Glu Leu Asn Lys Ser Gly Val Leu Pro Phe Glu Leu 
        35                  40                  45              
Val Trp Lys Pro Thr Leu Ile Thr Asn Glu Leu Ile Arg Lys Thr Phe 
    50                  55                  60                  
Asn Glu Ala Asn Ala Asp Asp Glu Cys Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Ser Trp Ile Leu Gly Leu Lys Glu Tyr 
                85                  90                  95      
Arg Lys Pro Leu Cys His Leu His Thr Gln Phe Asn Gln Glu Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Met Asp Phe Met Asn Glu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Gly Arg Glu Tyr Gly His Ile Val Thr Arg Met Gly Ile Glu Arg 
    130                 135                 140                 
Lys Val Ile Val Gly His Trp Ala Asp Lys Lys Val Gln Glu Arg Leu 
145                 150                 155                 160 
Ala Ser Trp Met Arg Thr Ala Val Gly Ile Met Glu Ser Ser His Ile 
                165                 170                 175     
Arg Val Cys Arg Val Ala Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Phe Gly Trp Glu Val Asp Ala 
        195                 200                 205             
Tyr Pro Val Asn Glu Val Cys Asp Tyr Val Lys Asp Val Ser Lys Gly 
    210                 215                 220                 
Asp Ile Asp Val Leu Val Glu Glu Tyr Tyr Asn Lys Tyr Asp Ile Leu 
225                 230                 235                 240 
Phe Glu Gly Arg Asp Pro Glu Glu Phe Lys Arg His Val Ala Val Gln 
                245                 250                 255     
Ala Ala Ile Glu Ile Gly Phe Glu Arg Phe Leu Glu Glu Lys Asn Tyr 
            260                 265                 270         
Gln Ala Val Val Thr His Phe Gly Asp Leu Gly Gly Leu Gln Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Met Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Thr Ala Ala Met Val Arg Leu Met Lys Ile 
305                 310                 315                 320 
Met Thr Ala Gly Val Lys Asp Ala Lys Gly Thr Ser Phe Met Glu Asp 
                325                 330                 335     
Tyr Thr Tyr Asn Leu Val Pro Gly Lys Glu Gly Ile Leu Gln Ser His 
            340                 345                 350         
Met Leu Glu Val Cys Pro Thr Ile Ala Asp Gly Lys Ile Gly Ile Lys 
        355                 360                 365             
Val Cys Pro Leu Ser Met Gly Asp Arg Glu Asp Pro Ala Arg Leu Phe 
    370                 375                 380                 
Thr Ser Lys Thr Gly Pro Ala Val Ala Thr Ser Leu Val Asp Leu Gly 
385                 390                 395                 400 
Asp Arg Phe Arg Leu Ile Ile Asn Asp Val Asp Cys Lys Lys Val Glu 
                405                 410                 415     
Lys Pro Met Pro Lys Leu Pro Val Gly Ser Ala Phe Trp Thr Pro Gln 
            420                 425                 430         
Pro Asp Leu Ala Thr Gly Ala Glu Ala Trp Ile Leu Ala Gly Gly Ala 
        435                 440                 445             
His His Thr Ala Phe Ser Tyr Asp Leu Thr Ala Glu Gln Met Gly Asp 
    450                 455                 460                 
Trp Ala Ala Ala Met Gly Ile Glu Ala Val Tyr Ile Asp Lys Asp Thr 
465                 470                 475                 480 
Thr Ile Arg Asn Phe Lys Asn Glu Leu Arg Trp Asn Glu Val Ala Phe 
                485                 490                 495     
Arg Lys 
        

<210>  17
<211>  491
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  17
Met Glu Asp Ile Met Lys Arg Glu Phe Trp Phe Ile Val Gly Ser Gln 
1               5                   10                  15      
Phe Leu Tyr Gly Gln Asp Val Leu Asp Thr Val Asp Ala Arg Ala Lys 
            20                  25                  30          
Glu Met Ala Ala Glu Leu Ser Lys Val Leu Pro Tyr Pro Leu Val Tyr 
        35                  40                  45              
Lys Val Thr Ala Lys Thr Asn Lys Glu Ile Thr Asp Val Ile Lys Glu 
    50                  55                  60                  
Ala Asn Tyr Arg Asp Glu Cys Ala Gly Ile Val Thr Trp Cys His Thr 
65                  70                  75                  80  
Phe Ser Pro Ser Lys Met Trp Ile Asn Gly Leu Ala Asn Leu Gln Lys 
                85                  90                  95      
Pro Tyr Cys His Leu Ala Thr Gln Tyr Asn Lys Glu Ile Pro Asn Asp 
            100                 105                 110         
Glu Ile Asp Met Asp Phe Met Asn Leu Asn Gln Ala Ala His Gly Asp 
        115                 120                 125             
Arg Glu His Gly Phe Ile Ala Ala Arg Leu Arg Leu Pro Arg Lys Val 
    130                 135                 140                 
Ile Ala Gly Phe Trp Gln Asp Glu Lys Ile His Lys Arg Leu Ser Asp 
145                 150                 155                 160 
Trp Met Arg Ala Ala Val Gly Val Ala Val Ser Lys Lys Met Lys Val 
                165                 170                 175     
Met Arg Phe Gly Asp Asn Met Arg Glu Val Ala Val Thr Glu Gly Asp 
            180                 185                 190         
Lys Val Glu Val Gln Thr Lys Leu Gly Trp Gln Val Asn Thr Trp Ala 
        195                 200                 205             
Val Gly Asp Leu Val Lys Glu Met Gly Lys Val Thr Glu Ala Glu Ile 
    210                 215                 220                 
Asp Ala Leu Val Ala Glu Tyr Glu Ala Asn Tyr Asp Ile Ala Thr Asp 
225                 230                 235                 240 
Asn Thr Ala Ala Ile Arg Tyr Gln Ala Arg Glu Glu Ile Ala Met Lys 
                245                 250                 255     
Lys Met Leu Asp Arg Glu Gly Cys Arg Ala Phe Thr Asn Thr Phe Gln 
            260                 265                 270         
Asp Leu Tyr Gly Met Glu Gln Leu Pro Gly Leu Ala Ser Gln His Leu 
        275                 280                 285             
Met Ala Gln Gly Tyr Gly Tyr Gly Gly Glu Gly Asp Trp Lys Val Ser 
    290                 295                 300                 
Ala Met Thr Ala Ile Leu Lys Ala Met Gly Glu Asn Gly Asn Gly Ala 
305                 310                 315                 320 
Ser Gly Phe Met Glu Asp Tyr Thr Tyr His Leu Val Glu Gly Gln Glu 
                325                 330                 335     
Tyr Ser Leu Gly Ala His Met Leu Glu Val Cys Pro Ser Leu Ala Ala 
            340                 345                 350         
Asp Lys Pro Arg Ile Glu Thr His His Leu Gly Ile Gly Met Asn Glu 
        355                 360                 365             
Lys Asp Pro Ala Arg Leu Val Phe Glu Gly Lys Ala Gly Lys Gly Ile 
    370                 375                 380                 
Val Val Ser Leu Ile Asp Met Gly Gly Arg Leu Arg Leu Ile Val Gln 
385                 390                 395                 400 
Asp Ile Glu Ala Val Lys Pro Ile Leu Pro Met Pro Asn Leu Pro Val 
                405                 410                 415     
Ala Arg Val Met Trp Arg Ala Met Pro Asp Leu Thr Thr Gly Val Glu 
            420                 425                 430         
Cys Trp Ile Thr Ala Gly Gly Ala His His Thr Val Leu Ser Tyr Asp 
        435                 440                 445             
Val Thr Ala Glu Gln Met Arg Asp Trp Ala Arg Met Met Asp Ile Glu 
    450                 455                 460                 
Phe Val His Ile Thr Lys Asp Thr Thr Pro Glu Lys Leu Glu Glu Glu 
465                 470                 475                 480 
Leu Leu Val Lys Asp Leu Val Trp Lys Leu Lys 
                485                 490     

<210>  18
<211>  488
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  18
Met Ser Lys Glu Phe Trp Phe Val Val Gly Ser Gln Asp Leu Tyr Gly 
1               5                   10                  15      
Glu Glu Val Leu Lys Ile Val Ala Glu Arg Ala Ala Glu Met Ala Ala 
            20                  25                  30          
Trp Leu Ser Glu Lys Leu Pro Tyr Pro Leu Ile Tyr Lys Val Thr Ala 
        35                  40                  45              
Met Ser Ser Asn Gln Ile Thr Ser Val Met Lys Glu Ala Asn Phe Asp 
    50                  55                  60                  
Asp Asn Cys Leu Gly Val Val Thr Trp Cys His Thr Phe Ser Pro Ser 
65                  70                  75                  80  
Lys Met Trp Leu Thr Gly Leu Asp Leu Leu Gln Lys Pro Trp Cys His 
                85                  90                  95      
Phe Ala Thr Gln Tyr Asn Leu Glu Ile Pro Asn Glu Glu Ile Asp Met 
            100                 105                 110         
Asp Phe Met Asn Leu Asn Gln Ala Ala His Gly Asp Arg Glu His Gly 
        115                 120                 125             
Phe Ile Gly Ala Arg Leu Arg Lys Ala Arg Lys Val Val Ala Gly Tyr 
    130                 135                 140                 
Trp Lys Asp Glu Lys Val Ile Ala Arg Leu Ala Glu Phe Gln Lys Val 
145                 150                 155                 160 
Ala Val Gly Val Asp Ala Ser Lys His Met Lys Val Met Arg Phe Gly 
                165                 170                 175     
Asp Asn Met Arg Asp Val Ala Val Thr Glu Gly Asp Lys Val Glu Val 
            180                 185                 190         
Gln Lys Lys Leu Gly Trp Glu Val Asn Thr Trp Ala Val Gly Asp Leu 
        195                 200                 205             
Val Lys Glu Met Asn Ala Val Thr Asp Glu Glu Val Glu Ala Leu Phe 
    210                 215                 220                 
Asn Glu Tyr Lys Ala Ser Tyr Asp Ile Asn Thr Asp Asn Ile Tyr Ala 
225                 230                 235                 240 
Ile Lys Tyr Gln Ala Arg Glu Glu Ile Ala Ile Lys Lys Met Met Asp 
                245                 250                 255     
Arg Asn Gly Cys Lys Ala Phe Ser Asn Thr Phe Gln Asp Leu Tyr Gly 
            260                 265                 270         
Met Glu Gln Leu Pro Gly Leu Ala Ser Gln His Leu Met Ser Leu Gly 
        275                 280                 285             
Tyr Gly Tyr Gly Gly Glu Gly Asp Trp Lys Val Ser Ala Met Thr Ala 
    290                 295                 300                 
Ile Leu Lys Ala Met Gly Glu Asn Gly Asn Gly Ala Ser Ala Phe Met 
305                 310                 315                 320 
Glu Asp Tyr Thr Tyr His Leu Val Lys Gly His Glu Tyr Ser Leu Gly 
                325                 330                 335     
Ala His Met Leu Glu Val Cys Pro Ser Leu Ala Ala Asp Lys Pro Arg 
            340                 345                 350         
Ile Glu Thr His His Leu Gly Ile Gly Met Asn Glu Lys Asp Pro Ala 
        355                 360                 365             
Arg Leu Val Phe Glu Gly Lys Glu Gly Arg Gly Ile Val Ala Ser Leu 
    370                 375                 380                 
Ile Asp Met Gly Gly Arg Leu Arg Leu Ile Val Gln Asp Ile Glu Ala 
385                 390                 395                 400 
Val Lys Pro Ile Met Pro Met Pro Asn Leu Pro Val Ala Arg Val Met 
                405                 410                 415     
Trp Arg Ala Leu Pro Asp Leu Thr Asp Gly Val Glu Cys Trp Ile Thr 
            420                 425                 430         
Ala Gly Gly Ala His His Thr Val Leu Ser Tyr Asp Val Thr Pro Glu 
        435                 440                 445             
Met Met Arg Asp Phe Ala Lys Phe Met Asp Ile Glu Phe Val His Ile 
    450                 455                 460                 
Asp Lys Asp Thr Thr Val Glu Lys Leu Glu Asp Glu Leu Met Val Lys 
465                 470                 475                 480 
Asp Leu Val Trp Lys Met Lys Gly 
                485             

<210>  19
<211>  488
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  19
Met Gly Asn Lys Lys Asn Phe Trp Phe Val Val Gly Ser Gln Phe Leu 
1               5                   10                  15      
Tyr Gly Asn Glu Val Leu Glu Thr Val Ala Ala Arg Ala Gln Glu Met 
            20                  25                  30          
Ala Glu Lys Met Ser Lys Ser Leu Pro Tyr Glu Leu Lys Phe Lys Gly 
        35                  40                  45              
Ile Val Lys Thr Trp Asp Glu Ala Thr Gln Tyr Ala Lys Glu Ala Asn 
    50                  55                  60                  
Phe Asp Asp Asn Cys Cys Gly Val Ile Thr Trp Cys His Thr Phe Ser 
65                  70                  75                  80  
Pro Ser Lys Met Trp Ile Glu Ala Phe Arg Leu Leu Gln Lys Pro Leu 
                85                  90                  95      
Leu His Phe Ala Thr Gln Tyr Asn Arg Tyr Ile Pro Asp Lys Glu Ile 
            100                 105                 110         
Asp Met Asp Phe Met Asn Leu Asn Gln Ala Ala His Gly Asp Arg Glu 
        115                 120                 125             
His Gly Phe Ile Ile Ala Arg Met Arg Leu Gln Gln Lys Ile Val Thr 
    130                 135                 140                 
Gly Phe Trp Glu Asp Gln Pro Val Leu Asp Glu Ile Gly Thr Trp Met 
145                 150                 155                 160 
Arg Ala Ala Val Ala Tyr Asp Phe Ser Arg Asn Leu Arg Val Met Arg 
                165                 170                 175     
Phe Gly Asp Asn Met Arg Glu Val Ala Val Thr Glu Gly Asp Lys Val 
            180                 185                 190         
Glu Ala Gln Ile Lys Phe Gly Trp Gln Val Asn Thr Trp Pro Val Gly 
        195                 200                 205             
Lys Leu Val Glu Glu Ile Gly Lys Val Thr Glu Glu Glu Val Asp Glu 
    210                 215                 220                 
Leu Leu Lys Val Tyr Thr Asp Thr Tyr Glu Leu Ala Thr Asp Asp Ile 
225                 230                 235                 240 
Glu Thr Ile Arg Tyr Gln Ala Arg Glu Glu Ile Ala Met Lys Lys Met 
                245                 250                 255     
Met Thr Ala Glu Gly Ala Asn Ala Phe Val Asn Thr Phe Gln Asp Leu 
            260                 265                 270         
Ile Gly Met Lys Gln Leu Pro Gly Ile Ala Ser Gln His Leu Met Ala 
        275                 280                 285             
Gln Gly Tyr Gly Tyr Gly Ala Glu Gly Asp Trp Lys Leu Ser Ala Leu 
    290                 295                 300                 
Val Ser Ile Val Lys Lys Met Thr Glu Gly Met Thr Gly Gly Thr Ser 
305                 310                 315                 320 
Phe Met Glu Asp Tyr Thr Tyr His Leu Asp Pro Asn Ala Glu Tyr Ala 
                325                 330                 335     
Leu Gly Ala His Met Leu Glu Val Cys Pro Ser Ile Ala Ala Asp Lys 
            340                 345                 350         
Pro Arg Ile Glu Val His Pro Leu Gly Ile Gly Asp Arg Glu Asp Pro 
        355                 360                 365             
Ala Arg Leu Val Phe Glu Gly His Glu Gly Asp Ala Val Val Val Thr 
    370                 375                 380                 
Leu Ile Asp Met Gly Glu Arg Phe Arg Met Leu Val Gln Asp Ile His 
385                 390                 395                 400 
Cys Val Lys Pro Ile Tyr Glu Met Pro Asn Leu Pro Val Ala Arg Val 
                405                 410                 415     
Met Trp Glu Gly Lys Pro Ser Leu Asn Glu Gly Leu Lys Met Trp Leu 
            420                 425                 430         
Met Ala Gly Gly Ala His His Ser Val Leu Ser Tyr Asp Ala Thr Pro 
        435                 440                 445             
Glu Met Leu Lys Asp Leu Ala Arg Met Met Asp Ile Glu Phe Val His 
    450                 455                 460                 
Ile Thr Ala Asp Ser Lys Pro Glu Glu Phe Glu Lys Asp Leu Phe Phe 
465                 470                 475                 480 
Ala Asp Leu Ala Trp Lys Leu Lys 
                485             

<210>  20
<211>  470
<212>  PRT
<213>  unknown

<220>
<223>  sequence from cow rumen metagenome dataset

<400>  20
Met Lys Lys Ile Tyr Phe Ile Thr Gly Ser Gln Asp Leu Tyr Gly Glu 
1               5                   10                  15      
Asp Val Leu Lys Thr Val Ala Lys Asp Ser Gln Glu Met Val Asn Phe 
            20                  25                  30          
Leu Asp Glu Gln Val Gly Glu Arg Ala Glu Ile Glu Phe Leu Gly Val 
        35                  40                  45              
Val Arg Asp Ser Glu Ile Cys Leu Asp Phe Ile Leu Lys Ala Asn Phe 
    50                  55                  60                  
Asp Lys Glu Cys Ile Gly Ile Ile Thr Trp Met His Thr Phe Ser Pro 
65                  70                  75                  80  
Ala Lys Met Trp Ile Arg Gly Leu Lys Val Leu His Lys Pro Met Leu 
                85                  90                  95      
His Leu His Thr Gln Tyr Asn Glu Lys Leu Pro Tyr Asp Ser Ile Asp 
            100                 105                 110         
Met Asp Phe Met Asn Leu Asn Gln Ala Ala His Gly Asp Arg Glu Phe 
        115                 120                 125             
Gly Phe Ile Ala Ala Arg Met Asn Ile Lys Gln His Val Leu Ser Gly 
    130                 135                 140                 
Tyr Tyr Lys Asn Lys Asp Phe Ile Glu Gly Val Lys Gln Tyr Ile Asp 
145                 150                 155                 160 
Val Cys Leu Ser Ile Asp Ala Ala Lys Tyr Leu Arg Val Ala Met Phe 
                165                 170                 175     
Gly Ser Asn Met Arg Asp Val Ala Val Thr Asp Gly Asp Arg Val Gln 
            180                 185                 190         
Ser Glu Ile Asp Phe Gly Trp Asn Val Asn Tyr Tyr Gly Ile Gly Asp 
        195                 200                 205             
Leu Val Asp Ile Ile Asn Lys Val Lys Asp Glu Glu Ile Asp Ala Gln 
    210                 215                 220                 
Phe Glu Glu Tyr Lys Lys Arg Tyr Thr Ile Asn Thr Thr Asn Ile Glu 
225                 230                 235                 240 
Ala Ile Lys Glu Gln Ala Lys Tyr Glu Val Ala Leu Lys Lys Phe Ile 
                245                 250                 255     
Lys Lys Glu Asn Val Gln Ala Phe Thr Asp Asn Phe Gln Asp Leu His 
            260                 265                 270         
Gly Leu Lys Gln Leu Pro Gly Leu Ala Val Gln Asp Leu Met Gln Glu 
        275                 280                 285             
Gly Ile Ser Phe Gly Pro Glu Gly Asp Tyr Lys Thr Pro Ala Leu Leu 
    290                 295                 300                 
Ala Thr Leu Leu Pro Met Thr Lys Tyr Arg Lys Gly Ala Thr Gly Phe 
305                 310                 315                 320 
Ile Glu Asp Tyr Thr Tyr Asp Leu Ile Glu Gly Lys Glu Ile Glu Leu 
                325                 330                 335     
Gly Ser His Met Leu Glu Val Pro Pro Ser Phe Ala Thr Ser Lys Pro 
            340                 345                 350         
Glu Ile Gln Val Arg Pro Leu Ser Ile Gly Asp Lys Ala Ala Pro Ala 
        355                 360                 365             
Arg Leu Val Phe Asp Ser Ile Glu Gly Glu Gly Leu Gln Ile Thr Met 
    370                 375                 380                 
Val Asp Met Gly Thr His Phe Arg Ile Ile Ala Ala Lys Ile Gln Leu 
385                 390                 395                 400 
Val Lys Gln Pro Ala Pro Met Pro Lys Leu Pro Val Ala Arg Ile Met 
                405                 410                 415     
Trp Lys His Ile Pro Asn Phe Lys Ile Ser Thr Glu Ala Trp Met Leu 
            420                 425                 430         
Tyr Gly Gly Gly His His Ser Val Ile Thr Thr Ala Leu Thr Ile Glu 
        435                 440                 445             
Asp Ile Lys Leu Phe Ala Lys Leu Thr Gly Thr Glu Leu Cys Val Ile 
    450                 455                 460                 
Asp Glu Asn Thr Lys Ile 
465                 470 

<210>  21
<211>  1488
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  21
atgaacttga agccacacac cttctggttc gtcactggtt cccaacactt gtacggtcca       60
gaaactttgg aacaagtcgc tgaacactcc agaatcgtcg ctactgaatt tgacaaggac      120
ccagttttca cctacccaat cgtcttcaag ccaatcgtta ccactccaga tgaaatctac      180
aagttgatct tggaagctaa caacgacgaa tcctgtgctg gtatcatgac ttggatgcac      240
accttctctc cagctaagat gtggatcgct ggtttgtccc aattgcaaaa gccattgttg      300
cacttccaca ctcaattcaa cagagatatc ccttgggaaa ccatcgacat ggatttcatg      360
aacttgaacc aatctgctca cggtgacaga gaatacggtc acatcggtgc tagattgggt      420
atcgctagaa aggttgtcgt tggtcactgg gaagacggtg aagtcagagg ttctatcgct      480
ggttggatga gaaccgctgc tgcttacgct gaatccagaa gattgaaggt cgctagattc      540
ggtgacaaca tgagacaagt cgctgttacc gaaggtgaca aggttgaagc tcaaatcaag      600
ttgggttggt ctgtcaacgg ttacggtatc ggtgacttgg ttcaatccat gaacgaagtc      660
ggtgacgaag aagttaaggc tttgttgaac gaatacgctg aatcttactc catcactaag      720
gaaggtttgt ctgacggtcc agtcagagat tccatcgctt accaagctag aatcgaaatc      780
gctttgagaa gattcttgga agaaggtggt ttcggtgctt tcaccactac cttcgaagac      840
ttgcacggta tgaagcaatt gccaggtttg gctgttcaaa gattgatgga atctggttac      900
ggtttcggtg gtgaaggtga ctggaagact gctgctttga ccagagtctt gaaggttttg      960
gctgacaaca agtctacttc cttcatggaa gattacacct accacttcga accaggtaac     1020
cacatgatct tgggttctca catgttggaa gtctgtccaa ctatcgcttt ggacaagcca     1080
accttggaag ttcacccatt gggtatcggt ggtaaaggtg acccagctag attggtcttc     1140
aacggtcaag atggtccagc tgttaacgct tctttgatcg acttgggtca cagattcaga     1200
ttgttggtca acgtcgttga cggtgtcaag gttgaacaac caatgccaaa gttgccagtc     1260
gctagagttt tgtggaagcc acaaccatct ttgagagaat ctgctgaagc ctggatcttg     1320
gctggtggtg ctcaccacac tgttttgtct tacgctatga ccgctgaaca cttgtccgac     1380
tgggctgaaa tgaccggtat cgaagctgtc gttatcgaca aggatactac catcccaaga     1440
ttcaagaacg aattgagatg gtctgaagct gcttacagat tgagatga                  1488

<210>  22
<211>  1488
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  22
atgaagttga agccacactc tttctggttc gttactggtt cccaacactt gtacggtcca       60
gaaaccttgg aagaagtcgc tggtcactcc agaatcatcg ctgaacaatt ggacaaggac      120
ccagctatcg gtttcccagt tgtcttcaag ccaatcgtta ccactccaga cgaaatctac      180
aagttgatct tggctgctaa cggtgacgaa acttgtgctg gtatcatcac ttggatgcac      240
accttctctc cagctaagat gtggatcgct ggtttgtccc aattgcaaaa gccattgttg      300
cacttccaca ctcaattcaa cagagacatc ccttgggaaa ccatcgacat ggatttcatg      360
aacttgaacc aatctgctca cggtgacaga gaatacggtc acatcggtgc tagattgggt      420
atcaacagaa agatcgttgt cggtcactgg gaagacgaag aagtcagagc ttctttggct      480
ggttggatga gaactgctgt tgcttacgct gaatccagac aattgaaggt cgctagattc      540
ggtgacaaca tgagagaagt tgctgtcacc gaaggtgaca aggttgaagc tcaaatcaag      600
ttcggttggt ctgttaacgg ttacggtgtc ggtgacttgg ttcaagtctt gaacgaagtc      660
accgatgctg aagctgaagc cttgttgaag gaatacgctg aacaatacac tatcacccaa      720
gctggtttgt cttccggtcc aatcagagac tccatcgctt accaagccaa gttggaaatc      780
gctatgaaga gattcttgga acaaggtggt ttcggtgctt tcaccactac cttcgaagac      840
ttgcacggtt tgaagcaatt gccaggtttg gctgttcaaa gattgatgga agctggttac      900
ggtttcggtg gtgaaggtga ctggaagact gctgctttga ccagagtttt gaaggtcttg      960
gctaacaaca agtctacttc cttcatggaa gactacacct accacttcga accaggtaac     1020
cacatgatct tgggtgctca catgttggaa gtttgtccaa ctatcgctgc tactaagcca     1080
accatcgaag tccacccatt gggtatcggt ggtaaagctg acccagctag aatggttttc     1140
gatggtcaag ctggtccagc tgttaacgct tctttggttg acttgggtca cagattcaga     1200
ttgttggtca acgttgtcga tggtgttaag gtcgaaaagc caatgccaaa gttgccagtt     1260
gctagagtct tgtggaagcc acaaccatct ttgagagaat ctgctgaagc ctggatcttg     1320
gctggtggtg ctcaccacac tgttttgtct tacgctatca ccgctgaaaa cttgtccgac     1380
tgggctgaaa tggttggtat cgaagctgtc atcatcgaca aggatacctc tgtcccaaga     1440
ttcaagaacg aattgagatg gtccgacgct gcttacagat tgagatga                  1488

<210>  23
<211>  1488
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  23
atgcaaagaa ctccatacga attctggttc gtcactggtt cccaacactt gtacggttcc       60
gaagctttgg ctgaagtctc ctcccactcc agacaaatca ctcaagcctt caacgaagct      120
gactctatct ccttcccaat cgttgtcaag ccagttgtca agaccccaga agaaatcttg      180
caattgtgta tggaagctaa ctctgacgaa aactgtgctg gtttgatcac ttggatgcac      240
accttctctc caggtaaaat gtggatcggt ggtttgtccc aattgcacaa gccattgttg      300
cacttccaca ctcaattcca cagagaaatc ccttgggaca gaatcgacat ggatttcatg      360
aacttgcacc aatctgctca cggtgacaga gaatttggtt tcatcgctac cagattgggt      420
atcttgagaa aggaagttgt cggtcactgg agagacgaag ctgttcaaaa gagattgtct      480
gattggatga gaactgctat cgcttgtttg gaaggtaaaa agttgaaggt tgctagattc      540
ggtgacaaca tgagaagagt tgctgtcacc gaaggtgaca aggtcgaagc tcaaatccaa      600
ttcggttggt ctatcaacgg ttacggtgtt ggtgacttgg tccaaagaat cactgacatc      660
tccgataccg ctgttcacca attgttcaga gaataccaag aaagatacga cttcccacca      720
gaagctagag aagctggtcc aatcagagat tctatcttgg aacaagctag aatcgaattg      780
ggtttgaagt tgttcttgag agaaggtggt tactccgctt tcaccactac cttcgaagac      840
ttgcacggtt tgaagcaatt gccaggtttg gctgtccaaa gattgatgtc tgaaggttac      900
ggtttcggtg ctgaaggtga ctggagaact gctggtttgt tgagaatgat gaagatcatg      960
gctgacaacg aaggtacttc cttcatggaa gattacacct accacttgga accaggtaac     1020
gaaatgatct tgggtgctca catgttggaa gtttgtccaa ccatcgctgc tcaaagacca     1080
ggtatcagag tccacccatt gtctatcggt ggtaaagctg acccagctag attggttttc     1140
gatggtagac caggtccagc tttgaacgtc tctttgatcg acttgggtaa cagattcaga     1200
ttgttgatca acaaggttga tgctgtccac ccaaagtccg ctatgccaca cttgccagtt     1260
gctagagtct tgtggaagcc aagaccatct ttgcacgact ctgctgaagc ctggatgtac     1320
gctggtggtg ctcaccacac tgttttctct taccacgtca ctaccgaaca attgttggac     1380
tgggctgaat gggttgatat ggaagccttg gttatcgacg aacaaacctc cttgtcttcc     1440
ttcagaagac aattgaagtg gaacgacgct tactacagaa tcagatga                  1488

<210>  24
<211>  1500
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  24
atgttgaaga ctaagaacta ccaattctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaatgtt tggctcacgt tgctgaacac gctaagaaga tcgttgaagc cttgaacgct      120
tctggtaact tgccatacga agttgtctgg aagccaacct tgatcactaa cgaattgatc      180
agaagaacct tcaacgaagc taacactgac gaaaactgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagagttcag aaagccattg      300
ttgcacttgc acacccaatt caacagagaa atcccatacg acactatcga catggatttc      360
atgaacgaaa accaatctgc tcacggtgac agagaatttg gtcacatctt ctccagattg      420
cacatgaaca gaaaggttgt cgttggttac tgggctgacg aagatgttca aaagcaaatc      480
ggttcttgga tgagaaccgc tgttggtgtc gttgaatctt cccacatcag agtcatgaga      540
atcgctgaca acatgagaaa cgtcgctgtt actgaaggtg acaaggttga agctcaaatc      600
aagttcggtt gggaagttga cgcttaccca gtcaacgaag ctgtcgaagc tgttaacgct      660
gtctcccaag ctgacatcga taccttggtc gaagaatact acgacaagta cgaaatcttg      720
ttggaaggta gagatgaaaa ggagttcaga agacacgtcg ctgttcaagc tggtatcgaa      780
atcggtttgg aaagattctt ggaagaaaac aactaccaag ctatcgttac tcacttcggt      840
gacttgggtg gtttcaagca attgccaggt ttggctatgc aaagattgat ggaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag accgctgcta tggtcagatt gatgaagatc      960
atgactggtg gtatgaagga cgctaagggt acttctttca tggaagatta cacttacaac     1020
ttggttccag gtaaagaagg tatcttggaa gctcacatgt tggaagtctg tccaaccatc     1080
gctgacggta aaatctctat caaggaacaa ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg ttttcactgc taaggaaggt ccagctatcg ctgcttcttt gatcgacttg     1200
ggtgacagat tcagattgtt gatcaacgaa gttgaatgta agaagaccga aaagccaatg     1260
ccaaagttgc cagtcgctac cgctttctgg actccaaagc caaacttgaa gatcggtgct     1320
caatcctgga tcttggctgg tggtgctcac cacactgctt tctcttacga cttgtccgct     1380
gaacaaatgg gtgactgggc tgaagctatg ggtatcgaag ctgtctacat cgacgctgat     1440
accactatca gacaattgaa gaacgaattg agatggaacg aattggctta cagaagatga     1500

<210>  25
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  25
atgaagactg gtagagatta caagttctgg ttctgtactg gttcccaaga tttgtacggt       60
gaagaatgtt tgagaaaggt tgctgaacac tccgctaaga tcgtcgaagg tttgaacgct      120
tctggtagat tgccattcga agttgtcttg aagccaacct tgatcgatcc agctaccatc      180
agaagaactt tgaacgaagc taacgaagac ggtgaatgtg ctggtgttat cacctggatg      240
cacactttct ccccagctaa gatgtggatc ttgggtttga aggaatacag aaagccattg      300
tgtcacttgc acacccaatt caacgaagaa atcccatacg atactatcga catggatttc      360
atgaacgaaa accaatccgc tcacggtgac agagaatttg gtcacatggt ttccagaatg      420
ggtatggaaa gaaagatcat cgtcggtcac tgggctaacg ctgaagttca agaaaagatc      480
ggttcttgga tgagaaccgc tatcggtatc atggaatctt cccacatcag agtctgtaga      540
atcggtgaca acatgaacaa cgttgctgtc actgaaggtg acaaggtcga agctgaagtc      600
aagttcggtt gggaaatcga tcactactgt gttaacgacg ctgttgaata cgtcaacgct      660
gtttccgaag gtgacgtcaa cgctttggtt gaagaatact actctaagta ccaaatcttg      720
ttggaaggta gagacccaga agagttcaga gctcacgtcg ctgctcaagc taagatcgaa      780
atcggtttgg aaaagttctt ggaagacggt gactaccacg ctatcgttac ccacttcggt      840
atgttgggtg gtttgcaaca attgccaggt ttggctatcc aaagattgat ggaaaagggt      900
tacggtttcg gtggtgaagg tgactggaag actgctgcta tggtcagatt gatgaagatc      960
atggctgctg gtgttccagg tgctaagggt acttctttca tggaagacta cacttacaac     1020
ttggtcccag gtaaagaagg tatcttgcaa gctcacatgt tggaagtctg tccatctatc     1080
gctgaaggtc caatctccat caaggttcaa ccattgtcta tgggtaacag agaagaccca     1140
gctagattgg ttttcacctc caagactggt ccagctgtcg ctacctcttt ggttgatttg     1200
ggtaacagat tcagattgat catcaacgct gttgactgta agaagtgtga aaaggaaatg     1260
ccaaagttgc cagttgctac cgctttctgg actccacaac cagacttggc tactggtgct     1320
caagcctgga tcttggctgg tggtgctcac cacaccgctt tctcctacga cttgactgtc     1380
gatcaaatgg ttgactgggc tgctgctatg ggtatcgaat ctgttgtcat cgacaaggat     1440
accactatca gaaacttcaa gaacgaattg agatggaact ctatctacta cagatga        1497

<210>  26
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the  cow rumen metagenome dataset

<400>  26
atgatccaaa ctaaggctta caagttctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaagttt tgagacacgt tgctgatcac tctaaggaaa tcgttgaaga attgaacaag      120
tccggtatct tgccatacga agttgtctgg aagccagtct tgatcaccaa ccaattgatc      180
agacaaactt tcaacgaagc taacgctgac gattcttgtg ctggtgttat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagagttcag aaagccattg      300
ttgcacttgc acacccaata caacgaagaa atcccatacg atactatcga catggatttc      360
atgaacgaaa accaagctgc tcacggtgac agagaatacg gtcacatcgt ttccagaatg      420
ggtatcgaaa gaaaggtcat cgctggttac tggaaggaca acgaagttag atccagaatc      480
gcttcctgga tgagaaccgc tgttggtgtc atggaatctt cccacatcag agttatgaga      540
gtcgctgata acatgagaaa cgttgctgtc actgaaggtg acaaggttga agctcaaatc      600
aagttcggtt gggaagttga cacctaccca gtcaacgaaa tcgctgattc tgttgctact      660
gtctctgctt ccgacgtcaa cgctttgttg gacgaatact acgataagta cgaaatcatc      720
ttggacggta gagacccaga tgagttcaag aagcacgttg ctgtccaagc tcaaatcgaa      780
ttgggtttcg aaagattctt ggaagaaaag aactaccaag ctatcgttac ccacttcggt      840
gacttgggtg ctttgggtca attgccaggt ttggctatcc aaagattgat ggaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag gttgctgcta tggtcagatt gatgaagatc      960
atgacctctg gtatgaagga tgctaagggt acttccatgt tggaagacta cacttacaac     1020
ttggttagag gtaaagaagg tatcttggaa gctcacatgt tggaaatctg tccaactatc     1080
gctgacggtc caatctctat cagagtcaag ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg ttttcacctc taaggaaggt aaaggtgtcg ctacttcctt gatcgacttg     1200
ggtaacagat tcagattgat catcaacgaa gttgaatgta agaagaccga aaagccaatg     1260
ccaaacttgc cagtcgctac cgcttactgg actccatacc cagacttgta cactggtgct     1320
gaagcctgga tcttggctgg tggtgctcac cacaccgctt tctcttacga cttgacttcc     1380
ggtcaaatgg ctgattgggc tgaaatgatg ggtatcgaag ctgttatcat cgataagaac     1440
accactatcc cagctttcaa gaaggaattg aagttgggtg acgtcttcta cagatga        1497

<210>  27
<211>  1434
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the  cow rumen metagenome dataset

<400>  27
atgaagttct ggttcgttac tggttcccaa ttcttgtacg gtgaagaaac cttgagacaa       60
gttgaagaag actctaagaa gatcgttgac ggtttgagat tgccattccc agttgaatac      120
aagttgaccg tcaagactga atctgaaatc gaaagaatcg ttaaggaagc taactacgac      180
gatgaatgtg ctggtatcat caccttctgt cacactttct ctccatccaa gatgtggatc      240
aacggtttgg ctttgttgca aaagccttgg ttgcacttcc acacccaatt caacgaaact      300
atcccaaacg aagctatcga catggattac atgaacttgc accaatccgc tcacggtgac      360
agagaacacg gtttcatcgg tgctagattg agagttccaa gagctgttgt cgctggttac      420
tggaaggacc cagctgtcca agctaagatc ggtgaatggc aaagagctgc tgttggtgtc      480
atgttctcca gatccttgaa gatcgtcaga ttcggtgaca acatgagaga agttgctgtc      540
accgaaggtg acaagatcga agctcaattg agattgggtt ggcaagttaa caccttcgct      600
gttggtgact tggtcgaata catggacgct gtcactgatg ctgaaatcga cgctttgatg      660
aaggaatacg ctgaattgta cgaattttct gaagctgaca ccgatactat cagataccaa      720
gctagagaag aaatcgctat cgaaaagatt ttggttagag aaggtgctaa ggctttctcc      780
aacaccttcg aagacttgca cggtatgaag caattgccag gtttggctac tcaacacttg      840
atgcacaagg gttacggttt cggtgctgaa ggtgactgga agaccgctgg tatgactgct      900
atcgtcaagg ctatgtaccc agacggtaac acctctttca tggaagatta cacttacgac      960
tacgaaagac aattgatctt gggttctcac atgttggaag tttgtccatc catcgctgct     1020
gatagaccaa gaatcgaagt ccacaagttg ggtatcggtg gtaaagacgc tccagctaga     1080
atcgttttcg aaggtagagc tggttctgct aaggtcttgt ccttgatcga tatcggtggt     1140
agattcagat tgatccaaca agacatcgaa tgtgaaaagc cattccaatc tatgccaaac     1200
ttgccagttg ctagaaccat gtggagacca gctccatcct tcttggaagg tttggaatgt     1260
tggatcatcg ctggtggtgc tcaccacact gttttgtctt acgacatcac cgatgaaact     1320
gtcagagatt tcgctagaat catgggtatc gaattggttg tcatcaacaa ggacaccact     1380
aaggaaaagt tggaaagaga tatcatgatc ggtgacgtca tctacggtag atga           1434

<210>  28
<211>  1434
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the  cow rumen metagenome dataset

<400>  28
atgaagttct ggttcatcac tggttcccaa ttcttgtacg gtgaagaaac tatcagacaa       60
gtcgaagaag attccaagaa gatcgtcgac ggtttgaagt tgccattccc agttgaatac      120
aagttgaccg tcaagaagga atctgaaatc gaaagaatcg ttaaggaagc taacttcgac      180
gatgaatgtg ctggtatcat caccttctgt cacactttct ctccatccaa gatgtggatc      240
aacggtttgg ctatcttgca aaagccttgg ttgcacttcc acacccaatt caacgaaact      300
atcccaaacg aagctatcga catggcttac atgaacttgc accaatctgc tcacggtgac      360
agagaacacg gtttcatcgg tgctagattg agaatgccaa gagctgttgt cgctggttac      420
tggaaggacc cagaagttca agctaagatc gctgaatggc aaagagctgc tgttggtgtc      480
atgttctcta agtccttgaa gatcgtcaga ttcggtgaca acatgagaga agttgctgtc      540
accgaaggtg acaagatcga agctcaattg aagttgggtt ggcaagtcaa caccttcgct      600
gttggtgact tggtcgaata catgaacgct gttactgacg ctgaaatcga tgtcttgatg      660
aaggaatacg ctgaattgta cgactacgat aaggctgacg aagaaactat cagataccaa      720
gctagagaag aaatcgctat cgaaaagatt ttggttagag aaggtgctaa ggctttctct      780
aacaccttcg aagacttgca cggtatgcaa caattgccag gtttggctac tcaacacttg      840
atgcacaagg gttacggttt cggtgctgaa ggtgactgga agaccgctgg tatgactgct      900
atcgtcaagg ctatgtaccc agacggtaac acctccttca tggaagacta cacttacgat      960
tacgaaagaa agttgatctt gggttctcac atgttggaag tttgtccatc catcgctgct     1020
gacagaccaa gaatcgaagt ccacccattg ggtatcggtg gtaaagaacc accagctaga     1080
atcgttttcg aaggtaaagc tggttctgct aaggtcttgt ccttgatcga catcggtggt     1140
agattgagat tgatccaaca agatatcgaa tgtgaaaagc cattccaatc tatgccaaac     1200
ttgccagttg ctagaactat gtggagacca gctccatcct tcttggaagg tttggaatgt     1260
tggatcatcg ctggtggtgc tcaccacacc gttttgtctt acgacatctc cgatgaaact     1320
gtcagagact tcgctagaat catgggtatc gaattggttg tcatcaacaa ggataccact     1380
aaggaaaagt tggaaagaga catcatgatc ggtgacatga tctacggtag atga           1434

<210>  29
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the  cow rumen metagenome dataset

<400>  29
atgtccgaaa tgaagaagta ccaattctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaatgtt tggctcacgt tgctgctcac tctaaggaaa tggtcgaagg tttgaacaag      120
tccggtgtct tgccattcga aatcgtttgg aagccaacct tgatcactaa cgaattgatc      180
agaaagacct tcaacgaagc taacaacgac ccaaactgtg ctggtgttat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagagttcag aaagccattg      300
ttgcacttgc acacccaata caacgaagaa atcccatacg ctactatgga catggatttc      360
atgaacgaaa accaagctgc tcacggtgac agagaatacg ctcacatctt gtccagaatg      420
agaatcgaaa gaaaggttgt cgttggtttc tggaaggatt ctgaagtcca aaagaagatc      480
gcttcctgga tgagaaccgc tatcggtatc atggaatctt cccacatcag agtctgtaga      540
gttgctgaca acatgagaaa cgtcgctgtt actgaaggtg acaaggtcga agctcaattg      600
aagttcggtt gggaaatcga cgcttaccca gttaacgaaa tcgctgaagc tgtcgctgct      660
gtttctgctt ccgacaccaa cgctttggtc gatgaatact actctaagta cgacatctgt      720
ttggaaggta gagatccaga agagttcaag aagcacgtcg ctgttcaagc tcaaatcgaa      780
atcggtttcg aaagattctt gaaggaaaag aactaccaag ctatcgttac tcacttcggt      840
gacttgggtg ctttgaagca attgccaggt ttggctatcc aaagattgat ggaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag gtcgctgcta tggttagatt gatgaagatc      960
atgtctgctg gtatgaagga cgctaagggt tcttccatgt tggaagatta cacctacaac     1020
ttggtcaagg gtaaagaagg tatcatccaa gctcacatgt tggaaatctg tccatctatc     1080
tccgacggtc caatccaaat caagtgtcaa ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg ttttccaatc taagaccggt gctggtatcg ctacttcctt gatcgacttg     1200
ggtaacagat tcagattgat catccaagat gtcgaatgta agaaggttga aaagccattg     1260
ccaaagttgc caaccgctat caacttctgg actccacaac cagacttcta caccggtact     1320
gaagcctggt tgttggctgg tggtgctcac cacaccgctt tctcttacga catcactgct     1380
gaacaaatgg gtgactgggc tgctgctatg ggtatcgaag ctgtcttcat cgacaagaac     1440
actaacatca gagacttcaa gaaggatttg atgttgggtg aagttttcta cagatga        1497

<210>  30
<211>  1464
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the  cow rumen metagenome dataset

<400>  30
atgcaaagag aattctggtt catcgtcggt tcccaattct tgtacggtca agatgttttg       60
gacactgttg atgctagagc tagagaaatg gctgctgaat tgtctaaggt cttgccatac      120
ccattggtct acaaggttac cgctaagact aacaaggaaa tcgctgacac tgttaaggaa      180
gctaactaca gagatgaagt catgggtatc gttacctggt gtcacacttt ctctccatcc      240
aagatgtgga tcaacggttt ggtcaacttg caaaagccat actgtcactt ggctacccaa      300
tacaacagag aattgccaaa cgaagaaatc gacatcgatt tcatgaactt gaaccaagct      360
gctcacggtg acagagaaca cggtttcatc gctgctagat tgagaatgcc aagaaaggtc      420
atcgctggtt actggcaaga cgaaaaggtt cacaagagat tgtctgattg gatgaaggct      480
gctgttggtg ttgacgtttc caagcacatg aaggtcatga gattcggtga caacatgaga      540
gaagtcgctg ttaccgaagg tgacaaggtc gaaactcaaa tcaagttggg ttggcaagtt      600
aacacttggg ctgtcggtga cttggttaag gaaatgaaca acgttaccga agctgaaatc      660
gacgctttgt tcgctgaata cgaagctcaa tacgacatcg ctactgataa cttggctgct      720
atcagatacc aagctaagga agaaatcgct atgaagaaga tgttggatag agaaggttgt      780
aaggctttct ctaacacctt ccaagacttg tacggtatgg aacaattgcc aggtttggct      840
tcccaacact tgatggctca aggttacggt tacggtggtg aaggtgactg gaaggtctct      900
gctatgactg ctatcttgaa ggctatgggt gaaaacggta acggtgcttc cgctttcatg      960
gaagactaca cctaccactt ggtcgaaggt caagaatact ctttgggtgc tcacatgttg     1020
gaagtttgtc catccttggc tgctgacaag ccaagaatcg aaactcacca cttgggtatc     1080
ggtatgaacg aaaaggaccc agctagattg gtcttcgaag gtaaagctgg taaaggtatc     1140
gttacctctt tgatcgatat gggtggtaga atgagattga tcgtccaaga catcgaagct     1200
gttaagccaa tcttgccaat gccaaacttg ccagtcgcta gagttatgtg gagagctatg     1260
ccagacttga ccactggtgt tgaatgttgg atcaccgctg gtggtgctca ccacactgtc     1320
ttgtctttcg acgttacccc agctatgttg agagactggg ctagaatgat ggatatcgaa     1380
tttgtctaca tcactaagga taccactcca gaagaattgg aagaagaatt gttgatcaag     1440
gacttggttt ggaagttgaa gtga                                            1464

<210>  31
<211>  1500
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  31
atgttgaaga ctaagaacta ccaattctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaatgtt tggctcacgt tgctgaacac tctaagatca tcgttgacgc tttgaacaag      120
tccggtaact tgccatacga agttgtctgg aagccaacca tgatcactaa cgaagttatc      180
agaaagacct tcaacgaagc taacactgac gaaaactgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagaatacag aaagccattg      300
ttgcacttgc acacccaatt caacagagaa atcccatacg acactatcga catggatttc      360
atgaacgaaa accaagctgc tcacggtgac agagaatacg gtcacatctt ctccagattg      420
aacatggaaa gaaaggttgt cgctggttac tgggaagacg aagatgttca aaagcaaatc      480
ggttcctgga tgagaaccgc tgtcggtgtt gtcgaatctt cccacgttag agtcatgaga      540
gttgctgaca acatgagaaa cgttgctgtc actgaaggtg acaaggtcga agctcaaatc      600
aagttcggtt gggaagttga cgcttaccca gtcaacgaag ttgtcgaagc tgttaacgct      660
gtctctcaag ctgacatcga taccttggtt gaagaatact acgacaagta cgatatcttg      720
ttggaaggta gagacgaaaa ggagttcaga gaacacgttg ctgtccaagc tggtatcgaa      780
ttgggtttcg aaagattctt ggacgaaaac aactaccaag ctgttgtcac tcacttcggt      840
gacttgggtg gtttgaagca attgccaggt ttggctatgc aaagattgat ggaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag accgctgcta tggttagagt catgaagatc      960
atgactcaag gtatgaagga cgctaagggt acttctttca tggaagatta cacttacaac     1020
ttggtttccg gtaaagaagg tgtcttggaa gctcacatgt tggaagtctg tccaaccatc     1080
gctgacggta aaatctctat caaggaacaa ccattgtcta tgggtaacag agaagaccca     1140
gctagattgg ttttcacctc taagactggt ccagctatcg ctacctcctt gatcgacttg     1200
ggtgacagat tcagattgat catcaacgac gtcgattgta agaagactga aaagccaatg     1260
ccaaagttgc cagttgctac cgctttctgg actccacaac caaacttgaa ggtcggtact     1320
gaagcctgga tcttggctgg tggtgctcac cacaccgctt tctcttacga cttgactgct     1380
gaacaaatgg gtgactgggc tgcttgtatg ggtatcgaag ctgtttacat cgacaaggat     1440
accactatca gacaattcaa gaacgaattg ttgtggaact ctgtcgctta cagaaagtaa     1500

<210>  32
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  32
atgactggtg ttaagaacta caagttctgg ttctgtactg gttcccaaga tttgtacggt       60
gaagaatgtt tggctcacgt cgctgaacac tccagaatca tcgttgaatc tttgaacaga      120
tccggtatct tgccatacga agttgtctgg aagccaacct tgatcactaa cgaattgatc      180
agaagaacct tcaacgaagc taacgctgac gaagaatgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagagttcag aaagccattg      300
atgcacttcc acacccaatt caacagagaa atcccatacg acactatcga catggatttc      360
atgaacgaaa accaatccgc tcacggtgac agagaatacg gtcacatggt tactagaatg      420
ggtatcgaaa gaaaggttat cgtcggtcac tggtctgacg aaaaggttgt cggtagaatc      480
gctggttgga tgagaaccgc tgttggtatc atggaatctt cccacgtcag agttgtcaga      540
ttcgctgaca acatgagaaa cgttgctgtc actgaaggtg acaaggttga agctcaagtc      600
aagttcggtt gggaagttga cgcttaccca gtcaacgaat tgtgtcaata cgttaaggct      660
gtcccaaagg gtgacatcac cgctttggtc gatgaatact actccaagta cactatcttg      720
ttggaaggta gagacccaga agagttcaag agacacgttg ctgtccaagc tcaaatcgaa      780
gctggtttgg aaagattctt ggttgaaaag gactaccacg ctatcgtcac ccacttcggt      840
gacttgggtg aattgcaaca attgccaggt ttggctatcc aaagattgat ggaaaagggt      900
tacggtttcg gtggtgaagg tgactggaag actgctgcta tggttagatt gatgaagatc      960
atggctcaag gtgtcaagaa cgctaagggt acttctttca tggaagacta cacttacaac     1020
ttggttccag gtaaagaagg tatcttggaa gctcacatgt tggaagtttg tccatctatc     1080
gctgacggtg aaatctccat caaggtcaac ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg ttttcacctc caagactggt cacggtatcg ctacctcttt ggttgacttg     1200
ggtactagat tcagattgat catcaacgat gtcgaatgta gaaagaccga aaaggctatg     1260
ccaaagttgc cagtcgctac cgctttctgg actccagaac catctttggc tactggtgct     1320
gaagcctgga tcttggctgg tggtgctcac cacaccgctt tctcctacga cttgactgct     1380
gaacaaatgg gtgactgggc tgaatctatg ggtatcgaag ttgtctacat cgacaaggat     1440
accactatca gaggtttgaa gaacgaaatg agatggaacg gtgctgtcta cagataa        1497

<210>  33
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  33
atgatcgctg ttaagaacta caagttctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaatgtt tggctcacgt tgctgaacac tctggtatca tcgttgactc tttgaacaag      120
tccggtatct tgccatacga agttgtcttg aagccaacct tgatcactaa cgaattgatc      180
agaagaacct tcaacgaagc taacgctgac gaagaatgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagaatacag aaagccattg      300
atgcacttcc acacccaatt caaccaagaa atcccatacg actctatcga catggatttc      360
atgaacgaaa accaatccgc tcacggtgac agagaatacg gtcacatggt tactagaatg      420
ggtatcgaaa gaaaggttat cgtcggtcac tggagagacg aaaaggttgt cggtagaatc      480
gctgcttgga tgagaaccgc tgttggtatc atggaatctt cccacgttag agtcgctaga      540
ttcgctgaca acatgagaaa cgttgctgtc actgaaggtg acaaggtcga agctcaaatg      600
aagttcggtt gggaagttga cgcttaccca gtcaacgaat tggctgaata cgttaaggct      660
gtcccaaagg gtgacatcac cgctttggtc gatgaatact actctaagta cactatcttg      720
ttggaaggta gagacccaga agagttcaag agacacgttg ctgtccaagc tcaaatcgaa      780
gctggtttgg aaaagttctt gttggaaaag gactaccacg ctatcgttac ccacttcggt      840
gacttgggtg aattgcaaca attgccaggt ttggctatcc aaagattgat ggaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag accgctgcta tggtcagatt gatgaagatc      960
atgactcaag gtatgaagga cgctaagggt acttctttca tggaagatta cacttacaac     1020
ttggttccag gtaaagaagg tatcttggaa gctcacatgt tggaagtctg tccaactatc     1080
gctgacggtg aaatctctat caaggcttgt ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg ttttcacctc taagactggt cacggtatcg ctgcttcctt ggttgacttg     1200
ggtactagat tcagattgat catcaacgat gtcgaatgta agaagactga aaagccaatg     1260
ccaaagttgc cagtcgctac cgctttctgg actccagaac caaacttggc taccggtgct     1320
gaatcttgga tcttggctgg tggtgctcac cacaccgctt tctcctacga cttgactgct     1380
gaacaaatgg gtgactgggc tgatgctatg ggtatcgaaa ctgtttacat cgacaaggat     1440
accactatca gaggtttgaa gaacgaattg agatggaacg ctgctgctta cagataa        1497

<210>  34
<211>  1500
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  34
atgttgaaga agaaggaata caagttctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaatgtt tggctcacgt tgctgaacac gctaagatca tcgtcgaaaa gttgaacgaa      120
tccggtgttt tgccatacga agttgtctgg aagccaacct tgatcactaa cgaattgatc      180
agaaagacct tcaacgaagc taacatcgac gatgaatgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagagttcag aaagccattg      300
ttgcacttgc acacccaatt caacatggaa atcccatacg acactatcga catggatttc      360
atgaacgaaa accaatctgc tcacggtggt agagaatttg gtcacatctt cactagattg      420
ggtatcgaaa gaaaggttgt cgttggtcac tggtccgacg aaaaggttca agaaaagatc      480
gcttcttgga tgagaaccgc tgtcggtgtt atcgaatctt cccacgtcag agttatgaga      540
gtcgctgaca acatgagaaa cgtcgctgtt actgaaggtg acaaggttga agctcaaatc      600
aagttcggtt gggaagttga cgcttaccca gttaacgaaa tcgctgaatc tgttgacgct      660
gtttccgctg ctgatgtcaa caccttggtt gaagaatact acgacaagta cgaaatcttg      720
ttggaaggta gagatccaga agagttcaga aagcacgtcg ctgttcaagc tcaaatcgaa      780
ttgggtttcg aaagattctt ggaagaaaag aactaccaag ctatcgtcac tcacttcggt      840
gacttgggtg ttttgaagca attgccaggt ttggctatcc aaagattgat gcaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag accgctgcta tggtcagaat catgaagatc      960
atgactgaag gtatgaagga cgctaagggt acttctatgt tggaagatta cacttacaac     1020
ttcgttccag gtaaagaagg tatcttgcaa gctcacatgt tggaaatctg tccatctatc     1080
gctgacggtc caatctccat caaggtcaac ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg ttttcacctc caaggaaggt aaaggtatcg ctacttcttt gatcgacttg     1200
ggtgacagat tcagattgat catcaacacc gttgactgta agaagaacga aaagccaatg     1260
ccaaagttgc cagttgctac caacttctgg actccagaac cagacttggc tactggtgct     1320
gaagcctgga tcttgtgtgg tggtgctcac cacaccgctt tctcttacga catcactgct     1380
gaacaaatgg gtgactgggc tgctatgatg ggtatcgaag ctgtctacat cgacaaggat     1440
accactatca gaaacttgaa gaacgaattg agatggaacg aattggcttt cagaaagtaa     1500

<210>  35
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  35
atgaaggctg ctaaggatta caagttctgg ttctgtactg gttctcaaga tttgtacggt       60
gacgaatgtt tggctcacgt tgctgaacac tccagaatca tcgttgacgc tttgaacaag      120
tccggtgttt tgccatacga aatcgtctgg aagccaacct tgatcactaa cgaattgatc      180
agaaagacct tcaacgaagc taacgctgac gaaaactgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttgc aagagttcag aaagccattg      300
ttgcacttcc acacccaatt caacagagaa atcccatacg acactatcga catggatttc      360
atgaacgaaa accaagctgc tcacggtgac agagaatacg gtcacatcgt ttccagaatg      420
ggtatcgaaa gaaagatcat cgttggttac tgggaagaca gagatgtcca agaaaagatc      480
gcttcctgga tgttgaccgc tatcggtatc atggaatctt cccacatcag agtctgtaga      540
atcgctgaca acatgagaaa cgttgctgtc actgaaggtg acaaggttga agctcaaatc      600
aagttcggtt gggaaatcga cgcttaccca gtcaacgaaa tcgctgaata cgttgctgct      660
gtcccacaag gtgaaatcaa cgctttggtt gaagaatact actctaagta cgacatcatc      720
ttggaaggta gagatccaca agagttcaga gaacacgttg ctgtccaagc tggtatcgaa      780
atcggtttcg aaaagttctt ggaagaaaag aactaccaag ctatcgtcac tcacttcggt      840
gacttgggtt ctttgaagca attgccaggt ttggctatcc aaagattgat ggaaaagggt      900
tacggtttcg gtggtgaagg tgactggaag accgctgcta tggttagatt gatgaagatc      960
atgactgctg gtgtcaagaa cccaaagggt acttctttca tggaagacta cacttacaac     1020
ttggttccag gtaaagaagg tgtcttggaa gctcacatgt tggaagtttg tccatctgtc     1080
gctgatggtc caatcggtat caaggtttgt ccattgtcta tgggtgacag agaagatcca     1140
gctagattgg tctacacctc taagactggt ccagctatcg ctacctcctt gatcgacttg     1200
ggtaacagat tcagattgat catcaacgaa gttgaatgta agaaggtcga aaagccaatg     1260
ccaaagttgc cagttgctac cgctttctgg actccatacc cagacttgaa gactggtgct     1320
gaagcctgga tcttggctgg tggtgctcac cacaccgctt tctcttacga cttgactgct     1380
gaacaaatgg gtgactgggc tgctgctatg ggtatcgaag ctgtttacat cgacaaggat     1440
accactatca gaaacttcaa gagagacttg caattgggta acatcgtcta cagataa        1497

<210>  36
<211>  1497
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the human microbiome dataset

<400>  36
atggttactg gtagaaacta caagttctgg ttctgtactg gttcccaaga tttgtacggt       60
gacgaatgtt tgagaaaggt tgctgaacac tccagaatca tcgttgaaga attgaacaag      120
tccggtgttt tgccattcga attggtctgg aagccaacct tgatcactaa cgaattgatc      180
agaaagacct tcaacgaagc taacgctgac gatgaatgtg ctggtgtcat cacctggatg      240
cacactttct ctccagctaa gtcttggatc ttgggtttga aggaatacag aaagccattg      300
tgtcacttgc acacccaatt caaccaagaa atcccatacg acactatcga catggatttc      360
atgaacgaaa accaatctgc tcacggtggt agagaatacg gtcacatcgt tactagaatg      420
ggtatcgaaa gaaaggttat cgtcggtcac tgggctgaca agaaggttca agaaagattg      480
gcttcctgga tgagaaccgc tgtcggtatc atggaatctt cccacatcag agtttgtaga      540
gtcgctgaca acatgagaaa cgttgctgtc actgaaggtg acaaggttga agctcaaatc      600
aagttcggtt gggaagttga cgcttaccca gttaacgaag tctgtgacta cgttaaggat      660
gtctctaagg gtgacatcga tgttttggtc gaagaatact acaacaagta cgacatcttg      720
ttcgaaggta gagatccaga agagttcaag agacacgttg ctgtccaagc tgctatcgaa      780
atcggtttcg aaagattctt ggaagaaaag aactaccaag ctgttgtcac ccacttcggt      840
gacttgggtg gtttgcaaca attgccaggt ttggctatgc aaagattgat ggaaaagggt      900
tacggtttcg gtgctgaagg tgactggaag accgctgcta tggttagatt gatgaagatc      960
atgactgctg gtgtcaagga cgctaagggt acttctttca tggaagatta cacttacaac     1020
ttggttccag gtaaagaagg tatcttgcaa tcccacatgt tggaagtttg tccaaccatc     1080
gctgacggta aaatcggtat caaggtctgt ccattgtcta tgggtgacag agaagatcca     1140
gctagattgt tcacctctaa gactggtcca gctgttgcta cttccttggt tgacttgggt     1200
gacagattca gattgatcat caacgacgtt gattgtaaga aggtcgaaaa gccaatgcca     1260
aagttgccag tcggttctgc tttctggacc ccacaaccag acttggctac tggtgctgaa     1320
gcctggatct tggctggtgg tgctcaccac accgctttct cctacgactt gactgctgaa     1380
caaatgggtg actgggctgc tgctatgggt atcgaagctg tttacatcga caaggatacc     1440
actatcagaa acttcaagaa cgaattgaga tggaacgaag tcgctttcag aaagtaa        1497

<210>  37
<211>  1476
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the cow rumen metagenome dataset

<400>  37
atggaagaca tcatgaagag agaattttgg ttcatcgttg gttcccaatt cttgtacggt       60
caagacgttt tggacactgt tgacgctaga gctaaggaaa tggctgctga attgtctaag      120
gtcttgccat acccattggt ctacaaggtt accgctaaga ctaacaagga aatcactgac      180
gtcatcaagg aagctaacta cagagatgaa tgtgctggta tcgttacctg gtgtcacact      240
ttctctccat ccaagatgtg gatcaacggt ttggctaact tgcaaaagcc atactgtcac      300
ttggctaccc aatacaacaa ggaaatccca aacgacgaaa tcgacatgga tttcatgaac      360
ttgaaccaag ctgctcacgg tgacagagaa cacggtttca tcgctgctag attgagattg      420
ccaagaaagg ttatcgctgg tttctggcaa gacgaaaaga tccacaagag attgtctgat      480
tggatgagag ctgctgttgg tgtcgctgtt tccaagaaga tgaaggtcat gagattcggt      540
gacaacatga gagaagtcgc tgttactgaa ggtgacaagg tcgaagttca aactaagttg      600
ggttggcaag ttaacacctg ggctgtcggt gacttggtta aggaaatggg taaagtcacc      660
gaagctgaaa tcgatgcttt ggttgctgaa tacgaagcta actacgacat cgctaccgat      720
aacactgctg ctatcagata ccaagctaga gaagaaatcg ctatgaagaa gatgttggac      780
agagaaggtt gtagagcttt caccaacact ttccaagatt tgtacggtat ggaacaattg      840
ccaggtttgg cttctcaaca cttgatggct caaggttacg gttacggtgg tgaaggtgac      900
tggaaggtct ctgctatgac tgctatcttg aaggctatgg gtgaaaacgg taacggtgct      960
tccggtttca tggaagacta cacctaccac ttggtcgaag gtcaagaata ctctttgggt     1020
gctcacatgt tggaagtttg tccatccttg gctgctgaca agccaagaat cgaaactcac     1080
cacttgggta tcggtatgaa cgaaaaggac ccagctagat tggttttcga aggtaaagct     1140
ggtaaaggta tcgttgtctc tttgatcgac atgggtggta gattgagatt gatcgtccaa     1200
gatatcgaag ctgttaagcc aatcttgcca atgccaaact tgccagtcgc tagagttatg     1260
tggagagcta tgccagactt gaccactggt gtcgaatgtt ggatcaccgc tggtggtgct     1320
caccacactg tcttgtccta cgacgttacc gctgaacaaa tgagagactg ggctagaatg     1380
atggatatcg aatttgttca catcaccaag gacactaccc cagaaaagtt ggaagaagaa     1440
ttgttggtta aggatttggt ttggaagttg aagtaa                               1476

<210>  38
<211>  1467
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the cow rumen metagenome dataset

<400>  38
atgtctaagg aattttggtt cgtcgtcggt tcccaagatt tgtacggtga agaagttttg       60
aagatcgtcg ctgaaagagc tgctgaaatg gctgcttggt tgtctgaaaa gttgccatac      120
ccattgatct acaaggtcac tgctatgtct tccaaccaaa tcacctccgt tatgaaggaa      180
gctaacttcg acgataactg tttgggtgtt gtcacctggt gtcacacttt ctctccatcc      240
aagatgtggt tgactggttt ggacttgttg caaaagcctt ggtgtcactt cgctacccaa      300
tacaacttgg aaatcccaaa cgaagaaatc gacatggatt tcatgaactt gaaccaagct      360
gctcacggtg acagagaaca cggtttcatc ggtgctagat tgagaaaggc tagaaaggtt      420
gtcgctggtt actggaagga cgaaaaggtc atcgctagat tggctgaatt tcaaaaggtt      480
gctgtcggtg ttgacgcttc taagcacatg aaggttatga gattcggtga caacatgaga      540
gatgtcgctg ttactgaagg tgacaaggtc gaagttcaaa agaagttggg ttgggaagtc      600
aacacttggg ctgtcggtga cttggttaag gaaatgaacg ctgtcaccga tgaagaagtt      660
gaagccttgt tcaacgaata caaggcttct tacgacatca acactgataa catctacgct      720
atcaagtacc aagctagaga agaaatcgct atcaagaaga tgatggacag aaacggttgt      780
aaggctttct ccaacacctt ccaagatttg tacggtatgg aacaattgcc aggtttggct      840
tctcaacact tgatgtcctt gggttacggt tacggtggtg aaggtgactg gaaggtttct      900
gctatgactg ctatcttgaa ggctatgggt gaaaacggta acggtgcttc cgctttcatg      960
gaagactaca cctaccactt ggtcaagggt cacgaatact ctttgggtgc tcacatgttg     1020
gaagtttgtc catccttggc tgctgacaag ccaagaatcg aaacccacca cttgggtatc     1080
ggtatgaacg aaaaggaccc agctagattg gtcttcgaag gtaaagaagg tagaggtatc     1140
gttgcttctt tgatcgacat gggtggtaga ttgagattga tcgtccaaga tatcgaagct     1200
gttaagccaa tcatgccaat gccaaacttg ccagtcgcta gagttatgtg gagagctttg     1260
ccagacttga ctgatggtgt cgaatgttgg atcaccgctg gtggtgctca ccacactgtc     1320
ttgtcttacg acgttacccc agaaatgatg agagacttcg ctaagttcat ggatatcgaa     1380
tttgttcaca tcgacaagga taccaccgtt gaaaagttgg aagatgaatt gatggttaag     1440
gatttggttt ggaagatgaa gggttaa                                         1467

<210>  39
<211>  1467
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the cow rumen metagenome dataset

<400>  39
atgggtaaca agaagaactt ctggttcgtc gtcggttctc aattcttgta cggtaacgaa       60
gttttggaaa ctgtcgctgc tagagctcaa gaaatggctg aaaagatgtc taagtccttg      120
ccatacgaat tgaagttcaa gggtatcgtc aagacctggg acgaagctac tcaatacgct      180
aaggaagcta acttcgacga taactgttgt ggtgttatca cctggtgtca cactttctct      240
ccatccaaga tgtggatcga agccttcaga ttgttgcaaa agccattgtt gcacttcgct      300
acccaataca acagatacat cccagacaag gaaatcgaca tggatttcat gaacttgaac      360
caagctgctc acggtgacag agaacacggt ttcatcatcg ctagaatgag attgcaacaa      420
aagatcgtca ccggtttctg ggaagaccaa ccagttttgg atgaaatcgg tacttggatg      480
agagctgctg tcgcttacga cttctccaga aacttgagag ttatgagatt cggtgacaac      540
atgagagaag tcgctgttac tgaaggtgac aaggtcgaag ctcaaatcaa gttcggttgg      600
caagttaaca cctggccagt cggtaaattg gttgaagaaa tcggtaaagt cactgaagaa      660
gaagttgacg aattgttgaa ggtctacacc gatacttacg aattggctac cgacgatatc      720
gaaactatca gataccaagc tagagaagaa atcgctatga agaagatgat gaccgctgaa      780
ggtgctaacg ctttcgttaa cactttccaa gacttgatcg gtatgaagca attgccaggt      840
atcgcttctc aacacttgat ggctcaaggt tacggttacg gtgctgaagg tgactggaag      900
ttgtctgctt tggtctccat cgttaagaag atgaccgaag gtatgaccgg tggtacttct      960
ttcatggaag actacactta ccacttggac ccaaacgctg aatacgcttt gggtgctcac     1020
atgttggaag tctgtccatc catcgctgct gacaagccaa gaatcgaagt tcacccattg     1080
ggtatcggtg acagagaaga tccagctaga ttggtcttcg aaggtcacga aggtgacgct     1140
gttgtcgtta ccttgatcga tatgggtgaa agattcagaa tgttggtcca agacatccac     1200
tgtgttaagc caatctacga aatgccaaac ttgccagtcg ctagagttat gtgggaaggt     1260
aaaccatctt tgaacgaagg tttgaagatg tggttgatgg ctggtggtgc tcaccactct     1320
gtcttgtcct acgacgctac cccagaaatg ttgaaggact tggctagaat gatggatatc     1380
gaatttgttc acatcactgc tgactccaag ccagaagaat ttgaaaagga cttgttcttc     1440
gctgatttgg cttggaagtt gaagtaa                                         1467

<210>  40
<211>  1413
<212>  DNA
<213>  unknown

<220>
<223>  Sequence from the cow rumen metagenome dataset

<400>  40
atgaagaaga tttacttcat cactggttcc caagacttgt acggtgaaga cgttttgaag       60
actgtcgcta aggactccca agaaatggtt aacttcttgg acgaacaagt cggtgaaaga      120
gctgaaatcg aatttttggg tgttgtcaga gactctgaaa tctgtttgga tttcatcttg      180
aaggctaact tcgacaagga atgtatcggt atcatcacct ggatgcacac tttctcccca      240
gctaagatgt ggatcagagg tttgaaggtt ttgcacaagc caatgttgca cttgcacacc      300
caatacaacg aaaagttgcc atacgactct atcgacatgg atttcatgaa cttgaaccaa      360
gctgctcacg gtgacagaga atttggtttc atcgctgcta gaatgaacat caagcaacac      420
gttttgtccg gttactacaa gaacaaggac ttcatcgaag gtgttaagca atacatcgac      480
gtctgtttgt ctatcgatgc tgctaagtac ttgagagtcg ctatgttcgg ttccaacatg      540
agagacgttg ctgtcactga cggtgacaga gttcaatctg aaatcgactt cggttggaac      600
gtcaactact acggtatcgg tgacttggtt gatatcatca acaaggtcaa ggacgaagaa      660
atcgatgctc aattcgaaga atacaagaag agatacacca tcaacaccac taacatcgaa      720
gctatcaagg aacaagccaa gtacgaagtt gctttgaaga agttcatcaa gaaggaaaac      780
gtccaagcct tcaccgacaa cttccaagat ttgcacggtt tgaagcaatt gccaggtttg      840
gctgttcaag acttgatgca agaaggtatc tctttcggtc cagaaggtga ctacaagacc      900
ccagctttgt tggctacctt gttgccaatg actaagtaca gaaagggtgc taccggtttc      960
atcgaagact acacttacga tttgatcgaa ggtaaagaaa tcgaattggg ttctcacatg     1020
ttggaagttc caccatcttt cgctacttcc aagccagaaa tccaagtcag accattgtct     1080
atcggtgaca aggctgctcc agctagattg gttttcgatt ccatcgaagg tgaaggtttg     1140
caaatcacta tggttgacat gggtactcac ttcagaatca tcgctgctaa gatccaattg     1200
gttaagcaac cagctccaat gccaaagttg ccagtcgcta gaatcatgtg gaagcacatc     1260
ccaaacttca agatttctac cgaagcctgg atgttgtacg gtggtggtca ccactccgtc     1320
atcaccactg ctttgactat cgaagacatc aagttgttcg ctaagttgac tggtactgaa     1380
ttgtgtgtta tcgatgaaaa cactaagatt taa                                  1413

<210>  41
<211>  496
<212>  PRT
<213>  Bacillus subtilis

<400>  41
Met Leu Gln Thr Lys Asp Tyr Glu Phe Trp Phe Val Thr Gly Ser Gln 
1               5                   10                  15      
His Leu Tyr Gly Glu Glu Thr Leu Glu Leu Val Asp Gln His Ala Lys 
            20                  25                  30          
Ser Ile Cys Glu Gly Leu Ser Gly Val Ser Ser Arg Tyr Lys Ile Thr 
        35                  40                  45              
His Lys Pro Val Val Thr Ser Ser Glu Thr Ile Arg Gln Leu Leu Arg 
    50                  55                  60                  
Glu Ala Glu Tyr Ser Glu Thr Cys Ala Gly Ile Ile Thr Trp Met His 
65                  70                  75                  80  
Thr Phe Ser Pro Ala Lys Met Trp Ile Glu Gly Leu Ser Ser Tyr Gln 
                85                  90                  95      
Lys Pro Leu Met His Leu His Thr Gln Tyr Asn Arg Asp Ile Pro Trp 
            100                 105                 110         
Gly Thr Ile Asp Met Asp Phe Met Asn Ser Asn Gln Ser Ala His Gly 
        115                 120                 125             
Asp Arg Glu Tyr Gly Tyr Ile Asn Ser Arg Met Gly Leu Ser Arg Lys 
    130                 135                 140                 
Val Val Ala Gly Tyr Trp Asp Asp Glu Glu Val Lys Lys Glu Ile Ser 
145                 150                 155                 160 
Gln Trp Met Asp Thr Ala Ala Ala Leu Asn Glu Ser Arg His Ile Lys 
                165                 170                 175     
Val Ala Arg Phe Gly Asp Asn Met Arg His Val Ala Val Thr Asp Gly 
            180                 185                 190         
Asp Lys Val Gly Ala His Ile Gln Phe Gly Trp Gln Val Asp Gly Tyr 
        195                 200                 205             
Gly Ile Gly Asp Leu Val Glu Val Met Asn Arg Ile Thr Asp Asp Glu 
    210                 215                 220                 
Val Asp Thr Leu Tyr Ala Glu Tyr Asp Arg Leu Tyr Val Ile Ser Glu 
225                 230                 235                 240 
Glu Thr Lys Arg Asp Glu Ala Lys Val Ala Ser Ile Lys Glu Gln Ala 
                245                 250                 255     
Lys Ile Glu Leu Gly Leu Thr Thr Phe Leu Glu Gln Gly Gly Tyr Ser 
            260                 265                 270         
Ala Phe Thr Thr Ser Phe Glu Val Leu His Gly Met Lys Gln Leu Pro 
        275                 280                 285             
Gly Leu Ala Val Gln Arg Leu Met Glu Lys Gly Tyr Gly Phe Ala Gly 
    290                 295                 300                 
Glu Gly Asp Trp Lys Thr Ala Ala Leu Val Arg Met Met Lys Ile Met 
305                 310                 315                 320 
Ser Gln Gly Lys Arg Thr Ser Phe Met Glu Asp Tyr Thr Tyr His Phe 
                325                 330                 335     
Glu Pro Gly Asn Glu Met Ile Leu Gly Ser His Met Leu Glu Val Cys 
            340                 345                 350         
Pro Thr Val Ala Leu Asp Gln Pro Lys Ile Glu Val His Pro Leu Ser 
        355                 360                 365             
Ile Gly Gly Lys Glu Asp Pro Ala Arg Phe Val Phe Asn Gly Ile Ser 
    370                 375                 380                 
Gly Ser Ala Ile Gln Ala Ser Leu Val Asp Ile Gly Gly Arg Phe Arg 
385                 390                 395                 400 
Leu Val Leu Asn Glu Val Asn Gly Gln Glu Ile Glu Lys Asp Met Pro 
                405                 410                 415     
Asn Leu Pro Val Ala Arg Val Leu Trp Lys Pro Glu Pro Ser Leu Lys 
            420                 425                 430         
Thr Ala Ala Glu Ala Trp Ile Leu Ala Gly Gly Ala His His Thr Cys 
        435                 440                 445             
Leu Ser Tyr Glu Leu Thr Val Glu Gln Met Leu Asp Trp Ala Glu Met 
    450                 455                 460                 
Ala Gly Ile Glu Ser Val Leu Ile Ser Arg Asp Thr Thr Ile His Lys 
465                 470                 475                 480 
Leu Lys His Glu Leu Lys Trp Asn Glu Ala Leu Tyr Arg Leu Gln Lys 
                485                 490                 495     

<210>  42
<211>  1491
<212>  DNA
<213>  artificial sequence

<220>
<223>  Codon-optimized coding region of B. subtilis AI.

<400>  42
atgttgcaaa ctaaggatta cgaattctgg ttcgttactg gttctcaaca cttgtacggt       60
gaagaaactt tggaattggt cgatcaacac gctaagtcta tctgtgaagg tttgtccggt      120
gtctcttcca gatacaagat cacccacaag ccagttgtca cctcttccga aactatcaga      180
caattgttga gagaagctga atactctgaa acttgtgctg gtatcatcac ctggatgcac      240
actttctctc cagctaagat gtggatcgaa ggtttgtctt cctaccaaaa gccattgatg      300
cacttgcaca cccaatacaa cagagacatc ccttggggta ctatcgacat ggatttcatg      360
aactctaacc aatccgctca cggtgacaga gaatacggtt acatcaactc cagaatgggt      420
ttgtccagaa aggttgtcgc tggttactgg gacgatgaag aagtcaagaa ggaaatctct      480
caatggatgg acaccgctgc tgctttgaac gaatccagac acatcaaggt tgctagattc      540
ggtgacaaca tgagacacgt tgctgtcact gacggtgaca aggttggtgc tcacatccaa      600
ttcggttggc aagttgacgg ttacggtatc ggtgacttgg ttgaagtcat gaacagaatc      660
accgacgatg aagttgacac tttgtacgct gaatacgata gattgtacgt catctctgaa      720
gaaaccaaga gagacgaagc taaggttgct tccatcaagg aacaagctaa gatcgaattg      780
ggtttgacca ctttcttgga acaaggtggt tactctgctt tcaccacttc cttcgaagtc      840
ttgcacggta tgaagcaatt gccaggtttg gctgttcaaa gattgatgga aaagggttac      900
ggtttcgctg gtgaaggtga ctggaagacc gctgctttgg tcagaatgat gaagatcatg      960
tctcaaggta aaagaacctc cttcatggaa gactacactt accacttcga accaggtaac     1020
gaaatgatct tgggttctca catgttggaa gtttgtccaa ctgtcgcttt ggaccaacca     1080
aagatcgaag ttcacccatt gtctatcggt ggtaaagaag atccagctag attcgtcttc     1140
aacggtatct ctggttccgc tatccaagcc tctttggttg acatcggtgg tagattcaga     1200
ttggttttga acgaagtcaa cggtcaagaa atcgaaaagg acatgccaaa cttgccagtt     1260
gctagagtct tgtggaagcc agaaccatct ttgaagactg ctgctgaagc ctggatcttg     1320
gctggtggtg ctcaccacac ctgtttgtct tacgaattga ctgtcgaaca aatgttggac     1380
tgggctgaaa tggctggtat cgaatctgtt ttgatctcca gagataccac tatccacaag     1440
ttgaagcacg aattgaagtg gaacgaagcc ttgtacagat tgcaaaagta a              1491

<210>  43
<211>  16404
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed plasmid

<400>  43
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat       60
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc      120
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga      180
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg      240
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta      300
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg      360
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt aggcgcctac      420
ttctaggggg cctatcaagt aaattactcc tggtacactg aagtatataa gggatataga      480
agcaaatagt tgtcagtgca atccttcaag acgattggga aaatactgta atataaatcg      540
taaaggaaaa ttggaaattt tttaaagatg tcttcactgg ttactcttaa taacggtctg      600
aaaatgcccc tagtcggctt agggtgctgg aaaattgaca aaaaagtctg tgcgaatcaa      660
atttatgaag ctatcaaatt aggctaccgt ttattcgatg gtgcttgcga ctacggcaac      720
gaaaaggaag ttggtgaagg tatcaggaaa gccatctccg aaggtcttgt ttctagaaag      780
gatatatttg ttgtttcaaa gttatggaac aattttcacc atcctgatca tgtaaaatta      840
gctttaaaga agaccttaag cgatatggga cttgattatt tagacctgta ttatattcac      900
ttcccaatcg ccttcaaata tgttccattt gaagagaaat accctccagg attctatacg      960
ggcgcagaag gattctatac gggcgcagaa ctagtgatct cgaggttcca gagctcggat     1020
ccaccacagg tgttgtcctc tgaggacata aaatacacac cgagattcat caactcattg     1080
ctggagttag catatctaca attgggtgaa atggggagcg atttgcaggc atttgctcgg     1140
catgccggta gaggtgtggt caataagagc gacctcatgc tatacctgag aaagcaacct     1200
gacctacagg aaagagttac tcaagaataa gaattttcgt tttaaaacct aagagtcact     1260
ttaaaatttg tatacactta ttttttttat aacttattta ataataaaaa tcataaatca     1320
taagaaattc gcttactcat cccgggttag atgagagtct tttccagttc gcttaagggg     1380
acaatcttgg aattatagcg atcccaattt tcattatcca catcggatat gctttccatt     1440
acatgccatg gaaaattgtc attcagaaat ttatcaaaag gaactgcaat tttattagag     1500
tcatataaca atgaccacat ggccttataa caaccaccaa gggcacatga gtttggtgtt     1560
tctagcctaa aattaccctt tgtagcacca atgacttgag caaacttctt cacaatagca     1620
tcgtttttag aagccccacc tacaaaaaaa gtcctttctg gccttttatt taggtagtcc     1680
cgcagcggag attcatcgta atcaaacttc acgattgtat cttcgttcag tctctgttgt     1740
gagcttgcgt ttgaatccga aagcagggga gatattctta ccctgcaact taaagcctgt     1800
gattctacaa tatttttggc atcgtgcctc ttgtctttga acttggccac ctctctttca     1860
atcatacccg tttttggatt gaagataacc cttttgttta tggcttttac gctaggaacg     1920
atctccccca gaggaaaata tacacctaat tcattttcac tactttctga gtcatctagc     1980
acagcttgat taaaaagagt ccaatcgtta gtcttctcat aattattttc ccgttctttg     2040
tttaactcgt ctcttatcct ctcccttgcc aaagaaccat tacaataaca aatcataccc     2100
atataatggt ttggcagagt tggatgaatg aaaagatgat agttcggaga ggggtgatac     2160
ttatcggtga ccagaagaac tgtagtactt gttcctaggg aaacgagaac gtcattcttc     2220
cgcaggggta aagaacatat agtggctaaa ttatccccag tcatgggaga gaccttgcag     2280
tttgtattga aaccgtactt ctcaataaaa tatttacaga tggtacccgc tatcaaattt     2340
ttcatgggtg ctctcattaa tttttgtctg atagttttat ccttagaaga actatcaatt     2400
agatgtagta gctcatcact gaattttctt tcacgtatat cataaaggtt cataccacag     2460
gcatctgcct cctctaattc aacaagatgg cccactaaga tagaagtcaa aaaattagac     2520
actaaagaaa tggtctttgt tttttcgtaa gcttctggtt ctaattgtgc aattttcaga     2580
atttgaggac cagtaaatct aaaatgggct ctggaccctg ttaattgagc cattttttca     2640
ggcccaccta tgcactcttc aaactcttga cattgctttg cagtactgtg gtcttgccaa     2700
ttgggggcgg tttgccttgc aaatgctaca gagctcacgt agtgcaataa atctttttcc     2760
ggtttcttat tcaattgctc taacagagat tcggcttggg aggaccagta gacagacccg     2820
tgctgctggc aggaccctga gacggccata actttgttca atggaaattt agcctcgcga     2880
tatttcgaga gaaccagatc tagagcctct aaccacatgg ctacgggaca ttcgatagtg     2940
tcgccgtgta tatagacacc cttctttgtg tgataatgcg gaagatcctt ttcaaattcc     3000
actgtttctg aatggacaat ttttaggtcc tggttaatgg cgagacattt cagttgttgg     3060
gtcgaaagat caaacccaag atagtatgag tctaaagaca ttgtgttgga aacctctctt     3120
gtctgtctct gaattactga acacaacata ctagtcgtac ggttttattt tttacttata     3180
ttgctggtag ggtaaaaaaa tataactcct aggaataggt tgtctatatg tttttgtctt     3240
gcttctataa ttgtaacaaa caaggaaagg gaaaatactg ggtgtaaaag ccattgagtc     3300
aagttaggtc atccctttta tacaaaattt ttcaattttt tttccaagat tcttgtacga     3360
ttaattattt tttttttgcg tcctacagcg tgatgaaaat ttccgcctgc tgcaagatga     3420
gcgggaacgg gcgaaatgtg cacgcgcaca acttacgaaa cgcggatgag tcactgacag     3480
ccaccgcaga ggttctgact cctactgagc tctattggag gtggcagaac cggtaccgga     3540
ggagaccgct ataaccggtt tgaatttatt gtcacagtgt cacatcagcg gcaactcaga     3600
agtttgacag caagcaagtt catcattcga actagcctta ttgttttagt tcagtgacag     3660
cgaactgccg tactcgatgc tttatttctc acggtagagc ggaagaacag ataggggcag     3720
cgtgagaaga gttagaaagt aaatttttat cacgtctgaa gtattcttat tcataggaaa     3780
ttttgcaagg ttttttagct caataacggg ctaagttata taaggtgttc acgcgatttt     3840
cttgttatgt atacctcttc tggcgcgcct ctttttatta accttaattt ttattttaga     3900
ttcctgactt caactcaaga cgcacagata ttataacatc tgcataatag gcatttgcaa     3960
gaattactcg tgagtaagga aagagtgagg aactatcgca tacctgcatt taaagatgcc     4020
gatttgggcg cgaatccttt attttggctt caccctcata ctattatcag ggccagaaaa     4080
aggaagtgtt tccctccttc ttgaattgat gttaccctca taaagcacgt ggcctcttat     4140
cgagaaagaa attaccgtcg ctcgtgattt gtttgcaaaa agaacaaaac tgaaaaaacc     4200
cagacacgct cgacttcctg tcttcctatt gattgcagct tccaatttcg tcacacaaca     4260
aggtcctagc gacggctcac aggttttgta acaagcaatc gaaggttctg gaatggcggg     4320
aaagggttta gtaccacatg ctatgatgcc cactgtgatc tccagagcaa agttcgttcg     4380
atcgtactgt tactctctct ctttcaaaca gaattgtccg aatcgtgtga caacaacagc     4440
ctgttctcac acactctttt cttctaacca agggggtggt ttagtttagt agaacctcgt     4500
gaaacttaca tttacatata tataaacttg cataaattgg tcaatgcaag aaatacatat     4560
ttggtctttt ctaattcgta gtttttcaag ttcttagatg ctttcttttt ctctttttta     4620
cagatcatca aggaagtaat tatctacttt ttacaacaaa tataaaacac gtacgactag     4680
tatgactcaa ttcactgaca ttgataagtt ggccgtctcc accataagaa ttttggctgt     4740
ggacaccgta tccaaggcca actcaggtca cccaggtgct ccattgggta tggcaccagc     4800
tgcacacgtt ctatggagtc aaatgcgcat gaacccaacc aacccagact ggatcaacag     4860
agatagattt gtcttgtcta acggtcacgc ggtcgctttg ttgtattcta tgctacattt     4920
gactggttac gatctgtcta ttgaagactt gaaacagttc agacagttgg gttccagaac     4980
accaggtcat cctgaatttg agttgccagg tgttgaagtt actaccggtc cattaggtca     5040
aggtatctcc aacgctgttg gtatggccat ggctcaagct aacctggctg ccacttacaa     5100
caagccgggc tttaccttgt ctgacaacta cacctatgtt ttcttgggtg acggttgttt     5160
gcaagaaggt atttcttcag aagcttcctc cttggctggt catttgaaat tgggtaactt     5220
gattgccatc tacgatgaca acaagatcac tatcgatggt gctaccagta tctcattcga     5280
tgaagatgtt gctaagagat acgaagccta cggttgggaa gttttgtacg tagaaaatgg     5340
taacgaagat ctagccggta ttgccaaggc tattgctcaa gctaagttat ccaaggacaa     5400
accaactttg atcaaaatga ccacaaccat tggttacggt tccttgcatg ccggctctca     5460
ctctgtgcac ggtgccccat tgaaagcaga tgatgttaaa caactaaaga gcaaattcgg     5520
tttcaaccca gacaagtcct ttgttgttcc acaagaagtt tacgaccact accaaaagac     5580
aattttaaag ccaggtgtcg aagccaacaa caagtggaac aagttgttca gcgaatacca     5640
aaagaaattc ccagaattag gtgctgaatt ggctagaaga ttgagcggcc aactacccgc     5700
aaattgggaa tctaagttgc caacttacac cgccaaggac tctgccgtgg ccactagaaa     5760
attatcagaa actgttcttg aggatgttta caatcaattg ccagagttga ttggtggttc     5820
tgccgattta acaccttcta acttgaccag atggaaggaa gcccttgact tccaacctcc     5880
ttcttccggt tcaggtaact actctggtag atacattagg tacggtatta gagaacacgc     5940
tatgggtgcc ataatgaacg gtatttcagc tttcggtgcc aactacaaac catacggtgg     6000
tactttcttg aacttcgttt cttatgctgc tggtgccgtt agattgtccg ctttgtctgg     6060
ccacccagtt atttgggttg ctacacatga ctctatcggt gtcggtgaag atggtccaac     6120
acatcaacct attgaaactt tagcacactt cagatcccta ccaaacattc aagtttggag     6180
accagctgat ggtaacgaag tttctgccgc ctacaagaac tctttagaat ccaagcatac     6240
tccaagtatc attgctttgt ccagacaaaa cttgccacaa ttggaaggta gctctattga     6300
aagcgcttct aagggtggtt acgtactaca agatgttgct aacccagata ttattttagt     6360
ggctactggt tccgaagtgt ctttgagtgt tgaagctgct aagactttgg ccgcaaagaa     6420
catcaaggct cgtgttgttt ctctaccaga tttcttcact tttgacaaac aacccctaga     6480
atacagacta tcagtcttac cagacaacgt tccaatcatg tctgttgaag ttttggctac     6540
cacatgttgg ggcaaatacg ctcatcaatc cttcggtatt gacagatttg gtgcctccgg     6600
taaggcacca gaagtcttca agttcttcgg tttcacccca gaaggtgttg ctgaaagagc     6660
tcaaaagacc attgcattct ataagggtga caagctaatt tctcctttga aaaaagcttt     6720
ctaaattctg atcgtagatc atcagatttg atatgatatt atttgtgaaa aaatgaaata     6780
aaactttata caacttaaat acaacttttt ttataaacga ttaagcaaaa aaatagtttc     6840
aaacttttaa caatattcca aacactcagt ccttttcctt cttatattat aggtgtacgt     6900
attatagaaa aatttcaatg attacttttt ctttcttttt ccttgtacca gcacatggcc     6960
gagcttgaat gttaaaccct tcgagagaat cacaccattc aagtataaag ccaataaaga     7020
atataactcc taaaaggcta attgaaaccc tgtgattttt gcccgggttt aaggcgcgcc     7080
ctttatcatt atcaatactg ccatttcaaa gaatacgtaa ataattaata gtagtgattt     7140
tcctaacttt atttagtcaa aaaattagcc ttttaattct gctgtaaccc gtacatgccc     7200
aaaatagggg gcgggttaca cagaatatat aacatcgtag gtgtctgggt gaacagttta     7260
ttcctggcat ccactaaata taatggagcc cgctttttaa gctggcatcc agaaaaaaaa     7320
agaatcccag caccaaaata ttgttttctt caccaaccat cagttcatag gtccattctc     7380
ttagcgcaac tacagagaac aggggcacaa acaggcaaaa aacgggcaca acctcaatgg     7440
agtgatgcaa cctgcctgga gtaaatgatg acacaaggca attgacccac gcatgtatct     7500
atctcatttt cttacacctt ctattacctt ctgctctctc tgatttggaa aaagctgaaa     7560
aaaaaggttg aaaccagttc cctgaaatta ttcccctact tgactaataa gtatataaag     7620
acggtaggta ttgattgtaa ttctgtaaat ctatttctta aacttcttaa attctacttt     7680
tatagttagt ctttttttta gttttaaaac accaagaact tagtttcgaa taaacacaca     7740
taaacaaaca ccactagcat ggctgccggt gtcccaaaaa ttgatgcgtt agaatctttg     7800
ggcaatcctt tggaggatgc caagagagct gcagcataca gagcagttga tgaaaattta     7860
aaatttgatg atcacaaaat tattggaatt ggtagtggta gcacagtggt ttatgttgcc     7920
gaaagaattg gacaatattt gcatgaccct aaattttatg aagtagcgtc taaattcatt     7980
tgcattccaa caggattcca atcaagaaac ttgattttgg ataacaagtt gcaattaggc     8040
tccattgaac agtatcctcg cattgatata gcgtttgacg gtgctgatga agtggatgag     8100
aatttacaat taattaaagg tggtggtgct tgtctatttc aagaaaaatt ggttagtact     8160
agtgctaaaa ccttcattgt cgttgctgat tcaagaaaaa agtcaccaaa acatttaggt     8220
aagaactgga ggcaaggtgt tcccattgaa attgtacctt cctcatacgt gagggtcaag     8280
aatgatctat tagaacaatt gcatgctgaa aaagttgaca tcagacaagg aggttctgct     8340
aaagcaggtc ctgttgtaac tgacaataat aacttcatta tcgatgcgga tttcggtgaa     8400
atttccgatc caagaaaatt gcatagagaa atcaaactgt tagtgggcgt ggtggaaaca     8460
ggtttattca tcgacaacgc ttcaaaagcc tacttcggta attctgacgg tagtgttgaa     8520
gttaccgaaa agtgagcggc cgcgtgaatt tactttaaat cttgcattta aataaatttt     8580
ctttttatag ctttatgact tagtttcaat ttatatacta ttttaatgac attttcgatt     8640
cattgattga aagctttgtg ttttttcttg atgcgctatt gcattgttct tgtctttttc     8700
gccacatgta atatctgtag tagatacctg atacattgtg gatgctgagt gaaattttag     8760
ttaataatgg aggcgctctt aataattttg gggatattgg cttttttttt taaagtttac     8820
aaatgaattt tttccgccag gataacgatt ctgaagttac tcttagcgtt cctatcggta     8880
cagccatcaa atcatgccta taaatcatgc ctatatttgc gtgcagtcag tatcatctac     8940
atgaaaaaaa ctcccgcaat ttcttataga atacgttgaa aattaaatgt acgcgccaag     9000
ataagataac atatatctag atgcagtaat atacacagat tcccgcggac gtgggaagga     9060
aaaaattaga taacaaaatc tgagtgatat ggaaattccg ctgtatagct catatctttc     9120
cctccaccgc ggtggtcgac tttcacatac gttgcatacg tcgatataga taataatgat     9180
aatgacagca ggattatcgt aatacgtaat agctgaaaat ctcaaaaatg tgtgggtcat     9240
tacgtaaata atgataggaa tgggattctt ctatttttcc tttttccatt ctagcagccg     9300
tcgggaaaac gtggcatcct ctctttcggg ctcaattgga gtcacgctgc cgtgagcatc     9360
ctctctttcc atatctaaca actgagcacg taaccaatgg aaaagcatga gcttagcgtt     9420
gctccaaaaa agtattggat ggttaatacc atttgtctgt tctcttctga ctttgactcc     9480
tcaaaaaaaa aaatctacaa tcaacagatc gcttcaatta cgccctcaca aaaacttttt     9540
tccttcttct tcgcccacgt taaattttat ccctcatgtt gtctaacgga tttctgcact     9600
tgatttatta taaaaagaca aagacataat acttctctat caatttcagt tattgttctt     9660
ccttgcgtta ttcttctgtt cttctttttc ttttgtcata tataaccata accaagtaat     9720
acatattcaa acttaagact cgagatggtc aaaccaatta tagctcccag tatccttgct     9780
tctgacttcg ccaacttggg ttgcgaatgt cataaggtca tcaacgccgg cgcagattgg     9840
ttacatatcg atgtcatgga cggccatttt gttccaaaca ttactctggg ccaaccaatt     9900
gttacctccc tacgtcgttc tgtgccacgc cctggcgatg ctagcaacac agaaaagaag     9960
cccactgcgt tcttcgattg tcacatgatg gttgaaaatc ctgaaaaatg ggtcgacgat    10020
tttgctaaat gtggtgctga ccaatttacg ttccactacg aggccacaca agaccctttg    10080
catttagtta agttgattaa gtctaagggc atcaaagctg catgcgccat caaacctggt    10140
acttctgttg acgttttatt tgaactagct cctcatttgg atatggctct tgttatgact    10200
gtggaacctg ggtttggagg ccaaaaattc atggaagaca tgatgccaaa agtggaaact    10260
ttgagagcca agttccccca tttgaatatc caagtcgatg gtggtttggg caaggagacc    10320
atcccgaaag ccgccaaagc cggtgccaac gttattgtcg ctggtaccag tgttttcact    10380
gcagctgacc cgcacgatgt tatctccttc atgaaagaag aagtctcgaa ggaattgcgt    10440
tctagagatt tgctagatta gacgtctgtt taaagattac ggatatttaa cttacttaga    10500
ataatgccat ttttttgagt tataataatc ctacgttagt gtgagcggga tttaaactgt    10560
gaggacctta atacattcag acacttctgc ggtatcaccc tacttattcc cttcgagatt    10620
atatctagga acccatcagg ttggtggaag attacccgtt ctaagacttt tcagcttcct    10680
ctattgatgt tacacctgga cacccctttt ctggcatcca gtttttaatc ttcagtggca    10740
tgtgagattc tccgaaatta attaaagcaa tcacacaatt ctctcggata ccacctcggt    10800
tgaaactgac aggtggtttg ttacgcatgc taatgcaaag gagcctatat acctttggct    10860
cggctgctgt aacagggaat ataaagggca gcataattta ggagtttagt gaacttgcaa    10920
catttactat tttcccttct tacgtaaata tttttctttt taattctaaa tcaatctttt    10980
tcaatttttt gtttgtattc ttttcttgct taaatctata actacaaaaa acacatacat    11040
aaactaaaac gtacgactag tatgtctgaa ccagctcaaa agaaacaaaa ggttgctaac    11100
aactctctag aacaattgaa agcctccggc actgtcgttg ttgccgacac tggtgatttc    11160
ggctctattg ccaagtttca acctcaagac tccacaacta acccatcatt gatcttggct    11220
gctgccaagc aaccaactta cgccaagttg atcgatgttg ccgtggaata cggtaagaag    11280
catggtaaga ccaccgaaga acaagtcgaa aatgctgtgg acagattgtt agtcgaattc    11340
ggtaaggaga tcttaaagat tgttccaggc agagtctcca ccgaagttga tgctagattg    11400
tcttttgaca ctcaagctac cattgaaaag gctagacata tcattaaatt gtttgaacaa    11460
gaaggtgtct ccaaggaaag agtccttatt aaaattgctt ccacttggga aggtattcaa    11520
gctgccaaag aattggaaga aaaggacggt atccactgta atttgactct attattctcc    11580
ttcgttcaag cagttgcctg tgccgaggcc caagttactt tgatttcccc atttgttggt    11640
agaattctag actggtacaa atccagcact ggtaaagatt acaagggtga agccgaccca    11700
ggtgttattt ccgtcaagaa aatctacaac tactacaaga agtacggtta caagactatt    11760
gttatgggtg cttctttcag aagcactgac gaaatcaaaa acttggctgg tgttgactat    11820
ctaacaattt ctccagcttt attggacaag ttgatgaaca gtactgaacc tttcccaaga    11880
gttttggacc ctgtctccgc taagaaggaa gccggcgaca agatttctta catcagcgac    11940
gaatctaaat tcagattcga cttgaatgaa gacgctatgg ccactgaaaa attgtccgaa    12000
ggtatcagaa aattctctgc cgatattgtt actctattcg acttgattga aaagaaagtt    12060
accgcttaag gaagtatctc ggaaatatta atttaggcca tgtccttatg cacgtttctt    12120
ttgatactta cgggtacatg tacacaagta tatctatata tataaattaa tgaaaatccc    12180
ctatttatat atatgacttt aacgagacag aacagttttt tattttttat cctatttgat    12240
gaatgataca gtttcggatc cacgatcgca ttgcggatta cgtattctaa tgttcagtac    12300
cgttcgtata atgtatgcta tacgaagtta tgcagattgt actgagagtg caccatacca    12360
cagcttttca attcaattca tcattttttt tttattcttt tttttgattt cggtttcttt    12420
gaaatttttt tgattcggta atctccgaac agaaggaaga acgaaggaag gagcacagac    12480
ttagattggt atatatacgc atatgtagtg ttgaagaaac atgaaattgc ccagtattct    12540
taacccaact gcacagaaca aaaacctgca ggaaacgaag ataaatcatg tcgaaagcta    12600
catataagga acgtgctgct actcatccta gtcctgttgc tgccaagcta tttaatatca    12660
tgcacgaaaa gcaaacaaac ttgtgtgctt cattggatgt tcgtaccacc aaggaattac    12720
tggagttagt tgaagcatta ggtcccaaaa tttgtttact aaaaacacat gtggatatct    12780
tgactgattt ttccatggag ggcacagtta agccgctaaa ggcattatcc gccaagtaca    12840
attttttact cttcgaagac agaaaatttg ctgacattgg taatacagtc aaattgcagt    12900
actctgcggg tgtatacaga atagcagaat gggcagacat tacgaatgca cacggtgtgg    12960
tgggcccagg tattgttagc ggtttgaagc aggcggcaga agaagtaaca aaggaaccta    13020
gaggcctttt gatgttagca gaattgtcat gcaagggctc cctatctact ggagaatata    13080
ctaagggtac tgttgacatt gcgaagagcg acaaagattt tgttatcggc tttattgctc    13140
aaagagacat gggtggaaga gatgaaggtt acgattggtt gattatgaca cccggtgtgg    13200
gtttagatga caagggagac gcattgggtc aacagtatag aaccgtggat gatgtggtct    13260
ctacaggatc tgacattatt attgttggaa gaggactatt tgcaaaggga agggatgcta    13320
aggtagaggg tgaacgttac agaaaagcag gctgggaagc atatttgaga agatgcggcc    13380
agcaaaacta aaaaactgta ttataagtaa atgcatgtat actaaactca caaattagag    13440
cttcaattta attatatcag ttattaccct atgcggtgtg aaataccgca cagatgcgta    13500
aggagaaaat accgcatcag gaaattgtaa acgttaatat tttgttaaaa ttcgcgttaa    13560
atttttgtta aatcagctca ttttttaacc aataggccga aatcggcaaa atcccttata    13620
aatcaaaaga atagaccgag atagggttga gtgttgttcc agtttggaac aagagtccac    13680
tattaaagaa cgtggactcc aacgtcaaag ggcgaaaaac cgtctatcag ggcgatggcc    13740
cactacgtga accatcaccc taatcaagat aacttcgtat aatgtatgct atacgaacgg    13800
tacccgccaa ctctgttcga gaatgatgta atcaagaagg tctcacaaaa ccatccaggc    13860
agtaccactt cccaagtatt gcttagatgg gcaactcaga gaggcattgc cgtcattcca    13920
aaatcttcca agaaggaaag gttacttggc aacctagaaa tcgaaaaaaa gttcacttta    13980
acggagcaag aattgaagga tatttctgca ctaaatgcca acatcagatt taatgatcca    14040
tggacctggt tggatggtaa attccccact tttgcctgat ccagccagta aaatccatac    14100
tcaacgacga tatgaacaaa tttccctcat tccgatgctg tatatgtgta taaattttta    14160
catgctcttc tgtttagaca cagaacagct ttaaataaaa tgttggatat actttttctg    14220
cctgtggtgt catccacgct tttaattcat ctcttgtatg gttgacaatt tggctatttt    14280
ttaacagaac ccaacggtaa ttgaaattaa aagggaaacg agtgggggcg atgagtgagt    14340
gatacggcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat    14400
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc    14460
gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca    14520
agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg    14580
cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat    14640
ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt    14700
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct    14760
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc    14820
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa    14880
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg    14940
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt    15000
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg    15060
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac    15120
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc    15180
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa    15240
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc    15300
aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt    15360
aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga    15420
taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa    15480
atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa    15540
gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa    15600
tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt    15660
ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt    15720
gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg    15780
agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt    15840
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca    15900
agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac    15960
tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac    16020
atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct    16080
taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg    16140
gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca    16200
gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt    16260
aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta    16320
tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc    16380
gtcagggggg cggagcctat ggaa                                           16404

<210>  44
<211>  440
<212>  PRT
<213>  artificial sequence

<220>
<223>  constructed xylose isomerase

<400>  44
Met Ala Lys Glu Tyr Phe Pro Gln Ile Gln Lys Ile Gln Tyr Gln Gly 
1               5                   10                  15      
Pro Lys Ser Thr Asp Pro Leu Ser Phe Lys Tyr Tyr Asn Pro Glu Glu 
            20                  25                  30          
Val Ile Asn Gly Lys Thr Met Arg Glu His Leu Lys Phe Ala Leu Ser 
        35                  40                  45              
Trp Trp His Thr Met Gly Gly Asp Gly Thr Asp Met Phe Gly Cys Gly 
    50                  55                  60                  
Thr Thr Asp Lys Thr Trp Gly Gln Ser Asp Pro Ala Ala Arg Ala Lys 
65                  70                  75                  80  
Ala Lys Val Asp Ala Ala Phe Glu Ile Met Asp Lys Leu Ser Ile Asp 
                85                  90                  95      
Tyr Tyr Cys Phe His Asp Arg Asp Leu Ser Pro Glu Tyr Gly Ser Leu 
            100                 105                 110         
Lys Ala Thr Asn Asp Gln Leu Asp Ile Val Thr Asp Tyr Ile Lys Glu 
        115                 120                 125             
Lys Gln Gly Asp Lys Phe Lys Cys Leu Trp Gly Thr Ala Lys Cys Phe 
    130                 135                 140                 
Asp His Pro Arg Phe Met His Gly Ala Gly Thr Ser Pro Ser Ala Asp 
145                 150                 155                 160 
Val Phe Ala Phe Ser Ala Ala Gln Ile Lys Lys Ala Leu Glu Ser Thr 
                165                 170                 175     
Val Lys Leu Gly Ala Asn Gly Tyr Val Phe Trp Gly Gly Arg Glu Gly 
            180                 185                 190         
Tyr Glu Thr Leu Leu Asn Thr Asn Met Gly Leu Glu Leu Asp Asn Met 
        195                 200                 205             
Ala Arg Leu Met Lys Met Ala Val Glu Tyr Gly Arg Ser Ile Gly Phe 
    210                 215                 220                 
Lys Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr Lys His 
225                 230                 235                 240 
Gln Tyr Asp Phe Asp Thr Ala Thr Val Leu Gly Phe Leu Arg Lys Tyr 
                245                 250                 255     
Gly Leu Asp Lys Asp Phe Lys Met Asn Ile Glu Ala Asn His Ala Thr 
            260                 265                 270         
Leu Ala Gln His Thr Phe Gln His Glu Leu Arg Val Ala Arg Asp Asn 
        275                 280                 285             
Gly Val Phe Gly Ser Ile Asp Ala Asn Gln Gly Asp Val Leu Leu Gly 
    290                 295                 300                 
Trp Asp Thr Asp Gln Phe Pro Thr Asn Ile Tyr Asp Thr Thr Met Cys 
305                 310                 315                 320 
Met Tyr Glu Val Ile Lys Ala Gly Gly Phe Thr Asn Gly Gly Leu Asn 
                325                 330                 335     
Phe Asp Ala Lys Ala Arg Arg Gly Ser Phe Thr Pro Glu Asp Ile Phe 
            340                 345                 350         
Tyr Ser Tyr Ile Ala Gly Met Asp Ala Phe Ala Leu Gly Phe Arg Ala 
        355                 360                 365             
Ala Leu Lys Leu Ile Glu Asp Gly Arg Ile Asp Lys Phe Val Ala Asp 
    370                 375                 380                 
Arg Tyr Ala Ser Trp Asn Thr Gly Ile Gly Ala Asp Ile Ile Ala Gly 
385                 390                 395                 400 
Lys Ala Asp Phe Ala Ser Leu Glu Lys Tyr Ala Leu Glu Lys Gly Glu 
                405                 410                 415     
Val Thr Ala Ser Leu Ser Ser Gly Arg Gln Glu Met Leu Glu Ser Ile 
            420                 425                 430         
Val Asn Asn Val Leu Phe Ser Leu 
        435                 440 

<210>  45
<211>  1323
<212>  DNA
<213>  artificial sequence

<220>
<223>  codon optimized coding region for constructed xylose isomerase 
       VDxylA

<400>  45
atggctaagg aatacttccc acaaatccaa aagatccaat accaaggtcc aaagtccact       60
gacccattgt ccttcaagta ctacaaccca gaagaagtca tcaacggtaa aaccatgaga      120
gaacacttga agttcgcttt gtcttggtgg cacaccatgg gtggtgacgg tactgatatg      180
ttcggttgtg gtactactga caagacttgg ggtcaatctg atccagctgc tagagctaag      240
gctaaggttg acgctgcttt cgaaatcatg gacaagttgt ccatcgatta ctactgtttc      300
cacgacagag atttgtctcc agaatacggt tccttgaagg ctaccaacga ccaattggat      360
atcgtcactg actacatcaa ggaaaagcaa ggtgacaagt tcaagtgttt gtggggtact      420
gctaagtgtt tcgaccaccc aagattcatg cacggtgctg gtacttctcc atccgctgac      480
gttttcgctt tctctgctgc tcaaatcaag aaggctttgg aatccaccgt taagttgggt      540
gctaacggtt acgtcttctg gggtggtaga gaaggttacg aaaccttgtt gaacactaac      600
atgggtttgg aattggacaa catggctaga ttgatgaaga tggctgttga atacggtaga      660
tctatcggtt tcaagggtga cttctacatc gaaccaaagc caaaggaacc aactaagcac      720
caatacgact tcgataccgc tactgtcttg ggtttcttga gaaagtacgg tttggacaag      780
gatttcaaga tgaacatcga agctaaccac gctaccttgg ctcaacacac tttccaacac      840
gaattgagag ttgctagaga caacggtgtc ttcggttcca tcgatgctaa ccaaggtgac      900
gttttgttgg gttgggacac cgatcaattc ccaactaaca tctacgacac cactatgtgt      960
atgtacgaag tcatcaaggc tggtggtttc accaacggtg gtttgaactt cgacgctaag     1020
gctagaagag gttctttcac tccagaagac atcttctact cctacatcgc tggtatggac     1080
gctttcgctt tgggtttcag agctgctttg aagttgatcg aagacggtag aatcgataag     1140
ttcgttgctg acagatacgc ttcttggaac accggtatcg gtgctgacat catcgctggt     1200
aaagctgatt tcgcttcttt ggaaaagtac gctttggaaa agggtgaagt cactgcttcc     1260
ttgtcctctg gtagacaaga aatgttggaa tccatcgtca acaacgtttt gttctctttg     1320
tga                                                                   1323

<210>  46
<211>  2966
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed chimeric expression cassette for VDxykA

<400>  46
tgacagcagg attatcgtaa tacgtaatag ttgaaaatct caaaaatgtg tgggtcatta       60
cgtaaataat gataggaatg ggattcttct atttttcctt tttccattct gtcgaccgca      120
cgccgaaatg catgcaagta acctattcaa agtaatatct catacatgtt tcatgagggt      180
aacaacatgc gactgggtga gcatatgttc cgctgatgtg atgtgcaaga taaacaagca      240
agacagaaac taacttcttc ttcatgtaat aaacacaccc cgcgtttatt tacctatctt      300
taaacttcaa caccttatat cataactaat atttcttgag ataagcacac tgcacccata      360
ccttccttaa aaacgtagct tccagttttt ggtggttctg gcttccttcc cgattccgcc      420
cgctaaacgc ataattttgt tgcctggtgg catttgcaaa atgcataacc tatgcattta      480
aaagattatg tatgctcttc tgacttttcg tgtgatgagg ctcgtggaaa aaatgaataa      540
tttatgaatt tgagaacaat tttgtgttgt tacggtattt tactatggaa taatcaatca      600
attgaggatt ttatgcaaat atcgtttgaa tatttttccg accctttgag tacttttctt      660
cataattgca taatattgtc cgctgcccgt ttttctgtta gacggtgtct tgatctactt      720
gctatcgttc aacaccacct tatcttctaa ctattttttt tttagctcat ttgaatcagc      780
ttatggtgat ggcacatttt tgcataaacc tagctgtcct cgttgaacat aggaaaaaaa      840
aatatataaa caaggctctt tcactctcct tggaatcaga tttgggtttg ttccctttat      900
tttcatattt cttgtcatat tcttttctca attattattt tctactcata acctcacgca      960
aaataacaca gtcaaatcaa tcaagtttaa acagtatggc taaggaatac ttcccacaaa     1020
tccaaaagat ccaataccaa ggtccaaagt ccactgaccc attgtccttc aagtactaca     1080
acccagaaga agtcatcaac ggtaaaacca tgagagaaca cttgaagttc gctttgtctt     1140
ggtggcacac catgggtggt gacggtactg atatgttcgg ttgtggtact actgacaaga     1200
cttggggtca atctgatcca gctgctagag ctaaggctaa ggttgacgct gctttcgaaa     1260
tcatggacaa gttgtccatc gattactact gtttccacga cagagatttg tctccagaat     1320
acggttcctt gaaggctacc aacgaccaat tggatatcgt cactgactac atcaaggaaa     1380
agcaaggtga caagttcaag tgtttgtggg gtactgctaa gtgtttcgac cacccaagat     1440
tcatgcacgg tgctggtact tctccatccg ctgacgtttt cgctttctct gctgctcaaa     1500
tcaagaaggc tttggaatcc accgttaagt tgggtgctaa cggttacgtc ttctggggtg     1560
gtagagaagg ttacgaaacc ttgttgaaca ctaacatggg tttggaattg gacaacatgg     1620
ctagattgat gaagatggct gttgaatacg gtagatctat cggtttcaag ggtgacttct     1680
acatcgaacc aaagccaaag gaaccaacta agcaccaata cgacttcgat accgctactg     1740
tcttgggttt cttgagaaag tacggtttgg acaaggattt caagatgaac atcgaagcta     1800
accacgctac cttggctcaa cacactttcc aacacgaatt gagagttgct agagacaacg     1860
gtgtcttcgg ttccatcgat gctaaccaag gtgacgtttt gttgggttgg gacaccgatc     1920
aattcccaac taacatctac gacaccacta tgtgtatgta cgaagtcatc aaggctggtg     1980
gtttcaccaa cggtggtttg aacttcgacg ctaaggctag aagaggttct ttcactccag     2040
aagacatctt ctactcctac atcgctggta tggacgcttt cgctttgggt ttcagagctg     2100
ctttgaagtt gatcgaagac ggtagaatcg ataagttcgt tgctgacaga tacgcttctt     2160
ggaacaccgg tatcggtgct gacatcatcg ctggtaaagc tgatttcgct tctttggaaa     2220
agtacgcttt ggaaaagggt gaagtcactg cttccttgtc ctctggtaga caagaaatgt     2280
tggaatccat cgtcaacaac gttttgttct ctttgtgagg ccctgcaggc cagaggaaaa     2340
taatatcaag tgctggaaac tttttctctt ggaatttttg caacatcaag tcatagtcaa     2400
ttgaattgac ccaatttcac atttaagatt tttttttttt catccgacat acatctgtac     2460
actaggaagc cctgtttttc tgaagcagct tcaaatatat atatttttta catatttatt     2520
atgattcaat gaacaatcta attaaatcga aaacaagaac cgaaacgcga ataaataatt     2580
tatttagatg gtgacaagtg tataagtcct catcgggaca gctacgattt ctctttcggt     2640
tttggctgag ctactggttg ctgtgacgca gcggcattag cgcggcgtta tgagctaccc     2700
tcgtggcctg aaagatggcg ggaataaagc ggaactaaaa attactgact gagccatatt     2760
gaggtcaatt tgtcaactcg tcaagtcacg tttggtggac ggcccctttc caacgaatcg     2820
tatatactaa catgcgcgcg cttcctatat acacatatac atatatatat atatatatat     2880
gtgtgcgtgt atgtgtacac ctgtatttaa tttccttact cgcgggtttt tcttttttct     2940
caattcttgg cttcctcttt ctcgag                                          2966

<210>  47
<211>  8601
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed plasmid

<400>  47
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat       60
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc      120
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga      180
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg      240
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta      300
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg      360
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagct      420
tggcgccact tgtgcatgat taccgacaac caaaaccagt cacagattct attaatcctc      480
caaatgtaaa cataaccacc tccacaacca acaagaacct agatggcatt tatattttgc      540
cagctcctcg tatgaatccc ccggctcaaa cacaatacca aatgattcat gcgccagaca      600
gcatgcaaca tccaccaaca tttagtaaaa acaacacatc aagcaatcct aaatcccacc      660
aatactcaaa gtagaagatc agcatccttt caattgctga aaggttcacc taaagtaccg      720
ctcatattcc aaaaggattc ttcactacat agaaagggca gccaattgtg tgtttttcag      780
aaagggtttt aaaaaaacag gagggtgctt gttcttgttg ttccctacca tcgatggatt      840
tcgaaaaact atttatagga ccatctgatt ttcacctcca tcattgtatc atatactaac      900
aagcatatcc aaatttgtaa ttctatcatg aaatttccag agaaagaaac gcaagggaac      960
tgagaaatca aacactagtt gacagcagga ttatcgtaat acgtaatagt tgaaaatctc     1020
aaaaatgtgt gggtcattac gtaaataatg ataggaatgg gattcttcta tttttccttt     1080
ttccattctg tcgaccgcac gccgaaatgc atgcaagtaa cctattcaaa gtaatatctc     1140
atacatgttt catgagggta acaacatgcg actgggtgag catatgttcc gctgatgtga     1200
tgtgcaagat aaacaagcaa gacagaaact aacttcttct tcatgtaata aacacacccc     1260
gcgtttattt acctatcttt aaacttcaac accttatatc ataactaata tttcttgaga     1320
taagcacact gcacccatac cttccttaaa aacgtagctt ccagtttttg gtggttctgg     1380
cttccttccc gattccgccc gctaaacgca taattttgtt gcctggtggc atttgcaaaa     1440
tgcataacct atgcatttaa aagattatgt atgctcttct gacttttcgt gtgatgaggc     1500
tcgtggaaaa aatgaataat ttatgaattt gagaacaatt ttgtgttgtt acggtatttt     1560
actatggaat aatcaatcaa ttgaggattt tatgcaaata tcgtttgaat atttttccga     1620
ccctttgagt acttttcttc ataattgcat aatattgtcc gctgcccgtt tttctgttag     1680
acggtgtctt gatctacttg ctatcgttca acaccacctt atcttctaac tatttttttt     1740
ttagctcatt tgaatcagct tatggtgatg gcacattttt gcataaacct agctgtcctc     1800
gttgaacata ggaaaaaaaa atatataaac aaggctcttt cactctcctt ggaatcagat     1860
ttgggtttgt tccctttatt ttcatatttc ttgtcatatt cttttctcaa ttattatttt     1920
ctactcataa cctcacgcaa aataacacag tcaaatcaat caagtttaaa cagtatggct     1980
aaggaatact tcccacaaat ccaaaagatc caataccaag gtccaaagtc cactgaccca     2040
ttgtccttca agtactacaa cccagaagaa gtcatcaacg gtaaaaccat gagagaacac     2100
ttgaagttcg ctttgtcttg gtggcacacc atgggtggtg acggtactga tatgttcggt     2160
tgtggtacta ctgacaagac ttggggtcaa tctgatccag ctgctagagc taaggctaag     2220
gttgacgctg ctttcgaaat catggacaag ttgtccatcg attactactg tttccacgac     2280
agagatttgt ctccagaata cggttccttg aaggctacca acgaccaatt ggatatcgtc     2340
actgactaca tcaaggaaaa gcaaggtgac aagttcaagt gtttgtgggg tactgctaag     2400
tgtttcgacc acccaagatt catgcacggt gctggtactt ctccatccgc tgacgttttc     2460
gctttctctg ctgctcaaat caagaaggct ttggaatcca ccgttaagtt gggtgctaac     2520
ggttacgtct tctggggtgg tagagaaggt tacgaaacct tgttgaacac taacatgggt     2580
ttggaattgg acaacatggc tagattgatg aagatggctg ttgaatacgg tagatctatc     2640
ggtttcaagg gtgacttcta catcgaacca aagccaaagg aaccaactaa gcaccaatac     2700
gacttcgata ccgctactgt cttgggtttc ttgagaaagt acggtttgga caaggatttc     2760
aagatgaaca tcgaagctaa ccacgctacc ttggctcaac acactttcca acacgaattg     2820
agagttgcta gagacaacgg tgtcttcggt tccatcgatg ctaaccaagg tgacgttttg     2880
ttgggttggg acaccgatca attcccaact aacatctacg acaccactat gtgtatgtac     2940
gaagtcatca aggctggtgg tttcaccaac ggtggtttga acttcgacgc taaggctaga     3000
agaggttctt tcactccaga agacatcttc tactcctaca tcgctggtat ggacgctttc     3060
gctttgggtt tcagagctgc tttgaagttg atcgaagacg gtagaatcga taagttcgtt     3120
gctgacagat acgcttcttg gaacaccggt atcggtgctg acatcatcgc tggtaaagct     3180
gatttcgctt ctttggaaaa gtacgctttg gaaaagggtg aagtcactgc ttccttgtcc     3240
tctggtagac aagaaatgtt ggaatccatc gtcaacaacg ttttgttctc tttgtgaggc     3300
cctgcaggcc agaggaaaat aatatcaagt gctggaaact ttttctcttg gaatttttgc     3360
aacatcaagt catagtcaat tgaattgacc caatttcaca tttaagattt tttttttttc     3420
atccgacata catctgtaca ctaggaagcc ctgtttttct gaagcagctt caaatatata     3480
tattttttac atatttatta tgattcaatg aacaatctaa ttaaatcgaa aacaagaacc     3540
gaaacgcgaa taaataattt atttagatgg tgacaagtgt ataagtcctc atcgggacag     3600
ctacgatttc tctttcggtt ttggctgagc tactggttgc tgtgacgcag cggcattagc     3660
gcggcgttat gagctaccct cgtggcctga aagatggcgg gaataaagcg gaactaaaaa     3720
ttactgactg agccatattg aggtcaattt gtcaactcgt caagtcacgt ttggtggacg     3780
gcccctttcc aacgaatcgt atatactaac atgcgcgcgc ttcctatata cacatataca     3840
tatatatata tatatatatg tgtgcgtgta tgtgtacacc tgtatttaat ttccttactc     3900
gcgggttttt cttttttctc aattcttggc ttcctctttc tcgagcccgg gatttaaatg     3960
tggcttactc cattgttgat gcaaaagttg taaatttcac gaattattta atgcgttcct     4020
tgcaaccttc tattttgatg aacatcagga attgaaacaa aaaaaaggct tcaatctcaa     4080
cggaaaacgg gaagaaaact acactcgatt atactatata tgccaagaag attctccgac     4140
agattgtcta cttaatttca cataatatat cttgttttac tagcttatta tatagcgtcg     4200
catttaattc atggcgccat cacccgcagg gaatataacg acaaggccga taccacggga     4260
aaaatagggc gagcggaaat actaaaagaa aaataagctt ccgaaataaa acaccgacaa     4320
tgaagttctt ggcaaggttc ggttaggatc cgcattgcgg attacgtatt ctaatgttca     4380
gtaccgttcg tataatgtat gctatacgaa gttatgcaga ttgtactgag agtgcaccat     4440
accacagctt ttcaattcaa ttcatcattt tttttttatt cttttttttg atttcggttt     4500
ctttgaaatt tttttgattc ggtaatctcc gaacagaagg aagaacgaag gaaggagcac     4560
agacttagat tggtatatat acgcatatgt agtgttgaag aaacatgaaa ttgcccagta     4620
ttcttaaccc aactgcacag aacaaaaacc tgcaggaaac gaagataaat catgtcgaaa     4680
gctacatata aggaacgtgc tgctactcat cctagtcctg ttgctgccaa gctatttaat     4740
atcatgcacg aaaagcaaac aaacttgtgt gcttcattgg atgttcgtac caccaaggaa     4800
ttactggagt tagttgaagc attaggtccc aaaatttgtt tactaaaaac acatgtggat     4860
atcttgactg atttttccat ggagggcaca gttaagccgc taaaggcatt atccgccaag     4920
tacaattttt tactcttcga agacagaaaa tttgctgaca ttggtaatac agtcaaattg     4980
cagtactctg cgggtgtata cagaatagca gaatgggcag acattacgaa tgcacacggt     5040
gtggtgggcc caggtattgt tagcggtttg aagcaggcgg cagaagaagt aacaaaggaa     5100
cctagaggcc ttttgatgtt agcagaattg tcatgcaagg gctccctatc tactggagaa     5160
tatactaagg gtactgttga cattgcgaag agcgacaaag attttgttat cggctttatt     5220
gctcaaagag acatgggtgg aagagatgaa ggttacgatt ggttgattat gacacccggt     5280
gtgggtttag atgacaaggg agacgcattg ggtcaacagt atagaaccgt ggatgatgtg     5340
gtctctacag gatctgacat tattattgtt ggaagaggac tatttgcaaa gggaagggat     5400
gctaaggtag agggtgaacg ttacagaaaa gcaggctggg aagcatattt gagaagatgc     5460
ggccagcaaa actaaaaaac tgtattataa gtaaatgcat gtatactaaa ctcacaaatt     5520
agagcttcaa tttaattata tcagttatta ccctatgcgg tgtgaaatac cgcacagatg     5580
cgtaaggaga aaataccgca tcaggaaatt gtaaacgtta atattttgtt aaaattcgcg     5640
ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg caaaatccct     5700
tataaatcaa aagaatagac cgagataggg ttgagtgttg ttccagtttg gaacaagagt     5760
ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat     5820
ggcccactac gtgaaccatc accctaatca agataacttc gtataatgta tgctatacga     5880
acggtaccga gatacccact tcgaaagtta ctgatattat aactcttgtg tcctctcttc     5940
taatacctta ctttcacctt tctcacgtag ttaaagttgc aacaacacat tttgtcctca     6000
tccaatttct tctatagaat atccgtttgc ctccaggagt gaagaaatga tagcagtaac     6060
actgtgaaag cgagactaag agaaacgact taaagctcga agacttcttg aggatacgtt     6120
tatgtttctg tggcttcttc ttcgcggcgc ggttctcgcg tataggaatg ttctaagaca     6180
agaaggcatg aagttatgtt aacagattct atatctactc gctacgcata tataaacgga     6240
ttcatcattg aaacaatggt acttgtggta atgtgtacga cgatttcaac ccgaataaaa     6300
gcaaaagtgc aaaaaaaaaa caagaagcgc ttagcactgt tgaatcattt agaacactac     6360
taatgctggt aatacggcgc cgaattcact ggccgtcgtt ttacaacgtc gtgactggga     6420
aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg     6480
taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga     6540
atggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcatatg     6600
gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc     6660
aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc     6720
tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc     6780
gagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt     6840
ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt     6900
tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca     6960
ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt     7020
ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga     7080
tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa     7140
gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct     7200
gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat     7260
acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga     7320
tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc     7380
caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat     7440
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa     7500
cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac     7560
tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa     7620
agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc     7680
tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc     7740
ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag     7800
acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta     7860
ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa     7920
gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc     7980
gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat     8040
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga     8100
gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt     8160
ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata     8220
cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac     8280
cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg     8340
ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg     8400
tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag     8460
cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct     8520
ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc     8580
aggggggcgg agcctatgga a                                               8601

<210>  48
<211>  8645
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed plasmid

<400>  48
accttggctc aacacacttt ccaacacgaa ttgagagttg ctagagacaa cggtgtcttc       60
ggttccatcg atgctaacca aggtgacgtt ttgttgggtt gggacaccga tcaattccca      120
actaacatct acgacaccac tatgtgtatg tacgaagtca tcaaggctgg tggtttcacc      180
aacggtggtt tgaacttcga cgctaaggct agaagaggtt ctttcactcc agaagacatc      240
ttctactcct acatcgctgg tatggacgct ttcgctttgg gtttcagagc tgctttgaag      300
ttgatcgaag acggtagaat cgataagttc gttgctgaca gatacgcttc ttggaacacc      360
ggtatcggtg ctgacatcat cgctggtaaa gctgatttcg cttctttgga aaagtacgct      420
ttggaaaagg gtgaagtcac tgcttccttg tcctctggta gacaagaaat gttggaatcc      480
atcgtcaaca acgttttgtt ctctttgtga ggccctgcag gccagaggaa aataatatca      540
agtgctggaa actttttctc ttggaatttt tgcaacatca agtcatagtc aattgaattg      600
acccaatttc acatttaaga tttttttttt ttcatccgac atacatctgt acactaggaa      660
gccctgtttt tctgaagcag cttcaaatat atatattttt tacatattta ttatgattca      720
atgaacaatc taattaaatc gaaaacaaga accgaaacgc gaataaataa tttatttaga      780
tggtgacaag tgtataagtc ctcatcggga cagctacgat ttctctttcg gttttggctg      840
agctactggt tgctgtgacg cagcggcatt agcgcggcgt tatgagctac cctcgtggcc      900
tgaaagatgg cgggaataaa gcggaactaa aaattactga ctgagccata ttgaggtcaa      960
tttgtcaact cgtcaagtca cgtttggtgg acggcccctt tccaacgaat cgtatatact     1020
aacatgcgcg cgcttcctat atacacatat acatatatat atatatatat atgtgtgcgt     1080
gtatgtgtac acctgtattt aatttcctta ctcgcgggtt tttctttttt ctcaattctt     1140
ggcttcctct ttctcgagcc cgggatttaa atccttatac cctcatctta ccgcagtgcg     1200
gttttacgcc tcatgttatt tttgccagct ttaataacaa catcagtaat attacgttat     1260
tggatcttcc actccggttc gaggaaaaaa agagaggagg agaaacgcat aagctacaat     1320
aatgagtgag ttaacgcttt aatttgctca gtgatcattg ctagccggat atttgtgttt     1380
ttgtagaacc cagccatacc taatcatctc agtatattat ctgaattctt gactcaaaat     1440
atggattttc tcgaacgtct cacttccaat catcccatca aatgctgtgg atgaaaacat     1500
ttaaaacacg ggcaagaaaa aatgagtata gtatagttgt ggccagttct tgttgtacta     1560
atgcacttgc tttcgtatca aagttgaaga agcactgaaa gaacaacaaa aaatttatta     1620
aaagcaaaaa tcattctacg ttcaacgaaa atgtaaggca aatatatctt tatataggca     1680
agcataaccg acgcggatcc gcattgcgga ttacgtattc taatgttcag taccgttcgt     1740
ataatgtatg ctatacgaag ttatgcagat tgtactgaga gtgcaccata ccacagcttt     1800
tcaattcaat tcatcatttt ttttttattc ttttttttga tttcggtttc tttgaaattt     1860
ttttgattcg gtaatctccg aacagaagga agaacgaagg aaggagcaca gacttagatt     1920
ggtatatata cgcatatgta gtgttgaaga aacatgaaat tgcccagtat tcttaaccca     1980
actgcacaga acaaaaacct gcaggaaacg aagataaatc atgtcgaaag ctacatataa     2040
ggaacgtgct gctactcatc ctagtcctgt tgctgccaag ctatttaata tcatgcacga     2100
aaagcaaaca aacttgtgtg cttcattgga tgttcgtacc accaaggaat tactggagtt     2160
agttgaagca ttaggtccca aaatttgttt actaaaaaca catgtggata tcttgactga     2220
tttttccatg gagggcacag ttaagccgct aaaggcatta tccgccaagt acaatttttt     2280
actcttcgaa gacagaaaat ttgctgacat tggtaataca gtcaaattgc agtactctgc     2340
gggtgtatac agaatagcag aatgggcaga cattacgaat gcacacggtg tggtgggccc     2400
aggtattgtt agcggtttga agcaggcggc agaagaagta acaaaggaac ctagaggcct     2460
tttgatgtta gcagaattgt catgcaaggg ctccctatct actggagaat atactaaggg     2520
tactgttgac attgcgaaga gcgacaaaga ttttgttatc ggctttattg ctcaaagaga     2580
catgggtgga agagatgaag gttacgattg gttgattatg acacccggtg tgggtttaga     2640
tgacaaggga gacgcattgg gtcaacagta tagaaccgtg gatgatgtgg tctctacagg     2700
atctgacatt attattgttg gaagaggact atttgcaaag ggaagggatg ctaaggtaga     2760
gggtgaacgt tacagaaaag caggctggga agcatatttg agaagatgcg gccagcaaaa     2820
ctaaaaaact gtattataag taaatgcatg tatactaaac tcacaaatta gagcttcaat     2880
ttaattatat cagttattac cctatgcggt gtgaaatacc gcacagatgc gtaaggagaa     2940
aataccgcat caggaaattg taaacgttaa tattttgtta aaattcgcgt taaatttttg     3000
ttaaatcagc tcatttttta accaataggc cgaaatcggc aaaatccctt ataaatcaaa     3060
agaatagacc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa     3120
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg     3180
tgaaccatca ccctaatcaa gataacttcg tataatgtat gctatacgaa cggtacctga     3240
aaggtttacg atattagctg ctaagttctc ccttgcccta gtgaattacg tttttttcga     3300
ttaaaagggt tggtcccgaa attgacctct tatatgtacc tcaattgcag atagcatcaa     3360
atttagacta cgtcatcaac caaattacac cctatgaaaa acataagatt ataagcatgc     3420
gtttgtgtaa aggtcatacc attttatacg ttgacgcaat ggtgcaaata gaaaggttta     3480
cgaaagctct tattccgctt agcgcaattc tattgttaat ttcaagtaaa aagataaata     3540
tctcacgtcc ttatattatc tttgattaaa atttcacgat caatgtaacg aaaaaaggtc     3600
gcaaatattt ctttttcatc actctgatta aacaatggca tcatgaaacg ctcattggat     3660
cgttacacgt tctcattgta catatgtatg atttagaaca gcttgttcgc ctgggcgccg     3720
aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt     3780
aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc     3840
gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat gcggtatttt     3900
ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag tacaatctgc     3960
tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga     4020
cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc     4080
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata     4140
cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact     4200
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg     4260
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt     4320
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct     4380
gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca     4440
cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc     4500
gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc     4560
cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg     4620
gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta     4680
tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc     4740
ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt     4800
gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg     4860
cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct     4920
tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc     4980
tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct     5040
cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac     5100
acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc     5160
tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat     5220
ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg     5280
accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc     5340
aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa     5400
ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag     5460
gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta     5520
ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta     5580
ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag     5640
ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg     5700
gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg     5760
cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag     5820
cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc     5880
cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa     5940
aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg     6000
ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct     6060
gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa     6120
gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg     6180
cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag     6240
ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga     6300
attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta cgccaagctt     6360
ggcgcccgag gtttgagtca atgtacgaga ttttgaaacc atgagggatg tgccctcaaa     6420
ctttatgatc tggtttcttt ttttatttat atcgttttta tgctgggtta cgttgttgtt     6480
ttatattcct tgtctagtat tacggtcagc agaggccaaa tcctatatgc attatatgtg     6540
gttagcattg actccttaat agaacggcca gttatcaatc aaattgccta taagcatgga     6600
tcactagcgg ctagtattgt attgcatttg gaatagccct cttcgttgag ccacaaagat     6660
atcttgcgga aatgtggcta aagcaacttt tgatgattgc tgggaaacgg tttggtaaac     6720
tgctgaagca attgttgtac acaatacttc tgcttttggc cccatctgga tttttcattt     6780
ccgcttttgc ctcaggtgca acaataacta tctttgagca tcgtaccacc aactagttga     6840
cagcaggatt atcgtaatac gtaatagttg aaaatctcaa aaatgtgtgg gtcattacgt     6900
aaataatgat aggaatggga ttcttctatt tttccttttt ccattctgtc gaccgcacgc     6960
cgaaatgcat gcaagtaacc tattcaaagt aatatctcat acatgtttca tgagggtaac     7020
aacatgcgac tgggtgagca tatgttccgc tgatgtgatg tgcaagataa acaagcaaga     7080
cagaaactaa cttcttcttc atgtaataaa cacaccccgc gtttatttac ctatctttaa     7140
acttcaacac cttatatcat aactaatatt tcttgagata agcacactgc acccatacct     7200
tccttaaaaa cgtagcttcc agtttttggt ggttctggct tccttcccga ttccgcccgc     7260
taaacgcata attttgttgc ctggtggcat ttgcaaaatg cataacctat gcatttaaaa     7320
gattatgtat gctcttctga cttttcgtgt gatgaggctc gtggaaaaaa tgaataattt     7380
atgaatttga gaacaatttt gtgttgttac ggtattttac tatggaataa tcaatcaatt     7440
gaggatttta tgcaaatatc gtttgaatat ttttccgacc ctttgagtac ttttcttcat     7500
aattgcataa tattgtccgc tgcccgtttt tctgttagac ggtgtcttga tctacttgct     7560
atcgttcaac accaccttat cttctaacta tttttttttt agctcatttg aatcagctta     7620
tggtgatggc acatttttgc ataaacctag ctgtcctcgt tgaacatagg aaaaaaaaat     7680
atataaacaa ggctctttca ctctccttgg aatcagattt gggtttgttc cctttatttt     7740
catatttctt gtcatattct tttctcaatt attattttct actcataacc tcacgcaaaa     7800
taacacagtc aaatcaatca agtttaaaca gtatggctaa ggaatacttc ccacaaatcc     7860
aaaagatcca ataccaaggt ccaaagtcca ctgacccatt gtccttcaag tactacaacc     7920
cagaagaagt catcaacggt aaaaccatga gagaacactt gaagttcgct ttgtcttggt     7980
ggcacaccat gggtggtgac ggtactgata tgttcggttg tggtactact gacaagactt     8040
ggggtcaatc tgatccagct gctagagcta aggctaaggt tgacgctgct ttcgaaatca     8100
tggacaagtt gtccatcgat tactactgtt tccacgacag agatttgtct ccagaatacg     8160
gttccttgaa ggctaccaac gaccaattgg atatcgtcac tgactacatc aaggaaaagc     8220
aaggtgacaa gttcaagtgt ttgtggggta ctgctaagtg tttcgaccac ccaagattca     8280
tgcacggtgc tggtacttct ccatccgctg acgttttcgc tttctctgct gctcaaatca     8340
agaaggcttt ggaatccacc gttaagttgg gtgctaacgg ttacgtcttc tggggtggta     8400
gagaaggtta cgaaaccttg ttgaacacta acatgggttt ggaattggac aacatggcta     8460
gattgatgaa gatggctgtt gaatacggta gatctatcgg tttcaagggt gacttctaca     8520
tcgaaccaaa gccaaaggaa ccaactaagc accaatacga cttcgatacc gctactgtct     8580
tgggtttctt gagaaagtac ggtttggaca aggatttcaa gatgaacatc gaagctaacc     8640
acgct                                                                 8645

<210>  49
<211>  8537
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed plasmid

<400>  49
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat       60
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc      120
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga      180
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg      240
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta      300
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg      360
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagct      420
tggcgccttg agcattagtt gatcattacc gctgcagtca gttaatgata tccctaaata      480
ggcatattgt acaatttcaa atatattacc cgaattacaa gaacaatagg gtcttctaca      540
atattactat gaacaattca taggggaaca tagtccactt caatgtcggt gccaacgatt      600
ctatttgttg taagtatata tgctgtttcc cggcaatact ttatattggg ggaattttct      660
ggaggtttaa gggcatatta ctacgtcaag gctgggattg accttctggt atatattacg      720
ttgcgcagga cattccttat acttgagcct ttgtaaaaac ttcttcgttc tcctgaccaa      780
ccgttcaatt atgttgtgaa ttttttgatc cgagtttgaa acatccaaat tcacaaacaa      840
accgttgtca tcctgagttg gcgtactagt tgacagcagg attatcgtaa tacgtaatag      900
ttgaaaatct caaaaatgtg tgggtcatta cgtaaataat gataggaatg ggattcttct      960
atttttcctt tttccattct gtcgaccgca cgccgaaatg catgcaagta acctattcaa     1020
agtaatatct catacatgtt tcatgagggt aacaacatgc gactgggtga gcatatgttc     1080
cgctgatgtg atgtgcaaga taaacaagca agacagaaac taacttcttc ttcatgtaat     1140
aaacacaccc cgcgtttatt tacctatctt taaacttcaa caccttatat cataactaat     1200
atttcttgag ataagcacac tgcacccata ccttccttaa aaacgtagct tccagttttt     1260
ggtggttctg gcttccttcc cgattccgcc cgctaaacgc ataattttgt tgcctggtgg     1320
catttgcaaa atgcataacc tatgcattta aaagattatg tatgctcttc tgacttttcg     1380
tgtgatgagg ctcgtggaaa aaatgaataa tttatgaatt tgagaacaat tttgtgttgt     1440
tacggtattt tactatggaa taatcaatca attgaggatt ttatgcaaat atcgtttgaa     1500
tatttttccg accctttgag tacttttctt cataattgca taatattgtc cgctgcccgt     1560
ttttctgtta gacggtgtct tgatctactt gctatcgttc aacaccacct tatcttctaa     1620
ctattttttt tttagctcat ttgaatcagc ttatggtgat ggcacatttt tgcataaacc     1680
tagctgtcct cgttgaacat aggaaaaaaa aatatataaa caaggctctt tcactctcct     1740
tggaatcaga tttgggtttg ttccctttat tttcatattt cttgtcatat tcttttctca     1800
attattattt tctactcata acctcacgca aaataacaca gtcaaatcaa tcaagtttaa     1860
acagtatggc taaggaatac ttcccacaaa tccaaaagat ccaataccaa ggtccaaagt     1920
ccactgaccc attgtccttc aagtactaca acccagaaga agtcatcaac ggtaaaacca     1980
tgagagaaca cttgaagttc gctttgtctt ggtggcacac catgggtggt gacggtactg     2040
atatgttcgg ttgtggtact actgacaaga cttggggtca atctgatcca gctgctagag     2100
ctaaggctaa ggttgacgct gctttcgaaa tcatggacaa gttgtccatc gattactact     2160
gtttccacga cagagatttg tctccagaat acggttcctt gaaggctacc aacgaccaat     2220
tggatatcgt cactgactac atcaaggaaa agcaaggtga caagttcaag tgtttgtggg     2280
gtactgctaa gtgtttcgac cacccaagat tcatgcacgg tgctggtact tctccatccg     2340
ctgacgtttt cgctttctct gctgctcaaa tcaagaaggc tttggaatcc accgttaagt     2400
tgggtgctaa cggttacgtc ttctggggtg gtagagaagg ttacgaaacc ttgttgaaca     2460
ctaacatggg tttggaattg gacaacatgg ctagattgat gaagatggct gttgaatacg     2520
gtagatctat cggtttcaag ggtgacttct acatcgaacc aaagccaaag gaaccaacta     2580
agcaccaata cgacttcgat accgctactg tcttgggttt cttgagaaag tacggtttgg     2640
acaaggattt caagatgaac atcgaagcta accacgctac cttggctcaa cacactttcc     2700
aacacgaatt gagagttgct agagacaacg gtgtcttcgg ttccatcgat gctaaccaag     2760
gtgacgtttt gttgggttgg gacaccgatc aattcccaac taacatctac gacaccacta     2820
tgtgtatgta cgaagtcatc aaggctggtg gtttcaccaa cggtggtttg aacttcgacg     2880
ctaaggctag aagaggttct ttcactccag aagacatctt ctactcctac atcgctggta     2940
tggacgcttt cgctttgggt ttcagagctg ctttgaagtt gatcgaagac ggtagaatcg     3000
ataagttcgt tgctgacaga tacgcttctt ggaacaccgg tatcggtgct gacatcatcg     3060
ctggtaaagc tgatttcgct tctttggaaa agtacgcttt ggaaaagggt gaagtcactg     3120
cttccttgtc ctctggtaga caagaaatgt tggaatccat cgtcaacaac gttttgttct     3180
ctttgtgagg ccctgcaggc cagaggaaaa taatatcaag tgctggaaac tttttctctt     3240
ggaatttttg caacatcaag tcatagtcaa ttgaattgac ccaatttcac atttaagatt     3300
tttttttttt catccgacat acatctgtac actaggaagc cctgtttttc tgaagcagct     3360
tcaaatatat atatttttta catatttatt atgattcaat gaacaatcta attaaatcga     3420
aaacaagaac cgaaacgcga ataaataatt tatttagatg gtgacaagtg tataagtcct     3480
catcgggaca gctacgattt ctctttcggt tttggctgag ctactggttg ctgtgacgca     3540
gcggcattag cgcggcgtta tgagctaccc tcgtggcctg aaagatggcg ggaataaagc     3600
ggaactaaaa attactgact gagccatatt gaggtcaatt tgtcaactcg tcaagtcacg     3660
tttggtggac ggcccctttc caacgaatcg tatatactaa catgcgcgcg cttcctatat     3720
acacatatac atatatatat atatatatat gtgtgcgtgt atgtgtacac ctgtatttaa     3780
tttccttact cgcgggtttt tcttttttct caattcttgg cttcctcttt ctcgagcccg     3840
ggatttaaat gccgttgaca ttcatgtaag cagtattagc ttttaaactg ccattaacgc     3900
acctatacga ttgcggcggt gcgaaaacag taactagcac tcttacggct cattgtttcc     3960
tctatccaaa gttgcctata tcacaataaa attagaaatt aatcatttta ataagcttgc     4020
tgccattcag tgactcgcaa tctacgtttt aaacgtaatt taaaaatcca cttccaccta     4080
catattttgc aacagtcgta cgacagaaac tgaggctcta taaaatgaga tgtgctaatt     4140
gttctttctt ggctggtttc agctacatct ttctgtaatc aatctacaaa tttacacgcg     4200
agcttcattt tgacagtaaa caaatgtaaa agacacatgc aataaaagcg gtccagaaaa     4260
caaacgacaa agccaccaaa aatgtttgca aacgttggat ttagaacttt gagggggatc     4320
cgcattgcgg attacgtatt ctaatgttca gtaccgttcg tataatgtat gctatacgaa     4380
gttatgcaga ttgtactgag agtgcaccat accacagctt ttcaattcaa ttcatcattt     4440
tttttttatt cttttttttg atttcggttt ctttgaaatt tttttgattc ggtaatctcc     4500
gaacagaagg aagaacgaag gaaggagcac agacttagat tggtatatat acgcatatgt     4560
agtgttgaag aaacatgaaa ttgcccagta ttcttaaccc aactgcacag aacaaaaacc     4620
tgcaggaaac gaagataaat catgtcgaaa gctacatata aggaacgtgc tgctactcat     4680
cctagtcctg ttgctgccaa gctatttaat atcatgcacg aaaagcaaac aaacttgtgt     4740
gcttcattgg atgttcgtac caccaaggaa ttactggagt tagttgaagc attaggtccc     4800
aaaatttgtt tactaaaaac acatgtggat atcttgactg atttttccat ggagggcaca     4860
gttaagccgc taaaggcatt atccgccaag tacaattttt tactcttcga agacagaaaa     4920
tttgctgaca ttggtaatac agtcaaattg cagtactctg cgggtgtata cagaatagca     4980
gaatgggcag acattacgaa tgcacacggt gtggtgggcc caggtattgt tagcggtttg     5040
aagcaggcgg cagaagaagt aacaaaggaa cctagaggcc ttttgatgtt agcagaattg     5100
tcatgcaagg gctccctatc tactggagaa tatactaagg gtactgttga cattgcgaag     5160
agcgacaaag attttgttat cggctttatt gctcaaagag acatgggtgg aagagatgaa     5220
ggttacgatt ggttgattat gacacccggt gtgggtttag atgacaaggg agacgcattg     5280
ggtcaacagt atagaaccgt ggatgatgtg gtctctacag gatctgacat tattattgtt     5340
ggaagaggac tatttgcaaa gggaagggat gctaaggtag agggtgaacg ttacagaaaa     5400
gcaggctggg aagcatattt gagaagatgc ggccagcaaa actaaaaaac tgtattataa     5460
gtaaatgcat gtatactaaa ctcacaaatt agagcttcaa tttaattata tcagttatta     5520
ccctatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggaaatt     5580
gtaaacgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag ctcatttttt     5640
aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac cgagataggg     5700
ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga ctccaacgtc     5760
aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc accctaatca     5820
agataacttc gtataatgta tgctatacga acggtaccaa ttgtttctcc acatgtgtac     5880
cgtataaagt tcttgttatg aattagaagg attatcacaa aagccctcca tcaaatctga     5940
taatttcatt acccaaagac tgtaatacct tgggtgtaaa gatcgtattt tttgatgtct     6000
tcgtgatatt ttgcagatct ttccatttta taaatttctg caaatatcgt gcatcagctc     6060
atcttggaat tggcacaatc tgcgtaattt ttttcttggt tctcttctcc ttttgatttt     6120
aagagcgctt tctattcagt agcagatttt cagtaaataa agccatcttt ctcaagtaaa     6180
ctcatgatca tgcagttgtt caaaattaca ccaataacga tgttatcttt aaaactttcc     6240
ccagaaagaa gatggcactt ttgcactaaa tacttcaaac tatttaactt tgaagcagga     6300
catattgccg gggcgccgaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac     6360
cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat     6420
agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg     6480
cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catatggtgc     6540
actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca cccgccaaca     6600
cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg     6660
accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga     6720
cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct     6780
tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc     6840
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa     6900
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt     6960
gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct     7020
gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc     7080
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta     7140
tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac     7200
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc     7260
atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac     7320
ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg     7380
gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac     7440
gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc     7500
gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt     7560
gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga     7620
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc     7680
cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag     7740
atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca     7800
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc     7860
ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca     7920
gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc     7980
tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta     8040
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt     8100
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc     8160
gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg     8220
ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg     8280
tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag     8340
ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc     8400
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat     8460
agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg     8520
gggcggagcc tatggaa                                                    8537

<210>  50
<211>  6728
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed vector

<400>  50
acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa       60
aagtgccacc tgggtccttt tcatcacgtg ctataaaaat aattataatt taaatttttt      120
aatataaata tataaattaa aaatagaaag taaaaaaaga aattaaagaa aaaatagttt      180
ttgttttccg aagatgtaaa agactctagg gggatcgcca acaaatacta ccttttatct      240
tgctcttcct gctctcaggt attaatgccg aattgtttca tcttgtctgt gtagaagacc      300
acacacgaaa atcctgtgat tttacatttt acttatcgtt aatcgaatgt atatctattt      360
aatctgcttt tcttgtctaa taaatatata tgtaaagtac gctttttgtt gaaatttttt      420
aaacctttgt ttattttttt ttcttcattc cgtaactctt ctaccttctt tatttacttt      480
ctaaaatcca aatacaaaac ataaaaataa ataaacacag agtaaattcc caaattattc      540
catcattaaa agatacgagg cgcgtgtaag ttacaggcaa gcgatccgtc ctaagaaacc      600
attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg      660
cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct      720
tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gcgtgttggc      780
gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat      840
aaattcccgt tttaagagct tggtgagcgc taggagtcac tgccaggtat cgtttgaaca      900
cggcattagt cagggaagtc ataacacagt cctttcccgc aattttcttt ttctattact      960
cttggcctcc tctagtacac tctatatttt tttatgcctc ggtaatgatt ttcatttttt     1020
tttttcccct agcggatgac tctttttttt tcttagcgat tggcattatc acataatgaa     1080
ttatacatta tataaagtaa tgtgatttct tcgaagaata tactaaaaaa tgagcaggca     1140
agataaacga aggcaaagat gacagagcag aaagccctag taaagcgtat tacaaatgaa     1200
accaagattc agattgcgat ctctttaaag ggtggtcccc tagcgataga gcactcgatc     1260
ttcccagaaa aagaggcaga agcagtagca gaacaggcca cacaatcgca agtgattaac     1320
gtccacacag gtatagggtt tctggaccat atgatacatg ctctggccaa gcattccggc     1380
tggtcgctaa tcgttgagtg cattggtgac ttacacatag acgaccatca caccactgaa     1440
gactgcggga ttgctctcgg tcaagctttt aaagaggccc tactggcgcg tggagtaaaa     1500
aggtttggat caggatttgc gcctttggat gaggcacttt ccagagcggt ggtagatctt     1560
tcgaacaggc cgtacgcagt tgtcgaactt ggtttgcaaa gggagaaagt aggagatctc     1620
tcttgcgaga tgatcccgca ttttcttgaa agctttgcag aggctagcag aattaccctc     1680
cacgttgatt gtctgcgagg caagaatgat catcaccgta gtgagagtgc gttcaaggct     1740
cttgcggttg ccataagaga agccacctcg cccaatggta ccaacgatgt tccctccacc     1800
aaaggtgttc ttatgtagtg acaccgatta tttaaagctg cagcatacga tatatataca     1860
tgtgtatata tgtataccta tgaatgtcag taagtatgta tacgaacagt atgatactga     1920
agatgacaag gtaatgcatc attctatacg tgtcattctg aacgaggcgc gctttccttt     1980
tttctttttg ctttttcttt ttttttctct tgaactcgac ggatctatgc ggtgtgaaat     2040
accgcacaga tgcgtaagga gaaaataccg catcaggaaa ttgtaaacgt taatattttg     2100
ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata ggccgaaatc     2160
ggcaaaatcc cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagtt     2220
tggaacaaga gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc     2280
tatcagggcg atggcccact acgtgaacca tcaccctaat caagtttttt ggggtcgagg     2340
tgccgtaaag cactaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga     2400
aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg     2460
ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct taatgcgccg     2520
ctacagggcg cgtcgcgcca ttcgccattc aggctgcgca actgttggga agggcgatcg     2580
gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc aaggcgatta     2640
agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc cagtgagcgc     2700
gcgtaatacg actcactata gggcgaattg ggtaccgggc cccccctcga ggtcgacggt     2760
atcgataagc ttgattagaa gccgccgagc gggcgacagc cctccgacgg aagactctcc     2820
tccgtgcgtc ctcgtcttca ccggtcgcgt tcctgaaacg cagatgtgcc tcgcgccgca     2880
ctgctccgaa caataaagat tctacaatac tagcttttat ggttatgaag aggaaaaatt     2940
ggcagtaacc tggccccaca aaccttcaaa ttaacgaatc aaattaacaa ccataggatg     3000
ataatgcgat tagtttttta gccttatttc tggggtaatt aatcagcgaa gcgatgattt     3060
ttgatctatt aacagatata taaatggaaa agctgcataa ccactttaac taatactttc     3120
aacattttca gtttgtatta cttcttattc aaatgtcata aaagtatcaa caaaaaattg     3180
ttaatatacc tctatacttt aacgtcaagg agaaaaatgt ccaatttact gcccgtacac     3240
caaaatttgc ctgcattacc ggtcgatgca acgagtgatg aggttcgcaa gaacctgatg     3300
gacatgttca gggatcgcca ggcgttttct gagcatacct ggaaaatgct tctgtccgtt     3360
tgccggtcgt gggcggcatg gtgcaagttg aataaccgga aatggtttcc cgcagaacct     3420
gaagatgttc gcgattatct tctatatctt caggcgcgcg gtctggcagt aaaaactatc     3480
cagcaacatt tgggccagct aaacatgctt catcgtcggt ccgggctgcc acgaccaagt     3540
gacagcaatg ctgtttcact ggttatgcgg cggatccgaa aagaaaacgt tgatgccggt     3600
gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt tcgaccaggt tcgttcactc     3660
atggaaaata gcgatcgctg ccaggatata cgtaatctgg catttctggg gattgcttat     3720
aacaccctgt tacgtatagc cgaaattgcc aggatcaggg ttaaagatat ctcacgtact     3780
gacggtggga gaatgttaat ccatattggc agaacgaaaa cgctggttag caccgcaggt     3840
gtagagaagg cacttagcct gggggtaact aaactggtcg agcgatggat ttccgtctct     3900
ggtgtagctg atgatccgaa taactacctg ttttgccggg tcagaaaaaa tggtgttgcc     3960
gcgccatctg ccaccagcca gctatcaact cgcgccctgg aagggatttt tgaagcaact     4020
catcgattga tttacggcgc taaggatgac tctggtcaga gatacctggc ctggtctgga     4080
cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg ctggagtttc aataccggag     4140
atcatgcaag ctggtggctg gaccaatgta aatattgtca tgaactatat ccgtaacctg     4200
gatagtgaaa caggggcaat ggtgcgcctg ctggaagatg gcgattagga gtaagcgaat     4260
ttcttatgat ttatgatttt tattattaaa taagttataa aaaaaataag tgtatacaaa     4320
ttttaaagtg actcttaggt tttaaaacga aaattcttat tcttgagtaa ctctttcctg     4380
taggtcaggt tgctttctca ggtatagcat gaggtcgctc ttattgacca cacctctacc     4440
ggcatgccga gcaaatgcct gcaaatcgct ccccatttca cccaattgta gatatgctaa     4500
ctccagcaat gagttgatga atctcggtgt gtattttatg tcctcagagg acaacacctg     4560
tggtgttcta gagcggccgc caccgcggtg gagctccagc ttttgttccc tttagtgagg     4620
gttaattgcg cgcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc     4680
gctcacaatt ccacacaaca taggagccgg aagcataaag tgtaaagcct ggggtgccta     4740
atgagtgagg taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa     4800
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     4860
tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     4920
agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     4980
aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     5040
gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     5100
tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     5160
cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     5220
ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     5280
cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     5340
atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     5400
agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     5460
gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa     5520
gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     5580
tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     5640
agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     5700
gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     5760
aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     5820
aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     5880
ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     5940
gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     6000
aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     6060
ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     6120
tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     6180
ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     6240
cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     6300
agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     6360
gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     6420
gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     6480
acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     6540
acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     6600
agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     6660
aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     6720
gagcggat                                                              6728

<210>  51
<211>  2431
<212>  DNA
<213>  artificial sequence

<220>
<223>  chimeric expression cassette for BSaraA

<400>  51
cttttctggc aaccaaaccc atacatcggg attcctataa taccttcgtt ggtctcccta       60
acatgtaggt ggcggagggg agatatacaa tagaacagat accagacaag acataatggg      120
ctaaacaaga ctacaccaat tacactgcct cattgatggt ggtacataac gaactaatac      180
tgtagcccta gacttgatag ccatcatcat atcgaagttt cactaccctt tttccatttg      240
ccatctattg aagtaataat aggcgcatgc aacttctttt cttttttttt cttttctctc      300
tcccccgttg ttgtctcacc atatccgcaa tgacaaaaaa atgatggaag acactaaagg      360
aaaaaattaa cgacaaagac agcaccaaca gatgtcgttg ttccagagct gatgaggggt      420
atctcgaagc acacgaaact ttttccttcc ttcattcacg cacactactc tctaatgagc      480
aacggtatac ggccttcctt ccagttactt gaatttgaaa taaaaaaaag tttgctgtct      540
tgctatcaag tataaataga cctgcaatta ttaatctttt gtttcctcgt cattgttctc      600
gttccctttc ttccttgttt ctttttctgc acaatatttc aagctatacc aagcatacaa      660
tcaactatct catatacaat gttgcaaact aaggattacg aattctggtt cgttactggt      720
tctcaacact tgtacggtga agaaactttg gaattggtcg atcaacacgc taagtctatc      780
tgtgaaggtt tgtccggtgt ctcttccaga tacaagatca cccacaagcc agttgtcacc      840
tcttccgaaa ctatcagaca attgttgaga gaagctgaat actctgaaac ttgtgctggt      900
atcatcacct ggatgcacac tttctctcca gctaagatgt ggatcgaagg tttgtcttcc      960
taccaaaagc cattgatgca cttgcacacc caatacaaca gagacatccc ttggggtact     1020
atcgacatgg atttcatgaa ctctaaccaa tccgctcacg gtgacagaga atacggttac     1080
atcaactcca gaatgggttt gtccagaaag gttgtcgctg gttactggga cgatgaagaa     1140
gtcaagaagg aaatctctca atggatggac accgctgctg ctttgaacga atccagacac     1200
atcaaggttg ctagattcgg tgacaacatg agacacgttg ctgtcactga cggtgacaag     1260
gttggtgctc acatccaatt cggttggcaa gttgacggtt acggtatcgg tgacttggtt     1320
gaagtcatga acagaatcac cgacgatgaa gttgacactt tgtacgctga atacgataga     1380
ttgtacgtca tctctgaaga aaccaagaga gacgaagcta aggttgcttc catcaaggaa     1440
caagctaaga tcgaattggg tttgaccact ttcttggaac aaggtggtta ctctgctttc     1500
accacttcct tcgaagtctt gcacggtatg aagcaattgc caggtttggc tgttcaaaga     1560
ttgatggaaa agggttacgg tttcgctggt gaaggtgact ggaagaccgc tgctttggtc     1620
agaatgatga agatcatgtc tcaaggtaaa agaacctcct tcatggaaga ctacacttac     1680
cacttcgaac caggtaacga aatgatcttg ggttctcaca tgttggaagt ttgtccaact     1740
gtcgctttgg accaaccaaa gatcgaagtt cacccattgt ctatcggtgg taaagaagat     1800
ccagctagat tcgtcttcaa cggtatctct ggttccgcta tccaagcctc tttggttgac     1860
atcggtggta gattcagatt ggttttgaac gaagtcaacg gtcaagaaat cgaaaaggac     1920
atgccaaact tgccagttgc tagagtcttg tggaagccag aaccatcttt gaagactgct     1980
gctgaagcct ggatcttggc tggtggtgct caccacacct gtttgtctta cgaattgact     2040
gtcgaacaaa tgttggactg ggctgaaatg gctggtatcg aatctgtttt gatctccaga     2100
gataccacta tccacaagtt gaagcacgaa ttgaagtgga acacgaagcc ttgtacagat     2160
tgcaaaagta attaattaat catgtaatta gttatgtcac gcttacattc acgccctcct     2220
cccacatccg ctctaaccga aaaggaagga gttagacaac ctgaagtcta ggtccctatt     2280
tatttttttt aatagttatg ttagtattaa gaacgttatt tatatttcaa atttttcttt     2340
tttttctgta caaacgcgtg tacgcatgta acattatact gaaaaccttg cttgagaagg     2400
ttttgggacg ctcgaaggct ttaatttgcg g                                    2431

<210>  52
<211>  566
<212>  PRT
<213>  Escherichia coli

<400>  52
Met Ala Ile Ala Ile Gly Leu Asp Phe Gly Ser Asp Ser Val Arg Ala 
1               5                   10                  15      
Leu Ala Val Asp Cys Ala Ser Gly Glu Glu Ile Ala Thr Ser Val Glu 
            20                  25                  30          
Trp Tyr Pro Arg Trp Gln Lys Gly Gln Phe Cys Asp Ala Pro Asn Asn 
        35                  40                  45              
Gln Phe Arg His His Pro Arg Asp Tyr Ile Glu Ser Met Glu Ala Ala 
    50                  55                  60                  
Leu Lys Thr Val Leu Ala Glu Leu Ser Val Glu Gln Arg Ala Ala Val 
65                  70                  75                  80  
Val Gly Ile Gly Val Asp Ser Thr Gly Ser Thr Pro Ala Pro Ile Asp 
                85                  90                  95      
Ala Asp Gly Asn Val Leu Ala Leu Arg Pro Glu Phe Ala Glu Asn Pro 
            100                 105                 110         
Asn Ala Met Phe Val Leu Trp Lys Asp His Thr Ala Val Glu Arg Ser 
        115                 120                 125             
Glu Glu Ile Thr Arg Leu Cys His Ala Pro Gly Asn Val Asp Tyr Ser 
    130                 135                 140                 
Arg Tyr Ile Gly Gly Ile Tyr Ser Ser Glu Trp Phe Trp Ala Lys Ile 
145                 150                 155                 160 
Leu His Val Thr Arg Gln Asp Ser Ala Val Ala Gln Ser Ala Ala Ser 
                165                 170                 175     
Trp Ile Glu Leu Cys Asp Trp Val Pro Ala Leu Leu Ser Gly Thr Thr 
            180                 185                 190         
Arg Pro Gln Asp Ile Arg Arg Gly Arg Cys Ser Ala Gly His Lys Ser 
        195                 200                 205             
Leu Trp His Glu Ser Trp Gly Gly Leu Pro Pro Ala Ser Phe Phe Asp 
    210                 215                 220                 
Glu Leu Asp Pro Ile Leu Asn Arg His Leu Pro Ser Pro Leu Phe Thr 
225                 230                 235                 240 
Asp Thr Trp Thr Ala Asp Ile Pro Val Gly Thr Leu Cys Pro Glu Trp 
                245                 250                 255     
Ala Gln Arg Leu Gly Leu Pro Glu Ser Val Val Ile Ser Gly Gly Ala 
            260                 265                 270         
Phe Asp Cys His Met Gly Ala Val Gly Ala Gly Ala Gln Pro Asn Ala 
        275                 280                 285             
Leu Val Lys Val Ile Gly Thr Ser Thr Cys Asp Ile Leu Ile Ala Asp 
    290                 295                 300                 
Lys Gln Ser Val Gly Glu Arg Ala Val Lys Gly Ile Cys Gly Gln Val 
305                 310                 315                 320 
Asp Gly Ser Val Val Pro Gly Phe Ile Gly Leu Glu Ala Gly Gln Ser 
                325                 330                 335     
Ala Phe Gly Asp Ile Tyr Ala Trp Phe Gly Arg Val Leu Ser Trp Pro 
            340                 345                 350         
Leu Glu Gln Leu Ala Ala Gln His Pro Glu Leu Lys Ala Gln Ile Asn 
        355                 360                 365             
Ala Ser Gln Lys Gln Leu Leu Pro Ala Leu Thr Glu Ala Trp Ala Lys 
    370                 375                 380                 
Asn Pro Ser Leu Asp His Leu Pro Val Val Leu Asp Trp Phe Asn Gly 
385                 390                 395                 400 
Arg Arg Ser Pro Asn Ala Asn Gln Arg Leu Lys Gly Val Ile Thr Asp 
                405                 410                 415     
Leu Asn Leu Ala Thr Asp Ala Pro Leu Leu Phe Gly Gly Leu Ile Ala 
            420                 425                 430         
Ala Thr Ala Phe Gly Ala Arg Ala Ile Met Glu Cys Phe Thr Asp Gln 
        435                 440                 445             
Gly Ile Ala Val Asn Asn Val Met Ala Leu Gly Gly Ile Ala Arg Lys 
    450                 455                 460                 
Asn Gln Val Ile Met Gln Ala Cys Cys Asp Val Leu Asn Arg Pro Leu 
465                 470                 475                 480 
Gln Ile Val Ala Ser Asp Gln Cys Cys Ala Leu Gly Ala Ala Ile Phe 
                485                 490                 495     
Ala Ala Val Ala Ala Lys Val His Ala Asp Ile Pro Ser Ala Gln Gln 
            500                 505                 510         
Lys Met Ala Ser Ala Val Glu Lys Thr Leu Gln Pro Arg Ser Glu Gln 
        515                 520                 525             
Ala Gln Arg Phe Glu Gln Leu Tyr Arg Arg Tyr Gln Gln Trp Ala Met 
    530                 535                 540                 
Ser Ala Glu Gln His Tyr Leu Pro Thr Ser Ala Pro Ala Gln Ala Ala 
545                 550                 555                 560 
Gln Ala Val Ala Thr Leu 
                565     

<210>  53
<211>  1701
<212>  DNA
<213>  artificial sequence

<220>
<223>  codon optimized coding region for E. coli ribulokinase

<400>  53
atggctatcg ctatcggttt ggacttcggt tctgactccg ttagagcttt ggctgttgac       60
tgtgcttccg gtgaagaaat cgctacttct gtcgaatggt atccaagatg gcaaaagggt      120
caattctgtg acgctccaaa caaccaattc agacaccacc caagagatta catcgaatct      180
atggaagctg ctttgaagac tgttttggct gaattgtctg tcgaacaaag agctgctgtt      240
gtcggtatcg gtgttgactc tactggttcc accccagctc caatcgacgc tgatggtaac      300
gttttggctt tgagaccaga attcgctgaa aacccaaacg ctatgttcgt tttgtggaag      360
gaccacactg ctgtcgaaag atccgaagaa atcaccagat tgtgtcacgc tccaggtaac      420
gttgactact ccagatacat cggtggtatc tactcttccg aatggttctg ggctaagatt      480
ttgcacgtta ctagacaaga ctctgctgtc gctcaatctg ctgcttcctg gatcgaattg      540
tgtgactggg ttccagcttt gttgtctggt actactagac cacaagatat cagaagaggt      600
agatgttctg ctggtcacaa gtccttgtgg cacgaatctt ggggtggttt gccaccagct      660
tccttcttcg acgaattgga cccaatcttg aacagacact tgccatctcc attgttcacc      720
gacacttgga ccgctgatat cccagtcggt actttgtgtc cagaatgggc tcaaagattg      780
ggtttgccag aatctgttgt catctccggt ggtgctttcg actgtcacat gggtgctgtt      840
ggtgctggtg ctcaaccaaa cgctttggtt aaggtcatcg gtacttccac ctgtgacatc      900
ttgatcgctg ataagcaatc tgttggtgaa agagctgtca agggtatctg tggtcaagtt      960
gacggttccg ttgtcccagg tttcatcggt ttggaagctg gtcaatctgc tttcggtgac     1020
atctacgctt ggttcggtag agttttgtcc tggccattgg aacaattggc tgctcaacac     1080
ccagaattga aggctcaaat caacgcttct caaaagcaat tgttgccagc tttgactgaa     1140
gcctgggcta agaacccatc cttggaccac ttgccagttg tcttggattg gttcaacggt     1200
agaagatccc caaacgctaa ccaaagattg aagggtgtta tcactgactt gaacttggct     1260
accgatgctc cattgttgtt cggtggtttg atcgctgcta ctgctttcgg tgctagagct     1320
atcatggaat gtttcaccga ccaaggtatc gctgttaaca acgtcatggc tttgggtggt     1380
atcgctagaa agaaccaagt tatcatgcaa gcctgttgtg acgtcttgaa cagaccattg     1440
caaatcgttg cttctgatca atgttgtgct ttgggtgctg ctatcttcgc tgctgttgct     1500
gctaaggtcc acgctgacat cccatccgct caacaaaaga tggcttctgc tgtcgaaaag     1560
accttgcaac caagatccga acaagctcaa agattcgaac aattgtacag aagataccaa     1620
caatgggcta tgtccgctga acaacactac ttgccaactt ctgctccagc tcaagctgct     1680
caagctgttg ctaccttgta a                                               1701

<210>  54
<211>  2911
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed chimeric expression cassette for ECaraB

<400>  54
gagaagaggt atacataaca agaaaatcgc gtgaacacct tatataactt agcccgttat       60
tgagctaaaa aaccttgcaa aatttcctat gaataagaat acttcagacg tgataaaaat      120
ttactttcta actcttctca cgctgcccct atctgttctt ccgctctacc gtgagaaata      180
aagcatcgag tacggcagtt cgctgtcact gaactaaaac aataaggcta gttcgaatga      240
tgaacttgct tgctgtcaaa cttctgagtt gccgctgatg tgacactgtg acaataaatt      300
caaaccggtt atagcggtct cctccggtac cggttctgcc acctccaata gagctcagta      360
ggagtcagaa cctctgcggt ggctgtcagt gactcatccg cgtttcgtaa gttgtgcgcg      420
tgcacatttc gcccgttccc gctcatcttg cagcaggcga aattttcatc acgctgtagg      480
acgcaaaaaa aaaataatta atcgtacaag aatcttggaa aaaaaattga aaaattttgt      540
ataaaaggga tgacctaact tgactcaatg gcttttacac ccagtatttt ccctttcctt      600
gtttgttaca attatagaag caagacaaaa acatatagac aacctattcc taggagttat      660
atttttttac cctaccagca atataagtaa aaaataaaac atggctatcg ctatcggttt      720
ggacttcggt tctgactccg ttagagcttt ggctgttgac tgtgcttccg gtgaagaaat      780
cgctacttct gtcgaatggt atccaagatg gcaaaagggt caattctgtg acgctccaaa      840
caaccaattc agacaccacc caagagatta catcgaatct atggaagctg ctttgaagac      900
tgttttggct gaattgtctg tcgaacaaag agctgctgtt gtcggtatcg gtgttgactc      960
tactggttcc accccagctc caatcgacgc tgatggtaac gttttggctt tgagaccaga     1020
attcgctgaa aacccaaacg ctatgttcgt tttgtggaag gaccacactg ctgtcgaaag     1080
atccgaagaa atcaccagat tgtgtcacgc tccaggtaac gttgactact ccagatacat     1140
cggtggtatc tactcttccg aatggttctg ggctaagatt ttgcacgtta ctagacaaga     1200
ctctgctgtc gctcaatctg ctgcttcctg gatcgaattg tgtgactggg ttccagcttt     1260
gttgtctggt actactagac cacaagatat cagaagaggt agatgttctg ctggtcacaa     1320
gtccttgtgg cacgaatctt ggggtggttt gccaccagct tccttcttcg acgaattgga     1380
cccaatcttg aacagacact tgccatctcc attgttcacc gacacttgga ccgctgatat     1440
cccagtcggt actttgtgtc cagaatgggc tcaaagattg ggtttgccag aatctgttgt     1500
catctccggt ggtgctttcg actgtcacat gggtgctgtt ggtgctggtg ctcaaccaaa     1560
cgctttggtt aaggtcatcg gtacttccac ctgtgacatc ttgatcgctg ataagcaatc     1620
tgttggtgaa agagctgtca agggtatctg tggtcaagtt gacggttccg ttgtcccagg     1680
tttcatcggt ttggaagctg gtcaatctgc tttcggtgac atctacgctt ggttcggtag     1740
agttttgtcc tggccattgg aacaattggc tgctcaacac ccagaattga aggctcaaat     1800
caacgcttct caaaagcaat tgttgccagc tttgactgaa gcctgggcta agaacccatc     1860
cttggaccac ttgccagttg tcttggattg gttcaacggt agaagatccc caaacgctaa     1920
ccaaagattg aagggtgtta tcactgactt gaacttggct accgatgctc cattgttgtt     1980
cggtggtttg atcgctgcta ctgctttcgg tgctagagct atcatggaat gtttcaccga     2040
ccaaggtatc gctgttaaca acgtcatggc tttgggtggt atcgctagaa agaaccaagt     2100
tatcatgcaa gcctgttgtg acgtcttgaa cagaccattg caaatcgttg cttctgatca     2160
atgttgtgct ttgggtgctg ctatcttcgc tgctgttgct gctaaggtcc acgctgacat     2220
cccatccgct caacaaaaga tggcttctgc tgtcgaaaag accttgcaac caagatccga     2280
acaagctcaa agattcgaac aattgtacag aagataccaa caatgggcta tgtccgctga     2340
acaacactac ttgccaactt ctbamhgctc cagctcaagc tgctcaagct gttgctacct     2400
tgtaaggatc caggagcaat gcaaaatcta ggggtagaat tactttttga aaaggaaaaa     2460
tattcaggtt tgttgttttt atgtaagttg tatgatttga tatacatata tatatatata     2520
taatatatat tgtacatgtg tttttccggg gaagaatgga ttatccggag gtgtgaataa     2580
aatgatgacg attataggtt tgtgttgtaa tatttagata actcaattct cgccagtttg     2640
aactccaacc tagactggtt caaagctttt gctatcaaga tgagatatat ggaattttcg     2700
tctttatcgt ccacttgtat ctttatttcc tcgtcatctt catcaatatt gattccatta     2760
ataatcgatt tatcgctcag agtgttgacc aattcggtct tgttggggaa gaaatgttcc     2820
atttttcttc ccaagttttg aattctttca caaacccagg caattctttg taagcctaat     2880
gcagcagaag aaccctttaa aaaatggccc a                                    2911

<210>  55
<211>  231
<212>  PRT
<213>  Escherichia coli

<400>  55
Met Leu Glu Asp Leu Lys Arg Gln Val Leu Glu Ala Asn Leu Ala Leu 
1               5                   10                  15      
Pro Lys His Asn Leu Val Thr Leu Thr Trp Gly Asn Val Ser Ala Val 
            20                  25                  30          
Asp Arg Glu Arg Gly Val Phe Val Ile Lys Pro Ser Gly Val Asp Tyr 
        35                  40                  45              
Ser Ile Met Thr Ala Asp Asp Met Val Val Val Ser Ile Glu Thr Gly 
    50                  55                  60                  
Glu Val Val Glu Gly Ala Lys Lys Pro Ser Ser Asp Thr Pro Thr His 
65                  70                  75                  80  
Arg Leu Leu Tyr Gln Ala Phe Pro Ser Ile Gly Gly Ile Val His Thr 
                85                  90                  95      
His Ser Arg His Ala Thr Ile Trp Ala Gln Ala Gly Gln Ser Ile Pro 
            100                 105                 110         
Ala Thr Gly Thr Thr His Ala Asp Tyr Phe Tyr Gly Thr Ile Pro Cys 
        115                 120                 125             
Thr Arg Lys Met Thr Asp Ala Glu Ile Asn Gly Glu Tyr Glu Trp Glu 
    130                 135                 140                 
Thr Gly Asn Val Ile Val Glu Thr Phe Glu Lys Gln Gly Ile Asp Ala 
145                 150                 155                 160 
Ala Gln Met Pro Gly Val Leu Val His Ser His Gly Pro Phe Ala Trp 
                165                 170                 175     
Gly Lys Asn Ala Glu Asp Ala Val His Asn Ala Ile Val Leu Glu Glu 
            180                 185                 190         
Val Ala Tyr Met Gly Ile Phe Cys Arg Gln Leu Ala Pro Gln Leu Pro 
        195                 200                 205             
Asp Met Gln Gln Thr Leu Leu Asn Lys His Tyr Leu Arg Lys His Gly 
    210                 215                 220                 
Ala Lys Ala Tyr Tyr Gly Gln 
225                 230     

<210>  56
<211>  696
<212>  DNA
<213>  artificial sequence

<220>
<223>  E coli L-ribulose-5-phosphate 4-epimerase codon optimized coding 
       region

<400>  56
atgttggaag atttgaagag acaagttttg gaagctaact tggctttgcc aaagcacaac       60
ttggtcacct tgacctgggg taacgtctct gctgttgaca gagaaagagg tgtcttcgtt      120
atcaagccat ctggtgttga ttactccatc atgactgctg acgatatggt tgtcgtttcc      180
atcgaaaccg gtgaagtcgt tgaaggtgct aagaagccat cttccgacac cccaactcac      240
agattgttgt accaagcctt cccatctatc ggtggtatcg tccacactca ctccagacac      300
gctaccatct gggctcaagc tggtcaatct atcccagcta ctggtactac tcacgctgac      360
tacttctacg gtactatccc atgtactaga aagatgaccg atgctgaaat caacggtgaa      420
tacgaatggg aaactggtaa cgtcatcgtt gaaaccttcg aaaagcaagg tatcgacgct      480
gctcaaatgc caggtgtctt ggttcactct cacggtccat tcgcttgggg taaaaacgct      540
gaagatgctg ttcacaacgc tatcgtcttg gaagaagttg cttacatggg tatcttctgt      600
agacaattgg ctccacaatt gccagacatg caacaaacct tgttgaacaa gcactacttg      660
agaaagcacg gtgctaaggc ttactacggt caataa                                696

<210>  57
<211>  1691
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed chimeric expression cassette for ECaraD

<400>  57
agttcgagtt tatcattatc aatactgcca tttcaaagaa tacgtaaata attaatagta       60
gtgattttcc taactttatt tagtcaaaaa attagccttt taattctgct gtaacccgta      120
catgcccaaa atagggggcg ggttacacag aatatataac atcgtaggtg tctgggtgaa      180
cagtttattc ctggcatcca ctaaatataa tggagcccgc tttttaagct ggcatccaga      240
aaaaaaaaga atcccagcac caaaatattg ttttcttcac caaccatcag ttcataggtc      300
cattctctta gcgcaactac agagaacagg ggcacaaaca ggcaaaaaac gggcacaacc      360
tcaatggagt gatgcaacct gcctggagta aatgatgaca caaggcaatt gacccacgca      420
tgtatctatc tcattttctt acaccttcta ttaccttctg ctctctctga tttggaaaaa      480
gctgaaaaaa aaggttgaaa ccagttccct gaaattattc ccctacttga ctaataagta      540
tataaagacg gtaggtattg attgtaattc tgtaaatcta tttcttaaac ttcttaaatt      600
ctacttttat agttagtctt ttttttagtt ttaaaacacc aagaacttag tttcgaataa      660
acacacataa acaaacaaaa tgttggaaga tttgaagaga caagttttgg aagctaactt      720
ggctttgcca aagcacaact tggtcacctt gacctggggt aacgtctctg ctgttgacag      780
agaaagaggt gtcttcgtta tcaagccatc tggtgttgat tactccatca tgactgctga      840
cgatatggtt gtcgtttcca tcgaaaccgg tgaagtcgtt gaaggtgcta agaagccatc      900
ttccgacacc ccaactcaca gattgttgta ccaagccttc ccatctatcg gtggtatcgt      960
ccacactcac tccagacacg ctaccatctg ggctcaagct ggtcaatcta tcccagctac     1020
tggtactact cacgctgact acttctacgg tactatccca tgtactagaa agatgaccga     1080
tgctgaaatc aacggtgaat acgaatggga aactggtaac gtcatcgttg aaaccttcga     1140
aaagcaaggt atcgacgctg ctcaaatgcc aggtgtcttg gttcactctc acggtccatt     1200
cgcttggggt aaaaacgctg aagatgctgt tcacaacgct atcgtcttgg aagaagttgc     1260
ttacatgggt atcttctgta gacaattggc tccacaattg ccagacatgc aacaaacctt     1320
gttgaacaag cactacttga gaaagcacgg tgctaaggct tactacggtc aataagagta     1380
agcgaatttc ttatgattta tgatttttat tattaaataa gttataaaaa aaataagtgt     1440
atacaaattt taaagtgact cttaggtttt aaaacgaaaa ttcttattct tgagtaactc     1500
tttcctgtag gtcaggttgc tttctcaggt atagcatgag gtcgctctta ttgaccacac     1560
ctctaccggc atgccgagca aatgcctgca aatcgctccc catttcaccc aattgtagat     1620
atgctaactc cagcaatgag ttgatgaatc tcggtgtgta ttttatgtcc tcagaggaca     1680
acacctgtgg t                                                          1691

<210>  58
<211>  8202
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed plasmid

<220>
<221>  misc_feature
<222>  (421)..(421)
<223>  n is a, c, g, or t

<400>  58
tcccattacc gacatttggg cgctatacgt gcatatgttc atgtatgtat ctgtatttaa       60
aacacttttg tattattttt cctcatatat gtgtataggt ttatacggat gatttaatta      120
ttacttcacc accctttatt tcaggctgat atcttagcct tgttactaga ttaatcatgt      180
aattagttat gtcacgctta cattcacgcc ctccccccac atccgctcta accgaaaagg      240
aaggagttag acaacctgaa gtctaggtcc ctatttattt ttttatagtt atgttagtat      300
taagaacgtt atttatattt caaatttttc ttttttttct gtacagacgc gtgtacgcat      360
gtaacattat actgaaaacc ttgcttgaga aggttttggg acgctcgaag gctttaattt      420
ntgcgggcgg ccgctggaca atttattcat ggcatcgtca ttgatataag tggcttgagc      480
tgtggataag aaaagccata tatttatata aacatttaga tatgaatagg aagtagattg      540
ttcgacgcaa ctacccgttc aagaagtata atggggaatg gtctcatctt ccctcacagg      600
atatagttct ctgaagagat acatacgttt gtgtatacta tgcttcttta tcaactcaag      660
ttttgtagag gaagacgttg aagatggtga tgtgacatct ttactattct ccagcacgtt      720
ttcagtattt acttaatcgt atattaatga cgtcccttat ctattaactt tccggttttt      780
ctttttttcg gtgaatgttc tttccgtttt agtgaatttt tcaattgtaa ttgacgcaat      840
cggtttataa caagcagaca taaatatcaa gctcgagcca aatcacaaaa aaagccttat      900
agbamhcttg ccctgacaaa gaatatacaa ctcgggaagg atccgagaag aggtatacat      960
aacaagaaaa tcgcgtgaac accttatata acttagcccg ttattgagct aaaaaacctt     1020
gcaaaatttc ctatgaataa gaatacttca gacgtgataa aaatttactt tctaactctt     1080
ctcacgctgc ccctatctgt tcttccgctc taccgtgaga aataaagcat cgagtacggc     1140
agttcgctgt cactgaacta aaacaataag gctagttcga atgatgaact tgcttgctgt     1200
caaacttctg agttgccgct gatgtgacac tgtgacaata aattcaaacc ggttatagcg     1260
gtctcctccg gtaccggttc tgccacctcc aatagagctc agtaggagtc agaacctctg     1320
cggtggctgt cagtgactca tccgcgtttc gtaagttgtg cgcgtgcaca tttcgcccgt     1380
tcccgctcat cttgcagcag gcgaaatttt catcacgctg taggacgcaa aaaaaaaata     1440
attaatcgta caagaatctt ggaaaaaaaa ttgaaaaatt ttgtataaaa gggatgacct     1500
aacttgactc aatggctttt acacccagta ttttcccttt ccttgtttgt tacaattata     1560
gaagcaagac aaaaacatat agacaaccta ttcctaggag ttatattttt ttaccctacc     1620
agcaatataa gtaaaaaata aaacatggct atcgctatcg gtttggactt cggttctgac     1680
tccgttagag ctttggctgt tgactgtgct tccggtgaag aaatcgctac ttctgtcgaa     1740
tggtatccaa gatggcaaaa gggtcaattc tgtgacgctc caaacaacca attcagacac     1800
cacccaagag attacatcga atctatggaa gctgctttga agactgtttt ggctgaattg     1860
tctgtcgaac aaagagctgc tgttgtcggt atcggtgttg actctactgg ttccacccca     1920
gctccaatcg acgctgatgg taacgttttg gctttgagac cagaattcgc tgaaaaccca     1980
aacgctatgt tcgttttgtg gaaggaccac actgctgtcg aaagatccga agaaatcacc     2040
agattgtgtc acgctccagg taacgttgac tactccagat acatcggtgg tatctactct     2100
tccgaatggt tctgggctaa gattttgcac gttactagac aagactctgc tgtcgctcaa     2160
tctgctgctt cctggatcga attgtgtgac tgggttccag ctttgttgtc tggtactact     2220
agaccacaag atatcagaag aggtagatgt tctgctggtc acaagtcctt gtggcacgaa     2280
tcttggggtg gtttgccacc agcttccttc ttcgacgaat tggacccaat cttgaacaga     2340
cacttgccat ctccattgtt caccgacact tggaccgctg atatcccagt cggtactttg     2400
tgtccagaat gggctcaaag attgggtttg ccagaatctg ttgtcatctc cggtggtgct     2460
ttcgactgtc acatgggtgc tgttggtgct ggtgctcaac caaacgcttt ggttaaggtc     2520
atcggtactt ccacctgtga catcttgatc gctgataagc aatctgttgg tgaaagagct     2580
gtcaagggta tctgtggtca agttgacggt tccgttgtcc caggtttcat cggtttggaa     2640
gctggtcaat ctgctttcgg tgacatctac gcttggttcg gtagagtttt gtcctggcca     2700
ttggaacaat tggctgctca acacccagaa ttgaaggctc aaatcaacgc ttctcaaaag     2760
caattgttgc cagctttgac tgaagcctgg gctaagaacc catccttgga ccacttgcca     2820
gttgtcttgg attggttcaa cggtagaaga tccccaaacg ctaaccaaag attgaagggt     2880
gttatcactg acttgaactt ggctaccgat gctccattgt tgttcggtgg tttgatcgct     2940
gctactgctt tcggtgctag agctatcatg gaatgtttca ccgaccaagg tatcgctgtt     3000
aacaacgtca tggctttggg tggtatcgct agaaagaacc aagttatcat gcaagcctgt     3060
tgtgacgtct tgaacagacc attgcaaatc gttgcttctg atcaatgttg tgctttgggt     3120
gctgctatct tcgctgctgt tgctgctaag gtccacgctg acatcccatc cgctcaacaa     3180
aagatggctt ctgctgtcga aaagaccttg caaccaagat ccgaacaagc tcaaagattc     3240
gaacaattgt acagaagata ccaacaatgg gctatgtccg ctgaacaaca ctacttgcca     3300
acttctbamh gctccagctc aagctgctca agctgttgct accttgtaag gatccaggag     3360
caatgcaaaa tctaggggta gaattacttt ttgaaaagga aaaatattca ggtttgttgt     3420
ttttatgtaa gttgtatgat ttgatataca tatatatata tatataatat atattgtaca     3480
tgtgtttttc cggggaagaa tggattatcc ggaggtgtga ataaaatgat gacgattata     3540
ggtttgtgtt gtaatattta gataactcaa ttctcgccag tttgaactcc aacctagact     3600
ggttcaaagc ttttgctatc aagatgagat atatggaatt ttcgtcttta tcgtccactt     3660
gtatctttat ttcctcgtca tcttcatcaa tattgattcc attaataatc gatttatcgc     3720
tcagagtgtt gaccaattcg gtcttgttgg ggaagaaatg ttccattttt cttcccaagt     3780
tttgaattct ttcacaaacc caggcaattc tttgtaagcc taatgcagca gaagaaccct     3840
ttaaaaaatg gcccattaat gtggctgtgg tttcagggtc cataaagctt ttcaattcat     3900
cttttttttt tttgttcttt tttttgattc cggtttcttt gaaatttttt tgattcggta     3960
atctccgagc agaaggaaga acgaaggaag gagcacagac ttagattggt atatatacgc     4020
atatgtggtg ttgaagaaac atgaaattgc ccagtattct taacccaact gcacagaaca     4080
aaaacctgca ggaaacgaag ataaatcatg tcgaaagcta catataagga acgtgctgct     4140
actcatccta gtcctgttgc tgccaagcta tttaatatca tgcacgaaaa gcaaacaaac     4200
ttgtgtgctt cattggatgt tcgtaccacc aaggaattac tggagttagt tgaagcatta     4260
ggtcccaaaa tttgtttact aaaaacacat gtggatatct tgactgattt ttccatggag     4320
ggcacagtta agccgctaaa ggcattatcc gccaagtaca attttttact cttcgaagac     4380
agaaaatttg ctgacattgg taatacagtc aaattgcagt actctgcggg tgtatacaga     4440
atagcagaat gggcagacat tacgaatgca cacggtgtgg tgggcccagg tattgttagc     4500
ggtttgaagc aggcggcgga agaagtaaca aaggaaccta gaggcctttt gatgttagca     4560
gaattgtcat gcaagggctc cctagctact ggagaatata ctaagggtac tgttgacatt     4620
gcgaagagcg acaaagattt tgttatcggc tttattgctc aaagagacat gggtggaaga     4680
gatgaaggtt acgattggtt gattatgaca cccggtgtgg gtttagatga caagggagac     4740
gcattgggtc aacagtatag aaccgtggat gatgtggtct ctacaggatc tgacattatt     4800
attgttggaa gaggactatt tgcaaaggga agggatgcta aggtagaggg tgaacgttac     4860
agaaaagcag gctgggaagc atatttgaga agatgcggcc agcaaaacta aaaaactgta     4920
ttataagtaa atgcatgtat actaaactca caaattagag cttcaattta attatatcag     4980
ttattacccg ggaatctcgg tcgtaatgat ttctataatg acgaaaaaaa aaaaattgga     5040
aagaaaaagc ttcatggcct tctaaatcac catttttggt gaacggcctt gataaggatg     5100
ttagttgtgt tattgctggg ttagacacga aggtaaatta ccaccgtttg gctgttacac     5160
tgcagtattt gcagaaggat tctgttcact ttgttggtac aaatgttgat tctactttcc     5220
cgcaaaaggg ttatacattt cccggtgcag gctccatgat tgaatcattg gcattctcat     5280
ctaataggag gccatcgtac tgtggtaagc caaatcaaaa tatgctaaac agcattatat     5340
cggcattcaa cctggataga tcaaagtgct gtatggttgg tgacagatta aacaccgata     5400
tgaaattcgg tgttgaaggt gggttaggtg gcacactact cgttttgagt ggtattgaaa     5460
ccgaagagag agccttgaag atttcgcacg attatccaag acctaaattt tacattgata     5520
aacttggtga asccatctac accttaacca ataatgagtt atagggcgcg ccgacgtcag     5580
gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt     5640
caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa     5700
ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt     5760
gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt     5820
tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt     5880
ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg     5940
tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga     6000
atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa     6060
gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga     6120
caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa     6180
ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca     6240
ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta     6300
ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac     6360
ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc     6420
gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag     6480
ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga     6540
taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt     6600
agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata     6660
atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag     6720
aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa     6780
caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt     6840
ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc     6900
cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa     6960
tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa     7020
gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc     7080
ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa     7140
gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa     7200
caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg     7260
ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc     7320
tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg     7380
ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg     7440
agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg     7500
aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat     7560
gcagctggca cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg     7620
tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt     7680
tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg     7740
ccaagctttt tctttccaat tttttttttt tcgtcattat aaaaatcatt acgaccgaga     7800
ttccctaata agagaatagg aacttcggaa taggaacttc aaagcgtttc cgaaaacgag     7860
cgcttccgaa aatgcaacgc gagctgcgca catacagctc actgttcacg tcgcacctat     7920
atctgcgtgt tgcctgtata tatatataca tgagaagaac ggcatagtgc gtgtttatgc     7980
ttaaatgcgt acttatatgc gtctatttat gtaggatgaa aggtagtcta gtacctcctg     8040
tgatattatc ccattccatg cggggtatcg tatgcttcct tcagcactac cctttagctg     8100
ttctatatgc tgccactcct caattggatt agtctcatcc ttcaatgcta tcatttcctt     8160
tgatattgga tcatatgcat agtaccgaga aactagagga tc                        8202

<210>  59
<211>  10887
<212>  DNA
<213>  artificial sequence

<220>
<223>  constructed plasmid

<400>  59
ggcctcgagc ggaccggatc cttttctggc aaccaaaccc atacatcggg attcctataa       60
taccttcgtt ggtctcccta acatgtaggt ggcggagggg agatatacaa tagaacagat      120
accagacaag acataatggg ctaaacaaga ctacaccaat tacactgcct cattgatggt      180
ggtacataac gaactaatac tgtagcccta gacttgatag ccatcatcat atcgaagttt      240
cactaccctt tttccatttg ccatctattg aagtaataat aggcgcatgc aacttctttt      300
cttttttttt cttttctctc tcccccgttg ttgtctcacc atatccgcaa tgacaaaaaa      360
atgatggaag acactaaagg aaaaaattaa cgacaaagac agcaccaaca gatgtcgttg      420
ttccagagct gatgaggggt atctcgaagc acacgaaact ttttccttcc ttcattcacg      480
cacactactc tctaatgagc aacggtatac ggccttcctt ccagttactt gaatttgaaa      540
taaaaaaaag tttgctgtct tgctatcaag tataaataga cctgcaatta ttaatctttt      600
gtttcctcgt cattgttctc gttccctttc ttccttgttt ctttttctgc acaatatttc      660
aagctatacc aagcatacaa tcaactatct catatacaat gttgcaaact aaggattacg      720
aattctggtt cgttactggt tctcaacact tgtacggtga agaaactttg gaattggtcg      780
atcaacacgc taagtctatc tgtgaaggtt tgtccggtgt ctcttccaga tacaagatca      840
cccacaagcc agttgtcacc tcttccgaaa ctatcagaca attgttgaga gaagctgaat      900
actctgaaac ttgtgctggt atcatcacct ggatgcacac tttctctcca gctaagatgt      960
ggatcgaagg tttgtcttcc taccaaaagc cattgatgca cttgcacacc caatacaaca     1020
gagacatccc ttggggtact atcgacatgg atttcatgaa ctctaaccaa tccgctcacg     1080
gtgacagaga atacggttac atcaactcca gaatgggttt gtccagaaag gttgtcgctg     1140
gttactggga cgatgaagaa gtcaagaagg aaatctctca atggatggac accgctgctg     1200
ctttgaacga atccagacac atcaaggttg ctagattcgg tgacaacatg agacacgttg     1260
ctgtcactga cggtgacaag gttggtgctc acatccaatt cggttggcaa gttgacggtt     1320
acggtatcgg tgacttggtt gaagtcatga acagaatcac cgacgatgaa gttgacactt     1380
tgtacgctga atacgataga ttgtacgtca tctctgaaga aaccaagaga gacgaagcta     1440
aggttgcttc catcaaggaa caagctaaga tcgaattggg tttgaccact ttcttggaac     1500
aaggtggtta ctctgctttc accacttcct tcgaagtctt gcacggtatg aagcaattgc     1560
caggtttggc tgttcaaaga ttgatggaaa agggttacgg tttcgctggt gaaggtgact     1620
ggaagaccgc tgctttggtc agaatgatga agatcatgtc tcaaggtaaa agaacctcct     1680
tcatggaaga ctacacttac cacttcgaac caggtaacga aatgatcttg ggttctcaca     1740
tgttggaagt ttgtccaact gtcgctttgg accaaccaaa gatcgaagtt cacccattgt     1800
ctatcggtgg taaagaagat ccagctagat tcgtcttcaa cggtatctct ggttccgcta     1860
tccaagcctc tttggttgac atcggtggta gattcagatt ggttttgaac gaagtcaacg     1920
gtcaagaaat cgaaaaggac atgccaaact tgccagttgc tagagtcttg tggaagccag     1980
aaccatcttt gaagactgct gctgaagcct ggatcttggc tggtggtgct caccacacct     2040
gtttgtctta cgaattgact gtcgaacaaa tgttggactg ggctgaaatg gctggtatcg     2100
aatctgtttt gatctccaga gataccacta tccacaagtt gaagcacgaa ttgaagtgga     2160
acgaagcctt gtacagattg caaaagtaat taattaatca tgtaattagt tatgtcacgc     2220
ttacattcac gccctcctcc cacatccgct ctaaccgaaa aggaaggagt tagacaacct     2280
gaagtctagg tccctattta ttttttttaa tagttatgtt agtattaaga acgttattta     2340
tatttcaaat ttttcttttt tttctgtaca aacgcgtgta cgcatgtaac attatactga     2400
aaaccttgct tgagaaggtt ttgggacgct cgaaggcttt aatttgcggg cggccgctct     2460
agaactagta ccacaggtgt tgtcctctga ggacataaaa tacacaccga gattcatcaa     2520
ctcattgctg gagttagcat atctacaatt gggtgaaatg gggagcgatt tgcaggcatt     2580
tgctcggcat gccggtagag gtgtggtcaa taagagcgac ctcatgctat acctgagaaa     2640
gcaacctgac ctacaggaaa gagttactca agaataagaa ttttcgtttt aaaacctaag     2700
agtcacttta aaatttgtat acacttattt tttttataac ttatttaata ataaaaatca     2760
taaatcataa gaaattcgct tactcttatt gaccgtagta agccttagca ccgtgctttc     2820
tcaagtagtg cttgttcaac aaggtttgtt gcatgtctgg caattgtgga gccaattgtc     2880
tacagaagat acccatgtaa gcaacttctt ccaagacgat agcgttgtga acagcatctt     2940
cagcgttttt accccaagcg aatggaccgt gagagtgaac caagacacct ggcatttgag     3000
cagcgtcgat accttgcttt tcgaaggttt caacgatgac gttaccagtt tcccattcgt     3060
attcaccgtt gatttcagca tcggtcatct ttctagtaca tgggatagta ccgtagaagt     3120
agtcagcgtg agtagtacca gtagctggga tagattgacc agcttgagcc cagatggtag     3180
cgtgtctgga gtgagtgtgg acgataccac cgatagatgg gaaggcttgg tacaacaatc     3240
tgtgagttgg ggtgtcggaa gatggcttct tagcaccttc aacgacttca ccggtttcga     3300
tggaaacgac aaccatatcg tcagcagtca tgatggagta atcaacacca gatggcttga     3360
taacgaagac acctctttct ctgtcaacag cagagacgtt accccaggtc aaggtgacca     3420
agttgtgctt tggcaaagcc aagttagctt ccaaaacttg tctcttcaaa tcttccaaca     3480
ttttgtttgt ttatgtgtgt ttattcgaaa ctaagttctt ggtgttttaa aactaaaaaa     3540
aagactaact ataaaagtag aatttaagaa gtttaagaaa tagatttaca gaattacaat     3600
caatacctac cgtctttata tacttattag tcaagtaggg gaataatttc agggaactgg     3660
tttcaacctt ttttttcagc tttttccaaa tcagagagag cagaaggtaa tagaaggtgt     3720
aagaaaatga gatagataca tgcgtgggtc aattgccttg tgtcatcatt tactccaggc     3780
aggttgcatc actccattga ggttgtgccc gttttttgcc tgtttgtgcc cctgttctct     3840
gtagttgcgc taagagaatg gacctatgaa ctgatggttg gtgaagaaaa caatattttg     3900
gtgctgggat tctttttttt tctggatgcc agcttaaaaa gcgggctcca ttatatttag     3960
tggatgccag gaataaactg ttcacccaga cacctacgat gttatatatt ctgtgtaacc     4020
cgccccctat tttgggcatg tacgggttac agcagaatta aaaggctaat tttttgacta     4080
aataaagtta ggaaaatcac tactattaat tatttacgta ttctttgaaa tggcagtatt     4140
gataatgata aactcgaact agatctatcc gcggtgccgg cagatctatt taaatggcgc     4200
gccgacgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt     4260
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata     4320
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt     4380
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc     4440
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat     4500
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct     4560
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca     4620
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg     4680
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa     4740
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg     4800
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga     4860
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg     4920
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt     4980
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg     5040
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc     5100
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca     5160
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc     5220
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat     5280
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc     5340
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg     5400
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct     5460
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct     5520
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct     5580
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg     5640
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc     5700
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga     5760
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg     5820
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta     5880
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg     5940
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg     6000
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat     6060
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc     6120
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc     6180
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa     6240
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc     6300
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga     6360
ccatgattac gccaagcttt ttctttccaa tttttttttt ttcgtcatta taaaaatcat     6420
tacgaccgag attcccgggt aataactgat ataattaaat tgaagctcta atttgtgagt     6480
ttagtataca tgcatttact tataatacag ttttttagtt ttgctggccg catcttctca     6540
aatatgcttc ccagcctgct tttctgtaac gttcaccctc taccttagca tcccttccct     6600
ttgcaaatag tcctcttcca acaataataa tgtcagatcc tgtagagacc acatcatcca     6660
cggttctata ctgttgaccc aatgcgtctc ccttgtcatc taaacccaca ccgggtgtca     6720
taatcaacca atcgtaacct tcatctcttc cacccatgtc tctttgagca ataaagccga     6780
taacaaaatc tttgtcgctc ttcgcaatgt caacagtacc cttagtatat tctccagtag     6840
atagggagcc cttgcatgac aattctgcta acatcaaaag gcctctaggt tcctttgtta     6900
cttcttctgc cgcctgcttc aaaccgctaa caatacctgg gcccaccaca ccgtgtgcat     6960
tcgtaatgtc tgcccattct gctattctgt atacacccgc agagtactgc aatttgactg     7020
tattaccaat gtcagcaaat tttctgtctt cgaagagtaa aaaattgtac ttggcggata     7080
atgcctttag cggcttaact gtgccctcca tggaaaaatc agtcaagata tccacatgtg     7140
tttttagtaa acaaattttg ggacctaatg cttcaactaa ctccagtaat tccttggtgg     7200
tacgaacatc caatgaagca cacaagtttg tttgcttttc gtgcatgata ttaaatagct     7260
tggcagcaac aggactagga tgagtagcag cacgttcctt atatgtagct ttcgacatga     7320
tttatcttcg tttcctgcag gtttttgttc tgtgcagttg ggttaagaat actgggcaat     7380
ttcatgtttc ttcaacacta catatgcgta tatataccaa tctaagtctg tgctccttcc     7440
ttcgttcttc cttctgttcg gagattaccg aatcaaaaaa atttcaagga aaccgaaatc     7500
aaaaaaaaga ataaaaaaaa aatgatgaat tgaaaagctt gcatgcctgc aggtcgactc     7560
tagtatactc cgtctactgt acgatacact tccgctcagg tccttgtcct ttaacgaggc     7620
cttaccactc ttttgttact ctattgatcc agctcagcaa aggcagtgtg atctaagatt     7680
ctatcttcgc gatgtagtaa aactagctag accgagaaag agactagaaa tgcaaaaggc     7740
acttctacaa tggctgccat cattattatc cgatgtgacg ctgcattttt tttttttttt     7800
tttttttttt tttttttttt tttttttttt tttttttgta caaatatcat aaaaaaagag     7860
aatcttttta agcaaggatt ttcttaactt cttcggcgac agcatcaccg acttcggtgg     7920
tactgttgga accacctaaa tcaccagttc tgatacctgc atccaaaacc tttttaactg     7980
catcttcaat ggctttacct tcttcaggca agttcaatga caatttcaac atcattgcag     8040
cagacaagat agtggcgata gggttgacct tattctttgg caaatctgga gcggaaccat     8100
ggcatggttc gtacaaacca aatgcggtgt tcttgtctgg caaagaggcc aaggacgcag     8160
atggcaacaa acccaaggag cctgggataa cggaggcttc atcggagatg atatcaccaa     8220
acatgttgct ggtgattata ataccattta ggtgggttgg gttcttaact aggatcatgg     8280
cggcagaatc aatcaattga tgttgaactt tcaatgtagg gaattcgttc ttgatggttt     8340
cctccacagt ttttctccat aatcttgaag aggccaaaac attagcttta tccaaggacc     8400
aaataggcaa tggtggctca tgttgtaggg ccatgaaagc ggccattctt gtgattcttt     8460
gcacttctgg aacggtgtat tgttcactat cccaagcgac accatcacca tcgtcttcct     8520
ttctcttacc aaagtaaata cctcccacta attctctaac aacaacgaag tcagtacctt     8580
tagcaaattg tggcttgatt ggagataagt ctaaaagaga gtcggatgca aagttacatg     8640
gtcttaagtt ggcgtacaat tgaagttctt tacggatttt tagtaaacct tgttcaggtc     8700
taacactacc ggtaccccat ttaggaccac ccacagcacc taacaaaacg gcatcagcct     8760
tcttggaggc ttccagcgcc tcatctggaa gtggaacacc tgtagcatcg atagcagcac     8820
caccaattaa atgattttcg aaatcgaact tgacattgga acgaacatca gaaatagctt     8880
taagaacctt aatggcttcg gctgtgattt cttgaccaac gtggtcacct ggcaaaacga     8940
cgatcttctt aggggcagac attacaatgg tatatccttg aaatatatat aaaaaaaaaa     9000
aaaaaaaaaa aaaaaaaaaa tgcagcttct caatgatatt cgaatacgct ttgaggagat     9060
acagcctaat atccgacaaa ctgttttaca gatttacgat cgtacttgtt acccatcatt     9120
gaattttgaa catccgaacc tgggagtttt ccctgaaaca gatagtatat ttgaacctgt     9180
ataataatat atagtctagc gctttacgga agacaatgta tgtatttcgg ttcctggaga     9240
aactattgca tctattgcat aggtaatctt gcacgtcgca tccccggttc attttctgcg     9300
tttccatctt gcacttcaat agcatatctt tgttaacgaa gcatctgtgc ttcattttgt     9360
agaacaaaaa tgcaacgcga gagcgctaat ttttcaaaca aagaatctga gctgcatttt     9420
tacagaacag aaatgcaacg cgaaagcgct attttaccaa cgaagaatct gtgcttcatt     9480
tttgtaaaac aaaaatgcaa cgcgagagcg ctaatttttc aaacaaagaa tctgagctgc     9540
atttttacag aacagaaatg caacgcgaga gcgctatttt accaacaaag aatctatact     9600
tcttttttgt tctacaaaaa tgcatcccga gagcgctatt tttctaacaa agcatcttag     9660
attacttttt ttctcctttg tgcgctctat aatgcagtct cttgataact ttttgcactg     9720
taggtccgtt aaggttagaa gaaggctact ttggtgtcta ttttctcttc cataaaaaaa     9780
gcctgactcc acttcccgcg tttactgatt actagcgaag ctgcgggtgc attttttcaa     9840
gataaaggca tccccgatta tattctatac cgatgtggat tgcgcatact ttgtgaacag     9900
aaagtgatag cgttgatgat tcttcattgg tcagaaaatt atgaacggtt tcttctattt     9960
tgtctctata tactacgtat aggaaatgtt tacattttcg tattgttttc gattcactct    10020
atgaatagtt cttactacaa tttttttgtc taaagagtaa tactagagat aaacataaaa    10080
aatgtagagg tcgagtttag atgcaagttc aaggagcgaa aggtggatgg gtaggttata    10140
tagggatata gcacagagat atatagcaaa gagatacttt tgagcaatgt ttgtggaagc    10200
ggtattcgca atattttagt agctcgttac agtccggtgc gtttttggtt ttttgaaagt    10260
gcgtcttcag agcgcttttg gttttcaaaa gcgctctgaa gttcctatac tttctagaga    10320
ataggaactt cggaatagga acttcaaagc gtttccgaaa acgagcgctt ccgaaaatgc    10380
aacgcgagct gcgcacatac agctcactgt tcacgtcgca cctatatctg cgtgttgcct    10440
gtatatatat atacatgaga agaacggcat agtgcgtgtt tatgcttaaa tgcgtactta    10500
tatgcgtcta tttatgtagg atgaaaggta gtctagtacc tcctgtgata ttatcccatt    10560
ccatgcgggg tatcgtatgc ttccttcagc actacccttt agctgttcta tatgctgcca    10620
ctcctcaatt ggattagtct catccttcaa tgctatcatt tcctttgata ttggatcata    10680
tgcatagtac cgagaaacta gaggatctcc cattaccgac atttgggcgc tatacgtgca    10740
tatgttcatg tatgtatctg tatttaaaac acttttgtat tatttttcct catatatgtg    10800
tataggttta tacggatgat ttaattatta cttcaccacc ctttatttca ggctgatatc    10860
ttagccttgt tactagtcac cggtggc                                        10887

<210>  60
<211>  500
<212>  PRT
<213>  Escherichia coli

<400>  60
Met Thr Ile Phe Asp Asn Tyr Glu Val Trp Phe Val Ile Gly Ser Gln 
1               5                   10                  15      
His Leu Tyr Gly Pro Glu Thr Leu Arg Gln Val Thr Gln His Ala Glu 
            20                  25                  30          
His Val Val Asn Ala Leu Asn Thr Glu Ala Lys Leu Pro Cys Lys Leu 
        35                  40                  45              
Val Leu Lys Pro Leu Gly Thr Thr Pro Asp Glu Ile Thr Ala Ile Cys 
    50                  55                  60                  
Arg Asp Ala Asn Tyr Asp Asp Pro Cys Ala Gly Leu Val Val Trp Leu 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Met Trp Ile Asn Gly Leu Thr Met Leu 
                85                  90                  95      
Asn Lys Pro Leu Leu Gln Phe His Thr Gln Phe Asn Ala Ala Leu Pro 
            100                 105                 110         
Trp Asp Ser Ile Asp Met Asp Phe Met Asn Leu Asn Gln Thr Ala His 
        115                 120                 125             
Gly Gly Arg Glu Phe Gly Phe Ile Gly Ala Arg Met Arg Gln Gln His 
    130                 135                 140                 
Ala Val Val Thr Gly His Trp Gln Asp Lys Gln Ala His Glu Arg Ile 
145                 150                 155                 160 
Gly Ser Trp Met Arg Gln Ala Val Ser Lys Gln Asp Thr Arg His Leu 
                165                 170                 175     
Lys Val Cys Arg Phe Gly Asp Asn Met Arg Glu Val Ala Val Thr Asp 
            180                 185                 190         
Gly Asp Lys Val Ala Ala Gln Ile Lys Phe Gly Phe Ser Val Asn Thr 
        195                 200                 205             
Trp Ala Val Gly Asp Leu Val Gln Val Val Asn Ser Ile Ser Asp Gly 
    210                 215                 220                 
Asp Val Asn Ala Leu Val Asp Glu Tyr Glu Ser Cys Tyr Thr Met Thr 
225                 230                 235                 240 
Pro Ala Thr Gln Ile His Gly Glu Lys Arg Gln Asn Val Leu Glu Ala 
                245                 250                 255     
Ala Arg Ile Glu Leu Gly Met Lys Arg Phe Leu Glu Gln Gly Gly Phe 
            260                 265                 270         
His Ala Phe Thr Thr Thr Phe Glu Asp Leu His Gly Leu Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Val Gln Arg Leu Met Gln Gln Gly Tyr Gly Phe Ala 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Leu Leu Arg Ile Met Lys Val 
305                 310                 315                 320 
Met Ser Thr Gly Leu Gln Gly Gly Thr Ser Phe Met Glu Asp Tyr Thr 
                325                 330                 335     
Tyr His Phe Glu Lys Gly Asn Asp Leu Val Leu Gly Ser His Met Leu 
            340                 345                 350         
Glu Val Cys Pro Ser Ile Ala Val Glu Glu Lys Pro Ile Leu Asp Val 
        355                 360                 365             
Gln His Leu Gly Ile Gly Gly Lys Asp Asp Pro Ala Arg Leu Ile Phe 
    370                 375                 380                 
Asn Thr Gln Thr Gly Pro Ala Ile Val Ala Ser Leu Ile Asp Leu Gly 
385                 390                 395                 400 
Asp Arg Tyr Arg Leu Leu Val Asn Cys Ile Asp Thr Val Lys Thr Pro 
                405                 410                 415     
His Ser Leu Pro Lys Leu Pro Val Ala Asn Ala Leu Trp Lys Ala Gln 
            420                 425                 430         
Pro Asp Leu Pro Thr Ala Ser Glu Ala Trp Ile Leu Ala Gly Gly Ala 
        435                 440                 445             
His His Thr Val Phe Ser His Ala Leu Asn Leu Asn Asp Met Arg Gln 
    450                 455                 460                 
Phe Ala Glu Met His Asp Ile Glu Ile Thr Val Ile Asp Asn Asp Thr 
465                 470                 475                 480 
Arg Leu Pro Ala Phe Lys Asp Ala Leu Arg Trp Asn Glu Val Tyr Tyr 
                485                 490                 495     
Gly Phe Arg Arg 
            500 

<210>  61
<211>  493
<212>  PRT
<213>  Bacillus licheniformis

<400>  61
Met Ile Gln Ala Lys Thr His Val Phe Trp Phe Val Thr Gly Ser Gln 
1               5                   10                  15      
His Leu Tyr Gly Glu Glu Ala Val Gln Glu Val Glu Glu His Ser Lys 
            20                  25                  30          
Met Ile Cys Asn Gly Leu Asn Asp Gly Asp Leu Arg Phe Gln Val Glu 
        35                  40                  45              
Tyr Lys Ala Val Ala Thr Ser Leu Asp Gly Val Arg Lys Leu Phe Glu 
    50                  55                  60                  
Glu Ala Asn Arg Asp Glu Glu Cys Ala Gly Ile Ile Thr Trp Met His 
65                  70                  75                  80  
Thr Phe Ser Pro Ala Lys Met Trp Ile Pro Gly Leu Ser Glu Leu Asn 
                85                  90                  95      
Lys Pro Leu Leu His Phe His Thr Gln Phe Asn Arg Asp Ile Pro Trp 
            100                 105                 110         
Asp Lys Ile Asp Met Asp Phe Met Asn Ile Asn Gln Ser Ala His Gly 
        115                 120                 125             
Asp Arg Glu Tyr Gly Phe Ile Gly Ala Arg Leu Gly Ile Pro Arg Lys 
    130                 135                 140                 
Val Ile Ala Gly Tyr Trp Glu Asp Arg Glu Val Lys Arg Ser Ile Asp 
145                 150                 155                 160 
Lys Trp Met Ser Ala Ala Val Ala Tyr Ile Glu Ser Arg His Ile Lys 
                165                 170                 175     
Val Ala Arg Phe Gly Asp Asn Met Arg Asn Val Ala Val Thr Glu Gly 
            180                 185                 190         
Asp Lys Ile Glu Ala Gln Ile Gln Leu Gly Trp Ser Val Asp Gly Tyr 
        195                 200                 205             
Gly Ile Gly Asp Leu Val Thr Glu Ile Asn Ala Val Ser Glu Gln Ser 
    210                 215                 220                 
Leu Ser Glu Leu Ile Ser Glu Tyr Glu Glu Leu Tyr Glu Trp Pro Glu 
225                 230                 235                 240 
Gly Glu Ala Ala Arg Glu Ser Val Lys Glu Gln Ala Arg Ile Glu Leu 
                245                 250                 255     
Gly Leu Lys Arg Phe Leu Ser Ser Gly Gly Tyr Thr Ala Phe Thr Thr 
            260                 265                 270         
Thr Phe Glu Asp Leu His Gly Met Lys Gln Leu Pro Gly Leu Ala Val 
        275                 280                 285             
Gln Arg Leu Met Ala Glu Gly Tyr Gly Phe Gly Gly Glu Gly Asp Trp 
    290                 295                 300                 
Lys Thr Ala Ala Leu Val Arg Met Met Lys Met Met Ala Gly Gly Lys 
305                 310                 315                 320 
Glu Thr Ser Phe Met Glu Asp Tyr Thr Tyr His Phe Glu Pro Gly Asn 
                325                 330                 335     
Glu Met Ile Leu Gly Ser His Met Leu Glu Val Cys Pro Ser Ile Ala 
            340                 345                 350         
Glu His Lys Pro Arg Ile Glu Val His Pro Leu Ser Met Gly Ala Lys 
        355                 360                 365             
Asp Asp Pro Ala Arg Leu Val Phe Asp Gly Ile Ala Gly Pro Ala Val 
    370                 375                 380                 
Asn Val Ser Leu Ile Asp Leu Gly Gly Arg Phe Arg Leu Val Ile Asn 
385                 390                 395                 400 
Lys Val Glu Ala Val Lys Val Pro His Asp Met Pro Asn Leu Pro Val 
                405                 410                 415     
Ala Arg Val Leu Trp Lys Pro Gln Pro Ser Leu Arg Thr Ser Ala Glu 
            420                 425                 430         
Ala Trp Ile Leu Ala Gly Gly Ala His His Thr Cys Leu Ser Tyr Gln 
        435                 440                 445             
Leu Thr Ala Glu Gln Met Leu Asp Trp Ala Glu Met Ser Gly Ile Glu 
    450                 455                 460                 
Ala Val Leu Ile Asn Arg Asp Thr Thr Ile Leu Asn Leu Arg Asn Glu 
465                 470                 475                 480 
Leu Lys Trp Ser Glu Ala Ala Tyr Arg Leu Arg Lys Phe 
                485                 490             

<210>  62
<211>  488
<212>  PRT
<213>  Clostridium acetobutylicum

<400>  62
Met Leu Glu Asn Lys Lys Met Glu Phe Trp Phe Val Val Gly Ser Gln 
1               5                   10                  15      
His Leu Tyr Gly Glu Glu Ala Leu Lys Glu Val Arg Lys Asn Ser Glu 
            20                  25                  30          
Thr Ile Val Asp Glu Leu Asn Lys Ser Ala Asn Leu Pro Tyr Lys Ile 
        35                  40                  45              
Ile Phe Lys Asp Leu Ala Thr Ser Ala Asp Lys Ile Lys Glu Ile Met 
    50                  55                  60                  
Lys Glu Val Asn Tyr Arg Asp Glu Val Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Met Trp Ile Ala Gly Thr Lys Ile Leu 
                85                  90                  95      
Gln Lys Pro Leu Leu His Phe Ala Thr Gln Tyr Asn Glu Asn Ile Pro 
            100                 105                 110         
Trp Lys Thr Ile Asp Met Asp Tyr Met Asn Leu His Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Gly Phe Ile Asn Ala Arg Leu Lys Lys His Asn 
    130                 135                 140                 
Lys Val Val Val Gly Tyr Trp Lys Asp Lys Glu Val Gln Lys Gln Val 
145                 150                 155                 160 
Ser Asp Trp Met Lys Val Ala Ala Gly Tyr Ile Ala Ser Glu Ser Ile 
                165                 170                 175     
Lys Val Ala Arg Phe Gly Asp Asn Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Gln Phe Gly Trp Thr Val Asp Tyr 
        195                 200                 205             
Phe Gly Ile Gly Asp Leu Val Ala Glu Met Asp Lys Val Ser Gln Asp 
    210                 215                 220                 
Glu Ile Asn Lys Thr Tyr Glu Glu Phe Lys Asp Leu Tyr Ile Leu Asp 
225                 230                 235                 240 
Pro Gly Glu Asn Asp Pro Ala Phe Tyr Glu Lys Gln Val Lys Glu Gln 
                245                 250                 255     
Ile Lys Ile Glu Ile Gly Leu Arg Arg Phe Leu Glu Lys Gly Asn Tyr 
            260                 265                 270         
Asn Ala Phe Thr Thr Asn Phe Glu Asp Leu Tyr Gly Met Lys Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Val Gln Arg Leu Asn Ala Glu Gly Tyr Gly Phe Ala 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Leu Asp Arg Leu Leu Lys Val 
305                 310                 315                 320 
Met Thr Asn Asn Thr Ala Thr Gly Phe Met Glu Asp Tyr Thr Tyr Glu 
                325                 330                 335     
Leu Ser Arg Gly Asn Glu Lys Ala Leu Gly Ala His Met Leu Glu Val 
            340                 345                 350         
Asp Pro Thr Phe Ala Ser Asp Lys Pro Lys Val Ile Val Lys Pro Leu 
        355                 360                 365             
Gly Ile Gly Asp Lys Glu Asp Pro Ala Arg Leu Ile Phe Asn Gly Ser 
    370                 375                 380                 
Thr Gly Lys Gly Val Ala Val Ser Met Leu Asp Leu Gly Thr His Tyr 
385                 390                 395                 400 
Arg Leu Ile Ile Asn Gly Leu Thr Ala Val Lys Pro Asp Glu Asp Met 
                405                 410                 415     
Pro Asn Leu Pro Val Ala Lys Met Val Trp Lys Pro Glu Pro Asn Phe 
            420                 425                 430         
Ile Glu Gly Val Lys Ser Trp Ile Tyr Ala Gly Gly Gly His His Thr 
        435                 440                 445             
Val Val Ser Leu Glu Leu Thr Val Glu Gln Val Tyr Asp Trp Ser Arg 
    450                 455                 460                 
Met Val Gly Leu Glu Ala Val Ile Ile Asp Lys Asp Thr Lys Leu Arg 
465                 470                 475                 480 
Asp Ile Ile Glu Lys Thr Thr Lys 
                485             

<210>  63
<211>  474
<212>  PRT
<213>  Leuconostoc mesenteroides

<400>  63
Met Ala Asp Ile Lys Asp Tyr Lys Phe Trp Phe Val Thr Gly Ser Gln 
1               5                   10                  15      
Phe Leu Tyr Gly Pro Glu Val Leu Lys Gln Val Glu Glu Asp Ser Lys 
            20                  25                  30          
Lys Ile Ile Glu Lys Leu Asn Glu Ser Gly Asn Leu Pro Tyr Pro Ile 
        35                  40                  45              
Glu Phe Lys Thr Val Gly Val Thr Ala Glu Asn Ile Thr Glu Ala Met 
    50                  55                  60                  
Lys Glu Ala Asn Tyr Asp Asp Ser Val Ala Gly Val Ile Thr Trp Ala 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Asn Trp Ile Arg Gly Thr Gln Leu Leu 
                85                  90                  95      
Asn Lys Pro Leu Leu His Leu Ala Thr Gln Met Leu Asn Asn Ile Pro 
            100                 105                 110         
Tyr Asp Ser Ile Asp Phe Asp Tyr Met Asn Leu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Ala Phe Ile Asn Ala Arg Leu Arg Leu Asn Asn 
    130                 135                 140                 
Lys Ile Val Phe Gly His Trp Ala Asp Glu Ala Val Gln Val Gln Ile 
145                 150                 155                 160 
Gly Lys Trp Met Asp Val Ala Val Ala Tyr Glu Glu Ser Phe Lys Ile 
                165                 170                 175     
Lys Val Val Thr Phe Ala Asp Lys Met Arg Asn Val Ala Val Thr Asp 
            180                 185                 190         
Gly Asp Lys Ile Glu Ala Gln Ile Lys Phe Gly Trp Thr Val Asp Tyr 
        195                 200                 205             
Trp Gly Val Gly Asp Leu Val Thr Tyr Val Asn Ala Ile Asp Asp Ala 
    210                 215                 220                 
Asp Ile Asp Asn Leu Tyr Ile Glu Leu Gln Asp Lys Tyr Asp Phe Val 
225                 230                 235                 240 
Ala Gly Gln Asn Asp Ser Glu Lys Tyr Glu His Asn Val Lys Tyr Gln 
                245                 250                 255     
Leu Arg Glu Tyr Leu Gly Ile Lys Arg Phe Leu Thr Asp Lys Gly Tyr 
            260                 265                 270         
Ser Ala Phe Thr Thr Asn Phe Glu Asp Leu Val Gly Leu Glu Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ala Gln Leu Leu Met Ala Asp Gly Phe Gly Phe Ala 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Leu Thr Arg Leu Leu Lys Ile 
305                 310                 315                 320 
Val Ser His Asn Gln Ala Thr Ala Phe Met Glu Asp Tyr Thr Leu Asp 
                325                 330                 335     
Leu Arg Gln Gly His Glu Ala Ile Leu Gly Ser His Met Leu Glu Val 
            340                 345                 350         
Asp Pro Thr Ile Ala Ser Asp Lys Pro Arg Val Glu Val His Pro Leu 
        355                 360                 365             
Gly Ile Gly Gly Lys Glu Asp Pro Ala Arg Leu Val Phe Ser Gly Arg 
    370                 375                 380                 
Thr Gly Asp Ala Val Asp Val Thr Ile Ser Asp Phe Gly Asp Glu Phe 
385                 390                 395                 400 
Lys Leu Ile Ser Tyr Asp Val Thr Gly Asn Lys Pro Glu Ala Glu Thr 
                405                 410                 415     
Pro Tyr Leu Pro Val Ala Lys Gln Leu Trp Thr Pro Lys Ala Gly Leu 
            420                 425                 430         
Lys Ala Gly Ala Glu Gly Trp Leu Thr Val Gly Gly Gly His His Thr 
        435                 440                 445             
Thr Leu Ser Phe Ser Val Asp Ser Glu Gln Leu Thr Asp Leu Ala Asn 
    450                 455                 460                 
Leu Phe Gly Val Thr Tyr Val Asp Ile Lys 
465                 470                 

<210>  64
<211>  474
<212>  PRT
<213>  Lactobacillus plantarum

<400>  64
Met Leu Ser Val Pro Asp Tyr Glu Phe Trp Phe Val Thr Gly Ser Gln 
1               5                   10                  15      
His Leu Tyr Gly Glu Glu Gln Leu Lys Ser Val Ala Lys Asp Ala Gln 
            20                  25                  30          
Asp Ile Ala Asp Lys Leu Asn Ala Ser Gly Lys Leu Pro Tyr Lys Val 
        35                  40                  45              
Val Phe Lys Asp Val Met Thr Thr Ala Glu Ser Ile Thr Asn Phe Met 
    50                  55                  60                  
Lys Glu Val Asn Tyr Asn Asp Lys Val Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Asn Trp Ile Arg Gly Thr Glu Leu Leu 
                85                  90                  95      
Gln Lys Pro Leu Leu His Leu Ala Thr Gln Tyr Leu Asn Asn Ile Pro 
            100                 105                 110         
Tyr Ala Asp Ile Asp Phe Asp Tyr Met Asn Leu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Ala Tyr Ile Asn Ala Arg Leu Gln Lys His Asn 
    130                 135                 140                 
Lys Ile Val Tyr Gly Tyr Trp Gly Asp Glu Asp Val Gln Glu Gln Ile 
145                 150                 155                 160 
Ala Arg Trp Glu Asp Val Ala Val Ala Tyr Asn Glu Ser Phe Lys Val 
                165                 170                 175     
Lys Val Ala Arg Phe Gly Asp Thr Met Arg Asn Val Ala Val Thr Glu 
            180                 185                 190         
Gly Asp Lys Val Glu Ala Gln Ile Lys Met Gly Trp Thr Val Asp Tyr 
        195                 200                 205             
Tyr Gly Ile Gly Asp Leu Val Glu Glu Ile Asn Lys Val Ser Asp Ala 
    210                 215                 220                 
Asp Val Asp Lys Glu Tyr Ala Asp Leu Glu Ser Arg Tyr Glu Met Val 
225                 230                 235                 240 
Gln Gly Asp Asn Asp Ala Asp Thr Tyr Lys His Ser Val Arg Val Gln 
                245                 250                 255     
Leu Ala Gln Tyr Leu Gly Ile Lys Arg Phe Leu Glu Arg Gly Gly Tyr 
            260                 265                 270         
Thr Ala Phe Thr Thr Asn Phe Glu Asp Leu Trp Gly Met Glu Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ser Gln Leu Leu Ile Arg Asp Gly Tyr Gly Phe Gly 
    290                 295                 300                 
Ala Glu Gly Asp Trp Lys Thr Ala Ala Leu Gly Arg Val Met Lys Ile 
305                 310                 315                 320 
Met Ser His Asn Lys Gln Thr Ala Phe Met Glu Asp Tyr Thr Leu Asp 
                325                 330                 335     
Leu Arg His Gly His Glu Ala Ile Leu Gly Ser His Met Leu Glu Val 
            340                 345                 350         
Asp Pro Ser Ile Ala Ser Asp Lys Pro Arg Val Glu Val His Pro Leu 
        355                 360                 365             
Asp Ile Gly Gly Lys Asp Asp Pro Ala Arg Leu Val Phe Thr Gly Ser 
    370                 375                 380                 
Glu Gly Glu Ala Ile Asp Val Thr Val Ala Asp Phe Arg Asp Gly Phe 
385                 390                 395                 400 
Lys Met Ile Ser Tyr Ala Val Asp Ala Asn Lys Pro Glu Ala Glu Thr 
                405                 410                 415     
Pro Asn Leu Pro Val Ala Lys Gln Leu Trp Thr Pro Lys Met Gly Leu 
            420                 425                 430         
Lys Lys Gly Ala Leu Glu Trp Met Gln Ala Gly Gly Gly His His Thr 
        435                 440                 445             
Met Leu Ser Phe Ser Leu Thr Glu Glu Gln Met Glu Asp Tyr Ala Thr 
    450                 455                 460                 
Met Val Gly Met Thr Lys Ala Phe Leu Lys 
465                 470                 

<210>  65
<211>  474
<212>  PRT
<213>  Pediococcus pentosaceus

<400>  65
Met Lys Lys Val Gln Asp Tyr Glu Phe Trp Phe Val Thr Gly Ser Gln 
1               5                   10                  15      
Phe Leu Tyr Gly Glu Glu Thr Leu Arg Ser Val Glu Lys Asp Ala Lys 
            20                  25                  30          
Glu Ile Val Asp Lys Leu Asn Glu Ser Lys Lys Leu Pro Tyr Pro Val 
        35                  40                  45              
Lys Phe Lys Leu Val Ala Thr Thr Ala Glu Asn Ile Thr Glu Val Met 
    50                  55                  60                  
Lys Glu Val Asn Tyr Asn Asp Lys Val Ala Gly Val Ile Thr Trp Met 
65                  70                  75                  80  
His Thr Phe Ser Pro Ala Lys Asn Trp Ile Arg Gly Thr Glu Leu Leu 
                85                  90                  95      
Gln Lys Pro Leu Leu His Leu Ala Thr Gln Phe Leu Asn His Ile Pro 
            100                 105                 110         
Tyr Asp Thr Ile Asp Phe Asp Tyr Met Asn Leu Asn Gln Ser Ala His 
        115                 120                 125             
Gly Asp Arg Glu Tyr Ala Phe Ile Asn Ala Arg Leu Arg Lys Asn Asn 
    130                 135                 140                 
Lys Ile Ile Ser Gly Tyr Trp Gly Asp Glu Gly Ile Gln Lys Gln Ile 
145                 150                 155                 160 
Ala Lys Trp Met Asp Val Ala Val Ala Tyr Asn Glu Ser Tyr Gly Ile 
                165                 170                 175     
Lys Val Val Thr Phe Ala Asp Lys Met Arg Asn Val Ala Val Thr Asp 
            180                 185                 190         
Gly Asp Lys Ile Glu Ala Gln Ile Lys Phe Gly Trp Thr Val Asp Tyr 
        195                 200                 205             
Trp Gly Val Ala Asp Leu Val Glu Glu Val Asn Ala Val Ser Asp Glu 
    210                 215                 220                 
Asp Ile Asp Lys Lys Tyr Glu Glu Met Lys Asn Asp Tyr Asn Phe Val 
225                 230                 235                 240 
Glu Gly Gln Asn Ser Ser Glu Lys Phe Glu His Asn Thr Lys Tyr Gln 
                245                 250                 255     
Ile Arg Glu Tyr Phe Gly Leu Lys Lys Phe Met Asp Asp Arg Gly Tyr 
            260                 265                 270         
Thr Ala Phe Thr Thr Asn Phe Glu Asp Leu Ala Gly Leu Glu Gln Leu 
        275                 280                 285             
Pro Gly Leu Ala Ala Gln Met Leu Met Ala Glu Gly Tyr Gly Phe Ala 
    290                 295                 300                 
Gly Glu Gly Asp Trp Lys Thr Ala Ala Leu Asp Arg Leu Leu Lys Ile 
305                 310                 315                 320 
Met Ala His Asn Lys Gln Thr Val Phe Met Glu Asp Tyr Thr Leu Asp 
                325                 330                 335     
Leu Arg Glu Gly His Glu Ala Ile Leu Gly Ser His Met Leu Glu Val 
            340                 345                 350         
Asp Pro Ser Ile Ala Ser Asp Thr Pro Arg Val Glu Val His Pro Leu 
        355                 360                 365             
Asp Ile Gly Gly Lys Glu Asp Pro Ala Arg Phe Val Phe Thr Gly Met 
    370                 375                 380                 
Glu Gly Asp Ala Val Asp Val Thr Met Ala Asp Tyr Gly Asp Glu Phe 
385                 390                 395                 400 
Lys Leu Met Ser Tyr Asp Val Thr Gly Asn Lys Thr Glu Lys Glu Thr 
                405                 410                 415     
Pro Tyr Leu Pro Val Ala Lys Gln Leu Trp Thr Pro Lys Gln Gly Trp 
            420                 425                 430         
Lys Gln Gly Ala Glu Gly Trp Leu Thr Leu Gly Gly Gly His His Thr 
        435                 440                 445             
Val Leu Ser Phe Ala Ile Asp Ala Glu Gln Leu Gln Asp Leu Ser Asn 
    450                 455                 460                 
Met Phe Gly Leu Thr Tyr Val Asn Ile Lys 
465                 470                 

<210>  66
<211>  33
<212>  PRT
<213>  artificial sequence

<220>
<223>  237-269 amino acid motif with variation at multiple positions

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  R or K

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  R or K

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  I or M

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  E or K

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  I or M

<220>
<221>  VARIANT
<222>  (15)..(15)
<223>  L or M

<220>
<221>  VARIANT
<222>  (16)..(16)
<223>  V or T or D

<220>
<221>  VARIANT
<222>  (17)..(17)
<223>  R or A

<220>
<221>  VARIANT
<222>  (18)..(18)
<223>  E or N

<220>
<221>  VARIANT
<222>  (20)..(20)
<223>  A or C

<220>
<221>  VARIANT
<222>  (21)..(21)
<223>  K or N or R

<220>
<221>  VARIANT
<222>  (24)..(24)
<223>  S or V or T

<220>
<221>  VARIANT
<222>  (28)..(28)
<223>  E or Q

<220>
<221>  VARIANT
<222>  (31)..(31)
<223>  H or I or Y

<400>  66
Ile Xaa Tyr Gln Ala Xaa Glu Glu Ile Ala Xaa Xaa Lys Xaa Xaa Xaa 
1               5                   10                  15      
Xaa Xaa Gly Xaa Xaa Ala Phe Xaa Asn Thr Phe Xaa Asp Leu Xaa Gly 
            20                  25                  30          
Met 
    

<210>  67
<211>  33
<212>  PRT
<213>  artificial sequence

<220>
<223>  Amino acid sequence of the 237-269 Motif shown in FIG. 2A

<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Arg or Lys

<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Arg or Lys

<220>
<221>  MISC_FEATURE
<222>  (11)..(11)
<223>  Ile or Met

<220>
<221>  MISC_FEATURE
<222>  (12)..(12)
<223>  any amino acid

<220>
<221>  MISC_FEATURE
<222>  (14)..(14)
<223>  Ile or Met

<220>
<221>  MISC_FEATURE
<222>  (15)..(15)
<223>  Leu or Met

<220>
<221>  MISC_FEATURE
<222>  (16)..(16)
<223>  any amino acid

<220>
<221>  MISC_FEATURE
<222>  (17)..(17)
<223>  Arg or Ala

<220>
<221>  MISC_FEATURE
<222>  (18)..(18)
<223>  Glu or Asn

<220>
<221>  MISC_FEATURE
<222>  (20)..(20)
<223>  Ala or Cys

<220>
<221>  MISC_FEATURE
<222>  (21)..(21)
<223>  any amino acid

<220>
<221>  MISC_FEATURE
<222>  (24)..(24)
<223>  any amino acid

<220>
<221>  MISC_FEATURE
<222>  (28)..(28)
<223>  Gln or Glu

<220>
<221>  MISC_FEATURE
<222>  (31)..(31)
<223>  any amino acid

<220>
<221>  MISC_FEATURE
<222>  (32)..(32)
<223>  any amino acid

<400>  67
Ile Xaa Tyr Gln Ala Xaa Glu Glu Ile Ala Xaa Xaa Lys Xaa Xaa Xaa 
1               5                   10                  15      
Xaa Xaa Gly Xaa Xaa Ala Phe Xaa Asn Thr Phe Xaa Asp Leu Xaa Gly 
            20                  25                  30          
Met 
    

<210>  68
<211>  477
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  68
Met Lys Phe Trp Phe Val Thr Gly Ser Gln Phe Leu Tyr Gly Glu Glu 
1               5                   10                  15      
Thr Leu Arg Gln Val Glu Glu Asp Ser Lys Lys Ile Val Asp Gly Leu 
            20                  25                  30          
Asp Leu Pro Phe Pro Val Glu Tyr Lys Met Thr Val Lys Lys Glu Ser 
        35                  40                  45              
Glu Ile Glu Arg Ile Ile Lys Glu Ala Asn Tyr Asp Asp Glu Cys Ala 
    50                  55                  60                  
Gly Ile Ile Thr Phe Cys His Thr Phe Ser Pro Ser Lys Met Trp Ile 
65                  70                  75                  80  
Asn Gly Leu Ala Leu Leu Gln Lys Pro Trp Leu His Phe His Thr Gln 
                85                  90                  95      
Phe Asn Glu Thr Ile Pro Asn Glu Gly Ile Asp Met Asp Tyr Met Asn 
            100                 105                 110         
Leu His Gln Ser Ala His Gly Asp Arg Glu His Gly Phe Ile Gly Ala 
        115                 120                 125             
Arg Leu Arg Met Pro Arg Ala Val Val Ala Gly His Trp Lys Asp Lys 
    130                 135                 140                 
Lys Val Gln Glu Lys Ile Ala Glu Trp Gln Arg Ala Ala Val Gly Ala 
145                 150                 155                 160 
Leu Phe Ser Lys Ser Leu Lys Ile Val Arg Phe Gly Asp Asn Met Arg 
                165                 170                 175     
Glu Val Ala Val Thr Glu Gly Asp Lys Ile Glu Ala Gln Leu Lys Leu 
            180                 185                 190         
Gly Trp Gln Val Asn Thr Phe Ala Val Gly Asp Leu Val Glu Ile Met 
        195                 200                 205             
Asn Ala Val Lys Asp Ala Glu Ile Asp Glu Leu Met Lys Glu Tyr Ala 
    210                 215                 220                 
Glu Leu Tyr Asp Tyr Asp Lys Ala Asp Glu Glu Thr Ile Arg Tyr Gln 
225                 230                 235                 240 
Ala Arg Glu Glu Ile Ala Ile Glu Lys Ile Leu Val Arg Glu Gly Ala 
                245                 250                 255     
Lys Ala Phe Ser Asn Thr Phe Glu Asp Leu His Gly Met Arg Gln Leu 
            260                 265                 270         
Pro Gly Leu Ala Thr Gln His Leu Met His Lys Gly Tyr Gly Phe Gly 
        275                 280                 285             
Ala Glu Gly Asp Trp Lys Thr Ala Gly Met Thr Ala Ile Val Lys Ala 
    290                 295                 300                 
Met Tyr Pro Glu Gly Asn Thr Ser Phe Met Glu Asp Tyr Thr Tyr Asp 
305                 310                 315                 320 
Tyr Lys His Glu Leu Ile Leu Gly Ser His Met Leu Glu Val Cys Pro 
                325                 330                 335     
Ser Ile Ala Ala Asp Lys Pro Arg Ile Glu Val His Lys Leu Gly Ile 
            340                 345                 350         
Gly Gly Lys Glu Ala Pro Ala Arg Ile Val Phe Glu Gly Arg Ala Gly 
        355                 360                 365             
Ser Ala Lys Ala Leu Ser Leu Ile Asp Ile Gly Gly Arg Phe Arg Leu 
    370                 375                 380                 
Ile Ser Gln Asp Val Glu Cys Glu Lys Pro Phe Gln Ser Met Pro Asn 
385                 390                 395                 400 
Leu Pro Val Ala Arg Thr Met Trp Lys Pro Ala Pro Ser Phe Leu Glu 
                405                 410                 415     
Gly Leu Glu Cys Trp Ile Ile Ala Gly Gly Ala His His Thr Val Leu 
            420                 425                 430         
Ser Tyr Asp Ile Thr Asp Glu Thr Val Arg Asp Phe Ala Arg Ile Met 
        435                 440                 445             
Gly Ile Glu Leu Val Val Ile Asn Lys Asp Thr Thr Lys Glu Lys Leu 
    450                 455                 460                 
Glu Arg Asp Ile Met Ile Gly Asp Met Ile Tyr Gly Arg 
465                 470                 475         

<210>  69
<211>  477
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  69
Met Lys Phe Trp Phe Ile Thr Gly Ser Gln Phe Leu Tyr Gly Glu Glu 
1               5                   10                  15      
Thr Leu Arg Gln Val Asp Glu Asp Ser Lys Lys Ile Val Ala Gly Leu 
            20                  25                  30          
Lys Leu Pro Phe Pro Val Glu Tyr Lys Ser Thr Val Lys Thr Glu Ser 
        35                  40                  45              
Glu Ile Gln Arg Ile Ile Lys Glu Ala Asn Phe Asp Asp Glu Cys Ala 
    50                  55                  60                  
Gly Val Ile Thr Phe Cys His Thr Phe Ser Pro Ser Lys Met Trp Ile 
65                  70                  75                  80  
Asn Gly Leu Ala Leu Leu Gln Lys Pro Trp Leu His Phe His Thr Gln 
                85                  90                  95      
Phe Asn Glu Thr Ile Pro Asn Glu Ala Ile Asp Met Asp Tyr Met Asn 
            100                 105                 110         
Leu His Gln Ser Ala His Gly Asp Arg Glu His Gly Phe Ile Gly Ala 
        115                 120                 125             
Arg Leu Arg Met Pro Arg Ala Val Val Ala Gly His Trp Gln Asp Pro 
    130                 135                 140                 
Glu Val Gln Ala Lys Ile Ala Glu Trp Gln Arg Ala Ala Val Gly Val 
145                 150                 155                 160 
Met Phe Ser Lys Ser Leu Lys Ile Val Arg Phe Gly Asp Asn Met Arg 
                165                 170                 175     
Glu Val Ala Val Thr Glu Gly Asp Lys Ile Glu Ala Gln Leu Lys Leu 
            180                 185                 190         
Gly Trp Gln Val Asn Thr Phe Ala Val Gly Asp Leu Val Glu Ile Met 
        195                 200                 205             
Asn Ala Val Thr Asp Ala Glu Ile Asp Ala Leu Met Lys Glu Tyr Ala 
    210                 215                 220                 
Glu Leu Tyr Asp Tyr Lys Lys Glu Asp Glu Glu Thr Ile Arg Tyr Gln 
225                 230                 235                 240 
Ala Arg Glu Glu Ile Ala Ile Glu Lys Ile Leu Val Arg Glu Gly Ala 
                245                 250                 255     
Lys Ala Tyr Ser Asn Thr Phe Glu Asp Leu His Gly Met Lys Gln Leu 
            260                 265                 270         
Pro Gly Leu Ala Thr Gln His Leu Met His Lys Gly Tyr Gly Phe Gly 
        275                 280                 285             
Ala Glu Gly Asp Trp Lys Thr Ala Gly Met Thr Ala Ile Val Lys Ala 
    290                 295                 300                 
Met Tyr Pro Glu Gly Asn Thr Ser Phe Met Glu Asp Tyr Thr Tyr Asp 
305                 310                 315                 320 
Tyr Lys Gln Glu Leu Ile Leu Gly Ser His Met Leu Glu Val Cys Pro 
                325                 330                 335     
Ser Ile Ala Ala Asp Arg Pro Arg Ile Glu Val His Lys Leu Gly Ile 
            340                 345                 350         
Gly Gly Lys Glu Pro Pro Ala Arg Ile Val Phe Glu Gly Lys Ala Gly 
        355                 360                 365             
Ser Ala Lys Val Leu Ser Leu Ile Asp Ile Gly Gly Arg Leu Arg Leu 
    370                 375                 380                 
Ile Gln Gln Asp Ile Glu Cys Val Lys Pro Phe Gln Ser Met Pro Asn 
385                 390                 395                 400 
Leu Pro Val Ala Arg Thr Met Trp Arg Pro Ala Pro Ser Phe Leu Asp 
                405                 410                 415     
Gly Leu Glu Cys Trp Ile Ile Ala Gly Gly Ala His His Thr Val Leu 
            420                 425                 430         
Ser Tyr Asp Ile Ser Asp Glu Ala Val Arg Ser Phe Ala Arg Ile Met 
        435                 440                 445             
Gly Ile Glu Leu Val Val Ile Asn Lys Asp Thr Thr Val Asn Gly Leu 
    450                 455                 460                 
Glu Arg Asp Ile Met Ile Gly Asp Val Ile Tyr Gly Arg 
465                 470                 475         

<210>  70
<211>  477
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  70
Met Lys Phe Trp Phe Ile Thr Gly Ser Gln Phe Leu Tyr Gly Glu Glu 
1               5                   10                  15      
Thr Leu Arg Gln Val Asp Glu Asp Ser Lys Lys Ile Val Ala Gly Leu 
            20                  25                  30          
Lys Leu Pro Phe Pro Val Glu Tyr Lys Ser Thr Val Lys Thr Glu Arg 
        35                  40                  45              
Glu Ile Glu Arg Ile Ile Lys Glu Ala Asn Tyr Asp Asp Glu Cys Ala 
    50                  55                  60                  
Gly Ile Ile Thr Phe Cys His Thr Phe Ser Pro Ser Lys Met Trp Ile 
65                  70                  75                  80  
Asn Gly Leu Ala Leu Leu Gln Lys Pro Trp Leu His Phe His Thr Gln 
                85                  90                  95      
Phe Asn Glu Thr Ile Pro Asn Glu Ala Ile Asp Met Asp Tyr Met Asn 
            100                 105                 110         
Leu His Gln Ser Ala His Gly Asp Arg Glu His Gly Phe Ile Gly Ala 
        115                 120                 125             
Arg Leu Arg Met Pro Arg Ala Val Val Ala Gly His Trp Gln Asp Pro 
    130                 135                 140                 
Glu Val Gln Ala Lys Ile Ala Glu Trp Gln Arg Ala Ala Val Gly Val 
145                 150                 155                 160 
Met Phe Ser Lys Ser Leu Lys Ile Val Arg Phe Gly Asp Asn Met Arg 
                165                 170                 175     
Glu Val Ala Val Thr Glu Gly Asp Lys Val Glu Ala Gln Leu Lys Leu 
            180                 185                 190         
Gly Trp Gln Val Asn Thr Phe Ala Val Gly Asp Leu Val Glu Ile Met 
        195                 200                 205             
Asn Ala Val Thr Asn Thr Glu Ile Asp Ala Leu Met Lys Glu Tyr Ala 
    210                 215                 220                 
Glu Leu Tyr Asp Tyr Lys Lys Glu Asp Glu Glu Thr Ile Arg Tyr Gln 
225                 230                 235                 240 
Ala Arg Glu Glu Ile Ala Ile Glu Lys Ile Leu Val Arg Glu Gly Ala 
                245                 250                 255     
Lys Ala Phe Ser Asn Thr Phe Glu Asp Leu His Gly Met Lys Gln Leu 
            260                 265                 270         
Pro Gly Leu Ala Thr Gln His Leu Met His Lys Gly Tyr Gly Phe Gly 
        275                 280                 285             
Ala Glu Gly Asp Trp Lys Thr Ala Gly Met Thr Ala Ile Val Lys Ala 
    290                 295                 300                 
Met Tyr Pro Asp Gly Asn Thr Ser Phe Met Glu Asp Tyr Thr Tyr Asp 
305                 310                 315                 320 
Tyr Lys Gln Gln Leu Ile Leu Gly Ser His Met Leu Glu Val Cys Pro 
                325                 330                 335     
Ser Ile Ala Ala Asp Lys Pro Arg Ile Glu Val His Lys Leu Gly Ile 
            340                 345                 350         
Gly Gly Lys Glu Pro Pro Ala Arg Ile Val Phe Glu Gly Lys Ala Gly 
        355                 360                 365             
Ser Ala Lys Ala Leu Ser Leu Ile Asp Ile Gly Gly Arg Leu Arg Leu 
    370                 375                 380                 
Ile Ser Gln Asp Val Glu Cys Val Lys Pro Phe Gln Ser Met Pro Asn 
385                 390                 395                 400 
Leu Pro Val Ala Arg Thr Met Trp Arg Pro Ala Pro Ser Phe Leu Glu 
                405                 410                 415     
Gly Leu Glu Cys Trp Ile Val Ala Gly Gly Ala His His Thr Val Leu 
            420                 425                 430         
Ser Tyr Asp Ile Ser Asp Glu Ala Val Arg Ser Phe Ala Arg Ile Met 
        435                 440                 445             
Gly Ile Glu Leu Val Val Ile Asn Lys Asp Thr Thr Val Asn Gly Leu 
    450                 455                 460                 
Glu Arg Asp Ile Met Ile Gly Asp Val Ile Tyr Gly Arg 
465                 470                 475         

<210>  71
<211>  477
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  71
Met Lys Phe Trp Phe Val Thr Gly Ser Gln Phe Leu Tyr Gly Glu Glu 
1               5                   10                  15      
Thr Leu Arg Gln Val Glu Glu Asp Ser Lys Lys Ile Val Ala Gly Leu 
            20                  25                  30          
Lys Leu Pro Phe Pro Val Glu Tyr Lys Leu Thr Val Lys Lys Glu Ala 
        35                  40                  45              
Glu Ile Thr Lys Ile Ile Lys Glu Ala Asn Tyr Asp Asp Glu Cys Ala 
    50                  55                  60                  
Gly Ile Ile Thr Phe Cys His Thr Phe Ser Pro Ser Lys Met Trp Ile 
65                  70                  75                  80  
Asn Gly Leu Arg Ser Leu Gln Lys Pro Trp Leu His Phe His Thr Gln 
                85                  90                  95      
Phe Asn Asp Asn Ile Pro Asn Asp Ala Ile Asp Met Asp Tyr Met Asn 
            100                 105                 110         
Leu His Gln Ser Ala His Gly Asp Arg Glu His Gly Phe Ile Gly Ala 
        115                 120                 125             
Arg Leu Arg Met Pro Arg Ala Val Val Ala Gly His Trp Ala Asp Pro 
    130                 135                 140                 
Ala Val Gln Glu Lys Ile Ala Asp Trp Met Arg Ala Ala Val Gly Val 
145                 150                 155                 160 
Gln Phe Ser Lys Ser Leu Lys Ile Val Arg Phe Gly Asp Asn Met Arg 
                165                 170                 175     
Glu Val Ala Val Thr Glu Gly Asp Lys Ile Glu Ala Gln Ile Lys Leu 
            180                 185                 190         
Gly Trp Gln Val Asn Thr Phe Ala Val Gly Asp Leu Val Gln Ile Met 
        195                 200                 205             
Asn Ala Val Thr Asp Ala Glu Ile Asp Ala Leu Met Lys Glu Tyr Ala 
    210                 215                 220                 
Glu Leu Tyr Asp Phe Asp Lys Ala Asp Glu Glu Cys Ile Arg Tyr Gln 
225                 230                 235                 240 
Ala Arg Glu Glu Ile Ala Ile Glu Lys Ile Leu Val Arg Glu Gly Ala 
                245                 250                 255     
Met Ala Phe Ser Asn Thr Phe Glu Asp Leu His Gly Met Lys Gln Leu 
            260                 265                 270         
Pro Gly Leu Ala Thr Gln His Leu Met His Lys Gly Tyr Gly Phe Gly 
        275                 280                 285             
Ala Glu Gly Asp Trp Lys Thr Ala Gly Met Thr Ala Ile Ile Lys Ala 
    290                 295                 300                 
Met Tyr Pro Asp Gly Asn Thr Ser Phe Met Glu Asp Tyr Asn Tyr Asp 
305                 310                 315                 320 
Tyr Lys His Glu Leu Ile Leu Gly Ala His Met Leu Glu Val Cys Pro 
                325                 330                 335     
Ser Ile Ala Ala Gly Arg Pro Arg Ile Glu Val His Pro Leu Gly Ile 
            340                 345                 350         
Gly Gly Lys Asp Ala Pro Ala Arg Ile Val Phe Glu Gly Lys Ala Gly 
        355                 360                 365             
Ser Ala Lys Ala Ile Ser Leu Ile Asp Ile Gly Gly Arg Leu Arg Leu 
    370                 375                 380                 
Ile Ala Gln Asp Val Glu Cys Glu Lys Pro Phe Gln Thr Met Pro Asn 
385                 390                 395                 400 
Leu Pro Val Ala Arg Thr Met Trp Lys Pro Ala Pro Ser Phe Leu Glu 
                405                 410                 415     
Gly Leu Glu Cys Trp Ile Ile Ala Gly Gly Ala His His Thr Val Leu 
            420                 425                 430         
Ser Tyr Asp Ile Ser Asp Glu Thr Val Arg Asp Phe Ala Arg Ile Met 
        435                 440                 445             
Gly Ile Glu Leu Val Val Ile Asn Lys Asn Thr Asn Lys Tyr Gln Leu 
    450                 455                 460                 
Glu Arg Asp Met Met Ile Gly Asp Val Ile Tyr Gly Arg 
465                 470                 475         

