               SEQUENCE LISTING



<110> DSM IP ASSETS B.V.

<120> METHODS OF PRODUCING HMO BLEND PROFILES WITH LNFP-I AND 2’-FL AS THE PREDOMINANT COMPOUNDS

<130> 34030-WO-PCT2

<150> PA202170247
<151> 2021-05-17

<150> PA202170390
<151> 2021-07-21

<160> 52

<170> BiSSAP 1.3.6

<210> 1

<211> 332

<212> PRT

<213> Neisseria meningitidis





<220> 

<223> lgtA, β-1,3-N-acetyl-glucosaminyltransferase, GeneBank ID:

      WP_033911473.1



<400> 1

Met Gln Pro Leu Val Ser Val Leu Ile Cys Ala Tyr Asn Val Glu Lys 

1               5                   10                  15      

Tyr Phe Ala Gln Ser Leu Ala Ala Val Val Asn Gln Thr Trp Arg Asn 

            20                  25                  30          

Leu Glu Ile Leu Ile Val Asp Asp Gly Ser Thr Asp Gly Thr Leu Ala 

        35                  40                  45              

Ile Ala Lys Asp Phe Gln Lys Arg Asp Ser Arg Ile Lys Ile Leu Ala 

    50                  55                  60                  

Gln Ala Gln Asn Ser Gly Leu Ile Pro Ser Leu Asn Ile Gly Leu Asp 

65                  70                  75                  80  

Glu Leu Ala Lys Ser Gly Met Gly Glu Tyr Ile Ala Arg Thr Asp Ala 

                85                  90                  95      

Asp Asp Ile Ala Ala Pro Asp Trp Ile Glu Lys Ile Val Gly Glu Met 

            100                 105                 110         

Glu Lys Asp Arg Ser Ile Ile Ala Met Gly Ala Trp Leu Glu Val Leu 

        115                 120                 125             

Ser Glu Glu Lys Asp Gly Asn Arg Leu Ala Arg His His Arg His Gly 

    130                 135                 140                 

Lys Ile Trp Lys Lys Pro Thr Arg Pro Glu Asp Ile Ala Asp Phe Phe 

145                 150                 155                 160 

Pro Phe Gly Asn Pro Ile His Asn Asn Thr Met Ile Met Arg Arg Ser 

                165                 170                 175     

Val Ile Asp Gly Gly Leu Arg Tyr Asn Thr Glu Arg Asp Trp Ala Glu 

            180                 185                 190         

Asp Tyr Gln Phe Trp Tyr Asp Val Ser Lys Leu Gly Arg Leu Ala Tyr 

        195                 200                 205             

Tyr Pro Glu Ala Leu Val Lys Tyr Arg Leu His Ala Asn Gln Val Ser 

    210                 215                 220                 

Ser Lys Tyr Ser Ile Arg Gln His Glu Ile Ala Gln Gly Ile Gln Lys 

225                 230                 235                 240 

Thr Ala Arg Asn Asp Phe Leu Gln Ser Met Gly Phe Lys Thr Arg Phe 

                245                 250                 255     

Asp Ser Leu Glu Tyr Arg Gln Ile Lys Ala Val Ala Tyr Glu Leu Leu 

            260                 265                 270         

Glu Lys His Leu Pro Glu Glu Asp Phe Glu Arg Ala Arg Arg Phe Leu 

        275                 280                 285             

Tyr Gln Cys Phe Lys Arg Thr Asp Thr Leu Pro Ala Gly Ala Trp Leu 

    290                 295                 300                 

Asp Phe Ala Ala Asp Gly Arg Met Arg Arg Leu Phe Thr Leu Arg Gln 

305                 310                 315                 320 

Tyr Phe Gly Ile Leu His Arg Leu Leu Lys Asn Arg 

                325                 330         



<210> 2

<211> 334

<212> PRT

<213> Pasteurella multocida





<220> 

<223> pmnagT, β-1,3-N-acetylglucosaminyl-transferase, GeneBankID:

      WP_014390683.1



<400> 2

Met Glu Asn Lys Pro Leu Val Ser Val Leu Ile Cys Ala Tyr Asn Val 

1               5                   10                  15      

Glu Lys Tyr Ile Glu Glu Cys Ile Asn Ala Val Ile Asn Gln Thr Tyr 

            20                  25                  30          

Lys Asn Leu Glu Ile Ile Ile Val Asn Asp Gly Ser Ser Asp Asn Thr 

        35                  40                  45              

Tyr Phe Leu Leu Lys Lys Leu Ala Glu Lys Asp Asn Arg Ile Lys Ile 

    50                  55                  60                  

Leu Asn Phe Asn Asn His Ile Gly Ile Ile Ser Ala Leu Asn Glu Gly 

65                  70                  75                  80  

Leu Lys Glu Ile Ala Gly Glu Tyr Ile Ala Arg Thr Asp Ser Asp Asp 

                85                  90                  95      

Ile Thr Lys Pro Asp Trp Ile Glu Lys Ile Leu Thr Cys Met Gln Asn 

            100                 105                 110         

Asp Pro Lys Ile Ile Ala Met Gly Ser Tyr Leu Thr Val Leu Ser Glu 

        115                 120                 125             

Glu Asn Asn Gly Ser Val Leu Ala Asn His His Lys Asn Lys Val Glu 

    130                 135                 140                 

Trp Lys Asn Pro Leu Glu His Lys Asp Ile Val Glu Lys Met Leu Phe 

145                 150                 155                 160 

Gly Asn Pro Ile His Asn Asn Ser Met Val Met Arg Ser Glu Ile Tyr 

                165                 170                 175     

Thr Lys Tyr His Leu Ile Tyr Asp Pro Asp Tyr His Tyr Ala Glu Asp 

            180                 185                 190         

Tyr Lys Phe Trp Leu Glu Val Ser Arg Ile Gly Lys Leu Ala Asn Tyr 

        195                 200                 205             

Pro Glu Ser Leu Val Tyr Tyr Arg Leu His Arg Asn Gln Thr Ser Ser 

    210                 215                 220                 

Ile His Asn Ser Gln Gln Glu Ile Asn Gly Lys Lys Leu Arg Leu Gln 

225                 230                 235                 240 

Ala Leu Asn Tyr Tyr Leu Lys Asp Leu Gly Ile Asp Tyr Gln Leu Pro 

                245                 250                 255     

Glu Lys Phe Leu Phe Lys Asp Ile Ala Leu Leu Gln Glu Ile Phe Tyr 

            260                 265                 270         

Glu Arg Gly Met Phe Arg Glu Asn Ile Ile Arg Arg Ile Ile Tyr Glu 

        275                 280                 285             

Cys Tyr Leu Ser Leu Gly Glu Tyr Asn Tyr Lys Asp Ile Tyr Tyr Phe 

    290                 295                 300                 

Leu Ile Asn Lys Asn Asn Phe Leu Ser Ile Lys Asp Lys Phe Lys Ile 

305                 310                 315                 320 

Ile Lys Lys Tyr Leu Arg Pro Asp Lys Tyr Ser Ser Thr Tyr 

                325                 330                 



<210> 3

<211> 330

<212> PRT

<213> Haemophilus ducreyi





<220> 

<223> HD9466, glycosyltransferase family 2 protein, GeneBank ID:

      WP_010944479.1



<400> 3

Met Thr Thr Leu Val Ser Val Leu Ile Cys Ala Tyr Asn Val Glu Lys 

1               5                   10                  15      

Tyr Ile Asp Glu Cys Leu Asn Ala Val Ile Ala Gln Thr Tyr Lys Asn 

            20                  25                  30          

Leu Glu Ile Ile Val Val Asn Asp Gly Ser Thr Asp Gly Thr Leu Ala 

        35                  40                  45              

Lys Leu Arg Gln Phe Glu Ala Lys Asp Pro Arg Val Lys Ile Ile Asp 

    50                  55                  60                  

Asn Ile Val Asn Gln Gly Thr Ser Lys Ser Leu Asn Ile Gly Ile Gln 

65                  70                  75                  80  

Tyr Cys Gln Gly Glu Ile Ile Ala Arg Thr Asp Ser Asp Asp Ile Val 

                85                  90                  95      

Asp Ile His Trp Ile Glu Thr Leu Met Arg Glu Leu Asp Asn Ser Pro 

            100                 105                 110         

Glu Thr Ile Ala Ile Ser Ala Tyr Leu Glu Phe Leu Ala Glu Lys Gly 

        115                 120                 125             

Asn Gly Ser Lys Leu Ser Arg Ser Arg Lys His Gly Lys Asn Ala Glu 

    130                 135                 140                 

Asn Pro Ile Ser Ser Glu Ala Ile Ser Gln Arg Met Leu Phe Gly Asn 

145                 150                 155                 160 

Pro Val His Asn Asn Val Ala Leu Val Arg Arg Lys Val Phe Ser Glu 

                165                 170                 175     

Tyr Gly Leu Arg Phe Asp Pro Asp Tyr Ile His Ala Glu Asp Tyr Lys 

            180                 185                 190         

Phe Trp Phe Glu Val Ser Lys Leu Gly Lys Met Arg Thr Tyr Pro Lys 

        195                 200                 205             

Ala Leu Val Lys Tyr Arg Leu His Ala Thr Gln Val Ser Ser Ala Tyr 

    210                 215                 220                 

Asn Gln Lys Gln Arg Ser Ile Ala Lys Lys Ile Lys Arg Glu Ala Ile 

225                 230                 235                 240 

Ser His Tyr Leu Gln Gln Tyr Gly Ile Gln Leu Pro Glu Lys Leu Thr 

                245                 250                 255     

Ile His Asp Leu Phe Ser Ile Phe Ser Pro Gln Ile Glu Leu Ser Leu 

            260                 265                 270         

Thr Val Ala Asn Lys Gln Glu Leu Phe Trp Ser Leu Ala Thr Ser Leu 

        275                 280                 285             

Ser Glu Tyr His Phe Arg Asp Leu Leu Lys Ile Tyr Ser Leu Asp Ile 

    290                 295                 300                 

Phe His Gln Leu Ser Phe Lys Tyr Lys Lys Arg Ile Phe Arg Lys Phe 

305                 310                 315                 320 

Leu Leu Pro Asn Arg Tyr Pro Ser Val Ile 

                325                 330 



<210> 4

<211> 439

<212> PRT

<213> Artificial Sequence





<220> 

<223> galTK, β-1,3-galactosyltransferase, homologous to GeneBank ID:

      BD182026.1



<400> 4

Met Ile Ser Val Tyr Ile Ile Ser Leu Lys Glu Ser Gln Arg Arg Leu 

1               5                   10                  15      

Asp Thr Glu Lys Leu Val Leu Glu Ser Asn Glu Lys Phe Lys Gly Arg 

            20                  25                  30          

Cys Val Phe Gln Ile Phe Asp Ala Ile Ser Pro Lys His Glu Asp Phe 

        35                  40                  45              

Glu Lys Phe Val Gln Glu Leu Tyr Asp Ser Ser Ser Leu Leu Lys Ser 

    50                  55                  60                  

Asp Trp Phe His Ser Asp Tyr Cys Tyr Gln Glu Leu Leu Pro Gln Glu 

65                  70                  75                  80  

Phe Gly Cys Tyr Leu Ser His Tyr Leu Leu Trp Lys Glu Cys Val Lys 

                85                  90                  95      

Leu Asn Gln Pro Val Val Ile Leu Glu Asp Asp Val Ala Leu Glu Ser 

            100                 105                 110         

Asn Phe Met Gln Ala Leu Glu Asp Cys Leu Lys Ser Pro Phe Asp Phe 

        115                 120                 125             

Val Arg Leu Tyr Gly His Tyr Trp Gly Gly His Lys Thr Asn Leu Cys 

    130                 135                 140                 

Ala Leu Pro Val Tyr Thr Glu Thr Glu Glu Ala Glu Ala Ser Ile Glu 

145                 150                 155                 160 

Lys Thr Pro Ile Glu Asn Tyr Glu Val Thr Ser Pro Pro Pro Pro Asn 

                165                 170                 175     

Pro Thr Arg Asp Thr Gln Gln Asp Phe Ile Thr Glu Thr Gln Gln Asp 

            180                 185                 190         

Pro Lys Glu Leu Ser Glu Pro Cys Lys Ile Ala Pro Gln Lys Ile Ser 

        195                 200                 205             

Phe Asn Gln Val Val Phe Lys Lys Ile Lys Arg Lys Leu Asn Arg Phe 

    210                 215                 220                 

Ile Gly Ser Ile Leu Ala Arg Thr Glu Val Tyr Lys Asn Ile Val Ala 

225                 230                 235                 240 

Lys Tyr Asp Asp Leu Thr Thr Lys Tyr Asp Asp Leu Thr Thr Lys Tyr 

                245                 250                 255     

Asp Asp Leu Thr Thr Lys Tyr Asp Asp Leu Thr Thr Lys Tyr Asp Asp 

            260                 265                 270         

Leu Asn Lys Asn Ile Ala Glu Lys Tyr Asp Glu Leu Met Gly Lys Tyr 

        275                 280                 285             

Glu Ser Leu Leu Ala Lys Glu Val Asn Ile Lys Glu Thr Phe Trp Glu 

    290                 295                 300                 

Ser Arg Ala Asp Ser Glu Lys Glu Ala Leu Phe Leu Asp His Phe Tyr 

305                 310                 315                 320 

Leu Thr Ser Val Tyr Val Ala Thr Thr Ala Gly Tyr Tyr Leu Thr Pro 

                325                 330                 335     

Lys Gly Ala Lys Thr Phe Ile Glu Ala Thr Glu Arg Phe Lys Ile Ile 

            340                 345                 350         

Glu Pro Val Asp Met Phe Ile Asn Asn Pro Thr Tyr His Asp Ile Ala 

        355                 360                 365             

Asn Phe Thr Tyr Val Pro Cys Pro Val Ser Leu Asn Lys His Ala Phe 

    370                 375                 380                 

Asn Ser Thr Ile Gln Asn Ala Lys Lys Pro Asp Ile Ser Leu Lys Pro 

385                 390                 395                 400 

Pro Lys Lys Ser Tyr Phe Asp Asn Leu Phe Tyr His Lys Phe Asn Ala 

                405                 410                 415     

Arg Lys Cys Leu Lys Ala Phe Asn Lys Tyr Ser Lys Gln Tyr Ala Pro 

            420                 425                 430         

Leu Lys Thr Pro Lys Glu Val 

        435                 



<210> 5

<211> 262

<212> PRT

<213> Chromobacterium violaceum





<220> 

<223> cvb3galT, β-1,3-galactosyltransferase, GeneBank ID:

      WP_080969100.1



<400> 5

Met Asp Thr Ile Met Ile Lys Arg Pro Leu Val Ser Val Ile Leu Pro 

1               5                   10                  15      

Val Asn Lys Asn Asn Pro His Leu Glu Glu Ala Ile Gln Ser Ile Lys 

            20                  25                  30          

Asn Gln Thr Tyr Lys Glu Leu Glu Leu Ile Ile Ile Ala Asn Asn Cys 

        35                  40                  45              

Glu Asp Asn Phe Tyr Ser Leu Leu Leu Lys Tyr Gln Asp Gln Lys Thr 

    50                  55                  60                  

Lys Ile Ile Arg Thr Ser Ile Lys Tyr Leu Pro Phe Ser Leu Asn Leu 

65                  70                  75                  80  

Gly Val His Leu Ser Gln Gly Glu Tyr Ile Ala Arg Met Asp Ser Asp 

                85                  90                  95      

Asp Ile Ser Val Leu Asp Arg Ile Glu Lys Gln Val Lys Arg Phe Leu 

            100                 105                 110         

Asn Thr Pro Glu Leu Ser Ile Leu Gly Ser Asn Val Glu Tyr Ile Asn 

        115                 120                 125             

Glu Ala Ser Glu Ser Ile Gly Tyr Ser Asn Tyr Pro Leu Asp His Ser 

    130                 135                 140                 

Ser Ile Val Asn Ser Phe Pro Phe Arg Cys Asn Leu Ala His Pro Thr 

145                 150                 155                 160 

Ile Met Val Lys Lys Glu Val Ile Thr Thr Leu Gly Gly Tyr Met Tyr 

                165                 170                 175     

Gly Ser Leu Ser Glu Asp Tyr Asp Leu Trp Ile Arg Ala Ser Arg His 

            180                 185                 190         

Gly Asn Phe Lys Phe Ser Asn Ile Asp Glu Pro Leu Leu Lys Tyr Arg 

        195                 200                 205             

Ile His Lys Gly Gln Ala Thr Asn Lys Ser Asn Ala Tyr Asn Ile Phe 

    210                 215                 220                 

Ala Phe Asp Ser Ser Leu Lys Ile Arg Glu Phe Leu Leu Asn Gly Asn 

225                 230                 235                 240 

Val Gln Tyr Leu Leu Gly Ala Ala Arg Gly Phe Phe Ala Phe Leu Tyr 

                245                 250                 255     

Val Arg Phe Ile Lys Lys 

            260         



<210> 6

<211> 302

<212> PRT

<213> Artificial Sequence





<220> 

<223> FutC, α-1,2-fucosyltransferase, homolouge to GeneBank ID:

      WP_080473865.1



<400> 6

Met Ala Phe Lys Val Val Gln Ile Cys Gly Gly Leu Gly Asn Gln Met 

1               5                   10                  15      

Phe Gln Tyr Ala Phe Ala Lys Ser Leu Gln Lys His Ser Asn Thr Pro 

            20                  25                  30          

Val Leu Leu Asp Ile Thr Ser Phe Asp Trp Ser Asp Arg Lys Met Gln 

        35                  40                  45              

Leu Glu Leu Phe Pro Ile Asp Leu Pro Tyr Ala Ser Ala Lys Glu Ile 

    50                  55                  60                  

Ala Ile Ala Lys Met Gln His Leu Pro Lys Leu Val Arg Asp Ala Leu 

65                  70                  75                  80  

Lys Cys Met Gly Phe Asp Arg Val Ser Gln Glu Ile Val Phe Glu Tyr 

                85                  90                  95      

Glu Pro Lys Leu Leu Lys Pro Ser Arg Leu Thr Tyr Phe Phe Gly Tyr 

            100                 105                 110         

Phe Gln Asp Pro Arg Tyr Phe Asp Ala Ile Ser Pro Leu Ile Lys Gln 

        115                 120                 125             

Thr Phe Thr Leu Pro Pro Pro Pro Glu Asn Asn Lys Asn Asn Asn Lys 

    130                 135                 140                 

Lys Glu Glu Glu Tyr Gln Cys Lys Leu Ser Leu Ile Leu Ala Ala Lys 

145                 150                 155                 160 

Asn Ser Val Phe Val His Ile Arg Arg Gly Asp Tyr Val Gly Ile Gly 

                165                 170                 175     

Cys Gln Leu Gly Ile Asp Tyr Gln Lys Lys Ala Leu Glu Tyr Met Ala 

            180                 185                 190         

Lys Arg Val Pro Asn Met Glu Leu Phe Val Phe Cys Glu Asp Leu Glu 

        195                 200                 205             

Phe Thr Gln Asn Leu Asp Leu Gly Tyr Pro Phe Met Asp Met Thr Thr 

    210                 215                 220                 

Arg Asp Lys Glu Glu Glu Ala Tyr Trp Asp Met Leu Leu Met Gln Ser 

225                 230                 235                 240 

Cys Gln His Gly Ile Ile Ala Asn Ser Thr Tyr Ser Trp Trp Ala Ala 

                245                 250                 255     

Tyr Leu Ile Glu Asn Pro Glu Lys Ile Ile Ile Gly Pro Lys His Trp 

            260                 265                 270         

Leu Phe Gly His Glu Asn Ile Leu Cys Lys Glu Trp Val Lys Ile Glu 

        275                 280                 285             

Ser His Phe Glu Val Lys Ser Gln Lys Tyr Asn Ala Leu Gly 

    290                 295                 300         



<210> 7

<211> 292

<212> PRT

<213> Methylobacter tundripaludum





<220> 

<223> Mtun, α-1,2-fucosyltransferase, GeneBank ID: WP_031437198.1



<400> 7

Met Val Ile Thr His Leu Ile Gly Gly Leu Gly Asn Gln Met Phe Gln 

1               5                   10                  15      

Tyr Ala Ala Gly Arg Ala Val Ser Leu Glu Arg Gly Val Ser Leu Ser 

            20                  25                  30          

Leu Asp Ile Ser Gly Phe Ala Asn Tyr Gly Leu His Gln Gly Phe Glu 

        35                  40                  45              

Leu Gln Arg Ile Phe Asn Cys Thr Ala Glu Ile Ala Asn Glu Ala Asp 

    50                  55                  60                  

Val Arg Gly Ile Leu Gly Trp Gln Ser Ser Pro Arg Ile Arg Gln Leu 

65                  70                  75                  80  

Leu Ser Arg Gln Asn Met Ala Ile Phe Arg Arg Glu Gly Phe Val Val 

                85                  90                  95      

Glu Pro His Phe His Tyr Trp Gln Gly Ile Lys Ser Val Pro Arg Asp 

            100                 105                 110         

Cys Tyr Leu Thr Gly Tyr Trp Gln Ser Glu Gln Tyr Phe Leu Glu Ala 

        115                 120                 125             

Ala Ala Gln Ile Arg Ala Asp Phe Thr Phe Lys Leu Pro Leu Asp Asn 

    130                 135                 140                 

Gln Asn Ile Glu Leu Ala Lys Gln Ile Asn Ala Val Asn Ala Val Ser 

145                 150                 155                 160 

Leu His Val Arg Arg Gly Asp Tyr Ala Asn Thr Pro Glu Thr Thr Ala 

                165                 170                 175     

Thr His Gly Leu Cys Ser Leu Asp Tyr Tyr Arg Val Ala Ile Arg His 

            180                 185                 190         

Ile Ala Glu Gln Val Gln Gln Pro His Phe Phe Val Phe Ser Asp Asp 

        195                 200                 205             

Ile Ala Trp Val Lys Asn Asn Leu Ser Ile Asp Phe Pro Cys Gln Tyr 

    210                 215                 220                 

Val Asp His Asn Gln Gly Ala Glu Ser Tyr Asn Asp Met Arg Leu Met 

225                 230                 235                 240 

Ser Met Cys Arg His His Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp 

                245                 250                 255     

Gly Ala Trp Leu Asn Pro Asn Val Asn Lys Ile Val Val Ala Pro Ser 

            260                 265                 270         

Arg Trp Phe Ala Lys Gln Thr Asp Val Arg Asp Leu Leu Pro Gln Gly 

        275                 280                 285             

Trp Ile Lys Gln 

    290         



<210> 8

<211> 292

<212> PRT

<213> Sulfuriflexus mobilis





<220> 

<223> Smob, α-1,2-fucosyltransferase, GeneBank ID: WP_126455392.1



<400> 8

Met Ile Ile Ser Gln Ile Ile Gly Gly Leu Gly Asn Gln Met Phe Gln 

1               5                   10                  15      

Tyr Ala Ala Gly Arg Ala Leu Ser Leu Val Arg Gly Gln Pro Leu Leu 

            20                  25                  30          

Leu Asp Val Thr Gly Phe Ala Gly Tyr Gly Leu His Gln Gly Phe Glu 

        35                  40                  45              

Leu Gln Arg Val Phe Asp Cys Pro Ile Gly Ile Ala Thr Glu Glu Asp 

    50                  55                  60                  

Val Arg Gly Ile Leu Gly Trp Gln Phe Ser Ala Gly Ile Arg Arg Ile 

65                  70                  75                  80  

Val Ala Arg Pro Gly Met Ala Ala Phe Arg Arg Lys Gly Phe Ile Val 

                85                  90                  95      

Glu Pro His Phe His Tyr Trp Pro Glu Ile Lys Asn Val Pro Arg Asp 

            100                 105                 110         

Cys Tyr Leu Leu Gly Tyr Trp Gln Ser Glu Arg Tyr Phe Arg Ala Ala 

        115                 120                 125             

Thr Ala Asp Ile Arg Ala Asp Phe Ser Phe Lys Ser Pro Leu Val Asn 

    130                 135                 140                 

Arg Asn Ala Glu Thr Ala Ala Gln Ile Asp Gln Val Asn Ala Ile Ser 

145                 150                 155                 160 

Leu His Met Arg Arg Gly Asp Tyr Val Asn Asn Pro Lys Thr Ser Ala 

                165                 170                 175     

Thr His Gly Leu Cys Ser Leu Asp Tyr Tyr Gln Ala Ala Ile Lys Phe 

            180                 185                 190         

Val Ser Glu Arg Val Glu Glu Pro Phe Phe Phe Ile Phe Ser Asp Asp 

        195                 200                 205             

Ile Ala Trp Val Lys Ala Asn Leu Lys Leu Asp Phe Pro Cys Gln Tyr 

    210                 215                 220                 

Val Asp His Asn His Gly Ala Glu Ser Phe Asn Asp Met His Leu Met 

225                 230                 235                 240 

Ser Leu Cys Gln His His Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp 

                245                 250                 255     

Gly Ala Trp Leu Asn Ser Asp Pro Lys Lys Ile Val Leu Ala Pro Lys 

            260                 265                 270         

Lys Trp Phe Ala Asn Lys Asn Asn Ile Lys Asp Leu Phe Pro Pro Gly 

        275                 280                 285             

Trp Val Ser Leu 

    290         



<210> 9

<211> 203

<212> DNA

<213> Artificial Sequence





<220> 

<223> PmglB_70UTR, variant of E. coli promoter for mglBAC

      galactose/methyl-galactosidade transporter



<400> 9

tgcgtcgcca ttctgtcgca acacgccaga atgcggcggc gatcactaac tcaacaaatc     60



aggcgatgta accgctttca atctgtgagt gatttcacag tatcttaaca atgtgatagc    120



tatgattgca ccgtgcctac aagcatcgtg gaggtccgtg actttcacgc atacaacaaa    180



cattaaccaa ggaggaaaca gct                                            203





<210> 10

<211> 203

<212> DNA

<213> Artificial Sequence





<220> 

<223> PmglB_70UTR_SD4, variant of E. coli promoter for mglBAC;

      galactose/methyl-galactosidade transporter



<400> 10

tgcgtcgcca ttctgtcgca acacgccaga atgcggcggc gatcactaac tcaacaaatc     60



aggcgatgta accgctttca atctgtgagt gatttcacag tatcttaaca atgtgatagc    120



tatgattgca ccgtgcctac aagcatcgtg gaggtccgtg actttcacgc atacaacaaa    180



cattaaccaa ctaggaaaca gct                                            203





<210> 11

<211> 152

<212> DNA

<213> Klebsiella pneumoniae





<220> 

<223> Promoter for scrYA sucrose genes



<400> 11

ggttaacggc ccactttgct ggcgacatca caattcttaa accggtttag caatttttat     60



tttcaccgcg ttaccgacat gtttaccata tcaactaaac cggtttagca aacattagca    120



cactcactga tttacctttg gatgtcacca ac                                  152





<210> 12

<211> 291

<212> DNA

<213> Artificial Sequence





<220> 

<223> PgatY_70UTR, variant of E. coli promoter for gatYZABCD;

      tagatose-1,6-bisP aldolase



<400> 12

cggcaaccta tgcctgatgc gacgctgaag cgtcttatca tgcctacata gcactgccac     60



gtatgtttac accgcatccg gcataaaaac acgcgcactt tgctacggct tccctatcgg    120



gaggccgttt ttttgccttt cactcctcga ataattttca tattgtcgtt tttgtgatcg    180



ttatctcgat atttaaaaac aaataatttc attatatttt gtgcctacaa gcatcgtgga    240



ggtccgtgac tttcacgcat acaacaaaca ttaaccaagg aggaaacagc t             291





<210> 13

<211> 300

<212> DNA

<213> Escherichia coli





<220> 

<223> PglpF, E. Coli promoter sequence of glpFKX operon



<400> 13

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagga ggaaacagct    300





<210> 14

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD1, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 14

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaaatt cgaaacagct    300





<210> 15

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD10, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 15

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaactg agaaacagct    300





<210> 16

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD2, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 16

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagcg caaaacagct    300





<210> 17

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD3 variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 17

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagaa caaaacagct    300





<210> 18

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD4, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 18

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaacta ggaaacagct    300





<210> 19

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD5, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 19

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaaccg agaaacagct    300





<210> 20

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD6, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 20

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagag ctaaacagct    300





<210> 21

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD7, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 21

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagag caaaacagct    300





<210> 22

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD8, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 22

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagag aaaaacagct    300





<210> 23

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_SD9, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 23

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgataa gtttacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaaagg aaaaacagct    300





<210> 24

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_B29, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 24

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgattt aattacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagga ggaaacagct    300





<210> 25

<211> 300

<212> DNA

<213> Artificial Sequence





<220> 

<223> PglpF_B29, variant of PglpF E. Coli promoter sequence of glpFKX

      operon



<400> 25

gcggcacgcc ttgcagatta cggtttgcca cacttttcat ccttctcctg gtgacataat     60



ccacatcaat cgaaaatgtt aataaatttg ttgcgcgaat gatctaacaa acatgcatca    120



tgtacaatca gatggaataa atggcgcgat aacgctcatt ttatgacgag gcacacacat    180



tttaagttcg atatttctcg tttttgctcg ttaacgatca gaatacagca tgcctacaag    240



catcgtggag gtccgtgact ttcacgcata caacaaacat taaccaagga ggaaacagct    300





<210> 26

<211> 107

<212> DNA

<213> Artificial Sequence





<220> 

<223> Plac_16UTR, variant of E. coli lac operon promoter



<400> 26

tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat     60



gttgtgtgga attgtgagcg gataacaatt tcaaggagga aacagct                  107





<210> 27

<211> 107

<212> DNA

<213> Escherichia coli





<220> 

<223> Plac, lac operon promoter



<400> 27

tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat     60



gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagct                  107





<210> 28

<211> 387

<212> PRT

<213> Rouxiella badensis





<220> 

<223> Bad MFS transporter, GeneBank ID: WP_017489914.1



<400> 28

Met Ser Ser Arg Arg Leu Ser Ile Ile Phe Ala Thr Phe Leu Leu Val 

1               5                   10                  15      

Ser Phe Leu Thr Gly Ile Ala Gly Ala Leu Gln Ala Pro Thr Leu Ser 

            20                  25                  30          

Leu Phe Leu Thr Asn Glu Val Lys Val Arg Pro Leu Trp Val Gly Leu 

        35                  40                  45              

Phe Tyr Thr Val Asn Ala Leu Gly Gly Ile Val Ile Ser Phe Leu Leu 

    50                  55                  60                  

Ala Asn Tyr Ser Asp Lys Lys Gly Asp Arg Arg Lys Leu Leu Phe Phe 

65                  70                  75                  80  

Cys Thr Leu Met Ala Ile Gly Asn Ser Leu Ile Phe Ala Tyr Ser Arg 

                85                  90                  95      

Asp Tyr Leu Val Leu Ile Ser Val Gly Val Leu Leu Ala Ala Ile Gly 

            100                 105                 110         

Asn Ala Ser Met Pro Gln Leu Phe Ala Leu Ala Arg Glu Tyr Ala Asp 

        115                 120                 125             

Arg Ser Ala His Glu Val Val Met Phe Ser Ser Met Met Arg Ala Thr 

    130                 135                 140                 

Leu Ser Leu Ala Trp Val Leu Gly Pro Pro Ile Ser Phe Thr Leu Ala 

145                 150                 155                 160 

Leu Asn Tyr Gly Phe Thr Leu Met Tyr Leu Cys Ala Ala Gly Val Phe 

                165                 170                 175     

Ile Phe Ser Ala Leu Met Val Trp Phe Phe Leu Pro Ser Val Gly Arg 

            180                 185                 190         

Ile Glu Gln Pro Val Asp Lys Val Val Val His Val Ser Ala Trp Lys 

        195                 200                 205             

Asn Arg Asp Val Arg Leu Leu Phe Phe Ala Ser Leu Leu Met Trp Thr 

    210                 215                 220                 

Cys Asn Ile Met Tyr Ile Ile Asp Met Pro Leu Tyr Ile Thr Ser Asp 

225                 230                 235                 240 

Leu Gly Leu Pro Glu Gly Leu Ala Gly Leu Leu Met Gly Ala Ala Ala 

                245                 250                 255     

Gly Leu Glu Ile Pro Val Met Leu Ile Ala Gly Tyr Leu Val Lys Arg 

            260                 265                 270         

Thr Gly Lys Arg Arg Leu Met Leu Cys Ala Ala Val Phe Gly Ile Leu 

        275                 280                 285             

Phe Tyr Leu Gly Leu Val Leu Phe Gln Phe Lys Ala Ala Leu Met Ile 

    290                 295                 300                 

Leu Gln Leu Phe Asn Ala Ile Phe Ile Gly Ile Ile Ala Gly Ile Gly 

305                 310                 315                 320 

Met Leu Tyr Phe Gln Asp Leu Met Pro Gly Arg Ala Gly Ser Ala Thr 

                325                 330                 335     

Thr Leu Phe Thr Asn Ser Ile Ser Thr Gly Ala Ile Leu Ala Gly Val 

            340                 345                 350         

Ile Gln Gly Thr Ile Val Gln Asn Phe Gly His Tyr Gln Val Tyr Trp 

        355                 360                 365             

Met Ala Leu Ala Leu Ala Val Gly Ala Leu Val Leu Met Thr Arg Val 

    370                 375                 380                 

Lys Asn Val 

385         



<210> 29

<211> 394

<212> PRT

<213> Rosenbergiella nectarea





<220> 

<223> Nec MFS transporter, GeneBank ID: WP_092672081.1



<400> 29

Met Gln Ser Phe Thr Pro Pro Ala Pro Lys Gly Gly Asn Pro Val Phe 

1               5                   10                  15      

Met Met Phe Met Leu Val Thr Phe Phe Val Ser Ile Ala Gly Ala Leu 

            20                  25                  30          

Gln Ala Pro Thr Leu Ser Leu Tyr Leu Ser Gln Glu Leu Ala Ala Lys 

        35                  40                  45              

Pro Phe Met Val Gly Leu Phe Phe Thr Ile Asn Ala Val Thr Gly Ile 

    50                  55                  60                  

Ile Ile Ser Phe Ile Leu Ala Lys Arg Ser Asp Arg Lys Gly Asp Arg 

65                  70                  75                  80  

Arg Arg Leu Leu Met Phe Cys Cys Ala Met Ala Ile Ala Asn Ala Leu 

                85                  90                  95      

Met Phe Ala Phe Val Arg Gln Tyr Val Val Leu Ile Thr Leu Gly Leu 

            100                 105                 110         

Ile Leu Ser Ala Leu Thr Ser Val Val Met Pro Gln Leu Phe Ala Leu 

        115                 120                 125             

Ala Arg Glu Tyr Ala Asp Arg Thr Gly Arg Glu Val Val Met Phe Ser 

    130                 135                 140                 

Ser Val Met Arg Thr Gln Met Ser Leu Ala Trp Val Ile Gly Pro Pro 

145                 150                 155                 160 

Ile Ser Phe Ala Leu Ala Leu Asn Tyr Gly Phe Ile Thr Leu Tyr Leu 

                165                 170                 175     

Val Ala Ala Ala Leu Phe Leu Leu Ser Leu Ile Leu Ile Lys Thr Thr 

            180                 185                 190         

Leu Pro Ser Val Pro Arg Leu Tyr Pro Ala Glu Asp Leu Ala Lys Ser 

        195                 200                 205             

Ala Ala Ser Gly Trp Lys Arg Thr Asp Val Arg Phe Leu Phe Ala Ala 

    210                 215                 220                 

Ser Val Leu Met Trp Val Cys Asn Leu Met Tyr Ile Ile Asp Met Pro 

225                 230                 235                 240 

Leu Tyr Ile Ser Lys Ser Leu Gly Met Pro Glu Ser Phe Ala Gly Val 

                245                 250                 255     

Leu Met Gly Thr Ala Ala Gly Leu Glu Ile Pro Val Met Leu Leu Ala 

            260                 265                 270         

Gly Tyr Leu Ala Lys Arg Val Gly Lys Arg Pro Leu Val Ile Val Ala 

        275                 280                 285             

Ala Val Cys Gly Leu Ala Phe Tyr Pro Ala Met Leu Val Phe His Gln 

    290                 295                 300                 

Gln Thr Gly Leu Leu Ile Ile Gln Leu Leu Asn Ala Val Phe Ile Gly 

305                 310                 315                 320 

Ile Val Ala Gly Leu Val Met Leu Trp Phe Gln Asp Leu Met Pro Gly 

                325                 330                 335     

Lys Ala Gly Ala Ala Thr Thr Leu Phe Thr Asn Ser Val Ser Thr Gly 

            340                 345                 350         

Met Ile Phe Ala Gly Leu Cys Gln Gly Leu Leu Ser Asp Leu Leu Gly 

        355                 360                 365             

His Gln Ala Ile Tyr Val Leu Ala Thr Val Leu Met Val Ile Ala Leu 

    370                 375                 380                 

Leu Leu Leu Leu Arg Val Lys Glu Gln Ala 

385                 390                 



<210> 30

<211> 393

<212> PRT

<213> Yersinia bercovieri





<220> 

<223> YberC MFS transporter, GeneBank ID: EEQ08298.1



<400> 30

Met Lys Ser Ala Leu Thr Phe Ser Arg Arg Ile Asn Pro Val Phe Leu 

1               5                   10                  15      

Ala Phe Phe Val Val Ala Phe Leu Ser Gly Ile Ala Gly Ala Leu Gln 

            20                  25                  30          

Ala Pro Thr Leu Ser Leu Phe Leu Ser Thr Glu Val Lys Val Arg Pro 

        35                  40                  45              

Leu Trp Val Gly Leu Phe Tyr Thr Val Asn Ala Ile Ala Gly Ile Thr 

    50                  55                  60                  

Val Ser Phe Ile Leu Ala Lys Arg Ser Asp Ser Arg Gly Asp Arg Arg 

65                  70                  75                  80  

Lys Leu Ile Met Val Cys Tyr Leu Met Ala Val Gly Asn Cys Leu Leu 

                85                  90                  95      

Phe Ala Phe Asn Arg Asp Tyr Leu Thr Leu Ile Thr Ala Gly Val Leu 

            100                 105                 110         

Leu Ala Ser Val Ala Asn Thr Ala Met Pro Gln Ile Phe Ala Leu Ala 

        115                 120                 125             

Arg Glu Tyr Ala Asp Ser Ser Ala Arg Glu Val Val Met Phe Ser Ser 

    130                 135                 140                 

Ile Met Arg Ala Gln Leu Ser Leu Ala Trp Val Ile Gly Pro Pro Leu 

145                 150                 155                 160 

Ser Phe Met Leu Ala Leu Asn Tyr Gly Phe Thr Leu Met Phe Ser Ile 

                165                 170                 175     

Ala Ala Gly Ile Phe Val Leu Ser Ala Leu Val Val Trp Phe Ile Leu 

            180                 185                 190         

Pro Ser Val Pro Arg Ala Glu Pro Val Val Asp Ala Pro Val Val Val 

        195                 200                 205             

Gln Gly Ser Leu Phe Ala Asp Lys Asn Val Leu Leu Leu Phe Ile Ala 

    210                 215                 220                 

Ser Met Leu Met Trp Thr Cys Asn Thr Met Tyr Ile Ile Asp Met Pro 

225                 230                 235                 240 

Leu Tyr Ile Thr Ala Ser Leu Gly Leu Pro Glu Arg Leu Ala Gly Leu 

                245                 250                 255     

Leu Met Gly Thr Ala Ala Gly Leu Glu Ile Pro Ile Met Leu Leu Ala 

            260                 265                 270         

Gly Tyr Ser Val Arg Tyr Phe Gly Lys Arg Lys Ile Met Leu Phe Ala 

        275                 280                 285             

Val Leu Ala Gly Val Leu Phe Tyr Thr Gly Leu Val Leu Phe Lys Phe 

    290                 295                 300                 

Lys Thr Ala Leu Met Leu Leu Gln Ile Phe Asn Ala Ile Phe Ile Gly 

305                 310                 315                 320 

Ile Val Ala Gly Ile Gly Met Leu Tyr Phe Gln Asp Leu Met Pro Gly 

                325                 330                 335     

Arg Ala Gly Ala Ala Thr Thr Leu Phe Thr Asn Ser Ile Ser Thr Gly 

            340                 345                 350         

Val Ile Leu Ala Gly Val Leu Gln Gly Gly Leu Thr Glu Thr Trp Gly 

        355                 360                 365             

His Asp Ser Val Tyr Val Met Ala Met Val Leu Ser Ile Leu Ala Leu 

    370                 375                 380                 

Ile Ile Cys Ala Arg Val Arg Glu Ala 

385                 390             



<210> 31

<211> 393

<212> PRT

<213> Yersinia frederiksenii





<220> 

<223> Fred MFS transporter, GeneBank ID: WP_087817556.1



<400> 31

Met Lys Ser Ala Leu Thr Phe Ser Arg Arg Ile Asn Pro Val Phe Leu 

1               5                   10                  15      

Ala Phe Phe Val Val Ala Phe Leu Ser Gly Ile Ala Gly Ala Leu Gln 

            20                  25                  30          

Ala Pro Thr Leu Ser Leu Phe Leu Ser Thr Glu Val Lys Val Arg Pro 

        35                  40                  45              

Leu Trp Val Gly Leu Phe Tyr Thr Val Asn Ala Ile Ala Gly Ile Thr 

    50                  55                  60                  

Val Ser Phe Val Leu Ala Lys Arg Ser Asp Leu Arg Gly Asp Arg Arg 

65                  70                  75                  80  

Lys Leu Ile Leu Val Cys Tyr Leu Met Ala Val Gly Asn Cys Leu Leu 

                85                  90                  95      

Phe Ala Phe Asn Arg Asp Tyr Leu Thr Leu Ile Thr Ala Gly Val Leu 

            100                 105                 110         

Leu Ala Ala Val Ala Asn Thr Ala Met Pro Gln Ile Phe Ala Leu Ala 

        115                 120                 125             

Arg Glu Tyr Ala Asp Asn Ser Ala Arg Glu Val Val Met Phe Ser Ser 

    130                 135                 140                 

Ile Met Arg Ala Gln Leu Ser Leu Ala Trp Val Ile Gly Pro Pro Leu 

145                 150                 155                 160 

Ser Phe Met Leu Ala Leu Asn Tyr Gly Phe Thr Leu Met Phe Cys Ile 

                165                 170                 175     

Ala Ala Gly Ile Phe Val Leu Ser Ala Leu Val Val Trp Phe Ile Leu 

            180                 185                 190         

Pro Ser Val Gln Arg Ala Glu Pro Val Met Asp Ala Pro Thr Val Ala 

        195                 200                 205             

Gln Gly Ser Leu Phe Ala Asp Lys Asp Val Leu Leu Leu Phe Ile Ala 

    210                 215                 220                 

Ser Met Leu Met Trp Thr Cys Asn Thr Met Tyr Ile Ile Asp Met Pro 

225                 230                 235                 240 

Leu Tyr Ile Thr Ala Ser Leu Gly Leu Pro Glu Arg Leu Ala Gly Leu 

                245                 250                 255     

Leu Met Gly Thr Ala Ala Gly Leu Glu Ile Pro Ile Met Leu Leu Ala 

            260                 265                 270         

Gly Tyr Ser Val Arg Arg Phe Gly Lys Arg Lys Ile Met Leu Phe Ala 

        275                 280                 285             

Val Leu Ala Gly Val Leu Phe Tyr Thr Gly Leu Val Leu Phe Lys Phe 

    290                 295                 300                 

Lys Ser Ala Leu Met Leu Leu Gln Ile Phe Asn Ala Ile Phe Ile Gly 

305                 310                 315                 320 

Ile Val Ala Gly Ile Gly Met Leu Tyr Phe Gln Asp Leu Met Pro Gly 

                325                 330                 335     

Arg Ala Gly Ala Ala Thr Thr Leu Phe Thr Asn Ser Ile Ser Thr Gly 

            340                 345                 350         

Val Ile Leu Ala Gly Val Leu Gln Gly Val Leu Thr Glu Thr Trp Gly 

        355                 360                 365             

His Asn Ser Val Tyr Val Met Ala Met Ile Leu Ala Ile Leu Ser Leu 

    370                 375                 380                 

Ile Ile Cys Ala Arg Val Arg Glu Ala 

385                 390             



<210> 32

<211> 392

<212> PRT

<213> Pantoea vagans





<220> 

<223> Vag MFS transporter, GeneBank ID: WP_048785139.1



<400> 32

Met Lys Ser Leu Leu Thr Arg Lys Arg Arg Ile Asn Pro Val Phe Leu 

1               5                   10                  15      

Ala Phe Met Ala Ala Ser Phe Met Ile Gly Val Ala Gly Ala Leu Gln 

            20                  25                  30          

Ala Pro Thr Leu Ser Leu Phe Leu Thr Arg Glu Val Gln Ala Arg Pro 

        35                  40                  45              

Leu Trp Val Gly Leu Phe Phe Thr Val Asn Ala Ile Ala Gly Ile Val 

    50                  55                  60                  

Val Ser Met Leu Val Ala Lys Arg Ser Asp Ser Arg Gly Asp Arg Arg 

65                  70                  75                  80  

Thr Leu Ile Leu Phe Cys Cys Ala Met Ala Phe Cys Asn Ala Leu Leu 

                85                  90                  95      

Phe Ala Phe Thr Arg His Tyr Leu Thr Leu Ile Thr Leu Gly Val Leu 

            100                 105                 110         

Leu Ser Ala Leu Ala Ser Val Ser Met Pro Gln Ile Phe Ala Leu Ala 

        115                 120                 125             

Arg Glu Tyr Ala Asp Gln Ser Ala Arg Glu Ala Val Met Phe Ser Ser 

    130                 135                 140                 

Val Met Arg Ala Gln Leu Ser Leu Ala Trp Val Ile Gly Pro Pro Leu 

145                 150                 155                 160 

Ser Phe Ala Leu Ala Leu Asn Phe Gly Phe Val Thr Leu Phe Leu Val 

                165                 170                 175     

Ala Ala Ala Leu Phe Leu Val Cys Ile Leu Leu Ile Lys Phe Thr Leu 

            180                 185                 190         

Pro Ser Val Pro Arg Ala Glu Pro Leu Met Arg Ser Gly Gly Met Pro 

        195                 200                 205             

Leu Ser Gly Trp Arg Asp Arg Asp Val Arg Leu Leu Phe Ile Ala Ser 

    210                 215                 220                 

Val Thr Met Trp Thr Cys Asn Thr Met Tyr Ile Ile Asp Met Pro Leu 

225                 230                 235                 240 

Tyr Ile Ser Val Thr Leu Gly Leu Pro Glu Lys Leu Ala Gly Leu Leu 

                245                 250                 255     

Met Gly Thr Ala Ala Gly Leu Glu Ile Pro Val Met Leu Leu Ala Gly 

            260                 265                 270         

His Tyr Ala Lys Arg Val Gly Lys Arg Asn Leu Met Leu Ile Ala Val 

        275                 280                 285             

Ala Ala Gly Val Leu Phe Tyr Ala Gly Leu Ala Met Phe Ala Ser Gln 

    290                 295                 300                 

Thr Ala Leu Met Ala Leu Gln Leu Phe Asn Ala Val Phe Ile Gly Ile 

305                 310                 315                 320 

Ile Ala Gly Ile Gly Met Leu Trp Phe Gln Asp Leu Met Pro Gly Arg 

                325                 330                 335     

Pro Gly Ala Ala Thr Thr Met Phe Thr Asn Ser Ile Ser Thr Gly Met 

            340                 345                 350         

Ile Leu Ala Gly Val Ile Gln Gly Thr Leu Ser Glu Arg Phe Gly His 

        355                 360                 365             

Ile Ala Val Tyr Trp Leu Ala Leu Gly Leu Ala Val Ala Ala Phe Ala 

    370                 375                 380                 

Met Ser Ala Arg Val Lys Asn Val 

385                 390         



<210> 33

<211> 398

<212> PRT

<213> Serratia marcescens





<220> 

<223> Marc MFS transporter, GeneBank ID: WP_060448169.1



<400> 33

Met Gln Arg Leu Ser Arg Leu Ser Leu Arg Ile Asn Pro Ile Phe Ala 

1               5                   10                  15      

Ala Phe Leu Leu Ile Ala Phe Leu Ser Gly Ile Ala Gly Ala Leu Leu 

            20                  25                  30          

Thr Pro Thr Leu Ser Leu Phe Leu Thr Thr Glu Val Lys Val Arg Pro 

        35                  40                  45              

Leu Trp Val Gly Leu Phe Tyr Thr Ala Asn Ala Val Ala Gly Ile Val 

    50                  55                  60                  

Val Ser Phe Leu Leu Ala Lys Arg Ser Asp Thr Arg Gly Asp Arg Arg 

65                  70                  75                  80  

Arg Leu Ile Leu Leu Cys Cys Leu Met Ala Val Gly Asn Cys Leu Leu 

                85                  90                  95      

Phe Ala Phe Asn Arg Asp Tyr Leu Thr Leu Ile Thr Ala Gly Val Leu 

            100                 105                 110         

Met Ser Ala Val Ala Asn Thr Ala Met Pro Gln Ile Phe Ala Leu Ala 

        115                 120                 125             

Arg Glu Tyr Ala Asp Ser Glu Ala Arg Glu Val Val Met Phe Ser Ser 

    130                 135                 140                 

Val Met Arg Ala Gln Leu Ser Leu Ala Trp Val Ile Gly Pro Pro Leu 

145                 150                 155                 160 

Ser Phe Ala Leu Ala Leu Asn Tyr Gly Phe Thr Val Met Phe Leu Ile 

                165                 170                 175     

Ala Ala Val Thr Phe Ala Val Cys Val Leu Leu Val Gly Phe Met Leu 

            180                 185                 190         

Pro Ser Val Pro Arg Ala Ala Glu Asn Glu Gly Leu Gln Gly Gly Val 

        195                 200                 205             

Ser Ala Pro Ile Ala Pro Ala Ser Ala Trp Arg Asn Arg Asp Val Arg 

    210                 215                 220                 

Leu Leu Phe Ile Ala Ser Met Leu Met Trp Thr Cys Asn Thr Leu Tyr 

225                 230                 235                 240 

Ile Ile Asp Met Pro Leu Tyr Ile Thr Ala Asp Leu Gly Leu Pro Glu 

                245                 250                 255     

Gly Leu Ala Gly Val Leu Met Gly Thr Ala Ala Gly Leu Glu Ile Pro 

            260                 265                 270         

Ala Met Leu Leu Ala Gly Tyr Tyr Val Lys Arg Phe Gly Lys Arg Asn 

        275                 280                 285             

Met Met Leu Leu Ala Val Val Ala Gly Val Leu Phe Tyr Leu Gly Leu 

    290                 295                 300                 

Thr Val Leu Glu Ser Lys Pro Ala Leu Ile Ala Leu Gln Leu Leu Asn 

305                 310                 315                 320 

Ala Val Phe Ile Gly Ile Val Ala Gly Ile Gly Met Leu Tyr Phe Gln 

                325                 330                 335     

Asp Leu Met Pro Gly Arg Pro Gly Ala Ala Thr Thr Leu Phe Thr Asn 

            340                 345                 350         

Ser Ile Ser Thr Gly Val Ile Leu Ala Gly Val Leu Gln Gly Ala Leu 

        355                 360                 365             

Val Glu Asn Leu Gly His Gly Ser Val Tyr Trp Met Ala Ala Leu Leu 

    370                 375                 380                 

Ala Leu Ala Ala Leu Gly Met Ser Ala Lys Val Arg Glu Val 

385                 390                 395             



<210> 34

<211> 505

<212> PRT

<213> Klebsiella pneumoniae





<220> 

<223> ScrY, sucrose porin, GeneBank ID CAA40657.1



<400> 34

Met Tyr Lys Lys Arg Lys Leu Ala Ile Leu Ile Ala Leu Leu Thr Gly 

1               5                   10                  15      

Thr Ala Ala Ala His Gly Gln Thr Asp Leu Asn Ser Ile Glu Ala Arg 

            20                  25                  30          

Leu Ala Ala Leu Glu Lys Arg Leu Gln Asp Ala Glu Thr Arg Ala Ser 

        35                  40                  45              

Thr Ala Glu Ser Arg Ala Ala Ser Ala Glu Gln Lys Val Gln Gln Leu 

    50                  55                  60                  

Thr Gln Gln Gln Gln Gln Thr Gln Ala Thr Thr Gln Gln Val Ala Arg 

65                  70                  75                  80  

Arg Thr Thr Gln Leu Glu Glu Lys Ala Glu Arg Pro Gly Gly Phe Glu 

                85                  90                  95      

Phe His Gly Tyr Ala Arg Ser Gly Val Ile Met Asn Asp Ser Ala Ala 

            100                 105                 110         

Ser Thr Lys Ser Gly Ala Tyr Met Thr Pro Ala Gly Glu Thr Gly Gly 

        115                 120                 125             

Ala Ile Gly Arg Leu Gly Asn Gln Ala Asp Thr Tyr Val Glu Met Asn 

    130                 135                 140                 

Leu Glu His Lys Gln Thr Leu Asp Asn Gly Ala Thr Thr Arg Phe Lys 

145                 150                 155                 160 

Val Met Val Ala Asp Gly Gln Thr Thr Tyr Asn Asp Trp Thr Ala Ser 

                165                 170                 175     

Ser Ser Asp Leu Asn Val Arg Gln Ala Phe Val Glu Leu Gly Asn Leu 

            180                 185                 190         

Pro Thr Phe Glu Gly Pro Phe Lys Gly Ser Thr Leu Trp Ala Gly Lys 

        195                 200                 205             

Arg Phe Asp Arg Asp Asn Phe Asp Ile His Trp Ile Asp Ser Asp Val 

    210                 215                 220                 

Val Phe Leu Ala Gly Thr Gly Gly Gly Ile Tyr Asp Val Lys Trp Asn 

225                 230                 235                 240 

Asp Ser Leu Arg Ser Asn Phe Ser Leu Tyr Gly Arg Asn Phe Gly Asp 

                245                 250                 255     

Ile Ala Asp Ser Ser Asn Ser Val Gln Asn Tyr Ile Val Ser Met Asn 

            260                 265                 270         

Asn Phe Ala Gly Pro Val Gln Met Met Val Ser Gly Met Arg Ala Lys 

        275                 280                 285             

Asp Asn Asp Asp Arg Gln Asp Ala Asn Gly Asn Leu Val Lys Gly Asp 

    290                 295                 300                 

Ala Ala Asn Thr Gly Val His Ala Leu Leu Gly Leu His Asn Glu Ser 

305                 310                 315                 320 

Phe Tyr Gly Leu Arg Asp Gly Thr Ser Lys Thr Ala Leu Leu Tyr Gly 

                325                 330                 335     

His Gly Leu Gly Ala Glu Val Lys Gly Ile Gly Ser Asp Gly Ala Leu 

            340                 345                 350         

Arg Pro Gly Ala Asn Thr Trp Arg Phe Ala Ser Tyr Gly Thr Thr Pro 

        355                 360                 365             

Leu Ser Asp Arg Trp Phe Ile Ala Pro Ala Val Leu Ala Gln Ser Ser 

    370                 375                 380                 

Lys Asp Arg Tyr Val Asp Gly Asp Ser Tyr Gln Trp Ala Thr Leu Asn 

385                 390                 395                 400 

Leu Arg Leu Ile Gln Glu Val Thr Gln Asn Phe Ala Leu Ala Trp Glu 

                405                 410                 415     

Gly Ser Tyr Gln Tyr Met Asp Leu Gln Pro Glu Gly Tyr Asn Asp Arg 

            420                 425                 430         

His Ala Val Asn Gly Ser Phe Tyr Lys Leu Thr Phe Ala Pro Thr Phe 

        435                 440                 445             

Lys Val Gly Ser Ile Gly Asp Phe Phe Ser Arg Pro Glu Ile Arg Phe 

    450                 455                 460                 

Tyr Thr Ser Trp Met Asp Trp Ser Lys Lys Leu Asp Asn Tyr Ala Asn 

465                 470                 475                 480 

Asp Asp Ala Leu Gly Ser Asn Gly Phe Lys Ser Gly Gly Glu Trp Ser 

                485                 490                 495     

Phe Gly Met Gln Met Glu Thr Trp Phe 

            500                 505 



<210> 35

<211> 456

<212> PRT

<213> Klebsiella pneumoniae





<220> 

<223> ScrA, sucrose-specific enzyme II, GeneBank ID: CAA40658.1



<400> 35

Met Asp Phe Glu Gln Ile Ser Arg Ser Leu Leu Pro Leu Leu Gly Gly 

1               5                   10                  15      

Lys Glu Asn Ile Ala Ser Ala Ala His Cys Ala Thr Arg Leu Arg Leu 

            20                  25                  30          

Val Leu Val Asp Asp Ala Leu Ala Asp Gln Gln Ala Ile Gly Lys Ile 

        35                  40                  45              

Asp Gly Val Lys Gly Cys Phe Arg Asn Ala Gly Gln Met Gln Ile Ile 

    50                  55                  60                  

Phe Gly Thr Gly Val Val Asn Lys Val Tyr Ala Ala Phe Ile Gln Ala 

65                  70                  75                  80  

Ala Gly Ile Ser Glu Ser Ser Lys Ser Glu Ala Ala Asp Leu Ala Ala 

                85                  90                  95      

Lys Lys Leu Asn Pro Phe Gln Arg Ile Ala Arg Leu Leu Ser Asn Ile 

            100                 105                 110         

Phe Val Pro Ile Ile Pro Ala Ile Val Ala Ser Gly Leu Leu Met Gly 

        115                 120                 125             

Leu Leu Gly Met Val Lys Thr Tyr Gly Trp Val Asp Pro Ser Asn Ala 

    130                 135                 140                 

Leu Tyr Ile Met Leu Asp Met Cys Ser Ser Ala Ala Phe Ile Ile Leu 

145                 150                 155                 160 

Pro Ile Leu Ile Gly Phe Thr Ala Ala Arg Glu Phe Gly Gly Asn Pro 

                165                 170                 175     

Tyr Leu Gly Ala Thr Leu Gly Gly Ile Leu Thr His Pro Ala Leu Thr 

            180                 185                 190         

Asn Ala Trp Gly Val Ala Ala Gly Phe His Thr Met Asn Phe Phe Gly 

        195                 200                 205             

Ile Glu Val Ala Met Ile Gly Tyr Gln Gly Thr Val Phe Pro Val Leu 

    210                 215                 220                 

Leu Ala Val Trp Phe Met Ser Met Val Glu Lys Arg Leu Arg Arg Val 

225                 230                 235                 240 

Ile Pro Asp Ala Leu Asp Leu Ile Leu Thr Pro Phe Leu Thr Val Ile 

                245                 250                 255     

Ile Ser Gly Phe Ile Ala Leu Leu Leu Ile Gly Pro Ala Gly Arg Ala 

            260                 265                 270         

Leu Gly Asp Gly Ile Ser Phe Ile Leu Ser Thr Leu Ile Ser His Ala 

        275                 280                 285             

Gly Trp Leu Ala Gly Leu Leu Phe Gly Gly Leu Tyr Ser Val Ile Val 

    290                 295                 300                 

Ile Thr Gly Ile His His Ser Phe His Ala Ile Glu Ala Gly Leu Leu 

305                 310                 315                 320 

Gly Asn Pro Ser Ile Gly Val Asn Phe Leu Leu Pro Ile Trp Ala Met 

                325                 330                 335     

Ala Asn Val Ala Gln Gly Gly Ala Cys Phe Ala Val Trp Phe Lys Thr 

            340                 345                 350         

Lys Asp Ala Lys Ile Lys Ala Ile Thr Leu Pro Ser Ala Phe Ser Ala 

        355                 360                 365             

Met Leu Gly Ile Thr Glu Ala Ala Ile Phe Gly Ile Asn Leu Arg Phe 

    370                 375                 380                 

Val Lys Pro Phe Ile Ala Ala Leu Val Gly Gly Ala Ala Gly Gly Ala 

385                 390                 395                 400 

Trp Val Val Ser Met His Val Tyr Met Thr Ala Val Gly Leu Thr Ala 

                405                 410                 415     

Ile Pro Gly Met Ala Ile Val Gln Ala Ser Ser Leu Leu Asn Tyr Ile 

            420                 425                 430         

Ile Gly Met Ala Ile Ala Phe Ala Val Ala Phe Ala Leu Ser Leu Thr 

        435                 440                 445             

Leu Lys Tyr Lys Thr Asp Ala Glu 

    450                 455     



<210> 36

<211> 466

<212> PRT

<213> Salmonella enterica subsp. enterica serovar Typhimurium





<220> 

<223> ScrB, beta-fructofuranosidase, GeneBank ID: CAA47974.1



<400> 36

Met Ser Leu Pro Ser Arg Leu Pro Ala Ile Leu Gln Ala Val Met Gln 

1               5                   10                  15      

Gly Gln Pro Arg Ala Leu Ala Asp Ser His Tyr Pro Arg Trp His His 

            20                  25                  30          

Ala Pro Val Thr Gly Leu Met Asn Asp Pro Asn Gly Phe Ile Glu Phe 

        35                  40                  45              

Ala Gly Arg Tyr His Leu Phe Tyr Gln Trp Asn Pro Leu Ala Cys Asp 

    50                  55                  60                  

His Thr Phe Lys Cys Trp Ala His Trp Ser Ser Ile Asp Leu Leu His 

65                  70                  75                  80  

Trp Gln His Glu Pro Ile Ala Leu Met Pro Asp Glu Glu Tyr Asp Arg 

                85                  90                  95      

Asn Gly Cys Tyr Ser Gly Ser Ala Val Asp Asn Asn Gly Thr Leu Thr 

            100                 105                 110         

Leu Cys Tyr Thr Gly Asn Val Lys Phe Ala Glu Gly Gly Arg Thr Ala 

        115                 120                 125             

Trp Gln Cys Leu Ala Thr Glu Asn Ala Asp Gly Thr Phe Arg Lys Ile 

    130                 135                 140                 

Gly Pro Val Leu Pro Leu Pro Glu Gly Tyr Thr Gly His Val Arg Asp 

145                 150                 155                 160 

Pro Lys Val Trp Arg His Glu Asp Leu Trp Tyr Met Val Leu Gly Ala 

                165                 170                 175     

Gln Asp Arg Gln Lys Arg Gly Lys Val Leu Leu Phe Ser Ser Ala Asp 

            180                 185                 190         

Leu His Gln Trp Thr Ser Met Gly Glu Ile Ala Gly His Gly Ile Asn 

        195                 200                 205             

Gly Leu Asp Asp Val Gly Tyr Met Trp Glu Cys Pro Asp Leu Phe Pro 

    210                 215                 220                 

Leu Gly Asp Gln His Ile Leu Ile Cys Cys Pro Gln Gly Ile Ala Arg 

225                 230                 235                 240 

Glu Glu Glu Cys Tyr Leu Asn Thr Tyr Pro Ala Val Trp Met Ala Gly 

                245                 250                 255     

Glu Phe Asp Tyr Ala Ala Gly Ala Phe Arg His Gly Glu Leu His Glu 

            260                 265                 270         

Leu Asp Ala Gly Phe Glu Phe Tyr Ala Pro Gln Thr Met Leu Thr Ser 

        275                 280                 285             

Asp Gly Arg Arg Leu Leu Val Gly Trp Met Gly Val Pro Glu Gly Glu 

    290                 295                 300                 

Glu Met Leu Gln Pro Thr Leu Asn Asn Gly Trp Ile His Gln Met Thr 

305                 310                 315                 320 

Cys Leu Arg Glu Leu Glu Phe Ile Asn Gly Gln Leu Tyr Gln Arg Pro 

                325                 330                 335     

Leu Arg Glu Leu Ser Ala Leu Arg Gly Glu Ala Asn Gly Trp Ser Gly 

            340                 345                 350         

Asn Ala Leu Pro Leu Ala Pro Met Glu Ile Asp Leu Gln Thr Arg Gly 

        355                 360                 365             

Gly Asp Met Leu Ser Leu Asp Phe Gly Gly Val Leu Thr Leu Glu Cys 

    370                 375                 380                 

Asp Ala Ser Gly Leu Arg Leu Ala Arg Arg Ser Leu Ala Ser Asp Glu 

385                 390                 395                 400 

Met His Tyr Arg Tyr Trp Arg Gly Asn Val Arg Ser Leu Arg Val Phe 

                405                 410                 415     

Ile Asp Gln Ser Ser Val Glu Ile Phe Ile Asn Gly Gly Glu Gly Val 

            420                 425                 430         

Met Ser Ser Arg Tyr Phe Pro Ala Cys Ser Gly Gln Leu Thr Phe Ser 

        435                 440                 445             

Gly Ile Thr Pro Asp Ala Phe Cys Tyr Trp Pro Leu Arg Thr Cys Met 

    450                 455                 460                 

Val Glu 

465     



<210> 37

<211> 334

<212> PRT

<213> Salmonella enterica subsp. enterica serovar Typhimurium





<220> 

<223> ScrR, sucrose repressor, GeneBank ID: CAA47975.1



<400> 37

Met Lys Thr Lys Arg Val Thr Ile Lys Asp Ile Ala Glu Gln Ala Gly 

1               5                   10                  15      

Val Ser Lys Ala Thr Ala Ser Leu Val Leu Asn Gly Arg Gly Lys Glu 

            20                  25                  30          

Leu Arg Val Ala Gln Glu Thr Arg Glu Arg Val Leu Ser Ile Ala Arg 

        35                  40                  45              

Lys His His Tyr Gln Pro Ser Ile His Ala Arg Ser Leu Arg Asn Asn 

    50                  55                  60                  

Arg Ser His Thr Ile Gly Leu Val Val Pro Glu Ile Thr Asn His Gly 

65                  70                  75                  80  

Phe Ala Val Phe Ala His Glu Leu Glu Met Leu Cys Arg Glu Ala Gly 

                85                  90                  95      

Val Gln Leu Leu Ile Ser Cys Thr Asp Glu Asn Pro Gly Gln Glu Ser 

            100                 105                 110         

Val Val Val Asn Asn Met Ile Ala Arg Gln Val Asp Gly Met Ile Val 

        115                 120                 125             

Ala Ser Cys Met His Asn Asp Ala Asp Tyr Leu Lys Leu Ser Gln Gln 

    130                 135                 140                 

Leu Pro Val Val Leu Phe Asp Arg Cys Pro Asn Glu Ser Ala Leu Pro 

145                 150                 155                 160 

Leu Val Met Thr Asp Ser Ile Thr Pro Thr Ala Glu Leu Ile Ser Arg 

                165                 170                 175     

Ile Ala Pro Gln His Ser Asp Glu Phe Trp Phe Leu Gly Gly Gln Ala 

            180                 185                 190         

Arg Leu Ser Pro Ser Arg Asp Arg Leu Thr Gly Phe Thr Gln Gly Leu 

        195                 200                 205             

Ala Gln Ala Gly Ile Ala Leu Arg Pro Glu Trp Val Ile Asn Gly Asn 

    210                 215                 220                 

Tyr His Pro Ser Ser Gly Tyr Glu Met Phe Ala Ala Leu Cys Ala Arg 

225                 230                 235                 240 

Leu Gly Arg Pro Pro Lys Ala Leu Phe Thr Ala Ala Cys Gly Leu Leu 

                245                 250                 255     

Glu Gly Val Leu Arg Tyr Met Ser Gln His His Leu Leu Asp Ser Asp 

            260                 265                 270         

Ile His Leu Thr Ser Phe Asp Asp His Tyr Leu Tyr Asp Ser Leu Ser 

        275                 280                 285             

Leu Arg Ile Asp Thr Val Gln Gln Asp Asn Arg Gln Leu Ala Trp His 

    290                 295                 300                 

Cys Tyr Asp Leu Ile Ser Gln Leu Ile Glu Gly Asp Thr Pro Glu Thr 

305                 310                 315                 320 

Leu Gln Arg Tyr Leu Pro Ala Thr Leu Gln Phe Arg His Gln 

                325                 330                 



<210> 38

<211> 483

<212> PRT

<213> Avibacterium gallinarum





<220> 

<223> SacC_AgaI, glycoside hydrolase family 32 protein, GeneBank ID:

      WP_103853210.1



<400> 38

Met Ile Ile Phe Asn Glu Gly Lys Tyr Lys Ser Leu Tyr Ala Ala Glu 

1               5                   10                  15      

Gln Gly Glu Leu Glu Lys Ile Ala Gln Thr Val Ala Gln Asp Gln Asp 

            20                  25                  30          

Phe Arg Pro Val Tyr His Leu Ala Pro Pro Thr Gly Leu Leu Asn Asp 

        35                  40                  45              

Pro Asn Gly Leu Ile Phe Asp Gly Glu Lys Tyr His Leu Phe Tyr Gln 

    50                  55                  60                  

Trp Tyr Pro Phe Asp Ala Leu His Gly Met Lys His Trp Gln His Phe 

65                  70                  75                  80  

Ile Thr Gln Asp Phe Lys Gln Phe Ser Gln Ala Asp Leu Leu Val Pro 

                85                  90                  95      

Cys Glu Leu Tyr Glu Ser His Gly Cys Tyr Ser Gly Gly Ala Val Lys 

            100                 105                 110         

Ile Gly Asp Gln Ile Ala Val Phe Tyr Thr Gly Asn Thr Arg Arg Pro 

        115                 120                 125             

Ser Asp Asn Gln Arg Val Pro Tyr Gln Asn Leu Ala Ile Phe Ser Lys 

    130                 135                 140                 

Asp Gly Lys Leu Leu Ser Lys Arg Pro Leu Ile Glu Gln Ala Pro Gln 

145                 150                 155                 160 

Gly Tyr Thr Glu His Val Arg Asp Pro Lys Pro Phe Leu Thr Lys Asp 

                165                 170                 175     

Gly Lys Ile Arg Phe Ile Cys Gly Ala Gln Arg Glu Asn Leu Thr Gly 

            180                 185                 190         

Thr Ala Leu Val Phe Glu Met Asp Asn Leu Ala Asp Thr Pro Arg Leu 

        195                 200                 205             

Leu Gly Glu Leu Ala Leu Pro Ala Phe Asp Asn Gln Gly Val Phe Met 

    210                 215                 220                 

Trp Glu Cys Pro Asp Leu Ser Gln Met Gly Asp Lys Ser Leu Phe Ile 

225                 230                 235                 240 

Trp Ser Pro Gln Gly Lys Ala Arg Glu Leu Glu Gln Tyr Gln Asn Asn 

                245                 250                 255     

Tyr His Ala Val Tyr Ala Leu Gly Glu Leu Ala Asp Arg Gln Phe His 

            260                 265                 270         

Ala Glu Gln Ile Ala Glu Leu Asp Gln Gly Phe Asp Phe Tyr Ala Pro 

        275                 280                 285             

Gln Thr Phe Ser Gly Thr Gln Thr Met Leu Leu Gly Trp Val Gly Leu 

    290                 295                 300                 

Pro Asp Leu Ser Tyr Pro Thr Asp Leu Tyr Lys Trp His Ser Met Leu 

305                 310                 315                 320 

Ser Met Pro Arg Gln Leu Arg Leu Gln Asp Gly Lys Ile Tyr Gln Gln 

                325                 330                 335     

Pro Ile Glu Asn Ile Tyr Lys Asn Leu Thr Ala Leu Gln Ser Ile Thr 

            340                 345                 350         

Val Glu Lys Glu Ala Glu Ile Ala Asp Leu Asp Arg Ala Tyr Leu Lys 

        355                 360                 365             

Phe Asp Ala Asn Ala Gln Pro Phe Ser Leu Lys Phe Phe Asn Asn Ala 

    370                 375                 380                 

Gln Asn Gln Arg Leu Ile Leu Ser Tyr Asp Gly Glu Met Leu Cys Leu 

385                 390                 395                 400 

Asp Arg Ser Gln Thr Glu Gln Thr Asp Ser Met Lys Ser Phe Gly Asp 

                405                 410                 415     

Lys Arg Tyr Cys Arg Ile Glu Asp Leu Arg Gln Val Glu Ile Phe Phe 

            420                 425                 430         

Asp Arg Ser Val Ala Glu Ile Phe Leu Asn Gln Gly Glu Lys Ala Met 

        435                 440                 445             

Thr Ser Arg Phe Phe Ile Cys Ala Arg Glu Asn Gln Leu Cys Thr Asp 

    450                 455                 460                 

Lys Pro Leu Thr Leu Gln Val Gly Tyr Pro Lys Lys Ile Glu Val Asp 

465                 470                 475                 480 

Tyr Thr Lys 

            



<210> 39

<211> 548

<212> PRT

<213> Arthrobacter globiformis





<220> 

<223> Bff, beta-fructofuranosidase protein, GeneBank ID: BAD18121.1



<400> 39

Met Glu Arg Thr Cys Ile Thr Val Arg Ala Ile Val Arg Phe His Ile 

1               5                   10                  15      

Glu Gln Arg Gln Thr Ile Val Asn Lys Gln Arg Thr Lys Arg Gly Ile 

            20                  25                  30          

Leu Thr Ala Ala Leu Ser Ile Gly Ala Leu Gly Ala Thr Leu Ile Ser 

        35                  40                  45              

Gly Pro Ala Val Ala Ala Thr Asp Ala Ala Pro Gly Phe Pro Gln Pro 

    50                  55                  60                  

Thr Glu His Thr Gln Lys Ala Tyr Ser Pro Thr Asp Asn Phe Thr Ser 

65                  70                  75                  80  

Arg Trp Thr Arg Ala Asp Ala Lys Gln Leu Lys Ala Met Ser Asp Pro 

                85                  90                  95      

Asp Ala Gly Ser Arg Glu Asn Ser Met Pro Thr Glu Tyr Thr Met Pro 

            100                 105                 110         

Thr Val Ser Gln Asp Phe Pro Asp Met Ser Asn Glu Lys Val Trp Val 

        115                 120                 125             

Trp Asp Thr Trp Pro Leu Ile Asp Glu Asn Ala Asn Gln Tyr Ser Val 

    130                 135                 140                 

Asn Gly Gln Glu Ile Ile Phe Ser Leu Val Ala Asp Arg Lys Leu Gly 

145                 150                 155                 160 

Phe Asp Glu Arg His Gln Tyr Ala Arg Ile Gly Tyr Phe Tyr Arg Pro 

                165                 170                 175     

Ala Gly Ile Pro Ala Asp Glu Arg Pro Glu Asp Gly Gly Trp Thr Tyr 

            180                 185                 190         

Gly Gly Gln Val Phe Asp Glu Gly Val Thr Gly Lys Ile Phe Glu Asp 

        195                 200                 205             

Gln Ser Phe Thr His Gln Thr Gln Trp Ser Gly Ser Ala Arg Val Ser 

    210                 215                 220                 

Lys Asn Gly Glu Ile Lys Leu Phe Phe Thr Asp Val Ala Phe Tyr Arg 

225                 230                 235                 240 

Asp Lys Asp Gly Gln Asp Val Lys Pro Tyr Asp Ser Arg Ile Ala Leu 

                245                 250                 255     

Ser Val Gly His Val His Ser Asn Lys Lys Gly Val Lys Leu Thr Gly 

            260                 265                 270         

Phe Asn Lys Val Lys Glu Leu Leu Gln Ala Asp Gly Lys Asn Tyr Gln 

        275                 280                 285             

Asn Ala Ala Gln Asn Ser Tyr Tyr Asn Phe Arg Asp Pro Phe Thr Phe 

    290                 295                 300                 

Val Asp Pro Ala His Pro Gly Glu Thr Tyr Met Val Phe Glu Gly Asn 

305                 310                 315                 320 

Ser Ala Met Asp Arg Asp Glu Ala Lys Cys Thr Ala Glu Asp Leu Gly 

                325                 330                 335     

Tyr Arg Glu Gly Glu Thr Asn Gly Glu Thr Val Glu Gln Val Asn Asn 

            340                 345                 350         

Ser Gly Ala Thr Tyr Gln Ile Gly Asn Val Gly Leu Ala Arg Ala Lys 

        355                 360                 365             

Asn Lys Ala Leu Thr Glu Trp Glu Phe Leu Pro Pro Ile Leu Ser Ala 

    370                 375                 380                 

Asn Cys Val Thr Asp Gln Thr Glu Arg Pro Gln Ile Tyr Met Gln Asp 

385                 390                 395                 400 

Gly Lys Tyr Tyr Leu Phe Thr Ile Ser His Arg Ser Thr Phe Ala Thr 

                405                 410                 415     

Gly Ile Asp Gly Pro Glu Gly Val Tyr Gly Phe Val Gly Asn Gly Ile 

            420                 425                 430         

Arg Ser Asp Tyr Gln Pro Leu Asn Arg Gly Ser Gly Leu Ala Leu Gly 

        435                 440                 445             

Ser Pro Thr Asn Leu Asn Phe Ala Ala Gly Thr Pro Phe Ala Pro Asp 

    450                 455                 460                 

Tyr Asn Gln His Pro Gly Gln Phe Gln Ala Tyr Ser His Tyr Val Met 

465                 470                 475                 480 

Pro Gly Gly Leu Val Gln Ser Phe Ile Asp Thr Ile Gly Thr Lys Asp 

                485                 490                 495     

Asn Phe Val Arg Gly Gly Thr Leu Gly Pro Thr Val Lys Leu Asn Ile 

            500                 505                 510         

Lys Gly Asp Ser Ala Thr Val Asp Tyr Asn Tyr Gly Asp Asn Gly Leu 

        515                 520                 525             

Gly Gly Trp Ala Asp Ile Pro Ala Asn Arg Glu Leu Lys Asn Ser Lys 

    530                 535                 540                 

Ala Val Ala Lys 

545             



<210> 40

<211> 999

<212> DNA

<213> Artificial Sequence





<220> 

<223> lgta coding nucleotide sequence



<400> 40

atgcaaccgc tggtctccgt gctgatctgt gcttacaatg tggaaaaata cttcgcccaa      60



tcgctggccg cagtcgtgaa tcaaacgtgg cgcaacctgg aaattctgat cgtggatgac     120



ggcagtaccg atggtacgct ggcgatcgcc aaagattttc agaaacgtga ctcccgcatt     180



aaaatcctgg cacaggctca aaacagtggc ctgattccgt ccctgaatat cggtctggat     240



gaactggcga aaagtggcat gggtgaatat atcgcacgca ccgatgctga tgacattgcg     300



gccccggact ggattgaaaa aatcgtcggc gaaatggaaa aagatcgtag cattatcgcg     360



atgggtgcct ggctggaagt gctgtctgaa gaaaaagatg gcaatcgtct ggcacgccat     420



caccgtcatg gtaaaatctg gaaaaaaccg acgcgtccgg aagatattgc cgactttttc     480



ccgtttggca acccgattca caacaatacc atgatcatgc gtcgctcagt tattgatggc     540



ggtctgcgct ataatacgga acgtgattgg gcggaagact atcagttctg gtacgatgtc     600



tcgaaactgg gtcgcctggc gtattacccg gaagccctgg tgaaatatcg tctgcatgcc     660



aaccaagtta gctctaaata ctctatccgc caacacgaaa ttgcacaggg catccaaaaa     720



accgctcgta atgattttct gcagtcaatg ggttttaaaa cgcgcttcga ctcgctggaa     780



tatcgtcaaa ttaaagcggt tgcctacgaa ctgctggaaa aacatctgcc ggaagaagat     840



tttgaacgcg cgcgtcgctt tctgtatcag tgcttcaaac gtaccgacac gctgccggca     900



ggtgcttggc tggatttcgc agctgacggt cgcatgcgtc gcctgtttac cctgcgtcaa     960



tacttcggca ttctgcaccg cctgctgaaa aaccgttaa                            999





<210> 41

<211> 1005

<212> DNA

<213> Artificial Sequence





<220> 

<223> pmnagT coding nucleotide sequence



<400> 41

atggaaaata aaccgctggt tagcgttctg atttgcgcct ataatgtgga aaaatacatc      60



gaagaatgca tcaacgccgt tattaaccag acctataaaa acctggaaat catcattgtg     120



aatgatggca gcagcgataa cacctatttt ctgctgaaaa aactggccga aaaagacaac     180



cgtatcaaga tcctgaactt caacaaccat attggcatta ttagcgcact gaatgaaggc     240



ctgaaagaaa ttgccggtga atatattgca cgtaccgatt cagatgatat caccaaaccg     300



gattggatcg aaaaaattct gacctgtatg cagaacgacc cgaaaattat cgcaatgggt     360



agctatctga ccgttctgag cgaagaaaat aatggtagcg tgctggccaa tcaccataaa     420



aacaaagtgg aatggaaaaa cccgctggaa cataaagata tcgtggaaaa aatgctgttt     480



ggcaacccga ttcataataa cagcatggtt atgcgcagcg agatctatac caaatatcac     540



ctgatttatg atccggatta tcattatgcc gaggactata aattctggct ggaagttagc     600



cgtattggta aactggcaaa ttatccggaa agcctggttt attatcgtct gcatcgtaat     660



cagaccagca gcattcataa ttcccagcaa gaaatcaacg gtaaaaaact gcgtctgcag     720



gcactgaact attatctgaa agatctgggc attgattatc agctgccgga aaaatttctg     780



ttcaaagata ttgcactgct gcaagagatc ttttatgaac gtggtatgtt ccgcgaaaac     840



attattcgtc gcattatcta tgagtgctat ctgagcctgg gcgagtataa ttacaaagat     900



atctactact tcctgatcaa caaaaacaac tttctgagca tcaaagacaa attcaaaatc     960



atcaaaaaat acctgcgtcc ggacaaatat agcagcacct attaa                    1005





<210> 42

<211> 990

<212> DNA

<213> Artificial Sequence





<220> 

<223> HD0466 coding nucleotide sequence



<400> 42

atgaccacac tggttagcgt tctgatttgt gcctataacg tggaaaaata catcgatgaa      60



tgtctgaatg cagttattgc ccagacctat aaaaacctgg aaattatcgt tgtgaatgat     120



ggtagcaccg atggcaccct ggcaaaactg cgtcagtttg aagcaaaaga tccgcgtgtt     180



aaaatcatcg ataacattgt taatcagggc accagcaaaa gcctgaatat tggtattcag     240



tattgtcagg gcgaaattat tgcacgtacc gattcagatg atatcgtgga tattcattgg     300



atcgaaaccc tgatgcgtga actggataat agtccggaaa ccattgcaat tagcgcctat     360



ctggaatttc tggccgaaaa aggtaatggt agcaaactga gccgtagccg taaacatggt     420



aaaaatgcag aaaatccgat tagcagcgaa gcaattagcc agcgtatgct gtttggtaat     480



ccggttcata acaatgtggc actggttcgt cgtaaagtgt ttagcgaata tggtctgcgt     540



tttgatccgg attatattca tgccgaggat tacaaatttt ggttcgaagt gagcaaactg     600



ggtaaaatgc gtacctatcc gaaagcgctg gttaaatatc gtctgcatgc aacccaggtt     660



agcagcgcat ataatcagaa acagcgtagc attgccaaaa aaatcaaacg tgaagccatc     720



agccattatc tgcagcagta tggcattcag ctgccggaaa aactgaccat tcatgacctg     780



tttagcattt ttagtccgca gattgaactg agcctgaccg ttgcaaataa acaagaactg     840



ttttggagcc tggcaaccag cctgagcgaa tatcattttc gtgatctgct gaaaatctac     900



agcctggata tttttcatca gctgagcttc aaatacaaaa agcgcatctt tcgcaaattt     960



ctgctgccga atcgttatcc gagcgttatt                                      990





<210> 43

<211> 1320

<212> DNA

<213> Artificial Sequence





<220> 

<223> GalTK coding nucleotide



<400> 43

atgatctctg tctacatcat cagtctgaaa gaatcgcagc gtcgtctgga tacggaaaaa      60



ctggttctgg aatcgaacga aaaatttaaa ggccgttgtg tgtttcagat tttcgatgcg     120



atctctccga aacatgaaga cttcgaaaaa ttcgttcaag aactgtacga tagctctagt     180



ctgctgaaat cggattggtt ccatagcgac tattgctacc aggaactgct gccgcaagaa     240



tttggttgtt atctgagcca ctacctgctg tggaaagaat gcgttaaact gaatcagccg     300



gtggttattc tggaagatga cgtcgcgctg gaatctaact ttatgcaggc cctggaagat     360



tgtctgaaaa gtccgtttga cttcgtccgt ctgtatggcc attactgggg cggtcacaaa     420



accaatctgt gcgcgctgcc ggtttatacc gaaacggaag aagcggaagc ctccattgaa     480



aaaaccccga tcgaaaatta tgaagtgacc agcccgccgc cgccgaaccc gacccgcgat     540



acgcagcaag acttcatcac cgaaacgcag caagatccga aagaactgtc ggaaccgtgc     600



aaaattgccc cgcagaaaat cagcttcaac caagtcgtgt tcaagaaaat taaacgtaaa     660



ctgaaccgct tcatcggtag catcctggcg cgtaccgaag tctataaaaa tatcgtggcc     720



aaatacgatg acctgaccac gaaatatgac gatctgacca cgaaatatga tgatctgacg     780



accaaatatg acgacctgac gacgaaatac gatgacctga acaaaaacat cgcagaaaaa     840



tacgatgaac tgatgggcaa atacgaatcg ctgctggcta aagaagtgaa catcaaagaa     900



accttctggg aatcccgtgc ggattcagaa aaagaagccc tgtttctgga ccatttctat     960



ctgaccagcg tttacgtcgc aaccacggct ggctattacc tgaccccgaa aggtgcaaaa    1020



accttcattg aagctacgga acgctttaaa attatcgaac cggttgatat gttcattaac    1080



aatccgacct atcatgatat tgccaacttt acgtacgtgc cgtgtccggt ttccctgaac    1140



aaacacgcat tcaactcaac catccagaac gctaaaaaac cggatattag cctgaaaccg    1200



ccgaaaaaat cttacttcga taacctgttt tatcacaaat ttaacgcacg caaatgcctg    1260



aaagcattca ataaatacag taaacagtac gccccgctga aaaccccgaa agaagtctaa    1320





<210> 44

<211> 789

<212> DNA

<213> Artificial Sequence





<220> 

<223> cvb3galT coding nucelotide sequence



<400> 44

atggacacca tcatgattaa acgtccgctg gttagcgtta ttctgccggt gaataaaaac      60



aatccgcatc tggaagaagc aatccagagc attaaaaacc agacctataa agagctggaa     120



ctgatcatta ttgccaacaa ctgcgaggat aacttttata gcctgctgct gaaatatcag     180



gaccagaaaa ccaaaattat ccgcaccagc atcaaatatc tgccgtttag cctgaatctg     240



ggtgttcatc tgagccaggg tgaatatatt gcacgtatgg attcagatga tatcagcgtt     300



ctggatcgca ttgaaaaaca ggttaaacgc tttctgaata caccggaact gagcattctg     360



ggtagcaatg ttgaatatat caatgaagcc agcgaaagca ttggctatag caactatccg     420



ctggatcata gcagcattgt taatagcttt ccgtttcgtt gtaatctggc acatccgacc     480



attatggtta aaaaagaagt gattaccacg cttggtggct atatgtatgg tagcctgagc     540



gaagattatg atctgtggat tcgtgcaagc cgtcatggca atttcaaatt tagcaatatt     600



gatgaaccgc tgctgaagta ccgtattcat aaaggtcagg caaccaataa aagcaacgcc     660



tataacatct ttgcctttga tagcagcctg aaaatccgtg aatttctgct gaatggtaat     720



gtgcagtatc tgctgggtgc agcacgtggt ttttttgcat ttctgtatgt gcgcttcatc     780



aaaaaatga                                                             789





<210> 45

<211> 909

<212> DNA

<213> Artificial Sequence





<220> 

<223> futC coding nuelcotide sequence



<400> 45

atggcgttca aagtggtcca aatctgcggt ggtctgggta atcaaatgtt ccaatatgcc      60



ttcgctaaat cgctgcaaaa acacagtaat accccggtcc tgctggatat tacgagtttt     120



gattggtccg accgtaaaat gcagctggaa ctgttcccga ttgatctgcc gtatgcgagc     180



gccaaagaaa tcgcaattgc taaaatgcag catctgccga aactggttcg tgatgcgctg     240



aaatgcatgg gctttgaccg cgtcagtcaa gaaatcgtgt tcgaatatga accgaaactg     300



ctgaaaccgt cccgtctgac ctatttcttt ggttactttc aggacccgcg ttacttcgac     360



gccatctctc cgctgattaa acaaaccttt acgctgccgc cgccgccgga aaacaacaaa     420



aacaacaaca aaaaagaaga agaatatcag tgcaaactga gcctgatcct ggcggccaaa     480



aactctgtgt ttgttcacat tcgtcgcggc gattacgtgg gcatcggttg tcagctgggt     540



attgactatc agaaaaaagc gctggaatac atggccaaac gtgttccgaa tatggaactg     600



tttgtcttct gcgaagatct ggaatttacc caaaacctgg acctgggcta tccgttcatg     660



gatatgacca cgcgcgacaa agaagaagaa gcgtattggg atatgctgct gatgcagagc     720



tgtcaacatg gtattatcgc taatagcacg tattcttggt gggcagctta cctgattgaa     780



aacccggaaa aaattatcat tggcccgaaa cattggctgt ttggtcacga aaatatcctg     840



tgtaaagaat gggtgaaaat cgaatcacac ttcgaagtta aatcgcagaa atataacgcg     900



ctgggctaa                                                             909





<210> 46

<211> 879

<212> DNA

<213> Artificial Sequence





<220> 

<223> mtun coding nucleotide sequence



<400> 46

atggtgatta cccatctgat tggtggtctg ggtaaccaga tgtttcagta tgcagcaggt      60



cgtgcagtta gcctggaacg tggtgttagc ctgagcctgg atattagcgg ttttgcaaat     120



tatggtctgc atcagggttt tgaactgcag cgtatcttta attgtaccgc agaaattgca     180



aatgaagccg atgttcgtgg tattttaggt tggcagagca gtccgcgtat tcgtcagctg     240



ctgagccgtc agaatatggc aatttttcgt cgtgaaggtt ttgttgtgga accgcatttt     300



cattattggc agggtattaa aagcgttccg cgtgattgtt atctgaccgg ctattggcag     360



agtgaacagt attttctgga agcagcagca cagattcgtg cagattttac ctttaaactg     420



ccgctggata accagaatat tgaactggcc aaacaaatca atgccgttaa tgcggttagc     480



ctgcatgttc gtcgtggtga ttatgcaaat acaccggaaa ccaccgcaac acatggtctg     540



tgtagtctgg attattatcg tgttgccatt cgtcatattg cagaacaggt tcagcagccg     600



catttttttg tttttagtga tgatattgcc tgggtgaaga acaacctgag tattgatttt     660



ccgtgccagt atgtggatca taatcagggt gcagaaagct ataatgatat gcgtctgatg     720



agcatgtgcc gtcatcatat tattgcaaac agcagcttta gttggtgggg tgcatggctg     780



aatccgaatg ttaacaaaat tgttgttgca ccgagccgtt ggtttgccaa acagaccgat     840



gtgcgtgatc tgctgccgca aggttggatt aaacagtaa                            879





<210> 47

<211> 879

<212> DNA

<213> Artificial Sequence





<220> 

<223> smob coding nucleotide sequence



<400> 47

atgatcatca gccagattat tggtggtctg ggtaatcaga tgtttcagta tgcagcaggt      60



cgtgcactga gcctggttcg tggtcagccg ctgctgctgg atgttaccgg ttttgcaggt     120



tatggtctgc atcagggttt tgaactgcag cgtgtttttg attgtccgat tggtattgca     180



accgaagaag atgttcgcgg tattttaggt tggcagttta gcgcaggtat tcgtcgtatt     240



gttgcacgtc ctggtatggc agcatttcgt cgtaaaggtt ttattgtgga accgcacttt     300



cattattggc ctgagattaa aaacgttccg cgtgattgtt atctgcttgg ttattggcag     360



agcgaacgtt attttcgtgc agcaaccgca gatattcgtg cagatttttc atttaaaagt     420



ccgctggtta atcgcaatgc cgaaaccgca gcacagattg atcaggttaa tgcaattagc     480



ctgcatatgc gtcgtggtga ttatgtgaat aatccgaaaa ccagcgcaac ccatggtctg     540



tgtagcctgg attattatca ggcagcaatc aaatttgtta gcgaacgtgt tgaagaaccg     600



tttttcttta tcttctccga tgatattgca tgggtgaaag caaatctgaa actggatttt     660



ccgtgccagt atgtggatca taatcatggt gcagaaagct tcaatgatat gcatctgatg     720



agcctgtgtc agcatcatat tattgcaaac agcagcttta gttggtgggg tgcatggctg     780



aatagcgatc cgaaaaaaat cgttctggca ccgaaaaaat ggttcgccaa caaaaacaac     840



atcaaagacc tgtttccgcc tggttgggtt agcctgtaa                            879





<210> 48

<211> 730

<212> DNA

<213> Artificial Sequence





<220> 

<223> DNA-binding transcriptional repressor GlpR



<400> 48

atgaaacaaa cacaacgtca caacggtatt atcgaactgg ttaaacagca gggttatgtc      60



agtaccgaag agctggtaga gcatttctcc gtcagcccgc agactattcg ccgcgacctc     120



aatgagctgg cggagcaaaa cctgatcctg gccatcatgg cggtgcggcg ctgccttcca     180



gttcggttaa cacgccgtgg cacgatcgca aggccaccca gaccgaagaa aaagagcgca     240



tcgcccgcaa agtggcggag caaatcccca atggctcgac gctgtttatc gatatcggca     300



ccacgccgga agcggtagcg cacgcactgc tcaatcacag caatttgcgc attgtcacca     360



acaatctcaa cgttgctaac acgttgatgg taaaagaaga ttttcgcatc attctcgccg     420



gtggcgaatt acgcagccgc gatggcggga tcattggcga agcgacgctc gattttatct     480



cccagttccg ccttgatttc ggcattctgg ggataagcgg catcgatagc gacggctcgc     540



tgctggagtt cgattaccac gaagttcgca ccaaacgcgc cattattgag aactcgcgcc     600



acgttatgct ggttgtcgat cactcgaaat ttggccgtaa cgcgatggtc aatatgggca     660



gcatcagcat ggtagatgcc gtctacaccg acgccccgcc gccagtaagc gtgatgcagg     720



tgctgacgga                                                            730





<210> 49

<211> 292

<212> PRT

<213> Sideroxydans lithotrophicus





<220> 

<223> fucT54 α-1,2-fucosyltransferas



<400> 49

Met Val Ile Ser Asn Ile Ile Gly Gly Leu Gly Asn Gln Met Phe Gln 

1               5                   10                  15      

Tyr Ala Ala Ala Arg Ala Leu Ser Leu Lys Leu Glu Val Pro Leu Lys 

            20                  25                  30          

Leu Asp Ile Ser Gly Phe Thr Asn Tyr Ala Leu His Gln Gly Phe Glu 

        35                  40                  45              

Leu Asp Arg Ile Phe Gly Cys Lys Ile Glu Ile Ala Ser Glu Ala Asp 

    50                  55                  60                  

Val His Glu Ile Leu Gly Trp Gln Ser Ala Ser Gly Ile Arg Arg Val 

65                  70                  75                  80  

Val Ser Arg Pro Gly Met Ser Ile Phe Arg Arg Lys Gly Phe Val Val 

                85                  90                  95      

Glu Pro His Phe Ser Tyr Trp Asn Gly Ile Arg Lys Ile Thr Gly Asp 

            100                 105                 110         

Cys Tyr Leu Ala Gly Tyr Trp Gln Ser Glu Lys Tyr Phe Leu Asp Ala 

        115                 120                 125             

Ala Val Glu Ile Arg Lys Asp Phe Ser Phe Lys Leu Pro Leu Asp Ser 

    130                 135                 140                 

His Asn Ala Glu Leu Ala Glu Lys Ile Asp Gln Glu Asn Ala Val Ser 

145                 150                 155                 160 

Leu His Ile Arg Arg Gly Asp Tyr Ala Asn Asn Pro Leu Thr Ala Ala 

                165                 170                 175     

Thr His Gly Leu Cys Ser Leu Asp Tyr Tyr Arg Lys Ser Ile Lys His 

            180                 185                 190         

Ile Ala Gly Gln Val Arg Asn Pro Tyr Phe Phe Val Phe Ser Asp Asp 

        195                 200                 205             

Ile Ala Trp Val Lys Asp Asn Leu Glu Ile Glu Phe Pro Ser Gln Tyr 

    210                 215                 220                 

Val Asp Tyr Asn His Gly Ser Met Ser Phe Asn Asp Met Arg Leu Met 

225                 230                 235                 240 

Ser Leu Cys Lys His His Ile Ile Ala Asn Ser Ser Phe Ser Trp Trp 

                245                 250                 255     

Gly Ala Trp Leu Asn Pro Asn Pro Glu Lys Val Val Ile Ala Pro Glu 

            260                 265                 270         

Arg Trp Phe Ala Asn Arg Thr Asp Val Gln Asp Leu Leu Pro Pro Gly 

        275                 280                 285             

Trp Val Lys Leu 

    290         



<210> 50

<211> 24

<212> DNA

<213> Artificial Sequence





<220> 

<223> Oligo O48, galK.for



<400> 50

cccagcgaga cctgaccgca gaac                                           24





<210> 51

<211> 24

<212> DNA

<213> Artificial Sequence





<220> 

<223> Oligo O49, galK.rev



<400> 51

ccccagtcca tcagcgtgac tacc                                           24



<210> 52
<211> 6706
<212> DNA
<213> Escherichia coli


<220> 
<223> CA gene cluster


<400> 52

atgtcaaaag tcgctctcat caccggtgta accggacaag acggttctta cctggcagag       60

tttctgctgg aaaaaggtta cgaggtgcat ggtattaagc gtcgcgcatc gtcattcaac      120

accgagcgcg tggatcacat ttatcaggat ccgcacacct gcaacccgaa attccatctg      180

cattatggcg acctgagtga tacctctaac ctgacgcgca ttttgcgtga agtacagccg      240

gatgaagtgt acaacctggg cgcaatgagc cacgttgcgg tctcttttga gtcaccagaa      300

tataccgctg acgtcgacgc gatgggtacg ctgcgcctgc tggaggcgat ccgcttcctc      360

ggtctggaaa agaaaactcg tttctatcag gcttccacct ctgaactgta tggtctggtg      420

caggaaattc cgcagaaaga gaccacgccg ttctacccgc gatctccgta tgcggtcgcc      480

aaactgtacg cctactggat caccgttaac taccgtgaat cctacggcat gtacgcctgt      540

aacggaattc tcttcaacca tgaatccccg cgccgcggcg aaaccttcgt tacccgcaaa      600

atcacccgcg caatcgccaa catcgcccag gggctggagt cgtgcctgta cctcggcaat      660

atggattccc tgcgtgactg gggccacgcc aaagactacg taaaaatgca gtggatgatg      720

ctgcagcagg aacagccgga agatttcgtt atcgcgaccg gcgttcagta ctccgtgcgt      780

cagttcgtgg aaatggcggc agcacagctg ggcatcaaac tgcgctttga aggcacgggc      840

gttgaagaga agggcattgt ggtttccgtc accgggcatg acgcgccggg cgttaaaccg      900

ggtgatgtga ttatcgctgt tgacccgcgt tacttccgtc cggctgaagt tgaaacgctg      960

ctcggcgacc cgaccaaagc gcacgaaaaa ctgggctgga aaccggaaat caccctcaga     1020

gagatggtgt ctgaaatggt ggctaatgac ctcgaagcgg cgaaaaaaca ctctctgctg     1080

aaatctcacg gctacgacgt ggcgatcgcg ctggagtcat aagcatgagt aaacaacgag     1140

tttttattgc tggtcatcgc gggatggtcg gttccgccat caggcggcag ctcgaacagc     1200

gcggtgatgt ggaactggta ttacgcaccc gcgacgagct gaacctgctg gacagccgcg     1260

ccgtgcatga tttctttgcc agcgaacgta ttgaccaggt ctatctggcg gcggcgaaag     1320

tgggcggcat tgttgccaac aacacctatc cggcggattt catctaccag aacatgatga     1380

ttgagagcaa catcattcac gccgcgcatc agaacgacgt gaacaaactg ctgtttctcg     1440

gatcgtcctg catctacccg aaactggcaa aacagccgat ggcagaaagc gagttgttgc     1500

agggcacgct ggagccgact aacgagcctt atgctattgc caaaatcgcc gggatcaaac     1560

tgtgcgaatc atacaaccgc cagtacggac gcgattaccg ctcagtcatg ccgaccaacc     1620

tgtacgggcc acacgacaac ttccacccga gtaattcgca tgtgatccca gcattgctgc     1680

gtcgcttcca cgaggcgacg gcacagaatg cgccggacgt ggtggtatgg ggcagcggta     1740

caccgatgcg cgaatttctg cacgtcgatg atatggcggc ggcgagcatt catgtcatgg     1800

agctggcgca tgaagtctgg ctggagaaca cccagccgat gttgtcgcac attaacgtcg     1860

gcacgggcgt tgactgcact atccgcgagc tggcgcaaac catcgccaaa gtggtgggtt     1920

acaaaggccg ggtggttttt gatgccagca aaccggatgg cacgccgcgc aaactgctgg     1980

atgtgacgcg cctgcatcag cttggctggt atcacgaaat ctcactggaa gcggggcttg     2040

ccagcactta ccagtggttc cttgagaatc aagaccgctt tcgggggtaa tgatgttttt     2100

acgtcaggaa gactttgcca cggtagtgcg ctccactccg cttgtctctc tcgactttat     2160

tgtcgagaac agtcgcggcg agtttctgct tggcaaaaga accaaccgcc cggcgcaggg     2220

ttactggttt gtgccgggag ggcgcgtgca gaaagacgaa acgctggaag ccgcatttga     2280

gcggctgacg atggcggaac tggggctgcg tttgccgata acagcaggcc agttttacgg     2340

tgtctggcag cacttttatg acgataactt ctctggcacg gatttcacca ctcactatgt     2400

ggtgctcggt tttcgcttca gagtatcgga agaagagctg ttactgccgg atgagcagca     2460

tgacgattac cgctggctga cgtcggacgc gctgctcgcc agtgataatg ttcatgctaa     2520

cagccgcgcc tattttctcg ctgagaagcg taccggagta cccggattat gaaaatactg     2580

gtctacggca ttaactactc gccggagtta accggcatcg gcaaatacac cggcgagatg     2640

gtggaatggc tggcggcaca aggtcatgag gtgcgggtca ttaccgcacc gccttactac     2700

ccgcaatggc aggtgggcga gaactattcc gcctggcgct acaaacgaga agagggggcc     2760

gccacggtgt ggcgctgccc gctgtatgtg ccaaaacagc cgagcaccct gaaacgcctg     2820

ttgcatctgg gcagttttgc cgtcagcagt ttctttccgc tgatggcgca acgtcgctgg     2880

aagccggatc gcattattgg cgtggtgcca acgctgtttt gcgcgccggg aatgcgcctg     2940

ctggcgaaac tctctggtgc gcgtaccgtg ctgcatattc aggattacga agtggacgcc     3000

atgctggggc tgggccttgc cggaaaaggc aaaggcggca aagtggcaca gctggcaacg     3060

gcgttcgaac gtagcggact gcataacgtc gataacgtct ccacgatttc gcgttcgatg     3120

atgaataaag ccatcgaaaa aggcgtggcg gcggaaaacg tcatcttctt ccccaactgg     3180

tcggaaattg cccgttttca gcatgttgca gatgccgatg ttgatgccct tcgtaaccag     3240

cttgacctgc cggataacaa aaaaatcatt ctttactccg gcaatattgg tgaaaagcag     3300

gggctggaaa acgttattga agctgccgat cgtctgcgcg atgaaccgct gatttttgcc     3360

attgtcgggc agggcggcgg caaagcgcgg ctggaaaaaa tggcgcagca gcgtggactg     3420

cgcaacatgc aatttttccc gctgcaatcg tatgacgctt tacccgcact gctgaagatg     3480

ggcgattgcc atctggtggt gcaaaaacgc ggcgcggcag atgccgtatt gccgtcgaaa     3540

ctgaccaata ttctggcagt aggcggtaac gcggtgatta ctgctgaagc ctacacagaa     3600

ctggggcagc tttgcgaaac ctttccgggc attgcggttt gcgttgaacc ggaatcggtc     3660

gaggcgctgg tggcggggat ccgtcaggcg ctcctgctgc ccaaacacaa cacggtggca     3720

cgtgaatatg ccgaacgcac gctcgataaa gagaacgtgt tacgtcaatt tataaatgat     3780

attcggggat aattatggcg cagtcgaaac tctatccagt tgtgatggca ggtggctccg     3840

gtagccgctt atggccgctt tcccgcgtac tttatcccaa gcagttttta tgcctgaaag     3900

gcgatctcac catgctgcaa accaccatct gccgcctgaa cggcgtggag tgcgaaagcc     3960

cggtggtgat ttgcaatgag cagcaccgct ttattgtcgc ggaacagctg cgtcaactga     4020

acaaacttac cgagaacatt attctcgaac cggcagggcg aaacacggca cctgccattg     4080

cgctggcggc gctggcggca aaacgtcata gcccggagag cgacccgtta atgctggtat     4140

tggcggcgga tcatgtgatt gccgatgaag acgcgttccg tgccgccgtg cgtaatgcca     4200

tgccatatgc cgaagcgggc aagctggtga ccttcggcat tgtgccggat ctaccagaaa     4260

ccggttatgg ctatattcgt cgcggtgaag tgtctgcggg tgagcaggat atggtggcct     4320

ttgaagtggc gcagtttgtc gaaaaaccga atctggaaac cgctcaggcc tatgtggcaa     4380

gcggcgaata ttactggaac agcggtatgt tcctgttccg cgccggacgc tatctcgaag     4440

aactgaaaaa atatcgcccg gatatcctcg atgcctgtga aaaagcgatg agcgccgtcg     4500

atccggatct caattttatt cgcgtggatg aagaagcgtt tctcgcctgc ccggaagagt     4560

cggtggatta cgcggtcatg gaacgtacgg cagatgctgt tgtggtgccg atggatgcgg     4620

gctggagcga tgttggctcc tggtcttcat tatgggagat cagcgcccac accgccgagg     4680

gcaacgtttg ccacggcgat gtgattaatc acaaaactga aaacagctat gtgtatgctg     4740

aatctggcct ggtcaccacc gtcggggtga aagatctggt agtggtgcag accaaagatg     4800

cggtgctgat tgccgaccgt aacgcggtac aggatgtgaa aaaagtggtc gagcagatca     4860

aagccgatgg tcgccatgag catcgggtgc atcgcgaagt gtatcgtccg tggggcaaat     4920

atgactctat cgacgcgggc gaccgctacc aggtgaaacg catcaccgtg aaaccgggcg     4980

agggcttgtc ggtacagatg caccatcacc gcgcggaaca ctgggtggtt gtcgcgggaa     5040

cggcaaaagt caccattgat ggtgatatca aactgcttgg tgaaaacgag tccatttata     5100

ttccgctggg ggcgacgcat tgcctggaaa acccggggaa aattccgctc gatttaattg     5160

aagtgcgctc cggctcttat ctcgaagagg atgatgtggt gcgtttcgcg gatcgctacg     5220

gacgggtgta aacgtcgcat caggcaatga atgcgaaacc gcggtgtaaa taacgacaaa     5280

aataaaattg gccgcttcgg tcagggccaa ctattgcctg aaaaagggta acgatatgaa     5340

aaaattaacc tgctttaaag cctatgatat tcgcgggaaa ttaggcgaag aactgaatga     5400

agatatcgcc tggcgcattg gtcgcgccta tggcgaattt ctcaaaccga aaaccattgt     5460

gttaggcggt gatgtccgcc tcaccagcga aaccttaaaa ctggcgctgg cgaaaggttt     5520

acaggatgcg ggcgttgacg tgctggatat tggtatgtcc ggcaccgaag agatctattt     5580

cgccacgttc catctcggcg tggatggcgg cattgaagtt accgccagcc ataatccgat     5640

ggattataac ggcatgaagc tggttcgcga gggggctcgc ccgatcagcg gagataccgg     5700

actgcgcgac gtccagcgtc tggctgaagc caacgacttt cctcccgtcg atgaaaccaa     5760

acgcggtcgc tatcagcaaa tcaacctgcg tgacgcttac gttgatcacc tgttcggtta     5820

tatcaatgtc aaaaacctca cgccgctcaa gctggtgatc aactccggga acggcgcagc     5880

gggtccggtg gtggacgcca ttgaagcccg ctttaaagcc ctcggcgcgc ccgtggaatt     5940

aatcaaagtg cacaacacgc cggacggcaa tttccccaac ggtattccta acccactact     6000

gccggaatgc cgcgacgaca cccgcaatgc ggtcatcaaa cacggcgcgg atatgggcat     6060

tgcttttgat ggcgattttg accgctgttt cctgtttgac gaaaaagggc agtttattga     6120

gggctactac attgtcggcc tgttggcaga agcattcctc gaaaaaaatc ccggcgcgaa     6180

gatcatccac gatccacgtc tctcctggaa caccgttgat gtggtgactg ccgcaggtgg     6240

cacgccggta atgtcgaaaa ccggacacgc ctttattaaa gaacgtatgc gcaaggaaga     6300

cgccatctat ggtggcgaaa tgagcgccca ccattacttc cgtgatttcg cttactgcga     6360

cagcggcatg atcccgtggc tgctggtcgc cgaactggtg tgcctgaaag ataaaacgct     6420

gggcgaactg gtacgcgacc ggatggcggc gtttccggca agcggtgaga tcaacagcaa     6480

actggcgcaa cccgttgagg cgattaaccg cgtggaacag cattttagcc gtgaggcgct     6540

ggcggtggat cgcaccgatg gcatcagcat gacctttgcc gactggcgct ttaacctgcg     6600

cacctccaat accgaaccgg tggtgcgcct gaatgtggaa tcgcgcggtg atgtgccgct     6660

gatggaagcg cgaacgcgaa ctctgctgac gttgctgaac gagtaa





