                         SEQUENCE LISTING

<110>  Second Genome, Inc.
       Han, Andrew Wonhee
       Goodyear, Andrew Whitman
       Gujral, Tarunmeet
       DeSantis, Todd Zachary
       Dabbagh, Karim
 
<120>  PROTEINS FOR THE TREATMENT OF EPITHELIAL BARRIER FUNCTION 
       DISORDERS

<130>  SEGE-008/01WO 321077-2036

<140>  PCT/US2018/033347
<141>  2018-05-18

<150>  US 62/508,501
<151>  2017-05-19

<160>  9     

<170>  PatentIn version 3.5

<210>  1
<211>  687
<212>  PRT
<213>  Eubacterium eligens

<400>  1

Met Ile Lys Leu Asn Lys Ile Lys Arg Asn Cys Val Ala Ala Val Ile 
1               5                   10                  15      


Leu Thr Met Cys Leu Met Thr Ala Gly Cys Ala Arg Asn Ser Thr Ser 
            20                  25                  30          


Thr Thr Thr Ala Ser Gly Gly Glu Thr Thr Ile Thr Ser Ala Ile Thr 
        35                  40                  45              


Lys Glu Asp Thr Asp Val Thr His Ala Asp Asp Ala Glu Asn Tyr Arg 
    50                  55                  60                  


Val Ser Ile Thr Gly Asp Phe Thr Val Thr Ser Asp Thr Ser Asp Gly 
65                  70                  75                  80  


Val Thr Gln Ser Gly Ser Val Tyr Thr Ile Thr Lys Ala Gly Glu Tyr 
                85                  90                  95      


Thr Val Thr Gly Leu Leu Ser Glu Gly Gln Leu Ile Val Asp Ala Gly 
            100                 105                 110         


Asn Glu Asn Glu Val Thr Ile Val Leu Asn Gly Thr Ser Ile Thr Cys 
        115                 120                 125             


Ser Ser Gly Ser Pro Ile Tyr Val Lys Asn Ala Ser Glu Val Lys Ile 
    130                 135                 140                 


Lys Ser Glu Glu Asn Ser Phe Asn Glu Val Ile Asp Asn Arg Asn Glu 
145                 150                 155                 160 


Ala Thr Glu Asp Ser Ser Asp Asp Ala Gly Asn Ala Ala Ile Tyr Ala 
                165                 170                 175     


Thr Cys Asp Leu Lys Leu Val Gly Lys Gly Ala Leu Val Val Thr Gly 
            180                 185                 190         


Asn Tyr Asn Asn Gly Ile Gln Ser Lys Asp Asp Leu Ser Ile Lys Asn 
        195                 200                 205             


Val Ile Val Lys Val Thr Ala Val Asn Asn Ala Val Lys Val Asn Asp 
    210                 215                 220                 


Ala Val Asp Ile Glu Ser Gly Asn Ile Ile Ala Ile Ser Ala Lys Gly 
225                 230                 235                 240 


Asp Gly Ile Lys Thr Ser Asn Ser Ser Ile Ser Asn Lys Gly Asn Gln 
                245                 250                 255     


Lys Gly Ile Val Thr Ile Thr Ser Gly Asn Ile Asp Ile Tyr Ala Ala 
            260                 265                 270         


Cys Asp Gly Ile Asp Ala Ser Tyr Gly Ala Asp Ile Ser Gly Asp Gly 
        275                 280                 285             


Asn Leu Asn Ile Tyr Thr Asp Thr Tyr Ser Glu Tyr Ser Glu Glu Val 
    290                 295                 300                 


Thr Ser Ser Gly Ser Ser Ser Gly Thr Ser Ser Gly Arg Asp Ser Ser 
305                 310                 315                 320 


Ala Asn Lys Ser Ala Ser Ala Asn Thr Val Ser Tyr Val Ala Thr Ser 
                325                 330                 335     


Asp Thr Ile Ala Asn Ala Pro Ser Gly Phe Gly Gly Gly Asn Met Gly 
            340                 345                 350         


Ser Gly Asn Ala Pro Asp Met Ser Asn Gly Asn Ala Pro Asn Met Asn 
        355                 360                 365             


Gly Ser Ser Asp Arg Asn Lys Thr Gly Gly Asn Arg Pro Gly Met Pro 
    370                 375                 380                 


Gly Asp Phe Asn Glu Ser Gly Asn Ser Ser Gly Gln Ser Tyr Ser Thr 
385                 390                 395                 400 


Lys Gly Ile Lys Ala Glu Ser Glu Ile Asn Ile Ser Gly Phe Thr Ile 
                405                 410                 415     


Asn Ile Cys Ser Thr Asp Asp Gly Ile His Ala Asn Ser Asp Ser Gly 
            420                 425                 430         


Val Leu Glu Thr Gly Glu Asp Gly Lys Gly Thr Ile Val Ile Asn Gly 
        435                 440                 445             


Gly Ser Ile Thr Ile Ser Ser Gly Asp Asp Gly Met His Ala Asp Lys 
    450                 455                 460                 


Gln Leu Asp Val Asn Asp Gly Tyr Ile Asn Val Val Thr Ser Tyr Glu 
465                 470                 475                 480 


Gly Leu Glu Ala Met Thr Ile Asn Leu Asn Gly Gly Lys Ile Tyr Val 
                485                 490                 495     


Tyr Ala Thr Asp Asp Gly Ile Asn Ala Cys Thr Gly Asp Gly Lys Thr 
            500                 505                 510         


Ser Pro Ile Val Asn Val Thr Gly Gly Tyr Ile Asp Val Thr Thr Thr 
        515                 520                 525             


Ser Gly Asp Thr Asp Gly Ile Asp Ser Asn Gly Asn Tyr Val Gln Thr 
    530                 535                 540                 


Gly Gly Phe Val Leu Val Lys Gly Gly Ser Ser Ser Gly Asn Val Ser 
545                 550                 555                 560 


Gly Ser Ile Asp Val Asp Gly Thr Val Thr Ile Thr Gly Gly Thr Cys 
                565                 570                 575     


Val Ala Leu Gly Gly Val Cys Glu Thr Pro Val Asn Ser Ala Asn Ala 
            580                 585                 590         


Tyr Val Leu Gly Ser Val Ser Phe Ser Ser Gly Ser Tyr Ser Leu Lys 
        595                 600                 605             


Asp Ser Ser Gly Asn Glu Val Ile Ser Phe Thr Val Asp Gly Ser Phe 
    610                 615                 620                 


Ser Asn Gly Trp Ile Cys Ser Asp Thr Leu Thr Thr Gly Ser Ser Tyr 
625                 630                 635                 640 


Thr Leu Tyr Arg Gly Ala Asp Ser Ile Ala Asp Trp Thr Gln Glu Ser 
                645                 650                 655     


Gly Thr Met Gly Ala Ser Gly Thr Gly Gly Phe Gly Gly Gly Asn Met 
            660                 665                 670         


Gly Gly Met Gly Gly Gln Asn Gly Gly Phe Gly Gly Gly Arg Arg 
        675                 680                 685           


<210>  2
<211>  2061
<212>  DNA
<213>  Eubacterium eligens

<400>  2
atgattaaat taaataaaat caaaaggaat tgcgtagccg cagttattct tactatgtgt       60

cttatgactg ctggctgtgc cagaaattca acttcaacta ctactgcatc aggcggtgag      120

actaccatca cttcagccat tactaaagaa gacacagatg taacacacgc agatgatgct      180

gagaattaca gagtctccat tacaggtgat ttcactgtga catctgatac atcagatgga      240

gttacacagt caggttctgt atacacaatc acaaaggctg gtgaatatac agtaacagga      300

cttttatcag aaggacagct tatcgttgac gcaggtaatg agaacgaggt tactatcgta      360

ttgaacggaa catctatcac atgctcaagt ggttcaccta tatacgttaa aaatgcttca      420

gaagtcaaga ttaaatcaga agagaactca tttaacgaag taattgacaa tcgtaacgaa      480

gctacagaag attcttctga tgacgctggc aacgcagcaa tctatgcaac atgcgattta      540

aaactcgtcg gcaaaggagc cttagttgta acaggtaatt acaataatgg tatccagagc      600

aaggatgacc tttctattaa aaatgtgatt gttaaagtta ctgctgtgaa caacgcagtc      660

aaagtcaacg atgccgttga tattgaatct ggaaatataa ttgcaatctc cgctaaaggc      720

gatggcatca agacttctaa cagcagtatt tctaacaagg gcaaccagaa gggaatcgtt      780

acaatcacta gtggtaacat tgatatttat gcagcctgtg acggtataga cgcatcttac      840

ggggcagata tatcaggtga cggtaactta aacatttata cagatactta ctctgaatat      900

agtgaagaag ttacttcgtc aggcagttct tcaggcacgt cttctggccg ggacagctct      960

gctaacaagt ctgcttctgc caatactgtt tcttatgtgg caacttctga cactattgcc     1020

aacgcaccta gcggctttgg tggtggcaac atgggtagcg gcaatgctcc agatatgagt     1080

aacggcaacg ctcccaatat gaacggcagc tctgatagaa ataagaccgg tggcaaccgt     1140

ccaggaatgc ctggtgactt taatgaatcc ggtaattctt ctggacagtc ctactcaact     1200

aagggtatta aagctgaaag cgaaataaat atttcaggct ttacaattaa catatgttca     1260

acagatgatg gtatccatgc caactctgac tcaggtgtac ttgaaaccgg tgaggacggc     1320

aaaggaacta ttgttatcaa cggcggttca attacaattt cttctggcga tgacggcatg     1380

cacgctgaca aacagcttga tgtcaatgac ggttacatta atgtagtaac ttcatatgaa     1440

ggacttgagg ctatgactat caacttaaat ggcggcaaga tatatgtata cgctactgat     1500

gatggcatta atgcctgcac aggtgatgga aagacttctc caattgtcaa tgtaactggt     1560

ggatatatag atgtcacaac tacgtctggt gatactgatg gtattgattc taatggaaat     1620

tacgtacaga ccggtggatt tgtattagtt aaaggtggca gttcatctgg aaatgtatca     1680

ggatcaattg atgtagatgg taccgtaacg ataaccggtg gaacatgcgt tgccctcggt     1740

ggtgtatgcg aaacacctgt aaactctgct aatgcttatg tattaggttc cgtatcattc     1800

agttctggaa gctattcact taaagattct tctggcaacg aagttataag cttcactgtt     1860

gacggttcat ttagcaacgg ctggatatgt tctgacactc ttacaaccgg ctcaagctac     1920

acactctacc ggggggcaga ctctattgca gactggactc aggaatctgg aacaatggga     1980

gcttctggca ctggcggctt tggcggcggt aacatgggcg gcatgggcgg tcagaatggt     2040

ggcttcggcg gtggcagacg a                                               2061


<210>  3
<211>  631
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SG-14

<400>  3

Gly Cys Ala Arg Asn Ser Thr Ser Thr Thr Thr Ala Ser Gly Gly Glu 
1               5                   10                  15      


Thr Thr Ile Thr Ser Ala Ile Thr Lys Glu Asp Thr Asp Val Thr His 
            20                  25                  30          


Ala Asp Asp Ala Glu Asn Tyr Arg Val Ser Ile Thr Gly Asp Phe Thr 
        35                  40                  45              


Val Thr Ser Asp Thr Ser Asp Gly Val Thr Gln Ser Gly Ser Val Tyr 
    50                  55                  60                  


Thr Ile Thr Lys Ala Gly Glu Tyr Thr Val Thr Gly Leu Leu Ser Glu 
65                  70                  75                  80  


Gly Gln Leu Ile Val Asp Ala Gly Asn Glu Asn Glu Val Thr Ile Val 
                85                  90                  95      


Leu Asn Gly Thr Ser Ile Thr Cys Ser Ser Gly Ser Pro Ile Tyr Val 
            100                 105                 110         


Lys Asn Ala Ser Glu Val Lys Ile Lys Ser Glu Glu Asn Ser Phe Asn 
        115                 120                 125             


Glu Val Ile Asp Asn Arg Asn Glu Ala Thr Glu Asp Ser Ser Asp Asp 
    130                 135                 140                 


Ala Gly Asn Ala Ala Ile Tyr Ala Thr Cys Asp Leu Lys Leu Val Gly 
145                 150                 155                 160 


Lys Gly Ala Leu Val Val Thr Gly Asn Tyr Asn Asn Gly Ile Gln Ser 
                165                 170                 175     


Lys Asp Asp Leu Ser Ile Lys Asn Val Ile Val Lys Val Thr Ala Val 
            180                 185                 190         


Asn Asn Ala Val Lys Val Asn Asp Ala Val Asp Ile Glu Ser Gly Asn 
        195                 200                 205             


Ile Ile Ala Ile Ser Ala Lys Gly Asp Gly Ile Lys Thr Ser Asn Ser 
    210                 215                 220                 


Ser Ile Ser Asn Lys Gly Asn Gln Lys Gly Ile Val Thr Ile Thr Ser 
225                 230                 235                 240 


Gly Asn Ile Asp Ile Tyr Ala Ala Cys Asp Gly Ile Asp Ala Ser Tyr 
                245                 250                 255     


Gly Ala Asp Ile Ser Gly Asp Gly Asn Leu Asn Ile Tyr Thr Asp Thr 
            260                 265                 270         


Tyr Ser Glu Tyr Ser Glu Glu Val Thr Ser Ser Gly Ser Ser Ser Gly 
        275                 280                 285             


Thr Ser Ser Gly Arg Asp Ser Ser Ala Asn Lys Ser Ala Ser Ala Asn 
    290                 295                 300                 


Thr Val Ser Tyr Val Ala Thr Ser Asp Thr Ile Ala Asn Ala Pro Ser 
305                 310                 315                 320 


Gly Phe Gly Gly Gly Asn Met Gly Ser Gly Asn Ala Pro Asp Met Ser 
                325                 330                 335     


Asn Gly Asn Ala Pro Asn Met Asn Gly Ser Ser Asp Arg Asn Lys Thr 
            340                 345                 350         


Gly Gly Asn Arg Pro Gly Met Pro Gly Asp Phe Asn Glu Ser Gly Asn 
        355                 360                 365             


Ser Ser Gly Gln Ser Tyr Ser Thr Lys Gly Ile Lys Ala Glu Ser Glu 
    370                 375                 380                 


Ile Asn Ile Ser Gly Phe Thr Ile Asn Ile Cys Ser Thr Asp Asp Gly 
385                 390                 395                 400 


Ile His Ala Asn Ser Asp Ser Gly Val Leu Glu Thr Gly Glu Asp Gly 
                405                 410                 415     


Lys Gly Thr Ile Val Ile Asn Gly Gly Ser Ile Thr Ile Ser Ser Gly 
            420                 425                 430         


Asp Asp Gly Met His Ala Asp Lys Gln Leu Asp Val Asn Asp Gly Tyr 
        435                 440                 445             


Ile Asn Val Val Thr Ser Tyr Glu Gly Leu Glu Ala Met Thr Ile Asn 
    450                 455                 460                 


Leu Asn Gly Gly Lys Ile Tyr Val Tyr Ala Thr Asp Asp Gly Ile Asn 
465                 470                 475                 480 


Ala Cys Thr Gly Asp Gly Lys Thr Ser Pro Ile Val Asn Val Thr Gly 
                485                 490                 495     


Gly Tyr Ile Asp Val Thr Thr Thr Ser Gly Asp Thr Asp Gly Ile Asp 
            500                 505                 510         


Ser Asn Gly Asn Tyr Val Gln Thr Gly Gly Phe Val Leu Val Lys Gly 
        515                 520                 525             


Gly Ser Ser Ser Gly Asn Val Ser Gly Ser Ile Asp Val Asp Gly Thr 
    530                 535                 540                 


Val Thr Ile Thr Gly Gly Thr Cys Val Ala Leu Gly Gly Val Cys Glu 
545                 550                 555                 560 


Thr Pro Val Asn Ser Ala Asn Ala Tyr Val Leu Gly Ser Val Ser Phe 
                565                 570                 575     


Ser Ser Gly Ser Tyr Ser Leu Lys Asp Ser Ser Gly Asn Glu Val Ile 
            580                 585                 590         


Ser Phe Thr Val Asp Gly Ser Phe Ser Asn Gly Trp Ile Cys Ser Asp 
        595                 600                 605             


Thr Leu Thr Thr Gly Ser Ser Tyr Thr Leu Tyr Arg Gly Ala Asp Ser 
    610                 615                 620                 


Ile Ala Asp Trp Thr Gln Glu 
625                 630     


<210>  4
<211>  1893
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SG-14

<400>  4
ggctgtgcca gaaattcaac ttcaactact actgcatcag gcggtgagac taccatcact       60

tcagccatta ctaaagaaga cacagatgta acacacgcag atgatgctga gaattacaga      120

gtctccatta caggtgattt cactgtgaca tctgatacat cagatggagt tacacagtca      180

ggttctgtat acacaatcac aaaggctggt gaatatacag taacaggact tttatcagaa      240

ggacagctta tcgttgacgc aggtaatgag aacgaggtta ctatcgtatt gaacggaaca      300

tctatcacat gctcaagtgg ttcacctata tacgttaaaa atgcttcaga agtcaagatt      360

aaatcagaag agaactcatt taacgaagta attgacaatc gtaacgaagc tacagaagat      420

tcttctgatg acgctggcaa cgcagcaatc tatgcaacat gcgatttaaa actcgtcggc      480

aaaggagcct tagttgtaac aggtaattac aataatggta tccagagcaa ggatgacctt      540

tctattaaaa atgtgattgt taaagttact gctgtgaaca acgcagtcaa agtcaacgat      600

gccgttgata ttgaatctgg aaatataatt gcaatctccg ctaaaggcga tggcatcaag      660

acttctaaca gcagtatttc taacaagggc aaccagaagg gaatcgttac aatcactagt      720

ggtaacattg atatttatgc agcctgtgac ggtatagacg catcttacgg ggcagatata      780

tcaggtgacg gtaacttaaa catttataca gatacttact ctgaatatag tgaagaagtt      840

acttcgtcag gcagttcttc aggcacgtct tctggccggg acagctctgc taacaagtct      900

gcttctgcca atactgtttc ttatgtggca acttctgaca ctattgccaa cgcacctagc      960

ggctttggtg gtggcaacat gggtagcggc aatgctccag atatgagtaa cggcaacgct     1020

cccaatatga acggcagctc tgatagaaat aagaccggtg gcaaccgtcc aggaatgcct     1080

ggtgacttta atgaatccgg taattcttct ggacagtcct actcaactaa gggtattaaa     1140

gctgaaagcg aaataaatat ttcaggcttt acaattaaca tatgttcaac agatgatggt     1200

atccatgcca actctgactc aggtgtactt gaaaccggtg aggacggcaa aggaactatt     1260

gttatcaacg gcggttcaat tacaatttct tctggcgatg acggcatgca cgctgacaaa     1320

cagcttgatg tcaatgacgg ttacattaat gtagtaactt catatgaagg acttgaggct     1380

atgactatca acttaaatgg cggcaagata tatgtatacg ctactgatga tggcattaat     1440

gcctgcacag gtgatggaaa gacttctcca attgtcaatg taactggtgg atatatagat     1500

gtcacaacta cgtctggtga tactgatggt attgattcta atggaaatta cgtacagacc     1560

ggtggatttg tattagttaa aggtggcagt tcatctggaa atgtatcagg atcaattgat     1620

gtagatggta ccgtaacgat aaccggtgga acatgcgttg ccctcggtgg tgtatgcgaa     1680

acacctgtaa actctgctaa tgcttatgta ttaggttccg tatcattcag ttctggaagc     1740

tattcactta aagattcttc tggcaacgaa gttataagct tcactgttga cggttcattt     1800

agcaacggct ggatatgttc tgacactctt acaaccggct caagctacac actctaccgg     1860

ggggcagact ctattgcaga ctggactcag gaa                                  1893


<210>  5
<211>  632
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SG-14 with start methionine

<400>  5

Met Gly Cys Ala Arg Asn Ser Thr Ser Thr Thr Thr Ala Ser Gly Gly 
1               5                   10                  15      


Glu Thr Thr Ile Thr Ser Ala Ile Thr Lys Glu Asp Thr Asp Val Thr 
            20                  25                  30          


His Ala Asp Asp Ala Glu Asn Tyr Arg Val Ser Ile Thr Gly Asp Phe 
        35                  40                  45              


Thr Val Thr Ser Asp Thr Ser Asp Gly Val Thr Gln Ser Gly Ser Val 
    50                  55                  60                  


Tyr Thr Ile Thr Lys Ala Gly Glu Tyr Thr Val Thr Gly Leu Leu Ser 
65                  70                  75                  80  


Glu Gly Gln Leu Ile Val Asp Ala Gly Asn Glu Asn Glu Val Thr Ile 
                85                  90                  95      


Val Leu Asn Gly Thr Ser Ile Thr Cys Ser Ser Gly Ser Pro Ile Tyr 
            100                 105                 110         


Val Lys Asn Ala Ser Glu Val Lys Ile Lys Ser Glu Glu Asn Ser Phe 
        115                 120                 125             


Asn Glu Val Ile Asp Asn Arg Asn Glu Ala Thr Glu Asp Ser Ser Asp 
    130                 135                 140                 


Asp Ala Gly Asn Ala Ala Ile Tyr Ala Thr Cys Asp Leu Lys Leu Val 
145                 150                 155                 160 


Gly Lys Gly Ala Leu Val Val Thr Gly Asn Tyr Asn Asn Gly Ile Gln 
                165                 170                 175     


Ser Lys Asp Asp Leu Ser Ile Lys Asn Val Ile Val Lys Val Thr Ala 
            180                 185                 190         


Val Asn Asn Ala Val Lys Val Asn Asp Ala Val Asp Ile Glu Ser Gly 
        195                 200                 205             


Asn Ile Ile Ala Ile Ser Ala Lys Gly Asp Gly Ile Lys Thr Ser Asn 
    210                 215                 220                 


Ser Ser Ile Ser Asn Lys Gly Asn Gln Lys Gly Ile Val Thr Ile Thr 
225                 230                 235                 240 


Ser Gly Asn Ile Asp Ile Tyr Ala Ala Cys Asp Gly Ile Asp Ala Ser 
                245                 250                 255     


Tyr Gly Ala Asp Ile Ser Gly Asp Gly Asn Leu Asn Ile Tyr Thr Asp 
            260                 265                 270         


Thr Tyr Ser Glu Tyr Ser Glu Glu Val Thr Ser Ser Gly Ser Ser Ser 
        275                 280                 285             


Gly Thr Ser Ser Gly Arg Asp Ser Ser Ala Asn Lys Ser Ala Ser Ala 
    290                 295                 300                 


Asn Thr Val Ser Tyr Val Ala Thr Ser Asp Thr Ile Ala Asn Ala Pro 
305                 310                 315                 320 


Ser Gly Phe Gly Gly Gly Asn Met Gly Ser Gly Asn Ala Pro Asp Met 
                325                 330                 335     


Ser Asn Gly Asn Ala Pro Asn Met Asn Gly Ser Ser Asp Arg Asn Lys 
            340                 345                 350         


Thr Gly Gly Asn Arg Pro Gly Met Pro Gly Asp Phe Asn Glu Ser Gly 
        355                 360                 365             


Asn Ser Ser Gly Gln Ser Tyr Ser Thr Lys Gly Ile Lys Ala Glu Ser 
    370                 375                 380                 


Glu Ile Asn Ile Ser Gly Phe Thr Ile Asn Ile Cys Ser Thr Asp Asp 
385                 390                 395                 400 


Gly Ile His Ala Asn Ser Asp Ser Gly Val Leu Glu Thr Gly Glu Asp 
                405                 410                 415     


Gly Lys Gly Thr Ile Val Ile Asn Gly Gly Ser Ile Thr Ile Ser Ser 
            420                 425                 430         


Gly Asp Asp Gly Met His Ala Asp Lys Gln Leu Asp Val Asn Asp Gly 
        435                 440                 445             


Tyr Ile Asn Val Val Thr Ser Tyr Glu Gly Leu Glu Ala Met Thr Ile 
    450                 455                 460                 


Asn Leu Asn Gly Gly Lys Ile Tyr Val Tyr Ala Thr Asp Asp Gly Ile 
465                 470                 475                 480 


Asn Ala Cys Thr Gly Asp Gly Lys Thr Ser Pro Ile Val Asn Val Thr 
                485                 490                 495     


Gly Gly Tyr Ile Asp Val Thr Thr Thr Ser Gly Asp Thr Asp Gly Ile 
            500                 505                 510         


Asp Ser Asn Gly Asn Tyr Val Gln Thr Gly Gly Phe Val Leu Val Lys 
        515                 520                 525             


Gly Gly Ser Ser Ser Gly Asn Val Ser Gly Ser Ile Asp Val Asp Gly 
    530                 535                 540                 


Thr Val Thr Ile Thr Gly Gly Thr Cys Val Ala Leu Gly Gly Val Cys 
545                 550                 555                 560 


Glu Thr Pro Val Asn Ser Ala Asn Ala Tyr Val Leu Gly Ser Val Ser 
                565                 570                 575     


Phe Ser Ser Gly Ser Tyr Ser Leu Lys Asp Ser Ser Gly Asn Glu Val 
            580                 585                 590         


Ile Ser Phe Thr Val Asp Gly Ser Phe Ser Asn Gly Trp Ile Cys Ser 
        595                 600                 605             


Asp Thr Leu Thr Thr Gly Ser Ser Tyr Thr Leu Tyr Arg Gly Ala Asp 
    610                 615                 620                 


Ser Ile Ala Asp Trp Thr Gln Glu 
625                 630         


<210>  6
<211>  1896
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SG-14 with start methionine

<400>  6
atgggctgtg ccagaaattc aacttcaact actactgcat caggcggtga gactaccatc       60

acttcagcca ttactaaaga agacacagat gtaacacacg cagatgatgc tgagaattac      120

agagtctcca ttacaggtga tttcactgtg acatctgata catcagatgg agttacacag      180

tcaggttctg tatacacaat cacaaaggct ggtgaatata cagtaacagg acttttatca      240

gaaggacagc ttatcgttga cgcaggtaat gagaacgagg ttactatcgt attgaacgga      300

acatctatca catgctcaag tggttcacct atatacgtta aaaatgcttc agaagtcaag      360

attaaatcag aagagaactc atttaacgaa gtaattgaca atcgtaacga agctacagaa      420

gattcttctg atgacgctgg caacgcagca atctatgcaa catgcgattt aaaactcgtc      480

ggcaaaggag ccttagttgt aacaggtaat tacaataatg gtatccagag caaggatgac      540

ctttctatta aaaatgtgat tgttaaagtt actgctgtga acaacgcagt caaagtcaac      600

gatgccgttg atattgaatc tggaaatata attgcaatct ccgctaaagg cgatggcatc      660

aagacttcta acagcagtat ttctaacaag ggcaaccaga agggaatcgt tacaatcact      720

agtggtaaca ttgatattta tgcagcctgt gacggtatag acgcatctta cggggcagat      780

atatcaggtg acggtaactt aaacatttat acagatactt actctgaata tagtgaagaa      840

gttacttcgt caggcagttc ttcaggcacg tcttctggcc gggacagctc tgctaacaag      900

tctgcttctg ccaatactgt ttcttatgtg gcaacttctg acactattgc caacgcacct      960

agcggctttg gtggtggcaa catgggtagc ggcaatgctc cagatatgag taacggcaac     1020

gctcccaata tgaacggcag ctctgataga aataagaccg gtggcaaccg tccaggaatg     1080

cctggtgact ttaatgaatc cggtaattct tctggacagt cctactcaac taagggtatt     1140

aaagctgaaa gcgaaataaa tatttcaggc tttacaatta acatatgttc aacagatgat     1200

ggtatccatg ccaactctga ctcaggtgta cttgaaaccg gtgaggacgg caaaggaact     1260

attgttatca acggcggttc aattacaatt tcttctggcg atgacggcat gcacgctgac     1320

aaacagcttg atgtcaatga cggttacatt aatgtagtaa cttcatatga aggacttgag     1380

gctatgacta tcaacttaaa tggcggcaag atatatgtat acgctactga tgatggcatt     1440

aatgcctgca caggtgatgg aaagacttct ccaattgtca atgtaactgg tggatatata     1500

gatgtcacaa ctacgtctgg tgatactgat ggtattgatt ctaatggaaa ttacgtacag     1560

accggtggat ttgtattagt taaaggtggc agttcatctg gaaatgtatc aggatcaatt     1620

gatgtagatg gtaccgtaac gataaccggt ggaacatgcg ttgccctcgg tggtgtatgc     1680

gaaacacctg taaactctgc taatgcttat gtattaggtt ccgtatcatt cagttctgga     1740

agctattcac ttaaagattc ttctggcaac gaagttataa gcttcactgt tgacggttca     1800

tttagcaacg gctggatatg ttctgacact cttacaaccg gctcaagcta cacactctac     1860

cggggggcag actctattgc agactggact caggaa                               1896


<210>  7
<211>  663
<212>  PRT
<213>  Eubacterium eligens

<400>  7

Gly Cys Ala Arg Asn Ser Thr Ser Thr Thr Thr Ala Ser Gly Gly Glu 
1               5                   10                  15      


Thr Thr Ile Thr Ser Ala Ile Thr Lys Glu Asp Thr Asp Val Thr His 
            20                  25                  30          


Ala Asp Asp Ala Glu Asn Tyr Arg Val Ser Ile Thr Gly Asp Phe Thr 
        35                  40                  45              


Val Thr Ser Asp Thr Ser Asp Gly Val Thr Gln Ser Gly Ser Val Tyr 
    50                  55                  60                  


Thr Ile Thr Lys Ala Gly Glu Tyr Thr Val Thr Gly Leu Leu Ser Glu 
65                  70                  75                  80  


Gly Gln Leu Ile Val Asp Ala Gly Asn Glu Asn Glu Val Thr Ile Val 
                85                  90                  95      


Leu Asn Gly Thr Ser Ile Thr Cys Ser Ser Gly Ser Pro Ile Tyr Val 
            100                 105                 110         


Lys Asn Ala Ser Glu Val Lys Ile Lys Ser Glu Glu Asn Ser Phe Asn 
        115                 120                 125             


Glu Val Ile Asp Asn Arg Asn Glu Ala Thr Glu Asp Ser Ser Asp Asp 
    130                 135                 140                 


Ala Gly Asn Ala Ala Ile Tyr Ala Thr Cys Asp Leu Lys Leu Val Gly 
145                 150                 155                 160 


Lys Gly Ala Leu Val Val Thr Gly Asn Tyr Asn Asn Gly Ile Gln Ser 
                165                 170                 175     


Lys Asp Asp Leu Ser Ile Lys Asn Val Ile Val Lys Val Thr Ala Val 
            180                 185                 190         


Asn Asn Ala Val Lys Val Asn Asp Ala Val Asp Ile Glu Ser Gly Asn 
        195                 200                 205             


Ile Ile Ala Ile Ser Ala Lys Gly Asp Gly Ile Lys Thr Ser Asn Ser 
    210                 215                 220                 


Ser Ile Ser Asn Lys Gly Asn Gln Lys Gly Ile Val Thr Ile Thr Ser 
225                 230                 235                 240 


Gly Asn Ile Asp Ile Tyr Ala Ala Cys Asp Gly Ile Asp Ala Ser Tyr 
                245                 250                 255     


Gly Ala Asp Ile Ser Gly Asp Gly Asn Leu Asn Ile Tyr Thr Asp Thr 
            260                 265                 270         


Tyr Ser Glu Tyr Ser Glu Glu Val Thr Ser Ser Gly Ser Ser Ser Gly 
        275                 280                 285             


Thr Ser Ser Gly Arg Asp Ser Ser Ala Asn Lys Ser Ala Ser Ala Asn 
    290                 295                 300                 


Thr Val Ser Tyr Val Ala Thr Ser Asp Thr Ile Ala Asn Ala Pro Ser 
305                 310                 315                 320 


Gly Phe Gly Gly Gly Asn Met Gly Ser Gly Asn Ala Pro Asp Met Ser 
                325                 330                 335     


Asn Gly Asn Ala Pro Asn Met Asn Gly Ser Ser Asp Arg Asn Lys Thr 
            340                 345                 350         


Gly Gly Asn Arg Pro Gly Met Pro Gly Asp Phe Asn Glu Ser Gly Asn 
        355                 360                 365             


Ser Ser Gly Gln Ser Tyr Ser Thr Lys Gly Ile Lys Ala Glu Ser Glu 
    370                 375                 380                 


Ile Asn Ile Ser Gly Phe Thr Ile Asn Ile Cys Ser Thr Asp Asp Gly 
385                 390                 395                 400 


Ile His Ala Asn Ser Asp Ser Gly Val Leu Glu Thr Gly Glu Asp Gly 
                405                 410                 415     


Lys Gly Thr Ile Val Ile Asn Gly Gly Ser Ile Thr Ile Ser Ser Gly 
            420                 425                 430         


Asp Asp Gly Met His Ala Asp Lys Gln Leu Asp Val Asn Asp Gly Tyr 
        435                 440                 445             


Ile Asn Val Val Thr Ser Tyr Glu Gly Leu Glu Ala Met Thr Ile Asn 
    450                 455                 460                 


Leu Asn Gly Gly Lys Ile Tyr Val Tyr Ala Thr Asp Asp Gly Ile Asn 
465                 470                 475                 480 


Ala Cys Thr Gly Asp Gly Lys Thr Ser Pro Ile Val Asn Val Thr Gly 
                485                 490                 495     


Gly Tyr Ile Asp Val Thr Thr Thr Ser Gly Asp Thr Asp Gly Ile Asp 
            500                 505                 510         


Ser Asn Gly Asn Tyr Val Gln Thr Gly Gly Phe Val Leu Val Lys Gly 
        515                 520                 525             


Gly Ser Ser Ser Gly Asn Val Ser Gly Ser Ile Asp Val Asp Gly Thr 
    530                 535                 540                 


Val Thr Ile Thr Gly Gly Thr Cys Val Ala Leu Gly Gly Val Cys Glu 
545                 550                 555                 560 


Thr Pro Val Asn Ser Ala Asn Ala Tyr Val Leu Gly Ser Val Ser Phe 
                565                 570                 575     


Ser Ser Gly Ser Tyr Ser Leu Lys Asp Ser Ser Gly Asn Glu Val Ile 
            580                 585                 590         


Ser Phe Thr Val Asp Gly Ser Phe Ser Asn Gly Trp Ile Cys Ser Asp 
        595                 600                 605             


Thr Leu Thr Thr Gly Ser Ser Tyr Thr Leu Tyr Arg Gly Ala Asp Ser 
    610                 615                 620                 


Ile Ala Asp Trp Thr Gln Glu Ser Gly Thr Met Gly Ala Ser Gly Thr 
625                 630                 635                 640 


Gly Gly Phe Gly Gly Gly Asn Met Gly Gly Met Gly Gly Gln Asn Gly 
                645                 650                 655     


Gly Phe Gly Gly Gly Arg Arg 
            660             


<210>  8
<211>  1989
<212>  DNA
<213>  Eubacterium eligens

<400>  8
ggctgtgcca gaaattcaac ttcaactact actgcatcag gcggtgagac taccatcact       60

tcagccatta ctaaagaaga cacagatgta acacacgcag atgatgctga gaattacaga      120

gtctccatta caggtgattt cactgtgaca tctgatacat cagatggagt tacacagtca      180

ggttctgtat acacaatcac aaaggctggt gaatatacag taacaggact tttatcagaa      240

ggacagctta tcgttgacgc aggtaatgag aacgaggtta ctatcgtatt gaacggaaca      300

tctatcacat gctcaagtgg ttcacctata tacgttaaaa atgcttcaga agtcaagatt      360

aaatcagaag agaactcatt taacgaagta attgacaatc gtaacgaagc tacagaagat      420

tcttctgatg acgctggcaa cgcagcaatc tatgcaacat gcgatttaaa actcgtcggc      480

aaaggagcct tagttgtaac aggtaattac aataatggta tccagagcaa ggatgacctt      540

tctattaaaa atgtgattgt taaagttact gctgtgaaca acgcagtcaa agtcaacgat      600

gccgttgata ttgaatctgg aaatataatt gcaatctccg ctaaaggcga tggcatcaag      660

acttctaaca gcagtatttc taacaagggc aaccagaagg gaatcgttac aatcactagt      720

ggtaacattg atatttatgc agcctgtgac ggtatagacg catcttacgg ggcagatata      780

tcaggtgacg gtaacttaaa catttataca gatacttact ctgaatatag tgaagaagtt      840

acttcgtcag gcagttcttc aggcacgtct tctggccggg acagctctgc taacaagtct      900

gcttctgcca atactgtttc ttatgtggca acttctgaca ctattgccaa cgcacctagc      960

ggctttggtg gtggcaacat gggtagcggc aatgctccag atatgagtaa cggcaacgct     1020

cccaatatga acggcagctc tgatagaaat aagaccggtg gcaaccgtcc aggaatgcct     1080

ggtgacttta atgaatccgg taattcttct ggacagtcct actcaactaa gggtattaaa     1140

gctgaaagcg aaataaatat ttcaggcttt acaattaaca tatgttcaac agatgatggt     1200

atccatgcca actctgactc aggtgtactt gaaaccggtg aggacggcaa aggaactatt     1260

gttatcaacg gcggttcaat tacaatttct tctggcgatg acggcatgca cgctgacaaa     1320

cagcttgatg tcaatgacgg ttacattaat gtagtaactt catatgaagg acttgaggct     1380

atgactatca acttaaatgg cggcaagata tatgtatacg ctactgatga tggcattaat     1440

gcctgcacag gtgatggaaa gacttctcca attgtcaatg taactggtgg atatatagat     1500

gtcacaacta cgtctggtga tactgatggt attgattcta atggaaatta cgtacagacc     1560

ggtggatttg tattagttaa aggtggcagt tcatctggaa atgtatcagg atcaattgat     1620

gtagatggta ccgtaacgat aaccggtgga acatgcgttg ccctcggtgg tgtatgcgaa     1680

acacctgtaa actctgctaa tgcttatgta ttaggttccg tatcattcag ttctggaagc     1740

tattcactta aagattcttc tggcaacgaa gttataagct tcactgttga cggttcattt     1800

agcaacggct ggatatgttc tgacactctt acaaccggct caagctacac actctaccgg     1860

ggggcagact ctattgcaga ctggactcag gaatctggaa caatgggagc ttctggcact     1920

ggcggctttg gcggcggtaa catgggcggc atgggcggtc agaatggtgg cttcggcggt     1980

ggcagacga                                                             1989


<210>  9
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  FLAG tag

<400>  9

Asp Tyr Lys Asp Asp Asp Asp Lys 
1               5               


