                               SEQUENCE LISTING

<110> THE BRIGHAM AND WOMEN'S HOSPITAL, INC.
 
<120> DEFINED THERAPEUTIC MICROBIOTA AND METHODS OF USE THEREOF

<130> 043214-090870WOPT

<140> PCT/US2018/065023
<141> 2018-12-11

<150> 62/665,754
<151> 2018-05-02

<150> 62/597,116
<151> 2017-12-11

<160> 32    

<170> PatentIn version 3.5

<210> 1
<211> 2710
<212> PRT
<213> Clostridium difficile

<400> 1
Met Ser Leu Ile Ser Lys Glu Glu Leu Ile Lys Leu Ala Tyr Ser Ile 
1               5                   10                  15      


Arg Pro Arg Glu Asn Glu Tyr Lys Thr Ile Leu Thr Asn Leu Asp Glu 
            20                  25                  30          


Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gln Leu 
        35                  40                  45              


Lys Lys Leu Asn Glu Ser Ile Asp Val Phe Met Asn Lys Tyr Lys Thr 
    50                  55                  60                  


Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp Ile Leu Lys 
65                  70                  75                  80  


Glu Val Ile Leu Ile Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 
                85                  90                  95      


Leu His Phe Val Trp Ile Gly Gly Glu Val Ser Asp Ile Ala Leu Glu 
            100                 105                 110         


Tyr Ile Lys Gln Trp Ala Asp Ile Asn Ala Glu Tyr Asn Ile Lys Leu 
        115                 120                 125             


Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala Ile 
    130                 135                 140                 


Val Glu Ser Ser Thr Thr Glu Ala Leu Gln Leu Leu Glu Glu Glu Ile 
145                 150                 155                 160 


Gln Asn Pro Gln Phe Asp Asn Met Lys Phe Tyr Lys Lys Arg Met Glu 
                165                 170                 175     


Phe Ile Tyr Asp Arg Gln Lys Arg Phe Ile Asn Tyr Tyr Lys Ser Gln 
            180                 185                 190         


Ile Asn Lys Pro Thr Val Pro Thr Ile Asp Asp Ile Ile Lys Ser His 
        195                 200                 205             


Leu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arg 
    210                 215                 220                 


Thr Asn Ser Leu Arg Lys Ile Asn Ser Asn His Gly Ile Asp Ile Arg 
225                 230                 235                 240 


Ala Asn Ser Leu Phe Thr Glu Gln Glu Leu Leu Asn Ile Tyr Ser Gln 
                245                 250                 255     


Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp Ile Val Arg 
            260                 265                 270         


Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asp Met 
        275                 280                 285             


Leu Pro Gly Ile His Ser Asp Leu Phe Lys Thr Ile Ser Arg Pro Ser 
    290                 295                 300                 


Ser Ile Gly Leu Asp Arg Trp Glu Met Ile Lys Leu Glu Ala Ile Met 
305                 310                 315                 320 


Lys Tyr Lys Lys Tyr Ile Asn Asn Tyr Thr Ser Glu Asn Phe Asp Lys 
                325                 330                 335     


Leu Asp Gln Gln Leu Lys Asp Asn Phe Lys Leu Ile Ile Glu Ser Lys 
            340                 345                 350         


Ser Glu Lys Ser Glu Ile Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 
        355                 360                 365             


Asp Leu Glu Ile Lys Ile Ala Phe Ala Leu Gly Ser Val Ile Asn Gln 
    370                 375                 380                 


Ala Leu Ile Ser Lys Gln Gly Ser Tyr Leu Thr Asn Leu Val Ile Glu 
385                 390                 395                 400 


Gln Val Lys Asn Arg Tyr Gln Phe Leu Asn Gln His Leu Asn Pro Ala 
                405                 410                 415     


Ile Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys Ile Phe His Asp 
            420                 425                 430         


Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 
        435                 440                 445             


Ile Ala Pro Tyr Leu Gln Val Gly Phe Met Pro Glu Ala Arg Ser Thr 
    450                 455                 460                 


Ile Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 
465                 470                 475                 480 


Ile Asn Leu Gln Glu Asn Thr Ile Glu Lys Thr Leu Lys Ala Ser Asp 
                485                 490                 495     


Leu Ile Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gln Leu Thr Glu 
            500                 505                 510         


Gln Glu Ile Asn Ser Leu Trp Ser Phe Asp Gln Ala Ser Ala Lys Tyr 
        515                 520                 525             


Gln Phe Glu Lys Tyr Val Arg Asp Tyr Thr Gly Gly Ser Leu Ser Glu 
    530                 535                 540                 


Asp Asn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 
545                 550                 555                 560 


Leu Leu Asn Asn Lys Ile Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 
                565                 570                 575     


Lys Asn Tyr Val His Tyr Ile Ile Gln Leu Gln Gly Asp Asp Ile Ser 
            580                 585                 590         


Tyr Glu Ala Thr Cys Asn Leu Phe Ser Lys Asn Pro Lys Asn Ser Ile 
        595                 600                 605             


Ile Ile Gln Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe Leu Ser 
    610                 615                 620                 


Asp Asp Gly Glu Ser Ile Leu Glu Leu Asn Lys Tyr Arg Ile Pro Glu 
625                 630                 635                 640 


Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe Ile Gly His Gly 
                645                 650                 655     


Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 
            660                 665                 670         


Leu Ser Asn Glu Ile Ser Ser Phe Leu Asp Thr Ile Lys Leu Asp Ile 
        675                 680                 685             


Ser Pro Lys Asn Val Glu Val Asn Leu Leu Gly Cys Asn Met Phe Ser 
    690                 695                 700                 


Tyr Asp Phe Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Ser 
705                 710                 715                 720 


Ile Met Asp Lys Ile Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Ser 
                725                 730                 735     


Ile Thr Ile Gly Ala Asn Gln Tyr Glu Val Arg Ile Asn Ser Glu Gly 
            740                 745                 750         


Arg Lys Glu Leu Leu Ala His Ser Gly Lys Trp Ile Asn Lys Glu Glu 
        755                 760                 765             


Ala Ile Met Ser Asp Leu Ser Ser Lys Glu Tyr Ile Phe Phe Asp Ser 
    770                 775                 780                 


Ile Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn Ile Pro Gly Leu Ala 
785                 790                 795                 800 


Ser Ile Ser Glu Asp Ile Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
                805                 810                 815     


Pro Asp Thr Lys Phe Ile Leu Asn Asn Leu Lys Leu Asn Ile Glu Ser 
            820                 825                 830         


Ser Ile Gly Asp Tyr Ile Tyr Tyr Glu Lys Leu Glu Pro Val Lys Asn 
        835                 840                 845             


Ile Ile His Asn Ser Ile Asp Asp Leu Ile Asp Glu Phe Asn Leu Leu 
    850                 855                 860                 


Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 
865                 870                 875                 880 


Asp Glu Lys Tyr Leu Ile Ser Phe Glu Asp Ile Ser Lys Asn Asn Ser 
                885                 890                 895     


Thr Tyr Ser Val Arg Phe Ile Asn Lys Ser Asn Gly Glu Ser Val Tyr 
            900                 905                 910         


Val Glu Thr Glu Lys Glu Ile Phe Ser Lys Tyr Ser Glu His Ile Thr 
        915                 920                 925             


Lys Glu Ile Ser Thr Ile Lys Asn Ser Ile Ile Thr Asp Val Asn Gly 
    930                 935                 940                 


Asn Leu Leu Asp Asn Ile Gln Leu Asp His Thr Ser Gln Val Asn Thr 
945                 950                 955                 960 


Leu Asn Ala Ala Phe Phe Ile Gln Ser Leu Ile Asp Tyr Ser Ser Asn 
                965                 970                 975     


Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gln Leu Tyr 
            980                 985                 990         


Ala Gln Leu Phe Ser Thr Gly Leu  Asn Thr Ile Tyr Asp  Ser Ile Gln 
        995                 1000                 1005             


Leu Val  Asn Leu Ile Ser Asn  Ala Val Asn Asp Thr  Ile Asn Val 
    1010                 1015                 1020             


Leu Pro  Thr Ile Thr Glu Gly  Ile Pro Ile Val Ser  Thr Ile Leu 
    1025                 1030                 1035             


Asp Gly  Ile Asn Leu Gly Ala  Ala Ile Lys Glu Leu  Leu Asp Glu 
    1040                 1045                 1050             


His Asp  Pro Leu Leu Lys Lys  Glu Leu Glu Ala Lys  Val Gly Val 
    1055                 1060                 1065             


Leu Ala  Ile Asn Met Ser Leu  Ser Ile Ala Ala Thr  Val Ala Ser 
    1070                 1075                 1080             


Ile Val  Gly Ile Gly Ala Glu  Val Thr Ile Phe Leu  Leu Pro Ile 
    1085                 1090                 1095             


Ala Gly  Ile Ser Ala Gly Ile  Pro Ser Leu Val Asn  Asn Glu Leu 
    1100                 1105                 1110             


Ile Leu  His Asp Lys Ala Thr  Ser Val Val Asn Tyr  Phe Asn His 
    1115                 1120                 1125             


Leu Ser  Glu Ser Lys Lys Tyr  Gly Pro Leu Lys Thr  Glu Asp Asp 
    1130                 1135                 1140             


Lys Ile  Leu Val Pro Ile Asp  Asp Leu Val Ile Ser  Glu Ile Asp 
    1145                 1150                 1155             


Phe Asn  Asn Asn Ser Ile Lys  Leu Gly Thr Cys Asn  Ile Leu Ala 
    1160                 1165                 1170             


Met Glu  Gly Gly Ser Gly His  Thr Val Thr Gly Asn  Ile Asp His 
    1175                 1180                 1185             


Phe Phe  Ser Ser Pro Ser Ile  Ser Ser His Ile Pro  Ser Leu Ser 
    1190                 1195                 1200             


Ile Tyr  Ser Ala Ile Gly Ile  Glu Thr Glu Asn Leu  Asp Phe Ser 
    1205                 1210                 1215             


Lys Lys  Ile Met Met Leu Pro  Asn Ala Pro Ser Arg  Val Phe Trp 
    1220                 1225                 1230             


Trp Glu  Thr Gly Ala Val Pro  Gly Leu Arg Ser Leu  Glu Asn Asp 
    1235                 1240                 1245             


Gly Thr  Arg Leu Leu Asp Ser  Ile Arg Asp Leu Tyr  Pro Gly Lys 
    1250                 1255                 1260             


Phe Tyr  Trp Arg Phe Tyr Ala  Phe Phe Asp Tyr Ala  Ile Thr Thr 
    1265                 1270                 1275             


Leu Lys  Pro Val Tyr Glu Asp  Thr Asn Ile Lys Ile  Lys Leu Asp 
    1280                 1285                 1290             


Lys Asp  Thr Arg Asn Phe Ile  Met Pro Thr Ile Thr  Thr Asn Glu 
    1295                 1300                 1305             


Ile Arg  Asn Lys Leu Ser Tyr  Ser Phe Asp Gly Ala  Gly Gly Thr 
    1310                 1315                 1320             


Tyr Ser  Leu Leu Leu Ser Ser  Tyr Pro Ile Ser Thr  Asn Ile Asn 
    1325                 1330                 1335             


Leu Ser  Lys Asp Asp Leu Trp  Ile Phe Asn Ile Asp  Asn Glu Val 
    1340                 1345                 1350             


Arg Glu  Ile Ser Ile Glu Asn  Gly Thr Ile Lys Lys  Gly Lys Leu 
    1355                 1360                 1365             


Ile Lys  Asp Val Leu Ser Lys  Ile Asp Ile Asn Lys  Asn Lys Leu 
    1370                 1375                 1380             


Ile Ile  Gly Asn Gln Thr Ile  Asp Phe Ser Gly Asp  Ile Asp Asn 
    1385                 1390                 1395             


Lys Asp  Arg Tyr Ile Phe Leu  Thr Cys Glu Leu Asp  Asp Lys Ile 
    1400                 1405                 1410             


Ser Leu  Ile Ile Glu Ile Asn  Leu Val Ala Lys Ser  Tyr Ser Leu 
    1415                 1420                 1425             


Leu Leu  Ser Gly Asp Lys Asn  Tyr Leu Ile Ser Asn  Leu Ser Asn 
    1430                 1435                 1440             


Ile Ile  Glu Lys Ile Asn Thr  Leu Gly Leu Asp Ser  Lys Asn Ile 
    1445                 1450                 1455             


Ala Tyr  Asn Tyr Thr Asp Glu  Ser Asn Asn Lys Tyr  Phe Gly Ala 
    1460                 1465                 1470             


Ile Ser  Lys Thr Ser Gln Lys  Ser Ile Ile His Tyr  Lys Lys Asp 
    1475                 1480                 1485             


Ser Lys  Asn Ile Leu Glu Phe  Tyr Asn Asp Ser Thr  Leu Glu Phe 
    1490                 1495                 1500             


Asn Ser  Lys Asp Phe Ile Ala  Glu Asp Ile Asn Val  Phe Met Lys 
    1505                 1510                 1515             


Asp Asp  Ile Asn Thr Ile Thr  Gly Lys Tyr Tyr Val  Asp Asn Asn 
    1520                 1525                 1530             


Thr Asp  Lys Ser Ile Asp Phe  Ser Ile Ser Leu Val  Ser Lys Asn 
    1535                 1540                 1545             


Gln Val  Lys Val Asn Gly Leu  Tyr Leu Asn Glu Ser  Val Tyr Ser 
    1550                 1555                 1560             


Ser Tyr  Leu Asp Phe Val Lys  Asn Ser Asp Gly His  His Asn Thr 
    1565                 1570                 1575             


Ser Asn  Phe Met Asn Leu Phe  Leu Asp Asn Ile Ser  Phe Trp Lys 
    1580                 1585                 1590             


Leu Phe  Gly Phe Glu Asn Ile  Asn Phe Val Ile Asp  Lys Tyr Phe 
    1595                 1600                 1605             


Thr Leu  Val Gly Lys Thr Asn  Leu Gly Tyr Val Glu  Phe Ile Cys 
    1610                 1615                 1620             


Asp Asn  Asn Lys Asn Ile Asp  Ile Tyr Phe Gly Glu  Trp Lys Thr 
    1625                 1630                 1635             


Ser Ser  Ser Lys Ser Thr Ile  Phe Ser Gly Asn Gly  Arg Asn Val 
    1640                 1645                 1650             


Val Val  Glu Pro Ile Tyr Asn  Pro Asp Thr Gly Glu  Asp Ile Ser 
    1655                 1660                 1665             


Thr Ser  Leu Asp Phe Ser Tyr  Glu Pro Leu Tyr Gly  Ile Asp Arg 
    1670                 1675                 1680             


Tyr Ile  Asn Lys Val Leu Ile  Ala Pro Asp Leu Tyr  Thr Ser Leu 
    1685                 1690                 1695             


Ile Asn  Ile Asn Thr Asn Tyr  Tyr Ser Asn Glu Tyr  Tyr Pro Glu 
    1700                 1705                 1710             


Ile Ile  Val Leu Asn Pro Asn  Thr Phe His Lys Lys  Val Asn Ile 
    1715                 1720                 1725             


Asn Leu  Asp Ser Ser Ser Phe  Glu Tyr Lys Trp Ser  Thr Glu Gly 
    1730                 1735                 1740             


Ser Asp  Phe Ile Leu Val Arg  Tyr Leu Glu Glu Ser  Asn Lys Lys 
    1745                 1750                 1755             


Ile Leu  Gln Lys Ile Arg Ile  Lys Gly Ile Leu Ser  Asn Thr Gln 
    1760                 1765                 1770             


Ser Phe  Asn Lys Met Ser Ile  Asp Phe Lys Asp Ile  Lys Lys Leu 
    1775                 1780                 1785             


Ser Leu  Gly Tyr Ile Met Ser  Asn Phe Lys Ser Phe  Asn Ser Glu 
    1790                 1795                 1800             


Asn Glu  Leu Asp Arg Asp His  Leu Gly Phe Lys Ile  Ile Asp Asn 
    1805                 1810                 1815             


Lys Thr  Tyr Tyr Tyr Asp Glu  Asp Ser Lys Leu Val  Lys Gly Leu 
    1820                 1825                 1830             


Ile Asn  Ile Asn Asn Ser Leu  Phe Tyr Phe Asp Pro  Ile Glu Phe 
    1835                 1840                 1845             


Asn Leu  Val Thr Gly Trp Gln  Thr Ile Asn Gly Lys  Lys Tyr Tyr 
    1850                 1855                 1860             


Phe Asp  Ile Asn Thr Gly Ala  Ala Leu Ile Ser Tyr  Lys Ile Ile 
    1865                 1870                 1875             


Asn Gly  Lys His Phe Tyr Phe  Asn Asn Asp Gly Val  Met Gln Leu 
    1880                 1885                 1890             


Gly Val  Phe Lys Gly Pro Asp  Gly Phe Glu Tyr Phe  Ala Pro Ala 
    1895                 1900                 1905             


Asn Thr  Gln Asn Asn Asn Ile  Glu Gly Gln Ala Ile  Val Tyr Gln 
    1910                 1915                 1920             


Ser Lys  Phe Leu Thr Leu Asn  Gly Lys Lys Tyr Tyr  Phe Asp Asn 
    1925                 1930                 1935             


Asp Ser  Lys Ala Val Thr Gly  Trp Arg Ile Ile Asn  Asn Glu Lys 
    1940                 1945                 1950             


Tyr Tyr  Phe Asn Pro Asn Asn  Ala Ile Ala Ala Val  Gly Leu Gln 
    1955                 1960                 1965             


Val Ile  Asp Asn Asn Lys Tyr  Tyr Phe Asn Pro Asp  Thr Ala Ile 
    1970                 1975                 1980             


Ile Ser  Lys Gly Trp Gln Thr  Val Asn Gly Ser Arg  Tyr Tyr Phe 
    1985                 1990                 1995             


Asp Thr  Asp Thr Ala Ile Ala  Phe Asn Gly Tyr Lys  Thr Ile Asp 
    2000                 2005                 2010             


Gly Lys  His Phe Tyr Phe Asp  Ser Asp Cys Val Val  Lys Ile Gly 
    2015                 2020                 2025             


Val Phe  Ser Thr Ser Asn Gly  Phe Glu Tyr Phe Ala  Pro Ala Asn 
    2030                 2035                 2040             


Thr Tyr  Asn Asn Asn Ile Glu  Gly Gln Ala Ile Val  Tyr Gln Ser 
    2045                 2050                 2055             


Lys Phe  Leu Thr Leu Asn Gly  Lys Lys Tyr Tyr Phe  Asp Asn Asn 
    2060                 2065                 2070             


Ser Lys  Ala Val Thr Gly Trp  Gln Thr Ile Asp Ser  Lys Lys Tyr 
    2075                 2080                 2085             


Tyr Phe  Asn Thr Asn Thr Ala  Glu Ala Ala Thr Gly  Trp Gln Thr 
    2090                 2095                 2100             


Ile Asp  Gly Lys Lys Tyr Tyr  Phe Asn Thr Asn Thr  Ala Glu Ala 
    2105                 2110                 2115             


Ala Thr  Gly Trp Gln Thr Ile  Asp Gly Lys Lys Tyr  Tyr Phe Asn 
    2120                 2125                 2130             


Thr Asn  Thr Ala Ile Ala Ser  Thr Gly Tyr Thr Ile  Ile Asn Gly 
    2135                 2140                 2145             


Lys His  Phe Tyr Phe Asn Thr  Asp Gly Ile Met Gln  Ile Gly Val 
    2150                 2155                 2160             


Phe Lys  Gly Pro Asn Gly Phe  Glu Tyr Phe Ala Pro  Ala Asn Thr 
    2165                 2170                 2175             


Asp Ala  Asn Asn Ile Glu Gly  Gln Ala Ile Leu Tyr  Gln Asn Glu 
    2180                 2185                 2190             


Phe Leu  Thr Leu Asn Gly Lys  Lys Tyr Tyr Phe Gly  Ser Asp Ser 
    2195                 2200                 2205             


Lys Ala  Val Thr Gly Trp Arg  Ile Ile Asn Asn Lys  Lys Tyr Tyr 
    2210                 2215                 2220             


Phe Asn  Pro Asn Asn Ala Ile  Ala Ala Ile His Leu  Cys Thr Ile 
    2225                 2230                 2235             


Asn Asn  Asp Lys Tyr Tyr Phe  Ser Tyr Asp Gly Ile  Leu Gln Asn 
    2240                 2245                 2250             


Gly Tyr  Ile Thr Ile Glu Arg  Asn Asn Phe Tyr Phe  Asp Ala Asn 
    2255                 2260                 2265             


Asn Glu  Ser Lys Met Val Thr  Gly Val Phe Lys Gly  Pro Asn Gly 
    2270                 2275                 2280             


Phe Glu  Tyr Phe Ala Pro Ala  Asn Thr His Asn Asn  Asn Ile Glu 
    2285                 2290                 2295             


Gly Gln  Ala Ile Val Tyr Gln  Asn Lys Phe Leu Thr  Leu Asn Gly 
    2300                 2305                 2310             


Lys Lys  Tyr Tyr Phe Asp Asn  Asp Ser Lys Ala Val  Thr Gly Trp 
    2315                 2320                 2325             


Gln Thr  Ile Asp Gly Lys Lys  Tyr Tyr Phe Asn Leu  Asn Thr Ala 
    2330                 2335                 2340             


Glu Ala  Ala Thr Gly Trp Gln  Thr Ile Asp Gly Lys  Lys Tyr Tyr 
    2345                 2350                 2355             


Phe Asn  Leu Asn Thr Ala Glu  Ala Ala Thr Gly Trp  Gln Thr Ile 
    2360                 2365                 2370             


Asp Gly  Lys Lys Tyr Tyr Phe  Asn Thr Asn Thr Phe  Ile Ala Ser 
    2375                 2380                 2385             


Thr Gly  Tyr Thr Ser Ile Asn  Gly Lys His Phe Tyr  Phe Asn Thr 
    2390                 2395                 2400             


Asp Gly  Ile Met Gln Ile Gly  Val Phe Lys Gly Pro  Asn Gly Phe 
    2405                 2410                 2415             


Glu Tyr  Phe Ala Pro Ala Asn  Thr His Asn Asn Asn  Ile Glu Gly 
    2420                 2425                 2430             


Gln Ala  Ile Leu Tyr Gln Asn  Lys Phe Leu Thr Leu  Asn Gly Lys 
    2435                 2440                 2445             


Lys Tyr  Tyr Phe Gly Ser Asp  Ser Lys Ala Val Thr  Gly Leu Arg 
    2450                 2455                 2460             


Thr Ile  Asp Gly Lys Lys Tyr  Tyr Phe Asn Thr Asn  Thr Ala Val 
    2465                 2470                 2475             


Ala Val  Thr Gly Trp Gln Thr  Ile Asn Gly Lys Lys  Tyr Tyr Phe 
    2480                 2485                 2490             


Asn Thr  Asn Thr Ser Ile Ala  Ser Thr Gly Tyr Thr  Ile Ile Ser 
    2495                 2500                 2505             


Gly Lys  His Phe Tyr Phe Asn  Thr Asp Gly Ile Met  Gln Ile Gly 
    2510                 2515                 2520             


Val Phe  Lys Gly Pro Asp Gly  Phe Glu Tyr Phe Ala  Pro Ala Asn 
    2525                 2530                 2535             


Thr Asp  Ala Asn Asn Ile Glu  Gly Gln Ala Ile Arg  Tyr Gln Asn 
    2540                 2545                 2550             


Arg Phe  Leu Tyr Leu His Asp  Asn Ile Tyr Tyr Phe  Gly Asn Asn 
    2555                 2560                 2565             


Ser Lys  Ala Ala Thr Gly Trp  Val Thr Ile Asp Gly  Asn Arg Tyr 
    2570                 2575                 2580             


Tyr Phe  Glu Pro Asn Thr Ala  Met Gly Ala Asn Gly  Tyr Lys Thr 
    2585                 2590                 2595             


Ile Asp  Asn Lys Asn Phe Tyr  Phe Arg Asn Gly Leu  Pro Gln Ile 
    2600                 2605                 2610             


Gly Val  Phe Lys Gly Ser Asn  Gly Phe Glu Tyr Phe  Ala Pro Ala 
    2615                 2620                 2625             


Asn Thr  Asp Ala Asn Asn Ile  Glu Gly Gln Ala Ile  Arg Tyr Gln 
    2630                 2635                 2640             


Asn Arg  Phe Leu His Leu Leu  Gly Lys Ile Tyr Tyr  Phe Gly Asn 
    2645                 2650                 2655             


Asn Ser  Lys Ala Val Thr Gly  Trp Gln Thr Ile Asn  Gly Lys Val 
    2660                 2665                 2670             


Tyr Tyr  Phe Met Pro Asp Thr  Ala Met Ala Ala Ala  Gly Gly Leu 
    2675                 2680                 2685             


Phe Glu  Ile Asp Gly Val Ile  Tyr Phe Phe Gly Val  Asp Gly Val 
    2690                 2695                 2700             


Lys Ala  Pro Gly Ile Tyr Gly  
    2705                 2710 


<210> 2
<211> 8133
<212> DNA
<213> Clostridium difficile

<400> 2
atgtctttaa tatctaaaga agagttaata aaactcgcat atagcattag accaagagaa       60

aatgagtata aaactatact aactaattta gacgaatata ataagttaac tacaaacaat      120

aatgaaaata aatatttaca attaaaaaaa ctaaatgaat caattgatgt ttttatgaat      180

aaatataaaa cttcaagcag aaatagagca ctctctaatc taaaaaaaga tatattaaaa      240

gaagtaattc ttattaaaaa ttccaataca agccctgtag aaaaaaattt acattttgta      300

tggataggtg gagaagtcag tgatattgct cttgaataca taaaacaatg ggctgatatt      360

aatgcagaat ataatattaa actgtggtat gatagtgaag cattcttagt aaatacacta      420

aaaaaggcta tagttgaatc ttctaccact gaagcattac agctactaga ggaagagatt      480

caaaatcctc aatttgataa tatgaaattt tacaaaaaaa ggatggaatt tatatatgat      540

agacaaaaaa ggtttataaa ttattataaa tctcaaatca ataaacctac agtacctaca      600

atagatgata ttataaagtc tcatctagta tctgaatata atagagatga aactgtatta      660

gaatcatata gaacaaattc tttgagaaaa ataaatagta atcatgggat agatatcagg      720

gctaatagtt tgtttacaga acaagagtta ttaaatattt atagtcagga gttgttaaat      780

cgtggaaatt tagctgcagc atctgacata gtaagattat tagccctaaa aaattttggc      840

ggagtatatt tagatgttga tatgcttcca ggtattcact ctgatttatt taaaacaata      900

tctagaccta gctctattgg actagaccgt tgggaaatga taaaattaga ggctattatg      960

aagtataaaa aatatataaa taattataca tcagaaaact ttgataaact tgatcaacaa     1020

ttaaaagata attttaaact cattatagaa agtaaaagtg aaaaatctga gatattttct     1080

aaattagaaa atttaaatgt atctgatctt gaaattaaaa tagctttcgc tttaggcagt     1140

gttataaatc aagccttgat atcaaaacaa ggttcatatc ttactaacct agtaatagaa     1200

caagtaaaaa atagatatca atttttaaac caacacctta acccagccat agagtctgat     1260

aataacttca cagatactac taaaattttt catgattcat tatttaattc agctaccgca     1320

gaaaactcta tgtttttaac aaaaatagca ccatacttac aagtaggttt tatgccagaa     1380

gctcgctcca caataagttt aagtggtcca ggagcttatg cgtcagctta ctatgatttc     1440

ataaatttac aagaaaatac tatagaaaaa actttaaaag catcagattt aatagaattt     1500

aaattcccag aaaataatct atctcaattg acagaacaag aaataaatag tctatggagc     1560

tttgatcaag caagtgcaaa atatcaattt gagaaatatg taagagatta tactggtgga     1620

tctctttctg aagacaatgg ggtagacttt aataaaaata ctgccctcga caaaaactat     1680

ttattaaata ataaaattcc atcaaacaat gtagaagaag ctggaagtaa aaattatgtt     1740

cattatatca tacagttaca aggagatgat ataagttatg aagcaacatg caatttattt     1800

tctaaaaatc ctaaaaatag tattattata caacgaaata tgaatgaaag tgcaaaaagc     1860

tactttttaa gtgatgatgg agaatctatt ttagaattaa ataaatatag gatacctgaa     1920

agattaaaaa ataaggaaaa agtaaaagta acctttattg gacatggtaa agatgaattc     1980

aacacaagcg aatttgctag attaagtgta gattcacttt ccaatgagat aagttcattt     2040

ttagatacca taaaattaga tatatcacct aaaaatgtag aagtaaactt acttggatgt     2100

aatatgttta gttatgattt taatgttgaa gaaacttatc ctgggaagtt gctattaagt     2160

attatggaca aaattacttc cactttacct gatgtaaata aaaattctat tactatagga     2220

gcaaatcaat atgaagtaag aattaatagt gagggaagaa aagaacttct ggctcactca     2280

ggtaaatgga taaataaaga agaagctatt atgagcgatt tatctagtaa agaatacatt     2340

ttttttgatt ctatagataa taagctaaaa gcaaagtcca agaatattcc aggattagca     2400

tcaatatcag aagatataaa aacattatta cttgatgcaa gtgttagtcc tgatacaaaa     2460

tttattttaa ataatcttaa gcttaatatt gaatcttcta ttggtgatta catttattat     2520

gaaaaattag agcctgttaa aaatataatt cacaattcta tagatgattt aatagatgag     2580

ttcaatctac ttgaaaatgt atctgatgaa ttatatgaat taaaaaaatt aaataatcta     2640

gatgagaagt atttaatatc ttttgaagat atctcaaaaa ataattcaac ttactctgta     2700

agatttatta acaaaagtaa tggtgagtca gtttatgtag aaacagaaaa agaaattttt     2760

tcaaaatata gcgaacatat tacaaaagaa ataagtacta taaagaatag tataattaca     2820

gatgttaatg gtaatttatt ggataatata cagttagatc atacttctca agttaataca     2880

ttaaacgcag cattctttat tcaatcatta atagattata gtagcaataa agatgtactg     2940

aatgatttaa gtacctcagt taaggttcaa ctttatgctc aactatttag tacaggttta     3000

aatactatat atgactctat ccaattagta aatttaatat caaatgcagt aaatgatact     3060

ataaatgtac tacctacaat aacagagggg atacctattg tatctactat attagacgga     3120

ataaacttag gtgcagcaat taaggaatta ctagacgaac atgacccatt actaaaaaaa     3180

gaattagaag ctaaggtggg tgttttagca ataaatatgt cattatctat agctgcaact     3240

gtagcttcaa ttgttggaat aggtgctgaa gttactattt tcttattacc tatagctggt     3300

atatctgcag gaataccttc attagttaat aatgaattaa tattgcatga taaggcaact     3360

tcagtggtaa actattttaa tcatttgtct gaatctaaaa aatatggccc tcttaaaaca     3420

gaagatgata aaattttagt tcctattgat gatttagtaa tatcagaaat agattttaat     3480

aataattcga taaaactagg aacatgtaat atattagcaa tggagggggg atcaggacac     3540

acagtgactg gtaatataga tcactttttc tcatctccat ctataagttc tcatattcct     3600

tcattatcaa tttattctgc aataggtata gaaacagaaa atctagattt ttcaaaaaaa     3660

ataatgatgt tacctaatgc tccttcaaga gtgttttggt gggaaactgg agcagttcca     3720

ggtttaagat cattggaaaa tgacggaact agattacttg attcaataag agatttatac     3780

ccaggtaaat tttactggag attctatgct tttttcgatt atgcaataac tacattaaaa     3840

ccagtttatg aagacactaa tattaaaatt aaactagata aagatactag aaacttcata     3900

atgccaacta taactactaa cgaaattaga aacaaattat cttattcatt tgatggagca     3960

ggaggaactt actctttatt attatcttca tatccaatat caacgaatat aaatttatct     4020

aaagatgatt tatggatatt taatattgat aatgaagtaa gagaaatatc tatagaaaat     4080

ggtactatta aaaaaggaaa gttaataaaa gatgttttaa gtaaaattga tataaataaa     4140

aataaactta ttataggcaa tcaaacaata gatttttcag gcgatataga taataaagat     4200

agatatatat tcttgacttg tgagttagat gataaaatta gtttaataat agaaataaat     4260

cttgttgcaa aatcttatag tttgttattg tctggggata aaaattattt gatatccaat     4320

ttatctaata ttattgagaa aatcaatact ttaggcctag atagtaaaaa tatagcgtac     4380

aattacactg atgaatctaa taataaatat tttggagcta tatctaaaac aagtcaaaaa     4440

agcataatac attataaaaa agacagtaaa aatatattag aattttataa tgacagtaca     4500

ttagaattta acagtaaaga ttttattgct gaagatataa atgtatttat gaaagatgat     4560

attaatacta taacaggaaa atactatgtt gataataata ctgataaaag tatagatttc     4620

tctatttctt tagttagtaa aaatcaagta aaagtaaatg gattatattt aaatgaatcc     4680

gtatactcat cttaccttga ttttgtgaaa aattcagatg gacaccataa tacttctaat     4740

tttatgaatt tatttttgga caatataagt ttctggaaat tgtttgggtt tgaaaatata     4800

aattttgtaa tcgataaata ctttaccctt gttggtaaaa ctaatcttgg atatgtagaa     4860

tttatttgtg acaataataa aaatatagat atatattttg gtgaatggaa aacatcgtca     4920

tctaaaagca ctatatttag cggaaatggt agaaatgttg tagtagagcc tatatataat     4980

cctgatacgg gtgaagatat atctacttca ctagattttt cctatgaacc tctctatgga     5040

atagatagat atatcaataa agtattgata gcacctgatt tatatacaag tttaataaat     5100

attaatacca attattattc aaatgagtac taccctgaga ttatagttct taacccaaat     5160

acattccaca aaaaagtaaa tataaattta gatagttctt cttttgagta taaatggtct     5220

acagaaggaa gtgactttat tttagttaga tacttagaag aaagtaataa aaaaatatta     5280

caaaaaataa gaatcaaagg tatcttatct aatactcaat catttaataa aatgagtata     5340

gattttaaag atattaaaaa actatcatta ggatatataa tgagtaattt taaatcattt     5400

aattctgaaa atgaattaga tagagatcat ttaggattta aaataataga taataaaact     5460

tattactatg atgaagatag taaattagtt aaaggattaa tcaatataaa taattcatta     5520

ttctattttg atcctataga atttaactta gtaactggat ggcaaactat caatggtaaa     5580

aaatattatt ttgatataaa tactggagca gctttaatta gttataaaat tattaatggt     5640

aaacactttt attttaataa tgatggtgtg atgcagttgg gagtatttaa aggacctgat     5700

ggatttgaat attttgcacc tgccaatact caaaataata acatagaagg tcaggctata     5760

gtttatcaaa gtaaattctt aactttgaat ggcaaaaaat attattttga taatgactca     5820

aaagcagtca ctggatggag aattattaac aatgagaaat attactttaa tcctaataat     5880

gctattgctg cagtcggatt gcaagtaatt gacaataata agtattattt caatcctgac     5940

actgctatca tctcaaaagg ttggcagact gttaatggta gtagatacta ctttgatact     6000

gataccgcta ttgcctttaa tggttataaa actattgatg gtaaacactt ttattttgat     6060

agtgattgtg tagtgaaaat aggtgtgttt agtacctcta atggatttga atattttgca     6120

cctgctaata cttataataa taacatagaa ggtcaggcta tagtttatca aagtaaattc     6180

ttaactttga atggtaaaaa atattacttt gataataact caaaagcagt taccggatgg     6240

caaactattg atagtaaaaa atattacttt aatactaaca ctgctgaagc agctactgga     6300

tggcaaacta ttgatggtaa aaaatattac tttaatacta acactgctga agcagctact     6360

ggatggcaaa ctattgatgg taaaaaatat tactttaata ctaacactgc tatagcttca     6420

actggttata caattattaa tggtaaacat ttttatttta atactgatgg tattatgcag     6480

ataggagtgt ttaaaggacc taatggattt gaatattttg cacctgctaa tacggatgct     6540

aacaacatag aaggtcaagc tatactttac caaaatgaat tcttaacttt gaatggtaaa     6600

aaatattact ttggtagtga ctcaaaagca gttactggat ggagaattat taacaataag     6660

aaatattact ttaatcctaa taatgctatt gctgcaattc atctatgcac tataaataat     6720

gacaagtatt actttagtta tgatggaatt cttcaaaatg gatatattac tattgaaaga     6780

aataatttct attttgatgc taataatgaa tctaaaatgg taacaggagt atttaaagga     6840

cctaatggat ttgagtattt tgcacctgct aatactcaca ataataacat agaaggtcag     6900

gctatagttt accagaacaa attcttaact ttgaatggca aaaaatatta ttttgataat     6960

gactcaaaag cagttactgg atggcaaacc attgatggta aaaaatatta ctttaatctt     7020

aacactgctg aagcagctac tggatggcaa actattgatg gtaaaaaata ttactttaat     7080

cttaacactg ctgaagcagc tactggatgg caaactattg atggtaaaaa atattacttt     7140

aatactaaca ctttcatagc ctcaactggt tatacaagta ttaatggtaa acatttttat     7200

tttaatactg atggtattat gcagatagga gtgtttaaag gacctaatgg atttgaatac     7260

tttgcacctg ctaatactca taataataac atagaaggtc aagctatact ttaccaaaat     7320

aaattcttaa ctttgaatgg taaaaaatat tactttggta gtgactcaaa agcagttacc     7380

ggattgcgaa ctattgatgg taaaaaatat tactttaata ctaacactgc tgttgcagtt     7440

actggatggc aaactattaa tggtaaaaaa tactacttta atactaacac ttctatagct     7500

tcaactggtt atacaattat tagtggtaaa catttttatt ttaatactga tggtattatg     7560

cagataggag tgtttaaagg acctgatgga tttgaatact ttgcacctgc taatacagat     7620

gctaacaata tagaaggtca agctatacgt tatcaaaata gattcctata tttacatgac     7680

aatatatatt attttggtaa taattcaaaa gcagctactg gttgggtaac tattgatggt     7740

aatagatatt acttcgagcc taatacagct atgggtgcga atggttataa aactattgat     7800

aataaaaatt tttactttag aaatggttta cctcagatag gagtgtttaa agggtctaat     7860

ggatttgaat actttgcacc tgctaatacg gatgctaaca atatagaagg tcaagctata     7920

cgttatcaaa atagattcct acatttactt ggaaaaatat attactttgg taataattca     7980

aaagcagtta ctggatggca aactattaat ggtaaagtat attactttat gcctgatact     8040

gctatggctg cagctggtgg acttttcgag attgatggtg ttatatattt ctttggtgtt     8100

gatggagtaa aagcccctgg gatatatggc taa                                  8133


<210> 3
<211> 2366
<212> PRT
<213> Clostridium difficile

<400> 3
Met Ser Leu Val Asn Arg Lys Gln Leu Glu Lys Met Ala Asn Val Arg 
1               5                   10                  15      


Phe Arg Thr Gln Glu Asp Glu Tyr Val Ala Ile Leu Asp Ala Leu Glu 
            20                  25                  30          


Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu Lys 
        35                  40                  45              


Leu Lys Asp Ile Asn Ser Leu Thr Asp Ile Tyr Ile Asp Thr Tyr Lys 
    50                  55                  60                  


Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 
65                  70                  75                  80  


Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 
                85                  90                  95      


Asn Leu His Phe Val Trp Ile Gly Gly Gln Ile Asn Asp Thr Ala Ile 
            100                 105                 110         


Asn Tyr Ile Asn Gln Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 
        115                 120                 125             


Val Phe Tyr Asp Ser Asn Ala Phe Leu Ile Asn Thr Leu Lys Lys Thr 
    130                 135                 140                 


Val Val Glu Ser Ala Ile Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 
145                 150                 155                 160 


Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
                165                 170                 175     


Glu Ile Ile Tyr Asp Lys Gln Lys Asn Phe Ile Asn Tyr Tyr Lys Ala 
            180                 185                 190         


Gln Arg Glu Glu Asn Pro Glu Leu Ile Ile Asp Asp Ile Val Lys Thr 
        195                 200                 205             


Tyr Leu Ser Asn Glu Tyr Ser Lys Glu Ile Asp Glu Leu Asn Thr Tyr 
    210                 215                 220                 


Ile Glu Glu Ser Leu Asn Lys Ile Thr Gln Asn Ser Gly Asn Asp Val 
225                 230                 235                 240 


Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
                245                 250                 255     


Gln Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp Ile Leu 
            260                 265                 270         


Arg Ile Ser Ala Leu Lys Glu Ile Gly Gly Met Tyr Leu Asp Val Asp 
        275                 280                 285             


Met Leu Pro Gly Ile Gln Pro Asp Leu Phe Glu Ser Ile Glu Lys Pro 
    290                 295                 300                 


Ser Ser Val Thr Val Asp Phe Trp Glu Met Thr Lys Leu Glu Ala Ile 
305                 310                 315                 320 


Met Lys Tyr Lys Glu Tyr Ile Pro Glu Tyr Thr Ser Glu His Phe Asp 
                325                 330                 335     


Met Leu Asp Glu Glu Val Gln Ser Ser Phe Glu Ser Val Leu Ala Ser 
            340                 345                 350         


Lys Ser Asp Lys Ser Glu Ile Phe Ser Ser Leu Gly Asp Met Glu Ala 
        355                 360                 365             


Ser Pro Leu Glu Val Lys Ile Ala Phe Asn Ser Lys Gly Ile Ile Asn 
    370                 375                 380                 


Gln Gly Leu Ile Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu Ile Val 
385                 390                 395                 400 


Lys Gln Ile Glu Asn Arg Tyr Lys Ile Leu Asn Asn Ser Leu Asn Pro 
                405                 410                 415     


Ala Ile Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe Ile 
            420                 425                 430         


Asp Ser Ile Met Ala Glu Ala Asn Ala Asp Asn Gly Arg Phe Met Met 
        435                 440                 445             


Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
    450                 455                 460                 


Thr Ile Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gln Asp 
465                 470                 475                 480 


Leu Leu Met Phe Lys Glu Gly Ser Met Asn Ile His Leu Ile Glu Ala 
                485                 490                 495     


Asp Leu Arg Asn Phe Glu Ile Ser Lys Thr Asn Ile Ser Gln Ser Thr 
            500                 505                 510         


Glu Gln Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
        515                 520                 525             


Ala Gln Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Gly 
    530                 535                 540                 


Glu Asp Asp Asn Leu Asp Phe Ser Gln Asn Ile Val Val Asp Lys Glu 
545                 550                 555                 560 


Tyr Leu Leu Glu Lys Ile Ser Ser Leu Ala Arg Ser Ser Glu Arg Gly 
                565                 570                 575     


Tyr Ile His Tyr Ile Val Gln Leu Gln Gly Asp Lys Ile Ser Tyr Glu 
            580                 585                 590         


Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 
        595                 600                 605             


Gln Lys Asn Ile Glu Asp Ser Glu Ile Ala Tyr Tyr Tyr Asn Pro Gly 
    610                 615                 620                 


Asp Gly Glu Ile Gln Glu Ile Asp Lys Tyr Lys Ile Pro Ser Ile Ile 
625                 630                 635                 640 


Ser Asp Arg Pro Lys Ile Lys Leu Thr Phe Ile Gly His Gly Lys Asp 
                645                 650                 655     


Glu Phe Asn Thr Asp Ile Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
            660                 665                 670         


Thr Glu Ile Glu Ala Ala Ile Asp Leu Ala Lys Glu Asp Ile Ser Pro 
        675                 680                 685             


Lys Ser Ile Glu Ile Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 
    690                 695                 700                 


Ile Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lys Val Lys 
705                 710                 715                 720 


Asp Lys Ile Ser Glu Leu Met Pro Ser Ile Ser Gln Asp Ser Ile Ile 
                725                 730                 735     


Val Ser Ala Asn Gln Tyr Glu Val Arg Ile Asn Ser Glu Gly Arg Arg 
            740                 745                 750         


Glu Leu Leu Asp His Ser Gly Glu Trp Ile Asn Lys Glu Glu Ser Ile 
        755                 760                 765             


Ile Lys Asp Ile Ser Ser Lys Glu Tyr Ile Ser Phe Asn Pro Lys Glu 
    770                 775                 780                 


Asn Lys Ile Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr Leu 
785                 790                 795                 800 


Leu Gln Glu Ile Arg Asn Asn Ser Asn Ser Ser Asp Ile Glu Leu Glu 
                805                 810                 815     


Glu Lys Val Met Leu Thr Glu Cys Glu Ile Asn Val Ile Ser Asn Ile 
            820                 825                 830         


Asp Thr Gln Ile Val Glu Glu Arg Ile Glu Glu Ala Lys Asn Leu Thr 
        835                 840                 845             


Ser Asp Ser Ile Asn Tyr Ile Lys Asp Glu Phe Lys Leu Ile Glu Ser 
    850                 855                 860                 


Ile Ser Asp Ala Leu Cys Asp Leu Lys Gln Gln Asn Glu Leu Glu Asp 
865                 870                 875                 880 


Ser His Phe Ile Ser Phe Glu Asp Ile Ser Glu Thr Asp Glu Gly Phe 
                885                 890                 895     


Ser Ile Arg Phe Ile Asn Lys Glu Thr Gly Glu Ser Ile Phe Val Glu 
            900                 905                 910         


Thr Glu Lys Thr Ile Phe Ser Glu Tyr Ala Asn His Ile Thr Glu Glu 
        915                 920                 925             


Ile Ser Lys Ile Lys Gly Thr Ile Phe Asp Thr Val Asn Gly Lys Leu 
    930                 935                 940                 


Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr Leu Asn 
945                 950                 955                 960 


Ala Ala Phe Phe Ile Gln Ser Leu Ile Glu Tyr Asn Ser Ser Lys Glu 
                965                 970                 975     


Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gln Val Tyr Ala Gln 
            980                 985                 990         


Leu Phe Ser Thr Gly Leu Asn Thr  Ile Thr Asp Ala Ala  Lys Val Val 
        995                 1000                 1005             


Glu Leu  Val Ser Thr Ala Leu  Asp Glu Thr Ile Asp  Leu Leu Pro 
    1010                 1015                 1020             


Thr Leu  Ser Glu Gly Leu Pro  Ile Ile Ala Thr Ile  Ile Asp Gly 
    1025                 1030                 1035             


Val Ser  Leu Gly Ala Ala Ile  Lys Glu Leu Ser Glu  Thr Ser Asp 
    1040                 1045                 1050             


Pro Leu  Leu Arg Gln Glu Ile  Glu Ala Lys Ile Gly  Ile Met Ala 
    1055                 1060                 1065             


Val Asn  Leu Thr Thr Ala Thr  Thr Ala Ile Ile Thr  Ser Ser Leu 
    1070                 1075                 1080             


Gly Ile  Ala Ser Gly Phe Ser  Ile Leu Leu Val Pro  Leu Ala Gly 
    1085                 1090                 1095             


Ile Ser  Ala Gly Ile Pro Ser  Leu Val Asn Asn Glu  Leu Val Leu 
    1100                 1105                 1110             


Arg Asp  Lys Ala Thr Lys Val  Val Asp Tyr Phe Lys  His Val Ser 
    1115                 1120                 1125             


Leu Val  Glu Thr Glu Gly Val  Phe Thr Leu Leu Asp  Asp Lys Ile 
    1130                 1135                 1140             


Met Met  Pro Gln Asp Asp Leu  Val Ile Ser Glu Ile  Asp Phe Asn 
    1145                 1150                 1155             


Asn Asn  Ser Ile Val Leu Gly  Lys Cys Glu Ile Trp  Arg Met Glu 
    1160                 1165                 1170             


Gly Gly  Ser Gly His Thr Val  Thr Asp Asp Ile Asp  His Phe Phe 
    1175                 1180                 1185             


Ser Ala  Pro Ser Ile Thr Tyr  Arg Glu Pro His Leu  Ser Ile Tyr 
    1190                 1195                 1200             


Asp Val  Leu Glu Val Gln Lys  Glu Glu Leu Asp Leu  Ser Lys Asp 
    1205                 1210                 1215             


Leu Met  Val Leu Pro Asn Ala  Pro Asn Arg Val Phe  Ala Trp Glu 
    1220                 1225                 1230             


Thr Gly  Trp Thr Pro Gly Leu  Arg Ser Leu Glu Asn  Asp Gly Thr 
    1235                 1240                 1245             


Lys Leu  Leu Asp Arg Ile Arg  Asp Asn Tyr Glu Gly  Glu Phe Tyr 
    1250                 1255                 1260             


Trp Arg  Tyr Phe Ala Phe Ile  Ala Asp Ala Leu Ile  Thr Thr Leu 
    1265                 1270                 1275             


Lys Pro  Arg Tyr Glu Asp Thr  Asn Ile Arg Ile Asn  Leu Asp Ser 
    1280                 1285                 1290             


Asn Thr  Arg Ser Phe Ile Val  Pro Ile Ile Thr Thr  Glu Tyr Ile 
    1295                 1300                 1305             


Arg Glu  Lys Leu Ser Tyr Ser  Phe Tyr Gly Ser Gly  Gly Thr Tyr 
    1310                 1315                 1320             


Ala Leu  Ser Leu Ser Gln Tyr  Asn Met Gly Ile Asn  Ile Glu Leu 
    1325                 1330                 1335             


Ser Glu  Ser Asp Val Trp Ile  Ile Asp Val Asp Asn  Val Val Arg 
    1340                 1345                 1350             


Asp Val  Thr Ile Glu Ser Asp  Lys Ile Lys Lys Gly  Asp Leu Ile 
    1355                 1360                 1365             


Glu Gly  Ile Leu Ser Thr Leu  Ser Ile Glu Glu Asn  Lys Ile Ile 
    1370                 1375                 1380             


Leu Asn  Ser His Glu Ile Asn  Phe Ser Gly Glu Val  Asn Gly Ser 
    1385                 1390                 1395             


Asn Gly  Phe Val Ser Leu Thr  Phe Ser Ile Leu Glu  Gly Ile Asn 
    1400                 1405                 1410             


Ala Ile  Ile Glu Val Asp Leu  Leu Ser Lys Ser Tyr  Lys Leu Leu 
    1415                 1420                 1425             


Ile Ser  Gly Glu Leu Lys Ile  Leu Met Leu Asn Ser  Asn His Ile 
    1430                 1435                 1440             


Gln Gln  Lys Ile Asp Tyr Ile  Gly Phe Asn Ser Glu  Leu Gln Lys 
    1445                 1450                 1455             


Asn Ile  Pro Tyr Ser Phe Val  Asp Ser Glu Gly Lys  Glu Asn Gly 
    1460                 1465                 1470             


Phe Ile  Asn Gly Ser Thr Lys  Glu Gly Leu Phe Val  Ser Glu Leu 
    1475                 1480                 1485             


Pro Asp  Val Val Leu Ile Ser  Lys Val Tyr Met Asp  Asp Ser Lys 
    1490                 1495                 1500             


Pro Ser  Phe Gly Tyr Tyr Ser  Asn Asn Leu Lys Asp  Val Lys Val 
    1505                 1510                 1515             


Ile Thr  Lys Asp Asn Val Asn  Ile Leu Thr Gly Tyr  Tyr Leu Lys 
    1520                 1525                 1530             


Asp Asp  Ile Lys Ile Ser Leu  Ser Leu Thr Leu Gln  Asp Glu Lys 
    1535                 1540                 1545             


Thr Ile  Lys Leu Asn Ser Val  His Leu Asp Glu Ser  Gly Val Ala 
    1550                 1555                 1560             


Glu Ile  Leu Lys Phe Met Asn  Arg Lys Gly Asn Thr  Asn Thr Ser 
    1565                 1570                 1575             


Asp Ser  Leu Met Ser Phe Leu  Glu Ser Met Asn Ile  Lys Ser Ile 
    1580                 1585                 1590             


Phe Val  Asn Phe Leu Gln Ser  Asn Ile Lys Phe Ile  Leu Asp Ala 
    1595                 1600                 1605             


Asn Phe  Ile Ile Ser Gly Thr  Thr Ser Ile Gly Gln  Phe Glu Phe 
    1610                 1615                 1620             


Ile Cys  Asp Glu Asn Asp Asn  Ile Gln Pro Tyr Phe  Ile Lys Phe 
    1625                 1630                 1635             


Asn Thr  Leu Glu Thr Asn Tyr  Thr Leu Tyr Val Gly  Asn Arg Gln 
    1640                 1645                 1650             


Asn Met  Ile Val Glu Pro Asn  Tyr Asp Leu Asp Asp  Ser Gly Asp 
    1655                 1660                 1665             


Ile Ser  Ser Thr Val Ile Asn  Phe Ser Gln Lys Tyr  Leu Tyr Gly 
    1670                 1675                 1680             


Ile Asp  Ser Cys Val Asn Lys  Val Val Ile Ser Pro  Asn Ile Tyr 
    1685                 1690                 1695             


Thr Asp  Glu Ile Asn Ile Thr  Pro Val Tyr Glu Thr  Asn Asn Thr 
    1700                 1705                 1710             


Tyr Pro  Glu Val Ile Val Leu  Asp Ala Asn Tyr Ile  Asn Glu Lys 
    1715                 1720                 1725             


Ile Asn  Val Asn Ile Asn Asp  Leu Ser Ile Arg Tyr  Val Trp Ser 
    1730                 1735                 1740             


Asn Asp  Gly Asn Asp Phe Ile  Leu Met Ser Thr Ser  Glu Glu Asn 
    1745                 1750                 1755             


Lys Val  Ser Gln Val Lys Ile  Arg Phe Val Asn Val  Phe Lys Asp 
    1760                 1765                 1770             


Lys Thr  Leu Ala Asn Lys Leu  Ser Phe Asn Phe Ser  Asp Lys Gln 
    1775                 1780                 1785             


Asp Val  Pro Val Ser Glu Ile  Ile Leu Ser Phe Thr  Pro Ser Tyr 
    1790                 1795                 1800             


Tyr Glu  Asp Gly Leu Ile Gly  Tyr Asp Leu Gly Leu  Val Ser Leu 
    1805                 1810                 1815             


Tyr Asn  Glu Lys Phe Tyr Ile  Asn Asn Phe Gly Met  Met Val Ser 
    1820                 1825                 1830             


Gly Leu  Ile Tyr Ile Asn Asp  Ser Leu Tyr Tyr Phe  Lys Pro Pro 
    1835                 1840                 1845             


Val Asn  Asn Leu Ile Thr Gly  Phe Val Thr Val Gly  Asp Asp Lys 
    1850                 1855                 1860             


Tyr Tyr  Phe Asn Pro Ile Asn  Gly Gly Ala Ala Ser  Ile Gly Glu 
    1865                 1870                 1875             


Thr Ile  Ile Asp Asp Lys Asn  Tyr Tyr Phe Asn Gln  Ser Gly Val 
    1880                 1885                 1890             


Leu Gln  Thr Gly Val Phe Ser  Thr Glu Asp Gly Phe  Lys Tyr Phe 
    1895                 1900                 1905             


Ala Pro  Ala Asn Thr Leu Asp  Glu Asn Leu Glu Gly  Glu Ala Ile 
    1910                 1915                 1920             


Asp Phe  Thr Gly Lys Leu Ile  Ile Asp Glu Asn Ile  Tyr Tyr Phe 
    1925                 1930                 1935             


Asp Asp  Asn Tyr Arg Gly Ala  Val Glu Trp Lys Glu  Leu Asp Gly 
    1940                 1945                 1950             


Glu Met  His Tyr Phe Ser Pro  Glu Thr Gly Lys Ala  Phe Lys Gly 
    1955                 1960                 1965             


Leu Asn  Gln Ile Gly Asp Tyr  Lys Tyr Tyr Phe Asn  Ser Asp Gly 
    1970                 1975                 1980             


Val Met  Gln Lys Gly Phe Val  Ser Ile Asn Asp Asn  Lys His Tyr 
    1985                 1990                 1995             


Phe Asp  Asp Ser Gly Val Met  Lys Val Gly Tyr Thr  Glu Ile Asp 
    2000                 2005                 2010             


Gly Lys  His Phe Tyr Phe Ala  Glu Asn Gly Glu Met  Gln Ile Gly 
    2015                 2020                 2025             


Val Phe  Asn Thr Glu Asp Gly  Phe Lys Tyr Phe Ala  His His Asn 
    2030                 2035                 2040             


Glu Asp  Leu Gly Asn Glu Glu  Gly Glu Glu Ile Ser  Tyr Ser Gly 
    2045                 2050                 2055             


Ile Leu  Asn Phe Asn Asn Lys  Ile Tyr Tyr Phe Asp  Asp Ser Phe 
    2060                 2065                 2070             


Thr Ala  Val Val Gly Trp Lys  Asp Leu Glu Asp Gly  Ser Lys Tyr 
    2075                 2080                 2085             


Tyr Phe  Asp Glu Asp Thr Ala  Glu Ala Tyr Ile Gly  Leu Ser Leu 
    2090                 2095                 2100             


Ile Asn  Asp Gly Gln Tyr Tyr  Phe Asn Asp Asp Gly  Ile Met Gln 
    2105                 2110                 2115             


Val Gly  Phe Val Thr Ile Asn  Asp Lys Val Phe Tyr  Phe Ser Asp 
    2120                 2125                 2130             


Ser Gly  Ile Ile Glu Ser Gly  Val Gln Asn Ile Asp  Asp Asn Tyr 
    2135                 2140                 2145             


Phe Tyr  Ile Asp Asp Asn Gly  Ile Val Gln Ile Gly  Val Phe Asp 
    2150                 2155                 2160             


Thr Ser  Asp Gly Tyr Lys Tyr  Phe Ala Pro Ala Asn  Thr Val Asn 
    2165                 2170                 2175             


Asp Asn  Ile Tyr Gly Gln Ala  Val Glu Tyr Ser Gly  Leu Val Arg 
    2180                 2185                 2190             


Val Gly  Glu Asp Val Tyr Tyr  Phe Gly Glu Thr Tyr  Thr Ile Glu 
    2195                 2200                 2205             


Thr Gly  Trp Ile Tyr Asp Met  Glu Asn Glu Ser Asp  Lys Tyr Tyr 
    2210                 2215                 2220             


Phe Asn  Pro Glu Thr Lys Lys  Ala Cys Lys Gly Ile  Asn Leu Ile 
    2225                 2230                 2235             


Asp Asp  Ile Lys Tyr Tyr Phe  Asp Glu Lys Gly Ile  Met Arg Thr 
    2240                 2245                 2250             


Gly Leu  Ile Ser Phe Glu Asn  Asn Asn Tyr Tyr Phe  Asn Glu Asn 
    2255                 2260                 2265             


Gly Glu  Met Gln Phe Gly Tyr  Ile Asn Ile Glu Asp  Lys Met Phe 
    2270                 2275                 2280             


Tyr Phe  Gly Glu Asp Gly Val  Met Gln Ile Gly Val  Phe Asn Thr 
    2285                 2290                 2295             


Pro Asp  Gly Phe Lys Tyr Phe  Ala His Gln Asn Thr  Leu Asp Glu 
    2300                 2305                 2310             


Asn Phe  Glu Gly Glu Ser Ile  Asn Tyr Thr Gly Trp  Leu Asp Leu 
    2315                 2320                 2325             


Asp Glu  Lys Arg Tyr Tyr Phe  Thr Asp Glu Tyr Ile  Ala Ala Thr 
    2330                 2335                 2340             


Gly Ser  Val Ile Ile Asp Gly  Glu Glu Tyr Tyr Phe  Asp Pro Asp 
    2345                 2350                 2355             


Thr Ala  Gln Leu Val Ile Ser  Glu 
    2360                 2365     


<210> 4
<211> 7101
<212> DNA
<213> Clostridium difficile

<400> 4
atgagtttag ttaatagaaa acagttagaa aaaatggcaa atgtaagatt tcgtactcaa       60

gaagatgaat atgttgcaat attggatgct ttagaagaat atcataatat gtcagagaat      120

actgtagtcg aaaaatattt aaaattaaaa gatataaata gtttaacaga tatttatata      180

gatacatata aaaaatctgg tagaaataaa gccttaaaaa aatttaagga atatctagtt      240

acagaagtat tagagctaaa gaataataat ttaactccag ttgagaaaaa tttacatttt      300

gtttggattg gaggtcaaat aaatgacact gctattaatt atataaatca atggaaagat      360

gtaaatagtg attataatgt taatgttttt tatgatagta atgcattttt gataaacaca      420

ttgaaaaaaa ctgtagtaga atcagcaata aatgatacac ttgaatcatt tagagaaaac      480

ttaaatgacc ctagatttga ctataataaa ttcttcagaa aacgtatgga aataatttat      540

gataaacaga aaaatttcat aaactactat aaagctcaaa gagaagaaaa tcctgaactt      600

ataattgatg atattgtaaa gacatatctt tcaaatgagt attcaaagga gatagatgaa      660

cttaatacct atattgaaga atccttaaat aaaattacac agaatagtgg aaatgatgtt      720

agaaactttg aagaatttaa aaatggagag tcattcaact tatatgaaca agagttggta      780

gaaaggtgga atttagctgc tgcttctgac atattaagaa tatctgcatt aaaagaaatt      840

ggtggtatgt atttagatgt tgatatgtta ccaggaatac aaccagactt atttgagtct      900

atagagaaac ctagttcagt aacagtggat ttttgggaaa tgacaaagtt agaagctata      960

atgaaataca aagaatatat accagaatat acctcagaac attttgacat gttagacgaa     1020

gaagttcaaa gtagttttga atctgttcta gcttctaagt cagataaatc agaaatattc     1080

tcatcacttg gtgatatgga ggcatcacca ctagaagtta aaattgcatt taatagtaag     1140

ggtattataa atcaagggct aatttctgtg aaagactcat attgtagcaa tttaatagta     1200

aaacaaatcg agaatagata taaaatattg aataatagtt taaatccagc tattagcgag     1260

gataatgatt ttaatactac aacgaatacc tttattgata gtataatggc tgaagctaat     1320

gcagataatg gtagatttat gatggaacta ggaaagtatt taagagttgg tttcttccca     1380

gatgttaaaa ctactattaa cttaagtggc cctgaagcat atgcggcagc ttatcaagat     1440

ttattaatgt ttaaagaagg cagtatgaat atccatttga tagaagctga tttaagaaac     1500

tttgaaatct ctaaaactaa tatttctcaa tcaactgaac aagaaatggc tagcttatgg     1560

tcatttgacg atgcaagagc taaagctcaa tttgaagaat ataaaaggaa ttattttgaa     1620

ggttctcttg gtgaagatga taatcttgat ttttctcaaa atatagtagt tgacaaggag     1680

tatcttttag aaaaaatatc ttcattagca agaagttcag agagaggata tatacactat     1740

attgttcagt tacaaggaga taaaattagt tatgaagcag catgtaactt atttgcaaag     1800

actccttatg atagtgtact gtttcagaaa aatatagaag attcagaaat tgcatattat     1860

tataatcctg gagatggtga aatacaagaa atagacaagt ataaaattcc aagtataatt     1920

tctgatagac ctaagattaa attaacattt attggtcatg gtaaagatga atttaatact     1980

gatatatttg caggttttga tgtagattca ttatccacag aaatagaagc agcaatagat     2040

ttagctaaag aggatatttc tcctaagtca atagaaataa atttattagg atgtaatatg     2100

tttagctact ctatcaacgt agaggagact tatcctggaa aattattact taaagttaaa     2160

gataaaatat cagaattaat gccatctata agtcaagact ctattatagt aagtgcaaat     2220

caatatgaag ttagaataaa tagtgaagga agaagagaat tattggatca ttctggtgaa     2280

tggataaata aagaagaaag tattataaag gatatttcat caaaagaata tatatcattt     2340

aatcctaaag aaaataaaat tacagtaaaa tctaaaaatt tacctgagct atctacatta     2400

ttacaagaaa ttagaaataa ttctaattca agtgatattg aactagaaga aaaagtaatg     2460

ttaacagaat gtgagataaa tgttatttca aatatagata cgcaaattgt tgaggaaagg     2520

attgaagaag ctaagaattt aacttctgac tctattaatt atataaaaga tgaatttaaa     2580

ctaatagaat ctatttctga tgcactatgt gacttaaaac aacagaatga attagaagat     2640

tctcatttta tatcttttga ggacatatca gagactgatg agggatttag tataagattt     2700

attaataaag aaactggaga atctatattt gtagaaactg aaaaaacaat attctctgaa     2760

tatgctaatc atataactga agagatttct aagataaaag gtactatatt tgatactgta     2820

aatggtaagt tagtaaaaaa agtaaattta gatactacac acgaagtaaa tactttaaat     2880

gctgcatttt ttatacaatc attaatagaa tataatagtt ctaaagaatc tcttagtaat     2940

ttaagtgtag caatgaaagt ccaagtttac gctcaattat ttagtactgg tttaaatact     3000

attacagatg cagccaaagt tgttgaatta gtatcaactg cattagatga aactatagac     3060

ttacttccta cattatctga aggattacct ataattgcaa ctattataga tggtgtaagt     3120

ttaggtgcag caatcaaaga gctaagtgaa acgagtgacc cattattaag acaagaaata     3180

gaagctaaga taggtataat ggcagtaaat ttaacaacag ctacaactgc aatcattact     3240

tcatctttgg ggatagctag tggatttagt atacttttag ttcctttagc aggaatttca     3300

gcaggtatac caagcttagt aaacaatgaa cttgtacttc gagataaggc aacaaaggtt     3360

gtagattatt ttaaacatgt ttcattagtt gaaactgaag gagtatttac tttattagat     3420

gataaaataa tgatgccaca agatgattta gtgatatcag aaatagattt taataataat     3480

tcaatagttt taggtaaatg tgaaatctgg agaatggaag gtggttcagg tcatactgta     3540

actgatgata tagatcactt cttttcagca ccatcaataa catatagaga gccacactta     3600

tctatatatg acgtattgga agtacaaaaa gaagaacttg atttgtcaaa agatttaatg     3660

gtattaccta atgctccaaa tagagtattt gcttgggaaa caggatggac accaggttta     3720

agaagcttag aaaatgatgg cacaaaactg ttagaccgta taagagataa ctatgaaggt     3780

gagttttatt ggagatattt tgcttttata gctgatgctt taataacaac attaaaacca     3840

agatatgaag atactaatat aagaataaat ttagatagta atactagaag ttttatagtt     3900

ccaataataa ctacagaata tataagagaa aaattatcat attctttcta tggttcagga     3960

ggaacttatg cattgtctct ttctcaatat aatatgggta taaatataga attaagtgaa     4020

agtgatgttt ggattataga tgttgataat gttgtgagag atgtaactat agaatctgat     4080

aaaattaaaa aaggtgattt aatagaaggt attttatcta cactaagtat tgaagagaat     4140

aaaattatct taaatagcca tgagattaat ttttctggtg aggtaaatgg aagtaatgga     4200

tttgtttctt taacattttc aattttagaa ggaataaatg caattataga agttgattta     4260

ttatctaaat catataaatt acttatttct ggcgaattaa aaatattgat gttaaattca     4320

aatcatattc aacagaaaat agattatata ggattcaata gcgaattaca gaaaaatata     4380

ccatatagct ttgtagatag tgaaggaaaa gagaatggtt ttattaatgg ttcaacaaaa     4440

gaaggtttat ttgtatctga attacctgat gtagttctta taagtaaggt ttatatggat     4500

gatagtaagc cttcatttgg atattatagt aataatttga aagatgtcaa agttataact     4560

aaagataatg ttaatatatt aacaggttat tatcttaagg atgatataaa aatctctctt     4620

tctttgactc tacaagatga aaaaactata aagttaaata gtgtgcattt agatgaaagt     4680

ggagtagctg agattttgaa gttcatgaat agaaaaggta atacaaatac ttcagattct     4740

ttaatgagct ttttagaaag tatgaatata aaaagtattt tcgttaattt cttacaatct     4800

aatattaagt ttatattaga tgctaatttt ataataagtg gtactacttc tattggccaa     4860

tttgagttta tttgtgatga aaatgataat atacaaccat atttcattaa gtttaataca     4920

ctagaaacta attatacttt atatgtagga aatagacaaa atatgatagt ggaaccaaat     4980

tatgatttag atgattctgg agatatatct tcaactgtta tcaatttctc tcaaaagtat     5040

ctttatggaa tagacagttg tgttaataaa gttgtaattt caccaaatat ttatacagat     5100

gaaataaata taacgcctgt atatgaaaca aataatactt atccagaagt tattgtatta     5160

gatgcaaatt atataaatga aaaaataaat gttaatatca atgatctatc tatacgatat     5220

gtatggagta atgatggtaa tgattttatt cttatgtcaa ctagtgaaga aaataaggtg     5280

tcacaagtta aaataagatt cgttaatgtt tttaaagata agactttggc aaataagcta     5340

tcttttaact ttagtgataa acaagatgta cctgtaagtg aaataatctt atcatttaca     5400

ccttcatatt atgaggatgg attgattggc tatgatttgg gtctagtttc tttatataat     5460

gagaaatttt atattaataa ctttggaatg atggtatctg gattaatata tattaatgat     5520

tcattatatt attttaaacc accagtaaat aatttgataa ctggatttgt gactgtaggc     5580

gatgataaat actactttaa tccaattaat ggtggagctg cttcaattgg agagacaata     5640

attgatgaca aaaattatta tttcaaccaa agtggagtgt tacaaacagg tgtatttagt     5700

acagaagatg gatttaaata ttttgcccca gctaatacac ttgatgaaaa cctagaagga     5760

gaagcaattg attttactgg aaaattaatt attgacgaaa atatttatta ttttgatgat     5820

aattatagag gagctgtaga atggaaagaa ttagatggtg aaatgcacta ttttagccca     5880

gaaacaggta aagcttttaa aggtctaaat caaataggtg attataaata ctatttcaat     5940

tctgatggag ttatgcaaaa aggatttgtt agtataaatg ataataaaca ctattttgat     6000

gattctggtg ttatgaaagt aggttacact gaaatagatg gcaagcattt ctactttgct     6060

gaaaacggag aaatgcaaat aggagtattt aatacagaag atggatttaa atattttgct     6120

catcataatg aagatttagg aaatgaagaa ggtgaagaaa tctcatattc tggtatatta     6180

aatttcaata ataaaattta ctattttgat gattcattta cagctgtagt tggatggaaa     6240

gatttagagg atggttcaaa gtattatttt gatgaagata cagcagaagc atatataggt     6300

ttgtcattaa taaatgatgg tcaatattat tttaatgatg atggaattat gcaagttgga     6360

tttgtcacta taaatgataa agtcttctac ttctctgact ctggaattat agaatctgga     6420

gtacaaaaca tagatgacaa ttatttctat atagatgata atggtatagt tcaaattggt     6480

gtatttgata cttcagatgg atataaatat tttgcacctg ctaatactgt aaatgataat     6540

atttacggac aagcagttga atatagtggt ttagttagag ttggtgaaga tgtatattat     6600

tttggagaaa catatacaat tgagactgga tggatatatg atatggaaaa tgaaagtgat     6660

aaatattatt tcaatccaga aactaaaaaa gcatgcaaag gtattaattt aattgatgat     6720

ataaaatatt attttgatga gaagggcata atgagaacgg gtcttatatc atttgaaaat     6780

aataattatt actttaatga gaatggtgaa atgcaatttg gttatataaa tatagaagat     6840

aagatgttct attttggtga agatggtgtc atgcagattg gagtatttaa tacaccagat     6900

ggatttaaat actttgcaca tcaaaatact ttggatgaga attttgaggg agaatcaata     6960

aactatactg gttggttaga tttagatgaa aagagatatt attttacaga tgaatatatt     7020

gcagcaactg gttcagttat tattgatggt gaggagtatt attttgatcc tgatacagct     7080

caattagtga ttagtgaata g                                               7101


<210> 5
<211> 184
<212> PRT
<213> Clostridium difficile

<400> 5
Met Gln Lys Ser Phe Tyr Glu Leu Ile Val Leu Ala Arg Asn Asn Ser 
1               5                   10                  15      


Val Asp Asp Leu Gln Glu Ile Leu Phe Met Phe Lys Pro Leu Val Lys 
            20                  25                  30          


Lys Leu Ser Arg Val Leu His Tyr Glu Glu Gly Glu Thr Asp Leu Ile 
        35                  40                  45              


Ile Phe Phe Ile Glu Leu Ile Lys Asn Ile Lys Leu Ser Ser Phe Ser 
    50                  55                  60                  


Glu Lys Ser Asp Ala Ile Ile Val Lys Tyr Ile His Lys Ser Leu Leu 
65                  70                  75                  80  


Asn Lys Thr Phe Glu Leu Ser Arg Arg Tyr Ser Lys Met Lys Phe Asn 
                85                  90                  95      


Phe Val Glu Phe Asp Glu Asn Ile Leu Asn Met Lys Asn Asn Tyr Gln 
            100                 105                 110         


Ser Lys Ser Val Phe Glu Glu Asp Ile Cys Phe Phe Glu Tyr Ile Leu 
        115                 120                 125             


Lys Glu Leu Ser Gly Ile Gln Arg Lys Val Ile Phe Tyr Lys Tyr Leu 
    130                 135                 140                 


Lys Gly Tyr Ser Asp Arg Glu Ile Ser Val Lys Leu Lys Ile Ser Arg 
145                 150                 155                 160 


Gln Ala Val Asn Lys Ala Lys Asn Arg Ala Phe Lys Lys Ile Lys Lys 
                165                 170                 175     


Asp Tyr Glu Asn Tyr Phe Asn Leu 
            180                 


<210> 6
<211> 555
<212> DNA
<213> Clostridium difficile

<400> 6
atgcaaaagt ctttttatga attaattgtt ttagcaagaa ataactcagt agatgatttg       60

caagaaattt tatttatgtt taagccatta gtaaaaaaac ttagtagagt tttacattat      120

gaagagggag aaacagattt aataatattt tttattgaat taataaaaaa tattaaatta      180

agtagctttt cagaaaaaag cgatgctatt atagtcaaat atattcataa atcattactg      240

aataagactt ttgagttgtc tagaagatat tctaaaatga agtttaattt tgtagaattt      300

gatgaaaata tcttaaatat gaaaaataat tatcaaagta agtctgtttt tgaggaagat      360

atttgttttt tcgaatatat tttgaaagaa ttatctggta ttcaaagaaa agttattttt      420

tataaatatt taaaaggata ttctgataga gaaatatcag tgaaattaaa aatatctaga      480

caagctgtta ataaggctaa aaatagagca tttaaaaaaa taaaaaaaga ctatgaaaat      540

tattttaact tgtaa                                                       555


<210> 7
<211> 261
<212> PRT
<213> Clostridium difficile

<400> 7
Met Ala Ser Glu Val Leu Gln Lys Thr Arg Lys Ile Asn Lys Thr Leu 
1               5                   10                  15      


Gln Thr Ser Gly Gly Ser Ser Val Ser Phe Asp Leu Leu Ala Gly Ala 
            20                  25                  30          


Leu Gly Asp Val Leu Ser Ser Asn Val Tyr Val Val Ser Ala Lys Gly 
        35                  40                  45              


Lys Val Leu Gly Leu His Leu Asn Asp Val Gln Asp Ser Ser Val Ile 
    50                  55                  60                  


Glu Asp Glu Tyr Thr Lys Gln Lys Lys Phe Ser Asp Glu Tyr Thr Gln 
65                  70                  75                  80  


Asn Val Leu Lys Ile Asp Glu Thr Leu Glu Asn Leu Asn Gly Glu Lys 
                85                  90                  95      


Ile Leu Glu Ile Phe Pro Glu Glu His Gly Arg Leu Gln Lys Tyr Thr 
            100                 105                 110         


Thr Val Val Pro Ile Leu Gly Ser Gly Gln Arg Leu Gly Thr Leu Val 
        115                 120                 125             


Leu Ser Arg Tyr Ser Asn Ser Phe Asn Asp Asp Asp Leu Val Ile Ala 
    130                 135                 140                 


Glu Tyr Ser Ala Thr Val Val Gly Leu Glu Ile Leu Arg Ala Ile Gly 
145                 150                 155                 160 


Glu Glu Leu Glu Glu Glu Met Arg Lys Lys Ala Val Val Gln Met Ala 
                165                 170                 175     


Ile Gly Thr Leu Ser Tyr Ser Glu Leu Glu Ala Val Glu His Ile Phe 
            180                 185                 190         


Ala Glu Leu Asp Gly Lys Glu Gly Leu Leu Val Ala Ser Lys Ile Ala 
        195                 200                 205             


Asp Arg Val Gly Ile Thr Arg Ser Val Ile Val Asn Ala Leu Arg Lys 
    210                 215                 220                 


Phe Glu Ser Ala Gly Val Ile Glu Ser Arg Ser Leu Gly Met Lys Gly 
225                 230                 235                 240 


Thr His Ile Arg Ile Leu Asn Asp Lys Leu Thr Asp Glu Leu Lys Lys 
                245                 250                 255     


Leu Lys Asn Asn Gln 
            260     


<210> 8
<211> 786
<212> DNA
<213> Clostridium difficile

<400> 8
atggcaagtg aagtgttaca aaaaacaagg aaaataaata aaacattaca aacaagtggt       60

ggaagcagtg tctcttttga tttactggcc ggagcattgg gcgacgtttt aagttctaat      120

gtttatgtag taagtgcaaa aggtaaagta ctaggtcttc atttaaatga tgttcaagac      180

agttcagtta tagaagatga gtatactaag caaaagaaat tttcagatga atatactcaa      240

aatgtgttaa aaattgatga aacattagaa aatttaaatg gtgagaagat attagaaatc      300

tttcctgaag aacatggaag attacaaaaa tatactacag tagttccaat attaggaagc      360

ggtcaaagat taggaacatt ggtactttca agatattcaa attcattcaa tgatgatgat      420

ttagtaatag ctgaatacag tgcaactgtt gttggtcttg aaatattaag agcaataggt      480

gaagaattag aagaagaaat gagaaagaaa gctgtagttc aaatggcaat aggcactctg      540

tcctactccg agcttgaagc agttgaacat atttttgctg aattggatgg aaaagaaggt      600

ctacttgtag caagtaagat agctgataga gttggtataa ctaggtctgt aatagtaaat      660

gcacttagaa aatttgagag tgcaggtgtg atagaatcaa gatcattagg tatgaaaggt      720

actcatataa gaatacttaa tgacaaactt acagatgaat taaaaaaatt aaaaaacaat      780

caataa                                                                 786


<210> 9
<211> 338
<212> PRT
<213> Clostridium difficile

<400> 9
Met Lys Gly Asn Ile Thr Ile Lys Asp Val Ala Lys Gln Ala Gly Val 
1               5                   10                  15      


Ser Ile Ser Thr Val Ser Arg Val Ile Asn Asp Ser Lys Pro Val Thr 
            20                  25                  30          


Asp Glu Val Lys Gln Lys Val Leu Glu Val Ile Lys Glu Thr Gly Tyr 
        35                  40                  45              


Ile Pro Asn Pro Leu Ala Arg Ser Leu Val Thr Lys Lys Ser Gln Leu 
    50                  55                  60                  


Ile Gly Val Ile Val Pro Glu Val Ser Asp Ser Phe Val Asn Glu Val 
65                  70                  75                  80  


Leu Asn Gly Ile Glu Glu Val Ala Lys Met Tyr Asp Tyr Asp Ile Leu 
                85                  90                  95      


Leu Ala Asn Thr Tyr Ser Asp Lys Glu Gln Glu Leu Lys Ser Ile Asn 
            100                 105                 110         


Leu Leu Arg Ala Lys Gln Val Glu Gly Ile Val Met Ile Ser Trp Ile 
        115                 120                 125             


Val Glu Gln Glu His Ile Asn Tyr Ile Gln Asn Cys Gly Ile Pro Ala 
    130                 135                 140                 


Thr Tyr Ile Ser Lys Thr Ala Arg Asn Tyr Asp Ile Tyr Thr Val Ser 
145                 150                 155                 160 


Thr Ser Asn Glu Glu Ala Thr Phe Asp Met Thr Glu His Leu Ile Lys 
                165                 170                 175     


Lys Gly His Glu Lys Ile Ala Phe Ile Met Thr Ser Lys Asp Asp Thr 
            180                 185                 190         


Val Leu Glu Met Glu Arg Leu Ala Gly Tyr Glu Lys Ala Leu Ser Asn 
        195                 200                 205             


Asn Asn Ile Glu Leu Asp Lys Ser Leu Ile Lys Tyr Gly Gly Thr Asp 
    210                 215                 220                 


Tyr Glu Ser Gly Tyr Asn Ser Met Lys Glu Leu Leu Asp Asp Gly Ile 
225                 230                 235                 240 


Ile Pro His Ala Ala Phe Val Thr Gly Asp Glu Ala Ala Ile Gly Ala 
                245                 250                 255     


Ile Asn Ala Ile Cys Asp Ala Gly Tyr Lys Val Pro Glu Asp Ile Ser 
            260                 265                 270         


Val Ala Gly Phe Asn Asp Val Lys Ile Ala Arg Met Tyr Arg Pro Lys 
        275                 280                 285             


Leu Thr Thr Val Tyr Gln Pro Leu Tyr Asp Met Gly Ala Val Ala Ile 
    290                 295                 300                 


Arg Met Val Ile Lys Leu Ile Asn Lys Glu Leu Ile Glu Asn Lys Lys 
305                 310                 315                 320 


Ile Glu Leu Pro Tyr Arg Ile Val Asp Arg Glu Ser Val Thr Glu Arg 
                325                 330                 335     


Lys Lys 
        


<210> 10
<211> 1017
<212> DNA
<213> Clostridium difficile

<400> 10
atgaaaggca atataacgat aaaagatgtt gctaaacaag caggagtgtc aatatctact       60

gtatctagag ttataaatga ttcaaaacct gtaactgatg aagtcaaaca aaaagtttta      120

gaggttataa aagagactgg atatatacca aatccacttg ctagaagctt agtaacaaag      180

aagagtcaat taataggggt aatagttcca gaagtttcag attcttttgt taatgaggtg      240

ttaaatggga tagaagaggt tgctaaaatg tatgactatg atattctttt agcgaataca      300

tactctgata aggaacaaga acttaagagt ataaatctat tgagagcaaa acaagtggaa      360

ggtatagtta tgatttcatg gatagttgaa caagaacata tcaactatat acaaaattgt      420

ggaataccag cgacatatat aagtaaaact gctagaaatt atgatatata tacagtaagt      480

actagcaacg aagaagctac ttttgatatg acagagcatc ttataaagaa aggtcatgaa      540

aagatagctt ttataatgac gagtaaagat gatactgttt tagaaatgga aagacttgct      600

ggttatgaga aagcactttc aaataacaat atagaattag acaagagttt gattaagtat      660

ggtggaactg attatgagag tggatacaat agtatgaaag aactattaga tgatggaata      720

atacctcatg cggcttttgt aacaggtgat gaggctgcca taggtgctat aaatgctata      780

tgtgatgctg gatataaggt tccagaagac atatctgttg caggatttaa tgatgttaag      840

atagctagaa tgtatagacc taaacttact acagtatatc aacctctata cgatatggga      900

gcagtagcaa taagaatggt tataaaatta ataaataagg aattaattga aaataagaaa      960

atagaattac cttatagaat tgttgataga gaaagtgtta cagaaagaaa aaaataa        1017


<210> 11
<211> 210
<212> PRT
<213> Clostridium difficile

<400> 11
Met Leu Gly Asn Lys Asn Ile Ser Met Ala Val Ile Arg Arg Leu Pro 
1               5                   10                  15      


Lys Tyr His Arg Tyr Leu Gly Asp Leu Leu Asp Arg Asp Ile Gln Arg 
            20                  25                  30          


Ile Ser Ser Lys Glu Leu Ser Asp Ile Ile Gly Phe Thr Ala Ser Gln 
        35                  40                  45              


Ile Arg Gln Asp Leu Asn Asn Phe Gly Gly Phe Gly Gln Gln Gly Tyr 
    50                  55                  60                  


Gly Tyr Asn Val Glu Ala Leu His Thr Glu Ile Gly Lys Ile Leu Gly 
65                  70                  75                  80  


Leu Asp Arg Pro Tyr Asn Ala Val Leu Val Gly Ala Gly Asn Leu Gly 
                85                  90                  95      


Gln Ala Ile Ala Asn Tyr Ala Gly Phe Arg Lys Ala Gly Phe Glu Ile 
            100                 105                 110         


Lys Ala Leu Phe Asp Ala Asn Pro Arg Met Ile Gly Leu Lys Ile Arg 
        115                 120                 125             


Glu Phe Glu Val Leu Asp Ser Asp Thr Leu Glu Asp Phe Ile Lys Asn 
    130                 135                 140                 


Asn Asn Ile Asp Ile Ala Val Leu Cys Ile Pro Lys Asn Gly Ala Gln 
145                 150                 155                 160 


Glu Val Ile Asn Arg Val Val Lys Ala Gly Ile Lys Gly Val Trp Asn 
                165                 170                 175     


Phe Ala Pro Leu Asp Leu Glu Val Pro Lys Gly Val Ile Val Glu Asn 
            180                 185                 190         


Val Asn Leu Thr Glu Ser Leu Phe Thr Leu Ser Tyr Leu Met Lys Glu 
        195                 200                 205             


Gly Lys 
    210 


<210> 12
<211> 633
<212> DNA
<213> Clostridium difficile

<400> 12
atgttgggaa ataaaaatat atcaatggca gttataagaa ggctcccaaa atatcataga       60

tatcttggag acttattaga tagggatata caaagaatat cttctaaaga attgagtgat      120

ataatagggt ttaccgcttc tcaaataaga caagatttaa acaactttgg tggatttgga      180

caacaaggat atggttataa tgtagaagct cttcatactg agataggtaa aattcttggg      240

ttggatcgac catacaacgc agttcttgta ggagcaggta acttaggaca agctatagcc      300

aattatgcag gatttagaaa agctggattc gagataaaag ctttatttga tgcaaatcct      360

agaatgatag gtttaaagat aagagagttt gaagtattag attcagatac tttagaagac      420

tttataaaaa acaataatat agatattgct gtattatgta tacctaaaaa tggagcacaa      480

gaagttatta atagagttgt aaaagctgga atcaaaggtg tatggaattt tgcaccttta      540

gatttagaag ttccgaaagg tgttatagtt gaaaatgtaa acttaacaga aagtttattt      600

accttatcgt atttaatgaa agaaggaaag tag                                   633


<210> 13
<211> 626
<212> PRT
<213> Clostridium difficile

<400> 13
Met Ser Ile Thr Leu Glu Thr Ala Gln Ala His Ala Asn Asp Pro Ala 
1               5                   10                  15      


Val Cys Cys Cys Arg Phe Glu Ala Gly Thr Ile Ile Ala Pro Glu Asn 
            20                  25                  30          


Leu Glu Asp Pro Ala Ile Phe Ala Asp Leu Glu Asp Ser Gly Leu Leu 
        35                  40                  45              


Thr Ile Pro Glu Asn Gly Leu Thr Ile Gly Gln Val Leu Gly Ala Lys 
    50                  55                  60                  


Leu Lys Glu Thr Leu Asp Ala Leu Ser Pro Met Thr Thr Asp Asn Val 
65                  70                  75                  80  


Glu Gly Tyr Lys Ala Gly Glu Ala Lys Glu Glu Val Val Glu Glu Thr 
                85                  90                  95      


Val Glu Glu Ala Ala Pro Val Ser Glu Ala Ala Val Val Pro Val Ser 
            100                 105                 110         


Thr Gly Val Ala Gly Glu Thr Val Lys Ile His Ile Gly Glu Gly Lys 
        115                 120                 125             


Asn Ile Ser Leu Glu Ile Pro Leu Ser Val Ala Gly Gln Ala Gly Val 
    130                 135                 140                 


Ala Ala Pro Val Ala Asn Val Ala Ala Pro Val Ala Ser Ala Ala Ala 
145                 150                 155                 160 


Glu Val Ala Pro Lys Val Glu Glu Lys Lys Leu Leu Arg Ser Leu Thr 
                165                 170                 175     


Lys Lys His Phe Lys Ile Asp Lys Val Glu Phe Ala Asp Glu Thr Lys 
            180                 185                 190         


Ile Glu Gly Thr Thr Leu Tyr Ile Arg Asn Ala Glu Glu Ile Cys Lys 
        195                 200                 205             


Glu Ala Asn Glu Thr Gln Glu Leu Val Val Asp Met Lys Leu Glu Ile 
    210                 215                 220                 


Ile Thr Pro Asp Lys Tyr Glu Thr Tyr Ser Glu Ala Val Leu Asp Ile 
225                 230                 235                 240 


Gln Pro Ile Ala Thr Lys Glu Glu Gly Glu Leu Gly Ser Gly Ile Thr 
                245                 250                 255     


Arg Val Ile Asp Gly Ala Val Met Val Leu Thr Gly Thr Asp Glu Asp 
            260                 265                 270         


Gly Val Gln Ile Gly Glu Phe Gly Ser Ser Glu Gly Glu Leu Asn Thr 
        275                 280                 285             


Thr Ile Met Trp Gly Arg Pro Gly Ala Ala Asp Lys Gly Glu Ile Phe 
    290                 295                 300                 


Ile Lys Gly Gln Val Thr Ile Lys Ala Gly Thr Asn Met Glu Arg Pro 
305                 310                 315                 320 


Gly Pro Leu Ala Ala His Arg Ala Phe Asp Tyr Val Thr Gln Glu Ile 
                325                 330                 335     


Arg Glu Ala Leu Lys Lys Val Asp Asn Ser Leu Val Val Asp Glu Glu 
            340                 345                 350         


Val Ile Glu Gln Tyr Arg Arg Glu Gly Lys Lys Lys Val Val Val Ile 
        355                 360                 365             


Lys Glu Ile Met Gly Gln Gly Ala Met His Asp Asn Leu Ile Leu Pro 
    370                 375                 380                 


Val Glu Pro Val Gly Thr Leu Gly Ala Gln Pro Asn Val Asp Leu Gly 
385                 390                 395                 400 


Asn Met Pro Val Val Leu Ser Pro Leu Glu Val Leu Asp Gly Gly Ile 
                405                 410                 415     


His Ala Leu Thr Cys Ile Gly Pro Ala Ser Lys Glu Met Ser Arg His 
            420                 425                 430         


Tyr Trp Arg Glu Pro Leu Val Ile Arg Ala Met Glu Asp Glu Glu Ile 
        435                 440                 445             


Asp Leu Val Gly Val Val Phe Val Gly Ser Pro Gln Val Asn Ala Glu 
    450                 455                 460                 


Lys Phe Tyr Val Ser Lys Arg Leu Gly Met Leu Val Glu Ala Met Glu 
465                 470                 475                 480 


Val Asp Gly Ala Val Val Thr Thr Glu Gly Phe Gly Asn Asn His Ile 
                485                 490                 495     


Asp Phe Ala Ser His Ile Glu Gln Ile Gly Met Arg Gly Ile Pro Val 
            500                 505                 510         


Val Gly Val Ser Phe Ser Ala Val Gln Gly Ala Leu Val Val Gly Asn 
        515                 520                 525             


Lys Tyr Met Thr His Met Val Asp Asn Asn Lys Ser Lys Gln Gly Ile 
    530                 535                 540                 


Glu Asn Glu Ile Leu Ser Asn Asn Thr Leu Ala Pro Glu Asp Ala Val 
545                 550                 555                 560 


Arg Ile Met Ala Met Leu Lys Asn Ala Ile Glu Gly Val Glu Val Lys 
                565                 570                 575     


Ala Pro Glu Arg Lys Trp Asn Pro Asn Val Lys Leu Asn Asn Ile Glu 
            580                 585                 590         


Ala Ile Glu Lys Val Thr Gly Glu Lys Ile Val Leu Glu Glu Asn Glu 
        595                 600                 605             


Gln Ser Leu Pro Met Ser Lys Lys Arg Arg Glu Ile Tyr Glu Lys Asp 
    610                 615                 620                 


Glu Asn 
625     


<210> 14
<211> 1881
<212> DNA
<213> Clostridium difficile

<400> 14
atgtcaataa ctttagaaac agctcaagcc catgcaaatg acccagcagt ttgttgttgt       60

agatttgaag cgggaacaat tatagcgcca gaaaacttag aagatccagc aatatttgca      120

gacttagagg attctggatt attaacaata ccagaaaatg gattaactat aggtcaagta      180

ctaggagcta agttaaaaga aactttagat gcactttctc caatgactac agataacgta      240

gaaggataca aagcaggaga ggctaaagaa gaagtagtag aagaaacagt agaagaagca      300

gctccagtat cagaagcagc agtagttcca gtaagcacag gagttgcagg tgaaacagtt      360

aaaatacaca taggtgaagg taagaacata agcttagaga tacctttatc agtagctggt      420

caagcaggag ttgctgctcc agtagcaaac gttgctgctc cagtggcaag tgcagcagca      480

gaagtagctc caaaagttga agaaaagaaa cttttaagaa gcttaactaa aaaacacttt      540

aaaatagata aagttgaatt tgctgatgaa actaaaatag aaggaactac tttatacatc      600

agaaacgcag aagaaatatg taaagaagct aatgaaactc aagagttagt tgtagatatg      660

aagttagaaa taataactcc tgataaatat gaaacttaca gtgaagctgt attagatata      720

caaccaatcg ctactaaaga agaaggcgaa ttaggttcag gtataactag agttatagat      780

ggagctgtaa tggtattaac tggtacagat gaagatggag ttcaaatagg tgaatttggt      840

tcttcagaag gtgagttaaa tactactata atgtggggta gaccaggtgc tgctgacaaa      900

ggtgaaatat tcatcaaagg tcaagtaaca ataaaagcag gaactaacat ggaaagacca      960

ggacctttag ctgctcaccg tgcatttgac tatgtaactc aagaaataag agaagcatta     1020

aagaaagttg acaactcttt agtagttgat gaagaagtaa tagagcaata cagaagagaa     1080

ggtaaaaaga aagttgttgt tataaaagaa ataatgggac aaggtgcaat gcatgataac     1140

ctaatattac cagttgagcc agttggtaca ttaggagctc aaccaaacgt tgacttagga     1200

aacatgccag ttgtattatc tccacttgaa gtattagatg gtggtatcca tgcattaact     1260

tgtataggac ctgcatcaaa agaaatgtca agacattact ggagagagcc attagtaata     1320

agagctatgg aagacgaaga aatagattta gtaggtgttg tatttgttgg ttctccacaa     1380

gtaaatgctg agaaattcta tgtatctaag agattaggta tgttagttga agctatggaa     1440

gttgatggag ctgtagtaac tactgaaggt ttcggaaaca accatataga tttcgcatct     1500

cacatagagc aaataggtat gagaggtata ccagtagttg gtgtaagttt ctcagctgtt     1560

caaggtgctc tagttgttgg taataaatac atgactcaca tggtagacaa caataagtct     1620

aagcaaggta tagagaatga aatattatct aacaacactt tagctccaga agatgctgtt     1680

agaataatgg ctatgcttaa aaatgctata gaaggtgtag aagttaaagc tcctgaaaga     1740

aaatggaatc caaatgttaa attaaataac atagaagcta tagaaaaagt tacaggagaa     1800

aaaatagtat tagaagagaa tgagcaatct ctaccaatga gtaagaagag aagagaaata     1860

tacgaaaaag acgaaaacta a                                               1881


<210> 15
<211> 474
<212> DNA
<213> Clostridium bifermentans

<400> 15
atgggtatag gaccatcaac taaagaaaca tcattacatc actttagaga tccgcttctt       60

gatatagtta gtaatgacaa agacatagat cttctgggga tagtagtagt aggaacacct      120

caggacaaca aagaaaaaga atttgttgga caaagaacag ctgcatggct agaagctatg      180

agagcagatg gtgttataat ttcatgtgat gggtggggaa actcacacgt agattatgct      240

aatactattg aagaaatagg aaaaagagag atcccggtag ttggacttac atttaatgga      300

acacaagcta agtttgtagt tacaaataaa tatatggaca caatagtaga ttttaataaa      360

tcagacaagg ggatagaaac agaagttgtc ggagagaaca ctgtaagcga gttagacgca      420

aaaaaatcat tagccttatt aaaattaaaa atgcaaagaa ataataaaaa ataa            474


<210> 16
<211> 1884
<212> DNA
<213> Clostridium bifermentans

<400> 16
atgtcaataa ctgtagaaac agctaaagct catgctaaag atccagcggt atgctgctgt       60

agatttgaag ctgggactgt actagaacca tcaaatttag aagatccagc aatattcgct      120

gacttagagg attcaggatt attaacaata gcagatgatt gtttaacaat agagcaagtt      180

ttaggagcta aactattaaa aactttagat gctttaactc caataactgc tgactgtgta      240

gaaggtgtag tagcagtagc tgaagaggct aaagaagaag ttaaggaaga agttaaagaa      300

gtagcaccag ttgcttcagt agctccagta tctcaaatag ctccagtaaa tggacaaact      360

ataaagatac atataggtga aggtagagat ataaacttag aaataccttt aaatgtagct      420

caaggaatgg gtgtagcacc agttgctcct gtagctgtag cagaaaatgc agaagctgta      480

gaagttaaag ctgagccagt tcaagaagct aaagcaatga gaagcttaac taaaaaacat      540

tttaaaatag aaaaagtagt tttcgctgaa gaaactaaaa tagatggaac tactttatac      600

ttaagaactc cagaagaatt aactaaagaa gctgtaaatt cagaagaatt agttgttgat      660

atgaagttag aaataataac tccagctgaa tacaacaaat acagtgaaac tataatggat      720

gttcaaccta tagctgctaa agaagaagga gaaataggag aaggtgtaac aagagttata      780

gacggagtta taatgatggt aactggtact gatgaaaacg gagttcaaat aggtgaattc      840

ggttcttcag aaggtgtatt agaaactaac ataatgtggg gaagaccagg tgctcctgat      900

aaaggtgata tattcatcaa aactcaagta acagttaaag ctggtactaa catggaaaga      960

ccaggaccat tagctgctca ctgtgcatct gattatataa ctcaagaaat aagagaagca     1020

ttaaagaacg ctgaagagtc tttagtagtt gatactgaag aattaactca atatagaaga     1080

cctggtaaga aaaaggttgt tgtagttaaa gagataatgg gacaaggggc aatgcatgat     1140

aacttaatat tacctgttga gccagttgga acattaggag ctaaaccaaa cgttgactta     1200

ggaaacgttc cagtagtatt atctccactt gaagtattag atggtggtat acatgcatta     1260

acttgtatag gacctgcatc taaagaaaac tctagacatt actggagaga gccattagta     1320

atagaagcta tgcatgatga agaaatagat ttagtaggtg ttatatttgt aggatctcca     1380

caagtaaatg ctgagaaatt ctatgtatct aagagattag gtatgatgat agaagctatg     1440

ggtgttgatg gtgctatagt aacaactgaa ggattcggaa acaaccatat agatttcgct     1500

tctcatatag agcaaatagg taagagagat gtagctgtag taggtgtaag tttctctgct     1560

gttcaaggtg ctctagttgt tggtaatgaa tacatgaaat acatgataga caacaacaag     1620

tctaaacaag gtatagaaaa tgaagtatta tcaaacaata cattatgccc agaagatgct     1680

gtaagatctt tagcaatgtt aaagacagta atgggtggag aagaagttaa agctgctgag     1740

agaaaatgga atgctaacgt taaattaaat aacgttgaat taatagaaaa agaaactggt     1800

aagaagttag aacttgttga aaacgagcaa actttaccaa tgagtgaaaa aagaaagaat     1860

atatacgaaa aagacgctaa atag                                            1884


<210> 17
<211> 759
<212> DNA
<213> Clostridium bifermentans

<400> 17
atggaagaga aaatacttag acgtttggta attaaaccat ttcatataaa taatgttgaa       60

ttcaatgaaa agttctcaat aaaaaaaggt acactatcca taaacaatga ctacataaat      120

gaaattaaaa attcacatga attaataacg gacataaaat tagatataat caaaccagga      180

gattataaca aggaaattaa tactatcatg gatataatcc ctatatctac taaagtttta      240

ggtagattag gtgaaggaat aacacacact ttaacaggtg tttatgttat gcttactggt      300

gttgatgaag atggaagaca aatgcatgaa tttggatctt cagaaggtat actttctgag      360

caaatggtgt ttggaagata tggtactcca tctactaatg attacataat tcattttgat      420

gttacagtta aaggtgggtt gccatatgag agaaaacttc cgatgatgac atttaaggca      480

tgtgatactt ttatacaagg tataagaaat gttttaaaac agcaagacgg aagagatgct      540

acagaaattc gtgaatattt tgacaaaatt agacctgacg ctaaaaaagt tgtaatagta      600

aaacaaatag caggtcaggg tgcaatgtat gacaatcaat tattttctca tgaaccaagt      660

ggtttagagg gaggtacatc cattattgat atgggaaatg taccgatgat aatatcacct      720

aatgaataca gagatggcgc cttgagagct atgacttaa                             759


<210> 18
<211> 726
<212> DNA
<213> Clostridium bifermentans

<400> 18
atgagcctta caacaataaa aggacttcaa tctgaaatat ttgtaccaat aacacctcct       60

cctgtttgga ctcctgtaac taaagaacta aaggatatga ctatagcttt agctacagcg      120

tcaggtgtac atttaaaagc tgataagaga ttcaacctag caggtgactt tacatttaga      180

gaaataccag acacagcaac tactgatgag atgatggtat ctcacggagg atatgataac      240

gctgatgtta ataaagatat aaactgtatg ttccctatag acagactaca tgaattagct      300

aaagaaggat ttataaaatc tgtagctcca gttcatatag gattcatggg tggtggtgga      360

gaccaaacta aattcactga agaaactggt cctgaaatcg ctaaaagatt aaaagatgag      420

ggagtagacg gtgtagttct aacagctggc tgaggtactt gccatagaac tgccgtgatc      480

gtgcagagag caatagaaga agctggtata ccaactataa taatagcagc tcttcctcca      540

gtagttagac aaaacggaac tccaagagca gttgctccac tagttccaat gggtgctaat      600

gctggtgaac caaacaataa agaaatgcaa atgcatatat taagagatac tttagagcaa      660

ttaatagcta taccatctgc tggtaagata attcaattac catacgagta tgtagctcaa      720

gtataa                                                                 726


<210> 19
<211> 617
<212> PRT
<213> Clostridium scindens

<400> 19
Met Ser Ile Thr Ala Glu Thr Ala Lys Glu His Ala His Asp Pro Ala 
1               5                   10                  15      


Val Leu Cys Cys Arg Ala Glu Ala Gly Ile Thr Ile Glu Ala Ala Asn 
            20                  25                  30          


Leu Glu Asp Pro Ala Ile Phe Asp Asp Leu Val Asp Ser Gly Leu Leu 
        35                  40                  45              


Asn Leu Asp Gly Ala Leu Thr Ile Glu Glu Val Leu Gly Ala Lys Leu 
    50                  55                  60                  


Thr Lys Thr Cys Asp Ser Leu Cys Pro Leu Thr Ala Asp Val Val Glu 
65                  70                  75                  80  


Gly Ala Lys Ala Pro Thr Ala Pro Ala Ala Glu Glu Ala Glu Glu Glu 
                85                  90                  95      


Ala Pro Ala Ala Pro Ala Pro Ala Ala Ala Pro Val Ala Gly Pro Ala 
            100                 105                 110         


Ala Gly Gly Thr Leu Lys Ile His Ile Gly Glu Gly Lys Asp Ile Asp 
        115                 120                 125             


Leu Glu Ile Pro Val Gly Ala Leu Gly Gly Gly Ala Ala Val Ala Pro 
    130                 135                 140                 


Leu Pro Ala Gly Ala Glu Ala Val Val Ala Gly Ala Ala Ala Pro Glu 
145                 150                 155                 160 


Ala Ala Gly Glu Glu Lys Val Val Arg Ser Leu Thr Arg Lys His Phe 
                165                 170                 175     


Thr Ile Thr Glu Val Lys Arg Gly Pro Glu Thr Lys Ile Glu Gly Thr 
            180                 185                 190         


Thr Leu Tyr Ile Arg Glu Gly Ile Glu Ser Glu Val Ile Asp Asn Gln 
        195                 200                 205             


Glu Leu Val Lys Asp Phe Lys Leu Glu Ile Ile Thr Pro Asp Leu Tyr 
    210                 215                 220                 


His Thr Tyr Ser Glu Thr Val Met Asp Val Gln Pro Ile Ala Thr Lys 
225                 230                 235                 240 


Glu Gly Asp Asp Glu Leu Gly Thr Gly Val Thr Arg Val Leu Asp Gly 
                245                 250                 255     


Val Val Met Met Leu Thr Gly Val Asp Glu Gly Gly Val Gln Ile Gly 
            260                 265                 270         


Glu Phe Gly Ser Ser Glu Gly Tyr Leu Asp Glu Asn Ile Met Trp Asn 
        275                 280                 285             


Arg Pro Ser Cys Pro Asp Lys Gly Glu Ile Phe Ile Lys Gly Asn Ile 
    290                 295                 300                 


Val Ile Gln Glu Lys Thr Asn Met Glu Arg Arg Gly Pro Met Ala Ala 
305                 310                 315                 320 


His Thr Ala Phe Asp Val Ile Thr Gln Glu Ile Arg Glu Val Met Lys 
                325                 330                 335     


Lys Leu Asp Asp Ser Leu Val Ala Asp Thr Glu Glu Leu Lys Gln Val 
            340                 345                 350         


Arg Arg Pro Gly Lys Lys Lys Val Val Ile Val Lys Glu Ile Met Gly 
        355                 360                 365             


Gln Gly Ala Met His Asp Asn Phe Ile Leu Pro Val Glu Pro Val Gly 
    370                 375                 380                 


Val Leu Gly Ala Arg Ala Asn Val Asp Leu Gly Asn Val Pro Val Cys 
385                 390                 395                 400 


Val Ser Pro Leu Glu Val Leu Asp Gly Cys Ile His Ala Leu Thr Cys 
                405                 410                 415     


Ile Gly Pro Ala Ser Lys Glu Met Ser Arg His Tyr Trp Arg Glu Pro 
            420                 425                 430         


Leu Val Leu Glu Ala Leu His Asp Pro Glu Val Asp Leu Cys Gly Val 
        435                 440                 445             


Val Phe Val Gly Ser Pro Gln Ile Asn Ala Glu Lys Phe Tyr Val Ser 
    450                 455                 460                 


Arg Arg Val Gly His Thr Val Glu Met Met Asp Ala Asp Gly Ala Phe 
465                 470                 475                 480 


Val Thr Thr Glu Gly Phe Gly Asn Asn His Ile Asp Phe Ala Ser His 
                485                 490                 495     


Ile Glu Gln Ile Gly Met Arg Gly Ile Pro Val Val Gly Met Ser Tyr 
            500                 505                 510         


Cys Ala Val Gln Gly Ala Leu Val Val Gly Asn Lys Tyr Met Thr Tyr 
        515                 520                 525             


Met Val Asp Asn Asn Lys Ser Glu Ala Gly Ile Glu Asn Glu Ile Leu 
    530                 535                 540                 


Gly Asn Asn Thr Leu Cys Pro Glu Asp Ala Val Arg Ala Leu Ala Met 
545                 550                 555                 560 


Leu Lys Thr Ala Met Ala Gly Glu Asp Val Lys Ala Ala Glu Lys Lys 
                565                 570                 575     


Trp Asn Pro Asn Val Lys Ser Thr Asn Val Glu Leu Ile Glu Ser Thr 
            580                 585                 590         


Tyr Gly Thr Lys Val Asp Leu Val Glu Asn Glu Gln Ala Leu Pro Met 
        595                 600                 605             


Ser Glu Lys Arg Arg Leu Lys Tyr Ser 
    610                 615         


<210> 20
<211> 768
<212> DNA
<213> Clostridium scindens

<400> 20
ttggctgaag aggtaaaaga cctgagacgt cttgtaatta aagcgttcca catgaatgat       60

gtagagtggg gtgaacataa tgatattact gttgacggta atatgacagt cagtaaagaa      120

atgattgatc agctggtggc tcaggaggaa cacattgaaa aaattgatat tcagattatt      180

aagccggggg atcatgaccg ttggacgaat acgattatgg atatcatacc gatctctaca      240

aaggtacttg gaaaattagg ggagggcatt acccatacca ttaccggcgt atatgtaatg      300

cttaccggcg ttgacgtaaa tggaaagcaa tgccatgaat tcggttcttc tgaggggaat      360

ctgaaagacc agctgtactt gaaccgtgca ggcacgccgg gggatgatga ttacataatt      420

tcctttgatg taacgcttgc agccggaatg gggcaggaga ggcctggacc gactgccgca      480

catagggcgt gcgataagtt tatccagaca taccgtgata agatgaagaa gttcaaaggc      540

gagaagtgta cggaacgcca tgagtaccat gatgtggtaa ggccgggaaa gaaacgcgtc      600

ctgatcgtaa agcaggtggc aggacaggga gcaatgtatg atacgcatct gttttccaaa      660

gagccgtctg gcgtagaggg cggacgttca attatcgata tgggcaatat gccgatcctt      720

gtaactccaa atgagtacag agacggtatt atccgctcca tgcagtag                   768


<210> 21
<211> 1854
<212> DNA
<213> Clostridium scindens

<400> 21
atgtcaatca cagctgaaac agcgaaagaa catgctcatg atcctgcggt attatgttgt       60

agagccgaag caggcattac aatcgaagct gctaatcttg aagatccggc gatctttgat      120

gacttggtag attcaggatt attgaacctg gatggtgcat tgaccatcga agaagttttg      180

ggagcaaaac ttacaaaaac atgtgattct ctttgcccgt taactgcaga tgtagttgaa      240

ggtgcaaaag cgccgactgc tccagcagca gaagaggcag aagaggaagc gccggcagca      300

ccggcaccgg ctgcagcacc tgtagcagga cctgcggcag gcggaacact taagatccac      360

attggagaag gcaaggacat tgatcttgag atcccagttg gagcgcttgg cggcggagca      420

gcagttgcac cattgccggc aggagcagag gcagttgttg caggagcagc agcaccagaa      480

gcagctggag aagaaaaggt tgtaagaagt ttaacaagaa aacacttcac gatcacagag      540

gttaagagag gaccagagac caagatcgaa ggaacaactc tttacatccg tgaaggcatt      600

gagtcagaag ttattgacaa ccaggagctt gtaaaagatt tcaaactgga aatcatcact      660

cctgatttat atcacacata ttccgagact gttatggacg ttcagccaat cgctacaaaa      720

gaaggcgatg atgaactcgg aacaggtgtt acaagagtac ttgacggcgt tgttatgatg      780

ctgacaggtg ttgacgaagg cggagttcag attggcgagt tcggttcttc agaaggatac      840

cttgatgaga acattatgtg gaatcgtccg agctgcccag ataaaggcga gatctttatc      900

aagggtaaca tcgtaatcca ggaaaagaca aacatggaac gtcgtggacc tatggctgct      960

catacagcat ttgatgtaat cacacaggaa atccgcgaag ttatgaagaa acttgatgac     1020

agccttgttg ctgatacgga agaactgaag caggttcgcc gtccgggcaa gaagaaagtc     1080

gttatcgtta aggaaatcat gggacaggga gctatgcatg acaactttat ccttcctgta     1140

gagcctgttg gcgttctagg cgcaagagct aacgtagact taggaaacgt accggtttgc     1200

gtatctccat tggaagttct tgatggatgt atccatgcat taacatgtat cggacctgca     1260

tctaaggaaa tgtccagaca ttactggaga gagccattgg ttctggaagc attgcatgac     1320

ccggaagttg acctttgcgg cgttgtattt gtaggatctc ctcagatcaa tgctgagaaa     1380

ttctatgtat cccgtcgtgt aggccatacc gtagaaatga tggatgctga tggagctttc     1440

gttacaacgg aaggttttgg aaacaaccac atcgatttcg caagccatat cgagcagatc     1500

ggtatgagag gaattccggt tgttggcatg tcttactgtg cagttcaggg cgctctggtt     1560

gttggtaaca agtatatgac atacatggtt gacaataaca agtctgaagc tggtatcgag     1620

aacgagattc ttggtaacaa tacgctttgc ccggaagatg ctgttcgtgc acttgctatg     1680

cttaagactg caatggcagg cgaagacgtt aaggctgctg agaagaagtg gaatccaaac     1740

gttaagtcta caaacgtaga gttaattgag agcacatacg gtacaaaggt tgatcttgtt     1800

gaaaatgagc aggctcttcc gatgagtgaa aaacgtagat taaaatacag ctaa           1854


<210> 22
<211> 756
<212> DNA
<213> Clostridium scindens

<400> 22
atgaatgtag gatcaaggct gacggttaag gcgtaccctg tcacagaagt gtgctatggg       60

gaggagaacc gagtgacggt ggatggccgg atgacggtct gtaagaacat agcagaaaag      120

attctggcgc aggagccatt gataaaggag attgatatcc gtattatcat gccggatgag      180

caccgacagc ataccaacac ggtgatggat gtgattcctc tggcaaccaa agtgctggga      240

cgggtggggg agggcattac ccataccctg acaggcgtat acgtgatcct taccggtgtg      300

gatgagagcg ggcgtcagat atgtaatttt ggcgccagcg acggaatact cgaggagaag      360

attgcctggg ggcgggcggg aacgccgctt aggagcgacg tgctgatctc ctttgacgtg      420

gttcttaagg aaggatcctg ggcggatcgt ccgggtccgg aagcagccca tcgcgcctgc      480

gatacatact gccagatatt ccgggagcag ataaagaagt ttaatggata caagtgcgcg      540

gaaaagcatg tctttcagga gacgtatgag ccggggaaaa aagatgtcta tattgtgaaa      600

gaagtatccg ggcaaggtgc cgtatacgat acccggatgt tcggacatga gccttgcgga      660

ttcgaaggcg ggaagtctgt tattgatatg ggctgcatgc ctgcgctggt gacgcccaat      720

gaatttaggg atggcattat gcgcgcgatg gattag                                756


<210> 23
<211> 405
<212> DNA
<213> Clostridium scindens

<400> 23
atgtctatta cagcagaaac tgcaaaagaa catgcaaatg acccggctgt attatgctgc       60

cgggcagaag agggcattac aatacaggct tccaacttgg aagatcctgc tatttttgac      120

gagttagtgg attcagggct gctatctttg gatggctgtc tgacaatcgg acaagtctta      180

ggggcaaccc tgacaaagac aagcgattct ttatgtccat tgactgcaga taacgtaggg      240

ggcttcaaag aggtagttga ggaagaagag cctgcatcag agccagtcga agaagcggta      300

gccgcagata ttaatattgg gggcgcggtc accacgatca aaaatggaaa agttgttatt      360

tcaatcaaag aaggaaaaga tatctattta gaacttcctg tttaa                      405


<210> 24
<211> 1329
<212> DNA
<213> Clostridium scindens

<400> 24
atgggaaatg tacagatttt attacgtcag catgttggtg caccctgtga ggcaatcgta       60

aaggctgggg ataaggtgga aaaaggtacc ttgattgcaa ctcctacagg acttggcgct      120

aacatctttt ccagcgtcta tggcgtggtg gaagaagtct tggaagaccg aatcgttatc      180

aagccggatg aagagcagaa agatgagttt gtacctatta aggaaggcag caagcttgag      240

atggttaagg aagccggaat cgtaggtatg ggcggcgcag gattcccaac tggcgtgaag      300

attggaacgg accttcacgg cggatatatc ctggtaaatg ctgcagaatg cgagcctgga      360

cttcgccaca atatccagca gattgaagaa aagacagata tcacaatccg cggattgaaa      420

tactgcatgg agatatccaa tgcggcaaaa ggaattattg ctattaagaa gaagaacgaa      480

aaagcgatcg aatttctcag agaggcaatc aaggatgaag acaatatcac gatccatctt      540

cttccggata tttacccaat gggagaggaa agagcggtag taagagaatg cctcggaaaa      600

ctgcttgatc ctacacaact tccgtcagca gcagatgcag tcgtaatcaa ctgcgagacc      660

ctgcttcgta tcgcagaggc gatcgaactt aagaaacctt gctttagcaa gaatatgacg      720

gttattggaa agattaacgg tggaaacgag ccgcatgtat tcatggatgt tccggttgga      780

acctgtgttg cagacatgat cgagaaggca ggcggaattg atggtacata tggcgagatt      840

atcatgggtg gagcatttac tggaaagtcc accacattag acgcgcctac tacgaagacg      900

acaggcggaa tcatcgttac ggtagagttc ccggatcttc acggagcgcc ggtaggattg      960

cttgtctgtg cgtgcggcgg aagcgaagac cgtatgcgcg aactttgcga aaagatgaat     1020

ggaaaggtcg tttctgtggc aagatgtaaa caggcggttg agccgaagcc gggcgcagcg     1080

cttaagtgcg agaatcctgg aaactgtcct ggacaggcac agaaatgtct gcagtttaag     1140

aaggacggcg cagagtacat catcatcggt aactgctcag actgttccaa cacagttatg     1200

ggatctgcac caaagttaaa actgaagaca ttccatcaga cagaccatgt gatgagaaca     1260

atcggtcatc cattatacag aagactgacc gtgtccaaag aagttgacca gctgcccaac     1320

ggcaaataa                                                             1329


<210> 25
<211> 474
<212> DNA
<213> Clostridium scindens

<400> 25
atgggtatag gaccatcaac aaaagaaaca tcattgcatc acttcaggga tccgctgctg       60

gatgtagtct cttcggatac agatctggat ctgatgggaa ttatcatcgt aggaacaccg      120

gacgataatg aggataagat gcttgtagga accaggacgg ctgtttgggc cgaggcaatg      180

cgtgcggacg gcgtaatcat ctcttcggac ggatggggaa acagcgacgt ggattacacg      240

aatacatgcg agcaggtggg gacgagaggc atcgcggtga cgggccttaa tttcagcggt      300

acggtagctc aatttgtagt tgtaaataat tacctggatg gaattgtgga tatcaataag      360

agcgcggacg ggacagagac caatgtggtt ggggaaaaca atatggtcga gctggattgc      420

aaaaaggcga ctgcgcttct gaaacttaag atgcgaaaga atgagaaaaa gtag            474


<210> 26
<211> 726
<212> DNA
<213> Clostridium scindens

<400> 26
atgagtttaa cggttgttaa aggtttacaa tctgaaatat tcgttcctat tactccacca       60

tcagtatgga ctcctgtaac aaaagagttg aaagacatgt ctatcgctct tgcaacagct      120

gccggtgttc ataagaagga tcaggaaaga ttcaatcttg ctggtgactt tacatggaga      180

aaaatagaga acacaacacc atctagcgaa ctgatggtat cccatggtgg atatgataac      240

agtgatgtta acaaagatat caactgtatg ttcccgattg acagaattca tgaattggct      300

gctgaaggat ttatcagggc ttgtgctccg gtacatgcag gattcatggg tggtggcgga      360

aaccaggaga agttcaaagg cgaaactggt ccggctatcg cgcagatgtt caaagaagag      420

gacgttgacg cagtaattct caccgctggc tgaggaacct gccaccgctc tgcagtattg      480

gtgcagagag cgattgaaga agctggaatt cctactatta ttattgcagc tcttccacca      540

gttgttcgcc agactggtac tcctcgtgca gttgctccat tggtacctat gggtgctaat      600

gcaggtggac cgcacaatgt tgaacagcag acacagatcg taaaggcaac tctggagcag      660

ttagttgaaa tccagacacc tggaaagatt gttccactgc cattcgagta tgtagctaag      720

atttaa                                                                 726


<210> 27
<211> 252
<212> PRT
<213> Clostridium bifermentans

<400> 27
Met Glu Glu Lys Ile Leu Arg Arg Leu Val Ile Lys Pro Phe His Ile 
1               5                   10                  15      


Asn Asn Val Glu Phe Asn Glu Lys Phe Ser Ile Lys Lys Gly Thr Leu 
            20                  25                  30          


Ser Ile Asn Asn Asp Tyr Ile Asn Glu Ile Lys Asn Ser His Glu Leu 
        35                  40                  45              


Ile Thr Asp Ile Lys Leu Asp Ile Ile Lys Pro Gly Asp Tyr Asn Lys 
    50                  55                  60                  


Glu Ile Asn Thr Ile Met Asp Ile Ile Pro Ile Ser Thr Lys Val Leu 
65                  70                  75                  80  


Gly Arg Leu Gly Glu Gly Ile Thr His Thr Leu Thr Gly Val Tyr Val 
                85                  90                  95      


Met Leu Thr Gly Val Asp Glu Asp Gly Arg Gln Met His Glu Phe Gly 
            100                 105                 110         


Ser Ser Glu Gly Ile Leu Ser Glu Gln Met Val Phe Gly Arg Tyr Gly 
        115                 120                 125             


Thr Pro Ser Thr Asn Asp Tyr Ile Ile His Phe Asp Val Thr Val Lys 
    130                 135                 140                 


Gly Gly Leu Pro Tyr Glu Arg Lys Leu Pro Met Met Thr Phe Lys Ala 
145                 150                 155                 160 


Cys Asp Thr Phe Ile Gln Gly Ile Arg Asn Val Leu Lys Gln Gln Asp 
                165                 170                 175     


Gly Arg Asp Ala Thr Glu Ile Arg Glu Tyr Phe Asp Lys Ile Arg Pro 
            180                 185                 190         


Asp Ala Lys Lys Val Val Ile Val Lys Gln Ile Ala Gly Gln Gly Ala 
        195                 200                 205             


Met Tyr Asp Asn Gln Leu Phe Ser His Glu Pro Ser Gly Leu Glu Gly 
    210                 215                 220                 


Gly Thr Ser Ile Ile Asp Met Gly Asn Val Pro Met Ile Ile Ser Pro 
225                 230                 235                 240 


Asn Glu Tyr Arg Asp Gly Ala Leu Arg Ala Met Thr 
                245                 250         


<210> 28
<211> 157
<212> PRT
<213> Clostridium difficile


<220>
<221> MOD_RES
<222> (45)..(45)
<223> Selenocysteine

<400> 28
Met Ser Leu Leu Ser Asn Lys Lys Val Leu Ile Ile Gly Asp Arg Asp 
1               5                   10                  15      


Gly Ile Pro Gly Pro Ala Ile Glu Glu Cys Val Lys Thr Val Glu Gly 
            20                  25                  30          


Ala Glu Val Val Phe Ser Ser Thr Glu Cys Phe Val Xaa Thr Ala Ala 
        35                  40                  45              


Gly Ala Met Asp Leu Glu Asn Gln Asn Arg Val Lys Asp Ala Ala Asp 
    50                  55                  60                  


Lys Phe Gly Ala Glu Asn Val Val Ile Leu Leu Gly Ala Ala Glu Ala 
65                  70                  75                  80  


Glu Ala Ala Gly Leu Ala Ala Glu Thr Val Thr Ala Gly Asp Pro Thr 
                85                  90                  95      


Phe Ala Gly Pro Leu Ala Gly Val Ala Leu Gly Leu Ser Val Tyr His 
            100                 105                 110         


Val Val Glu Glu Pro Ile Lys Ser Leu Phe Asp Glu Ser Val Tyr Glu 
        115                 120                 125             


Asp Gln Ile Ser Met Met Glu Met Val Leu Glu Val Glu Glu Ile Glu 
    130                 135                 140                 


Glu Glu Met Ser Gly Ile Arg Glu Glu Phe Cys Lys Phe 
145                 150                 155         


<210> 29
<211> 474
<212> DNA
<213> Clostridium difficile

<400> 29
atgagtttac ttagtaataa aaaggttctt ataataggtg accgtgatgg tataccagga       60

cctgcgatag aagaatgtgt aaaaacagta gaaggagcag aggttgtttt ctcatctaca      120

gaatgctttg tctgaacagc tgctggggct atggacttag aaaatcaaaa cagagttaaa      180

gatgctgctg ataaattcgg agctgaaaat gttgtgattt tactaggtgc tgctgaagcc      240

gaagctgcag gtcttgcagc cgaaacagta actgctggag atccaacttt cgctggacca      300

cttgctggag ttgccttagg attaagtgtt taccacgttg ttgaggaacc aataaaatca      360

ttatttgatg aaagtgtata tgaagaccaa ataagtatga tggaaatggt tttagaagtt      420

gaagaaatag aagaagaaat gtctggtata agagaagaat tttgtaaatt ttaa            474


<210> 30
<211> 1476
<212> DNA
<213> Clostridium bifermentans


<220>
<221> modified_base
<222> (1449)..(1449)
<223> a, c, t, g, unknown or other

<400> 30
catrgctcag gatgaacgct ggcggcgtgc ctaacacatg caagtcgagc gatctcttcg       60

gagagagcgg cggacgggtg agtaacgcgt gggtaacctg ccctgtacac acggataaca      120

taccgaaagg tatactaata cgggataaca tatgaaagtc gcatggcttt tgtatcaaag      180

ctccggcggt acaggatgga cccgcgtctg attagctagt tggtaaggta atggcttacc      240

aaggcaacga tcagtagccg acctgagagg gtgatcggcc acactggaac tgagacacgg      300

tccagactcc tacgggaggc agcagtgggg aatattgcac aatgggcgaa agcctgatgc      360

agcaacgccg cgtgagcgat gaaggccttc gggtcgtaaa gctctgtcct caaggaagat      420

aatgacggta cttgaggagg aagccccggc taactacgtg ccagcagccg cggtaatatg      480

tagggggcta gcgttatccg gaattactgg gcgtaaaggg tgcgtaggtg gttttttaag      540

tcagaagtga aaggctacgg ctcaaccgta gtaagctttt gaaactagag aacttgagtg      600

caggagagga gagtagaatt cctagtgtag cggtgaaatg cgtagatatt aggaggaata      660

ccagtagcga aggcggctct ctggactgta actgacactg aggcacgaaa gcgtggggag      720

caaacaggat tagataccct ggtagtccac gccgtaaacg atgagtacta ggtgtcgggg      780

gttacccccc tcggtgccgc actaacgcat taagtactcc gcctgggaag tacgctcgca      840

agagtgaaac tcaaaggaat ttdcggggac ccgcacaagt agcggagcat gtggtttaat      900

tcgaagcaac gcgaagaacc ttacctaagc ttgacatccc actgacctct ccctaatcgg      960

agatttccct tcggggacag tggtgacagg tggtgcatgg ttgtcgtcag ctcgtgtcgt     1020

gagatgttgg gttaagtccc gcaacgagcg caacccttgc ctttagttgc cagcattaag     1080

ttgggcactc tagagggact gccgaggata actcggagga aggtggggat gacgtcaaat     1140

catcatgccc cttatgctta gggctacaca cgtgctacaa tgggtggtac agagggttgc     1200

caagccgcga ggtggagcta atcccttaaa gccattctca gttcggattg taggctgaaa     1260

ctcgcctaca tgaagctgga gttactagta atcgcagatc agaatgctgc ggtgaatgcg     1320

ttcccgggtc ttgtacacac cgcccgtcac accatggaag ttgggggcgc ccgaagccgg     1380

ttagctaacc ttttaggaag cggccgtcga aggtgaacaa atgactgggg tgaagtcgta     1440

acaaggtanc cgtatcggaa ggtgcggcbg gatcaa                               1476


<210> 31
<211> 1529
<212> DNA
<213> Clostridium scindens

<400> 31
gagagtttga tcctggctca ggatgaacgc tggcggcgtg cctaacacat gcaagtcgaa       60

cgaagcgcct ggccccgact tcttcggaac gaggagcctt gcgactgagt ggcggacggg      120

tgagtaacgc gtgggcaacc tgccttgcac tgggggataa cagccagaaa tggctgctaa      180

taccgcataa gaccgaagcg ccgcatggcg cggcggccaa agccccggcg gtgcaagatg      240

ggcccgcgtc tgattaggta gttggcgggg taacggccca ccaagccgac gatcagtagc      300

cgacctgaga gggtgaccgg ccacattggg actgagacac ggcccagact cctacgggag      360

gcagcagtgg ggaatattgc acaatggggg aaaccctgat gcagcgacgc cgcgtgaagg      420

atgaagtatt tcggtatgta aacttctatc agcagggaag aagatgacgg tacctgacta      480

agaagccccg gctaactacg tgccagcagc cgcggtaata cgtagggggc aagcgttatc      540

cggatttact gggtgtaaag ggagcgtaga cggcgatgca agccagatgt gaaagcccgg      600

ggctcaaccc cgggactgca tttggaactg cgtggctgga gtgtcggaga ggcaggcgga      660

attcctagtg tagcggtgaa atgcgtagat attaggagga acaccagtgg cgaaggcggc      720

ctgctggacg atgactgacg ttgaggctcg aaagcgtggg gagcaaacag gattagatac      780

cctggtagtc cacgccgtaa acgatgacta ctaggtgtcg ggtggcaagg ccattcggtg      840

ccgcagcaaa cgcaataagt agtccacctg gggagtacgt tcgcaagaat gaaactcaaa      900

ggaattgacg gggacccgca caagcggtgg agcatgtggt ttaattcgaa gcaacgcgaa      960

gaaccttacc tgatcttgac atcccgatgc caaagcgcgt aacgcgctct ttcttcggaa     1020

catcggtgac aggtggtgca tggttgtcgt cagctcgtgt cgtgagatgt tgggttaagt     1080

cccgcaacga gcgcaacccc tatcttcagt agccagcatt ttggatgggc actctggaga     1140

gactgccagg gagaacctgg aggaaggtgg ggatgacgtc aaatcatcat gccccttatg     1200

accagggcta cacacgtgct acaatggcgt aaacaaaggg aggcgaaccc gcgagggtgg     1260

gcaaatccca aaaataacgt ctcagttcgg attgtagtct gcaactcgac tacatgaagt     1320

tggaatcgct agtaatcgcg aatcagaatg tcgcggtgaa tacgttcccg ggtcttgtac     1380

acaccgcccg tcacaccatg ggagtcagta acgcccgaag ccggtgaccc aacccgtaag     1440

ggagggagcc gtcgaaggtg ggaccgataa ctggggtgaa gtcgtaacaa ggtagccgta     1500

tcggaaggtg cggctggatc acctccttc                                       1529


<210> 32
<211> 1445
<212> DNA
<213> Clostridium hylemonae

<400> 32
aggatgaacg ctgccgccgt gcttaacaca tgcaagtcga acgaagcaat actgtgtgaa       60

gagattagct tgctaagatc agaactttgt attgactgag tggcggacgg gtgagtaacg      120

cgtgggcaac ctgccttaca cagggggata acagctagaa atggctgcta ataccgcata      180

agacctcagt accgcatggt agaggggtaa aaactccggt ggtgtaagat gggcccgcgt      240

ctgattaggt agttggtagg gtaacggcct accaagccga cgatcagtag ccgacctgag      300

agggtgaccg gccacattgg actgagacac ggcccaaact cctacgggag gcagcagtgg      360

ggaatattgc acaatggggg aaaccctgat gcagcgacgc cgcgtgaagg atgaagtatt      420

tcggtatgta aacttctatc agcagggaag aagatgacgg tacctgacta agaagccccg      480

gctaactacg tgccagcagc cgcggtaata cgtagggggc aagcgttatc cggatttact      540

gggtgtaaag ggagcgtaga cggcatggca agtctgaagt gaaagcccgg ggctcaaccc      600

cgggactgct ttggaaactg tcaggctaga gtgtcggaga ggcaagtgga attcctagtg      660

tagcggtgaa atgcgtagat attaggagga acaccagtgg cgaagcggct tgctggacga      720

tgactgacgt tgaggctcga aagcgtgggg agcaaacagg attagatacc ctggtagtcc      780

acgccgtaaa cgatgattac taggtgtcgg gaagcaaagc ttttcggtgc cgcagccaac      840

gcaataagta atccacctgg ggagtacgtt cgcaagaatg aaactcaaag gaattgacgg      900

ggacccgcac aagcggtgga gcatgtggtt taattcgaag caacgcgaag aaccttacct      960

gatcttgaca tcccggtgac aaagtatgta acgtactctt tcttcggaac accggtgaca     1020

ggtggtgcat ggttgtcgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacggc     1080

gcaaccctta tctttagtag ccagcatttg aggtgggcac tctagagaga ctgccaggga     1140

taacctggag gaaggtgggg atgacgtcaa atcatcatgc cccttatgac cagggctaca     1200

cacgtgctac aatggcgtaa acaaagggaa gcgaccctgt gaaggcaagc aaatcccaaa     1260

aataacgtct cagttcggat tgtagtctgc aactcgacta catgaagctg gaatcgctag     1320

taatcgcgaa tcagaatgtc gcggtgaata cgttcccggg tcttgtacac accgcccgtc     1380

acaccatggg gtcagtaacg cccgaagccg gtgacctaac cgcaaaggag gagccgtcga     1440

aggtg                                                                 1445


