                         SEQUENCE LISTING

<110>  Emergent Product Development United Kingdom Limited
       Telfer, Jon
       Redfern, Mark
 
<120>  USE OF E. COLI  SURFACE ANTIGEN 3 AS SEQUENCES FOR THE EXPORT OF 
       HETEROLOGOUS ANTIGENS

<130>  EMER-005/01WO

<150>  US 61/107,113
<151>  2008-10-21

<160>  25    

<170>  PatentIn version 3.5

<210>  1
<211>  504
<212>  DNA
<213>  Escherichia coli

<400>  1
atgttaaaaa taaaatactt attaataggt ctttcactgt cagctatgag ttcatactca       60

ctagctgcag cggggcccac tctaaccaaa gaactggcat taaatgtgct ttctcctgca      120

gctctggatg caacttgggc tcctcaggat aatttaacat tatccaatac tggcgtttct      180

aatactttgg tgggtgtttt gactctttca aataccagta ttgatacagt tagcattgcg      240

agtacaagtg tttctgatac atctaagaat ggtacagtaa cttttgcaca tgagacaaat      300

aactctgcta gctttgccac caccatttca acagataatg ccaacattac gttggataaa      360

aatgctggaa atacgattgt taaaactaca aatgggagtc agttgccaac taatttacca      420

cttaagttta ttaccactga aggtaacgaa catttagttt caggtaatta ccgtgcaaat      480

ataacaatta cttcgacaat taaa                                             504


<210>  2
<211>  168
<212>  PRT
<213>  Escherichia coli

<400>  2

Met Leu Lys Ile Lys Tyr Leu Leu Ile Gly Leu Ser Leu Ser Ala Met 
1               5                   10                  15      


Ser Ser Tyr Ser Leu Ala Ala Ala Gly Pro Thr Leu Thr Lys Glu Leu 
            20                  25                  30          


Ala Leu Asn Val Leu Ser Pro Ala Ala Leu Asp Ala Thr Trp Ala Pro 
        35                  40                  45              


Gln Asp Asn Leu Thr Leu Ser Asn Thr Gly Val Ser Asn Thr Leu Val 
    50                  55                  60                  


Gly Val Leu Thr Leu Ser Asn Thr Ser Ile Asp Thr Val Ser Ile Ala 
65                  70                  75                  80  


Ser Thr Ser Val Ser Asp Thr Ser Lys Asn Gly Thr Val Thr Phe Ala 
                85                  90                  95      


His Glu Thr Asn Asn Ser Ala Ser Phe Ala Thr Thr Ile Ser Thr Asp 
            100                 105                 110         


Asn Ala Asn Ile Thr Leu Asp Lys Asn Ala Gly Asn Thr Ile Val Lys 
        115                 120                 125             


Thr Thr Asn Gly Ser Gln Leu Pro Thr Asn Leu Pro Leu Lys Phe Ile 
    130                 135                 140                 


Thr Thr Glu Gly Asn Glu His Leu Val Ser Gly Asn Tyr Arg Ala Asn 
145                 150                 155                 160 


Ile Thr Ile Thr Ser Thr Ile Lys 
                165             


<210>  3
<211>  372
<212>  DNA
<213>  Escherichia coli

<400>  3
atgaataaag taaaatttta tgttttattt acggcgttac tatcctctct atgtgcatac       60

ggagctcccc agtctattac agaactatgt tcggaatatc gcaacacaca aatatatacg      120

ataaatgaca agatactatc atatacggaa tcgatggcag gcaaaagaga aatggttatc      180

attacattta agagcggcgc aacatttcag gtcgaagtcc cgggcagtca acatatagac      240

tcccaaaaaa aagccattga aaggatgaag gacacattaa gaatcacata tctgaccgag      300

accaaaattg ataaattatg tgtatggaat aataaaaccc ccaattcaat tgcggcaatc      360

agtatggaaa ac                                                          372


<210>  4
<211>  124
<212>  PRT
<213>  Escherichia coli

<400>  4

Met Asn Lys Val Lys Phe Tyr Val Leu Phe Thr Ala Leu Leu Ser Ser 
1               5                   10                  15      


Leu Cys Ala Tyr Gly Ala Pro Gln Ser Ile Thr Glu Leu Cys Ser Glu 
            20                  25                  30          


Tyr Arg Asn Thr Gln Ile Tyr Thr Ile Asn Asp Lys Ile Leu Ser Tyr 
        35                  40                  45              


Thr Glu Ser Met Ala Gly Lys Arg Glu Met Val Ile Ile Thr Phe Lys 
    50                  55                  60                  


Ser Gly Ala Thr Phe Gln Val Glu Val Pro Gly Ser Gln His Ile Asp 
65                  70                  75                  80  


Ser Gln Lys Lys Ala Ile Glu Arg Met Lys Asp Thr Leu Arg Ile Thr 
                85                  90                  95      


Tyr Leu Thr Glu Thr Lys Ile Asp Lys Leu Cys Val Trp Asn Asn Lys 
            100                 105                 110         


Thr Pro Asn Ser Ile Ala Ala Ile Ser Met Glu Asn 
        115                 120                 


<210>  5
<211>  57
<212>  DNA
<213>  Escherichia coli

<400>  5
aatagtagca attactgctg tgaattgtgt tgtaatccgc tctgtaccgg gtgctat          57


<210>  6
<211>  19
<212>  PRT
<213>  Escherichia coli

<400>  6

Asn Ser Ser Asn Tyr Cys Cys Glu Leu Cys Cys Asn Pro Leu Cys Thr 
1               5                   10                  15      


Gly Cys Tyr 
            


<210>  7
<211>  66
<212>  DNA
<213>  Escherichia coli

<400>  7
atgttaaaaa taaaatactt attaataggt ctttcactgt cagctatgag ttcatactca       60

ctagct                                                                  66


<210>  8
<211>  22
<212>  PRT
<213>  Escherichia coli

<400>  8

Met Leu Lys Ile Lys Tyr Leu Leu Ile Gly Leu Ser Leu Ser Ala Met 
1               5                   10                  15      


Ser Ser Tyr Ser Leu Ala 
            20          


<210>  9
<211>  28
<212>  DNA
<213>  Escherichia coli

<400>  9
ttacctaggg gtacccagct tttgttcc                                          28


<210>  10
<211>  108
<212>  DNA
<213>  Escherichia coli

<400>  10
acttcctagg ctagtctaga ttaatagcac ccggtacaga gcccattaca acacaattca       60

cagcagtaat tgctactatt cccggggttt tccatactga ttgccgca                   108


<210>  11
<211>  453
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CS3 (export signal):LTb:ST fusion

<400>  11
atgttaaaaa taaaatactt attaataggt ctttcactgt cagctatgag ttcatactca       60

ctagctgcag cggggcccgc tccccagtct attacagaac tatgttcgga atatcgcaac      120

acacaaatat atacgataaa tgacaagata ctatcatata cggaatcgat ggcaggcaaa      180

agagaaatgg ttatcattac atttaagagc ggcgcaacat ttcaggtcga agtcccgggc      240

agtcaacata tagactccca aaaaaaagcc attgaaagga tgaaggacac attaagaatc      300

acatatctga ccgagaccaa aattgataaa ttatgtgtat ggaataataa aacccccaat      360

tcaattgcgg caatcagtat ggaaaacccc gggaatagta gcaattactg ctgtgaattg      420

tgttgtaatc cgctctgtac cgggtgctat taa                                   453


<210>  12
<211>  150
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CS3 (export signal):LTb:ST fusion

<400>  12

Met Leu Lys Ile Lys Tyr Leu Leu Ile Gly Leu Ser Leu Ser Ala Met 
1               5                   10                  15      


Ser Ser Tyr Ser Leu Ala Ala Ala Gly Pro Ala Pro Gln Ser Ile Thr 
            20                  25                  30          


Glu Leu Cys Ser Glu Tyr Arg Asn Thr Gln Ile Tyr Thr Ile Asn Asp 
        35                  40                  45              


Lys Ile Leu Ser Tyr Thr Glu Ser Met Ala Gly Lys Arg Glu Met Val 
    50                  55                  60                  


Ile Ile Thr Phe Lys Ser Gly Ala Thr Phe Gln Val Glu Val Pro Gly 
65                  70                  75                  80  


Ser Gln His Ile Asp Ser Gln Lys Lys Ala Ile Glu Arg Met Lys Asp 
                85                  90                  95      


Thr Leu Arg Ile Thr Tyr Leu Thr Glu Thr Lys Ile Asp Lys Leu Cys 
            100                 105                 110         


Val Trp Asn Asn Lys Thr Pro Asn Ser Ile Ala Ala Ile Ser Met Glu 
        115                 120                 125             


Asn Pro Gly Asn Ser Ser Asn Tyr Cys Cys Glu Leu Cys Cys Asn Pro 
    130                 135                 140                 


Leu Cys Thr Gly Cys Tyr 
145                 150 


<210>  13
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CS3 (full length):LTb:ST fusion

<400>  13
atgttaaaaa taaaatactt attaataggt ctttcactgt cagctatgag ttcatactca       60

ctagctgcag cggggcccac tctaaccaaa gaactggcat taaatgtgct ttctcctgca      120

gctctggatg caacttgggc tcctcaggat aatttaacat tatccaatac tggcgtttct      180

aatactttgg tgggtgtttt gactctttca aataccagta ttgatacagt tagcattgcg      240

agtacaagtg tttctgatac atctaagaat ggtacagtaa cttttgcaca tgagacaaat      300

aactctgcta gctttgccac caccatttca acagataatg ccaacattac gttggataaa      360

aatgctggaa atacgattgt taaaactaca aatgggagtc agttgccaac taatttacca      420

cttaagttta ttaccactga aggtaacgaa catttagttt caggtaatta ccgtgcaaat      480

ataacaatta cttcgacaat taaacccggg gctccccagt ctattacaga actatgttcg      540

gaatatcgca acacacaaat atatacgata aatgacaaga tactatcata tacggaatcg      600

atggcaggca aaagagaaat ggttatcatt acatttaaga gcggcgcaac atttcaggtc      660

gaagtcccgg gcagtcaaca tatagactcc caaaaaaaag ccattgaaag gatgaaggac      720

acattaagaa tcacatatct gaccgagacc aaaattgata aattatgtgt atggaataat      780

aaaaccccca attcaattgc ggcaatcagt atggaaaacc ccgggaatag tagcaattac      840

tgctgtgaat tgtgttgtaa tccgctctgt accgggtgct attaa                      885


<210>  14
<211>  294
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CS3 (full length):LTb:ST fusion

<400>  14

Met Leu Lys Ile Lys Tyr Leu Leu Ile Gly Leu Ser Leu Ser Ala Met 
1               5                   10                  15      


Ser Ser Tyr Ser Leu Ala Ala Ala Gly Pro Thr Leu Thr Lys Glu Leu 
            20                  25                  30          


Ala Leu Asn Val Leu Ser Pro Ala Ala Leu Asp Ala Thr Trp Ala Pro 
        35                  40                  45              


Gln Asp Asn Leu Thr Leu Ser Asn Thr Gly Val Ser Asn Thr Leu Val 
    50                  55                  60                  


Gly Val Leu Thr Leu Ser Asn Thr Ser Ile Asp Thr Val Ser Ile Ala 
65                  70                  75                  80  


Ser Thr Ser Val Ser Asp Thr Ser Lys Asn Gly Thr Val Thr Phe Ala 
                85                  90                  95      


His Glu Thr Asn Asn Ser Ala Ser Phe Ala Thr Thr Ile Ser Thr Asp 
            100                 105                 110         


Asn Ala Asn Ile Thr Leu Asp Lys Asn Ala Gly Asn Thr Ile Val Lys 
        115                 120                 125             


Thr Thr Asn Gly Ser Gln Leu Pro Thr Asn Leu Pro Leu Lys Phe Ile 
    130                 135                 140                 


Thr Thr Glu Gly Asn Glu His Leu Val Ser Gly Asn Tyr Arg Ala Asn 
145                 150                 155                 160 


Ile Thr Ile Thr Ser Thr Ile Lys Pro Gly Ala Pro Gln Ser Ile Thr 
                165                 170                 175     


Glu Leu Cys Ser Glu Tyr Arg Asn Thr Gln Ile Tyr Thr Ile Asn Asp 
            180                 185                 190         


Lys Ile Leu Ser Tyr Thr Glu Ser Met Ala Gly Lys Arg Glu Met Val 
        195                 200                 205             


Ile Ile Thr Phe Lys Ser Gly Ala Thr Phe Gln Val Glu Val Pro Gly 
    210                 215                 220                 


Ser Gln His Ile Asp Ser Gln Lys Lys Ala Ile Glu Arg Met Lys Asp 
225                 230                 235                 240 


Thr Leu Arg Ile Thr Tyr Leu Thr Glu Thr Lys Ile Asp Lys Leu Cys 
                245                 250                 255     


Val Trp Asn Asn Lys Thr Pro Asn Ser Ile Ala Ala Ile Ser Met Glu 
            260                 265                 270         


Asn Pro Gly Asn Ser Ser Asn Tyr Cys Cys Glu Leu Cys Cys Asn Pro 
        275                 280                 285             


Leu Cys Thr Gly Cys Tyr 
    290                 


<210>  15
<211>  474
<212>  DNA
<213>  Salmonella sp.

<400>  15
ctcgagattg ccatcgcgga tgtcgcctgt cttatctacc atcataaaca tcatttgcct       60

atggctcacg acagtatagg caatgccgtt ttttatattg ctaattgttt cgccaatcaa      120

cgcaaaagta tggcgattgc taaagccgtc tccctgggcg gtagattagc cttaaccgcg      180

acggtaatga ctcattcata ctggagtggt agtttgggac tacagcctca tttattagag      240

cgtcttaatg atattaccta tggactaatg agttttactc gcttcggtat ggatgggatg      300

gcaatgaccg gtatgcaggt cagcagccca ttatatcgtt tgctggctca ggtaacgcca      360

gaacaacgtg cgccggagta atcgttttca ggtatatacc ggatgttcat tgctttctaa      420

attttgctat gttgccagta tccttacgat gtatttattt taaggaaaag ccat            474


<210>  16
<211>  72
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CS3 export signal sequence plus linker 1

<400>  16
atgttaaaaa taaaatactt attaataggt ctttcactgt cagctatgag ttcatactca       60

ctagctgcag cg                                                           72


<210>  17
<211>  24
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CS3 export signal sequence plus linker 1

<400>  17

Met Leu Lys Ile Lys Tyr Leu Leu Ile Gly Leu Ser Leu Ser Ala Met 
1               5                   10                  15      


Ser Ser Tyr Ser Leu Ala Ala Ala 
            20                  


<210>  18
<211>  78
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CS3 export signal sequence plus linker 2

<400>  18
atgttaaaaa taaaatactt attaataggt ctttcactgt cagctatgag ttcatactca       60

ctagctgcag cggggccc                                                     78


<210>  19
<211>  26
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CS3 export signal sequence plus linker 2

<400>  19

Met Leu Lys Ile Lys Tyr Leu Leu Ile Gly Leu Ser Leu Ser Ala Met 
1               5                   10                  15      


Ser Ser Tyr Ser Leu Ala Ala Ala Gly Pro 
            20                  25      


<210>  20
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Amino acid linker

<400>  20

Ala Ala Pro Gly 
1               


<210>  21
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Amino acid linker

<400>  21

Ala Ala Gly Pro 
1               


<210>  22
<211>  2638
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ssaG promoter, full length CS3, C diff toxin B C terminal repeat 
       domain

<400>  22
ctcgagattg ccatcgcgga tgtcgcctgt cttatctacc atcataaaca tcatttgcct       60

atggctcacg acagtatagg caatgccgtt ttttatattg ctaattgttt cgccaatcaa      120

cgcaaaagta tggcgattgc taaagccgtc tccctgggcg gtagattagc cttaaccgcg      180

acggtaatga ctcattcata ctggagtggt agtttgggac tacagcctca tttattagag      240

cgtcttaatg atattaccta tggactaatg agttttactc gcttcggtat ggatgggatg      300

gcaatgaccg gtatgcaggt cagcagccca ttatatcgtt tgctggctca ggtaacgcca      360

gaacaacgtg cgccggagta atcgttttca ggtatatacc ggatgttcat tgctttctaa      420

attttgctat gttgccagta tccttacgat gtatttattt taaggaaaag ccatatgtta      480

aaaataaaat acttattaat aggtctttca ctgtcagcta tgagttcata ctcactagct      540

gcagcggggc ccactctaac caaagaactg gcattaaatg tgctttctcc tgcagctctg      600

gatgcaactt gggctcctca ggataattta acattatcca atactggcgt ttctaatact      660

ttggtgggtg ttttgactct ttcaaatacc agtattgata cagttagcat tgcgagtaca      720

agtgtttctg atacatctaa gaatggtaca gtaacttttg cacatgagac aaataactct      780

gctagctttg ccaccaccat ttcaacagat aatgccaaca ttacgttgga taaaaatgct      840

ggaaatacga ttgttaaaac tacaaatggg agtcagttgc caactaattt accacttaag      900

tttattacca ctgaaggtaa cgaacattta gtttcaggta attaccgtgc aaatataaca      960

attacttcga caattaaacc cgggaagttt tatatcaaca acttcggcat gatggtgtct     1020

ggcttgatct acatcaacga tagcctctat tatttcaagc cgcccgttaa taacttaatc     1080

acaggcttcg tgacagtagg tgatgacaaa tactatttta atccgatcaa tggaggcgca     1140

gcaagtattg gtgaaacgat aatcgacgac aagaactatt attttaacca atcaggagtg     1200

ctgcaaactg gtgtgttttc caccgaggac ggctttaagt acttcgcccc cgcgaacacc     1260

ctggacgaaa accttgaggg tgaagccatt gacttcactg gtaaacttat tatcgacgaa     1320

aacatctact attttgatga taactacaga ggcgcagtgg agtggaaaga gctggacggg     1380

gaaatgcatt acttttcccc agagacaggt aaagctttca aaggtctgaa tcagattggg     1440

gattacaaat attacttcaa ctctgacggt gtcatgcaga agggatttgt gtcaatcaac     1500

gataataagc actactttga tgactcagga gtaatgaagg tgggctacac ggagattgac     1560

ggaaaacatt tctatttcgc cgaaaatggt gaaatgcaga ttggcgtttt caataccgag     1620

gatggcttca agtattttgc tcatcacaat gaggatctgg gaaacgaaga aggcgaggaa     1680

atttcctact cgggcatact gaattttaac aataaaatat attatttcga cgacagtttt     1740

acggcggttg ttgggtggaa ggatttagaa gatggtagta aatactactt cgatgaggac     1800

acggccgaag cctatatcgg tttgtcgctg attaatgatg gacagtacta ttttaatgac     1860

gacggcatta tgcaagttgg gttcgtgacc attaacgaca aagtgtttta tttttcagac     1920

tcaggaatta tcgagagcgg ggttcaaaac attgatgata attattttta catagacgat     1980

aatgggatcg ttcagatcgg ggtgttcgac acatctgacg gttacaaata ttttgctccc     2040

gcaaatacgg tgaacgacaa catttacggg caggcagtgg aatattcggg tttggttaga     2100

gttggcgagg atgtctacta ttttggcgag acatacacga ttgaaacggg gtggatttac     2160

gatatggaga acgaaagcga taaatattac tttaacccag aaacaaagaa ggcctgcaaa     2220

ggtatcaatt taatcgatga tatcaaatac tatttcgacg aaaagggtat catgcgtact     2280

gggctgatca gctttgagaa caataattac tatttcaatg aaaatgggga aatgcaattt     2340

ggatatatta atatagaaga taagatgttt tatttcgggg aggatggtgt gatgcagatc     2400

ggcgttttca acaccccgga cgggtttaaa tatttcgcac atcagaatac actggatgag     2460

aacttcgagg gtgagtctat taactacacc gggtggctgg acttagacga gaaacgctac     2520

tatttcacag acgagtacat tgcagctact ggttcggtca tcattgatgg cgaggaatat     2580

tatttcgacc cggataccgc ccagttagtg atctccgagt aatctagact agcctagg       2638


<210>  23
<211>  2206
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ssaG promoter, CS3 signal peptide, C diff toxin B C terminal 
       repeat domain

<400>  23
ctcgagattg ccatcgcgga tgtcgcctgt cttatctacc atcataaaca tcatttgcct       60

atggctcacg acagtatagg caatgccgtt ttttatattg ctaattgttt cgccaatcaa      120

cgcaaaagta tggcgattgc taaagccgtc tccctgggcg gtagattagc cttaaccgcg      180

acggtaatga ctcattcata ctggagtggt agtttgggac tacagcctca tttattagag      240

cgtcttaatg atattaccta tggactaatg agttttactc gcttcggtat ggatgggatg      300

gcaatgaccg gtatgcaggt cagcagccca ttatatcgtt tgctggctca ggtaacgcca      360

gaacaacgtg cgccggagta atcgttttca ggtatatacc ggatgttcat tgctttctaa      420

attttgctat gttgccagta tccttacgat gtatttattt taaggaaaag ccatatgtta      480

aaaataaaat acttattaat aggtctttca ctgtcagcta tgagttcata ctcactagct      540

gcagcggggc ccaagtttta tatcaacaac ttcggcatga tggtgtctgg cttgatctac      600

atcaacgata gcctctatta tttcaagccg cccgttaata acttaatcac aggcttcgtg      660

acagtaggtg atgacaaata ctattttaat ccgatcaatg gaggcgcagc aagtattggt      720

gaaacgataa tcgacgacaa gaactattat tttaaccaat caggagtgct gcaaactggt      780

gtgttttcca ccgaggacgg ctttaagtac ttcgcccccg cgaacaccct ggacgaaaac      840

cttgagggtg aagccattga cttcactggt aaacttatta tcgacgaaaa catctactat      900

tttgatgata actacagagg cgcagtggag tggaaagagc tggacgggga aatgcattac      960

ttttccccag agacaggtaa agctttcaaa ggtctgaatc agattgggga ttacaaatat     1020

tacttcaact ctgacggtgt catgcagaag ggatttgtgt caatcaacga taataagcac     1080

tactttgatg actcaggagt aatgaaggtg ggctacacgg agattgacgg aaaacatttc     1140

tatttcgccg aaaatggtga aatgcagatt ggcgttttca ataccgagga tggcttcaag     1200

tattttgctc atcacaatga ggatctggga aacgaagaag gcgaggaaat ttcctactcg     1260

ggcatactga attttaacaa taaaatatat tatttcgacg acagttttac ggcggttgtt     1320

gggtggaagg atttagaaga tggtagtaaa tactacttcg atgaggacac ggccgaagcc     1380

tatatcggtt tgtcgctgat taatgatgga cagtactatt ttaatgacga cggcattatg     1440

caagttgggt tcgtgaccat taacgacaaa gtgttttatt tttcagactc aggaattatc     1500

gagagcgggg ttcaaaacat tgatgataat tatttttaca tagacgataa tgggatcgtt     1560

cagatcgggg tgttcgacac atctgacggt tacaaatatt ttgctcccgc aaatacggtg     1620

aacgacaaca tttacgggca ggcagtggaa tattcgggtt tggttagagt tggcgaggat     1680

gtctactatt ttggcgagac atacacgatt gaaacggggt ggatttacga tatggagaac     1740

gaaagcgata aatattactt taacccagaa acaaagaagg cctgcaaagg tatcaattta     1800

atcgatgata tcaaatacta tttcgacgaa aagggtatca tgcgtactgg gctgatcagc     1860

tttgagaaca ataattacta tttcaatgaa aatggggaaa tgcaatttgg atatattaat     1920

atagaagata agatgtttta tttcggggag gatggtgtga tgcagatcgg cgttttcaac     1980

accccggacg ggtttaaata tttcgcacat cagaatacac tggatgagaa cttcgagggt     2040

gagtctatta actacaccgg gtggctggac ttagacgaga aacgctacta tttcacagac     2100

gagtacattg cagctactgg ttcggtcatc attgatggcg aggaatatta tttcgacccg     2160

gataccgccc agttagtgat ctccgagtaa tctagactag cctagg                    2206


<210>  24
<211>  3673
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ssaG promoter, full length CS3, C diff toxin A C terminal repeat 
       domain

<400>  24
ctcgagattg ccatcgcgga tgtcgcctgt cttatctacc atcataaaca tcatttgcct       60

atggctcacg acagtatagg caatgccgtt ttttatattg ctaattgttt cgccaatcaa      120

cgcaaaagta tggcgattgc taaagccgtc tccctgggcg gtagattagc cttaaccgcg      180

acggtaatga ctcattcata ctggagtggt agtttgggac tacagcctca tttattagag      240

cgtcttaatg atattaccta tggactaatg agttttactc gcttcggtat ggatgggatg      300

gcaatgaccg gtatgcaggt cagcagccca ttatatcgtt tgctggctca ggtaacgcca      360

gaacaacgtg cgccggagta atcgttttca ggtatatacc ggatgttcat tgctttctaa      420

attttgctat gttgccagta tccttacgat gtatttattt taaggaaaag ccatatgtta      480

aaaataaaat acttattaat aggtctttca ctgtcagcta tgagttcata ctcactagct      540

gcagcggggc ccactctaac caaagaactg gcattaaatg tgctttctcc tgcagctctg      600

gatgcaactt gggctcctca ggataattta acattatcca atactggcgt ttctaatact      660

ttggtgggtg ttttgactct ttcaaatacc agtattgata cagttagcat tgcgagtaca      720

agtgtttctg atacatctaa gaatggtaca gtaacttttg cacatgagac aaataactct      780

gctagctttg ccaccaccat ttcaacagat aatgccaaca ttacgttgga taaaaatgct      840

ggaaatacga ttgttaaaac tacaaatggg agtcagttgc caactaattt accacttaag      900

tttattacca ctgaaggtaa cgaacattta gtttcaggta attaccgtgc aaatataaca      960

attacttcga caattaaacc cgggacatat tactacgacg aagattcgaa gttggtcaag     1020

ggcctgataa acataaacaa ctcgttattt tatttcgatc ctattgaatt taacctggtg     1080

acggggtggc agaccataaa cgggaagaag tactactttg acatcaatac cggcgcagca     1140

ttgatttcat ataagataat taacggcaag catttctact ttaacaacga tggagtcatg     1200

caactgggag tctttaaggg tcccgacggc ttcgaatact ttgccccagc gaacacccaa     1260

aacaacaata ttgaggggca ggcgattgtc tatcaatcaa agtttttgac gctgaacggt     1320

aagaaatact attttgataa cgattcgaaa gcagtcacgg ggtggcggat tattaacaac     1380

gaaaaatatt attttaatcc aaataatgct atcgcagcag tcgggcttca agtgatcgat     1440

aataataagt actacttcaa tccagatacg gctattattt caaaagggtg gcagactgtc     1500

aacggctcca ggtattattt cgacactgat actgctatcg ctttcaacgg gtataagaca     1560

atcgatggta agcatttcta ctttgatagc gactgcgtgg ttaaaattgg tgtattcagt     1620

acctctaatg gatttgagta cttcgctcct gcaaacactt acaataacaa tattgaaggt     1680

caggccatcg tataccaaag caagttcctc accttaaatg gcaaaaagta ctatttcgac     1740

aacaatagca aagcggtcac cggttggcag accattgata gtaaaaaata ttattttaat     1800

accaacactg cggaagctgc taccggatgg cagacaatcg acggcaagaa gtattatttc     1860

aacaccaata cagcagaagc ggccacaggg tggcaaacga tcgacgggaa gaagtactac     1920

tttaatacta acacggccat tgctagcacc ggttatacca ttattaatgg gaaacacttt     1980

tacttcaaca ctgacggcat tatgcagatc ggtgtattca aagggcctaa cggcttcgaa     2040

tatttcgcac cggccaatac agacgcgaac aatatagaag gacaggcgat tctgtatcag     2100

aatgaattcc tgaccctgaa tggtaagaaa tattacttcg gcagcgattc taaggccgtc     2160

accgggtggc ggataatcaa taataaaaag tactatttca acccgaataa cgcgattgca     2220

gctattcacc tgtgcacgat caacaatgat aagtattatt ttagctatga tgggatcctt     2280

caaaatggat atattacaat agaaagaaat aacttctatt tcgatgcgaa taatgagtct     2340

aaaatggtga ctggcgtttt caaaggccca aatgggttcg aatacttcgc tccggcgaac     2400

acacacaaca acaatattga agggcaggca atagtgtatc agaataaatt cttgacgctg     2460

aatggtaaaa agtactactt tgataatgat tcgaaagcgg taacaggctg gcagaccata     2520

gacggcaaga aatattactt taatctgaat actgccgaag ctgcgacggg ctggcaaacc     2580

atagacggaa agaaatatta ttttaatctg aacaccgcag aggccgccac cggatggcag     2640

accatcgacg ggaagaaata ctatttcaac actaatacct tcatagcgag tacggggtat     2700

acctcgatca atggcaagca tttctacttt aacaccgacg ggattatgca gatcggtgtt     2760

ttcaaggggc cgaacggctt cgaatacttc gctcccgcaa acacacacaa caacaacatc     2820

gagggacagg ctatactgta tcaaaataaa tttcttacgt taaatggcaa gaagtattat     2880

tttgggtcgg acagcaaagc agtgaccggt ttgcgtacca tagatggtaa gaaatattat     2940

tttaatacta acacggcagt agccgttacc ggatggcaga ctattaatgg gaagaaatac     3000

tattttaaca ctaacacgag cattgcctcg actggctaca cgatcattag cgggaaacac     3060

ttctacttca acacggatgg tattatgcag ataggtgtct ttaaaggtcc tgacggtttt     3120

gagtacttcg cacccgccaa caccgacgct aataacatag aggggcaagc tatcaggtat     3180

cagaatcgct tcctttacct gcatgataac atctattact tcgggaacaa cagtaaggct     3240

gctaccgggt gggtgacaat tgacggtaat cgctattatt tcgagcctaa cacagcaatg     3300

ggagccaatg gctataagac tatcgataac aaaaattttt actttcggaa cggtttgcct     3360

caaatcgggg tttttaaagg atctaacggc ttcgagtact ttgccccggc gaacacggat     3420

gccaacaata ttgagggcca ggcgataagg taccagaacc gctttctgca tctcttgggt     3480

aaaatctatt acttcggcaa caactcaaag gcggtaacag gatggcaaac tataaacggg     3540

aaggtttact attttatgcc tgatacggcc atggctgcgg cgggaggcct gttcgaaatt     3600

gacggtgtta tatacttttt cggtgtggac ggtgttaagg ccccaggcat ttactaatct     3660

agactagcct agg                                                        3673


<210>  25
<211>  3241
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ssaG promoter, CS3 signal peptide, C diff toxin A C terminal 
       repeat domain

<400>  25
ctcgagattg ccatcgcgga tgtcgcctgt cttatctacc atcataaaca tcatttgcct       60

atggctcacg acagtatagg caatgccgtt ttttatattg ctaattgttt cgccaatcaa      120

cgcaaaagta tggcgattgc taaagccgtc tccctgggcg gtagattagc cttaaccgcg      180

acggtaatga ctcattcata ctggagtggt agtttgggac tacagcctca tttattagag      240

cgtcttaatg atattaccta tggactaatg agttttactc gcttcggtat ggatgggatg      300

gcaatgaccg gtatgcaggt cagcagccca ttatatcgtt tgctggctca ggtaacgcca      360

gaacaacgtg cgccggagta atcgttttca ggtatatacc ggatgttcat tgctttctaa      420

attttgctat gttgccagta tccttacgat gtatttattt taaggaaaag ccatatgtta      480

aaaataaaat acttattaat aggtctttca ctgtcagcta tgagttcata ctcactagct      540

gcagcggggc ccacatatta ctacgacgaa gattcgaagt tggtcaaggg cctgataaac      600

ataaacaact cgttatttta tttcgatcct attgaattta acctggtgac ggggtggcag      660

accataaacg ggaagaagta ctactttgac atcaataccg gcgcagcatt gatttcatat      720

aagataatta acggcaagca tttctacttt aacaacgatg gagtcatgca actgggagtc      780

tttaagggtc ccgacggctt cgaatacttt gccccagcga acacccaaaa caacaatatt      840

gaggggcagg cgattgtcta tcaatcaaag tttttgacgc tgaacggtaa gaaatactat      900

tttgataacg attcgaaagc agtcacgggg tggcggatta ttaacaacga aaaatattat      960

tttaatccaa ataatgctat cgcagcagtc gggcttcaag tgatcgataa taataagtac     1020

tacttcaatc cagatacggc tattatttca aaagggtggc agactgtcaa cggctccagg     1080

tattatttcg acactgatac tgctatcgct ttcaacgggt ataagacaat cgatggtaag     1140

catttctact ttgatagcga ctgcgtggtt aaaattggtg tattcagtac ctctaatgga     1200

tttgagtact tcgctcctgc aaacacttac aataacaata ttgaaggtca ggccatcgta     1260

taccaaagca agttcctcac cttaaatggc aaaaagtact atttcgacaa caatagcaaa     1320

gcggtcaccg gttggcagac cattgatagt aaaaaatatt attttaatac caacactgcg     1380

gaagctgcta ccggatggca gacaatcgac ggcaagaagt attatttcaa caccaataca     1440

gcagaagcgg ccacagggtg gcaaacgatc gacgggaaga agtactactt taatactaac     1500

acggccattg ctagcaccgg ttataccatt attaatggga aacactttta cttcaacact     1560

gacggcatta tgcagatcgg tgtattcaaa gggcctaacg gcttcgaata tttcgcaccg     1620

gccaatacag acgcgaacaa tatagaagga caggcgattc tgtatcagaa tgaattcctg     1680

accctgaatg gtaagaaata ttacttcggc agcgattcta aggccgtcac cgggtggcgg     1740

ataatcaata ataaaaagta ctatttcaac ccgaataacg cgattgcagc tattcacctg     1800

tgcacgatca acaatgataa gtattatttt agctatgatg ggatccttca aaatggatat     1860

attacaatag aaagaaataa cttctatttc gatgcgaata atgagtctaa aatggtgact     1920

ggcgttttca aaggcccaaa tgggttcgaa tacttcgctc cggcgaacac acacaacaac     1980

aatattgaag ggcaggcaat agtgtatcag aataaattct tgacgctgaa tggtaaaaag     2040

tactactttg ataatgattc gaaagcggta acaggctggc agaccataga cggcaagaaa     2100

tattacttta atctgaatac tgccgaagct gcgacgggct ggcaaaccat agacggaaag     2160

aaatattatt ttaatctgaa caccgcagag gccgccaccg gatggcagac catcgacggg     2220

aagaaatact atttcaacac taataccttc atagcgagta cggggtatac ctcgatcaat     2280

ggcaagcatt tctactttaa caccgacggg attatgcaga tcggtgtttt caaggggccg     2340

aacggcttcg aatacttcgc tcccgcaaac acacacaaca acaacatcga gggacaggct     2400

atactgtatc aaaataaatt tcttacgtta aatggcaaga agtattattt tgggtcggac     2460

agcaaagcag tgaccggttt gcgtaccata gatggtaaga aatattattt taatactaac     2520

acggcagtag ccgttaccgg atggcagact attaatggga agaaatacta ttttaacact     2580

aacacgagca ttgcctcgac tggctacacg atcattagcg ggaaacactt ctacttcaac     2640

acggatggta ttatgcagat aggtgtcttt aaaggtcctg acggttttga gtacttcgca     2700

cccgccaaca ccgacgctaa taacatagag gggcaagcta tcaggtatca gaatcgcttc     2760

ctttacctgc atgataacat ctattacttc gggaacaaca gtaaggctgc taccgggtgg     2820

gtgacaattg acggtaatcg ctattatttc gagcctaaca cagcaatggg agccaatggc     2880

tataagacta tcgataacaa aaatttttac tttcggaacg gtttgcctca aatcggggtt     2940

tttaaaggat ctaacggctt cgagtacttt gccccggcga acacggatgc caacaatatt     3000

gagggccagg cgataaggta ccagaaccgc tttctgcatc tcttgggtaa aatctattac     3060

ttcggcaaca actcaaaggc ggtaacagga tggcaaacta taaacgggaa ggtttactat     3120

tttatgcctg atacggccat ggctgcggcg ggaggcctgt tcgaaattga cggtgttata     3180

tactttttcg gtgtggacgg tgttaaggcc ccaggcattt actaatctag actagcctag     3240

g                                                                     3241


