                         SEQUENCE LISTING

<110>  National Research Council of Canada
 
<120>  SYSTEMS AND METHODS FOR THE PRODUCTION OF DIPHTHERIA TOXIN 
       POLYPEPTIDES

<130>  2017-025-02

<160>  10    

<170>  PatentIn version 3.5

<210>  1
<211>  21
<212>  PRT
<213>  Escherichia coli

<400>  1

Met Lys Val Lys Val Leu Ser Leu Leu Val Pro Ala Leu Leu Val Ala 
1               5                   10                  15      


Gly Ala Ala Asn Ala 
            20      


<210>  2
<211>  1671
<212>  DNA
<213>  Corynebacterium diphtheriae

<400>  2
atgaaagtta aggtgttgtc tttgttggtt cctgcattgt tggtcgctgg tgctgcaaat       60

gcgggtgcag atgacgtagt agacagcagc aaatcgttcg ttatggagaa tttcagcagc      120

taccacggta cgaagccggg ttacgtcgat tctattcaga agggcatcca gaaaccgaag      180

tccggtaccc aaggcaacta tgatgacgat tggaaagagt tttacagcac cgataacaaa      240

tatgacgcag cgggttatag cgttgataat gaaaatccgt tgtccggcaa agcaggcggc      300

gtggtcaaag tgacctatcc gggcctgacc aaagttctgg ccctgaaagt tgataacgcg      360

gaaaccatta agaaagagct gggtttgagc ttgacggaac cgctgatgga acaggtcggc      420

acggaagagt tcattaagcg ttttggcgat ggtgcttctc gcgttgttct gagcttgccg      480

tttgcggaag gtagcagcag cgtcgaatac attaacaatt gggagcaggc gaaagccctg      540

tccgttgagc tggagatcaa cttcgaaacc cgcggtaagc gtggccaaga cgccatgtat      600

gagtatatgg cccaagcgtg tgctggcaac cgtgtgcgtc gcagcgtcgg tagcagcctg      660

tcgtgtatca atctggattg ggacgtgatt cgcgacaaaa ccaagaccaa aatcgaaagc      720

ctgaaagaac acggtccgat taagaataag atgagcgaat ctccgaataa gacggttagc      780

gaagagaagg cgaaacagta cctggaagag tttcaccaga ctgcgctgga acatccagag      840

ctgagcgagc tgaaaacggt gaccggtacc aacccagttt ttgctggtgc gaactatgcg      900

gcttgggcgg tcaacgtggc acaagtcatt gactccgaaa cggcagataa cctggagaaa      960

accactgcgg cgctgagcat cttgccgggt attggcagcg tgatgggcat cgcggacggt     1020

gccgtacacc ataacaccga agagatcgtg gcgcagagca tcgcgctgag ctctctgatg     1080

gtcgctcagg cgatcccgct ggtgggtgag ctggttgata ttggttttgc tgcttacaac     1140

tttgttgaga gcatcatcaa tctgttccag gtggtgcaca atagctacaa tcgtccggcg     1200

tacagccctg gtcacaaaac gcaaccgttc ctgcacgacg gttacgcggt ttcctggaat     1260

accgttgagg acagcattat tcgcaccggt ttccaaggcg agtctggcca tgacattaag     1320

atcacggcag aaaacacccc gctgccgatt gcgggtgtcc tgctgccgac gattccgggc     1380

aagctggatg tgaataagag caagacccac attagcgtta acggccgtaa gatccgtatg     1440

cgttgccgtg cgattgacgg tgacgtgacc ttttgccgcc ctaaaagccc ggtctacgtg     1500

ggtaatggtg tgcatgcgaa tctgcacgtt gccttccatc gtagcagctc cgagaaaatt     1560

cacagcaatg agatcagctc cgacagcatc ggtgtcctgg gctatcaaaa gaccgtcgat     1620

catacgaagg ttaactccaa gctgagcttg ttctttgaga tcaaaagcta a              1671


<210>  3
<211>  556
<212>  PRT
<213>  Corynebacterium diphtheriae

<400>  3

Met Lys Val Lys Val Leu Ser Leu Leu Val Pro Ala Leu Leu Val Ala 
1               5                   10                  15      


Gly Ala Ala Asn Ala Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser 
            20                  25                  30          


Phe Val Met Glu Asn Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr 
        35                  40                  45              


Val Asp Ser Ile Gln Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln 
    50                  55                  60                  


Gly Asn Tyr Asp Asp Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys 
65                  70                  75                  80  


Tyr Asp Ala Ala Gly Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly 
                85                  90                  95      


Lys Ala Gly Gly Val Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val 
            100                 105                 110         


Leu Ala Leu Lys Val Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly 
        115                 120                 125             


Leu Ser Leu Thr Glu Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe 
    130                 135                 140                 


Ile Lys Arg Phe Gly Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro 
145                 150                 155                 160 


Phe Ala Glu Gly Ser Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln 
                165                 170                 175     


Ala Lys Ala Leu Ser Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly 
            180                 185                 190         


Lys Arg Gly Gln Asp Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala 
        195                 200                 205             


Gly Asn Arg Val Arg Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn 
    210                 215                 220                 


Leu Asp Trp Asp Val Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser 
225                 230                 235                 240 


Leu Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn 
                245                 250                 255     


Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe His 
            260                 265                 270         


Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr 
        275                 280                 285             


Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val 
    290                 295                 300                 


Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys 
305                 310                 315                 320 


Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile Gly Ser Val Met Gly 
                325                 330                 335     


Ile Ala Asp Gly Ala Val His His Asn Thr Glu Glu Ile Val Ala Gln 
            340                 345                 350         


Ser Ile Ala Leu Ser Ser Leu Met Val Ala Gln Ala Ile Pro Leu Val 
        355                 360                 365             


Gly Glu Leu Val Asp Ile Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser 
    370                 375                 380                 


Ile Ile Asn Leu Phe Gln Val Val His Asn Ser Tyr Asn Arg Pro Ala 
385                 390                 395                 400 


Tyr Ser Pro Gly His Lys Thr Gln Pro Phe Leu His Asp Gly Tyr Ala 
                405                 410                 415     


Val Ser Trp Asn Thr Val Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln 
            420                 425                 430         


Gly Glu Ser Gly His Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu 
        435                 440                 445             


Pro Ile Ala Gly Val Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp Val 
    450                 455                 460                 


Asn Lys Ser Lys Thr His Ile Ser Val Asn Gly Arg Lys Ile Arg Met 
465                 470                 475                 480 


Arg Cys Arg Ala Ile Asp Gly Asp Val Thr Phe Cys Arg Pro Lys Ser 
                485                 490                 495     


Pro Val Tyr Val Gly Asn Gly Val His Ala Asn Leu His Val Ala Phe 
            500                 505                 510         


His Arg Ser Ser Ser Glu Lys Ile His Ser Asn Glu Ile Ser Ser Asp 
        515                 520                 525             


Ser Ile Gly Val Leu Gly Tyr Gln Lys Thr Val Asp His Thr Lys Val 
    530                 535                 540                 


Asn Ser Lys Leu Ser Leu Phe Phe Glu Ile Lys Ser 
545                 550                 555     


<210>  4
<211>  114
<212>  DNA
<213>  Escherichia coli

<400>  4
caccacaatt cagcaaattg tgaacatcat cacgttcatc tttccctggt tgccaatggc       60

ccattttcct gtcagtaacg agaaggtcgc gaattcaggc gctttttaga ctgg            114


<210>  5
<211>  129
<212>  DNA
<213>  Escherichia coli

<400>  5
caccacaatt cagcaaattg tgaacatcat cacgttcatc tttccctggt tgccaatggc       60

ccattttcct gtcagtaacg agaaggtcgc gaattcaggc gctttttaga ctggtcgtaa      120

tgaacaatt                                                              129


<210>  6
<211>  1683
<212>  DNA
<213>  Corynebacterium diphtheriae

<400>  6
atgagcagaa aactgtttgc gtcaatctta ataggggcgc tactggggat aggggcccca       60

ccttcagccc atgcaggcgc tgatgatgtt gttgattctt ctaaatcttt tgtgatggaa      120

aacttttctt cgtaccacgg gactaaacct ggttatgtag attccattca aaaaggtata      180

caaaagccaa aatctggtac acaaggaaat tatgacgatg attggaaagg gttttatagt      240

accgacaata aatacgacgc tgcgggatac tctgtagata atgaaaaccc gctctctgga      300

aaagctggag gcgtggtcaa agtgacgtat ccaggactga cgaaggttct cgcactaaaa      360

gtggataatg ccgaaactat taagaaagag ttaggtttaa gtctcactga accgttgatg      420

gagcaagtcg gaacggaaga gtttatcaaa aggttcggtg atggtgcttc gcgtgtagtg      480

ctcagccttc ccttcgctga ggggagttct agcgttgaat atattaataa ctgggaacag      540

gcgaaagcgt taagcgtaga acttgagatt aattttgaaa cccgtggaaa acgtggccaa      600

gatgcgatgt atgagtatat ggctcaagcc tgtgcaggaa atcgtgtcag gcgatcagta      660

ggtagctcat tgtcatgcat aaatcttgat tgggatgtca taagggataa aactaagaca      720

aagatagagt ctttgaaaga gcatggccct atcaaaaata aaatgagcga aagtcccaat      780

aaaacagtat ctgaggaaaa agctaaacaa tacctagaag aatttcatca aacggcatta      840

gagcatcctg aattgtcaga acttaaaacc gttactggga ccaatcctgt attcgctggg      900

gctaactatg cggcgtgggc agtaaacgtt gcgcaagtta tcgatagcga aacagctgat      960

aatttggaaa agacaactgc tgctctttcg atacttcctg gtatcggtag cgtaatgggc     1020

attgcagacg gtgccgttca ccacaataca gaagagatag tggcacaatc aatagcttta     1080

tcgtctttaa tggttgctca agctattcca ttggtaggag agctagttga tattggtttc     1140

gctgcatata attttgtaga gagtattatc aatttatttc aagtagttca taattcgtat     1200

aatcgtcccg cgtattctcc ggggcataaa acacaaccat ttcttcatga cgggtatgct     1260

gtcagttgga acactgttga agattcgata atccgaactg gttttcaagg ggagagtggg     1320

cacgacataa aaattactgc tgaaaatacc ccgcttccaa tcgcgggtgt cctactaccg     1380

actattcctg gaaagctgga cgttaataag tccaagactc atatttccgt aaatggtcgg     1440

aaaataagga tgcgttgcag agctatagac ggtgatgtaa ctttttgtcg ccctaaatct     1500

cctgtttatg ttggtaatgg tgtgcatgcg aatcttcacg tggcatttca cagaagcagc     1560

tcggagaaaa ttcattctaa tgaaatttcg tcggattcca taggcgttct tgggtaccag     1620

aaaacagtag atcacaccaa ggttaattct aagctatcgc tattttttga aatcaaaagc     1680

tga                                                                   1683


<210>  7
<211>  560
<212>  PRT
<213>  Corynebacterium diphtheriae

<400>  7

Met Ser Arg Lys Leu Phe Ala Ser Ile Leu Ile Gly Ala Leu Leu Gly 
1               5                   10                  15      


Ile Gly Ala Pro Pro Ser Ala His Ala Gly Ala Asp Asp Val Val Asp 
            20                  25                  30          


Ser Ser Lys Ser Phe Val Met Glu Asn Phe Ser Ser Tyr His Gly Thr 
        35                  40                  45              


Lys Pro Gly Tyr Val Asp Ser Ile Gln Lys Gly Ile Gln Lys Pro Lys 
    50                  55                  60                  


Ser Gly Thr Gln Gly Asn Tyr Asp Asp Asp Trp Lys Gly Phe Tyr Ser 
65                  70                  75                  80  


Thr Asp Asn Lys Tyr Asp Ala Ala Gly Tyr Ser Val Asp Asn Glu Asn 
                85                  90                  95      


Pro Leu Ser Gly Lys Ala Gly Gly Val Val Lys Val Thr Tyr Pro Gly 
            100                 105                 110         


Leu Thr Lys Val Leu Ala Leu Lys Val Asp Asn Ala Glu Thr Ile Lys 
        115                 120                 125             


Lys Glu Leu Gly Leu Ser Leu Thr Glu Pro Leu Met Glu Gln Val Gly 
    130                 135                 140                 


Thr Glu Glu Phe Ile Lys Arg Phe Gly Asp Gly Ala Ser Arg Val Val 
145                 150                 155                 160 


Leu Ser Leu Pro Phe Ala Glu Gly Ser Ser Ser Val Glu Tyr Ile Asn 
                165                 170                 175     


Asn Trp Glu Gln Ala Lys Ala Leu Ser Val Glu Leu Glu Ile Asn Phe 
            180                 185                 190         


Glu Thr Arg Gly Lys Arg Gly Gln Asp Ala Met Tyr Glu Tyr Met Ala 
        195                 200                 205             


Gln Ala Cys Ala Gly Asn Arg Val Arg Arg Ser Val Gly Ser Ser Leu 
    210                 215                 220                 


Ser Cys Ile Asn Leu Asp Trp Asp Val Ile Arg Asp Lys Thr Lys Thr 
225                 230                 235                 240 


Lys Ile Glu Ser Leu Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser 
                245                 250                 255     


Glu Ser Pro Asn Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu 
            260                 265                 270         


Glu Glu Phe His Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu 
        275                 280                 285             


Lys Thr Val Thr Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala 
    290                 295                 300                 


Ala Trp Ala Val Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp 
305                 310                 315                 320 


Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile Gly 
                325                 330                 335     


Ser Val Met Gly Ile Ala Asp Gly Ala Val His His Asn Thr Glu Glu 
            340                 345                 350         


Ile Val Ala Gln Ser Ile Ala Leu Ser Ser Leu Met Val Ala Gln Ala 
        355                 360                 365             


Ile Pro Leu Val Gly Glu Leu Val Asp Ile Gly Phe Ala Ala Tyr Asn 
    370                 375                 380                 


Phe Val Glu Ser Ile Ile Asn Leu Phe Gln Val Val His Asn Ser Tyr 
385                 390                 395                 400 


Asn Arg Pro Ala Tyr Ser Pro Gly His Lys Thr Gln Pro Phe Leu His 
                405                 410                 415     


Asp Gly Tyr Ala Val Ser Trp Asn Thr Val Glu Asp Ser Ile Ile Arg 
            420                 425                 430         


Thr Gly Phe Gln Gly Glu Ser Gly His Asp Ile Lys Ile Thr Ala Glu 
        435                 440                 445             


Asn Thr Pro Leu Pro Ile Ala Gly Val Leu Leu Pro Thr Ile Pro Gly 
    450                 455                 460                 


Lys Leu Asp Val Asn Lys Ser Lys Thr His Ile Ser Val Asn Gly Arg 
465                 470                 475                 480 


Lys Ile Arg Met Arg Cys Arg Ala Ile Asp Gly Asp Val Thr Phe Cys 
                485                 490                 495     


Arg Pro Lys Ser Pro Val Tyr Val Gly Asn Gly Val His Ala Asn Leu 
            500                 505                 510         


His Val Ala Phe His Arg Ser Ser Ser Glu Lys Ile His Ser Asn Glu 
        515                 520                 525             


Ile Ser Ser Asp Ser Ile Gly Val Leu Gly Tyr Gln Lys Thr Val Asp 
    530                 535                 540                 


His Thr Lys Val Asn Ser Lys Leu Ser Leu Phe Phe Glu Ile Lys Ser 
545                 550                 555                 560 


<210>  8
<211>  1462
<212>  DNA
<213>  Escherichia coli

<400>  8
atgacctttc gcaattgtgt cgccgtcgat ctcggcgcat ccagtgggcg cgtgatgctg       60

gcgcgttacg agcgtgaatg ccgcagcctg acgctgcgcg aaatccatcg ttttaacaat      120

gggctgcata gtcagaacgg ctatgtcacc tgggatgtgg atagcctgga aagtgccatt      180

cgccttggat taaacaaggt gtgcgaggaa gggattcgta tcgcgatagc attgggattg      240

atacctgggg cgtggacttt gtgctgctcg accaacaggg tcagcgtgtg ggcctgcccg      300

ttgcttatcg cgatagccgc accaatggcc taatggcgca ggcacaacaa caactcggca      360

aacgcgatat ttatcaacgt agcggcatcc agtttctgcc cttcaatacg ctttatcagt      420

tgcgtgcgct gacggagcaa caacctgaac ttattccaca cattgctcac gctctgctga      480

tgccggatta cttcagttat cgcctgaccg gcaagatgaa ctgggaatat accaacgcca      540

cgaccacgca actggtcaat atcaatagcg acgactggga cgagtcgcta ctggcgtgga      600

gcggggccaa caaagcctgg tttggtcgcc cgacgcatcc gggtaatgtc ataggtcact      660

ggatttgccc gcagggtaat gagattccgg tggtcgccgt tgccagccat gataccgcca      720

gcgcggttat cgcctcgccg ttaaacggtt cacgcgccgc ttatctctct tctggcacct      780

ggtcattgat gggcttcgaa agccagacgc catttaccaa tgacacggcg ctggcagcca      840

acatcaccaa tgaaggcggg gcggaaggtc gctatcgggt gctgaaaaat attatgggct      900

tatggctgct tcagcgagtg ctacaggagc ggcaaatcaa cgatctcccg gcgcttatcg      960

ccgcgacaca ggcacttccg gcctgccgct tcatcatcaa tcccaatgac gatcgcttta     1020

ttaaccctga cgagatgtgc agcgaaattc aggctgcgtg tcgggaaatg gcgcaaccga     1080

tcccagaaag tgatgctgaa ctggcgcgct gtattttcga cagtctggcg ttgctgtatg     1140

ccgatgtgtt gcatgagctg gcgcagctac gcggtgaaga tttctcgcaa ctgcatattg     1200

tcggcggcgg ctgccagaac acgctgctca accagctatg tgccgatgcc tgcggtattc     1260

gggtgatcgc cgggcctgtt gaagcctcga cgctcggcaa tatcggcatc cagttaatga     1320

cgctggatga actcaacaat gtggatgatt tccgtcaggt cgtcagcacc accgcgaatc     1380

tgaccacctt tacccctaat cctgacagtg aaattgccca ctatgtggcg ctgattcact     1440

ctacacgaca gacaaaggag ct                                              1462


<210>  9
<211>  89
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic oligomer

<400>  9
gtgcgatctt cctgaggccg atactgtcgt cgtcccctca tgaacctgaa tgcacggtta       60

cgatgcgccc atctacacca acgtgacct                                         89


<210>  10
<211>  511
<212>  DNA
<213>  Escherichia coli

<400>  10
atgtacactt caggctatgc acatcgttct tcgtcgttct catccgcagc aagtaaaatt       60

gcgcgtgtct ctacggaaaa cactacagcc tgaacctgac agtgaagttg tctatcgcga      120

agatcagccc atgatgacgc aacttctact gttgccattg ttacagcaac tcggtcagca      180

atcgcgctgg caactctggt taacaccgca acaaaaactg agtcgggaat gggttcaggc      240

atctgggcta cccttaacga aagtaatgca gattagccag ctctcccctt gccacactgt      300

ggagtcaatg gttcgcgctt tacgcacggg caattacagt gtggtgatcg gttggttggc      360

agatgatttg actgaagaag agcatgctga acttgttgat gcggcaaatg aaggtaacgc      420

tatggggttt attatgcgtc cggtaagcgc atcctctcac gccacgagac aactttccgg      480

gctaaaaatt cactctaatt tgtatcatta a                                     511


