                               SEQUENCE LISTING

<110> FINA BIOSOLUTIONS, LLC
 
<120> EXPRESSION AND PURIFICATION OF CRM197 AND RELATED PROTEINS

<130> 8164.014.PCT

<140> PCT/US2015/14130
<141> 2015-02-02

<150> 61/934,377
<151> 2014-01-31

<160> 17    

<170> PatentIn version 3.5

<210> 1
<211> 535
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 1
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn 
1               5                   10                  15      


Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln 
            20                  25                  30          


Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp 
        35                  40                  45              


Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly 
    50                  55                  60                  


Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val 
65                  70                  75                  80  


Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val 
                85                  90                  95      


Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu 
            100                 105                 110         


Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly 
        115                 120                 125             


Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser 
    130                 135                 140                 


Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser 
145                 150                 155                 160 


Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp 
                165                 170                 175     


Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg 
            180                 185                 190         


Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp Trp Asp Val 
        195                 200                 205             


Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu Lys Glu His Gly 
    210                 215                 220                 


Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn Lys Thr Val Ser Glu 
225                 230                 235                 240 


Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe His Gln Thr Ala Leu Glu 
                245                 250                 255     


His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly Thr Asn Pro Val 
            260                 265                 270         


Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn Val Ala Gln Val 
        275                 280                 285             


Ile Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr Thr Ala Ala Leu 
    290                 295                 300                 


Ser Ile Leu Pro Gly Ile Gly Ser Val Met Gly Ile Ala Asp Gly Ala 
305                 310                 315                 320 


Val His His Asn Thr Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser 
                325                 330                 335     


Ser Leu Met Val Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp 
            340                 345                 350         


Ile Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe 
        355                 360                 365             


Gln Val Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His 
    370                 375                 380                 


Lys Thr Gln Pro Phe Leu His Asp Gly Tyr Ala Val Ser Trp Asn Thr 
385                 390                 395                 400 


Val Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln Gly Glu Ser Gly His 
                405                 410                 415     


Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu Pro Ile Ala Gly Val 
            420                 425                 430         


Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp Val Asn Lys Ser Lys Thr 
        435                 440                 445             


His Ile Ser Val Asn Gly Arg Lys Ile Arg Met Arg Cys Arg Ala Ile 
    450                 455                 460                 


Asp Gly Asp Val Thr Phe Cys Arg Pro Lys Ser Pro Val Tyr Val Gly 
465                 470                 475                 480 


Asn Gly Val His Ala Asn Leu His Val Ala Phe His Arg Ser Ser Ser 
                485                 490                 495     


Glu Lys Ile His Ser Asn Glu Ile Ser Ser Asp Ser Ile Gly Val Leu 
            500                 505                 510         


Gly Tyr Gln Lys Thr Val Asp His Thr Lys Val Asn Ser Lys Leu Ser 
        515                 520                 525             


Leu Phe Phe Glu Ile Lys Ser 
    530                 535 


<210> 2
<211> 155
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 2
Ser Pro Gly His Lys Thr Gln Pro Phe Leu His Asp Gly Tyr Ala Val 
1               5                   10                  15      


Ser Trp Asn Thr Val Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln Gly 
            20                  25                  30          


Glu Ser Gly His Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu Pro 
        35                  40                  45              


Ile Ala Gly Val Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp Val Asn 
    50                  55                  60                  


Lys Ser Lys Thr His Ile Ser Val Asn Gly Arg Lys Ile Arg Met Arg 
65                  70                  75                  80  


Cys Arg Ala Ile Asp Gly Asp Val Thr Phe Cys Arg Pro Lys Ser Pro 
                85                  90                  95      


Val Tyr Val Gly Asn Gly Val His Ala Asn Leu His Val Ala Phe His 
            100                 105                 110         


Arg Ser Ser Ser Glu Lys Ile His Ser Asn Glu Ile Ser Ser Asp Ser 
        115                 120                 125             


Ile Gly Val Leu Gly Tyr Gln Lys Thr Val Asp His Thr Lys Val Asn 
    130                 135                 140                 


Ser Lys Leu Ser Leu Phe Phe Glu Ile Lys Ser 
145                 150                 155 


<210> 3
<211> 7
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 3
gatatac                                                                  7


<210> 4
<211> 9
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 4
gatatacca                                                                9


<210> 5
<211> 12
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 5
gatataccat at                                                           12


<210> 6
<211> 6
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide


<220>
<221> MOD_RES
<222> (1)..(1)
<223> Any hydropathic residue

<220>
<221> MOD_RES
<222> (2)..(3)
<223> Lys or Arg

<220>
<221> MOD_RES
<222> (4)..(4)
<223> Any hydropathic residue 

<220>
<221> MOD_RES
<222> (5)..(5)
<223> Lys or Arg

<220>
<221> MOD_RES
<222> (6)..(6)
<223> Any hydropathic residue 

<400> 6
Xaa Xaa Xaa Xaa Xaa Xaa 
1               5       


<210> 7
<211> 9
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 7
Gly Arg Lys Ile Arg Met Arg Cys Arg 
1               5                   


<210> 8
<211> 1608
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 8
atgggtgctg atgatgttgt tgattcctct aagtctttcg tgatggaaaa tttctcgtcc       60

tatcacggta ccaagcctgg ctatgtggat agcattcaaa agggtattca aaaaccgaag      120

tctggtaccc agggcaacta cgatgacgat tggaaagagt tttacagcac cgacaacaaa      180

tatgacgcgg caggctacag cgttgataat gaaaatccgc tgagcggtaa ggctggcggc      240

gtcgttaagg ttacctatcc gggtctgacg aaagtgctgg ccctgaaagt tgacaatgct      300

gaaaccatca aaaaagaact gggtctgagc ttgaccgagc cgctgatgga acaggttggt      360

actgaagaat tcattaaacg ttttggtgac ggcgcgagcc gtgttgtgct gtccctgccg      420

tttgccgagg gttctagctc cgtggagtat atcaacaatt gggaacaggc gaaagcgttg      480

agcgtcgagc tggaaatcaa tttcgagact cgtggtaagc gtggccaaga tgcgatgtac      540

gagtacatgg cccaggcatg tgcgggtaac cgcgtccgtc gcagcgtcgg cagctccctg      600

agctgcatta acctggactg ggacgtgatc cgcgacaaga ctaagaccaa gattgagagc      660

ctgaaagagc acggtccgat taagaacaaa atgtccgagt ctccgaacaa aacggtgagc      720

gaagaaaaag ccaaacagta tctggaagaa ttccatcaga ccgccctgga gcacccagag      780

ctgagcgagc tgaaaaccgt caccggcacg aatccggttt ttgcgggtgc gaactacgcg      840

gcatgggcag tcaatgttgc gcaagtcatc gacagcgaaa cggctgataa cttggagaaa      900

accaccgcgg cactgagcat tctgccgggc atcggtagcg ttatgggcat tgcggacggt      960

gccgtgcatc acaataccga agaaattgtc gcgcagagca tcgcattgtc tagcctgatg     1020

gttgcacagg ccattccgct ggtaggcgaa ttggtggata tcggtttcgc ggcttacaat     1080

ttcgttgagt cgatcattaa cctgtttcaa gtcgttcaca atagctataa ccgtccggca     1140

tacagcccgg gtcataagac gcaaccgttt ctgcatgatg gctatgccgt gagctggaac     1200

acggtcgagg attcgattat ccgtaccggt tttcagggtg agagcggtca cgacatcaaa     1260

atcaccgcgg agaacacgcc gctgcctatt gcgggcgtcc tgctgccgac gatcccgggc     1320

aaactggacg ttaacaagag caagacccat atcagcgtca acggtcgtaa gattcgcatg     1380

cgttgtcgtg caatcgacgg tgacgtgacg ttctgccgcc caaaaagccc ggtgtacgtg     1440

ggtaacggcg tgcacgcgaa tctgcatgtc gcgttccacc gctcctcaag cgagaaaatc     1500

cacagcaatg aaattagcag cgacagcatt ggtgtgttgg gctaccaaaa gaccgtggat     1560

cacaccaagg ttaatagcaa gctgagcctg ttctttgaga tcaaaagc                  1608


<210> 9
<211> 1608
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 9
atgggtgccg atgacgtggt tgactcttcc aaaagcttcg tcatggaaaa cttcagctcc       60

tatcacggca ctaaaccggg ttatgtcgac agcatccaga aaggcatcca gaaaccgaaa      120

tctggcactc agggtaacta tgacgacgac tggaaagagt tctactctac cgacaacaaa      180

tacgacgcgg ctggttattc tgtggacaac gaaaacccgc tgtctggtaa agctggtggt      240

gttgttaaag tgacctaccc gggtctgacc aaagttctgg ctctgaaagt ggacaacgcc      300

gaaaccatca aaaaagaact gggtctgtct ctgaccgaac cgctgatgga acaggtaggt      360

accgaggaat tcatcaaacg ttttggtgat ggtgcgtccc gtgttgtact gtctctgcca      420

tttgccgaag gttctagctc tgtcgagtac atcaacaact gggagcaggc caaagctctg      480

tctgtggaac tggaaatcaa cttcgagacc cgtggtaaac gtggtcagga cgcaatgtat      540

gaatacatgg cacaggcttg cgcgggtaac cgtgtacgtc gttctgtagg ttcttccctg      600

tcttgcatca acctggactg ggatgtcatc cgtgacaaaa ccaaaaccaa aatcgagtcc      660

ctgaaagagc acggtccgat caaaaacaaa atgagcgaat ctccgaacaa aacggtctct      720

gaggaaaaag cgaaacagta cctggaagaa ttccatcaga ccgccctgga acacccggaa      780

ctgtctgaac tgaaaaccgt taccggtact aacccggttt tcgcaggtgc taactacgca      840

gcgtgggcgg ttaacgtagc ccaggtaatc gattccgaaa ccgcagacaa cctggaaaaa      900

acgactgcgg ctctgtctat tctgccgggt attggtagcg tgatgggtat tgcagatggt      960

gcagttcacc acaacacgga agaaatcgtt gcgcagtcta tcgctctgtc ttctctgatg     1020

gtagcacagg cgatcccgct ggttggtgaa ctggttgaca ttggcttcgc ggcctacaac     1080

ttcgttgaat ccatcatcaa cctgttccag gttgtgcaca actcttacaa ccgtccagct     1140

tactctccgg gtcacaaaac ccagccgttc ctgcacgacg gttatgcggt ttcttggaac     1200

accgttgaag acagcatcat ccgtactggt ttccagggtg aatctggcca cgacatcaaa     1260

atcactgctg aaaacacccc gctgccgatc gcaggtgttc tcctgccaac tattccgggt     1320

aaactggacg tgaacaaatc caaaacgcac atctccgtga acggtcgtaa aatccgcatg     1380

cgttgtcgtg cgattgatgg tgacgttact ttctgtcgtc cgaaatctcc ggtctacgta     1440

ggtaacggtg tacatgctaa cctccatgta gcgttccacc gttcttcttc cgagaaaatc     1500

cactccaacg agatctctag cgactctatc ggtgttctgg gttaccagaa aaccgttgac     1560

cacaccaaag tgaactccaa actcagcctg ttcttcgaaa tcaaatct                  1608


<210> 10
<211> 49
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 10
gagctctaag aaggagatat acatgggtgc cgatgacgtg gttgactct                   49


<210> 11
<211> 50
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 11
gagctcttaa gaaggagata tacatgggtg ccgatgacgt ggttgactct                  50


<210> 12
<211> 50
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 12
gagctctaag aaggagatat acaatgggtg ccgatgacgt ggttgactct                  50


<210> 13
<211> 51
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 13
gagctctaag aaggagatat acacatgggt gccgatgacg tggttgactc t                51


<210> 14
<211> 54
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 14
gagctctaag aaggagatat accatatatg ggtgccgatg acgtggttga ctct             54


<210> 15
<211> 112
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 15
tctagaaata attttgttta actttaagaa ggagatatac atatggctag catgactggt       60

ggacagcaaa tgggtcggga tccgaattcg agctctaaga aggagatata cc              112


<210> 16
<211> 73
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 16
tctagaaata attttgttta actttaagaa ggagatatac atatggctag catgactggt       60

aaggagatat acc                                                          73


<210> 17
<211> 93
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 17
tctagaaata attttgttta actttaagaa ggagatatac atatggctag catgactggt       60

gcgmayccat tcagtgaaga agragsttya ttt                                    93


