                         SEQUENCE LISTING

<110>  LORVEN BIOLOGICS PRIVATE LIMITED
 
<120>  NUCLEIC ACID ENCODING CRM197 AND PROCESS FOR IMPROVED EXPRESSION 
       THEREOF

<130>  PCT1817

<160>  2     

<170>  PatentIn version 3.5

<210>  1
<211>  1698
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence encoding CRM197 fused to a nucleotide 
       sequence encoding signal peptide of B. amyloliquefaciens.

<400>  1
atgattcaga aacgcaaacg caccgtgagc tttcgcctgg tgctgatgtg caccctgctg       60

tttgtgagcc tgccgattac caaaaccagc gcgggcgcgg atgatgtggt ggatagcagc      120

aaaagctttg tgatggaaaa ctttagcagc tatcatggca ccaaaccggg ctatgtggat      180

agcattcaga aaggcattca gaaaccgaaa agcggcaccc agggcaacta tgatgatgat      240

tggaaagaat tttatagcac cgataacaaa tatgatgcgg cgggctatag cgtggataac      300

gaaaacccgc tgagcggcaa agcgggcggc gtggtgaaag tgacctatcc gggcctgacc      360

aaagtgctgg cgctgaaagt ggataacgcg gaaaccatta aaaaagaact gggcctgagc      420

ctgaccgaac cgctgatgga acaggtgggc accgaagaat ttattaaacg ctttggcgat      480

ggcgcgagcc gcgtggtgct gagcctgccg tttgcggaag gcagcagcag cgtggaatat      540

attaacaact gggaacaggc gaaagcgctg agcgtggaac tggaaattaa ctttgaaacc      600

cgcggcaaac gcggccagga tgcgatgtat gaatatatgg cgcaggcgtg cgcgggcaac      660

cgcgtgcgcc gcagcgtggg cagcagcctg agctgcatta acctggattg ggatgtgatt      720

cgcgataaaa ccaaaaccaa aattgaaagc ctgaaagaac atggcccgat taaaaacaaa      780

atgagcgaaa gcccgaacaa aaccgtgagc gaagaaaaag cgaaacagta tctggaagaa      840

tttcatcaga ccgcgctgga acatccggaa ctgagcgaac tgaaaaccgt gaccggcacc      900

aacccggtgt ttgcgggcgc gaactatgcg gcgtgggcgg tgaacgtggc gcaggtgatt      960

gatagcgaaa ccgcggataa cctggaaaaa accaccgcgg cgctgagcat tctgccgggc     1020

attggcagcg tgatgggcat tgcggatggc gcggtgcatc ataacaccga agaaattgtg     1080

gcgcagagca ttgcgctgag cagcctgatg gtggcgcagg cgattccgct ggtgggcgaa     1140

ctggtggata ttggctttgc ggcgtataac tttgtggaaa gcattattaa cctgtttcag     1200

gtggtgcata acagctataa ccgcccggcg tatagcccgg gccataaaac ccagccgttt     1260

ctgcatgatg gctatgcggt gagctggaac accgtggaag atagcattat tcgcaccggc     1320

tttcagggcg aaagcggcca tgatattaaa attaccgcgg aaaacacccc gctgccgatt     1380

gcgggcgtgc tgctgccgac cattccgggc aaactggatg tgaacaaaag caaaacccat     1440

attagcgtga acggccgcaa aattcgcatg cgctgccgcg cgattgatgg cgatgtgacc     1500

ttttgccgcc cgaaaagccc ggtgtatgtg ggcaacggcg tgcatgcgaa cctgcatgtg     1560

gcgtttcatc gcagcagcag cgaaaaaatt catagcaacg aaattagcag cgatagcatt     1620

ggcgtgctgg gctatcagaa aaccgtggat cataccaaag tgaacagcaa actgagcctg     1680

ttttttgaaa ttaaaagc                                                   1698


<210>  2
<211>  566
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CRM197 of Corynebacterium diphtheriae fused to signal peptide of 
       Bacillus amyloliquefaciens

<400>  2

Met Ile Gln Lys Arg Lys Arg Thr Val Ser Phe Arg Leu Val Leu Met 
1               5                   10                  15      


Cys Thr Leu Leu Phe Val Ser Leu Pro Ile Thr Lys Thr Ser Ala Gly 
            20                  25                  30          


Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn Phe 
        35                  40                  45              


Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln Lys 
    50                  55                  60                  


Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp Asp 
65                  70                  75                  80  


Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly Tyr 
                85                  90                  95      


Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val Val 
            100                 105                 110         


Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val Asp 
        115                 120                 125             


Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu Pro 
    130                 135                 140                 


Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly Asp 
145                 150                 155                 160 


Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser Ser 
                165                 170                 175     


Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser Val 
            180                 185                 190         


Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp Ala 
        195                 200                 205             


Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg Arg 
    210                 215                 220                 


Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp Trp Asp Val Ile 
225                 230                 235                 240 


Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu Lys Glu His Gly Pro 
                245                 250                 255     


Ile Lys Asn Lys Met Ser Glu Ser Pro Asn Lys Thr Val Ser Glu Glu 
            260                 265                 270         


Lys Ala Lys Gln Tyr Leu Glu Glu Phe His Gln Thr Ala Leu Glu His 
        275                 280                 285             


Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly Thr Asn Pro Val Phe 
    290                 295                 300                 


Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn Val Ala Gln Val Ile 
305                 310                 315                 320 


Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser 
                325                 330                 335     


Ile Leu Pro Gly Ile Gly Ser Val Met Gly Ile Ala Asp Gly Ala Val 
            340                 345                 350         


His His Asn Thr Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser Ser 
        355                 360                 365             


Leu Met Val Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp Ile 
    370                 375                 380                 


Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe Gln 
385                 390                 395                 400 


Val Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His Lys 
                405                 410                 415     


Thr Gln Pro Phe Leu His Asp Gly Tyr Ala Val Ser Trp Asn Thr Val 
            420                 425                 430         


Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln Gly Glu Ser Gly His Asp 
        435                 440                 445             


Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu Pro Ile Ala Gly Val Leu 
    450                 455                 460                 


Leu Pro Thr Ile Pro Gly Lys Leu Asp Val Asn Lys Ser Lys Thr His 
465                 470                 475                 480 


Ile Ser Val Asn Gly Arg Lys Ile Arg Met Arg Cys Arg Ala Ile Asp 
                485                 490                 495     


Gly Asp Val Thr Phe Cys Arg Pro Lys Ser Pro Val Tyr Val Gly Asn 
            500                 505                 510         


Gly Val His Ala Asn Leu His Val Ala Phe His Arg Ser Ser Ser Glu 
        515                 520                 525             


Lys Ile His Ser Asn Glu Ile Ser Ser Asp Ser Ile Gly Val Leu Gly 
    530                 535                 540                 


Tyr Gln Lys Thr Val Asp His Thr Lys Val Asn Ser Lys Leu Ser Leu 
545                 550                 555                 560 


Phe Phe Glu Ile Lys Ser 
                565     


