                         SEQUENCE LISTING

<110>  TOTAL MARKETING SERVICES
       Qingdao Institute of Bioenergy and Bioprocess Technology 
       (QIBEBT)
 
<120>  PRODUCTION OF ALPHA-OLEFINS

<130>  TOTAL-207-PCT

<150>  EP15174554.4
<151>  2015-06-30

<160>  13    

<170>  PatentIn version 3.5

<210>  1
<211>  1269
<212>  DNA
<213>  Jeotgalicoccus sp.

<400>  1
atggcaacac ttaagaggga taagggctta gataatactt tgaaagtatt aaagcaaggt       60

tatctttaca caacaaatca gagaaatcgt ctaaacacat cagttttcca aactaaagca      120

ctcggtggta aaccattcgt agttgtgact ggtaaggaag gcgctgaaat gttctacaac      180

aatgatgttg ttcaacgtga aggcatgtta ccaaaacgta tcgttaatac gctttttggt      240

aaaggtgcaa tccatacggt agatggtaaa aaacacgtag acagaaaagc attgttcatg      300

agcttgatga ctgaaggtaa cttgaattat gtacgagaat taacgcgtac attatggcat      360

gcgaacacac aacgtatgga aagtatggat gaggtaaata tttaccgtga atctatcgta      420

ctacttacaa aagtaggaac acgttgggca ggcgttcaag caccacctga agatatcgaa      480

agaatcgcaa cagacatgga catcatgatc gattcattta gagcacttgg tggtgccttt      540

aaaggttaca aggcatcaaa agaagcacgt cgtcgtgttg aagattggtt agaagaacaa      600

attattgaga ctcgtaaagg gaatattcat ccaccagaag gtacagcact ttacgaattt      660

gcacattggg aagactactt aggtaaccca atggactcaa gaacttgtgc gattgactta      720

atgaacacat tccgcccatt aatcgcaatc aacagattcg tttcattcgg tttacacgcg      780

atgaacgaaa acccaatcac acgtgaaaaa attaaatcag aacctgacta tgcatataaa      840

ttcgctcaag aagttcgtcg ttactatcca ttcgttccat tccttccagg taaagcgaaa      900

gtagacatcg acttccaagg cgttacaatt cctgcaggtg taggtcttgc attagatgtt      960

tatggtacaa cgcatgatga atcactttgg gacgatccaa atgaattccg cccagaaaga     1020

ttcgaaactt gggacggatc accatttgac cttattccac aaggtggtgg agattactgg     1080

acaaatcacc gttgtgcagg tgaatggatc acagtaatca tcatggaaga aacaatgaaa     1140

tactttgcag aaaaaataac ttatgatgtt ccagaacaag atttagaagt ggacttaaac     1200

agtatcccag gatacgttaa gagtggcttt gtaatcaaaa atgttcgcga agttgtagac     1260

agaacataa                                                             1269


<210>  2
<211>  422
<212>  PRT
<213>  Jeotgalicoccus sp.

<400>  2

Met Ala Thr Leu Lys Arg Asp Lys Gly Leu Asp Asn Thr Leu Lys Val 
1               5                   10                  15      


Leu Lys Gln Gly Tyr Leu Tyr Thr Thr Asn Gln Arg Asn Arg Leu Asn 
            20                  25                  30          


Thr Ser Val Phe Gln Thr Lys Ala Leu Gly Gly Lys Pro Phe Val Val 
        35                  40                  45              


Val Thr Gly Lys Glu Gly Ala Glu Met Phe Tyr Asn Asn Asp Val Val 
    50                  55                  60                  


Gln Arg Glu Gly Met Leu Pro Lys Arg Ile Val Asn Thr Leu Phe Gly 
65                  70                  75                  80  


Lys Gly Ala Ile His Thr Val Asp Gly Lys Lys His Val Asp Arg Lys 
                85                  90                  95      


Ala Leu Phe Met Ser Leu Met Thr Glu Gly Asn Leu Asn Tyr Val Arg 
            100                 105                 110         


Glu Leu Thr Arg Thr Leu Trp His Ala Asn Thr Gln Arg Met Glu Ser 
        115                 120                 125             


Met Asp Glu Val Asn Ile Tyr Arg Glu Ser Ile Val Leu Leu Thr Lys 
    130                 135                 140                 


Val Gly Thr Arg Trp Ala Gly Val Gln Ala Pro Pro Glu Asp Ile Glu 
145                 150                 155                 160 


Arg Ile Ala Thr Asp Met Asp Ile Met Ile Asp Ser Phe Arg Ala Leu 
                165                 170                 175     


Gly Gly Ala Phe Lys Gly Tyr Lys Ala Ser Lys Glu Ala Arg Arg Arg 
            180                 185                 190         


Val Glu Asp Trp Leu Glu Glu Gln Ile Ile Glu Thr Arg Lys Gly Asn 
        195                 200                 205             


Ile His Pro Pro Glu Gly Thr Ala Leu Tyr Glu Phe Ala His Trp Glu 
    210                 215                 220                 


Asp Tyr Leu Gly Asn Pro Met Asp Ser Arg Thr Cys Ala Ile Asp Leu 
225                 230                 235                 240 


Met Asn Thr Phe Arg Pro Leu Ile Ala Ile Asn Arg Phe Val Ser Phe 
                245                 250                 255     


Gly Leu His Ala Met Asn Glu Asn Pro Ile Thr Arg Glu Lys Ile Lys 
            260                 265                 270         


Ser Glu Pro Asp Tyr Ala Tyr Lys Phe Ala Gln Glu Val Arg Arg Tyr 
        275                 280                 285             


Tyr Pro Phe Val Pro Phe Leu Pro Gly Lys Ala Lys Val Asp Ile Asp 
    290                 295                 300                 


Phe Gln Gly Val Thr Ile Pro Ala Gly Val Gly Leu Ala Leu Asp Val 
305                 310                 315                 320 


Tyr Gly Thr Thr His Asp Glu Ser Leu Trp Asp Asp Pro Asn Glu Phe 
                325                 330                 335     


Arg Pro Glu Arg Phe Glu Thr Trp Asp Gly Ser Pro Phe Asp Leu Ile 
            340                 345                 350         


Pro Gln Gly Gly Gly Asp Tyr Trp Thr Asn His Arg Cys Ala Gly Glu 
        355                 360                 365             


Trp Ile Thr Val Ile Ile Met Glu Glu Thr Met Lys Tyr Phe Ala Glu 
    370                 375                 380                 


Lys Ile Thr Tyr Asp Val Pro Glu Gln Asp Leu Glu Val Asp Leu Asn 
385                 390                 395                 400 


Ser Ile Pro Gly Tyr Val Lys Ser Gly Phe Val Ile Lys Asn Val Arg 
                405                 410                 415     


Glu Val Val Asp Arg Thr 
            420         


<210>  3
<211>  1254
<212>  DNA
<213>  Bacillus subtilis

<400>  3
atgaatgagc agattccaca tgacaaaagt ctcgataaca gtctgacact gctgaaggaa       60

gggtatttat ttattaaaaa cagaacagag cgctacaatt cagatctgtt tcaggcccgt      120

ttgttgggaa aaaactttat ttgcatgact ggcgctgagg cggcgaaggt gttttatgat      180

acggatcgat tccagcggca gaacgctttg cctaagcggg tgcagaaatc gctgtttggt      240

gttaatgcga ttcagggaat ggatggcagc gcgcatatcc atcggaagat gctttttctg      300

tcattgatga caccgccgca tcaaaaacgt ttggctgagt tgatgacaga ggagtggaaa      360

gcagcagtca caagatggga gaaggcagat gaggttgtgt tatttgaaga agcaaaagaa      420

atcctgtgcc gggtagcgtg ctattgggca ggtgttccgt tgaaggaaac ggaagtcaaa      480

gagagagcgg atgacttcat tgacatggtc gacgcgttcg gtgctgtggg accgcggcat      540

tggaaaggaa gaagagcaag gccgcgtgcg gaagagtgga ttgaagtcat gattgaagat      600

gctcgtgccg gcttgctgaa aacgacttcc ggaacagcgc tgcatgaaat ggcttttcac      660

acacaagaag atggaagcca gctggattcc cgcatggcag ccattgagct gattaatgta      720

ctgcggccta ttgtcgccat ttcttacttt ctggtgtttt cagctttggc gcttcatgag      780

catccgaagt ataaggaatg gctgcggtct ggaaacagcc gggaaagaga aatgtttgtg      840

caggaggtcc gcagatatta tccgttcggc ccgtttttag gggcgcttgt caaaaaagat      900

tttgtatgga ataactgtga gtttaagaag ggcacatcgg tgctgcttga tttatatgga      960

acgaaccacg accctcgtct atgggatcat cccgatgaat tccggccgga acgatttgcg     1020

gagcgggaag aaaatctgtt tgatatgatt cctcaaggcg gggggcacgc cgagaaaggc     1080

caccgctgtc caggggaagg cattacaatt gaagtcatga aagcgagcct ggatttcctc     1140

gtccatcaga ttgaatacga tgttccggaa caatcactgc attacagtct cgccagaatg     1200

ccatcattgc ctgaaagcgg cttcgtaatg agcggaatca gacgaaaaag ttaa           1254


<210>  4
<211>  417
<212>  PRT
<213>  Bacillus subtilis

<400>  4

Met Asn Glu Gln Ile Pro His Asp Lys Ser Leu Asp Asn Ser Leu Thr 
1               5                   10                  15      


Leu Leu Lys Glu Gly Tyr Leu Phe Ile Lys Asn Arg Thr Glu Arg Tyr 
            20                  25                  30          


Asn Ser Asp Leu Phe Gln Ala Arg Leu Leu Gly Lys Asn Phe Ile Cys 
        35                  40                  45              


Met Thr Gly Ala Glu Ala Ala Lys Val Phe Tyr Asp Thr Asp Arg Phe 
    50                  55                  60                  


Gln Arg Gln Asn Ala Leu Pro Lys Arg Val Gln Lys Ser Leu Phe Gly 
65                  70                  75                  80  


Val Asn Ala Ile Gln Gly Met Asp Gly Ser Ala His Ile His Arg Lys 
                85                  90                  95      


Met Leu Phe Leu Ser Leu Met Thr Pro Pro His Gln Lys Arg Leu Ala 
            100                 105                 110         


Glu Leu Met Thr Glu Glu Trp Lys Ala Ala Val Thr Arg Trp Glu Lys 
        115                 120                 125             


Ala Asp Glu Val Val Leu Phe Glu Glu Ala Lys Glu Ile Leu Cys Arg 
    130                 135                 140                 


Val Ala Cys Tyr Trp Ala Gly Val Pro Leu Lys Glu Thr Glu Val Lys 
145                 150                 155                 160 


Glu Arg Ala Asp Asp Phe Ile Asp Met Val Asp Ala Phe Gly Ala Val 
                165                 170                 175     


Gly Pro Arg His Trp Lys Gly Arg Arg Ala Arg Pro Arg Ala Glu Glu 
            180                 185                 190         


Trp Ile Glu Val Met Ile Glu Asp Ala Arg Ala Gly Leu Leu Lys Thr 
        195                 200                 205             


Thr Ser Gly Thr Ala Leu His Glu Met Ala Phe His Thr Gln Glu Asp 
    210                 215                 220                 


Gly Ser Gln Leu Asp Ser Arg Met Ala Ala Ile Glu Leu Ile Asn Val 
225                 230                 235                 240 


Leu Arg Pro Ile Val Ala Ile Ser Tyr Phe Leu Val Phe Ser Ala Leu 
                245                 250                 255     


Ala Leu His Glu His Pro Lys Tyr Lys Glu Trp Leu Arg Ser Gly Asn 
            260                 265                 270         


Ser Arg Glu Arg Glu Met Phe Val Gln Glu Val Arg Arg Tyr Tyr Pro 
        275                 280                 285             


Phe Gly Pro Phe Leu Gly Ala Leu Val Lys Lys Asp Phe Val Trp Asn 
    290                 295                 300                 


Asn Cys Glu Phe Lys Lys Gly Thr Ser Val Leu Leu Asp Leu Tyr Gly 
305                 310                 315                 320 


Thr Asn His Asp Pro Arg Leu Trp Asp His Pro Asp Glu Phe Arg Pro 
                325                 330                 335     


Glu Arg Phe Ala Glu Arg Glu Glu Asn Leu Phe Asp Met Ile Pro Gln 
            340                 345                 350         


Gly Gly Gly His Ala Glu Lys Gly His Arg Cys Pro Gly Glu Gly Ile 
        355                 360                 365             


Thr Ile Glu Val Met Lys Ala Ser Leu Asp Phe Leu Val His Gln Ile 
    370                 375                 380                 


Glu Tyr Asp Val Pro Glu Gln Ser Leu His Tyr Ser Leu Ala Arg Met 
385                 390                 395                 400 


Pro Ser Leu Pro Glu Ser Gly Phe Val Met Ser Gly Ile Arg Arg Lys 
                405                 410                 415     


Ser 
    


<210>  5
<211>  1311
<212>  DNA
<213>  Alicyclobacillus acidocaldarius

<400>  5
atgaatcagt gcattccgcg cgatcgaacg tttgacagca gcctcgcctt gataaaggaa       60

gggtatttgt tcatcaaaaa tcgagttgat caataccaat ccgacatctt cgaagcgcgt      120

ctcctcctgg aaaatgtggt atgcatgcac ggagcagagg cggcaaaact cttctacaat      180

acggaactgt ttcaacgcca aggtgctctt ccgaagcggg ttcaaaagac gcttttcgga      240

gaaaacgcca tccaaaccct tgatggtaca gcgcatcttc accgtaagca gctgtttctg      300

tcgttgttga cgccggatca agaaaaatcc cttgcgacgc tcgcgacaac gcagtggagg      360

gagtgcgcga aggtatggga gaacgcggat agggttgtgc tatttgaaga ggccaagcgg      420

atgttatgtc ggatcgcatg tcagtggacc ggggttccgc tggatgaatc ggaggtgtca      480

aagcgggccg acgattttgg ggcgatggtg gacgcgtttg gagcggttgg tccgcgacat      540

tggaaaggcc ggagagctcg ggccagagca gaagcatggc tccggcagat gattgacgag      600

atacgaatcg gattgcgtag tgtagatgaa catacgccgc tccatgtggt ggccttttgg      660

cgtgacgtga atggaaacct cttggatgct cagatggttg caatcgagtt aatcaatctg      720

ctacgaccca tcgtagctat ttctactttc atcacgtttt cagccctggc cctgcacgaa      780

cacccgacat ggcgagaccg attgaaggcg cgcaatgaag cggatatcga gatgtttgtg      840

caagaggttc gtcgctacta tccgttcgcg ccatttctcg gtgccagagt gaaaaaggat      900

tttgtgtgga ggggatacga atttaaaaga gggacccttg tgttgctgga tgtgtatgga      960

acccatcatg atgcccgcct ctgggattcc ccaaatgagt ttcgacccga acgattcatg     1020

agaaaaacag ttgggccgtt tgatttgatt cctcaaggtg gaggggactc tcacaccggt     1080

catcgttgcc ctggtgaagg cgccaccatc gagattatga aggcgagcgt ggattttctg     1140

gttaaccaaa ttgacttcga agtgcccgct caggacctca gttacagatt ggatgttatg     1200

ccgacgttgc caaagagcgg atttgtgctg acccatgttc atcggaagtt catagcttct     1260

ccgaccattg ctacacctaa tggttctgaa gctcttcctt cagaagtcta a              1311


<210>  6
<211>  436
<212>  PRT
<213>  Alicyclobacillus acidocaldarius

<400>  6

Met Asn Gln Cys Ile Pro Arg Asp Arg Thr Phe Asp Ser Ser Leu Ala 
1               5                   10                  15      


Leu Ile Lys Glu Gly Tyr Leu Phe Ile Lys Asn Arg Val Asp Gln Tyr 
            20                  25                  30          


Gln Ser Asp Ile Phe Glu Ala Arg Leu Leu Leu Glu Asn Val Val Cys 
        35                  40                  45              


Met His Gly Ala Glu Ala Ala Lys Leu Phe Tyr Asn Thr Glu Leu Phe 
    50                  55                  60                  


Gln Arg Gln Gly Ala Leu Pro Lys Arg Val Gln Lys Thr Leu Phe Gly 
65                  70                  75                  80  


Glu Asn Ala Ile Gln Thr Leu Asp Gly Thr Ala His Leu His Arg Lys 
                85                  90                  95      


Gln Leu Phe Leu Ser Leu Leu Thr Pro Asp Gln Glu Lys Ser Leu Ala 
            100                 105                 110         


Thr Leu Ala Thr Thr Gln Trp Arg Glu Cys Ala Lys Val Trp Glu Asn 
        115                 120                 125             


Ala Asp Arg Val Val Leu Phe Glu Glu Ala Lys Arg Met Leu Cys Arg 
    130                 135                 140                 


Ile Ala Cys Gln Trp Thr Gly Val Pro Leu Asp Glu Ser Glu Val Ser 
145                 150                 155                 160 


Lys Arg Ala Asp Asp Phe Gly Ala Met Val Asp Ala Phe Gly Ala Val 
                165                 170                 175     


Gly Pro Arg His Trp Lys Gly Arg Arg Ala Arg Ala Arg Ala Glu Ala 
            180                 185                 190         


Trp Leu Arg Gln Met Ile Asp Glu Ile Arg Ile Gly Leu Arg Ser Val 
        195                 200                 205             


Asp Glu His Thr Pro Leu His Val Val Ala Phe Trp Arg Asp Val Asn 
    210                 215                 220                 


Gly Asn Leu Leu Asp Ala Gln Met Val Ala Ile Glu Leu Ile Asn Leu 
225                 230                 235                 240 


Leu Arg Pro Ile Val Ala Ile Ser Thr Phe Ile Thr Phe Ser Ala Leu 
                245                 250                 255     


Ala Leu His Glu His Pro Thr Trp Arg Asp Arg Leu Lys Ala Arg Asn 
            260                 265                 270         


Glu Ala Asp Ile Glu Met Phe Val Gln Glu Val Arg Arg Tyr Tyr Pro 
        275                 280                 285             


Phe Ala Pro Phe Leu Gly Ala Arg Val Lys Lys Asp Phe Val Trp Arg 
    290                 295                 300                 


Gly Tyr Glu Phe Lys Arg Gly Thr Leu Val Leu Leu Asp Val Tyr Gly 
305                 310                 315                 320 


Thr His His Asp Ala Arg Leu Trp Asp Ser Pro Asn Glu Phe Arg Pro 
                325                 330                 335     


Glu Arg Phe Met Arg Lys Thr Val Gly Pro Phe Asp Leu Ile Pro Gln 
            340                 345                 350         


Gly Gly Gly Asp Ser His Thr Gly His Arg Cys Pro Gly Glu Gly Ala 
        355                 360                 365             


Thr Ile Glu Ile Met Lys Ala Ser Val Asp Phe Leu Val Asn Gln Ile 
    370                 375                 380                 


Asp Phe Glu Val Pro Ala Gln Asp Leu Ser Tyr Arg Leu Asp Val Met 
385                 390                 395                 400 


Pro Thr Leu Pro Lys Ser Gly Phe Val Leu Thr His Val His Arg Lys 
                405                 410                 415     


Phe Ile Ala Ser Pro Thr Ile Ala Thr Pro Asn Gly Ser Glu Ala Leu 
            420                 425                 430         


Pro Ser Glu Val 
        435     


<210>  7
<211>  1362
<212>  DNA
<213>  Staphylococcus massiliensis

<400>  7
atgtttgtag attcgatact tgtgttaaga ttaaatttat taaaaacggg tatacaatta       60

gaaatgaaaa atgggggaat caaagtggca aagaaactac ctaaggttaa aggcctagat      120

aacacagtag acattattaa aggcgggtat acatacgtac ctggcaaatt agaagaattt      180

gattctaaag catttgaagt acgcgcatta ggcggtaaga aaattgctgt tatgagcggt      240

aaagaagcgg cagaaatttt ctatgataat gaaaaaatgg aaagacaagg tactttacca      300

aaacgtatcg taaacacttt atttggtaaa ggtgcaattc atacaactgc tggtaagaag      360

cacgttgacc gtaaagcttt atttatgtca cttatgacag atgaaaatct taactactta      420

cgtgaattaa cacgtaatta ttggttcatg aatactgaac gtatgcaaag catggataaa      480

gttaacgtat ataacgaatc aatttatatg ttaactaaaa tcggcttccg ttgggctggt      540

atcatccaaa cgcctgaaga agcagaacaa aatgcgaaag acatggatac tatgattaac      600

tcattcgtat ctttaggttc agcttacaaa ggttataaga aagctaaaaa agcacgtaaa      660

cgtgttgaag atttcttaga aaaacaaatt atcgatgtgc gtaaaggtaa attacaccct      720

gaagaaggta ctgcgttata cgaattcgcg cattgggaag atttaaacga taacccaatg      780

gattctcact tatgtgcagt agacttaatg aacgttgtgc gcccattagc tgcaatcaac      840

cgtttcatca gctatggtgt taaagtatta atcgaattcg atcaagaaaa agaaaaatta      900

cgtcttgaaa ataatgaaga ctatgcgtat aaattcgctc aagaagtacg tcgtatcttc      960

ccattcgtac catacttacc aggtagagca gctgttgatt tagaatatga cggctacaaa     1020

atccctgcag gtatgatgac agcattagat gtttatggta cgacacatga tgaagattta     1080

tgggaaaacc cagaccaatt caatcctaac cgttttgata actgggacgg tagcccattc     1140

gacttaattc cacaaggtgg cggtgacttc tatacgaacc acagatgtgc tggtgagtgg     1200

atcacagtta tcattatgga agaaacaatg aaatatttcg cgaataagat tgaatttgat     1260

gtaccgtctc aagatttatc agttaagctt gataaattac caggtaacgt aacaagcggt     1320

acaatcatta gtaatgtacg tccacgtgtt gcgcgtaaat aa                        1362


<210>  8
<211>  453
<212>  PRT
<213>  Staphylococcus massiliensis

<400>  8

Met Phe Val Asp Ser Ile Leu Val Leu Arg Leu Asn Leu Leu Lys Thr 
1               5                   10                  15      


Gly Ile Gln Leu Glu Met Lys Asn Gly Gly Ile Lys Val Ala Lys Lys 
            20                  25                  30          


Leu Pro Lys Val Lys Gly Leu Asp Asn Thr Val Asp Ile Ile Lys Gly 
        35                  40                  45              


Gly Tyr Thr Tyr Val Pro Gly Lys Leu Glu Glu Phe Asp Ser Lys Ala 
    50                  55                  60                  


Phe Glu Val Arg Ala Leu Gly Gly Lys Lys Ile Ala Val Met Ser Gly 
65                  70                  75                  80  


Lys Glu Ala Ala Glu Ile Phe Tyr Asp Asn Glu Lys Met Glu Arg Gln 
                85                  90                  95      


Gly Thr Leu Pro Lys Arg Ile Val Asn Thr Leu Phe Gly Lys Gly Ala 
            100                 105                 110         


Ile His Thr Thr Ala Gly Lys Lys His Val Asp Arg Lys Ala Leu Phe 
        115                 120                 125             


Met Ser Leu Met Thr Asp Glu Asn Leu Asn Tyr Leu Arg Glu Leu Thr 
    130                 135                 140                 


Arg Asn Tyr Trp Phe Met Asn Thr Glu Arg Met Gln Ser Met Asp Lys 
145                 150                 155                 160 


Val Asn Val Tyr Asn Glu Ser Ile Tyr Met Leu Thr Lys Ile Gly Phe 
                165                 170                 175     


Arg Trp Ala Gly Ile Ile Gln Thr Pro Glu Glu Ala Glu Gln Asn Ala 
            180                 185                 190         


Lys Asp Met Asp Thr Met Ile Asn Ser Phe Val Ser Leu Gly Ser Ala 
        195                 200                 205             


Tyr Lys Gly Tyr Lys Lys Ala Lys Lys Ala Arg Lys Arg Val Glu Asp 
    210                 215                 220                 


Phe Leu Glu Lys Gln Ile Ile Asp Val Arg Lys Gly Lys Leu His Pro 
225                 230                 235                 240 


Glu Glu Gly Thr Ala Leu Tyr Glu Phe Ala His Trp Glu Asp Leu Asn 
                245                 250                 255     


Asp Asn Pro Met Asp Ser His Leu Cys Ala Val Asp Leu Met Asn Val 
            260                 265                 270         


Val Arg Pro Leu Ala Ala Ile Asn Arg Phe Ile Ser Tyr Gly Val Lys 
        275                 280                 285             


Val Leu Ile Glu Phe Asp Gln Glu Lys Glu Lys Leu Arg Leu Glu Asn 
    290                 295                 300                 


Asn Glu Asp Tyr Ala Tyr Lys Phe Ala Gln Glu Val Arg Arg Ile Phe 
305                 310                 315                 320 


Pro Phe Val Pro Tyr Leu Pro Gly Arg Ala Ala Val Asp Leu Glu Tyr 
                325                 330                 335     


Asp Gly Tyr Lys Ile Pro Ala Gly Met Met Thr Ala Leu Asp Val Tyr 
            340                 345                 350         


Gly Thr Thr His Asp Glu Asp Leu Trp Glu Asn Pro Asp Gln Phe Asn 
        355                 360                 365             


Pro Asn Arg Phe Asp Asn Trp Asp Gly Ser Pro Phe Asp Leu Ile Pro 
    370                 375                 380                 


Gln Gly Gly Gly Asp Phe Tyr Thr Asn His Arg Cys Ala Gly Glu Trp 
385                 390                 395                 400 


Ile Thr Val Ile Ile Met Glu Glu Thr Met Lys Tyr Phe Ala Asn Lys 
                405                 410                 415     


Ile Glu Phe Asp Val Pro Ser Gln Asp Leu Ser Val Lys Leu Asp Lys 
            420                 425                 430         


Leu Pro Gly Asn Val Thr Ser Gly Thr Ile Ile Ser Asn Val Arg Pro 
        435                 440                 445             


Arg Val Ala Arg Lys 
    450             


<210>  9
<211>  1359
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon-optimized nucleotide sequence encoding Sm46

<400>  9
ttcgtggata gcattctggt tctgcgcctg aacctgctga agacaggcat ccagctggag       60

atgaagaacg gtggcatcaa agtggcaaaa aagctgccta aagtgaaagg tctggacaac      120

accgtggaca tcatcaaggg tggctatacc tacgtgcctg gcaaactgga ggagttcgac      180

agcaaagcat tcgaagtgcg cgccctgggt ggcaagaaga tcgcagtgat gagcggcaag      240

gaagccgccg agatttttta tgataacgaa aaaatggagc gtcagggtac cctgccgaag      300

cgcatcgtga acacactgtt cggtaaaggc gccattcata ccaccgccgg caagaaacat      360

gtggatcgca aggcactgtt catgagtctg atgaccgatg aaaatttaaa ttatctgcgc      420

gaactgacac gcaactattg gtttatgaat acagaacgca tgcagagcat ggataaagtg      480

aatgtgtaca atgaaagcat ttatatgctg accaaaattg gcttccgctg ggccggtatc      540

attcagaccc ctgaagaggc cgagcagaat gccaaagaca tggacaccat gatcaacagc      600

tttgtgagcc tgggcagcgc ctacaagggt tacaaaaaag ccaagaaagc ccgcaagcgc      660

gtggaagatt ttctggagaa acaaattatc gacgttcgta aaggcaaact gcatccggag      720

gaaggtaccg ccctgtacga attcgcccat tgggaagacc tgaacgataa cccgatggac      780

agccatctgt gcgccgttga tctgatgaac gttgttcgcc cgctggcagc aattaaccgc      840

ttcattagct acggcgttaa agtgctgatc gaattcgacc aggaaaaaga aaagctgcgc      900

ctggagaaca acgaggacta cgcctacaag ttcgcacagg aagtgcgccg tatctttccg      960

ttcgtgcctt acttaccggg tcgcgccgcc gtggatctgg agtatgatgg ctataagatc     1020

ccggccggta tgatgaccgc cctggatgtt tacggtacca cacacgatga ggatctgtgg     1080

gagaatccgg atcagttcaa cccgaatcgt tttgataact gggacggcag tccgtttgat     1140

ctgattccgc agggcggtgg cgatttctac accaatcatc gttgcgccgg cgagtggatc     1200

accgtgatta ttatggaaga aacaatgaaa tactttgcca acaaaattga attcgatgtg     1260

ccgagtcagg acctgagcgt taaactggac aaactgcctg gcaacgtgac cagcggtacc     1320

atcattagca acgtgcgtcc gcgtgttgcc cgcaaataa                            1359


<210>  10
<211>  1275
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon-optimized nucleotide sequence encoding Sm46-del29

<400>  10
gcaaaaaagc tgcctaaagt gaaaggtctg gacaacaccg tggacatcat caagggtggc       60

tatacctacg tgcctggcaa actggaggag ttcgacagca aagcattcga agtgcgcgcc      120

ctgggtggca agaagatcgc agtgatgagc ggcaaggaag ccgccgagat tttttatgat      180

aacgaaaaaa tggagcgtca gggtaccctg ccgaagcgca tcgtgaacac actgttcggt      240

aaaggcgcca ttcataccac cgccggcaag aaacatgtgg atcgcaaggc actgttcatg      300

agtctgatga ccgatgaaaa tttaaattat ctgcgcgaac tgacacgcaa ctattggttt      360

atgaatacag aacgcatgca gagcatggat aaagtgaatg tgtacaatga aagcatttat      420

atgctgacca aaattggctt ccgctgggcc ggtatcattc agacccctga agaggccgag      480

cagaatgcca aagacatgga caccatgatc aacagctttg tgagcctggg cagcgcctac      540

aagggttaca aaaaagccaa gaaagcccgc aagcgcgtgg aagattttct ggagaaacaa      600

attatcgacg ttcgtaaagg caaactgcat ccggaggaag gtaccgccct gtacgaattc      660

gcccattggg aagacctgaa cgataacccg atggacagcc atctgtgcgc cgttgatctg      720

atgaacgttg ttcgcccgct ggcagcaatt aaccgcttca ttagctacgg cgttaaagtg      780

ctgatcgaat tcgaccagga aaaagaaaag ctgcgcctgg agaacaacga ggactacgcc      840

tacaagttcg cacaggaagt gcgccgtatc tttccgttcg tgccttactt accgggtcgc      900

gccgccgtgg atctggagta tgatggctat aagatcccgg ccggtatgat gaccgccctg      960

gatgtttacg gtaccacaca cgatgaggat ctgtgggaga atccggatca gttcaacccg     1020

aatcgttttg ataactggga cggcagtccg tttgatctga ttccgcaggg cggtggcgat     1080

ttctacacca atcatcgttg cgccggcgag tggatcaccg tgattattat ggaagaaaca     1140

atgaaatact ttgccaacaa aattgaattc gatgtgccga gtcaggacct gagcgttaaa     1200

ctggacaaac tgcctggcaa cgtgaccagc ggtaccatca ttagcaacgt gcgtccgcgt     1260

gttgcccgca aataa                                                      1275


<210>  11
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  11
gtccatatgg caaaaaagct gcctaaagtg                                        30


<210>  12
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  12
gtactcgagt tatttgcggg caacacgcgg                                        30


<210>  13
<211>  424
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sm46-del29

<400>  13

Ala Lys Lys Leu Pro Lys Val Lys Gly Leu Asp Asn Thr Val Asp Ile 
1               5                   10                  15      


Ile Lys Gly Gly Tyr Thr Tyr Val Pro Gly Lys Leu Glu Glu Phe Asp 
            20                  25                  30          


Ser Lys Ala Phe Glu Val Arg Ala Leu Gly Gly Lys Lys Ile Ala Val 
        35                  40                  45              


Met Ser Gly Lys Glu Ala Ala Glu Ile Phe Tyr Asp Asn Glu Lys Met 
    50                  55                  60                  


Glu Arg Gln Gly Thr Leu Pro Lys Arg Ile Val Asn Thr Leu Phe Gly 
65                  70                  75                  80  


Lys Gly Ala Ile His Thr Thr Ala Gly Lys Lys His Val Asp Arg Lys 
                85                  90                  95      


Ala Leu Phe Met Ser Leu Met Thr Asp Glu Asn Leu Asn Tyr Leu Arg 
            100                 105                 110         


Glu Leu Thr Arg Asn Tyr Trp Phe Met Asn Thr Glu Arg Met Gln Ser 
        115                 120                 125             


Met Asp Lys Val Asn Val Tyr Asn Glu Ser Ile Tyr Met Leu Thr Lys 
    130                 135                 140                 


Ile Gly Phe Arg Trp Ala Gly Ile Ile Gln Thr Pro Glu Glu Ala Glu 
145                 150                 155                 160 


Gln Asn Ala Lys Asp Met Asp Thr Met Ile Asn Ser Phe Val Ser Leu 
                165                 170                 175     


Gly Ser Ala Tyr Lys Gly Tyr Lys Lys Ala Lys Lys Ala Arg Lys Arg 
            180                 185                 190         


Val Glu Asp Phe Leu Glu Lys Gln Ile Ile Asp Val Arg Lys Gly Lys 
        195                 200                 205             


Leu His Pro Glu Glu Gly Thr Ala Leu Tyr Glu Phe Ala His Trp Glu 
    210                 215                 220                 


Asp Leu Asn Asp Asn Pro Met Asp Ser His Leu Cys Ala Val Asp Leu 
225                 230                 235                 240 


Met Asn Val Val Arg Pro Leu Ala Ala Ile Asn Arg Phe Ile Ser Tyr 
                245                 250                 255     


Gly Val Lys Val Leu Ile Glu Phe Asp Gln Glu Lys Glu Lys Leu Arg 
            260                 265                 270         


Leu Glu Asn Asn Glu Asp Tyr Ala Tyr Lys Phe Ala Gln Glu Val Arg 
        275                 280                 285             


Arg Ile Phe Pro Phe Val Pro Tyr Leu Pro Gly Arg Ala Ala Val Asp 
    290                 295                 300                 


Leu Glu Tyr Asp Gly Tyr Lys Ile Pro Ala Gly Met Met Thr Ala Leu 
305                 310                 315                 320 


Asp Val Tyr Gly Thr Thr His Asp Glu Asp Leu Trp Glu Asn Pro Asp 
                325                 330                 335     


Gln Phe Asn Pro Asn Arg Phe Asp Asn Trp Asp Gly Ser Pro Phe Asp 
            340                 345                 350         


Leu Ile Pro Gln Gly Gly Gly Asp Phe Tyr Thr Asn His Arg Cys Ala 
        355                 360                 365             


Gly Glu Trp Ile Thr Val Ile Ile Met Glu Glu Thr Met Lys Tyr Phe 
    370                 375                 380                 


Ala Asn Lys Ile Glu Phe Asp Val Pro Ser Gln Asp Leu Ser Val Lys 
385                 390                 395                 400 


Leu Asp Lys Leu Pro Gly Asn Val Thr Ser Gly Thr Ile Ile Ser Asn 
                405                 410                 415     


Val Arg Pro Arg Val Ala Arg Lys 
            420                 


