                         SEQUENCE LISTING

<110>  Pebble Labs USA, Inc.
 
<120>  SYSTEM AND METHODS FOR ENGINEERING BACTERIA FIT FOR EUKARYOTIC 
       MRNA PRODUCTION, EXPORT, AND TRANSLATION IN A EUKARYOTIC HOST

<130>  90115.00231

<150>  US 62/693,963
<151>  2018-07-04

<160>  44    

<170>  PatentIn version 3.5

<210>  1
<211>  512
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  1-IRES:crTMV

<400>  1
gaattcgtcg attcggttgc agcatttaaa gcggttgaca actttaaaag aaggaaaaag       60

aaggttgaag aaaagggtgt agtaagtaag tataagtaca gaccggagaa gtacgccggt      120

cctgattcgt ttaatttgaa agaagaaaat gcctgctgcg aagagggtga aattggatcc      180

ggcggccaaa cgagtcaaac ttgatccagc tgctaagcga gtgaagctag acggtggtgg      240

ggggtctggg ggaggtggta gcggaggtgg aggtagcaga gaccacatgg tattgcacga      300

atatgtaaat gcggctggca ttaccggagg ggggggtagc gggggcggcg gatccggtgg      360

tggaagcagc agagaccata tggttttgca cgagtatgtc aatgccgcgg gcataacgta      420

ataaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaacaccac      480

caccaccacc actgcatggt taattcctcc tg                                    512


<210>  2
<211>  858
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  1-IRES:NtHSF

<400>  2
ggcacgaggc tcccattaat atttcttctt ctgtgtaatt ccattattct gtagtagatt       60

cacgtccgag tttaaagaag agagaaaact gaaaaggcag aaaattccag agctttagat      120

ttagccaaag atagttatgg tcgtgttgtt cttggtgaag attggcaaag taggagccaa      180

tggaagaaac taagatcata atcaatcgcc ccaaaaacaa ccttgttcat tctatggttt      240

ttctcttcgg tttctatgtt tgggattggg aattcctcac tgtccttttg cttttcagtt      300

attgctcctt ctaattttcc ctagctagga tcttctcaat taatttcctt tttcattttc      360

aactaactca taattagccc aaatcttcaa aagagttttg tgtaagttga tagacgttta      420

gagaaacaga gaaatacagg ggaaaaacaa gggatgcctg ctgcgaagag ggtgaaattg      480

gatccggcgg ccaaacgagt caaacttgat ccagctgcta agcgagtgaa gctagacggt      540

ggtggggggt ctgggggagg tggtagcgga ggtggaggta gcagagacca catggtattg      600

cacgaatatg taaatgcggc tggcattacc ggaggggggg gtagcggggg cggcggatcc      660

ggtggtggaa gcagcagaga ccatatggtt ttgcacgagt atgtcaatgc cgcgggcata      720

acgtaataaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaac      780

accaccacca ccaccactgc atggttaatt cctcctgcag ataaaaaaaa tccttagctt      840

tcgctaagga tgatttct                                                    858


<210>  3
<211>  597
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  1-IRES:TuMV

<400>  3
gggaaagctt gcatgcctgc aggtcgactc tagaaaaata taaaaactca acacaacata       60

cacaaaacga ttaaagcaaa cacaatcttt caaagcattc aaagcattca agcaatcaaa      120

gattttcaaa tcttttgtcg ttatcaaagc aatcaccaac aggatccagg atccccgggt      180

ggtcagtccc ttatgcctgc tgcgaagagg gtgaaattgg atccggcggc caaacgagtc      240

aaacttgatc cagctgctaa gcgagtgaag ctagacggtg gtggggggtc tgggggaggt      300

ggtagcggag gtggaggtag cagagaccac atggtattgc acgaatatgt aaatgcggct      360

ggcattaccg gagggggggg tagcgggggc ggcggatccg gtggtggaag cagcagagac      420

catatggttt tgcacgagta tgtcaatgcc gcgggcataa cgtaataaaa aaaaaaaaaa      480

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaca ccaccaccac caccactgca      540

tggttaattc ctcctgcaga taaaaaaaat ccttagcttt cgctaaggat gatttct         597


<210>  4
<211>  554
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  1-IRES:crTEV

<400>  4
aaataacaaa tctcaacaca acatatacaa aacaaacgaa tctcaagcaa tcaagcattc       60

tacttctatt gcagcaattt aaatcatttc ttttaaagca aaagcaattt tctgaaaatt      120

ttcaccattt acgaacgata gcaatgcctg ctgcgaagag ggtgaaattg gatccggcgg      180

ccaaacgagt caaacttgat ccagctgcta agcgagtgaa gctagacggt ggtggggggt      240

ctgggggagg tggtagcgga ggtggaggta gcagagacca catggtattg cacgaatatg      300

taaatgcggc tggcattacc ggaggggggg gtagcggggg cggcggatcc ggtggtggaa      360

gcagcagaga ccatatggtt ttgcacgagt atgtcaatgc cgcgggcata acgtaataaa      420

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaac accaccacca      480

ccaccactgc atggttaatt cctcctgcag ataaaaaaaa tccttagctt tcgctaagga      540

tgatttctga tatc                                                        554


<210>  5
<211>  524
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  1-CITE:SNTV

<400>  5
aaaaaaaaaa aagtaaagac aggaaacttt actgactaac atgcctgctg cgaagagggt       60

gaaattggat ccggcggcca aacgagtcaa acttgatcca gctgctaagc gagtgaagct      120

agacggtggt ggggggtctg ggggaggtgg tagcggaggt ggaggtagca gagaccacat      180

ggtattgcac gaatatgtaa atgcggctgg cattaccgga ggggggggta gcgggggcgg      240

cggatccggt ggtggaagca gcagagacca tatggttttg cacgagtatg tcaatgccgc      300

gggcataacg taataaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa      360

aaaaaatccc agaggttcac aatgttagtg atggggcgct gaaagatgcg tagctaccct      420

tctggagcca cttcctggtg gtaagcagaa atccaagggt acggtggtac ggtggaaagc      480

agtccccacc accaccacca ccactgcatg gttaattcct cctg                       524


<210>  6
<211>  550
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  2-IRES:crTMV (hairpin)

<400>  6
caggaggaat taaccatgca gtggtggtgg tggtggtgga attcgtcgat tcggttgcag       60

catttaaagc ggttgacaac tttaaaagaa ggaaaaagaa ggttgaagaa aagggtgtag      120

taagtaagta taagtacaga ccggagaagt acgccggtcc tgattcgttt aatttgaaag      180

aagaaaatgc ctgctgcgaa gagggtgaaa ttggatccgg cggccaaacg agtcaaactt      240

gatccagctg ctaagcgagt gaagctagac ggtggtgggg ggtctggggg aggtggtagc      300

ggaggtggag gtagcagaga ccacatggta ttgcacgaat atgtaaatgc ggctggcatt      360

accggagggg ggggtagcgg gggcggcgga tccggtggtg gaagcagcag agaccatatg      420

gttttgcacg agtatgtcaa tgccgcgggc ataacgtaat aaaaaaaaaa aaaaaaaaaa      480

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aacaccacca ccaccaccac tgcatggtta      540

attcctcctg                                                             550


<210>  7
<211>  896
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  2-IRES:NtHSF (hairpin)

<400>  7
caggaggaat taaccatgca gtggtggtgg tggtggtggg cacgaggctc ccattaatat       60

ttcttcttct gtgtaattcc attattctgt agtagattca cgtccgagtt taaagaagag      120

agaaaactga aaaggcagaa aattccagag ctttagattt agccaaagat agttatggtc      180

gtgttgttct tggtgaagat tggcaaagta ggagccaatg gaagaaacta agatcataat      240

caatcgcccc aaaaacaacc ttgttcattc tatggttttt ctcttcggtt tctatgtttg      300

ggattgggaa ttcctcactg tccttttgct tttcagttat tgctccttct aattttccct      360

agctaggatc ttctcaatta atttcctttt tcattttcaa ctaactcata attagcccaa      420

atcttcaaaa gagttttgtg taagttgata gacgtttaga gaaacagaga aatacagggg      480

aaaaacaagg gatgcctgct gcgaagaggg tgaaattgga tccggcggcc aaacgagtca      540

aacttgatcc agctgctaag cgagtgaagc tagacggtgg tggggggtct gggggaggtg      600

gtagcggagg tggaggtagc agagaccaca tggtattgca cgaatatgta aatgcggctg      660

gcattaccgg aggggggggt agcgggggcg gcggatccgg tggtggaagc agcagagacc      720

atatggtttt gcacgagtat gtcaatgccg cgggcataac gtaataaaaa aaaaaaaaaa      780

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaacac caccaccacc accactgcat      840

ggttaattcc tcctgcagat aaaaaaaatc cttagctttc gctaaggatg atttct          896


<210>  8
<211>  592
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  2-IRES:crTEV (hairpin)

<400>  8
caggaggaat taaccatgca gtggtggtgg tggtggtgaa ataacaaatc tcaacacaac       60

atatacaaaa caaacgaatc tcaagcaatc aagcattcta cttctattgc agcaatttaa      120

atcatttctt ttaaagcaaa agcaattttc tgaaaatttt caccatttac gaacgatagc      180

aatgcctgct gcgaagaggg tgaaattgga tccggcggcc aaacgagtca aacttgatcc      240

agctgctaag cgagtgaagc tagacggtgg tggggggtct gggggaggtg gtagcggagg      300

tggaggtagc agagaccaca tggtattgca cgaatatgta aatgcggctg gcattaccgg      360

aggggggggt agcgggggcg gcggatccgg tggtggaagc agcagagacc atatggtttt      420

gcacgagtat gtcaatgccg cgggcataac gtaataaaaa aaaaaaaaaa aaaaaaaaaa      480

aaaaaaaaaa aaaaaaaaaa aaaaaaacac caccaccacc accactgcat ggttaattcc      540

tcctgcagat aaaaaaaatc cttagctttc gctaaggatg atttctgata tc              592


<210>  9
<211>  635
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  2-IRES:TuMV (hairpin)

<400>  9
caggaggaat taaccatgca gtggtggtgg tggtggtggg gaaagcttgc atgcctgcag       60

gtcgactcta gaaaaatata aaaactcaac acaacataca caaaacgatt aaagcaaaca      120

caatctttca aagcattcaa agcattcaag caatcaaaga ttttcaaatc ttttgtcgtt      180

atcaaagcaa tcaccaacag gatccaggat ccccgggtgg tcagtccctt atgcctgctg      240

cgaagagggt gaaattggat ccggcggcca aacgagtcaa acttgatcca gctgctaagc      300

gagtgaagct agacggtggt ggggggtctg ggggaggtgg tagcggaggt ggaggtagca      360

gagaccacat ggtattgcac gaatatgtaa atgcggctgg cattaccgga ggggggggta      420

gcgggggcgg cggatccggt ggtggaagca gcagagacca tatggttttg cacgagtatg      480

tcaatgccgc gggcataacg taataaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa      540

aaaaaaaaaa aaaaaacacc accaccacca ccactgcatg gttaattcct cctgcagata      600

aaaaaaatcc ttagctttcg ctaaggatga tttct                                 635


<210>  10
<211>  562
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  2-CITE:SNTV (hairpin)

<400>  10
caggaggaat taaccatgca gtggtggtgg tggtggtgaa aaaaaaaaaa gtaaagacag       60

gaaactttac tgactaacat gcctgctgcg aagagggtga aattggatcc ggcggccaaa      120

cgagtcaaac ttgatccagc tgctaagcga gtgaagctag acggtggtgg ggggtctggg      180

ggaggtggta gcggaggtgg aggtagcaga gaccacatgg tattgcacga atatgtaaat      240

gcggctggca ttaccggagg ggggggtagc gggggcggcg gatccggtgg tggaagcagc      300

agagaccata tggttttgca cgagtatgtc aatgccgcgg gcataacgta ataaaaaaaa      360

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaatcccag aggttcacaa      420

tgttagtgat ggggcgctga aagatgcgta gctacccttc tggagccact tcctggtggt      480

aagcagaaat ccaagggtac ggtggtacgg tggaaagcag tccccaccac caccaccacc      540

actgcatggt taattcctcc tg                                               562


<210>  11
<211>  21
<212>  PRT
<213>  E. Coli

<400>  11

Met Lys Lys Thr Ala Ile Ala Ile Ala Val Ala Leu Ala Gly Phe Ala 
1               5                   10                  15      


Thr Val Ala Gln Ala 
            20      


<210>  12
<211>  22
<212>  PRT
<213>  Erwinia carotovora

<400>  12

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
1               5                   10                  15      


Ala Gln Pro Ala Met Ala 
            20          


<210>  13
<211>  23
<212>  PRT
<213>  bacteria

<400>  13

Met Lys Lys Asn Ile Ala Phe Leu Leu Ala Ser Met Phe Val Phe Ser 
1               5                   10                  15      


Ile Ala Thr Asn Ala Tyr Ala 
            20              


<210>  14
<211>  28
<212>  PRT
<213>  Bacillus sp.

<400>  14

Met Phe Lys Phe Lys Lys Lys Phe Leu Val Gly Leu Thr Ala Ala Phe 
1               5                   10                  15      


Met Ser Ile Ser Met Phe Ser Ala Thr Ala Ser Ala 
            20                  25              


<210>  15
<211>  21
<212>  PRT
<213>  E. Coli

<400>  15

Met Lys Gln Ser Thr Ile Ala Leu Ala Leu Leu Pro Leu Leu Phe Thr 
1               5                   10                  15      


Pro Val Thr Lys Ala 
            20      


<210>  16
<211>  22
<212>  PRT
<213>  E. Coli

<400>  16

Met Met Lys Arg Asn Ile Leu Ala Val Ile Val Pro Ala Leu Leu Val 
1               5                   10                  15      


Ala Gly Thr Ala Asn Ala 
            20          


<210>  17
<211>  21
<212>  PRT
<213>  E. Coli

<400>  17

Met Lys Lys Ser Thr Leu Ala Leu Val Val Met Gly Ile Val Ala Ser 
1               5                   10                  15      


Ala Ser Val Gln Ala 
            20      


<210>  18
<211>  26
<212>  PRT
<213>  E. Coli

<400>  18

Met Lys Ile Lys Thr Gly Ala Arg Ile Leu Ala Leu Ser Ala Leu Thr 
1               5                   10                  15      


Thr Met Met Phe Ser Ala Ser Ala Leu Ala 
            20                  25      


<210>  19
<211>  21
<212>  PRT
<213>  E. Coli

<400>  19

Met Lys Val Lys Val Leu Ser Leu Leu Val Pro Ala Leu Leu Val Ala 
1               5                   10                  15      


Gly Ala Ala Asn Ala 
            20      


<210>  20
<211>  20
<212>  PRT
<213>  E. Coli

<400>  20

Met Lys Ala Thr Lys Leu Val Leu Gly Ala Val Ile Leu Gly Ser Thr 
1               5                   10                  15      


Leu Leu Ala Gly 
            20  


<210>  21
<211>  25
<212>  PRT
<213>  E. Coli

<400>  21

Met Met Ile Thr Leu Arg Lys Leu Pro Leu Ala Val Ala Val Ala Ala 
1               5                   10                  15      


Gly Val Met Ser Ala Gln Ala Met Ala 
            20                  25  


<210>  22
<211>  20
<212>  PRT
<213>  E. Coli

<400>  22

Met Arg Ala Lys Leu Leu Gly Ile Val Leu Thr Thr Pro Ile Ala Ile 
1               5                   10                  15      


Ser Ser Phe Ala 
            20  


<210>  23
<211>  25
<212>  PRT
<213>  E. Coli

<400>  23

Ser Lys Gln His Tyr Gly Ile Arg Lys Tyr Lys Val Gly Val Cys Ser 
1               5                   10                  15      


Ala Leu Ile Ala Leu Ser Ile Leu Gly 
            20                  25  


<210>  24
<211>  1065
<212>  DNA
<213>  Arabidopsis thaliana

<400>  24
gaccacgtgt acaagggtca gctgcaagcg tatgcgctgc agcacaacct ggagctgccg       60

gtttacgcga acgagcgtga aggtccgccg cacgcgccgc gtttccgttg caacgtgacc      120

ttctgcggcc agacctttca aagcagcgag ttctttccga ccctgaaaag cgcggaacac      180

gcggcggcga agatcgcggt ggcgagcctg accccgcaaa gcccggaagg tatcgatgtt      240

gcgtacaaaa acctgctgca ggagattgcg caaaaggaaa gcagcctgct gccgttctat      300

gcgaccgcga ccagcggtcc gagccacgcg ccgaccttta ccagcaccgt ggagttcgcg      360

ggtaaagttt ttagcggcga ggaagcgaag accaagaaac tggcggaaat gagcgcggcg      420

aaagttgcgt tcatgagcat taagaacggt aacagcaacc agaccggtag cccgaccctg      480

ccgagcgagc gtcaagaaga cgtgaacagc aacgttaaaa gcagcccgca ggagatccac      540

agccaaccga gcagcaaggt ggttatgacc ccggacaccc cgagcaaagg tattaaggtt      600

aacgaggatg aatttccgga cctgcacgat gcgccggcga gcaacgcgaa agaaatcaac      660

gtggcgctga acgagccgga aaacccgacc aacgacggta ccctgagcgc gctgaccacc      720

gatggcatga agatgaacat cgcgagcagc agcctgccga ttccgcacaa cccgaccaac      780

gttattaccc tgaacgcgcc ggcggcgaac ggtatcaagc gtaacattgc ggcgtgcagc      840

agctggatgc cgcagaaccc gaccaacgac ggcagcgaga ccagcagctg cgtggttgat      900

gagagcgaaa agaaaaagct gatcatgggt accggtcacc tgagcattcc gaccggtcag      960

cacgtggttt gccgtccgtg gaacccggag atcaccctgc cgcaagatgc ggaaatgctg     1020

ttccgtgacg ataaatttat tgcgtatcgt ctggtgaagc cgtaa                     1065


<210>  25
<211>  355
<212>  PRT
<213>  Arabidopsis thaliana

<400>  25

Met Asp His Val Tyr Lys Gly Gln Leu Gln Ala Tyr Ala Leu Gln His 
1               5                   10                  15      


Asn Leu Glu Leu Pro Val Tyr Ala Asn Glu Arg Glu Gly Pro Pro His 
            20                  25                  30          


Ala Pro Arg Phe Arg Cys Asn Val Thr Phe Cys Gly Gln Thr Phe Gln 
        35                  40                  45              


Ser Ser Glu Phe Phe Pro Thr Leu Lys Ser Ala Glu His Ala Ala Ala 
    50                  55                  60                  


Lys Ile Ala Val Ala Ser Leu Thr Pro Gln Ser Pro Glu Gly Ile Asp 
65                  70                  75                  80  


Val Ala Tyr Lys Asn Leu Leu Gln Glu Ile Ala Gln Lys Glu Ser Ser 
                85                  90                  95      


Leu Leu Pro Phe Tyr Ala Thr Ala Thr Ser Gly Pro Ser His Ala Pro 
            100                 105                 110         


Thr Phe Thr Ser Thr Val Glu Phe Ala Gly Lys Val Phe Ser Gly Glu 
        115                 120                 125             


Glu Ala Lys Thr Lys Lys Leu Ala Glu Met Ser Ala Ala Lys Val Ala 
    130                 135                 140                 


Phe Met Ser Ile Lys Asn Gly Asn Ser Asn Gln Thr Gly Ser Pro Thr 
145                 150                 155                 160 


Leu Pro Ser Glu Arg Gln Glu Asp Val Asn Ser Asn Val Lys Ser Ser 
                165                 170                 175     


Pro Gln Glu Ile His Ser Gln Pro Ser Ser Lys Val Val Met Thr Pro 
            180                 185                 190         


Asp Thr Pro Ser Lys Gly Ile Lys Val Asn Glu Asp Glu Phe Pro Asp 
        195                 200                 205             


Leu His Asp Ala Pro Ala Ser Asn Ala Lys Glu Ile Asn Val Ala Leu 
    210                 215                 220                 


Asn Glu Pro Glu Asn Pro Thr Asn Asp Gly Thr Leu Ser Ala Leu Thr 
225                 230                 235                 240 


Thr Asp Gly Met Lys Met Asn Ile Ala Ser Ser Ser Leu Pro Ile Pro 
                245                 250                 255     


His Asn Pro Thr Asn Val Ile Thr Leu Asn Ala Pro Ala Ala Asn Gly 
            260                 265                 270         


Ile Lys Arg Asn Ile Ala Ala Cys Ser Ser Trp Met Pro Gln Asn Pro 
        275                 280                 285             


Thr Asn Asp Gly Ser Glu Thr Ser Ser Cys Val Val Asp Glu Ser Glu 
    290                 295                 300                 


Lys Lys Lys Leu Ile Met Gly Thr Gly His Leu Ser Ile Pro Thr Gly 
305                 310                 315                 320 


Gln His Val Val Cys Arg Pro Trp Asn Pro Glu Ile Thr Leu Pro Gln 
                325                 330                 335     


Asp Ala Glu Met Leu Phe Arg Asp Asp Lys Phe Ile Ala Tyr Arg Leu 
            340                 345                 350         


Val Lys Pro 
        355 


<210>  26
<211>  739
<212>  DNA
<213>  Arabidopsis thaliana

<400>  26
aagcaagaaa cactgcagcg agctgctgcc gaacaagatg ttccgtaacc aggacagcaa       60

gtacctgatc ccggttcaaa aagaagcgcc gccggtgacc accctgccga tgaaggcgag      120

caccgttaaa agcccgcaca actgcgaggc gatcctgcgt gacgcggatc cgccgattag      180

cctgagcagc gttaacctga gcgaacagct gcgtagcggc gtgttcctga agccgaagaa      240

acaaatcaaa tactgggttg atgagcgtaa cagcaactgc ttcatgctgt ttgcgaagaa      300

cctgagcatt acctggagcg acgatgtgaa ctattggacc tggtttaccg agaaagaaag      360

cccgaacgag aacgttgaag cggtgggtct gaagaacgtg tgctggctgg acatcaccgg      420

caaattcgat acccgtaacc tgaccccggg tattgtttac gaggtggttt ttaaggtgaa      480

actggaagac ccggcgtatg gctgggatac cccggttaac ctgaaactgg tgctgccgaa      540

cggcaaggag aaaccgcagg aaaagaaagt tagcctgcgt gaactgccgc gttacaagtg      600

ggtggacgtt cgtgtgggcg agttcgtgcc ggagaagagc gcggcgggcg agatcacctt      660

tagcatgtat gaacacgcgg cgggtgtttg gaagaaaggc ctgagcctga agggtgtggc      720

gattcgtccg aaacaataa                                                   739


<210>  27
<211>  246
<212>  PRT
<213>  Arabidopsis thaliana

<400>  27

Met Ser Lys Lys His Cys Ser Glu Leu Leu Pro Asn Lys Met Phe Arg 
1               5                   10                  15      


Asn Gln Asp Ser Lys Tyr Leu Ile Pro Val Gln Lys Glu Ala Pro Pro 
            20                  25                  30          


Val Thr Thr Leu Pro Met Lys Ala Ser Thr Val Lys Ser Pro His Asn 
        35                  40                  45              


Cys Glu Ala Ile Leu Arg Asp Ala Asp Pro Pro Ile Ser Leu Ser Ser 
    50                  55                  60                  


Val Asn Leu Ser Glu Gln Leu Arg Ser Gly Val Phe Leu Lys Pro Lys 
65                  70                  75                  80  


Lys Gln Ile Lys Tyr Trp Val Asp Glu Arg Asn Ser Asn Cys Phe Met 
                85                  90                  95      


Leu Phe Ala Lys Asn Leu Ser Ile Thr Trp Ser Asp Asp Val Asn Tyr 
            100                 105                 110         


Trp Thr Trp Phe Thr Glu Lys Glu Ser Pro Asn Glu Asn Val Glu Ala 
        115                 120                 125             


Val Gly Leu Lys Asn Val Cys Trp Leu Asp Ile Thr Gly Lys Phe Asp 
    130                 135                 140                 


Thr Arg Asn Leu Thr Pro Gly Ile Val Tyr Glu Val Val Phe Lys Val 
145                 150                 155                 160 


Lys Leu Glu Asp Pro Ala Tyr Gly Trp Asp Thr Pro Val Asn Leu Lys 
                165                 170                 175     


Leu Val Leu Pro Asn Gly Lys Glu Lys Pro Gln Glu Lys Lys Val Ser 
            180                 185                 190         


Leu Arg Glu Leu Pro Arg Tyr Lys Trp Val Asp Val Arg Val Gly Glu 
        195                 200                 205             


Phe Val Pro Glu Lys Ser Ala Ala Gly Glu Ile Thr Phe Ser Met Tyr 
    210                 215                 220                 


Glu His Ala Ala Gly Val Trp Lys Lys Gly Leu Ser Leu Lys Gly Val 
225                 230                 235                 240 


Ala Ile Arg Pro Lys Gln 
                245     


<210>  28
<211>  14
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Kozak (Shine Delgarno) Sequence

<400>  28
aggaggtacc cacc                                                         14


<210>  29
<211>  2781
<212>  DNA
<213>  Streptococcus thermophilus

<400>  29
atgaaacata ttaatgatta tttttgggct aagaaaacag aggaaaatag tagacttctt       60

tggttaccat taactcaaca cttagaagac acgaaaaata ttgcaggcct cttatgggaa      120

cattggttaa gtgaaggaca aaaggtatta attgaaaatt ctattaatgt taaatcaaat      180

attgaaaacc aagggaaaag attggcacaa ttcctaggag ctgttcatga tatcggtaaa      240

gcaacaccag cttttcagac gcaaaaaggt tatgcaaatt cagtagattt ggatattcaa      300

ttgttagaaa aattggaacg cgcaggtttt tctggcatta gttctctcca actagcctcc      360

cccaaaaaga gtcatcatag cattgcaggt caatatttgt tatcccatta tggcgtggac      420

gaagatattg caacaattat tggtggacac catggacgac cagttgatga tttagacggt      480

ttaaattctc aaaaaagcta tccctccaat tattaccagg atgaaaagaa agatagtctc      540

gtttatcaga aatggaagtc aaatcaagaa gcttttttaa actgggcttt aacagaaaca      600

gggtttaatt ctgtgtctca gcttccaaaa atcaaacagc ctgctcaagt tattctatca      660

ggtttactca taatgtctga ctggattgct agtaatgagc atttttttcc tttgttaagt      720

ttggatgaaa ctgatgtgaa aaacaagagt caacgtattg aaactgggtt taaaaagtgg      780

aaaaaatcta acttgtggca acctgaaact ttcgttgacc ttgttactct ttatcaggaa      840

agatttggat ttagtccacg aaattttcag ctgatactct cacaaacaat cgaaaagacg      900

actaatcctg ggatagtgat actggaagcg ccaatgggaa tcgggaaaac agaggcggct      960

ctagcggtat cagagcagtt atctagtaaa aaaggatgta gtggattgtt ttttggattg     1020

cccacacaag caacctccaa tggaattttt aagaggattg aacagtggac agagaatata     1080

aagggtaaca attctgatca tttttccatt cagctggttc atggaaaagc agccttaaat     1140

acggatttta ttgagttact taaaggaaat acaattaata tggacgactc ggaaaacggc     1200

agtatttttg tcaatgagtg gttttctggg agaaaaactt cagcattaga tgattttgta     1260

gttgggacgg tcgaccaatt tttaatggtg gctttaaaac aaaaacattt ggccttacgt     1320

catttaggat ttagtaaaaa agttatcgtt attgatgaag tccacgctta tgatgcttat     1380

atgagccaat atttgttgga agctatcaga tggatgggag cttatggtgt tcctgtaatt     1440

attttatcag caactttacc tgcccaacaa agagaaaaac tcataaaaag ctatatggct     1500

ggaatgggag tgaaatggcg agatattgaa aatatagatc agataaaaat agacgcatac     1560

cctttaatca cttataatga cgggcctgac attcatcaag ttaaaatgtt cgaaaagcaa     1620

gaacaaaaaa atatctacat tcatcgttta ccagaagaac agttatttga tattgtaaaa     1680

gaaggtcttg acaatggtgg agtagttggg ataattgtca atacggtgag aaaatctcaa     1740

gaattggcaa gaaatttttc agatattttt ggagatgata tggtagattt gcttcattct     1800

aatttcatag caactgaaag aatccgaaaa gaaaaggatt tattgcaaga aattgggaaa     1860

aaagcaatac gtccaccaaa gaaaatcatt attggtacac aggtgcttga acagtcgtta     1920

gatattgatt ttgatgtact gataagcgac ttagcgccta tggatttact cattcaacgt     1980

atcggacgac tacatcgtca caaaatcaaa aggccccaaa agcacgaagt agcaagattt     2040

tatgttttag gaacatttga agagtttgat tttgatgaag gaacgcgttt ggtttatggg     2100

gactacctat tagctagaac tcagtacttt ttaccagata aaatacgact tcctgatgat     2160

atttcaccgc tagtccaaaa ggtttataat tcagacctaa caattacgtt tccaaagcca     2220

gaacttcata aaaaatattt ggatgctaaa atagaacatg atgataagat taaaaataaa     2280

gaaacaaagg caaagtcata ccgtattgct aatcctgtct taaaaaaatc gagagttcga     2340

actaacagtt tgattggttg gttaaagaac ctccatccaa atgatagtga agaaaaagca     2400

tatgctcaag ttcgagatat tgaagataca gttgaagtga ttgcattaaa aaaaatatct     2460

gatgggtatg gtttgttcat agaaaataaa gatatatctc agaacattac tgatcctata     2520

attgcaaaaa aggtagcaca aaatacttta cgacttccga tgagtttatc caaagcctat     2580

aatattgatc aaacgattaa tgagcttgaa agatataaca atagccactt aagtcaatgg     2640

caaaactcat catggttaaa gggatctctt gggattattt ttgataaaaa caatgagttt     2700

atactgaatg gatttaaact attatatgat gaaaaatatg gtgttaccat agaaaggttg     2760

gataagaatg agtcggttta a                                               2781


<210>  30
<211>  926
<212>  PRT
<213>  Streptococcus thermophilus

<400>  30

Met Lys His Ile Asn Asp Tyr Phe Trp Ala Lys Lys Thr Glu Glu Asn 
1               5                   10                  15      


Ser Arg Leu Leu Trp Leu Pro Leu Thr Gln His Leu Glu Asp Thr Lys 
            20                  25                  30          


Asn Ile Ala Gly Leu Leu Trp Glu His Trp Leu Ser Glu Gly Gln Lys 
        35                  40                  45              


Val Leu Ile Glu Asn Ser Ile Asn Val Lys Ser Asn Ile Glu Asn Gln 
    50                  55                  60                  


Gly Lys Arg Leu Ala Gln Phe Leu Gly Ala Val His Asp Ile Gly Lys 
65                  70                  75                  80  


Ala Thr Pro Ala Phe Gln Thr Gln Lys Gly Tyr Ala Asn Ser Val Asp 
                85                  90                  95      


Leu Asp Ile Gln Leu Leu Glu Lys Leu Glu Arg Ala Gly Phe Ser Gly 
            100                 105                 110         


Ile Ser Ser Leu Gln Leu Ala Ser Pro Lys Lys Ser His His Ser Ile 
        115                 120                 125             


Ala Gly Gln Tyr Leu Leu Ser His Tyr Gly Val Asp Glu Asp Ile Ala 
    130                 135                 140                 


Thr Ile Ile Gly Gly His His Gly Arg Pro Val Asp Asp Leu Asp Gly 
145                 150                 155                 160 


Leu Asn Ser Gln Lys Ser Tyr Pro Ser Asn Tyr Tyr Gln Asp Glu Lys 
                165                 170                 175     


Lys Asp Ser Leu Val Tyr Gln Lys Trp Lys Ser Asn Gln Glu Ala Phe 
            180                 185                 190         


Leu Asn Trp Ala Leu Thr Glu Thr Gly Phe Asn Ser Val Ser Gln Leu 
        195                 200                 205             


Pro Lys Ile Lys Gln Pro Ala Gln Val Ile Leu Ser Gly Leu Leu Ile 
    210                 215                 220                 


Met Ser Asp Trp Ile Ala Ser Asn Glu His Phe Phe Pro Leu Leu Ser 
225                 230                 235                 240 


Leu Asp Glu Thr Asp Val Lys Asn Lys Ser Gln Arg Ile Glu Thr Gly 
                245                 250                 255     


Phe Lys Lys Trp Lys Lys Ser Asn Leu Trp Gln Pro Glu Thr Phe Val 
            260                 265                 270         


Asp Leu Val Thr Leu Tyr Gln Glu Arg Phe Gly Phe Ser Pro Arg Asn 
        275                 280                 285             


Phe Gln Leu Ile Leu Ser Gln Thr Ile Glu Lys Thr Thr Asn Pro Gly 
    290                 295                 300                 


Ile Val Ile Leu Glu Ala Pro Met Gly Ile Gly Lys Thr Glu Ala Ala 
305                 310                 315                 320 


Leu Ala Val Ser Glu Gln Leu Ser Ser Lys Lys Gly Cys Ser Gly Leu 
                325                 330                 335     


Phe Phe Gly Leu Pro Thr Gln Ala Thr Ser Asn Gly Ile Phe Lys Arg 
            340                 345                 350         


Ile Glu Gln Trp Thr Glu Asn Ile Lys Gly Asn Asn Ser Asp His Phe 
        355                 360                 365             


Ser Ile Gln Leu Val His Gly Lys Ala Ala Leu Asn Thr Asp Phe Ile 
    370                 375                 380                 


Glu Leu Leu Lys Gly Asn Thr Ile Asn Met Asp Asp Ser Glu Asn Gly 
385                 390                 395                 400 


Ser Ile Phe Val Asn Glu Trp Phe Ser Gly Arg Lys Thr Ser Ala Leu 
                405                 410                 415     


Asp Asp Phe Val Val Gly Thr Val Asp Gln Phe Leu Met Val Ala Leu 
            420                 425                 430         


Lys Gln Lys His Leu Ala Leu Arg His Leu Gly Phe Ser Lys Lys Val 
        435                 440                 445             


Ile Val Ile Asp Glu Val His Ala Tyr Asp Ala Tyr Met Ser Gln Tyr 
    450                 455                 460                 


Leu Leu Glu Ala Ile Arg Trp Met Gly Ala Tyr Gly Val Pro Val Ile 
465                 470                 475                 480 


Ile Leu Ser Ala Thr Leu Pro Ala Gln Gln Arg Glu Lys Leu Ile Lys 
                485                 490                 495     


Ser Tyr Met Ala Gly Met Gly Val Lys Trp Arg Asp Ile Glu Asn Ile 
            500                 505                 510         


Asp Gln Ile Lys Ile Asp Ala Tyr Pro Leu Ile Thr Tyr Asn Asp Gly 
        515                 520                 525             


Pro Asp Ile His Gln Val Lys Met Phe Glu Lys Gln Glu Gln Lys Asn 
    530                 535                 540                 


Ile Tyr Ile His Arg Leu Pro Glu Glu Gln Leu Phe Asp Ile Val Lys 
545                 550                 555                 560 


Glu Gly Leu Asp Asn Gly Gly Val Val Gly Ile Ile Val Asn Thr Val 
                565                 570                 575     


Arg Lys Ser Gln Glu Leu Ala Arg Asn Phe Ser Asp Ile Phe Gly Asp 
            580                 585                 590         


Asp Met Val Asp Leu Leu His Ser Asn Phe Ile Ala Thr Glu Arg Ile 
        595                 600                 605             


Arg Lys Glu Lys Asp Leu Leu Gln Glu Ile Gly Lys Lys Ala Ile Arg 
    610                 615                 620                 


Pro Pro Lys Lys Ile Ile Ile Gly Thr Gln Val Leu Glu Gln Ser Leu 
625                 630                 635                 640 


Asp Ile Asp Phe Asp Val Leu Ile Ser Asp Leu Ala Pro Met Asp Leu 
                645                 650                 655     


Leu Ile Gln Arg Ile Gly Arg Leu His Arg His Lys Ile Lys Arg Pro 
            660                 665                 670         


Gln Lys His Glu Val Ala Arg Phe Tyr Val Leu Gly Thr Phe Glu Glu 
        675                 680                 685             


Phe Asp Phe Asp Glu Gly Thr Arg Leu Val Tyr Gly Asp Tyr Leu Leu 
    690                 695                 700                 


Ala Arg Thr Gln Tyr Phe Leu Pro Asp Lys Ile Arg Leu Pro Asp Asp 
705                 710                 715                 720 


Ile Ser Pro Leu Val Gln Lys Val Tyr Asn Ser Asp Leu Thr Ile Thr 
                725                 730                 735     


Phe Pro Lys Pro Glu Leu His Lys Lys Tyr Leu Asp Ala Lys Ile Glu 
            740                 745                 750         


His Asp Asp Lys Ile Lys Asn Lys Glu Thr Lys Ala Lys Ser Tyr Arg 
        755                 760                 765             


Ile Ala Asn Pro Val Leu Lys Lys Ser Arg Val Arg Thr Asn Ser Leu 
    770                 775                 780                 


Ile Gly Trp Leu Lys Asn Leu His Pro Asn Asp Ser Glu Glu Lys Ala 
785                 790                 795                 800 


Tyr Ala Gln Val Arg Asp Ile Glu Asp Thr Val Glu Val Ile Ala Leu 
                805                 810                 815     


Lys Lys Ile Ser Asp Gly Tyr Gly Leu Phe Ile Glu Asn Lys Asp Ile 
            820                 825                 830         


Ser Gln Asn Ile Thr Asp Pro Ile Ile Ala Lys Lys Val Ala Gln Asn 
        835                 840                 845             


Thr Leu Arg Leu Pro Met Ser Leu Ser Lys Ala Tyr Asn Ile Asp Gln 
    850                 855                 860                 


Thr Ile Asn Glu Leu Glu Arg Tyr Asn Asn Ser His Leu Ser Gln Trp 
865                 870                 875                 880 


Gln Asn Ser Ser Trp Leu Lys Gly Ser Leu Gly Ile Ile Phe Asp Lys 
                885                 890                 895     


Asn Asn Glu Phe Ile Leu Asn Gly Phe Lys Leu Leu Tyr Asp Glu Lys 
            900                 905                 910         


Tyr Gly Val Thr Ile Glu Arg Leu Asp Lys Asn Glu Ser Val 
        915                 920                 925     


<210>  31
<211>  4107
<212>  DNA
<213>  Streptococcus

<400>  31
atggataaga aatactcaat aggcttagat atcggcacaa atagcgtcgg atgggcggtg       60

atcactgatg aatataaggt tccgtctaaa aagttcaagg ttctgggaaa tacagaccgc      120

cacagtatca aaaaaaatct tataggggct cttttatttg acagtggaga gacagcggaa      180

gcgactcgtc tcaaacggac agctcgtaga aggtatacac gtcggaagaa tcgtatttgt      240

tatctacagg agattttttc aaatgagatg gcgaaagtag atgatagttt ctttcatcga      300

cttgaagagt cttttttggt ggaagaagac aagaagcatg aacgtcatcc tatttttgga      360

aatatagtag atgaagttgc ttatcatgag aaatatccaa ctatctatca tctgcgaaaa      420

aaattggtag attctactga taaagcggat ttgcgcttaa tctatttggc cttagcgcat      480

atgattaagt ttcgtggtca ttttttgatt gagggagatt taaatcctga taatagtgat      540

gtggacaaac tatttatcca gttggtacaa acctacaatc aattatttga agaaaaccct      600

attaacgcaa gtggagtaga tgctaaagcg attctttctg cacgattgag taaatcaaga      660

cgattagaaa atctcattgc tcagctcccc ggtgagaaga aaaatggctt atttgggaat      720

ctcattgctt tgtcattggg tttgacccct aattttaaat caaattttga tttggcagaa      780

gatgctaaat tacagctttc aaaagatact tacgatgatg atttagataa tttattggcg      840

caaattggag atcaatatgc tgatttgttt ttggcagcta agaatttatc agatgctatt      900

ttactttcag atatcctaag agtaaatact gaaataacta aggctcccct atcagcttca      960

atgattaaac gctacgatga acatcatcaa gacttgactc ttttaaaagc tttagttcga     1020

caacaacttc cagaaaagta taaagaaatc ttttttgatc aatcaaaaaa cggatatgca     1080

ggttatattg atgggggagc tagccaagaa gaattttata aatttatcaa accaatttta     1140

gaaaaaatgg atggtactga ggaattattg gtgaaactaa atcgtgaaga tttgctgcgc     1200

aagcaacgga cctttgacaa cggctctatt ccccatcaaa ttcacttggg tgagctgcat     1260

gctattttga gaagacaaga agacttttat ccatttttaa aagacaatcg tgagaagatt     1320

gaaaaaatct tgacttttcg aattccttat tatgttggtc cattggcgcg tggcaatagt     1380

cgttttgcat ggatgactcg gaagtctgaa gaaacaatta ccccatggaa ttttgaagaa     1440

gttgtcgata aaggtgcttc agctcaatca tttattgaac gcatgacaaa ctttgataaa     1500

aatcttccaa atgaaaaagt actaccaaaa catagtttgc tttatgagta ttttacggtt     1560

tataacgaat tgacaaaggt caaatatgtt actgaaggaa tgcgaaaacc agcatttctt     1620

tcaggtgaac agaagaaagc cattgttgat ttactcttca aaacaaatcg aaaagtaacc     1680

gttaagcaat taaaagaaga ttatttcaaa aaaatagaat gttttgatag tgttgaaatt     1740

tcaggagttg aagatagatt taatgcttca ttaggtacct accatgattt gctaaaaatt     1800

attaaagata aagatttttt ggataatgaa gaaaatgaag atatcttaga ggatattgtt     1860

ttaacattga ccttatttga agatagggag atgattgagg aaagacttaa aacatatgct     1920

cacctctttg atgataaggt gatgaaacag cttaaacgtc gccgttatac tggttgggga     1980

cgtttgtctc gaaaattgat taatggtatt agggataagc aatctggcaa aacaatatta     2040

gattttttga aatcagatgg ttttgccaat cgcaatttta tgcagctgat ccatgatgat     2100

agtttgacat ttaaagaaga cattcaaaaa gcacaagtgt ctggacaagg cgatagttta     2160

catgaacata ttgcaaattt agctggtagc cctgctatta aaaaaggtat tttacagact     2220

gtaaaagttg ttgatgaatt ggtcaaagta atggggcggc ataagccaga aaatatcgtt     2280

attgaaatgg cacgtgaaaa tcagacaact caaaagggcc agaaaaattc gcgagagcgt     2340

atgaaacgaa tcgaagaagg tatcaaagaa ttaggaagtc agattcttaa agagcatcct     2400

gttgaaaata ctcaattgca aaatgaaaag ctctatctct attatctcca aaatggaaga     2460

gacatgtatg tggaccaaga attagatatt aatcgtttaa gtgattatga tgtcgatcac     2520

attgttccac aaagtttcct taaagacgat tcaatagaca ataaggtctt aacgcgttct     2580

gataaaaatc gtggtaaatc ggataacgtt ccaagtgaag aagtagtcaa aaagatgaaa     2640

aactattgga gacaacttct aaacgccaag ttaatcactc aacgtaagtt tgataattta     2700

acgaaagctg aacgtggagg tttgagtgaa cttgataaag ctggttttat caaacgccaa     2760

ttggttgaaa ctcgccaaat cactaagcat gtggcacaaa ttttggatag tcgcatgaat     2820

actaaatacg atgaaaatga taaacttatt cgagaggtta aagtgattac cttaaaatct     2880

aaattagttt ctgacttccg aaaagatttc caattctata aagtacgtga gattaacaat     2940

taccatcatg cccatgatgc gtatctaaat gccgtcgttg gaactgcttt gattaagaaa     3000

tatccaaaac ttgaatcgga gtttgtctat ggtgattata aagtttatga tgttcgtaaa     3060

atgattgcta agtctgagca agaaataggc aaagcaaccg caaaatattt cttttactct     3120

aatatcatga acttcttcaa aacagaaatt acacttgcaa atggagagat tcgcaaacgc     3180

cctctaatcg aaactaatgg ggaaactgga gaaattgtct gggataaagg gcgagatttt     3240

gccacagtgc gcaaagtatt gtccatgccc caagtcaata ttgtcaagaa aacagaagta     3300

cagacaggcg gattctccaa ggagtcaatt ttaccaaaaa gaaattcgga caagcttatt     3360

gctcgtaaaa aagactggga tccaaaaaaa tatggtggtt ttgatagtcc aacggtagct     3420

tattcagtcc tagtggttgc taaggtggaa aaagggaaat cgaagaagtt aaaatccgtt     3480

aaagagttac tagggatcac aattatggaa agaagttcct ttgaaaaaaa tccgattgac     3540

tttttagaag ctaaaggata taaggaagtt aaaaaagact taatcattaa actacctaaa     3600

tatagtcttt ttgagttaga aaacggtcgt aaacggatgc tggctagtgc cggagaatta     3660

caaaaaggaa atgagctggc tctgccaagc aaatatgtga attttttata tttagctagt     3720

cattatgaaa agttgaaggg tagtccagaa gataacgaac aaaaacaatt gtttgtggag     3780

cagcataagc attatttaga tgagattatt gagcaaatca gtgaattttc taagcgtgtt     3840

attttagcag atgccaattt agataaagtt cttagtgcat ataacaaaca tagagacaaa     3900

ccaatacgtg aacaagcaga aaatattatt catttattta cgttgacgaa tcttggagct     3960

cccgctgctt ttaaatattt tgatacaaca attgatcgta aacgatatac gtctacaaaa     4020

gaagttttag atgccactct tatccatcaa tccatcactg gtctttatga aacacgcatt     4080

gatttgagtc agctaggagg tgactga                                         4107


<210>  32
<211>  1368
<212>  PRT
<213>  Streptococcus

<400>  32

Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


<210>  33
<211>  1053
<212>  PRT
<213>  Staphylococcus

<400>  33

Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val 
1               5                   10                  15      


Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly 
            20                  25                  30          


Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg 
        35                  40                  45              


Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile 
    50                  55                  60                  


Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His 
65                  70                  75                  80  


Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu 
                85                  90                  95      


Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu 
            100                 105                 110         


Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr 
        115                 120                 125             


Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala 
    130                 135                 140                 


Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys 
145                 150                 155                 160 


Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr 
                165                 170                 175     


Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln 
            180                 185                 190         


Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg 
        195                 200                 205             


Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys 
    210                 215                 220                 


Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe 
225                 230                 235                 240 


Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr 
                245                 250                 255     


Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn 
            260                 265                 270         


Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe 
        275                 280                 285             


Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu 
    290                 295                 300                 


Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys 
305                 310                 315                 320 


Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr 
                325                 330                 335     


Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala 
            340                 345                 350         


Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu 
        355                 360                 365             


Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser 
    370                 375                 380                 


Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile 
385                 390                 395                 400 


Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala 
                405                 410                 415     


Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln 
            420                 425                 430         


Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro 
        435                 440                 445             


Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile 
    450                 455                 460                 


Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg 
465                 470                 475                 480 


Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys 
                485                 490                 495     


Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr 
            500                 505                 510         


Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp 
        515                 520                 525             


Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu 
    530                 535                 540                 


Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro 
545                 550                 555                 560 


Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys 
                565                 570                 575     


Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu 
            580                 585                 590         


Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile 
        595                 600                 605             


Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu 
    610                 615                 620                 


Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp 
625                 630                 635                 640 


Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu 
                645                 650                 655     


Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys 
            660                 665                 670         


Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp 
        675                 680                 685             


Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp 
    690                 695                 700                 


Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys 
705                 710                 715                 720 


Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys 
                725                 730                 735     


Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu 
            740                 745                 750         


Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp 
        755                 760                 765             


Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile 
    770                 775                 780                 


Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu 
785                 790                 795                 800 


Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu 
                805                 810                 815     


Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His 
            820                 825                 830         


Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly 
        835                 840                 845             


Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr 
    850                 855                 860                 


Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile 
865                 870                 875                 880 


Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp 
                885                 890                 895     


Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr 
            900                 905                 910         


Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val 
        915                 920                 925             


Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser 
    930                 935                 940                 


Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala 
945                 950                 955                 960 


Glu Phe Ile Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly 
                965                 970                 975     


Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile 
            980                 985                 990         


Glu Val Asn Met Ile Asp Ile Thr  Tyr Arg Glu Tyr Leu  Glu Asn Met 
        995                 1000                 1005             


Asn Asp  Lys Arg Pro Pro Arg  Ile Ile Lys Thr Ile  Ala Ser Lys 
    1010                 1015                 1020             


Thr Gln  Ser Ile Lys Lys Tyr  Ser Thr Asp Ile Leu  Gly Asn Leu 
    1025                 1030                 1035             


Tyr Glu  Val Lys Ser Lys Lys  His Pro Gln Ile Ile  Lys Lys Gly 
    1040                 1045                 1050             


<210>  34
<211>  148
<212>  DNA
<213>  Tobacco mosaic virus

<400>  34
gaattcgtcg attcggttgc agcatttaaa gcggttgaca actttaaaag aaggaaaaag       60

aaggttgaag aaaagggtgt agtaagtaag tataagtaca gaccggagaa gtacgccggt      120

cctgattcgt ttaatttgaa agaagaaa                                         148


<210>  35
<211>  453
<212>  DNA
<213>  Nicotiana tabacum IRES

<400>  35
ggcacgaggc tcccattaat atttcttctt ctgtgtaatt ccattattct gtagtagatt       60

cacgtccgag tttaaagaag agagaaaact gaaaaggcag aaaattccag agctttagat      120

ttagccaaag atagttatgg tcgtgttgtt cttggtgaag attggcaaag taggagccaa      180

tggaagaaac taagatcata atcaatcgcc ccaaaaacaa ccttgttcat tctatggttt      240

ttctcttcgg tttctatgtt tgggattggg aattcctcac tgtccttttg cttttcagtt      300

attgctcctt ctaattttcc ctagctagga tcttctcaat taatttcctt tttcattttc      360

aactaactca taattagccc aaatcttcaa aagagttttg tgtaagttga tagacgttta      420

gagaaacaga gaaatacagg ggaaaaacaa ggg                                   453


<210>  36
<211>  192
<212>  DNA
<213>  turnip mosaic potyvirus IRES

<400>  36
gggaaagctt gcatgcctgc aggtcgactc tagaaaaata taaaaactca acacaacata       60

cacaaaacga ttaaagcaaa cacaatcttt caaagcattc aaagcattca agcaatcaaa      120

gattttcaaa tcttttgtcg ttatcaaagc aatcaccaac aggatccagg atccccgggt      180

ggtcagtccc tt                                                          192


<210>  37
<211>  143
<212>  DNA
<213>  tobacco etch virus IRES

<400>  37
aaataacaaa tctcaacaca acatatacaa aacaaacgaa tctcaagcaa tcaagcattc       60

tacttctatt gcagcaattt aaatcatttc ttttaaagca aaagcaattt tctgaaaatt      120

ttcaccattt acgaacgata gca                                              143


<210>  38
<211>  160
<212>  DNA
<213>  satellite tobacco necrosis virus CITE

<400>  38
aaaaaaaaaa aagtaaagac aggaaacttt actgactaac tcccagaggt tcacaatgtt       60

agtgatgggg cgctgaaaga tgcgtagcta cccttctgga gccacttcct ggtggtaagc      120

agaaatccaa gggtacggtg gtacggtgga aagcagtccc                            160


<210>  39
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NtHSF1_IRES_F primer

<400>  39
ggcacgaggc tcccattaat atttc                                             25


<210>  40
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  TuMV_IRES_F primer

<400>  40
gggaaagctt gcatgcctg                                                    19


<210>  41
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  TEV_IRES_F	primer

<400>  41
gaaataacaa atctcaacac aacatatac                                         29


<210>  42
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  rpoA-for primer

<400>  42
gcaccaaaga aggcgttcag                                                   20


<210>  43
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  rpoA-rev primer

<400>  43
ggtcaggtgg cagatcacat                                                   20


<210>  44
<211>  63
<212>  DNA
<213>  E. Coli

<400>  44
atgaaaaaga cggcgattgc tatcgctgtg gcgcttgctg gattcgccac tgtagcacaa       60

gca                                                                     63


