SEQUENCE LISTING

<110>  Bayer CropScience AG
 
<120>  Plants tolerant to HPPD inhibitor herbicides

<130>  BCS 11-1055

<160>  37    

<170>  PatentIn version 3.5

<210>  1
<211>  1206
<212>  DNA
<213>  Rhodococcus opacus B4

<400>  1
atgacgatcg agcagaccct caccgacaag gaacgcctgg caggtctcga cctcggccag       60

ctcgagcaac tggtcgggct cgtcgaatac gacggcaccc gcgacccgtt cccggtcagc      120

ggctgggacg ccgtcgtctg ggtggtcggc aacgccaccc agaccgccca ctacttccag      180

tccgcgttcg ggatgaccct cgtcgcctac tccgggccca ccaccggcaa ccgcgaccac      240

cacagcttcg tcctcgaatc cggcgccgtc cgcttcgtca tccagggcgc cgtcgacccc      300

cagagcccgc tgatcgagca ccaccgcgcc cacggcgacg gcgtcgtcga catcgcgttg      360

tcggtgcccg acgtcgacaa gtgcatcgcc cacgcccgcg cccagggcgc cgtcgtcctc      420

gacgaacccc acgacatgac cgacgagcac ggcaccgtcc ggctcgccgc gatcgccacc      480

tacggcgaca cccggcacac cctcgtcgac cgcacccact acaccggccc ctacctgccc      540

ggctacatcg cacgcacctc cacgcacacc aagcgcgacg gcgcccccaa acgcctgttc      600

caggccctcg accacgtcgt cggcaacgtc gaactcggcc ggatggacca ctgggtcgac      660

ttctacaacc gggtcatggg ctttacgaac atggccgagt tcgtcggcga ggacatcgcc      720

accgactact ccgcactgat gtccaaggtc gtctccaacg gcaaccaccg ggtgaagttc      780

ccgctcaatg aacccgcgat cgcgaagaag cgttcgcaga tcgacgagta cctggacttc      840

taccaaggtc ccggcgcgca gcacctggcg ctggccacca acgacatcct caccgccgtg      900

gaccggctca ccgccgaggg cgtcgaattc ctggccaccc ccgactccta ctaccaggac      960

ccggaactgc gggcgcggat cggtaacgtc cgcgccccga tcgaggagtt gcagaaacgc     1020

ggcatcctcg tcgaccgcga cgaggacggc tacctgctgc agatcttcac caaacccctc     1080

gtcgaccggc ccaccgtgtt cttcgaactc atcgaacgcc acggctccct cggcttcggc     1140

atcggcaact tcaaggccct cttcgaagcc atcgaacgcg aacaggccgc ccgcggaaac     1200

ttctga                                                                1206


<210>  2
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for E. coli , containing at the 5' end a nucleic acid 
       encoding an aspartate


<220>
<221>  misc_feature
<222>  (4)..(6)
<223>  sequence coding for Asp

<400>  2
atgacgatcg agcagaccct caccgacaag gaacgcctgg caggtctcga cctcggccag       60

ctcgagcaac tggtcgggct cgtcgaatac gacggcaccc gcgacccgtt cccggtcagc      120

ggctgggacg ccgtcgtctg ggtggtcggc aacgccaccc agaccgccca ctacttccag      180

tccgcgttcg ggatgaccct cgtcgcctac tccgggccca ccaccggcaa ccgcgaccac      240

cacagcttcg tcctcgaatc cggcgccgtc cgcttcgtca tccagggcgc cgtcgacccc      300

cagagcccgc tgatcgagca ccaccgcgcc cacggcgacg gcgtcgtcga catcgcgttg      360

tcggtgcccg acgtcgacaa gtgcatcgcc cacgcccgcg cccagggcgc cgtcgtcctc      420

gacgaacccc acgacatgac cgacgagcac ggcaccgtcc ggctcgccgc gatcgccacc      480

tacggcgaca cccggcacac cctcgtcgac cgcacccact acaccggccc ctacctgccc      540

ggctacatcg cacgcacctc cacgcacacc aagcgcgacg gcgcccccaa acgcctgttc      600

caggccctcg accacgtcgt cggcaacgtc gaactcggcc ggatggacca ctgggtcgac      660

ttctacaacc gggtcatggg ctttacgaac atggccgagt tcgtcggcga ggacatcgcc      720

accgactact ccgcactgat gtccaaggtc gtctccaacg gcaaccaccg ggtgaagttc      780

ccgctcaatg aacccgcgat cgcgaagaag cgttcgcaga tcgacgagta cctggacttc      840

taccaaggtc ccggcgcgca gcacctggcg ctggccacca acgacatcct caccgccgtg      900

gaccggctca ccgccgaggg cgtcgaattc ctggccaccc ccgactccta ctaccaggac      960

ccggaactgc gggcgcggat cggtaacgtc cgcgccccga tcgaggagtt gcagaaacgc     1020

ggcatcctcg tcgaccgcga cgaggacggc tacctgctgc agatcttcac caaacccctc     1080

gtcgaccggc ccaccgtgtt cttcgaactc atcgaacgcc acggctccct cggcttcggc     1140

atcggcaact tcaaggccct cttcgaagcc atcgaacgcg aacaggccgc ccgcggaaac     1200

ttctga                                                                1206


<210>  3
<211>  1341
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for E. coli, containing at the 5' end a nucleic acid 
       encoding several tags


<220>
<221>  misc_feature
<222>  (4)..(21)
<223>  Sequence encoding His tag containing 6 His

<220>
<221>  misc_feature
<222>  (22)..(31)
<223>  Sequence coding for two Ser and a Gly

<220>
<221>  misc_feature
<222>  (32)..(48)
<223>  Nucleic acid stretch encoding a protein binding site thrombin

<220>
<221>  misc_feature
<222>  (49)..(54)
<223>  Sequence coding for Gly and Met

<220>
<221>  misc_feature
<222>  (55)..(99)
<223>  Nucleic acid stretch encoding a S-tag

<220>
<221>  misc_feature
<222>  (100)..(114)
<223>  Nucleic acid coding for 2 Pro, a Asp, a Leu, a Gly and a Thr

<220>
<221>  misc_feature
<222>  (115)..(129)
<223>  Nucleic acid stretch encoding the recognition site of an 
       enterokinase

<220>
<221>  misc_feature
<222>  (130)..(132)
<223>  Nucleic acids coding for Ala

<220>
<221>  misc_feature
<222>  (133)..(135)
<223>  Start codon

<220>
<221>  misc_feature
<222>  (136)..(138)
<223>  nucleic acid sequence encoding an aspartate

<400>  3
atgcaccatc atcatcatca ttcttctggt ctggtgccac gcggttctgg tatgaaagaa       60

accgctgctg ctaaattcga acgccagcac atggacagcc cagatctggg taccgacgac      120

gacgacaagg ccatggatac tattgaacag accctgaccg ataaagaacg tctggcaggt      180

ctggatctgg gtcaactgga acaactggtt ggtctggttg aatacgacgg cacccgtgat      240

ccgtttccgg ttagcggttg ggacgcagtt gtttgggttg ttggtaacgc tactcagacc      300

gctcactatt ttcagtcagc ctttggtatg accctggttg cctatagcgg tccgactaca      360

ggtaatcgtg atcaccacag ctttgttctg gaatcaggtg cagttcgttt tgttattcag      420

ggtgcagttg atccgcagtc accgctgatt gaacaccacc gtgctcacgg tgacggtgtt      480

gttgatattg cactgagcgt gccggacgtt gataagtgta ttgctcacgc acgtgctcag      540

ggtgccgttg ttctggacga accgcacgat atgaccgacg aacacggcac cgttcgtctg      600

gcagctattg ctacctacgg tgatacccgt cacaccctgg ttgatcgtac tcactataca      660

ggtccgtatc tgcctggtta tattgcacgt actagcactc acactaaacg tgacggtgca      720

ccgaaacgtc tgtttcaggc actggatcac gttgtgggta acgttgaact gggtcgtatg      780

gatcactggg tggattttta taatcgcgtg atgggcttta ctaatatggc cgaatttgtg      840

ggtgaagata ttgctaccga ttattcagca ctgatgagta aagtggttag taacggtaat      900

caccgtgtta agtttccgct gaacgaaccg gctattgcta aaaaacgtag tcagattgac      960

gaatatctgg atttttatca gggtccgggt gctcagcacc tggcactggc tactaacgat     1020

attctgaccg cagttgatcg tctgaccgca gaaggtgttg aatttctggc tacaccggat     1080

agctattatc aggacccgga actgcgtgca cgtattggta acgttcgtgc accgattgaa     1140

gaattacaga aacgtggtat tctggtggat cgtgacgaag acggttatct gctgcaaatt     1200

tttactaaac cgctggttga tcgtccgacc gttttttttg aactgattga acgtcacggt     1260

agcctgggtt ttggtattgg taactttaaa gccttatttg aagctattga acgtgaacag     1320

gcagcacgcg gtaactttta a                                               1341


<210>  4
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 4

<400>  4

His His His His His His 
1               5       


<210>  5
<211>  18
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid encoding 6 consecutive histidines

<400>  5
caccatcatc atcatcat                                                     18


<210>  6
<211>  18
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid stretch encoding a protein binding site thrombin

<400>  6
ctggtgccac gcggttct                                                     18


<210>  7
<211>  6
<212>  PRT
<213>  Artificial protein

<220>
<223>  Protein encoded by SEQ ID No. 6

<400>  7

Leu Val Pro Arg Gly Ser 
1               5       


<210>  8
<211>  45
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid stretch encoding a S-tag

<400>  8
aaagaaaccg ctgctgctaa attcgaacgc cagcacatgg acagc                       45


<210>  9
<211>  15
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 8

<400>  9

Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser 
1               5                   10                  15  


<210>  10
<211>  15
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid stretch encoding the recognition site of an 
       enterokinase

<400>  10
gacgacgacg acaag                                                        15


<210>  11
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 10

<400>  11

Asp Asp Asp Asp Lys 
1               5   


<210>  12
<211>  1584
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for E. coli containing at the 5' end a nucleic acid 
       sequence encoding an optimized transit peptide (according to SEQ 
       ID No. 13) and an aspartate.


<220>
<221>  transit_peptide
<222>  (1)..(375)
<223>  Nucleic acid coding for an optimized transit peptide to 
       chloroplasts

<220>
<221>  misc_feature
<222>  (376)..(378)
<223>  Sequence encoding a Met

<220>
<221>  misc_feature
<222>  (379)..(381)
<223>  Sequence coding for Asp

<400>  12
atggcttcga tctcctcctc agtcgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatgtatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

agaatccggt gcgccatgga tactattgaa cagaccctga ccgataaaga acgtctggca      420

ggtctggatc tgggtcaact ggaacaactg gttggtctgg ttgaatacga cggcacccgt      480

gatccgtttc cggttagcgg ttgggacgca gttgtttggg ttgttggtaa cgctactcag      540

accgctcact attttcagtc agcctttggt atgaccctgg ttgcctatag cggtccgact      600

acaggtaatc gtgatcacca cagctttgtt ctggaatcag gtgcagttcg ttttgttatt      660

cagggtgcag ttgatccgca gtcaccgctg attgaacacc accgtgctca cggtgacggt      720

gttgttgata ttgcactgag cgtgccggac gttgataagt gtattgctca cgcacgtgct      780

cagggtgccg ttgttctgga cgaaccgcac gatatgaccg acgaacacgg caccgttcgt      840

ctggcagcta ttgctaccta cggtgatacc cgtcacaccc tggttgatcg tactcactat      900

acaggtccgt atctgcctgg ttatattgca cgtactagca ctcacactaa acgtgacggt      960

gcaccgaaac gtctgtttca ggcactggat cacgttgtgg gtaacgttga actgggtcgt     1020

atggatcact gggtggattt ttataatcgc gtgatgggct ttactaatat ggccgaattt     1080

gtgggtgaag atattgctac cgattattca gcactgatga gtaaagtggt tagtaacggt     1140

aatcaccgtg ttaagtttcc gctgaacgaa ccggctattg ctaaaaaacg tagtcagatt     1200

gacgaatatc tggattttta tcagggtccg ggtgctcagc acctggcact ggctactaac     1260

gatattctga ccgcagttga tcgtctgacc gcagaaggtg ttgaatttct ggctacaccg     1320

gatagctatt atcaggaccc ggaactgcgt gcacgtattg gtaacgttcg tgcaccgatt     1380

gaagaattac agaaacgtgg tattctggtg gatcgtgacg aagacggtta tctgctgcaa     1440

atttttacta aaccgctggt tgatcgtccg accgtttttt ttgaactgat tgaacgtcac     1500

ggtagcctgg gttttggtat tggtaacttt aaagccttat ttgaagctat tgaacgtgaa     1560

caggcagcac gcggtaactt ttaa                                            1584


<210>  13
<211>  372
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding an optimized transit peptide

<400>  13
atggcttcga tctcctcctc agtcgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatgtatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

agaatccggt gc                                                          372


<210>  14
<211>  124
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 13

<400>  14

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Cys Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys 
        115                 120                 


<210>  15
<211>  1581
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for dicotyledonous plants containing at the 5' end a 
       nucleic acid sequence encoding an optimized transit peptide 
       (according to SEQ ID No. 13) and an aspartate.


<220>
<221>  transit_peptide
<222>  (1)..(372)
<223>  Nucleic acid coding for an optimized transit peptide to 
       chloroplasts

<220>
<221>  misc_feature
<222>  (373)..(375)
<223>  Nucleic acid coding for Met

<220>
<221>  misc_feature
<222>  (376)..(378)
<223>  Nucleic acid coding for Asp

<400>  15
atggcttcga tctcctcctc agttgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatatatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

aggatccggt gcatggacac tattgagcag accctcaccg acaaagaaag gcttgctgga      420

cttgatctcg gtcagcttga gcagcttgtt ggacttgttg agtacgacgg cactagggac      480

cctttcccag ttagtggttg ggacgctgtt gtttgggttg tgggtaacgc tactcaaacc      540

gctcactact ttcagtcagc cttcggaatg accctcgtgg cttattcagg acctactact      600

ggtaataggg atcaccactc cttcgtgctt gagtcaggtg ctgttagatt cgtgattcag      660

ggcgctgttg atcctcagtc accacttatt gagcaccaca gggctcacgg tgacggtgtt      720

gttgatattg ctcttagcgt gcccgacgtg gacaagtgta ttgctcacgc tagggctcag      780

ggtgctgttg ttcttgacga acctcacgat atgactgacg agcacggaac tgttaggctc      840

gctgctattg ctacttacgg tgacactagg cacaccctcg ttgataggac tcactacact      900

ggaccttacc tcccaggcta tattgctagg acctctactc acactaagag ggacggtgct      960

cctaagaggc tttttcaggc tcttgatcac gttgtgggta acgtggaact cggtagaatg     1020

gatcactggg tggacttcta taatagggtg atgggcttca ctaatatggc cgagttcgtg     1080

ggcgaggata ttgctactga ttactcagcc ctgatgtcta aagtggttag taacggtaat     1140

cacagggtta agttcccact taacgagccc gctatcgcta agaaacgtag tcagattgac     1200

gagtacctcg acttctatca gggaccaggt gctcaacacc ttgctctcgc tactaacgat     1260

attctcaccg ctgtggatag gcttaccgct gaaggtgttg agttccttgc tacccccgat     1320

agctactatc aggacccaga acttagggct aggatcggta acgttagggc tcctattgag     1380

gaacttcaga agaggggaat cctggttgat agggacgagg acggttacct ccttcagatc     1440

ttcactaagc cactcgtgga taggcctact gtgttcttcg agcttattga gcgtcacgga     1500

tcactcggat tcgggatcgg taactttaag gccctcttcg aggctatcga gagagagcaa     1560

gctgctaggg gcaacttcta g                                               1581


<210>  16
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       containing at the 5' end a nucleic acid sequence encoding an 
       optimized transit peptide (according to SEQ ID No. 13).


<220>
<221>  transit_peptide
<222>  (1)..(372)
<223>  Sequence coding for an optimized transit peptide to chloroplasts

<400>  16
atggcttcga tctcctcctc agttgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatatatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

aggatccggt gcatgacgat cgagcagacc ctcaccgaca aggaacgcct ggcaggtctc      420

gacctcggcc agctcgagca actggtcggg ctcgtcgaat acgacggcac ccgcgacccg      480

ttcccggtca gcggctggga cgccgtcgtc tgggtggtcg gcaacgccac ccagaccgcc      540

cactacttcc agtccgcgtt cgggatgacc ctcgtcgcct actccgggcc caccaccggc      600

aaccgcgacc accacagctt cgtcctcgaa tccggcgccg tccgcttcgt catccagggc      660

gccgtcgacc cccagagccc gctgatcgag caccaccgcg cccacggcga cggcgtcgtc      720

gacatcgcgt tgtcggtgcc cgacgtcgac aagtgcatcg cccacgcccg cgcccagggc      780

gccgtcgtcc tcgacgaacc ccacgacatg accgacgagc acggcaccgt ccggctcgcc      840

gcgatcgcca cctacggcga cacccggcac accctcgtcg accgcaccca ctacaccggc      900

ccctacctgc ccggctacat cgcacgcacc tccacgcaca ccaagcgcga cggcgccccc      960

aaacgcctgt tccaggccct cgaccacgtc gtcggcaacg tcgaactcgg ccggatggac     1020

cactgggtcg acttctacaa ccgggtcatg ggctttacga acatggccga gttcgtcggc     1080

gaggacatcg ccaccgacta ctccgcactg atgtccaagg tcgtctccaa cggcaaccac     1140

cgggtgaagt tcccgctcaa tgaacccgcg atcgcgaaga agcgttcgca gatcgacgag     1200

tacctggact tctaccaagg tcccggcgcg cagcacctgg cgctggccac caacgacatc     1260

ctcaccgccg tggaccggct caccgccgag ggcgtcgaat tcctggccac ccccgactcc     1320

tactaccagg acccggaact gcgggcgcgg atcggtaacg tccgcgcccc gatcgaggag     1380

ttgcagaaac gcggcatcct cgtcgaccgc gacgaggacg gctacctgct gcagatcttc     1440

accaaacccc tcgtcgaccg gcccaccgtg ttcttcgaac tcatcgaacg ccacggctcc     1500

ctcggcttcg gcatcggcaa cttcaaggcc ctcttcgaag ccatcgaacg cgaacaggcc     1560

gcccgcggaa acttctga                                                   1578


<210>  17
<211>  401
<212>  PRT
<213>  Rhodococcus opacus B4

<400>  17

Met Thr Ile Glu Gln Thr Leu Thr Asp Lys Glu Arg Leu Ala Gly Leu 
1               5                   10                  15      


Asp Leu Gly Gln Leu Glu Gln Leu Val Gly Leu Val Glu Tyr Asp Gly 
            20                  25                  30          


Thr Arg Asp Pro Phe Pro Val Ser Gly Trp Asp Ala Val Val Trp Val 
        35                  40                  45              


Val Gly Asn Ala Thr Gln Thr Ala His Tyr Phe Gln Ser Ala Phe Gly 
    50                  55                  60                  


Met Thr Leu Val Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His 
65                  70                  75                  80  


His Ser Phe Val Leu Glu Ser Gly Ala Val Arg Phe Val Ile Gln Gly 
                85                  90                  95      


Ala Val Asp Pro Gln Ser Pro Leu Ile Glu His His Arg Ala His Gly 
            100                 105                 110         


Asp Gly Val Val Asp Ile Ala Leu Ser Val Pro Asp Val Asp Lys Cys 
        115                 120                 125             


Ile Ala His Ala Arg Ala Gln Gly Ala Val Val Leu Asp Glu Pro His 
    130                 135                 140                 


Asp Met Thr Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr 
145                 150                 155                 160 


Tyr Gly Asp Thr Arg His Thr Leu Val Asp Arg Thr His Tyr Thr Gly 
                165                 170                 175     


Pro Tyr Leu Pro Gly Tyr Ile Ala Arg Thr Ser Thr His Thr Lys Arg 
            180                 185                 190         


Asp Gly Ala Pro Lys Arg Leu Phe Gln Ala Leu Asp His Val Val Gly 
        195                 200                 205             


Asn Val Glu Leu Gly Arg Met Asp His Trp Val Asp Phe Tyr Asn Arg 
    210                 215                 220                 


Val Met Gly Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala 
225                 230                 235                 240 


Thr Asp Tyr Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His 
                245                 250                 255     


Arg Val Lys Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser 
            260                 265                 270         


Gln Ile Asp Glu Tyr Leu Asp Phe Tyr Gln Gly Pro Gly Ala Gln His 
        275                 280                 285             


Leu Ala Leu Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Arg Leu Thr 
    290                 295                 300                 


Ala Glu Gly Val Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Gln Asp 
305                 310                 315                 320 


Pro Glu Leu Arg Ala Arg Ile Gly Asn Val Arg Ala Pro Ile Glu Glu 
                325                 330                 335     


Leu Gln Lys Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu 
            340                 345                 350         


Leu Gln Ile Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe 
        355                 360                 365             


Glu Leu Ile Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe 
    370                 375                 380                 


Lys Ala Leu Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn 
385                 390                 395                 400 


Phe 
    


<210>  18
<211>  402
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 2


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Asp

<400>  18

Met Asp Thr Ile Glu Gln Thr Leu Thr Asp Lys Glu Arg Leu Ala Gly 
1               5                   10                  15      


Leu Asp Leu Gly Gln Leu Glu Gln Leu Val Gly Leu Val Glu Tyr Asp 
            20                  25                  30          


Gly Thr Arg Asp Pro Phe Pro Val Ser Gly Trp Asp Ala Val Val Trp 
        35                  40                  45              


Val Val Gly Asn Ala Thr Gln Thr Ala His Tyr Phe Gln Ser Ala Phe 
    50                  55                  60                  


Gly Met Thr Leu Val Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp 
65                  70                  75                  80  


His His Ser Phe Val Leu Glu Ser Gly Ala Val Arg Phe Val Ile Gln 
                85                  90                  95      


Gly Ala Val Asp Pro Gln Ser Pro Leu Ile Glu His His Arg Ala His 
            100                 105                 110         


Gly Asp Gly Val Val Asp Ile Ala Leu Ser Val Pro Asp Val Asp Lys 
        115                 120                 125             


Cys Ile Ala His Ala Arg Ala Gln Gly Ala Val Val Leu Asp Glu Pro 
    130                 135                 140                 


His Asp Met Thr Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala 
145                 150                 155                 160 


Thr Tyr Gly Asp Thr Arg His Thr Leu Val Asp Arg Thr His Tyr Thr 
                165                 170                 175     


Gly Pro Tyr Leu Pro Gly Tyr Ile Ala Arg Thr Ser Thr His Thr Lys 
            180                 185                 190         


Arg Asp Gly Ala Pro Lys Arg Leu Phe Gln Ala Leu Asp His Val Val 
        195                 200                 205             


Gly Asn Val Glu Leu Gly Arg Met Asp His Trp Val Asp Phe Tyr Asn 
    210                 215                 220                 


Arg Val Met Gly Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile 
225                 230                 235                 240 


Ala Thr Asp Tyr Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn 
                245                 250                 255     


His Arg Val Lys Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg 
            260                 265                 270         


Ser Gln Ile Asp Glu Tyr Leu Asp Phe Tyr Gln Gly Pro Gly Ala Gln 
        275                 280                 285             


His Leu Ala Leu Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Arg Leu 
    290                 295                 300                 


Thr Ala Glu Gly Val Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Gln 
305                 310                 315                 320 


Asp Pro Glu Leu Arg Ala Arg Ile Gly Asn Val Arg Ala Pro Ile Glu 
                325                 330                 335     


Glu Leu Gln Lys Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr 
            340                 345                 350         


Leu Leu Gln Ile Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe 
        355                 360                 365             


Phe Glu Leu Ile Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn 
    370                 375                 380                 


Phe Lys Ala Leu Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly 
385                 390                 395                 400 


Asn Phe 
        


<210>  19
<211>  446
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 3.


<220>
<221>  MISC_FEATURE
<222>  (2)..(7)
<223>  HIS Tag made of 6 HIS

<220>
<221>  MISC_FEATURE
<222>  (8)..(8)
<223>  Ser

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  Ser

<220>
<221>  MISC_FEATURE
<222>  (10)..(10)
<223>  Gly

<220>
<221>  MISC_FEATURE
<222>  (11)..(16)
<223>  protein binding site thrombin

<220>
<221>  MISC_FEATURE
<222>  (17)..(17)
<223>  Gly

<220>
<221>  MISC_FEATURE
<222>  (18)..(18)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (19)..(33)
<223>  S-Tag

<220>
<221>  MISC_FEATURE
<222>  (34)..(34)
<223>  Pro

<220>
<221>  MISC_FEATURE
<222>  (35)..(35)
<223>  Pro

<220>
<221>  MISC_FEATURE
<222>  (36)..(36)
<223>  Asp

<220>
<221>  MISC_FEATURE
<222>  (37)..(37)
<223>  Asp

<220>
<221>  MISC_FEATURE
<222>  (38)..(38)
<223>  Gly

<220>
<221>  MISC_FEATURE
<222>  (39)..(39)
<223>  Thr

<220>
<221>  MISC_FEATURE
<222>  (40)..(43)
<223>  recognition site of an enterokinase

<220>
<221>  MISC_FEATURE
<222>  (44)..(44)
<223>  Ala

<220>
<221>  MISC_FEATURE
<222>  (45)..(45)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (46)..(46)
<223>  Asp

<400>  19

Met His His His His His His Ser Ser Gly Leu Val Pro Arg Gly Ser 
1               5                   10                  15      


Gly Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp 
            20                  25                  30          


Ser Pro Asp Leu Gly Thr Asp Asp Asp Asp Lys Ala Met Asp Thr Ile 
        35                  40                  45              


Glu Gln Thr Leu Thr Asp Lys Glu Arg Leu Ala Gly Leu Asp Leu Gly 
    50                  55                  60                  


Gln Leu Glu Gln Leu Val Gly Leu Val Glu Tyr Asp Gly Thr Arg Asp 
65                  70                  75                  80  


Pro Phe Pro Val Ser Gly Trp Asp Ala Val Val Trp Val Val Gly Asn 
                85                  90                  95      


Ala Thr Gln Thr Ala His Tyr Phe Gln Ser Ala Phe Gly Met Thr Leu 
            100                 105                 110         


Val Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ser Phe 
        115                 120                 125             


Val Leu Glu Ser Gly Ala Val Arg Phe Val Ile Gln Gly Ala Val Asp 
    130                 135                 140                 


Pro Gln Ser Pro Leu Ile Glu His His Arg Ala His Gly Asp Gly Val 
145                 150                 155                 160 


Val Asp Ile Ala Leu Ser Val Pro Asp Val Asp Lys Cys Ile Ala His 
                165                 170                 175     


Ala Arg Ala Gln Gly Ala Val Val Leu Asp Glu Pro His Asp Met Thr 
            180                 185                 190         


Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly Asp 
        195                 200                 205             


Thr Arg His Thr Leu Val Asp Arg Thr His Tyr Thr Gly Pro Tyr Leu 
    210                 215                 220                 


Pro Gly Tyr Ile Ala Arg Thr Ser Thr His Thr Lys Arg Asp Gly Ala 
225                 230                 235                 240 


Pro Lys Arg Leu Phe Gln Ala Leu Asp His Val Val Gly Asn Val Glu 
                245                 250                 255     


Leu Gly Arg Met Asp His Trp Val Asp Phe Tyr Asn Arg Val Met Gly 
            260                 265                 270         


Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp Tyr 
        275                 280                 285             


Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val Lys 
    290                 295                 300                 


Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile Asp 
305                 310                 315                 320 


Glu Tyr Leu Asp Phe Tyr Gln Gly Pro Gly Ala Gln His Leu Ala Leu 
                325                 330                 335     


Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Arg Leu Thr Ala Glu Gly 
            340                 345                 350         


Val Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Gln Asp Pro Glu Leu 
        355                 360                 365             


Arg Ala Arg Ile Gly Asn Val Arg Ala Pro Ile Glu Glu Leu Gln Lys 
    370                 375                 380                 


Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln Ile 
385                 390                 395                 400 


Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu Ile 
                405                 410                 415     


Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala Leu 
            420                 425                 430         


Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        435                 440                 445     


<210>  20
<211>  527
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 12


<220>
<221>  TRANSIT
<222>  (1)..(125)
<223>  Optimized transit peptide to cholorplasts

<220>
<221>  MISC_FEATURE
<222>  (126)..(126)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (127)..(127)
<223>  Asp

<400>  20

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Cys Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Ala Met Asp Thr 
        115                 120                 125             


Ile Glu Gln Thr Leu Thr Asp Lys Glu Arg Leu Ala Gly Leu Asp Leu 
    130                 135                 140                 


Gly Gln Leu Glu Gln Leu Val Gly Leu Val Glu Tyr Asp Gly Thr Arg 
145                 150                 155                 160 


Asp Pro Phe Pro Val Ser Gly Trp Asp Ala Val Val Trp Val Val Gly 
                165                 170                 175     


Asn Ala Thr Gln Thr Ala His Tyr Phe Gln Ser Ala Phe Gly Met Thr 
            180                 185                 190         


Leu Val Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ser 
        195                 200                 205             


Phe Val Leu Glu Ser Gly Ala Val Arg Phe Val Ile Gln Gly Ala Val 
    210                 215                 220                 


Asp Pro Gln Ser Pro Leu Ile Glu His His Arg Ala His Gly Asp Gly 
225                 230                 235                 240 


Val Val Asp Ile Ala Leu Ser Val Pro Asp Val Asp Lys Cys Ile Ala 
                245                 250                 255     


His Ala Arg Ala Gln Gly Ala Val Val Leu Asp Glu Pro His Asp Met 
            260                 265                 270         


Thr Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly 
        275                 280                 285             


Asp Thr Arg His Thr Leu Val Asp Arg Thr His Tyr Thr Gly Pro Tyr 
    290                 295                 300                 


Leu Pro Gly Tyr Ile Ala Arg Thr Ser Thr His Thr Lys Arg Asp Gly 
305                 310                 315                 320 


Ala Pro Lys Arg Leu Phe Gln Ala Leu Asp His Val Val Gly Asn Val 
                325                 330                 335     


Glu Leu Gly Arg Met Asp His Trp Val Asp Phe Tyr Asn Arg Val Met 
            340                 345                 350         


Gly Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp 
        355                 360                 365             


Tyr Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val 
    370                 375                 380                 


Lys Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile 
385                 390                 395                 400 


Asp Glu Tyr Leu Asp Phe Tyr Gln Gly Pro Gly Ala Gln His Leu Ala 
                405                 410                 415     


Leu Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Arg Leu Thr Ala Glu 
            420                 425                 430         


Gly Val Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Gln Asp Pro Glu 
        435                 440                 445             


Leu Arg Ala Arg Ile Gly Asn Val Arg Ala Pro Ile Glu Glu Leu Gln 
    450                 455                 460                 


Lys Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln 
465                 470                 475                 480 


Ile Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu 
                485                 490                 495     


Ile Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala 
            500                 505                 510         


Leu Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        515                 520                 525         


<210>  21
<211>  526
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 15


<220>
<221>  TRANSIT
<222>  (1)..(124)
<223>  Optimized transit peptide to cholorplasts

<220>
<221>  MISC_FEATURE
<222>  (125)..(125)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (126)..(126)
<223>  Asp

<400>  21

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Tyr Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Met Asp Thr Ile 
        115                 120                 125             


Glu Gln Thr Leu Thr Asp Lys Glu Arg Leu Ala Gly Leu Asp Leu Gly 
    130                 135                 140                 


Gln Leu Glu Gln Leu Val Gly Leu Val Glu Tyr Asp Gly Thr Arg Asp 
145                 150                 155                 160 


Pro Phe Pro Val Ser Gly Trp Asp Ala Val Val Trp Val Val Gly Asn 
                165                 170                 175     


Ala Thr Gln Thr Ala His Tyr Phe Gln Ser Ala Phe Gly Met Thr Leu 
            180                 185                 190         


Val Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ser Phe 
        195                 200                 205             


Val Leu Glu Ser Gly Ala Val Arg Phe Val Ile Gln Gly Ala Val Asp 
    210                 215                 220                 


Pro Gln Ser Pro Leu Ile Glu His His Arg Ala His Gly Asp Gly Val 
225                 230                 235                 240 


Val Asp Ile Ala Leu Ser Val Pro Asp Val Asp Lys Cys Ile Ala His 
                245                 250                 255     


Ala Arg Ala Gln Gly Ala Val Val Leu Asp Glu Pro His Asp Met Thr 
            260                 265                 270         


Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly Asp 
        275                 280                 285             


Thr Arg His Thr Leu Val Asp Arg Thr His Tyr Thr Gly Pro Tyr Leu 
    290                 295                 300                 


Pro Gly Tyr Ile Ala Arg Thr Ser Thr His Thr Lys Arg Asp Gly Ala 
305                 310                 315                 320 


Pro Lys Arg Leu Phe Gln Ala Leu Asp His Val Val Gly Asn Val Glu 
                325                 330                 335     


Leu Gly Arg Met Asp His Trp Val Asp Phe Tyr Asn Arg Val Met Gly 
            340                 345                 350         


Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp Tyr 
        355                 360                 365             


Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val Lys 
    370                 375                 380                 


Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile Asp 
385                 390                 395                 400 


Glu Tyr Leu Asp Phe Tyr Gln Gly Pro Gly Ala Gln His Leu Ala Leu 
                405                 410                 415     


Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Arg Leu Thr Ala Glu Gly 
            420                 425                 430         


Val Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Gln Asp Pro Glu Leu 
        435                 440                 445             


Arg Ala Arg Ile Gly Asn Val Arg Ala Pro Ile Glu Glu Leu Gln Lys 
    450                 455                 460                 


Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln Ile 
465                 470                 475                 480 


Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu Ile 
                485                 490                 495     


Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala Leu 
            500                 505                 510         


Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        515                 520                 525     


<210>  22
<211>  525
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 16


<220>
<221>  TRANSIT
<222>  (1)..(124)
<223>  Optimized transit peptide to chloroplasts

<400>  22

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Tyr Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Met Thr Ile Glu 
        115                 120                 125             


Gln Thr Leu Thr Asp Lys Glu Arg Leu Ala Gly Leu Asp Leu Gly Gln 
    130                 135                 140                 


Leu Glu Gln Leu Val Gly Leu Val Glu Tyr Asp Gly Thr Arg Asp Pro 
145                 150                 155                 160 


Phe Pro Val Ser Gly Trp Asp Ala Val Val Trp Val Val Gly Asn Ala 
                165                 170                 175     


Thr Gln Thr Ala His Tyr Phe Gln Ser Ala Phe Gly Met Thr Leu Val 
            180                 185                 190         


Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ser Phe Val 
        195                 200                 205             


Leu Glu Ser Gly Ala Val Arg Phe Val Ile Gln Gly Ala Val Asp Pro 
    210                 215                 220                 


Gln Ser Pro Leu Ile Glu His His Arg Ala His Gly Asp Gly Val Val 
225                 230                 235                 240 


Asp Ile Ala Leu Ser Val Pro Asp Val Asp Lys Cys Ile Ala His Ala 
                245                 250                 255     


Arg Ala Gln Gly Ala Val Val Leu Asp Glu Pro His Asp Met Thr Asp 
            260                 265                 270         


Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly Asp Thr 
        275                 280                 285             


Arg His Thr Leu Val Asp Arg Thr His Tyr Thr Gly Pro Tyr Leu Pro 
    290                 295                 300                 


Gly Tyr Ile Ala Arg Thr Ser Thr His Thr Lys Arg Asp Gly Ala Pro 
305                 310                 315                 320 


Lys Arg Leu Phe Gln Ala Leu Asp His Val Val Gly Asn Val Glu Leu 
                325                 330                 335     


Gly Arg Met Asp His Trp Val Asp Phe Tyr Asn Arg Val Met Gly Phe 
            340                 345                 350         


Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp Tyr Ser 
        355                 360                 365             


Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val Lys Phe 
    370                 375                 380                 


Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile Asp Glu 
385                 390                 395                 400 


Tyr Leu Asp Phe Tyr Gln Gly Pro Gly Ala Gln His Leu Ala Leu Ala 
                405                 410                 415     


Thr Asn Asp Ile Leu Thr Ala Val Asp Arg Leu Thr Ala Glu Gly Val 
            420                 425                 430         


Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Gln Asp Pro Glu Leu Arg 
        435                 440                 445             


Ala Arg Ile Gly Asn Val Arg Ala Pro Ile Glu Glu Leu Gln Lys Arg 
    450                 455                 460                 


Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln Ile Phe 
465                 470                 475                 480 


Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu Ile Glu 
                485                 490                 495     


Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala Leu Phe 
            500                 505                 510         


Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        515                 520                 525 


<210>  23
<211>  1422
<212>  DNA
<213>  Arabidopsis thaliana

<400>  23
atgtgtctat cgttagcttc tacagctcaa cgaaacacac agttccgtag cagagtttta       60

gttttagcag agttggtgaa atcaatgggc caccaaaacg ccgccgtttc agagaatcaa      120

aaccatgatg acggcgctgc gtcgtcgccg ggattcaagc tcgtcggatt ttccaagttc      180

gtaagaaaga atccaaagtc tgataaattc aaggttaagc gcttccatca catcgagttc      240

tggtgcggcg acgcaaccaa cgtcgctcgt cgcttctcct ggggtctggg gatgagattc      300

tccgccaaat ccgatctttc caccggaaac atggttcacg cctcttacct actcacctcc      360

ggtgacctcc gattcctttt cactgctcct tactctccgt ctctctccgc cggagagatt      420

aaaccgacaa ccacagcttc tatcccaagt ttcgatcacg gctcttgtcg ttccttcttc      480

tcttcacatg gtctcggtgt tagagccgtt gcgattgaag tagaagacgc agagtcagct      540

ttctccatca gtgtagctaa tggcgctatt ccttcgtcgc ctcctatcgt cctcaatgaa      600

gcagttacga tcgctgaggt taaactatac ggcgatgttg ttctccgata tgttagttac      660

aaagcagaag ataccgaaaa atccgaattc ttgccagggt tcgagcgtgt agaggatgcg      720

tcgtcgttcc cattggatta tggtatccgg cggcttgacc acgccgtggg aaacgttcct      780

gagcttggtc cggctttaac ttatgtagcg gggttcactg gttttcacca attcgcagag      840

ttcacagcag acgacgttgg aaccgccgag agcggtttaa attcagcggt cctggctagc      900

aatgatgaaa tggttcttct accgattaac gagccagtgc acggaacaaa gaggaagagt      960

cagattcaga cgtatttgga acataacgaa ggcgcagggc tacaacatct ggctctgatg     1020

agtgaagaca tattcaggac cctgagagag atgaggaaga ggagcagtat tggaggattc     1080

gacttcatgc cttctcctcc gcctacttac taccagaatc tcaagaaacg ggtcggcgac     1140

gtgctcagcg atgatcagat caaggagtgt gaggaattag ggattcttgt agacagagat     1200

gatcaaggga cgttgcttca aatcttcaca aaaccactag gtgacaggcc gacgatattt     1260

atagagataa tccagagagt aggatgcatg atgaaagatg aggaagggaa ggcttaccag     1320

agtggaggat gtggtggttt tggcaaaggc aatttctctg agctcttcaa gtccattgaa     1380

gaatacgaaa agactcttga agccaaacag ttagtgggat ga                        1422


<210>  24
<211>  473
<212>  PRT
<213>  Arabidopsis thaliana

<400>  24

Met Cys Leu Ser Leu Ala Ser Thr Ala Gln Arg Asn Thr Gln Phe Arg 
1               5                   10                  15      


Ser Arg Val Leu Val Leu Ala Glu Leu Val Lys Ser Met Gly His Gln 
            20                  25                  30          


Asn Ala Ala Val Ser Glu Asn Gln Asn His Asp Asp Gly Ala Ala Ser 
        35                  40                  45              


Ser Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn 
    50                  55                  60                  


Pro Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His His Ile Glu Phe 
65                  70                  75                  80  


Trp Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu 
                85                  90                  95      


Gly Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val 
            100                 105                 110         


His Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr 
        115                 120                 125             


Ala Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu Ile Lys Pro Thr Thr 
    130                 135                 140                 


Thr Ala Ser Ile Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe 
145                 150                 155                 160 


Ser Ser His Gly Leu Gly Val Arg Ala Val Ala Ile Glu Val Glu Asp 
                165                 170                 175     


Ala Glu Ser Ala Phe Ser Ile Ser Val Ala Asn Gly Ala Ile Pro Ser 
            180                 185                 190         


Ser Pro Pro Ile Val Leu Asn Glu Ala Val Thr Ile Ala Glu Val Lys 
        195                 200                 205             


Leu Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp 
    210                 215                 220                 


Thr Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala 
225                 230                 235                 240 


Ser Ser Phe Pro Leu Asp Tyr Gly Ile Arg Arg Leu Asp His Ala Val 
                245                 250                 255     


Gly Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe 
            260                 265                 270         


Thr Gly Phe His Gln Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr 
        275                 280                 285             


Ala Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met 
    290                 295                 300                 


Val Leu Leu Pro Ile Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser 
305                 310                 315                 320 


Gln Ile Gln Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gln His 
                325                 330                 335     


Leu Ala Leu Met Ser Glu Asp Ile Phe Arg Thr Leu Arg Glu Met Arg 
            340                 345                 350         


Lys Arg Ser Ser Ile Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro 
        355                 360                 365             


Thr Tyr Tyr Gln Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp 
    370                 375                 380                 


Asp Gln Ile Lys Glu Cys Glu Glu Leu Gly Ile Leu Val Asp Arg Asp 
385                 390                 395                 400 


Asp Gln Gly Thr Leu Leu Gln Ile Phe Thr Lys Pro Leu Gly Asp Arg 
                405                 410                 415     


Pro Thr Ile Phe Ile Glu Ile Ile Gln Arg Val Gly Cys Met Met Lys 
            420                 425                 430         


Asp Glu Glu Gly Lys Ala Tyr Gln Ser Gly Gly Cys Gly Gly Phe Gly 
        435                 440                 445             


Lys Gly Asn Phe Ser Glu Leu Phe Lys Ser Ile Glu Glu Tyr Glu Lys 
    450                 455                 460                 


Thr Leu Glu Ala Lys Gln Leu Val Gly 
465                 470             


<210>  25
<211>  1353
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Arabidopsis thaliana HPPD, 
       containing at the 5' end a nucleic acid encoding an alanine and 6
       histidine amino acids


<220>
<221>  misc_feature
<222>  (4)..(6)
<223>  Nucleic sequence coding for Ala

<220>
<221>  misc_feature
<222>  (7)..(24)
<223>  Nucleic sequence coding for 6 His

<400>  25
atggctcatc accatcacca tcaccaaaac gccgccgttt cagagaatca aaaccatgat       60

gacggcgctg cgtcgtcgcc gggattcaag ctcgtcggat tttccaagtt cgtaagaaag      120

aatccaaagt ctgataaatt caaggttaag cgcttccatc acatcgagtt ctggtgcggc      180

gacgcaacca acgtcgctcg tcgcttctcc tggggtctgg ggatgagatt ctccgccaaa      240

tccgatcttt ccaccggaaa catggttcac gcctcttacc tactcacctc cggtgacctc      300

cgattccttt tcactgctcc ttactctccg tctctctccg ccggagagat taaaccgaca      360

accacagctt ctatcccaag tttcgatcac ggctcttgtc gttccttctt ctcgtcacat      420

ggtctcggtg ttagagccgt tgcgattgaa gtagaagacg cagagtcagc tttctccatc      480

agtgtagcta atggcgctat tccttcgtcg cctcctatcg tcctcaatga agcagttacg      540

atcgctgagg ttaaactata cggcgatgtt gttctccgat atgttagtta caaagcagaa      600

gataccgaaa aatccgaatt cttgccaggg ttcgagcgtg tagaggatgc gtcgtcgttc      660

ccattggatt atggtatccg gcggcttgac cacgccgtgg gaaacgttcc tgagcttggt      720

ccggctttaa cttatgtagc ggggttcact ggttttcacc aattcgcaga gttcacagca      780

gacgacgttg gaaccgccga gagcggttta aattcagcgg tcctggctag caatgatgaa      840

atggttcttc taccgattaa cgagccagtg cacggaacaa agaggaagag tcagattcag      900

acgtatttgg aacataacga aggcgcaggg ctacaacatc tggctctgat gagtgaagac      960

atattcagga ccctgagaga gatgaggaag aggagcagta ttggaggatt cgacttcatg     1020

ccttctcctc cgcctactta ctaccagaat ctcaagaaac gggtcggcga cgtgctcagc     1080

gatgatcaga tcaaggagtg tgaggaatta gggattcttg tagacagaga tgatcaaggg     1140

acgttgcttc aaatcttcac aaaaccacta ggtgacaggc cgacgatatt tatagagata     1200

atccagagag taggatgcat gatgaaagat gaggaaggga aggcttacca gagtggagga     1260

tgtggtggtt ttggcaaagg caatttctct gagctcttca agtccattga agaatacgaa     1320

aagactcttg aagccaaaca gttagtggga tga                                  1353


<210>  26
<211>  450
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 25


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Ala

<220>
<221>  MISC_FEATURE
<222>  (3)..(8)
<223>  6 His

<400>  26

Met Ala His His His His His His Gln Asn Ala Ala Val Ser Glu Asn 
1               5                   10                  15      


Gln Asn His Asp Asp Gly Ala Ala Ser Ser Pro Gly Phe Lys Leu Val 
            20                  25                  30          


Gly Phe Ser Lys Phe Val Arg Lys Asn Pro Lys Ser Asp Lys Phe Lys 
        35                  40                  45              


Val Lys Arg Phe His His Ile Glu Phe Trp Cys Gly Asp Ala Thr Asn 
    50                  55                  60                  


Val Ala Arg Arg Phe Ser Trp Gly Leu Gly Met Arg Phe Ser Ala Lys 
65                  70                  75                  80  


Ser Asp Leu Ser Thr Gly Asn Met Val His Ala Ser Tyr Leu Leu Thr 
                85                  90                  95      


Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala Pro Tyr Ser Pro Ser Leu 
            100                 105                 110         


Ser Ala Gly Glu Ile Lys Pro Thr Thr Thr Ala Ser Ile Pro Ser Phe 
        115                 120                 125             


Asp His Gly Ser Cys Arg Ser Phe Phe Ser Ser His Gly Leu Gly Val 
    130                 135                 140                 


Arg Ala Val Ala Ile Glu Val Glu Asp Ala Glu Ser Ala Phe Ser Ile 
145                 150                 155                 160 


Ser Val Ala Asn Gly Ala Ile Pro Ser Ser Pro Pro Ile Val Leu Asn 
                165                 170                 175     


Glu Ala Val Thr Ile Ala Glu Val Lys Leu Tyr Gly Asp Val Val Leu 
            180                 185                 190         


Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr Glu Lys Ser Glu Phe Leu 
        195                 200                 205             


Pro Gly Phe Glu Arg Val Glu Asp Ala Ser Ser Phe Pro Leu Asp Tyr 
    210                 215                 220                 


Gly Ile Arg Arg Leu Asp His Ala Val Gly Asn Val Pro Glu Leu Gly 
225                 230                 235                 240 


Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr Gly Phe His Gln Phe Ala 
                245                 250                 255     


Glu Phe Thr Ala Asp Asp Val Gly Thr Ala Glu Ser Gly Leu Asn Ser 
            260                 265                 270         


Ala Val Leu Ala Ser Asn Asp Glu Met Val Leu Leu Pro Ile Asn Glu 
        275                 280                 285             


Pro Val His Gly Thr Lys Arg Lys Ser Gln Ile Gln Thr Tyr Leu Glu 
    290                 295                 300                 


His Asn Glu Gly Ala Gly Leu Gln His Leu Ala Leu Met Ser Glu Asp 
305                 310                 315                 320 


Ile Phe Arg Thr Leu Arg Glu Met Arg Lys Arg Ser Ser Ile Gly Gly 
                325                 330                 335     


Phe Asp Phe Met Pro Ser Pro Pro Pro Thr Tyr Tyr Gln Asn Leu Lys 
            340                 345                 350         


Lys Arg Val Gly Asp Val Leu Ser Asp Asp Gln Ile Lys Glu Cys Glu 
        355                 360                 365             


Glu Leu Gly Ile Leu Val Asp Arg Asp Asp Gln Gly Thr Leu Leu Gln 
    370                 375                 380                 


Ile Phe Thr Lys Pro Leu Gly Asp Arg Pro Thr Ile Phe Ile Glu Ile 
385                 390                 395                 400 


Ile Gln Arg Val Gly Cys Met Met Lys Asp Glu Glu Gly Lys Ala Tyr 
                405                 410                 415     


Gln Ser Gly Gly Cys Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu 
            420                 425                 430         


Phe Lys Ser Ile Glu Glu Tyr Glu Lys Thr Leu Glu Ala Lys Gln Leu 
        435                 440                 445             


Val Gly 
    450 


<210>  27
<211>  1712
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleid acid sequence encoding Arabidopsis thaliana HPPD 
       containing at the 5' end a nucleic acid encoding an OTP 
       (Optimized Tansit Peptide; SEQ ID No. 13


<220>
<221>  transit_peptide
<222>  (1)..(375)
<223>  Nucleic acid coding for optimized transit peptide to chloroplasts

<400>  27
atggcttcga tctcctcctc agtcgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatgtatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

agaatccggt gcgccatgca aaacgccgcc gtttcagaga atcaaaacca tgatgacggc      420

gctgcgtcgt cgccgggatt caagctcgtc ggattttcca agttcgtaag aaagaatcca      480

aagtctgata aattcaaggt taagcgcttc catcacatcg agttctggtg cggcgacgca      540

accaacgtcg ctcgtcgctt ctcctggggt ctggggatga gattctccgc caaatccgat      600

ctttccaccg gaaacatggt tcacgcctct tacctactca cctccggtga cctccgattc      660

cttttcactg ctccttactc tccgtctctc tccgccggag agattaaacc gacaaccaca      720

gcttctatcc caagtttcga tcacggctct tgtcgttcct tcttctcgtc acatggtctc      780

ggtgttagag ccgttgcgat tgaagtagaa gacgcagagt cagctttctc catcagtgta      840

gctaatggcg ctattccttc gtcgcctcct atcgtcctca atgaagcagt tacgatcgct      900

gaggttaaac tatacggcga tgttgttctc cgatatgtta gttacaaagc agaagatacc      960

gaaaaatccg aattcttgcc agggttcgag cgtgtagagg atgcgtcgtc gttcccattg     1020

gattatggta tccggcggct tgaccacgcc gtgggaaacg ttcctgagct tggtccggct     1080

ttaacttatg tagcggggtt cactggtttt caccaattcg cagagttcac agcagacgac     1140

gttggaaccg ccgagagcgg tttaaattca gcggtcctgg ctagcaatga tgaaatggtt     1200

cttctaccga ttaacgagcc agtgcacgga acaaagagga agagtcagat tcagacgtat     1260

ttggaacata acgaaggcgc agggctacaa catctggctc tgatgagtga agacatattc     1320

aggaccctga gagagatgag gaagaggagc agtattggag gattcgactt catgccttct     1380

cctccgccta cttactacca gaatctcaag aaacgggtcg gcgacgtgct cagcgatgat     1440

cagatcaagg agtgtgagga attagggatt cttgtagaca gagatgatca agggacgttg     1500

cttcaaatct tcacaaaacc actaggtgac aggccgacga tatttataga gataatccag     1560

agagtaggat gcatgatgaa agatgaggaa gggaaggctt accagagtgg aggatgtggt     1620

ggttttggca aaggcaattt ctctgagctc ttcaagtcca ttgaagaata cgaaaagact     1680

cttgaagcca aacagttagt gggatgatct ag                                   1712


<210>  28
<211>  569
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein of SEQ ID No. 24 plus the OTP sequence (SEQ ID No. 14) 
       located at the N-terminal extremity of the protein


<220>
<221>  TRANSIT
<222>  (1)..(125)
<223>  Optimized transit peptide to chloroplasts

<400>  28

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Cys Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Ala Met Gln Asn 
        115                 120                 125             


Ala Ala Val Ser Glu Asn Gln Asn His Asp Asp Gly Ala Ala Ser Ser 
    130                 135                 140                 


Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn Pro 
145                 150                 155                 160 


Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His His Ile Glu Phe Trp 
                165                 170                 175     


Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu Gly 
            180                 185                 190         


Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val His 
        195                 200                 205             


Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala 
    210                 215                 220                 


Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu Ile Lys Pro Thr Thr Thr 
225                 230                 235                 240 


Ala Ser Ile Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe Ser 
                245                 250                 255     


Ser His Gly Leu Gly Val Arg Ala Val Ala Ile Glu Val Glu Asp Ala 
            260                 265                 270         


Glu Ser Ala Phe Ser Ile Ser Val Ala Asn Gly Ala Ile Pro Ser Ser 
        275                 280                 285             


Pro Pro Ile Val Leu Asn Glu Ala Val Thr Ile Ala Glu Val Lys Leu 
    290                 295                 300                 


Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr 
305                 310                 315                 320 


Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala Ser 
                325                 330                 335     


Ser Phe Pro Leu Asp Tyr Gly Ile Arg Arg Leu Asp His Ala Val Gly 
            340                 345                 350         


Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr 
        355                 360                 365             


Gly Phe His Gln Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr Ala 
    370                 375                 380                 


Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met Val 
385                 390                 395                 400 


Leu Leu Pro Ile Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser Gln 
                405                 410                 415     


Ile Gln Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gln His Leu 
            420                 425                 430         


Ala Leu Met Ser Glu Asp Ile Phe Arg Thr Leu Arg Glu Met Arg Lys 
        435                 440                 445             


Arg Ser Ser Ile Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro Thr 
    450                 455                 460                 


Tyr Tyr Gln Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp Asp 
465                 470                 475                 480 


Gln Ile Lys Glu Cys Glu Glu Leu Gly Ile Leu Val Asp Arg Asp Asp 
                485                 490                 495     


Gln Gly Thr Leu Leu Gln Ile Phe Thr Lys Pro Leu Gly Asp Arg Pro 
            500                 505                 510         


Thr Ile Phe Ile Glu Ile Ile Gln Arg Val Gly Cys Met Met Lys Asp 
        515                 520                 525             


Glu Glu Gly Lys Ala Tyr Gln Ser Gly Gly Cys Gly Gly Phe Gly Lys 
    530                 535                 540                 


Gly Asn Phe Ser Glu Leu Phe Lys Ser Ile Glu Glu Tyr Glu Lys Thr 
545                 550                 555                 560 


Leu Glu Ala Lys Gln Leu Val Gly Ser 
                565                 


<210>  29
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Zea mays plants

<400>  29
atgaccatcg aacagactct gactgataag gagcggctcg ccgggctgga cctcggacaa       60

cttgaacagc ttgtggggct ggtcgagtac gacggcacga gggacccgtt cccggtgagc      120

ggctgggacg ccgtcgtgtg ggtcgtaggt aacgccacgc agactgccca ctacttccaa      180

tccgcgtttg gcatgaccct cgtcgcgtat tcgggaccga ccactggcaa tagggatcat      240

cactcgttcg tgctcgagtc aggcgcggta cgctttgtga tccagggtgc cgtggatcca      300

cagagtcctc tcatcgagca ccaccgcgcc cacggggatg gagtggtcga tatcgccctg      360

tccgttccgg acgtagacaa gtgcattgcc catgcacggg cgcaaggagc cgtggtgctg      420

gacgagcctc atgatatgac agatgagcac ggaactgtgc gactcgccgc tattgcaaca      480

tatggggaca cacgccacac tctggtagat cgcacgcact acacaggccc ctacctcccg      540

ggttacattg caaggacgtc aacacacact aagagggatg gggcgcccaa gaggctcttc      600

caggccttgg accacgtagt gggcaacgtg gagctcggta ggatggatca ttgggtggac      660

ttttacaata gggtcatggg cttcacaaac atggctgagt tcgttggtga ggacatcgcg      720

accgactact cggctctaat gagcaaggtg gtgtccaacg gtaaccatag ggtgaagttc      780

cctctgaacg aacccgccat cgctaagaag cggtcgcaga ttgatgaata cctcgacttt      840

tatcaaggcc caggggcgca gcacctggcg ctggctacta acgacatact aactgctgtg      900

gatcgcttga cagctgaggg tgttgagttc cttgctacgc cagattccta ctatcaagat      960

cctgagctcc gtgcccgcat cggcaacgtc agggccccga tcgaagagct tcaaaagcgc     1020

ggaatcttgg tcgatcgcga cgaagacggc tatttactcc aaatcttcac aaaaccgctt     1080

gtggaccggc caaccgtgtt ttttgaactg atcgagcgac acggctctct cggcttcggg     1140

atcggcaatt tcaaagccct ctttgaagct atcgaacggg agcaggctgc acgtggcaac     1200

ttctag                                                                1206


<210>  30
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Brassica napus plants

<400>  30
atgaccattg agcagacttt aacagataag gagagactcg ccgggctgga cctcggtcag       60

ttagagcagc tagttgggct cgttgagtac gatggaacta gggatccatt tccggtgagt      120

ggatgggacg ctgttgtttg ggttgtcgga aacgcaactc aaacagccca ttacttccaa      180

tcagctttcg gaatgacact ggtcgcgtac tccggaccca cgactggaaa cagagatcat      240

cactcattcg tcctcgaatc gggtgcagtg cgttttgtga tccagggcgc cgtagaccct      300

caaagcccgc tgatagagca ccatcgtgct catggagacg gcgtcgtgga tattgcatta      360

tctgtaccag acgttgataa atgcatagct catgctagag cacaaggagc agtcgtttta      420

gatgagcctc atgatatgac ggacgagcac ggaacggtta gacttgcggc catagcgacg      480

tacggcgata cacgtcacac attagtggat cgtactcact atactgggcc gtatttacca      540

ggctacatcg caaggacatc cacacacacc aaaagagatg gagcccccaa aaggcttttc      600

caagctttag atcatgttgt cggaaatgtg gaactcggac gtatggacca ttgggtggac      660

ttctataacc gagtgatggg ttttactaat atggcagagt ttgttggaga ggatatcgcc      720

actgactact ctgcactgat gtcaaaagtc gtgtcgaatg gcaaccatag agtaaagttc      780

ccactgaatg aacctgccat cgctaagaag agatcgcaga ttgatgagta cctcgacttc      840

tatcagggtc cgggtgcgca gcatttagct ctggccacta acgatattct tacggcagtt      900

gacaggctaa ctgctgaagg agttgaattc ctggctacgc cggattctta ctaccaggac      960

ccggagctca gggccagaat tggaaatgtt agagccccga tcgaggaact tcaaaagcgt     1020

gggattttag ttgatcgtga tgaagatggt tatttgctcc aaatcttcac gaaaccactt     1080

gtagatcggc ccacagtctt ctttgagctg atcgaaaggc atgggtcact agggttcgga     1140

ataggaaact tcaaagctct attcgaggcc atcgaaagag agcaggctgc tagaggcaac     1200

ttttag                                                                1206


<210>  31
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Beta vulgaris plants

<400>  31
atgactatag agcaaacact cacagataag gaaagacttg ctgggcttga cctgggccag       60

cttgagcaac tggtcgggct agttgagtac gatggtacaa gggatccatt cccagtgagt      120

ggttgggacg cggtggtttg ggttgtcgga aatgctactc agactgccca ttactttcaa      180

tcagcattcg gaatgaccct tgttgcatac tcggggccaa ccaccggtaa tagggatcac      240

cacagcttcg tattggagtc tggggctgtg aggtttgtta tacaaggggc agtggatcct      300

cagagcccac tcatcgagca tcaccgagca catggagatg gtgttgtgga cattgctctt      360

agcgtgccag atgtagacaa gtgcatagct catgctaggg ctcaaggcgc tgttgtgctc      420

gacgagcctc atgatatgac cgatgaacat ggtacggtaa gacttgcagc tattgccact      480

tacggtgata ctaggcacac cctagtcgat aggacccatt acacggggcc gtatttgcca      540

gggtacattg cacgaacttc gactcacacc aaacgggacg gtgctcctaa gaggttattt      600

caggccttgg atcatgtggt tgggaatgtt gagcttggac ggatggatca ttgggttgac      660

ttttacaata gggtgatggg attcactaat atggctgaat tcgttgggga ggatattgcc      720

acagattata gtgcattaat gtctaaagtt gtgtccaatg gtaatcacag agtgaagttc      780

ccacttaacg agcccgctat cgccaaaaaa cgatctcaaa tagatgaata cctggatttc      840

tatcagggcc caggtgctca acatttggca ttggctacca atgacattct cacagccgtt      900

gatcgtttga ctgcagaagg agtggaattc cttgctaccc ctgattccta ctaccaagac      960

ccagaactta gggcacggat tggtaacgtt agagctccaa ttgaagagct tcaaaaaaga     1020

gggatccttg ttgaccggga cgaggacgga tatttgctcc aaattttcac aaagccgctg     1080

gtggatagac caactgtttt cttcgaattg attgaacgtc atggctcatt aggcttcggc     1140

atcggtaact ttaaagctct ttttgaggca atcgaaagag agcaagcggc acgcggcaat     1200

ttctag                                                                1206


<210>  32
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Gossypium hirsutum plants

<400>  32
atgacgattg agcagactct aacagataaa gaacgtctcg ctggccttga tcttgggcaa       60

ctcgaacagt tggtgggact cgttgaatac gatggaaccc gagatccatt ccctgttagc      120

ggttgggatg ctgtagtctg ggttgtcgga aacgcaaccc aaaccgccca ttacttccaa      180

agcgcattcg ggatgactct cgtggcctac tccggtccta ctactggaaa cagagaccac      240

cattcctttg ttctcgagtc tggggctgtt aggttcgtga ttcagggagc tgtggatcct      300

caatccccgc ttatcgaaca ccacagagct cacggggacg gagttgttga tatcgctctt      360

tcggtgccag atgttgataa gtgcatcgca cacgcacggg ctcaaggtgc tgttgtgttg      420

gatgaaccac atgatatgac cgatgaacat gggactgtca ggttagcggc cattgctaca      480

tatggtgaca cgcgtcacac tcttgtggac aggactcatt acactggtcc ctatctccca      540

ggatacatcg ctcgaaccag tactcatacc aagagagatg gagcacctaa aagattattc      600

caagccctgg atcatgtggt cgggaacgta gaattgggac gcatggatca ttgggttgat      660

ttctataacc gtgttatggg ttttactaat atggccgaat tcgttggtga ggatattgct      720

actgactact ccgctcttat gtccaaggtc gtgtcgaacg gaaaccatag ggtaaaattc      780

ccccttaacg agccagcaat cgctaaaaag cgtagtcaaa tcgatgaata cctcgatttc      840

tatcaaggac ctggcgcgca acatttggca ctcgccacca acgacatttt gactgctgtg      900

gatcgattaa ctgcggaggg cgtggaattt ttggcgactc ctgattccta ttatcaagat      960

ccagagcttc gtgctagaat tggcaatgtt agagccccaa ttgaagaatt gcagaagagg     1020

ggaatcttgg ttgaccgaga cgaggatggg tacttgcttc agatctttac aaagcctctc     1080

gttgatcgac ctaccgtgtt ctttgaatta attgaacgtc atggaagttt aggctttgga     1140

attggcaatt ttaaagcttt attcgaagct atagaacgag agcaggctgc tagaggaaac     1200

ttctag                                                                1206


<210>  33
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Glycine max plants

<400>  33
atgacaattg agcaaaccct gacagataag gaaaggcttg ccggcctgga ccttggccaa       60

ctagagcagt tagtcgggct agtagaatat gatggaacta gagacccttt tcctgtttct      120

ggctgggatg ccgtggtctg ggtggtaggg aacgcaactc aaacggccca ttacttccag      180

agtgcattcg ggatgaccct ggttgcttat agcgggccga ctactgggaa tcgagaccat      240

catagttttg ttcttgagag tggcgcggtg aggttcgtga ttcaaggtgc tgtcgacccc      300

caatctcctc ttattgagca tcatagggcg catggtgacg gcgttgtcga tattgctctt      360

tctgtacctg acgtcgacaa atgcattgca catgcccgag cacagggggc agtcgtcttg      420

gatgagcctc atgatatgac cgacgaacat ggaaccgtgc gtttggctgc cattgctacc      480

tatggtgaca ctaggcatac actggtggac agaacgcact acactggccc ttatctgcca      540

ggctatatag ctaggacaag cacccacacc aaacgcgatg gagcacctaa gcgccttttt      600

caggcgttgg accatgtagt cggtaacgtg gaactcggta ggatggacca ctgggtcgac      660

ttttacaacc gtgtgatggg ttttaccaac atggcagaat tcgtgggcga ggatattgca      720

accgattatt ccgccttgat gtccaaggtt gtctcgaacg gcaaccacag agtcaagttt      780

cctcttaatg aaccggcgat cgcaaagaaa aggtcccaga ttgatgaata ccttgatttc      840

tatcagggcc ctggagcaca acacctggct ctcgctacca atgatatttt aacggctgtg      900

gataggctta ccgcagaggg agtggaattc cttgccactc ccgactcata ctatcaggat      960

cccgaactaa gagcgaggat cggtaatgtt cgtgcaccca ttgaggagct tcaaaagagg     1020

ggtatccttg tcgaccgaga tgaggatgga tacctattgc aaatctttac caagcctcta     1080

gttgaccggc ctaccgtttt cttcgaactt atagagagac acggatctct tggtttcggt     1140

ataggaaatt ttaaggcact cttcgaagct atcgaaagag agcaagcagc caggggaaat     1200

ttctag                                                                1206


<210>  34
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Glycine max plants

<400>  34
atgacgatcg agcagacctt aaccgataaa gagcggctag ccgggcttga cctcggccag       60

ttagagcaac tggtgggcct ggtggaatac gacggtacac gcgatccgtt ccctgtgtct      120

ggttgggatg cggtagtttg ggttgttggc aatgccaccc aaactgcaca ctatttccaa      180

tctgcgtttg gcatgaccct cgtggcctat agcggcccga ctaccggcaa tagggaccac      240

cattccttcg tcttggagag tggcgcggtt cggttcgtca tccagggcgc ggtggaccct      300

cagtcaccgc tgatcgagca tcacagagcc catggggatg gtgtggtcga tatcgcactc      360

tccgttccag atgtggacaa gtgtatcgct cacgctaggg cgcagggcgc cgttgtccta      420

gacgagcctc acgatatgac agacgagcac ggcacagttc ggttggccgc tatcgctacc      480

tacggcgata ctaggcatac cttggttgac cgcacgcact ataccgggcc atatctgcca      540

ggctatattg cccgaacgag cacccacacg aagagggatg gcgctccgaa gcgcctcttt      600

caagcgctcg accatgtggt gggcaacgtg gagctcggca ggatggacca ctgggtggac      660

ttttacaaca gggtcatggg ctttaccaac atggccgagt tcgttggtga ggacatcgcg      720

accgactata gcgcccttat gtccaaggtt gtgagcaatg ggaaccatcg ggtgaaattt      780

cccctgaatg agcccgcgat tgctaaaaag aggagccaaa tcgacgaata tctggacttc      840

taccaggggc cgggggctca acatttggct ctcgccacaa atgacatttt gacagctgtc      900

gatcgcctaa ctgctgaggg cgtcgagttc cttgcgacac cggactcata ctaccaggac      960

cccgaactcc gcgcccggat tggtaacgtt agagccccca tcgaagagct ccagaaacgg     1020

ggcatccttg tcgatcgtga cgaagatggc tacctcctgc agatattcac gaagcccctc     1080

gtggatcggc ccactgtgtt cttcgagcta atcgagagac acggctcgtt gggcttcggc     1140

atcggcaatt tcaaggcgct tttcgaggct atcgagagag agcaggccgc gaggggtaat     1200

ttctag                                                                1206


<210>  35
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Oryza sativa plants

<400>  35
atgacaatcg agcaaacact gacagacaag gaaaggctcg ccgggttaga tctcggccag       60

ctggaacagc ttgtcgggct cgttgaatac gacgggaccc gggacccttt cccggtctcc      120

gggtgggacg cagtggtctg ggttgtcggc aacgccaccc agacggcgca ttatttccag      180

agcgcgttcg ggatgacgct agtggcttat agcggcccga caacggggaa ccgggatcat      240

cattcgtttg tactcgagtc tggtgctgtt cggttcgtta tccagggagc tgttgacccg      300

cagtcgcccc tcatagaaca ccaccgcgcg catggggacg gagtggtaga catcgcgttg      360

agcgttccag acgtggataa gtgcattgcc cacgccagag cgcaaggtgc agtcgtcctg      420

gatgaacctc atgacatgac agacgaacac ggaactgtca gattggccgc tattgccacg      480

tacggcgaca cccggcacac actggtggat aggactcact acactggccc atacctaccg      540

ggctacattg ctcggacctc cactcacact aagagggacg gggcccccaa gcgcttgttt      600

caagccctcg atcatgtagt cggtaacgtc gaactgggcc gcatggatca ttgggtagac      660

ttttataacc gcgtgatggg atttaccaat atggcggagt tcgtcggcga ggacatcgca      720

accgactaca gcgctctgat gtccaaggtt gtgagcaacg gaaatcaccg ggttaagttc      780

ccgttgaacg agccagcgat tgccaagaaa cgctcacaaa tcgacgagta cttagatttt      840

taccaaggtc ctggtgcaca acacctcgcg ctcgcgacta acgacatcct gacagcagtc      900

gatcggttga cagccgaagg agtcgagttt ctggccactc cggattcgta ctaccaagat      960

ccggaactta gagctagaat cggcaacgtt cgcgccccga tcgaggagct ccagaagagg     1020

ggaatactgg tggacaggga cgaggatgga taccttttgc aaattttcac gaagccgttg     1080

gtggatcggc caacagtgtt tttcgagctc atcgagcgtc atggaagtct gggcttcggc     1140

atcggcaact tcaaggctct gttcgaagct atcgagcggg aacaggctgc gagaggtaat     1200

ttctag                                                                1206


<210>  36
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for Triticum aestivum plants

<400>  36
atgactatag agcagaccct aaccgataag gaacgactcg cgggattaga ccttggccaa       60

ctggagcaac tcgtgggact tgttgaatat gatgggacca gagacccgtt cccggtgtct      120

ggctgggacg ccgttgtgtg ggtcgtgggc aacgcaacac agactgcgca ctacttccag      180

tccgcattcg gcatgacgct ggtggcttac agtggtccga ctaccggcaa tagggaccac      240

cactcttttg tactagagag cggcgcagtg cgttttgtta ttcagggcgc agttgacccg      300

cagtcccccc taatcgagca tcacagagcg catggcgatg gtgtcgttga catcgccctt      360

agcgttccag atgtcgacaa gtgcatcgcc catgccaggg cgcagggagc ggtcgtttta      420

gatgagcccc acgacatgac ggatgagcac gggaccgtca ggctcgctgc gatcgccacg      480

tacggggaca ccagacatac cctggtggac aggacccact acacgggacc atacctccct      540

ggctacatcg ccaggacctc aacacacacc aagagagacg gggcaccgaa gcggttattc      600

caggcgctgg accacgtggt ggggaacgtt gagctgggac gtatggacca ttgggtcgat      660

ttctacaacc gggtgatggg tttcacgaac atggccgagt tcgtcggcga ggacatcgca      720

acggactact ccgcgctcat gagcaaagtg gtttccaatg ggaaccacag agtgaagttc      780

cctctgaacg aaccggccat tgccaaaaag aggtcccaga tcgatgagta cctggacttc      840

tatcaagggc cgggcgctca acacctcgca cttgccacga atgacatcct aacggcggtg      900

gaccggctta ctgctgaggg agtcgaattc ctagccaccc cagactcgta ctaccaagac      960

ccagaactga gggcccgcat cgggaacgtc agagcaccta ttgaagagct acagaagcgc     1020

ggcattctcg ttgatcggga tgaggatggc tacctcctgc agattttcac taagccttta     1080

gtcgatcgac caaccgtgtt tttcgagctg atcgagagac acggatccct gggattcgga     1140

attgggaact tcaaggctct gttcgaggcc atcgagaggg agcaagccgc gcgtggcaac     1200

ttctag                                                                1206


<210>  37
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding rhodococcus opacus B4 HPPD 
       optimized for dicotyledonous plants containing at the 5' end a 
       nucleic acid sequence encoding an optimized transit peptide 
       (according to SEQ ID No. 13).


<220>
<221>  transit_peptide
<222>  (1)..(372)
<223>  Nucleotide sequence encoding a transit peptide to chlorplasts

<400>  37
atggcttcga tctcctcctc agttgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatatatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

aggatccggt gcatgactat tgagcagacc ctcaccgaca aagaaaggct tgctggactt      420

gatctcggtc agcttgagca gcttgttgga cttgttgagt acgacggcac tagggaccct      480

ttcccagtta gtggttggga cgctgttgtt tgggttgtgg gtaacgctac tcaaaccgct      540

cactactttc agtcagcctt cggaatgacc ctcgtggctt attcaggacc tactactggt      600

aatagggatc accactcctt cgtgcttgag tcaggtgctg ttagattcgt gattcagggc      660

gctgttgatc ctcagtcacc acttattgag caccacaggg ctcacggtga cggtgttgtt      720

gatattgctc ttagcgtgcc cgacgtggac aagtgtattg ctcacgctag ggctcagggt      780

gctgttgttc ttgacgaacc tcacgatatg actgacgagc acggaactgt taggctcgct      840

gctattgcta cttacggtga cactaggcac accctcgttg ataggactca ctacactgga      900

ccttacctcc caggctatat tgctaggacc tctactcaca ctaagaggga cggtgctcct      960

aagaggcttt ttcaggctct tgatcacgtt gtgggtaacg tggaactcgg tagaatggat     1020

cactgggtgg acttctataa tagggtgatg ggcttcacta atatggccga gttcgtgggc     1080

gaggatattg ctactgatta ctcagccctg atgtctaaag tggttagtaa cggtaatcac     1140

agggttaagt tcccacttaa cgagcccgct atcgctaaga aacgtagtca gattgacgag     1200

tacctcgact tctatcaggg accaggtgct caacaccttg ctctcgctac taacgatatt     1260

ctcaccgctg tggataggct taccgctgaa ggtgttgagt tccttgctac ccccgatagc     1320

tactatcagg acccagaact tagggctagg atcggtaacg ttagggctcc tattgaggaa     1380

cttcagaaga ggggaatcct ggttgatagg gacgaggacg gttacctcct tcagatcttc     1440

actaagccac tcgtggatag gcctactgtg ttcttcgagc ttattgagcg tcacggatca     1500

ctcggattcg ggatcggtaa ctttaaggcc ctcttcgagg ctatcgagag agagcaagct     1560

gctaggggca acttctag                                                   1578





17

