SEQUENCE LISTING

<110>  Bayer CropScience AG
 
<120>  Plants tolerant to HPPD inhibitor herbicides

<130>  BCS 11-1056

<160>  37    

<170>  PatentIn version 3.5

<210>  1
<211>  1206
<212>  DNA
<213>  Amycolicicoccus subflavus

<400>  1
atgacactcg aacagactct caccgacgcg gagaagcttg cggacctcga cctcgctcag       60

ctcaagacgc tcgtcgggct tgtggagtac gacggcagca gcgacccgtt tcccgtcagc      120

ggctgggacg cgatcgtgtg ggccgtcggc aacgccaccc agacgtcgca tttcttccag      180

tccgccttcg gcatggaact gatcgcttac tcaggtccga cgaccggtaa ccgtgaccat      240

cacgcgttcg tgctgcgcag tggagcggtc cgattcgtca tcactggagc tgtggatccg      300

gcaagtccga tcgccgacca ccaccgccgc cacggtgatg gtgtcgtcga catcgcgctc      360

accgtcccgg atgttgaccg ttgtatcgcg catgcacgcg cccagggcgc gacggttctc      420

gccgaaccgc acgatgtgtc agatgagcac ggcacggtcc gtctcgccgc gatcgccacc      480

tacggcgaca cccgccacac cctcgtcgac cgttcccgct acaacgggcc ctatctgccc      540

ggttacgcgg agcgcacctc gactttcacc aagcgcgaag gtgctcctaa acgaatcttc      600

caggccctcg accacgtggt cggcaacgtt gagctcggca gaatggacga gtgggtctcc      660

ttctacaacc gggtgatggg tttcacgaat atggctgagt tcgtcggaga agacatcgcc      720

accgactatt ccgcgctgat gagcaaggtg gtgtcgaacg gcaaccaccg ggtcaagttc      780

ccgctcaatg agcccgcgat cgcgaagaag cgctcccaga tcgatgaata cctcgactat      840

tacggtgggc ccggtgctca gcacctcgct ctcgccacca acgacatcct gaccgctgtg      900

gacacgctgc gcgcggaggg catcgagttt ctcgccacgc cggacagcta ctacgaggac      960

ccggaactgc gcgcacgaat cggcgaggta cgggtcccga tcgaggagct gcagaagcgt     1020

ggcatcctcg tcgaccgtga cgaggacggt tatctgctgc agatcttcac caaaccgctc     1080

gtcgaccggc cgaccgtgtt cttcgaactc attgaacgcc atggttcact cggctttggc     1140

atcggcaact tcaaggccct tttcgaggcg atcgagcggg aacaagccgc acgcggcaac     1200

ttctaa                                                                1206


<210>  2
<211>  1209
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for E. coli , containing at the 5' end a nucleic acid 
       encoding an aspartate.


<220>
<221>  misc_feature
<222>  (4)..(6)
<223>  Sequence coding for Asp

<400>  2
atggataccc tggaacagac cctgaccgac gcagaaaaac tggccgatct ggatctggct       60

caactgaaaa ccctggttgg tctggttgaa tacgacggta gctcagatcc gtttccggtt      120

agcggttggg acgctattgt ttgggcagtt ggtaacgcta ctcagactag ccactttttt      180

cagtcagcct ttggtatgga actgattgcc tatagcggtc cgactacagg taatcgtgat      240

caccacgcct ttgttctgcg tagcggtgca gttcgttttg ttattacagg tgcagttgat      300

ccggctagtc cgattgcaga tcaccaccgt cgtcacggtg acggtgttgt tgatattgca      360

ctgaccgtgc cggacgttga tcgttgtatt gctcacgcac gtgctcaggg tgctaccgtt      420

ctggcagaac cgcacgacgt tagtgacgaa cacggcaccg ttcgtctggc agctattgct      480

acctacggtg atacccgtca caccctggtt gatcgtagcc gttataacgg tccgtatctg      540

cctggttacg cagaacgtac tagcaccttt actaaacgtg aaggcgcacc gaaacgtatt      600

tttcaggcac tggatcacgt tgtgggtaac gttgaactgg gtcgtatgga cgagtgggtt      660

agcttttata accgcgtgat gggctttact aatatggccg aatttgtggg tgaagatatt      720

gctaccgatt attcagcact gatgagtaaa gtggttagta acggtaatca ccgtgttaag      780

tttccgctga acgaaccggc tattgctaaa aaacgtagtc agattgacga atatctggat      840

tattacggtg gtccgggtgc tcagcacctg gcactggcta ctaacgatat tctgaccgca      900

gttgataccc tgcgtgcaga aggtattgaa tttctggcta caccggatag ctattacgaa      960

gatcctgaac tgcgtgcccg tattggtgaa gttcgtgttc cgattgaaga actgcaaaaa     1020

cgtggtattc tggtggatcg tgacgaagac ggttatctgc tgcaaatctt tactaaaccg     1080

ctggttgatc gtccgaccgt tttttttgaa ctgattgaac gtcacggtag cctgggtttt     1140

ggtattggta actttaaagc cttatttgaa gctattgaac gtgaacaggc agcacgtggt     1200

aacttttaa                                                             1209


<210>  3
<211>  1341
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for E. coli, containing at the 5' end a nucleic acid 
       encoding several Tags


<220>
<221>  misc_feature
<222>  (4)..(21)
<223>  Sequence encoding His tag containing 6 His

<220>
<221>  misc_feature
<222>  (22)..(31)
<223>  Sequence coding for two Ser and a Gly

<220>
<221>  misc_feature
<222>  (32)..(48)
<223>  Nucleic acid stretch encoding a protein binding site thrombin

<220>
<221>  misc_feature
<222>  (49)..(54)
<223>  Sequence coding for Gly and Met

<220>
<221>  misc_feature
<222>  (55)..(99)
<223>  Nucleic acid stretch encoding a S-tag

<220>
<221>  misc_feature
<222>  (100)..(114)
<223>  Nucleic acid coding for 2 Pro, a Asp, a Leu, a Gly and a Thr

<220>
<221>  misc_feature
<222>  (115)..(129)
<223>  Nucleic acid stretch encoding the recognition site of an 
       enterokinase

<220>
<221>  misc_feature
<222>  (130)..(132)
<223>  Nucleic acid scoding for Ala

<220>
<221>  misc_feature
<222>  (133)..(135)
<223>  start codon

<220>
<221>  misc_feature
<222>  (136)..(138)
<223>  Nucleic acid sequence encoding an aspartate

<400>  3
atgcaccatc atcatcatca ttcttctggt ctggtgccac gcggttctgg tatgaaagaa       60

accgctgctg ctaaattcga acgccagcac atggacagcc cagatctggg taccgacgac      120

gacgacaagg ccatggatac cctggaacag accctgaccg acgcagaaaa actggccgat      180

ctggatctgg ctcaactgaa aaccctggtt ggtctggttg aatacgacgg tagctcagat      240

ccgtttccgg ttagcggttg ggacgctatt gtttgggcag ttggtaacgc tactcagact      300

agccactttt ttcagtcagc ctttggtatg gaactgattg cctatagcgg tccgactaca      360

ggtaatcgtg atcaccacgc ctttgttctg cgtagcggtg cagttcgttt tgttattaca      420

ggtgcagttg atccggctag tccgattgca gatcaccacc gtcgtcacgg tgacggtgtt      480

gttgatattg cactgaccgt gccggacgtt gatcgttgta ttgctcacgc acgtgctcag      540

ggtgctaccg ttctggcaga accgcacgac gttagtgacg aacacggcac cgttcgtctg      600

gcagctattg ctacctacgg tgatacccgt cacaccctgg ttgatcgtag ccgttataac      660

ggtccgtatc tgcctggtta cgcagaacgt actagcacct ttactaaacg tgaaggcgca      720

ccgaaacgta tttttcaggc actggatcac gttgtgggta acgttgaact gggtcgtatg      780

gacgagtggg ttagctttta taaccgcgtg atgggcttta ctaatatggc cgaatttgtg      840

ggtgaagata ttgctaccga ttattcagca ctgatgagta aagtggttag taacggtaat      900

caccgtgtta agtttccgct gaacgaaccg gctattgcta aaaaacgtag tcagattgac      960

gaatatctgg attattacgg tggtccgggt gctcagcacc tggcactggc tactaacgat     1020

attctgaccg cagttgatac cctgcgtgca gaaggtattg aatttctggc tacaccggat     1080

agctattacg aagatcctga actgcgtgcc cgtattggtg aagttcgtgt tccgattgaa     1140

gaactgcaaa aacgtggtat tctggtggat cgtgacgaag acggttatct gctgcaaatc     1200

tttactaaac cgctggttga tcgtccgacc gttttttttg aactgattga acgtcacggt     1260

agcctgggtt ttggtattgg taactttaaa gccttatttg aagctattga acgtgaacag     1320

gcagcacgtg gtaactttta a                                               1341


<210>  4
<211>  18
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid encoding 6 consecutive histidines

<400>  4
caccatcatc atcatcat                                                     18


<210>  5
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 4

<400>  5

His His His His His His 
1               5       


<210>  6
<211>  18
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid stretch encoding a protein binding site thrombin

<400>  6
ctggtgccac gcggttct                                                     18


<210>  7
<211>  6
<212>  PRT
<213>  Artificial protein

<220>
<223>  Protein encoded by SEQ ID No. 6

<400>  7

Leu Val Pro Arg Gly Ser 
1               5       


<210>  8
<211>  45
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid stretch encoding a S-tag

<400>  8
aaagaaaccg ctgctgctaa attcgaacgc cagcacatgg acagc                       45


<210>  9
<211>  15
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 8

<400>  9

Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser 
1               5                   10                  15  


<210>  10
<211>  15
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid stretch encoding the recognition site of an 
       enterokinase

<400>  10
gacgacgacg acaag                                                        15


<210>  11
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 10

<400>  11

Asp Asp Asp Asp Lys 
1               5   


<210>  12
<211>  1584
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for E. coli containing at the 5' end a nucleic acid 
       sequence encoding an optimized transit peptide (according to SEQ 
       ID No. 13) and an aspartate


<220>
<221>  transit_peptide
<222>  (1)..(375)
<223>  Optimized transit peptide to chloroplasts

<220>
<221>  misc_feature
<222>  (376)..(378)
<223>  Sequence encoding a Met

<220>
<221>  misc_feature
<222>  (379)..(381)
<223>  Sequence encoding an Asp

<400>  12
atggcttcga tctcctcctc agtcgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatgtatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

agaatccggt gcgccatgga taccctggaa cagaccctga ccgacgcaga aaaactggcc      420

gatctggatc tggctcaact gaaaaccctg gttggtctgg ttgaatacga cggtagctca      480

gatccgtttc cggttagcgg ttgggacgct attgtttggg cagttggtaa cgctactcag      540

actagccact tttttcagtc agcctttggt atggaactga ttgcctatag cggtccgact      600

acaggtaatc gtgatcacca cgcctttgtt ctgcgtagcg gtgcagttcg ttttgttatt      660

acaggtgcag ttgatccggc tagtccgatt gcagatcacc accgtcgtca cggtgacggt      720

gttgttgata ttgcactgac cgtgccggac gttgatcgtt gtattgctca cgcacgtgct      780

cagggtgcta ccgttctggc agaaccgcac gacgttagtg acgaacacgg caccgttcgt      840

ctggcagcta ttgctaccta cggtgatacc cgtcacaccc tggttgatcg tagccgttat      900

aacggtccgt atctgcctgg ttacgcagaa cgtactagca cctttactaa acgtgaaggc      960

gcaccgaaac gtatttttca ggcactggat cacgttgtgg gtaacgttga actgggtcgt     1020

atggacgagt gggttagctt ttataaccgc gtgatgggct ttactaatat ggccgaattt     1080

gtgggtgaag atattgctac cgattattca gcactgatga gtaaagtggt tagtaacggt     1140

aatcaccgtg ttaagtttcc gctgaacgaa ccggctattg ctaaaaaacg tagtcagatt     1200

gacgaatatc tggattatta cggtggtccg ggtgctcagc acctggcact ggctactaac     1260

gatattctga ccgcagttga taccctgcgt gcagaaggta ttgaatttct ggctacaccg     1320

gatagctatt acgaagatcc tgaactgcgt gcccgtattg gtgaagttcg tgttccgatt     1380

gaagaactgc aaaaacgtgg tattctggtg gatcgtgacg aagacggtta tctgctgcaa     1440

atctttacta aaccgctggt tgatcgtccg accgtttttt ttgaactgat tgaacgtcac     1500

ggtagcctgg gttttggtat tggtaacttt aaagccttat ttgaagctat tgaacgtgaa     1560

caggcagcac gtggtaactt ttaa                                            1584


<210>  13
<211>  372
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding an optimized transit peptide

<400>  13
atggcttcga tctcctcctc agtcgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatgtatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

agaatccggt gc                                                          372


<210>  14
<211>  124
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 13

<400>  14

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Cys Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys 
        115                 120                 


<210>  15
<211>  1581
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for dicotyledonous plants containing at the 5' end a 
       nucleic acid sequence encoding an optimized transit peptide 
       (according to SEQ ID No. 13) and an aspartate


<220>
<221>  transit_peptide
<222>  (1)..(372)
<223>  Optimized transit peptide to chloroplasts

<220>
<221>  misc_feature
<222>  (373)..(375)
<223>  Nucleic acid encoding Met

<220>
<221>  misc_feature
<222>  (376)..(378)
<223>  Nucleic acid encoding Asp

<400>  15
atggcttcga tctcctcctc agttgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatatatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

aggatccggt gcatggatac ccttgagcaa acccttaccg acgctgagaa gctcgctgat      420

cttgatcttg ctcagcttaa gacccttgtg ggccttgttg aatacgacgg ctctagtgat      480

cccttcccag ttagtggttg ggacgctatt gtgtgggctg ttggtaacgc tactcagaca      540

agtcacttct ttcagtcagc cttcgggatg gaactgatcg cttattcagg acctactacc      600

ggtaataggg atcaccacgc tttcgtgctt aggtcaggtg ctgttagatt cgtgattacc      660

ggcgctgttg atcccgctag tcctattgct gatcatcata ggcgtcacgg tgacggcgtt      720

gtggatattg ctcttactgt tccagacgtg gaccgttgta ttgctcacgc tagggctcaa      780

ggtgctactg ttcttgctga acctcacgac gttagtgacg agcacggaac tgttaggctt      840

gctgctattg ctacctacgg tgacactagg cacacccttg ttgataggtc taggtataac      900

ggcccttacc tcccaggtta cgctgagaga actagcacct tcactaagag ggaaggtgcc      960

cctaagagga tctttcaggc tcttgatcac gttgtgggta acgtggaact tggtagaatg     1020

gacgagtggg ttagcttcta taatagggtg atgggcttca ctaatatggc cgagttcgtg     1080

ggcgaggata ttgctactga ttactcagcc ctgatgtcta aagtggttag taacggtaat     1140

cacagggtta agttcccact taacgagccc gctatcgcta agaaacgtag tcagattgac     1200

gagtacctcg actactacgg tggaccaggt gctcaacacc ttgctcttgc tactaacgat     1260

attctcaccg ccgtggatac tcttagggct gagggaattg agttcctcgc taccccagat     1320

agttactacg aggacccaga acttagggct aggattggag aggttagagt gcctatcgag     1380

gaacttcaga agaggggaat cctcgttgat agggacgagg acggttacct ccttcagatt     1440

ttcactaagc ccctcgtgga taggcctacc gttttcttcg aacttatcga gcgtcacgga     1500

tcactcggat tcgggatcgg taactttaag gctctcttcg aggctatcga gagagagcaa     1560

gctgctaggg gcaacttcta g                                               1581


<210>  16
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       containing at the 5' end a nucleic acid sequence encoding an 
       optimized transit peptide (according to SEQ ID No. 13)


<220>
<221>  transit_peptide
<222>  (1)..(372)
<223>  Optimized transit peptde to chloroplasts

<400>  16
atggcttcga tctcctcctc agttgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatatatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

aggatccggt gcatgacact cgaacagact ctcaccgacg cggagaagct tgcggacctc      420

gacctcgctc agctcaagac gctcgtcggg cttgtggagt acgacggcag cagcgacccg      480

tttcccgtca gcggctggga cgcgatcgtg tgggccgtcg gcaacgccac ccagacgtcg      540

catttcttcc agtccgcctt cggcatggaa ctgatcgctt actcaggtcc gacgaccggt      600

aaccgtgacc atcacgcgtt cgtgctgcgc agtggagcgg tccgattcgt catcactgga      660

gctgtggatc cggcaagtcc gatcgccgac caccaccgcc gccacggtga tggtgtcgtc      720

gacatcgcgc tcaccgtccc ggatgttgac cgttgtatcg cgcatgcacg cgcccagggc      780

gcgacggttc tcgccgaacc gcacgatgtg tcagatgagc acggcacggt ccgtctcgcc      840

gcgatcgcca cctacggcga cacccgccac accctcgtcg accgttcccg ctacaacggg      900

ccctatctgc ccggttacgc ggagcgcacc tcgactttca ccaagcgcga aggtgctcct      960

aaacgaatct tccaggccct cgaccacgtg gtcggcaacg ttgagctcgg cagaatggac     1020

gagtgggtct ccttctacaa ccgggtgatg ggtttcacga atatggctga gttcgtcgga     1080

gaagacatcg ccaccgacta ttccgcgctg atgagcaagg tggtgtcgaa cggcaaccac     1140

cgggtcaagt tcccgctcaa tgagcccgcg atcgcgaaga agcgctccca gatcgatgaa     1200

tacctcgact attacggtgg gcccggtgct cagcacctcg ctctcgccac caacgacatc     1260

ctgaccgctg tggacacgct gcgcgcggag ggcatcgagt ttctcgccac gccggacagc     1320

tactacgagg acccggaact gcgcgcacga atcggcgagg tacgggtccc gatcgaggag     1380

ctgcagaagc gtggcatcct cgtcgaccgt gacgaggacg gttatctgct gcagatcttc     1440

accaaaccgc tcgtcgaccg gccgaccgtg ttcttcgaac tcattgaacg ccatggttca     1500

ctcggctttg gcatcggcaa cttcaaggcc cttttcgagg cgatcgagcg ggaacaagcc     1560

gcacgcggca acttctaa                                                   1578


<210>  17
<211>  401
<212>  PRT
<213>  Amycolicicoccus subflavus

<400>  17

Met Thr Leu Glu Gln Thr Leu Thr Asp Ala Glu Lys Leu Ala Asp Leu 
1               5                   10                  15      


Asp Leu Ala Gln Leu Lys Thr Leu Val Gly Leu Val Glu Tyr Asp Gly 
            20                  25                  30          


Ser Ser Asp Pro Phe Pro Val Ser Gly Trp Asp Ala Ile Val Trp Ala 
        35                  40                  45              


Val Gly Asn Ala Thr Gln Thr Ser His Phe Phe Gln Ser Ala Phe Gly 
    50                  55                  60                  


Met Glu Leu Ile Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His 
65                  70                  75                  80  


His Ala Phe Val Leu Arg Ser Gly Ala Val Arg Phe Val Ile Thr Gly 
                85                  90                  95      


Ala Val Asp Pro Ala Ser Pro Ile Ala Asp His His Arg Arg His Gly 
            100                 105                 110         


Asp Gly Val Val Asp Ile Ala Leu Thr Val Pro Asp Val Asp Arg Cys 
        115                 120                 125             


Ile Ala His Ala Arg Ala Gln Gly Ala Thr Val Leu Ala Glu Pro His 
    130                 135                 140                 


Asp Val Ser Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr 
145                 150                 155                 160 


Tyr Gly Asp Thr Arg His Thr Leu Val Asp Arg Ser Arg Tyr Asn Gly 
                165                 170                 175     


Pro Tyr Leu Pro Gly Tyr Ala Glu Arg Thr Ser Thr Phe Thr Lys Arg 
            180                 185                 190         


Glu Gly Ala Pro Lys Arg Ile Phe Gln Ala Leu Asp His Val Val Gly 
        195                 200                 205             


Asn Val Glu Leu Gly Arg Met Asp Glu Trp Val Ser Phe Tyr Asn Arg 
    210                 215                 220                 


Val Met Gly Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala 
225                 230                 235                 240 


Thr Asp Tyr Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His 
                245                 250                 255     


Arg Val Lys Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser 
            260                 265                 270         


Gln Ile Asp Glu Tyr Leu Asp Tyr Tyr Gly Gly Pro Gly Ala Gln His 
        275                 280                 285             


Leu Ala Leu Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Thr Leu Arg 
    290                 295                 300                 


Ala Glu Gly Ile Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Glu Asp 
305                 310                 315                 320 


Pro Glu Leu Arg Ala Arg Ile Gly Glu Val Arg Val Pro Ile Glu Glu 
                325                 330                 335     


Leu Gln Lys Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu 
            340                 345                 350         


Leu Gln Ile Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe 
        355                 360                 365             


Glu Leu Ile Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe 
    370                 375                 380                 


Lys Ala Leu Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn 
385                 390                 395                 400 


Phe 
    


<210>  18
<211>  402
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 2

<400>  18

Met Asp Thr Leu Glu Gln Thr Leu Thr Asp Ala Glu Lys Leu Ala Asp 
1               5                   10                  15      


Leu Asp Leu Ala Gln Leu Lys Thr Leu Val Gly Leu Val Glu Tyr Asp 
            20                  25                  30          


Gly Ser Ser Asp Pro Phe Pro Val Ser Gly Trp Asp Ala Ile Val Trp 
        35                  40                  45              


Ala Val Gly Asn Ala Thr Gln Thr Ser His Phe Phe Gln Ser Ala Phe 
    50                  55                  60                  


Gly Met Glu Leu Ile Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp 
65                  70                  75                  80  


His His Ala Phe Val Leu Arg Ser Gly Ala Val Arg Phe Val Ile Thr 
                85                  90                  95      


Gly Ala Val Asp Pro Ala Ser Pro Ile Ala Asp His His Arg Arg His 
            100                 105                 110         


Gly Asp Gly Val Val Asp Ile Ala Leu Thr Val Pro Asp Val Asp Arg 
        115                 120                 125             


Cys Ile Ala His Ala Arg Ala Gln Gly Ala Thr Val Leu Ala Glu Pro 
    130                 135                 140                 


His Asp Val Ser Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala 
145                 150                 155                 160 


Thr Tyr Gly Asp Thr Arg His Thr Leu Val Asp Arg Ser Arg Tyr Asn 
                165                 170                 175     


Gly Pro Tyr Leu Pro Gly Tyr Ala Glu Arg Thr Ser Thr Phe Thr Lys 
            180                 185                 190         


Arg Glu Gly Ala Pro Lys Arg Ile Phe Gln Ala Leu Asp His Val Val 
        195                 200                 205             


Gly Asn Val Glu Leu Gly Arg Met Asp Glu Trp Val Ser Phe Tyr Asn 
    210                 215                 220                 


Arg Val Met Gly Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile 
225                 230                 235                 240 


Ala Thr Asp Tyr Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn 
                245                 250                 255     


His Arg Val Lys Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg 
            260                 265                 270         


Ser Gln Ile Asp Glu Tyr Leu Asp Tyr Tyr Gly Gly Pro Gly Ala Gln 
        275                 280                 285             


His Leu Ala Leu Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Thr Leu 
    290                 295                 300                 


Arg Ala Glu Gly Ile Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Glu 
305                 310                 315                 320 


Asp Pro Glu Leu Arg Ala Arg Ile Gly Glu Val Arg Val Pro Ile Glu 
                325                 330                 335     


Glu Leu Gln Lys Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr 
            340                 345                 350         


Leu Leu Gln Ile Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe 
        355                 360                 365             


Phe Glu Leu Ile Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn 
    370                 375                 380                 


Phe Lys Ala Leu Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly 
385                 390                 395                 400 


Asn Phe 
        


<210>  19
<211>  446
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 3


<220>
<221>  MISC_FEATURE
<222>  (2)..(7)
<223>  HIS Tag made of 6 HIS

<220>
<221>  MISC_FEATURE
<222>  (8)..(8)
<223>  Ser

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  Ser

<220>
<221>  MISC_FEATURE
<222>  (10)..(10)
<223>  Gly

<220>
<221>  MISC_FEATURE
<222>  (11)..(16)
<223>  protein binding site thrombin

<220>
<221>  MISC_FEATURE
<222>  (17)..(17)
<223>  Gly

<220>
<221>  MISC_FEATURE
<222>  (18)..(18)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (19)..(33)
<223>  S-Tag

<220>
<221>  MISC_FEATURE
<222>  (34)..(34)
<223>  Pro

<220>
<221>  MISC_FEATURE
<222>  (35)..(35)
<223>  Pro

<220>
<221>  MISC_FEATURE
<222>  (36)..(36)
<223>  Asp

<220>
<221>  MISC_FEATURE
<222>  (37)..(37)
<223>  Leu

<220>
<221>  MISC_FEATURE
<222>  (38)..(38)
<223>  Gly

<220>
<221>  MISC_FEATURE
<222>  (39)..(39)
<223>  Thr

<220>
<221>  MISC_FEATURE
<222>  (40)..(43)
<223>  recognition site of an enterokinase

<220>
<221>  MISC_FEATURE
<222>  (44)..(44)
<223>  Ala

<220>
<221>  MISC_FEATURE
<222>  (45)..(45)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (46)..(46)
<223>  Asp

<400>  19

Met His His His His His His Ser Ser Gly Leu Val Pro Arg Gly Ser 
1               5                   10                  15      


Gly Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp 
            20                  25                  30          


Ser Pro Asp Leu Gly Thr Asp Asp Asp Asp Lys Ala Met Asp Thr Leu 
        35                  40                  45              


Glu Gln Thr Leu Thr Asp Ala Glu Lys Leu Ala Asp Leu Asp Leu Ala 
    50                  55                  60                  


Gln Leu Lys Thr Leu Val Gly Leu Val Glu Tyr Asp Gly Ser Ser Asp 
65                  70                  75                  80  


Pro Phe Pro Val Ser Gly Trp Asp Ala Ile Val Trp Ala Val Gly Asn 
                85                  90                  95      


Ala Thr Gln Thr Ser His Phe Phe Gln Ser Ala Phe Gly Met Glu Leu 
            100                 105                 110         


Ile Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ala Phe 
        115                 120                 125             


Val Leu Arg Ser Gly Ala Val Arg Phe Val Ile Thr Gly Ala Val Asp 
    130                 135                 140                 


Pro Ala Ser Pro Ile Ala Asp His His Arg Arg His Gly Asp Gly Val 
145                 150                 155                 160 


Val Asp Ile Ala Leu Thr Val Pro Asp Val Asp Arg Cys Ile Ala His 
                165                 170                 175     


Ala Arg Ala Gln Gly Ala Thr Val Leu Ala Glu Pro His Asp Val Ser 
            180                 185                 190         


Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly Asp 
        195                 200                 205             


Thr Arg His Thr Leu Val Asp Arg Ser Arg Tyr Asn Gly Pro Tyr Leu 
    210                 215                 220                 


Pro Gly Tyr Ala Glu Arg Thr Ser Thr Phe Thr Lys Arg Glu Gly Ala 
225                 230                 235                 240 


Pro Lys Arg Ile Phe Gln Ala Leu Asp His Val Val Gly Asn Val Glu 
                245                 250                 255     


Leu Gly Arg Met Asp Glu Trp Val Ser Phe Tyr Asn Arg Val Met Gly 
            260                 265                 270         


Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp Tyr 
        275                 280                 285             


Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val Lys 
    290                 295                 300                 


Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile Asp 
305                 310                 315                 320 


Glu Tyr Leu Asp Tyr Tyr Gly Gly Pro Gly Ala Gln His Leu Ala Leu 
                325                 330                 335     


Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Thr Leu Arg Ala Glu Gly 
            340                 345                 350         


Ile Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Glu Asp Pro Glu Leu 
        355                 360                 365             


Arg Ala Arg Ile Gly Glu Val Arg Val Pro Ile Glu Glu Leu Gln Lys 
    370                 375                 380                 


Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln Ile 
385                 390                 395                 400 


Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu Ile 
                405                 410                 415     


Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala Leu 
            420                 425                 430         


Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        435                 440                 445     


<210>  20
<211>  527
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 12


<220>
<221>  TRANSIT
<222>  (1)..(125)
<223>  Optimized transit peptide for chloroplasts

<220>
<221>  MISC_FEATURE
<222>  (126)..(126)

<220>
<221>  MISC_FEATURE
<222>  (127)..(127)

<400>  20

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Cys Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Ala Met Asp Thr 
        115                 120                 125             


Leu Glu Gln Thr Leu Thr Asp Ala Glu Lys Leu Ala Asp Leu Asp Leu 
    130                 135                 140                 


Ala Gln Leu Lys Thr Leu Val Gly Leu Val Glu Tyr Asp Gly Ser Ser 
145                 150                 155                 160 


Asp Pro Phe Pro Val Ser Gly Trp Asp Ala Ile Val Trp Ala Val Gly 
                165                 170                 175     


Asn Ala Thr Gln Thr Ser His Phe Phe Gln Ser Ala Phe Gly Met Glu 
            180                 185                 190         


Leu Ile Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ala 
        195                 200                 205             


Phe Val Leu Arg Ser Gly Ala Val Arg Phe Val Ile Thr Gly Ala Val 
    210                 215                 220                 


Asp Pro Ala Ser Pro Ile Ala Asp His His Arg Arg His Gly Asp Gly 
225                 230                 235                 240 


Val Val Asp Ile Ala Leu Thr Val Pro Asp Val Asp Arg Cys Ile Ala 
                245                 250                 255     


His Ala Arg Ala Gln Gly Ala Thr Val Leu Ala Glu Pro His Asp Val 
            260                 265                 270         


Ser Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly 
        275                 280                 285             


Asp Thr Arg His Thr Leu Val Asp Arg Ser Arg Tyr Asn Gly Pro Tyr 
    290                 295                 300                 


Leu Pro Gly Tyr Ala Glu Arg Thr Ser Thr Phe Thr Lys Arg Glu Gly 
305                 310                 315                 320 


Ala Pro Lys Arg Ile Phe Gln Ala Leu Asp His Val Val Gly Asn Val 
                325                 330                 335     


Glu Leu Gly Arg Met Asp Glu Trp Val Ser Phe Tyr Asn Arg Val Met 
            340                 345                 350         


Gly Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp 
        355                 360                 365             


Tyr Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val 
    370                 375                 380                 


Lys Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile 
385                 390                 395                 400 


Asp Glu Tyr Leu Asp Tyr Tyr Gly Gly Pro Gly Ala Gln His Leu Ala 
                405                 410                 415     


Leu Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Thr Leu Arg Ala Glu 
            420                 425                 430         


Gly Ile Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Glu Asp Pro Glu 
        435                 440                 445             


Leu Arg Ala Arg Ile Gly Glu Val Arg Val Pro Ile Glu Glu Leu Gln 
    450                 455                 460                 


Lys Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln 
465                 470                 475                 480 


Ile Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu 
                485                 490                 495     


Ile Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala 
            500                 505                 510         


Leu Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        515                 520                 525         


<210>  21
<211>  526
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 15


<220>
<221>  TRANSIT
<222>  (1)..(124)
<223>  Optimized transit peptide to chloroplasts

<220>
<221>  MISC_FEATURE
<222>  (125)..(125)
<223>  Met

<220>
<221>  MISC_FEATURE
<222>  (126)..(126)
<223>  Asp

<400>  21

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Tyr Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Met Asp Thr Leu 
        115                 120                 125             


Glu Gln Thr Leu Thr Asp Ala Glu Lys Leu Ala Asp Leu Asp Leu Ala 
    130                 135                 140                 


Gln Leu Lys Thr Leu Val Gly Leu Val Glu Tyr Asp Gly Ser Ser Asp 
145                 150                 155                 160 


Pro Phe Pro Val Ser Gly Trp Asp Ala Ile Val Trp Ala Val Gly Asn 
                165                 170                 175     


Ala Thr Gln Thr Ser His Phe Phe Gln Ser Ala Phe Gly Met Glu Leu 
            180                 185                 190         


Ile Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ala Phe 
        195                 200                 205             


Val Leu Arg Ser Gly Ala Val Arg Phe Val Ile Thr Gly Ala Val Asp 
    210                 215                 220                 


Pro Ala Ser Pro Ile Ala Asp His His Arg Arg His Gly Asp Gly Val 
225                 230                 235                 240 


Val Asp Ile Ala Leu Thr Val Pro Asp Val Asp Arg Cys Ile Ala His 
                245                 250                 255     


Ala Arg Ala Gln Gly Ala Thr Val Leu Ala Glu Pro His Asp Val Ser 
            260                 265                 270         


Asp Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly Asp 
        275                 280                 285             


Thr Arg His Thr Leu Val Asp Arg Ser Arg Tyr Asn Gly Pro Tyr Leu 
    290                 295                 300                 


Pro Gly Tyr Ala Glu Arg Thr Ser Thr Phe Thr Lys Arg Glu Gly Ala 
305                 310                 315                 320 


Pro Lys Arg Ile Phe Gln Ala Leu Asp His Val Val Gly Asn Val Glu 
                325                 330                 335     


Leu Gly Arg Met Asp Glu Trp Val Ser Phe Tyr Asn Arg Val Met Gly 
            340                 345                 350         


Phe Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp Tyr 
        355                 360                 365             


Ser Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val Lys 
    370                 375                 380                 


Phe Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile Asp 
385                 390                 395                 400 


Glu Tyr Leu Asp Tyr Tyr Gly Gly Pro Gly Ala Gln His Leu Ala Leu 
                405                 410                 415     


Ala Thr Asn Asp Ile Leu Thr Ala Val Asp Thr Leu Arg Ala Glu Gly 
            420                 425                 430         


Ile Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Glu Asp Pro Glu Leu 
        435                 440                 445             


Arg Ala Arg Ile Gly Glu Val Arg Val Pro Ile Glu Glu Leu Gln Lys 
    450                 455                 460                 


Arg Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln Ile 
465                 470                 475                 480 


Phe Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu Ile 
                485                 490                 495     


Glu Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala Leu 
            500                 505                 510         


Phe Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        515                 520                 525     


<210>  22
<211>  525
<212>  PRT
<213>  Artificial sequenc

<220>
<223>  Protein sequence encoded by SEQ ID No. 16


<220>
<221>  TRANSIT
<222>  (1)..(124)
<223>  Optimized transit peptide to chloroplasts

<400>  22

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Tyr Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Met Thr Leu Glu 
        115                 120                 125             


Gln Thr Leu Thr Asp Ala Glu Lys Leu Ala Asp Leu Asp Leu Ala Gln 
    130                 135                 140                 


Leu Lys Thr Leu Val Gly Leu Val Glu Tyr Asp Gly Ser Ser Asp Pro 
145                 150                 155                 160 


Phe Pro Val Ser Gly Trp Asp Ala Ile Val Trp Ala Val Gly Asn Ala 
                165                 170                 175     


Thr Gln Thr Ser His Phe Phe Gln Ser Ala Phe Gly Met Glu Leu Ile 
            180                 185                 190         


Ala Tyr Ser Gly Pro Thr Thr Gly Asn Arg Asp His His Ala Phe Val 
        195                 200                 205             


Leu Arg Ser Gly Ala Val Arg Phe Val Ile Thr Gly Ala Val Asp Pro 
    210                 215                 220                 


Ala Ser Pro Ile Ala Asp His His Arg Arg His Gly Asp Gly Val Val 
225                 230                 235                 240 


Asp Ile Ala Leu Thr Val Pro Asp Val Asp Arg Cys Ile Ala His Ala 
                245                 250                 255     


Arg Ala Gln Gly Ala Thr Val Leu Ala Glu Pro His Asp Val Ser Asp 
            260                 265                 270         


Glu His Gly Thr Val Arg Leu Ala Ala Ile Ala Thr Tyr Gly Asp Thr 
        275                 280                 285             


Arg His Thr Leu Val Asp Arg Ser Arg Tyr Asn Gly Pro Tyr Leu Pro 
    290                 295                 300                 


Gly Tyr Ala Glu Arg Thr Ser Thr Phe Thr Lys Arg Glu Gly Ala Pro 
305                 310                 315                 320 


Lys Arg Ile Phe Gln Ala Leu Asp His Val Val Gly Asn Val Glu Leu 
                325                 330                 335     


Gly Arg Met Asp Glu Trp Val Ser Phe Tyr Asn Arg Val Met Gly Phe 
            340                 345                 350         


Thr Asn Met Ala Glu Phe Val Gly Glu Asp Ile Ala Thr Asp Tyr Ser 
        355                 360                 365             


Ala Leu Met Ser Lys Val Val Ser Asn Gly Asn His Arg Val Lys Phe 
    370                 375                 380                 


Pro Leu Asn Glu Pro Ala Ile Ala Lys Lys Arg Ser Gln Ile Asp Glu 
385                 390                 395                 400 


Tyr Leu Asp Tyr Tyr Gly Gly Pro Gly Ala Gln His Leu Ala Leu Ala 
                405                 410                 415     


Thr Asn Asp Ile Leu Thr Ala Val Asp Thr Leu Arg Ala Glu Gly Ile 
            420                 425                 430         


Glu Phe Leu Ala Thr Pro Asp Ser Tyr Tyr Glu Asp Pro Glu Leu Arg 
        435                 440                 445             


Ala Arg Ile Gly Glu Val Arg Val Pro Ile Glu Glu Leu Gln Lys Arg 
    450                 455                 460                 


Gly Ile Leu Val Asp Arg Asp Glu Asp Gly Tyr Leu Leu Gln Ile Phe 
465                 470                 475                 480 


Thr Lys Pro Leu Val Asp Arg Pro Thr Val Phe Phe Glu Leu Ile Glu 
                485                 490                 495     


Arg His Gly Ser Leu Gly Phe Gly Ile Gly Asn Phe Lys Ala Leu Phe 
            500                 505                 510         


Glu Ala Ile Glu Arg Glu Gln Ala Ala Arg Gly Asn Phe 
        515                 520                 525 


<210>  23
<211>  1422
<212>  DNA
<213>  Arabidopsis thaliana

<400>  23
atgtgtctat cgttagcttc tacagctcaa cgaaacacac agttccgtag cagagtttta       60

gttttagcag agttggtgaa atcaatgggc caccaaaacg ccgccgtttc agagaatcaa      120

aaccatgatg acggcgctgc gtcgtcgccg ggattcaagc tcgtcggatt ttccaagttc      180

gtaagaaaga atccaaagtc tgataaattc aaggttaagc gcttccatca catcgagttc      240

tggtgcggcg acgcaaccaa cgtcgctcgt cgcttctcct ggggtctggg gatgagattc      300

tccgccaaat ccgatctttc caccggaaac atggttcacg cctcttacct actcacctcc      360

ggtgacctcc gattcctttt cactgctcct tactctccgt ctctctccgc cggagagatt      420

aaaccgacaa ccacagcttc tatcccaagt ttcgatcacg gctcttgtcg ttccttcttc      480

tcttcacatg gtctcggtgt tagagccgtt gcgattgaag tagaagacgc agagtcagct      540

ttctccatca gtgtagctaa tggcgctatt ccttcgtcgc ctcctatcgt cctcaatgaa      600

gcagttacga tcgctgaggt taaactatac ggcgatgttg ttctccgata tgttagttac      660

aaagcagaag ataccgaaaa atccgaattc ttgccagggt tcgagcgtgt agaggatgcg      720

tcgtcgttcc cattggatta tggtatccgg cggcttgacc acgccgtggg aaacgttcct      780

gagcttggtc cggctttaac ttatgtagcg gggttcactg gttttcacca attcgcagag      840

ttcacagcag acgacgttgg aaccgccgag agcggtttaa attcagcggt cctggctagc      900

aatgatgaaa tggttcttct accgattaac gagccagtgc acggaacaaa gaggaagagt      960

cagattcaga cgtatttgga acataacgaa ggcgcagggc tacaacatct ggctctgatg     1020

agtgaagaca tattcaggac cctgagagag atgaggaaga ggagcagtat tggaggattc     1080

gacttcatgc cttctcctcc gcctacttac taccagaatc tcaagaaacg ggtcggcgac     1140

gtgctcagcg atgatcagat caaggagtgt gaggaattag ggattcttgt agacagagat     1200

gatcaaggga cgttgcttca aatcttcaca aaaccactag gtgacaggcc gacgatattt     1260

atagagataa tccagagagt aggatgcatg atgaaagatg aggaagggaa ggcttaccag     1320

agtggaggat gtggtggttt tggcaaaggc aatttctctg agctcttcaa gtccattgaa     1380

gaatacgaaa agactcttga agccaaacag ttagtgggat ga                        1422


<210>  24
<211>  473
<212>  PRT
<213>  Arabidopsis thaliana

<400>  24

Met Cys Leu Ser Leu Ala Ser Thr Ala Gln Arg Asn Thr Gln Phe Arg 
1               5                   10                  15      


Ser Arg Val Leu Val Leu Ala Glu Leu Val Lys Ser Met Gly His Gln 
            20                  25                  30          


Asn Ala Ala Val Ser Glu Asn Gln Asn His Asp Asp Gly Ala Ala Ser 
        35                  40                  45              


Ser Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn 
    50                  55                  60                  


Pro Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His His Ile Glu Phe 
65                  70                  75                  80  


Trp Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu 
                85                  90                  95      


Gly Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val 
            100                 105                 110         


His Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr 
        115                 120                 125             


Ala Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu Ile Lys Pro Thr Thr 
    130                 135                 140                 


Thr Ala Ser Ile Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe 
145                 150                 155                 160 


Ser Ser His Gly Leu Gly Val Arg Ala Val Ala Ile Glu Val Glu Asp 
                165                 170                 175     


Ala Glu Ser Ala Phe Ser Ile Ser Val Ala Asn Gly Ala Ile Pro Ser 
            180                 185                 190         


Ser Pro Pro Ile Val Leu Asn Glu Ala Val Thr Ile Ala Glu Val Lys 
        195                 200                 205             


Leu Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp 
    210                 215                 220                 


Thr Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala 
225                 230                 235                 240 


Ser Ser Phe Pro Leu Asp Tyr Gly Ile Arg Arg Leu Asp His Ala Val 
                245                 250                 255     


Gly Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe 
            260                 265                 270         


Thr Gly Phe His Gln Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr 
        275                 280                 285             


Ala Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met 
    290                 295                 300                 


Val Leu Leu Pro Ile Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser 
305                 310                 315                 320 


Gln Ile Gln Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gln His 
                325                 330                 335     


Leu Ala Leu Met Ser Glu Asp Ile Phe Arg Thr Leu Arg Glu Met Arg 
            340                 345                 350         


Lys Arg Ser Ser Ile Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro 
        355                 360                 365             


Thr Tyr Tyr Gln Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp 
    370                 375                 380                 


Asp Gln Ile Lys Glu Cys Glu Glu Leu Gly Ile Leu Val Asp Arg Asp 
385                 390                 395                 400 


Asp Gln Gly Thr Leu Leu Gln Ile Phe Thr Lys Pro Leu Gly Asp Arg 
                405                 410                 415     


Pro Thr Ile Phe Ile Glu Ile Ile Gln Arg Val Gly Cys Met Met Lys 
            420                 425                 430         


Asp Glu Glu Gly Lys Ala Tyr Gln Ser Gly Gly Cys Gly Gly Phe Gly 
        435                 440                 445             


Lys Gly Asn Phe Ser Glu Leu Phe Lys Ser Ile Glu Glu Tyr Glu Lys 
    450                 455                 460                 


Thr Leu Glu Ala Lys Gln Leu Val Gly 
465                 470             


<210>  25
<211>  1353
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Arabidopsis thaliana HPPD, 
       containing at the 5' end a nucleic acid encoding an alanine and 6
       histidine amino acids


<220>
<221>  misc_feature
<222>  (4)..(6)
<223>  Nucleic sequence coding for Ala

<220>
<221>  misc_feature
<222>  (7)..(24)
<223>  Nucleic sequence coding for 6 His

<400>  25
atggctcatc accatcacca tcaccaaaac gccgccgttt cagagaatca aaaccatgat       60

gacggcgctg cgtcgtcgcc gggattcaag ctcgtcggat tttccaagtt cgtaagaaag      120

aatccaaagt ctgataaatt caaggttaag cgcttccatc acatcgagtt ctggtgcggc      180

gacgcaacca acgtcgctcg tcgcttctcc tggggtctgg ggatgagatt ctccgccaaa      240

tccgatcttt ccaccggaaa catggttcac gcctcttacc tactcacctc cggtgacctc      300

cgattccttt tcactgctcc ttactctccg tctctctccg ccggagagat taaaccgaca      360

accacagctt ctatcccaag tttcgatcac ggctcttgtc gttccttctt ctcgtcacat      420

ggtctcggtg ttagagccgt tgcgattgaa gtagaagacg cagagtcagc tttctccatc      480

agtgtagcta atggcgctat tccttcgtcg cctcctatcg tcctcaatga agcagttacg      540

atcgctgagg ttaaactata cggcgatgtt gttctccgat atgttagtta caaagcagaa      600

gataccgaaa aatccgaatt cttgccaggg ttcgagcgtg tagaggatgc gtcgtcgttc      660

ccattggatt atggtatccg gcggcttgac cacgccgtgg gaaacgttcc tgagcttggt      720

ccggctttaa cttatgtagc ggggttcact ggttttcacc aattcgcaga gttcacagca      780

gacgacgttg gaaccgccga gagcggttta aattcagcgg tcctggctag caatgatgaa      840

atggttcttc taccgattaa cgagccagtg cacggaacaa agaggaagag tcagattcag      900

acgtatttgg aacataacga aggcgcaggg ctacaacatc tggctctgat gagtgaagac      960

atattcagga ccctgagaga gatgaggaag aggagcagta ttggaggatt cgacttcatg     1020

ccttctcctc cgcctactta ctaccagaat ctcaagaaac gggtcggcga cgtgctcagc     1080

gatgatcaga tcaaggagtg tgaggaatta gggattcttg tagacagaga tgatcaaggg     1140

acgttgcttc aaatcttcac aaaaccacta ggtgacaggc cgacgatatt tatagagata     1200

atccagagag taggatgcat gatgaaagat gaggaaggga aggcttacca gagtggagga     1260

tgtggtggtt ttggcaaagg caatttctct gagctcttca agtccattga agaatacgaa     1320

aagactcttg aagccaaaca gttagtggga tga                                  1353


<210>  26
<211>  450
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein encoded by SEQ ID No. 25


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Ala

<220>
<221>  MISC_FEATURE
<222>  (3)..(8)
<223>  6 His

<400>  26

Met Ala His His His His His His Gln Asn Ala Ala Val Ser Glu Asn 
1               5                   10                  15      


Gln Asn His Asp Asp Gly Ala Ala Ser Ser Pro Gly Phe Lys Leu Val 
            20                  25                  30          


Gly Phe Ser Lys Phe Val Arg Lys Asn Pro Lys Ser Asp Lys Phe Lys 
        35                  40                  45              


Val Lys Arg Phe His His Ile Glu Phe Trp Cys Gly Asp Ala Thr Asn 
    50                  55                  60                  


Val Ala Arg Arg Phe Ser Trp Gly Leu Gly Met Arg Phe Ser Ala Lys 
65                  70                  75                  80  


Ser Asp Leu Ser Thr Gly Asn Met Val His Ala Ser Tyr Leu Leu Thr 
                85                  90                  95      


Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala Pro Tyr Ser Pro Ser Leu 
            100                 105                 110         


Ser Ala Gly Glu Ile Lys Pro Thr Thr Thr Ala Ser Ile Pro Ser Phe 
        115                 120                 125             


Asp His Gly Ser Cys Arg Ser Phe Phe Ser Ser His Gly Leu Gly Val 
    130                 135                 140                 


Arg Ala Val Ala Ile Glu Val Glu Asp Ala Glu Ser Ala Phe Ser Ile 
145                 150                 155                 160 


Ser Val Ala Asn Gly Ala Ile Pro Ser Ser Pro Pro Ile Val Leu Asn 
                165                 170                 175     


Glu Ala Val Thr Ile Ala Glu Val Lys Leu Tyr Gly Asp Val Val Leu 
            180                 185                 190         


Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr Glu Lys Ser Glu Phe Leu 
        195                 200                 205             


Pro Gly Phe Glu Arg Val Glu Asp Ala Ser Ser Phe Pro Leu Asp Tyr 
    210                 215                 220                 


Gly Ile Arg Arg Leu Asp His Ala Val Gly Asn Val Pro Glu Leu Gly 
225                 230                 235                 240 


Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr Gly Phe His Gln Phe Ala 
                245                 250                 255     


Glu Phe Thr Ala Asp Asp Val Gly Thr Ala Glu Ser Gly Leu Asn Ser 
            260                 265                 270         


Ala Val Leu Ala Ser Asn Asp Glu Met Val Leu Leu Pro Ile Asn Glu 
        275                 280                 285             


Pro Val His Gly Thr Lys Arg Lys Ser Gln Ile Gln Thr Tyr Leu Glu 
    290                 295                 300                 


His Asn Glu Gly Ala Gly Leu Gln His Leu Ala Leu Met Ser Glu Asp 
305                 310                 315                 320 


Ile Phe Arg Thr Leu Arg Glu Met Arg Lys Arg Ser Ser Ile Gly Gly 
                325                 330                 335     


Phe Asp Phe Met Pro Ser Pro Pro Pro Thr Tyr Tyr Gln Asn Leu Lys 
            340                 345                 350         


Lys Arg Val Gly Asp Val Leu Ser Asp Asp Gln Ile Lys Glu Cys Glu 
        355                 360                 365             


Glu Leu Gly Ile Leu Val Asp Arg Asp Asp Gln Gly Thr Leu Leu Gln 
    370                 375                 380                 


Ile Phe Thr Lys Pro Leu Gly Asp Arg Pro Thr Ile Phe Ile Glu Ile 
385                 390                 395                 400 


Ile Gln Arg Val Gly Cys Met Met Lys Asp Glu Glu Gly Lys Ala Tyr 
                405                 410                 415     


Gln Ser Gly Gly Cys Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu 
            420                 425                 430         


Phe Lys Ser Ile Glu Glu Tyr Glu Lys Thr Leu Glu Ala Lys Gln Leu 
        435                 440                 445             


Val Gly 
    450 


<210>  27
<211>  1712
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleid acid sequence encoding Arabidopsis thaliana HPPD 
       containing at the 5' end a nucleic acid encoding an OTP 
       (Optimized Tansit Peptide; SEQ ID No. 13


<220>
<221>  transit_peptide
<222>  (1)..(375)
<223>  Nucleic acid coding for optimized transit peptide to chloroplasts

<400>  27
atggcttcga tctcctcctc agtcgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatgtatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

agaatccggt gcgccatgca aaacgccgcc gtttcagaga atcaaaacca tgatgacggc      420

gctgcgtcgt cgccgggatt caagctcgtc ggattttcca agttcgtaag aaagaatcca      480

aagtctgata aattcaaggt taagcgcttc catcacatcg agttctggtg cggcgacgca      540

accaacgtcg ctcgtcgctt ctcctggggt ctggggatga gattctccgc caaatccgat      600

ctttccaccg gaaacatggt tcacgcctct tacctactca cctccggtga cctccgattc      660

cttttcactg ctccttactc tccgtctctc tccgccggag agattaaacc gacaaccaca      720

gcttctatcc caagtttcga tcacggctct tgtcgttcct tcttctcgtc acatggtctc      780

ggtgttagag ccgttgcgat tgaagtagaa gacgcagagt cagctttctc catcagtgta      840

gctaatggcg ctattccttc gtcgcctcct atcgtcctca atgaagcagt tacgatcgct      900

gaggttaaac tatacggcga tgttgttctc cgatatgtta gttacaaagc agaagatacc      960

gaaaaatccg aattcttgcc agggttcgag cgtgtagagg atgcgtcgtc gttcccattg     1020

gattatggta tccggcggct tgaccacgcc gtgggaaacg ttcctgagct tggtccggct     1080

ttaacttatg tagcggggtt cactggtttt caccaattcg cagagttcac agcagacgac     1140

gttggaaccg ccgagagcgg tttaaattca gcggtcctgg ctagcaatga tgaaatggtt     1200

cttctaccga ttaacgagcc agtgcacgga acaaagagga agagtcagat tcagacgtat     1260

ttggaacata acgaaggcgc agggctacaa catctggctc tgatgagtga agacatattc     1320

aggaccctga gagagatgag gaagaggagc agtattggag gattcgactt catgccttct     1380

cctccgccta cttactacca gaatctcaag aaacgggtcg gcgacgtgct cagcgatgat     1440

cagatcaagg agtgtgagga attagggatt cttgtagaca gagatgatca agggacgttg     1500

cttcaaatct tcacaaaacc actaggtgac aggccgacga tatttataga gataatccag     1560

agagtaggat gcatgatgaa agatgaggaa gggaaggctt accagagtgg aggatgtggt     1620

ggttttggca aaggcaattt ctctgagctc ttcaagtcca ttgaagaata cgaaaagact     1680

cttgaagcca aacagttagt gggatgatct ag                                   1712


<210>  28
<211>  569
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Protein of SEQ ID No. 24 plus the OTP sequence (SEQ ID No. 14) 
       located at the N-terminal extremity of the protein


<220>
<221>  TRANSIT
<222>  (1)..(125)
<223>  Optimized transit peptide to chloroplasts

<400>  28

Met Ala Ser Ile Ser Ser Ser Val Ala Thr Val Ser Arg Thr Ala Pro 
1               5                   10                  15      


Ala Gln Ala Asn Met Val Ala Pro Phe Thr Gly Leu Lys Ser Asn Ala 
            20                  25                  30          


Ala Phe Pro Thr Thr Lys Lys Ala Asn Asp Phe Ser Thr Leu Pro Ser 
        35                  40                  45              


Asn Gly Gly Arg Val Gln Cys Met Gln Val Trp Pro Ala Tyr Gly Asn 
    50                  55                  60                  


Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Pro Leu Ser Met Ala Pro 
65                  70                  75                  80  


Thr Val Met Met Ala Ser Ser Ala Thr Ala Val Ala Pro Phe Gln Gly 
                85                  90                  95      


Leu Lys Ser Thr Ala Ser Leu Pro Val Ala Arg Arg Ser Ser Arg Ser 
            100                 105                 110         


Leu Gly Asn Val Ser Asn Gly Gly Arg Ile Arg Cys Ala Met Gln Asn 
        115                 120                 125             


Ala Ala Val Ser Glu Asn Gln Asn His Asp Asp Gly Ala Ala Ser Ser 
    130                 135                 140                 


Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn Pro 
145                 150                 155                 160 


Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His His Ile Glu Phe Trp 
                165                 170                 175     


Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu Gly 
            180                 185                 190         


Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val His 
        195                 200                 205             


Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala 
    210                 215                 220                 


Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu Ile Lys Pro Thr Thr Thr 
225                 230                 235                 240 


Ala Ser Ile Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe Ser 
                245                 250                 255     


Ser His Gly Leu Gly Val Arg Ala Val Ala Ile Glu Val Glu Asp Ala 
            260                 265                 270         


Glu Ser Ala Phe Ser Ile Ser Val Ala Asn Gly Ala Ile Pro Ser Ser 
        275                 280                 285             


Pro Pro Ile Val Leu Asn Glu Ala Val Thr Ile Ala Glu Val Lys Leu 
    290                 295                 300                 


Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr 
305                 310                 315                 320 


Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala Ser 
                325                 330                 335     


Ser Phe Pro Leu Asp Tyr Gly Ile Arg Arg Leu Asp His Ala Val Gly 
            340                 345                 350         


Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr 
        355                 360                 365             


Gly Phe His Gln Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr Ala 
    370                 375                 380                 


Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met Val 
385                 390                 395                 400 


Leu Leu Pro Ile Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser Gln 
                405                 410                 415     


Ile Gln Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gln His Leu 
            420                 425                 430         


Ala Leu Met Ser Glu Asp Ile Phe Arg Thr Leu Arg Glu Met Arg Lys 
        435                 440                 445             


Arg Ser Ser Ile Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro Thr 
    450                 455                 460                 


Tyr Tyr Gln Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp Asp 
465                 470                 475                 480 


Gln Ile Lys Glu Cys Glu Glu Leu Gly Ile Leu Val Asp Arg Asp Asp 
                485                 490                 495     


Gln Gly Thr Leu Leu Gln Ile Phe Thr Lys Pro Leu Gly Asp Arg Pro 
            500                 505                 510         


Thr Ile Phe Ile Glu Ile Ile Gln Arg Val Gly Cys Met Met Lys Asp 
        515                 520                 525             


Glu Glu Gly Lys Ala Tyr Gln Ser Gly Gly Cys Gly Gly Phe Gly Lys 
    530                 535                 540                 


Gly Asn Phe Ser Glu Leu Phe Lys Ser Ile Glu Glu Tyr Glu Lys Thr 
545                 550                 555                 560 


Leu Glu Ala Lys Gln Leu Val Gly Ser 
                565                 


<210>  29
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Zea mays plants

<400>  29
atgacgctcg aacaaaccct gacggacgca gagaagttgg cagatctgga tctggcgcag       60

ctcaagaccc tcgtgggcct ggtggagtac gacggctcgt ctgatccctt cccggtgagc      120

ggctgggacg cgatcgtgtg ggcagtgggc aatgctacgc aaacctctca tttctttcag      180

agcgccttcg ggatggagtt gatcgcctat tctggcccga ctaccggcaa cagggatcat      240

cacgcgttcg tattaaggag cggcgcggtt agattcgtca ttaccggcgc ggtcgatccg      300

gcatcaccca tcgcggatca ccatagaaga catggcgacg gggtcgtgga catcgctctt      360

accgtgccag acgttgatag gtgcattgcc catgcgcgcg cacagggtgc gaccgtattg      420

gcggagccgc acgacgtaag tgacgaacat ggaacagtgc ggttagctgc gatcgccact      480

tacggggata ccaggcatac actagtcgac cgtagccgct ataatggccc ctaccttcct      540

ggctacgcag agcgtacgtc aacgttcacc aagcgcgaag gagctcccaa gaggatattc      600

caggccttgg atcacgtggt cggaaatgtt gaactcggga ggatggatga atgggtaagc      660

ttctataatc gagtgatggg ctttacgaat atggccgaat tcgtggggga ggatatcgct      720

acggattact ccgccctcat gtcaaaggtg gtgtctaacg gaaaccaccg ggtaaaattc      780

cctttgaacg aaccagcgat cgcgaaaaag cgctctcaga tcgatgagta cctagattac      840

tatggtggcc caggcgcgca gcaccttgct cttgctacta atgatatctt gaccgcggtc      900

gatactttgc gcgcagaggg tattgagttc ctggcaacgc ctgatagcta ttacgaggac      960

cctgagctcc gggcaagaat tggcgaagtg cgtgtcccta tcgaggagct ccagaagaga     1020

ggcattctgg tggatagaga cgaggacggt tatttgcttc agatattcac caagcctctc     1080

gtggataggc ctacggtgtt ctttgagtta atcgagcggc acgggtcgct cggcttcggc     1140

attggcaatt tcaaagcatt gttcgaagcc attgagaggg agcaggcagc gcgtggtaac     1200

ttctag                                                                1206


<210>  30
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Brassica napus plants

<400>  30
atgaccctgg agcaaacgtt aacagacgcg gagaaactgg ctgaccttga tttggctcag       60

ctcaagacac ttgtcggttt ggtcgaatac gatggatctt cagatccgtt ccccgttagc      120

ggttgggacg ctattgtgtg ggctgttggc aacgctactc agacttctca ctttttccag      180

agtgccttcg gtatggaact tatcgcttat agtggaccga caaccggaaa cagagatcac      240

catgcgttcg tactaagatc gggcgccgtt aggtttgtca tcactggtgc tgttgatcca      300

gcttcaccta ttgccgatca tcatcgtcga cacggcgatg gagtcgtgga tatagcttta      360

accgttcctg acgtcgatag gtgtatagct catgctcgtg cccagggagc tactgttctc      420

gcagaacctc acgacgtgtc agacgagcac ggaaccgtta gactagctgc tatcgcaact      480

tacggcgaca caagacacac gctagttgat agatcaaggt acaacggtcc gtatcttcct      540

ggttacgcgg agcgcactag cacttttaca aagcgcgagg gagcacctaa acggattttc      600

caagccctcg atcatgtggt tgggaatgtc gagctcggga gaatggacga gtgggtgtcg      660

ttctacaata gagtgatggg attcactaat atggcggagt tcgtcggaga agatatcgct      720

acagattata gtgcactaat gtccaaggta gttagtaatg gaaaccaccg cgtgaaattt      780

cctttaaacg agcccgctat tgccaaaaaa agatcacaga tcgatgagta cctagattat      840

tacggaggac caggtgctca gcacctcgcg ttagccacca atgatatcct taccgctgtt      900

gatactctca gggcagaggg aattgaattc ttagctacac cagatagtta ttatgaggat      960

cccgagctac gtgctcggat aggtgaggtc cgagttccga tcgaggagtt gcaaaagaga     1020

ggaatcttgg tcgaccgtga cgaggacgga tacctccttc aaatatttac gaaacctctt     1080

gtggaccgcc ctaccgtctt tttcgaactc atcgagcgtc atggatcact cggttttggg     1140

ataggtaact ttaaggcgct gtttgaagca atcgagcgag aacaagctgc aagagggaac     1200

ttttag                                                                1206


<210>  31
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Beta vulgaris plants

<400>  31
atgacccttg aacaaacgtt aacagacgca gagaagcttg ctgatcttga cttggctcag       60

cttaagacat tggttggact cgttgaatat gatggatcta gcgatccctt tccagtctct      120

ggttgggatg cgatcgtctg ggctgtgggc aacgcaaccc aaacgtccca tttttttcaa      180

tcagcgtttg gcatggaact gatcgcatat tcgggcccaa caacaggcaa tagagaccac      240

cacgccttcg tcctgcgttc cggtgctgta aggtttgtaa ttactggtgc cgttgaccct      300

gcttctccta ttgctgatca ccaccgcagg cacggggacg gcgtggttga tatcgcttta      360

acggtgcctg acgtggatcg ctgcattgct catgccaggg cacaaggcgc tacagttttg      420

gcagagcctc atgatgttag tgatgaacac ggtactgttc ggttggcagc tattgcgaca      480

tatggagaca ctagacatac cctcgttgac agatctaggt ataacggtcc gtatcttcca      540

ggttatgctg aaaggacttc cactttcacc aagagggaag gagctcctaa aaggatcttc      600

caggcactgg atcacgttgt gggaaacgtg gaattgggtc ggatggatga atgggtgtct      660

ttttataaca gagtgatggg attcaccaat atggctgaat ttgtgggcga agacattgct      720

acagactatt cagctctgat gagtaaagtc gtgagcaatg gaaatcaccg ggtgaagttt      780

cctcttaacg aacctgctat cgctaaaaag cgttctcaaa ttgacgagta cttggactac      840

tatggtggac ctggtgcaca gcatttggct cttgccacaa atgatattct cacagcagtt      900

gatacactaa gagctgaagg gatagaattt ttggctaccc ctgatagtta ctatgaggat      960

ccagagctga gagcacgaat cggcgaagtt cgtgtcccta tagaagagtt acagaaaagg     1020

ggaatcctag tcgatagaga cgaggatggt tatttgttgc agatattcac aaaacctcta     1080

gtggatcgtc cgaccgtgtt tttcgaactt attgagagac atggatccct tggcttcggg     1140

ataggtaatt tcaaggcact tttcgaagca atagaaagag aacaagcagc tagaggaaat     1200

ttttag                                                                1206


<210>  32
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Gossypium hirsutum plants

<400>  32
atgactcttg agcagacact aacggatgct gagaagttag ctgatttaga tctagctcaa       60

cttaaaacac tcgtgggact tgtagagtat gacggcagta gtgatccgtt tcccgtttct      120

ggatgggacg caattgtatg ggctgtgggt aatgctactc aaacttcaca tttctttcag      180

tcagcctttg ggatggaact gattgcgtac tctgggccaa ctactgggaa tagggaccat      240

catgcttttg ttctgaggag cggcgctgtc aggtttgtta ttaccggagc tgttgatccc      300

gcctccccga ttgctgatca tcacaggagg catggagacg gagtggtgga cattgctttg      360

acagttcccg acgtcgaccg ttgtatcgcc cacgcgagag cacaaggtgc gactgtcttg      420

gctgagcctc acgatgttag tgacgagcat ggtacagtta ggcttgcagc tatcgcaacc      480

tatggagata ctcgacacac tttggttgat aggagccgtt acaacggccc gtatcttcct      540

ggttatgcag aacgcacgtc tacttttact aagagagaag gagcacctaa aagaatattt      600

caagctctgg atcatgtggt gggaaacgtg gagctaggca ggatggatga gtgggttagt      660

ttctacaacc gagtcatggg attcacaaat atggctgaat tcgtcggaga ggacatcgct      720

acagactact ccgccctgat gtctaaagtt gtttccaatg gtaaccatag agttaagttc      780

ccacttaacg agcccgctat tgctaagaaa agatcccaaa tcgatgaata tctcgactat      840

tatggtgggc ccggtgccca acacctcgct ttagctacta acgatattct cactgctgtc      900

gatacactga gggctgaagg catagaattc ctagcaacgc ccgatagcta ctatgaagac      960

cctgagttaa gagcaaggat tggggaggta cgtgttccaa tagaggagct gcaaaagagg     1020

ggtattcttg tagatcgtga tgaagatggt tatttactac aaattttcac taagcccctc     1080

gtagaccgtc ctacagtgtt cttcgaactt atcgagagac atggctcctt ggggtttgga     1140

attggaaact ttaaggcttt gttcgaggca attgagagag aacaagcagc aaggggcaac     1200

ttttag                                                                1206


<210>  33
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Glycine max plants

<400>  33
atgacactgg agcaaacttt gaccgacgct gaaaaattgg cagaccttga cttggcccag       60

ttgaagacgt tagtgggttt agttgagtac gatggatcgt ctgatccttt cccggtttct      120

ggatgggatg caattgtttg ggcagttgga aacgccactc agacatccca tttcttccaa      180

tctgcttttg gaatggaatt aattgcatat tcgggcccaa ccactggcaa ccgtgatcat      240

cacgcattcg tgctaagatc cggagcagtc cgattcgtca tcacgggcgc tgttgatccg      300

gcttccccca ttgctgatca ccatagacga catggtgatg gcgttgttga tattgcactc      360

acggttcctg atgttgacag atgcattgca cacgctcgtg ctcaaggcgc aaccgttttg      420

gcggaacccc atgatgtttc agatgaacac ggcaccgtga ggcttgcggc tattgctaca      480

tacggtgata cccgccacac ccttgttgat aggtcgagat ataatgggcc ttatcttccg      540

gggtatgcag agagaacttc aactttcact aaaagagaag gagcacccaa gcggatattc      600

caagctcttg accatgttgt gggaaatgtg gagttgggaa ggatggatga gtgggtttcg      660

ttctataaca gggttatggg ttttacgaat atggctgagt ttgtgggtga ggatatagca      720

acagattact cggctctcat gagtaaggtc gttagcaacg gtaaccatcg cgtgaagttc      780

cccttgaatg agcctgcaat cgccaaaaag aggtctcaga tcgacgagta cttggactac      840

tatggggggc caggagctca acatcttgca ttggccacaa acgacatcct taccgccgtt      900

gacacattgc gtgctgaagg gattgaattc ttggcaaccc cagattcata ctacgaagac      960

cctgagctcc gtgccagaat cggggaagtt cgtgtaccta tagaagaact gcagaagcga     1020

ggcattcttg ttgacagaga cgaggatggt taccttcttc agattttcac caaaccattg     1080

gtagatcggc ctactgtctt ttttgagctg attgaacgac atgggtcctt aggtttcgga     1140

attggcaatt ttaaagctct tttcgaggct attgaaaggg agcaggctgc gagaggtaac     1200

ttttag                                                                1206


<210>  34
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Hordeum vulgare plants

<400>  34
atgacgctcg agcaaacact taccgacgct gaaaagcttg ccgatctgga tctggctcag       60

ttgaagaccc tggtaggtct cgtcgagtac gacgggtcaa gcgacccatt cccagtcagc      120

gggtgggatg ccatcgtgtg ggccgtgggg aacgcgaccc agacctcaca cttcttccaa      180

tctgcgttcg gcatggagct tattgcctac tcagggccta caaccggaaa ccgcgaccat      240

catgccttcg tcctccgctc cggggccgtc agattcgtca tcaccggcgc cgtggacccc      300

gcttcaccta tcgccgacca tcatcggcgt cacggcgatg gggtggttga tatcgcctta      360

accgtccccg atgtcgaccg ctgcatagca catgccaggg cccaaggcgc caccgtccta      420

gcggagcccc acgatgtaag cgatgagcac ggcacggtcc gcctcgccgc gatcgcaaca      480

tacggggata cgcgacacac tctcgtggat aggtcacggt acaacgggcc atacttgccg      540

ggttacgccg agaggacatc gacgtttaca aagagggagg gagcccctaa gaggatattc      600

caggccctag accacgtggt gggaaatgtg gagctcggta ggatggacga atgggtgtca      660

ttctataacc gcgttatggg gttcacgaac atggctgagt ttgtcggtga agacatcgca      720

actgactaca gcgcgttgat gtccaaggta gtgagcaacg gcaaccaccg ggtgaagttt      780

cccttgaacg aaccggcaat tgccaaaaag cggagccaaa tcgacgagta cctagactac      840

tacgggggtc cgggcgctca acacctcgcg ttagctacga acgatattct gaccgccgtc      900

gacaccctgc gcgcagaggg aattgagttc cttgccacac ccgactccta ctacgaagac      960

ccggagctcc gcgcccgtat aggggaggtc agagttccca tcgaagaact gcagaagcgg     1020

ggcatcctcg tggacagaga cgaagacggt tacctgctcc aaatcttcac caagcccctg     1080

gtggataggc ccaccgtctt ctttgaacta attgagaggc acggttcctt gggattcggg     1140

atcggcaact tcaaggcgct ctttgaagcc atcgagcggg agcaggccgc gagaggcaac     1200

ttttag                                                                1206


<210>  35
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Oryza sativa plants

<400>  35
atgacactcg agcagacact aacggatgct gagaagttgg ctgaccttga cctcgcgcag       60

ctcaaaacct tggttggact ggtggagtac gacgggtcga gcgatccgtt cccagtgtca      120

ggttgggacg ccatagtctg ggctgtaggg aatgctactc aaacatcaca ttttttccag      180

tccgcgttcg ggatggaact tatcgcttac tccggcccta ccaccggaaa tcgcgaccac      240

cacgctttcg tccttcgatc cggagccgtt cgtttcgtga ttaccggtgc cgtggatccc      300

gcaagcccta tcgccgatca tcatcggcgt cacggagacg gagtggtgga tatcgctctc      360

actgttccgg atgtcgatag gtgcatagct catgccaggg cacaaggagc gacagtttta      420

gcagagccac acgatgttag cgatgaacat gggactgtca ggttggccgc gattgccacg      480

tatggcgata ccaggcacac actggtggac cgctcaagat acaacggccc atatctgcct      540

ggctatgcgg agcggacaag cacattcact aagagagagg gcgcccccaa gcgaatattc      600

caggccctgg accatgtggt tggcaacgtc gagttgggcc gaatggatga atgggtgagc      660

ttctataacc gggtcatggg attcaccaac atggcggagt tcgtaggtga ggacatcgct      720

acagattact cggctttgat gagcaaggtg gtgagtaatg gtaaccacag ggtgaaattt      780

ccactgaacg agccggcgat cgccaagaaa agatcacaaa ttgatgagta tcttgattac      840

tatggcggac caggggccca gcatttggcg ttagcgacaa atgatattct cacggcggtg      900

gacaccctca gggcggaagg gatagagttc cttgcgactc cagattccta ttacgaagac      960

ccggagctca gggcaagaat tggggaggtt agggtcccca ttgaggaatt acagaagcgc     1020

gggatcctgg ttgaccggga cgaagatggc tacttgctcc agattttcac taaacccctc     1080

gtagacagac cgaccgtttt cttcgagctc atagagcgcc acggatcact gggctttggc     1140

ataggtaact tcaaggctct cttcgaggca atcgaaaggg agcaagcggc cagggggaac     1200

ttctag                                                                1206


<210>  36
<211>  1206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for Triticum aestivum plants

<400>  36
atgactctag aacaaacgct gacggacgcg gagaagctcg ccgatcttga tcttgcgcaa       60

ttaaagaccc tcgttggcct tgtcgaatat gacggcagct ccgatccatt cccggtctca      120

ggctgggacg cgattgtatg ggccgttggt aacgccaccc agacgagtca cttttttcag      180

tcggcgtttg gtatggagtt aatagcgtac tcagggccca ccaccggcaa ccgcgatcac      240

catgctttcg tgcttagatc tggcgcggtg aggttcgtca tcactggggc ggtcgacccg      300

gcatccccga tagcagacca tcaccgccgt cacggggacg gtgtggtgga catagcattg      360

actgtaccag acgtggatcg gtgtatagcc catgcgcgag cgcaaggcgc aacggtactg      420

gcagagcctc atgacgtgag cgacgaacac ggcactgttc gcctggcggc aatagcgaca      480

tatggggaca cgcgacacac cctggtcgac agatcacgct acaacgggcc ctacctgcct      540

ggatacgctg agaggacatc gaccttcacc aaaagggagg gcgcgccgaa gcgcatcttt      600

caagccctcg accacgtggt tgggaacgtc gagcttggca ggatggacga gtgggtgagt      660

ttctataacc gcgtcatggg cttcacgaac atggccgagt tcgtcggtga agatatcgca      720

acagattaca gcgccctgat gtcgaaggtg gtcagcaacg gaaatcatcg tgtgaagttc      780

ccgttgaacg aaccagcgat tgccaagaag aggagccaaa tcgacgagta cctggattat      840

tacggcgggc ctggagcaca gcatctggct ttggctacta atgatatcct gactgcggtg      900

gacacgctca gagccgaggg cattgaattc ttggctaccc cagatagtta ctacgaagac      960

cccgagctgc gcgcgcgaat cggcgaagtc cgcgtgccca tcgaggagct gcagaagcgt     1020

ggaatcttgg tggatcggga tgaggacgga tatctactgc aaattttcac aaagccgctc     1080

gtggatcgtc ccaccgtgtt tttcgagctt atcgagcggc atgggtcact aggcttcgga     1140

atcggcaact ttaaggctct gtttgaggcg atcgagaggg agcaagccgc gcgggggaat     1200

ttttag                                                                1206


<210>  37
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Nucleic acid sequence encoding Amycolicicoccus subflavus HPPD 
       optimized for dicotyledonous plants containing at the 5' end a 
       nucleic acid sequence encoding an optimized transit peptide 
       (according to SEQ ID No. 13).

<400>  37
atggcttcga tctcctcctc agttgcgacc gttagccgga ccgcccctgc tcaggccaac       60

atggtggctc cgttcaccgg ccttaagtcc aacgccgcct tccccaccac caagaaggct      120

aacgacttct ccacccttcc cagcaacggt ggaagagttc aatatatgca ggtgtggccg      180

gcctacggca acaagaagtt cgagacgctg tcgtacctgc cgccgctgtc tatggcgccc      240

accgtgatga tggcctcgtc ggccaccgcc gtcgctccgt tccaggggct caagtccacc      300

gccagcctcc ccgtcgcccg ccgctcctcc agaagcctcg gcaacgtcag caacggcgga      360

aggatccggt gcatgaccct tgagcaaacc cttaccgacg ctgagaagct cgctgatctt      420

gatcttgctc agcttaagac ccttgtgggc cttgttgaat acgacggctc tagtgatccc      480

ttcccagtta gtggttggga cgctattgtg tgggctgttg gtaacgctac tcagacaagt      540

cacttctttc agtcagcctt cgggatggaa ctgatcgctt attcaggacc tactaccggt      600

aatagggatc accacgcttt cgtgcttagg tcaggtgctg ttagattcgt gattaccggc      660

gctgttgatc ccgctagtcc tattgctgat catcataggc gtcacggtga cggcgttgtg      720

gatattgctc ttactgttcc agacgtggac cgttgtattg ctcacgctag ggctcaaggt      780

gctactgttc ttgctgaacc tcacgacgtt agtgacgagc acggaactgt taggcttgct      840

gctattgcta cctacggtga cactaggcac acccttgttg ataggtctag gtataacggc      900

ccttacctcc caggttacgc tgagagaact agcaccttca ctaagaggga aggtgcccct      960

aagaggatct ttcaggctct tgatcacgtt gtgggtaacg tggaacttgg tagaatggac     1020

gagtgggtta gcttctataa tagggtgatg ggcttcacta atatggccga gttcgtgggc     1080

gaggatattg ctactgatta ctcagccctg atgtctaaag tggttagtaa cggtaatcac     1140

agggttaagt tcccacttaa cgagcccgct atcgctaaga aacgtagtca gattgacgag     1200

tacctcgact actacggtgg accaggtgct caacaccttg ctcttgctac taacgatatt     1260

ctcaccgccg tggatactct tagggctgag ggaattgagt tcctcgctac cccagatagt     1320

tactacgagg acccagaact tagggctagg attggagagg ttagagtgcc tatcgaggaa     1380

cttcagaaga ggggaatcct cgttgatagg gacgaggacg gttacctcct tcagattttc     1440

actaagcccc tcgtggatag gcctaccgtt ttcttcgaac ttatcgagcg tcacggatca     1500

ctcggattcg ggatcggtaa ctttaaggct ctcttcgagg ctatcgagag agagcaagct     1560

gctaggggca acttctag                                                   1578






2

