                         SEQUENCE LISTING

<110>  DSM IP Assets B.V.
 
<120>  KAURENOIC ACID HYDROXLYASES

<130>  32301-WO-PCT

<150>  EP 16202945.8
<151>  2016-12-08

<160>  25    

<170>  PatentIn version 3.5

<210>  1
<211>  525
<212>  PRT
<213>  Arabidopsis thaliana

<400>  1

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Cys Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Ser Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Lys Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Thr Leu 
        115                 120                 125             


Asn Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Asn Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Ile Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Lys Arg Gly Gly Glu Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Ser Phe Ser Lys Gly Lys Ala Ile Phe Ser Met Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Ile Glu Cys Lys Asp Thr His Lys Lys Asp Leu Met Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Val Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ser 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  2
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  kaurenoic acid 13-hydroxylase from Arabidopsis thaliana, 
       codon-pair optimized for expression in Yarrowia lipolitica.

<400>  2
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gctgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cactgctccg gtgacaacat catctcccac      240

gactactctt cttctctgtt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccactggcct caagcagcac ctctacatca accaccccga gatggtcaag      360

gagctctccc agaccaacac cctcaacctc ggccgaatca cccacatcac caagcgactc      420

aaccccattc tcggtaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cctacgagtt cacccacgac aagatcaagg gtatggtcgg tctgatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcaagcgagg tggtgagatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggatgtct ccgctgacgt cattgccaag      660

gcctgtttcg gctcttcctt ctccaagggc aaggccatct tctccatgat ccgagatctg      720

ctcaccgcca tcaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgacgctc tcgagatgga gctcgagtcc      840

tccatctggg agactgtcaa ggagcgagag attgagtgca aggacaccca caagaaggac      900

ctcatgcagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tctgcttacc gacgattcgt tgtcgacaac tgcaagtcca tctactttgc cggccacgac     1020

tccaccgccg tttccgtttc ttggtgcctc atgctgctcg ctctcaaccc ctcttggcag     1080

gtcaagatcc gagatgagat tctgtcctcc tgcaagaacg gtatccccga cgccgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccattg tcggccgaga ggcctccaag gacattcgac tcggtgatct ggttgtcccc     1260

aagggtgtct gtatctggac cctcatcccc gctctgcacc gagatcccga gatctggggt     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcctgcaag     1380

tacccccagt cctacatccc ctttggcctc ggcccccgaa cctgtgtcgg caagaacttt     1440

ggtatgatgg aggtcaaggt cctcgtttct ctgattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctgctcg tcgagcccca gcacggtgtt     1560

gtcatccgag ttgtataa                                                   1578


<210>  3
<211>  525
<212>  PRT
<213>  Artificial sequence

<220>
<223>  kaurenoic acid 13-hydroxylase polypeptide

<400>  3

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Asn Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Thr Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Arg Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Ser Leu 
        115                 120                 125             


Asp Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Ala Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Val Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Glu Ala Glu Gly Gly Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Asn Phe Ser Lys Gly Lys Ala Ile Phe Ser Lys Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Arg Glu Cys Lys Asp Thr His Lys Lys Asp Leu Leu Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Glu Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ala 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  4
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  kaurenoic acid 13-hydroxylase encoding sequence optimized for 
       expression in Yarrowia lipolitica

<400>  4
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gccgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cacaactccg gtgacaacat catctcccac      240

gactactcct ccactctctt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccaccggtct gcgacagcac ctctacatca accaccccga gatggtcaag      360

gaactgtccc agaccaactc tctcgatctc ggtcgaatca cccacatcac caagcgactc      420

gcccccattc tcggcaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cttacgagtt cacccacgac aaggtcaagg gtatggtcgg cctcatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcgaggctga gggtggtatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggacgttt ctgccgatgt cattgccaag      660

gcctgctttg gctccaactt ctccaagggc aaggccattt tctccaagat ccgagatctg      720

ctcaccgcca ttaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgatgctc tcgagatgga gctggagtcc      840

tccatctggg agactgtcaa ggagcgagag cgagagtgca aggacaccca caagaaggac      900

ctcctccagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tccgcctacc gacgatttgt tgttgacaac tgcaagtcca tctactttgc cggccacgac     1020

tccaccgccg tttctgtctc ttggtgcctc atgctgctgg ctctcaaccc ctcttggcag     1080

gagaagatcc gtgacgagat tctctcttct tgtaagaacg gtatccccga tgctgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccattg tcggccgaga ggcctccaag gacatccgac tcggtgatct cgttgtcccc     1260

aagggtgtct gcatctggac cctcatcccc gctctgcacc gggaccccga aatctggggc     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcttgcaag     1380

tacccccagg cctacatccc cttcggtctg ggcccccgaa cctgtgtcgg caagaacttc     1440

ggtatgatgg aggtcaaggt ccttgtctct ctcattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctcctcg ttgagcccca gcacggtgtt     1560

gtcatccgag tggtgtaa                                                   1578


<210>  5
<211>  525
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase polypeptide

<400>  5

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Asn Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Thr Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Arg Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Ser Leu 
        115                 120                 125             


Asp Leu Gly Arg Ile Thr His Met Thr Lys Arg Leu Ala Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Val Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Glu Ala Glu Gly Gly Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Asn Phe Ser Lys Gly Lys Ala Ile Phe Ser Lys Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Arg Glu Cys Lys Asp Thr His Lys Lys Asp Leu Leu Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Glu Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ala 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  6
<211>  1578
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase encoding sequence optimized for 
       expression in Yarrowia lipolitica

<400>  6
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gccgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cacaactccg gtgacaacat catctcccac      240

gactactcct ccactctctt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccaccggtct gcgacagcac ctctacatca accaccccga gatggtcaag      360

gaactgtccc agaccaactc tctcgatctc ggtcgaatca cccacatgac caagcgactc      420

gcccccattc tcggcaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cttacgagtt cacccacgac aaggtcaagg gtatggtcgg cctcatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcgaggctga gggtggtatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggacgttt ctgccgatgt cattgccaag      660

gcctgctttg gctccaactt ctccaagggc aaggccattt tctccaagat ccgagatctg      720

ctcaccgcca ttaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgatgctc tcgagatgga gctggagtcc      840

tccatctggg agactgtcaa ggagcgagag cgagagtgca aggacaccca caagaaggac      900

ctcctccagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tccgcctacc gacgatttgt tgttgacaac tgcaagtcca tctactttgc cggccacgac     1020

tccaccgccg tttctgtctc ttggtgcctc atgctgctgg ctctcaaccc ctcttggcag     1080

gagaagatcc gtgacgagat tctctcttct tgtaagaacg gtatccccga tgctgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccattg tcggccgaga ggcctccaag gacatccgac tcggtgatct cgttgtcccc     1260

aagggtgtct gcatctggac cctcatcccc gctctgcacc gggaccccga aatctggggc     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcttgcaag     1380

tacccccagg cctacatccc cttcggtctg ggcccccgaa cctgtgtcgg caagaacttc     1440

ggtatgatgg aggtcaaggt ccttgtctct ctcattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctcctcg ttgagcccca gcacggtgtt     1560

gtcatccgag tggtgtaa                                                   1578


<210>  7
<211>  525
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase polypeptide

<400>  7

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Asn Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Thr Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Arg Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Ser Leu 
        115                 120                 125             


Asp Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Ala Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Val Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Glu Ala Glu Gly Gly Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Asn Phe Ser Lys Gly Lys Ala Ile Phe Ser Lys Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Arg Glu Cys Lys Asp Thr His Lys Lys Asp Leu Leu Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Ser 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Glu Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ala 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  8
<211>  1578
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase encoding sequence optimized for 
       expression in Yarrowia lipolitica

<400>  8
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gccgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cacaactccg gtgacaacat catctcccac      240

gactactcct ccactctctt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccaccggtct gcgacagcac ctctacatca accaccccga gatggtcaag      360

gaactgtccc agaccaactc tctcgatctc ggtcgaatca cccacatcac caagcgactc      420

gcccccattc tcggcaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cttacgagtt cacccacgac aaggtcaagg gtatggtcgg cctcatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcgaggctga gggtggtatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggacgttt ctgccgatgt cattgccaag      660

gcctgctttg gctccaactt ctccaagggc aaggccattt tctccaagat ccgagatctg      720

ctcaccgcca ttaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgatgctc tcgagatgga gctggagtcc      840

tccatctggg agactgtcaa ggagcgagag cgagagtgca aggacaccca caagaaggac      900

ctcctccagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tccgcctacc gacgatttgt tgttgacaac tgcaagtcca tctactccgc cggccacgac     1020

tccaccgccg tttctgtctc ttggtgcctc atgctgctgg ctctcaaccc ctcttggcag     1080

gagaagatcc gtgacgagat tctctcttct tgtaagaacg gtatccccga tgctgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccattg tcggccgaga ggcctccaag gacatccgac tcggtgatct cgttgtcccc     1260

aagggtgtct gcatctggac cctcatcccc gctctgcacc gggaccccga aatctggggc     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcttgcaag     1380

tacccccagg cctacatccc cttcggtctg ggcccccgaa cctgtgtcgg caagaacttc     1440

ggtatgatgg aggtcaaggt ccttgtctct ctcattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctcctcg ttgagcccca gcacggtgtt     1560

gtcatccgag tggtgtaa                                                   1578


<210>  9
<211>  525
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase polypeptide

<400>  9

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Asn Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Thr Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Arg Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Ser Leu 
        115                 120                 125             


Asp Leu Gly Arg Ile Thr His Val Thr Lys Arg Leu Ala Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Val Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Glu Ala Glu Gly Gly Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Asn Phe Ser Lys Gly Lys Ala Ile Phe Ser Lys Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Arg Glu Cys Lys Asp Thr His Lys Lys Asp Leu Leu Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Glu Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ala 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  10
<211>  1578
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase encoding sequence optimized for 
       expression in Yarrowia lipolitica

<400>  10
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gccgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cacaactccg gtgacaacat catctcccac      240

gactactcct ccactctctt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccaccggtct gcgacagcac ctctacatca accaccccga gatggtcaag      360

gaactgtccc agaccaactc tctcgatctc ggtcgaatca cccacgtcac caagcgactc      420

gcccccattc tcggcaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cttacgagtt cacccacgac aaggtcaagg gtatggtcgg cctcatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcgaggctga gggtggtatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggacgttt ctgccgatgt cattgccaag      660

gcctgctttg gctccaactt ctccaagggc aaggccattt tctccaagat ccgagatctg      720

ctcaccgcca ttaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgatgctc tcgagatgga gctggagtcc      840

tccatctggg agactgtcaa ggagcgagag cgagagtgca aggacaccca caagaaggac      900

ctcctccagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tccgcctacc gacgatttgt tgttgacaac tgcaagtcca tctactttgc cggccacgac     1020

tccaccgccg tttctgtctc ttggtgcctc atgctgctgg ctctcaaccc ctcttggcag     1080

gagaagatcc gtgacgagat tctctcttct tgtaagaacg gtatccccga tgctgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccattg tcggccgaga ggcctccaag gacatccgac tcggtgatct cgttgtcccc     1260

aagggtgtct gcatctggac cctcatcccc gctctgcacc gggaccccga aatctggggc     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcttgcaag     1380

tacccccagg cctacatccc cttcggtctg ggcccccgaa cctgtgtcgg caagaacttc     1440

ggtatgatgg aggtcaaggt ccttgtctct ctcattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctcctcg ttgagcccca gcacggtgtt     1560

gtcatccgag tggtgtaa                                                   1578


<210>  11
<211>  525
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase polypeptide

<400>  11

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Asn Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Thr Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Arg Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Ser Leu 
        115                 120                 125             


Asp Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Ala Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Val Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Glu Ala Glu Gly Gly Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Asn Phe Ser Lys Gly Lys Ala Ile Phe Ser Lys Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Asn Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Arg Glu Cys Lys Asp Thr His Lys Lys Asp Leu Leu Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Glu Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ala 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  12
<211>  1578
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase encoding sequence optimized for 
       expression in Yarrowia lipolitica

<400>  12
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gccgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cacaactccg gtgacaacat catctcccac      240

gactactcct ccactctctt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccaccggtct gcgacagcac ctctacatca accaccccga gatggtcaag      360

gaactgtccc agaccaactc tctcgatctc ggtcgaatca cccacatcac caagcgactc      420

gcccccattc tcggcaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cttacgagtt cacccacgac aaggtcaagg gtatggtcgg cctcatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcgaggctga gggtggtatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggacgttt ctgccgatgt cattgccaag      660

gcctgctttg gctccaactt ctccaagggc aaggccattt tctccaagat ccgagatctg      720

ctcaccgcca ttaccaagcg aaacgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgatgctc tcgagatgga gctggagtcc      840

tccatctggg agactgtcaa ggagcgagag cgagagtgca aggacaccca caagaaggac      900

ctcctccagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tccgcctacc gacgatttgt tgttgacaac tgcaagtcca tctactttgc cggccacgac     1020

tccaccgccg tttctgtctc ttggtgcctc atgctgctgg ctctcaaccc ctcttggcag     1080

gagaagatcc gtgacgagat tctctcttct tgtaagaacg gtatccccga tgctgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccattg tcggccgaga ggcctccaag gacatccgac tcggtgatct cgttgtcccc     1260

aagggtgtct gcatctggac cctcatcccc gctctgcacc gggaccccga aatctggggc     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcttgcaag     1380

tacccccagg cctacatccc cttcggtctg ggcccccgaa cctgtgtcgg caagaacttc     1440

ggtatgatgg aggtcaaggt ccttgtctct ctcattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctcctcg ttgagcccca gcacggtgtt     1560

gtcatccgag tggtgtaa                                                   1578


<210>  13
<211>  525
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase polypeptide

<400>  13

Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val 
1               5                   10                  15      


Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val 
            20                  25                  30          


Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys 
        35                  40                  45              


Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg 
    50                  55                  60                  


Ile Gln Ser Glu Ala Lys His Asn Ser Gly Asp Asn Ile Ile Ser His 
65                  70                  75                  80  


Asp Tyr Ser Ser Thr Leu Phe Pro His Phe Asp His Trp Arg Lys Gln 
                85                  90                  95      


Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Arg Gln His Leu Tyr 
            100                 105                 110         


Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Ser Leu 
        115                 120                 125             


Asp Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Ala Pro Ile Leu 
    130                 135                 140                 


Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg 
145                 150                 155                 160 


Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Val Lys Gly Met Val 
                165                 170                 175     


Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu 
            180                 185                 190         


Met Val Glu Ala Glu Gly Gly Met Gly Cys Asp Ile Arg Val Asp Glu 
        195                 200                 205             


Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly 
    210                 215                 220                 


Ser Asn Phe Ser Lys Gly Lys Ala Ile Phe Ser Lys Ile Arg Asp Leu 
225                 230                 235                 240 


Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe 
                245                 250                 255     


Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp 
            260                 265                 270         


Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu 
        275                 280                 285             


Arg Glu Arg Glu Cys Lys Asp Thr His Lys Lys Asp Leu Leu Gln Leu 
    290                 295                 300                 


Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys 
305                 310                 315                 320 


Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe 
                325                 330                 335     


Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu 
            340                 345                 350         


Leu Ala Leu Asn Pro Ser Trp Gln Glu Lys Ile Arg Asp Glu Ile Leu 
        355                 360                 365             


Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu 
    370                 375                 380                 


Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro 
385                 390                 395                 400 


Ala Pro Gly Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp 
                405                 410                 415     


Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu 
            420                 425                 430         


His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro 
        435                 440                 445             


Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ala 
    450                 455                 460                 


Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe 
465                 470                 475                 480 


Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe 
                485                 490                 495     


Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu 
            500                 505                 510         


Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val 
        515                 520                 525 


<210>  14
<211>  1578
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  kaurenoic acid 13-hydroxylase encoding sequence optimized for 
       expression in Yarrowia lipolitica

<400>  14
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc       60

ttctccgtcg gctaccacgt ctacggccga gccgttgtcg agcagtggcg aatgcgacga      120

tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc      180

gagatgcagc gaatccagtc cgaggccaag cacaactccg gtgacaacat catctcccac      240

gactactcct ccactctctt cccccacttt gaccactggc gaaagcagta cggccgaatc      300

tacacctact ccaccggtct gcgacagcac ctctacatca accaccccga gatggtcaag      360

gaactgtccc agaccaactc tctcgatctc ggtcgaatca cccacatcac caagcgactc      420

gcccccattc tcggcaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga      480

cgaatcattg cttacgagtt cacccacgac aaggtcaagg gtatggtcgg cctcatggtc      540

gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcgaggctga gggtggtatg      600

ggctgtgaca tccgagtcga cgaggacctc aaggacgttt ctgccgatgt cattgccaag      660

gcctgctttg gctccaactt ctccaagggc aaggccattt tctccaagat ccgagatctg      720

ctcaccgcca ttaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt      780

ttcggctcca agaagcacgg tgacgttgac attgatgctc tcgagatgga gctggagtcc      840

tccatctggg agactgtcaa ggagcgagag cgagagtgca aggacaccca caagaaggac      900

ctcctccagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag      960

tccgcctacc gacgatttgt tgttgacaac tgcaagtcca tctactttgc cggccacgac     1020

tccaccgccg tttctgtctc ttggtgcctc atgctgctgg ctctcaaccc ctcttggcag     1080

gagaagatcc gtgacgagat tctctcttct tgtaagaacg gtatccccga tgctgagtcc     1140

atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc     1200

gctcccggtg tcggccgaga ggcctccaag gacatccgac tcggtgatct cgttgtcccc     1260

aagggtgtct gcatctggac cctcatcccc gctctgcacc gggaccccga aatctggggc     1320

cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcttgcaag     1380

tacccccagg cctacatccc cttcggtctg ggcccccgaa cctgtgtcgg caagaacttc     1440

ggtatgatgg aggtcaaggt ccttgtctct ctcattgtct ccaagttctc cttcactctg     1500

tctcccacct accagcactc tccctcccac aagctcctcg ttgagcccca gcacggtgtt     1560

gtcatccgag tggtgtaa                                                   1578


<210>  15
<211>  1503
<212>  DNA
<213>  Artificial sequence

<220>
<223>  hydroxymethylglutaryl-CoA reductase from Yarrowia lipolitica, CpO
       for expression in Yarrowia lipolitica

<400>  15
atgacccagt ctgtgaaggt ggttgagaag cacgttccta tcgtcattga gaagcccagc       60

gagaaggagg aggacacctc ttctgaagac tccattgagc tgactgtcgg aaagcagccc      120

aagcccgtga ccgagacccg ttctctggac gacttggagg ctatcatgaa ggcaggtaag      180

accaagctcc tggaggacca cgaggttgtc aagctctctc tcgaaggcaa gctccctttg      240

tatgctcttg agaagcagct tggtgacaac acccgagctg ttggcatccg acgatctatc      300

atctcccagc agtctaatac caagactctt gagacctcaa agctccctta cctgcactac      360

gactacgacc gtgtttttgg agcctgttgc gagaacgtta ttggttacat gcctctcccc      420

gttggtgttg ctggccccat gaacattgat ggcaagaact accacattcc tatggccacc      480

actgagggtt gtcttgttgc ctcaaccatg cgaggttgca aggccatcaa cgccggtggc      540

ggtgttacca ctgtgcttac tcaggacggt atgacacgag gtccttgtgt ttccttcccc      600

tctctcaagc gggctggagc cgctaagatc tggcttgatt ccgaggaggg tctcaagtcc      660

atgcgaaagg ccttcaactc cacctctcga tttgctcgtc tccagtctct tcactctacc      720

cttgctggta acctgctgtt tattcgattc cgaaccacca ctggtgatgc catgggcatg      780

aacatgatct ccaagggcgt cgaacactct ctggccgtca tggtcaagga gtacggcttc      840

cctgatatgg acattgtgtc tgtctcgggt aactactgca ctgacaagaa gcccgcagcg      900

atcaactgga tcgaaggccg aggcaagagt gttgttgccg aagccaccat ccctgctcac      960

attgtcaagt ctgttctcaa aagtgaggtt gacgctcttg ttgagctcaa catcagcaag     1020

aatctgatcg gtagtgccat ggctggctct gtgggaggtt tcaatgcaca cgccgcaaac     1080

ctggtgaccg ccatctacct tgccactggc caggatcctg ctcagaatgt cgagtcttcc     1140

aactgcatca cgctgatgag caacgtcgac ggtaacctgc tcatctccgt ttccatgcct     1200

tctatcgagg tcggtaccat tggtggaggt actattttgg agccccaggg tgctatgctg     1260

gagatgcttg gcgtgcgagg tcctcacatc gagacccccg gtgccaacgc ccaacagctt     1320

gctcgcatca ttgcttctgg agttcttgca gcggagcttt cgctgtgttc tgctcttgct     1380

gccggccatc ttgtgcaaag tcatatgacc cacaaccgtt cccaggctcc tactccggcc     1440

aagcagtctc aggccgatct gcagcgtctc caaaacggtt cgaatatctg cattcggtca     1500

tag                                                                   1503


<210>  16
<211>  984
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Geranylgeranyl diphosphate synthase from Yarrowia lipolitica CpO 
       for expression in Yarrowia lipolitica

<400>  16
atggattata acagcgcgga tttcaaggag atctggggca aggccgccga caccgcgctg       60

ctgggaccgt acaactacct cgccaacaac cggggccaca acatcagaga acacttgatc      120

gcagcgttcg gagcggttat caaggtggac aagagcgatc tcgaaaccat ttcgcacatc      180

accaagattt tgcataactc gtcgctgctt gttgatgacg tggaagacaa ctcgatgctc      240

cgacgaggcc tgccggcagc ccattgtctg tttggagtcc cccaaaccat caactccgcc      300

aactacatgt actttgtggc tctgcaggag gtgctcaagc tcaagtctta tgatgccgtc      360

tccattttca ccgaggaaat gatcaacttg catagaggtc agggtatgga tctctactgg      420

agagaaacac tcacttgccc ctcggaagac gagtatctgg agatggtggt gcacaagacc      480

ggaggactgt ttcggctggc tctgagactt atgctgtcgg tggcatcgaa acaggaggac      540

catgaaaaga tcaactttga tctcacacac cttaccgaca cactgggagt catttaccag      600

attctggatg attacctcaa cctgcagtcc acggaattga ccgagaacaa gggattctgc      660

gaagatatca gcgaaggaaa gttttcgttt ccgctgattc acagcatccg gaccaacccg      720

gataaccacg agattctcaa cattctcaaa cagcgaacaa gcgacgcttc actcaaaaag      780

tacgccgtgg actacatgag aacagaaacc aagagtttcg actactgcct caagagaatc      840

caggccatgt cactcaaggc aagttcgtac attgatgatc tcgcagcagc cggccacgat      900

gtctccaagt tgcgagccat tttgcattat tttgtgtcca cctctgactg tgaggagaga      960

aagtactttg aggatgcgca gtga                                             984


<210>  17
<211>  927
<212>  DNA
<213>  Artificial sequence

<220>
<223>  geranylgeranyl diphosphate synthase from Mucor circenelloides, 
       codon optimized for expression in Yarrowia lipolitica.

<400>  17
atgctagcca caaaaatgct caactctcac aaccgaaccg aggagcgatc caccgaggat       60

attattctcg agccttacac ctacctcatt tctcagcccg gaaaggacat tcgagctaag      120

ctcatttctg cctttgacct ctggctgcac gttcctaagg atgttctttg cgtcatcaac      180

aagattatcg gtatgctgca caacgcctct cttatgattg acgatgttca ggacgactct      240

gatctccgac gaggagtccc cgttgctcac cacatttacg gtgtccctca gactattaac      300

accgctaact acgtgatttt cctcgccctt caggaggtta tgaagctgaa catcccttct      360

atgatgcagg tgtgtaccga ggagcttatt aacctccacc gaggtcaggg aattgagctg      420

tactggcgag attccctcac ttgtcccact gaggaggagt acattgatat ggttaacaac      480

aagacctctg gcctccttcg acttgccgtc cgactgatgc aggctgcttc tgagtccgac      540

atcgactaca cccctctcgt caacattatc ggaattcact tccaggttcg agatgactac      600

atgaacctcc agtccacctc ttacactaac aacaagggct tttgcgagga cctgaccgag      660

ggaaagttct ccttccctat tattcacgct attcgaaagg acccctctaa ccgacagctc      720

ctgaacatta tctctcagaa gcccacctcc attgaggtta agaagtacgc tcttgaggtg      780

atccgaaagg ctggatcttt tgagtacgtt cgagagttcc ttcgacagaa ggaggctgag      840

tccctgaagg agatcaagcg acttggcggc aaccctctcc tcgagaagta cattgagact      900

attcgagtcg aggctactaa cgactaa                                          927


<210>  18
<211>  2232
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Copalyl pyrophosphate synthase from Stevia rebaudiana CpO for 
       expression in Yarrowia lipolitica

<400>  18
atgtgcaagg ctgtttccaa ggagtactcc gatctgctcc agaaggacga ggcctctttc       60

accaagtggg acgacgacaa ggtcaaggac cacctcgaca ccaacaagaa cctctacccc      120

aacgacgaga tcaaggagtt tgtcgagtcc gtcaaggcca tgttcggctc catgaacgac      180

ggcgagatta atgtctctgc ttacgacacc gcctgggttg ctctggtcca ggatgtcgac      240

ggttccggct ctcctcagtt cccttcctct ctcgagtgga tcgccaacaa ccagctgtcc      300

gacggttctt ggggtgacca cctgctcttc tctgctcacg accgaatcat caacaccctg      360

gcctgtgtca ttgctctgac ctcttggaac gtccacccct ccaagtgcga gaagggtctg      420

aacttcctcc gagagaacat ctgcaagctc gaggacgaga acgccgagca catgcccatt      480

ggcttcgagg tcaccttccc ctctctgatt gacattgcca agaagctcaa cattgaggtc      540

cccgaggaca cccccgctct caaggagatc tacgctcgac gagacatcaa gctcaccaag      600

atccccatgg aggttctcca caaggtcccc accactctcc tccactctct cgagggtatg      660

cccgatctcg agtgggagaa gctgctcaag ctgcagtgca aggacggctc tttcctcttc      720

tccccctctt ccactgcctt cgccctcatg cagaccaagg acgagaagtg tctccagtac      780

ctcaccaaca ttgtcaccaa gttcaacggt ggtgtcccca acgtctaccc cgttgacctc      840

tttgagcaca tctgggttgt tgaccgactc cagcgactcg gtatcgcccg atacttcaag      900

tccgagatca aggactgtgt cgagtacatc aacaagtact ggaccaagaa cggtatctgc      960

tgggcccgaa acacccacgt ccaggacatt gacgacaccg ccatgggctt ccgagttctg     1020

cgagcccacg gctacgatgt cacccccgat gtctttcgac agtttgagaa ggacggcaag     1080

tttgtctgtt tcgccggtca gtccacccag gccgtcaccg gtatgttcaa cgtctaccga     1140

gcttctcaga tgctcttccc cggtgagcga atcctcgagg acgccaagaa gttctcctac     1200

aactacctca aggagaagca gtccaccaac gagctgctcg acaagtggat cattgccaag     1260

gatctgcccg gtgaggttgg ctacgccctc gacatcccct ggtacgcctc tctgccccga     1320

ctggagactc gatactacct cgagcagtac ggtggtgagg acgatgtctg gatcggtaag     1380

accctgtacc gaatgggcta cgtttccaac aacacctacc tcgagatggc caagctcgac     1440

tacaacaact acgttgccgt cctccagctc gagtggtaca ccatccagca gtggtacgtc     1500

gacattggta tcgagaagtt cgagtccgac aacatcaagt ccgtccttgt ctcctactac     1560

ctcgctgctg cctccatctt cgagcccgag cgatccaagg agcgaattgc ctgggccaag     1620

accaccatcc tcgtcgacaa gatcacctcc atcttcgact cctcccagtc ctccaaggaa     1680

gatatcaccg ccttcattga caagttccga aacaagtcct cctccaagaa gcactccatc     1740

aacggcgagc cctggcacga ggtcatggtt gctctcaaga aaactctcca cggctttgcc     1800

ctcgacgctc tgatgaccca ctctcaggac atccaccccc agctccacca ggcctgggag     1860

atgtggctca ccaagctcca ggacggtgtt gatgtcactg ctgagctcat ggtccagatg     1920

atcaacatga ccgccggccg atgggtttcc aaggagctcc tcacccaccc ccagtaccag     1980

cgactctcca ctgtcaccaa ctctgtctgc cacgacatca ccaagctcca caacttcaag     2040

gagaactcca ccaccgtcga ctccaaggtc caggagctgg tccagctcgt tttctccgac     2100

acccccgatg atctcgacca ggacatgaag cagaccttcc tgactgtcat gaaaactttc     2160

tactacaagg cctggtgcga ccccaacacc atcaacgacc acatctccaa ggtctttgag     2220

attgtgattt aa                                                         2232


<210>  19
<211>  2274
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Kaurene synthase from Stevia rebaudiana CpO for expression in 
       Yarrowia lipolitica

<400>  19
atgacctccc acggcggcca gaccaacccc accaacctca tcattgacac caccaaggag       60

cgaatccaga agcagttcaa gaacgtcgag atctccgttt cctcctacga caccgcctgg      120

gtcgccatgg tcccctctcc caactccccc aagtctccct gcttccccga gtgtctcaac      180

tggctcatca acaaccagct caacgacggc tcttggggtc tggtcaacca cacccacaac      240

cacaaccacc ccctcctcaa ggactctctc tcttccactc tcgcctgcat tgttgctctc      300

aagcgatgga acgttggcga ggaccagatc aacaagggtc tgtctttcat tgagtccaac      360

ctcgcctccg ccaccgagaa gtcccagccc tcccccattg gctttgatat catcttcccc      420

ggtctgctcg agtacgccaa gaacctcgat atcaacctgc tctccaagca gaccgacttc      480

tctctcatgc tgcacaagcg agagctcgag cagaagcgat gccactccaa cgagatggac      540

ggctacctgg cctacatttc cgagggtctg ggtaacctct acgactggaa catggtcaag      600

aagtaccaga tgaagaacgg ttccgttttc aactccccct ctgccaccgc tgctgccttc      660

atcaaccacc agaaccccgg ctgtctcaac tacctcaact ctctgctcga caagtttggt      720

aacgccgtcc ccactgtcta cccccacgat ctcttcatcc gactctccat ggtcgacacc      780

attgagcgac tcggtatttc ccaccacttc cgagtcgaga tcaagaacgt tctcgatgag      840

acttaccgat gctgggttga gcgagatgag cagatcttca tggacgttgt cacctgtgct      900

ctggccttcc gactcctccg aatcaacggt tacgaggttt cccccgaccc cctcgccgag      960

atcaccaacg agctggctct caaggacgag tacgccgccc tcgagactta ccacgcttct     1020

cacattctgt accaagagga tctgtcctcc ggcaagcaga ttctcaagtc cgccgacttc     1080

ctcaaggaga tcatctccac tgactccaac cgactctcca agctcatcca caaggaagtc     1140

gagaacgctc tcaagttccc catcaacacc ggtctggagc gaatcaacac ccgacgaaac     1200

atccagctct acaacgtcga caacacccga attctcaaga ccacctacca ctcttccaac     1260

atctccaaca ccgactacct gcgactcgcc gtcgaggact tctacacctg ccagtccatc     1320

taccgagagg agctcaaggg tctggagcga tgggttgtcg agaacaagct cgaccagctc     1380

aagtttgccc gacaaaagac tgcctactgc tacttctccg ttgctgccac cctctcttct     1440

cccgagctct ccgacgcccg aatctcttgg gccaagaacg gtatcctgac cactgttgtc     1500

gacgacttct ttgacattgg tggcaccatt gacgagctga ccaacctcat ccagtgcgtc     1560

gagaagtgga acgtcgacgt tgacaaggac tgttgttccg agcacgtccg aatcctcttc     1620

ctggctctca aggacgccat ctgctggatc ggtgacgagg ccttcaagtg gcaggctcga     1680

gatgtcactt cccacgtcat ccagacctgg ctcgagctca tgaactccat gctgcgagag     1740

gccatctgga cccgagatgc ctacgtcccc accctcaacg agtacatgga gaacgcctac     1800

gtcagctttg ctctcggtcc cattgtcaag cccgccatct actttgtcgg tcccaagctg     1860

tccgaggaga ttgtcgagtc ctccgagtac cacaacctct tcaagctcat gtccacccag     1920

ggccgactcc tcaacgatat ccactccttc aagcgagagt tcaaggaagg taagctcaac     1980

gccgttgctc tgcacctgtc caacggtgag tccggcaagg tcgaggaaga ggtcgtcgag     2040

gagatgatga tgatgatcaa gaacaagcga aaggagctca tgaagctcat cttcgaggag     2100

aacggctcca ttgtcccccg agcctgcaag gacgccttct ggaacatgtg ccacgtcctc     2160

aacttcttct acgccaacga cgacggtttc accggcaaca ccattctcga caccgtcaag     2220

gacatcatct acaaccctct ggttctggtc aacgagaacg aggagcagag gtaa           2274


<210>  20
<211>  1578
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Kaurene oxidase from Giberella fujikuroi CpO for expression in 
       Yarrowia lipolitica

<400>  20
atgtccaagt ccaactccat gaactccacc tcccacgaga ctctcttcca gcagctcgtt       60

ctcggcctcg accgaatgcc cctcatggac gtccactggc tcatctacgt tgcctttggt      120

gcctggctct gctcctacgt catccacgtt ctgtcctctt cctccactgt caaggtcccc      180

gtcgtcggtt accgatccgt tttcgagccc acctggctcc tccgactgcg attcgtctgg      240

gagggtggtt ccatcattgg ccagggctac aacaagttca aggactccat cttccaggtc      300

cgaaagctcg gtaccgacat tgtcatcatc cctcccaact acattgacga ggtccgaaag      360

ctctcccagg acaagacccg atccgtcgag cccttcatca acgactttgc cggccagtac      420

acccgaggta tggtctttct gcagtccgat ctccagaacc gagtcatcca gcagcgactc      480

acccccaagc ttgtctctct caccaaggtc atgaaggaag agctcgacta cgctctgacc      540

aaggagatgc ccgacatgaa gaacgacgag tgggttgagg tcgacatctc ttccatcatg      600

gtccgactca tctctcgaat ctccgcccga gttttcctcg gccccgagca ctgccgaaac      660

caggagtggc tcaccaccac cgccgagtac tccgagtctc tcttcatcac cggcttcatc      720

ctccgagttg tcccccacat tctccgaccc ttcattgctc ctctgctgcc ctcttaccga      780

accctgctgc gaaacgtttc ttccggccga cgagtcattg gtgatatcat ccgatcccag      840

cagggtgacg gtaacgagga catcctctct tggatgcgag atgctgccac tggtgaggag      900

aagcagatcg acaacattgc ccagcgaatg ctcattctgt ctctcgcctc catccacacc      960

accgccatga ccatgaccca cgccatgtac gatctgtgtg cctgccccga gtacattgag     1020

cccctccgag atgaggtcaa gtccgtcgtt ggtgcttctg gctgggacaa gaccgctctc     1080

aaccgattcc acaagctcga ctctttcctc aaggagtccc agcgattcaa ccccgttttc     1140

ctgctcacct tcaaccgaat ctaccaccag tccatgaccc tctccgatgg taccaacatc     1200

ccctccggta cccgaattgc tgtcccctct cacgccatgc tccaggactc cgcccacgtc     1260

cccggtccca ctcctcccac tgagttcgac ggtttccgat actccaagat ccgatccgac     1320

tccaactacg cccagaagta cctcttctcc atgaccgact cttccaacat ggcctttggc     1380

tacggtaagt acgcctgccc cggccgattc tacgcctcca acgagatgaa gctgactctg     1440

gccattctgc tcctccagtt tgagttcaag ctccccgacg gtaagggccg accccgaaac     1500

atcaccatcg actccgacat gatccccgac ccccgagctc gactctgtgt ccgaaagcga     1560

tctctgcgtg acgagtaa                                                   1578


<210>  21
<211>  2136
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Cytochrome P450 reductase from Arabidopsis thaliana CpO for 
       expression in Yarrowia lipolitica

<400>  21
atgtcctcct cttcttcttc ttccacctcc atgattgatc tcatggctgc catcatcaag       60

ggtgagcccg tcattgtctc cgaccccgcc aacgcctccg cctacgagtc cgttgctgcc      120

gagctgtcct ccatgctcat cgagaaccga cagtttgcca tgatcgtcac cacctccatt      180

gctgttctca ttggctgcat tgtcatgctc gtctggcgac gatctggctc cggtaactcc      240

aagcgagtcg agcccctcaa gcccctggtc atcaagcccc gagaagagga gatcgacgac      300

ggccgaaaga aggtcaccat cttctttggc acccagaccg gtactgctga gggcttcgcc      360

aaggctctcg gtgaggaagc caaggctcga tacgaaaaga cccgattcaa gattgtcgac      420

ctcgatgatt acgctgccga tgacgacgag tacgaggaga agctcaagaa agaggacgtt      480

gccttcttct tcctcgccac ctacggtgac ggtgagccca ccgacaacgc tgcccgattc      540

tacaagtggt tcaccgaggg taacgaccga ggcgagtggc tcaagaacct caagtacggt      600

gttttcggtc tgggcaaccg acagtacgag cacttcaaca aggttgccaa ggttgtcgac      660

gacatcctcg tcgagcaggg tgcccagcga ctcgtccagg tcggcctcgg tgatgatgac      720

cagtgcatcg aggacgactt cactgcctgg cgagaggctc tgtggcccga gctcgacacc      780

attctgcgag aggaaggtga caccgccgtt gccaccccct acaccgccgc cgtcctcgag      840

taccgagtct ccatccacga ctccgaggat gccaagttca acgacatcaa catggccaac      900

ggtaacggct acaccgtctt tgacgcccag cacccctaca aggccaacgt cgccgtcaag      960

cgagagctcc acacccccga gtccgaccga tcttgtatcc acctcgagtt tgacattgct     1020

ggttccggtc tgacctacga gactggtgac cacgttggtg tcctctgtga caacctgtcc     1080

gagactgtcg acgaggctct gcgactcctc gacatgtccc ccgacactta cttctctctg     1140

cacgccgaga aagaggacgg tactcccatc tcttcttctc tgccccctcc cttccctccc     1200

tgcaacctgc gaaccgctct gacccgatac gcctgcctcc tctcttctcc caagaagtct     1260

gctctcgttg ctctggccgc ccacgcctcc gaccccaccg aggctgagcg actcaagcac     1320

ctcgcctctc ccgctggcaa ggacgagtac tccaagtggg ttgtcgagtc ccagcgatct     1380

ctgctcgagg tcatggccga gttcccctcc gccaagcccc ctctcggtgt tttcttcgcc     1440

ggtgttgctc cccgactcca gccccgattc tactccatct cctcttcccc caagatcgcc     1500

gagactcgaa tccacgttac ctgtgctctg gtctacgaga agatgcccac cggccgaatc     1560

cacaagggtg tctgctccac ctggatgaag aacgccgttc cctacgagaa gtccgagaac     1620

tgttcctctg ctcccatctt tgtccgacag tccaacttca agctcccctc cgactccaag     1680

gtccccatca tcatgattgg ccccggtacc ggcctcgccc ccttccgagg cttcctgcag     1740

gagcgactcg ccctcgtcga gtccggtgtc gagctcggcc cctccgtcct cttctttggc     1800

tgccgaaacc gacgaatgga cttcatctac gaagaggagc tccagcgatt cgtcgagtcc     1860

ggtgctctcg ccgagctctc cgttgccttc tcccgagagg gtcccaccaa ggagtacgtc     1920

cagcacaaga tgatggacaa ggcctccgac atctggaaca tgatctccca gggcgcctac     1980

ctctacgtct gcggtgacgc caagggtatg gcccgagatg tccaccgatc tctgcacacc     2040

attgcccagg agcagggctc catggactcc accaaggccg agggtttcgt caagaacctc     2100

cagacctccg gccgatacct ccgagatgtc tggtaa                               2136


<210>  22
<211>  1446
<212>  DNA
<213>  Artificial sequence

<220>
<223>  UDP-glucosyltransferase from Stevia rebaudiana Cpo for expression
       in Yarrowia lipolitica

<400>  22
atggacgcca tggccaccac cgagaagaag ccccacgtca tcttcatccc cttccccgcc       60

cagtcccaca tcaaggccat gctcaagctc gcccagctcc tccaccacaa gggcctccag      120

atcacctttg tcaacaccga cttcatccac aaccagttcc tcgagtcctc cggcccccac      180

tgtctggacg gtgctcccgg tttccgattt gagactatcc ccgatggtgt ctcccactcc      240

cccgaggcct ccatccccat ccgagagtct ctgctccgat ccattgagac taacttcctc      300

gaccgattca ttgatctcgt caccaagctc cccgatcctc ccacctgtat catctccgac      360

ggtttcctgt ccgttttcac cattgatgct gccaagaagc tcggtatccc cgtcatgatg      420

tactggactc tggctgcctg tggtttcatg ggtttctacc acatccactc tctgatcgag      480

aagggctttg ctcctctcaa ggacgcctcc tacctcacca acggttacct cgacaccgtc      540

attgactggg tccccggtat ggagggtatc cgactcaagg acttccccct cgactggtcc      600

accgacctca acgacaaggt tctcatgttc accaccgagg ctccccagcg atcccacaag      660

gtttcccacc acatcttcca caccttcgac gagctcgagc cctccatcat caagactctg      720

tctctgcgat acaaccacat ctacaccatt ggccccctcc agctcctcct cgaccagatc      780

cccgaggaga agaagcagac cggtatcacc tctctgcacg gctactctct cgtcaaggaa      840

gagcccgagt gcttccagtg gctccagtcc aaggagccca actccgttgt ctacgtcaac      900

tttggctcca ccaccgtcat gtctctcgag gacatgaccg agtttggctg gggtctggcc      960

aactccaacc actacttcct gtggatcatc cgatccaacc tcgtcattgg cgagaacgcc     1020

gttctgcctc ccgagctcga ggagcacatc aagaagcgag gcttcattgc ctcttggtgc     1080

tcccaggaga aggttctcaa gcacccctcc gtcggtggtt tcctgaccca ctgcggctgg     1140

ggctccacca ttgagtctct gtccgctggt gtccccatga tctgctggcc ctactcctgg     1200

gaccagctca ccaactgccg atacatctgc aaggagtggg aggttggtct ggagatgggt     1260

accaaggtca agcgagatga ggtcaagcga ctcgtccagg agctcatggg cgagggtggt     1320

cacaagatgc gaaacaaggc caaggactgg aaggagaagg cccgaattgc cattgccccc     1380

aacggctctt cttctctcaa cattgacaag atggtcaagg agatcactgt tctcgctcga     1440

aactaa                                                                1446


<210>  23
<211>  1422
<212>  DNA
<213>  Artificial sequence

<220>
<223>  variant of UDP-glucosyltransferase from Stevia rebaudiana Cpo for
       expression in Yarrowia lipolitica

<400>  23
atggccacct ccgactccat tgttgacgac cgaaagaagc tccacattgt catgttcccc       60

tggctcgcct ttggccacat catcccctat ctcgagcttt ccaagctcat tgcccagaag      120

ggccacaagg tttccttcct ctccaccacc aagaacattg accgactctc ctcccacatc      180

tctcccctca tcaactttgt caagctcacc ctcccccgag tccaggagct gcccgaggac      240

gccgaggcca ccactgatgt ccaccccgag gatatcccct acctcaagaa ggcctccgac      300

ggcctccagc ccgaggtcac tgagttcctc gagcagcact ctcccgactg gatcatctac      360

gactacaccc actactggct ccccgagatt gccaagtctc tcggtgtctc tcgagcccac      420

ttctccgtca ccaccccctg ggccattgct tacatgggtc ccactgccga tgccatgatc      480

aacggttccg actaccgaac cgagcttgag gacttcaccg tccctcccaa gtggttcccc      540

ttccccacca ccgtctgctg gcgaaagcac gatctggccc gactcgtccc ctacaaggct      600

cccggtatct ccgacggtta ccgaatgggc ctcgtcatca agggctgcga ctgtctgctc      660

tccaagacct accacgagtt cggtactcag tggctccgac ttctcgagga gctgcaccga      720

gtccccgtca tccccgttgg tctgctccct ccctccatcc ccggctctga caaggacgac      780

tcttgggttt ccatcaagga gtggctcgac ggccaggaga agggctccgt tgtctacgtt      840

gctctcggtt ccgaggttct cgtcacccag gaagaggttg tcgagcttgc tcacggtctg      900

gagctgtccg gtctgccctt cttctgggcc taccgaaagc ccaagggtcc cgccaagtcc      960

gactccgtcg agcttcccga tggtttcgtc gagcgagtcc gagatcgagg tctggtctgg     1020

acctcttggg ctccccagct ccgaatcctc tcccacgagt ccgttgctgg tttcctcacc     1080

cactgcggtt ccggctccat tgtcgagggc ctcatgttcg gccaccctct catcatgctc     1140

cccatcttcg gtgaccagcc cctcaacgcc cgactccttg aggacaagca ggtcggtatc     1200

gagatccccc gaaacgagga agatggttct ttcacccgag actctgttgc cgagtctctg     1260

cgactcgtca tggtcgagga agagggtaag atctaccgag agaaggccaa ggagatgtcc     1320

aagctctttg gcgacaagga cctccaggac cagtacgtcg acgactttgt cgagtacctc     1380

cagaagcacc gacgagctgt tgccattgac cacgaaagct aa                        1422


<210>  24
<211>  1383
<212>  DNA
<213>  Artificial sequence

<220>
<223>  UDP-glucosyltransferase from Stevia rebaudiana Cpo for expression
       in Yarrowia lipolitica

<400>  24
atggccgagc agcagaagat caagaagtct ccccacgttc tgctcatccc cttccctctg       60

cagggccaca tcaacccctt catccagttc ggcaagcgac tcatctccaa gggtgtcaag      120

accactctgg tcaccaccat ccacaccctc aactccactc tcaaccactc caacaccacc      180

accacctcca tcgagatcca ggccatctcc gacggctgtg acgagggtgg tttcatgtct      240

gctggtgagt cttacctcga gactttcaag caggtcggtt ccaagtctct ggctgacctc      300

atcaagaagc tccagtccga gggtaccacc attgacgcca tcatctacga ctccatgacc      360

gagtgggttc tcgatgtcgc catcgagttt ggtattgacg gtggctcctt cttcacccag      420

gcctgtgtcg tcaactctct ctactaccac gtccacaagg gtctgatctc tctgcccctc      480

ggcgagactg tctccgtccc cggtttcccc gttctgcagc gatgggagac tcctctcatt      540

ctccagaacc acgagcagat ccagtccccc tggtcccaga tgctcttcgg ccagttcgcc      600

aacattgacc aggcccgatg ggttttcacc aactccttct acaagctcga ggaagaggtc      660

attgagtgga cccgaaagat ctggaacctc aaggtcattg gccccaccct cccctccatg      720

tacctcgaca agcgactcga tgacgacaag gacaacggtt tcaacctcta caaggccaac      780

caccacgagt gcatgaactg gctcgacgac aagcccaagg agtccgttgt ctacgttgcc      840

tttggctctc tggtcaagca cggccccgag caggttgagg agatcacccg agctctgatt      900

gactccgatg tcaacttcct gtgggtcatc aagcacaagg aagagggtaa gctccccgag      960

aacctgtccg aggtcatcaa gaccggcaag ggcctcattg ttgcctggtg caagcagctc     1020

gacgttctcg cccacgagtc cgtcggctgc tttgtcaccc actgcggttt caactccacc     1080

ctcgaggcta tctctctcgg tgtccccgtt gttgccatgc cccagttctc cgaccagacc     1140

accaacgcca agctcctcga tgagattctc ggtgtcggtg tccgagtcaa ggctgacgag     1200

aacggtattg tccgacgagg taacctggct tcttgtatca agatgatcat ggaggaagag     1260

cgaggtgtca tcatccgaaa gaacgccgtc aagtggaagg atctggccaa ggttgctgtc     1320

cacgagggtg gctcttccga caacgacatt gtcgagtttg tctccgagct catcaaggcc     1380

taa                                                                   1383


<210>  25
<211>  1377
<212>  DNA
<213>  Artificial sequence

<220>
<223>  UDP-glucosyltransferase from Stevia rebaudiana Cpo for expression
       in Yarrowia lipolitica

<400>  25
atggagaaca agaccgagac taccgtccga cgacgacgac gaatcattct cttccccgtc       60

cccttccagg gccacatcaa ccccattctg cagctcgcca acgttctgta ctccaagggc      120

ttctccatca ccatcttcca caccaacttc aacaagccca agacctccaa ctacccccac      180

ttcactttcc gattcatcct cgacaacgac ccccaggacg agcgaatctc caacctgccc      240

acccacggtc ctctggctgg tatgcgaatc cccatcatca acgagcacgg tgctgacgag      300

ctccgacgag agctcgagct gctcatgctc gcctccgaag aggacgagga agtctcctgt      360

ctgatcaccg atgctctgtg gtactttgcc cagtccgtcg ccgactctct caacctgcga      420

cgactcgttc tcatgacctc ctctctgttc aacttccacg cccacgtttc tctgccccag      480

tttgacgagc tcggttacct cgaccccgat gacaagaccc gactcgagga gcaggcttcc      540

ggtttcccca tgctcaaggt caaggacatc aagtccgcct actccaactg gcagattctc      600

aaggagattc tcggcaagat gatcaagcag accaaggcct cctccggtgt catctggaac      660

tccttcaagg agctcgagga gtccgagctc gagactgtca tccgagagat ccccgctccc      720

tctttcctca tccccctgcc caagcacctc accgcttcct cctcttctct gctcgaccac      780

gaccgaaccg tctttcagtg gctcgaccag cagccccctt cctccgtcct ctacgtttcc      840

ttcggctcca cctccgaggt cgacgagaag gacttcctcg agattgctcg aggcctcgtt      900

gactccaagc agtccttcct gtgggttgtc cgacccggct ttgtcaaggg ctccacctgg      960

gttgagcccc tgcccgatgg tttcctcggt gagcgaggcc gaattgtcaa gtgggtcccc     1020

cagcaggaag ttctggccca cggtgccatt ggtgccttct ggacccactc cggctggaac     1080

tccactctcg agtccgtctg cgagggtgtc cccatgatct tctccgactt tggcctcgac     1140

cagcccctca acgcccgata catgtccgat gttctcaagg tcggtgtcta cctcgagaac     1200

ggctgggagc gaggtgagat tgccaacgcc atccgacgag tcatggtcga cgaggaaggt     1260

gagtacatcc gacagaacgc ccgagtcctc aagcagaagg ccgatgtctc tctcatgaag     1320

ggtggttctt cttacgagtc tctcgagtct ctcgtttcct acatctcttc tttgtaa        1377


