                         SEQUENCE LISTING

<110>  E.I. duPont de Nemours & Company, Inc.
       Zhu, Quinn
 
<120>  MUTANT HDASH MOTIF DELTA-5 DESATURASES AND THEIR USE IN MAKING 
       POLYUNSATURATED FATTY ACIDS

<130>  CL5267

<150>  US 61/428277
<151>  2010-12-30

<160>  311   

<170>  PatentIn version 3.5

<210>  1
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  1

His Xaa Xaa Xaa His 
1               5   


<210>  2
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(5)
<223>  Xaa can be any naturally occurring amino acid

<400>  2

His Xaa Xaa Xaa Xaa His 
1               5       


<210>  3
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  3

His Xaa Xaa His His 
1               5   


<210>  4
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  4

His Xaa Xaa Xaa His His 
1               5       


<210>  5
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  MISC_FEATURE
<222>  (1)..(1)
<223>  Xaa = Gln [Q] or His [H]

<220>
<221>  misc_feature
<222>  (2)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  5

Xaa Xaa Xaa His His 
1               5   


<210>  6
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  MISC_FEATURE
<222>  (1)..(1)
<223>  Xaa = Gln [Q] or His [H]

<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  6

Xaa Xaa Xaa Xaa His His 
1               5       


<210>  7
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HPGG motif

<400>  7

His Pro Gly Gly 
1               


<210>  8
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDASH motif

<400>  8

His Asp Ala Ser His 
1               5   


<210>  9
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HGGG motif

<400>  9

His Gly Gly Gly 
1               


<210>  10
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HHGG motif

<400>  10

His His Gly Gly 
1               


<210>  11
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HPGS motif

<400>  11

His Pro Gly Ser 
1               


<210>  12
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HCGG motif

<400>  12

His Cys Gly Gly 
1               


<210>  13
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HWGG motif

<400>  13

His Trp Gly Gly 
1               


<210>  14
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HAGG motif

<400>  14

His Ala Gly Gly 
1               


<210>  15
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDGSH motif

<400>  15

His Asp Gly Ser His 
1               5   


<210>  16
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDSSH motif

<400>  16

His Asp Ser Ser His 
1               5   


<210>  17
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAAH motif

<400>  17

His Asp Ala Ala His 
1               5   


<210>  18
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAGH motif

<400>  18

His Asp Ala Gly His 
1               5   


<210>  19
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HEASH motif

<400>  19

His Glu Ala Ser His 
1               5   


<210>  20
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  delta-5 desaturase

<300>
<302>  DELTA-5 DESATURASE AND ITS USE IN MAKING POLYUNSATURATED FATTY 
       ACIDS
<310>  US 7,678,560
<311>  2007-05-15
<312>  2010-03-16
<313>  (1)..(1350)

<400>  20
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cca gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aat tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg tcc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc caa gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gag aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc act gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag agc tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cgc cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc tga                                                               1350
Ser                                                                       
                                                                          


<210>  21
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  21

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  22
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  synthetic delta-5 desaturase (codon-optimized for Yarrowia 
       lipolytica)

<300>
<302>  DELTA-5 DESATURASE AND ITS USE IN MAKING POLYUNSATURATED FATTY 
       ACIDS
<310>  US 7,678,560
<311>  2007-05-15
<312>  2010-03-16
<313>  (1)..(1350)

<400>  22
atg gct ctc tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcc atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ccc gga ggc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct        144
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acc tac gac tct ccc ttt gca cag gac gtc aag cga act        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc aca cct gga ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cga acc gtt ggc atc att gct gtc acc gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct act acc gga atg gtg ctg tgg ggt ctc ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc ctg tcc att cag cac gat gcc tct cat ggt        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly           
145                 150                 155                 160           

gcc atc agc aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc aac act        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gct cga aag tgg ttt cac cga ttc cag gcc tgg tac atg tac ctc gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctt gga gcc tac ggc gtt tcg ctg gtg tac aac cct ctc tac atc ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cga atg cag cac aac gac acc att ccc gag tct gtc aca gcc atg cga        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gag aac ggc ttt ctg cga cgg tac cga acc ctt gca ttc gtt atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg gag       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gaa gcc gat cag atc gac tgg tac aga gct cag gtg gag acc tct tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

acc tac ggt gga ccc att gcc atg ttc ttt act ggc ggt ctc aac ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cat cac ctc ttt cct cga atg tcg tct tgg cac tat ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtg cag caa gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tac gtc ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  23
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  23

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  24
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  delta-5 desaturase ("EgD5R")

<400>  24
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cca gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aat tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg tcc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc caa gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag aga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cgc cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  25
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  25

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  26
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  delta-5 desaturase ("EgD5R*")

<400>  26
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cca gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg tcc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  27
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  27

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  28
<211>  1362
<212>  DNA
<213>  Euglena anabaena UTEX 373


<220>
<221>  CDS
<222>  (1)..(1362)
<223>  delta-5 desaturase

<300>
<302>  DELTA-5 DESATURASES AND THEIR USE IN MAKING POLYUNSATURATED FATTY
        ACIDS
<310>  US 7,943,365
<311>  2008-04-29
<312>  2011-05-17
<313>  (1)..(1362)

<400>  28
atg gcc acc atc tct ttg act act gag caa ctt tta gaa cac cca gaa         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctg gtt gca att gat ggg gtg ttg tac gat ctc ttc gga ctg gcg aaa         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtg cat cca ggt ggc aac ctc att gaa gcc gcc ggt gcc tcc gac gga        144
Val His Pro Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gcc ctg ttc tac tcc atg cac cct gga gtg aag cca gag aat tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctg ctg cag caa ttt gcc cga ggc aaa cac gaa cga agc tcg aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gac cca gtg tac acc ttt gac agt ccc ttc gcc cag gat gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

agc gtt cgg gag gtc atg aag ggg cgc aac tgg tac gcc acg ccc ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttt tgg ctg cgg acc gcg ctg atc atc gcg tgc act gcc ata ggc gaa        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tat tgg atc act acc ggg gca gtg atg tgg ggc atc ttc acc ggg        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac agc cag att ggg ttg gcg att caa cac gat gcc tct cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtg aac gcc ttt ttc gcc tac ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att gga tcc tcc cgc tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cgc cac cac acc tac acc aac cag cat ggc ctg gac ctg gac gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gcg gag ccg ttc att ttg ttc cac tcc tac ccg gca aca aat        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcg tca cga aag tgg tac cat cgg ttc cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ttg ggg atg tat ggt gtg tcg atg gtg tac aat ccg atg tac ttg        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc acg atg cag cac aac gac aca atc cca gag gcc acc tct ctt aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aac cgg cag cgc gcc ttc gcc gtt tcc ctc cgc        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

cta ctg ttc atc ttc cgc aac gcc ttc ctc ccc tgg tac atc gcg ggc        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct ccg ctg ctc acc atc ctg ctg gtg cca acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ttg aca ttt gtt ttt gtg ctg tcc cat aac ttt gaa ggc gct gag       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc ccc gaa aag aac tgc aag gcc aaa agg gcc aag gag ggg aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc cgc gat gta gag gag gac cgg gtg gac tgg tac cgg gcg cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag acc gcg gcg acc tac ggg ggc agc gtc ggg atg atg ctg acc       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ttg aac ctg cag atc gag cac cac ttg ttc ccc cgc atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tac ccc ttc atc caa gat acg gtg cgg gaa tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cgc cat ggc gtg cgc tac aca tac tac ccg acc atc ctg gag aat ata       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc acg ctc cgc tac atg cag aag gtg ggc gtg gcc cac aca att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gaa ttc                                               1362
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  29
<211>  454
<212>  PRT
<213>  Euglena anabaena UTEX 373

<400>  29

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Pro Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  30
<211>  1362
<212>  DNA
<213>  Euglena anabaena UTEX 373


<220>
<221>  CDS
<222>  (1)..(1362)
<223>  synthetic delta-5 desaturase (codon-optimized for Yarrowia 
       lipolytica)

<300>
<302>  DELTA-5 DESATURASES AND THEIR USE IN MAKING POLYUNSATURATED FATTY
        ACIDS
<310>  US 7,943,365
<311>  2008-04-29
<312>  2011-05-17
<313>  (1)..(1362)

<400>  30
atg gcc acc atc tcc ctg act acc gag cag ctc ctg gaa cac ccc gag         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctc gtt gcc atc gac gga gtc ctg tac gat ctc ttc ggt ctg gcc aag         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtg cat cca gga ggc aac ctc atc gaa gct gcc ggt gca tcc gac gga        144
Val His Pro Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gct ctg ttc tac tcc atg cat cct gga gtc aag cca gag aac tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctt ctg cag caa ttt gcc cga ggc aag cac gaa cga agc tcc aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gat ccc gtg tac acc ttc gac tct ccc ttt gct cag gac gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

tcc gtt cga gag gtc atg aag ggt cga aac tgg tac gcc act cct ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttc tgg ctg aga acc gca ctc atc atc gct tgt act gcc att ggc gag        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tac tgg atc aca acc gga gca gtg atg tgg ggt atc ttt act gga        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac tcg cag att ggc ttg gcc att caa cac gat gct tct cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtc aac gcc ttt ttc gct tat ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att ggt tcc tct cgt tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cga cat cac act tac acc aac cag cat ggc ctc gac ctg gat gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gca gag ccg ttc atc ttg ttc cac tcc tat cct gct acc aac        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcc tct cga aag tgg tac cac cga ttt cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ctg gga atg tat ggt gtc tcg atg gtg tac aat ccc atg tac ctc        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc aca atg cag cac aac gac acc att ccc gag gcc act tct ctc aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aat cgg cag cga gct ttc gcc gtt tcc ctt cga        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

ctg ctc ttc atc ttc cga aac gcc ttt ctt ccc tgg tac att gct ggt        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct cct ctg ctc acc att ctt ctg gtg ccc acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ctc acc ttt gtg ttc gtt ctg tcc cat aac ttc gag gga gcc gaa       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc cca gag aag aac tgc aag gcc aaa cga gct aag gaa ggc aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc aga gac gtg gaa gag gat cga gtc gac tgg tac cga gca cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag act gct gcc acc tac ggt ggc agc gtg gga atg atg ctt aca       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ctc aac ctg cag atc gag cat cac ttg ttt ccc cga atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tat ccc ttc att caa gac acc gtt cgg gag tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cga cat ggc gtc cgt tac aca tac tat cct acc att ctc gag aac atc       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc act ctt cga tac atg cag aag gtg ggt gtt gct cac acc att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gag ttc                                               1362
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  31
<211>  454
<212>  PRT
<213>  Euglena anabaena UTEX 373

<400>  31

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Pro Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  32
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HXGG MOTIF


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<400>  32

His Xaa Gly Gly 
1               


<210>  33
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HPGX motif


<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  33

His Pro Gly Xaa 
1               


<210>  34
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HXGX motif


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  34

His Xaa Gly Xaa 
1               


<210>  35
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HXASH motif


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<400>  35

His Xaa Ala Ser His 
1               5   


<210>  36
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDXSH motif


<220>
<221>  misc_feature
<222>  (3)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  36

His Asp Xaa Ser His 
1               5   


<210>  37
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAXH motif


<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  37

His Asp Ala Xaa His 
1               5   


<210>  38
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367

<400>  38
catggctctc agtcttacca cagaacagct gttagaacgc cctgatttgg ttgcgattga       60

tggcatcctc tacgaccttg aagggcttgc caaagttcat ccaggaggag atttgattct      120

cgcttctggt gcctctgatg cctcccctct cttttattca atgcatccat acgtcaaacc      180

ggagaattcc aaattgcttc aacagttcgt ccgagggaag catgaccgca cctcgaagga      240

cattgtctac acgtatgatt ctcccttcgc acaagacgtt aagcggacaa tgcgcgaggt      300

gatgaaaggg aggaactggt acgcaacccc tggcttctgg ctgcgcaccg ttgggatcat      360

cgccgtgacg gccttttgcg agtggcactg ggctaccacg gggatggtgc tgtggggcct      420

gttgactgga ttcatgcaca tgcagatcgg cttatccatc cagcatgatg cgtcccacgg      480

ggccatcagc aagaagcctt gggtcaacgc cctcttcgcc tacggcattg acgtcatcgg      540

atcgtcccgg tggatttggc tgcagtcgca catcatgcgg caccacacct acaccaacca      600

gcacggcctc gacctggatg cggagtcggc agagccgttc ctggtgttcc acaactaccc      660

cgccgcaaac accgcccgaa agtggttcca ccgcttccaa gcttggtaca tgtaccttgt      720

gctgggggca tacggggtat cgctggtgta caacccgctc tacattttcc ggatgcagca      780

caatgacacc atcccagagt ctgtcacggc catgcgggaa aatggctttc tgcggcgcta      840

ccgcacactt gcattcgtga tgcgagcttt cttcatcttc cggaccgcat tcttgccctg      900

gtacctcact gggacctcat tgctgatcac cattcctctg gtgcccaccg caactggtgc      960

cttcttgacg ttcttcttca ttttgtccca caattttgat ggctccgaac ggatccccga     1020

caagaactgc aaggttaaga gatctgagaa ggacgttgag gctgaccaaa ttgactggta     1080

tcgggcgcag gtggagacgt cctccacata cggtggcccc atcgccatgt tcttcactgg     1140

cggtctcaat ttccagatcg agcaccacct ctttccccgg atgtcgtctt ggcactaccc     1200

cttcgtccag caggcggtcc gggagtgttg cgaacgccat ggagtgcgat atgttttcta     1260

ccctaccatc gtcggcaaca tcatctccac cctgaagtac atgcataagg tgggtgtcgt     1320

ccactgcgtg aaggacgcac aggattccta agcggccgca agtgtggatg gggaagtgag     1380

tgcccggttc tgtgtgcaca attggcaatc caagatggat ggattcaaca cagggatata     1440

gcgagctacg tggtggtgcg aggatatagc aacggatatt tatgtttgac acttgagaat     1500

gtacgataca agcactgtcc aagtacaata ctaaacatac tgtacatact catactcgta     1560

cccgggcaac ggtttcactt gagtgcagtg gctagtgctc ttactcgtac agtgtgcaat     1620

actgcgtatc atagtctttg atgtatatcg tattcattca tgttagttgc gtacgagccg     1680

gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt     1740

tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg     1800

gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg     1860

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa     1920

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc     1980

aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc     2040

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat     2100

aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc     2160

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct     2220

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg     2280

aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc     2340

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga     2400

ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa     2460

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta     2520

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc     2580

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg     2640

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga     2700

tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg     2760

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct     2820

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg     2880

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc     2940

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa     3000

ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc     3060

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt     3120

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc     3180

ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt     3240

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc     3300

catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt     3360

gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata     3420

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga     3480

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag     3540

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa     3600

aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt     3660

attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga     3720

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct     3780

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     3840

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     3900

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     3960

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct     4020

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     4080

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt     4140

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     4200

ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca actgttggga     4260

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     4320

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     4380

cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc ccctcgaggt     4440

cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct tcgcctcaag     4500

gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat taattttcgg     4560

gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat atacatcatg     4620

atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc gcctccaact     4680

gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag actccatcta     4740

ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt acttagtatt     4800

attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa tttataatgg     4860

cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat gggaaatctt     4920

aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca gcaacgaaaa     4980

aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag aacagctatt     5040

cacacgttac tattgagatt attattggac gagaatcaca cactcaactg tctttctctc     5100

ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct agtcatttca     5160

tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca aattcaacaa     5220

ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc tctggtgtgc     5280

ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt tcttgttata     5340

taatcctttt gtttattaca tgggctggat acataaaggt attttgattt aattttttgc     5400

ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta ccatactttt     5460

gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga cgttccgcag     5520

aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg ctccctgaga     5580

tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta ctactgttga     5640

tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat gattcattac     5700

cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca attaatcata     5760

gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca tgctacttgg     5820

gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg acagtaatta     5880

attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt agttcaacgt     5940

attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc cattggacag     6000

atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag gtcgtctgac     6060

catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca cagttaaatt     6120

acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca gccagccttc     6180

tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc tcggccgaca     6240

attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg ctgtccgaga     6300

gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc ctcagagtcg     6360

cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga tcgggcaagc     6420

tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga cagctcggcc     6480

agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa ctccttgtac     6540

tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt ttcctcggca     6600

ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt ggtgatatcg     6660

gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc aatatctgcg     6720

aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt gagggggagc     6780

acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat catgcacaca     6840

taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac atccagagaa     6900

gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc aaaggcggac     6960

ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag gagactgaaa     7020

taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa gtatatgtta     7080

tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg ctatcggtcc     7140

aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa aatgtgatca     7200

tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg cgccgaaaac     7260

gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat ccaagcacac     7320

tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata ctcgtcgact     7380

caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc gggttggcgg     7440

cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca ggccgcctag     7500

atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg ggggcctttt     7560

tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata aatgggtagg     7620

gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg gggctcaatg     7680

gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga caccattgca     7740

tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca ccacagaggt     7800

tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca     7860

gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac     7920

ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag gccagattga     7980

gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata tagccccgac     8040

aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg tacccacacc     8100

ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca tcttacaagc     8160

ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc cagtctcttt     8220

tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat gcctgttact     8280

gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc cgtgagtatc     8340

cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct     8400

agcaacacac actctctaca caaactaacc cagctctc                             8438


<210>  39
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367-M4

<400>  39
catggctctc agtcttacca cagaacagct gttagaacgc cctgatttgg ttgcgattga       60

tggcatcctc tacgaccttg aagggcttgc caaagttcat ccaggaggag atttgattct      120

cgcttctggt gcctctgatg cctcccctct cttttattca atgcatccat acgtcaaacc      180

ggagaactcc aaattgcttc aacagttcgt ccgagggaag catgaccgca cctcgaagga      240

cattgtctac acgtatgatt ctcccttcgc acaagacgtt aagcggacaa tgcgcgaggt      300

gatgaaaggg aggaactggt acgcaacccc tggcttctgg ctgcgcaccg ttgggatcat      360

cgccgtgacg gccttttgcg agtggcactg ggctaccacg gggatggtgc tgtggggcct      420

gttgactgga ttcatgcaca tgcagatcgg cttatccatc cagcatgatg cgtcccacgg      480

ggccatcagc aagaagcctt gggtcaacgc cctcttcgcc tacggcattg acgtcatcgg      540

atcgtcccgg tggatttggc tgcagtcgca catcatgcgg caccacacct acaccaacca      600

gcacggcctc gacctggatg cggagtcggc agagccgttc ctggtgttcc acaactaccc      660

cgccgcaaac accgcccgaa agtggttcca ccgcttccag gcttggtaca tgtaccttgt      720

gctgggggca tacggggtat cgctggtgta caacccgctc tacattttcc ggatgcagca      780

caatgacacc atcccagagt ctgtcacggc catgcgggaa aatggctttc tgcggcgcta      840

ccgcacactt gcattcgtga tgcgagcttt cttcatcttc cggaccgcat tcttgccctg      900

gtacctcact gggacctcat tgctgatcac cattcctctg gtgcccaccg caactggtgc      960

cttcttgacg ttcttcttca ttttgtccca caattttgat ggctccgaac ggatccccga     1020

caagaactgc aaggttaagc gatctgagaa ggacgttgag gctgaccaaa ttgactggta     1080

tcgggcgcag gtggagacgt cctccacata cggtggcccc atcgccatgt tcttcactgg     1140

cggtctcaat ttccagatcg agcaccacct ctttccccgg atgtcgtctt ggcactaccc     1200

cttcgtccag caggcggtcc gggagtgttg cgaacgacat ggagtgcgat atgttttcta     1260

ccctaccatc gtcggcaaca tcatctccac cctgaagtac atgcataagg tgggtgtcgt     1320

ccactgcgtg aaggacgcac aggattccta agcggccgca agtgtggatg gggaagtgag     1380

tgcccggttc tgtgtgcaca attggcaatc caagatggat ggattcaaca cagggatata     1440

gcgagctacg tggtggtgcg aggatatagc aacggatatt tatgtttgac acttgagaat     1500

gtacgataca agcactgtcc aagtacaata ctaaacatac tgtacatact catactcgta     1560

cccgggcaac ggtttcactt gagtgcagtg gctagtgctc ttactcgtac agtgtgcaat     1620

actgcgtatc atagtctttg atgtatatcg tattcattca tgttagttgc gtacgagccg     1680

gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt     1740

tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg     1800

gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg     1860

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa     1920

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc     1980

aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc     2040

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat     2100

aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc     2160

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct     2220

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg     2280

aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc     2340

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga     2400

ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa     2460

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta     2520

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc     2580

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg     2640

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga     2700

tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg     2760

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct     2820

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg     2880

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc     2940

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa     3000

ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc     3060

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt     3120

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc     3180

ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt     3240

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc     3300

catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt     3360

gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata     3420

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga     3480

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag     3540

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa     3600

aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt     3660

attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga     3720

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct     3780

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     3840

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     3900

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     3960

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct     4020

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     4080

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt     4140

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     4200

ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca actgttggga     4260

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     4320

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     4380

cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc ccctcgaggt     4440

cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct tcgcctcaag     4500

gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat taattttcgg     4560

gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat atacatcatg     4620

atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc gcctccaact     4680

gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag actccatcta     4740

ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt acttagtatt     4800

attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa tttataatgg     4860

cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat gggaaatctt     4920

aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca gcaacgaaaa     4980

aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag aacagctatt     5040

cacacgttac tattgagatt attattggac gagaatcaca cactcaactg tctttctctc     5100

ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct agtcatttca     5160

tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca aattcaacaa     5220

ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc tctggtgtgc     5280

ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt tcttgttata     5340

taatcctttt gtttattaca tgggctggat acataaaggt attttgattt aattttttgc     5400

ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta ccatactttt     5460

gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga cgttccgcag     5520

aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg ctccctgaga     5580

tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta ctactgttga     5640

tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat gattcattac     5700

cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca attaatcata     5760

gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca tgctacttgg     5820

gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg acagtaatta     5880

attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt agttcaacgt     5940

attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc cattggacag     6000

atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag gtcgtctgac     6060

catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca cagttaaatt     6120

acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca gccagccttc     6180

tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc tcggccgaca     6240

attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg ctgtccgaga     6300

gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc ctcagagtcg     6360

cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga tcgggcaagc     6420

tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga cagctcggcc     6480

agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa ctccttgtac     6540

tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt ttcctcggca     6600

ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt ggtgatatcg     6660

gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc aatatctgcg     6720

aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt gagggggagc     6780

acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat catgcacaca     6840

taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac atccagagaa     6900

gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc aaaggcggac     6960

ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag gagactgaaa     7020

taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa gtatatgtta     7080

tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg ctatcggtcc     7140

aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa aatgtgatca     7200

tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg cgccgaaaac     7260

gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat ccaagcacac     7320

tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata ctcgtcgact     7380

caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc gggttggcgg     7440

cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca ggccgcctag     7500

atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg ggggcctttt     7560

tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata aatgggtagg     7620

gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg gggctcaatg     7680

gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga caccattgca     7740

tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca ccacagaggt     7800

tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca     7860

gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac     7920

ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag gccagattga     7980

gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata tagccccgac     8040

aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg tacccacacc     8100

ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca tcttacaagc     8160

ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc cagtctcttt     8220

tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat gcctgttact     8280

gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc cgtgagtatc     8340

cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct     8400

agcaacacac actctctaca caaactaacc cagctctc                             8438


<210>  40
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 813

<400>  40
cgtcaaaccg gagaactcca aattgcttca a                                      31


<210>  41
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 814

<400>  41
ttgaagcaat ttggagttct ccggtttgac g                                      31


<210>  42
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 815

<400>  42
aactgcaagg ttaagcgatc tgagaaggac g                                      31


<210>  43
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 816

<400>  43
cgtccttctc agatcgctta accttgcagt t                                      31


<210>  44
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367-M2

<400>  44
catggctctc agtcttacca cagaacagct gttagaacgc cctgatttgg ttgcgattga       60

tggcatcctc tacgaccttg aagggcttgc caaagttcat ccaggaggag atttgattct      120

cgcttctggt gcctctgatg cctcccctct cttttattca atgcatccat acgtcaaacc      180

ggagaactcc aaattgcttc aacagttcgt ccgagggaag catgaccgca cctcgaagga      240

cattgtctac acgtatgatt ctcccttcgc acaagacgtt aagcggacaa tgcgcgaggt      300

gatgaaaggg aggaactggt acgcaacccc tggcttctgg ctgcgcaccg ttgggatcat      360

cgccgtgacg gccttttgcg agtggcactg ggctaccacg gggatggtgc tgtggggcct      420

gttgactgga ttcatgcaca tgcagatcgg cttatccatc cagcatgatg cgtcccacgg      480

ggccatcagc aagaagcctt gggtcaacgc cctcttcgcc tacggcattg acgtcatcgg      540

atcgtcccgg tggatttggc tgcagtcgca catcatgcgg caccacacct acaccaacca      600

gcacggcctc gacctggatg cggagtcggc agagccgttc ctggtgttcc acaactaccc      660

cgccgcaaac accgcccgaa agtggttcca ccgcttccaa gcttggtaca tgtaccttgt      720

gctgggggca tacggggtat cgctggtgta caacccgctc tacattttcc ggatgcagca      780

caatgacacc atcccagagt ctgtcacggc catgcgggaa aatggctttc tgcggcgcta      840

ccgcacactt gcattcgtga tgcgagcttt cttcatcttc cggaccgcat tcttgccctg      900

gtacctcact gggacctcat tgctgatcac cattcctctg gtgcccaccg caactggtgc      960

cttcttgacg ttcttcttca ttttgtccca caattttgat ggctccgaac ggatccccga     1020

caagaactgc aaggttaagc gatctgagaa ggacgttgag gctgaccaaa ttgactggta     1080

tcgggcgcag gtggagacgt cctccacata cggtggcccc atcgccatgt tcttcactgg     1140

cggtctcaat ttccagatcg agcaccacct ctttccccgg atgtcgtctt ggcactaccc     1200

cttcgtccag caggcggtcc gggagtgttg cgaacgccat ggagtgcgat atgttttcta     1260

ccctaccatc gtcggcaaca tcatctccac cctgaagtac atgcataagg tgggtgtcgt     1320

ccactgcgtg aaggacgcac aggattccta agcggccgca agtgtggatg gggaagtgag     1380

tgcccggttc tgtgtgcaca attggcaatc caagatggat ggattcaaca cagggatata     1440

gcgagctacg tggtggtgcg aggatatagc aacggatatt tatgtttgac acttgagaat     1500

gtacgataca agcactgtcc aagtacaata ctaaacatac tgtacatact catactcgta     1560

cccgggcaac ggtttcactt gagtgcagtg gctagtgctc ttactcgtac agtgtgcaat     1620

actgcgtatc atagtctttg atgtatatcg tattcattca tgttagttgc gtacgagccg     1680

gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt     1740

tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg     1800

gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg     1860

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa     1920

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc     1980

aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc     2040

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat     2100

aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc     2160

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct     2220

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg     2280

aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc     2340

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga     2400

ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa     2460

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta     2520

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc     2580

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg     2640

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga     2700

tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg     2760

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct     2820

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg     2880

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc     2940

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa     3000

ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc     3060

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt     3120

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc     3180

ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt     3240

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc     3300

catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt     3360

gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata     3420

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga     3480

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag     3540

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa     3600

aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt     3660

attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga     3720

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct     3780

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     3840

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     3900

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     3960

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct     4020

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     4080

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt     4140

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     4200

ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca actgttggga     4260

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     4320

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     4380

cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc ccctcgaggt     4440

cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct tcgcctcaag     4500

gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat taattttcgg     4560

gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat atacatcatg     4620

atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc gcctccaact     4680

gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag actccatcta     4740

ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt acttagtatt     4800

attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa tttataatgg     4860

cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat gggaaatctt     4920

aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca gcaacgaaaa     4980

aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag aacagctatt     5040

cacacgttac tattgagatt attattggac gagaatcaca cactcaactg tctttctctc     5100

ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct agtcatttca     5160

tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca aattcaacaa     5220

ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc tctggtgtgc     5280

ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt tcttgttata     5340

taatcctttt gtttattaca tgggctggat acataaaggt attttgattt aattttttgc     5400

ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta ccatactttt     5460

gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga cgttccgcag     5520

aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg ctccctgaga     5580

tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta ctactgttga     5640

tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat gattcattac     5700

cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca attaatcata     5760

gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca tgctacttgg     5820

gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg acagtaatta     5880

attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt agttcaacgt     5940

attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc cattggacag     6000

atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag gtcgtctgac     6060

catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca cagttaaatt     6120

acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca gccagccttc     6180

tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc tcggccgaca     6240

attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg ctgtccgaga     6300

gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc ctcagagtcg     6360

cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga tcgggcaagc     6420

tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga cagctcggcc     6480

agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa ctccttgtac     6540

tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt ttcctcggca     6600

ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt ggtgatatcg     6660

gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc aatatctgcg     6720

aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt gagggggagc     6780

acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat catgcacaca     6840

taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac atccagagaa     6900

gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc aaaggcggac     6960

ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag gagactgaaa     7020

taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa gtatatgtta     7080

tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg ctatcggtcc     7140

aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa aatgtgatca     7200

tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg cgccgaaaac     7260

gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat ccaagcacac     7320

tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata ctcgtcgact     7380

caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc gggttggcgg     7440

cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca ggccgcctag     7500

atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg ggggcctttt     7560

tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata aatgggtagg     7620

gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg gggctcaatg     7680

gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga caccattgca     7740

tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca ccacagaggt     7800

tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca     7860

gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac     7920

ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag gccagattga     7980

gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata tagccccgac     8040

aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg tacccacacc     8100

ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca tcttacaagc     8160

ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc cagtctcttt     8220

tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat gcctgttact     8280

gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc cgtgagtatc     8340

cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct     8400

agcaacacac actctctaca caaactaacc cagctctc                             8438


<210>  45
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 829

<400>  45
gttccaccgc ttccaggctt ggtacatgta c                                      31


<210>  46
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 830

<400>  46
gtacatgtac caagcctgga agcggtggaa c                                      31


<210>  47
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 831

<400>  47
ggagtgttgc gaacgacatg gagtgcgata tg                                     32


<210>  48
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer YL 832

<400>  48
catatcgcac tccatgtcgt tcgcaacact cc                                     32


<210>  49
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  49
ttatccatcc agcatgattg ctcccacggg gccatcagc                              39


<210>  50
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  50
gctgatggcc ccgtgggagc aatcatgctg gatggataa                              39


<210>  51
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  51
tatccatcca gcatgatgac tcccacgggg ccatcag                                37


<210>  52
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  52
ctgatggccc cgtgggagtc atcatgctgg atggata                                37


<210>  53
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  53
tatccatcca gcatgatgag tcccacgggg ccatcag                                37


<210>  54
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  54
ctgatggccc cgtgggactc atcatgctgg atggata                                37


<210>  55
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  55
ttatccatcc agcatgattt ctcccacggg gccatcagc                              39


<210>  56
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  56
gctgatggcc ccgtgggaga aatcatgctg gatggataa                              39


<210>  57
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  57
ttatccatcc agcatgatgg ctcccacggg gccatcagc                              39


<210>  58
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  58
gctgatggcc ccgtgggagc catcatgctg gatggataa                              39


<210>  59
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  59
tatccatcca gcatgatcac tcccacgggg ccatcag                                37


<210>  60
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  60
ctgatggccc cgtgggagtg atcatgctgg atggata                                37


<210>  61
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  61
tatccatcca gcatgatatc tcccacgggg ccatcag                                37


<210>  62
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  62
ctgatggccc cgtgggagat atcatgctgg atggata                                37


<210>  63
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  63
tatccatcca gcatgataag tcccacgggg ccatcag                                37


<210>  64
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  64
ctgatggccc cgtgggactt atcatgctgg atggata                                37


<210>  65
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  65
tatccatcca gcatgatctg tcccacgggg ccatcag                                37


<210>  66
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  66
ctgatggccc cgtgggacag atcatgctgg atggata                                37


<210>  67
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  67
tatccatcca gcatgatatg tcccacgggg ccatcag                                37


<210>  68
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  68
ctgatggccc cgtgggacat atcatgctgg atggata                                37


<210>  69
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  69
tatccatcca gcatgataac tcccacgggg ccatcag                                37


<210>  70
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  70
ctgatggccc cgtgggagtt atcatgctgg atggata                                37


<210>  71
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  71
tatccatcca gcatgatccc tcccacgggg ccatcag                                37


<210>  72
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  72
ctgatggccc cgtgggaggg atcatgctgg atggata                                37


<210>  73
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  73
tatccatcca gcatgatcaa tcccacgggg ccatcag                                37


<210>  74
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  74
ctgatggccc cgtgggattg atcatgctgg atggata                                37


<210>  75
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  75
tatccatcca gcatgatcga tcccacgggg ccatcag                                37


<210>  76
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  76
ctgatggccc cgtgggatcg atcatgctgg atggata                                37


<210>  77
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  77
ttatccatcc agcatgattc ctcccacggg gccatcagc                              39


<210>  78
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  78
gctgatggcc ccgtgggagg aatcatgctg gatggataa                              39


<210>  79
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  79
tatccatcca gcatgatacc tcccacgggg ccatcag                                37


<210>  80
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  80
ctgatggccc cgtgggaggt atcatgctgg atggata                                37


<210>  81
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  81
tatccatcca gcatgatgtc tcccacgggg ccatcag                                37


<210>  82
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  82
ctgatggccc cgtgggagac atcatgctgg atggata                                37


<210>  83
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  83
tatccatcca gcatgattgg tcccacgggg ccatcag                                37


<210>  84
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  84
ctgatggccc cgtgggacca atcatgctgg atggata                                37


<210>  85
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  85
ttatccatcc agcatgatta ctcccacggg gccatcagc                              39


<210>  86
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  86
gctgatggcc ccgtgggagt aatcatgctg gatggataa                              39


<210>  87
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  87

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Gly Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  88
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  88

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ser Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  89
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  89
tccatccagc atgatgcggc tcacggggcc atcagcaag                              39


<210>  90
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  90
cttgctgatg gccccgtgag ccgcatcatg ctggatgga                              39


<210>  91
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  91
tccatccagc atgatgcgtg ccacggggcc atcagcaag                              39


<210>  92
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  92
cttgctgatg gccccgtggc acgcatcatg ctggatgga                              39


<210>  93
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  93
tccatccagc atgatgcgga ccacggggcc atcagcaag                              39


<210>  94
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  94
cttgctgatg gccccgtggt ccgcatcatg ctggatgga                              39


<210>  95
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  95
tccatccagc atgatgcgga gcacggggcc atcagcaag                              39


<210>  96
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  96
cttgctgatg gccccgtgct ccgcatcatg ctggatgga                              39


<210>  97
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  97
tccatccagc atgatgcgtt ccacggggcc atcagcaag                              39


<210>  98
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  98
cttgctgatg gccccgtgga acgcatcatg ctggatgga                              39


<210>  99
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  99
tccatccagc atgatgcggg tcacggggcc atcagcaag                              39


<210>  100
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  100
cttgctgatg gccccgtgac ccgcatcatg ctggatgga                              39


<210>  101
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  101
tccatccagc atgatgcgca ccacggggcc atcagcaag                              39


<210>  102
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  102
cttgctgatg gccccgtggt gcgcatcatg ctggatgga                              39


<210>  103
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  103
tccatccagc atgatgcgat ccacggggcc atcagcaag                              39


<210>  104
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  104
cttgctgatg gccccgtgga tcgcatcatg ctggatgga                              39


<210>  105
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  105
tccatccagc atgatgcgaa gcacggggcc atcagcaag                              39


<210>  106
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  106
cttgctgatg gccccgtgct tcgcatcatg ctggatgga                              39


<210>  107
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  107
tccatccagc atgatgcgct gcacggggcc atcagcaag                              39


<210>  108
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  108
cttgctgatg gccccgtgca gcgcatcatg ctggatgga                              39


<210>  109
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  109
tccatccagc atgatgcgat gcacggggcc atcagcaag                              39


<210>  110
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  110
cttgctgatg gccccgtgca tcgcatcatg ctggatgga                              39


<210>  111
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  111
tccatccagc atgatgcgaa ccacggggcc atcagcaag                              39


<210>  112
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  112
cttgctgatg gccccgtggt tcgcatcatg ctggatgga                              39


<210>  113
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  113
tccatccagc atgatgcgcc tcacggggcc atcagcaag                              39


<210>  114
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  114
cttgctgatg gccccgtgag gcgcatcatg ctggatgga                              39


<210>  115
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  115
tccatccagc atgatgcgca gcacggggcc atcagcaag                              39


<210>  116
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  116
cttgctgatg gccccgtgct gcgcatcatg ctggatgga                              39


<210>  117
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  117
tccatccagc atgatgcgcg acacggggcc atcagcaag                              39


<210>  118
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  118
cttgctgatg gccccgtgtc gcgcatcatg ctggatgga                              39


<210>  119
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  119
tccatccagc atgatgcgac ccacggggcc atcagcaag                              39


<210>  120
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  120
cttgctgatg gccccgtggg tcgcatcatg ctggatgga                              39


<210>  121
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  121
tccatccagc atgatgcggt ccacggggcc atcagcaag                              39


<210>  122
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  122
cttgctgatg gccccgtgga ccgcatcatg ctggatgga                              39


<210>  123
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  123
tccatccagc atgatgcgtg gcacggggcc atcagcaag                              39


<210>  124
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  124
cttgctgatg gccccgtgcc acgcatcatg ctggatgga                              39


<210>  125
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  125
tccatccagc atgatgcgta ccacggggcc atcagcaag                              39


<210>  126
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  126
cttgctgatg gccccgtggt acgcatcatg ctggatgga                              39


<210>  127
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  127

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  128
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  128

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  129
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367M4-157g

<400>  129
catggctctc agtcttacca cagaacagct gttagaacgc cctgatttgg ttgcgattga       60

tggcatcctc tacgaccttg aagggcttgc caaagttcat ccaggaggag atttgattct      120

cgcttctggt gcctctgatg cctcccctct cttttattca atgcatccat acgtcaaacc      180

ggagaactcc aaattgcttc aacagttcgt ccgagggaag catgaccgca cctcgaagga      240

cattgtctac acgtatgatt ctcccttcgc acaagacgtt aagcggacaa tgcgcgaggt      300

gatgaaaggg aggaactggt acgcaacccc tggcttctgg ctgcgcaccg ttgggatcat      360

cgccgtgacg gccttttgcg agtggcactg ggctaccacg gggatggtgc tgtggggcct      420

gttgactgga ttcatgcaca tgcagatcgg cttatccatc cagcatgatg gctcccacgg      480

ggccatcagc aagaagcctt gggtcaacgc cctcttcgcc tacggcattg acgtcatcgg      540

atcgtcccgg tggatttggc tgcagtcgca catcatgcgg caccacacct acaccaacca      600

gcacggcctc gacctggatg cggagtcggc agagccgttc ctggtgttcc acaactaccc      660

cgccgcaaac accgcccgaa agtggttcca ccgcttccag gcttggtaca tgtaccttgt      720

gctgggggca tacggggtat cgctggtgta caacccgctc tacattttcc ggatgcagca      780

caatgacacc atcccagagt ctgtcacggc catgcgggaa aatggctttc tgcggcgcta      840

ccgcacactt gcattcgtga tgcgagcttt cttcatcttc cggaccgcat tcttgccctg      900

gtacctcact gggacctcat tgctgatcac cattcctctg gtgcccaccg caactggtgc      960

cttcttgacg ttcttcttca ttttgtccca caattttgat ggctccgaac ggatccccga     1020

caagaactgc aaggttaagc gatctgagaa ggacgttgag gctgaccaaa ttgactggta     1080

tcgggcgcag gtggagacgt cctccacata cggtggcccc atcgccatgt tcttcactgg     1140

cggtctcaat ttccagatcg agcaccacct ctttccccgg atgtcgtctt ggcactaccc     1200

cttcgtccag caggcggtcc gggagtgttg cgaacgacat ggagtgcgat atgttttcta     1260

ccctaccatc gtcggcaaca tcatctccac cctgaagtac atgcataagg tgggtgtcgt     1320

ccactgcgtg aaggacgcac aggattccta agcggccgca agtgtggatg gggaagtgag     1380

tgcccggttc tgtgtgcaca attggcaatc caagatggat ggattcaaca cagggatata     1440

gcgagctacg tggtggtgcg aggatatagc aacggatatt tatgtttgac acttgagaat     1500

gtacgataca agcactgtcc aagtacaata ctaaacatac tgtacatact catactcgta     1560

cccgggcaac ggtttcactt gagtgcagtg gctagtgctc ttactcgtac agtgtgcaat     1620

actgcgtatc atagtctttg atgtatatcg tattcattca tgttagttgc gtacgagccg     1680

gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt     1740

tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg     1800

gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg     1860

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa     1920

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc     1980

aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc     2040

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat     2100

aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc     2160

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct     2220

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg     2280

aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc     2340

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga     2400

ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa     2460

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta     2520

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc     2580

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg     2640

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga     2700

tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg     2760

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct     2820

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg     2880

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc     2940

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa     3000

ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc     3060

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt     3120

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc     3180

ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt     3240

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc     3300

catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt     3360

gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata     3420

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga     3480

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag     3540

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa     3600

aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt     3660

attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga     3720

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct     3780

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     3840

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     3900

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     3960

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct     4020

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     4080

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt     4140

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     4200

ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca actgttggga     4260

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     4320

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     4380

cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc ccctcgaggt     4440

cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct tcgcctcaag     4500

gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat taattttcgg     4560

gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat atacatcatg     4620

atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc gcctccaact     4680

gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag actccatcta     4740

ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt acttagtatt     4800

attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa tttataatgg     4860

cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat gggaaatctt     4920

aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca gcaacgaaaa     4980

aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag aacagctatt     5040

cacacgttac tattgagatt attattggac gagaatcaca cactcaactg tctttctctc     5100

ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct agtcatttca     5160

tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca aattcaacaa     5220

ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc tctggtgtgc     5280

ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt tcttgttata     5340

taatcctttt gtttattaca tgggctggat acataaaggt attttgattt aattttttgc     5400

ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta ccatactttt     5460

gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga cgttccgcag     5520

aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg ctccctgaga     5580

tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta ctactgttga     5640

tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat gattcattac     5700

cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca attaatcata     5760

gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca tgctacttgg     5820

gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg acagtaatta     5880

attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt agttcaacgt     5940

attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc cattggacag     6000

atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag gtcgtctgac     6060

catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca cagttaaatt     6120

acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca gccagccttc     6180

tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc tcggccgaca     6240

attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg ctgtccgaga     6300

gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc ctcagagtcg     6360

cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga tcgggcaagc     6420

tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga cagctcggcc     6480

agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa ctccttgtac     6540

tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt ttcctcggca     6600

ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt ggtgatatcg     6660

gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc aatatctgcg     6720

aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt gagggggagc     6780

acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat catgcacaca     6840

taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac atccagagaa     6900

gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc aaaggcggac     6960

ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag gagactgaaa     7020

taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa gtatatgtta     7080

tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg ctatcggtcc     7140

aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa aatgtgatca     7200

tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg cgccgaaaac     7260

gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat ccaagcacac     7320

tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata ctcgtcgact     7380

caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc gggttggcgg     7440

cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca ggccgcctag     7500

atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg ggggcctttt     7560

tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata aatgggtagg     7620

gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg gggctcaatg     7680

gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga caccattgca     7740

tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca ccacagaggt     7800

tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca     7860

gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac     7920

ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag gccagattga     7980

gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata tagccccgac     8040

aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg tacccacacc     8100

ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca tcttacaagc     8160

ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc cagtctcttt     8220

tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat gcctgttact     8280

gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc cgtgagtatc     8340

cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct     8400

agcaacacac actctctaca caaactaacc cagctctc                             8438


<210>  130
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367M4-158a

<400>  130
catggctctc agtcttacca cagaacagct gttagaacgc cctgatttgg ttgcgattga       60

tggcatcctc tacgaccttg aagggcttgc caaagttcat ccaggaggag atttgattct      120

cgcttctggt gcctctgatg cctcccctct cttttattca atgcatccat acgtcaaacc      180

ggagaactcc aaattgcttc aacagttcgt ccgagggaag catgaccgca cctcgaagga      240

cattgtctac acgtatgatt ctcccttcgc acaagacgtt aagcggacaa tgcgcgaggt      300

gatgaaaggg aggaactggt acgcaacccc tggcttctgg ctgcgcaccg ttgggatcat      360

cgccgtgacg gccttttgcg agtggcactg ggctaccacg gggatggtgc tgtggggcct      420

gttgactgga ttcatgcaca tgcagatcgg cttatccatc cagcatgatg cggctcacgg      480

ggccatcagc aagaagcctt gggtcaacgc cctcttcgcc tacggcattg acgtcatcgg      540

atcgtcccgg tggatttggc tgcagtcgca catcatgcgg caccacacct acaccaacca      600

gcacggcctc gacctggatg cggagtcggc agagccgttc ctggtgttcc acaactaccc      660

cgccgcaaac accgcccgaa agtggttcca ccgcttccag gcttggtaca tgtaccttgt      720

gctgggggca tacggggtat cgctggtgta caacccgctc tacattttcc ggatgcagca      780

caatgacacc atcccagagt ctgtcacggc catgcgggaa aatggctttc tgcggcgcta      840

ccgcacactt gcattcgtga tgcgagcttt cttcatcttc cggaccgcat tcttgccctg      900

gtacctcact gggacctcat tgctgatcac cattcctctg gtgcccaccg caactggtgc      960

cttcttgacg ttcttcttca ttttgtccca caattttgat ggctccgaac ggatccccga     1020

caagaactgc aaggttaagc gatctgagaa ggacgttgag gctgaccaaa ttgactggta     1080

tcgggcgcag gtggagacgt cctccacata cggtggcccc atcgccatgt tcttcactgg     1140

cggtctcaat ttccagatcg agcaccacct ctttccccgg atgtcgtctt ggcactaccc     1200

cttcgtccag caggcggtcc gggagtgttg cgaacgacat ggagtgcgat atgttttcta     1260

ccctaccatc gtcggcaaca tcatctccac cctgaagtac atgcataagg tgggtgtcgt     1320

ccactgcgtg aaggacgcac aggattccta agcggccgca agtgtggatg gggaagtgag     1380

tgcccggttc tgtgtgcaca attggcaatc caagatggat ggattcaaca cagggatata     1440

gcgagctacg tggtggtgcg aggatatagc aacggatatt tatgtttgac acttgagaat     1500

gtacgataca agcactgtcc aagtacaata ctaaacatac tgtacatact catactcgta     1560

cccgggcaac ggtttcactt gagtgcagtg gctagtgctc ttactcgtac agtgtgcaat     1620

actgcgtatc atagtctttg atgtatatcg tattcattca tgttagttgc gtacgagccg     1680

gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt     1740

tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg     1800

gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg     1860

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa     1920

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc     1980

aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc     2040

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat     2100

aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc     2160

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct     2220

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg     2280

aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc     2340

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga     2400

ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa     2460

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta     2520

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc     2580

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg     2640

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga     2700

tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg     2760

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct     2820

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg     2880

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc     2940

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa     3000

ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc     3060

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt     3120

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc     3180

ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt     3240

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc     3300

catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt     3360

gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata     3420

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga     3480

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag     3540

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa     3600

aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt     3660

attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga     3720

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct     3780

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     3840

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     3900

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     3960

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct     4020

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     4080

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt     4140

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     4200

ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca actgttggga     4260

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     4320

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     4380

cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc ccctcgaggt     4440

cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct tcgcctcaag     4500

gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat taattttcgg     4560

gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat atacatcatg     4620

atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc gcctccaact     4680

gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag actccatcta     4740

ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt acttagtatt     4800

attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa tttataatgg     4860

cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat gggaaatctt     4920

aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca gcaacgaaaa     4980

aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag aacagctatt     5040

cacacgttac tattgagatt attattggac gagaatcaca cactcaactg tctttctctc     5100

ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct agtcatttca     5160

tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca aattcaacaa     5220

ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc tctggtgtgc     5280

ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt tcttgttata     5340

taatcctttt gtttattaca tgggctggat acataaaggt attttgattt aattttttgc     5400

ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta ccatactttt     5460

gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga cgttccgcag     5520

aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg ctccctgaga     5580

tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta ctactgttga     5640

tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat gattcattac     5700

cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca attaatcata     5760

gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca tgctacttgg     5820

gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg acagtaatta     5880

attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt agttcaacgt     5940

attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc cattggacag     6000

atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag gtcgtctgac     6060

catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca cagttaaatt     6120

acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca gccagccttc     6180

tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc tcggccgaca     6240

attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg ctgtccgaga     6300

gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc ctcagagtcg     6360

cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga tcgggcaagc     6420

tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga cagctcggcc     6480

agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa ctccttgtac     6540

tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt ttcctcggca     6600

ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt ggtgatatcg     6660

gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc aatatctgcg     6720

aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt gagggggagc     6780

acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat catgcacaca     6840

taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac atccagagaa     6900

gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc aaaggcggac     6960

ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag gagactgaaa     7020

taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa gtatatgtta     7080

tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg ctatcggtcc     7140

aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa aatgtgatca     7200

tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg cgccgaaaac     7260

gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat ccaagcacac     7320

tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata ctcgtcgact     7380

caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc gggttggcgg     7440

cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca ggccgcctag     7500

atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg ggggcctttt     7560

tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata aatgggtagg     7620

gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg gggctcaatg     7680

gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga caccattgca     7740

tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca ccacagaggt     7800

tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca     7860

gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac     7920

ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag gccagattga     7980

gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata tagccccgac     8040

aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg tacccacacc     8100

ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca tcttacaagc     8160

ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc cagtctcttt     8220

tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat gcctgttact     8280

gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc cgtgagtatc     8340

cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct     8400

agcaacacac actctctaca caaactaacc cagctctc                             8438


<210>  131
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367M4-158g

<400>  131
catggctctc agtcttacca cagaacagct gttagaacgc cctgatttgg ttgcgattga       60

tggcatcctc tacgaccttg aagggcttgc caaagttcat ccaggaggag atttgattct      120

cgcttctggt gcctctgatg cctcccctct cttttattca atgcatccat acgtcaaacc      180

ggagaactcc aaattgcttc aacagttcgt ccgagggaag catgaccgca cctcgaagga      240

cattgtctac acgtatgatt ctcccttcgc acaagacgtt aagcggacaa tgcgcgaggt      300

gatgaaaggg aggaactggt acgcaacccc tggcttctgg ctgcgcaccg ttgggatcat      360

cgccgtgacg gccttttgcg agtggcactg ggctaccacg gggatggtgc tgtggggcct      420

gttgactgga ttcatgcaca tgcagatcgg cttatccatc cagcatgatg cgggtcacgg      480

ggccatcagc aagaagcctt gggtcaacgc cctcttcgcc tacggcattg acgtcatcgg      540

atcgtcccgg tggatttggc tgcagtcgca catcatgcgg caccacacct acaccaacca      600

gcacggcctc gacctggatg cggagtcggc agagccgttc ctggtgttcc acaactaccc      660

cgccgcaaac accgcccgaa agtggttcca ccgcttccag gcttggtaca tgtaccttgt      720

gctgggggca tacggggtat cgctggtgta caacccgctc tacattttcc ggatgcagca      780

caatgacacc atcccagagt ctgtcacggc catgcgggaa aatggctttc tgcggcgcta      840

ccgcacactt gcattcgtga tgcgagcttt cttcatcttc cggaccgcat tcttgccctg      900

gtacctcact gggacctcat tgctgatcac cattcctctg gtgcccaccg caactggtgc      960

cttcttgacg ttcttcttca ttttgtccca caattttgat ggctccgaac ggatccccga     1020

caagaactgc aaggttaagc gatctgagaa ggacgttgag gctgaccaaa ttgactggta     1080

tcgggcgcag gtggagacgt cctccacata cggtggcccc atcgccatgt tcttcactgg     1140

cggtctcaat ttccagatcg agcaccacct ctttccccgg atgtcgtctt ggcactaccc     1200

cttcgtccag caggcggtcc gggagtgttg cgaacgacat ggagtgcgat atgttttcta     1260

ccctaccatc gtcggcaaca tcatctccac cctgaagtac atgcataagg tgggtgtcgt     1320

ccactgcgtg aaggacgcac aggattccta agcggccgca agtgtggatg gggaagtgag     1380

tgcccggttc tgtgtgcaca attggcaatc caagatggat ggattcaaca cagggatata     1440

gcgagctacg tggtggtgcg aggatatagc aacggatatt tatgtttgac acttgagaat     1500

gtacgataca agcactgtcc aagtacaata ctaaacatac tgtacatact catactcgta     1560

cccgggcaac ggtttcactt gagtgcagtg gctagtgctc ttactcgtac agtgtgcaat     1620

actgcgtatc atagtctttg atgtatatcg tattcattca tgttagttgc gtacgagccg     1680

gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt     1740

tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg     1800

gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg     1860

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa     1920

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc     1980

aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc     2040

ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat     2100

aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc     2160

cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct     2220

cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg     2280

aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc     2340

cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga     2400

ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa     2460

ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta     2520

gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc     2580

agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg     2640

acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga     2700

tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg     2760

agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct     2820

gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg     2880

agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc     2940

cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa     3000

ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc     3060

cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt     3120

cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc     3180

ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt     3240

tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc     3300

catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt     3360

gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata     3420

gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga     3480

tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag     3540

catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa     3600

aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt     3660

attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga     3720

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct     3780

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     3840

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     3900

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     3960

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct     4020

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     4080

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt     4140

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     4200

ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca actgttggga     4260

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     4320

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     4380

cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc ccctcgaggt     4440

cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct tcgcctcaag     4500

gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat taattttcgg     4560

gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat atacatcatg     4620

atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc gcctccaact     4680

gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag actccatcta     4740

ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt acttagtatt     4800

attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa tttataatgg     4860

cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat gggaaatctt     4920

aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca gcaacgaaaa     4980

aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag aacagctatt     5040

cacacgttac tattgagatt attattggac gagaatcaca cactcaactg tctttctctc     5100

ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct agtcatttca     5160

tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca aattcaacaa     5220

ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc tctggtgtgc     5280

ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt tcttgttata     5340

taatcctttt gtttattaca tgggctggat acataaaggt attttgattt aattttttgc     5400

ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta ccatactttt     5460

gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga cgttccgcag     5520

aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg ctccctgaga     5580

tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta ctactgttga     5640

tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat gattcattac     5700

cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca attaatcata     5760

gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca tgctacttgg     5820

gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg acagtaatta     5880

attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt agttcaacgt     5940

attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc cattggacag     6000

atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag gtcgtctgac     6060

catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca cagttaaatt     6120

acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca gccagccttc     6180

tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc tcggccgaca     6240

attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg ctgtccgaga     6300

gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc ctcagagtcg     6360

cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga tcgggcaagc     6420

tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga cagctcggcc     6480

agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa ctccttgtac     6540

tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt ttcctcggca     6600

ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt ggtgatatcg     6660

gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc aatatctgcg     6720

aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt gagggggagc     6780

acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat catgcacaca     6840

taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac atccagagaa     6900

gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc aaaggcggac     6960

ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag gagactgaaa     7020

taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa gtatatgtta     7080

tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg ctatcggtcc     7140

aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa aatgtgatca     7200

tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg cgccgaaaac     7260

gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat ccaagcacac     7320

tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata ctcgtcgact     7380

caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc gggttggcgg     7440

cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca ggccgcctag     7500

atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg ggggcctttt     7560

tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata aatgggtagg     7620

gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg gggctcaatg     7680

gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga caccattgca     7740

tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca ccacagaggt     7800

tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca     7860

gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac     7920

ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag gccagattga     7980

gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata tagccccgac     8040

aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg tacccacacc     8100

ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca tcttacaagc     8160

ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc cagtctcttt     8220

tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat gcctgttact     8280

gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc cgtgagtatc     8340

cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc gaaagtcgct     8400

agcaacacac actctctaca caaactaacc cagctctc                             8438


<210>  132
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  132
ggcttgccaa agttcatggc ggaggagatt tgattctc                               38


<210>  133
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  133
gagaatcaaa tctcctccgc catgaacttt ggcaagcc                               38


<210>  134
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  134
ggcttgccaa agttcatcac ggaggagatt tgattctc                               38


<210>  135
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  135
gagaatcaaa tctcctccgt gatgaacttt ggcaagcc                               38


<210>  136
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  136
ccaaagttca tccaggatcc gatttgattc tcgcttct                               38


<210>  137
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  137
agaagcgaga atcaaatcgg atcctggatg aactttgg                               38


<210>  138
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  138
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ggt gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat ggc tcc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Gly Ser His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  139
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  139

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Gly Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  140
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  140
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ggt gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg gct cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  141
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  141

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  142
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  142
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ggt gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg ggt cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  143
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  143

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  144
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  144
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cac gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His His Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg gct cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  145
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  145

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His His Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  146
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  146
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cac gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His His Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg ggt cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  147
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  147

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His His Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  148
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  148
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cca gga tcc gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg gct cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  149
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  149

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  150
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  150
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat cca gga tcc gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aac tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg ggt cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  151
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  151

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  152
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  EgD5M (i.e., EgD5R*-34g158g) mutant delta-5 desaturase

<400>  152
atg gcc ctg tct ctc act acc gaa cag ctc ctg gag cga cct gat ctc         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtc gct atc gat ggt atc ctg tac gac ctc gag ggc ctg gcc aag gtg         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ggt ggt gga gac ctc att ctg gcc tct gga gcc tcc gac gcc tct        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

ccc ctc ttc tac tct atg cat ccc tac gtc aag ccc gag aac tcc aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctc ctg cag caa ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg ggt cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag cga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  153
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  153

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  154
<211>  4070
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pEgD5M

<400>  154
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgccat ggccctgtct ctcactaccg      420

aacagctcct ggagcgacct gatctcgtcg ctatcgatgg tatcctgtac gacctcgagg      480

gcctggccaa ggtgcatggt ggtggagacc tcattctggc ctctggagcc tccgacgcct      540

ctcccctctt ctactctatg catccctacg tcaagcccga gaactccaag ctcctgcagc      600

aattcgtccg agggaagcat gaccgcacct cgaaggacat tgtctacacg tatgattctc      660

ccttcgcaca agacgttaag cggacaatgc gcgaggtgat gaaagggagg aactggtacg      720

caacccctgg cttctggctg cgcaccgttg ggatcatcgc cgtgacggcc ttttgcgagt      780

ggcactgggc taccacgggg atggtgctgt ggggcctgtt gactggattc atgcacatgc      840

agatcggctt atccatccag catgatgcgg gtcacggggc catcagcaag aagccttggg      900

tcaacgccct cttcgcctac ggcattgacg tcatcggatc gtcccggtgg atttggctgc      960

agtcgcacat catgcggcac cacacctaca ccaaccagca cggcctcgac ctggatgcgg     1020

agtcggcaga gccgttcctg gtgttccaca actaccccgc cgcaaacacc gcccgaaagt     1080

ggttccaccg cttccaggct tggtacatgt accttgtgct gggggcatac ggggtatcgc     1140

tggtgtacaa cccgctctac attttccgga tgcagcacaa tgacaccatc ccagagtctg     1200

tcacggccat gcgggaaaat ggctttctgc ggcgctaccg cacacttgca ttcgtgatgc     1260

gagctttctt catcttccgg accgcattct tgccctggta cctcactggg acctcattgc     1320

tgatcaccat tcctctggtg cccaccgcaa ctggtgcctt cttgacgttc ttcttcattt     1380

tgtcccacaa ttttgatggc tccgaacgga tccccgacaa gaactgcaag gttaagcgat     1440

ctgagaagga cgttgaggct gaccaaattg actggtatcg ggcgcaggtg gagacgtcct     1500

ccacatacgg tggccccatc gccatgttct tcactggcgg tctcaatttc cagatcgagc     1560

accacctctt tccccggatg tcgtcttggc actacccctt cgtccagcag gcggtccggg     1620

agtgttgcga acgacatgga gtgcgatatg ttttctaccc taccatcgtc ggcaacatca     1680

tctccaccct gaagtacatg cataaggtgg gtgtcgtcca ctgcgtgaag gacgcacagg     1740

attcctaagc ggccgcaatt cgagctcggt acctcgcgaa tgcatctaga tatcggatcc     1800

cgggcccgtc gactgcagag gcctgcatgc aagcttggcg taatcatggt catagctgtt     1860

tcctgtgtga aattgttatc cgctcacaat tccacacaac atacgagccg gaagcataaa     1920

gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact     1980

gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc     2040

ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg     2100

ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc     2160

cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag     2220

gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca     2280

tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca     2340

ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg     2400

atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag     2460

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt     2520

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca     2580

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg     2640

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt     2700

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc     2760

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg     2820

cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg     2880

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta     2940

gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg     3000

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg     3060

ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc     3120

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc     3180

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc     3240

ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag     3300

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat     3360

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg     3420

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt     3480

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag     3540

atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg     3600

accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt     3660

aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct     3720

gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac     3780

tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat     3840

aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat     3900

ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca     3960

aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat     4020

tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc                4070


<210>  155
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367-5M

<400>  155
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa       60

gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac      120

ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta      180

aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct      240

agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat      300

tcattcatgt tagttgcgta cgagccggaa gcataaagtg taaagcctgg ggtgcctaat      360

gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc      420

tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg      480

ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag      540

cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag      600

gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc      660

tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc      720

agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc      780

tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt      840

cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg      900

ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat      960

ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag     1020

ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt     1080

ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc     1140

cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta     1200

gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag     1260

atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga     1320

ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa     1380

gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa     1440

tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc     1500

ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga     1560

taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa     1620

gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt     1680

gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg     1740

ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc     1800

aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg     1860

gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag     1920

cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt     1980

actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt     2040

caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac     2100

gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac     2160

ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag     2220

caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa     2280

tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga     2340

gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc     2400

cccgaaaagt gccacctgac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg     2460

ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct     2520

tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc     2580

ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg     2640

atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt     2700

ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg     2760

tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc     2820

tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttccattc     2880

gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg     2940

ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc     3000

ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac tatagggcga     3060

attgggtacc gggccccccc tcgaggtcga tggtgtcgat aagcttgata tcgaattcat     3120

gtcacacaaa ccgatcttcg cctcaaggaa acctaattct acatccgaga gactgccgag     3180

atccagtcta cactgattaa ttttcgggcc aataatttaa aaaaatcgtg ttatataata     3240

ttatatgtat tatatatata catcatgatg atactgacag tcatgtccca ttgctaaata     3300

gacagactcc atctgccgcc tccaactgat gttctcaata tttaaggggt catctcgcat     3360

tgtttaataa taaacagact ccatctaccg cctccaaatg atgttctcaa aatatattgt     3420

atgaacttat ttttattact tagtattatt agacaactta cttgctttat gaaaaacact     3480

tcctatttag gaaacaattt ataatggcag ttcgttcatt taacaattta tgtagaataa     3540

atgttataaa tgcgtatggg aaatcttaaa tatggatagc ataaatgata tctgcattgc     3600

ctaattcgaa atcaacagca acgaaaaaaa tcccttgtac aacataaata gtcatcgaga     3660

aatatcaact atcaaagaac agctattcac acgttactat tgagattatt attggacgag     3720

aatcacacac tcaactgtct ttctctcttc tagaaataca ggtacaagta tgtactattc     3780

tcattgttca tacttctagt catttcatcc cacatattcc ttggatttct ctccaatgaa     3840

tgacattcta tcttgcaaat tcaacaatta taataagata taccaaagta gcggtatagt     3900

ggcaatcaaa aagcttctct ggtgtgcttc tcgtatttat ttttattcta atgatccatt     3960

aaaggtatat atttatttct tgttatataa tccttttgtt tattacatgg gctggataca     4020

taaaggtatt ttgatttaat tttttgctta aattcaatcc cccctcgttc agtgtcaact     4080

gtaatggtag gaaattacca tacttttgaa gaagcaaaaa aaatgaaaga aaaaaaaaat     4140

cgtatttcca ggttagacgt tccgcagaat ctagaatgcg gtatgcggta cattgttctt     4200

cgaacgtaaa agttgcgctc cctgagatat tgtacatttt tgcttttaca agtacaagta     4260

catcgtacaa ctatgtacta ctgttgatgc atccacaaca gtttgttttg tttttttttg     4320

tttttttttt ttctaatgat tcattaccgc tatgtatacc tacttgtact tgtagtaagc     4380

cgggttattg gcgttcaatt aatcatagac ttatgaatct gcacggtgtg cgctgcgagt     4440

tacttttagc ttatgcatgc tacttgggtg taatattggg atctgttcgg aaatcaacgg     4500

atgctcaatc gatttcgaca gtaattaatt aagtcataca caagtcagct ttcttcgagc     4560

ctcatataag tataagtagt tcaacgtatt agcactgtac ccagcatctc cgtatcgaga     4620

aacacaacaa catgccccat tggacagatc atgcggatac acaggttgtg cagtatcata     4680

catactcgat cagacaggtc gtctgaccat catacaagct gaacaagcgc tccatacttg     4740

cacgctctct atatacacag ttaaattaca tatccatagt ctaacctcta acagttaatc     4800

ttctggtaag cctcccagcc agccttctgg tatcgcttgg cctcctcaat aggatctcgg     4860

ttctggccgt acagacctcg gccgacaatt atgatatccg ttccggtaga catgacatcc     4920

tcaacagttc ggtactgctg tccgagagcg tctcccttgt cgtcaagacc caccccgggg     4980

gtcagaataa gccagtcctc agagtcgccc ttaggtcggt tctgggcaat gaagccaacc     5040

acaaactcgg ggtcggatcg ggcaagctca atggtctgct tggagtactc gccagtggcc     5100

agagagccct tgcaagacag ctcggccagc atgagcagac ctctggccag cttctcgttg     5160

ggagagggga ctaggaactc cttgtactgg gagttctcgt agtcagagac gtcctccttc     5220

ttctgttcag agacagtttc ctcggcacca gctcgcaggc cagcaatgat tccggttccg     5280

ggtacaccgt gggcgttggt gatatcggac cactcggcga ttcggtgaca ccggtactgg     5340

tgcttgacag tgttgccaat atctgcgaac tttctgtcct cgaacaggaa gaaaccgtgc     5400

ttaagagcaa gttccttgag ggggagcaca gtgccggcgt aggtgaagtc gtcaatgatg     5460

tcgatatggg ttttgatcat gcacacataa ggtccgacct tatcggcaag ctcaatgagc     5520

tccttggtgg tggtaacatc cagagaagca cacaggttgg ttttcttggc tgccacgagc     5580

ttgagcactc gagcggcaaa ggcggacttg tggacgttag ctcgagcttc gtaggagggc     5640

attttggtgg tgaagaggag actgaaataa atttagtctg cagaactttt tatcggaacc     5700

ttatctgggg cagtgaagta tatgttatgg taatagttac gagttagttg aacttataga     5760

tagactggac tatacggcta tcggtccaaa ttagaaagaa cgtcaatggc tctctgggcg     5820

tcgcctttgc cgacaaaaat gtgatcatga tgaaagccag caatgacgtt gcagctgata     5880

ttgttgtcgg ccaaccgcgc cgaaaacgca gctgtcagac ccacagcctc caacgaagaa     5940

tgtatcgtca aagtgatcca agcacactca tagttggagt cgtactccaa aggcggcaat     6000

gacgagtcag acagatactc gtcgactcag gcgacgacgg aattcctgca gcccatctgc     6060

agaattcagg agagaccggg ttggcggcgt atttgtgtcc caaaaaacag ccccaattgc     6120

cccggagaag acggccaggc cgcctagatg acaaattcaa caactcacag ctgactttct     6180

gccattgcca ctaggggggg gcctttttat atggccaagc caagctctcc acgtcggttg     6240

ggctgcaccc aacaataaat gggtagggtt gcaccaacaa agggatggga tggggggtag     6300

aagatacgag gataacgggg ctcaatggca caaataagaa cgaatactgc cattaagact     6360

cgtgatccag cgactgacac cattgcatca tctaagggcc tcaaaactac ctcggaactg     6420

ctgcgctgat ctggacacca cagaggttcc gagcacttta ggttgcacca aatgtcccac     6480

caggtgcagg cagaaaacgc tggaacagcg tgtacagttt gtcttaacaa aaagtgaggg     6540

cgctgaggtc gagcagggtg gtgtgacttg ttatagcctt tagagctgcg aaagcgcgta     6600

tggatttggc tcatcaggcc agattgaggg tctgtggaca catgtcatgt tagtgtactt     6660

caatcgcccc ctggatatag ccccgacaat aggccgtggc ctcatttttt tgccttccgc     6720

acatttccat tgctcggtac ccacaccttg cttctcctgc acttgccaac cttaatactg     6780

gtttacattg accaacatct tacaagcggg gggcttgtct agggtatata taaacagtgg     6840

ctctcccaat cggttgccag tctctttttt cctttctttc cccacagatt cgaaatctaa     6900

actacacatc acacaatgcc tgttactgac gtccttaagc gaaagtccgg tgtcatcgtc     6960

ggcgacgatg tccgagccgt gagtatccac gacaagatca gtgtcgagac gacgcgtttt     7020

gtgtaatgac acaatccgaa agtcgctagc aacacacact ctctacacaa actaacccag     7080

ctctccatgg ccctgtctct cactaccgaa cagctcctgg agcgacctga tctcgtcgct     7140

atcgatggta tcctgtacga cctcgagggc ctggccaagg tgcatggtgg tggagacctc     7200

attctggcct ctggagcctc cgacgcctct cccctcttct actctatgca tccctacgtc     7260

aagcccgaga actccaagct cctgcagcaa ttcgtccgag ggaagcatga ccgcacctcg     7320

aaggacattg tctacacgta tgattctccc ttcgcacaag acgttaagcg gacaatgcgc     7380

gaggtgatga aagggaggaa ctggtacgca acccctggct tctggctgcg caccgttggg     7440

atcatcgccg tgacggcctt ttgcgagtgg cactgggcta ccacggggat ggtgctgtgg     7500

ggcctgttga ctggattcat gcacatgcag atcggcttat ccatccagca tgatgcgggt     7560

cacggggcca tcagcaagaa gccttgggtc aacgccctct tcgcctacgg cattgacgtc     7620

atcggatcgt cccggtggat ttggctgcag tcgcacatca tgcggcacca cacctacacc     7680

aaccagcacg gcctcgacct ggatgcggag tcggcagagc cgttcctggt gttccacaac     7740

taccccgccg caaacaccgc ccgaaagtgg ttccaccgct tccaggcttg gtacatgtac     7800

cttgtgctgg gggcatacgg ggtatcgctg gtgtacaacc cgctctacat tttccggatg     7860

cagcacaatg acaccatccc agagtctgtc acggccatgc gggaaaatgg ctttctgcgg     7920

cgctaccgca cacttgcatt cgtgatgcga gctttcttca tcttccggac cgcattcttg     7980

ccctggtacc tcactgggac ctcattgctg atcaccattc ctctggtgcc caccgcaact     8040

ggtgccttct tgacgttctt cttcattttg tcccacaatt ttgatggctc cgaacggatc     8100

cccgacaaga actgcaaggt taagcgatct gagaaggacg ttgaggctga ccaaattgac     8160

tggtatcggg cgcaggtgga gacgtcctcc acatacggtg gccccatcgc catgttcttc     8220

actggcggtc tcaatttcca gatcgagcac cacctctttc cccggatgtc gtcttggcac     8280

taccccttcg tccagcaggc ggtccgggag tgttgcgaac gacatggagt gcgatatgtt     8340

ttctacccta ccatcgtcgg caacatcatc tccaccctga agtacatgca taaggtgggt     8400

gtcgtccact gcgtgaagga cgcacaggat tcctaagc                             8438


<210>  156
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  EgD5M1 (i.e., EgD5R*-34g158g347s) mutant delta-5 desaturase

<400>  156
atg gcc ctg tct ctc act acc gaa cag ctc ctg gag cga cct gat ctc         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtc gct atc gat ggt atc ctg tac gac ctc gag ggc ctg gcc aag gtg         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ggt ggt gga gac ctc att ctg gcc tct gga gcc tcc gac gcc tct        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

ccc ctc ttc tac tct atg cat ccc tac gtc aag ccc gag aac tcc aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctc ctg cag caa ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg ggt cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc cag gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag agc tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cga cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  157
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  157

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  158
<211>  4070
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pEgD5M1

<400>  158
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgccat ggccctgtct ctcactaccg      420

aacagctcct ggagcgacct gatctcgtcg ctatcgatgg tatcctgtac gacctcgagg      480

gcctggccaa ggtgcatggt ggtggagacc tcattctggc ctctggagcc tccgacgcct      540

ctcccctctt ctactctatg catccctacg tcaagcccga gaactccaag ctcctgcagc      600

aattcgtccg agggaagcat gaccgcacct cgaaggacat tgtctacacg tatgattctc      660

ccttcgcaca agacgttaag cggacaatgc gcgaggtgat gaaagggagg aactggtacg      720

caacccctgg cttctggctg cgcaccgttg ggatcatcgc cgtgacggcc ttttgcgagt      780

ggcactgggc taccacgggg atggtgctgt ggggcctgtt gactggattc atgcacatgc      840

agatcggctt atccatccag catgatgcgg gtcacggggc catcagcaag aagccttggg      900

tcaacgccct cttcgcctac ggcattgacg tcatcggatc gtcccggtgg atttggctgc      960

agtcgcacat catgcggcac cacacctaca ccaaccagca cggcctcgac ctggatgcgg     1020

agtcggcaga gccgttcctg gtgttccaca actaccccgc cgcaaacacc gcccgaaagt     1080

ggttccaccg cttccaggct tggtacatgt accttgtgct gggggcatac ggggtatcgc     1140

tggtgtacaa cccgctctac attttccgga tgcagcacaa tgacaccatc ccagagtctg     1200

tcacggccat gcgggaaaat ggctttctgc ggcgctaccg cacacttgca ttcgtgatgc     1260

gagctttctt catcttccgg accgcattct tgccctggta cctcactggg acctcattgc     1320

tgatcaccat tcctctggtg cccaccgcaa ctggtgcctt cttgacgttc ttcttcattt     1380

tgtcccacaa ttttgatggc tccgaacgga tccccgacaa gaactgcaag gttaagagct     1440

ctgagaagga cgttgaggct gaccaaattg actggtatcg ggcgcaggtg gagacgtcct     1500

ccacatacgg tggccccatc gccatgttct tcactggcgg tctcaatttc cagatcgagc     1560

accacctctt tccccggatg tcgtcttggc actacccctt cgtccagcag gcggtccggg     1620

agtgttgcga acgacatgga gtgcgatatg ttttctaccc taccatcgtc ggcaacatca     1680

tctccaccct gaagtacatg cataaggtgg gtgtcgtcca ctgcgtgaag gacgcacagg     1740

attcctaagc ggccgcaatt cgagctcggt acctcgcgaa tgcatctaga tatcggatcc     1800

cgggcccgtc gactgcagag gcctgcatgc aagcttggcg taatcatggt catagctgtt     1860

tcctgtgtga aattgttatc cgctcacaat tccacacaac atacgagccg gaagcataaa     1920

gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact     1980

gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc     2040

ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg     2100

ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc     2160

cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag     2220

gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca     2280

tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca     2340

ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg     2400

atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag     2460

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt     2520

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca     2580

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg     2640

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt     2700

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc     2760

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg     2820

cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg     2880

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta     2940

gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg     3000

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg     3060

ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc     3120

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc     3180

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc     3240

ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag     3300

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat     3360

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg     3420

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt     3480

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag     3540

atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg     3600

accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt     3660

aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct     3720

gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac     3780

tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat     3840

aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat     3900

ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca     3960

aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat     4020

tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc                4070


<210>  159
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW367-5M1

<400>  159
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa       60

gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac      120

ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta      180

aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct      240

agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat      300

tcattcatgt tagttgcgta cgagccggaa gcataaagtg taaagcctgg ggtgcctaat      360

gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc      420

tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg      480

ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag      540

cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag      600

gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc      660

tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc      720

agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc      780

tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt      840

cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg      900

ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat      960

ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag     1020

ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt     1080

ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc     1140

cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta     1200

gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag     1260

atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga     1320

ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa     1380

gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa     1440

tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc     1500

ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga     1560

taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa     1620

gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt     1680

gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg     1740

ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc     1800

aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg     1860

gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag     1920

cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt     1980

actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt     2040

caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac     2100

gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac     2160

ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag     2220

caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa     2280

tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga     2340

gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc     2400

cccgaaaagt gccacctgac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg     2460

ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct     2520

tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc     2580

ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg     2640

atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt     2700

ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg     2760

tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc     2820

tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttccattc     2880

gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg     2940

ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc     3000

ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac tatagggcga     3060

attgggtacc gggccccccc tcgaggtcga tggtgtcgat aagcttgata tcgaattcat     3120

gtcacacaaa ccgatcttcg cctcaaggaa acctaattct acatccgaga gactgccgag     3180

atccagtcta cactgattaa ttttcgggcc aataatttaa aaaaatcgtg ttatataata     3240

ttatatgtat tatatatata catcatgatg atactgacag tcatgtccca ttgctaaata     3300

gacagactcc atctgccgcc tccaactgat gttctcaata tttaaggggt catctcgcat     3360

tgtttaataa taaacagact ccatctaccg cctccaaatg atgttctcaa aatatattgt     3420

atgaacttat ttttattact tagtattatt agacaactta cttgctttat gaaaaacact     3480

tcctatttag gaaacaattt ataatggcag ttcgttcatt taacaattta tgtagaataa     3540

atgttataaa tgcgtatggg aaatcttaaa tatggatagc ataaatgata tctgcattgc     3600

ctaattcgaa atcaacagca acgaaaaaaa tcccttgtac aacataaata gtcatcgaga     3660

aatatcaact atcaaagaac agctattcac acgttactat tgagattatt attggacgag     3720

aatcacacac tcaactgtct ttctctcttc tagaaataca ggtacaagta tgtactattc     3780

tcattgttca tacttctagt catttcatcc cacatattcc ttggatttct ctccaatgaa     3840

tgacattcta tcttgcaaat tcaacaatta taataagata taccaaagta gcggtatagt     3900

ggcaatcaaa aagcttctct ggtgtgcttc tcgtatttat ttttattcta atgatccatt     3960

aaaggtatat atttatttct tgttatataa tccttttgtt tattacatgg gctggataca     4020

taaaggtatt ttgatttaat tttttgctta aattcaatcc cccctcgttc agtgtcaact     4080

gtaatggtag gaaattacca tacttttgaa gaagcaaaaa aaatgaaaga aaaaaaaaat     4140

cgtatttcca ggttagacgt tccgcagaat ctagaatgcg gtatgcggta cattgttctt     4200

cgaacgtaaa agttgcgctc cctgagatat tgtacatttt tgcttttaca agtacaagta     4260

catcgtacaa ctatgtacta ctgttgatgc atccacaaca gtttgttttg tttttttttg     4320

tttttttttt ttctaatgat tcattaccgc tatgtatacc tacttgtact tgtagtaagc     4380

cgggttattg gcgttcaatt aatcatagac ttatgaatct gcacggtgtg cgctgcgagt     4440

tacttttagc ttatgcatgc tacttgggtg taatattggg atctgttcgg aaatcaacgg     4500

atgctcaatc gatttcgaca gtaattaatt aagtcataca caagtcagct ttcttcgagc     4560

ctcatataag tataagtagt tcaacgtatt agcactgtac ccagcatctc cgtatcgaga     4620

aacacaacaa catgccccat tggacagatc atgcggatac acaggttgtg cagtatcata     4680

catactcgat cagacaggtc gtctgaccat catacaagct gaacaagcgc tccatacttg     4740

cacgctctct atatacacag ttaaattaca tatccatagt ctaacctcta acagttaatc     4800

ttctggtaag cctcccagcc agccttctgg tatcgcttgg cctcctcaat aggatctcgg     4860

ttctggccgt acagacctcg gccgacaatt atgatatccg ttccggtaga catgacatcc     4920

tcaacagttc ggtactgctg tccgagagcg tctcccttgt cgtcaagacc caccccgggg     4980

gtcagaataa gccagtcctc agagtcgccc ttaggtcggt tctgggcaat gaagccaacc     5040

acaaactcgg ggtcggatcg ggcaagctca atggtctgct tggagtactc gccagtggcc     5100

agagagccct tgcaagacag ctcggccagc atgagcagac ctctggccag cttctcgttg     5160

ggagagggga ctaggaactc cttgtactgg gagttctcgt agtcagagac gtcctccttc     5220

ttctgttcag agacagtttc ctcggcacca gctcgcaggc cagcaatgat tccggttccg     5280

ggtacaccgt gggcgttggt gatatcggac cactcggcga ttcggtgaca ccggtactgg     5340

tgcttgacag tgttgccaat atctgcgaac tttctgtcct cgaacaggaa gaaaccgtgc     5400

ttaagagcaa gttccttgag ggggagcaca gtgccggcgt aggtgaagtc gtcaatgatg     5460

tcgatatggg ttttgatcat gcacacataa ggtccgacct tatcggcaag ctcaatgagc     5520

tccttggtgg tggtaacatc cagagaagca cacaggttgg ttttcttggc tgccacgagc     5580

ttgagcactc gagcggcaaa ggcggacttg tggacgttag ctcgagcttc gtaggagggc     5640

attttggtgg tgaagaggag actgaaataa atttagtctg cagaactttt tatcggaacc     5700

ttatctgggg cagtgaagta tatgttatgg taatagttac gagttagttg aacttataga     5760

tagactggac tatacggcta tcggtccaaa ttagaaagaa cgtcaatggc tctctgggcg     5820

tcgcctttgc cgacaaaaat gtgatcatga tgaaagccag caatgacgtt gcagctgata     5880

ttgttgtcgg ccaaccgcgc cgaaaacgca gctgtcagac ccacagcctc caacgaagaa     5940

tgtatcgtca aagtgatcca agcacactca tagttggagt cgtactccaa aggcggcaat     6000

gacgagtcag acagatactc gtcgactcag gcgacgacgg aattcctgca gcccatctgc     6060

agaattcagg agagaccggg ttggcggcgt atttgtgtcc caaaaaacag ccccaattgc     6120

cccggagaag acggccaggc cgcctagatg acaaattcaa caactcacag ctgactttct     6180

gccattgcca ctaggggggg gcctttttat atggccaagc caagctctcc acgtcggttg     6240

ggctgcaccc aacaataaat gggtagggtt gcaccaacaa agggatggga tggggggtag     6300

aagatacgag gataacgggg ctcaatggca caaataagaa cgaatactgc cattaagact     6360

cgtgatccag cgactgacac cattgcatca tctaagggcc tcaaaactac ctcggaactg     6420

ctgcgctgat ctggacacca cagaggttcc gagcacttta ggttgcacca aatgtcccac     6480

caggtgcagg cagaaaacgc tggaacagcg tgtacagttt gtcttaacaa aaagtgaggg     6540

cgctgaggtc gagcagggtg gtgtgacttg ttatagcctt tagagctgcg aaagcgcgta     6600

tggatttggc tcatcaggcc agattgaggg tctgtggaca catgtcatgt tagtgtactt     6660

caatcgcccc ctggatatag ccccgacaat aggccgtggc ctcatttttt tgccttccgc     6720

acatttccat tgctcggtac ccacaccttg cttctcctgc acttgccaac cttaatactg     6780

gtttacattg accaacatct tacaagcggg gggcttgtct agggtatata taaacagtgg     6840

ctctcccaat cggttgccag tctctttttt cctttctttc cccacagatt cgaaatctaa     6900

actacacatc acacaatgcc tgttactgac gtccttaagc gaaagtccgg tgtcatcgtc     6960

ggcgacgatg tccgagccgt gagtatccac gacaagatca gtgtcgagac gacgcgtttt     7020

gtgtaatgac acaatccgaa agtcgctagc aacacacact ctctacacaa actaacccag     7080

ctctccatgg ccctgtctct cactaccgaa cagctcctgg agcgacctga tctcgtcgct     7140

atcgatggta tcctgtacga cctcgagggc ctggccaagg tgcatggtgg tggagacctc     7200

attctggcct ctggagcctc cgacgcctct cccctcttct actctatgca tccctacgtc     7260

aagcccgaga actccaagct cctgcagcaa ttcgtccgag ggaagcatga ccgcacctcg     7320

aaggacattg tctacacgta tgattctccc ttcgcacaag acgttaagcg gacaatgcgc     7380

gaggtgatga aagggaggaa ctggtacgca acccctggct tctggctgcg caccgttggg     7440

atcatcgccg tgacggcctt ttgcgagtgg cactgggcta ccacggggat ggtgctgtgg     7500

ggcctgttga ctggattcat gcacatgcag atcggcttat ccatccagca tgatgcgggt     7560

cacggggcca tcagcaagaa gccttgggtc aacgccctct tcgcctacgg cattgacgtc     7620

atcggatcgt cccggtggat ttggctgcag tcgcacatca tgcggcacca cacctacacc     7680

aaccagcacg gcctcgacct ggatgcggag tcggcagagc cgttcctggt gttccacaac     7740

taccccgccg caaacaccgc ccgaaagtgg ttccaccgct tccaggcttg gtacatgtac     7800

cttgtgctgg gggcatacgg ggtatcgctg gtgtacaacc cgctctacat tttccggatg     7860

cagcacaatg acaccatccc agagtctgtc acggccatgc gggaaaatgg ctttctgcgg     7920

cgctaccgca cacttgcatt cgtgatgcga gctttcttca tcttccggac cgcattcttg     7980

ccctggtacc tcactgggac ctcattgctg atcaccattc ctctggtgcc caccgcaact     8040

ggtgccttct tgacgttctt cttcattttg tcccacaatt ttgatggctc cgaacggatc     8100

cccgacaaga actgcaaggt taagagctct gagaaggacg ttgaggctga ccaaattgac     8160

tggtatcggg cgcaggtgga gacgtcctcc acatacggtg gccccatcgc catgttcttc     8220

actggcggtc tcaatttcca gatcgagcac cacctctttc cccggatgtc gtcttggcac     8280

taccccttcg tccagcaggc ggtccgggag tgttgcgaac gacatggagt gcgatatgtt     8340

ttctacccta ccatcgtcgg caacatcatc tccaccctga agtacatgca taaggtgggt     8400

gtcgtccact gcgtgaagga cgcacaggat tcctaagc                             8438


<210>  160
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  160

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  161
<211>  8438
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pDMW369S

<400>  161
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa       60

gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac      120

ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta      180

aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct      240

agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat      300

tcattcatgt tagttgcgta cgagccggaa gcataaagtg taaagcctgg ggtgcctaat      360

gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc      420

tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg      480

ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag      540

cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag      600

gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc      660

tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc      720

agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc      780

tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt      840

cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg      900

ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat      960

ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag     1020

ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt     1080

ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc     1140

cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta     1200

gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag     1260

atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga     1320

ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa     1380

gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa     1440

tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc     1500

ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga     1560

taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa     1620

gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt     1680

gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg     1740

ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc     1800

aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg     1860

gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag     1920

cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt     1980

actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt     2040

caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac     2100

gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac     2160

ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag     2220

caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa     2280

tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga     2340

gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc     2400

cccgaaaagt gccacctgac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg     2460

ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct     2520

tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc     2580

ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg     2640

atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt     2700

ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg     2760

tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc     2820

tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttccattc     2880

gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg     2940

ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc     3000

ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac tatagggcga     3060

attgggtacc gggccccccc tcgaggtcga tggtgtcgat aagcttgata tcgaattcat     3120

gtcacacaaa ccgatcttcg cctcaaggaa acctaattct acatccgaga gactgccgag     3180

atccagtcta cactgattaa ttttcgggcc aataatttaa aaaaatcgtg ttatataata     3240

ttatatgtat tatatatata catcatgatg atactgacag tcatgtccca ttgctaaata     3300

gacagactcc atctgccgcc tccaactgat gttctcaata tttaaggggt catctcgcat     3360

tgtttaataa taaacagact ccatctaccg cctccaaatg atgttctcaa aatatattgt     3420

atgaacttat ttttattact tagtattatt agacaactta cttgctttat gaaaaacact     3480

tcctatttag gaaacaattt ataatggcag ttcgttcatt taacaattta tgtagaataa     3540

atgttataaa tgcgtatggg aaatcttaaa tatggatagc ataaatgata tctgcattgc     3600

ctaattcgaa atcaacagca acgaaaaaaa tcccttgtac aacataaata gtcatcgaga     3660

aatatcaact atcaaagaac agctattcac acgttactat tgagattatt attggacgag     3720

aatcacacac tcaactgtct ttctctcttc tagaaataca ggtacaagta tgtactattc     3780

tcattgttca tacttctagt catttcatcc cacatattcc ttggatttct ctccaatgaa     3840

tgacattcta tcttgcaaat tcaacaatta taataagata taccaaagta gcggtatagt     3900

ggcaatcaaa aagcttctct ggtgtgcttc tcgtatttat ttttattcta atgatccatt     3960

aaaggtatat atttatttct tgttatataa tccttttgtt tattacatgg gctggataca     4020

taaaggtatt ttgatttaat tttttgctta aattcaatcc cccctcgttc agtgtcaact     4080

gtaatggtag gaaattacca tacttttgaa gaagcaaaaa aaatgaaaga aaaaaaaaat     4140

cgtatttcca ggttagacgt tccgcagaat ctagaatgcg gtatgcggta cattgttctt     4200

cgaacgtaaa agttgcgctc cctgagatat tgtacatttt tgcttttaca agtacaagta     4260

catcgtacaa ctatgtacta ctgttgatgc atccacaaca gtttgttttg tttttttttg     4320

tttttttttt ttctaatgat tcattaccgc tatgtatacc tacttgtact tgtagtaagc     4380

cgggttattg gcgttcaatt aatcatagac ttatgaatct gcacggtgtg cgctgcgagt     4440

tacttttagc ttatgcatgc tacttgggtg taatattggg atctgttcgg aaatcaacgg     4500

atgctcaatc gatttcgaca gtaattaatt aagtcataca caagtcagct ttcttcgagc     4560

ctcatataag tataagtagt tcaacgtatt agcactgtac ccagcatctc cgtatcgaga     4620

aacacaacaa catgccccat tggacagatc atgcggatac acaggttgtg cagtatcata     4680

catactcgat cagacaggtc gtctgaccat catacaagct gaacaagcgc tccatacttg     4740

cacgctctct atatacacag ttaaattaca tatccatagt ctaacctcta acagttaatc     4800

ttctggtaag cctcccagcc agccttctgg tatcgcttgg cctcctcaat aggatctcgg     4860

ttctggccgt acagacctcg gccgacaatt atgatatccg ttccggtaga catgacatcc     4920

tcaacagttc ggtactgctg tccgagagcg tctcccttgt cgtcaagacc caccccgggg     4980

gtcagaataa gccagtcctc agagtcgccc ttaggtcggt tctgggcaat gaagccaacc     5040

acaaactcgg ggtcggatcg ggcaagctca atggtctgct tggagtactc gccagtggcc     5100

agagagccct tgcaagacag ctcggccagc atgagcagac ctctggccag cttctcgttg     5160

ggagagggga ctaggaactc cttgtactgg gagttctcgt agtcagagac gtcctccttc     5220

ttctgttcag agacagtttc ctcggcacca gctcgcaggc cagcaatgat tccggttccg     5280

ggtacaccgt gggcgttggt gatatcggac cactcggcga ttcggtgaca ccggtactgg     5340

tgcttgacag tgttgccaat atctgcgaac tttctgtcct cgaacaggaa gaaaccgtgc     5400

ttaagagcaa gttccttgag ggggagcaca gtgccggcgt aggtgaagtc gtcaatgatg     5460

tcgatatggg ttttgatcat gcacacataa ggtccgacct tatcggcaag ctcaatgagc     5520

tccttggtgg tggtaacatc cagagaagca cacaggttgg ttttcttggc tgccacgagc     5580

ttgagcactc gagcggcaaa ggcggacttg tggacgttag ctcgagcttc gtaggagggc     5640

attttggtgg tgaagaggag actgaaataa atttagtctg cagaactttt tatcggaacc     5700

ttatctgggg cagtgaagta tatgttatgg taatagttac gagttagttg aacttataga     5760

tagactggac tatacggcta tcggtccaaa ttagaaagaa cgtcaatggc tctctgggcg     5820

tcgcctttgc cgacaaaaat gtgatcatga tgaaagccag caatgacgtt gcagctgata     5880

ttgttgtcgg ccaaccgcgc cgaaaacgca gctgtcagac ccacagcctc caacgaagaa     5940

tgtatcgtca aagtgatcca agcacactca tagttggagt cgtactccaa aggcggcaat     6000

gacgagtcag acagatactc gtcgactcag gcgacgacgg aattcctgca gcccatctgc     6060

agaattcagg agagaccggg ttggcggcgt atttgtgtcc caaaaaacag ccccaattgc     6120

cccggagaag acggccaggc cgcctagatg acaaattcaa caactcacag ctgactttct     6180

gccattgcca ctaggggggg gcctttttat atggccaagc caagctctcc acgtcggttg     6240

ggctgcaccc aacaataaat gggtagggtt gcaccaacaa agggatggga tggggggtag     6300

aagatacgag gataacgggg ctcaatggca caaataagaa cgaatactgc cattaagact     6360

cgtgatccag cgactgacac cattgcatca tctaagggcc tcaaaactac ctcggaactg     6420

ctgcgctgat ctggacacca cagaggttcc gagcacttta ggttgcacca aatgtcccac     6480

caggtgcagg cagaaaacgc tggaacagcg tgtacagttt gtcttaacaa aaagtgaggg     6540

cgctgaggtc gagcagggtg gtgtgacttg ttatagcctt tagagctgcg aaagcgcgta     6600

tggatttggc tcatcaggcc agattgaggg tctgtggaca catgtcatgt tagtgtactt     6660

caatcgcccc ctggatatag ccccgacaat aggccgtggc ctcatttttt tgccttccgc     6720

acatttccat tgctcggtac ccacaccttg cttctcctgc acttgccaac cttaatactg     6780

gtttacattg accaacatct tacaagcggg gggcttgtct agggtatata taaacagtgg     6840

ctctcccaat cggttgccag tctctttttt cctttctttc cccacagatt cgaaatctaa     6900

actacacatc acacaatgcc tgttactgac gtccttaagc gaaagtccgg tgtcatcgtc     6960

ggcgacgatg tccgagccgt gagtatccac gacaagatca gtgtcgagac gacgcgtttt     7020

gtgtaatgac acaatccgaa agtcgctagc aacacacact ctctacacaa actaacccag     7080

ctctccatgg ctctctccct tactaccgag cagctgctcg agcgacccga cctggttgcc     7140

atcgacggca ttctctacga tctggaaggt cttgccaagg tccatcccgg atccgacttg     7200

atcctcgctt ctggtgcctc cgatgcttct cctctgttct actccatgca cccttacgtc     7260

aagcccgaga actcgaagct gcttcaacag ttcgtgcgag gcaagcacga ccgaacctcc     7320

aaggacattg tctacaccta cgactctccc tttgcacagg acgtcaagcg aactatgcga     7380

gaggtcatga aaggtcggaa ctggtatgcc acacctggat tctggctgcg aaccgttggc     7440

atcattgctg tcaccgcctt ttgcgagtgg cactgggcta ctaccggaat ggtgctgtgg     7500

ggtctcttga ctggattcat gcacatgcag atcggcctgt ccattcagca cgatgcctct     7560

catggtgcca tcagcaaaaa gccctgggtc aacgctctct ttgcctacgg catcgacgtc     7620

attggatcgt ccagatggat ctggctgcag tctcacatca tgcgacatca cacctacacc     7680

aatcagcatg gtctcgacct ggatgccgag tccgcagaac cattccttgt gttccacaac     7740

taccctgctg ccaacactgc tcgaaagtgg tttcaccgat tccaggcctg gtacatgtac     7800

ctcgtgcttg gagcctacgg cgtttcgctg gtgtacaacc ctctctacat cttccgaatg     7860

cagcacaacg acaccattcc cgagtctgtc acagccatgc gagagaacgg ctttctgcga     7920

cggtaccgaa cccttgcatt cgttatgcga gctttcttca tctttcgaac cgccttcttg     7980

ccctggtatc tcactggaac ctccctgctc atcaccattc ctctggtgcc cactgctacc     8040

ggtgccttcc tcaccttctt tttcatcttg tctcacaact tcgatggctc ggagcgaatc     8100

cccgacaaga actgcaaggt caagagctcc gagaaggacg ttgaagccga tcagatcgac     8160

tggtacagag ctcaggtgga gacctcttcc acctacggtg gacccattgc catgttcttt     8220

actggcggtc tcaacttcca gatcgagcat cacctctttc ctcgaatgtc gtcttggcac     8280

tatcccttcg tgcagcaagc tgtccgagag tgttgcgaac gacacggagt tcggtacgtc     8340

ttctacccta ccattgtggg caacatcatt tccaccctca agtacatgca caaagtcggt     8400

gtggttcact gtgtcaagga cgctcaggat tcctaagc                             8438


<210>  162
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  162
cctgtccatt cagcacgatt tctctcatgg tgccatcag                              39


<210>  163
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  163
ctgatggcac catgagagaa atcgtgctga atggacagg                              39


<210>  164
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  164
cctgtccatt cagcacgata tgtctcatgg tgccatcag                              39


<210>  165
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  165
ctgatggcac catgagacat atcgtgctga atggacagg                              39


<210>  166
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  166
gtccattcag cacgatggtt ctcatggtgc catcag                                 36


<210>  167
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  167
ctgatggcac catgagaacc atcgtgctga atggac                                 36


<210>  168
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  168
gtccattcag cacgattcct ctcatggtgc c                                      31


<210>  169
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  169
ggcaccatga gaggaatcgt gctgaatgga c                                      31


<210>  170
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  170
cattcagcac gatgccgctc atggtgccat cag                                    33


<210>  171
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  171
ctgatggcac catgagcggc atcgtgctga atg                                    33


<210>  172
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  172
ctgtccattc agcacgatgc caaccatggt gccatcagca aaaag                       45


<210>  173
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  173
ctttttgctg atggcaccat ggttggcatc gtgctgaatg gacag                       45


<210>  174
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  174
cattcagcac gatgccaccc atggtgccat cagca                                  35


<210>  175
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  175
tgctgatggc accatgggtg gcatcgtgct gaatg                                  35


<210>  176
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  176
cattcagcac gatgccggtc atggtgccat cagc                                   34


<210>  177
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  177
gctgatggca ccatgaccgg catcgtgctg aatg                                   34


<210>  178
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  178
ctgtccattc agcacgaggc ctctcatggt g                                      31


<210>  179
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  179
caccatgaga ggcctcgtgc tgaatggaca g                                      31


<210>  180
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  180
atg gct ctc tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcc atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ccc gga tcc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct        144
His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acc tac gac tct ccc ttt gca cag gac gtc aag cga act        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc aca cct gga ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cga acc gtt ggc atc att gct gtc acc gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct act acc gga atg gtg ctg tgg ggt ctc ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc ctg tcc att cag cac gag gcc tct cat ggt        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Glu Ala Ser His Gly           
145                 150                 155                 160           

gcc atc agc aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc aac act        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gct cga aag tgg ttt cac cga ttc cag gcc tgg tac atg tac ctc gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctt gga gcc tac ggc gtt tcg ctg gtg tac aac cct ctc tac atc ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cga atg cag cac aac gac acc att ccc gag tct gtc aca gcc atg cga        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gag aac ggc ttt ctg cga cgg tac cga acc ctt gca ttc gtt atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg gag       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gaa gcc gat cag atc gac tgg tac aga gct cag gtg gag acc tct tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

acc tac ggt gga ccc att gcc atg ttc ttt act ggc ggt ctc aac ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cat cac ctc ttt cct cga atg tcg tct tgg cac tat ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtg cag caa gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tac gtc ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  181
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  181

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Glu Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  182
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)
<223>  EgD5S-36s157g mutant delta-5 desaturase

<400>  182
atg gct ctc tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcc atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ccc gga tcc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct        144
His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acc tac gac tct ccc ttt gca cag gac gtc aag cga act        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc aca cct gga ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cga acc gtt ggc atc att gct gtc acc gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct act acc gga atg gtg ctg tgg ggt ctc ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc ctg tcc att cag cac gat ggt tct cat ggt        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Gly Ser His Gly           
145                 150                 155                 160           

gcc atc agc aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc aac act        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gct cga aag tgg ttt cac cga ttc cag gcc tgg tac atg tac ctc gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctt gga gcc tac ggc gtt tcg ctg gtg tac aac cct ctc tac atc ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cga atg cag cac aac gac acc att ccc gag tct gtc aca gcc atg cga        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gag aac ggc ttt ctg cga cgg tac cga acc ctt gca ttc gtt atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg gag       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gaa gcc gat cag atc gac tgg tac aga gct cag gtg gag acc tct tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

acc tac ggt gga ccc att gcc atg ttc ttt act ggc ggt ctc aac ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cat cac ctc ttt cct cga atg tcg tct tgg cac tat ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtg cag caa gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tac gtc ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  183
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  183

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Gly Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  184
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  184
atg gct ctc tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcc atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ccc gga tcc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct        144
His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acc tac gac tct ccc ttt gca cag gac gtc aag cga act        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc aca cct gga ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cga acc gtt ggc atc att gct gtc acc gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct act acc gga atg gtg ctg tgg ggt ctc ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc ctg tcc att cag cac gat gcc gct cat ggt        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly           
145                 150                 155                 160           

gcc atc agc aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc aac act        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gct cga aag tgg ttt cac cga ttc cag gcc tgg tac atg tac ctc gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctt gga gcc tac ggc gtt tcg ctg gtg tac aac cct ctc tac atc ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cga atg cag cac aac gac acc att ccc gag tct gtc aca gcc atg cga        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gag aac ggc ttt ctg cga cgg tac cga acc ctt gca ttc gtt atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg gag       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gaa gcc gat cag atc gac tgg tac aga gct cag gtg gag acc tct tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

acc tac ggt gga ccc att gcc atg ttc ttt act ggc ggt ctc aac ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cat cac ctc ttt cct cga atg tcg tct tgg cac tat ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtg cag caa gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tac gtc ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  185
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  185

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  186
<211>  1350
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1350)

<400>  186
atg gct ctc tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcc atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat ccc gga tcc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct        144
His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acc tac gac tct ccc ttt gca cag gac gtc aag cga act        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc aca cct gga ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cga acc gtt ggc atc att gct gtc acc gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct act acc gga atg gtg ctg tgg ggt ctc ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc ctg tcc att cag cac gat gcc ggt cat ggt        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc aac act        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gct cga aag tgg ttt cac cga ttc cag gcc tgg tac atg tac ctc gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctt gga gcc tac ggc gtt tcg ctg gtg tac aac cct ctc tac atc ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cga atg cag cac aac gac acc att ccc gag tct gtc aca gcc atg cga        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gag aac ggc ttt ctg cga cgg tac cga acc ctt gca ttc gtt atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg gag       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val           
            340                 345                 350                   

gaa gcc gat cag atc gac tgg tac aga gct cag gtg gag acc tct tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

acc tac ggt gga ccc att gcc atg ttc ttt act ggc ggt ctc aac ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cat cac ctc ttt cct cga atg tcg tct tgg cac tat ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtg cag caa gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tac gtc ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc taa                                                               1350
Ser                                                                       
                                                                          


<210>  187
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  187

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Pro Gly Ser Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  188
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  188

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  189
<211>  8357
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZuFmEaD5S-A(S)

<400>  189
catggccacc atctccctga ctaccgagca gctcctggaa caccccgagc tcgttgccat       60

cgacggagtc ctgtacgatc tcttcggtct ggccaaggtc catgccggag gcaacctcat      120

cgaagctgcc ggtgcatccg acggaaccgc tctgttctac tccatgcatc ctggagtcaa      180

gccagagaac tcgaagcttc tgcagcaatt tgcccgaggc aagcacgaac gaagctccaa      240

ggatcccgtg tacaccttcg actctccctt tgctcaggac gtcaagcagt ccgttcgaga      300

ggtcatgaag ggtcgaaact ggtacgccac tcctggcttc tggctgagaa ccgcactcat      360

catcgcttgt actgccattg gcgagtggta ctggatcaca accggagcag tgatgtgggg      420

tatctttact ggatacttcc actcgcagat tggcttggcc attcaacacg atgcttctca      480

cggagccatc agcaaaaagc cctgggtcaa cgcctttttc gcttatggca tcgacgccat      540

tggttcctct cgttggatct ggctgcagtc ccacattatg cgacatcaca cttacaccaa      600

ccagcatggc ctcgacctgg atgctgcctc ggcagagccg ttcatcttgt tccactccta      660

tcctgctacc aacgcctctc gaaagtggta ccaccgattt caggcgtggt acatgtacat      720

cgttctggga atgtatggtg tctcgatggt gtacaatccc atgtacctct tcacaatgca      780

gcacaacgac accattcccg aggccacttc tctcagacca ggcagctttt tcaatcggca      840

gcgagctttc gccgtttccc ttcgactgct cttcatcttc cgaaacgcct ttcttccctg      900

gtacattgct ggtgcctctc ctctgctcac cattcttctg gtgcccacgg tcacaggcat      960

cttcctcacc tttgtgttcg ttctgtccca taacttcgag ggagccgaac ggaccccaga     1020

gaagaactgc aaggccaaac gagctaagga aggcaaggag gtcagagacg tggaagagga     1080

tcgagtcgac tggtaccgag cacaggccga gactgctgcc acctacggtg gcagcgtggg     1140

aatgatgctt acaggcggtc tcaacctgca gatcgagcat cacttgtttc cccgaatgtc     1200

ctcttggcac tatcccttca ttcaagacac cgttcgggag tgttgcaagc gacatggcgt     1260

ccgttacaca tactatccta ccattctcga gaacatcatg tccactcttc gatacatgca     1320

gaaggtgggt gttgctcaca ccattcagga tgcccaggag ttctaagcgg ccgcaagtgt     1380

ggatggggaa gtgagtgccc ggttctgtgt gcacaattgg caatccaaga tggatggatt     1440

caacacaggg atatagcgag ctacgtggtg gtgcgaggat atagcaacgg atatttatgt     1500

ttgacacttg agaatgtacg atacaagcac tgtccaagta caatactaaa catactgtac     1560

atactcatac tcgtacccgg gcaacggttt cacttgagtg cagtggctag tgctcttact     1620

cgtacagtgt gcaatactgc gtatcatagt ctttgatgta tatcgtattc attcatgtta     1680

gttgcgtacg agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac     1740

tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc     1800

tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg     1860

cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc     1920

actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt     1980

gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc     2040

ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa     2100

acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc     2160

ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg     2220

cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc     2280

tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc     2340

gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca     2400

ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact     2460

acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg     2520

gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt     2580

ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct     2640

tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga     2700

gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa     2760

tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac     2820

ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga     2880

taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc     2940

cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca     3000

gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta     3060

gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg     3120

tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc     3180

gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg     3240

ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt     3300

ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt     3360

cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata     3420

ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc     3480

gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac     3540

ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa     3600

ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct     3660

tcctttttca atattattga agcatttatc agggttattg tctcatgagc ggatacatat     3720

ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc     3780

cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg     3840

tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc     3900

tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc     3960

gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta     4020

gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta     4080

atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg     4140

atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa     4200

aatttaacgc gaattttaac aaaatattaa cgcttacaat ttccattcgc cattcaggct     4260

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa     4320

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg     4380

ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg     4440

gccccccctc gaggtcgatg gtgtcgataa gcttgatatc gaattcatgt cacacaaacc     4500

gatcttcgcc tcaaggaaac ctaattctac atccgagaga ctgccgagat ccagtctaca     4560

ctgattaatt ttcgggccaa taatttaaaa aaatcgtgtt atataatatt atatgtatta     4620

tatatataca tcatgatgat actgacagtc atgtcccatt gctaaataga cagactccat     4680

ctgccgcctc caactgatgt tctcaatatt taaggggtca tctcgcattg tttaataata     4740

aacagactcc atctaccgcc tccaaatgat gttctcaaaa tatattgtat gaacttattt     4800

ttattactta gtattattag acaacttact tgctttatga aaaacacttc ctatttagga     4860

aacaatttat aatggcagtt cgttcattta acaatttatg tagaataaat gttataaatg     4920

cgtatgggaa atcttaaata tggatagcat aaatgatatc tgcattgcct aattcgaaat     4980

caacagcaac gaaaaaaatc ccttgtacaa cataaatagt catcgagaaa tatcaactat     5040

caaagaacag ctattcacac gttactattg agattattat tggacgagaa tcacacactc     5100

aactgtcttt ctctcttcta gaaatacagg tacaagtatg tactattctc attgttcata     5160

cttctagtca tttcatccca catattcctt ggatttctct ccaatgaatg acattctatc     5220

ttgcaaattc aacaattata ataagatata ccaaagtagc ggtatagtgg caatcaaaaa     5280

gcttctctgg tgtgcttctc gtatttattt ttattctaat gatccattaa aggtatatat     5340

ttatttcttg ttatataatc cttttgttta ttacatgggc tggatacata aaggtatttt     5400

gatttaattt tttgcttaaa ttcaatcccc cctcgttcag tgtcaactgt aatggtagga     5460

aattaccata cttttgaaga agcaaaaaaa atgaaagaaa aaaaaaatcg tatttccagg     5520

ttagacgttc cgcagaatct agaatgcggt atgcggtaca ttgttcttcg aacgtaaaag     5580

ttgcgctccc tgagatattg tacatttttg cttttacaag tacaagtaca tcgtacaact     5640

atgtactact gttgatgcat ccacaacagt ttgttttgtt tttttttgtt tttttttttt     5700

ctaatgattc attaccgcta tgtataccta cttgtacttg tagtaagccg ggttattggc     5760

gttcaattaa tcatagactt atgaatctgc acggtgtgcg ctgcgagtta cttttagctt     5820

atgcatgcta cttgggtgta atattgggat ctgttcggaa atcaacggat gctcaatcga     5880

tttcgacagt aattaattaa gtcatacaca agtcagcttt cttcgagcct catataagta     5940

taagtagttc aacgtattag cactgtaccc agcatctccg tatcgagaaa cacaacaaca     6000

tgccccattg gacagatcat gcggatacac aggttgtgca gtatcataca tactcgatca     6060

gacaggtcgt ctgaccatca tacaagctga acaagcgctc catacttgca cgctctctat     6120

atacacagtt aaattacata tccatagtct aacctctaac agttaatctt ctggtaagcc     6180

tcccagccag ccttctggta tcgcttggcc tcctcaatag gatctcggtt ctggccgtac     6240

agacctcggc cgacaattat gatatccgtt ccggtagaca tgacatcctc aacagttcgg     6300

tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca ccccgggggt cagaataagc     6360

cagtcctcag agtcgccctt aggtcggttc tgggcaatga agccaaccac aaactcgggg     6420

tcggatcggg caagctcaat ggtctgcttg gagtactcgc cagtggccag agagcccttg     6480

caagacagct cggccagcat gagcagacct ctggccagct tctcgttggg agaggggact     6540

aggaactcct tgtactggga gttctcgtag tcagagacgt cctccttctt ctgttcagag     6600

acagtttcct cggcaccagc tcgcaggcca gcaatgattc cggttccggg tacaccgtgg     6660

gcgttggtga tatcggacca ctcggcgatt cggtgacacc ggtactggtg cttgacagtg     6720

ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga aaccgtgctt aagagcaagt     6780

tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt caatgatgtc gatatgggtt     6840

ttgatcatgc acacataagg tccgacctta tcggcaagct caatgagctc cttggtggtg     6900

gtaacatcca gagaagcaca caggttggtt ttcttggctg ccacgagctt gagcactcga     6960

gcggcaaagg cggacttgtg gacgttagct cgagcttcgt aggagggcat tttggtggtg     7020

aagaggagac tgaaataaat ttagtctgca gaacttttta tcggaacctt atctggggca     7080

gtgaagtata tgttatggta atagttacga gttagttgaa cttatagata gactggacta     7140

tacggctatc ggtccaaatt agaaagaacg tcaatggctc tctgggcgtc gcctttgccg     7200

acaaaaatgt gatcatgatg aaagccagca atgacgttgc agctgatatt gttgtcggcc     7260

aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca acgaagaatg tatcgtcaaa     7320

gtgatccaag cacactcata gttggagtcg tactccaaag gcggcaatga cgagtcagac     7380

agatactcgt cgacgtttaa acagtgtacg cagatctact atagaggaac atttaaattg     7440

ccccggagaa gacggccagg ccgcctagat gacaaattca acaactcaca gctgactttc     7500

tgccattgcc actagggggg ggccttttta tatggccaag ccaagctctc cacgtcggtt     7560

gggctgcacc caacaataaa tgggtagggt tgcaccaaca aagggatggg atggggggta     7620

gaagatacga ggataacggg gctcaatggc acaaataaga acgaatactg ccattaagac     7680

tcgtgatcca gcgactgaca ccattgcatc atctaagggc ctcaaaacta cctcggaact     7740

gctgcgctga tctggacacc acagaggttc cgagcacttt aggttgcacc aaatgtccca     7800

ccaggtgcag gcagaaaacg ctggaacagc gtgtacagtt tgtcttaaca aaaagtgagg     7860

gcgctgaggt cgagcagggt ggtgtgactt gttatagcct ttagagctgc gaaagcgcgt     7920

atggatttgg ctcatcaggc cagattgagg gtctgtggac acatgtcatg ttagtgtact     7980

tcaatcgccc cctggatata gccccgacaa taggccgtgg cctcattttt ttgccttccg     8040

cacatttcca ttgctcgata cccacacctt gcttctcctg cacttgccaa ccttaatact     8100

ggtttacatt gaccaacatc ttacaagcgg ggggcttgtc tagggtatat ataaacagtg     8160

gctctcccaa tcggttgcca gtctcttttt tcctttcttt ccccacagat tcgaaatcta     8220

aactacacat cacagaattc cgagccgtga gtatccacga caagatcagt gtcgagacga     8280

cgcgttttgt gtaatgacac aatccgaaag tcgctagcaa cacacactct ctacacaaac     8340

taacccagct ctggtac                                                    8357


<210>  190
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  190
ttggccattc aacacgaggc ttctcacgga gc                                     32


<210>  191
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  191
gctccgtgag aagcctcgtg ttgaatggcc aa                                     32


<210>  192
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  192
gattggcttg gccattcaac acgatttctc tcacggagcc atca                        44


<210>  193
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  193
tgatggctcc gtgagagaaa tcgtgttgaa tggccaagcc aatc                        44


<210>  194
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  194
gccattcaac acgatggttc tcacggagcc atc                                    33


<210>  195
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  195
gatggctccg tgagaaccat cgtgttgaat ggc                                    33


<210>  196
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  196
gattggcttg gccattcaac acgatatgtc tcacggagcc atca                        44


<210>  197
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  197
tgatggctcc gtgagacata tcgtgttgaa tggccaagcc aatc                        44


<210>  198
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  198
ggcttggcca ttcaacacga ttcctctcac ggagcca                                37


<210>  199
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  199
tggctccgtg agaggaatcg tgttgaatgg ccaagcc                                37


<210>  200
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  200
gattggcttg gccattcaac acgattactc tcacggagcc atca                        44


<210>  201
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  201
tgatggctcc gtgagagtaa tcgtgttgaa tggccaagcc aatc                        44


<210>  202
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  202
ccattcaaca cgatgctgcc cacggagcca tcagcaa                                37


<210>  203
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  203
ttgctgatgg ctccgtgggc agcatcgtgt tgaatgg                                37


<210>  204
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  204
ggccattcaa cacgatgctt gtcacggagc ca                                     32


<210>  205
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  205
tggctccgtg acaagcatcg tgttgaatgg cc                                     32


<210>  206
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  206
cttggccatt caacacgatg ctggtcacgg agccat                                 36


<210>  207
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  207
atggctccgt gaccagcatc gtgttgaatg gccaag                                 36


<210>  208
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  208
ttggccattc aacacgatgc taaccacgga gccatcagca aaaag                       45


<210>  209
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  209
ctttttgctg atggctccgt ggttagcatc gtgttgaatg gccaa                       45


<210>  210
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  210
ccattcaaca cgatgctacc cacggagcca tcagcaa                                37


<210>  211
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer

<400>  211
ttgctgatgg ctccgtgggt agcatcgtgt tgaatgg                                37


<210>  212
<211>  1365
<212>  DNA
<213>  Euglena anabaena


<220>
<221>  CDS
<222>  (1)..(1365)
<223>  EaD5S-35a158g mutant delta-5 desaturase

<400>  212
atg gcc acc atc tcc ctg act acc gag cag ctc ctg gaa cac ccc gag         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctc gtt gcc atc gac gga gtc ctg tac gat ctc ttc ggt ctg gcc aag         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtc cat gcc gga ggc aac ctc atc gaa gct gcc ggt gca tcc gac gga        144
Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gct ctg ttc tac tcc atg cat cct gga gtc aag cca gag aac tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctt ctg cag caa ttt gcc cga ggc aag cac gaa cga agc tcc aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gat ccc gtg tac acc ttc gac tct ccc ttt gct cag gac gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

tcc gtt cga gag gtc atg aag ggt cga aac tgg tac gcc act cct ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttc tgg ctg aga acc gca ctc atc atc gct tgt act gcc att ggc gag        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tac tgg atc aca acc gga gca gtg atg tgg ggt atc ttt act gga        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac tcg cag att ggc ttg gcc att caa cac gat ggt tct cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Gly Ser His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtc aac gcc ttt ttc gct tat ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att ggt tcc tct cgt tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cga cat cac act tac acc aac cag cat ggc ctc gac ctg gat gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gca gag ccg ttc atc ttg ttc cac tcc tat cct gct acc aac        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcc tct cga aag tgg tac cac cga ttt cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ctg gga atg tat ggt gtc tcg atg gtg tac aat ccc atg tac ctc        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc aca atg cag cac aac gac acc att ccc gag gcc act tct ctc aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aat cgg cag cga gct ttc gcc gtt tcc ctt cga        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

ctg ctc ttc atc ttc cga aac gcc ttt ctt ccc tgg tac att gct ggt        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct cct ctg ctc acc att ctt ctg gtg ccc acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ctc acc ttt gtg ttc gtt ctg tcc cat aac ttc gag gga gcc gaa       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc cca gag aag aac tgc aag gcc aaa cga gct aag gaa ggc aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc aga gac gtg gaa gag gat cga gtc gac tgg tac cga gca cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag act gct gcc acc tac ggt ggc agc gtg gga atg atg ctt aca       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ctc aac ctg cag atc gag cat cac ttg ttt ccc cga atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tat ccc ttc att caa gac acc gtt cgg gag tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cga cat ggc gtc cgt tac aca tac tat cct acc att ctc gag aac atc       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc act ctt cga tac atg cag aag gtg ggt gtt gct cac acc att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gag ttc taa                                           1365
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  213
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  213

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Gly Ser His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  214
<211>  1365
<212>  DNA
<213>  Euglena anabaena


<220>
<221>  CDS
<222>  (1)..(1365)

<400>  214
atg gcc acc atc tcc ctg act acc gag cag ctc ctg gaa cac ccc gag         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctc gtt gcc atc gac gga gtc ctg tac gat ctc ttc ggt ctg gcc aag         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtc cat gcc gga ggc aac ctc atc gaa gct gcc ggt gca tcc gac gga        144
Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gct ctg ttc tac tcc atg cat cct gga gtc aag cca gag aac tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctt ctg cag caa ttt gcc cga ggc aag cac gaa cga agc tcc aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gat ccc gtg tac acc ttc gac tct ccc ttt gct cag gac gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

tcc gtt cga gag gtc atg aag ggt cga aac tgg tac gcc act cct ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttc tgg ctg aga acc gca ctc atc atc gct tgt act gcc att ggc gag        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tac tgg atc aca acc gga gca gtg atg tgg ggt atc ttt act gga        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac tcg cag att ggc ttg gcc att caa cac gat tcc tct cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ser Ser His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtc aac gcc ttt ttc gct tat ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att ggt tcc tct cgt tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cga cat cac act tac acc aac cag cat ggc ctc gac ctg gat gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gca gag ccg ttc atc ttg ttc cac tcc tat cct gct acc aac        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcc tct cga aag tgg tac cac cga ttt cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ctg gga atg tat ggt gtc tcg atg gtg tac aat ccc atg tac ctc        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc aca atg cag cac aac gac acc att ccc gag gcc act tct ctc aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aat cgg cag cga gct ttc gcc gtt tcc ctt cga        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

ctg ctc ttc atc ttc cga aac gcc ttt ctt ccc tgg tac att gct ggt        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct cct ctg ctc acc att ctt ctg gtg ccc acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ctc acc ttt gtg ttc gtt ctg tcc cat aac ttc gag gga gcc gaa       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc cca gag aag aac tgc aag gcc aaa cga gct aag gaa ggc aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc aga gac gtg gaa gag gat cga gtc gac tgg tac cga gca cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag act gct gcc acc tac ggt ggc agc gtg gga atg atg ctt aca       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ctc aac ctg cag atc gag cat cac ttg ttt ccc cga atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tat ccc ttc att caa gac acc gtt cgg gag tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cga cat ggc gtc cgt tac aca tac tat cct acc att ctc gag aac atc       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc act ctt cga tac atg cag aag gtg ggt gtt gct cac acc att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gag ttc taa                                           1365
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  215
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  215

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ser Ser His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  216
<211>  1365
<212>  DNA
<213>  Euglena anabaena


<220>
<221>  CDS
<222>  (1)..(1365)

<400>  216
atg gcc acc atc tcc ctg act acc gag cag ctc ctg gaa cac ccc gag         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctc gtt gcc atc gac gga gtc ctg tac gat ctc ttc ggt ctg gcc aag         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtc cat gcc gga ggc aac ctc atc gaa gct gcc ggt gca tcc gac gga        144
Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gct ctg ttc tac tcc atg cat cct gga gtc aag cca gag aac tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctt ctg cag caa ttt gcc cga ggc aag cac gaa cga agc tcc aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gat ccc gtg tac acc ttc gac tct ccc ttt gct cag gac gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

tcc gtt cga gag gtc atg aag ggt cga aac tgg tac gcc act cct ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttc tgg ctg aga acc gca ctc atc atc gct tgt act gcc att ggc gag        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tac tgg atc aca acc gga gca gtg atg tgg ggt atc ttt act gga        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac tcg cag att ggc ttg gcc att caa cac gat gct ggt cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Gly His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtc aac gcc ttt ttc gct tat ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att ggt tcc tct cgt tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cga cat cac act tac acc aac cag cat ggc ctc gac ctg gat gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gca gag ccg ttc atc ttg ttc cac tcc tat cct gct acc aac        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcc tct cga aag tgg tac cac cga ttt cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ctg gga atg tat ggt gtc tcg atg gtg tac aat ccc atg tac ctc        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc aca atg cag cac aac gac acc att ccc gag gcc act tct ctc aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aat cgg cag cga gct ttc gcc gtt tcc ctt cga        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

ctg ctc ttc atc ttc cga aac gcc ttt ctt ccc tgg tac att gct ggt        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct cct ctg ctc acc att ctt ctg gtg ccc acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ctc acc ttt gtg ttc gtt ctg tcc cat aac ttc gag gga gcc gaa       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc cca gag aag aac tgc aag gcc aaa cga gct aag gaa ggc aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc aga gac gtg gaa gag gat cga gtc gac tgg tac cga gca cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag act gct gcc acc tac ggt ggc agc gtg gga atg atg ctt aca       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ctc aac ctg cag atc gag cat cac ttg ttt ccc cga atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tat ccc ttc att caa gac acc gtt cgg gag tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cga cat ggc gtc cgt tac aca tac tat cct acc att ctc gag aac atc       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc act ctt cga tac atg cag aag gtg ggt gtt gct cac acc att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gag ttc taa                                           1365
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  217
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  217

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Ala Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Gly His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  218
<211>  4313
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZKUM

<400>  218
taatcgagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc       60

acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga      120

gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg      180

tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg      240

cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg      300

gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga      360

aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg      420

gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag      480

aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc      540

gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg      600

ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt      660

cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc      720

ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc      780

actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg      840

tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca      900

gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc      960

ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat     1020

cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt     1080

ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt     1140

tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc     1200

agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc     1260

gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata     1320

ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg     1380

gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc     1440

cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct     1500

acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa     1560

cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt     1620

cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca     1680

ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac     1740

tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca     1800

atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt     1860

tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc     1920

actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca     1980

aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata     2040

ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc     2100

ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc     2160

cgaaaagtgc cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt     2220

acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc     2280

ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct     2340

ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat     2400

ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc     2460

acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc     2520

tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg     2580

atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat ttccattcgc     2640

cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc     2700

agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc     2760

agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat     2820

tgggtaccgg gccccccctc gaggtcgacg agtatctgtc tgactcgtca ttgccgcctt     2880

tggagtacga ctccaactat gagtgtgctt ggatcacttt gacgatacat tcttcgttgg     2940

aggctgtggg tctgacagct gcgttttcgg cgcggttggc cgacaacaat atcagctgca     3000

acgtcattgc tggctttcat catgatcaca tttttgtcgg caaaggcgac gcccagagag     3060

ccattgacgt tctttctaat ttggaccgat agccgtatag tccagtctat ctataagttc     3120

aactaactcg taactattac cataacatat acttcactgc cccagataag gttccgataa     3180

aaagttctgc agactaaatt tatttcagtc tcctcttcac caccaaaatg ccctcctacg     3240

aagctcgagt gctcaagctc gtggcagcca agaaaaccaa cctgtgtgct tctctggatg     3300

ttaccaccac caaggagctc attgagcttg ccgataaggt cggaccttat gtgtgcatga     3360

tcaaaaccca tatcgacatc attgacgact tcacctacgc cggcactgtg ctccccctca     3420

aggaacttgc tcttaagcac ggtttcttcc tgttcgagga cagaaagttc gcagatattg     3480

gcaacactgt caagcaccag taccggtgtc accgaatcgc cgagtggtcc gatatcacca     3540

acgcccacgg tgtacccgga accggaatcg attgctggcc tgcgagctgg tgcgtacgag     3600

gaaactgtct ctgaacagaa gaaggaggac gtctctgact acgagaactc ccagtacaag     3660

gagttcctag tcccctctcc caacgagaag ctggccagag gtctgctcat gctggccgag     3720

ctgtcttgca agggctctct ggccactggc gagtactcca agcagaccat tgagcttgcc     3780

cgatccgacc ccgagtttgt ggttggcttc attgcccaga accgacctaa gggcgactct     3840

gaggactggc ttattctgac ccccggggtg ggtcttgacg acaagggaga cgctctcgga     3900

cagcagtacc gaactgttga ggatgtcatg tctaccggaa cggatatcat aattgtcggc     3960

cgaggtctgt acggccagaa ccgagatcct attgaggagg ccaagcgata ccagaaggct     4020

ggctgggagg cttaccagaa gattaactgt tagaggttag actatggata tgtaatttaa     4080

ctgtgtatat agagagcgtg caagtatgga gcgcttgttc agcttgtatg atggtcagac     4140

gacctgtctg atcgagtatg tatgatactg cacaacctgt gtatccgcat gatctgtcca     4200

atggggcatg ttgttgtgtt tctcgatacg gagatgctgg gtacagtgct aatacgttga     4260

actacttata cttatatgag gctcgaagaa agctgacttg tgtatgactt aat            4313


<210>  219
<211>  13565
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZKL3-9DPN9N

<400>  219
gtacggattg tgtatgtccc tgtacctgca tcttgatgga gagagctccg gaaagcggat       60

caggagctgt ccaattttaa ttttataaca tggaaacgag tccttggagc tagaagacca      120

ttttttcaac tgccctatcg actatattta tctactccaa aaccgactgc ttcccaagaa      180

tcttcagcca aggcttccaa agtaacccct cgcttcccga cacttaattg aaaccttaga      240

tgcagtcact gcgagtgaag tggactctaa catctccaac atagcgacga tattgcgagg      300

gtttgaatat aactaagatg catgatccat tacatttgta gaaatatcat aaacaacgaa      360

gcacatagac agaatgctgt tggttgttac atctgaagcc gaggtaccga tgtcattttc      420

agctgtcact gcagagacag gggtatgtca catttgaaga tcatacaacc gacgtttatg      480

aaaaccagag atatagagaa tgtattgacg gttgtggcta tgtcataagt gcagtgaagt      540

gcagtgatta taggtatagt acacttactg tagctacaag tacatactgc tacagtaata      600

ctcatgtatg caaaccgtat tctgtgtcta cagaaggcga tacggaagag tcaatctctt      660

atgtagagcc atttctataa tcgaaggggc cttgtaattt ccaaacgagt aattgagtaa      720

ttgaagagca tcgtagacat tacttatcat gtattgtgag agggaggaga tgcagctgta      780

gctactgcac atactgtact cgcccatgca gggataatgc atagcgagac ttggcagtag      840

gtgacagttg ctagctgcta cttgtagtcg ggtgggtgat agcatggcgc gccagctgca      900

ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc      960

ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc     1020

aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc     1080

aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag     1140

gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc     1200

gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt     1260

tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct     1320

ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg     1380

ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct     1440

tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat     1500

tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg     1560

ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa     1620

aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt     1680

ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc     1740

tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt     1800

atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta     1860

aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat     1920

ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac     1980

tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg     2040

ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag     2100

tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt     2160

aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt     2220

gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt     2280

tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt     2340

cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct     2400

tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt     2460

ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac     2520

cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa     2580

actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa     2640

ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca     2700

aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct     2760

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga     2820

atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc     2880

tgatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggaaattgt     2940

aagcgttaat attttgttaa aattcgcgtt aaatttttgt taaatcagct cattttttaa     3000

ccaataggcc gaaatcggca aaatccctta taaatcaaaa gaatagaccg agatagggtt     3060

gagtgttgtt ccagtttgga acaagagtcc actattaaag aacgtggact ccaacgtcaa     3120

agggcgaaaa accgtctatc agggcgatgg cccactacgt gaaccatcac cctaatcaag     3180

ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac cctaaaggga gcccccgatt     3240

tagagcttga cggggaaagc cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg     3300

agcgggcgct agggcgctgg caagtgtagc ggtcacgctg cgcgtaacca ccacacccgc     3360

cgcgcttaat gcgccgctac agggcgcgtc cattcgccat tcaggctgcg caactgttgg     3420

gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct     3480

gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg     3540

gccagtgaat tgtaatacga ctcactatag ggcgaattgg gcccgacgtc gcatgcagga     3600

atagacatct tcaataggag cattaatacc tgtgggatca ctgatgtaaa cttctcccag     3660

agtatgtgaa taaccagcgg gccatccaac aaagaagtcg ttccagtgag tgactcggta     3720

catccgtctt tcggggttga tggtaagtcc gtcgtctcct tgcttaaaga acagagcgtc     3780

cacgtagtct gcaaaagcct tgtttccaag tcgaggctgc ccatagttga ttagcgttgg     3840

atcatatcca agattcttca ggttgatgcc catgaataga gcagtgacag ctcctagaga     3900

gtggccagtt acgatcaatt tgtagtcagt gttgtttcca aggaagtcga ccagacgatc     3960

ctgtacgttc accatagtct ctctgtatgc cttctgaaag ccatcatgaa cttggcagcc     4020

aggacaattg atactggcag aagggtttgt ggagtttatg tcagtagtgt taagaggagg     4080

gatactggtc atgtagggtt gttggatcgt ttggatgtca gtaatagcgt ctgcaatgga     4140

gaaagtgcct cggaaaacaa tatacttttc ctttttggtg tgatcgtggg ccaaaaatcc     4200

agtaactgaa gtcgagaaga aatttcctcc aaactggtag tcaagagtca catcgggaaa     4260

atgagcgcaa gagtttccac aggtaaaatc gctctgcagg gcaaatgggc caggggctct     4320

gacacaatag gccacgttag atagccatcc gtacttgaga acaaagtcgt atgtctcctg     4380

ggtgatagga gccgttaatt aagttgcgac acatgtcttg atagtatctt gaattctctc     4440

tcttgagctt ttccataaca agttcttctg cctccaggaa gtccatgggt ggtttgatca     4500

tggttttggt gtagtggtag tgcagtggtg gtattgtgac tggggatgta gttgagaata     4560

agtcatacac aagtcagctt tcttcgagcc tcatataagt ataagtagtt caacgtatta     4620

gcactgtacc cagcatctcc gtatcgagaa acacaacaac atgccccatt ggacagatca     4680

tgcggataca caggttgtgc agtatcatac atactcgatc agacaggtcg tctgaccatc     4740

atacaagctg aacaagcgct ccatacttgc acgctctcta tatacacagt taaattacat     4800

atccatagtc taacctctaa cagttaatct tctggtaagc ctcccagcca gccttctggt     4860

atcgcttggc ctcctcaata ggatctcggt tctggccgta cagacctcgg ccgacaatta     4920

tgatatccgt tccggtagac atgacatcct caacagttcg gtactgctgt ccgagagcgt     4980

ctcccttgtc gtcaagaccc accccggggg tcagaataag ccagtcctca gagtcgccct     5040

taggtcggtt ctgggcaatg aagccaacca caaactcggg gtcggatcgg gcaagctcaa     5100

tggtctgctt ggagtactcg ccagtggcca gagagccctt gcaagacagc tcggccagca     5160

tgagcagacc tctggccagc ttctcgttgg gagaggggac taggaactcc ttgtactggg     5220

agttctcgta gtcagagacg tcctccttct tctgttcaga gacagtttcc tcggcaccag     5280

ctcgcaggcc agcaatgatt ccggttccgg gtacaccgtg ggcgttggtg atatcggacc     5340

actcggcgat tcggtgacac cggtactggt gcttgacagt gttgccaata tctgcgaact     5400

ttctgtcctc gaacaggaag aaaccgtgct taagagcaag ttccttgagg gggagcacag     5460

tgccggcgta ggtgaagtcg tcaatgatgt cgatatgggt tttgatcatg cacacataag     5520

gtccgacctt atcggcaagc tcaatgagct ccttggtggt ggtaacatcc agagaagcac     5580

acaggttggt tttcttggct gccacgagct tgagcactcg agcggcaaag gcggacttgt     5640

ggacgttagc tcgagcttcg taggagggca ttttggtggt gaagaggaga ctgaaataaa     5700

tttagtctgc agaacttttt atcggaacct tatctggggc agtgaagtat atgttatggt     5760

aatagttacg agttagttga acttatagat agactggact atacggctat cggtccaaat     5820

tagaaagaac gtcaatggct ctctgggcgt cgcctttgcc gacaaaaatg tgatcatgat     5880

gaaagccagc aatgacgttg cagctgatat tgttgtcggc caaccgcgcc gaaaacgcag     5940

ctgtcagacc cacagcctcc aacgaagaat gtatcgtcaa agtgatccaa gcacactcat     6000

agttggagtc gtactccaaa ggcggcaatg acgagtcaga cagatactcg tcgacctttt     6060

ccttgggaac caccaccgtc agcccttctg actcacgtat tgtagccacc gacacaggca     6120

acagtccgtg gatagcagaa tatgtcttgt cggtccattt ctcaccaact ttaggcgtca     6180

agtgaatgtt gcagaagaag tatgtgcctt cattgagaat cggtgttgct gatttcaata     6240

aagtcttgag atcagtttgg ccagtcatgt tgtggggggt aattggattg agttatcgcc     6300

tacagtctgt acaggtatac tcgctgccca ctttatactt tttgattccg ctgcacttga     6360

agcaatgtcg tttaccaaaa gtgagaatgc tccacagaac acaccccagg gtatggttga     6420

gcaaaaaata aacactccga tacggggaat cgaaccccgg tctccacggt tctcaagaag     6480

tattcttgat gagagcgtat cgatggttaa tgctgctgtg tgctgtgtgt gtgtgttgtt     6540

tggcgctcat tgttgcgtta tgcagcgtac accacaatat tggaagctta ttagcctttc     6600

tattttttcg tttgcaaggc ttaacaacat tgctgtggag agggatgggg atatggaggc     6660

cgctggaggg agtcggagag gcgttttgga gcggcttggc ctggcgccca gctcgcgaaa     6720

cgcacctagg accctttggc acgccgaaat gtgccacttt tcagtctagt aacgccttac     6780

ctacgtcatt ccatgcgtgc atgtttgcgc cttttttccc ttgcccttga tcgccacaca     6840

gtacagtgca ctgtacagtg gaggttttgg gggggtctta gatgggagct aaaagcggcc     6900

tagcggtaca ctagtgggat tgtatggagt ggcatggagc ctaggtggag cctgacagga     6960

cgcacgaccg gctagcccgt gacagacgat gggtggctcc tgttgtccac cgcgtacaaa     7020

tgtttgggcc aaagtcttgt cagccttgct tgcgaaccta attcccaatt ttgtcacttc     7080

gcacccccat tgatcgagcc ctaacccctg cccatcaggc aatccaatta agctcgcatt     7140

gtctgccttg tttagtttgg ctcctgcccg tttcggcgtc cacttgcaca aacacaaaca     7200

agcattatat ataaggctcg tctctccctc ccaaccacac tcactttttt gcccgtcttc     7260

ccttgctaac acaaaagtca agaacacaaa caaccacccc aaccccctta cacacaagac     7320

atatctacag caatggccat ggccaaaagc aaacgacggt cggaggctgt ggaagagcac     7380

gtgaccggct cggacgaggg cttgaccgat acttcgggtc acgtgagccc tgccgccaag     7440

aagcagaaga actcggagat tcatttcacc acccaggctg cccagcagtt ggatcgggag     7500

cgcaaggagg agtatctgga ctcgctgatc gacaacaagg actatctcaa gtaccgtcct     7560

cgaggctgga agctcaacaa cccgcctacc gaccgacctg tgcgaatcta cgccgatgga     7620

gtgtttgatt tgttccatct gggacacatg cgtcagctgg agcagtccaa gaaggccttc     7680

cccaacgcag tgttgattgt gggcattccc agcgacaagg agacccacaa gcggaaggga     7740

ttgaccgtgc tgagtgacgt ccagcggtac gagacggtgc gacactgcaa gtgggtggac     7800

gaggtggtgg aggatgctcc ctggtgtgtc accatggact ttctggaaaa acacaaaatc     7860

gactacgtgg cccatgacga tctgccctac gcttccggca acgacgatga tatctacaag     7920

cccatcaagg agaagggcat gtttctggcc acccagcgaa ccgagggcat ttccacctcg     7980

gacatcatca ccaagattat ccgagactac gacaagtatt taatgcgaaa ctttgcccgg     8040

ggtgctaacc gaaaggatct caacgtctcg tggctcaaga agaacgagct ggacttcaag     8100

cgtcatgtgg ccgagttccg aaactcgttc aagcgaaaga aggtcggtaa ggatctctac     8160

ggcgagattc gcggtctgct gcagaatgtg ctcatttgga acggcgacaa ctccggcact     8220

tccactcccc agcgaaagac gctgcagacc aacgccaaga agatgtacat gaacgtgctc     8280

aagactctgc aggctcctga cgctgttgac gtggactcct cggagaacgt gtctgagaac     8340

gtcactgatg aggaggagga agacgacgac gaggttgatg aggacgaaga agccgacgac     8400

gacgacgaag acgacgaaga cgaggaagac gacgagtagg cggccgcatt gatgattgga     8460

aacacacaca tgggttatat ctaggtgaga gttagttgga cagttatata ttaaatcagc     8520

tatgccaacg gtaacttcat tcatgtcaac gaggaaccag tgactgcaag taatatagaa     8580

tttgaccacc ttgccattct cttgcactcc tttactatat ctcatttatt tcttatatac     8640

aaatcacttc ttcttcccag catcgagctc ggaaacctca tgagcaataa catcgtggat     8700

ctcgtcaata gagggctttt tggactcctt gctgttggcc accttgtcct tgctgtttaa     8760

acacgcagta ggatgtcctg cacgggtctt tttgtggggt gtggagaaag gggtgcttgg     8820

agatggaagc cggtagaacc gggctgcttg tgcttggaga tggaagccgg tagaaccggg     8880

ctgcttgggg ggatttgggg ccgctgggct ccaaagaggg gtaggcattt cgttggggtt     8940

acgtaattgc ggcatttggg tcctgcgcgc atgtcccatt ggtcagaatt agtccggata     9000

ggagacttat cagccaatca cagcgccgga tccacctgta ggttgggttg ggtgggagca     9060

cccctccaca gagtagagtc aaacagcagc agcaacatga tagttggggg tgtgcgtgtt     9120

aaaggaaaaa aaagaagctt gggttatatt cccgctctat ttagaggttg cgggatagac     9180

gccgacggag ggcaatggcg ctatggaacc ttgcggatat ccatacgccg cggcggactg     9240

cgtccgaacc agctccagca gcgttttttc cgggccattg agccgactgc gaccccgcca     9300

acgtgtcttg gcccacgcac tcatgtcatg ttggtgttgg gaggccactt tttaagtagc     9360

acaaggcacc tagctcgcag caaggtgtcc gaaccaaaga agcggctgca gtggtgcaaa     9420

cggggcggaa acggcgggaa aaagccacgg gggcacgaat tgaggcacgc cctcgaattt     9480

gagacgagtc acggccccat tcgcccgcgc aatggctcgc caacgcccgg tcttttgcac     9540

cacatcaggt taccccaagc caaacctttg tgttaaaaag cttaacatat tataccgaac     9600

gtaggtttgg gcgggcttgc tccgtctgtc caaggcaaca tttatataag ggtctgcatc     9660

gccggctcaa ttgaatcttt tttcttcttc tcttctctat attcattctt gaattaaaca     9720

cacatcaaca tggccatcaa agtcggtatt aacggattcg ggcgaatcgg acgaattgtg     9780

agtaccatag aaggtgatgg aaacatgacc caacagaaac agatgacaag tgtcatcgac     9840

ccaccagagc ccaattgagc tcatactaac agtcgacaac ctgtcgaacc aattgatgac     9900

tccccgacaa tgtactaaca caggtcctgc ccatggtgaa aaacgtggac caagtggatc     9960

tctcgcaggt cgacaccatt gcctccggcc gagatgtcaa ctacaaggtc aagtacacct    10020

ccggcgttaa gatgagccag ggcgcctacg acgacaaggg ccgccacatt tccgagcagc    10080

ccttcacctg ggccaactgg caccagcaca tcaactggct caacttcatt ctggtgattg    10140

cgctgcctct gtcgtccttt gctgccgctc ccttcgtctc cttcaactgg aagaccgccg    10200

cgtttgctgt cggctattac atgtgcaccg gtctcggtat caccgccggc taccaccgaa    10260

tgtgggccca tcgagcctac aaggccgctc tgcccgttcg aatcatcctt gctctgtttg    10320

gaggaggagc tgtcgagggc tccatccgat ggtgggcctc gtctcaccga gtccaccacc    10380

gatggaccga ctccaacaag gacccttacg acgcccgaaa gggattctgg ttctcccact    10440

ttggctggat gctgcttgtg cccaacccca agaacaaggg ccgaactgac atttctgacc    10500

tcaacaacga ctgggttgtc cgactccagc acaagtacta cgtttacgtt ctcgtcttca    10560

tggccattgt tctgcccacc ctcgtctgtg gctttggctg gggcgactgg aagggaggtc    10620

ttgtctacgc cggtatcatg cgatacacct ttgtgcagca ggtgactttc tgtgtcaact    10680

cccttgccca ctggattgga gagcagccct tcgacgaccg acgaactccc cgagaccacg    10740

ctcttaccgc cctggtcacc tttggagagg gctaccacaa cttccaccac gagttcccct    10800

cggactaccg aaacgccctc atctggtacc agtacgaccc caccaagtgg ctcatctgga    10860

ccctcaagca ggttggtctc gcctgggacc tccagacctt ctcccagaac gccatcgagc    10920

agggtctcgt gcagcagcga cagaagaagc tggacaagtg gcgaaacaac ctcaactggg    10980

gtatccccat tgagcagctg cctgtcattg agtttgagga gttccaagag caggccaaga    11040

cccgagatct ggttctcatt tctggcattg tccacgacgt gtctgccttt gtcgagcacc    11100

accctggtgg aaaggccctc attatgagcg ccgtcggcaa ggacggtacc gctgtcttca    11160

acggaggtgt ctaccgacac tccaacgctg gccacaacct gcttgccacc atgcgagttt    11220

cggtcattcg aggcggcatg gaggttgagg tgtggaagac tgcccagaac gaaaagaagg    11280

accagaacat tgtctccgat gagagtggaa accgaatcca ccgagctggt ctccaggcca    11340

cccgggtcga gaaccccggt atgtctggca tggctgctta ggcggccgca tgagaagata    11400

aatatataaa tacattgaga tattaaatgc gctagattag agagcctcat actgctcgga    11460

gagaagccaa gacgagtact caaaggggat tacaccatcc atatccacag acacaagctg    11520

gggaaaggtt ctatatacac tttccggaat accgtagttt ccgatgttat caatgggggc    11580

agccaggatt tcaggcactt cggtgtctcg gggtgaaatg gcgttcttgg cctccatcaa    11640

gtcgtaccat gtcttcattt gcctgtcaaa gtaaaacaga agcagatgaa gaatgaactt    11700

gaagtgaagg aatttaaata gttggagcaa gggagaaatg tagagtgtga aagactcact    11760

atggtccggg cttatctcga ccaatagcca aagtctggag tttctgagag aaaaaggcaa    11820

gatacgtatg taacaaagcg acgcatggta caataatacc ggaggcatgt atcatagaga    11880

gttagtggtt cgatgatggc actggtgcct ggtatgactt tatacggctg actacatatt    11940

tgtcctcaga catacaatta cagtcaagca cttacccttg gacatctgta ggtacccccc    12000

ggccaagacg atctcagcgt gtcgtatgtc ggattggcgt agctccctcg ctcgtcaatt    12060

ggctcccatc tactttcttc tgcttggcta cacccagcat gtctgctatg gctcgttttc    12120

gtgccttatc tatcctccca gtattaccaa ctctaaatga catgatgtga ttgggtctac    12180

actttcatat cagagataag gagtagcaca gttgcataaa aagcccaact ctaatcagct    12240

tcttcctttc ttgtaattag tacaaaggtg attagcgaaa tctggaagct tagttggccc    12300

taaaaaaatc aaaaaaagca aaaaacgaaa aacgaaaaac cacagttttg agaacaggga    12360

ggtaacgaag gatcgtatat atatatatat atatatatac ccacggatcc cgagaccggc    12420

ctttgattct tccctacaac caaccattct caccacccta attcacaacc atggaggtcg    12480

tgaacgaaat cgtctccatt ggccaggagg ttcttcccaa ggtcgactat gctcagctct    12540

ggtctgatgc ctcgcactgc gaggtgctgt acctctccat cgccttcgtc atcctgaagt    12600

tcacccttgg tcctctcgga cccaagggtc agtctcgaat gaagtttgtg ttcaccaact    12660

acaacctgct catgtccatc tactcgctgg gctccttcct ctctatggcc tacgccatgt    12720

acaccattgg tgtcatgtcc gacaactgcg agaaggcttt cgacaacaat gtcttccgaa    12780

tcaccactca gctgttctac ctcagcaagt tcctcgagta cattgactcc ttctatctgc    12840

ccctcatggg caagcctctg acctggttgc agttctttca ccatctcgga gctcctatgg    12900

acatgtggct gttctacaac taccgaaacg aagccgtttg gatctttgtg ctgctcaacg    12960

gcttcattca ctggatcatg tacggctact attggacccg actgatcaag ctcaagttcc    13020

ctatgcccaa gtccctgatt acttctatgc agatcattca gttcaacgtt ggcttctaca    13080

tcgtctggaa gtaccggaac attccctgct accgacaaga tggaatgaga atgtttggct    13140

ggtttttcaa ctacttctac gttggtactg tcctgtgtct gttcctcaac ttctacgtgc    13200

agacctacat cgtccgaaag cacaagggag ccaaaaagat tcagtgagcg gccgcaagtg    13260

tggatgggga agtgagtgcc cggttctgtg tgcacaattg gcaatccaag atggatggat    13320

tcaacacagg gatatagcga gctacgtggt ggtgcgagga tatagcaacg gatatttatg    13380

tttgacactt gagaatgtac gatacaagca ctgtccaagt acaatactaa acatactgta    13440

catactcata ctcgtacccg gcaacggttt cacttgagtg cagtggctag tgctcttact    13500

cgtacagtgt gcaatactgc gtatcatagt ctttgatgta tatcgtattc attcatgtta    13560

gttgc                                                                13565


<210>  220
<211>  777
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(777)
<223>  mutant delta-9 elongase "EgD9eS-L35G"

<400>  220
atg gag gtc gtg aac gaa atc gtc tcc att ggc cag gag gtt ctt ccc         48
Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro           
1               5                   10                  15                

aag gtc gac tat gct cag ctc tgg tct gat gcc tcg cac tgc gag gtg         96
Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val           
            20                  25                  30                    

ctg tac ggg tcc atc gcc ttc gtc atc ctg aag ttc acc ctt ggt cct        144
Leu Tyr Gly Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro           
        35                  40                  45                        

ctc gga ccc aag ggt cag tct cga atg aag ttt gtg ttc acc aac tac        192
Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr           
    50                  55                  60                            

aac ctg ctc atg tcc atc tac tcg ctg ggc tcc ttc ctc tct atg gcc        240
Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala           
65                  70                  75                  80            

tac gcc atg tac acc att ggt gtc atg tcc gac aac tgc gag aag gct        288
Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala           
                85                  90                  95                

ttc gac aac aat gtc ttc cga atc acc act cag ctg ttc tac ctc agc        336
Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser           
            100                 105                 110                   

aag ttc ctc gag tac att gac tcc ttc tat ctg ccc ctc atg ggc aag        384
Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys           
        115                 120                 125                       

cct ctg acc tgg ttg cag ttc ttt cac cat ctc gga gct cct atg gac        432
Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp           
    130                 135                 140                           

atg tgg ctg ttc tac aac tac cga aac gaa gcc gtt tgg atc ttt gtg        480
Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val           
145                 150                 155                 160           

ctg ctc aac ggc ttc att cac tgg atc atg tac ggc tac tat tgg acc        528
Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr           
                165                 170                 175               

cga ctg atc aag ctc aag ttc cct atg ccc aag tcc ctg att act tct        576
Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser           
            180                 185                 190                   

atg cag atc att cag ttc aac gtt ggc ttc tac atc gtc tgg aag tac        624
Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr           
        195                 200                 205                       

cgg aac att ccc tgc tac cga caa gat gga atg aga atg ttt ggc tgg        672
Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp           
    210                 215                 220                           

ttt ttc aac tac ttc tac gtt ggt act gtc ctg tgt ctg ttc ctc aac        720
Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn           
225                 230                 235                 240           

ttc tac gtg cag acc tac atc gtc cga aag cac aag gga gcc aaa aag        768
Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys           
                245                 250                 255               

att cag tga                                                            777
Ile Gln                                                                   
                                                                          


<210>  221
<211>  258
<212>  PRT
<213>  Euglena gracilis

<400>  221

Met Glu Val Val Asn Glu Ile Val Ser Ile Gly Gln Glu Val Leu Pro 
1               5                   10                  15      


Lys Val Asp Tyr Ala Gln Leu Trp Ser Asp Ala Ser His Cys Glu Val 
            20                  25                  30          


Leu Tyr Gly Ser Ile Ala Phe Val Ile Leu Lys Phe Thr Leu Gly Pro 
        35                  40                  45              


Leu Gly Pro Lys Gly Gln Ser Arg Met Lys Phe Val Phe Thr Asn Tyr 
    50                  55                  60                  


Asn Leu Leu Met Ser Ile Tyr Ser Leu Gly Ser Phe Leu Ser Met Ala 
65                  70                  75                  80  


Tyr Ala Met Tyr Thr Ile Gly Val Met Ser Asp Asn Cys Glu Lys Ala 
                85                  90                  95      


Phe Asp Asn Asn Val Phe Arg Ile Thr Thr Gln Leu Phe Tyr Leu Ser 
            100                 105                 110         


Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr Leu Pro Leu Met Gly Lys 
        115                 120                 125             


Pro Leu Thr Trp Leu Gln Phe Phe His His Leu Gly Ala Pro Met Asp 
    130                 135                 140                 


Met Trp Leu Phe Tyr Asn Tyr Arg Asn Glu Ala Val Trp Ile Phe Val 
145                 150                 155                 160 


Leu Leu Asn Gly Phe Ile His Trp Ile Met Tyr Gly Tyr Tyr Trp Thr 
                165                 170                 175     


Arg Leu Ile Lys Leu Lys Phe Pro Met Pro Lys Ser Leu Ile Thr Ser 
            180                 185                 190         


Met Gln Ile Ile Gln Phe Asn Val Gly Phe Tyr Ile Val Trp Lys Tyr 
        195                 200                 205             


Arg Asn Ile Pro Cys Tyr Arg Gln Asp Gly Met Arg Met Phe Gly Trp 
    210                 215                 220                 


Phe Phe Asn Tyr Phe Tyr Val Gly Thr Val Leu Cys Leu Phe Leu Asn 
225                 230                 235                 240 


Phe Tyr Val Gln Thr Tyr Ile Val Arg Lys His Lys Gly Ala Lys Lys 
                245                 250                 255     


Ile Gln 
        


<210>  222
<211>  1449
<212>  DNA
<213>  Yarrowia lipolytica


<220>
<221>  CDS
<222>  (1)..(1449)
<223>  delta-9 desaturase; GenBank Accession No. XM_501496

<400>  222
atg gtg aaa aac gtg gac caa gtg gat ctc tcg cag gtc gac acc att         48
Met Val Lys Asn Val Asp Gln Val Asp Leu Ser Gln Val Asp Thr Ile           
1               5                   10                  15                

gcc tcc ggc cga gat gtc aac tac aag gtc aag tac acc tcc ggc gtt         96
Ala Ser Gly Arg Asp Val Asn Tyr Lys Val Lys Tyr Thr Ser Gly Val           
            20                  25                  30                    

aag atg agc cag ggc gcc tac gac gac aag ggc cgc cac att tcc gag        144
Lys Met Ser Gln Gly Ala Tyr Asp Asp Lys Gly Arg His Ile Ser Glu           
        35                  40                  45                        

cag ccc ttc acc tgg gcc aac tgg cac cag cac atc aac tgg ctc aac        192
Gln Pro Phe Thr Trp Ala Asn Trp His Gln His Ile Asn Trp Leu Asn           
    50                  55                  60                            

ttc att ctg gtg att gcg ctg cct ctg tcg tcc ttt gct gcc gct ccc        240
Phe Ile Leu Val Ile Ala Leu Pro Leu Ser Ser Phe Ala Ala Ala Pro           
65                  70                  75                  80            

ttc gtc tcc ttc aac tgg aag acc gcc gcg ttt gct gtc ggc tat tac        288
Phe Val Ser Phe Asn Trp Lys Thr Ala Ala Phe Ala Val Gly Tyr Tyr           
                85                  90                  95                

atg tgc acc ggt ctc ggt atc acc gcc ggc tac cac cga atg tgg gcc        336
Met Cys Thr Gly Leu Gly Ile Thr Ala Gly Tyr His Arg Met Trp Ala           
            100                 105                 110                   

cat cga gcc tac aag gcc gct ctg ccc gtt cga atc atc ctt gct ctg        384
His Arg Ala Tyr Lys Ala Ala Leu Pro Val Arg Ile Ile Leu Ala Leu           
        115                 120                 125                       

ttt gga gga gga gct gtc gag ggc tcc atc cga tgg tgg gcc tcg tct        432
Phe Gly Gly Gly Ala Val Glu Gly Ser Ile Arg Trp Trp Ala Ser Ser           
    130                 135                 140                           

cac cga gtc cac cac cga tgg acc gac tcc aac aag gac cct tac gac        480
His Arg Val His His Arg Trp Thr Asp Ser Asn Lys Asp Pro Tyr Asp           
145                 150                 155                 160           

gcc cga aag gga ttc tgg ttc tcc cac ttt ggc tgg atg ctg ctt gtg        528
Ala Arg Lys Gly Phe Trp Phe Ser His Phe Gly Trp Met Leu Leu Val           
                165                 170                 175               

ccc aac ccc aag aac aag ggc cga act gac att tct gac ctc aac aac        576
Pro Asn Pro Lys Asn Lys Gly Arg Thr Asp Ile Ser Asp Leu Asn Asn           
            180                 185                 190                   

gac tgg gtt gtc cga ctc cag cac aag tac tac gtt tac gtt ctc gtc        624
Asp Trp Val Val Arg Leu Gln His Lys Tyr Tyr Val Tyr Val Leu Val           
        195                 200                 205                       

ttc atg gcc att gtt ctg ccc acc ctc gtc tgt ggc ttt ggc tgg ggc        672
Phe Met Ala Ile Val Leu Pro Thr Leu Val Cys Gly Phe Gly Trp Gly           
    210                 215                 220                           

gac tgg aag gga ggt ctt gtc tac gcc ggt atc atg cga tac acc ttt        720
Asp Trp Lys Gly Gly Leu Val Tyr Ala Gly Ile Met Arg Tyr Thr Phe           
225                 230                 235                 240           

gtg cag cag gtg act ttc tgt gtc aac tcc ctt gcc cac tgg att gga        768
Val Gln Gln Val Thr Phe Cys Val Asn Ser Leu Ala His Trp Ile Gly           
                245                 250                 255               

gag cag ccc ttc gac gac cga cga act ccc cga gac cac gct ctt acc        816
Glu Gln Pro Phe Asp Asp Arg Arg Thr Pro Arg Asp His Ala Leu Thr           
            260                 265                 270                   

gcc ctg gtc acc ttt gga gag ggc tac cac aac ttc cac cac gag ttc        864
Ala Leu Val Thr Phe Gly Glu Gly Tyr His Asn Phe His His Glu Phe           
        275                 280                 285                       

ccc tcg gac tac cga aac gcc ctc atc tgg tac cag tac gac ccc acc        912
Pro Ser Asp Tyr Arg Asn Ala Leu Ile Trp Tyr Gln Tyr Asp Pro Thr           
    290                 295                 300                           

aag tgg ctc atc tgg acc ctc aag cag gtt ggt ctc gcc tgg gac ctc        960
Lys Trp Leu Ile Trp Thr Leu Lys Gln Val Gly Leu Ala Trp Asp Leu           
305                 310                 315                 320           

cag acc ttc tcc cag aac gcc atc gag cag ggt ctc gtg cag cag cga       1008
Gln Thr Phe Ser Gln Asn Ala Ile Glu Gln Gly Leu Val Gln Gln Arg           
                325                 330                 335               

cag aag aag ctg gac aag tgg cga aac aac ctc aac tgg ggt atc ccc       1056
Gln Lys Lys Leu Asp Lys Trp Arg Asn Asn Leu Asn Trp Gly Ile Pro           
            340                 345                 350                   

att gag cag ctg cct gtc att gag ttt gag gag ttc caa gag cag gcc       1104
Ile Glu Gln Leu Pro Val Ile Glu Phe Glu Glu Phe Gln Glu Gln Ala           
        355                 360                 365                       

aag acc cga gat ctg gtt ctc att tct ggc att gtc cac gac gtg tct       1152
Lys Thr Arg Asp Leu Val Leu Ile Ser Gly Ile Val His Asp Val Ser           
    370                 375                 380                           

gcc ttt gtc gag cac cac cct ggt gga aag gcc ctc att atg agc gcc       1200
Ala Phe Val Glu His His Pro Gly Gly Lys Ala Leu Ile Met Ser Ala           
385                 390                 395                 400           

gtc ggc aag gac ggt acc gct gtc ttc aac gga ggt gtc tac cga cac       1248
Val Gly Lys Asp Gly Thr Ala Val Phe Asn Gly Gly Val Tyr Arg His           
                405                 410                 415               

tcc aac gct ggc cac aac ctg ctt gcc acc atg cga gtt tcg gtc att       1296
Ser Asn Ala Gly His Asn Leu Leu Ala Thr Met Arg Val Ser Val Ile           
            420                 425                 430                   

cga ggc ggc atg gag gtt gag gtg tgg aag act gcc cag aac gaa aag       1344
Arg Gly Gly Met Glu Val Glu Val Trp Lys Thr Ala Gln Asn Glu Lys           
        435                 440                 445                       

aag gac cag aac att gtc tcc gat gag agt gga aac cga atc cac cga       1392
Lys Asp Gln Asn Ile Val Ser Asp Glu Ser Gly Asn Arg Ile His Arg           
    450                 455                 460                           

gct ggt ctc cag gcc acc cgg gtc gag aac ccc ggt atg tct ggc atg       1440
Ala Gly Leu Gln Ala Thr Arg Val Glu Asn Pro Gly Met Ser Gly Met           
465                 470                 475                 480           

gct gct tag                                                           1449
Ala Ala                                                                   
                                                                          


<210>  223
<211>  482
<212>  PRT
<213>  Yarrowia lipolytica

<400>  223

Met Val Lys Asn Val Asp Gln Val Asp Leu Ser Gln Val Asp Thr Ile 
1               5                   10                  15      


Ala Ser Gly Arg Asp Val Asn Tyr Lys Val Lys Tyr Thr Ser Gly Val 
            20                  25                  30          


Lys Met Ser Gln Gly Ala Tyr Asp Asp Lys Gly Arg His Ile Ser Glu 
        35                  40                  45              


Gln Pro Phe Thr Trp Ala Asn Trp His Gln His Ile Asn Trp Leu Asn 
    50                  55                  60                  


Phe Ile Leu Val Ile Ala Leu Pro Leu Ser Ser Phe Ala Ala Ala Pro 
65                  70                  75                  80  


Phe Val Ser Phe Asn Trp Lys Thr Ala Ala Phe Ala Val Gly Tyr Tyr 
                85                  90                  95      


Met Cys Thr Gly Leu Gly Ile Thr Ala Gly Tyr His Arg Met Trp Ala 
            100                 105                 110         


His Arg Ala Tyr Lys Ala Ala Leu Pro Val Arg Ile Ile Leu Ala Leu 
        115                 120                 125             


Phe Gly Gly Gly Ala Val Glu Gly Ser Ile Arg Trp Trp Ala Ser Ser 
    130                 135                 140                 


His Arg Val His His Arg Trp Thr Asp Ser Asn Lys Asp Pro Tyr Asp 
145                 150                 155                 160 


Ala Arg Lys Gly Phe Trp Phe Ser His Phe Gly Trp Met Leu Leu Val 
                165                 170                 175     


Pro Asn Pro Lys Asn Lys Gly Arg Thr Asp Ile Ser Asp Leu Asn Asn 
            180                 185                 190         


Asp Trp Val Val Arg Leu Gln His Lys Tyr Tyr Val Tyr Val Leu Val 
        195                 200                 205             


Phe Met Ala Ile Val Leu Pro Thr Leu Val Cys Gly Phe Gly Trp Gly 
    210                 215                 220                 


Asp Trp Lys Gly Gly Leu Val Tyr Ala Gly Ile Met Arg Tyr Thr Phe 
225                 230                 235                 240 


Val Gln Gln Val Thr Phe Cys Val Asn Ser Leu Ala His Trp Ile Gly 
                245                 250                 255     


Glu Gln Pro Phe Asp Asp Arg Arg Thr Pro Arg Asp His Ala Leu Thr 
            260                 265                 270         


Ala Leu Val Thr Phe Gly Glu Gly Tyr His Asn Phe His His Glu Phe 
        275                 280                 285             


Pro Ser Asp Tyr Arg Asn Ala Leu Ile Trp Tyr Gln Tyr Asp Pro Thr 
    290                 295                 300                 


Lys Trp Leu Ile Trp Thr Leu Lys Gln Val Gly Leu Ala Trp Asp Leu 
305                 310                 315                 320 


Gln Thr Phe Ser Gln Asn Ala Ile Glu Gln Gly Leu Val Gln Gln Arg 
                325                 330                 335     


Gln Lys Lys Leu Asp Lys Trp Arg Asn Asn Leu Asn Trp Gly Ile Pro 
            340                 345                 350         


Ile Glu Gln Leu Pro Val Ile Glu Phe Glu Glu Phe Gln Glu Gln Ala 
        355                 360                 365             


Lys Thr Arg Asp Leu Val Leu Ile Ser Gly Ile Val His Asp Val Ser 
    370                 375                 380                 


Ala Phe Val Glu His His Pro Gly Gly Lys Ala Leu Ile Met Ser Ala 
385                 390                 395                 400 


Val Gly Lys Asp Gly Thr Ala Val Phe Asn Gly Gly Val Tyr Arg His 
                405                 410                 415     


Ser Asn Ala Gly His Asn Leu Leu Ala Thr Met Arg Val Ser Val Ile 
            420                 425                 430         


Arg Gly Gly Met Glu Val Glu Val Trp Lys Thr Ala Gln Asn Glu Lys 
        435                 440                 445             


Lys Asp Gln Asn Ile Val Ser Asp Glu Ser Gly Asn Arg Ile His Arg 
    450                 455                 460                 


Ala Gly Leu Gln Ala Thr Arg Val Glu Asn Pro Gly Met Ser Gly Met 
465                 470                 475                 480 


Ala Ala 
        


<210>  224
<211>  1101
<212>  DNA
<213>  Yarrowia lipolytica


<220>
<221>  CDS
<222>  (1)..(1101)
<223>  cholinephosphate cytidylyltransferase; GenBank Accession No. 
       XM_502978

<400>  224
atg gcc aaa agc aaa cga cgg tcg gag gct gtg gaa gag cac gtg acc         48
Met Ala Lys Ser Lys Arg Arg Ser Glu Ala Val Glu Glu His Val Thr           
1               5                   10                  15                

ggc tcg gac gag ggc ttg acc gat act tcg ggt cac gtg agc cct gcc         96
Gly Ser Asp Glu Gly Leu Thr Asp Thr Ser Gly His Val Ser Pro Ala           
            20                  25                  30                    

gcc aag aag cag aag aac tcg gag att cat ttc acc acc cag gct gcc        144
Ala Lys Lys Gln Lys Asn Ser Glu Ile His Phe Thr Thr Gln Ala Ala           
        35                  40                  45                        

cag cag ttg gat cgg gag cgc aag gag gag tat ctg gac tcg ctg atc        192
Gln Gln Leu Asp Arg Glu Arg Lys Glu Glu Tyr Leu Asp Ser Leu Ile           
    50                  55                  60                            

gac aac aag gac tat ctc aag tac cgt cct cga ggc tgg aag ctc aac        240
Asp Asn Lys Asp Tyr Leu Lys Tyr Arg Pro Arg Gly Trp Lys Leu Asn           
65                  70                  75                  80            

aac ccg cct acc gac cga cct gtg cga atc tac gcc gat gga gtg ttt        288
Asn Pro Pro Thr Asp Arg Pro Val Arg Ile Tyr Ala Asp Gly Val Phe           
                85                  90                  95                

gat ttg ttc cat ctg gga cac atg cgt cag ctg gag cag tcc aag aag        336
Asp Leu Phe His Leu Gly His Met Arg Gln Leu Glu Gln Ser Lys Lys           
            100                 105                 110                   

gcc ttc ccc aac gca gtg ttg att gtg ggc att ccc agc gac aag gag        384
Ala Phe Pro Asn Ala Val Leu Ile Val Gly Ile Pro Ser Asp Lys Glu           
        115                 120                 125                       

acc cac aag cgg aag gga ttg acc gtg ctg agt gac gtc cag cgg tac        432
Thr His Lys Arg Lys Gly Leu Thr Val Leu Ser Asp Val Gln Arg Tyr           
    130                 135                 140                           

gag acg gtg cga cac tgc aag tgg gtg gac gag gtg gtg gag gat gct        480
Glu Thr Val Arg His Cys Lys Trp Val Asp Glu Val Val Glu Asp Ala           
145                 150                 155                 160           

ccc tgg tgt gtc acc atg gac ttt ctg gaa aaa cac aaa atc gac tac        528
Pro Trp Cys Val Thr Met Asp Phe Leu Glu Lys His Lys Ile Asp Tyr           
                165                 170                 175               

gtg gcc cat gac gat ctg ccc tac gct tcc ggc aac gac gat gat atc        576
Val Ala His Asp Asp Leu Pro Tyr Ala Ser Gly Asn Asp Asp Asp Ile           
            180                 185                 190                   

tac aag ccc atc aag gag aag ggc atg ttt ctg gcc acc cag cga acc        624
Tyr Lys Pro Ile Lys Glu Lys Gly Met Phe Leu Ala Thr Gln Arg Thr           
        195                 200                 205                       

gag ggc att tcc acc tcg gac atc atc acc aag att atc cga gac tac        672
Glu Gly Ile Ser Thr Ser Asp Ile Ile Thr Lys Ile Ile Arg Asp Tyr           
    210                 215                 220                           

gac aag tat tta atg cga aac ttt gcc cgg ggt gct aac cga aag gat        720
Asp Lys Tyr Leu Met Arg Asn Phe Ala Arg Gly Ala Asn Arg Lys Asp           
225                 230                 235                 240           

ctc aac gtc tcg tgg ctc aag aag aac gag ctg gac ttc aag cgt cat        768
Leu Asn Val Ser Trp Leu Lys Lys Asn Glu Leu Asp Phe Lys Arg His           
                245                 250                 255               

gtg gcc gag ttc cga aac tcg ttc aag cga aag aag gtc ggt aag gat        816
Val Ala Glu Phe Arg Asn Ser Phe Lys Arg Lys Lys Val Gly Lys Asp           
            260                 265                 270                   

ctc tac ggc gag att cgc ggt ctg ctg cag aat gtg ctc att tgg aac        864
Leu Tyr Gly Glu Ile Arg Gly Leu Leu Gln Asn Val Leu Ile Trp Asn           
        275                 280                 285                       

ggc gac aac tcc ggc act tcc act ccc cag cga aag acg ctg cag acc        912
Gly Asp Asn Ser Gly Thr Ser Thr Pro Gln Arg Lys Thr Leu Gln Thr           
    290                 295                 300                           

aac gcc aag aag atg tac atg aac gtg ctc aag act ctg cag gct cct        960
Asn Ala Lys Lys Met Tyr Met Asn Val Leu Lys Thr Leu Gln Ala Pro           
305                 310                 315                 320           

gac gct gtt gac gtg gac tcc tcg gag aac gtg tct gag aac gtc act       1008
Asp Ala Val Asp Val Asp Ser Ser Glu Asn Val Ser Glu Asn Val Thr           
                325                 330                 335               

gat gag gag gag gaa gac gac gac gag gtt gat gag gac gaa gaa gcc       1056
Asp Glu Glu Glu Glu Asp Asp Asp Glu Val Asp Glu Asp Glu Glu Ala           
            340                 345                 350                   

gac gac gac gac gaa gac gac gaa gac gag gaa gac gac gag tag           1101
Asp Asp Asp Asp Glu Asp Asp Glu Asp Glu Glu Asp Asp Glu                   
        355                 360                 365                       


<210>  225
<211>  366
<212>  PRT
<213>  Yarrowia lipolytica

<400>  225

Met Ala Lys Ser Lys Arg Arg Ser Glu Ala Val Glu Glu His Val Thr 
1               5                   10                  15      


Gly Ser Asp Glu Gly Leu Thr Asp Thr Ser Gly His Val Ser Pro Ala 
            20                  25                  30          


Ala Lys Lys Gln Lys Asn Ser Glu Ile His Phe Thr Thr Gln Ala Ala 
        35                  40                  45              


Gln Gln Leu Asp Arg Glu Arg Lys Glu Glu Tyr Leu Asp Ser Leu Ile 
    50                  55                  60                  


Asp Asn Lys Asp Tyr Leu Lys Tyr Arg Pro Arg Gly Trp Lys Leu Asn 
65                  70                  75                  80  


Asn Pro Pro Thr Asp Arg Pro Val Arg Ile Tyr Ala Asp Gly Val Phe 
                85                  90                  95      


Asp Leu Phe His Leu Gly His Met Arg Gln Leu Glu Gln Ser Lys Lys 
            100                 105                 110         


Ala Phe Pro Asn Ala Val Leu Ile Val Gly Ile Pro Ser Asp Lys Glu 
        115                 120                 125             


Thr His Lys Arg Lys Gly Leu Thr Val Leu Ser Asp Val Gln Arg Tyr 
    130                 135                 140                 


Glu Thr Val Arg His Cys Lys Trp Val Asp Glu Val Val Glu Asp Ala 
145                 150                 155                 160 


Pro Trp Cys Val Thr Met Asp Phe Leu Glu Lys His Lys Ile Asp Tyr 
                165                 170                 175     


Val Ala His Asp Asp Leu Pro Tyr Ala Ser Gly Asn Asp Asp Asp Ile 
            180                 185                 190         


Tyr Lys Pro Ile Lys Glu Lys Gly Met Phe Leu Ala Thr Gln Arg Thr 
        195                 200                 205             


Glu Gly Ile Ser Thr Ser Asp Ile Ile Thr Lys Ile Ile Arg Asp Tyr 
    210                 215                 220                 


Asp Lys Tyr Leu Met Arg Asn Phe Ala Arg Gly Ala Asn Arg Lys Asp 
225                 230                 235                 240 


Leu Asn Val Ser Trp Leu Lys Lys Asn Glu Leu Asp Phe Lys Arg His 
                245                 250                 255     


Val Ala Glu Phe Arg Asn Ser Phe Lys Arg Lys Lys Val Gly Lys Asp 
            260                 265                 270         


Leu Tyr Gly Glu Ile Arg Gly Leu Leu Gln Asn Val Leu Ile Trp Asn 
        275                 280                 285             


Gly Asp Asn Ser Gly Thr Ser Thr Pro Gln Arg Lys Thr Leu Gln Thr 
    290                 295                 300                 


Asn Ala Lys Lys Met Tyr Met Asn Val Leu Lys Thr Leu Gln Ala Pro 
305                 310                 315                 320 


Asp Ala Val Asp Val Asp Ser Ser Glu Asn Val Ser Glu Asn Val Thr 
                325                 330                 335     


Asp Glu Glu Glu Glu Asp Asp Asp Glu Val Asp Glu Asp Glu Glu Ala 
            340                 345                 350         


Asp Asp Asp Asp Glu Asp Asp Glu Asp Glu Glu Asp Asp Glu 
        355                 360                 365     


<210>  226
<211>  13975
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZKSL-5S5A5

<400>  226
cgatggttaa tgctgctgtg tgctgtgtgt gtgtgttgtt tggcgctcat tgttgcgtta       60

tgcagcgtac accacaatat tggaagctta ttagcctttc tattttttcg tttgcaaggc      120

ttaacaacat tgctgtggag agggatgggg atatggaggc cgctggaggg agtcggagag      180

gcgttttgga gcggcttggc ctggcgccca gctcgcgaaa cgcacctagg accctttggc      240

acgccgaaat gtgccacttt tcagtctagt aacgccttac ctacgtcatt ccatgcgtgc      300

atgtttgcgc cttttttccc ttgcccttga tcgccacaca gtacagtgca ctgtacagtg      360

gaggttttgg gggggtctta gatgggagct aaaagcggcc tagcggtaca ctagtgggat      420

tgtatggagt ggcatggagc ctaggtggag cctgacagga cgcacgaccg gctagcccgt      480

gacagacgat gggtggctcc tgttgtccac cgcgtacaaa tgtttgggcc aaagtcttgt      540

cagccttgct tgcgaaccta attcccaatt ttgtcacttc gcacccccat tgatcgagcc      600

ctaacccctg cccatcaggc aatccaatta agctcgcatt gtctgccttg tttagtttgg      660

ctcctgcccg tttcggcgtc cacttgcaca aacacaaaca agcattatat ataaggctcg      720

tctctccctc ccaaccacac tcactttttt gcccgtcttc ccttgctaac acaaaagtca      780

agaacacaaa caaccacccc aaccccctta cacacaagac atatctacag caatggccat      840

ggctctcagt cttaccacag aacagctgtt agaacgccct gatttggttg cgattgatgg      900

catcctctac gaccttgaag ggcttgccaa agttcatcca ggatccgatt tgattctcgc      960

ttctggtgcc tctgatgcct cccctctctt ttattcaatg catccatacg tcaaaccgga     1020

gaactccaaa ttgcttcaac agttcgtccg agggaagcat gaccgcacct cgaaggacat     1080

tgtctacacg tatgattctc ccttcgcaca agacgttaag cggacaatgc gcgaggtgat     1140

gaaagggagg aactggtacg caacccctgg cttctggctg cgcaccgttg ggatcatcgc     1200

cgtgacggcc ttttgcgagt ggcactgggc taccacgggg atggtgctgt ggggcctgtt     1260

gactggattc atgcacatgc agatcggctt atccatccag catgatgcgt cccacggggc     1320

catcagcaag aagccttggg tcaacgccct cttcgcctac ggcattgacg tcatcggatc     1380

gtcccggtgg atttggctgc agtcgcacat catgcggcac cacacctaca ccaaccagca     1440

cggcctcgac ctggatgcgg agtcggcaga gccgttcctg gtgttccaca actaccccgc     1500

cgcaaacacc gcccgaaagt ggttccaccg cttccaggct tggtacatgt accttgtgct     1560

gggggcatac ggggtatcgc tggtgtacaa cccgctctac attttccgga tgcagcacaa     1620

tgacaccatc ccagagtctg tcacggccat gcgggaaaat ggctttctgc ggcgctaccg     1680

cacacttgca ttcgtgatgc gagctttctt catcttccgg accgcattct tgccctggta     1740

cctcactggg acctcattgc tgatcaccat tcctctggtg cccaccgcaa ctggtgcctt     1800

cttgacgttc ttcttcattt tgtcccacaa ttttgatggc tccgaacgga tccccgacaa     1860

gaactgcaag gttaagcgat ctgagaagga cgttgaggct gaccaaattg actggtatcg     1920

ggcgcaggtg gagacgtcct ccacatacgg tggccccatc gccatgttct tcactggcgg     1980

tctcaatttc cagatcgagc accacctctt tccccggatg tcgtcttggc actacccctt     2040

cgtccagcag gcggtccggg agtgttgcga acgacatgga gtgcgatatg ttttctaccc     2100

taccatcgtc ggcaacatca tctccaccct gaagtacatg cataaggtgg gtgtcgtcca     2160

ctgcgtgaag gacgcacagg attcctaagc ggccgcattg atgattggaa acacacacat     2220

gggttatatc taggtgagag ttagttggac agttatatat taaatcagct atgccaacgg     2280

taacttcatt catgtcaacg aggaaccagt gactgcaagt aatatagaat ttgaccacct     2340

tgccattctc ttgcactcct ttactatatc tcatttattt cttatataca aatcacttct     2400

tcttcccagc atcgagctcg gaaacctcat gagcaataac atcgtggatc tcgtcaatag     2460

agggcttttt ggactccttg ctgttggcca ccttgtcctt gctgtctggc tcattctgtt     2520

tcaacgcctt ttaattaact ctcctgcagc ttcctcagcg agacactgct cctgtctgga     2580

cccgagctaa ggctctgttc gacaaacacg ttctgcgaat tggcgagtaa tttcattaat     2640

gcaaatagac gtgtatttga aaaggaggag atgatggtac cagtaattta cggtgtttga     2700

atagacaaat tatatatata aaattaacct gatgaatgag tgtatgaatg gattgcttaa     2760

tataccgagg gagagccggc attatcaata tttattgtcc taagaagcta aaatatgctg     2820

ttccgttgat agcgatgtct gtttgagagt caatggcaga acgcggagga gtattgttag     2880

ttggtgatcg gtttactcga catcgagtag ggggcgagac agaagatatc ccgaatccat     2940

tgcattgttt attaggatgt tcacaacaca actccctcta gactctgggg atgtgcgtta     3000

gaagaatgac ctggagcaag agtgttcaga ttgatccgtt caaattttca agattactgt     3060

tggggttgtt tttgaatcca cattagctgg acgactattg catctgagcg ctcggaacgt     3120

ctccctttcc cttcagcttt atacgagtcg attccataag cgcgacctct cgaattgcct     3180

tcggttgtga agccataggc aaaggtgtgg ctatggaatg catgcgacgt cgggcccaat     3240

tcgccctata gtgagtcgta ttacaattca ctggccgtcg ttttacaacg tcgtgactgg     3300

gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg     3360

cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc     3420

gaatggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg     3480

tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc     3540

tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc     3600

gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta     3660

gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta     3720

atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg     3780

atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa     3840

aatttaacgc gaattttaac aaaatattaa cgcttacaat ttcctgatgc ggtattttct     3900

ccttacgcat ctgtgcggta tttcacaccg catcaggtgg cacttttcgg ggaaatgtgc     3960

gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac     4020

aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt     4080

tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag     4140

aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg     4200

aactggatct caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa     4260

tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc     4320

aagagcaact cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag     4380

tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa     4440

ccatgagtga taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc     4500

taaccgcttt tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg     4560

agctgaatga agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa     4620

caacgttgcg caaactatta actggcgaac tacttactct agcttcccgg caacaattaa     4680

tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg     4740

gctggtttat tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag     4800

cactggggcc agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg     4860

caactatgga tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt     4920

ggtaactgtc agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt     4980

aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac     5040

gtgagttttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag     5100

atcctttttt tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg     5160

tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca     5220

gagcgcagat accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga     5280

actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca     5340

gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc     5400

agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca     5460

ccgaactgag atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa     5520

aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc     5580

cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc     5640

gtcgattttt gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg     5700

cctttttacg gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat     5760

cccctgattc tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca     5820

gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca     5880

aaccgcctct ccccgcgcgt tggccgattc attaatgcag ctggcgcgcc gaggtttcca     5940

acgagataac atcgtggcag ctgccaccat gaccgatatg cctgaaataa gtgaattgac     6000

catcagaagt tctgcattca aatagaacta tatcatattc ggctcagttt tttcaataat     6060

agtccaatcc ctaagtctcc tatccaaaat ggttcctgac cagccacatc catgatcatg     6120

actgcgtgac aggaacagtc attccgtgga tgaacgactt tacgctcagg tactgtaaat     6180

atctgtaaag ggcagacaac caaccaattg agtaacctgt gagacttgaa acgtaagatg     6240

acttcacaca caaagtcact tgactcaacc gtggctctca attgcacaaa atcactctgc     6300

actaatctat tgcaggagtc aggctatgaa caactagacg acagctactt atgtgttata     6360

tagaggaata ttaaaaaatc taagaataat cataaagtta caaaataatt atcagatttc     6420

gagccacagg tcacccctaa catgtgttat tgcacaccca caatcctcag cttgatgtca     6480

tttaattctt ccagccacca tctctctctc caaccctaat ggcaaacttt attttggtgg     6540

agcgatgact cttactcaac tgcagcatac ttaagcacaa ttgttcccca gcctgatacg     6600

acacaccatc cattgtcaag cttcaccaca tacaacaaca cagcgtacgc aactaacatg     6660

aatgaatacg atatacatca aagactatga tacgcagtat tgcacactgt acgagtaaga     6720

gcactagcca ctgcactcaa gtgaaaccgt tgcccgggta cgagtatgag tatgtacagt     6780

atgtttagta ttgtacttgg acagtgcttg tatcgtacat tctcaagtgt caaacataaa     6840

tatccgttgc tatatcctcg caccaccacg tagctcgcta tatccctgtg ttgaatccat     6900

ccatcttgga ttgccaattg tgcacacaga accgggcact cacttcccca tccacacttg     6960

cggccgctta ggaatcctga gcgtccttga cacagtgaac cacaccgact ttgtgcatgt     7020

acttgagggt ggaaatgatg ttgcccacaa tggtagggta gaagacgtac cgaactccgt     7080

gtcgttcgca acactctcgg acagcttgct gcacgaaggg atagtgccaa gacgacattc     7140

gaggaaagag gtgatgctcg atctggaagt tgagaccgcc agtaaagaac atggcaatgg     7200

gtccaccgta ggtggaagag gtctccacct gagctctgta ccagtcgatc tgatcggctt     7260

caacgtcctt ctcggagctc ttgaccttgc agttcttgtc ggggattcgc tccgagccat     7320

cgaagttgtg agacaagatg aaaaagaagg tgaggaaggc accggtagca gtgggcacca     7380

gaggaatggt gatgagcagg gaggttccag tgagatacca gggcaagaag gcggttcgaa     7440

agatgaagaa agctcgcata acgaatgcaa gggttcggta ccgtcgcaga aagccgttct     7500

ctcgcatggc tgtgacagac tcgggaatgg tgtcgttgtg ctgcattcgg aagatgtaga     7560

gagggttgta caccagcgaa acgccgtagg ctccaagcac gaggtacatg taccaggcct     7620

ggaatcggtg aaaccacttt cgagcagtgt tggcagcagg gtagttgtgg aacacaagga     7680

atggttctgc ggactcggca tccaggtcga gaccatgctg attggtgtag gtgtgatgtc     7740

gcatgatgtg agactgcagc cagatccatc tggacgatcc aatgacgtcg atgccgtagg     7800

caaagagagc gttgacccag ggctttttgc tgatggcacc atgagaggca tcgtgctgaa     7860

tggacaggcc gatctgcatg tgcatgaatc cagtcaagag accccacagc accattccgg     7920

tagtagccca gtgccactcg caaaaggcgg tgacagcaat gatgccaacg gttcgcagcc     7980

agaatccagg tgtggcatac cagttccgac ctttcatgac ctctcgcata gttcgcttga     8040

cgtcctgtgc aaagggagag tcgtaggtgt agacaatgtc cttggaggtt cggtcgtgct     8100

tgcctcgcac gaactgttga agcagcttcg agttctcggg cttgacgtaa gggtgcatgg     8160

agtagaacag aggagaagca tcggaggcac cagaagcgag gatcaagtcg gatccgggat     8220

ggaccttggc aagaccttcc agatcgtaga gaatgccgtc gatggcaacc aggtcgggtc     8280

gctcgagcag ctgctcggta gtaagggaga gagccatgga gagctgggtt agtttgtgta     8340

gagagtgtgt gttgctagcg actttcggat tgtgtcatta cacaaaacgc gtcgtctcga     8400

cactgatctt gtcgtggata ctcacggctc ggacatcgtc gccgacgatg acaccggact     8460

ttcgcttaag gacgtcagta acaggcattg tgtgatgtgt agtttagatt tcgaatctgt     8520

ggggaaagaa aggaaaaaag agactggcaa ccgattggga gagccactgt ttatatatac     8580

cctagacaag ccccccgctt gtaagatgtt ggtcaatgta aaccagtatt aaggttggca     8640

agtgcaggag aagcaaggtg tgggtaccga gcaatggaaa tgtgcggaag gcaaaaaaat     8700

gaggccacgg cctattgtcg gggctatatc cagggggcga ttgaagtaca ctaacatgac     8760

atgtgtccac agaccctcaa tctggcctga tgagccaaat ccatacgcgc tttcgcagct     8820

ctaaaggcta taacaagtca caccaccctg ctcgacctca gcgccctcac tttttgttaa     8880

gacaaactgt acacgctgtt ccagcgtttt ctgcctgcac ctggtgggac atttggtgca     8940

acctaaagtg ctcggaacct ctgtggtgtc cagatcagcg cagcagttcc gaggtagttt     9000

tgaggccctt agatgatgca atggtgtcag tcgctggatc acgagtctta atggcagtat     9060

tcgttcttat ttgtgccatt gagccccgtt atcctcgtat cttctacccc ccatcccatc     9120

cctttgttgg tgcaacccta cccatttatt gttgggtgca gcccaaccga cgtggagagc     9180

ttggcttggc catataaaaa ggcccccccc tagtggcaat ggcagaaagt cagctgtgag     9240

ttgttgaatt tgtcatctag gcggcctggc cgtcttctcc ggggcaattg gggctgtttt     9300

ttgggacaca aatacgccgc caacccggtc tctcctgaat tctgcagatg ggctgcagga     9360

attccgtcgt cgcctgagtc gactccaact tttcacactg agcgtaaaat gtggagaaga     9420

aatcggcact aaaaagtcag gtagactgga aaatgcgcca tgaaatgaat atctcttgct     9480

acagtaatgc ccagcatcga ggggtattgt gtcaccaaca ctatagtggc agctgaagcg     9540

ctcgtgattg tagtatgagt ctttattggt gatgggaaga gttcactcaa tattctcgtt     9600

actgccaaaa caccacggta atcggccaga caccatggat gtagatcacc aagcctgtga     9660

atgttattcg agctaaaatg cacatggttg gtgaaaggag tagttgctgt cgaattccgt     9720

cgtcgcctga gtcatcattt atttaccagt tggccacaaa cccttgacga tctcgtatgt     9780

cccctccgac atactcccgg ccggctgggg tacgttcgat agcgctatcg gcatcgacaa     9840

ggtttgggtc cctagccgat accgcactac ctgagtcaca atcttcggag gtttagtctt     9900

ccacatagca cgggcaaaag tgcgtatata tacaagagcg tttgccagcc acagattttc     9960

actccacaca ccacatcaca catacaacca cacacatcca caatggaacc cgaaactaag    10020

aagaccaaga ctgactccaa gaagattgtt cttctcggcg gcgacttctg tggccccgag    10080

gtgattgccg aggccgtcaa ggtgctcaag tctgttgctg aggcctccgg caccgagttt    10140

gtgtttgagg accgactcat tggaggagct gccattgaga aggagggcga gcccatcacc    10200

gacgctactc tcgacatctg ccgaaaggct gactctatta tgctcggtgc tgtcggaggc    10260

gctgccaaca ccgtatggac cactcccgac ggacgaaccg acgtgcgacc cgagcagggt    10320

ctcctcaagc tgcgaaagga cctgaacctg tacgccaacc tgcgaccctg ccagctgctg    10380

tcgcccaagc tcgccgatct ctcccccatc cgaaacgttg agggcaccga cttcatcatt    10440

gtccgagagc tcgtcggagg tatctacttt ggagagcgaa aggaggatga cggatctggc    10500

gtcgcttccg acaccgagac ctactccgtt cctgaggttg agcgaattgc ccgaatggcc    10560

gccttcctgg cccttcagca caacccccct cttcccgtgt ggtctcttga caaggccaac    10620

gtgctggcct cctctcgact ttggcgaaag actgtcactc gagtcctcaa ggacgaattc    10680

ccccagctcg agctcaacca ccagctgatc gactcggccg ccatgatcct catcaagcag    10740

ccctccaaga tgaatggtat catcatcacc accaacatgt ttggcgatat catctccgac    10800

gaggcctccg tcatccccgg ttctctgggt ctgctgccct ccgcctctct ggcttctctg    10860

cccgacacca acgaggcgtt cggtctgtac gagccctgtc acggatctgc ccccgatctc    10920

ggcaagcaga aggtcaaccc cattgccacc attctgtctg ccgccatgat gctcaagttc    10980

tctcttaaca tgaagcccgc cggtgacgct gttgaggctg ccgtcaagga gtccgtcgag    11040

gctggtatca ctaccgccga tatcggaggc tcttcctcca cctccgaggt cggagacttg    11100

ttgccaacaa ggtcaaggag ctgctcaaga aggagtaagt cgtttctacg acgcattgat    11160

ggaaggagca aactgacgcg cctgcgggtt ggtctaccgg cagggtccgc tagtgtataa    11220

gactctataa aaagggccct gccctgctaa tgaaatgatg atttataatt taccggtgta    11280

gcaaccttga ctagaagaag cagattgggt gtgtttgtag tggaggacag tggtacgttt    11340

tggaaacagt cttcttgaaa gtgtcttgtc tacagtatat tcactcataa cctcaatagc    11400

caagggtgta gtcggtttat taaaggaagg gagttgtggc tgatgtggat agatatcttt    11460

aagctggcga ctgcacccaa cgagtgtggt ggtagcttgt ttaaacagag tgtgaaagac    11520

tcactatggt ccgggcttat ctcgaccaat agccaaagtc tggagtttct gagagaaaaa    11580

ggcaagatac gtatgtaaca aagcgacgca tggtacaata ataccggagg catgtatcat    11640

agagagttag tggttcgatg atggcactgg tgcctggtat gactttatac ggctgactac    11700

atatttgtcc tcagacatac aattacagtc aagcacttac ccttggacat ctgtaggtac    11760

cccccggcca agacgatctc agcgtgtcgt atgtcggatt ggcgtagctc cctcgctcgt    11820

caattggctc ccatctactt tcttctgctt ggctacaccc agcatgtctg ctatggctcg    11880

ttttcgtgcc ttatctatcc tcccagtatt accaactcta aatgacatga tgtgattggg    11940

tctacacttt catatcagag ataaggagta gcacagttgc ataaaaagcc caactctaat    12000

cagcttcttc ctttcttgta attagtacaa aggtgattag cgaaatctgg aagcttagtt    12060

ggccctaaaa aaatcaaaaa aagcaaaaaa cgaaaaacga aaaaccacag ttttgagaac    12120

agggaggtaa cgaaggatcg tatatatata tatatatata tatacccacg gatcccgaga    12180

ccggcctttg attcttccct acaaccaacc attctcacca ccctaattca caaccatggc    12240

caccatctcc ctgactaccg agcagctcct ggaacacccc gagctcgttg ccatcgacgg    12300

agtcctgtac gatctcttcg gtctggccaa ggtccatgcc ggaggcaacc tcatcgaagc    12360

tgccggtgca tccgacggaa ccgctctgtt ctactccatg catcctggag tcaagccaga    12420

gaactcgaag cttctgcagc aatttgcccg aggcaagcac gaacgaagct ccaaggatcc    12480

cgtgtacacc ttcgactctc cctttgctca ggacgtcaag cagtccgttc gagaggtcat    12540

gaagggtcga aactggtacg ccactcctgg cttctggctg agaaccgcac tcatcatcgc    12600

ttgtactgcc attggcgagt ggtactggat cacaaccgga gcagtgatgt ggggtatctt    12660

tactggatac ttccactcgc agattggctt ggccattcaa cacgatgctt ctcacggagc    12720

catcagcaaa aagccctggg tcaacgcctt tttcgcttat ggcatcgacg ccattggttc    12780

ctctcgttgg atctggctgc agtcccacat tatgcgacat cacacttaca ccaaccagca    12840

tggcctcgac ctggatgctg cctcggcaga gccgttcatc ttgttccact cctatcctgc    12900

taccaacgcc tctcgaaagt ggtaccaccg atttcaggcg tggtacatgt acatcgttct    12960

gggaatgtat ggtgtctcga tggtgtacaa tcccatgtac ctcttcacaa tgcagcacaa    13020

cgacaccatt cccgaggcca cttctctcag accaggcagc tttttcaatc ggcagcgagc    13080

tttcgccgtt tcccttcgac tgctcttcat cttccgaaac gcctttcttc cctggtacat    13140

tgctggtgcc tctcctctgc tcaccattct tctggtgccc acggtcacag gcatcttcct    13200

cacctttgtg ttcgttctgt cccataactt cgagggagcc gaacggaccc cagagaagaa    13260

ctgcaaggcc aaacgagcta aggaaggcaa ggaggtcaga gacgtggaag aggatcgagt    13320

cgactggtac cgagcacagg ccgagactgc tgccacctac ggtggcagcg tgggaatgat    13380

gcttacaggc ggtctcaacc tgcagatcga gcatcacttg tttccccgaa tgtcctcttg    13440

gcactatccc ttcattcaag acaccgttcg ggagtgttgc aagcgacatg gcgtccgtta    13500

cacatactat cctaccattc tcgagaacat catgtccact cttcgataca tgcagaaggt    13560

gggtgttgct cacaccattc aggatgccca ggagttctaa gcggccgcat gtacatacaa    13620

gattatttat agaaatgaat cgcgatcgaa caaagagtac gagtgtacga gtaggggatg    13680

atgataaaag tggaagaagt tccgcatctt tggatttatc aacgtgtagg acgatacttc    13740

ctgtaaaaat gcaatgtctt taccataggt tctgctgtag atgttattaa ctaccattaa    13800

catgtctact tgtacagttg cagaccagtt ggagtataga atggtacact taccaaaaag    13860

tgttgatggt tgtaactacg atatataaaa ctgttgacgg gatccccgct gatatgccta    13920

aggaacaatc aaagaggaag atattaattc agaatgctag tatacagtta gggat         13975


<210>  227
<211>  14619
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZP2-85m98F

<400>  227
cgatggaagc cggtagaacc gggctgcttg tgcttggaga tggaagccgg tagaaccggg       60

ctgcttgggg ggatttgggg ccgctgggct ccaaagaggg gtaggcattt cgttggggtt      120

acgtaattgc ggcatttggg tcctgcgcgc atgtcccatt ggtcagaatt agtccggata      180

ggagacttat cagccaatca cagcgccgga tccacctgta ggttgggttg ggtgggagca      240

cccctccaca gagtagagtc aaacagcagc agcaacatga tagttggggg tgtgcgtgtt      300

aaaggaaaaa aaagaagctt gggttatatt cccgctctat ttagaggttg cgggatagac      360

gccgacggag ggcaatggcg ctatggaacc ttgcggatat ccatacgccg cggcggactg      420

cgtccgaacc agctccagca gcgttttttc cgggccattg agccgactgc gaccccgcca      480

acgtgtcttg gcccacgcac tcatgtcatg ttggtgttgg gaggccactt tttaagtagc      540

acaaggcacc tagctcgcag caaggtgtcc gaaccaaaga agcggctgca gtggtgcaaa      600

cggggcggaa acggcgggaa aaagccacgg gggcacgaat tgaggcacgc cctcgaattt      660

gagacgagtc acggccccat tcgcccgcgc aatggctcgc caacgcccgg tcttttgcac      720

cacatcaggt taccccaagc caaacctttg tgttaaaaag cttaacatat tataccgaac      780

gtaggtttgg gcgggcttgc tccgtctgtc caaggcaaca tttatataag ggtctgcatc      840

gccggctcaa ttgaatcttt tttcttcttc tcttctctat attcattctt gaattaaaca      900

cacatcaacc atggtcaagc gacccgctct gcctctcacc gtggacggtg tcacctacga      960

cgtttctgcc tggctcaacc accatcccgg aggtgccgac attatcgaga actaccgagg     1020

tcgggatgct accgacgtct tcatggttat gcactccgag aacgccgtgt ccaaactcag     1080

acgaatgccc atcatggaac cttcctctcc cctgactcca acacctccca agccaaactc     1140

cgacgaacct caggaggatt tccgaaagct gcgagacgag ctcattgctg caggcatgtt     1200

cgatgcctct cccatgtggt acgcttacaa gaccctgtcg actctcggac tgggtgtcct     1260

tgccgtgctg ttgatgaccc agtggcactg gtacctggtt ggtgctatcg tcctcggcat     1320

tcactttcaa cagatgggat ggctctcgca cgacatttgc catcaccagc tgttcaagga     1380

ccgatccatc aacaatgcca ttggcctgct cttcggaaac gtgcttcagg gcttttctgt     1440

cacttggtgg aaggaccgac acaacgctca tcactccgcc accaacgtgc agggtcacga     1500

tcccgacatc gacaacctgc ctctcctggc gtggtccaag gaggacgtcg agcgagctgg     1560

cccgttttct cgacggatga tcaagtacca acagtattac ttctttttca tctgtgccct     1620

tctgcgattc atctggtgct ttcagtccat tcatactgcc acgggtctca aggatcgaag     1680

caatcagtac tatcgaagac agtacgagaa ggagtccgtc ggtctggcac tccactgggg     1740

tctcaaggcc ttgttctact atttctacat gccctcgttt ctcaccggac tcatggtgtt     1800

ctttgtctcc gagctgcttg gtggcttcgg aattgccatc gttgtcttca tgaaccacta     1860

ccctctggag aagattcagg actccgtgtg ggatggtcat ggcttctgtg ctggacagat     1920

tcacgagacc atgaacgttc agcgaggcct cgtcacagac tggtttttcg gtggcctcaa     1980

ctaccagatc gaacatcacc tgtggcctac tcttcccaga cacaacctca ccgctgcctc     2040

catcaaagtg gagcagctgt gcaagaagca caacctgccc taccgatctc ctcccatgct     2100

cgaaggtgtc ggcattctta tctcctacct gggcaccttc gctcgaatgg ttgccaaggc     2160

agacaaggcc taagcggccg cattgatgat tggaaacaca cacatgggtt atatctaggt     2220

gagagttagt tggacagtta tatattaaat cagctatgcc aacggtaact tcattcatgt     2280

caacgaggaa ccagtgactg caagtaatat agaatttgac caccttgcca ttctcttgca     2340

ctcctttact atatctcatt tatttcttat atacaaatca cttcttcttc ccagcatcga     2400

gctcggaaac ctcatgagca ataacatcgt ggatctcgtc aatagagggc tttttggact     2460

ccttgctgtt ggccaccttg tccttgctgt ttaaacatcg tggttaatgc tgctgtgtgc     2520

tgtgtgtgtg tgttgtttgg cgctcattgt tgcgttatgc agcgtacacc acaatattgg     2580

aagcttatta gcctttctat tttttcgttt gcaaggctta acaacattgc tgtggagagg     2640

gatggggata tggaggccgc tggagggagt cggagaggcg ttttggagcg gcttggcctg     2700

gcgcccagct cgcgaaacgc acctaggacc ctttggcacg ccgaaatgtg ccacttttca     2760

gtctagtaac gccttaccta cgtcattcca tgcgtgcatg tttgcgcctt ttttcccttg     2820

cccttgatcg ccacacagta cagtgcactg tacagtggag gttttggggg ggtcttagat     2880

gggagctaaa agcggcctag cggtacacta gtgggattgt atggagtggc atggagccta     2940

ggtggagcct gacaggacgc acgaccggct agcccgtgac agacgatggg tggctcctgt     3000

tgtccaccgc gtacaaatgt ttgggccaaa gtcttgtcag ccttgcttgc gaacctaatt     3060

cccaattttg tcacttcgca cccccattga tcgagcccta acccctgccc atcaggcaat     3120

ccaattaagc tcgcattgtc tgccttgttt agtttggctc ctgcccgttt cggcgtccac     3180

ttgcacaaac acaaacaagc attatatata aggctcgtct ctccctccca accacactca     3240

cttttttgcc cgtcttccct tgctaacaca aaagtcaaga acacaaacaa ccaccccaac     3300

ccccttacac acaagacata tctacagcaa tggccatggc tctctccctt actaccgagc     3360

agctgctcga gcgacccgac ctggttgcca tcgacggcat tctctacgat ctggaaggtc     3420

ttgccaaggt ccatcccgga tccgacttga tcctcgcttc tggtgcctcc gatgcttctc     3480

ctctgttcta ctccatgcac ccttacgtca agcccgagaa ctcgaagctg cttcaacagt     3540

tcgtgcgagg caagcacgac cgaacctcca aggacattgt ctacacctac gactctccct     3600

ttgcacagga cgtcaagcga actatgcgag aggtcatgaa aggtcggaac tggtatgcca     3660

cacctggatt ctggctgcga accgttggca tcattgctgt caccgccttt tgcgagtggc     3720

actgggctac taccggaatg gtgctgtggg gtctcttgac tggattcatg cacatgcaga     3780

tcggcctgtc cattcagcac gatgcctctc atggtgccat cagcaaaaag ccctgggtca     3840

acgctctctt tgcctacggc atcgacgtca ttggatcgtc cagatggatc tggctgcagt     3900

ctcacatcat gcgacatcac acctacacca atcagcatgg tctcgacctg gatgccgagt     3960

ccgcagaacc attccttgtg ttccacaact accctgctgc caacactgct cgaaagtggt     4020

ttcaccgatt ccaggcctgg tacatgtacc tcgtgcttgg agcctacggc gtttcgctgg     4080

tgtacaaccc tctctacatc ttccgaatgc agcacaacga caccattccc gagtctgtca     4140

cagccatgcg agagaacggc tttctgcgac ggtaccgaac ccttgcattc gttatgcgag     4200

ctttcttcat ctttcgaacc gccttcttgc cctggtatct cactggaacc tccctgctca     4260

tcaccattcc tctggtgccc actgctaccg gtgccttcct caccttcttt ttcatcttgt     4320

ctcacaactt cgatggctcg gagcgaatcc ccgacaagaa ctgcaaggtc aagagctccg     4380

agaaggacgt tgaagccgat cagatcgact ggtacagagc tcaggtggag acctcttcca     4440

cctacggtgg acccattgcc atgttcttta ctggcggtct caacttccag atcgagcatc     4500

acctctttcc tcgaatgtcg tcttggcact atcccttcgt gcagcaagct gtccgagagt     4560

gttgcgaacg acacggagtt cggtacgtct tctaccctac cattgtgggc aacatcattt     4620

ccaccctcaa gtacatgcac aaagtcggtg tggttcactg tgtcaaggac gctcaggatt     4680

cctaagcggc cgcatgagaa gataaatata taaatacatt gagatattaa atgcgctaga     4740

ttagagagcc tcatactgct cggagagaag ccaagacgag tactcaaagg ggattacacc     4800

atccatatcc acagacacaa gctggggaaa ggttctatat acactttccg gaataccgta     4860

gtttccgatg ttatcaatgg gggcagccag gatttcaggc acttcggtgt ctcggggtga     4920

aatggcgttc ttggcctcca tcaagtcgta ccatgtcttc atttgcctgt caaagtaaaa     4980

cagaagcaga tgaagaatga acttgaagtg aaggaattta aatgtaacga aactgaaatt     5040

tgaccagata ttgtgtccgc ggtggagctc cagcttttgt tccctttagt gagggttaat     5100

ttcgagcttg gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgctcac     5160

aagcttccac acaacgtacg ggcgtcgttg cttgtgtgat ttttgaggac ccatcccttt     5220

ggtatataag tatactctgg ggttaaggtt gcccgtgtag tctaggttat agttttcatg     5280

tgaaataccg agagccgagg gagaataaac gggggtattt ggacttgttt ttttcgcgga     5340

aaagcgtcga atcaaccctg cgggccttgc accatgtcca cgacgtgttt ctcgccccaa     5400

ttcgcccctt gcacgtcaaa attaggcctc catctagacc cctccataac atgtgactgt     5460

ggggaaaagt ataagggaaa ccatgcaacc atagacgacg tgaaagacgg ggaggaacca     5520

atggaggcca aagaaatggg gtagcaacag tccaggagac agacaaggag acaaggagag     5580

ggcgcccgaa agatcggaaa aacaaacatg tccaattggg gcagtgacgg aaacgacacg     5640

gacacttcag tacaatggac cgaccatctc caagccaggg ttattccggt atcaccttgg     5700

ccgtaacctc ccgctggtac ctgatattgt acacgttcac attcaatata ctttcagcta     5760

caataagaga ggctgtttgt cgggcatgtg tgtccgtcgt atggggtgat gtccgagggc     5820

gaaattcgct acaagcttaa ctctggcgct tgtccagtat gaatagacaa gtcaagacca     5880

gtggtgccat gattgacagg gaggtacaag acttcgatac tcgagcatta ctcggacttg     5940

tggcgattga acagacgggc gatcgcttct cccccgtatt gccggcgcgc cagctgcatt     6000

aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct     6060

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa     6120

aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa     6180

aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc     6240

tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga     6300

caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc     6360

cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt     6420

ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct     6480

gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg     6540

agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta     6600

gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct     6660

acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa     6720

gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt     6780

gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta     6840

cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat     6900

caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa     6960

gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct     7020

cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta     7080

cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct     7140

caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg     7200

gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa     7260

gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt     7320

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta     7380

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca     7440

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta     7500

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct     7560

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg     7620

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac     7680

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact     7740

gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa     7800

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt     7860

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat     7920

gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg     7980

atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gaaattgtaa     8040

gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc     8100

aataggccga aatcggcaaa atcccttata aatcaaaaga atagaccgag atagggttga     8160

gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag     8220

ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc taatcaagtt     8280

ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta     8340

gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag     8400

cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg     8460

cgcttaatgc gccgctacag ggcgcgtcca ttcgccattc aggctgcgca actgttggga     8520

agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc     8580

aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc     8640

cagtgaattg taatacgact cactataggg cgaattgggc ccgacgtcgc atgcgctgat     8700

gacactttgg tctgaaagag atgcattttg aatcccaaac ttgcagtgcc caagtgacat     8760

acatctccgc gttttggaaa atgttcagaa acagttgatt gtgttggaat ggggaatggg     8820

gaatggaaaa atgactcaag tatcaattcc aaaaacttct ctggctggca gtacctactg     8880

tccatactac tgcattttct ccagtcaggc cactctatac tcgacgacac agtagtaaaa     8940

cccagataat ttcgacataa acaagaaaac agacccaata atatttatat atagtcagcc     9000

gtttgtccag ttcagactgt aatagccgaa aaaaaatcca aagtttctat tctaggaaaa     9060

tatattccaa tatttttaat tcttaatctc atttatttta ttctagcgaa atacatttca     9120

gctacttgag acatgtgata cccacaaatc ggattcggac tcggttgttc agaagagcat     9180

atggcattcg tgctcgcttg ttcacgtatt cttcctgttc catctcttgg ccgacaatca     9240

cacaaaaatg gggttttttt tttaattcta atgattcatt acagcaaaat tgagatatag     9300

cagaccacgt attccataat caccaaggaa gttcttgggc gtcttaatta actcacctgc     9360

aggattgaga ctatgaatgg attcccgtgc ccgtattact ctactaattt gatcttggaa     9420

cgcgaaaata cgtttctagg actccaaaga atctcaactc ttgtccttac taaatatact     9480

acccatagtt gatggtttac ttgaacagag aggacatgtt cacttgaccc aaagtttctc     9540

gcatctcttg gatatttgaa caacggcgtc cactgaccgt cagttatcca gtcacaaaac     9600

ccccacattc atacattccc atgtacgttt acaaagttct caattccatc gtgcaaatca     9660

aaatcacatc tattcattca tcatatataa acccatcatg tctactaaca ctcacaactc     9720

catagaaaac atcgactcag aacacacgct ccatgcggcc gcttactgag ccttggcacc     9780

gggctgcttc tcggccattc gagcgaactg ggacaggtat cggagcagga tgacgagacc     9840

ttcatggggc agagggtttc ggtaggggag gttgtgcttc tggcacagct gttccacctg     9900

gtaggaaacg gcagtgaggt tgtgtcgagg cagggtgggc cagagatggt gctcgatctg     9960

gtagttcagg cctccaaaga accagtcagt aatgatgcct cgtcgaatgt tcatggtctc    10020

atggatctga cccacagaga agccatgtcc gtcccagacg gaatcaccga tcttctccag    10080

agggtagtgg ttcatgaaga ccacgatggc aattccgaag ccaccgacga gctcggaaac    10140

aaagaacacc agcatcgagg tcaggatgga gggcataaag aagaggtgga acagggtctt    10200

gagagtccag tgcagagcga gtccaatggc ctctttcttg tactgagatc ggtagaactg    10260

gttgtctcgg tccttgaggg atcgaacggt cagcacagac tggaaacacc agatgaatcg    10320

caggagaata cagatgacca ggaaatagta ctgttggaac tgaatgagct ttcgggagat    10380

gggagaagct cgagtgacat cgtcctcgga ccaggcgagc agaggcaggt tatcaatgtc    10440

gggatcgtga ccctgaacgt tggtagcaga atgatgggcg ttgtgtctgt ccttccacca    10500

ggtcacggag aagccctgga gtccgttgcc aaagaccaga cccaggacgt tattccagtt    10560

tcggttcttg aaggtctggt ggtggcagat gtcatgagac agccatccca tttgctggta    10620

gtgcataccg agcacgagag caccaatgaa gtacaggtgg tactggacca gcatgaagaa    10680

ggcaagcacg ccaagaccca gggtggtcaa gatcttgtac gagtaccaga ggggagaggc    10740

gtcaaacatg ccagtggcga tcagctcttc tcggagcttt cggaaatcct cctgagcttc    10800

gttgacggca gcctggggag gcagctcgga agcctggttg atcttgggca ttcgcttgag    10860

cttgtcgaag gcttcctgag agtgcataac catgaaggcg tcagtagcat ctcgtccctg    10920

gtagttctca atgatttcag ctccaccagg gtggaagttc acccaagcgg agacgtcgta    10980

cacctttccg tcgatgacga ggggcagagc ctgtcgagaa gccttcacgg atcccatgac    11040

ggccagagag tcgtagtagg tagcgggagg aagtccggca ggtcgagcgg gaccggcgcc    11100

ctgaatcttt ttggctccct tgtgctttcg gacgatgtag gtctgcacgt agaagttgag    11160

gaacagacac aggacagtac caacgtagaa gtagttgaaa aaccagccaa acattctcat    11220

tccatcttgt cggtagcagg gaatgttccg gtacttccag acgatgtaga agccaacgtt    11280

gaactgaatg atctgcatag aagtaatcag ggacttgggc atagggaact tgagcttgat    11340

cagtcgggtc caatagtagc cgtacatgat ccagtgaatg aagccgttga gcagcacaaa    11400

gatccaaacg gcttcgtttc ggtagttgta gaacagccac atgtccatag gagctccgag    11460

atggtgaaag aactgcaacc aggtcagagg cttgcccatg aggggcagat agaaggagtc    11520

aatgtactcg aggaacttgc tgaggtagaa cagctgagtg gtgattcgga agacattgtt    11580

gtcgaaagcc ttctcgcagt tgtcggacat gacaccaatg gtgtacatgg cgtaggccat    11640

agagaggaag gagcccagcg agtagatgga catgagcagg ttgtagttgg tgaacacaaa    11700

cttcattcga gactgaccct tgggtccgag aggaccaagg gtgaacttca ggatgacgaa    11760

ggcgatggag aggtacagca cctcgcagtg cgaggcatca gaccagagct gagcatagtc    11820

gaccttggga agaacctcct ggccaatgga gacgatttcg ttcacgacct ccatggttgt    11880

gaattagggt ggtgagaatg gttggttgta gggaagaatc aaaggccggt ctcgggatcc    11940

gtgggtatat atatatatat atatatatac gatccttcgt tacctccctg ttctcaaaac    12000

tgtggttttt cgtttttcgt tttttgcttt ttttgatttt tttagggcca actaagcttc    12060

cagatttcgc taatcacctt tgtactaatt acaagaaagg aagaagctga ttagagttgg    12120

gctttttatg caactgtgct actccttatc tctgatatga aagtgtagac ccaatcacat    12180

catgtcattt agagttggta atactgggag gatagataag gcacgaaaac gagccatagc    12240

agacatgctg ggtgtagcca agcagaagaa agtagatggg agccaattga cgagcgaggg    12300

agctacgcca atccgacata cgacacgctg agatcgtctt ggccgggggg tacctacaga    12360

tgtccaaggg taagtgcttg actgtaattg tatgtctgag gacaaatatg tagtcagccg    12420

tataaagtca taccaggcac cagtgccatc atcgaaccac taactctcta tgatacatgc    12480

ctccggtatt attgtaccat gcgtcgcttt gttacatacg tatcttgcct ttttctctca    12540

gaaactccag aattctctct cttgagcttt tccataacaa gttcttctgc ctccaggaag    12600

tccatgggtg gtttgatcat ggttttggtg tagtggtagt gcagtggtgg tattgtgact    12660

ggggatgtag ttgagaataa gtcatacaca agtcagcttt cttcgagcct catataagta    12720

taagtagttc aacgtattag cactgtaccc agcatctccg tatcgagaaa cacaacaaca    12780

tgccccattg gacagatcat gcggatacac aggttgtgca gtatcataca tactcgatca    12840

gacaggtcgt ctgaccatca tacaagctga acaagcgctc catacttgca cgctctctat    12900

atacacagtt aaattacata tccatagtct aacctctaac agttaatctt ctggtaagcc    12960

tcccagccag ccttctggta tcgcttggcc tcctcaatag gatctcggtt ctggccgtac    13020

agacctcggc cgacaattat gatatccgtt ccggtagaca tgacatcctc aacagttcgg    13080

tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca ccccgggggt cagaataagc    13140

cagtcctcag agtcgccctt aggtcggttc tgggcaatga agccaaccac aaactcgggg    13200

tcggatcggg caagctcaat ggtctgcttg gagtactcgc cagtggccag agagcccttg    13260

caagacagct cggccagcat gagcagacct ctggccagct tctcgttggg agaggggact    13320

aggaactcct tgtactggga gttctcgtag tcagagacgt cctccttctt ctgttcagag    13380

acagtttcct cggcaccagc tcgcaggcca gcaatgattc cggttccggg tacaccgtgg    13440

gcgttggtga tatcggacca ctcggcgatt cggtgacacc ggtactggtg cttgacagtg    13500

ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga aaccgtgctt aagagcaagt    13560

tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt caatgatgtc gatatgggtt    13620

ttgatcatgc acacataagg tccgacctta tcggcaagct caatgagctc cttggtggtg    13680

gtaacatcca gagaagcaca caggttggtt ttcttggctg ccacgagctt gagcactcga    13740

gcggcaaagg cggacttgtg gacgttagct cgagcttcgt aggagggcat tttggtggtg    13800

aagaggagac tgaaataaat ttagtctgca gaacttttta tcggaacctt atctggggca    13860

gtgaagtata tgttatggta atagttacga gttagttgaa cttatagata gactggacta    13920

tacggctatc ggtccaaatt agaaagaacg tcaatggctc tctgggcgtc gcctttgccg    13980

acaaaaatgt gatcatgatg aaagccagca atgacgttgc agctgatatt gttgtcggcc    14040

aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca acgaagaatg tatcgtcaaa    14100

gtgatccaag cacactcata gttggagtcg tactccaaag gcggcaatga cgagtcagac    14160

agatactcgt cgaccttttc cttgggaacc accaccgtca gcccttctga ctcacgtatt    14220

gtagccaccg acacaggcaa cagtccgtgg atagcagaat atgtcttgtc ggtccatttc    14280

tcaccaactt taggcgtcaa gtgaatgttg cagaagaagt atgtgccttc attgagaatc    14340

ggtgttgctg atttcaataa agtcttgaga tcagtttggc cagtcatgtt gtggggggta    14400

attggattga gttatcgcct acagtctgta caggtatact cgctgcccac tttatacttt    14460

ttgattccgc tgcacttgaa gcaatgtcgt ttaccaaaag tgagaatgct ccacagaaca    14520

caccccaggg tatggttgag caaaaaataa acactccgat acggggaatc gaaccccggt    14580

ctccacggtt ctcaagaagt attcttgatg agagcgtat                           14619


<210>  228
<211>  993
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  993 bp stuffer fragment

<400>  228
ctctatccaa tcacacagtc aactccgact accacgctcc ggcgggcagt ggagccttct       60

actctcacca gaatggctct gagcgcgaca tgccaccgcc cgccacgtcc taccgaggat      120

accagcctcc gggatattga aaaaaaagtg taaatattta ttatcgtcaa tcgatcacga      180

attatctgaa cggaatgaat gcagccgtgt gtgagctgtg tagatcatgt aacgtaaacg      240

gatacaatgg ctccatcctt tcttgactcg tttccacttc tgcaattgag tcgtaggagg      300

tgatctcact catgatcatg acttgtgcac taacgttggc tctacagaat agtcagaata      360

gtccagctcg tgccagtcgc gtgatttgaa acgcgaccaa caatgagaat gtttcataag      420

ctgcgctaag ttggtgatgt ctcacattca cattcgtttc atgacgcgat caaacttgtc      480

tacagttacc tattgcgaaa caaaactgcg ggacagtttc ttctatcatc tataaatcat      540

taatttatac aatccgtatc cgacggattc gtttttccca ttctttttgt ctgcgtctcc      600

gtctaagcga gagaagtgaa gaacttgtgg gcgagctcga gcggccgcat tctagatgca      660

gacgcacctt gaccagatca aacggatgac caaccagaac ggcgcagata cctccaaagc      720

caccggcggc gaaggactcg gcctgcgaca ggaacgactt gagagcggaa gggggagcga      780

tggccttaac gccagagtca atttcgggag cgtcactcat tgtgtggggt gtgtggaagg      840

gatagttgag tgcagttagg agatggcaac gtccgagtta taagaagtgt ttagagttgt      900

tcacacggga aggttacctc ggcatgtgaa tatggaggat tcggtatagg ggggtgtagc      960

ggcgggttta gggggcgatt gtgtatatat atg                                   993


<210>  229
<211>  7338
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYPS234

<400>  229
aaatgagtat ctgtctgact cgtcattgcc gcctttggag tacgactcca actatgagtg       60

tgcttggatc actttgacga tacattcttc gttggaggct gtgggtctga cagctgcgtt      120

ttcggcgcgg ttggccgaca acaatatcag ctgcaacgtc attgctggct ttcatcatga      180

tcacattttt gtcggcaaag gcgacgccca gagagccatt gacgttcttt ctaatttgga      240

ccgatagccg tatagtccag tctatctata agttcaacta actcgtaact attaccataa      300

catatacttc actgccccag ataaggttcc gataaaaagt tctgcagact aaatttattt      360

cagtctcctc ttcaccacca aaatgccctc ctacgaagct cgagctaacg tccacaagtc      420

cgcctttgcc gctcgagtgc tcaagctcgt ggcagccaag aaaaccaacc tgtgtgcttc      480

tctggatgtt accaccacca aggagctcat tgagcttgcc gataaggtcg gaccttatgt      540

gtgcatgatc aaaacccata tcgacatcat tgacgacttc acctacgccg gcactgtgct      600

ccccctcaag gaacttgctc ttaagcacgg tttcttcctg ttcgaggaca gaaagttcgc      660

agatattggc aacactgtca agcaccagta ccggtgtcac cgaatcgccg agtggtccga      720

tatcaccaac gcccacggtg tacccggaac cggaatcatt gctggcctgc gagctggtgc      780

cgaggaaact gtctctgaac agaagaagga ggacgtctct gactacgaga actcccagta      840

caaggagttc ctagtcccct ctcccaacga gaagctggcc agaggtctgc tcatgctggc      900

cgagctgtct tgcaagggct ctctggccac tggcgagtac tccaagcaga ccattgagct      960

tgcccgatcc gaccccgagt ttgtggttgg cttcattgcc cagaaccgac ctaagggcga     1020

ctctgaggac tggcttattc tgacccccgg ggtgggtctt gacgacaagg gagacgctct     1080

cggacagcag taccgaactg ttgaggatgt catgtctacc ggaacggata tcataattgt     1140

cggccgaggt ctgtacggcc agaaccgaga tcctattgag gaggccaagc gataccagaa     1200

ggctggctgg gaggcttacc agaagattaa ctgttagagg ttagactatg gatatgtaat     1260

ttaactgtgt atatagagag cgtgcaagta tggagcgctt gttcagcttg tatgatggtc     1320

agacgacctg tctgatcgag tatgtatgat actgcacaac ctgtgtatcc gcatgatctg     1380

tccaatgggg catgttgttg tgtttctcga tacggagatg ctgggtacag tgctaatacg     1440

ttgaactact tatacttata tgaggctcga agaaagctga cttgtgtatg acttaattaa     1500

cgcggcgcgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     1560

tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     1620

agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     1680

aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     1740

gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     1800

tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     1860

cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     1920

ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     1980

cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     2040

atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     2100

agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     2160

gtggtggcct aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa     2220

gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     2280

tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     2340

agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     2400

gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     2460

aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     2520

aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     2580

ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     2640

gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     2700

aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     2760

ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     2820

tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     2880

ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     2940

cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     3000

agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     3060

gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     3120

gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     3180

acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     3240

acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     3300

agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     3360

aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     3420

gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt     3480

tccccgaaaa gtgccacctg atgcggtgtg aaataccgca cagatgcgta aggagaaaat     3540

accgcatcag gaaattgtaa gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta     3600

aatcagctca ttttttaacc aataggccga aatcggcaaa atcccttata aatcaaaaga     3660

atagaccgag atagggttga gtgttgttcc agtttggaac aagagtccac tattaaagaa     3720

cgtggactcc aacgtcaaag ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga     3780

accatcaccc taatcaagtt ttttggggtc gaggtgccgt aaagcactaa atcggaaccc     3840

taaagggagc ccccgattta gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga     3900

agggaagaaa gcgaaaggag cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg     3960

cgtaaccacc acacccgccg cgcttaatgc gccgctacag ggcgcgtcca ttcgccattc     4020

aggctgcgca actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg     4080

gcgaaagggg gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca     4140

cgacgttgta aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggc     4200

ccgacgtcgc atgcatcaat cgatgactgt cgactcacgt acgggtggtg agaatggttg     4260

gttgtaggga agaatcaaag gccggtctcg ggatccgtgg gtatatatat atatatatat     4320

atatacgatc cttcgttacc tccctgttct caaaactgtg gtttttcgtt tttcgttttt     4380

tgcttttttt gattttttta gggccaacta agcttccaga tttcgctaat cacctttgta     4440

ctaattacaa gaaaggaaga agctgattag agttgggctt tttatgcaac tgtgctactc     4500

cttatctctg atatgaaagt gtagacccaa tcacatcatg tcatttagag ttggtaatac     4560

tgggaggata gataaggcac gaaaacgagc catagcagac atgctgggtg tagccaagca     4620

gaagaaagta gatgggagcc aattgacgag cgagggagct acgccaatcc gacatacgac     4680

acgctgagat cgtcttggcc ggggggtacc tacagatgtc caagggtaag tgcttgactg     4740

taattgtatg tctgaggaca aatatgtagt cagccgtata aagtcatacc aggcaccagt     4800

gccatcatcg aaccactaac tctctatgat acatgcctcc ggtattattg taccatgcgt     4860

cgctttgtta catacgtatc ttgccttttt ctctcagaaa ctccagactt tggctattgg     4920

tcgagataag cccggaccat agtgagtctt tcacactctg tttaaacaag ctaccaccac     4980

actcgttggg tgcagtcgcc agcttaaaga tatctatcca catcagccac aactcccttc     5040

ctttaataaa ccgactacac ccttggctat tgaggttatg agtgaatata ctgtagacaa     5100

gacactttca agaagactgt ttccaaaacg taccactgtc ctccactaca aacacaccca     5160

atctgcttct tctagtcaag gttgctacac cggtaaatta taaatcatca tttcattagc     5220

agggcagggc cctttttata gagtcttata cactagcgga ccctgccggt agaccaaccc     5280

gcaggcgcgt cagtttgctc cttccatcaa tgcgtcgtag actagtctct atccaatcac     5340

acagtcaact ccgactacca cgctccggcg ggcagtggag ccttctactc tcaccagaat     5400

ggctctgagc gcgacatgcc accgcccgcc acgtcctacc gaggatacca gcctccggga     5460

tattgaaaaa aaagtgtaaa tatttattat cgtcaatcga tcacgaatta tctgaacgga     5520

atgaatgcag ccgtgtgtga gctgtgtaga tcatgtaacg taaacggata caatggctcc     5580

atcctttctt gactcgtttc cacttctgca attgagtcgt aggaggtgat ctcactcatg     5640

atcatgactt gtgcactaac gttggctcta cagaatagtc agaatagtcc agctcgtgcc     5700

agtcgcgtga tttgaaacgc gaccaacaat gagaatgttt cataagctgc gctaagttgg     5760

tgatgtctca cattcacatt cgtttcatga cgcgatcaaa cttgtctaca gttacctatt     5820

gcgaaacaaa actgcgggac agtttcttct atcatctata aatcattaat ttatacaatc     5880

cgtatccgac ggattcgttt ttcccattct ttttgtctgc gtctccgtct aagcgagaga     5940

agtgaagaac ttgtgggcga gctcgagcgg ccgcattcta gatgcagacg caccttgacc     6000

agatcaaacg gatgaccaac cagaacggcg cagatacctc caaagccacc ggcggcgaag     6060

gactcggcct gcgacaggaa cgacttgaga gcggaagggg gagcgatggc cttaacgcca     6120

gagtcaattt cgggagcgtc actcattgtg tggggtgtgt ggaagggata gttgagtgca     6180

gttaggagat ggcaacgtcc gagttataag aagtgtttag agttgttcac acgggaaggt     6240

tacctcggca tgtgaatatg gaggattcgg tatagggggg tgtagcggcg ggtttagggg     6300

gcgattgtgt atatatatgg gatcccggtt ctgtgtgcac aattggcaat ccaagatgga     6360

tggattcaac acagggatat agcgagctac gtggtggtgc gaggatatag caacggatat     6420

ttatgtttga cacttgagaa tgtacgatac aagcactgtc caagtacaat actaaacata     6480

ctgtacatac tcatactcgt acccgggcaa cggtttcact tgagtgcagt ggctagtgct     6540

cttactcgta cagtgtgcaa tactgcgtat catagtcttt gatgtatatc gtattcattc     6600

atgttagttg cgtacgctgt gttgttgtat gtggtgaagc ttgacaatgg atggtgtgtc     6660

gtatcaggct ggggaacaat tgtgcttaag tatgctgcag ttgagtaaga gtcatcgctc     6720

caccaaaata aagtttgcca ttagggttgg agagagagat ggtggctgga agaattaaat     6780

gacatcaagc tgaggattgt gggtgtgcaa taacacatgt taggggtgac ctgtggctcg     6840

aaatctgata attattttgt aactttatga ttattcttag attttttaat attcctctat     6900

ataacacata agtagctgtc gtctagttgt tcatagcctg actcctgcaa tagattagtg     6960

cagagtgatt ttgtgcaatt gagagccacg gttgagtcaa gtgactttgt gtgtgaagtc     7020

atcttacgtt tcaagtctca caggttactc aattggttgg ttgtctgccc tttacagata     7080

tttacagtac ctgagcgtaa agtcgttcat ccacggaatg actgttcctg tcacgcagtc     7140

atgatcatgg atgtggctgg tcaggaacca ttttggatag gagacttagg gattggacta     7200

ttattgaaaa aactgagccg aatatgatat agttctattt gaatgcagaa cttctgatgg     7260

tcaattcact tatttcaggc atatcggtca tggtggcagc tgccacgatg ttatctcgtt     7320

ggaaaagatc ttcgattt                                                   7338


<210>  230
<211>  1019
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  1019 bp stuffer fragment

<400>  230
actcactctg tcagctgccc cacattgccc aatgcacaat gcacaatgat gtgtgcaaac       60

aacgcaatca aaagtctatg catgctgacc aaactctgat caccaagttg cgaacatgaa      120

aaagaagacc tgtgtatata taagtaaggg ggagagccct aactagatct ttcgaaaacc      180

ccccgacctt caccttccac aaccatgatc atcttatacg ttttggccgt tgcggtctcc      240

ttcctcatct tcaagagagt cacctacacg atgcgaagcc gagagctcgc caagaagtgg      300

cactgtgagg agcctcacaa cctgaatgat ctagatagcg gccgccatct taacggtgtg      360

ttcgttaaga tgacccggga cgaggctgcc tttgccgaga ccgagaagct cattaacgca      420

taaagggatg tatacttgta ctgtagatgg attaataaat tatttatgat gaacaaacat      480

tgactgagaa aggcaccata ttgtagcttc aagtactgta ctggtagaag taagcctgtc      540

gttacccact tatggagatc caatattgtt tgatgcggcc atcggactgc ttggagcgta      600

cagagaattg gtaaaatgtg gtggctgggt tggtttcgat tgaaattatc aagatctctg      660

aggtattctt ggtgttgaag ggatgttctt ggactactgc ttgtaacata gatgggatgt      720

agggatggca tggggtataa tcgagacaac cattgtggag gtaggcaaga gctcttacaa      780

gtaccatacg agtacaacta ctgaggcata atgtctggaa ttcccggtta agtccattac      840

agttgatatc attcaaatcg gacagatatt ctccgaggtg atacatctct gtccccgaga      900

gaagctgatt tggagcacat ccaggttacg gggggagaga aatttatgta atgtgaggat      960

tttttggagc tgaataataa agtcctgttg atctagcttc aattgagtca acatccgcc      1019


<210>  231
<211>  7364
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYPS233

<400>  231
aaatgagtat ctgtctgact cgtcattgcc gcctttggag tacgactcca actatgagtg       60

tgcttggatc actttgacga tacattcttc gttggaggct gtgggtctga cagctgcgtt      120

ttcggcgcgg ttggccgaca acaatatcag ctgcaacgtc attgctggct ttcatcatga      180

tcacattttt gtcggcaaag gcgacgccca gagagccatt gacgttcttt ctaatttgga      240

ccgatagccg tatagtccag tctatctata agttcaacta actcgtaact attaccataa      300

catatacttc actgccccag ataaggttcc gataaaaagt tctgcagact aaatttattt      360

cagtctcctc ttcaccacca aaatgccctc ctacgaagct cgagctaacg tccacaagtc      420

cgcctttgcc gctcgagtgc tcaagctcgt ggcagccaag aaaaccaacc tgtgtgcttc      480

tctggatgtt accaccacca aggagctcat tgagcttgcc gataaggtcg gaccttatgt      540

gtgcatgatc aaaacccata tcgacatcat tgacgacttc acctacgccg gcactgtgct      600

ccccctcaag gaacttgctc ttaagcacgg tttcttcctg ttcgaggaca gaaagttcgc      660

agatattggc aacactgtca agcaccagta ccggtgtcac cgaatcgccg agtggtccga      720

tatcaccaac gcccacggtg tacccggaac cggaatcatt gctggcctgc gagctggtgc      780

cgaggaaact gtctctgaac agaagaagga ggacgtctct gactacgaga actcccagta      840

caaggagttc ctagtcccct ctcccaacga gaagctggcc agaggtctgc tcatgctggc      900

cgagctgtct tgcaagggct ctctggccac tggcgagtac tccaagcaga ccattgagct      960

tgcccgatcc gaccccgagt ttgtggttgg cttcattgcc cagaaccgac ctaagggcga     1020

ctctgaggac tggcttattc tgacccccgg ggtgggtctt gacgacaagg gagacgctct     1080

cggacagcag taccgaactg ttgaggatgt catgtctacc ggaacggata tcataattgt     1140

cggccgaggt ctgtacggcc agaaccgaga tcctattgag gaggccaagc gataccagaa     1200

ggctggctgg gaggcttacc agaagattaa ctgttagagg ttagactatg gatatgtaat     1260

ttaactgtgt atatagagag cgtgcaagta tggagcgctt gttcagcttg tatgatggtc     1320

agacgacctg tctgatcgag tatgtatgat actgcacaac ctgtgtatcc gcatgatctg     1380

tccaatgggg catgttgttg tgtttctcga tacggagatg ctgggtacag tgctaatacg     1440

ttgaactact tatacttata tgaggctcga agaaagctga cttgtgtatg acttaattaa     1500

cgcggcgcgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     1560

tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     1620

agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     1680

aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     1740

gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     1800

tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     1860

cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     1920

ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     1980

cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     2040

atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     2100

agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     2160

gtggtggcct aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa     2220

gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     2280

tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     2340

agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     2400

gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     2460

aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     2520

aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     2580

ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     2640

gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     2700

aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     2760

ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     2820

tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     2880

ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     2940

cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     3000

agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     3060

gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     3120

gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     3180

acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     3240

acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     3300

agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     3360

aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     3420

gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt     3480

tccccgaaaa gtgccacctg atgcggtgtg aaataccgca cagatgcgta aggagaaaat     3540

accgcatcag gaaattgtaa gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta     3600

aatcagctca ttttttaacc aataggccga aatcggcaaa atcccttata aatcaaaaga     3660

atagaccgag atagggttga gtgttgttcc agtttggaac aagagtccac tattaaagaa     3720

cgtggactcc aacgtcaaag ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga     3780

accatcaccc taatcaagtt ttttggggtc gaggtgccgt aaagcactaa atcggaaccc     3840

taaagggagc ccccgattta gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga     3900

agggaagaaa gcgaaaggag cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg     3960

cgtaaccacc acacccgccg cgcttaatgc gccgctacag ggcgcgtcca ttcgccattc     4020

aggctgcgca actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg     4080

gcgaaagggg gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca     4140

cgacgttgta aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggc     4200

ccgacgtcgc atgcattcca tagccacacc tttgcctatg gcttcacaac cgaaggcaat     4260

tcgagaggtc gcgcttatgg aatcgactcg tataaagctg aagggaaagg gagacgttcc     4320

gagcgctcag atgcaatagt cgtccagcta atgtggattc aaaaacaacc ccaacagtaa     4380

tcttgaaaat ttgaacggat caatctgaac actcttgctc caggtcattc ttctaacgca     4440

catccccaga gtctagaggg agttgtgttg tgaacatcct aataaacaat gcaatggatt     4500

cgggatatct tctgtctcgc cccctactcg atgtcgagta aaccgatcac caactaacaa     4560

tactcctccg cgttctgcca ttgactctca aacagacatc gctatcaacg gaacagcata     4620

ttttagcttc ttaggacaat aaatattgat aatgccggct ctccctcggt atattaagca     4680

atccattcat acactcattc atcaggttaa ttttatatat ataatttgtc tattcaaaca     4740

ccgtaaatta ctggtaccat catctcctcc ttttcaaata cacgtctatt tgcattaatg     4800

aaattactcg ccaattcgca gaacgtgttt gtcgaacaga gccttagctc gggtccagac     4860

aggagcagtg tctcgctgag gaagctgcag gagagttaat taaaaggcgt tgaaacagaa     4920

tgagccagac agcaaggaca aggtggccaa cagcaaggag tccaaaaagc cctctattga     4980

cgagatccac gatgttattg ctcatgaggt ttccgagctc gatgctggga agaagaagtg     5040

atttgtatat aagaaataaa tgagatatag taaaggagtg caagagaatg gcaaggtggt     5100

caaattctat attacttgca gtcactggtt cctcgttgac atgaatgaag ttaccgttgg     5160

catagctgat ttaatatata actgtccaac taactctcac ctagatataa cccatgtgtg     5220

tgtttccacg cgtactcact ctgtcagctg ccccacattg cccaatgcac aatgcacaat     5280

gatgtgtgca aacaacgcaa tcaaaagtct atgcatgctg accaaactct gatcaccaag     5340

ttgcgaacat gaaaaagaag acctgtgtat atataagtaa gggggagagc cctaactaga     5400

tctttcgaaa accccccgac cttcaccttc cacaaccatg atcatcttat acgttttggc     5460

cgttgcggtc tccttcctca tcttcaagag agtcacctac acgatgcgaa gccgagagct     5520

cgccaagaag tggcactgtg aggagcctca caacctgaat gatctagata gcggccgcca     5580

tcttaacggt gtgttcgtta agatgacccg ggacgaggct gcctttgccg agaccgagaa     5640

gctcattaac gcataaaggg atgtatactt gtactgtaga tggattaata aattatttat     5700

gatgaacaaa cattgactga gaaaggcacc atattgtagc ttcaagtact gtactggtag     5760

aagtaagcct gtcgttaccc acttatggag atccaatatt gtttgatgcg gccatcggac     5820

tgcttggagc gtacagagaa ttggtaaaat gtggtggctg ggttggtttc gattgaaatt     5880

atcaagatct ctgaggtatt cttggtgttg aagggatgtt cttggactac tgcttgtaac     5940

atagatggga tgtagggatg gcatggggta taatcgagac aaccattgtg gaggtaggca     6000

agagctctta caagtaccat acgagtacaa ctactgaggc ataatgtctg gaattcccgg     6060

ttaagtccat tacagttgat atcattcaaa tcggacagat attctccgag gtgatacatc     6120

tctgtccccg agagaagctg atttggagca catccaggtt acggggggag agaaatttat     6180

gtaatgtgag gattttttgg agctgaataa taaagtcctg ttgatctagc ttcaattgag     6240

tcaacatccg ccgtcgactg acgtacgggt ggtgagaatg gttggttgta gggaagaatc     6300

aaaggccggt ctcgggatcc gtgggtatat atatatatat atatatatac gatccttcgt     6360

tacctccctg ttctcaaaac tgtggttttt cgtttttcgt tttttgcttt ttttgatttt     6420

tttagggcca actaagcttc cagatttcgc taatcacctt tgtactaatt acaagaaagg     6480

aagaagctga ttagagttgg gctttttatg caactgtgct actccttatc tctgatatga     6540

aagtgtagac ccaatcacat catgtcattt agagttggta atactgggag gatagataag     6600

gcacgaaaac gagccatagc agacatgctg ggtgtagcca agcagaagaa agtagatggg     6660

agccaattga cgagcgaggg agctacgcca atccgacata cgacacgctg agatcgtctt     6720

ggccgggggg tacctacaga tgtccaaggg taagtgcttg actgtaattg tatgtctgag     6780

gacaaatatg tagtcagccg tataaagtca taccaggcac cagtgccatc atcgaaccac     6840

taactctcta tgatacatgc ctccggtatt attgtaccat gcgtcgcttt gttacatacg     6900

tatcttgcct ttttctctca gaaactccag actttggcta ttggtcgaga taagcccgga     6960

ccatagtgag tctttcacac tctgtttaaa caagctacca ccacactcgt tgggtgcagt     7020

cgccagctta aagatatcta tccacatcag ccacaactcc cttcctttaa taaaccgact     7080

acacccttgg ctattgaggt tatgagtgaa tatactgtag acaagacact ttcaagaaga     7140

ctgtttccaa aacgtaccac tgtcctccac tacaaacaca cccaatctgc ttcttctagt     7200

caaggttgct acaccggtaa attataaatc atcatttcat tagcagggca gggccctttt     7260

tatagagtct tatacactag cggaccctgc cggtagacca acccgcaggc gcgtcagttt     7320

gctccttcca tcaatgcgtc gtagactagt tcagatctcg attt                      7364


<210>  232
<211>  9211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYSP241

<400>  232
cgatgactgt cgactccaac ttttcacact gagcgtaaaa tgtggagaag aaatcggcac       60

taaaaagtca ggtagactgg aaaatgcgcc atgaaatgaa tatctcttgc tacagtaatg      120

cccagcatcg aggggtattg tgtcaccaac actatagtgg cagctgaagc gctcgtgatt      180

gtagtatgag tctttattgg tgatgggaag agttcactca atattctcgt tactgccaaa      240

acaccacggt aatcggccag acaccatgga tgtagatcac caagcctgtg aatgttattc      300

gagctaaaat gcacatggtt ggtgaaagga gtagttgctg tcgaattccg tcgtcgcctg      360

agtcatcatt tatttaccag ttggccacaa acccttgacg atctcgtatg tcccctccga      420

catactcccg gccggctggg gtacgttcga tagcgctatc ggcatcgaca aggtttgggt      480

ccctagccga taccgcacta cctgagtcac aatcttcgga ggtttagtct tccacatagc      540

acgggcaaaa gtgcgtatat atacaagagc gtttgccagc cacagatttt cactccacac      600

accacatcac acatacaacc acacacatcc acaatggaac ccgaaactaa gaagaccaag      660

actgactcca agaagattgt tcttctcggc ggcgacttct gtggccccga ggtgattgcc      720

gaggccgtca aggtgctcaa gtctgttgct gaggcctccg gcaccgagtt tgtgtttgag      780

gaccgactca ttggaggagc tgccattgag aaggagggcg agcccatcac cgacgctact      840

ctcgacatct gccgaaaggc tgactctatt atgctcggtg ctgtcggagg cgctgccaac      900

accgtatgga ccactcccga cggacgaacc gacgtgcgac ccgagcaggg tctcctcaag      960

ctgcgaaagg acctgaacct gtacgccaac ctgcgaccct gccagctgct gtcgcccaag     1020

ctcgccgatc tctcccccat ccgaaacgtt gagggcaccg acttcatcat tgtccgagag     1080

ctcgtcggag gtatctactt tggagagcga aaggaggatg acggatctgg cgtcgcttcc     1140

gacaccgaga cctactccgt tcctgaggtt gagcgaattg cccgaatggc cgccttcctg     1200

gcccttcagc acaacccccc tcttcccgtg tggtctcttg acaaggccaa cgtgctggcc     1260

tcctctcgac tttggcgaaa gactgtcact cgagtcctca aggacgaatt cccccagctc     1320

gagctcaacc accagctgat cgactcggcc gccatgatcc tcatcaagca gccctccaag     1380

atgaatggta tcatcatcac caccaacatg tttggcgata tcatctccga cgaggcctcc     1440

gtcatccccg gttctctggg tctgctgccc tccgcctctc tggcttctct gcccgacacc     1500

aacgaggcgt tcggtctgta cgagccctgt cacggatctg cccccgatct cggcaagcag     1560

aaggtcaacc ccattgccac cattctgtct gccgccatga tgctcaagtt ctctcttaac     1620

atgaagcccg ccggtgacgc tgttgaggct gccgtcaagg agtccgtcga ggctggtatc     1680

actaccgccg atatcggagg ctcttcctcc acctccgagg tcggagactt gttgccaaca     1740

aggtcaagga gctgctcaag aaggagtaag tcgtttctac gacgcattga tggaaggagc     1800

aaactgacgc gcctgcgggt tggtctaccg gcagggtccg ctagtgtata agactctata     1860

aaaagggccc tgccctgcta atgaaatgat gatttataat ttaccggtgt agcaaccttg     1920

actagaagaa gcagattggg tgtgtttgta gtggaggaca gtggtacgtt ttggaaacag     1980

tcttcttgaa agtgtcttgt ctacagtata ttcactcata acctcaatag ccaagggtgt     2040

agtcggttta ttaaaggaag ggagttgtgg ctgatgtgga tagatatctt taagctggcg     2100

actgcaccca acgagtgtgg tggtagcttg tttaaacggt aggttagtgc ttggtatatg     2160

agttgtaggc atgacaattt ggaaaggggt ggactttggg aatattgtgg gatttcaata     2220

ccttagtttg tacgcaacta acatgaatga atacgatata catcaaagac tatgatacgc     2280

agtattgcac actgtacgag taagagcact agccactgca ctcaagtgaa accgttgccc     2340

gggtacgagt atgagtatgt acagtatgtt tagtattgta cttggacagt gcttgtatcg     2400

tacattctca agtgtcaaac ataaatatcc gttgctatat cctcgcacca ccacgtagct     2460

cgctatatcc ctgtgttgaa tccatccatc ttggattgcc aattgtgcac acagaaccgg     2520

gcactcactt ccccatccac acttgcggcc gcttaggcct tgtctgcctt ggcaaccatt     2580

cgagcgaagg tgcccaggta ggagataaga atgccgacac cttcgagcat gggaggagat     2640

cggtagggca ggttgtgctt cttgcacagc tgctccactt tgatggaggc agcggtgagg     2700

ttgtgtctgg gaagagtagg ccacaggtga tgttcgatct ggtagttgag gccaccgaaa     2760

aaccagtctg tgacgaggcc tcgctgaacg ttcatggtct cgtgaatctg tccagcacag     2820

aagccatgac catcccacac ggagtcctga atcttctcca gagggtagtg gttcatgaag     2880

acaacgatgg caattccgaa gccaccaagc agctcggaga caaagaacac catgagtccg     2940

gtgagaaacg agggcatgta gaaatagtag aacaaggcct tgagacccca gtggagtgcc     3000

agaccgacgg actccttctc gtactgtctt cgatagtact gattgcttcg atccttgaga     3060

cccgtggcag tatgaatgga ctgaaagcac cagatgaatc gcagaagggc acagatgaaa     3120

aagaagtaat actgttggta cttgatcatc cgtcgagaaa acgggccagc tcgctcgacg     3180

tcctccttgg accacgccag gagaggcagg ttgtcgatgt cgggatcgtg accctgcacg     3240

ttggtggcgg agtgatgagc gttgtgtcgg tccttccacc aagtgacaga aaagccctga     3300

agcacgtttc cgaagagcag gccaatggca ttgttgatgg atcggtcctt gaacagctgg     3360

tgatggcaaa tgtcgtgcga gagccatccc atctgttgaa agtgaatgcc gaggacgata     3420

gcaccaacca ggtaccagtg ccactgggtc atcaacagca cggcaaggac acccagtccg     3480

agagtcgaca gggtcttgta agcgtaccac atgggagagg catcgaacat gcctgcagca     3540

atgagctcgt ctcgcagctt tcggaaatcc tcctgaggtt cgtcggagtt tggcttggga     3600

ggtgttggag tcaggggaga ggaaggttcc atgatgggca ttcgtctgag tttggacacg     3660

gcgttctcgg agtgcataac catgaagacg tcggtagcat cccgacctcg gtagttctcg     3720

ataatgtcgg cacctccggg atggtggttg agccaggcag aaacgtcgta ggtgacaccg     3780

tccacggtga gaggcagagc gggtcgcttg accatggttg atgtgtgttt aattcaagaa     3840

tgaatataga gaagagaaga agaaaaaaga ttcaattgag ccaaaccgat ttaaatgagt     3900

atctgtctga ctcgtcattg ccgcctttgg agtacgactc caactatgag tgtgcttgga     3960

tcactttgac gatacattct tcgttggagg ctgtgggtct gacagctgcg ttttcggcgc     4020

ggttggccga caacaatatc agctgcaacg tcattgctgg ctttcatcat gatcacattt     4080

ttgtcggcaa aggcgacgcc cagagagcca ttgacgttct ttctaatttg gaccgatagc     4140

cgtatagtcc agtctatcta taagttcaac taactcgtaa ctattaccat aacatatact     4200

tcactgcccc agataaggtt ccgataaaaa gttctgcaga ctaaatttat ttcagtctcc     4260

tcttcaccac caaaatgccc tcctacgaag ctcgagctaa cgtccacaag tccgcctttg     4320

ccgctcgagt gctcaagctc gtggcagcca agaaaaccaa cctgtgtgct tctctggatg     4380

ttaccaccac caaggagctc attgagcttg ccgataaggt cggaccttat gtgtgcatga     4440

tcaaaaccca tatcgacatc attgacgact tcacctacgc cggcactgtg ctccccctca     4500

aggaacttgc tcttaagcac ggtttcttcc tgttcgagga cagaaagttc gcagatattg     4560

gcaacactgt caagcaccag taccggtgtc accgaatcgc cgagtggtcc gatatcacca     4620

acgcccacgg tgtacccgga accggaatca ttgctggcct gcgagctggt gccgaggaaa     4680

ctgtctctga acagaagaag gaggacgtct ctgactacga gaactcccag tacaaggagt     4740

tcctagtccc ctctcccaac gagaagctgg ccagaggtct gctcatgctg gccgagctgt     4800

cttgcaaggg ctctctggcc actggcgagt actccaagca gaccattgag cttgcccgat     4860

ccgaccccga gtttgtggtt ggcttcattg cccagaaccg acctaagggc gactctgagg     4920

actggcttat tctgaccccc ggggtgggtc ttgacgacaa gggagacgct ctcggacagc     4980

agtaccgaac tgttgaggat gtcatgtcta ccggaacgga tatcataatt gtcggccgag     5040

gtctgtacgg ccagaaccga gatcctattg aggaggccaa gcgataccag aaggctggct     5100

gggaggctta ccagaagatt aactgttaga ggttagacta tggatatgta atttaactgt     5160

gtatatagag agcgtgcaag tatggagcgc ttgttcagct tgtatgatgg tcagacgacc     5220

tgtctgatcg agtatgtatg atactgcaca acctgtgtat ccgcatgatc tgtccaatgg     5280

ggcatgttgt tgtgtttctc gatacggaga tgctgggtac agtgctaata cgttgaacta     5340

cttatactta tatgaggctc gaagaaagct gacttgtgta tgacttaatt aacgcggcgc     5400

gccagctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct     5460

cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat     5520

cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga     5580

acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt     5640

ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt     5700

ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc     5760

gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa     5820

gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct     5880

ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta     5940

actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg     6000

gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc     6060

ctaactacgg ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta     6120

ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg     6180

gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt     6240

tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg     6300

tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta     6360

aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg     6420

aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg     6480

tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc     6540

gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg     6600

agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg     6660

aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag     6720

gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat     6780

caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc     6840

cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc     6900

ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa     6960

ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac     7020

gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt     7080

cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc     7140

gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa     7200

caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca     7260

tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat     7320

acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa     7380

aagtgccacc tgatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc     7440

aggaaattgt aagcgttaat attttgttaa aattcgcgtt aaatttttgt taaatcagct     7500

cattttttaa ccaataggcc gaaatcggca aaatccctta taaatcaaaa gaatagaccg     7560

agatagggtt gagtgttgtt ccagtttgga acaagagtcc actattaaag aacgtggact     7620

ccaacgtcaa agggcgaaaa accgtctatc agggcgatgg cccactacgt gaaccatcac     7680

cctaatcaag ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac cctaaaggga     7740

gcccccgatt tagagcttga cggggaaagc cggcgaacgt ggcgagaaag gaagggaaga     7800

aagcgaaagg agcgggcgct agggcgctgg caagtgtagc ggtcacgctg cgcgtaacca     7860

ccacacccgc cgcgcttaat gcgccgctac agggcgcgtc cattcgccat tcaggctgcg     7920

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg     7980

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg     8040

taaaacgacg gccagtgaat tgtaatacga ctcactatag ggcgaattgg gcccgacgtc     8100

gcatgccact tagtctacag attacgtgtg gtgtactggt acctgtggtg tagcgacagt     8160

attgtacata ctcgtactcc cactctacac acccgccgcc accacttacc ctcgtgagaa     8220

ccccctcaga aactcacaac cgtcacctta gcagaagggt ttagccgctt ggagccgaca     8280

gggatgtcag gcaatacagc tgtcaccgag gggacaccgg ggtgtcaaca acagtgtcat     8340

ttcggtgctt gtggagtgag tgacatcacc gagaaagttg tagtatgtag gggtttttgg     8400

aatggttgtt tcatgatcca ttggtactgc tcggcaaaaa tagcctgatg gaatcacgtt     8460

tcgttctggg gttgacgggc atctcagcta tctccatcca cgagtgtctc agagataaga     8520

cctgcatatc tgttatctgt tttcaaagtg taagtcgtat agagtgccgg gagggggtca     8580

aggatgtgca tggacgtgac tacagtgaga gtttttggag gaagtaaaga ccgagtatcg     8640

accatccaga aaccccatat cttctggaat tccacaaatg gccaagactg gcttctttta     8700

tccacttttt gatactaaaa atgtctaatt tgacctctct tccactaaca aaaacagtct     8760

cgccgttctc caacccgcac caccaactac aagtacagta gctataattt tttggcacca     8820

ctcgcttttc tctttttagg cgaaacttgt ctcaaactta tcatgccata ggcattaatt     8880

tccgggtctt gcagagctac tgcactgggg ggggggggtg tacaacgcac gaccgagtgc     8940

tatgtaatgt acacaatggc cacaaacaca actgttactg atgtaccgat gactcagcac     9000

ctacagtagt ccacgcgttt gacttccggg gctcttcggg cccaattttt ttcttcgtct     9060

gcttcccaaa tgccagattt atgcaaggtg catgtgacgt cccggtccga ggattgctat     9120

aacgggcgaa ataaggacac aaacaactca aaaataatac caaaatatag cttgcaacaa     9180

gggcttcaga ttgatcccat tattggtcca t                                    9211


<210>  233
<211>  13926
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZR5AU-555

<400>  233
cgatccctaa ctgtatacta gcattctgaa ttaatatctt cctctttgat tgttccttag       60

gcatatcagc ggggatcccg tcaacagttt tatatatcgt agttacaacc atcaacactt      120

tttggtaagt gtaccattct atactccaac tggtctgcaa ctgtacaagt agacatgtta      180

atggtagtta ataacatcta cagcagaacc tatggtaaag acattgcatt tttacaggaa      240

gtatcgtcct acacgttgat aaatccaaag atgcggaact tcttccactt ttatcatcat      300

cccctactcg tacactcgta ctctttgttc gatcgcgatt catttctata aataatcttg      360

tatgtacatg cggccgctta gaactcctgg gcatcctgaa tggtgtgagc aacacccacc      420

ttctgcatgt atcgaagagt ggacatgatg ttctcgagaa tggtaggata gtatgtgtaa      480

cggacgccat gtcgcttgca acactcccga acggtgtctt gaatgaaggg atagtgccaa      540

gaggacattc ggggaaacaa gtgatgctcg atctgcaggt tgagaccgcc tgtaagcatc      600

attcccacgc tgccaccgta ggtggcagca gtctcggcct gtgctcggta ccagtcgact      660

cgatcctctt ccacgtctct gacctccttg ccttccttag ctcgtttggc cttgcagttc      720

ttctctgggg tccgttcggc tccctcgaag ttatgggaca gaacgaacac aaaggtgagg      780

aagatgcctg tgaccgtggg caccagaaga atggtgagca gaggagaggc accagcaatg      840

taccagggaa gaaaggcgtt tcggaagatg aagagcagtc gaagggaaac ggcgaaagct      900

cgctgccgat tgaaaaagct gcctggtctg agagaagtgg cctcgggaat ggtgtcgttg      960

tgctgcattg tgaagaggta catgggattg tacaccatcg agacaccata cattcccaga     1020

acgatgtaca tgtaccacgc ctgaaatcgg tggtaccact ttcgagaggc gttggtagca     1080

ggataggagt ggaacaagat gaacggctct gccgaggcag catccaggtc gaggccatgc     1140

tggttggtgt aagtgtgatg tcgcataatg tgggactgca gccagatcca acgagaggaa     1200

ccaatggcgt cgatgccata agcgaaaaag gcgttgaccc agggcttttt gctgatggct     1260

ccgtgagaac catcgtgttg aatggccaag ccaatctgcg agtggaagta tccagtaaag     1320

ataccccaca tcactgctcc ggttgtgatc cagtaccact cgccaatggc agtacaagcg     1380

atgatgagtg cggttctcag ccagaagcca ggagtggcgt accagtttcg acccttcatg     1440

acctctcgaa cggactgctt gacgtcctga gcaaagggag agtcgaaggt gtacacggga     1500

tccttggagc ttcgttcgtg cttgcctcgg gcaaattgct gcagaagctt cgagttctct     1560

ggcttgactc caggatgcat ggagtagaac agagcggttc cgtcggatgc accggcagct     1620

tcgatgaggt tgcctccggc atggaccttg gccagaccga agagatcgta caggactccg     1680

tcgatggcaa cgagctcggg gtgttccagg agctgctcgg tagtcaggga gatggtggcc     1740

atggttgtga attagggtgg tgagaatggt tggttgtagg gaagaatcaa aggccggtct     1800

cgggatccgt gggtatatat atatatatat atatatacga tccttcgtta cctccctgtt     1860

ctcaaaactg tggtttttcg tttttcgttt tttgcttttt ttgatttttt tagggccaac     1920

taagcttcca gatttcgcta atcacctttg tactaattac aagaaaggaa gaagctgatt     1980

agagttgggc tttttatgca actgtgctac tccttatctc tgatatgaaa gtgtagaccc     2040

aatcacatca tgtcatttag agttggtaat actgggagga tagataaggc acgaaaacga     2100

gccatagcag acatgctggg tgtagccaag cagaagaaag tagatgggag ccaattgacg     2160

agcgagggag ctacgccaat ccgacatacg acacgctgag atcgtcttgg ccggggggta     2220

cctacagatg tccaagggta agtgcttgac tgtaattgta tgtctgagga caaatatgta     2280

gtcagccgta taaagtcata ccaggcacca gtgccatcat cgaaccacta actctctatg     2340

atacatgcct ccggtattat tgtaccatgc gtcgctttgt tacatacgta tcttgccttt     2400

ttctctcaga aactccagac tttggctatt ggtcgagata agcccggacc atagtgagtc     2460

tttcacactc tgtttaaaca agctaccacc acactcgttg ggtgcagtcg acgagtatct     2520

gtctgactcg tcattgccgc ctttggagta cgactccaac tatgagtgtg cttggatcac     2580

tttgacgata cattcttcgt tggaggctgt gggtctgaca gctgcgtttt cggcgcggtt     2640

ggccgacaac aatatcagct gcaacgtcat tgctggcttt catcatgatc acatttttgt     2700

cggcaaaggc gacgcccaga gagccattga cgttctttct aatttggacc gatagccgta     2760

tagtccagtc tatctataag ttcaactaac tcgtaactat taccataaca tatacttcac     2820

tgccccagat aaggttccga taaaaagttc tgcagactaa atttatttca gtctcctctt     2880

caccaccaaa atgccctcct acgaagctcg agctaacgtc cacaagtccg cctttgccgc     2940

tcgagtgctc aagctcgtgg cagccaagaa aaccaacctg tgtgcttctc tggatgttac     3000

caccaccaag gagctcattg agcttgccga taaggtcgga ccttatgtgt gcatgatcaa     3060

aacccatatc gacatcattg acgacttcac ctacgccggc actgtgctcc ccctcaagga     3120

acttgctctt aagcacggtt tcttcctgtt cgaggacaga aagttcgcag atattggcaa     3180

cactgtcaag caccagtacc ggtgtcaccg aatcgccgag tggtccgata tcaccaacgc     3240

ccacggtgta cccggaaccg gaatcattgc tggcctgcga gctggtgccg aggaaactgt     3300

ctctgaacag aagaaggagg acgtctctga ctacgagaac tcccagtaca aggagttcct     3360

agtcccctct cccaacgaga agctggccag aggtctgctc atgctggccg agctgtcttg     3420

caagggctct ctggccactg gcgagtactc caagcagacc attgagcttg cccgatccga     3480

ccccgagttt gtggttggct tcattgccca gaaccgacct aagggcgact ctgaggactg     3540

gcttattctg acccccgggg tgggtcttga cgacaaggga gacgctctcg gacagcagta     3600

ccgaactgtt gaggatgtca tgtctaccgg aacggatatc ataattgtcg gccgaggtct     3660

gtacggccag aaccgagatc ctattgagga ggccaagcga taccagaagg ctggctggga     3720

ggcttaccag aagattaact gttagaggtt agactatgga tatgtaattt aactgtgtat     3780

atagagagcg tgcaagtatg gagcgcttgt tcagcttgta tgatggtcag acgacctgtc     3840

tgatcgagta tgtatgatac tgcacaacct gtgtatccgc atgatctgtc caatggggca     3900

tgttgttgtg tttctcgata cggagatgct gggtacagtg ctaatacgtt gaactactta     3960

tacttatatg aggctcgaag aaagctgact tgtgtatgac ttattctcaa ctacatcccc     4020

agtcacaata ccaccactgc actaccacta caccaaaacc atgatcaaac caccgatgga     4080

cttcctggag gcagaagaac ttgttatgga aaagctcaag agagagaatt caggagagac     4140

cgggttggcg gcgtatttgt gtcccaaaaa acagccccaa ttgccccgga gaagacggcc     4200

aggccgccta gatgacaaat tcaacaactc acagctgact ttctgccatt gccactaggg     4260

gggggccttt ttatatggcc aagccaagct ctccacgtcg gttgggctgc acccaacaat     4320

aaatgggtag ggttgcacca acaaagggat gggatggggg gtagaagata cgaggataac     4380

ggggctcaat ggcacaaata agaacgaata ctgccattaa gactcgtgat ccagcgactg     4440

acaccattgc atcatctaag ggcctcaaaa ctacctcgga actgctgcgc tgatctggac     4500

accacagagg ttccgagcac tttaggttgc accaaatgtc ccaccaggtg caggcagaaa     4560

acgctggaac agcgtgtaca gtttgtctta acaaaaagtg agggcgctga ggtcgagcag     4620

ggtggtgtga cttgttatag cctttagagc tgcgaaagcg cgtatggatt tggctcatca     4680

ggccagattg agggtctgtg gacacatgtc atgttagtgt acttcaatcg ccccctggat     4740

atagccccga caataggccg tggcctcatt tttttgcctt ccgcacattt ccattgctcg     4800

gtacccacac cttgcttctc ctgcacttgc caaccttaat actggtttac attgaccaac     4860

atcttacaag cggggggctt gtctagggta tatataaaca gtggctctcc caatcggttg     4920

ccagtctctt ttttcctttc tttccccaca gattcgaaat ctaaactaca catcacacaa     4980

tgcctgttac tgacgtcctt aagcgaaagt ccggtgtcat cgtcggcgac gatgtccgag     5040

ccgtgagtat ccacgacaag atcagtgtcg agacgacgcg ttttgtgtaa tgacacaatc     5100

cgaaagtcgc tagcaacaca cactctctac acaaactaac ccagctctcc atggctctct     5160

cccttactac cgagcagctg ctcgagcgac ccgacctggt tgccatcgac ggcattctct     5220

acgatctgga aggtcttgcc aaggtccatc ccggatccga cttgatcctc gcttctggtg     5280

cctccgatgc ttctcctctg ttctactcca tgcaccctta cgtcaagccc gagaactcga     5340

agctgcttca acagttcgtg cgaggcaagc acgaccgaac ctccaaggac attgtctaca     5400

cctacgactc tccctttgca caggacgtca agcgaactat gcgagaggtc atgaaaggtc     5460

ggaactggta tgccacacct ggattctggc tgcgaaccgt tggcatcatt gctgtcaccg     5520

ccttttgcga gtggcactgg gctactaccg gaatggtgct gtggggtctc ttgactggat     5580

tcatgcacat gcagatcggc ctgtccattc agcacgatgg ttctcatggt gccatcagca     5640

aaaagccctg ggtcaacgct ctctttgcct acggcatcga cgtcattgga tcgtccagat     5700

ggatctggct gcagtctcac atcatgcgac atcacaccta caccaatcag catggtctcg     5760

acctggatgc cgagtccgca gaaccattcc ttgtgttcca caactaccct gctgccaaca     5820

ctgctcgaaa gtggtttcac cgattccagg cctggtacat gtacctcgtg cttggagcct     5880

acggcgtttc gctggtgtac aaccctctct acatcttccg aatgcagcac aacgacacca     5940

ttcccgagtc tgtcacagcc atgcgagaga acggctttct gcgacggtac cgaacccttg     6000

cattcgttat gcgagctttc ttcatctttc gaaccgcctt cttgccctgg tatctcactg     6060

gaacctccct gctcatcacc attcctctgg tgcccactgc taccggtgcc ttcctcacct     6120

tctttttcat cttgtctcac aacttcgatg gctcggagcg aatccccgac aagaactgca     6180

aggtcaagag ctccgagaag gacgttgaag ccgatcagat cgactggtac agagctcagg     6240

tggagacctc ttccacctac ggtggaccca ttgccatgtt ctttactggc ggtctcaact     6300

tccagatcga gcatcacctc tttcctcgaa tgtcgtcttg gcactatccc ttcgtgcagc     6360

aagctgtccg agagtgttgc gaacgacacg gagttcggta cgtcttctac cctaccattg     6420

tgggcaacat catttccacc ctcaagtaca tgcacaaagt cggtgtggtt cactgtgtca     6480

aggacgctca ggattcctaa gcggccgcaa gtgtggatgg ggaagtgagt gcccggttct     6540

gtgtgcacaa ttggcaatcc aagatggatg gattcaacac agggatatag cgagctacgt     6600

ggtggtgcga ggatatagca acggatattt atgtttgaca cttgagaatg tacgatacaa     6660

gcactgtcca agtacaatac taaacatact gtacatactc atactcgtac ccgggcaacg     6720

gtttcacttg agtgcagtgg ctagtgctct tactcgtaca gtgtgcaata ctgcgtatca     6780

tagtctttga tgtatatcgt attcattcat gttagttgcg tacgacagac ggacaatgac     6840

gagccaggtt cgcccaactg acggaatctc ctatagtctt ctcatcaagc gcataaatat     6900

agtatataaa gaattacatg gtgggccaca gcttggcgat agtatgacgg tctgggctcc     6960

agatcacacc catgagggat gccgtgcgag tagatgtaga ggtggtgctt ccaggttgtc     7020

cccaacaaca atcgctactg gaaaataccg tgaaagtacg tacacccaac tctgtgactc     7080

gcgttgactt tcatctttca atttggtcgt gtcttacggt gcatgtgctg tattggacat     7140

gaagaagggg ttggcaagct gaaccatgcc tgaggtgctg gagtctgata ctgtctagca     7200

gagggcgatc aaaagatcac catccacacg tttgtttcca tattcaatct ccaatttcca     7260

acatctcctt ggctaatagc cactttgaga gatatttggt atcgcgtatt cacttgtctc     7320

atatcccgtg tttctgtttc accaaggctt ggacgatatt atcgcgagaa aaaccgtatt     7380

gaggtattta ttagagtcga gagaggagga acaggtattc ggaagatgga aaagctgtac     7440

agagcgagaa acgatcccat gtcgacgctc ctgatggctt atactgagac atggaatcag     7500

gatcttgccc ttcttcacag ctctacttgc agctacagaa ttgctttttc tcgtatccga     7560

tcataattcc gctgctatga taggcagcca ccgatacaat ataggagaga gtataggtat     7620

tgactctata gacactgtat gatgctgaca aagctgactg agggattcga ctcagcttcg     7680

ccgttgcttg atctccaatt aattatcgta ggcgcgccag ctgcattaat gaatcggcca     7740

acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc     7800

gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg     7860

gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa     7920

ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga     7980

cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag     8040

ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct     8100

taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg     8160

ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc     8220

ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt     8280

aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta     8340

tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac     8400

agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc     8460

ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat     8520

tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc     8580

tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt     8640

cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta     8700

aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct     8760

atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg     8820

cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga     8880

tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt     8940

atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt     9000

taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt     9060

tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat     9120

gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc     9180

cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc     9240

cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat     9300

gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag     9360

aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt     9420

accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc     9480

ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa     9540

gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg     9600

aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa     9660

taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgatg cggtgtgaaa     9720

taccgcacag atgcgtaagg agaaaatacc gcatcaggaa attgtaagcg ttaatatttt     9780

gttaaaattc gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat     9840

cggcaaaatc ccttataaat caaaagaata gaccgagata gggttgagtg ttgttccagt     9900

ttggaacaag agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt     9960

ctatcagggc gatggcccac tacgtgaacc atcaccctaa tcaagttttt tggggtcgag    10020

gtgccgtaaa gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg    10080

aaagccggcg aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc    10140

gctggcaagt gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc ttaatgcgcc    10200

gctacagggc gcgtccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg    10260

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt    10320

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa    10380

tacgactcac tatagggcga attgggcccg acgtcgcatg cattaccaac gttggcgcgc    10440

cacaacgtca gaaactctct tccatccgat aagccgtgtc taactatacg accctcaaac    10500

aaggtcagcc agaccaactt ttagcactcg caaaacacac acccaaaccc aggccgcgac    10560

gcagccggag cccgtctact tatactgttt cgccgcagac gccaagtgac acaaaaagcg    10620

cacaaaccct cccccccagt ggcggcgcac atccacaaca aacgacccat aaacggcact    10680

cccacacgga aacaaaacgt ttgtcgggtt actcctcacg ccaagacaca ctcacacaca    10740

caacctccca cctttcatca ccactcaact acgggtacag tagtggtaga aaacacagag    10800

tccagagtcc acacagacac cgcacagagt ctgtcacaca agcaccatgc caacaagcag    10860

accgttcatt tcgctgctgt ttggaagctt ctccacgtcg tatcaacgca acaccaccac    10920

acagacctcc gcatcgacag ccaccgtcac caccggcccc acgtcgcaga tgtacaagat    10980

gctcgacaac cactcgatac ctgagcagcc ccagcagccg accctgactc agacacagcc    11040

agcggtgcag acggcgcaga cggcgcagac ggcgcagcag gcaagtggga ctgcggacct    11100

cagcgaatcg ccacacgaca agctgtggat tggaggccga tcaaacgacg gacgggaaaa    11160

gttctaccgg ctggaaccgg ccaagcgaag ggcatccttc gatcgcattt ctctggatca    11220

ggtgtctatt tgatgaggcg ccaaacagtg gccagaccaa gtctggtatc tcaacccctc    11280

aaaattgata accaagtttc tggtggagtt caaaccaagg ggagctacgg agccgttcac    11340

aaggtgtcca cagaggttca aagcacattg caccaagacc gccatcatcc ttaattaaaa    11400

ggcgttgaaa cagaatgagc cagacagcaa ggacaaggtg gccaacagca aggagtccaa    11460

aaagccctct attgacgaga tccacgatgt tattgctcat gaggtttccg agctcgatgc    11520

tgggaagaag aagtgatttg tatataagaa ataaatgaga tatagtaaag gagtgcaaga    11580

gaatggcaag gtggtcaaat tctatattac ttgcagtcac tggttcctcg ttgacatgaa    11640

tgaagttacc gttggcatag ctgatttaat atataactgt ccaactaact ctcacctaga    11700

tataacccat gtgtgtgttt ccaatcatca atgcggccgc ttaggaatcc tgtgcgtcct    11760

tcacgcagtg gacgacaccc accttatgca tgtacttcag ggtggagatg atgttgccga    11820

cgatggtagg gtagaaaaca tatcgcactc catgtcgttc gcaacactcc cggaccgcct    11880

gctggacgaa ggggtagtgc caagacgaca tccggggaaa gaggtggtgc tcgatctgga    11940

aattgagacc gccagtgaag aacatggcga tggggccacc gtatgtggag gacgtctcca    12000

cctgcgcccg ataccagtca atttggtcag cctcaacgtc cttctcagat cgcttaacct    12060

tgcagttctt gtcggggatc cgttcggagc catcaaaatt gtgggacaaa atgaagaaga    12120

acgtcaagaa ggcaccagtt gcggtgggca ccagaggaat ggtgatcagc aatgaggtcc    12180

cagtgaggta ccagggcaag aatgcggtcc ggaagatgaa gaaagctcgc atcacgaatg    12240

caagtgtgcg gtagcgccgc agaaagccat tttcccgcat ggccgtgaca gactctggga    12300

tggtgtcatt gtgctgcatc cggaaaatgt agagcgggtt gtacaccagc gataccccgt    12360

atgcccccag cacaaggtac atgtaccaag cctggaagcg gtggaaccac tttcgggcgg    12420

tgtttgcggc ggggtagttg tggaacacca ggaacggctc tgccgactcc gcatccaggt    12480

cgaggccgtg ctggttggtg taggtgtggt gccgcatgat gtgcgactgc agccaaatcc    12540

accgggacga tccgatgacg tcaatgccgt aggcgaagag ggcgttgacc caaggcttct    12600

tgctgatggc cccgtgaccc gcatcatgct ggatggataa gccgatctgc atgtgcatga    12660

atccagtcaa caggccccac agcaccatcc ccgtggtagc ccagtgccac tcgcaaaagg    12720

ccgtcacggc gatgatccca acggtgcgca gccagaagcc aggggttgcg taccagttcc    12780

tccctttcat cacctcgcgc attgtccgct taacgtcttg tgcgaaggga gaatcatacg    12840

tgtagacaat gtccttcgag gtgcggtcat gcttccctcg gacgaattgc tgcaggagct    12900

tggagttctc gggcttgacg tagggatgca tagagtagaa gaggggagag gcgtcggagg    12960

ctccagaggc cagaatgagg tctccaccac catgcacctt ggccaggccc tcgaggtcgt    13020

acaggatacc atcgatagcg acgagatcag gtcgctccag gagctgttcg gtagtgagag    13080

acagggccat ggccattgct gtagatatgt cttgtgtgta agggggttgg ggtggttgtt    13140

tgtgttcttg acttttgtgt tagcaaggga agacgggcaa aaaagtgagt gtggttggga    13200

gggagagacg agccttatat ataatgcttg tttgtgtttg tgcaagtgga cgccgaaacg    13260

ggcaggagcc aaactaaaca aggcagacaa tgcgagctta attggattgc ctgatgggca    13320

ggggttaggg ctcgatcaat gggggtgcga agtgacaaaa ttgggaatta ggttcgcaag    13380

caaggctgac aagactttgg cccaaacatt tgtacgcggt ggacaacagg agccacccat    13440

cgtctgtcac gggctagccg gtcgtgcgtc ctgtcaggct ccacctaggc tccatgccac    13500

tccatacaat cccactagtg taccgctagg ccgcttttag ctcccatcta agaccccccc    13560

aaaacctcca ctgtacagtg cactgtactg tgtggcgatc aagggcaagg gaaaaaaggc    13620

gcaaacatgc acgcatggaa tgacgtaggt aaggcgttac tagactgaaa agtggcacat    13680

ttcggcgtgc caaagggtcc taggtgcgtt tcgcgagctg ggcgccaggc caagccgctc    13740

caaaacgcct ctccgactcc ctccagcggc ctccatatcc ccatccctct ccacagcaat    13800

gttgttaagc cttgcaaacg aaaaaataga aaggctaata agcttccaat attgtggtgt    13860

acgctgcata acgcaacaat gagcgccaaa caacacacac acacagcaca cagcagcatt    13920

aaccat                                                               13926


<210>  234
<211>  13926
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pZR5AU-555M

<400>  234
cgatccctaa ctgtatacta gcattctgaa ttaatatctt cctctttgat tgttccttag       60

gcatatcagc ggggatcccg tcaacagttt tatatatcgt agttacaacc atcaacactt      120

tttggtaagt gtaccattct atactccaac tggtctgcaa ctgtacaagt agacatgtta      180

atggtagtta ataacatcta cagcagaacc tatggtaaag acattgcatt tttacaggaa      240

gtatcgtcct acacgttgat aaatccaaag atgcggaact tcttccactt ttatcatcat      300

cccctactcg tacactcgta ctctttgttc gatcgcgatt catttctata aataatcttg      360

tatgtacatg cggccgctta gaactcctgg gcatcctgaa tggtgtgagc aacacccacc      420

ttctgcatgt atcgaagagt ggacatgatg ttctcgagaa tggtaggata gtatgtgtaa      480

cggacgccat gtcgcttgca acactcccga acggtgtctt gaatgaaggg atagtgccaa      540

gaggacattc ggggaaacaa gtgatgctcg atctgcaggt tgagaccgcc tgtaagcatc      600

attcccacgc tgccaccgta ggtggcagca gtctcggcct gtgctcggta ccagtcgact      660

cgatcctctt ccacgtctct gacctccttg ccttccttag ctcgtttggc cttgcagttc      720

ttctctgggg tccgttcggc tccctcgaag ttatgggaca gaacgaacac aaaggtgagg      780

aagatgcctg tgaccgtggg caccagaaga atggtgagca gaggagaggc accagcaatg      840

taccagggaa gaaaggcgtt tcggaagatg aagagcagtc gaagggaaac ggcgaaagct      900

cgctgccgat tgaaaaagct gcctggtctg agagaagtgg cctcgggaat ggtgtcgttg      960

tgctgcattg tgaagaggta catgggattg tacaccatcg agacaccata cattcccaga     1020

acgatgtaca tgtaccacgc ctgaaatcgg tggtaccact ttcgagaggc gttggtagca     1080

ggataggagt ggaacaagat gaacggctct gccgaggcag catccaggtc gaggccatgc     1140

tggttggtgt aagtgtgatg tcgcataatg tgggactgca gccagatcca acgagaggaa     1200

ccaatggcgt cgatgccata agcgaaaaag gcgttgaccc agggcttttt gctgatggct     1260

ccgtgagaac catcgtgttg aatggccaag ccaatctgcg agtggaagta tccagtaaag     1320

ataccccaca tcactgctcc ggttgtgatc cagtaccact cgccaatggc agtacaagcg     1380

atgatgagtg cggttctcag ccagaagcca ggagtggcgt accagtttcg acccttcatg     1440

acctctcgaa cggactgctt gacgtcctga gcaaagggag agtcgaaggt gtacacggga     1500

tccttggagc ttcgttcgtg cttgcctcgg gcaaattgct gcagaagctt cgagttctct     1560

ggcttgactc caggatgcat ggagtagaac agagcggttc cgtcggatgc accggcagct     1620

tcgatgaggt tgcctccggc atggaccttg gccagaccga agagatcgta caggactccg     1680

tcgatggcaa cgagctcggg gtgttccagg agctgctcgg tagtcaggga gatggtggcc     1740

atggttgtga attagggtgg tgagaatggt tggttgtagg gaagaatcaa aggccggtct     1800

cgggatccgt gggtatatat atatatatat atatatacga tccttcgtta cctccctgtt     1860

ctcaaaactg tggtttttcg tttttcgttt tttgcttttt ttgatttttt tagggccaac     1920

taagcttcca gatttcgcta atcacctttg tactaattac aagaaaggaa gaagctgatt     1980

agagttgggc tttttatgca actgtgctac tccttatctc tgatatgaaa gtgtagaccc     2040

aatcacatca tgtcatttag agttggtaat actgggagga tagataaggc acgaaaacga     2100

gccatagcag acatgctggg tgtagccaag cagaagaaag tagatgggag ccaattgacg     2160

agcgagggag ctacgccaat ccgacatacg acacgctgag atcgtcttgg ccggggggta     2220

cctacagatg tccaagggta agtgcttgac tgtaattgta tgtctgagga caaatatgta     2280

gtcagccgta taaagtcata ccaggcacca gtgccatcat cgaaccacta actctctatg     2340

atacatgcct ccggtattat tgtaccatgc gtcgctttgt tacatacgta tcttgccttt     2400

ttctctcaga aactccagac tttggctatt ggtcgagata agcccggacc atagtgagtc     2460

tttcacactc tgtttaaaca agctaccacc acactcgttg ggtgcagtcg acgagtatct     2520

gtctgactcg tcattgccgc ctttggagta cgactccaac tatgagtgtg cttggatcac     2580

tttgacgata cattcttcgt tggaggctgt gggtctgaca gctgcgtttt cggcgcggtt     2640

ggccgacaac aatatcagct gcaacgtcat tgctggcttt catcatgatc acatttttgt     2700

cggcaaaggc gacgcccaga gagccattga cgttctttct aatttggacc gatagccgta     2760

tagtccagtc tatctataag ttcaactaac tcgtaactat taccataaca tatacttcac     2820

tgccccagat aaggttccga taaaaagttc tgcagactaa atttatttca gtctcctctt     2880

caccaccaaa atgccctcct acgaagctcg agctaacgtc cacaagtccg cctttgccgc     2940

tcgagtgctc aagctcgtgg cagccaagaa aaccaacctg tgtgcttctc tggatgttac     3000

caccaccaag gagctcattg agcttgccga taaggtcgga ccttatgtgt gcatgatcaa     3060

aacccatatc gacatcattg acgacttcac ctacgccggc actgtgctcc ccctcaagga     3120

acttgctctt aagcacggtt tcttcctgtt cgaggacaga aagttcgcag atattggcaa     3180

cactgtcaag caccagtacc ggtgtcaccg aatcgccgag tggtccgata tcaccaacgc     3240

ccacggtgta cccggaaccg gaatcattgc tggcctgcga gctggtgccg aggaaactgt     3300

ctctgaacag aagaaggagg acgtctctga ctacgagaac tcccagtaca aggagttcct     3360

agtcccctct cccaacgaga agctggccag aggtctgctc atgctggccg agctgtcttg     3420

caagggctct ctggccactg gcgagtactc caagcagacc attgagcttg cccgatccga     3480

ccccgagttt gtggttggct tcattgccca gaaccgacct aagggcgact ctgaggactg     3540

gcttattctg acccccgggg tgggtcttga cgacaaggga gacgctctcg gacagcagta     3600

ccgaactgtt gaggatgtca tgtctaccgg aacggatatc ataattgtcg gccgaggtct     3660

gtacggccag aaccgagatc ctattgagga ggccaagcga taccagaagg ctggctggga     3720

ggcttaccag aagattaact gttagaggtt agactatgga tatgtaattt aactgtgtat     3780

atagagagcg tgcaagtatg gagcgcttgt tcagcttgta tgatggtcag acgacctgtc     3840

tgatcgagta tgtatgatac tgcacaacct gtgtatccgc atgatctgtc caatggggca     3900

tgttgttgtg tttctcgata cggagatgct gggtacagtg ctaatacgtt gaactactta     3960

tacttatatg aggctcgaag aaagctgact tgtgtatgac ttattctcaa ctacatcccc     4020

agtcacaata ccaccactgc actaccacta caccaaaacc atgatcaaac caccgatgga     4080

cttcctggag gcagaagaac ttgttatgga aaagctcaag agagagaatt caggagagac     4140

cgggttggcg gcgtatttgt gtcccaaaaa acagccccaa ttgccccgga gaagacggcc     4200

aggccgccta gatgacaaat tcaacaactc acagctgact ttctgccatt gccactaggg     4260

gggggccttt ttatatggcc aagccaagct ctccacgtcg gttgggctgc acccaacaat     4320

aaatgggtag ggttgcacca acaaagggat gggatggggg gtagaagata cgaggataac     4380

ggggctcaat ggcacaaata agaacgaata ctgccattaa gactcgtgat ccagcgactg     4440

acaccattgc atcatctaag ggcctcaaaa ctacctcgga actgctgcgc tgatctggac     4500

accacagagg ttccgagcac tttaggttgc accaaatgtc ccaccaggtg caggcagaaa     4560

acgctggaac agcgtgtaca gtttgtctta acaaaaagtg agggcgctga ggtcgagcag     4620

ggtggtgtga cttgttatag cctttagagc tgcgaaagcg cgtatggatt tggctcatca     4680

ggccagattg agggtctgtg gacacatgtc atgttagtgt acttcaatcg ccccctggat     4740

atagccccga caataggccg tggcctcatt tttttgcctt ccgcacattt ccattgctcg     4800

gtacccacac cttgcttctc ctgcacttgc caaccttaat actggtttac attgaccaac     4860

atcttacaag cggggggctt gtctagggta tatataaaca gtggctctcc caatcggttg     4920

ccagtctctt ttttcctttc tttccccaca gattcgaaat ctaaactaca catcacacaa     4980

tgcctgttac tgacgtcctt aagcgaaagt ccggtgtcat cgtcggcgac gatgtccgag     5040

ccgtgagtat ccacgacaag atcagtgtcg agacgacgcg ttttgtgtaa tgacacaatc     5100

cgaaagtcgc tagcaacaca cactctctac acaaactaac ccagctctcc atggctctct     5160

cccttactac cgagcagctg ctcgagcgac ccgacctggt tgccatcgac ggcattctct     5220

acgatctgga aggtcttgcc aaggtccatc ccggatccga cttgatcctc gcttctggtg     5280

cctccgatgc ttctcctctg ttctactcca tgcaccctta cgtcaagccc gagaactcga     5340

agctgcttca acagttcgtg cgaggcaagc acgaccgaac ctccaaggac attgtctaca     5400

cctacgactc tccctttgca caggacgtca agcgaactat gcgagaggtc atgaaaggtc     5460

ggaactggta tgccacacct ggattctggc tgcgaaccgt tggcatcatt gctgtcaccg     5520

ccttttgcga gtggcactgg gctactaccg gaatggtgct gtggggtctc ttgactggat     5580

tcatgcacat gcagatcggc ctgtccattc agcacgatgg ttctcatggt gccatcagca     5640

aaaagccctg ggtcaacgct ctctttgcct acggcatcga cgtcattgga tcgtccagat     5700

ggatctggct gcagtctcac atcatgcgac atcacaccta caccaatcag catggtctcg     5760

acctggatgc cgagtccgca gaaccattcc ttgtgttcca caactaccct gctgccaaca     5820

ctgctcgaaa gtggtttcac cgattccagg cctggtacat gtacctcgtg cttggagcct     5880

acggcgtttc gctggtgtac aaccctctct acatcttccg aatgcagcac aacgacacca     5940

ttcccgagtc tgtcacagcc atgcgagaga acggctttct gcgacggtac cgaacccttg     6000

cattcgttat gcgagctttc ttcatctttc gaaccgcctt cttgccctgg tatctcactg     6060

gaacctccct gctcatcacc attcctctgg tgcccactgc taccggtgcc ttcctcacct     6120

tctttttcat cttgtctcac aacttcgatg gctcggagcg aatccccgac aagaactgca     6180

aggtcaagag ctccgagaag gacgttgaag ccgatcagat cgactggtac agagctcagg     6240

tggagacctc ttccacctac ggtggaccca ttgccatgtt ctttactggc ggtctcaact     6300

tccagatcga gcatcacctc tttcctcgaa tgtcgtcttg gcactatccc ttcgtgcagc     6360

aagctgtccg agagtgttgc gaacgacacg gagttcggta cgtcttctac cctaccattg     6420

tgggcaacat catttccacc ctcaagtaca tgcacaaagt cggtgtggtt cactgtgtca     6480

aggacgctca ggattcctaa gcggccgcaa gtgtggatgg ggaagtgagt gcccggttct     6540

gtgtgcacaa ttggcaatcc aagatggatg gattcaacac agggatatag cgagctacgt     6600

ggtggtgcga ggatatagca acggatattt atgtttgaca cttgagaatg tacgatacaa     6660

gcactgtcca agtacaatac taaacatact gtacatactc atactcgtac ccgggcaacg     6720

gtttcacttg agtgcagtgg ctagtgctct tactcgtaca gtgtgcaata ctgcgtatca     6780

tagtctttga tgtatatcgt attcattcat gttagttgcg tacgacagac ggacaatgac     6840

gagccaggtt cgcccaactg acggaatctc ctatagtctt ctcatcaagc gcataaatat     6900

agtatataaa gaattacatg gtgggccaca gcttggcgat agtatgacgg tctgggctcc     6960

agatcacacc catgagggat gccgtgcgag tagatgtaga ggtggtgctt ccaggttgtc     7020

cccaacaaca atcgctactg gaaaataccg tgaaagtacg tacacccaac tctgtgactc     7080

gcgttgactt tcatctttca atttggtcgt gtcttacggt gcatgtgctg tattggacat     7140

gaagaagggg ttggcaagct gaaccatgcc tgaggtgctg gagtctgata ctgtctagca     7200

gagggcgatc aaaagatcac catccacacg tttgtttcca tattcaatct ccaatttcca     7260

acatctcctt ggctaatagc cactttgaga gatatttggt atcgcgtatt cacttgtctc     7320

atatcccgtg tttctgtttc accaaggctt ggacgatatt atcgcgagaa aaaccgtatt     7380

gaggtattta ttagagtcga gagaggagga acaggtattc ggaagatgga aaagctgtac     7440

agagcgagaa acgatcccat gtcgacgctc ctgatggctt atactgagac atggaatcag     7500

gatcttgccc ttcttcacag ctctacttgc agctacagaa ttgctttttc tcgtatccga     7560

tcataattcc gctgctatga taggcagcca ccgatacaat ataggagaga gtataggtat     7620

tgactctata gacactgtat gatgctgaca aagctgactg agggattcga ctcagcttcg     7680

ccgttgcttg atctccaatt aattatcgta ggcgcgccag ctgcattaat gaatcggcca     7740

acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc     7800

gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg     7860

gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa     7920

ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga     7980

cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag     8040

ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct     8100

taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg     8160

ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc     8220

ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt     8280

aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta     8340

tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac     8400

agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc     8460

ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat     8520

tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc     8580

tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt     8640

cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta     8700

aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct     8760

atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg     8820

cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga     8880

tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt     8940

atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt     9000

taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt     9060

tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat     9120

gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc     9180

cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc     9240

cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat     9300

gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag     9360

aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt     9420

accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc     9480

ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa     9540

gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg     9600

aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa     9660

taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgatg cggtgtgaaa     9720

taccgcacag atgcgtaagg agaaaatacc gcatcaggaa attgtaagcg ttaatatttt     9780

gttaaaattc gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat     9840

cggcaaaatc ccttataaat caaaagaata gaccgagata gggttgagtg ttgttccagt     9900

ttggaacaag agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt     9960

ctatcagggc gatggcccac tacgtgaacc atcaccctaa tcaagttttt tggggtcgag    10020

gtgccgtaaa gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg    10080

aaagccggcg aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc    10140

gctggcaagt gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc ttaatgcgcc    10200

gctacagggc gcgtccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg    10260

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt    10320

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa    10380

tacgactcac tatagggcga attgggcccg acgtcgcatg cattaccaac gttggcgcgc    10440

cacaacgtca gaaactctct tccatccgat aagccgtgtc taactatacg accctcaaac    10500

aaggtcagcc agaccaactt ttagcactcg caaaacacac acccaaaccc aggccgcgac    10560

gcagccggag cccgtctact tatactgttt cgccgcagac gccaagtgac acaaaaagcg    10620

cacaaaccct cccccccagt ggcggcgcac atccacaaca aacgacccat aaacggcact    10680

cccacacgga aacaaaacgt ttgtcgggtt actcctcacg ccaagacaca ctcacacaca    10740

caacctccca cctttcatca ccactcaact acgggtacag tagtggtaga aaacacagag    10800

tccagagtcc acacagacac cgcacagagt ctgtcacaca agcaccatgc caacaagcag    10860

accgttcatt tcgctgctgt ttggaagctt ctccacgtcg tatcaacgca acaccaccac    10920

acagacctcc gcatcgacag ccaccgtcac caccggcccc acgtcgcaga tgtacaagat    10980

gctcgacaac cactcgatac ctgagcagcc ccagcagccg accctgactc agacacagcc    11040

agcggtgcag acggcgcaga cggcgcagac ggcgcagcag gcaagtggga ctgcggacct    11100

cagcgaatcg ccacacgaca agctgtggat tggaggccga tcaaacgacg gacgggaaaa    11160

gttctaccgg ctggaaccgg ccaagcgaag ggcatccttc gatcgcattt ctctggatca    11220

ggtgtctatt tgatgaggcg ccaaacagtg gccagaccaa gtctggtatc tcaacccctc    11280

aaaattgata accaagtttc tggtggagtt caaaccaagg ggagctacgg agccgttcac    11340

aaggtgtcca cagaggttca aagcacattg caccaagacc gccatcatcc ttaattaaaa    11400

ggcgttgaaa cagaatgagc cagacagcaa ggacaaggtg gccaacagca aggagtccaa    11460

aaagccctct attgacgaga tccacgatgt tattgctcat gaggtttccg agctcgatgc    11520

tgggaagaag aagtgatttg tatataagaa ataaatgaga tatagtaaag gagtgcaaga    11580

gaatggcaag gtggtcaaat tctatattac ttgcagtcac tggttcctcg ttgacatgaa    11640

tgaagttacc gttggcatag ctgatttaat atataactgt ccaactaact ctcacctaga    11700

tataacccat gtgtgtgttt ccaatcatca atgcggccgc ttaggaatcc tgtgcgtcct    11760

tcacgcagtg gacgacaccc accttatgca tgtacttcag ggtggagatg atgttgccga    11820

cgatggtagg gtagaaaaca tatcgcactc catgtcgttc gcaacactcc cggaccgcct    11880

gctggacgaa ggggtagtgc caagacgaca tccggggaaa gaggtggtgc tcgatctgga    11940

aattgagacc gccagtgaag aacatggcga tggggccacc gtatgtggag gacgtctcca    12000

cctgcgcccg ataccagtca atttggtcag cctcaacgtc cttctcagag ctcttaacct    12060

tgcagttctt gtcggggatc cgttcggagc catcaaaatt gtgggacaaa atgaagaaga    12120

acgtcaagaa ggcaccagtt gcggtgggca ccagaggaat ggtgatcagc aatgaggtcc    12180

cagtgaggta ccagggcaag aatgcggtcc ggaagatgaa gaaagctcgc atcacgaatg    12240

caagtgtgcg gtagcgccgc agaaagccat tttcccgcat ggccgtgaca gactctggga    12300

tggtgtcatt gtgctgcatc cggaaaatgt agagcgggtt gtacaccagc gataccccgt    12360

atgcccccag cacaaggtac atgtaccaag cctggaagcg gtggaaccac tttcgggcgg    12420

tgtttgcggc ggggtagttg tggaacacca ggaacggctc tgccgactcc gcatccaggt    12480

cgaggccgtg ctggttggtg taggtgtggt gccgcatgat gtgcgactgc agccaaatcc    12540

accgggacga tccgatgacg tcaatgccgt aggcgaagag ggcgttgacc caaggcttct    12600

tgctgatggc cccgtgaccc gcatcatgct ggatggataa gccgatctgc atgtgcatga    12660

atccagtcaa caggccccac agcaccatcc ccgtggtagc ccagtgccac tcgcaaaagg    12720

ccgtcacggc gatgatccca acggtgcgca gccagaagcc aggggttgcg taccagttcc    12780

tccctttcat cacctcgcgc attgtccgct taacgtcttg tgcgaaggga gaatcatacg    12840

tgtagacaat gtccttcgag gtgcggtcat gcttccctcg gacgaattgc tgcaggagct    12900

tggagttctc gggcttgacg tagggatgca tagagtagaa gaggggagag gcgtcggagg    12960

ctccagaggc cagaatgagg tctccaccac catgcacctt ggccaggccc tcgaggtcgt    13020

acaggatacc atcgatagcg acgagatcag gtcgctccag gagctgttcg gtagtgagag    13080

acagggccat ggccattgct gtagatatgt cttgtgtgta agggggttgg ggtggttgtt    13140

tgtgttcttg acttttgtgt tagcaaggga agacgggcaa aaaagtgagt gtggttggga    13200

gggagagacg agccttatat ataatgcttg tttgtgtttg tgcaagtgga cgccgaaacg    13260

ggcaggagcc aaactaaaca aggcagacaa tgcgagctta attggattgc ctgatgggca    13320

ggggttaggg ctcgatcaat gggggtgcga agtgacaaaa ttgggaatta ggttcgcaag    13380

caaggctgac aagactttgg cccaaacatt tgtacgcggt ggacaacagg agccacccat    13440

cgtctgtcac gggctagccg gtcgtgcgtc ctgtcaggct ccacctaggc tccatgccac    13500

tccatacaat cccactagtg taccgctagg ccgcttttag ctcccatcta agaccccccc    13560

aaaacctcca ctgtacagtg cactgtactg tgtggcgatc aagggcaagg gaaaaaaggc    13620

gcaaacatgc acgcatggaa tgacgtaggt aaggcgttac tagactgaaa agtggcacat    13680

ttcggcgtgc caaagggtcc taggtgcgtt tcgcgagctg ggcgccaggc caagccgctc    13740

caaaacgcct ctccgactcc ctccagcggc ctccatatcc ccatccctct ccacagcaat    13800

gttgttaagc cttgcaaacg aaaaaataga aaggctaata agcttccaat attgtggtgt    13860

acgctgcata acgcaacaat gagcgccaaa caacacacac acacagcaca cagcagcatt    13920

aaccat                                                               13926


<210>  235
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDGNH motif

<400>  235

His Asp Gly Asn His 
1               5   


<210>  236
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDANH motif

<400>  236

His Asp Ala Asn His 
1               5   


<210>  237
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HEFAH motif

<400>  237

His Glu Phe Ala His 
1               5   


<210>  238
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HEFTH motif

<400>  238

His Glu Phe Thr His 
1               5   


<210>  239
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HEMGH motif

<400>  239

His Glu Met Gly His 
1               5   


<210>  240
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HEAGH motif

<400>  240

His Glu Ala Gly His 
1               5   


<210>  241
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HDFGH motif

<400>  241

His Asp Phe Gly His 
1               5   


<210>  242
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDYGH motif

<400>  242

His Asp Tyr Gly His 
1               5   


<210>  243
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDSCH motif

<400>  243

His Asp Ser Cys His 
1               5   


<210>  244
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDACH motif

<400>  244

His Asp Ala Cys His 
1               5   


<210>  245
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5-5

<400>  245
ctcaaacatt ctctccattg gtcc                                              24


<210>  246
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5 M1-3

<400>  246
gaatcaaatc tcctcctcca tgaactttgg caagc                                  35


<210>  247
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5 M1-5

<400>  247
gcttgccaaa gttcatggag gaggagattt gattc                                  35


<210>  248
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5-3

<400>  248
tattctgttt ccgccaaagg tacatg                                            26


<210>  249
<211>  1347
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1347)

<400>  249
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat gga gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aat tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg tcc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc caa gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag aga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cgc cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc                                                                   1347
Ser                                                                       
                                                                          


<210>  250
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  250

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ser His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  251
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5 M2-3

<400>  251
cttgctgatg gccccgtggc ccgcatcatg ctggatgg                               38


<210>  252
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5 M2-5

<400>  252
ccatccagca tgatgcgggc cacggggcca tcagcaag                               38


<210>  253
<211>  5050
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pLF336

<400>  253
cctgaattcc agcacactgg cggccgttac tagtggatcc gagctcggta ccaagcttga       60

tgcatagctt gagtattcta acgcgtcacc taaatagctt ggcgtaatca tggtcatagc      120

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca      180

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct      240

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      300

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      360

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      420

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      480

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      540

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      600

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      660

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      720

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      780

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      840

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      900

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag      960

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt     1020

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1080

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1140

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1200

cctagatcct tttaaattaa aaatgaagtt ttagcacgtg tcagtcctgc tcctcggcca     1260

cgaagtgcac gcagttgccg gccgggtcgc gcagggcgaa ctcccgcccc cacggctgct     1320

cgccgatctc ggtcatggcc ggcccggagg cgtcccggaa gttcgtggac acgacctccg     1380

accactcggc gtacagctcg tccaggccgc gcacccacac ccaggccagg gtgttgtccg     1440

gcaccacctg gtcctggacc gcgctgatga acagggtcac gtcgtcccgg accacaccgg     1500

cgaagtcgtc ctccacgaag tcccgggaga acccgagccg gtcggtccag aactcgaccg     1560

ctccggcgac gtcgcgcgcg gtgagcaccg gaacggcact ggtcaacttg gccatggtgg     1620

ccctcctcac gtgctattat tgaagcattt atcagggtta ttgtctcatg agcggataca     1680

tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag     1740

tgccacctga tgcggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg     1800

aaattgtaag cgttaataat tcagaagaac tcgtcaagaa ggcgatagaa ggcgatgcgc     1860

tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc ggtcagccca ttcgccgcca     1920

agctcttcag caatatcacg ggtagccaac gctatgtcct gatagcggtc cgccacaccc     1980

agccggccac agtcgatgaa tccagaaaag cggccatttt ccaccatgat attcggcaag     2040

caggcatcgc catgggtcac gacgagatcc tcgccgtcgg gcatgctcgc cttgagcctg     2100

gcgaacagtt cggctggcgc gagcccctga tgctcttcgt ccagatcatc ctgatcgaca     2160

agaccggctt ccatccgagt acgtgctcgc tcgatgcgat gtttcgcttg gtggtcgaat     2220

gggcaggtag ccggatcaag cgtatgcagc cgccgcattg catcagccat gatggatact     2280

ttctcggcag gagcaaggtg agatgacagg agatcctgcc ccggcacttc gcccaatagc     2340

agccagtccc ttcccgcttc agtgacaacg tcgagcacag ctgcgcaagg aacgcccgtc     2400

gtggccagcc acgatagccg cgctgcctcg tcttgcagtt cattcagggc accggacagg     2460

tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca gccggaacac ggcggcatca     2520

gagcagccga ttgtctgttg tgcccagtca tagccgaata gcctctccac ccaagcggcc     2580

ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa acgatcctca tcctgtctct     2640

tgatcagagc ttgatcccct gcgccatcag atccttggcg gcaagaaagc catccagttt     2700

actttgcagg gcttcccaac cttaccagag ggcgccccag ctggcaattc cggttcgctt     2760

gctgtccata aaaccgccca gtctagctat cgccatgtaa gcccactgca agctacctgc     2820

tttctctttg cgcttgcgtt ttcccttgtc cagatagccc agtagctgac attcatccgg     2880

ggtcagcacc gtttctgcgg actggctttc tacgtgaaaa ggatctaggt gaagatcctt     2940

tttgataatc tcatgcctga catttatatt ccccagaaca tcaggttaat ggcgtttttg     3000

atgtcatttt cgcggtggct gagatcagcc acttcttccc cgataacgga gaccggcaca     3060

ctggccatat cggtggtcat catgcgccag ctttcatccc cgatatgcac caccgggtaa     3120

agttcacggg agactttatc tgacagcaga cgtgcactgg ccagggggat caccatccgt     3180

cgccccggcg tgtcaataat atcactctgt acatccacaa acagacgata acggctctct     3240

cttttatagg tgtaaacctt aaactgccgt acgtataggc tgcgcaactg ttgggaaggg     3300

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg     3360

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt     3420

gaattgtaat acgactcact atagggcgaa ttgggccctc tagatgcatg ctcgagcggc     3480

cgccagtgtg atggatatct gcagaattca ggctcaaaca ttctctccat tggtccttaa     3540

acactcatca gtcatcaccg cggccgcaaa ccatggctct cagtcttacc acagaacagc     3600

tgttagaacg ccctgatttg gttgcgattg atggcatcct ctacgacctt gaagggcttg     3660

ccaaagttca tggaggagga gatttgattc tcgcttctgg tgcctctgat gcctcccctc     3720

tcttttattc aatgcatcca tacgtcaaac cggagaattc caaattgctt caacagttcg     3780

tccgagggaa gcatgaccgc acctcgaagg acattgtcta cacgtatgat tctcccttcg     3840

cacaagacgt taagcggaca atgcgcgagg tgatgaaagg gaggaactgg tacgcaaccc     3900

ctggcttctg gctgcgcacc gttgggatca tcgccgtgac ggccttttgc gagtggcact     3960

gggctaccac ggggatggtg ctgtggggcc tgttgactgg attcatgcac atgcagatcg     4020

gcttatccat ccagcatgat gcgggccacg gggccatcag caagaagcct tgggtcaacg     4080

ccctcttcgc ctacggcatt gacgtcatcg gatcgtcccg gtggatttgg ctgcagtcgc     4140

acatcatgcg gcaccacacc tacaccaacc agcacggcct cgacctggat gcggagtcgg     4200

cagagccgtt cctggtgttc cacaactacc ccgccgcaaa caccgcccga aagtggttcc     4260

accgcttcca agcttggtac atgtaccttg tgctgggggc atacggggta tcgctggtgt     4320

acaacccgct ctacattttc cggatgcagc acaatgacac catcccagag tctgtcacgg     4380

ccatgcggga aaatggcttt ctgcggcgct accgcacact tgcattcgtg atgcgagctt     4440

tcttcatctt ccggaccgca ttcttgccct ggtacctcac tgggacctca ttgctgatca     4500

ccattcctct ggtgcccacc gcaactggtg ccttcttgac gttcttcttc attttgtccc     4560

acaattttga tggctccgaa cggatccccg acaagaactg caaggttaag agatctgaga     4620

aggacgttga ggctgaccaa attgactggt atcgggcgca ggtggagacg tcctccacat     4680

acggtggccc catcgccatg ttcttcactg gcggtctcaa tttccagatc gagcaccacc     4740

tctttccccg gatgtcgtct tggcactacc ccttcgtcca gcaggcggtc cgggagtgtt     4800

gcgaacgcca tggagtgcga tatgttttct accctaccat cgtcggcaac atcatctcca     4860

ccctgaagta catgcataag gtgggtgtcg tccactgcgt gaaggacgca caggattcct     4920

aagcggccgc atttcgcacc aaatcaatga aagtaataat gaaaagtctg aataagaata     4980

cttaggctta gatgcctttg ttacttgtgt aaaataactt gagtcatgta cctttggcgg     5040

aaacagaata                                                            5050


<210>  254
<211>  1347
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1347)

<400>  254
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat gga gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aat tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg ggc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc caa gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag aga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cgc cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc                                                                   1347
Ser                                                                       
                                                                          


<210>  255
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  255

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Gly His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  256
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5 M3-3

<400>  256
cttgctgatg gccccgtggg ccgcatcatg ctggatgg                               38


<210>  257
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EgD5 M3-5

<400>  257
ccatccagca tgatgcggcc cacggggcca tcagcaag                               38


<210>  258
<211>  5050
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pLF337

<400>  258
cctgaattcc agcacactgg cggccgttac tagtggatcc gagctcggta ccaagcttga       60

tgcatagctt gagtattcta acgcgtcacc taaatagctt ggcgtaatca tggtcatagc      120

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca      180

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct      240

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      300

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      360

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      420

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      480

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      540

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      600

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      660

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      720

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      780

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      840

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      900

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag      960

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt     1020

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1080

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1140

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1200

cctagatcct tttaaattaa aaatgaagtt ttagcacgtg tcagtcctgc tcctcggcca     1260

cgaagtgcac gcagttgccg gccgggtcgc gcagggcgaa ctcccgcccc cacggctgct     1320

cgccgatctc ggtcatggcc ggcccggagg cgtcccggaa gttcgtggac acgacctccg     1380

accactcggc gtacagctcg tccaggccgc gcacccacac ccaggccagg gtgttgtccg     1440

gcaccacctg gtcctggacc gcgctgatga acagggtcac gtcgtcccgg accacaccgg     1500

cgaagtcgtc ctccacgaag tcccgggaga acccgagccg gtcggtccag aactcgaccg     1560

ctccggcgac gtcgcgcgcg gtgagcaccg gaacggcact ggtcaacttg gccatggtgg     1620

ccctcctcac gtgctattat tgaagcattt atcagggtta ttgtctcatg agcggataca     1680

tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag     1740

tgccacctga tgcggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg     1800

aaattgtaag cgttaataat tcagaagaac tcgtcaagaa ggcgatagaa ggcgatgcgc     1860

tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc ggtcagccca ttcgccgcca     1920

agctcttcag caatatcacg ggtagccaac gctatgtcct gatagcggtc cgccacaccc     1980

agccggccac agtcgatgaa tccagaaaag cggccatttt ccaccatgat attcggcaag     2040

caggcatcgc catgggtcac gacgagatcc tcgccgtcgg gcatgctcgc cttgagcctg     2100

gcgaacagtt cggctggcgc gagcccctga tgctcttcgt ccagatcatc ctgatcgaca     2160

agaccggctt ccatccgagt acgtgctcgc tcgatgcgat gtttcgcttg gtggtcgaat     2220

gggcaggtag ccggatcaag cgtatgcagc cgccgcattg catcagccat gatggatact     2280

ttctcggcag gagcaaggtg agatgacagg agatcctgcc ccggcacttc gcccaatagc     2340

agccagtccc ttcccgcttc agtgacaacg tcgagcacag ctgcgcaagg aacgcccgtc     2400

gtggccagcc acgatagccg cgctgcctcg tcttgcagtt cattcagggc accggacagg     2460

tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca gccggaacac ggcggcatca     2520

gagcagccga ttgtctgttg tgcccagtca tagccgaata gcctctccac ccaagcggcc     2580

ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa acgatcctca tcctgtctct     2640

tgatcagagc ttgatcccct gcgccatcag atccttggcg gcaagaaagc catccagttt     2700

actttgcagg gcttcccaac cttaccagag ggcgccccag ctggcaattc cggttcgctt     2760

gctgtccata aaaccgccca gtctagctat cgccatgtaa gcccactgca agctacctgc     2820

tttctctttg cgcttgcgtt ttcccttgtc cagatagccc agtagctgac attcatccgg     2880

ggtcagcacc gtttctgcgg actggctttc tacgtgaaaa ggatctaggt gaagatcctt     2940

tttgataatc tcatgcctga catttatatt ccccagaaca tcaggttaat ggcgtttttg     3000

atgtcatttt cgcggtggct gagatcagcc acttcttccc cgataacgga gaccggcaca     3060

ctggccatat cggtggtcat catgcgccag ctttcatccc cgatatgcac caccgggtaa     3120

agttcacggg agactttatc tgacagcaga cgtgcactgg ccagggggat caccatccgt     3180

cgccccggcg tgtcaataat atcactctgt acatccacaa acagacgata acggctctct     3240

cttttatagg tgtaaacctt aaactgccgt acgtataggc tgcgcaactg ttgggaaggg     3300

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg     3360

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt     3420

gaattgtaat acgactcact atagggcgaa ttgggccctc tagatgcatg ctcgagcggc     3480

cgccagtgtg atggatatct gcagaattca ggctcaaaca ttctctccat tggtccttaa     3540

acactcatca gtcatcaccg cggccgcaaa ccatggctct cagtcttacc acagaacagc     3600

tgttagaacg ccctgatttg gttgcgattg atggcatcct ctacgacctt gaagggcttg     3660

ccaaagttca tggaggagga gatttgattc tcgcttctgg tgcctctgat gcctcccctc     3720

tcttttattc aatgcatcca tacgtcaaac cggagaattc caaattgctt caacagttcg     3780

tccgagggaa gcatgaccgc acctcgaagg acattgtcta cacgtatgat tctcccttcg     3840

cacaagacgt taagcggaca atgcgcgagg tgatgaaagg gaggaactgg tacgcaaccc     3900

ctggcttctg gctgcgcacc gttgggatca tcgccgtgac ggccttttgc gagtggcact     3960

gggctaccac ggggatggtg ctgtggggcc tgttgactgg attcatgcac atgcagatcg     4020

gcttatccat ccagcatgat gcggcccacg gggccatcag caagaagcct tgggtcaacg     4080

ccctcttcgc ctacggcatt gacgtcatcg gatcgtcccg gtggatttgg ctgcagtcgc     4140

acatcatgcg gcaccacacc tacaccaacc agcacggcct cgacctggat gcggagtcgg     4200

cagagccgtt cctggtgttc cacaactacc ccgccgcaaa caccgcccga aagtggttcc     4260

accgcttcca agcttggtac atgtaccttg tgctgggggc atacggggta tcgctggtgt     4320

acaacccgct ctacattttc cggatgcagc acaatgacac catcccagag tctgtcacgg     4380

ccatgcggga aaatggcttt ctgcggcgct accgcacact tgcattcgtg atgcgagctt     4440

tcttcatctt ccggaccgca ttcttgccct ggtacctcac tgggacctca ttgctgatca     4500

ccattcctct ggtgcccacc gcaactggtg ccttcttgac gttcttcttc attttgtccc     4560

acaattttga tggctccgaa cggatccccg acaagaactg caaggttaag agatctgaga     4620

aggacgttga ggctgaccaa attgactggt atcgggcgca ggtggagacg tcctccacat     4680

acggtggccc catcgccatg ttcttcactg gcggtctcaa tttccagatc gagcaccacc     4740

tctttccccg gatgtcgtct tggcactacc ccttcgtcca gcaggcggtc cgggagtgtt     4800

gcgaacgcca tggagtgcga tatgttttct accctaccat cgtcggcaac atcatctcca     4860

ccctgaagta catgcataag gtgggtgtcg tccactgcgt gaaggacgca caggattcct     4920

aagcggccgc atttcgcacc aaatcaatga aagtaataat gaaaagtctg aataagaata     4980

cttaggctta gatgcctttg ttacttgtgt aaaataactt gagtcatgta cctttggcgg     5040

aaacagaata                                                            5050


<210>  259
<211>  1347
<212>  DNA
<213>  Euglena gracilis


<220>
<221>  CDS
<222>  (1)..(1347)

<400>  259
atg gct ctc agt ctt acc aca gaa cag ctg tta gaa cgc cct gat ttg         48
Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu           
1               5                   10                  15                

gtt gcg att gat ggc atc ctc tac gac ctt gaa ggg ctt gcc aaa gtt         96
Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val           
            20                  25                  30                    

cat gga gga gga gat ttg att ctc gct tct ggt gcc tct gat gcc tcc        144
His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser           
        35                  40                  45                        

cct ctc ttt tat tca atg cat cca tac gtc aaa ccg gag aat tcc aaa        192
Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys           
    50                  55                  60                            

ttg ctt caa cag ttc gtc cga ggg aag cat gac cgc acc tcg aag gac        240
Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp           
65                  70                  75                  80            

att gtc tac acg tat gat tct ccc ttc gca caa gac gtt aag cgg aca        288
Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr           
                85                  90                  95                

atg cgc gag gtg atg aaa ggg agg aac tgg tac gca acc cct ggc ttc        336
Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe           
            100                 105                 110                   

tgg ctg cgc acc gtt ggg atc atc gcc gtg acg gcc ttt tgc gag tgg        384
Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp           
        115                 120                 125                       

cac tgg gct acc acg ggg atg gtg ctg tgg ggc ctg ttg act gga ttc        432
His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe           
    130                 135                 140                           

atg cac atg cag atc ggc tta tcc atc cag cat gat gcg gcc cac ggg        480
Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly           
145                 150                 155                 160           

gcc atc agc aag aag cct tgg gtc aac gcc ctc ttc gcc tac ggc att        528
Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile           
                165                 170                 175               

gac gtc atc gga tcg tcc cgg tgg att tgg ctg cag tcg cac atc atg        576
Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met           
            180                 185                 190                   

cgg cac cac acc tac acc aac cag cac ggc ctc gac ctg gat gcg gag        624
Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu           
        195                 200                 205                       

tcg gca gag ccg ttc ctg gtg ttc cac aac tac ccc gcc gca aac acc        672
Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr           
    210                 215                 220                           

gcc cga aag tgg ttc cac cgc ttc caa gct tgg tac atg tac ctt gtg        720
Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val           
225                 230                 235                 240           

ctg ggg gca tac ggg gta tcg ctg gtg tac aac ccg ctc tac att ttc        768
Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe           
                245                 250                 255               

cgg atg cag cac aat gac acc atc cca gag tct gtc acg gcc atg cgg        816
Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg           
            260                 265                 270                   

gaa aat ggc ttt ctg cgg cgc tac cgc aca ctt gca ttc gtg atg cga        864
Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg           
        275                 280                 285                       

gct ttc ttc atc ttc cgg acc gca ttc ttg ccc tgg tac ctc act ggg        912
Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly           
    290                 295                 300                           

acc tca ttg ctg atc acc att cct ctg gtg ccc acc gca act ggt gcc        960
Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala           
305                 310                 315                 320           

ttc ttg acg ttc ttc ttc att ttg tcc cac aat ttt gat ggc tcc gaa       1008
Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu           
                325                 330                 335               

cgg atc ccc gac aag aac tgc aag gtt aag aga tct gag aag gac gtt       1056
Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val           
            340                 345                 350                   

gag gct gac caa att gac tgg tat cgg gcg cag gtg gag acg tcc tcc       1104
Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser           
        355                 360                 365                       

aca tac ggt ggc ccc atc gcc atg ttc ttc act ggc ggt ctc aat ttc       1152
Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe           
    370                 375                 380                           

cag atc gag cac cac ctc ttt ccc cgg atg tcg tct tgg cac tac ccc       1200
Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro           
385                 390                 395                 400           

ttc gtc cag cag gcg gtc cgg gag tgt tgc gaa cgc cat gga gtg cga       1248
Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg           
                405                 410                 415               

tat gtt ttc tac cct acc atc gtc ggc aac atc atc tcc acc ctg aag       1296
Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys           
            420                 425                 430                   

tac atg cat aag gtg ggt gtc gtc cac tgc gtg aag gac gca cag gat       1344
Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp           
        435                 440                 445                       

tcc                                                                   1347
Ser                                                                       
                                                                          


<210>  260
<211>  449
<212>  PRT
<213>  Euglena gracilis

<400>  260

Met Ala Leu Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu 
1               5                   10                  15      


Val Ala Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 
            20                  25                  30          


His Gly Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 
        35                  40                  45              


Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys 
    50                  55                  60                  


Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys Asp 
65                  70                  75                  80  


Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val Lys Arg Thr 
                85                  90                  95      


Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly Phe 
            100                 105                 110         


Trp Leu Arg Thr Val Gly Ile Ile Ala Val Thr Ala Phe Cys Glu Trp 
        115                 120                 125             


His Trp Ala Thr Thr Gly Met Val Leu Trp Gly Leu Leu Thr Gly Phe 
    130                 135                 140                 


Met His Met Gln Ile Gly Leu Ser Ile Gln His Asp Ala Ala His Gly 
145                 150                 155                 160 


Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 
                165                 170                 175     


Asp Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 
            180                 185                 190         


Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu 
        195                 200                 205             


Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala Asn Thr 
    210                 215                 220                 


Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr Met Tyr Leu Val 
225                 230                 235                 240 


Leu Gly Ala Tyr Gly Val Ser Leu Val Tyr Asn Pro Leu Tyr Ile Phe 
                245                 250                 255     


Arg Met Gln His Asn Asp Thr Ile Pro Glu Ser Val Thr Ala Met Arg 
            260                 265                 270         


Glu Asn Gly Phe Leu Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 
        275                 280                 285             


Ala Phe Phe Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 
    290                 295                 300                 


Thr Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala 
305                 310                 315                 320 


Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser Glu 
                325                 330                 335     


Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Arg Ser Glu Lys Asp Val 
            340                 345                 350         


Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln Val Glu Thr Ser Ser 
        355                 360                 365             


Thr Tyr Gly Gly Pro Ile Ala Met Phe Phe Thr Gly Gly Leu Asn Phe 
    370                 375                 380                 


Gln Ile Glu His His Leu Phe Pro Arg Met Ser Ser Trp His Tyr Pro 
385                 390                 395                 400 


Phe Val Gln Gln Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 
                405                 410                 415     


Tyr Val Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 
            420                 425                 430         


Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp 
        435                 440                 445             


Ser 
    


<210>  261
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5-5

<400>  261
tgtaatacga ctcactatag g                                                 21


<210>  262
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5 M1-3

<400>  262
caatgaggtt gccacctcca tgcactttcg ccagtcc                                37


<210>  263
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5 M1-5

<400>  263
ggactggcga aagtgcatgg aggtggcaac ctcattg                                37


<210>  264
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5-3

<400>  264
gccagtgtgc tggaattcag g                                                 21


<210>  265
<211>  1362
<212>  DNA
<213>  Euglena anabaena


<220>
<221>  CDS
<222>  (1)..(1362)

<400>  265
atg gcc acc atc tct ttg act act gag caa ctt tta gaa cac cca gaa         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctg gtt gca att gat ggg gtg ttg tac gat ctc ttc gga ctg gcg aaa         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtg cat gga ggt ggc aac ctc att gaa gcc gcc ggt gcc tcc gac gga        144
Val His Gly Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gcc ctg ttc tac tcc atg cac cct gga gtg aag cca gag aat tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctg ctg cag caa ttt gcc cga ggc aaa cac gaa cga agc tcg aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gac cca gtg tac acc ttt gac agt ccc ttc gcc cag gat gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

agc gtt cgg gag gtc atg aag ggg cgc aac tgg tac gcc acg ccc ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttt tgg ctg cgg acc gcg ctg atc atc gcg tgc act gcc ata ggc gaa        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tat tgg atc act acc ggg gca gtg atg tgg ggc atc ttc acc ggg        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac agc cag att ggg ttg gcg att caa cac gat gcc tct cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtg aac gcc ttt ttc gcc tac ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att gga tcc tcc cgc tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cgc cac cac acc tac acc aac cag cat ggc ctg gac ctg gac gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gcg gag ccg ttc att ttg ttc cac tcc tac ccg gca aca aat        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcg tca cga aag tgg tac cat cgg ttc cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ttg ggg atg tat ggt gtg tcg atg gtg tac aat ccg atg tac ttg        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc acg atg cag cac aac gac aca atc cca gag gcc acc tct ctt aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aac cgg cag cgc gcc ttc gcc gtt tcc ctc cgc        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

cta ctg ttc atc ttc cgc aac gcc ttc ctc ccc tgg tac atc gcg ggc        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct ccg ctg ctc acc atc ctg ctg gtg cca acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ttg aca ttt gtt ttt gtg ctg tcc cat aac ttt gaa ggc gct gag       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc ccc gaa aag aac tgc aag gcc aaa agg gcc aag gag ggg aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc cgc gat gta gag gag gac cgg gtg gac tgg tac cgg gcg cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag acc gcg gcg acc tac ggg ggc agc gtc ggg atg atg ctg acc       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ttg aac ctg cag atc gag cac cac ttg ttc ccc cgc atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tac ccc ttc atc caa gat acg gtg cgg gaa tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cgc cat ggc gtg cgc tac aca tac tac ccg acc atc ctg gag aat ata       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc acg ctc cgc tac atg cag aag gtg ggc gtg gcc cac aca att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gaa ttc                                               1362
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  266
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  266

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Gly Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ser His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  267
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5 M2-3

<400>  267
gctgatggct ccgtgaccgg catcgtgttg aatcg                                  35


<210>  268
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5 M2-5

<400>  268
cgattcaaca cgatgccggt cacggagcca tcagc                                  35


<210>  269
<211>  5007
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pLF338

<400>  269
cctgaattcc agcacactgg cggccgttac tagtggatcc gagctcggta ccaagcttga       60

tgcatagctt gagtattcta acgcgtcacc taaatagctt ggcgtaatca tggtcatagc      120

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca      180

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct      240

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      300

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      360

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      420

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      480

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      540

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      600

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      660

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      720

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      780

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      840

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      900

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag      960

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt     1020

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1080

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1140

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1200

cctagatcct tttaaattaa aaatgaagtt ttagcacgtg tcagtcctgc tcctcggcca     1260

cgaagtgcac gcagttgccg gccgggtcgc gcagggcgaa ctcccgcccc cacggctgct     1320

cgccgatctc ggtcatggcc ggcccggagg cgtcccggaa gttcgtggac acgacctccg     1380

accactcggc gtacagctcg tccaggccgc gcacccacac ccaggccagg gtgttgtccg     1440

gcaccacctg gtcctggacc gcgctgatga acagggtcac gtcgtcccgg accacaccgg     1500

cgaagtcgtc ctccacgaag tcccgggaga acccgagccg gtcggtccag aactcgaccg     1560

ctccggcgac gtcgcgcgcg gtgagcaccg gaacggcact ggtcaacttg gccatggtgg     1620

ccctcctcac gtgctattat tgaagcattt atcagggtta ttgtctcatg agcggataca     1680

tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag     1740

tgccacctga tgcggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg     1800

aaattgtaag cgttaataat tcagaagaac tcgtcaagaa ggcgatagaa ggcgatgcgc     1860

tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc ggtcagccca ttcgccgcca     1920

agctcttcag caatatcacg ggtagccaac gctatgtcct gatagcggtc cgccacaccc     1980

agccggccac agtcgatgaa tccagaaaag cggccatttt ccaccatgat attcggcaag     2040

caggcatcgc catgggtcac gacgagatcc tcgccgtcgg gcatgctcgc cttgagcctg     2100

gcgaacagtt cggctggcgc gagcccctga tgctcttcgt ccagatcatc ctgatcgaca     2160

agaccggctt ccatccgagt acgtgctcgc tcgatgcgat gtttcgcttg gtggtcgaat     2220

gggcaggtag ccggatcaag cgtatgcagc cgccgcattg catcagccat gatggatact     2280

ttctcggcag gagcaaggtg agatgacagg agatcctgcc ccggcacttc gcccaatagc     2340

agccagtccc ttcccgcttc agtgacaacg tcgagcacag ctgcgcaagg aacgcccgtc     2400

gtggccagcc acgatagccg cgctgcctcg tcttgcagtt cattcagggc accggacagg     2460

tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca gccggaacac ggcggcatca     2520

gagcagccga ttgtctgttg tgcccagtca tagccgaata gcctctccac ccaagcggcc     2580

ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa acgatcctca tcctgtctct     2640

tgatcagagc ttgatcccct gcgccatcag atccttggcg gcaagaaagc catccagttt     2700

actttgcagg gcttcccaac cttaccagag ggcgccccag ctggcaattc cggttcgctt     2760

gctgtccata aaaccgccca gtctagctat cgccatgtaa gcccactgca agctacctgc     2820

tttctctttg cgcttgcgtt ttcccttgtc cagatagccc agtagctgac attcatccgg     2880

ggtcagcacc gtttctgcgg actggctttc tacgtgaaaa ggatctaggt gaagatcctt     2940

tttgataatc tcatgcctga catttatatt ccccagaaca tcaggttaat ggcgtttttg     3000

atgtcatttt cgcggtggct gagatcagcc acttcttccc cgataacgga gaccggcaca     3060

ctggccatat cggtggtcat catgcgccag ctttcatccc cgatatgcac caccgggtaa     3120

agttcacggg agactttatc tgacagcaga cgtgcactgg ccagggggat caccatccgt     3180

cgccccggcg tgtcaataat atcactctgt acatccacaa acagacgata acggctctct     3240

cttttatagg tgtaaacctt aaactgccgt acgtataggc tgcgcaactg ttgggaaggg     3300

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg     3360

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt     3420

gaattgtaat acgactcact atagggcgaa ttgggccctc tagatgcatg ctcgagcggc     3480

cgccagtgtg atggatatct gcagaattca ggtgtaatac gactcactat agggcgaatt     3540

gggccctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcagg     3600

agcggccgca ccatggccac catctctttg actactgagc aacttttaga acacccagaa     3660

ctggttgcaa ttgatggggt gttgtacgat ctcttcggac tggcgaaagt gcatggaggt     3720

ggcaacctca ttgaagccgc cggtgcctcc gacggaaccg ccctgttcta ctccatgcac     3780

cctggagtga agccagagaa ttcgaagctg ctgcagcaat ttgcccgagg caaacacgaa     3840

cgaagctcga aggacccagt gtacaccttt gacagtccct tcgcccagga tgtcaagcag     3900

agcgttcggg aggtcatgaa ggggcgcaac tggtacgcca cgcccggctt ttggctgcgg     3960

accgcgctga tcatcgcgtg cactgccata ggcgaatggt attggatcac taccggggca     4020

gtgatgtggg gcatcttcac cgggtacttc cacagccaga ttgggttggc gattcaacac     4080

gatgccggtc acggagccat cagcaaaaag ccctgggtga acgccttttt cgcctacggc     4140

atcgacgcca ttggatcctc ccgctggatc tggctgcagt cccacattat gcgccaccac     4200

acctacacca accagcatgg cctggacctg gacgctgcct cggcggagcc gttcattttg     4260

ttccactcct acccggcaac aaatgcgtca cgaaagtggt accatcggtt ccaggcgtgg     4320

tacatgtaca tcgttttggg gatgtatggt gtgtcgatgg tgtacaatcc gatgtacttg     4380

ttcacgatgc agcacaacga cacaatccca gaggccacct ctcttagacc aggcagcttt     4440

ttcaaccggc agcgcgcctt cgccgtttcc ctccgcctac tgttcatctt ccgcaacgcc     4500

ttcctcccct ggtacatcgc gggcgcctct ccgctgctca ccatcctgct ggtgccaacg     4560

gtcacaggca tcttcttgac atttgttttt gtgctgtccc ataactttga aggcgctgag     4620

cggacccccg aaaagaactg caaggccaaa agggccaagg aggggaagga ggtccgcgat     4680

gtagaggagg accgggtgga ctggtaccgg gcgcaggccg agaccgcggc gacctacggg     4740

ggcagcgtcg ggatgatgct gaccggcggt ttgaacctgc agatcgagca ccacttgttc     4800

ccccgcatgt cctcttggca ctaccccttc atccaagata cggtgcggga atgttgcaag     4860

cgccatggcg tgcgctacac atactacccg accatcctgg agaatataat gtccacgctc     4920

cgctacatgc agaaggtggg cgtggcccac acaattcagg atgcccagga attctgagcg     4980

gccgcacctg aattccagca cactggc                                         5007


<210>  270
<211>  1362
<212>  DNA
<213>  Euglena anabaena


<220>
<221>  CDS
<222>  (1)..(1362)

<400>  270
atg gcc acc atc tct ttg act act gag caa ctt tta gaa cac cca gaa         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctg gtt gca att gat ggg gtg ttg tac gat ctc ttc gga ctg gcg aaa         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtg cat gga ggt ggc aac ctc att gaa gcc gcc ggt gcc tcc gac gga        144
Val His Gly Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gcc ctg ttc tac tcc atg cac cct gga gtg aag cca gag aat tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctg ctg cag caa ttt gcc cga ggc aaa cac gaa cga agc tcg aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gac cca gtg tac acc ttt gac agt ccc ttc gcc cag gat gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

agc gtt cgg gag gtc atg aag ggg cgc aac tgg tac gcc acg ccc ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttt tgg ctg cgg acc gcg ctg atc atc gcg tgc act gcc ata ggc gaa        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tat tgg atc act acc ggg gca gtg atg tgg ggc atc ttc acc ggg        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac agc cag att ggg ttg gcg att caa cac gat gcc ggt cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Gly His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtg aac gcc ttt ttc gcc tac ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att gga tcc tcc cgc tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cgc cac cac acc tac acc aac cag cat ggc ctg gac ctg gac gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gcg gag ccg ttc att ttg ttc cac tcc tac ccg gca aca aat        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcg tca cga aag tgg tac cat cgg ttc cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ttg ggg atg tat ggt gtg tcg atg gtg tac aat ccg atg tac ttg        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc acg atg cag cac aac gac aca atc cca gag gcc acc tct ctt aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aac cgg cag cgc gcc ttc gcc gtt tcc ctc cgc        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

cta ctg ttc atc ttc cgc aac gcc ttc ctc ccc tgg tac atc gcg ggc        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct ccg ctg ctc acc atc ctg ctg gtg cca acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ttg aca ttt gtt ttt gtg ctg tcc cat aac ttt gaa ggc gct gag       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc ccc gaa aag aac tgc aag gcc aaa agg gcc aag gag ggg aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc cgc gat gta gag gag gac cgg gtg gac tgg tac cgg gcg cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag acc gcg gcg acc tac ggg ggc agc gtc ggg atg atg ctg acc       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ttg aac ctg cag atc gag cac cac ttg ttc ccc cgc atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tac ccc ttc atc caa gat acg gtg cgg gaa tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cgc cat ggc gtg cgc tac aca tac tac ccg acc atc ctg gag aat ata       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc acg ctc cgc tac atg cag aag gtg ggc gtg gcc cac aca att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gaa ttc                                               1362
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  271
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  271

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Gly Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Gly His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  272
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5 M3-3

<400>  272
gctgatggct ccgtgagcgg catcgtgttg aatcg                                  35


<210>  273
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer EaD5 M3-5

<400>  273
cgattcaaca cgatgccgct cacggagcca tcagc                                  35


<210>  274
<211>  5007
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pLF339

<400>  274
cctgaattcc agcacactgg cggccgttac tagtggatcc gagctcggta ccaagcttga       60

tgcatagctt gagtattcta acgcgtcacc taaatagctt ggcgtaatca tggtcatagc      120

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca      180

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct      240

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac      300

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc      360

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt      420

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg      480

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg      540

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat      600

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta      660

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct      720

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc      780

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa      840

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg      900

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag      960

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt     1020

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     1080

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     1140

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     1200

cctagatcct tttaaattaa aaatgaagtt ttagcacgtg tcagtcctgc tcctcggcca     1260

cgaagtgcac gcagttgccg gccgggtcgc gcagggcgaa ctcccgcccc cacggctgct     1320

cgccgatctc ggtcatggcc ggcccggagg cgtcccggaa gttcgtggac acgacctccg     1380

accactcggc gtacagctcg tccaggccgc gcacccacac ccaggccagg gtgttgtccg     1440

gcaccacctg gtcctggacc gcgctgatga acagggtcac gtcgtcccgg accacaccgg     1500

cgaagtcgtc ctccacgaag tcccgggaga acccgagccg gtcggtccag aactcgaccg     1560

ctccggcgac gtcgcgcgcg gtgagcaccg gaacggcact ggtcaacttg gccatggtgg     1620

ccctcctcac gtgctattat tgaagcattt atcagggtta ttgtctcatg agcggataca     1680

tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag     1740

tgccacctga tgcggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg     1800

aaattgtaag cgttaataat tcagaagaac tcgtcaagaa ggcgatagaa ggcgatgcgc     1860

tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc ggtcagccca ttcgccgcca     1920

agctcttcag caatatcacg ggtagccaac gctatgtcct gatagcggtc cgccacaccc     1980

agccggccac agtcgatgaa tccagaaaag cggccatttt ccaccatgat attcggcaag     2040

caggcatcgc catgggtcac gacgagatcc tcgccgtcgg gcatgctcgc cttgagcctg     2100

gcgaacagtt cggctggcgc gagcccctga tgctcttcgt ccagatcatc ctgatcgaca     2160

agaccggctt ccatccgagt acgtgctcgc tcgatgcgat gtttcgcttg gtggtcgaat     2220

gggcaggtag ccggatcaag cgtatgcagc cgccgcattg catcagccat gatggatact     2280

ttctcggcag gagcaaggtg agatgacagg agatcctgcc ccggcacttc gcccaatagc     2340

agccagtccc ttcccgcttc agtgacaacg tcgagcacag ctgcgcaagg aacgcccgtc     2400

gtggccagcc acgatagccg cgctgcctcg tcttgcagtt cattcagggc accggacagg     2460

tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca gccggaacac ggcggcatca     2520

gagcagccga ttgtctgttg tgcccagtca tagccgaata gcctctccac ccaagcggcc     2580

ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa acgatcctca tcctgtctct     2640

tgatcagagc ttgatcccct gcgccatcag atccttggcg gcaagaaagc catccagttt     2700

actttgcagg gcttcccaac cttaccagag ggcgccccag ctggcaattc cggttcgctt     2760

gctgtccata aaaccgccca gtctagctat cgccatgtaa gcccactgca agctacctgc     2820

tttctctttg cgcttgcgtt ttcccttgtc cagatagccc agtagctgac attcatccgg     2880

ggtcagcacc gtttctgcgg actggctttc tacgtgaaaa ggatctaggt gaagatcctt     2940

tttgataatc tcatgcctga catttatatt ccccagaaca tcaggttaat ggcgtttttg     3000

atgtcatttt cgcggtggct gagatcagcc acttcttccc cgataacgga gaccggcaca     3060

ctggccatat cggtggtcat catgcgccag ctttcatccc cgatatgcac caccgggtaa     3120

agttcacggg agactttatc tgacagcaga cgtgcactgg ccagggggat caccatccgt     3180

cgccccggcg tgtcaataat atcactctgt acatccacaa acagacgata acggctctct     3240

cttttatagg tgtaaacctt aaactgccgt acgtataggc tgcgcaactg ttgggaaggg     3300

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg     3360

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt     3420

gaattgtaat acgactcact atagggcgaa ttgggccctc tagatgcatg ctcgagcggc     3480

cgccagtgtg atggatatct gcagaattca ggtgtaatac gactcactat agggcgaatt     3540

gggccctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcagg     3600

agcggccgca ccatggccac catctctttg actactgagc aacttttaga acacccagaa     3660

ctggttgcaa ttgatggggt gttgtacgat ctcttcggac tggcgaaagt gcatggaggt     3720

ggcaacctca ttgaagccgc cggtgcctcc gacggaaccg ccctgttcta ctccatgcac     3780

cctggagtga agccagagaa ttcgaagctg ctgcagcaat ttgcccgagg caaacacgaa     3840

cgaagctcga aggacccagt gtacaccttt gacagtccct tcgcccagga tgtcaagcag     3900

agcgttcggg aggtcatgaa ggggcgcaac tggtacgcca cgcccggctt ttggctgcgg     3960

accgcgctga tcatcgcgtg cactgccata ggcgaatggt attggatcac taccggggca     4020

gtgatgtggg gcatcttcac cgggtacttc cacagccaga ttgggttggc gattcaacac     4080

gatgccgctc acggagccat cagcaaaaag ccctgggtga acgccttttt cgcctacggc     4140

atcgacgcca ttggatcctc ccgctggatc tggctgcagt cccacattat gcgccaccac     4200

acctacacca accagcatgg cctggacctg gacgctgcct cggcggagcc gttcattttg     4260

ttccactcct acccggcaac aaatgcgtca cgaaagtggt accatcggtt ccaggcgtgg     4320

tacatgtaca tcgttttggg gatgtatggt gtgtcgatgg tgtacaatcc gatgtacttg     4380

ttcacgatgc agcacaacga cacaatccca gaggccacct ctcttagacc aggcagcttt     4440

ttcaaccggc agcgcgcctt cgccgtttcc ctccgcctac tgttcatctt ccgcaacgcc     4500

ttcctcccct ggtacatcgc gggcgcctct ccgctgctca ccatcctgct ggtgccaacg     4560

gtcacaggca tcttcttgac atttgttttt gtgctgtccc ataactttga aggcgctgag     4620

cggacccccg aaaagaactg caaggccaaa agggccaagg aggggaagga ggtccgcgat     4680

gtagaggagg accgggtgga ctggtaccgg gcgcaggccg agaccgcggc gacctacggg     4740

ggcagcgtcg ggatgatgct gaccggcggt ttgaacctgc agatcgagca ccacttgttc     4800

ccccgcatgt cctcttggca ctaccccttc atccaagata cggtgcggga atgttgcaag     4860

cgccatggcg tgcgctacac atactacccg accatcctgg agaatataat gtccacgctc     4920

cgctacatgc agaaggtggg cgtggcccac acaattcagg atgcccagga attctgagcg     4980

gccgcacctg aattccagca cactggc                                         5007


<210>  275
<211>  1362
<212>  DNA
<213>  Euglena anabaena


<220>
<221>  CDS
<222>  (1)..(1362)

<400>  275
atg gcc acc atc tct ttg act act gag caa ctt tta gaa cac cca gaa         48
Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu           
1               5                   10                  15                

ctg gtt gca att gat ggg gtg ttg tac gat ctc ttc gga ctg gcg aaa         96
Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys           
            20                  25                  30                    

gtg cat gga ggt ggc aac ctc att gaa gcc gcc ggt gcc tcc gac gga        144
Val His Gly Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly           
        35                  40                  45                        

acc gcc ctg ttc tac tcc atg cac cct gga gtg aag cca gag aat tcg        192
Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser           
    50                  55                  60                            

aag ctg ctg cag caa ttt gcc cga ggc aaa cac gaa cga agc tcg aag        240
Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys           
65                  70                  75                  80            

gac cca gtg tac acc ttt gac agt ccc ttc gcc cag gat gtc aag cag        288
Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln           
                85                  90                  95                

agc gtt cgg gag gtc atg aag ggg cgc aac tgg tac gcc acg ccc ggc        336
Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly           
            100                 105                 110                   

ttt tgg ctg cgg acc gcg ctg atc atc gcg tgc act gcc ata ggc gaa        384
Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu           
        115                 120                 125                       

tgg tat tgg atc act acc ggg gca gtg atg tgg ggc atc ttc acc ggg        432
Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly           
    130                 135                 140                           

tac ttc cac agc cag att ggg ttg gcg att caa cac gat gcc gct cac        480
Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ala His           
145                 150                 155                 160           

gga gcc atc agc aaa aag ccc tgg gtg aac gcc ttt ttc gcc tac ggc        528
Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly           
                165                 170                 175               

atc gac gcc att gga tcc tcc cgc tgg atc tgg ctg cag tcc cac att        576
Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile           
            180                 185                 190                   

atg cgc cac cac acc tac acc aac cag cat ggc ctg gac ctg gac gct        624
Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala           
        195                 200                 205                       

gcc tcg gcg gag ccg ttc att ttg ttc cac tcc tac ccg gca aca aat        672
Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn           
    210                 215                 220                           

gcg tca cga aag tgg tac cat cgg ttc cag gcg tgg tac atg tac atc        720
Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile           
225                 230                 235                 240           

gtt ttg ggg atg tat ggt gtg tcg atg gtg tac aat ccg atg tac ttg        768
Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu           
                245                 250                 255               

ttc acg atg cag cac aac gac aca atc cca gag gcc acc tct ctt aga        816
Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg           
            260                 265                 270                   

cca ggc agc ttt ttc aac cgg cag cgc gcc ttc gcc gtt tcc ctc cgc        864
Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg           
        275                 280                 285                       

cta ctg ttc atc ttc cgc aac gcc ttc ctc ccc tgg tac atc gcg ggc        912
Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly           
    290                 295                 300                           

gcc tct ccg ctg ctc acc atc ctg ctg gtg cca acg gtc aca ggc atc        960
Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile           
305                 310                 315                 320           

ttc ttg aca ttt gtt ttt gtg ctg tcc cat aac ttt gaa ggc gct gag       1008
Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu           
                325                 330                 335               

cgg acc ccc gaa aag aac tgc aag gcc aaa agg gcc aag gag ggg aag       1056
Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys           
            340                 345                 350                   

gag gtc cgc gat gta gag gag gac cgg gtg gac tgg tac cgg gcg cag       1104
Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln           
        355                 360                 365                       

gcc gag acc gcg gcg acc tac ggg ggc agc gtc ggg atg atg ctg acc       1152
Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr           
    370                 375                 380                           

ggc ggt ttg aac ctg cag atc gag cac cac ttg ttc ccc cgc atg tcc       1200
Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser           
385                 390                 395                 400           

tct tgg cac tac ccc ttc atc caa gat acg gtg cgg gaa tgt tgc aag       1248
Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys           
                405                 410                 415               

cgc cat ggc gtg cgc tac aca tac tac ccg acc atc ctg gag aat ata       1296
Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile           
            420                 425                 430                   

atg tcc acg ctc cgc tac atg cag aag gtg ggc gtg gcc cac aca att       1344
Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile           
        435                 440                 445                       

cag gat gcc cag gaa ttc                                               1362
Gln Asp Ala Gln Glu Phe                                                   
    450                                                                   


<210>  276
<211>  454
<212>  PRT
<213>  Euglena anabaena

<400>  276

Met Ala Thr Ile Ser Leu Thr Thr Glu Gln Leu Leu Glu His Pro Glu 
1               5                   10                  15      


Leu Val Ala Ile Asp Gly Val Leu Tyr Asp Leu Phe Gly Leu Ala Lys 
            20                  25                  30          


Val His Gly Gly Gly Asn Leu Ile Glu Ala Ala Gly Ala Ser Asp Gly 
        35                  40                  45              


Thr Ala Leu Phe Tyr Ser Met His Pro Gly Val Lys Pro Glu Asn Ser 
    50                  55                  60                  


Lys Leu Leu Gln Gln Phe Ala Arg Gly Lys His Glu Arg Ser Ser Lys 
65                  70                  75                  80  


Asp Pro Val Tyr Thr Phe Asp Ser Pro Phe Ala Gln Asp Val Lys Gln 
                85                  90                  95      


Ser Val Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala Thr Pro Gly 
            100                 105                 110         


Phe Trp Leu Arg Thr Ala Leu Ile Ile Ala Cys Thr Ala Ile Gly Glu 
        115                 120                 125             


Trp Tyr Trp Ile Thr Thr Gly Ala Val Met Trp Gly Ile Phe Thr Gly 
    130                 135                 140                 


Tyr Phe His Ser Gln Ile Gly Leu Ala Ile Gln His Asp Ala Ala His 
145                 150                 155                 160 


Gly Ala Ile Ser Lys Lys Pro Trp Val Asn Ala Phe Phe Ala Tyr Gly 
                165                 170                 175     


Ile Asp Ala Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile 
            180                 185                 190         


Met Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala 
        195                 200                 205             


Ala Ser Ala Glu Pro Phe Ile Leu Phe His Ser Tyr Pro Ala Thr Asn 
    210                 215                 220                 


Ala Ser Arg Lys Trp Tyr His Arg Phe Gln Ala Trp Tyr Met Tyr Ile 
225                 230                 235                 240 


Val Leu Gly Met Tyr Gly Val Ser Met Val Tyr Asn Pro Met Tyr Leu 
                245                 250                 255     


Phe Thr Met Gln His Asn Asp Thr Ile Pro Glu Ala Thr Ser Leu Arg 
            260                 265                 270         


Pro Gly Ser Phe Phe Asn Arg Gln Arg Ala Phe Ala Val Ser Leu Arg 
        275                 280                 285             


Leu Leu Phe Ile Phe Arg Asn Ala Phe Leu Pro Trp Tyr Ile Ala Gly 
    290                 295                 300                 


Ala Ser Pro Leu Leu Thr Ile Leu Leu Val Pro Thr Val Thr Gly Ile 
305                 310                 315                 320 


Phe Leu Thr Phe Val Phe Val Leu Ser His Asn Phe Glu Gly Ala Glu 
                325                 330                 335     


Arg Thr Pro Glu Lys Asn Cys Lys Ala Lys Arg Ala Lys Glu Gly Lys 
            340                 345                 350         


Glu Val Arg Asp Val Glu Glu Asp Arg Val Asp Trp Tyr Arg Ala Gln 
        355                 360                 365             


Ala Glu Thr Ala Ala Thr Tyr Gly Gly Ser Val Gly Met Met Leu Thr 
    370                 375                 380                 


Gly Gly Leu Asn Leu Gln Ile Glu His His Leu Phe Pro Arg Met Ser 
385                 390                 395                 400 


Ser Trp His Tyr Pro Phe Ile Gln Asp Thr Val Arg Glu Cys Cys Lys 
                405                 410                 415     


Arg His Gly Val Arg Tyr Thr Tyr Tyr Pro Thr Ile Leu Glu Asn Ile 
            420                 425                 430         


Met Ser Thr Leu Arg Tyr Met Gln Lys Val Gly Val Ala His Thr Ile 
        435                 440                 445             


Gln Asp Ala Gln Glu Phe 
    450                 


<210>  277
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDcSH motif

<400>  277

His Asp Cys Ser His 
1               5   


<210>  278
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDdSH motif

<400>  278

His Asp Asp Ser His 
1               5   


<210>  279
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDeSH motif

<400>  279

His Asp Glu Ser His 
1               5   


<210>  280
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDfSH motif

<400>  280

His Asp Phe Ser His 
1               5   


<210>  281
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDhSH motif

<400>  281

His Asp His Ser His 
1               5   


<210>  282
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDiSH motif

<400>  282

His Asp Ile Ser His 
1               5   


<210>  283
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDkSH motif

<400>  283

His Asp Lys Ser His 
1               5   


<210>  284
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDlSH motif

<400>  284

His Asp Leu Ser His 
1               5   


<210>  285
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDmSH motif

<400>  285

His Asp Met Ser His 
1               5   


<210>  286
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDnSH motif

<400>  286

His Asp Asn Ser His 
1               5   


<210>  287
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDpSH motif

<400>  287

His Asp Pro Ser His 
1               5   


<210>  288
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDqSH motif

<400>  288

His Asp Gln Ser His 
1               5   


<210>  289
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDrSH motif

<400>  289

His Asp Arg Ser His 
1               5   


<210>  290
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDtSH motif

<400>  290

His Asp Thr Ser His 
1               5   


<210>  291
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDvSH motif

<400>  291

His Asp Val Ser His 
1               5   


<210>  292
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDwSH motif

<400>  292

His Asp Trp Ser His 
1               5   


<210>  293
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDySH motif

<400>  293

His Asp Tyr Ser His 
1               5   


<210>  294
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAcH motif

<400>  294

His Asp Ala Cys His 
1               5   


<210>  295
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAdH motif

<400>  295

Val His Asp Ala Asp His 
1               5       


<210>  296
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAeH motif

<400>  296

His Asp Ala Glu His 
1               5   


<210>  297
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAfH motif

<400>  297

His Asp Ala Phe His 
1               5   


<210>  298
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAhH motif

<400>  298

His Asp Ala His His 
1               5   


<210>  299
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAiH motif

<400>  299

His Asp Ala Ile His 
1               5   


<210>  300
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAkH motif

<400>  300

His Asp Ala Lys His 
1               5   


<210>  301
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAlH motif

<400>  301

His Asp Ala Leu His 
1               5   


<210>  302
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAmH motif

<400>  302

His Asp Ala Met His 
1               5   


<210>  303
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAnH motif

<400>  303

His Asp Ala Asn His 
1               5   


<210>  304
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDApH motif

<400>  304

His Asp Ala Pro His 
1               5   


<210>  305
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAqH motif

<400>  305

His Asp Ala Gln His 
1               5   


<210>  306
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDArH motif

<400>  306

His Asp Ala Arg His 
1               5   


<210>  307
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAtH motif

<400>  307

His Asp Ala Thr His 
1               5   


<210>  308
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAvH motif

<400>  308

His Asp Ala Val His 
1               5   


<210>  309
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAwH motif

<400>  309

His Asp Ala Trp His 
1               5   


<210>  310
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDAyH motif

<400>  310

His Asp Ala Tyr His 
1               5   


<210>  311
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HDxxH motif


<220>
<221>  misc_feature
<222>  (3)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  311

His Asp Xaa Xaa His 
1               5   


