                         SEQUENCE LISTING

<110>  SYNTHETIC GENOMICS, INC.
       AJJAWI, Imad
       SORIAGA, Leah
       AQUI, Moena
       MOELLERING, Eric R.
 
<120>  ALGAL MUTANTS WITH INCREASED LIPID PRODUCTIVITY

<130>  SGI1980-1WO

<150>  US 62/249,834
<151>  2015-11-02

<160>  78    

<170>  PatentIn version 3.5

<210>  1
<211>  3921
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene transcript a

<220>
<221>  misc_feature
<223>  encodes the polypeptide of SEQ ID NO:2

<400>  1
atggattcga acgcgcaaac caccagtggc accgtcgttg aaagcacggc tagcaatgga       60

gaggcttctg cgcccgcgcc catgctttcg tcctcccttc cttctccaag ctttgagtcc      120

ggcccagacc ccccccccca gttagcaagg cgggtccccg ggaacgtgcc gcttgacccc      180

tcggccgccg acgtggacga caaggaccgc gcctccagcg cctacggaga cgaacctccc      240

ctccccctcc ccctcctcac gtccacctcg atgacagcct cagaagcgag cagcggtcaa      300

ggaggggaag ctggggccgc cccaggggtg ccctcccttg cttcctcccc tgccttcgcc      360

cccgcagcta ccggcctgtc cccgtctcac tccgccggtt ccggcatgtc agtgctgatc      420

caagtgcctc aaaacgggcc cagcgaggct ctgtcgcctt tgcccttgcc gaccactgcc      480

ttggatactc ccttggacac ccggtcgtcc accccccgcc ccgcgcccgc cccagccccg      540

ccttctcctt accagactgt tggaggcctc cacggcgggg agcactcgtt ccttcctccc      600

gtcagtacgg aagggctggc ccctccggcg atgggcacgg gggaaggagg gcttgagggc      660

ggggatggag ggtcggtagg tttttatccc ccccttgccc agtcgcagac gcaactcgcg      720

ccgttgccgg gcccaccgcc tccgcaggcg caagattcgc tgcagtacaa gcctgcttcg      780

gtaccggagc cgactaggat gatggaaggg tccagtgatc ctccttttca ttcgtcggag      840

acgcccaggg cgatggggat cggccggggg ggagggaatt cgcagatggt tgcacctgcc      900

cccgcgccat cgttgcaaca gtcggcgccg ttgcaacaac gtcagcaatt gcaacctcaa      960

cagcaccaac agttccattc gcgctcccac ccacaagtag cgccactcca ggtgcagcaa     1020

cggcagcaac cgcgggcact ggtgccaggg ccccagcagc agcagcagca tcagcagcag     1080

caagctctct atgcatcttc gcaacagcag cagcaacagc agcagcagca acagcagcaa     1140

catcagcagc agcagcagca gcagcagcag caacagcagc agagacatca cccgcaccca     1200

cagcaactgc agcaacaaca gcgacacaac cagcagcagc cactccagca tccacaagca     1260

cagcatcgag tcccacccca gggcatgcct cagcaccagc acgtccgggc gccacagcaa     1320

cagcggcagc agcaactcct ccctcttcca accgcgggca atgccgtccc aggcggccag     1380

gcaaccggca ccccgcacgc gtcgcaactg cctcacgccc agctctccca acaacaacaa     1440

cccgcgcatt ccttgcccca acggcagggc ctgggcgcgc agcccctcaa cccacaggac     1500

actgccttgc ggcccggaat ggtcaagaac atcatggtct tgctccaaca acgcaaaccc     1560

gccgccgatc cttccaaacc cttggtggaa actcggttga aggagatggc gatccggctg     1620

gaggactacc tgtggaaacg ctcgtccacg ttggcggagt actcggatct gagcaccctc     1680

aaacaccgcc tgcagtgttt ggcagtctac atgggcaagc accagcagcg gggtcaaact     1740

gtaccggcgg gcgcaagggg cagaggggga gggatgccga atcaagcgcc ccagccacag     1800

gggggggggc tctctgggaa cacgaaccaa ctgcaacgtt tggtgcctac cgccaatgcc     1860

agcaatattc acctgcccaa ccctcatccc ggaggtcttt cgggtggaat gggggcggga     1920

ggcgcgcgtg tgggagggcg gggcagtggg atcggcggag gggggttgat catgcaacct     1980

gggagtgcca tccacggaca tccccccggg ccccagttgc ggggcagctc tctcccccac     2040

caagggcaag tgcaaccgac ctcgcagcag ggaagtcagc aaagaagggt gggaacgggt     2100

ctggcgcctg cgcctggcac acaacccgcg tttttaccac aggaacaaac gcaaatgcaa     2160

ggtcggcggg tagggggggg agggatgctg cccgtaaatg ggggtaacag ccaccctcct     2220

cccgcgccag gtcctccaca aggccatctg cagccgccgc agcagtcatc aggacagggg     2280

caagccgctc ccttgaacgt gatggggggg gcacagcaag tggggggggg cggtaatgcg     2340

aaccgagggc tccctatgcc tttatcttca ggccccgggg gtaccgcctc cgccagtcag     2400

aagaaacgcg tccagcacac gcccgaacaa cgtcagcaaa tcttgcacca gcagcagcag     2460

cggctgcttt acttgcgcca tgcgtccaag tgcattcatg tggacggccg ctgtccccag     2520

gggtacccga actgcatcgg gatgaaggag ctttggaagc acatcgcctc ctgtagggaa     2580

caacggtgca agttccccca ctgcgtgtcc tcgagatacg tcctgtccca ctaccacaaa     2640

tgtaaggaca cgcagtgccc ggtgtgcgga cccgtacgaa acacgatccg atcttctcgc     2700

tcctcggcgc atcccatgcc gcaacttggt cagggtgtgg cagacgccga cggaggaggc     2760

gagggaggcg gatctggagt ccagcagcag cagcagcagc agcaacaaca acaacaacaa     2820

caacaacaac aacagcaaca acaacagcaa ttggtagcac agagtaatca acgcacgcag     2880

cagcaacaaa tgttgatcgc ccagcagccc ccccccgcag ggatgggggg agggagggtc     2940

ggaggcatga ctggggccct ggcgaatgga ggcaggggtg ggagggtcgg agggagggcg     3000

cggggcaggg ggggtcaagt cgtgcttcct cagcaggttg cggccggggg gcggggaata     3060

ggcggtcaga atgtaggtgg aagtggaatg aaccagcaac gattgcagca acagcaacaa     3120

cagcagcagc aacaacaaca gcagcagcag cagcagcagc agcagcagca gcaacgccca     3180

caaaatatgg cttccgtgcc ggttcctggg gtaggacgtg ggggaggagg ggtgcgagct     3240

ggcggggaag ccctcgcctt gggcactgcg ggtggagcgg gcagcaaacc tggggcccgg     3300

agcggttcgg ggaaaatgcc agtcgtagcc aagactccga atggcctcat gatccagacg     3360

gaaacgcatg gatgggtgcc ggtagagccc acgaaaaacg gcggctaccg tcccctggtg     3420

cctctgcccg gctccggtca aagcttctca caggctgccg gcggggctgg cgcgggcgga     3480

cgtcctggcg gcgttgggag aggggtgccc ggcgtacctg ccccaccttc cgcggcagcg     3540

ttgcagcggt tcgaagactc cgtgtccttg gtgaactcct tcacggacgc acaaattaag     3600

gcgcacatgg cctctctgcg ttcaggggga gggttttgga ctcccgccaa gttgaaactt     3660

aaggttctcc ccctcgtggt aaaacagctg aaatcggagt atggatggat ttttgaagaa     3720

cccgtggacc ccgtgaagct cgggctcccg gattacttcg atgtgatcaa gcaccctatg     3780

gacttgggca ctgtacgtcg gcttgtgggg aggggagggc gaagagaggc gggagggaaa     3840

gacaatccca atggacaact gtcagtcgac gacaagggag aattggagga ggaggtcgac     3900

ggacttcagg aacttctttg a                                               3921


<210>  2
<211>  1306
<212>  PRT
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 polypeptide, isoform a

<400>  2

Met Asp Ser Asn Ala Gln Thr Thr Ser Gly Thr Val Val Glu Ser Thr 
1               5                   10                  15      


Ala Ser Asn Gly Glu Ala Ser Ala Pro Ala Pro Met Leu Ser Ser Ser 
            20                  25                  30          


Leu Pro Ser Pro Ser Phe Glu Ser Gly Pro Asp Pro Pro Pro Gln Leu 
        35                  40                  45              


Ala Arg Arg Val Pro Gly Asn Val Pro Leu Asp Pro Ser Ala Ala Asp 
    50                  55                  60                  


Val Asp Asp Lys Asp Arg Ala Ser Ser Ala Tyr Gly Asp Glu Pro Pro 
65                  70                  75                  80  


Leu Pro Leu Pro Leu Leu Thr Ser Thr Ser Met Thr Ala Ser Glu Ala 
                85                  90                  95      


Ser Ser Gly Gln Gly Gly Glu Ala Gly Ala Ala Pro Gly Val Pro Ser 
            100                 105                 110         


Leu Ala Ser Ser Pro Ala Phe Ala Pro Ala Ala Thr Gly Leu Ser Pro 
        115                 120                 125             


Ser His Ser Ala Gly Ser Gly Met Ser Val Leu Ile Gln Val Pro Gln 
    130                 135                 140                 


Asn Gly Pro Ser Glu Ala Leu Ser Pro Leu Pro Leu Pro Thr Thr Ala 
145                 150                 155                 160 


Leu Asp Thr Pro Leu Asp Thr Arg Ser Ser Thr Pro Arg Pro Ala Pro 
                165                 170                 175     


Ala Pro Ala Pro Pro Ser Pro Tyr Gln Thr Val Gly Gly Leu His Gly 
            180                 185                 190         


Gly Glu His Ser Phe Leu Pro Pro Val Ser Thr Glu Gly Leu Ala Pro 
        195                 200                 205             


Pro Ala Met Gly Thr Gly Glu Gly Gly Leu Glu Gly Gly Asp Gly Gly 
    210                 215                 220                 


Ser Val Gly Phe Tyr Pro Pro Leu Ala Gln Ser Gln Thr Gln Leu Ala 
225                 230                 235                 240 


Pro Leu Pro Gly Pro Pro Pro Pro Gln Ala Gln Asp Ser Leu Gln Tyr 
                245                 250                 255     


Lys Pro Ala Ser Val Pro Glu Pro Thr Arg Met Met Glu Gly Ser Ser 
            260                 265                 270         


Asp Pro Pro Phe His Ser Ser Glu Thr Pro Arg Ala Met Gly Ile Gly 
        275                 280                 285             


Arg Gly Gly Gly Asn Ser Gln Met Val Ala Pro Ala Pro Ala Pro Ser 
    290                 295                 300                 


Leu Gln Gln Ser Ala Pro Leu Gln Gln Arg Gln Gln Leu Gln Pro Gln 
305                 310                 315                 320 


Gln His Gln Gln Phe His Ser Arg Ser His Pro Gln Val Ala Pro Leu 
                325                 330                 335     


Gln Val Gln Gln Arg Gln Gln Pro Arg Ala Leu Val Pro Gly Pro Gln 
            340                 345                 350         


Gln Gln Gln Gln His Gln Gln Gln Gln Ala Leu Tyr Ala Ser Ser Gln 
        355                 360                 365             


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln His Gln Gln Gln 
    370                 375                 380                 


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Arg His His Pro His Pro 
385                 390                 395                 400 


Gln Gln Leu Gln Gln Gln Gln Arg His Asn Gln Gln Gln Pro Leu Gln 
                405                 410                 415     


His Pro Gln Ala Gln His Arg Val Pro Pro Gln Gly Met Pro Gln His 
            420                 425                 430         


Gln His Val Arg Ala Pro Gln Gln Gln Arg Gln Gln Gln Leu Leu Pro 
        435                 440                 445             


Leu Pro Thr Ala Gly Asn Ala Val Pro Gly Gly Gln Ala Thr Gly Thr 
    450                 455                 460                 


Pro His Ala Ser Gln Leu Pro His Ala Gln Leu Ser Gln Gln Gln Gln 
465                 470                 475                 480 


Pro Ala His Ser Leu Pro Gln Arg Gln Gly Leu Gly Ala Gln Pro Leu 
                485                 490                 495     


Asn Pro Gln Asp Thr Ala Leu Arg Pro Gly Met Val Lys Asn Ile Met 
            500                 505                 510         


Val Leu Leu Gln Gln Arg Lys Pro Ala Ala Asp Pro Ser Lys Pro Leu 
        515                 520                 525             


Val Glu Thr Arg Leu Lys Glu Met Ala Ile Arg Leu Glu Asp Tyr Leu 
    530                 535                 540                 


Trp Lys Arg Ser Ser Thr Leu Ala Glu Tyr Ser Asp Leu Ser Thr Leu 
545                 550                 555                 560 


Lys His Arg Leu Gln Cys Leu Ala Val Tyr Met Gly Lys His Gln Gln 
                565                 570                 575     


Arg Gly Gln Thr Val Pro Ala Gly Ala Arg Gly Arg Gly Gly Gly Met 
            580                 585                 590         


Pro Asn Gln Ala Pro Gln Pro Gln Gly Gly Gly Leu Ser Gly Asn Thr 
        595                 600                 605             


Asn Gln Leu Gln Arg Leu Val Pro Thr Ala Asn Ala Ser Asn Ile His 
    610                 615                 620                 


Leu Pro Asn Pro His Pro Gly Gly Leu Ser Gly Gly Met Gly Ala Gly 
625                 630                 635                 640 


Gly Ala Arg Val Gly Gly Arg Gly Ser Gly Ile Gly Gly Gly Gly Leu 
                645                 650                 655     


Ile Met Gln Pro Gly Ser Ala Ile His Gly His Pro Pro Gly Pro Gln 
            660                 665                 670         


Leu Arg Gly Ser Ser Leu Pro His Gln Gly Gln Val Gln Pro Thr Ser 
        675                 680                 685             


Gln Gln Gly Ser Gln Gln Arg Arg Val Gly Thr Gly Leu Ala Pro Ala 
    690                 695                 700                 


Pro Gly Thr Gln Pro Ala Phe Leu Pro Gln Glu Gln Thr Gln Met Gln 
705                 710                 715                 720 


Gly Arg Arg Val Gly Gly Gly Gly Met Leu Pro Val Asn Gly Gly Asn 
                725                 730                 735     


Ser His Pro Pro Pro Ala Pro Gly Pro Pro Gln Gly His Leu Gln Pro 
            740                 745                 750         


Pro Gln Gln Ser Ser Gly Gln Gly Gln Ala Ala Pro Leu Asn Val Met 
        755                 760                 765             


Gly Gly Ala Gln Gln Val Gly Gly Gly Gly Asn Ala Asn Arg Gly Leu 
    770                 775                 780                 


Pro Met Pro Leu Ser Ser Gly Pro Gly Gly Thr Ala Ser Ala Ser Gln 
785                 790                 795                 800 


Lys Lys Arg Val Gln His Thr Pro Glu Gln Arg Gln Gln Ile Leu His 
                805                 810                 815     


Gln Gln Gln Gln Arg Leu Leu Tyr Leu Arg His Ala Ser Lys Cys Ile 
            820                 825                 830         


His Val Asp Gly Arg Cys Pro Gln Gly Tyr Pro Asn Cys Ile Gly Met 
        835                 840                 845             


Lys Glu Leu Trp Lys His Ile Ala Ser Cys Arg Glu Gln Arg Cys Lys 
    850                 855                 860                 


Phe Pro His Cys Val Ser Ser Arg Tyr Val Leu Ser His Tyr His Lys 
865                 870                 875                 880 


Cys Lys Asp Thr Gln Cys Pro Val Cys Gly Pro Val Arg Asn Thr Ile 
                885                 890                 895     


Arg Ser Ser Arg Ser Ser Ala His Pro Met Pro Gln Leu Gly Gln Gly 
            900                 905                 910         


Val Ala Asp Ala Asp Gly Gly Gly Glu Gly Gly Gly Ser Gly Val Gln 
        915                 920                 925             


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
    930                 935                 940                 


Gln Gln Gln Gln Gln Gln Leu Val Ala Gln Ser Asn Gln Arg Thr Gln 
945                 950                 955                 960 


Gln Gln Gln Met Leu Ile Ala Gln Gln Pro Pro Pro Ala Gly Met Gly 
                965                 970                 975     


Gly Gly Arg Val Gly Gly Met Thr Gly Ala Leu Ala Asn Gly Gly Arg 
            980                 985                 990         


Gly Gly Arg Val Gly Gly Arg Ala  Arg Gly Arg Gly Gly  Gln Val Val 
        995                 1000                 1005             


Leu Pro  Gln Gln Val Ala Ala  Gly Gly Arg Gly Ile  Gly Gly Gln 
    1010                 1015                 1020             


Asn Val  Gly Gly Ser Gly Met  Asn Gln Gln Arg Leu  Gln Gln Gln 
    1025                 1030                 1035             


Gln Gln  Gln Gln Gln Gln Gln  Gln Gln Gln Gln Gln  Gln Gln Gln 
    1040                 1045                 1050             


Gln Gln  Gln Gln Gln Arg Pro  Gln Asn Met Ala Ser  Val Pro Val 
    1055                 1060                 1065             


Pro Gly  Val Gly Arg Gly Gly  Gly Gly Val Arg Ala  Gly Gly Glu 
    1070                 1075                 1080             


Ala Leu  Ala Leu Gly Thr Ala  Gly Gly Ala Gly Ser  Lys Pro Gly 
    1085                 1090                 1095             


Ala Arg  Ser Gly Ser Gly Lys  Met Pro Val Val Ala  Lys Thr Pro 
    1100                 1105                 1110             


Asn Gly  Leu Met Ile Gln Thr  Glu Thr His Gly Trp  Val Pro Val 
    1115                 1120                 1125             


Glu Pro  Thr Lys Asn Gly Gly  Tyr Arg Pro Leu Val  Pro Leu Pro 
    1130                 1135                 1140             


Gly Ser  Gly Gln Ser Phe Ser  Gln Ala Ala Gly Gly  Ala Gly Ala 
    1145                 1150                 1155             


Gly Gly  Arg Pro Gly Gly Val  Gly Arg Gly Val Pro  Gly Val Pro 
    1160                 1165                 1170             


Ala Pro  Pro Ser Ala Ala Ala  Leu Gln Arg Phe Glu  Asp Ser Val 
    1175                 1180                 1185             


Ser Leu  Val Asn Ser Phe Thr  Asp Ala Gln Ile Lys  Ala His Met 
    1190                 1195                 1200             


Ala Ser  Leu Arg Ser Gly Gly  Gly Phe Trp Thr Pro  Ala Lys Leu 
    1205                 1210                 1215             


Lys Leu  Lys Val Leu Pro Leu  Val Val Lys Gln Leu  Lys Ser Glu 
    1220                 1225                 1230             


Tyr Gly  Trp Ile Phe Glu Glu  Pro Val Asp Pro Val  Lys Leu Gly 
    1235                 1240                 1245             


Leu Pro  Asp Tyr Phe Asp Val  Ile Lys His Pro Met  Asp Leu Gly 
    1250                 1255                 1260             


Thr Val  Arg Arg Leu Val Gly  Arg Gly Gly Arg Arg  Glu Ala Gly 
    1265                 1270                 1275             


Gly Lys  Asp Asn Pro Asn Gly  Gln Leu Ser Val Asp  Asp Lys Gly 
    1280                 1285                 1290             


Glu Leu  Glu Glu Glu Val Asp  Gly Leu Gln Glu Leu  Leu 
    1295                 1300                 1305     


<210>  3
<211>  3894
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene transcript b

<220>
<221>  misc_feature
<223>  encodes the polypeptide of SEQ ID NO:4

<400>  3
atggattcga acgcgcaaac caccagtggc accgtcgttg aaagcacggc tagcaatgga       60

gaggcttctg cgcccgcgcc catgctttcg tcctcccttc cttctccaag ctttgagtcc      120

ggcccagacc ccccccccca gttagcaagg cgggtccccg ggaacgtgcc gcttgacccc      180

tcggccgccg acgtggacga caaggaccgc gcctccagcg cctacggaga cgaacctccc      240

ctccccctcc ccctcctcac gtccacctcg atgacagcct cagaagcgag cagcggtcaa      300

ggaggggaag ctggggccgc cccaggggtg ccctcccttg cttcctcccc tgccttcgcc      360

cccgcagcta ccggcctgtc cccgtctcac tccgccggtt ccggcatgtc agtgctgatc      420

caagtgcctc aaaacgggcc cagcgaggct ctgtcgcctt tgcccttgcc gaccactgcc      480

ttggatactc ccttggacac ccggtcgtcc accccccgcc ccgcgcccgc cccagccccg      540

ccttctcctt accagactgt tggaggcctc cacggcgggg agcactcgtt ccttcctccc      600

gtcagtacgg aagggctggc ccctccggcg atgggcacgg gggaaggagg gcttgagggc      660

ggggatggag ggtcgggcga tggggatcgg ccggggggag ggaattcgca gatggttgca      720

cctgcccccg cgccatcgtt gcaacagtcg gcgccgttgc aacaacgtca gcaattgcaa      780

cctcaacagc accaacagtt ccattcgcgc tcccacccac aagtagcgcc actccaggtg      840

cagcaacggc agcaaccgcg ggcactggtg ccagggcccc agcagcagca gcagcatcag      900

cagcagcaag ctctctatgc atcttcgcaa cagcagcagc aacagcagca gcagcaacag      960

cagcaacatc agcagcagca gcagcagcag cagcagcaac agcagcagag acatcacccg     1020

cacccacagc aactgcagca acaacagcga cacaaccagc agcagccact ccagcatcca     1080

caagcacagc atcgagtccc accccagggc atgcctcagc accagcacgt ccgggcgcca     1140

cagcaacagc ggcagcagca actcctccct cttccaaccg cgggcaatgc cgtcccaggc     1200

ggccaggcaa ccggcacccc gcacgcgtcg caactgcctc acgcccagct ctcccaacaa     1260

caacaacccg cgcattcctt gccccaacgg cagggcctgg gcgcgcagcc cctcaaccca     1320

caggacactg ccttgcggcc cggaatggtc aagaacatca tggtcttgct ccaacaacgc     1380

aaacccgccg ccgatccttc caaacccttg gtggaaactc ggttgaagga gatggcgatc     1440

cggctggagg actacctgtg gaaacgctcg tccacgttgg cggagtactc ggatctgagc     1500

accctcaaac accgcctgca gtgtttggca gtctacatgg gcaagcacca gcagcggggt     1560

caaactgtac cggcgggcgc aaggggcaga gggggaggga tgccgaatca agcgccccag     1620

ccacaggggg gggggctctc tgggaacacg aaccaactgc aacgtttggt gcctaccgcc     1680

aatgccagca atattcacct gcccaaccct catcccggag gtctttcggg tggaatgggg     1740

gcgggaggcg cgcgtgtggg agggcggggc agtgggatcg gcggaggggg gttgatcatg     1800

caacctggga gtgccatcca cggacatccc cccgggcccc agttgcgggg cagctctctc     1860

ccccaccaag ggcaagtgca accgacctcg cagcagggaa gtcagcaaag aagggtggga     1920

acgggtctgg cgcctgcgcc tggcacacaa cccgcgtttt taccacagga acaaacgcaa     1980

atgcaaggtc ggcgggtagg ggggggaggg atgctgcccg taaatggggg taacagccac     2040

cctcctcccg cgccaggtcc tccacaaggc catctgcagc cgccgcagca gtcatcagga     2100

caggggcaag ccgctccctt gaacgtgatg gggggggcac agcaagtggg ggggggcggt     2160

aatgcgaacc gagggctccc tatgccttta tcttcaggcc ccgggggtac cgcctccgcc     2220

agtcagaaga aacgcgtcca gcacacgccc gaacaacgtc agcaaatctt gcaccagcag     2280

cagcagcggc tgctttactt gcgccatgcg tccaagtgca ttcatgtgga cggccgctgt     2340

ccccaggggt acccgaactg catcgggatg aaggagcttt ggaagcacat cgcctcctgt     2400

agggaacaac ggtgcaagtt cccccactgc gtgtcctcga gatacgtcct gtcccactac     2460

cacaaatgta aggacacgca gtgcccggtg tgcggacccg tacgaaacac gatccgatct     2520

tctcgctcct cggcgcatcc catgccgcaa cttggtcagg gtgtggcaga cgccgacgga     2580

ggaggcgagg gaggcggatc tggagtccag cagcagcagc agcagcagca acaacaacaa     2640

caacaacaac aacaacaaca gcaacaacaa cagcaattgg tagcacagag taatcaacgc     2700

acgcagcagc aacaaatgtt gatcgcccag cagccccccc ccgcagggat ggggggaggg     2760

agggtcggag gcatgactgg ggccctggcg aatggaggca ggggtgggag ggtcggaggg     2820

agggcgcggg gcaggggggg tcaagtcgtg cttcctcagc aggttgcggc cggggggcgg     2880

ggaataggcg gtcagaatgt aggtggaagt ggaatgaacc agcaacgatt gcagcaacag     2940

caacaacagc agcagcaaca acagcagcag cagcagcagc agcagcagca gcagcagcaa     3000

cgcccacaaa atatggcttc cgtgccggtt cctggggtag gacgtggggg aggaggggtg     3060

cgagctggcg gggaagccct cgccttgggc actgcgggtg gagcgggcag caaacctggg     3120

gcccggagcg gttcggggaa aatgccagtc gtagccaaga ctccgaatgg cctcatgatc     3180

cagacggaaa cgcatggatg ggtgccggta gagcccacga aaaacggcgg ctaccgtccc     3240

ctggtgcctc tgcccggctc cggtcaaagc ttctcacagg ctgccggcgg ggctggcgcg     3300

ggcggacgtc ctggcggcgt tgggagaggg gtgcccggcg tacctgcccc accttccgcg     3360

gcagcgttgc agcggttcga agactccgtg tccttggtga actccttcac ggacgcacaa     3420

attaaggcgc acatggcctc tctgcgttca gggggagggt tttggactcc cgccaagttg     3480

aaacttaagg ttctccccct cgtggtaaaa cagctgaaat cggagtatgg atggattttt     3540

gaagaacccg tggaccccgt gaagctcggg ctcccggatt acttcgatgt gatcaagcac     3600

cctatggact tgggcactgt gaagcgtcgt ttggaaaacg gctcctacac agagctggaa     3660

aaggtggcgg cggacgtgaa gctcaccttc gacaatgcca tcctttacaa ccccccgggg     3720

caagaaatcc acaaggtaac ggacgaaaaa cgggcgggaa aagggggcag gtcaaggctg     3780

gatgaagagg cagacgagga ggttgaaaga gagaggctcg tgctaggggc ggaccggagc     3840

aatggatggt tctacgacga aaaaatggat ggttccacga cgaaaatgaa gtga           3894


<210>  4
<211>  1297
<212>  PRT
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 polypeptide, isoform b

<400>  4

Met Asp Ser Asn Ala Gln Thr Thr Ser Gly Thr Val Val Glu Ser Thr 
1               5                   10                  15      


Ala Ser Asn Gly Glu Ala Ser Ala Pro Ala Pro Met Leu Ser Ser Ser 
            20                  25                  30          


Leu Pro Ser Pro Ser Phe Glu Ser Gly Pro Asp Pro Pro Pro Gln Leu 
        35                  40                  45              


Ala Arg Arg Val Pro Gly Asn Val Pro Leu Asp Pro Ser Ala Ala Asp 
    50                  55                  60                  


Val Asp Asp Lys Asp Arg Ala Ser Ser Ala Tyr Gly Asp Glu Pro Pro 
65                  70                  75                  80  


Leu Pro Leu Pro Leu Leu Thr Ser Thr Ser Met Thr Ala Ser Glu Ala 
                85                  90                  95      


Ser Ser Gly Gln Gly Gly Glu Ala Gly Ala Ala Pro Gly Val Pro Ser 
            100                 105                 110         


Leu Ala Ser Ser Pro Ala Phe Ala Pro Ala Ala Thr Gly Leu Ser Pro 
        115                 120                 125             


Ser His Ser Ala Gly Ser Gly Met Ser Val Leu Ile Gln Val Pro Gln 
    130                 135                 140                 


Asn Gly Pro Ser Glu Ala Leu Ser Pro Leu Pro Leu Pro Thr Thr Ala 
145                 150                 155                 160 


Leu Asp Thr Pro Leu Asp Thr Arg Ser Ser Thr Pro Arg Pro Ala Pro 
                165                 170                 175     


Ala Pro Ala Pro Pro Ser Pro Tyr Gln Thr Val Gly Gly Leu His Gly 
            180                 185                 190         


Gly Glu His Ser Phe Leu Pro Pro Val Ser Thr Glu Gly Leu Ala Pro 
        195                 200                 205             


Pro Ala Met Gly Thr Gly Glu Gly Gly Leu Glu Gly Gly Asp Gly Gly 
    210                 215                 220                 


Ser Gly Asp Gly Asp Arg Pro Gly Gly Gly Asn Ser Gln Met Val Ala 
225                 230                 235                 240 


Pro Ala Pro Ala Pro Ser Leu Gln Gln Ser Ala Pro Leu Gln Gln Arg 
                245                 250                 255     


Gln Gln Leu Gln Pro Gln Gln His Gln Gln Phe His Ser Arg Ser His 
            260                 265                 270         


Pro Gln Val Ala Pro Leu Gln Val Gln Gln Arg Gln Gln Pro Arg Ala 
        275                 280                 285             


Leu Val Pro Gly Pro Gln Gln Gln Gln Gln His Gln Gln Gln Gln Ala 
    290                 295                 300                 


Leu Tyr Ala Ser Ser Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
305                 310                 315                 320 


Gln Gln His Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
                325                 330                 335     


Arg His His Pro His Pro Gln Gln Leu Gln Gln Gln Gln Arg His Asn 
            340                 345                 350         


Gln Gln Gln Pro Leu Gln His Pro Gln Ala Gln His Arg Val Pro Pro 
        355                 360                 365             


Gln Gly Met Pro Gln His Gln His Val Arg Ala Pro Gln Gln Gln Arg 
    370                 375                 380                 


Gln Gln Gln Leu Leu Pro Leu Pro Thr Ala Gly Asn Ala Val Pro Gly 
385                 390                 395                 400 


Gly Gln Ala Thr Gly Thr Pro His Ala Ser Gln Leu Pro His Ala Gln 
                405                 410                 415     


Leu Ser Gln Gln Gln Gln Pro Ala His Ser Leu Pro Gln Arg Gln Gly 
            420                 425                 430         


Leu Gly Ala Gln Pro Leu Asn Pro Gln Asp Thr Ala Leu Arg Pro Gly 
        435                 440                 445             


Met Val Lys Asn Ile Met Val Leu Leu Gln Gln Arg Lys Pro Ala Ala 
    450                 455                 460                 


Asp Pro Ser Lys Pro Leu Val Glu Thr Arg Leu Lys Glu Met Ala Ile 
465                 470                 475                 480 


Arg Leu Glu Asp Tyr Leu Trp Lys Arg Ser Ser Thr Leu Ala Glu Tyr 
                485                 490                 495     


Ser Asp Leu Ser Thr Leu Lys His Arg Leu Gln Cys Leu Ala Val Tyr 
            500                 505                 510         


Met Gly Lys His Gln Gln Arg Gly Gln Thr Val Pro Ala Gly Ala Arg 
        515                 520                 525             


Gly Arg Gly Gly Gly Met Pro Asn Gln Ala Pro Gln Pro Gln Gly Gly 
    530                 535                 540                 


Gly Leu Ser Gly Asn Thr Asn Gln Leu Gln Arg Leu Val Pro Thr Ala 
545                 550                 555                 560 


Asn Ala Ser Asn Ile His Leu Pro Asn Pro His Pro Gly Gly Leu Ser 
                565                 570                 575     


Gly Gly Met Gly Ala Gly Gly Ala Arg Val Gly Gly Arg Gly Ser Gly 
            580                 585                 590         


Ile Gly Gly Gly Gly Leu Ile Met Gln Pro Gly Ser Ala Ile His Gly 
        595                 600                 605             


His Pro Pro Gly Pro Gln Leu Arg Gly Ser Ser Leu Pro His Gln Gly 
    610                 615                 620                 


Gln Val Gln Pro Thr Ser Gln Gln Gly Ser Gln Gln Arg Arg Val Gly 
625                 630                 635                 640 


Thr Gly Leu Ala Pro Ala Pro Gly Thr Gln Pro Ala Phe Leu Pro Gln 
                645                 650                 655     


Glu Gln Thr Gln Met Gln Gly Arg Arg Val Gly Gly Gly Gly Met Leu 
            660                 665                 670         


Pro Val Asn Gly Gly Asn Ser His Pro Pro Pro Ala Pro Gly Pro Pro 
        675                 680                 685             


Gln Gly His Leu Gln Pro Pro Gln Gln Ser Ser Gly Gln Gly Gln Ala 
    690                 695                 700                 


Ala Pro Leu Asn Val Met Gly Gly Ala Gln Gln Val Gly Gly Gly Gly 
705                 710                 715                 720 


Asn Ala Asn Arg Gly Leu Pro Met Pro Leu Ser Ser Gly Pro Gly Gly 
                725                 730                 735     


Thr Ala Ser Ala Ser Gln Lys Lys Arg Val Gln His Thr Pro Glu Gln 
            740                 745                 750         


Arg Gln Gln Ile Leu His Gln Gln Gln Gln Arg Leu Leu Tyr Leu Arg 
        755                 760                 765             


His Ala Ser Lys Cys Ile His Val Asp Gly Arg Cys Pro Gln Gly Tyr 
    770                 775                 780                 


Pro Asn Cys Ile Gly Met Lys Glu Leu Trp Lys His Ile Ala Ser Cys 
785                 790                 795                 800 


Arg Glu Gln Arg Cys Lys Phe Pro His Cys Val Ser Ser Arg Tyr Val 
                805                 810                 815     


Leu Ser His Tyr His Lys Cys Lys Asp Thr Gln Cys Pro Val Cys Gly 
            820                 825                 830         


Pro Val Arg Asn Thr Ile Arg Ser Ser Arg Ser Ser Ala His Pro Met 
        835                 840                 845             


Pro Gln Leu Gly Gln Gly Val Ala Asp Ala Asp Gly Gly Gly Glu Gly 
    850                 855                 860                 


Gly Gly Ser Gly Val Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
865                 870                 875                 880 


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Leu Val Ala Gln 
                885                 890                 895     


Ser Asn Gln Arg Thr Gln Gln Gln Gln Met Leu Ile Ala Gln Gln Pro 
            900                 905                 910         


Pro Pro Ala Gly Met Gly Gly Gly Arg Val Gly Gly Met Thr Gly Ala 
        915                 920                 925             


Leu Ala Asn Gly Gly Arg Gly Gly Arg Val Gly Gly Arg Ala Arg Gly 
    930                 935                 940                 


Arg Gly Gly Gln Val Val Leu Pro Gln Gln Val Ala Ala Gly Gly Arg 
945                 950                 955                 960 


Gly Ile Gly Gly Gln Asn Val Gly Gly Ser Gly Met Asn Gln Gln Arg 
                965                 970                 975     


Leu Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
            980                 985                 990         


Gln Gln Gln Gln Gln Gln Gln Gln  Arg Pro Gln Asn Met  Ala Ser Val 
        995                 1000                 1005             


Pro Val  Pro Gly Val Gly Arg  Gly Gly Gly Gly Val  Arg Ala Gly 
    1010                 1015                 1020             


Gly Glu  Ala Leu Ala Leu Gly  Thr Ala Gly Gly Ala  Gly Ser Lys 
    1025                 1030                 1035             


Pro Gly  Ala Arg Ser Gly Ser  Gly Lys Met Pro Val  Val Ala Lys 
    1040                 1045                 1050             


Thr Pro  Asn Gly Leu Met Ile  Gln Thr Glu Thr His  Gly Trp Val 
    1055                 1060                 1065             


Pro Val  Glu Pro Thr Lys Asn  Gly Gly Tyr Arg Pro  Leu Val Pro 
    1070                 1075                 1080             


Leu Pro  Gly Ser Gly Gln Ser  Phe Ser Gln Ala Ala  Gly Gly Ala 
    1085                 1090                 1095             


Gly Ala  Gly Gly Arg Pro Gly  Gly Val Gly Arg Gly  Val Pro Gly 
    1100                 1105                 1110             


Val Pro  Ala Pro Pro Ser Ala  Ala Ala Leu Gln Arg  Phe Glu Asp 
    1115                 1120                 1125             


Ser Val  Ser Leu Val Asn Ser  Phe Thr Asp Ala Gln  Ile Lys Ala 
    1130                 1135                 1140             


His Met  Ala Ser Leu Arg Ser  Gly Gly Gly Phe Trp  Thr Pro Ala 
    1145                 1150                 1155             


Lys Leu  Lys Leu Lys Val Leu  Pro Leu Val Val Lys  Gln Leu Lys 
    1160                 1165                 1170             


Ser Glu  Tyr Gly Trp Ile Phe  Glu Glu Pro Val Asp  Pro Val Lys 
    1175                 1180                 1185             


Leu Gly  Leu Pro Asp Tyr Phe  Asp Val Ile Lys His  Pro Met Asp 
    1190                 1195                 1200             


Leu Gly  Thr Val Lys Arg Arg  Leu Glu Asn Gly Ser  Tyr Thr Glu 
    1205                 1210                 1215             


Leu Glu  Lys Val Ala Ala Asp  Val Lys Leu Thr Phe  Asp Asn Ala 
    1220                 1225                 1230             


Ile Leu  Tyr Asn Pro Pro Gly  Gln Glu Ile His Lys  Val Thr Asp 
    1235                 1240                 1245             


Glu Lys  Arg Ala Gly Lys Gly  Gly Arg Ser Arg Leu  Asp Glu Glu 
    1250                 1255                 1260             


Ala Asp  Glu Glu Val Glu Arg  Glu Arg Leu Val Leu  Gly Ala Asp 
    1265                 1270                 1275             


Arg Ser  Asn Gly Trp Phe Tyr  Asp Glu Lys Met Asp  Gly Ser Thr 
    1280                 1285                 1290             


Thr Lys  Met Lys 
    1295         


<210>  5
<211>  4722
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene transcript c (HAT-B2)

<220>
<221>  misc_feature
<223>  encodes the polypeptide of SEQ ID NO:6

<400>  5
atggattcga acgcgcaaac caccagtggc accgtcgttg aaagcacggc tagcaatgga       60

gaggcttctg cgcccgcgcc catgctttcg tcctcccttc cttctccaag ctttgagtcc      120

ggcccagacc ccccccccca gttagcaagg cgggtccccg ggaacgtgcc gcttgacccc      180

tcggccgccg acgtggacga caaggaccgc gcctccagcg cctacggaga cgaacctccc      240

ctccccctcc ccctcctcac gtccacctcg atgacagcct cagaagcgag cagcggtcaa      300

ggaggggaag ctggggccgc cccaggggtg ccctcccttg cttcctcccc tgccttcgcc      360

cccgcagcta ccggcctgtc cccgtctcac tccgccggtt ccggcatgtc agtgctgatc      420

caagtgcctc aaaacgggcc cagcgaggct ctgtcgcctt tgcccttgcc gaccactgcc      480

ttggatactc ccttggacac ccggtcgtcc accccccgcc ccgcgcccgc cccagccccg      540

ccttctcctt accagactgt tggaggcctc cacggcgggg agcactcgtt ccttcctccc      600

gtcagtacgg aagggctggc ccctccggcg atgggcacgg gggaaggagg gcttgagggc      660

ggggatggag ggtcggtagg tttttatccc ccccttgccc agtcgcagac gcaactcgcg      720

ccgttgccgg gcccaccgcc tccgcaggcg caagattcgc tgcagtacaa gcctgcttcg      780

gtaccggagc cgactaggat gatggaaggg tccagtgatc ctccttttca ttcgtcggag      840

acgcccaggg cgatggggat cggccggggg ggagggaatt cgcagatggt tgcacctgcc      900

cccgcgccat cgttgcaaca gtcggcgccg ttgcaacaac gtcagcaatt gcaacctcaa      960

cagcaccaac agttccattc gcgctcccac ccacaagtag cgccactcca ggtgcagcaa     1020

cggcagcaac cgcgggcact ggtgccaggg ccccagcagc agcagcagca tcagcagcag     1080

caagctctct atgcatcttc gcaacagcag cagcaacagc agcagcagca acagcagcaa     1140

catcagcagc agcagcagca gcagcagcag caacagcagc agagacatca cccgcaccca     1200

cagcaactgc agcaacaaca gcgacacaac cagcagcagc cactccagca tccacaagca     1260

cagcatcgag tcccacccca gggcatgcct cagcaccagc acgtccgggc gccacagcaa     1320

cagcggcagc agcaactcct ccctcttcca accgcgggca atgccgtccc aggcggccag     1380

gcaaccggca ccccgcacgc gtcgcaactg cctcacgccc agctctccca acaacaacaa     1440

cccgcgcatt ccttgcccca acggcagggc ctgggcgcgc agcccctcaa cccacaggac     1500

actgccttgc ggcccggaat ggtcaagaac atcatggtct tgctccaaca acgcaaaccc     1560

gccgccgatc cttccaaacc cttggtggaa actcggttga aggagatggc gatccggctg     1620

gaggactacc tgtggaaacg ctcgtccacg ttggcggagt actcggatct gagcaccctc     1680

aaacaccgcc tgcagtgttt ggcagtctac atgggcaagc accagcagcg gggtcaaact     1740

gtaccggcgg gcgcaagggg cagaggggga gggatgccga atcaagcgcc ccagccacag     1800

gggggggggc tctctgggaa cacgaaccaa ctgcaacgtt tggtgcctac cgccaatgcc     1860

agcaatattc acctgcccaa ccctcatccc ggaggtcttt cgggtggaat gggggcggga     1920

ggcgcgcgtg tgggagggcg gggcagtggg atcggcggag gggggttgat catgcaacct     1980

gggagtgcca tccacggaca tccccccggg ccccagttgc ggggcagctc tctcccccac     2040

caagggcaag tgcaaccgac ctcgcagcag ggaagtcagc aaagaagggt gggaacgggt     2100

ctggcgcctg cgcctggcac acaacccgcg tttttaccac aggaacaaac gcaaatgcaa     2160

ggtcggcggg tagggggggg agggatgctg cccgtaaatg ggggtaacag ccaccctcct     2220

cccgcgccag gtcctccaca aggccatctg cagccgccgc agcagtcatc aggacagggg     2280

caagccgctc ccttgaacgt gatggggggg gcacagcaag tggggggggg cggtaatgcg     2340

aaccgagggc tccctatgcc tttatcttca ggccccgggg gtaccgcctc cgccagtcag     2400

aagaaacgcg tccagcacac gcccgaacaa cgtcagcaaa tcttgcacca gcagcagcag     2460

cggctgcttt acttgcgcca tgcgtccaag tgcattcatg tggacggccg ctgtccccag     2520

gggtacccga actgcatcgg gatgaaggag ctttggaagc acatcgcctc ctgtagggaa     2580

caacggtgca agttccccca ctgcgtgtcc tcgagatacg tcctgtccca ctaccacaaa     2640

tgtaaggaca cgcagtgccc ggtgtgcgga cccgtacgaa acacgatccg atcttctcgc     2700

tcctcggcgc atcccatgcc gcaacttggt cagggtgtgg cagacgccga cggaggaggc     2760

gagggaggcg gatctggagt ccagcagcag cagcagcagc agcaacaaca acaacaacaa     2820

caacaacaac aacagcaaca acaacagcaa ttggtagcac agagtaatca acgcacgcag     2880

cagcaacaaa tgttgatcgc ccagcagccc ccccccgcag ggatgggggg agggagggtc     2940

ggaggcatga ctggggccct ggcgaatgga ggcaggggtg ggagggtcgg agggagggcg     3000

cggggcaggg ggggtcaagt cgtgcttcct cagcaggttg cggccggggg gcggggaata     3060

ggcggtcaga atgtaggtgg aagtggaatg aaccagcaac gattgcagca acagcaacaa     3120

cagcagcagc aacaacagca gcagcagcag cagcagcagc agcagcagca gcaacgccca     3180

caaaatatgg cttccgtgcc ggttcctggg gtaggacgtg ggggaggagg ggtgcgagct     3240

ggcggggaag ccctcgcctt gggcactgcg ggtggagcgg gcagcaaacc tggggcccgg     3300

agcggttcgg ggaaaatgcc agtcgtagcc aagactccga atggcctcat gatccagacg     3360

gaaacgcatg gatgggtgcc ggtagagccc acgaaaaacg gcggctaccg tcccctggtg     3420

cctctgcccg gctccggtca aagcttctca caggctgccg gcggggctgg cgcgggcgga     3480

cgtcctggcg gcgttgggag aggggtgccc ggcgtacctg ccccaccttc cgcggcagcg     3540

ttgcagcggt tcgaagactc cgtgtccttg gtgaactcct tcacggacgc acaaattaag     3600

gcgcacatgg cctctctgcg ttcaggggga gggttttgga ctcccgccaa gttgaaactt     3660

aaggtgcgtt caaggatatc taccgagcca acggtctctt tgttagtctc tccctttgtt     3720

ccccgctttc attacgctcc tgcatacctg gatgccgcgc tttcttctcc tctcacatgc     3780

cctgtccccc ccctttcccc taggttctcc ccctcggggt aaaacagctg aaatcggagt     3840

agggatggat ttttgaagaa cccggggacc ccgtgaagct cgggctcccg gattacttcg     3900

atgtgatcaa gcaccctatg gacttggcca ctgtacgtcg gcttgtgtcg aggcgctttc     3960

cctcagaatc gtctcccccc cccccccccc ccaatgacca gtgctgctgg tcgcatcatg     4020

tcttctactt tccctccatc tttttttttc tttttcgtct atgcctcttc ttcttcccca     4080

cctctttttt taaaacggac attgcccgtt gttggtcaag ttggccttgc ctccccagcc     4140

cgtgctgacc atggctttcc gtcgtccctc cgttcttcct cgatcaggtg aagcgtcgtt     4200

tggaaaacgg ctcctacaca gagctggaaa ggtggcggcg gacgtgaagc tcaccttcga     4260

caatgccatc ctttacaacc ccccggggca agaaatccac aaggtaacgg acgaaaaacg     4320

ggcgggaaaa gggggcaggt caaggctgga tgaagaggca gacgaggagg ttgaaagaga     4380

gaggctcgtg ctaggggcgg accggagcaa tggatggttc tacgacgaaa aaatggatgg     4440

ttccacgacg aaaatgaagt gacgggcagg ggggaaaggg gggacacgga aacgacattg     4500

cgggatacag aagtctgttg ggtgggccat ccctccctct caccctccct ccctcgttgc     4560

tggcccctac agatggccaa ggacatgcgg gacagtttct tcaaggactt caggcagctg     4620

gaggaggagg ttaagaggga acagcagctg actgtcaaca ggtaacccta acagaaaggg     4680

agggcggaga gaagcgggag ggaaggggga gggggagggg ga                        4722


<210>  6
<211>  1273
<212>  PRT
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 polypeptide, isoform c (HAT-B2)

<400>  6

Met Asp Ser Asn Ala Gln Thr Thr Ser Gly Thr Val Val Glu Ser Thr 
1               5                   10                  15      


Ala Ser Asn Gly Glu Ala Ser Ala Pro Ala Pro Met Leu Ser Ser Ser 
            20                  25                  30          


Leu Pro Ser Pro Ser Phe Glu Ser Gly Pro Asp Pro Pro Pro Gln Leu 
        35                  40                  45              


Ala Arg Arg Val Pro Gly Asn Val Pro Leu Asp Pro Ser Ala Ala Asp 
    50                  55                  60                  


Val Asp Asp Lys Asp Arg Ala Ser Ser Ala Tyr Gly Asp Glu Pro Pro 
65                  70                  75                  80  


Leu Pro Leu Pro Leu Leu Thr Ser Thr Ser Met Thr Ala Ser Glu Ala 
                85                  90                  95      


Ser Ser Gly Gln Gly Gly Glu Ala Gly Ala Ala Pro Gly Val Pro Ser 
            100                 105                 110         


Leu Ala Ser Ser Pro Ala Phe Ala Pro Ala Ala Thr Gly Leu Ser Pro 
        115                 120                 125             


Ser His Ser Ala Gly Ser Gly Met Ser Val Leu Ile Gln Val Pro Gln 
    130                 135                 140                 


Asn Gly Pro Ser Glu Ala Leu Ser Pro Leu Pro Leu Pro Thr Thr Ala 
145                 150                 155                 160 


Leu Asp Thr Pro Leu Asp Thr Arg Ser Ser Thr Pro Arg Pro Ala Pro 
                165                 170                 175     


Ala Pro Ala Pro Pro Ser Pro Tyr Gln Thr Val Gly Gly Leu His Gly 
            180                 185                 190         


Gly Glu His Ser Phe Leu Pro Pro Val Ser Thr Glu Gly Leu Ala Pro 
        195                 200                 205             


Pro Ala Met Gly Thr Gly Glu Gly Gly Leu Glu Gly Gly Asp Gly Gly 
    210                 215                 220                 


Ser Val Gly Phe Tyr Pro Pro Leu Ala Gln Ser Gln Thr Gln Leu Ala 
225                 230                 235                 240 


Pro Leu Pro Gly Pro Pro Pro Pro Gln Ala Gln Asp Ser Leu Gln Tyr 
                245                 250                 255     


Lys Pro Ala Ser Val Pro Glu Pro Thr Arg Met Met Glu Gly Ser Ser 
            260                 265                 270         


Asp Pro Pro Phe His Ser Ser Glu Thr Pro Arg Ala Met Gly Ile Gly 
        275                 280                 285             


Arg Gly Gly Gly Asn Ser Gln Met Val Ala Pro Ala Pro Ala Pro Ser 
    290                 295                 300                 


Leu Gln Gln Ser Ala Pro Leu Gln Gln Arg Gln Gln Leu Gln Pro Gln 
305                 310                 315                 320 


Gln His Gln Gln Phe His Ser Arg Ser His Pro Gln Val Ala Pro Leu 
                325                 330                 335     


Gln Val Gln Gln Arg Gln Gln Pro Arg Ala Leu Val Pro Gly Pro Gln 
            340                 345                 350         


Gln Gln Gln Gln His Gln Gln Gln Gln Ala Leu Tyr Ala Ser Ser Gln 
        355                 360                 365             


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln His Gln Gln Gln 
    370                 375                 380                 


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Arg His His Pro His Pro 
385                 390                 395                 400 


Gln Gln Leu Gln Gln Gln Gln Arg His Asn Gln Gln Gln Pro Leu Gln 
                405                 410                 415     


His Pro Gln Ala Gln His Arg Val Pro Pro Gln Gly Met Pro Gln His 
            420                 425                 430         


Gln His Val Arg Ala Pro Gln Gln Gln Arg Gln Gln Gln Leu Leu Pro 
        435                 440                 445             


Leu Pro Thr Ala Gly Asn Ala Val Pro Gly Gly Gln Ala Thr Gly Thr 
    450                 455                 460                 


Pro His Ala Ser Gln Leu Pro His Ala Gln Leu Ser Gln Gln Gln Gln 
465                 470                 475                 480 


Pro Ala His Ser Leu Pro Gln Arg Gln Gly Leu Gly Ala Gln Pro Leu 
                485                 490                 495     


Asn Pro Gln Asp Thr Ala Leu Arg Pro Gly Met Val Lys Asn Ile Met 
            500                 505                 510         


Val Leu Leu Gln Gln Arg Lys Pro Ala Ala Asp Pro Ser Lys Pro Leu 
        515                 520                 525             


Val Glu Thr Arg Leu Lys Glu Met Ala Ile Arg Leu Glu Asp Tyr Leu 
    530                 535                 540                 


Trp Lys Arg Ser Ser Thr Leu Ala Glu Tyr Ser Asp Leu Ser Thr Leu 
545                 550                 555                 560 


Lys His Arg Leu Gln Cys Leu Ala Val Tyr Met Gly Lys His Gln Gln 
                565                 570                 575     


Arg Gly Gln Thr Val Pro Ala Gly Ala Arg Gly Arg Gly Gly Gly Met 
            580                 585                 590         


Pro Asn Gln Ala Pro Gln Pro Gln Gly Gly Gly Leu Ser Gly Asn Thr 
        595                 600                 605             


Asn Gln Leu Gln Arg Leu Val Pro Thr Ala Asn Ala Ser Asn Ile His 
    610                 615                 620                 


Leu Pro Asn Pro His Pro Gly Gly Leu Ser Gly Gly Met Gly Ala Gly 
625                 630                 635                 640 


Gly Ala Arg Val Gly Gly Arg Gly Ser Gly Ile Gly Gly Gly Gly Leu 
                645                 650                 655     


Ile Met Gln Pro Gly Ser Ala Ile His Gly His Pro Pro Gly Pro Gln 
            660                 665                 670         


Leu Arg Gly Ser Ser Leu Pro His Gln Gly Gln Val Gln Pro Thr Ser 
        675                 680                 685             


Gln Gln Gly Ser Gln Gln Arg Arg Val Gly Thr Gly Leu Ala Pro Ala 
    690                 695                 700                 


Pro Gly Thr Gln Pro Ala Phe Leu Pro Gln Glu Gln Thr Gln Met Gln 
705                 710                 715                 720 


Gly Arg Arg Val Gly Gly Gly Gly Met Leu Pro Val Asn Gly Gly Asn 
                725                 730                 735     


Ser His Pro Pro Pro Ala Pro Gly Pro Pro Gln Gly His Leu Gln Pro 
            740                 745                 750         


Pro Gln Gln Ser Ser Gly Gln Gly Gln Ala Ala Pro Leu Asn Val Met 
        755                 760                 765             


Gly Gly Ala Gln Gln Val Gly Gly Gly Gly Asn Ala Asn Arg Gly Leu 
    770                 775                 780                 


Pro Met Pro Leu Ser Ser Gly Pro Gly Gly Thr Ala Ser Ala Ser Gln 
785                 790                 795                 800 


Lys Lys Arg Val Gln His Thr Pro Glu Gln Arg Gln Gln Ile Leu His 
                805                 810                 815     


Gln Gln Gln Gln Arg Leu Leu Tyr Leu Arg His Ala Ser Lys Cys Ile 
            820                 825                 830         


His Val Asp Gly Arg Cys Pro Gln Gly Tyr Pro Asn Cys Ile Gly Met 
        835                 840                 845             


Lys Glu Leu Trp Lys His Ile Ala Ser Cys Arg Glu Gln Arg Cys Lys 
    850                 855                 860                 


Phe Pro His Cys Val Ser Ser Arg Tyr Val Leu Ser His Tyr His Lys 
865                 870                 875                 880 


Cys Lys Asp Thr Gln Cys Pro Val Cys Gly Pro Val Arg Asn Thr Ile 
                885                 890                 895     


Arg Ser Ser Arg Ser Ser Ala His Pro Met Pro Gln Leu Gly Gln Gly 
            900                 905                 910         


Val Ala Asp Ala Asp Gly Gly Gly Glu Gly Gly Gly Ser Gly Val Gln 
        915                 920                 925             


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
    930                 935                 940                 


Gln Gln Gln Gln Gln Gln Leu Val Ala Gln Ser Asn Gln Arg Thr Gln 
945                 950                 955                 960 


Gln Gln Gln Met Leu Ile Ala Gln Gln Pro Pro Pro Ala Gly Met Gly 
                965                 970                 975     


Gly Gly Arg Val Gly Gly Met Thr Gly Ala Leu Ala Asn Gly Gly Arg 
            980                 985                 990         


Gly Gly Arg Val Gly Gly Arg Ala  Arg Gly Arg Gly Gly  Gln Val Val 
        995                 1000                 1005             


Leu Pro  Gln Gln Val Ala Ala  Gly Gly Arg Gly Ile  Gly Gly Gln 
    1010                 1015                 1020             


Asn Val  Gly Gly Ser Gly Met  Asn Gln Gln Arg Leu  Gln Gln Gln 
    1025                 1030                 1035             


Gln Gln  Gln Gln Gln Gln Gln  Gln Gln Gln Gln Gln  Gln Gln Gln 
    1040                 1045                 1050             


Gln Gln  Gln Gln Gln Arg Pro  Gln Asn Met Ala Ser  Val Pro Val 
    1055                 1060                 1065             


Pro Gly  Val Gly Arg Gly Gly  Gly Gly Val Arg Ala  Gly Gly Glu 
    1070                 1075                 1080             


Ala Leu  Ala Leu Gly Thr Ala  Gly Gly Ala Gly Ser  Lys Pro Gly 
    1085                 1090                 1095             


Ala Arg  Ser Gly Ser Gly Lys  Met Pro Val Val Ala  Lys Thr Pro 
    1100                 1105                 1110             


Asn Gly  Leu Met Ile Gln Thr  Glu Thr His Gly Trp  Val Pro Val 
    1115                 1120                 1125             


Glu Pro  Thr Lys Asn Gly Gly  Tyr Arg Pro Leu Val  Pro Leu Pro 
    1130                 1135                 1140             


Gly Ser  Gly Gln Ser Phe Ser  Gln Ala Ala Gly Gly  Ala Gly Ala 
    1145                 1150                 1155             


Gly Gly  Arg Pro Gly Gly Val  Gly Arg Gly Val Pro  Gly Val Pro 
    1160                 1165                 1170             


Ala Pro  Pro Ser Ala Ala Ala  Leu Gln Arg Phe Glu  Asp Ser Val 
    1175                 1180                 1185             


Ser Leu  Val Asn Ser Phe Thr  Asp Ala Gln Ile Lys  Ala His Met 
    1190                 1195                 1200             


Ala Ser  Leu Arg Ser Gly Gly  Gly Phe Trp Thr Pro  Ala Lys Leu 
    1205                 1210                 1215             


Lys Leu  Lys Val Arg Ser Arg  Ile Ser Thr Glu Pro  Thr Val Ser 
    1220                 1225                 1230             


Leu Leu  Val Ser Pro Phe Val  Pro Arg Phe His Tyr  Ala Pro Ala 
    1235                 1240                 1245             


Tyr Leu  Asp Ala Ala Leu Ser  Ser Pro Leu Thr Cys  Pro Val Pro 
    1250                 1255                 1260             


Pro Leu  Ser Pro Arg Phe Ser  Pro Ser Gly 
    1265                 1270             


<210>  7
<211>  4429
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene transcript d (HAT-B10)

<220>
<221>  misc_feature
<223>  encodes the polypeptide of SEQ ID NO:8

<400>  7
atggattcga acgcgcaaac caccagtggc accgtcgttg aaagcacggc tagcaatgga       60

gaggcttctg cgcccgcgcc catgctttcg tcctcccttc cttctccaag ctttgagtcc      120

ggcccagacc ccccccccca gttagcaagg cgggtccccg ggaacgtgcc gcttgacccc      180

tcggccgccg acgtggacga caaggaccgc gcctccagcg cctacggaga cgaacctccc      240

ctccccctcc ccctcctcac gtccacctcg atgacagcct cagaagcgag cagcggtcaa      300

ggaggggaag ctggggccgc cccaggggtg ccctcccttg cttcctcccc tgccttcgcc      360

cccgcagcta ccggcctgtc cccgtctcac tccgccggtt ccggcatgtc agtgctgatc      420

caagtgcctc aaaacgggcc cagcgaggct ctgtcgcctt tgcccttgcc gaccactgcc      480

ttggatactc ccttggacac ccggtcgtcc accccccgcc ccgcgcccgc cccagccccg      540

ccttctcctt accagactgt tggaggcctc cacggcgggg agcactcgtt ccttcctccc      600

gtcagtacgg aagggctggc ccctccggcg atgggcacgg gggaaggagg gcttgagggc      660

ggggatggag ggtcggtagg tttttatccc ccccttgccc agtcgcagac gcaactcgcg      720

ccgttgccgg gcccaccgcc tccgcaggcg caagattcgc tgcagtacaa gcctgcttcg      780

gtaccggagc cgactaggat gatggaaggg tccagtgatc ctccttttca ttcgtcggag      840

acgcccaggg cgatggggat cggccggggg ggagggaatt cgcagatggt tgcacctgcc      900

cccgcgccat cgttgcaaca gtcggcgccg ttgcaacaac gtcagcaatt gcaacctcaa      960

cagcaccaac agttccattc gcgctcccac ccacaagtag cgccactcca ggtgcagcaa     1020

cggcagcaac cgcgggcact ggtgccaggg ccccagcagc agcagcagca tcagcagcag     1080

caagctctct atgcatcttc gcaacagcag cagcaacagc agcagcagca acagcagcaa     1140

catcagcagc agcagcagca gcagcagcag caacagcagc agagacatca cccgcaccca     1200

cagcaactgc agcaacaaca gcgacacaac cagcagcagc cactccagca tccacaagca     1260

cagcatcgag tcccacccca gggcatgcct cagcaccagc acgtccgggc gccacagcaa     1320

cagcggcagc agcaactcct ccctcttcca accgcgggca atgccgtccc aggcggccag     1380

gcaaccggca ccccgcacgc gtcgcaactg cctcacgccc agctctccca acaacaacaa     1440

cccgcgcatt ccttgcccca acggcagggc ctgggcgcgc agcccctcaa cccacaggac     1500

actgccttgc ggcccggaat ggtcaagaac atcatggtct tgctccaaca acgcaaaccc     1560

gccgccgatc cttccaaacc cttggtggaa actcggttga aggagatggc gatccggctg     1620

gaggactacc tgtggaaacg ctcgtccacg ttggcggagt actcggatct gagcaccctc     1680

aaacaccgcc tgcagtgttt ggcagtctac atgggcaagc accagcagcg gggtcaaact     1740

gtaccggcgg gcgcaagggg cagaggggga gggatgccga atcaagcgcc ccagccacag     1800

gggggggggc tctctgggaa cacgaaccaa ctgcaacgtt tggtgcctac cgccaatgcc     1860

agcaatattc acctgcccaa ccctcatccc ggaggtcttt cgggtggaat gggggcggga     1920

ggcgcgcgtg tgggagggcg gggcagtggg atcggcggag gggggttgat catgcaacct     1980

gggagtgcca tccacggaca tccccccggg ccccagttgc ggggcagctc tctcccccac     2040

caagggcaag tgcaaccgac ctcgcagcag ggaagtcagc aaagaagggt gggaacgggt     2100

ctggcgcctg cgcctggcac acaacccgcg tttttaccac aggaacaaac gcaaatgcaa     2160

ggtcggcggg tagggggggg agggatgctg cccgtaaatg ggggtaacag ccaccctcct     2220

cccgcgccag gtcctccaca aggccatctg cagccgccgc agcagtcatc aggacagggg     2280

caagccgctc ccttgaacgt gatggggggg gcacagcaag tggggggggg cggtaatgcg     2340

aaccgagggc tccctatgcc tttatcttca ggccccgggg gtaccgcctc cgccagtcag     2400

aagaaacgcg tccagcacac gcccgaacaa cgtcagcaaa tcttgcacca gcagcagcag     2460

cggctgcttt acttgcgcca tgcgtccaag tgcattcatg tggacggccg ctgtccccag     2520

gggtacccga actgcatcgg gatgaaggag ctttggaagc acatcgcctc ctgtagggaa     2580

caacggtgca agttccccca ctgcgtgtcc tcgagatacg tcctgtccca ctaccacaaa     2640

tgtaaggaca cgcagtgccc ggtgtgcgga cccgtacgaa acacgatccg atcttctcgc     2700

tcctcggcgc atcccatgcc gcaacttggt cagggtgtgg cagacgccga cggaggaggc     2760

gagggaggcg gatctggagt ccagcagcag cagcagcagc aacaacaaca acaacaacaa     2820

caacaacaac agcaacaaca acagcaattg gtagcacaga gtaatcaacg cacgcagcag     2880

caacaaatgt tgatcgccca gcagcccccc ccccgcaggg atggggggag ggagggtcgg     2940

aggcatgact ggggccctgg cgaatggagg caggggtggg agggtcggag ggagggcgcg     3000

gggcaggggg ggtcaagtcg tgcttcctca gcaggttgcg gccggggggc ggggaatagg     3060

cggtcagaat gtaggtggaa gtggaatgaa ccagcaacga ttgcagcaac agcaacaaca     3120

gcagcagcaa caacagcagc agcagcagca gcagcagcag cagcagcaac gcccacaaaa     3180

tatggcttcc gtgccggttc ctggggtagg acgtggggga ggaggggtgc gagctggcgg     3240

ggaagccctc gccttgggca ctgcgggtgg agcgggcagc aaacctgggg cccggagcgg     3300

ttcggggaaa atgccagtcg tagccaagac tccgaatggc ctcatgatcc agacggaaac     3360

gcatggatgg gtgccggtag agcccacgaa aaacggcggc taccgtcccc tggtgcctct     3420

gcccggctcc ggtcaaagct tctcacaggc tgccggcggg gctggcgcgg gcggacgtcc     3480

tggcggcgtt gggagagggg tgcccggcgt acctgcccca ccttccgcgg cagcgttgca     3540

gcggttcgaa gactccgtgt ccttggtgaa ctccttcacg gacgcacaaa ttaaggcgca     3600

catggtctct ctgcgttcag ggggagggtt ttgaactccc gccaagttga aacttaaggt     3660

gcgttcaagg atatctaccg agccaacggt ctctttgttt gtctctccct ttgttccccg     3720

ctttcattac gctcctgcat acctggatgc cgcgctttct tctcctctca catgccctgt     3780

cccccccctt tcccctaggt tctccccctc gtggtaaaac agctgaaatc ggagtatgga     3840

tggatttttg aagaacccgt ggaccccgtg aagctcgggc tcccggatta cttcgatgtg     3900

atcaagcacc ctatggactt gggcactgta cgtcggcttg tgtcgaggcg ctttccctca     3960

gaatcgtctc cttccccccc cccccccaat gaccagtgct gctggtcgca tcatgtcttc     4020

tactttccct ccatcttttt tttctttttc gtctatgcct cttcttcttc cccacctctt     4080

tttttaaaac ggacattgcc cgttgttggt caagttggcc ttgcctcccc agcccgtgct     4140

gaccatggct ttccgtcgtc cctccgttct tcctcgatca ggtgaagcgt cgtttggaaa     4200

acggctccta cacagagctg gaaaaggtgg cggcggacgt gaagctcacc ttcgacaatg     4260

ccatccttta caaccccccg gggcaagaaa tccacaaggt aacggacgaa aaacgggcgg     4320

gaaaaggggg caggtcaagg ctggatgaag aggcagacga ggaggttgaa agagagaggc     4380

tcgtgctagg ggcggaccgg agcaatggat ggttctacga cgaaaaaat                 4429


<210>  8
<211>  1197
<212>  PRT
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 polypeptide, isoform d (HAT-B10)

<400>  8

Met Asp Ser Asn Ala Gln Thr Thr Ser Gly Thr Val Val Glu Ser Thr 
1               5                   10                  15      


Ala Ser Asn Gly Glu Ala Ser Ala Pro Ala Pro Met Leu Ser Ser Ser 
            20                  25                  30          


Leu Pro Ser Pro Ser Phe Glu Ser Gly Pro Asp Pro Pro Pro Gln Leu 
        35                  40                  45              


Ala Arg Arg Val Pro Gly Asn Val Pro Leu Asp Pro Ser Ala Ala Asp 
    50                  55                  60                  


Val Asp Asp Lys Asp Arg Ala Ser Ser Ala Tyr Gly Asp Glu Pro Pro 
65                  70                  75                  80  


Leu Pro Leu Pro Leu Leu Thr Ser Thr Ser Met Thr Ala Ser Glu Ala 
                85                  90                  95      


Ser Ser Gly Gln Gly Gly Glu Ala Gly Ala Ala Pro Gly Val Pro Ser 
            100                 105                 110         


Leu Ala Ser Ser Pro Ala Phe Ala Pro Ala Ala Thr Gly Leu Ser Pro 
        115                 120                 125             


Ser His Ser Ala Gly Ser Gly Met Ser Val Leu Ile Gln Val Pro Gln 
    130                 135                 140                 


Asn Gly Pro Ser Glu Ala Leu Ser Pro Leu Pro Leu Pro Thr Thr Ala 
145                 150                 155                 160 


Leu Asp Thr Pro Leu Asp Thr Arg Ser Ser Thr Pro Arg Pro Ala Pro 
                165                 170                 175     


Ala Pro Ala Pro Pro Ser Pro Tyr Gln Thr Val Gly Gly Leu His Gly 
            180                 185                 190         


Gly Glu His Ser Phe Leu Pro Pro Val Ser Thr Glu Gly Leu Ala Pro 
        195                 200                 205             


Pro Ala Met Gly Thr Gly Glu Gly Gly Leu Glu Gly Gly Asp Gly Gly 
    210                 215                 220                 


Ser Val Gly Phe Tyr Pro Pro Leu Ala Gln Ser Gln Thr Gln Leu Ala 
225                 230                 235                 240 


Pro Leu Pro Gly Pro Pro Pro Pro Gln Ala Gln Asp Ser Leu Gln Tyr 
                245                 250                 255     


Lys Pro Ala Ser Val Pro Glu Pro Thr Arg Met Met Glu Gly Ser Ser 
            260                 265                 270         


Asp Pro Pro Phe His Ser Ser Glu Thr Pro Arg Ala Met Gly Ile Gly 
        275                 280                 285             


Arg Gly Gly Gly Asn Ser Gln Met Val Ala Pro Ala Pro Ala Pro Ser 
    290                 295                 300                 


Leu Gln Gln Ser Ala Pro Leu Gln Gln Arg Gln Gln Leu Gln Pro Gln 
305                 310                 315                 320 


Gln His Gln Gln Phe His Ser Arg Ser His Pro Gln Val Ala Pro Leu 
                325                 330                 335     


Gln Val Gln Gln Arg Gln Gln Pro Arg Ala Leu Val Pro Gly Pro Gln 
            340                 345                 350         


Gln Gln Gln Gln His Gln Gln Gln Gln Ala Leu Tyr Ala Ser Ser Gln 
        355                 360                 365             


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln His Gln Gln Gln 
    370                 375                 380                 


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Arg His His Pro His Pro 
385                 390                 395                 400 


Gln Gln Leu Gln Gln Gln Gln Arg His Asn Gln Gln Gln Pro Leu Gln 
                405                 410                 415     


His Pro Gln Ala Gln His Arg Val Pro Pro Gln Gly Met Pro Gln His 
            420                 425                 430         


Gln His Val Arg Ala Pro Gln Gln Gln Arg Gln Gln Gln Leu Leu Pro 
        435                 440                 445             


Leu Pro Thr Ala Gly Asn Ala Val Pro Gly Gly Gln Ala Thr Gly Thr 
    450                 455                 460                 


Pro His Ala Ser Gln Leu Pro His Ala Gln Leu Ser Gln Gln Gln Gln 
465                 470                 475                 480 


Pro Ala His Ser Leu Pro Gln Arg Gln Gly Leu Gly Ala Gln Pro Leu 
                485                 490                 495     


Asn Pro Gln Asp Thr Ala Leu Arg Pro Gly Met Val Lys Asn Ile Met 
            500                 505                 510         


Val Leu Leu Gln Gln Arg Lys Pro Ala Ala Asp Pro Ser Lys Pro Leu 
        515                 520                 525             


Val Glu Thr Arg Leu Lys Glu Met Ala Ile Arg Leu Glu Asp Tyr Leu 
    530                 535                 540                 


Trp Lys Arg Ser Ser Thr Leu Ala Glu Tyr Ser Asp Leu Ser Thr Leu 
545                 550                 555                 560 


Lys His Arg Leu Gln Cys Leu Ala Val Tyr Met Gly Lys His Gln Gln 
                565                 570                 575     


Arg Gly Gln Thr Val Pro Ala Gly Ala Arg Gly Arg Gly Gly Gly Met 
            580                 585                 590         


Pro Asn Gln Ala Pro Gln Pro Gln Gly Gly Gly Leu Ser Gly Asn Thr 
        595                 600                 605             


Asn Gln Leu Gln Arg Leu Val Pro Thr Ala Asn Ala Ser Asn Ile His 
    610                 615                 620                 


Leu Pro Asn Pro His Pro Gly Gly Leu Ser Gly Gly Met Gly Ala Gly 
625                 630                 635                 640 


Gly Ala Arg Val Gly Gly Arg Gly Ser Gly Ile Gly Gly Gly Gly Leu 
                645                 650                 655     


Ile Met Gln Pro Gly Ser Ala Ile His Gly His Pro Pro Gly Pro Gln 
            660                 665                 670         


Leu Arg Gly Ser Ser Leu Pro His Gln Gly Gln Val Gln Pro Thr Ser 
        675                 680                 685             


Gln Gln Gly Ser Gln Gln Arg Arg Val Gly Thr Gly Leu Ala Pro Ala 
    690                 695                 700                 


Pro Gly Thr Gln Pro Ala Phe Leu Pro Gln Glu Gln Thr Gln Met Gln 
705                 710                 715                 720 


Gly Arg Arg Val Gly Gly Gly Gly Met Leu Pro Val Asn Gly Gly Asn 
                725                 730                 735     


Ser His Pro Pro Pro Ala Pro Gly Pro Pro Gln Gly His Leu Gln Pro 
            740                 745                 750         


Pro Gln Gln Ser Ser Gly Gln Gly Gln Ala Ala Pro Leu Asn Val Met 
        755                 760                 765             


Gly Gly Ala Gln Gln Val Gly Gly Gly Gly Asn Ala Asn Arg Gly Leu 
    770                 775                 780                 


Pro Met Pro Leu Ser Ser Gly Pro Gly Gly Thr Ala Ser Ala Ser Gln 
785                 790                 795                 800 


Lys Lys Arg Val Gln His Thr Pro Glu Gln Arg Gln Gln Ile Leu His 
                805                 810                 815     


Gln Gln Gln Gln Arg Leu Leu Tyr Leu Arg His Ala Ser Lys Cys Ile 
            820                 825                 830         


His Val Asp Gly Arg Cys Pro Gln Gly Tyr Pro Asn Cys Ile Gly Met 
        835                 840                 845             


Lys Glu Leu Trp Lys His Ile Ala Ser Cys Arg Glu Gln Arg Cys Lys 
    850                 855                 860                 


Phe Pro His Cys Val Ser Ser Arg Tyr Val Leu Ser His Tyr His Lys 
865                 870                 875                 880 


Cys Lys Asp Thr Gln Cys Pro Val Cys Gly Pro Val Arg Asn Thr Ile 
                885                 890                 895     


Arg Ser Ser Arg Ser Ser Ala His Pro Met Pro Gln Leu Gly Gln Gly 
            900                 905                 910         


Val Ala Asp Ala Asp Gly Gly Gly Glu Gly Gly Gly Ser Gly Val Gln 
        915                 920                 925             


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
    930                 935                 940                 


Gln Gln Gln Gln Gln Leu Val Ala Gln Ser Asn Gln Arg Thr Gln Gln 
945                 950                 955                 960 


Gln Gln Met Leu Ile Ala Gln Gln Pro Pro Pro Arg Arg Asp Gly Gly 
                965                 970                 975     


Arg Glu Gly Arg Arg His Asp Trp Gly Pro Gly Glu Trp Arg Gln Gly 
            980                 985                 990         


Trp Glu Gly Arg Arg Glu Gly Ala  Gly Gln Gly Gly Ser  Ser Arg Ala 
        995                 1000                 1005             


Ser Ser  Ala Gly Cys Gly Arg  Gly Ala Gly Asn Arg  Arg Ser Glu 
    1010                 1015                 1020             


Cys Arg  Trp Lys Trp Asn Glu  Pro Ala Thr Ile Ala  Ala Thr Ala 
    1025                 1030                 1035             


Thr Thr  Ala Ala Ala Thr Thr  Ala Ala Ala Ala Ala  Ala Ala Ala 
    1040                 1045                 1050             


Ala Ala  Ala Thr Pro Thr Lys  Tyr Gly Phe Arg Ala  Gly Ser Trp 
    1055                 1060                 1065             


Gly Arg  Thr Trp Gly Arg Arg  Gly Ala Ser Trp Arg  Gly Ser Pro 
    1070                 1075                 1080             


Arg Leu  Gly His Cys Gly Trp  Ser Gly Gln Gln Thr  Trp Gly Pro 
    1085                 1090                 1095             


Glu Arg  Phe Gly Glu Asn Ala  Ser Arg Ser Gln Asp  Ser Glu Trp 
    1100                 1105                 1110             


Pro His  Asp Pro Asp Gly Asn  Ala Trp Met Gly Ala  Gly Arg Ala 
    1115                 1120                 1125             


His Glu  Lys Arg Arg Leu Pro  Ser Pro Gly Ala Ser  Ala Arg Leu 
    1130                 1135                 1140             


Arg Ser  Lys Leu Leu Thr Gly  Cys Arg Arg Gly Trp  Arg Gly Arg 
    1145                 1150                 1155             


Thr Ser  Trp Arg Arg Trp Glu  Arg Gly Ala Arg Arg  Thr Cys Pro 
    1160                 1165                 1170             


Thr Phe  Arg Gly Ser Val Ala  Ala Val Arg Arg Leu  Arg Val Leu 
    1175                 1180                 1185             


Gly Glu  Leu Leu His Gly Arg  Thr Asn 
    1190                 1195         


<210>  9
<211>  66
<212>  PRT
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  TAZ zinc finger domain (PF02135), amino acids 769-833 of SEQ ID 
       NO:4

<400>  9

His Ala Ser Lys Cys Ile His Val Asp Gly Arg Cys Pro Gln Gly Tyr 
1               5                   10                  15      


Pro Asn Cys Ile Gly Met Lys Glu Leu Trp Lys His Ile Ala Ser Cys 
            20                  25                  30          


Arg Glu Gln Arg Cys Lys Phe Pro His Cys Val Ser Ser Arg Tyr Val 
        35                  40                  45              


Leu Ser His Tyr His Lys Cys Lys Asp Thr Gln Cys Pro Val Cys Gly 
    50                  55                  60                  


Pro Val 
65      


<210>  10
<211>  43
<212>  PRT
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo domain (PF00439), amino acids 1165-1245 of SEQ ID NO:4

<400>  10

Leu Pro Leu Val Val Lys Gln Leu Lys Ser Glu Tyr Gly Trp Ile Phe 
1               5                   10                  15      


Glu Glu Pro Val Asp Pro Val Lys Leu Gly Leu Pro Asp Tyr Phe Asp 
            20                  25                  30          


Val Ile Lys His Pro Met Asp Leu Gly Thr Val 
        35                  40              


<210>  11
<211>  639
<212>  DNA
<213>  Nannochloropsis oceanica


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:12

<400>  11
atgaataata acgtcagtac caataacagc aacataggca atattaacac caatcgaaat       60

cagccaccca tgccgctccc tactggccct ggcggcggtc ctcccacttc gcagcagcaa      120

cggatgcagc acacccccga gcagcgccag cagatcctgc accagcagca gcagcggttg      180

ttatacctaa gacatgcatc caagtgtatc cacgtggatg gtaggtgtcc ccaggggtac      240

cccaactgca aaggaatgaa ggagctctgg aagcacattg catcgtgccg agagcaacgg      300

tgtcaatttc cccactgcgt ctcgtcaaga tacgtcctct cccactatca caagtgcaag      360

gacacgaact gcccggtgtg tggacccgtg cgcaacacca tccggtcctc ccgcaacgca      420

tcccaaccca tgcctcagct gaatcaagga ggagtgggag gagtgatgcc ggtaccagga      480

cagccgcagc cgcagccgca gcagcaatca caacaacaac aacaacaaca acaacaacaa      540

caacagatgt acattccgca acagcaacag cagcaattac agcaagggat gggtggagga      600

agaggggggc gtggatgccg atccaagcgc agccgttga                             639


<210>  12
<211>  212
<212>  PRT
<213>  Nannochloropsis oceanica


<220>
<221>  misc_feature
<223>  Partial sequence, Bromo-1091 ortholog

<400>  12

Met Asn Asn Asn Val Ser Thr Asn Asn Ser Asn Ile Gly Asn Ile Asn 
1               5                   10                  15      


Thr Asn Arg Asn Gln Pro Pro Met Pro Leu Pro Thr Gly Pro Gly Gly 
            20                  25                  30          


Gly Pro Pro Thr Ser Gln Gln Gln Arg Met Gln His Thr Pro Glu Gln 
        35                  40                  45              


Arg Gln Gln Ile Leu His Gln Gln Gln Gln Arg Leu Leu Tyr Leu Arg 
    50                  55                  60                  


His Ala Ser Lys Cys Ile His Val Asp Gly Arg Cys Pro Gln Gly Tyr 
65                  70                  75                  80  


Pro Asn Cys Lys Gly Met Lys Glu Leu Trp Lys His Ile Ala Ser Cys 
                85                  90                  95      


Arg Glu Gln Arg Cys Gln Phe Pro His Cys Val Ser Ser Arg Tyr Val 
            100                 105                 110         


Leu Ser His Tyr His Lys Cys Lys Asp Thr Asn Cys Pro Val Cys Gly 
        115                 120                 125             


Pro Val Arg Asn Thr Ile Arg Ser Ser Arg Asn Ala Ser Gln Pro Met 
    130                 135                 140                 


Pro Gln Leu Asn Gln Gly Gly Val Gly Gly Val Met Pro Val Pro Gly 
145                 150                 155                 160 


Gln Pro Gln Pro Gln Pro Gln Gln Gln Ser Gln Gln Gln Gln Gln Gln 
                165                 170                 175     


Gln Gln Gln Gln Gln Gln Met Tyr Ile Pro Gln Gln Gln Gln Gln Gln 
            180                 185                 190         


Leu Gln Gln Gly Met Gly Gly Gly Arg Gly Gly Arg Gly Cys Arg Ser 
        195                 200                 205             


Lys Arg Ser Arg 
    210         


<210>  13
<211>  4305
<212>  DNA
<213>  Cyclotella sp.


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:14

<400>  13
atggtccgaa acgcgagtag taggttgcct ctcatggcga aaaagctcga agaacacctt       60

tacaggtcag cacaaaccaa ggaagaatat atggatcttt cttcgctcaa acgacgtctt      120

catctcattg caaaggcggt aggagttcct aagtttggct atcagagtga actccgagca      180

cccgtttctg ctgtaaacaa tgcagtgaga acaaacatga atgaaaatgt catgcaaccg      240

actgatgctg tcaatctcaa taatggtatg caacgaaata atgtcatgca gccttttaac      300

gacatgcaaa ccaacaacgc cgttcagctt agcaatggta tgcaagctaa caacaccatg      360

cagtctcaaa ataatgctat gcagcccttc ggtacgggta gtcaagaagc cagtgggccg      420

aatccccaac agactaagtc tcaattgtca cagctgcaac agataaggga aaatgtgcag      480

aatagcaccc caagcagcgc cttcgtctcg tcagagcaga tgcttagcga ccaatccttc      540

gttgcagccg caccagatag ctcccatagc aacgacacgg atacaggctc aatcgatcca      600

cgagccgaga agaagaacct tgttctccag cagcaacagc gccgtctatt gttactgcgt      660

cattctaaat tatgccgcat tggacctaac tgtgacacga aattctgccc tcaaatggtg      720

attctttgga agcatttgaa gtattgtcgg ggcaaaacat gccctgttcc ccattgcgtc      780

tcctcacgtt gtgtcttgag ccaccatcgc aattgtaaac gtagaggtct ttctgccaca      840

tgcgagattt gttatccggt cgcgaagtat attcgtcgac tgactggaga cacagacggt      900

gatcattgga acgacgattg ggataatttc tctgttttcg aggcggacgg ggacggggat      960

ggggacattg atttgtccac gagtactgca gacaacataa tgtccaactc cagcgggctc     1020

gtccatggtg aaatggcagc atcaggcttg atgcctcaac cgcaaataca gcataacgaa     1080

tctgcctccg tgaatgttat tcgagcttta caaaatgaaa tcgagcagaa gcagacagtc     1140

cttgcgcaaa ttcgtcgcca gaagggaacg ctttttagtc agaataaatt gcttctagaa     1200

cagttatctg catctaatga ggctgacaat tcgacacagc tccagaatca gataaatctt     1260

ctgcaacacc tcaatgcaca atttgaacga caagaattgt tacttgacag agaaatcagc     1320

cgtcaatccc gggagctcca acaaatgcga caatcgcagt caggggagaa ccagagcttg     1380

agtgccccat cacttcctca tagctcgggt gctccaccac taccgcctca gactaccgag     1440

aaagaaagtg cagagtcacc gaagctaaag ccgaaagctg ccaaaagccc cacggctact     1500

gaatcttcaa atgattactc tccccctttc atgtctgtgt cgtcgtcgtg tcatacggcg     1560

cttcctgctt gttctacgtc ctgtgcgtct aaaaagagat ctactgcgca gaatgatagc     1620

tgccaggaag atgacgacga ctctcgcatt cggaagttgt ttaagcagga tggcgcagtg     1680

ctcacttcaa cactatatga agacggaaat gatgcttctt cgactgccac gcaaaacgaa     1740

agtataggtg taagccagga gccatccgca gacgtgaacc ctcgtagtgt gaacgacatc     1800

ctttcatcga tgccagttgc cgcaatcgag gaacaccttg attccctgct aaattgctgt     1860

cagttgacac ctcggtgcat tgcgagaaag tgcctcccaa taattaagaa attaatcaga     1920

catgagcacg gatgggtgtt taaagatccg gttgatcccg ttgagctcgg tttagatgat     1980

tattttgaga ttgttgaaca ccccatggat cttggattag tcgaaaaaaa actcgaaaat     2040

tgtgtttata aggacatcga atcattcgag cgtgacgcaa gacttgtctt tgagaatgct     2100

attttattca atggcgatga aaatgaaatc ggaatatggg caagacaact attggatatt     2160

ttcaacgacg aagtgaaatc tctcatgaaa ggactgggaa tgagcctgaa aacggaagct     2220

gtagaggcct gcggaaacga ctctacatgt tccctgtgtg ggaaatttag gcttttattt     2280

gagccaccgg cactttactg cagtggagtt tgcggaatgc aaaaaatacg acgtaatggc     2340

ttttactaca ctgaccagta caagcaaaac cattggtgtg accgatgctt taccgggtta     2400

aaagaagatc aaacaatcca gcttgacgat ggtaaagaga ccaaaaagtc tcttctcgtt     2460

agaatgaaaa atgatttgac accagaagaa cagtgggtcc aatgtgatgt ctgtcacgag     2520

tggtgccatc aaatttgcgc cctttttaac gctagtcgaa acaacccggc aaagacattt     2580

tcttgcccca agtgtgttgt tgcgaagccg aaaaaggagc agaacgaaaa gtttgagtac     2640

tcatcgttca aagatgccag tgctctaccc gagtgcaagt taagtcgtgc aatagaatca     2700

ggcttgtttg atacgctttc ggaggaatac gaaaagatcg ccaatcagag aacatgtgat     2760

gtatcccaag tcgaaaaagc tgatggcctc tgtgttcgtg tagtattgtc acaggagaag     2820

aaacacaagg ttcgagaggg tatgcagtca aggtattcta acaagggatt tccttcagag     2880

ttcccagtaa cgtcaaaatg catcttgctg ttccaaaaaa ttcacggggc tgatgttctc     2940

ctttttggga tgtatgtcta cgaatatggc gacaaatgcc cagcaccaaa cagaagacga     3000

gtgtatattt cttacttgga ttcggttcac taccttcagc cttcattata taggactctt     3060

acctatcaga caatcatagt ggaatacctt cgatttgtca gatctcgagg gtttcatact     3120

gcccatattt ggagctgccc accttctaaa ggcgacgagt atatttttta ctgtcatcca     3180

ccacaacagt tgataccaaa ggatgatatg ttgtgcgcct ggtacgttga gactttgaag     3240

aaggcccaag aaaaaggcgt cgtcctagaa acaaggacac tctatgatga atacttcaaa     3300

gaccatggcg tcaaccctga gaccggtgaa ccgtttgatc caaccagcat accatatttc     3360

gacggcgact acattcctgg agagattgaa aagatcatca caattctcaa caaggacgaa     3420

acactgcgtg aaggagcgaa gtgccatgat tcccactcga agtcgaatgc tcccaatggc     3480

caaaagatag aaggtaaaag acgaggcact aggagcaacc caggcgactt agtaaatcag     3540

gatcgtgaca aagtgatgaa tcgtctagat ttggccttgt ccaggatgaa gcaaaatttc     3600

ttcgtcgctc aattgcttag tgataacttc atcaaagcag tcgagattgg tgttgatgtt     3660

tcccaatggg tcgaagaaat aaagtccgat tcgatgatta aaccatcaaa acagattggc     3720

aaaagcccag atcttctaga tttgcgaacg gttgatgcta ccgcaaagac cccgccgatt     3780

ccaccaacgt caagcgttca agtaataggc aacactgttg atgaagaccc ctcgatagaa     3840

caagaatgtc tcgatacacg catcgcgttt ttgaactttt gccagaagaa ttactatcag     3900

tttgatgaat tacgccgtgc caagtacagt acaatgatgc ttctaagtga gttgcacgat     3960

cctcgtgcag agagggagca gcaattcaag gtgcatctac aagtaatcgc acatgcagcg     4020

tcttgtcaag gctgcgcatc caaaaattgc acacgaatga agtccctttt tgatcacgtc     4080

agaaagtgtg acgtaacata ccgacatgga tgcaaagttt gtgtgcgtct ttttatgcta     4140

ttgaccaaac acgcacgcga ttgtgtcaca gagggaacat gctgcattcc tttttgtgaa     4200

cgcatcaggg aaaggcacag aagactgttg agacagcagc agcttttgga cgacaggcga     4260

cgtgatgagc aaaataacag gcatcagcaa gaggaagcag tctaa                     4305


<210>  14
<211>  1434
<212>  PRT
<213>  Cyclotella sp.


<220>
<221>  misc_feature
<223>  translation product 5091230

<400>  14

Met Val Arg Asn Ala Ser Ser Arg Leu Pro Leu Met Ala Lys Lys Leu 
1               5                   10                  15      


Glu Glu His Leu Tyr Arg Ser Ala Gln Thr Lys Glu Glu Tyr Met Asp 
            20                  25                  30          


Leu Ser Ser Leu Lys Arg Arg Leu His Leu Ile Ala Lys Ala Val Gly 
        35                  40                  45              


Val Pro Lys Phe Gly Tyr Gln Ser Glu Leu Arg Ala Pro Val Ser Ala 
    50                  55                  60                  


Val Asn Asn Ala Val Arg Thr Asn Met Asn Glu Asn Val Met Gln Pro 
65                  70                  75                  80  


Thr Asp Ala Val Asn Leu Asn Asn Gly Met Gln Arg Asn Asn Val Met 
                85                  90                  95      


Gln Pro Phe Asn Asp Met Gln Thr Asn Asn Ala Val Gln Leu Ser Asn 
            100                 105                 110         


Gly Met Gln Ala Asn Asn Thr Met Gln Ser Gln Asn Asn Ala Met Gln 
        115                 120                 125             


Pro Phe Gly Thr Gly Ser Gln Glu Ala Ser Gly Pro Asn Pro Gln Gln 
    130                 135                 140                 


Thr Lys Ser Gln Leu Ser Gln Leu Gln Gln Ile Arg Glu Asn Val Gln 
145                 150                 155                 160 


Asn Ser Thr Pro Ser Ser Ala Phe Val Ser Ser Glu Gln Met Leu Ser 
                165                 170                 175     


Asp Gln Ser Phe Val Ala Ala Ala Pro Asp Ser Ser His Ser Asn Asp 
            180                 185                 190         


Thr Asp Thr Gly Ser Ile Asp Pro Arg Ala Glu Lys Lys Asn Leu Val 
        195                 200                 205             


Leu Gln Gln Gln Gln Arg Arg Leu Leu Leu Leu Arg His Ser Lys Leu 
    210                 215                 220                 


Cys Arg Ile Gly Pro Asn Cys Asp Thr Lys Phe Cys Pro Gln Met Val 
225                 230                 235                 240 


Ile Leu Trp Lys His Leu Lys Tyr Cys Arg Gly Lys Thr Cys Pro Val 
                245                 250                 255     


Pro His Cys Val Ser Ser Arg Cys Val Leu Ser His His Arg Asn Cys 
            260                 265                 270         


Lys Arg Arg Gly Leu Ser Ala Thr Cys Glu Ile Cys Tyr Pro Val Ala 
        275                 280                 285             


Lys Tyr Ile Arg Arg Leu Thr Gly Asp Thr Asp Gly Asp His Trp Asn 
    290                 295                 300                 


Asp Asp Trp Asp Asn Phe Ser Val Phe Glu Ala Asp Gly Asp Gly Asp 
305                 310                 315                 320 


Gly Asp Ile Asp Leu Ser Thr Ser Thr Ala Asp Asn Ile Met Ser Asn 
                325                 330                 335     


Ser Ser Gly Leu Val His Gly Glu Met Ala Ala Ser Gly Leu Met Pro 
            340                 345                 350         


Gln Pro Gln Ile Gln His Asn Glu Ser Ala Ser Val Asn Val Ile Arg 
        355                 360                 365             


Ala Leu Gln Asn Glu Ile Glu Gln Lys Gln Thr Val Leu Ala Gln Ile 
    370                 375                 380                 


Arg Arg Gln Lys Gly Thr Leu Phe Ser Gln Asn Lys Leu Leu Leu Glu 
385                 390                 395                 400 


Gln Leu Ser Ala Ser Asn Glu Ala Asp Asn Ser Thr Gln Leu Gln Asn 
                405                 410                 415     


Gln Ile Asn Leu Leu Gln His Leu Asn Ala Gln Phe Glu Arg Gln Glu 
            420                 425                 430         


Leu Leu Leu Asp Arg Glu Ile Ser Arg Gln Ser Arg Glu Leu Gln Gln 
        435                 440                 445             


Met Arg Gln Ser Gln Ser Gly Glu Asn Gln Ser Leu Ser Ala Pro Ser 
    450                 455                 460                 


Leu Pro His Ser Ser Gly Ala Pro Pro Leu Pro Pro Gln Thr Thr Glu 
465                 470                 475                 480 


Lys Glu Ser Ala Glu Ser Pro Lys Leu Lys Pro Lys Ala Ala Lys Ser 
                485                 490                 495     


Pro Thr Ala Thr Glu Ser Ser Asn Asp Tyr Ser Pro Pro Phe Met Ser 
            500                 505                 510         


Val Ser Ser Ser Cys His Thr Ala Leu Pro Ala Cys Ser Thr Ser Cys 
        515                 520                 525             


Ala Ser Lys Lys Arg Ser Thr Ala Gln Asn Asp Ser Cys Gln Glu Asp 
    530                 535                 540                 


Asp Asp Asp Ser Arg Ile Arg Lys Leu Phe Lys Gln Asp Gly Ala Val 
545                 550                 555                 560 


Leu Thr Ser Thr Leu Tyr Glu Asp Gly Asn Asp Ala Ser Ser Thr Ala 
                565                 570                 575     


Thr Gln Asn Glu Ser Ile Gly Val Ser Gln Glu Pro Ser Ala Asp Val 
            580                 585                 590         


Asn Pro Arg Ser Val Asn Asp Ile Leu Ser Ser Met Pro Val Ala Ala 
        595                 600                 605             


Ile Glu Glu His Leu Asp Ser Leu Leu Asn Cys Cys Gln Leu Thr Pro 
    610                 615                 620                 


Arg Cys Ile Ala Arg Lys Cys Leu Pro Ile Ile Lys Lys Leu Ile Arg 
625                 630                 635                 640 


His Glu His Gly Trp Val Phe Lys Asp Pro Val Asp Pro Val Glu Leu 
                645                 650                 655     


Gly Leu Asp Asp Tyr Phe Glu Ile Val Glu His Pro Met Asp Leu Gly 
            660                 665                 670         


Leu Val Glu Lys Lys Leu Glu Asn Cys Val Tyr Lys Asp Ile Glu Ser 
        675                 680                 685             


Phe Glu Arg Asp Ala Arg Leu Val Phe Glu Asn Ala Ile Leu Phe Asn 
    690                 695                 700                 


Gly Asp Glu Asn Glu Ile Gly Ile Trp Ala Arg Gln Leu Leu Asp Ile 
705                 710                 715                 720 


Phe Asn Asp Glu Val Lys Ser Leu Met Lys Gly Leu Gly Met Ser Leu 
                725                 730                 735     


Lys Thr Glu Ala Val Glu Ala Cys Gly Asn Asp Ser Thr Cys Ser Leu 
            740                 745                 750         


Cys Gly Lys Phe Arg Leu Leu Phe Glu Pro Pro Ala Leu Tyr Cys Ser 
        755                 760                 765             


Gly Val Cys Gly Met Gln Lys Ile Arg Arg Asn Gly Phe Tyr Tyr Thr 
    770                 775                 780                 


Asp Gln Tyr Lys Gln Asn His Trp Cys Asp Arg Cys Phe Thr Gly Leu 
785                 790                 795                 800 


Lys Glu Asp Gln Thr Ile Gln Leu Asp Asp Gly Lys Glu Thr Lys Lys 
                805                 810                 815     


Ser Leu Leu Val Arg Met Lys Asn Asp Leu Thr Pro Glu Glu Gln Trp 
            820                 825                 830         


Val Gln Cys Asp Val Cys His Glu Trp Cys His Gln Ile Cys Ala Leu 
        835                 840                 845             


Phe Asn Ala Ser Arg Asn Asn Pro Ala Lys Thr Phe Ser Cys Pro Lys 
    850                 855                 860                 


Cys Val Val Ala Lys Pro Lys Lys Glu Gln Asn Glu Lys Phe Glu Tyr 
865                 870                 875                 880 


Ser Ser Phe Lys Asp Ala Ser Ala Leu Pro Glu Cys Lys Leu Ser Arg 
                885                 890                 895     


Ala Ile Glu Ser Gly Leu Phe Asp Thr Leu Ser Glu Glu Tyr Glu Lys 
            900                 905                 910         


Ile Ala Asn Gln Arg Thr Cys Asp Val Ser Gln Val Glu Lys Ala Asp 
        915                 920                 925             


Gly Leu Cys Val Arg Val Val Leu Ser Gln Glu Lys Lys His Lys Val 
    930                 935                 940                 


Arg Glu Gly Met Gln Ser Arg Tyr Ser Asn Lys Gly Phe Pro Ser Glu 
945                 950                 955                 960 


Phe Pro Val Thr Ser Lys Cys Ile Leu Leu Phe Gln Lys Ile His Gly 
                965                 970                 975     


Ala Asp Val Leu Leu Phe Gly Met Tyr Val Tyr Glu Tyr Gly Asp Lys 
            980                 985                 990         


Cys Pro Ala Pro Asn Arg Arg Arg  Val Tyr Ile Ser Tyr  Leu Asp Ser 
        995                 1000                 1005             


Val His  Tyr Leu Gln Pro Ser  Leu Tyr Arg Thr Leu  Thr Tyr Gln 
    1010                 1015                 1020             


Thr Ile  Ile Val Glu Tyr Leu  Arg Phe Val Arg Ser  Arg Gly Phe 
    1025                 1030                 1035             


His Thr  Ala His Ile Trp Ser  Cys Pro Pro Ser Lys  Gly Asp Glu 
    1040                 1045                 1050             


Tyr Ile  Phe Tyr Cys His Pro  Pro Gln Gln Leu Ile  Pro Lys Asp 
    1055                 1060                 1065             


Asp Met  Leu Cys Ala Trp Tyr  Val Glu Thr Leu Lys  Lys Ala Gln 
    1070                 1075                 1080             


Glu Lys  Gly Val Val Leu Glu  Thr Arg Thr Leu Tyr  Asp Glu Tyr 
    1085                 1090                 1095             


Phe Lys  Asp His Gly Val Asn  Pro Glu Thr Gly Glu  Pro Phe Asp 
    1100                 1105                 1110             


Pro Thr  Ser Ile Pro Tyr Phe  Asp Gly Asp Tyr Ile  Pro Gly Glu 
    1115                 1120                 1125             


Ile Glu  Lys Ile Ile Thr Ile  Leu Asn Lys Asp Glu  Thr Leu Arg 
    1130                 1135                 1140             


Glu Gly  Ala Lys Cys His Asp  Ser His Ser Lys Ser  Asn Ala Pro 
    1145                 1150                 1155             


Asn Gly  Gln Lys Ile Glu Gly  Lys Arg Arg Gly Thr  Arg Ser Asn 
    1160                 1165                 1170             


Pro Gly  Asp Leu Val Asn Gln  Asp Arg Asp Lys Val  Met Asn Arg 
    1175                 1180                 1185             


Leu Asp  Leu Ala Leu Ser Arg  Met Lys Gln Asn Phe  Phe Val Ala 
    1190                 1195                 1200             


Gln Leu  Leu Ser Asp Asn Phe  Ile Lys Ala Val Glu  Ile Gly Val 
    1205                 1210                 1215             


Asp Val  Ser Gln Trp Val Glu  Glu Ile Lys Ser Asp  Ser Met Ile 
    1220                 1225                 1230             


Lys Pro  Ser Lys Gln Ile Gly  Lys Ser Pro Asp Leu  Leu Asp Leu 
    1235                 1240                 1245             


Arg Thr  Val Asp Ala Thr Ala  Lys Thr Pro Pro Ile  Pro Pro Thr 
    1250                 1255                 1260             


Ser Ser  Val Gln Val Ile Gly  Asn Thr Val Asp Glu  Asp Pro Ser 
    1265                 1270                 1275             


Ile Glu  Gln Glu Cys Leu Asp  Thr Arg Ile Ala Phe  Leu Asn Phe 
    1280                 1285                 1290             


Cys Gln  Lys Asn Tyr Tyr Gln  Phe Asp Glu Leu Arg  Arg Ala Lys 
    1295                 1300                 1305             


Tyr Ser  Thr Met Met Leu Leu  Ser Glu Leu His Asp  Pro Arg Ala 
    1310                 1315                 1320             


Glu Arg  Glu Gln Gln Phe Lys  Val His Leu Gln Val  Ile Ala His 
    1325                 1330                 1335             


Ala Ala  Ser Cys Gln Gly Cys  Ala Ser Lys Asn Cys  Thr Arg Met 
    1340                 1345                 1350             


Lys Ser  Leu Phe Asp His Val  Arg Lys Cys Asp Val  Thr Tyr Arg 
    1355                 1360                 1365             


His Gly  Cys Lys Val Cys Val  Arg Leu Phe Met Leu  Leu Thr Lys 
    1370                 1375                 1380             


His Ala  Arg Asp Cys Val Thr  Glu Gly Thr Cys Cys  Ile Pro Phe 
    1385                 1390                 1395             


Cys Glu  Arg Ile Arg Glu Arg  His Arg Arg Leu Leu  Arg Gln Gln 
    1400                 1405                 1410             


Gln Leu  Leu Asp Asp Arg Arg  Arg Asp Glu Gln Asn  Asn Arg His 
    1415                 1420                 1425             


Gln Gln  Glu Glu Ala Val 
    1430                 


<210>  15
<211>  4647
<212>  DNA
<213>  Cyclotella sp.


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:16

<400>  15
atgcttaata tctcgaagag ccagagccag tcatcagttt ttatgtttac cgtcttcatg       60

tcgctttcgt ggggacagcg agcaatcctt tgctgtggcc aaaaaactac cagttttctt      120

cacactccgg gctttggacc aggcctttct catcggttgg caacgaaagt tcaccgccgt      180

caaagcactc cattttacac aaaacaatca ttgctgtttt catcaactgc gcccaacaag      240

aaagaggaag tcgatcataa ttatgacttt gaccgcatcc tcccgtttga caaacattct      300

cacaattcaa tcaaaatagc agtcccacag aacgagcaag cagatccgag tgaagatctt      360

tttgacagcg aaacattcct ctcgaaacta gaagccaccg tagccaccgc caaacaactc      420

cacaaaactg ccatttggat cactgtgccc atcacaagag ctggtctcat ggaacatgca      480

cacaaatgtg ggttcacgtt tcaccacgcc gaaggaaaca cggccactct gagcaagtgg      540

ctatccgaag atgaagaaag ccgaatcccc acgtttgcta ctcaccaggt aggcgttggc      600

gccgtagtta tcaatcgcga aacggaggaa atactttgcg tgagagagaa acgaaacaac      660

taccgtccat ggaaaatgcc tggcggtctt gctgaactgg gcgaagactt ggatatcgca      720

gtgataagag aagtttacga agaaactgga attcaatgta ggtttctcag tgttcttggt      780

gtaagacata ctcatggatt acaattcggt cgaagtgact tatactttgt ctgtcgtttg      840

gagcctgtga ccgatgagag cgggaaagtt gcgcagccag tgccacaaga aggagaaatc      900

gaagcggctg catggattcc gctggatgag tacagagata tggtaaacaa ccctgatagt      960

aatattggac atccaatgat gcgtcacatt atgaggattg ttgatcaggg cgactgggac     1020

aagtttgaca ttcagagaac ggttcgtcaa aagactgcga atggtcagta cgcctcccaa     1080

cctcccccga ttcctcaaca gcaacaacaa ccggccaatc tacaacagcc ccagcaacct     1140

ccacctccgg caacgcaaca aattgtccct gcctcgggtc ctaaaattaa ggcagggtac     1200

gtgtacagtg gaggaaatcc tgtaccagca gcgtccaaac cgggtggtgt agccctctcg     1260

aatgggaagg ttcttgcagc cccatcgagt tctggtccca aacctcaaga agatcatacc     1320

ctcataaatt gttttaccct ggaacaaatc gaaacccaca tcaagtctct taacaagggt     1380

ctgcaactcc ccctagcaaa gttgaaaaca aaatgcggtg aattgctcaa gggcttacaa     1440

tctcatcagc atggatgggt atttaacagc cctgtggacc ctgtagagct tggactacct     1500

gattatttcg aagtcataaa gaaccccatg gatttgggca cagtgaagaa acgcctcgac     1560

aacggattgt atcggagcat cagagaggtt gaggctgata ttaatctgac atttgataac     1620

gcaatgctct ataatcctga aggatcagta gtctggagca tggcgaagga gctcaaggat     1680

aaattcgaga cagattttgc tgcacttatg aaagtccttc acgaagagga ggaagagaag     1740

cgcaagaacg gtgatgcttg ttcactttgc ggatgtgaaa agctactttt tgaaccccct     1800

gtcttctatt gcaatgggtt gtcatgtcgt tccaagcgaa tcagaaggaa cagttactat     1860

tttgttgggg gaaataatca ataccattgg tgccaacctt gttatgagga actgaaggaa     1920

agccaagcaa ttgaactgcc agatatgact ctgaagaaaa gtcaactgga caagaagaag     1980

aataatgagg ttccagagga aagttgggtc caatgcgatc gatgtgagag atggattcat     2040

caaatttgtg ctctcttcaa tactaggcaa aataagaatc aacagtctga gtttgtctgc     2100

ccaagttgta caatcaatga taggaagaag aagggttcgt tgggaccaac atccactact     2160

cccatggcag aggatttgcc caggacaaaa ctttctgagc atctagaaaa gcatgtgaga     2220

gagaaattca agtctgaaat ggaacgtttg gcaaaggaaa gggcagaagc agagggcatc     2280

tccatggaag aagccatgcg aataacttcc gacggaggcg gtgagattta cattcgacag     2340

gtgacttcaa tgagccgaac attggaagtc cgtgaacgaa tgcttaaacg ttactcattc     2400

aaaaactatc ccaatgagtt taagtaccgc tgcaaatgcg tcattgtctt ccaaaacctg     2460

gatggtgtgg atgtcattct ttttggcctg tacgtctacg agcatgacga aaccaatgcc     2520

ccccccaatc agcgtgctgt ctatatttcc tatctggaca gtgtctacta catgagacct     2580

cgcaagatgc gaactttcgt gtatcatgag ttattgattt cgtacatgga ctatgtccgt     2640

tgcaaaggct actccactgc gcacatttgg gcatgccctc cgctcaaagg cgatgactat     2700

attttgtttg caaagccgga ggaccagaaa actcctaaag acgatcgtct tcgccaatgg     2760

taccttgaca tgctgaagga ttgccaacga aggggcatcg tgggaaaggt caccaatgca     2820

tatgatttgt atttctcaga cccgaagaac gatgcatcag tactgcctta catggaggga     2880

gactatttcc cagctgagct tgaaaacatc atcaaagatc tagaggaagg caaaaatctt     2940

agcaagaaac cagacaaatc ggcttcaaag aagggaaaga aagaaaagaa atccaagaca     3000

aagaaggcgg gcagtcgagg tggaactcgt tcagcaggct tggacgagga cgctcttgca     3060

gcaagtggta ttcttcaaga gggtgtggat atcaagagcc tccaagctgg tggaagagac     3120

gctgtaatga agaagcttgg agacactatt taccccatga aggagagttt ccttgttgca     3180

tttcttgatt gggatggagc gaaagaagag aaccgagttg tgccgaaaga cattatggaa     3240

tacagagagc agcatgggat tgttgtcagg aaggcttctg gcgtgcagga aaagaaagac     3300

ggcgatagca ctaaacccgc agcagagtgt tcaaatcttc cagccataaa ggaagagagt     3360

ccgaaagagg tggcagaatc ctccattgaa aaagccgctg aatcaactgc gtctccatcc     3420

agctctgctc caaccaaaga agattctgca tcaaaggatg ggtcttctgc tgtaaaagag     3480

gaatcagatc ttgctcctgc caacaaccca tctgagtcca ctccattgca tgatctggca     3540

tctggcagcg aggaaaagaa agaggaagtt aacagcgaaa atcccgatgg ggctaccaaa     3600

gaatccgagt ctgccccgac agaaggaagc agtgcaagcc cccaaggcgt agctgaaaag     3660

cctaatagag gcgaggctga aacggccaag ggtggtaacg gtgatgtcga aatggaagac     3720

tccaagagtt cggagtcaaa agaagataat gggaaggaag ctgagacgaa aggtgttgaa     3780

gctactactg gagaggcacc aaaacaagca atagtagcca gggagggtaa attcgctgca     3840

atggagaaaa ttaaaaagga aatgaaagtg gaacccgaac cggagccatc atcctcaacc     3900

tctgatcaaa ttgcttccaa gagtgtcacg aaagatagca agggacgact agttaaggtt     3960

atcgatgacg atgacgagga aatggattgc gagttcctca ataaccgcca gcttttcctg     4020

aacctttgcc aaggcaatca ctaccagttt gaccagctcc gtagagccaa acacacatcc     4080

atgatggttt tgtggcatct gcataacaga gatgcaccaa agtttgttca gcaatgtgct     4140

gtgtgctcac gtgaaatcct gcaaggaatg cgttaccatt gccccacttg tgctgacttt     4200

gatcagtgct acgaatgcat gtccaacccg aatgttcctc ggcatcagca tccactcaaa     4260

ccaataccag taggtagcca gcagagttcg ttgacacccg agcaaagaaa agaaaggcag     4320

cgtagcattc aactccacat gaccttgttg ttgcatgctg ccacgtgcaa atcgtctaaa     4380

tgtgcctctg caaactgtgc gaaaatgaaa ggtctattga agcatggttc ccaatgccaa     4440

attaaagctg ccggaggatg tcatgtctgt aaacgcattt gggccctcct ccaaattcat     4500

gcaaggcagt gcaaacaaga caactgtcca gtgcctaatt gtttagctat ccgagagcga     4560

ttccgacagt tgaatttgca gcagcaggca atggatgaca ggcgtcgcca gatgatgaac     4620

cagacttatc atcagcaggc gcgctga                                         4647


<210>  16
<211>  1548
<212>  PRT
<213>  Cyclotella sp.


<220>
<221>  misc_feature
<223>  translation product 5092334

<400>  16

Met Leu Asn Ile Ser Lys Ser Gln Ser Gln Ser Ser Val Phe Met Phe 
1               5                   10                  15      


Thr Val Phe Met Ser Leu Ser Trp Gly Gln Arg Ala Ile Leu Cys Cys 
            20                  25                  30          


Gly Gln Lys Thr Thr Ser Phe Leu His Thr Pro Gly Phe Gly Pro Gly 
        35                  40                  45              


Leu Ser His Arg Leu Ala Thr Lys Val His Arg Arg Gln Ser Thr Pro 
    50                  55                  60                  


Phe Tyr Thr Lys Gln Ser Leu Leu Phe Ser Ser Thr Ala Pro Asn Lys 
65                  70                  75                  80  


Lys Glu Glu Val Asp His Asn Tyr Asp Phe Asp Arg Ile Leu Pro Phe 
                85                  90                  95      


Asp Lys His Ser His Asn Ser Ile Lys Ile Ala Val Pro Gln Asn Glu 
            100                 105                 110         


Gln Ala Asp Pro Ser Glu Asp Leu Phe Asp Ser Glu Thr Phe Leu Ser 
        115                 120                 125             


Lys Leu Glu Ala Thr Val Ala Thr Ala Lys Gln Leu His Lys Thr Ala 
    130                 135                 140                 


Ile Trp Ile Thr Val Pro Ile Thr Arg Ala Gly Leu Met Glu His Ala 
145                 150                 155                 160 


His Lys Cys Gly Phe Thr Phe His His Ala Glu Gly Asn Thr Ala Thr 
                165                 170                 175     


Leu Ser Lys Trp Leu Ser Glu Asp Glu Glu Ser Arg Ile Pro Thr Phe 
            180                 185                 190         


Ala Thr His Gln Val Gly Val Gly Ala Val Val Ile Asn Arg Glu Thr 
        195                 200                 205             


Glu Glu Ile Leu Cys Val Arg Glu Lys Arg Asn Asn Tyr Arg Pro Trp 
    210                 215                 220                 


Lys Met Pro Gly Gly Leu Ala Glu Leu Gly Glu Asp Leu Asp Ile Ala 
225                 230                 235                 240 


Val Ile Arg Glu Val Tyr Glu Glu Thr Gly Ile Gln Cys Arg Phe Leu 
                245                 250                 255     


Ser Val Leu Gly Val Arg His Thr His Gly Leu Gln Phe Gly Arg Ser 
            260                 265                 270         


Asp Leu Tyr Phe Val Cys Arg Leu Glu Pro Val Thr Asp Glu Ser Gly 
        275                 280                 285             


Lys Val Ala Gln Pro Val Pro Gln Glu Gly Glu Ile Glu Ala Ala Ala 
    290                 295                 300                 


Trp Ile Pro Leu Asp Glu Tyr Arg Asp Met Val Asn Asn Pro Asp Ser 
305                 310                 315                 320 


Asn Ile Gly His Pro Met Met Arg His Ile Met Arg Ile Val Asp Gln 
                325                 330                 335     


Gly Asp Trp Asp Lys Phe Asp Ile Gln Arg Thr Val Arg Gln Lys Thr 
            340                 345                 350         


Ala Asn Gly Gln Tyr Ala Ser Gln Pro Pro Pro Ile Pro Gln Gln Gln 
        355                 360                 365             


Gln Gln Pro Ala Asn Leu Gln Gln Pro Gln Gln Pro Pro Pro Pro Ala 
    370                 375                 380                 


Thr Gln Gln Ile Val Pro Ala Ser Gly Pro Lys Ile Lys Ala Gly Tyr 
385                 390                 395                 400 


Val Tyr Ser Gly Gly Asn Pro Val Pro Ala Ala Ser Lys Pro Gly Gly 
                405                 410                 415     


Val Ala Leu Ser Asn Gly Lys Val Leu Ala Ala Pro Ser Ser Ser Gly 
            420                 425                 430         


Pro Lys Pro Gln Glu Asp His Thr Leu Ile Asn Cys Phe Thr Leu Glu 
        435                 440                 445             


Gln Ile Glu Thr His Ile Lys Ser Leu Asn Lys Gly Leu Gln Leu Pro 
    450                 455                 460                 


Leu Ala Lys Leu Lys Thr Lys Cys Gly Glu Leu Leu Lys Gly Leu Gln 
465                 470                 475                 480 


Ser His Gln His Gly Trp Val Phe Asn Ser Pro Val Asp Pro Val Glu 
                485                 490                 495     


Leu Gly Leu Pro Asp Tyr Phe Glu Val Ile Lys Asn Pro Met Asp Leu 
            500                 505                 510         


Gly Thr Val Lys Lys Arg Leu Asp Asn Gly Leu Tyr Arg Ser Ile Arg 
        515                 520                 525             


Glu Val Glu Ala Asp Ile Asn Leu Thr Phe Asp Asn Ala Met Leu Tyr 
    530                 535                 540                 


Asn Pro Glu Gly Ser Val Val Trp Ser Met Ala Lys Glu Leu Lys Asp 
545                 550                 555                 560 


Lys Phe Glu Thr Asp Phe Ala Ala Leu Met Lys Val Leu His Glu Glu 
                565                 570                 575     


Glu Glu Glu Lys Arg Lys Asn Gly Asp Ala Cys Ser Leu Cys Gly Cys 
            580                 585                 590         


Glu Lys Leu Leu Phe Glu Pro Pro Val Phe Tyr Cys Asn Gly Leu Ser 
        595                 600                 605             


Cys Arg Ser Lys Arg Ile Arg Arg Asn Ser Tyr Tyr Phe Val Gly Gly 
    610                 615                 620                 


Asn Asn Gln Tyr His Trp Cys Gln Pro Cys Tyr Glu Glu Leu Lys Glu 
625                 630                 635                 640 


Ser Gln Ala Ile Glu Leu Pro Asp Met Thr Leu Lys Lys Ser Gln Leu 
                645                 650                 655     


Asp Lys Lys Lys Asn Asn Glu Val Pro Glu Glu Ser Trp Val Gln Cys 
            660                 665                 670         


Asp Arg Cys Glu Arg Trp Ile His Gln Ile Cys Ala Leu Phe Asn Thr 
        675                 680                 685             


Arg Gln Asn Lys Asn Gln Gln Ser Glu Phe Val Cys Pro Ser Cys Thr 
    690                 695                 700                 


Ile Asn Asp Arg Lys Lys Lys Gly Ser Leu Gly Pro Thr Ser Thr Thr 
705                 710                 715                 720 


Pro Met Ala Glu Asp Leu Pro Arg Thr Lys Leu Ser Glu His Leu Glu 
                725                 730                 735     


Lys His Val Arg Glu Lys Phe Lys Ser Glu Met Glu Arg Leu Ala Lys 
            740                 745                 750         


Glu Arg Ala Glu Ala Glu Gly Ile Ser Met Glu Glu Ala Met Arg Ile 
        755                 760                 765             


Thr Ser Asp Gly Gly Gly Glu Ile Tyr Ile Arg Gln Val Thr Ser Met 
    770                 775                 780                 


Ser Arg Thr Leu Glu Val Arg Glu Arg Met Leu Lys Arg Tyr Ser Phe 
785                 790                 795                 800 


Lys Asn Tyr Pro Asn Glu Phe Lys Tyr Arg Cys Lys Cys Val Ile Val 
                805                 810                 815     


Phe Gln Asn Leu Asp Gly Val Asp Val Ile Leu Phe Gly Leu Tyr Val 
            820                 825                 830         


Tyr Glu His Asp Glu Thr Asn Ala Pro Pro Asn Gln Arg Ala Val Tyr 
        835                 840                 845             


Ile Ser Tyr Leu Asp Ser Val Tyr Tyr Met Arg Pro Arg Lys Met Arg 
    850                 855                 860                 


Thr Phe Val Tyr His Glu Leu Leu Ile Ser Tyr Met Asp Tyr Val Arg 
865                 870                 875                 880 


Cys Lys Gly Tyr Ser Thr Ala His Ile Trp Ala Cys Pro Pro Leu Lys 
                885                 890                 895     


Gly Asp Asp Tyr Ile Leu Phe Ala Lys Pro Glu Asp Gln Lys Thr Pro 
            900                 905                 910         


Lys Asp Asp Arg Leu Arg Gln Trp Tyr Leu Asp Met Leu Lys Asp Cys 
        915                 920                 925             


Gln Arg Arg Gly Ile Val Gly Lys Val Thr Asn Ala Tyr Asp Leu Tyr 
    930                 935                 940                 


Phe Ser Asp Pro Lys Asn Asp Ala Ser Val Leu Pro Tyr Met Glu Gly 
945                 950                 955                 960 


Asp Tyr Phe Pro Ala Glu Leu Glu Asn Ile Ile Lys Asp Leu Glu Glu 
                965                 970                 975     


Gly Lys Asn Leu Ser Lys Lys Pro Asp Lys Ser Ala Ser Lys Lys Gly 
            980                 985                 990         


Lys Lys Glu Lys Lys Ser Lys Thr  Lys Lys Ala Gly Ser  Arg Gly Gly 
        995                 1000                 1005             


Thr Arg  Ser Ala Gly Leu Asp  Glu Asp Ala Leu Ala  Ala Ser Gly 
    1010                 1015                 1020             


Ile Leu  Gln Glu Gly Val Asp  Ile Lys Ser Leu Gln  Ala Gly Gly 
    1025                 1030                 1035             


Arg Asp  Ala Val Met Lys Lys  Leu Gly Asp Thr Ile  Tyr Pro Met 
    1040                 1045                 1050             


Lys Glu  Ser Phe Leu Val Ala  Phe Leu Asp Trp Asp  Gly Ala Lys 
    1055                 1060                 1065             


Glu Glu  Asn Arg Val Val Pro  Lys Asp Ile Met Glu  Tyr Arg Glu 
    1070                 1075                 1080             


Gln His  Gly Ile Val Val Arg  Lys Ala Ser Gly Val  Gln Glu Lys 
    1085                 1090                 1095             


Lys Asp  Gly Asp Ser Thr Lys  Pro Ala Ala Glu Cys  Ser Asn Leu 
    1100                 1105                 1110             


Pro Ala  Ile Lys Glu Glu Ser  Pro Lys Glu Val Ala  Glu Ser Ser 
    1115                 1120                 1125             


Ile Glu  Lys Ala Ala Glu Ser  Thr Ala Ser Pro Ser  Ser Ser Ala 
    1130                 1135                 1140             


Pro Thr  Lys Glu Asp Ser Ala  Ser Lys Asp Gly Ser  Ser Ala Val 
    1145                 1150                 1155             


Lys Glu  Glu Ser Asp Leu Ala  Pro Ala Asn Asn Pro  Ser Glu Ser 
    1160                 1165                 1170             


Thr Pro  Leu His Asp Leu Ala  Ser Gly Ser Glu Glu  Lys Lys Glu 
    1175                 1180                 1185             


Glu Val  Asn Ser Glu Asn Pro  Asp Gly Ala Thr Lys  Glu Ser Glu 
    1190                 1195                 1200             


Ser Ala  Pro Thr Glu Gly Ser  Ser Ala Ser Pro Gln  Gly Val Ala 
    1205                 1210                 1215             


Glu Lys  Pro Asn Arg Gly Glu  Ala Glu Thr Ala Lys  Gly Gly Asn 
    1220                 1225                 1230             


Gly Asp  Val Glu Met Glu Asp  Ser Lys Ser Ser Glu  Ser Lys Glu 
    1235                 1240                 1245             


Asp Asn  Gly Lys Glu Ala Glu  Thr Lys Gly Val Glu  Ala Thr Thr 
    1250                 1255                 1260             


Gly Glu  Ala Pro Lys Gln Ala  Ile Val Ala Arg Glu  Gly Lys Phe 
    1265                 1270                 1275             


Ala Ala  Met Glu Lys Ile Lys  Lys Glu Met Lys Val  Glu Pro Glu 
    1280                 1285                 1290             


Pro Glu  Pro Ser Ser Ser Thr  Ser Asp Gln Ile Ala  Ser Lys Ser 
    1295                 1300                 1305             


Val Thr  Lys Asp Ser Lys Gly  Arg Leu Val Lys Val  Ile Asp Asp 
    1310                 1315                 1320             


Asp Asp  Glu Glu Met Asp Cys  Glu Phe Leu Asn Asn  Arg Gln Leu 
    1325                 1330                 1335             


Phe Leu  Asn Leu Cys Gln Gly  Asn His Tyr Gln Phe  Asp Gln Leu 
    1340                 1345                 1350             


Arg Arg  Ala Lys His Thr Ser  Met Met Val Leu Trp  His Leu His 
    1355                 1360                 1365             


Asn Arg  Asp Ala Pro Lys Phe  Val Gln Gln Cys Ala  Val Cys Ser 
    1370                 1375                 1380             


Arg Glu  Ile Leu Gln Gly Met  Arg Tyr His Cys Pro  Thr Cys Ala 
    1385                 1390                 1395             


Asp Phe  Asp Gln Cys Tyr Glu  Cys Met Ser Asn Pro  Asn Val Pro 
    1400                 1405                 1410             


Arg His  Gln His Pro Leu Lys  Pro Ile Pro Val Gly  Ser Gln Gln 
    1415                 1420                 1425             


Ser Ser  Leu Thr Pro Glu Gln  Arg Lys Glu Arg Gln  Arg Ser Ile 
    1430                 1435                 1440             


Gln Leu  His Met Thr Leu Leu  Leu His Ala Ala Thr  Cys Lys Ser 
    1445                 1450                 1455             


Ser Lys  Cys Ala Ser Ala Asn  Cys Ala Lys Met Lys  Gly Leu Leu 
    1460                 1465                 1470             


Lys His  Gly Ser Gln Cys Gln  Ile Lys Ala Ala Gly  Gly Cys His 
    1475                 1480                 1485             


Val Cys  Lys Arg Ile Trp Ala  Leu Leu Gln Ile His  Ala Arg Gln 
    1490                 1495                 1500             


Cys Lys  Gln Asp Asn Cys Pro  Val Pro Asn Cys Leu  Ala Ile Arg 
    1505                 1510                 1515             


Glu Arg  Phe Arg Gln Leu Asn  Leu Gln Gln Gln Ala  Met Asp Asp 
    1520                 1525                 1530             


Arg Arg  Arg Gln Met Met Asn  Gln Thr Tyr His Gln  Gln Ala Arg 
    1535                 1540                 1545             


<210>  17
<211>  5346
<212>  DNA
<213>  Cyclotella sp.


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:18

<400>  17
atgagtccct acgagggaac cacgacgacc caacagccgt cctcgtcgtc tggttcttcc       60

cgccctcctc cccagggcaa catgccccaa atgccaccta acatggccgg cctctccggt      120

gtggggggcc ctcaacacca tcccggaatg gggaacctct acggattcca acagcatcgg      180

ggaagcgcga gccagcagcc tcccatgaac atgatgatgg gcggggacaa cgtaggcaac      240

ggcagcatgg gggcaggcgg gtttatgcag cccccctcca acgtcagagg tggggggatg      300

caccccaacc atcccatgaa tatggggggg cagtttcaca acggccccgg gcacatgtcc      360

gggcagaatc ctatgtataa tcatggaaat tttaatggac agcagcagaa tcaaatgcag      420

catccctaca ataatcctca tggggggcaa cagtctcaac agcaacagcc gcagcatggg      480

ggatataatc accaccaagc tcagatgcag cataatatgc aacaacatct gcagcaacag      540

caacatggcg gaaatgtagg caacaatggg atggggcata gttcgcaata tcaaaatcac      600

cccatgcaac ggcaacattc aagtcatcag cataattatc attcgcaaca gcagctgcag      660

cagcagcagc agatgcaaca cgcccagcaa caaatgcaac accccaccca acaacaacaa      720

ggctaccctt ccaacaatta ccaccgtcaa ccttcgtcta ctcaccactc acattccaac      780

tctccctcca ccccgaatcc ccatgacgcc aatgatggag ggctactgtc ctacaaacaa      840

ccgcctaatt tttcggaagt catgggtttg gactttgagt tgggagagta tggtcaacag      900

tttttgccaa cgggattgaa tggggactgg cagagtgatc gggatatgac acataggaga      960

gagatgattc agcatattgt gaaacttctt aagcaaaagg acaagaatgc atctcctgag     1020

tggctaacca agttacccca aatggtaaaa caactcgaag tatcgctata ccgttctgcg     1080

ccgtcgttcg aatcctattc ggacatatcg acccttaaac atcgcttaca acaacttgct     1140

atggaaatcg cgagaaaaac ccagcaggcc aaagagagta gtggaaggtc atccaaatca     1200

cgctctgatc gcagagatcg tcttccctcc tcgtccgccg ccccccaagc tccatctatc     1260

tccacccaaa accatgtcat gggtgggatg cgcttccaca gcaccaacga acgaagaggg     1320

ggcgacgacg gagaagatca catcagttcc cagcacggca atccaaacga tcctgaatgg     1380

aagattcgaa tccgtcacaa gcagcagcgc cttctcctcc ttcaccactc ctccaaatgc     1440

ccgtacgatg atggcaaatg caaggtcacc ccgtattgct ccgagatgaa gaagttgtgg     1500

aagcacatgg ctcgttgcat cgacaatgaa tgtcgagtgc cgcactgctt ttcaagtcga     1560

tctatattaa gccattaccg aaaatgcaag gatccgcgtt gtccagcgtg cggcccagtc     1620

agggaaacgg ttcgcaagac actgaagagc agctcgcaaa gaaggcctaa tgggatgccg     1680

ggcgagggac ctcattctgg cggagggcat gattcgatga atatgggtat aggtttgggt     1740

ggattgggta atgtcgaatt ggggcagggc gatcagccaa tgaataatag tatgatccca     1800

atgtctcgtg ggaacagtgg tagtgatagg aatcaagtga tgttgcagga aatgcagcag     1860

cagggaatgc aggaggcgaa ttcgcccatg ccttggtcgg gagaagttgc aagcatgccg     1920

tacaatgctc cactatcggg aagtaatggg ggtggaagac tatcatctca aggcggtaat     1980

gatgctttcc ccgaatctct ggtaaatagc aacggagcac aaaatggggc ttcgggagcg     2040

caaagggaag gtcgcgatgc tgaatcgagc aagtcgaaac acaagcagca gcgtcttctc     2100

cttctcaggc atgcttccaa gtgtactgct ccctcgggaa gttgcactgt taccccacat     2160

tgcgccgaga tgaaagttct gtggaggcat attgcgaact gcaaagagcc tcagtgcaaa     2220

ataaaacact gtatgagcag ccgatacgtt ctcagtcact accgtagatg cagggatcct     2280

tattgtgaca tttgtgctcc agtgagggaa acgattaaga atggcaccgc tacctacatc     2340

catgacccaa catttaatcc gacgggggag aacgcgacgc ccccccctgc taacactgaa     2400

ggccctcaaa cgaagaagca gaagactaca cacgactcga gcacaatgga cagcatgccg     2460

cctcctcaag accgaccggc tattcctgca gcgggaattt catcgtcttc cgatccgcat     2520

tcggcatcta cctctccgca tgttcccact tcgggggagg aaaacagagc taagtcgaag     2580

ccagacccaa cgaaatcagc aggaaaggga gcgtcttcct ccagctcatc ggaagaccat     2640

tctctcttgg agtgcttcac aacacagcaa gttaagactc acatcgagtc gttaaaaaag     2700

actgtccaag ttccaccggc taaattgaag ctcaagtgct tggaggtgtt gcgtgggctt     2760

caaacgcacg agcatgggtg ggtttttgcc acccccgttg atcctgtcga acttggacta     2820

gcagattact ttgacatcat caaaaagcca atggatcttg gaactatcca gaagaagctc     2880

gaagcaggat cctaccattc gtttggagaa ttcaagtccg atgtacgtct tacgtttgag     2940

aacgccatga aatacaacga agaaaggaca gttgtgcatg aaatggctaa agagctcaag     3000

aagaagttcg acgttgacta caaaaagctc atcaagcaat tggaaaaaga gcatcaagag     3060

gactcaaaaa aggcacaggc ttgtgggctt tgcggttgcg agaagctcaa ttttgaaccc     3120

ccggtcttct tctgcaatgg cctcaactgt cctagtaaac gaatccgtcg caacacacac     3180

ttttacatca cctccgacaa gcaatatgct tggtgcaacc aatgctttaa tgagttgggc     3240

aatgaaattg accttggaac gtctaaactg aaaaaggtag agctgactaa acgcaagaac     3300

gacgaaaccc acgaagaaag ctgggttcaa tgtgacgact gcgagcgttg gattcaccaa     3360

atatgtggtc tttacaacac tcgccaggat aaagagaaca aaagcgccta ttcatgtcct     3420

ctctgtcttt acgaaaagag aaagaaggac ggagatccaa aagagttgcc gaaggctccg     3480

tctgccaatg atattcctag gacaaagctg tcggattggt tggaaaagga tgtcctgagg     3540

agagtgaatg atcgcctcaa cgagattgcg aaagagaaat ctgaaactga gaacacatca     3600

cttgagaaag cctacaaaga ggtcgcttct ggcgggccgt tgattattcg acaagttacg     3660

tctaccgaca ggaagctgga agtgcgagaa cgaatgaaag cccgatatgc ccacaaaaac     3720

tatccagagg aattccctta ccgttgtaaa tgcattgttg tcttccagaa tatagacggt     3780

gtcgacgttc tgctctttgc tctgtatgta tatgagcatg gcgatgataa tcctttccct     3840

aacaaaaaga ctgtctacgt gtcttatctt gacagtgttc acttcatgaa gccaaggaaa     3900

gttaggacct tcatttatca tgaaattttg atatcctatc tggactatgt gaggagaaag     3960

ggttatcacc aagccttcat ctgggcatgt cccccgctaa aaggcgatga ctacatcttc     4020

tacgcaaagc cagaagatca aaaaactccc aaagatgtta ggcttcgcca gtggtacctt     4080

gacatgcttg cagaatgcca gagacgggat atcgtgggca aagtctccaa tatgtacgac     4140

caatattttg ccaacaagaa gttggacgct gcatccgtgc cttactttga gggcgattat     4200

ttccctggag aagctgagaa cattatcaaa cttcttgaag aaggggatgg caaacgaaag     4260

agtgcatcag ggaagaaaaa gaaggactca tctaagagcc aacgctcaag cagtagtgga     4320

tgcgaagaag gggatagtga tgacaaggca tataaggagg gcggccgaga tcctgtcatg     4380

cagaagtttt gcgacgccat ccagggaatg aaagaaagtt tcatcgttgc attcctcaat     4440

tgtgagggtg ccgacccaga aaatctagtc gttcccaaag agatcatgga atatcgcgaa     4500

gcgaagttga agtctattga aggtgacaac caacgtgatg ataattgtca gaccagaaag     4560

cgtgatgcgg acgggaacga acttcagaaa tcggaagacc aggtggacag gaaaggacgt     4620

cccattaaag tgttggacga tgatgcagaa gaaattgact gtgaattttt taacactcgc     4680

caatgcttcc tggatctctg tcgagggaac cactaccaat tcgatgagct aaggcgagca     4740

aagcacacat ccatgatggt tctttggcat cttcaaaatc gcgaagcacc caaatttgtg     4800

caacaatgct tttcgtgcaa tcgtgagatt gtctcaggta ttcgacacca ttgcaatgtt     4860

tgctctgact tcgatctatg tgacgagtgt ttccgaagtc cagatgccaa caggggaagt     4920

tgtaaccaca aacttgaagt gatcaaagtt gatacatcgc agagtggatc cagtggactt     4980

acggaggagc aacgtagaga acgtcagaga aacattcaac tccatataac acttattgaa     5040

catgcatctc gttgtgtttc gtcaacgtgc aagtcctcga attgtcagaa aatgaagtct     5100

tacctcaagc acggagagac ttgcaaaatt aaagcatctg gtggatgcaa gatttgcaaa     5160

cgaatttgga ctttgctgcg aattcatgct caacaatgca agaacaataa ttgtcccata     5220

cctcagtgta ttgcaattaa gaagcgtctg cgtcagctac agcaaaagca acaagctatg     5280

gatgaccgca gacggcagga aatgaatagg cattatagga tgggtatgat gggtgataac     5340

aattga                                                                5346


<210>  18
<211>  1781
<212>  PRT
<213>  Cyclotella sp.


<220>
<221>  misc_feature
<223>  translation product 5092336

<400>  18

Met Ser Pro Tyr Glu Gly Thr Thr Thr Thr Gln Gln Pro Ser Ser Ser 
1               5                   10                  15      


Ser Gly Ser Ser Arg Pro Pro Pro Gln Gly Asn Met Pro Gln Met Pro 
            20                  25                  30          


Pro Asn Met Ala Gly Leu Ser Gly Val Gly Gly Pro Gln His His Pro 
        35                  40                  45              


Gly Met Gly Asn Leu Tyr Gly Phe Gln Gln His Arg Gly Ser Ala Ser 
    50                  55                  60                  


Gln Gln Pro Pro Met Asn Met Met Met Gly Gly Asp Asn Val Gly Asn 
65                  70                  75                  80  


Gly Ser Met Gly Ala Gly Gly Phe Met Gln Pro Pro Ser Asn Val Arg 
                85                  90                  95      


Gly Gly Gly Met His Pro Asn His Pro Met Asn Met Gly Gly Gln Phe 
            100                 105                 110         


His Asn Gly Pro Gly His Met Ser Gly Gln Asn Pro Met Tyr Asn His 
        115                 120                 125             


Gly Asn Phe Asn Gly Gln Gln Gln Asn Gln Met Gln His Pro Tyr Asn 
    130                 135                 140                 


Asn Pro His Gly Gly Gln Gln Ser Gln Gln Gln Gln Pro Gln His Gly 
145                 150                 155                 160 


Gly Tyr Asn His His Gln Ala Gln Met Gln His Asn Met Gln Gln His 
                165                 170                 175     


Leu Gln Gln Gln Gln His Gly Gly Asn Val Gly Asn Asn Gly Met Gly 
            180                 185                 190         


His Ser Ser Gln Tyr Gln Asn His Pro Met Gln Arg Gln His Ser Ser 
        195                 200                 205             


His Gln His Asn Tyr His Ser Gln Gln Gln Leu Gln Gln Gln Gln Gln 
    210                 215                 220                 


Met Gln His Ala Gln Gln Gln Met Gln His Pro Thr Gln Gln Gln Gln 
225                 230                 235                 240 


Gly Tyr Pro Ser Asn Asn Tyr His Arg Gln Pro Ser Ser Thr His His 
                245                 250                 255     


Ser His Ser Asn Ser Pro Ser Thr Pro Asn Pro His Asp Ala Asn Asp 
            260                 265                 270         


Gly Gly Leu Leu Ser Tyr Lys Gln Pro Pro Asn Phe Ser Glu Val Met 
        275                 280                 285             


Gly Leu Asp Phe Glu Leu Gly Glu Tyr Gly Gln Gln Phe Leu Pro Thr 
    290                 295                 300                 


Gly Leu Asn Gly Asp Trp Gln Ser Asp Arg Asp Met Thr His Arg Arg 
305                 310                 315                 320 


Glu Met Ile Gln His Ile Val Lys Leu Leu Lys Gln Lys Asp Lys Asn 
                325                 330                 335     


Ala Ser Pro Glu Trp Leu Thr Lys Leu Pro Gln Met Val Lys Gln Leu 
            340                 345                 350         


Glu Val Ser Leu Tyr Arg Ser Ala Pro Ser Phe Glu Ser Tyr Ser Asp 
        355                 360                 365             


Ile Ser Thr Leu Lys His Arg Leu Gln Gln Leu Ala Met Glu Ile Ala 
    370                 375                 380                 


Arg Lys Thr Gln Gln Ala Lys Glu Ser Ser Gly Arg Ser Ser Lys Ser 
385                 390                 395                 400 


Arg Ser Asp Arg Arg Asp Arg Leu Pro Ser Ser Ser Ala Ala Pro Gln 
                405                 410                 415     


Ala Pro Ser Ile Ser Thr Gln Asn His Val Met Gly Gly Met Arg Phe 
            420                 425                 430         


His Ser Thr Asn Glu Arg Arg Gly Gly Asp Asp Gly Glu Asp His Ile 
        435                 440                 445             


Ser Ser Gln His Gly Asn Pro Asn Asp Pro Glu Trp Lys Ile Arg Ile 
    450                 455                 460                 


Arg His Lys Gln Gln Arg Leu Leu Leu Leu His His Ser Ser Lys Cys 
465                 470                 475                 480 


Pro Tyr Asp Asp Gly Lys Cys Lys Val Thr Pro Tyr Cys Ser Glu Met 
                485                 490                 495     


Lys Lys Leu Trp Lys His Met Ala Arg Cys Ile Asp Asn Glu Cys Arg 
            500                 505                 510         


Val Pro His Cys Phe Ser Ser Arg Ser Ile Leu Ser His Tyr Arg Lys 
        515                 520                 525             


Cys Lys Asp Pro Arg Cys Pro Ala Cys Gly Pro Val Arg Glu Thr Val 
    530                 535                 540                 


Arg Lys Thr Leu Lys Ser Ser Ser Gln Arg Arg Pro Asn Gly Met Pro 
545                 550                 555                 560 


Gly Glu Gly Pro His Ser Gly Gly Gly His Asp Ser Met Asn Met Gly 
                565                 570                 575     


Ile Gly Leu Gly Gly Leu Gly Asn Val Glu Leu Gly Gln Gly Asp Gln 
            580                 585                 590         


Pro Met Asn Asn Ser Met Ile Pro Met Ser Arg Gly Asn Ser Gly Ser 
        595                 600                 605             


Asp Arg Asn Gln Val Met Leu Gln Glu Met Gln Gln Gln Gly Met Gln 
    610                 615                 620                 


Glu Ala Asn Ser Pro Met Pro Trp Ser Gly Glu Val Ala Ser Met Pro 
625                 630                 635                 640 


Tyr Asn Ala Pro Leu Ser Gly Ser Asn Gly Gly Gly Arg Leu Ser Ser 
                645                 650                 655     


Gln Gly Gly Asn Asp Ala Phe Pro Glu Ser Leu Val Asn Ser Asn Gly 
            660                 665                 670         


Ala Gln Asn Gly Ala Ser Gly Ala Gln Arg Glu Gly Arg Asp Ala Glu 
        675                 680                 685             


Ser Ser Lys Ser Lys His Lys Gln Gln Arg Leu Leu Leu Leu Arg His 
    690                 695                 700                 


Ala Ser Lys Cys Thr Ala Pro Ser Gly Ser Cys Thr Val Thr Pro His 
705                 710                 715                 720 


Cys Ala Glu Met Lys Val Leu Trp Arg His Ile Ala Asn Cys Lys Glu 
                725                 730                 735     


Pro Gln Cys Lys Ile Lys His Cys Met Ser Ser Arg Tyr Val Leu Ser 
            740                 745                 750         


His Tyr Arg Arg Cys Arg Asp Pro Tyr Cys Asp Ile Cys Ala Pro Val 
        755                 760                 765             


Arg Glu Thr Ile Lys Asn Gly Thr Ala Thr Tyr Ile His Asp Pro Thr 
    770                 775                 780                 


Phe Asn Pro Thr Gly Glu Asn Ala Thr Pro Pro Pro Ala Asn Thr Glu 
785                 790                 795                 800 


Gly Pro Gln Thr Lys Lys Gln Lys Thr Thr His Asp Ser Ser Thr Met 
                805                 810                 815     


Asp Ser Met Pro Pro Pro Gln Asp Arg Pro Ala Ile Pro Ala Ala Gly 
            820                 825                 830         


Ile Ser Ser Ser Ser Asp Pro His Ser Ala Ser Thr Ser Pro His Val 
        835                 840                 845             


Pro Thr Ser Gly Glu Glu Asn Arg Ala Lys Ser Lys Pro Asp Pro Thr 
    850                 855                 860                 


Lys Ser Ala Gly Lys Gly Ala Ser Ser Ser Ser Ser Ser Glu Asp His 
865                 870                 875                 880 


Ser Leu Leu Glu Cys Phe Thr Thr Gln Gln Val Lys Thr His Ile Glu 
                885                 890                 895     


Ser Leu Lys Lys Thr Val Gln Val Pro Pro Ala Lys Leu Lys Leu Lys 
            900                 905                 910         


Cys Leu Glu Val Leu Arg Gly Leu Gln Thr His Glu His Gly Trp Val 
        915                 920                 925             


Phe Ala Thr Pro Val Asp Pro Val Glu Leu Gly Leu Ala Asp Tyr Phe 
    930                 935                 940                 


Asp Ile Ile Lys Lys Pro Met Asp Leu Gly Thr Ile Gln Lys Lys Leu 
945                 950                 955                 960 


Glu Ala Gly Ser Tyr His Ser Phe Gly Glu Phe Lys Ser Asp Val Arg 
                965                 970                 975     


Leu Thr Phe Glu Asn Ala Met Lys Tyr Asn Glu Glu Arg Thr Val Val 
            980                 985                 990         


His Glu Met Ala Lys Glu Leu Lys  Lys Lys Phe Asp Val  Asp Tyr Lys 
        995                 1000                 1005             


Lys Leu  Ile Lys Gln Leu Glu  Lys Glu His Gln Glu  Asp Ser Lys 
    1010                 1015                 1020             


Lys Ala  Gln Ala Cys Gly Leu  Cys Gly Cys Glu Lys  Leu Asn Phe 
    1025                 1030                 1035             


Glu Pro  Pro Val Phe Phe Cys  Asn Gly Leu Asn Cys  Pro Ser Lys 
    1040                 1045                 1050             


Arg Ile  Arg Arg Asn Thr His  Phe Tyr Ile Thr Ser  Asp Lys Gln 
    1055                 1060                 1065             


Tyr Ala  Trp Cys Asn Gln Cys  Phe Asn Glu Leu Gly  Asn Glu Ile 
    1070                 1075                 1080             


Asp Leu  Gly Thr Ser Lys Leu  Lys Lys Val Glu Leu  Thr Lys Arg 
    1085                 1090                 1095             


Lys Asn  Asp Glu Thr His Glu  Glu Ser Trp Val Gln  Cys Asp Asp 
    1100                 1105                 1110             


Cys Glu  Arg Trp Ile His Gln  Ile Cys Gly Leu Tyr  Asn Thr Arg 
    1115                 1120                 1125             


Gln Asp  Lys Glu Asn Lys Ser  Ala Tyr Ser Cys Pro  Leu Cys Leu 
    1130                 1135                 1140             


Tyr Glu  Lys Arg Lys Lys Asp  Gly Asp Pro Lys Glu  Leu Pro Lys 
    1145                 1150                 1155             


Ala Pro  Ser Ala Asn Asp Ile  Pro Arg Thr Lys Leu  Ser Asp Trp 
    1160                 1165                 1170             


Leu Glu  Lys Asp Val Leu Arg  Arg Val Asn Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Ile Ala  Lys Glu Lys Ser Glu  Thr Glu Asn Thr Ser  Leu Glu Lys 
    1190                 1195                 1200             


Ala Tyr  Lys Glu Val Ala Ser  Gly Gly Pro Leu Ile  Ile Arg Gln 
    1205                 1210                 1215             


Val Thr  Ser Thr Asp Arg Lys  Leu Glu Val Arg Glu  Arg Met Lys 
    1220                 1225                 1230             


Ala Arg  Tyr Ala His Lys Asn  Tyr Pro Glu Glu Phe  Pro Tyr Arg 
    1235                 1240                 1245             


Cys Lys  Cys Ile Val Val Phe  Gln Asn Ile Asp Gly  Val Asp Val 
    1250                 1255                 1260             


Leu Leu  Phe Ala Leu Tyr Val  Tyr Glu His Gly Asp  Asp Asn Pro 
    1265                 1270                 1275             


Phe Pro  Asn Lys Lys Thr Val  Tyr Val Ser Tyr Leu  Asp Ser Val 
    1280                 1285                 1290             


His Phe  Met Lys Pro Arg Lys  Val Arg Thr Phe Ile  Tyr His Glu 
    1295                 1300                 1305             


Ile Leu  Ile Ser Tyr Leu Asp  Tyr Val Arg Arg Lys  Gly Tyr His 
    1310                 1315                 1320             


Gln Ala  Phe Ile Trp Ala Cys  Pro Pro Leu Lys Gly  Asp Asp Tyr 
    1325                 1330                 1335             


Ile Phe  Tyr Ala Lys Pro Glu  Asp Gln Lys Thr Pro  Lys Asp Val 
    1340                 1345                 1350             


Arg Leu  Arg Gln Trp Tyr Leu  Asp Met Leu Ala Glu  Cys Gln Arg 
    1355                 1360                 1365             


Arg Asp  Ile Val Gly Lys Val  Ser Asn Met Tyr Asp  Gln Tyr Phe 
    1370                 1375                 1380             


Ala Asn  Lys Lys Leu Asp Ala  Ala Ser Val Pro Tyr  Phe Glu Gly 
    1385                 1390                 1395             


Asp Tyr  Phe Pro Gly Glu Ala  Glu Asn Ile Ile Lys  Leu Leu Glu 
    1400                 1405                 1410             


Glu Gly  Asp Gly Lys Arg Lys  Ser Ala Ser Gly Lys  Lys Lys Lys 
    1415                 1420                 1425             


Asp Ser  Ser Lys Ser Gln Arg  Ser Ser Ser Ser Gly  Cys Glu Glu 
    1430                 1435                 1440             


Gly Asp  Ser Asp Asp Lys Ala  Tyr Lys Glu Gly Gly  Arg Asp Pro 
    1445                 1450                 1455             


Val Met  Gln Lys Phe Cys Asp  Ala Ile Gln Gly Met  Lys Glu Ser 
    1460                 1465                 1470             


Phe Ile  Val Ala Phe Leu Asn  Cys Glu Gly Ala Asp  Pro Glu Asn 
    1475                 1480                 1485             


Leu Val  Val Pro Lys Glu Ile  Met Glu Tyr Arg Glu  Ala Lys Leu 
    1490                 1495                 1500             


Lys Ser  Ile Glu Gly Asp Asn  Gln Arg Asp Asp Asn  Cys Gln Thr 
    1505                 1510                 1515             


Arg Lys  Arg Asp Ala Asp Gly  Asn Glu Leu Gln Lys  Ser Glu Asp 
    1520                 1525                 1530             


Gln Val  Asp Arg Lys Gly Arg  Pro Ile Lys Val Leu  Asp Asp Asp 
    1535                 1540                 1545             


Ala Glu  Glu Ile Asp Cys Glu  Phe Phe Asn Thr Arg  Gln Cys Phe 
    1550                 1555                 1560             


Leu Asp  Leu Cys Arg Gly Asn  His Tyr Gln Phe Asp  Glu Leu Arg 
    1565                 1570                 1575             


Arg Ala  Lys His Thr Ser Met  Met Val Leu Trp His  Leu Gln Asn 
    1580                 1585                 1590             


Arg Glu  Ala Pro Lys Phe Val  Gln Gln Cys Phe Ser  Cys Asn Arg 
    1595                 1600                 1605             


Glu Ile  Val Ser Gly Ile Arg  His His Cys Asn Val  Cys Ser Asp 
    1610                 1615                 1620             


Phe Asp  Leu Cys Asp Glu Cys  Phe Arg Ser Pro Asp  Ala Asn Arg 
    1625                 1630                 1635             


Gly Ser  Cys Asn His Lys Leu  Glu Val Ile Lys Val  Asp Thr Ser 
    1640                 1645                 1650             


Gln Ser  Gly Ser Ser Gly Leu  Thr Glu Glu Gln Arg  Arg Glu Arg 
    1655                 1660                 1665             


Gln Arg  Asn Ile Gln Leu His  Ile Thr Leu Ile Glu  His Ala Ser 
    1670                 1675                 1680             


Arg Cys  Val Ser Ser Thr Cys  Lys Ser Ser Asn Cys  Gln Lys Met 
    1685                 1690                 1695             


Lys Ser  Tyr Leu Lys His Gly  Glu Thr Cys Lys Ile  Lys Ala Ser 
    1700                 1705                 1710             


Gly Gly  Cys Lys Ile Cys Lys  Arg Ile Trp Thr Leu  Leu Arg Ile 
    1715                 1720                 1725             


His Ala  Gln Gln Cys Lys Asn  Asn Asn Cys Pro Ile  Pro Gln Cys 
    1730                 1735                 1740             


Ile Ala  Ile Lys Lys Arg Leu  Arg Gln Leu Gln Gln  Lys Gln Gln 
    1745                 1750                 1755             


Ala Met  Asp Asp Arg Arg Arg  Gln Glu Met Asn Arg  His Tyr Arg 
    1760                 1765                 1770             


Met Gly  Met Met Gly Asp Asn  Asn 
    1775                 1780     


<210>  19
<211>  4290
<212>  DNA
<213>  Fragilariopsis cylindrus


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:20

<400>  19
atgggtagtc attccatgag caattcgatg gggaactcga tgaatggaat ggggaataca       60

atgaacaaca acaattctat gaacggtacg aacaccatga actcgtcgat gaataactct      120

atgagtaata atactatgaa tgctcctatg ggaggcaact cgatgaataa catgggtgga      180

aactcgacga acggaccaac taacaatggt gcgagttctt ctcgaggcaa taatgtgatg      240

aatccaagcg gtcgcaatag cgttagcaac agcgctagtg gtagtgttaa tggcagcgct      300

agtggtaatg gtagtggtag tggtagtggt acttctggat tgaatggaaa ctggcaaaca      360

gatagagata caccccatag acgagaaatg attcagcaca tcgtaaaaat gctaaaaaaa      420

gataagactg gttccccgga atggcttagc aagctgccac aaatggctaa gcagctagag      480

gtatctcttt atcgaaacgc acgatctttt gacgcatatg tcgacatgaa tacactgaaa      540

cagcgcttac agcagattgc agtacaggta tctcagaaag cacgaggtca agaccatgga      600

cggcgtgatc ggcacagaga ttcacaacaa aattctaatg gaatacgtca agacgggagt      660

tcttcatcat acacaggcaa caacccgtcg aatcggaccg ataggaacag cacaataaat      720

aataacaacc cctctagtgg catgtcgaat gtatctacat tgccaatatc ttcaggagga      780

tatcaacaac gatcgatgag caacaccgcc tcatcaaatg ctggcacgca acaacagcag      840

cagcagcaat caagtatgcc acccccttca acaaatggtg gttcggctaa tggtctgacg      900

ggatctgatt ttacgtcacc tgcactatcc cctacaggtg gaagtcaaaa tcccaacaac      960

acctcattac catcgtcatc atctagaagg aacgactccg aatggcaaaa ggttcgtcac     1020

aaacagcaac gacttttgtt gctgaggcac gcctctcgtt gtcagcataa gggaacgaaa     1080

tgccctgtta cccctcattg tgcaagtatg aaaaaacttt gggaacacat tgctcactgt     1140

aaggatcaac attgtagtgt tgcgcattgt atgagcagtc gatacgtcct tagtcactat     1200

agaagatgca aggacccacg ctgtccagca tgtgggcctg tccgcgaaac tattcgaaaa     1260

agtcacgagc gagagcaaca gcaaggcaat cgccagccaa cgtcatctag ttcgactccc     1320

tttgataccg aagtacccgg accaagtagc tctcctgatg ctttgccagc cacgaaacgt     1380

cccagaatag atccaaatgc tagtaatatg cccccaccaa atcctacaga cggacaacct     1440

aatcaaccgc tttctgcccc ctctgatgtt atagcaccac cgacaaattc caacgaaaag     1500

gtttcgaaac caccttcccc tacaccttct tcttcagcga ataaaggctc cgaagatcga     1560

tcgttgctgg atagtttcac tcttgatcag attgcattgc atcttgcatc tttgaatcga     1620

gcggccgacc taccccccgc gaaactaaaa caaaaatgcc tagaagtcct gaaggggttg     1680

caagctcacc aacatggatg ggtatttaac gtaccagttg atccagtaga actaggttta     1740

cctgactatt ttgaacttat caaaaagccg atggatcttg gaagtgtcca aaaaaaactt     1800

gaaaaaggcg aatatcacgc catcaaggat ttccaatcag acgtgaattt aagctttgag     1860

aacgccatga catacaacga acaaggttca gtggtttacg acatggccaa ggaactgaag     1920

actaagtttg agggcgattt caagaaatta gaacaacagc tggaatctga agatcgcgag     1980

agacgagaaa atgatagagc ttgtgtcctt tgcggatgcg agaaacgtct attcgaaccc     2040

ccagtattct tctgcaacgg tataaattgc gcgagtaaac gaattcgacg taatagtcac     2100

ttttacatcg gcggaaacaa ccaatatttt tggtgtaacc agtgctacgg tgagcttgag     2160

gagaaatcac caatcgagtt gatcgacctg actgttaaaa agactgattt gaagaagaaa     2220

aagaatgacg aaattgtcga agagagttgg gtgcagtgcg atatttgcga aagatggatt     2280

catcaaattt gcgggctttt caacacaaga cagaacaaag agcatcacag cgaatattgc     2340

tgccctttat gtctgttaga aaaacgtaaa aagaaccctg taacaccgcc accgcgaccg     2400

gcgggggcaa cagaattacc aaggacgaaa ttatctgaat ttatagagaa tcacgttagg     2460

aaaaaaatag aaaagagacg acgtaatgtg gcggaagaaa agtgtcgtat tgagaatatt     2520

tcaatggatg atgcgctgaa agacgctcgg gaaggaggta atgtgatcat acgtcaggtc     2580

acgtcaatgg ataggaaatt ggaagtgcgg gaaggaataa agaatagata cgctcataag     2640

aattatcctg atgagttttc gtttcgatgt aaatgtcttc tagtattcca ggaaattgat     2700

ggtgttgatg tggttctgtt tgctctttac ttatacgagc acggcgagga tagcccacgt     2760

ccaaataccc gttctatata tatatcttat ctcgacagtg tgcattatat gaggcctcga     2820

aaacttcgaa cctttgttta ccatgaaatt ctgatttctt actgtgatta tgctcgacag     2880

agggggtttg caacagttca tatctgggca tgcccaccgt taaagggtga tgactacatt     2940

ttctatgcaa agcctgaaga tcagaagact ccgagggatt caaggttgag acaatggtac     3000

attgatatgc ttgttgaatg ccagaatcgt ggcatagtcg gaagactgac caatatgtat     3060

gatttgtact ttgcgaacgc atctatcgat gctacggctg tgccttacca cgaaggcgat     3120

tacttccctg gagaagccga gaatatcatc aagatgcttg atgacgaggg aggaaaaaag     3180

aatggaagca gtgggaagaa gaagaaacag aagaaccaaa gcaagtcaaa gaaccgaggc     3240

gggactcgat cgacaggagt tgacgaggaa gcattacttg caagtggcat gatggatgga     3300

gtgaagaatt atgaagagtt ggatcgagat caagtaatgg tcaagttagg cgaagctatt     3360

caaccgatga aggagagttt tcttgttgct tttctgactt ggtctggtat caaggaagaa     3420

gacttagagg tgcctgaagc tatagcgaag tatcgagaag agcaccccga aaatgttgta     3480

cctttaccat cgggtaataa gcgtaatgcc gacggtcaaa cgaaagatga agttgtaccg     3540

ctggacgcag atggacaccc actaaaagtc ctagacgatg acgcggaaga tcttgactgc     3600

gagttcttaa ataatcgcca agctttcctt aacttatgtc gtggaaacca ctatcaattt     3660

gacgagttac gtcgtgctaa acacacatca atgatggtat tgtggcatct tcagaaccgt     3720

gacgctccta aatatgtaca gcagtgtgtt tcttgtagcc gcgaaattct tagtggaaag     3780

cgctatcatt gtaattcctg tccagactat gatctttgcg agacatgtta caaagacccg     3840

aagaccaatc gtggcacgtg tacacacaag cttcaagaaa ttaaggttga atccgaaggg     3900

caatctgatt caagtggatt gacagagact cagagaaagc agcgtcaacg caacttgatg     3960

ctgcacatcc aattaatcga acatgcttca agatgcacgt cgtcaacttg tcaatcaaag     4020

aactgtgcga agatgaaaga gtatctacag cacgcacgca cttgcaagac aaaagttgta     4080

ggtggatgca gaatatgcaa acgaatttgg acccttcttc ggatccatgc gcagaaatgt     4140

aaggaaccgg tttgtcctat cccgcaatgt atgattatta gagaaaagat gcgtgaacta     4200

cagaagcaac agcaagctat ggatgatcga cgtcgtcaag aaatgaatcg acattacggc     4260

cgaatgagca tgaccagcgg atcaggttaa                                      4290


<210>  20
<211>  1429
<212>  PRT
<213>  Fragilariopsis cylindrus

<400>  20

Met Gly Ser His Ser Met Ser Asn Ser Met Gly Asn Ser Met Asn Gly 
1               5                   10                  15      


Met Gly Asn Thr Met Asn Asn Asn Asn Ser Met Asn Gly Thr Asn Thr 
            20                  25                  30          


Met Asn Ser Ser Met Asn Asn Ser Met Ser Asn Asn Thr Met Asn Ala 
        35                  40                  45              


Pro Met Gly Gly Asn Ser Met Asn Asn Met Gly Gly Asn Ser Thr Asn 
    50                  55                  60                  


Gly Pro Thr Asn Asn Gly Ala Ser Ser Ser Arg Gly Asn Asn Val Met 
65                  70                  75                  80  


Asn Pro Ser Gly Arg Asn Ser Val Ser Asn Ser Ala Ser Gly Ser Val 
                85                  90                  95      


Asn Gly Ser Ala Ser Gly Asn Gly Ser Gly Ser Gly Ser Gly Thr Ser 
            100                 105                 110         


Gly Leu Asn Gly Asn Trp Gln Thr Asp Arg Asp Thr Pro His Arg Arg 
        115                 120                 125             


Glu Met Ile Gln His Ile Val Lys Met Leu Lys Lys Asp Lys Thr Gly 
    130                 135                 140                 


Ser Pro Glu Trp Leu Ser Lys Leu Pro Gln Met Ala Lys Gln Leu Glu 
145                 150                 155                 160 


Val Ser Leu Tyr Arg Asn Ala Arg Ser Phe Asp Ala Tyr Val Asp Met 
                165                 170                 175     


Asn Thr Leu Lys Gln Arg Leu Gln Gln Ile Ala Val Gln Val Ser Gln 
            180                 185                 190         


Lys Ala Arg Gly Gln Asp His Gly Arg Arg Asp Arg His Arg Asp Ser 
        195                 200                 205             


Gln Gln Asn Ser Asn Gly Ile Arg Gln Asp Gly Ser Ser Ser Ser Tyr 
    210                 215                 220                 


Thr Gly Asn Asn Pro Ser Asn Arg Thr Asp Arg Asn Ser Thr Ile Asn 
225                 230                 235                 240 


Asn Asn Asn Pro Ser Ser Gly Met Ser Asn Val Ser Thr Leu Pro Ile 
                245                 250                 255     


Ser Ser Gly Gly Tyr Gln Gln Arg Ser Met Ser Asn Thr Ala Ser Ser 
            260                 265                 270         


Asn Ala Gly Thr Gln Gln Gln Gln Gln Gln Gln Ser Ser Met Pro Pro 
        275                 280                 285             


Pro Ser Thr Asn Gly Gly Ser Ala Asn Gly Leu Thr Gly Ser Asp Phe 
    290                 295                 300                 


Thr Ser Pro Ala Leu Ser Pro Thr Gly Gly Ser Gln Asn Pro Asn Asn 
305                 310                 315                 320 


Thr Ser Leu Pro Ser Ser Ser Ser Arg Arg Asn Asp Ser Glu Trp Gln 
                325                 330                 335     


Lys Val Arg His Lys Gln Gln Arg Leu Leu Leu Leu Arg His Ala Ser 
            340                 345                 350         


Arg Cys Gln His Lys Gly Thr Lys Cys Pro Val Thr Pro His Cys Ala 
        355                 360                 365             


Ser Met Lys Lys Leu Trp Glu His Ile Ala His Cys Lys Asp Gln His 
    370                 375                 380                 


Cys Ser Val Ala His Cys Met Ser Ser Arg Tyr Val Leu Ser His Tyr 
385                 390                 395                 400 


Arg Arg Cys Lys Asp Pro Arg Cys Pro Ala Cys Gly Pro Val Arg Glu 
                405                 410                 415     


Thr Ile Arg Lys Ser His Glu Arg Glu Gln Gln Gln Gly Asn Arg Gln 
            420                 425                 430         


Pro Thr Ser Ser Ser Ser Thr Pro Phe Asp Thr Glu Val Pro Gly Pro 
        435                 440                 445             


Ser Ser Ser Pro Asp Ala Leu Pro Ala Thr Lys Arg Pro Arg Ile Asp 
    450                 455                 460                 


Pro Asn Ala Ser Asn Met Pro Pro Pro Asn Pro Thr Asp Gly Gln Pro 
465                 470                 475                 480 


Asn Gln Pro Leu Ser Ala Pro Ser Asp Val Ile Ala Pro Pro Thr Asn 
                485                 490                 495     


Ser Asn Glu Lys Val Ser Lys Pro Pro Ser Pro Thr Pro Ser Ser Ser 
            500                 505                 510         


Ala Asn Lys Gly Ser Glu Asp Arg Ser Leu Leu Asp Ser Phe Thr Leu 
        515                 520                 525             


Asp Gln Ile Ala Leu His Leu Ala Ser Leu Asn Arg Ala Ala Asp Leu 
    530                 535                 540                 


Pro Pro Ala Lys Leu Lys Gln Lys Cys Leu Glu Val Leu Lys Gly Leu 
545                 550                 555                 560 


Gln Ala His Gln His Gly Trp Val Phe Asn Val Pro Val Asp Pro Val 
                565                 570                 575     


Glu Leu Gly Leu Pro Asp Tyr Phe Glu Leu Ile Lys Lys Pro Met Asp 
            580                 585                 590         


Leu Gly Ser Val Gln Lys Lys Leu Glu Lys Gly Glu Tyr His Ala Ile 
        595                 600                 605             


Lys Asp Phe Gln Ser Asp Val Asn Leu Ser Phe Glu Asn Ala Met Thr 
    610                 615                 620                 


Tyr Asn Glu Gln Gly Ser Val Val Tyr Asp Met Ala Lys Glu Leu Lys 
625                 630                 635                 640 


Thr Lys Phe Glu Gly Asp Phe Lys Lys Leu Glu Gln Gln Leu Glu Ser 
                645                 650                 655     


Glu Asp Arg Glu Arg Arg Glu Asn Asp Arg Ala Cys Val Leu Cys Gly 
            660                 665                 670         


Cys Glu Lys Arg Leu Phe Glu Pro Pro Val Phe Phe Cys Asn Gly Ile 
        675                 680                 685             


Asn Cys Ala Ser Lys Arg Ile Arg Arg Asn Ser His Phe Tyr Ile Gly 
    690                 695                 700                 


Gly Asn Asn Gln Tyr Phe Trp Cys Asn Gln Cys Tyr Gly Glu Leu Glu 
705                 710                 715                 720 


Glu Lys Ser Pro Ile Glu Leu Ile Asp Leu Thr Val Lys Lys Thr Asp 
                725                 730                 735     


Leu Lys Lys Lys Lys Asn Asp Glu Ile Val Glu Glu Ser Trp Val Gln 
            740                 745                 750         


Cys Asp Ile Cys Glu Arg Trp Ile His Gln Ile Cys Gly Leu Phe Asn 
        755                 760                 765             


Thr Arg Gln Asn Lys Glu His His Ser Glu Tyr Cys Cys Pro Leu Cys 
    770                 775                 780                 


Leu Leu Glu Lys Arg Lys Lys Asn Pro Val Thr Pro Pro Pro Arg Pro 
785                 790                 795                 800 


Ala Gly Ala Thr Glu Leu Pro Arg Thr Lys Leu Ser Glu Phe Ile Glu 
                805                 810                 815     


Asn His Val Arg Lys Lys Ile Glu Lys Arg Arg Arg Asn Val Ala Glu 
            820                 825                 830         


Glu Lys Cys Arg Ile Glu Asn Ile Ser Met Asp Asp Ala Leu Lys Asp 
        835                 840                 845             


Ala Arg Glu Gly Gly Asn Val Ile Ile Arg Gln Val Thr Ser Met Asp 
    850                 855                 860                 


Arg Lys Leu Glu Val Arg Glu Gly Ile Lys Asn Arg Tyr Ala His Lys 
865                 870                 875                 880 


Asn Tyr Pro Asp Glu Phe Ser Phe Arg Cys Lys Cys Leu Leu Val Phe 
                885                 890                 895     


Gln Glu Ile Asp Gly Val Asp Val Val Leu Phe Ala Leu Tyr Leu Tyr 
            900                 905                 910         


Glu His Gly Glu Asp Ser Pro Arg Pro Asn Thr Arg Ser Ile Tyr Ile 
        915                 920                 925             


Ser Tyr Leu Asp Ser Val His Tyr Met Arg Pro Arg Lys Leu Arg Thr 
    930                 935                 940                 


Phe Val Tyr His Glu Ile Leu Ile Ser Tyr Cys Asp Tyr Ala Arg Gln 
945                 950                 955                 960 


Arg Gly Phe Ala Thr Val His Ile Trp Ala Cys Pro Pro Leu Lys Gly 
                965                 970                 975     


Asp Asp Tyr Ile Phe Tyr Ala Lys Pro Glu Asp Gln Lys Thr Pro Arg 
            980                 985                 990         


Asp Ser Arg Leu Arg Gln Trp Tyr  Ile Asp Met Leu Val  Glu Cys Gln 
        995                 1000                 1005             


Asn Arg  Gly Ile Val Gly Arg  Leu Thr Asn Met Tyr  Asp Leu Tyr 
    1010                 1015                 1020             


Phe Ala  Asn Ala Ser Ile Asp  Ala Thr Ala Val Pro  Tyr His Glu 
    1025                 1030                 1035             


Gly Asp  Tyr Phe Pro Gly Glu  Ala Glu Asn Ile Ile  Lys Met Leu 
    1040                 1045                 1050             


Asp Asp  Glu Gly Gly Lys Lys  Asn Gly Ser Ser Gly  Lys Lys Lys 
    1055                 1060                 1065             


Lys Gln  Lys Asn Gln Ser Lys  Ser Lys Asn Arg Gly  Gly Thr Arg 
    1070                 1075                 1080             


Ser Thr  Gly Val Asp Glu Glu  Ala Leu Leu Ala Ser  Gly Met Met 
    1085                 1090                 1095             


Asp Gly  Val Lys Asn Tyr Glu  Glu Leu Asp Arg Asp  Gln Val Met 
    1100                 1105                 1110             


Val Lys  Leu Gly Glu Ala Ile  Gln Pro Met Lys Glu  Ser Phe Leu 
    1115                 1120                 1125             


Val Ala  Phe Leu Thr Trp Ser  Gly Ile Lys Glu Glu  Asp Leu Glu 
    1130                 1135                 1140             


Val Pro  Glu Ala Ile Ala Lys  Tyr Arg Glu Glu His  Pro Glu Asn 
    1145                 1150                 1155             


Val Val  Pro Leu Pro Ser Gly  Asn Lys Arg Asn Ala  Asp Gly Gln 
    1160                 1165                 1170             


Thr Lys  Asp Glu Val Val Pro  Leu Asp Ala Asp Gly  His Pro Leu 
    1175                 1180                 1185             


Lys Val  Leu Asp Asp Asp Ala  Glu Asp Leu Asp Cys  Glu Phe Leu 
    1190                 1195                 1200             


Asn Asn  Arg Gln Ala Phe Leu  Asn Leu Cys Arg Gly  Asn His Tyr 
    1205                 1210                 1215             


Gln Phe  Asp Glu Leu Arg Arg  Ala Lys His Thr Ser  Met Met Val 
    1220                 1225                 1230             


Leu Trp  His Leu Gln Asn Arg  Asp Ala Pro Lys Tyr  Val Gln Gln 
    1235                 1240                 1245             


Cys Val  Ser Cys Ser Arg Glu  Ile Leu Ser Gly Lys  Arg Tyr His 
    1250                 1255                 1260             


Cys Asn  Ser Cys Pro Asp Tyr  Asp Leu Cys Glu Thr  Cys Tyr Lys 
    1265                 1270                 1275             


Asp Pro  Lys Thr Asn Arg Gly  Thr Cys Thr His Lys  Leu Gln Glu 
    1280                 1285                 1290             


Ile Lys  Val Glu Ser Glu Gly  Gln Ser Asp Ser Ser  Gly Leu Thr 
    1295                 1300                 1305             


Glu Thr  Gln Arg Lys Gln Arg  Gln Arg Asn Leu Met  Leu His Ile 
    1310                 1315                 1320             


Gln Leu  Ile Glu His Ala Ser  Arg Cys Thr Ser Ser  Thr Cys Gln 
    1325                 1330                 1335             


Ser Lys  Asn Cys Ala Lys Met  Lys Glu Tyr Leu Gln  His Ala Arg 
    1340                 1345                 1350             


Thr Cys  Lys Thr Lys Val Val  Gly Gly Cys Arg Ile  Cys Lys Arg 
    1355                 1360                 1365             


Ile Trp  Thr Leu Leu Arg Ile  His Ala Gln Lys Cys  Lys Glu Pro 
    1370                 1375                 1380             


Val Cys  Pro Ile Pro Gln Cys  Met Ile Ile Arg Glu  Lys Met Arg 
    1385                 1390                 1395             


Glu Leu  Gln Lys Gln Gln Gln  Ala Met Asp Asp Arg  Arg Arg Gln 
    1400                 1405                 1410             


Glu Met  Asn Arg His Tyr Gly  Arg Met Ser Met Thr  Ser Gly Ser 
    1415                 1420                 1425             


Gly 
    


<210>  21
<211>  3156
<212>  DNA
<213>  Fragilariopsis cylindrus


<220>
<221>  misc_feature
<223>  Encodes polypeptide of SEQ ID NO:22

<400>  21
atgaaacgat tgtggaagca cattgccgaa tgcaaagatc aaaagtgttt agttcctcat       60

tgtgttagtt cacggtatgt tcttagtcac tatcatcgat gtaaggatgt tcgttgtccg      120

gtgtgcggtc ccgtaagaga ggctatacat cgaagtcacg agaagcagaa gcaaatgcaa      180

gcattgaaac aacgacatca acaggccgtc cagcaaaatc aaaatgaaga aaaaatacca      240

gcaggagcag ccttagcacc tcctccagta caacatcaac aggggtttgg atatccttca      300

aaaccacaac cgatgtctca acagcccgga gttccttcta cggcatcagt accaccgaag      360

atcaccgtcc ctcctatcgc tggtgtcaag tttgctaacg ggcaagttat tactccgaag      420

tttactggtc cgaaaccaca ggaagatcat actcttatca attgtttttc ggtcgaacag      480

atcgaaacgc acattaagtc tttgaataag ggtttgcaac ttccacccct aaaactgaag      540

gtaaaatgtc tcgaggtact caaagtcctt caaggtcacc agcatggttg ggtgtttaat      600

agtcccgtgg atcctattga actcggtcta cctgattact tcgaagttat taagattcca      660

atggatcttg gaacgattcg aaagaaatta gagaatggat gctatcattc tttggattcc      720

tttcataccc atgttcatac aacatttgat aatgcaatgc tgtataatcc cgaagggtca      780

gttgtttaca atatggcgaa tgaaatgaag accaagttta aacaagattt tgaaatcctc      840

atgaagcaac tgaatgccga tgaggatgta aagcgtcgaa atggcgaggc atgttcgttg      900

tgtggatgtg aaaaactctt gtttgagcca ccggtatttt attgcaacgg tctcagttgt      960

ccctcgaaac gtattcgacg aaatagctat tattatgtgg ggggaaacaa tcaatatcat     1020

tggtgccatc aatgttttca agaacttaag gacaatcaac tactcgaact tgcagatgtt     1080

tcgattcgga aggagcaact cacgaagaaa aagaacgacg aaacacacga agaaagttgg     1140

gtccaatgcg atcgttgcga gcgatgggtc catcagattt gtgctctttt caacactcgt     1200

cagaacaaag accagcggtc agaatttgct tgtccccggt gtacaattga ggaacgcaag     1260

aagacaggaa ggctggaagc aacttcctcc actccaatgg ctgaagatct tcaacgtaca     1320

aaactctctg aatacgttga aacccatgtt cgcgtcaaga tggctgaaca tctgaaggaa     1380

cttgcagaag agaaagtact aaaggaaggt atggacctcg aggaagctaa agcttctgtt     1440

acaatgggtg gtactatcac aatccgtcaa gttacttcta tggatcgtaa actcgaagtg     1500

agagaacgta tgaagaagcg ctacgccttt aagaattatc ctgacgaatt tacctatcgg     1560

tgcaagtgtt ttgtagtttt ccaaaatctt gacggtgtag atgtgattct atttggactt     1620

tacgtctacg aacacgacga gaagaatcct gcgccgaacc agcgagctgt atatgtatcg     1680

tatctcgata gtgttcatta catgcgacca agatctatga gaacgttcat ttatcacgag     1740

attctgatat cataccttga ttatgttcga cgacgtgggt tttctacagc tcatatctgg     1800

gcctgtccgc cactgaaagg tgatgattat attctatatg ctaaaccaga agatcaaaaa     1860

acaccgaagg atgatcgact tcgtcaatgg tacattgata tgctaattga ctgtcaaaaa     1920

cgcggcattg ttggtagact tactaacatg tacgacctat acttctcgag caaagaaaat     1980

aacgcaacga tcgttccata tatggaaggt gactatttcc cagccgaggt tgaaaatatc     2040

atcaaggaca tcgaagaagg aaaagttggc aagaagactg gaggaaagga aggaaagaaa     2100

aagaaaggag ataagaaaca gaagaagaag ggcggacgag gtggaacgcg atcaagtgga     2160

atcgacgaag atgccctcaa agctagtggt attcaattcc caggtaaaga ccaaaagagt     2220

ctagaagagg gaggtcgaga ctatgtaatg gtaaagttgg gggaaactat tcaacctatg     2280

aaggagagtt ttatcgttgc ccatttagcc tggaagggtg ctaaaaagga aaatatggtt     2340

gtgcccagag ctattcaaga atacagggaa aaacataata tcaagattga agatgagaag     2400

gagaaagaaa cggaagccga acctgcacca gttatttacg tcttggacag caagggaaga     2460

cgagtgaagg ttatcgacga tgatgcagaa gagatggact gtgaatttct caacaatcgt     2520

caagcatttt tgaatttatg ccaaggaaat cactatcagt acgatcattt aagaagggcg     2580

aagcattctt caatgatggt tctttggcat ctacacaatc gggatgcacc gaagtttgtc     2640

caacaatgta caacttgttc cagagagatt ttgcagggct atcgcttcca ttgtccaatc     2700

tgtgctgact ttgatcaatg tcaagattgt gtacagaatc ctaatactcc tcggcatcct     2760

catcagttga aacctattgc agtagcaggt caacaaactg agttgacaga agctcaacgc     2820

aaggaacgcc aacgaagtat acagttacat atgactcttt tgcagcacgc cgcgacctgt     2880

aactcaacaa agtgtccatc cgccaattgt accaaaatga agggcctttt gaagcacggt     2940

tcgcagtgta ctgttaaggc cacgggtggc tgtaatgtat gcaaaaggat atgggctctt     3000

ctccagatcc acgctcgtca gtgtaaggca cagcaatgtc ctgtccctaa ttgtatggcg     3060

atccgagaac gggtacgcca gttgaagaaa cagcaacagg caatggacga ccgtcgtcgt     3120

caagaaatga acagagttta tagaggagcg cgatag                               3156


<210>  22
<211>  1051
<212>  PRT
<213>  Fragilariopsis cylindrus


<220>
<221>  misc_feature
<223>  translation product 388115

<400>  22

Met Lys Arg Leu Trp Lys His Ile Ala Glu Cys Lys Asp Gln Lys Cys 
1               5                   10                  15      


Leu Val Pro His Cys Val Ser Ser Arg Tyr Val Leu Ser His Tyr His 
            20                  25                  30          


Arg Cys Lys Asp Val Arg Cys Pro Val Cys Gly Pro Val Arg Glu Ala 
        35                  40                  45              


Ile His Arg Ser His Glu Lys Gln Lys Gln Met Gln Ala Leu Lys Gln 
    50                  55                  60                  


Arg His Gln Gln Ala Val Gln Gln Asn Gln Asn Glu Glu Lys Ile Pro 
65                  70                  75                  80  


Ala Gly Ala Ala Leu Ala Pro Pro Pro Val Gln His Gln Gln Gly Phe 
                85                  90                  95      


Gly Tyr Pro Ser Lys Pro Gln Pro Met Ser Gln Gln Pro Gly Val Pro 
            100                 105                 110         


Ser Thr Ala Ser Val Pro Pro Lys Ile Thr Val Pro Pro Ile Ala Gly 
        115                 120                 125             


Val Lys Phe Ala Asn Gly Gln Val Ile Thr Pro Lys Phe Thr Gly Pro 
    130                 135                 140                 


Lys Pro Gln Glu Asp His Thr Leu Ile Asn Cys Phe Ser Val Glu Gln 
145                 150                 155                 160 


Ile Glu Thr His Ile Lys Ser Leu Asn Lys Gly Leu Gln Leu Pro Pro 
                165                 170                 175     


Leu Lys Leu Lys Val Lys Cys Leu Glu Val Leu Lys Val Leu Gln Gly 
            180                 185                 190         


His Gln His Gly Trp Val Phe Asn Ser Pro Val Asp Pro Ile Glu Leu 
        195                 200                 205             


Gly Leu Pro Asp Tyr Phe Glu Val Ile Lys Ile Pro Met Asp Leu Gly 
    210                 215                 220                 


Thr Ile Arg Lys Lys Leu Glu Asn Gly Cys Tyr His Ser Leu Asp Ser 
225                 230                 235                 240 


Phe His Thr His Val His Thr Thr Phe Asp Asn Ala Met Leu Tyr Asn 
                245                 250                 255     


Pro Glu Gly Ser Val Val Tyr Asn Met Ala Asn Glu Met Lys Thr Lys 
            260                 265                 270         


Phe Lys Gln Asp Phe Glu Ile Leu Met Lys Gln Leu Asn Ala Asp Glu 
        275                 280                 285             


Asp Val Lys Arg Arg Asn Gly Glu Ala Cys Ser Leu Cys Gly Cys Glu 
    290                 295                 300                 


Lys Leu Leu Phe Glu Pro Pro Val Phe Tyr Cys Asn Gly Leu Ser Cys 
305                 310                 315                 320 


Pro Ser Lys Arg Ile Arg Arg Asn Ser Tyr Tyr Tyr Val Gly Gly Asn 
                325                 330                 335     


Asn Gln Tyr His Trp Cys His Gln Cys Phe Gln Glu Leu Lys Asp Asn 
            340                 345                 350         


Gln Leu Leu Glu Leu Ala Asp Val Ser Ile Arg Lys Glu Gln Leu Thr 
        355                 360                 365             


Lys Lys Lys Asn Asp Glu Thr His Glu Glu Ser Trp Val Gln Cys Asp 
    370                 375                 380                 


Arg Cys Glu Arg Trp Val His Gln Ile Cys Ala Leu Phe Asn Thr Arg 
385                 390                 395                 400 


Gln Asn Lys Asp Gln Arg Ser Glu Phe Ala Cys Pro Arg Cys Thr Ile 
                405                 410                 415     


Glu Glu Arg Lys Lys Thr Gly Arg Leu Glu Ala Thr Ser Ser Thr Pro 
            420                 425                 430         


Met Ala Glu Asp Leu Gln Arg Thr Lys Leu Ser Glu Tyr Val Glu Thr 
        435                 440                 445             


His Val Arg Val Lys Met Ala Glu His Leu Lys Glu Leu Ala Glu Glu 
    450                 455                 460                 


Lys Val Leu Lys Glu Gly Met Asp Leu Glu Glu Ala Lys Ala Ser Val 
465                 470                 475                 480 


Thr Met Gly Gly Thr Ile Thr Ile Arg Gln Val Thr Ser Met Asp Arg 
                485                 490                 495     


Lys Leu Glu Val Arg Glu Arg Met Lys Lys Arg Tyr Ala Phe Lys Asn 
            500                 505                 510         


Tyr Pro Asp Glu Phe Thr Tyr Arg Cys Lys Cys Phe Val Val Phe Gln 
        515                 520                 525             


Asn Leu Asp Gly Val Asp Val Ile Leu Phe Gly Leu Tyr Val Tyr Glu 
    530                 535                 540                 


His Asp Glu Lys Asn Pro Ala Pro Asn Gln Arg Ala Val Tyr Val Ser 
545                 550                 555                 560 


Tyr Leu Asp Ser Val His Tyr Met Arg Pro Arg Ser Met Arg Thr Phe 
                565                 570                 575     


Ile Tyr His Glu Ile Leu Ile Ser Tyr Leu Asp Tyr Val Arg Arg Arg 
            580                 585                 590         


Gly Phe Ser Thr Ala His Ile Trp Ala Cys Pro Pro Leu Lys Gly Asp 
        595                 600                 605             


Asp Tyr Ile Leu Tyr Ala Lys Pro Glu Asp Gln Lys Thr Pro Lys Asp 
    610                 615                 620                 


Asp Arg Leu Arg Gln Trp Tyr Ile Asp Met Leu Ile Asp Cys Gln Lys 
625                 630                 635                 640 


Arg Gly Ile Val Gly Arg Leu Thr Asn Met Tyr Asp Leu Tyr Phe Ser 
                645                 650                 655     


Ser Lys Glu Asn Asn Ala Thr Ile Val Pro Tyr Met Glu Gly Asp Tyr 
            660                 665                 670         


Phe Pro Ala Glu Val Glu Asn Ile Ile Lys Asp Ile Glu Glu Gly Lys 
        675                 680                 685             


Val Gly Lys Lys Thr Gly Gly Lys Glu Gly Lys Lys Lys Lys Gly Asp 
    690                 695                 700                 


Lys Lys Gln Lys Lys Lys Gly Gly Arg Gly Gly Thr Arg Ser Ser Gly 
705                 710                 715                 720 


Ile Asp Glu Asp Ala Leu Lys Ala Ser Gly Ile Gln Phe Pro Gly Lys 
                725                 730                 735     


Asp Gln Lys Ser Leu Glu Glu Gly Gly Arg Asp Tyr Val Met Val Lys 
            740                 745                 750         


Leu Gly Glu Thr Ile Gln Pro Met Lys Glu Ser Phe Ile Val Ala His 
        755                 760                 765             


Leu Ala Trp Lys Gly Ala Lys Lys Glu Asn Met Val Val Pro Arg Ala 
    770                 775                 780                 


Ile Gln Glu Tyr Arg Glu Lys His Asn Ile Lys Ile Glu Asp Glu Lys 
785                 790                 795                 800 


Glu Lys Glu Thr Glu Ala Glu Pro Ala Pro Val Ile Tyr Val Leu Asp 
                805                 810                 815     


Ser Lys Gly Arg Arg Val Lys Val Ile Asp Asp Asp Ala Glu Glu Met 
            820                 825                 830         


Asp Cys Glu Phe Leu Asn Asn Arg Gln Ala Phe Leu Asn Leu Cys Gln 
        835                 840                 845             


Gly Asn His Tyr Gln Tyr Asp His Leu Arg Arg Ala Lys His Ser Ser 
    850                 855                 860                 


Met Met Val Leu Trp His Leu His Asn Arg Asp Ala Pro Lys Phe Val 
865                 870                 875                 880 


Gln Gln Cys Thr Thr Cys Ser Arg Glu Ile Leu Gln Gly Tyr Arg Phe 
                885                 890                 895     


His Cys Pro Ile Cys Ala Asp Phe Asp Gln Cys Gln Asp Cys Val Gln 
            900                 905                 910         


Asn Pro Asn Thr Pro Arg His Pro His Gln Leu Lys Pro Ile Ala Val 
        915                 920                 925             


Ala Gly Gln Gln Thr Glu Leu Thr Glu Ala Gln Arg Lys Glu Arg Gln 
    930                 935                 940                 


Arg Ser Ile Gln Leu His Met Thr Leu Leu Gln His Ala Ala Thr Cys 
945                 950                 955                 960 


Asn Ser Thr Lys Cys Pro Ser Ala Asn Cys Thr Lys Met Lys Gly Leu 
                965                 970                 975     


Leu Lys His Gly Ser Gln Cys Thr Val Lys Ala Thr Gly Gly Cys Asn 
            980                 985                 990         


Val Cys Lys Arg Ile Trp Ala Leu  Leu Gln Ile His Ala  Arg Gln Cys 
        995                 1000                 1005             


Lys Ala  Gln Gln Cys Pro Val  Pro Asn Cys Met Ala  Ile Arg Glu 
    1010                 1015                 1020             


Arg Val  Arg Gln Leu Lys Lys  Gln Gln Gln Ala Met  Asp Asp Arg 
    1025                 1030                 1035             


Arg Arg  Gln Glu Met Asn Arg  Val Tyr Arg Gly Ala  Arg 
    1040                 1045                 1050     


<210>  23
<211>  2598
<212>  DNA
<213>  Thalassiosira pseudonana


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:24

<400>  23
atgaatgatg actgtgtgat atcagctgac agcgaagcca gtagtgttgc ccagaaagtc       60

acctcgattc ttccgtcaat gtcaattgca gctattcaac agcacgttga atctttggca      120

tccaatggtc agctgacacc tcggcttatc acccgaaaat gcctccctct cgttagaaag      180

ctatataacc acgaacacgg atgggtcttt aaggatccag ttgatcctgt ggagttgggc      240

attccagact actttgatat tgtgcagcat ccaatggatc ttgccttggt agagacgaag      300

cttgagaatg gagtgtacaa agatctagat tcttttgagc gtgatacaaa gctagtgttt      360

gagaacgcaa tccttttcaa tggtgagaag aatgatgttg gtggaatggc aaagcaactg      420

ttgtttatgt ttgacgagga tctcaaagct gtaatgaaag gtatggggtt ggttcacaaa      480

agtgagaagg aagaacccaa gaagaaggat gacacgtcat gcacactctg tgggaatcac      540

cgccgtctct ttgagccaac cactctctac tgcagtggtc agtgcggaat gcagaaaatc      600

cgtcgcaacg catcgtatta cactgacaga tatcgacaaa accaatggtg tgagaagtgc      660

tttgatgttt tgatggagga gaagccagtt ctgcttgatg atggaaagga gacgaagaag      720

tcgctactgg tgaaaatgaa gaatgactcg acaccagagg agaagtgggt tcaatgcgac      780

aattgtcata attgggctca tcagatttgt gctctcttca atgaggtgca aagtagcaat      840

gcgtttacgt gtcccaagtg tttcttgaaa cagcaagata gagcgactag tccagagctt      900

acttcgttca aagatgcagc cgctttgccc cagtgtaaac tgagtactgt gatcgaagaa      960

ggtctggcga cgacactttc tgtcgaatac gaaaagattg caaaggaaag aggatgcacc     1020

gtagcccagg ttgaaaaggc agagggcctc tgcgttagag ttgtgtcaag tcttgagaaa     1080

aagcacaagg ttcgggatga gatgctgggt cgatattcaa agaagggata tccatcagag     1140

tttccagtga cctcaaagtg catcctattg ttccagaaga tccacggagt tgatgtgctt     1200

ctgtttggaa tgtatgtcta cgagtacggt gacaagtgtg cagctccgaa ccggcgacga     1260

gtctacattt catacttgga ttcagttcag tatcttgagc catcatcata caggacaacc     1320

acctaccagt ccatcattgt tgaatacctt cgatacgcaa ggatgcgtgg ctttcacact     1380

gctcacattt ggagttgccc tccgtcaaag ggcgatgagt acattttcta ctgccaccct     1440

tcctctcagc tcgttcccaa agacgacatg ctttgtgctt ggtacattga aactctcaaa     1500

aaagctcaag accagggcat cgtcttggaa acaaggacca tctacgacga gtattttaag     1560

aacaatggta tcaactcaga gaatggagag ccctttgatc caatgagcct cccttacttt     1620

gaaggcgact acatccccgg agagatagag aaaatcatta gagactttaa caaggatgag     1680

aatttgcgcg aagagaccaa gttaaaggaa ctcaagtctg cccctgctcc aacggctcat     1740

aagaaggaag gcaatcgtaa aggcactagg tccaacccgg gtgaattggt aaatcaagac     1800

cgcgacaaag tgatgattcg tcttgacttg gctttggcga aaatgaagca aaactttatc     1860

gtagcccagc ttcttagcga tgacttcatc aaggcggtgg agaagggtca cgatgtttcc     1920

tcatggatag aagatattga gccgcacgaa gtaaagcagc cgaagcaggt tggcaagaat     1980

ccgtgcgtcc ttgatgcacc gactgatatg tctgacaaaa tgagcgctga cggaaaagat     2040

ggagatgcca caaagacacc tgcatccttg gtaattggta atactattga cgaagacccc     2100

ttgatggagc aggaattcat cgacactcgt cttcagttct tgaactactg tcaaaagaac     2160

aacctccagt ttgatgagtt gcgtcgtgcc aaacacacaa caatgatgct tctttgcaat     2220

ctgcacaatc ctcgggctga acgagagcag caagttaagg tgcacttgca gatcattgca     2280

catgcttcgt gttgcaatgg tcctccggct tgcatgtcta ccaactgtcg aaggatgaag     2340

caactattca gccacgtcag gggatgcgaa attacctaca aaagaggctg caagatgtgc     2400

gttcgtctat tcatgcttct taccaaacat gcccgcgatt gtgactctgc gggatcatgt     2460

gctattccgt tttgtgatcg tattagggag aggaatagga gaatgttgcg tcaacagcaa     2520

cttatggatg ataggcgaag gaatgctcag aatgataggc acagggaaga ggaagatgac     2580

gctcaagctc gtgtttga                                                   2598


<210>  24
<211>  865
<212>  PRT
<213>  Thalassiosira pseudonana


<220>
<221>  misc_feature
<223>  translation product 324007

<400>  24

Met Asn Asp Asp Cys Val Ile Ser Ala Asp Ser Glu Ala Ser Ser Val 
1               5                   10                  15      


Ala Gln Lys Val Thr Ser Ile Leu Pro Ser Met Ser Ile Ala Ala Ile 
            20                  25                  30          


Gln Gln His Val Glu Ser Leu Ala Ser Asn Gly Gln Leu Thr Pro Arg 
        35                  40                  45              


Leu Ile Thr Arg Lys Cys Leu Pro Leu Val Arg Lys Leu Tyr Asn His 
    50                  55                  60                  


Glu His Gly Trp Val Phe Lys Asp Pro Val Asp Pro Val Glu Leu Gly 
65                  70                  75                  80  


Ile Pro Asp Tyr Phe Asp Ile Val Gln His Pro Met Asp Leu Ala Leu 
                85                  90                  95      


Val Glu Thr Lys Leu Glu Asn Gly Val Tyr Lys Asp Leu Asp Ser Phe 
            100                 105                 110         


Glu Arg Asp Thr Lys Leu Val Phe Glu Asn Ala Ile Leu Phe Asn Gly 
        115                 120                 125             


Glu Lys Asn Asp Val Gly Gly Met Ala Lys Gln Leu Leu Phe Met Phe 
    130                 135                 140                 


Asp Glu Asp Leu Lys Ala Val Met Lys Gly Met Gly Leu Val His Lys 
145                 150                 155                 160 


Ser Glu Lys Glu Glu Pro Lys Lys Lys Asp Asp Thr Ser Cys Thr Leu 
                165                 170                 175     


Cys Gly Asn His Arg Arg Leu Phe Glu Pro Thr Thr Leu Tyr Cys Ser 
            180                 185                 190         


Gly Gln Cys Gly Met Gln Lys Ile Arg Arg Asn Ala Ser Tyr Tyr Thr 
        195                 200                 205             


Asp Arg Tyr Arg Gln Asn Gln Trp Cys Glu Lys Cys Phe Asp Val Leu 
    210                 215                 220                 


Met Glu Glu Lys Pro Val Leu Leu Asp Asp Gly Lys Glu Thr Lys Lys 
225                 230                 235                 240 


Ser Leu Leu Val Lys Met Lys Asn Asp Ser Thr Pro Glu Glu Lys Trp 
                245                 250                 255     


Val Gln Cys Asp Asn Cys His Asn Trp Ala His Gln Ile Cys Ala Leu 
            260                 265                 270         


Phe Asn Glu Val Gln Ser Ser Asn Ala Phe Thr Cys Pro Lys Cys Phe 
        275                 280                 285             


Leu Lys Gln Gln Asp Arg Ala Thr Ser Pro Glu Leu Thr Ser Phe Lys 
    290                 295                 300                 


Asp Ala Ala Ala Leu Pro Gln Cys Lys Leu Ser Thr Val Ile Glu Glu 
305                 310                 315                 320 


Gly Leu Ala Thr Thr Leu Ser Val Glu Tyr Glu Lys Ile Ala Lys Glu 
                325                 330                 335     


Arg Gly Cys Thr Val Ala Gln Val Glu Lys Ala Glu Gly Leu Cys Val 
            340                 345                 350         


Arg Val Val Ser Ser Leu Glu Lys Lys His Lys Val Arg Asp Glu Met 
        355                 360                 365             


Leu Gly Arg Tyr Ser Lys Lys Gly Tyr Pro Ser Glu Phe Pro Val Thr 
    370                 375                 380                 


Ser Lys Cys Ile Leu Leu Phe Gln Lys Ile His Gly Val Asp Val Leu 
385                 390                 395                 400 


Leu Phe Gly Met Tyr Val Tyr Glu Tyr Gly Asp Lys Cys Ala Ala Pro 
                405                 410                 415     


Asn Arg Arg Arg Val Tyr Ile Ser Tyr Leu Asp Ser Val Gln Tyr Leu 
            420                 425                 430         


Glu Pro Ser Ser Tyr Arg Thr Thr Thr Tyr Gln Ser Ile Ile Val Glu 
        435                 440                 445             


Tyr Leu Arg Tyr Ala Arg Met Arg Gly Phe His Thr Ala His Ile Trp 
    450                 455                 460                 


Ser Cys Pro Pro Ser Lys Gly Asp Glu Tyr Ile Phe Tyr Cys His Pro 
465                 470                 475                 480 


Ser Ser Gln Leu Val Pro Lys Asp Asp Met Leu Cys Ala Trp Tyr Ile 
                485                 490                 495     


Glu Thr Leu Lys Lys Ala Gln Asp Gln Gly Ile Val Leu Glu Thr Arg 
            500                 505                 510         


Thr Ile Tyr Asp Glu Tyr Phe Lys Asn Asn Gly Ile Asn Ser Glu Asn 
        515                 520                 525             


Gly Glu Pro Phe Asp Pro Met Ser Leu Pro Tyr Phe Glu Gly Asp Tyr 
    530                 535                 540                 


Ile Pro Gly Glu Ile Glu Lys Ile Ile Arg Asp Phe Asn Lys Asp Glu 
545                 550                 555                 560 


Asn Leu Arg Glu Glu Thr Lys Leu Lys Glu Leu Lys Ser Ala Pro Ala 
                565                 570                 575     


Pro Thr Ala His Lys Lys Glu Gly Asn Arg Lys Gly Thr Arg Ser Asn 
            580                 585                 590         


Pro Gly Glu Leu Val Asn Gln Asp Arg Asp Lys Val Met Ile Arg Leu 
        595                 600                 605             


Asp Leu Ala Leu Ala Lys Met Lys Gln Asn Phe Ile Val Ala Gln Leu 
    610                 615                 620                 


Leu Ser Asp Asp Phe Ile Lys Ala Val Glu Lys Gly His Asp Val Ser 
625                 630                 635                 640 


Ser Trp Ile Glu Asp Ile Glu Pro His Glu Val Lys Gln Pro Lys Gln 
                645                 650                 655     


Val Gly Lys Asn Pro Cys Val Leu Asp Ala Pro Thr Asp Met Ser Asp 
            660                 665                 670         


Lys Met Ser Ala Asp Gly Lys Asp Gly Asp Ala Thr Lys Thr Pro Ala 
        675                 680                 685             


Ser Leu Val Ile Gly Asn Thr Ile Asp Glu Asp Pro Leu Met Glu Gln 
    690                 695                 700                 


Glu Phe Ile Asp Thr Arg Leu Gln Phe Leu Asn Tyr Cys Gln Lys Asn 
705                 710                 715                 720 


Asn Leu Gln Phe Asp Glu Leu Arg Arg Ala Lys His Thr Thr Met Met 
                725                 730                 735     


Leu Leu Cys Asn Leu His Asn Pro Arg Ala Glu Arg Glu Gln Gln Val 
            740                 745                 750         


Lys Val His Leu Gln Ile Ile Ala His Ala Ser Cys Cys Asn Gly Pro 
        755                 760                 765             


Pro Ala Cys Met Ser Thr Asn Cys Arg Arg Met Lys Gln Leu Phe Ser 
    770                 775                 780                 


His Val Arg Gly Cys Glu Ile Thr Tyr Lys Arg Gly Cys Lys Met Cys 
785                 790                 795                 800 


Val Arg Leu Phe Met Leu Leu Thr Lys His Ala Arg Asp Cys Asp Ser 
                805                 810                 815     


Ala Gly Ser Cys Ala Ile Pro Phe Cys Asp Arg Ile Arg Glu Arg Asn 
            820                 825                 830         


Arg Arg Met Leu Arg Gln Gln Gln Leu Met Asp Asp Arg Arg Arg Asn 
        835                 840                 845             


Ala Gln Asn Asp Arg His Arg Glu Glu Glu Asp Asp Ala Gln Ala Arg 
    850                 855                 860                 


Val 
865 


<210>  25
<211>  5154
<212>  DNA
<213>  Thalassiosira pseudonana


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:26

<400>  25
atgggtccct atgaaggcgc atcttcagcc caacaaaaca atggatcacg accccctcac       60

aatcagatgc aagggcatcc ctcccaacag caaccgccgc agcatggagg aggagggggt      120

caaatgcctc ccaacgggca tcatcaaatg ggcaatccgt atggaggata tcatccactt      180

aatatgcagc agggaggtca accgcagcag cagcagaatg gaagatggga ggcgggggag      240

gcgtaaccat ggggggattt aacatgcaag gcagtaacgg aggtatgtcc atgggtggga      300

ataatggtca acagcacagg gggatgcatc cgatgagtat gcaaagcggg ggaggtggga      360

gtcaatacgg aggaccggga ggaggagggg gtaacaacaa catgtacaat catcccaatt      420

ttaactctgg gaatcagcct gggggaggag gacatcatca tgggggatac aatccccatc      480

agcaacagat acagcagcaa cagcttggtg ggtacaatcc ccaaatgatg gcgcagatgc      540

agcagtcaca caattcgcag tataatccga tgcaacagat gggtcaacgt tctatcaaca      600

acaacagcag catcagggtg gatatcatcc acagccacag atcccccttc ctcaggccca      660

agcctacggc caacaacagc agcagttgca tccatccaac tcctacgcca gacaagcctc      720

ctctgcatcc atacattcat ctattgccgt ggccaacgac agcagtccct caccctcgga      780

caacatgcaa cttctttcat acaaggaggc tccaaacttt tcagaagtaa cgggagtcga      840

tataggggat gaggattatg ggcagcaatt tttgcccacg gggttgaatg gggattggca      900

gagtgatcgg gatatgcatc acaggaggga gatgattcag cacattgtta agctgctgaa      960

gcagaaagac aagagcgctt cccccgagtg gctcaccaaa ctccctcaga tggtgaaaca     1020

actagaagtg tcactctacc gttcagctcc ctcctttgaa gcgtattccg acaccaacac     1080

cctcaagcat cgtctacaac aactggccat ggaaattgcg aggaagactc aacaggccaa     1140

ggcgagtgga aggtcgtcca gcaggggtga tcgtattcca ggaatgggca acatgcacta     1200

caaccctgtc aacatgggag caaacgagga aatagtcagc agtcaatacg gcaatccaaa     1260

tgatcccgaa tggaaggttc gtatccgtca caagcagcaa cgtctactgc tcttgcacca     1320

ctcctccaaa tgcccctacg acaatgacaa gtgcaaggtc actccctatt gcggcgagat     1380

gaagaaattg tggaaacaca tggcgcgttg cacagacaat gagtgtcgag tgcctcactg     1440

cttctccagt cgatctattt tgagtcattg tcgtaaatgc aaagatcctg gatgtcctgc     1500

ctgcggtcct gtcagagaga cggtgcgaaa gacgcaaaag agtaacgctg gtaagggtgt     1560

gaatgaaggt caaggtgatt ttggtgggat gggtccaaat agtggaattg ggttgggagg     1620

aatgggtaat gaaatgggcg gcggtggtgg cagtgggatg ggaggcaacg acatgatggg     1680

tggaatgcca atgatggggg gaaacatgaa tcaaatgcca aagaggcagc cttctcagcc     1740

aatgccttgg aaaggggata tcaatagcat gccaaacttc ccaccgccga atataagaca     1800

gcagcaggat gactacatgg ctttcccgga gggtttccct gagggacagc aggtgctcaa     1860

tggaccgagc tctgggcaga gtgggaatcc agaatcgagc aaggctaggc ataagcagca     1920

acgtcttctc cttcttcgtc atgcatcaaa gtgtaatgca gagcctggtc gctgtaccac     1980

cactccacac tgtgctgaaa tgaaagtctt gtggaaacac atcgccaatt gcaaggatca     2040

gtattgcaag gtgaaacact gcatgagcag tcggtatgtt ctcagtcact atcgtcggtg     2100

caatgatccg ggatgtgaga tttgcggtcc ggtgagggag atttttaaga gtggcacgaa     2160

ccatttcatt catgatccgt cctttatgcc aggatcatca gcggctgatc tcatcactcc     2220

tcctctccca gagggaccac aaacgaagag gtcgaggact aacgatcctt caatgaatgg     2280

aatgcatcat accgcgcctg ctcggccagc cttccctcta agtgctccta catctggctc     2340

ggagaatcat gccaagttga agtcttcagc gaagccttcg tcatccaatg ccacggaaga     2400

gcactcttta ttggaatgtt ttacgacgga gcaggtcaag actcatatcc aatcactgaa     2460

gaagacgata gaagtgtcac ccgccaagtt gaagctcaag tgcgtggaaa tattgagaga     2520

actccaaatt cacgagcatg gttgggtgtt tgcaacgcct gttgatcccg tcgagctggg     2580

tcttgatgac tactttgacg ttatcaaaaa gccgatggat cttggaacta tcagtaggag     2640

gcttgacaac ggatcgtacc atgcctttga tgacttcaag tctgatgttc ggcttacttt     2700

tgagaacgct atgaaataca atgatgagaa ttcggtagtt cacgaaatgg caaaggagtt     2760

gaagaagaag tttgatactg actacaaaaa gctaatgaag cagctggaga aggagcaccg     2820

agagaactcc atgaggcagc aggcgtgcgg cctttgtggt tgcgaaaagc tcaactttga     2880

gcctcccgtg ttcttttgca acggtatgaa ctgtcccagt aagcgcatcc gtcgtaacac     2940

ccacttctac atcacggccg acaagcagta tgcttggtgc agccaatgtt acaatgagct     3000

tgggggagag attgacctcg gtacgtcagt cttgaagaag gtggaccttg cgaagaagaa     3060

gaacgacgag actcacgagg agagttgggt tcagtgtgac gattgtgagc gatggatcca     3120

tcagatttgt ggactctaca acacacgtca ggacaaggag aacaagagtg cctattcttg     3180

tccactatgc ctgctggata agaggaagaa agaaggagag cctaaaaagc tcccacctcc     3240

tcccgcagcg agcgacattc ccaggacaaa tctgtcagat tggcttgaaa gggatgttca     3300

caagaaggta aatcagcgtc tcaaagagct tgcgcaggag aaggccgata ctgagcacat     3360

tgcctttgaa aaggcgtatg ctgatctttc tgctgggggg cctttgacca ttcggcaggt     3420

gacgtctact gaccgaaagt tggaagttcg cgatcaaatg aggcagcgat atgctcataa     3480

gaactatcct gaggagtttc cctaccgttg taaatgcatt gttgtcttcc agaacattga     3540

tggtgttgac gtggttcttt ttgcgttgta tgtttacgag catggagatg acaatccctt     3600

ccccaaaaaa agacggtgta tgtgtcctac cttgacagtg tccacttcat gaagccaagg     3660

caaatgagga cgttccttta ccacgaaatc ttaatctcct accttgacta cgctcgtcag     3720

aaaggcttct tgcaggcctt catttgggcg tgcccaccgt tgaaggggga cgattacatc     3780

ttctacgcaa aaccagagga tcaaaagact cccaaagacg taagacttcg tcaatggtat     3840

cttgatatgc tggtggagtg ccagaaacgc aacatcgttg gtatggtctc caatatgtat     3900

gatcaatact ttgccaacaa gtctctggat gcagcgagtg tcccctactt tgacggagat     3960

tacttccctg gagaggctga aaatatcatc aaagacttgg aagaaagcaa cagtaagcgc     4020

aagggtggtg ctggcaagaa aaataaggat ccttcaaaga gcaaagctgc tccatctggt     4080

gatgcagagt ttgtgggtga aaagtgctac aaggagggtg gtcgtgatcc cgtgatgcag     4140

aagttctgcg acgccattca ggggatgaag gagagtttca tcgtcgcata cttgaacgca     4200

aaggacgcca agcctgagca tcttgtcgta ccgaagaaga ttatggagtt tagggaagca     4260

aacaaacttc tcatgatcga cgatgatcct aagaagaaga aagaagatgg aacggaggag     4320

aagaaggatg acgaaaagcc tcagagcaag aagcgcgatg ctgacggtta cgaagtcgcc     4380

gcctcggaaa agccaccggc taataagcaa ctcaatagca agggaaagcc tgtccgagta     4440

ttgaacgacg acgatgaaga aattgactgt gaattcttta acacgcgaca atgctttttg     4500

gatctctgcc gtggtaacca ctatcagttt gatgagttgc gacgggcaaa gcatacgtca     4560

atgatggttt tgtggcacct tcaaaatcgt gaagcgccaa aattcgttca gcagtgcatg     4620

gcatgcaacc gcgagatcgc gtctggcatt cgtcatcatt gcaacgtatg ctcagacttt     4680

gacctctgtg acgattgctt ccgagatcca gacaccaaca gaggcacgtg caatcataag     4740

cttgaggcaa ttaaagtgga tactgcccag agtgaaaaca gtggactcac cgaggagcaa     4800

cgaaaggagc gtcagcgaaa catccagctt catatcactc tcattgagca tgcatctcgt     4860

tgtaactcgt cttcctgcaa gtcttccaat tgtatgaaaa tgaaatccta cctcaagcac     4920

ggctcaacgt gcacggtcaa agcatcagga ggatgcaaga tttgcaagag aatctggacg     4980

ttgttgagga ttcacgcaca gcaatgcaag agctctagct gtgccatccc gcaatgtatc     5040

gcaattagaa agcgtatccg tcagcttcag ctcaagcagc aggctatgga cgaccgtaga     5100

aggcaagaaa tgaaccgaca ctaccgcatg ggaatgatgt cctctgataa ctga           5154


<210>  26
<211>  1718
<212>  PRT
<213>  Thalassiosira pseudonana


<220>
<221>  misc_feature
<223>  Translation product 324378

<400>  26

Met Gly Pro Tyr Glu Gly Ala Ser Ser Ala Gln Gln Asn Asn Gly Ser 
1               5                   10                  15      


Arg Pro Pro His Asn Gln Met Gln Gly His Pro Ser Gln Gln Gln Pro 
            20                  25                  30          


Pro Gln His Gly Gly Gly Gly Gly Gln Met Pro Pro Asn Gly His His 
        35                  40                  45              


Gln Met Gly Asn Pro Tyr Gly Gly Tyr His Pro Leu Asn Met Gln Gln 
    50                  55                  60                  


Gly Gly Gln Pro Gln Gln Gln Gln Asn Gly Met Met Gly Gly Gly Gly 
65                  70                  75                  80  


Gly Val Thr Met Gly Gly Phe Asn Met Gln Gly Ser Asn Gly Gly Met 
                85                  90                  95      


Ser Met Gly Gly Asn Asn Gly Gln Gln His Arg Gly Met His Pro Met 
            100                 105                 110         


Ser Met Gln Ser Gly Gly Gly Gly Ser Gln Tyr Gly Gly Pro Gly Gly 
        115                 120                 125             


Gly Gly Gly Asn Asn Asn Met Tyr Asn His Pro Asn Phe Asn Ser Gly 
    130                 135                 140                 


Asn Gln Pro Gly Gly Gly Gly His His His Gly Gly Tyr Asn Pro His 
145                 150                 155                 160 


Gln Gln Gln Ile Gln Gln Gln Gln Leu Gly Gly Tyr Asn Pro Gln Met 
                165                 170                 175     


Met Ala Gln Met Gln Gln Ser His Asn Ser Gln Tyr Asn Pro Met Gln 
            180                 185                 190         


Gln Met Gly Gln Arg Ser His Gln Gln Gln Gln Gln His Gln Gly Gly 
        195                 200                 205             


Tyr His Pro Gln Pro Gln Ile Pro Leu Pro Gln Ala Gln Ala Tyr Gly 
    210                 215                 220                 


Gln Gln Gln Gln Gln Leu His Pro Ser Asn Ser Tyr Ala Arg Gln Ala 
225                 230                 235                 240 


Ser Ser Ala Ser Ile His Ser Ser Ile Ala Val Ala Asn Asp Ser Ser 
                245                 250                 255     


Pro Ser Pro Ser Asp Asn Met Gln Leu Leu Ser Tyr Lys Glu Ala Pro 
            260                 265                 270         


Asn Phe Ser Glu Val Thr Gly Val Asp Ile Gly Asp Glu Asp Tyr Gly 
        275                 280                 285             


Gln Gln Phe Leu Pro Thr Gly Leu Asn Gly Asp Trp Gln Ser Asp Arg 
    290                 295                 300                 


Asp Met His His Arg Arg Glu Met Ile Gln His Ile Val Lys Leu Leu 
305                 310                 315                 320 


Lys Gln Lys Asp Lys Ser Ala Ser Pro Glu Trp Leu Thr Lys Leu Pro 
                325                 330                 335     


Gln Met Val Lys Gln Leu Glu Val Ser Leu Tyr Arg Ser Ala Pro Ser 
            340                 345                 350         


Phe Glu Ala Tyr Ser Asp Thr Asn Thr Leu Lys His Arg Leu Gln Gln 
        355                 360                 365             


Leu Ala Met Glu Ile Ala Arg Lys Thr Gln Gln Ala Lys Ala Ser Gly 
    370                 375                 380                 


Arg Ser Ser Ser Arg Gly Asp Arg Ile Pro Gly Met Gly Asn Met His 
385                 390                 395                 400 


Tyr Asn Pro Val Asn Met Gly Ala Asn Glu Glu Ile Val Ser Ser Gln 
                405                 410                 415     


Tyr Gly Asn Pro Asn Asp Pro Glu Trp Lys Val Arg Ile Arg His Lys 
            420                 425                 430         


Gln Gln Arg Leu Leu Leu Leu His His Ser Ser Lys Cys Pro Tyr Asp 
        435                 440                 445             


Asn Asp Lys Cys Lys Val Thr Pro Tyr Cys Gly Glu Met Lys Lys Leu 
    450                 455                 460                 


Trp Lys His Met Ala Arg Cys Thr Asp Asn Glu Cys Arg Val Pro His 
465                 470                 475                 480 


Cys Phe Ser Ser Arg Ser Ile Leu Ser His Cys Arg Lys Cys Lys Asp 
                485                 490                 495     


Pro Gly Cys Pro Ala Cys Gly Pro Val Arg Glu Thr Val Arg Lys Thr 
            500                 505                 510         


Gln Lys Ser Asn Ala Gly Lys Gly Val Asn Glu Gly Gln Gly Asp Phe 
        515                 520                 525             


Gly Gly Met Gly Pro Asn Ser Gly Ile Gly Leu Gly Gly Met Gly Asn 
    530                 535                 540                 


Glu Met Gly Gly Gly Gly Gly Ser Gly Met Gly Gly Asn Asp Met Met 
545                 550                 555                 560 


Gly Gly Met Pro Met Met Gly Gly Asn Met Asn Gln Met Pro Lys Arg 
                565                 570                 575     


Gln Pro Ser Gln Pro Met Pro Trp Lys Gly Asp Ile Asn Ser Met Pro 
            580                 585                 590         


Asn Phe Pro Pro Pro Asn Ile Arg Gln Gln Gln Asp Asp Tyr Met Ala 
        595                 600                 605             


Phe Pro Glu Gly Phe Pro Glu Gly Gln Gln Val Leu Asn Gly Pro Ser 
    610                 615                 620                 


Ser Gly Gln Ser Gly Asn Pro Glu Ser Ser Lys Ala Arg His Lys Gln 
625                 630                 635                 640 


Gln Arg Leu Leu Leu Leu Arg His Ala Ser Lys Cys Asn Ala Glu Pro 
                645                 650                 655     


Gly Arg Cys Thr Thr Thr Pro His Cys Ala Glu Met Lys Val Leu Trp 
            660                 665                 670         


Lys His Ile Ala Asn Cys Lys Asp Gln Tyr Cys Lys Val Lys His Cys 
        675                 680                 685             


Met Ser Ser Arg Tyr Val Leu Ser His Tyr Arg Arg Cys Asn Asp Pro 
    690                 695                 700                 


Gly Cys Glu Ile Cys Gly Pro Val Arg Glu Ile Phe Lys Ser Gly Thr 
705                 710                 715                 720 


Asn His Phe Ile His Asp Pro Ser Phe Met Pro Gly Ser Ser Ala Ala 
                725                 730                 735     


Asp Leu Ile Thr Pro Pro Leu Pro Glu Gly Pro Gln Thr Lys Arg Ser 
            740                 745                 750         


Arg Thr Asn Asp Pro Ser Met Asn Gly Met His His Thr Ala Pro Ala 
        755                 760                 765             


Arg Pro Ala Phe Pro Leu Ser Ala Pro Thr Ser Gly Ser Glu Asn His 
    770                 775                 780                 


Ala Lys Leu Lys Ser Ser Ala Lys Pro Ser Ser Ser Asn Ala Thr Glu 
785                 790                 795                 800 


Glu His Ser Leu Leu Glu Cys Phe Thr Thr Glu Gln Val Lys Thr His 
                805                 810                 815     


Ile Gln Ser Leu Lys Lys Thr Ile Glu Val Ser Pro Ala Lys Leu Lys 
            820                 825                 830         


Leu Lys Cys Val Glu Ile Leu Arg Glu Leu Gln Ile His Glu His Gly 
        835                 840                 845             


Trp Val Phe Ala Thr Pro Val Asp Pro Val Glu Leu Gly Leu Asp Asp 
    850                 855                 860                 


Tyr Phe Asp Val Ile Lys Lys Pro Met Asp Leu Gly Thr Ile Ser Arg 
865                 870                 875                 880 


Arg Leu Asp Asn Gly Ser Tyr His Ala Phe Asp Asp Phe Lys Ser Asp 
                885                 890                 895     


Val Arg Leu Thr Phe Glu Asn Ala Met Lys Tyr Asn Asp Glu Asn Ser 
            900                 905                 910         


Val Val His Glu Met Ala Lys Glu Leu Lys Lys Lys Phe Asp Thr Asp 
        915                 920                 925             


Tyr Lys Lys Leu Met Lys Gln Leu Glu Lys Glu His Arg Glu Asn Ser 
    930                 935                 940                 


Met Arg Gln Gln Ala Cys Gly Leu Cys Gly Cys Glu Lys Leu Asn Phe 
945                 950                 955                 960 


Glu Pro Pro Val Phe Phe Cys Asn Gly Met Asn Cys Pro Ser Lys Arg 
                965                 970                 975     


Ile Arg Arg Asn Thr His Phe Tyr Ile Thr Ala Asp Lys Gln Tyr Ala 
            980                 985                 990         


Trp Cys Ser Gln Cys Tyr Asn Glu  Leu Gly Gly Glu Ile  Asp Leu Gly 
        995                 1000                 1005             


Thr Ser  Val Leu Lys Lys Val  Asp Leu Ala Lys Lys  Lys Asn Asp 
    1010                 1015                 1020             


Glu Thr  His Glu Glu Ser Trp  Val Gln Cys Asp Asp  Cys Glu Arg 
    1025                 1030                 1035             


Trp Ile  His Gln Ile Cys Gly  Leu Tyr Asn Thr Arg  Gln Asp Lys 
    1040                 1045                 1050             


Glu Asn  Lys Ser Ala Tyr Ser  Cys Pro Leu Cys Leu  Leu Asp Lys 
    1055                 1060                 1065             


Arg Lys  Lys Glu Gly Glu Pro  Lys Lys Leu Pro Pro  Pro Pro Ala 
    1070                 1075                 1080             


Ala Ser  Asp Ile Pro Arg Thr  Asn Leu Ser Asp Trp  Leu Glu Arg 
    1085                 1090                 1095             


Asp Val  His Lys Lys Val Asn  Gln Arg Leu Lys Glu  Leu Ala Gln 
    1100                 1105                 1110             


Glu Lys  Ala Asp Thr Glu His  Ile Ala Phe Glu Lys  Ala Tyr Ala 
    1115                 1120                 1125             


Asp Leu  Ser Ala Gly Gly Pro  Leu Thr Ile Arg Gln  Val Thr Ser 
    1130                 1135                 1140             


Thr Asp  Arg Lys Leu Glu Val  Arg Asp Gln Met Arg  Gln Arg Tyr 
    1145                 1150                 1155             


Ala His  Lys Asn Tyr Pro Glu  Glu Phe Pro Tyr Arg  Cys Lys Cys 
    1160                 1165                 1170             


Ile Val  Val Phe Gln Asn Ile  Asp Gly Val Asp Val  Val Leu Phe 
    1175                 1180                 1185             


Ala Leu  Tyr Val Tyr Glu His  Gly Asp Asp Asn Pro  Phe Pro Asn 
    1190                 1195                 1200             


Lys Lys  Thr Val Tyr Val Ser  Tyr Leu Asp Ser Val  His Phe Met 
    1205                 1210                 1215             


Lys Pro  Arg Gln Met Arg Thr  Phe Leu Tyr His Glu  Ile Leu Ile 
    1220                 1225                 1230             


Ser Tyr  Leu Asp Tyr Ala Arg  Gln Lys Gly Phe Leu  Gln Ala Phe 
    1235                 1240                 1245             


Ile Trp  Ala Cys Pro Pro Leu  Lys Gly Asp Asp Tyr  Ile Phe Tyr 
    1250                 1255                 1260             


Ala Lys  Pro Glu Asp Gln Lys  Thr Pro Lys Asp Val  Arg Leu Arg 
    1265                 1270                 1275             


Gln Trp  Tyr Leu Asp Met Leu  Val Glu Cys Gln Lys  Arg Asn Ile 
    1280                 1285                 1290             


Val Gly  Met Val Ser Asn Met  Tyr Asp Gln Tyr Phe  Ala Asn Lys 
    1295                 1300                 1305             


Ser Leu  Asp Ala Ala Ser Val  Pro Tyr Phe Asp Gly  Asp Tyr Phe 
    1310                 1315                 1320             


Pro Gly  Glu Ala Glu Asn Ile  Ile Lys Asp Leu Glu  Glu Ser Asn 
    1325                 1330                 1335             


Ser Lys  Arg Lys Gly Gly Ala  Gly Lys Lys Asn Lys  Asp Pro Ser 
    1340                 1345                 1350             


Lys Ser  Lys Ala Ala Pro Ser  Gly Asp Ala Glu Phe  Val Gly Glu 
    1355                 1360                 1365             


Lys Cys  Tyr Lys Glu Gly Gly  Arg Asp Pro Val Met  Gln Lys Phe 
    1370                 1375                 1380             


Cys Asp  Ala Ile Gln Gly Met  Lys Glu Ser Phe Ile  Val Ala Tyr 
    1385                 1390                 1395             


Leu Asn  Ala Lys Asp Ala Lys  Pro Glu His Leu Val  Val Pro Lys 
    1400                 1405                 1410             


Lys Ile  Met Glu Phe Arg Glu  Ala Asn Lys Leu Leu  Met Ile Asp 
    1415                 1420                 1425             


Asp Asp  Pro Lys Lys Lys Lys  Glu Asp Gly Thr Glu  Glu Lys Lys 
    1430                 1435                 1440             


Asp Asp  Glu Lys Pro Gln Ser  Lys Lys Arg Asp Ala  Asp Gly Tyr 
    1445                 1450                 1455             


Glu Val  Ala Ala Ser Glu Lys  Pro Pro Ala Asn Lys  Gln Leu Asn 
    1460                 1465                 1470             


Ser Lys  Gly Lys Pro Val Arg  Val Leu Asn Asp Asp  Asp Glu Glu 
    1475                 1480                 1485             


Ile Asp  Cys Glu Phe Phe Asn  Thr Arg Gln Cys Phe  Leu Asp Leu 
    1490                 1495                 1500             


Cys Arg  Gly Asn His Tyr Gln  Phe Asp Glu Leu Arg  Arg Ala Lys 
    1505                 1510                 1515             


His Thr  Ser Met Met Val Leu  Trp His Leu Gln Asn  Arg Glu Ala 
    1520                 1525                 1530             


Pro Lys  Phe Val Gln Gln Cys  Met Ala Cys Asn Arg  Glu Ile Ala 
    1535                 1540                 1545             


Ser Gly  Ile Arg His His Cys  Asn Val Cys Ser Asp  Phe Asp Leu 
    1550                 1555                 1560             


Cys Asp  Asp Cys Phe Arg Asp  Pro Asp Thr Asn Arg  Gly Thr Cys 
    1565                 1570                 1575             


Asn His  Lys Leu Glu Ala Ile  Lys Val Asp Thr Ala  Gln Ser Glu 
    1580                 1585                 1590             


Asn Ser  Gly Leu Thr Glu Glu  Gln Arg Lys Glu Arg  Gln Arg Asn 
    1595                 1600                 1605             


Ile Gln  Leu His Ile Thr Leu  Ile Glu His Ala Ser  Arg Cys Asn 
    1610                 1615                 1620             


Ser Ser  Ser Cys Lys Ser Ser  Asn Cys Met Lys Met  Lys Ser Tyr 
    1625                 1630                 1635             


Leu Lys  His Gly Ser Thr Cys  Thr Val Lys Ala Ser  Gly Gly Cys 
    1640                 1645                 1650             


Lys Ile  Cys Lys Arg Ile Trp  Thr Leu Leu Arg Ile  His Ala Gln 
    1655                 1660                 1665             


Gln Cys  Lys Ser Ser Ser Cys  Ala Ile Pro Gln Cys  Ile Ala Ile 
    1670                 1675                 1680             


Arg Lys  Arg Ile Arg Gln Leu  Gln Leu Lys Gln Gln  Ala Met Asp 
    1685                 1690                 1695             


Asp Arg  Arg Arg Gln Glu Met  Asn Arg His Tyr Arg  Met Gly Met 
    1700                 1705                 1710             


Met Ser  Ser Asp Asn 
    1715             


<210>  27
<211>  7281
<212>  DNA
<213>  Phaeodactylum tricornutum


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:28

<400>  27
atgcaatcca atagtggtgg gatgcctgga ggtagtatga acgcaacgtc gatgcaagac       60

atgcaacgtt tgcagctcca aatggcgcag tatcaacagc agcagcaaca acaacaacga      120

caggcgcccg tcggaaacca gctactcctt aacaatcaca acagtgtgtc aaacctaaat      180

atgcagcagc agtttcccag caacacaaac aacgcgccta ccgcatcatt tgtgaacctg      240

tcgacacaat ctggcgccgc aggtcgtatg agtaatccgg cgcttgccat gatgcaacaa      300

cagcagcagg gagttgtgac aggtagcaat ggcgcttcat tgatgaattc cgggggtccc      360

aacgcggctt ccatgtttag ttggaatgga atgcagcagc cacagcaggg tcagaacgcg      420

tcgtcgatgg acgccagcac cggaagtagt gctcgtctca tggctatggc taacatgaat      480

cgtatgagta tagggggagg ggccggtact atttcagggc aggggaatag tatgaatccg      540

tctacgagca caatgccgaa tatgcagact ttacttcagc agcagcaggt gaacgcctct      600

catacaccaa atcagatggg ctttcagcag cagcatcact tgtcggggtc tcagatggga      660

tcgtccacaa atacgaatca caccaacggt ggaggagcac agcagcttat gctacagcag      720

cagatcgcga gtttacagaa gcagatgcaa tttcaacatc aaggtggcat tggaaccgta      780

tcagctatgc agaatccttc tatatccaat gcaactgttg gcagtgcggg tccacgggcg      840

gcaaactcgc tgcaatctca ccagcagcaa cttctgcagc aaatacagca acaacagcat      900

gttgggcccg ggcctccttc tatgcctgcg caacaccagc aaccctatca acaacaccaa      960

atgtctgccg gaatgcagtc tctgcatcag caagactcga ctccacaaaa tatgatgaat     1020

atgcttcaac agcaacctca atctcacgcc agaaataatg ccatggctaa cgtgatgagt     1080

gatcaatcaa gtcagactct tagtcgaagt gggtccttaa atgagcaaca gttgcgaatt     1140

catcagcaga atctgctgcg tgcttcatcc gggcagcaaa ccacggtgtc ggatacgcaa     1200

gaagcggcga agtccgtaag atcgcagcag cagtcaccaa gtcaaccgtc caaacaacat     1260

ggaatgcatc cacaaaatgc aacgtcgtat caaccatcca acaatattgt acagggtcca     1320

tttggcggaa tgcatggctt gcccaatcag catagcatgc aaagtgtgtc caaccaacag     1380

ctaaacaatc atgggaaagc aatgccgatg cattccgatg gtagcactac tttaggcatg     1440

tcctcacatg ggaacaatag catgtacagt gggcaaatga gtggtagtaa cgcttctcag     1500

cagcagggta gcgacgatcc aatttccatt tcacagcaca gtaatttaag tgctggtcag     1560

ctgtcacgca ctcatcaagc gagcagcaat gactctggac aaaagacttt tctggatggt     1620

agctttgctg ggggctggca atctaacgat gatctgccag atcgacgtcg cgttatattt     1680

agcattttag aggtgattcg gcagattcgg cccgacgata cgagcaaaat gtcaaacaaa     1740

ctacctcata tggcaaagag cctggaagag catttgtatc gatcggcaca cagcaaagac     1800

gaatacatgg atttttcaac tctgaaggag cgtttgcaag caattgcgca tggacttgac     1860

ctgcacagag gttcctcttc gccaatggtt tccaagaatc atgatacgac gcacttgccc     1920

cagcaaagta gtaatccaag ctattcaaat attgagtctc agcagaattc tttgcaaatc     1980

ggctttccgc caagcttgac tgcatctggt ccgacaagtc agcagcatca aaatgcgggt     2040

tggacgggtc catatgatgt aagttccaag gatgtgatga aaattcaagg ccaaaacaac     2100

gccgacaatt tagttgtgca gagaaacgca gctagtcagc agagctttgg acgtattgct     2160

ggttcgaata gtcaacatgg aggcattatg tcgggatcaa ataccgctgg tccaaaccac     2220

aacagcggaa tttggccaac gaatatggga tcgtcggaaa gtttgggtca accgagcata     2280

gggaatgtgg caatgaacgg cggctcgcag catcaatcct caatgaatca agggatgaac     2340

gatatggcgt cgatgagtca gacttcgcaa cagaacgatt ttgctgggtc ttccctgttt     2400

attgatcctt tgcaaggctt caattggcag agcggtttcc tttcggactc aaatatgcct     2460

ccccctgtcg ggaatggtat agttaactcg gattatccaa atacacccaa gtaccaggat     2520

ccgggcgtag cgcagaagca gaaggtcata ttgcagcagc aacagcgatt gctgctactt     2580

cggcatgcca gtaaatgcaa ggcgggatca aactgtacga cgaagttctg ttctcagatg     2640

gtgaccttgt ggaagcatat gaagacttgc cgtgataaga attgtaagac ttctcattgc     2700

ttgagcagtc gttgtgtttt gaatcactac cgtatttgca aaaatcaagg caagacgtcg     2760

acttgtgaag tatgcggtcc tgtgatggcg aaaatccgtc aacaggagcg cgacgatggt     2820

actggtgatc ccttggccac cgattcctct gccatgaact atcttcagcc aagcttgaat     2880

gctcttccaa atgtgattcc gacaaaacaa atcggtggtt tgtcacaggt tcgacggagc     2940

gataatattt tggaaaattc ttgtcaaagt gaacaggtcc agctgcagca attgcaggcg     3000

cagcaaatga aacttcaaac acagttggat tcattgaagc agcttcagaa acagcaagag     3060

caattgctcg agcagcagtc gagaatacag gagcaggcgc ataaggtcaa ggacccaagc     3120

tcccagcaag cacaacaatt gcaacaacag cagcttcttc tgcatcagct acagaaacga     3180

tgcgaacaac agcagcttca gctacaacaa gagattcagt cccaatcgag aacagctggt     3240

ttggcccaag ctcaggctca gcaattccaa gcggcagcac agtttcgtac aagtgtacaa     3300

gaggcccaga tgttgcagtc ttcatcacca attattcctg gatcctacgg ggaaccaaca     3360

gagtctaaga aaaagcggca tacggtaaca aaatccaaac gaatttcgtc gaaagggaag     3420

cgtggtggga aagggaaagg acttcgggct gcggttgagg ttctatcatc ccatgatcca     3480

gccgaagata actttgatcc atatgcctcg ccaaaaaaga ggggtctgtc ttcttcttcg     3540

aagccagcgc aaaagaaaag gaaggcaact tccgataaag aggctgaccc aggcgaaagg     3600

gcgacaggaa ctgatattgt ggaagactcg acgctggcgt atgaaggcaa tacgtctttg     3660

cttccgttca tgagtctagt cagcgtcaga aaacatgtgg attctctgaa taaaaaaaca     3720

agtctttggt ctcgcatggt gacttacaag tgtcttccag tcattcaaga gctcattgac     3780

gaccagtttg ggtgggtttt ccacgacgcc gtcgatccaa ttgcacttgg cttgcccgac     3840

tactttgatg ttgtgaaaca tcccatgcat ctcgagcttg tgaagaaaaa actggaaaat     3900

gcgatctact gtgacacaga cagttttgcg catgacgttg agctagtttt tgagaatgct     3960

attttgtaca atggggaaac cagtgaagtt ggagagctag cgaatagttt cttggtcaag     4020

tttgctcaga tatacgagaa gctcattgca ggaatcgagt cgccgcagca actcgtgaaa     4080

aagaatgggg aggcttgtgc tctctgtggt ctccaaaaga gacagcttga gccattatcg     4140

ctttattgtc atgggaactg tggtatgcag cctatcgaaa ggcattcatc ttactttacc     4200

gatcactcaa aatcaaatct ttggtgttta ttgtgttacg atcagttgca cgaagaaaaa     4260

atcatattgc tggacgacgg aagtgatatt agaaaaaagg atttacaaga gttcaagaat     4320

gacacttgtc ctgaggaagc atggatcact tgtgacgagt gtaattctca agttcacgaa     4380

gtttgcgctc ttttcagcag gagaaacgag gcaaaagctt cgtacacctg cccaaactgc     4440

tatacctcga aatctttagc gtcgcaaagc acgaagtctg tggccaagtt tgtaaagggg     4500

gctgattatt taccacactg taaaatgagt attgatatcg aaaagggact tcatagaacg     4560

ctccaagatc tctatgatgc caaagcgaaa gatgaaaaat tgggggccgg ccaaactgag     4620

caagcggagg gtctcactgt tagagtgcta tcaaatgtag aaaagaaaca atctgtagga     4680

gcgaggatgc aacgctgttt ttccgaaaag gggtaccctt tagagtttcc tgtacgctcg     4740

aaatgcattg ccctctttca aaaaatccac ggtgttgaca cccttctttt ttcagtctat     4800

gtgtatgaat acgggcaaga atgtccagct ccgaacaaaa gaagggtgta catttcttgc     4860

ttagattctg ttcaatattt tgagcccagc tgctaccgta aagcggctta ccaggcaatc     4920

attgtcgaat atctgcgtta cgtaaaggag cgaggcttcc atacggctca tatatggagc     4980

tgtcctctga cgcccgaaga cggatacatt ttctattgtc acccatcgca ccaacttata     5040

ccgcgagaag atatgcttca gtcatggtat catcagctac tagaaaaggc gaagtcaagt     5100

ggtgttgcta ttagcaccac cacgctctat cacgagtatt ttgaaggtgg ggctgattct     5160

acgaaaattg agcaacaaag gttgccgacc tgtctcccat attttgaagg tgactacata     5220

cctggtgaaa tcgagaatat cctggaaaca attgatgaaa aagaaaatca gagtagtgtc     5280

cagaaactga tcatgtccct gcttgggcag aggatcatga agatgaaaga caatttcctc     5340

gttgttcatt tacacaatga tggtgttgct gcggctagcg agcaaagcga agacgtttca     5400

aaagggtgtg acggctgcga cgagaaaata gtgctcagca agagatcaag tacaactgaa     5460

ccgggtttga tgcggatcga tgtaagggac gatgatgtag caatgacgga agctgacgct     5520

tttcctgccc gggaggatcc tactgtattg aaaacagctg ctccaccgaa gaaggtaaat     5580

actccggaga aagctacacg ttcaatggga gaggcaacat ccaaatctga aaaaactgaa     5640

gacaagagtg ttccaacacc tggtatgttg ctatttgaaa agcctgggag cgacacaagt     5700

cttgttgatt cagctaaaga cgcagcaaat gagggtgtgg ccccaatatc agtttcaatg     5760

ggagaaccaa cagccgaatc tgaaaagagg aaagatagat atgtttcgac agctattgtt     5820

tgtgagaagc ctaggagtaa cttcagtctg attgaatcaa cgaaagatac agcagaaacc     5880

gctgcggccc ccgattcaat ttcgatagta gattcaaaag ttgattccaa agacacggct     5940

tattcaacaa ctggcgcttt gctttgtggt aagcctggga gcgacataag cccgattgat     6000

tcagccgata acgtcaaaaa tgaaattgag cttcctggtg taagagtagc tggagtgaaa     6060

gaagaaagtg gaagcgaggg attgcgggag aaagtcagcc ttgcgcatac tgtttgcgtt     6120

gtagaattaa aagctaacga tgaacctccg ctagaagaat cgggcggtaa cggaggcctg     6180

acaaacgaaa gcgatggggt cgctgcttca ctcatagaga aacaagctac catccagata     6240

gctggaggga atctttccga aacccaaacg gagccaatcg attcggagga tggatgtatc     6300

gacgattctg tcaacactgc agtccaatct ggcgagttgg atgaaaagga gggaagtgca     6360

acagaacaaa atcgggatga agtgattgcc accatcgaca agaaagcgag caaaaggctt     6420

atggacagcg cgatctcaac ccacactgaa cccaccgaat cttcgagtga aatttcgaca     6480

aaaagtgctc tggcgagcag aagccctctc gtcaatagaa agaggccgct gaattcggtt     6540

gaatccaaca catgggatga agatgctccc attgaaaatg ctttgtttga aaccccacag     6600

catttcttaa atttttgtaa aacaaagcac tttcagtttg atgagcttcg acgagccaaa     6660

cactccactt tgtcgatact ctttcagctg cacaatccta tggcttcaca cgttcttcag     6720

cagtgcggat cgtgctaccg agatataacc tgcgatgcca ggtaccattg caatgtttgc     6780

tccaacttcg acttgtgcca agaatgctac agctcagtaa tgaagaagga gtttgttctg     6840

aatgactccc gcttcgctca tgacacgagc cacacgtttt ctcccattga tacggaaatg     6900

cttgaagaaa cgaaaacacg cgaagaacgt cagaaatcct taacggcgca tgttgaactc     6960

ctggagcacg ctgtaccttg ccaaggccca ccagcatgct ctctggagaa ctgccagcgc     7020

atgaaaaaac tcgtcgagca cgtgggaact tgtatgatcc aaccaaagaa ggactgcaag     7080

atttgcagtc gactcctgtc gctatgtaca atacattcgc gtttgtgcgc tattcgcgga     7140

ccttgtccga ttcccttttg tgaccgaatc cgagagcgca acaaacgact acgccagcag     7200

caagatcttg tggacgaccg gcgccgacaa gctcaaaatg aattgtacca atcctctgaa     7260

gagccatcta taacaacttg a                                               7281


<210>  28
<211>  2426
<212>  PRT
<213>  Phaeodactylum tricornutum


<220>
<221>  misc_feature
<223>  Translation product 332250

<400>  28

Met Gln Ser Asn Ser Gly Gly Met Pro Gly Gly Ser Met Asn Ala Thr 
1               5                   10                  15      


Ser Met Gln Asp Met Gln Arg Leu Gln Leu Gln Met Ala Gln Tyr Gln 
            20                  25                  30          


Gln Gln Gln Gln Gln Gln Gln Arg Gln Ala Pro Val Gly Asn Gln Leu 
        35                  40                  45              


Leu Leu Asn Asn His Asn Ser Val Ser Asn Leu Asn Met Gln Gln Gln 
    50                  55                  60                  


Phe Pro Ser Asn Thr Asn Asn Ala Pro Thr Ala Ser Phe Val Asn Leu 
65                  70                  75                  80  


Ser Thr Gln Ser Gly Ala Ala Gly Arg Met Ser Asn Pro Ala Leu Ala 
                85                  90                  95      


Met Met Gln Gln Gln Gln Gln Gly Val Val Thr Gly Ser Asn Gly Ala 
            100                 105                 110         


Ser Leu Met Asn Ser Gly Gly Pro Asn Ala Ala Ser Met Phe Ser Trp 
        115                 120                 125             


Asn Gly Met Gln Gln Pro Gln Gln Gly Gln Asn Ala Ser Ser Met Asp 
    130                 135                 140                 


Ala Ser Thr Gly Ser Ser Ala Arg Leu Met Ala Met Ala Asn Met Asn 
145                 150                 155                 160 


Arg Met Ser Ile Gly Gly Gly Ala Gly Thr Ile Ser Gly Gln Gly Asn 
                165                 170                 175     


Ser Met Asn Pro Ser Thr Ser Thr Met Pro Asn Met Gln Thr Leu Leu 
            180                 185                 190         


Gln Gln Gln Gln Val Asn Ala Ser His Thr Pro Asn Gln Met Gly Phe 
        195                 200                 205             


Gln Gln Gln His His Leu Ser Gly Ser Gln Met Gly Ser Ser Thr Asn 
    210                 215                 220                 


Thr Asn His Thr Asn Gly Gly Gly Ala Gln Gln Leu Met Leu Gln Gln 
225                 230                 235                 240 


Gln Ile Ala Ser Leu Gln Lys Gln Met Gln Phe Gln His Gln Gly Gly 
                245                 250                 255     


Ile Gly Thr Val Ser Ala Met Gln Asn Pro Ser Ile Ser Asn Ala Thr 
            260                 265                 270         


Val Gly Ser Ala Gly Pro Arg Ala Ala Asn Ser Leu Gln Ser His Gln 
        275                 280                 285             


Gln Gln Leu Leu Gln Gln Ile Gln Gln Gln Gln His Val Gly Pro Gly 
    290                 295                 300                 


Pro Pro Ser Met Pro Ala Gln His Gln Gln Pro Tyr Gln Gln His Gln 
305                 310                 315                 320 


Met Ser Ala Gly Met Gln Ser Leu His Gln Gln Asp Ser Thr Pro Gln 
                325                 330                 335     


Asn Met Met Asn Met Leu Gln Gln Gln Pro Gln Ser His Ala Arg Asn 
            340                 345                 350         


Asn Ala Met Ala Asn Val Met Ser Asp Gln Ser Ser Gln Thr Leu Ser 
        355                 360                 365             


Arg Ser Gly Ser Leu Asn Glu Gln Gln Leu Arg Ile His Gln Gln Asn 
    370                 375                 380                 


Leu Leu Arg Ala Ser Ser Gly Gln Gln Thr Thr Val Ser Asp Thr Gln 
385                 390                 395                 400 


Glu Ala Ala Lys Ser Val Arg Ser Gln Gln Gln Ser Pro Ser Gln Pro 
                405                 410                 415     


Ser Lys Gln His Gly Met His Pro Gln Asn Ala Thr Ser Tyr Gln Pro 
            420                 425                 430         


Ser Asn Asn Ile Val Gln Gly Pro Phe Gly Gly Met His Gly Leu Pro 
        435                 440                 445             


Asn Gln His Ser Met Gln Ser Val Ser Asn Gln Gln Leu Asn Asn His 
    450                 455                 460                 


Gly Lys Ala Met Pro Met His Ser Asp Gly Ser Thr Thr Leu Gly Met 
465                 470                 475                 480 


Ser Ser His Gly Asn Asn Ser Met Tyr Ser Gly Gln Met Ser Gly Ser 
                485                 490                 495     


Asn Ala Ser Gln Gln Gln Gly Ser Asp Asp Pro Ile Ser Ile Ser Gln 
            500                 505                 510         


His Ser Asn Leu Ser Ala Gly Gln Leu Ser Arg Thr His Gln Ala Ser 
        515                 520                 525             


Ser Asn Asp Ser Gly Gln Lys Thr Phe Leu Asp Gly Ser Phe Ala Gly 
    530                 535                 540                 


Gly Trp Gln Ser Asn Asp Asp Leu Pro Asp Arg Arg Arg Val Ile Phe 
545                 550                 555                 560 


Ser Ile Leu Glu Val Ile Arg Gln Ile Arg Pro Asp Asp Thr Ser Lys 
                565                 570                 575     


Met Ser Asn Lys Leu Pro His Met Ala Lys Ser Leu Glu Glu His Leu 
            580                 585                 590         


Tyr Arg Ser Ala His Ser Lys Asp Glu Tyr Met Asp Phe Ser Thr Leu 
        595                 600                 605             


Lys Glu Arg Leu Gln Ala Ile Ala His Gly Leu Asp Leu His Arg Gly 
    610                 615                 620                 


Ser Ser Ser Pro Met Val Ser Lys Asn His Asp Thr Thr His Leu Pro 
625                 630                 635                 640 


Gln Gln Ser Ser Asn Pro Ser Tyr Ser Asn Ile Glu Ser Gln Gln Asn 
                645                 650                 655     


Ser Leu Gln Ile Gly Phe Pro Pro Ser Leu Thr Ala Ser Gly Pro Thr 
            660                 665                 670         


Ser Gln Gln His Gln Asn Ala Gly Trp Thr Gly Pro Tyr Asp Val Ser 
        675                 680                 685             


Ser Lys Asp Val Met Lys Ile Gln Gly Gln Asn Asn Ala Asp Asn Leu 
    690                 695                 700                 


Val Val Gln Arg Asn Ala Ala Ser Gln Gln Ser Phe Gly Arg Ile Ala 
705                 710                 715                 720 


Gly Ser Asn Ser Gln His Gly Gly Ile Met Ser Gly Ser Asn Thr Ala 
                725                 730                 735     


Gly Pro Asn His Asn Ser Gly Ile Trp Pro Thr Asn Met Gly Ser Ser 
            740                 745                 750         


Glu Ser Leu Gly Gln Pro Ser Ile Gly Asn Val Ala Met Asn Gly Gly 
        755                 760                 765             


Ser Gln His Gln Ser Ser Met Asn Gln Gly Met Asn Asp Met Ala Ser 
    770                 775                 780                 


Met Ser Gln Thr Ser Gln Gln Asn Asp Phe Ala Gly Ser Ser Leu Phe 
785                 790                 795                 800 


Ile Asp Pro Leu Gln Gly Phe Asn Trp Gln Ser Gly Phe Leu Ser Asp 
                805                 810                 815     


Ser Asn Met Pro Pro Pro Val Gly Asn Gly Ile Val Asn Ser Asp Tyr 
            820                 825                 830         


Pro Asn Thr Pro Lys Tyr Gln Asp Pro Gly Val Ala Gln Lys Gln Lys 
        835                 840                 845             


Val Ile Leu Gln Gln Gln Gln Arg Leu Leu Leu Leu Arg His Ala Ser 
    850                 855                 860                 


Lys Cys Lys Ala Gly Ser Asn Cys Thr Thr Lys Phe Cys Ser Gln Met 
865                 870                 875                 880 


Val Thr Leu Trp Lys His Met Lys Thr Cys Arg Asp Lys Asn Cys Lys 
                885                 890                 895     


Thr Ser His Cys Leu Ser Ser Arg Cys Val Leu Asn His Tyr Arg Ile 
            900                 905                 910         


Cys Lys Asn Gln Gly Lys Thr Ser Thr Cys Glu Val Cys Gly Pro Val 
        915                 920                 925             


Met Ala Lys Ile Arg Gln Gln Glu Arg Asp Asp Gly Thr Gly Asp Pro 
    930                 935                 940                 


Leu Ala Thr Asp Ser Ser Ala Met Asn Tyr Leu Gln Pro Ser Leu Asn 
945                 950                 955                 960 


Ala Leu Pro Asn Val Ile Pro Thr Lys Gln Ile Gly Gly Leu Ser Gln 
                965                 970                 975     


Val Arg Arg Ser Asp Asn Ile Leu Glu Asn Ser Cys Gln Ser Glu Gln 
            980                 985                 990         


Val Gln Leu Gln Gln Leu Gln Ala  Gln Gln Met Lys Leu  Gln Thr Gln 
        995                 1000                 1005             


Leu Asp  Ser Leu Lys Gln Leu  Gln Lys Gln Gln Glu  Gln Leu Leu 
    1010                 1015                 1020             


Glu Gln  Gln Ser Arg Ile Gln  Glu Gln Ala His Lys  Val Lys Asp 
    1025                 1030                 1035             


Pro Ser  Ser Gln Gln Ala Gln  Gln Leu Gln Gln Gln  Gln Leu Leu 
    1040                 1045                 1050             


Leu His  Gln Leu Gln Lys Arg  Cys Glu Gln Gln Gln  Leu Gln Leu 
    1055                 1060                 1065             


Gln Gln  Glu Ile Gln Ser Gln  Ser Arg Thr Ala Gly  Leu Ala Gln 
    1070                 1075                 1080             


Ala Gln  Ala Gln Gln Phe Gln  Ala Ala Ala Gln Phe  Arg Thr Ser 
    1085                 1090                 1095             


Val Gln  Glu Ala Gln Met Leu  Gln Ser Ser Ser Pro  Ile Ile Pro 
    1100                 1105                 1110             


Gly Ser  Tyr Gly Glu Pro Thr  Glu Ser Lys Lys Lys  Arg His Thr 
    1115                 1120                 1125             


Val Thr  Lys Ser Lys Arg Ile  Ser Ser Lys Gly Lys  Arg Gly Gly 
    1130                 1135                 1140             


Lys Gly  Lys Gly Leu Arg Ala  Ala Val Glu Val Leu  Ser Ser His 
    1145                 1150                 1155             


Asp Pro  Ala Glu Asp Asn Phe  Asp Pro Tyr Ala Ser  Pro Lys Lys 
    1160                 1165                 1170             


Arg Gly  Leu Ser Ser Ser Ser  Lys Pro Ala Gln Lys  Lys Arg Lys 
    1175                 1180                 1185             


Ala Thr  Ser Asp Lys Glu Ala  Asp Pro Gly Glu Arg  Ala Thr Gly 
    1190                 1195                 1200             


Thr Asp  Ile Val Glu Asp Ser  Thr Leu Ala Tyr Glu  Gly Asn Thr 
    1205                 1210                 1215             


Ser Leu  Leu Pro Phe Met Ser  Leu Val Ser Val Arg  Lys His Val 
    1220                 1225                 1230             


Asp Ser  Leu Asn Lys Lys Thr  Ser Leu Trp Ser Arg  Met Val Thr 
    1235                 1240                 1245             


Tyr Lys  Cys Leu Pro Val Ile  Gln Glu Leu Ile Asp  Asp Gln Phe 
    1250                 1255                 1260             


Gly Trp  Val Phe His Asp Ala  Val Asp Pro Ile Ala  Leu Gly Leu 
    1265                 1270                 1275             


Pro Asp  Tyr Phe Asp Val Val  Lys His Pro Met His  Leu Glu Leu 
    1280                 1285                 1290             


Val Lys  Lys Lys Leu Glu Asn  Ala Ile Tyr Cys Asp  Thr Asp Ser 
    1295                 1300                 1305             


Phe Ala  His Asp Val Glu Leu  Val Phe Glu Asn Ala  Ile Leu Tyr 
    1310                 1315                 1320             


Asn Gly  Glu Thr Ser Glu Val  Gly Glu Leu Ala Asn  Ser Phe Leu 
    1325                 1330                 1335             


Val Lys  Phe Ala Gln Ile Tyr  Glu Lys Leu Ile Ala  Gly Ile Glu 
    1340                 1345                 1350             


Ser Pro  Gln Gln Leu Val Lys  Lys Asn Gly Glu Ala  Cys Ala Leu 
    1355                 1360                 1365             


Cys Gly  Leu Gln Lys Arg Gln  Leu Glu Pro Leu Ser  Leu Tyr Cys 
    1370                 1375                 1380             


His Gly  Asn Cys Gly Met Gln  Pro Ile Glu Arg His  Ser Ser Tyr 
    1385                 1390                 1395             


Phe Thr  Asp His Ser Lys Ser  Asn Leu Trp Cys Leu  Leu Cys Tyr 
    1400                 1405                 1410             


Asp Gln  Leu His Glu Glu Lys  Ile Ile Leu Leu Asp  Asp Gly Ser 
    1415                 1420                 1425             


Asp Ile  Arg Lys Lys Asp Leu  Gln Glu Phe Lys Asn  Asp Thr Cys 
    1430                 1435                 1440             


Pro Glu  Glu Ala Trp Ile Thr  Cys Asp Glu Cys Asn  Ser Gln Val 
    1445                 1450                 1455             


His Glu  Val Cys Ala Leu Phe  Ser Arg Arg Asn Glu  Ala Lys Ala 
    1460                 1465                 1470             


Ser Tyr  Thr Cys Pro Asn Cys  Tyr Thr Ser Lys Ser  Leu Ala Ser 
    1475                 1480                 1485             


Gln Ser  Thr Lys Ser Val Ala  Lys Phe Val Lys Gly  Ala Asp Tyr 
    1490                 1495                 1500             


Leu Pro  His Cys Lys Met Ser  Ile Asp Ile Glu Lys  Gly Leu His 
    1505                 1510                 1515             


Arg Thr  Leu Gln Asp Leu Tyr  Asp Ala Lys Ala Lys  Asp Glu Lys 
    1520                 1525                 1530             


Leu Gly  Ala Gly Gln Thr Glu  Gln Ala Glu Gly Leu  Thr Val Arg 
    1535                 1540                 1545             


Val Leu  Ser Asn Val Glu Lys  Lys Gln Ser Val Gly  Ala Arg Met 
    1550                 1555                 1560             


Gln Arg  Cys Phe Ser Glu Lys  Gly Tyr Pro Leu Glu  Phe Pro Val 
    1565                 1570                 1575             


Arg Ser  Lys Cys Ile Ala Leu  Phe Gln Lys Ile His  Gly Val Asp 
    1580                 1585                 1590             


Thr Leu  Leu Phe Ser Val Tyr  Val Tyr Glu Tyr Gly  Gln Glu Cys 
    1595                 1600                 1605             


Pro Ala  Pro Asn Lys Arg Arg  Val Tyr Ile Ser Cys  Leu Asp Ser 
    1610                 1615                 1620             


Val Gln  Tyr Phe Glu Pro Ser  Cys Tyr Arg Lys Ala  Ala Tyr Gln 
    1625                 1630                 1635             


Ala Ile  Ile Val Glu Tyr Leu  Arg Tyr Val Lys Glu  Arg Gly Phe 
    1640                 1645                 1650             


His Thr  Ala His Ile Trp Ser  Cys Pro Leu Thr Pro  Glu Asp Gly 
    1655                 1660                 1665             


Tyr Ile  Phe Tyr Cys His Pro  Ser His Gln Leu Ile  Pro Arg Glu 
    1670                 1675                 1680             


Asp Met  Leu Gln Ser Trp Tyr  His Gln Leu Leu Glu  Lys Ala Lys 
    1685                 1690                 1695             


Ser Ser  Gly Val Ala Ile Ser  Thr Thr Thr Leu Tyr  His Glu Tyr 
    1700                 1705                 1710             


Phe Glu  Gly Gly Ala Asp Ser  Thr Lys Ile Glu Gln  Gln Arg Leu 
    1715                 1720                 1725             


Pro Thr  Cys Leu Pro Tyr Phe  Glu Gly Asp Tyr Ile  Pro Gly Glu 
    1730                 1735                 1740             


Ile Glu  Asn Ile Leu Glu Thr  Ile Asp Glu Lys Glu  Asn Gln Ser 
    1745                 1750                 1755             


Ser Val  Gln Lys Leu Ile Met  Ser Leu Leu Gly Gln  Arg Ile Met 
    1760                 1765                 1770             


Lys Met  Lys Asp Asn Phe Leu  Val Val His Leu His  Asn Asp Gly 
    1775                 1780                 1785             


Val Ala  Ala Ala Ser Glu Gln  Ser Glu Asp Val Ser  Lys Gly Cys 
    1790                 1795                 1800             


Asp Gly  Cys Asp Glu Lys Ile  Val Leu Ser Lys Arg  Ser Ser Thr 
    1805                 1810                 1815             


Thr Glu  Pro Gly Leu Met Arg  Ile Asp Val Arg Asp  Asp Asp Val 
    1820                 1825                 1830             


Ala Met  Thr Glu Ala Asp Ala  Phe Pro Ala Arg Glu  Asp Pro Thr 
    1835                 1840                 1845             


Val Leu  Lys Thr Ala Ala Pro  Pro Lys Lys Val Asn  Thr Pro Glu 
    1850                 1855                 1860             


Lys Ala  Thr Arg Ser Met Gly  Glu Ala Thr Ser Lys  Ser Glu Lys 
    1865                 1870                 1875             


Thr Glu  Asp Lys Ser Val Pro  Thr Pro Gly Met Leu  Leu Phe Glu 
    1880                 1885                 1890             


Lys Pro  Gly Ser Asp Thr Ser  Leu Val Asp Ser Ala  Lys Asp Ala 
    1895                 1900                 1905             


Ala Asn  Glu Gly Val Ala Pro  Ile Ser Val Ser Met  Gly Glu Pro 
    1910                 1915                 1920             


Thr Ala  Glu Ser Glu Lys Arg  Lys Asp Arg Tyr Val  Ser Thr Ala 
    1925                 1930                 1935             


Ile Val  Cys Glu Lys Pro Arg  Ser Asn Phe Ser Leu  Ile Glu Ser 
    1940                 1945                 1950             


Thr Lys  Asp Thr Ala Glu Thr  Ala Ala Ala Pro Asp  Ser Ile Ser 
    1955                 1960                 1965             


Ile Val  Asp Ser Lys Val Asp  Ser Lys Asp Thr Ala  Tyr Ser Thr 
    1970                 1975                 1980             


Thr Gly  Ala Leu Leu Cys Gly  Lys Pro Gly Ser Asp  Ile Ser Pro 
    1985                 1990                 1995             


Ile Asp  Ser Ala Asp Asn Val  Lys Asn Glu Ile Glu  Leu Pro Gly 
    2000                 2005                 2010             


Val Arg  Val Ala Gly Val Lys  Glu Glu Ser Gly Ser  Glu Gly Leu 
    2015                 2020                 2025             


Arg Glu  Lys Val Ser Leu Ala  His Thr Val Cys Val  Val Glu Leu 
    2030                 2035                 2040             


Lys Ala  Asn Asp Glu Pro Pro  Leu Glu Glu Ser Gly  Gly Asn Gly 
    2045                 2050                 2055             


Gly Leu  Thr Asn Glu Ser Asp  Gly Val Ala Ala Ser  Leu Ile Glu 
    2060                 2065                 2070             


Lys Gln  Ala Thr Ile Gln Ile  Ala Gly Gly Asn Leu  Ser Glu Thr 
    2075                 2080                 2085             


Gln Thr  Glu Pro Ile Asp Ser  Glu Asp Gly Cys Ile  Asp Asp Ser 
    2090                 2095                 2100             


Val Asn  Thr Ala Val Gln Ser  Gly Glu Leu Asp Glu  Lys Glu Gly 
    2105                 2110                 2115             


Ser Ala  Thr Glu Gln Asn Arg  Asp Glu Val Ile Ala  Thr Ile Asp 
    2120                 2125                 2130             


Lys Lys  Ala Ser Lys Arg Leu  Met Asp Ser Ala Ile  Ser Thr His 
    2135                 2140                 2145             


Thr Glu  Pro Thr Glu Ser Ser  Ser Glu Ile Ser Thr  Lys Ser Ala 
    2150                 2155                 2160             


Leu Ala  Ser Arg Ser Pro Leu  Val Asn Arg Lys Arg  Pro Leu Asn 
    2165                 2170                 2175             


Ser Val  Glu Ser Asn Thr Trp  Asp Glu Asp Ala Pro  Ile Glu Asn 
    2180                 2185                 2190             


Ala Leu  Phe Glu Thr Pro Gln  His Phe Leu Asn Phe  Cys Lys Thr 
    2195                 2200                 2205             


Lys His  Phe Gln Phe Asp Glu  Leu Arg Arg Ala Lys  His Ser Thr 
    2210                 2215                 2220             


Leu Ser  Ile Leu Phe Gln Leu  His Asn Pro Met Ala  Ser His Val 
    2225                 2230                 2235             


Leu Gln  Gln Cys Gly Ser Cys  Tyr Arg Asp Ile Thr  Cys Asp Ala 
    2240                 2245                 2250             


Arg Tyr  His Cys Asn Val Cys  Ser Asn Phe Asp Leu  Cys Gln Glu 
    2255                 2260                 2265             


Cys Tyr  Ser Ser Val Met Lys  Lys Glu Phe Val Leu  Asn Asp Ser 
    2270                 2275                 2280             


Arg Phe  Ala His Asp Thr Ser  His Thr Phe Ser Pro  Ile Asp Thr 
    2285                 2290                 2295             


Glu Met  Leu Glu Glu Thr Lys  Thr Arg Glu Glu Arg  Gln Lys Ser 
    2300                 2305                 2310             


Leu Thr  Ala His Val Glu Leu  Leu Glu His Ala Val  Pro Cys Gln 
    2315                 2320                 2325             


Gly Pro  Pro Ala Cys Ser Leu  Glu Asn Cys Gln Arg  Met Lys Lys 
    2330                 2335                 2340             


Leu Val  Glu His Val Gly Thr  Cys Met Ile Gln Pro  Lys Lys Asp 
    2345                 2350                 2355             


Cys Lys  Ile Cys Ser Arg Leu  Leu Ser Leu Cys Thr  Ile His Ser 
    2360                 2365                 2370             


Arg Leu  Cys Ala Ile Arg Gly  Pro Cys Pro Ile Pro  Phe Cys Asp 
    2375                 2380                 2385             


Arg Ile  Arg Glu Arg Asn Lys  Arg Leu Arg Gln Gln  Gln Asp Leu 
    2390                 2395                 2400             


Val Asp  Asp Arg Arg Arg Gln  Ala Gln Asn Glu Leu  Tyr Gln Ser 
    2405                 2410                 2415             


Ser Glu  Glu Pro Ser Ile Thr  Thr 
    2420                 2425     


<210>  29
<211>  4812
<212>  DNA
<213>  Phaeodactylum tricornutum


<220>
<221>  misc_feature
<223>  Encodes polypeptide of SEQ ID NO:30

<400>  29
atgcaagcct cgcaagccca actgcagcca caacaggcgc cgccggttgc cgccccgctt       60

ccgtcagcgg cggcgcaaca aggatcggcg gcggcgccgg cagtgtcgca ctactcatca      120

cagcacttgt cacaggtgcc ggctgcgcaa cccggtctgg cgccgcaatc gcaaggagtt      180

gttcatcaac agcgcccagt gtcgcaaggt ataaactata cgacgcaaac gtcacaatcg      240

cagacgccgg ctcagcagca ggctcccccg caacagcatt tccctcacca aggactcaac      300

ggcggatggc aaagcgataa ggactatcaa gagcgccgga aaatgatcgc caagattgtg      360

catctactgc aacagcgcaa gccgaatgct ccgcaggaat ggttgaagaa acttccgcaa      420

atggcgaaaa gattggaaga gtcgctgtat cgcacagcaa cgtcctttga agagtataac      480

gatgccaaca cgttgaagca tcgtttgcag cagttggcgg tgaacattgg ccaaaagacc      540

aagaagctgc aacaacagca ggcgttgttg gctcaacaga gactacagca gcagcaacaa      600

caacaacagg ttcggcagca gtctgataca actcagtaca ctccccaggt accgccatct      660

acttcacagc aagcattaat acagccgcag acgcagcaag gtctaatacc tcaagtgaag      720

agcgccgcgc cacctgcgtc acaagtacaa ggtcaacgga tggtaaacat gtcggaaatc      780

aacccgataa tgggacaacc gacccaacaa cagcaaagcg ccccagctcc acaaccgccg      840

cctctccagc aagtgcagta cacacaaaag ccgcccgtgg cccccatttc agcccccact      900

cctcaggctc ccgcaccggc cgcaaacggc cccaatggcc aggcttctgg tcgacaagtt      960

tcggatcgtc aacaggtgtt acgccatcaa cagcaacggc tgctgctact gcgacacgct     1020

gccaaatgtc agcatgaaga cggaaaatgc ccagtaacgc cgcattgcgc tgggatgaag     1080

agattgtgga agcatattgc cgaatgcaag gatcaaaaat gtcttgtacc gcattgcgtc     1140

agttcccggt acgtgttaag tcattatcac cgatgcaagg acgttcgttg tccagtgtgt     1200

ggcccagtaa gggaagccat tcatcgaagc cacgaaaagc agaaacagat gcaggcccta     1260

aaacagcgac atcagcaggc cgtacagcaa cagggccaac ctcagaatgc gacttcagcg     1320

cccgccgcta tcggtgcttt gccagtcccc gctcctcctg gacatagttt ggaacctgtt     1380

accaagaaac agcgcaccgc gcccattaca gctttgagag ctccgatcat gccagttcag     1440

cgacttcagc agccaccagg tactcgtccg gccgtttcgc atccaaccac agtaagacct     1500

ggatataccg gaagtcaacc tcccatcact tctggtcctg gtggtccacc ggttgcgcaa     1560

gtacctggcc tagcgtttgc gaacggacaa gtagtaatgc cgaaacattc aggaccaaag     1620

ccacaagaag atcacacttt gatcaactgc ttctctgtcc agcaaattga gacgcacata     1680

tcttccttga gcaatgggtt ggtcctgcct ccgcagaaat tgaaaacgaa aggattggac     1740

gctcttaaaa cgctgcagtc gcaccaacat gcgtgggtat tcaacactcc agtggatccc     1800

gtggaactcg gcttgccgga ctactttgag gtcatcaaaa aaccaatgga tctagggaca     1860

ataaggaaga agctcgaaaa tggcgtttat cagaggctgg acgacttcaa agagcatgta     1920

ctgcttacat ttgataacgc catgatgtac aacccggagg gttcggttgt gtataacatg     1980

gctaatgaaa tgaaggtaaa gtttcagagc gacttcgtaa agctcatgga acaactgaac     2040

gccgaagaag atgtcaagcg aaagaacggg gaggcctgtt gtttatgcgg atgtgaaaag     2100

ctgctatttg agcctcctgt attttattgc aacggaataa attgcccttc gaagcgaatt     2160

cggcgaaaca gttattacta cattggaggg aacaaccaat atcactggtg tcaccagtgc     2220

tatcaagaac tccgcgacaa ttcaaccatt gatttaggcg acctttccgt taaaaaagaa     2280

agtctcgtga agaagaagaa tgacgaggtg cacgaagaga gctgggtaca atgcgatcgt     2340

tgtgaaagat gggttcatca gatttgtgct ttatttaaca ctcggcaaaa taaggatcag     2400

cgatccgaat acgcttgtcc gaagtgtaca attgacgaac gaaaggcaaa aggcgagctt     2460

gaggcaaaat cgtcaactcc gatggcagag gacctccctc gtaccaagct gtccgagtac     2520

ttggagaatc atgtgcgtga gaaggtcgat gagttcgttg aacagaggtc gcaggatatg     2580

gttgttgctc aaggttgctc tattgaagaa gccagaagca aacttaagat gggaggtgca     2640

atcactatcc gacaggtaac ttccatggac agacgacttg aggtccgaga tagaatgaag     2700

caacgctatg cattcaaaaa ctacccggaa gaattcaatt ttcggtgtaa atgcatcgtt     2760

gtcttccaga atttggacgg cgttgatgtt gttttgtttg gcctttacgt atacgagcat     2820

gatgagaaaa atcctgcccc caacaagcgg gccgtctatg tgtcctatct cgatagtgtt     2880

cattacatga gaccacgtga tatgcgtact ttcatttacc acgaaatttt aatatcttat     2940

cttgattacg tccggaggcg tggattttcg actgctcaca tttgggcttg tccgccgctt     3000

cgcggagacg actacatcct ttacgcaaaa ccagaggacc agaagacccc gaaagacgat     3060

cgattgcgtc agtggtacat agacatgctg attgaggccc aaaggcgagg gattgttggg     3120

aaacttacca acatgtacga cctctatttt tccaacgaga aaaacgatgc aacggttgtc     3180

ccctacatgg atggtgacta ctttcctgct gaggttgaga atatcatcaa ggatattgag     3240

gaaggcaaga cgggaaagaa aggcagttcg caaggcaaaa agaaaaaaga aaaagccaaa     3300

cagaagaaga agtcaggtcg tggcggaact cggtctacgg gattggatga agacgctctt     3360

aaagcgagcg gatttctgcc acccggtact gattcaaaaa gtctagaaga aggcgctcga     3420

gactacgtca tggtgaaact tggtgagacc atccagccca tgaaggaaag tttcattgtg     3480

gctttcttag gctgggaagg ggcgaaagag ggagacatgg ttgttcccaa tgagatccaa     3540

gagcaccgtg acctgcatga gatcacttgg aaacttaaaa gcagtagcac caaagctgat     3600

acagtggaga ctatcgagaa cgaaagcgat aggcaacagg acgccgagat caaagattct     3660

agggataaaa aaggggacag ttcgataaag ttaaacggta ctacttcaaa gaagccggat     3720

gacacgtcct caagctcagg aaacatcgaa gacactgcca gcacacatag ggctcatgtt     3780

gacacaccga tggaagggat tgtaaaaaat gaatttaccg aaaccaatgg aattttgcaa     3840

tcctcacctc aagagaataa agactctgaa tccatcaacg ctcctgcgct tcgtgttgga     3900

actgaggcta ttgatcgccc ggatgctccg cagtccgcga taactgcagc accaaacact     3960

atttctatcc gagagggaaa attcgctgct atggcggccc ggaaacgtga tagagaaggg     4020

gagccgaaag agcccgagga ggtggaaagt acaagtgaga agacgaagga agaaaagctg     4080

acttccataa cagtgactga tagcaagggc cgtactgtga aagttttgga tgacgacgag     4140

gaggaacttg actgcgagtt tctaaacaat cgacaggcgt tcttaaatct atgtcaagga     4200

aatcactacc agtttgatca cctgcgccgc gcaaagcact cctccatgat ggttttgtgg     4260

caccttcaca acagggatgc accaaaattt gtgcagcaat gtgcgacttg ctccagagaa     4320

cttcttaccg gatatcgctt taattgtcct acatgtgggg atttcgatca gtgccaagac     4380

tgcatttcca acccgaaggt tcctcggcac ccgcatcagc tcaagcctat tccggtggcc     4440

aatgcgcaac aaaacgaatt gacggaagcg caacgcaagg aacgacagcg cagtatccag     4500

cttcatatga ctcttttgct gcatgctgct acgtgtagct cgccgaagtg tccgtcagcc     4560

aattgtacaa agatgaaggg tcttttaaag cacggcgcgc aatgccaagt gaaggccact     4620

ggcggttgca acgtatgcaa gagaatatgg gctttactgc aaattcatgc tcgtcagtgc     4680

aaagcgaagt cttgccctgt tccgaattgt atggcaatcc gtgaaagagt tcgccaattg     4740

aaaaagcaac aacaggcgat ggatgaccgt cgtcgccaag aaatgaatcg agcttacagg     4800

gggaagcgct aa                                                         4812


<210>  30
<211>  1603
<212>  PRT
<213>  Phaeodactylum tricornutum


<220>
<221>  misc_feature
<223>  Translation product 332333

<400>  30

Met Gln Ala Ser Gln Ala Gln Leu Gln Pro Gln Gln Ala Pro Pro Val 
1               5                   10                  15      


Ala Ala Pro Leu Pro Ser Ala Ala Ala Gln Gln Gly Ser Ala Ala Ala 
            20                  25                  30          


Pro Ala Val Ser His Tyr Ser Ser Gln His Leu Ser Gln Val Pro Ala 
        35                  40                  45              


Ala Gln Pro Gly Leu Ala Pro Gln Ser Gln Gly Val Val His Gln Gln 
    50                  55                  60                  


Arg Pro Val Ser Gln Gly Ile Asn Tyr Thr Thr Gln Thr Ser Gln Ser 
65                  70                  75                  80  


Gln Thr Pro Ala Gln Gln Gln Ala Pro Pro Gln Gln His Phe Pro His 
                85                  90                  95      


Gln Gly Leu Asn Gly Gly Trp Gln Ser Asp Lys Asp Tyr Gln Glu Arg 
            100                 105                 110         


Arg Lys Met Ile Ala Lys Ile Val His Leu Leu Gln Gln Arg Lys Pro 
        115                 120                 125             


Asn Ala Pro Gln Glu Trp Leu Lys Lys Leu Pro Gln Met Ala Lys Arg 
    130                 135                 140                 


Leu Glu Glu Ser Leu Tyr Arg Thr Ala Thr Ser Phe Glu Glu Tyr Asn 
145                 150                 155                 160 


Asp Ala Asn Thr Leu Lys His Arg Leu Gln Gln Leu Ala Val Asn Ile 
                165                 170                 175     


Gly Gln Lys Thr Lys Lys Leu Gln Gln Gln Gln Ala Leu Leu Ala Gln 
            180                 185                 190         


Gln Arg Leu Gln Gln Gln Gln Gln Gln Gln Gln Val Arg Gln Gln Ser 
        195                 200                 205             


Asp Thr Thr Gln Tyr Thr Pro Gln Val Pro Pro Ser Thr Ser Gln Gln 
    210                 215                 220                 


Ala Leu Ile Gln Pro Gln Thr Gln Gln Gly Leu Ile Pro Gln Val Lys 
225                 230                 235                 240 


Ser Ala Ala Pro Pro Ala Ser Gln Val Gln Gly Gln Arg Met Val Asn 
                245                 250                 255     


Met Ser Glu Ile Asn Pro Ile Met Gly Gln Pro Thr Gln Gln Gln Gln 
            260                 265                 270         


Ser Ala Pro Ala Pro Gln Pro Pro Pro Leu Gln Gln Val Gln Tyr Thr 
        275                 280                 285             


Gln Lys Pro Pro Val Ala Pro Ile Ser Ala Pro Thr Pro Gln Ala Pro 
    290                 295                 300                 


Ala Pro Ala Ala Asn Gly Pro Asn Gly Gln Ala Ser Gly Arg Gln Val 
305                 310                 315                 320 


Ser Asp Arg Gln Gln Val Leu Arg His Gln Gln Gln Arg Leu Leu Leu 
                325                 330                 335     


Leu Arg His Ala Ala Lys Cys Gln His Glu Asp Gly Lys Cys Pro Val 
            340                 345                 350         


Thr Pro His Cys Ala Gly Met Lys Arg Leu Trp Lys His Ile Ala Glu 
        355                 360                 365             


Cys Lys Asp Gln Lys Cys Leu Val Pro His Cys Val Ser Ser Arg Tyr 
    370                 375                 380                 


Val Leu Ser His Tyr His Arg Cys Lys Asp Val Arg Cys Pro Val Cys 
385                 390                 395                 400 


Gly Pro Val Arg Glu Ala Ile His Arg Ser His Glu Lys Gln Lys Gln 
                405                 410                 415     


Met Gln Ala Leu Lys Gln Arg His Gln Gln Ala Val Gln Gln Gln Gly 
            420                 425                 430         


Gln Pro Gln Asn Ala Thr Ser Ala Pro Ala Ala Ile Gly Ala Leu Pro 
        435                 440                 445             


Val Pro Ala Pro Pro Gly His Ser Leu Glu Pro Val Thr Lys Lys Gln 
    450                 455                 460                 


Arg Thr Ala Pro Ile Thr Ala Leu Arg Ala Pro Ile Met Pro Val Gln 
465                 470                 475                 480 


Arg Leu Gln Gln Pro Pro Gly Thr Arg Pro Ala Val Ser His Pro Thr 
                485                 490                 495     


Thr Val Arg Pro Gly Tyr Thr Gly Ser Gln Pro Pro Ile Thr Ser Gly 
            500                 505                 510         


Pro Gly Gly Pro Pro Val Ala Gln Val Pro Gly Leu Ala Phe Ala Asn 
        515                 520                 525             


Gly Gln Val Val Met Pro Lys His Ser Gly Pro Lys Pro Gln Glu Asp 
    530                 535                 540                 


His Thr Leu Ile Asn Cys Phe Ser Val Gln Gln Ile Glu Thr His Ile 
545                 550                 555                 560 


Ser Ser Leu Ser Asn Gly Leu Val Leu Pro Pro Gln Lys Leu Lys Thr 
                565                 570                 575     


Lys Gly Leu Asp Ala Leu Lys Thr Leu Gln Ser His Gln His Ala Trp 
            580                 585                 590         


Val Phe Asn Thr Pro Val Asp Pro Val Glu Leu Gly Leu Pro Asp Tyr 
        595                 600                 605             


Phe Glu Val Ile Lys Lys Pro Met Asp Leu Gly Thr Ile Arg Lys Lys 
    610                 615                 620                 


Leu Glu Asn Gly Val Tyr Gln Arg Leu Asp Asp Phe Lys Glu His Val 
625                 630                 635                 640 


Leu Leu Thr Phe Asp Asn Ala Met Met Tyr Asn Pro Glu Gly Ser Val 
                645                 650                 655     


Val Tyr Asn Met Ala Asn Glu Met Lys Val Lys Phe Gln Ser Asp Phe 
            660                 665                 670         


Val Lys Leu Met Glu Gln Leu Asn Ala Glu Glu Asp Val Lys Arg Lys 
        675                 680                 685             


Asn Gly Glu Ala Cys Cys Leu Cys Gly Cys Glu Lys Leu Leu Phe Glu 
    690                 695                 700                 


Pro Pro Val Phe Tyr Cys Asn Gly Ile Asn Cys Pro Ser Lys Arg Ile 
705                 710                 715                 720 


Arg Arg Asn Ser Tyr Tyr Tyr Ile Gly Gly Asn Asn Gln Tyr His Trp 
                725                 730                 735     


Cys His Gln Cys Tyr Gln Glu Leu Arg Asp Asn Ser Thr Ile Asp Leu 
            740                 745                 750         


Gly Asp Leu Ser Val Lys Lys Glu Ser Leu Val Lys Lys Lys Asn Asp 
        755                 760                 765             


Glu Val His Glu Glu Ser Trp Val Gln Cys Asp Arg Cys Glu Arg Trp 
    770                 775                 780                 


Val His Gln Ile Cys Ala Leu Phe Asn Thr Arg Gln Asn Lys Asp Gln 
785                 790                 795                 800 


Arg Ser Glu Tyr Ala Cys Pro Lys Cys Thr Ile Asp Glu Arg Lys Ala 
                805                 810                 815     


Lys Gly Glu Leu Glu Ala Lys Ser Ser Thr Pro Met Ala Glu Asp Leu 
            820                 825                 830         


Pro Arg Thr Lys Leu Ser Glu Tyr Leu Glu Asn His Val Arg Glu Lys 
        835                 840                 845             


Val Asp Glu Phe Val Glu Gln Arg Ser Gln Asp Met Val Val Ala Gln 
    850                 855                 860                 


Gly Cys Ser Ile Glu Glu Ala Arg Ser Lys Leu Lys Met Gly Gly Ala 
865                 870                 875                 880 


Ile Thr Ile Arg Gln Val Thr Ser Met Asp Arg Arg Leu Glu Val Arg 
                885                 890                 895     


Asp Arg Met Lys Gln Arg Tyr Ala Phe Lys Asn Tyr Pro Glu Glu Phe 
            900                 905                 910         


Asn Phe Arg Cys Lys Cys Ile Val Val Phe Gln Asn Leu Asp Gly Val 
        915                 920                 925             


Asp Val Val Leu Phe Gly Leu Tyr Val Tyr Glu His Asp Glu Lys Asn 
    930                 935                 940                 


Pro Ala Pro Asn Lys Arg Ala Val Tyr Val Ser Tyr Leu Asp Ser Val 
945                 950                 955                 960 


His Tyr Met Arg Pro Arg Asp Met Arg Thr Phe Ile Tyr His Glu Ile 
                965                 970                 975     


Leu Ile Ser Tyr Leu Asp Tyr Val Arg Arg Arg Gly Phe Ser Thr Ala 
            980                 985                 990         


His Ile Trp Ala Cys Pro Pro Leu  Arg Gly Asp Asp Tyr  Ile Leu Tyr 
        995                 1000                 1005             


Ala Lys  Pro Glu Asp Gln Lys  Thr Pro Lys Asp Asp  Arg Leu Arg 
    1010                 1015                 1020             


Gln Trp  Tyr Ile Asp Met Leu  Ile Glu Ala Gln Arg  Arg Gly Ile 
    1025                 1030                 1035             


Val Gly  Lys Leu Thr Asn Met  Tyr Asp Leu Tyr Phe  Ser Asn Glu 
    1040                 1045                 1050             


Lys Asn  Asp Ala Thr Val Val  Pro Tyr Met Asp Gly  Asp Tyr Phe 
    1055                 1060                 1065             


Pro Ala  Glu Val Glu Asn Ile  Ile Lys Asp Ile Glu  Glu Gly Lys 
    1070                 1075                 1080             


Thr Gly  Lys Lys Gly Ser Ser  Gln Gly Lys Lys Lys  Lys Glu Lys 
    1085                 1090                 1095             


Ala Lys  Gln Lys Lys Lys Ser  Gly Arg Gly Gly Thr  Arg Ser Thr 
    1100                 1105                 1110             


Gly Leu  Asp Glu Asp Ala Leu  Lys Ala Ser Gly Phe  Leu Pro Pro 
    1115                 1120                 1125             


Gly Thr  Asp Ser Lys Ser Leu  Glu Glu Gly Ala Arg  Asp Tyr Val 
    1130                 1135                 1140             


Met Val  Lys Leu Gly Glu Thr  Ile Gln Pro Met Lys  Glu Ser Phe 
    1145                 1150                 1155             


Ile Val  Ala Phe Leu Gly Trp  Glu Gly Ala Lys Glu  Gly Asp Met 
    1160                 1165                 1170             


Val Val  Pro Asn Glu Ile Gln  Glu His Arg Asp Leu  His Glu Ile 
    1175                 1180                 1185             


Thr Trp  Lys Leu Lys Ser Ser  Ser Thr Lys Ala Asp  Thr Val Glu 
    1190                 1195                 1200             


Thr Ile  Glu Asn Glu Ser Asp  Arg Gln Gln Asp Ala  Glu Ile Lys 
    1205                 1210                 1215             


Asp Ser  Arg Asp Lys Lys Gly  Asp Ser Ser Ile Lys  Leu Asn Gly 
    1220                 1225                 1230             


Thr Thr  Ser Lys Lys Pro Asp  Asp Thr Ser Ser Ser  Ser Gly Asn 
    1235                 1240                 1245             


Ile Glu  Asp Thr Ala Ser Thr  His Arg Ala His Val  Asp Thr Pro 
    1250                 1255                 1260             


Met Glu  Gly Ile Val Lys Asn  Glu Phe Thr Glu Thr  Asn Gly Ile 
    1265                 1270                 1275             


Leu Gln  Ser Ser Pro Gln Glu  Asn Lys Asp Ser Glu  Ser Ile Asn 
    1280                 1285                 1290             


Ala Pro  Ala Leu Arg Val Gly  Thr Glu Ala Ile Asp  Arg Pro Asp 
    1295                 1300                 1305             


Ala Pro  Gln Ser Ala Ile Thr  Ala Ala Pro Asn Thr  Ile Ser Ile 
    1310                 1315                 1320             


Arg Glu  Gly Lys Phe Ala Ala  Met Ala Ala Arg Lys  Arg Asp Arg 
    1325                 1330                 1335             


Glu Gly  Glu Pro Lys Glu Pro  Glu Glu Val Glu Ser  Thr Ser Glu 
    1340                 1345                 1350             


Lys Thr  Lys Glu Glu Lys Leu  Thr Ser Ile Thr Val  Thr Asp Ser 
    1355                 1360                 1365             


Lys Gly  Arg Thr Val Lys Val  Leu Asp Asp Asp Glu  Glu Glu Leu 
    1370                 1375                 1380             


Asp Cys  Glu Phe Leu Asn Asn  Arg Gln Ala Phe Leu  Asn Leu Cys 
    1385                 1390                 1395             


Gln Gly  Asn His Tyr Gln Phe  Asp His Leu Arg Arg  Ala Lys His 
    1400                 1405                 1410             


Ser Ser  Met Met Val Leu Trp  His Leu His Asn Arg  Asp Ala Pro 
    1415                 1420                 1425             


Lys Phe  Val Gln Gln Cys Ala  Thr Cys Ser Arg Glu  Leu Leu Thr 
    1430                 1435                 1440             


Gly Tyr  Arg Phe Asn Cys Pro  Thr Cys Gly Asp Phe  Asp Gln Cys 
    1445                 1450                 1455             


Gln Asp  Cys Ile Ser Asn Pro  Lys Val Pro Arg His  Pro His Gln 
    1460                 1465                 1470             


Leu Lys  Pro Ile Pro Val Ala  Asn Ala Gln Gln Asn  Glu Leu Thr 
    1475                 1480                 1485             


Glu Ala  Gln Arg Lys Glu Arg  Gln Arg Ser Ile Gln  Leu His Met 
    1490                 1495                 1500             


Thr Leu  Leu Leu His Ala Ala  Thr Cys Ser Ser Pro  Lys Cys Pro 
    1505                 1510                 1515             


Ser Ala  Asn Cys Thr Lys Met  Lys Gly Leu Leu Lys  His Gly Ala 
    1520                 1525                 1530             


Gln Cys  Gln Val Lys Ala Thr  Gly Gly Cys Asn Val  Cys Lys Arg 
    1535                 1540                 1545             


Ile Trp  Ala Leu Leu Gln Ile  His Ala Arg Gln Cys  Lys Ala Lys 
    1550                 1555                 1560             


Ser Cys  Pro Val Pro Asn Cys  Met Ala Ile Arg Glu  Arg Val Arg 
    1565                 1570                 1575             


Gln Leu  Lys Lys Gln Gln Gln  Ala Met Asp Asp Arg  Arg Arg Gln 
    1580                 1585                 1590             


Glu Met  Asn Arg Ala Tyr Arg  Gly Lys Arg 
    1595                 1600             


<210>  31
<211>  3171
<212>  DNA
<213>  Phaeodactylum tricornutum


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:32

<400>  31
atgaagcgac tttggagaca tattgcaaat tgtaaggatc aggactgctc tgttcaacat       60

tgtcttagta gtcgcggcgt tctcagccat tatcgacggt gtaaagatgc gctctgtcct      120

gcatgtgggc ctgtccgaga aactatacgg aaaagtcatg agatggaaag tcaaagcaat      180

ccacaagggg taccgtccga caatcggttt atgggtcgag atgattcgtt cggtcggtca      240

agttccgtta catcgccaac cgaacaggaa ccgaagcgta tgagaacaga acatcgccct      300

agcgcggcgt ctataaaatc agcgcgctct acgcctgtga gcgcgcctcc tttgaagcaa      360

gaaccccctc gaagcatagg caaaggtgag aaagtagctc catctgctga aaaagattcg      420

aaaggaagtg tcgaccgatc actcctcgag agtttctcgg tgaaggagct cgaaactcat      480

ttgcgatcgc tggaacgaga gacccaactt cctccggcga agctcaagtc taaatgtctg      540

gatgtattaa agggtttaat ggctcaccaa cacggttggg ttttcaatgg tccagtcgat      600

ccagttgagc tcggtcttgt tgattatttt gaaattatca agaagcccat ggacctcggc      660

accattcaaa agcgtttgga aagtagtgca taccactcca tcgatgactt taaaacggat      720

atcttcttaa cttttgagaa tgcaatggtg tataatgagg atggttccgt tgtctacgac      780

atggcgaagc agctgaaggt taaagccgaa tctgacatga agagacttgt ggcacaactg      840

gaaacagaag accttgaaag acgccagaat gaacgcgcgt gcaccttgtg tggttcagag      900

aaactgttgt ttgaacctcc tgtttatttt tgtaacggaa ttaattgtca atcgcagcgg      960

atccgacgaa acagtcactt ctatatcgga ggaaacaacc aatacttttg gtgtagccct     1020

tgctttaatg aacttgatga taaaattccg attgagcttg ccgacttgac agtcatgaaa     1080

aacaatctga agaagaaaaa gaatgacgag attcacgagg agagctgggt acagtgtgac     1140

acttgcgaac ggtgggttca ccagatatgt ggacttttta acacccgtca gaataaagag     1200

caccacagcg agtactgttg tcctaaatgt ttgcttgaaa aacgcaaaac tgtttcaata     1260

actccagcgc cgaagccatt gctggctgcg gacttgccgc ggactacttt atcggagtgg     1320

ctagaacgca gtgtcactaa gaaagtggaa aaaaggaaga gagaactggc cgaagagcgt     1380

tcgcagaatg aggggatatc tcttgaagaa gctttgcgac aggtagaaag tggcggccca     1440

ataataattc gtcaagttac cgcgatggat agaaagcttg aggttcgcga gctgatgaaa     1500

aagcgatatg cacacaagaa ttatcctgac gaatttccct ttcggtgcaa atcgattgtc     1560

gtttttcagc atcttgacgg agttgatgtc attctgtttg cgttgtatct ctacgaacac     1620

ggtgaagaca atcctccgcc caaccaacga accgtgtaca tctcatatct ggacagtgtt     1680

cactttatga ggcctcgcaa actccggacc tttgtgtacc atgagattct gattgcctat     1740

ttggactacg ctaggcgacg gggatttgca actgctcata tttgggcatg cccacctttg     1800

aagggtgacg attacatttt ctacgctaaa ccagaagacc agaagactcc gagagattca     1860

cgactgcgcc tttggtacat tgacatgctc gtagaatgtc aaaaaaggag tatcgtcggc     1920

aaagtaacga atatgtacga tatttatttc gcagacccga atttggacgc cactgctgtt     1980

ccctatttgg agggcgacta ttttcctggt gaagcggaga atattataaa aatgctcgaa     2040

gaaggtggag gcaagaaact tgggtcagtg gggaaaaaga agaaaagcaa atcgtcgaaa     2100

gcgcagaaga ataagggagg aaatacgggt actagatcca ctggagtcga cgaagaagcg     2160

cttattgcga gtggtattct ggatggaacc aagagtttaa aggaccttga tcgtgatcag     2220

gtcatggtga agctgggtga aacgattcag cctatgaagg aaagttttat agtagcgttc     2280

ttaaattgga aagatgctcg cgaagaagat atgatagtcc cagaagaaat cgaaatggct     2340

aggattgaat acgcagcgaa aggtgatcca gagcttgttg gaagcaaacg tgatgctgct     2400

ggaaacatga gagacgctac gtcgaagacg ggcgcgaatg gagagcctgt aaaggttatt     2460

gatgacgacg ctgaagatct agattgcgag tttttgaaca atcgccaagc attcttgaat     2520

ctttgtcgag gaaaccatta tcaatttgac gagctccggc gagcaaagca tacttcattg     2580

atgctccttt ggcatctaca taacagagat gcaccaaaat ttgtgcagca gtgcgtttct     2640

tgcagtcgcg aaatcctcag tggcaaacgt tttcactgcg acacgtgccc tgactatgat     2700

ctctgtcaag attgctacaa agaccctaag gcaaacagag gtaactgtac gcacgctctt     2760

aaaccactcg ccgttgaagc tgattccgga caggatcgca gtgggctatc agagcaagaa     2820

cgcatgcaac gccagcgaaa cctgttgtta cacattcaac ttatcgaaca cgcttcaagg     2880

tgttcctctc agacatgttc ttcattaaat tgcgcaaaaa tgaaaaaata tctgcagcat     2940

gctcgtgtct gcaaggttaa agtattagga gggtgcaaga tttgcaaaaa gatctggacc     3000

ttactccgaa ttcatgcgca gaaatgtaag gatacaaatt gccccattcc acaatgcaat     3060

gcgattcgtg agaagatgag gcaactgcaa aagcagcagc aggctatgga cgaccggcgc     3120

cgtctggaaa tgaatcgtca catgcgtttc tccaccgcag gaggctcttg a              3171


<210>  32
<211>  1056
<212>  PRT
<213>  Phaeodactylum tricornutum


<220>
<221>  misc_feature
<223>  translation product 332603

<400>  32

Met Lys Arg Leu Trp Arg His Ile Ala Asn Cys Lys Asp Gln Asp Cys 
1               5                   10                  15      


Ser Val Gln His Cys Leu Ser Ser Arg Gly Val Leu Ser His Tyr Arg 
            20                  25                  30          


Arg Cys Lys Asp Ala Leu Cys Pro Ala Cys Gly Pro Val Arg Glu Thr 
        35                  40                  45              


Ile Arg Lys Ser His Glu Met Glu Ser Gln Ser Asn Pro Gln Gly Val 
    50                  55                  60                  


Pro Ser Asp Asn Arg Phe Met Gly Arg Asp Asp Ser Phe Gly Arg Ser 
65                  70                  75                  80  


Ser Ser Val Thr Ser Pro Thr Glu Gln Glu Pro Lys Arg Met Arg Thr 
                85                  90                  95      


Glu His Arg Pro Ser Ala Ala Ser Ile Lys Ser Ala Arg Ser Thr Pro 
            100                 105                 110         


Val Ser Ala Pro Pro Leu Lys Gln Glu Pro Pro Arg Ser Ile Gly Lys 
        115                 120                 125             


Gly Glu Lys Val Ala Pro Ser Ala Glu Lys Asp Ser Lys Gly Ser Val 
    130                 135                 140                 


Asp Arg Ser Leu Leu Glu Ser Phe Ser Val Lys Glu Leu Glu Thr His 
145                 150                 155                 160 


Leu Arg Ser Leu Glu Arg Glu Thr Gln Leu Pro Pro Ala Lys Leu Lys 
                165                 170                 175     


Ser Lys Cys Leu Asp Val Leu Lys Gly Leu Met Ala His Gln His Gly 
            180                 185                 190         


Trp Val Phe Asn Gly Pro Val Asp Pro Val Glu Leu Gly Leu Val Asp 
        195                 200                 205             


Tyr Phe Glu Ile Ile Lys Lys Pro Met Asp Leu Gly Thr Ile Gln Lys 
    210                 215                 220                 


Arg Leu Glu Ser Ser Ala Tyr His Ser Ile Asp Asp Phe Lys Thr Asp 
225                 230                 235                 240 


Ile Phe Leu Thr Phe Glu Asn Ala Met Val Tyr Asn Glu Asp Gly Ser 
                245                 250                 255     


Val Val Tyr Asp Met Ala Lys Gln Leu Lys Val Lys Ala Glu Ser Asp 
            260                 265                 270         


Met Lys Arg Leu Val Ala Gln Leu Glu Thr Glu Asp Leu Glu Arg Arg 
        275                 280                 285             


Gln Asn Glu Arg Ala Cys Thr Leu Cys Gly Ser Glu Lys Leu Leu Phe 
    290                 295                 300                 


Glu Pro Pro Val Tyr Phe Cys Asn Gly Ile Asn Cys Gln Ser Gln Arg 
305                 310                 315                 320 


Ile Arg Arg Asn Ser His Phe Tyr Ile Gly Gly Asn Asn Gln Tyr Phe 
                325                 330                 335     


Trp Cys Ser Pro Cys Phe Asn Glu Leu Asp Asp Lys Ile Pro Ile Glu 
            340                 345                 350         


Leu Ala Asp Leu Thr Val Met Lys Asn Asn Leu Lys Lys Lys Lys Asn 
        355                 360                 365             


Asp Glu Ile His Glu Glu Ser Trp Val Gln Cys Asp Thr Cys Glu Arg 
    370                 375                 380                 


Trp Val His Gln Ile Cys Gly Leu Phe Asn Thr Arg Gln Asn Lys Glu 
385                 390                 395                 400 


His His Ser Glu Tyr Cys Cys Pro Lys Cys Leu Leu Glu Lys Arg Lys 
                405                 410                 415     


Thr Val Ser Ile Thr Pro Ala Pro Lys Pro Leu Leu Ala Ala Asp Leu 
            420                 425                 430         


Pro Arg Thr Thr Leu Ser Glu Trp Leu Glu Arg Ser Val Thr Lys Lys 
        435                 440                 445             


Val Glu Lys Arg Lys Arg Glu Leu Ala Glu Glu Arg Ser Gln Asn Glu 
    450                 455                 460                 


Gly Ile Ser Leu Glu Glu Ala Leu Arg Gln Val Glu Ser Gly Gly Pro 
465                 470                 475                 480 


Ile Ile Ile Arg Gln Val Thr Ala Met Asp Arg Lys Leu Glu Val Arg 
                485                 490                 495     


Glu Leu Met Lys Lys Arg Tyr Ala His Lys Asn Tyr Pro Asp Glu Phe 
            500                 505                 510         


Pro Phe Arg Cys Lys Ser Ile Val Val Phe Gln His Leu Asp Gly Val 
        515                 520                 525             


Asp Val Ile Leu Phe Ala Leu Tyr Leu Tyr Glu His Gly Glu Asp Asn 
    530                 535                 540                 


Pro Pro Pro Asn Gln Arg Thr Val Tyr Ile Ser Tyr Leu Asp Ser Val 
545                 550                 555                 560 


His Phe Met Arg Pro Arg Lys Leu Arg Thr Phe Val Tyr His Glu Ile 
                565                 570                 575     


Leu Ile Ala Tyr Leu Asp Tyr Ala Arg Arg Arg Gly Phe Ala Thr Ala 
            580                 585                 590         


His Ile Trp Ala Cys Pro Pro Leu Lys Gly Asp Asp Tyr Ile Phe Tyr 
        595                 600                 605             


Ala Lys Pro Glu Asp Gln Lys Thr Pro Arg Asp Ser Arg Leu Arg Leu 
    610                 615                 620                 


Trp Tyr Ile Asp Met Leu Val Glu Cys Gln Lys Arg Ser Ile Val Gly 
625                 630                 635                 640 


Lys Val Thr Asn Met Tyr Asp Ile Tyr Phe Ala Asp Pro Asn Leu Asp 
                645                 650                 655     


Ala Thr Ala Val Pro Tyr Leu Glu Gly Asp Tyr Phe Pro Gly Glu Ala 
            660                 665                 670         


Glu Asn Ile Ile Lys Met Leu Glu Glu Gly Gly Gly Lys Lys Leu Gly 
        675                 680                 685             


Ser Val Gly Lys Lys Lys Lys Ser Lys Ser Ser Lys Ala Gln Lys Asn 
    690                 695                 700                 


Lys Gly Gly Asn Thr Gly Thr Arg Ser Thr Gly Val Asp Glu Glu Ala 
705                 710                 715                 720 


Leu Ile Ala Ser Gly Ile Leu Asp Gly Thr Lys Ser Leu Lys Asp Leu 
                725                 730                 735     


Asp Arg Asp Gln Val Met Val Lys Leu Gly Glu Thr Ile Gln Pro Met 
            740                 745                 750         


Lys Glu Ser Phe Ile Val Ala Phe Leu Asn Trp Lys Asp Ala Arg Glu 
        755                 760                 765             


Glu Asp Met Ile Val Pro Glu Glu Ile Glu Met Ala Arg Ile Glu Tyr 
    770                 775                 780                 


Ala Ala Lys Gly Asp Pro Glu Leu Val Gly Ser Lys Arg Asp Ala Ala 
785                 790                 795                 800 


Gly Asn Met Arg Asp Ala Thr Ser Lys Thr Gly Ala Asn Gly Glu Pro 
                805                 810                 815     


Val Lys Val Ile Asp Asp Asp Ala Glu Asp Leu Asp Cys Glu Phe Leu 
            820                 825                 830         


Asn Asn Arg Gln Ala Phe Leu Asn Leu Cys Arg Gly Asn His Tyr Gln 
        835                 840                 845             


Phe Asp Glu Leu Arg Arg Ala Lys His Thr Ser Leu Met Leu Leu Trp 
    850                 855                 860                 


His Leu His Asn Arg Asp Ala Pro Lys Phe Val Gln Gln Cys Val Ser 
865                 870                 875                 880 


Cys Ser Arg Glu Ile Leu Ser Gly Lys Arg Phe His Cys Asp Thr Cys 
                885                 890                 895     


Pro Asp Tyr Asp Leu Cys Gln Asp Cys Tyr Lys Asp Pro Lys Ala Asn 
            900                 905                 910         


Arg Gly Asn Cys Thr His Ala Leu Lys Pro Leu Ala Val Glu Ala Asp 
        915                 920                 925             


Ser Gly Gln Asp Arg Ser Gly Leu Ser Glu Gln Glu Arg Met Gln Arg 
    930                 935                 940                 


Gln Arg Asn Leu Leu Leu His Ile Gln Leu Ile Glu His Ala Ser Arg 
945                 950                 955                 960 


Cys Ser Ser Gln Thr Cys Ser Ser Leu Asn Cys Ala Lys Met Lys Lys 
                965                 970                 975     


Tyr Leu Gln His Ala Arg Val Cys Lys Val Lys Val Leu Gly Gly Cys 
            980                 985                 990         


Lys Ile Cys Lys Lys Ile Trp Thr  Leu Leu Arg Ile His  Ala Gln Lys 
        995                 1000                 1005             


Cys Lys  Asp Thr Asn Cys Pro  Ile Pro Gln Cys Asn  Ala Ile Arg 
    1010                 1015                 1020             


Glu Lys  Met Arg Gln Leu Gln  Lys Gln Gln Gln Ala  Met Asp Asp 
    1025                 1030                 1035             


Arg Arg  Arg Leu Glu Met Asn  Arg His Met Arg Phe  Ser Thr Ala 
    1040                 1045                 1050             


Gly Gly  Ser 
    1055     


<210>  33
<211>  4614
<212>  DNA
<213>  Navicula WT0229


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:34

<400>  33
atgagtactc aacaacagca gccaccccca cagcctcctc ctccgccaca agcacagggc       60

atgggcggtg ggagttggca aagtgatcgg gatatacctc accgaaggga aatgatacaa      120

cacattatta agttgctcaa gaaggataga agtgggtcac ctgaatcact gaacaggctt      180

ccacaaatgg cgaaacattt ggaagtatcg ctctatcgga acgctccgtc atttgaggct      240

ttcgtcgata tgtcaactct caagcagcgt ttgcaccgaa ttgctgccga ggtatcgcgg      300

cggtctcgct ctcaaaatga ttccagacgt gacgattcga tgcgacctca aagtgttgac      360

cattcgttac cgtcgtttag tcaaaacggt atgcgaggag ggtcgtcatc gccatatatg      420

ggtggaatga gtgctggcag tggacacatg aacagtggaa gcatgaacaa tggaagcatg      480

aacagtggaa gaatggtcaa catggaagat atcaacccga tgtctaatgg agtcagctcc      540

caggtctacc accagcagcc tcaaagaaac gatcgaatga gccagcatca gccgccgcca      600

cagcaaccgc agcagcgtcc aaacctgcag cctccggcga tgcaacctca aggtcaaaac      660

cgcaacgatc cggagtggaa acttaggata cgtcacaagc agcagcgttt attactgctt      720

catcattcag cgaagtgtag ccacaaaggc caatgtccag taacacctca ttgcgctgac      780

atgaaacggc tctggagaca catggagggc tgcaaagaca accaatgtcg tgttccgcac      840

tgtttttcct ccagagcaat tttaagtcac tacaggaaat gcaaagatcc tgcttgtcca      900

gcgtgtggac cggtgcgtga aacagttcgt aagggccagc gacctggctc tagcgcgaat      960

gcaatgaacc ttataagaac atcatcgcct tctgttccta atcagcaacc gcagcaacaa     1020

atgatgcaag gtaccgacat ggtgcaaatg ggcaattcgt cttttggtgg cggttcggtt     1080

cggtcaggca gtgggcattc tgtaatgccc cctccaagcg taccagtagg caataacgac     1140

atgcagtttt cttcgcagtt tcgatcaaac aatccggttc cctcaggcga ccaagtattc     1200

ttcggtagcg accagcagtc ctctgacgcc catggtactt cactttccgc caatacccaa     1260

tcttcgctga aagatcatgc ttcgacaacg atcccaggag gtagcagacc gccaggtagt     1320

agtgaatcgg agtggcaaaa aattcgacac aagcaacagc gactccttct gttacggcat     1380

gcgtcaagat gccagcacga gatgggtaca tgcccagtaa cacctcactg cgctagcatg     1440

aaaaaattat gggaccatat tgctcactgc aaagaccagc agtgcaaagt tcagcactgt     1500

cttacaagtc gttacgtact cagtcattat cgtcgttgca agaacgcgcg atgcccctct     1560

tgtgggcctg tacgtgattc aattcgtagg tcggcgctaa aagaaaagca gcaacaaggg     1620

gctgtgatga gttcgatttc gttggatgac gatgttttca agactccagt ttcctcaccg     1680

cctcaacttg agccctctct gaccgaatcg tcgttacaac cagaacagaa gcgaagaagg     1740

aaaggagacg atgcatctga agccacgagt tccacgatgc ctatcagcaa tgaaacttta     1800

aaagtgccat ctgcacctgg ttcgtctctg gctgcgacgg tggattctaa attgcagtca     1860

gctcctccta cgaaagggga tatgaaaccg aaggacacca aaagtgctga tagatccttg     1920

ttgaatagct ttactttgac ggaacttgag acacacttgc agtctcttga ccggaaaacg     1980

cagctaccag ctgctaagct caagtctaaa tgctcagaag tgctgaaggg tttacaaaca     2040

caccagcacg gatgggtgtt taattgtcct gttgacccag ttgaacttgg ccttcccgac     2100

tattttgaga tcatcaaaaa accgatggac cttggaacta tccagaaaaa ggtggaaagt     2160

gggggcatcc attcaatcga ggaattcata gctctcgttc atctcacgtt tgataacgcg     2220

atggcgtaca acgagtctga atcggtagtg tacggaatgg cgaaagaatt gaagacaaaa     2280

ttcgagggtg atgtcaagaa gctaatgaaa acgctggaag aggaagacat ggagcggcga     2340

caaaatgatc gcgcatgcca tttgtgtgga tgcgaaaagc tgttgttcga gccacctgtt     2400

tacttttgca acggaatgaa ttgcccgagt cagcgaattc gcaggaacaa taatttttac     2460

atcggaggca ataatcagta tttctggtgc agttcgtgct ttaatgaact tgacgacaag     2520

atccccatcg agttaattga catgacaata atgaaaagtg atcttgtcaa aaagagaaac     2580

gacgaagttc acgaggaaag ctgggtgcaa tgcgacacat gtgagaactg ggtgcatcag     2640

atttgtggct tatttaacac tcgccaaaac aaagagcacc atagcgagta taattgtccc     2700

agatgcattc gggataagcg gataacatgt ggtgatatac catttactag accaccaggc     2760

gcatccgatt tgccccgaac aacactatcg gagtgtcttg aacagcatat cgcgaatcgg     2820

atcgagaaga aaaagaggca gctcgcagaa gacaagcaaa gaaacgaggg aatttcattt     2880

gacgatgcgt tgaaatatgt cgagtccgga ggtccgatta tcatccgcca ggttacagca     2940

atggatcgaa agcttgaagt cagggatttg atgcgagagc gttatgcgca taagaattac     3000

ccagaagaat tcccttttcg gtgcaaatgc atcgttgtct tccagaagct tgacggagtc     3060

gataccatct tgttcgcgct ctatgtgtat gaacacggag agaacaatcc tccacccaac     3120

cagcgatgtg tttacatttc atacttggac agtgtgcatt tcatgcgacc gcgaaatttg     3180

aggacttttg tctatcacga gattctcata gcgtatctcg actacgcgcg ccagagaggt     3240

ttcgccactg ctcatatttg ggcatgtcca ccccttagag gcgacgatta cattttcttc     3300

gccaagccag aggaccagaa aacgccacgt gacaacaggc ttcgccaatg gtaccaagag     3360

atgttgatcg aagcccaaaa acgagggatt gttggaaagc ttacgaatat gtacgatctg     3420

tattttgcaa acgaatcact tgatgcgaca gctgttccct atatggaggg tgactatttt     3480

cccggcgaag ctgaaaatat cattaagctt cttcaagaag gtaaaggaaa gaaagccgga     3540

aacggaggga aaaagaaaaa gagcaaggcc agcaaagggt ctactggtac gcggtcgaca     3600

ggtgttgacg aggaagcact tctcgccagc ggattcatgg acgacgcaaa gtcactgaaa     3660

gacttggacc gcgatcaggt gatggtgaaa cttggcgaaa caatccagcc catgaaagaa     3720

agtttcattg tagcttttct gaattggtcc ggcgcgaaag aagaggataa ggtcgtgccc     3780

gaggcgatga tcaaggcccg tgctgaatac gtggatgaga atctagaaag cgacgctgcc     3840

ggtagcaagc gcgatgctga agggcatacc gcaaatagct cgacccattc tgataaggtt     3900

attaatgacg acgaagagga tcttgattgt gagttcttga ataaccgcca agcttttctc     3960

aacctttgtc gaggtaatca ttatcagttc gacgagctca gacgctctaa gcacacgtcc     4020

atgatggtcc tttggcactt gcacaacaga gacgcgccca agtttgttca acaatgtgtg     4080

gcttgcagcc gagagattct cagtggtaag cgataccact gtagcacgtg ccctgactat     4140

gatctctgtc aagactgcta caaagacccg aaggttaata gaggaaactg cacccatact     4200

ttgactccaa tcgctgtcga tcctgatgcg aaccaggaac gcaatggcat ggacgacgcc     4260

gaacgacagg ctcgccagcg caatcttatg atgcacattc agctgatcga acacgcctcc     4320

ggatgtgtgt cgaagacatg cacttcgtcg aactgcgcca agatgaagaa ttatcttcac     4380

catgctagta tctgccgcgt gaaggttcaa ggcggatgta aaatctgtaa gaagatctgg     4440

actctcctga gaatccacgc ccagaaatgc agacaggcgc gatgtccgat cccgcaatgt     4500

aatgctattc gtgagaagat gcgacaacta cagaagcagc aacaggccat ggacgacaga     4560

cgtcgtctag agatgaaccg ccacatgcgt ttcggtggcg cagccccgtc ctaa           4614


<210>  34
<211>  1537
<212>  PRT
<213>  Navicula WT0229


<220>
<221>  misc_feature
<223>  translation product 4241628

<400>  34

Met Ser Thr Gln Gln Gln Gln Pro Pro Pro Gln Pro Pro Pro Pro Pro 
1               5                   10                  15      


Gln Ala Gln Gly Met Gly Gly Gly Ser Trp Gln Ser Asp Arg Asp Ile 
            20                  25                  30          


Pro His Arg Arg Glu Met Ile Gln His Ile Ile Lys Leu Leu Lys Lys 
        35                  40                  45              


Asp Arg Ser Gly Ser Pro Glu Ser Leu Asn Arg Leu Pro Gln Met Ala 
    50                  55                  60                  


Lys His Leu Glu Val Ser Leu Tyr Arg Asn Ala Pro Ser Phe Glu Ala 
65                  70                  75                  80  


Phe Val Asp Met Ser Thr Leu Lys Gln Arg Leu His Arg Ile Ala Ala 
                85                  90                  95      


Glu Val Ser Arg Arg Ser Arg Ser Gln Asn Asp Ser Arg Arg Asp Asp 
            100                 105                 110         


Ser Met Arg Pro Gln Ser Val Asp His Ser Leu Pro Ser Phe Ser Gln 
        115                 120                 125             


Asn Gly Met Arg Gly Gly Ser Ser Ser Pro Tyr Met Gly Gly Met Ser 
    130                 135                 140                 


Ala Gly Ser Gly His Met Asn Ser Gly Ser Met Asn Asn Gly Ser Met 
145                 150                 155                 160 


Asn Ser Gly Arg Met Val Asn Met Glu Asp Ile Asn Pro Met Ser Asn 
                165                 170                 175     


Gly Val Ser Ser Gln Val Tyr His Gln Gln Pro Gln Arg Asn Asp Arg 
            180                 185                 190         


Met Ser Gln His Gln Pro Pro Pro Gln Gln Pro Gln Gln Arg Pro Asn 
        195                 200                 205             


Leu Gln Pro Pro Ala Met Gln Pro Gln Gly Gln Asn Arg Asn Asp Pro 
    210                 215                 220                 


Glu Trp Lys Leu Arg Ile Arg His Lys Gln Gln Arg Leu Leu Leu Leu 
225                 230                 235                 240 


His His Ser Ala Lys Cys Ser His Lys Gly Gln Cys Pro Val Thr Pro 
                245                 250                 255     


His Cys Ala Asp Met Lys Arg Leu Trp Arg His Met Glu Gly Cys Lys 
            260                 265                 270         


Asp Asn Gln Cys Arg Val Pro His Cys Phe Ser Ser Arg Ala Ile Leu 
        275                 280                 285             


Ser His Tyr Arg Lys Cys Lys Asp Pro Ala Cys Pro Ala Cys Gly Pro 
    290                 295                 300                 


Val Arg Glu Thr Val Arg Lys Gly Gln Arg Pro Gly Ser Ser Ala Asn 
305                 310                 315                 320 


Ala Met Asn Leu Ile Arg Thr Ser Ser Pro Ser Val Pro Asn Gln Gln 
                325                 330                 335     


Pro Gln Gln Gln Met Met Gln Gly Thr Asp Met Val Gln Met Gly Asn 
            340                 345                 350         


Ser Ser Phe Gly Gly Gly Ser Val Arg Ser Gly Ser Gly His Ser Val 
        355                 360                 365             


Met Pro Pro Pro Ser Val Pro Val Gly Asn Asn Asp Met Gln Phe Ser 
    370                 375                 380                 


Ser Gln Phe Arg Ser Asn Asn Pro Val Pro Ser Gly Asp Gln Val Phe 
385                 390                 395                 400 


Phe Gly Ser Asp Gln Gln Ser Ser Asp Ala His Gly Thr Ser Leu Ser 
                405                 410                 415     


Ala Asn Thr Gln Ser Ser Leu Lys Asp His Ala Ser Thr Thr Ile Pro 
            420                 425                 430         


Gly Gly Ser Arg Pro Pro Gly Ser Ser Glu Ser Glu Trp Gln Lys Ile 
        435                 440                 445             


Arg His Lys Gln Gln Arg Leu Leu Leu Leu Arg His Ala Ser Arg Cys 
    450                 455                 460                 


Gln His Glu Met Gly Thr Cys Pro Val Thr Pro His Cys Ala Ser Met 
465                 470                 475                 480 


Lys Lys Leu Trp Asp His Ile Ala His Cys Lys Asp Gln Gln Cys Lys 
                485                 490                 495     


Val Gln His Cys Leu Thr Ser Arg Tyr Val Leu Ser His Tyr Arg Arg 
            500                 505                 510         


Cys Lys Asn Ala Arg Cys Pro Ser Cys Gly Pro Val Arg Asp Ser Ile 
        515                 520                 525             


Arg Arg Ser Ala Leu Lys Glu Lys Gln Gln Gln Gly Ala Val Met Ser 
    530                 535                 540                 


Ser Ile Ser Leu Asp Asp Asp Val Phe Lys Thr Pro Val Ser Ser Pro 
545                 550                 555                 560 


Pro Gln Leu Glu Pro Ser Leu Thr Glu Ser Ser Leu Gln Pro Glu Gln 
                565                 570                 575     


Lys Arg Arg Arg Lys Gly Asp Asp Ala Ser Glu Ala Thr Ser Ser Thr 
            580                 585                 590         


Met Pro Ile Ser Asn Glu Thr Leu Lys Val Pro Ser Ala Pro Gly Ser 
        595                 600                 605             


Ser Leu Ala Ala Thr Val Asp Ser Lys Leu Gln Ser Ala Pro Pro Thr 
    610                 615                 620                 


Lys Gly Asp Met Lys Pro Lys Asp Thr Lys Ser Ala Asp Arg Ser Leu 
625                 630                 635                 640 


Leu Asn Ser Phe Thr Leu Thr Glu Leu Glu Thr His Leu Gln Ser Leu 
                645                 650                 655     


Asp Arg Lys Thr Gln Leu Pro Ala Ala Lys Leu Lys Ser Lys Cys Ser 
            660                 665                 670         


Glu Val Leu Lys Gly Leu Gln Thr His Gln His Gly Trp Val Phe Asn 
        675                 680                 685             


Cys Pro Val Asp Pro Val Glu Leu Gly Leu Pro Asp Tyr Phe Glu Ile 
    690                 695                 700                 


Ile Lys Lys Pro Met Asp Leu Gly Thr Ile Gln Lys Lys Val Glu Ser 
705                 710                 715                 720 


Gly Gly Ile His Ser Ile Glu Glu Phe Ile Ala Leu Val His Leu Thr 
                725                 730                 735     


Phe Asp Asn Ala Met Ala Tyr Asn Glu Ser Glu Ser Val Val Tyr Gly 
            740                 745                 750         


Met Ala Lys Glu Leu Lys Thr Lys Phe Glu Gly Asp Val Lys Lys Leu 
        755                 760                 765             


Met Lys Thr Leu Glu Glu Glu Asp Met Glu Arg Arg Gln Asn Asp Arg 
    770                 775                 780                 


Ala Cys His Leu Cys Gly Cys Glu Lys Leu Leu Phe Glu Pro Pro Val 
785                 790                 795                 800 


Tyr Phe Cys Asn Gly Met Asn Cys Pro Ser Gln Arg Ile Arg Arg Asn 
                805                 810                 815     


Asn Asn Phe Tyr Ile Gly Gly Asn Asn Gln Tyr Phe Trp Cys Ser Ser 
            820                 825                 830         


Cys Phe Asn Glu Leu Asp Asp Lys Ile Pro Ile Glu Leu Ile Asp Met 
        835                 840                 845             


Thr Ile Met Lys Ser Asp Leu Val Lys Lys Arg Asn Asp Glu Val His 
    850                 855                 860                 


Glu Glu Ser Trp Val Gln Cys Asp Thr Cys Glu Asn Trp Val His Gln 
865                 870                 875                 880 


Ile Cys Gly Leu Phe Asn Thr Arg Gln Asn Lys Glu His His Ser Glu 
                885                 890                 895     


Tyr Asn Cys Pro Arg Cys Ile Arg Asp Lys Arg Ile Thr Cys Gly Asp 
            900                 905                 910         


Ile Pro Phe Thr Arg Pro Pro Gly Ala Ser Asp Leu Pro Arg Thr Thr 
        915                 920                 925             


Leu Ser Glu Cys Leu Glu Gln His Ile Ala Asn Arg Ile Glu Lys Lys 
    930                 935                 940                 


Lys Arg Gln Leu Ala Glu Asp Lys Gln Arg Asn Glu Gly Ile Ser Phe 
945                 950                 955                 960 


Asp Asp Ala Leu Lys Tyr Val Glu Ser Gly Gly Pro Ile Ile Ile Arg 
                965                 970                 975     


Gln Val Thr Ala Met Asp Arg Lys Leu Glu Val Arg Asp Leu Met Arg 
            980                 985                 990         


Glu Arg Tyr Ala His Lys Asn Tyr  Pro Glu Glu Phe Pro  Phe Arg Cys 
        995                 1000                 1005             


Lys Cys  Ile Val Val Phe Gln  Lys Leu Asp Gly Val  Asp Thr Ile 
    1010                 1015                 1020             


Leu Phe  Ala Leu Tyr Val Tyr  Glu His Gly Glu Asn  Asn Pro Pro 
    1025                 1030                 1035             


Pro Asn  Gln Arg Cys Val Tyr  Ile Ser Tyr Leu Asp  Ser Val His 
    1040                 1045                 1050             


Phe Met  Arg Pro Arg Asn Leu  Arg Thr Phe Val Tyr  His Glu Ile 
    1055                 1060                 1065             


Leu Ile  Ala Tyr Leu Asp Tyr  Ala Arg Gln Arg Gly  Phe Ala Thr 
    1070                 1075                 1080             


Ala His  Ile Trp Ala Cys Pro  Pro Leu Arg Gly Asp  Asp Tyr Ile 
    1085                 1090                 1095             


Phe Phe  Ala Lys Pro Glu Asp  Gln Lys Thr Pro Arg  Asp Asn Arg 
    1100                 1105                 1110             


Leu Arg  Gln Trp Tyr Gln Glu  Met Leu Ile Glu Ala  Gln Lys Arg 
    1115                 1120                 1125             


Gly Ile  Val Gly Lys Leu Thr  Asn Met Tyr Asp Leu  Tyr Phe Ala 
    1130                 1135                 1140             


Asn Glu  Ser Leu Asp Ala Thr  Ala Val Pro Tyr Met  Glu Gly Asp 
    1145                 1150                 1155             


Tyr Phe  Pro Gly Glu Ala Glu  Asn Ile Ile Lys Leu  Leu Gln Glu 
    1160                 1165                 1170             


Gly Lys  Gly Lys Lys Ala Gly  Asn Gly Gly Lys Lys  Lys Lys Ser 
    1175                 1180                 1185             


Lys Ala  Ser Lys Gly Ser Thr  Gly Thr Arg Ser Thr  Gly Val Asp 
    1190                 1195                 1200             


Glu Glu  Ala Leu Leu Ala Ser  Gly Phe Met Asp Asp  Ala Lys Ser 
    1205                 1210                 1215             


Leu Lys  Asp Leu Asp Arg Asp  Gln Val Met Val Lys  Leu Gly Glu 
    1220                 1225                 1230             


Thr Ile  Gln Pro Met Lys Glu  Ser Phe Ile Val Ala  Phe Leu Asn 
    1235                 1240                 1245             


Trp Ser  Gly Ala Lys Glu Glu  Asp Lys Val Val Pro  Glu Ala Met 
    1250                 1255                 1260             


Ile Lys  Ala Arg Ala Glu Tyr  Val Asp Glu Asn Leu  Glu Ser Asp 
    1265                 1270                 1275             


Ala Ala  Gly Ser Lys Arg Asp  Ala Glu Gly His Thr  Ala Asn Ser 
    1280                 1285                 1290             


Ser Thr  His Ser Asp Lys Val  Ile Asn Asp Asp Glu  Glu Asp Leu 
    1295                 1300                 1305             


Asp Cys  Glu Phe Leu Asn Asn  Arg Gln Ala Phe Leu  Asn Leu Cys 
    1310                 1315                 1320             


Arg Gly  Asn His Tyr Gln Phe  Asp Glu Leu Arg Arg  Ser Lys His 
    1325                 1330                 1335             


Thr Ser  Met Met Val Leu Trp  His Leu His Asn Arg  Asp Ala Pro 
    1340                 1345                 1350             


Lys Phe  Val Gln Gln Cys Val  Ala Cys Ser Arg Glu  Ile Leu Ser 
    1355                 1360                 1365             


Gly Lys  Arg Tyr His Cys Ser  Thr Cys Pro Asp Tyr  Asp Leu Cys 
    1370                 1375                 1380             


Gln Asp  Cys Tyr Lys Asp Pro  Lys Val Asn Arg Gly  Asn Cys Thr 
    1385                 1390                 1395             


His Thr  Leu Thr Pro Ile Ala  Val Asp Pro Asp Ala  Asn Gln Glu 
    1400                 1405                 1410             


Arg Asn  Gly Met Asp Asp Ala  Glu Arg Gln Ala Arg  Gln Arg Asn 
    1415                 1420                 1425             


Leu Met  Met His Ile Gln Leu  Ile Glu His Ala Ser  Gly Cys Val 
    1430                 1435                 1440             


Ser Lys  Thr Cys Thr Ser Ser  Asn Cys Ala Lys Met  Lys Asn Tyr 
    1445                 1450                 1455             


Leu His  His Ala Ser Ile Cys  Arg Val Lys Val Gln  Gly Gly Cys 
    1460                 1465                 1470             


Lys Ile  Cys Lys Lys Ile Trp  Thr Leu Leu Arg Ile  His Ala Gln 
    1475                 1480                 1485             


Lys Cys  Arg Gln Ala Arg Cys  Pro Ile Pro Gln Cys  Asn Ala Ile 
    1490                 1495                 1500             


Arg Glu  Lys Met Arg Gln Leu  Gln Lys Gln Gln Gln  Ala Met Asp 
    1505                 1510                 1515             


Asp Arg  Arg Arg Leu Glu Met  Asn Arg His Met Arg  Phe Gly Gly 
    1520                 1525                 1530             


Ala Ala  Pro Ser 
    1535         


<210>  35
<211>  6698
<212>  DNA
<213>  Navicula WT0229


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:36

<400>  35
tgaacgattc tgctgtacga tctcatccga gttcagtaaa cccaagacaa gacaatatca       60

ataatagcag tcaaagtgca aatggaagct cggatcagaa agcattcctt gacggaagct      120

ttgctggcgg ctggcagtct aacgcggacc ttcccgatcg acgtgaggtg atttttcgaa      180

ttctggaagt aatcaggcac atgagaccag atacggatcg tgtttcatca aagcttccgc      240

acatggcaaa aagtttagaa gagcacctct accggtcagc acagactaag gaagagtaca      300

tggattttgg aacactaagg cgccgcctcc aggcaatcgc acacggactc gaactccacc      360

ggccgtcttc ttctaccagt cagcaatcgg gagaccaatc caatcagacc cagcccgtgg      420

catctgggcg aaatcaaggc aggtcgtcat ttcaaactgc cagtagtacc gacagtggga      480

tgtattctag cgcaagtaat gccaatccgg ataatttgaa ctcgtcgatg acttccggaa      540

tgggtatggg ttccatcaac cagtcacaga taccctctaa tatgcaaaaa atgggcggac      600

aaatgaacca gtccgctagt tttgggagta atatgggtgt gaacacgtct gcgccttcca      660

gtatgggaaa cctttcgcag cgacagacat cttcgcaatt ccagaagaat agtaactggt      720

caaacagcgg aagtgcagac tacggcggca gtatggctgg tgattcgaac atgaccacta      780

tgccactaaa cgggggaatt ctaccgatgg gtggaatgag cctgccacaa cagcaacaga      840

tgcaaccttc aatgattcag caacagcagc ctatgtcgaa cattatgccg tctcagatgg      900

ggagtgctgc tggacaggct attaattcgt cacaggctgt tccacagaac tgggcttctc      960

agtcagcacc cttttgggat tcagccggtt caacgagtgg aatggactcg tcgatgcaaa     1020

agaaaaaagt cattcttcag cagcagcaaa gacttctcct gctccgccat gccagtaaat     1080

gtacggctgg tgcggcatgt caaactagat tttgctccca gatggtgacg ctgtggcggc     1140

atatgaaggc ttgcagggat aaaaattgca aaacgcctca ttgcgtcagt agtcgctgcg     1200

ttcttaatca ttaccgtatt tgtaagagca acggaaacac agcgagctgc gaagtgtgtg     1260

gccctgtgat gatgaaaatt aaccaaaagg atactgaagc catggctggt gatcctctca     1320

cgagagatca agatctgtcg atgaagcaga cgcatcatcc atatcaacag caatctatgc     1380

tgcagccggg aggggggcag atgtcctcgg gattgatgaa ccaaaatatc atgcaaccac     1440

ttcctatgca acctatgcaa cagcagtcga ccatgcaacc agggtcttcg cagcacatgg     1500

tgaactcggt gtcggagggt atccaattgc aacaagctaa acaacagcag cagctgaaac     1560

tgaagcagca acttgagagc cttaagcaac tccaaaagaa gcaagaagaa ctcgaaaagc     1620

aacagaagcg acttgagatg cacgctcagc agattcagga tcctagctcg ccccaagcgc     1680

agcagctaca gcagcagcaa atgctcttac ggcacctcca gaaaaagtgt caacaacaac     1740

aactcatgct acagcaagaa gtaaaacttc ttatgacggg tggcggcaat ccgcaacaag     1800

accaaagtca gatatctcta caagctcaag ttcaacaaca gcaatttcag cagcagctta     1860

ttgcgcacca gcagggactg cagggcacca tgggattgtc agcgggcgcc gtacatggag     1920

ttcagtcttc cagtgcaatt gcggagggtc atatccaagg tccctcgccg cgtaagtccc     1980

ccgttccagc aaagccacga tatacggggg gcaaaggacg acggggagga aaagggaagt     2040

cgctaggtat aaattctgcc gtttcaaaga aaaggctaag cgaaaccgag gacgattccc     2100

cccagtaccg aaaacgcgcg acaatcacaa aaccagaaac tgagctttcc gagactattg     2160

ccgtggaaag aaattcttca ctgggctctg ggcttgatga aacatcgctg attcccttaa     2220

tgacgagaga tgagataatg aagcatctcg aatccttgaa caagcggttt tgtttgtcat     2280

ctcgaactgt gactcacaag tgcatgccta ttatacaagg tcttattgac gaccaatttg     2340

gatgggtttt tcatgatcct gtcgatcctg taacgttggg tcttcctgat tactttgatg     2400

tagtgaaaac accaatgtgc ctcgaactcg ttaagaagaa gctggagaat gcagtttata     2460

acgacacgga atcatttgcc cgagatctca gtctggtttt tgaaaatgcc atcctgtata     2520

acggtgaaag cagtgaagta ggggagttag ccaaatccat gctggataaa tttcacacgg     2580

tttatcgcgc tttggttcaa gaacttgaat cttcctattt aagtttggag aagaaaggtg     2640

aactgtgttc gctttgtgga aatcagaata gaaaattcga gcctacaatt ctttactgtc     2700

aaggcgattg tgaaatgcag caaatcaagc ggcatgcgac ttacttcacg gatcgggcaa     2760

agcagaataa ctggtgcgag ggatgcttta agctccttca agatgaccag cctatcatgc     2820

ttgacgacgg caccgaagtg agaaaaagcg acctacagga atgtcaaaat gatgcgcttc     2880

cggaagaggg atgggtcaac tgcgatcact gcaactcatg ggtgcaccaa gtctgctcat     2940

tatttaacgg gcgagttaat aagtccggcg cgcggtacac atgccctaat tgttatctaa     3000

gcaaaggtag tatcgggaga gctttctcga agcaaataaa ggttgcggct gatctacctc     3060

attgcaagat gagtgaagcg atcgaacgtg gtcttttggc tacgcttgag aaagcctaca     3120

gagaccggtc aaatgaaatc ggagtggcca tcgacgatgt tgaaaaggca gaatccttga     3180

caattcgcgt cgtatcgaat attgaaaaga agcacatagt tggagaagag atgcttaaac     3240

ggtacaagga cgagggatgt gtaaaaggtt atcctgtccg tacaaagtgt atcgctctat     3300

ttcaaaagat acacggggct gatacccttc ttttcgccat gtacgtctac gaatatggtc     3360

acgaatgccc tgctcccaac cgcagacgag tgtacatatc ctatcttgat tctgtccaat     3420

attttgaacc caaatgctat cgaacacttg tttatcactc agttctggta gagtatctcc     3480

gctacgttaa agctcgcggt ttccacactg ctcatttttg gagctgtcca ccaacccccg     3540

gcgacgatta tatttttcat gtacacccat cgcaccagct ggtaccacgt gaagatatgc     3600

tcagagcttg gtatcatgat atgctagatc gcgccaaagc agagggtata gttattcgga     3660

caactaattt atacgacgag tactttgtga aaggcggcat ggactccgtg ccatgggcta     3720

cagggcgacc gacatgttta ccctattttg aaggggacta tattcctggg gagatcgaaa     3780

ctatcataag atcggagcaa gaaaaattga cggatggctc ggagatggga gaagaagaca     3840

gagtgatggc gcgtctcggt ctaaatctcc gcaaaatgaa agacaatttc atcgtcgtgc     3900

acctcagaag taggcgtttt gctgcagcag tagaaagcgg tgatgatgta tctgatttca     3960

aagatgacag tgatgaagaa cttgtacgta acaagcgcgc gaagattagc ggcaaagaca     4020

caggttcatt atgtatgcaa gctgaacttc tcgaccaagc tgggtctgtt accttggaaa     4080

gagacccaac ggcacatact acaacggagg aacatgctag cggggcgtca tcggaaaatg     4140

agcaccctga gcgcagtcct gttggtgagg ttaaaaaggc agagccagtc tcagctttcg     4200

tggcaacgga aaccagccag tcaccatcca catcaacacg agatgaaagc gctaacaatg     4260

gaaggcatgt acaagatcaa gcattaccga taataggtga tgtccctacc gacgagatgg     4320

agtcgagtaa tggaagtcct agtccgcttg tcgaaaccat cgagacggtt gagtcgcatg     4380

acctgcccgc gtttgcttct cattatgacg aagagaaaag gaaccacgaa ctcagtgctg     4440

aaagggagcc aaccaaaacg acaacagctg agactagctc cgtcacttca gccctggtca     4500

agaaggacga tgataccgaa gaacgcgtga atacaccaaa cgtcgaagcc caggaccatg     4560

ttgaaaaaga accaccgtca cgcaacatta agttggatcc agacctacaa catggcggcc     4620

acgcagtcgc acaagacatt tcatctgaaa tagtcgagac tcaaaccaat caggaacagt     4680

caaatgattg cgccccaact gattctgttc tcttggataa taacaggccc gaggagattg     4740

aaaaaggagc ttcagacatt gatcatcgtt gtgcagacga ggccatcgaa tttaaacaag     4800

ttattgatga tgatgacaag gatgcttcca ggaaagtaaa cgagtgcaat cgtggtcgcg     4860

agataataga agagaaagtc ggtctaggtg acagaaacaa gaatactgac gaaatgccat     4920

taccgtatgc agccgacacc aataaagtga ctctcaacga cgaaacggcg gccactaaca     4980

gagaatcggt taatgatatc gctatgactg ccgattccgg aggaatgaac gaagacgaag     5040

cggttgccgt taatcatgaa attaccggag ctgaagttgt gatcgcagac ggactcgaag     5100

agaacaaaga tgaatctatg aaaggggatt ctgtgatcat gaacacagta aacgaggtta     5160

aagacagctc cttggtttct tccagagaag gcataaaaag tagtcttgag agtatgatcg     5220

ctaatccagc agaggcaaaa gacgcctcag aggttcctgg attggaaacc attgataatg     5280

gtgttgctgt gaacgcgaat ccctcaagag acaacacagt ccattcgcag acctctgatg     5340

aagcattgcc ggaaattaca ggtggagata gcaaaggtga aacctctgat cataatgcca     5400

gcaaaagtga tactgtgact gctgtttcag cgggggaaat tgcaagcacg accgatcgag     5460

tatgccaagt agactctgga aaacaggttt caacgccaga gaatgcacca aaaaatttgg     5520

gaccaggaca cgtgcttcta atgcctgaag cagctgcaac cacttcaagc gaccaagaat     5580

gtctcttccc tcaaagagga atttcagaca aattgagtca tgtctctgat gttgatgcta     5640

caatagccga tcaacagccc ccgaatgcac cagaagagtc tattgcaatt gcacgtccaa     5700

tcataatcaa taccgcctct gacgtagagg gtaaaaacat tagttcacag acagaagcag     5760

cagtcgaaaa gactgcttct gaccaagatg tcttgactcc accgcgtgat gccacggttt     5820

ttgttcaatt tcctgatggc caatcctcgg accaagcgac tgctgatcca agtttattta     5880

ctggcaacag ctcgcaagga ctgaagcgtg atattgatga agtcaagccg ctactttctc     5940

gtcatttcga cgaaatgaat cgacctctaa aatacgtaac ggatacagct gatcccgacg     6000

aaccgataga agttgagctt ttcgaatcgc ggcaaagatt tctcaattat tgccaaacta     6060

gccactgtca gttcgacgaa ttgcgacggg cgaagcactc gactatgatg gtcttatttc     6120

agcttcacaa ccctgcggcc ccgctgtttc tccagcaatg cggtgcttgt tacagagaca     6180

taacacacgg tgtccgatac agttgtaaca attgctctaa atttgatcta tgcgaggatt     6240

gctacaagcc tgttacttca ggtttgtggg ccaaaagaga ctctcgtttt gagcatgatc     6300

catcccacac atttacacct atcgacatgg aagtgtccac tgacagcgca atgagccaag     6360

aagatcggca gaaggcccta aaagcacatt gcgccttatt ggagcacgca ggtgactgtc     6420

aaggtccccc gacttgttct cttcaaaact gtcaaaaaat gaagaagctt tttaatcacg     6480

tgcgaagttg cgaaatcaag ccaaagagcg attgtagaat atgcactcgt ctcatttcgc     6540

tgtgtgcaat tcatgctcga acatgcaaaa tcgctgactc gtgcccagtt ccattctgtg     6600

atcgcatccg cgatagaaac gaaagacttc agcgacaaca acaactcatg gatgatcgcc     6660

gtcgtcaagc ccaaaacgat ttatatcaca cgtcttaa                             6698


<210>  36
<211>  2232
<212>  PRT
<213>  Navicula WT0229


<220>
<221>  misc_feature
<223>  translation product 4244056

<400>  36

Met Asn Asp Ser Ala Val Arg Ser His Pro Ser Ser Val Asn Pro Arg 
1               5                   10                  15      


Gln Asp Asn Ile Asn Asn Ser Ser Gln Ser Ala Asn Gly Ser Ser Asp 
            20                  25                  30          


Gln Lys Ala Phe Leu Asp Gly Ser Phe Ala Gly Gly Trp Gln Ser Asn 
        35                  40                  45              


Ala Asp Leu Pro Asp Arg Arg Glu Val Ile Phe Arg Ile Leu Glu Val 
    50                  55                  60                  


Ile Arg His Met Arg Pro Asp Thr Asp Arg Val Ser Ser Lys Leu Pro 
65                  70                  75                  80  


His Met Ala Lys Ser Leu Glu Glu His Leu Tyr Arg Ser Ala Gln Thr 
                85                  90                  95      


Lys Glu Glu Tyr Met Asp Phe Gly Thr Leu Arg Arg Arg Leu Gln Ala 
            100                 105                 110         


Ile Ala His Gly Leu Glu Leu His Arg Pro Ser Ser Ser Thr Ser Gln 
        115                 120                 125             


Gln Ser Gly Asp Gln Ser Asn Gln Thr Gln Pro Val Ala Ser Gly Arg 
    130                 135                 140                 


Asn Gln Gly Arg Ser Ser Phe Gln Thr Ala Ser Ser Thr Asp Ser Gly 
145                 150                 155                 160 


Met Tyr Ser Ser Ala Ser Asn Ala Asn Pro Asp Asn Leu Asn Ser Ser 
                165                 170                 175     


Met Thr Ser Gly Met Gly Met Gly Ser Ile Asn Gln Ser Gln Ile Pro 
            180                 185                 190         


Ser Asn Met Gln Lys Met Gly Gly Gln Met Asn Gln Ser Ala Ser Phe 
        195                 200                 205             


Gly Ser Asn Met Gly Val Asn Thr Ser Ala Pro Ser Ser Met Gly Asn 
    210                 215                 220                 


Leu Ser Gln Arg Gln Thr Ser Ser Gln Phe Gln Lys Asn Ser Asn Trp 
225                 230                 235                 240 


Ser Asn Ser Gly Ser Ala Asp Tyr Gly Gly Ser Met Ala Gly Asp Ser 
                245                 250                 255     


Asn Met Thr Thr Met Pro Leu Asn Gly Gly Ile Leu Pro Met Gly Gly 
            260                 265                 270         


Met Ser Leu Pro Gln Gln Gln Gln Met Gln Pro Ser Met Ile Gln Gln 
        275                 280                 285             


Gln Gln Pro Met Ser Asn Ile Met Pro Ser Gln Met Gly Ser Ala Ala 
    290                 295                 300                 


Gly Gln Ala Ile Asn Ser Ser Gln Ala Val Pro Gln Asn Trp Ala Ser 
305                 310                 315                 320 


Gln Ser Ala Pro Phe Trp Asp Ser Ala Gly Ser Thr Ser Gly Met Asp 
                325                 330                 335     


Ser Ser Met Gln Lys Lys Lys Val Ile Leu Gln Gln Gln Gln Arg Leu 
            340                 345                 350         


Leu Leu Leu Arg His Ala Ser Lys Cys Thr Ala Gly Ala Ala Cys Gln 
        355                 360                 365             


Thr Arg Phe Cys Ser Gln Met Val Thr Leu Trp Arg His Met Lys Ala 
    370                 375                 380                 


Cys Arg Asp Lys Asn Cys Lys Thr Pro His Cys Val Ser Ser Arg Cys 
385                 390                 395                 400 


Val Leu Asn His Tyr Arg Ile Cys Lys Ser Asn Gly Asn Thr Ala Ser 
                405                 410                 415     


Cys Glu Val Cys Gly Pro Val Met Met Lys Ile Asn Gln Lys Asp Thr 
            420                 425                 430         


Glu Ala Met Ala Gly Asp Pro Leu Thr Arg Asp Gln Asp Leu Ser Met 
        435                 440                 445             


Lys Gln Thr His His Pro Tyr Gln Gln Gln Ser Met Leu Gln Pro Gly 
    450                 455                 460                 


Gly Gly Gln Met Ser Ser Gly Leu Met Asn Gln Asn Ile Met Gln Pro 
465                 470                 475                 480 


Leu Pro Met Gln Pro Met Gln Gln Gln Ser Thr Met Gln Pro Gly Ser 
                485                 490                 495     


Ser Gln His Met Val Asn Ser Val Ser Glu Gly Ile Gln Leu Gln Gln 
            500                 505                 510         


Ala Lys Gln Gln Gln Gln Leu Lys Leu Lys Gln Gln Leu Glu Ser Leu 
        515                 520                 525             


Lys Gln Leu Gln Lys Lys Gln Glu Glu Leu Glu Lys Gln Gln Lys Arg 
    530                 535                 540                 


Leu Glu Met His Ala Gln Gln Ile Gln Asp Pro Ser Ser Pro Gln Ala 
545                 550                 555                 560 


Gln Gln Leu Gln Gln Gln Gln Met Leu Leu Arg His Leu Gln Lys Lys 
                565                 570                 575     


Cys Gln Gln Gln Gln Leu Met Leu Gln Gln Glu Val Lys Leu Leu Met 
            580                 585                 590         


Thr Gly Gly Gly Asn Pro Gln Gln Asp Gln Ser Gln Ile Ser Leu Gln 
        595                 600                 605             


Ala Gln Val Gln Gln Gln Gln Phe Gln Gln Gln Leu Ile Ala His Gln 
    610                 615                 620                 


Gln Gly Leu Gln Gly Thr Met Gly Leu Ser Ala Gly Ala Val His Gly 
625                 630                 635                 640 


Val Gln Ser Ser Ser Ala Ile Ala Glu Gly His Ile Gln Gly Pro Ser 
                645                 650                 655     


Pro Arg Lys Ser Pro Val Pro Ala Lys Pro Arg Tyr Thr Gly Gly Lys 
            660                 665                 670         


Gly Arg Arg Gly Gly Lys Gly Lys Ser Leu Gly Ile Asn Ser Ala Val 
        675                 680                 685             


Ser Lys Lys Arg Leu Ser Glu Thr Glu Asp Asp Ser Pro Gln Tyr Arg 
    690                 695                 700                 


Lys Arg Ala Thr Ile Thr Lys Pro Glu Thr Glu Leu Ser Glu Thr Ile 
705                 710                 715                 720 


Ala Val Glu Arg Asn Ser Ser Leu Gly Ser Gly Leu Asp Glu Thr Ser 
                725                 730                 735     


Leu Ile Pro Leu Met Thr Arg Asp Glu Ile Met Lys His Leu Glu Ser 
            740                 745                 750         


Leu Asn Lys Arg Phe Cys Leu Ser Ser Arg Thr Val Thr His Lys Cys 
        755                 760                 765             


Met Pro Ile Ile Gln Gly Leu Ile Asp Asp Gln Phe Gly Trp Val Phe 
    770                 775                 780                 


His Asp Pro Val Asp Pro Val Thr Leu Gly Leu Pro Asp Tyr Phe Asp 
785                 790                 795                 800 


Val Val Lys Thr Pro Met Cys Leu Glu Leu Val Lys Lys Lys Leu Glu 
                805                 810                 815     


Asn Ala Val Tyr Asn Asp Thr Glu Ser Phe Ala Arg Asp Leu Ser Leu 
            820                 825                 830         


Val Phe Glu Asn Ala Ile Leu Tyr Asn Gly Glu Ser Ser Glu Val Gly 
        835                 840                 845             


Glu Leu Ala Lys Ser Met Leu Asp Lys Phe His Thr Val Tyr Arg Ala 
    850                 855                 860                 


Leu Val Gln Glu Leu Glu Ser Ser Tyr Leu Ser Leu Glu Lys Lys Gly 
865                 870                 875                 880 


Glu Leu Cys Ser Leu Cys Gly Asn Gln Asn Arg Lys Phe Glu Pro Thr 
                885                 890                 895     


Ile Leu Tyr Cys Gln Gly Asp Cys Glu Met Gln Gln Ile Lys Arg His 
            900                 905                 910         


Ala Thr Tyr Phe Thr Asp Arg Ala Lys Gln Asn Asn Trp Cys Glu Gly 
        915                 920                 925             


Cys Phe Lys Leu Leu Gln Asp Asp Gln Pro Ile Met Leu Asp Asp Gly 
    930                 935                 940                 


Thr Glu Val Arg Lys Ser Asp Leu Gln Glu Cys Gln Asn Asp Ala Leu 
945                 950                 955                 960 


Pro Glu Glu Gly Trp Val Asn Cys Asp His Cys Asn Ser Trp Val His 
                965                 970                 975     


Gln Val Cys Ser Leu Phe Asn Gly Arg Val Asn Lys Ser Gly Ala Arg 
            980                 985                 990         


Tyr Thr Cys Pro Asn Cys Tyr Leu  Ser Lys Gly Ser Ile  Gly Arg Ala 
        995                 1000                 1005             


Phe Ser  Lys Gln Ile Lys Val  Ala Ala Asp Leu Pro  His Cys Lys 
    1010                 1015                 1020             


Met Ser  Glu Ala Ile Glu Arg  Gly Leu Leu Ala Thr  Leu Glu Lys 
    1025                 1030                 1035             


Ala Tyr  Arg Asp Arg Ser Asn  Glu Ile Gly Val Ala  Ile Asp Asp 
    1040                 1045                 1050             


Val Glu  Lys Ala Glu Ser Leu  Thr Ile Arg Val Val  Ser Asn Ile 
    1055                 1060                 1065             


Glu Lys  Lys His Ile Val Gly  Glu Glu Met Leu Lys  Arg Tyr Lys 
    1070                 1075                 1080             


Asp Glu  Gly Cys Val Lys Gly  Tyr Pro Val Arg Thr  Lys Cys Ile 
    1085                 1090                 1095             


Ala Leu  Phe Gln Lys Ile His  Gly Ala Asp Thr Leu  Leu Phe Ala 
    1100                 1105                 1110             


Met Tyr  Val Tyr Glu Tyr Gly  His Glu Cys Pro Ala  Pro Asn Arg 
    1115                 1120                 1125             


Arg Arg  Val Tyr Ile Ser Tyr  Leu Asp Ser Val Gln  Tyr Phe Glu 
    1130                 1135                 1140             


Pro Lys  Cys Tyr Arg Thr Leu  Val Tyr His Ser Val  Leu Val Glu 
    1145                 1150                 1155             


Tyr Leu  Arg Tyr Val Lys Ala  Arg Gly Phe His Thr  Ala His Phe 
    1160                 1165                 1170             


Trp Ser  Cys Pro Pro Thr Pro  Gly Asp Asp Tyr Ile  Phe His Val 
    1175                 1180                 1185             


His Pro  Ser His Gln Leu Val  Pro Arg Glu Asp Met  Leu Arg Ala 
    1190                 1195                 1200             


Trp Tyr  His Asp Met Leu Asp  Arg Ala Lys Ala Glu  Gly Ile Val 
    1205                 1210                 1215             


Ile Arg  Thr Thr Asn Leu Tyr  Asp Glu Tyr Phe Val  Lys Gly Gly 
    1220                 1225                 1230             


Met Asp  Ser Val Pro Trp Ala  Thr Gly Arg Pro Thr  Cys Leu Pro 
    1235                 1240                 1245             


Tyr Phe  Glu Gly Asp Tyr Ile  Pro Gly Glu Ile Glu  Thr Ile Ile 
    1250                 1255                 1260             


Arg Ser  Glu Gln Glu Lys Leu  Thr Asp Gly Ser Glu  Met Gly Glu 
    1265                 1270                 1275             


Glu Asp  Arg Val Met Ala Arg  Leu Gly Leu Asn Leu  Arg Lys Met 
    1280                 1285                 1290             


Lys Asp  Asn Phe Ile Val Val  His Leu Arg Ser Arg  Arg Phe Ala 
    1295                 1300                 1305             


Ala Ala  Val Glu Ser Gly Asp  Asp Val Ser Asp Phe  Lys Asp Asp 
    1310                 1315                 1320             


Ser Asp  Glu Glu Leu Val Arg  Asn Lys Arg Ala Lys  Ile Ser Gly 
    1325                 1330                 1335             


Lys Asp  Thr Gly Ser Leu Cys  Met Gln Ala Glu Leu  Leu Asp Gln 
    1340                 1345                 1350             


Ala Gly  Ser Val Thr Leu Glu  Arg Asp Pro Thr Ala  His Thr Thr 
    1355                 1360                 1365             


Thr Glu  Glu His Ala Ser Gly  Ala Ser Ser Glu Asn  Glu His Pro 
    1370                 1375                 1380             


Glu Arg  Ser Pro Val Gly Glu  Val Lys Lys Ala Glu  Pro Val Ser 
    1385                 1390                 1395             


Ala Phe  Val Ala Thr Glu Thr  Ser Gln Ser Pro Ser  Thr Ser Thr 
    1400                 1405                 1410             


Arg Asp  Glu Ser Ala Asn Asn  Gly Arg His Val Gln  Asp Gln Ala 
    1415                 1420                 1425             


Leu Pro  Ile Ile Gly Asp Val  Pro Thr Asp Glu Met  Glu Ser Ser 
    1430                 1435                 1440             


Asn Gly  Ser Pro Ser Pro Leu  Val Glu Thr Ile Glu  Thr Val Glu 
    1445                 1450                 1455             


Ser His  Asp Leu Pro Ala Phe  Ala Ser His Tyr Asp  Glu Glu Lys 
    1460                 1465                 1470             


Arg Asn  His Glu Leu Ser Ala  Glu Arg Glu Pro Thr  Lys Thr Thr 
    1475                 1480                 1485             


Thr Ala  Glu Thr Ser Ser Val  Thr Ser Ala Leu Val  Lys Lys Asp 
    1490                 1495                 1500             


Asp Asp  Thr Glu Glu Arg Val  Asn Thr Pro Asn Val  Glu Ala Gln 
    1505                 1510                 1515             


Asp His  Val Glu Lys Glu Pro  Pro Ser Arg Asn Ile  Lys Leu Asp 
    1520                 1525                 1530             


Pro Asp  Leu Gln His Gly Gly  His Ala Val Ala Gln  Asp Ile Ser 
    1535                 1540                 1545             


Ser Glu  Ile Val Glu Thr Gln  Thr Asn Gln Glu Gln  Ser Asn Asp 
    1550                 1555                 1560             


Cys Ala  Pro Thr Asp Ser Val  Leu Leu Asp Asn Asn  Arg Pro Glu 
    1565                 1570                 1575             


Glu Ile  Glu Lys Gly Ala Ser  Asp Ile Asp His Arg  Cys Ala Asp 
    1580                 1585                 1590             


Glu Ala  Ile Glu Phe Lys Gln  Val Ile Asp Asp Asp  Asp Lys Asp 
    1595                 1600                 1605             


Ala Ser  Arg Lys Val Asn Glu  Cys Asn Arg Gly Arg  Glu Ile Ile 
    1610                 1615                 1620             


Glu Glu  Lys Val Gly Leu Gly  Asp Arg Asn Lys Asn  Thr Asp Glu 
    1625                 1630                 1635             


Met Pro  Leu Pro Tyr Ala Ala  Asp Thr Asn Lys Val  Thr Leu Asn 
    1640                 1645                 1650             


Asp Glu  Thr Ala Ala Thr Asn  Arg Glu Ser Val Asn  Asp Ile Ala 
    1655                 1660                 1665             


Met Thr  Ala Asp Ser Gly Gly  Met Asn Glu Asp Glu  Ala Val Ala 
    1670                 1675                 1680             


Val Asn  His Glu Ile Thr Gly  Ala Glu Val Val Ile  Ala Asp Gly 
    1685                 1690                 1695             


Leu Glu  Glu Asn Lys Asp Glu  Ser Met Lys Gly Asp  Ser Val Ile 
    1700                 1705                 1710             


Met Asn  Thr Val Asn Glu Val  Lys Asp Ser Ser Leu  Val Ser Ser 
    1715                 1720                 1725             


Arg Glu  Gly Ile Lys Ser Ser  Leu Glu Ser Met Ile  Ala Asn Pro 
    1730                 1735                 1740             


Ala Glu  Ala Lys Asp Ala Ser  Glu Val Pro Gly Leu  Glu Thr Ile 
    1745                 1750                 1755             


Asp Asn  Gly Val Ala Val Asn  Ala Asn Pro Ser Arg  Asp Asn Thr 
    1760                 1765                 1770             


Val His  Ser Gln Thr Ser Asp  Glu Ala Leu Pro Glu  Ile Thr Gly 
    1775                 1780                 1785             


Gly Asp  Ser Lys Gly Glu Thr  Ser Asp His Asn Ala  Ser Lys Ser 
    1790                 1795                 1800             


Asp Thr  Val Thr Ala Val Ser  Ala Gly Glu Ile Ala  Ser Thr Thr 
    1805                 1810                 1815             


Asp Arg  Val Cys Gln Val Asp  Ser Gly Lys Gln Val  Ser Thr Pro 
    1820                 1825                 1830             


Glu Asn  Ala Pro Lys Asn Leu  Gly Pro Gly His Val  Leu Leu Met 
    1835                 1840                 1845             


Pro Glu  Ala Ala Ala Thr Thr  Ser Ser Asp Gln Glu  Cys Leu Phe 
    1850                 1855                 1860             


Pro Gln  Arg Gly Ile Ser Asp  Lys Leu Ser His Val  Ser Asp Val 
    1865                 1870                 1875             


Asp Ala  Thr Ile Ala Asp Gln  Gln Pro Pro Asn Ala  Pro Glu Glu 
    1880                 1885                 1890             


Ser Ile  Ala Ile Ala Arg Pro  Ile Ile Ile Asn Thr  Ala Ser Asp 
    1895                 1900                 1905             


Val Glu  Gly Lys Asn Ile Ser  Ser Gln Thr Glu Ala  Ala Val Glu 
    1910                 1915                 1920             


Lys Thr  Ala Ser Asp Gln Asp  Val Leu Thr Pro Pro  Arg Asp Ala 
    1925                 1930                 1935             


Thr Val  Phe Val Gln Phe Pro  Asp Gly Gln Ser Ser  Asp Gln Ala 
    1940                 1945                 1950             


Thr Ala  Asp Pro Ser Leu Phe  Thr Gly Asn Ser Ser  Gln Gly Leu 
    1955                 1960                 1965             


Lys Arg  Asp Ile Asp Glu Val  Lys Pro Leu Leu Ser  Arg His Phe 
    1970                 1975                 1980             


Asp Glu  Met Asn Arg Pro Leu  Lys Tyr Val Thr Asp  Thr Ala Asp 
    1985                 1990                 1995             


Pro Asp  Glu Pro Ile Glu Val  Glu Leu Phe Glu Ser  Arg Gln Arg 
    2000                 2005                 2010             


Phe Leu  Asn Tyr Cys Gln Thr  Ser His Cys Gln Phe  Asp Glu Leu 
    2015                 2020                 2025             


Arg Arg  Ala Lys His Ser Thr  Met Met Val Leu Phe  Gln Leu His 
    2030                 2035                 2040             


Asn Pro  Ala Ala Pro Leu Phe  Leu Gln Gln Cys Gly  Ala Cys Tyr 
    2045                 2050                 2055             


Arg Asp  Ile Thr His Gly Val  Arg Tyr Ser Cys Asn  Asn Cys Ser 
    2060                 2065                 2070             


Lys Phe  Asp Leu Cys Glu Asp  Cys Tyr Lys Pro Val  Thr Ser Gly 
    2075                 2080                 2085             


Leu Trp  Ala Lys Arg Asp Ser  Arg Phe Glu His Asp  Pro Ser His 
    2090                 2095                 2100             


Thr Phe  Thr Pro Ile Asp Met  Glu Val Ser Thr Asp  Ser Ala Met 
    2105                 2110                 2115             


Ser Gln  Glu Asp Arg Gln Lys  Ala Leu Lys Ala His  Cys Ala Leu 
    2120                 2125                 2130             


Leu Glu  His Ala Gly Asp Cys  Gln Gly Pro Pro Thr  Cys Ser Leu 
    2135                 2140                 2145             


Gln Asn  Cys Gln Lys Met Lys  Lys Leu Phe Asn His  Val Arg Ser 
    2150                 2155                 2160             


Cys Glu  Ile Lys Pro Lys Ser  Asp Cys Arg Ile Cys  Thr Arg Leu 
    2165                 2170                 2175             


Ile Ser  Leu Cys Ala Ile His  Ala Arg Thr Cys Lys  Ile Ala Asp 
    2180                 2185                 2190             


Ser Cys  Pro Val Pro Phe Cys  Asp Arg Ile Arg Asp  Arg Asn Glu 
    2195                 2200                 2205             


Arg Leu  Gln Arg Gln Gln Gln  Leu Met Asp Asp Arg  Arg Arg Gln 
    2210                 2215                 2220             


Ala Gln  Asn Asp Leu Tyr His  Thr Ser 
    2225                 2230         


<210>  37
<211>  4665
<212>  DNA
<213>  Navicula WT0229


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:38

<400>  37
atgcaaaccc atccgcagca gcccggctcg ggcggtgcca gctttcccca gcctcctact       60

cagcaacaac agcagcagca aatatttcca caccaaggac tcaacggcgg gtggcagagt      120

gacaaagatt atgaggatcg tcggaaaatg attgcgaaaa tcgtgcatct cttgcaacaa      180

cggaagccaa acgcgccgca agaatggcta aagaagttgc ctcaaatggc gaaacgatta      240

gaagaatcat tgtacaggtc ggccaaatct ttcaatgagt ataatgatgc aaatacattg      300

aagcacagac tgcagcaact cgccgtaaat attggaatga aaacaaagaa actccagcaa      360

caacaggcga tgatgcaaca gcaaaagatg cagcagcagc aacaacaaca atcgcaacaa      420

ccagggataa atcagttttc acggtcgact atgccggcac aggcgcaaca agaacaaccc      480

cttgtcaagc ctcaagcaca gcaaccaatt ccattatctg ctccacaagg caccaacggt      540

cagcagcaac agcaacgaat agtcaacatg gcggagataa atcctatgat gagttcccaa      600

acaactactc cctcgcagcc tcagcccccg gtacctgcac ctgcacctcc tctacaacaa      660

atccagtatg gtcagccggg gtctgctcca gtcccttccg cgcctcctgc ggcactgtca      720

ggtgcgcccg gtccaaattt atcagcagct gctagcaacg gaggacgaca aatagctaat      780

agacagcagg ttcttcgaca tcaacaacaa cgcctacttc ttttgcgcca cgcagccaag      840

tgccaatatg acgacggtcg atgcccagtg accccgcact gcgcaggtat gaagcgatta      900

tggaaacata ttgcggaatg caagaaccag aaatgtcttg ttccccattg tgtgagctct      960

cgttacgttt tgagtcacta ccatcgttgc aaagacgtgc ggtgccccgt atgtggacct     1020

gtgcgtgaag ccattcatcg cagtcacgaa aagcagaagc acatgcaagc gctcaaacag     1080

cggcaccagc aagctgtgca acaaaatcaa acacaagaag gagctcagca acagcctgct     1140

gcactggctc ccactggagc tgcttctgta catccgaccc agcccctgtc tgctgaacca     1200

ccaaacaaga agcaacggac tgctggggta ttgaccgctc catcattcca agtccaacaa     1260

cgacctttac agcacccggg agcgagaccg gtagcgcctg ggcaaaccca gtctggctac     1320

agtttatcgc agcaacagag tgcacaacag gcgggcccac agttatccca ccatcaagca     1380

ggccagcagc aaggatctcg cccagtcgtc gcatctacgc caggcttggc tttttctaat     1440

ggacaggtga taactccaaa atattcgggt ccaaagcctc aggaggatca tactttgatc     1500

aactgtttct ccgttgaaca aattgaatct catatcgagt cgctgaacaa tggtctgcag     1560

ttgcctcctg cgaagctcaa agcgaaatgc ctcgacgtat tgagactatt acagtcgcat     1620

cagcatgcat gggtgtttaa tactccggtg gatcctgtgg agcttggctt acctgattat     1680

tttgaggtta tcaagacacc aatggactta gggaccatca ggaagaaact tgagaacggt     1740

gtttaccaga agattgaaga attcgagggg cacgttttat tgacattcga gaatgcgatg     1800

ctgtataatc ctgaagggtc agtggtgtac aatatggcaa aagagatgaa agagaaattt     1860

gtgcgcgact atgccaaatt gatcgaaatt ctcaatgagg aagaagacgt taaaaggaag     1920

aacggagaag catgcctact atgtggatgc gaaaagctac ttttcgagcc tcctgtcttc     1980

tattgtaacg gcatgaattg tccgtcgaag cgtatacgac gaaacagcca ttactatgtg     2040

ggtggcaaca accagtatca ctggtgccat caatgttacc aggatcttcg ggataattca     2100

acaatcgatc taggggatat ccaagtaaag aaagagagct tgactaagaa aaagaatgat     2160

gaagtgcacg aggaaagttg ggtgcaatgt gatcgatgcg aacgatgggt gcaccaaatt     2220

tgtgctttgt ttaacacaag gcagaacaaa gaccagcgct ctgaatatgc ttgtcctcgt     2280

tgcacgattg aggaacgcat gaaaagaggc aacttagagg caatctcgtc ttcgccaatg     2340

gcggaagacc ttcctcgaac aaagatgtct gagtatcttg aatctcacgt tcgtcagaaa     2400

gtcgatgagt tcgtggagaa aaaatcgaag gcggtttcga tcgcagaaaa tattccgttc     2460

gaggaggcca agaagaagat tcaaatggga ggcgagataa cgattcgaca ggtaacctct     2520

atggacagga agttggaggt tagggaacgg atgaagagaa gatatgcctt caaaaactat     2580

cccgaagagt ttactttccg gtgcaagtgc tttgttgttt ttcagaatct cgacggggtg     2640

gacgttgttt tgttcggact ttacgtgtac gaacacgacg agaaaaaccc tttaccgaac     2700

agccgcactg tctacgtgtc gtacctagac agtgttcact acatgagacc gcgccaaatg     2760

cgaactttta tatatcatga gatacttatc tcctaccttg actacgtgcg gcgtcgagga     2820

ttttctacag ctcacatctg ggcctgtcct cctctgaaag gcgatgatta catcctttac     2880

gcgaaaccag aagatcagaa aactcctaga gacgatcgcc tcaaacagtg gtatatcgac     2940

atgctggtcg agtcacagag gcgaggaatt gttgggaaac tgacgaacat gtatgatctc     3000

tacttctcga acccgaagaa tgatgcgacc gtcgttccgt acatggatgg agattatttt     3060

cctgctgaag ctgagaacat catcaaagac attgaagaag ggaaaacggg gaagaagtcc     3120

agttctcagg gaaagaagaa agagaaggcg aagcagaaaa agaagtctgg gtctagccgt     3180

ggtggcacac ggtccacggg tttagatgaa gatgcattga aagctagcgg tattctacca     3240

ccaggtgctg atcagaaaag tctcgaggaa ggcggccgtg attttgtcat ggctaagttg     3300

ggagagacta tccagccaat gaaagaaagt ttcatagtgg cgtatcttgc ctggagcgga     3360

gcgaaggatg aggatatgca agttccgaaa gaaattgagg agtatcgtaa cgagcatggc     3420

atcacgtgga agatcaatga agaagcgtcg tctgagaaag gtgataaaga aaacccgaaa     3480

ccgacggagt cgattgagat ggagacgacg ccgactgaag tttcaactag cgtgaatgcg     3540

acagctgggg tcgccgaaaa caaagatccg gaaaaacaaa caggaaacga tggagacgag     3600

aagaatgcca caatgagcat ggacacaggc gcatccagcc tcgagccgaa aagtgatgac     3660

gcgtgtgacg attcgtcaaa ggccaagaca agcgctgaca atatggaatc ggacccggaa     3720

ataaaagtcg aatcacaaac caggtcacag cttgatacgc aggtggaaca aagttccgat     3780

tcagcgaatg catctcaacc taacgctgta aatatgagta ttcgagaagg aaagttcgct     3840

gccatggctg ctaggaaaag agatatcgat ggtgtcccaa aagaaagttc agagggtgaa     3900

gaatctacaa aagccaaaaa cgagccttca aagactgtta ccgtcaaaga tagtaaggga     3960

agaacggtga aagttttgga cgacgatgaa gaggagcttg actgcgaatt cttaaacaat     4020

cggcaagcat ttttgaatct ttgccaggga aatcattacc agttcgatac gattcgccgc     4080

gcaaaacact cttcaatgat ggtactttgg cacctccata atcgtgacgc tcctaagttc     4140

gttcaacagt gcgctacgtg ctccagggaa atattgactg gtatgaggtt ccactgtcca     4200

acttgtgcgg actttgatca gtgtcaagat tgcgtctcca actcgaaaat accgagacat     4260

ccacatccat tgaaacctat agcagttggc aacggccaac aatctgactt gacagacgag     4320

cagcgcaagg agcgccagcg aagtattcag ttgcatatga cacttctgca gcatgctgcc     4380

acatgttcaa acgcgaaatg tccttccgcc aattgcacca aaatgaaggg tctattgaaa     4440

cacgggtcgc aatgccagat caaggcaaca ggcggatgca acgtatgcaa acgtatttgg     4500

gccctactgc aaatacatgc acgacagtgc aagacatcaa gttgtgcagt tcctaactgt     4560

atggcaattc gtgaacgatt tcgccaactc aaaaagcaac agatggcaat ggacgaccgc     4620

aggcgacagg aaatgaatag ggcttgtcgc gggaaacgtg gatga                     4665


<210>  38
<211>  1554
<212>  PRT
<213>  Navicula WT0229


<220>
<221>  misc_feature
<223>  translation product 4244509

<400>  38

Met Gln Thr His Pro Gln Gln Pro Gly Ser Gly Gly Ala Ser Phe Pro 
1               5                   10                  15      


Gln Pro Pro Thr Gln Gln Gln Gln Gln Gln Gln Ile Phe Pro His Gln 
            20                  25                  30          


Gly Leu Asn Gly Gly Trp Gln Ser Asp Lys Asp Tyr Glu Asp Arg Arg 
        35                  40                  45              


Lys Met Ile Ala Lys Ile Val His Leu Leu Gln Gln Arg Lys Pro Asn 
    50                  55                  60                  


Ala Pro Gln Glu Trp Leu Lys Lys Leu Pro Gln Met Ala Lys Arg Leu 
65                  70                  75                  80  


Glu Glu Ser Leu Tyr Arg Ser Ala Lys Ser Phe Asn Glu Tyr Asn Asp 
                85                  90                  95      


Ala Asn Thr Leu Lys His Arg Leu Gln Gln Leu Ala Val Asn Ile Gly 
            100                 105                 110         


Met Lys Thr Lys Lys Leu Gln Gln Gln Gln Ala Met Met Gln Gln Gln 
        115                 120                 125             


Lys Met Gln Gln Gln Gln Gln Gln Gln Ser Gln Gln Pro Gly Ile Asn 
    130                 135                 140                 


Gln Phe Ser Arg Ser Thr Met Pro Ala Gln Ala Gln Gln Glu Gln Pro 
145                 150                 155                 160 


Leu Val Lys Pro Gln Ala Gln Gln Pro Ile Pro Leu Ser Ala Pro Gln 
                165                 170                 175     


Gly Thr Asn Gly Gln Gln Gln Gln Gln Arg Ile Val Asn Met Ala Glu 
            180                 185                 190         


Ile Asn Pro Met Met Ser Ser Gln Thr Thr Thr Pro Ser Gln Pro Gln 
        195                 200                 205             


Pro Pro Val Pro Ala Pro Ala Pro Pro Leu Gln Gln Ile Gln Tyr Gly 
    210                 215                 220                 


Gln Pro Gly Ser Ala Pro Val Pro Ser Ala Pro Pro Ala Ala Leu Ser 
225                 230                 235                 240 


Gly Ala Pro Gly Pro Asn Leu Ser Ala Ala Ala Ser Asn Gly Gly Arg 
                245                 250                 255     


Gln Ile Ala Asn Arg Gln Gln Val Leu Arg His Gln Gln Gln Arg Leu 
            260                 265                 270         


Leu Leu Leu Arg His Ala Ala Lys Cys Gln Tyr Asp Asp Gly Arg Cys 
        275                 280                 285             


Pro Val Thr Pro His Cys Ala Gly Met Lys Arg Leu Trp Lys His Ile 
    290                 295                 300                 


Ala Glu Cys Lys Asn Gln Lys Cys Leu Val Pro His Cys Val Ser Ser 
305                 310                 315                 320 


Arg Tyr Val Leu Ser His Tyr His Arg Cys Lys Asp Val Arg Cys Pro 
                325                 330                 335     


Val Cys Gly Pro Val Arg Glu Ala Ile His Arg Ser His Glu Lys Gln 
            340                 345                 350         


Lys His Met Gln Ala Leu Lys Gln Arg His Gln Gln Ala Val Gln Gln 
        355                 360                 365             


Asn Gln Thr Gln Glu Gly Ala Gln Gln Gln Pro Ala Ala Leu Ala Pro 
    370                 375                 380                 


Thr Gly Ala Ala Ser Val His Pro Thr Gln Pro Leu Ser Ala Glu Pro 
385                 390                 395                 400 


Pro Asn Lys Lys Gln Arg Thr Ala Gly Val Leu Thr Ala Pro Ser Phe 
                405                 410                 415     


Gln Val Gln Gln Arg Pro Leu Gln His Pro Gly Ala Arg Pro Val Ala 
            420                 425                 430         


Pro Gly Gln Thr Gln Ser Gly Tyr Ser Leu Ser Gln Gln Gln Ser Ala 
        435                 440                 445             


Gln Gln Ala Gly Pro Gln Leu Ser His His Gln Ala Gly Gln Gln Gln 
    450                 455                 460                 


Gly Ser Arg Pro Val Val Ala Ser Thr Pro Gly Leu Ala Phe Ser Asn 
465                 470                 475                 480 


Gly Gln Val Ile Thr Pro Lys Tyr Ser Gly Pro Lys Pro Gln Glu Asp 
                485                 490                 495     


His Thr Leu Ile Asn Cys Phe Ser Val Glu Gln Ile Glu Ser His Ile 
            500                 505                 510         


Glu Ser Leu Asn Asn Gly Leu Gln Leu Pro Pro Ala Lys Leu Lys Ala 
        515                 520                 525             


Lys Cys Leu Asp Val Leu Arg Leu Leu Gln Ser His Gln His Ala Trp 
    530                 535                 540                 


Val Phe Asn Thr Pro Val Asp Pro Val Glu Leu Gly Leu Pro Asp Tyr 
545                 550                 555                 560 


Phe Glu Val Ile Lys Thr Pro Met Asp Leu Gly Thr Ile Arg Lys Lys 
                565                 570                 575     


Leu Glu Asn Gly Val Tyr Gln Lys Ile Glu Glu Phe Glu Gly His Val 
            580                 585                 590         


Leu Leu Thr Phe Glu Asn Ala Met Leu Tyr Asn Pro Glu Gly Ser Val 
        595                 600                 605             


Val Tyr Asn Met Ala Lys Glu Met Lys Glu Lys Phe Val Arg Asp Tyr 
    610                 615                 620                 


Ala Lys Leu Ile Glu Ile Leu Asn Glu Glu Glu Asp Val Lys Arg Lys 
625                 630                 635                 640 


Asn Gly Glu Ala Cys Leu Leu Cys Gly Cys Glu Lys Leu Leu Phe Glu 
                645                 650                 655     


Pro Pro Val Phe Tyr Cys Asn Gly Met Asn Cys Pro Ser Lys Arg Ile 
            660                 665                 670         


Arg Arg Asn Ser His Tyr Tyr Val Gly Gly Asn Asn Gln Tyr His Trp 
        675                 680                 685             


Cys His Gln Cys Tyr Gln Asp Leu Arg Asp Asn Ser Thr Ile Asp Leu 
    690                 695                 700                 


Gly Asp Ile Gln Val Lys Lys Glu Ser Leu Thr Lys Lys Lys Asn Asp 
705                 710                 715                 720 


Glu Val His Glu Glu Ser Trp Val Gln Cys Asp Arg Cys Glu Arg Trp 
                725                 730                 735     


Val His Gln Ile Cys Ala Leu Phe Asn Thr Arg Gln Asn Lys Asp Gln 
            740                 745                 750         


Arg Ser Glu Tyr Ala Cys Pro Arg Cys Thr Ile Glu Glu Arg Met Lys 
        755                 760                 765             


Arg Gly Asn Leu Glu Ala Ile Ser Ser Ser Pro Met Ala Glu Asp Leu 
    770                 775                 780                 


Pro Arg Thr Lys Met Ser Glu Tyr Leu Glu Ser His Val Arg Gln Lys 
785                 790                 795                 800 


Val Asp Glu Phe Val Glu Lys Lys Ser Lys Ala Val Ser Ile Ala Glu 
                805                 810                 815     


Asn Ile Pro Phe Glu Glu Ala Lys Lys Lys Ile Gln Met Gly Gly Glu 
            820                 825                 830         


Ile Thr Ile Arg Gln Val Thr Ser Met Asp Arg Lys Leu Glu Val Arg 
        835                 840                 845             


Glu Arg Met Lys Arg Arg Tyr Ala Phe Lys Asn Tyr Pro Glu Glu Phe 
    850                 855                 860                 


Thr Phe Arg Cys Lys Cys Phe Val Val Phe Gln Asn Leu Asp Gly Val 
865                 870                 875                 880 


Asp Val Val Leu Phe Gly Leu Tyr Val Tyr Glu His Asp Glu Lys Asn 
                885                 890                 895     


Pro Leu Pro Asn Ser Arg Thr Val Tyr Val Ser Tyr Leu Asp Ser Val 
            900                 905                 910         


His Tyr Met Arg Pro Arg Gln Met Arg Thr Phe Ile Tyr His Glu Ile 
        915                 920                 925             


Leu Ile Ser Tyr Leu Asp Tyr Val Arg Arg Arg Gly Phe Ser Thr Ala 
    930                 935                 940                 


His Ile Trp Ala Cys Pro Pro Leu Lys Gly Asp Asp Tyr Ile Leu Tyr 
945                 950                 955                 960 


Ala Lys Pro Glu Asp Gln Lys Thr Pro Arg Asp Asp Arg Leu Lys Gln 
                965                 970                 975     


Trp Tyr Ile Asp Met Leu Val Glu Ser Gln Arg Arg Gly Ile Val Gly 
            980                 985                 990         


Lys Leu Thr Asn Met Tyr Asp Leu  Tyr Phe Ser Asn Pro  Lys Asn Asp 
        995                 1000                 1005             


Ala Thr  Val Val Pro Tyr Met  Asp Gly Asp Tyr Phe  Pro Ala Glu 
    1010                 1015                 1020             


Ala Glu  Asn Ile Ile Lys Asp  Ile Glu Glu Gly Lys  Thr Gly Lys 
    1025                 1030                 1035             


Lys Ser  Ser Ser Gln Gly Lys  Lys Lys Glu Lys Ala  Lys Gln Lys 
    1040                 1045                 1050             


Lys Lys  Ser Gly Ser Ser Arg  Gly Gly Thr Arg Ser  Thr Gly Leu 
    1055                 1060                 1065             


Asp Glu  Asp Ala Leu Lys Ala  Ser Gly Ile Leu Pro  Pro Gly Ala 
    1070                 1075                 1080             


Asp Gln  Lys Ser Leu Glu Glu  Gly Gly Arg Asp Phe  Val Met Ala 
    1085                 1090                 1095             


Lys Leu  Gly Glu Thr Ile Gln  Pro Met Lys Glu Ser  Phe Ile Val 
    1100                 1105                 1110             


Ala Tyr  Leu Ala Trp Ser Gly  Ala Lys Asp Glu Asp  Met Gln Val 
    1115                 1120                 1125             


Pro Lys  Glu Ile Glu Glu Tyr  Arg Asn Glu His Gly  Ile Thr Trp 
    1130                 1135                 1140             


Lys Ile  Asn Glu Glu Ala Ser  Ser Glu Lys Gly Asp  Lys Glu Asn 
    1145                 1150                 1155             


Pro Lys  Pro Thr Glu Ser Ile  Glu Met Glu Thr Thr  Pro Thr Glu 
    1160                 1165                 1170             


Val Ser  Thr Ser Val Asn Ala  Thr Ala Gly Val Ala  Glu Asn Lys 
    1175                 1180                 1185             


Asp Pro  Glu Lys Gln Thr Gly  Asn Asp Gly Asp Glu  Lys Asn Ala 
    1190                 1195                 1200             


Thr Met  Ser Met Asp Thr Gly  Ala Ser Ser Leu Glu  Pro Lys Ser 
    1205                 1210                 1215             


Asp Asp  Ala Cys Asp Asp Ser  Ser Lys Ala Lys Thr  Ser Ala Asp 
    1220                 1225                 1230             


Asn Met  Glu Ser Asp Pro Glu  Ile Lys Val Glu Ser  Gln Thr Arg 
    1235                 1240                 1245             


Ser Gln  Leu Asp Thr Gln Val  Glu Gln Ser Ser Asp  Ser Ala Asn 
    1250                 1255                 1260             


Ala Ser  Gln Pro Asn Ala Val  Asn Met Ser Ile Arg  Glu Gly Lys 
    1265                 1270                 1275             


Phe Ala  Ala Met Ala Ala Arg  Lys Arg Asp Ile Asp  Gly Val Pro 
    1280                 1285                 1290             


Lys Glu  Ser Ser Glu Gly Glu  Glu Ser Thr Lys Ala  Lys Asn Glu 
    1295                 1300                 1305             


Pro Ser  Lys Thr Val Thr Val  Lys Asp Ser Lys Gly  Arg Thr Val 
    1310                 1315                 1320             


Lys Val  Leu Asp Asp Asp Glu  Glu Glu Leu Asp Cys  Glu Phe Leu 
    1325                 1330                 1335             


Asn Asn  Arg Gln Ala Phe Leu  Asn Leu Cys Gln Gly  Asn His Tyr 
    1340                 1345                 1350             


Gln Phe  Asp Thr Ile Arg Arg  Ala Lys His Ser Ser  Met Met Val 
    1355                 1360                 1365             


Leu Trp  His Leu His Asn Arg  Asp Ala Pro Lys Phe  Val Gln Gln 
    1370                 1375                 1380             


Cys Ala  Thr Cys Ser Arg Glu  Ile Leu Thr Gly Met  Arg Phe His 
    1385                 1390                 1395             


Cys Pro  Thr Cys Ala Asp Phe  Asp Gln Cys Gln Asp  Cys Val Ser 
    1400                 1405                 1410             


Asn Ser  Lys Ile Pro Arg His  Pro His Pro Leu Lys  Pro Ile Ala 
    1415                 1420                 1425             


Val Gly  Asn Gly Gln Gln Ser  Asp Leu Thr Asp Glu  Gln Arg Lys 
    1430                 1435                 1440             


Glu Arg  Gln Arg Ser Ile Gln  Leu His Met Thr Leu  Leu Gln His 
    1445                 1450                 1455             


Ala Ala  Thr Cys Ser Asn Ala  Lys Cys Pro Ser Ala  Asn Cys Thr 
    1460                 1465                 1470             


Lys Met  Lys Gly Leu Leu Lys  His Gly Ser Gln Cys  Gln Ile Lys 
    1475                 1480                 1485             


Ala Thr  Gly Gly Cys Asn Val  Cys Lys Arg Ile Trp  Ala Leu Leu 
    1490                 1495                 1500             


Gln Ile  His Ala Arg Gln Cys  Lys Thr Ser Ser Cys  Ala Val Pro 
    1505                 1510                 1515             


Asn Cys  Met Ala Ile Arg Glu  Arg Phe Arg Gln Leu  Lys Lys Gln 
    1520                 1525                 1530             


Gln Met  Ala Met Asp Asp Arg  Arg Arg Gln Glu Met  Asn Arg Ala 
    1535                 1540                 1545             


Cys Arg  Gly Lys Arg Gly 
    1550                 


<210>  39
<211>  4080
<212>  DNA
<213>  Ectocarpus silicosus


<220>
<221>  misc_feature
<223>  encodes polypeptide of SEQ ID NO:40

<400>  39
atggggggtg ggctcgtcgc aggggcgggg cagagcccgg cgttgatgcg caacgggagc       60

atgtcctcca gcgccggggg ggggatgggt atgggaagcg ttggcatcgg cggcagcatg      120

acggctaccg ctagcggtag cggcggcggt gctgccgctg ccggcggtgg gagcggtggc      180

ggtggcggtg gcggtggcgg tggggggcgc gatggtggga gctccgggcg tgggggacag      240

cagcgaagga ggaacgctga gtttactccg gaagatcgta aggccgccct ccggcagcag      300

cagcagcggc tgttgctgct gcgacacgcg agcaagtgcc ctgcggaggg ggagcaatgc      360

aaggttacgc cgcactgcca ggcgatgaag cgtttgtgga agcacatcgc cgagtgcaag      420

aaccagcagt gcccggacct caagtgtgcc gtgtgcgctc ccgtgcgcga ggtcattgcc      480

aagtcacacc agcgccagat ggtgactcag gaggcgagga accgagtagc cgggtcgggg      540

cagcccggtg cggggggggt ctccggacag cagcttgttc ccggcagtag tggtcacatg      600

gtggggccca atggggtggc gggtgggtcg ggcggaggca acttctcgaa cccacgcgat      660

ttgacgctgg cgcagaggca gcagcagcag cagcagctgg agacgcagcg gggtttgatc      720

acctcccagc aggcggccca ggcgcgccag cagcagcaaa atcaactcat gtccggacag      780

caggcgcccg ggttgcatca gtcgggatcg attgaccagt ttaataatgc gcacggagga      840

ggcgacaccg gtaggggtgg ggcacgcagc agctctaagt cgtccgcgtc taacgggaag      900

cgatcgtcga gtctgatcgg cgcgccgggc accatcggca ccgcgtcggc aagcggcggc      960

ggtggtggcg cggggagcag cagcaacggc atgatggtgg acccgaatgc cgtcgttccg     1020

gcgaataact cgagggcggc gcggtcggtc gcgcatcagc agggcgcgta ccccgcgggg     1080

tctgcggggc agccggttcg gactgttcag caggctcgag ccacgcccgg caagatgctc     1140

tcccccgagg actgcacttc tctcatcgaa gcgtttacgg aggatcagat cctgaaccac     1200

gtcaagtccc tcgacacggg catgcatgtt agccaggagc gtatccaggc ggcagcgggg     1260

gctgtcctga cgaagctgag ggactctcag ttcggttggg ttttcaacga cccggtggac     1320

ccggtccacc tcaacctgcc ggactacttc gagatcatca cgcacccgat ggacctcggg     1380

actgtggcgc gcaaactggc gaaggagggc gcgggcgggt acctggagca cgaggagttc     1440

gccgcagacg tgcagctggt gttcgacaac gccatgaagt acaacgggcc ggagagcgag     1500

gtgtaccctg tggcggagcg catgaagaag gaattcaaca aggattgggc gctggcgttg     1560

aagcgtatgg aggcggaaga gaacggccgc aaggagaggg gcgagacctg caacctgtgc     1620

ggctactccg ccaagacgtt cgagcccatg acgtactact gcaacggggc tcagtgcaac     1680

gggaagcgca tcgggagggg gcggtacttc taccacgcca cgggctccaa ccagtggcac     1740

tggtgttcca gttgctacaa cgaccttaag gacggggaga tcatcgcgct agccgagacg     1800

gcggtgcgaa aggcggacct gaagaggaag aagaacgacg agcaggcgga ggttggggat     1860

gtggacaacg caagcaagct ctcttggagt ttcacggggg tttgcacctg cgagcgttca     1920

cggcggggta cgagggccgg caacatcgct ccgacggctc acaagttggg cggcaaggac     1980

ctcccgcacg ggcccctgag cgcgtacgtg gaggcgcagg tgaagaagcg gctcgatgcg     2040

gcctacgagg cagaggcgaa ggagagaggg gtccccgtgg accaggtgac gaaggcgaat     2100

accctgtaca tccgcgaggt gtcggtgatg gacacggtcc acctcgtcaa gccgggtttc     2160

caccggcgtt acggccctgc gggggagtac ccggcagact tcccggtccg aagcaagtgc     2220

atcgtgctgt tccaggagct ggacggcgtg gacgtgctcc tgttcgggat gtacgtgtac     2280

gagtacggcc acacgtgccc agcgccgaac cagcgaaggg tgtacatcag ctacctggac     2340

tcggttcact acttccgccc gaggaactat cgaaccatgg tgtatcacga gatcctgatc     2400

gcctacctgg aggaggtgaa gacaagaggc ttccacacgg cgcacatctg ggcgtgcccc     2460

cctgccaagg gggacgacta catcctctac tgccaccccc cggagcagca gactcccaag     2520

gatgaccgcc tgcagcaatg gtacgtcacg atgcttgagg aggcgaaaaa gaggggcatc     2580

gtggagggat tgaccaacct cttcgacgag tactggtcaa acccagaaac cgcggacgca     2640

cgccagctgc catacctcga gggggactat tggatagggg aggcggagaa catcatcaag     2700

gacctcccgg agggcacccc cctgatttgc aagccgaagg tggaggcaaa ggccgatggc     2760

tccgccgcag cagcgccgcc ggactcggcc ggcggcaccg cggccgcaga cggggcgggg     2820

gcggcagcag ggagtggcgc agctccagca acggcagggg ctgccggtga cggcgaggct     2880

ttggtgaagg tcgaggacag cgctgctaag gcggagggtg gtggtggtgg ggatggcgga     2940

gggcgagggg gggagggtaa cggggcagag gcgaagaagg tggagggggg ggaagggaag     3000

gaggaggagg agaaggagga gaagtcgccc gggaagaagg gcaagcgaaa ggcgggagac     3060

ggtgtgaaga agaaggcgaa gaaggcgaga acatccaagt ccggtggcgg aagcaagaag     3120

cgaggggtta agccggagga agctcccatc gtcggtgacc ccctgatgca caagctggcg     3180

gcgatagtgt cgccgatgaa gtcctcgttc atcgtggccc acctcagacc gagagagttc     3240

gttacccaga tgcaggagcg gcgtgcaaag gagaaggcga tcgaagcagc caagaagacg     3300

gtgtcgacgg cggtgagcga gaagaggaaa ccggaccccg agatggcgaa gctggcggag     3360

caggccatcg ccaaagacga gacggaggag gggcagtcta gccaggagtg cgaggtgctg     3420

gacacgagac agaccttcct caacctgtgc caggggaacc actaccagtt cgacatgctc     3480

agacgaggga agcactcttc gatgatggtg ctgtaccacc tgtgcaaccc ggacgtgccc     3540

aagttcctgt cgacgtgctc gaactgctac aaggagatcc actcagggga ccggtatcac     3600

tgtgaggtct gcacggactt cgacctctgc aaggagtgct acaaggcggt gccgcacccc     3660

caccccctca agcccatccc ggtgcgcccg gcggcgcagc agcagaagca cctcagcccc     3720

gcgcagcgag aggagcggca gaggcacatc aagctgcaca tgcagctgct ccagcacgct     3780

tcgacgtgcg aggatcgaaa ctgccagtcc aagaactgct cacggatgaa gaacctcttg     3840

acgcacgggg cgagctgcac catccgggcc cagggcggct gcggcgtgtg caagcgcatt     3900

tgggctcttc tgcagattca cgcgaggcag tgcaagaagg atcgatgctc cgtgccgaag     3960

tgtcggcagc tgcggcagca catgcgcttc ctgcgagagc agcagcaggc catggacgac     4020

cggcgaaggc aggcgatgaa cgagtggtct cggaacagac aggagggaag cggcagctag     4080


<210>  40
<211>  1359
<212>  PRT
<213>  Ectocarpus silicosus


<220>
<221>  misc_feature
<223>  translation product 656007

<400>  40

Met Gly Gly Gly Leu Val Ala Gly Ala Gly Gln Ser Pro Ala Leu Met 
1               5                   10                  15      


Arg Asn Gly Ser Met Ser Ser Ser Ala Gly Gly Gly Met Gly Met Gly 
            20                  25                  30          


Ser Val Gly Ile Gly Gly Ser Met Thr Ala Thr Ala Ser Gly Ser Gly 
        35                  40                  45              


Gly Gly Ala Ala Ala Ala Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly 
    50                  55                  60                  


Gly Gly Gly Gly Gly Arg Asp Gly Gly Ser Ser Gly Arg Gly Gly Gln 
65                  70                  75                  80  


Gln Arg Arg Arg Asn Ala Glu Phe Thr Pro Glu Asp Arg Lys Ala Ala 
                85                  90                  95      


Leu Arg Gln Gln Gln Gln Arg Leu Leu Leu Leu Arg His Ala Ser Lys 
            100                 105                 110         


Cys Pro Ala Glu Gly Glu Gln Cys Lys Val Thr Pro His Cys Gln Ala 
        115                 120                 125             


Met Lys Arg Leu Trp Lys His Ile Ala Glu Cys Lys Asn Gln Gln Cys 
    130                 135                 140                 


Pro Asp Leu Lys Cys Ala Val Cys Ala Pro Val Arg Glu Val Ile Ala 
145                 150                 155                 160 


Lys Ser His Gln Arg Gln Met Val Thr Gln Glu Ala Arg Asn Arg Val 
                165                 170                 175     


Ala Gly Ser Gly Gln Pro Gly Ala Gly Gly Val Ser Gly Gln Gln Leu 
            180                 185                 190         


Val Pro Gly Ser Ser Gly His Met Val Gly Pro Asn Gly Val Ala Gly 
        195                 200                 205             


Gly Ser Gly Gly Gly Asn Phe Ser Asn Pro Arg Asp Leu Thr Leu Ala 
    210                 215                 220                 


Gln Arg Gln Gln Gln Gln Gln Gln Leu Glu Thr Gln Arg Gly Leu Ile 
225                 230                 235                 240 


Thr Ser Gln Gln Ala Ala Gln Ala Arg Gln Gln Gln Gln Asn Gln Leu 
                245                 250                 255     


Met Ser Gly Gln Gln Ala Pro Gly Leu His Gln Ser Gly Ser Ile Asp 
            260                 265                 270         


Gln Phe Asn Asn Ala His Gly Gly Gly Asp Thr Gly Arg Gly Gly Ala 
        275                 280                 285             


Arg Ser Ser Ser Lys Ser Ser Ala Ser Asn Gly Lys Arg Ser Ser Ser 
    290                 295                 300                 


Leu Ile Gly Ala Pro Gly Thr Ile Gly Thr Ala Ser Ala Ser Gly Gly 
305                 310                 315                 320 


Gly Gly Gly Ala Gly Ser Ser Ser Asn Gly Met Met Val Asp Pro Asn 
                325                 330                 335     


Ala Val Val Pro Ala Asn Asn Ser Arg Ala Ala Arg Ser Val Ala His 
            340                 345                 350         


Gln Gln Gly Ala Tyr Pro Ala Gly Ser Ala Gly Gln Pro Val Arg Thr 
        355                 360                 365             


Val Gln Gln Ala Arg Ala Thr Pro Gly Lys Met Leu Ser Pro Glu Asp 
    370                 375                 380                 


Cys Thr Ser Leu Ile Glu Ala Phe Thr Glu Asp Gln Ile Leu Asn His 
385                 390                 395                 400 


Val Lys Ser Leu Asp Thr Gly Met His Val Ser Gln Glu Arg Ile Gln 
                405                 410                 415     


Ala Ala Ala Gly Ala Val Leu Thr Lys Leu Arg Asp Ser Gln Phe Gly 
            420                 425                 430         


Trp Val Phe Asn Asp Pro Val Asp Pro Val His Leu Asn Leu Pro Asp 
        435                 440                 445             


Tyr Phe Glu Ile Ile Thr His Pro Met Asp Leu Gly Thr Val Ala Arg 
    450                 455                 460                 


Lys Leu Ala Lys Glu Gly Ala Gly Gly Tyr Leu Glu His Glu Glu Phe 
465                 470                 475                 480 


Ala Ala Asp Val Gln Leu Val Phe Asp Asn Ala Met Lys Tyr Asn Gly 
                485                 490                 495     


Pro Glu Ser Glu Val Tyr Pro Val Ala Glu Arg Met Lys Lys Glu Phe 
            500                 505                 510         


Asn Lys Asp Trp Ala Leu Ala Leu Lys Arg Met Glu Ala Glu Glu Asn 
        515                 520                 525             


Gly Arg Lys Glu Arg Gly Glu Thr Cys Asn Leu Cys Gly Tyr Ser Ala 
    530                 535                 540                 


Lys Thr Phe Glu Pro Met Thr Tyr Tyr Cys Asn Gly Ala Gln Cys Asn 
545                 550                 555                 560 


Gly Lys Arg Ile Gly Arg Gly Arg Tyr Phe Tyr His Ala Thr Gly Ser 
                565                 570                 575     


Asn Gln Trp His Trp Cys Ser Ser Cys Tyr Asn Asp Leu Lys Asp Gly 
            580                 585                 590         


Glu Ile Ile Ala Leu Ala Glu Thr Ala Val Arg Lys Ala Asp Leu Lys 
        595                 600                 605             


Arg Lys Lys Asn Asp Glu Gln Ala Glu Val Gly Asp Val Asp Asn Ala 
    610                 615                 620                 


Ser Lys Leu Ser Trp Ser Phe Thr Gly Val Cys Thr Cys Glu Arg Ser 
625                 630                 635                 640 


Arg Arg Gly Thr Arg Ala Gly Asn Ile Ala Pro Thr Ala His Lys Leu 
                645                 650                 655     


Gly Gly Lys Asp Leu Pro His Gly Pro Leu Ser Ala Tyr Val Glu Ala 
            660                 665                 670         


Gln Val Lys Lys Arg Leu Asp Ala Ala Tyr Glu Ala Glu Ala Lys Glu 
        675                 680                 685             


Arg Gly Val Pro Val Asp Gln Val Thr Lys Ala Asn Thr Leu Tyr Ile 
    690                 695                 700                 


Arg Glu Val Ser Val Met Asp Thr Val His Leu Val Lys Pro Gly Phe 
705                 710                 715                 720 


His Arg Arg Tyr Gly Pro Ala Gly Glu Tyr Pro Ala Asp Phe Pro Val 
                725                 730                 735     


Arg Ser Lys Cys Ile Val Leu Phe Gln Glu Leu Asp Gly Val Asp Val 
            740                 745                 750         


Leu Leu Phe Gly Met Tyr Val Tyr Glu Tyr Gly His Thr Cys Pro Ala 
        755                 760                 765             


Pro Asn Gln Arg Arg Val Tyr Ile Ser Tyr Leu Asp Ser Val His Tyr 
    770                 775                 780                 


Phe Arg Pro Arg Asn Tyr Arg Thr Met Val Tyr His Glu Ile Leu Ile 
785                 790                 795                 800 


Ala Tyr Leu Glu Glu Val Lys Thr Arg Gly Phe His Thr Ala His Ile 
                805                 810                 815     


Trp Ala Cys Pro Pro Ala Lys Gly Asp Asp Tyr Ile Leu Tyr Cys His 
            820                 825                 830         


Pro Pro Glu Gln Gln Thr Pro Lys Asp Asp Arg Leu Gln Gln Trp Tyr 
        835                 840                 845             


Val Thr Met Leu Glu Glu Ala Lys Lys Arg Gly Ile Val Glu Gly Leu 
    850                 855                 860                 


Thr Asn Leu Phe Asp Glu Tyr Trp Ser Asn Pro Glu Thr Ala Asp Ala 
865                 870                 875                 880 


Arg Gln Leu Pro Tyr Leu Glu Gly Asp Tyr Trp Ile Gly Glu Ala Glu 
                885                 890                 895     


Asn Ile Ile Lys Asp Leu Pro Glu Gly Thr Pro Leu Ile Cys Lys Pro 
            900                 905                 910         


Lys Val Glu Ala Lys Ala Asp Gly Ser Ala Ala Ala Ala Pro Pro Asp 
        915                 920                 925             


Ser Ala Gly Gly Thr Ala Ala Ala Asp Gly Ala Gly Ala Ala Ala Gly 
    930                 935                 940                 


Ser Gly Ala Ala Pro Ala Thr Ala Gly Ala Ala Gly Asp Gly Glu Ala 
945                 950                 955                 960 


Leu Val Lys Val Glu Asp Ser Ala Ala Lys Ala Glu Gly Gly Gly Gly 
                965                 970                 975     


Gly Asp Gly Gly Gly Arg Gly Gly Glu Gly Asn Gly Ala Glu Ala Lys 
            980                 985                 990         


Lys Val Glu Gly Gly Glu Gly Lys  Glu Glu Glu Glu Lys  Glu Glu Lys 
        995                 1000                 1005             


Ser Pro  Gly Lys Lys Gly Lys  Arg Lys Ala Gly Asp  Gly Val Lys 
    1010                 1015                 1020             


Lys Lys  Ala Lys Lys Ala Arg  Thr Ser Lys Ser Gly  Gly Gly Ser 
    1025                 1030                 1035             


Lys Lys  Arg Gly Val Lys Pro  Glu Glu Ala Pro Ile  Val Gly Asp 
    1040                 1045                 1050             


Pro Leu  Met His Lys Leu Ala  Ala Ile Val Ser Pro  Met Lys Ser 
    1055                 1060                 1065             


Ser Phe  Ile Val Ala His Leu  Arg Pro Arg Glu Phe  Val Thr Gln 
    1070                 1075                 1080             


Met Gln  Glu Arg Arg Ala Lys  Glu Lys Ala Ile Glu  Ala Ala Lys 
    1085                 1090                 1095             


Lys Thr  Val Ser Thr Ala Val  Ser Glu Lys Arg Lys  Pro Asp Pro 
    1100                 1105                 1110             


Glu Met  Ala Lys Leu Ala Glu  Gln Ala Ile Ala Lys  Asp Glu Thr 
    1115                 1120                 1125             


Glu Glu  Gly Gln Ser Ser Gln  Glu Cys Glu Val Leu  Asp Thr Arg 
    1130                 1135                 1140             


Gln Thr  Phe Leu Asn Leu Cys  Gln Gly Asn His Tyr  Gln Phe Asp 
    1145                 1150                 1155             


Met Leu  Arg Arg Gly Lys His  Ser Ser Met Met Val  Leu Tyr His 
    1160                 1165                 1170             


Leu Cys  Asn Pro Asp Val Pro  Lys Phe Leu Ser Thr  Cys Ser Asn 
    1175                 1180                 1185             


Cys Tyr  Lys Glu Ile His Ser  Gly Asp Arg Tyr His  Cys Glu Val 
    1190                 1195                 1200             


Cys Thr  Asp Phe Asp Leu Cys  Lys Glu Cys Tyr Lys  Ala Val Pro 
    1205                 1210                 1215             


His Pro  His Pro Leu Lys Pro  Ile Pro Val Arg Pro  Ala Ala Gln 
    1220                 1225                 1230             


Gln Gln  Lys His Leu Ser Pro  Ala Gln Arg Glu Glu  Arg Gln Arg 
    1235                 1240                 1245             


His Ile  Lys Leu His Met Gln  Leu Leu Gln His Ala  Ser Thr Cys 
    1250                 1255                 1260             


Glu Asp  Arg Asn Cys Gln Ser  Lys Asn Cys Ser Arg  Met Lys Asn 
    1265                 1270                 1275             


Leu Leu  Thr His Gly Ala Ser  Cys Thr Ile Arg Ala  Gln Gly Gly 
    1280                 1285                 1290             


Cys Gly  Val Cys Lys Arg Ile  Trp Ala Leu Leu Gln  Ile His Ala 
    1295                 1300                 1305             


Arg Gln  Cys Lys Lys Asp Arg  Cys Ser Val Pro Lys  Cys Arg Gln 
    1310                 1315                 1320             


Leu Arg  Gln His Met Arg Phe  Leu Arg Glu Gln Gln  Gln Ala Met 
    1325                 1330                 1335             


Asp Asp  Arg Arg Arg Gln Ala  Met Asn Glu Trp Ser  Arg Asn Arg 
    1340                 1345                 1350             


Gln Glu  Gly Ser Gly Ser 
    1355                 


<210>  41
<211>  2622
<212>  DNA
<213>  Aureococcus anophagefferens


<220>
<221>  misc_feature
<223>  Encodes polypeptide of SEQ ID NO:42

<400>  41
atgcaacctg tcgatccggt cgaactcaac ttgccggact acttcgatat aatcaagaat       60

ccaatggatc tagggtcaat taaaaaacgc atggaaaata actgctacaa gtccatatct      120

gaatttgggt ctgacgtacg gctcacgttc gacaatgcaa tctcgtataa cggagatggc      180

tcggatgttt gcaaagttgc acgtgaaatg aaagctgttt ttgagaagtt gtatcatgcc      240

atgatcacaa gtattgaagc cgaggaagag catcgcaagt caaatggcga tgtgtgcgtg      300

ctctgtggtt gcgaaaagtt gctttttgaa cccacggtct actactgcaa tggctcctgc      360

aatggacaac gaatccggag gaattcgtat tattacactg gagggcgaaa tcagtatcac      420

tggtgtcaac aatgttttaa tgaattacgc gaaaaggaac cactcgagtt tgcggattgc      480

accctgtgga agaaagaatt gcagaagaag aaaaatgatg agatgcacga agagccttgg      540

gttgaatgct cgcaatgcaa ccgatgggtg caccaaatct gcgccttatt caatggccgc      600

atgaacaaag gaaccactat ctatcactgc ccattttgtt ttatggcaag acgcggcgcc      660

aaagagccac atgcgaagcc acttggcgcc aaagagatcc gccacaccaa aatgtcacgt      720

ttcctcgaag atcgagtgat caagtcgcta gatgatgcat atgcacttag gtcttcaaat      780

ggggtccccc atttgacagc atctgctgtg tatgtgcgtc agttatctaa cattgagaaa      840

gcgcatcagg tgaaacctag aatacttcag cgttatgcag atcagaaata tccacgcgag      900

tttccagttc gatcaaaatg catcctcctc tttcaagaaa tggatggtgt tgatgtcatt      960

ctatttggaa tgtacttgta cgaatatggc cataactgcc cccaaccaaa tcaacgacgc     1020

gtctatgtca gctatcttga ttcagtctac tactttcgac cgcggcaata cagaacgctt     1080

gtctatcatg agatgctcat tgcttatcta gcccacacga aggagcgcgg tttccacacc     1140

gcgcatattt gggcatgccc cccctgcaag ggcgatgatt acattttctt ttgtcaccca     1200

gaggaccaga aaactcccaa ggatgatcgt cttcgctcat ggtacataac cctactagaa     1260

aaagcgaagg aagagggtat tgtcactcac atcacgaatc tctgggacga acatttccag     1320

gcagactacg atgtgaatca tattccttat tttgagggtg attattggcc cggtgaggct     1380

gaaaacgtgg tcaaggctct tgaagacgag gccaatgagc gaaacgaatc taaatctcgt     1440

aaagcaggga gtgctaccaa atcaaaagca aaaatgaaag ggcgaacgca gcgcggcctt     1500

cgatcagatg gctccataga ggaggaaaat gggcaggatg cacttgtcgc acgaatgggc     1560

aaaattctag aaccaatgaa agacgccttc atcgtagcat acttgcagcc acgtgacttt     1620

gctcatgtca tggaaggacg atatgaaaga gagcagaaac ggttgttggg cgatgatgtt     1680

ccaaaatcca acgcgggaag tccagccggc caagtcctta atagcgaggt gtctgcggag     1740

tgctcatcac caccatcaga tacagtggga gctccagtga tatcaacgaa cattgccgag     1800

cccacggttc gattggatgt cactcatccc gtgtcaaata atgaagataa tcaggccccg     1860

actgaaccaa ccgctgacgt caacgcaaaa cctagccacg gtaaagcctt cgacgaaaca     1920

gaagataccg acgaaatcat tgaatctgag ttttatgata cgcgacagca atttctgaat     1980

ctgtgtcaag gaaatcatta tcaattcgat gatctacggc gggccaaaca tacctcaatg     2040

atgtcgctat atcacatgca caatccagat gtaccaaaat ttcttgtaac gtgctcaaat     2100

tgcaatgttg acataaattc tggctactgc tatacctcag aaaaagatac tgagtttcat     2160

ctttgtcagg actgctatca aaagatgcac aaggttttcg ctgacaaatt tccttttcga     2220

aggtctgttg ttggaagtga ttcccaggcc cagctcaccg aagagcaacg tcgtgaccgc     2280

catcgctcca tacaattgca tatgcagcta cttcagcacg cttctggctg ccgaaaccaa     2340

caatgccctt cagcgaactg caacaaaatg aagaatctgt tgaagcacgg agcgacttgc     2400

gtgacacgtg tacagggcgg ctgcgctatt tgccgccgta tttgggcact gttgcagatt     2460

catgcgcgtc aatgtcgccg tgatgcgtgt atggtaccta agtgcaggca gctcaaggaa     2520

cagttgaggg ctcttgccca acaacaagcc caaatggatg aacgtcgccg agcagcaatg     2580

aacgctgctt atcgcaggga gggctccaaa gcggccgtat aa                        2622


<210>  42
<211>  873
<212>  PRT
<213>  Aureococcus anophagefferens


<220>
<221>  misc_feature
<223>  Translation product 378924

<400>  42

Met Gln Pro Val Asp Pro Val Glu Leu Asn Leu Pro Asp Tyr Phe Asp 
1               5                   10                  15      


Ile Ile Lys Asn Pro Met Asp Leu Gly Ser Ile Lys Lys Arg Met Glu 
            20                  25                  30          


Asn Asn Cys Tyr Lys Ser Ile Ser Glu Phe Gly Ser Asp Val Arg Leu 
        35                  40                  45              


Thr Phe Asp Asn Ala Ile Ser Tyr Asn Gly Asp Gly Ser Asp Val Cys 
    50                  55                  60                  


Lys Val Ala Arg Glu Met Lys Ala Val Phe Glu Lys Leu Tyr His Ala 
65                  70                  75                  80  


Met Ile Thr Ser Ile Glu Ala Glu Glu Glu His Arg Lys Ser Asn Gly 
                85                  90                  95      


Asp Val Cys Val Leu Cys Gly Cys Glu Lys Leu Leu Phe Glu Pro Thr 
            100                 105                 110         


Val Tyr Tyr Cys Asn Gly Ser Cys Asn Gly Gln Arg Ile Arg Arg Asn 
        115                 120                 125             


Ser Tyr Tyr Tyr Thr Gly Gly Arg Asn Gln Tyr His Trp Cys Gln Gln 
    130                 135                 140                 


Cys Phe Asn Glu Leu Arg Glu Lys Glu Pro Leu Glu Phe Ala Asp Cys 
145                 150                 155                 160 


Thr Leu Trp Lys Lys Glu Leu Gln Lys Lys Lys Asn Asp Glu Met His 
                165                 170                 175     


Glu Glu Pro Trp Val Glu Cys Ser Gln Cys Asn Arg Trp Val His Gln 
            180                 185                 190         


Ile Cys Ala Leu Phe Asn Gly Arg Met Asn Lys Gly Thr Thr Ile Tyr 
        195                 200                 205             


His Cys Pro Phe Cys Phe Met Ala Arg Arg Gly Ala Lys Glu Pro His 
    210                 215                 220                 


Ala Lys Pro Leu Gly Ala Lys Glu Ile Arg His Thr Lys Met Ser Arg 
225                 230                 235                 240 


Phe Leu Glu Asp Arg Val Ile Lys Ser Leu Asp Asp Ala Tyr Ala Leu 
                245                 250                 255     


Arg Ser Ser Asn Gly Val Pro His Leu Thr Ala Ser Ala Val Tyr Val 
            260                 265                 270         


Arg Gln Leu Ser Asn Ile Glu Lys Ala His Gln Val Lys Pro Arg Ile 
        275                 280                 285             


Leu Gln Arg Tyr Ala Asp Gln Lys Tyr Pro Arg Glu Phe Pro Val Arg 
    290                 295                 300                 


Ser Lys Cys Ile Leu Leu Phe Gln Glu Met Asp Gly Val Asp Val Ile 
305                 310                 315                 320 


Leu Phe Gly Met Tyr Leu Tyr Glu Tyr Gly His Asn Cys Pro Gln Pro 
                325                 330                 335     


Asn Gln Arg Arg Val Tyr Val Ser Tyr Leu Asp Ser Val Tyr Tyr Phe 
            340                 345                 350         


Arg Pro Arg Gln Tyr Arg Thr Leu Val Tyr His Glu Met Leu Ile Ala 
        355                 360                 365             


Tyr Leu Ala His Thr Lys Glu Arg Gly Phe His Thr Ala His Ile Trp 
    370                 375                 380                 


Ala Cys Pro Pro Cys Lys Gly Asp Asp Tyr Ile Phe Phe Cys His Pro 
385                 390                 395                 400 


Glu Asp Gln Lys Thr Pro Lys Asp Asp Arg Leu Arg Ser Trp Tyr Ile 
                405                 410                 415     


Thr Leu Leu Glu Lys Ala Lys Glu Glu Gly Ile Val Thr His Ile Thr 
            420                 425                 430         


Asn Leu Trp Asp Glu His Phe Gln Ala Asp Tyr Asp Val Asn His Ile 
        435                 440                 445             


Pro Tyr Phe Glu Gly Asp Tyr Trp Pro Gly Glu Ala Glu Asn Val Val 
    450                 455                 460                 


Lys Ala Leu Glu Asp Glu Ala Asn Glu Arg Asn Glu Ser Lys Ser Arg 
465                 470                 475                 480 


Lys Ala Gly Ser Ala Thr Lys Ser Lys Ala Lys Met Lys Gly Arg Thr 
                485                 490                 495     


Gln Arg Gly Leu Arg Ser Asp Gly Ser Ile Glu Glu Glu Asn Gly Gln 
            500                 505                 510         


Asp Ala Leu Val Ala Arg Met Gly Lys Ile Leu Glu Pro Met Lys Asp 
        515                 520                 525             


Ala Phe Ile Val Ala Tyr Leu Gln Pro Arg Asp Phe Ala His Val Met 
    530                 535                 540                 


Glu Gly Arg Tyr Glu Arg Glu Gln Lys Arg Leu Leu Gly Asp Asp Val 
545                 550                 555                 560 


Pro Lys Ser Asn Ala Gly Ser Pro Ala Gly Gln Val Leu Asn Ser Glu 
                565                 570                 575     


Val Ser Ala Glu Cys Ser Ser Pro Pro Ser Asp Thr Val Gly Ala Pro 
            580                 585                 590         


Val Ile Ser Thr Asn Ile Ala Glu Pro Thr Val Arg Leu Asp Val Thr 
        595                 600                 605             


His Pro Val Ser Asn Asn Glu Asp Asn Gln Ala Pro Thr Glu Pro Thr 
    610                 615                 620                 


Ala Asp Val Asn Ala Lys Pro Ser His Gly Lys Ala Phe Asp Glu Thr 
625                 630                 635                 640 


Glu Asp Thr Asp Glu Ile Ile Glu Ser Glu Phe Tyr Asp Thr Arg Gln 
                645                 650                 655     


Gln Phe Leu Asn Leu Cys Gln Gly Asn His Tyr Gln Phe Asp Asp Leu 
            660                 665                 670         


Arg Arg Ala Lys His Thr Ser Met Met Ser Leu Tyr His Met His Asn 
        675                 680                 685             


Pro Asp Val Pro Lys Phe Leu Val Thr Cys Ser Asn Cys Asn Val Asp 
    690                 695                 700                 


Ile Asn Ser Gly Tyr Cys Tyr Thr Ser Glu Lys Asp Thr Glu Phe His 
705                 710                 715                 720 


Leu Cys Gln Asp Cys Tyr Gln Lys Met His Lys Val Phe Ala Asp Lys 
                725                 730                 735     


Phe Pro Phe Arg Arg Ser Val Val Gly Ser Asp Ser Gln Ala Gln Leu 
            740                 745                 750         


Thr Glu Glu Gln Arg Arg Asp Arg His Arg Ser Ile Gln Leu His Met 
        755                 760                 765             


Gln Leu Leu Gln His Ala Ser Gly Cys Arg Asn Gln Gln Cys Pro Ser 
    770                 775                 780                 


Ala Asn Cys Asn Lys Met Lys Asn Leu Leu Lys His Gly Ala Thr Cys 
785                 790                 795                 800 


Val Thr Arg Val Gln Gly Gly Cys Ala Ile Cys Arg Arg Ile Trp Ala 
                805                 810                 815     


Leu Leu Gln Ile His Ala Arg Gln Cys Arg Arg Asp Ala Cys Met Val 
            820                 825                 830         


Pro Lys Cys Arg Gln Leu Lys Glu Gln Leu Arg Ala Leu Ala Gln Gln 
        835                 840                 845             


Gln Ala Gln Met Asp Glu Arg Arg Arg Ala Ala Met Asn Ala Ala Tyr 
    850                 855                 860                 


Arg Arg Glu Gly Ser Lys Ala Ala Val 
865                 870             


<210>  43
<211>  5262
<212>  DNA
<213>  Schizochytrium limacinum


<220>
<221>  misc_feature
<223>  Encodes polypeptide of SEQ ID NO:44

<400>  43
atgaaacgtt tgtggaagca catttccaag tgtaaagatc ctcgttgtcc tgaacctcat       60

tgtgtatcat ctcgctatgt attgtcacat taccatcgtt gcgagaaaga agaatgccct      120

gtttgtaaac ctgtccgcct catctcagca tcacaacgta gtcaggctct tgctgcccag      180

cgtaagcagc aacaacaatt acagggcctt ggtcagccaa gttccggtgg agctgtacaa      240

ccaggagtgg cagtggcgcc tagttcttct gcatcttcag cagctgcatc acaaactgct      300

gtaacatccc cagctgctgc tgctgctgct ccggtagtca caggcgcaac ctccacaccg      360

aagcttgcag ctgcaagtac tgctgtccct cctgttgttt tgcgacgtcc cgatggtagt      420

gttgtcaatg acccggcggt tcttgcaagg tataataacc tctctaggca acaacaacag      480

gccctggcat tgcgtcaaca acaacagagc aaccagttag cattgcaacg tcaacgcgtt      540

ttgcagcagc ggcaacagct gctggcaaaa gaacaacgtg cgcagtttcc aaatgctatc      600

gatccaacta aagctcgcca ctgtaagtct ggcactgggc cttctttgcc gttcagtatg      660

tctcgtagaa gcgttgaaaa gcacattgaa tctctccgtg ttgaacgatt gcagccaaac      720

cttcgcccac tcttgcgaaa acttattgaa cacaaatcca acaaaggcat ttacaatgcc      780

cctgtggatt ggaaggccat gaatattccg gactatccac gaattatcaa aactccaatg      840

gatcttggca caattcgcaa gcgtcttgat gcttcctatt acacaaccct ggaccaattt      900

aaaaacgaca ttgtgcttac tttcaagaat gcaatgacct ttaaccctcc tgaaaatgag      960

taccatacac gtgctgcaga cttgctcaag gtagcacata aagagtttcc gctgatccta     1020

aacaagattg agcgaaatgg caggcccaaa aaccaagact gtcagctttg tcgtcaatct     1080

gtttgtgaac aatgtccttt atgtgagcgc ggctgcattc ctttccagcc caaactccta     1140

ttttgctcgg gaacctgtgg caaacgtatc cagcccaata gcgtatacta tacgggtgtt     1200

gggtacaact attgctggtg cagtgactgt tatgacaaag ctcgtaccgg tgttctttca     1260

gttaatggcc agtcctataa caaagacagt ttgacaaaga aaaagaacaa cgaactttac     1320

ggtgagcctt gggtatcatg tgataaatgt gaccgctggg tgcatcagat ttgtgcgctt     1380

ttcaactcac gtaagaacag cctcacatcc tcacaggatt acatttgccc tctctgtctc     1440

atcgaggaat caaaacttgg tgaagcagaa cgagcaaacg agtccaagtc caagtctggt     1500

ggaggcaaga aggcagcagc tgcagcaact gaggaaacta aacccaagcc tgaacaggct     1560

actgtagcga aagatgccaa ggatgaaaag ccaactacag gagacgatgc tacacgtgcg     1620

aagggaaagt acaagcattt tgtccttcct gctggcaagc gggtaccgag tgcaagagag     1680

cttgatcaat ctcgccttgg cgtatttctg gaaaaatggc ttcgtaactg tattactgaa     1740

ttccgcacta gggagatgga gcgcaaacct gacatttctg actatgagct tggacctgat     1800

gcggacaaac tgcatgttcg catcctgtcc aattttgatg aaaagtgcca ggtgaagcct     1860

tttgtgaaac gacttatccc ggagtatcct gatgcttttc cattccgttc acgttgtatt     1920

ttcttgttcc aagaacttaa tggtgttgac gttctcttct ttgcaatgtt tgtccaagaa     1980

tacggatctg agtgccctga accgaaccgt cgcaagattt acatcgctta ccttgattct     2040

gtgtattatt tccaaccccg cagaatgcgt acaaccgtgt atcatgagct tcttctcggg     2100

tatcttgaat acatgcgtcg catgggcatg acatcaacgt acatttgggc atgtcctcct     2160

cctaacaaac gtgatgatta cattatccat tgccatcctg aagatcagcg tgtgcaaaca     2220

cctgaacgtt tacgaaaatg gtatcatgat atgattaagg ttggtgctga tcgaaggatt     2280

tttatgggtt cgtgtgctat gtttgaggag cacttcgaag gttctgctcg tacaaaccag     2340

aagaagagca aatctaaatc taaaaagcgc tcgagcaagt ctaagtccaa gtctaagtca     2400

aagagcagca aaaagggaaa gaagagtagt ggtcgaaagg gtacctcaac acgccgcgtg     2460

agcggagcag ctgccgctgc tgctgctgct gctgctgctg ctgctgccga agcaaaagca     2520

gctgccgagg ccgcagaggc agagactgaa gcgaagaacg gggacggcga aaacgagaat     2580

aacgaggatg acaataacga tgaggcactt ggacttcttg ctgatgttcc cgatatggac     2640

ggtgatgatg atgaggaaga cgatgatgcg aacaccgaag caaaggcgac acctgaagat     2700

ggcgttccgc tgagtgaagc agacaaagct aaagccattg ctgaggctgc aggtgttact     2760

tttcttacac cagccaatgg ggttcgtgct ggtgatgatc gcatgcttgc agaagacttg     2820

gaacgccgaa agcttacgaa acttgctgct tccggtaatt tgccttactt tgagggcgat     2880

tactggcctc aagaagccga agaacttgcg aaggagcgtg ctagacaaaa gaagaaggat     2940

gaagatggaa agggtggtcg tagtaagcga aagcgtcgtc gcgctgggga agaagctgaa     3000

gaaaagaccg aagaagctga acctgaggaa aaacctgttg cagaacttgc tgttgagctt     3060

tctgaagagg aggctgtaga ggctcgtgtc caactaatgc agcgacttgc ccaacaactt     3120

gaggtcatga aggatgattt cctggttgtc aaaatggcgc acgagtgctc ccgatgcgga     3180

aagtatctgc ttcggggacg atgggaatgc cgccatcctt catgccttga ggagtttggc     3240

tttaacaaat cttgtccatt cgctctgtgt aatagttgct acattctcga atcgaaacgc     3300

ccaaaggaac aacaacacgg tggaggttgt attgctgatg gtgaagctgg taggccaaaa     3360

tctgtttcag aagaagcgga caaggacgct ggtaaagacc aggaggtgat cgacgtaaac     3420

gttgagttca aacgcaagga gcgcgagcgc gaagagaagg atgctagaga aaaggcaaaa     3480

cgcgatgctg agattaaaaa gaagaaggag gccagagaaa aagcaaacgg caaccctgat     3540

cagaagaagg tcaaagtaga ggcttctgca gcagatgcta aagaggctac agctggaaac     3600

tcttccgcac catcagcatt aggggatact tctaagagct tgggtagcga ggtggcaaac     3660

ttggagatga agaaagagcc tggcgataat acaaaatctg ttagtgcttc cgggtctgta     3720

aaacgcagcg ctccggagag cacttcggta gtcaaaccaa tttcttcatc ttccgtctct     3780

gaaacgaagg tagatccacc agcaaagccg aatggattgt cccgagcaaa agaggtgaat     3840

aacgaagcta agaaggcaga atcgtcgccg gcagctgctg tggaagttga tgagcagagc     3900

cagcaggtag tgaagaagca aaaggtctca gaaacctcgt ccaaggtaat tgaaacctcg     3960

gctaatagtt cttcaaccac tcaggctggg gacaaaacca agtcacaaga cgagaaagga     4020

acaacaaaca aggccgaaga tggtaagaag cactccgatg cggaagacga ggataagaaa     4080

aaggaacttc tcgctgaatg ggagaagcaa agtattgagt gtactcgtgc tcctgttcac     4140

agcccgtctg aagatcatat tcttcactac gtggatgaaa atcttcctgt gcagactcct     4200

gactacgaca agattattaa gaaccatttg cttgaatctc gccacgcttt tctatctctg     4260

tgtacgggga accgttacca gtttgaccag caacgtcgag ccaagcattc aacaatgatg     4320

gttctctatc atcttcacaa cccagatgca ccagctcatc tttacacgtg ctttgagtgt     4380

cacaatgata ttcttacagg aaagcgctac cactgtgatg tctgtaacgg aggtgattac     4440

gatatctgta ttcattgtaa gagacagacg cgccacgatc atactctcac accctttgtt     4500

gttacccgtg gtgttcaggc agaaacgtca gaatcgcaac ggatgcaacg tgtccaggag     4560

atgcagagag ctcgccaaca ctccctcacg ctattcttgg atgcattggt gcactcttca     4620

cagtgtgacg accctcagtg cacaaaggct ccttgcaaga agatgaagga cttgctcaaa     4680

catcgtatga cttgcgaagt tcgagttcgt ggtggctgcg aaatatgtcg ccgtgtactt     4740

tgccttgtgc aaatgcacgc tcgtaactgt accactgtga actgtcgtgt gccacactgc     4800

gaggacctca aggtccacat caacaaacac aaacagcaaa tgcagctcgc tcgccaaccg     4860

gcgggtgatg ctgctccagg tgcatctgct tcgactgcag ctccggctgc acgttcacag     4920

cagcagccgc agcagcagcc gcagcaactt actcaacagc agttgcaaca tcaacatcaa     4980

ctgcttcaac agcgacaacg ccagctgcag gctcaggctc aagcccttgc ccaagctcag     5040

gcccgtggtg cgcgtggcaa ccgcgcaccc cgtacagtag gggctgctgc ccaagccatc     5100

actcaagccg gacagcaaat ccaagctact gtagtagaag gaagtggagg aacaaaaatc     5160

aagattcgtc caactaattt gaagccttcg aacacaacgg cacctcctgc ttcaggatct     5220

aactcccgtg ccccgcgtgg ccaacggaac gcgcgaagat aa                        5262


<210>  44
<211>  1753
<212>  PRT
<213>  Schizochytrium limacinum


<220>
<221>  misc_feature
<223>  Translation product 6503

<400>  44

Met Lys Arg Leu Trp Lys His Ile Ser Lys Cys Lys Asp Pro Arg Cys 
1               5                   10                  15      


Pro Glu Pro His Cys Val Ser Ser Arg Tyr Val Leu Ser His Tyr His 
            20                  25                  30          


Arg Cys Glu Lys Glu Glu Cys Pro Val Cys Lys Pro Val Arg Leu Ile 
        35                  40                  45              


Ser Ala Ser Gln Arg Ser Gln Ala Leu Ala Ala Gln Arg Lys Gln Gln 
    50                  55                  60                  


Gln Gln Leu Gln Gly Leu Gly Gln Pro Ser Ser Gly Gly Ala Val Gln 
65                  70                  75                  80  


Pro Gly Val Ala Val Ala Pro Ser Ser Ser Ala Ser Ser Ala Ala Ala 
                85                  90                  95      


Ser Gln Thr Ala Val Thr Ser Pro Ala Ala Ala Ala Ala Ala Pro Val 
            100                 105                 110         


Val Thr Gly Ala Thr Ser Thr Pro Lys Leu Ala Ala Ala Ser Thr Ala 
        115                 120                 125             


Val Pro Pro Val Val Leu Arg Arg Pro Asp Gly Ser Val Val Asn Asp 
    130                 135                 140                 


Pro Ala Val Leu Ala Arg Tyr Asn Asn Leu Ser Arg Gln Gln Gln Gln 
145                 150                 155                 160 


Ala Leu Ala Leu Arg Gln Gln Gln Gln Ser Asn Gln Leu Ala Leu Gln 
                165                 170                 175     


Arg Gln Arg Val Leu Gln Gln Arg Gln Gln Leu Leu Ala Lys Glu Gln 
            180                 185                 190         


Arg Ala Gln Phe Pro Asn Ala Ile Asp Pro Thr Lys Ala Arg His Cys 
        195                 200                 205             


Lys Ser Gly Thr Gly Pro Ser Leu Pro Phe Ser Met Ser Arg Arg Ser 
    210                 215                 220                 


Val Glu Lys His Ile Glu Ser Leu Arg Val Glu Arg Leu Gln Pro Asn 
225                 230                 235                 240 


Leu Arg Pro Leu Leu Arg Lys Leu Ile Glu His Lys Ser Asn Lys Gly 
                245                 250                 255     


Ile Tyr Asn Ala Pro Val Asp Trp Lys Ala Met Asn Ile Pro Asp Tyr 
            260                 265                 270         


Pro Arg Ile Ile Lys Thr Pro Met Asp Leu Gly Thr Ile Arg Lys Arg 
        275                 280                 285             


Leu Asp Ala Ser Tyr Tyr Thr Thr Leu Asp Gln Phe Lys Asn Asp Ile 
    290                 295                 300                 


Val Leu Thr Phe Lys Asn Ala Met Thr Phe Asn Pro Pro Glu Asn Glu 
305                 310                 315                 320 


Tyr His Thr Arg Ala Ala Asp Leu Leu Lys Val Ala His Lys Glu Phe 
                325                 330                 335     


Pro Leu Ile Leu Asn Lys Ile Glu Arg Asn Gly Arg Pro Lys Asn Gln 
            340                 345                 350         


Asp Cys Gln Leu Cys Arg Gln Ser Val Cys Glu Gln Cys Pro Leu Cys 
        355                 360                 365             


Glu Arg Gly Cys Ile Pro Phe Gln Pro Lys Leu Leu Phe Cys Ser Gly 
    370                 375                 380                 


Thr Cys Gly Lys Arg Ile Gln Pro Asn Ser Val Tyr Tyr Thr Gly Val 
385                 390                 395                 400 


Gly Tyr Asn Tyr Cys Trp Cys Ser Asp Cys Tyr Asp Lys Ala Arg Thr 
                405                 410                 415     


Gly Val Leu Ser Val Asn Gly Gln Ser Tyr Asn Lys Asp Ser Leu Thr 
            420                 425                 430         


Lys Lys Lys Asn Asn Glu Leu Tyr Gly Glu Pro Trp Val Ser Cys Asp 
        435                 440                 445             


Lys Cys Asp Arg Trp Val His Gln Ile Cys Ala Leu Phe Asn Ser Arg 
    450                 455                 460                 


Lys Asn Ser Leu Thr Ser Ser Gln Asp Tyr Ile Cys Pro Leu Cys Leu 
465                 470                 475                 480 


Ile Glu Glu Ser Lys Leu Gly Glu Ala Glu Arg Ala Asn Glu Ser Lys 
                485                 490                 495     


Ser Lys Ser Gly Gly Gly Lys Lys Ala Ala Ala Ala Ala Thr Glu Glu 
            500                 505                 510         


Thr Lys Pro Lys Pro Glu Gln Ala Thr Val Ala Lys Asp Ala Lys Asp 
        515                 520                 525             


Glu Lys Pro Thr Thr Gly Asp Asp Ala Thr Arg Ala Lys Gly Lys Tyr 
    530                 535                 540                 


Lys His Phe Val Leu Pro Ala Gly Lys Arg Val Pro Ser Ala Arg Glu 
545                 550                 555                 560 


Leu Asp Gln Ser Arg Leu Gly Val Phe Leu Glu Lys Trp Leu Arg Asn 
                565                 570                 575     


Cys Ile Thr Glu Phe Arg Thr Arg Glu Met Glu Arg Lys Pro Asp Ile 
            580                 585                 590         


Ser Asp Tyr Glu Leu Gly Pro Asp Ala Asp Lys Leu His Val Arg Ile 
        595                 600                 605             


Leu Ser Asn Phe Asp Glu Lys Cys Gln Val Lys Pro Phe Val Lys Arg 
    610                 615                 620                 


Leu Ile Pro Glu Tyr Pro Asp Ala Phe Pro Phe Arg Ser Arg Cys Ile 
625                 630                 635                 640 


Phe Leu Phe Gln Glu Leu Asn Gly Val Asp Val Leu Phe Phe Ala Met 
                645                 650                 655     


Phe Val Gln Glu Tyr Gly Ser Glu Cys Pro Glu Pro Asn Arg Arg Lys 
            660                 665                 670         


Ile Tyr Ile Ala Tyr Leu Asp Ser Val Tyr Tyr Phe Gln Pro Arg Arg 
        675                 680                 685             


Met Arg Thr Thr Val Tyr His Glu Leu Leu Leu Gly Tyr Leu Glu Tyr 
    690                 695                 700                 


Met Arg Arg Met Gly Met Thr Ser Thr Tyr Ile Trp Ala Cys Pro Pro 
705                 710                 715                 720 


Pro Asn Lys Arg Asp Asp Tyr Ile Ile His Cys His Pro Glu Asp Gln 
                725                 730                 735     


Arg Val Gln Thr Pro Glu Arg Leu Arg Lys Trp Tyr His Asp Met Ile 
            740                 745                 750         


Lys Val Gly Ala Asp Arg Arg Ile Phe Met Gly Ser Cys Ala Met Phe 
        755                 760                 765             


Glu Glu His Phe Glu Gly Ser Ala Arg Thr Asn Gln Lys Lys Ser Lys 
    770                 775                 780                 


Ser Lys Ser Lys Lys Arg Ser Ser Lys Ser Lys Ser Lys Ser Lys Ser 
785                 790                 795                 800 


Lys Ser Ser Lys Lys Gly Lys Lys Ser Ser Gly Arg Lys Gly Thr Ser 
                805                 810                 815     


Thr Arg Arg Val Ser Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
            820                 825                 830         


Ala Ala Ala Ala Glu Ala Lys Ala Ala Ala Glu Ala Ala Glu Ala Glu 
        835                 840                 845             


Thr Glu Ala Lys Asn Gly Asp Gly Glu Asn Glu Asn Asn Glu Asp Asp 
    850                 855                 860                 


Asn Asn Asp Glu Ala Leu Gly Leu Leu Ala Asp Val Pro Asp Met Asp 
865                 870                 875                 880 


Gly Asp Asp Asp Glu Glu Asp Asp Asp Ala Asn Thr Glu Ala Lys Ala 
                885                 890                 895     


Thr Pro Glu Asp Gly Val Pro Leu Ser Glu Ala Asp Lys Ala Lys Ala 
            900                 905                 910         


Ile Ala Glu Ala Ala Gly Val Thr Phe Leu Thr Pro Ala Asn Gly Val 
        915                 920                 925             


Arg Ala Gly Asp Asp Arg Met Leu Ala Glu Asp Leu Glu Arg Arg Lys 
    930                 935                 940                 


Leu Thr Lys Leu Ala Ala Ser Gly Asn Leu Pro Tyr Phe Glu Gly Asp 
945                 950                 955                 960 


Tyr Trp Pro Gln Glu Ala Glu Glu Leu Ala Lys Glu Arg Ala Arg Gln 
                965                 970                 975     


Lys Lys Lys Asp Glu Asp Gly Lys Gly Gly Arg Ser Lys Arg Lys Arg 
            980                 985                 990         


Arg Arg Ala Gly Glu Glu Ala Glu  Glu Lys Thr Glu Glu  Ala Glu Pro 
        995                 1000                 1005             


Glu Glu  Lys Pro Val Ala Glu  Leu Ala Val Glu Leu  Ser Glu Glu 
    1010                 1015                 1020             


Glu Ala  Val Glu Ala Arg Val  Gln Leu Met Gln Arg  Leu Ala Gln 
    1025                 1030                 1035             


Gln Leu  Glu Val Met Lys Asp  Asp Phe Leu Val Val  Lys Met Ala 
    1040                 1045                 1050             


His Glu  Cys Ser Arg Cys Gly  Lys Tyr Leu Leu Arg  Gly Arg Trp 
    1055                 1060                 1065             


Glu Cys  Arg His Pro Ser Cys  Leu Glu Glu Phe Gly  Phe Asn Lys 
    1070                 1075                 1080             


Ser Cys  Pro Phe Ala Leu Cys  Asn Ser Cys Tyr Ile  Leu Glu Ser 
    1085                 1090                 1095             


Lys Arg  Pro Lys Glu Gln Gln  His Gly Gly Gly Cys  Ile Ala Asp 
    1100                 1105                 1110             


Gly Glu  Ala Gly Arg Pro Lys  Ser Val Ser Glu Glu  Ala Asp Lys 
    1115                 1120                 1125             


Asp Ala  Gly Lys Asp Gln Glu  Val Ile Asp Val Asn  Val Glu Phe 
    1130                 1135                 1140             


Lys Arg  Lys Glu Arg Glu Arg  Glu Glu Lys Asp Ala  Arg Glu Lys 
    1145                 1150                 1155             


Ala Lys  Arg Asp Ala Glu Ile  Lys Lys Lys Lys Glu  Ala Arg Glu 
    1160                 1165                 1170             


Lys Ala  Asn Gly Asn Pro Asp  Gln Lys Lys Val Lys  Val Glu Ala 
    1175                 1180                 1185             


Ser Ala  Ala Asp Ala Lys Glu  Ala Thr Ala Gly Asn  Ser Ser Ala 
    1190                 1195                 1200             


Pro Ser  Ala Leu Gly Asp Thr  Ser Lys Ser Leu Gly  Ser Glu Val 
    1205                 1210                 1215             


Ala Asn  Leu Glu Met Lys Lys  Glu Pro Gly Asp Asn  Thr Lys Ser 
    1220                 1225                 1230             


Val Ser  Ala Ser Gly Ser Val  Lys Arg Ser Ala Pro  Glu Ser Thr 
    1235                 1240                 1245             


Ser Val  Val Lys Pro Ile Ser  Ser Ser Ser Val Ser  Glu Thr Lys 
    1250                 1255                 1260             


Val Asp  Pro Pro Ala Lys Pro  Asn Gly Leu Ser Arg  Ala Lys Glu 
    1265                 1270                 1275             


Val Asn  Asn Glu Ala Lys Lys  Ala Glu Ser Ser Pro  Ala Ala Ala 
    1280                 1285                 1290             


Val Glu  Val Asp Glu Gln Ser  Gln Gln Val Val Lys  Lys Gln Lys 
    1295                 1300                 1305             


Val Ser  Glu Thr Ser Ser Lys  Val Ile Glu Thr Ser  Ala Asn Ser 
    1310                 1315                 1320             


Ser Ser  Thr Thr Gln Ala Gly  Asp Lys Thr Lys Ser  Gln Asp Glu 
    1325                 1330                 1335             


Lys Gly  Thr Thr Asn Lys Ala  Glu Asp Gly Lys Lys  His Ser Asp 
    1340                 1345                 1350             


Ala Glu  Asp Glu Asp Lys Lys  Lys Glu Leu Leu Ala  Glu Trp Glu 
    1355                 1360                 1365             


Lys Gln  Ser Ile Glu Cys Thr  Arg Ala Pro Val His  Ser Pro Ser 
    1370                 1375                 1380             


Glu Asp  His Ile Leu His Tyr  Val Asp Glu Asn Leu  Pro Val Gln 
    1385                 1390                 1395             


Thr Pro  Asp Tyr Asp Lys Ile  Ile Lys Asn His Leu  Leu Glu Ser 
    1400                 1405                 1410             


Arg His  Ala Phe Leu Ser Leu  Cys Thr Gly Asn Arg  Tyr Gln Phe 
    1415                 1420                 1425             


Asp Gln  Gln Arg Arg Ala Lys  His Ser Thr Met Met  Val Leu Tyr 
    1430                 1435                 1440             


His Leu  His Asn Pro Asp Ala  Pro Ala His Leu Tyr  Thr Cys Phe 
    1445                 1450                 1455             


Glu Cys  His Asn Asp Ile Leu  Thr Gly Lys Arg Tyr  His Cys Asp 
    1460                 1465                 1470             


Val Cys  Asn Gly Gly Asp Tyr  Asp Ile Cys Ile His  Cys Lys Arg 
    1475                 1480                 1485             


Gln Thr  Arg His Asp His Thr  Leu Thr Pro Phe Val  Val Thr Arg 
    1490                 1495                 1500             


Gly Val  Gln Ala Glu Thr Ser  Glu Ser Gln Arg Met  Gln Arg Val 
    1505                 1510                 1515             


Gln Glu  Met Gln Arg Ala Arg  Gln His Ser Leu Thr  Leu Phe Leu 
    1520                 1525                 1530             


Asp Ala  Leu Val His Ser Ser  Gln Cys Asp Asp Pro  Gln Cys Thr 
    1535                 1540                 1545             


Lys Ala  Pro Cys Lys Lys Met  Lys Asp Leu Leu Lys  His Arg Met 
    1550                 1555                 1560             


Thr Cys  Glu Val Arg Val Arg  Gly Gly Cys Glu Ile  Cys Arg Arg 
    1565                 1570                 1575             


Val Leu  Cys Leu Val Gln Met  His Ala Arg Asn Cys  Thr Thr Val 
    1580                 1585                 1590             


Asn Cys  Arg Val Pro His Cys  Glu Asp Leu Lys Val  His Ile Asn 
    1595                 1600                 1605             


Lys His  Lys Gln Gln Met Gln  Leu Ala Arg Gln Pro  Ala Gly Asp 
    1610                 1615                 1620             


Ala Ala  Pro Gly Ala Ser Ala  Ser Thr Ala Ala Pro  Ala Ala Arg 
    1625                 1630                 1635             


Ser Gln  Gln Gln Pro Gln Gln  Gln Pro Gln Gln Leu  Thr Gln Gln 
    1640                 1645                 1650             


Gln Leu  Gln His Gln His Gln  Leu Leu Gln Gln Arg  Gln Arg Gln 
    1655                 1660                 1665             


Leu Gln  Ala Gln Ala Gln Ala  Leu Ala Gln Ala Gln  Ala Arg Gly 
    1670                 1675                 1680             


Ala Arg  Gly Asn Arg Ala Pro  Arg Thr Val Gly Ala  Ala Ala Gln 
    1685                 1690                 1695             


Ala Ile  Thr Gln Ala Gly Gln  Gln Ile Gln Ala Thr  Val Val Glu 
    1700                 1705                 1710             


Gly Ser  Gly Gly Thr Lys Ile  Lys Ile Arg Pro Thr  Asn Leu Lys 
    1715                 1720                 1725             


Pro Ser  Asn Thr Thr Ala Pro  Pro Ala Ser Gly Ser  Asn Ser Arg 
    1730                 1735                 1740             


Ala Pro  Arg Gly Gln Arg Asn  Ala Arg Arg 
    1745                 1750             


<210>  45
<211>  3057
<212>  DNA
<213>  Schizochytrium limacinum


<220>
<221>  misc_feature
<223>  Encodes polypeptide of SEQ ID NO:26

<400>  45
atgaaagctt tatggaaaca tattgcaaag tgtaaggata agcagtgtca gttcccccat       60

tgtgtctctt cgcgctacgt tttgtcacac taccatcgat gcaagaaccc caagtgtgag      120

gtttgccgtc ccgtgaagga cgctattcag aaacaacaag agagcagcgg aatgcccaat      180

cgcgggaatc ctcgcccccc tcatccacct acaggtgtgt tacagtccgg cggcagtatg      240

ccaccacact ctgcacatcc gggacatcga cctggtgtga gtggctcctc aatgtacaaa      300

tcgtcatcgc caccccctcc tccttctgga tcgagtgcgg atgcctccca aagggccctc      360

attgagaaac ttgagcgaga gcgcaaggct gcggaggatg ccgctcgtag gcaaactctg      420

aaggtacaac agcttgaaaa acaaatgcag gatttgcaaa gacaagctgc gcagataaaa      480

ccggctgagt tgcgcaccaa actgactccg ctactgcgga aacagatgga cttgcagttt      540

gcctacatct tcctcaaacc agtggatccc atcgcaatgg aaattcctga ctactttgat      600

gtagtcaaga accctatgga tttgactaca atcaagcgtc gcctcgactc cagctggtac      660

aagaccatga agtcctttgc cagcgacgtt cttttggtat atgataatgc aatcctttat      720

aaccctgtaa caccagatgg atacggcgtg aatgagacgg cgcgagaata tgcccaaatt      780

ttcattgacg actacaacaa gttactgctc aaattaaagg atgaggagtc gaagaagcga      840

actaatgccg aagcttgtag gctctgcggt gggcgacagt tcctttttga gcccccagtc      900

tactattgcc attcatgcaa ccaaaagatc cgtcgtgggg ctcactatta tccatctcct      960

gatgggaaga tgtattggtg tgttacatgt tatggcagtc tccgcactcc aattgagttg     1020

gaggatggta ctactgtgga aaagtcttct ttggagaaaa agaagaactc cgatgagtct     1080

gaagaatcat gggttcagtg taaccagtgc aaccggtggt atcaccagat ttgtgccatg     1140

ttcaatgggc gcaatgaaga agcaaaacag agtcaatact tctgcccaat gtgtattctt     1200

cggcaccttg acaaggctcg tctggaccgt atccctgacc acattgcaac agcaaaaggc     1260

aaaggtttcc gcgcaaagga cttgccacgt actaagttta gcgacttcat cgaggagcgt     1320

ttagtggggc gaattctgga cgagcgcaaa cgcgaagcaa agaagcagaa tcttctgctc     1380

ggggacatcc ctgtccctgg cgaattaact attcgtgtag tattgaacaa ggaaactgaa     1440

gtgcttcccc gccagaacct cgaacgctta tacaaagatc ctccttacaa ctacccacgc     1500

tcctttccgc accgcgtaaa gtgtgtcctt ctctttcaga atattgatgg tgttgatgtg     1560

ctcatctttg cactctacac gcagacatat gggtcggatt gccctgagcc taatgcccgt     1620

acattgtaca ttgcgtatct tgactctgtg ttttaccttg aacctcggtt cttgcgtaca     1680

ccgatttacc atgagcttct tctcgctact ttcgaatatg aaaagcgccg tggtatcacg     1740

aagtccttta tttgggcttg tccacccatg gctggtgatg actacatcct gtactgtcac     1800

cctcgtgaac agaggactca aaaggttgat atgcttcgat cttggtactg gattctcctc     1860

gagcaagcac gtaaagaaca cattgtctgc tctgttgaca atctcttcga tgcttacttt     1920

cgccgtgttt gcagtccttg tggtgtccct aattttgaag gtgactactg gccaggtgta     1980

acagaacagt atatcacaga tctcgaaaag gagaagggtc gcactgctgc tgccaagaag     2040

tcaaaagcga agtccaagag taagatgcgt actcgtccta atgatcgtaa gggttctcaa     2100

attaaggagg aagcaattga ggaagaggaa gaggaagaag acgaccctct atggcctcct     2160

ccccagcctg caaagtgggt tgagatccca cagcaggatg ctcttacagc aaagattgga     2220

gaatatctga agagtaccaa agaagacttc tttgttgttt actttcacca tatttgtgca     2280

aattgtgcgg ttcgcattga ccagccagat cagctattct ggttgccacg tcgatacaag     2340

gaaggtatgg gaaagaacaa gactgcggca aatggtatgg cgggtgctac atccaattca     2400

gcagcccaag gtaaacctcc tgctgaaagt actgcttcgg atccgctgat ggataatcag     2460

ttctttgaca ctcgtcagca gttcctttct ctttgccaag gtaaccatta ccagtttgat     2520

cagctgcgtc gggctaagca cagcagtatg atggtgttat accatctgca caaccctgac     2580

gagcctggtt ttgttactac ttgtaacact tgctcgcaag aaattaagga tgattcctgg     2640

tataagtgta ctgtctgcga ggactttgac tcatgcaata attgccataa aactagaccg     2700

cacccgcacc cgatgaaaat taccgagcag aagcgctcta cagcagaccg caagaagaac     2760

agcagccgtg ctcagaacgt caaattacat atggagcttc tggcccatgc agcgggttgt     2820

actaatgatc cttgcgagca gtacagcaac tgcgcgaaga tgaaggcatt gttgaaccat     2880

ggcaagacat gtaaggttcg cttgcaaggc aagtgtcttg tatgtcgtcg aatctgggtt     2940

cttttacaga ttcatgctcg gaaatgtcgt atcccgatgg gtcgttgccc tgtgcctcgc     3000

tgtgcagata ttcgcactca gatccgtcgc gcgcaggctg ccatgtcaga tcgccgt        3057


<210>  46
<211>  1019
<212>  PRT
<213>  Schizochytrium limacinum


<220>
<221>  misc_feature
<223>  Translation product 12739

<400>  46

Met Lys Ala Leu Trp Lys His Ile Ala Lys Cys Lys Asp Lys Gln Cys 
1               5                   10                  15      


Gln Phe Pro His Cys Val Ser Ser Arg Tyr Val Leu Ser His Tyr His 
            20                  25                  30          


Arg Cys Lys Asn Pro Lys Cys Glu Val Cys Arg Pro Val Lys Asp Ala 
        35                  40                  45              


Ile Gln Lys Gln Gln Glu Ser Ser Gly Met Pro Asn Arg Gly Asn Pro 
    50                  55                  60                  


Arg Pro Pro His Pro Pro Thr Gly Val Leu Gln Ser Gly Gly Ser Met 
65                  70                  75                  80  


Pro Pro His Ser Ala His Pro Gly His Arg Pro Gly Val Ser Gly Ser 
                85                  90                  95      


Ser Met Tyr Lys Ser Ser Ser Pro Pro Pro Pro Pro Ser Gly Ser Ser 
            100                 105                 110         


Ala Asp Ala Ser Gln Arg Ala Leu Ile Glu Lys Leu Glu Arg Glu Arg 
        115                 120                 125             


Lys Ala Ala Glu Asp Ala Ala Arg Arg Gln Thr Leu Lys Val Gln Gln 
    130                 135                 140                 


Leu Glu Lys Gln Met Gln Asp Leu Gln Arg Gln Ala Ala Gln Ile Lys 
145                 150                 155                 160 


Pro Ala Glu Leu Arg Thr Lys Leu Thr Pro Leu Leu Arg Lys Gln Met 
                165                 170                 175     


Asp Leu Gln Phe Ala Tyr Ile Phe Leu Lys Pro Val Asp Pro Ile Ala 
            180                 185                 190         


Met Glu Ile Pro Asp Tyr Phe Asp Val Val Lys Asn Pro Met Asp Leu 
        195                 200                 205             


Thr Thr Ile Lys Arg Arg Leu Asp Ser Ser Trp Tyr Lys Thr Met Lys 
    210                 215                 220                 


Ser Phe Ala Ser Asp Val Leu Leu Val Tyr Asp Asn Ala Ile Leu Tyr 
225                 230                 235                 240 


Asn Pro Val Thr Pro Asp Gly Tyr Gly Val Asn Glu Thr Ala Arg Glu 
                245                 250                 255     


Tyr Ala Gln Ile Phe Ile Asp Asp Tyr Asn Lys Leu Leu Leu Lys Leu 
            260                 265                 270         


Lys Asp Glu Glu Ser Lys Lys Arg Thr Asn Ala Glu Ala Cys Arg Leu 
        275                 280                 285             


Cys Gly Gly Arg Gln Phe Leu Phe Glu Pro Pro Val Tyr Tyr Cys His 
    290                 295                 300                 


Ser Cys Asn Gln Lys Ile Arg Arg Gly Ala His Tyr Tyr Pro Ser Pro 
305                 310                 315                 320 


Asp Gly Lys Met Tyr Trp Cys Val Thr Cys Tyr Gly Ser Leu Arg Thr 
                325                 330                 335     


Pro Ile Glu Leu Glu Asp Gly Thr Thr Val Glu Lys Ser Ser Leu Glu 
            340                 345                 350         


Lys Lys Lys Asn Ser Asp Glu Ser Glu Glu Ser Trp Val Gln Cys Asn 
        355                 360                 365             


Gln Cys Asn Arg Trp Tyr His Gln Ile Cys Ala Met Phe Asn Gly Arg 
    370                 375                 380                 


Asn Glu Glu Ala Lys Gln Ser Gln Tyr Phe Cys Pro Met Cys Ile Leu 
385                 390                 395                 400 


Arg His Leu Asp Lys Ala Arg Leu Asp Arg Ile Pro Asp His Ile Ala 
                405                 410                 415     


Thr Ala Lys Gly Lys Gly Phe Arg Ala Lys Asp Leu Pro Arg Thr Lys 
            420                 425                 430         


Phe Ser Asp Phe Ile Glu Glu Arg Leu Val Gly Arg Ile Leu Asp Glu 
        435                 440                 445             


Arg Lys Arg Glu Ala Lys Lys Gln Asn Leu Leu Leu Gly Asp Ile Pro 
    450                 455                 460                 


Val Pro Gly Glu Leu Thr Ile Arg Val Val Leu Asn Lys Glu Thr Glu 
465                 470                 475                 480 


Val Leu Pro Arg Gln Asn Leu Glu Arg Leu Tyr Lys Asp Pro Pro Tyr 
                485                 490                 495     


Asn Tyr Pro Arg Ser Phe Pro His Arg Val Lys Cys Val Leu Leu Phe 
            500                 505                 510         


Gln Asn Ile Asp Gly Val Asp Val Leu Ile Phe Ala Leu Tyr Thr Gln 
        515                 520                 525             


Thr Tyr Gly Ser Asp Cys Pro Glu Pro Asn Ala Arg Thr Leu Tyr Ile 
    530                 535                 540                 


Ala Tyr Leu Asp Ser Val Phe Tyr Leu Glu Pro Arg Phe Leu Arg Thr 
545                 550                 555                 560 


Pro Ile Tyr His Glu Leu Leu Leu Ala Thr Phe Glu Tyr Glu Lys Arg 
                565                 570                 575     


Arg Gly Ile Thr Lys Ser Phe Ile Trp Ala Cys Pro Pro Met Ala Gly 
            580                 585                 590         


Asp Asp Tyr Ile Leu Tyr Cys His Pro Arg Glu Gln Arg Thr Gln Lys 
        595                 600                 605             


Val Asp Met Leu Arg Ser Trp Tyr Trp Ile Leu Leu Glu Gln Ala Arg 
    610                 615                 620                 


Lys Glu His Ile Val Cys Ser Val Asp Asn Leu Phe Asp Ala Tyr Phe 
625                 630                 635                 640 


Arg Arg Val Cys Ser Pro Cys Gly Val Pro Asn Phe Glu Gly Asp Tyr 
                645                 650                 655     


Trp Pro Gly Val Thr Glu Gln Tyr Ile Thr Asp Leu Glu Lys Glu Lys 
            660                 665                 670         


Gly Arg Thr Ala Ala Ala Lys Lys Ser Lys Ala Lys Ser Lys Ser Lys 
        675                 680                 685             


Met Arg Thr Arg Pro Asn Asp Arg Lys Gly Ser Gln Ile Lys Glu Glu 
    690                 695                 700                 


Ala Ile Glu Glu Glu Glu Glu Glu Glu Asp Asp Pro Leu Trp Pro Pro 
705                 710                 715                 720 


Pro Gln Pro Ala Lys Trp Val Glu Ile Pro Gln Gln Asp Ala Leu Thr 
                725                 730                 735     


Ala Lys Ile Gly Glu Tyr Leu Lys Ser Thr Lys Glu Asp Phe Phe Val 
            740                 745                 750         


Val Tyr Phe His His Ile Cys Ala Asn Cys Ala Val Arg Ile Asp Gln 
        755                 760                 765             


Pro Asp Gln Leu Phe Trp Leu Pro Arg Arg Tyr Lys Glu Gly Met Gly 
    770                 775                 780                 


Lys Asn Lys Thr Ala Ala Asn Gly Met Ala Gly Ala Thr Ser Asn Ser 
785                 790                 795                 800 


Ala Ala Gln Gly Lys Pro Pro Ala Glu Ser Thr Ala Ser Asp Pro Leu 
                805                 810                 815     


Met Asp Asn Gln Phe Phe Asp Thr Arg Gln Gln Phe Leu Ser Leu Cys 
            820                 825                 830         


Gln Gly Asn His Tyr Gln Phe Asp Gln Leu Arg Arg Ala Lys His Ser 
        835                 840                 845             


Ser Met Met Val Leu Tyr His Leu His Asn Pro Asp Glu Pro Gly Phe 
    850                 855                 860                 


Val Thr Thr Cys Asn Thr Cys Ser Gln Glu Ile Lys Asp Asp Ser Trp 
865                 870                 875                 880 


Tyr Lys Cys Thr Val Cys Glu Asp Phe Asp Ser Cys Asn Asn Cys His 
                885                 890                 895     


Lys Thr Arg Pro His Pro His Pro Met Lys Ile Thr Glu Gln Lys Arg 
            900                 905                 910         


Ser Thr Ala Asp Arg Lys Lys Asn Ser Ser Arg Ala Gln Asn Val Lys 
        915                 920                 925             


Leu His Met Glu Leu Leu Ala His Ala Ala Gly Cys Thr Asn Asp Pro 
    930                 935                 940                 


Cys Glu Gln Tyr Ser Asn Cys Ala Lys Met Lys Ala Leu Leu Asn His 
945                 950                 955                 960 


Gly Lys Thr Cys Lys Val Arg Leu Gln Gly Lys Cys Leu Val Cys Arg 
                965                 970                 975     


Arg Ile Trp Val Leu Leu Gln Ile His Ala Arg Lys Cys Arg Ile Pro 
            980                 985                 990         


Met Gly Arg Cys Pro Val Pro Arg  Cys Ala Asp Ile Arg  Thr Gln Ile 
        995                 1000                 1005             


Arg Arg  Ala Gln Ala Ala Met  Ser Asp Arg Arg 
    1010                 1015                 


<210>  47
<211>  11263
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  vector pSGE-6206

<400>  47
gcggccgccg tatggtcgac ggttgctcgg atgggggggg cggggagcga tggagggagg       60

aagatcaggt aaggtctcga cagactagag aagcacgagt gcaggtataa gaaacagcaa      120

aaaaaagtaa tgggcccagg cctggagagg gtatttgtct tgtttttctt tggccaggaa      180

cttgttctcc tttcttcgtt tctaggaccc cgatccccgc tcgcatttct ctcttcctca      240

gccgaagcgc agcggtaaag catccatttt atcccaccga aagggcgctc ccagccttcg      300

tcgagcggaa ccggggttac agtgcctcaa ccctcccaga cgtagccaga gggaagcaac      360

tccctgatgc caaccgctgt gggctgccca tcggaatctt tgacaattgc cttgatcccc      420

gggtgcaagt caagcagcac ctgccgacat cgcccgcacg gagacagaat gccgcggttt      480

tcgttcccga tggccactat gcacgtcaga tttccggcag cagccgcagc ggccgttccg      540

aggaccacga gctccgcgca tggccctccg gtgaaatgat atacattcac gccggtaaag      600

atccgaccgt cggacgagag ggctgcactg gccaccgagt agtcctcgct aataggtatg      660

ctgttgatgg tcgcagttgc acgttcgatc agcgtggatt cctcttggga taaaggcttg      720

gccatcgagc tcggtacccg gggatccatg attgttgtat tatgtaccta tgtttgtgat      780

gagacaataa atatgagaag agaacgttgc ggccactttt ttctccttcc ttcgcgtgct      840

catgttggtg gtttgggagg cagaagatgc atggagcgcc acacattcgg taggacgaaa      900

cagcctcccc cacaaaggga ccatgggtag ctaggatgac gcacaagcga gttcccgctc      960

tcgaagggaa acccaggcat ttccttcctc ttttcaagcc acttgttcac gtgtcaacac     1020

aattttggac taaaatgccc ctcggaactc ggcaggcctc cctctgctcc gttgtcctgg     1080

tcgccgagaa cgcgagaccg tgccgcatgc catcgatctg ctcgtctgta ctactaatcg     1140

tgtgcgtgtt cgtgcttgtt tcgcacgaaa ttgtcctcgt tcggccctca caacggtgga     1200

aatcggtgct agaataaagt gaggtggctt atttcaatgg cggccgtcat catgcgggat     1260

caactgaagt acggcgggtt ctcgagattt catcgtgctc gtccagagca ggtgttttgc     1320

ctgcagctct tcatgtttag gggtcatgat ttcatctgat atgccgtaag aaaaccaata     1380

ttcacttctc aattttccat ggaaaggtga aggcctaggt tgtgtgcgag gcaacgactg     1440

gggagggatc gcaacattct tgctaacctc ccctctatct tggccgctgt gaatcggcat     1500

atttaccggg ctgaattgag aaagtgtttt gagggaatta aaaggtggct gtcttgcaag     1560

cttggcttca gtgcctgctt aattcgaacc gatccagctt gtgatgaggc cttcctaagc     1620

ctggtagtca gaagcgacat ggcgctataa atttcgtctc agttggagag tagaaaagca     1680

tgattcgaac acggttttca actgccaaag atatctccat tgtttccttc aatctgtaca     1740

cctgcacggt gcaccagttg gtacggcata ttatggttta ataagcatac atcatatgaa     1800

tacaattcag cttaaattta tcatacaaag atgtaagtgc agcgtgggtc tgtaacgatc     1860

gggcgtaatt taagataatg cgagggaccg ggggaggttt tggaacggaa tgaggaatgg     1920

gtcatggccc ataataataa tatgggtttg gtcgcctcgc acagcaaccg tacgtgcgaa     1980

aaaggaacag atccatttaa taagttgaac gttattcttt cctatgcaat gcgtgtatcg     2040

gaggcgagag caagtcatag gtggctgcgc acaataattg agtctcagct gagcgccgtc     2100

cgcgggtggt gtgagtggtc atcctcctcc cggcctatcg ctcacatcgc ctctcaatgg     2160

tggtggtggg gcctgatatg acctcaatgc cgacccatat taaaacccag taaagcattc     2220

accaacgaac gaggggctct tttgtgtgtg ttttgagtat gattttacac ctctttgtgc     2280

atctctctgg tcttccttgg ttcccgtagt ttgggcatca tcactcacgc ttccctcgac     2340

cttcgttctt cctttacaac cccgacacag gtcagagttg gagtaatcaa aaaaggggtg     2400

cacgaatgag atacattaga ttttgacaga tatcctttta ctggagaggg ttcaagggat     2460

caaatgaaca gcgggcgttg gcaatctagg gagggatcgg aggttggcag cgagcgaaag     2520

cgtgtccatc cttttggctg tcacacctca cgaaccaact gttagcaggc cagcacagat     2580

gacatacgag aatctttatt atatcgtaga ccttatgtgg atgacctttg gtgctgtgtg     2640

tctggcaatg aacctgaagg cttgataggg aggtggctcc cgtaaaccct ttgtcctttc     2700

cacgctgagt ctcccccgca ctgtccttta tacaaattgt tacagtcatc tgcaggcggt     2760

ttttctttgg caggcaaaga tgcccaagaa aaagcggaag gtcggcgact acaaggatga     2820

cgatgacaag ttggagcctg gagagaagcc ctacaaatgc cctgagtgcg gaaagagctt     2880

cagccaatct ggagccttga cccggcatca acgaacgcat acacgagaca agaagtactc     2940

catcgggctg gacatcggga cgaactccgt gggatgggcc gtgatcacag acgaatacaa     3000

ggtgccttcc aagaagttca aggtgctggg gaacacggac agacactcca tcaagaagaa     3060

cctcatcggg gccttgctct tcgactccgg agaaaccgcc gaagcaacgc gattgaaaag     3120

aaccgccaga agacgataca cacgacggaa gaaccgcatc tgctacctcc aggagatctt     3180

cagcaacgag atggccaagg tggacgactc gttctttcat cgcctggagg agagcttcct     3240

ggtggaggaa gacaagaaac atgagcgcca cccgatcttc gggaacatcg tggacgaagt     3300

ggcctaccac gagaaatacc ccacgatcta ccacttgcgc aagaaactcg tggactccac     3360

ggacaaagcg gacttgcggt tgatctactt ggccttggcc cacatgatca aatttcgggg     3420

ccacttcctg atcgagggcg acttgaatcc cgacaattcc gacgtggaca agctcttcat     3480

ccagctggtg cagacctaca accagctctt cgaggagaac cccatcaatg cctccggagt     3540

ggacgccaaa gccatcttgt ccgcccgatt gtccaaatcc agacgcttgg agaacttgat     3600

cgcacaactt cctggcgaga agaagaacgg cctcttcggc aacttgatcg cgctgtcgct     3660

gggattgacg cctaacttca agtccaactt cgacttggcc gaggacgcca agttgcaact     3720

gtccaaggac acctacgacg acgacctcga caacctgctg gcccaaattg gcgaccaata     3780

cgcggacttg tttttggcgg ccaagaactt gagcgacgcc atcttgttga gcgacatctt     3840

gcgcgtgaat acggagatca ccaaagcccc tttgtccgcc tctatgatca agcggtacga     3900

cgagcaccac caagacttga ccctgttgaa agccctcgtg cggcaacaat tgcccgagaa     3960

gtacaaggag atcttcttcg accagtccaa gaacgggtac gccggctaca tcgacggagg     4020

agcctcccaa gaagagttct acaagttcat caagcccatc ctggagaaga tggacggcac     4080

cgaggagttg ctcgtgaagc tgaaccgcga agacttgttg cgaaaacagc ggacgttcga     4140

caatggcagc atcccccacc aaatccattt gggagagttg cacgccatct tgcgacggca     4200

agaggacttc tacccgttcc tgaaggacaa ccgcgagaaa atcgagaaga tcctgacgtt     4260

cagaatcccc tactacgtgg gacccttggc ccgaggcaat tcccggtttg catggatgac     4320

gcgcaaaagc gaagagacga tcaccccctg gaacttcgaa gaagtggtcg acaaaggagc     4380

atccgcacag agcttcatcg agcgaatgac gaacttcgac aagaacctgc ccaacgagaa     4440

ggtgttgccc aagcattcgc tgctgtacga gtacttcacg gtgtacaacg agctgaccaa     4500

ggtgaagtac gtgaccgagg gcatgcgcaa acccgcgttc ctgtcgggag agcaaaagaa     4560

ggccattgtg gacctgctgt tcaagaccaa ccggaaggtg accgtgaaac agctgaaaga     4620

ggactacttc aagaagatcg agtgcttcga ctccgtggag atctccggcg tggaggaccg     4680

attcaatgcc tccttgggaa cctaccatga cctcctgaag atcatcaagg acaaggactt     4740

cctggacaac gaggagaacg aggacatcct ggaggacatc gtgctgaccc tgaccctgtt     4800

cgaggaccga gagatgatcg aggaacggtt gaaaacgtac gcccacttgt tcgacgacaa     4860

ggtgatgaag cagctgaaac gccgccgcta caccggatgg ggacgattga gccgcaaact     4920

gattaatgga attcgcgaca agcaatccgg aaagaccatc ctggacttcc tgaagtccga     4980

cgggttcgcc aaccgcaact tcatgcagct catccacgac gactccttga ccttcaagga     5040

ggacatccag aaggcccaag tgtccggaca aggagactcc ttgcacgagc acatcgccaa     5100

tttggccgga tcccccgcaa tcaaaaaagg catcttgcaa accgtgaaag tggtcgacga     5160

actggtgaag gtgatgggac ggcacaagcc cgagaacatc gtgatcgaaa tggcccgcga     5220

gaaccaaacc acccaaaaag gacagaagaa ctcccgagag cgcatgaagc ggatcgaaga     5280

gggcatcaag gagttgggct cccagatcct gaaggagcat cccgtggaga atacccaatt     5340

gcaaaacgag aagctctacc tctactacct ccagaacggg cgggacatgt acgtcgacca     5400

agagctggac atcaaccgcc tctccgacta cgatgtggat catattgtgc cccagagctt     5460

cctcaaggac gacagcatcg acaacaaggt cctgacgcgc agcgacaaga accggggcaa     5520

gtctgacaat gtgccttccg aagaagtcgt gaagaagatg aagaactact ggcggcagct     5580

gctcaacgcc aagctcatca cccaacggaa gttcgacaac ctgaccaagg ccgagagagg     5640

aggattgtcc gagttggaca aagccggctt cattaaacgc caactcgtgg agacccgcca     5700

gatcacgaag cacgtggccc aaatcttgga ctcccggatg aacacgaaat acgacgagaa     5760

tgacaagctg atccgcgagg tgaaggtgat cacgctgaag tccaagctgg tgagcgactt     5820

ccggaaggac ttccagttct acaaggtgcg ggagatcaac aactaccatc acgcccatga     5880

cgcctacctg aacgccgtgg tcggaaccgc cctgatcaag aaatacccca agctggagtc     5940

cgaattcgtg tacggagatt acaaggtcta cgacgtgcgg aagatgatcg cgaagtccga     6000

gcaggagatc ggcaaagcca ccgccaagta cttcttttac tccaacatca tgaacttctt     6060

caagaccgag atcacgctcg ccaacggcga gatccgcaag cgccccctga tcgagaccaa     6120

cggcgagacg ggagagattg tgtgggacaa aggaagagat tttgccacag tgcgcaaggt     6180

gctgtccatg cctcaggtga acatcgtgaa gaagaccgag gtgcaaacag gagggttttc     6240

caaagagtcc attttgccta agaggaattc cgacaagctc atcgcccgca agaaggactg     6300

ggaccccaag aagtacgggg gcttcgactc ccccacggtg gcctactccg tgttggtggt     6360

ggccaaagtg gagaaaggga agagcaagaa gctgaaatcc gtgaaggagt tgctcggaat     6420

cacgatcatg gaacgatcgt cgttcgagaa aaaccccatc gacttcctcg aagccaaagg     6480

gtacaaagag gtgaagaagg acctgatcat caagctgccc aagtactccc tgttcgagct     6540

ggagaacggc cgcaagcgga tgctggcctc cgccggggaa ctgcagaaag ggaacgaatt     6600

ggccttgccc tccaaatacg tgaacttcct ctacttggcc tcccattacg aaaagctcaa     6660

aggatcccct gaggacaatg agcagaagca actcttcgtg gaacaacaca agcactacct     6720

ggacgagatc atcgagcaga tcagcgagtt ctccaagcgc gtgatcctcg ccgacgccaa     6780

cctggacaag gtgctctccg cctacaacaa gcaccgcgac aagcctatcc gcgagcaagc     6840

cgagaatatc attcacctgt ttaccctgac gaatttggga gcccctgccg cctttaaata     6900

ctttgacacc accatcgacc gcaaaagata cacctccacc aaggaagtct tggacgccac     6960

cctcatccac cagtccatca cgggcctcta cgagacgcgc atcgacctct cccaattggg     7020

cggcgactaa agtgatgcgg cctttaggaa acaccacaaa agtaattgac aatctcagga     7080

acgatctgcg tgtttacagc ttcccaaata acaattatac cacgtaccaa aaggggttta     7140

atgtatctca caaattcttc taataggtac agcttctcaa attgggtgta tgatgtgaca     7200

cttcgtctca cacacgtcac gataattcag cgtatggctt cccttcatca cattcacgca     7260

aacttctaca caaccctggg catatttctt gtgttggcaa cactcccgaa atcgattctg     7320

cacacaatgg ttcattcaat gattcaagta cgttttagac ggactaggca gtttaattaa     7380

aaacatctat cctccagatc accagggcca gtgaggccgg cataaaggac ggcaaggaaa     7440

gaaaagaaag aaagaaaagg acacttatag catagtttga agttataagt agtcgcaatc     7500

tgtgtgcagc cgacagatgc tttttttttc cgtttggcag gaggtgtagg gatgtcgaag     7560

accagtccag ctagtatcta tcctacaagt caatcatgct gcgacaaaaa tttctcgcac     7620

gaggcctctc gataaacaaa actttaaaag cacacttcat tgtcatgcag agtaataact     7680

cttccgcgtc gatcaattta tcaatctcta tcatttccgc ccctttcctt gcatagagca     7740

agaaaagcga cccggatgag gataacatgt cctgcgccag tagtgtggca ttgcctgtct     7800

ctcatttaca cgtactgaaa gcataatgca cgcgcatacc aatatttttc gtgtacggag     7860

atgaagagac gcgacacgta agatcacgag aaggcgagca cggttgccaa tggcagacgc     7920

gctagtctcc attatcgcgt tgttcggtag cttgctgcat gtcttcagtg gcactatatc     7980

cactctgcct cgtcttctac acgagggcca catcggtgca agttcgaaaa atcatatctc     8040

aatcttcaga tcctttccag aaacggtgct caggcgggaa agtgaaggtt ttctactcta     8100

gtggctaccc caattctctc cgactgtcgc agacggtcct tcgttgcgca cgcaccgcgc     8160

actacctctg aaattcgaca accgaagttc aattttacat ctaacttctt tcccattctc     8220

tcaccaaaag cctagcttac atgttggaga gcgacgagag cggcctgccc gccatggaga     8280

tcgagtgccg catcaccggc accctgaacg gcgtggagtt cgagctggtg ggcggcggag     8340

agggcacccc cgagcagggc cgcatgacca acaagatgaa gagcaccaaa ggcgccctga     8400

ccttcagccc ctacctgctg agccacgtga tgggctacgg cttctaccac ttcggcacct     8460

accccagcgg ctacgagaac cccttcctgc acgccatcaa caacggcggc tacaccaaca     8520

cccgcatcga gaagtacgag gacggcggcg tgctgcacgt gagcttcagc taccgctacg     8580

aggccggccg cgtgatcggc gacttcaagg tgatgggcac cggcttcccc gaggacagcg     8640

tgatcttcac cgacaagatc atccgcagca acgccaccgt ggagcacctg caccccatgg     8700

gcgataacga tctggatggc agcttcaccc gcaccttcag cctgcgcgac ggcggctact     8760

acagctccgt ggtggacagc cacatgcact tcaagagcgc catccacccc agcatcctgc     8820

agaacggggg ccccatgttc gccttccgcc gcgtggagga ggatcacagc aacaccgagc     8880

tgggcatcgt ggagtaccag cacgccttca agaccccgga tgcagatgcc ggtgaagaat     8940

aagggtggga aggagtcggg gagggtcctg gcagagcggc gtcctcatga tgtgttggag     9000

acctggagag tcgagagctt cctcgtcacc tgattgtcat gtgtgtatag gttaaggggg     9060

cccactcaaa gccataaaga cgaacacaaa cactaatctc aacaaagtct actagcatgc     9120

cgtctgtcca tctttatttc ctggcgcgcc tatgcttgta aaccgttttg tgaaaaaatt     9180

tttaaaataa aaaaggggac ctctagggtc cccaattaat tagtaatata atctattaaa     9240

ggtcattcaa aaggtcatcc agacgaaagg gcctcgtgat acgcctattt ttataggtta     9300

atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg     9360

gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat     9420

aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc     9480

gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa     9540

cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac     9600

tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga     9660

tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag     9720

agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca     9780

cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca     9840

tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa     9900

ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc     9960

tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa    10020

cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag    10080

actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct    10140

ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac    10200

tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa    10260

ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt    10320

aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat    10380

ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg    10440

agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc    10500

ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg    10560

tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag    10620

cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact    10680

ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg    10740

gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc    10800

ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg    10860

aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg    10920

cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag    10980

ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc    11040

gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct    11100

ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc    11160

ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc    11220

gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga aga                      11263


<210>  48
<211>  4101
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Cas9 gene from Streptococcus pyogenes codon optimized for 
       Nannochloropsis

<400>  48
gacaagaagt actccatcgg gctggacatc gggacgaact ccgtgggatg ggccgtgatc       60

acagacgaat acaaggtgcc ttccaagaag ttcaaggtgc tggggaacac ggacagacac      120

tccatcaaga agaacctcat cggggccttg ctcttcgact ccggagaaac cgccgaagca      180

acgcgattga aaagaaccgc cagaagacga tacacacgac ggaagaaccg catctgctac      240

ctccaggaga tcttcagcaa cgagatggcc aaggtggacg actcgttctt tcatcgcctg      300

gaggagagct tcctggtgga ggaagacaag aaacatgagc gccacccgat cttcgggaac      360

atcgtggacg aagtggccta ccacgagaaa taccccacga tctaccactt gcgcaagaaa      420

ctcgtggact ccacggacaa agcggacttg cggttgatct acttggcctt ggcccacatg      480

atcaaatttc ggggccactt cctgatcgag ggcgacttga atcccgacaa ttccgacgtg      540

gacaagctct tcatccagct ggtgcagacc tacaaccagc tcttcgagga gaaccccatc      600

aatgcctccg gagtggacgc caaagccatc ttgtccgccc gattgtccaa atccagacgc      660

ttggagaact tgatcgcaca acttcctggc gagaagaaga acggcctctt cggcaacttg      720

atcgcgctgt cgctgggatt gacgcctaac ttcaagtcca acttcgactt ggccgaggac      780

gccaagttgc aactgtccaa ggacacctac gacgacgacc tcgacaacct gctggcccaa      840

attggcgacc aatacgcgga cttgtttttg gcggccaaga acttgagcga cgccatcttg      900

ttgagcgaca tcttgcgcgt gaatacggag atcaccaaag cccctttgtc cgcctctatg      960

atcaagcggt acgacgagca ccaccaagac ttgaccctgt tgaaagccct cgtgcggcaa     1020

caattgcccg agaagtacaa ggagatcttc ttcgaccagt ccaagaacgg gtacgccggc     1080

tacatcgacg gaggagcctc ccaagaagag ttctacaagt tcatcaagcc catcctggag     1140

aagatggacg gcaccgagga gttgctcgtg aagctgaacc gcgaagactt gttgcgaaaa     1200

cagcggacgt tcgacaatgg cagcatcccc caccaaatcc atttgggaga gttgcacgcc     1260

atcttgcgac ggcaagagga cttctacccg ttcctgaagg acaaccgcga gaaaatcgag     1320

aagatcctga cgttcagaat cccctactac gtgggaccct tggcccgagg caattcccgg     1380

tttgcatgga tgacgcgcaa aagcgaagag acgatcaccc cctggaactt cgaagaagtg     1440

gtcgacaaag gagcatccgc acagagcttc atcgagcgaa tgacgaactt cgacaagaac     1500

ctgcccaacg agaaggtgtt gcccaagcat tcgctgctgt acgagtactt cacggtgtac     1560

aacgagctga ccaaggtgaa gtacgtgacc gagggcatgc gcaaacccgc gttcctgtcg     1620

ggagagcaaa agaaggccat tgtggacctg ctgttcaaga ccaaccggaa ggtgaccgtg     1680

aaacagctga aagaggacta cttcaagaag atcgagtgct tcgactccgt ggagatctcc     1740

ggcgtggagg accgattcaa tgcctccttg ggaacctacc atgacctcct gaagatcatc     1800

aaggacaagg acttcctgga caacgaggag aacgaggaca tcctggagga catcgtgctg     1860

accctgaccc tgttcgagga ccgagagatg atcgaggaac ggttgaaaac gtacgcccac     1920

ttgttcgacg acaaggtgat gaagcagctg aaacgccgcc gctacaccgg atggggacga     1980

ttgagccgca aactgattaa tggaattcgc gacaagcaat ccggaaagac catcctggac     2040

ttcctgaagt ccgacgggtt cgccaaccgc aacttcatgc agctcatcca cgacgactcc     2100

ttgaccttca aggaggacat ccagaaggcc caagtgtccg gacaaggaga ctccttgcac     2160

gagcacatcg ccaatttggc cggatccccc gcaatcaaaa aaggcatctt gcaaaccgtg     2220

aaagtggtcg acgaactggt gaaggtgatg ggacggcaca agcccgagaa catcgtgatc     2280

gaaatggccc gcgagaacca aaccacccaa aaaggacaga agaactcccg agagcgcatg     2340

aagcggatcg aagagggcat caaggagttg ggctcccaga tcctgaagga gcatcccgtg     2400

gagaataccc aattgcaaaa cgagaagctc tacctctact acctccagaa cgggcgggac     2460

atgtacgtcg accaagagct ggacatcaac cgcctctccg actacgatgt ggatcatatt     2520

gtgccccaga gcttcctcaa ggacgacagc atcgacaaca aggtcctgac gcgcagcgac     2580

aagaaccggg gcaagtctga caatgtgcct tccgaagaag tcgtgaagaa gatgaagaac     2640

tactggcggc agctgctcaa cgccaagctc atcacccaac ggaagttcga caacctgacc     2700

aaggccgaga gaggaggatt gtccgagttg gacaaagccg gcttcattaa acgccaactc     2760

gtggagaccc gccagatcac gaagcacgtg gcccaaatct tggactcccg gatgaacacg     2820

aaatacgacg agaatgacaa gctgatccgc gaggtgaagg tgatcacgct gaagtccaag     2880

ctggtgagcg acttccggaa ggacttccag ttctacaagg tgcgggagat caacaactac     2940

catcacgccc atgacgccta cctgaacgcc gtggtcggaa ccgccctgat caagaaatac     3000

cccaagctgg agtccgaatt cgtgtacgga gattacaagg tctacgacgt gcggaagatg     3060

atcgcgaagt ccgagcagga gatcggcaaa gccaccgcca agtacttctt ttactccaac     3120

atcatgaact tcttcaagac cgagatcacg ctcgccaacg gcgagatccg caagcgcccc     3180

ctgatcgaga ccaacggcga gacgggagag attgtgtggg acaaaggaag agattttgcc     3240

acagtgcgca aggtgctgtc catgcctcag gtgaacatcg tgaagaagac cgaggtgcaa     3300

acaggagggt tttccaaaga gtccattttg cctaagagga attccgacaa gctcatcgcc     3360

cgcaagaagg actgggaccc caagaagtac gggggcttcg actcccccac ggtggcctac     3420

tccgtgttgg tggtggccaa agtggagaaa gggaagagca agaagctgaa atccgtgaag     3480

gagttgctcg gaatcacgat catggaacga tcgtcgttcg agaaaaaccc catcgacttc     3540

ctcgaagcca aagggtacaa agaggtgaag aaggacctga tcatcaagct gcccaagtac     3600

tccctgttcg agctggagaa cggccgcaag cggatgctgg cctccgccgg ggaactgcag     3660

aaagggaacg aattggcctt gccctccaaa tacgtgaact tcctctactt ggcctcccat     3720

tacgaaaagc tcaaaggatc ccctgaggac aatgagcaga agcaactctt cgtggaacaa     3780

cacaagcact acctggacga gatcatcgag cagatcagcg agttctccaa gcgcgtgatc     3840

ctcgccgacg ccaacctgga caaggtgctc tccgcctaca acaagcaccg cgacaagcct     3900

atccgcgagc aagccgagaa tatcattcac ctgtttaccc tgacgaattt gggagcccct     3960

gccgccttta aatactttga caccaccatc gaccgcaaaa gatacacctc caccaaggaa     4020

gtcttggacg ccaccctcat ccaccagtcc atcacgggcc tctacgagac gcgcatcgac     4080

ctctcccaat tgggcggcga c                                               4101


<210>  49
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Encodes FLAG tag

<400>  49
gactacaagg atgacgatga caag                                              24


<210>  50
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Encodes Nuclear Localization Sequence

<400>  50
cccaagaaaa agcggaaggt cggc                                              24


<210>  51
<211>  147
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Encodes peptide linker

<400>  51
atgcccaaga aaaagcggaa ggtcggcgac tacaaggatg acgatgacaa gttggagcct       60

ggagagaagc cctacaaatg ccctgagtgc ggaaagagct tcagccaatc tggagccttg      120

acccggcatc aacgaacgca tacacga                                          147


<210>  52
<211>  1000
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  RPL24 promoter

<400>  52
aataagcata catcatatga atacaattca gcttaaattt atcatacaaa gatgtaagtg       60

cagcgtgggt ctgtaacgat cgggcgtaat ttaagataat gcgagggacc gggggaggtt      120

ttggaacgga atgaggaatg ggtcatggcc cataataata atatgggttt ggtcgcctcg      180

cacagcaacc gtacgtgcga aaaaggaaca gatccattta ataagttgaa cgttattctt      240

tcctatgcaa tgcgtgtatc ggaggcgaga gcaagtcata ggtggctgcg cacaataatt      300

gagtctcagc tgagcgccgt ccgcgggtgg tgtgagtggt catcctcctc ccggcctatc      360

gctcacatcg cctctcaatg gtggtggtgg ggcctgatat gacctcaatg ccgacccata      420

ttaaaaccca gtaaagcatt caccaacgaa cgaggggctc ttttgtgtgt gttttgagta      480

tgattttaca cctctttgtg catctctctg gtcttccttg gttcccgtag tttgggcatc      540

atcactcacg cttccctcga ccttcgttct tcctttacaa ccccgacaca ggtcagagtt      600

ggagtaatca aaaaaggggt gcacgaatga gatacattag attttgacag atatcctttt      660

actggagagg gttcaaggga tcaaatgaac agcgggcgtt ggcaatctag ggagggatcg      720

gaggttggca gcgagcgaaa gcgtgtccat ccttttggct gtcacacctc acgaaccaac      780

tgttagcagg ccagcacaga tgacatacga gaatctttat tatatcgtag accttatgtg      840

gatgaccttt ggtgctgtgt gtctggcaat gaacctgaag gcttgatagg gaggtggctc      900

ccgtaaaccc tttgtccttt ccacgctgag tctcccccgc actgtccttt atacaaattg      960

ttacagtcat ctgcaggcgg tttttctttg gcaggcaaag                           1000


<210>  53
<211>  317
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  bidirectional terminator 2

<400>  53
agtgatgcgg cctttaggaa acaccacaaa agtaattgac aatctcagga acgatctgcg       60

tgtttacagc ttcccaaata acaattatac cacgtaccaa aaggggttta atgtatctca      120

caaattcttc taataggtac agcttctcaa attgggtgta tgatgtgaca cttcgtctca      180

cacacgtcac gataattcag cgtatggctt cccttcatca cattcacgca aacttctaca      240

caaccctggg catatttctt gtgttggcaa cactcccgaa atcgattctg cacacaatgg      300

ttcattcaat gattcaa                                                     317


<210>  54
<211>  399
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  blast gene from Aspergillus terreus codon optimized for 
       Nannochloropsis gaditana

<400>  54
atggccaagc ctttatccca agaggaatcc acgctgatcg aacgtgcaac tgcgaccatc       60

aacagcatac ctattagcga ggactactcg gtggccagtg cagccctctc gtccgacggt      120

cggatcttta ccggcgtgaa tgtatatcat ttcaccggag ggccatgcgc ggagctcgtg      180

gtcctcggaa cggccgctgc ggctgctgcc ggaaatctga cgtgcatagt ggccatcggg      240

aacgaaaacc gcggcattct gtctccgtgc gggcgatgtc ggcaggtgct gcttgacttg      300

cacccgggga tcaaggcaat tgtcaaagat tccgatgggc agcccacagc ggttggcatc      360

agggagttgc ttccctctgg ctacgtctgg gagggttga                             399


<210>  55
<211>  999
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  TCTP promoter

<400>  55
cgtgcaggtg tacagattga aggaaacaat ggagatatct ttggcagttg aaaaccgtgt       60

tcgaatcatg cttttctact ctccaactga gacgaaattt atagcgccat gtcgcttctg      120

actaccaggc ttaggaaggc ctcatcacaa gctggatcgg ttcgaattaa gcaggcactg      180

aagccaagct tgcaagacag ccacctttta attccctcaa aacactttct caattcagcc      240

cggtaaatat gccgattcac agcggccaag atagagggga ggttagcaag aatgttgcga      300

tccctcccca gtcgttgcct cgcacacaac ctaggccttc acctttccat ggaaaattga      360

gaagtgaata ttggttttct tacggcatat cagatgaaat catgacccct aaacatgaag      420

agctgcaggc aaaacacctg ctctggacga gcacgatgaa atctcgagaa cccgccgtac      480

ttcagttgat cccgcatgat gacggccgcc attgaaataa gccacctcac tttattctag      540

caccgatttc caccgttgtg agggccgaac gaggacaatt tcgtgcgaaa caagcacgaa      600

cacgcacacg attagtagta cagacgagca gatcgatggc atgcggcacg gtctcgcgtt      660

ctcggcgacc aggacaacgg agcagaggga ggcctgccga gttccgaggg gcattttagt      720

ccaaaattgt gttgacacgt gaacaagtgg cttgaaaaga ggaaggaaat gcctgggttt      780

cccttcgaga gcgggaactc gcttgtgcgt catcctagct acccatggtc cctttgtggg      840

ggaggctgtt tcgtcctacc gaatgtgtgg cgctccatgc atcttctgcc tcccaaacca      900

ccaacatgag cacgcgaagg aaggagaaaa aagtggccgc aacgttctct tctcatattt      960

attgtctcat cacaaacata ggtacataat acaacaatc                             999


<210>  56
<211>  318
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  EIF3 terminator

<400>  56
ggcactgtaa ccccggttcc gctcgacgaa ggctgggagc gccctttcgg tgggataaaa       60

tggatgcttt accgctgcgc ttcggctgag gaagagagaa atgcgagcgg ggatcggggt      120

cctagaaacg aagaaaggag aacaagttcc tggccaaaga aaaacaagac aaataccctc      180

tccaggcctg ggcccattac ttttttttgc tgtttcttat acctgcactc gtgcttctct      240

agtctgtcga gaccttacct gatcttcctc cctccatcgc tccccgcccc ccccatccga      300

gcaaccgtcg accatacg                                                    318


<210>  57
<211>  702
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  TurboGFP gene codon optimized for Nannochloropsis gaditana

<400>  57
atgttggaga gcgacgagag cggcctgccc gccatggaga tcgagtgccg catcaccggc       60

accctgaacg gcgtggagtt cgagctggtg ggcggcggag agggcacccc cgagcagggc      120

cgcatgacca acaagatgaa gagcaccaaa ggcgccctga ccttcagccc ctacctgctg      180

agccacgtga tgggctacgg cttctaccac ttcggcacct accccagcgg ctacgagaac      240

cccttcctgc acgccatcaa caacggcggc tacaccaaca cccgcatcga gaagtacgag      300

gacggcggcg tgctgcacgt gagcttcagc taccgctacg aggccggccg cgtgatcggc      360

gacttcaagg tgatgggcac cggcttcccc gaggacagcg tgatcttcac cgacaagatc      420

atccgcagca acgccaccgt ggagcacctg caccccatgg gcgataacga tctggatggc      480

agcttcaccc gcaccttcag cctgcgcgac ggcggctact acagctccgt ggtggacagc      540

cacatgcact tcaagagcgc catccacccc agcatcctgc agaacggggg ccccatgttc      600

gccttccgcc gcgtggagga ggatcacagc aacaccgagc tgggcatcgt ggagtaccag      660

cacgccttca agaccccgga tgcagatgcc ggtgaagaat aa                         702


<210>  58
<211>  822
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  4A-III promoter

<400>  58
ggcataaagg acggcaagga aagaaaagaa agaaagaaaa ggacacttat agcatagttt       60

gaagttataa gtagtcgcaa tctgtgtgca gccgacagat gctttttttt tccgtttggc      120

aggaggtgta gggatgtcga agaccagtcc agctagtatc tatcctacaa gtcaatcatg      180

ctgcgacaaa aatttctcgc acgaggcctc tcgataaaca aaactttaaa agcacacttc      240

attgtcatgc agagtaataa ctcttccgcg tcgatcaatt tatcaatctc tatcatttcc      300

gcccctttcc ttgcatagag caagaaaagc gacccggatg aggataacat gtcctgcgcc      360

agtagtgtgg cattgcctgt ctctcattta cacgtactga aagcataatg cacgcgcata      420

ccaatatttt tcgtgtacgg agatgaagag acgcgacacg taagatcacg agaaggcgag      480

cacggttgcc aatggcagac gcgctagtct ccattatcgc gttgttcggt agcttgctgc      540

atgtcttcag tggcactata tccactctgc ctcgtcttct acacgagggc cacatcggtg      600

caagttcgaa aaatcatatc tcaatcttca gatcctttcc agaaacggtg ctcaggcggg      660

aaagtgaagg ttttctactc tagtggctac cccaattctc tccgactgtc gcagacggtc      720

cttcgttgcg cacgcaccgc gcactacctc tgaaattcga caaccgaagt tcaattttac      780

atctaacttc tttcccattc tctcaccaaa agcctagctt ac                         822


<210>  59
<211>  200
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  bidirectional terminator 5

<400>  59
gggtgggaag gagtcgggga gggtcctggc agagcggcgt cctcatgatg tgttggagac       60

ctggagagtc gagagcttcc tcgtcacctg attgtcatgt gtgtataggt taagggggcc      120

cactcaaagc cataaagacg aacacaaaca ctaatctcaa caaagtctac tagcatgccg      180

tctgtccatc tttatttcct                                                  200


<210>  60
<211>  101
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Guide RNA for Bromo-1091 gene knockout

<400>  60
uguggcagac gccgacgggu uuuagagcua gaaauagcaa guuaaaauaa ggcuaguccg       60

uuaucaacuu gaaaaagugg caccgagucg gugcuuuuuu u                          101


<210>  61
<211>  18
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene target sequence used in chimeric guide RNA for 
       knockout (SEQ ID NO:60)

<400>  61
tgtggcagac gccgacgg                                                     18


<210>  62
<211>  1029
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Hygromycin resistance gene

<400>  62
atggggaaga aaccggaact gaccgctacg tccgtggaga aattccttat tgagaagttc       60

gactctgtct ccgacttgat gcaactgagc gagggagagg agagtagggc gttctcgttt      120

gacgtagggg gtcggggata cgtgttgagg gttaatagtt gtgcggacgg gttctacaag      180

gatcggtatg tctaccgtca tttcgcctcc gccgctctcc ccataccaga ggtactggac      240

attggggagt ttagcgaatc tctcacgtac tgcatctcgc gccgagccca gggagtgacg      300

ttgcaagatc tgcccgaaac tgaattgcct gccgttttgc aacccgtggc cgaggccatg      360

gacgcgatcg ctgccgcaga tctgtctcag acgtccggct ttggaccttt tgggccccag      420

ggcatcgggc agtacacgac ctggcgagac ttcatctgcg ccattgccga tcctcacgtc      480

tatcattggc agacagtcat ggatgacacc gtgtctgcat ccgtggccca agcactggac      540

gaactcatgt tgtgggccga ggattgccct gaggtcaggc acctggtgca cgcggatttc      600

ggcagcaata acgtacttac agacaatggt cggattactg ctgtcatcga ctggtccgaa      660

gcgatgtttg gtgatagcca atacgaagtg gcgaacatat tcttctggcg tccctggttg      720

gcgtgcatgg agcagcagac acgctacttt gaacggaggc acccggagct ggccggctcc      780

ccacgactcc gcgcctatat gttgcgtatc ggactcgatc agctttacca gtctctcgtc      840

gacggcaact tcgacgacgc cgcgtgggcg cagggccgct gcgacgcgat agtccgcagc      900

ggggctggga cggtgggtcg gacccaaatc gcacgccggt cggctgcggt gtggacagac      960

ggctgtgttg aggtgcttgc ggactcgggc aaccgtaggc cgagcacccg accgcgtgca     1020

aaggagtga                                                             1029


<210>  63
<211>  1000
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  EIF3 promoter

<400>  63
tcataatcaa agatgagcca gccacgaagc taccggagaa ttctgtaaga aaaatgttta       60

aagttgaaaa tgctaacagt gaagtgatat ccttttttaa tggagtgttg aggtgaagtc      120

tagcatcgta ggggaaaaca ggattctgtg tcttccattc tactccttga taaagcgaag      180

aaatccgaca aaaccaaaga gattgttcaa gtttaagatt tgtaagcgta caactatgaa      240

cttcttctct ttgtaggcct gagtggtcgt atgcatacga ttcatgaagt gaatcagtat      300

cgctggattt tgcttaggag taaagcacaa ctaagaaaat atgctgcctg gcaggcatcc      360

tgagacatga ggcaagcgac gtagcaattg aatcctaatt taagccaggg catctgtatg      420

actctgttag ttaattgatg aaccaatgag ctttaaaaaa aaatcgttgc gcgtaatgta      480

gttttaattc tccgccttga ggtgcggggc catttcggac aaggttcttt ggacggagat      540

ggcagcatgt gtcccttctc caaattggtc cgtgtggtag ttgagatgct gccttaaaat      600

tctgctcggt catcctgcct tcgcattcac tcctttcgag ctgtcgggtt cctcacgagg      660

cctccgggag cggattgcgc agaaaggcga cccggagaca cagagaccat acaccgacta      720

aattgcactg gacgatacgg catggcgacg acgatggcca agcattgcta cgtgattatt      780

cgccttgtca ttcagggaga aatgatgaca tgtgtgggac ggtctttaca tgggaagagg      840

gcatgaaaat aacatggcct ggcgggatgg agcgtcacac ctgtgtatgc gttcgatcca      900

caagcaactc accatttgcg tcggggcctg tctccaatct gctttaggct acttttctct      960

aatttagcct attctataca gacagagaca cacagggatc                           1000


<210>  64
<211>  27
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  5'ID sequence

<400>  64
tccacagccc gaacccatga gagagaa                                           27


<210>  65
<211>  27
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  3'ID sequence

<400>  65
gcccgaatcg agttgatggc ccgcaaa                                           27


<210>  66
<211>  2400
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Donor Fragment with HygR cassette

<400>  66
tccacagccc gaacccatga gagagaatca taatcaaaga tgagccagcc acgaagctac       60

cggagaattc tgtaagaaaa atgtttaaag ttgaaaatgc taacagtgaa gtgatatcct      120

tttttaatgg agtgttgagg tgaagtctag catcgtaggg gaaaacagga ttctgtgtct      180

tccattctac tccttgataa agcgaagaaa tccgacaaaa ccaaagagat tgttcaagtt      240

taagatttgt aagcgtacaa ctatgaactt cttctctttg taggcctgag tggtcgtatg      300

catacgattc atgaagtgaa tcagtatcgc tggattttgc ttaggagtaa agcacaacta      360

agaaaatatg ctgcctggca ggcatcctga gacatgaggc aagcgacgta gcaattgaat      420

cctaatttaa gccagggcat ctgtatgact ctgttagtta attgatgaac caatgagctt      480

taaaaaaaaa tcgttgcgcg taatgtagtt ttaattctcc gccttgaggt gcggggccat      540

ttcggacaag gttctttgga cggagatggc agcatgtgtc ccttctccaa attggtccgt      600

gtggtagttg agatgctgcc ttaaaattct gctcggtcat cctgccttcg cattcactcc      660

tttcgagctg tcgggttcct cacgaggcct ccgggagcgg attgcgcaga aaggcgaccc      720

ggagacacag agaccataca ccgactaaat tgcactggac gatacggcat ggcgacgacg      780

atggccaagc attgctacgt gattattcgc cttgtcattc agggagaaat gatgacatgt      840

gtgggacggt ctttacatgg gaagagggca tgaaaataac atggcctggc gggatggagc      900

gtcacacctg tgtatgcgtt cgatccacaa gcaactcacc atttgcgtcg gggcctgtct      960

ccaatctgct ttaggctact tttctctaat ttagcctatt ctatacagac agagacacac     1020

agggatcatg gggaagaaac cggaactgac cgctacgtcc gtggagaaat tccttattga     1080

gaagttcgac tctgtctccg acttgatgca actgagcgag ggagaggaga gtagggcgtt     1140

ctcgtttgac gtagggggtc ggggatacgt gttgagggtt aatagttgtg cggacgggtt     1200

ctacaaggat cggtatgtct accgtcattt cgcctccgcc gctctcccca taccagaggt     1260

actggacatt ggggagttta gcgaatctct cacgtactgc atctcgcgcc gagcccaggg     1320

agtgacgttg caagatctgc ccgaaactga attgcctgcc gttttgcaac ccgtggccga     1380

ggccatggac gcgatcgctg ccgcagatct gtctcagacg tccggctttg gaccttttgg     1440

gccccagggc atcgggcagt acacgacctg gcgagacttc atctgcgcca ttgccgatcc     1500

tcacgtctat cattggcaga cagtcatgga tgacaccgtg tctgcatccg tggcccaagc     1560

actggacgaa ctcatgttgt gggccgagga ttgccctgag gtcaggcacc tggtgcacgc     1620

ggatttcggc agcaataacg tacttacaga caatggtcgg attactgctg tcatcgactg     1680

gtccgaagcg atgtttggtg atagccaata cgaagtggcg aacatattct tctggcgtcc     1740

ctggttggcg tgcatggagc agcagacacg ctactttgaa cggaggcacc cggagctggc     1800

cggctcccca cgactccgcg cctatatgtt gcgtatcgga ctcgatcagc tttaccagtc     1860

tctcgtcgac ggcaacttcg acgacgccgc gtgggcgcag ggccgctgcg acgcgatagt     1920

ccgcagcggg gctgggacgg tgggtcggac ccaaatcgca cgccggtcgg ctgcggtgtg     1980

gacagacggc tgtgttgagg tgcttgcgga ctcgggcaac cgtaggccga gcacccgacc     2040

gcgtgcaaag gagtgattga atcattgaat gaaccattgt gtgcagaatc gatttcggga     2100

gtgttgccaa cacaagaaat atgcccaggg ttgtgtagaa gtttgcgtga atgtgatgaa     2160

gggaagccat acgctgaatt atcgtgacgt gtgtgagacg aagtgtcaca tcatacaccc     2220

aatttgagaa gctgtaccta ttagaagaat ttgtgagata cattaaaccc cttttggtac     2280

gtggtataat tgttatttgg gaagctgtaa acacgcagat cgttcctgag attgtcaatt     2340

acttttgtgg tgtttcctaa aggccgcatc actgcccgaa tcgagttgat ggcccgcaaa     2400


<210>  67
<211>  101
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  BASH-1 Bromo-1091 gene Guide RNA

<400>  67
uguggcagac gccgacgggu uuuagagcua gaaauagcaa guuaaaauaa ggcuaguccg       60

uuaucaacuu gaaaaagugg caccgagucg gugcuuuuuu u                          101


<210>  68
<211>  17
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene target sequence used in guide RNA for BASH-1 
       knockdown (SEQ ID NO:67)

<400>  68
actgaaaggg cagagtg                                                      17


<210>  69
<211>  101
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  BASH-4 Bromo-1091 Guide RNA

<400>  69
uguggcagac gccgacgggu uuuagagcua gaaauagcaa guuaaaauaa ggcuaguccg       60

uuaucaacuu gaaaaagugg caccgagucg gugcuuuuuu u                          101


<210>  70
<211>  18
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Bromo-1091 gene target sequence used in guide RNA for BASH-4 
       knockdown (SEQ IDNO:69)

<400>  70
tgtggacgct agtacagg                                                     18


<210>  71
<211>  101
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  BASH-5 Guide RNA

<400>  71
uguggcagac gccgacgggu uuuagagcua gaaauagcaa guuaaaauaa ggcuaguccg       60

uuaucaacuu gaaaaagugg caccgagucg gugcuuuuuu u                          101


<210>  72
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Bromo-1091 gene target sequence used in guide RNA for BASH-5 
       knockdown (SEQ ID NO:71)

<400>  72
aaaagcgccg tctcggaa                                                     18


<210>  73
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Forward primer, Bromo-1091 gene 5' end

<400>  73
attgctagcc gtgctttcaa c                                                 21


<210>  74
<211>  20
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Reverse primer, Bromo-1091 gene 5' end

<400>  74
gtcggtttgg agaccctaga                                                   20


<210>  75
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Forward primer for RT PCR, Bromo-1091 gene

<400>  75
gaataggcgg tcagaatgta gg                                                22


<210>  76
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<223>  Reverse primer for RT PCR, Bromo-1091 gene

<400>  76
atattttgtg ggcgttgctg                                                   20


<210>  77
<211>  19
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Forward primer, housekeeping gene 1T5001704

<400>  77
gaggaagcgg aagaggatg                                                    19


<210>  78
<211>  20
<212>  DNA
<213>  Nannochloropsis gaditana


<220>
<221>  misc_feature
<223>  Reverse primer, housekeeping gene 1T5001704

<400>  78
tcaagtacca gttccacacg                                                   20


