                         SEQUENCE LISTING

<110>  SYNTHETIC GENOMICS, INC.
       MOELLERING, Eric R.
       IMAM, Saheed
       PEACH, Luke
       KALB, Ryan
       POTTS, Sarah
 
<120>  RECOMBINANT ALGAE HAVING HIGH LIPID PRODUCTIVITY

<130>  SGI2240-1WO

<150>  US 62/949,378
<151>  2019-12-17

<160>  50    

<170>  PatentIn version 3.5

<210>  1
<211>  351
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  RBD domain

<400>  1

Met Ser Ser Glu Glu Ile Ser Lys Asp Met Glu Glu Ala Ser Ser Ser 
1               5                   10                  15      


Gly Asp Gly Gly Gly Lys Leu Phe Leu Gly Gly Leu Ser Trp Asp Thr 
            20                  25                  30          


Thr Glu Glu Lys Leu Arg Glu His Phe Gly Val Tyr Gly Asp Ile His 
        35                  40                  45              


Glu Ala Val Val Met Lys Asp Arg Thr Thr Gly Arg Pro Arg Gly Phe 
    50                  55                  60                  


Gly Phe Val Thr Phe Lys Asp Ala Glu Val Ala Asp Arg Val Val Gln 
65                  70                  75                  80  


Asp Ile His Val Ile Asp Gly Arg Gln Ile Asp Ala Lys Lys Ser Val 
                85                  90                  95      


Pro Gln Glu Gln Lys Pro Lys Ala Arg Lys Ile Phe Val Gly Gly Leu 
            100                 105                 110         


Ala Pro Glu Thr Thr Glu Ala Asp Phe Lys Glu Tyr Phe Glu Arg Tyr 
        115                 120                 125             


Gly Ser Ile Ser Asp Val Gln Ile Met Gln Asp His Met Thr Gly Arg 
    130                 135                 140                 


Ser Arg Gly Phe Gly Phe Ile Thr Phe Glu Glu Asp Ala Ala Val Glu 
145                 150                 155                 160 


Lys Val Phe Ala Gln Gly Ala Met Gln Glu Leu Gly Gly Lys Arg Ile 
                165                 170                 175     


Glu Ile Lys His Ala Thr Pro Lys Gly Ser Ser Ser Pro Thr Thr Pro 
            180                 185                 190         


Gly Gly Arg Ser Ser Ser Gly Gly Arg Gly Gln Gly Tyr Gly Arg Ala 
        195                 200                 205             


Met Pro Met Pro Phe Gly Gln Leu Ala Gly Ser Pro Tyr Gly Tyr Gly 
    210                 215                 220                 


Leu Phe His Phe Pro Pro Gly Val Met Pro His Ala Thr Pro Tyr Ser 
225                 230                 235                 240 


Met Gly Tyr Ala Asn Pro Tyr Leu Met Met Gln Gln Ile Ser Gly Tyr 
                245                 250                 255     


Pro Gly Ala Thr Pro Tyr Pro Phe Ala Gly Leu Tyr Gly Gly Gln Gly 
            260                 265                 270         


Arg Gly Ala Ser Gln Gln Leu Gln Gln Ala Gln His Thr Ser Gln Gln 
        275                 280                 285             


Leu Ser Ser Ser Gly Ala Gly Pro Val Thr Arg Leu Gln Gly Gln Gln 
    290                 295                 300                 


Gln Gln Met Pro Gly Gln Gly Ser Arg Gln Gln His Pro Gln Ala Pro 
305                 310                 315                 320 


Tyr Pro Arg Pro Leu Ala Gly Ser Gly Arg Gly Lys Gly Lys Val Asp 
                325                 330                 335     


Ser Ala Ser Glu Leu Ser Asn His His His Ser Ala Ala His Ser 
            340                 345                 350     


<210>  2
<211>  853
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  Trehalose-6-phosphate synthase/phosphatase

<400>  2

Met Gly Thr Phe Ser Arg Lys Ser Phe Ser Asn Leu Ala Ala Leu Ala 
1               5                   10                  15      


Asp Gly Asp Phe Gly Gln Gly Ser Gln Asn Asp Leu Arg Gly Gly Gly 
            20                  25                  30          


Pro Leu Ser Leu Ser Ser Ala Ala Asn Arg Thr Ser Arg Asn Ser Ile 
        35                  40                  45              


Asp Ser Asp Gly Arg Arg Leu Gln Arg Leu Ile Phe Val Ser Asn His 
    50                  55                  60                  


Leu Pro Leu Arg Val Ser Lys Gly Ala Thr Asp Trp Asn Phe Glu Trp 
65                  70                  75                  80  


Asp Asp Asp Ala Leu Ile Ala Gln Ala Lys Glu Gly Leu Pro Glu Asp 
                85                  90                  95      


Met Glu Ala Leu Tyr Val Gly Cys Leu Pro Val Glu Val Asp Pro Gln 
            100                 105                 110         


Asp Gln Asp Glu Val Ser Leu Gln Leu Gln Lys Gln His Asn Cys Phe 
        115                 120                 125             


Pro Val Phe Leu Gly Thr Glu Leu Lys Thr Asn Tyr Tyr Arg Lys Phe 
    130                 135                 140                 


Cys Lys Gln Gln Leu Trp Pro Ile Leu His Tyr Leu Ile Pro Leu Asn 
145                 150                 155                 160 


Pro Thr Ser Leu Gly Arg Phe Asp Pro Gly Leu Trp Ala Ser Tyr Val 
                165                 170                 175     


Arg Ala Asn Lys Val Phe Ala Asp Lys Met Val Glu Val Leu Gly Ser 
            180                 185                 190         


Leu Glu Asp Asp Phe Val Trp Val His Asp Tyr His Leu Leu Val Leu 
        195                 200                 205             


Pro Ser Leu Leu Arg Lys Arg Phe His Arg Ile Lys Cys Gly Ile Phe 
    210                 215                 220                 


Leu His Ser Pro Phe Pro Ser Ser Glu Val Phe Arg Thr Phe Pro Arg 
225                 230                 235                 240 


Glu Glu Ile Ile Arg Ser Met Leu Asn Ala Asp Leu Ile Gly Phe His 
                245                 250                 255     


Thr Phe Asp Tyr Ala Arg His Phe Leu Ser Cys Cys Ala Arg Met Leu 
            260                 265                 270         


Gly Leu Glu His Lys Thr Ser Arg Gly Ala Ile Ile Ile Glu Tyr Tyr 
        275                 280                 285             


Gly Arg Asp Val Gly Ile Lys Ile Met Pro Thr Gly Val Lys Pro Ser 
    290                 295                 300                 


Arg Phe Leu Ser Ala Phe Ser Trp Lys Asp Thr Glu Trp Arg Arg Gly 
305                 310                 315                 320 


Glu Leu Ala Ala Gln Phe Lys Gly Lys Thr Val Leu Leu Gly Met Asp 
                325                 330                 335     


Asp Leu Asp Val Phe Lys Gly Ile Glu Leu Arg Leu Ala Ala Phe Lys 
            340                 345                 350         


Asp Val Leu Glu Tyr His Pro Glu Trp Lys Gly Arg Leu Val Leu Leu 
        355                 360                 365             


Gln Val Thr Thr Thr Arg Ala Pro Gly Arg Asp Val Asp Asp Leu Phe 
    370                 375                 380                 


Asp Phe Ile Thr Lys Gln Val Glu Glu Ile Asn Glu Arg Phe Gly Ala 
385                 390                 395                 400 


Pro Gly Tyr Gln Pro Val Val Trp Phe Asn Arg Pro Val Pro Met Tyr 
                405                 410                 415     


Glu Arg Ile Ala Met Leu Ser Ile Ala Asp Val Ala Val Val Thr Ala 
            420                 425                 430         


Thr Arg Asp Gly Met Asn Leu Met Pro Tyr Glu Tyr Val Val Cys Arg 
        435                 440                 445             


Gln Gly Pro Pro Gly Leu Ala Glu Thr Glu Gly Pro Arg His Ser Gln 
    450                 455                 460                 


Leu Val Val Ser Glu Phe Val Gly Cys Ser Pro Ser Leu Ser Gly Ala 
465                 470                 475                 480 


Ile Arg Val Asn Pro Trp Ser Ile Glu Ala Val Arg Asp Ala Leu Tyr 
                485                 490                 495     


Gly Ala Ile Arg Met Pro Ile Glu Glu Arg His Ile Arg His Glu Lys 
            500                 505                 510         


His Trp Lys Tyr Val Ser Ser His Thr Val Gln Phe Trp Ala Lys Ser 
        515                 520                 525             


Tyr Val Thr Asp Leu Gln Arg Phe Thr Ala Asn His Ser Lys Leu Gln 
    530                 535                 540                 


Cys Phe Asp Leu Gly Phe Ala Leu Asp Thr Phe Arg Met Val Ala Leu 
545                 550                 555                 560 


Thr Ser Asn Phe Arg Lys Leu Gln Thr Asp Thr Val Val Lys Ala Tyr 
                565                 570                 575     


Gln Arg Ala Lys Lys Arg Val Leu Leu Leu Asp His Asp Gly Thr Leu 
            580                 585                 590         


Met Ala Pro Ser Ser Ile Ser Ser Arg Pro Thr Asp His Val Leu Ala 
        595                 600                 605             


Thr Leu Arg Gln Leu Thr Ser Asp Pro Arg Asn Thr Val Tyr Ile Ile 
    610                 615                 620                 


Ser Gly Arg Ala Arg Thr Glu Leu Gln Glu Trp Phe Lys Ser Val Pro 
625                 630                 635                 640 


Asn Leu Gly Leu Ala Ala Glu His Gly Phe Tyr Leu Trp Thr Pro Gly 
                645                 650                 655     


Ser Ala Asp Trp Ala Val Gln Asp Pro Asp Met Gly Phe Gly Trp Lys 
            660                 665                 670         


Glu Ile Val Glu Pro Ile Leu Gln Val Tyr Thr Glu Ser Thr Asp Gly 
        675                 680                 685             


Ser His Ile Glu Ala Lys Glu Ser Ala Leu Val Trp His Tyr Arg Asp 
    690                 695                 700                 


Ala Asp Pro Asp Phe Gly Ser Trp Gln Ala Lys Glu Leu Leu Asp His 
705                 710                 715                 720 


Leu Glu Gly Ile Ile Ser Asn Glu Pro Val Glu Ile Val Ala Gly Gln 
                725                 730                 735     


Asn Ile Val Glu Val Lys Pro Gln Gly Val Ser Lys Gly Lys Val Val 
            740                 745                 750         


Glu Arg Ile Leu His Asp Cys Leu Thr Ala Ser Gln Ala Pro Glu Phe 
        755                 760                 765             


Val Leu Cys Val Gly Asp Asp Arg Ser Asp Glu Asp Met Phe Thr Ala 
    770                 775                 780                 


Met Glu Asn Met Gln Phe Ser Pro His Met Pro Val Glu Val Phe Ala 
785                 790                 795                 800 


Cys Thr Val Gly Gln Lys Pro Ser Lys Ala Pro Phe Tyr Val Asn Asp 
                805                 810                 815     


Pro Ala Glu Val Gly Gly Cys Gly Ser Arg Met Cys Gly Gly Lys Gly 
            820                 825                 830         


Gly Glu Gly Ala Ser Ala Pro Glu Thr His Gly Ile Gly Glu Gly Gly 
        835                 840                 845             


Gly Gly Met His Leu 
    850             


<210>  3
<211>  69
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  RNA Recognition Motif (RRM) of RNA binding domain protein of SEQ 
       ID NO: 3

<220>
<221>  misc_feature
<223>  RNA Recognition Motif (RRM) of RNA binding domain protein of SEQ 
       ID NO: 1

<400>  3

Leu Phe Leu Gly Gly Leu Ser Trp Asp Thr Thr Glu Glu Lys Leu Arg 
1               5                   10                  15      


Glu His Phe Gly Val Tyr Gly Asp Ile His Glu Ala Val Val Met Lys 
            20                  25                  30          


Asp Arg Thr Thr Gly Arg Pro Arg Gly Phe Gly Phe Val Thr Phe Lys 
        35                  40                  45              


Asp Ala Glu Val Ala Asp Arg Val Val Gln Asp Ile His Val Ile Asp 
    50                  55                  60                  


Gly Arg Gln Ile Asp 
65                  


<210>  4
<211>  234
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  trehalose-6-phosphate phosphatase/synthase active site domain

<400>  4

Leu Leu Asp His Asp Gly Thr Leu Met Ala Pro Ser Ser Ile Ser Ser 
1               5                   10                  15      


Arg Pro Thr Asp His Val Leu Ala Thr Leu Arg Gln Leu Thr Ser Asp 
            20                  25                  30          


Pro Arg Asn Thr Val Tyr Ile Ile Ser Gly Arg Ala Arg Thr Glu Leu 
        35                  40                  45              


Gln Glu Trp Phe Lys Ser Val Pro Asn Leu Gly Leu Ala Ala Glu His 
    50                  55                  60                  


Gly Phe Tyr Leu Trp Thr Pro Gly Ser Ala Asp Trp Ala Val Gln Asp 
65                  70                  75                  80  


Pro Asp Met Gly Phe Gly Trp Lys Glu Ile Val Glu Pro Ile Leu Gln 
                85                  90                  95      


Val Tyr Thr Glu Ser Thr Asp Gly Ser His Ile Glu Ala Lys Glu Ser 
            100                 105                 110         


Ala Leu Val Trp His Tyr Arg Asp Ala Asp Pro Asp Phe Gly Ser Trp 
        115                 120                 125             


Gln Ala Lys Glu Leu Leu Asp His Leu Glu Gly Ile Ile Ser Asn Glu 
    130                 135                 140                 


Pro Val Glu Ile Val Ala Gly Gln Asn Ile Val Glu Val Lys Pro Gln 
145                 150                 155                 160 


Gly Val Ser Lys Gly Lys Val Val Glu Arg Ile Leu His Asp Cys Leu 
                165                 170                 175     


Thr Ala Ser Gln Ala Pro Glu Phe Val Leu Cys Val Gly Asp Asp Arg 
            180                 185                 190         


Ser Asp Glu Asp Met Phe Thr Ala Met Glu Asn Met Gln Phe Ser Pro 
        195                 200                 205             


His Met Pro Val Glu Val Phe Ala Cys Thr Val Gly Gln Lys Pro Ser 
    210                 215                 220                 


Lys Ala Pro Phe Tyr Val Asn Asp Pro Ala 
225                 230                 


<210>  5
<211>  81
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  trehalose-6-phosphate phosphatase/synthase conserved domain

<400>  5

Phe Gly Trp Lys Glu Ile Val Glu Pro Ile Leu Gln Val Tyr Thr Glu 
1               5                   10                  15      


Ser Thr Asp Gly Ser His Ile Glu Ala Lys Glu Ser Ala Leu Val Trp 
            20                  25                  30          


His Tyr Arg Asp Ala Asp Pro Asp Phe Gly Ser Trp Gln Ala Lys Glu 
        35                  40                  45              


Leu Leu Asp His Leu Glu Gly Ile Ile Ser Asn Glu Pro Val Glu Ile 
    50                  55                  60                  


Val Ala Gly Gln Asn Ile Val Glu Val Lys Pro Gln Gly Val Ser Lys 
65                  70                  75                  80  


Gly 
    


<210>  6
<211>  4531
<212>  DNA
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  SGI1 gene

<400>  6
atgtctggtt cagctggatc gggccaggct actctcagac atgacggtgg ctctgctggc       60

ggcagtgggc ctgtctcaga cggtttttca ccggccggcc tgaaggtaaa gtagaaagac      120

actcatacac atcttggttc ggcgttgaaa gtaggtcatt aacatactct ataaccaata      180

tttgtaggtt ctggtcgtgg acgacgacct catgtgcctt aaggtggtgt cagccatgtt      240

gaagaggtgc agctatcaag gtgaggtctt tactggtgtc tgttattgct gtaacatcat      300

ttcgctgttg cacaatttaa acatttgtaa tttactgttg ttattgcagt ggccacttgt      360

agcagtggca gcgaggcact gacacttcta cgtgaacgca acgaggacgg atcctccgac      420

cagttcgacc tcgtactgtc agatgtttac atgccgggta tgtcgtattc ctttgtaaac      480

tttacaatat gcgtctagtt tgacgcgtac actttgtaca ctttgcaaaa acgcaccctg      540

cgaggtctgc catttggtca ctacaacttg gccaccttgg ttgcaagttt gcaagttcgc      600

tctacgtcaa cgctgcaaaa tgaaccaatt gttttgcact gaccctgcca accttcattt      660

gtggctgcag acatggacgg tttcaagctg cttgaacaca tcggtctaga gttggagctt      720

cccgttatca gtaagttgat cgagccgagt ccagagcgaa gcctgcttct atactattag      780

cagctgtctt ttgatatttg acagcttgac ttgatatggt cacagagcat acttgcaacc      840

aggttacctg ttgaactagc aactgtgccc aagcatctct tcaagcacct ccgtcagtcc      900

atagggtact gttgatttgt actctgcaat actgcactgt aatgcgctgt gaatcactgc      960

ccttcacctc tagatggtgc ttccctggag ccctccccca cctccgcctc aagcccctca     1020

catgcctctc ccccccctgc agtgatgtca tccaacgggg acacgaatgt cgtgctgcgg     1080

ggggtcaccc acggggctgt ggactttctg atcaagcccg ttcgaattga ggagctgcgg     1140

aacgtgtggc agcacgtggt gcgtcgtcgt tccatggcgc tggccaggac gccagacgag     1200

gggggacact cggacgagga ctctcaggtg cccttggcag cttctgggcg gcttgctgtg     1260

tcggatgcca cttggactgg ggatgcacga ggggtggggg gacaatggga gatgggccat     1320

agtaggccag agttgatggc agtggtggtg ggggggagta ggcgggagag aagcagccat     1380

cctggtgttg gttttgatga ttgagtgcat ggggatgatg cacaggtgag ctgactggat     1440

gccttgtctt gctgtgctgc gctgcagcgg cacagtgtga aacgcaagga gtcggagcag     1500

agcccgctgc agctcagcac agagcagggc gggaacaaga agccaagagt ggtgtggtcg     1560

gtggagatgc accaacaggt gtgcttgcgg gcgggtgtat acgggggagg ggggccagct     1620

gctggctgac ctggcgtgcg cggtgcattg cacttggcga tgaggggcgt gcttcagtat     1680

gtagctggga cgcaattggt tgtgctgtgt gaccagtgca caaaatacat ccctgaattc     1740

cagtgggttg aacagagttg tcctggaggt gggaagcaaa cgcgcacgtg gtagagggga     1800

gcagggtgca gaacagccgc agcaggggtg ttgcgcagtg tgcaggtatc ctgcctccat     1860

gccccgggcc atgggcatac tacgctggta ccgtcaggat gggcgttgag cctggcttgg     1920

ggggcagggg gcgagcgaat gcggaatggg agcggcaggt gctgggaggg tggctgactg     1980

gcttgcagga gcgcaagtcc tgtcgggggc gtcgtcctgt tccctcctgc ccgcttcacc     2040

cacgttcact ctcatgcctc cacactcctg ctgctgacac acctgtcgcc acctccgctg     2100

cagtttgtga acgcggtcaa ctccctgggc attgacaagg cggtgcccaa gcggattctg     2160

gacctgatga acgtggaggg gctgacgcgc gagaacgtgg ccagccatct gcaggtgcct     2220

gccatgaccc ctcccaccag ggacctggtg ttttgacacc ctggaactcc tctttgacgg     2280

agcctccagt tcaattccag caatcgaatt gaatcaaaaa gcatgtgcac ccacgtgctg     2340

tttgaatgtc ccatgtggta ggaaacacaa ctgccccctt gccatttgct ggagggtgcc     2400

cgctgcgcca tgcccgagtg cgctgtgctc agcgttgtgc tgcgcccccc gctgactgaa     2460

gctgacagcg tgcggctgag gagggtactg ggggaggggg ggtgggaggc ggccgctggc     2520

ggcggaaggg agggtgtgca cgcatggaca cagggccttt ccgccctgca cggcctctac     2580

tgcaccctgc cacgtgatgt atcgacatgg tgggccatgc tgtgctgtgc cgctgcagaa     2640

gtaccgcctg tacctgaagc gggtggaggg agtgcaatcg ggtgcggcag cctccaagca     2700

gcaccagcac ccgcagtatc accagcagca gcagcagcag caagcgcaac ctcgtgcagc     2760

tgtctcccct gcagcagctt cctttggtgc cctttccttg ggagccccgc agcaggcgca     2820

gcagggcatg ccgcagctgg ggatgcctgt gcaggtgaag actgcccccc cccccctccc     2880

cctttccatc ttccctccat cagcctgctg ttccttaccc ttgtcaaccc gtctctcctt     2940

tttcgcaagc agcgcaccac cccccatgca cgccttgcct ggcactgttg tcagctgccc     3000

ccctagaaat acacaaggtg tgggtgcaac tggtgggacc ccctcccccc cccccctggg     3060

gctgcagggt ctccctccaa acttggcagc catgggatcc cagccgccgc acatcccctt     3120

ccagcaggcc ctggccatgc aggcggcggc tgcggcggct gcagccagcg gcgcgctccc     3180

cgggagtctg cccccctaca tgccaccccc ggggatgatg ccccccggca tgccgggggg     3240

ggtccccggt atgggagggg tggtggggca tcctcaggta cgggcagcac atgagtgggc     3300

aggggtattg gagaggggaa gggcagggag gttgcatgtg aggggctgca tggcaaagag     3360

gctgcagcgc aggtgttgct tgcagcactt cccctcggtg gcgcttgcat caaattttga     3420

atcctccccc gatgggcacg cccgtgtgtg ggggggggtg ggatggggga tgggggtggt     3480

tttgtggcat gtcgggcgct ttcatctacc cgggcccctg cccctgcctg tacgcgtgcg     3540

catgtgtgca gatgcccgcc ccagggatgg actttgcggg tttcaacggg tatggcaacg     3600

ctgcgggggg gctgatgttt ggcgggcagc agcaggcgca gcacgcgcag cagcacgcgt     3660

cagcgcaagc gggctcgctg gcgcagcagc aggcgcagca agtatccatg ggcttgggcc     3720

ttatgccccc cccgttgggg ttcccgccca cctcgctcgc cgcgccagcc ccgcgctccg     3780

cagcaactga gcccgccgca gccccactcc ccctgacgtc ctcgccgcca gctgcttcag     3840

caggcggcag cggcggccca gcagcagctg ctccgcagca cagcagcggc gccgcagcag     3900

cccaagcccc ccatcaccac ccacagtgct cggagcaggg agcggggggg ctcccgcccc     3960

cgctgcccgc gtccagcgcc ccgcagtcct atcccctccc tcccccctcc tcgcaggccg     4020

ctttgcatga cccggacgaa cactaccccc caggctcggc agaggtgagc acgtcccccc     4080

gccccctccc cccccccccc cccccttccc ttcaccctgg cttggcgtgc aatgaaaccc     4140

taaataaccc taaaacctca ttatcagttg caaattggac ccgtgaagcg ggcgggggca     4200

actgcgctct gctggtgtca gcgctgtctc tgccggttcc tgcccagcgt gcgcctgcat     4260

gcaagggggg atgggggggg ggaggcattt aacaataggc cagtcatctc caatccaccg     4320

tcaatttcag ccccctcccc ccccctccct catccccttg cagatgcacc accagcacct     4380

cccagggctg tgtggcttta acccggacga cctgctgggg gggcagctgg gggacatggg     4440

gttcctgggg gagctggggg gggcggtggg aggaaagcac gaacaggacg acttcctgga     4500

cctgctgctg aagggggagg aggagctgtg a                                    4531


<210>  7
<211>  1860
<212>  DNA
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  SGI1 gene coding sequence

<400>  7
atgtctggtt cagctggatc gggccaggct actctcagac atgacggtgg ctctgctggc       60

ggcagtgggc ctgtctcaga cggtttttca ccggccggcc tgaaggttct ggtcgtggac      120

gacgacctca tgtgccttaa ggtggtgtca gccatgttga agaggtgcag ctatcaagtg      180

gccacttgta gcagtggcag cgaggcactg acacttctac gtgaacgcaa cgaggacgga      240

tcctccgacc agttcgacct cgtactgtca gatgtttaca tgccggacat ggacggtttc      300

aagctgcttg aacacatcgg tctagagttg gagcttcccg ttatcatgat gtcatccaac      360

ggggacacga atgtcgtgct gcggggggtc acccacgggg ctgtggactt tctgatcaag      420

cccgttcgaa ttgaggagct gcggaacgtg tggcagcacg tggtgcgtcg tcgttccatg      480

gcgctggcca ggacgccaga cgagggggga cactcggacg aggactctca gcggcacagt      540

gtgaaacgca aggagtcgga gcagagcccg ctgcagctca gcacagagca gggcgggaac      600

aagaagccaa gagtggtgtg gtcggtggag atgcaccaac agtttgtgaa cgcggtcaac      660

tccctgggca ttgacaaggc ggtgcccaag cggattctgg acctgatgaa cgtggagggg      720

ctgacgcgcg agaacgtggc cagccatctg cagaagtacc gcctgtacct gaagcgggtg      780

gagggagtgc aatcgggtgc ggcagcctcc aagcagcacc agcacccgca gtatcaccag      840

cagcagcagc agcagcaagc gcaacctcgt gcagctgtct cccctgcagc agcttccttt      900

ggtgcccttt ccttgggagc cccgcagcag gcgcagcagg gcatgccgca gctggggatg      960

cctgtgcagg gtctccctcc aaacttggca gccatgggat cccagccgcc gcacatcccc     1020

ttccagcagg ccctggccat gcaggcggcg gctgcggcgg ctgcagccag cggcgcgctc     1080

cccgggagtc tgccccccta catgccaccc ccggggatga tgccccccgg catgccgggg     1140

ggggtccccg gtatgggagg ggtggtgggg catcctcaga tgcccgcccc agggatggac     1200

tttgcgggtt tcaacgggta tggcaacgct gcgggggggc tgatgtttgg cgggcagcag     1260

caggcgcagc acgcgcagca gcacgcgtca gcgcaagcgg gctcgctggc gcagcagcag     1320

gcgcagcaag tatccatggg cttgggcctt atgccccccc cgttggggtt cccgcccacc     1380

tcgctcgccg cgccagcccc gcgctccgca gcaactgagc ccgccgcagc cccactcccc     1440

ctgacgtcct cgccgccagc tgcttcagca ggcggcagcg gcggcccagc agcagctgct     1500

ccgcagcaca gcagcggcgc cgcagcagcc caagcccccc atcaccaccc acagtgctcg     1560

gagcagggag cgggggggct cccgcccccg ctgcccgcgt ccagcgcccc gcagtcctat     1620

cccctccctc ccccctcctc gcaggccgct ttgcatgacc cggacgaaca ctacccccca     1680

ggctcggcag agatgcacca ccagcacctc ccagggctgt gtggctttaa cccggacgac     1740

ctgctggggg ggcagctggg ggacatgggg ttcctggggg agctgggggg ggcggtggga     1800

ggaaagcacg aacaggacga cttcctggac ctgctgctga agggggagga ggagctgtga     1860


<210>  8
<211>  619
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  SGI1 polypeptide

<400>  8

Met Ser Gly Ser Ala Gly Ser Gly Gln Ala Thr Leu Arg His Asp Gly 
1               5                   10                  15      


Gly Ser Ala Gly Gly Ser Gly Pro Val Ser Asp Gly Phe Ser Pro Ala 
            20                  25                  30          


Gly Leu Lys Val Leu Val Val Asp Asp Asp Leu Met Cys Leu Lys Val 
        35                  40                  45              


Val Ser Ala Met Leu Lys Arg Cys Ser Tyr Gln Val Ala Thr Cys Ser 
    50                  55                  60                  


Ser Gly Ser Glu Ala Leu Thr Leu Leu Arg Glu Arg Asn Glu Asp Gly 
65                  70                  75                  80  


Ser Ser Asp Gln Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp 
                85                  90                  95      


Met Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Glu Leu 
            100                 105                 110         


Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Asn Val Val Leu Arg 
        115                 120                 125             


Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Ile 
    130                 135                 140                 


Glu Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Arg Ser Met 
145                 150                 155                 160 


Ala Leu Ala Arg Thr Pro Asp Glu Gly Gly His Ser Asp Glu Asp Ser 
                165                 170                 175     


Gln Arg His Ser Val Lys Arg Lys Glu Ser Glu Gln Ser Pro Leu Gln 
            180                 185                 190         


Leu Ser Thr Glu Gln Gly Gly Asn Lys Lys Pro Arg Val Val Trp Ser 
        195                 200                 205             


Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Ser Leu Gly Ile 
    210                 215                 220                 


Asp Lys Ala Val Pro Lys Arg Ile Leu Asp Leu Met Asn Val Glu Gly 
225                 230                 235                 240 


Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr 
                245                 250                 255     


Leu Lys Arg Val Glu Gly Val Gln Ser Gly Ala Ala Ala Ser Lys Gln 
            260                 265                 270         


His Gln His Pro Gln Tyr His Gln Gln Gln Gln Gln Gln Gln Ala Gln 
        275                 280                 285             


Pro Arg Ala Ala Val Ser Pro Ala Ala Ala Ser Phe Gly Ala Leu Ser 
    290                 295                 300                 


Leu Gly Ala Pro Gln Gln Ala Gln Gln Gly Met Pro Gln Leu Gly Met 
305                 310                 315                 320 


Pro Val Gln Gly Leu Pro Pro Asn Leu Ala Ala Met Gly Ser Gln Pro 
                325                 330                 335     


Pro His Ile Pro Phe Gln Gln Ala Leu Ala Met Gln Ala Ala Ala Ala 
            340                 345                 350         


Ala Ala Ala Ala Ser Gly Ala Leu Pro Gly Ser Leu Pro Pro Tyr Met 
        355                 360                 365             


Pro Pro Pro Gly Met Met Pro Pro Gly Met Pro Gly Gly Val Pro Gly 
    370                 375                 380                 


Met Gly Gly Val Val Gly His Pro Gln Met Pro Ala Pro Gly Met Asp 
385                 390                 395                 400 


Phe Ala Gly Phe Asn Gly Tyr Gly Asn Ala Ala Gly Gly Leu Met Phe 
                405                 410                 415     


Gly Gly Gln Gln Gln Ala Gln His Ala Gln Gln His Ala Ser Ala Gln 
            420                 425                 430         


Ala Gly Ser Leu Ala Gln Gln Gln Ala Gln Gln Val Ser Met Gly Leu 
        435                 440                 445             


Gly Leu Met Pro Pro Pro Leu Gly Phe Pro Pro Thr Ser Leu Ala Ala 
    450                 455                 460                 


Pro Ala Pro Arg Ser Ala Ala Thr Glu Pro Ala Ala Ala Pro Leu Pro 
465                 470                 475                 480 


Leu Thr Ser Ser Pro Pro Ala Ala Ser Ala Gly Gly Ser Gly Gly Pro 
                485                 490                 495     


Ala Ala Ala Ala Pro Gln His Ser Ser Gly Ala Ala Ala Ala Gln Ala 
            500                 505                 510         


Pro His His His Pro Gln Cys Ser Glu Gln Gly Ala Gly Gly Leu Pro 
        515                 520                 525             


Pro Pro Leu Pro Ala Ser Ser Ala Pro Gln Ser Tyr Pro Leu Pro Pro 
    530                 535                 540                 


Pro Ser Ser Gln Ala Ala Leu His Asp Pro Asp Glu His Tyr Pro Pro 
545                 550                 555                 560 


Gly Ser Ala Glu Met His His Gln His Leu Pro Gly Leu Cys Gly Phe 
                565                 570                 575     


Asn Pro Asp Asp Leu Leu Gly Gly Gln Leu Gly Asp Met Gly Phe Leu 
            580                 585                 590         


Gly Glu Leu Gly Gly Ala Val Gly Gly Lys His Glu Gln Asp Asp Phe 
        595                 600                 605             


Leu Asp Leu Leu Leu Lys Gly Glu Glu Glu Leu 
    610                 615                 


<210>  9
<211>  302
<212>  PRT
<213>  Coccomyxa subellipsoidea


<220>
<221>  misc_feature
<223>  SGI1 polypeptide, NCBI Accession XP_005652114

<400>  9

Met Gly Leu Lys Ala Arg Ala Ala Ser Val Ser Val His Ser Ser Ala 
1               5                   10                  15      


Asn Asn Thr Ala Ser Pro Leu Ser Ser Gly Arg Arg Gly Phe Pro His 
            20                  25                  30          


Ser Gly Glu Met Ser Gly Glu Asp Leu Ala Arg Ser Asp Ser Trp Glu 
        35                  40                  45              


Met Phe Pro Ala Gly Leu Lys Val Leu Val Val Asp Asp Asp Pro Leu 
    50                  55                  60                  


Cys Leu Lys Val Val Glu His Met Leu Arg Arg Cys Asn Tyr Gln Val 
65                  70                  75                  80  


Thr Thr Cys Pro Asn Gly Lys Ala Ala Leu Glu Lys Leu Arg Asp Arg 
                85                  90                  95      


Ser Val His Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met 
            100                 105                 110         


Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Asp Leu Pro 
        115                 120                 125             


Val Ile Met Met Ser Ser Asn Gly Glu Thr Asn Val Val Leu Arg Gly 
    130                 135                 140                 


Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Val Glu 
145                 150                 155                 160 


Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Lys Arg Asp Gln 
                165                 170                 175     


Ala Val Ser Gln Ala Arg Asp Ser Arg Asp Ile Ser Asp Glu Glu Gly 
            180                 185                 190         


Thr Asp Asp Gly Lys Pro Arg Asp Lys Lys Arg Lys Glu Val Ile Leu 
        195                 200                 205             


Val Leu Trp Trp Asp Met Gln Arg Arg Asp Ser Asp Asp Gly Val Ser 
    210                 215                 220                 


Ala Lys Lys Ala Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe 
225                 230                 235                 240 


Val Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg 
                245                 250                 255     


Ile Leu Asp Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala 
            260                 265                 270         


Ser His Leu Gln Val Pro His Leu Ser Ile Phe Ser Pro Leu Phe Ala 
        275                 280                 285             


Glu Leu Met Ser Thr Leu Pro Arg Arg Cys Phe Tyr Asp Phe 
    290                 295                 300         


<210>  10
<211>  270
<212>  PRT
<213>  Ostreococcus lucimarinus


<220>
<221>  misc_feature
<223>  SGI1 polypeptide, NCBI Accession XP_001415508

<400>  10

Phe Pro Ala Gly Leu Gly Val Leu Val Val Asp Asp Asp Leu Leu Cys 
1               5                   10                  15      


Leu Lys Val Val Glu Lys Met Leu Lys Ala Cys Lys Tyr Lys Val Thr 
            20                  25                  30          


Ala Cys Ser Thr Ala Lys Thr Ala Leu Glu Ile Leu Arg Thr Arg Lys 
        35                  40                  45              


Glu Glu Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp 
    50                  55                  60                  


Gly Phe Lys Leu Leu Glu Ile Ile Gln Phe Glu Leu Ala Leu Pro Val 
65                  70                  75                  80  


Leu Met Met Ser Ala Asn Ser Asp Ser Ser Val Val Leu Arg Gly Ile 
                85                  90                  95      


Ile His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu 
            100                 105                 110         


Leu Arg Asn Ile Trp Gln His Val Val Arg Arg Asp Tyr Ser Ser Ala 
        115                 120                 125             


Lys Ser Ser Gly Ser Glu Asp Val Glu Ala Ser Ser Pro Ser Lys Arg 
    130                 135                 140                 


Ala Lys Thr Ser Gly Ser Asn Ser Lys Ser Glu Glu Val Asp Arg Thr 
145                 150                 155                 160 


Ala Ser Glu Met Ser Ser Gly Lys Ala Arg Lys Lys Pro Thr Gly Lys 
                165                 170                 175     


Lys Gly Gly Lys Ser Val Lys Glu Ala Glu Lys Lys Asp Val Val Asp 
            180                 185                 190         


Asn Ser Asn Ser Lys Lys Pro Arg Val Val Trp Ser Ala Glu Leu His 
        195                 200                 205             


Ala Gln Phe Val Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val 
    210                 215                 220                 


Pro Lys Arg Ile Leu Asp Leu Met Gly Val Gln Gly Leu Thr Arg Glu 
225                 230                 235                 240 


Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu 
                245                 250                 255     


Gln Gly Asn Asp Ala Arg Gly Gly Gly Asn Ala Ser Ser Thr 
            260                 265                 270 


<210>  11
<211>  941
<212>  PRT
<213>  Chlamydomonas reinhardtii


<220>
<221>  misc_feature
<223>  SGI1 polypeptide

<400>  11

Met Asp Ser Gln Gly Val Lys Leu Glu Glu His Pro Gly His Thr Gly 
1               5                   10                  15      


Gly His Trp Gln Gly Phe Pro Ala Gly Leu Arg Leu Leu Val Val Asp 
            20                  25                  30          


Asp Asp Pro Leu Cys Leu Lys Val Val Glu Gln Met Leu Arg Lys Cys 
        35                  40                  45              


Ser Tyr Glu Val Thr Val Cys Ser Asn Ala Thr Thr Ala Leu Asn Ile 
    50                  55                  60                  


Leu Arg Asp Lys Asn Thr Glu Tyr Asp Leu Val Leu Ser Asp Val Tyr 
65                  70                  75                  80  


Met Pro Asp Met Asp Gly Phe Arg Leu Leu Glu Leu Val Gly Leu Glu 
                85                  90                  95      


Met Asp Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Ser Asn 
            100                 105                 110         


Val Leu Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile Lys Pro 
        115                 120                 125             


Val Arg Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val Arg Arg 
    130                 135                 140                 


Arg Arg Gln His Ala Gln Glu Ile Asp Ser Asp Glu Gln Ser Gln Glu 
145                 150                 155                 160 


Arg Asp Glu Asp Gln Thr Arg Asn Lys Arg Lys Ala Asp Ala Ala Gly 
                165                 170                 175     


Val Thr Gly Asp Gln Cys Arg Leu Asn Gly Ser Gly Ser Gly Gly Ala 
            180                 185                 190         


Ala Gly Pro Gly Ser Gly Gly Gly Ala Gly Gly Met Thr Asp Glu Met 
        195                 200                 205             


Leu Met Met Ser Gly Gly Glu Asn Gly Ser Asn Lys Lys Ala Arg Val 
    210                 215                 220                 


Val Trp Ser Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln 
225                 230                 235                 240 


Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Ile Met Gly 
                245                 250                 255     


Val Asp Gly Ser Ala Gly Arg Leu Ala Asp Thr Ser Gly Arg Asp Val 
            260                 265                 270         


Cys Gly Thr Val Tyr Arg Leu Tyr Leu Lys Arg Val Ser Gly Val Thr 
        275                 280                 285             


Pro Ser Gly His His His Asn Ala Ala His Lys Ser Asn Lys Pro Ser 
    290                 295                 300                 


Pro His Thr Thr Pro Pro Pro Pro Ala Leu Pro Gly Gln Ala Gly Thr 
305                 310                 315                 320 


His Pro Ala Asn Gln Ala Thr Ala Ile Pro Pro Pro Pro Gln Pro Gly 
                325                 330                 335     


Ser Gly Thr Ala Ala Gly Ala Gly Ala Ala Ala Ala Gly Thr Gly Gly 
            340                 345                 350         


Gly Ala Ala Ala Ala Asn Gly His Ala Ala Thr Thr Gly Ala Gly Thr 
        355                 360                 365             


Pro Gly Ala Ala Pro Gly Ala Gly Gly Gly Val Gly Gly Thr Gly Ala 
    370                 375                 380                 


Gly Gly Leu Gly Ser Gly Pro Asp Gly Ala Ala Ala Ala Ala Gly Pro 
385                 390                 395                 400 


Gly Pro Gly Ala Ala Val Pro Gly Gly Leu Gly Gly Leu Pro Leu Pro 
                405                 410                 415     


Pro Gly Ala Gly Pro Gly Pro Gly Pro Gly Gly Phe Gly Gly Pro Ser 
            420                 425                 430         


Pro Pro Pro Pro Pro His Pro Ala Ala Leu Leu Ala Asn Pro Met Ala 
        435                 440                 445             


Ala Ala Val Ala Gly Leu Asn Gln Ser Leu Leu Asn Ala Met Gly Ser 
    450                 455                 460                 


Leu Gly Val Gly Val Gly Gly Met Ser Pro Leu Gly Pro Val Gly Pro 
465                 470                 475                 480 


Leu Gly Pro Leu Gly Gly Leu Pro Gly Leu Pro Gly Met Gln Pro Pro 
                485                 490                 495     


Pro Leu Gly Met Gly Gly Leu Gln Pro Gly Met Gly Pro Leu Gly Pro 
            500                 505                 510         


Leu Gly Leu Pro Gly Met Gly Gly Leu Pro Gly Leu Pro Gly Met Asn 
        515                 520                 525             


Pro Met Ala Asn Leu Met Gln Gly Met Ala Ala Gly Met Ala Ala Ala 
    530                 535                 540                 


Asn Gln Met Asn Gly Met Gly Gly His Met Gly Gly His Met Gly Gly 
545                 550                 555                 560 


Met Asn Gly Pro Met Gly Ala Leu Ala Gly Met Asn Gly Leu Asn Gly 
                565                 570                 575     


Ala Met Met Gly Gly Leu Pro Gly Met Gly Gly Pro Gln Asn Met Phe 
            580                 585                 590         


Gln Ala Ala Ala Ala Ala Ala Ala Gln Gln Gln Gln Gln Gln Gln Glu 
        595                 600                 605             


Gln Gln His Ala Met Met Gln Gln Ala Ala Ala Gly Leu Leu Ala Ser 
    610                 615                 620                 


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Ala 
625                 630                 635                 640 


Leu Gln Gln Gln Gln Gln Gln Gly Met Ala Val Ser Pro Pro Gly Pro 
                645                 650                 655     


His Asn Ala Thr Pro Asn Gly Gln Leu His Thr His Pro Gln Ala His 
            660                 665                 670         


His Pro His Gln His Gly Leu His Ala His Ala His Pro His Gln His 
        675                 680                 685             


Leu Asn Thr Ala Pro Ala Gly Ala Leu Gly Leu Ser Pro Pro Gln Pro 
    690                 695                 700                 


Pro Ala Gly Leu Leu Ser Ala Ser Gly Leu Ser Ser Gly Pro Asp Gly 
705                 710                 715                 720 


Ser Gly Leu Gly Ser Gly Val Gly Gly Leu Leu Asp Gly Leu Gln Gln 
                725                 730                 735     


His Pro His His Pro Gln Leu Gln Leu Ala Gly Ser Leu Gly Thr Gly 
            740                 745                 750         


Gly Thr Gly Arg Ser Ser Gly Ala Ala Gly Arg Gly Ser Leu Asp Leu 
        755                 760                 765             


Pro Ala Asp Leu Met Gly Met Ala Leu Leu Asp Phe Pro Pro Val Pro 
    770                 775                 780                 


Val Pro Gly Gly Ala Asp Val Gly Met Ala Gly Ala Gly Gly Gly Ala 
785                 790                 795                 800 


Ala Gly Ala His His His Gly His Gln Gly His Gln Gly Ile Gly Gly 
                805                 810                 815     


Gly Ala Gly Val Gly Ile Ala Gly Gly Val Gly Cys Gly Val Pro Ala 
            820                 825                 830         


Ala Ala His Gly Leu Glu Pro Ala Ile Leu Met Asp Asp Pro Ala Asp 
        835                 840                 845             


Leu Gly Ala Val Phe Ser Asp Val Met Tyr Gly Thr Pro Gly Gly Gly 
    850                 855                 860                 


Gly Val Pro Gly Gly Val Pro Gly Gly Gly Val Gly Leu Gly Leu Gly 
865                 870                 875                 880 


Ala Gly Gln Val Pro Ser Gly Pro Ala Gly Ala Gly Gly Leu His Ser 
                885                 890                 895     


His His His Gln His His His His Gln His His Leu Gly His Val Val 
            900                 905                 910         


Pro Val Gly Gly Val Asp Pro Leu Ala Gly Asp Ala Ala Lys Met Ala 
        915                 920                 925             


Met Asn Asp Asp Asp Phe Phe Asn Phe Leu Leu Lys Asn 
    930                 935                 940     


<210>  12
<211>  523
<212>  PRT
<213>  Chromochloris zofingiensis


<220>
<221>  misc_feature
<223>  SGI1 polypeptide

<400>  12

Met Asp Gly Phe Lys Leu Leu Glu Thr Val Gly Leu Glu Leu Asp Leu 
1               5                   10                  15      


Pro Val Ile Met Met Ser Ser Asn Gly Glu His Thr Thr Val Met Arg 
            20                  25                  30          


Gly Val Thr His Gly Ala Cys Asp Phe Leu Ile Lys Pro Val Arg Ile 
        35                  40                  45              


Glu Glu Leu Arg Asn Ile Trp Gln His Val Ile Arg Arg Thr Arg His 
    50                  55                  60                  


Pro Val Phe Arg Asp Leu Glu Pro Asp Asp His Glu Gly Gly Asp Tyr 
65                  70                  75                  80  


Glu Ala Ser Lys Lys Arg Lys Asp Leu Tyr Arg Gly Glu Asn Ser Ser 
                85                  90                  95      


Gly Ser Gly Gly Ala Gly Gly Leu Glu Arg Asp Asp Asp Gly Ser Ala 
            100                 105                 110         


Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe 
        115                 120                 125             


Val Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys 
    130                 135                 140                 


Ile Leu Glu Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala 
145                 150                 155                 160 


Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Gln Gly Val 
                165                 170                 175     


Gln Ala Pro Phe Gly Leu Pro Asn Ile Gln Leu Pro Arg Gln Thr Ser 
            180                 185                 190         


Ser Lys Gly Ala Gly Ser Ser Ser Gln Gln Gln His His Gln Gln Gln 
        195                 200                 205             


Gln His Gln Gln Gln His Gln His Gln His Gln Thr Ala Leu Gly Thr 
    210                 215                 220                 


Gly Gln Gln Gln Ser His Gln Leu Gln Pro Cys Pro Val Ser Thr Ala 
225                 230                 235                 240 


Thr Pro Val Met Pro Ser Pro Asp Ala Met Val Ala Ala Ser Met Met 
                245                 250                 255     


Ser Ser Gln Ala Met Ala Ala Met Ala Pro Gly Val Met Asn Pro Met 
            260                 265                 270         


Thr Ala Met Asn Ser Met Met Ala Gly Leu Asn Pro Asn Met Met Gly 
        275                 280                 285             


Met Ala Ala Gly Leu Gly Leu Ala Gly Leu Gly Ile Gly Gly Met Ala 
    290                 295                 300                 


Gly His Pro Val Pro Asn Pro Met Leu Ala Gly Met Gly Pro Met Gly 
305                 310                 315                 320 


Leu Gly Leu Pro Pro Pro Pro Gly Met Pro Pro Pro Pro Pro Gly Met 
                325                 330                 335     


Pro Pro Gly Met Pro Pro Gly Met Pro Pro Gly Met Pro Ala Met Met 
            340                 345                 350         


Gln Gly Leu Ser Met Ala Gly Met Ser His Leu Ala Ala Ala Gly Met 
        355                 360                 365             


Arg Pro Pro Pro Gly Ala Leu Gly Gly His Leu Gly Gly Pro Gly Leu 
    370                 375                 380                 


Ser Pro Phe Gly Pro Pro Pro Pro Pro Gly Ala Asp Pro Ala Asn Met 
385                 390                 395                 400 


Met Ala Asn Met Ser Ser Met Met Ala Asn Met Gln Ala Ala Leu Ala 
                405                 410                 415     


Phe Gln Ala Asp Ala Ala Ala Ala Ala Gln His Gln Ala Ala Ser Thr 
            420                 425                 430         


Gly Ser Val Ala Pro Gly Arg Gln Gln Gln Val His Gln His Gln Gln 
        435                 440                 445             


Ala Val Gly Met Ala Val Asp Asp Ala Ala Ala Phe Pro Ser Pro Gly 
    450                 455                 460                 


Cys Arg Pro Asn Gly Ser Ala Asp Ala Gly Ala Gln Ser Ala Ala Glu 
465                 470                 475                 480 


Pro Asn Asp Phe Ser Arg Val Phe Asp Asp Pro Phe Ala Gln Pro Ala 
                485                 490                 495     


Ala Ser Pro Ser Gly Ala Ala Ala Ala Gly Ser Asn Glu Ala Pro Gly 
            500                 505                 510         


Met Asp Asp Phe Leu Asp Phe Phe Leu Lys Ser 
        515                 520             


<210>  13
<211>  832
<212>  PRT
<213>  Volvox carteri


<220>
<221>  misc_feature
<223>  SGI1 polypeptide

<400>  13

Met Asp Gly Arg Ala Glu Gly Thr Val Ala Ile Lys Gln Glu Asp His 
1               5                   10                  15      


Ala Ser Gly His Trp His Asn Phe Pro Ala Gly Leu Arg Leu Leu Val 
            20                  25                  30          


Val Asp Asp Asp Pro Leu Cys Leu Lys Val Val Glu Gln Met Leu Arg 
        35                  40                  45              


Lys Cys Ser Tyr Asp Val Thr Thr Cys Thr Asn Ala Thr Met Ala Leu 
    50                  55                  60                  


Asn Leu Leu Arg Asp Lys Ser Thr Glu Tyr Asp Leu Val Leu Ser Asp 
65                  70                  75                  80  


Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Val Val Gly 
                85                  90                  95      


Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr 
            100                 105                 110         


Ser Asn Val Leu Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile 
        115                 120                 125             


Lys Pro Val Arg Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val 
    130                 135                 140                 


Arg Arg Arg Arg Gln Leu Asn Leu Asp Met Asp Ser Asp Glu His Ser 
145                 150                 155                 160 


Gln Glu Arg Asp Asp Asp Gln Gly Arg Lys Arg Lys Ala Asp Thr Ala 
                165                 170                 175     


Gly Cys Ile Gly Asp Gln Leu Arg Met Met Gly Ala Gly Cys Ser Gly 
            180                 185                 190         


Gly Ala Asn Gly Leu Gly Ser Thr Gly Asn Leu Gly Ala Val Ala Thr 
        195                 200                 205             


Gly Ser Ala Gly Leu Gly Leu Gly Leu Gly Thr Ala Ala Asp Glu Leu 
    210                 215                 220                 


Gly Leu Gly Leu Asp Asn Gly Ser Ser Lys Lys Ala Arg Val Val Trp 
225                 230                 235                 240 


Ser Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly 
                245                 250                 255     


Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Ile Met Asn Val Asp 
            260                 265                 270         


Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu 
        275                 280                 285             


Tyr Leu Lys Arg Val Ser Gly Ala Gln Gln Pro Gly Gln Asn Arg Val 
    290                 295                 300                 


Ser Arg Pro Ser Pro Pro Gln Pro Gln Ser Pro Gln Val Pro Ser Gln 
305                 310                 315                 320 


Gln Gln Gln Ser Leu Pro Gly Gly Gly Gly Ala Ala Ala Ala Gly Ala 
                325                 330                 335     


Gly Gln Leu Gln Gly Gly Gly Gly Ala Ala Ala Ala Ala Ala Ser Leu 
            340                 345                 350         


Ala Ser Ile Leu Ala Gly Gly Gly Pro Ala Gly Gly Gly Ala Gly Ala 
        355                 360                 365             


Gly Pro Pro Pro Gly Gly Gly Gln Leu Gly Ala Asp Gly Gly Gly Pro 
    370                 375                 380                 


Gly Pro Gly Leu Ser Ser Ala Val Ala Asn Ala Met Ser Ala Ala Ala 
385                 390                 395                 400 


Ala Ala Gly Gly Phe Pro Thr Pro Pro Pro Pro Pro Pro Pro His Pro 
                405                 410                 415     


Ala Ala Leu Leu Ala Ala Asn Pro Met Met Ala Ala Ala Ala Gly Leu 
            420                 425                 430         


Asn Pro Leu Leu Gly Ala Met Gly Gly Leu Gly Val Gly Pro Leu Gly 
        435                 440                 445             


Pro Leu Asn Pro Leu Asn Gly Met Pro Met Pro Gly Met Gln Pro Pro 
    450                 455                 460                 


Leu Gly Leu Leu Pro Gly Leu Pro Gly Pro Gly Gly Gln Leu Gly Leu 
465                 470                 475                 480 


Gly Pro Leu Gly Pro Ile Gly Leu Pro Gly Pro Gly Pro Leu Pro Ser 
                485                 490                 495     


Leu Pro Ala Gly Leu Pro Leu Asn Pro Met Ala Asn Gly Leu Gln Gln 
            500                 505                 510         


Met Ala Ala Ala Asn Leu Met Gln Gly Met Ala Gly Met Gly Gln Leu 
        515                 520                 525             


Pro Ala Leu Ser Met Asn Gly Met Asn Gly Ile Met Gly Pro Leu Pro 
    530                 535                 540                 


Gly Val Gly Leu Pro Gly Pro Gln Gln His Leu Phe Pro Gln Gln Gln 
545                 550                 555                 560 


Gln Pro His Leu Gln Gln Gln Gln Gln Gln Gln Gln Gln Lys Asp Leu 
                565                 570                 575     


Gln Met Ala Gln Lys Gln His Gln Ala Ala Ala Ala Ala Ala Ala Val 
            580                 585                 590         


Ala Ala Ala Val Ala Ala Ala Gln His Gln Gln Gln Gln Pro Gln Ala 
        595                 600                 605             


Gln Gln Gln Pro Gln Pro Gln Gln Gln Gln Gln Gln Pro Gly Lys Leu 
    610                 615                 620                 


Pro Gln Ala Thr Val Gly Thr Pro Ala Leu Ala Ser Pro Ala Gly Ala 
625                 630                 635                 640 


Leu Pro Arg Gln Pro Ser Gly Gln His Pro His Thr Leu Ser Ser Ser 
                645                 650                 655     


Ser Leu His Thr Gln Gln Pro His Gln Gln Gln Leu Leu His Ser Gln 
            660                 665                 670         


Pro Ser Ser Thr His Leu Ala Thr Asn Asn Thr Leu Ala Met Ala Pro 
        675                 680                 685             


Ala Leu Asn Gly Thr Leu Asp Val Gly Gly Lys Gly His Leu His Ala 
    690                 695                 700                 


Ala Gly Gly Gln Gly Ala Gly Ala Gly Ala Gly Ala Val Leu Asp Ile 
705                 710                 715                 720 


Pro Pro Asp Leu Ile Gly Gly Leu Ile Glu Asp Gly Phe Gly Ala Pro 
                725                 730                 735     


Pro Gly Pro Thr Ile Gln Leu Ala His Gly Thr Ala Ala Val Leu Asp 
            740                 745                 750         


Pro Thr Met Leu Leu Asp Glu Gly Asp Asn Ser Asp Phe Ala Ala Val 
        755                 760                 765             


Phe Gln Glu Met Ser Ser Tyr Gly Gly Gly Gly Val Ile Gly Gly Gly 
    770                 775                 780                 


Gly Ser Gly Ala Gly Ala Met Gly Val Leu Gly His Gly Leu Leu Ala 
785                 790                 795                 800 


Ala Gly Gly Pro Val Met Val Asp Val Ala Ala Gly Leu Ala Gly Val 
                805                 810                 815     


Thr Glu Thr Ala Thr Arg Val Asp Asp Asp Phe Leu Asn Phe Leu Leu 
            820                 825                 830         


<210>  14
<211>  446
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  SGI1 polypeptide 5172

<400>  14

Met Ser Cys Thr Val Ala Ser Phe Pro Pro Ala Ala Gly Gly Gln Gly 
1               5                   10                  15      


Ser Pro Ala Thr Pro Val Pro Tyr Gln Asp Leu Leu Val Lys Arg Gln 
            20                  25                  30          


Asp Gln Trp Ser Asn Phe Pro Ala Gly Leu Arg Val Leu Val Ala Asp 
        35                  40                  45              


Asn Asp Pro Ala Ser Leu Gln Gln Val Glu Lys Met Leu Lys Lys Cys 
    50                  55                  60                  


Ser Tyr Gln Val Thr Leu Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile 
65                  70                  75                  80  


Leu Arg Lys Arg Arg Glu Glu Phe Asp Leu Val Leu Ala Asp Ala Asn 
                85                  90                  95      


Leu Pro Asp Ile Asp Gly Phe Lys Leu Leu His Val Cys His Thr Glu 
            100                 105                 110         


Leu Ser Leu Pro Val Val Leu Met Ser Gly Thr Ser Asp Thr Gln Leu 
        115                 120                 125             


Val Met Arg Gly Val Met Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro 
    130                 135                 140                 


Leu Arg Val Glu Glu Leu Lys Val Leu Trp Gln His Leu Val Arg Phe 
145                 150                 155                 160 


Thr Ser Glu Ile Thr Lys Thr Asp Ala Gln Leu Asn Val Val Lys Val 
                165                 170                 175     


Glu Leu Asp Gly Gly Arg Pro Ala Gly Glu Val Ser Thr Ser Gln Asn 
            180                 185                 190         


Gly Ser Gln Cys Thr Glu Arg Glu Gly Glu Gly Asn Ser Ser Lys Lys 
        195                 200                 205             


Gln Arg Met Asn Trp Ser Asp Glu Met His Gln Gln Phe Val Asn Ala 
    210                 215                 220                 


Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp 
225                 230                 235                 240 


Leu Met Ser Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu 
                245                 250                 255     


Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met Ala Asn His Gln Glu Asn 
            260                 265                 270         


Gly Lys Gln Ala Val Met Ser Thr Asp Thr Ile Ala Arg Ala Glu Ala 
        275                 280                 285             


Ala Tyr Gln Gly Gly Met Pro Gln Gly Gln Gln Met Met Gln Gln Glu 
    290                 295                 300                 


His Ser Gly Gln Ala Val Gln Tyr Ser Gln Pro His Ala Pro Gly Gly 
305                 310                 315                 320 


Leu His Gln Gln Ala Met Pro Ala Gln Met His Met Gly Met Met Pro 
                325                 330                 335     


Ala Gly Pro Gln Pro Gly Ser Met Gln Met Ala Pro His His Val Met 
            340                 345                 350         


Gln Met Pro Asn Gly Gln Val Met Val Met Gln Gln Met Gly Pro Arg 
        355                 360                 365             


Pro Gly Met Pro Pro Gly Met Pro Gln Gln Met Met Ala Ser Ser Gln 
    370                 375                 380                 


Gln Met Gly Met Leu Gln Pro Gly Met Pro Ala Gly Gln Met Leu His 
385                 390                 395                 400 


Phe Gln His Pro Gln Gln Val His Gln His Pro Pro Ser Ser Gly Pro 
                405                 410                 415     


Met His Ala Val Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln 
            420                 425                 430         


Met Ala Gly Trp Pro Val Gln Gly Gln Pro Gly Asn Gln Ala 
        435                 440                 445     


<210>  15
<211>  490
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  SGI1 polypeptide 5185

<400>  15

Met Thr Pro Thr Pro Pro Met Ser Cys Thr Val Ala Ser Phe Pro Pro 
1               5                   10                  15      


Ala Ala Gly Gly Gln Gly Ser Pro Ala Thr Pro Val Pro Tyr Gln Asp 
            20                  25                  30          


Leu Leu Val Lys Arg Gln Asp Gln Trp Ser Asn Phe Pro Ala Gly Leu 
        35                  40                  45              


Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu Gln Gln Val Glu 
    50                  55                  60                  


Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu Cys Ser Ser Gly 
65                  70                  75                  80  


Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu Glu Phe Asp Leu 
                85                  90                  95      


Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly Phe Lys Leu Leu 
            100                 105                 110         


His Val Cys His Thr Glu Leu Ser Leu Pro Val Val Leu Met Ser Gly 
        115                 120                 125             


Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met Asp Gly Ala Arg 
    130                 135                 140                 


Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu Lys Val Leu Trp 
145                 150                 155                 160 


Gln His Leu Val Arg Phe Thr Ser Glu Ile Thr Lys Thr Asp Ala Gln 
                165                 170                 175     


Leu Asn Val Val Lys Val Glu Leu Asp Gly Gly Arg Pro Ala Gly Glu 
            180                 185                 190         


Val Ser Thr Ser Gln Asn Gly Ser Gln Cys Thr Glu Arg Glu Gly Glu 
        195                 200                 205             


Gly Asn Ser Ser Lys Lys Gln Arg Met Asn Trp Ser Asp Glu Met His 
    210                 215                 220                 


Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val 
225                 230                 235                 240 


Pro Lys Arg Ile Leu Asp Leu Met Ser Val Glu Gly Leu Thr Arg Glu 
                245                 250                 255     


Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met 
            260                 265                 270         


Ala Asn His Gln Glu Asn Gly Lys Gln Ala Val Met Ser Thr Asp Thr 
        275                 280                 285             


Ile Ala Arg Ala Glu Ala Ala Tyr Gln Gly Gly Met Pro Gln Gly Gln 
    290                 295                 300                 


Gln Met Met Gln Gln Glu His Ser Gly Gln Ala Val Gln Tyr Ser Gln 
305                 310                 315                 320 


Pro His Ala Pro Gly Gly Leu His Gln Gln Ala Met Pro Ala Gln Met 
                325                 330                 335     


His Met Gly Met Met Pro Ala Gly Pro Gln Pro Gly Ser Met Gln Met 
            340                 345                 350         


Ala Pro His His Val Met Gln Met Pro Asn Gly Gln Val Met Val Met 
        355                 360                 365             


Gln Gln Met Gly Pro Arg Pro Gly Met Pro Pro Gly Met Pro Gln Gln 
    370                 375                 380                 


Met Met Ala Ser Ser Gln Gln Met Gly Met Leu Gln Pro Gly Met Pro 
385                 390                 395                 400 


Ala Gly Gln Met Leu His Phe Gln His Pro Gln Gln Val His Gln His 
                405                 410                 415     


Pro Pro Ser Ser Gly Pro Met His Ala Gly Gly Glu Met Ile Asp Pro 
            420                 425                 430         


Gly Ser Met Gln Arg Leu His Gln Gln Pro His Tyr Ile Gly Pro Asn 
        435                 440                 445             


Gly Gln His Met Pro Ala Pro Ala Met Gly Met Pro Ser Gly Thr Val 
    450                 455                 460                 


Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln Met Ala Gly Trp 
465                 470                 475                 480 


Pro Val Gln Gly Gln Pro Gly Asn Gln Ala 
                485                 490 


<210>  16
<211>  574
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  SGI1 polypeptide 5230

<400>  16

Met Thr Met Pro Leu Gly Gly Gly Leu Cys Met Lys Asp Arg Ile His 
1               5                   10                  15      


Gly Asp Glu Arg Tyr Arg Ser Lys Ala Lys Arg Gln Val Asn Thr Ile 
            20                  25                  30          


Phe Ala Phe Thr Gln Arg Asn Thr Trp Arg Gly Arg Phe Arg Leu Cys 
        35                  40                  45              


Ser Tyr Arg Thr Thr Glu Leu Leu Gly Gly Ser Lys Thr Thr Glu Pro 
    50                  55                  60                  


Gly Arg Gly Thr Phe Val Leu Gln Ile Phe Met Cys Val Lys Asn Ala 
65                  70                  75                  80  


Ser Ile Asp Asp Gly Ser Arg His Ile Ser Thr Ser Arg Gly Leu Glu 
                85                  90                  95      


Ser Val Leu Lys Arg Arg Gly Gly Gln Gly Ala Pro Ala Ala Pro Val 
            100                 105                 110         


Pro Tyr His Asp Leu Leu Val Lys Arg Gln Asp Gln Trp Ser Asn Phe 
        115                 120                 125             


Pro Ala Gly Leu Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu 
    130                 135                 140                 


Gln Gln Val Glu Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu 
145                 150                 155                 160 


Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu 
                165                 170                 175     


Glu Phe Asp Leu Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly 
            180                 185                 190         


Phe Lys Leu Leu His Val Cys His Thr Glu Leu Ser Leu Pro Val Val 
        195                 200                 205             


Leu Met Ser Gly Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met 
    210                 215                 220                 


Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu 
225                 230                 235                 240 


Lys Val Leu Trp Gln His Leu Val Arg Phe Thr Ser Glu Ile Thr Lys 
                245                 250                 255     


Thr Asp Ala Gln Leu Asn Val Val Lys Val Glu Leu Asp Ser Gly Arg 
            260                 265                 270         


Pro Ala Gly Glu Val Ser Thr Ser Gln Asn Gly Ser Gln Cys Ala Glu 
        275                 280                 285             


Arg Glu Gly Glu Gly Asn Ser Ser Lys Lys Gln Arg Met Asn Trp Ser 
    290                 295                 300                 


Asp Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly Ile 
305                 310                 315                 320 


Asp Lys Ala Val Pro Lys Arg Ile Leu Asp Leu Met Ser Val Glu Gly 
                325                 330                 335     


Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr 
            340                 345                 350         


Leu Lys Arg Met Ala Asn His Gln Glu Asn Gly Lys Gln Ala Val Met 
        355                 360                 365             


Ser Thr Asp Thr Ile Ala Arg Ala Glu Ala Ala Tyr Gln Gly Gly Met 
    370                 375                 380                 


Pro Gln Gly Gln Gln Met Met Gln Gln Glu His Ser Gly Gln Ala Val 
385                 390                 395                 400 


Gln Tyr Ser Gln Pro His Ala Pro Ser Gly Leu His Gln Gln Ala Met 
                405                 410                 415     


Pro Ala Gln Met His Met Gly Met Met Pro Ala Gly Pro Gln Pro Gly 
            420                 425                 430         


Ser Met Gln Met Ala Pro His His Val Met Gln Met Pro Asn Gly Gln 
        435                 440                 445             


Val Met Val Met Gln Gln Met Gly Pro Arg Pro Gly Met Pro Pro Gly 
    450                 455                 460                 


Met Pro Gln Gln Met Met Ala Ser Ser Gln Gln Met Gly Met Leu Gln 
465                 470                 475                 480 


Pro Gly Met Pro Ala Gly Gln Met Leu His Phe Gln His Pro Gln Gln 
                485                 490                 495     


Val His Gln His Pro Pro Ser Ser Gly Pro Met His Ala Gly Gly Glu 
            500                 505                 510         


Met Ile Asp Pro Gly Ser Met Gln Arg Leu His Gln Gln Pro His Tyr 
        515                 520                 525             


Ile Val Pro Asn Ala Gln His Met Pro Ala Pro Ala Met Gly Met Pro 
    530                 535                 540                 


Pro Gly Ala Val Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln 
545                 550                 555                 560 


Met Ala Gly Trp Pro Val Gln Gly Gln Pro Gly Ser Gln Ala 
                565                 570                 


<210>  17
<211>  674
<212>  PRT
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  SGI1 polypeptide 5549

<400>  17

Met Leu Ala Phe Thr His Gln Arg Met Thr Thr Ala Pro Ala Leu Ala 
1               5                   10                  15      


Val Ala Thr Ser His Phe Phe Ala His Val Arg Val Thr Thr Gly Ser 
            20                  25                  30          


Ser Ala Ile Ala Thr Val Phe Ala Ala Arg Ser Arg Gly Ser Gly Leu 
        35                  40                  45              


Leu Ala Gly Phe Asn Thr Met Glu Asn Val Lys Val Glu Val Pro Glu 
    50                  55                  60                  


Val Val Pro Glu Asn Val Asn Phe Pro Ala Gly Leu Lys Val Leu Val 
65                  70                  75                  80  


Val Asp Asp Asp Pro Leu Cys Leu Lys Val Ile Asp Gln Met Leu Arg 
                85                  90                  95      


Arg Cys Asn Tyr Ala Ala Thr Thr Cys Gln Ser Ser Leu Glu Ala Leu 
            100                 105                 110         


Glu Leu Leu Arg Ser Ser Lys Glu Asn His Phe Asp Leu Val Leu Ser 
        115                 120                 125             


Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Ile Ile 
    130                 135                 140                 


Gly Leu Glu Met Gly Leu Pro Val Ile Met Met Ser Ser Asn Gly Glu 
145                 150                 155                 160 


Thr Gly Val Val Phe Arg Gly Val Thr His Gly Ala Val Asp Phe Leu 
                165                 170                 175     


Ile Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Leu Trp Gln His Val 
            180                 185                 190         


Val Arg Lys Thr Met Val Val Pro Ser Asn Asp Lys Ala Thr Ser Glu 
        195                 200                 205             


Glu Asp Gly Glu Glu Ser Lys His Arg Val Asp Arg Lys Arg Lys Glu 
    210                 215                 220                 


Ser Phe His Ser Arg Ala Arg Glu Gln Val Glu Ile Ala Cys Ser Val 
225                 230                 235                 240 


Val Pro Ala Leu Leu Trp Pro Thr Val Pro Pro Ser Ser Val His Pro 
                245                 250                 255     


Thr Ser Ser Ser Phe Leu Arg Ser His Val Leu Leu Leu Gln Arg Ser 
            260                 265                 270         


Ser Gly Gly Lys Asp Val Leu Asp Glu Gly Gly Ser Asn Ala Lys Lys 
        275                 280                 285             


Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val Asn Ala 
    290                 295                 300                 


Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp 
305                 310                 315                 320 


Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu 
                325                 330                 335     


Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Ala Gly Ile Asn Thr Ala 
            340                 345                 350         


Thr Gly Ser Arg Asn Gly Lys Gly Arg Ser Asp Val Ser Gly Leu Ser 
        355                 360                 365             


Gly Met Pro Asn Gly Ser Leu Pro Met Pro Gly Met Met Pro Pro His 
    370                 375                 380                 


Met Ala Ala Gly Met Leu Leu Ala Gly Met Ala Ala Asp Val Gly Pro 
385                 390                 395                 400 


Arg Pro His Pro Phe Pro Ile Met Pro Met Pro Ala Met Ala Leu Gln 
                405                 410                 415     


Gly Met His Gly Gly Met Ala Gln Met Met Gln Leu Pro Pro Gly Met 
            420                 425                 430         


Pro Pro Pro Met Met Met Pro Met Ala Pro Leu Leu Pro Ser Gln Leu 
        435                 440                 445             


Ala Ala Leu Gly Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Val Ala 
    450                 455                 460                 


Arg Ser Glu Ser Met Pro Ser Glu Asn Gly Val Ala Gly Pro Ser Gly 
465                 470                 475                 480 


Ser Phe Thr Ala Met Leu Asn Gly Pro Ala Pro Met Glu Ser Ser Pro 
                485                 490                 495     


Phe Ala Ala Leu Gln Val Phe Gly Pro Pro Gln Gly Met Glu Gln Leu 
            500                 505                 510         


Thr Gln Gln Gln Gln Gln Gln Gln Gln Ala Gly Ala Ala Ala Phe Val 
        515                 520                 525             


Ala Ala Phe Ala Ala Ala Asn Gly Gly Asp Met Gln Gly Gly Gly Gly 
    530                 535                 540                 


Gly Pro Gly Pro Met Leu Gly Gly Ala Gly Gly Ala Gly Pro Leu Leu 
545                 550                 555                 560 


Gly Gly Val Gly Gly Gly Asp Pro Leu His Gly Gly Gly Gly Ser Ser 
                565                 570                 575     


Ala Leu Gly Gly Arg Pro Met Met Ser Ala Glu Gln Pro Met Gly Gly 
            580                 585                 590         


Ser Gly Gly Leu Ala Ser Asn Ser Leu Thr Val Gln Gln Asn Asp Leu 
        595                 600                 605             


Ala Gln Met Cys Ser Gln Leu Asp Val Asn Gly Leu Gln Ala Val Ala 
    610                 615                 620                 


Ala Ala Ala Ala Ala Gly Ala Met Gly Ala Pro Gly Gly Ala Gly Gly 
625                 630                 635                 640 


Ala Met Pro Pro Ser Ser Val Gly Gly Val Gly Pro Asp Met Lys Leu 
                645                 650                 655     


Thr Glu Gln Asp Asp Phe Phe Ser Phe Leu Leu Lys Asp Ser Asn Leu 
            660                 665                 670         


Ile Asp 
        


<210>  18
<211>  488
<212>  PRT
<213>  Micromonas sp.


<220>
<221>  misc_feature
<223>  RCC299, SGI1 polypeptide

<400>  18

Met Ser Thr Pro Ala Val Ser Lys Gly Phe Pro Ile Gly Leu Arg Val 
1               5                   10                  15      


Leu Val Val Asp Asp Asp Pro Leu Cys Leu Lys Ile Val Glu Lys Met 
            20                  25                  30          


Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr Phe Ser Arg Gly Ala Glu 
        35                  40                  45              


Ala Leu Lys Thr Leu Arg Glu Arg Lys Asp Asp Phe Asp Ile Val Leu 
    50                  55                  60                  


Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His 
65                  70                  75                  80  


Ile Ala Leu Glu Leu Asp Ile Pro Val Met Met Met Ser Ala Asn Cys 
                85                  90                  95      


Ala Thr Asp Val Val Leu Arg Gly Ile Ile His Gly Ala Val Asp Tyr 
            100                 105                 110         


Leu Leu Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Ile Trp Gln His 
        115                 120                 125             


Val Val Arg Arg Lys Arg Glu Ser Ser Gln Gly Asn Leu Arg Ser Gly 
    130                 135                 140                 


Glu Gly Gly Ser Asn Gly Arg Thr Val Ser Gly Gly Ser Thr Gly Glu 
145                 150                 155                 160 


Gly Gly Gly Lys Asp Ser Lys Gly Ser Ser Glu Gln His Gly Asp Ala 
                165                 170                 175     


Lys Asp Lys Thr Gly Ser Ala Gly Gly Ser Gly Gly Ser Ser Lys Arg 
            180                 185                 190         


Lys Lys Gly Ser Gly Lys Lys Gly Asp Glu Gly Thr Asp Glu Val Lys 
        195                 200                 205             


Asp Gly Ser Gly Gly Asp Glu Asn Glu Asp Ser Ser Ala Leu Lys Lys 
    210                 215                 220                 


Pro Arg Val Val Trp Ser Ala Glu Leu His Gln Gln Phe Val Thr Ala 
225                 230                 235                 240 


Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp 
                245                 250                 255     


Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu 
            260                 265                 270         


Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln Gly Val Asn Ser Gly 
        275                 280                 285             


Gly Ala Pro Gly Gly Gly Pro Gly Phe Met Ser Pro Ile Ala Leu Asp 
    290                 295                 300                 


Gly Ser Met Val Gln Gly Gly Pro Gly Gly Arg Val Gly Ser Pro Ala 
305                 310                 315                 320 


Ile Gly Gly Pro Asn Gly Pro Ile Met Val Gly His Gly His Ile Asp 
                325                 330                 335     


Pro Ala Met Leu Ala Gly Gly Ala Pro Gln Thr Ile Gln Met Gly Met 
            340                 345                 350         


Val Tyr Gly Gly Pro Gly Met Gly Pro Pro Gln Met Met Ala Pro Asn 
        355                 360                 365             


Gly Lys Gly Gly Gly Gly Met Pro Gly Gly Tyr Val Met Gln Pro Gly 
    370                 375                 380                 


Gln Met Met Ala Pro Asn Gly Gln Met Met Pro Val Gly Gln Met Gly 
385                 390                 395                 400 


Pro Gly Gly Met Met Val Gln Gly Pro Gly Gly Gly Met Met Gln Met 
                405                 410                 415     


His Asp Gly Gly Met Met Asn Gly Asn Gly Ser Tyr Gly Ser Leu Gln 
            420                 425                 430         


Asn Met Lys Gln Gly Asn Gly Val Val Met Met Pro Asn Gly Gly Met 
        435                 440                 445             


Gly Gly Val Asp Gly Ala Ile Pro Asn Met Ala Thr Gly Leu Ile Asn 
    450                 455                 460                 


Gly Gln Gly Leu Pro Asp Asp Asp Val Leu Asp Met Phe Leu Lys Asp 
465                 470                 475                 480 


Gly Leu Pro Glu Gly Glu Gly Phe 
                485             


<210>  19
<211>  544
<212>  PRT
<213>  Micromonas pusilla


<220>
<221>  misc_feature
<223>  SGI1 polypeptide

<400>  19

Met Thr Ala Glu Lys Lys Glu Leu Lys Val Phe Pro Ala Gly Leu Arg 
1               5                   10                  15      


Val Leu Val Val Asp Asp Asp Pro Leu Cys Leu Arg Ile Val Glu Lys 
            20                  25                  30          


Met Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr Phe Ser Arg Gly Ala 
        35                  40                  45              


Glu Ala Leu Glu Thr Leu Arg Ala Arg Arg Asp Asp Phe Asp Ile Val 
    50                  55                  60                  


Leu Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu 
65                  70                  75                  80  


His Ile Ala Leu Glu Leu Asp Val Pro Val Met Met Met Ser Ala Asn 
                85                  90                  95      


Cys Ala Thr Asp Val Val Leu Arg Gly Ile Ile His Gly Ala Val Asp 
            100                 105                 110         


Tyr Leu Leu Lys Pro Val Arg Leu Glu Glu Leu Arg Asn Ile Trp Gln 
        115                 120                 125             


His Val Val Arg Arg Gln Arg Glu Pro Ser Lys Asp Gly Ala Ala Gly 
    130                 135                 140                 


Lys Gly Gly Gly Ala Ser Gly Ala Pro Glu Val Ser Gly Asp Thr His 
145                 150                 155                 160 


Ala Asn Thr Asp Asp Lys Gln Asp Gly Asn Ala Thr Asp Ser Lys Gly 
                165                 170                 175     


Ser Gly Ser Gln Lys Arg Lys Ser Gly Lys Ser Gly Asp Asp Gly Gly 
            180                 185                 190         


Lys Asp Gly Gly Gly Ser Gly Gly Lys Asp Gly Asp Ala Ser Asn Lys 
        195                 200                 205             


Gly Asn Asn Asn Lys Arg Lys Lys Gly Lys Ser Asn Asp Ala Thr Glu 
    210                 215                 220                 


Thr Ala Gly Gly Ala Gly Val Glu Asp Asn Asp Asp Thr Ser Gly Leu 
225                 230                 235                 240 


Lys Lys Pro Arg Val Val Trp Ser Pro Glu Leu His Gln Gln Phe Val 
                245                 250                 255     


Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            260                 265                 270         


Leu Asp Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser 
        275                 280                 285             


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln Gly Val Asn 
    290                 295                 300                 


Asn Asn Gly Thr Val Pro Ser Gly Ala Ala Gly Phe Met Thr Gly Leu 
305                 310                 315                 320 


Ala Ile Asp Gly Val Gly Gly Val Met Gly Pro Pro Thr Thr Gly Ser 
                325                 330                 335     


Pro Ala Met Asn Gly Pro Gly Gly Pro Gly Gly Gly Leu Val Met Gly 
            340                 345                 350         


Pro Gly His Met Gly Gly Pro His Met Asp Gly Ser Gly Met Met His 
        355                 360                 365             


Met Gly Pro Gly Gly Pro Met Ala Gly Met Thr Val Val Tyr Gly Gly 
    370                 375                 380                 


Gly Met Pro Gly Gly Met Pro Gly Gly Ala Asp Ser Lys Asn Gly Ala 
385                 390                 395                 400 


Ser Gly Gln Pro Pro Pro Gly Gly Tyr Val Val Met Gly Gly Pro His 
                405                 410                 415     


Gly Gly Gly Pro Gly Gly Ala Pro Met Met Met Gln His Gly Gly Met 
            420                 425                 430         


Val Pro Gly Pro Gly Pro Gly Leu Val Pro Gly Pro Gly Gly Ser Leu 
        435                 440                 445             


Met Met Pro Ala Gly Met Met Pro Asp Gly Gly Gly Gly Met Val Gly 
    450                 455                 460                 


Val His Val Gly Pro Gly Val Val Met Gly Gln His Gln Leu Gly Gly 
465                 470                 475                 480 


Lys His Ser Ser Gly Gly Ala Gly Met Ala Gly Gly Ser Ala Ala Gly 
                485                 490                 495     


Lys Gly Ala Gln Arg Gly Gly Val Gly Gly Ala Phe Asp Val Pro Pro 
            500                 505                 510         


Thr Asn Gly Ser Leu Asp Ala Asp Glu Ile Gly Asp Asp Val Leu Thr 
        515                 520                 525             


Met Phe Leu Lys Asp Gly Leu Pro Glu Met Asn Asp Gly Asp Ala Leu 
    530                 535                 540                 


<210>  20
<211>  776
<212>  PRT
<213>  Sphagnum fallax


<220>
<221>  misc_feature
<223>  SGI1 polypeptide

<400>  20

Met Ser Gly Gly Asp Leu Ser Arg Val Arg Glu Gly Thr Ala Asp Leu 
1               5                   10                  15      


Asp Pro Val Met Ala Ser His Gln His Pro Pro Pro Arg Gln Gln Ser 
            20                  25                  30          


His Gln Gln Pro Lys Asn His Gln Gln Glu Ala His Gln Gln His Cys 
        35                  40                  45              


Ser Ser Ala Glu Thr Thr Ser Pro Asn Asn Thr Ala Arg Gly Ala Gly 
    50                  55                  60                  


Ala Thr Tyr Gly Lys Met Glu Pro Ala Asp Asp Phe Pro Ala Gly Leu 
65                  70                  75                  80  


Arg Ile Leu Val Val Asp Asp Asp Pro Thr Cys Leu Ala Ile Leu Lys 
                85                  90                  95      


Lys Met Leu Gln Gln Cys Ser Tyr Gln Val Thr Thr Cys Gly Arg Ala 
            100                 105                 110         


Thr Arg Ala Leu Glu Leu Leu Arg Glu Asp Lys Asp Lys Phe Asp Leu 
        115                 120                 125             


Val Ile Ser Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu 
    130                 135                 140                 


Glu Leu Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Gly 
145                 150                 155                 160 


Asn Gly Glu Thr Ser Val Val Met Lys Gly Ile Thr His Gly Ala Cys 
                165                 170                 175     


Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu Leu Ser Asn Ile Trp 
            180                 185                 190         


Gln His Val Val Arg Lys Leu Arg Ser Glu Pro Lys Glu His Ser Ala 
        195                 200                 205             


Ser Leu Glu Asp Gly Asp Arg Gln Arg Arg Gly Gly Ala Glu Asp Ala 
    210                 215                 220                 


Asp Asn Thr Ser Ser Ala Ala Asp Thr Ala Asp Gly Ile Trp Arg Asn 
225                 230                 235                 240 


Lys Lys Lys Lys Glu Ala Lys Glu Asp Glu Glu Asp Phe Glu Gln Asp 
                245                 250                 255     


Asn Asp Asp Pro Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val 
            260                 265                 270         


Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp 
        275                 280                 285             


Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Ser Val Gln Gly Leu 
    290                 295                 300                 


Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu 
305                 310                 315                 320 


Lys Arg Leu Ser Gly Val Thr Ser Gln Ser Asn Ser Leu Asn Val Ser 
                325                 330                 335     


Phe Gly Gly Pro Asp Ala Gly Tyr Gly Gly Leu Phe Gly Leu Asp Glu 
            340                 345                 350         


Met Ser Asp Tyr Arg Asn Leu Val Thr Asn Gly His Leu Pro Ala Gln 
        355                 360                 365             


Thr Ile Ala Ala Leu His His Ala Asn Met Ala Gly Arg Leu Gly Ala 
    370                 375                 380                 


Ser Ser Gly Met Val Gly Pro Ser Ser Pro Leu Asp Pro Ser Val Leu 
385                 390                 395                 400 


Ala Gln Ile Ala Ala Leu Gln Ser Gly Ser Leu Pro Arg Pro Gly Met 
                405                 410                 415     


Asp Gly Ser Leu Gln Gly Asn Gln Ala Gly Leu Leu Gln Ser Leu Ser 
            420                 425                 430         


Gly Ala Leu Asp Tyr Asn Ser Leu His Gln Ser His Leu Leu Pro Ala 
        435                 440                 445             


Ile Gly Gln Leu Gly Gln Leu Asp Glu Leu Pro Ser Leu Lys Ser Met 
    450                 455                 460                 


Gln His Gln Leu Gly Met Gly Ser Leu Gly Gly Ser Thr Arg Asn Leu 
465                 470                 475                 480 


Ala Gly Ser Pro Asn Glu Glu Leu Thr Met Gln Leu Leu Gln Gln Arg 
                485                 490                 495     


Ala Gln Gln Gln Ser Gly Gly Ser Pro Ile Asn Leu Pro Gln Ala Thr 
            500                 505                 510         


Gly Ile Leu Arg Pro Leu Ser Ser Asn Ile Asn Gln Gly Gly Ser Val 
        515                 520                 525             


Pro Asn Leu Val Gly Val Ile Pro Gly Thr Ala Ile Gly Leu Ser Asn 
    530                 535                 540                 


Met Cys Ser Gly Gly Arg Glu Phe Gly Ser Ser Ser Gly Leu Leu Ser 
545                 550                 555                 560 


Ala Ser Gly Ser Leu Met Gln Ser Ser Thr Val Glu Ala Gln Asn Leu 
                565                 570                 575     


Asn Phe Gly Gly Ser Ser Gly Ser Ser Gly Cys Ser Phe Gln Ala Ser 
            580                 585                 590         


Val Leu Ser Ser Lys Thr Gly Gly Leu Glu Asp Leu Asn Pro Ala Lys 
        595                 600                 605             


Arg Val Arg Thr Thr Tyr Ser Ala Leu Ser His Ser Ser Pro Asp Leu 
    610                 615                 620                 


Gly Gln Ser Ser Arg Pro Ala Trp Leu Gly Ser Gln Glu Gly Leu Val 
625                 630                 635                 640 


His Gly Asp Pro Val Tyr Ser Pro His Gln Leu Ser Leu Pro Arg Gln 
                645                 650                 655     


Asp Ile Val Gly Gly Ile Gly Ser Ser Gly Arg Pro Ala Tyr Met Gly 
            660                 665                 670         


Ser Gln Ser Met Gly Ser Leu Gly Met Asn Phe Pro Leu Ser Leu Ala 
        675                 680                 685             


Val Asp Ala Gly Ala Val Arg Pro Ser Leu Thr Arg Gly Gln Ser Leu 
    690                 695                 700                 


Thr Glu Gln Val Ala Ala Asn Arg Glu Leu Lys Phe Pro Lys Glu Glu 
705                 710                 715                 720 


Arg Gly Arg Asp Asn Leu Met Cys Ala Arg Leu Gly Gly Gly Met Ile 
                725                 730                 735     


Thr Asn Glu Ser Ser Ser Glu Glu Leu Leu Asn Tyr Leu Lys Gln Ser 
            740                 745                 750         


His Glu Gly Leu Gly Phe Met Glu Gly Asp Leu Val Ser Asp Gly Tyr 
        755                 760                 765             


Pro Val Asp Asn Leu Tyr Val Lys 
    770                 775     


<210>  21
<211>  715
<212>  PRT
<213>  Physcomitrella patens


<220>
<221>  misc_feature
<223>  SGI1 polypeptid

<400>  21

Met Gly Gly Gly Tyr Leu Ser Ser Thr Val Asn Met Gly Glu Ser Arg 
1               5                   10                  15      


Asp Gly Gly Ser Pro Ala Met Ala Thr Leu Gln Gln Gln Gln Lys His 
            20                  25                  30          


Gln Pro Leu Asn Pro Asn His Gln Asn Pro Arg Asn Arg Ser Asn Ser 
        35                  40                  45              


Ser Pro Thr Asn Cys Tyr Ser Asn Thr Ala Trp Gly Ala Lys Pro Ala 
    50                  55                  60                  


Lys Leu Asp Thr Pro Asp Glu Phe Pro Val Gly Met Arg Val Leu Val 
65                  70                  75                  80  


Val Asp Asp Asn Pro Thr Cys Leu Met Ile Leu Glu Gln Met Leu Val 
                85                  90                  95      


Arg Cys Ala Tyr Arg Val Thr Thr Cys Gly Lys Ala Thr Glu Ala Leu 
            100                 105                 110         


Ser Met Leu Arg Glu Asp Ile Gly Lys Phe Asp Val Val Ile Ser Asp 
        115                 120                 125             


Val Asp Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly 
    130                 135                 140                 


Leu Glu Met Asp Leu Pro Val Ile Met Val Ser Gly Asn Gly Glu Thr 
145                 150                 155                 160 


Ser Ala Val Met Lys Gly Ile Thr His Gly Ala Cys Asp Tyr Leu Leu 
                165                 170                 175     


Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Ile Trp Gln His Val Val 
            180                 185                 190         


Arg Lys Lys Arg Arg Glu Val Lys Ala Val Ala Thr Lys Ser Val Glu 
        195                 200                 205             


Glu Ala Gly Gly Cys Glu Arg Pro Lys Arg Gly Gly Gly Ala Asp Asp 
    210                 215                 220                 


Ala Asp Tyr Thr Ser Ser Ala Thr Asp Thr Thr Asp Ser Asn Trp Lys 
225                 230                 235                 240 


Leu Thr Lys Arg Arg Lys Gly Glu Phe Lys Asp Glu Asn Glu Glu Asp 
                245                 250                 255     


Asn Glu Gln Glu Asn Asp Asp Pro Ser Thr Leu Lys Arg Pro Arg Val 
            260                 265                 270         


Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln 
        275                 280                 285             


Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Gly 
    290                 295                 300                 


Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr 
305                 310                 315                 320 


Arg Leu Tyr Leu Lys Arg Leu Ser Gly Val Thr Ser Gln Gln Gly Asn 
                325                 330                 335     


Met Ser Ala His Phe Gly Gly Ser Asp Pro Phe Cys Met Met Pro Pro 
            340                 345                 350         


Asp Met Ser Leu Ala Asn Gly Gln Leu Thr Pro Gln Ala Leu Ala Lys 
        355                 360                 365             


Phe His Met Leu Gly Arg Met Asn Ala Thr Asn Gly Ile Gly Phe Ser 
    370                 375                 380                 


Gly Gly Gly Leu Asp Pro Gly Met Asn Gln Met Phe Leu Gln Asp Leu 
385                 390                 395                 400 


Pro Arg Pro Pro Gln Leu Asn Ser Met Leu Arg Asn Asn Thr Gly Leu 
                405                 410                 415     


Leu Ala Ser Val Pro Asn Gly Leu Gln His Leu Glu Gln Leu Ser Glu 
            420                 425                 430         


Pro His His Val His Val Val Asn Glu Leu Glu His Tyr Pro Ser Asn 
        435                 440                 445             


Thr Lys Val Tyr Pro Gln Leu Asn Gly Asn Leu Asp Val Ser Val Gly 
    450                 455                 460                 


Pro Leu Gly Ala Ala Asn Gly Asn Leu Ala Ser Asn Pro Asn Ser Asp 
465                 470                 475                 480 


Thr Leu Leu Met His Ile Leu His Ser Arg Ala Ser Gln Gln Gly Val 
                485                 490                 495     


Gly Ser Pro Ser Thr Leu Pro Gln Pro Arg Cys Gly Leu Asn Pro Thr 
            500                 505                 510         


His Leu Leu Ser Asn Asp Ile Asn Phe Ala Pro Val Gly Ser Leu Pro 
        515                 520                 525             


Asn Leu Ala Gly Ser Leu Gly Pro Ala Val Gly Leu Ser Ala Ile Pro 
    530                 535                 540                 


Gly Ser Ala Gly Gly Arg Asp Leu Ser Pro Ser Val Gly Gly Ser Gly 
545                 550                 555                 560 


Ala Ser Leu Ser Ser Pro Leu Gly Ser Leu Val Arg Arg Pro Leu Met 
                565                 570                 575     


Ala Glu Glu Gln Ser Asn Pro Val Asn Ser Thr Asn Gly Thr Tyr Ser 
            580                 585                 590         


Met Ala His Ser Gly Gln Ser Pro Lys Pro Ser Gly Asp Thr Leu Pro 
        595                 600                 605             


Thr Pro Leu Asn Glu Gly Leu Glu Gln Gln Gln Pro Leu Trp Ala Leu 
    610                 615                 620                 


Tyr Gln Asn Pro Met Asn Gln Leu Ser His Gly Pro Ser Gln Gly Phe 
625                 630                 635                 640 


Pro His Asp Ser Leu Gln Trp Ser Val Leu Thr Glu Asn Leu Ser Phe 
                645                 650                 655     


Gly Asp Met Gly Gln Ser Leu Ser Ala Gly Leu Ile Ser Gln Phe Ser 
            660                 665                 670         


Ser Gln Gly Gln Asp Asn Gly Ile Gly Phe Ala Pro Pro Ser Gln Arg 
        675                 680                 685             


Gly Ser Tyr Thr Arg Gln Ser Val Ser Phe Pro Ala Ser Ser Ala Leu 
    690                 695                 700                 


Asp Gly Arg Met Val Arg Ser Ser Tyr Glu Pro 
705                 710                 715 


<210>  22
<211>  60
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 8

<400>  22

Lys Lys Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Asn Ala Val Asn Ser Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            20                  25                  30          


Leu Asp Leu Met Asn Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val 
    50                  55                  60  


<210>  23
<211>  51
<212>  PRT
<213>  Coccomyxa subellipsoidea


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 9

<400>  23

Lys Lys Ala Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            20                  25                  30          


Leu Asp Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln 
    50      


<210>  24
<211>  61
<212>  PRT
<213>  Ostreococcus lucimarinus


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 10

<400>  24

Lys Lys Pro Arg Val Val Trp Ser Ala Glu Leu His Ala Gln Phe Val 
1               5                   10                  15      


Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            20                  25                  30          


Leu Asp Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln 
    50                  55                  60      


<210>  25
<211>  65
<212>  PRT
<213>  Chlamydomonas reinhardtii


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 11

<400>  25

Lys Lys Ala Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile 
            20                  25                  30          


Leu Glu Ile Met Gly Val Asp Gly Ser Ala Gly Arg Leu Ala Asp Thr 
        35                  40                  45              


Ser Gly Arg Asp Val Cys Gly Thr Val Tyr Arg Leu Tyr Leu Lys Arg 
    50                  55                  60                  


Val 
65  


<210>  26
<211>  61
<212>  PRT
<213>  Chromochloris zofingiensis


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 9

<400>  26

Lys Lys Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile 
            20                  25                  30          


Leu Glu Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Gln 
    50                  55                  60      


<210>  27
<211>  60
<212>  PRT
<213>  Volvox carteri


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 10

<400>  27

Lys Lys Ala Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile 
            20                  25                  30          


Leu Glu Ile Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val 
    50                  55                  60  


<210>  28
<211>  60
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 11

<400>  28

Lys Lys Gln Arg Met Asn Trp Ser Asp Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            20                  25                  30          


Leu Asp Leu Met Ser Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met 
    50                  55                  60  


<210>  29
<211>  60
<212>  PRT
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 12

<400>  29

Lys Lys Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val 
1               5                   10                  15      


Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            20                  25                  30          


Leu Asp Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val 
    50                  55                  60  


<210>  30
<211>  61
<212>  PRT
<213>  Micromonas sp.


<220>
<221>  misc_feature
<223>  Myb domain of SGI1 polypeptide of SEQ ID NO: 18

<400>  30

Lys Lys Pro Arg Val Val Trp Ser Ala Glu Leu His Gln Gln Phe Val 
1               5                   10                  15      


Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile 
            20                  25                  30          


Leu Asp Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser 
        35                  40                  45              


His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln 
    50                  55                  60      


<210>  31
<211>  126
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 6 or 7

<400>  31

Pro Ala Gly Leu Lys Val Leu Val Val Asp Asp Asp Leu Met Cys Leu 
1               5                   10                  15      


Lys Val Val Ser Ala Met Leu Lys Arg Cys Ser Tyr Gln Val Ala Thr 
            20                  25                  30          


Cys Ser Ser Gly Ser Glu Ala Leu Thr Leu Leu Arg Glu Arg Asn Glu 
        35                  40                  45              


Asp Gly Ser Ser Asp Gln Phe Asp Leu Val Leu Ser Asp Val Tyr Met 
    50                  55                  60                  


Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu 
65                  70                  75                  80  


Glu Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Asn Val Val 
                85                  90                  95      


Leu Arg Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val 
            100                 105                 110         


Arg Ile Glu Glu Leu Arg Asn Val Trp Gln His Val Val Arg 
        115                 120                 125     


<210>  32
<211>  123
<212>  PRT
<213>  Coccomyxa subellipsoidea


<220>
<221>  misc_feature
<223>  sub-sequence of Response Regulator receiver domain of SGI1 
       polypeptide of SEQ ID NO: 9

<400>  32

Pro Ala Gly Leu Lys Val Leu Val Val Asp Asp Asp Pro Leu Cys Leu 
1               5                   10                  15      


Lys Val Val Glu His Met Leu Arg Arg Cys Asn Tyr Gln Val Thr Thr 
            20                  25                  30          


Cys Pro Asn Gly Lys Ala Ala Leu Glu Lys Leu Arg Asp Arg Ser Val 
        35                  40                  45              


His Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met Asp Gly 
    50                  55                  60                  


Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Asp Leu Pro Val Ile 
65                  70                  75                  80  


Met Met Ser Ser Asn Gly Glu Thr Asn Val Val Leu Arg Gly Val Thr 
                85                  90                  95      


His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Val Glu Glu Leu 
            100                 105                 110         


Arg Asn Val Trp Gln His Val Val Arg Arg Lys 
        115                 120             


<210>  33
<211>  122
<212>  PRT
<213>  Ostreococcus lucimarinus


<220>
<221>  misc_feature
<223>  sub-sequence of Response Regulator receiver domain of SGI1 
       polypeptide of SEQ ID NO: 10

<400>  33

Pro Ala Gly Leu Gly Val Leu Val Val Asp Asp Asp Leu Leu Cys Leu 
1               5                   10                  15      


Lys Val Val Glu Lys Met Leu Lys Ala Cys Lys Tyr Lys Val Thr Ala 
            20                  25                  30          


Cys Ser Thr Ala Lys Thr Ala Leu Glu Ile Leu Arg Thr Arg Lys Glu 
        35                  40                  45              


Glu Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp Gly 
    50                  55                  60                  


Phe Lys Leu Leu Glu Ile Ile Gln Phe Glu Leu Ala Leu Pro Val Leu 
65                  70                  75                  80  


Met Met Ser Ala Asn Ser Asp Ser Ser Val Val Leu Arg Gly Ile Ile 
                85                  90                  95      


His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu Leu 
            100                 105                 110         


Arg Asn Ile Trp Gln His Val Val Arg Arg 
        115                 120         


<210>  34
<211>  123
<212>  PRT
<213>  Chlamydomonas reinhardtii


<220>
<221>  misc_feature
<223>  sub-sequence of Response Regulator receiver domain of SGI1 
       polypeptide of SEQ ID NO: 11

<400>  34

Pro Ala Gly Leu Arg Leu Leu Val Val Asp Asp Asp Pro Leu Cys Leu 
1               5                   10                  15      


Lys Val Val Glu Gln Met Leu Arg Lys Cys Ser Tyr Glu Val Thr Val 
            20                  25                  30          


Cys Ser Asn Ala Thr Thr Ala Leu Asn Ile Leu Arg Asp Lys Asn Thr 
        35                  40                  45              


Glu Tyr Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met Asp Gly 
    50                  55                  60                  


Phe Arg Leu Leu Glu Leu Val Gly Leu Glu Met Asp Leu Pro Val Ile 
65                  70                  75                  80  


Met Met Ser Ser Asn Gly Asp Thr Ser Asn Val Leu Arg Gly Val Thr 
                85                  90                  95      


His Gly Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg Leu Glu Glu Leu 
            100                 105                 110         


Arg Asn Leu Trp Gln His Val Val Arg Arg Arg 
        115                 120             


<210>  35
<211>  61
<212>  PRT
<213>  Chromochloris zofingiensis


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 12

<400>  35

Met Asp Gly Phe Lys Leu Leu Glu Thr Val Gly Leu Glu Leu Asp Leu 
1               5                   10                  15      


Pro Val Ile Met Met Ser Ser Asn Gly Glu His Thr Thr Val Met Arg 
            20                  25                  30          


Gly Val Thr His Gly Ala Cys Asp Phe Leu Ile Lys Pro Val Arg Ile 
        35                  40                  45              


Glu Glu Leu Arg Asn Ile Trp Gln His Val Ile Arg Arg 
    50                  55                  60      


<210>  36
<211>  123
<212>  PRT
<213>  Volvox carteri


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 13

<400>  36

Pro Ala Gly Leu Arg Leu Leu Val Val Asp Asp Asp Pro Leu Cys Leu 
1               5                   10                  15      


Lys Val Val Glu Gln Met Leu Arg Lys Cys Ser Tyr Asp Val Thr Thr 
            20                  25                  30          


Cys Thr Asn Ala Thr Met Ala Leu Asn Leu Leu Arg Asp Lys Ser Thr 
        35                  40                  45              


Glu Tyr Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met Asp Gly 
    50                  55                  60                  


Phe Lys Leu Leu Glu Val Val Gly Leu Glu Met Asp Leu Pro Val Ile 
65                  70                  75                  80  


Met Met Ser Ser Asn Gly Asp Thr Ser Asn Val Leu Arg Gly Val Thr 
                85                  90                  95      


His Gly Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg Leu Glu Glu Leu 
            100                 105                 110         


Arg Asn Leu Trp Gln His Val Val Arg Arg Arg 
        115                 120             


<210>  37
<211>  121
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 14, 15, and 16

<400>  37

Pro Ala Gly Leu Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu 
1               5                   10                  15      


Gln Gln Val Glu Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu 
            20                  25                  30          


Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu 
        35                  40                  45              


Glu Phe Asp Leu Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly 
    50                  55                  60                  


Phe Lys Leu Leu His Val Cys His Thr Glu Leu Ser Leu Pro Val Val 
65                  70                  75                  80  


Leu Met Ser Gly Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met 
                85                  90                  95      


Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu 
            100                 105                 110         


Lys Val Leu Trp Gln His Leu Val Arg 
        115                 120     


<210>  38
<211>  123
<212>  PRT
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 17

<400>  38

Pro Ala Gly Leu Lys Val Leu Val Val Asp Asp Asp Pro Leu Cys Leu 
1               5                   10                  15      


Lys Val Ile Asp Gln Met Leu Arg Arg Cys Asn Tyr Ala Ala Thr Thr 
            20                  25                  30          


Cys Gln Ser Ser Leu Glu Ala Leu Glu Leu Leu Arg Ser Ser Lys Glu 
        35                  40                  45              


Asn His Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met Asp 
    50                  55                  60                  


Gly Phe Lys Leu Leu Glu Ile Ile Gly Leu Glu Met Gly Leu Pro Val 
65                  70                  75                  80  


Ile Met Met Ser Ser Asn Gly Glu Thr Gly Val Val Phe Arg Gly Val 
                85                  90                  95      


Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Ile Glu Glu 
            100                 105                 110         


Leu Arg Asn Leu Trp Gln His Val Val Arg Lys 
        115                 120             


<210>  39
<211>  123
<212>  PRT
<213>  Micromonas sp.


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 18

<400>  39

Pro Ile Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Leu Cys Leu 
1               5                   10                  15      


Lys Ile Val Glu Lys Met Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr 
            20                  25                  30          


Phe Ser Arg Gly Ala Glu Ala Leu Lys Thr Leu Arg Glu Arg Lys Asp 
        35                  40                  45              


Asp Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp Gly 
    50                  55                  60                  


Phe Lys Leu Leu Glu His Ile Ala Leu Glu Leu Asp Ile Pro Val Met 
65                  70                  75                  80  


Met Met Ser Ala Asn Cys Ala Thr Asp Val Val Leu Arg Gly Ile Ile 
                85                  90                  95      


His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu Leu 
            100                 105                 110         


Arg Asn Ile Trp Gln His Val Val Arg Arg Lys 
        115                 120             


<210>  40
<211>  45
<212>  PRT
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 8

<400>  40

Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Asn Val Val Leu 
1               5                   10                  15      


Arg Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg 
            20                  25                  30          


Ile Glu Glu Leu Arg Asn Val Trp Gln His Val Val Arg 
        35                  40                  45  


<210>  41
<211>  47
<212>  PRT
<213>  Coccomyxa subellipsoidea


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 9

<400>  41

Leu Pro Val Ile Met Met Ser Ser Asn Gly Glu Thr Asn Val Val Leu 
1               5                   10                  15      


Arg Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg 
            20                  25                  30          


Val Glu Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Lys 
        35                  40                  45          


<210>  42
<211>  46
<212>  PRT
<213>  Ostreococcus lucimarinus


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 10

<400>  42

Leu Pro Val Leu Met Met Ser Ala Asn Ser Asp Ser Ser Val Val Leu 
1               5                   10                  15      


Arg Gly Ile Ile His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg 
            20                  25                  30          


Ile Glu Glu Leu Arg Asn Ile Trp Gln His Val Val Arg Arg 
        35                  40                  45      


<210>  43
<211>  47
<212>  PRT
<213>  Chlamydomonas reinhardtii


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 11

<400>  43

Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Ser Asn Val Leu 
1               5                   10                  15      


Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg 
            20                  25                  30          


Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val Arg Arg Arg 
        35                  40                  45          


<210>  44
<211>  46
<212>  PRT
<213>  Chromochloris zofingiensis


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 12

<400>  44

Leu Pro Val Ile Met Met Ser Ser Asn Gly Glu His Thr Thr Val Met 
1               5                   10                  15      


Arg Gly Val Thr His Gly Ala Cys Asp Phe Leu Ile Lys Pro Val Arg 
            20                  25                  30          


Ile Glu Glu Leu Arg Asn Ile Trp Gln His Val Ile Arg Arg 
        35                  40                  45      


<210>  45
<211>  47
<212>  PRT
<213>  Volvox carteri


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 13

<400>  45

Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Ser Asn Val Leu 
1               5                   10                  15      


Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg 
            20                  25                  30          


Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val Arg Arg Arg 
        35                  40                  45          


<210>  46
<211>  45
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 14

<400>  46

Leu Pro Val Val Leu Met Ser Gly Thr Ser Asp Thr Gln Leu Val Met 
1               5                   10                  15      


Arg Gly Val Met Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro Leu Arg 
            20                  25                  30          


Val Glu Glu Leu Lys Val Leu Trp Gln His Leu Val Arg 
        35                  40                  45  


<210>  47
<211>  46
<212>  PRT
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 17

<400>  47

Leu Pro Val Ile Met Met Ser Ser Asn Gly Glu Thr Gly Val Val Phe 
1               5                   10                  15      


Arg Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg 
            20                  25                  30          


Ile Glu Glu Leu Arg Asn Leu Trp Gln His Val Val Arg Lys 
        35                  40                  45      


<210>  48
<211>  47
<212>  PRT
<213>  Micromonas sp.


<220>
<221>  misc_feature
<223>  Response Regulator receiver domain of SGI1 polypeptide of SEQ ID 
       NO: 18

<400>  48

Ile Pro Val Met Met Met Ser Ala Asn Cys Ala Thr Asp Val Val Leu 
1               5                   10                  15      


Arg Gly Ile Ile His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg 
            20                  25                  30          


Ile Glu Glu Leu Arg Asn Ile Trp Gln His Val Val Arg Arg Lys 
        35                  40                  45          


<210>  49
<211>  9898
<212>  DNA
<213>  Parachlorella sp

<400>  49
atggggactt tttcccgcaa atccttctca aacttagcag cactcgctga cggtgacttt       60

gggcagggat ctcaaaacga tttgcgcggt ggtggccccc tgtctttgag ctcagctgcc      120

aaccgcacct ctcggaactc aatcgatagt gatggcaggc ggctccagcg gtgggtggac      180

caattttaat cgtctgaaga gtgtataata tgcatcgctt aatctatatg gcgtcttctt      240

gcgaaccagg gctacctgta agcactgcct agtatcgtcc ttaccaattt gttcgcaggc      300

ttatctttgt gagcaaccac ctgccgctcc gcgtttcgaa gggagccacc gactggaatt      360

tcgagtggga cgacgatgcg ttgatcgcgc aggccaagga gggcctgccg gaggacatgg      420

aggcactata tgttggttgc ctccctgtgg aggtggaccc tcaggaccaa gacgtgagca      480

actggggtgc accacagctt tgcaactata tatccctttg ccactcgctg cttgctgggt      540

gcctgagcaa cggcggaaag ctgacagtgc ttgggtttag agctgcacca gaggcgctgc      600

ctgaagtgac ggttctgttc actgcctccc aagctacgtg tgtgcaactt caatctgtac      660

catgccactg caggaggtca gcctgcagct acagaagcag cacaactgct tccctgtttt      720

tcttgggacg gagctcaaga ccaactatta cagaagtgag gcccatcact gcccctgcag      780

cactcttgcc gtctgccccc ccacttcatt ttgcaccata ctccctttgg gactcaccaa      840

ctgggtgggg cactgttgca ttgcttcctg gcgcatcgtt aggttgggat tgagctgacg      900

caaaccaagg ttggcaccgc ccctgtgcca aataatcacc gctgggcagc catccttttg      960

ctggccccgg caccagcacc ggcccctgct ccgcctgtgc tctgggcacg gggcacgaaa     1020

aaaaactgct gaatggatgg agtgatgtgc agggcttgct cgaatgctgg ccaccgcctc     1080

catttgcacc cacagtgtca ccagtcacac atgctgcacc caagcataag ggaagggact     1140

gtgagtggaa gaaagggagg caatgcaatg gaacagaaag gctggagtgc ggggcatact     1200

gtccgcaatc gcaccgcagc caggccttgc gtctgcaaac agagatgcgg tgcacagatg     1260

cagcacacgc ctgccacctg tgtgcgtgag catgcatgtg ctgtgtgtgt gccattgggc     1320

tggtgcagag ttctgcaagc agcagctgtg gcccatcctg cactacctca tccccctcaa     1380

ccccactagc ctaggcaggt ttgaccccgg actctgggcg tcatacgtga gggcgaacaa     1440

ggtaggcgag gtagagctga ggggtggagg agagggggtg gagagggtgg atcatggtga     1500

cgtgtgaagg gagggggagg gtgggtaggt gttgggtggg gtcaggaggg gatgaccatg     1560

tgattgagag cttggtgcac atcctctccc cccttgcaat ggcaggtgaa gtggagaaga     1620

gaaggcaggg gtgtaagatt gggcatagtg tggagtgcag gaaggccttg cgaaaaggaa     1680

ggggggatgc atgcttgctg tgccgtgaca ggcgtgctcc ggctggtgga tctggaaaca     1740

ttgtgggttg gaggggaatg ccaggaacat tcctgttccc tcttccccgc acccggcctt     1800

atggcaccta agcaccggtc cccgcccact accctaccca ctcacctcca gcttccccct     1860

ggcatatgct ctcaaggtgg ccatagtggc ctcctcgctg ccagccatca cgtatgggtt     1920

gggagggagg aaaaaggggg gggggaggtg gcggtgccac atctaccagg tgcgcgaccc     1980

tagaaaagtc cacccgcccc tcttccttca ggtgtttgcg gacaagatgg tggaggtgct     2040

ggggtcgctg gaggacgact ttgtgtgggt gcacgactac cacctgctgg tgctgccctc     2100

gctgctgcgg aaaagattcc acagaatcaa gtgagatccc ggggagcaag caagacccag     2160

cccatgcgtg gttgcatggg ggagtacggg tgcccatgca ggctagtagg ggtggggtgt     2220

gggaaggggt catttccggg gggagagcgg gggtgtgggt gaggggtctc ttcttcaacc     2280

caccacgttg acgtggcttg ctttggagtc ctgttttatc ctgcattcag ggtttcgtgc     2340

tacgtttgct cgcctcgcac ctttccgtgg aacggttgca tgcccttctc cttcagcggg     2400

gtgctggcac atgcagcccg ttagtgcaat gccattagga ggggggcagg gaggggggga     2460

gtggcatgga aggacgggaa aatgaggcaa aagaggtggg caggtgcaag ggtgtgtagc     2520

tgccaggata ggagggctgg acaccaggta ctaaccccgt gcctggtccg cgctgcaggt     2580

gcggaatctt cctccactcc cccttcccct cctccgaggt gttccgaacc ttcccccggc     2640

gggaagagat catacggtcc atgctgaatg cggatcttat aggtggggca tcacccacca     2700

ccaccaacct tccacttacc ccttcacacc ccatccaccc cttcaccctc gcacagcaat     2760

gttcctcacc taacttgtcc ctatctcata caccagacgt ggacacccgc accccctccc     2820

cccccatatc ccctcccccc cccccatact ccctcccccc cttcgtcctc gcttttaact     2880

cacagctgtg tgcatctatt catatctgaa ccttcctttc ggcaaggtgt tgtcatgggg     2940

ttaagccatg ggcacccatc gctgcagctg cccaccagct tctcgctgtc cccctgtcac     3000

cccgtcaaac acgcccctcc ccaccccccc cccgcaggct tccacacctt tgactacgcg     3060

cggcacttcc tgtcatgctg cgcccgcatg ctgggactgg agcacaagac cagccggggg     3120

gccatcatca tcgagtacta cggcagggac gtgggcatca aaatcatgcc cacaggtgag     3180

gggaagggga ggggctgtat tgtgtgctgc gttgtgcgtg cttgggtgtg tccgtgcatg     3240

cgtgtgcgtg ggtgtcattc aatgagtgtg gggggcaggg ggctcctctg tgtcacgtga     3300

ggggtcacag gccaaggtga gtcaagccaa ccagggttag tcatgcgtgc cagctggctg     3360

cagctgtgtg cgggtgcgga agggccaatc tggccatctg ctgcctgctg gggagggggg     3420

gttgcattgt ccccttcatg ctgcgcatgc tgtgctactt tctggaagcc ctgagagctg     3480

ctgttgactt accagctccc cccccctcct cccccctacc ccaggcgtga agccctcccg     3540

cttcctcagt gccttcagct ggaaggacac agagtggagg aggggggagc tggcggcgca     3600

ggtgagcggc gctggtttgc cgctgataag gggggggttg aggggggcga caataaaata     3660

tgatgttttg ctacaaaatc accctcgacc tcagctcgta gggtgggcaa catgtggggc     3720

gaacggggag gggggggggg ggggggcgag atcccacaga atccgtagga ttcacgaata     3780

tgtacttttg accttttcaa ccataccctt ttgccgccac ctcccatttc gaagagggcg     3840

ggggggtccc aatcattcag cgtccgggtg gctgccttga ggatatttga tggggcgcca     3900

ccctccccag cttcccacca gcttcttgct gtctgggcca tccccagtcg ccatctttct     3960

ccatcattca cccatgcccc acgttccgtt cccatcaccc acccatgctc tcctataacc     4020

taaaaatcgg ccaaagacta ggggagcgtc taggtgaaaa tttttgatta agtacatgaa     4080

aatttcttac gtgcacccca cccacccacc cacccctccc cctgtgcagt tcaagggcaa     4140

gacggtgctg ctgggcatgg acgacttgga cgtgttcaag gggatcgagc tgcggctggc     4200

cgccttcaag gacgtgctgg agtaccaccc ggagtggaag gggaggctgg tgctgctgca     4260

ggtcaccacc acaaggtgcg cggcaacggg ctgcgttgtg ggttttttcc ccttgttttc     4320

cgttcgcttt tccagttcat aggaaggaaa cagcagatag tacaagttat gccgtgtctg     4380

ttgtctctgt tttttttttt agaaggaaag agagctgatt cattttgagt tcaagttgaa     4440

aaacaacgaa aacatgaaag tgaagagtga ggttcagtca ttggtaggct tcaggcgtgt     4500

gtggtggcgg ggggctgcgg gtcggcacca ccgggttggc ggcagtgaag tgcgttttta     4560

actgtcgggg tgagggcggg catggaaggt cgtgcattgg aagggggcat ggggcggggt     4620

gggatgggta ctgctgctgc aggtgcctcc tgaacgggtg ggtggctgtc tgctgaggct     4680

gtggagggtg cagggcgtag ggcttattct agggtgctga agcggggtga tttcgatgcg     4740

gggctggggt tgctttggtg gagtgcgggt ttggtcctcc attcgccccc ccacccaccc     4800

accctggtgc cgcagggccc ccggccgcga cgtggacgac ctatttgact tcatcacgaa     4860

gcaggtggag gagatcaacg agcggtttgg ggcgccgggg taccagccgg tggtgtggtt     4920

taaccgcccc gtacccatgt acgagcgcat cgccatgctg tccatcgcag gtgaggcgct     4980

tctgcgcttc tgcagtgcaa tgctgctgcg ctgctccact gctgcacttg tgcgctgctg     5040

tgatgcggtg ctgctgtagt cgctgtactg caccccggac ggacaacccc cgccacgctg     5100

tccccgaggc ggtcccgcca ctgcctggcc cttgctgaca tggcggctaa ccctgccctg     5160

ccctgcagac gtggcggtgg tgacggccac gcgtgacggg atgaacctga tgccgtacga     5220

gtacgtggtg tgccgacagg tgcgtcccct cccccccccc ccccttgtcc ctgaacctcc     5280

tatgccccac gacccagacc ctgctctatg tccgtgtgcc ctgggaccgc gcttctcaat     5340

tcccacacct gcgctctgct acaccaacag gctccccctg ctccgctgcc tgcgccaccc     5400

cccccctaca gccccccacc actctctcct cccaccctct cctcggcagt tccggatgca     5460

gcacccccag gtgcagggcc cgctctctcc ccactcactg gcagggccag ggcggcggag     5520

aaggaaggct cgtgcccaac tacacggaaa cctgcacctt gacctcgcgt gctggcagct     5580

tcaggcctat ggggtccagg gtgtgttcgc ctccctcccc tccttacctc agcctgcagg     5640

ttgggtaccc ttttcgggct tccctcagaa caccccaccc ccccccctct ctcctgcagg     5700

gtcccccagg tctggcagag acggagggcc cccgccacag ccagctggtg gtgtcggagt     5760

ttgtggggtg ctcgccctcg ctgtcggggg ccatccgggt caacccctgg agcatcgaag     5820

cggtacgtac cgcggtgcag agcgcgcagc gccctgcatg gattgctccc cccccccaac     5880

cccctccctt ttacccctgc cgtcaccccc cccccctccc tcctcccatc ccatccaacc     5940

ctgccgccac cctgggggcg ggacgccgtg gcacctctgc ccatcacccc tctttgacac     6000

gtgtccgaac ctttgcggta tgcccagctg cccgagcccc ccctgcggca ggagggcgcc     6060

gttccaatcg gagcccccat actgctcgtt acactgtatc attcaagttg atgtattttt     6120

gcatacctag agaatgcatt gatgcttgca atttgtcgac tcgtacattt cacatcgtag     6180

ttgttcatgc gatgctaagt ttggtgcaag gtttgtgaag ctagttgaac tcccttccca     6240

caagatttca actgaagttt ttatactttg aagtcgccac acctccatct cacaagggtg     6300

taggagggcg aggttccgtt ctttcacaac aaccgcaacg atttctcgtc gtccagcaac     6360

agcagaagct gttgtgcgca cctgtgcttc aggcgtgggg agtgcatgtg tgcgcaggtc     6420

acacagtcct cccaccctcc ccgccagccc gcccaccaac acactcacgt cgtggtgatg     6480

catccctccc cctttcccct cgcctccttc cccttgcccc ccttcctgcg ctcaccccct     6540

cacctgcttt ccccctccct cctgcaggtc cgggacgcat tgtacggcgc catccgcatg     6600

cccatagagg aacgtcacat ccggcacgag aagcactgga agtacgtctc ctcccacacc     6660

gttcagttct gggccaaggt gagcttgatg cagccggcgc ggtgcgggac caatgtcgca     6720

caccacccag cagcgcacgc agcagggcgg agcagtgctc ttaattgagg tgttaggatt     6780

agggtttagg gacccaagac cccaagaccc ctcccttcca cacacacaca tgcacacgca     6840

cgcttacccc ttacccacac gctctctctc acacacactg ctgcctccct acaagaagcc     6900

cggcaccgcc cacctgtcac ataccggctt gaaatgccaa ttgcaatgcc cccccccccc     6960

ccccccccgc cacccccttc ttctcccccg cgcagtccta cgtcaccgac ctgcagcgct     7020

tcacggccaa ccacagcaag ctgcagtgct tcgaccttgg gtttgcgctg gacaccttcc     7080

gcatggtggc ccttacctcc aacttcagga agctgcagac cgacactgtg gtcaaggcgt     7140

accagaggtg tgttctttgg gggggggggg ggagagggag taggtagatg cgaaattcgg     7200

tcgttcatgg aggtgtacac aagcttgatt gtcattcatt tcaaaattag acagcaagag     7260

ctcctgtcgc tgatttggtc ccagtggaaa aggactcacg gttgttgtta gatcgatgat     7320

cggagcgagg gaggggaagc ctgctggggg aggggggggg atttggcatg ggtggtggtc     7380

tgctgtgtgt gcaggcggtg tggtgtgagg ggttggggca tgcagtgcgg tgcagaggtg     7440

cgggtgtgat gagaaaacgg ggggtgctga ccccccccct ccccccacct tcccacccgc     7500

tccccccccc cccccccgtt ctacctaggg ccaagaagcg tgtgctgctg ctggaccacg     7560

acggtaccct gatggccccc tcctccatct cctcccgccc caccgaccac gtgctggcca     7620

cgctgcgcca gctcacctcc gacccgcgca acaccgtgta catcatctcc ggccgcgccc     7680

gcaccgagct gcaggagtgg ttcaagtcgg tggtgagtgg cgccccccaa cccctccctt     7740

catcgtcctt actcacccct tgctcatcac ccccgagctg caggagtggt tcaagtcggt     7800

tattagtagc ctaccccccc cccccccccc cccaaatctt cttcatcgcc cttcctcacc     7860

ccttacacat caccccacgg ttgcacacgt gcctggcctg gcaccagggc tgcgccccct     7920

actccccccc aaggtgtgaa ccctttggcc aagccggcta aagctgaggt agtgtgcccc     7980

ccgccctccc cccccccact ctcaagcgcc ctgctgtctg ccgctgtgcc ccccctcagc     8040

ggcggtgccc ccccctctgt ctcacttttg cacacgcagc gcacgcgctc tctctccgcc     8100

cccccccctc cccccccccc cccgcccccc ttgtcaatcc actagttccc cccctgtgtg     8160

tgtcaccccc catgcagccc aacctggggc tggctgcgga gcacggcttc tacctgtgga     8220

cccccggctc cgccgactgg gcggtgcagg acccggacat ggggtttggg tggaaagaga     8280

ttgtggagcc catcctccag gtggggggtg cggggcagcg ggtcttgcat actcatgcac     8340

tactgactga tgatgaccac gcaaccatgg tagggtttgg atgggaggac accgtggagc     8400

ccatcctcca ggtggggggc tcaggcctct cctacctggg catcgcagca aaccgccgtc     8460

cggttctgca tgggcgctgc ctagtgcaca gaaacccagg gggagtcggg gttgctccaa     8520

ccccaaccct acgcacagcc cacccctggc ttcaccctgg cttcacccct cctacccgcc     8580

acacgcctcg ccccctcctt ttccggcccc ccctcccccc cgcagcaacg cgtgcgcccc     8640

ctccccacct cccgtttctc tctcggaagg ccacccccct cagaggttga tgtgcgaatg     8700

acgaacttgc cgtcaccccc tttccagcct tgttttccac cgcttcctgg ccctttccac     8760

gcccctttca ccaccctgac gatcctcttc accccccccc ccccccttct cgagcaggtg     8820

tacaccgagt ccacggacgg gtcccacatc gaagccaagg aaagcgcgct ggtgtggcac     8880

taccgcgacg ccgaccccga ctttgggtcg tggcaggcca aggagctgct ggaccacctg     8940

gaaggcatca tctccaacga gccagtggag gtcaggcgca cacaccccct gccacccccc     9000

accattcccc tcttcccctc cccctccccc tccccctccc ctcccttctc cccccccccc     9060

gccttccctt cagtcccttc caggagcacc ctgcccgcca ccccccccgc cacaccgctc     9120

ctttgtgacc ccggcgttgc gtgctgccgc gccttgcgcc cccaggccag acgccgctgc     9180

gacaaccccc ctcccccccc aaacccttct atcccccccc ccccccccgc ccgcccgtcc     9240

gcccccgttc caagcaagca tctacttgca caggcatgca tggcgccatg cgtccacccc     9300

ctgcagggtt gacagtgtgt ccctctcccc cttcccccct cccccgcccc tgctgcagat     9360

tgtggcgggc cagaacatcg tggaggtgaa gccgcaggga gtgagcaagg gcaaggtggt     9420

ggagcgcatc ctgcacgact gcctcaccgc cagccaggcg ccggagtttg tgctgtgcgt     9480

gggggacgac agatcaggtg cgggtgggaa gcggggtgtt gggagggggg aagggctggg     9540

ggcacgggcg ggcgggtggc tggattgcag ggcgagcaac aacctccatt ggtgcaaggt     9600

ctgtggtgtg acgtgctgtg ctgtgctgtg ccgtccctgg gtgagccgtg ctgtgctgtg     9660

ctgtgcagac gaggacatgt tcactgccat ggagaacatg cagttctccc cccacatgcc     9720

ggtggaggtg tttgcatgca cggtggggca gaagccgagc aaggcgccct tctacgtcaa     9780

cgacccggca gaggtgggtg gatgtggtag caggatgtgc ggggggaagg ggggggaagg     9840

ggcgtccgcc cctgaaaccc atgggattgg ggagggggga ggggggatgc acctctga       9898


<210>  50
<211>  900
<212>  DNA
<213>  Parachlorella sp


<220>
<221>  misc_feature
<223>  RNA binding domain, cds, encodes SEQ ID 1

<400>  50
atgtcctcgg aggaaataag caaagatatg gaggaagcaa gcagcagcgg ggatggaggg       60

ggaaaattat ttctcggagg tttaagttgg gacacaacgg aagagaaatt aagggaacat      120

tttggcgtat atggcgatat tcacgaagct gtggtcatga aggataggac gactgggcga      180

ccgcgaggat tcggatttgt tactttcaaa gatgcggagg ttgcagacag ggttgttcaa      240

gatatccacg tgattgatgg cagacagata gacgcgaaga aatctgtacc gcaagagcaa      300

aagccgaagg ctcgaaaaat atttgttggc gggctcgcac ctgaaactac agaggcggat      360

ttcaaagagt attttgaacg atatggctcg ataagcgacg ttcaaataat gcaagatcat      420

atgacaggcc ggtcgagggg cttcgggttc attacttttg aggaggacgc agcagtagag      480

aaggtgtttg cccagggcgc catgcaagag cttggtggca agcgcattga aatcaagcat      540

gccactccca aggggtctag ctcaccaacc actcctgggg ggaggagttc tagtggaggc      600

agagggcagg gctatggcag agccatgcca atgcctttcg gtcaacttgc cggatccccc      660

tatgggtatg gcttgtttca cttccctcca ggcgtgatgc cccatgccac cccctacagc      720

atgggctacg ccaaccccta cctgatgatg cagcaaatca gcggctaccc cggcgccacg      780

ccgtatccat ttgccggcct gtatggcggg caggggcgtg gagcctcgca gcagctgcag      840

caggctcagc acacgtcaca gcagctgtct tcctcgggag cggggcccgt gactcgcctg      900


