                         Sequence Listing

<110>  Martek Biosciences Corporation
 
<120>  Production of Hemagglutinin-Neuraminidase Protein in Microalgae

<130>  2715.045PC01/JUK/SAS/BNC

<140>  To Be Assigned
<141>  Herewith

<150>  61/290,469
<151>  2009-12-28

<160>  11     

<170>  PatentIn version 3.3

<210>  1
<211>  1728
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Codon optimized HN 

<400>  1
ggatccatgg accgtgtcgt ctcccgcgtg gtcctcgaga acgaggagcg tgaggccaag     60

aacacctggc gccttgtctt tcgtgtcgcc gtcctctccc ttattgtcat gaccctcgcc    120

atctccgtcg ccgccctcgt ctacagcatg gaggctagca cccccaacga tctcgccgga    180

atctcgactg ttatctcccg cgccgaggac cgcgtcacct ccctcctcaa ctccaaccag    240

gatgtcgttg atcgcgtcta caagcaggtc gccctcgagt cccctctcgc cctccttaac    300

accgagagca tcattatgaa cgccattacc tccctcagct accagattaa cggcgccgcc    360

aactcgtccg gctgcggcgc ccccgtccat gaccctgatt acatcggcgg cgtcggcaag    420

gagctcatcg tcgacgacac tagcgatgcc acgtccttct accctagcgc ctaccaggag    480

cacctcaact tcatccctgc ccccactacc ggctccggct gcacccgcat tcccagcttc    540

gacatgtccg ccactcacta ctgctacacc cataacgtca tcctttcggg ttgccgcgac    600

cactcccaca gccaccagta cctcgccctc ggagttcttc gtacgtccgc caccggccgc    660

gtcttttttt ccaccctccg cagcatcaac ctcgacgata cccagaaccg caagagctgc    720

tcggtctccg ccaccccgct cggctgcgac atgctctgct ccaaggtcac cgagacggag    780

gaggaggatt acaagtccgt tacccccact tcgatggtcc acggccgcct tggcttcgac    840

ggccagtacc acgagaagga cctcgacgtc accgttctct ttaaggactg ggttgccaac    900

taccccggcg tcggcggcgg ctccctcatc gatgaccgcg tctggtttcc tgtctacggt    960

ggtctcaagc ctaacagccc ctccgatacc gcccaggagg gtaagtacgt gatctacaag   1020

cgctacaaca acacctgccc tgacgagcag gattaccagg tccgcatggc caagtcctcg   1080

tacaagcccg gtcgtttcgg cggcaagcgc gtccagcagg ccattctctc gatcaaggtc   1140

tcgaccagcc tcggagagga ccccgtgctc accgttcccc ctaacaccgt cacccttatg   1200

ggcgccgagg gccgcatcct caccgtcggt acctcccact tcctctacca gcgcggctcg   1260

agctactttt cccctgccct tctttacccc atgactgttc gcaacaagac tgctaccctc   1320

cacagcccct acacctttaa cgccttcacg cgccccggaa gcgtcccctg ccaggcgagc   1380

gcccgctgcc ctaactcctg cattaccggc gtctacaccg acccttaccc tgtcgtcttt   1440

caccgcaacc atacccttcg cggcgtcttc ggtactatgc ttgataacga gcaggcccgc   1500

ctcaaccccg tctccgccat tttcgactac acttcccgct cccgtatcac ccgcgtctcc   1560

tccacctcca ccaaggccgc ctacaccacc tccacctgct ttaaggttgt caagactaac   1620

aaggtctact gcctctccat cgccgagatt agcaacaccc tcttcggaga gttccgcatt   1680

gtccccctgc tcgtcgagat cctcaaggac gatcgcgttt aacatatg                1728


<210>  2
<211>  571
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Codon optimized HN

<400>  2

Met Asp Arg Val Val Ser Arg Val Val Leu Glu Asn Glu Glu Arg Glu 
1               5                   10                  15      


Ala Lys Asn Thr Trp Arg Leu Val Phe Arg Val Ala Val Leu Ser Leu 
            20                  25                  30          


Ile Val Met Thr Leu Ala Ile Ser Val Ala Ala Leu Val Tyr Ser Met 
        35                  40                  45              


Glu Ala Ser Thr Pro Asn Asp Leu Ala Gly Ile Ser Thr Val Ile Ser 
    50                  55                  60                  


Arg Ala Glu Asp Arg Val Thr Ser Leu Leu Asn Ser Asn Gln Asp Val 
65                  70                  75                  80  


Val Asp Arg Val Tyr Lys Gln Val Ala Leu Glu Ser Pro Leu Ala Leu 
                85                  90                  95      


Leu Asn Thr Glu Ser Ile Ile Met Asn Ala Ile Thr Ser Leu Ser Tyr 
            100                 105                 110         


Gln Ile Asn Gly Ala Ala Asn Ser Ser Gly Cys Gly Ala Pro Val His 
        115                 120                 125             


Asp Pro Asp Tyr Ile Gly Gly Val Gly Lys Glu Leu Ile Val Asp Asp 
    130                 135                 140                 


Thr Ser Asp Ala Thr Ser Phe Tyr Pro Ser Ala Tyr Gln Glu His Leu 
145                 150                 155                 160 


Asn Phe Ile Pro Ala Pro Thr Thr Gly Ser Gly Cys Thr Arg Ile Pro 
                165                 170                 175     


Ser Phe Asp Met Ser Ala Thr His Tyr Cys Tyr Thr His Asn Val Ile 
            180                 185                 190         


Leu Ser Gly Cys Arg Asp His Ser His Ser His Gln Tyr Leu Ala Leu 
        195                 200                 205             


Gly Val Leu Arg Thr Ser Ala Thr Gly Arg Val Phe Phe Ser Thr Leu 
    210                 215                 220                 


Arg Ser Ile Asn Leu Asp Asp Thr Gln Asn Arg Lys Ser Cys Ser Val 
225                 230                 235                 240 


Ser Ala Thr Pro Leu Gly Cys Asp Met Leu Cys Ser Lys Val Thr Glu 
                245                 250                 255     


Thr Glu Glu Glu Asp Tyr Lys Ser Val Thr Pro Thr Ser Met Val His 
            260                 265                 270         


Gly Arg Leu Gly Phe Asp Gly Gln Tyr His Glu Lys Asp Leu Asp Val 
        275                 280                 285             


Thr Val Leu Phe Lys Asp Trp Val Ala Asn Tyr Pro Gly Val Gly Gly 
    290                 295                 300                 


Gly Ser Leu Ile Asp Asp Arg Val Trp Phe Pro Val Tyr Gly Gly Leu 
305                 310                 315                 320 


Lys Pro Asn Ser Pro Ser Asp Thr Ala Gln Glu Gly Lys Tyr Val Ile 
                325                 330                 335     


Tyr Lys Arg Tyr Asn Asn Thr Cys Pro Asp Glu Gln Asp Tyr Gln Val 
            340                 345                 350         


Arg Met Ala Lys Ser Ser Tyr Lys Pro Gly Arg Phe Gly Gly Lys Arg 
        355                 360                 365             


Val Gln Gln Ala Ile Leu Ser Ile Lys Val Ser Thr Ser Leu Gly Glu 
    370                 375                 380                 


Asp Pro Val Leu Thr Val Pro Pro Asn Thr Val Thr Leu Met Gly Ala 
385                 390                 395                 400 


Glu Gly Arg Ile Leu Thr Val Gly Thr Ser His Phe Leu Tyr Gln Arg 
                405                 410                 415     


Gly Ser Ser Tyr Phe Ser Pro Ala Leu Leu Tyr Pro Met Thr Val Arg 
            420                 425                 430         


Asn Lys Thr Ala Thr Leu His Ser Pro Tyr Thr Phe Asn Ala Phe Thr 
        435                 440                 445             


Arg Pro Gly Ser Val Pro Cys Gln Ala Ser Ala Arg Cys Pro Asn Ser 
    450                 455                 460                 


Cys Ile Thr Gly Val Tyr Thr Asp Pro Tyr Pro Val Val Phe His Arg 
465                 470                 475                 480 


Asn His Thr Leu Arg Gly Val Phe Gly Thr Met Leu Asp Asn Glu Gln 
                485                 490                 495     


Ala Arg Leu Asn Pro Val Ser Ala Ile Phe Asp Tyr Thr Ser Arg Ser 
            500                 505                 510         


Arg Ile Thr Arg Val Ser Ser Thr Ser Thr Lys Ala Ala Tyr Thr Thr 
        515                 520                 525             


Ser Thr Cys Phe Lys Val Val Lys Thr Asn Lys Val Tyr Cys Leu Ser 
    530                 535                 540                 


Ile Ala Glu Ile Ser Asn Thr Leu Phe Gly Glu Phe Arg Ile Val Pro 
545                 550                 555                 560 


Leu Leu Val Glu Ile Leu Lys Asp Asp Arg Val 
                565                 570     


<210>  3
<211>  51
<212>  PRT
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  GlcNac-transferase-I-like protein 

<400>  3

Met Arg Gly Pro Gly Met Val Gly Leu Ser Arg Val Asp Arg Glu His 
1               5                   10                  15      


Leu Arg Arg Arg Gln Gln Gln Ala Ala Ser Glu Trp Arg Arg Trp Gly 
            20                  25                  30          


Phe Phe Val Ala Thr Ala Val Val Leu Leu Val Phe Leu Thr Val Tyr 
        35                  40                  45              


Pro Asn Val 
    50      


<210>  4
<211>  153
<212>  DNA
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  signal anchor sequence

<400>  4

atgcgcggcc cgggcatggt cggcctcagc cgcgtggacc gcgagcacct gcggcggcgg     60

cagcagcagg cggcgagcga atggcggcgc tgggggttct tcgtcgcgac ggccgtcgtc    120

ctgctcgtct ttctcaccgt atacccgaac gta                                 153


<210>  5
<211>  66
<212>  PRT
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  beta-1,2- xylosyltransferase-like protein

<400>  5

Met Arg Thr Arg Gly Ala Ala Tyr Val Arg Pro Gly Gln His Glu Ala 
1               5                   10                  15      


Lys Ala Leu Ser Ser Arg Ser Ser Asp Glu Gly Tyr Thr Thr Val Asn 
            20                  25                  30          


Val Val Arg Thr Lys Arg Lys Arg Thr Thr Val Ala Ala Leu Val Ala 
        35                  40                  45              


Ala Ala Leu Leu Val Thr Gly Phe Ile Val Val Val Val Phe Val Val 
    50                  55                  60                  


Val Val 
65      


<210>  6
<211>  198
<212>  DNA
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  signal anchor sequence

<400>  6

atgcgcacgc ggggcgcggc gtacgtgcgg ccgggacagc acgaggcgaa ggcgctctcg     60

tcaaggagca gcgacgaggg atatacgacg gtcaacgttg tcaggaccaa gcgaaagagg    120

accactgtag ccgcgcttgt agccgcggcg ctgctggtga cgggctttat cgtcgtcgtc    180

gtcttcgtcg tcgttgtt                                                  198


<210>  7
<211>  64
<212>  PRT
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  beta-1,4-xylosidase-like protein

<400>  7

Met Glu Ala Leu Arg Glu Pro Leu Ala Ala Pro Pro Thr Ser Ala Arg 
1               5                   10                  15      


Ser Ser Val Pro Ala Pro Leu Ala Lys Glu Glu Gly Glu Glu Glu Asp 
            20                  25                  30          


Gly Glu Lys Gly Thr Phe Gly Ala Gly Val Leu Gly Val Val Ala Val 
        35                  40                  45              


Leu Val Ile Val Val Phe Ala Ile Val Ala Gly Gly Gly Gly Asp Ile 
    50                  55                  60                  


<210>  8
<211>  192
<212>  DNA
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  signal anchor sequence

<400>  8

atggaggccc tgcgcgagcc cttggctgcg ccgccaacgt cggcgcgatc gtcggtgcca     60

gcgccgctcg cgaaggagga gggggaggag gaggacgggg aaaaagggac gtttggggcg    120

ggggtcctcg gtgtcgtggc ggtgctcgtc atcgtggtgt ttgcgatcgt ggcgggaggc    180

ggaggcgata tt                                                        192


<210>  9
<211>  73
<212>  PRT
<213>  Schizochytrium sp. ATCC 20888

<220>
<223>  galactosyltransferase-like protein

<400>  9

Met Leu Ser Val Ala Gln Val Ala Gly Ser Ala His Ser Arg Pro Arg 
1               5                   10                  15      


Arg Gly Gly Glu Arg Met Gln Asp Val Leu Ala Leu Glu Glu Ser Ser 
            20                  25                  30          


Arg Asp Arg Lys Arg Ala Thr Ala Arg Pro Gly Leu Tyr Arg Ala Leu 
        35                  40                  45              


Ala Ile Leu Gly Leu Pro Leu Ile Val Phe Ile Val Trp Gln Met Thr 
    50                  55                  60                  


Ser Ser Leu Thr Thr Ala Pro Ser Ala 
65                  70              


<210>  10
<211>  219
<212>  DNA
<213>  Schizochytrium sp. ATCC 20888


<220>
<223>  signal anchor sequence

<400>  10

atgttgagcg tagcacaagt cgcggggtcg gcccactcgc ggccgagacg aggtggtgag     60

cggatgcaag acgtgctggc cctggaggaa agcagcagag atcgaaaacg agcaacagca    120

aggcccgggc tatatcgcgc acttgcgatt ctggggctgc cgctcatcgt attcatcgta    180

tggcaaatga ctagctccct cacgactgcc ccgagcgcc                           219


<210>  11
<211>  572
<212>  PRT
<213>  Human parainfluenza 3 Virus

<400>  11

Met Glu Tyr Trp Lys His Thr Asn His Gly Lys Asp Ala Gly Asn Glu 
1               5                   10                  15      


Leu Glu Thr Ser Met Ala Thr His Gly Asn Lys Ile Thr Asn Lys Ile 
            20                  25                  30          


Thr Tyr Ile Leu Trp Thr Ile Ile Leu Val Leu Leu Ser Ile Val Phe 
        35                  40                  45              


Ile Ile Val Leu Ile Asn Ser Ile Lys Ser Glu Lys Ala His Glu Ser 
    50                  55                  60                  


Leu Leu Gln Asp Val Asn Asn Glu Phe Met Glu Val Thr Glu Lys Ile 
65                  70                  75                  80  


Gln Met Ala Ser Asp Asn Ile Asn Asp Leu Ile Gln Ser Gly Val Asn 
                85                  90                  95      


Thr Arg Leu Leu Thr Ile Gln Ser His Val Gln Asn Tyr Ile Pro Ile 
            100                 105                 110         


Ser Leu Thr Gln Gln Met Ser Asp Leu Arg Lys Phe Ile Ser Glu Ile 
        115                 120                 125             


Thr Ile Arg Asn Asp Asn Gln Glu Val Pro Pro Gln Arg Ile Thr His 
    130                 135                 140                 


Asp Val Gly Ile Lys Pro Leu Asn Pro Asp Asp Phe Trp Arg Cys Thr 
145                 150                 155                 160 


Ser Gly Leu Pro Ser Leu Met Lys Thr Pro Lys Ile Arg Leu Met Pro 
                165                 170                 175     


Gly Pro Gly Leu Leu Ala Met Pro Thr Thr Val Asp Gly Cys Val Arg 
            180                 185                 190         


Thr Pro Ser Leu Val Ile Asn Asp Leu Ile Tyr Ala Tyr Thr Ser Asn 
        195                 200                 205             


Leu Ile Thr Arg Gly Cys Gln Asp Ile Gly Lys Ser Tyr Gln Val Leu 
    210                 215                 220                 


Gln Ile Gly Ile Ile Thr Val Asn Ser Asp Leu Val Pro Asp Leu Asn 
225                 230                 235                 240 


Pro Arg Ile Ser His Thr Phe Asn Ile Asn Asp Asn Arg Lys Ser Cys 
                245                 250                 255     


Ser Leu Ala Leu Leu Asn Thr Asp Val Tyr Gln Leu Cys Ser Thr Pro 
            260                 265                 270         


Lys Val Asp Glu Arg Ser Asp Tyr Ala Ser Ser Gly Ile Glu Asp Ile 
        275                 280                 285             


Val Leu Asp Ile Val Asn His Asp Gly Ser Ile Ser Thr Thr Arg Phe 
    290                 295                 300                 


Lys Asn Asn Asn Ile Ser Phe Asp Gln Pro Tyr Ala Ala Leu Tyr Pro 
305                 310                 315                 320 


Ser Val Gly Pro Gly Ile Tyr Tyr Lys Gly Lys Ile Ile Phe Leu Gly 
                325                 330                 335     


Tyr Gly Gly Leu Glu His Pro Ile Asn Glu Asn Ala Ile Cys Asn Thr 
            340                 345                 350         


Thr Gly Cys Pro Gly Lys Thr Gln Arg Asp Cys Asn Gln Ala Ser His 
        355                 360                 365             


Ser Pro Trp Phe Ser Asp Arg Arg Met Val Asn Ser Ile Ile Val Val 
    370                 375                 380                 


Asp Lys Gly Leu Asn Ser Ile Pro Lys Leu Lys Val Trp Thr Ile Ser 
385                 390                 395                 400 


Met Arg Gln Asn Tyr Trp Gly Ser Glu Gly Arg Leu Leu Leu Leu Gly 
                405                 410                 415     


Asn Lys Ile Tyr Ile Tyr Thr Arg Ser Thr Ser Trp His Ser Lys Leu 
            420                 425                 430         


Gln Leu Gly Ile Ile Asp Ile Thr Asp Tyr Ser Asp Ile Arg Ile Lys 
        435                 440                 445             


Trp Thr Trp His Asn Val Leu Ser Arg Pro Gly Asn Asn Glu Cys Pro 
    450                 455                 460                 


Trp Gly His Ser Cys Pro Asp Gly Cys Ile Thr Gly Val Tyr Thr Asp 
465                 470                 475                 480 


Ala Tyr Pro Leu Asn Pro Thr Gly Ser Ile Val Ser Ser Val Ile Leu 
                485                 490                 495     


Asp Ser Gln Lys Ser Arg Val Asn Pro Val Ile Thr Tyr Ser Thr Ser 
            500                 505                 510         


Thr Glu Arg Val Asn Glu Leu Ala Ile Arg Asn Lys Thr Leu Ser Ala 
        515                 520                 525             


Gly Tyr Thr Thr Thr Ser Cys Ile Thr His Tyr Asn Lys Gly Tyr Cys 
    530                 535                 540                 


Phe His Ile Val Glu Ile Asn His Lys Ser Leu Asp Thr Phe Gln Pro 
545                 550                 555                 560 


Met Leu Phe Lys Thr Glu Ile Pro Lys Ser Cys Ser 
                565                 570         



