﻿               SEQUENCE LISTING

<110> VIRALGA SAS
      LLUESMA Monica
      MARTINEZ Manuel

<120> CONTROL OF GREEN ALGAE BLOOMS

<130> IBIO-1580/PCT 

<160> 10

<170> BiSSAP 1.3.6

<210> 1
<211> 8518
<212> RNA
<213> Unknown


<220> 
<223> Virus-like particle’s RNA genome

<400> 1
uuuuuuuuuu uuuuuuuuuu uuuuuggaau uacgagcuac gagagcaucg caugaaaaug       60

uucgaacuag agagggaaau uuccacccag cgucaccgcc ggguuuaaaa ccucuaaagu      120

ucuaauaacu uuaugcgaug acacagagga gaaccucucg ucaaucuauu acacuguaaa      180

auagaccaca uggcucgacc auguguggga aagcccacac augaucgugu gaacgcuuuc      240

cuuagagagu gacaauauuu acgucacuug guaugcgagc aggagccccu augaaaaaga      300

auagggaaaa aucuucugcu gcugcagugu aaaauuguaa ccaaugauau ugauucgcag      360

uuucgcgaua aucaaaggau acaguuuggc uaccuugauu caauccugua uaaggaggaa      420

ccucgguuug uccugguuca uuacauucag cauaucuuaa ugcuguuuga aaagggaguu      480

cauacuccaa accagguuga gaagccacgc caguuaaagc uguuccagcu gcacugucuu      540

ggcaauuguc cgcacgccau uuuucgcgga caacucuuga caaacuguau gaaccguuau      600

cguuaauaau auuaacuugc gauucguuug acaaguacuc gggauuucua ccuaccgaua      660

aagguuguaa uugguuauuu gugccacaau auagcacuuu ccaacgacau ccuccucgaa      720

agccaacaua ugccugcaug caauaacgca ucauggucau agguaacaaa uuauaauuua      780

uugggccugc gauagggguc auugcagaag caauacugga uccauaagau ggaccaggac      840

ccugagggua gauaaaaguu cgguacauaa cuguguacca auuguuuaca uuaagagauu      900

cuaaaggcuc uaaaguccua aaguaauuau aucucuuaag uaaagaucug aaggaaacaa      960

cagccucacc guaauauacg ugagauuguu ccuuacagaa uuccguguau ucaccauuaa     1020

gaacauaagu aguaucuugu ucuggcaaau uuucguccau ggucauaccu ucuguagcca     1080

uuugugcuaa gguaggugga ccaacaucag ucguagaguu ugcauaagca acaucuugac     1140

cuaauguucc acguggauuu cucacuucgu aagaaucacc accggaaaug aagacauuca     1200

cggaaacgcu gcuauccgua auaggagcag caagugcauu aaguacauau acauuaagac     1260

gaccauugca auuagcaaca ucaucagcau cuguaccuau agcaccagua gcacccuccc     1320

aagcagaaag uuuuugagcu cuaaagauau caaucuuucu auaagcuucu uucugggucc     1380

aguugauuuc aaaaguaaca ucgcguucuu cugaaauauc aacaauauga gaguaacggu     1440

cauuaguauc ugucacagua cccgccguug auaacguggg uucauaaaca aacaauaauc     1500

guccucuaug auauugugaa gccacaaucu gaaaucuaua ucuuagacuu ccagaccagu     1560

uauugaaagg auaagaaaca aaacucaaag gaguuugaca ucuuaccacu guaucauuag     1620

aaccaguuug auagauuggu gcuaucauug gauguacuug aauagaauac aagagcccag     1680

uguucugggc uacaauugua cuccaauuga aguaaucaau gaaagcuuca cgcuugacca     1740

aauauccaaa agacauuuga ucuucuucac cuaacccaac gguggugggg ucaaugguga     1800

gcucuugcuu gggaucaaga guuaauuuga agauaggauc ugcgccugau guauuggcua     1860

aauugccaau aggcugcgau uuauaaaacg caguaucugu uagaacugca ggucgugaaa     1920

auccaaaaag acgcgcaaug cugcuaacag caccacucgc uaucuggguu gauuuugcaa     1980

auuuaccaau auaugguauu ucuguaaaau agccagcaaa gucggccaau aaagcugcag     2040

gagcggacac uacaccauca gguuuguguu caucuucucc agaaguauug guaaacguug     2100

guuuagcuuu cuucuuuggu uuucucuucu uggucuucuu acuguuuuga aucuuaucaa     2160

cagccaucug agcuguugca ggagcagcag cuguaagacc agcgaauucu acaucuucca     2220

uccacgcaaa aauugaaauu ucaacuucau caguggcacc guuggaaugu uguaguuggg     2280

uuaauuccca aauuuccaau acacccauac gaucaauagu uucuugauca guuaaaucaa     2340

uccaauuggu ggccgaaaag aaaggccacg agauuuguug ugguugauuu guugauggau     2400

caauaaauac augugggcgu ugcgaguaua agcaagccau guuauucauu gucuuauuac     2460

uaccagcugc auuuucaucc guauaggaua aacuaacacc uggaguuaca ggaccaucag     2520

uaaguguauu guugucgaau uuugaagguc uaagaccaac aaacauucga ccauaaugaa     2580

aaggugaacc auuaaccaaa aucuugaguu uaagaguucc augaaguaau uugaaaguuu     2640

cuauuuuguu agcaacuuua gaaucacuaa ggaauaaauc ccaaggauua auagcauuca     2700

cauauugugg agacucauuu acuucccaaa ucuuagaaaa aauucuaaua ggacgggaca     2760

uaaacguagu caaaucaaca gcuugugaau uagccuuuuc guaagucgga ucugacaaag     2820

guuuaccgac caaaaguuga uauuguucaa cuuggucaga aaaauuaaca uuugucucga     2880

cuugaagacc agaaucuuca uguucuacuu uugcuucaac ugcacuuugu acaguaauau     2940

cugaaucauu aguaguagau ucugguuguu ugguuucuuc auguagagua ggaagagaau     3000

aaccuaaggc ucucucauau ucuaucauau caugaaguuu gcaaguuuga cacuugcacc     3060

ugacaugguc caccuuaggu guacaacaug ugcuaggcau auggguuggc auuuuaaaau     3120

ugaaguucau aaugcuuuaa auuuaaggga cguacccuaa agaggcuauc ugcgcgugcc     3180

ugugcgugcu auaagguugc aaaccauaaa uuaucgagcu cccaauuaag ggcgacguaa     3240

cuauuuuuug ugaaggaaau aauuguaaaa ccuauauaaa auuacaacuc uccaaaccuu     3300

ucaauauaaa guucaucaaa aguuuuaaga gcacaauugg guaaauaaca ucucaaaucg     3360

aauucaucua auaauucguu gagaaaugua uuuucccgua caaauuucuu cuuaccauau     3420

uggaaauauu ccguauuggc ggaagaaaua auuugagcac acuguucuuu aaaaguaaua     3480

gucuuagauc uagugcaaac acuaagacuu uuaagaauag auccuucauc uaaaggagcu     3540

ugaauuacac cuugaacaua aacagaugga guaaauuuuc ucuuaagaaa aucaacaucu     3600

uuaacguuaa uaaaagguac agauucagac ucuuuaucag ccauuguaua aguaacuccu     3660

cuuuuagaaa gagcaguacu aaugguggua ugauuuaacc aaggaauacu agaagaagca     3720

caauuaucau cgccauaagu uaaaauggca agugaaucuu gaaaucuagc auaaucuauu     3780

ucuugaacac cucuuucuuc uucaauaucc auggcagcua gcaucauaua cauaauguug     3840

cacaugccau uaauuggagu aguaagagga ugaccacuag gguuacuacc aaacauaccu     3900

accacgguuc caaaaacauu ggauagugga uagcaaauau caguagcuag accgcgcaua     3960

auaguuaagu cuucaucaga ccaaccauug ucacggcaaa uucuaaugag aacauuaaaa     4020

cuagcaguca uaagcucagg aggcauauuc uuaucgaaug cuuuauaauc gcccgcaauc     4080

auauuauucu uuccauguug gguaauauau ucauaaauug caguccaauc aucuccauga     4140

gcauuggcac cgauggccau accgaauuca guucucuuuu uaccagaaac uaauggaaua     4200

auccaaaggu aguacaugcg gaccauaauc auaaaaugaa gugguccacu guuaaaaauu     4260

cuacauuucu guuucaagag uuuugacucg gaaacgggcu caucuuuaaa auugaagucc     4320

caaacaauau cgccgcguuc accucucaag uauuuuucuu ugaggcgauu gaauucuagc     4380

auaauauuau cauuaaguuc auauuuuacg guguguccug guucaggauc accuaaaugu     4440

agauauuugg auuuagcucc uuuaugagca aaaccaccug auguagaugc aggaauucug     4500

ucaauauaug acgcuccguc auauccauug acaccuacgu cgauaucgua gggcucacag     4560

ggaauaaaga auuuauguuc uucgaucuuc cuugugaacc aagcauauaa cgcgucuucu     4620

gcgcguugaa uauucuuggc uggaaaacua gguuuuacaa acauaggggc aacauuuugu     4680

uuacacgcca aagaggauuu aauaccuuuu ggugaaaaau guguaaaauc uaaucuacca     4740

uaaugcugua aaauacguuc ugccauaaua guaugacaaa ccauaguuuu cauuuuacgu     4800

cuaugaguau uaauagaacc aaucaucauc auugaaccuc caucaguuag ucugauagga     4860

cauuucuugu cuuggguuug aguaauugau agaucuucua cugucucaua guguucguuu     4920

aaaucaacgc cauuauauga uaauggagua aauuuggaau cauccaaauc uggguugauu     4980

aaacaggcug caaagacuuu auaucuacca ccaaagguag cuuucuuugc accaacaugg     5040

aaaccagcga uaaaacaccc auuagggguc uuaacgaugu auggugcucc acauucgccu     5100

auaauaguug gcacuuuggu guaugccaua uaaccauuau aagcauauga aaaauauuua     5160

ucauuauaag agaugggacu aguuugcaua gcuucaaaau cuuuauaagc cauaauaccg     5220

ucucuuugac gaauaaccau uguaccagau guuuucccuu uaaucauucc agguaauagg     5280

aacuuacgca caucgcgaaa aguuccuaaa gcugaauguc uaaagaauau aacaucacuu     5340

gguccauagc guucaucaaa uucaaaacau gauucaucaa cuaagaccuu aaaucugcua     5400

gguccuccuu guuugguaau aucgugucua augauaucca uucuaagagg aaauacaucu     5460

uuaagagaau caaagaaaug uuuaggcaaa gccauaacau uuccguacag uccgagacca     5520

cauauccuau gauauugucg cucggcgaga uuuucugcua ucagaugaca gcaguuuagc     5580

ucaacacuau ugacuaacug gucaaaaguu acuguacuag gagguccacu caaucgugua     5640

auauuuucau auugagcuga ccaauaguug gagucagauu gauuuucauc aaccucuggu     5700

uuugauugug guugaauagc ugaguacaaa uguacuacag uauacaagca cccuaaaagu     5760

gcaacaguac caccaaccga auacuuguua uaauauuuag auaagucaug ucuaauagac     5820

ugaacuaaaa uaccaugccc auaggcacgu cuacaagcgu ucguagccca aucuugccaa     5880

gugucacaaa gacaccuagu agcaagaaac auaggcaaau agguaaacca ccaaggcacu     5940

ccaaaccucg aagguaauuu ucuuauacaa auaacgaccu uuuuuguaaa auaauuaggu     6000

ucauuauuuc uauaagcuuu ugcgcauacu uuaugaagua gccaauaaau ggcuaacuga     6060

aaaauuguac acaauacaau gagugagcug auauuuucaa ccacagucau aguacaauua     6120

uaagcuuuca uaauguuguc aacacucauu ugaggacgag ugcaaucaca aagcgcugau     6180

ggaacgugac auguuuuaca aaauucacua uccauaaaau gcucaacacu cuucuuggca     6240

uguucgccac gaaagugaug aggcuuuugu acauguucua guaaaaaucu agauaauuga     6300

gcgaaaguca uaguaggaca gcguguuuug cuaucauacc auuguuuauc uucgucauuc     6360

caguaaacug ggguagauuu auuugcuagu acuuuguacu uacggacaug aaauacauga     6420

aguucaugau uuuuauaucc gucuucggug agaucuccuu ggagcuguug uucaccaggu     6480

uucuuauacu cuucccgaac cauggucuca auaaagagau cacgucuaua ugcaccacca     6540

ccaggucgaa agaccuugga aauaccagca ucauaagagu ugguagucuu cauaacauac     6600

uuacauguaa aaggaaucau accuuuaucu ucuaaguuug cuugauuggu aacauaugca     6660

acaguauuug caaauuggau gcacuuugaa uuagcgccuc cuuucuuuug caaauuaaua     6720

ucaucauuga auugaucaau aucaucaaua auacaaacuu caugagaagc uuuaaauucu     6780

gacauauauu caucaucuuc auuaaagaca uauuuuaguu ugggaucaaa uuuaguacuu     6840

gauacaccaa gauaucuauc auuuucguau aagcaaugua auagcuuauc agugagugua     6900

gauuuaccca caccuggagc acuguacaaa acaauagaaa uuggagcuuu acgcucgauu     6960

uuaauacaaa guuuaucauc aacucuguau uuguaacgag auaaagcaga uuguugcuuu     7020

uugaggauag cugccuuaua cuuauccgcg gaauagaaac uaagcaauuu cuuuccaguu     7080

uccagagucu caucgacuuu acgucgauaa ucauccaaag acauuucaug auucgcuagc     7140

aaagcgaauu uaucacuaua auaagugaga aagucaaaau ccuucucaua uucgaugauu     7200

ucuuuaucau caauauauau agcguuauaa uuaccaguuu caagauauaa aauaaccuug     7260

ucgguaauaa aaguugcacc aucaauaaua ugaagugcaa uauuuacagc augaguauug     7320

uuauacuccu uuuguaaauu cuuagcgugu aacucggaaa auccgagcca cguggaauca     7380

acuccacuuu uaacaaaaag ugggguacaa acaagaaaag aacaaaaagu agcaaauuuc     7440

uuagcgaaau cacuauuuuu aauuuguuca guuuuagaca aacacugucu aauaucuugu     7500

aaaaucuucu ugaaggacac cuuaucuauc gcaccuugcg cauagguaag auuuccaccu     7560

ucuuucuuag ggcgucuaua caaagacaca ccaagacaca agaaaauaaa ugauaaaaca     7620

uauuuuucau auuuucuugu aaauugagaa uacgucaagc caaacauguc ucgcaaauac     7680

gaauguauug cgguaaugaa ugacauagga gaaguacacu cggauaaccg aauaaagaaa     7740

uucguaugga uauacaaaua uucuucaaug uuaguugauu ucuuuccaaa aacuuuggaa     7800

gaaauaguuu cguuuuuaga gauaacggaa ucuagcauuu uaaugucuug cugagacaaa     7860

aauuuaaaau aauccauaau ugaaguauua uaucgggcau uacccguaaa aacagccuuc     7920

agcggcuuaa acugacuuaa guugucaccu auaaauguuc ucacuuaaga gaaacuaagg     7980

uacuuuuaug gcaacuagua cgaaacccuu uguuuuacua aacaaagaaa aguucauccu     8040

acauauuuau aauuaaauua auaauuaauu gucuauaggg ucgacaaaau aaaaaagggg     8100

accuguuucu ccucuuuauu uauacauaaa aaauuuaauc guuucaaaaa ugcucgucgu     8160

gagagcaguu agcguuaagg ggggaacaga gccuaauccg cgagaauaag uucuggagcc     8220

aaucuaccug uaacuuacgu guuuacaaca ccacagucga uuuucucuga acaccucuaa     8280

uacuaaaaag uucacgaauc uacauuaauu cugucgucua aauuucccug caaaauaaag     8340

cgggacgcca aguugggcgc agauaaaauu cuauacagcu cauaaagagc uguacagugu     8400

ccgacuccca gaacggccag uucgugguua uggauuccca cagacuguug agcggggacu     8460

caccgugaug ggccucccag guuccuggcu gaucuuucgg ugcccgcugu ccccaauc       8518


<210> 2
<211> 4596
<212> RNA
<213> Unknown


<220> 
<223> RNA sequence of ORF1 encoding pro-polypeptide 1

<400> 2
auggauuauu uuaaauuuuu gucucagcaa gacauuaaaa ugcuagauuc cguuaucucu       60

aaaaacgaaa cuauuucuuc caaaguuuuu ggaaagaaau caacuaacau ugaagaauau      120

uuguauaucc auacgaauuu cuuuauucgg uuauccgagu guacuucucc uaugucauuc      180

auuaccgcaa uacauucgua uuugcgagac auguuuggcu ugacguauuc ucaauuuaca      240

agaaaauaug aaaaauaugu uuuaucauuu auuuucuugu gucuuggugu gucuuuguau      300

agacgcccua agaaagaagg uggaaaucuu accuaugcgc aaggugcgau agauaaggug      360

uccuucaaga agauuuuaca agauauuaga caguguuugu cuaaaacuga acaaauuaaa      420

aauagugauu ucgcuaagaa auuugcuacu uuuuguucuu uucuuguuug uaccccacuu      480

uuuguuaaaa guggaguuga uuccacgugg cucggauuuu ccgaguuaca cgcuaagaau      540

uuacaaaagg aguauaacaa uacucaugcu guaaauauug cacuucauau uauugauggu      600

gcaacuuuua uuaccgacaa gguuauuuua uaucuugaaa cugguaauua uaacgcuaua      660

uauauugaug auaaagaaau caucgaauau gagaaggauu uugacuuucu cacuuauuau      720

agugauaaau ucgcuuugcu agcgaaucau gaaaugucuu uggaugauua ucgacguaaa      780

gucgaugaga cucuggaaac uggaaagaaa uugcuuaguu ucuauuccgc ggauaaguau      840

aaggcagcua uccucaaaaa gcaacaaucu gcuuuaucuc guuacaaaua cagaguugau      900

gauaaacuuu guauuaaaau cgagcguaaa gcuccaauuu cuauuguuuu guacagugcu      960

ccaggugugg guaaaucuac acucacugau aagcuauuac auugcuuaua cgaaaaugau     1020

agauaucuug guguaucaag uacuaaauuu gaucccaaac uaaaauaugu cuuuaaugaa     1080

gaugaugaau auaugucaga auuuaaagcu ucucaugaag uuuguauuau ugaugauauu     1140

gaucaauuca augaugauau uaauuugcaa aagaaaggag gcgcuaauuc aaagugcauc     1200

caauuugcaa auacuguugc auauguuacc aaucaagcaa acuuagaaga uaaagguaug     1260

auuccuuuua cauguaagua uguuaugaag acuaccaacu cuuaugaugc ugguauuucc     1320

aaggucuuuc gaccuggugg uggugcauau agacgugauc ucuuuauuga gaccaugguu     1380

cgggaagagu auaagaaacc uggugaacaa cagcuccaag gagaucucac cgaagacgga     1440

uauaaaaauc augaacuuca uguauuucau guccguaagu acaaaguacu agcaaauaaa     1500

ucuaccccag uuuacuggaa ugacgaagau aaacaauggu augauagcaa aacacgcugu     1560

ccuacuauga cuuucgcuca auuaucuaga uuuuuacuag aacauguaca aaagccucau     1620

cacuuucgug gcgaacaugc caagaagagu guugagcauu uuauggauag ugaauuuugu     1680

aaaacauguc acguuccauc agcgcuuugu gauugcacuc guccucaaau gaguguugac     1740

aacauuauga aagcuuauaa uuguacuaug acugugguug aaaauaucag cucacucauu     1800

guauugugua caauuuuuca guuagccauu uauuggcuac uucauaaagu augcgcaaaa     1860

gcuuauagaa auaaugaacc uaauuauuuu acaaaaaagg ucguuauuug uauaagaaaa     1920

uuaccuucga gguuuggagu gccuuggugg uuuaccuauu ugccuauguu ucuugcuacu     1980

aggugucuuu gugacacuug gcaagauugg gcuacgaacg cuuguagacg ugccuauggg     2040

caugguauuu uaguucaguc uauuagacau gacuuaucua aauauuauaa caaguauucg     2100

guugguggua cuguugcacu uuuagggugc uuguauacug uaguacauuu guacucagcu     2160

auucaaccac aaucaaaacc agagguugau gaaaaucaau cugacuccaa cuauugguca     2220

gcucaauaug aaaauauuac acgauugagu ggaccuccua guacaguaac uuuugaccag     2280

uuagucaaua guguugagcu aaacugcugu caucugauag cagaaaaucu cgccgagcga     2340

caauaucaua ggauaugugg ucucggacug uacggaaaug uuauggcuuu gccuaaacau     2400

uucuuugauu cucuuaaaga uguauuuccu cuuagaaugg auaucauuag acacgauauu     2460

accaaacaag gaggaccuag cagauuuaag gucuuaguug augaaucaug uuuugaauuu     2520

gaugaacgcu auggaccaag ugauguuaua uucuuuagac auucagcuuu aggaacuuuu     2580

cgcgaugugc guaaguuccu auuaccugga augauuaaag ggaaaacauc ugguacaaug     2640

guuauucguc aaagagacgg uauuauggcu uauaaagauu uugaagcuau gcaaacuagu     2700

cccaucucuu auaaugauaa auauuuuuca uaugcuuaua augguuauau ggcauacacc     2760

aaagugccaa cuauuauagg cgaaugugga gcaccauaca ucguuaagac cccuaauggg     2820

uguuuuaucg cugguuucca uguuggugca aagaaagcua ccuuuggugg uagauauaaa     2880

gucuuugcag ccuguuuaau caacccagau uuggaugauu ccaaauuuac uccauuauca     2940

uauaauggcg uugauuuaaa cgaacacuau gagacaguag aagaucuauc aauuacucaa     3000

acccaagaca agaaaugucc uaucagacua acugauggag guucaaugau gaugauuggu     3060

ucuauuaaua cucauagacg uaaaaugaaa acuaugguuu gucauacuau uauggcagaa     3120

cguauuuuac agcauuaugg uagauuagau uuuacacauu uuucaccaaa agguauuaaa     3180

uccucuuugg cguguaaaca aaauguugcc ccuauguuug uaaaaccuag uuuuccagcc     3240

aagaauauuc aacgcgcaga agacgcguua uaugcuuggu ucacaaggaa gaucgaagaa     3300

cauaaauucu uuauucccug ugagcccuac gauaucgacg uaggugucaa uggauaugac     3360

ggagcgucau auauugacag aauuccugca ucuacaucag gugguuuugc ucauaaagga     3420

gcuaaaucca aauaucuaca uuuaggugau ccugaaccag gacacaccgu aaaauaugaa     3480

cuuaaugaua auauuaugcu agaauucaau cgccucaaag aaaaauacuu gagaggugaa     3540

cgcggcgaua uuguuuggga cuucaauuuu aaagaugagc ccguuuccga gucaaaacuc     3600

uugaaacaga aauguagaau uuuuaacagu ggaccacuuc auuuuaugau uaugguccgc     3660

auguacuacc uuuggauuau uccauuaguu ucugguaaaa agagaacuga auucgguaug     3720

gccaucggug ccaaugcuca uggagaugau uggacugcaa uuuaugaaua uauuacccaa     3780

cauggaaaga auaauaugau ugcgggcgau uauaaagcau ucgauaagaa uaugccuccu     3840

gagcuuauga cugcuaguuu uaauguucuc auuagaauuu gccgugacaa ugguuggucu     3900

gaugaagacu uaacuauuau gcgcggucua gcuacugaua uuugcuaucc acuauccaau     3960

guuuuuggaa ccgugguagg uauguuuggu aguaacccua guggucaucc ucuuacuacu     4020

ccaauuaaug gcaugugcaa cauuauguau augaugcuag cugccaugga uauugaagaa     4080

gaaagaggug uucaagaaau agauuaugcu agauuucaag auucacuugc cauuuuaacu     4140

uauggcgaug auaauugugc uucuucuagu auuccuuggu uaaaucauac caccauuagu     4200

acugcucuuu cuaaaagagg aguuacuuau acaauggcug auaaagaguc ugaaucugua     4260

ccuuuuauua acguuaaaga uguugauuuu cuuaagagaa aauuuacucc aucuguuuau     4320

guucaaggug uaauucaagc uccuuuagau gaaggaucua uucuuaaaag ucuuaguguu     4380

ugcacuagau cuaagacuau uacuuuuaaa gaacagugug cucaaauuau uucuuccgcc     4440

aauacggaau auuuccaaua ugguaagaag aaauuuguac gggaaaauac auuucucaac     4500

gaauuauuag augaauucga uuugagaugu uauuuaccca auugugcucu uaaaacuuuu     4560

gaugaacuuu auauugaaag guuuggagag uuguaa                               4596


<210> 3
<211> 2787
<212> RNA
<213> Unknown


<220> 
<223> RNA sequence of ORF2 encoding pro-polypeptide 2

<400> 3
augauagaau augagagagc cuuagguuau ucucuuccua cucuacauga agaaaccaaa      60

caaccagaau cuacuacuaa ugauucagau auuacuguac aaagugcagu ugaagcaaaa     120

guagaacaug aagauucugg ucuucaaguc gagacaaaug uuaauuuuuc ugaccaaguu     180

gaacaauauc aacuuuuggu cgguaaaccu uugucagauc cgacuuacga aaaggcuaau     240

ucacaagcug uugauuugac uacguuuaug ucccguccua uuagaauuuu uucuaagauu     300

ugggaaguaa augagucucc acaauaugug aaugcuauua auccuuggga uuuauuccuu     360

agugauucua aaguugcuaa caaaauagaa acuuucaaau uacuucaugg aacucuuaaa     420

cucaagauuu ugguuaaugg uucaccuuuu cauuaugguc gaauguuugu uggucuuaga     480

ccuucaaaau ucgacaacaa uacacuuacu gaugguccug uaacuccagg uguuaguuua     540

uccuauacgg augaaaaugc agcugguagu aauaagacaa ugaauaacau ggcuugcuua     600

uacucgcaac gcccacaugu auuuauugau ccaucaacaa aucaaccaca acaaaucucg     660

uggccuuucu uuucggccac caauuggauu gauuuaacug aucaagaaac uauugaucgu     720

auggguguau uggaaauuug ggaauuaacc caacuacaac auuccaacgg ugccacugau     780

gaaguugaaa uuucaauuuu ugcguggaug gaagauguag aauucgcugg ucuuacagcu     840

gcugcuccug caacagcuca gauggcuguu gauaagauuc aaaacaguaa gaagaccaag     900

aagagaaaac caaagaagaa agcuaaacca acguuuacca auacuucugg agaagaugaa     960

cacaaaccug augguguagu guccgcuccu gcagcuuuau uggccgacuu ugcuggcuau    1020

uuuacagaaa uaccauauau ugguaaauuu gcaaaaucaa cccagauagc gaguggugcu    1080

guuagcagca uugcgcgucu uuuuggauuu ucacgaccug caguucuaac agauacugcg    1140

uuuuauaaau cgcagccuau uggcaauuua gccaauacau caggcgcaga uccuaucuuc    1200

aaauuaacuc uugaucccaa gcaagagcuc accauugacc ccaccaccgu uggguuaggu    1260

gaagaagauc aaaugucuuu uggauauuug gucaagcgug aagcuuucau ugauuacuuc    1320

aauuggagua caauuguagc ccagaacacu gggcucuugu auucuauuca aguacaucca    1380

augauagcac caaucuauca aacugguucu aaugauacag ugguaagaug ucaaacuccu    1440

uugaguuuug uuucuuaucc uuucaauaac uggucuggaa gucuaagaua uagauuucag    1500

auuguggcuu cacaauauca uagaggacga uuauuguuug uuuaugaacc cacguuauca    1560

acggcgggua cugugacaga uacuaaugac cguuacucuc auauuguuga uauuucagaa    1620

gaacgcgaug uuacuuuuga aaucaacugg acccagaaag aagcuuauag aaagauugau    1680

aucuuuagag cucaaaaacu uucugcuugg gagggugcua cuggugcuau agguacagau    1740

gcugaugaug uugcuaauug caauggucgu cuuaauguau auguacuuaa ugcacuugcu    1800

gcuccuauua cggauagcag cguuuccgug aaugucuuca uuuccggugg ugauucuuac    1860

gaagugagaa auccacgugg aacauuaggu caagauguug cuuaugcaaa cucuacgacu    1920

gauguugguc caccuaccuu agcacaaaug gcuacagaag guaugaccau ggacgaaaau    1980

uugccagaac aagauacuac uuauguucuu aauggugaau acacggaauu cuguaaggaa    2040

caaucucacg uauauuacgg ugaggcuguu guuuccuuca gaucuuuacu uaagagauau    2100

aauuacuuua ggacuuuaga gccuuuagaa ucucuuaaug uaaacaauug guacacaguu    2160

auguaccgaa cuuuuaucua cccucagggu ccugguccau cuuauggauc caguauugcu    2220

ucugcaauga ccccuaucgc aggcccaaua aauuauaauu uguuaccuau gaccaugaug    2280

cguuauugca ugcaggcaua uguuggcuuu cgaggaggau gucguuggaa agugcuauau    2340

uguggcacaa auaaccaauu acaaccuuua ucgguaggua gaaaucccga guacuuguca    2400

aacgaaucgc aaguuaauau uauuaacgau aacgguucau acaguuuguc aagaguuguc    2460

cgcgaaaaau ggcgugcgga caauugccaa gacagugcag cuggaacagc uuuaacuggc    2520

guggcuucuc aaccugguuu ggaguaugaa cucccuuuuc aaacagcauu aagauaugcu    2580

gaauguaaug aaccaggaca aaccgagguu ccuccuuaua caggauugaa ucaagguagc    2640

caaacuguau ccuuugauua ucgcgaaacu gcgaaucaau aucauugguu acaauuuuac    2700

acugcagcag cagaagauuu uucccuauuc uuuuucauag gggcuccugc ucgcauacca    2760

agugacguaa auauugucac ucucuaa                                        2787


<210> 4
<211> 1560
<212> PRT
<213> Unknown


<220> 
<223> Amino acid sequence of pro-polypeptide P1

<400> 4
Val Arg Thr Phe Ile Gly Asp Asn Leu Ser Gln Phe Lys Pro Leu Lys 
1               5                   10                  15      
Ala Val Phe Thr Gly Asn Ala Arg Tyr Asn Thr Ser Ile Met Asp Tyr 
            20                  25                  30          
Phe Lys Phe Leu Ser Gln Gln Asp Ile Lys Met Leu Asp Ser Val Ile 
        35                  40                  45              
Ser Lys Asn Glu Thr Ile Ser Ser Lys Val Phe Gly Lys Lys Ser Thr 
    50                  55                  60                  
Asn Ile Glu Glu Tyr Leu Tyr Ile His Thr Asn Phe Phe Ile Arg Leu 
65                  70                  75                  80  
Ser Glu Cys Thr Ser Pro Met Ser Phe Ile Thr Ala Ile His Ser Tyr 
                85                  90                  95      
Leu Arg Asp Met Phe Gly Leu Thr Tyr Ser Gln Phe Thr Arg Lys Tyr 
            100                 105                 110         
Glu Lys Tyr Val Leu Ser Phe Ile Phe Leu Cys Leu Gly Val Ser Leu 
        115                 120                 125             
Tyr Arg Arg Pro Lys Lys Glu Gly Gly Asn Leu Thr Tyr Ala Gln Gly 
    130                 135                 140                 
Ala Ile Asp Lys Val Ser Phe Lys Lys Ile Leu Gln Asp Ile Arg Gln 
145                 150                 155                 160 
Cys Leu Ser Lys Thr Glu Gln Ile Lys Asn Ser Asp Phe Ala Lys Lys 
                165                 170                 175     
Phe Ala Thr Phe Cys Ser Phe Leu Val Cys Thr Pro Leu Phe Val Lys 
            180                 185                 190         
Ser Gly Val Asp Ser Thr Trp Leu Gly Phe Ser Glu Leu His Ala Lys 
        195                 200                 205             
Asn Leu Gln Lys Glu Tyr Asn Asn Thr His Ala Val Asn Ile Ala Leu 
    210                 215                 220                 
His Ile Ile Asp Gly Ala Thr Phe Ile Thr Asp Lys Val Ile Leu Tyr 
225                 230                 235                 240 
Leu Glu Thr Gly Asn Tyr Asn Ala Ile Tyr Ile Asp Asp Lys Glu Ile 
                245                 250                 255     
Ile Glu Tyr Glu Lys Asp Phe Asp Phe Leu Thr Tyr Tyr Ser Asp Lys 
            260                 265                 270         
Phe Ala Leu Leu Ala Asn His Glu Met Ser Leu Asp Asp Tyr Arg Arg 
        275                 280                 285             
Lys Val Asp Glu Thr Leu Glu Thr Gly Lys Lys Leu Leu Ser Phe Tyr 
    290                 295                 300                 
Ser Ala Asp Lys Tyr Lys Ala Ala Ile Leu Lys Lys Gln Gln Ser Ala 
305                 310                 315                 320 
Leu Ser Arg Tyr Lys Tyr Arg Val Asp Asp Lys Leu Cys Ile Lys Ile 
                325                 330                 335     
Glu Arg Lys Ala Pro Ile Ser Ile Val Leu Tyr Ser Ala Pro Gly Val 
            340                 345                 350         
Gly Lys Ser Thr Leu Thr Asp Lys Leu Leu His Cys Leu Tyr Glu Asn 
        355                 360                 365             
Asp Arg Tyr Leu Gly Val Ser Ser Thr Lys Phe Asp Pro Lys Leu Lys 
    370                 375                 380                 
Tyr Val Phe Asn Glu Asp Asp Glu Tyr Met Ser Glu Phe Lys Ala Ser 
385                 390                 395                 400 
His Glu Val Cys Ile Ile Asp Asp Ile Asp Gln Phe Asn Asp Asp Ile 
                405                 410                 415     
Asn Leu Gln Lys Lys Gly Gly Ala Asn Ser Lys Cys Ile Gln Phe Ala 
            420                 425                 430         
Asn Thr Val Ala Tyr Val Thr Asn Gln Ala Asn Leu Glu Asp Lys Gly 
        435                 440                 445             
Met Ile Pro Phe Thr Cys Lys Tyr Val Met Lys Thr Thr Asn Ser Tyr 
    450                 455                 460                 
Asp Ala Gly Ile Ser Lys Val Phe Arg Pro Gly Gly Gly Ala Tyr Arg 
465                 470                 475                 480 
Arg Asp Leu Phe Ile Glu Thr Met Val Arg Glu Glu Tyr Lys Lys Pro 
                485                 490                 495     
Gly Glu Gln Gln Leu Gln Gly Asp Leu Thr Glu Asp Gly Tyr Lys Asn 
            500                 505                 510         
His Glu Leu His Val Phe His Val Arg Lys Tyr Lys Val Leu Ala Asn 
        515                 520                 525             
Lys Ser Thr Pro Val Tyr Trp Asn Asp Glu Asp Lys Gln Trp Tyr Asp 
    530                 535                 540                 
Ser Lys Thr Arg Cys Pro Thr Met Thr Phe Ala Gln Leu Ser Arg Phe 
545                 550                 555                 560 
Leu Leu Glu His Val Gln Lys Pro His His Phe Arg Gly Glu His Ala 
                565                 570                 575     
Lys Lys Ser Val Glu His Phe Met Asp Ser Glu Phe Cys Lys Thr Cys 
            580                 585                 590         
His Val Pro Ser Ala Leu Cys Asp Cys Thr Arg Pro Gln Met Ser Val 
        595                 600                 605             
Asp Asn Ile Met Lys Ala Tyr Asn Cys Thr Met Thr Val Val Glu Asn 
    610                 615                 620                 
Ile Ser Ser Leu Ile Val Leu Cys Thr Ile Phe Gln Leu Ala Ile Tyr 
625                 630                 635                 640 
Trp Leu Leu His Lys Val Cys Ala Lys Ala Tyr Arg Asn Asn Glu Pro 
                645                 650                 655     
Asn Tyr Phe Thr Lys Lys Val Val Ile Cys Ile Arg Lys Leu Pro Ser 
            660                 665                 670         
Arg Phe Gly Val Pro Trp Trp Phe Thr Tyr Leu Pro Met Phe Leu Ala 
        675                 680                 685             
Thr Arg Cys Leu Cys Asp Thr Trp Gln Asp Trp Ala Thr Asn Ala Cys 
    690                 695                 700                 
Arg Arg Ala Tyr Gly His Gly Ile Leu Val Gln Ser Ile Arg His Asp 
705                 710                 715                 720 
Leu Ser Lys Tyr Tyr Asn Lys Tyr Ser Val Gly Gly Thr Val Ala Leu 
                725                 730                 735     
Leu Gly Cys Leu Tyr Thr Val Val His Leu Tyr Ser Ala Ile Gln Pro 
            740                 745                 750         
Gln Ser Lys Pro Glu Val Asp Glu Asn Gln Ser Asp Ser Asn Tyr Trp 
        755                 760                 765             
Ser Ala Gln Tyr Glu Asn Ile Thr Arg Leu Ser Gly Pro Pro Ser Thr 
    770                 775                 780                 
Val Thr Phe Asp Gln Leu Val Asn Ser Val Glu Leu Asn Cys Cys His 
785                 790                 795                 800 
Leu Ile Ala Glu Asn Leu Ala Glu Arg Gln Tyr His Arg Ile Cys Gly 
                805                 810                 815     
Leu Gly Leu Tyr Gly Asn Val Met Ala Leu Pro Lys His Phe Phe Asp 
            820                 825                 830         
Ser Leu Lys Asp Val Phe Pro Leu Arg Met Asp Ile Ile Arg His Asp 
        835                 840                 845             
Ile Thr Lys Gln Gly Gly Pro Ser Arg Phe Lys Val Leu Val Asp Glu 
    850                 855                 860                 
Ser Cys Phe Glu Phe Asp Glu Arg Tyr Gly Pro Ser Asp Val Ile Phe 
865                 870                 875                 880 
Phe Arg His Ser Ala Leu Gly Thr Phe Arg Asp Val Arg Lys Phe Leu 
                885                 890                 895     
Leu Pro Gly Met Ile Lys Gly Lys Thr Ser Gly Thr Met Val Ile Arg 
            900                 905                 910         
Gln Arg Asp Gly Ile Met Ala Tyr Lys Asp Phe Glu Ala Met Gln Thr 
        915                 920                 925             
Ser Pro Ile Ser Tyr Asn Asp Lys Tyr Phe Ser Tyr Ala Tyr Asn Gly 
    930                 935                 940                 
Tyr Met Ala Tyr Thr Lys Val Pro Thr Ile Ile Gly Glu Cys Gly Ala 
945                 950                 955                 960 
Pro Tyr Ile Val Lys Thr Pro Asn Gly Cys Phe Ile Ala Gly Phe His 
                965                 970                 975     
Val Gly Ala Lys Lys Ala Thr Phe Gly Gly Arg Tyr Lys Val Phe Ala 
            980                 985                 990         
Ala Cys Leu Ile Asn Pro Asp Leu Asp Asp Ser Lys Phe Thr Pro Leu 
        995                 1000                1005            
Ser Tyr Asn Gly Val Asp Leu Asn Glu His Tyr Glu Thr Val Glu Asp 
    1010                1015                1020                
Leu Ser Ile Thr Gln Thr Gln Asp Lys Lys Cys Pro Ile Arg Leu Thr 
1025                1030                1035                1040
Asp Gly Gly Ser Met Met Met Ile Gly Ser Ile Asn Thr His Arg Arg 
                1045                1050                1055    
Lys Met Lys Thr Met Val Cys His Thr Ile Met Ala Glu Arg Ile Leu 
            1060                1065                1070        
Gln His Tyr Gly Arg Leu Asp Phe Thr His Phe Ser Pro Lys Gly Ile 
        1075                1080                1085            
Lys Ser Ser Leu Ala Cys Lys Gln Asn Val Ala Pro Met Phe Val Lys 
    1090                1095                1100                
Pro Ser Phe Pro Ala Lys Asn Ile Gln Arg Ala Glu Asp Ala Leu Tyr 
1105                1110                1115                1120
Ala Trp Phe Thr Arg Lys Ile Glu Glu His Lys Phe Phe Ile Pro Cys 
                1125                1130                1135    
Glu Pro Tyr Asp Ile Asp Val Gly Val Asn Gly Tyr Asp Gly Ala Ser 
            1140                1145                1150        
Tyr Ile Asp Arg Ile Pro Ala Ser Thr Ser Gly Gly Phe Ala His Lys 
        1155                1160                1165            
Gly Ala Lys Ser Lys Tyr Leu His Leu Gly Asp Pro Glu Pro Gly His 
    1170                1175                1180                
Thr Val Lys Tyr Glu Leu Asn Asp Asn Ile Met Leu Glu Phe Asn Arg 
1185                1190                1195                1200
Leu Lys Glu Lys Tyr Leu Arg Gly Glu Arg Gly Asp Ile Val Trp Asp 
                1205                1210                1215    
Phe Asn Phe Lys Asp Glu Pro Val Ser Glu Ser Lys Leu Leu Lys Gln 
            1220                1225                1230        
Lys Cys Arg Ile Phe Asn Ser Gly Pro Leu His Phe Met Ile Met Val 
        1235                1240                1245            
Arg Met Tyr Tyr Leu Trp Ile Ile Pro Leu Val Ser Gly Lys Lys Arg 
    1250                1255                1260                
Thr Glu Phe Gly Met Ala Ile Gly Ala Asn Ala His Gly Asp Asp Trp 
1265                1270                1275                1280
Thr Ala Ile Tyr Glu Tyr Ile Thr Gln His Gly Lys Asn Asn Met Ile 
                1285                1290                1295    
Ala Gly Asp Tyr Lys Ala Phe Asp Lys Asn Met Pro Pro Glu Leu Met 
            1300                1305                1310        
Thr Ala Ser Phe Asn Val Leu Ile Arg Ile Cys Arg Asp Asn Gly Trp 
        1315                1320                1325            
Ser Asp Glu Asp Leu Thr Ile Met Arg Gly Leu Ala Thr Asp Ile Cys 
    1330                1335                1340                
Tyr Pro Leu Ser Asn Val Phe Gly Thr Val Val Gly Met Phe Gly Ser 
1345                1350                1355                1360
Asn Pro Ser Gly His Pro Leu Thr Thr Pro Ile Asn Gly Met Cys Asn 
                1365                1370                1375    
Ile Met Tyr Met Met Leu Ala Ala Met Asp Ile Glu Glu Glu Arg Gly 
            1380                1385                1390        
Val Gln Glu Ile Asp Tyr Ala Arg Phe Gln Asp Ser Leu Ala Ile Leu 
        1395                1400                1405            
Thr Tyr Gly Asp Asp Asn Cys Ala Ser Ser Ser Ile Pro Trp Leu Asn 
    1410                1415                1420                
His Thr Thr Ile Ser Thr Ala Leu Ser Lys Arg Gly Val Thr Tyr Thr 
1425                1430                1435                1440
Met Ala Asp Lys Glu Ser Glu Ser Val Pro Phe Ile Asn Val Lys Asp 
                1445                1450                1455    
Val Asp Phe Leu Lys Arg Lys Phe Thr Pro Ser Val Tyr Val Gln Gly 
            1460                1465                1470        
Val Ile Gln Ala Pro Leu Asp Glu Gly Ser Ile Leu Lys Ser Leu Ser 
        1475                1480                1485            
Val Cys Thr Arg Ser Lys Thr Ile Thr Phe Lys Glu Gln Cys Ala Gln 
    1490                1495                1500                
Ile Ile Ser Ser Ala Asn Thr Glu Tyr Phe Gln Tyr Gly Lys Lys Lys 
1505                1510                1515                1520
Phe Val Arg Glu Asn Thr Phe Leu Asn Glu Leu Leu Asp Glu Phe Asp 
                1525                1530                1535    
Leu Arg Cys Tyr Leu Pro Asn Cys Ala Leu Lys Thr Phe Asp Glu Leu 
            1540                1545                1550        
Tyr Ile Glu Arg Phe Gly Glu Leu 
        1555                1560

<210> 5
<211> 928
<212> PRT
<213> Unknown


<220> 
<223> Amino acid sequence of pro-polypeptide P2

<400> 5
Met Ile Glu Tyr Glu Arg Ala Leu Gly Tyr Ser Leu Pro Thr Leu His 
1               5                   10                  15      
Glu Glu Thr Lys Gln Pro Glu Ser Thr Thr Asn Asp Ser Asp Ile Thr 
            20                  25                  30          
Val Gln Ser Ala Val Glu Ala Lys Val Glu His Glu Asp Ser Gly Leu 
        35                  40                  45              
Gln Val Glu Thr Asn Val Asn Phe Ser Asp Gln Val Glu Gln Tyr Gln 
    50                  55                  60                  
Leu Leu Val Gly Lys Pro Leu Ser Asp Pro Thr Tyr Glu Lys Ala Asn 
65                  70                  75                  80  
Ser Gln Ala Val Asp Leu Thr Thr Phe Met Ser Arg Pro Ile Arg Ile 
                85                  90                  95      
Phe Ser Lys Ile Trp Glu Val Asn Glu Ser Pro Gln Tyr Val Asn Ala 
            100                 105                 110         
Ile Asn Pro Trp Asp Leu Phe Leu Ser Asp Ser Lys Val Ala Asn Lys 
        115                 120                 125             
Ile Glu Thr Phe Lys Leu Leu His Gly Thr Leu Lys Leu Lys Ile Leu 
    130                 135                 140                 
Val Asn Gly Ser Pro Phe His Tyr Gly Arg Met Phe Val Gly Leu Arg 
145                 150                 155                 160 
Pro Ser Lys Phe Asp Asn Asn Thr Leu Thr Asp Gly Pro Val Thr Pro 
                165                 170                 175     
Gly Val Ser Leu Ser Tyr Thr Asp Glu Asn Ala Ala Gly Ser Asn Lys 
            180                 185                 190         
Thr Met Asn Asn Met Ala Cys Leu Tyr Ser Gln Arg Pro His Val Phe 
        195                 200                 205             
Ile Asp Pro Ser Thr Asn Gln Pro Gln Gln Ile Ser Trp Pro Phe Phe 
    210                 215                 220                 
Ser Ala Thr Asn Trp Ile Asp Leu Thr Asp Gln Glu Thr Ile Asp Arg 
225                 230                 235                 240 
Met Gly Val Leu Glu Ile Trp Glu Leu Thr Gln Leu Gln His Ser Asn 
                245                 250                 255     
Gly Ala Thr Asp Glu Val Glu Ile Ser Ile Phe Ala Trp Met Glu Asp 
            260                 265                 270         
Val Glu Phe Ala Gly Leu Thr Ala Ala Ala Pro Ala Thr Ala Gln Met 
        275                 280                 285             
Ala Val Asp Lys Ile Gln Asn Ser Lys Lys Thr Lys Lys Arg Lys Pro 
    290                 295                 300                 
Lys Lys Lys Ala Lys Pro Thr Phe Thr Asn Thr Ser Gly Glu Asp Glu 
305                 310                 315                 320 
His Lys Pro Asp Gly Val Val Ser Ala Pro Ala Ala Leu Leu Ala Asp 
                325                 330                 335     
Phe Ala Gly Tyr Phe Thr Glu Ile Pro Tyr Ile Gly Lys Phe Ala Lys 
            340                 345                 350         
Ser Thr Gln Ile Ala Ser Gly Ala Val Ser Ser Ile Ala Arg Leu Phe 
        355                 360                 365             
Gly Phe Ser Arg Pro Ala Val Leu Thr Asp Thr Ala Phe Tyr Lys Ser 
    370                 375                 380                 
Gln Pro Ile Gly Asn Leu Ala Asn Thr Ser Gly Ala Asp Pro Ile Phe 
385                 390                 395                 400 
Lys Leu Thr Leu Asp Pro Lys Gln Glu Leu Thr Ile Asp Pro Thr Thr 
                405                 410                 415     
Val Gly Leu Gly Glu Glu Asp Gln Met Ser Phe Gly Tyr Leu Val Lys 
            420                 425                 430         
Arg Glu Ala Phe Ile Asp Tyr Phe Asn Trp Ser Thr Ile Val Ala Gln 
        435                 440                 445             
Asn Thr Gly Leu Leu Tyr Ser Ile Gln Val His Pro Met Ile Ala Pro 
    450                 455                 460                 
Ile Tyr Gln Thr Gly Ser Asn Asp Thr Val Val Arg Cys Gln Thr Pro 
465                 470                 475                 480 
Leu Ser Phe Val Ser Tyr Pro Phe Asn Asn Trp Ser Gly Ser Leu Arg 
                485                 490                 495     
Tyr Arg Phe Gln Ile Val Ala Ser Gln Tyr His Arg Gly Arg Leu Leu 
            500                 505                 510         
Phe Val Tyr Glu Pro Thr Leu Ser Thr Ala Gly Thr Val Thr Asp Thr 
        515                 520                 525             
Asn Asp Arg Tyr Ser His Ile Val Asp Ile Ser Glu Glu Arg Asp Val 
    530                 535                 540                 
Thr Phe Glu Ile Asn Trp Thr Gln Lys Glu Ala Tyr Arg Lys Ile Asp 
545                 550                 555                 560 
Ile Phe Arg Ala Gln Lys Leu Ser Ala Trp Glu Gly Ala Thr Gly Ala 
                565                 570                 575     
Ile Gly Thr Asp Ala Asp Asp Val Ala Asn Cys Asn Gly Arg Leu Asn 
            580                 585                 590         
Val Tyr Val Leu Asn Ala Leu Ala Ala Pro Ile Thr Asp Ser Ser Val 
        595                 600                 605             
Ser Val Asn Val Phe Ile Ser Gly Gly Asp Ser Tyr Glu Val Arg Asn 
    610                 615                 620                 
Pro Arg Gly Thr Leu Gly Gln Asp Val Ala Tyr Ala Asn Ser Thr Thr 
625                 630                 635                 640 
Asp Val Gly Pro Pro Thr Leu Ala Gln Met Ala Thr Glu Gly Met Thr 
                645                 650                 655     
Met Asp Glu Asn Leu Pro Glu Gln Asp Thr Thr Tyr Val Leu Asn Gly 
            660                 665                 670         
Glu Tyr Thr Glu Phe Cys Lys Glu Gln Ser His Val Tyr Tyr Gly Glu 
        675                 680                 685             
Ala Val Val Ser Phe Arg Ser Leu Leu Lys Arg Tyr Asn Tyr Phe Arg 
    690                 695                 700                 
Thr Leu Glu Pro Leu Glu Ser Leu Asn Val Asn Asn Trp Tyr Thr Val 
705                 710                 715                 720 
Met Tyr Arg Thr Phe Ile Tyr Pro Gln Gly Pro Gly Pro Ser Tyr Gly 
                725                 730                 735     
Ser Ser Ile Ala Ser Ala Met Thr Pro Ile Ala Gly Pro Ile Asn Tyr 
            740                 745                 750         
Asn Leu Leu Pro Met Thr Met Met Arg Tyr Cys Met Gln Ala Tyr Val 
        755                 760                 765             
Gly Phe Arg Gly Gly Cys Arg Trp Lys Val Leu Tyr Cys Gly Thr Asn 
    770                 775                 780                 
Asn Gln Leu Gln Pro Leu Ser Val Gly Arg Asn Pro Glu Tyr Leu Ser 
785                 790                 795                 800 
Asn Glu Ser Gln Val Asn Ile Ile Asn Asp Asn Gly Ser Tyr Ser Leu 
                805                 810                 815     
Ser Arg Val Val Arg Glu Lys Trp Arg Ala Asp Asn Cys Gln Asp Ser 
            820                 825                 830         
Ala Ala Gly Thr Ala Leu Thr Gly Val Ala Ser Gln Pro Gly Leu Glu 
        835                 840                 845             
Tyr Glu Leu Pro Phe Gln Thr Ala Leu Arg Tyr Ala Glu Cys Asn Glu 
    850                 855                 860                 
Pro Gly Gln Thr Glu Val Pro Pro Tyr Thr Gly Leu Asn Gln Gly Ser 
865                 870                 875                 880 
Gln Thr Val Ser Phe Asp Tyr Arg Glu Thr Ala Asn Gln Tyr His Trp 
                885                 890                 895     
Leu Gln Phe Tyr Thr Ala Ala Ala Glu Asp Phe Ser Leu Phe Phe Phe 
            900                 905                 910         
Ile Gly Ala Pro Ala Arg Ile Pro Ser Asp Val Asn Ile Val Thr Leu 
        915                 920                 925             



<210> 6
<211> 316
<212> PRT
<213> Unknown


<220> 
<223> Amino acid sequence of capsid-like polypeptide CP1

<400> 6
Met Ile Glu Tyr Glu Arg Ala Leu Gly Tyr Ser Leu Pro Thr Leu His 
1               5                   10                  15      
Glu Glu Thr Lys Gln Pro Glu Ser Thr Thr Asn Asp Ser Asp Ile Thr 
            20                  25                  30          
Val Gln Ser Ala Val Glu Ala Lys Val Glu His Glu Asp Ser Gly Leu 
        35                  40                  45              
Gln Val Glu Thr Asn Val Asn Phe Ser Asp Gln Val Glu Gln Tyr Gln 
    50                  55                  60                  
Leu Leu Val Gly Lys Pro Leu Ser Asp Pro Thr Tyr Glu Lys Ala Asn 
65                  70                  75                  80  
Ser Gln Ala Val Asp Leu Thr Thr Phe Met Ser Arg Pro Ile Arg Ile 
                85                  90                  95      
Phe Ser Lys Ile Trp Glu Val Asn Glu Ser Pro Gln Tyr Val Asn Ala 
            100                 105                 110         
Ile Asn Pro Trp Asp Leu Phe Leu Ser Asp Ser Lys Val Ala Asn Lys 
        115                 120                 125             
Ile Glu Thr Phe Lys Leu Leu His Gly Thr Leu Lys Leu Lys Ile Leu 
    130                 135                 140                 
Val Asn Gly Ser Pro Phe His Tyr Gly Arg Met Phe Val Gly Leu Arg 
145                 150                 155                 160 
Pro Ser Lys Phe Asp Asn Asn Thr Leu Thr Asp Gly Pro Val Thr Pro 
                165                 170                 175     
Gly Val Ser Leu Ser Tyr Thr Asp Glu Asn Ala Ala Gly Ser Asn Lys 
            180                 185                 190         
Thr Met Asn Asn Met Ala Cys Leu Tyr Ser Gln Arg Pro His Val Phe 
        195                 200                 205             
Ile Asp Pro Ser Thr Asn Gln Pro Gln Gln Ile Ser Trp Pro Phe Phe 
    210                 215                 220                 
Ser Ala Thr Asn Trp Ile Asp Leu Thr Asp Gln Glu Thr Ile Asp Arg 
225                 230                 235                 240 
Met Gly Val Leu Glu Ile Trp Glu Leu Thr Gln Leu Gln His Ser Asn 
                245                 250                 255     
Gly Ala Thr Asp Glu Val Glu Ile Ser Ile Phe Ala Trp Met Glu Asp 
            260                 265                 270         
Val Glu Phe Ala Gly Leu Thr Ala Ala Ala Pro Ala Thr Ala Gln Met 
        275                 280                 285             
Ala Val Asp Lys Ile Gln Asn Ser Lys Lys Thr Lys Lys Arg Lys Pro 
    290                 295                 300                 
Lys Lys Lys Ala Lys Pro Thr Phe Thr Asn Thr Ser 
305                 310                 315     

<210> 7
<211> 283
<212> PRT
<213> Unknown


<220> 
<223> Amino acid sequence of capsid-like polypeptide CP2

<400> 7
Ser Arg Pro Ala Val Leu Thr Asp Thr Ala Phe Tyr Lys Ser Gln Pro 
1               5                   10                  15      
Ile Gly Asn Leu Ala Asn Thr Ser Gly Ala Asp Pro Ile Phe Lys Leu 
            20                  25                  30          
Thr Leu Asp Pro Lys Gln Glu Leu Thr Ile Asp Pro Thr Thr Val Gly 
        35                  40                  45              
Leu Gly Glu Glu Asp Gln Met Ser Phe Gly Tyr Leu Val Lys Arg Glu 
    50                  55                  60                  
Ala Phe Ile Asp Tyr Phe Asn Trp Ser Thr Ile Val Ala Gln Asn Thr 
65                  70                  75                  80  
Gly Leu Leu Tyr Ser Ile Gln Val His Pro Met Ile Ala Pro Ile Tyr 
                85                  90                  95      
Gln Thr Gly Ser Asn Asp Thr Val Val Arg Cys Gln Thr Pro Leu Ser 
            100                 105                 110         
Phe Val Ser Tyr Pro Phe Asn Asn Trp Ser Gly Ser Leu Arg Tyr Arg 
        115                 120                 125             
Phe Gln Ile Val Ala Ser Gln Tyr His Arg Gly Arg Leu Leu Phe Val 
    130                 135                 140                 
Tyr Glu Pro Thr Leu Ser Thr Ala Gly Thr Val Thr Asp Thr Asn Asp 
145                 150                 155                 160 
Arg Tyr Ser His Ile Val Asp Ile Ser Glu Glu Arg Asp Val Thr Phe 
                165                 170                 175     
Glu Ile Asn Trp Thr Gln Lys Glu Ala Tyr Arg Lys Ile Asp Ile Phe 
            180                 185                 190         
Arg Ala Gln Lys Leu Ser Ala Trp Glu Gly Ala Thr Gly Ala Ile Gly 
        195                 200                 205             
Thr Asp Ala Asp Asp Val Ala Asn Cys Asn Gly Arg Leu Asn Val Tyr 
    210                 215                 220                 
Val Leu Asn Ala Leu Ala Ala Pro Ile Thr Asp Ser Ser Val Ser Val 
225                 230                 235                 240 
Asn Val Phe Ile Ser Gly Gly Asp Ser Tyr Glu Val Arg Asn Pro Arg 
                245                 250                 255     
Gly Thr Leu Gly Gln Asp Val Ala Tyr Ala Asn Ser Thr Thr Asp Val 
            260                 265                 270         
Gly Pro Pro Thr Leu Ala Gln Met Ala Thr Glu 
        275                 280             

<210> 8
<211> 275
<212> PRT
<213> Unknown


<220> 
<223> Amino acid sequence of capsid-like polypeptide CP3

<400> 8
Gly Met Thr Met Asp Glu Asn Leu Pro Glu Gln Asp Thr Thr Tyr Val 
1               5                   10                  15      
Leu Asn Gly Glu Tyr Thr Glu Phe Cys Lys Glu Gln Ser His Val Tyr 
            20                  25                  30          
Tyr Gly Glu Ala Val Val Ser Phe Arg Ser Leu Leu Lys Arg Tyr Asn 
        35                  40                  45              
Tyr Phe Arg Thr Leu Glu Pro Leu Glu Ser Leu Asn Val Asn Asn Trp 
    50                  55                  60                  
Tyr Thr Val Met Tyr Arg Thr Phe Ile Tyr Pro Gln Gly Pro Gly Pro 
65                  70                  75                  80  
Ser Tyr Gly Ser Ser Ile Ala Ser Ala Met Thr Pro Ile Ala Gly Pro 
                85                  90                  95      
Ile Asn Tyr Asn Leu Leu Pro Met Thr Met Met Arg Tyr Cys Met Gln 
            100                 105                 110         
Ala Tyr Val Gly Phe Arg Gly Gly Cys Arg Trp Lys Val Leu Tyr Cys 
        115                 120                 125             
Gly Thr Asn Asn Gln Leu Gln Pro Leu Ser Val Gly Arg Asn Pro Glu 
    130                 135                 140                 
Tyr Leu Ser Asn Glu Ser Gln Val Asn Ile Ile Asn Asp Asn Gly Ser 
145                 150                 155                 160 
Tyr Ser Leu Ser Arg Val Val Arg Glu Lys Trp Arg Ala Asp Asn Cys 
                165                 170                 175     
Gln Asp Ser Ala Ala Gly Thr Ala Leu Thr Gly Val Ala Ser Gln Pro 
            180                 185                 190         
Gly Leu Glu Tyr Glu Leu Pro Phe Gln Thr Ala Leu Arg Tyr Ala Glu 
        195                 200                 205             
Cys Asn Glu Pro Gly Gln Thr Glu Val Pro Pro Tyr Thr Gly Leu Asn 
    210                 215                 220                 
Gln Gly Ser Gln Thr Val Ser Phe Asp Tyr Arg Glu Thr Ala Asn Gln 
225                 230                 235                 240 
Tyr His Trp Leu Gln Phe Tyr Thr Ala Ala Ala Glu Asp Phe Ser Leu 
                245                 250                 255     
Phe Phe Phe Ile Gly Ala Pro Ala Arg Ile Pro Ser Asp Val Asn Ile 
            260                 265                 270         
Val Thr Leu 
        275 

<210> 9
<211> 54
<212> PRT
<213> Unknown


<220> 
<223> Amino acid sequence of capsid-like polypeptide CP4

<400> 9
Gly Glu Asp Glu His Lys Pro Asp Gly Val Val Ser Ala Pro Ala Ala 
1               5                   10                  15      
Leu Leu Ala Asp Phe Ala Gly Tyr Phe Thr Glu Ile Pro Tyr Ile Gly 
            20                  25                  30          
Lys Phe Ala Lys Ser Thr Gln Ile Ala Ser Gly Ala Val Ser Ser Ile 
        35                  40                  45              
Ala Arg Leu Phe Gly Phe 
    50                  

<210> 10
<211> 1764
<212> DNA
<213> Ulva lactuca


<220> 
<223> DNA nucleic acid encoding 18sRNA of Ulva lactuca

<400> 10
atcctgccag tagtcatatg cttgtctcaa agattaagcc atgcatgtct aagtataaac      60

agtttatact gtgaaactgc gaatggctca ttaaatcagt tagagtttat ttgatggtac     120

cacactactc ggataaccgt agtaaagcta cagctaatac gtgcgtaact cccgacttac     180

gaagggacgt atttattaga ttcaaggccg accgtgcttg cacgtctttg gtgaatcatg     240

gtaacttcac gaatcgcagg gtctatcccg gcgatgtttc attcaacttt ctgccctatc     300

aactttcgac ggtagtatag aggactaccg tggtggtaac gggtgacgga ggattagggt     360

tcgattccgg agagggagcc tgagaaacgg ctaccacatc caaggaaggc agcaggcgcg     420

caaattaccc aatcctgaca cagggaggta gtgacaataa atatcaattc tgggccacat     480

ggtccggtaa ttggaatgag tacaatgtaa acgccttaac gaggatccat tggagggcaa     540

gtctggtgcc agcagccgcg gtaattccag ctccaatagc gtatatttaa gttgttgcag     600

ttaaaaagct cgtagttgga tttcgggtgg gacgcagcgg tctcgcattg cgtttgtact     660

gctgcagccc tccttcttgc cgggggcggc ccttcggact tcactgtctg ggggtcagaa     720

tcggcgatgt tactttgagt aaattagagt gttcaaagca agcttacgct ctgaatataa     780

tagcatggga taacacgaca ggactctggc ctatcgtgtt ggtctatagg accagagtaa     840

tgattaagag ggacagtcgg gggcattcgt attccgttgt cagaggtgaa attcttggat     900

ttacggaaga cgaacatctg cgaaagcatt tgccaaggat gttttcattg atcaagaacg     960

aaagttgggg gctcgaagac gattagatac cgtcgtagtc tcaaccataa acgatgccga    1020

ctagggattg gcgggtgttt ttttgatgac cccgccagca cctcatgaga aatcaaagtt    1080

tttgggttcc ggggggagta tggtcgcaag gctgaaactt aaaggaattg acggaagggc    1140

accaccaggc gtggagcctg cggcttaatt tgactcaaca cgggaaaact taccaggtcc    1200

agacatagga aggattgaca gattgagagc tctttcttga ttctatgggt ggtggtgcat    1260

ggccgttctt agttggtggg ttgccttgtc aggttgattc cggtaacgaa cgagacctca    1320

gcctgctaaa tagtgacgtc tgctgcggca gtcgcgcgct tcttagaggg actgttggcg    1380

tctagccaat ggaagtatga ggcaataaca ggtctgtgat gcccttagat gttctgggcc    1440

gcacgcgcgc tacactgata cgttcaacga gctccttgac cgagaggccc gggtaatctt    1500

tgaaaccgta tcgtgatggg gatagaacat tgcaattatt gttcttcaac gaggaatgcc    1560

tagtaagcgc gagtcatcat ctcgcgttga ttacgtccct gccctttgta cacaccgccc    1620

gtcgctccta ccgattgaac gtgctggtga agcgttagga ctggaacttt gggccggtct    1680

cctgctcatt gtttcgggaa tttcgttgaa ccctcccgtt tagaggaagg agaagtcgta    1740

acaaggtctc cgtaggtgaa cctg                                           1764


