               SEQUENCE LISTING

<110> Sanofi Pasteur SA

<120> Multimerization of recombinant protein by fusion to a sequence from lamprey

<160> 29

<170> BiSSAP 1.2

<210> 1
<211> 30
<212> PRT
<213> Petromyzontidae


<400> 1
Asp Cys Gly Lys Pro Ala Cys Thr Thr Leu Leu Asn Cys Ala Asn Phe 
1               5                   10                  15      
Leu Ser Cys Leu Cys Ser Thr Cys Ala Leu Cys Arg Lys Arg 
            20                  25                  30  

<210> 2
<211> 43
<212> PRT
<213> Petromyzontidae


<400> 2
Asn Cys Thr Ser Ile Gln Glu Arg Lys Asn Asp Gly Gly Asp Cys Gly 
1               5                   10                  15      
Lys Pro Ala Cys Thr Thr Leu Leu Asn Cys Ala Asn Phe Leu Ser Cys 
            20                  25                  30          
Leu Cys Ser Thr Cys Ala Leu Cys Arg Lys Arg 
        35                  40              

<210> 3
<211> 90
<212> DNA
<213> Petromyzontidae

<220> 
<221> source
<222> 1..90
<223> /mol_type="unassigned DNA"
      /organism="Petromyzontidae"

<400> 3
gattgcggca aaccggcgtg caccaccctg ctgaactgcg cgaactttct gagctgcctg     60

tgcagcacct gcgcgctgtg ccgcaaacgc                                      90


<210> 4
<211> 129
<212> DNA
<213> Petromyzontidae

<220> 
<221> source
<222> 1..129
<223> /mol_type="unassigned DNA"
      /organism="Petromyzontidae"

<400> 4
aactgcacca gcattcagga acgcaaaaac gatggcggcg attgcggcaa accggcgtgc     60

accaccctgc tgaactgcgc gaactttctg agctgcctgt gcagcacctg cgcgctgtgc    120

cgcaaacgc                                                            129


<210> 5
<211> 29
<212> PRT
<213> T4-like viruses


<400> 5
Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val 
1               5                   10                  15      
Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 
            20                  25                  

<210> 6
<211> 87
<212> DNA
<213> T4-like viruses

<220> 
<221> source
<222> 1..87
<223> /mol_type="unassigned DNA"
      /organism="T4-like viruses"

<400> 6
ggcagcggct atattccgga agcgccgcgc gatggccagg cgtatgtgcg caaagatggc     60

gaatgggtgc tgctgagcac ctttctg                                         87


<210> 7
<211> 1680
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1680
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA fused to lamprey multimerizing
      shortened sequence"
      /organism="Artificial Sequence"

<400> 7
atgaaggcga tcctggtggt gctgctgtac acgttcgcga cggccaacgc ggatacgctg      60

tgcatcggct accacgcgaa caacagcacg gacaccgtgg acacggtgct cgagaagaac     120

gtgacggtga cgcacagcgt gaacctgctt gaggacaagc acaacggcaa gctgtgcaag     180

ctgcgtggcg tggctccgct gcacctgggc aagtgcaaca ttgctggctg gatcctgggc     240

aacccagagt gcgagagcct gagcacggcg tcgtcttgga gctacatcgt ggagacgccg     300

agcagcgaca acggcacgtg ctatccgggt gacttcatcg actacgaaga gctgcgcgag     360

cagctgtcgt cggtgagcag ctttgaacgc ttcgagattt tccccaagac gagcagctgg     420

ccgaaccacg actcgaacaa gggcgtgacg gctgcgtgtc cacacgctgg tgccaagagc     480

ttctacaaga acctgatctg gctggtgaag aagggcaaca gctacccgaa gctgagcaag     540

agctacatca acgacaaggg caaagaagtg ctcgtcctgt ggggcatcca ccacccgagc     600

acgagcgctg accagcagag cctgtaccag aacgccgaca cctacgtgtt cgtgggcagc     660

agccgctaca gcaagaagtt caagcccgag atcgcgattc gtccaaaggt gcgcgaccaa     720

gagggtcgca tgaactacta ctggacgctc gtggagccag gcgacaagat cacgttcgag     780

gcgacgggca acctggtcgt gccacgctac gccttcgcca tggaacgcaa cgctggcagc     840

ggcatcatca tcagcgacac gccagtgcac gactgcaaca cgacgtgcca gacgccgaag     900

ggtgcgatca acacgagcct gccgttccag aacatccacc cgatcacgat cggcaagtgc     960

ccgaagtacg tgaagagcac gaagctgcgc ctggcgacgg gtctgcgcaa catcccgagc    1020

atccagtctc gtggtctgtt tggcgctatc gctggcttca tcgagggtgg ctggacgggc    1080

atggtggacg gctggtacgg ctaccaccac cagaacgagc agggcagcgg ctacgctgcg    1140

gacctgaagt cgacgcagaa cgcgatcgac gagatcacga acaaggtgaa cagcgtgatc    1200

gagaagatga acacgcagtt cacggctgtg ggcaaagagt tcaaccacct tgagaagcgc    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac gtacaacgcg    1320

gagctgctgg tgctgcttga gaacgagcgc acgctggact accacgattc gaacgtgaag    1380

aacctctacg agaaggtgcg cagccagctg aagaacaacg cgaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acgtgcatgg aaagcgtgaa gaacggcacg    1500

tacgactacc cgaagtactc ggaagaggcc aagctgaacc gcgaagagat cgacggcgtg    1560

aagcttgaga gcacgcgcat ctaccaggat tgcggcaaac cggcgtgcac caccctgctg    1620

aactgcgcga actttctgag ctgcctgtgc agcacctgcg cgctgtgccg caaacgctag    1680


<210> 8
<211> 1719
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1719
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA ectodomain fused to lamprey multimering
      sequence"
      /organism="Artificial Sequence"

<400> 8
atgaaggcga tcctggtggt gctgctgtac acgttcgcga cggccaacgc ggatacgctg      60

tgcatcggct accacgcgaa caacagcacg gacaccgtgg acacggtgct cgagaagaac     120

gtgacggtga cgcacagcgt gaacctgctt gaggacaagc acaacggcaa gctgtgcaag     180

ctgcgtggcg tggctccgct gcacctgggc aagtgcaaca ttgctggctg gatcctgggc     240

aacccagagt gcgagagcct gagcacggcg tcgtcttgga gctacatcgt ggagacgccg     300

agcagcgaca acggcacgtg ctatccgggt gacttcatcg actacgaaga gctgcgcgag     360

cagctgtcgt cggtgagcag ctttgaacgc ttcgagattt tccccaagac gagcagctgg     420

ccgaaccacg actcgaacaa gggcgtgacg gctgcgtgtc cacacgctgg tgccaagagc     480

ttctacaaga acctgatctg gctggtgaag aagggcaaca gctacccgaa gctgagcaag     540

agctacatca acgacaaggg caaagaagtg ctcgtcctgt ggggcatcca ccacccgagc     600

acgagcgctg accagcagag cctgtaccag aacgccgaca cctacgtgtt cgtgggcagc     660

agccgctaca gcaagaagtt caagcccgag atcgcgattc gtccaaaggt gcgcgaccaa     720

gagggtcgca tgaactacta ctggacgctc gtggagccag gcgacaagat cacgttcgag     780

gcgacgggca acctggtcgt gccacgctac gccttcgcca tggaacgcaa cgctggcagc     840

ggcatcatca tcagcgacac gccagtgcac gactgcaaca cgacgtgcca gacgccgaag     900

ggtgcgatca acacgagcct gccgttccag aacatccacc cgatcacgat cggcaagtgc     960

ccgaagtacg tgaagagcac gaagctgcgc ctggcgacgg gtctgcgcaa catcccgagc    1020

atccagtctc gtggtctgtt tggcgctatc gctggcttca tcgagggtgg ctggacgggc    1080

atggtggacg gctggtacgg ctaccaccac cagaacgagc agggcagcgg ctacgctgcg    1140

gacctgaagt cgacgcagaa cgcgatcgac gagatcacga acaaggtgaa cagcgtgatc    1200

gagaagatga acacgcagtt cacggctgtg ggcaaagagt tcaaccacct tgagaagcgc    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac gtacaacgcg    1320

gagctgctgg tgctgcttga gaacgagcgc acgctggact accacgattc gaacgtgaag    1380

aacctctacg agaaggtgcg cagccagctg aagaacaacg cgaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acgtgcatgg aaagcgtgaa gaacggcacg    1500

tacgactacc cgaagtactc ggaagaggcc aagctgaacc gcgaagagat cgacggcgtg    1560

aagcttgaga gcacgcgcat ctaccagaac tgcaccagca ttcaggaacg caaaaacgat    1620

ggcggcgatt gcggcaaacc ggcgtgcacc accctgctga actgcgcgaa ctttctgagc    1680

tgcctgtgca gcacctgcgc gctgtgccgc aaacgctag                           1719


<210> 9
<211> 1677
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1677
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA ectodomain fused T4 foldon sequence"
      /organism="Artificial Sequence"

<400> 9
atgaaggcga tcctggtggt gctgctgtac acgttcgcga cggccaacgc ggatacgctg      60

tgcatcggct accacgcgaa caacagcacg gacaccgtgg acacggtgct cgagaagaac     120

gtgacggtga cgcacagcgt gaacctgctt gaggacaagc acaacggcaa gctgtgcaag     180

ctgcgtggcg tggctccgct gcacctgggc aagtgcaaca ttgctggctg gatcctgggc     240

aacccagagt gcgagagcct gagcacggcg tcgtcttgga gctacatcgt ggagacgccg     300

agcagcgaca acggcacgtg ctatccgggt gacttcatcg actacgaaga gctgcgcgag     360

cagctgtcgt cggtgagcag ctttgaacgc ttcgagattt tccccaagac gagcagctgg     420

ccgaaccacg actcgaacaa gggcgtgacg gctgcgtgtc cacacgctgg tgccaagagc     480

ttctacaaga acctgatctg gctggtgaag aagggcaaca gctacccgaa gctgagcaag     540

agctacatca acgacaaggg caaagaagtg ctcgtcctgt ggggcatcca ccacccgagc     600

acgagcgctg accagcagag cctgtaccag aacgccgaca cctacgtgtt cgtgggcagc     660

agccgctaca gcaagaagtt caagcccgag atcgcgattc gtccaaaggt gcgcgaccaa     720

gagggtcgca tgaactacta ctggacgctc gtggagccag gcgacaagat cacgttcgag     780

gcgacgggca acctggtcgt gccacgctac gccttcgcca tggaacgcaa cgctggcagc     840

ggcatcatca tcagcgacac gccagtgcac gactgcaaca cgacgtgcca gacgccgaag     900

ggtgcgatca acacgagcct gccgttccag aacatccacc cgatcacgat cggcaagtgc     960

ccgaagtacg tgaagagcac gaagctgcgc ctggcgacgg gtctgcgcaa catcccgagc    1020

atccagtctc gtggtctgtt tggcgctatc gctggcttca tcgagggtgg ctggacgggc    1080

atggtggacg gctggtacgg ctaccaccac cagaacgagc agggcagcgg ctacgctgcg    1140

gacctgaagt cgacgcagaa cgcgatcgac gagatcacga acaaggtgaa cagcgtgatc    1200

gagaagatga acacgcagtt cacggctgtg ggcaaagagt tcaaccacct tgagaagcgc    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac gtacaacgcg    1320

gagctgctgg tgctgcttga gaacgagcgc acgctggact accacgattc gaacgtgaag    1380

aacctctacg agaaggtgcg cagccagctg aagaacaacg cgaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acgtgcatgg aaagcgtgaa gaacggcacg    1500

tacgactacc cgaagtactc ggaagaggcc aagctgaacc gcgaagagat cgacggcgtg    1560

aagcttgaga gcacgcgcat ctaccagggc agcggctata ttccggaagc gccgcgcgat    1620

ggccaggcgt atgtgcgcaa agatggcgaa tgggtgctgc tgagcacctt tctgtag       1677


<210> 10
<211> 1587
<212> DNA
<213> Influenza A virus

<220> 
<221> source
<222> 1..1587
<223> /mol_type="unassigned DNA"
      /organism="Influenza A virus"

<400> 10
atgaaggcga tcctggtggt gctgctgtac acgttcgcga cggccaacgc ggatacgctg      60

tgcatcggct accacgcgaa caacagcacg gacaccgtgg acacggtgct cgagaagaac     120

gtgacggtga cgcacagcgt gaacctgctt gaggacaagc acaacggcaa gctgtgcaag     180

ctgcgtggcg tggctccgct gcacctgggc aagtgcaaca ttgctggctg gatcctgggc     240

aacccagagt gcgagagcct gagcacggcg tcgtcttgga gctacatcgt ggagacgccg     300

agcagcgaca acggcacgtg ctatccgggt gacttcatcg actacgaaga gctgcgcgag     360

cagctgtcgt cggtgagcag ctttgaacgc ttcgagattt tccccaagac gagcagctgg     420

ccgaaccacg actcgaacaa gggcgtgacg gctgcgtgtc cacacgctgg tgccaagagc     480

ttctacaaga acctgatctg gctggtgaag aagggcaaca gctacccgaa gctgagcaag     540

agctacatca acgacaaggg caaagaagtg ctcgtcctgt ggggcatcca ccacccgagc     600

acgagcgctg accagcagag cctgtaccag aacgccgaca cctacgtgtt cgtgggcagc     660

agccgctaca gcaagaagtt caagcccgag atcgcgattc gtccaaaggt gcgcgaccaa     720

gagggtcgca tgaactacta ctggacgctc gtggagccag gcgacaagat cacgttcgag     780

gcgacgggca acctggtcgt gccacgctac gccttcgcca tggaacgcaa cgctggcagc     840

ggcatcatca tcagcgacac gccagtgcac gactgcaaca cgacgtgcca gacgccgaag     900

ggtgcgatca acacgagcct gccgttccag aacatccacc cgatcacgat cggcaagtgc     960

ccgaagtacg tgaagagcac gaagctgcgc ctggcgacgg gtctgcgcaa catcccgagc    1020

atccagtctc gtggtctgtt tggcgctatc gctggcttca tcgagggtgg ctggacgggc    1080

atggtggacg gctggtacgg ctaccaccac cagaacgagc agggcagcgg ctacgctgcg    1140

gacctgaagt cgacgcagaa cgcgatcgac gagatcacga acaaggtgaa cagcgtgatc    1200

gagaagatga acacgcagtt cacggctgtg ggcaaagagt tcaaccacct tgagaagcgc    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac gtacaacgcg    1320

gagctgctgg tgctgcttga gaacgagcgc acgctggact accacgattc gaacgtgaag    1380

aacctctacg agaaggtgcg cagccagctg aagaacaacg cgaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acgtgcatgg aaagcgtgaa gaacggcacg    1500

tacgactacc cgaagtactc ggaagaggcc aagctgaacc gcgaagagat cgacggcgtg    1560

aagcttgaga gcacgcgcat ctaccag                                        1587


<210> 11
<211> 529
<212> PRT
<213> Influenza A virus


<400> 11
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn 
1               5                   10                  15      
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr 
            20                  25                  30          
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn 
        35                  40                  45              
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val 
    50                  55                  60                  
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly 
65                  70                  75                  80  
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile 
                85                  90                  95      
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe 
            100                 105                 110         
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe 
        115                 120                 125             
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp 
    130                 135                 140                 
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser 
145                 150                 155                 160 
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro 
                165                 170                 175     
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val 
            180                 185                 190         
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu 
        195                 200                 205             
Tyr Gln Asn Ala Asp Thr Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser 
    210                 215                 220                 
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln 
225                 230                 235                 240 
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys 
                245                 250                 255     
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe 
            260                 265                 270         
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro 
        275                 280                 285             
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn 
    290                 295                 300                 
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys 
305                 310                 315                 320 
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg 
                325                 330                 335     
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly 
            340                 345                 350         
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr 
        355                 360                 365             
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser 
    370                 375                 380                 
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile 
385                 390                 395                 400 
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His 
                405                 410                 415     
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe 
            420                 425                 430         
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn 
        435                 440                 445             
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu 
    450                 455                 460                 
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly 
465                 470                 475                 480 
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val 
                485                 490                 495     
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu 
            500                 505                 510         
Asn Arg Glu Glu Ile Asp Gly Val Lys Leu Glu Ser Thr Arg Ile Tyr 
        515                 520                 525             
Gln 
    

<210> 12
<211> 1590
<212> DNA
<213> Influenza A virus

<220> 
<221> source
<222> 1..1590
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA ectodomain optimized for codon usage in
      CHO"
      /organism="Influenza A virus"

<400> 12
atgaaggcca tcctggtggt gctgctgtac accttcgcca ccgccaacgc cgacaccctg      60

tgcatcggct accacgccaa caactccacc gacaccgtgg ataccgtgct ggaaaagaac     120

gtgaccgtga cccactccgt gaacctgctg gaagataagc acaacggcaa gctgtgcaag     180

ctgcggggcg tggcccctct gcacctgggc aagtgtaata tcgccggctg gatcctgggc     240

aaccccgagt gcgagtccct gtccaccgcc tccagctggt cctacatcgt ggaaaccccc     300

tccagcgaca acggcacctg ttaccccggc gacttcatcg actacgagga actgcgcgag     360

cagctgtcct ccgtgtccag cttcgagaga ttcgagatct tccccaagac ctcctcctgg     420

cccaaccacg actccaacaa gggcgtgacc gccgcctgtc ctcacgctgg cgccaagtcc     480

ttctacaaga acctgatctg gctggtgaaa aagggcaact cctaccccaa gctgtccaag     540

tcctacatca acgacaaggg caaagaggtg ctggtgctgt ggggcatcca ccacccttcc     600

acctccgccg accagcagtc cctgtaccag aacgccgata cctacgtgtt cgtgggctcc     660

tcccggtact ccaagaagtt caagcccgag atcgccatcc ggcccaaagt gcgggaccag     720

gaaggccgga tgaactacta ctggaccctg gtggaacccg gcgacaagat caccttcgag     780

gccaccggca atctggtggt gcccagatac gccttcgcca tggaacggaa cgccggctcc     840

ggcatcatca tctccgacac ccccgtgcac gactgcaaca ccacctgtca gacccccaag     900

ggcgccatca acacctccct gcccttccag aacatccacc ccatcaccat cggcaagtgc     960

cccaaatacg tgaagtccac caagctgcgg ctggctaccg gcctgcggaa catcccctcc    1020

atccagtctc ggggcctgtt cggcgctatc gctggcttca tcgagggcgg ctggaccggc    1080

atggtggacg gttggtacgg ctaccaccac cagaacgagc agggctccgg ctacgccgcc    1140

gacctgaagt ctacccagaa cgccatcgac gagatcacca acaaagtgaa ctccgtgatc    1200

gagaagatga acacccagtt caccgccgtg ggcaaagagt tcaaccacct ggaaaagcgg    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac ctacaacgcc    1320

gagctgctgg tgctgctgga aaacgagcgg accctggact accacgacag caacgtgaag    1380

aacctgtacg agaaagtgcg gtcccagctg aagaacaacg ccaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acctgtatgg aatccgtgaa gaacggcacc    1500

tacgactacc ccaagtactc cgaggaagcc aagctgaacc gggaagagat cgacggcgtg    1560

aagctggaat ccacccggat ctatcagtga                                     1590


<210> 13
<211> 1680
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1680
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA ectodomain fused to lamprey
      multimerizing shortened sequence, optimized for codon usage in
      CHO"
      /organism="Artificial Sequence"

<400> 13
atgaaggcca tcctggtggt gctgctgtac accttcgcca ccgccaacgc cgacaccctg      60

tgcatcggct accacgccaa caactccacc gacaccgtgg ataccgtgct ggaaaagaac     120

gtgaccgtga cccactccgt gaacctgctg gaagataagc acaacggcaa gctgtgcaag     180

ctgcggggcg tggcccctct gcacctgggc aagtgtaata tcgccggctg gatcctgggc     240

aaccccgagt gcgagtccct gtccaccgcc tccagctggt cctacatcgt ggaaaccccc     300

tccagcgaca acggcacctg ttaccccggc gacttcatcg actacgagga actgcgcgag     360

cagctgtcct ccgtgtccag cttcgagaga ttcgagatct tccccaagac ctcctcctgg     420

cccaaccacg actccaacaa gggcgtgacc gccgcctgtc ctcacgctgg cgccaagtcc     480

ttctacaaga acctgatctg gctggtgaaa aagggcaact cctaccccaa gctgtccaag     540

tcctacatca acgacaaggg caaagaggtg ctggtgctgt ggggcatcca ccacccttcc     600

acctccgccg accagcagtc cctgtaccag aacgccgata cctacgtgtt cgtgggctcc     660

tcccggtact ccaagaagtt caagcccgag atcgccatcc ggcccaaagt gcgggaccag     720

gaaggccgga tgaactacta ctggaccctg gtggaacccg gcgacaagat caccttcgag     780

gccaccggca atctggtggt gcccagatac gccttcgcca tggaacggaa cgccggctcc     840

ggcatcatca tctccgacac ccccgtgcac gactgcaaca ccacctgtca gacccccaag     900

ggcgccatca acacctccct gcccttccag aacatccacc ccatcaccat cggcaagtgc     960

cccaaatacg tgaagtccac caagctgcgg ctggctaccg gcctgcggaa catcccctcc    1020

atccagtctc ggggcctgtt cggcgctatc gctggcttca tcgagggcgg ctggaccggc    1080

atggtggacg gttggtacgg ctaccaccac cagaacgagc agggctccgg ctacgccgcc    1140

gacctgaagt ctacccagaa cgccatcgac gagatcacca acaaagtgaa ctccgtgatc    1200

gagaagatga acacccagtt caccgccgtg ggcaaagagt tcaaccacct ggaaaagcgg    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac ctacaacgcc    1320

gagctgctgg tgctgctgga aaacgagcgg accctggact accacgacag caacgtgaag    1380

aacctgtacg agaaagtgcg gtcccagctg aagaacaacg ccaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acctgtatgg aatccgtgaa gaacggcacc    1500

tacgactacc ccaagtactc cgaggaagcc aagctgaacc gggaagagat cgacggcgtg    1560

aagctggaat ccacccggat ctaccaggac tgcggcaagc ccgcctgcac caccctgctg    1620

aactgcgcca acttcctgtc ctgcctgtgc tctacctgcg ccctgtgccg gaagagatga    1680


<210> 14
<211> 1719
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1719
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA ectodomain fused to lamprey
      multimerizing long sequence, optimized for codon usage in CHO"
      /organism="Artificial Sequence"

<400> 14
atgaaggcca tcctggtggt gctgctgtac accttcgcca ccgccaacgc cgacaccctg      60

tgcatcggct accacgccaa caactccacc gacaccgtgg ataccgtgct ggaaaagaac     120

gtgaccgtga cccactccgt gaacctgctg gaagataagc acaacggcaa gctgtgcaag     180

ctgcggggcg tggcccctct gcacctgggc aagtgtaata tcgccggctg gatcctgggc     240

aaccccgagt gcgagtccct gtccaccgcc tccagctggt cctacatcgt ggaaaccccc     300

tccagcgaca acggcacctg ttaccccggc gacttcatcg actacgagga actgcgcgag     360

cagctgtcct ccgtgtccag cttcgagaga ttcgagatct tccccaagac ctcctcctgg     420

cccaaccacg actccaacaa gggcgtgacc gccgcctgtc ctcacgctgg cgccaagtcc     480

ttctacaaga acctgatctg gctggtgaaa aagggcaact cctaccccaa gctgtccaag     540

tcctacatca acgacaaggg caaagaggtg ctggtgctgt ggggcatcca ccacccttcc     600

acctccgccg accagcagtc cctgtaccag aacgccgata cctacgtgtt cgtgggctcc     660

tcccggtact ccaagaagtt caagcccgag atcgccatcc ggcccaaagt gcgggaccag     720

gaaggccgga tgaactacta ctggaccctg gtggaacccg gcgacaagat caccttcgag     780

gccaccggca atctggtggt gcccagatac gccttcgcca tggaacggaa cgccggctcc     840

ggcatcatca tctccgacac ccccgtgcac gactgcaaca ccacctgtca gacccccaag     900

ggcgccatca acacctccct gcccttccag aacatccacc ccatcaccat cggcaagtgc     960

cccaaatacg tgaagtccac caagctgcgg ctggctaccg gcctgcggaa catcccctcc    1020

atccagtctc ggggcctgtt cggcgctatc gctggcttca tcgagggcgg ctggaccggc    1080

atggtggacg gttggtacgg ctaccaccac cagaacgagc agggctccgg ctacgccgcc    1140

gacctgaagt ctacccagaa cgccatcgac gagatcacca acaaagtgaa ctccgtgatc    1200

gagaagatga acacccagtt caccgccgtg ggcaaagagt tcaaccacct ggaaaagcgg    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac ctacaacgcc    1320

gagctgctgg tgctgctgga aaacgagcgg accctggact accacgacag caacgtgaag    1380

aacctgtacg agaaagtgcg gtcccagctg aagaacaacg ccaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acctgtatgg aatccgtgaa gaacggcacc    1500

tacgactacc ccaagtactc cgaggaagcc aagctgaacc gggaagagat cgacggcgtg    1560

aagctggaat ccacccggat ctaccagaac tgcaccagca tccaggaacg gaagaacgac    1620

ggcggcgact gcggcaagcc tgcctgcacc accctgctga actgcgccaa cttcctgtcc    1680

tgcctgtgct ctacctgcgc cctgtgccgg aagagatga                           1719


<210> 15
<211> 1677
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1677
<223> /mol_type="unassigned DNA"
      /note="Influenza virus HA ectodomain fused to T4 foldon
      multimerizing sequence, optimized for codon usage in CHO"
      /organism="Artificial Sequence"

<400> 15
atgaaggcca tcctggtggt gctgctgtac accttcgcca ccgccaacgc cgacaccctg      60

tgcatcggct accacgccaa caactccacc gacaccgtgg ataccgtgct ggaaaagaac     120

gtgaccgtga cccactccgt gaacctgctg gaagataagc acaacggcaa gctgtgcaag     180

ctgcggggcg tggcccctct gcacctgggc aagtgtaata tcgccggctg gatcctgggc     240

aaccccgagt gcgagtccct gtccaccgcc tccagctggt cctacatcgt ggaaaccccc     300

tccagcgaca acggcacctg ttaccccggc gacttcatcg actacgagga actgcgcgag     360

cagctgtcct ccgtgtccag cttcgagaga ttcgagatct tccccaagac ctcctcctgg     420

cccaaccacg actccaacaa gggcgtgacc gccgcctgtc ctcacgctgg cgccaagtcc     480

ttctacaaga acctgatctg gctggtgaaa aagggcaact cctaccccaa gctgtccaag     540

tcctacatca acgacaaggg caaagaggtg ctggtgctgt ggggcatcca ccacccttcc     600

acctccgccg accagcagtc cctgtaccag aacgccgata cctacgtgtt cgtgggctcc     660

tcccggtact ccaagaagtt caagcccgag atcgccatcc ggcccaaagt gcgggaccag     720

gaaggccgga tgaactacta ctggaccctg gtggaacccg gcgacaagat caccttcgag     780

gccaccggca atctggtggt gcccagatac gccttcgcca tggaacggaa cgccggctcc     840

ggcatcatca tctccgacac ccccgtgcac gactgcaaca ccacctgtca gacccccaag     900

ggcgccatca acacctccct gcccttccag aacatccacc ccatcaccat cggcaagtgc     960

cccaaatacg tgaagtccac caagctgcgg ctggctaccg gcctgcggaa catcccctcc    1020

atccagtctc ggggcctgtt cggcgctatc gctggcttca tcgagggcgg ctggaccggc    1080

atggtggacg gttggtacgg ctaccaccac cagaacgagc agggctccgg ctacgccgcc    1140

gacctgaagt ctacccagaa cgccatcgac gagatcacca acaaagtgaa ctccgtgatc    1200

gagaagatga acacccagtt caccgccgtg ggcaaagagt tcaaccacct ggaaaagcgg    1260

atcgagaacc tgaacaagaa ggtggacgac ggcttcctgg acatctggac ctacaacgcc    1320

gagctgctgg tgctgctgga aaacgagcgg accctggact accacgacag caacgtgaag    1380

aacctgtacg agaaagtgcg gtcccagctg aagaacaacg ccaaagagat cggcaacggc    1440

tgcttcgagt tctaccacaa gtgcgacaac acctgtatgg aatccgtgaa gaacggcacc    1500

tacgactacc ccaagtactc cgaggaagcc aagctgaacc gggaagagat cgacggcgtg    1560

aagctggaat ccacccggat ctaccagggc agcggctaca tccctgaggc ccccagagat    1620

ggccaggcct acgtgcggaa ggacggcgag tgggtgctgc tgagcacatt tctgtga       1677


<210> 16
<211> 996
<212> DNA
<213> Shigella flexneri 2a str. 301

<220> 
<221> source
<222> 1..996
<223> /mol_type="unassigned DNA"
      /note="IpaD sequence optimized for codon usage in E. coli"
      /organism="Shigella flexneri 2a str. 301"

<400> 16
atgaatatta ccaccctgac caatagcatt agcaccagca gctttagccc gaataatacc      60

aatggtagca gcaccgaaac cgttaatagc gatattaaaa ccaccacctc tagccatccg     120

gttagcagcc tgaccatgct gaatgatacc ctgcataata ttcgtaccac caatcaggca     180

ctgaaaaaag aactgagcca gaaaaccctg accaaaacca gcctggaaga aattgcactg     240

catagcagcc agattagcat ggatgttaat aaaagcgcac agctgctgga tattctgtct     300

cgccatgaat atccgattaa taaagatgca cgcgaactgc tgcatagcgc accgaaagaa     360

gcagaactgg acggcgatca gatgattagc catcgtgaac tgtgggcaaa aattgcgaat     420

agcattaatg atattaatga acagtatctg aaagtgtatg aacatgccgt tagcagctat     480

acccagatgt atcaggattt ttctgccgtt ttaagctctc tggctggctg gatttctccg     540

ggtggtaatg atggtaatag cgtgaaactg caggttaata gcctgaaaaa agccctggaa     600

gaactgaaag aaaaatataa agataaaccg ctgtatccgg ctaataatac cgttagccaa     660

gaacaggcaa ataaatggct gaccgaactg ggtggcacca ttggtaaagt gtctcagaaa     720

aatggtggtt atgtggtgag cattaatatg accccgattg ataatatgct gaaaagcctg     780

gataatctgg gtggtaatgg tgaagttgtt ctggataatg ccaaatatca ggcatggaat     840

gccggtttta gcgccgaaga tgaaaccatg aaaaataatc tgcagaccct ggttcagaaa     900

tatagcaatg ccaatagcat ttttgataat ctggtgaaag ttctgtctag caccattagc     960

agctgtaccg ataccgataa actgtttctg catttt                               996


<210> 17
<211> 1125
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1125
<223> /mol_type="unassigned DNA"
      /note="Shigella IpaD fused to lamprey multimerizing long
      sequence, optimized for codon usage in E. coli"
      /organism="Artificial Sequence"

<400> 17
atgaatatta ccaccctgac caatagcatt agcaccagca gctttagccc gaataatacc      60

aatggtagca gcaccgaaac cgttaatagc gatattaaaa ccaccacctc tagccatccg     120

gttagcagcc tgaccatgct gaatgatacc ctgcataata ttcgtaccac caatcaggca     180

ctgaaaaaag aactgagcca gaaaaccctg accaaaacca gcctggaaga aattgcactg     240

catagcagcc agattagcat ggatgttaat aaaagcgcac agctgctgga tattctgtct     300

cgccatgaat atccgattaa taaagatgca cgcgaactgc tgcatagcgc accgaaagaa     360

gcagaactgg acggcgatca gatgattagc catcgtgaac tgtgggcaaa aattgcgaat     420

agcattaatg atattaatga acagtatctg aaagtgtatg aacatgccgt tagcagctat     480

acccagatgt atcaggattt ttctgccgtt ttaagctctc tggctggctg gatttctccg     540

ggtggtaatg atggtaatag cgtgaaactg caggttaata gcctgaaaaa agccctggaa     600

gaactgaaag aaaaatataa agataaaccg ctgtatccgg ctaataatac cgttagccaa     660

gaacaggcaa ataaatggct gaccgaactg ggtggcacca ttggtaaagt gtctcagaaa     720

aatggtggtt atgtggtgag cattaatatg accccgattg ataatatgct gaaaagcctg     780

gataatctgg gtggtaatgg tgaagttgtt ctggataatg ccaaatatca ggcatggaat     840

gccggtttta gcgccgaaga tgaaaccatg aaaaataatc tgcagaccct ggttcagaaa     900

tatagcaatg ccaatagcat ttttgataat ctggtgaaag ttctgtctag caccattagc     960

agctgtaccg ataccgataa actgtttctg cattttaatt gtaccagcat tcaagagcgc    1020

aaaaatgatg gtggtgattg tggtaaaccg gcatgtacca ccctgctgaa ttgtgcaaat    1080

tttctgagct gtctgtgtag cacctgtgca ctgtgtcgta aacgt                    1125


<210> 18
<211> 375
<212> PRT
<213> Artificial Sequence

<220> 
<223> Shigella IpaD fused to lamprey multimerizing long sequence

<400> 18
Met Asn Ile Thr Thr Leu Thr Asn Ser Ile Ser Thr Ser Ser Phe Ser 
1               5                   10                  15      
Pro Asn Asn Thr Asn Gly Ser Ser Thr Glu Thr Val Asn Ser Asp Ile 
            20                  25                  30          
Lys Thr Thr Thr Ser Ser His Pro Val Ser Ser Leu Thr Met Leu Asn 
        35                  40                  45              
Asp Thr Leu His Asn Ile Arg Thr Thr Asn Gln Ala Leu Lys Lys Glu 
    50                  55                  60                  
Leu Ser Gln Lys Thr Leu Thr Lys Thr Ser Leu Glu Glu Ile Ala Leu 
65                  70                  75                  80  
His Ser Ser Gln Ile Ser Met Asp Val Asn Lys Ser Ala Gln Leu Leu 
                85                  90                  95      
Asp Ile Leu Ser Arg His Glu Tyr Pro Ile Asn Lys Asp Ala Arg Glu 
            100                 105                 110         
Leu Leu His Ser Ala Pro Lys Glu Ala Glu Leu Asp Gly Asp Gln Met 
        115                 120                 125             
Ile Ser His Arg Glu Leu Trp Ala Lys Ile Ala Asn Ser Ile Asn Asp 
    130                 135                 140                 
Ile Asn Glu Gln Tyr Leu Lys Val Tyr Glu His Ala Val Ser Ser Tyr 
145                 150                 155                 160 
Thr Gln Met Tyr Gln Asp Phe Ser Ala Val Leu Ser Ser Leu Ala Gly 
                165                 170                 175     
Trp Ile Ser Pro Gly Gly Asn Asp Gly Asn Ser Val Lys Leu Gln Val 
            180                 185                 190         
Asn Ser Leu Lys Lys Ala Leu Glu Glu Leu Lys Glu Lys Tyr Lys Asp 
        195                 200                 205             
Lys Pro Leu Tyr Pro Ala Asn Asn Thr Val Ser Gln Glu Gln Ala Asn 
    210                 215                 220                 
Lys Trp Leu Thr Glu Leu Gly Gly Thr Ile Gly Lys Val Ser Gln Lys 
225                 230                 235                 240 
Asn Gly Gly Tyr Val Val Ser Ile Asn Met Thr Pro Ile Asp Asn Met 
                245                 250                 255     
Leu Lys Ser Leu Asp Asn Leu Gly Gly Asn Gly Glu Val Val Leu Asp 
            260                 265                 270         
Asn Ala Lys Tyr Gln Ala Trp Asn Ala Gly Phe Ser Ala Glu Asp Glu 
        275                 280                 285             
Thr Met Lys Asn Asn Leu Gln Thr Leu Val Gln Lys Tyr Ser Asn Ala 
    290                 295                 300                 
Asn Ser Ile Phe Asp Asn Leu Val Lys Val Leu Ser Ser Thr Ile Ser 
305                 310                 315                 320 
Ser Cys Thr Asp Thr Asp Lys Leu Phe Leu His Phe Asn Cys Thr Ser 
                325                 330                 335     
Ile Gln Glu Arg Lys Asn Asp Gly Gly Asp Cys Gly Lys Pro Ala Cys 
            340                 345                 350         
Thr Thr Leu Leu Asn Cys Ala Asn Phe Leu Ser Cys Leu Cys Ser Thr 
        355                 360                 365             
Cys Ala Leu Cys Arg Lys Arg 
    370                 375 

<210> 19
<211> 1032
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1032
<223> /mol_type="unassigned DNA"
      /note="Shigella IpaD fused to a His-tag, optimized for codon
      usage in E. coli"
      /organism="Artificial Sequence"

<400> 19
atgaatatta ccaccctgac caatagcatt agcaccagca gctttagccc gaataatacc      60

aatggtagca gcaccgaaac cgttaatagc gatattaaaa ccaccacctc tagccatccg     120

gttagcagcc tgaccatgct gaatgatacc ctgcataata ttcgtaccac caatcaggca     180

ctgaaaaaag aactgagcca gaaaaccctg accaaaacca gcctggaaga aattgcactg     240

catagcagcc agattagcat ggatgttaat aaaagcgcac agctgctgga tattctgtct     300

cgccatgaat atccgattaa taaagatgca cgcgaactgc tgcatagcgc accgaaagaa     360

gcagaactgg acggcgatca gatgattagc catcgtgaac tgtgggcaaa aattgcgaat     420

agcattaatg atattaatga acagtatctg aaagtgtatg aacatgccgt tagcagctat     480

acccagatgt atcaggattt ttctgccgtt ttaagctctc tggctggctg gatttctccg     540

ggtggtaatg atggtaatag cgtgaaactg caggttaata gcctgaaaaa agccctggaa     600

gaactgaaag aaaaatataa agataaaccg ctgtatccgg ctaataatac cgttagccaa     660

gaacaggcaa ataaatggct gaccgaactg ggtggcacca ttggtaaagt gtctcagaaa     720

aatggtggtt atgtggtgag cattaatatg accccgattg ataatatgct gaaaagcctg     780

gataatctgg gtggtaatgg tgaagttgtt ctggataatg ccaaatatca ggcatggaat     840

gccggtttta gcgccgaaga tgaaaccatg aaaaataatc tgcagaccct ggttcagaaa     900

tatagcaatg ccaatagcat ttttgataat ctggtgaaag ttctgtctag caccattagc     960

agctgtaccg ataccgataa actgtttctg cattttggtg gtagcctcga gcaccaccac    1020

caccaccact ga                                                        1032


<210> 20
<211> 1158
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..1158
<223> /mol_type="unassigned DNA"
      /note="Shigella IpaD fused to lamprey multimerizing long sequence
      and to a His-tag, optimized for codon usage in E. coli"
      /organism="Artificial Sequence"

<400> 20
atgaatatta ccaccctgac caatagcatt agcaccagca gctttagccc gaataatacc      60

aatggtagca gcaccgaaac cgttaatagc gatattaaaa ccaccacctc tagccatccg     120

gttagcagcc tgaccatgct gaatgatacc ctgcataata ttcgtaccac caatcaggca     180

ctgaaaaaag aactgagcca gaaaaccctg accaaaacca gcctggaaga aattgcactg     240

catagcagcc agattagcat ggatgttaat aaaagcgcac agctgctgga tattctgtct     300

cgccatgaat atccgattaa taaagatgca cgcgaactgc tgcatagcgc accgaaagaa     360

gcagaactgg acggcgatca gatgattagc catcgtgaac tgtgggcaaa aattgcgaat     420

agcattaatg atattaatga acagtatctg aaagtgtatg aacatgccgt tagcagctat     480

acccagatgt atcaggattt ttctgccgtt ttaagctctc tggctggctg gatttctccg     540

ggtggtaatg atggtaatag cgtgaaactg caggttaata gcctgaaaaa agccctggaa     600

gaactgaaag aaaaatataa agataaaccg ctgtatccgg ctaataatac cgttagccaa     660

gaacaggcaa ataaatggct gaccgaactg ggtggcacca ttggtaaagt gtctcagaaa     720

aatggtggtt atgtggtgag cattaatatg accccgattg ataatatgct gaaaagcctg     780

gataatctgg gtggtaatgg tgaagttgtt ctggataatg ccaaatatca ggcatggaat     840

gccggtttta gcgccgaaga tgaaaccatg aaaaataatc tgcagaccct ggttcagaaa     900

tatagcaatg ccaatagcat ttttgataat ctggtgaaag ttctgtctag caccattagc     960

agctgtaccg ataccgataa actgtttctg cattttaatt gtaccagcat tcaagagcgc    1020

aaaaatgatg gtggtgattg tggtaaaccg gcatgtacca ccctgctgaa ttgtgcaaat    1080

tttctgagct gtctgtgtag cacctgtgca ctgtgtcgta aacgtggtgg tagcctcgag    1140

caccaccacc accaccac                                                  1158


<210> 21
<211> 249
<212> DNA
<213> Shigella flexneri 2a str. 301

<220> 
<221> source
<222> 1..249
<223> /mol_type="unassigned DNA"
      /note="MxiH sequence, optimized for codon usage in E. coli"
      /organism="Shigella flexneri 2a str. 301"

<400> 21
atgagtgtta ccgttccgaa tgatgattgg accctgagca gcctgagcga aacctttgat     60

gatggcaccc agacactgca gggtgaactg accctggcac tggataaact ggcaaaaaat    120

ccgagcaatc cgcagctgct ggcagaatat cagagcaaac tgagcgaata taccctgtat    180

cgtaatgcac agagcaatac cgtgaaagtg attaaagatg ttgatgcagc catcatccag    240

aactttcgt                                                            249


<210> 22
<211> 378
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..378
<223> /mol_type="unassigned DNA"
      /note="Shigella MxiH fused to lamprey multimerizing long
      sequence, optimized for codon usage in E. coli"
      /organism="Artificial Sequence"

<400> 22
atgagcgtta ccgttccgaa tgatgattgg accctgagca gcctgagcga aacctttgat      60

gatggcaccc agacactgca gggtgaactg accctggcac tggataaact ggcaaaaaat     120

ccgagcaatc cgcagctgct ggcagaatat cagagcaaac tgagcgaata taccctgtat     180

cgtaatgcac agagcaatac cgtgaaagtg attaaagatg ttgatgcagc catcatccag     240

aattttcgta attgtaccag catccaagag cgcaaaaatg atggtggtga ttgtggtaaa     300

ccggcatgta ccaccctgct gaattgtgca aattttctga gctgtctgtg tagcacctgt     360

gcactgtgtc gtaaacgt                                                   378


<210> 23
<211> 126
<212> PRT
<213> Artificial Sequence

<220> 
<223> Shigella MxiH fused to lamprey multimerizing long sequence

<400> 23
Met Ser Val Thr Val Pro Asn Asp Asp Trp Thr Leu Ser Ser Leu Ser 
1               5                   10                  15      
Glu Thr Phe Asp Asp Gly Thr Gln Thr Leu Gln Gly Glu Leu Thr Leu 
            20                  25                  30          
Ala Leu Asp Lys Leu Ala Lys Asn Pro Ser Asn Pro Gln Leu Leu Ala 
        35                  40                  45              
Glu Tyr Gln Ser Lys Leu Ser Glu Tyr Thr Leu Tyr Arg Asn Ala Gln 
    50                  55                  60                  
Ser Asn Thr Val Lys Val Ile Lys Asp Val Asp Ala Ala Ile Ile Gln 
65                  70                  75                  80  
Asn Phe Arg Asn Cys Thr Ser Ile Gln Glu Arg Lys Asn Asp Gly Gly 
                85                  90                  95      
Asp Cys Gly Lys Pro Ala Cys Thr Thr Leu Leu Asn Cys Ala Asn Phe 
            100                 105                 110         
Leu Ser Cys Leu Cys Ser Thr Cys Ala Leu Cys Arg Lys Arg 
        115                 120                 125     

<210> 24
<211> 282
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..282
<223> /mol_type="unassigned DNA"
      /note="Shigella MxiH fused to a His-tag, optimized for codon
      usage in E. coli"
      /organism="Artificial Sequence"

<400> 24
atgagtgtta ccgttccgaa tgatgattgg accctgagca gcctgagcga aacctttgat     60

gatggcaccc agacactgca gggtgaactg accctggcac tggataaact ggcaaaaaat    120

ccgagcaatc cgcagctgct ggcagaatat cagagcaaac tgagcgaata taccctgtat    180

cgtaatgcac agagcaatac cgtgaaagtg attaaagatg ttgatgcagc catcatccag    240

aattttcgtg gtggtagcct cgagcaccac caccaccacc ac                       282


<210> 25
<211> 411
<212> DNA
<213> Artificial Sequence

<220> 
<221> source
<222> 1..411
<223> /mol_type="unassigned DNA"
      /note="Shigella MxiH fused to lamprey multimerizing long sequence
      and to a His-tag, optimized for codon usage in E. coli"
      /organism="Artificial Sequence"

<400> 25
atgagtgtta ccgttccgaa tgatgattgg accctgagca gcctgagcga aacctttgat      60

gatggcaccc agacactgca gggtgaactg accctggcac tggataaact ggcaaaaaat     120

ccgagcaatc cgcagctgct ggcagaatat cagagcaaac tgagcgaata taccctgtat     180

cgtaatgcac agagcaatac cgtgaaagtg attaaagatg ttgatgcagc catcatccag     240

aattttcgta attgtaccag catccaagag cgcaaaaatg atggtggtga ttgtggtaaa     300

ccggcatgta ccaccctgct gaattgtgca aattttctga gctgtctgtg tagcacctgt     360

gcactgtgtc gtaaacgtgg tggtagcctc gagcaccacc accaccacca c              411


<210> 26
<211> 559
<212> PRT
<213> Artificial Sequence

<220> 
<223> Influenza virus HA ectodomain fused to lamprey multimerizing
      shortened sequence

<400> 26
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn 
1               5                   10                  15      
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr 
            20                  25                  30          
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn 
        35                  40                  45              
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val 
    50                  55                  60                  
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly 
65                  70                  75                  80  
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile 
                85                  90                  95      
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe 
            100                 105                 110         
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe 
        115                 120                 125             
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp 
    130                 135                 140                 
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser 
145                 150                 155                 160 
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro 
                165                 170                 175     
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val 
            180                 185                 190         
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu 
        195                 200                 205             
Tyr Gln Asn Ala Asp Thr Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser 
    210                 215                 220                 
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln 
225                 230                 235                 240 
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys 
                245                 250                 255     
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe 
            260                 265                 270         
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro 
        275                 280                 285             
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn 
    290                 295                 300                 
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys 
305                 310                 315                 320 
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg 
                325                 330                 335     
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly 
            340                 345                 350         
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr 
        355                 360                 365             
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser 
    370                 375                 380                 
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile 
385                 390                 395                 400 
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His 
                405                 410                 415     
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe 
            420                 425                 430         
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn 
        435                 440                 445             
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu 
    450                 455                 460                 
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly 
465                 470                 475                 480 
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val 
                485                 490                 495     
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu 
            500                 505                 510         
Asn Arg Glu Glu Ile Asp Gly Val Lys Leu Glu Ser Thr Arg Ile Tyr 
        515                 520                 525             
Gln Asp Cys Gly Lys Pro Ala Cys Thr Thr Leu Leu Asn Cys Ala Asn 
    530                 535                 540                 
Phe Leu Ser Cys Leu Cys Ser Thr Cys Ala Leu Cys Arg Lys Arg 
545                 550                 555                 

<210> 27
<211> 572
<212> PRT
<213> Artificial Sequence

<220> 
<223> Influenza virus HA ectodomain fused to lamprey multimerizing long
      sequence

<400> 27
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn 
1               5                   10                  15      
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr 
            20                  25                  30          
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn 
        35                  40                  45              
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val 
    50                  55                  60                  
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly 
65                  70                  75                  80  
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile 
                85                  90                  95      
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe 
            100                 105                 110         
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe 
        115                 120                 125             
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp 
    130                 135                 140                 
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser 
145                 150                 155                 160 
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro 
                165                 170                 175     
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val 
            180                 185                 190         
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu 
        195                 200                 205             
Tyr Gln Asn Ala Asp Thr Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser 
    210                 215                 220                 
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln 
225                 230                 235                 240 
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys 
                245                 250                 255     
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe 
            260                 265                 270         
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro 
        275                 280                 285             
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn 
    290                 295                 300                 
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys 
305                 310                 315                 320 
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg 
                325                 330                 335     
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly 
            340                 345                 350         
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr 
        355                 360                 365             
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser 
    370                 375                 380                 
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile 
385                 390                 395                 400 
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His 
                405                 410                 415     
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe 
            420                 425                 430         
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn 
        435                 440                 445             
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu 
    450                 455                 460                 
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly 
465                 470                 475                 480 
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val 
                485                 490                 495     
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu 
            500                 505                 510         
Asn Arg Glu Glu Ile Asp Gly Val Lys Leu Glu Ser Thr Arg Ile Tyr 
        515                 520                 525             
Gln Asn Cys Thr Ser Ile Gln Glu Arg Lys Asn Asp Gly Gly Asp Cys 
    530                 535                 540                 
Gly Lys Pro Ala Cys Thr Thr Leu Leu Asn Cys Ala Asn Phe Leu Ser 
545                 550                 555                 560 
Cys Leu Cys Ser Thr Cys Ala Leu Cys Arg Lys Arg 
                565                 570         

<210> 28
<211> 558
<212> PRT
<213> Artificial Sequence

<220> 
<223> Influenza virus HA ectodomain fused to T4 foldon multimerizing
      sequence

<400> 28
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn 
1               5                   10                  15      
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr 
            20                  25                  30          
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn 
        35                  40                  45              
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val 
    50                  55                  60                  
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly 
65                  70                  75                  80  
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile 
                85                  90                  95      
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe 
            100                 105                 110         
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe 
        115                 120                 125             
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp 
    130                 135                 140                 
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser 
145                 150                 155                 160 
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro 
                165                 170                 175     
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val 
            180                 185                 190         
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu 
        195                 200                 205             
Tyr Gln Asn Ala Asp Thr Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser 
    210                 215                 220                 
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln 
225                 230                 235                 240 
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys 
                245                 250                 255     
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe 
            260                 265                 270         
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro 
        275                 280                 285             
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn 
    290                 295                 300                 
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys 
305                 310                 315                 320 
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg 
                325                 330                 335     
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly 
            340                 345                 350         
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr 
        355                 360                 365             
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser 
    370                 375                 380                 
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile 
385                 390                 395                 400 
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His 
                405                 410                 415     
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe 
            420                 425                 430         
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn 
        435                 440                 445             
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu 
    450                 455                 460                 
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly 
465                 470                 475                 480 
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val 
                485                 490                 495     
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu 
            500                 505                 510         
Asn Arg Glu Glu Ile Asp Gly Val Lys Leu Glu Ser Thr Arg Ile Tyr 
        515                 520                 525             
Gln Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr 
    530                 535                 540                 
Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 
545                 550                 555             

<210> 29
<211> 24
<212> PRT
<213> Artificial Sequence

<220> 
<223> Consensus sequence for a LRR module from Lamprey VLR-B antibody

<220> 
<223> X" is any amino acid

<400> 29
Leu Xaa Xaa Leu Xaa Xaa Leu Xaa Leu Xaa Xaa Asn Xaa Leu Xaa Xaa 
1               5                   10                  15      
Xaa Pro Xaa Gly Xaa Phe Asp Xaa 
            20                  

