                         SEQUENCE LISTING

<110>  University of Virginia Patent Foundation
       Virginia Tech Intellectual Properties, Inc.
       Zeichner, Steven L.
       Meng, Xiang-Jin
       Tang, Debin
 
<120>  COMPOSITIONS AND METHODS FOR INDUCING IMMUNE RESPONSES AGAINST 
       CLASS I FUSION PROTEIN VIRUSES

<130>  3062/129 PCT

<150>  US 63/022,746
<151>  2020-05-11

<150>  US 63/127,712
<151>  2020-12-18

<160>  49    

<170>  PatentIn version 3.5

<210>  1
<211>  849
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  1
ggtctcacac accacaattc agcaaattgt gaacatcatc acgttcatct ttccctggtt       60

gccaatggcc cattttcctg tcagtaacga gaaggtcgcg aattcaggcg ctttttagac      120

tggtcgtaat gaaattcttt ttaagaagga gatatacata tgattaaatt aaaatttggt      180

gtttttttta cagttttact atcttcagca tatgcacatg gaacacctca aaatattact      240

cacggtgtgt taaccctgtc ttctcgtggt ctggactatc catacgatgt accggattac      300

gcgcgttatg gaagacgagt gttgaacaag ggatggtatt taaccgtgtt aacaatggag      360

atcttcgaaa atgtggctaa tggcgatatt tctgccactt ccaccgatgc gattaacgga      420

agtcagttgt atgctgtggc aaaaggggta acaaaccttg ctggacaagt gaataatctt      480

gagggcaaag tgaataaagt gggcaaacgt gcagatgcag gtacagcaag tgcattagcg      540

gcttcacagt taccacaagc cactatgcca ggtaaatcaa tggttgctat tgcgggaagt      600

agttatcaag gtcaaaatgg tttagctatc ggggtatcaa gaatttccga taatggcaaa      660

gtgattattc gcttgtcagg cacaaccaat agtcaaggta aaacaggcgt tgcagcaggt      720

gttggttacc agtggtaagt tagagcggcc gccaccgctg agcaataact agcataaccc      780

cttggggcct ctaaacgggt cttgaggggt tttttgctga aaggaggaac tatatccggg      840

taacccggg                                                              849


<210>  2
<211>  3475
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  2
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt gacgcgtatt gggatggtct      420

cacacaccac aattcagcaa attgtgaaca tcatcacgtt catctttccc tggttgccaa      480

tggcccattt tcctgtcagt aacgagaagg tcgcgaattc aggcgctttt tagactggtc      540

gtaatgaaat tctttttaag aaggagatat acatatgatt aaattaaaat ttggtgtttt      600

ttttacagtt ttactatctt cagcatatgc acatggaaca cctcaaaata ttactcacgg      660

tgtgttaacc ctgtcttctc gtggtctgga ctatccatac gatgtaccgg attacgcgcg      720

ttatggaaga cgagtgttga acaagggatg gtatttaacc gtgttaacaa tggagatctt      780

cgaaaatgtg gctaatggcg atatttctgc cacttccacc gatgcgatta acggaagtca      840

gttgtatgct gtggcaaaag gggtaacaaa ccttgctgga caagtgaata atcttgaggg      900

caaagtgaat aaagtgggca aacgtgcaga tgcaggtaca gcaagtgcat tagcggcttc      960

acagttacca caagccacta tgccaggtaa atcaatggtt gctattgcgg gaagtagtta     1020

tcaaggtcaa aatggtttag ctatcggggt atcaagaatt tccgataatg gcaaagtgat     1080

tattcgcttg tcaggcacaa ccaatagtca aggtaaaaca ggcgttgcag caggtgttgg     1140

ttaccagtgg taagttagag cggccgccac cgctgagcaa taactagcat aaccccttgg     1200

ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccgggtaacc     1260

cgggatccca atggcgcgcc gagcttggct cgagcatggt catagctgtt tcctgtgtga     1320

aattgttatc cgctcacaat tccacacaac atacgagccg gaagcataaa gtgtaaagcc     1380

tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact gcccgctttc     1440

cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc ggggagaggc     1500

ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt     1560

cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca     1620

ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa     1680

aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat     1740

cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc     1800

cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc     1860

gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt     1920

tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac     1980

cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg     2040

ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca     2100

gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc     2160

gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa     2220

accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa     2280

ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac     2340

tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta     2400

aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt     2460

tagaaaaact catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata     2520

ccatattttt gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat     2580

aggatggcaa gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct     2640

attaatttcc cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact     2700

gaatccggtg agaatggcaa aagtttatgc atttctttcc agacttgttc aacaggccag     2760

ccattacgct cgtcatcaaa atcactcgca tcaaccaaac cgttattcat tcgtgattgc     2820

gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac aattacaaac aggaatcgaa     2880

tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat     2940

tcttctaata cctggaatgc tgttttccca gggatcgcag tggtgagtaa ccatgcatca     3000

tcaggagtac ggataaaatg cttgatggtc ggaagaggca taaattccgt cagccagttt     3060

agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac     3120

aactctggcg catcgggctt cccatacaat cgatagattg tcgcacctga ttgcccgaca     3180

ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc     3240

ctagagcaag acgtttcccg ttgaatatgg ctcatactct tcctttttca atattattga     3300

agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat     3360

aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc     3420

attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtc          3475


<210>  3
<211>  675
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  3
ggtctcacac accacaattc agcaaattgt gaacatcatc acgttcatct ttccctggtt       60

gccaatggcc cattttcctg tcagtaacga gaaggtcgcg aattcaggcg ctttttagac      120

tggtcgtaat gaaattcttt ttaagaagga gatatacata tgattaaatt aaaatttggt      180

gtttttttta cagttttact atcttcagca tatgcacatg gaacacctca aaatattact      240

cacggtgtgt taaccctgtc ttctcgtggt ctggactatc catacgatgt accggattac      300

gcgcgttatg gaagacgagt gttgaacaag ggatggtatt taaccgtgtt aacaatggag      360

atcttcgaaa gccagctgcc gcaggcgacc atgccgggca aaagcatggt ggcgattgcg      420

ggcagcagct atcagggcca gaacggcctg gcgattggcg tgagccgcat tagcgataac      480

ggcaaagtga ttattcgcct gagcggcacc accaacagcc agggcaaaac cggcgtggcg      540

gcgggcgtgg gctatcagtg gtaagttaga gcggccgcca ccgctgagca ataactagca      600

taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata      660

tccgggtaac ccggg                                                       675


<210>  4
<211>  782
<212>  DNA
<213>  SARS-CoV-2

<400>  4
aaccacgtct tcaaaagtgg aagccgaagt tcaaatcgac cgtctgatca ccggtcgtct       60

gcagagtctg cagacctatg tgacccagca gctcatccgc gccgcggaaa ttcgtgccag      120

cgcgaatctg gccgcgacga aaatgagcga gtgcgtgctg ggccagagca aacgcgtgga      180

tttctgcggc aagggctacc atctgatgag cttcccacag agcgcgccgc acggtgtggt      240

ttttctgcat gttacctacg tgccagccca agaaaaaaac ttcacgaccg cgccagcgat      300

ctgtcacgac ggcaaggccc acttcccacg cgaaggtgtg ttcgtgagca atggtaccca      360

ctggtttgtg acgcagcgca acttctacga accgcagatc atcaccacgg acaacacgtt      420

cgtgagcggt aactgcgacg tggtgattgg catcgtgaac aacacggtgt acgatccgct      480

ccagccagaa ctggacagct tcaaggagga gctggacaag tacttcaaaa accacaccag      540

cccggacgtt gatctgggcg acatcagcgg catcaacgcg agcgtggtga acatccagaa      600

agagatcgat cgtctgaacg aagtggcgaa aaacctcaac gagagtctga tcgatctgca      660

agaactgggc aaatacgagc agtacatcaa gtggccgtgg tacatttggc tgggtttcat      720

tgccggtctg atcgccatcg tgatggtgac catcatgctg tgctgcatga gaagacttgt      780

gt                                                                     782


<210>  5
<211>  746
<212>  DNA
<213>  SARS-CoV-2

<400>  5
aaccacgtct tcaaaagtgg aagccgaagt tcaaatcgac cgtctgatca ccggtcgtct       60

gcagagtctg cagacctatg tgacccagca gctcatccgc gccgcggaaa ttcgtgccag      120

cgcgaatctg gccgcgacga aaatgagcga gtgcgtgctg ggccagagca aacgcgtgga      180

tttctgcggc aagggctacc atctgatgag cttcccacag agcgcgccgc acggtgtggt      240

ttttctgcat gttacctacg tgccagccca agaaaaaaac ttcacgaccg cgccagcgat      300

ctgtcacgac ggcaaggccc acttcccacg cgaaggtgtg ttcgtgagca atggtaccca      360

ctggtttgtg acgcagcgca acttctacga accgcagatc atcaccacgg acaacacgtt      420

cgtgagcggt aactgcgacg tggtgattgg catcgtgaac aacacggtgt acgatccgct      480

ccagccagaa ctggacagct tcaaggagga gctggacaag tacttcaaaa accacaccag      540

cccggacgtt gatctgggcg acatcagcgg catcaacgcg agcgtggtga acatccagaa      600

agagatcgat cgtctgaacg aagtggcgaa aaacctcaac gagagtctga tcgatctgca      660

agaactgggc aaatacgagc agtacatcaa gtggccgtgg tacatttggc tgggtttcat      720

tgccggtctg atcagaagac ttgtgt                                           746


<210>  6
<211>  710
<212>  DNA
<213>  SARS-CoV-2

<400>  6
aaccacgtct tcaaaagtgg aagccgaagt tcaaatcgac cgtctgatca ccggtcgtct       60

gcagagtctg cagacctatg tgacccagca gctcatccgc gccgcggaaa ttcgtgccag      120

cgcgaatctg gccgcgacga aaatgagcga gtgcgtgctg ggccagagca aacgcgtgga      180

tttctgcggc aagggctacc atctgatgag cttcccacag agcgcgccgc acggtgtggt      240

ttttctgcat gttacctacg tgccagccca agaaaaaaac ttcacgaccg cgccagcgat      300

ctgtcacgac ggcaaggccc acttcccacg cgaaggtgtg ttcgtgagca atggtaccca      360

ctggtttgtg acgcagcgca acttctacga accgcagatc atcaccacgg acaacacgtt      420

cgtgagcggt aactgcgacg tggtgattgg catcgtgaac aacacggtgt acgatccgct      480

ccagccagaa ctggacagct tcaaggagga gctggacaag tacttcaaaa accacaccag      540

cccggacgtt gatctgggcg acatcagcgg catcaacgcg agcgtggtga acatccagaa      600

agagatcgat cgtctgaacg aagtggcgaa aaacctcaac gagagtctga tcgatctgca      660

agaactgggc aaatacgagc agtacatcaa gtggccgaga agacttgtgt                 710


<210>  7
<211>  512
<212>  DNA
<213>  SARS-CoV-2

<400>  7
aaccacgtct tcaacgaccg cgccagcgat ctgtcacgac ggcaaggccc acttcccacg       60

cgaaggtgtg ttcgtgagca atggtaccca ctggtttgtg acgcagcgca acttctacga      120

accgcagatc atcaccacgg acaacacgtt cgtgagcggt aactgcgacg tggtgattgg      180

catcgtgaac aacacggtgt acgatccgct ccagccagaa ctggacagct tcaaggagga      240

gctggacaag tacttcaaaa accacaccag cccggacgtt gatctgggcg acatcagcgg      300

catcaacgcg agcgtggtga acatccagaa agagatcgat cgtctgaacg aagtggcgaa      360

aaacctcaac gagagtctga tcgatctgca agaactgggc aaatacgagc agtacatcaa      420

gtggccgtgg tacatttggc tgggtttcat tgccggtctg atcgccatcg tgatggtgac      480

catcatgctg tgctgcatga gaagacttgt gt                                    512


<210>  8
<211>  476
<212>  DNA
<213>  SARS-CoV-2

<400>  8
aaccacgtct tcaacgaccg cgccagcgat ctgtcacgac ggcaaggccc acttcccacg       60

cgaaggtgtg ttcgtgagca atggtaccca ctggtttgtg acgcagcgca acttctacga      120

accgcagatc atcaccacgg acaacacgtt cgtgagcggt aactgcgacg tggtgattgg      180

catcgtgaac aacacggtgt acgatccgct ccagccagaa ctggacagct tcaaggagga      240

gctggacaag tacttcaaaa accacaccag cccggacgtt gatctgggcg acatcagcgg      300

catcaacgcg agcgtggtga acatccagaa agagatcgat cgtctgaacg aagtggcgaa      360

aaacctcaac gagagtctga tcgatctgca agaactgggc aaatacgagc agtacatcaa      420

gtggccgtgg tacatttggc tgggtttcat tgccggtctg atcagaagac ttgtgt          476


<210>  9
<211>  440
<212>  DNA
<213>  SARS-CoV-2

<400>  9
aaccacgtct tcaacgaccg cgccagcgat ctgtcacgac ggcaaggccc acttcccacg       60

cgaaggtgtg ttcgtgagca atggtaccca ctggtttgtg acgcagcgca acttctacga      120

accgcagatc atcaccacgg acaacacgtt cgtgagcggt aactgcgacg tggtgattgg      180

catcgtgaac aacacggtgt acgatccgct ccagccagaa ctggacagct tcaaggagga      240

gctggacaag tacttcaaaa accacaccag cccggacgtt gatctgggcg acatcagcgg      300

catcaacgcg agcgtggtga acatccagaa agagatcgat cgtctgaacg aagtggcgaa      360

aaacctcaac gagagtctga tcgatctgca agaactgggc aaatacgagc agtacatcaa      420

gtggccgaga agacttgtgt                                                  440


<210>  10
<211>  251
<212>  DNA
<213>  SARS-CoV-2

<400>  10
aaccacgtct tcagacgttg atctgggcga catcagcggc atcaacgcga gcgtggtgaa       60

catccagaaa gagatcgatc gtctgaacga agtggcgaaa aacctcaacg agagtctgat      120

cgatctgcaa gaactgggca aatacgagca gtacatcaag tggccgtggt acatttggct      180

gggtttcatt gccggtctga tcgccatcgt gatggtgacc atcatgctgt gctgcatgag      240

aagacttgtg t                                                           251


<210>  11
<211>  215
<212>  DNA
<213>  SARS-CoV-2

<400>  11
aaccacgtct tcagacgttg atctgggcga catcagcggc atcaacgcga gcgtggtgaa       60

catccagaaa gagatcgatc gtctgaacga agtggcgaaa aacctcaacg agagtctgat      120

cgatctgcaa gaactgggca aatacgagca gtacatcaag tggccgtggt acatttggct      180

gggtttcatt gccggtctga tcagaagact tgtgt                                 215


<210>  12
<211>  179
<212>  DNA
<213>  SARS-CoV-2

<400>  12
aaccacgtct tcagacgttg atctgggcga catcagcggc atcaacgcga gcgtggtgaa       60

catccagaaa gagatcgatc gtctgaacga agtggcgaaa aacctcaacg agagtctgat      120

cgatctgcaa gaactgggca aatacgagca gtacatcaag tggccgagaa gacttgtgt       179


<210>  13
<211>  176
<212>  DNA
<213>  SARS-CoV-2

<400>  13
aaccacgtct tcagaagtgg cgaaaaacct caacgagagt ctgatcgatc tgcaagaact       60

gggcaaatac gagcagtaca tcaagtggcc gtggtacatt tggctgggtt tcattgccgg      120

tctgatcgcc atcgtgatgg tgaccatcat gctgtgctgc atgagaagac ttgtgt          176


<210>  14
<211>  140
<212>  DNA
<213>  SARS-CoV-2

<400>  14
aaccacgtct tcagaagtgg cgaaaaacct caacgagagt ctgatcgatc tgcaagaact       60

gggcaaatac gagcagtaca tcaagtggcc gtggtacatt tggctgggtt tcattgccgg      120

tctgatcaga agacttgtgt                                                  140


<210>  15
<211>  104
<212>  DNA
<213>  SARS-CoV-2

<400>  15
aaccacgtct tcagaagtgg cgaaaaacct caacgagagt ctgatcgatc tgcaagaact       60

gggcaaatac gagcagtaca tcaagtggcc gagaagactt gtgt                       104


<210>  16
<211>  252
<212>  PRT
<213>  SARS-CoV-2

<400>  16

Lys Val Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu 
1               5                   10                  15      


Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu 
            20                  25                  30          


Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val 
        35                  40                  45              


Leu Gly Gln Ser Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu 
    50                  55                  60                  


Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val 
65                  70                  75                  80  


Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile 
                85                  90                  95      


Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser 
            100                 105                 110         


Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln 
        115                 120                 125             


Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val 
    130                 135                 140                 


Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu 
145                 150                 155                 160 


Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser 
                165                 170                 175     


Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val 
            180                 185                 190         


Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn Leu 
        195                 200                 205             


Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr 
    210                 215                 220                 


Ile Lys Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile 
225                 230                 235                 240 


Ala Ile Val Met Val Thr Ile Met Leu Cys Cys Met 
                245                 250         


<210>  17
<211>  9
<212>  PRT
<213>  SARS-CoV-2

<400>  17

Leu Ile Thr Gly Arg Leu Gln Ser Leu 
1               5                   


<210>  18
<211>  18
<212>  PRT
<213>  SARS-CoV-2

<400>  18

Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala 
1               5                   10                  15      


Thr Lys 
        


<210>  19
<211>  9
<212>  PRT
<213>  SARS-CoV-2

<400>  19

Val Val Phe Leu His Val Thr Tyr Val 
1               5                   


<210>  20
<211>  15
<212>  PRT
<213>  SARS-CoV-2

<400>  20

His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile 
1               5                   10                  15  


<210>  21
<211>  12
<212>  PRT
<213>  SARS-CoV-2

<400>  21

Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu 
1               5                   10          


<210>  22
<211>  9
<212>  PRT
<213>  SARS-CoV-2

<400>  22

Arg Leu Asn Glu Val Ala Lys Asn Leu 
1               5                   


<210>  23
<211>  9
<212>  PRT
<213>  SARS-CoV-2

<400>  23

Asn Leu Asn Glu Ser Leu Ile Asp Leu 
1               5                   


<210>  24
<211>  9
<212>  PRT
<213>  SARS-CoV-2

<400>  24

Phe Ile Ala Gly Leu Ile Ala Ile Val 
1               5                   


<210>  25
<211>  22
<212>  PRT
<213>  SARS-CoV-2

<400>  25

Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala 
1               5                   10                  15      


Tyr Arg Phe Asn Gly Ile 
            20          


<210>  26
<211>  14
<212>  PRT
<213>  SARS-CoV-2

<400>  26

Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile 
1               5                   10                  


<210>  27
<211>  9
<212>  PRT
<213>  SARS-CoV-2

<400>  27

Ile Pro Phe Ala Met Gln Met Ala Tyr 
1               5                   


<210>  28
<211>  1273
<212>  PRT
<213>  SARS-CoV-2

<400>  28

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
        675                 680                 685             


Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 


Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 


Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     


Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         


Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             


Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 


Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 


Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     


Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         


Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             


Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 


Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 


Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                885                 890                 895     


Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         


Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             


Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
    930                 935                 940                 


Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 


Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     


Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
            980                 985                 990         


Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
        995                 1000                 1005             


Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
    1010                 1015                 1020             


Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
    1025                 1030                 1035             


Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
    1040                 1045                 1050             


Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
    1055                 1060                 1065             


Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
    1070                 1075                 1080             


Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
    1085                 1090                 1095             


Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
    1100                 1105                 1110             


Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
    1115                 1120                 1125             


Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
    1130                 1135                 1140             


Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
    1145                 1150                 1155             


His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
    1160                 1165                 1170             


Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
    1190                 1195                 1200             


Gly Lys  Tyr Glu Gln Tyr Ile  Lys Trp Pro Trp Tyr  Ile Trp Leu 
    1205                 1210                 1215             


Gly Phe  Ile Ala Gly Leu Ile  Ala Ile Val Met Val  Thr Ile Met 
    1220                 1225                 1230             


Leu Cys  Cys Met Thr Ser Cys  Cys Ser Cys Leu Lys  Gly Cys Cys 
    1235                 1240                 1245             


Ser Cys  Gly Ser Cys Cys Lys  Phe Asp Glu Asp Asp  Ser Glu Pro 
    1250                 1255                 1260             


Val Leu  Lys Gly Val Lys Leu  His Tyr Thr 
    1265                 1270             


<210>  29
<211>  18
<212>  PRT
<213>  SARS-CoV-2

<400>  29

Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala 
1               5                   10                  15      


Gly Phe 
        


<210>  30
<211>  15
<212>  PRT
<213>  SARS-CoV-2

<400>  30

Pro Leu Lys Pro Thr Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe 
1               5                   10                  15  


<210>  31
<211>  25
<212>  PRT
<213>  SARS-CoV-2

<400>  31

Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn 
1               5                   10                  15      


Lys Val Thr Leu Ala Asp Ala Gly Phe 
            20                  25  


<210>  32
<211>  110
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  32
aaccacgtct tcaccgagca aaccgagcaa acgcagcttc atcgaggatc tgctgttcaa       60

caaggtgacg ctggccgatg ccggttttgg tggcggcaga agacttgtgt                 110


<210>  33
<211>  3432
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  33
ggccgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag       60

gggttttttg ctgaaaggag gaactatatc cgggtaacga attcaagctt gatatcattc      120

aggacgagcc tcagactcca gcgtaactgg actgcaatca actcactggc tcaccttcac      180

gggtgggcct ttcttcggta gaagtcatct catgaccaaa atcccttaac gtgagttacg      240

cgcgcgtcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat      300

cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg      360

gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga      420

gcgcagatac caaatactgt tcttctagtg tagccgtagt tagcccacca cttcaagaac      480

tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt      540

ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag      600

cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc      660

gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag      720

gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca      780

gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt      840

cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc      900

tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc      960

cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgcgcagaa     1020

aggcccaccc gaaggtgagc caggtgatta catttgggcc ctcattagaa aaactcatcg     1080

agcatcaaat gaaattgcaa tttattcata tcaggattat caataccata tttttgaaaa     1140

agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc     1200

tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg     1260

tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat     1320

ggcaaaagtt tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca     1380

tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgaggcga     1440

aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcgagtgcaa ccggcgcagg     1500

aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg     1560

aacgctgttt ttccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata     1620

aaatgcttga tggtcggaag tggcataaat tccgtcagcc agtttagtct gaccatctca     1680

tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg     1740

ggcttcccat acaagcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat     1800

ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga cgtttcccgt     1860

tgaatatggc tcatagctcc tgaaaatctc gataactcaa aaaatacgcc cggtagtgat     1920

cttatttcat tatggtgaaa gttggaacct cttacgtgcc gatcaaaaag acggtcaaaa     1980

gcctccggtc ggaggctttt gactttctgc tatggaggtc aggtatgatt taaatggtca     2040

gtattgagcg atatctagag aattcgtcca ccacaattca gcaaattgtg aacatcatca     2100

cgttcatctt tccctggttg ccaatggccc attttcctgt cagtaacgag aaggtcgcga     2160

attcaggcgc tttttagact ggtcgtaatg aaattctttt taagaggaga tatacatagg     2220

tctcacatga ttaaattaaa atttggtgtt ttttttacag ttttactatc ttcagcatat     2280

gcacatggaa cacctcaaaa tattactcac ggtgtgttaa ccacgtcttc accgagcaaa     2340

ccgagcaaac gcagcttcat cgaggatctg ctgttcaaca aggtgacgct ggccgatgcc     2400

ggttttggtg gcggcagaag acttgtgtta acaatggaga tcttcgaaca atacagaccg     2460

gagaacggaa gttatgctac caatatggca ctggctaact cactgttcct catggatttg     2520

aatgagcgta agcaattcag ggccatgagt gataatacac agcctgagtc tgcatccgtg     2580

tggatgaaga tcactggagg aataagctct ggtaagctta atgacgggca aaataaaaca     2640

acaaccaatc agtttatcaa tcagctcggg ggggatattt ataaattcca tgctgaacaa     2700

ctgggtgatt ttaccttagg gattatggga ggatacgcga atgcaaaagg taaaacgata     2760

aattacacga gcaacaaagc tgccagaaac acactggatg gttattctgt cggggtatac     2820

ggtacgtggt atcagaatgg ggaaaatgca acagggctct ttgctgaaac ttggatgcaa     2880

tataactggt ttaatgcatc agtgaaaggt gacggactgg aagaagaaaa atataatctg     2940

aatggtttaa ccgcttctgc aggtggggga tataacctga atgtgcacac atggacatca     3000

cctgaaggaa taacaggtga attctggtta cagcctcatt tgcaggctgt ctggatgggg     3060

gttacaccgg atacacatca ggaggataac ggaacggtgg tgcagggagc agggaaaaat     3120

aatattcaga caaaagcagg tattcgtgca tcctggaagg tgaaaagcac cctggataag     3180

gataccgggc ggaggttccg tccgtatata gaggcaaact ggatccataa cactcatgaa     3240

tttggtgtta aaatgagtga tgacagccag ttgttgtcag gtagccgaaa tcagggagag     3300

ataaagacag gtattgaagg ggtgattact caaaacttgt cagtgaatgg cggagtcgca     3360

tatcaggcag gaggtcacgg gagcaatgcc atctccggag cactggggat aaaatacagc     3420

ttcggttaga gc                                                         3432


<210>  34
<211>  446
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  34
aaccacgtct tcaccgagta aaccgagtaa acgcagcttt attgaggatc tgctcttcaa       60

caaggtgacg ctggccgacg ccggttttgg cggcggcccg agcaagccga gcaaacgtag      120

ctttatcgag gatctgctct ttaataaagt gacgctggcg gatgcgggtt tcggcggtgg      180

cccgagcaaa ccaagcaaac gtagtttcat cgaggatctg ctgttcaata aagttacgct      240

ggccgatgcg ggttttggtg gtggtccaag caaaccaagt aagcgtagtt ttattgagga      300

tctgctgttt aacaaggtta ccctcgccga tgccggtttt ggtggcggtc cgagtaagcc      360

gagtaagcgt agcttcattg aagatctgct gttcaacaaa gtgacgctgg ccgatgccgg      420

ctttggcggt ggtagaagac ttgtgt                                           446


<210>  35
<211>  3768
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  35
ggccgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag       60

gggttttttg ctgaaaggag gaactatatc cgggtaacga attcaagctt gatatcattc      120

aggacgagcc tcagactcca gcgtaactgg actgcaatca actcactggc tcaccttcac      180

gggtgggcct ttcttcggta gaagtcatct catgaccaaa atcccttaac gtgagttacg      240

cgcgcgtcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat      300

cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg      360

gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga      420

gcgcagatac caaatactgt tcttctagtg tagccgtagt tagcccacca cttcaagaac      480

tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt      540

ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag      600

cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc      660

gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag      720

gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca      780

gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt      840

cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc      900

tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc      960

cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgcgcagaa     1020

aggcccaccc gaaggtgagc caggtgatta catttgggcc ctcattagaa aaactcatcg     1080

agcatcaaat gaaattgcaa tttattcata tcaggattat caataccata tttttgaaaa     1140

agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc     1200

tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg     1260

tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat     1320

ggcaaaagtt tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca     1380

tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgaggcga     1440

aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcgagtgcaa ccggcgcagg     1500

aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg     1560

aacgctgttt ttccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata     1620

aaatgcttga tggtcggaag tggcataaat tccgtcagcc agtttagtct gaccatctca     1680

tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg     1740

ggcttcccat acaagcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat     1800

ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga cgtttcccgt     1860

tgaatatggc tcatagctcc tgaaaatctc gataactcaa aaaatacgcc cggtagtgat     1920

cttatttcat tatggtgaaa gttggaacct cttacgtgcc gatcaaaaag acggtcaaaa     1980

gcctccggtc ggaggctttt gactttctgc tatggaggtc aggtatgatt taaatggtca     2040

gtattgagcg atatctagag aattcgtcca ccacaattca gcaaattgtg aacatcatca     2100

cgttcatctt tccctggttg ccaatggccc attttcctgt cagtaacgag aaggtcgcga     2160

attcaggcgc tttttagact ggtcgtaatg aaattctttt taagaggaga tatacatagg     2220

tctcacatga ttaaattaaa atttggtgtt ttttttacag ttttactatc ttcagcatat     2280

gcacatggaa cacctcaaaa tattactcac ggtgtgttaa ccacgtcttc accgagtaaa     2340

ccgagtaaac gcagctttat tgaggatctg ctcttcaaca aggtgacgct ggccgacgcc     2400

ggttttggcg gcggcccgag caagccgagc aaacgtagct ttatcgagga tctgctcttt     2460

aataaagtga cgctggcgga tgcgggtttc ggcggtggcc cgagcaaacc aagcaaacgt     2520

agtttcatcg aggatctgct gttcaataaa gttacgctgg ccgatgcggg ttttggtggt     2580

ggtccaagca aaccaagtaa gcgtagtttt attgaggatc tgctgtttaa caaggttacc     2640

ctcgccgatg ccggttttgg tggcggtccg agtaagccga gtaagcgtag cttcattgaa     2700

gatctgctgt tcaacaaagt gacgctggcc gatgccggct ttggcggtgg tagaagactt     2760

gtgttaacaa tggagatctt cgaacaatac agaccggaga acggaagtta tgctaccaat     2820

atggcactgg ctaactcact gttcctcatg gatttgaatg agcgtaagca attcagggcc     2880

atgagtgata atacacagcc tgagtctgca tccgtgtgga tgaagatcac tggaggaata     2940

agctctggta agcttaatga cgggcaaaat aaaacaacaa ccaatcagtt tatcaatcag     3000

ctcggggggg atatttataa attccatgct gaacaactgg gtgattttac cttagggatt     3060

atgggaggat acgcgaatgc aaaaggtaaa acgataaatt acacgagcaa caaagctgcc     3120

agaaacacac tggatggtta ttctgtcggg gtatacggta cgtggtatca gaatggggaa     3180

aatgcaacag ggctctttgc tgaaacttgg atgcaatata actggtttaa tgcatcagtg     3240

aaaggtgacg gactggaaga agaaaaatat aatctgaatg gtttaaccgc ttctgcaggt     3300

gggggatata acctgaatgt gcacacatgg acatcacctg aaggaataac aggtgaattc     3360

tggttacagc ctcatttgca ggctgtctgg atgggggtta caccggatac acatcaggag     3420

gataacggaa cggtggtgca gggagcaggg aaaaataata ttcagacaaa agcaggtatt     3480

cgtgcatcct ggaaggtgaa aagcaccctg gataaggata ccgggcggag gttccgtccg     3540

tatatagagg caaactggat ccataacact catgaatttg gtgttaaaat gagtgatgac     3600

agccagttgt tgtcaggtag ccgaaatcag ggagagataa agacaggtat tgaaggggtg     3660

attactcaaa acttgtcagt gaatggcgga gtcgcatatc aggcaggagg tcacgggagc     3720

aatgccatct ccggagcact ggggataaaa tacagcttcg gttagagc                  3768


<210>  36
<211>  3393
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized sequence

<400>  36
ggccgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag       60

gggttttttg ctgaaaggag gaactatatc cgggtaacga attcaagctt gatatcattc      120

aggacgagcc tcagactcca gcgtaactgg actgcaatca actcactggc tcaccttcac      180

gggtgggcct ttcttcggta gaagtcatct catgaccaaa atcccttaac gtgagttacg      240

cgcgcgtcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat      300

cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg      360

gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga      420

gcgcagatac caaatactgt tcttctagtg tagccgtagt tagcccacca cttcaagaac      480

tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt      540

ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag      600

cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc      660

gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag      720

gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca      780

gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt      840

cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc      900

tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc      960

cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgcgcagaa     1020

aggcccaccc gaaggtgagc caggtgatta catttgggcc ctcattagaa aaactcatcg     1080

agcatcaaat gaaattgcaa tttattcata tcaggattat caataccata tttttgaaaa     1140

agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc     1200

tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg     1260

tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat     1320

ggcaaaagtt tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca     1380

tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgaggcga     1440

aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcgagtgcaa ccggcgcagg     1500

aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg     1560

aacgctgttt ttccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata     1620

aaatgcttga tggtcggaag tggcataaat tccgtcagcc agtttagtct gaccatctca     1680

tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg     1740

ggcttcccat acaagcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat     1800

ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga cgtttcccgt     1860

tgaatatggc tcatagctcc tgaaaatctc gataactcaa aaaatacgcc cggtagtgat     1920

cttatttcat tatggtgaaa gttggaacct cttacgtgcc gatcaaaaag acggtcaaaa     1980

gcctccggtc ggaggctttt gactttctgc tatggaggtc aggtatgatt taaatggtca     2040

gtattgagcg atatctagag aattcgtcca ccacaattca gcaaattgtg aacatcatca     2100

cgttcatctt tccctggttg ccaatggccc attttcctgt cagtaacgag aaggtcgcga     2160

attcaggcgc tttttagact ggtcgtaatg aaattctttt taagaggaga tatacatagg     2220

tctcacatga ttaaattaaa atttggtgtt ttttttacag ttttactatc ttcagcatat     2280

gcacatggaa cacctcaaaa tattactcac ggtgtgttaa ccctgtcttc tcgtggtctg     2340

gactatccat acgatgtacc ggattacgcg cgttatggaa gacgagtgtt aacaatggag     2400

atcttcgaac aatacagacc ggagaacgga agttatgcta ccaatatggc actggctaac     2460

tcactgttcc tcatggattt gaatgagcgt aagcaattca gggccatgag tgataataca     2520

cagcctgagt ctgcatccgt gtggatgaag atcactggag gaataagctc tggtaagctt     2580

aatgacgggc aaaataaaac aacaaccaat cagtttatca atcagctcgg gggggatatt     2640

tataaattcc atgctgaaca actgggtgat tttaccttag ggattatggg aggatacgcg     2700

aatgcaaaag gtaaaacgat aaattacacg agcaacaaag ctgccagaaa cacactggat     2760

ggttattctg tcggggtata cggtacgtgg tatcagaatg gggaaaatgc aacagggctc     2820

tttgctgaaa cttggatgca atataactgg tttaatgcat cagtgaaagg tgacggactg     2880

gaagaagaaa aatataatct gaatggttta accgcttctg caggtggggg atataacctg     2940

aatgtgcaca catggacatc acctgaagga ataacaggtg aattctggtt acagcctcat     3000

ttgcaggctg tctggatggg ggttacaccg gatacacatc aggaggataa cggaacggtg     3060

gtgcagggag cagggaaaaa taatattcag acaaaagcag gtattcgtgc atcctggaag     3120

gtgaaaagca ccctggataa ggataccggg cggaggttcc gtccgtatat agaggcaaac     3180

tggatccata acactcatga atttggtgtt aaaatgagtg atgacagcca gttgttgtca     3240

ggtagccgaa atcagggaga gataaagaca ggtattgaag gggtgattac tcaaaacttg     3300

tcagtgaatg gcggagtcgc atatcaggca ggaggtcacg ggagcaatgc catctccgga     3360

gcactgggga taaaatacag cttcggttag agc                                  3393


<210>  37
<211>  6
<212>  PRT
<213>  SARS-CoV-2

<400>  37

Ile Glu Asp Leu Leu Phe 
1               5       


<210>  38
<211>  8
<212>  PRT
<213>  Human immunodeficiency virus

<400>  38

Ala Val Gly Ile Gly Ala Val Phe 
1               5               


<210>  39
<211>  29903
<212>  DNA
<213>  SARS-CoV-2


<220>
<221>  CDS
<222>  (21563)..(25384)

<400>  39
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct       60

gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact      120

cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc      180

ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt      240

cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac      300

acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg      360

agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg      420

cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa      480

acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact      540

cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg      600

cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg      660

tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga      720

tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga      780

actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg      840

ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc      900

atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg      960

tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca     1020

gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa     1080

ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa     1140

gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg     1200

caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca     1260

gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga     1320

aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc     1380

atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg     1440

cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc     1500

ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg     1560

ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga     1620

aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga     1680

gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa     1740

aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac     1800

aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc     1860

tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct     1920

tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg     1980

aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac     2040

taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg     2100

gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga     2160

agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat     2220

ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa     2280

ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc     2340

tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca     2400

ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc     2460

tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt     2520

aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga     2580

agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga     2640

aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac     2700

cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga     2760

agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt     2820

acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc     2880

ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc     2940

actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg     3000

tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga     3060

agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga     3120

agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga     3180

agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga     3240

cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt     3300

agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt     3360

aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt     3420

aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc     3480

aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc     3540

tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa     3600

acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa     3660

gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg     3720

tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa     3780

tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga     3840

aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa     3900

gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat     3960

caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa     4020

cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag     4080

tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca     4140

agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat     4200

gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca     4260

gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc     4320

cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc     4380

ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg     4440

tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca     4500

agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc     4560

gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta     4620

tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc     4680

agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc     4740

ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa     4800

agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga     4860

taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac     4920

ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac     4980

aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca     5040

acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc     5100

acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt     5160

tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca     5220

cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa     5280

caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc     5340

acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc     5400

acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat     5460

gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg     5520

taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg     5580

cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca     5640

agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc     5700

tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca     5760

gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt     5820

acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag     5880

ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat     5940

tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat     6000

tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg     6060

tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc     6120

aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta     6180

taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg     6240

gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg     6300

tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga     6360

cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt     6420

ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt     6480

aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca     6540

cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga     6600

attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag     6660

tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac     6720

aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt     6780

ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc     6840

atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga     6900

ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg     6960

gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt     7020

tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa     7080

ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct     7140

tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc     7200

atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat     7260

tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag     7320

ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt     7380

acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta     7440

tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg     7500

ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag     7560

gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg     7620

tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga     7680

cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga     7740

tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac     7800

ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac     7860

taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc     7920

atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact     7980

agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga     8040

tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact     8100

agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac     8160

ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt     8220

tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa     8280

ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat     8340

tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat     8400

atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc     8460

tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa     8520

tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca     8580

gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc     8640

tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat     8700

tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc     8760

tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc     8820

attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac     8880

gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt     8940

tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc     9000

ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata     9060

ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac     9120

acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc     9180

tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc     9240

agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag     9300

atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac     9360

accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat     9420

tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg     9480

tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact     9540

ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt     9600

gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt     9660

cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca     9720

tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt     9780

tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa     9840

gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa     9900

taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg     9960

tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc    10020

accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc    10080

atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg    10140

tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat    10200

gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca    10260

ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct    10320

taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg    10380

acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc    10440

tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg    10500

ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac    10560

tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca    10620

aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta    10680

cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga    10740

ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat    10800

actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa    10860

agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga    10920

tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt    10980

gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt    11040

agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt    11100

accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa    11160

gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat    11220

ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac    11280

tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact    11340

aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat    11400

gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc    11460

catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat    11520

gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac    11580

tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg    11640

ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga    11700

ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa    11760

gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg    11820

tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt    11880

actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt    11940

ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt    12000

ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga    12060

agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc    12120

atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga    12180

ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga    12240

ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat    12300

gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat    12360

gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc    12420

aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt    12480

tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc    12540

atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag    12600

tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag    12660

ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat    12720

gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta    12780

caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa    12840

atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc    12900

ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa    12960

aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct    13020

acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt    13080

tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac    13140

taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc    13200

ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg    13260

ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat    13320

acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt    13380

ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca    13440

gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca    13500

ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat    13560

aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac    13620

gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac    13680

caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac    13740

ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact    13800

aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac    13860

acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag    13920

gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa    13980

cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt    14040

attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt    14100

gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg    14160

ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac    14220

ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta    14280

aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac    14340

tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg    14400

ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt    14460

gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac    14520

ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg    14580

cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca    14640

cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat    14700

gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc    14760

ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta    14820

ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt    14880

gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa    14940

tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt    15000

tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact    15060

caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc    15120

tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc    15180

gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac    15240

atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct    15300

aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc    15360

aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct    15420

caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc    15480

tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc    15540

acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc    15600

cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac    15660

tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac    15720

gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag    15780

aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg    15840

actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt    15900

aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc    15960

ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg    16020

tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc    16080

tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta    16140

gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt    16200

tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc    16260

aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa    16320

tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat    16380

gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg    16440

agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa    16500

gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca    16560

attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa    16620

agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct    16680

tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa    16740

gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact    16800

aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct    16860

gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca    16920

tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga    16980

attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat    17040

tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag    17100

agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct    17160

tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat    17220

aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg    17280

aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca    17340

gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat    17400

gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca    17460

cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt    17520

atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt    17580

gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca    17640

gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt    17700

aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa    17760

gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta    17820

ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa    17880

accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca    17940

aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca    18000

agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc    18060

tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc    18120

agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag    18180

gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat    18240

ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt    18300

ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta    18360

cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca    18420

cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa    18480

cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta    18540

caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca    18600

catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt    18660

tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg    18720

catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg    18780

ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca    18840

catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt    18900

aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg    18960

gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca    19020

gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa    19080

tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc    19140

tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc    19200

aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct    19260

aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac    19320

acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac    19380

tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca    19440

ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat    19500

gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc    19560

ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag    19620

agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt    19680

gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta    19740

gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag    19800

cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct    19860

gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt    19920

gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact    19980

gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt    20040

gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct    20100

agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag    20160

aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta    20220

caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa    20280

ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt    20340

agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa    20400

tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata    20460

acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat    20520

gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg    20580

actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca    20640

ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt    20700

tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca    20760

acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta    20820

aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct    20880

gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg    20940

cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat    21000

tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct    21060

aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt    21120

gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat    21180

tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt    21240

actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa    21300

ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca    21360

aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta    21420

aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt    21480

cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt    21540

cttgttaaca actaaacgaa ca atg ttt gtt ttt ctt gtt tta ttg cca cta     21592
                         Met Phe Val Phe Leu Val Leu Leu Pro Leu          
                         1               5                   10           

gtc tct agt cag tgt gtt aat ctt aca acc aga act caa tta ccc cct      21640
Val Ser Ser Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro           
                15                  20                  25                

gca tac act aat tct ttc aca cgt ggt gtt tat tac cct gac aaa gtt      21688
Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val           
            30                  35                  40                    

ttc aga tcc tca gtt tta cat tca act cag gac ttg ttc tta cct ttc      21736
Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe           
        45                  50                  55                        

ttt tcc aat gtt act tgg ttc cat gct ata cat gtc tct ggg acc aat      21784
Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn           
    60                  65                  70                            

ggt act aag agg ttt gat aac cct gtc cta cca ttt aat gat ggt gtt      21832
Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val           
75                  80                  85                  90            

tat ttt gct tcc act gag aag tct aac ata ata aga ggc tgg att ttt      21880
Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe           
                95                  100                 105               

ggt act act tta gat tcg aag acc cag tcc cta ctt att gtt aat aac      21928
Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn           
            110                 115                 120                   

gct act aat gtt gtt att aaa gtc tgt gaa ttt caa ttt tgt aat gat      21976
Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp           
        125                 130                 135                       

cca ttt ttg ggt gtt tat tac cac aaa aac aac aaa agt tgg atg gaa      22024
Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu           
    140                 145                 150                           

agt gag ttc aga gtt tat tct agt gcg aat aat tgc act ttt gaa tat      22072
Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr           
155                 160                 165                 170           

gtc tct cag cct ttt ctt atg gac ctt gaa gga aaa cag ggt aat ttc      22120
Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe           
                175                 180                 185               

aaa aat ctt agg gaa ttt gtg ttt aag aat att gat ggt tat ttt aaa      22168
Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys           
            190                 195                 200                   

ata tat tct aag cac acg cct att aat tta gtg cgt gat ctc cct cag      22216
Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln           
        205                 210                 215                       

ggt ttt tcg gct tta gaa cca ttg gta gat ttg cca ata ggt att aac      22264
Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn           
    220                 225                 230                           

atc act agg ttt caa act tta ctt gct tta cat aga agt tat ttg act      22312
Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr           
235                 240                 245                 250           

cct ggt gat tct tct tca ggt tgg aca gct ggt gct gca gct tat tat      22360
Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr           
                255                 260                 265               

gtg ggt tat ctt caa cct agg act ttt cta tta aaa tat aat gaa aat      22408
Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn           
            270                 275                 280                   

gga acc att aca gat gct gta gac tgt gca ctt gac cct ctc tca gaa      22456
Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu           
        285                 290                 295                       

aca aag tgt acg ttg aaa tcc ttc act gta gaa aaa gga atc tat caa      22504
Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln           
    300                 305                 310                           

act tct aac ttt aga gtc caa cca aca gaa tct att gtt aga ttt cct      22552
Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro           
315                 320                 325                 330           

aat att aca aac ttg tgc cct ttt ggt gaa gtt ttt aac gcc acc aga      22600
Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg           
                335                 340                 345               

ttt gca tct gtt tat gct tgg aac agg aag aga atc agc aac tgt gtt      22648
Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val           
            350                 355                 360                   

gct gat tat tct gtc cta tat aat tcc gca tca ttt tcc act ttt aag      22696
Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys           
        365                 370                 375                       

tgt tat gga gtg tct cct act aaa tta aat gat ctc tgc ttt act aat      22744
Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn           
    380                 385                 390                           

gtc tat gca gat tca ttt gta att aga ggt gat gaa gtc aga caa atc      22792
Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile           
395                 400                 405                 410           

gct cca ggg caa act gga aag att gct gat tat aat tat aaa tta cca      22840
Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro           
                415                 420                 425               

gat gat ttt aca ggc tgc gtt ata gct tgg aat tct aac aat ctt gat      22888
Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp           
            430                 435                 440                   

tct aag gtt ggt ggt aat tat aat tac ctg tat aga ttg ttt agg aag      22936
Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys           
        445                 450                 455                       

tct aat ctc aaa cct ttt gag aga gat att tca act gaa atc tat cag      22984
Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln           
    460                 465                 470                           

gcc ggt agc aca cct tgt aat ggt gtt gaa ggt ttt aat tgt tac ttt      23032
Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe           
475                 480                 485                 490           

cct tta caa tca tat ggt ttc caa ccc act aat ggt gtt ggt tac caa      23080
Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln           
                495                 500                 505               

cca tac aga gta gta gta ctt tct ttt gaa ctt cta cat gca cca gca      23128
Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala           
            510                 515                 520                   

act gtt tgt gga cct aaa aag tct act aat ttg gtt aaa aac aaa tgt      23176
Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys           
        525                 530                 535                       

gtc aat ttc aac ttc aat ggt tta aca ggc aca ggt gtt ctt act gag      23224
Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu           
    540                 545                 550                           

tct aac aaa aag ttt ctg cct ttc caa caa ttt ggc aga gac att gct      23272
Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala           
555                 560                 565                 570           

gac act act gat gct gtc cgt gat cca cag aca ctt gag att ctt gac      23320
Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp           
                575                 580                 585               

att aca cca tgt tct ttt ggt ggt gtc agt gtt ata aca cca gga aca      23368
Ile Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr           
            590                 595                 600                   

aat act tct aac cag gtt gct gtt ctt tat cag gat gtt aac tgc aca      23416
Asn Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr           
        605                 610                 615                       

gaa gtc cct gtt gct att cat gca gat caa ctt act cct act tgg cgt      23464
Glu Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg           
    620                 625                 630                           

gtt tat tct aca ggt tct aat gtt ttt caa aca cgt gca ggc tgt tta      23512
Val Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu           
635                 640                 645                 650           

ata ggg gct gaa cat gtc aac aac tca tat gag tgt gac ata ccc att      23560
Ile Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile           
                655                 660                 665               

ggt gca ggt ata tgc gct agt tat cag act cag act aat tct cct cgg      23608
Gly Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg           
            670                 675                 680                   

cgg gca cgt agt gta gct agt caa tcc atc att gcc tac act atg tca      23656
Arg Ala Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser           
        685                 690                 695                       

ctt ggt gca gaa aat tca gtt gct tac tct aat aac tct att gcc ata      23704
Leu Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile           
    700                 705                 710                           

ccc aca aat ttt act att agt gtt acc aca gaa att cta cca gtg tct      23752
Pro Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser           
715                 720                 725                 730           

atg acc aag aca tca gta gat tgt aca atg tac att tgt ggt gat tca      23800
Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser           
                735                 740                 745               

act gaa tgc agc aat ctt ttg ttg caa tat ggc agt ttt tgt aca caa      23848
Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln           
            750                 755                 760                   

tta aac cgt gct tta act gga ata gct gtt gaa caa gac aaa aac acc      23896
Leu Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr           
        765                 770                 775                       

caa gaa gtt ttt gca caa gtc aaa caa att tac aaa aca cca cca att      23944
Gln Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile           
    780                 785                 790                           

aaa gat ttt ggt ggt ttt aat ttt tca caa ata tta cca gat cca tca      23992
Lys Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser           
795                 800                 805                 810           

aaa cca agc aag agg tca ttt att gaa gat cta ctt ttc aac aaa gtg      24040
Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val           
                815                 820                 825               

aca ctt gca gat gct ggc ttc atc aaa caa tat ggt gat tgc ctt ggt      24088
Thr Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly           
            830                 835                 840                   

gat att gct gct aga gac ctc att tgt gca caa aag ttt aac ggc ctt      24136
Asp Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu           
        845                 850                 855                       

act gtt ttg cca cct ttg ctc aca gat gaa atg att gct caa tac act      24184
Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr           
    860                 865                 870                           

tct gca ctg tta gcg ggt aca atc act tct ggt tgg acc ttt ggt gca      24232
Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala           
875                 880                 885                 890           

ggt gct gca tta caa ata cca ttt gct atg caa atg gct tat agg ttt      24280
Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe           
                895                 900                 905               

aat ggt att gga gtt aca cag aat gtt ctc tat gag aac caa aaa ttg      24328
Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu           
            910                 915                 920                   

att gcc aac caa ttt aat agt gct att ggc aaa att caa gac tca ctt      24376
Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu           
        925                 930                 935                       

tct tcc aca gca agt gca ctt gga aaa ctt caa gat gtg gtc aac caa      24424
Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln           
    940                 945                 950                           

aat gca caa gct tta aac acg ctt gtt aaa caa ctt agc tcc aat ttt      24472
Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe           
955                 960                 965                 970           

ggt gca att tca agt gtt tta aat gat atc ctt tca cgt ctt gac aaa      24520
Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys           
                975                 980                 985               

gtt gag gct gaa gtg caa att gat agg ttg atc aca ggc aga  ctt caa     24568
Val Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln          
            990                 995                 1000                  

agt ttg cag  aca tat gtg act caa  caa tta att aga gct  gca gaa       24613
Ser Leu Gln  Thr Tyr Val Thr Gln  Gln Leu Ile Arg Ala  Ala Glu            
        1005                 1010                 1015                    

atc aga gct  tct gct aat ctt gct  gct act aaa atg tca  gag tgt       24658
Ile Arg Ala  Ser Ala Asn Leu Ala  Ala Thr Lys Met Ser  Glu Cys            
        1020                 1025                 1030                    

gta ctt gga  caa tca aaa aga gtt  gat ttt tgt gga aag  ggc tat       24703
Val Leu Gly  Gln Ser Lys Arg Val  Asp Phe Cys Gly Lys  Gly Tyr            
        1035                 1040                 1045                    

cat ctt atg  tcc ttc cct cag tca  gca cct cat ggt gta  gtc ttc       24748
His Leu Met  Ser Phe Pro Gln Ser  Ala Pro His Gly Val  Val Phe            
        1050                 1055                 1060                    

ttg cat gtg  act tat gtc cct gca  caa gaa aag aac ttc  aca act       24793
Leu His Val  Thr Tyr Val Pro Ala  Gln Glu Lys Asn Phe  Thr Thr            
        1065                 1070                 1075                    

gct cct gcc  att tgt cat gat gga  aaa gca cac ttt cct  cgt gaa       24838
Ala Pro Ala  Ile Cys His Asp Gly  Lys Ala His Phe Pro  Arg Glu            
        1080                 1085                 1090                    

ggt gtc ttt  gtt tca aat ggc aca  cac tgg ttt gta aca  caa agg       24883
Gly Val Phe  Val Ser Asn Gly Thr  His Trp Phe Val Thr  Gln Arg            
        1095                 1100                 1105                    

aat ttt tat  gaa cca caa atc att  act aca gac aac aca  ttt gtg       24928
Asn Phe Tyr  Glu Pro Gln Ile Ile  Thr Thr Asp Asn Thr  Phe Val            
        1110                 1115                 1120                    

tct ggt aac  tgt gat gtt gta ata  gga att gtc aac aac  aca gtt       24973
Ser Gly Asn  Cys Asp Val Val Ile  Gly Ile Val Asn Asn  Thr Val            
        1125                 1130                 1135                    

tat gat cct  ttg caa cct gaa tta  gac tca ttc aag gag  gag tta       25018
Tyr Asp Pro  Leu Gln Pro Glu Leu  Asp Ser Phe Lys Glu  Glu Leu            
        1140                 1145                 1150                    

gat aaa tat  ttt aag aat cat aca  tca cca gat gtt gat  tta ggt       25063
Asp Lys Tyr  Phe Lys Asn His Thr  Ser Pro Asp Val Asp  Leu Gly            
        1155                 1160                 1165                    

gac atc tct  ggc att aat gct tca  gtt gta aac att caa  aaa gaa       25108
Asp Ile Ser  Gly Ile Asn Ala Ser  Val Val Asn Ile Gln  Lys Glu            
        1170                 1175                 1180                    

att gac cgc  ctc aat gag gtt gcc  aag aat tta aat gaa  tct ctc       25153
Ile Asp Arg  Leu Asn Glu Val Ala  Lys Asn Leu Asn Glu  Ser Leu            
        1185                 1190                 1195                    

atc gat ctc  caa gaa ctt gga aag  tat gag cag tat ata  aaa tgg       25198
Ile Asp Leu  Gln Glu Leu Gly Lys  Tyr Glu Gln Tyr Ile  Lys Trp            
        1200                 1205                 1210                    

cca tgg tac  att tgg cta ggt ttt  ata gct ggc ttg att  gcc ata       25243
Pro Trp Tyr  Ile Trp Leu Gly Phe  Ile Ala Gly Leu Ile  Ala Ile            
        1215                 1220                 1225                    

gta atg gtg  aca att atg ctt tgc  tgt atg acc agt tgc  tgt agt       25288
Val Met Val  Thr Ile Met Leu Cys  Cys Met Thr Ser Cys  Cys Ser            
        1230                 1235                 1240                    

tgt ctc aag  ggc tgt tgt tct tgt  gga tcc tgc tgc aaa  ttt gat       25333
Cys Leu Lys  Gly Cys Cys Ser Cys  Gly Ser Cys Cys Lys  Phe Asp            
        1245                 1250                 1255                    

gaa gac gac  tct gag cca gtg ctc  aaa gga gtc aaa tta  cat tac       25378
Glu Asp Asp  Ser Glu Pro Val Leu  Lys Gly Val Lys Leu  His Tyr            
        1260                 1265                 1270                    

aca taa acgaacttat ggatttgttt atgagaatct tcacaattgg aactgtaact       25434
Thr                                                                       
                                                                          

ttgaagcaag gtgaaatcaa ggatgctact ccttcagatt ttgttcgcgc tactgcaacg    25494

ataccgatac aagcctcact ccctttcgga tggcttattg ttggcgttgc acttcttgct    25554

gtttttcaga gcgcttccaa aatcataacc ctcaaaaaga gatggcaact agcactctcc    25614

aagggtgttc actttgtttg caacttgctg ttgttgtttg taacagttta ctcacacctt    25674

ttgctcgttg ctgctggcct tgaagcccct tttctctatc tttatgcttt agtctacttc    25734

ttgcagagta taaactttgt aagaataata atgaggcttt ggctttgctg gaaatgccgt    25794

tccaaaaacc cattacttta tgatgccaac tattttcttt gctggcatac taattgttac    25854

gactattgta taccttacaa tagtgtaact tcttcaattg tcattacttc aggtgatggc    25914

acaacaagtc ctatttctga acatgactac cagattggtg gttatactga aaaatgggaa    25974

tctggagtaa aagactgtgt tgtattacac agttacttca cttcagacta ttaccagctg    26034

tactcaactc aattgagtac agacactggt gttgaacatg ttaccttctt catctacaat    26094

aaaattgttg atgagcctga agaacatgtc caaattcaca caatcgacgg ttcatccgga    26154

gttgttaatc cagtaatgga accaatttat gatgaaccga cgacgactac tagcgtgcct    26214

ttgtaagcac aagctgatga gtacgaactt atgtactcat tcgtttcgga agagacaggt    26274

acgttaatag ttaatagcgt acttcttttt cttgctttcg tggtattctt gctagttaca    26334

ctagccatcc ttactgcgct tcgattgtgt gcgtactgct gcaatattgt taacgtgagt    26394

cttgtaaaac cttcttttta cgtttactct cgtgttaaaa atctgaattc ttctagagtt    26454

cctgatcttc tggtctaaac gaactaaata ttatattagt ttttctgttt ggaactttaa    26514

ttttagccat ggcagattcc aacggtacta ttaccgttga agagcttaaa aagctccttg    26574

aacaatggaa cctagtaata ggtttcctat tccttacatg gatttgtctt ctacaatttg    26634

cctatgccaa caggaatagg tttttgtata taattaagtt aattttcctc tggctgttat    26694

ggccagtaac tttagcttgt tttgtgcttg ctgctgttta cagaataaat tggatcaccg    26754

gtggaattgc tatcgcaatg gcttgtcttg taggcttgat gtggctcagc tacttcattg    26814

cttctttcag actgtttgcg cgtacgcgtt ccatgtggtc attcaatcca gaaactaaca    26874

ttcttctcaa cgtgccactc catggcacta ttctgaccag accgcttcta gaaagtgaac    26934

tcgtaatcgg agctgtgatc cttcgtggac atcttcgtat tgctggacac catctaggac    26994

gctgtgacat caaggacctg cctaaagaaa tcactgttgc tacatcacga acgctttctt    27054

attacaaatt gggagcttcg cagcgtgtag caggtgactc aggttttgct gcatacagtc    27114

gctacaggat tggcaactat aaattaaaca cagaccattc cagtagcagt gacaatattg    27174

ctttgcttgt acagtaagtg acaacagatg tttcatctcg ttgactttca ggttactata    27234

gcagagatat tactaattat tatgaggact tttaaagttt ccatttggaa tcttgattac    27294

atcataaacc tcataattaa aaatttatct aagtcactaa ctgagaataa atattctcaa    27354

ttagatgaag agcaaccaat ggagattgat taaacgaaca tgaaaattat tcttttcttg    27414

gcactgataa cactcgctac ttgtgagctt tatcactacc aagagtgtgt tagaggtaca    27474

acagtacttt taaaagaacc ttgctcttct ggaacatacg agggcaattc accatttcat    27534

cctctagctg ataacaaatt tgcactgact tgctttagca ctcaatttgc ttttgcttgt    27594

cctgacggcg taaaacacgt ctatcagtta cgtgccagat cagtttcacc taaactgttc    27654

atcagacaag aggaagttca agaactttac tctccaattt ttcttattgt tgcggcaata    27714

gtgtttataa cactttgctt cacactcaaa agaaagacag aatgattgaa ctttcattaa    27774

ttgacttcta tttgtgcttt ttagcctttc tgctattcct tgttttaatt atgcttatta    27834

tcttttggtt ctcacttgaa ctgcaagatc ataatgaaac ttgtcacgcc taaacgaaca    27894

tgaaatttct tgttttctta ggaatcatca caactgtagc tgcatttcac caagaatgta    27954

gtttacagtc atgtactcaa catcaaccat atgtagttga tgacccgtgt cctattcact    28014

tctattctaa atggtatatt agagtaggag ctagaaaatc agcaccttta attgaattgt    28074

gcgtggatga ggctggttct aaatcaccca ttcagtacat cgatatcggt aattatacag    28134

tttcctgttt accttttaca attaattgcc aggaacctaa attgggtagt cttgtagtgc    28194

gttgttcgtt ctatgaagac tttttagagt atcatgacgt tcgtgttgtt ttagatttca    28254

tctaaacgaa caaactaaaa tgtctgataa tggaccccaa aatcagcgaa atgcaccccg    28314

cattacgttt ggtggaccct cagattcaac tggcagtaac cagaatggag aacgcagtgg    28374

ggcgcgatca aaacaacgtc ggccccaagg tttacccaat aatactgcgt cttggttcac    28434

cgctctcact caacatggca aggaagacct taaattccct cgaggacaag gcgttccaat    28494

taacaccaat agcagtccag atgaccaaat tggctactac cgaagagcta ccagacgaat    28554

tcgtggtggt gacggtaaaa tgaaagatct cagtccaaga tggtatttct actacctagg    28614

aactgggcca gaagctggac ttccctatgg tgctaacaaa gacggcatca tatgggttgc    28674

aactgaggga gccttgaata caccaaaaga tcacattggc acccgcaatc ctgctaacaa    28734

tgctgcaatc gtgctacaac ttcctcaagg aacaacattg ccaaaaggct tctacgcaga    28794

agggagcaga ggcggcagtc aagcctcttc tcgttcctca tcacgtagtc gcaacagttc    28854

aagaaattca actccaggca gcagtagggg aacttctcct gctagaatgg ctggcaatgg    28914

cggtgatgct gctcttgctt tgctgctgct tgacagattg aaccagcttg agagcaaaat    28974

gtctggtaaa ggccaacaac aacaaggcca aactgtcact aagaaatctg ctgctgaggc    29034

ttctaagaag cctcggcaaa aacgtactgc cactaaagca tacaatgtaa cacaagcttt    29094

cggcagacgt ggtccagaac aaacccaagg aaattttggg gaccaggaac taatcagaca    29154

aggaactgat tacaaacatt ggccgcaaat tgcacaattt gcccccagcg cttcagcgtt    29214

cttcggaatg tcgcgcattg gcatggaagt cacaccttcg ggaacgtggt tgacctacac    29274

aggtgccatc aaattggatg acaaagatcc aaatttcaaa gatcaagtca ttttgctgaa    29334

taagcatatt gacgcataca aaacattccc accaacagag cctaaaaagg acaaaaagaa    29394

gaaggctgat gaaactcaag ccttaccgca gagacagaag aaacagcaaa ctgtgactct    29454

tcttcctgct gcagatttgg atgatttctc caaacaattg caacaatcca tgagcagtgc    29514

tgactcaact caggcctaaa ctcatgcaga ccacacaagg cagatgggct atataaacgt    29574

tttcgctttt ccgtttacga tatatagtct actcttgtgc agaatgaatt ctcgtaacta    29634

catagcacaa gtagatgtag ttaactttaa tctcacatag caatctttaa tcagtgtgta    29694

acattaggga ggacttgaaa gagccaccac attttcaccg aggccacgcg gagtacgatc    29754

gagtgtacag tgaacaatgc tagggagagc tgcctatatg gaagagccct aatgtgtaaa    29814

attaatttta gtagtgctat ccccatgtga ttttaatagc ttcttaggag aatgacaaaa    29874

aaaaaaaaaa aaaaaaaaaa aaaaaaaaa                                      29903


<210>  40
<211>  1273
<212>  PRT
<213>  SARS-CoV-2

<400>  40

Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
1               5                   10                  15      


Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
            20                  25                  30          


Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
        35                  40                  45              


His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
    50                  55                  60                  


Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
65                  70                  75                  80  


Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                85                  90                  95      


Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
            100                 105                 110         


Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
        115                 120                 125             


Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
    130                 135                 140                 


Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
145                 150                 155                 160 


Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                165                 170                 175     


Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
            180                 185                 190         


Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
        195                 200                 205             


Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
    210                 215                 220                 


Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
225                 230                 235                 240 


Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                245                 250                 255     


Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
            260                 265                 270         


Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
        275                 280                 285             


Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
    290                 295                 300                 


Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
305                 310                 315                 320 


Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                325                 330                 335     


Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
            340                 345                 350         


Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
        355                 360                 365             


Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
    370                 375                 380                 


Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
385                 390                 395                 400 


Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                405                 410                 415     


Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
            420                 425                 430         


Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
        435                 440                 445             


Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
    450                 455                 460                 


Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
465                 470                 475                 480 


Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                485                 490                 495     


Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
            500                 505                 510         


Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
        515                 520                 525             


Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
    530                 535                 540                 


Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
545                 550                 555                 560 


Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                565                 570                 575     


Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
            580                 585                 590         


Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
        595                 600                 605             


Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
    610                 615                 620                 


His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
625                 630                 635                 640 


Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                645                 650                 655     


Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
            660                 665                 670         


Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
        675                 680                 685             


Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
    690                 695                 700                 


Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
705                 710                 715                 720 


Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                725                 730                 735     


Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
            740                 745                 750         


Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
        755                 760                 765             


Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
    770                 775                 780                 


Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
785                 790                 795                 800 


Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                805                 810                 815     


Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
            820                 825                 830         


Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
        835                 840                 845             


Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
    850                 855                 860                 


Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
865                 870                 875                 880 


Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                885                 890                 895     


Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
            900                 905                 910         


Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
        915                 920                 925             


Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
    930                 935                 940                 


Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
945                 950                 955                 960 


Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                965                 970                 975     


Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
            980                 985                 990         


Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
        995                 1000                 1005             


Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
    1010                 1015                 1020             


Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
    1025                 1030                 1035             


Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
    1040                 1045                 1050             


Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
    1055                 1060                 1065             


Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
    1070                 1075                 1080             


Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
    1085                 1090                 1095             


Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
    1100                 1105                 1110             


Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
    1115                 1120                 1125             


Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
    1130                 1135                 1140             


Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
    1145                 1150                 1155             


His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
    1160                 1165                 1170             


Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
    1175                 1180                 1185             


Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
    1190                 1195                 1200             


Gly Lys  Tyr Glu Gln Tyr Ile  Lys Trp Pro Trp Tyr  Ile Trp Leu 
    1205                 1210                 1215             


Gly Phe  Ile Ala Gly Leu Ile  Ala Ile Val Met Val  Thr Ile Met 
    1220                 1225                 1230             


Leu Cys  Cys Met Thr Ser Cys  Cys Ser Cys Leu Lys  Gly Cys Cys 
    1235                 1240                 1245             


Ser Cys  Gly Ser Cys Cys Lys  Phe Asp Glu Asp Asp  Ser Glu Pro 
    1250                 1255                 1260             


Val Leu  Lys Gly Val Lys Leu  His Tyr Thr 
    1265                 1270             


<210>  41
<211>  24
<212>  PRT
<213>  porcine epidemic diarrhea virus

<400>  41

Gly Arg Val Val Gln Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn 
1               5                   10                  15      


Lys Val Val Thr Asn Gly Leu Gly 
            20                  


<210>  42
<211>  24
<212>  PRT
<213>  SARS-CoV-2

<400>  42

Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn 
1               5                   10                  15      


Val Lys Thr Leu Ala Asp Ala Gly 
            20                  


<210>  43
<211>  8
<212>  PRT
<213>  SARS-CoV-2

<400>  43

Ser Phe Ile Glu Asp Leu Leu Phe 
1               5               


<210>  44
<211>  9
<212>  PRT
<213>  Influenza virus

<400>  44

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1               5                   


<210>  45
<211>  8
<212>  PRT
<213>  Homo sapiens

<400>  45

Tyr Ser Asn Ile Gly Val Cys Lys 
1               5               


<210>  46
<211>  110
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificially synthesized oligonucleotide sequence

<400>  46
aaccacgtct tcaggccgcg ttgttcagaa acgcagcttc atcgaggatc tgctgttcaa       60

caaggtggtg accaatggtc tgggcaccgg tggcggcaga agacttgtgt                 110


<210>  47
<211>  8
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  47

Ala Leu Gly Ile Gly Ala Ala Phe 
1               5               


<210>  48
<211>  8
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  48

Ala Val Gly Phe Gly Ala Ala Phe 
1               5               


<210>  49
<211>  8
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  49

Ala Ala Gly Phe Gly Ala Met Phe 
1               5               


