                         SEQUENCE LISTING

<110>  Affinia Therapeutics, Inc.
 
<120>  RECOMBINANT AAV FOR TREATMENT OF NEURAL DISEASE

<130>  AFF-002WO

<140>  PCT/US2021/063889
<141>  2021-12-16

<160>  22    

<170>  PatentIn version 3.5

<210>  1
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Anc80L65 vp1 capsid protein

<400>  1

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 
145                 150                 155                 160 


Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ala Gly Gly Gly 
        195                 200                 205             


Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Ser Gln Ser Gly Gly Ser Thr Asn Asp Asn Thr 
            260                 265                 270         


Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
        275                 280                 285             


His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 
    290                 295                 300                 


Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn Ile Gln 
305                 310                 315                 320 


Val Lys Glu Val Thr Thr Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn 
                325                 330                 335     


Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro 
            340                 345                 350         


Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 
        355                 360                 365             


Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
    370                 375                 380                 


Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385                 390                 395                 400 


Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Thr Phe 
                405                 410                 415     


Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 
            420                 425                 430         


Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg 
        435                 440                 445             


Thr Gln Thr Thr Ser Gly Thr Ala Gly Asn Arg Thr Leu Gln Phe Ser 
    450                 455                 460                 


Gln Ala Gly Pro Ser Ser Met Ala Asn Gln Ala Lys Asn Trp Leu Pro 
465                 470                 475                 480 


Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Thr Asn Gln Asn 
                485                 490                 495     


Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn 
            500                 505                 510         


Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Thr His Lys 
        515                 520                 525             


Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Ala Gly Asn Ser Asn Val Asp Leu Asp Asn Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Glu 
                565                 570                 575     


Tyr Gly Thr Val Ala Thr Asn Leu Gln Ser Ala Asn Thr Ala Pro Ala 
            580                 585                 590         


Thr Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asn Pro Pro Thr Thr Phe Ser Pro Ala Lys Phe Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Asn Lys Ser Thr Asn Val Asp Phe Ala Val Asp Thr Asn Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  2
<211>  1530
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ARSA codon optimized sequence (S)

<400>  2
atgagcatgg gcgctcctag aagcctgctg ctggccctgg ctgccggcct ggccgtggct       60

agacctccaa acatcgtgct gatcttcgcc gacgacctgg gctatggtga cctgggctgc      120

tacggccacc cctcttctac aacacccaat ctggaccagc tggccgctgg cggcctgaga      180

ttcacagact tctacgtgcc agtgtccctg tgcacccctt ctagagccgc tctcctgacc      240

ggcagactgc ctgtgcggat gggcatgtac cccggagtgc tggtgcccag cagtagagga      300

ggactgcctc tggaagaggt gaccgtggcc gaggtgctgg ccgccagagg ctacctgaca      360

ggaatggccg gaaaatggca cctgggagtg ggcccagaag gcgccttcct gccaccacac      420

cagggctttc accggttcct ggggatccct tacagccacg accaaggccc ttgtcagaac      480

ctgacatgct tcccccccgc cacaccttgc gacggcggct gtgaccaggg ccttgtgcct      540

atccccctgc tggccaacct gagcgtggaa gcccagcctc catggctgcc tggcctcgag      600

gccagataca tggccttcgc tcatgatctg atggccgatg cccagagaca ggacagacct      660

tttttcctgt attacgccag ccaccacacc cactaccctc agttcagcgg acagagcttc      720

gccgagcgga gcggcagagg ccccttcggc gacagcctga tggaactgga cgccgctgtt      780

ggaaccctga tgaccgccat tggcgatctg ggcctgctcg aggaaaccct ggtgatcttc      840

accgccgata acggccctga gacaatgcgg atgtctagag gcggctgcag cggcctgctg      900

cggtgcggca agggcaccac ctacgagggc ggcgtgcggg aacccgccct ggctttttgg      960

cctggccaca tcgcccctgg cgttacccac gagctggctt ctagcctgga cctgctgccc     1020

accctggccg cactggccgg agctccactg cctaatgtga ccctggatgg cttcgacctg     1080

tcccctctgc tgctcggcac cggcaagagc cctagacaga gcctgttctt ctacccctcc     1140

taccctgatg aggtgcgggg cgtctttgcc gtcaggaccg gcaaatacaa ggcccatttc     1200

tttacacagg gcagcgccca ctctgatacc acagccgacc ctgcctgcca cgccagctcc     1260

agcctgaccg cccacgagcc tcctctgcta tacgacctga gcaaggaccc tggcgagaac     1320

tacaacctgc tgggtggcgt ggccggcgct acacctgagg tgctgcaggc cctgaagcag     1380

ctgcagctgc ttaaggccca actggacgcc gctgtgacct tcggccctag ccaggtggcc     1440

agaggagaag atcccgccct gcaaatctgc tgccaccctg gatgtacccc tcggcccgct     1500

tgttgtcact gccccgaccc tcacgcctga                                      1530


<210>  3
<211>  1530
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ARSA codon optimized sequence (A)

<400>  3
atgtctatgg gagcccctag atctctgctg ctggctctgg ctgctggact ggcagttgcc       60

agacctccta acatcgtgct gatcttcgcc gacgatctcg gctatggcga tctgggctgt      120

tacggacacc ccagcagcac cacacctaac ctggatcaac ttgccgctgg cggcctgaga      180

ttcaccgatt tctacgtgcc cgtgtctctg tgcacccctt ctagagctgc tctgctgaca      240

ggcagactcc ctgtgcggat gggaatgtat cctggcgtgc tggtgcctag ctctagaggc      300

ggactgcctc tggaagaagt gacagttgcc gaagtgctgg ccgccagagg atatctgact      360

ggcatggccg gaaagtggca cctcggagtt ggacctgaag gcgcttttct gcctcctcac      420

cagggcttcc accggtttct gggcatccct tactctcacg atcagggccc ctgccagaac      480

ctgacctgtt ttcctcctgc cacaccttgc gacggcggct gtgatcaagg actggtgcca      540

attcctctgc tggccaacct gagcgtggaa gctcaacctc cttggctgcc aggactggaa      600

gcccggtata tggccttcgc tcacgacctg atggccgacg ctcagagaca ggacagacca      660

ttcttcctgt actacgccag ccaccacaca cactaccctc agtttagcgg ccagagcttc      720

gccgagagat ctggcagagg acctttcggc gacagcctga tggaactgga tgccgctgtg      780

ggcacactga tgacagccat cggagatctg ggactgctgg aagagacact ggtcatcttc      840

accgccgaca acggccccga gacaatgaga atgagcagag gcggctgtag cggcctgctg      900

agatgtggca agggcaccac atatgaaggc ggcgtcagag aacctgctct ggccttttgg      960

cctggccata ttgctccagg cgtgacacac gagctggcct cttctctgga tctgctgcct     1020

acactggcag ctcttgctgg tgctcccctg cctaatgtga ccctggatgg cttcgatctg     1080

agcccactgc tgctcggcac aggcaagtct ccaagacaga gcctgttctt ctaccctagc     1140

taccccgatg aagtgcgggg agtgtttgcc gtgcggaccg gaaagtataa ggcccacttc     1200

ttcacccaag gcagcgccca ctctgacacc acagctgatc ctgcttgtca cgccagctct     1260

agcctgacag cccatgaacc tccactgctg tacgacctga gcaaggaccc cggcgagaac     1320

tacaatctgc ttggcggagt tgccggcgct acacctgaag ttctgcaggc cctgaaacag     1380

ctccagctgc tgaaagccca gctggacgct gccgtgacat ttggacctag tcaggtggcc     1440

agaggcgagg atcctgctct gcagatctgt tgtcaccctg gctgcacacc cagacctgcc     1500

tgctgtcatt gtcctgatcc tcacgcctga                                      1530


<210>  4
<211>  1530
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ARSA coding sequence (native)

<400>  4
atgtccatgg gggcaccgcg gtccctcctc ctggccctgg ctgctggcct ggccgttgcc       60

cgtccgccca acatcgtgct gatctttgcc gacgacctcg gctatgggga cctgggctgc      120

tatgggcacc ccagctctac cactcccaac ctggaccagc tggcggcggg agggctgcgg      180

ttcacagact tctacgtgcc tgtgtctctg tgcacaccct ctagggccgc cctcctgacc      240

ggccggctcc cggttcggat gggcatgtac cctggcgtcc tggtgcccag ctcccggggg      300

ggcctgcccc tggaggaggt gaccgtggcc gaagtcctgg ctgcccgagg ctacctcaca      360

ggaatggccg gcaagtggca ccttggggtg gggcctgagg gggccttcct gcccccccat      420

cagggcttcc atcgatttct aggcatcccg tactcccacg accagggccc ctgccagaac      480

ctgacctgct tcccgccggc cactccttgc gacggtggct gtgaccaggg cctggtcccc      540

atcccactgt tggccaacct gtccgtggag gcgcagcccc cctggctgcc cggactagag      600

gcccgctaca tggctttcgc ccatgacctc atggccgacg cccagcgcca ggatcgcccc      660

ttcttcctgt actatgcctc tcaccacacc cactaccctc agttcagtgg gcagagcttt      720

gcagagcgtt caggccgcgg gccatttggg gactccctga tggagctgga tgcagctgtg      780

gggaccctga tgacagccat aggggacctg gggctgcttg aagagacgct ggtcatcttc      840

actgcagaca atggacctga gaccatgcgt atgtcccgag gcggctgctc cggtctcttg      900

cggtgtggaa agggaacgac ctacgagggc ggtgtccgag agcctgcctt ggccttctgg      960

ccaggtcata tcgctcccgg cgtgacccac gagctggcca gctccctgga cctgctgcct     1020

accctggcag ccctggctgg ggccccactg cccaatgtca ccttggatgg ctttgacctc     1080

agccccctgc tgctgggcac aggcaagagc cctcggcagt ctctcttctt ctacccgtcc     1140

tacccagacg aggtccgtgg ggtttttgct gtgcggactg gaaagtacaa ggctcacttc     1200

ttcacccagg gctctgccca cagtgatacc actgcagacc ctgcctgcca cgcctccagc     1260

tctctgactg ctcatgagcc cccgctgctc tatgacctgt ccaaggaccc tggtgagaac     1320

tacaacctgc tggggggtgt ggccggggcc accccagagg tgctgcaagc cctgaaacag     1380

cttcagctgc tcaaggccca gttagacgca gctgtgacct tcggccccag ccaggtggcc     1440

cggggcgagg accccgccct gcagatctgc tgtcatcctg gctgcacccc ccgcccagct     1500

tgctgccatt gcccagatcc ccatgcctga                                      1530


<210>  5
<211>  507
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ARSA (native) (amino acid sequence)

<400>  5

Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly Leu Ala 
1               5                   10                  15      


Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp Leu Gly 
            20                  25                  30          


Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr Pro Asn 
        35                  40                  45              


Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe Tyr Val 
    50                  55                  60                  


Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr Gly Arg 
65                  70                  75                  80  


Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser 
                85                  90                  95      


Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala 
            100                 105                 110         


Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val 
        115                 120                 125             


Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His Arg Phe 
    130                 135                 140                 


Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn Leu Thr 
145                 150                 155                 160 


Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln Gly Leu 
                165                 170                 175     


Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln Pro Pro 
            180                 185                 190         


Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His Asp Leu 
        195                 200                 205             


Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr Tyr Ala 
    210                 215                 220                 


Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe Ala Glu 
225                 230                 235                 240 


Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala 
                245                 250                 255     


Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu Leu Glu 
            260                 265                 270         


Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr Met Arg 
        275                 280                 285             


Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr 
    290                 295                 300                 


Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly 
305                 310                 315                 320 


His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu 
                325                 330                 335     


Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr 
            340                 345                 350         


Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly Lys Ser 
        355                 360                 365             


Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu Val Arg 
    370                 375                 380                 


Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe Phe Thr 
385                 390                 395                 400 


Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala 
                405                 410                 415     


Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser 
            420                 425                 430         


Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala 
        435                 440                 445             


Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu Lys Ala 
    450                 455                 460                 


Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala Arg Gly 
465                 470                 475                 480 


Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr Pro Arg 
                485                 490                 495     


Pro Ala Cys Cys His Cys Pro Asp Pro His Ala 
            500                 505         


<210>  6
<211>  507
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Hyper-ARSA  (amino acid sequence)

<400>  6

Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly Leu Ala 
1               5                   10                  15      


Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp Leu Gly 
            20                  25                  30          


Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr Pro Asn 
        35                  40                  45              


Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe Tyr Val 
    50                  55                  60                  


Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr Gly Arg 
65                  70                  75                  80  


Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser 
                85                  90                  95      


Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala 
            100                 105                 110         


Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val 
        115                 120                 125             


Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His Arg Phe 
    130                 135                 140                 


Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn Leu Thr 
145                 150                 155                 160 


Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln Gly Leu 
                165                 170                 175     


Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln Pro Pro 
            180                 185                 190         


Trp Leu Pro Gly Leu Glu Ala Arg Tyr Val Ala Phe Ala His Asp Leu 
        195                 200                 205             


Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr Tyr Ala 
    210                 215                 220                 


Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe Ala Glu 
225                 230                 235                 240 


Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala 
                245                 250                 255     


Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu Leu Glu 
            260                 265                 270         


Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Leu Met Arg 
        275                 280                 285             


Met Ser Asn Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr 
    290                 295                 300                 


Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly 
305                 310                 315                 320 


His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu 
                325                 330                 335     


Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr 
            340                 345                 350         


Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly Lys Ser 
        355                 360                 365             


Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu Val Arg 
    370                 375                 380                 


Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe Phe Thr 
385                 390                 395                 400 


Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala 
                405                 410                 415     


Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser 
            420                 425                 430         


Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala 
        435                 440                 445             


Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu Lys Ala 
    450                 455                 460                 


Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala Arg Gly 
465                 470                 475                 480 


Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr Pro Arg 
                485                 490                 495     


Pro Ala Cys Cys His Cys Pro Asp Pro His Ala 
            500                 505         


<210>  7
<211>  1530
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyper-ARSA codon optimized sequence (S)

<400>  7
atgagcatgg gcgctcctag aagcctgctg ctggccctgg ctgccggcct ggccgtggct       60

agacctccaa acatcgtgct gatcttcgcc gacgacctgg gctatggtga cctgggctgc      120

tacggccacc cctcttctac aacacccaat ctggaccagc tggccgctgg cggcctgaga      180

ttcacagact tctacgtgcc agtgtccctg tgcacccctt ctagagccgc tctcctgacc      240

ggcagactgc ctgtgcggat gggcatgtac cccggagtgc tggtgcccag cagtagagga      300

ggactgcctc tggaagaggt gaccgtggcc gaggtgctgg ccgccagagg ctacctgaca      360

ggaatggccg gaaaatggca cctgggagtg ggcccagaag gcgccttcct gccaccacac      420

cagggctttc accggttcct ggggatccct tacagccacg accaaggccc ttgtcagaac      480

ctgacatgct tcccccccgc cacaccttgc gacggcggct gtgaccaggg ccttgtgcct      540

atccccctgc tggccaacct gagcgtggaa gcccagcctc catggctgcc tggcctcgag      600

gccagatacg tggccttcgc tcatgatctg atggccgatg cccagagaca ggacagacct      660

tttttcctgt attacgccag ccaccacacc cactaccctc agttcagcgg acagagcttc      720

gccgagcgga gcggcagagg ccccttcggc gacagcctga tggaactgga cgccgctgtt      780

ggaaccctga tgaccgccat tggcgatctg ggcctgctcg aggaaaccct ggtgatcttc      840

accgccgata acggccctga gctgatgcgg atgtctaacg gcggctgcag cggcctgctg      900

cggtgcggca agggcaccac ctacgagggc ggcgtgcggg aacccgccct ggctttttgg      960

cctggccaca tcgcccctgg cgttacccac gagctggctt ctagcctgga cctgctgccc     1020

accctggccg cactggccgg agctccactg cctaatgtga ccctggatgg cttcgacctg     1080

tcccctctgc tgctcggcac cggcaagagc cctagacaga gcctgttctt ctacccctcc     1140

taccctgatg aggtgcgggg cgtctttgcc gtcaggaccg gcaaatacaa ggcccatttc     1200

tttacacagg gcagcgccca ctctgatacc acagccgacc ctgcctgcca cgccagctcc     1260

agcctgaccg cccacgagcc tcctctgcta tacgacctga gcaaggaccc tggcgagaac     1320

tacaacctgc tgggtggcgt ggccggcgct acacctgagg tgctgcaggc cctgaagcag     1380

ctgcagctgc ttaaggccca actggacgcc gctgtgacct tcggccctag ccaggtggcc     1440

agaggagaag atcccgccct gcaaatctgc tgccaccctg gatgtacccc tcggcccgct     1500

tgttgtcact gccccgaccc tcacgcctga                                      1530


<210>  8
<211>  1530
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Hyper-ARSA codon- codon optimized sequence (A)

<400>  8
atgtctatgg gagcccctag atctctgctg ctggctctgg ctgctggact ggcagttgcc       60

agacctccta acatcgtgct gatcttcgcc gacgatctcg gctatggcga tctgggctgt      120

tacggacacc ccagcagcac cacacctaac ctggatcaac ttgccgctgg cggcctgaga      180

ttcaccgatt tctacgtgcc cgtgtctctg tgcacccctt ctagagctgc tctgctgaca      240

ggcagactcc ctgtgcggat gggaatgtat cctggcgtgc tggtgcctag ctctagaggc      300

ggactgcctc tggaagaagt gacagttgcc gaagtgctgg ccgccagagg atatctgact      360

ggcatggccg gaaagtggca cctcggagtt ggacctgaag gcgcttttct gcctcctcac      420

cagggcttcc accggtttct gggcatccct tactctcacg atcagggccc ctgccagaac      480

ctgacctgtt ttcctcctgc cacaccttgc gacggcggct gtgatcaagg actggtgcca      540

attcctctgc tggccaacct gagcgtggaa gctcaacctc cttggctgcc aggactggaa      600

gcccggtatg tggccttcgc tcacgacctg atggccgacg ctcagagaca ggacagacca      660

ttcttcctgt actacgccag ccaccacaca cactaccctc agtttagcgg ccagagcttc      720

gccgagagat ctggcagagg acctttcggc gacagcctga tggaactgga tgccgctgtg      780

ggcacactga tgacagccat cggagatctg ggactgctgg aagagacact ggtcatcttc      840

accgccgaca acggccccga gctgatgaga atgagcaacg gcggctgtag cggcctgctg      900

agatgtggca agggcaccac atatgaaggc ggcgtcagag aacctgctct ggccttttgg      960

cctggccata ttgctccagg cgtgacacac gagctggcct cttctctgga tctgctgcct     1020

acactggcag ctcttgctgg tgctcccctg cctaatgtga ccctggatgg cttcgatctg     1080

agcccactgc tgctcggcac aggcaagtct ccaagacaga gcctgttctt ctaccctagc     1140

taccccgatg aagtgcgggg agtgtttgcc gtgcggaccg gaaagtataa ggcccacttc     1200

ttcacccaag gcagcgccca ctctgacacc acagctgatc ctgcttgtca cgccagctct     1260

agcctgacag cccatgaacc tccactgctg tacgacctga gcaaggaccc cggcgagaac     1320

tacaatctgc ttggcggagt tgccggcgct acacctgaag ttctgcaggc cctgaaacag     1380

ctccagctgc tgaaagccca gctggacgct gccgtgacat ttggacctag tcaggtggcc     1440

agaggcgagg atcctgctct gcagatctgt tgtcaccctg gctgcacacc cagacctgcc     1500

tgctgtcatt gtcctgatcc tcacgcctga                                      1530


<210>  9
<211>  334
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC promoter minimal (nucleotide sequence)

<400>  9
ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg       60

ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag      120

cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag      180

gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg      240

aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat      300

gattatataa ggacgcgccg ggtgtggcac agct                                  334


<210>  10
<211>  1212
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC promoter full (nucleotide sequence)

<400>  10
ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg       60

ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag      120

cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag      180

gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg      240

aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat      300

gattatataa ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg ggatttgggt      360

cgcagttctt gtttgtggat cgctgtgatc gtcacttggt gagtagcggg ctgctgggct      420

ggccggggct ttcgtggccg ccgggccgct cggtgggacg gaggcgtgtg gagagaccgc      480

caagggctgt agtctgggtc cgcgagcaag gttgccctga actgggggtt ggggggagcg      540

cagcaaaatg gcggctgttc ccgagtcttg aatggaagac gcttgtgagg cgggctgtga      600

ggtcgttgaa acaaggtggg gggcatggtg ggcggcaaga acccaaggtc ttgaggcctt      660

cgctaatgcg ggaaagctct tattcgggtg agatgggctg gggcaccatc tggggaccct      720

gacgtgaagt ttgtcactga ctggagaact cggtttgtcg tctgttgcgg gggcggcagt      780

tatggcggtg ccgttgggca gtgcacccgt acctttggga gcgcgcgccc tcgtcgtgtc      840

gtgacgtcac ccgttctgtt ggcttataat gcagggtggg gccacctgcc ggtaggtgtg      900

cggtaggctt ttctccgtcg caggacgcag ggttcgggcc aagggtaggc tctcctgaat      960

cgacaggcgc cggacctctg gtgaggggag ggataagtga ggcgtcagtt tctctggtcg     1020

gttttatgta cctatcttct taagtagctg aagctccggt tttgaactat gcgctcgggg     1080

ttggcgagtg tgttttgtga agttttttag gcaccttttg aaatgtaatc atttgggtca     1140

atatgtaatt ttcagtgtta gactagtaaa ttgtccgcta aattctggcc gtttttggct     1200

tttttgttag ac                                                         1212


<210>  11
<211>  1212
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC promoter full  - variant sequence (nucleotide sequence)

<400>  11
ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg       60

ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag      120

cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag      180

gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg      240

aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat      300

gattatataa ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg ggatttgggt      360

cgcagttctt gtttgtggat cgctgtgatc gtcacttggt gagtagcggg ctgctgggct      420

ggccggggct ttcgtggccg ccgggccgct cggtgggacg gaggcgtgtg gagagcccgc      480

caagggctgt agtctgggtc cgcgagcaag gttgccctga actgggggtt ggggggagcg      540

cagcaaaatg gcggctgttc ccgagtcttg aatggaagac gcttgtgagg cgggctgtga      600

ggtcgttgaa acaaggtggg gggcatggtg ggcggcaaga acccaaggtc ttgaggcctt      660

cgctaatgcg ggaaagctct tattcgggtg agatgggctg gggcaccatc tggggaccct      720

gacgtgaagt ttgtcactga ctggagaact cggtttgtcg tctgttgcgg gggcggcagt      780

tatggcggtg ccgttgggca gtgcacccgt acctttggga gcgcgcgccc tcgtcgtgtc      840

gtgacgtcac ccgttctgtt ggcttataat gcagggtggg gccacctgcc ggtaggtgtg      900

cggtaggctt ttctccgtcg caggacgcag ggttcgggcc aagggtaggc tctcctgaat      960

cgacaggcgc cggacctctg gtgaggggag ggataagtga ggcgtcagtt tctctggtcg     1020

gttttatgta cctatcttct taagtagctg aagctccggt tttgaactat gcgctcgggg     1080

ttggcgagtg tgttttgtga agttttttag gcaccttttg aaatgtaatc atttgggtca     1140

atatgtaatt ttcagtgtta gactagtaaa ttgtccgcta aattctggcc gtttttggct     1200

tttttgttag ac                                                         1212


<210>  12
<211>  584
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CAG promoter (nucleotide sequence)

<400>  12
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat       60

tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc      120

aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc      180

caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt      240

acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta      300

ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac      360

ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg      420

ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga      480

gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc      540

ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcg                       584


<210>  13
<211>  204
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV promoter (nucleotide sequence)

<400>  13
gtgatgcggt tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt       60

ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac      120

tttccaaaat gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg      180

tgggaggtct atataagcag agct                                             204


<210>  14
<211>  602
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV enhancer-promoter (nucleotide sequence)

<400>  14
gcattgatta ttgactagtt attaatagta atcaattacg gggtcattag ttcatagccc       60

atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa      120

cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac      180

tttccattga cgtcaatggg tggagtattt acggtaaact gcccacttgg cagtacatca      240

agtgtatcat atgccaagtc cgccccctat tgacgtcaat gacggtaaat ggcccgcctg      300

gcattatgcc cagtacatga ccttacggga ctttcctact tggcagtaca tctacgtatt      360

agtcatcgct attaccatgg tgatgcggtt ttggcagtac accaatgggc gtggatagcg      420

gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg      480

gcaccaaaat caacgggact ttccaaaatg tcgtaataac cccgccccgt tgacgcaaat      540

gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctcgtttag tgaaccgtca      600

ga                                                                     602


<210>  15
<211>  589
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  WPRE (nucleotide sequence)

<400>  15
aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct       60

ccttttacgc tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt      120

atggctttca ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg      180

tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact      240

ggttggggca ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct      300

attgccacgg cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg      360

ttgggcactg acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc      420

gcctgtgttg ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc      480

aatccagcgg accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt      540

cgccttcgcc ctcagacgag tcggatctcc ctttgggccg cctccccgc                  589


<210>  16
<211>  124
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SV40 LPA (nucleotide sequence)

<400>  16
atttgtgaaa tttgtgatgc tattgcttta tttgtaacca ttataagctg caataaacaa       60

gttaacaaca acaattgcat tcattttatg tttcaggttc agggggaggt gtgggaggtt      120

tttt                                                                   124


<210>  17
<211>  91
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' ITR (nucleotide sequence)

<400>  17
gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg       60

tcgcccggcc tcagtgagcg agcgagcgcg c                                      91


<210>  18
<211>  145
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' ITR (nucleotide sequence)

<400>  18
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg       60

ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc      120

gagcgcgcag agagggagtg gccaa                                            145


<210>  19
<211>  3766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC-COGS (ATP0123)

<400>  19
gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg       60

tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag      120

gggttccttt gtcgactcgg cctccgcgcc gggttttggc gcctcccgcg ggcgcccccc      180

tcctcacggc gagcgctgcc acgtcagacg aagggcgcag cgagcgtcct gatccttccg      240

cccggacgct caggacagcg gcccgctgct cataagactc ggccttagaa ccccagtatc      300

agcagaagga cattttagga cgggacttgg gtgactctag ggcactggtt ttctttccag      360

agagcggaac aggcgaggaa aagtagtccc ttctcggcga ttctgcggag ggatctccgt      420

ggggcggtga acgccgatga ttatataagg acgcgccggg tgtggcacag ctagttccgt      480

cgcagccggg atttgggtcg cagttcttgt ttgtggatcg ctgtgatcgt cacttggtga      540

gtagcgggct gctgggctgg ccggggcttt cgtggccgcc gggccgctcg gtgggacgga      600

ggcgtgtgga gagaccgcca agggctgtag tctgggtccg cgagcaaggt tgccctgaac      660

tgggggttgg ggggagcgca gcaaaatggc ggctgttccc gagtcttgaa tggaagacgc      720

ttgtgaggcg ggctgtgagg tcgttgaaac aaggtggggg gcatggtggg cggcaagaac      780

ccaaggtctt gaggccttcg ctaatgcggg aaagctctta ttcgggtgag atgggctggg      840

gcaccatctg gggaccctga cgtgaagttt gtcactgact ggagaactcg gtttgtcgtc      900

tgttgcgggg gcggcagtta tggcggtgcc gttgggcagt gcacccgtac ctttgggagc      960

gcgcgccctc gtcgtgtcgt gacgtcaccc gttctgttgg cttataatgc agggtggggc     1020

cacctgccgg taggtgtgcg gtaggctttt ctccgtcgca ggacgcaggg ttcgggccaa     1080

gggtaggctc tcctgaatcg acaggcgccg gacctctggt gaggggaggg ataagtgagg     1140

cgtcagtttc tctggtcggt tttatgtacc tatcttctta agtagctgaa gctccggttt     1200

tgaactatgc gctcggggtt ggcgagtgtg ttttgtgaag ttttttaggc accttttgaa     1260

atgtaatcat ttgggtcaat atgtaatttt cagtgttaga ctagtaaatt gtccgctaaa     1320

ttctggccgt ttttggcttt tttgttagac agatctatga gcatgggcgc tcctagaagc     1380

ctgctgctgg ccctggctgc cggcctggcc gtggctagac ctccaaacat cgtgctgatc     1440

ttcgccgacg acctgggcta tggtgacctg ggctgctacg gccacccctc ttctacaaca     1500

cccaatctgg accagctggc cgctggcggc ctgagattca cagacttcta cgtgccagtg     1560

tccctgtgca ccccttctag agccgctctc ctgaccggca gactgcctgt gcggatgggc     1620

atgtaccccg gagtgctggt gcccagcagt agaggaggac tgcctctgga agaggtgacc     1680

gtggccgagg tgctggccgc cagaggctac ctgacaggaa tggccggaaa atggcacctg     1740

ggagtgggcc cagaaggcgc cttcctgcca ccacaccagg gctttcaccg gttcctgggg     1800

atcccttaca gccacgacca aggcccttgt cagaacctga catgcttccc ccccgccaca     1860

ccttgcgacg gcggctgtga ccagggcctt gtgcctatcc ccctgctggc caacctgagc     1920

gtggaagccc agcctccatg gctgcctggc ctcgaggcca gatacatggc cttcgctcat     1980

gatctgatgg ccgatgccca gagacaggac agaccttttt tcctgtatta cgccagccac     2040

cacacccact accctcagtt cagcggacag agcttcgccg agcggagcgg cagaggcccc     2100

ttcggcgaca gcctgatgga actggacgcc gctgttggaa ccctgatgac cgccattggc     2160

gatctgggcc tgctcgagga aaccctggtg atcttcaccg ccgataacgg ccctgagaca     2220

atgcggatgt ctagaggcgg ctgcagcggc ctgctgcggt gcggcaaggg caccacctac     2280

gagggcggcg tgcgggaacc cgccctggct ttttggcctg gccacatcgc ccctggcgtt     2340

acccacgagc tggcttctag cctggacctg ctgcccaccc tggccgcact ggccggagct     2400

ccactgccta atgtgaccct ggatggcttc gacctgtccc ctctgctgct cggcaccggc     2460

aagagcccta gacagagcct gttcttctac ccctcctacc ctgatgaggt gcggggcgtc     2520

tttgccgtca ggaccggcaa atacaaggcc catttcttta cacagggcag cgcccactct     2580

gataccacag ccgaccctgc ctgccacgcc agctccagcc tgaccgccca cgagcctcct     2640

ctgctatacg acctgagcaa ggaccctggc gagaactaca acctgctggg tggcgtggcc     2700

ggcgctacac ctgaggtgct gcaggccctg aagcagctgc agctgcttaa ggcccaactg     2760

gacgccgctg tgaccttcgg ccctagccag gtggccagag gagaagatcc cgccctgcaa     2820

atctgctgcc accctggatg tacccctcgg cccgcttgtt gtcactgccc cgaccctcac     2880

gcctgaggta ccaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt     2940

aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct     3000

attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt     3060

tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac     3120

gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct     3180

ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca     3240

ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt     3300

ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc     3360

ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct     3420

cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg     3480

cgagctcatt tgtgaaattt gtgatgctat tgctttattt gtaaccatta taagctgcaa     3540

taaacaagtt aacaacaaca attgcattca ttttatgttt caggttcagg gggaggtgtg     3600

ggaggttttt tgagtcctag gaggaacccc tagtgatgga gttggccact ccctctctgc     3660

gcgctcgctc gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc     3720

gggcggcctc agtgagcgag cgagcgcgca gagagggagt ggccaa                    3766


<210>  20
<211>  3766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC-COGS-Hyper (ATP0137)

<400>  20
gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg       60

tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag      120

gggttccttt gtcgactcgg cctccgcgcc gggttttggc gcctcccgcg ggcgcccccc      180

tcctcacggc gagcgctgcc acgtcagacg aagggcgcag cgagcgtcct gatccttccg      240

cccggacgct caggacagcg gcccgctgct cataagactc ggccttagaa ccccagtatc      300

agcagaagga cattttagga cgggacttgg gtgactctag ggcactggtt ttctttccag      360

agagcggaac aggcgaggaa aagtagtccc ttctcggcga ttctgcggag ggatctccgt      420

ggggcggtga acgccgatga ttatataagg acgcgccggg tgtggcacag ctagttccgt      480

cgcagccggg atttgggtcg cagttcttgt ttgtggatcg ctgtgatcgt cacttggtga      540

gtagcgggct gctgggctgg ccggggcttt cgtggccgcc gggccgctcg gtgggacgga      600

ggcgtgtgga gagaccgcca agggctgtag tctgggtccg cgagcaaggt tgccctgaac      660

tgggggttgg ggggagcgca gcaaaatggc ggctgttccc gagtcttgaa tggaagacgc      720

ttgtgaggcg ggctgtgagg tcgttgaaac aaggtggggg gcatggtggg cggcaagaac      780

ccaaggtctt gaggccttcg ctaatgcggg aaagctctta ttcgggtgag atgggctggg      840

gcaccatctg gggaccctga cgtgaagttt gtcactgact ggagaactcg gtttgtcgtc      900

tgttgcgggg gcggcagtta tggcggtgcc gttgggcagt gcacccgtac ctttgggagc      960

gcgcgccctc gtcgtgtcgt gacgtcaccc gttctgttgg cttataatgc agggtggggc     1020

cacctgccgg taggtgtgcg gtaggctttt ctccgtcgca ggacgcaggg ttcgggccaa     1080

gggtaggctc tcctgaatcg acaggcgccg gacctctggt gaggggaggg ataagtgagg     1140

cgtcagtttc tctggtcggt tttatgtacc tatcttctta agtagctgaa gctccggttt     1200

tgaactatgc gctcggggtt ggcgagtgtg ttttgtgaag ttttttaggc accttttgaa     1260

atgtaatcat ttgggtcaat atgtaatttt cagtgttaga ctagtaaatt gtccgctaaa     1320

ttctggccgt ttttggcttt tttgttagac agatctatga gcatgggcgc tcctagaagc     1380

ctgctgctgg ccctggctgc cggcctggcc gtggctagac ctccaaacat cgtgctgatc     1440

ttcgccgacg acctgggcta tggtgacctg ggctgctacg gccacccctc ttctacaaca     1500

cccaatctgg accagctggc cgctggcggc ctgagattca cagacttcta cgtgccagtg     1560

tccctgtgca ccccttctag agccgctctc ctgaccggca gactgcctgt gcggatgggc     1620

atgtaccccg gagtgctggt gcccagcagt agaggaggac tgcctctgga agaggtgacc     1680

gtggccgagg tgctggccgc cagaggctac ctgacaggaa tggccggaaa atggcacctg     1740

ggagtgggcc cagaaggcgc cttcctgcca ccacaccagg gctttcaccg gttcctgggg     1800

atcccttaca gccacgacca aggcccttgt cagaacctga catgcttccc ccccgccaca     1860

ccttgcgacg gcggctgtga ccagggcctt gtgcctatcc ccctgctggc caacctgagc     1920

gtggaagccc agcctccatg gctgcctggc ctcgaggcca gatacgtggc cttcgctcat     1980

gatctgatgg ccgatgccca gagacaggac agaccttttt tcctgtatta cgccagccac     2040

cacacccact accctcagtt cagcggacag agcttcgccg agcggagcgg cagaggcccc     2100

ttcggcgaca gcctgatgga actggacgcc gctgttggaa ccctgatgac cgccattggc     2160

gatctgggcc tgctcgagga aaccctggtg atcttcaccg ccgataacgg ccctgagctg     2220

atgcggatgt ctaacggcgg ctgcagcggc ctgctgcggt gcggcaaggg caccacctac     2280

gagggcggcg tgcgggaacc cgccctggct ttttggcctg gccacatcgc ccctggcgtt     2340

acccacgagc tggcttctag cctggacctg ctgcccaccc tggccgcact ggccggagct     2400

ccactgccta atgtgaccct ggatggcttc gacctgtccc ctctgctgct cggcaccggc     2460

aagagcccta gacagagcct gttcttctac ccctcctacc ctgatgaggt gcggggcgtc     2520

tttgccgtca ggaccggcaa atacaaggcc catttcttta cacagggcag cgcccactct     2580

gataccacag ccgaccctgc ctgccacgcc agctccagcc tgaccgccca cgagcctcct     2640

ctgctatacg acctgagcaa ggaccctggc gagaactaca acctgctggg tggcgtggcc     2700

ggcgctacac ctgaggtgct gcaggccctg aagcagctgc agctgcttaa ggcccaactg     2760

gacgccgctg tgaccttcgg ccctagccag gtggccagag gagaagatcc cgccctgcaa     2820

atctgctgcc accctggatg tacccctcgg cccgcttgtt gtcactgccc cgaccctcac     2880

gcctgaggta ccaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt     2940

aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct     3000

attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt     3060

tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac     3120

gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct     3180

ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca     3240

ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt     3300

ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc     3360

ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct     3420

cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg     3480

cgagctcatt tgtgaaattt gtgatgctat tgctttattt gtaaccatta taagctgcaa     3540

taaacaagtt aacaacaaca attgcattca ttttatgttt caggttcagg gggaggtgtg     3600

ggaggttttt tgagtcctag gaggaacccc tagtgatgga gttggccact ccctctctgc     3660

gcgctcgctc gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc     3720

gggcggcctc agtgagcgag cgagcgcgca gagagggagt ggccaa                    3766


<210>  21
<211>  3617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV-COGS (ATP0139)

<400>  21
gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg       60

tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag      120

gggttccttt gtcgacctct aggaagacct tcaatattgg ccattagcca tattattcat      180

tggttatata gcataaatca atattggcta ttggccattg catacgttgt atctatatca      240

taatatgtac atttatattg gctcatgtcc aatatgaccg ccatgttgca ttgattattg      300

actagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc      360

cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca      420

ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt      480

caatgggtgg agtatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg      540

ccaagtccgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag      600

tacatgacct tacgggactt tcctacttgg cagtacatct acgtattagt catcgctatt      660

accatggtga tgcggttttg gcagtacacc aatgggcgtg gatagcggtt tgactcacgg      720

ggatttccaa gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa      780

cgggactttc caaaatgtcg taataacccc gccccgttga cgcaaatggg cggtaggcgt      840

gtacggtggg aggtctatat aagcagagct cgtttagtga accgtcagat cactagacac      900

tttgtggcgg tagtttatca cagttaaatt gctaacgcag tcagtgcttc tgacacaaca      960

gtctcgaact taagctgcag aagttggtcg tgaggcactg ggcaggtaag tatcaaggtt     1020

acaagacagg tttaaggcga ccaatagaaa ctgggcttgt cgagacagag aagactcttg     1080

cgtttctgat aggcacctat tggtcttact gacatccact ttgcctttct ctccacaggt     1140

gtccactccc agttcaatta cagctcttaa ggctagagta cttaatacga ctcactatag     1200

gagatctatg agcatgggcg ctcctagaag cctgctgctg gccctggctg ccggcctggc     1260

cgtggctaga cctccaaaca tcgtgctgat cttcgccgac gacctgggct atggtgacct     1320

gggctgctac ggccacccct cttctacaac acccaatctg gaccagctgg ccgctggcgg     1380

cctgagattc acagacttct acgtgccagt gtccctgtgc accccttcta gagccgctct     1440

cctgaccggc agactgcctg tgcggatggg catgtacccc ggagtgctgg tgcccagcag     1500

tagaggagga ctgcctctgg aagaggtgac cgtggccgag gtgctggccg ccagaggcta     1560

cctgacagga atggccggaa aatggcacct gggagtgggc ccagaaggcg ccttcctgcc     1620

accacaccag ggctttcacc ggttcctggg gatcccttac agccacgacc aaggcccttg     1680

tcagaacctg acatgcttcc cccccgccac accttgcgac ggcggctgtg accagggcct     1740

tgtgcctatc cccctgctgg ccaacctgag cgtggaagcc cagcctccat ggctgcctgg     1800

cctcgaggcc agatacatgg ccttcgctca tgatctgatg gccgatgccc agagacagga     1860

cagacctttt ttcctgtatt acgccagcca ccacacccac taccctcagt tcagcggaca     1920

gagcttcgcc gagcggagcg gcagaggccc cttcggcgac agcctgatgg aactggacgc     1980

cgctgttgga accctgatga ccgccattgg cgatctgggc ctgctcgagg aaaccctggt     2040

gatcttcacc gccgataacg gccctgagac aatgcggatg tctagaggcg gctgcagcgg     2100

cctgctgcgg tgcggcaagg gcaccaccta cgagggcggc gtgcgggaac ccgccctggc     2160

tttttggcct ggccacatcg cccctggcgt tacccacgag ctggcttcta gcctggacct     2220

gctgcccacc ctggccgcac tggccggagc tccactgcct aatgtgaccc tggatggctt     2280

cgacctgtcc cctctgctgc tcggcaccgg caagagccct agacagagcc tgttcttcta     2340

cccctcctac cctgatgagg tgcggggcgt ctttgccgtc aggaccggca aatacaaggc     2400

ccatttcttt acacagggca gcgcccactc tgataccaca gccgaccctg cctgccacgc     2460

cagctccagc ctgaccgccc acgagcctcc tctgctatac gacctgagca aggaccctgg     2520

cgagaactac aacctgctgg gtggcgtggc cggcgctaca cctgaggtgc tgcaggccct     2580

gaagcagctg cagctgctta aggcccaact ggacgccgct gtgaccttcg gccctagcca     2640

ggtggccaga ggagaagatc ccgccctgca aatctgctgc caccctggat gtacccctcg     2700

gcccgcttgt tgtcactgcc ccgaccctca cgcctgaggt accaatcaac ctctggatta     2760

caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg     2820

atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc     2880

ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca     2940

acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac     3000

cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact     3060

catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc     3120

cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg ttgccacctg     3180

gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc     3240

ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac     3300

gagtcggatc tccctttggg ccgcctcccc gcgagctcat ttgtgaaatt tgtgatgcta     3360

ttgctttatt tgtaaccatt ataagctgca ataaacaagt taacaacaac aattgcattc     3420

attttatgtt tcaggttcag ggggaggtgt gggaggtttt ttgagtccta ggaggaaccc     3480

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     3540

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     3600

agagagggag tggccaa                                                    3617


<210>  22
<211>  3617
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CMV-COGS-Hyper (ATP0138)

<400>  22
gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg       60

tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag      120

gggttccttt gtcgacctct aggaagacct tcaatattgg ccattagcca tattattcat      180

tggttatata gcataaatca atattggcta ttggccattg catacgttgt atctatatca      240

taatatgtac atttatattg gctcatgtcc aatatgaccg ccatgttgca ttgattattg      300

actagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc      360

cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca      420

ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt      480

caatgggtgg agtatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg      540

ccaagtccgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag      600

tacatgacct tacgggactt tcctacttgg cagtacatct acgtattagt catcgctatt      660

accatggtga tgcggttttg gcagtacacc aatgggcgtg gatagcggtt tgactcacgg      720

ggatttccaa gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa      780

cgggactttc caaaatgtcg taataacccc gccccgttga cgcaaatggg cggtaggcgt      840

gtacggtggg aggtctatat aagcagagct cgtttagtga accgtcagat cactagacac      900

tttgtggcgg tagtttatca cagttaaatt gctaacgcag tcagtgcttc tgacacaaca      960

gtctcgaact taagctgcag aagttggtcg tgaggcactg ggcaggtaag tatcaaggtt     1020

acaagacagg tttaaggcga ccaatagaaa ctgggcttgt cgagacagag aagactcttg     1080

cgtttctgat aggcacctat tggtcttact gacatccact ttgcctttct ctccacaggt     1140

gtccactccc agttcaatta cagctcttaa ggctagagta cttaatacga ctcactatag     1200

gagatctatg agcatgggcg ctcctagaag cctgctgctg gccctggctg ccggcctggc     1260

cgtggctaga cctccaaaca tcgtgctgat cttcgccgac gacctgggct atggtgacct     1320

gggctgctac ggccacccct cttctacaac acccaatctg gaccagctgg ccgctggcgg     1380

cctgagattc acagacttct acgtgccagt gtccctgtgc accccttcta gagccgctct     1440

cctgaccggc agactgcctg tgcggatggg catgtacccc ggagtgctgg tgcccagcag     1500

tagaggagga ctgcctctgg aagaggtgac cgtggccgag gtgctggccg ccagaggcta     1560

cctgacagga atggccggaa aatggcacct gggagtgggc ccagaaggcg ccttcctgcc     1620

accacaccag ggctttcacc ggttcctggg gatcccttac agccacgacc aaggcccttg     1680

tcagaacctg acatgcttcc cccccgccac accttgcgac ggcggctgtg accagggcct     1740

tgtgcctatc cccctgctgg ccaacctgag cgtggaagcc cagcctccat ggctgcctgg     1800

cctcgaggcc agatacgtgg ccttcgctca tgatctgatg gccgatgccc agagacagga     1860

cagacctttt ttcctgtatt acgccagcca ccacacccac taccctcagt tcagcggaca     1920

gagcttcgcc gagcggagcg gcagaggccc cttcggcgac agcctgatgg aactggacgc     1980

cgctgttgga accctgatga ccgccattgg cgatctgggc ctgctcgagg aaaccctggt     2040

gatcttcacc gccgataacg gccctgagct gatgcggatg tctaacggcg gctgcagcgg     2100

cctgctgcgg tgcggcaagg gcaccaccta cgagggcggc gtgcgggaac ccgccctggc     2160

tttttggcct ggccacatcg cccctggcgt tacccacgag ctggcttcta gcctggacct     2220

gctgcccacc ctggccgcac tggccggagc tccactgcct aatgtgaccc tggatggctt     2280

cgacctgtcc cctctgctgc tcggcaccgg caagagccct agacagagcc tgttcttcta     2340

cccctcctac cctgatgagg tgcggggcgt ctttgccgtc aggaccggca aatacaaggc     2400

ccatttcttt acacagggca gcgcccactc tgataccaca gccgaccctg cctgccacgc     2460

cagctccagc ctgaccgccc acgagcctcc tctgctatac gacctgagca aggaccctgg     2520

cgagaactac aacctgctgg gtggcgtggc cggcgctaca cctgaggtgc tgcaggccct     2580

gaagcagctg cagctgctta aggcccaact ggacgccgct gtgaccttcg gccctagcca     2640

ggtggccaga ggagaagatc ccgccctgca aatctgctgc caccctggat gtacccctcg     2700

gcccgcttgt tgtcactgcc ccgaccctca cgcctgaggt accaatcaac ctctggatta     2760

caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg     2820

atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc     2880

ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca     2940

acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac     3000

cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact     3060

catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc     3120

cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg ttgccacctg     3180

gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc     3240

ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac     3300

gagtcggatc tccctttggg ccgcctcccc gcgagctcat ttgtgaaatt tgtgatgcta     3360

ttgctttatt tgtaaccatt ataagctgca ataaacaagt taacaacaac aattgcattc     3420

attttatgtt tcaggttcag ggggaggtgt gggaggtttt ttgagtccta ggaggaaccc     3480

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     3540

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     3600

agagagggag tggccaa                                                    3617


