                         SEQUENCE LISTING

<110>  The Regents of the University of California
       Yang, Xiangdong W.
       Lee, Chung-Ying
       Wang, Nan
 
<120>  A Cell-Based Seeding Assay for Huntingtin Aggregation

<130>  206030-0124-00-WO.607878

<150>  US 62/571,443
<151>  2017-10-12

<160>  6     

<170>  PatentIn version 3.5

<210>  1
<211>  1065
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Chemically Synthesized, HTTex1-46Q-EGFP nucleic acid sequence

<400>  1
atggcgaccc tggaaaagct gatgaaggcc ttcgagtccc tcaaaagctt ccaacagcag       60

caacagcaac aacagcagca acagcaacaa cagcagcaac agcaacaaca gcagcaacag      120

cagcaacagc aacaacagca gcaacagcaa caacagcagc aacagcaaca acagcagcaa      180

cagcaacaac cgccaccacc tccccctcca cccccacctc ctcaacttcc tcaacctcct      240

ccacaggcac agcctctgct gcctcagcca caacctcctc cacctccacc tccacctcct      300

ccaggcccag ctgtggctga ggagcctctg caccgacctg gatccctggt gagcaagggc      360

gaggagctgt tcaccggggt ggtgcccatc ctggtcgagc tggacggcga cgtaaacggc      420

cacaagttca gcgtgtccgg cgagggcgag ggcgatgcca cctacggcaa gctgaccctg      480

aagttcatct gcaccaccgg caagctgccc gtgccctggc ccaccctcgt gaccaccctg      540

acctacggcg tgcagtgctt cagccgctac cccgaccaca tgaagcagca cgacttcttc      600

aagtccgcca tgcccgaagg ctacgtccag gagcgcacca tcttcttcaa ggacgacggc      660

aactacaaga cccgcgccga ggtgaagttc gagggcgaca ccctggtgaa ccgcatcgag      720

ctgaagggca tcgacttcaa ggaggacggc aacatcctgg ggcacaagct ggagtacaac      780

tacaacagcc acaacgtcta tatcatggcc gacaagcaga agaacggcat caaggtgaac      840

ttcaagatcc gccacaacat cgaggacggc agcgtgcagc tcgccgacca ctaccagcag      900

aacaccccca tcggcgacgg ccccgtgctg ctgcccgaca accactacct gagcacccag      960

tccgccctga gcaaagaccc caacgagaag cgcgatcaca tggtcctgct ggagttcgtg     1020

accgccgccg ggatcactct cggcatggac gagctgtaca agtaa                     1065


<210>  2
<211>  354
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Chemically Synthesized, HTTex1-46Q-EGFP amino acid sequence

<400>  2

Met Ala Thr Leu Glu Lys Leu Met Lys Ala Phe Glu Ser Leu Lys Ser 
1               5                   10                  15      


Phe Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
            20                  25                  30          


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
        35                  40                  45              


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Pro 
    50                  55                  60                  


Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Gln Leu Pro Gln Pro Pro 
65                  70                  75                  80  


Pro Gln Ala Gln Pro Leu Leu Pro Gln Pro Gln Pro Pro Pro Pro Pro 
                85                  90                  95      


Pro Pro Pro Pro Pro Gly Pro Ala Val Ala Glu Glu Pro Leu His Arg 
            100                 105                 110         


Pro Gly Ser Leu Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 
        115                 120                 125             


Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 
    130                 135                 140                 


Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
145                 150                 155                 160 


Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
                165                 170                 175     


Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp 
            180                 185                 190         


His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 
        195                 200                 205             


Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
    210                 215                 220                 


Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu 
225                 230                 235                 240 


Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys 
                245                 250                 255     


Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys 
            260                 265                 270         


Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu 
        275                 280                 285             


Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile 
    290                 295                 300                 


Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln 
305                 310                 315                 320 


Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
                325                 330                 335     


Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu 
            340                 345                 350         


Tyr Lys 
        


<210>  3
<211>  1017
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Chemically Synthesized, HTTex1-46Q-EGFP nucleic acid sequence 
       lacking encoding sequence for residues 2-16 of SEQ ID NO:2 (N17)

<400>  3
atgcaacagc agcaacagca acaacagcag caacagcaac aacagcagca acagcaacaa       60

cagcagcaac agcagcaaca gcaacaacag cagcaacagc aacaacagca gcaacagcaa      120

caacagcagc aacagcaaca accgccacca cctccccctc cacccccacc tcctcaactt      180

cctcaacctc ctccacaggc acagcctctg ctgcctcagc cacaacctcc tccacctcca      240

cctccacctc ctccaggccc agctgtggct gaggagcctc tgcaccgacc tggatccctg      300

gtgagcaagg gcgaggagct gttcaccggg gtggtgccca tcctggtcga gctggacggc      360

gacgtaaacg gccacaagtt cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc      420

aagctgaccc tgaagttcat ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc      480

gtgaccaccc tgacctacgg cgtgcagtgc ttcagccgct accccgacca catgaagcag      540

cacgacttct tcaagtccgc catgcccgaa ggctacgtcc aggagcgcac catcttcttc      600

aaggacgacg gcaactacaa gacccgcgcc gaggtgaagt tcgagggcga caccctggtg      660

aaccgcatcg agctgaaggg catcgacttc aaggaggacg gcaacatcct ggggcacaag      720

ctggagtaca actacaacag ccacaacgtc tatatcatgg ccgacaagca gaagaacggc      780

atcaaggtga acttcaagat ccgccacaac atcgaggacg gcagcgtgca gctcgccgac      840

cactaccagc agaacacccc catcggcgac ggccccgtgc tgctgcccga caaccactac      900

ctgagcaccc agtccgccct gagcaaagac cccaacgaga agcgcgatca catggtcctg      960

ctggagttcg tgaccgccgc cgggatcact ctcggcatgg acgagctgta caagtaa        1017


<210>  4
<211>  338
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Chemically Synthesized, HTTex1-46Q-EGFP amino acid sequence 
       lacking sequence for residues 2-16 of SEQ ID NO:2 (N17)

<400>  4

Met Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
1               5                   10                  15      


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
            20                  25                  30          


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Pro 
        35                  40                  45              


Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Gln Leu Pro Gln Pro Pro 
    50                  55                  60                  


Pro Gln Ala Gln Pro Leu Leu Pro Gln Pro Gln Pro Pro Pro Pro Pro 
65                  70                  75                  80  


Pro Pro Pro Pro Pro Gly Pro Ala Val Ala Glu Glu Pro Leu His Arg 
                85                  90                  95      


Pro Gly Ser Leu Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 
            100                 105                 110         


Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 
        115                 120                 125             


Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
    130                 135                 140                 


Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
145                 150                 155                 160 


Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp 
                165                 170                 175     


His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 
            180                 185                 190         


Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
        195                 200                 205             


Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu 
    210                 215                 220                 


Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys 
225                 230                 235                 240 


Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys 
                245                 250                 255     


Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu 
            260                 265                 270         


Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile 
        275                 280                 285             


Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln 
    290                 295                 300                 


Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
305                 310                 315                 320 


Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu 
                325                 330                 335     


Tyr Lys 
        


<210>  5
<211>  297
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Chemically Synthesized, nucleic acid sequence for a fragment of 
       exon 1 of HTT encoding a 46 polyQ region and a proline rich 
       region

<400>  5
caacagcagc aacagcaaca acagcagcaa cagcaacaac agcagcaaca gcaacaacag       60

cagcaacagc agcaacagca acaacagcag caacagcaac aacagcagca acagcaacaa      120

cagcagcaac agcaacaacc gccaccacct ccccctccac ccccacctcc tcaacttcct      180

caacctcctc cacaggcaca gcctctgctg cctcagccac aacctcctcc acctccacct      240

ccacctcctc caggcccagc tgtggctgag gagcctctgc accgacctgg atccctg         297


<210>  6
<211>  99
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Chemically Synthesized, amino acid sequence for a fragment of 
       exon 1 of HTT comprising a 46 polyQ region and a proline rich 
       region

<400>  6

Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
1               5                   10                  15      


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
            20                  25                  30          


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Pro Pro 
        35                  40                  45              


Pro Pro Pro Pro Pro Pro Pro Pro Pro Gln Leu Pro Gln Pro Pro Pro 
    50                  55                  60                  


Gln Ala Gln Pro Leu Leu Pro Gln Pro Gln Pro Pro Pro Pro Pro Pro 
65                  70                  75                  80  


Pro Pro Pro Pro Gly Pro Ala Val Ala Glu Glu Pro Leu His Arg Pro 
                85                  90                  95      


Gly Ser Leu 
            


