                         SEQUENCE LISTING

<110>  The Regents of the University of California
       Devaraj, Neal K.
       Alexander, Seth C.
 
<120>  ENZYMATIC SITE SPECIFIC LABELING OF RNA WITH UNNATUIRAL 
       NUCLEOBASES

<130>  48537-547001WO

<150>  US 62/127,596
<151>  2015-03-03

<150>  US 62/214,074
<151>  2015-09-03

<160>  8     

<170>  PatentIn version 3.5

<210>  1
<211>  1311
<212>  DNA
<213>  Escherichia coli

<400>  1
ctgcagggcc agtgaattcg agctcggtac ctcgcgaatg catctagata tcggatccta       60

atacgactca ctatagggaa taattttgtt taactttaag aaggagatat aatgaaattt      120

gaactggaca ccaccgacgg tcgcgcacgc cgtggccgcc tggtctttga tcgtggcgta      180

gtggaaacgc cttgttttat gcctgttggc acctacggca ccgtaaaagg gatgacgccg      240

gaagaagttg aagccactgg cgcgcaaatt atcctcggca acaccttcca cctgtggctg      300

cgcccgggcc aggaaatcat gaaactgcac ggcgatctgc acgattttat gcagtggaag      360

gggccgatcc tcaccgactc cggcggcttc caggtcttca gccttggcga tattcgtaaa      420

atcaccgaac agggcgtgca cttccgtaac ccgatcaacg gcgatccgat tttcctcgat      480

cctgaaaaat caatggagat tcagtacgat cttggttcgg atatcgtcat gatctttgat      540

gagtgtacgc cgtatcctgc tgactgggat tacgcaaaac gctccatgga gatgtctctg      600

cgttgggcga agcgtagccg tgagcgtttt gacagtctcg gaaacaaaaa tgcgctgttt      660

ggtatcatcc agggcagcgt ttacgaagat ttacgtgata tttctgttaa aggtctggta      720

gatatcggtt ttgatggcta cgctgtcggc ggtctggctg tgggtgagcc gaaagcagat      780

atgcaccgca ttctggagca tgtatgcccg caaattccgg cagacaaacc gcgttacctg      840

atgggcgttg gtaaaccaga agacctggtt gaaggcgtac gtcgtggtat cgatatgttt      900

gactgcgtaa tgccaacccg caacgcccga aatggtcatt tgttcgtgac cgatggcgtg      960

gtgaaaatcc gcaatgcgaa gtataagagc gatactggcc cactcgatcc tgagtgtgat     1020

tgctacacct gtcgcaatta ttcacgcgct tacttgcatc atcttgaccg ttgcaacgaa     1080

atattaggcg cgcgactcaa caccattcat aaccttcgtt actaccagcg tttgatggcg     1140

ggtttacgca aggctattga agagggtaaa ttagagagct tcgtaactga tttttaccag     1200

cgtcaggggc gagaagtacc acctttgaac gttgatcacc atcaccacca tcactaaaaa     1260

ggcgggcctc gagcaaagcc cgccgaaagg cgggcttttc tgtgtaagct t              1311


<210>  2
<211>  897
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  2
gatatctaat acgactcact atagggaata attttgttta actttaagaa ggagatataa       60

tggtgagcaa gggcgaggag gataacatgg ccatcatcaa ggagttcatg cgcttcaagg      120

tgcacatgga gggctccgtg aacggccacg agttcgagat cgagggcgag ggcgagggcc      180

gcccctacga gggcacccag accgccaagc tgaaggtgac caagggtggc cccctgccct      240

tcgcctggga catcctgtcc cctcagttca tgtacggctc caaggcctac gtgaagcacc      300

ccgccgacat ccccgactac ttgaagctgt ccttccccga gggcttcaag tgggagcgcg      360

tgatgaactt cgaggacggc ggcgtggtga ccgtgaccca ggactcctcc ctgcaggacg      420

gcgagttcat ctacaaggtg aagctgcgcg gcaccaactt cccctccgac ggccccgtaa      480

tgcagaagaa gaccatgggc tgggaggcct cctccgagcg gatgtacccc gaggacggcg      540

ccctgaaggg cgagatcaag cagaggctga agctgaagga cggcggccac tacgacgctg      600

aggtcaagac cacctacaag gccaagaagc ccgtgcagct gcccggcgcc tacaacgtca      660

acatcaagtt ggacatcacc tcccacaacg aggactacac catcgtggaa cagtacgaac      720

gcgccgaggg ccgccactcc accggcggca tggacgagct gtacaagtaa ccccatgtat      780

ctaaatcagc acccatcatt ttcatatccc cgcagactgt aaatctgccc ccatgtatct      840

aaatcagcac ccatcatttt catatccccc gaaaggcggg cttttctgtg tctcgag         897


<210>  3
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  3
cgcagactct aaatctgccc ccatg                                             25


<210>  4
<211>  1097
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  4
gatatctaat acgactcact atagggaata attttgttta actttaagaa ggagatataa       60

tggtgagcaa gggcgaggag gataacatgg ccatcatcaa ggagttcatg cgcttcaagg      120

tgcacatgga gggctccgtg aacggccacg agttcgagat cgagggcgag ggcgagggcc      180

gcccctacga gggcacccag accgccaagc tgaaggtgac caagggtggc cccctgccct      240

tcgcctggga catcctgtcc cctcagttca tgtacggctc caaggcctac gtgaagcacc      300

ccgccgacat ccccgactac ttgaagctgt ccttccccga gggcttcaag tgggagcgcg      360

tgatgaactt cgaggacggc ggcgtggtga ccgtgaccca ggactcctcc ctgcaggacg      420

gcgagttcat ctacaaggtg aagctgcgcg gcaccaactt cccctccgac ggccccgtaa      480

tgcagaagaa gaccatgggc tgggaggcct cctccgagcg gatgtacccc gaggacggcg      540

ccctgaaggg cgagatcaag cagaggctga agctgaagga cggcggccac tacgacgctg      600

aggtcaagac cacctacaag gccaagaagc ccgtgcagct gcccggcgcc tacaacgtca      660

acatcaagtt ggacatcacc tcccacaacg aggactacac catcgtggaa cagtacgaac      720

gcgccgaggg ccgccactcc accggcggca tggacgagct gtacaagtaa ccccatgtat      780

ctaaatcagc acccatcatt ttcatatccc cgcagactct aaatctgccc ccatgcgcag      840

actctaaatc tgcccccatg cgcagactct aaatctgccc ccatgcgcag actctaaatc      900

tgcccccatg cgcagactct aaatctgccc ccatgcgcag actctaaatc tgcccccatg      960

cgcagactct aaatctgccc ccatgcgcag actctaaatc tgcccccatg cgcagactct     1020

aaatctgccc ccatgtatct aaatcagcac ccatcatttt catatccccc gaaaggcggg     1080

cttttctgtg tctcgag                                                    1097


<210>  5
<211>  17
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide


<220>
<221>  misc_feature
<222>  (7)..(7)
<223>  Residue is a, u, g, c, or p

<220>
<221>  misc_feature
<222>  (9)..(9)
<223>  Residue is a, u, g, c, or p

<400>  5
gcagacngna aaucugc                                                      17


<210>  6
<211>  25
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  6
gggagcagac uguaaaucug cuccc                                             25


<210>  7
<211>  17
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide


<220>
<221>  misc_feature
<222>  (7)..(7)
<223>  Residue is a, u, g, c, or p

<220>
<221>  misc_feature
<222>  (8)..(8)
<223>  Residue is a, u, g, c, q, or PreQ1, wherein PreQ1 is optionally 
       modified

<220>
<221>  misc_feature
<222>  (9)..(9)
<223>  Residue is a, u, g, c, or p

<400>  7
gcagacnnna aaucugc                                                      17


<210>  8
<211>  381
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  8

Met Lys Phe Glu Leu Asp Thr Thr Asp Gly Arg Ala Arg Arg Gly Arg 
1               5                   10                  15      


Leu Val Phe Asp Arg Gly Val Val Glu Thr Pro Cys Phe Met Pro Val 
            20                  25                  30          


Gly Thr Tyr Gly Thr Val Lys Gly Met Thr Pro Glu Glu Val Glu Ala 
        35                  40                  45              


Thr Gly Ala Gln Ile Ile Leu Gly Asn Thr Phe His Leu Trp Leu Arg 
    50                  55                  60                  


Pro Gly Gln Glu Ile Met Lys Leu His Gly Asp Leu His Asp Phe Met 
65                  70                  75                  80  


Gln Trp Lys Gly Pro Ile Leu Thr Asp Ser Gly Gly Phe Gln Val Phe 
                85                  90                  95      


Ser Leu Gly Asp Ile Arg Lys Ile Thr Glu Gln Gly Val His Phe Arg 
            100                 105                 110         


Asn Pro Ile Asn Gly Asp Pro Ile Phe Leu Asp Pro Glu Lys Ser Met 
        115                 120                 125             


Glu Ile Gln Tyr Asp Leu Gly Ser Asp Ile Val Met Ile Phe Asp Glu 
    130                 135                 140                 


Cys Thr Pro Tyr Pro Ala Asp Trp Asp Tyr Ala Lys Arg Ser Met Glu 
145                 150                 155                 160 


Met Ser Leu Arg Trp Ala Lys Arg Ser Arg Glu Arg Phe Asp Ser Leu 
                165                 170                 175     


Gly Asn Lys Asn Ala Leu Phe Gly Ile Ile Gln Gly Ser Val Tyr Glu 
            180                 185                 190         


Asp Leu Arg Asp Ile Ser Val Lys Gly Leu Val Asp Ile Gly Phe Asp 
        195                 200                 205             


Gly Tyr Ala Val Gly Gly Leu Ala Val Gly Glu Pro Lys Ala Asp Met 
    210                 215                 220                 


His Arg Ile Leu Glu His Val Cys Pro Gln Ile Pro Ala Asp Lys Pro 
225                 230                 235                 240 


Arg Tyr Leu Met Gly Val Gly Lys Pro Glu Asp Leu Val Glu Gly Val 
                245                 250                 255     


Arg Arg Gly Ile Asp Met Phe Asp Cys Val Met Pro Thr Arg Asn Ala 
            260                 265                 270         


Arg Asn Gly His Leu Phe Val Thr Asp Gly Val Val Lys Ile Arg Asn 
        275                 280                 285             


Ala Lys Tyr Lys Ser Asp Thr Gly Pro Leu Asp Pro Glu Cys Asp Cys 
    290                 295                 300                 


Tyr Thr Cys Arg Asn Tyr Ser Arg Ala Tyr Leu His His Leu Asp Arg 
305                 310                 315                 320 


Cys Asn Glu Ile Leu Gly Ala Arg Leu Asn Thr Ile His Asn Leu Arg 
                325                 330                 335     


Tyr Tyr Gln Arg Leu Met Ala Gly Leu Arg Lys Ala Ile Glu Glu Gly 
            340                 345                 350         


Lys Leu Glu Ser Phe Val Thr Asp Phe Tyr Gln Arg Gln Gly Arg Glu 
        355                 360                 365             


Val Pro Pro Leu Asn Val Asp His His His His His His 
    370                 375                 380     


