                         SEQUENCE LISTING

<110>  University of Rochester
       Anderson, Douglas Matthew
 
<120>  Ribozyme-mediated RNA Assembly and Expression

<130>  204606-0127-00WO

<150>  US 62/971,356
<151>  2020-02-07

<160>  130   

<170>  PatentIn version 3.5

<210>  1
<211>  302
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Nt-GFP

<400>  1
auggugagca agggcgagga gcuguucacc gggguggugc ccauccuggu cgagcuggac       60

ggcgacguaa acggccacaa guucagcgug uccggcgagg gcgagggcga ugccaccuac      120

ggcaagcuga cccugaaguu caucugcacc accggcaagc ugcccgugcc cuggcccacc      180

cucgugacca cccugaccua cggcgugcag ugcuucagcc gcuaccccga ccacaugaag      240

cagcacgacu ucuucaaguc cgccaugccc gaaggcuacg uccaggagcg caccaucuuc      300

uu                                                                     302


<210>  2
<211>  421
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Ct-GFP

<400>  2
caaggacgac ggcaacuaca agacccgcgc cgaggugaag uucgagggcg acacccuggu       60

gaaccgcauc gagcugaagg gcaucgacuu caaggaggac ggcaacaucc uggggcacaa      120

gcuggaguac aacuacaaca gccacaacgu cuauaucaug gccgacaagc agaagaacgg      180

caucaaggug aacuucaaga uccgccacaa caucgaggac ggcagcgugc agcucgccga      240

ccacuaccag cagaacaccc ccaucggcga cggccccgug cugcugcccg acaaccacua      300

ccugagcacc caguccgccc ugagcaaaga ccccaacgag aagcgcgauc acaugguccu      360

gcuggaguuc gugaccgccg ccgggaucac ucucggcaug gacgagcugu acaaguagua      420

a                                                                      421


<210>  3
<211>  831
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Nt-Luciferase

<400>  3
auggaagacg ccaaaaacau aaagaaaggc ccggcgccau ucuauccgcu ggaagaugga       60

accgcuggag agcaacugca uaaggcuaug aagagauacg cccugguucc uggaacaauu      120

gcuuuuacag augcacauau cgagguggac aucacuuacg cugaguacuu cgaaaugucc      180

guucgguugg cagaagcuau gaaacgauau gggcugaaua caaaucacag aaucgucgua      240

ugcagugaaa acucucuuca auucuuuaug ccgguguugg gcgcguuauu uaucggaguu      300

gcaguugcgc ccgcgaacga cauuuauaau gaacgugaau ugcucaacag uaugggcauu      360

ucgcagccua ccgugguguu cguuuccaaa aagggguugc aaaaaauuuu gaacgugcaa      420

aaaaagcucc caaucaucca aaaaauuauu aucauggauu cuaaaacgga uuaccaggga      480

uuucagucga uguacacguu cgucacaucu caucuaccuc ccgguuuuaa ugaauacgau      540

uuugugccag aguccuucga uagggacaag acaauugcac ugaucaugaa cuccucugga      600

ucuacugguc ugccuaaagg ugucgcucug ccucauagaa cugccugcgu gagauucucg      660

caugccagag auccuauuuu uggcaaucaa aucauuccgg auacugcgau uuuaaguguu      720

guuccauucc aucacgguuu uggaauguuu acuacacucg gauauuugau auguggauuu      780

cgagucgucu uaauguauag auuugaagaa gagcuguuuc ugaggagccu u               831


<210>  4
<211>  825
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Ct-Luciferase

<400>  4
caggauuaca agauucaaag ugcgcugcug gugccaaccc uauucuccuu cuucgccaaa       60

agcacucuga uugacaaaua cgauuuaucu aauuuacacg aaauugcuuc ugguggcgcu      120

ccccucucua aggaagucgg ggaagcgguu gccaagaggu uccaucugcc agguaucagg      180

caaggauaug ggcucacuga gacuacauca gcuauucuga uuacacccga gggggaugau      240

aaaccgggcg cggucgguaa aguuguucca uuuuuugaag cgaagguugu ggaucuggau      300

accgggaaaa cgcugggcgu uaaucaaaga ggcgaacugu gugugagagg uccuaugauu      360

auguccgguu auguaaacaa uccggaagcg accaacgccu ugauugacaa ggauggaugg      420

cuacauucug gagacauagc uuacugggac gaagacgaac acuucuucau cguugaccgc      480

cugaagucuc ugauuaagua caaaggcuau cagguggcuc ccgcugaauu ggaauccauc      540

uugcuccaac accccaacau cuucgacgca ggugucgcag gucuucccga cgaugacgcc      600

ggugaacuuc ccgccgccgu uguuguuuug gagcacggaa agacgaugac ggaaaaagag      660

aucguggauu acgucgccag ucaaguaaca accgcgaaaa aguugcgcgg aggaguugug      720

uuuguggacg aaguaccgaa aggucuuacc ggaaaacucg acgcaagaaa aaucagagag      780

auccucauaa aggccaagaa gggcggaaag aucgccgugu aguaa                      825


<210>  5
<211>  543
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  N1L

<400>  5
atgggtcagg ccaatacgcc ctggagcagt aaggcaaacg cggatgcctt tataaattca       60

ttcatcagtg cagcatccaa tactggttcc ttctctcaag accaaatgga ggacatgtca      120

ctcatcggca atactctgat ggctgccatg gacaatatgg gaggccgcat aacaccatct      180

aagttgcagg cgttggatat ggccttcgca tcatcagtgg ccgagatcgc ggctagtgag      240

ggcggcgact tgggagtcac taccaacgcg atcgcggatg ccctcacttc tgctttttat      300

caaacgaccg gggttgtcaa ttcacgattc atatctgaga tcaggagcct cataggaatg      360

ttcgcgcagg cttccgcaaa tgacgtttat gcatctgctg gctctggcag cgggggtggt      420

gggtatggag ccagctcagc atctgcggct tctgcaagtg ctgctgcccc gagtggcgta      480

gcttatcagg ctcctgctca ggctcaaatc agttttacgt tgcgagggca acaacctgtt      540

tcc                                                                    543


<210>  6
<211>  132
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AQ

<400>  6
ggtccttatg gacccggtgc tagcgctgcg gcagcagccg ctggcggtta tggcccaggt       60

tcagggcaac aggggcctgg gcaacaagga cctggccaac aaggtcctgg tcagcagggt      120

ccagggcagc ag                                                          132


<210>  7
<211>  450
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NR3

<400>  7
ggcgctgctt ccgctgcagt atcagtaggt ggctatggac ctcaatctag tagcgcccct       60

gttgcctctg ccgccgcatc tcgactttca agtcccgccg ctagttccag ggtcagttcc      120

gcggtatcta gcttggtaag tagcggaccc actaatcaag cggcactttc aaacacaata      180

tcctcagtag tcagtcaagt aagcgcatca aaccctggct tgtcagggtg tgacgttctg      240

gttcaggcac ttctggaagt tgtctcagcg ttggtaagca tcctgggtag ctcctccata      300

ggtcaaatta attatggcgc gagcgcccaa tacacacaaa tggtgggtca gagtgtggcg      360

caggcactcg caggcgacta caaggatcat gacggagact ataaggatca tgatatagat      420

tacaaggacg atgatgacaa ggcctagtaa                                       450


<210>  8
<211>  453
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Nt-4xMTS

<400>  8
augagugugu ugacgccguu gcuucugcga gggcuuaccg ggucugcuag aagacuuccg       60

guccccaggg ccaagauaca uagccucgga gacccgaugu cugugcucac uccucugcuu      120

uugcgaggac ugacuggguc cgccagacga cucccggugc cgagagcuaa aauccauagc      180

cugggaaaau uggcaacuau gucaguccug acgccgcuuc uucuccgggg ucuuacaggg      240

ucugcaagaa ggcugccugu accucgggcg aaaauucaua gcuugggcga cccgaugagu      300

guauugacgc cccuguugcu gagaggauug acugggucag cgcgccggcu cccugucccc      360

cgagcuaaga uucacucccu ugguaagcug agaauccucc aaucaacggu uccgagagca      420

agagauccgc cggucgccac gaggccucuc gag                                   453


<210>  9
<211>  68
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  HDV68

<400>  9
ggccggcaug gucccagccu ccucgcuggc gccggcuggg caacaugcuu cggcauggcg       60

aaugggac                                                                68


<210>  10
<211>  67
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  HDV67

<400>  10
gggucggcau ggcaucucca ccuccucgcg guccgaccug ggcuacuucg guaggcuaag       60

ggagaag                                                                 67


<210>  11
<211>  56
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  HDV56

<400>  11
gagggauagu acagagccuc cccguggcuc ccuuggauaa ccaacugaua cuguac           56


<210>  12
<211>  87
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Genomic HDV

<400>  12
ggccggcaug gucccagccu ccucgcuggc gccggcuggg caacauuccg aggggaccgu       60

ccccucggua auggcgaaug ggaccca                                           87


<210>  13
<211>  91
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Antigenomic HDV

<400>  13
gggucggcau ggcaucucca ccuccucgcg guccgaccug ggcauccgaa ggaggacgca       60

cguccacucg gauggcuaag ggagagccac u                                      91


<210>  14
<211>  144
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  VS Ribozyme

<400>  14
gcgguaguaa gcagggaacu caccuccaau uucaguacug aaauugucgu agcaguugac       60

uacuguuaug ugauugguag aggcuaagug acgguauugg cguaagucag uauugcagca      120

cagcacaagc ccgcuugcga gaau                                             144


<210>  15
<211>  21
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  VS-S

<400>  15
gaagggcguc gucgccccga g                                                 21


<210>  16
<211>  144
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  VS-Rz

<400>  16
gcgguaguaa gcagggaacu caccuccaau uucaguacug aaauugucgu agcaguugac       60

uacuguuaug ugauugguag aggcuaagug acgguauugg cguaagucag uauugcagca      120

cagcacaagc ccgcuugcga gaau                                             144


<210>  17
<211>  291
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Nt-DTA

<400>  17
auggaccccg acgacguggu ggacagcagc aagagcuucg ugauggagaa cuucagcagc       60

uaccacggca ccaagcccgg cuacguggac agcauccaga agggcaucca gaagcccaag      120

agcggcaccc agggcaacua cgacgacgac uggaagggcu ucuacagcac cgacaacaag      180

uacgacgcug ccggcuacag cguggacaac gagaaccccc ugagcggcaa ggccggcggc      240

guggugaagg ugaccuaccc cggccugacc aaggugcugg cccugaaggu g               291


<210>  18
<211>  297
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Ct-DTA

<400>  18
gacaaugccg agaccaucaa gaaggagcug ggccugagcc ugaccgagcc ccugauggag       60

caggugggca ccgaggaguu caucaagaga uucggcgacg gcgccagcag aguggugcug      120

agccugcccu ucgccgaggg cagcagcagc guggaguaca ucaacaacug ggagcaggcc      180

aaggcccuga gcguggagcu ggagaucaac uucgagacca gaggcaagag aggccaggac      240

gccauguacg aguacauggc ccaggcuugc gccggcaaca gagugagaag auaguaa         297


<210>  19
<211>  717
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  GFPcdn (no start ATG codon)

<400>  19
guuagcaagg gcgaggagcu cuucaccggg gucgucccca uccucgucga gcucgacggc       60

gacguaaacg gccacaaguu cagcgucucc ggcgagggcg agggcgaugc caccuacggc      120

aagcucaccc ugaaguucau cugcaccacc ggcaagcugc ccgugcccug gcccacccuc      180

gugaccaccc ugaccuacgg cgugcagugc uucagccgcu accccgacca caugaagcag      240

cacgacuucu ucaaguccgc caugcccgaa ggcuacgucc aggagcgcac caucuucuuc      300

aaggacgacg gcaacuacaa gacccgcgcc gaggugaagu ucgagggcga cacccuggug      360

aaccgcaucg agcugaaggg caucgacuuc aaggaggacg gcaacauccu ggggcacaag      420

cuggaguaca acuacaacag ccacaacguc uauaucaugg ccgacaagca gaagaacggc      480

aucaagguga acuucaagau ccgccacaac aucgaggacg gcagcgugca gcucgccgac      540

cacuaccagc agaacacccc caucggcgac ggccccgugc ugcugcccga caaccacuac      600

cugagcaccc aguccgcccu gagcaaagac cccaacgaga agcgcgauca caugguccug      660

cuggaguucg ugaccgccgc cgggaucacu cucggcaugg acgagcugua caaguag         717


<210>  20
<211>  131
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  F2-Myr

<400>  20
auggguuguu guuucagcaa gacagcggcg aaaggugaag cagcagcaga aagaccaggc       60

gaggcugcgg uagcaucaag ucccuccaag gcuaaugggc aggaaaacgg acacgucaaa      120

guuggaagcg u                                                           131


<210>  21
<211>  685
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  F2-RFP

<400>  21
agccaucauc aaggaguuca ugcgcuucaa ggugcacaug gagggcuccg ugaacggcca       60

cgaguucgag aucgagggcg agggcgaggg ccgccccuac gagggcaccc agaccgccaa      120

gcugaaggug accaagggug gcccccugcc cuucgccugg gacauccugu ccccucaguu      180

cauguacggc uccaaggccu acgugaagca ccccgccgac auccccgacu acuugaagcu      240

guccuucccc gagggcuuca agugggagcg cgugaugaac uucgaggacg gcggcguggu      300

gaccgugacc caggacuccu cccugcagga cggcgaguuc aucuacaagg ugaagcugcg      360

cggcaccaac uuccccuccg acggccccgu aaugcagaag aagaccaugg gcugggaggc      420

cuccuccgag cggauguacc ccgaggacgg cgcccugaag ggcgagauca agcagaggcu      480

gaagcugaag gacggcggcc acuacgacgc ugaggucaag accaccuaca aggccaagaa      540

gcccgugcag cugcccggcg ccuacaacgu caacaucaag uuggacauca ccucccacaa      600

cgaggacuac accaucgugg aacaguacga acgcgccgag ggccgccacu ccaccggcgg      660

cauggacgag cuguacaagu aguaa                                            685


<210>  22
<211>  2337
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Nt-uDys

<400>  22
augcuuuggu gggaagaagu agaggacugu uaugaaagag aagauguuca aaagaaaaca       60

uucacaaaau ggguaaaugc acaauuuucu aaguuuggga agcagcauau ugagaaccuc      120

uucagugacc uacaggaugg gaggcgccuc cuagaccucc ucgaaggccu gacagggcaa      180

aaacugccaa aagaaaaagg auccacaaga guucaugccc ugaacaaugu caacaaggca      240

cugcggguuu ugcagaacaa uaauguugau uuagugaaua uuggaaguac ugacaucgua      300

gauggaaauc auaaacugac ucuugguuug auuuggaaua uaauccucca cuggcagguc      360

aaaaauguaa ugaaaaauau cauggcugga uugcaacaaa ccaacaguga aaagauucuc      420

cugagcuggg uccgacaauc aacucguaau uauccacagg uuaauguaau caacuucacc      480

accagcuggu cugauggccu ggcuuugaau gcucucaucc auagucauag gccagaccua      540

uuugacugga auaguguggu uugccagcag ucagccacac aacgacugga acaugcauuc      600

aacaucgcca gauaucaauu aggcauagag aaacuacucg auccugaaga uguugauacc      660

accuauccag auaagaaguc caucuuaaug uacaucacau cacucuucca aguuuugccu      720

caacaaguga gcauugaagc cauccaggaa guggaaaugu ugccaaggcc accuaaagug      780

acuaaagaag aacauuuuca guuacaucau caaaugcacu auucucaaca gaucacgguc      840

agucuagcac agggauauga gagaacuucu uccccuaagc cucgauucaa gagcuaugcc      900

uacacacagg cugcuuaugu caccaccucu gacccuacac ggagcccauu uccuucacag      960

cauuuggaag cuccugaaga caagucauuu ggcaguucau ugauggagag ugaaguaaac     1020

cuggaccguu aucaaacagc uuuagaagaa guauuaucgu ggcuucuuuc ugcugaggac     1080

acauugcaag cacaaggaga gauuucuaau gauguggaag uggugaaaga ccaguuucau     1140

acucaugagg gguacaugau ggauuugaca gcccaucagg gccggguugg uaauauucua     1200

caauugggaa guaagcugau uggaacagga aaauuaucag aagaugaaga aacugaagua     1260

caagagcaga ugaaucuccu aaauucaaga ugggaaugcc ucaggguagc uagcauggaa     1320

aaacaaagca auuuacauag aguuuuaaug gaucuccaga aucagaaacu gaaagaguug     1380

aaugacuggc uaacaaaaac agaagaaaga acaaggaaaa uggaggaaga gccucuugga     1440

ccugaucuug aagaccuaaa acgccaagua caacaacaua aggugcuuca agaagaucua     1500

gaacaagaac aagucagggu caauucucuc acucacaugg uggugguagu ugaugaaucu     1560

aguggagauc acgcaacugc ugcuuuggaa gaacaacuua agguauuggg agaucgaugg     1620

gcaaacaucu guagauggac agaagaccgc uggguucuuu uacaagacau ccuucucaaa     1680

uggcaacguc uuacugaaga acagugccuu uuuagugcau ggcuuucaga aaaagaagau     1740

gcagugaaca agauucacac aacuggcuuu aaagaucaaa augaaauguu aucaagucuu     1800

caaaaacugg ccguuuuaaa agcggaucua gaaaagaaaa agcaauccau gggcaaacug     1860

uauucacuca aacaagaucu ucuuucaaca cugaagaaua agucagugac ccagaagacg     1920

gaagcauggc uggauaacuu ugcccggugu ugggauaauu uaguccaaaa acuugaaaag     1980

aguacagcac agauuucaca ggcugucacc accacucagc caucacuaac acagacaacu     2040

guaauggaaa caguaacuac ggugaccaca agggaacaga uccugguaaa gcaugcucaa     2100

gaggaacuuc caccaccacc uccccaaaag aagaggcaga uuacugugga ucuugaaaga     2160

cuccaggaac uucaagaggc cacggaugag cuggaccuca agcugcgcca agcugaggug     2220

aucaagggau ccuggcagcc cgugggcgau cuccucauug acucucucca agaucaccuc     2280

gagaaaguca aggcacuucg aggagaaauu gcgccucuga aagagaacgu gagccac        2337


<210>  23
<211>  1974
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Ct-uDys-GFP

<400>  23
gucaaugacc uugcucgcca gcuuaccacu uugggcauuc agcucucacc guauaaccuc       60

agcacucugg aagaccugaa caccagaugg aagcuucugc agguggccgu cgaggaccga      120

gucaggcagc ugcaugaagc ccacagggac uuugguccag caucucagca cuuucuuucc      180

acgucugucc agggucccug ggagagagcc aucucgccaa acaaagugcc cuacuauauc      240

aaccacgaga cucaaacaac uugcugggac caucccaaaa ugacagagcu cuaccagucu      300

uuagcugacc ugaauaaugu cagauucuca gcuuauagga cugccaugaa acuccgaaga      360

cugcagaagg cccuuugcuu ggaucucuug agccugucag cugcauguga ugccuuggac      420

cagcacaacc ucaagcaaaa ugaccagccc auggauaucc ugcagauuau uaauuguuug      480

accacuauuu augaccgccu ggagcaagag cacaacaauu uggucaacgu cccucucugc      540

guggauaugu gucugaacug gcugcugaau guuuaugaua cgggacgaac agggaggauc      600

cguguccugu cuuuuaaaac uggcaucauu ucccugugua aagcacauuu ggaagacaag      660

uacagauacc uuuucaagca aguggcaagu ucaacaggau uuugugacca gcgcaggcug      720

ggccuccuuc ugcaugauuc uauccaaauu ccaagacagu ugggugaagu ugcauccuuu      780

gggggcagua acauugagcc aaguguccgg agcugcuucc aauuugcuaa uaauaagcca      840

gagaucgaag cggcccucuu ccuagacugg augagacugg aaccccaguc cauggugugg      900

cugcccgucc ugcacagagu ggcugcugca gaaacugcca agcaucaggc caaauguaac      960

aucugcaaag aguguccaau cauuggauuc agguacagga gucuaaagca cuuuaauuau     1020

gacaucugcc aaagcugcuu uuuuucuggu cgaguugcaa aaggccauaa aaugcacuau     1080

cccauggugg aauauugcac uccgacuaca ucaggagaag auguucgaga cuuugccaag     1140

guacuaaaaa acaaauuucg aaccaaaagg uauuuugcga agcauccccg aaugggcuac     1200

cugccagugc agacugucuu agagggggac aacauggaaa cugacacaau ucuagaggug     1260

agcaagggcg aggagcuguu caccggggug gugcccaucc uggucgagcu ggacggcgac     1320

guaaacggcc acaaguucag cguguccggc gagggcgagg gcgaugccac cuacggcaag     1380

cugacccuga aguucaucug caccaccggc aagcugcccg ugcccuggcc cacccucgug     1440

accacccuga ccuacggcgu gcagugcuuc agccgcuacc ccgaccacau gaagcagcac     1500

gacuucuuca aguccgccau gcccgaaggc uacguccagg agcgcaccau cuucuucaag     1560

gacgacggca acuacaagac ccgcgccgag gugaaguucg agggcgacac ccuggugaac     1620

cgcaucgagc ugaagggcau cgacuucaag gaggacggca acauccuggg gcacaagcug     1680

gaguacaacu acaacagcca caacgucuau aucauggccg acaagcagaa gaacggcauc     1740

aaggugaacu ucaagauccg ccacaacauc gaggacggca gcgugcagcu cgccgaccac     1800

uaccagcaga acacccccau cggcgacggc cccgugcugc ugcccgacaa ccacuaccug     1860

agcacccagu ccgcccugag caaagacccc aacgagaagc gcgaucacau gguccugcug     1920

gaguucguga ccgccgccgg gaucacucuc ggcauggacg agcuguacaa guaa           1974


<210>  24
<211>  68
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  HDV68 catalytic mutant

<400>  24
ggccggcaug gucccagccu ccucgcuggc gccggcuggg caacaugcuu cggcauggug       60

aaugggac                                                                68


<210>  25
<211>  56
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead with stem 3 overhangs specific to Nt-Luc

<400>  25
gagccuuacc ggauguguuu uccggucuga ugaguccggu agcggacgaa aggcuc           56


<210>  26
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 5 nt P1 stem for Ct-Luc

<400>  26
agccuuaaca cugccaaugc cggucccaag cccggauaaa aguggaggga ggcu             54


<210>  27
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 5 nt P1 stem for Ct-Luc and T6A mutation

<400>  27
agccuaaaca cugccaaugc cggucccaag cccggauaaa aguggaggga ggcu             54


<210>  28
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister mutant with 5 nt P1 stem for Ct-Luc

<400>  28
agccuuaacu cuuccaaugc cggucccaag cccggauaaa aguggaggga ggcu             54


<210>  29
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 5 nt P1 stem for Ct-Luc

<400>  29
agccuuaaca cugccaaugc cggucccaag cccggauaaa aguggaggga ggcu             54


<210>  30
<211>  51
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 2 nt P1 stem for Ct-Luc

<400>  30
agccuuaaca cugccaaugc cggucccaag cccggauaaa aguggaggga g                51


<210>  31
<211>  49
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 1 nt P1 stem for Ct-Luc

<400>  31
agccuuaaca cugccaaugc cggucccaag cccggauaaa aguggaggg                   49


<210>  32
<211>  49
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with no P1 stem for Ct-Luc

<400>  32
agccuuaaca cugccaaugc cggucccaag cccggauaaa aguggaggg                   49


<210>  33
<211>  53
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  16HH stem 1 overhang specific to Ct-Luc

<400>  33
gaaucuugua auccugcuga ugaguccgug aggacgaaac gaguaagcuc guc              53


<210>  34
<211>  51
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  14HH stem 1 overhang specific to Ct-Luc

<400>  34
aucuuguaau ccugcugaug aguccgugag gacgaaacga guaagcucgu c                51


<210>  35
<211>  49
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  12HH stem 1 overhang specific to Ct-Luc

<400>  35
cuuguaaucc ugcugaugag uccgugagga cgaaacgagu aagcucguc                   49


<210>  36
<211>  45
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  8HH stem 1 overhang specific to Ct-Luc

<400>  36
uaauccugcu gaugaguccg ugaggacgaa acgaguaagc ucguc                       45


<210>  37
<211>  43
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  6HH stem 1 overhang specific to Ct-Luc

<400>  37
auccugcuga ugaguccgug aggacgaaac gaguaagcuc guc                         43


<210>  38
<211>  43
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  6HH Mutant stem 1 overhang specific to Ct-Lu

<400>  38
auccugcuga ugaguccgug aggacgagac gaguaagcuc guc                         43


<210>  39
<211>  41
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  4HH stem 1 overhang specific to Ct-Luc

<400>  39
ccugcugaug aguccgugag gacgaaacga guaagcucgu c                           41


<210>  40
<211>  55
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  RzB stem1 overhang specific to Ct-Luc

<400>  40
uuguaauaau ccugcugaug agucgcuggg augcgacgaa acgccuucgg gcguc            55


<210>  41
<211>  52
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Splice Donor sequence for Nt vector

<400>  41
guaaguauca agguuacaag acagguuuaa ggagaccaau agaaacuggg cu               52


<210>  42
<211>  81
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Splice Acceptor sequence for Ct vector

<400>  42
ugucgagaca gagaagacuc uugcguuucu gauaggcacc uauuggucuu acugacaucc       60

acuuugccuu ucucuccaca g                                                 81


<210>  43
<211>  560
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  GCN4 5' UTR uORFs

<400>  43
aaacaaaaac ucacaacaca gguuacucuc cccccuaaau ucaaauuuuu uuugcccauc       60

aguuucacua gcgaauuaua caacucacca gccacacagc ucacucaucu acuucgcaau      120

caaaacaaaa uauuuuauuu uaguucaguu uauuaaguua uuaucaguau cguauuaaaa      180

aauuaaagau cauugaaaaa uggcuugcua aaccgauuau auuuuguuuu uaaaguagau      240

uauuauuaga aaauuauuaa gagaauuaug uguuaaauuu auugaaagag aaaauuuauu      300

uucccuuauu aauuaaaguc cuuuacuuuu uuugaaaacu gucaguuuuu ugaagaguua      360

uuuguuuugu uaccaauugc uaucauguac ccguagaauu uuauucaaga uguuuccgua      420

acgguuaccu uucugucaaa uuauccaggu uuacucgcca auaaaaauuu cccuauacua      480

ucauuaauua aaucauuauu auuacuaaag uuuuguuuac caauuugucu gcucaagaaa      540

auaaauuaaa uacaaauaaa                                                  560


<210>  44
<211>  148
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  SRY 5' UTR uORFs

<400>  44
guugaggggg uguugagggc ggagaaaugc aaguuucauu acaaaaguua acguaacaaa       60

gaaucuggua gaaaugaguu uuggauagua aaauaaguuu cgaacucugg caccuuucaa      120

uuuugucgca cucuccuugu uuuugaca                                         148


<210>  45
<211>  343
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hoxa9 TIE

<400>  45
gaaaaaacag aagagggaag gauaccagag cgguucauac agggcccaga aacuaggcga       60

ggugaccccu cagcaagaca aacaccucuu gauguugacu ggcgauuuuc cccaucucca      120

gucuggggag cgggacuagg cauacagaug auggagcuua gaacccgcug gcuagggaau      180

aaaauucgcu gggcaguuug ugcucaaaga agugggccag ggcgcuugug acacaaucag      240

ggcguuugug acacaaaccc uugaggguug gcaguucucu ccuuggcggu ugcucugguu      300

gcucuguggg gccuucccug uggagcaagg gugaucuggc cga                        343


<210>  46
<211>  170
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hoxa3 TIE

<400>  46
aggacaauuc gucucuuggg cugccgaagc gacagcuguc agagaggcag aagcuucugg       60

gagccgcggu cugaaggcua cgugugcugc cuggucauuc aaagugucaa uuuuaggucc      120

agaagugucc aaaccacaag uucucaaaac ucugaaaaau ggcucccucc                 170


<210>  47
<211>  38
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  NRAS 5'UTR G-quadruplex

<400>  47
cgucccgugu gggaggggcg ggucugggug cggccugc                               38


<210>  48
<211>  126
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Human IFNG 5' UTR pseudoknot

<400>  48
cacauuguuc ugaucaucug aagaucagcu auuagaagag aaagaucagu uaaguccuuu       60

ggaccugauc agcuugauac aagaacuacu gauuucaacu ucuuuggcuu aauucucucg      120

gaaacg                                                                 126


<210>  49
<211>  132
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Rat ODC 5'UTR

<400>  49
ugucaguccc ugcagccgcc gccgccggcc gccuucaguc agcagcucgg cgccaccucc       60

ggucggcgac ugcggcgggc ucgacgaggc ggcugacggg gcggcggcgg gaagacggcc      120

gggugcgccu ug                                                          132


<210>  50
<211>  49
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  SIRLOIN RNA Nuclear Localization Signal

<400>  50
cgccucccgg guucaagcga uucuccugcc ucagccuccc gaguagcug                   49


<210>  51
<211>  42
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  BORG lncRNA NLS

<400>  51
accucagaau cuacaaguca gccccaauua aauguuguuu ua                          42


<210>  52
<211>  108
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  FKBP DD

<400>  52

Met Gly Val Gln Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 
1               5                   10                  15      


Pro Lys Arg Gly Gln Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 
            20                  25                  30          


Asp Gly Lys Lys Val Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 
        35                  40                  45              


Phe Met Leu Gly Lys Gln Glu Val Ile Arg Gly Trp Glu Glu Gly Val 
    50                  55                  60                  


Ala Gln Met Ser Val Gly Gln Arg Ala Lys Leu Thr Ile Ser Pro Asp 
65                  70                  75                  80  


Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 
                85                  90                  95      


Thr Leu Val Phe Asp Val Glu Leu Leu Lys Pro Glu 
            100                 105             


<210>  53
<211>  40
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  PEST (enhanced ODC PEST)

<400>  53

Ser His Gly Phe Pro Pro Glu Val Glu Glu Gln Ala Ala Gly Thr Leu 
1               5                   10                  15      


Pro Met Ser Cys Ala Gln Glu Ser Gly Met Asp Arg His Pro Ala Ala 
            20                  25                  30          


Cys Ala Ser Ala Arg Ile Asn Val 
        35                  40  


<210>  54
<211>  40
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ODC PEST (yeast)

<400>  54

Ser His Gly Phe Pro Pro Glu Val Glu Glu Gln Asp Asp Gly Thr Leu 
1               5                   10                  15      


Pro Met Ser Cys Ala Gln Glu Ser Gly Met Asp Arg His Pro Ala Ala 
            20                  25                  30          


Cys Ala Ser Ala Arg Ile Asn Val 
        35                  40  


<210>  55
<211>  40
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ODC PEST (human)

<400>  55

Asn Pro Asp Phe Pro Pro Glu Val Glu Glu Gln Asp Ala Ser Thr Leu 
1               5                   10                  15      


Pro Val Ser Cys Ala Trp Glu Ser Gly Met Lys Arg His Arg Ala Ala 
            20                  25                  30          


Cys Ala Ser Ala Ser Ile Asn Val 
        35                  40  


<210>  56
<211>  57
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CL1

<400>  56

Ala Cys Lys Asn Trp Phe Ser Ser Leu Ser His Phe Val Ile His Leu 
1               5                   10                  15      


Asn Ser His Gly Phe Pro Pro Glu Val Glu Glu Gln Ala Ala Gly Thr 
            20                  25                  30          


Leu Pro Met Ser Cys Ala Gln Glu Ser Gly Met Asp Arg His Pro Ala 
        35                  40                  45              


Ala Cys Ala Ser Ala Arg Ile Asn Val 
    50                  55          


<210>  57
<211>  57
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  CL1-PEST

<400>  57

Ala Cys Lys Asn Trp Phe Ser Ser Leu Ser His Phe Val Ile His Leu 
1               5                   10                  15      


Asn Ser His Gly Phe Pro Pro Glu Val Glu Glu Gln Ala Ala Gly Thr 
            20                  25                  30          


Leu Pro Met Ser Cys Ala Gln Glu Ser Gly Met Asp Arg His Pro Ala 
        35                  40                  45              


Ala Cys Ala Ser Ala Arg Ile Asn Val 
    50                  55          


<210>  58
<211>  68
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E1A PEST

<400>  58

Ser Arg Glu Cys Asn Ser Ser Thr Asp Ser Cys Asp Ser Gly Pro Ser 
1               5                   10                  15      


Asn Thr Pro Pro Glu Ile His Pro Val Val Pro Leu Cys Pro Ile Lys 
            20                  25                  30          


Pro Val Ala Val Arg Val Gly Gly Arg Arg Gln Ala Val Glu Cys Ile 
        35                  40                  45              


Glu Asp Leu Leu Asn Glu Pro Gly Gln Pro Leu Asp Leu Ser Cys Lys 
    50                  55                  60                  


Arg Pro Arg Pro 
65              


<210>  59
<211>  31
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  C-myc PEST

<400>  59

Leu His Glu Glu Thr Pro Pro Thr Thr Ser Ser Asp Ser Glu Glu Glu 
1               5                   10                  15      


Gln Glu Asp Glu Glu Glu Ile Asp Val Val Ser Val Glu Lys Arg 
            20                  25                  30      


<210>  60
<211>  25
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  c-Fos PEST

<400>  60

Ala Ala His Arg Lys Gly Ser Ser Ser Asn Glu Pro Ser Ser Asp Ser 
1               5                   10                  15      


Leu Ser Ser Pro Thr Leu Leu Ala Leu 
            20                  25  


<210>  61
<211>  26
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  v-Myb PEST

<400>  61

Pro Ser Pro Pro Val Asp His Gly Cys Leu Pro Glu Glu Ser Ala Ser 
1               5                   10                  15      


Pro Ala Arg Cys Met Ile Val His Gln Ser 
            20                  25      


<210>  62
<211>  59
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NPDC1 PEST

<400>  62

Pro Pro Lys Glu Leu Asp Thr Ala Ser Ser Asp Glu Glu Asn Glu Asp 
1               5                   10                  15      


Gly Asp Phe Thr Val Tyr Glu Cys Pro Gly Leu Ala Pro Thr Gly Glu 
            20                  25                  30          


Met Glu Val Arg Asn Pro Leu Phe Asp His Ala Ala Leu Ser Ala Pro 
        35                  40                  45              


Leu Pro Ala Pro Ser Ser Pro Pro Ala Leu Pro 
    50                  55                  


<210>  63
<211>  37
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IkBa PEST

<400>  63

Pro Glu Ser Glu Asp Glu Glu Ser Tyr Asp Thr Glu Ser Glu Phe Thr 
1               5                   10                  15      


Glu Phe Thr Glu Asp Glu Leu Pro Tyr Asp Asp Cys Val Phe Gly Gly 
            20                  25                  30          


Gln Arg Leu Thr Leu 
        35          


<210>  64
<211>  41
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  m.m. AZIN2 PEST

<400>  64

Gly Gln Leu Leu Pro Ala Glu Glu Asp Gln Asp Ala Glu Gly Val Cys 
1               5                   10                  15      


Lys Pro Leu Ser Cys Gly Trp Glu Ile Thr Asp Thr Leu Cys Val Gly 
            20                  25                  30          


Pro Val Phe Thr Pro Ala Ser Ile Met 
        35                  40      


<210>  65
<211>  43
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  x.l. AZIN2 PEST

<400>  65

Val Gln Leu Leu Gln Arg Gly Leu Gln Gln Thr Glu Glu Lys Glu Asn 
1               5                   10                  15      


Val Cys Thr Pro Met Ser Cys Gly Trp Glu Ile Ser Asp Ser Leu Cys 
            20                  25                  30          


Phe Thr Arg Thr Phe Ala Ala Thr Ser Ile Ile 
        35                  40              


<210>  66
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NS1

<400>  66

Thr Ser Leu Tyr Lys Lys Val Gly Met Gly Arg Lys 
1               5                   10          


<210>  67
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NS6

<400>  67

Ser Leu Tyr Lys Lys Val Gly Thr Met Ala Ala Gly 
1               5                   10          


<210>  68
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NS7

<400>  68

Tyr Lys Lys Val Gly Thr Met Arg Gly Arg Gly Leu 
1               5                   10          


<210>  69
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NS12

<400>  69

Glu Arg Ala Pro Thr Gly Arg Trp Gly Arg Arg Gly 
1               5                   10          


<210>  70
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NS15

<400>  70

Glu Gly Pro Leu Trp His Pro Arg Ile Cys Gly Ser 
1               5                   10          


<210>  71
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SELK

<400>  71

Leu Arg Gly Pro Ser Pro Pro Pro Met Ala Gly Gly 
1               5                   10          


<210>  72
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SELS

<400>  72

Trp Arg Pro Gly Arg Arg Gly Pro Ser Ser Gly Gly 
1               5                   10          


<210>  73
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  EMID1

<400>  73

Arg Asp Glu Arg Gly 
1               5   


<210>  74
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IRX6

<400>  74

Gly Ala Glu Ala Gly 
1               5   


<210>  75
<211>  80
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  UbVR

<400>  75

Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu Val 
1               5                   10                  15      


Glu Pro Ser Asp Thr Ile Glu Asn Val Lys Ala Lys Ile Gln Asp Lys 
            20                  25                  30          


Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys Gln 
        35                  40                  45              


Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn Ile Gln Lys Glu Ser 
    50                  55                  60                  


Thr Leu His Leu Val Leu Arg Leu Arg Gly Val Arg Ala Ser Ala Ser 
65                  70                  75                  80  


<210>  76
<211>  162
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  2xUbVR

<400>  76

Thr Ser Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu 
1               5                   10                  15      


Glu Val Glu Pro Ser Asp Thr Ile Glu Asn Val Lys Ala Lys Ile Gln 
            20                  25                  30          


Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly 
        35                  40                  45              


Lys Gln Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn Ile Gln Lys 
    50                  55                  60                  


Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Val Arg Ala Ser 
65                  70                  75                  80  


Ala Ser Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu 
                85                  90                  95      


Glu Val Glu Pro Ser Asp Thr Ile Glu Asn Val Lys Ala Lys Ile Gln 
            100                 105                 110         


Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly 
        115                 120                 125             


Lys Gln Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn Ile Gln Lys 
    130                 135                 140                 


Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Val Arg Ala Ser 
145                 150                 155                 160 


Ala Ser 
        


<210>  77
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  12x poly K encoding tail sequence

<400>  77
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaataa                              39


<210>  78
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Translation Product 12x poly K

<400>  78

Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
1               5                   10          


<210>  79
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  16x poly K encoding tail sequence

<400>  79
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaata a                51


<210>  80
<211>  16
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Translation Product 16x poly K

<400>  80

Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
1               5                   10                  15      


<210>  81
<211>  505
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Human RtcB protein sequence

<400>  81

Met Ser Arg Ser Tyr Asn Asp Glu Leu Gln Phe Leu Glu Lys Ile Asn 
1               5                   10                  15      


Lys Asn Cys Trp Arg Ile Lys Lys Gly Phe Val Pro Asn Met Gln Val 
            20                  25                  30          


Glu Gly Val Phe Tyr Val Asn Asp Ala Leu Glu Lys Leu Met Phe Glu 
        35                  40                  45              


Glu Leu Arg Asn Ala Cys Arg Gly Gly Gly Val Gly Gly Phe Leu Pro 
    50                  55                  60                  


Ala Met Lys Gln Ile Gly Asn Val Ala Ala Leu Pro Gly Ile Val His 
65                  70                  75                  80  


Arg Ser Ile Gly Leu Pro Asp Val His Ser Gly Tyr Gly Phe Ala Ile 
                85                  90                  95      


Gly Asn Met Ala Ala Phe Asp Met Asn Asp Pro Glu Ala Val Val Ser 
            100                 105                 110         


Pro Gly Gly Val Gly Phe Asp Ile Asn Cys Gly Val Arg Leu Leu Arg 
        115                 120                 125             


Thr Asn Leu Asp Glu Ser Asp Val Gln Pro Val Lys Glu Gln Leu Ala 
    130                 135                 140                 


Gln Ala Met Phe Asp His Ile Pro Val Gly Val Gly Ser Lys Gly Val 
145                 150                 155                 160 


Ile Pro Met Asn Ala Lys Asp Leu Glu Glu Ala Leu Glu Met Gly Val 
                165                 170                 175     


Asp Trp Ser Leu Arg Glu Gly Tyr Ala Trp Ala Glu Asp Lys Glu His 
            180                 185                 190         


Cys Glu Glu Tyr Gly Arg Met Leu Gln Ala Asp Pro Asn Lys Val Ser 
        195                 200                 205             


Ala Arg Ala Lys Lys Arg Gly Leu Pro Gln Leu Gly Thr Leu Gly Ala 
    210                 215                 220                 


Gly Asn His Tyr Ala Glu Ile Gln Val Val Asp Glu Ile Phe Asn Glu 
225                 230                 235                 240 


Tyr Ala Ala Lys Lys Met Gly Ile Asp His Lys Gly Gln Val Cys Val 
                245                 250                 255     


Met Ile His Ser Gly Ser Arg Gly Leu Gly His Gln Val Ala Thr Asp 
            260                 265                 270         


Ala Leu Val Ala Met Glu Lys Ala Met Lys Arg Asp Lys Ile Ile Val 
        275                 280                 285             


Asn Asp Arg Gln Leu Ala Cys Ala Arg Ile Ala Ser Pro Glu Gly Gln 
    290                 295                 300                 


Asp Tyr Leu Lys Gly Met Ala Ala Ala Gly Asn Tyr Ala Trp Val Asn 
305                 310                 315                 320 


Arg Ser Ser Met Thr Phe Leu Thr Arg Gln Ala Phe Ala Lys Val Phe 
                325                 330                 335     


Asn Thr Thr Pro Asp Asp Leu Asp Leu His Val Ile Tyr Asp Val Ser 
            340                 345                 350         


His Asn Ile Ala Lys Val Glu Gln His Val Val Asp Gly Lys Glu Arg 
        355                 360                 365             


Thr Leu Leu Val His Arg Lys Gly Ser Thr Arg Ala Phe Pro Pro His 
    370                 375                 380                 


His Pro Leu Ile Ala Val Asp Tyr Gln Leu Thr Gly Gln Pro Val Leu 
385                 390                 395                 400 


Ile Gly Gly Thr Met Gly Thr Cys Ser Tyr Val Leu Thr Gly Thr Glu 
                405                 410                 415     


Gln Gly Met Thr Glu Thr Phe Gly Thr Thr Cys His Gly Ala Gly Arg 
            420                 425                 430         


Ala Leu Ser Arg Ala Lys Ser Arg Arg Asn Leu Asp Phe Gln Asp Val 
        435                 440                 445             


Leu Asp Lys Leu Ala Asp Met Gly Ile Ala Ile Arg Val Ala Ser Pro 
    450                 455                 460                 


Lys Leu Val Met Glu Glu Ala Pro Glu Ser Tyr Lys Asn Val Thr Asp 
465                 470                 475                 480 


Val Val Asn Thr Cys His Asp Ala Gly Ile Ser Lys Lys Ala Ile Lys 
                485                 490                 495     


Leu Arg Pro Ile Ala Val Ile Lys Gly 
            500                 505 


<210>  82
<211>  1518
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Human RtcB human codon optimized nucleic acid sequence

<400>  82
atgtcccggt catataatga cgagctgcaa ttccttgaga agataaataa gaattgctgg       60

cgcatcaaga aaggcttcgt tcctaatatg caagttgaag gtgtatttta tgtaaatgac      120

gctttggaaa agttgatgtt cgaggaactg aggaacgcat gtcgcggtgg aggtgtcggg      180

ggttttcttc ccgctatgaa gcagattggc aatgtggcgg ctctgcccgg aattgtgcac      240

cgctctatag gattgcctga cgtacacagc ggctacggat tcgccattgg gaatatggcg      300

gcgttcgata tgaacgaccc tgaggcggtt gttagccctg gaggtgtcgg cttcgatata      360

aattgcggag tcagattgct tcggacaaat ttggatgaat ctgacgtaca accagtgaaa      420

gagcaacttg cacaagcgat gttcgatcat attcccgtgg gtgtggggtc aaagggagta      480

atcccaatga acgcgaaaga cctggaagaa gcattggaga tgggtgtaga ctggtcactg      540

cgagaaggtt atgcctgggc tgaagacaaa gagcactgcg aggagtacgg tcgcatgttg      600

caagcagacc caaataaagt atccgcgagg gccaagaaaa gaggtttgcc gcagctgggg      660

acattggggg ccggtaacca ctatgcagaa atacaagtag tggatgagat tttcaatgag      720

tacgctgcga agaaaatggg gatcgaccat aaaggtcaag tgtgcgtaat gatacattct      780

gggagtcgcg gactcgggca ccaagttgca acggacgccc ttgtcgccat ggaaaaagcg      840

atgaagcggg ataaaatcat cgtaaatgat aggcaattgg cttgcgctcg cattgcgagt      900

ccggaagggc aagactactt gaaagggatg gctgctgccg ggaattatgc atgggtcaac      960

cggagcagta tgacattctt gacgcggcag gcttttgcaa aagtgtttaa tacgactccg     1020

gacgacctcg atctccatgt tatatatgat gtatcacaca atatcgcaaa ggttgagcaa     1080

cacgttgtgg atggtaagga aaggactctg ctggtacacc ggaaaggcag tacacgggca     1140

ttcccgcctc atcacccatt gatcgcagtc gattatcaat tgacaggtca gccagttctg     1200

atcggaggaa caatgggcac atgtagctac gtattgaccg ggactgaaca ggggatgacc     1260

gaaacttttg gcacaacatg ccatggcgcg gggagggcac tctcccgagc taaaagtagg     1320

aggaatcttg acttccagga tgtactggat aagctggccg atatggggat agccatccgg     1380

gtagcgtcac ccaaattggt aatggaggaa gctcctgaaa gctataaaaa tgtcactgac     1440

gttgtcaaca catgccatga cgcgggtata tccaagaaag ctattaagct gcgcccaata     1500

gctgtaatta aaggatag                                                   1518


<210>  83
<211>  408
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E. Coli RtcB protein sequence

<400>  83

Met Asn Tyr Glu Leu Leu Thr Thr Glu Asn Ala Pro Val Lys Met Trp 
1               5                   10                  15      


Thr Lys Gly Val Pro Val Glu Ala Asp Ala Arg Gln Gln Leu Ile Asn 
            20                  25                  30          


Thr Ala Lys Met Pro Phe Ile Phe Lys His Ile Ala Val Met Pro Asp 
        35                  40                  45              


Val His Leu Gly Lys Gly Ser Thr Ile Gly Ser Val Ile Pro Thr Lys 
    50                  55                  60                  


Gly Ala Ile Ile Pro Ala Ala Val Gly Val Asp Ile Gly Cys Gly Met 
65                  70                  75                  80  


Asn Ala Leu Arg Thr Ala Leu Thr Ala Glu Asp Leu Pro Glu Asn Leu 
                85                  90                  95      


Ala Glu Leu Arg Gln Ala Ile Glu Thr Ala Val Pro His Gly Arg Thr 
            100                 105                 110         


Thr Gly Arg Cys Lys Arg Asp Lys Gly Ala Trp Glu Asn Pro Pro Val 
        115                 120                 125             


Asn Val Asp Ala Lys Trp Ala Glu Leu Glu Ala Gly Tyr Gln Trp Leu 
    130                 135                 140                 


Thr Gln Lys Tyr Pro Arg Phe Leu Asn Thr Asn Asn Tyr Lys His Leu 
145                 150                 155                 160 


Gly Thr Leu Gly Thr Gly Asn His Phe Ile Glu Ile Cys Leu Asp Glu 
                165                 170                 175     


Ser Asp Gln Val Trp Ile Met Leu His Ser Gly Ser Arg Gly Ile Gly 
            180                 185                 190         


Asn Ala Ile Gly Thr Tyr Phe Ile Asp Leu Ala Gln Lys Glu Met Gln 
        195                 200                 205             


Glu Thr Leu Glu Thr Leu Pro Ser Arg Asp Leu Ala Tyr Phe Met Glu 
    210                 215                 220                 


Gly Thr Glu Tyr Phe Asp Asp Tyr Leu Lys Ala Val Ala Trp Ala Gln 
225                 230                 235                 240 


Leu Phe Ala Ser Leu Asn Arg Asp Ala Met Met Glu Asn Val Val Thr 
                245                 250                 255     


Ala Leu Gln Ser Ile Thr Gln Lys Thr Val Arg Gln Pro Gln Thr Leu 
            260                 265                 270         


Ala Met Glu Glu Ile Asn Cys His His Asn Tyr Val Gln Lys Glu Gln 
        275                 280                 285             


His Phe Gly Glu Glu Ile Tyr Val Thr Arg Lys Gly Ala Val Ser Ala 
    290                 295                 300                 


Arg Ala Gly Gln Tyr Gly Ile Ile Pro Gly Ser Met Gly Ala Lys Ser 
305                 310                 315                 320 


Phe Ile Val Arg Gly Leu Gly Asn Glu Glu Ser Phe Cys Ser Cys Ser 
                325                 330                 335     


His Gly Ala Gly Arg Val Met Ser Arg Thr Lys Ala Lys Lys Leu Phe 
            340                 345                 350         


Ser Val Glu Asp Gln Ile Arg Ala Thr Ala His Val Glu Cys Arg Lys 
        355                 360                 365             


Asp Ala Glu Val Ile Asp Glu Ile Pro Met Ala Tyr Lys Asp Ile Asp 
    370                 375                 380                 


Ala Val Met Ala Ala Gln Ser Asp Leu Val Glu Val Ile Tyr Thr Leu 
385                 390                 395                 400 


Arg Gln Val Val Cys Val Lys Gly 
                405             


<210>  84
<211>  1227
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E. Coli RtcB human codon optimized nucleic acid sequence

<400>  84
atgaattacg agcttcttac cactgagaat gcacctgtga aaatgtggac taagggagtg       60

cccgtggaag cggacgcaag gcagcagctc ataaatacag ctaagatgcc tttcatcttc      120

aaacacatcg cggttatgcc cgacgtgcac ctcggaaaag gctctactat tggaagtgtg      180

attccgacaa agggtgcgat catacctgct gccgtcgggg tggacatagg ctgtggaatg      240

aatgccctgc gaacggctct taccgcagaa gatcttcctg agaatctggc cgagctgcga      300

caggccattg aaacagcggt tccgcatggt cggactaccg gacggtgcaa aagggacaaa      360

ggtgcgtggg aaaaccctcc cgttaacgtg gatgcgaaat gggctgagtt ggaagcaggc      420

tatcaatggc ttacccagaa atatccacgg ttcttgaaca ctaataacta caaacacctg      480

gggaccttgg ggacggggaa tcatttcatc gaaatctgtc ttgatgagtc tgaccaagtg      540

tggattatgc ttcatagcgg tagccgcggc attggtaacg caattgggac atattttatt      600

gacctcgcgc agaaagagat gcaggaaacg cttgagacgc tgccgtcccg agatcttgcg      660

tattttatgg aagggacgga atactttgac gattatctga aggcggtagc atgggctcaa      720

ctgtttgcta gtctcaaccg agacgcgatg atggaaaatg tggtaacagc acttcaatca      780

atcacccaaa agacagtgcg acagccccaa actctcgcta tggaagaaat caattgccac      840

cacaattacg ttcagaaaga gcaacatttc ggagaagaaa tttacgtgac aagaaaagga      900

gctgttagcg cgagggccgg acagtacggc atcattcctg ggtcaatggg tgcgaaatct      960

tttatagtac gcgggcttgg taatgaagaa tccttctgca gctgttctca tggagccgga     1020

agggtaatgt ccaggactaa ggccaagaaa ctcttctctg tggaagatca aattagagct     1080

acagcacatg ttgaatgtag aaaggatgcc gaagtcatag acgagatccc tatggcttac     1140

aaagatatag atgctgtaat ggctgcacag tcagacctcg tagaggttat ctacacactc     1200

cggcaagtcg tatgcgtaaa aggatag                                         1227


<210>  85
<211>  470
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Deinococcus radiodurans RtcB protein sequence

<400>  85

Met Asn Gly Lys His Ile Thr Lys Leu Gly Phe Glu Gly Lys Ala Val 
1               5                   10                  15      


Gly Leu Ala Leu Ser Ala Ala Gly Leu Arg Glu Asp Ala Gly Val Ser 
            20                  25                  30          


Arg Gly Asp Ile Leu Asp Glu Leu Arg Ser Val Gln Asn Tyr Pro Glu 
        35                  40                  45              


Gln Tyr Gln Gly Gly Gly Val Tyr Ala Asp Leu Ala Thr His Leu Ile 
    50                  55                  60                  


Glu Gln Gln Ala Ala Gln Gln Thr Arg Gln Ser Ala Lys Leu Arg Ala 
65                  70                  75                  80  


Ala Pro Leu Pro Tyr Arg Thr Trp Gly Glu Asp Leu Ile Glu Pro Gly 
                85                  90                  95      


Ala His Arg Gln Met Asp Val Ala Met Gln Leu Pro Ile Ser Arg Ala 
            100                 105                 110         


Gly Ala Leu Met Pro Asp Ala His Val Gly Tyr Gly Leu Pro Ile Gly 
        115                 120                 125             


Gly Val Leu Ala Thr Glu Asn Ala Val Ile Pro Tyr Gly Val Gly Val 
    130                 135                 140                 


Asp Ile Gly Cys Ser Met Met Leu Ser Val Phe Pro Val Ala Ala Thr 
145                 150                 155                 160 


Gly Leu Ser Val Asp Glu Ala Arg Ser Leu Leu Leu Lys His Thr Arg 
                165                 170                 175     


Phe Gly Ala Gly Val Gly Phe Glu Lys Arg Asp Arg Leu Asp His Pro 
            180                 185                 190         


Val Leu Ala Glu Ala Thr Trp Asp Glu Gln Pro Leu Leu Arg His Leu 
        195                 200                 205             


Phe Asp Lys Ala Ala Gly Gln Ile Gly Ser Ser Gly Ser Gly Asn His 
    210                 215                 220                 


Phe Val Glu Phe Gly Thr Phe Thr Leu Ala Gln Ala Asp Pro Gln Leu 
225                 230                 235                 240 


Glu Gly Leu Asp Pro Gly Glu Tyr Leu Ala Val Leu Ser His Ser Gly 
                245                 250                 255     


Ser Arg Gly Phe Gly Ala Gln Val Ala Gly His Phe Thr Asn Leu Ala 
            260                 265                 270         


Gln Arg Leu Trp Pro Ala Leu Asp Lys Glu Ala Gln Lys Leu Ala Trp 
        275                 280                 285             


Leu Pro Leu Asp Ser Glu Ala Gly Gln Ala Tyr Trp Gln Ala Met Asn 
    290                 295                 300                 


Leu Ala Gly Arg Tyr Ala Leu Ala Asn His Glu Gln Ile His Ala Arg 
305                 310                 315                 320 


Leu Ala Arg Ala Leu Gly Glu Lys Pro Leu Leu Arg Ala Gln Asn Ser 
                325                 330                 335     


His Asn Leu Ala Trp Lys Gln Gln Val Asn Gly Gln Glu Leu Ile Val 
            340                 345                 350         


His Arg Lys Gly Ala Thr Pro Ala Glu Ala Gly Gln Leu Gly Leu Ile 
        355                 360                 365             


Pro Gly Ser Met Ala Asp Pro Gly Tyr Leu Val Arg Gly Arg Gly Asn 
    370                 375                 380                 


Pro Glu Ala Leu Ala Ser Ala Ser His Gly Ala Gly Arg Gln Leu Gly 
385                 390                 395                 400 


Arg Lys Ala Ala Glu Arg Ser Leu Ala Lys Lys Asp Val Gln Ala Tyr 
                405                 410                 415     


Leu Lys Asp Arg Gly Val Thr Leu Ile Gly Gly Gly Ile Asp Glu Ala 
            420                 425                 430         


Pro Gln Ala Tyr Lys Arg Ile Glu Asp Val Ile Ala Arg Gln Arg Asp 
        435                 440                 445             


Leu Val Asp Val Leu Gly Glu Phe Arg Pro Arg Val Val Arg Met Asp 
    450                 455                 460                 


Thr Gly Ser Glu Asp Val 
465                 470 


<210>  86
<211>  1413
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Deinococcus radiodurans RtcB human codon optimized nucleic acid 
       sequence

<400>  86
atgaacggaa agcacatcac gaagttgggt ttcgaaggga aggctgttgg cctggcattg       60

tctgcggctg gtctcaggga agacgcaggc gtttcccgag gagatattct cgatgaactt      120

aggtctgtcc agaattatcc ggagcaatat caagggggag gggtctatgc cgacttggcg      180

acacacctta ttgagcaaca agctgctcag cagactaggc aatccgccaa gctgcgagca      240

gcaccacttc cgtaccgaac gtggggtgaa gacctgatcg agccaggcgc acacagacag      300

atggatgtag caatgcagct cccgatctcc cgggcgggag cgctgatgcc agatgcccac      360

gtaggatacg gacttcccat tggaggcgtg ctcgctaccg aaaacgccgt aatcccctat      420

ggagtgggcg ttgacatcgg ttgctcaatg atgttgagtg ttttcccggt ggctgcaaca      480

ggtctgtcag tggatgaggc gcggtcactg cttctcaaac acacgcgctt cggtgcgggg      540

gtcggattcg agaaacgcga caggctcgac catcctgtct tggcggaggc tacgtgggac      600

gagcagcctt tgctgagaca cttgtttgat aaagctgctg gccagattgg gtcttccgga      660

tcagggaacc acttcgtcga atttggaact ttcaccctcg cacaggccga tccgcagttg      720

gaaggtttgg accctgggga atacttggct gttctttcac actcagggag tagaggattt      780

ggagcccagg tggctgggca ttttaccaac ttggcgcagc gcttgtggcc cgcacttgat      840

aaggaagctc aaaaactcgc atggctgcca ctggattctg aggctgggca agcctactgg      900

caagccatga acttggcggg acgatatgcg ttggctaacc atgagcaaat tcacgcccga      960

ctggcccgcg cacttggtga gaagcctctt ctgcgcgccc agaactccca caatctggcc     1020

tggaaacagc aggtgaatgg gcaggaattg atagtccacc gcaaaggggc tactcctgcg     1080

gaagccgggc aacttggtct catccctggc tccatggccg acccgggata tttggtcagg     1140

ggaaggggaa atccggaagc attggcctct gcgtcacacg gagcaggtag acagctcggc     1200

cggaaggcag cggaaaggtc cctggcgaag aaagatgtgc aggcttacct taaagataga     1260

ggagtaaccc ttatcggggg cgggattgac gaggctcccc aggcgtataa aaggatcgaa     1320

gacgtcatag cacgccagcg ggaccttgtg gatgtgttgg gagaatttag gccacgagta     1380

gtgcggatgg atacagggtc tgaagatgtt tag                                  1413


<210>  87
<211>  481
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Pyrococcus horikoshii RtcB protein sequence

<400>  87

Met Val Val Pro Leu Lys Arg Ile Asp Lys Ile Arg Trp Glu Ile Pro 
1               5                   10                  15      


Lys Phe Asp Lys Arg Met Arg Val Pro Gly Arg Val Tyr Ala Asp Glu 
            20                  25                  30          


Val Leu Leu Glu Lys Met Lys Asn Asp Arg Thr Leu Glu Gln Ala Thr 
        35                  40                  45              


Asn Val Ala Met Leu Pro Gly Ile Tyr Lys Tyr Ser Ile Val Met Pro 
    50                  55                  60                  


Asp Gly His Gln Gly Tyr Gly Phe Pro Ile Gly Gly Val Ala Ala Phe 
65                  70                  75                  80  


Asp Val Lys Glu Gly Val Ile Ser Pro Gly Gly Ile Gly Tyr Asp Ile 
                85                  90                  95      


Asn Cys Gly Val Arg Leu Ile Arg Thr Asn Leu Thr Glu Lys Glu Val 
            100                 105                 110         


Arg Pro Arg Ile Lys Gln Leu Val Asp Thr Leu Phe Lys Asn Val Pro 
        115                 120                 125             


Ser Gly Val Gly Ser Gln Gly Arg Ile Lys Leu His Trp Thr Gln Ile 
    130                 135                 140                 


Asp Asp Val Leu Val Asp Gly Ala Lys Trp Ala Val Asp Asn Gly Tyr 
145                 150                 155                 160 


Gly Trp Glu Arg Asp Leu Glu Arg Leu Glu Glu Gly Gly Arg Met Glu 
                165                 170                 175     


Gly Ala Asp Pro Glu Ala Val Ser Gln Arg Ala Lys Gln Arg Gly Ala 
            180                 185                 190         


Pro Gln Leu Gly Ser Leu Gly Ser Gly Asn His Phe Leu Glu Val Gln 
        195                 200                 205             


Val Val Asp Lys Ile Phe Asp Pro Glu Val Ala Lys Ala Tyr Gly Leu 
    210                 215                 220                 


Phe Glu Gly Gln Val Val Val Met Val His Thr Gly Ser Arg Gly Leu 
225                 230                 235                 240 


Gly His Gln Val Ala Ser Asp Tyr Leu Arg Ile Met Glu Arg Ala Ile 
                245                 250                 255     


Arg Lys Tyr Arg Ile Pro Trp Pro Asp Arg Glu Leu Val Ser Val Pro 
            260                 265                 270         


Phe Gln Ser Glu Glu Gly Gln Arg Tyr Phe Ser Ala Met Lys Ala Ala 
        275                 280                 285             


Ala Asn Phe Ala Trp Ala Asn Arg Gln Met Ile Thr His Trp Val Arg 
    290                 295                 300                 


Glu Ser Phe Gln Glu Val Phe Lys Gln Asp Pro Glu Gly Asp Leu Gly 
305                 310                 315                 320 


Met Asp Ile Val Tyr Asp Val Ala His Asn Ile Gly Lys Val Glu Glu 
                325                 330                 335     


His Glu Val Asp Gly Lys Arg Val Lys Val Ile Val His Arg Lys Gly 
            340                 345                 350         


Ala Thr Arg Ala Phe Pro Pro Gly His Glu Ala Val Pro Arg Leu Tyr 
        355                 360                 365             


Arg Asp Val Gly Gln Pro Val Leu Ile Pro Gly Ser Met Gly Thr Ala 
    370                 375                 380                 


Ser Tyr Ile Leu Ala Gly Thr Glu Gly Ala Met Lys Glu Thr Phe Gly 
385                 390                 395                 400 


Ser Thr Cys His Gly Ala Gly Arg Val Leu Ser Arg Lys Ala Ala Thr 
                405                 410                 415     


Arg Gln Tyr Arg Gly Asp Arg Ile Arg Gln Glu Leu Leu Asn Arg Gly 
            420                 425                 430         


Ile Tyr Val Arg Ala Ala Ser Met Arg Val Val Ala Glu Glu Ala Pro 
        435                 440                 445             


Gly Ala Tyr Lys Asn Val Asp Asn Val Val Lys Val Val Ser Glu Ala 
    450                 455                 460                 


Gly Ile Ala Lys Leu Val Ala Arg Met Arg Pro Ile Gly Val Ala Lys 
465                 470                 475                 480 


Gly 
    


<210>  88
<211>  1446
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Pyrococcus horikoshii RtcB human codon optimized nucleic acid 
       sequence

<400>  88
atggtggttc ccctgaagag aatagataaa attcgctggg agatccctaa gttcgacaaa       60

aggatgagag taccaggacg ggtgtatgca gatgaggtct tgctcgaaaa aatgaaaaat      120

gaccgcacgc ttgaacaggc aacgaacgtc gcaatgctgc caggcattta taaatacagt      180

attgtgatgc ccgatggcca ccaggggtac ggatttccaa ttggaggggt agccgctttc      240

gatgttaaag agggcgtaat cagtcctggt gggatcgggt acgacatcaa ttgtggagtc      300

cgactgatca gaaccaatct cactgagaaa gaagtaaggc ccagaatcaa gcaactggtt      360

gatactctgt ttaaaaacgt cccttctgga gtgggcagtc aagggcggat taaactgcat      420

tggactcaaa tagacgatgt actcgtagac ggggcaaaat gggctgtgga caacggatat      480

ggatgggagc gcgacctcga acggttggaa gaaggtggtc ggatggaggg ggccgatcca      540

gaggcggtct cccaacgggc aaagcagagg ggagcacccc agctcgggtc cctggggtct      600

ggcaaccatt tcctcgaagt acaggtcgta gataagatct ttgatcctga agtagcgaaa      660

gcgtatggcc tcttcgaggg gcaagtggtt gtgatggttc acactggtag cagaggtctt      720

gggcaccaag ttgcatccga ctacttgcga atcatggagc gcgcaattag gaagtataga      780

atcccctggc cggatagaga gcttgtctca gtcccttttc aaagcgagga aggacaaaga      840

tacttcagcg ccatgaaagc cgcggcaaac tttgcatggg caaatcggca gatgataact      900

cattgggtac gagaatcatt ccaagaggtc ttcaaacaag atccggaagg cgacctcggc      960

atggacattg tgtacgatgt cgcccacaat ataggcaaag tggaggagca cgaggtcgat     1020

ggcaaacggg tgaaagttat agtccatcga aagggagcaa ctcgcgcttt tccaccaggt     1080

cacgaggctg tacctaggct gtatcgggat gtcggtcaac ctgtactcat acccggatct     1140

atgggcacag cttcctatat tctggctggc actgaaggag caatgaaaga gacgtttgga     1200

tctacctgtc acggagctgg tagggtactc tcccggaagg ccgcgacacg acaatatcgc     1260

ggggacagga tcagacaaga acttttgaat agaggcatct acgtgcgcgc cgctagtatg     1320

cgcgtcgtgg ccgaagaggc acctggggct tacaagaacg tggataacgt agttaaagta     1380

gtaagtgaag ccggcatcgc caagctggtg gcccggatgc gcccgattgg cgtggcaaag     1440

ggttag                                                                1446


<210>  89
<211>  481
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Pyrococcus sp. ST04 RtcB protein sequence

<400>  89

Met Thr Val Pro Leu Lys Arg Ile Asp Arg Ile Arg Trp Glu Ile Pro 
1               5                   10                  15      


Lys Phe Asp Lys Arg Met Arg Val Pro Gly Arg Val Tyr Ala Asp Glu 
            20                  25                  30          


Val Leu Ile Glu Lys Met Arg Ser Asp Arg Thr Leu Glu Gln Ala Ala 
        35                  40                  45              


Asn Val Ala Met Leu Pro Gly Ile Tyr Lys Tyr Ser Ile Val Met Pro 
    50                  55                  60                  


Asp Gly His Gln Gly Tyr Gly Phe Pro Ile Gly Gly Val Ala Ala Phe 
65                  70                  75                  80  


Asp Val Lys Glu Gly Val Ile Ser Pro Gly Gly Ile Gly Tyr Asp Ile 
                85                  90                  95      


Asn Cys Gly Val Arg Leu Ile Arg Thr Asn Leu Thr Glu Lys Glu Val 
            100                 105                 110         


Arg Pro Lys Ile Lys Gln Leu Val Asp Thr Leu Phe Lys Asn Val Pro 
        115                 120                 125             


Ser Gly Val Gly Ser Gln Gly Arg Ile Arg Leu His Trp Thr Gln Ile 
    130                 135                 140                 


Asp Asp Val Leu Val Asp Gly Ala Lys Trp Ala Val Asp Asn Gly Tyr 
145                 150                 155                 160 


Gly Trp Glu Arg Asp Leu Glu Arg Leu Glu Glu Gly Gly Arg Met Glu 
                165                 170                 175     


Gly Ala Asp Pro Asp Ala Val Ser Gln Arg Ala Lys Gln Arg Gly Ala 
            180                 185                 190         


Pro Gln Leu Gly Ser Leu Gly Ser Gly Asn His Phe Leu Glu Val Gln 
        195                 200                 205             


Val Val Asp Lys Ile Tyr Asp Glu Glu Val Ala Lys Ala Tyr Gly Leu 
    210                 215                 220                 


Phe Glu Gly Gln Val Val Val Met Val His Thr Gly Ser Arg Gly Leu 
225                 230                 235                 240 


Gly His Gln Val Ala Ser Asp Tyr Leu Arg Ile Met Glu Arg Ala Ile 
                245                 250                 255     


Arg Lys Tyr Arg Ile Pro Trp Pro Asp Arg Glu Leu Val Ser Val Pro 
            260                 265                 270         


Phe Gln Ser Glu Glu Gly Gln Arg Tyr Phe Ser Ala Met Lys Ala Ala 
        275                 280                 285             


Ala Asn Phe Ala Trp Ala Asn Arg Gln Met Ile Thr His Trp Val Arg 
    290                 295                 300                 


Glu Ser Phe Gln Glu Val Phe Arg Gln Asp Pro Glu Gly Asp Leu Gly 
305                 310                 315                 320 


Met Asp Ile Val Tyr Asp Val Ala His Asn Ile Gly Lys Val Glu Glu 
                325                 330                 335     


His Glu Val Asp Gly Lys Lys Val Thr Val Ile Val His Arg Lys Gly 
            340                 345                 350         


Ala Thr Arg Ala Phe Pro Pro Gly His Glu Ala Ile Pro Arg Ile Tyr 
        355                 360                 365             


Arg Asp Val Gly Gln Pro Val Leu Ile Pro Gly Ser Met Gly Thr Ala 
    370                 375                 380                 


Ser Tyr Val Leu Ala Gly Thr Glu Gly Ala Met Lys Glu Thr Phe Gly 
385                 390                 395                 400 


Ser Thr Cys His Gly Ala Gly Arg Val Leu Ser Arg Lys Ala Ala Thr 
                405                 410                 415     


Arg Gln Tyr Arg Gly Asp Arg Ile Arg Asn Glu Leu Leu Gln Arg Gly 
            420                 425                 430         


Ile Tyr Val Arg Ala Ala Ser Met Arg Val Val Ala Glu Glu Ala Pro 
        435                 440                 445             


Gly Ala Tyr Lys Asn Val Asp Asn Val Val Lys Val Val Ser Glu Ala 
    450                 455                 460                 


Gly Ile Ala Lys Leu Val Ala Arg Met Arg Pro Ile Gly Val Ala Lys 
465                 470                 475                 480 


Gly 
    


<210>  90
<211>  1446
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Pyrococcus sp. ST04 RtcB human codon optimized nucleic acid 
       sequence

<400>  90
atgaccgttc ccctgaagag aatagatagg attcgctggg agatccctaa gttcgacaaa       60

aggatgagag taccaggacg ggtgtatgca gatgaggtct tgatcgagaa aatgagaagc      120

gaccgcacgc ttgaacaggc agccaacgtc gcaatgctgc caggcattta taaatacagt      180

attgtgatgc ccgatggcca ccaggggtac ggatttccaa ttggaggggt agccgctttc      240

gatgttaaag agggcgtaat cagtcctggt gggatcgggt acgacatcaa ttgtggagtc      300

cgactgatca gaaccaatct cactgagaaa gaagtaaggc ccaaaatcaa gcaactggtt      360

gatactctgt ttaaaaacgt cccttctgga gtgggcagtc aagggcggat tagactgcat      420

tggactcaaa tagacgatgt actcgtagac ggggcaaaat gggctgtgga caacggatat      480

ggatgggagc gcgacctcga acggttggaa gaaggtggtc ggatggaggg ggccgatcca      540

gacgcggtct cccaacgggc aaagcagagg ggagcacccc agctcgggtc cctggggtct      600

ggcaaccatt tcctcgaagt acaggtcgta gataagatct acgatgagga agtagcgaaa      660

gcgtatggcc tcttcgaggg gcaagtggtt gtgatggttc acactggtag cagaggtctt      720

gggcaccaag ttgcatccga ctacttgcga atcatggagc gcgcaattag gaagtataga      780

atcccctggc cggatagaga gcttgtctca gtcccttttc aaagcgagga aggacaaaga      840

tacttcagcg ccatgaaagc cgcggcaaac tttgcatggg caaatcggca gatgataact      900

cattgggtac gagaatcatt ccaagaggtc ttcagacaag atccggaagg cgacctcggc      960

atggacattg tgtacgatgt cgcccacaat ataggcaaag tggaggagca cgaggtcgat     1020

ggcaagaaag tgaccgttat agtccatcga aagggagcaa ctcgcgcttt tccaccaggt     1080

cacgaggcta tccctaggat ctatcgggat gtcggtcaac ctgtactcat acccggatct     1140

atgggcacag cttcctatgt gctggctggc actgaaggag caatgaaaga gacgtttgga     1200

tctacctgtc acggagctgg tagggtactc tcccggaagg ccgcgacacg acaatatcgc     1260

ggggacagga tcagaaatga acttttgcaa agaggcatct acgtgcgcgc cgctagtatg     1320

cgcgtcgtgg ccgaagaggc acctggggct tacaagaacg tggataacgt agttaaagta     1380

gtaagtgaag ccggcatcgc caagctggtg gcccggatgc gcccgattgg cgtggcaaag     1440

ggttag                                                                1446


<210>  91
<211>  480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Thermococcus sp. EP1 RtcB protein sequence

<400>  91

Met Glu Ile Pro Leu Lys Arg Leu Asp Lys Ile Arg Trp Glu Ile Pro 
1               5                   10                  15      


Lys Phe Asn Arg Arg Met Arg Val Pro Gly Arg Val Tyr Ala Asp Asp 
            20                  25                  30          


Thr Leu Leu Gln Lys Met Arg Gln Asp Lys Thr Leu Glu Gln Ala Thr 
        35                  40                  45              


Asn Val Ala Met Leu Pro Gly Ile Tyr Lys Tyr Ser Ile Val Met Pro 
    50                  55                  60                  


Asp Gly His Gln Gly Tyr Gly Phe Pro Ile Gly Gly Val Ala Ala Phe 
65                  70                  75                  80  


Asp Val Lys Glu Gly Val Ile Ser Pro Gly Gly Val Gly Tyr Asp Ile 
                85                  90                  95      


Asn Cys Gly Val Arg Leu Ile Arg Thr Asn Leu Val Glu Lys Glu Val 
            100                 105                 110         


Arg Pro Lys Ile Lys Gln Leu Ile Asp Thr Leu Phe Lys Asn Val Pro 
        115                 120                 125             


Ser Gly Leu Gly Ser Lys Gly Arg Ile Arg Leu His Trp Thr Gln Leu 
    130                 135                 140                 


Asp Asp Val Leu Ala Asp Gly Ala Lys Trp Ala Val Asp Asn Gly Tyr 
145                 150                 155                 160 


Gly Trp Lys Asp Asp Leu Glu His Leu Glu Glu Gly Gly Arg Met Glu 
                165                 170                 175     


Gly Ala Asn Pro Asn Ala Val Ser Gln Lys Ala Lys Gln Arg Gly Ala 
            180                 185                 190         


Pro Gln Leu Gly Ser Leu Gly Ser Gly Asn His Phe Leu Glu Ile Gln 
        195                 200                 205             


Val Val Asp Lys Val Phe Asn Glu Glu Ile Ala Lys Ala Tyr Gly Leu 
    210                 215                 220                 


Phe Glu Gly Gln Ile Val Val Met Val His Thr Gly Ser Arg Gly Leu 
225                 230                 235                 240 


Gly His Gln Val Ala Ser Asp Tyr Leu Arg Ile Met Glu Lys Ala Asn 
                245                 250                 255     


Arg Lys Tyr Asn Val Pro Trp Pro Asp Arg Glu Leu Val Ser Val Pro 
            260                 265                 270         


Phe Gln Thr Glu Glu Gly Gln Arg Tyr Phe Ser Ala Met Lys Ala Ala 
        275                 280                 285             


Ala Asn Phe Ala Trp Ala Asn Arg Gln Met Ile Thr His Trp Val Arg 
    290                 295                 300                 


Glu Ser Phe Glu Glu Val Phe Lys Gln Lys Ala Glu Asp Leu Gly Met 
305                 310                 315                 320 


His Ile Val Tyr Asp Val Ala His Asn Ile Ala Lys Val Glu Glu His 
                325                 330                 335     


Glu Val Asn Gly Arg Lys Ile Lys Val Val Val His Arg Lys Gly Ala 
            340                 345                 350         


Thr Arg Ala Phe Pro Ala Gly His Glu Ala Ile Pro Lys Ala Tyr Arg 
        355                 360                 365             


Asp Val Gly Gln Pro Val Leu Ile Pro Gly Ser Met Gly Thr Ala Ser 
    370                 375                 380                 


Tyr Val Leu Ala Gly Ala Glu Gly Ser Met Arg Glu Thr Phe Gly Ser 
385                 390                 395                 400 


Thr Cys His Gly Ala Gly Arg Val Leu Ser Arg His Ala Ala Thr Arg 
                405                 410                 415     


Gln Phe Arg Gly Asp Arg Leu Arg Asn Glu Leu Met Gln Arg Gly Ile 
            420                 425                 430         


Tyr Ile Arg Ala Ala Ser Met Arg Val Val Ala Glu Glu Ala Pro Gly 
        435                 440                 445             


Ala Tyr Lys Asn Val Asp Asn Val Val Arg Val Val His Glu Ala Gly 
    450                 455                 460                 


Ile Ala Asn Leu Val Ala Arg Met Arg Pro Ile Gly Val Ala Lys Gly 
465                 470                 475                 480 


<210>  92
<211>  1446
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Thermococcus sp. EP1 RtcB human codon optimized nucleic acid 
       sequence

<400>  92
atggagatac cactcaaacg acttgacaag atccgatggg agattcccaa atttaacaga       60

cgaatgagag ttccgggaag agtttacgca gatgatacat tgctccaaaa gatgcgacaa      120

gataagacgc tcgaacaagc caccaacgtg gccatgctcc caggcattta taagtatagt      180

atagtcatgc ctgacggaca ccagggttat ggattcccga ttggcggtgt agcagccttc      240

gacgtaaaag agggagtaat tagtcctggc ggtgttggtt atgatattaa ctgtggcgtg      300

aggcttatca ggacgaatct tgtagagaag gaagtgcgac caaaaatcaa acaacttata      360

gatactttgt tcaaaaatgt cccgtctggg ctcggatcaa agggtcggat aaggctccac      420

tggactcaac tggatgatgt tctggctgat ggggcaaaat gggctgttga caatgggtac      480

gggtggaagg atgatctcga acatttggag gagggcggac ggatggaggg cgcaaacccc      540

aatgccgttt cacagaaagc gaagcaaagg ggagcgccac agcttgggtc ccttggctca      600

ggcaatcatt tcctcgaaat tcaggtcgtc gataaggttt ttaacgaaga gatagcaaag      660

gcttacggac tctttgaagg tcagatagtg gtaatggtcc atacgggctc tcggggactg      720

ggacatcaag tcgcaagtga ctacctgagg atcatggaga aagccaatcg caagtacaat      780

gtgccctggc ctgaccggga gcttgttagc gtgcccttcc agacggaaga gggtcaacga      840

tactttagcg ctatgaaggc ggcagctaat ttcgcttggg caaacagaca gatgataaca      900

cattgggtta gagagtcctt cgaggaggtc tttaaacaaa aagctgagga ccttggaatg      960

catattgtct atgatgttgc ccataacata gcaaaagtag aggaacatga ggtgaacggg     1020

cggaaaatta aggtcgtagt acacagaaaa ggcgctacca gagcattccc cgcaggacac     1080

gaggccatac ccaaagcata tagagatgtc ggccagccag tgctcatacc gggatctatg     1140

ggtacggcgt cctatgtctt ggcgggtgct gaaggatcaa tgagggagac gttcggctca     1200

acctgtcatg gggcaggtcg ggtcttgtct cggcatgctg caactcggca gttccgcggg     1260

gatcgactca ggaatgaact catgcagaga ggcatttaca tacgcgctgc ctccatgcgc     1320

gttgtcgccg aggaagctcc cggcgcctat aagaacgtag acaatgtcgt cagggtggtg     1380

catgaagcgg gaattgcgaa cttggtagcc aggatgcgcc caataggggt tgccaaggga     1440

tagtaa                                                                1446


<210>  93
<211>  167
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Human Archease protein sequence

<400>  93

Met Ala Gln Glu Glu Glu Asp Val Arg Asp Tyr Asn Leu Thr Glu Glu 
1               5                   10                  15      


Gln Lys Ala Ile Lys Ala Lys Tyr Pro Pro Val Asn Arg Lys Tyr Glu 
            20                  25                  30          


Tyr Leu Asp His Thr Ala Asp Val Gln Leu His Ala Trp Gly Asp Thr 
        35                  40                  45              


Leu Glu Glu Ala Phe Glu Gln Cys Ala Met Ala Met Phe Gly Tyr Met 
    50                  55                  60                  


Thr Asp Thr Gly Thr Val Glu Pro Leu Gln Thr Val Glu Val Glu Thr 
65                  70                  75                  80  


Gln Gly Asp Asp Leu Gln Ser Leu Leu Phe His Phe Leu Asp Glu Trp 
                85                  90                  95      


Leu Tyr Lys Phe Ser Ala Asp Glu Phe Phe Ile Pro Arg Glu Val Lys 
            100                 105                 110         


Val Leu Ser Ile Asp Gln Arg Asn Phe Lys Leu Arg Ser Ile Gly Trp 
        115                 120                 125             


Gly Glu Glu Phe Ser Leu Ser Lys His Pro Gln Gly Thr Glu Val Lys 
    130                 135                 140                 


Ala Ile Thr Tyr Ser Ala Met Gln Val Tyr Asn Glu Glu Asn Pro Glu 
145                 150                 155                 160 


Val Phe Val Ile Ile Asp Ile 
                165         


<210>  94
<211>  461
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Human Archease human codon optimized nucleic acid sequence

<400>  94
aggaacaaaa ggccatcaaa gcgaaatatc cgcctgtaaa ccgaaagtat gagtacctgg       60

atcacactgc ggacgtccag ttgcatgcct ggggcgacac tctggaggag gcattcgaac      120

aatgtgcaat ggcaatgttt ggctacatga ctgatacagg cacagtggag ccccttcaaa      180

cggtagaggt agaaactcag ggagatgatc ttcagagctt gctcttccat tttctcgacg      240

aatggttgta taagttcagt gccgacgagt tcttcattcc acgcgaagtg aaagtgctga      300

gtattgatca gagaaacttt aaacttaggt ctattgggtg gggtgaagag ttctctttgt      360

ctaaacaccc tcaaggaact gaggtaaagg cgataactta ctcagccatg caggtatata      420

acgaggagaa tcctgaggtt ttcgtaatca ttgatatata g                          461


<210>  95
<211>  142
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Pyrococcus horikoshii Archease protein sequence

<400>  95

Met Lys Lys Trp Glu His Tyr Glu His Thr Ala Asp Ile Gly Ile Arg 
1               5                   10                  15      


Gly Tyr Gly Asp Ser Leu Glu Glu Ala Phe Glu Ala Val Ala Ile Ala 
            20                  25                  30          


Leu Phe Asp Val Met Val Asn Val Asn Lys Val Glu Lys Lys Glu Val 
        35                  40                  45              


Arg Glu Ile Glu Val Glu Ala Glu Asp Leu Glu Ala Leu Leu Tyr Ser 
    50                  55                  60                  


Phe Leu Glu Glu Leu Leu Val Ile His Asp Ile Glu Gly Leu Val Phe 
65                  70                  75                  80  


Arg Asp Phe Glu Val Lys Ile Glu Arg Val Asn Gly Lys Tyr Arg Leu 
                85                  90                  95      


Arg Ala Lys Ala Tyr Gly Glu Lys Leu Asp Leu Lys Lys His Glu Pro 
            100                 105                 110         


Lys Glu Glu Val Lys Ala Ile Thr Tyr His Asp Met Lys Ile Glu Arg 
        115                 120                 125             


Leu Pro Asn Gly Lys Trp Met Ala Gln Leu Val Pro Asp Ile 
    130                 135                 140         


<210>  96
<211>  429
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Pyrococcus horikoshii Archease human codon optimized nucleic acid
       sequence

<400>  96
atgaagaaat gggagcacta tgagcatact gccgacattg gtattcgggg atatggggat       60

agccttgagg aggcattcga agcagtagcc atcgcgctct ttgatgtaat ggtgaacgtg      120

aataaagtcg agaagaagga agtccgagaa attgaagtgg aggcagaaga tttggaggcc      180

ctcctttatt cattcctgga agaactgttg gttattcatg atatagaggg actggttttc      240

agggactttg aagttaagat agagagagta aatggcaaat accgacttcg agcgaaagcc      300

tacggtgaga agctcgacct caagaagcac gaaccgaaag aggaagtaaa ggcgataacc      360

taccatgata tgaaaattga acggttgccc aatggaaagt ggatggctca actcgttcca      420

gatatttag                                                              429


<210>  97
<211>  301
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  T4 Polynucleotide Kinase (T4 PNK) protein sequence

<400>  97

Met Lys Lys Ile Ile Leu Thr Ile Gly Cys Pro Gly Ser Gly Lys Ser 
1               5                   10                  15      


Thr Trp Ala Arg Glu Phe Ile Ala Lys Asn Pro Gly Phe Tyr Asn Ile 
            20                  25                  30          


Asn Arg Asp Asp Tyr Arg Gln Ser Ile Met Ala His Glu Glu Arg Asp 
        35                  40                  45              


Glu Tyr Lys Tyr Thr Lys Lys Lys Glu Gly Ile Val Thr Gly Met Gln 
    50                  55                  60                  


Phe Asp Thr Ala Lys Ser Ile Leu Tyr Gly Gly Asp Ser Val Lys Gly 
65                  70                  75                  80  


Val Ile Ile Ser Asp Thr Asn Leu Asn Pro Glu Arg Arg Leu Ala Trp 
                85                  90                  95      


Glu Thr Phe Ala Lys Glu Tyr Gly Trp Lys Val Glu His Lys Val Phe 
            100                 105                 110         


Asp Val Pro Trp Thr Glu Leu Val Lys Arg Asn Ser Lys Arg Gly Thr 
        115                 120                 125             


Lys Ala Val Pro Ile Asp Val Leu Arg Ser Met Tyr Lys Ser Met Arg 
    130                 135                 140                 


Glu Tyr Leu Gly Leu Pro Val Tyr Asn Gly Thr Pro Gly Lys Pro Lys 
145                 150                 155                 160 


Ala Val Ile Phe Asp Val Asp Gly Thr Leu Ala Lys Met Asn Gly Arg 
                165                 170                 175     


Gly Pro Tyr Asp Leu Glu Lys Cys Asp Thr Asp Val Ile Asn Pro Met 
            180                 185                 190         


Val Val Glu Leu Ser Lys Met Tyr Ala Leu Met Gly Tyr Gln Ile Val 
        195                 200                 205             


Val Val Ser Gly Arg Glu Ser Gly Thr Lys Glu Asp Pro Thr Lys Tyr 
    210                 215                 220                 


Tyr Arg Met Thr Arg Lys Trp Val Glu Asp Ile Ala Gly Val Pro Leu 
225                 230                 235                 240 


Val Met Gln Cys Gln Arg Glu Gln Gly Asp Thr Arg Lys Asp Asp Val 
                245                 250                 255     


Val Lys Glu Glu Ile Phe Trp Lys His Ile Ala Pro His Phe Asp Val 
            260                 265                 270         


Lys Leu Ala Ile Asp Asp Arg Thr Gln Val Val Glu Met Trp Arg Arg 
        275                 280                 285             


Ile Gly Val Glu Cys Trp Gln Val Ala Ser Gly Asp Phe 
    290                 295                 300     


<210>  98
<211>  906
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  T4 PNK human codon optimized nucleic acid sequence

<400>  98
atgaagaaaa ttatacttac aatcggatgc cctggtagtg gtaagagcac ttgggcgagg       60

gaatttattg cgaagaaccc tggattttat aatatcaatc gagacgacta ccggcagtct      120

attatggccc acgaggaacg agacgaatac aagtatacca agaagaaaga agggattgtc      180

acgggtatgc aatttgacac cgccaaatca atactgtacg gaggtgattc agtcaaaggc      240

gttatcatat cagacactaa cctcaatcct gaacgccgat tggcatggga aacatttgcg      300

aaggaatacg gttggaaggt tgaacacaag gtgttcgatg tcccgtggac cgaactggta      360

aaacgcaatt ctaaacgagg cactaaagct gtgcccattg acgtacttcg aagtatgtac      420

aagtccatga gagagtacct ggggcttccc gtctataacg gtacgccggg caaaccgaag      480

gcggtgatct ttgacgtaga tgggactctg gcgaagatga atggtcgcgg accatacgat      540

ttggaaaaat gtgacacaga tgtaatcaac ccaatggtag tagagcttag caagatgtac      600

gcattgatgg gctaccaaat tgtcgtggtg tccgggcggg agtcaggcac aaaagaagat      660

ccgacgaagt attatcgcat gacacggaaa tgggtcgaag atatagccgg ggtgcctctc      720

gttatgcaat gtcaacgaga acagggcgac acacggaagg atgacgtagt gaaggaggaa      780

attttctgga agcatatagc gccacacttt gacgttaagc tcgccatcga cgaccgaact      840

caggtggtcg agatgtggcg acgaattggc gtagagtgtt ggcaagttgc atctggagat      900

ttttag                                                                 906


<210>  99
<211>  176
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E. Coli thpR protein sequence

<400>  99

Met Ser Glu Pro Gln Arg Leu Phe Phe Ala Ile Asp Leu Pro Ala Glu 
1               5                   10                  15      


Ile Arg Glu Gln Ile Ile His Trp Arg Ala Thr His Phe Pro Pro Glu 
            20                  25                  30          


Ala Gly Arg Pro Val Ala Ala Asp Asn Leu His Leu Thr Leu Ala Phe 
        35                  40                  45              


Leu Gly Glu Val Ser Ala Glu Lys Glu Lys Ala Leu Ser Leu Leu Ala 
    50                  55                  60                  


Gly Arg Ile Arg Gln Pro Gly Phe Thr Leu Thr Leu Asp Asp Ala Gly 
65                  70                  75                  80  


Gln Trp Leu Arg Ser Arg Val Val Trp Leu Gly Met Arg Gln Pro Pro 
                85                  90                  95      


Arg Gly Leu Ile Gln Leu Ala Asn Met Leu Arg Ser Gln Ala Ala Arg 
            100                 105                 110         


Ser Gly Cys Phe Gln Ser Asn Arg Pro Phe His Pro His Ile Thr Leu 
        115                 120                 125             


Leu Arg Asp Ala Ser Glu Ala Val Thr Ile Pro Pro Pro Gly Phe Asn 
    130                 135                 140                 


Trp Ser Tyr Ala Val Thr Glu Phe Thr Leu Tyr Ala Ser Ser Phe Ala 
145                 150                 155                 160 


Arg Gly Arg Thr Arg Tyr Thr Pro Leu Lys Arg Trp Ala Leu Thr Gln 
                165                 170                 175     


<210>  100
<211>  531
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E. Coli thpR human codon optimized nucleic acid sequence

<400>  100
atgagtgagc ctcaacgatt gttctttgcc atagatttgc ctgctgaaat tagagagcaa       60

attatccatt ggagagccac ccatttcccc ccagaagctg gacgaccagt cgcagcggac      120

aacctccacc ttacactggc gttcttgggt gaagtgagcg ccgagaaaga gaaagctctc      180

tcacttctgg ctgggaggat tcggcagccg ggctttaccc ttactctgga tgatgccggc      240

cagtggctga ggtccagggt tgtctggctc ggaatgaggc aaccacctag ggggctcatc      300

cagctcgcca atatgctgag atcccaggcc gcaaggtctg gctgcttcca atcaaacagg      360

ccattccacc cgcatattac cttgctcaga gatgcctccg aggcagtaac tattccacct      420

cccggcttta actggagtta cgccgtcaca gaatttactc tgtacgcctc cagcttcgcc      480

cgagggagaa ccaggtacac gcctttgaag cggtgggcct tgacccagta g               531


<210>  101
<211>  521
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Human PNKP protein sequence

<400>  101

Met Gly Glu Val Glu Ala Pro Gly Arg Leu Trp Leu Glu Ser Pro Pro 
1               5                   10                  15      


Gly Gly Ala Pro Pro Ile Phe Leu Pro Ser Asp Gly Gln Ala Leu Val 
            20                  25                  30          


Leu Gly Arg Gly Pro Leu Thr Gln Val Thr Asp Arg Lys Cys Ser Arg 
        35                  40                  45              


Thr Gln Val Glu Leu Val Ala Asp Pro Glu Thr Arg Thr Val Ala Val 
    50                  55                  60                  


Lys Gln Leu Gly Val Asn Pro Ser Thr Thr Gly Thr Gln Glu Leu Lys 
65                  70                  75                  80  


Pro Gly Leu Glu Gly Ser Leu Gly Val Gly Asp Thr Leu Tyr Leu Val 
                85                  90                  95      


Asn Gly Leu His Pro Leu Thr Leu Arg Trp Glu Glu Thr Arg Thr Pro 
            100                 105                 110         


Glu Ser Gln Pro Asp Thr Pro Pro Gly Thr Pro Leu Val Ser Gln Asp 
        115                 120                 125             


Glu Lys Arg Asp Ala Glu Leu Pro Lys Lys Arg Met Arg Lys Ser Asn 
    130                 135                 140                 


Pro Gly Trp Glu Asn Leu Glu Lys Leu Leu Val Phe Thr Ala Ala Gly 
145                 150                 155                 160 


Val Lys Pro Gln Gly Lys Val Ala Gly Phe Asp Leu Asp Gly Thr Leu 
                165                 170                 175     


Ile Thr Thr Arg Ser Gly Lys Val Phe Pro Thr Gly Pro Ser Asp Trp 
            180                 185                 190         


Arg Ile Leu Tyr Pro Glu Ile Pro Arg Lys Leu Arg Glu Leu Glu Ala 
        195                 200                 205             


Glu Gly Tyr Lys Leu Val Ile Phe Thr Asn Gln Met Ser Ile Gly Arg 
    210                 215                 220                 


Gly Lys Leu Pro Ala Glu Glu Phe Lys Ala Lys Val Glu Ala Val Val 
225                 230                 235                 240 


Glu Lys Leu Gly Val Pro Phe Gln Val Leu Val Ala Thr His Ala Gly 
                245                 250                 255     


Leu Tyr Arg Lys Pro Val Thr Gly Met Trp Asp His Leu Gln Glu Gln 
            260                 265                 270         


Ala Asn Asp Gly Thr Pro Ile Ser Ile Gly Asp Ser Ile Phe Val Gly 
        275                 280                 285             


Asp Ala Ala Gly Arg Pro Ala Asn Trp Ala Pro Gly Arg Lys Lys Lys 
    290                 295                 300                 


Asp Phe Ser Cys Ala Asp Arg Leu Phe Ala Leu Asn Leu Gly Leu Pro 
305                 310                 315                 320 


Phe Ala Thr Pro Glu Glu Phe Phe Leu Lys Trp Pro Ala Ala Gly Phe 
                325                 330                 335     


Glu Leu Pro Ala Phe Asp Pro Arg Thr Val Ser Arg Ser Gly Pro Leu 
            340                 345                 350         


Cys Leu Pro Glu Ser Arg Ala Leu Leu Ser Ala Ser Pro Glu Val Val 
        355                 360                 365             


Val Ala Val Gly Phe Pro Gly Ala Gly Lys Ser Thr Phe Leu Lys Lys 
    370                 375                 380                 


His Leu Val Ser Ala Gly Tyr Val His Val Asn Arg Asp Thr Leu Gly 
385                 390                 395                 400 


Ser Trp Gln Arg Cys Val Thr Thr Cys Glu Thr Ala Leu Lys Gln Gly 
                405                 410                 415     


Lys Arg Val Ala Ile Asp Asn Thr Asn Pro Asp Ala Ala Ser Arg Ala 
            420                 425                 430         


Arg Tyr Val Gln Cys Ala Arg Ala Ala Gly Val Pro Cys Arg Cys Phe 
        435                 440                 445             


Leu Phe Thr Ala Thr Leu Glu Gln Ala Arg His Asn Asn Arg Phe Arg 
    450                 455                 460                 


Glu Met Thr Asp Ser Ser His Ile Pro Val Ser Asp Met Val Met Tyr 
465                 470                 475                 480 


Gly Tyr Arg Lys Gln Phe Glu Ala Pro Thr Leu Ala Glu Gly Phe Ser 
                485                 490                 495     


Ala Ile Leu Glu Ile Pro Phe Arg Leu Trp Val Glu Pro Arg Leu Gly 
            500                 505                 510         


Arg Leu Tyr Cys Gln Phe Ser Glu Gly 
        515                 520     


<210>  102
<211>  1566
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Human PNKP human codon optimized nucleic acid sequence

<400>  102
atgggcgagg tggaggcccc gggccgcttg tggctcgaga gcccccctgg gggagcgccc       60

cccatcttcc tgccctcgga cgggcaagcc ctggtcctgg gcaggggacc cctgacccag      120

gttacggacc ggaagtgctc cagaactcaa gtggagctgg tcgcagatcc tgagacccgg      180

acagtggcag tgaaacagct gggagttaac ccctcaacta ccgggaccca ggagttgaag      240

ccggggttgg agggctctct gggggtgggg gacacactgt atttggtcaa tggcctccac      300

ccactgaccc tgcgctggga agagacccgc acaccagaat cccagccaga tactccgcct      360

ggcacccctc tggtgtccca agatgagaag agagatgctg agctgccgaa gaagcgtatg      420

cggaagtcaa accccggctg ggagaacttg gagaagttgc tagtgttcac cgcagctggg      480

gtgaaacccc agggcaaggt ggctggcttt gatctggacg ggacgctcat caccacacgc      540

tctgggaagg tctttcccac tggccccagt gactggagga tcttgtaccc agagattccc      600

cgtaagctcc gagagctgga agccgagggc tacaagctgg tgatcttcac caaccagatg      660

agcatcgggc gcgggaagct gccagccgag gagttcaagg ccaaggtgga ggctgtggtg      720

gagaagctgg gggtcccctt ccaggtgctg gtggccacgc acgcaggctt gtaccggaag      780

ccggtgacgg gcatgtggga ccatctgcag gagcaggcca acgacggcac gcccatatcc      840

atcggggaca gcatctttgt gggagacgca gccggacgcc cggccaactg ggccccgggg      900

cggaagaaga aagacttctc ctgcgccgat cgcctgtttg ccctcaacct tggcctgccc      960

ttcgccacgc ctgaggagtt ctttctcaag tggccagcag ccggcttcga gctcccagcc     1020

tttgatccga ggactgtctc ccgctcaggg cctctctgcc tccccgagtc cagggccctc     1080

ctgagcgcca gcccggaggt ggttgtcgca gtgggattcc ctggggccgg gaagtccacc     1140

tttctcaaga agcacctcgt gtcggccgga tatgtccacg tgaacaggga cacgctaggc     1200

tcctggcagc gctgtgtgac cacgtgtgag acagccctga agcaagggaa acgggtcgcc     1260

atcgacaaca caaacccaga cgccgcgagc cgcgccaggt acgtccagtg tgcccgagcc     1320

gcgggcgtcc cctgccgctg cttcctcttc accgccactc tggagcaggc gcgccacaac     1380

aaccggtttc gagagatgac ggactcctct catatccccg tgtcagacat ggtcatgtat     1440

ggctacagga agcagttcga ggccccaacg ctggctgaag gcttctctgc catcctggag     1500

atcccgttcc ggctatgggt ggagccgagg ctggggcggc tgtactgcca gttctccgag     1560

ggctag                                                                1566


<210>  103
<211>  857
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  NtGFP-HDV-HH-CtGFP syntethic intron

<400>  103
auggugagca agggcgagga gcuguucacc gggguggugc ccauccuggu cgagcuggac       60

ggcgacguaa acggccacaa guucagcgug uccggcgagg gcgagggcga ugccaccuac      120

ggcaagcuga cccugaaguu caucugcacc accggcaagc ugcccgugcc cuggcccacc      180

cucgugacca cccugaccua cggcgugcag ugcuucagcc gcuaccccga ccacaugaag      240

cagcacgacu ucuucaaguc cgccaugccc gaaggcuacg uccaggagcg caccaucuuc      300

uuggccggca uggucccagc cuccucgcug gcgccggcug ggcaacaugc uucggcaugg      360

cgaaugggac cccgggacau aacuaguuaa accaaauccu ugcugaugag uccgugagga      420

cgaaacgagu aagcucgucc aaggacgacg gcaacuacaa gacccgcgcc gaggugaagu      480

ucgagggcga cacccuggug aaccgcaucg agcugaaggg caucgacuuc aaggaggacg      540

gcaacauccu ggggcacaag cuggaguaca acuacaacag ccacaacguc uauaucaugg      600

ccgacaagca gaagaacggc aucaagguga acuucaagau ccgccacaac aucgaggacg      660

gcagcgugca gcucgccgac cacuaccagc agaacacccc caucggcgac ggccccgugc      720

ugcugcccga caaccacuac cugagcaccc aguccgcccu gagcaaagac cccaacgaga      780

agcgcgauca caugguccug cuggaguucg ugaccgccgc cgggaucacu cucggcaugg      840

acgagcugua caaguag                                                     857


<210>  104
<211>  248
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  sGCN4 5' UTR uORFs

<400>  104
uuaaagauca uugaaaaaug gcuugcuaaa ccgauuauau uuuguuuuua aaguagauua       60

uuauuagaaa auuauuaaga gaauuaugug uuaaauuuau ugaaagagaa aauuuauuuu      120

cccuuauuaa uuaaaguccu uuacuuuuuu ugaaaacugu caguuuuuug aagaguuauu      180

uguuuuguua ccaauugcua ucauguaccc guagaauuuu auucaagaug uuuccguaac      240

gguuaccu                                                               248


<210>  105
<211>  56
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead (HH) for 3'


<220>
<221>  misc_feature
<222>  (1)..(4)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (53)..(56)
<223>  n is a, c, g, or u

<400>  105
nnnndwhacc ggauguguuu uccggucuga ugaguccggu agcggacgaa whnnnn           56


<210>  106
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister WT with 5 nt P1 stem


<220>
<221>  misc_feature
<222>  (1)..(5)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (50)..(54)
<223>  n is a, c, g, or u

<400>  106
nnnnnuaaca cugccaaugc cggucccaag cccggauaaa aguggagggn nnnn             54


<210>  107
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister Mutant with 5 nt P1 stem


<220>
<221>  misc_feature
<222>  (1)..(5)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (50)..(54)
<223>  n is a, c, g, or u

<400>  107
nnnnnuaacu cuuccaaugc cggucccaag cccggauaaa aguggagggn nnnn             54


<210>  108
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 5 nt P1 stem with U1A mutation


<220>
<221>  misc_feature
<222>  (1)..(5)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (50)..(54)
<223>  n is a, c, g, or u

<400>  108
nnnnnaaaca cugccaaugc cggucccaag cccggauaaa aguggagggn nnnn             54


<210>  109
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 5 nt P1 stem with U1C mutation


<220>
<221>  misc_feature
<222>  (1)..(5)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (50)..(54)
<223>  n is a, c, g, or u

<400>  109
nnnnncaaca cugccaaugc cggucccaag cccggauaaa aguggagggn nnnn             54


<210>  110
<211>  54
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Twister with 5 nt P1 stem with U1G mutation


<220>
<221>  misc_feature
<222>  (1)..(5)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (50)..(54)
<223>  n is a, c, g, or u

<400>  110
nnnnngaaca cugccaaugc cggucccaag cccggauaaa aguggagggn nnnn             54


<210>  111
<211>  41
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 4 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(4)
<223>  n is a, c, g, or u

<400>  111
nnnncugaug aguccgugag gacgaaacga guaagcucgu c                           41


<210>  112
<211>  43
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 6 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(6)
<223>  n is a, c, g, or u

<400>  112
nnnnnncuga ugaguccgug aggacgaaac gaguaagcuc guc                         43


<210>  113
<211>  45
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 8 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(8)
<223>  n is a, c, g, or u

<400>  113
nnnnnnnncu gaugaguccg ugaggacgaa acgaguaagc ucguc                       45


<210>  114
<211>  47
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 10 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(10)
<223>  n is a, c, g, or u

<400>  114
nnnnnnnnnn cugaugaguc cgugaggacg aaacgaguaa gcucguc                     47


<210>  115
<211>  49
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 12 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(12)
<223>  n is a, c, g, or u

<400>  115
nnnnnnnnnn nncugaugag uccgugagga cgaaacgagu aagcucguc                   49


<210>  116
<211>  51
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 14 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(14)
<223>  n is a, c, g, or u

<400>  116
nnnnnnnnnn nnnncugaug aguccgugag gacgaaacga guaagcucgu c                51


<210>  117
<211>  53
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Hammerhead 16 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(16)
<223>  n is a, c, g, or u

<400>  117
nnnnnnnnnn nnnnnncuga ugaguccgug aggacgaaac gaguaagcuc guc              53


<210>  118
<211>  45
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 4 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(4)
<223>  n is a, c, g, or u

<400>  118
nnnncugaug aguccgguag cggacgaaac gcgcuucggu gcguc                       45


<210>  119
<211>  47
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 6 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(6)
<223>  n is a, c, g, or u

<400>  119
nnnnnncuga ugaguccggu agcggacgaa acgcgcuucg gugcguc                     47


<210>  120
<211>  49
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 8 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(8)
<223>  n is a, c, g, or u

<400>  120
nnnnnnnncu gaugaguccg guagcggacg aaacgcgcuu cggugcguc                   49


<210>  121
<211>  51
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 10 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(10)
<223>  n is a, c, g, or u

<400>  121
nnnnnnnnnn cugaugaguc cgguagcgga cgaaacgcgc uucggugcgu c                51


<210>  122
<211>  53
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 12 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(12)
<223>  n is a, c, g, or u

<400>  122
nnnnnnnnnn nncugaugag uccgguagcg gacgaaacgc gcuucggugc guc              53


<210>  123
<211>  55
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 14 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(14)
<223>  n is a, c, g, or u

<400>  123
nnnnnnnnnn nnnncugaug aguccgguag cggacgaaac gcgcuucggu gcguc            55


<210>  124
<211>  57
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  TX2 Hammerhead 16 nt overhang for 5'


<220>
<221>  misc_feature
<222>  (1)..(16)
<223>  n is a, c, g, or u

<400>  124
nnnnnnnnnn nnnnnncuga ugaguccggu agcggacgaa acgcgcuucg gugcguc          57


<210>  125
<211>  55
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  RzB Hammerhead for 5'


<220>
<221>  misc_feature
<222>  (1)..(6)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (10)..(14)
<223>  n is a, c, g, or u

<400>  125
nnnnnnuaan nnnncugaug agucgcuggg augcgacgaa acgccuucgg gcguc            55


<210>  126
<211>  832
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  NtGFP-HDV-CARGO-HH-CtGFP


<220>
<221>  misc_feature
<222>  (371)..(371)
<223>  n is a, c, g, or u

<400>  126
auggugagca agggcgagga gcuguucacc gggguggugc ccauccuggu cgagcuggac       60

ggcgacguaa acggccacaa guucagcgug uccggcgagg gcgagggcga ugccaccuac      120

ggcaagcuga cccugaaguu caucugcacc accggcaagc ugcccgugcc cuggcccacc      180

cucgugacca cccugaccua cggcgugcag ugcuucagcc gcuaccccga ccacaugaag      240

cagcacgacu ucuucaaguc cgccaugccc gaaggcuacg uccaggagcg caccaucuuc      300

uuggccggca uggucccagc cuccucgcug gcgccggcug ggcaacaugc uucggcaugg      360

cgaaugggac nuccuugcug augaguccgu gaggacgaaa cgaguaagcu cguccaagga      420

cgacggcaac uacaagaccc gcgccgaggu gaaguucgag ggcgacaccc uggugaaccg      480

caucgagcug aagggcaucg acuucaagga ggacggcaac auccuggggc acaagcugga      540

guacaacuac aacagccaca acgucuauau cauggccgac aagcagaaga acggcaucaa      600

ggugaacuuc aagauccgcc acaacaucga ggacggcagc gugcagcucg ccgaccacua      660

ccagcagaac acccccaucg gcgacggccc cgugcugcug cccgacaacc acuaccugag      720

cacccagucc gcccugagca aagaccccaa cgagaagcgc gaucacaugg uccugcugga      780

guucgugacc gccgccggga ucacucucgg cauggacgag cuguacaagu ag              832


<210>  127
<211>  370
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  NtGFP-HDV

<400>  127
auggugagca agggcgagga gcuguucacc gggguggugc ccauccuggu cgagcuggac       60

ggcgacguaa acggccacaa guucagcgug uccggcgagg gcgagggcga ugccaccuac      120

ggcaagcuga cccugaaguu caucugcacc accggcaagc ugcccgugcc cuggcccacc      180

cucgugacca cccugaccua cggcgugcag ugcuucagcc gcuaccccga ccacaugaag      240

cagcacgacu ucuucaaguc cgccaugccc gaaggcuacg uccaggagcg caccaucuuc      300

uuggccggca uggucccagc cuccucgcug gcgccggcug ggcaacaugc uucggcaugg      360

cgaaugggac                                                             370


<210>  128
<211>  461
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  HH-CtGFP

<400>  128
uccuugcuga ugaguccgug aggacgaaac gaguaagcuc guccaaggac gacggcaacu       60

acaagacccg cgccgaggug aaguucgagg gcgacacccu ggugaaccgc aucgagcuga      120

agggcaucga cuucaaggag gacggcaaca uccuggggca caagcuggag uacaacuaca      180

acagccacaa cgucuauauc auggccgaca agcagaagaa cggcaucaag gugaacuuca      240

agauccgcca caacaucgag gacggcagcg ugcagcucgc cgaccacuac cagcagaaca      300

cccccaucgg cgacggcccc gugcugcugc ccgacaacca cuaccugagc acccaguccg      360

cccugagcaa agaccccaac gagaagcgcg aucacauggu ccugcuggag uucgugaccg      420

ccgccgggau cacucucggc auggacgagc uguacaagua g                          461


<210>  129
<211>  3724
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Nt-miniDys (deltaH2-R15)

<400>  129
augcuuuggu gggaagaagu agaggacugu uaugaaagag aagauguuca aaagaaaaca       60

uucacaaaau ggguaaaugc acaauuuucu aaguuuggga agcagcauau ugagaaccuc      120

uucagugacc uacaggaugg gaggcgccuc cuagaccucc ucgaaggccu gacagggcaa      180

aaacugccaa aagaaaaagg auccacaaga guucaugccc ugaacaaugu caacaaggca      240

cugcggguuu ugcagaacaa uaauguugau uuagugaaua uuggaaguac ugacaucgua      300

gauggaaauc auaaacugac ucuugguuug auuuggaaua uaauccucca cuggcagguc      360

aaaaauguaa ugaaaaauau cauggcugga uugcaacaaa ccaacaguga aaagauucuc      420

cugagcuggg uccgacaauc aacucguaau uauccacagg uuaauguaau caacuucacc      480

accagcuggu cugauggccu ggcuuugaau gcucucaucc auagucauag gccagaccua      540

uuugacugga auaguguggu uugccagcag ucagccacac aacgacugga acaugcauuc      600

aacaucgcca gauaucaauu aggcauagag aaacuacucg auccugaaga uguugauacc      660

accuauccag auaagaaguc caucuuaaug uacaucacau cacucuucca aguuuugccu      720

caacaaguga gcauugaagc cauccaggaa guggaaaugu ugccaaggcc accuaaagug      780

acuaaagaag aacauuuuca guuacaucau caaaugcacu auucucaaca gaucacgguc      840

agucuagcac agggauauga gagaacuucu uccccuaagc cucgauucaa gagcuaugcc      900

uacacacagg cugcuuaugu caccaccucu gacccuacac ggagcccauu uccuucacag      960

cauuuggaag cuccugaaga caagucauuu ggcaguucau ugauggagag ugaaguaaac     1020

cuggaccguu aucaaacagc uuuagaagaa guauuaucgu ggcuucuuuc ugcugaggac     1080

acauugcaag cacaaggaga gauuucuaau gauguggaag uggugaaaga ccaguuucau     1140

acucaugagg gguacaugau ggauuugaca gcccaucagg gccggguugg uaauauucua     1200

caauugggaa guaagcugau uggaacagga aaauuaucag aagaugaaga aacugaagua     1260

caagagcaga ugaaucuccu aaauucaaga ugggaaugcc ucaggguagc uagcauggaa     1320

aaacaaagca auuuacauag aguuuuaaug gaucuccaga aucagaaacu gaaagaguug     1380

aaugacuggc uaacaaaaac agaagaaaga acaaggaaaa uggaggaaga gccucuugga     1440

ccugaucuug aagaccuaaa acgccaagua caacaacaua aggugcuuca agaagaucua     1500

gaacaagaac aagucagggu caauucucuc acucacaugg uggugguagu ugaugaaucu     1560

aguggagauc acgcaacugc ugcuuuggaa gaacaacuua agguauuggg agaucgaugg     1620

gcaaacaucu guagauggac agaagaccgc uggguucuuu uacaagacau ccuucucaaa     1680

uggcaacguc uuacugaaga acagugccuu uuuagugcau ggcuuucaga aaaagaagau     1740

gcagugaaca agauucacac aacuggcuuu aaagaucaaa augaaauguu aucaagucuu     1800

caaaaacugg ccguuuuaaa agcggaucua gaaaagaaaa agcaauccau gggcaaacug     1860

uauucacuca aacaagaucu ucuuucaaca cugaagaaua agucagugac ccagaagacg     1920

gaagcauggc uggauaacuu ugcccggugu ugggauaauu uaguccaaaa acuugaaaag     1980

aguacagcac agauuucaca ggaaauuucu uaugugccuu cuacuuauuu gacugaaauc     2040

acucaugucu cacaagcccu auuagaagug gaacaacuuc ucaaugcucc ugaccucugu     2100

gcuaaggacu uugaagaccu cuuuaagcaa gaggagucuc ugaagaauau aaaagauagu     2160

cuacaacaaa gcucaggucg gauugacauu auucauagca agaagacagc agcauugcaa     2220

agugcaacgc cuguggaaag ggugaagcua caggaagcuc ucucccagcu ugauuuccaa     2280

ugggaaaaag uuaacaaaau guacaaggac cgacaagggc gauuugacag auccguugag     2340

aaauggcggc guuuucauua ugauauaaag auauuuaauc aguggcuaac agaagcugaa     2400

caguuucuca gaaagacaca aauuccugag aauugggaac augcuaaaua caaaugguau     2460

cuuaaggaac uccaggaugg cauugggcag cggcaaacug uugucagaac auugaaugca     2520

acuggggaag aaauaauuca gcaauccuca aaaacagaug ccaguauucu acaggaaaaa     2580

uugggaagcc ugaaucugcg guggcaggag gucugcaaac agcugucaga cagaaaaaag     2640

aggcuagaag aacaaaagaa uaucuuguca gaauuucaaa gagauuuaaa ugaauuuguu     2700

uuaugguugg aggaagcaga uaacauugcu aguaucccac uugaaccugg aaaagagcag     2760

caacuaaaag aaaagcuuga gcaagucaag uuacuggugg aagaguugcc ccugcgccag     2820

ggaauccuca aacaauuaaa ugaaacugga ggacccgugc uuguaagugc ucccauaagc     2880

ccagaagagc aagauaaacu ugaaaauaag cucaagcaga caaaucucca guggauaaag     2940

guuuccagag cuuuaccuga gaaacaagga gaaauugaag cucaaauaaa agaccuuggg     3000

cagcuugaaa aaaagcuuga agaccuugaa gagcaguuaa aucaucugcu gcugugguua     3060

ucuccuauua ggaaucaguu ggaaauuuau aaccaaccaa accaagaagg accauuugac     3120

guuaaggaaa cugaaauagc aguucaagcu aaacaaccgg auguggaaga gauuuugucu     3180

aaagggcagc auuuguacaa ggaaaaacca gccacucagc cagugaagag gaaguuagaa     3240

gaccuguccu cugaguggaa ggcgguaaac cguuuacuuc aagagcugag ggcaaagcag     3300

ccugaccuag cuccuggacu gaccacuauu ggagccucuc cuacucagac uguuacucug     3360

gugacacaac cugugguuac uaaggaaacu gccaucucca aacuagaaau gccaucuucc     3420

uugauguugg agguaccugc ucuggcagau uucaaccggg cuuggacaga acuuaccgac     3480

uggcuuucuc ugcuugauca aguuauaaaa ucacaacgcg ugaugguggg cgaccuugag     3540

gauaucaacg agaugaucau caagcagaag gcaacaaugc aggauuugga acagaggcgu     3600

ccccaguugg aagaacucau uaccgcugcc caaaauuuga aaaacaagac cagcaaucaa     3660

gaggcuagaa caaucauuac ggaucgaauu gaaagaauuc agaaucagug ggaugaagua     3720

caag                                                                  3724


<210>  130
<211>  3362
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Ct-miniDys (deltaH2-R15)

<400>  130
aacaccuuca gaaccggagg caacaguuga augaaauguu aaaggauuca acacaauggc       60

uggaagcuaa ggaagaagcu gagcaggucu uaggacaggc cagagccaag cuggagucau      120

ggaaggaggg ucccuauaca guagaugcaa uccaaaagaa aaucacagaa accaagcagu      180

uggccaaaga ccuccgccag uggcagacaa auguagaugu ggcaaaugac uuggcccuga      240

aacuucuccg ggauuauucu gcagaugaua ccagaaaagu ccacaugaua acagagaaua      300

ucaaugccuc uuggagaagc auucauaaaa gggugaguga gcgagaggcu gcuuuggaag      360

aaacucauag auuacugcaa caguuccccc uggaccugga aaaguuucuu gccuggcuua      420

cagaagcuga aacaacugcc aauguccuac aggaugcuac ccguaaggaa aggcuccuag      480

aagacuccaa gggaguaaaa gagcugauga aacaauggca agaccuccaa ggugaaauug      540

aagcucacac agauguuuau cacaaccugg augaaaacag ccaaaaaauc cugagauccc      600

uggaagguuc cgaugaugca guccuguuac aaagacguuu ggauaacaug aacuucaagu      660

ggagugaacu ucggaaaaag ucucucaaca uuagguccca uuuggaagcc aguucugacc      720

aguggaagcg ucugcaccuu ucucugcagg aacuucuggu guggcuacag cugaaagaug      780

augaauuaag ccggcaggca ccuauuggag gcgacuuucc agcaguucag aagcagaacg      840

augugcauag ggccuucaag agggaauuga aaacuaaaga accuguaauc augaguacuc      900

uugagacugu acgaauauuu cugacagagc agccuuugga aggacuagag aaacucuacc      960

aggagcccag agagcugccu ccugaggaga gagcccagaa ugucacucgg cuucuacgaa     1020

agcaggcuga ggaggucaau acugaguggg aaaaauugaa ccugcacucc gcugacuggc     1080

agagaaaaau agaugagacc cuugaaagac uccgggaacu ucaagaggcc acggaugagc     1140

uggaccucaa gcugcgccaa gcugagguga ucaagggauc cuggcagccc gugggcgauc     1200

uccucauuga cucucuccaa gaucaccugg agaaagucaa ggcacuucga ggagaaauug     1260

cgccucugaa agagaacgug agccacguca augaccuugc ucgccagcuu accacuuugg     1320

gcauucagcu cucaccguau aaccucagca cucuggaaga ccugaacacc agauggaagc     1380

uucugcaggu ggccgucgag gaccgaguca ggcagcugca ugaagcccac agggacuuug     1440

guccagcauc ucagcacuuu cuuuccacgu cuguccaggg ucccugggag agagccaucu     1500

cgccaaacaa agugcccuac uauaucaacc acgagacuca aacaacuugc ugggaccauc     1560

ccaaaaugac agagcucuac cagucuuuag cugaccugaa uaaugucaga uucucagcuu     1620

auaggacugc caugaaacuc cgaagacugc agaaggcccu uugcuuggau cucuugagcc     1680

ugucagcugc augugaugcc uuggaccagc acaaccucaa gcaaaaugac cagcccaugg     1740

auauccugca gauuauuaau uguuugacca cuauuuauga ccgccuggag caagagcaca     1800

acaauuuggu caacgucccu cucugcgugg auaugugucu gaacuggcug cugaauguuu     1860

augauacggg acgaacaggg aggauccgug uccugucuuu uaaaacuggc aucauuuccc     1920

uguguaaagc acauuuggaa gacaaguaca gauaccuuuu caagcaagug gcaaguucaa     1980

caggauuuug ugaccagcgc aggcugggcc uccuucugca ugauucuauc caaauuccaa     2040

gacaguuggg ugaaguugca uccuuugggg gcaguaacau ugagccaagu guccggagcu     2100

gcuuccaauu ugcuaauaau aagccagaga ucgaagcggc ccucuuccua gacuggauga     2160

gacuggaacc ccaguccaug guguggcugc ccguccugca cagaguggcu gcugcagaaa     2220

cugccaagca ucaggccaaa uguaacaucu gcaaagagug uccaaucauu ggauucaggu     2280

acaggagucu aaagcacuuu aauuaugaca ucugccaaag cugcuuuuuu ucuggucgag     2340

uugcaaaagg ccauaaaaug cacuauccca ugguggaaua uugcacuccg acuacaucag     2400

gagaagaugu ucgagacuuu gccaagguac uaaaaaacaa auuucgaacc aaaagguauu     2460

uugcgaagca uccccgaaug ggcuaccugc cagugcagac ugucuuagag ggggacaaca     2520

uggaaacucc cguuacucug aucaacuucu ggccaguaga uucugcgccu gccucguccc     2580

cucagcuuuc acacgaugau acucauucac gcauugaaca uuaugcuagc aggcuagcag     2640

aaauggaaaa cagcaaugga ucuuaucuaa augauagcau cucuccuaau gagagcauag     2700

augaugaaca uuuguuaauc cagcauuacu gccaaaguuu gaaccaggac uccccccuga     2760

gccagccucg uaguccugcc cagaucuuga uuuccuuaga gagugaggaa agaggggagc     2820

uagagagaau ccuagcagau cuugaggaag aaaacaggaa ucugcaagca gaauaugacc     2880

gucuaaagca gcagcacgaa cauaaaggcc uguccccacu gccguccccu ccugaaauga     2940

ugcccaccuc uccccagagu ccccgggaug cugagcucau ugcugaggcc aagcuacugc     3000

gucaacacaa aggccgccug gaagccagga ugcaaauccu ggaagaccac aauaaacagc     3060

uggagucaca guuacacagg cuaaggcagc ugcuggagca accccaggca gaggccaaag     3120

ugaauggcac aacggugucc ucuccuucua ccucucuaca gagguccgac agcagucagc     3180

cuaugcugcu ccgagugguu ggcagucaaa cuucggacuc caugggugag gaagaucuuc     3240

ucaguccucc ccaggacaca agcacagggu uagaggaggu gauggagcaa cucaacaacu     3300

ccuucccuag uucaagagga agaaauaccc cuggaaagcc aaugagagag gacacaaugu     3360

aa                                                                    3362


