                         SEQUENCE LISTING

<110>  Ramot at Tel-Aviv University Ltd.
       PRAG, Gali
 
<120>  CHLORAMPHENICOL RESISTANT SPLIT PROTEIN

<130>  74377

<150>  US 62/542,333
<151>  2017-08-08

<160>  11    

<170>  PatentIn version 3.5

<210>  1
<211>  219
<212>  PRT
<213>  Escherichia coli

<400>  1

Met Glu Lys Lys Ile Thr Gly Tyr Thr Thr Val Asp Ile Ser Gln Trp 
1               5                   10                  15      


His Arg Lys Glu His Phe Glu Ala Phe Gln Ser Val Ala Gln Cys Thr 
            20                  25                  30          


Tyr Asn Gln Thr Val Gln Leu Asp Ile Thr Ala Phe Leu Lys Thr Val 
        35                  40                  45              


Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe Ile His Ile Leu Ala 
    50                  55                  60                  


Arg Leu Met Asn Ala His Pro Glu Phe Arg Met Ala Met Lys Asp Gly 
65                  70                  75                  80  


Glu Leu Val Ile Trp Asp Ser Val His Pro Cys Tyr Thr Val Phe His 
                85                  90                  95      


Glu Gln Thr Glu Thr Phe Ser Ser Leu Trp Ser Glu Tyr His Asp Asp 
            100                 105                 110         


Phe Arg Gln Phe Leu His Ile Tyr Ser Gln Asp Val Ala Cys Tyr Gly 
        115                 120                 125             


Glu Asn Leu Ala Tyr Phe Pro Lys Gly Phe Ile Glu Asn Met Phe Phe 
    130                 135                 140                 


Val Ser Ala Asn Pro Trp Val Ser Phe Thr Ser Phe Asp Leu Asn Val 
145                 150                 155                 160 


Ala Asn Met Asp Asn Phe Phe Ala Pro Val Phe Thr Met Gly Lys Tyr 
                165                 170                 175     


Tyr Thr Gln Gly Asp Lys Val Leu Met Pro Leu Ala Ile Gln Val His 
            180                 185                 190         


His Ala Val Cys Asp Gly Phe His Val Gly Arg Met Leu Asn Glu Leu 
        195                 200                 205             


Gln Gln Tyr Cys Asp Glu Trp Gln Gly Gly Ala 
    210                 215                 


<210>  2
<211>  30
<212>  PRT
<213>  Artificial sequence

<220>
<223>  CAT N terminal protein sequence

<400>  2

Met Glu Lys Lys Ile Thr Gly Tyr Thr Thr Val Asp Ile Ser Gln Trp 
1               5                   10                  15      


His Arg Lys Glu His Phe Glu Ala Phe Gln Ser Val Ala Gln 
            20                  25                  30  


<210>  3
<211>  190
<212>  PRT
<213>  Artificial sequence

<220>
<223>  CAT C  terminal protein sequence

<400>  3

Met Cys Thr Tyr Asn Gln Thr Val Gln Leu Asp Ile Thr Ala Phe Leu 
1               5                   10                  15      


Lys Thr Val Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe Ile His 
            20                  25                  30          


Ile Leu Ala Arg Leu Met Asn Ala His Pro Glu Phe Arg Met Ala Met 
        35                  40                  45              


Lys Asp Gly Glu Leu Val Ile Trp Asp Ser Val His Pro Cys Tyr Thr 
    50                  55                  60                  


Val Phe His Glu Gln Thr Glu Thr Phe Ser Ser Leu Trp Ser Glu Tyr 
65                  70                  75                  80  


His Asp Asp Phe Arg Gln Phe Leu His Ile Tyr Ser Gln Asp Val Ala 
                85                  90                  95      


Cys Tyr Gly Glu Asn Leu Ala Tyr Phe Pro Lys Gly Phe Ile Glu Asn 
            100                 105                 110         


Met Phe Phe Val Ser Ala Asn Pro Trp Val Ser Phe Thr Ser Phe Asp 
        115                 120                 125             


Leu Asn Val Ala Asn Met Asp Asn Phe Phe Ala Pro Val Phe Thr Met 
    130                 135                 140                 


Gly Lys Tyr Tyr Thr Gln Gly Asp Lys Val Leu Met Pro Leu Ala Ile 
145                 150                 155                 160 


Gln Val His His Ala Val Cys Asp Gly Phe His Val Gly Arg Met Leu 
                165                 170                 175     


Asn Glu Leu Gln Gln Tyr Cys Asp Glu Trp Gln Gly Gly Ala 
            180                 185                 190 


<210>  4
<211>  93
<212>  DNA
<213>  Artificial sequence

<220>
<223>  CAT N terminal DNA sequence

<400>  4
atggagaaaa aaatcactgg atataccacc gttgatatat cccaatggca tcgtaaagaa       60

cattttgagg catttcagtc agttgctcaa taa                                    93


<210>  5
<211>  573
<212>  DNA
<213>  Artificial sequence

<220>
<223>  CAT C terminal DNA sequence

<400>  5
atgtgtacct ataaccagac cgttcagctg gatattacgg cctttttaaa gaccgtaaag       60

aaaaataagc acaagtttta tccggccttt attcacattc ttgcccgcct gatgaatgct      120

catccggaat tccgtatggc aatgaaagac ggtgagctgg tgatatggga tagtgttcac      180

ccttgttaca ccgttttcca tgagcaaact gaaacgtttt catcgctctg gagtgaatac      240

cacgacgatt tccggcagtt tctacacata tattcgcaag atgtggcgtg ttacggtgaa      300

aacctggcct atttccctaa agggtttatt gagaatatgt ttttcgtctc agccaatccc      360

tgggtgagtt tcaccagttt tgatttaaac gtggccaata tggacaactt cttcgccccc      420

gttttcacca tgggcaaata ttatacgcaa ggcgacaagg tgctgatgcc gctggcgatt      480

caggttcatc atgccgtctg tgatggcttc catgtcggca gaatgcttaa tgaattacaa      540

cagtactgcg atgagtggca gggcggggcg taa                                   573


<210>  6
<211>  28
<212>  PRT
<213>  Artificial sequence

<220>
<223>  CAT N terminal protein sequence

<400>  6

Met Glu Lys Lys Ile Thr Gly Tyr Thr Thr Val Asp Ile Ser Gln Trp 
1               5                   10                  15      


His Arg Lys Glu His Phe Glu Ala Phe Gln Ser Val 
            20                  25              


<210>  7
<211>  192
<212>  PRT
<213>  Artificial sequence

<220>
<223>  CAT C  terminal protein sequence

<400>  7

Met Ala Gln Cys Thr Tyr Asn Gln Thr Val Gln Leu Asp Ile Thr Ala 
1               5                   10                  15      


Phe Leu Lys Thr Val Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe 
            20                  25                  30          


Ile His Ile Leu Ala Arg Leu Met Asn Ala His Pro Glu Phe Arg Met 
        35                  40                  45              


Ala Met Lys Asp Gly Glu Leu Val Ile Trp Asp Ser Val His Pro Cys 
    50                  55                  60                  


Tyr Thr Val Phe His Glu Gln Thr Glu Thr Phe Ser Ser Leu Trp Ser 
65                  70                  75                  80  


Glu Tyr His Asp Asp Phe Arg Gln Phe Leu His Ile Tyr Ser Gln Asp 
                85                  90                  95      


Val Ala Cys Tyr Gly Glu Asn Leu Ala Tyr Phe Pro Lys Gly Phe Ile 
            100                 105                 110         


Glu Asn Met Phe Phe Val Ser Ala Asn Pro Trp Val Ser Phe Thr Ser 
        115                 120                 125             


Phe Asp Leu Asn Val Ala Asn Met Asp Asn Phe Phe Ala Pro Val Phe 
    130                 135                 140                 


Thr Met Gly Lys Tyr Tyr Thr Gln Gly Asp Lys Val Leu Met Pro Leu 
145                 150                 155                 160 


Ala Ile Gln Val His His Ala Val Cys Asp Gly Phe His Val Gly Arg 
                165                 170                 175     


Met Leu Asn Glu Leu Gln Gln Tyr Cys Asp Glu Trp Gln Gly Gly Ala 
            180                 185                 190         


<210>  8
<211>  660
<212>  DNA
<213>  Escherichia coli

<400>  8
atggagaaaa aaatcactgg atataccacc gttgatatat cccaatggca tcgtaaagaa       60

cattttgagg catttcagtc agttgctcaa tgtacctata accagaccgt tcagctggat      120

attacggcct ttttaaagac cgtaaagaaa aataagcaca agttttatcc ggcctttatt      180

cacattcttg cccgcctgat gaatgctcat ccggaattcc gtatggcaat gaaagacggt      240

gagctggtga tatgggatag tgttcaccct tgttacaccg ttttccatga gcaaactgaa      300

acgttttcat cgctctggag tgaataccac gacgatttcc ggcagtttct acacatatat      360

tcgcaagatg tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg gtttattgag      420

aatatgtttt tcgtctcagc caatccctgg gtgagtttca ccagttttga tttaaacgtg      480

gccaatatgg acaacttctt cgcccccgtt ttcaccatgg gcaaatatta tacgcaaggc      540

gacaaggtgc tgatgccgct ggcgattcag gttcatcatg ccgtctgtga tggcttccat      600

gtcggcagaa tgcttaatga attacaacag tactgcgatg agtggcaggg cggggcgtaa      660


<210>  9
<211>  76
<212>  PRT
<213>  Artificial sequence

<220>
<223>  mammalian ubiquitin protein sequence

<400>  9

Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu 
1               5                   10                  15      


Val Glu Pro Ser Asp Thr Ile Glu Asn Val Lys Ala Lys Ile Gln Asp 
            20                  25                  30          


Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys 
        35                  40                  45              


Gln Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn Ile Gln Lys Glu 
    50                  55                  60                  


Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65                  70                  75      


<210>  10
<211>  75
<212>  PRT
<213>  Artificial sequence

<220>
<223>  yeast  ubiquitin protein sequence

<400>  10

Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu 
1               5                   10                  15      


Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ser Lys Ile Gln Asp 
            20                  25                  30          


Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Lys 
        35                  40                  45              


Gln Leu Glu Asp Gly Arg Leu Ser Asp Tyr Asn Ile Gln Lys Glu Ser 
    50                  55                  60                  


Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65                  70                  75  


<210>  11
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Amino acid sequence of the active site of CAT

<400>  11

Ala Phe Gln Ser Val Ala Gln Cys Thr Tyr Asn Gln Thr Val Gln Leu 
1               5                   10                  15      


Asp Ile 
        


