                         SEQUENCE LISTING

<110>  The Hospital For Sick Children
 
<120>  RAS-TARGETED THERAPEUTIC

<130>  PAT 83002AW-90

<140>  To be assigned
<141>  2018-11-29

<150>  US 15/827,595
<151>  2017-11-30

<160>  30    

<170>  PatentIn version 3.5

<210>  1
<211>  199
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  dtA Domain

<400>  1

Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn 
1               5                   10                  15      


Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln 
            20                  25                  30          


Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp 
        35                  40                  45              


Asp Trp Lys Gly Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly 
    50                  55                  60                  


Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val 
65                  70                  75                  80  


Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val 
                85                  90                  95      


Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu 
            100                 105                 110         


Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly 
        115                 120                 125             


Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser 
    130                 135                 140                 


Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser 
145                 150                 155                 160 


Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp 
                165                 170                 175     


Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg 
            180                 185                 190         


Arg Ser Val Gly Ser Ser Leu 
        195                 


<210>  2
<211>  199
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  dta Domain (K51E, E148K)

<400>  2

Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn 
1               5                   10                  15      


Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln 
            20                  25                  30          


Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp 
        35                  40                  45              


Asp Trp Glu Gly Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly 
    50                  55                  60                  


Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val 
65                  70                  75                  80  


Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val 
                85                  90                  95      


Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu 
            100                 105                 110         


Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly 
        115                 120                 125             


Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser 
    130                 135                 140                 


Ser Ser Val Lys Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser 
145                 150                 155                 160 


Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp 
                165                 170                 175     


Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg 
            180                 185                 190         


Arg Ser Val Gly Ser Ser Leu 
        195                 


<210>  3
<211>  339
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  dtB Domain

<400>  3

Ser Cys Ile Asn Leu Asp Trp Asp Val Ile Arg Asp Lys Thr Lys Thr 
1               5                   10                  15      


Lys Ile Glu Ser Leu Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser 
            20                  25                  30          


Glu Ser Pro Asn Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu 
        35                  40                  45              


Glu Glu Phe His Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu 
    50                  55                  60                  


Lys Thr Val Thr Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala 
65                  70                  75                  80  


Ala Trp Ala Val Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp 
                85                  90                  95      


Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile Gly 
            100                 105                 110         


Ser Val Met Gly Ile Ala Asp Gly Ala Val His His Asn Thr Glu Glu 
        115                 120                 125             


Ile Val Ala Gln Ser Ile Ala Leu Ser Ser Leu Met Val Ala Gln Ala 
    130                 135                 140                 


Ile Pro Leu Val Gly Glu Leu Val Asp Ile Gly Phe Ala Ala Tyr Asn 
145                 150                 155                 160 


Phe Val Glu Ser Ile Ile Asn Leu Phe Gln Val Val His Asn Ser Tyr 
                165                 170                 175     


Asn Arg Pro Ala Tyr Ser Pro Gly His Lys Thr Gln Pro Phe Leu His 
            180                 185                 190         


Asp Gly Tyr Ala Val Ser Trp Asn Thr Val Glu Asp Ser Ile Ile Arg 
        195                 200                 205             


Thr Gly Phe Gln Gly Glu Ser Gly His Asp Ile Lys Ile Thr Ala Glu 
    210                 215                 220                 


Asn Thr Pro Leu Pro Ile Ala Gly Val Leu Leu Pro Thr Ile Pro Gly 
225                 230                 235                 240 


Lys Leu Asp Val Asn Lys Ser Lys Thr His Ile Ser Val Asn Gly Arg 
                245                 250                 255     


Lys Ile Arg Met Arg Cys Arg Ala Ile Asp Gly Asp Val Thr Phe Cys 
            260                 265                 270         


Arg Pro Lys Ser Pro Val Tyr Val Gly Asn Gly Val His Ala Asn Leu 
        275                 280                 285             


His Val Ala Phe His Arg Ser Ser Ser Glu Lys Ile His Ser Asn Glu 
    290                 295                 300                 


Ile Ser Ser Asp Ser Ile Gly Val Leu Gly Tyr Gln Lys Thr Val Asp 
305                 310                 315                 320 


His Thr Lys Val Asn Ser Lys Leu Ser Leu Phe Phe Glu Ile Lys Ser 
                325                 330                 335     


Arg Gln Ala 
            


<210>  4
<211>  179
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  dtT (dtB Translocation Domain)

<400>  4

Ser Cys Ile Asn Leu Asp Trp Asp Val Ile Arg Asp Lys Thr Lys Thr 
1               5                   10                  15      


Lys Ile Glu Ser Leu Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser 
            20                  25                  30          


Glu Ser Pro Asn Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu 
        35                  40                  45              


Glu Glu Phe His Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu 
    50                  55                  60                  


Lys Thr Val Thr Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala 
65                  70                  75                  80  


Ala Trp Ala Val Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp 
                85                  90                  95      


Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile Gly 
            100                 105                 110         


Ser Val Met Gly Ile Ala Asp Gly Ala Val His His Asn Thr Glu Glu 
        115                 120                 125             


Ile Val Ala Gln Ser Ile Ala Leu Ser Ser Leu Met Val Ala Gln Ala 
    130                 135                 140                 


Ile Pro Leu Val Gly Glu Leu Val Asp Ile Gly Phe Ala Ala Tyr Asn 
145                 150                 155                 160 


Phe Val Glu Ser Ile Ile Asn Leu Phe Gln Val Val His Asn Ser Tyr 
                165                 170                 175     


Asn Arg Pro 
            


<210>  5
<211>  179
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  Translocation-deficient dtT (L350K)

<400>  5

Ser Cys Ile Asn Leu Asp Trp Asp Val Ile Arg Asp Lys Thr Lys Thr 
1               5                   10                  15      


Lys Ile Glu Ser Leu Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser 
            20                  25                  30          


Glu Ser Pro Asn Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu 
        35                  40                  45              


Glu Glu Phe His Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu 
    50                  55                  60                  


Lys Thr Val Thr Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala 
65                  70                  75                  80  


Ala Trp Ala Val Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp 
                85                  90                  95      


Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile Gly 
            100                 105                 110         


Ser Val Met Gly Ile Ala Asp Gly Ala Val His His Asn Thr Glu Glu 
        115                 120                 125             


Ile Val Ala Gln Ser Ile Ala Leu Ser Ser Leu Met Val Ala Gln Ala 
    130                 135                 140                 


Ile Pro Leu Val Gly Glu Lys Val Asp Ile Gly Phe Ala Ala Tyr Asn 
145                 150                 155                 160 


Phe Val Glu Ser Ile Ile Asn Leu Phe Gln Val Val His Asn Ser Tyr 
                165                 170                 175     


Asn Arg Pro 
            


<210>  6
<211>  160
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  dtR (dtB Receptor-binding Domain)

<400>  6

Ala Tyr Ser Pro Gly His Lys Thr Gln Pro Phe Leu His Asp Gly Tyr 
1               5                   10                  15      


Ala Val Ser Trp Asn Thr Val Glu Asp Ser Ile Ile Arg Thr Gly Phe 
            20                  25                  30          


Gln Gly Glu Ser Gly His Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro 
        35                  40                  45              


Leu Pro Ile Ala Gly Val Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp 
    50                  55                  60                  


Val Asn Lys Ser Lys Thr His Ile Ser Val Asn Gly Arg Lys Ile Arg 
65                  70                  75                  80  


Met Arg Cys Arg Ala Ile Asp Gly Asp Val Thr Phe Cys Arg Pro Lys 
                85                  90                  95      


Ser Pro Val Tyr Val Gly Asn Gly Val His Ala Asn Leu His Val Ala 
            100                 105                 110         


Phe His Arg Ser Ser Ser Glu Lys Ile His Ser Asn Glu Ile Ser Ser 
        115                 120                 125             


Asp Ser Ile Gly Val Leu Gly Tyr Gln Lys Thr Val Asp His Thr Lys 
    130                 135                 140                 


Val Asn Ser Lys Leu Ser Leu Phe Phe Glu Ile Lys Ser Arg Gln Ala 
145                 150                 155                 160 


<210>  7
<211>  119
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Polyhistidine-SUMO

<400>  7

Met Gly Ser Ser His His His His His His Gly Ser Gly Leu Val Pro 
1               5                   10                  15      


Arg Gly Ser Ala Ser Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys 
            20                  25                  30          


Pro Glu Val Lys Pro Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys 
        35                  40                  45              


Val Ser Asp Gly Ser Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr 
    50                  55                  60                  


Pro Leu Arg Arg Leu Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu 
65                  70                  75                  80  


Met Asp Ser Leu Arg Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp 
                85                  90                  95      


Gln Thr Pro Glu Asp Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala 
            100                 105                 110         


His Arg Glu Gln Ile Gly Gly 
        115                 


<210>  8
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  MYC tag

<400>  8

Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu 
1               5                   10  


<210>  9
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SV40 nuclear localization sequences (NLS)

<400>  9

Ser Pro Pro Lys Lys Lys Arg Lys Val 
1               5                   


<210>  10
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  (G4S) linker

<400>  10

Gly Gly Gly Gly Ser 
1               5   


<210>  11
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  (G4S)2 linker

<400>  11

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10  


<210>  12
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  (G4S)3 linker

<400>  12

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10                  15  


<210>  13
<211>  238
<212>  PRT
<213>  Aequorea victoria


<220>
<221>  MISC_FEATURE
<223>  Enhanced Green Fluorescent Protein (eGFP)

<400>  13

Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val 
1               5                   10                  15      


Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
            20                  25                  30          


Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys 
        35                  40                  45              


Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
    50                  55                  60                  


Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln 
65                  70                  75                  80  


His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg 
                85                  90                  95      


Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
            100                 105                 110         


Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile 
        115                 120                 125             


Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn 
    130                 135                 140                 


Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly 
145                 150                 155                 160 


Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val 
                165                 170                 175     


Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro 
            180                 185                 190         


Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser 
        195                 200                 205             


Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
    210                 215                 220                 


Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225                 230                 235             


<210>  14
<211>  235
<212>  PRT
<213>  Discosoma sp. (also: Actinodiscus or mushroom coral)


<220>
<221>  MISC_FEATURE
<223>  Monomeric Cherry (mCherry)

<400>  14

Gly Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met 
1               5                   10                  15      


Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu 
            20                  25                  30          


Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala 
        35                  40                  45              


Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile 
    50                  55                  60                  


Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro 
65                  70                  75                  80  


Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys 
                85                  90                  95      


Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr 
            100                 105                 110         


Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu 
        115                 120                 125             


Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr 
    130                 135                 140                 


Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala 
145                 150                 155                 160 


Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His 
                165                 170                 175     


Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln 
            180                 185                 190         


Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His 
        195                 200                 205             


Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg 
    210                 215                 220                 


His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys 
225                 230                 235 


<210>  15
<211>  493
<212>  PRT
<213>  Bacillus megaterium


<220>
<221>  MISC_FEATURE
<223>  alpha-amylase (B. megaterium)

<400>  15

Gly His Lys Gly Lys Ser Pro Thr Ala Asp Lys Asn Gly Val Phe Tyr 
1               5                   10                  15      


Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His Gly 
            20                  25                  30          


Asp Leu Lys Gly Leu Thr Gln Lys Leu Asp Tyr Leu Asn Asp Gly Asn 
        35                  40                  45              


Ser His Thr Lys Asn Asp Leu Gln Val Asn Gly Ile Trp Met Met Pro 
    50                  55                  60                  


Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val Thr Asp Tyr Tyr 
65                  70                  75                  80  


Asn Ile Asp Pro Gln Tyr Gly Asn Leu Gln Asp Phe Arg Lys Leu Met 
                85                  90                  95      


Lys Glu Ala Asp Lys Arg Asp Val Lys Val Ile Met Asp Leu Val Val 
            100                 105                 110         


Asn His Thr Ser Ser Glu His Pro Trp Phe Gln Ala Ala Leu Lys Asp 
        115                 120                 125             


Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr Ile Trp Ala Asp Lys Asn Thr 
    130                 135                 140                 


Asp Leu Asn Glu Lys Gly Ser Trp Gly Gln Gln Val Trp His Lys Ala 
145                 150                 155                 160 


Pro Asn Gly Glu Tyr Phe Tyr Gly Thr Phe Trp Glu Gly Met Pro Asp 
                165                 170                 175     


Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met Ile Asn Val Gly 
            180                 185                 190         


Lys Phe Trp Leu Asn Gln Gly Val Asp Gly Phe Arg Leu Asp Ala Ala 
        195                 200                 205             


Leu His Ile Phe Lys Gly Gln Thr Pro Glu Gly Ala Lys Lys Asn Ile 
    210                 215                 220                 


Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu Asn Pro Asn 
225                 230                 235                 240 


Val Tyr Leu Thr Gly Glu Val Trp Asp Gln Pro Glu Val Val Ala Pro 
                245                 250                 255     


Tyr Tyr Gln Ser Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys 
            260                 265                 270         


Ile Val Ser Ser Val Lys Ala Gly Asn Asp Gln Gly Ile Ala Thr Ala 
        275                 280                 285             


Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys Ile 
    290                 295                 300                 


Asp Gly Ile Phe Leu Thr Asn His Asp Gln Asn Arg Val Met Ser Glu 
305                 310                 315                 320 


Leu Ser Gly Asp Val Asn Lys Ala Lys Ser Ala Ala Ser Ile Leu Leu 
                325                 330                 335     


Thr Leu Pro Gly Asn Pro Tyr Ile Tyr Tyr Gly Glu Glu Ile Gly Met 
            340                 345                 350         


Thr Gly Glu Lys Pro Asp Glu Leu Ile Arg Glu Pro Phe Arg Trp Tyr 
        355                 360                 365             


Glu Gly Asn Gly Leu Gly Gln Thr Ser Trp Glu Thr Pro Ile Tyr Asn 
    370                 375                 380                 


Lys Gly Gly Asn Gly Val Ser Ile Glu Ala Gln Thr Lys Gln Lys Asp 
385                 390                 395                 400 


Ser Leu Leu Asn His Tyr Arg Glu Met Ile Arg Val Arg Gln Gln His 
                405                 410                 415     


Glu Glu Leu Val Lys Gly Thr Leu Gln Ser Ile Ser Leu Asp Gln Lys 
            420                 425                 430         


Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Lys Ser Ile Ser Val 
        435                 440                 445             


Tyr His Asn Ile Ser Asn Gln Pro Ile Lys Val Ser Val Ala Ala Lys 
    450                 455                 460                 


Gly Lys Leu Ile Phe Ser Ser Glu Lys Gly Val Lys Lys Val Lys Asn 
465                 470                 475                 480 


Gln Leu Val Ile Pro Ala Asn Thr Thr Ile Leu Ile Lys 
                485                 490             


<210>  16
<211>  497
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  MeCP2 (e1 isoform)

<400>  16

Ala Ala Ala Ala Ala Ala Ala Pro Ser Gly Gly Gly Gly Gly Gly Glu 
1               5                   10                  15      


Glu Glu Arg Leu Glu Glu Lys Ser Glu Asp Gln Asp Leu Gln Gly Leu 
            20                  25                  30          


Lys Asp Lys Pro Leu Lys Phe Lys Lys Val Lys Lys Asp Lys Lys Glu 
        35                  40                  45              


Glu Lys Glu Gly Lys His Glu Pro Val Gln Pro Ser Ala His His Ser 
    50                  55                  60                  


Ala Glu Pro Ala Glu Ala Gly Lys Ala Glu Thr Ser Glu Gly Ser Gly 
65                  70                  75                  80  


Ser Ala Pro Ala Val Pro Glu Ala Ser Ala Ser Pro Lys Gln Arg Arg 
                85                  90                  95      


Ser Ile Ile Arg Asp Arg Gly Pro Met Tyr Asp Asp Pro Thr Leu Pro 
            100                 105                 110         


Glu Gly Trp Thr Arg Lys Leu Lys Gln Arg Lys Ser Gly Arg Ser Ala 
        115                 120                 125             


Gly Lys Tyr Asp Val Tyr Leu Ile Asn Pro Gln Gly Lys Ala Phe Arg 
    130                 135                 140                 


Ser Lys Val Glu Leu Ile Ala Tyr Phe Glu Lys Val Gly Asp Thr Ser 
145                 150                 155                 160 


Leu Asp Pro Asn Asp Phe Asp Phe Thr Val Thr Gly Arg Gly Ser Pro 
                165                 170                 175     


Ser Arg Arg Glu Gln Lys Pro Pro Lys Lys Pro Lys Ser Pro Lys Ala 
            180                 185                 190         


Pro Gly Thr Gly Arg Gly Arg Gly Arg Pro Lys Gly Ser Gly Thr Thr 
        195                 200                 205             


Arg Pro Lys Ala Ala Thr Ser Glu Gly Val Gln Val Lys Arg Val Leu 
    210                 215                 220                 


Glu Lys Ser Pro Gly Lys Leu Leu Val Lys Met Pro Phe Gln Thr Ser 
225                 230                 235                 240 


Pro Gly Gly Lys Ala Glu Gly Gly Gly Ala Thr Thr Ser Thr Gln Val 
                245                 250                 255     


Met Val Ile Lys Arg Pro Gly Arg Lys Arg Lys Ala Glu Ala Asp Pro 
            260                 265                 270         


Gln Ala Ile Pro Lys Lys Arg Gly Arg Lys Pro Gly Ser Val Val Ala 
        275                 280                 285             


Ala Ala Ala Ala Glu Ala Lys Lys Lys Ala Val Lys Glu Ser Ser Ile 
    290                 295                 300                 


Arg Ser Val Gln Glu Thr Val Leu Pro Ile Lys Lys Arg Lys Thr Arg 
305                 310                 315                 320 


Glu Thr Val Ser Ile Glu Val Lys Glu Val Val Lys Pro Leu Leu Val 
                325                 330                 335     


Ser Thr Leu Gly Glu Lys Ser Gly Lys Gly Leu Lys Thr Cys Lys Ser 
            340                 345                 350         


Pro Gly Arg Lys Ser Lys Glu Ser Ser Pro Lys Gly Arg Ser Ser Ser 
        355                 360                 365             


Ala Ser Ser Pro Pro Lys Lys Glu His His His His His His His Ser 
    370                 375                 380                 


Glu Ser Pro Lys Ala Pro Val Pro Leu Leu Pro Pro Leu Pro Pro Pro 
385                 390                 395                 400 


Pro Pro Glu Pro Glu Ser Ser Glu Asp Pro Thr Ser Pro Pro Glu Pro 
                405                 410                 415     


Gln Asp Leu Ser Ser Ser Val Cys Lys Glu Glu Lys Met Pro Arg Gly 
            420                 425                 430         


Gly Ser Leu Glu Ser Asp Gly Cys Pro Lys Glu Pro Ala Lys Thr Gln 
        435                 440                 445             


Pro Ala Val Ala Thr Ala Ala Thr Ala Ala Glu Lys Tyr Lys His Arg 
    450                 455                 460                 


Gly Glu Gly Glu Arg Lys Asp Ile Val Ser Ser Ser Met Pro Arg Pro 
465                 470                 475                 480 


Asn Arg Glu Glu Pro Val Asp Ser Arg Thr Pro Val Thr Glu Arg Val 
                485                 490                 495     


Ser 
    


<210>  17
<211>  485
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  MeCP2 (e2 isoform)

<400>  17

Val Ala Gly Met Leu Gly Leu Arg Glu Glu Lys Ser Glu Asp Gln Asp 
1               5                   10                  15      


Leu Gln Gly Leu Lys Asp Lys Pro Leu Lys Phe Lys Lys Val Lys Lys 
            20                  25                  30          


Asp Lys Lys Glu Glu Lys Glu Gly Lys His Glu Pro Val Gln Pro Ser 
        35                  40                  45              


Ala His His Ser Ala Glu Pro Ala Glu Ala Gly Lys Ala Glu Thr Ser 
    50                  55                  60                  


Glu Gly Ser Gly Ser Ala Pro Ala Val Pro Glu Ala Ser Ala Ser Pro 
65                  70                  75                  80  


Lys Gln Arg Arg Ser Ile Ile Arg Asp Arg Gly Pro Met Tyr Asp Asp 
                85                  90                  95      


Pro Thr Leu Pro Glu Gly Trp Thr Arg Lys Leu Lys Gln Arg Lys Ser 
            100                 105                 110         


Gly Arg Ser Ala Gly Lys Tyr Asp Val Tyr Leu Ile Asn Pro Gln Gly 
        115                 120                 125             


Lys Ala Phe Arg Ser Lys Val Glu Leu Ile Ala Tyr Phe Glu Lys Val 
    130                 135                 140                 


Gly Asp Thr Ser Leu Asp Pro Asn Asp Phe Asp Phe Thr Val Thr Gly 
145                 150                 155                 160 


Arg Gly Ser Pro Ser Arg Arg Glu Gln Lys Pro Pro Lys Lys Pro Lys 
                165                 170                 175     


Ser Pro Lys Ala Pro Gly Thr Gly Arg Gly Arg Gly Arg Pro Lys Gly 
            180                 185                 190         


Ser Gly Thr Thr Arg Pro Lys Ala Ala Thr Ser Glu Gly Val Gln Val 
        195                 200                 205             


Lys Arg Val Leu Glu Lys Ser Pro Gly Lys Leu Leu Val Lys Met Pro 
    210                 215                 220                 


Phe Gln Thr Ser Pro Gly Gly Lys Ala Glu Gly Gly Gly Ala Thr Thr 
225                 230                 235                 240 


Ser Thr Gln Val Met Val Ile Lys Arg Pro Gly Arg Lys Arg Lys Ala 
                245                 250                 255     


Glu Ala Asp Pro Gln Ala Ile Pro Lys Lys Arg Gly Arg Lys Pro Gly 
            260                 265                 270         


Ser Val Val Ala Ala Ala Ala Ala Glu Ala Lys Lys Lys Ala Val Lys 
        275                 280                 285             


Glu Ser Ser Ile Arg Ser Val Gln Glu Thr Val Leu Pro Ile Lys Lys 
    290                 295                 300                 


Arg Lys Thr Arg Glu Thr Val Ser Ile Glu Val Lys Glu Val Val Lys 
305                 310                 315                 320 


Pro Leu Leu Val Ser Thr Leu Gly Glu Lys Ser Gly Lys Gly Leu Lys 
                325                 330                 335     


Thr Cys Lys Ser Pro Gly Arg Lys Ser Lys Glu Ser Ser Pro Lys Gly 
            340                 345                 350         


Arg Ser Ser Ser Ala Ser Ser Pro Pro Lys Lys Glu His His His His 
        355                 360                 365             


His His His Ser Glu Ser Pro Lys Ala Pro Val Pro Leu Leu Pro Pro 
    370                 375                 380                 


Leu Pro Pro Pro Pro Pro Glu Pro Glu Ser Ser Glu Asp Pro Thr Ser 
385                 390                 395                 400 


Pro Pro Glu Pro Gln Asp Leu Ser Ser Ser Val Cys Lys Glu Glu Lys 
                405                 410                 415     


Met Pro Arg Gly Gly Ser Leu Glu Ser Asp Gly Cys Pro Lys Glu Pro 
            420                 425                 430         


Ala Lys Thr Gln Pro Ala Val Ala Thr Ala Ala Thr Ala Ala Glu Lys 
        435                 440                 445             


Tyr Lys His Arg Gly Glu Gly Glu Arg Lys Asp Ile Val Ser Ser Ser 
    450                 455                 460                 


Met Pro Arg Pro Asn Arg Glu Glu Pro Val Asp Ser Arg Thr Pro Val 
465                 470                 475                 480 


Thr Glu Arg Val Ser 
                485 


<210>  18
<211>  585
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  FMRP

<400>  18

Glu Glu Leu Val Val Glu Val Arg Gly Ser Asn Gly Ala Phe Tyr Lys 
1               5                   10                  15      


Ala Phe Val Lys Asp Val His Glu Asp Ser Ile Thr Val Ala Phe Glu 
            20                  25                  30          


Asn Asn Trp Gln Pro Asp Arg Gln Ile Pro Phe His Asp Val Arg Phe 
        35                  40                  45              


Pro Pro Pro Val Gly Tyr Asn Lys Asp Ile Asn Glu Ser Asp Glu Val 
    50                  55                  60                  


Glu Val Tyr Ser Arg Ala Asn Glu Lys Glu Pro Cys Cys Trp Trp Leu 
65                  70                  75                  80  


Ala Lys Val Arg Met Ile Lys Gly Glu Phe Tyr Val Ile Glu Tyr Ala 
                85                  90                  95      


Ala Cys Asp Ala Thr Tyr Asn Glu Ile Val Thr Ile Glu Arg Leu Arg 
            100                 105                 110         


Ser Val Asn Pro Asn Lys Pro Ala Thr Lys Asp Thr Phe His Lys Ile 
        115                 120                 125             


Lys Leu Asp Val Pro Glu Asp Leu Arg Gln Met Cys Ala Lys Glu Ala 
    130                 135                 140                 


Ala His Lys Asp Phe Lys Lys Ala Val Gly Ala Phe Ser Val Thr Tyr 
145                 150                 155                 160 


Asp Pro Glu Asn Tyr Gln Leu Val Ile Leu Ser Ile Asn Glu Val Thr 
                165                 170                 175     


Ser Lys Arg Ala His Met Leu Ile Asp Met His Phe Arg Ser Leu Arg 
            180                 185                 190         


Thr Lys Leu Ser Leu Ile Met Arg Asn Glu Glu Ala Ser Lys Gln Leu 
        195                 200                 205             


Glu Ser Ser Arg Gln Leu Ala Ser Arg Phe His Glu Gln Phe Ile Val 
    210                 215                 220                 


Arg Glu Asp Leu Met Gly Leu Ala Ile Gly Thr His Gly Ala Asn Ile 
225                 230                 235                 240 


Gln Gln Ala Arg Lys Val Pro Gly Val Thr Ala Ile Asp Leu Asp Glu 
                245                 250                 255     


Asp Thr Cys Thr Phe His Ile Tyr Gly Glu Asp Gln Asp Ala Val Lys 
            260                 265                 270         


Lys Ala Arg Ser Phe Leu Glu Phe Ala Glu Asp Val Ile Gln Val Pro 
        275                 280                 285             


Arg Asn Leu Val Gly Lys Val Ile Gly Lys Asn Gly Lys Leu Ile Gln 
    290                 295                 300                 


Glu Ile Val Asp Lys Ser Gly Val Val Arg Val Arg Ile Glu Ala Glu 
305                 310                 315                 320 


Asn Glu Lys Asn Val Pro Gln Glu Glu Glu Ile Met Pro Pro Asn Ser 
                325                 330                 335     


Leu Pro Ser Asn Asn Ser Arg Val Gly Pro Asn Ala Pro Glu Glu Lys 
            340                 345                 350         


Lys His Leu Asp Ile Lys Glu Asn Ser Thr His Phe Ser Gln Pro Asn 
        355                 360                 365             


Ser Thr Lys Val Gln Arg Gly Met Val Pro Phe Val Phe Val Gly Thr 
    370                 375                 380                 


Lys Asp Ser Ile Ala Asn Ala Thr Val Leu Leu Asp Tyr His Leu Asn 
385                 390                 395                 400 


Tyr Leu Lys Glu Val Asp Gln Leu Arg Leu Glu Arg Leu Gln Ile Asp 
                405                 410                 415     


Glu Gln Leu Arg Gln Ile Gly Ala Ser Ser Arg Pro Pro Pro Asn Arg 
            420                 425                 430         


Thr Asp Lys Glu Lys Ser Tyr Val Thr Asp Asp Gly Gln Gly Met Gly 
        435                 440                 445             


Arg Gly Ser Arg Pro Tyr Arg Asn Arg Gly His Gly Arg Arg Gly Pro 
    450                 455                 460                 


Gly Tyr Thr Ser Ala Pro Thr Glu Glu Glu Arg Glu Ser Phe Leu Arg 
465                 470                 475                 480 


Arg Gly Asp Gly Arg Arg Arg Gly Gly Gly Gly Arg Gly Gln Gly Gly 
                485                 490                 495     


Arg Gly Arg Gly Gly Gly Phe Lys Gly Asn Asp Asp His Ser Arg Thr 
            500                 505                 510         


Asp Asn Arg Pro Arg Asn Pro Arg Glu Ala Lys Gly Arg Thr Thr Asp 
        515                 520                 525             


Gly Ser Leu Gln Ile Arg Val Asp Cys Asn Asn Glu Arg Ser Val His 
    530                 535                 540                 


Thr Lys Thr Leu Gln Asn Thr Ser Ser Glu Gly Ser Arg Leu Arg Thr 
545                 550                 555                 560 


Gly Lys Asp Arg Asn Gln Lys Lys Glu Lys Pro Asp Ser Val Asp Gly 
                565                 570                 575     


Gln Gln Pro Leu Val Asn Gly Val Pro 
            580                 585 


<210>  19
<211>  294
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  SMN protein

<400>  19

Met Ala Met Ser Ser Gly Gly Ser Gly Gly Gly Val Pro Glu Gln Glu 
1               5                   10                  15      


Asp Ser Val Leu Phe Arg Arg Gly Thr Gly Gln Ser Asp Asp Ser Asp 
            20                  25                  30          


Ile Trp Asp Asp Thr Ala Leu Ile Lys Ala Tyr Asp Lys Ala Val Ala 
        35                  40                  45              


Ser Phe Lys His Ala Leu Lys Asn Gly Asp Ile Cys Glu Thr Ser Gly 
    50                  55                  60                  


Lys Pro Lys Thr Thr Pro Lys Arg Lys Pro Ala Lys Lys Asn Lys Ser 
65                  70                  75                  80  


Gln Lys Lys Asn Thr Ala Ala Ser Leu Gln Gln Trp Lys Val Gly Asp 
                85                  90                  95      


Lys Cys Ser Ala Ile Trp Ser Glu Asp Gly Cys Ile Tyr Pro Ala Thr 
            100                 105                 110         


Ile Ala Ser Ile Asp Phe Lys Arg Glu Thr Cys Val Val Val Tyr Thr 
        115                 120                 125             


Gly Tyr Gly Asn Arg Glu Glu Gln Asn Leu Ser Asp Leu Leu Ser Pro 
    130                 135                 140                 


Ile Cys Glu Val Ala Asn Asn Ile Glu Gln Asn Ala Gln Glu Asn Glu 
145                 150                 155                 160 


Asn Glu Ser Gln Val Ser Thr Asp Glu Ser Glu Asn Ser Arg Ser Pro 
                165                 170                 175     


Gly Asn Lys Ser Asp Asn Ile Lys Pro Lys Ser Ala Pro Trp Asn Ser 
            180                 185                 190         


Phe Leu Pro Pro Pro Pro Pro Met Pro Gly Pro Arg Leu Gly Pro Gly 
        195                 200                 205             


Lys Pro Gly Leu Lys Phe Asn Gly Pro Pro Pro Pro Pro Pro Pro Pro 
    210                 215                 220                 


Pro Pro His Leu Leu Ser Cys Trp Leu Pro Pro Phe Pro Ser Gly Pro 
225                 230                 235                 240 


Pro Ile Ile Pro Pro Pro Pro Pro Ile Cys Pro Asp Ser Leu Asp Asp 
                245                 250                 255     


Ala Asp Ala Leu Gly Ser Met Leu Ile Ser Trp Tyr Met Ser Gly Tyr 
            260                 265                 270         


His Thr Gly Tyr Tyr Met Gly Phe Arg Gln Asn Gln Lys Glu Gly Arg 
        275                 280                 285             


Cys Ser His Ser Leu Asn 
    290                 


<210>  20
<211>  261
<212>  PRT
<213>  Clostridium difficile


<220>
<221>  MISC_FEATURE
<223>  CPD (C. difficile)

<400>  20

Glu Gly Ser Leu Gly Glu Asp Asp Asn Leu Asp Phe Ser Gln Asn Ile 
1               5                   10                  15      


Val Val Asp Lys Glu Tyr Leu Leu Glu Lys Ile Ser Ser Leu Ala Arg 
            20                  25                  30          


Ser Ser Glu Arg Gly Tyr Ile His Tyr Ile Val Gln Leu Gln Gly Asp 
        35                  40                  45              


Lys Ile Ser Tyr Glu Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr 
    50                  55                  60                  


Asp Ser Val Leu Phe Gln Lys Asn Ile Glu Asp Ser Glu Ile Ala Tyr 
65                  70                  75                  80  


Tyr Tyr Asn Pro Gly Asp Gly Glu Ile Gln Glu Ile Asp Lys Tyr Lys 
                85                  90                  95      


Ile Pro Ser Ile Ile Ser Asp Arg Pro Lys Ile Lys Leu Thr Phe Ile 
            100                 105                 110         


Gly His Gly Lys Asp Glu Phe Asn Thr Asp Ile Phe Ala Gly Phe Asp 
        115                 120                 125             


Val Asp Ser Leu Ser Thr Glu Ile Glu Ala Ala Ile Asp Leu Ala Lys 
    130                 135                 140                 


Glu Asp Ile Ser Pro Lys Ser Ile Glu Ile Asn Leu Leu Gly Cys Asn 
145                 150                 155                 160 


Met Phe Ser Tyr Ser Ile Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu 
                165                 170                 175     


Leu Leu Lys Val Lys Asp Lys Ile Ser Glu Leu Met Pro Ser Ile Ser 
            180                 185                 190         


Gln Asp Ser Ile Ile Val Ser Ala Asn Gln Tyr Glu Val Arg Ile Asn 
        195                 200                 205             


Ser Glu Gly Arg Arg Glu Leu Leu Asp His Ser Gly Glu Trp Ile Asn 
    210                 215                 220                 


Lys Glu Glu Ser Ile Ile Lys Asp Ile Ser Ser Lys Glu Tyr Ile Ser 
225                 230                 235                 240 


Phe Asn Pro Lys Glu Asn Lys Ile Thr Val Lys Ser Lys Asn Leu Pro 
                245                 250                 255     


Glu Leu Ser Thr Leu 
            260     


<210>  21
<211>  212
<212>  PRT
<213>  Vibrio cholerae


<220>
<221>  MISC_FEATURE
<223>  CPD (V. cholera)

<400>  21

Lys Glu Ala Leu Ala Asp Gly Lys Ile Leu His Asn Gln Asn Val Asn 
1               5                   10                  15      


Ser Trp Gly Pro Ile Thr Val Thr Pro Thr Thr Asp Gly Gly Glu Thr 
            20                  25                  30          


Arg Phe Asp Gly Gln Ile Ile Val Gln Met Glu Asn Asp Pro Val Val 
        35                  40                  45              


Ala Lys Ala Ala Ala Asn Leu Ala Gly Lys His Ala Glu Ser Ser Val 
    50                  55                  60                  


Val Val Gln Leu Asp Ser Asp Gly Asn Tyr Arg Val Val Tyr Gly Asp 
65                  70                  75                  80  


Pro Ser Lys Leu Asp Gly Lys Leu Arg Trp Gln Leu Val Gly His Gly 
                85                  90                  95      


Arg Asp His Ser Glu Thr Asn Asn Thr Arg Leu Ser Gly Tyr Ser Ala 
            100                 105                 110         


Asp Glu Leu Ala Val Lys Leu Ala Lys Phe Gln Gln Ser Phe Asn Gln 
        115                 120                 125             


Ala Glu Asn Ile Asn Asn Lys Pro Asp His Ile Ser Ile Val Gly Cys 
    130                 135                 140                 


Ser Leu Val Ser Asp Asp Lys Gln Lys Gly Phe Gly His Gln Phe Ile 
145                 150                 155                 160 


Asn Ala Met Asp Ala Asn Gly Leu Arg Val Asp Val Ser Val Arg Ser 
                165                 170                 175     


Ser Glu Leu Ala Val Asp Glu Ala Gly Arg Lys His Thr Lys Asp Ala 
            180                 185                 190         


Asn Gly Asp Trp Val Gln Lys Ala Glu Asn Asn Lys Val Ser Leu Ser 
        195                 200                 205             


Trp Asp Ala Gln 
    210         


<210>  22
<211>  1371
<212>  PRT
<213>  Streptococcus pyogenes


<220>
<221>  MISC_FEATURE
<223>  Cas9 (S. pyogenes)

<400>  22

Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Asp Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Gly Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Ala Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Ile Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Arg Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Arg Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Ser Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Ala Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Gly Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly His Ser Leu 
705                 710                 715                 720 


His Glu Gln Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Ile Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln Thr 
        755                 760                 765             


Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile Glu 
    770                 775                 780                 


Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro Val 
785                 790                 795                 800 


Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu Gln 
                805                 810                 815     


Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg Leu 
            820                 825                 830         


Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Ile Lys Asp 
        835                 840                 845             


Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg Gly 
    850                 855                 860                 


Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys Asn 
865                 870                 875                 880 


Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys Phe 
                885                 890                 895     


Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp Lys 
            900                 905                 910         


Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr Lys 
        915                 920                 925             


His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp Glu 
    930                 935                 940                 


Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser Lys 
945                 950                 955                 960 


Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg Glu 
                965                 970                 975     


Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val Val 
            980                 985                 990         


Gly Thr Ala Leu Ile Lys Lys Tyr  Pro Lys Leu Glu Ser  Glu Phe Val 
        995                 1000                 1005             


Tyr Gly  Asp Tyr Lys Val Tyr  Asp Val Arg Lys Met  Ile Ala Lys 
    1010                 1015                 1020             


Ser Glu  Gln Glu Ile Gly Lys  Ala Thr Ala Lys Tyr  Phe Phe Tyr 
    1025                 1030                 1035             


Ser Asn  Ile Met Asn Phe Phe  Lys Thr Glu Ile Thr  Leu Ala Asn 
    1040                 1045                 1050             


Gly Glu  Ile Arg Lys Arg Pro  Leu Ile Glu Thr Asn  Gly Glu Thr 
    1055                 1060                 1065             


Gly Glu  Ile Val Trp Asp Lys  Gly Arg Asp Phe Ala  Thr Val Arg 
    1070                 1075                 1080             


Lys Val  Leu Ser Met Pro Gln  Val Asn Ile Val Lys  Lys Thr Glu 
    1085                 1090                 1095             


Val Gln  Thr Gly Gly Phe Ser  Lys Glu Ser Ile Leu  Pro Lys Arg 
    1100                 1105                 1110             


Asn Ser  Asp Lys Leu Ile Ala  Arg Lys Lys Asp Trp  Asp Pro Lys 
    1115                 1120                 1125             


Lys Tyr  Gly Gly Phe Asp Ser  Pro Thr Val Ala Tyr  Ser Val Leu 
    1130                 1135                 1140             


Val Val  Ala Lys Val Glu Lys  Gly Lys Ser Lys Lys  Leu Lys Ser 
    1145                 1150                 1155             


Val Lys  Glu Leu Leu Gly Ile  Thr Ile Met Glu Arg  Ser Ser Phe 
    1160                 1165                 1170             


Glu Lys  Asn Pro Ile Asp Phe  Leu Glu Ala Lys Gly  Tyr Lys Glu 
    1175                 1180                 1185             


Val Lys  Lys Asp Leu Ile Ile  Lys Leu Pro Lys Tyr  Ser Leu Phe 
    1190                 1195                 1200             


Glu Leu  Glu Asn Gly Arg Lys  Arg Met Leu Ala Ser  Ala Gly Glu 
    1205                 1210                 1215             


Leu Gln  Lys Gly Asn Glu Leu  Ala Leu Pro Ser Lys  Tyr Val Asn 
    1220                 1225                 1230             


Phe Leu  Tyr Leu Ala Ser His  Tyr Glu Lys Leu Lys  Gly Ser Pro 
    1235                 1240                 1245             


Glu Asp  Asn Glu Gln Lys Gln  Leu Phe Val Glu Gln  His Lys His 
    1250                 1255                 1260             


Tyr Leu  Asp Glu Ile Ile Glu  Gln Ile Ser Glu Phe  Ser Lys Arg 
    1265                 1270                 1275             


Val Ile  Leu Ala Asp Ala Asn  Leu Asp Lys Val Leu  Ser Ala Tyr 
    1280                 1285                 1290             


Asn Lys  His Arg Asp Lys Pro  Ile Arg Glu Gln Ala  Glu Asn Ile 
    1295                 1300                 1305             


Ile His  Leu Phe Thr Leu Thr  Asn Leu Gly Ala Pro  Ala Ala Phe 
    1310                 1315                 1320             


Lys Tyr  Phe Asp Thr Thr Ile  Asp Arg Lys Arg Tyr  Thr Ser Thr 
    1325                 1330                 1335             


Lys Glu  Val Leu Asp Ala Thr  Leu Ile His Gln Ser  Ile Thr Gly 
    1340                 1345                 1350             


Leu Tyr  Glu Thr Arg Ile Asp  Leu Ser Gln Leu Gly  Gly Asp Ser 
    1355                 1360                 1365             


Pro Val  Arg 
    1370     


<210>  23
<211>  1403
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cas9 (S. pyogenes) with N-terminal His, SV40 and C-terminal SV40 
       sequences

<400>  23

His His His His His His Gly Ser Gly Ala Thr Met Ala Ser Pro Pro 
1               5                   10                  15      


Lys Lys Lys Arg Lys Val Gly Ser Met Asp Lys Lys Tyr Ser Ile Gly 
            20                  25                  30          


Leu Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Asp 
        35                  40                  45              


Tyr Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp Arg 
    50                  55                  60                  


His Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe Gly Ser Gly 
65                  70                  75                  80  


Glu Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr 
                85                  90                  95      


Thr Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn 
            100                 105                 110         


Glu Met Ala Lys Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser 
        115                 120                 125             


Phe Leu Val Glu Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly 
    130                 135                 140                 


Asn Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr 
145                 150                 155                 160 


His Leu Arg Lys Lys Leu Ala Asp Ser Thr Asp Lys Ala Asp Leu Arg 
                165                 170                 175     


Leu Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg Gly His Phe 
            180                 185                 190         


Leu Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val Asp Lys Leu 
        195                 200                 205             


Phe Ile Gln Leu Val Gln Ile Tyr Asn Gln Leu Phe Glu Glu Asn Pro 
    210                 215                 220                 


Ile Asn Ala Ser Arg Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu 
225                 230                 235                 240 


Ser Lys Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu 
                245                 250                 255     


Lys Arg Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu 
            260                 265                 270         


Thr Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu 
        275                 280                 285             


Gln Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala 
    290                 295                 300                 


Gln Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu 
305                 310                 315                 320 


Ser Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn Ser Glu Ile 
                325                 330                 335     


Thr Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His 
            340                 345                 350         


His Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro 
        355                 360                 365             


Glu Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala 
    370                 375                 380                 


Gly Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile 
385                 390                 395                 400 


Lys Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys 
                405                 410                 415     


Leu Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly 
            420                 425                 430         


Ser Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile Leu Arg 
        435                 440                 445             


Arg Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile 
    450                 455                 460                 


Glu Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala 
465                 470                 475                 480 


Arg Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr 
                485                 490                 495     


Ile Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala 
            500                 505                 510         


Gln Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn 
        515                 520                 525             


Glu Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val 
    530                 535                 540                 


Tyr Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys 
545                 550                 555                 560 


Pro Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val Asp Leu Leu 
                565                 570                 575     


Phe Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr 
            580                 585                 590         


Phe Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu 
        595                 600                 605             


Asp Arg Phe Asn Ala Ser Leu Gly Ala Tyr His Asp Leu Leu Lys Ile 
    610                 615                 620                 


Ile Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu 
625                 630                 635                 640 


Glu Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Gly Met Ile 
                645                 650                 655     


Glu Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met 
            660                 665                 670         


Lys Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg 
        675                 680                 685             


Lys Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu 
    690                 695                 700                 


Asp Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn Phe Met Gln Leu 
705                 710                 715                 720 


Ile His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln 
                725                 730                 735     


Val Ser Gly Gln Gly His Ser Leu His Glu Gln Ile Ala Asn Leu Ala 
            740                 745                 750         


Gly Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Ile Val 
        755                 760                 765             


Asp Glu Leu Val Lys Val Met Gly His Lys Pro Glu Asn Ile Val Ile 
    770                 775                 780                 


Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser 
785                 790                 795                 800 


Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser 
                805                 810                 815     


Gln Ile Leu Lys Glu His Pro Val Glu Asn Thr Gln Leu Gln Asn Glu 
            820                 825                 830         


Lys Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val Asp 
        835                 840                 845             


Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp His Ile 
    850                 855                 860                 


Val Pro Gln Ser Phe Ile Lys Asp Asp Ser Ile Asp Asn Lys Val Leu 
865                 870                 875                 880 


Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu 
                885                 890                 895     


Glu Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala 
            900                 905                 910         


Lys Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg 
        915                 920                 925             


Gly Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu 
    930                 935                 940                 


Val Glu Thr Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp Ser 
945                 950                 955                 960 


Arg Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg Glu Val 
                965                 970                 975     


Lys Val Ile Thr Leu Lys Ser Lys Leu Val Ser Asp Phe Arg Lys Asp 
            980                 985                 990         


Phe Gln Phe Tyr Lys Val Arg Glu  Ile Asn Asn Tyr His  His Ala His 
        995                 1000                 1005             


Asp Ala  Tyr Leu Asn Ala Val  Val Gly Thr Ala Leu  Ile Lys Lys 
    1010                 1015                 1020             


Tyr Pro  Lys Leu Glu Ser Glu  Phe Val Tyr Gly Asp  Tyr Lys Val 
    1025                 1030                 1035             


Tyr Asp  Val Arg Lys Met Ile  Ala Lys Ser Glu Gln  Glu Ile Gly 
    1040                 1045                 1050             


Lys Ala  Thr Ala Lys Tyr Phe  Phe Tyr Ser Asn Ile  Met Asn Phe 
    1055                 1060                 1065             


Phe Lys  Thr Glu Ile Thr Leu  Ala Asn Gly Glu Ile  Arg Lys Arg 
    1070                 1075                 1080             


Pro Leu  Ile Glu Thr Asn Gly  Glu Thr Gly Glu Ile  Val Trp Asp 
    1085                 1090                 1095             


Lys Gly  Arg Asp Phe Ala Thr  Val Arg Lys Val Leu  Ser Met Pro 
    1100                 1105                 1110             


Gln Val  Asn Ile Val Lys Lys  Thr Glu Val Gln Thr  Gly Gly Phe 
    1115                 1120                 1125             


Ser Lys  Glu Ser Ile Leu Pro  Lys Arg Asn Ser Asp  Lys Leu Ile 
    1130                 1135                 1140             


Ala Arg  Lys Lys Asp Trp Asp  Pro Lys Lys Tyr Gly  Gly Phe Asp 
    1145                 1150                 1155             


Ser Pro  Thr Val Ala Tyr Ser  Val Leu Val Val Ala  Lys Val Glu 
    1160                 1165                 1170             


Lys Gly  Lys Ser Lys Lys Leu  Lys Ser Val Lys Glu  Leu Leu Gly 
    1175                 1180                 1185             


Ile Thr  Ile Met Glu Arg Ser  Ser Phe Glu Lys Asn  Pro Ile Asp 
    1190                 1195                 1200             


Phe Leu  Glu Ala Lys Gly Tyr  Lys Glu Val Lys Lys  Asp Leu Ile 
    1205                 1210                 1215             


Ile Lys  Leu Pro Lys Tyr Ser  Leu Phe Glu Leu Glu  Asn Gly Arg 
    1220                 1225                 1230             


Lys Arg  Met Leu Ala Ser Ala  Gly Glu Leu Gln Lys  Gly Asn Glu 
    1235                 1240                 1245             


Leu Ala  Leu Pro Ser Lys Tyr  Val Asn Phe Leu Tyr  Leu Ala Ser 
    1250                 1255                 1260             


His Tyr  Glu Lys Leu Lys Gly  Ser Pro Glu Asp Asn  Glu Gln Lys 
    1265                 1270                 1275             


Gln Leu  Phe Val Glu Gln His  Lys His Tyr Leu Asp  Glu Ile Ile 
    1280                 1285                 1290             


Glu Gln  Ile Ser Glu Phe Ser  Lys Arg Val Ile Leu  Ala Asp Ala 
    1295                 1300                 1305             


Asn Leu  Asp Lys Val Leu Ser  Ala Tyr Asn Lys His  Arg Asp Lys 
    1310                 1315                 1320             


Pro Ile  Arg Glu Gln Ala Glu  Asn Ile Ile His Leu  Phe Thr Leu 
    1325                 1330                 1335             


Thr Asn  Leu Gly Ala Pro Ala  Ala Phe Lys Tyr Phe  Asp Thr Thr 
    1340                 1345                 1350             


Ile Asp  Arg Lys Arg Tyr Thr  Ser Thr Lys Glu Val  Leu Asp Ala 
    1355                 1360                 1365             


Thr Leu  Ile His Gln Ser Ile  Thr Gly Leu Tyr Glu  Thr Arg Ile 
    1370                 1375                 1380             


Asp Leu  Ser Gln Leu Gly Gly  Asp Ser Pro Val Arg  Ser Pro Lys 
    1385                 1390                 1395             


Lys Lys  Arg Lys Val 
    1400             


<210>  24
<211>  289
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  PNP

<400>  24

Met Glu Asn Gly Tyr Thr Tyr Glu Asp Tyr Lys Asn Thr Ala Glu Trp 
1               5                   10                  15      


Leu Leu Ser His Thr Lys His Arg Pro Gln Val Ala Ile Ile Cys Gly 
            20                  25                  30          


Ser Gly Leu Gly Gly Leu Thr Asp Lys Leu Thr Gln Ala Gln Ile Phe 
        35                  40                  45              


Asp Tyr Ser Glu Ile Pro Asn Phe Pro Arg Ser Thr Val Pro Gly His 
    50                  55                  60                  


Ala Gly Arg Leu Val Phe Gly Phe Leu Asn Gly Arg Ala Cys Val Met 
65                  70                  75                  80  


Met Gln Gly Arg Phe His Met Tyr Glu Gly Tyr Pro Leu Trp Lys Val 
                85                  90                  95      


Thr Phe Pro Val Arg Val Phe His Leu Leu Gly Val Asp Thr Leu Val 
            100                 105                 110         


Val Thr Asn Ala Ala Gly Gly Leu Asn Pro Lys Phe Glu Val Gly Asp 
        115                 120                 125             


Ile Met Leu Ile Arg Asp His Ile Asn Leu Pro Gly Phe Ser Gly Gln 
    130                 135                 140                 


Asn Pro Leu Arg Gly Pro Asn Asp Glu Arg Phe Gly Asp Arg Phe Pro 
145                 150                 155                 160 


Ala Met Ser Asp Ala Tyr Asp Arg Thr Met Arg Gln Arg Ala Leu Ser 
                165                 170                 175     


Thr Trp Lys Gln Met Gly Glu Gln Arg Glu Leu Gln Glu Gly Thr Tyr 
            180                 185                 190         


Val Met Val Ala Gly Pro Ser Phe Glu Thr Val Ala Glu Cys Arg Val 
        195                 200                 205             


Leu Gln Lys Leu Gly Ala Asp Ala Val Gly Met Ser Thr Val Pro Glu 
    210                 215                 220                 


Val Ile Val Ala Arg His Cys Gly Leu Arg Val Phe Gly Phe Ser Leu 
225                 230                 235                 240 


Ile Thr Asn Lys Val Ile Met Asp Tyr Glu Ser Leu Glu Lys Ala Asn 
                245                 250                 255     


His Glu Glu Val Leu Ala Ala Gly Lys Gln Ala Ala Gln Lys Leu Glu 
            260                 265                 270         


Gln Phe Val Ser Ile Leu Met Ala Ser Ile Pro Leu Pro Asp Lys Ala 
        275                 280                 285             


Ser 
    


<210>  25
<211>  98
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  SUMO

<400>  25

Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 
1               5                   10                  15      


Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 
            20                  25                  30          


Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 
        35                  40                  45              


Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 
    50                  55                  60                  


Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 
65                  70                  75                  80  


Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 
                85                  90                  95      


Gly Gly 
        


<210>  26
<211>  14
<212>  PRT
<213>  Corynebacterium diphtheriae


<220>
<221>  MISC_FEATURE
<223>  delta-dta

<400>  26

Cys Ala Gly Asn Arg Val Arg Arg Ser Val Gly Ser Ser Leu 
1               5                   10                  


<210>  27
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Strep Tag (TM) II

<400>  27

Leu Val Pro Arg Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys 
1               5                   10                  15  


<210>  28
<211>  543
<212>  PRT
<213>  Clostridium difficile


<220>
<221>  MISC_FEATURE
<223>  contains glucosyl transferase domain

<400>  28

Met Ser Leu Val Asn Arg Lys Gln Leu Glu Lys Met Ala Asn Val Arg 
1               5                   10                  15      


Phe Arg Thr Gln Glu Asp Glu Tyr Val Ala Ile Leu Asp Ala Leu Glu 
            20                  25                  30          


Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu Lys 
        35                  40                  45              


Leu Lys Asp Ile Asn Ser Leu Thr Asp Ile Tyr Ile Asp Thr Tyr Lys 
    50                  55                  60                  


Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 
65                  70                  75                  80  


Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 
                85                  90                  95      


Asn Leu His Phe Val Trp Ile Gly Gly Gln Ile Asn Asp Thr Ala Ile 
            100                 105                 110         


Asn Tyr Ile Asn Gln Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 
        115                 120                 125             


Val Phe Tyr Asp Ser Asn Ala Phe Leu Ile Asn Thr Leu Lys Lys Thr 
    130                 135                 140                 


Val Val Glu Ser Ala Ile Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 
145                 150                 155                 160 


Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
                165                 170                 175     


Glu Ile Ile Tyr Asp Lys Gln Lys Asn Phe Ile Asn Tyr Tyr Lys Ala 
            180                 185                 190         


Gln Arg Glu Glu Asn Pro Glu Leu Ile Ile Asp Asp Ile Val Lys Thr 
        195                 200                 205             


Tyr Leu Ser Asn Glu Tyr Ser Lys Glu Ile Asp Glu Leu Asn Thr Tyr 
    210                 215                 220                 


Ile Glu Glu Ser Leu Asn Lys Ile Thr Gln Asn Ser Gly Asn Asp Val 
225                 230                 235                 240 


Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
                245                 250                 255     


Gln Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp Ile Leu 
            260                 265                 270         


Arg Ile Ser Ala Leu Lys Glu Ile Gly Gly Met Tyr Leu Asp Val Asp 
        275                 280                 285             


Met Leu Pro Gly Ile Gln Pro Asp Leu Phe Glu Ser Ile Glu Lys Pro 
    290                 295                 300                 


Ser Ser Val Thr Val Asp Phe Trp Glu Met Thr Lys Leu Glu Ala Ile 
305                 310                 315                 320 


Met Lys Tyr Lys Glu Tyr Ile Pro Glu Tyr Thr Ser Glu His Phe Asp 
                325                 330                 335     


Met Leu Asp Glu Glu Val Gln Ser Ser Phe Glu Ser Val Leu Ala Ser 
            340                 345                 350         


Lys Ser Asp Lys Ser Glu Ile Phe Ser Ser Leu Gly Asp Met Glu Ala 
        355                 360                 365             


Ser Pro Leu Glu Val Lys Ile Ala Phe Asn Ser Lys Gly Ile Ile Asn 
    370                 375                 380                 


Gln Gly Leu Ile Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu Ile Val 
385                 390                 395                 400 


Lys Gln Ile Glu Asn Arg Tyr Lys Ile Leu Asn Asn Ser Leu Asn Pro 
                405                 410                 415     


Ala Ile Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe Ile 
            420                 425                 430         


Asp Ser Ile Met Ala Glu Ala Asn Ala Asp Asn Gly Arg Phe Met Met 
        435                 440                 445             


Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
    450                 455                 460                 


Thr Ile Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gln Asp 
465                 470                 475                 480 


Leu Leu Met Phe Lys Glu Gly Ser Met Asn Ile His Leu Ile Glu Ala 
                485                 490                 495     


Asp Leu Arg Asn Phe Glu Ile Ser Lys Thr Asn Ile Ser Gln Ser Thr 
            500                 505                 510         


Glu Gln Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
        515                 520                 525             


Ala Gln Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu 
    530                 535                 540             


<210>  29
<211>  1530
<212>  DNA
<213>  Homo sapiens


<220>
<221>  misc_feature
<223>  RRSP

<400>  29
ggtgataaaa ccaaggtcgt ggtcgattta gcgcaaatct ttacggtgca agagctgaaa       60

gaaagagcaa aagtttttgc taaaccgatt ggcgcatcct accaaggtat tctcgatcaa      120

ctcgaccttg tgcatcaggc taaaggccgc gatcaaatcg cagcgagctt tgagcttaat      180

aagaagatta atgactacat cgctgaacat ccaacttcgg ggcgtaatca agcgctaacg      240

cagttgaaag agcaggtcac cagtgcgttg tttatcggta agatgcaagt tgcccaagcg      300

ggtattgatg caatcgcaca aacaagaccg gagcttgccg ctcgtatctt tatggtcgcg      360

attgaagaag ccaacggtaa acacgtaggt ttgacggaca tgatggttcg ttgggccaat      420

gaagacccat acttggcacc gaagcatggt tacaaaggcg aaacgccaag tgaccttggt      480

tttgatgcga agtaccacgt agatctaggt gagcattacg ctgatttcaa acagtggtta      540

gaaacgtccc agtcgaacgg gttgttgagt aaagcgacgt tggatgaatc cactaaaacg      600

gttcatcttg gctatagcta tcaagaactt caggatttga cgggtgctga atcggtgcaa      660

atggcgttct acttcctgaa agaagcggcg aagaaagcgg atccgatttc tggtgattca      720

gctgaaatga tactgctgaa gaaatttgca gatcaaagct acttatctca acttgattcc      780

gaccgaatgg atcaaattga aggtatctac cgcagtagcc atgagacgga tattgacgct      840

tgggatcgtc gttactctgg tacaggctat gatgagctga cgaataagct tgctagtgca      900

acgggcgttg acgagcagct tgcggttctt ctggatgatc gtaaaggcct cttgattggt      960

gaagtgcatg gcagcgacgt caacggccta cgctttgtta atgaacagat ggatgcactg     1020

aaaaaacagg gagtcacagt cattggcctt gagcatttac gctcagacct tgcgcaaccg     1080

ctgattgatc gctacctagc tacgggtgtg atgtcgagtg aactaagcgc aatgctgaaa     1140

acaaagcatc tcgatgtcac tctttttgaa aacgcacgtg ctaacggtat gcgcatcgtc     1200

gcgctggatg caaacagctc tgcgcgtcca aatgttcagg gaacagaaca tggtctgatg     1260

taccgtgctg gtgctgcgaa caacattgcg gtggaagtat tacaaaatct gcctgatggc     1320

gaaaagttcg ttgctatcta cggtaaagcg catttgcagt ctcacaaagg gattgaaggg     1380

ttcgttcctg gtatcacgca ccgtctcgat cttcctgcgc ttaaagtcag tgactcgaac     1440

cagttcacag ttgaacaaga cgatgtaagt ctacgtgttg tctacgatga tgttgctaac     1500

aaaccgaaga tcacgttcaa gggcagtttg                                      1530


<210>  30
<211>  510
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  RRSP

<400>  30

Gly Asp Lys Thr Lys Val Val Val Asp Leu Ala Gln Ile Phe Thr Val 
1               5                   10                  15      


Gln Glu Leu Lys Glu Arg Ala Lys Val Phe Ala Lys Pro Ile Gly Ala 
            20                  25                  30          


Ser Tyr Gln Gly Ile Leu Asp Gln Leu Asp Leu Val His Gln Ala Lys 
        35                  40                  45              


Gly Arg Asp Gln Ile Ala Ala Ser Phe Glu Leu Asn Lys Lys Ile Asn 
    50                  55                  60                  


Asp Tyr Ile Ala Glu His Pro Thr Ser Gly Arg Asn Gln Ala Leu Thr 
65                  70                  75                  80  


Gln Leu Lys Glu Gln Val Thr Ser Ala Leu Phe Ile Gly Lys Met Gln 
                85                  90                  95      


Val Ala Gln Ala Gly Ile Asp Ala Ile Ala Gln Thr Arg Pro Glu Leu 
            100                 105                 110         


Ala Ala Arg Ile Phe Met Val Ala Ile Glu Glu Ala Asn Gly Lys His 
        115                 120                 125             


Val Gly Leu Thr Asp Met Met Val Arg Trp Ala Asn Glu Asp Pro Tyr 
    130                 135                 140                 


Leu Ala Pro Lys His Gly Tyr Lys Gly Glu Thr Pro Ser Asp Leu Gly 
145                 150                 155                 160 


Phe Asp Ala Lys Tyr His Val Asp Leu Gly Glu His Tyr Ala Asp Phe 
                165                 170                 175     


Lys Gln Trp Leu Glu Thr Ser Gln Ser Asn Gly Leu Leu Ser Lys Ala 
            180                 185                 190         


Thr Leu Asp Glu Ser Thr Lys Thr Val His Leu Gly Tyr Ser Tyr Gln 
        195                 200                 205             


Glu Leu Gln Asp Leu Thr Gly Ala Glu Ser Val Gln Met Ala Phe Tyr 
    210                 215                 220                 


Phe Leu Lys Glu Ala Ala Lys Lys Ala Asp Pro Ile Ser Gly Asp Ser 
225                 230                 235                 240 


Ala Glu Met Ile Leu Leu Lys Lys Phe Ala Asp Gln Ser Tyr Leu Ser 
                245                 250                 255     


Gln Leu Asp Ser Asp Arg Met Asp Gln Ile Glu Gly Ile Tyr Arg Ser 
            260                 265                 270         


Ser His Glu Thr Asp Ile Asp Ala Trp Asp Arg Arg Tyr Ser Gly Thr 
        275                 280                 285             


Gly Tyr Asp Glu Leu Thr Asn Lys Leu Ala Ser Ala Thr Gly Val Asp 
    290                 295                 300                 


Glu Gln Leu Ala Val Leu Leu Asp Asp Arg Lys Gly Leu Leu Ile Gly 
305                 310                 315                 320 


Glu Val His Gly Ser Asp Val Asn Gly Leu Arg Phe Val Asn Glu Gln 
                325                 330                 335     


Met Asp Ala Leu Lys Lys Gln Gly Val Thr Val Ile Gly Leu Glu His 
            340                 345                 350         


Leu Arg Ser Asp Leu Ala Gln Pro Leu Ile Asp Arg Tyr Leu Ala Thr 
        355                 360                 365             


Gly Val Met Ser Ser Glu Leu Ser Ala Met Leu Lys Thr Lys His Leu 
    370                 375                 380                 


Asp Val Thr Leu Phe Glu Asn Ala Arg Ala Asn Gly Met Arg Ile Val 
385                 390                 395                 400 


Ala Leu Asp Ala Asn Ser Ser Ala Arg Pro Asn Val Gln Gly Thr Glu 
                405                 410                 415     


His Gly Leu Met Tyr Arg Ala Gly Ala Ala Asn Asn Ile Ala Val Glu 
            420                 425                 430         


Val Leu Gln Asn Leu Pro Asp Gly Glu Lys Phe Val Ala Ile Tyr Gly 
        435                 440                 445             


Lys Ala His Leu Gln Ser His Lys Gly Ile Glu Gly Phe Val Pro Gly 
    450                 455                 460                 


Ile Thr His Arg Leu Asp Leu Pro Ala Leu Lys Val Ser Asp Ser Asn 
465                 470                 475                 480 


Gln Phe Thr Val Glu Gln Asp Asp Val Ser Leu Arg Val Val Tyr Asp 
                485                 490                 495     


Asp Val Ala Asn Lys Pro Lys Ile Thr Phe Lys Gly Ser Leu 
            500                 505                 510 


