                         SEQUENCE LISTING

<110>  YEDA RESEARCH AND DEVELOPMENT CO. LTD.
       SHAUL, Yosef
       REUVEN, Nina
 
<120>  SYSTEMS AND METHODS FOR IDENTIFYING CELLS THAT HAVE UNDERGONE 
       GENOME EDITING

<130>  85175

<150>  IL 271656
<151>  2019-12-22

<160>  50    

<170>  PatentIn version 3.5

<210>  1
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  1
caccgtagaa tcccaggatt caggc                                             25


<210>  2
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  2
aaacgcctga atcctgggat tctac                                             25


<210>  3
<211>  44
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  3
ctcgaggtcg accactattc tgccatcctg caggtcctac atcg                        44


<210>  4
<211>  45
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  4
ggtggcaagc ttggcgggtg gtaaagtggc aacggcgaat ttggg                       45


<210>  5
<211>  37
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  5
cccgccaagc ttgccaccat ggtgagcaag ggcgagg                                37


<210>  6
<211>  46
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  6
gattcaggat ccagctcgag atctgagtcc ggacttgtac agctcg                      46


<210>  7
<211>  46
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  7
cgagctggat cctgaatcct gggattctag tatgcaataa gagatg                      46


<210>  8
<211>  46
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  8
ggccgctcta gagcagtgag ccaagaccag gctactgcac tccagc                      46


<210>  9
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  9
caccggaccc ttaatgatgc aggt                                              24


<210>  10
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  10
aaacacctgc atcattaagg gtcc                                              24


<210>  11
<211>  103
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  11
tctgagcaga gactcacccg tttataatag ttctttatct tggttgccat gtcaacctgc       60

atcattaagg gtccattttc ctcactatat tctgcaagaa taa                        103


<210>  12
<211>  26
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  12
gcagaaccca tacatggata tggagg                                            26


<210>  13
<211>  26
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  13
tatggtatat gttcacagat taccag                                            26


<210>  14
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  14
caccgcttaa tgatgcaggt tgaca                                             25


<210>  15
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  15
aaactgtcaa cctgcatcat taagc                                             25


<210>  16
<211>  100
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  16
ctgagcagag actcacccgt ttataatagt tctttatctt ggttgccatg ccaacctgca       60

tcattaaggg tccattttcc tcactatatt ctgcaagaat                            100


<210>  17
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  17
caccgctgca cgccgtgggt caggg                                             25


<210>  18
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  18
aaacccctga cccacggcgt gcagc                                             25


<210>  19
<211>  99
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Single strand DNA oligonucleotide

<400>  19
accggcaagc tgcccgtgcc ctggcccacc ctcgtgacca ccctgacata cggcgtgcag       60

tgcttcagcc gctaccccga ccacatgaag cagcacgac                              99


<210>  20
<211>  9
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Meganuclease family structural motif sequence

<400>  20

Leu Ala Gly Leu Ile Asp Ala Asp Gly 
1               5                   


<210>  21
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Meganuclease family structural motif sequence

<400>  21

Gly Ile Tyr Tyr Ile Gly 
1               5       


<210>  22
<211>  8
<212>  DNA
<213>  Artificial sequence

<220>
<223>  PAM sequence of Neisseria meningitidis-derived Cas9


<220>
<221>  misc_feature
<222>  (1)..(4)
<223>  n is a, c, g, or t

<400>  22
nnnngatt                                                                 8


<210>  23
<211>  6
<212>  DNA
<213>  Artificial sequence

<220>
<223>  PAM sequence of Streptococcus thermophilus-derived Cas9


<220>
<221>  misc_feature
<222>  (1)..(2)
<223>  n is a, c, g, or t

<400>  23
nnagaa                                                                   6


<210>  24
<211>  6
<212>  DNA
<213>  Artificial sequence

<220>
<223>  PAM sequence of Treponema denticola-derived Cas9


<220>
<221>  misc_feature
<222>  (1)..(1)
<223>  n is a, c, g, or t

<400>  24
naaaac                                                                   6


<210>  25
<211>  626
<212>  PRT
<213>  Artificial sequence

<220>
<223>  AMINO ACID SEQ OF UL12

<400>  25

Met Glu Ser Thr Gly Gly Pro Ala Cys Pro Pro Gly Arg Thr Val Thr 
1               5                   10                  15      


Lys Arg Ser Trp Ala Leu Ala Glu Asp Thr Pro Arg Gly Pro Asp Ser 
            20                  25                  30          


Pro Pro Lys Arg Pro Arg Pro Asn Ser Leu Pro Leu Thr Thr Thr Phe 
        35                  40                  45              


Arg Pro Leu Pro Pro Pro Pro Gln Thr Thr Ser Ala Val Asp Pro Ser 
    50                  55                  60                  


Ser His Ser Pro Val Asn Pro Pro Arg Asp Gln His Ala Thr Asp Thr 
65                  70                  75                  80  


Ala Asp Glu Lys Pro Arg Ala Ala Ser Pro Ala Leu Ser Asp Ala Ser 
                85                  90                  95      


Gly Pro Pro Thr Pro Asp Ile Pro Leu Ser Pro Gly Gly Thr His Ala 
            100                 105                 110         


Arg Asp Pro Asp Ala Asp Pro Asp Ser Pro Asp Leu Asp Ser Met Trp 
        115                 120                 125             


Ser Ala Ser Val Ile Pro Asn Ala Leu Pro Ser His Ile Leu Ala Glu 
    130                 135                 140                 


Thr Phe Glu Arg His Leu Arg Gly Leu Leu Arg Gly Val Arg Ala Pro 
145                 150                 155                 160 


Leu Ala Ile Gly Pro Leu Trp Ala Arg Leu Asp Tyr Leu Cys Ser Leu 
                165                 170                 175     


Ala Val Val Leu Glu Glu Ala Gly Met Val Asp Arg Gly Leu Gly Arg 
            180                 185                 190         


His Leu Trp Arg Leu Thr Arg Arg Gly Pro Pro Ala Ala Ala Asp Ala 
        195                 200                 205             


Val Ala Pro Arg Pro Leu Met Gly Phe Tyr Glu Ala Ala Thr Gln Asn 
    210                 215                 220                 


Gln Ala Asp Cys Gln Leu Trp Ala Leu Leu Arg Arg Gly Leu Thr Thr 
225                 230                 235                 240 


Ala Ser Thr Leu Arg Trp Gly Pro Gln Gly Pro Cys Phe Ser Pro Gln 
                245                 250                 255     


Trp Leu Lys His Asn Ala Ser Leu Arg Pro Asp Val Gln Ser Ser Ala 
            260                 265                 270         


Val Met Phe Gly Arg Val Asn Glu Pro Thr Ala Arg Ser Leu Leu Phe 
        275                 280                 285             


Arg Tyr Cys Val Gly Arg Ala Asp Asp Gly Gly Glu Ala Gly Ala Asp 
    290                 295                 300                 


Thr Arg Arg Phe Ile Phe His Glu Pro Gly Asp Leu Ala Glu Glu Asn 
305                 310                 315                 320 


Val His Thr Cys Gly Val Leu Met Asp Gly His Thr Gly Met Val Gly 
                325                 330                 335     


Ala Ser Leu Asp Ile Leu Val Cys Pro Arg Asp Ile His Gly Tyr Leu 
            340                 345                 350         


Ala Pro Val Pro Lys Thr Pro Leu Ala Phe Tyr Glu Val Lys Cys Arg 
        355                 360                 365             


Ala Lys Tyr Ala Phe Asp Pro Met Asp Pro Ser Asp Pro Thr Ala Ser 
    370                 375                 380                 


Ala Tyr Glu Asp Leu Met Ala His Arg Ser Pro Glu Ala Phe Arg Ala 
385                 390                 395                 400 


Phe Ile Arg Ser Ile Pro Lys Pro Ser Val Arg Tyr Phe Ala Pro Gly 
                405                 410                 415     


Arg Val Pro Gly Pro Glu Glu Ala Leu Val Thr Gln Asp Gln Ala Trp 
            420                 425                 430         


Ser Glu Ala His Ala Ser Gly Glu Lys Arg Arg Cys Ser Ala Ala Asp 
        435                 440                 445             


Arg Ala Leu Val Glu Leu Asn Ser Gly Val Val Ser Glu Val Leu Leu 
    450                 455                 460                 


Phe Gly Ala Pro Asp Leu Gly Arg His Thr Ile Ser Pro Val Ser Trp 
465                 470                 475                 480 


Ser Ser Gly Asp Leu Val Arg Arg Glu Pro Val Phe Ala Asn Pro Arg 
                485                 490                 495     


His Pro Asn Phe Lys Gln Ile Leu Val Gln Gly Tyr Val Leu Asp Ser 
            500                 505                 510         


His Phe Pro Asp Cys Pro Pro His Pro His Leu Val Thr Phe Ile Gly 
        515                 520                 525             


Arg His Arg Thr Ser Ala Glu Glu Gly Val Thr Phe Arg Leu Glu Asp 
    530                 535                 540                 


Gly Ala Gly Ala Leu Gly Ala Ala Gly Pro Ser Lys Ala Ser Ile Leu 
545                 550                 555                 560 


Pro Asn Gln Ala Val Pro Ile Ala Leu Ile Ile Thr Pro Val Arg Ile 
                565                 570                 575     


Asp Pro Glu Ile Tyr Lys Ala Ile Gln Arg Ser Ser Arg Leu Ala Phe 
            580                 585                 590         


Asp Asp Thr Leu Ala Glu Leu Trp Ala Ser Arg Ser Pro Gly Pro Gly 
        595                 600                 605             


Pro Ala Ala Ala Glu Thr Thr Ser Ser Ser Pro Thr Thr Gly Arg Ser 
    610                 615                 620                 


Ser Arg 
625     


<210>  26
<211>  126
<212>  PRT
<213>  Artificial sequence

<220>
<223>  THE FIRST 126 AA OF UL12 - AMINO ACIDS SEQUENCE

<400>  26

Met Glu Ser Thr Gly Gly Pro Ala Cys Pro Pro Gly Arg Thr Val Thr 
1               5                   10                  15      


Lys Arg Ser Trp Ala Leu Ala Glu Asp Thr Pro Arg Gly Pro Asp Ser 
            20                  25                  30          


Pro Pro Lys Arg Pro Arg Pro Asn Ser Leu Pro Leu Thr Thr Thr Phe 
        35                  40                  45              


Arg Pro Leu Pro Pro Pro Pro Gln Thr Thr Ser Ala Val Asp Pro Ser 
    50                  55                  60                  


Ser His Ser Pro Val Asn Pro Pro Arg Asp Gln His Ala Thr Asp Thr 
65                  70                  75                  80  


Ala Asp Glu Lys Pro Arg Ala Ala Ser Pro Ala Leu Ser Asp Ala Ser 
                85                  90                  95      


Gly Pro Pro Thr Pro Asp Ile Pro Leu Ser Pro Gly Gly Thr His Ala 
            100                 105                 110         


Arg Asp Pro Asp Ala Asp Pro Asp Ser Pro Asp Leu Asp Ser 
        115                 120                 125     


<210>  27
<211>  1553
<212>  PRT
<213>  Artificial sequence

<220>
<223>  THE COMPLETE CAS9-UL12 FUSION PROTEIN U3 - AMINO ACIDS SEQUENCE

<400>  27

Met Glu Ser Thr Gly Gly Pro Ala Cys Pro Pro Gly Arg Thr Val Thr 
1               5                   10                  15      


Lys Arg Ser Trp Ala Leu Ala Glu Asp Thr Pro Arg Gly Pro Asp Ser 
            20                  25                  30          


Pro Pro Lys Arg Pro Arg Pro Asn Ser Leu Pro Leu Thr Thr Thr Phe 
        35                  40                  45              


Arg Pro Leu Pro Pro Pro Pro Gln Thr Thr Ser Ala Val Asp Pro Ser 
    50                  55                  60                  


Ser His Ser Pro Val Asn Pro Pro Arg Asp Gln His Ala Thr Asp Thr 
65                  70                  75                  80  


Ala Asp Glu Lys Pro Arg Ala Ala Ser Pro Ala Leu Ser Asp Ala Ser 
                85                  90                  95      


Gly Pro Pro Thr Pro Asp Ile Pro Leu Ser Pro Gly Gly Thr His Ala 
            100                 105                 110         


Arg Asp Pro Asp Ala Asp Pro Asp Ser Pro Asp Leu Asp Ser Gly Ser 
        115                 120                 125             


Val Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile 
    130                 135                 140                 


Asp Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys 
145                 150                 155                 160 


Val Gly Ile His Gly Val Pro Ala Ala Asp Lys Lys Tyr Ser Ile Gly 
                165                 170                 175     


Leu Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu 
            180                 185                 190         


Tyr Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp Arg 
        195                 200                 205             


His Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe Asp Ser Gly 
    210                 215                 220                 


Glu Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr 
225                 230                 235                 240 


Thr Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn 
                245                 250                 255     


Glu Met Ala Lys Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser 
            260                 265                 270         


Phe Leu Val Glu Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly 
        275                 280                 285             


Asn Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr 
    290                 295                 300                 


His Leu Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg 
305                 310                 315                 320 


Leu Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg Gly His Phe 
                325                 330                 335     


Leu Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val Asp Lys Leu 
            340                 345                 350         


Phe Ile Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro 
        355                 360                 365             


Ile Asn Ala Ser Gly Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu 
    370                 375                 380                 


Ser Lys Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu 
385                 390                 395                 400 


Lys Lys Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu 
                405                 410                 415     


Thr Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu 
            420                 425                 430         


Gln Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala 
        435                 440                 445             


Gln Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu 
    450                 455                 460                 


Ser Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile 
465                 470                 475                 480 


Thr Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His 
                485                 490                 495     


His Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro 
            500                 505                 510         


Glu Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala 
        515                 520                 525             


Gly Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile 
    530                 535                 540                 


Lys Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys 
545                 550                 555                 560 


Leu Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly 
                565                 570                 575     


Ser Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile Leu Arg 
            580                 585                 590         


Arg Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile 
        595                 600                 605             


Glu Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala 
    610                 615                 620                 


Arg Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr 
625                 630                 635                 640 


Ile Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala 
                645                 650                 655     


Gln Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn 
            660                 665                 670         


Glu Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val 
        675                 680                 685             


Tyr Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys 
    690                 695                 700                 


Pro Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val Asp Leu Leu 
705                 710                 715                 720 


Phe Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr 
                725                 730                 735     


Phe Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu 
            740                 745                 750         


Asp Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile 
        755                 760                 765             


Ile Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu 
    770                 775                 780                 


Glu Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile 
785                 790                 795                 800 


Glu Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met 
                805                 810                 815     


Lys Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg 
            820                 825                 830         


Lys Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu 
        835                 840                 845             


Asp Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn Phe Met Gln Leu 
    850                 855                 860                 


Ile His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln 
865                 870                 875                 880 


Val Ser Gly Gln Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala 
                885                 890                 895     


Gly Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val 
            900                 905                 910         


Asp Glu Leu Val Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val 
        915                 920                 925             


Ile Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn 
    930                 935                 940                 


Ser Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly 
945                 950                 955                 960 


Ser Gln Ile Leu Lys Glu His Pro Val Glu Asn Thr Gln Leu Gln Asn 
                965                 970                 975     


Glu Lys Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val 
            980                 985                 990         


Asp Gln Glu Leu Asp Ile Asn Arg  Leu Ser Asp Tyr Asp  Val Asp His 
        995                 1000                 1005             


Ile Val  Pro Gln Ser Phe Leu  Lys Asp Asp Ser Ile  Asp Asn Lys 
    1010                 1015                 1020             


Val Leu  Thr Arg Ser Asp Lys  Asn Arg Gly Lys Ser  Asp Asn Val 
    1025                 1030                 1035             


Pro Ser  Glu Glu Val Val Lys  Lys Met Lys Asn Tyr  Trp Arg Gln 
    1040                 1045                 1050             


Leu Leu  Asn Ala Lys Leu Ile  Thr Gln Arg Lys Phe  Asp Asn Leu 
    1055                 1060                 1065             


Thr Lys  Ala Glu Arg Gly Gly  Leu Ser Glu Leu Asp  Lys Ala Gly 
    1070                 1075                 1080             


Phe Ile  Lys Arg Gln Leu Val  Glu Thr Arg Gln Ile  Thr Lys His 
    1085                 1090                 1095             


Val Ala  Gln Ile Leu Asp Ser  Arg Met Asn Thr Lys  Tyr Asp Glu 
    1100                 1105                 1110             


Asn Asp  Lys Leu Ile Arg Glu  Val Lys Val Ile Thr  Leu Lys Ser 
    1115                 1120                 1125             


Lys Leu  Val Ser Asp Phe Arg  Lys Asp Phe Gln Phe  Tyr Lys Val 
    1130                 1135                 1140             


Arg Glu  Ile Asn Asn Tyr His  His Ala His Asp Ala  Tyr Leu Asn 
    1145                 1150                 1155             


Ala Val  Val Gly Thr Ala Leu  Ile Lys Lys Tyr Pro  Lys Leu Glu 
    1160                 1165                 1170             


Ser Glu  Phe Val Tyr Gly Asp  Tyr Lys Val Tyr Asp  Val Arg Lys 
    1175                 1180                 1185             


Met Ile  Ala Lys Ser Glu Gln  Glu Ile Gly Lys Ala  Thr Ala Lys 
    1190                 1195                 1200             


Tyr Phe  Phe Tyr Ser Asn Ile  Met Asn Phe Phe Lys  Thr Glu Ile 
    1205                 1210                 1215             


Thr Leu  Ala Asn Gly Glu Ile  Arg Lys Arg Pro Leu  Ile Glu Thr 
    1220                 1225                 1230             


Asn Gly  Glu Thr Gly Glu Ile  Val Trp Asp Lys Gly  Arg Asp Phe 
    1235                 1240                 1245             


Ala Thr  Val Arg Lys Val Leu  Ser Met Pro Gln Val  Asn Ile Val 
    1250                 1255                 1260             


Lys Lys  Thr Glu Val Gln Thr  Gly Gly Phe Ser Lys  Glu Ser Ile 
    1265                 1270                 1275             


Leu Pro  Lys Arg Asn Ser Asp  Lys Leu Ile Ala Arg  Lys Lys Asp 
    1280                 1285                 1290             


Trp Asp  Pro Lys Lys Tyr Gly  Gly Phe Asp Ser Pro  Thr Val Ala 
    1295                 1300                 1305             


Tyr Ser  Val Leu Val Val Ala  Lys Val Glu Lys Gly  Lys Ser Lys 
    1310                 1315                 1320             


Lys Leu  Lys Ser Val Lys Glu  Leu Leu Gly Ile Thr  Ile Met Glu 
    1325                 1330                 1335             


Arg Ser  Ser Phe Glu Lys Asn  Pro Ile Asp Phe Leu  Glu Ala Lys 
    1340                 1345                 1350             


Gly Tyr  Lys Glu Val Lys Lys  Asp Leu Ile Ile Lys  Leu Pro Lys 
    1355                 1360                 1365             


Tyr Ser  Leu Phe Glu Leu Glu  Asn Gly Arg Lys Arg  Met Leu Ala 
    1370                 1375                 1380             


Ser Ala  Gly Glu Leu Gln Lys  Gly Asn Glu Leu Ala  Leu Pro Ser 
    1385                 1390                 1395             


Lys Tyr  Val Asn Phe Leu Tyr  Leu Ala Ser His Tyr  Glu Lys Leu 
    1400                 1405                 1410             


Lys Gly  Ser Pro Glu Asp Asn  Glu Gln Lys Gln Leu  Phe Val Glu 
    1415                 1420                 1425             


Gln His  Lys His Tyr Leu Asp  Glu Ile Ile Glu Gln  Ile Ser Glu 
    1430                 1435                 1440             


Phe Ser  Lys Arg Val Ile Leu  Ala Asp Ala Asn Leu  Asp Lys Val 
    1445                 1450                 1455             


Leu Ser  Ala Tyr Asn Lys His  Arg Asp Lys Pro Ile  Arg Glu Gln 
    1460                 1465                 1470             


Ala Glu  Asn Ile Ile His Leu  Phe Thr Leu Thr Asn  Leu Gly Ala 
    1475                 1480                 1485             


Pro Ala  Ala Phe Lys Tyr Phe  Asp Thr Thr Ile Asp  Arg Lys Arg 
    1490                 1495                 1500             


Tyr Thr  Ser Thr Lys Glu Val  Leu Asp Ala Thr Leu  Ile His Gln 
    1505                 1510                 1515             


Ser Ile  Thr Gly Leu Tyr Glu  Thr Arg Ile Asp Leu  Ser Gln Leu 
    1520                 1525                 1530             


Gly Gly  Asp Lys Arg Pro Ala  Ala Thr Lys Lys Ala  Gly Gln Ala 
    1535                 1540                 1545             


Lys Lys  Lys Lys Leu 
    1550             


<210>  28
<211>  1548
<212>  PRT
<213>  Artificial sequence

<220>
<223>  THE COMPLETE CAS9-UL12 FUSION PROTEIN U4 - AMINO ACIDS SEQUENCE

<400>  28

Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp 
1               5                   10                  15      


Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val 
            20                  25                  30          


Gly Ile His Gly Val Pro Ala Ala Asp Lys Lys Tyr Ser Ile Gly Leu 
        35                  40                  45              


Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu Tyr 
    50                  55                  60                  


Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp Arg His 
65                  70                  75                  80  


Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu 
                85                  90                  95      


Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr 
            100                 105                 110         


Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu 
        115                 120                 125             


Met Ala Lys Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser Phe 
    130                 135                 140                 


Leu Val Glu Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly Asn 
145                 150                 155                 160 


Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr His 
                165                 170                 175     


Leu Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu 
            180                 185                 190         


Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg Gly His Phe Leu 
        195                 200                 205             


Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val Asp Lys Leu Phe 
    210                 215                 220                 


Ile Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile 
225                 230                 235                 240 


Asn Ala Ser Gly Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser 
                245                 250                 255     


Lys Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys 
            260                 265                 270         


Lys Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr 
        275                 280                 285             


Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln 
    290                 295                 300                 


Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln 
305                 310                 315                 320 


Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser 
                325                 330                 335     


Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile Thr 
            340                 345                 350         


Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His His 
        355                 360                 365             


Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro Glu 
    370                 375                 380                 


Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly 
385                 390                 395                 400 


Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys 
                405                 410                 415     


Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu 
            420                 425                 430         


Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser 
        435                 440                 445             


Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile Leu Arg Arg 
    450                 455                 460                 


Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu 
465                 470                 475                 480 


Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg 
                485                 490                 495     


Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr Ile 
            500                 505                 510         


Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln 
        515                 520                 525             


Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu 
    530                 535                 540                 


Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr 
545                 550                 555                 560 


Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys Pro 
                565                 570                 575     


Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val Asp Leu Leu Phe 
            580                 585                 590         


Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe 
        595                 600                 605             


Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu Asp 
    610                 615                 620                 


Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile 
625                 630                 635                 640 


Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu 
                645                 650                 655     


Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu 
            660                 665                 670         


Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met Lys 
        675                 680                 685             


Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys 
    690                 695                 700                 


Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp 
705                 710                 715                 720 


Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn Phe Met Gln Leu Ile 
                725                 730                 735     


His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln Val 
            740                 745                 750         


Ser Gly Gln Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly 
        755                 760                 765             


Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp 
    770                 775                 780                 


Glu Leu Val Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val Ile 
785                 790                 795                 800 


Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser 
                805                 810                 815     


Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser 
            820                 825                 830         


Gln Ile Leu Lys Glu His Pro Val Glu Asn Thr Gln Leu Gln Asn Glu 
        835                 840                 845             


Lys Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val Asp 
    850                 855                 860                 


Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp His Ile 
865                 870                 875                 880 


Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys Val Leu 
                885                 890                 895     


Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu 
            900                 905                 910         


Glu Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala 
        915                 920                 925             


Lys Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg 
    930                 935                 940                 


Gly Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu 
945                 950                 955                 960 


Val Glu Thr Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp Ser 
                965                 970                 975     


Arg Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg Glu Val 
            980                 985                 990         


Lys Val Ile Thr Leu Lys Ser Lys  Leu Val Ser Asp Phe  Arg Lys Asp 
        995                 1000                 1005             


Phe Gln  Phe Tyr Lys Val Arg  Glu Ile Asn Asn Tyr  His His Ala 
    1010                 1015                 1020             


His Asp  Ala Tyr Leu Asn Ala  Val Val Gly Thr Ala  Leu Ile Lys 
    1025                 1030                 1035             


Lys Tyr  Pro Lys Leu Glu Ser  Glu Phe Val Tyr Gly  Asp Tyr Lys 
    1040                 1045                 1050             


Val Tyr  Asp Val Arg Lys Met  Ile Ala Lys Ser Glu  Gln Glu Ile 
    1055                 1060                 1065             


Gly Lys  Ala Thr Ala Lys Tyr  Phe Phe Tyr Ser Asn  Ile Met Asn 
    1070                 1075                 1080             


Phe Phe  Lys Thr Glu Ile Thr  Leu Ala Asn Gly Glu  Ile Arg Lys 
    1085                 1090                 1095             


Arg Pro  Leu Ile Glu Thr Asn  Gly Glu Thr Gly Glu  Ile Val Trp 
    1100                 1105                 1110             


Asp Lys  Gly Arg Asp Phe Ala  Thr Val Arg Lys Val  Leu Ser Met 
    1115                 1120                 1125             


Pro Gln  Val Asn Ile Val Lys  Lys Thr Glu Val Gln  Thr Gly Gly 
    1130                 1135                 1140             


Phe Ser  Lys Glu Ser Ile Leu  Pro Lys Arg Asn Ser  Asp Lys Leu 
    1145                 1150                 1155             


Ile Ala  Arg Lys Lys Asp Trp  Asp Pro Lys Lys Tyr  Gly Gly Phe 
    1160                 1165                 1170             


Asp Ser  Pro Thr Val Ala Tyr  Ser Val Leu Val Val  Ala Lys Val 
    1175                 1180                 1185             


Glu Lys  Gly Lys Ser Lys Lys  Leu Lys Ser Val Lys  Glu Leu Leu 
    1190                 1195                 1200             


Gly Ile  Thr Ile Met Glu Arg  Ser Ser Phe Glu Lys  Asn Pro Ile 
    1205                 1210                 1215             


Asp Phe  Leu Glu Ala Lys Gly  Tyr Lys Glu Val Lys  Lys Asp Leu 
    1220                 1225                 1230             


Ile Ile  Lys Leu Pro Lys Tyr  Ser Leu Phe Glu Leu  Glu Asn Gly 
    1235                 1240                 1245             


Arg Lys  Arg Met Leu Ala Ser  Ala Gly Glu Leu Gln  Lys Gly Asn 
    1250                 1255                 1260             


Glu Leu  Ala Leu Pro Ser Lys  Tyr Val Asn Phe Leu  Tyr Leu Ala 
    1265                 1270                 1275             


Ser His  Tyr Glu Lys Leu Lys  Gly Ser Pro Glu Asp  Asn Glu Gln 
    1280                 1285                 1290             


Lys Gln  Leu Phe Val Glu Gln  His Lys His Tyr Leu  Asp Glu Ile 
    1295                 1300                 1305             


Ile Glu  Gln Ile Ser Glu Phe  Ser Lys Arg Val Ile  Leu Ala Asp 
    1310                 1315                 1320             


Ala Asn  Leu Asp Lys Val Leu  Ser Ala Tyr Asn Lys  His Arg Asp 
    1325                 1330                 1335             


Lys Pro  Ile Arg Glu Gln Ala  Glu Asn Ile Ile His  Leu Phe Thr 
    1340                 1345                 1350             


Leu Thr  Asn Leu Gly Ala Pro  Ala Ala Phe Lys Tyr  Phe Asp Thr 
    1355                 1360                 1365             


Thr Ile  Asp Arg Lys Arg Tyr  Thr Ser Thr Lys Glu  Val Leu Asp 
    1370                 1375                 1380             


Ala Thr  Leu Ile His Gln Ser  Ile Thr Gly Leu Tyr  Glu Thr Arg 
    1385                 1390                 1395             


Ile Asp  Leu Ser Gln Leu Gly  Gly Asp Lys Arg Pro  Ala Ala Thr 
    1400                 1405                 1410             


Lys Lys  Ala Gly Gln Ala Lys  Lys Lys Met Glu Ser  Thr Gly Gly 
    1415                 1420                 1425             


Pro Ala  Cys Pro Pro Gly Arg  Thr Val Thr Lys Arg  Ser Trp Ala 
    1430                 1435                 1440             


Leu Ala  Glu Asp Thr Pro Arg  Gly Pro Asp Ser Pro  Pro Lys Arg 
    1445                 1450                 1455             


Pro Arg  Pro Asn Ser Leu Pro  Leu Thr Thr Thr Phe  Arg Pro Leu 
    1460                 1465                 1470             


Pro Pro  Pro Pro Gln Thr Thr  Ser Ala Val Asp Pro  Ser Ser His 
    1475                 1480                 1485             


Ser Pro  Val Asn Pro Pro Arg  Asp Gln His Ala Thr  Asp Thr Ala 
    1490                 1495                 1500             


Asp Glu  Lys Pro Arg Ala Ala  Ser Pro Ala Leu Ser  Asp Ala Ser 
    1505                 1510                 1515             


Gly Pro  Pro Thr Pro Asp Ile  Pro Leu Ser Pro Gly  Gly Thr His 
    1520                 1525                 1530             


Ala Arg  Asp Pro Asp Ala Asp  Pro Asp Ser Pro Asp  Leu Asp Ser 
    1535                 1540                 1545             


<210>  29
<211>  1872
<212>  PRT
<213>  Artificial sequence

<220>
<223>  An exemplary sequence of TAF1


<220>
<221>  misc_feature
<222>  (695)..(695)
<223>  Xaa can be any naturally occurring amino acid

<400>  29

Met Gly Pro Gly Cys Asp Leu Leu Leu Arg Thr Ala Ala Thr Ile Thr 
1               5                   10                  15      


Ala Ala Ala Ile Met Ser Asp Thr Asp Ser Asp Glu Asp Ser Ala Gly 
            20                  25                  30          


Gly Gly Pro Phe Ser Leu Ala Gly Phe Leu Phe Gly Asn Ile Asn Gly 
        35                  40                  45              


Ala Gly Gln Leu Glu Gly Glu Ser Val Leu Asp Asp Glu Cys Lys Lys 
    50                  55                  60                  


His Leu Ala Gly Leu Gly Ala Leu Gly Leu Gly Ser Leu Ile Thr Glu 
65                  70                  75                  80  


Leu Thr Ala Asn Glu Glu Leu Thr Gly Thr Asp Gly Ala Leu Val Asn 
                85                  90                  95      


Asp Glu Gly Trp Val Arg Ser Thr Glu Asp Ala Val Asp Tyr Ser Asp 
            100                 105                 110         


Ile Asn Glu Val Ala Glu Asp Glu Ser Arg Arg Tyr Gln Gln Thr Met 
        115                 120                 125             


Gly Ser Leu Gln Pro Leu Cys His Ser Asp Tyr Asp Glu Asp Asp Tyr 
    130                 135                 140                 


Asp Ala Asp Cys Glu Asp Ile Asp Cys Lys Leu Met Pro Pro Pro Pro 
145                 150                 155                 160 


Pro Pro Pro Gly Pro Met Lys Lys Asp Lys Asp Gln Asp Ser Ile Thr 
                165                 170                 175     


Gly Glu Lys Val Asp Phe Ser Ser Ser Ser Asp Ser Glu Ser Glu Met 
            180                 185                 190         


Gly Pro Gln Glu Ala Thr Gln Ala Glu Ser Glu Asp Gly Lys Leu Thr 
        195                 200                 205             


Leu Pro Leu Ala Gly Ile Met Gln His Asp Ala Thr Lys Leu Leu Pro 
    210                 215                 220                 


Ser Val Thr Glu Leu Phe Pro Glu Phe Arg Pro Gly Lys Val Leu Arg 
225                 230                 235                 240 


Phe Leu Arg Leu Phe Gly Pro Gly Lys Asn Val Pro Ser Val Trp Arg 
                245                 250                 255     


Ser Ala Arg Arg Lys Arg Lys Lys Lys His Arg Glu Leu Ile Gln Glu 
            260                 265                 270         


Glu Gln Ile Gln Glu Val Glu Cys Ser Val Glu Ser Glu Val Ser Gln 
        275                 280                 285             


Lys Ser Leu Trp Asn Tyr Asp Tyr Ala Pro Pro Pro Pro Pro Glu Gln 
    290                 295                 300                 


Cys Leu Ser Asp Asp Glu Ile Thr Met Met Ala Pro Val Glu Ser Lys 
305                 310                 315                 320 


Phe Ser Gln Ser Thr Gly Asp Ile Asp Lys Val Thr Asp Thr Lys Pro 
                325                 330                 335     


Arg Val Ala Glu Trp Arg Tyr Gly Pro Ala Arg Leu Trp Tyr Asp Met 
            340                 345                 350         


Leu Gly Val Pro Glu Asp Gly Ser Gly Phe Asp Tyr Gly Phe Lys Leu 
        355                 360                 365             


Arg Lys Thr Glu His Glu Pro Val Ile Lys Ser Arg Met Ile Glu Glu 
    370                 375                 380                 


Phe Arg Lys Leu Glu Glu Asn Asn Gly Thr Asp Leu Leu Ala Asp Glu 
385                 390                 395                 400 


Asn Phe Leu Met Val Thr Gln Leu His Trp Glu Asp Asp Ile Ile Trp 
                405                 410                 415     


Asp Gly Glu Asp Val Lys His Lys Gly Thr Lys Pro Gln Arg Ala Ser 
            420                 425                 430         


Leu Ala Gly Trp Leu Pro Ser Ser Met Thr Arg Asn Ala Met Ala Tyr 
        435                 440                 445             


Asn Val Gln Gln Gly Phe Ala Ala Thr Leu Asp Asp Asp Lys Pro Trp 
    450                 455                 460                 


Tyr Ser Ile Phe Pro Ile Asp Asn Glu Asp Leu Val Tyr Gly Arg Trp 
465                 470                 475                 480 


Glu Asp Asn Ile Ile Trp Asp Ala Gln Ala Met Pro Arg Leu Leu Glu 
                485                 490                 495     


Pro Pro Val Leu Thr Leu Asp Pro Asn Asp Glu Asn Leu Ile Leu Glu 
            500                 505                 510         


Ile Pro Asp Glu Lys Glu Glu Ala Thr Ser Asn Ser Pro Ser Lys Glu 
        515                 520                 525             


Ser Lys Lys Glu Ser Ser Leu Lys Lys Ser Arg Ile Leu Leu Gly Lys 
    530                 535                 540                 


Thr Gly Val Ile Lys Glu Glu Pro Gln Gln Asn Met Ser Gln Pro Glu 
545                 550                 555                 560 


Val Lys Asp Pro Trp Asn Leu Ser Asn Asp Glu Tyr Tyr Tyr Pro Lys 
                565                 570                 575     


Gln Gln Gly Leu Arg Gly Thr Phe Gly Gly Asn Ile Ile Gln His Ser 
            580                 585                 590         


Ile Pro Ala Val Glu Leu Arg Gln Pro Phe Phe Pro Thr His Met Gly 
        595                 600                 605             


Pro Ile Lys Leu Arg Gln Phe His Arg Pro Pro Leu Lys Lys Tyr Ser 
    610                 615                 620                 


Phe Gly Ala Leu Ser Gln Pro Gly Pro His Ser Val Gln Pro Leu Leu 
625                 630                 635                 640 


Lys His Ile Lys Lys Lys Ala Lys Met Arg Glu Gln Glu Arg Gln Ala 
                645                 650                 655     


Ser Gly Gly Gly Glu Met Phe Phe Met Arg Thr Pro Gln Asp Leu Thr 
            660                 665                 670         


Gly Lys Asp Gly Asp Leu Ile Leu Ala Glu Tyr Ser Glu Glu Asn Gly 
        675                 680                 685             


Pro Leu Met Met Gln Val Xaa Met Ala Thr Lys Ile Lys Asn Tyr Tyr 
    690                 695                 700                 


Lys Arg Lys Pro Gly Lys Asp Pro Gly Ala Pro Asp Cys Lys Tyr Gly 
705                 710                 715                 720 


Glu Thr Val Tyr Cys His Thr Ser Pro Phe Leu Gly Ser Leu His Pro 
                725                 730                 735     


Gly Gln Leu Leu Gln Ala Phe Glu Asn Asn Leu Phe Arg Ala Pro Ile 
            740                 745                 750         


Tyr Leu His Lys Met Pro Glu Thr Asp Phe Leu Ile Ile Arg Thr Arg 
        755                 760                 765             


Gln Gly Tyr Tyr Ile Arg Glu Leu Val Asp Ile Phe Val Val Gly Gln 
    770                 775                 780                 


Gln Cys Pro Leu Phe Glu Val Pro Gly Pro Asn Ser Lys Arg Ala Asn 
785                 790                 795                 800 


Thr His Ile Arg Asp Phe Leu Gln Val Phe Ile Tyr Arg Leu Phe Trp 
                805                 810                 815     


Lys Ser Lys Asp Arg Pro Arg Arg Ile Arg Met Glu Asp Ile Lys Lys 
            820                 825                 830         


Ala Phe Pro Ser His Ser Glu Ser Ser Ile Arg Lys Arg Leu Lys Leu 
        835                 840                 845             


Cys Ala Asp Phe Lys Arg Thr Gly Met Asp Ser Asn Trp Trp Val Leu 
    850                 855                 860                 


Lys Ser Asp Phe Arg Leu Pro Thr Glu Glu Glu Ile Arg Ala Met Val 
865                 870                 875                 880 


Ser Pro Glu Gln Cys Cys Ala Tyr Tyr Ser Met Ile Ala Ala Glu Gln 
                885                 890                 895     


Arg Leu Lys Asp Ala Gly Tyr Gly Glu Lys Ser Phe Phe Ala Pro Glu 
            900                 905                 910         


Glu Glu Asn Glu Glu Asp Phe Gln Met Lys Ile Asp Asp Glu Val Arg 
        915                 920                 925             


Thr Ala Pro Trp Asn Thr Thr Arg Ala Phe Ile Ala Ala Met Lys Gly 
    930                 935                 940                 


Lys Cys Leu Leu Glu Val Thr Gly Val Ala Asp Pro Thr Gly Cys Gly 
945                 950                 955                 960 


Glu Gly Phe Ser Tyr Val Lys Ile Pro Asn Lys Pro Thr Gln Gln Lys 
                965                 970                 975     


Asp Asp Lys Glu Pro Gln Pro Val Lys Lys Thr Val Thr Gly Thr Asp 
            980                 985                 990         


Ala Asp Leu Arg Arg Leu Ser Leu  Lys Asn Ala Lys Gln  Leu Leu Arg 
        995                 1000                 1005             


Lys Phe  Gly Val Pro Glu Glu  Glu Ile Lys Lys Leu  Ser Arg Trp 
    1010                 1015                 1020             


Glu Val  Ile Asp Val Val Arg  Thr Met Ser Thr Glu  Gln Ala Arg 
    1025                 1030                 1035             


Ser Gly  Glu Gly Pro Met Ser  Lys Phe Ala Arg Gly  Ser Arg Phe 
    1040                 1045                 1050             


Ser Val  Ala Glu His Gln Glu  Arg Tyr Lys Glu Glu  Cys Gln Arg 
    1055                 1060                 1065             


Ile Phe  Asp Leu Gln Asn Lys  Val Leu Ser Ser Thr  Glu Val Leu 
    1070                 1075                 1080             


Ser Thr  Asp Thr Asp Ser Ser  Ser Ala Glu Asp Ser  Asp Phe Glu 
    1085                 1090                 1095             


Glu Met  Gly Lys Asn Ile Glu  Asn Met Leu Gln Asn  Lys Lys Thr 
    1100                 1105                 1110             


Ser Ser  Gln Leu Ser Arg Glu  Arg Glu Glu Gln Glu  Arg Lys Glu 
    1115                 1120                 1125             


Leu Gln  Arg Met Leu Leu Ala  Ala Gly Ser Ala Ala  Ser Gly Asn 
    1130                 1135                 1140             


Asn His  Arg Asp Asp Asp Thr  Ala Ser Val Thr Ser  Leu Asn Ser 
    1145                 1150                 1155             


Ser Ala  Thr Gly Arg Cys Leu  Lys Ile Tyr Arg Thr  Phe Arg Asp 
    1160                 1165                 1170             


Glu Glu  Gly Lys Glu Tyr Val  Arg Cys Glu Thr Val  Arg Lys Pro 
    1175                 1180                 1185             


Ala Val  Ile Asp Ala Tyr Val  Arg Ile Arg Thr Thr  Lys Asp Glu 
    1190                 1195                 1200             


Glu Phe  Ile Arg Lys Phe Ala  Leu Phe Asp Glu Gln  His Arg Glu 
    1205                 1210                 1215             


Glu Met  Arg Lys Glu Arg Arg  Arg Ile Gln Glu Gln  Leu Arg Arg 
    1220                 1225                 1230             


Leu Lys  Arg Asn Gln Glu Lys  Glu Lys Leu Lys Gly  Pro Pro Glu 
    1235                 1240                 1245             


Lys Lys  Pro Lys Lys Met Lys  Glu Arg Pro Asp Leu  Lys Leu Lys 
    1250                 1255                 1260             


Cys Gly  Ala Cys Gly Ala Ile  Gly His Met Arg Thr  Asn Lys Phe 
    1265                 1270                 1275             


Cys Pro  Leu Tyr Tyr Gln Thr  Asn Ala Pro Pro Ser  Asn Pro Val 
    1280                 1285                 1290             


Ala Met  Thr Glu Glu Gln Glu  Glu Glu Leu Glu Lys  Thr Val Ile 
    1295                 1300                 1305             


His Asn  Asp Asn Glu Glu Leu  Ile Lys Val Glu Gly  Thr Lys Ile 
    1310                 1315                 1320             


Val Leu  Gly Lys Gln Leu Ile  Glu Ser Ala Asp Glu  Val Arg Arg 
    1325                 1330                 1335             


Lys Ser  Leu Val Leu Lys Phe  Pro Lys Gln Gln Leu  Pro Pro Lys 
    1340                 1345                 1350             


Lys Lys  Arg Arg Val Gly Thr  Thr Val His Cys Asp  Tyr Leu Asn 
    1355                 1360                 1365             


Arg Pro  His Lys Ser Ile His  Arg Arg Arg Thr Asp  Pro Met Val 
    1370                 1375                 1380             


Thr Leu  Ser Ser Ile Leu Glu  Ser Ile Ile Asn Asp  Met Arg Asp 
    1385                 1390                 1395             


Leu Pro  Asn Thr Tyr Pro Phe  His Thr Pro Val Asn  Ala Lys Val 
    1400                 1405                 1410             


Val Lys  Asp Tyr Tyr Lys Ile  Ile Thr Arg Pro Met  Asp Leu Gln 
    1415                 1420                 1425             


Thr Leu  Arg Glu Asn Val Arg  Lys Arg Leu Tyr Pro  Ser Arg Glu 
    1430                 1435                 1440             


Glu Phe  Arg Glu His Leu Glu  Leu Ile Val Lys Asn  Ser Ala Thr 
    1445                 1450                 1455             


Tyr Asn  Gly Pro Lys His Ser  Leu Thr Gln Ile Ser  Gln Ser Met 
    1460                 1465                 1470             


Leu Asp  Leu Cys Asp Glu Lys  Leu Lys Glu Lys Glu  Asp Lys Leu 
    1475                 1480                 1485             


Ala Arg  Leu Glu Lys Ala Ile  Asn Pro Leu Leu Asp  Asp Asp Asp 
    1490                 1495                 1500             


Gln Val  Ala Phe Ser Phe Ile  Leu Asp Asn Ile Val  Thr Gln Lys 
    1505                 1510                 1515             


Met Met  Ala Val Pro Asp Ser  Trp Pro Phe His His  Pro Val Asn 
    1520                 1525                 1530             


Lys Lys  Phe Val Pro Asp Tyr  Tyr Lys Val Ile Val  Asn Pro Met 
    1535                 1540                 1545             


Asp Leu  Glu Thr Ile Arg Lys  Asn Ile Ser Lys His  Lys Tyr Gln 
    1550                 1555                 1560             


Ser Arg  Glu Ser Phe Leu Asp  Asp Val Asn Leu Ile  Leu Ala Asn 
    1565                 1570                 1575             


Ser Val  Lys Tyr Asn Gly Pro  Glu Ser Gln Tyr Thr  Lys Thr Ala 
    1580                 1585                 1590             


Gln Glu  Ile Val Asn Val Cys  Tyr Gln Thr Leu Thr  Glu Tyr Asp 
    1595                 1600                 1605             


Glu His  Leu Thr Gln Leu Glu  Lys Asp Ile Cys Thr  Ala Lys Glu 
    1610                 1615                 1620             


Ala Ala  Leu Glu Glu Ala Glu  Leu Glu Ser Leu Asp  Pro Met Thr 
    1625                 1630                 1635             


Pro Gly  Pro Tyr Thr Pro Gln  Pro Pro Asp Leu Tyr  Asp Thr Asn 
    1640                 1645                 1650             


Thr Ser  Leu Ser Met Ser Arg  Asp Ala Ser Val Phe  Gln Asp Glu 
    1655                 1660                 1665             


Ser Asn  Met Ser Val Leu Asp  Ile Pro Ser Ala Thr  Pro Glu Lys 
    1670                 1675                 1680             


Gln Val  Thr Gln Glu Gly Glu  Asp Gly Asp Gly Asp  Leu Ala Asp 
    1685                 1690                 1695             


Glu Glu  Glu Gly Thr Val Gln  Gln Pro Gln Ala Ser  Val Leu Tyr 
    1700                 1705                 1710             


Glu Asp  Leu Leu Met Ser Glu  Gly Glu Asp Asp Glu  Glu Asp Ala 
    1715                 1720                 1725             


Gly Ser  Asp Glu Glu Gly Asp  Asn Pro Phe Ser Ala  Ile Gln Leu 
    1730                 1735                 1740             


Ser Glu  Ser Gly Ser Asp Ser  Asp Val Gly Ser Gly  Gly Ile Arg 
    1745                 1750                 1755             


Pro Lys  Gln Pro Arg Met Leu  Gln Glu Asn Thr Arg  Met Asp Met 
    1760                 1765                 1770             


Glu Asn  Glu Glu Ser Met Met  Ser Tyr Glu Gly Asp  Gly Gly Glu 
    1775                 1780                 1785             


Ala Ser  His Gly Leu Glu Asp  Ser Asn Ile Ser Tyr  Gly Ser Tyr 
    1790                 1795                 1800             


Glu Glu  Pro Asp Pro Lys Ser  Asn Thr Gln Asp Thr  Ser Phe Ser 
    1805                 1810                 1815             


Ser Ile  Gly Gly Tyr Glu Val  Ser Glu Glu Glu Glu  Asp Glu Glu 
    1820                 1825                 1830             


Glu Glu  Glu Gln Arg Ser Gly  Pro Ser Val Leu Ser  Gln Val His 
    1835                 1840                 1845             


Leu Ser  Glu Asp Glu Glu Asp  Ser Glu Asp Phe His  Ser Ile Ala 
    1850                 1855                 1860             


Gly Asp  Ser Asp Leu Asp Ser  Asp Glu 
    1865                 1870         


<210>  30
<211>  7599
<212>  DNA
<213>  Artificial sequence

<220>
<223>  An exemplary sequence of TAF1

<400>  30
atcactgctg ccgccatcat gtcagacacg gacagcgacg aagattccgc tggaggcggc       60

ccattttctt tagcgggttt ccttttcggc aacatcaatg gagccgggca gctggagggg      120

gaaagcgtct tggatgatga atgtaagaag cacttggcag gcttgggggc tttggggctg      180

ggcagcctga tcactgaact cacggcaaat gaagaattga ccgggactga cggtgccttg      240

gtaaatgatg aagggtgggt taggagtaca gaagatgctg tggactattc agacatcaat      300

gaggtggcag aagatgaaag ccgaagatac cagcagacga tggggagctt gcagcccctt      360

tgccactcag attatgatga agatgactat gatgctgatt gtgaagacat tgattgcaag      420

ttgatgcctc ctccacctcc acccccggga ccaatgaaga aggataagga ccaggattct      480

attactggtg tgtctgaaaa tggagaaggc atcatcttgc cctccatcat tgccccttcc      540

tctttggcct cagagaaagt ggacttcagt agttcctctg actcagaatc tgagatggga      600

cctcaggaag caacacaggc agaatctgaa gatggaaagc tgacccttcc attggctggg      660

attatgcagc atgatgccac caagctgttg ccaagtgtca cagaactttt tccagaattt      720

cgacctggaa aggtgttacg ttttctacgt ctttttggac cagggaagaa tgtcccatct      780

gtttggcgga gtgctcggag aaagaggaag aagaagcacc gtgagctgat acaggaagag      840

cagatccagg aggtggagtg ctcagtagaa tcagaagtca gccagaagtc tttgtggaac      900

tacgactacg ctccaccacc acctccagag cagtgtctct ctgatgatga aatcacgatg      960

atggctcctg tggagtccaa attttcccaa tcaactggag atatagataa agtgacagat     1020

accaaaccaa gagtggctga gtggcgttat gggcctgccc gactgtggta tgatatgctg     1080

ggtgtccctg aagatggcag tgggtttgac tatggcttca aactgagaaa gacagaacat     1140

gaacctgtga taaaatctag aatgatagag gaatttagga aacttgagga aaacaatggc     1200

actgatcttc tggctgatga aaacttcctg atggtgacac agctgcattg ggaggatgat     1260

atcatctggg atggggagga tgtcaaacac aaagggacaa aacctcagcg tgcaagcctg     1320

gcaggctggc ttccttctag catgactagg aatgcgatgg cttacaatgt tcagcaaggt     1380

tttgcagcca ctcttgatga tgacaaacct tggtactcca tttttcccat tgacaatgag     1440

gatctggtat atggacgctg ggaggacaat atcatttggg atgctcaggc catgccccgg     1500

ctgttggaac ctcctgtttt gacacttgat cccaatgatg agaacctcat tttggaaatt     1560

cctgatgaga aggaagaggc cacctctaac tccccctcca aggagagtaa gaaggaatca     1620

tctctgaaga agagtcgaat tctcttaggg aaaacaggag tcatcaagga ggaaccacag     1680

cagaacatgt ctcagccaga agtgaaagat ccatggaatc tctccaatga tgagtattat     1740

tatcccaagc aacagggtct tcgaggcacc tttggaggga atattatcca gcattcaatt     1800

cctgctgtgg aattacggca gcccttcttt cccacccaca tggggcccat caaactccgg     1860

cagttccatc gcccacctct gaaaaagtac tcatttggtg cactttctca gccaggtccc     1920

cactcagtcc aacctttgct aaagcacatc aaaaaaaagg ccaagatgag agaacaagag     1980

aggcaagctt caggtggtgg agagatgttt tttatgcgca cacctcagga cctcacaggc     2040

aaagatggtg atcttattct tgcagaatat agtgaggaaa atggaccctt aatgatgcag     2100

gttggcatgg caaccaagat aaagaactat tataaacgga aacctggaaa agatcctgga     2160

gcaccagatt gtaaatatgg ggaaactgtt tactgccata catctccttt cctgggttct     2220

ctccatcctg gccaattgct gcaagcattt gagaacaacc tttttcgtgc tccaatttat     2280

cttcataaga tgccagaaac tgatttcttg atcattcgga caagacaggg ttactatatt     2340

cgggaattag tggatatttt tgtggttggc cagcagtgtc ccttgtttga agttcctggg     2400

cctaactcca aaagggccaa tacgcatatt cgagactttc tacaggtttt tatttaccgc     2460

cttttctgga aaagtaaaga tcggccacgg aggatacgaa tggaagatat aaaaaaagcc     2520

tttccttccc attcagaaag cagcatccgg aagaggctaa agctctgcgc tgacttcaaa     2580

cgcacaggga tggactcaaa ctggtgggtg cttaagtctg attttcgttt accaacggaa     2640

gaagagatca gagctatggt gtcaccagag cagtgctgtg cttattatag catgatagct     2700

gcagagcaac gactgaagga tgctggctat ggtgagaaat ccttttttgc tccagaagaa     2760

gaaaatgagg aagatttcca gatgaagatt gatgatgaag ttcgcactgc cccttggaac     2820

accacaaggg ccttcattgc tgccatgaag ggcaagtgtc tgctagaggt gactggggtg     2880

gcagatccca cggggtgtgg tgaaggattc tcctatgtga agattccaaa caaaccaaca     2940

cagcagaagg atgataaaga accgcagcca gtgaagaaga cagtgacagg aacagatgca     3000

gaccttcgtc gcctttccct gaaaaatgcc aagcaacttc tacgtaaatt tggtgtgcct     3060

gaggaagaga ttaaaaagtt gtcccgctgg gaagtgattg atgtggtgcg cacaatgtca     3120

acagaacagg ctcgttctgg agaggggccc atgagtaaat ttgcccgtgg atcaaggttt     3180

tctgtggctg agcatcaaga gcgttacaaa gaggaatgtc agcgcatctt tgacctacag     3240

aacaaggttc tgtcatcaac tgaagtctta tcaactgaca cagacagcag ctcagctgaa     3300

gatagtgact ttgaagaaat gggaaagaac attgagaaca tgttgcagaa caagaaaacc     3360

agctctcagc tttcacgtga acgggaggaa caggagcgga aggaactaca gcgaatgcta     3420

ctggcagcag gctcagcagc atccggaaac aatcacagag atgatgacac agcttccgtg     3480

actagcctta actcttctgc cactggacgc tgtctcaaga tttatcgcac gtttcgagat     3540

gaagagggga aagagtatgt tcgctgtgag acagtccgaa aaccagctgt cattgatgcc     3600

tatgtgcgca tacggactac aaaagatgag gaattcattc gaaaatttgc cctttttgat     3660

gaacaacatc gggaagagat gcgaaaagaa cggcggagga ttcaagagca actgaggcgg     3720

cttaagagga accaggaaaa ggagaagctt aagggtcctc ctgagaagaa gcccaagaaa     3780

atgaaggagc gtcctgacct aaaactgaaa tgtggggcat gtggtgccat tggacacatg     3840

aggactaaca aattctgccc cctctattat caaacaaatg cgccaccttc caaccctgtt     3900

gccatgacag aagaacagga ggaggagttg gaaaagacag tcattcataa tgataatgaa     3960

gaacttatca aggttgaagg gaccaaaatt gtcttgggga aacagctaat tgagagtgcg     4020

gatgaggttc gcagaaaatc tctggttctc aagtttccta aacagcagct tcctccaaag     4080

aagaaacggc gagttggaac cactgttcac tgtgactatt tgaatagacc tcataagtcc     4140

atccaccggc gccgcacaga ccctatggtg acgctgtcgt ccatcttgga gtctatcatc     4200

aatgacatga gagatcttcc aaatacatac cctttccaca ctccagtcaa tgcaaaggtt     4260

gtaaaggact actacaaaat catcactcgg ccaatggacc tacaaacact ccgcgaaaac     4320

gtgcgtaaac gcctctaccc atctcgggaa gagttcagag agcatctgga gctaattgtg     4380

aaaaatagtg caacctacaa tgggccaaaa cactcattga ctcagatctc tcaatccatg     4440

ctggatctct gtgatgaaaa actcaaagag aaagaagaca aattagctcg cttagagaaa     4500

gctatcaacc ccttgctgga tgatgatgac caagtggcgt tttctttcat tctggacaac     4560

attgtcaccc agaaaatgat ggcagttcca gattcttggc catttcatca cccagttaat     4620

aagaaatttg ttccagatta ttacaaagtg attgtcaatc caatggattt agagaccata     4680

cgtaagaaca tctccaagca caagtatcag agtcgggaga gctttctgga tgatgtaaac     4740

cttattctgg ccaacagtgt taagtataat ggacctgaga gtcagtatac taagactgcc     4800

caggagattg tgaacgtctg ttaccagaca ttgactgagt atgatgaaca tttgactcaa     4860

cttgagaagg atatttgtac tgctaaagaa gcagctttgg aggaagcaga attagaaagc     4920

ctggacccaa tgaccccagg gccctacacg cctcagcctc ctgatttgta tgataccaac     4980

acatccctca gtatgtctcg agatgcctct gtatttcaag atgagagcaa tatgtctgtc     5040

ttggatattc ccagtgccac tccagaaaag caggtaacac aggaaggtga agatggagat     5100

ggtgatcttg cagatgaaga ggaaggaact gtacaacagc ctcaagccag tgtcctgtat     5160

gaggatttgc ttatgtctga aggagaagat gatgaggaag atgctgggag tgatgaagaa     5220

ggagacaatc ctttctctgc tatccagctg agtgaaagtg gaagtgactc tgatgtggga     5280

tctggtggaa taagacccaa acaaccccgc atgcttcagg agaacacaag gatggacatg     5340

gaaaatgaag aaagcatgat gtcctatgag ggagacggtg gggaggcttc ccatggtttg     5400

gaggatagca acatcagtta tgggagctat gaggagcctg atcccaagtc gaacacccaa     5460

gacacaagct tcagcagcat cggtgggtat gaggtatcag aggaggaaga agatgaggag     5520

gaggaagagc agcgctctgg gccgagcgta ctaagccagg tccacctgtc agaggacgag     5580

gaggacagtg aggatttcca ctccattgct ggggacagtg acttggactc tgatgaatga     5640

ggcttccttt gggcctcctt ggtcagcctt ccctgttctc cagcctaggt ggttcacctt     5700

tccccaattt gttcatattt gtacagtatc tgatcctgaa atcatgaaat taactaacac     5760

cttagccttt ttaaaagtag taagtaaatg ataataaatc acctctccta atcttcctgg     5820

ggcaatgtca ccctttgatt taaaacaaag caaccccctt tcccctacca ctacggaaaa     5880

gagcaagctc atttttccgt gtcctccttt atttaactcc atttattgct tttggtataa     5940

tttttccctg gggaaggagg ggaaattatg aaagaactag taactttatg tcctcttgat     6000

gtattaggaa atttccggcc aggcgtggtg gctcacacct gtaatctcag cactctggga     6060

ggccgaggcg ggcagatcac ctgaggtcag aagttcgaga ccagcttggc caacatggcg     6120

aaaccgcatc tctactaaaa atacaaaaat tagccaggtg tggtggcgta tgcctgttaa     6180

tcctagctac tcgggaggct gaggcaggag aattacttga acccgggagg cagaggttgc     6240

agtgagtgga ggtcacgcca ctgcactcca gcctgggcga aagagtgaga ttcagtctca     6300

aaaaaaaaaa aatttccaag catggtatca tctcactttt ctaatttaca ggctggagca     6360

gatgagagcc ctcctgctgg gacagagaat tgggttctag tggactctgt gctacactta     6420

aacctgtgag acaaaccgcc cattatttta ttatttaatt atgcaatgcc tagttcctaa     6480

atggattgga ggcaaattac cgtaaatttt gaaacagcct atatgtcaga aatgataatg     6540

ttgccaccta aatgttttct gtccccccca ccctccccag gggaaatggt aggaaaatgg     6600

taagtttctt agggcaaaga ctgtgtcttc tgtttctttt catgcttagg atatggttct     6660

gtgcatagta ggtactcagt aaatgttcct agaatcataa aatcctcaac agatatgtta     6720

ctgagcatct gcttttcatg ataagcactc tatcagatcc ttgggatgca aaggtaaata     6780

agacaaatcc cttttaccca aagagctcac catcaagttg ggggagggaa agtggaattc     6840

aaaacatgtt aataaatcat catagtactg tgagataagt gcaattaaga agctagttat     6900

aaagtatagg ggaaatagag gagtaatcat gtctgaaaag tcaggaaagt cttcctagag     6960

gtaattttta agctgattgt tttagaatta gtagaagctt gccagatgga aaagtccagg     7020

caaagtgtaa catgaatggg aaaggccaca gtctagaaat ggcagagtgt gttcctagtt     7080

tgtttgtttg tttgtttgta cctgccttgt tccaggaagg atttaatgtg gtttatattc     7140

cagtccttta atgctggaag ggctgagatg agactgaaag atgggcagga agtatatcat     7200

cacaagcttt gtgtttgatg ttaatgtgta tgatttttat attatgggaa ataagctctt     7260

agaggagtga tataatcagg tttgtgtttt agaaatctgt gtaatgaatg aatgaagaaa     7320

gaaattgaag aatcatgtaa catatgtgat cgcatttttg taaaagaacc atgtgtgttt     7380

atatgtgttt atatatatac ttgtgtatgc aaaggtaaaa gtctgaaagg atatatgcta     7440

actgttcaca atgataaccc cccaggaatg ggattggagg ggagggggct tctgtgtttg     7500

ttatgtatgc tgggtgggat attgtgcttt tatttctata ttgtttgaat ttttttacag     7560

tatgtattat ttttgtaata aaaattttaa aaaattcca                            7599


<210>  31
<211>  106
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Insertion of a P54Q mutation on RPL36A  AA


<220>
<221>  misc_feature
<222>  (54)..(54)
<223>  Xaa can be any naturally occurring amino acid

<400>  31

Met Val Asn Val Pro Lys Thr Arg Arg Thr Phe Cys Lys Lys Cys Gly 
1               5                   10                  15      


Lys His Gln Pro His Lys Val Thr Gln Tyr Lys Lys Gly Lys Asp Ser 
            20                  25                  30          


Leu Tyr Ala Gln Gly Lys Arg Arg Tyr Asp Arg Lys Gln Ser Gly Tyr 
        35                  40                  45              


Gly Gly Gln Thr Lys Xaa Ile Phe Arg Lys Lys Ala Lys Thr Thr Lys 
    50                  55                  60                  


Lys Ile Val Leu Arg Leu Glu Cys Val Glu Pro Asn Cys Arg Ser Lys 
65                  70                  75                  80  


Arg Met Leu Ala Ile Lys Arg Cys Lys His Phe Glu Leu Gly Gly Asp 
                85                  90                  95      


Lys Lys Arg Lys Gly Gln Val Ile Gln Phe 
            100                 105     


<210>  32
<211>  761
<212>  DNA
<213>  Artificial sequence

<220>
<223>  protein a positive selectable marker which allows the growth of 
       cells in the presence of cycloheximide nucleic acid sequence

<400>  32
ctttctttcc gcgccgatag cgctcacgca agcatggtta acgtccctaa aacccgccgg       60

actttctgta agaagtgtgg caagcaccaa ccccataaag tgacacagta caagaagggc      120

aaggattctc tgtacgccca gggaaagcgg cgttatgaca ggaagcagag tggctatggt      180

gggcaaacta agccgatttt ccggaaaaag gctaaaacta caaagaagat tgtgctaagg      240

cttgagtgcg ttgagcccaa ctgcagatct aagagaatgc tggctattaa aagatgcaag      300

cattttgaac tgggaggaga taagaagaga aagggccaag tgatccagtt ctaagtgtca      360

tcttttatta tgaagacaat aaaatcttga gtttatgttc acttcatttg tttgctgttc      420

atcttttggg agggaataag ctagagccat caatacaatt ccgcttgtgg ggaaatttat      480

gcctcttact ggtactactt gttttgcatt gaagctgact ggttgagttc acatcatatg      540

ttgcaatttt ctaatttggc acttcaatca ctaggggcct tatgaggcag tttgtcatta      600

tgcaatggtt attggttatc atgtgagtag acacatttca ggctaatagg gagaagtcag      660

taacacattc atagtgaata tgagatgtct ttgctaagag ttaagtgtca gatctttgtt      720

ataacagtta atttaataaa gaattttggc attgttcttc a                          761


<210>  33
<211>  187
<212>  PRT
<213>  Artificial sequence

<220>
<223>  encoded protein a positive selectable marker which allows the 
       growth of cells in the presence of methotrexate.


<220>
<221>  misc_feature
<222>  (23)..(23)
<223>  Xaa can be any naturally occurring amino acid

<400>  33

Met Val Gly Ser Leu Asn Cys Ile Val Ala Val Ser Gln Asn Met Gly 
1               5                   10                  15      


Ile Gly Lys Asn Gly Asp Xaa Pro Trp Pro Pro Leu Arg Asn Glu Phe 
            20                  25                  30          


Arg Tyr Phe Gln Arg Met Thr Thr Thr Ser Ser Val Glu Gly Lys Gln 
        35                  40                  45              


Asn Leu Val Ile Met Gly Lys Lys Thr Trp Phe Ser Ile Pro Glu Lys 
    50                  55                  60                  


Asn Arg Pro Leu Lys Gly Arg Ile Asn Leu Val Leu Ser Arg Glu Leu 
65                  70                  75                  80  


Lys Glu Pro Pro Gln Gly Ala His Phe Leu Ser Arg Ser Leu Asp Asp 
                85                  90                  95      


Ala Leu Lys Leu Thr Glu Gln Pro Glu Leu Ala Asn Lys Val Asp Met 
            100                 105                 110         


Val Trp Ile Val Gly Gly Ser Ser Val Tyr Lys Glu Ala Met Asn His 
        115                 120                 125             


Pro Gly His Leu Lys Leu Phe Val Thr Arg Ile Met Gln Asp Phe Glu 
    130                 135                 140                 


Ser Asp Thr Phe Phe Pro Glu Ile Asp Leu Glu Lys Tyr Lys Leu Leu 
145                 150                 155                 160 


Pro Glu Tyr Pro Gly Val Leu Ser Asp Val Gln Glu Glu Lys Gly Ile 
                165                 170                 175     


Lys Tyr Lys Phe Glu Val Tyr Glu Lys Asn Asp 
            180                 185         


<210>  34
<211>  389
<212>  PRT
<213>  Artificial sequence

<220>
<223>  encoded protein a positive selectable marker which allows the 
       growth of cells in the presence of hydroxyurea


<220>
<221>  misc_feature
<222>  (128)..(128)
<223>  Xaa can be any naturally occurring amino acid

<400>  34

Met Leu Ser Leu Arg Val Pro Leu Ala Pro Ile Thr Asp Pro Gln Gln 
1               5                   10                  15      


Leu Gln Leu Ser Pro Leu Lys Gly Leu Ser Leu Val Asp Lys Glu Asn 
            20                  25                  30          


Thr Pro Pro Ala Leu Ser Gly Thr Arg Val Leu Ala Ser Lys Thr Ala 
        35                  40                  45              


Arg Arg Ile Phe Gln Glu Pro Thr Glu Pro Lys Thr Lys Ala Ala Ala 
    50                  55                  60                  


Pro Gly Val Glu Asp Glu Pro Leu Leu Arg Glu Asn Pro Arg Arg Phe 
65                  70                  75                  80  


Val Ile Phe Pro Ile Glu Tyr His Asp Ile Trp Gln Met Tyr Lys Lys 
                85                  90                  95      


Ala Glu Ala Ser Phe Trp Thr Ala Glu Glu Val Asp Leu Ser Lys Asp 
            100                 105                 110         


Ile Gln His Trp Glu Ser Leu Lys Pro Glu Glu Arg Tyr Phe Ile Xaa 
        115                 120                 125             


His Val Leu Ala Phe Phe Ala Ala Ser Asp Gly Ile Val Asn Glu Asn 
    130                 135                 140                 


Leu Val Glu Arg Phe Ser Gln Glu Val Gln Ile Thr Glu Ala Arg Cys 
145                 150                 155                 160 


Phe Tyr Gly Phe Gln Ile Ala Met Glu Asn Ile His Ser Glu Met Tyr 
                165                 170                 175     


Ser Leu Leu Ile Asp Thr Tyr Ile Lys Asp Pro Lys Glu Arg Glu Phe 
            180                 185                 190         


Leu Phe Asn Ala Ile Glu Thr Met Pro Cys Val Lys Lys Lys Ala Asp 
        195                 200                 205             


Trp Ala Leu Arg Trp Ile Gly Asp Lys Glu Ala Thr Tyr Gly Glu Arg 
    210                 215                 220                 


Val Val Ala Phe Ala Ala Val Glu Gly Ile Phe Phe Ser Gly Ser Phe 
225                 230                 235                 240 


Ala Ser Ile Phe Trp Leu Lys Lys Arg Gly Leu Met Pro Gly Leu Thr 
                245                 250                 255     


Phe Ser Asn Glu Leu Ile Ser Arg Asp Glu Gly Leu His Cys Asp Phe 
            260                 265                 270         


Ala Cys Leu Met Phe Lys His Leu Val His Lys Pro Ser Glu Glu Arg 
        275                 280                 285             


Val Arg Glu Ile Ile Ile Asn Ala Val Arg Ile Glu Gln Glu Phe Leu 
    290                 295                 300                 


Thr Glu Ala Leu Pro Val Lys Leu Ile Gly Met Asn Cys Thr Leu Met 
305                 310                 315                 320 


Lys Gln Tyr Ile Glu Phe Val Ala Asp Arg Leu Met Leu Glu Leu Gly 
                325                 330                 335     


Phe Ser Lys Val Phe Arg Val Glu Asn Pro Phe Asp Phe Met Glu Asn 
            340                 345                 350         


Ile Ser Leu Glu Gly Lys Thr Asn Phe Phe Glu Lys Arg Val Gly Glu 
        355                 360                 365             


Tyr Gln Arg Met Gly Val Met Ser Ser Pro Thr Glu Asn Ser Phe Thr 
    370                 375                 380                 


Leu Asp Ala Asp Phe 
385                 


<210>  35
<211>  1970
<212>  PRT
<213>  Artificial sequence

<220>
<223>  POLR2A (largest subunit of RNA polymerase II)

<400>  35

Met His Gly Gly Gly Pro Pro Ser Gly Asp Ser Ala Cys Pro Leu Arg 
1               5                   10                  15      


Thr Ile Lys Arg Val Gln Phe Gly Val Leu Ser Pro Asp Glu Leu Lys 
            20                  25                  30          


Arg Met Ser Val Thr Glu Gly Gly Ile Lys Tyr Pro Glu Thr Thr Glu 
        35                  40                  45              


Gly Gly Arg Pro Lys Leu Gly Gly Leu Met Asp Pro Arg Gln Gly Val 
    50                  55                  60                  


Ile Glu Arg Thr Gly Arg Cys Gln Thr Cys Ala Gly Asn Met Thr Glu 
65                  70                  75                  80  


Cys Pro Gly His Phe Gly His Ile Glu Leu Ala Lys Pro Val Phe His 
                85                  90                  95      


Val Gly Phe Leu Val Lys Thr Met Lys Val Leu Arg Cys Val Cys Phe 
            100                 105                 110         


Phe Cys Ser Lys Leu Leu Val Asp Ser Asn Asn Pro Lys Ile Lys Asp 
        115                 120                 125             


Ile Leu Ala Lys Ser Lys Gly Gln Pro Lys Lys Arg Leu Thr His Val 
    130                 135                 140                 


Tyr Asp Leu Cys Lys Gly Lys Asn Ile Cys Glu Gly Gly Glu Glu Met 
145                 150                 155                 160 


Asp Asn Lys Phe Gly Val Glu Gln Pro Glu Gly Asp Glu Asp Leu Thr 
                165                 170                 175     


Lys Glu Lys Gly His Gly Gly Cys Gly Arg Tyr Gln Pro Arg Ile Arg 
            180                 185                 190         


Arg Ser Gly Leu Glu Leu Tyr Ala Glu Trp Lys His Val Asn Glu Asp 
        195                 200                 205             


Ser Gln Glu Lys Lys Ile Leu Leu Ser Pro Glu Arg Val His Glu Ile 
    210                 215                 220                 


Phe Lys Arg Ile Ser Asp Glu Glu Cys Phe Val Leu Gly Met Glu Pro 
225                 230                 235                 240 


Arg Tyr Ala Arg Pro Glu Trp Met Ile Val Thr Val Leu Pro Val Pro 
                245                 250                 255     


Pro Leu Ser Val Arg Pro Ala Val Val Met Gln Gly Ser Ala Arg Asn 
            260                 265                 270         


Gln Asp Asp Leu Thr His Lys Leu Ala Asp Ile Val Lys Ile Asn Asn 
        275                 280                 285             


Gln Leu Arg Arg Asn Glu Gln Asn Gly Ala Ala Ala His Val Ile Ala 
    290                 295                 300                 


Glu Asp Val Lys Leu Leu Gln Phe His Val Ala Thr Met Val Asp Asn 
305                 310                 315                 320 


Glu Leu Pro Gly Leu Pro Arg Ala Met Gln Lys Ser Gly Arg Pro Leu 
                325                 330                 335     


Lys Ser Leu Lys Gln Arg Leu Lys Gly Lys Glu Gly Arg Val Arg Gly 
            340                 345                 350         


Asn Leu Met Gly Lys Arg Val Asp Phe Ser Ala Arg Thr Val Ile Thr 
        355                 360                 365             


Pro Asp Pro Asn Leu Ser Ile Asp Gln Val Gly Val Pro Arg Ser Ile 
    370                 375                 380                 


Ala Ala Asn Met Thr Phe Ala Glu Ile Val Thr Pro Phe Asn Ile Asp 
385                 390                 395                 400 


Arg Leu Gln Glu Leu Val Arg Arg Gly Asn Ser Gln Tyr Pro Gly Ala 
                405                 410                 415     


Lys Tyr Ile Ile Arg Asp Asn Gly Asp Arg Ile Asp Leu Arg Phe His 
            420                 425                 430         


Pro Lys Pro Ser Asp Leu His Leu Gln Thr Gly Tyr Lys Val Glu Arg 
        435                 440                 445             


His Met Cys Asp Gly Asp Ile Val Ile Phe Asn Arg Gln Pro Thr Leu 
    450                 455                 460                 


His Lys Met Ser Met Met Gly His Arg Val Arg Ile Leu Pro Trp Ser 
465                 470                 475                 480 


Thr Phe Arg Leu Asn Leu Ser Val Thr Thr Pro Tyr Asn Ala Asp Phe 
                485                 490                 495     


Asp Gly Asp Glu Met Asn Leu His Leu Pro Gln Ser Leu Glu Thr Arg 
            500                 505                 510         


Ala Glu Ile Gln Glu Leu Ala Met Val Pro Arg Met Ile Val Thr Pro 
        515                 520                 525             


Gln Ser Asn Arg Pro Val Met Gly Ile Val Gln Asp Thr Leu Thr Ala 
    530                 535                 540                 


Val Arg Lys Phe Thr Lys Arg Asp Val Phe Leu Glu Arg Gly Glu Val 
545                 550                 555                 560 


Met Asn Leu Leu Met Phe Leu Ser Thr Trp Asp Gly Lys Val Pro Gln 
                565                 570                 575     


Pro Ala Ile Leu Lys Pro Arg Pro Leu Trp Thr Gly Lys Gln Ile Phe 
            580                 585                 590         


Ser Leu Ile Ile Pro Gly His Ile Asn Cys Ile Arg Thr His Ser Thr 
        595                 600                 605             


His Pro Asp Asp Glu Asp Ser Gly Pro Tyr Lys His Ile Ser Pro Gly 
    610                 615                 620                 


Asp Thr Lys Val Val Val Glu Asn Gly Glu Leu Ile Met Gly Ile Leu 
625                 630                 635                 640 


Cys Lys Lys Ser Leu Gly Thr Ser Ala Gly Ser Leu Val His Ile Ser 
                645                 650                 655     


Tyr Leu Glu Met Gly His Asp Ile Thr Arg Leu Phe Tyr Ser Asn Ile 
            660                 665                 670         


Gln Thr Val Ile Asn Asn Trp Leu Leu Ile Glu Gly His Thr Ile Gly 
        675                 680                 685             


Ile Gly Asp Ser Ile Ala Asp Ser Lys Thr Tyr Gln Asp Ile Gln Asn 
    690                 695                 700                 


Thr Ile Lys Lys Ala Lys Gln Asp Val Ile Glu Val Ile Glu Lys Ala 
705                 710                 715                 720 


His Asn Asn Glu Leu Glu Pro Thr Pro Gly Asn Thr Leu Arg Gln Thr 
                725                 730                 735     


Phe Glu Asn Gln Val Asn Arg Ile Leu Asn Asp Ala Arg Asp Lys Thr 
            740                 745                 750         


Gly Ser Ser Ala Gln Lys Ser Leu Ser Glu Tyr Asn Asn Phe Lys Ser 
        755                 760                 765             


Met Val Val Ser Gly Ala Lys Gly Ser Lys Ile Asn Ile Ser Gln Val 
    770                 775                 780                 


Ile Ala Val Val Gly Gln Gln Asn Val Glu Gly Lys Arg Ile Pro Phe 
785                 790                 795                 800 


Gly Phe Lys His Arg Thr Leu Pro His Phe Ile Lys Asp Asp Tyr Gly 
                805                 810                 815     


Pro Glu Ser Arg Gly Phe Val Glu Asn Ser Tyr Leu Ala Gly Leu Thr 
            820                 825                 830         


Pro Thr Glu Phe Phe Phe His Ala Met Gly Gly Arg Glu Gly Leu Ile 
        835                 840                 845             


Asp Thr Ala Val Lys Thr Ala Glu Thr Gly Tyr Ile Gln Arg Arg Leu 
    850                 855                 860                 


Ile Lys Ser Met Glu Ser Val Met Val Lys Tyr Asp Ala Thr Val Arg 
865                 870                 875                 880 


Asn Ser Ile Asn Gln Val Val Gln Leu Arg Tyr Gly Glu Asp Gly Leu 
                885                 890                 895     


Ala Gly Glu Ser Val Glu Phe Gln Asn Leu Ala Thr Leu Lys Pro Ser 
            900                 905                 910         


Asn Lys Ala Phe Glu Lys Lys Phe Arg Phe Asp Tyr Thr Asn Glu Arg 
        915                 920                 925             


Ala Leu Arg Arg Thr Leu Gln Glu Asp Leu Val Lys Asp Val Leu Ser 
    930                 935                 940                 


Asn Ala His Ile Gln Asn Glu Leu Glu Arg Glu Phe Glu Arg Met Arg 
945                 950                 955                 960 


Glu Asp Arg Glu Val Leu Arg Val Ile Phe Pro Thr Gly Asp Ser Lys 
                965                 970                 975     


Val Val Leu Pro Cys Asn Leu Leu Arg Met Ile Trp Asn Ala Gln Lys 
            980                 985                 990         


Ile Phe His Ile Asn Pro Arg Leu  Pro Ser Asp Leu His  Pro Ile Lys 
        995                 1000                 1005             


Val Val  Glu Gly Val Lys Glu  Leu Ser Lys Lys Leu  Val Ile Val 
    1010                 1015                 1020             


Asn Gly  Asp Asp Pro Leu Ser  Arg Gln Ala Gln Glu  Asn Ala Thr 
    1025                 1030                 1035             


Leu Leu  Phe Asn Ile His Leu  Arg Ser Thr Leu Cys  Ser Arg Arg 
    1040                 1045                 1050             


Met Ala  Glu Glu Phe Arg Leu  Ser Gly Glu Ala Phe  Asp Trp Leu 
    1055                 1060                 1065             


Leu Gly  Glu Ile Glu Ser Lys  Phe Asn Gln Ala Ile  Ala His Pro 
    1070                 1075                 1080             


Gly Glu  Met Val Gly Ala Leu  Ala Ala Gln Ser Leu  Gly Glu Pro 
    1085                 1090                 1095             


Ala Thr  Gln Met Thr Leu Asn  Thr Phe His Tyr Ala  Gly Val Ser 
    1100                 1105                 1110             


Ala Lys  Asn Val Thr Leu Gly  Val Pro Arg Leu Lys  Glu Leu Ile 
    1115                 1120                 1125             


Asn Ile  Ser Lys Lys Pro Lys  Thr Pro Ser Leu Thr  Val Phe Leu 
    1130                 1135                 1140             


Leu Gly  Gln Ser Ala Arg Asp  Ala Glu Arg Ala Lys  Asp Ile Leu 
    1145                 1150                 1155             


Cys Arg  Leu Glu His Thr Thr  Leu Arg Lys Val Thr  Ala Asn Thr 
    1160                 1165                 1170             


Ala Ile  Tyr Tyr Asp Pro Asn  Pro Gln Ser Thr Val  Val Ala Glu 
    1175                 1180                 1185             


Asp Gln  Glu Trp Val Asn Val  Tyr Tyr Glu Met Pro  Asp Phe Asp 
    1190                 1195                 1200             


Val Ala  Arg Ile Ser Pro Trp  Leu Leu Arg Val Glu  Leu Asp Arg 
    1205                 1210                 1215             


Lys His  Met Thr Asp Arg Lys  Leu Thr Met Glu Gln  Ile Ala Glu 
    1220                 1225                 1230             


Lys Ile  Asn Ala Gly Phe Gly  Asp Asp Leu Asn Cys  Ile Phe Asn 
    1235                 1240                 1245             


Asp Asp  Asn Ala Glu Lys Leu  Val Leu Arg Ile Arg  Ile Met Asn 
    1250                 1255                 1260             


Ser Asp  Glu Asn Lys Met Gln  Glu Glu Glu Glu Val  Val Asp Lys 
    1265                 1270                 1275             


Met Asp  Asp Asp Val Phe Leu  Arg Cys Ile Glu Ser  Asn Met Leu 
    1280                 1285                 1290             


Thr Asp  Met Thr Leu Gln Gly  Ile Glu Gln Ile Ser  Lys Val Tyr 
    1295                 1300                 1305             


Met His  Leu Pro Gln Thr Asp  Asn Lys Lys Lys Ile  Ile Ile Thr 
    1310                 1315                 1320             


Glu Asp  Gly Glu Phe Lys Ala  Leu Gln Glu Trp Ile  Leu Glu Thr 
    1325                 1330                 1335             


Asp Gly  Val Ser Leu Met Arg  Val Leu Ser Glu Lys  Asp Val Asp 
    1340                 1345                 1350             


Pro Val  Arg Thr Thr Ser Asn  Asp Ile Val Glu Ile  Phe Thr Val 
    1355                 1360                 1365             


Leu Gly  Ile Glu Ala Val Arg  Lys Ala Leu Glu Arg  Glu Leu Tyr 
    1370                 1375                 1380             


His Val  Ile Ser Phe Asp Gly  Ser Tyr Val Asn Tyr  Arg His Leu 
    1385                 1390                 1395             


Ala Leu  Leu Cys Asp Thr Met  Thr Cys Arg Gly His  Leu Met Ala 
    1400                 1405                 1410             


Ile Thr  Arg His Gly Val Asn  Arg Gln Asp Thr Gly  Pro Leu Met 
    1415                 1420                 1425             


Lys Cys  Ser Phe Glu Glu Thr  Val Asp Val Leu Met  Glu Ala Ala 
    1430                 1435                 1440             


Ala His  Gly Glu Ser Asp Pro  Met Lys Gly Val Ser  Glu Asn Ile 
    1445                 1450                 1455             


Met Leu  Gly Gln Leu Ala Pro  Ala Gly Thr Gly Cys  Phe Asp Leu 
    1460                 1465                 1470             


Leu Leu  Asp Ala Glu Lys Cys  Lys Tyr Gly Met Glu  Ile Pro Thr 
    1475                 1480                 1485             


Asn Ile  Pro Gly Leu Gly Ala  Ala Gly Pro Thr Gly  Met Phe Phe 
    1490                 1495                 1500             


Gly Ser  Ala Pro Ser Pro Met  Gly Gly Ile Ser Pro  Ala Met Thr 
    1505                 1510                 1515             


Pro Trp  Asn Gln Gly Ala Thr  Pro Ala Tyr Gly Ala  Trp Ser Pro 
    1520                 1525                 1530             


Ser Val  Gly Ser Gly Met Thr  Pro Gly Ala Ala Gly  Phe Ser Pro 
    1535                 1540                 1545             


Ser Ala  Ala Ser Asp Ala Ser  Gly Phe Ser Pro Gly  Tyr Ser Pro 
    1550                 1555                 1560             


Ala Trp  Ser Pro Thr Pro Gly  Ser Pro Gly Ser Pro  Gly Pro Ser 
    1565                 1570                 1575             


Ser Pro  Tyr Ile Pro Ser Pro  Gly Gly Ala Met Ser  Pro Ser Tyr 
    1580                 1585                 1590             


Ser Pro  Thr Ser Pro Ala Tyr  Glu Pro Arg Ser Pro  Gly Gly Tyr 
    1595                 1600                 1605             


Thr Pro  Gln Ser Pro Ser Tyr  Ser Pro Thr Ser Pro  Ser Tyr Ser 
    1610                 1615                 1620             


Pro Thr  Ser Pro Ser Tyr Ser  Pro Thr Ser Pro Asn  Tyr Ser Pro 
    1625                 1630                 1635             


Thr Ser  Pro Ser Tyr Ser Pro  Thr Ser Pro Ser Tyr  Ser Pro Thr 
    1640                 1645                 1650             


Ser Pro  Ser Tyr Ser Pro Thr  Ser Pro Ser Tyr Ser  Pro Thr Ser 
    1655                 1660                 1665             


Pro Ser  Tyr Ser Pro Thr Ser  Pro Ser Tyr Ser Pro  Thr Ser Pro 
    1670                 1675                 1680             


Ser Tyr  Ser Pro Thr Ser Pro  Ser Tyr Ser Pro Thr  Ser Pro Ser 
    1685                 1690                 1695             


Tyr Ser  Pro Thr Ser Pro Ser  Tyr Ser Pro Thr Ser  Pro Ser Tyr 
    1700                 1705                 1710             


Ser Pro  Thr Ser Pro Ser Tyr  Ser Pro Thr Ser Pro  Ser Tyr Ser 
    1715                 1720                 1725             


Pro Thr  Ser Pro Ser Tyr Ser  Pro Thr Ser Pro Ser  Tyr Ser Pro 
    1730                 1735                 1740             


Thr Ser  Pro Asn Tyr Ser Pro  Thr Ser Pro Asn Tyr  Thr Pro Thr 
    1745                 1750                 1755             


Ser Pro  Ser Tyr Ser Pro Thr  Ser Pro Ser Tyr Ser  Pro Thr Ser 
    1760                 1765                 1770             


Pro Asn  Tyr Thr Pro Thr Ser  Pro Asn Tyr Ser Pro  Thr Ser Pro 
    1775                 1780                 1785             


Ser Tyr  Ser Pro Thr Ser Pro  Ser Tyr Ser Pro Thr  Ser Pro Ser 
    1790                 1795                 1800             


Tyr Ser  Pro Ser Ser Pro Arg  Tyr Thr Pro Gln Ser  Pro Thr Tyr 
    1805                 1810                 1815             


Thr Pro  Ser Ser Pro Ser Tyr  Ser Pro Ser Ser Pro  Ser Tyr Ser 
    1820                 1825                 1830             


Pro Ala  Ser Pro Lys Tyr Thr  Pro Thr Ser Pro Ser  Tyr Ser Pro 
    1835                 1840                 1845             


Ser Ser  Pro Glu Tyr Thr Pro  Thr Ser Pro Lys Tyr  Ser Pro Thr 
    1850                 1855                 1860             


Ser Pro  Lys Tyr Ser Pro Thr  Ser Pro Lys Tyr Ser  Pro Thr Ser 
    1865                 1870                 1875             


Pro Thr  Tyr Ser Pro Thr Thr  Pro Lys Tyr Ser Pro  Thr Ser Pro 
    1880                 1885                 1890             


Thr Tyr  Ser Pro Thr Ser Pro  Val Tyr Thr Pro Thr  Ser Pro Lys 
    1895                 1900                 1905             


Tyr Ser  Pro Thr Ser Pro Thr  Tyr Ser Pro Thr Ser  Pro Lys Tyr 
    1910                 1915                 1920             


Ser Pro  Thr Ser Pro Thr Tyr  Ser Pro Thr Ser Pro  Lys Gly Ser 
    1925                 1930                 1935             


Thr Tyr  Ser Pro Thr Ser Pro  Gly Tyr Ser Pro Thr  Ser Pro Thr 
    1940                 1945                 1950             


Tyr Ser  Leu Thr Ser Pro Ala  Ile Ser Pro Asp Asp  Ser Asp Glu 
    1955                 1960                 1965             


Glu Asn  
    1970 


<210>  36
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Guide sequence

<400>  36
ggacccttaa tgatgcaggt tggcatggca                                        30


<210>  37
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Guide sequence

<400>  37
ggacccttaa tgatgcaggt tgacatggca                                        30


<210>  38
<211>  51
<212>  DNA
<213>  Artificial sequence

<220>
<223>  partial nucleic acid sequence of PSMB6

<400>  38
tcgccgttgc cactttacca cccgcctgaa tcctgggatt ctagtatgca a                51


<210>  39
<211>  33
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Partial nucleic acid sequence of human RPL36A

<400>  39
actaagccga ttttccggaa aaaggtgagt ggt                                    33


<210>  40
<211>  31
<212>  DNA
<213>  Artificial sequence

<220>
<223>  An exemplary DHFR mutation in human

<400>  40
ggagacctac cctggcctcc gctcaggtat c                                      31


<210>  41
<211>  31
<212>  DNA
<213>  Artificial sequence

<220>
<223>  An exemplary DHFR mutation in human

<400>  41
ggagacctac cctggcctcc gctcaggtat t                                      31


<210>  42
<211>  33
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Targeted locus of human R2

<400>  42
tttatatccc atgttctggc tttctttgca gca                                    33


<210>  43
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  an example for edited nucleic acid sequence resistant to SiRNA

<400>  43
cttcttctac ttctactact actt                                              24


<210>  44
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  an example for WT nucleic acid sequence susceptible to SiRNA

<400>  44
cttctactgc tcctcctact actt                                              24


<210>  45
<211>  119
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Wild-type PSMA1 3 UTR

<400>  45
tcttcccttt cccaggatct cacttgctta tctgaagaag attgtccagg ctcatattgg       60

gaatgcttat gaggaaattc atgccgagac ctgctattca atgcatgtat cgttgcctc       119


<210>  46
<211>  119
<212>  DNA
<213>  Artificial sequence

<220>
<223>  donor ssODN

<400>  46
tcttcccttt cccaggatct cacttgctta tctgaagaag attgtccagg ctgatagtct       60

gaattcatat gaggaaattc atgccgagac ctgctattca atgcatgtat cgttgcctc       119


<210>  47
<211>  25
<212>  DNA
<213>  Artificial sequence

<220>
<223>  humTAF1_tsmut_g2_forward

<400>  47
caccgcttaa tgatgcaggt tgaca                                             25


<210>  48
<211>  23
<212>  DNA
<213>  Artificial sequence

<220>
<223>  humTAF1_tsmut_g2_reverse

<400>  48
cttaatgatg caggttgaca tgg                                               23


<210>  49
<211>  100
<212>  DNA
<213>  Artificial sequence

<220>
<223>  ssODN_humTAFwt  (-strand)

<400>  49
ctgagcagag actcacccgt ttataatagt tctttatctt ggttgccatg ccaacctgca       60

tcattaaggg tccattttcc tcactatatt ctgcaagaat                            100


<210>  50
<211>  19
<212>  DNA
<213>  Artificial sequence

<220>
<223>  partial sequence of  PSMD1

<400>  50
taagcattcc caatatgag                                                    19


