                         序列表

<110>  北京中因科技有限公司

<120>  PRPF31变体及其用途

<130>  0138-PA-022

<150>  CN2021106652668
<151>  2021-06-16

<160>  36

<170>  PatentIn version 3.5

<210>  1
<211>  7
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  sv40

<400>  1

Pro Lys Lys Lys Arg Lys Val 
1               5           


<210>  2
<211>  20
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  hnRNPA1

<400>  2

Ser Ser Asn Phe Gly Pro Met Lys Gly Gly Asn Arg Phe Phe Arg Ser 
1               5                   10                  15      


Ser Gly Pro Tyr 
            20  


<210>  3
<211>  12
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  HIV Tat

<400>  3

Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Ala Pro 
1               5                   10          


<210>  4
<211>  27
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  hnRNPD

<400>  4

Tyr Gly Asp Tyr Ser Asn Gln Gln Ser Gly Tyr Gly Lys Val Ser Arg 
1               5                   10                  15      


Arg Gly Gly His Gln Asn Ser Tyr Lys Pro Tyr 
            20                  25          


<210>  5
<211>  26
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  hnRNPM

<400>  5

Gly Glu Gly Glu Arg Pro Ala Gln Asn Glu Lys Arg Lys Glu Asn Ile 
1               5                   10                  15      


Lys Arg Gly Gly Asn Arg Phe Glu Pro Tyr 
            20                  25      


<210>  6
<211>  16
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  nuleoplasmin

<400>  6

Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 
1               5                   10                  15      


<210>  7
<211>  19
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  SRY

<400>  7

His Arg Glu Lys Tyr Pro Asn Tyr Lys Tyr Arg Pro Arg Arg Lys Ala 
1               5                   10                  15      


Lys Met Leu 
            


<210>  8
<211>  21
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  sv40

<400>  8
cctaagaaga aaagaaaggt g                                                 21


<210>  9
<211>  60
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  hnRNPA1

<400>  9
tctagcaact tcggccctat gaagggcgga aaccggttct ttagaagctc cggcccctac       60


<210>  10
<211>  36
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  HIV Tat

<400>  10
ggcagaaaga agcggagaca gagaagacgg gcccct                                 36


<210>  11
<211>  81
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  hnRNPD

<400>  11
tatggcgact acagcaacca gcagtccggc tacggcaagg tgtctagacg gggcggacac       60

cagaacagct acaagcctta c                                                 81


<210>  12
<211>  78
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  hnRNPM

<400>  12
ggcgaaggcg agagacctgc ccagaacgag aaaagaaagg aaaacatcaa gcggggcgga       60

aatagattcg agccctac                                                     78


<210>  13
<211>  48
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  nuleoplasmin

<400>  13
aaaagacctg ctgccaccaa gaaggccggc caggccaaga agaaaaag                    48


<210>  14
<211>  57
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  SRY

<400>  14
cacagagaga aataccccaa ctacaagtac cggcctagaa gaaaggccaa gatgctg          57


<210>  15
<211>  14
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31蛋白的核定位信号序列

<400>  15

Arg Lys Lys Arg Gly Gly Arg Arg Tyr Arg Lys Met Lys Glu 
1               5                   10                  


<210>  16
<211>  42
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31蛋白的核定位信号序列

<400>  16
cggaagaagc gaggcggccg caggtaccgc aagatgaagg ag                          42


<210>  17
<211>  500
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  野生型PRPF31

<220>
<221>  misc_feature
<222>  (500)..(500)
<223>  Xaa = 任意种类氨基酸

<400>  17

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Arg Lys 
            340                 345                 350         


Lys Arg Gly Gly Arg Arg Tyr Arg Lys Met Lys Glu Arg Leu Gly Leu 
        355                 360                 365             


Thr Glu Ile Arg Lys Gln Ala Asn Arg Met Ser Phe Gly Glu Ile Glu 
    370                 375                 380                 


Glu Asp Ala Tyr Gln Glu Asp Leu Gly Phe Ser Leu Gly His Leu Gly 
385                 390                 395                 400 


Lys Ser Gly Ser Gly Arg Val Arg Gln Thr Gln Val Asn Glu Ala Thr 
                405                 410                 415     


Lys Ala Arg Ile Ser Lys Thr Leu Gln Arg Thr Leu Gln Lys Gln Ser 
            420                 425                 430         


Val Val Tyr Gly Gly Lys Ser Thr Ile Arg Asp Arg Ser Ser Gly Thr 
        435                 440                 445             


Ala Ser Ser Val Ala Phe Thr Pro Leu Gln Gly Leu Glu Ile Val Asn 
    450                 455                 460                 


Pro Gln Ala Ala Glu Lys Lys Val Ala Glu Ala Asn Gln Lys Tyr Phe 
465                 470                 475                 480 


Ser Ser Met Ala Glu Phe Leu Lys Val Lys Gly Glu Lys Ser Gly Leu 
                485                 490                 495     


Met Ser Thr Xaa 
            500 


<210>  18
<211>  493
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-sv40

<220>
<221>  misc_feature
<222>  (493)..(493)
<223>  Xaa = 任意种类氨基酸

<400>  18

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Pro Lys 
            340                 345                 350         


Lys Lys Arg Lys Val Arg Leu Gly Leu Thr Glu Ile Arg Lys Gln Ala 
        355                 360                 365             


Asn Arg Met Ser Phe Gly Glu Ile Glu Glu Asp Ala Tyr Gln Glu Asp 
    370                 375                 380                 


Leu Gly Phe Ser Leu Gly His Leu Gly Lys Ser Gly Ser Gly Arg Val 
385                 390                 395                 400 


Arg Gln Thr Gln Val Asn Glu Ala Thr Lys Ala Arg Ile Ser Lys Thr 
                405                 410                 415     


Leu Gln Arg Thr Leu Gln Lys Gln Ser Val Val Tyr Gly Gly Lys Ser 
            420                 425                 430         


Thr Ile Arg Asp Arg Ser Ser Gly Thr Ala Ser Ser Val Ala Phe Thr 
        435                 440                 445             


Pro Leu Gln Gly Leu Glu Ile Val Asn Pro Gln Ala Ala Glu Lys Lys 
    450                 455                 460                 


Val Ala Glu Ala Asn Gln Lys Tyr Phe Ser Ser Met Ala Glu Phe Leu 
465                 470                 475                 480 


Lys Val Lys Gly Glu Lys Ser Gly Leu Met Ser Thr Xaa 
                485                 490             


<210>  19
<211>  506
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-A1

<220>
<221>  misc_feature
<222>  (506)..(506)
<223>  Xaa = 任意种类氨基酸

<400>  19

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Ser Ser 
            340                 345                 350         


Asn Phe Gly Pro Met Lys Gly Gly Asn Arg Phe Phe Arg Ser Ser Gly 
        355                 360                 365             


Pro Tyr Arg Leu Gly Leu Thr Glu Ile Arg Lys Gln Ala Asn Arg Met 
    370                 375                 380                 


Ser Phe Gly Glu Ile Glu Glu Asp Ala Tyr Gln Glu Asp Leu Gly Phe 
385                 390                 395                 400 


Ser Leu Gly His Leu Gly Lys Ser Gly Ser Gly Arg Val Arg Gln Thr 
                405                 410                 415     


Gln Val Asn Glu Ala Thr Lys Ala Arg Ile Ser Lys Thr Leu Gln Arg 
            420                 425                 430         


Thr Leu Gln Lys Gln Ser Val Val Tyr Gly Gly Lys Ser Thr Ile Arg 
        435                 440                 445             


Asp Arg Ser Ser Gly Thr Ala Ser Ser Val Ala Phe Thr Pro Leu Gln 
    450                 455                 460                 


Gly Leu Glu Ile Val Asn Pro Gln Ala Ala Glu Lys Lys Val Ala Glu 
465                 470                 475                 480 


Ala Asn Gln Lys Tyr Phe Ser Ser Met Ala Glu Phe Leu Lys Val Lys 
                485                 490                 495     


Gly Glu Lys Ser Gly Leu Met Ser Thr Xaa 
            500                 505     


<210>  20
<211>  498
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-TAT

<220>
<221>  misc_feature
<222>  (498)..(498)
<223>  Xaa = 任意种类氨基酸

<400>  20

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Gly Arg 
            340                 345                 350         


Lys Lys Arg Arg Gln Arg Arg Arg Ala Pro Arg Leu Gly Leu Thr Glu 
        355                 360                 365             


Ile Arg Lys Gln Ala Asn Arg Met Ser Phe Gly Glu Ile Glu Glu Asp 
    370                 375                 380                 


Ala Tyr Gln Glu Asp Leu Gly Phe Ser Leu Gly His Leu Gly Lys Ser 
385                 390                 395                 400 


Gly Ser Gly Arg Val Arg Gln Thr Gln Val Asn Glu Ala Thr Lys Ala 
                405                 410                 415     


Arg Ile Ser Lys Thr Leu Gln Arg Thr Leu Gln Lys Gln Ser Val Val 
            420                 425                 430         


Tyr Gly Gly Lys Ser Thr Ile Arg Asp Arg Ser Ser Gly Thr Ala Ser 
        435                 440                 445             


Ser Val Ala Phe Thr Pro Leu Gln Gly Leu Glu Ile Val Asn Pro Gln 
    450                 455                 460                 


Ala Ala Glu Lys Lys Val Ala Glu Ala Asn Gln Lys Tyr Phe Ser Ser 
465                 470                 475                 480 


Met Ala Glu Phe Leu Lys Val Lys Gly Glu Lys Ser Gly Leu Met Ser 
                485                 490                 495     


Thr Xaa 
        


<210>  21
<211>  513
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-D

<220>
<221>  misc_feature
<222>  (513)..(513)
<223>  Xaa = 任意种类氨基酸

<400>  21

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Tyr Gly 
            340                 345                 350         


Asp Tyr Ser Asn Gln Gln Ser Gly Tyr Gly Lys Val Ser Arg Arg Gly 
        355                 360                 365             


Gly His Gln Asn Ser Tyr Lys Pro Tyr Arg Leu Gly Leu Thr Glu Ile 
    370                 375                 380                 


Arg Lys Gln Ala Asn Arg Met Ser Phe Gly Glu Ile Glu Glu Asp Ala 
385                 390                 395                 400 


Tyr Gln Glu Asp Leu Gly Phe Ser Leu Gly His Leu Gly Lys Ser Gly 
                405                 410                 415     


Ser Gly Arg Val Arg Gln Thr Gln Val Asn Glu Ala Thr Lys Ala Arg 
            420                 425                 430         


Ile Ser Lys Thr Leu Gln Arg Thr Leu Gln Lys Gln Ser Val Val Tyr 
        435                 440                 445             


Gly Gly Lys Ser Thr Ile Arg Asp Arg Ser Ser Gly Thr Ala Ser Ser 
    450                 455                 460                 


Val Ala Phe Thr Pro Leu Gln Gly Leu Glu Ile Val Asn Pro Gln Ala 
465                 470                 475                 480 


Ala Glu Lys Lys Val Ala Glu Ala Asn Gln Lys Tyr Phe Ser Ser Met 
                485                 490                 495     


Ala Glu Phe Leu Lys Val Lys Gly Glu Lys Ser Gly Leu Met Ser Thr 
            500                 505                 510         


Xaa 
    


<210>  22
<211>  512
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-M

<220>
<221>  misc_feature
<222>  (512)..(512)
<223>  Xaa = 任意种类氨基酸

<400>  22

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Gly Glu 
            340                 345                 350         


Gly Glu Arg Pro Ala Gln Asn Glu Lys Arg Lys Glu Asn Ile Lys Arg 
        355                 360                 365             


Gly Gly Asn Arg Phe Glu Pro Tyr Arg Leu Gly Leu Thr Glu Ile Arg 
    370                 375                 380                 


Lys Gln Ala Asn Arg Met Ser Phe Gly Glu Ile Glu Glu Asp Ala Tyr 
385                 390                 395                 400 


Gln Glu Asp Leu Gly Phe Ser Leu Gly His Leu Gly Lys Ser Gly Ser 
                405                 410                 415     


Gly Arg Val Arg Gln Thr Gln Val Asn Glu Ala Thr Lys Ala Arg Ile 
            420                 425                 430         


Ser Lys Thr Leu Gln Arg Thr Leu Gln Lys Gln Ser Val Val Tyr Gly 
        435                 440                 445             


Gly Lys Ser Thr Ile Arg Asp Arg Ser Ser Gly Thr Ala Ser Ser Val 
    450                 455                 460                 


Ala Phe Thr Pro Leu Gln Gly Leu Glu Ile Val Asn Pro Gln Ala Ala 
465                 470                 475                 480 


Glu Lys Lys Val Ala Glu Ala Asn Gln Lys Tyr Phe Ser Ser Met Ala 
                485                 490                 495     


Glu Phe Leu Lys Val Lys Gly Glu Lys Ser Gly Leu Met Ser Thr Xaa 
            500                 505                 510         


<210>  23
<211>  502
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-NP

<220>
<221>  misc_feature
<222>  (502)..(502)
<223>  Xaa = 任意种类氨基酸

<400>  23

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln Lys Arg 
            340                 345                 350         


Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Arg Leu 
        355                 360                 365             


Gly Leu Thr Glu Ile Arg Lys Gln Ala Asn Arg Met Ser Phe Gly Glu 
    370                 375                 380                 


Ile Glu Glu Asp Ala Tyr Gln Glu Asp Leu Gly Phe Ser Leu Gly His 
385                 390                 395                 400 


Leu Gly Lys Ser Gly Ser Gly Arg Val Arg Gln Thr Gln Val Asn Glu 
                405                 410                 415     


Ala Thr Lys Ala Arg Ile Ser Lys Thr Leu Gln Arg Thr Leu Gln Lys 
            420                 425                 430         


Gln Ser Val Val Tyr Gly Gly Lys Ser Thr Ile Arg Asp Arg Ser Ser 
        435                 440                 445             


Gly Thr Ala Ser Ser Val Ala Phe Thr Pro Leu Gln Gly Leu Glu Ile 
    450                 455                 460                 


Val Asn Pro Gln Ala Ala Glu Lys Lys Val Ala Glu Ala Asn Gln Lys 
465                 470                 475                 480 


Tyr Phe Ser Ser Met Ala Glu Phe Leu Lys Val Lys Gly Glu Lys Ser 
                485                 490                 495     


Gly Leu Met Ser Thr Xaa 
            500         


<210>  24
<211>  505
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-SRY

<220>
<221>  misc_feature
<222>  (505)..(505)
<223>  Xaa = 任意种类氨基酸

<400>  24

Met Ser Leu Ala Asp Glu Leu Leu Ala Asp Leu Glu Glu Ala Ala Glu 
1               5                   10                  15      


Glu Glu Glu Gly Gly Ser Tyr Gly Glu Glu Glu Glu Glu Pro Ala Ile 
            20                  25                  30          


Glu Asp Val Gln Glu Glu Thr Gln Leu Asp Leu Ser Gly Asp Ser Val 
        35                  40                  45              


Lys Thr Ile Ala Lys Leu Trp Asp Ser Lys Met Phe Ala Glu Ile Met 
    50                  55                  60                  


Met Lys Ile Glu Glu Tyr Ile Ser Lys Gln Ala Lys Ala Ser Glu Val 
65                  70                  75                  80  


Met Gly Pro Val Glu Ala Ala Pro Glu Tyr Arg Val Ile Val Asp Ala 
                85                  90                  95      


Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn Ile Ile His Lys 
            100                 105                 110         


Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu Leu Glu Ser Leu 
        115                 120                 125             


Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys Glu Leu Gly Asn 
    130                 135                 140                 


Ser Leu Asp Lys Cys Lys Asn Asn Glu Asn Leu Gln Gln Ile Leu Thr 
145                 150                 155                 160 


Asn Ala Thr Ile Met Val Val Ser Val Thr Ala Ser Thr Thr Gln Gly 
                165                 170                 175     


Gln Gln Leu Ser Glu Glu Glu Leu Glu Arg Leu Glu Glu Ala Cys Asp 
            180                 185                 190         


Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr Glu Tyr Val 
        195                 200                 205             


Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile Ile Ile Gly 
    210                 215                 220                 


Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly Leu Thr Asn 
225                 230                 235                 240 


Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly Ala Gln Arg 
                245                 250                 255     


Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro His Thr Gly 
            260                 265                 270         


Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro Asp Leu Arg 
        275                 280                 285             


Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu Ala Ala Arg 
    290                 295                 300                 


Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly Tyr Glu Leu 
305                 310                 315                 320 


Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu Pro Pro Pro 
                325                 330                 335     


Val Lys Gln Val Lys Pro Leu Pro Ala Pro Leu Asp Gly Gln His Arg 
            340                 345                 350         


Glu Lys Tyr Pro Asn Tyr Lys Tyr Arg Pro Arg Arg Lys Ala Lys Met 
        355                 360                 365             


Leu Arg Leu Gly Leu Thr Glu Ile Arg Lys Gln Ala Asn Arg Met Ser 
    370                 375                 380                 


Phe Gly Glu Ile Glu Glu Asp Ala Tyr Gln Glu Asp Leu Gly Phe Ser 
385                 390                 395                 400 


Leu Gly His Leu Gly Lys Ser Gly Ser Gly Arg Val Arg Gln Thr Gln 
                405                 410                 415     


Val Asn Glu Ala Thr Lys Ala Arg Ile Ser Lys Thr Leu Gln Arg Thr 
            420                 425                 430         


Leu Gln Lys Gln Ser Val Val Tyr Gly Gly Lys Ser Thr Ile Arg Asp 
        435                 440                 445             


Arg Ser Ser Gly Thr Ala Ser Ser Val Ala Phe Thr Pro Leu Gln Gly 
    450                 455                 460                 


Leu Glu Ile Val Asn Pro Gln Ala Ala Glu Lys Lys Val Ala Glu Ala 
465                 470                 475                 480 


Asn Gln Lys Tyr Phe Ser Ser Met Ala Glu Phe Leu Lys Val Lys Gly 
                485                 490                 495     


Glu Lys Ser Gly Leu Met Ser Thr Xaa 
            500                 505 


<210>  25
<211>  1500
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  野生型PRPF31

<400>  25
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag cggaagaagc gaggcggccg caggtaccgc     1080

aagatgaagg agcggctggg gctgacggag atccggaagc aggccaaccg tatgagcttc     1140

ggagagatcg aggaggacgc ctaccaggag gacctgggat tcagcctggg ccacctgggc     1200

aagtcgggca gtgggcgtgt gcggcagaca caggtaaacg aggccaccaa ggccaggatc     1260

tccaagacgc tgcagcggac cctgcagaag cagagcgtcg tatatggcgg gaagtccacc     1320

atccgcgacc gctcctcggg cacggcctcc agcgtggcct tcaccccact ccagggcctg     1380

gagattgtga acccacaggc ggcagagaag aaggtggctg aggccaacca gaagtatttc     1440

tccagcatgg ctgagttcct caaggtcaag ggcgagaaga gtggccttat gtccacctga     1500


<210>  26
<211>  1479
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-sv40

<400>  26
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag cctaagaaga aaagaaaggt gcggctgggg     1080

ctgacggaga tccggaagca ggccaaccgt atgagcttcg gagagatcga ggaggacgcc     1140

taccaggagg acctgggatt cagcctgggc cacctgggca agtcgggcag tgggcgtgtg     1200

cggcagacac aggtaaacga ggccaccaag gccaggatct ccaagacgct gcagcggacc     1260

ctgcagaagc agagcgtcgt atatggcggg aagtccacca tccgcgaccg ctcctcgggc     1320

acggcctcca gcgtggcctt caccccactc cagggcctgg agattgtgaa cccacaggcg     1380

gcagagaaga aggtggctga ggccaaccag aagtatttct ccagcatggc tgagttcctc     1440

aaggtcaagg gcgagaagag tggccttatg tccacctga                            1479


<210>  27
<211>  1518
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-A1

<400>  27
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag tctagcaact tcggccctat gaagggcgga     1080

aaccggttct ttagaagctc cggcccctac cggctggggc tgacggagat ccggaagcag     1140

gccaaccgta tgagcttcgg agagatcgag gaggacgcct accaggagga cctgggattc     1200

agcctgggcc acctgggcaa gtcgggcagt gggcgtgtgc ggcagacaca ggtaaacgag     1260

gccaccaagg ccaggatctc caagacgctg cagcggaccc tgcagaagca gagcgtcgta     1320

tatggcggga agtccaccat ccgcgaccgc tcctcgggca cggcctccag cgtggccttc     1380

accccactcc agggcctgga gattgtgaac ccacaggcgg cagagaagaa ggtggctgag     1440

gccaaccaga agtatttctc cagcatggct gagttcctca aggtcaaggg cgagaagagt     1500

ggccttatgt ccacctga                                                   1518


<210>  28
<211>  1494
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-TAT

<400>  28
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag ggcagaaaga agcggagaca gagaagacgg     1080

gcccctcggc tggggctgac ggagatccgg aagcaggcca accgtatgag cttcggagag     1140

atcgaggagg acgcctacca ggaggacctg ggattcagcc tgggccacct gggcaagtcg     1200

ggcagtgggc gtgtgcggca gacacaggta aacgaggcca ccaaggccag gatctccaag     1260

acgctgcagc ggaccctgca gaagcagagc gtcgtatatg gcgggaagtc caccatccgc     1320

gaccgctcct cgggcacggc ctccagcgtg gccttcaccc cactccaggg cctggagatt     1380

gtgaacccac aggcggcaga gaagaaggtg gctgaggcca accagaagta tttctccagc     1440

atggctgagt tcctcaaggt caagggcgag aagagtggcc ttatgtccac ctga           1494


<210>  29
<211>  1539
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-D

<400>  29
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag tatggcgact acagcaacca gcagtccggc     1080

tacggcaagg tgtctagacg gggcggacac cagaacagct acaagcctta ccggctgggg     1140

ctgacggaga tccggaagca ggccaaccgt atgagcttcg gagagatcga ggaggacgcc     1200

taccaggagg acctgggatt cagcctgggc cacctgggca agtcgggcag tgggcgtgtg     1260

cggcagacac aggtaaacga ggccaccaag gccaggatct ccaagacgct gcagcggacc     1320

ctgcagaagc agagcgtcgt atatggcggg aagtccacca tccgcgaccg ctcctcgggc     1380

acggcctcca gcgtggcctt caccccactc cagggcctgg agattgtgaa cccacaggcg     1440

gcagagaaga aggtggctga ggccaaccag aagtatttct ccagcatggc tgagttcctc     1500

aaggtcaagg gcgagaagag tggccttatg tccacctga                            1539


<210>  30
<211>  1536
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-M

<400>  30
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag ggcgaaggcg agagacctgc ccagaacgag     1080

aaaagaaagg aaaacatcaa gcggggcgga aatagattcg agccctaccg gctggggctg     1140

acggagatcc ggaagcaggc caaccgtatg agcttcggag agatcgagga ggacgcctac     1200

caggaggacc tgggattcag cctgggccac ctgggcaagt cgggcagtgg gcgtgtgcgg     1260

cagacacagg taaacgaggc caccaaggcc aggatctcca agacgctgca gcggaccctg     1320

cagaagcaga gcgtcgtata tggcgggaag tccaccatcc gcgaccgctc ctcgggcacg     1380

gcctccagcg tggccttcac cccactccag ggcctggaga ttgtgaaccc acaggcggca     1440

gagaagaagg tggctgaggc caaccagaag tatttctcca gcatggctga gttcctcaag     1500

gtcaagggcg agaagagtgg ccttatgtcc acctga                               1536


<210>  31
<211>  1506
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-NP

<400>  31
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag aaaagacctg ctgccaccaa gaaggccggc     1080

caggccaaga agaaaaagcg gctggggctg acggagatcc ggaagcaggc caaccgtatg     1140

agcttcggag agatcgagga ggacgcctac caggaggacc tgggattcag cctgggccac     1200

ctgggcaagt cgggcagtgg gcgtgtgcgg cagacacagg taaacgaggc caccaaggcc     1260

aggatctcca agacgctgca gcggaccctg cagaagcaga gcgtcgtata tggcgggaag     1320

tccaccatcc gcgaccgctc ctcgggcacg gcctccagcg tggccttcac cccactccag     1380

ggcctggaga ttgtgaaccc acaggcggca gagaagaagg tggctgaggc caaccagaag     1440

tatttctcca gcatggctga gttcctcaag gtcaagggcg agaagagtgg ccttatgtcc     1500

acctga                                                                1506


<210>  32
<211>  1515
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31-SRY

<400>  32
atgtctctgg cagatgagct cttagctgat ctcgaagagg cagcagaaga ggaggaagga       60

ggaagctatg gggaggaaga agaggagcca gcgatcgagg atgtgcagga ggagacacag      120

ctggatcttt ccggggattc agtcaagacc atcgccaagc tatgggatag taagatgttt      180

gctgagatta tgatgaagat tgaggagtat atcagcaagc aagccaaagc ttcagaagtg      240

atgggaccag tggaggccgc gcctgaatac cgcgtcatcg tggatgccaa caacctgacc      300

gtggagatcg aaaacgagct gaacatcatc cataagttca tccgggataa gtactcaaag      360

agattccctg aactggagtc cttggtcccc aatgcactgg attacatccg cacggtcaag      420

gagctgggca acagcctgga caagtgcaag aacaatgaga acctgcagca gatcctcacc      480

aatgccacca tcatggtcgt cagcgtcacc gcctccacca cccaggggca gcagctgtcg      540

gaggaggagc tggagcggct ggaggaggcc tgcgacatgg cgctggagct gaacgcctcc      600

aagcaccgca tctacgagta tgtggagtcc cggatgtcct tcatcgcacc caacctgtcc      660

atcattatcg gggcatccac ggccgccaag atcatgggtg tggccggcgg cctgaccaac      720

ctctccaaga tgcccgcctg caacatcatg ctgctcgggg cccagcgcaa gacgctgtcg      780

ggcttctcgt ctacctcagt gctgccccac accggctaca tctaccacag tgacatcgtg      840

cagtccctgc caccggatct gcggcggaaa gcggcccggc tggtggccgc caagtgcaca      900

ctggcagccc gtgtggacag tttccacgag agcacagaag ggaaggtggg ctacgaactg      960

aaggatgaga tcgagcgcaa attcgacaag tggcaggagc cgccgcctgt gaagcaggtg     1020

aagccgctgc ctgcgcccct ggatggacag cacagagaga aataccccaa ctacaagtac     1080

cggcctagaa gaaaggccaa gatgctgcgg ctggggctga cggagatccg gaagcaggcc     1140

aaccgtatga gcttcggaga gatcgaggag gacgcctacc aggaggacct gggattcagc     1200

ctgggccacc tgggcaagtc gggcagtggg cgtgtgcggc agacacaggt aaacgaggcc     1260

accaaggcca ggatctccaa gacgctgcag cggaccctgc agaagcagag cgtcgtatat     1320

ggcgggaagt ccaccatccg cgaccgctcc tcgggcacgg cctccagcgt ggccttcacc     1380

ccactccagg gcctggagat tgtgaaccca caggcggcag agaagaaggt ggctgaggcc     1440

aaccagaagt atttctccag catggctgag ttcctcaagg tcaagggcga gaagagtggc     1500

cttatgtcca cctga                                                      1515


<210>  33
<211>  653
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  RHO minigene报告基因

<400>  33
gctgttccca agtccctcac aggcagggtc tccctacctg cctgtcctca ggtacatccc       60

cgagggcctg cagtgctcgt gtggaatcga ctactacacg ctcaagccgg aggtcaacaa      120

cgagtctttt gtcatctaca tgttcgtggt ccacttcacc atccccatga ttatcatctt      180

tttctgctat gggcagctcg tcttcaccgt caaggaggta cgggccgggg ggtgggcggc      240

ctcacggctc tgagggtcca gcccccagca tgcatctgcg gctcctgctc cctggaggag      300

ccatggtctg gacccgggtc ccgtgtcctg caggccgctg cccagcagca ggagtcagcc      360

accacacaga aggcagagaa ggaggtcacc cgcatggtca tcatcatggt catcgctttc      420

ctgatctgct gggtgcccta cgccagcgtg gcattctaca tcttcaccca ccagggctcc      480

aacttcggtc ccatcttcat gaccatccca gcgttctttg ccaagagcgc cgccatctac      540

aaccctgtca tctatatcat gatgaacaag caggtgccta ctgcgggtgg gagggcccca      600

gtgccccagg ccacaggcgc tgcctgccaa ggacaagcta cttcccaggg cag             653


<210>  34
<211>  711
<212>  DNA
<213>  人工序列（Artificial Sequence）

<220>
<223>  编码Mcherry的核酸分子

<400>  34
atggtgagca agggcgagga ggataacatg gccatcatca aggagttcat gcgcttcaag       60

gtgcacatgg agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc      120

cgcccctacg agggcaccca gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc      180

ttcgcctggg acatcctgtc ccctcagttc atgtacggct ccaaggccta cgtgaagcac      240

cccgccgaca tccccgacta cttgaagctg tccttccccg agggcttcaa gtgggagcgc      300

gtgatgaact tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac      360

ggcgagttca tctacaaggt gaagctgcgc ggcaccaact tcccctccga cggccccgta      420

atgcagaaga agaccatggg ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc      480

gccctgaagg gcgagatcaa gcagaggctg aagctgaagg acggcggcca ctacgacgct      540

gaggtcaaga ccacctacaa ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc      600

aacatcaagt tggacatcac ctcccacaac gaggactaca ccatcgtgga acagtacgaa      660

cgcgccgagg gccgccactc caccggcggc atggacgagc tgtacaagta g               711


<210>  35
<211>  52
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31的NOSIC结构域

<400>  35

Ile Val Asp Ala Asn Asn Leu Thr Val Glu Ile Glu Asn Glu Leu Asn 
1               5                   10                  15      


Ile Ile His Lys Phe Ile Arg Asp Lys Tyr Ser Lys Arg Phe Pro Glu 
            20                  25                  30          


Leu Glu Ser Leu Val Pro Asn Ala Leu Asp Tyr Ile Arg Thr Val Lys 
        35                  40                  45              


Glu Leu Gly Asn 
    50          


<210>  36
<211>  145
<212>  PRT
<213>  人工序列（Artificial Sequence）

<220>
<223>  PRPF31的NOP结构域

<400>  36

Ala Cys Asp Met Ala Leu Glu Leu Asn Ala Ser Lys His Arg Ile Tyr 
1               5                   10                  15      


Glu Tyr Val Glu Ser Arg Met Ser Phe Ile Ala Pro Asn Leu Ser Ile 
            20                  25                  30          


Ile Ile Gly Ala Ser Thr Ala Ala Lys Ile Met Gly Val Ala Gly Gly 
        35                  40                  45              


Leu Thr Asn Leu Ser Lys Met Pro Ala Cys Asn Ile Met Leu Leu Gly 
    50                  55                  60                  


Ala Gln Arg Lys Thr Leu Ser Gly Phe Ser Ser Thr Ser Val Leu Pro 
65                  70                  75                  80  


His Thr Gly Tyr Ile Tyr His Ser Asp Ile Val Gln Ser Leu Pro Pro 
                85                  90                  95      


Asp Leu Arg Arg Lys Ala Ala Arg Leu Val Ala Ala Lys Cys Thr Leu 
            100                 105                 110         


Ala Ala Arg Val Asp Ser Phe His Glu Ser Thr Glu Gly Lys Val Gly 
        115                 120                 125             


Tyr Glu Leu Lys Asp Glu Ile Glu Arg Lys Phe Asp Lys Trp Gln Glu 
    130                 135                 140                 


Pro 
145 
