                         SEQUENCE LISTING

<110>  Regeneron Pharmaceuticals, Inc.
 
<120>  METHODS AND COMPOSITIONS FOR ASSESSING CRISPR/CAS-INDUCED 
       RECOMBINATION WITH AN EXOGENOUS DONOR NUCLEIC ACID IN VIVO

<130>  57766-516566

<150>  US 62/539,285
<151>  2017-07-31

<160>  24    

<170>  PatentIn version 3.5

<210>  1
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  1
aacggttatg cgggtgcgct                                                   20


<210>  2
<211>  128
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  2
tgtgccgaaa tggtccatca aaaaatggct ttcgctacct ggagagacgc gcccgctgat       60

tctttgcgaa tacgcccacg cgatgggtaa cagtcttggc ggtttcgcta aatactggca      120

ggcgtttc                                                               128


<210>  3
<211>  128
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  3
gaaacgcctg ccagtattta gcgaaaccgc caagactgtt acccatcgcg tgggcgtatt       60

cgcaaagaat cagcgggcgc gtctctccag gtagcgaaag ccattttttg atggaccatt      120

tcggcaca                                                               128


<210>  4
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  4

Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro 
1               5                   10                  15      


Gly Pro 
        


<210>  5
<211>  19
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  5

Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn 
1               5                   10                  15      


Pro Gly Pro 
            


<210>  6
<211>  20
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  6

Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp Val Glu Ser 
1               5                   10                  15      


Asn Pro Gly Pro 
            20  


<210>  7
<211>  22
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  7

Val Lys Gln Thr Leu Asn Phe Asp Leu Leu Lys Leu Ala Gly Asp Val 
1               5                   10                  15      


Glu Ser Asn Pro Gly Pro 
            20          


<210>  8
<211>  82
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  8
guuggaacca uucaaaacag cauagcaagu uaaaauaagg cuaguccguu aucaacuuga       60

aaaaguggca ccgagucggu gc                                                82


<210>  9
<211>  76
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  9
guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc cguuaucaac uugaaaaagu       60

ggcaccgagu cggugc                                                       76


<210>  10
<211>  86
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  10
guuuaagagc uaugcuggaa acagcauagc aaguuuaaau aaggcuaguc cguuaucaac       60

uugaaaaagu ggcaccgagu cggugc                                            86


<210>  11
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<222>  (2)..(21)
<223>  n = A, T, C, or G

<400>  11
gnnnnnnnnn nnnnnnnnnn ngg                                               23


<210>  12
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<222>  (1)..(21)
<223>  n = A, T, C, or G

<400>  12
nnnnnnnnnn nnnnnnnnnn ngg                                               23


<210>  13
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<222>  (3)..(23)
<223>  n = A, T, C, or G

<400>  13
ggnnnnnnnn nnnnnnnnnn nnngg                                             25


<210>  14
<211>  20
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  14
uugccaauac gcccacgcga                                                   20


<210>  15
<211>  1023
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  15

Met Gly Thr Asp Leu Asn Asp Pro Val Val Leu Gln Arg Arg Asp Trp 
1               5                   10                  15      


Glu Asn Pro Gly Val Thr Gln Leu Asn Arg Leu Ala Ala His Pro Pro 
            20                  25                  30          


Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro Ser 
        35                  40                  45              


Gln Gln Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe Pro 
    50                  55                  60                  


Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro Glu 
65                  70                  75                  80  


Ala Asp Thr Val Val Val Pro Ser Asn Trp Gln Met His Gly Tyr Asp 
                85                  90                  95      


Ala Pro Ile Tyr Thr Asn Val Thr Tyr Pro Ile Thr Val Asn Pro Pro 
            100                 105                 110         


Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe Asn 
        115                 120                 125             


Val Asp Glu Ser Trp Leu Gln Glu Gly Gln Thr Arg Ile Ile Phe Asp 
    130                 135                 140                 


Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val Gly 
145                 150                 155                 160 


Tyr Gly Gln Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala Phe 
                165                 170                 175     


Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp Ser 
            180                 185                 190         


Asp Gly Ser Tyr Leu Glu Asp Gln Asp Met Trp Arg Met Ser Gly Ile 
        195                 200                 205             


Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gln Ile Ser Asp 
    210                 215                 220                 


Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val Leu 
225                 230                 235                 240 


Glu Ala Glu Val Gln Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg Val 
                245                 250                 255     


Thr Val Ser Leu Trp Gln Gly Glu Thr Gln Val Ala Ser Gly Thr Ala 
            260                 265                 270         


Pro Phe Gly Gly Glu Ile Ile Asp Glu Arg Gly Gly Tyr Ala Asp Arg 
        275                 280                 285             


Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu Trp Ser Ala Glu 
    290                 295                 300                 


Ile Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp Gly 
305                 310                 315                 320 


Thr Leu Ile Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val Arg 
                325                 330                 335     


Ile Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu Ile Arg 
            340                 345                 350         


Gly Val Asn Arg His Glu His His Pro Leu His Gly Gln Val Met Asp 
        355                 360                 365             


Glu Gln Thr Met Val Gln Asp Ile Leu Leu Met Lys Gln Asn Asn Phe 
    370                 375                 380                 


Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr Thr 
385                 390                 395                 400 


Leu Cys Asp Arg Tyr Gly Leu Tyr Val Val Asp Glu Ala Asn Ile Glu 
                405                 410                 415     


Thr His Gly Met Val Pro Met Asn Arg Leu Thr Asp Asp Pro Arg Trp 
            420                 425                 430         


Leu Pro Ala Met Ser Glu Arg Val Thr Arg Met Val Gln Arg Asp Arg 
        435                 440                 445             


Asn His Pro Ser Val Ile Ile Trp Ser Leu Gly Asn Glu Ser Gly His 
    450                 455                 460                 


Gly Ala Asn His Asp Ala Leu Tyr Arg Trp Ile Lys Ser Val Asp Pro 
465                 470                 475                 480 


Ser Arg Pro Val Gln Tyr Glu Gly Gly Gly Ala Asp Thr Thr Ala Thr 
                485                 490                 495     


Asp Ile Ile Cys Pro Met Tyr Ala Arg Val Asp Glu Asp Gln Pro Phe 
            500                 505                 510         


Pro Ala Val Pro Lys Trp Ser Ile Lys Lys Trp Leu Ser Leu Pro Gly 
        515                 520                 525             


Glu Thr Arg Pro Leu Ile Leu Cys Gln Tyr Ala His Ala Met Gly Asn 
    530                 535                 540                 


Ser Leu Gly Gly Phe Ala Lys Tyr Trp Gln Ala Phe Arg Gln Tyr Pro 
545                 550                 555                 560 


Arg Leu Gln Gly Gly Phe Val Trp Asp Trp Val Asp Gln Ser Leu Ile 
                565                 570                 575     


Lys Tyr Asp Glu Asn Gly Asn Pro Trp Ser Ala Tyr Gly Gly Asp Phe 
            580                 585                 590         


Gly Asp Thr Pro Asn Asp Arg Gln Phe Cys Met Asn Gly Leu Val Phe 
        595                 600                 605             


Ala Asp Arg Thr Pro His Pro Ala Leu Thr Glu Ala Lys His Gln Gln 
    610                 615                 620                 


Gln Phe Phe Gln Phe Arg Leu Ser Gly Gln Thr Ile Glu Val Thr Ser 
625                 630                 635                 640 


Glu Tyr Leu Phe Arg His Ser Asp Asn Glu Leu Leu His Trp Met Val 
                645                 650                 655     


Ala Leu Asp Gly Lys Pro Leu Ala Ser Gly Glu Val Pro Leu Asp Val 
            660                 665                 670         


Ala Pro Gln Gly Lys Gln Leu Ile Glu Leu Pro Glu Leu Pro Gln Pro 
        675                 680                 685             


Glu Ser Ala Gly Gln Leu Trp Leu Thr Val Arg Val Val Gln Pro Asn 
    690                 695                 700                 


Ala Thr Ala Trp Ser Glu Ala Gly His Ile Ser Ala Trp Gln Gln Trp 
705                 710                 715                 720 


Arg Leu Ala Glu Asn Leu Ser Val Thr Leu Pro Ala Ala Ser His Ala 
                725                 730                 735     


Ile Pro His Leu Thr Thr Ser Glu Met Asp Phe Cys Ile Glu Leu Gly 
            740                 745                 750         


Asn Lys Arg Trp Gln Phe Asn Arg Gln Ser Gly Phe Leu Ser Gln Met 
        755                 760                 765             


Trp Ile Gly Asp Lys Lys Gln Leu Leu Thr Pro Leu Arg Asp Gln Phe 
    770                 775                 780                 


Thr Arg Ala Pro Leu Asp Asn Asp Ile Gly Val Ser Glu Ala Thr Arg 
785                 790                 795                 800 


Ile Asp Pro Asn Ala Trp Val Glu Arg Trp Lys Ala Ala Gly His Tyr 
                805                 810                 815     


Gln Ala Glu Ala Ala Leu Leu Gln Cys Thr Ala Asp Thr Leu Ala Asp 
            820                 825                 830         


Ala Val Leu Ile Thr Thr Ala His Ala Trp Gln His Gln Gly Lys Thr 
        835                 840                 845             


Leu Phe Ile Ser Arg Lys Thr Tyr Arg Ile Asp Gly Ser Gly Gln Met 
    850                 855                 860                 


Ala Ile Thr Val Asp Val Glu Val Ala Ser Asp Thr Pro His Pro Ala 
865                 870                 875                 880 


Arg Ile Gly Leu Asn Cys Gln Leu Ala Gln Val Ala Glu Arg Val Asn 
                885                 890                 895     


Trp Leu Gly Leu Gly Pro Gln Glu Asn Tyr Pro Asp Arg Leu Thr Ala 
            900                 905                 910         


Ala Cys Phe Asp Arg Trp Asp Leu Pro Leu Ser Asp Met Tyr Thr Pro 
        915                 920                 925             


Tyr Val Phe Pro Ser Glu Asn Gly Leu Arg Cys Gly Thr Arg Glu Leu 
    930                 935                 940                 


Asn Tyr Gly Pro His Gln Trp Arg Gly Asp Phe Gln Phe Asn Ile Ser 
945                 950                 955                 960 


Arg Tyr Ser Gln Gln Gln Leu Met Glu Thr Ser His Arg His Leu Leu 
                965                 970                 975     


His Ala Glu Glu Gly Thr Trp Leu Asn Ile Asp Gly Phe His Met Gly 
            980                 985                 990         


Ile Gly Gly Asp Asp Ser Trp Ser  Pro Ser Val Ser Ala  Glu Phe Gln 
        995                 1000                 1005             


Leu Ser  Ala Gly Arg Tyr His  Tyr Gln Leu Val Trp  Cys Gln Lys 
    1010                 1015                 1020             


<210>  16
<211>  1023
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  16

Met Gly Thr Asp Leu Asn Asp Pro Val Val Leu Gln Arg Arg Asp Trp 
1               5                   10                  15      


Glu Asn Pro Gly Val Thr Gln Leu Asn Arg Leu Ala Ala His Pro Pro 
            20                  25                  30          


Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro Ser 
        35                  40                  45              


Gln Gln Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe Pro 
    50                  55                  60                  


Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro Glu 
65                  70                  75                  80  


Ala Asp Thr Val Val Val Pro Ser Asn Trp Gln Met His Gly Tyr Asp 
                85                  90                  95      


Ala Pro Ile Tyr Thr Asn Val Thr Tyr Pro Ile Thr Val Asn Pro Pro 
            100                 105                 110         


Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe Asn 
        115                 120                 125             


Val Asp Glu Ser Trp Leu Gln Glu Gly Gln Thr Arg Ile Ile Phe Asp 
    130                 135                 140                 


Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val Gly 
145                 150                 155                 160 


Tyr Gly Gln Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala Phe 
                165                 170                 175     


Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp Ser 
            180                 185                 190         


Asp Gly Ser Tyr Leu Glu Asp Gln Asp Met Trp Arg Met Ser Gly Ile 
        195                 200                 205             


Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gln Ile Ser Asp 
    210                 215                 220                 


Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val Leu 
225                 230                 235                 240 


Glu Ala Glu Val Gln Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg Val 
                245                 250                 255     


Thr Val Ser Leu Trp Gln Gly Glu Thr Gln Val Ala Ser Gly Thr Ala 
            260                 265                 270         


Pro Phe Gly Gly Glu Ile Ile Asp Glu Arg Gly Gly Tyr Ala Asp Arg 
        275                 280                 285             


Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu Trp Ser Ala Glu 
    290                 295                 300                 


Ile Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp Gly 
305                 310                 315                 320 


Thr Leu Ile Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val Arg 
                325                 330                 335     


Ile Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu Ile Arg 
            340                 345                 350         


Gly Val Asn Arg His Glu His His Pro Leu His Gly Gln Val Met Asp 
        355                 360                 365             


Glu Gln Thr Met Val Gln Asp Ile Leu Leu Met Lys Gln Asn Asn Phe 
    370                 375                 380                 


Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr Thr 
385                 390                 395                 400 


Leu Cys Asp Arg Tyr Gly Leu Tyr Val Val Asp Glu Ala Asn Ile Glu 
                405                 410                 415     


Thr His Gly Met Val Pro Met Asn Arg Leu Thr Asp Asp Pro Arg Trp 
            420                 425                 430         


Leu Pro Ala Met Ser Glu Arg Val Thr Arg Met Val Glu Arg Asp Arg 
        435                 440                 445             


Asn His Pro Ser Val Ile Ile Trp Ser Leu Gly Asn Glu Ser Gly His 
    450                 455                 460                 


Gly Ala Asn His Asp Ala Leu Tyr Arg Trp Ile Lys Ser Val Asp Pro 
465                 470                 475                 480 


Ser Arg Pro Val Gln Tyr Glu Gly Gly Gly Ala Asp Thr Thr Ala Thr 
                485                 490                 495     


Asp Ile Ile Cys Pro Met Tyr Ala Arg Val Asp Glu Asp Gln Pro Phe 
            500                 505                 510         


Pro Ala Val Pro Lys Trp Ser Ile Lys Lys Trp Leu Ser Leu Pro Gly 
        515                 520                 525             


Glu Thr Arg Pro Leu Ile Leu Cys Gln Tyr Ala His Ala Met Gly Asn 
    530                 535                 540                 


Ser Leu Gly Gly Phe Ala Lys Tyr Trp Gln Ala Phe Arg Gln Tyr Pro 
545                 550                 555                 560 


Arg Leu Gln Gly Gly Phe Val Trp Asp Trp Val Asp Gln Ser Leu Ile 
                565                 570                 575     


Lys Tyr Asp Glu Asn Gly Asn Pro Trp Ser Ala Tyr Gly Gly Asp Phe 
            580                 585                 590         


Gly Asp Thr Pro Asn Asp Arg Gln Phe Cys Met Asn Gly Leu Val Phe 
        595                 600                 605             


Ala Asp Arg Thr Pro His Pro Ala Leu Thr Glu Ala Lys His Gln Gln 
    610                 615                 620                 


Gln Phe Phe Gln Phe Arg Leu Ser Gly Gln Thr Ile Glu Val Thr Ser 
625                 630                 635                 640 


Glu Tyr Leu Phe Arg His Ser Asp Asn Glu Leu Leu His Trp Met Val 
                645                 650                 655     


Ala Leu Asp Gly Lys Pro Leu Ala Ser Gly Glu Val Pro Leu Asp Val 
            660                 665                 670         


Ala Pro Gln Gly Lys Gln Leu Ile Glu Leu Pro Glu Leu Pro Gln Pro 
        675                 680                 685             


Glu Ser Ala Gly Gln Leu Trp Leu Thr Val Arg Val Val Gln Pro Asn 
    690                 695                 700                 


Ala Thr Ala Trp Ser Glu Ala Gly His Ile Ser Ala Trp Gln Gln Trp 
705                 710                 715                 720 


Arg Leu Ala Glu Asn Leu Ser Val Thr Leu Pro Ala Ala Ser His Ala 
                725                 730                 735     


Ile Pro His Leu Thr Thr Ser Glu Met Asp Phe Cys Ile Glu Leu Gly 
            740                 745                 750         


Asn Lys Arg Trp Gln Phe Asn Arg Gln Ser Gly Phe Leu Ser Gln Met 
        755                 760                 765             


Trp Ile Gly Asp Lys Lys Gln Leu Leu Thr Pro Leu Arg Asp Gln Phe 
    770                 775                 780                 


Thr Arg Ala Pro Leu Asp Asn Asp Ile Gly Val Ser Glu Ala Thr Arg 
785                 790                 795                 800 


Ile Asp Pro Asn Ala Trp Val Glu Arg Trp Lys Ala Ala Gly His Tyr 
                805                 810                 815     


Gln Ala Glu Ala Ala Leu Leu Gln Cys Thr Ala Asp Thr Leu Ala Asp 
            820                 825                 830         


Ala Val Leu Ile Thr Thr Ala His Ala Trp Gln His Gln Gly Lys Thr 
        835                 840                 845             


Leu Phe Ile Ser Arg Lys Thr Tyr Arg Ile Asp Gly Ser Gly Gln Met 
    850                 855                 860                 


Ala Ile Thr Val Asp Val Glu Val Ala Ser Asp Thr Pro His Pro Ala 
865                 870                 875                 880 


Arg Ile Gly Leu Asn Cys Gln Leu Ala Gln Val Ala Glu Arg Val Asn 
                885                 890                 895     


Trp Leu Gly Leu Gly Pro Gln Glu Asn Tyr Pro Asp Arg Leu Thr Ala 
            900                 905                 910         


Ala Cys Phe Asp Arg Trp Asp Leu Pro Leu Ser Asp Met Tyr Thr Pro 
        915                 920                 925             


Tyr Val Phe Pro Ser Glu Asn Gly Leu Arg Cys Gly Thr Arg Glu Leu 
    930                 935                 940                 


Asn Tyr Gly Pro His Gln Trp Arg Gly Asp Phe Gln Phe Asn Ile Ser 
945                 950                 955                 960 


Arg Tyr Ser Gln Gln Gln Leu Met Glu Thr Ser His Arg His Leu Leu 
                965                 970                 975     


His Ala Glu Glu Gly Thr Trp Leu Asn Ile Asp Gly Phe His Met Gly 
            980                 985                 990         


Ile Gly Gly Asp Asp Ser Trp Ser  Pro Ser Val Ser Ala  Glu Phe Gln 
        995                 1000                 1005             


Leu Ser  Ala Gly Arg Tyr His  Tyr Gln Leu Val Trp  Cys Gln Lys 
    1010                 1015                 1020             


<210>  17
<211>  6409
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic


<220>
<221>  misc_feature
<222>  (178)..(3255)
<223>  LacZ

<220>
<221>  misc_feature
<222>  (1779)..(1801)
<223>  Guide RNA Target Site v2

<220>
<221>  misc_feature
<222>  (1782)..(1801)
<223>  Guide RNA Target Sequence v1

<220>
<221>  misc_feature
<222>  (3286)..(3534)
<223>  Poly(A)

<220>
<221>  misc_feature
<222>  (3611)..(3644)
<223>  LoxP

<220>
<221>  misc_feature
<222>  (3651)..(4863)
<223>  Ubiquitin Promoter

<220>
<221>  misc_feature
<222>  (4864)..(4930)
<223>  EM7 Promoter

<220>
<221>  misc_feature
<222>  (4931)..(5734)
<223>  Neomycin Phosphotransferase

<220>
<221>  misc_feature
<222>  (5735)..(6219)
<223>  SV40 Poly(A)

<220>
<221>  misc_feature
<222>  (6225)..(6258)
<223>  LoxP

<400>  17
agtgttgcaa tacctttctg ggagttctct gctgcctcct ggcttctgag gaccgccctg       60

ggcctgggag aatcccttcc ccctcttccc tcgtgatctg caactccagt ctttctagtt      120

taaactgcta gttccctttt ttttcacagg ttggcgcgcc gaattaattc tgcagacatg      180

ggtaccgatt taaatgatcc agtggtcctg cagaggagag attgggagaa tcccggtgtg      240

acacagctga acagactagc cgcccaccct ccctttgctt cttggagaaa cagtgaggaa      300

gctaggacag acagaccaag ccagcaactc agatctttga acggggagtg gagatttgcc      360

tggtttccgg caccagaagc ggtgccggaa agctggctgg agtgcgatct tcctgaggcc      420

gatactgtcg tcgtcccctc aaactggcag atgcacggtt acgatgcgcc catctacacc      480

aacgtgacct atcccattac ggtcaatccg ccgtttgttc ccacggagaa tccgacgggt      540

tgttactcgc tcacatttaa tgttgatgaa agctggctac aggaaggcca gacgcgaatt      600

atttttgatg gcgttaactc ggcgtttcat ctgtggtgca acgggcgctg ggtcggttac      660

ggccaggaca gtcgtttgcc gtctgaattt gacctgagcg catttttacg cgccggagaa      720

aaccgcctcg cggtgatggt gctgcgctgg agtgacggca gttatctgga agatcaggat      780

atgtggcgga tgagcggcat tttccgtgac gtctcgttgc tgcataaacc gactacacaa      840

atcagcgatt tccatgttgc cactcgcttt aatgatgatt tcagccgcgc tgtactggag      900

gctgaagttc agatgtgcgg cgagttgcgt gactacctac gggtaacagt ttctttatgg      960

cagggtgaaa cgcaggtcgc cagcggcacc gcgcctttcg gcggtgaaat tatcgatgag     1020

cgtggtggtt atgccgatcg cgtcacacta cgtctgaacg tcgaaaaccc gaaactgtgg     1080

agcgccgaaa tcccgaatct ctatcgtgcg gtggttgaac tgcacaccgc cgacggcacg     1140

ctgattgaag cagaagcctg cgatgtcggt ttccgcgagg tgcggattga aaatggtctg     1200

ctgctgctga acggcaagcc gttgctgatt cgaggcgtta accgtcacga gcatcatcct     1260

ctgcatggtc aggtcatgga tgagcagacg atggtgcagg atatcctgct gatgaagcag     1320

aacaacttta acgccgtgcg ctgttcgcat tatccgaacc atccgctgtg gtacacgctg     1380

tgcgaccgct acggcctgta tgtggtggat gaagccaata ttgaaaccca cggcatggtg     1440

ccaatgaatc gtctgaccga tgatccgcgc tggctaccgg cgatgagcga acgcgtaacg     1500

cgaatggtgc agcgcgatcg taatcacccg agtgtgatca tctggtcgct ggggaatgaa     1560

tcaggccacg gcgctaatca cgacgcgctg tatcgctgga tcaaatctgt cgatccttcc     1620

cgcccggtgc agtatgaagg cggcggagcc gacaccacgg ccaccgatat tatttgcccg     1680

atgtacgcgc gcgtggatga agaccagccc ttcccggctg tgccgaaatg gtccatcaaa     1740

aaatggcttt cgctacctgg agagacgcgc ccgctgatcc tttgccaata cgcccacgcg     1800

atgggtaaca gtcttggcgg tttcgctaaa tactggcagg cgtttcgtca gtatccccgt     1860

ttacagggcg gcttcgtctg ggactgggtg gatcagtcgc tgattaaata tgatgaaaac     1920

ggcaacccgt ggtcggctta cggcggtgat tttggcgata cgccgaacga tcgccagttc     1980

tgtatgaacg gtctggtctt tgccgaccgc acgccgcatc cagcgctgac ggaagcaaaa     2040

caccagcagc agtttttcca gttccgttta tccgggcaaa ccatcgaagt gaccagcgaa     2100

tacctgttcc gtcatagcga taacgagctc ctgcactgga tggtggcgct ggatggtaag     2160

ccgctggcaa gcggtgaagt gcctctggat gtcgctccac aaggtaaaca gttgattgaa     2220

ctgcctgaac taccgcagcc ggagagcgcc gggcaactct ggctcacagt acgcgtagtg     2280

caaccgaacg cgaccgcatg gtcagaagcc gggcacatca gcgcctggca gcagtggcgt     2340

ctggcggaaa acctcagtgt gacgctcccc gccgcgtccc acgccatccc gcatctgacc     2400

accagcgaaa tggatttttg catcgagctg ggtaataagc gttggcaatt taaccgccag     2460

tcaggctttc tttcacagat gtggattggc gataaaaaac aactgctgac gccgctgcgc     2520

gatcagttca cccgtgcacc gctggataac gacattggcg taagtgaagc gacccgcatt     2580

gaccctaacg cctgggtcga acgctggaag gcggcgggcc attaccaggc cgaagcagcg     2640

ttgttgcagt gcacggcaga tacacttgct gatgcggtgc tgattacgac cgctcacgcg     2700

tggcagcatc aggggaaaac cttatttatc agccggaaaa cctaccggat tgatggtagt     2760

ggtcaaatgg cgattaccgt tgatgttgaa gtggcgagcg atacaccgca tccggcgcgg     2820

attggcctga actgccagct ggcgcaggta gcagagcggg taaactggct cggattaggg     2880

ccgcaagaaa actatcccga ccgccttact gccgcctgtt ttgaccgctg ggatctgcca     2940

ttgtcagaca tgtatacccc gtacgtcttc ccgagcgaaa acggtctgcg ctgcgggacg     3000

cgcgaattga attatggccc acaccagtgg cgcggcgact tccagttcaa catcagccgc     3060

tacagtcaac agcaactgat ggaaaccagc catcgccatc tgctgcacgc ggaagaaggc     3120

acatggctga atatcgacgg tttccatatg gggattggtg gcgacgactc ctggagcccg     3180

tcagtatcgg cggaattcca gctgagcgcc ggtcgctacc attaccagtt ggtctggtgt     3240

caaaaataat aataaccggg caggggggat ctaagctcta gataagtaat gatcataatc     3300

agccatatca catctgtaga ggttttactt gctttaaaaa acctcccaca cctccccctg     3360

aacctgaaac ataaaatgaa tgcaattgtt gttgttaact tgtttattgc agcttataat     3420

ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat     3480

tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggat cccccggcta     3540

gagtttaaac actagaacta gtggatcccc gggctcgata actataacgg tcctaaggta     3600

gcgactcgag ataacttcgt ataatgtatg ctatacgaag ttatatgcat ggcctccgcg     3660

ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg ccacgtcaga     3720

cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag cggcccgctg     3780

ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag gacgggactt     3840

gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg aaaagtagtc     3900

ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat gattatataa     3960

ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg ggatttgggt cgcggttctt     4020

gtttgtggat cgctgtgatc gtcacttggt gagtagcggg ctgctgggct ggccggggct     4080

ttcgtggccg ccgggccgct cggtgggacg gaagcgtgtg gagagaccgc caagggctgt     4140

agtctgggtc cgcgagcaag gttgccctga actgggggtt ggggggagcg cagcaaaatg     4200

gcggctgttc ccgagtcttg aatggaagac gcttgtgagg cgggctgtga ggtcgttgaa     4260

acaaggtggg gggcatggtg ggcggcaaga acccaaggtc ttgaggcctt cgctaatgcg     4320

ggaaagctct tattcgggtg agatgggctg gggcaccatc tggggaccct gacgtgaagt     4380

ttgtcactga ctggagaact cggtttgtcg tctgttgcgg gggcggcagt tatggcggtg     4440

ccgttgggca gtgcacccgt acctttggga gcgcgcgccc tcgtcgtgtc gtgacgtcac     4500

ccgttctgtt ggcttataat gcagggtggg gccacctgcc ggtaggtgtg cggtaggctt     4560

ttctccgtcg caggacgcag ggttcgggcc tagggtaggc tctcctgaat cgacaggcgc     4620

cggacctctg gtgaggggag ggataagtga ggcgtcagtt tctttggtcg gttttatgta     4680

cctatcttct taagtagctg aagctccggt tttgaactat gcgctcgggg ttggcgagtg     4740

tgttttgtga agttttttag gcaccttttg aaatgtaatc atttgggtca atatgtaatt     4800

ttcagtgtta gactagtaaa ttgtccgcta aattctggcc gtttttggct tttttgttag     4860

acgtgttgac aattaatcat cggcatagta tatcggcata gtataatacg acaaggtgag     4920

gaactaaacc atgggatcgg ccattgaaca agatggattg cacgcaggtt ctccggccgc     4980

ttgggtggag aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc     5040

cgccgtgttc cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc     5100

cggtgccctg aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg     5160

cgttccttgc gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt     5220

gggcgaagtg ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc     5280

catcatggct gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga     5340

ccaccaagcg aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga     5400

tcaggatgat ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct     5460

caaggcgcgc atgcccgacg gcgatgatct cgtcgtgacc catggcgatg cctgcttgcc     5520

gaatatcatg gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt     5580

ggcggaccgc tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg     5640

cgaatgggct gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat     5700

cgccttctat cgccttcttg acgagttctt ctgaggggat ccgctgtaag tctgcagaaa     5760

ttgatgatct attaaacaat aaagatgtcc actaaaatgg aagtttttcc tgtcatactt     5820

tgttaagaag ggtgagaaca gagtacctac attttgaatg gaaggattgg agctacgggg     5880

gtgggggtgg ggtgggatta gataaatgcc tgctctttac tgaaggctct ttactattgc     5940

tttatgataa tgtttcatag ttggatatca taatttaaac aagcaaaacc aaattaaggg     6000

ccagctcatt cctcccactc atgatctata gatctataga tctctcgtgg gatcattgtt     6060

tttctcttga ttcccacttt gtggttctaa gtactgtggt ttccaaatgt gtcagtttca     6120

tagcctgaag aacgagatca gcagcctctg ttccacatac acttcattct cagtattgtt     6180

ttgccaagtt ctaattccat cagacctcga cctgcagccc ctagataact tcgtataatg     6240

tatgctatac gaagttatgc tagctaaaat tggagggaca agacttccca cagattttcg     6300

gttttgtcgg gaagtttttt aataggggca aataaggaaa atgggaggat aggtagtcat     6360

ctggggtttt atgcagcaaa actacaggtt attattgctt gtgatccgc                 6409


<210>  18
<211>  16
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  18
guuuuagagc uaugcu                                                       16


<210>  19
<211>  67
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  19
agcauagcaa guuaaaauaa ggcuaguccg uuaucaacuu gaaaaagugg caccgagucg       60

gugcuuu                                                                 67


<210>  20
<211>  77
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  20
guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc cguuaucaac uugaaaaagu       60

ggcaccgagu cggugcu                                                      77


<210>  21
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  21
ttgccaatac gcccacgcga                                                   20


<210>  22
<211>  1391
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  22

Met Asp Lys Pro Lys Lys Lys Arg Lys Val Lys Tyr Ser Ile Gly Leu 
1               5                   10                  15      


Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu Tyr 
            20                  25                  30          


Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp Arg His 
        35                  40                  45              


Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu 
    50                  55                  60                  


Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr 
65                  70                  75                  80  


Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu 
                85                  90                  95      


Met Ala Lys Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser Phe 
            100                 105                 110         


Leu Val Glu Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly Asn 
        115                 120                 125             


Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr His 
    130                 135                 140                 


Leu Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu 
145                 150                 155                 160 


Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg Gly His Phe Leu 
                165                 170                 175     


Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val Asp Lys Leu Phe 
            180                 185                 190         


Ile Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile 
        195                 200                 205             


Asn Ala Ser Gly Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser 
    210                 215                 220                 


Lys Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys 
225                 230                 235                 240 


Lys Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr 
                245                 250                 255     


Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln 
            260                 265                 270         


Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln 
        275                 280                 285             


Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser 
    290                 295                 300                 


Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile Thr 
305                 310                 315                 320 


Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His His 
                325                 330                 335     


Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro Glu 
            340                 345                 350         


Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly 
        355                 360                 365             


Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys 
    370                 375                 380                 


Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu 
385                 390                 395                 400 


Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser 
                405                 410                 415     


Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile Leu Arg Arg 
            420                 425                 430         


Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu 
        435                 440                 445             


Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg 
    450                 455                 460                 


Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr Ile 
465                 470                 475                 480 


Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln 
                485                 490                 495     


Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu 
            500                 505                 510         


Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr 
        515                 520                 525             


Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys Pro 
    530                 535                 540                 


Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val Asp Leu Leu Phe 
545                 550                 555                 560 


Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe 
                565                 570                 575     


Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu Asp 
            580                 585                 590         


Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile 
        595                 600                 605             


Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu 
    610                 615                 620                 


Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu 
625                 630                 635                 640 


Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met Lys 
                645                 650                 655     


Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys 
            660                 665                 670         


Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp 
        675                 680                 685             


Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn Phe Met Gln Leu Ile 
    690                 695                 700                 


His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln Val 
705                 710                 715                 720 


Ser Gly Gln Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly 
                725                 730                 735     


Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp 
            740                 745                 750         


Glu Leu Val Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val Ile 
        755                 760                 765             


Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser 
    770                 775                 780                 


Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser 
785                 790                 795                 800 


Gln Ile Leu Lys Glu His Pro Val Glu Asn Thr Gln Leu Gln Asn Glu 
                805                 810                 815     


Lys Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val Asp 
            820                 825                 830         


Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp His Ile 
        835                 840                 845             


Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys Val Leu 
    850                 855                 860                 


Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu 
865                 870                 875                 880 


Glu Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala 
                885                 890                 895     


Lys Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg 
            900                 905                 910         


Gly Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu 
        915                 920                 925             


Val Glu Thr Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp Ser 
    930                 935                 940                 


Arg Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg Glu Val 
945                 950                 955                 960 


Lys Val Ile Thr Leu Lys Ser Lys Leu Val Ser Asp Phe Arg Lys Asp 
                965                 970                 975     


Phe Gln Phe Tyr Lys Val Arg Glu Ile Asn Asn Tyr His His Ala His 
            980                 985                 990         


Asp Ala Tyr Leu Asn Ala Val Val  Gly Thr Ala Leu Ile  Lys Lys Tyr 
        995                 1000                 1005             


Pro Lys  Leu Glu Ser Glu Phe  Val Tyr Gly Asp Tyr  Lys Val Tyr 
    1010                 1015                 1020             


Asp Val  Arg Lys Met Ile Ala  Lys Ser Glu Gln Glu  Ile Gly Lys 
    1025                 1030                 1035             


Ala Thr  Ala Lys Tyr Phe Phe  Tyr Ser Asn Ile Met  Asn Phe Phe 
    1040                 1045                 1050             


Lys Thr  Glu Ile Thr Leu Ala  Asn Gly Glu Ile Arg  Lys Arg Pro 
    1055                 1060                 1065             


Leu Ile  Glu Thr Asn Gly Glu  Thr Gly Glu Ile Val  Trp Asp Lys 
    1070                 1075                 1080             


Gly Arg  Asp Phe Ala Thr Val  Arg Lys Val Leu Ser  Met Pro Gln 
    1085                 1090                 1095             


Val Asn  Ile Val Lys Lys Thr  Glu Val Gln Thr Gly  Gly Phe Ser 
    1100                 1105                 1110             


Lys Glu  Ser Ile Leu Pro Lys  Arg Asn Ser Asp Lys  Leu Ile Ala 
    1115                 1120                 1125             


Arg Lys  Lys Asp Trp Asp Pro  Lys Lys Tyr Gly Gly  Phe Asp Ser 
    1130                 1135                 1140             


Pro Thr  Val Ala Tyr Ser Val  Leu Val Val Ala Lys  Val Glu Lys 
    1145                 1150                 1155             


Gly Lys  Ser Lys Lys Leu Lys  Ser Val Lys Glu Leu  Leu Gly Ile 
    1160                 1165                 1170             


Thr Ile  Met Glu Arg Ser Ser  Phe Glu Lys Asn Pro  Ile Asp Phe 
    1175                 1180                 1185             


Leu Glu  Ala Lys Gly Tyr Lys  Glu Val Lys Lys Asp  Leu Ile Ile 
    1190                 1195                 1200             


Lys Leu  Pro Lys Tyr Ser Leu  Phe Glu Leu Glu Asn  Gly Arg Lys 
    1205                 1210                 1215             


Arg Met  Leu Ala Ser Ala Gly  Glu Leu Gln Lys Gly  Asn Glu Leu 
    1220                 1225                 1230             


Ala Leu  Pro Ser Lys Tyr Val  Asn Phe Leu Tyr Leu  Ala Ser His 
    1235                 1240                 1245             


Tyr Glu  Lys Leu Lys Gly Ser  Pro Glu Asp Asn Glu  Gln Lys Gln 
    1250                 1255                 1260             


Leu Phe  Val Glu Gln His Lys  His Tyr Leu Asp Glu  Ile Ile Glu 
    1265                 1270                 1275             


Gln Ile  Ser Glu Phe Ser Lys  Arg Val Ile Leu Ala  Asp Ala Asn 
    1280                 1285                 1290             


Leu Asp  Lys Val Leu Ser Ala  Tyr Asn Lys His Arg  Asp Lys Pro 
    1295                 1300                 1305             


Ile Arg  Glu Gln Ala Glu Asn  Ile Ile His Leu Phe  Thr Leu Thr 
    1310                 1315                 1320             


Asn Leu  Gly Ala Pro Ala Ala  Phe Lys Tyr Phe Asp  Thr Thr Ile 
    1325                 1330                 1335             


Asp Arg  Lys Arg Tyr Thr Ser  Thr Lys Glu Val Leu  Asp Ala Thr 
    1340                 1345                 1350             


Leu Ile  His Gln Ser Ile Thr  Gly Leu Tyr Glu Thr  Arg Ile Asp 
    1355                 1360                 1365             


Leu Ser  Gln Leu Gly Gly Asp  Lys Arg Pro Ala Ala  Thr Lys Lys 
    1370                 1375                 1380             


Ala Gly  Gln Ala Lys Lys Lys  Lys 
    1385                 1390     


<210>  23
<211>  4173
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  23
atggacaagc ccaagaaaaa gcggaaagtg aagtacagca tcggcctgga catcggcacc       60

aactctgtgg gctgggccgt gatcaccgac gagtacaagg tgcccagcaa gaaattcaag      120

gtgctgggca acaccgacag gcacagcatc aagaagaacc tgatcggcgc cctgctgttc      180

gacagcggcg aaacagccga ggccaccaga ctgaagagaa ccgccagaag aagatacacc      240

aggcggaaga acaggatctg ctatctgcaa gagatcttca gcaacgagat ggccaaggtg      300

gacgacagct tcttccacag actggaagag tccttcctgg tggaagagga caagaagcac      360

gagagacacc ccatcttcgg caacatcgtg gacgaggtgg cctaccacga gaagtacccc      420

accatctacc acctgagaaa gaaactggtg gacagcaccg acaaggccga cctgagactg      480

atctacctgg ccctggccca catgatcaag ttcagaggcc acttcctgat cgagggcgac      540

ctgaaccccg acaacagcga cgtggacaag ctgttcatcc agctggtgca gacctacaac      600

cagctgttcg aggaaaaccc catcaacgcc agcggcgtgg acgccaaggc tatcctgtct      660

gccagactga gcaagagcag aaggctggaa aatctgatcg cccagctgcc cggcgagaag      720

aagaacggcc tgttcggcaa cctgattgcc ctgagcctgg gcctgacccc caacttcaag      780

agcaacttcg acctggccga ggatgccaaa ctgcagctga gcaaggacac ctacgacgac      840

gacctggaca acctgctggc ccagatcggc gaccagtacg ccgacctgtt cctggccgcc      900

aagaacctgt ctgacgccat cctgctgagc gacatcctga gagtgaacac cgagatcacc      960

aaggcccccc tgagcgcctc tatgatcaag agatacgacg agcaccacca ggacctgacc     1020

ctgctgaaag ctctcgtgcg gcagcagctg cctgagaagt acaaagaaat cttcttcgac     1080

cagagcaaga acggctacgc cggctacatc gatggcggcg ctagccagga agagttctac     1140

aagttcatca agcccatcct ggaaaagatg gacggcaccg aggaactgct cgtgaagctg     1200

aacagagagg acctgctgag aaagcagaga accttcgaca acggcagcat cccccaccag     1260

atccacctgg gagagctgca cgctatcctg agaaggcagg aagattttta cccattcctg     1320

aaggacaacc gggaaaagat cgagaagatc ctgaccttca ggatccccta ctacgtgggc     1380

cccctggcca gaggcaacag cagattcgcc tggatgacca gaaagagcga ggaaaccatc     1440

accccctgga acttcgagga agtggtggac aagggcgcca gcgcccagag cttcatcgag     1500

agaatgacaa acttcgataa gaacctgccc aacgagaagg tgctgcccaa gcacagcctg     1560

ctgtacgagt acttcaccgt gtacaacgag ctgaccaaag tgaaatacgt gaccgaggga     1620

atgagaaagc ccgccttcct gagcggcgag cagaaaaagg ccatcgtgga cctgctgttc     1680

aagaccaaca gaaaagtgac cgtgaagcag ctgaaagagg actacttcaa gaaaatcgag     1740

tgcttcgact ccgtggaaat ctccggcgtg gaagatagat tcaacgcctc cctgggcaca     1800

taccacgatc tgctgaaaat tatcaaggac aaggacttcc tggataacga agagaacgag     1860

gacattctgg aagatatcgt gctgaccctg acactgtttg aggaccgcga gatgatcgag     1920

gaaaggctga aaacctacgc tcacctgttc gacgacaaag tgatgaagca gctgaagaga     1980

aggcggtaca ccggctgggg caggctgagc agaaagctga tcaacggcat cagagacaag     2040

cagagcggca agacaatcct ggatttcctg aagtccgacg gcttcgccaa ccggaacttc     2100

atgcagctga tccacgacga cagcctgaca ttcaaagagg acatccagaa agcccaggtg     2160

tccggccagg gcgactctct gcacgagcat atcgctaacc tggccggcag ccccgctatc     2220

aagaagggca tcctgcagac agtgaaggtg gtggacgagc tcgtgaaagt gatgggcaga     2280

cacaagcccg agaacatcgt gatcgagatg gctagagaga accagaccac ccagaaggga     2340

cagaagaact cccgcgagag gatgaagaga atcgaagagg gcatcaaaga gctgggcagc     2400

cagatcctga aagaacaccc cgtggaaaac acccagctgc agaacgagaa gctgtacctg     2460

tactacctgc agaatggccg ggatatgtac gtggaccagg aactggacat caacagactg     2520

tccgactacg atgtggacca tatcgtgcct cagagctttc tgaaggacga ctccatcgat     2580

aacaaagtgc tgactcggag cgacaagaac agaggcaaga gcgacaacgt gccctccgaa     2640

gaggtcgtga agaagatgaa gaactactgg cgacagctgc tgaacgccaa gctgattacc     2700

cagaggaagt tcgataacct gaccaaggcc gagagaggcg gcctgagcga gctggataag     2760

gccggcttca tcaagaggca gctggtggaa accagacaga tcacaaagca cgtggcacag     2820

atcctggact cccggatgaa cactaagtac gacgaaaacg ataagctgat ccgggaagtg     2880

aaagtgatca ccctgaagtc caagctggtg tccgatttcc ggaaggattt ccagttttac     2940

aaagtgcgcg agatcaacaa ctaccaccac gcccacgacg cctacctgaa cgccgtcgtg     3000

ggaaccgccc tgatcaaaaa gtaccctaag ctggaaagcg agttcgtgta cggcgactac     3060

aaggtgtacg acgtgcggaa gatgatcgcc aagagcgagc aggaaatcgg caaggctacc     3120

gccaagtact tcttctacag caacatcatg aactttttca agaccgaaat caccctggcc     3180

aacggcgaga tcagaaagcg ccctctgatc gagacaaacg gcgaaaccgg ggagatcgtg     3240

tgggataagg gcagagactt cgccacagtg cgaaaggtgc tgagcatgcc ccaagtgaat     3300

atcgtgaaaa agaccgaggt gcagacaggc ggcttcagca aagagtctat cctgcccaag     3360

aggaacagcg acaagctgat cgccagaaag aaggactggg accccaagaa gtacggcggc     3420

ttcgacagcc ctaccgtggc ctactctgtg ctggtggtgg ctaaggtgga aaagggcaag     3480

tccaagaaac tgaagagtgt gaaagagctg ctggggatca ccatcatgga aagaagcagc     3540

tttgagaaga accctatcga ctttctggaa gccaagggct acaaagaagt gaaaaaggac     3600

ctgatcatca agctgcctaa gtactccctg ttcgagctgg aaaacggcag aaagagaatg     3660

ctggcctctg ccggcgaact gcagaaggga aacgagctgg ccctgcctag caaatatgtg     3720

aacttcctgt acctggcctc ccactatgag aagctgaagg gcagccctga ggacaacgaa     3780

cagaaacagc tgtttgtgga acagcataag cactacctgg acgagatcat cgagcagatc     3840

agcgagttct ccaagagagt gatcctggcc gacgccaatc tggacaaggt gctgtctgcc     3900

tacaacaagc acagggacaa gcctatcaga gagcaggccg agaatatcat ccacctgttc     3960

accctgacaa acctgggcgc tcctgccgcc ttcaagtact ttgacaccac catcgaccgg     4020

aagaggtaca ccagcaccaa agaggtgctg gacgccaccc tgatccacca gagcatcacc     4080

ggcctgtacg agacaagaat cgacctgtct cagctgggag gcgacaagag acctgccgcc     4140

actaagaagg ccggacaggc caaaaagaag aag                                  4173


<210>  24
<211>  3069
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic

<400>  24
atgggtaccg atttaaatga tccagtggtc ctgcagagga gagattggga gaatcccggt       60

gtgacacagc tgaacagact agccgcccac cctccctttg cttcttggag aaacagtgag      120

gaagctagga cagacagacc aagccagcaa ctcagatctt tgaacgggga gtggagattt      180

gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga tcttcctgag      240

gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc gcccatctac      300

accaacgtga cctatcccat tacggtcaat ccgccgtttg ttcccacgga gaatccgacg      360

ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg ccagacgcga      420

attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg ctgggtcggt      480

tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt acgcgccgga      540

gaaaaccgcc tcgcggtgat ggtgctgcgc tggagtgacg gcagttatct ggaagatcag      600

gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa accgactaca      660

caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg cgctgtactg      720

gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac agtttcttta      780

tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga aattatcgat      840

gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa cccgaaactg      900

tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac cgccgacggc      960

acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat tgaaaatggt     1020

ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca cgagcatcat     1080

cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct gctgatgaag     1140

cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct gtggtacacg     1200

ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac ccacggcatg     1260

gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag cgaacgcgta     1320

acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc gctggggaat     1380

gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc tgtcgatcct     1440

tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga tattatttgc     1500

ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa atggtccatc     1560

aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcca atacgcccac     1620

gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg tcagtatccc     1680

cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa atatgatgaa     1740

aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa cgatcgccag     1800

ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct gacggaagca     1860

aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga agtgaccagc     1920

gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc gctggatggt     1980

aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa acagttgatt     2040

gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac agtacgcgta     2100

gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg gcagcagtgg     2160

cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat cccgcatctg     2220

accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca atttaaccgc     2280

cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct gacgccgctg     2340

cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga agcgacccgc     2400

attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca ggccgaagca     2460

gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac gaccgctcac     2520

gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg gattgatggt     2580

agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc gcatccggcg     2640

cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg gctcggatta     2700

gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg ctgggatctg     2760

ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct gcgctgcggg     2820

acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt caacatcagc     2880

cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca cgcggaagaa     2940

ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga ctcctggagc     3000

ccgtcagtat cggcggaatt ccagctgagc gccggtcgct accattacca gttggtctgg     3060

tgtcaaaaa                                                             3069


