                         SEQUENCE LISTING

<110>  Keygene N.V.
 
<120>  Targeted sequence addition

<130>  p6092252pct

<150>  EP 20200254.9
<151>  2020-10-06

<150>  US 63/118,781
<151>  2020-11-27

<160>  39    

<170>  PatentIn version 3.5

<210>  1
<211>  1368
<212>  PRT
<213>  artificial sequence

<220>
<223>  Cas9

<400>  1

Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


<210>  2
<211>  4104
<212>  DNA
<213>  artificial sequence

<220>
<223>  sequence encoding Cas9

<400>  2
atggataaaa aatatagcat tggtctggat attggtacca atagcgttgg ttgggcagtt       60

attaccgatg aatataaagt tccgagcaaa aaatttaaag ttctgggtaa taccgatcgt      120

catagcatta aaaaaaatct gattggtgca ctgctgtttg atagcggtga aaccgcagaa      180

gcaacccgtc tgaaacgtac cgcacgtcgt cgttataccc gtcgtaaaaa tcgtatttgt      240

tatctgcagg aaatttttag caatgaaatg gcaaaagttg atgatagctt ttttcatcgt      300

ctggaagaaa gctttctggt tgaagaagat aaaaaacatg aacgtcatcc gatttttggt      360

aatattgttg atgaagttgc atatcatgaa aaatatccga ccatttatca tctgcgtaaa      420

aaactggttg atagcaccga taaagcagat ctgcgtctga tttatctggc actggcacat      480

atgattaaat ttcgtggtca ttttctgatt gaaggtgatc tgaatccgga taatagcgat      540

gttgataaac tgtttattca gctggttcag acctataatc agctgtttga agaaaatccg      600

attaatgcaa gcggtgttga tgcaaaagca attctgagcg cacgtctgag caaaagccgt      660

cgtctggaaa atctgattgc acagctgccg ggtgaaaaaa aaaatggtct gtttggtaat      720

ctgattgcac tgagcctggg tctgaccccg aattttaaaa gcaattttga tctggcagaa      780

gatgcaaaac tgcagctgag caaagatacc tatgatgatg atctggataa tctgctggca      840

cagattggtg atcagtatgc agatctgttt ctggcagcaa aaaatctgag cgatgcaatt      900

ctgctgagcg atattctgcg tgttaatacc gaaattacca aagcaccgct gagcgcaagc      960

atgattaaac gttatgatga acatcatcag gatctgaccc tgctgaaagc actggttcgt     1020

cagcagctgc cggaaaaata taaagaaatt ttttttgatc agagcaaaaa tggttatgca     1080

ggttatattg atggtggtgc aagccaggaa gaattttata aatttattaa accgattctg     1140

gaaaaaatgg atggtaccga agaactgctg gttaaactga atcgtgaaga tctgctgcgt     1200

aaacagcgta cctttgataa tggtagcatt ccgcatcaga ttcatctggg tgaactgcat     1260

gcaattctgc gtcgtcagga agatttttat ccgtttctga aagataatcg tgaaaaaatt     1320

gaaaaaattc tgacctttcg tattccgtat tatgttggtc cgctggcacg tggtaatagc     1380

cgttttgcat ggatgacccg taaaagcgaa gaaaccatta ccccgtggaa ttttgaagaa     1440

gttgttgata aaggtgcaag cgcacagagc tttattgaac gtatgaccaa ttttgataaa     1500

aatctgccga atgaaaaagt tctgccgaaa catagcctgc tgtatgaata ttttaccgtt     1560

tataatgaac tgaccaaagt taaatatgtt accgaaggta tgcgtaaacc ggcatttctg     1620

agcggtgaac agaaaaaagc aattgttgat ctgctgttta aaaccaatcg taaagttacc     1680

gttaaacagc tgaaagaaga ttattttaaa aaaattgaat gttttgatag cgttgaaatt     1740

agcggtgttg aagatcgttt taatgcaagc ctgggtacct atcatgatct gctgaaaatt     1800

attaaagata aagattttct ggataatgaa gaaaatgaag atattctgga agatattgtt     1860

ctgaccctga ccctgtttga agatcgtgaa atgattgaag aacgtctgaa aacctatgca     1920

catctgtttg atgataaagt tatgaaacag ctgaaacgtc gtcgttatac cggttggggt     1980

cgtctgagcc gtaaactgat taatggtatt cgtgataaac agagcggtaa aaccattctg     2040

gattttctga aaagcgatgg ttttgcaaat cgtaatttta tgcagctgat tcatgatgat     2100

agcctgacct ttaaagaaga tattcagaaa gcacaggtta gcggtcaggg tgatagcctg     2160

catgaacata ttgcaaatct ggcaggtagc ccggcaatta aaaaaggtat tctgcagacc     2220

gttaaagttg ttgatgaact ggttaaagtt atgggtcgtc ataaaccgga aaatattgtt     2280

attgaaatgg cacgtgaaaa tcagaccacc cagaaaggtc agaaaaatag ccgtgaacgt     2340

atgaaacgta ttgaagaagg tattaaagaa ctgggtagcc agattctgaa agaacatccg     2400

gttgaaaata cccagctgca gaatgaaaaa ctgtatctgt attatctgca gaatggtcgt     2460

gatatgtatg ttgatcagga actggatatt aatcgtctga gcgattatga tgttgatcat     2520

attgttccgc agagctttct gaaagatgat agcattgata ataaagttct gacccgtagc     2580

gataaaaatc gtggtaaaag cgataatgtt ccgagcgaag aagttgttaa aaaaatgaaa     2640

aattattggc gtcagctgct gaatgcaaaa ctgattaccc agcgtaaatt tgataatctg     2700

accaaagcag aacgtggtgg tctgagcgaa ctggataaag caggttttat taaacgtcag     2760

ctggttgaaa cccgtcagat taccaaacat gttgcacaga ttctggatag ccgtatgaat     2820

accaaatatg atgaaaatga taaactgatt cgtgaagtta aagttattac cctgaaaagc     2880

aaactggtta gcgattttcg taaagatttt cagttttata aagttcgtga aattaataat     2940

tatcatcatg cacatgatgc atatctgaat gcagttgttg gtaccgcact gattaaaaaa     3000

tatccgaaac tggaaagcga atttgtttat ggtgattata aagtttatga tgttcgtaaa     3060

atgattgcaa aaagcgaaca ggaaattggt aaagcaaccg caaaatattt tttttatagc     3120

aatattatga atttttttaa aaccgaaatt accctggcaa atggtgaaat tcgtaaacgt     3180

ccgctgattg aaaccaatgg tgaaaccggt gaaattgttt gggataaagg tcgtgatttt     3240

gcaaccgttc gtaaagttct gagcatgccg caggttaata ttgttaaaaa aaccgaagtt     3300

cagaccggtg gttttagcaa agaaagcatt ctgccgaaac gtaatagcga taaactgatt     3360

gcacgtaaaa aagattggga tccgaaaaaa tatggtggtt ttgatagccc gaccgttgca     3420

tatagcgttc tggttgttgc aaaagttgaa aaaggtaaaa gcaaaaaact gaaaagcgtt     3480

aaagaactgc tgggtattac cattatggaa cgtagcagct ttgaaaaaaa tccgattgat     3540

tttctggaag caaaaggtta taaagaagtt aaaaaagatc tgattattaa actgccgaaa     3600

tatagcctgt ttgaactgga aaatggtcgt aaacgtatgc tggcaagcgc aggtgaactg     3660

cagaaaggta atgaactggc actgccgagc aaatatgtta attttctgta tctggcaagc     3720

cattatgaaa aactgaaagg tagcccggaa gataatgaac agaaacagct gtttgttgaa     3780

cagcataaac attatctgga tgaaattatt gaacagatta gcgaatttag caaacgtgtt     3840

attctggcag atgcaaatct ggataaagtt ctgagcgcat ataataaaca tcgtgataaa     3900

ccgattcgtg aacaggcaga aaatattatt catctgttta ccctgaccaa tctgggtgca     3960

ccggcagcat ttaaatattt tgataccacc attgatcgta aacgttatac cagcaccaaa     4020

gaagttctgg atgcaaccct gattcatcag agcattaccg gtctgtatga aacccgtatt     4080

gatctgagcc agctgggtgg tgat                                            4104


<210>  3
<211>  1082
<212>  PRT
<213>  Geobacillus thermodenitrificans T12

<400>  3

Met Lys Tyr Lys Ile Gly Leu Asp Ile Gly Ile Thr Ser Ile Gly Trp 
1               5                   10                  15      


Ala Val Ile Asn Leu Asp Ile Pro Arg Ile Glu Asp Leu Gly Val Arg 
            20                  25                  30          


Ile Phe Asp Arg Ala Glu Asn Pro Lys Thr Gly Glu Ser Leu Ala Leu 
        35                  40                  45              


Pro Arg Arg Leu Ala Arg Ser Ala Arg Arg Arg Leu Arg Arg Arg Lys 
    50                  55                  60                  


His Arg Leu Glu Arg Ile Arg Arg Leu Phe Val Arg Glu Gly Ile Leu 
65                  70                  75                  80  


Thr Lys Glu Glu Leu Asn Lys Leu Phe Glu Lys Lys His Glu Ile Asp 
                85                  90                  95      


Val Trp Gln Leu Arg Val Glu Ala Leu Asp Arg Lys Leu Asn Asn Asp 
            100                 105                 110         


Glu Leu Ala Arg Ile Leu Leu His Leu Ala Lys Arg Arg Gly Phe Arg 
        115                 120                 125             


Ser Asn Arg Lys Ser Glu Arg Thr Asn Lys Glu Asn Ser Thr Met Leu 
    130                 135                 140                 


Lys His Ile Glu Glu Asn Gln Ser Ile Leu Ser Ser Tyr Arg Thr Val 
145                 150                 155                 160 


Ala Glu Met Val Val Lys Asp Pro Lys Phe Ser Leu His Lys Arg Asn 
                165                 170                 175     


Lys Glu Asp Asn Tyr Thr Asn Thr Val Ala Arg Asp Asp Leu Glu Arg 
            180                 185                 190         


Glu Ile Lys Leu Ile Phe Ala Lys Gln Arg Glu Tyr Gly Asn Ile Val 
        195                 200                 205             


Cys Thr Glu Ala Phe Glu His Glu Tyr Ile Ser Ile Trp Ala Ser Gln 
    210                 215                 220                 


Arg Pro Phe Ala Ser Lys Asp Asp Ile Glu Lys Lys Val Gly Phe Cys 
225                 230                 235                 240 


Thr Phe Glu Pro Lys Glu Lys Arg Ala Pro Lys Ala Thr Tyr Thr Phe 
                245                 250                 255     


Gln Ser Phe Thr Val Trp Glu His Ile Asn Lys Leu Arg Leu Val Ser 
            260                 265                 270         


Pro Gly Gly Ile Arg Ala Leu Thr Asp Asp Glu Arg Arg Leu Ile Tyr 
        275                 280                 285             


Lys Gln Ala Phe His Lys Asn Lys Ile Thr Phe His Asp Val Arg Thr 
    290                 295                 300                 


Leu Leu Asn Leu Pro Asp Asp Thr Arg Phe Lys Gly Leu Leu Tyr Asp 
305                 310                 315                 320 


Arg Asn Thr Thr Leu Lys Glu Asn Glu Lys Val Arg Phe Leu Glu Leu 
                325                 330                 335     


Gly Ala Tyr His Lys Ile Arg Lys Ala Ile Asp Ser Val Tyr Gly Lys 
            340                 345                 350         


Gly Ala Ala Lys Ser Phe Arg Pro Ile Asp Phe Asp Thr Phe Gly Tyr 
        355                 360                 365             


Ala Leu Thr Met Phe Lys Asp Asp Thr Asp Ile Arg Ser Tyr Leu Arg 
    370                 375                 380                 


Asn Glu Tyr Glu Gln Asn Gly Lys Arg Met Glu Asn Leu Ala Asp Lys 
385                 390                 395                 400 


Val Tyr Asp Glu Glu Leu Ile Glu Glu Leu Leu Asn Leu Ser Phe Ser 
                405                 410                 415     


Lys Phe Gly His Leu Ser Leu Lys Ala Leu Arg Asn Ile Leu Pro Tyr 
            420                 425                 430         


Met Glu Gln Gly Glu Val Tyr Ser Thr Ala Cys Glu Arg Ala Gly Tyr 
        435                 440                 445             


Thr Phe Thr Gly Pro Lys Lys Lys Gln Lys Thr Val Leu Leu Pro Asn 
    450                 455                 460                 


Ile Pro Pro Ile Ala Asn Pro Val Val Met Arg Ala Leu Thr Gln Ala 
465                 470                 475                 480 


Arg Lys Val Val Asn Ala Ile Ile Lys Lys Tyr Gly Ser Pro Val Ser 
                485                 490                 495     


Ile His Ile Glu Leu Ala Arg Glu Leu Ser Gln Ser Phe Asp Glu Arg 
            500                 505                 510         


Arg Lys Met Gln Lys Glu Gln Glu Gly Asn Arg Lys Lys Asn Glu Thr 
        515                 520                 525             


Ala Ile Arg Gln Leu Val Glu Tyr Gly Leu Thr Leu Asn Pro Thr Gly 
    530                 535                 540                 


Leu Asp Ile Val Lys Phe Lys Leu Trp Ser Glu Gln Asn Gly Lys Cys 
545                 550                 555                 560 


Ala Tyr Ser Leu Gln Pro Ile Glu Ile Glu Arg Leu Leu Glu Pro Gly 
                565                 570                 575     


Tyr Thr Glu Val Asp His Val Ile Pro Tyr Ser Arg Ser Leu Asp Asp 
            580                 585                 590         


Ser Tyr Thr Asn Lys Val Leu Val Leu Thr Lys Glu Asn Arg Glu Lys 
        595                 600                 605             


Gly Asn Arg Thr Pro Ala Glu Tyr Leu Gly Leu Gly Ser Glu Arg Trp 
    610                 615                 620                 


Gln Gln Phe Glu Thr Phe Val Leu Thr Asn Lys Gln Phe Ser Lys Lys 
625                 630                 635                 640 


Lys Arg Asp Arg Leu Leu Arg Leu His Tyr Asp Glu Asn Glu Glu Asn 
                645                 650                 655     


Glu Phe Lys Asn Arg Asn Leu Asn Asp Thr Arg Tyr Ile Ser Arg Phe 
            660                 665                 670         


Leu Ala Asn Phe Ile Arg Glu His Leu Lys Phe Ala Asp Ser Asp Asp 
        675                 680                 685             


Lys Gln Lys Val Tyr Thr Val Asn Gly Arg Ile Thr Ala His Leu Arg 
    690                 695                 700                 


Ser Arg Trp Asn Phe Asn Lys Asn Arg Glu Glu Ser Asn Leu His His 
705                 710                 715                 720 


Ala Val Asp Ala Ala Ile Val Ala Cys Thr Thr Pro Ser Asp Ile Ala 
                725                 730                 735     


Arg Val Thr Ala Phe Tyr Gln Arg Arg Glu Gln Asn Lys Glu Leu Ser 
            740                 745                 750         


Lys Lys Thr Asp Pro Gln Phe Pro Gln Pro Trp Pro His Phe Ala Asp 
        755                 760                 765             


Glu Leu Gln Ala Arg Leu Ser Lys Asn Pro Lys Glu Ser Ile Lys Ala 
    770                 775                 780                 


Leu Asn Leu Gly Asn Tyr Asp Asn Glu Lys Leu Glu Ser Leu Gln Pro 
785                 790                 795                 800 


Val Phe Val Ser Arg Met Pro Lys Arg Ser Ile Thr Gly Ala Ala His 
                805                 810                 815     


Gln Glu Thr Leu Arg Arg Tyr Ile Gly Ile Asp Glu Arg Ser Gly Lys 
            820                 825                 830         


Ile Gln Thr Val Val Lys Lys Lys Leu Ser Glu Ile Gln Leu Asp Lys 
        835                 840                 845             


Thr Gly His Phe Pro Met Tyr Gly Lys Glu Ser Asp Pro Arg Thr Tyr 
    850                 855                 860                 


Glu Ala Ile Arg Gln Arg Leu Leu Glu His Asn Asn Asp Pro Lys Lys 
865                 870                 875                 880 


Ala Phe Gln Glu Pro Leu Tyr Lys Pro Lys Lys Asn Gly Glu Leu Gly 
                885                 890                 895     


Pro Ile Ile Arg Thr Ile Lys Ile Ile Asp Thr Thr Asn Gln Val Ile 
            900                 905                 910         


Pro Leu Asn Asp Gly Lys Thr Val Ala Tyr Asn Ser Asn Ile Val Arg 
        915                 920                 925             


Val Asp Val Phe Glu Lys Asp Gly Lys Tyr Tyr Cys Val Pro Ile Tyr 
    930                 935                 940                 


Thr Ile Asp Met Met Lys Gly Ile Leu Pro Asn Lys Ala Ile Glu Pro 
945                 950                 955                 960 


Asn Lys Pro Tyr Ser Glu Trp Lys Glu Met Thr Glu Asp Tyr Thr Phe 
                965                 970                 975     


Arg Phe Ser Leu Tyr Pro Asn Asp Leu Ile Arg Ile Glu Phe Pro Arg 
            980                 985                 990         


Glu Lys Thr Ile Lys Thr Ala Val  Gly Glu Glu Ile Lys  Ile Lys Asp 
        995                 1000                 1005             


Leu Phe  Ala Tyr Tyr Gln Thr  Ile Asp Ser Ser Asn  Gly Gly Leu 
    1010                 1015                 1020             


Ser Leu  Val Ser His Asp Asn  Asn Phe Ser Leu Arg  Ser Ile Gly 
    1025                 1030                 1035             


Ser Arg  Thr Leu Lys Arg Phe  Glu Lys Tyr Gln Val  Asp Val Leu 
    1040                 1045                 1050             


Gly Asn  Ile Tyr Lys Val Arg  Gly Glu Lys Arg Val  Gly Val Ala 
    1055                 1060                 1065             


Ser Ser  Ser His Ser Lys Ala  Gly Glu Thr Ile Arg  Pro Leu 
    1070                 1075                 1080         


<210>  4
<211>  1300
<212>  PRT
<213>  artificial sequence

<220>
<223>  FnCpfI

<400>  4

Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr 
1               5                   10                  15      


Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys 
            20                  25                  30          


Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys 
        35                  40                  45              


Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu 
    50                  55                  60                  


Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser 
65                  70                  75                  80  


Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys 
                85                  90                  95      


Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr 
            100                 105                 110         


Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile 
        115                 120                 125             


Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln 
    130                 135                 140                 


Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr 
145                 150                 155                 160 


Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr 
                165                 170                 175     


Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser 
            180                 185                 190         


Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu 
        195                 200                 205             


Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys 
    210                 215                 220                 


Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu 
225                 230                 235                 240 


Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg 
                245                 250                 255     


Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr 
            260                 265                 270         


Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys 
        275                 280                 285             


Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile 
    290                 295                 300                 


Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys 
305                 310                 315                 320 


Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser 
                325                 330                 335     


Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met 
            340                 345                 350         


Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys 
        355                 360                 365             


Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln 
    370                 375                 380                 


Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr 
385                 390                 395                 400 


Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala 
                405                 410                 415     


Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn 
            420                 425                 430         


Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala 
        435                 440                 445             


Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn 
    450                 455                 460                 


Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala 
465                 470                 475                 480 


Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys 
                485                 490                 495     


Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys 
            500                 505                 510         


Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp 
        515                 520                 525             


Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His 
    530                 535                 540                 


Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His 
545                 550                 555                 560 


Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val 
                565                 570                 575     


Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser 
            580                 585                 590         


Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly 
        595                 600                 605             


Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys 
    610                 615                 620                 


Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile 
625                 630                 635                 640 


Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys 
                645                 650                 655     


Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val 
            660                 665                 670         


Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile 
        675                 680                 685             


Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln 
    690                 695                 700                 


Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe 
705                 710                 715                 720 


Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp 
                725                 730                 735     


Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu 
            740                 745                 750         


Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn 
        755                 760                 765             


Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr 
    770                 775                 780                 


Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg 
785                 790                 795                 800 


Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn 
                805                 810                 815     


Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr 
            820                 825                 830         


Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala 
        835                 840                 845             


Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu 
    850                 855                 860                 


Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe 
865                 870                 875                 880 


His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe 
                885                 890                 895     


Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His 
            900                 905                 910         


Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu 
        915                 920                 925             


Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile 
    930                 935                 940                 


Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile 
945                 950                 955                 960 


Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn 
                965                 970                 975     


Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile 
            980                 985                 990         


Ala Lys Leu Val Ile Glu Tyr Asn  Ala Ile Val Val Phe  Glu Asp Leu 
        995                 1000                 1005             


Asn Phe  Gly Phe Lys Arg Gly  Arg Phe Lys Val Glu  Lys Gln Val 
    1010                 1015                 1020             


Tyr Gln  Lys Leu Glu Lys Met  Leu Ile Glu Lys Leu  Asn Tyr Leu 
    1025                 1030                 1035             


Val Phe  Lys Asp Asn Glu Phe  Asp Lys Thr Gly Gly  Val Leu Arg 
    1040                 1045                 1050             


Ala Tyr  Gln Leu Thr Ala Pro  Phe Glu Thr Phe Lys  Lys Met Gly 
    1055                 1060                 1065             


Lys Gln  Thr Gly Ile Ile Tyr  Tyr Val Pro Ala Gly  Phe Thr Ser 
    1070                 1075                 1080             


Lys Ile  Cys Pro Val Thr Gly  Phe Val Asn Gln Leu  Tyr Pro Lys 
    1085                 1090                 1095             


Tyr Glu  Ser Val Ser Lys Ser  Gln Glu Phe Phe Ser  Lys Phe Asp 
    1100                 1105                 1110             


Lys Ile  Cys Tyr Asn Leu Asp  Lys Gly Tyr Phe Glu  Phe Ser Phe 
    1115                 1120                 1125             


Asp Tyr  Lys Asn Phe Gly Asp  Lys Ala Ala Lys Gly  Lys Trp Thr 
    1130                 1135                 1140             


Ile Ala  Ser Phe Gly Ser Arg  Leu Ile Asn Phe Arg  Asn Ser Asp 
    1145                 1150                 1155             


Lys Asn  His Asn Trp Asp Thr  Arg Glu Val Tyr Pro  Thr Lys Glu 
    1160                 1165                 1170             


Leu Glu  Lys Leu Leu Lys Asp  Tyr Ser Ile Glu Tyr  Gly His Gly 
    1175                 1180                 1185             


Glu Cys  Ile Lys Ala Ala Ile  Cys Gly Glu Ser Asp  Lys Lys Phe 
    1190                 1195                 1200             


Phe Ala  Lys Leu Thr Ser Val  Leu Asn Thr Ile Leu  Gln Met Arg 
    1205                 1210                 1215             


Asn Ser  Lys Thr Gly Thr Glu  Leu Asp Tyr Leu Ile  Ser Pro Val 
    1220                 1225                 1230             


Ala Asp  Val Asn Gly Asn Phe  Phe Asp Ser Arg Gln  Ala Pro Lys 
    1235                 1240                 1245             


Asn Met  Pro Gln Asp Ala Asp  Ala Asn Gly Ala Tyr  His Ile Gly 
    1250                 1255                 1260             


Leu Lys  Gly Leu Met Leu Leu  Gly Arg Ile Lys Asn  Asn Gln Glu 
    1265                 1270                 1275             


Gly Lys  Lys Leu Asn Leu Val  Ile Lys Asn Glu Glu  Tyr Phe Glu 
    1280                 1285                 1290             


Phe Val  Gln Asn Arg Asn Asn  
    1295                 1300 


<210>  5
<211>  3900
<212>  DNA
<213>  artificial sequence

<220>
<223>  sequence encoding FnCpfI

<400>  5
atgagcattt atcaggaatt tgttaataaa tatagcctga gcaaaaccct gcgttttgaa       60

ctgattccgc agggtaaaac cctggaaaat attaaagcac gtggtctgat tctggatgat      120

gaaaaacgtg caaaagatta taaaaaagca aaacagatta ttgataaata tcatcagttt      180

tttattgaag aaattctgag cagcgtttgt attagcgaag atctgctgca gaattatagc      240

gatgtttatt ttaaactgaa aaaaagcgat gatgataatc tgcagaaaga ttttaaaagc      300

gcaaaagata ccattaaaaa acagattagc gaatatatta aagatagcga aaaatttaaa      360

aatctgttta atcagaatct gattgatgca aaaaaaggtc aggaaagcga tctgattctg      420

tggctgaaac agagcaaaga taatggtatt gaactgttta aagcaaatag cgatattacc      480

gatattgatg aagcactgga aattattaaa agctttaaag gttggaccac ctattttaaa      540

ggttttcatg aaaatcgtaa aaatgtttat agcagcaatg atattccgac cagcattatt      600

tatcgtattg ttgatgataa tctgccgaaa tttctggaaa ataaagcaaa atatgaaagc      660

ctgaaagata aagcaccgga agcaattaat tatgaacaga ttaaaaaaga tctggcagaa      720

gaactgacct ttgatattga ttataaaacc agcgaagtta atcagcgtgt ttttagcctg      780

gatgaagttt ttgaaattgc aaattttaat aattatctga atcagagcgg tattaccaaa      840

tttaatacca ttattggtgg taaatttgtt aatggtgaaa ataccaaacg taaaggtatt      900

aatgaatata ttaatctgta tagccagcag attaatgata aaaccctgaa aaaatataaa      960

atgagcgttc tgtttaaaca gattctgagc gataccgaaa gcaaaagctt tgttattgat     1020

aaactggaag atgatagcga tgttgttacc accatgcaga gcttttatga acagattgca     1080

gcatttaaaa ccgttgaaga aaaaagcatt aaagaaaccc tgagcctgct gtttgatgat     1140

ctgaaagcac agaaactgga tctgagcaaa atttatttta aaaatgataa aagcctgacc     1200

gatctgagcc agcaggtttt tgatgattat agcgttattg gtaccgcagt tctggaatat     1260

attacccagc agattgcacc gaaaaatctg gataatccga gcaaaaaaga acaggaactg     1320

attgcaaaaa aaaccgaaaa agcaaaatat ctgagcctgg aaaccattaa actggcactg     1380

gaagaattta ataaacatcg tgatattgat aaacagtgtc gttttgaaga aattctggca     1440

aattttgcag caattccgat gatttttgat gaaattgcac agaataaaga taatctggca     1500

cagattagca ttaaatatca gaatcagggt aaaaaagatc tgctgcaggc aagcgcagaa     1560

gatgatgtta aagcaattaa agatctgctg gatcagacca ataatctgct gcataaactg     1620

aaaatttttc atattagcca gagcgaagat aaagcaaata ttctggataa agatgaacat     1680

ttttatctgg tttttgaaga atgttatttt gaactggcaa atattgttcc gctgtataat     1740

aaaattcgta attatattac ccagaaaccg tatagcgatg aaaaatttaa actgaatttt     1800

gaaaatagca ccctggcaaa tggttgggat aaaaataaag aaccggataa taccgcaatt     1860

ctgtttatta aagatgataa atattatctg ggtgttatga ataaaaaaaa taataaaatt     1920

tttgatgata aagcaattaa agaaaataaa ggtgaaggtt ataaaaaaat tgtttataaa     1980

ctgctgccgg gtgcaaataa aatgctgccg aaagtttttt ttagcgcaaa aagcattaaa     2040

ttttataatc cgagcgaaga tattctgcgt attcgtaatc atagcaccca taccaaaaat     2100

ggtagcccgc agaaaggtta tgaaaaattt gaatttaata ttgaagattg tcgtaaattt     2160

attgattttt ataaacagag cattagcaaa catccggaat ggaaagattt tggttttcgt     2220

tttagcgata cccagcgtta taatagcatt gatgaatttt atcgtgaagt tgaaaatcag     2280

ggttataaac tgacctttga aaatattagc gaaagctata ttgatagcgt tgttaatcag     2340

ggtaaactgt atctgtttca gatttataat aaagatttta gcgcatatag caaaggtcgt     2400

ccgaatctgc ataccctgta ttggaaagca ctgtttgatg aacgtaatct gcaggatgtt     2460

gtttataaac tgaatggtga agcagaactg ttttatcgta aacagagcat tccgaaaaaa     2520

attacccatc cggcaaaaga agcaattgca aataaaaata aagataatcc gaaaaaagaa     2580

agcgtttttg aatatgatct gattaaagat aaacgtttta ccgaagataa attttttttt     2640

cattgtccga ttaccattaa ttttaaaagc agcggtgcaa ataaatttaa tgatgaaatt     2700

aatctgctgc tgaaagaaaa agcaaatgat gttcatattc tgagcattga tcgtggtgaa     2760

cgtcatctgg catattatac cctggttgat ggtaaaggta atattattaa acaggatacc     2820

tttaatatta ttggtaatga tcgtatgaaa accaattatc atgataaact ggcagcaatt     2880

gaaaaagatc gtgatagcgc acgtaaagat tggaaaaaaa ttaataatat taaagaaatg     2940

aaagaaggtt atctgagcca ggttgttcat gaaattgcaa aactggttat tgaatataat     3000

gcaattgttg tttttgaaga tctgaatttt ggttttaaac gtggtcgttt taaagttgaa     3060

aaacaggttt atcagaaact ggaaaaaatg ctgattgaaa aactgaatta tctggttttt     3120

aaagataatg aatttgataa aaccggtggt gttctgcgtg catatcagct gaccgcaccg     3180

tttgaaacct ttaaaaaaat gggtaaacag accggtatta tttattatgt tccggcaggt     3240

tttaccagca aaatttgtcc ggttaccggt tttgttaatc agctgtatcc gaaatatgaa     3300

agcgttagca aaagccagga attttttagc aaatttgata aaatttgtta taatctggat     3360

aaaggttatt ttgaatttag ctttgattat aaaaattttg gtgataaagc agcaaaaggt     3420

aaatggacca ttgcaagctt tggtagccgt ctgattaatt ttcgtaatag cgataaaaat     3480

cataattggg atacccgtga agtttatccg accaaagaac tggaaaaact gctgaaagat     3540

tatagcattg aatatggtca tggtgaatgt attaaagcag caatttgtgg tgaaagcgat     3600

aaaaaatttt ttgcaaaact gaccagcgtt ctgaatacca ttctgcagat gcgtaatagc     3660

aaaaccggta ccgaactgga ttatctgatt agcccggttg cagatgttaa tggtaatttt     3720

tttgatagcc gtcaggcacc gaaaaatatg ccgcaggatg cagatgcaaa tggtgcatat     3780

catattggtc tgaaaggtct gatgctgctg ggtcgtatta aaaataatca ggaaggtaaa     3840

aaactgaatc tggttattaa aaatgaagaa tattttgaat ttgttcagaa tcgtaataat     3900


<210>  6
<211>  1263
<212>  PRT
<213>  Eubacterium rectale

<400>  6

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Asp Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


<210>  7
<211>  1274
<212>  PRT
<213>  artificial sequence

<220>
<223>  MAD7-NLS

<400>  7

Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser 
1               5                   10                  15      


Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln 
            20                  25                  30          


Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly 
        35                  40                  45              


Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly 
    50                  55                  60                  


Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser 
65                  70                  75                  80  


Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp 
                85                  90                  95      


Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys 
            100                 105                 110         


Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile 
        115                 120                 125             


Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala 
    130                 135                 140                 


Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe 
145                 150                 155                 160 


Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser 
                165                 170                 175     


Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn 
            180                 185                 190         


Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys 
        195                 200                 205             


Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp 
    210                 215                 220                 


Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr 
225                 230                 235                 240 


Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys 
                245                 250                 255     


Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu 
            260                 265                 270         


Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys 
        275                 280                 285             


Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu 
    290                 295                 300                 


Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys 
305                 310                 315                 320 


His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr 
                325                 330                 335     


Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser 
            340                 345                 350         


Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile 
        355                 360                 365             


His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys 
    370                 375                 380                 


Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile 
385                 390                 395                 400 


Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys 
                405                 410                 415     


Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu 
            420                 425                 430         


Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu 
        435                 440                 445             


Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala 
    450                 455                 460                 


Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp 
465                 470                 475                 480 


Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro 
                485                 490                 495     


Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro 
            500                 505                 510         


Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala 
        515                 520                 525             


Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu 
    530                 535                 540                 


Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys 
545                 550                 555                 560 


Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp 
                565                 570                 575     


Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile 
            580                 585                 590         


Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro 
        595                 600                 605             


Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser 
    610                 615                 620                 


Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe 
625                 630                 635                 640 


Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp 
                645                 650                 655     


Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys 
        675                 680                 685             


Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile 
                725                 730                 735     


Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser 
            740                 745                 750         


Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg 
        755                 760                 765             


Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val 
    770                 775                 780                 


Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe 
785                 790                 795                 800 


Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys 
                805                 810                 815     


Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr 
            820                 825                 830         


Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn 
        835                 840                 845             


Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr 
    850                 855                 860                 


Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Asp Arg Gly Glu 
865                 870                 875                 880 


Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val 
                885                 890                 895     


Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys 
            900                 905                 910         


Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys 
        915                 920                 925             


Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val 
    930                 935                 940                 


Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala 
945                 950                 955                 960 


Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu 
                965                 970                 975     


Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn 
            980                 985                 990         


Tyr Leu Val Phe Lys Asp Ile Ser  Ile Thr Glu Asn Gly  Gly Leu Leu 
        995                 1000                 1005             


Lys Gly  Tyr Gln Leu Thr Tyr  Ile Pro Asp Lys Leu  Lys Asn Val 
    1010                 1015                 1020             


Gly His  Gln Cys Gly Cys Ile  Phe Tyr Val Pro Ala  Ala Tyr Thr 
    1025                 1030                 1035             


Ser Lys  Ile Asp Pro Thr Thr  Gly Phe Val Asn Ile  Phe Lys Phe 
    1040                 1045                 1050             


Lys Asp  Leu Thr Val Asp Ala  Lys Arg Glu Phe Ile  Lys Lys Phe 
    1055                 1060                 1065             


Asp Ser  Ile Arg Tyr Asp Ser  Glu Lys Asn Leu Phe  Cys Phe Thr 
    1070                 1075                 1080             


Phe Asp  Tyr Asn Asn Phe Ile  Thr Gln Asn Thr Val  Met Ser Lys 
    1085                 1090                 1095             


Ser Ser  Trp Ser Val Tyr Thr  Tyr Gly Val Arg Ile  Lys Arg Arg 
    1100                 1105                 1110             


Phe Val  Asn Gly Arg Phe Ser  Asn Glu Ser Asp Thr  Ile Asp Ile 
    1115                 1120                 1125             


Thr Lys  Asp Met Glu Lys Thr  Leu Glu Met Thr Asp  Ile Asn Trp 
    1130                 1135                 1140             


Arg Asp  Gly His Asp Leu Arg  Gln Asp Ile Ile Asp  Tyr Glu Ile 
    1145                 1150                 1155             


Val Gln  His Ile Phe Glu Ile  Phe Arg Leu Thr Val  Gln Met Arg 
    1160                 1165                 1170             


Asn Ser  Leu Ser Glu Leu Glu  Asp Arg Asp Tyr Asp  Arg Leu Ile 
    1175                 1180                 1185             


Ser Pro  Val Leu Asn Glu Asn  Asn Ile Phe Tyr Asp  Ser Ala Lys 
    1190                 1195                 1200             


Ala Gly  Asp Ala Leu Pro Lys  Asp Ala Asp Ala Asn  Gly Ala Tyr 
    1205                 1210                 1215             


Cys Ile  Ala Leu Lys Gly Leu  Tyr Glu Ile Lys Gln  Ile Thr Glu 
    1220                 1225                 1230             


Asn Trp  Lys Glu Asp Gly Lys  Phe Ser Arg Asp Lys  Leu Lys Ile 
    1235                 1240                 1245             


Ser Asn  Lys Asp Trp Phe Asp  Phe Ile Gln Asn Lys  Arg Tyr Leu 
    1250                 1255                 1260             


Ser Gly  Gly Ser Pro Lys Lys  Lys Arg Lys Val 
    1265                 1270                 


<210>  8
<211>  1050
<212>  DNA
<213>  artificial sequence

<220>
<223>  Lambda genome

<400>  8
aatcacgctg atttacagcg gcagccataa ggtggatggc aacccctaca gccatcttcc       60

ggatgacgtc cgggagacac tgcagtcccg gatggacgca acccgccaga tgtttgcgca      120

gaaggtgtcg gcatataccg gcctgtccgt gcaggttgtg ctggataccg aggctgcagt      180

gtacagcggt caggaggcca ttgatgccgg actggctgat gaacttgtta acagcaccga      240

tgcgatcacc gtcatgcgtg atgcactgga tgcacgtaaa tcccgtctct caggagggcg      300

aatgaccaaa gagactcaat caacaactgt ttcagccact gcttcgcagg ctgacgttac      360

tgacgtggtg ccagcgacgg agggcgagaa cgccagcgcg gcgcagccgg acgtgaacgc      420

gcagatcacc gcagcggttg cggcagaaaa cagccgcatt atggggatcc tcaactgtga      480

ggaggctcac ggacgcgaag aacaggcacg cgtgctggca gaaacccccg gtatgaccgt      540

gaaaacggcc cgccgcattc tggccgcagc accacagagt gcacaggcgc gcagtgacac      600

tgcgctggat cgtctgatgc agggggcacc ggcaccgctg gctgcaggta acccggcatc      660

tgatgccgtt aacgatttgc tgaacacacc agtgtaaggg atgtttatga cgagcaaaga      720

aacctttacc cattaccagc cgcagggcaa cagtgacccg gctcataccg caaccgcgcc      780

cggcggattg agtgcgaaag cgcctgcaat gaccccgctg atgctggaca cctccagccg      840

taagctggtt gcgtgggatg gcaccaccga cggtgctgcc gttggcattc ttgcggttgc      900

tgctgaccag accagcacca cgctgacgtt ctacaagtcc ggcacgttcc gttatgagga      960

tgtgctctgg ccggaggctg ccagcgacga gacgaaaaaa cggaccgcgt ttgccggaac     1020

ggcaatcagc atcgtttaac tttacccttc                                      1050


<210>  9
<211>  55
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. ONT barcode primer B2

<400>  9
aaggttacag acgactacaa acggaatcga acagcacctg acgatgagtc ctgag            55


<210>  10
<211>  34
<212>  RNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Adapter RevsgRNA

<400>  10
gacgaugagu ccugaguccg gaugacgucc ggga                                   34


<210>  11
<211>  17
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. RevsgRNA

<400>  11
ctgcaggccc tctgtga                                                      17


<210>  12
<211>  17
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. sgRNA 3

<400>  12
gctcataccg caaccgc                                                      17


<210>  13
<211>  35
<212>  RNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Adapter sgRNA 3

<400>  13
uggcguuggc gcgggccgcc augcgucaga ugcuc                                  35


<210>  14
<211>  56
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. ONT Barcode primer B1

<400>  14
ccatgcgtca gatgctctcc acgacattct ttcaacagcc acagaaacac ttggaa           56


<210>  15
<211>  16
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  15
gacgatgagt cctgag                                                       16


<210>  16
<211>  18
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  16
gtcgctggca ccacgtca                                                     18


<210>  17
<211>  19
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  17
cactgcgctg gatcgtctg                                                    19


<210>  18
<211>  17
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  18
ctcgtagact gcgtacc                                                      17


<210>  19
<211>  21
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  19
acgactacaa acggaatcga a                                                 21


<210>  20
<211>  18
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  20
gtcgctggca ccacgtca                                                     18


<210>  21
<211>  19
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  21
cactgcgctg gatcgtctg                                                    19


<210>  22
<211>  22
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  22
cacaaagaca ccgacaactt tc                                                22


<210>  23
<211>  21
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  23
acgactacaa acggaatcga a                                                 21


<210>  24
<211>  22
<212>  DNA
<213>  artificial sequence

<220>
<223>  Fig. 2. Primer

<400>  24
cacaaagaca ccgacaactt tc                                                22


<210>  25
<211>  21
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  25
tcacgctgat ttacagcggc a                                                 21


<210>  26
<211>  20
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  26
ggcttgccgt tagtcgtagc                                                   20


<210>  27
<211>  103
<212>  RNA
<213>  artificial sequence

<220>
<223>  RevsgRNA

<400>  27
agugucuccc ggacgucauc guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc       60

cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu uuu                        103


<210>  28
<211>  103
<212>  RNA
<213>  artificial sequence

<220>
<223>  sgRNA3

<400>  28
gcucauaccg caaccgcgcc guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc       60

cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu uuu                        103


<210>  29
<211>  34
<212>  RNA
<213>  artificial sequence

<220>
<223>  RevsgRNA-RNA-Ad

<400>  29
gacgaugagu ccugaguccg gaugacgucc ggga                                   34


<210>  30
<211>  35
<212>  RNA
<213>  artificial sequence

<220>
<223>  sgRNA3-RNA-Ad

<400>  30
cucguagacu gcguaccgcc gggcgcgguu gcggu                                  35


<210>  31
<211>  55
<212>  DNA
<213>  artificial sequence

<220>
<223>  RevsgRNA-BC2

<400>  31
aaggttacag acgactacaa acggaatcga acagcacctg acgatgagtc ctgag            55


<210>  32
<211>  56
<212>  DNA
<213>  artificial sequence

<220>
<223>  sgRNA3-BC1

<400>  32
aaggttcaca aagacaccga caactttctt acagcacctc tcgtagactg cgtacc           56


<210>  33
<211>  16
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  33
gacgatgagt cctgag                                                       16


<210>  34
<211>  17
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  34
ctcgtagact gcgtacc                                                      17


<210>  35
<211>  21
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  35
acgactacaa acggaatcga a                                                 21


<210>  36
<211>  22
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer

<400>  36
cacaaagaca ccgacaactt tc                                                22


<210>  37
<211>  22
<212>  DNA
<213>  artificial sequence

<220>
<223>  protelomerase recognition sequence


<220>
<221>  misc_feature
<223>  n is a, c, g or t

<220>
<221>  misc_feature
<222>  (1)..(1)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (5)..(6)
<223>  n is a, c, g or t

<220>
<221>  misc_feature
<222>  (9)..(10)
<223>  n is a, c, g or t

<220>
<221>  misc_feature
<222>  (13)..(14)
<223>  n is a, c, g or t

<220>
<221>  misc_feature
<222>  (17)..(18)
<223>  n is a, c, g or t

<220>
<221>  misc_feature
<222>  (22)..(22)
<223>  n is a, c, g or t

<400>  37
ncatnntann cgnntannat gn                                                22


<210>  38
<211>  56
<212>  DNA
<213>  artificial sequence

<220>
<223>  protelomerase recognition sequence

<400>  38
tatcagcaca caattgccca ttatacgcgc gtataatgga ctattgtgtg ctgata           56


<210>  39
<211>  631
<212>  PRT
<213>  artificial sequence

<220>
<223>  protelomerase

<400>  39

Met Ser Lys Val Lys Ile Gly Glu Leu Ile Asn Thr Leu Val Asn Glu 
1               5                   10                  15      


Val Glu Ala Ile Asp Ala Ser Asp Arg Pro Gln Gly Asp Lys Thr Lys 
            20                  25                  30          


Arg Ile Lys Ala Ala Ala Ala Arg Tyr Lys Asn Ala Leu Phe Asn Asp 
        35                  40                  45              


Lys Arg Lys Phe Arg Gly Lys Gly Leu Gln Lys Arg Ile Thr Ala Asn 
    50                  55                  60                  


Thr Phe Asn Ala Tyr Met Ser Arg Ala Arg Lys Arg Phe Asp Asp Lys 
65                  70                  75                  80  


Leu His His Ser Phe Asp Lys Asn Ile Asn Lys Leu Ser Glu Lys Tyr 
                85                  90                  95      


Pro Leu Tyr Ser Glu Glu Leu Ser Ser Trp Leu Ser Met Pro Thr Ala 
            100                 105                 110         


Asn Ile Arg Gln His Met Ser Ser Leu Gln Ser Lys Leu Lys Glu Ile 
        115                 120                 125             


Met Pro Leu Ala Glu Glu Leu Ser Asn Val Arg Ile Gly Ser Lys Gly 
    130                 135                 140                 


Ser Asp Ala Lys Ile Ala Arg Leu Ile Lys Lys Tyr Pro Asp Trp Ser 
145                 150                 155                 160 


Phe Ala Leu Ser Asp Leu Asn Ser Asp Asp Trp Lys Glu Arg Arg Asp 
                165                 170                 175     


Tyr Leu Tyr Lys Leu Phe Gln Gln Gly Ser Ala Leu Leu Glu Glu Leu 
            180                 185                 190         


His Gln Leu Lys Val Asn His Glu Val Leu Tyr His Leu Gln Leu Ser 
        195                 200                 205             


Pro Ala Glu Arg Thr Ser Ile Gln Gln Arg Trp Ala Asp Val Leu Arg 
    210                 215                 220                 


Glu Lys Lys Arg Asn Val Val Val Ile Asp Tyr Pro Thr Tyr Met Gln 
225                 230                 235                 240 


Ser Ile Tyr Asp Ile Leu Asn Asn Pro Ala Thr Leu Phe Ser Leu Asn 
                245                 250                 255     


Thr Arg Ser Gly Met Ala Pro Leu Ala Phe Ala Leu Ala Ala Val Ser 
            260                 265                 270         


Gly Arg Arg Met Ile Glu Ile Met Phe Gln Gly Glu Phe Ala Val Ser 
        275                 280                 285             


Gly Lys Tyr Thr Val Asn Phe Ser Gly Gln Ala Lys Lys Arg Ser Glu 
    290                 295                 300                 


Asp Lys Ser Val Thr Arg Thr Ile Tyr Thr Leu Cys Glu Ala Lys Leu 
305                 310                 315                 320 


Phe Val Glu Leu Leu Thr Glu Leu Arg Ser Cys Ser Ala Ala Ser Asp 
                325                 330                 335     


Phe Asp Glu Val Val Lys Gly Tyr Gly Lys Asp Asp Thr Arg Ser Glu 
            340                 345                 350         


Asn Gly Arg Ile Asn Ala Ile Leu Ala Lys Ala Phe Asn Pro Trp Val 
        355                 360                 365             


Lys Ser Phe Phe Gly Asp Asp Arg Arg Val Tyr Lys Asp Ser Arg Ala 
    370                 375                 380                 


Ile Tyr Ala Arg Ile Ala Tyr Glu Met Phe Phe Arg Val Asp Pro Arg 
385                 390                 395                 400 


Trp Lys Asn Val Asp Glu Asp Val Phe Phe Met Glu Ile Leu Gly His 
                405                 410                 415     


Asp Asp Glu Asn Thr Gln Leu His Tyr Lys Gln Phe Lys Leu Ala Asn 
            420                 425                 430         


Phe Ser Arg Thr Trp Arg Pro Glu Val Gly Asp Glu Asn Thr Arg Leu 
        435                 440                 445             


Val Ala Leu Gln Lys Leu Asp Asp Glu Met Pro Gly Phe Ala Arg Gly 
    450                 455                 460                 


Asp Ala Gly Val Arg Leu His Glu Thr Val Lys Gln Leu Val Glu Gln 
465                 470                 475                 480 


Asp Pro Ser Ala Lys Ile Thr Asn Ser Thr Leu Arg Ala Phe Lys Phe 
                485                 490                 495     


Ser Pro Thr Met Ile Ser Arg Tyr Leu Glu Phe Ala Ala Asp Ala Leu 
            500                 505                 510         


Gly Gln Phe Val Gly Glu Asn Gly Gln Trp Gln Leu Lys Ile Glu Thr 
        515                 520                 525             


Pro Ala Ile Val Leu Pro Asp Glu Glu Ser Val Glu Thr Ile Asp Glu 
    530                 535                 540                 


Pro Asp Asp Glu Ser Gln Asp Asp Glu Leu Asp Glu Asp Glu Ile Glu 
545                 550                 555                 560 


Leu Asp Glu Gly Gly Gly Asp Glu Pro Thr Glu Glu Glu Gly Pro Glu 
                565                 570                 575     


Glu His Gln Pro Thr Ala Leu Lys Pro Val Phe Lys Pro Ala Lys Asn 
            580                 585                 590         


Asn Gly Asp Gly Thr Tyr Lys Ile Glu Phe Glu Tyr Asp Gly Lys His 
        595                 600                 605             


Tyr Ala Trp Ser Gly Pro Ala Asp Ser Pro Met Ala Ala Met Arg Ser 
    610                 615                 620                 


Ala Trp Glu Thr Tyr Tyr Ser 
625                 630     


