                         SEQUENCE LISTING

<110>  The Regents of the University of California
       McManus, Michael T.
 
<120>  RECONSTRUCTION OF ANCESTRAL CELLS BY ENZYMATIC RECORDING

<130>  081906-0957718

<140>  PCT/US2015/049375
<141>  2015-09-10

<150>  US 62/048,695
<151>  2014-09-10

<160>  39    

<170>  PatentIn version 3.5

<210>  1
<211>  1417
<212>  PRT
<213>  Unknown

<220>
<223>  Cas9 protein

<400>  1

Met Asp Tyr Lys Asp Asp Asp Asp Lys Asp Tyr Lys Asp Asp Asp Asp 
1               5                   10                  15      


Lys Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro 
            20                  25                  30          


Ala Ala Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser 
        35                  40                  45              


Val Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys 
    50                  55                  60                  


Phe Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu 
65                  70                  75                  80  


Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg 
                85                  90                  95      


Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile 
            100                 105                 110         


Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp 
        115                 120                 125             


Ser Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys 
    130                 135                 140                 


Lys His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala 
145                 150                 155                 160 


Tyr His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val 
                165                 170                 175     


Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala 
            180                 185                 190         


His Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn 
        195                 200                 205             


Pro Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr 
    210                 215                 220                 


Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp 
225                 230                 235                 240 


Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu 
                245                 250                 255     


Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly 
            260                 265                 270         


Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn 
        275                 280                 285             


Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr 
    290                 295                 300                 


Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala 
305                 310                 315                 320 


Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser 
                325                 330                 335     


Asp Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala 
            340                 345                 350         


Ser Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu 
        355                 360                 365             


Lys Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe 
    370                 375                 380                 


Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala 
385                 390                 395                 400 


Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met 
                405                 410                 415     


Asp Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu 
            420                 425                 430         


Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His 
        435                 440                 445             


Leu Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro 
    450                 455                 460                 


Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg 
465                 470                 475                 480 


Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala 
                485                 490                 495     


Trp Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu 
            500                 505                 510         


Glu Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met 
        515                 520                 525             


Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His 
    530                 535                 540                 


Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val 
545                 550                 555                 560 


Lys Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu 
                565                 570                 575     


Gln Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val 
            580                 585                 590         


Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe 
        595                 600                 605             


Asp Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu 
    610                 615                 620                 


Gly Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu 
625                 630                 635                 640 


Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu 
                645                 650                 655     


Thr Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr 
            660                 665                 670         


Ala His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg 
        675                 680                 685             


Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg 
    690                 695                 700                 


Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly 
705                 710                 715                 720 


Phe Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr 
                725                 730                 735     


Phe Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser 
            740                 745                 750         


Leu His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys 
        755                 760                 765             


Gly Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met 
    770                 775                 780                 


Gly Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn 
785                 790                 795                 800 


Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg 
                805                 810                 815     


Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His 
            820                 825                 830         


Pro Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr 
        835                 840                 845             


Leu Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn 
    850                 855                 860                 


Arg Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu 
865                 870                 875                 880 


Lys Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn 
                885                 890                 895     


Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met 
            900                 905                 910         


Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg 
        915                 920                 925             


Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu 
    930                 935                 940                 


Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile 
945                 950                 955                 960 


Thr Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr 
                965                 970                 975     


Asp Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys 
            980                 985                 990         


Ser Lys Leu Val Ser Asp Phe Arg  Lys Asp Phe Gln Phe  Tyr Lys Val 
        995                 1000                 1005             


Arg Glu  Ile Asn Asn Tyr His  His Ala His Asp Ala  Tyr Leu Asn 
    1010                 1015                 1020             


Ala Val  Val Gly Thr Ala Leu  Ile Lys Lys Tyr Pro  Lys Leu Glu 
    1025                 1030                 1035             


Ser Glu  Phe Val Tyr Gly Asp  Tyr Lys Val Tyr Asp  Val Arg Lys 
    1040                 1045                 1050             


Met Ile  Ala Lys Ser Glu Gln  Glu Ile Gly Lys Ala  Thr Ala Lys 
    1055                 1060                 1065             


Tyr Phe  Phe Tyr Ser Asn Ile  Met Asn Phe Phe Lys  Thr Glu Ile 
    1070                 1075                 1080             


Thr Leu  Ala Asn Gly Glu Ile  Arg Lys Arg Pro Leu  Ile Glu Thr 
    1085                 1090                 1095             


Asn Gly  Glu Thr Gly Glu Ile  Val Trp Asp Lys Gly  Arg Asp Phe 
    1100                 1105                 1110             


Ala Thr  Val Arg Lys Val Leu  Ser Met Pro Gln Val  Asn Ile Val 
    1115                 1120                 1125             


Lys Lys  Thr Glu Val Gln Thr  Gly Gly Phe Ser Lys  Glu Ser Ile 
    1130                 1135                 1140             


Leu Pro  Lys Arg Asn Ser Asp  Lys Leu Ile Ala Arg  Lys Lys Asp 
    1145                 1150                 1155             


Trp Asp  Pro Lys Lys Tyr Gly  Gly Phe Asp Ser Pro  Thr Val Ala 
    1160                 1165                 1170             


Tyr Ser  Val Leu Val Val Ala  Lys Val Glu Lys Gly  Lys Ser Lys 
    1175                 1180                 1185             


Lys Leu  Lys Ser Val Lys Glu  Leu Leu Gly Ile Thr  Ile Met Glu 
    1190                 1195                 1200             


Arg Ser  Ser Phe Glu Lys Asn  Pro Ile Asp Phe Leu  Glu Ala Lys 
    1205                 1210                 1215             


Gly Tyr  Lys Glu Val Lys Lys  Asp Leu Ile Ile Lys  Leu Pro Lys 
    1220                 1225                 1230             


Tyr Ser  Leu Phe Glu Leu Glu  Asn Gly Arg Lys Arg  Met Leu Ala 
    1235                 1240                 1245             


Ser Ala  Gly Glu Leu Gln Lys  Gly Asn Glu Leu Ala  Leu Pro Ser 
    1250                 1255                 1260             


Lys Tyr  Val Asn Phe Leu Tyr  Leu Ala Ser His Tyr  Glu Lys Leu 
    1265                 1270                 1275             


Lys Gly  Ser Pro Glu Asp Asn  Glu Gln Lys Gln Leu  Phe Val Glu 
    1280                 1285                 1290             


Gln His  Lys His Tyr Leu Asp  Glu Ile Ile Glu Gln  Ile Ser Glu 
    1295                 1300                 1305             


Phe Ser  Lys Arg Val Ile Leu  Ala Asp Ala Asn Leu  Asp Lys Val 
    1310                 1315                 1320             


Leu Ser  Ala Tyr Asn Lys His  Arg Asp Lys Pro Ile  Arg Glu Gln 
    1325                 1330                 1335             


Ala Glu  Asn Ile Ile His Leu  Phe Thr Leu Thr Asn  Leu Gly Ala 
    1340                 1345                 1350             


Pro Ala  Ala Phe Lys Tyr Phe  Asp Thr Thr Ile Asp  Arg Lys Arg 
    1355                 1360                 1365             


Tyr Thr  Ser Thr Lys Glu Val  Leu Asp Ala Thr Leu  Ile His Gln 
    1370                 1375                 1380             


Ser Ile  Thr Gly Leu Tyr Glu  Thr Arg Ile Asp Leu  Ser Gln Leu 
    1385                 1390                 1395             


Gly Gly  Asp Lys Arg Pro Ala  Ala Thr Lys Lys Ala  Gly Gln Ala 
    1400                 1405                 1410             


Lys Lys  Lys Lys 
    1415         


<210>  2
<211>  82
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic WT guide RNA sequence

<400>  2
gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac ttgaaaaagt       60

ggcaccgagt cggtgctttt tt                                                82


<210>  3
<211>  25100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic GST-TAL-FokI-linker-FokI

<400>  3
gcttaagcgg tcgacggatc gggagatctc ccgatcccct atggtgcact ctcagtacaa       60

tctgctctga tgccgcatag ttaagccagt atctgctccc tgcttgtgtg ttggaggtcg      120

ctgagtagtg cgcgagcaaa atttaagcta caacaaggca aggcttgacc gacaattgca      180

tgaagaatct gcttagggtt aggcgttttg cgctgcttcg cgatgtacgg gccagatata      240

cgcgttgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc      300

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      360

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      420

tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc cacttggcag      480

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      540

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      600

acgtattagt catcgctatt accatggtga tgcggttttg gcagtacatc aatgggcgtg      660

gatagcggtt tgactcacgg ggatttccaa gtctccaccc cattgacgtc aatgggagtt      720

tgttttggca ccaaaatcaa cgggactttc caaaatgtcg taacaactcc gccccattga      780

cgcaaatggg cggtaggcgt gtacggtggg aggtctatat aagcagcgcg ttttgcctgt      840

actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta actagggaac      900

ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg      960

ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg gaaaatctct     1020

agcagtggcg cccgaacagg gacttgaaag cgaaagggaa accagaggag ctctctcgac     1080

gcaggactcg gcttgctgaa gcgcgcacgg caagaggcga ggggcggcga ctggtgagta     1140

cgccaaaaat tttgactagc ggaggctaga aggagagaga tgggtgcgag agcgtcagta     1200

ttaagcgggg gagaattaga tcgcgatggg aaaaaattcg gttaaggcca gggggaaaga     1260

aaaaatataa attaaaacat atagtatggg caagcaggga gctagaacga ttcgcagtta     1320

atcctggcct gttagaaaca tcagaaggct gtagacaaat actgggacag ctacaaccat     1380

cccttcagac aggatcagaa gaacttagat cattatataa tacagtagca accctctatt     1440

gtgtgcatca aaggatagag ataaaagaca ccaaggaagc tttagacaag atagaggaag     1500

agcaaaacaa aagtaagacc accgcacagc aagcggccgg ccgcgctgat cttcagacct     1560

ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa     1620

attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa     1680

agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg     1740

ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag     1800

cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc     1860

tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa     1920

cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg     1980

aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag     2040

tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa     2100

aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg     2160

aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga     2220

ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag     2280

ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cgacaggccc     2340

gaaggaatag aagaagaagg tggagagaga gacagagaca gatccattcg attagtgaac     2400

ggatcggcac tgcgtgcgcc aattctgcag acaaatggca gtattcatcc acaattttaa     2460

aagaaaaggg gggattgggg ggtacagtgc aggggaaaga atagtagaca taatagcaac     2520

agacatacaa actaaagaat tacaaaaaca aattacaaaa attcaaaatt ttcgggttta     2580

ttacagggac agcagagatc cagtttggtt agtaccgggc cctagagatc acgagactag     2640

cctcgagaga tctgatcata atcagccata ccacatttgt agaggtttta cttgctttaa     2700

aaaacctccc acacctcccc ctgaacctga aacataaaat gaatgcaatt gttgttgtta     2760

acttgtttat tgcagcttat aatggttaca aataaggcaa tagcatcaca aatttcacaa     2820

ataaggcatt tttttcactg cattctagtt ttggtttgtc caaactcatc aatgtatctt     2880

atcatgtctg gatctcaaat ccctcggaag ctgcgcctgt catcgaattc ctgcagcccg     2940

gtgcatgact aagctagtac cggttaggat gcatgctagc tcagttagcc tcccccatct     3000

ctcgacgcgg ccgctttaca tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc     3060

catcctggtc gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg     3120

cgagggcgat gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct     3180

gcccgtgccc tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg     3240

ctaccccgac cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt     3300

ccaggagcgc accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa     3360

gttcgagggc gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga     3420

cggcaacatc ctggggcaca agctggagta caactacaac agccacaacg tctatatcat     3480

ggccgacaag cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga     3540

cggcagcgtg cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt     3600

gctgctgccc gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga     3660

gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat     3720

ggacgagctg tacaaggtgg ctcgagcgga ggctggatcg gtcccggtgt cttctatgga     3780

ggtcaaaaca gcgtggatgg cgtctccagg cgatctgacg gttcactaaa cgagctctgc     3840

ttatataggc ctcccaccgt acacgcctac cctcgagaag cttgatatca ctagagctct     3900

agtgtgcccg tcagtgggca gagcgcacat cgcccacagt ccccgagaag ttggggggag     3960

gggtcggcaa ttgaaccggt gcctagagaa ggtggcgcgg ggtaaactgg gaaagtgatg     4020

tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata agtgcagtag     4080

tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacagtgagc tagcgctacc     4140

ggtcgccacc cctaggatgt cccctatact aggttattgg aaaattaagg gccttgtgca     4200

acccactcga cttcttttgg aatatcttga agaaaaatat gaagagcatt tgtatgagcg     4260

cgatgaaggt gataaatggc gaaacaaaaa gtttgaattg ggtttggagt ttcccaatct     4320

tccttattat attgatggtg atgttaaatt aacacagtct atggccatca tacgttatat     4380

agctgacaag cacaacatgt tgggtggttg tccaaaagag cgtgcagaga tttcaatgct     4440

tgaaggagcg gttttggata ttagatacgg tgtttcgaga attgcatata gtaaagactt     4500

tgaaactctc aaagttgatt ttcttagcaa gctacctgaa atgctgaaaa tgttcgaaga     4560

tcgtttatgt cataaaacat atttaaatgg tgatcatgta acccatcctg acttcatgtt     4620

gtatgacgct cttgatgttg ttttatacat ggacccaatg tgcctggatg cgttcccaaa     4680

attagtttgt tttaaaaaac gtattgaagc tatcccacaa attgataagt acttgaaatc     4740

cagcaagtat atagcatggc ctttgcaggg ctggcaagcc acgtttggtg gtggcgacca     4800

tcctccaaaa tcggatctgg ttccgcgtgg atccggcggt agtttaaaca tggcttcctc     4860

ccctccaaag aaaaagagaa aggttagttg gaaggacgca agtggttggt ctagagtgga     4920

tctacgcacg ctcggctaca gtcagcagca gcaagagaag atcaaaccga aggtgcgttc     4980

gacagtggcg cagcaccacg aggcactggt gggccatggg tttacacacg cgcacatcgt     5040

tgcgctcagc caacacccgg cagcgttagg gaccgtcgct gtcacgtatc agcacataat     5100

cacggcgttg ccagaggcga cacacgaaga catcgttggc gtcggcaaac agtggtccgg     5160

cgcacgcgcc ctggaggcct tgctcacgga tgcgggggag ttgagaggtc cgccgttaca     5220

gttggacaca ggccaacttg tgaagattgc aaaacgtggc ggcgtgaccg caatggaggc     5280

agtgcatgca tcgcgcaatg cactgacggg tgcccccctg aacctgaccc cggaccaagt     5340

ggtggctatc gccagcaaca atggcggcaa gcaagcgctc gaaacggtgc agcggctgtt     5400

gccggtgctg tgccaggacc atggcctgac cccggaccaa gtggtggcta tcgccagcaa     5460

cggtggcggc aagcaagcgc tcgaaacggt gcagcggctg ttgccggtgc tgtgccagga     5520

ccatggcctg accccggacc aagtggtggc tatcgccagc aacaatggcg gcaagcaagc     5580

gctcgaaacg gtgcagcggc tgttgccggt gctgtgccag gaccatggcc tgaccccgga     5640

ccaagtggtg gctatcgcca gcaacattgg cggcaagcaa gcgctcgaaa cggtgcagcg     5700

gctgttgccg gtgctgtgcc aggaccatgg cctgaccccg gaccaagtgg tggctatcgc     5760

cagcaacaat ggcggcaagc aagcgctcga aacggtgcag cggctgttgc cggtgctgtg     5820

ccaggaccat ggcctgactc cggaccaagt ggtggctatc gccagccacg atggcggcaa     5880

gcaagcgctc gaaacggtgc agcggctgtt gccggtgctg tgccaggacc atggcctgac     5940

cccggaccaa gtggtggcta tcgccagcaa cattggcggc aagcaagcgc tcgaaacggt     6000

gcagcggctg ttgccggtgc tgtgccagga ccatggcctg actccggacc aagtggtggc     6060

tatcgccagc cacgatggcg gcaagcaagc gctcgaaacg gtgcagcggc tgttgccggt     6120

gctgtgccag gaccatggcc tgactccgga ccaagtggtg gctatcgcca gccacgatgg     6180

cggcaagcaa gcgctcgaaa cggtgcagcg gctgttgccg gtgctgtgcc aggaccatgg     6240

cctgactccg gaccaagtgg tggctatcgc cagccacgat ggcggcaagc aagcgctcga     6300

aacggtgcag cggctgttgc cggtgctgtg ccaggaccat ggcctgaccc cggaccaagt     6360

ggtggctatc gccagcaaca ttggcggcaa gcaagcgctc gaaacggtgc agcggctgtt     6420

gccggtgctg tgccaggacc atggcctgac cccggaccaa gtggtggcta tcgccagcaa     6480

caatggcggc aagcaagcgc tcgaaacggt gcagcggctg ttgccggtgc tgtgccagga     6540

ccatggcctg actccggacc aagtggtggc tatcgccagc cacgatggcg gcaagcaagc     6600

gctcgaaacg gtgcagcggc tgttgccggt gctgtgccag gaccatggcc tgaccccgga     6660

ccaagtggtg gctatcgcca gcaacaatgg cggcaagcaa gcgctcgaaa cggtgcagcg     6720

gctgttgccg gtgctgtgcc aggaccatgg cctgaccccg gaccaagtgg tggctatcgc     6780

cagcaacaat ggcggcaagc aagcgctcga aacggtgcag cggctgttgc cggtgctgtg     6840

ccaggaccat ggcctgaccc cggaccaagt ggtggctatc gccagcaaca ttggcggcaa     6900

gcaagcgctc gaaacggtgc agcggctgtt gccggtgctg tgccaggacc atggcctgac     6960

tccggaccaa gtggtggcta tcgccagcca cgatggcggc aagcaagcgc tcgaaacggt     7020

gcagcggctg ttgccggtgc tgtgccagga ccatggcctg actccggacc aagtggtggc     7080

tatcgccagc cacgatggcg gcaagcaagc gctcgaaacg gtgcagcggc tgttgccggt     7140

gctgtgccag gaccatggcc tgaccccgga ccaagtggtg gctatcgcca gcaacggtgg     7200

cggcaagcaa gcgctcgaaa cggtgcagcg gctgttgccg gtgctgtgcc aggaccatgg     7260

cctgactccg gaccaagtgg tggctatcgc cagccacgat ggcggcaagc aagcgctcga     7320

aacggtgcag cggctgttgc cggtgctgtg ccaggaccat ggcctgaccc cggaccaagt     7380

ggtggctatc gccagccacg atggcggcaa gcaagcgctc gaaacggtgc agcggctgtt     7440

gccggtgctg tgccaggacc atggcctgac cccggaccaa gtggtggcta tcgccagcaa     7500

cggtggcggc aagcaagcgc tcgaaacggt gcagcggctg ttgccggtgc tgtgccagga     7560

ccatggcctg actccggacc aagtggtggc tatcgccagc cacgatggcg gcaagcaagc     7620

gctcgaaacg gtgcagcggc tgttgccggt gctgtgccag gaccatggcc tgaccccgga     7680

ccaagtggtg gctatcgcca gcaacggtgg cggcaagcaa gcgctcgaaa gcattgtggc     7740

ccagctgagc cggcctgatc cggcgttggc cgcgttgacc aacgaccacc tcgtcgcctt     7800

ggcctgcctc ggcggacgtc ctgccatgga tgcagtgaaa aagggattgc cgcacgcgcc     7860

ggaattgatc agaagagtca atcgccgtat tggcgaacgc acgtcccatc gcgttgcctc     7920

tagatcccag cctgcaggtt cccaactagt caaaagtgaa ctggaggaga agaaatctga     7980

acttcgtcat aaattgaaat atgtgcctca tgaatatatt gaattaattg aaattgccag     8040

aaattccact caggatagaa ttcttgaaat gaaggtaatg gaatttttta tgaaagttta     8100

tggatataga ggtaaacatt tgggtggatc aaggaaaccg gacggagcaa tttatactgt     8160

cggatctcct attgattacg gtgtgatcgt ggatactaaa gcttatagcg gaggttataa     8220

tctgccaatt ggccaagcag atgaaatgca acgatatgtc gaagaaaatc aaacacgaaa     8280

caaacatatc aaccctaatg aatggtggaa agtctatcca tcttctgtaa cggaatttaa     8340

gtttttattt gtgagtggtc actttaaagg aaactacaaa gctcagctta cacgattaaa     8400

tcatatcact aattgtaatg gagctgttct tagtgtagaa gagcttttaa ttggtggaga     8460

aatgattaaa gccggcacat taaccttaga ggaagtgaga cggaaattta ataacggcga     8520

gataaacttt ggcgcgcctg gcggaggtgg aagtgcaggt gctggatccg gtagtggctc     8580

aggtggtggt ggcggttcag ctggcgctgg aagtggttca ggtagtggag gaggaggcgg     8640

ctctgcagga gcaggctctg gctccggatc tggaggaggt ggcggaagcg ctggtgcagg     8700

ctccggaagc ggaagtggag cgatcgcttc ccagctagtg aaatctgaat tggaagagaa     8760

gaaatctgaa cttagacata aattgaaata tgtgccacat gaatatattg aattgattga     8820

aatcgcaaga aattcaactc aggatagaat ccttgaaatg aaggtgatgg agttctttat     8880

gaaggtttat ggttatcgtg gtaaacattt gggtggatca aggaaaccag acggagcaat     8940

ttatactgtc ggatctccta ttgattacgg tgtgatcgtt gatactaagg catattcagg     9000

aggttataat cttccaattg gtcaagcaga tgaaatgcaa agatatgtcg aagagaatca     9060

aacaagaaac aagcatatca accctaatga atggtggaaa gtctatccat cttcagtaac     9120

agaatttaag ttcttgtttg tgagtggtca tttcaaagga aactacaaag ctcagcttac     9180

aagattgaat catatcacta attgtaatgg agctgttctt agtgtagaag agcttttgat     9240

tggtggagaa atgattaaag ctggtacatt gacacttgag gaagtgagaa ggaaatttaa     9300

taacggtgag ataaactttt agttaattaa gaattcgtcg agggacctaa taacttcgta     9360

tagcatacat tatacgaagt tatacatgtt taagggttcc ggttccacta ggtacaattc     9420

gatatcaagc ttatcgataa tcaacctctg gattacaaaa tttgtgaaag attgactggt     9480

attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat gcctttgtat     9540

catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc ctggttgctg     9600

tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt     9660

gctgacgcaa cccccactgg ttggggcatt gccaccacct gtcagctcct ttccgggact     9720

ttcgctttcc ccctccctat tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc     9780

tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg gaaatcatcg     9840

tcctttcctt ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac gtccttctgc     9900

tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct gccggctctg     9960

cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc ggatctccct ttgggccgcc    10020

tccccgcatc gataccgtcg acctcgatcg agacctagaa aaacatggag caatcacaag    10080

tagcaataca gcagctacca atgctgattg tgcctggcta gaagcacaag aggaggagga    10140

ggtgggtttt ccagtcacac ctcaggtacc tttaagacca atgacttaca aggcagctgt    10200

agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg    10260

aagacaagat atccttgatc tgtggatcta ccacacacaa ggctacttcc ctgattggca    10320

gaactacaca ccagggccag ggatcagata tccactgacc tttggatggt gctacaagct    10380

agtaccagtt gagcaagaga aggtagaaga agccaatgaa ggagagaaca cccgcttgtt    10440

acaccctgtg agcctgcatg ggatggatga cccggagaga gaagtattag agtggaggtt    10500

tgacagccgc ctagcatttc atcacatggc ccgagagctg catccggact gtactgggtc    10560

tctctggtta gaccagatct gagcctggga gctctctggc taactaggga acccactgct    10620

taagcctcaa taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga    10680

ctctggtaac tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagcat    10740

gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt    10800

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg    10860

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc    10920

tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt    10980

ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa    11040

gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta    11100

tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa    11160

caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa    11220

ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt    11280

cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt    11340

ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat    11400

cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat    11460

gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc    11520

aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc    11580

acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta    11640

gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga    11700

cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg    11760

cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc    11820

tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat    11880

cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag    11940

gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat    12000

cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa    12060

ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa    12120

gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga    12180

taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg    12240

gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc    12300

acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg    12360

aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact    12420

cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat    12480

atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt    12540

gccacctgac gcttaagcgg tcgacggatc gggagatctc ccgatcccct atggtgcact    12600

ctcagtacaa tctgctctga tgccgcatag ttaagccagt atctgctccc tgcttgtgtg    12660

ttggaggtcg ctgagtagtg cgcgagcaaa atttaagcta caacaaggca aggcttgacc    12720

gacaattgca tgaagaatct gcttagggtt aggcgttttg cgctgcttcg cgatgtacgg    12780

gccagatata cgcgttgaca ttgattattg actagttatt aatagtaatc aattacgggg    12840

tcattagttc atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg    12900

cctggctgac cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata    12960

gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc    13020

cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac    13080

ggtaaatggc ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg    13140

cagtacatct acgtattagt catcgctatt accatggtga tgcggttttg gcagtacatc    13200

aatgggcgtg gatagcggtt tgactcacgg ggatttccaa gtctccaccc cattgacgtc    13260

aatgggagtt tgttttggca ccaaaatcaa cgggactttc caaaatgtcg taacaactcc    13320

gccccattga cgcaaatggg cggtaggcgt gtacggtggg aggtctatat aagcagcgcg    13380

ttttgcctgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta    13440

actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg    13500

tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg    13560

gaaaatctct agcagtggcg cccgaacagg gacttgaaag cgaaagggaa accagaggag    13620

ctctctcgac gcaggactcg gcttgctgaa gcgcgcacgg caagaggcga ggggcggcga    13680

ctggtgagta cgccaaaaat tttgactagc ggaggctaga aggagagaga tgggtgcgag    13740

agcgtcagta ttaagcgggg gagaattaga tcgcgatggg aaaaaattcg gttaaggcca    13800

gggggaaaga aaaaatataa attaaaacat atagtatggg caagcaggga gctagaacga    13860

ttcgcagtta atcctggcct gttagaaaca tcagaaggct gtagacaaat actgggacag    13920

ctacaaccat cccttcagac aggatcagaa gaacttagat cattatataa tacagtagca    13980

accctctatt gtgtgcatca aaggatagag ataaaagaca ccaaggaagc tttagacaag    14040

atagaggaag agcaaaacaa aagtaagacc accgcacagc aagcggccgg ccgcgctgat    14100

cttcagacct ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa    14160

agtagtaaaa attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca    14220

gagagaaaaa agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg    14280

aagcactatg ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg    14340

tatagtgcag cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca    14400

actcacagtc tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct    14460

aaaggatcaa cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc    14520

tgtgccttgg aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac    14580

ctggatggag tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga    14640

agaatcgcaa aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc    14700

aagtttgtgg aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat    14760

gatagtagga ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag    14820

agttaggcag ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc    14880

cgacaggccc gaaggaatag aagaagaagg tggagagaga gacagagaca gatccattcg    14940

attagtgaac ggatcggcac tgcgtgcgcc aattctgcag acaaatggca gtattcatcc    15000

acaattttaa aagaaaaggg gggattgggg ggtacagtgc aggggaaaga atagtagaca    15060

taatagcaac agacatacaa actaaagaat tacaaaaaca aattacaaaa attcaaaatt    15120

ttcgggttta ttacagggac agcagagatc cagtttggtt agtaccgggc cctagagatc    15180

acgagactag cctcgagaga tctgatcata atcagccata ccacatttgt agaggtttta    15240

cttgctttaa aaaacctccc acacctcccc ctgaacctga aacataaaat gaatgcaatt    15300

gttgttgtta acttgtttat tgcagcttat aatggttaca aataaggcaa tagcatcaca    15360

aatttcacaa ataaggcatt tttttcactg cattctagtt ttggtttgtc caaactcatc    15420

aatgtatctt atcatgtctg gatctcaaat ccctcggaag ctgcgcctgt catcgaattc    15480

ctgcagcccg gtgcatgact aagctagtac cggttaggat gcatgctagc tcagttagcc    15540

tcccccatct ctcgacgcgg ccgctttaca tggtgagcaa gggcgaggag ctgttcaccg    15600

gggtggtgcc catcctggtc gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt    15660

ccggcgaggg cgagggcgat gccacctacg gcaagctgac cctgaagttc atctgcacca    15720

ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt    15780

gcttcagccg ctaccccgac cacatgaagc agcacgactt cttcaagtcc gccatgcccg    15840

aaggctacgt ccaggagcgc accatcttct tcaaggacga cggcaactac aagacccgcg    15900

ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat cgagctgaag ggcatcgact    15960

tcaaggagga cggcaacatc ctggggcaca agctggagta caactacaac agccacaacg    16020

tctatatcat ggccgacaag cagaagaacg gcatcaaggt gaacttcaag atccgccaca    16080

acatcgagga cggcagcgtg cagctcgccg accactacca gcagaacacc cccatcggcg    16140

acggccccgt gctgctgccc gacaaccact acctgagcac ccagtccgcc ctgagcaaag    16200

accccaacga gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc gccgggatca    16260

ctctcggcat ggacgagctg tacaaggtgg ctcgagcgga ggctggatcg gtcccggtgt    16320

cttctatgga ggtcaaaaca gcgtggatgg cgtctccagg cgatctgacg gttcactaaa    16380

cgagctctgc ttatataggc ctcccaccgt acacgcctac cctcgagaag cttgatatca    16440

ctagagctct agtgtgcccg tcagtgggca gagcgcacat cgcccacagt ccccgagaag    16500

ttggggggag gggtcggcaa ttgaaccggt gcctagagaa ggtggcgcgg ggtaaactgg    16560

gaaagtgatg tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata    16620

agtgcagtag tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacagtgagc    16680

tagcgctacc ggtcgccacc cctaggatgt cccctatact aggttattgg aaaattaagg    16740

gccttgtgca acccactcga cttcttttgg aatatcttga agaaaaatat gaagagcatt    16800

tgtatgagcg cgatgaaggt gataaatggc gaaacaaaaa gtttgaattg ggtttggagt    16860

ttcccaatct tccttattat attgatggtg atgttaaatt aacacagtct atggccatca    16920

tacgttatat agctgacaag cacaacatgt tgggtggttg tccaaaagag cgtgcagaga    16980

tttcaatgct tgaaggagcg gttttggata ttagatacgg tgtttcgaga attgcatata    17040

gtaaagactt tgaaactctc aaagttgatt ttcttagcaa gctacctgaa atgctgaaaa    17100

tgttcgaaga tcgtttatgt cataaaacat atttaaatgg tgatcatgta acccatcctg    17160

acttcatgtt gtatgacgct cttgatgttg ttttatacat ggacccaatg tgcctggatg    17220

cgttcccaaa attagtttgt tttaaaaaac gtattgaagc tatcccacaa attgataagt    17280

acttgaaatc cagcaagtat atagcatggc ctttgcaggg ctggcaagcc acgtttggtg    17340

gtggcgacca tcctccaaaa tcggatctgg ttccgcgtgg atccggcggt agtttaaaca    17400

tggcttcctc ccctccaaag aaaaagagaa aggttagttg gaaggacgca agtggttggt    17460

ctagagtgga tctacgcacg ctcggctaca gtcagcagca gcaagagaag atcaaaccga    17520

aggtgcgttc gacagtggcg cagcaccacg aggcactggt gggccatggg tttacacacg    17580

cgcacatcgt tgcgctcagc caacacccgg cagcgttagg gaccgtcgct gtcacgtatc    17640

agcacataat cacggcgttg ccagaggcga cacacgaaga catcgttggc gtcggcaaac    17700

agtggtccgg cgcacgcgcc ctggaggcct tgctcacgga tgcgggggag ttgagaggtc    17760

cgccgttaca gttggacaca ggccaacttg tgaagattgc aaaacgtggc ggcgtgaccg    17820

caatggaggc agtgcatgca tcgcgcaatg cactgacggg tgcccccctg aacctgaccc    17880

cggaccaagt ggtggctatc gccagcaaca atggcggcaa gcaagcgctc gaaacggtgc    17940

agcggctgtt gccggtgctg tgccaggacc atggcctgac cccggaccaa gtggtggcta    18000

tcgccagcaa cggtggcggc aagcaagcgc tcgaaacggt gcagcggctg ttgccggtgc    18060

tgtgccagga ccatggcctg accccggacc aagtggtggc tatcgccagc aacaatggcg    18120

gcaagcaagc gctcgaaacg gtgcagcggc tgttgccggt gctgtgccag gaccatggcc    18180

tgaccccgga ccaagtggtg gctatcgcca gcaacattgg cggcaagcaa gcgctcgaaa    18240

cggtgcagcg gctgttgccg gtgctgtgcc aggaccatgg cctgaccccg gaccaagtgg    18300

tggctatcgc cagcaacaat ggcggcaagc aagcgctcga aacggtgcag cggctgttgc    18360

cggtgctgtg ccaggaccat ggcctgactc cggaccaagt ggtggctatc gccagccacg    18420

atggcggcaa gcaagcgctc gaaacggtgc agcggctgtt gccggtgctg tgccaggacc    18480

atggcctgac cccggaccaa gtggtggcta tcgccagcaa cattggcggc aagcaagcgc    18540

tcgaaacggt gcagcggctg ttgccggtgc tgtgccagga ccatggcctg actccggacc    18600

aagtggtggc tatcgccagc cacgatggcg gcaagcaagc gctcgaaacg gtgcagcggc    18660

tgttgccggt gctgtgccag gaccatggcc tgactccgga ccaagtggtg gctatcgcca    18720

gccacgatgg cggcaagcaa gcgctcgaaa cggtgcagcg gctgttgccg gtgctgtgcc    18780

aggaccatgg cctgactccg gaccaagtgg tggctatcgc cagccacgat ggcggcaagc    18840

aagcgctcga aacggtgcag cggctgttgc cggtgctgtg ccaggaccat ggcctgaccc    18900

cggaccaagt ggtggctatc gccagcaaca ttggcggcaa gcaagcgctc gaaacggtgc    18960

agcggctgtt gccggtgctg tgccaggacc atggcctgac cccggaccaa gtggtggcta    19020

tcgccagcaa caatggcggc aagcaagcgc tcgaaacggt gcagcggctg ttgccggtgc    19080

tgtgccagga ccatggcctg actccggacc aagtggtggc tatcgccagc cacgatggcg    19140

gcaagcaagc gctcgaaacg gtgcagcggc tgttgccggt gctgtgccag gaccatggcc    19200

tgaccccgga ccaagtggtg gctatcgcca gcaacaatgg cggcaagcaa gcgctcgaaa    19260

cggtgcagcg gctgttgccg gtgctgtgcc aggaccatgg cctgaccccg gaccaagtgg    19320

tggctatcgc cagcaacaat ggcggcaagc aagcgctcga aacggtgcag cggctgttgc    19380

cggtgctgtg ccaggaccat ggcctgaccc cggaccaagt ggtggctatc gccagcaaca    19440

ttggcggcaa gcaagcgctc gaaacggtgc agcggctgtt gccggtgctg tgccaggacc    19500

atggcctgac tccggaccaa gtggtggcta tcgccagcca cgatggcggc aagcaagcgc    19560

tcgaaacggt gcagcggctg ttgccggtgc tgtgccagga ccatggcctg actccggacc    19620

aagtggtggc tatcgccagc cacgatggcg gcaagcaagc gctcgaaacg gtgcagcggc    19680

tgttgccggt gctgtgccag gaccatggcc tgaccccgga ccaagtggtg gctatcgcca    19740

gcaacggtgg cggcaagcaa gcgctcgaaa cggtgcagcg gctgttgccg gtgctgtgcc    19800

aggaccatgg cctgactccg gaccaagtgg tggctatcgc cagccacgat ggcggcaagc    19860

aagcgctcga aacggtgcag cggctgttgc cggtgctgtg ccaggaccat ggcctgaccc    19920

cggaccaagt ggtggctatc gccagccacg atggcggcaa gcaagcgctc gaaacggtgc    19980

agcggctgtt gccggtgctg tgccaggacc atggcctgac cccggaccaa gtggtggcta    20040

tcgccagcaa cggtggcggc aagcaagcgc tcgaaacggt gcagcggctg ttgccggtgc    20100

tgtgccagga ccatggcctg actccggacc aagtggtggc tatcgccagc cacgatggcg    20160

gcaagcaagc gctcgaaacg gtgcagcggc tgttgccggt gctgtgccag gaccatggcc    20220

tgaccccgga ccaagtggtg gctatcgcca gcaacggtgg cggcaagcaa gcgctcgaaa    20280

gcattgtggc ccagctgagc cggcctgatc cggcgttggc cgcgttgacc aacgaccacc    20340

tcgtcgcctt ggcctgcctc ggcggacgtc ctgccatgga tgcagtgaaa aagggattgc    20400

cgcacgcgcc ggaattgatc agaagagtca atcgccgtat tggcgaacgc acgtcccatc    20460

gcgttgcctc tagatcccag cctgcaggtt cccaactagt caaaagtgaa ctggaggaga    20520

agaaatctga acttcgtcat aaattgaaat atgtgcctca tgaatatatt gaattaattg    20580

aaattgccag aaattccact caggatagaa ttcttgaaat gaaggtaatg gaatttttta    20640

tgaaagttta tggatataga ggtaaacatt tgggtggatc aaggaaaccg gacggagcaa    20700

tttatactgt cggatctcct attgattacg gtgtgatcgt ggatactaaa gcttatagcg    20760

gaggttataa tctgccaatt ggccaagcag atgaaatgca acgatatgtc gaagaaaatc    20820

aaacacgaaa caaacatatc aaccctaatg aatggtggaa agtctatcca tcttctgtaa    20880

cggaatttaa gtttttattt gtgagtggtc actttaaagg aaactacaaa gctcagctta    20940

cacgattaaa tcatatcact aattgtaatg gagctgttct tagtgtagaa gagcttttaa    21000

ttggtggaga aatgattaaa gccggcacat taaccttaga ggaagtgaga cggaaattta    21060

ataacggcga gataaacttt ggcgcgcctg gcggaggtgg aagtgcaggt gctggatccg    21120

gtagtggctc aggtggtggt ggcggttcag ctggcgctgg aagtggttca ggtagtggag    21180

gaggaggcgg ctctgcagga gcaggctctg gctccggatc tggaggaggt ggcggaagcg    21240

ctggtgcagg ctccggaagc ggaagtggag cgatcgcttc ccagctagtg aaatctgaat    21300

tggaagagaa gaaatctgaa cttagacata aattgaaata tgtgccacat gaatatattg    21360

aattgattga aatcgcaaga aattcaactc aggatagaat ccttgaaatg aaggtgatgg    21420

agttctttat gaaggtttat ggttatcgtg gtaaacattt gggtggatca aggaaaccag    21480

acggagcaat ttatactgtc ggatctccta ttgattacgg tgtgatcgtt gatactaagg    21540

catattcagg aggttataat cttccaattg gtcaagcaga tgaaatgcaa agatatgtcg    21600

aagagaatca aacaagaaac aagcatatca accctaatga atggtggaaa gtctatccat    21660

cttcagtaac agaatttaag ttcttgtttg tgagtggtca tttcaaagga aactacaaag    21720

ctcagcttac aagattgaat catatcacta attgtaatgg agctgttctt agtgtagaag    21780

agcttttgat tggtggagaa atgattaaag ctggtacatt gacacttgag gaagtgagaa    21840

ggaaatttaa taacggtgag ataaactttt agttaattaa gaattcgtcg agggacctaa    21900

taacttcgta tagcatacat tatacgaagt tatacatgtt taagggttcc ggttccacta    21960

ggtacaattc gatatcaagc ttatcgataa tcaacctctg gattacaaaa tttgtgaaag    22020

attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat    22080

gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc    22140

ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg    22200

cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct gtcagctcct    22260

ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg ccgcctgcct    22320

tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg    22380

gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac    22440

gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct    22500

gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc ggatctccct    22560

ttgggccgcc tccccgcatc gataccgtcg acctcgatcg agacctagaa aaacatggag    22620

caatcacaag tagcaataca gcagctacca atgctgattg tgcctggcta gaagcacaag    22680

aggaggagga ggtgggtttt ccagtcacac ctcaggtacc tttaagacca atgacttaca    22740

aggcagctgt agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc    22800

actcccaacg aagacaagat atccttgatc tgtggatcta ccacacacaa ggctacttcc    22860

ctgattggca gaactacaca ccagggccag ggatcagata tccactgacc tttggatggt    22920

gctacaagct agtaccagtt gagcaagaga aggtagaaga agccaatgaa ggagagaaca    22980

cccgcttgtt acaccctgtg agcctgcatg ggatggatga cccggagaga gaagtattag    23040

agtggaggtt tgacagccgc ctagcatttc atcacatggc ccgagagctg catccggact    23100

gtactgggtc tctctggtta gaccagatct gagcctggga gctctctggc taactaggga    23160

acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc    23220

tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg tggaaaatct    23280

ctagcagcat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc    23340

tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc    23400

agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc    23460

tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt    23520

cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg    23580

ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat    23640

ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag    23700

ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt    23760

ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc    23820

cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta    23880

gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag    23940

atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga    24000

ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa    24060

gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa    24120

tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc    24180

ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga    24240

taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa    24300

gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt    24360

gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg    24420

ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc    24480

aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg    24540

gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag    24600

cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt    24660

actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt    24720

caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac    24780

gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac    24840

ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag    24900

caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa    24960

tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga    25020

gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc    25080

cccgaaaagt gccacctgac                                                25100


<210>  4
<211>  306
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide linker sequence

<400>  4
cctagggggg gagggtccgg cggcggttcc ggcggaggat cgggtggagg gtcaggtgga       60

ggctcaggcg gtggatcagg aggagggagc ggtggcggga gcggcggagg gtcgggagga      120

ggttcgggcg gaggctcggg cggtgggtcc ggaggtggct cgggaggcgg aagcggaggc      180

gggtccggtg gcggatcagg cggaggcagc ggaggaggat caggtggcgg aagcggaggc      240

ggctccggag gaggctccgg cggtggaagc ggtggaggaa gcggcggcgg atcgggaggt      300

gggtcg                                                                 306


<210>  5
<211>  102
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  synthetic protein linker sequence

<400>  5

Pro Arg Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
1               5                   10                  15      


Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
        35                  40                  45              


Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
    50                  55                  60                  


Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
65                  70                  75                  80  


Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
                85                  90                  95      


Gly Ser Gly Gly Gly Ser 
            100         


<210>  6
<211>  180
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic linker nucleotide sequence

<400>  6
ggcggaggtg gaagtgcagg tgctggatcc ggtagtggct caggtggtgg tggcggttca       60

gctggcgctg gaagtggttc aggtagtgga ggaggaggcg gctctgcagg agcaggctct      120

ggctccggat ctggaggagg tggcggaagc gctggtgcag gctccggaag cggaagtgga      180


<210>  7
<211>  60
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  synthetic linker protein sequence

<400>  7

Gly Gly Gly Gly Ser Ala Gly Ala Gly Ser Gly Ser Gly Ser Gly Gly 
1               5                   10                  15      


Gly Gly Gly Ser Ala Gly Ala Gly Ser Gly Ser Gly Ser Gly Gly Gly 
            20                  25                  30          


Gly Gly Ser Ala Gly Ala Gly Ser Gly Ser Gly Ser Gly Gly Gly Gly 
        35                  40                  45              


Gly Ser Ala Gly Ala Gly Ser Gly Ser Gly Ser Gly 
    50                  55                  60  


<210>  8
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  synthetic linker sequence


<220>
<221>  MOD_RES
<222>  (5)..(15)
<223>  Xaa may be present or absent; if present, repeats as 5 amino 
       acids at a time with a sequence of Gly Gly Gly Gly Ser

<400>  8

Gly Gly Gly Gly Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15  


<210>  9
<211>  27
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  synthetic zinc finger motif


<220>
<221>  MOD_RES
<222>  (1)..(27)
<223>  Xaa is any amino acid

<220>
<221>  MOD_RES
<222>  (6)..(7)
<223>  Xaa may be present or absent; if present, both residues are 
       present

<220>
<221>  MOD_RES
<222>  (25)..(26)
<223>  Xaa may be present or absent

<400>  9

Xaa Xaa Cys Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15      


Xaa Xaa Xaa Xaa His Xaa Xaa Xaa Xaa Xaa His 
            20                  25          


<210>  10
<211>  100
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  synthetic Cas9:gRNA target sequence


<220>
<221>  misc_feature
<222>  (1)..(20)
<223>  n is a, c, g or u

<220>
<221>  misc_feature
<222>  (24)..(25)
<223>  n is u for both ribonucleosides or g for both ribonucleosides

<400>  10
nnnnnnnnnn nnnnnnnnnn guunnagagc uagaaauagc aaguuaammu aaggcuaguc       60

cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu                            100


<210>  11
<211>  12
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  11
ccataaagta gg                                                           12


<210>  12
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  12
ccataaagga tagtagg                                                      17


<210>  13
<211>  16
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  13
ccataaagcg agtagg                                                       16


<210>  14
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  14
ccataaagac caagtagg                                                     18


<210>  15
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  15
ccataaagcc cccaagtagg                                                   20


<210>  16
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  16
ccataaggct taaagtagg                                                    19


<210>  17
<211>  11
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic recombination sequence

<400>  17
cgtgtcgatc g                                                            11


<210>  18
<211>  11
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic recombination sequence

<400>  18
gcgcgtgcaa c                                                            11


<210>  19
<211>  13
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic recombination sequence

<400>  19
cgtgtcgatc ggc                                                          13


<210>  20
<211>  13
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic recombination sequence

<400>  20
gcgcctcgac acg                                                          13


<210>  21
<211>  96
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence


<220>
<221>  misc_feature
<222>  (85)..(86)
<223>  N is absent at 85-86; Or N at 85 is C and at 86 is T

<400>  21
caccctaact gtaaagtaat tgtgtgtttt gagactataa gtatccctag gagaaccacc       60

ttgttggtag cttctgggcg agttnntacg ggttag                                 96


<210>  22
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence


<220>
<221>  misc_feature
<222>  (85)..(90)
<223>  N is absent at 85-90; Or N at 85 is C, at 86 is T, and is C at 
       each of 87-90

<400>  22
caccctaact gtaaagtaat tgtgtgtttt gagactataa gtatccctag gagaaccacc       60

ttgttggtag cttctgggcg agttnnnnnn tacgggttag                            100


<210>  23
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence


<220>
<221>  misc_feature
<222>  (84)..(90)
<223>  N is absent at 84-90; Or N at 84 is C, at 85 is T, at 86-87 is A 
       and at 88-90 is C

<400>  23
accctaactg taaagtaatt gtgtgttttg agactataag tatccctagg agaaccacct       60

tgttggtagc ttctgggcga gttnnnnnnn tacgggttag                            100


<210>  24
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  24
agtatcccta ggagaaccac cttgttggta gcttctgggc gagtttacgg gttaga           56


<210>  25
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence with barcode

<400>  25
agtatcccta ggagaaccac cttgttggta gcttctgggc gagttgctcc ctcgtgcgct       60

ccacctgttc cgacccttcc ggttgccggt acgggttaga                            100


<210>  26
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  26
acgggttaga gctagaaata gcaagttaac ctaaggctag tccgttatca a                51


<210>  27
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence with barcode

<400>  27
atccctagga gaaccacctt gttggtagct tctgggcgag ttagaagcta cgggttagag       60

ctagaaatag caagttaacc taaggctagt ccgttatcaa                            100


<210>  28
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  28
ccctggtgaa ccgcatcgag ctgaagggca                                        30


<210>  29
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence with deletion

<400>  29
ccctggtgaa ccgcatcgag ca                                                22


<210>  30
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence with barcode

<400>  30
ccctggtgaa ccgcatcgag caggggcccg aagggca                                37


<210>  31
<211>  40
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence

<400>  31
ttggtagctt ctgggcgagt ttacgggtta gagctagaaa                             40


<210>  32
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence with deletion

<400>  32
ttggtagctt ctgtacgggt tagagctaga aa                                     32


<210>  33
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence with barcode

<400>  33
ttggtagctt ctgggccctc ggcctcgagt ttcttacggg ttagagctag aaa              53


<210>  34
<211>  13
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence of barcode insertions

<400>  34
agaagttaaa agt                                                          13


<210>  35
<211>  13
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence of barcode insertions

<400>  35
agaagttaga agc                                                          13


<210>  36
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence of barcode insertions

<400>  36
agagctacgg cttagagc                                                     18


<210>  37
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence of barcode insertions

<400>  37
agagctagaa agacgggtta gaaa                                              24


<210>  38
<211>  11
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence of barcode insertions

<400>  38
agagttagaa a                                                            11


<210>  39
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic nucleotide sequence of barcode insertions

<400>  39
gagttaccgt aactctggg                                                    19


