                         SEQUENCE LISTING

<110>  ALPINE BIOTHERAPEUTICS CORPORATION
       WU, Ying
       ZHAO, Jiagang
 
<120>  NUCLEIC ACIDS AND METHODS FOR GENOME EDITING

<130>  046432-0456588

<140>  PCT/US18/12618
<141>  2018-01-05

<150>  62/443,515
<151>  2017-01-07

<160>  26    

<170>  PatentIn version 3.5

<210>  1
<211>  2666
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Argonaute polypeptide

<400>  1
ccaccgtgat cgacctggac tccaccacca ccgccgacga gctgacctcc ggccacacct       60

acgacatctc cgtgaccctg accggcgtgt acgacaacac cgacgagcag cacccccgga      120

tgtccctggc cttcgagcag gacaacggcg agcggcggta catcaccctg tggaagaaca      180

ccacccccaa ggacgtgttc acctacgact acgccaccgg ctccacctac atcttcacca      240

acatcgacta cgaggtgaag gacggctacg agaacctgac cgccacctac cagaccaccg      300

tggagaacgc caccgcccag gaggtgggca ccaccgacga ggacgagacc ttcgccggcg      360

gcgagcccct ggaccaccac ctggacgacg ccctgaacga gacccccgac gacgccgaga      420

ccgagtccga ctccggccac gtgatgacct ccttcgcctc ccgggaccag ctgcccgagt      480

ggaccctgca cacctacacc ctgaccgcca ccgacggcgc caagaccgac accgagtacg      540

cccggcggac cctggcctac accgtgcggc aggagctgta caccgaccac gacgccgccc      600

ccgtggccac cgacggcctg atgctgctga cccccgagcc cctgggcgag acccccctgg      660

acctggactg cggcgtgcgg gtggaggccg acgagacccg gaccctggac tacaccaccg      720

ccaaggaccg gctgctggcc cgggagctgg tggaggaggg cctgaagcgg tccctgtggg      780

acgactacct ggtgcggggc atcgacgagg tgctgtccaa ggagcccgtg ctgacctgcg      840

acgagttcga cctgcacgag cggtacgacc tgtccgtgga ggtgggccac tccggccggg      900

cctacctgca catcaacttc cggcaccggt tcgtgcccaa gctgaccctg gccgacatcg      960

acgacgacaa catctacccc ggcctgcggg tgaagaccac ctaccggccc cggcggggcc     1020

acatcgtgtg gggcctgcgg gacgagtgcg ccaccgactc cctgaacacc ctgggcaacc     1080

agtccgtggt ggcctaccac cggaacaacc agacccccat caacaccgac ctgctggacg     1140

ccatcgaggc cgccgaccgg cgggtggtgg agacccggcg gcagggccac ggcgacgacg     1200

ccgtgtcctt cccccaggag ctgctggccg tggagcccaa cacccaccag atcaagcagt     1260

tcgcctccga cggcttccac cagcaggccc ggtccaagac ccggctgtcc gcctcccggt     1320

gctccgagaa ggcccaggcc ttcgccgagc ggctggaccc cgtgcggctg aacggctcca     1380

ccgtggagtt ctcctccgag ttcttcaccg gcaacaacga gcagcagctg cggctgctgt     1440

acgagaacgg cgagtccgtg ctgaccttcc gggacggcgc ccggggcgcc caccccgacg     1500

agaccttctc caagggcatc gtgaaccccc ccgagtcctt cgaggtggcc gtggtgctgc     1560

ccgagcagca ggccgacacc tgcaaggccc agtgggacac catggccgac ctgctgaacc     1620

aggccggcgc cccccccacc cggtccgaga ccgtgcagta cgacgccttc tcctcccccg     1680

agtccatctc cctgaacgtg gccggcgcca tcgacccctc cgaggtggac gccgccttcg     1740

tggtgctgcc ccccgaccag gagggcttcg ccgacctggc ctcccccacc gagacctacg     1800

acgagctgaa gaaggccctg gccaacatgg gcatctactc ccagatggcc tacttcgacc     1860

ggttccggga cgccaagatc ttctacaccc ggaacgtggc cctgggcctg ctggccgccg     1920

ccggcggcgt ggccttcacc accgagcacg ccatgcccgg cgacgccgac atgttcatcg     1980

gcatcgacgt gtcccggtcc taccccgagg acggcgcctc cggccagatc aacatcgccg     2040

ccaccgccac cgccgtgtac aaggacggca ccatcctggg ccactcctcc acccggcccc     2100

agctgggcga gaagctgcag tccaccgacg tgcgggacat catgaagaac gccatcctgg     2160

gctaccagca ggtgaccggc gagtccccca cccacatcgt gatccaccgg gacggcttca     2220

tgaacgagga cctggacccc gccaccgagt tcctgaacga gcagggcgtg gagtacgaca     2280

tcgtggagat ccggaagcag ccccagaccc ggctgctggc cgtgtccgac gtgcagtacg     2340

acacccccgt gaagtccatc gccgccatca accagaacga gccccgggcc accgtggcca     2400

ccttcggcgc ccccgagtac ctggccaccc gggacggcgg cggcctgccc cggcccatcc     2460

agatcgagcg ggtggccggc gagaccgaca tcgagaccct gacccggcag gtgtacctgc     2520

tgtcccagtc ccacatccag gtgcacaact ccaccgcccg gctgcccatc accaccgcct     2580

acgccgacca ggcctccacc cacgccacca agggctacct ggtgcagacc ggcgccttcg     2640

agtccaacgt gggcttcctg tctaga                                          2666


<210>  2
<211>  1286
<212>  PRT
<213>  Natronobacterium gregoryi

<400>  2

Met Val Pro Lys Lys Lys Arg Lys Val Ala Thr Val Ile Asp Leu Asp 
1               5                   10                  15      


Ser Thr Thr Thr Ala Asp Glu Leu Thr Ser Gly His Thr Tyr Asp Ile 
            20                  25                  30          


Ser Val Thr Leu Thr Gly Val Tyr Asp Asn Thr Asp Glu Gln His Pro 
        35                  40                  45              


Arg Met Ser Leu Ala Phe Glu Gln Asp Asn Gly Glu Arg Arg Tyr Ile 
    50                  55                  60                  


Thr Leu Trp Lys Asn Thr Thr Pro Lys Asp Val Phe Thr Tyr Asp Tyr 
65                  70                  75                  80  


Ala Thr Gly Ser Thr Tyr Ile Phe Thr Asn Ile Asp Tyr Glu Val Lys 
                85                  90                  95      


Asp Gly Tyr Glu Asn Leu Thr Ala Thr Tyr Gln Thr Thr Val Glu Asn 
            100                 105                 110         


Ala Thr Ala Gln Glu Val Gly Thr Thr Asp Glu Asp Glu Thr Phe Ala 
        115                 120                 125             


Gly Gly Glu Pro Leu Asp His His Leu Asp Asp Ala Leu Asn Glu Thr 
    130                 135                 140                 


Pro Asp Asp Ala Glu Thr Glu Ser Asp Ser Gly His Val Met Thr Ser 
145                 150                 155                 160 


Phe Ala Ser Arg Asp Gln Leu Pro Glu Trp Thr Leu His Thr Tyr Thr 
                165                 170                 175     


Leu Thr Ala Thr Asp Gly Ala Lys Thr Asp Thr Glu Tyr Ala Arg Arg 
            180                 185                 190         


Thr Leu Ala Tyr Thr Val Arg Gln Glu Leu Tyr Thr Asp His Asp Ala 
        195                 200                 205             


Ala Pro Val Ala Thr Asp Gly Leu Met Leu Leu Thr Pro Glu Pro Leu 
    210                 215                 220                 


Gly Glu Thr Pro Leu Asp Leu Asp Cys Gly Val Arg Val Glu Ala Asp 
225                 230                 235                 240 


Glu Thr Arg Thr Leu Asp Tyr Thr Thr Ala Lys Asp Arg Leu Leu Ala 
                245                 250                 255     


Arg Glu Leu Val Glu Glu Gly Leu Lys Arg Ser Leu Trp Asp Asp Tyr 
            260                 265                 270         


Leu Val Arg Gly Ile Asp Glu Val Leu Ser Lys Glu Pro Val Leu Thr 
        275                 280                 285             


Cys Asp Glu Phe Asp Leu His Glu Arg Tyr Asp Leu Ser Val Glu Val 
    290                 295                 300                 


Gly His Ser Gly Arg Ala Tyr Leu His Ile Asn Phe Arg His Arg Phe 
305                 310                 315                 320 


Val Pro Lys Leu Thr Leu Ala Asp Ile Asp Asp Asp Asn Ile Tyr Pro 
                325                 330                 335     


Gly Leu Arg Val Lys Thr Thr Tyr Arg Pro Arg Arg Gly His Ile Val 
            340                 345                 350         


Trp Gly Leu Arg Asp Glu Cys Ala Thr Asp Ser Leu Asn Thr Leu Gly 
        355                 360                 365             


Asn Gln Ser Val Val Ala Tyr His Arg Asn Asn Gln Thr Pro Ile Asn 
    370                 375                 380                 


Thr Asp Leu Leu Asp Ala Ile Glu Ala Ala Asp Arg Arg Val Val Glu 
385                 390                 395                 400 


Thr Arg Arg Gln Gly His Gly Asp Asp Ala Val Ser Phe Pro Gln Glu 
                405                 410                 415     


Leu Leu Ala Val Glu Pro Asn Thr His Gln Ile Lys Gln Phe Ala Ser 
            420                 425                 430         


Asp Gly Phe His Gln Gln Ala Arg Ser Lys Thr Arg Leu Ser Ala Ser 
        435                 440                 445             


Arg Cys Ser Glu Lys Ala Gln Ala Phe Ala Glu Arg Leu Asp Pro Val 
    450                 455                 460                 


Arg Leu Asn Gly Ser Thr Val Glu Phe Ser Ser Glu Phe Phe Thr Gly 
465                 470                 475                 480 


Asn Asn Glu Gln Gln Leu Arg Leu Leu Tyr Glu Asn Gly Glu Ser Val 
                485                 490                 495     


Leu Thr Phe Arg Asp Gly Ala Arg Gly Ala His Pro Asp Glu Thr Phe 
            500                 505                 510         


Ser Lys Gly Ile Val Asn Pro Pro Glu Ser Phe Glu Val Ala Val Val 
        515                 520                 525             


Leu Pro Glu Gln Gln Ala Asp Thr Cys Lys Ala Gln Trp Asp Thr Met 
    530                 535                 540                 


Ala Asp Leu Leu Asn Gln Ala Gly Ala Pro Pro Thr Arg Ser Glu Thr 
545                 550                 555                 560 


Val Gln Tyr Asp Ala Phe Ser Ser Pro Glu Ser Ile Ser Leu Asn Val 
                565                 570                 575     


Ala Gly Ala Ile Asp Pro Ser Glu Val Asp Ala Ala Phe Val Val Leu 
            580                 585                 590         


Pro Pro Asp Gln Glu Gly Phe Ala Asp Leu Ala Ser Pro Thr Glu Thr 
        595                 600                 605             


Tyr Asp Glu Leu Lys Lys Ala Leu Ala Asn Met Gly Ile Tyr Ser Gln 
    610                 615                 620                 


Met Ala Tyr Phe Asp Arg Phe Arg Asp Ala Lys Ile Phe Tyr Thr Arg 
625                 630                 635                 640 


Asn Val Ala Leu Gly Leu Leu Ala Ala Ala Gly Gly Val Ala Phe Thr 
                645                 650                 655     


Thr Glu His Ala Met Pro Gly Asp Ala Asp Met Phe Ile Gly Ile Asp 
            660                 665                 670         


Val Ser Arg Ser Tyr Pro Glu Asp Gly Ala Ser Gly Gln Ile Asn Ile 
        675                 680                 685             


Ala Ala Thr Ala Thr Ala Val Tyr Lys Asp Gly Thr Ile Leu Gly His 
    690                 695                 700                 


Ser Ser Thr Arg Pro Gln Leu Gly Glu Lys Leu Gln Ser Thr Asp Val 
705                 710                 715                 720 


Arg Asp Ile Met Lys Asn Ala Ile Leu Gly Tyr Gln Gln Val Thr Gly 
                725                 730                 735     


Glu Ser Pro Thr His Ile Val Ile His Arg Asp Gly Phe Met Asn Glu 
            740                 745                 750         


Asp Leu Asp Pro Ala Thr Glu Phe Leu Asn Glu Gln Gly Val Glu Tyr 
        755                 760                 765             


Asp Ile Val Glu Ile Arg Lys Gln Pro Gln Thr Arg Leu Leu Ala Val 
    770                 775                 780                 


Ser Asp Val Gln Tyr Asp Thr Pro Val Lys Ser Ile Ala Ala Ile Asn 
785                 790                 795                 800 


Gln Asn Glu Pro Arg Ala Thr Val Ala Thr Phe Gly Ala Pro Glu Tyr 
                805                 810                 815     


Leu Ala Thr Arg Asp Gly Gly Gly Leu Pro Arg Pro Ile Gln Ile Glu 
            820                 825                 830         


Arg Val Ala Gly Glu Thr Asp Ile Glu Thr Leu Thr Arg Gln Val Tyr 
        835                 840                 845             


Leu Leu Ser Gln Ser His Ile Gln Val His Asn Ser Thr Ala Arg Leu 
    850                 855                 860                 


Pro Ile Thr Thr Ala Tyr Ala Asp Gln Ala Ser Thr His Ala Thr Lys 
865                 870                 875                 880 


Gly Tyr Leu Val Gln Thr Gly Ala Phe Glu Ser Asn Val Gly Phe Leu 
                885                 890                 895     


Arg Asp Pro Tyr Val Ser Lys Glu Ser Phe Glu Val Ala Val Val Leu 
            900                 905                 910         


Pro Glu Gln Gln Ala Asp Thr Cys Lys Ala Gln Trp Asp Thr Met Ala 
        915                 920                 925             


Asp Leu Leu Asn Gln Ala Gly Ala Pro Pro Thr Arg Ser Glu Thr Val 
    930                 935                 940                 


Gln Tyr Asp Ala Phe Ser Ser Pro Glu Ser Ile Ser Leu Asn Val Ala 
945                 950                 955                 960 


Gly Ala Ile Asp Pro Ser Glu Val Asp Ala Ala Phe Val Val Leu Pro 
                965                 970                 975     


Pro Asp Gln Glu Gly Phe Ala Asp Leu Ala Ser Pro Thr Glu Thr Tyr 
            980                 985                 990         


Asp Glu Leu Lys Lys Ala Leu Ala  Asn Met Gly Ile Tyr  Ser Gln Met 
        995                 1000                 1005             


Ala Tyr  Phe Asp Arg Phe Arg  Asp Ala Lys Ile Phe  Tyr Thr Arg 
    1010                 1015                 1020             


Asn Val  Ala Leu Gly Leu Leu  Ala Ala Ala Gly Gly  Val Ala Phe 
    1025                 1030                 1035             


Thr Thr  Glu His Ala Met Pro  Gly Asp Ala Asp Met  Phe Ile Gly 
    1040                 1045                 1050             


Ile Asp  Val Ser Arg Ser Tyr  Pro Glu Asp Gly Ala  Ser Gly Gln 
    1055                 1060                 1065             


Ile Asn  Ile Ala Ala Thr Ala  Thr Ala Val Tyr Lys  Asp Gly Thr 
    1070                 1075                 1080             


Ile Leu  Gly His Ser Ser Thr  Arg Pro Gln Leu Gly  Glu Lys Leu 
    1085                 1090                 1095             


Gln Ser  Thr Asp Val Arg Asp  Ile Met Lys Asn Ala  Ile Leu Gly 
    1100                 1105                 1110             


Tyr Gln  Gln Val Thr Gly Glu  Ser Pro Thr His Ile  Val Ile His 
    1115                 1120                 1125             


Arg Asp  Gly Phe Met Asn Glu  Asp Leu Asp Pro Ala  Thr Glu Phe 
    1130                 1135                 1140             


Leu Asn  Glu Gln Gly Val Glu  Tyr Asp Ile Val Glu  Ile Arg Lys 
    1145                 1150                 1155             


Gln Pro  Gln Thr Arg Leu Leu  Ala Val Ser Asp Val  Gln Tyr Asp 
    1160                 1165                 1170             


Thr Pro  Val Lys Ser Ile Ala  Ala Ile Asn Gln Asn  Glu Pro Arg 
    1175                 1180                 1185             


Ala Thr  Val Ala Thr Phe Gly  Ala Pro Glu Tyr Leu  Ala Thr Arg 
    1190                 1195                 1200             


Asp Gly  Gly Gly Leu Pro Arg  Pro Ile Gln Ile Glu  Arg Val Ala 
    1205                 1210                 1215             


Gly Glu  Thr Asp Ile Glu Thr  Leu Thr Arg Gln Val  Tyr Leu Leu 
    1220                 1225                 1230             


Ser Gln  Ser His Ile Gln Val  His Asn Ser Thr Ala  Arg Leu Pro 
    1235                 1240                 1245             


Ile Thr  Thr Ala Tyr Ala Asp  Gln Ala Ser Thr His  Ala Thr Lys 
    1250                 1255                 1260             


Gly Tyr  Leu Val Gln Thr Gly  Ala Phe Glu Ser Asn  Val Gly Phe 
    1265                 1270                 1275             


Leu Arg  Asp Pro Tyr Val Ser  Lys 
    1280                 1285     


<210>  3
<211>  2712
<212>  DNA
<213>  Natronobacterium gregoryi

<400>  3
atggtgccaa aaaagaagag aaaggtagcc acagtgattg acctcgattc gaccaccacc       60

gcagacgaac tgacatcggg acacacgtac gacatctcag tcacgctcac cggtgtctac      120

gataacaccg acgagcagca tcctcgcatg tctctcgcat tcgagcagga caacggcgag      180

cggcgttaca ttaccctgtg gaagaacacg acacccaagg atgtctttac atacgactac      240

gccacgggct cgacgtacat cttcactaac atcgactacg aagtgaagga cggctacgag      300

aatctgactg caacatacca gacgaccgtc gagaacgcta ccgctcagga agtcgggacg      360

actgacgagg acgaaacgtt cgcgggcggc gagccgctcg accatcactt ggacgacgcg      420

ctcaatgaga cgccagacga cgcggagaca gagagcgact caggccatgt gatgacctcg      480

ttcgcctccc gcgaccaact ccctgagtgg acgctgcata cgtatacgct aacagccaca      540

gacggcgcaa agacggacac ggagtacgcg cgacgaaccc tcgcatacac ggtacggcag      600

gaactctata ccgaccatga tgcggctccg gttgcaactg acgggctaat gcttctcacg      660

ccagagccgc tcggcgagac cccgcttgac ctcgattgcg gtgtccgggt cgaggcggac      720

gagactcgga cactcgatta caccacggcc aaagaccggt tactcgcccg cgaactcgtc      780

gaagaggggc tcaaacgctc cctctgggat gactacctcg ttcgcggcat cgatgaagtc      840

ctctcaaagg agcctgtgct gacttgcgat gagttcgacc tacatgagcg gtatgacctc      900

tctgtcgaag tcggtcacag tgggcgggcg taccttcaca tcaacttccg ccaccggttc      960

gtaccgaagc tgacgctcgc agacatcgat gatgacaaca tctatcctgg gctccgggtg     1020

aagacgacgt atcgcccccg gcgaggacat atcgtctggg gtctgcggga cgagtgcgcc     1080

accgactcgc tcaacacgct gggaaaccag tccgtcgttg cataccaccg caacaatcag     1140

acacctatta acactgacct cctcgacgct atcgaggccg ctgaccggcg agtcgtcgaa     1200

acccgacgtc aagggcacgg cgatgatgct gtctcattcc cccaagaact gcttgcggtc     1260

gaaccgaata cgcaccaaat taagcagttc gcctccgacg gattccacca acaggcccgc     1320

tcaaagacgc gtctctcggc ctcccgctgc agcgagaaag cgcaagcgtt cgccgagcgg     1380

cttgacccgg tgcgtctcaa tgggtccacg gtagagttct cctcggagtt tttcaccggg     1440

aacaacgagc agcaactgcg cctcctctac gagaacggtg agtcggttct gacgttccgc     1500

gacggggcgc gtggtgcgca ccccgacgag acattctcga aaggtatcgt caatccacca     1560

gagtcgttcg aggtggccgt agtactgccc gagcagcagg cagatacctg caaagcgcag     1620

tgggacacga tggctgacct cctcaaccaa gctggcgcgc caccgacacg gagcgagacc     1680

gtccaatatg atgcgttctc ctcgccagag agcatcagcc tcaatgtggc tggagccatc     1740

gaccctagcg aggtagacgc ggcattcgtc gtactgccgc cggaccaaga aggattcgca     1800

gacctcgcca gtccgacaga gacgtacgac gagctgaaga aggcgcttgc caacatgggc     1860

atttacagcc agatggcgta cttcgaccgg ttccgcgacg cgaaaatatt ctatactcgt     1920

aacgtggcac tcgggctgct ggcagccgct ggcggcgtcg cattcacaac cgaacatgcg     1980

atgcctgggg acgcagatat gttcattggg attgatgtct ctcggagcta ccccgaggac     2040

ggtgccagcg gccagataaa cattgccgcg acggcgaccg ccgtctacaa ggatggaact     2100

atcctcggcc actcgtccac ccgaccgcag ctcggggaga aactacagtc gacggatgtt     2160

cgtgacatta tgaagaatgc catcctcggc taccagcagg tgaccggtga gtcgccgacc     2220

catatcgtca tccaccgtga tggcttcatg aacgaagacc tcgaccccgc cacggaattc     2280

ctcaacgaac aaggcgtcga gtacgacatc gtcgaaatcc gcaagcagcc ccagacacgc     2340

ctgctggcag tctccgatgt gcagtacgat acgcctgtga agagcatcgc cgctatcaac     2400

cagaacgagc cacgggcaac ggtcgccacc ttcggcgcac ccgaatactt agcgacacgc     2460

gatggaggcg gccttccccg cccaatccaa attgaacgag tcgccggcga aaccgacatc     2520

gagacgctca ctcgccaagt ctatctgctc tcccagtcgc atatccaggt ccataactcg     2580

actgcgcgcc tacccatcac caccgcatac gccgaccagg caagtactca cgcgaccaag     2640

ggttacctcg tccagaccgg agcgttcgag tctaatgtcg gattcctccg ggatccatat     2700

gtaagtaagt aa                                                         2712


<210>  4
<211>  2706
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Argonaute polypeptide

<400>  4
gatatcgcca ccatggtgcc aaaaaagaag agaaaggtag ccaccgtgat cgacctggac       60

tccaccacca ccgccgacga gctgacctcc ggccacacct acgacatctc cgtgaccctg      120

accggcgtgt acgacaacac cgacgagcag cacccccgga tgtccctggc cttcgagcag      180

gacaacggcg agcggcggta catcaccctg tggaagaaca ccacccccaa ggacgtgttc      240

acctacgact acgccaccgg ctccacctac atcttcacca acatcgacta cgaggtgaag      300

gacggctacg agaacctgac cgccacctac cagaccaccg tggagaacgc caccgcccag      360

gaggtgggca ccaccgacga ggacgagacc ttcgccggcg gcgagcccct ggaccaccac      420

ctggacgacg ccctgaacga gacccccgac gacgccgaga ccgagtccga ctccggccac      480

gtgatgacct ccttcgcctc ccgggaccag ctgcccgagt ggaccctgca cacctacacc      540

ctgaccgcca ccgacggcgc caagaccgac accgagtacg cccggcggac cctggcctac      600

accgtgcggc aggagctgta caccgaccac gacgccgccc ccgtggccac cgacggcctg      660

atgctgctga cccccgagcc cctgggcgag acccccctgg acctggactg cggcgtgcgg      720

gtggaggccg acgagacccg gaccctggac tacaccaccg ccaaggaccg gctgctggcc      780

cgggagctgg tggaggaggg cctgaagcgg tccctgtggg acgactacct ggtgcggggc      840

atcgacgagg tgctgtccaa ggagcccgtg ctgacctgcg acgagttcga cctgcacgag      900

cggtacgacc tgtccgtgga ggtgggccac tccggccggg cctacctgca catcaacttc      960

cggcaccggt tcgtgcccaa gctgaccctg gccgacatcg acgacgacaa catctacccc     1020

ggcctgcggg tgaagaccac ctaccggccc cggcggggcc acatcgtgtg gggcctgcgg     1080

gacgagtgcg ccaccgactc cctgaacacc ctgggcaacc agtccgtggt ggcctaccac     1140

cggaacaacc agacccccat caacaccgac ctgctggacg ccatcgaggc cgccgaccgg     1200

cgggtggtgg agacccggcg gcagggccac ggcgacgacg ccgtgtcctt cccccaggag     1260

ctgctggccg tggagcccaa cacccaccag atcaagcagt tcgcctccga cggcttccac     1320

cagcaggccc ggtccaagac ccggctgtcc gcctcccggt gctccgagaa ggcccaggcc     1380

ttcgccgagc ggctggaccc cgtgcggctg aacggctcca ccgtggagtt ctcctccgag     1440

ttcttcaccg gcaacaacga gcagcagctg cggctgctgt acgagaacgg cgagtccgtg     1500

ctgaccttcc gggacggcgc ccggggcgcc caccccgacg agaccttctc caagggcatc     1560

gtgaaccccc ccgagtcctt cgaggtggcc gtggtgctgc ccgagcagca ggccgacacc     1620

tgcaaggccc agtgggacac catggccgac ctgctgaacc aggccggcgc cccccccacc     1680

cggtccgaga ccgtgcagta cgacgccttc tcctcccccg agtccatctc cctgaacgtg     1740

gccggcgcca tcgacccctc cgaggtggac gccgccttcg tggtgctgcc ccccgaccag     1800

gagggcttcg ccgacctggc ctcccccacc gagacctacg acgagctgaa gaaggccctg     1860

gccaacatgg gcatctactc ccagatggcc tacttcgacc ggttccggga cgccaagatc     1920

ttctacaccc ggaacgtggc cctgggcctg ctggccgccg ccggcggcgt ggccttcacc     1980

accgagcacg ccatgcccgg cgacgccgac atgttcatcg gcatcgacgt gtcccggtcc     2040

taccccgagg acggcgcctc cggccagatc aacatcgccg ccaccgccac cgccgtgtac     2100

aaggacggca ccatcctggg ccactcctcc acccggcccc agctgggcga gaagctgcag     2160

tccaccgacg tgcgggacat catgaagaac gccatcctgg gctaccagca ggtgaccggc     2220

gagtccccca cccacatcgt gatccaccgg gacggcttca tgaacgagga cctggacccc     2280

gccaccgagt tcctgaacga gcagggcgtg gagtacgaca tcgtggagat ccggaagcag     2340

ccccagaccc ggctgctggc cgtgtccgac gtgcagtacg acacccccgt gaagtccatc     2400

gccgccatca accagaacga gccccgggcc accgtggcca ccttcggcgc ccccgagtac     2460

ctggccaccc gggacggcgg cggcctgccc cggcccatcc agatcgagcg ggtggccggc     2520

gagaccgaca tcgagaccct gacccggcag gtgtacctgc tgtcccagtc ccacatccag     2580

gtgcacaact ccaccgcccg gctgcccatc accaccgcct acgccgacca ggcctccacc     2640

cacgccacca agggctacct ggtgcagacc ggcgccttcg agtccaacgt gggcttcctg     2700

tctaga                                                                2706


<210>  5
<211>  2477
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Argonaute polypeptide

<400>  5
atggtgccaa aaaagaagag aaaggtagcc accgtgatcg acctggactc caccaccacc       60

gccgacgagc tgacctccgg ccacacctac gacatctccg tgaccctgac cggcgtgtac      120

gacaacaccg acgagcagca cccccggatg tccctggcct tcgagcagga caacggcgag      180

cggcggtaca tcaccctgtg gaagaacacc acccccaagg acgtgttcac ctacgactac      240

gccaccggct ccacctacat cttcaccaac atcgactacg aggtgaagga cggctacgag      300

aacctgaccg ccacctacca gaccaccgtg gagaacgcca ccgcccagga ggtgggcacc      360

accgacgagg acgagacctt cgccggcggc gagcccctgg accaccacct ggacgacgcc      420

ctgaacgaga cccccgacga cgccgagacc gagtccgact ccggccacgt gatgacctcc      480

ttcgcctccc gggaccagct gcccgagtgg accctgcaca cctacaccct gaccgccacc      540

gacggcgcca agaccgacac cgagtacgcc cggcggaccc tggcctacac cgtgcggcag      600

gagctgtaca ccgaccacga cgccgccccc gtggccaccg acggcctgat gctgctgacc      660

cccgagcccc tgggcgagac ccccctggac ctggactgcg gcgtgcgggt ggaggccgac      720

gagacccgga ccctggacta caccaccgcc aaggaccggc tgctggcccg ggagctggtg      780

gaggagggcc tgaagcggtc cctgtgggac gactacctgg tgcggggcat cgacgaggtg      840

ctgtccaagg agcccgtgct gacctgcgac gagttcgacc tgcacgagcg gtacgacctg      900

gtgcccaagc tgaccctggc cgacatcgac gacgacaaca tctaccccgg cctgcgggtg      960

aagaccacct accggccccg gcggggccac atcgtgtggg gcctgcggga cgagtgcgcc     1020

accgactccc tgaacaccct gggcaaccag tccgtggtgg cctaccaccg gaacaaccag     1080

acccccatca acaccgacct gctggacgcc atcgaggccg ccgaccggcg ggtggtggag     1140

acccggcggc agggccacgg cgacgacgcc gtgtccttcc cccaggagct gctggccgtg     1200

gagcccaaca cccaccagat caagcagttc gcctccgacg gcttccacca gcaggcccgg     1260

tccaagaccc ggctgtccgc ctcccggtgc tccgagaagg cccaggcctt cgccgagcgg     1320

ctggaccccg tgcggctgaa cggctccacc gtggagttct cctccgagtt cttcaccggc     1380

aacaacgagc agcagctgcg gctgctgtac gagaacggcg agtccgtgct gaccttccgg     1440

gacggcgccc ggggcgccca ccccgacgag accttctcca agggcatcgt gaaccccccc     1500

gagtccttcg aggtggccgt ggtgctgccc gagcagcagg ccgacacctg caaggcccag     1560

tgggacacca tggccgacct gctgaaccag gccggcgccc cccccacccg gtccgagacc     1620

gtgcagtacg acgccttctc ctcccccgag tccatctccc tgaacgtggc cggcgccatc     1680

gacccctccg aggtggacgc cgccttcgtg gtgctgcccc ccgaccagga gggcttcgcc     1740

gacctggcct cccccaccga gacctacgac gagctgaaga aggccctggc atctactccc     1800

agatggccta cttcgaccgg taacgtggcc ctgggcctgc tggccgccgc cggcggcgtg     1860

gccttcacat gcccggcgac gccgacatgt tcatcggcat cgacgtgtcc cggtccggcg     1920

cctccggcca gatcaacatc gccgccaccg ccaccgccgt gtacaaggac ggcaccatcc     1980

tgggccactc ctccacccgg ccccagctgg gcgagaagct gcagtccacc gacgtgcggg     2040

acatcatgaa gaacgccatc ctgggctacc agcaggtgac cggcgagtcc cccacccaca     2100

tcgtgatcca ccgggacggc ttcatgaacg aggacctgga ccccgccacc gagttcctga     2160

acgagcaggg cgtggagtac gacatcgtgg agatccggaa gcagccccag acccggctgc     2220

tggccgtgtc cgacgtgcag tacgacaccc ccgtgaagtc catcgccaga acgagccccg     2280

ggccaccgtg gccaccttcg gcgcccccga gtacctggcc acccgggacg gcggcggcct     2340

gccccggccc atccagatcg agcgggtggc cggcgagacc ctgacccggc aggtgtacct     2400

gctgtcccag tcccacatcc aggtgcacaa ctccaccgcc cggctgccca tcaccaccgc     2460

ctacgccgac caggcct                                                    2477


<210>  6
<211>  2579
<212>  DNA
<213>  Natronobacterium gregoryi

<400>  6
atggtgccaa aaaagaagag aaaggtagcc acagtgattg acctcgattc gaccaccacc       60

gcagacgaac tgacatcggg acacacgtac gacatctcag tcacgctcac cggtgtctac      120

gataacaccg acgagcagca tcctcgcatg tctctcgcat tcgagcagga caacggcgag      180

cggcgttaca ttaccctgtg gaagaacacg acacccaagg atgtctttac atacgactac      240

gccacgggct cgacgtacat cttcactaac atcgactacg aagtgaagga cggctacgag      300

aatctgactg caacatacca gacgaccgtc gagaacgcta ccgctcagga agtcgggacg      360

actgacgagg acgaaacgtc gcgggcggcg agccgctcga ccatcacttg gacgacgcgc      420

tcaatgagac gccagacgac gcggagacag agagcgactc aggccatgtg atgacctcgt      480

tcgcctcccg cgaccaactc cctgagtgga cgctgcatac gtatacgcta acagccacag      540

acggcgcaaa gacggacacg gagtacgcgc gacgaaccct cgcatacacg gtacggcagg      600

aactctatac cgaccatgat gcggctccgg ttgcaactga cgggctaatg cttctcacgc      660

cagagccgct cggcgagacc ccgcttgacc tcgattgcgg tgtccgggtc gaggcggacg      720

agactcggac actcgattac accacggcca aagaccggtt actcgcccgc gaactcgtcg      780

aagaggggct caaacgctcc ctctgggatg actacctcgt tcgcggcatc gatgaagtcc      840

tctcaaagga gcctgtgctg acttgcgatg agttcgacct acatgagcgg tatgacctcg      900

taccgaagct gacgctcgca gacatcgatg atgacaacat ctatcctggg ctccgggtga      960

agacgacgta tcgcccccgg cgaggacata tcgtctgggg tctgcgggac gagtgcgcca     1020

ccgactcgct caacacgctg ggaaaccagt ccgtcgttgc ataccaccgc aacaatcaga     1080

cacctattaa cactgacctc ctcgacgcta tcgaggccgc tgaccggcga gtcgtcgaaa     1140

cccgacgtca agggcacggc gatgatgctg tctcattccc ccaagaactg cttgcggtcg     1200

aaccgaatac gcaccaaatt aagcagttcg cctccgacgg attccaccaa caggcccgct     1260

caaagacgcg tctctcggcc tcccgctgca gcgagaaagc gcaagcgttc gccgagcggc     1320

ttgacccggt gcgtctcaat gggtccacgg tagagttctc ctcggagttt ttcaccggga     1380

acaacgagca gcaactgcgc ctcctctacg agaacggtga gtcggttctg acgttccgcg     1440

acggggcgcg tggtgcgcac cccgacgaga cattctcgaa aggtatcgtc aatccaccag     1500

agtcgttcga ggtggccgta gtactgcccg agcagcaggc agatacctgc aaagcgcagt     1560

gggacacgat ggctgacctc ctcaaccaag ctggcgcgcc accgacacgg agcgagaccg     1620

tccaatatga tgcgttctcc tcgccagaga gcatcagcct caatgtggct ggagccatcg     1680

accctagcga ggtagacgcg gcattcgtcg tactgccgcc ggaccaagaa ggattcgcag     1740

acctcgccag tccgacagag acgtacgacg agctgaagaa ggcgcttgcc aacatgggca     1800

tttacagcca gatggcgtac ttcgaccggt tccgcgacgc gaaaatattc tatactcgta     1860

acgtggcact cgggctgctg gcagccgctg gcggcgtcgc attcacaacc gaacatgcga     1920

tgcctgggga cgcagatatg ttcattggga ttgatgtctc tcggagctac cccgaggacg     1980

gtgccagcgg ccagataaac attgccgcga cggcgaccgc cgtctacaag gatggaacta     2040

tcctcggcca ctcgtccacc cgaccgcagc tcggggagaa actacagtcg acggatgttc     2100

gtgacattat gaagaatgcc atcctcggct accagcaggt gaccggtgag tcgccgaccc     2160

atatcgtcat ccaccgtgat ggcttcatga acgaagacct cgaccccgcc acggaattcc     2220

tcaacgaaca aggcgtcgag tacgacatcg tcgaaatccg caagcagccc cagacacgcc     2280

tgctggcagt ctccgatgtg cagtacgata cgcctgtgaa gagcatcgcc gctatcaacc     2340

agaacgagcc acgggcaacg gtcgccacct tcggcgcacc cgaatactta gcgacacgcg     2400

atggaggcgg ccttccccgc ccaatccaaa ttgaacgagt cgccggcgaa accgacatcg     2460

agacgctcac tcgccaagtc tatctgctct cccagtcgca tatccaggtc cataactcga     2520

ctgcgcgcct acccatcacc accgcatacg ccgaccaggc aagtactcac gcgaccaag      2579


<210>  7
<211>  146
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL8A2 genomic sequence

<400>  7
tccactcctc cttttcagga ttcttgctct tgctctgccc cacataaccc gcgggggggg       60

tcctgctgcc ctggccaggt gaggaggaaa agtcctaaga acgagacggg gtgtattggg      120

cgcccccccc aggacgacgg gaccgg                                           146


<210>  8
<211>  180
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL8A2 genomic sequence

<400>  8
tccactcctc cttttcagga ttcttgctct gccccacatc cggatccatg gtggcctgaa       60

acccgcgggg ggggtcctgc tgccctggcc aggtgaggag gaaaagtcct aagaacgaga      120

cggggtgtag gcctaggtac caccggactt tgggcgcccc ccccaggacg acgggaccgg      180


<210>  9
<211>  526
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Knock-in of a EGFP-P2A-Puro donor fragment into the human COL8A2 
       locus Sequence

<400>  9
agcggccgcg aattcgccct tcggcgtcta ctactttgct taccatgtgc acgtcaaggg       60

caccaacgtg tgggtggccc tgtacaagaa caacgtgccg gccacctata cctacgatga      120

gtacaagaag ggctacctgg accaggcatc tggtggggcc gtgctccagc tgcggcccaa      180

cgaccaggtc tgggtgcaga tgccgtcgga ccaggccaac ggcctctact ccacggagta      240

catccactcc tccttttcag gattcttgct ctgccccaca tccggatcca tggtgagcaa      300

gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa      360

cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac      420

cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac      480

cctgacctac ggcgtgcagt gcttcagccg ttaccaaggg cgaatt                     526


<210>  10
<211>  261
<212>  DNA
<213>  Homo sapiens

<400>  10
cggcgtctac tactttgctt accatgtgca cgtcaagggc accaacgtgt gggtggccct       60

gtacaagaac aacgtgccgg ccacctatac ctacgatgga gtacaagaag ggctacctgg      120

accaggcatc tggtggggcc gtgctccagc tgcggcccaa cgaccaggtc tgggtgcaga      180

tgccgtcgga ccaggccaac ggcctctact ccacggagta catccactcc tccttttcag      240

gattcttgct ctgccccaca t                                                261


<210>  11
<211>  260
<212>  DNA
<213>  Homo sapiens

<400>  11
cggcgtctac tactttgctt accatgtgca cgtcaagggc accaacgtgt gggtggccct       60

gtacaagaac aacgtgccgg ccacctatac ctacgatgag tacaagaagg gctacctgga      120

ccaggcatct ggtggggccg tgctccagct gcggcccaac gaccaggtct gggtgcagat      180

gccgtcggac caggccaacg gcctctactc cacggagtac atccactcct ccttttcagg      240

attcttgctc tgccccacat                                                  260


<210>  12
<211>  56
<212>  DNA
<213>  Homo sapiens

<400>  12
tggtgccgac tacaagcgaa ttactgtgaa agtcaatggt aagaattatt atagat           56


<210>  13
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DNA oligo guided HuAgo Sequence

<400>  13
tggtgccgac tacaagcgga ttactgtgaa agtcaatggt aagaattatt atagat           56


<210>  14
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DNA oligo guided HuAgo Sequence

<400>  14
tggtgccgac tacaagcgaa ttaccgtgaa agtcaatggt aagaattatt atagat           56


<210>  15
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Clonal PCR Amplicon Sequence

<400>  15
tacaagcgga ttactgtgaa agtcaatggt aagaattatt at                          42


<210>  16
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GuideDNA-COL8A2-p24

<400>  16
gccccacata acccgcgggg gggg                                              24


<210>  17
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  guideDNA-Col8A2-p21

<400>  17
caccggaccc cccccgcggg ttatg                                             25


<210>  18
<211>  59
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL-GFP-F59 Primer

<400>  18
atccactcct ccttttcagg attcttgctc tgccccacat atggtgagca agggcgagg        59


<210>  19
<211>  59
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL-PURO-R59 Primer

<400>  19
actaaagggg aggaggccag ggcagcagga ccccccccgc gggtttcagg caccgggct        59


<210>  20
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Col8A2 FP1

<400>  20
cggcgtctac tactttgctt ac                                                22


<210>  21
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GFP RP1

<400>  21
ggtaacggct gaagcactg                                                    19


<210>  22
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL8A2 RP1

<400>  22
agcctgcatg cagggagaaa g                                                 21


<210>  23
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Puro FP3

<400>  23
gagctgcaag aactcttcct c                                                 21


<210>  24
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Guide DNA oligo GD5

<400>  24
gcgaattact gtgaaagtca atgg                                              24


<210>  25
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CD274_461F/R primer

<400>  25
cctggctgca ctaattgtct at                                                22


<210>  26
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  461R primer

<400>  26
ctgtgttgtt tgttctggat ttc                                               23


